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Enterococcus faecalis Polynucleotides and Polypeptides 



FIELD OF THE INVENTION 

5 The present invention relates to the field of molecular biology. In particular, it 

relates to, among other things, nucleotide sequences of Enterococcus faecalis, contigs, 
ORFs, fragments, probes, primers and related polynucleotides thereof, peptides and 
polypeptides encoded by the sequences, and uses of the polynucleotides and sequences 
thereof, such as in fermentation, polypeptide production, assays and pharmaceutical 

10 development, among others. 

BACKGROUND OF THE INVENTION 

Enterococci have been recognized as being pathogenic for humans since the turn 
of the century when they were first described by Thiercelin in 1988 as microscopic 

15 organisms. The genus Enterococcus includes the species Enterococcus faecalis or E. 
faecalis which is the most common pathogen in the group, accounting for 80 - 90 
percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin Microbiol 
Infect Dis. 9:111-117. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors arc mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily used; 
the hospital setting provides the antibiotics which eliminate or suppress susceptible 
bacteria, thereby providing a selective advantage for resistant organisms, and the hospital 
also provides the potential for dissemination of resistant enterococci via the usual routes 
of hand and environmental contamination. 

30 Antimicrobial resistance can be divided into two general types, inherent or 

intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired resistance 
results from either a mutation in the existing DN A or acquisition of new DNA. The 
various inherent traits expressed by enterococci include resistance to semisynthetic 

35 pcnicillinase-resistant penicillins, cephalosporins, low levels of aminoglycosides, and low 
levels of clindamycin. Examples of acquired resistance include resistance to 
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chloramphenicol, erythromycin, high levels of clindamycin, tetracycline, high levels of 
aminoglycosides, penicillin by means of penicillinase, fluoroquinolones, and vancomycin. 
Resistance to high levels of penicillin without penicillinase and resistance to 
fluoroquinolones are not known to be plasmid or transposon mediated and presumably are 

5 due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagina. 

E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, 
urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and 

10 neonatal sepsis (Lewis 1990, supra). In the 1970s and 1980s enterococci became firmly 
established as major nosocomial pathogens. They are now the fourth leading cause of 
hospital-acquired infection and the third leading cause of bacteremia in the United States. 
Fatality ratios for enterococcal bactermia range from 12% to 68%, with death due to 
enterococcal sepsis in 4 to 50% of these cases. See Emori, T.G. (1993) Clin. Microbiol. 

15 Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually seem to 
have relatively low intrinsic virulence, are given an excellent opportunity to become 
secondary invaders. Since nosocomial isolates of enterococci have displayed resistance to 

20 essentially every useful antimicrobial agent, it will likely become increasingly difficult to 
successfully treat and control enterococcal infections. Particularly when the various 
resistance genes come together in a single strain, an event almost certain to occur at 
some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 

25 involves the programmed expression of £. faecalis genes, and that characterizing these 
genes and their patterns of expression would dramatically add to our understanding of the 
organism and its host interactions. Knowledge of the E. faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to improved 
and new ways of preventing, treating and diagnosing diseases. Thus, there is a need to 

30 characterize the genome of £. faecalis and for polynucleotides of this organism. 



SUMMARY OF THE INVENTION 

The present invention is based on the sequencing of fragments of the 
Enterococcus faecalis genome. The primary nucleotide sequences which were generated 
35 arc provided in SEQ ID NOS: 1-982. 

The present invention provides the nucleotide sequence of hundreds of contigs of 
the Enterococcus faecalis genome, which are listed in tables below and set out in the 
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Sequence Listing submitted herewith, and representative fragments thereof, in a form 
which can be readily used, analyzed, and interpreted by a skilled artisan. In one 
embodiment, the present invention is provided as contiguous strings of primary sequence 
information corresponding to the nucleotide sequences depicted in SEQ ID NOS: 1-982. 
5 The present invention further provides nucleotide sequences which are at least 

95%, 96%, 97%, 98%, and 99%, identical to the nucleotide sequences of SEQ ID NOS:l- 
982. 

The nucleotide sequence of SEQ ID NOS: 1-982, a representative fragment 
thereof, or a nucleotide sequence which is at least 95% identical to the nucleotide 

10 sequence of SEQ ID NOS: 1-982 may be provided in a variety of mediums to facilitate its 
use. In one application of this embodiment, the sequences of the present invention are 
recorded on computer readable media. Such media includes, but is not limited to: magnetic 
storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical 
storage media such as CD-ROM; electrical storage media such as RAM and ROM; and 

15 hybrids of these categories such as magnetic/optical storage media. 

The present invention further provides systems, particularly computer-based . 
systems which contain the sequence information herein described stored in a data storage 
means. Such systems are designed to identify commercially important fragments of the 
Enter ococcus faecalis genome. 

20 Another embodiment of the present invention is directed to fragments of the 

Enterococcus faecalis genome having particular structural or functional attributes. Such 
fragments of the Enterococcus faecalis genome of the present invention include, but are 
not limited to, fragments which encode peptides, hereinafter referred to as open reading 
frames or ORFs, fragments which modulate the expression of an operably linked ORF, 

25 hereinafter referred to as expression modulating fragments or EMFs, and fragments 
which can be used to diagnose the presence of Enterococcus faecalis in a sample, 
hereinafter referred to as diagnostic fragments or DFs. 

Each of the ORFs in fragments of the Enterococcus faecalis genome disclosed in 
Tables 1-3, and the EMFs found 5' prime of the initiation codon, can be used in numerous 

30 ways as polynucleotide reagents. For instance, the sequences can be used as diagnostic 
probes or amplification primers for detecting or determining the presence of a specific 
microbe in a sample, to selectively control gene expression in a host and in the 
production of polypeptides, such as polypeptides encoded by ORFs of the present 
invention, particular those polypeptides that have a pharmacological activity. 

35 The present invention further includes recombinant constructs comprising one or 

more fragments of the Enterococcus faecalis genome of the present invention. The 
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recombinant constructs of the present invention comprise vectors, such as a plasmid or 
viral vector, into which a fragment of the Enterococcus faecalis has been inserted. 

The present invention further provides host cells containing any of the isolated 
fragments of the Enterococcus faecalis genome of the present invention. The host cells 

5 can be a higher eukaryotic host cell, such as a mammalian cell, a lower eukaryotic cell, 
such as a yeast cell, or a procaryotic cell such as a bacterial cell. 

The present invention is further directed to isolated polypeptides and proteins 
encoded by ORFs of the present invention. A variety of methods, well known to those of 
skill in the art, routinely may be utilized to obtain any of the polypeptides and proteins 

10 of the present invention. For instance, polypeptides and proteins of the present 

invention having relatively short, simple amino acid sequences readily can be synthesized 
using commercially available automated peptide synthesizers. Polypeptides and proteins 
of the present invention also may be purified from bacterial cells which naturally produce 
the protein. Yet another alternative is to purify polypeptide and proteins of the present 

15 invention from cells which have been altered to express them. 

The invention further provides methods of obtaining homologs of the fragments 
of the Enterococcus faecalis genome of the present invention and homologs of the 
proteins encoded by the ORFs of the present invention. Specifically, by using the 
nucleotide and amino acid sequences disclosed herein as a probe or as primers, and 

20 techniques such as PCR cloning and colony/plaque hybridization, one skilled in the art can 
obtain homologs. 

The invention further provides antibodies which selectively bind polypeptides and 
proteins of the present invention. Such antibodies include both monoclonal and 
polyclonal antibodies. 

25 The invention further provides hybridomas which produce the above- described 

antibodies. A hybridoma is an immortalized cell line which is capable of secreting a 
specific monoclonal antibody. 

The present invention further provides methods of identifying test samples 
derived from cells which express one of the ORFs of the present invention, or a homolog 
30 thereof. Such methods comprise incubating a test sample with one or more of the 

antibodies of the present invention, or one or more of the DFs of the present invention, 
under conditions which allow a skilled artisan to determine if the sample contains the 
ORF or product produced therefrom. 

In another embodiment of the present invention, kits are provided which contain 
35 the necessary reagents to carry out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one«or more containers which comprises: (a) a first container comprising 
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one of the antibodies, or one of the DFs of the present invention; and (b) one or more 
other containers comprising one or more of the following: wash reagents, reagents 
capable of detecting presence of bound antibodies or hybridized DFs. 

Using the isolated proteins of the present invention, the present invention 

5 further provides methods of obtaining and identifying agents capable of binding to a 
polypeptide or protein encoded by one of the ORFs of the present invention. 
Specifically, such agents include, as further described below, antibodies, peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise steps of: 
(a)contacting an agent with an isolated protein encoded by one of the ORFs of the 

10 present invention; and (b)determining whether the agent binds to said protein. 

The present genomic sequences of Enterococcus faecalis will be of great value to 
all laboratories working with this organism and for a variety of commercial punposes. 
Many fragments of the Enterococcus faecalis genome will be immediately identified by 
similarity searches against GenBank or protein databases and will be of immediate value to 

1 5 Enterococcus faecalis researchers and for immediate commercial value for the production 
of proteins or to control gene expression. 

The methodology and technology for elucidating extensive genomic sequences of 
bacterial and other genomes has and will greatly enhance the ability to analyze and 
understand chromosomal organization. In particular, sequenced contigs and genomes will 

20 provide the models for developing tools for the analysis of chromosome structure and 
function, including the ability to identify genes within large segments of genomic DNA, 
the structure, position, and spacing of regulatory elements, the identification of genes 
with potential industrial applications, and the ability to do comparative genomic and 
molecular phylogeny. 

25 

DESCRIPTION OF THE FIGURES 

FIGURE. 1 is a block diagram of a computer system (102) that can be used to 
implement computer-based systems of the present invention. 

30 FIGURE 2 is a schematic diagram depicting the data flow and computer programs 

used to collect, assemble, edit and annotate the contigs of the Enterococcus faecalis 
genome of the present invention. Both Macintosh and Unix platforms are used to handle 
the AB 373 and 377 sequence data files, largely as described in Kerlavage et al. t 
Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System 

35 Sciences, 585, IEEE Computer Society Press, Washington D.C. (1993). Factura (AB) is a 
Macintosh program designed for automatic vector sequence removal and end-trimming of 



WO 98/50555 



PCT/US98/08985 



6 

sequence files. The program Sequis runs on a Macintosh platform and parses the feature 
data extracted from the sequence files by Factura to the Unix based Enterococcus faecalis 
relational database. Assembly of contigs (and whole genome sequences) is accomplished 
by retrieving a specific set of sequence files and their associated features using Extrseq, a 
5 Unix utility for retrieving sequences from an SQL database. The resulting sequence file is 
processed by seq_filter to trim portions of the sequences with more than 1 % ambiguous 
nucleotides. The sequence files were assembled using TIGR Assembler, an assembly engine 
designed at The Institute for Genomic Research ( TIGR ) for rapid and accurate assembly 
of thousands of sequence fragments. The collection of contigs generated by the assembly 
10 step is loaded into the database with the lassie program. Identification of open reading 
frames (ORFs) is accomplished by processing contigs with GeneMark, described in 
Borodovsky, M. and Mclninch, J.D. (1993) Comput. Chem., 17:123133. The ORFs are 
searched against E. faecalis sequences from GenBank and against all protein sequences 
using the BLASTN and BLASTP programs, described in Altschul et al.,J. Mol Biol. 215: 
15 403-410 (1990)). Results of the ORF determination and similarity searching steps were 
loaded into the database. As described below, some results of the determination and the 
searches are set out in Tables 1 -3 . 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

The present invention is based on the sequencing of fragments of the 
Enterococcus faecalis genome and analysis of the sequences. The primary nucleotide 
sequences generated by sequencing the fragments are provided in SEQ ID NOS: 1-982. 
(As used herein, the "primary sequence" refers to the nucleotide sequence represented by 
the IUPAC nomenclature system.) 

In addition to the aforementioned Enterococcus faecalis polynucleotide and 
polynucleotide sequences, the present invention provides the nucleotide sequences of SEQ 
ID NOS: 1-982 , or representative fragments thereof, in a form which can be readily used, 
analyzed, and interpreted by a skilled artisan. 

As used herein, a "representative fragment of the nucleotide sequence depicted in 
SEQ ID NOS: 1-982" refers to any portion of the SEQ ID NOS: 3-982 which is not 
presently represented within a publicly available database. Preferred representative 
fragments of the present invention are Enterococcus faecalis open reading frames (ORFs 
), expression modulating fragment ( EMFs ) and fragments which can be used to diagnose 
the presence of Enterococcus faecalis in a sample (DFs ). A non-limiting identification 
of preferred representative fragments is provided in Tables 1-3. As discussed in detail 
below, the information provided in SEQ ID NOS: 1-982 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled in the 



25 



30 
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art to clone and sequence all "representative fragments" of interest, including open 
reading frames encoding a large variety of Enterococcus faecalis proteins. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragments include 

5 portions of the nucleotide sequences of Table 1-3 and SEQ ID NOS: 1-982, at least 10 
contiguous nucleotides in length selected from any two integers, one of which 
representing a 5' nucleotide position and a second of which representing a 3' nucleotide 
position, where the first nucleotide for each nucleotide sequence in SEQ ID NOS: 1-982 is 
position 1. That is, every combination of a 5' and V nucleotide position that a fragment 

10 at least 10 contiguous nucleotides in length could occupy is included in the invention. At 
least means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of SEQ ID NOS:l-982 minus 
1. Therefore, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of SEQ ID NOS: 1-982 wherein the 

15 contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1 . 

Further, the invention includes polynucleotides comprising fragments specified by 
size, in nucleotides, rather than by nucleotide positions. The invention includes any 
fragment size, in contiguous nucleotides, selected from integers between 10 and the length 

20 of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous nucleotide 

fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 nucleotides. Other 
preferred sizes of contiguous nucleotide fragments, which may be useful as diagnostic 
probes and primers, include fragments 50-300 nucleotides in length which include, as 
discussed above, fragment sizes representing each integer between 50-300. Larger 

25 fragments arc also useful according to the present invention corresponding to most, if not 
all, of the nucleotide sequences shown in SEQ ID NOS: 1-982. The preferred sizes are, of 
course, meant to exemplify not limit the present invention as all size fragments, 
representing any integer between 10 and the length of an entire nucleotide sequence 
minus 1, of each SEQ ID NO:, are included in the invention. 

30 The present invention also provides for the exclusion of any fragment, specified 

by 5' and 3' base positions or by size in nucleotide bases as described above for any 
nucleotide sequence of SEQ ID NOS: 1-982. Any number of fragments of nucleotide 
sequences in SEQ ID NOS: 1-982, specified by 5' and 3' base positions or by size in 
nucleotides, as described above, may be excluded from the present invention. 

35 While the presently disclosed sequences of SEQ ID NOS: 1-982 are highly 

accurate, sequencing techniques are not perfect and, in relatively rare instances, further 
investigation of a fragment or sequence of the invention may reveal a nucleotide 
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sequence error present in a nucleotide sequence disclosed in SEQ ID NOS: 1-982. 
However, once the present invention is made available (i.e., once the information in SEQ 
ID NOS: 1-982 and Tables 1-3 has been made available), resolving a rare sequencing error 
in SEQ ID NOS: 1-982 will be well within the skill of the art. The present disclosure 

5 makes available sufficient sequence information to allow any of the described contigs or 
portions thereof to be obtained readily by straightforward application of routine 
techniques. Further sequencing of such polynucleotides may proceed in like manner using 
manual and automated sequencing methods which are employed ubiquitous in the art. 
Nucleotide sequence editing software is publicly available. For example, Applied 

10 Biosystem's (AB) Auto Assembler can be used as an aid during visual inspection of 

nucleotide sequences. By employing such routine techniques potential errors readily may 
be identified and the correct sequence then may be ascertained by targeting further 
sequencing effort, also of a routine nature, to the region containing the potential error. 
Even if all of the very rare sequencing errors in SEQ ID NOS: 1-982 were 

15 corrected, the resulting nucleotide sequences would still be at least 95% identical, nearly 
all would be at least 99% identical, and the great majority would be at least 99.9% 
identical to the nucleotide sequences of SEQ ID NOS: 1-982. 

As discussed elsewhere herein, polynucleotides of the present invention readily 
may be obtained by routine application of well known and standard procedures for cloning 

20 and sequencing DNA. A wide variety of Enterococcus faecalis strains that can be used to 
prepare E. faecalis genomic DNA for cloning and for obtaining polynucleotides of the 
present invention are available to the public from recognized depository institutions, such 
as the American Type Culture Collection (ATCC). While the present invention is 
enabled by the sequences and other information herein disclosed, the E. faecalis strain 

25 that provided the DNA of the present Sequence Listing, Strain V586, kindly provided by 
Dr. Michael Gilmore, University of Oklahoma, has been deposited in the ATCC, as a 
convenience to those of skill in the art. The E. faecalis strain V586 was deposited 2 May 
1997 at the ATCC, 10801 University Blvd. Manassas, VA 20110-2209, and given 
accession number 55969. The provision of the deposits is not a waiver of any rights of 

30 the inventors or their assignees in the present subject matter. 

The nucleotide sequences of the genomes from different strains of Enterococcus 
faecalis differ somewhat. However, the nucleotide sequences of the genomes of all 
Enterococcus faecalis strains will be at least 95% identical, in corresponding part, to the 
nucleotide sequences provided in SEQ ID NOS: 1-982. Nearly all will be at least 99% 

35 identical and the great majority will be 99.9% identical. 

The present application is further directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in SEQ ID NOS: 
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1-982. The above nucleic acid sequences are included irrespective of whether they encode 
a polypeptide having E. faecalis activity. This is because even where a particular nucleic 
acid molecule does not encode a polypeptide having E. faecalis activity, one of skill in 
the art would still know how to use the nucleic acid molecule, for instance, as a 

5 hybridization probe. Uses of the nucleic acid molecules of the present invention that do 
not encode a polypeptide having E. faecalis activity include, inter alia, isolating an E. 
faecalis gene or allelic variants thereof from a DNA library, and detecting E. faecalis 
mRNA expression samples, environmental samples, suspected of containing E. faecalis by 
Northern Blot analysis. 

10 Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 

97%, 98% or 99% identical to the nucleic acid sequence shown in SEQ ID NOS: 1-982, 
which do, in fact, encode a polypeptide having E. faecalis protein activity By "a 
polypeptide having E, faecalis activity" is intended polypeptides exhibiting activity 
similar, but not necessarily identical, to an activity of the E. faecalis protein of the 

15 invention, as measured in a particular biological assay suitable for measuring activity of 
the specified protein. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art will 
immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 

20 sequences shown in SEQ ID NOS: 1-982 will encode a polypeptide having E. faecalis 
protein activity. In fact, since degenerate variants of these nucleotide sequences all 
encode the same polypeptide, this will be clear to the skilled artisan even without 
performing the above described comparison assay. It will be further recognized in the art 
that, for such nucleic acid molecules that are not degenerate variants, a reasonable number 

25 will also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or not 
likely to significantly effect protein function (e.g., replacing one aliphatic amino acid 
with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present invention 

30 are expected to be similar or identical to polypeptides from other bacteria that share a 
high degree of structural identity/similarity. Tables 1 and 2 lists accession numbers and 
descriptions for the closest matching sequences of polypeptides available through 
Genbank. It is therefore expected that the biological activity or function of the 
polypeptides of the present invention will be similar or identical to those polypeptides 

35 from other bacterial genuses, species, or strains listed in Tables 1 and 2. 

By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended that 
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the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the E. faecaiis 
polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at 

5 least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted, inserted, or substituted with another nucleotide. The 
query sequence may be an entire sequence shown in SEQ ID NOS: 1-982, the ORF (open 
reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide 

10 is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the 
presence invention can be determined conventionally using known computer programs. 
A preferred method for determining the best overall match between a query sequence (a 
sequence of the present invention) and a subject sequence, also referred to as a global 
sequence alignment, can be determined using the FASTDB computer program based on 

15 the algorithm of Brutlag et al. See Brutlag et al. (1990) Comp. App. Biosci. 6:237-245. 
In a sequence alignment the query and subject sequences are both DNA sequences. An 
RNA sequence can be compared by first converting U's to T's. The result of said global 
sequence alignment is in percent identity. Preferred parameters used in a FASTDB 
alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 

20 tuple=4, Mismatch Penalty=l, Joining Penal ty=30, Randomization Group Length=0, 

Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght 
of the subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 

25 results. This is because the FASTDB program docs not account for 5' and 3' truncations 
of the subject sequence when calculating percent identity. For subject sequences truncated 
at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by 
calculating the number of bases of the query sequence that are 5' and 3' of the subject 
sequence, which are not matched/aligned, as a percent of the total bases of the query 

30 sequence. Whether a nucleotide is matched/aligned is determined by results of the 
FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to 
arrive at a final percent identity score. This corrected score is what is used for the 
purposes of the present invention. Only nucleotides outside the 5' and V nucleotides of 

35 the subject sequence, as displayed by the FASTDB alignment, which are not 

matched/aligned with the query sequence, are calculated for the purposes of manually 
adjusting the percent identity score. 
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For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject 
sequence and therefore, the FASTDB alignment does not show a matched/alignment of 
the first 10 nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the 
5 sequence (number of nucleotides at the 5' and 3' ends not matched/total number of 
nucleotides in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly 
matched the final percent identity would be 90%. In another example, a 90 nucleotide 
subject sequence is compared with a 100 nucleotide query sequence. This time the 

10 deletions arc internal deletions so that there are no nucleotides on the 5' or 3' of the 
subject sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 
5' and 3* of the subject sequence which are not matched/aligned with the query sequence 
are manually corrected for. No other manual corrections are to made for the purposes of 

15 the present invention. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS: 1-982, a representative 
fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most 

20 preferably at least 99.9% identical to a polynucleotide sequence of SEQ ID NOS: 1-982 
may be "provided" in a variety of mediums to facilitate use thereof. As used herein, 
provided refers to a manufacture, other than an isolated nucleic acid molecule, which 
contains a nucleotide sequence of the present invention; i.e., a nucleotide sequence 
provided in SEQ ID NOS: 1-982 , a representative fragment thereof, or a nucleotide 

25 sequence at least 95%, preferably at least 99% and most preferably at least 99.9% 

identical to a polynucleotide of SEQ ID NOS: 1-982. Such a manufacture provides a large 
portion of the Enterococcus faecal is genome and parts thereof (e.g., a Enterococcus 
faecalis open reading frame (ORF)) in a form which allows a skilled artisan to examine 
the manufacture using means not directly applicable to examining the Enterococcus 

30 faecalis genome or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any medium which can be read and accessed directly by a 
computer. Such media include, but arc not limited to: magnetic storage media, such as 

35 floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as 
CD- ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories, such as magnetic/optical storage media, A skilled artisan can readily 
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appreciate how any of the presently known computer readable mediums can be used to 
create a manufacture comprising computer readable medium having recorded thereon a 
nucleotide sequence of the present invention. Likewise, it will be clear to those of skill 
how additional computer readable media that may be developed also can be used to create 
5 analogous manufactures having recorded thereon a nucleotide sequence of the present 
invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently know 
methods for recording information on computer readable medium to generate 

10 manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. The choice of the data storage structure will generally be based on the means 
chosen to access the stored information. In addition, a variety of data processor 

15 programs and formats can be used to store the nucleotide sequence information of the 
present invention on computer readable medium. The sequence information can be 
represented in a word processing text file, formatted in commercially- available software 
such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, 
stored in a database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan 

20 can readily adapt any number of data-processor structuring formats (e.g., text file or 
database) in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention; 

Computer software is publicly available which allows a skilled artisan to access 
sequence information provided in a computer readable medium. Thus, by providing in 

25 computer readable form the nucleotide sequences of SEQ ID NOS: 1-982, a representative 
fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most 
preferably at least 99.9% identical to a sequence of SEQ ID NOS: 1-982 the present 
invention enables the skilled artisan routinely to access the provided sequence 
information for a wide variety of purposes. 

30 The examples which follow demonstrate how software which implements the 

BLAST (Altschul et al.,J. Mot. Biol 275:403-410 (1990)) and BLAZE (Brutlag et al, 
Comp. Chem. 77:203-207 (1993)) search algorithms on a Sybase system was used to 
identify open reading frames (ORFs) within the Enterococcus faecalis genome which 
contain homology to ORFs or proteins from both Enterococcus faecalis and from other 

35 organisms. Among the ORFs discussed herein are protein encoding fragments of the 

Enterococcus faecalis genome useful in producing commercially important proteins, such 
as enzymes used in fermentation reactions and in the production of commercially useful 
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metabolites, proteins to be used as vaccines or in the generation of immuno-therapeutic 
reagents, or as drug screening targets. 

The present invention further provides systems, particularly computer-based 
systems, which contain the sequence information described herein. Such systems are 
5 designed to identify, among other things, commercially important fragments of the 
Enterococcus faecalis genome. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
10 present invention comprises a central processing unit (CPU), input means, output means, 
and data storage means. A skilled artisan can readily appreciate that any one of the 
currently available computer-based system are suitable for use in the present invention. 

As stated above, the computer-based systems of the present invention comprise a 
data storage means having stored therein a nucleotide sequence of the present invention 
15 and the necessary hardware means and software means for supporting and implementing a 
search means. 

As used herein, "data storage means" refers to memory which can store nucleotide 
sequence information of the present invention, or a memory access means which can 
access manufactures having recorded thereon the nucleotide sequence information of the 

20 present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer- based system to compare a target sequence or target 
structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of the present genomic sequences 

25 which match a particular target sequence or target motif. A variety of known algorithms 
are disclosed publicly and a variety of commercially available software for conducting 
search means are and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, MacPattern (EMBL), BLASTN 
and BLASTX (NCBI). A skilled artisan can readily recognize that any one of the 

30 available algorithms or implementing software packages for conducting homology 
searches can be adapted for use in the present computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of six 
or more nucleotides or two or more amino acids. A skilled artisan can readily recognize 
that the longer a target sequence is, the less likely a target sequence will be present as a 

35 random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 10 to 100 amino acids or from about 30 to 300 nucleotide 
residues. However, it is well recognized that searches for commercially important 
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fragments, such as sequence fragments involved in gene expression and protein 
processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) are 
5 chosen based on a three-dimensional configuration which is formed upon the folding of 
the target motif. There are a variety of target motifs known in the art. Protein target 
motifs include, but are not limited to, enzymic active sites and signal sequences. Nucleic 
acid target motifs include, but are not limited to, promoter sequences, hairpin structures 
and inducible expression elements (protein binding sequences). 

10 A variety of structural formats for the input and output means can be used to 

input and output the information in the computer-based systems of the present 
invention. A preferred format for an output means ranks fragments of the Enter ococcus 
faecalis genomic sequences possessing varying degrees of homology to the target sequence 
or target motif. Such presentation provides a skilled artisan with a ranking of sequences 

15 which contain various amounts of the target sequence or target motif and identifies the 
degree of homology contained in the identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the Enterococcus 
faecalis genome. In the present examples, implementing software which implement the 

20 BLAST algorithm, described in Altschul et al (1990) J. Mol. Biol. 215: 403-410, is used 
to identify open reading frames within the Enterococcus faecalis genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer-based systems of the present 
invention. Of course, suitable proprietary systems that may be known to those of skill 

25 also may be employed in this regard. 

Figure 1 provides a block diagram of a computer system illustrative of 
embodiments of this aspect of present invention. The computer system 102 includes a 
processor 106 connected to a bus 104. Also connected to the bus 104 are a main 
memory 108 (preferably implemented as random access memory, RAM) and a variety of 

30 secondary storage devices 110, such as a hard drive 1 12 and a removable medium storage 
device 1 14. The removable medium storage device 114 may represent, for example, a 
floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A removable storage 
medium 116 (such as a floppy disk, a compact disk, a magnetic tape, etc.) containing 
control logic and/or data recorded therein may be inserted into the removable medium 

35 storage device 1 14. The computer system 102 includes appropriate software for reading 
the control logic and/or the data from the removable medium storage device 1 14, once it 
is inserted into the removable medium storage device 114, 
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A nucleotide sequence of the present invention may be stored in a well known 
manner in the main memory 108, any of the secondary storage devices 110, and/or a 
removable storage medium 1 16. During execution, software for accessing and processing 
the genomic sequence (such as search tools, comparing tools, etc.) reside in main memory 
5 108, in accordance with the requirements and operating parameters of the operating 
system, the hardware system and the software program or programs. 

BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to isolated fragments of 

10 the Enterococcus faecalis genome. The fragments of the Enterococcus faecalis genome 
of the present invention include, but are not limited to fragments which encode peptides, 
hereinafter open reading frames (ORFs), fragments which modulate the expression of an 
operably linked ORF, hereinafter expression modulating fragments (EMFs) and fragments 
which can be used to diagnose the presence of Enterococcus faecalis in a sample, 

15 hereinafter diagnostic fragments (DFs). 

As used herein, an "isolated nucleic acid molecule" or an "isolated fragment of the 
Enterococcus faecalis genome" refers to a nucleic acid molecule possessing a specific 
nucleotide sequence which has been subjected to purification means to reduce, from the 
composition, the number of compounds which are normally associated with the 

20 composition. Particularly, the term refers to the nucleic acid molecules having the 

sequences set out in SEQ ID NOS: 1-982, to representative fragments thereof as described 
above, to polynucleotides at least 95%, preferably at least 99% and especially preferably 
at least 99.9% identical in sequence thereto, also as set out above. 

A variety of purification means can be used to generate the isolated fragments of 

25 the present invention. These include, but are not limited to methods which separate 
constituents of a solution based on charge, solubility, or size. 

In one embodiment, Enterococcus faecalis DNA can be enzymatically sheared to 
produce fragments of 15-20 kb in length. These fragments can then be used to generate a 
Enterococcus faecalis library by inserting them into lambda clones as described in the 

30 Examples below. Primers flanking, for example, an ORF, such as those enumerated in 
Tables 1-3 can then be generated using nucleotide sequence information provided in SEQ 
ID NOS: 1-982. Well known and routine techniques of PCR cloning then can be used to 
isolate the ORF from the lambda DNA library or Enterococcus faecalis genomic DNA. 
Thus, given the availability of SEQ ID NOS:l-982, the information in Tables 1, 2 and 3, 

35 and the information that may be obtained readily by analysis of the sequences of SEQ ID 
NOS: 1-982 using methods set out above, those of skill will be enabled by the present 
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disclosure to isolate any ORF-containing or other nucleic acid fragment of the present 
invention. 

The isolated nucleic acid molecules of the present invention include, but are not 
limited to single stranded and double stranded DNA, and single stranded RNA. As used 
5 herein, an "open reading frame," ORF, means a series of triplets coding for amino acids 
without any termination codons and is a sequence translatable into protein. Each 
sequence of SEQ ID NOS: 1-982, however, begins and ends with a termination codon. For 
purposes of numbering and reference to polynucleotide and polypeptide sequences the 
entire sequence of each sequence of SEQ ID NOS: 1-982 is included with the first 
10 nucleotide being positon 1. Therefore, for reference purposes the numbering used in the 
present invention is that provided in the sequence listing for SEQ ID NOS: 1-982. 

Tables 1 , 2, and 3 list ORFs in the Enterococcus faecal is genomic contigs of the 
present invention that were identified as putative coding regions by the GeneMark 
software using organism-specific second-order Markov probability transition matrices. It 
1 5 will be appreciated that other criteria can be used, in accordance with well known 
analytical methods, such as those discussed herein, to generate more inclusive, more 
restrictive, or more selective lists. 

Table 1 sets out ORFs in the Enterococcus faecalis contigs of the present 
invention that over a continuous region of at least 50 bases are 95% or more identical (by 
20 BLAST analysis) to a nucleotide sequence available through GenBank in March, 1997. 
Table 2 sets out ORFs in the Enterococcus faecalis contigs of the present 
invention that are not in Table 1 and match, with a BLASTP probability score of 0.01 or 
less, a polypeptide sequence available through GenBank in March, 1997. 

Table 3 sets out ORFs in the Enterococcus faecalis contigs of the present 
25 invention that do not match significantly, by BLASTP analysis, a polypeptide sequence 
available through GenBank in March, 1997. 

In each table, the first and second columns identify the ORF by, respectively, 
contig number and ORF number within the contig; the third column indicates the 
coordinate of the first nucleotide of the ORF, counting from the 5' end of the contig 
30 strand; the fourth column indicates the coordinate of the final nucleotide of the ORF, 
counting from the 5' end of the contig strand. 

In Tables 1 and 2, column five lists the Reference for the closest matching 
sequence available through GenBank. These reference numbers are the database entry 
numbers commonly used by those of skill in the art, who will be familiar with their 
35 denominators. Descriptions of the nomenclature are available from the National Center 
for Biotechnology Information. Column six in Tables 1 and 2 provides the gene name of 
the matching sequence. 
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In Table 1, column seven provides the nucleotide BLAST percent identity score, 
from the comparison of the ORF and the GenBank sequence, column eight indicates the 
length in nucleotides of the highest scoring segment pair identified by the BLAST identity 
analysis, and column nine provides the total length of the ORF in nucleotides. 
5 In Table 2, column seven provides the protein BLAST percent similarity of the 

highest scoring segment pair identified, column eight provides the percent identity of the 
highest scoring segment pair, and column nine provides the total length of the ORF in 
nucleotides. 

The concepts of percent identity and percent similarity of two polypeptide 

10 sequences is well understood in the art. For example, two polypeptides 10 amino acids in 
length which differ at three amino acid positions (e.g., at positions 1, 3 and 5) are said to 
have a percent identity of 70%. However, the same two polypeptides would be deemed to 
have a percent similarity of 80% if, for example at position 5, the amino acids moieties, 
although not identical, were "similar" (i.e., possessed similar biochemical characteristics). 

15 Many programs for analysis of nucleotide or amino acid sequence similarity, such as fasta 
and BLAST specifically list percent identity of a matching region as an output parameter. 
Thus, for instance, Tables 1 and 2 herein enumerate the percent identity of the highest 
scoring segment pair in each ORF and its listed relative. Further details concerning the 
algorithms and criteria used for homology searches arc provided below and are described in 

20 the pertinent literature highlighted by the citations provided below. 

It will be appreciated that other criteria can be used to generate more inclusive 
and more exclusive listings of the types set out in the tables. As those of skill will 
appreciate, narrow and broad searches both are useful. Thus, a skilled artisan can readily 
identify ORFs in contigs of the Enterococcus faecalis genome other than those listed in 

25 Tables 1-3, such as ORFs which are overlapping or encoded by the opposite strand of an 
identified ORF in addition to those ascertainable using the computer-based systems of the 
present invention. 

As used herein, an "expression modulating fragment," EMF, means a series of 
nucleotide molecules which modulates the expression of an operably linked ORF or EMF. 

30 As used herein, a sequence is said to "modulate the expression of an operably 

linked sequence" when the expression of the sequence is altered by the presence of the 
EMF. EMFs include, but are not limited to, promoters, and promoter modulating 
sequences (inducible elements). One class of EMFs are fragments which induce the 
expression or an operably linked ORF in response to a specific regulatory factor or 

35 physiological event. 

EMF sequences can be identified within the contigs of the Enterococcus faecalis 
genome by their proximity to the ORFs provided in Tables 1-3. An intergenic segment, 
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or a fragment of the intergenic segment, from about 10 to 200 nucleotides in length, 
taken from any one of the ORFs of Tables 1-3 will modulate the expression of an 
operably linked ORF in a fashion similar to that found with the naturally linked ORF 
sequence. As used herein, an "intergenic segment" refers to fragments of the 
5 Enterococcus faecalis genome which are between two ORF(s) herein described. EMFs also 
can be identified using known EMFs as a target sequence or target motif in the computer- 
based systems of the present invention. Further, the two methods can be combined and 
used together. 

The presence and activity of an EMF can be confirmed using an EMF trap vector. 

.10 An EMF. trap vector contains a cloning site linked to a marker sequence. A marker 
sequence encodes an identifiable phenotype, such as antibiotic resistance or a 
complementing nutrition auxotrophic factor, which can be identified or assayed when the 
EMF trap vector is placed within an appropriate host under appropriate conditions. As 
described above, a EMF will modulate the expression of an operably linked marker 

1 5 sequence. A more detailed discussion of various marker sequences is provided below. 

A sequence which is suspected as being an EMF is cloned in all three reading 
frames in one or more restriction sites upstream from the marker sequence in the EMF 
trap vector. The vector is then transformed into an appropriate host using known 
procedures and the phenotype of the transformed host in examined under appropriate 

20 conditions. As described above, an EMF will modulate the expression of an operably 
linked marker sequence. 

As used herein, a "diagnostic fragment," DF, means a series of nucleotide 
molecules which selectively hybridize to Enterococcus faecalis sequences. DFs can be 
readily identified by identifying unique sequences within contigs of the Enterococcus 

25 faecalis genome, such as by using well-known computer analysis software, and by 

generating and testing probes or amplification primers consisting of the DF sequence in 
an appropriate diagnostic format which determines amplification or hybridization 
selectivity. 

The sequences falling within the scope of the present invention are not limited to 
30 the specific sequences herein described, but also include allelic and species variations 
thereof. Allelic and species variations can be routinely determined by comparing the 
sequences provided in SEQ ID NOS: 1-982, a representative fragment thereof, or a 
nucleotide sequence at least 99% and preferably 99.9% identical to SEQ ID NOS: 1-982, 
with a sequence from another isolate of the same species. Furthermore, to accommodate 
35 codon variability, the invention includes nucleic acid molecules coding for the same 
amino acid sequences as do the specific ORFs disclosed herein. In other words, in the 
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coding region of an ORF, substitution of one codon for another which encodes the same 
amino acid is expressly contemplated. 

Any specific sequence disclosed herein can be readily screened for errors by 
resequencing a particular fragment, such as an ORF, in both directions (i.e., sequence both 
5 strands). Alternatively, error screening can be performed by sequencing corresponding 
. polynucleotides of Enterococcus faecalis origin isolated by using part or all of the 
fragments in question as a probe or primer. 

Each of the ORFs of the Enterococcus faecalis genome disclosed in Tables 1, 2 
and 3, and the EMFs found 5 to the ORFs, can be used as polynucleotide reagents in 

10 numerous ways. For example, the sequences can be used as diagnostic probes or diagnostic 
amplification primers to detect the presence of a specific microbe in a sample, 
particularly Enterococcus faecalis. Especially preferred in this regard are ORFs such as 
those of Table 3, which do not match previously characterized sequences from other 
organisms and thus are most likely to be highly selective for Enterococcus faecalis. Also 

15 particularly preferred are ORFs that can be used to distinguish between strains of 

Enterococcus faecalis, particularly those that distinguish medically important strain, such 
as drug-resistant strains. 

In addition, the fragments of the present invention, as broadly described, can be 
used to control gene expression through triple helix formation or antisense DNA or RNA, 

20 both of which methods are based on the binding of a polynucleotide sequence to DNA or 
RNA. Triple helix- formation optimally results in a shut-off of RNA transcription from 
DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Information from the sequences of the present invention can be used to 
design antisense and triple helix-forming oligonucleotides. Polynucleotides suitable for 

25 use in these methods are usually 20 to 40 bases in length and are designed to be 
complementary to a region of the gene involved in transcription, for triple-helix 
formation, or to the mRNA itself, for antisense inhibition. Both techniques have been 
demonstrated to be effective in model systems, and the requisite techniques are well 
known and involve routine procedures. Triple helix techniques are discussed in, for 

30 example, Lee et al, NucL Acids Res. 5:3073 (1979); Cooney et ai, Science 241:456 

(1988); and Dervan et ai, Science 257:1360 (1991). Antisense techniques in general are 
discussed in, for instance, Okano, J. Neurochem. 56:560 (1991) and 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). 

35 The present invention further provides recombinant constructs comprising one or 

more fragments of the Enterococcus faecalis genomic fragments and contigs of the 
present invention. Certain preferred recombinant constructs of the present invention 
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comprise a vector, such as a plasmid or viral vector, into which a fragment of the 
Enterococcus faecalis genome has been inserted, in a forward or reverse orientation. In 
the case of a vector comprising one of the ORFs of the present invention, the vector 
may further comprise regulatory sequences, including for example, a promoter, operably 
5 linked to the ORF. For vectors comprising the EMFs of the present invention, the 

vector may further comprise a marker sequence or heterologous ORF operably linked to 
the EMF. 

Large numbers of suitable vectors and promoters are known to those of skill in 
the art and are commercially available for generating the recombinant constructs of the 

1 0 present invention. The following vectors are provided by way of example. Useful 

bacterial vectors include phagescript, PsiX174, pBS SK (+ or -), pBS KS (+ or -), pNH8a, 
pNH16a, pNH18a, pNH46a (available from Stratagene); pTrc99A, pKK223-3, pKK233- 
3, pDR540, pRIT5 (available from Pharmacia). Useful eukaryotic vectors include 
pWLnco, pSV2cat, pOG44, pXTl, pSG (available from Stratagene) pSVK3, pBPV, pMSG, 

15 pSVL (available from Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV 

20 immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein- I. Selection of the appropriate vector and promoter is well 
within the level of ordinary skill in the art. 

The present invention further provides host cells containing any one of the 
isolated fragments of the Enterococcus faecalis genomic fragments and contigs of the 

25 present invention, wherein the fragment has been introduced into the host cell using 
known methods. The host cell can be a higher eukaryotic host cell, such as a 
mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or a procaryotic cell, 
such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct 

30 comprising an ORF of the present invention, may be introduced into the host by a 
variety of well established techniques that are standard in the art, such as calcium 
phosphate transfection, DEAE, dcxtran mediated transfection and electroporation, which 
are described in, for instance, Davis, L. et al y BASIC METHODS IN MOLECULAR 
BIOLOGY (1986). 

35 A host cell containing one of the fragments of the Enterococcus faecalis genomic 

fragments and contigs of the present invention, can be used in conventional manners to 
produce the gene product encoded by the isolated fragment (in the case of an ORF) or can 
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be used to produce a heterologous protein under the control of the EMF. The 
present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid 
fragments of the present invention. By "degenerate variant" is intended nucleotide 

5 fragments which differ from a nucleic acid fragment of the present invention (e.g., an 
ORF) by nucleotide sequence but, due to the degeneracy of the Genetic Code, encode an 
identical polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs depicted 
in Tables 2 and 3 which encode proteins. 

10 A variety of methodologies known in the art can be utilized to obtain any one of 

the isolated polypeptides or proteins of the present invention. At the simplest level, the 
amino acid sequence can be synthesized using commercially available peptide synthesizers. 
This is particularly useful in producing small peptides and fragments of larger 
polypeptides. Such short fragments as may be obtained most readily by synthesis are 

15 useful, for example, in generating antibodies against the native polypeptide, as discussed 
further below. 

In an alternative method, the polypeptide or protein is purified from bacterial 
cells which naturally produce the polypeptide or protein. One skilled in the art can 
readily employ well-known methods for isolating polypeptides and proteins to isolate and 
20 purify polypeptides or proteins of the present invention produced naturally by a bacterial 
strain, or by other methods. Methods for isolation and purification that can be employed 
in this regard include, but are not limited to, immunochromatography, HPLC, size- 
exclusion chromatography, ion-exchange chromatography, and immuno-affinity 
chromatography. 

25 The polypeptides and proteins of the present invention also can be purified from 

cells which have been altered to express the desired polypeptide or protein. Preferred 
polypeptides and proteins of the present invention are polypeptides and proteins coded 
for by the polynucleotides of SEQ ID NOS: 1-982, wherein the polypeptides and proteins 
are coded in the same frame as the termination codon at the end of each sequence of SEQ 

30 ID NOS: 1-982. As used herein, a cell is said to be altered to express a desired polypeptide 
or protein when the cell, through genetic manipulation, is made to produce a polypeptide 
or protein which it normally does not produce or which the cell normally produces at a 
lower level. Those skilled in the art can readily adapt procedures for introducing and 
expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic cells 

35 in order to generate a cell which produces one of the polypeptides or proteins of the 
present invention. 
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The polypeptides of the present invention are preferably provided in an isolated 
form, and preferably are substantially purified, A recombinant^ produced version of the 
E. faecalis polypeptide can be substantially purified by the one-step method described by 
Smith et al. (1988) Gene 67:31-40. Polypeptides of the invention also can be purified 

5 from natural or recombinant sources using antibodies directed against the polypeptides of 
the invention in methods which are well known in the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides comprising 
an amino acid sequence selected from the group including: (a) the amino acid sequence of 
a full-length E, faecalis polypeptide having the complete amino acid sequence from the 

10 first methionine codon to the termination codon of each sequence listed in SEQ ID 
NOS: 1-982, wherein said termination codon is at the end of each SEQ ID NO: and said 
first methionine is the first methionine in frame with said termination codon; and (b) the 
amino acid sequence of a full-length E. faecalis polypeptide having the complete amino 
acid sequence in (a) excepting the N -terminal methionine. 

15 The polypeptides of the present invention also include polypeptides having an 

amino acid sequence at least 80% identical, more preferably at least 90% identical, and 
still more preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a) and 
(b) above. 

The present invention is further directed to polynucleotide encoding portions or 

20 fragments of the amino acid sequences described herein as well as to portions or fragments 
of the isolated amino acid sequences described herein. Fragments include portions of the 
amino acid sequences described herein, are at least 5 contiguous amino acid in length, are 
selected from any two integers, one of which representing a N-tcrminal position. The 
initiation codon of the polypeptides of the present inventions position 1. The initiation 

25 codon (positon 1) for purposes of the present invention is the first methionine codon of 
each sequence of SEQ ID NOS: 1-982 which is in frame with the termination codon at the 
end of each said sequence. Every combination of a N -terminal and C- terminal position 
that a fragment at least 5 contiguous amino acid residues in length could occupy, on any 
given amino acid sequence encoded by a sequence of SEQ ID NOS: 1-982 is included in the 

30 invention, i.e., from initiation codon up to the termination codon. At least means a 

fragment may be 5 contiguous amino acid residues in length or any integer between 5 and 
the number of residues in a full length amino acid sequence minus 1 . Therefore, included 
in the invention are contiguous fragments specified by any N-terminal and C-terminal 
positions of amino acid sequence set forth in SEQ ID NOS: 1-982 wherein the contiguous 

35 fragment is any integer between 5 and the number of residues in a full length sequence 
minus 1. 
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Further, the invention includes polypeptides comprising fragments specified by 
size, in amino acid residues, rather than by N-terminal and C-terminal positions. The 
invention includes any fragment size, in contiguous amino acid residues, selected from 
integers between 5 and the number of residues in a full length sequence minus 1 . Preferred 
5 sizes of contiguous polypeptide fragments include about 5 amino acid residues, about 10 
amino acid residues, about 20 amino acid residues, about 30 amino acid residues, about 40 
amino acid residues, about 50 amino acid residues, about 100 amino acid residues, about 
200 amino acid residues, about 300 amino acid residues, and about 400 amino acid 
residues. The preferred sizes are, of course, meant to exemplify, not limit, the present 

10 invention as all size fragments representing any integer between 5 and the number of 
residues in a full length sequence minus 1 are included in the invention. The present 
invention also provides for the exclusion of any fragments specified by N-terminal and C- 
terminal positions or by size in amino acid residues as described above. Any number of 
fragments specified by N-terminal and C-tcrminal positions or by size in amino acid 

15 residues as described above may be excluded. 

The above fragments need not be active since they would be useful, for example, 
in immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a 
particular portion of the protein, as vaccines, and as molecular weight markers. 

Further polypeptides of the present invention include polypeptides which have at 

20 least 90% similarity, more preferably at least 95% similarity, and still more preferably at 
least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which comprises 
the amino acid sequence of a E, faecalis polypeptide having an amino acid sequence which 
contains at least one conservative amino acid substitution, but not more than 50 

25 conservative amino acid substitutions, not more than 40 conservative amino acid 

substitutions, not more than 30 conservative amino acid substitutions, and not more than 
20 conservative amino acid substitutions. Also provided are polypeptides which comprise 
the amino acid sequence of a E. faecalis polypeptide, having at least one, but not more 
than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions. 

30 By a polypeptide having an amino acid sequence at least, for example, 95% 

"identical" to a query amino acid sequence of the present invention, it is intended that the 
amino acid sequence of the subject polypeptide is identical to the query sequence except 
that the subject polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the query amino acid sequence. In other words, to obtain a 

35 polypeptide having an amino acid sequence at least 95% identical to a query amino acid 
sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, 
deleted, (indels) or substituted with another amino acid. These alterations of the 
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reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed 
either individually among residues in the reference sequence or in one or more contiguous 
groups within the reference sequence. 

5 As a practical matter, whether any particular polypeptide is at least 90%, 95%, 

96%, 97%, 98% or 99% identical to the amino acid sequences encoded by the sequences 
of SEQ ID NOS: 1-982, as described hcrcrin, can be determined conventionally using 
known computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject sequence, 

10 also referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et al., (1990) Comp. App. Biosci. 
6:237-245. In a sequence alignment the query and subject sequences are both amino acid 
sequences. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, 

15 Mismatch Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff 

Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, the results, in percent identity, must be 

20 manually corrected. This is because the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. For 
subject sequences truncated at the N- and C-termini, relative to the query sequence, the 
percent identity is corrected by calculating the number of residues of the query sequence 
that are N- and C-terminal of the subject sequence, which are not matched/aligned with a 

25 corresponding subject residue, as a percent of the total bases of the query sequence. 

Whether a residue is matched/aligned is determined by results of the FASTDB sequence 
alignment. This percentage is then subtracted from the percent identity, calculated by 
the above FASTDB program using the specified parameters, to arrive at a final percent 
identity score. This final percent identity score is what is used for the purposes of the 

30 present invention. Only residues to the N- and C-termini of the subject sequence, which 
are not matched/aligned with the query sequence, are considered for the purposes of 
manually adjusting the percent identity score. That is, only query amino acid residues 
outside the farthest N- and C-terminal residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue 

35 query sequence to determine percent identity. The deletion occurs at the N -terminus of 
the subject sequence and therefore, the FASTDB alignment does not match/align with the 
first 10 residues at the N-tcrminus. The 10 unpaired residues represent 10% of the 



WO 98/50555 



PCT7US98/08985 



sequence (number of residues at the N- and C- termini not matched/total number of 
residues in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 residues were perfectly matched 
the final percent identity would be 90%. In another example, a 90 residue subject 

5 sequence is compared with a 100 residue query sequence. This time the deletions are 

internal so there are no residues at the N- or C-termini of the subject sequence which are 
not matched/aligned with the query. In this case the percent identity calculated by 
FASTDB is not manually corrected. Once again, only residue positions outside the N- and 
C-tcrminal ends of the subject sequence, as displayed in the FASTDB alignment, which 

10 are not matched/aligned with the query sequence are manually corrected. No other 
manual corrections are to made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they have 
their normal biological activity. This is because even where a particular polypeptide 
molecule does not have biological activity, one of skill in the art would still know how to 

15 use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of 
the polypeptides of the present invention that do not have E. faecalis activity include, 
inter alia, as epitope tags, in epitope mapping, and as molecular weight markers on 
SDS-PAGE gels or on molecular sieve gel filtration columns using methods known to 
those of skill in the art. 

20 As described below, the polypeptides of the present invention can also be used to 

raise polyclonal and monoclonal antibodies, which are useful in assays for detecting E. 
faecalis protein expression or as agonists and antagonists capable of enhancing or 
inhibiting E. faecalis protein function. Further, such polypeptides can be used in the 
yeast two-hybrid system to "capture" E. faecalis protein binding proteins which are also 

25 candidate agonists and antagonists according to the present invention. See, e.g., Fields ct 
al. (1989) Nature 340:245-246. 

Any host/vector system can be used to express one or more of the ORFs of the 
present invention. These include, but are not limited to, eukaryotic hosts such as HeLa 
cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic host such as E. coli and /?. 

30 subtilis. The most preferred cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or protein at low natural level. 

"Recombinant," as used herein, means that a polypeptide or protein is derived 
from recombinant (e.g., microbial or mammalian) expression systems. "Microbial" refers 
to recombinant polypeptides or proteins made in bacterial or fungal (e.g., yeast) 

35 expression systems. As a product, "recombinant microbial" defines a polypeptide or 

protein essentially free of native endogenous substances and unaccompanied by associated 
native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, e.g., 
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E. coli, will be free of glycosylation modifications; polypeptides or proteins expressed in 
yeast will have a glycosylation pattern different from that expressed in mammalian cells. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. 
Generally, DNA segments encoding the polypeptides and proteins provided by this 
5 invention are assembled from fragments of the Enterococcus faecalis genome and short 
oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene 
which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon. 

Recombinant expression vehicle or "vector" refers to a plasmid or phage or virus 

10 or vector, for expressing a polypeptide from a DNA (RNA) sequence. The expression 
vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic 
regulatory elements necessary for gene expression in the host, including elements required 
to initiate and maintain transcription at a level sufficient for suitable expression of the 
desired polypeptide, including, for example, promoters and, where necessary, an enhancer 

15 and a polyadenylation signal; (2) a structural or coding sequence which is transcribed into 
mRNA and translated into protein, and (3) appropriate signals to initiate translation at 
the beginning of the desired coding region and terminate translation at its end. Structural 
units intended for use in yeast or eukaryotic expression systems preferably include a 
leader sequence enabling extracellular secretion of translated protein by a host cell. 

20 Alternatively, where recombinant protein is expressed without a leader or transport 

sequence, it may include an N-terminal methionine residue. This residue may or may not 
be subsequently cleaved from the expressed recombinant protein to provide a final 
product. 

"Recombinant expression system" means host cells which have stably integrated a 
25 recombinant transcriptional unit into chromosomal DNA or carry the recombinant 

transcriptional unit extra chromosomally. The cells can be prokaryotic or eukaryotic. 
Recombinant expression systems as defined herein will express heterologous polypeptides 
or proteins upon induction of the regulatory elements linked to the DNA segment or 
synthetic gene to be expressed, 
30 Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 

cells under the control of appropriate promoters. Cell-free translation systems can also 
be employed to produce such proteins using RNAs derived from the DNA constructs of 
the present invention. Appropriate cloning and expression vectors for use with 
prokaryotic and eukaryotic hosts are described in Sambrook et al., Molecular Cloning: A 
35 Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, New York (1989), the disclosure of which is hereby incorporated by reference in 
its entirety. 
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Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin 
resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a 
highly expressed gene to direct transcription of a downstream structural sequence. Such 

5 promoters can be derived from operons encoding glycolytic enzymes such as 3- 

phosphoglycerate kinase (PGK), alpha-factor, acid phosphatase, or heat shock proteins, 
among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the pcriplasmic space or 

10 extracellular medium. Optionally, the heterologous sequence can encode a fusion protein 
including an N-terminal identification peptide imparting desired characteristics, e.g., 
stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 
DNA sequence encoding a desired protein together with suitable translation initiation and 

15 termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to 
ensure maintenance of the vector and, when desirable, provide amplification within the 
host. 

Suitable prokaryotic hosts for transformation include strains of £. coli, B. subtilis, 
20 Salmonella typhimurium and various species within the genera Pseudomonas and 
Streptomyces. Others may, also be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
25 cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 
pKK223-3 (available form Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 
(available from Promega Biotec, Madison, Wl, USA). These pBR322 "backbone" 
sections are combined with an appropriate promoter and the structural sequence to be 
expressed. 

30 Following transformation of a suitable host strain and growth of the host strain to 

an appropriate cell density, the selected promoter, where it is inducible, is derepressed or 
induced by appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period to provide for expression of the induced gene product. 
Thereafter cells are typically harvested, generally by centrifugation, disrupted to release 

35 expressed protein, generally by physical or chemical means, and the resulting crude 
extract is retained for further purification. 
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Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described in Gluzman, Cell 23.175 (1981), and other 
cell lines capable of expressing a compatible vector, for example, the CI 27, 3T3, CHO, 
5 HeLa and BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5 
flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, 
10 for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites 
may be used to provide the required nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is usually 
isolated by initial extraction from cell pellets, followed by one or more saltirig-out, 
aqueous ion exchange or size exclusion chromatography steps. Microbial cells employed 
15 in expression of proteins can be disrupted by any convenient method, including freeze- 
thaw cycling, sonication, mechanical disruption, or use of cell lysing agents. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be employed for 
final purification steps. 

20 The present invention further includes isolated polypeptides, proteins and nucleic 

acid molecules which are substantially equivalent to those herein described. As used 
herein, substantially equivalent can refer both to nucleic acid and amino acid sequences, 
for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 

25 functional dissimilarity between reference and subject sequences. For purposes of the 
present invention, sequences having equivalent biological activity, and equivalent 
expression characteristics are considered substantially equivalent. For purposes of 
determining equivalence, truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other 

30 strains of Enterococcus faecalis, of the fragments of the Enterococcus faecalis genome of 
the present invention and homologs of the proteins encoded by the ORFs of the present 
invention. As used herein, a sequence or protein of Enterococcus faecalis is defined as a 
homolog of a fragment of the Enterococcus faecalis fragments or contigs or a protein 
encoded by one of the ORFs of the present invention, if it shares significant homology to 

35 one of the fragments of the Enterococcus faecalis genome of the present invention or a 
protein encoded by one of the ORFs of the present invention. Specifically, by using the 
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sequence disclosed herein as a probe or as primers, and techniques such as PCR cloning and 
colony/plaque hybridization, one skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to "share 
significant homology" if the two contain regions which possess greater than 85% 
5 sequence (amino acid or nucleic acid) homology. Preferred homologs in this regard are 
those with more than 90% homology. Especially preferred are those with 93% or more 
homology. Among especially preferred homologs those with 95% or more homology are 
particularly preferred. Very particularly preferred among these are those with 97% and 
even more particularly preferred among those are homologs with 99% or more 

10 homology. The most preferred homologs among these are those with 99.9% homology 
or more. It will be understood that, among measures of homology, identity is particularly 
preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence provided 
in SEQ ID NOS: 1-982 or from a nucleotide sequence at least 95%, particularly at least 

15 99%, especially at least 99.5% identical to a sequence of SEQ ID NOS: 1-982 can be used 
to prime DNA synthesis and PCR amplification, as well as to identify colonies containing 
cloned DNA encoding a homo log. Methods suitable to this aspect of the present 

i 

invention are well known and have been described in great detail in many publications 
such as, for example, Innis et al. y PCR Protocols , Academic Press, San Diego, CA 
20 (1990)). 

When using primers derived from SEQ ID NOS: 1-982 or from a nucleotide 
sequence having an aforementioned identity to a sequence of SEQ ID NOS: 1-982, one 
skilled in the art will recognize that by employing high stringency conditions (e.g., 
annealing at 50-60°C in 6X SSPC and 50% fonnamide, and washing at 50- 65°C in 0.5X 

25 SSPC) only sequences which are greater than 75% homologous to the primer will be 

amplified. By employing lower stringency conditions (e.g., hybridizing at 35-37°C in 5X 
SSPC and 40-45% formamide, and washing at 42°C in 0.5X SSPC), sequences which are 
greater than 40-50% homologous to the primer will also be amplified. 

When using DNA probes derived from SEQ ID NOS:l-982, or from a nucleotide 

30 sequence having an aforementioned identity to a sequence of SEQ ID NOS: 1-982, for 
colony/plaque hybridization, one skilled in the art will recognize that by employing high 
stringency conditions (e.g., hybridizing at 50- 65°C in 5X SSPC and 50% formamide, and 
washing at 50- 65°C in 0.5X SSPC), sequences having regions which are greater than 90% 
homologous to the probe can be obtained, and that by employing lower stringency 

35 conditions (e.g., hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing 
at 42°C in 0.5X SSPC), sequences having regions which are greater than 35-45% 
homologous to the probe will be obtained. 
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Any organism can be used as the source for homologs of the present invention so 
long as the organism naturally expresses such a protein or contains genes encoding the 
same. The most preferred organism for isolating homologs are bacteria which are closely 
related to Enterococcus faecalis. 

5 

ILLUSTRATIVE USES OF COMPOSITIONS OF THE INVENTION 

Each ORF provided in Tables 1 and 2 is identified with a function by homology to 
a known gene or polypeptide. As a result, one skilled in the art can use the polypeptides 
of the present invention for commercial, therapeutic and industrial purposes consistent 

10 with the type of putative identification of the polypeptide. Such identifications permit 
one skilled in the art to use the Enterococcus faecalis ORFs in a manner similar to the 
known type of sequences for which the identification is made; for example, to ferment a 
particular sugar source or to produce a particular metabolite. A variety of reviews 
illustrative of this aspect of the invention are available, including the following reviews 

15 on the industrial use of enzymes, for example, BIOCHEMICAL ENGINEERING AND 

BIOTECHNOLOGY HANDBOOK, 2nd Ed., MacMillan Publications, Ltd. NY (1991) and 
BIOCATALYSTS IN ORGANIC SYNTHESES, Tramper et al. y Eds., Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). A variety of exemplary uses that 
illustrate this and similar aspects of the present invention arc discussed below. 

20 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic 
reactions involved in intermediary and macromolecular metabolism, the biosynthesis of 
small molecules, cellular processes and other functions includes enzymes involved in the 

25 degradation of the intermediary products of metabolism, enzymes involved in central 
intermediary metabolism, enzymes involved in respiration, both aerobic and anaerobic, 
enzymes involved in fermentation, enzymes involved in ATP proton motor force 
conversion, enzymes involved in broad regulatory function, enzymes involved in amino 
acid synthesis, enzymes involved in nucleotide synthesis, enzymes involved in cofactor 

30 and vitamin synthesis, can be used for industrial biosynthesis. 

The various metabolic pathways present in Enterococcus faecalis can be identified 
based on absolute nutritional requirements as well as by examining the various enzymes 
identified in Table 1-3 and SEQ ID NOS: 1-982. 

Of particular interest are polypeptides involved in the degradation of 

35 intermediary metabolites as well as non-macromolecular metabolism. Such enzymes 
include amylases, glucose oxidases, and catalase. 
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Proteolytic enzymes are another class of commercially important enzymes. 
Proteolytic enzymes find use in a number of industrial processes including the processing 
of flax and other vegetable fibers, in the extraction, clarification and depectinization of 
fruit juices, in the extraction of vegetables' oil and in the maceration of fruits and 

5 vegetables to give unicellular fruits. A detailed review of the proteolytic enzymes used in 
the food industry is provided in Rombouts et al. Symbiosis 21:19 (1986) and Voragen et 
al in Biocatalysts In Agricultural Biotechnology, Whitaker et al, Eds., American 
Chemical Society Symposium Series 389:93 (1989) . 

The metabolism of sugars is an important aspect of the primary metabolism of 

10 Enterococcus faecalis. Enzymes involved in the degradation of sugars, such as, 
particularly, glucose, galactose, fructose and xylose, can be used in industrial 
fermentation. Some of the important sugar transforming enzymes, from a commercial 
viewpoint, include sugar isomerases such as glucose isomerase. Other metabolic enzymes 
have found commercial use such as glucose oxidases which produces ketogulonic acid 

15 (KGA). KGA is an intermediate in the commercial production of ascorbic acid using the 
Reichstein's procedure, as described in Krueger et al, Biotechnology 6(A) , Rhine et al, 
Eds., Verlag Press, Weinheim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in purified 
form as well as in an immobilized form for the deoxygenation of beer. See, for instance, 

20 Hartmeir et al, Biotechnology Letters 7:21 (1979). The most important application of 
GOD is the industrial scale fermentation of gluconic acid. Market for gluconic acids 
which are used in the detergent, textile, leather, photographic, pharmaceutical, food, feed 
and concrete industry, as described, for example, in Bigelis et al, beginning on page 357 
in GENE MANIPULATIONS AND FUNGI; Benett et al f Eds., Academic Press, New 

25 York (1985). In addition to industrial applications, GOD has found applications in 
medicine for quantitative determination of glucose in body fluids recently in 
biotechnology for analyzing syrups from starch and cellulose hydrosylates. This 
application is described in Owusu et al, Biochem. et Biophysica. Acta. 572:83 (1986), for 
instance. 

30 The main sweetener used in the world today is sugar which comes from sugar beets 

and sugar cane. In the field of industrial enzymes, the glucose isomerase process shows 
the largest expansion in the market today. Initially, soluble enzymes were used and later 
immobilized enzymes were developed (Krueger et al, Biotechnology, The Textbook of 
Industrial Microbiology, Sinauer Associated Incorporated, Sunderland, Massachusetts 

35 (1990)). Today, the use of glucose- produced high fructose syrups is by far the largest 
industrial business using immobilized enzymes. A review of the industrial use of these 
enzymes is provided by Jorgensen, Starch 40:301 (1988). 
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Proteinases, such as alkaline serine proteinases, are used as detergent additives and 
thus represent one of the largest volumes of microbial enzymes used in the industrial 
sector. Because of their industrial importance, there is a large body of published and 
unpublished information regarding the use of these enzymes in industrial processes. (See 
5 Faultman et al y Acid Proteases Structure Function and Biology, Tang, J., ed. s Plenum 
Press, New York (1977) and Godfrey et aL, Industrial Enzymes, MacMillan Publishers, 
Surrey, UK (1983) and Hepner et at., Report Industrial Enzymes by 1990, Hel Hepner & 
Associates, London (1986)). 

Another class of commercially usable proteins of the present invention are the 

10 microbial lipases, described by, for instance, Macrae et aL, Philosophical Transactions of 
the Chiral Society of London 310:221 (1985) and Poserke, Journal of the American Oil 
Chemist Society 61 .1758 (1984). A major use of lipases is in the fat and oil industry for 
the production of neutral glycerides using lipase catalyzed inter-esterification of readily 
available triglycerides. Application of lipases include the use as a detergent additive to 

15 facilitate the removal of fats from fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for key steps 
in the synthesis of complex organic molecules is gaining popularity at a great rate. One 
area of great interest is the preparation of chiral intermediates. Preparation of chiral 
intermediates is of interest to a wide range of synthetic chemists particularly those 

20 scientists involved with the preparation of new pharmaceuticals, agrochemicals, 

fragrances and flavors. (See Davies et ai, Recent Advances in the Generation of Chiral 
Intermediates Using Enzymes, CRC Press, Boca Raton, Florida (1990)). The following 
reactions catalyzed by enzymes are of interest to organic chemists: hydrolysis of 
carboxylic acid esters, phosphate esters, amides and nitriles, esterification reactions, 

25 trans-esterification reactions, synthesis of amides, reduction of alkanones and 

oxoalkanates, oxidation of alcohols to carbonyl compounds, oxidation of sulfides to 
sulfoxides, and carbon bond forming reactions such as the aldol reaction. 

When considering the use of an enzyme encoded by one of the ORFs of the 
present invention for biotransformation and organic synthesis it is sometimes necessary 

30 to consider the respective advantages and disadvantages of using a microorganism as 
opposed to an isolated enzyme. Pros and cons of using a whole cell system on the one 
hand or an isolated partially purified enzyme on the other hand, has been described in 
detail by Bud et aL, Chemistry in Britain (1987), p. 127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism of 

35 amino acids, are useful in the catalytic production of amino acids. The advantages of 

using microbial based enzyme systems is that the amino transferase enzymes catalyze the 
stereo- selective synthesis of only L-amino acids and generally possess uniformly high 
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catalytic rates. A description of the use of amino transferases for amino acid production 
is provided by Roselle-David, Methods of Enzymology 755:479 (1987). 

Another category of useful proteins encoded by the ORFs of the present 
invention include enzymes involved in nucleic acid synthesis, repair, and recombination. 

5 

2. Generation of Antibodies 

As described here, the proteins of the present invention, as well as homologs 
thereof, can be used in a variety of procedures and methods known in the art which are 
currently applied to other proteins. The proteins of the present invention, can further be 

10 used to generate an antibody which selectively binds the protein. 

E. faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E. faecalis protein or an antigenic polypeptide fragment thereof, 
which may be presented together with a carrier protein, such as an albumin, to an animal 
system (such as rabbit or mouse) or, if it is long enough (at least about 25 amino acids), 

15 without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody fragments. 
Antibody fragments of the present invention include Fab and F(ab')2 and other fragments 
including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). Also included in the 

20 present invention are chimeric and humanized monoclonal antibodies and polyclonal 
antibodies specific for the polypeptides of the present invention. The antibodies of the 
present invention may be prepared by any of a variety of methods. For example, cells 
expressing a polypeptide of the present invention or an antigenic fragment thereof can 
be administered to an animal in order to induce the production of sera containing 

25 polyclonal antibodies. For example, a preparation of E, faecalis polypeptide or fragment 
thereof is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 

In a preferred method, the antibodies of the present invention are monoclonal 

30 antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 

using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: 
MONOCLONAL ANTIBODIES AND T-CELL HYBR1DOMAS 563-681 (Elsevier, 
N.Y., 1981). Fab and F(ab')2 fragments may be produced by proteolytic cleavage, using 

35 enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). Alternatively, E. faecalis polypeptide-binding fragments, chimeric, and 
humanized antibodies can be produced through the application of recombinant DNA 
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technology or through synthetic chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide antigen 
of the present invention may be produced in a two-step procedure through the use of 
anti-idiotypic antibodies. Such a method makes use of the fact that antibodies are 
5 themselves antigens, and that, therefore, it is possible to obtain an antibody which binds 
to a second antibody. In accordance with this method, E.faecalis polypeptide-specific 
antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such 
an animal are then used to produce hybridoma cells, and the hybridoma cells are screened 
to identify clones which produce an antibody whose ability to bind to the E. faecalis 

10 polypeptide-specific antibody can be blocked by the E.faecalis polypeptide antigen. 

Such antibodies comprise anti-idiotypic antibodies to the E. faecalis polypeptide-specific 
antibody and can be used to immunize an animal to induce formation of further E. 
faecalis polypeptide-specific antibodies. 

Antibodies and fragements thereof of the present invention may be described by 

15 the portion of a polypeptide of the present invention recognized or specifically bound by 
the antibody. Antibody binding fragements of a polypeptide of the present invention 
may be described or specified in the same manner as for polypeptide fragements discussed 
above., i.e, by N-terminal and C-terminal positions or by size in contiguous amino acid 
residues. Any number of antibody binding fragments, of a polypeptide of the present 

20 invention, specified by N-terminal and C-terminal positions or by size in amino acid 

residues, as described above, may also be excluded from the present invention. Therefore, 
the present invention includes antibodies the specifically bind a particuarlly discribed 
fragement of a polypeptide of the present invention and allows for the exclusion of the 
same. 

25 Antibodies and fragements thereof of the present invention may also be described 

or specified in terms of their cross-reactivity. Antibodies and fragements that do not bind 
polypeptides of any other species of Enterococcus other than E. faecalis are included in 
the present invention. Likewise, antibodies and fragements that bind only species of 
Enterococcus, i.e. antibodies and fragements that do not bind bacteria from any genus 

30 other than Enterococcus, are included in the present invention. 

3. Diagnostic and Detection Assays and Kits 

The present invention further relates to methods for assaying enterococcal 
infection in an animal by detecting the expression of genes encoding enterococcal 
35 polypeptides of the present invention. The methods comprise analyzing tissue or body 
fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or proteins. 
Analysis of nucleic acid specific to Enterococcus is assayed by PCR or hybridization 



WO 98/50555 



PCT/US98/08985 



35 

techniques using nucleic acid sequences of the present invention as either hybridization 
probes or primers. See, e.g., Sambrook et al. Molecular cloning: A Laboratory Manual 
(Cold Spring Harbor Laboratory Press, 2nd ed., 1 989, page 54 reference); Eremeeva et al 
(1994) J. Clin. Microbiol. 32:803-810 (describing differentiation among spotted fever 

5 group Rickettsiae species by analysis of restriction fragment length polymorphism of 
PCR-amplified DNA) and Chen et al. 1994 J. Clin. Microbiol. 32:589-595 (detecting 
B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

10 regression of the disease state whereby patients exhibiting enhanced Enterococcus gene 
expression will experience a worse clinical outcome relative to patients expressing these 
gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an animal, 
cell line, tissue culture, or other source which contains Enterococcus polypeptide, mRNA, 

15 or DNA. Biological samples include body fluids (such as saliva, blood, plasma, urine, 
mucus, synovial fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other 
biological source suspected of containing Enterococcus polypeptides or nucleic acids. 
Methods for obtaining biological samples such as tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 

20 infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, 
pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 

Total RNA can be isolated from a biological sample using any suitable technique 
such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in 
Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Enterococcus 

25 polypeptides having sufficient homology to the nucleic acid sequences identified in SEQ 
ID NOS: 1-982 to allow for hybridization between complementary sequences are then 
assayed using any appropriate method. These include Northern blot analysis, SI nuclease 
mapping, the polymerase chain reaction (PCR), reverse transcription in combination 
with the polymerase chain reaction (RT-PCR), and reverse transcription in combination 

30 with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) Cell 
63:303-312. Briefly, total RNA is prepared from a biological sample as described above. 
For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 

35 electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 
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phosphate buffer. A E. faecalis polynucleotide sequence shown in SEQ ID NOS: 1-982 
labeled according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is washed 
and exposed to x-ray film. DNA for use as probe according to the present invention is 
5 described in the sections above and will preferably at least 1 5 nucleotides in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. 
To prepare probe DNA for use in SI mapping, the sense strand of an above-described E. 
faecalis DNA sequence of the present invention is used as a template to synthesize labeled 
antisense DNA. The antisense DNA can then be digested using an appropriate restriction 

1 0 endonuclease to generate further DNA probes of a desired length. Such antisense probes 
are useful for visualizing protected bands corresponding to the target mRNA (/.e., mRNA 
encoding Enter ococcus polypeptides). 

Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., using 
the RT-PCR method described in Makino et al (1990) Technique 2:295-301. By this 

15 method, the radioactivities of the "amplicons" in the poly aery lamide gel bands are 

linearly related to the initial concentration of the target mRNA. Briefly, this method 
involves adding total RNA isolated from a biological sample in a reaction mixture 
containing a RT primer and appropriate buffer. After incubating for primer annealing, 
the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and 

20 reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the 
RT products are then subject to PCR using labeled primers. Alternatively, rather than 
labeling the primers, a labeled dNTP can be included in the PCR reaction mixture. PCR 
amplification can be performed in a DNA thermal cycler according to conventional 
techniques. After a suitable number of rounds to achieve amplification, the PCR reaction 

25 mixture is electrophoresed on a polyacrylamide gel. After drying the gel, the 

radioactivity of the appropriate bands (corresponding to the mRNA encoding the 
Enterococcus polypeptides of the present invention) are quantified using an imaging 
analyzer. RT and PCR reaction ingredients and conditions, reagent and gel 
concentrations, and labeling methods are well known in the art. Variations on the 

30 RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can 
detect the nucleic acid of the present invention can be found in PCR PRIMER: A 
LABORATORY MANUAL (C.W. Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 
1995). 

The polynucleotides of the present invention, including both DNA and RNA, may 
35 be used to detect polynucleotides of the present invention or Enterococcal species 

including E. faecalis using bio chip technology. The present invention includes both high 
density chip arrays (>1000 oligonucleotides per cm 2 ) and low density chip arrays (<1000 
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oligonucleotides per cm 2 ). Bio chips comprising arrays of polynucleotides of the present 
invention may be used to detect Enterococcal species, including E. faecalis, in biological 
and environmental samples and to diagnose an animal, including humans, with an E. 
faecalis or other Enterococcal infection. The bio chips of the present invention may 
5 comprise polynucleotide sequences of other pathogens including bacteria, viral, parasitic, 
and fungal polynucleotide sequences, in addition to the polynucleotide sequences of the 
present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The 
bio chips can also be used to monitor an E, faecalis or other Enterococcal infections and 
to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response to 

10 drug therapy in the clinic and drug development in the laboratory. The bio chip 

technology comprising arrays of polynucleotides of the present invention may also be 
used to simultaneously monitor the expression of a multiplicity of genes, including those 
of the present invention. The polynucleotides used to comprise a selected array may be 
specified in the same manner as for the fragements, i.e, by their 5' and 3' positions or 

15 length in contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 
5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 

20 WO/9511995, WO/9743447, WO/9535505, each incorporated herein in their entireties. 

Biosensors using the polynucleotides of the present invention may also be used to 
detect, diagnose, and monitor E. faecalis or other Enterococcal species and infections 
thereof. Biosensors using the polynucleotides of the present invention may also be used 
to detect particular polynucleotides of the present invention. Biosensors using the 

25 polynucleotides of the present invention may also be used to monitor the genetic changes 
(deletions, insertions, mismatches, etc.) in response to drug therapy in the clinic and drug 
development in the laboratory. Methods and particular uses of the polynucleotides of the 
present invention to detect Enterococcal species, including E. faecalis, using biosenors 
include those known in the art and those of: U.S. Patent Nos 5721102, 5658732, 

30 5631170, and World Patent Nos. WO97/35011, WO/9720203, each incorporated herein 
in their entireties. 

Thus, the present invention includes both bio chips and biosensors comprising 
polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur using 
35 any art-known method, such as antibody-based techniques. For example, Enterococcus 
polypeptide expression in tissues can be studied with classical immunohistological 
methods. In these, the specific recognition is provided by the primary antibody 
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(polyclonal or monoclonal) but the secondary detection system can utilize fluorescent, 
enzyme, or other conjugated secondary antibodies. As a result, an immunohistological 
staining of tissue section for pathological examination is obtained. Tissues can also be 
extracted, e.g., with urea and neutral detergent, for the liberation of Enterococcus 
5 polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. et al (1985) J. 
Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 105:3087-3096. In this 
technique, which is based on the use of cationic solid phases, quantitation of a 
Enterococcus polypeptide can be accomplished using an isolated Enterococcus 
polypeptide as a standard. This technique can also be applied to body fluids. 

10 Other antibody-based methods useful for detecting Enterococcus polypeptide gene 

expression include immunoassays, such as the ELISA and the radioimmunoassay (R1A). 
For example, a Enterococcus polypep tide-specific monoclonal antibodies can be used 
both as an immunoabsorbent and as an enzyme-labeled probe to detect and quantify a 
Enterococcus polypeptide. The amount of a Enterococcus polypeptide present in the 

15 sample can be calculated by reference to the amount present in a standard preparation 
using a linear regression computer algorithm. Such an ELISA is described in Iacobelli et 
al. (1988) Breast Cancer Research and Treatment 11:19-30. In another ELISA assay, two 
distinct specific monoclonal antibodies can be used to detect Enterococcus polypeptides 
in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and 

20 the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or "two-step" 
assay. The "one-step" assay involves contacting the Enterococcus polypeptide with 
immobilized antibody and, without washing, contacting the mixture with the labeled 
antibody. The "two-step" assay involves washing before contacting the mixture with the 

25 labeled antibody. Other conventional methods may also be employed as suitable. It is 

usually desirable to immobilize one component of the assay system on a support, thereby 
allowing other components of the system to be brought into contact with the component 
and readily removed from the sample. Variations of the above and other immunological 
methods included in the present invention can also be found in Harlow et al., 

30 ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 
ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, which 
catalyze the production of hydrogen peroxide by reacting with substrate. Glucose oxidase 
is particularly preferred as it has good stability and its substrate (glucose) is readily 
35 available. Activity of an oxidase label may be assayed by measuring the concentration of 
hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides 
enzymes, other suitable labels include radioisotopes, such as iodine ( 125 I, 121 1), carbon 
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( l4 C), sulphur ( 35 S), tritium ( 3 H), indium ( n2 In), and technetium (" m Tc), and fluorescent 
labels, such as fluorescein and rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of the 
present invention are provided below. Examples of suitable enzyme labels include malate 
5 dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 

dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
acetylcholine esterase. 

10 Examples of suitable radioisotopic labels include 3 H, ,ll In, I25 I, 131 1, 32 P, 35 S, 14 C, 

5i Cr, "To, 58 Co, 59 Fe, 75 Se, I52 Eu, 90 Y, 67 Cu, 217 Ci, 2,I At, 212 Pb, 47 Sc, I09 Pd, etc. ni In is a 
preferred isotope where in vivo imaging is used since its avoids the problem of 
dehalogenation of the 125 I or 13l I-labeled monoclonal antibody by the liver. In addition, 
this radionucleotide has a more favorable gamma emission energy for imaging. See, e.g., 

15 Perkins et al. (1985) Eur. J, Nucl. Med. 10:296-301; Carasquillo et al. (1987) J. Nucl. 
Med. 28:281-287. For example, U1 ln coupled to monoclonal antibodies with 
l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumors tissues, 
particularly the liver, and therefore enhances specificity of tumor localization. See, 
Esteban et al. (1987) J. Nucl Med. 28:861-870. 

20 Examples of suitable non-radioactive isotonic labels include 157 Gd, 55 Mn, 162 Dy, 

52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an I52 Eu label, a fluorescein label, 
an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an 
allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

25 Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 

ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal label, 
an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate 
ester label, a luciferin label, a lucif erase label, and an aequorin label. 

30 Examples of nuclear magnetic resonance contrasting agents include heavy metal 

nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies arc 
provided by Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) 
Clin. Chim. Acta 81:1-40. Coupling techniques mentioned in the latter are the 

35 glutaraldehyde method, the periodate method, the dimaleimide method, the 

m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 
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In a related aspect, the invention includes a diagnostic kit for use in screening 
serum containing antibodies specific against E. faecalis infection. Such a kit may include 
an isolated E. faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-is. faecalis antibody. Such a kit also includes 
5 means for detecting the binding of said antibody to the antigen. In specific embodiments, 
the kit may include a rccombinantly produced or chemically synthesized peptide or 
polypeptide antigen. The peptide or polypeptide antigen may be attached to a solid 
support. 

In a more specific embodiment, the detecting means of the above-described kit 

10 includes a solid support to which said peptide or polypeptide antigen is attached. Such a 
kit may also include a non-attached reporter- labeled anti -human antibody. In this 
embodiment, binding of the antibody to the E. faecalis antigen can be detected by binding 
of the reporter labeled antibody to the anti-£. faecalis polypeptide antibody. 

In a related aspect, the invention includes a method of detecting E. faecalis 

15 infection in a subject. This detection method includes reacting a body fluid, preferably 
serum, from the subject with an isolated E. faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the support. 
Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The 

20 support is then examined for the presence of reporter-labeled antibody. 

The solid surface reagent employed in the above assays and kits is prepared by 
known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods 
generally include non-specific adsorption of the protein to the support or covalent 

25 attachment of the protein , typically through a free amine group, to a chemically 

reactive group on the solid support, such as an activated carboxyl, hydroxyl, or aldehyde 
group. Alternatively, streptavidin coated plates can be used in conjunction with 
biotinylated antigen(s). 

The polypeptides and antibodies of the present invention, including fragments 

30 thereof may be used to detect Enterococcal species including E. faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E. faecalis. Bio chip and biosensors 
of the present invention may also comprise antibodies which specifically recognize the 

35 polypeptides of the present invention to detect Enterococcal species, including E. 
faecalis or specific polypeptides of the present invention. Bio chips or biosensors 
comprising polypeptides or antibodies of the present invention may be used to detect 
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Enterococcal species, including E. faecalis, in biological and environmental samples and 
to diagnose an animal, including humans, with an E. faecalis or other Enterococcal 
infection. Thus, the present invention includes both bio chips and biosensors comprising 
polypeptides or antibodies of the present invention and methods of their use. 
5 The bio chips of the present invention may further comprise polypeptide 

sequences of other pathogens including bacteria, viral, parasitic, and fungal polypeptide 
sequences, in addition to the polypeptide sequences of the present invention, for use in 
rapid diffenertial pathogenic detection and diagnosis. The bio chips of the present 
invention may further comprise antibodies or fragements thereof specific for other 

10 pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition 
to the antibodies or fragements thereof of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips and biosensors of the 
present invention may also be used to monitor an E, faecalis or other Enterococcal 
infection and to monitor the genetic changes (amio acid deletions, insertions, 

15 substitutions, etc.) in response to drug therapy in the clinic and drug development in the 
laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the 
present invention may also be used to simultaneously monitor the expression of a 
multiplicity of polypeptides, including those of the present invention. The polypeptides 
used to comprise a bio chip or biosensor of the present invention may be specified in the 

20 same manner as for the fragements, i.e, by their N-terminal and C-terminal positions or 
length in contigious amino acid residue. Methods and particular uses of the polypeptides 
and antibodies of the present invention to detect Enterococcal species, including E. 
faecalis, or specific polypeptides using bio chip and biosensor technology include those 
known in the art, those of the U.S. Patent Nos. and World Patent Nos. listed above for 

25 bio chips and biosensors using polynucleotides of the present invention, and those of: 
U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and World Patent 
Nos. W09729366, W09612957, each incorporated herein in their entireties. 

4. Screening Assay for Binding Agents 

30 Using the isolated proteins of the present invention, the present invention 

further provides methods of obtaining and identifying agents which bind to a protein 
encoded by one of the ORFs of the present invention or to one of the fragments and the 
Enterococcus faecalis fragment and contigs herein described. 
In general, such methods comprise steps of: 

35 (a) contacting an agent with an isolated protein encoded by one of the ORFs of 

the present invention, or an isolated fragment of the Enterococcus faecalis genome; and 
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(b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, peptides, 
carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be 
selected and screened at random or rationally selected or designed using protein modeling 
5 techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical 
agents and the like are selected at random and are assayed for their ability to bind to the 
protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used herein, an 

10 agent is said to be "rationally selected or designed" when the agent is chosen based on the 
configuration of the particular protein. For example, one skilled in the art can readily 
adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like capable of binding to a specific peptide sequence in order to generate rationally 
designed antipeptide peptides, for example see Hurby et aL, "Application of Synthetic 

15 Peptides: Antisense Peptides," in Synthetic Peptides, A User's Guide, W. H. Freeman, NY 
(1992), pp. 289-307, and Kaspczak et a!., Biochemistry 25:9230-8 (1989), or 
pharmaceutical agents, or the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one of the 

20 ORFs or EMFs of the present invention. As described above, such agents can be 

randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a 
skilled artisan to design sequence specific or element specific agents, modulating the 
expression of either a single ORF or multiple ORFs which rely on the same EMF for 
expression control. 

25 One class of DNA binding agents are agents which contain base residues which 

hybridize or form a triple helix by binding to DNA or RNA. Such agents can be based on 
the classic phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or 
polymeric derivatives which have base attachment capacity. 

Agents suitable for use in these methods usually contain 20 to 40 bases and are 

30 designed to be complementary to a region of the gene involved in transcription (triple 
helix - see Lee et aL, NucL Acids Res. 6:3073 (1979); Cooney et aL 9 Science 241:456 
(1988); and Dervan et al y Science 257.1360 (1991)) or to the mRNA itself (antisense - 
Okano, J. Neurochem, 56. 560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of 
Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix- formation optimally 

35 results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization 
blocks translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the sequences 
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of the present invention can be used to design antisense and triple helix-forming 
oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

5 The present invention further provides pharmaceutical agents which can be used 

to modulate the growth or pathogenicity of Enterococcus faecalis, or another related 
organism, in vivo or in vitro. As used herein, a "pharmaceutical agent" is defined as a 
composition of matter which can be formulated using known techniques to provide a 
pharmaceutical compositions. As used herein, the "pharmaceutical agents of the present 
10 invention" refers the pharmaceutical agents which are derived from the proteins encoded 
by the ORFs of the present invention or are agents which are identified using the herein 
described assays. 

As used herein, a pharmaceutical agent is said to "modulate the growth and/or 
pathogenicity of Enterococcus faecalis or a related organism, in vivo or in vitro," when 

15 the agent reduces the rate of growth, rate of division, or viability of the organism in 

question. The pharmaceutical agents of the present invention can modulate the growth 
or pathogenicity of an organism in many fashions, although an understanding of the 
underlying mechanism of action is not needed to practice the use of the pharmaceutical 
agents of the present invention. Some agents will modulate the growth by binding to an 

20 important protein thus blocking the biological activity of the protein, while other agents 
may bind to a component of the outer surface of the organism blocking attachment or 
rendering the organism more prone to act the bodies nature immune system. 
Alternatively, the agent may comprise a protein encoded by one of the ORFs of the 
present invention and serve as a vaccine. The development and use of a vaccine based on 

25 outer membrane components are well known in the art. 

As used herein, a "related organism" is a broad term which refers to any organism 
whose growth can be modulated by one of the pharmaceutical agents of the present 
invention. In general, such an organism will contain a homolog of the protein which is 
the target of the pharmaceutical agent or the protein used as a vaccine. As such, related 

30 organisms do not need to be bacterial but may be fungal or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may be 
administered in a convenient manner, such as by the oral, topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. The 
pharmaceutical compositions are administered in an amount which is effective for 

35 treating and/or prophylaxis of the specific indication. In general, they are administered 
in an amount of at least about 1 mg/kg body weight and in most cases they will be 
administered in an amount not in excess of about 1 g/kg body weight per day. In most 
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cases, the dosage is from about 0.1 mg/kg to about 10 g/kg body weight daily, taking into 
account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form or can be modified 
to form a chemical derivative. As used herein, a molecule is said to be a "chemical 
5 derivative" of another molecule when it contains additional chemical moieties not 

normally a part of the molecule. Such moieties may improve the molecule's solubility, 
absorption, biological half life, etc. The moieties may alternatively decrease the toxicity 
of the molecule, eliminate or attenuate any undesirable side effect of the molecule, etc. 
Moieties capable of mediating such effects are disclosed in, among other sources, 

10 REMINGTON'S PHARMACEUTICAL SCIENCES (1 980) cited elsewhere herein. 

For example, such moieties may change an immunological character of the 
functional derivative, such as affinity for a given antibody. Such changes in 
immunomodulation activity are measured by the appropriate assay, such as a competitive 
type immunoassay. Modifications of such protein properties as redox or thermal 

15 stability, biological half-life, hydrophobicity, susceptibility to proteolytic degradation or 
the tendency to aggregate with carriers or into multimers also may be effected in this way 
and can be assayed by methods well known to the skilled artisan. 

The therapeutic effects of the agents of the present invention may be obtained by 
providing the agent to a. patient by any suitable means (e.g., inhalation, intravenously, 

20 intramuscularly, subcutaneously, enterally, or parenterally). It is preferred to administer 
the agent of the present invention so as to achieve an effective concentration within the 
blood or tissue in which the growth of the organism is to be controlled. To achieve an 
effective blood concentration, the preferred method is to administer the agent by 
injection. The administration may be by continuous infusion, or by single or multiple 

25 injections. 

In providing a patient with one of the agents of the present invention, the dosage 
of the administered agent will vary depending upon such factors as the patient's age, 
weight, height, sex, general medical condition, previous medical history, etc. In general, 
it is desirable to provide the recipient with a dosage of agent which is in the range of from 

30 about 1 pg/kg to 10 mg/kg (body weight of patient), although a lower or higher dosage 
may be administered. The therapeutically effective dose can be lowered by using 
combinations of the agents of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be administered "in 
combination" with each other when cither (1) the physiological effects of each 

35 compound, or (2) the serum concentrations of each compound can be measured at the 
same time. The composition of the present invention can be administered concurrently 
with, prior to, or following the administration of the other agent. 
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The agents of the present invention are intended to be provided to recipient 
subjects in an amount sufficient to decrease the rate of growth (as defined above) of the 
target organism. 

The administration of the agent(s) of the invention may be for either a 
5 "prophylactic" or "therapeutic" purpose. When provided prophylactically, the agent(s) 
are provided in advance of any symptoms indicative of the organisms growth. The 
prophylactic administration of the agent(s) serves to prevent, attenuate, or decrease the 
rate of onset of any subsequent infection. When provided therapeutically, the agcnt(s) 
are provided at (or shortly after) the onset of an indication of infection. The therapeutic 

10 administration of the compound(s) serves to attenuate the pathological symptoms of the 
infection and to increase the rate of recovery. 

The agents of the present invention are administered to a subject, such as a 
mammal, or a patient, in a pharmaceutically acceptable form and in a therapeutically 
effective concentration. A composition is said to be "pharmacologically acceptable" if 

15 its administration can be tolerated by a recipient patient. Such an agent is said to be 
administered in a "therapeutically effective amount" if the amount administered is 
physiologically significant An agent is physiologically significant if its presence results 
in a detectable change in the physiology of a recipient patient. 

The agents of the present invention can be formulated according to known 

20 methods to prepare pharmaceutically useful compositions, whereby these materials, or 

their functional derivatives, are combined in a mixture with a pharmaceutically acceptable 
carrier vehicle. Suitable vehicles and their formulation, inclusive of other human 
proteins, e.g., human serum albumin, are described, for example, in REMINGTON'S 
PHARMACEUTICAL SCIENCES, 16th Ed., Osol, A., Ed., Mack Publishing, Easton PA 

25 (1980). In order to form a pharmaceutically acceptable composition suitable for 

effective administration, such compositions will contain an effective amount of one or 
more of the agents of the present invention, together with a suitable amount of carrier 
vehicle. 

Additional pharmaceutical methods may be employed to control the duration of 
30 action. Control release preparations may be achieved through the use of polymers to 
complex or absorb one or more of the agents of the present invention. The controlled 
delivery may be effectuated by a variety of well known techniques, including formulation 
with macromolecules such as, for example, polyesters, polyamino acids, polyvinyl, 
pyrrolidone, ethylenevinylacetate, methylcellulose, carboxymethylcellulose, or 
35 protamine, sulfate, adjusting the concentration of the macromolecules and the agent in 
the formulation, and by appropriate use of methods of incorporation, which can be 
manipulated to effectuate a desired time course of release. Another possible method to 
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control the duration of action by controlled release preparations is to incorporate agents 
of the present invention into particles of a polymeric material such as polyesters, 
polyamino acids, hydrogels, poly(lactic acid) or ethylene vinylacetate copolymers. 
Alternatively, instead of incorporating these agents into polymeric particles, it is possible 

5 to entrap these materials in microcapsules prepared, for example, by coacervation 

techniques or by interfacial polymerization with, for example, hydroxymethylcellulose or 
gelatine-mi crocapsules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin microspheres, 
microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such techniques 

10 are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES (1 980). 

The invention further provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such containers) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 

15 pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

In addition, the agents of the present invention may be employed in conjunction 
with other therapeutic compounds. 

The present invention also provides vaccines comprising one or more 

20 polypeptides of the present invention. Heterogeneity in the composition of a vaccine 
may be provided by combining E. faecalis polypeptides of the present invention. 
Multi-component vaccines of this type are desirable because they are likely to be more 
effective in eliciting protective immune responses against multiple species and strains of 
the Enterococcus genus than single polypeptide vaccines. 

25 Multi-component vaccines are known in the art to elicit antibody production to 

numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 
174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tctravalcnt 
vaccine has recently been demonstrated to elicit protective levels of antibodies in human 
infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) Vaccine 

30 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 
immunogen or antigen. Thus, a multi-component vaccine would be a vaccine comprising 
more than one of the E. faecalis polypeptides of the present invention. 
35 Further within the scope of the invention are whole cell and whole viral vaccines. 

Such vaccines may be produced recombinantly and involve the expression of one or more 
of the E. faecalis polypeptides described in SEQ ID NOS: 1-982. For example, the E. 
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faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on the cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, for 
example, be localized in the viral envelope, on the surface of the capsid, or internally 
5 within the capsid. Whole cells vaccines which employ cells expressing heterologous 
proteins are known in the art. See, e.g., Robinson, K. et al. (1997) Nature Biotech. 
15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; Chabalgoity, J. et al. 
(1997) Infect. Immun. 65:2402-2412 . These cells may be administered live or may be 
killed prior to administration. Chabalgoity, J. et al., supra, for example, report the 
10 successful use in mice of a live attenuated Salmonella vaccine strain which expresses a 
portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells 
surface. 

A multi-component vaccine can also be prepared using techniques known in the 
. art by combining one or more E. faecalis polypeptides of the present invention, or 

15 fragments thereof, with additional non-Enterococcal components {e.g., diphtheria toxin 
or tetanus toxin, and/or other compounds known to elicit an immune response). Such 
vaccines are useful for eliciting protective immune responses to both members of the 
Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA vaccines 

20 arc currently being developed for a number of infectious diseases. See, et al, Boyer, et al. 
(1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 14:1285-1288. Such 
DNA vaccines contain a nucleotide sequence encoding one or more E. faecalis 
polypeptides of the present invention oriented in a manner that allows for expression of 
the subject polypeptide. For example, the direct administration of plasmid DNA 

25 encoding B. burgdorgeri OspA has been shown to elicit protective immunity in mice 
against borrelial challenge. See, Luke et al. (1997) J. Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 
(1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune 

30 responses produced by DNA immunizations when DNA sequences encoding molecules 
which stimulate the immune response are co-administered. In a similar fashion, the 
vaccines of the present invention may be co-administered with cither nucleic acids 
encoding immune modulators or the immune modulators themselves. These immune 
modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and 

35 CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 
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the present invention are used to confer resistance to Entcrococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
5 Enterococcal infection through passive immunization, the vaccine is provided to a host 
animal {e.g., human, dog, or mouse), and the antisera elicited by this antisera is recovered 
and directly provided to a recipient suspected of having an infection caused by a member 
of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 

10 provides an additional method for treating Enterococcal infections when passive 

immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, 
capable of recognizing the E. faecalis polypeptides disclosed herein, or fragments thereof, 
as well as other Enterococcus proteins, are labeled with toxin molecules prior to their 
administration to the patient. When such toxin derivatized antibodies bind to 

15 Enterococcus cells, toxin moieties will be localized to these cells and will cause their 
death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 

20 present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its 
administration to an animal results either in the total or partial attenuation (i.e.; 
suppression) of a symptom or condition of the disease, or in the total or partial immunity 
of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 

25 either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, the 
compound(s) are provided in advance of any symptoms of Enterococcal infection. The 
prophylactic administration of the compound(s) serves to prevent or attenuate any 
subsequent infection. When provided therapeutically, the compound(s) is provided upon 
or after the detection of symptoms which indicate that an animal may be infected with a 

30 member of the Enterococcus genus. The therapeutic administration of the compound(s) 
serves to attenuate any actual infection. Thus, the E. faecalis polypeptides, and 
fragments thereof, of the present invention may be provided either prior to the onset of 
infection (so as to prevent or attenuate an anticipated infection) or after the initiation of 
an actual infection. 

35 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
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carbohydrates. Suitable proteins which may act as macromolecular carrier for enhancing 
the immunogenicity of the polypeptides of the present invention include keyhole limpet 
hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. 
Methods for coupling the polypeptides of the present invention to such macromolecular 
5 carriers are disclosed in Harlow et al, ANTIBODIES: A LABORATORY MANUAL, 
(Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

A composition is said to be "pharmacologically or physiologically acceptable" if 
its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 

10 "therapeutically effective amount" if the amount administered is physiologically 

significant. An agent is physiologically significant if its presence results in a detectable 
change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that the 

15 composition of a pharmacologically acceptable compound varies with the animal to 
which it is administered. For example, a vaccine intended for human use will generally 
not be co-administered with Freund's adjuvant. Further, the level of purity of the E. 
faecal is polypeptides of the present invention will normally be higher when administered 
to a human than when administered to a non-human animal. 

20 As would be understood by one of ordinary skill in the art, when the vaccine of 

the present invention is provided to an animal, it may be in a composition which may 
contain salts, buffers, adjuvants, or other substances which are desirable for improving the 
efficacy of the composition. Adjuvants are substances that can be used to specifically 
augment a specific immune response. These substances generally perform two functions: 

25 (1) they protect the antigen(s) from being rapidly catabolized after administration and (2) 
they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 

30 composition. These groups include oil adjuvants (for example, Freund's complete and 
incomplete), mineral salts (for example, A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as well 
as substances found in Corynebacterium parvum, or Bordetella pertussis, and members of 

35 the genus Brucella. Other substances useful as adjuvants are the saponins such as, for 
example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present 
invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and A1NH 4 (S0 4 ). 
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Examples of materials suitable for use in vaccine compositions are provided in 
REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. Osol, ed, Mack 
Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

The therapeutic compositions of the present invention can be administered 
5 parenterally by injection, rapid infusion, nasopharyngeal absorption 

(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, 
and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene 

10 glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Carriers or occlusive dressings can be used to increase skin permeability and enhance 
antigen absorption. Liquid dosage forms for oral administration may generally comprise a 
liposome solution containing the liquid dosage form. Suitable forms for suspending 
liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert 

15 diluents commonly used in the art, such as purified water. Besides the inert diluents, such 
compositions can also include adjuvants, wetting agents, emulsifying and suspending 
agents, or sweetening, flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated in 

20 biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, Shahin, R. et 
al. (1995) Infect. Immua 63:1195-1200. Similarly, orally administered encapsulated 
Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. et al. (1997) 
Infect. Immun. 65:853-857. Encapsulated vaccines of the present invention can be 
administered by a variety of routes including those involving contacting the vaccine with 

25 mucous membranes {e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations when a 
multiple administration regimen is utilized. It is possible to use the compositions of the 
invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 

30 immunizations are given, they will be given one to two months apart. 

According to the present invention, an "effective amount" of a therapeutic 
composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary depending 
upon such factors as the animal's or human's age, condition, sex, and extent of disease, if 

35 any, and other variables which can be adjusted by one of ordinary skill in the art. 

The antigenic preparations of the invention can be administered by either single 
or multiple dosages of an effective amount. Effective amounts of the compositions of 
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the invention can vary from 0.01-1,000 Jig/ml per dose, more preferably 0.1-500 |lg/ml 
per dose, and most preferably 10-300 |ig/ml per dose. 

6. Shot-Gun Approach to Megabase DNA Sequencing 

5 The present invention further demonstrates that a large genome can be sequenced 

using a random shotgun approach. This procedure, described in detail in the examples 
that follow, has eliminated the up front cost of isolating and ordering overlapping or 
contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail in the 

10 examples that follow. The examples are provided by way of illustration. Other aspects 
and embodiments of the present invention are contemplated by the inventors, as will be 
clear to those of skill in the art from reading the present disclosure. 

ILLUSTRATIVE EXAMPLES 

15 

LIBRARIES AND SEQUENCING 

1. Shotgun Sequencing Probability Analysis , 

The overall strategy for a shotgun approach to whole genome sequencing follows 
from the Lander and Waterman (Landerman and Waterman, Genomics 2:23\ (1988)) 

20 application of the equation for the Poisson distribution. According to this treatment, the 
probability, P0, that any given base in a sequence of size L, in nucleotides, is not 
sequenced after a certain amount, n, in nucleotides, of random sequence has been 
determined can be calculated by the equation P0 = e-m, where m is L/n, the fold 
coverage. For instance, for a genome of 2.8 Mb, m=l when 2.8 Mb of sequence has been 

25 randomly generated (IX coverage). At that point, P0 = e-1 = 0.37. The probability that 
any given base has not been sequenced is the same as the probability that any region of 
the whole sequence L has not been determined and, therefore, is equivalent to the fraction 
of the whole sequence that has yet to be determined. Thus, at one-fold coverage, 
approximately 37% of a polynucleotide of size L, in nucleotides has not been sequenced. 

30 When 14 Mb of sequence has been generated, coverage is 5X for a 2.8 Mb and the 

unsequenced fraction drops to .0067 or 0.67%. 5X coverage of a 2.8 Mb sequence can be 
attained by sequencing approximately 17,000 random clones from both insert ends with 
an average sequence read length of 4 1 0 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le-m, and the 

35 average gap size, g, follows the equation, g = L/n. Thus, 5X coverage leaves about 240 
gaps averaging about 82 bp in size in a sequence of a polynucleotide 2.8 Mb long. 
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The treatment above is essentially that of Lander and Waterman, Genomics 2;. 
231 (1988). 

2. Random Library Construction 

5 In order to approximate the random model described above during actual 

sequencing, a nearly ideal library of cloned genomic fragments is required. The following 
library construction procedure was developed to achieve this end. 

Enierococcus faecalis DNA is prepared by phenol extraction. A mixture 
containing 200 \ig DNA in 1.0 ml of 300 mM sodium acetate, 10 mM Tris-HCl, 1 mM 

10 Na-EDTA, 50% glycerol is processed through a nebulizer (IPI Medical Products) with a 
stream of nitrogen adjusted to 35 Kpa for 2 minutes. The sonicated DNA is ethanol 
precipitated and redissolved in 500 \i\ TE buffer. 

To create blunt-ends, a 1 00 |il aliquot of the resuspended DNA is digested with 5 
units of BAL31 nuclease (New England BioLabs) for 10 min at 30°C in 200 ^1 BAL31 

15 buffer. The digested DNA is phenol-extracted, ethanol-precipitated, redissolved in 100 \i\ 
TE buffer, and then size- fractionated by electrophoresis through a 1.0% low melting 
temperature agarose gel. The section containing DNA fragments 1 .6-2.0 kb in size is 
excised from the gel, and the LGT agarose is melted and the resulting solution is extracted 
with phenol to separate the agarose from the DNA. DNA is ethanol precipitated and 

20 redissolved in 20 \i\ of TE buffer for ligation to vector. 

A two-step ligation procedure is used to produce a plasmid library with 97% 
inserts, of which >99% were single inserts. The first ligation mixture (50 ul) contains 2 
|xg of DNA fragments, 2 \ig pUC18 DNA (Pharmacia) cut with Smal and 
dephosphorylated with bacterial alkaline phosphatase, and 10 units of T4 ligase 

25 (GIBCO/BRL) and is incubated at 14°C for 4 hr. The ligation mixture then is phenol 
extracted and ethanol precipitated, and the precipitated DNA is dissolved in 20 \i\ TE 
buffer and electrophoresed on a 1 .0% low melting agarose gel. Discrete bands in a ladder 
are visualized by ethidium bromide-staining and UV illumination and identified by size as 
insert (I), vector (v), v+J, v+2i, v+3i, etc. The portion of the gel containing v+I DNA is 

30 excised and the v+I DNA is recovered and resuspended into 20 (il TE. The v+I DNA then 
is blunt-ended by T4 polymerase treatment for 5 min. at 37°C in a reaction mixture (50 
ul) containing the v+I linears, 500 \iM each of the 4 dNTPs, and 9 units of T4 
polymerase (New England BioLabs), under recommended buffer conditions. After phenol 
extraction and ethanol precipitation the repaired v+I linears are dissolved in 20 fxl TE. 

35 The final ligation to produce circles is carried out in a 50 ^il reaction containing 5 |xl of 
v+I linears and 5 units of T4 ligase at 14°C overnight. After 10 min. at 70°C the 
following day, the reaction mixture is stored at -20°C. 
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This two-stage procedure results in a molecularly random collection of single- 
insert plasmid recombinants with minimal contamination from double-insert chimeras 
(<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in the 
5 host, E. coli host cells deficient in all recombination and restriction functions (A. 

Greener, Strategies 3 (1)\S (1990)) are used to prevent rearrangements, deletions, and loss 
of clones by restriction. Furthermore, transformed cells are plated directly on antibiotic 
diffusion plates to avoid the usual broth recovery phase which allows multiplication and 
selection of the most rapidly growing cells. 

10 Plating is carried out as follows. A 1 00 |xl aliquot of Epicurian Coli SURE II 

Supercompetent Cells (Stratagene 200152) is thawed on ice and transferred to a chilled 
Falcon 2059 tube on ice. A 1.7 Jil aliquot of 1.42 M beta-mercaptoethanol is added to the 
aliquot of cells to a final concentration of 25 mM. Cells are incubated on ice for 10 min. 
A 1 \i\ aliquot of the final ligation is added to the cells and incubated on ice for 30 min. 

15 The cells are heat pulsed for 30 sec. at 42°C and placed back on ice for 2 min. The 

outgrowth period in liquid culture is eliminated from this protocol in order to minimize 
the preferential growth of any given transformed cell. Instead the transformation 
mixture is plated directly on a nutrient rich SOB plate containing a 5 ml bottom layer of 
SOB agar (5% SOB agar: 20 g tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar 

20 per liter of media). The 5 ml bottom layer is supplemented with 0.4 ml of 50 mg/ml 

ampicillin per 100 ml SOB agar. The 15 ml top layer of SOB agar is supplemented with 1 
ml X-Gal (2%), 1 ml MgC12 (1 M), and 1 ml MgSO4/100 ml SOB agar. The 15 ml top 
layer is poured just prior to plating. Our titer is approximately 100 colonies/10 |xl aliquot 
of transformation. 

25 All colonies are picked for template preparation regardless of size. Thus, only 

clones lost due to "poison" DNA or deleterious gene products are deleted from the library, 
resulting in a slight increase in gap number over that expected. 

3. Random DNA Sequencing 

30 High quality double stranded DNA plasmid templates are prepared using a "boiling 

bead" method developed in collaboration with Advanced Genetic Technology Corp. 
(Gaithersburg, MD) (Adams et ai, Science 252:1651 (1991); Adams et aL, Nature 
355:632 (1992)). Plasmid preparation is performed in a 96-well format for all stages of 
DNA preparation from bacterial growth through final DNA purification. Template 

35 concentration is determined using Hoechst Dye and a Millipore Cytofluor. DNA 

concentrations are not adjusted, but low-yielding templates are identified where possible 
and not sequenced. 
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Templates are also prepared from an Enterococcus faecalis lambda genomic 
library in the vector DASH II (Stratagene). In particular, Enterococcus faecalis DNA (> 
1 00 kb) is partially digested in a reaction mixture (200 ul) containing 50 \ig DNA, 1 X 
Sau3AI buffer, 20 units Sau3AI for 6 min. at 23°C. The digested DNA was phenol- 
5 extracted and fractionated by sucrose density gradient centrifugation. Fractions of the 
sucrose gradient containing 15 to 25 kb are recovered in a final volume of 6 ul. One |il 
of fragments is used with 1 \il of lambda DASHII vector (Stratagene) in the recommended 
ligation reaction. One jil of the ligation mixture is used per packaging reaction following 
the recommended protocol with the Gigapack II XL Packaging Extract (Stratagene, 

10 #227711). Phage are plated directly without amplification from the packaging mixture 
(after dilution with 500 fj.1 of recommended SM buffer and chloroform treatment). Yield 
is about 2.5x103 pfu/ul. An amplified library is prepared by infecting restructure NM539 
host E. coli cells eitn approximately 1x104 phage particles and recovering the progeny 
phages particles. The recovered phage is stored frozen in 7% dimethylsulfoxide. The 

15 phage titer is approximately 1x109 pfu/ml. 

For high throughput sequencing of individual lambda phage clones, liquid lysates 
(100 are prepared from randomly selected plaques (from the unamplified library) and 
template is prepared by long-range PCR using T7 and T3 vector-specific primers. 

Sequencing reactions are carried out on plasmid and/or PCR templates using the 

20 AB Catalyst LabStation with Applied Biosystems PRISM Ready Reaction Dye Primer 
Cycle Sequencing Kits for the M13 forward (M13-21) and the M13 reverse (M13RP1) 
primers (Adams et al. y Nature 368:474 (1994)). Dye terminator sequencing reactions are 
carried out on the lambda templates on a Perkin-Elmer 9600 Thermocycler using the 
Applied Biosystems Ready Reaction Dye Terminator Cycle Sequencing kits. T7 and T3 

25 primers are used to sequence the ends of the inserts from the Lambda DASH II library. 
Sequencing reactions are performed by eight individuals using an average of fourteen AB 
373 DNA Sequencers per day. All sequencing reactions are analyzed using the Stretch 
modification of the AB 373, primarily using a 34 cm well-to-read distance. The overall 
sequencing success rate very approximately is about 85% for Ml 3-21 and M13RP1 

30 sequences and 65% for dye- terminator reactions. The average usable read length is 485 
bp for M13-21 sequences, 445bp for M13RP1 sequences, and 375 bp for dye-terminator 
reactions. 

Richards et al, Chapter 28 in AUTOMATED DNA SEQUENCING AND 
ANALYSIS, M. D. Adams, C Fields, J. C. Venter, Eds., Academic Press, London, (1994) 
35 described the value of using sequence from both ends of sequencing templates to facilitate 
ordering of contigs in shotgun assembly projects of lambda and cosmid clones. We 
balance the desirability of both-end sequencing (including the reduced cost of lower total 
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number of templates) against shorter read- lengths for sequencing reactions performed 
with the M13RP1 (reverse) primer compared to the M13-21 (forward) primer. 
Approximately one-half of the templates are sequenced from both ends. Random reverse 
sequencing reactions are done based on successful forward sequencing reactions. Some 
5 M13RP1 sequences are obtained in a semi-directed fashion: M13-21: sequences pointing 
outward at the ends of contigs are chosen for M13RP1 sequencing in an effort to 
specifically order contigs. 

4. Protocol for Automated Cycle Sequencing 

10 The sequencing was carried out using ABI Catalyst robots and AB 373 Automated 

DNA Sequencers. The Catalyst robot is a publicly available sophisticated pipetting and 
temperature control robot which has been developed specifically for DNA sequencing 
reactions. The Catalyst combines pre-aliquoted templates and reaction mixes consisting 
of deoxy- and dideoxynucleotides, the thermostable Taq DNA polymerase, fluorescently- 

15 labelled sequencing primers, and reaction buffer. Reaction mixes and templates are 

combined in the wells of an aluminum 96-well thermocycling plate. Thirty consecutive 
cycles of linear amplification (i.e.., one primer synthesis) steps are performed including 
denaturation, annealing of primer and template, and extension; i.e., DNA synthesis. A 
heated lid with rubber gaskets on the thermocycling plate prevents evaporation without 

20 the need for an oil overlay. 

Two sequencing protocols are used: one for dye-labelled primers and a second for 
dye-labelled dideoxy chain terminators. The shotgun sequencing involves use of four dye- 
labelled sequencing primers, one for each of the four terminator nucleotide. Each dye- 
primer is labelled with a different fluorescent dye, permitting the four individual reactions 

25 to be combined into one lane of the 373 DNA Sequencer for electrophoresis, detection, 
and base-calling. ABI currently supplies pre-mixed reaction mixes in bulk packages 
containing all the necessary non-template reagents for sequencing. Sequencing can be 
done with both plasmid and PCR- generated templates with both dye-primers and dye- 
terminators with approximately equal fidelity, although plasmid templates generally give 

30 longer usable sequences. 

Thirty-two reactions are loaded per AB373 Sequencer each day, for a total of 960 
samples. Electrophoresis is run overnight following the manufacturer's protocols, and the 
data is collected for twelve hours. Following electrophoresis and fluorescence detection, 
the ABI 373 performs automatic lane tracking and base-calling. The lane-tracking is 

35 confirmed visually. Each sequence clectropherogram (or fluorescence lane trace) is 

inspected visually and assessed for quality. Trailing sequences of low quality are removed 
and the sequence itself is loaded via software to a Sybase database (archived daily to 8mm 
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tape). Leading vector polylinker sequence is removed automatically by a software 
program. Average edited lengths of sequences from the standard ABI 373 are around 400 
bp arid depend mostly on the quality of the template used for the sequencing reaction. 
ABI 373 Sequencers converted to Stretch Liners provide a longer electrophoresis path 
5 prior to fluorescence detection and increase the average number of usable bases to 500- 
600 bp. 

INFORMATICS 

1. Data Management 

10 A number of information management systems for a large-scale sequencing lab 

have been developed. (For review see, for instance, Kerlavage et al., Proceedings of the 
Twenty-Sixth Annual Hawaii International Conference on System Sciences, IEEE 
Computer Society Press, Washington D. C, 585 (1993)) The system used to collect and 
assemble the sequence data was developed using the Sybase relational database 

15 management system and was designed to automate data flow wherever possible and to 
reduce user error. The database stores and correlates all information collected during the 
entire operation from template preparation to final analysis of the genome. Because the 
raw output of the ABI 373 Sequencers was based on a Macintosh platform and the data 
management system chosen is based on a Unix platform, it was necessary to design and 

20 implement a variety of multi- user, client-server applications which allow the raw data as 
well as analysis results to flow seamlessly into the database with a minimum of user effort. 

2. Assembly 

An assembly engine (TIGR Assembler) developed for the rapid and accurate 
25 assembly of thousands of sequence fragments is employed to generate contigs. The TIGR 
assembler simultaneously clusters and assembles fragments of the genome. In order to 
obtain the speed necessary to assemble more than 104 fragments, the algorithm builds a 
hash table of 10 bp oligonucleotide subsequences to generate a list of potential sequence 
fragment overlaps. The number of potential overlaps for each fragment determines 
30 which fragments are likely to fall into repetitive elements. Beginning with a single seed 
sequence fragment, TIGR Assembler extends the current contig by attempting to add the 
best matching fragment based on oligonucleotide content. The contig and candidate 
fragment are aligned using a modified version of the Smith-Waterman algorithm which 
provides for optimal gapped alignments (Waterman, M. S., Methods in Enzymology 
35 164:165 (1988)). The contig is extended by the fragment only if strict criteria for the 
quality of the match are met. The match criteria include the minimum length of overlap, 
the maximum length of an unmatched end, and the minimum percentage match. These 
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criteria are automatically lowered by the algorithm in regions of minimal coverage and 
raised in regions with a possible repetitive clement. The number of potential overlaps for 
each fragment determines which fragments are likely to fall into repetitive elements. 
Fragments representing the boundaries of repetitive elements and potentially chimeric 
5 fragments are often rejected based on partial mismatches at the ends of alignments and 
excluded from the current contig. TIGR Assembler is designed to take advantage of clone 
size information coupled with sequencing from both ends of each template. It enforces 
the constraint that sequence fragments from two ends of the same template point toward 
one another in the contig and are located within a certain range of base pairs (definable 
10 for each clone based on the known clone size range for a given library). 

The process resulted in 982 contigs as represented by SEQ ID NOs: 1-982. 

3. Identifying Genes 

The predicted coding regions of the Enterococcus faecal is genome were initially 
15 defined with the program GeneMark, which finds ORFs using a probabilistic classification 
technique. The predicted coding region sequences were used in searches against a database 
of all Enterococcus faecali nucleotide sequences from GenBank (March, 1 997), using the 
BLASTN search method to identify overlaps of 50 or more nucleotides with at least a 
95% identity. Those ORFs with nucleotide sequence matches are shown in Table 1. The 
20 ORFs without such matches were translated to protein sequences and compared to a non- 
redundant database of known proteins generated by combining the Swiss-prot, PIR and 
GenPept databases. ORFs that matched a database protein with BLASTP probability less 
than or equal to 0.01 are shown in Table 2. The table also lists assigned functions based 
on the closest match in the databases. ORFs that did not match protein or nucleotide 
25 sequences in the databases at these levels are shown in Table 3, 

ILLUSTRATIVE APPLICATIONS 

1. Production of an Antibody to a Enterococcus faecalis Protein 
Substantially pure protein or polypeptide is isolated from the transfected or 
30 transformed cells using any one of the methods known in the art. The protein can also 
be produced in a recombinant prokaryotic expression system, such as £. coli, or can be 
chemically synthesized. Concentration of protein in the final preparation is adjusted, for 
example, by concentration on an Amicon filter device, to the level of a few 
micrograms/ml. Monoclonal or polyclonal antibody to the protein can then be prepared 
35 as follows. 
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2. Mon cl nal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as 
described can be prepared from murine hybridomas according to the classical method of 
Kohler, G. and Milstein, C, Nature 256:495 (1975) or modifications of the methods 
5 thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected 
protein over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of 
polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by 
growth of the system on selective media comprising aminopterin (HAT media). The 
10 successfully fused cells are diluted and aliquots of the dilution placed in wells of a 

microtiter plate where growth of the culture is continued. Antibody-producing clones are 
identified by detection of antibody in the supernatant fluid of the wells by immunoassay 
procedures, such as ELISA, as originally described by Engvall, E., Meth. EnzymoL 70:419 
(1 980), and modified methods thereof. Selected positive clones can be expanded and 
15 their monoclonal antibody product harvested for use. Detailed procedures for monoclonal 
antibody production are described in Davis, L. et aL, Basic Methods in Molecular Biology, 
Elsevier, New York. Section 21-2 (1989). 

3. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a single 
protein can be prepared by immunizing suitable animals with the expressed protein 
described above, which can be unmodified or modified to enhance immunogenicity. 
Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. For example, small molecules tend to be less immunogenic 
than others and may require the use of carriers and adjuvant. Also, host animals vary in 
response to site of inoculations and dose, with both inadequate or excessive doses of 
antigen resulting in low titer antisera. Small doses (ng level) of antigen administered at 
multiple intradermal sites appears to be most reliable. An effective immunization 
protocol for rabbits can be found in Vaitukaitis, J. et aL y J. Clin, Endocrinol. Metab. 
33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, 
for example, Ouchterlony, O. et al. 9 Chap. 19 in: Handbook of Experimental 
Immunology, Wier, D., ed, Blackwell (1973). Plateau concentration of antibody is 
usually in the range of 0. 1 to 0. 2 mg/ml of serum (about 12M). Affinity of the 
antisera for the antigen is determined by preparing competitive binding curves, as 
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described, for example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 
second edition, Rose and Friedman, eds., Amer. Soc. For Microbiology, Washington, D. C. 
(1980) 

Antibody preparations prepared according to either protocol are useful in 
5 quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi- quantitatively or qualitatively to identify 
the presence of antigen in a biological sample. In addition, antibodies are useful in various 
animal models of enterococcal disease as a means of evaluating the protein used to make 
the antibody as a potential vaccine target or as a means of evaluating the antibody as a 
10 potential immuno therapeutic or immunoprophylactic reagent. 

4. Preparation of PCR Primers and Amplification of DNA 

Various fragments of the Enterococcus faecalis genome, such as those of Tables 
1-3 and SEQ ID NOS: 1-982 can be used, in accordance with the present invention, to 
15 prepare PCR primers for a variety of uses. The PCR primers are preferably at least 15 
bases, and more preferably at least 18 bases in length. When selecting a primer sequence, 
it is preferred that the primer pairs have approximately the same G/C ratio, so that 
melting temperatures are approximately the same. The PCR primers and amplified DNA 
of this Example find use in the Examples that follow. 

20 

5. Isolation of a Selected DNA Clone From the Deposited 
Sample of E. faecalis 

Three approaches can be used to isolate a E. faecalis clone comprising a 
polynucleotide of the present invention from any £. faecalis genomic DNA library. The 
25 E. faecalis strain V586 has been deposited as a convienent source for obtaining a £. 

faecalis strain although a wide varity of strains E. faecalis strains can be used which are 
known in the art. 

E, faecalis genomic DNA is prepared using the following method. A 20ml 
overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 

30 Heart Infusion broth or Super broth), pelleted, ished two times with TES (30mM Tris-pH 
8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES (2.5M NaCl). 
Lysostaphin is added to final concentration of approx 50ug/ml and the mixture is rotated 
slowly 1 hour at 37C to make protoplast cells. The solution is then placed in incubator 
(or place in a shaking water bath) and warmed to 55C. Five hundred micro liter of 20% 

35 sarcosyl in TES (final concentration 2%) is then added to lyse the cells. Next, guanidine 
HC1 is added to a final concentration of 7M (3.69g in 5.5 ml). The mixture is swirled 
slowly at 55C for 60-90 min (solution should clear). A CsCl gradient is then set up in 
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SW41 ultra clear tubes using 2.0ml 5.7M CsCl and overlaying with 2.85M CsCl. The 
gradient is carefully overlayed with the DNA -containing GuHCl solution. The gradient is 
spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume is 
increased to 5 ml with TE buffer. The DNA is then treated with protease K (1 0 ug/ml) 
5 overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in 
a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E: 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 30- 

10 40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer according to 
the sequence reported. The oligonucleotide is labeled, for instance, with 32 P-y-ATP using 
T4 polynucleotide kinase and purified according to routine methods. (See, e.g., Maniatis 
et al,, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, 
NY (1982).) The library is transformed into a suitable host, as indicated above (such as 

15 XL-1 Blue (Stratagene)) using techniques known to those of skill in the art. See, e.g., 
Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring 
Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 
BIOLOGY (John Wiley and Sons, N.Y. 1989). The transformants are plated on 1.5% 
agar plates (containing the appropriate selection agent, e.g., ampicillin) to a density of 

20 about 150 transformants (colonies) per plate. These plates are screened using Nylon 
membranes according to routine methods for bacterial colony screening. See, e.g., 
Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring 
Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 
BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of skill 

25 in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of 
a polynucleotide of SEQ ID NOS: 1-982 are synthesized and used to amplify the desired 
DNA by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
under routine conditions, for instance, in 25 |il of reaction mixture with 0.5 ug of the 

30 above DNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) 
gelatin, 20 ^.M each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 
Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 min; 
annealing at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a Perkin- 
Elmer Cetus automated thermal cycler. The amplified product is analyzed by agarose gel 

35 electrophoresis and the DNA band with expected molecular weight is excised and purified. 
The PCR product is verified to be the selected sequence by subcloning and sequencing the 
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DNA product. 

Finally, overlapping oligos of the DNA sequences of SEQ ID NOS: 1-982 can be 
chemically synthesized and used to generate a nucleotide sequence of desired length using 
PCR methods known in the art. 

5 

6(a). Expression and Purification Enterococcal polypeptides 
in E. coli 

The bacterial expression vector pQE60 was used for bacterial expression of some 
of the polypeptide fragements of the present invention which were used in the soft tissue 

10 and systemic infection models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, 

Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and 
contains a bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome 
binding site ("RBS"), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and 

15 suitable single restriction enzyme cleavage sites. These elements are arranged such that 
an inserted DNA fragment encoding a polypeptide expresses that polypeptide with the six 
His residues (i.e., a "6 X His tag") covalently linked to the carboxyl terminus of that 
polypeptide. 

The DNA sequence encoding the desired portion of a E. faecalis protein of the 

20 present invention was amplified from E. faecalis genomic DNA using PCR 

oligonucleotide primers which anneal to the 5' and 3' sequences coding for the portions 
of the E. faecalis polynucleotide shown in SEQ ID NOS: 1-982. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3* sequences, respectively. 

25 For cloning the mature protein, the 5' primer has a sequence containing an 

appropriate restriction site followed by nucleotides of the amino terminal coding sequence 
of the desired E. faecalis polynucleotide sequence in SEQ ID NOS: 1-982. One of ordinary 
skill in the art would appreciate that the point in the protein coding sequence where the 5' 
and 3' primers begin may be varied to amplify a DNA segment encoding any desired 

30 portion of the complete protein shorter or longer than the mature form. The 3' primer 
has a sequence containing an appropriate restriction site followed by nucleotides 
complementary to the 3' end of the polypeptide coding sequence of SEQ ID NOS: 1-982, 
excluding a stop codon, with the coding sequence aligned with the restriction site so as to 
maintain its reading frame with that of the six His codons in the pQE60 vector. 

35 The amplified E. faecalis DNA fragment and the vector pQE60 were digested 

with restriction enzymes which recognize the sites in the primers and the digested DNAs 
were then ligated together. The E. faecalis DNA was inserted into the restricted pQE60 
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vector in a manner which places the E. faecalis protein coding region downstream from 
the IPTG-inducible promoter and in-frame with an initiating AUG and the six histidine 
codons. 

The ligation mixture was transformed into competent E. coli cells using standard 
5 procedures such as those described by Sambrook et al., supra.. E. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"), was used in carrying out the illustrative example 
described herein. This strain, which was only one of many that are suitable for expressing 
a E. faecalis polypeptide, is available commercially (QIAGEN, Inc., supra). 

10 Transformants were identified by their ability to grow on LB agar plates in the presence 
of ampicillin and kanamycin. Plasmid DNA was isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PGR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("O/N") in liquid 
culture in LB media supplemented with both ampicillin (100 fig/ml) and kanamycin (25 

15 p<g/ml). The O/N culture was used to inoculate a large culture, at a dilution of 

approximately 1:25 to 1:250. The cells were grown to an optical density at 600 nm 
("OD600") of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside ("IPTG") was 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently were 

20 incubated further for 3 to 4 hours. Cells then were harvested by centrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. The 
cell debris was removed by centrifugation, and the supernatant containing the E. faecalis 
polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin 
column (QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin 

25 with high affinity were purified in a simple one-step procedure (for details see: The 

QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the supernatant was loaded onto 
the column in 6 M guanidine-HCl, pH 8, the column was first washed with 1 0 volumes of 
6 M guanidine-HCl, pH 8, then washed with 10 volumes of 6 M guanidine-HCl pH 6, and 
finally the E. faecalis polypeptide was eluted with 6 M guanidine-HCl, pH 5. 

30 The purified protein was then renatured by dialyzing it against phosphate-buffered 

saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
protein could be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 
500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. 

35 The renaturation should be performed over a period of 1.5 hours or more. After 
renaturation the proteins can be eluted by the addition of 250 mM immidazole. 
Immidazole was removed by a final dialyzing step against PBS or 50 mM sodium acetate 
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pH 6 buffer plus 200 mM NaCl. The purified protein was stored at 4° C or frozen at -80° 
C. 

Some of the polypeptide of the present invention were prepared using a non- 
denaturing protein purification method. For these polypeptides, the cell pellet from each 
5 liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer A = 50 
mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% Glycerol, pH 7.5 
with 1 tablet of Complete EDTA-free protease inhibitor cocktail (Boehringer Mannheim 
#1873580) per 50 ml of buffer). Absorbance at 550 nm was approximately 10-20 
O.D./ml. The suspension was then put through three freeze/thaw cycles from -70°C 

10 (using a ethanol-dry ice bath) up to room temperature. The cells were lysed via 

sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on ice. 
The sonicated sample was then centrifuged at 15,000 RPM for 30 minutes at 4°C. The 
supernatant was passed through a column containing 1.0 ml of CL-4B resin to pre-clear 
the sample of any proteins that may bind to agarose non-specifically, and the flow- 

15 through fraction was collected. 

The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid ("Ni- 
NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag bind to 
the Ni-NTA resin with high affinity and can be purified in a simple one-step procedure. 
Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 4°C, the column 

20 was first washed with 10 volumes of Lysis Buffer A until the A280 of the eluate returns to 
the baseline. Then, the column was washed with 5 volumes of 40 mM Imidazole (92% 
Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na-Phosphate, 300 mM NaCl, 10% 
Glycerol, 10 mM 2-mercaptoethanol, 500 mM Imidazole, pH of the final buffer should be 
7.5). The protein was eluted off of the column with a series of increasing Imidazole 

25 solutions made by adjusting the ratios of Lysis Buffer A to Buffer B. Three different 
concentrations were used: 3 volumes of 75 mM Imidazole, 3 volumes of 150 mM 
Imidazole, 5 volumes of 500 mM Imidazole. The fractions containing the purified 
protein were analyzed using 8 %, 10 % or 14% SDS-P AGE depending on the protein size. 
The purified protein was then dialyzed 2X against phosphate-buffered saline (PBS) in 

30 order to place it into an easily workable buffer. The purified protein was stored at 4°C or 
frozen at -80°. 

The following alternative method may be used to purify E. faecalis expressed in E 
coli when it is present in the form of inclusion bodies. Unless otherwise specified, all of 
the following steps are conducted at 4-10°C. 
35 Upon completion of the production phase of the E. coli fermentation, the cell 

culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
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weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM 
EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear 
mixer. 

5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E. faecalis polypeptide-containing supernatant is incubated at . 
4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing for 
12 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface 
area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosy stems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 
250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise 

25 manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions 
are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 

30 (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A 2 g 0 

35 monitoring of the effluent. Fractions containing the E, faecalis polypeptide (determined, 
for instance, by 16% SDS-PAGE) are then pooled. 
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The resultant E. faecalis polypeptide exhibits greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands are observed from 
Commassie blue stained 16% SDS-PAGE gel when 5 \ig of purified protein is loaded. The 
purified protein is also tested for endotoxin/LPS contamination, and typically the LPS 
5 content is less than 0.1 ng/ml according to LAL assays. 

6(b). Alternative Expression and Purification Enterococcal 
polypeptides in E. coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
10 polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 
encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 X 
His tag") covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) 
15 was used in this example . The components of the pQElO plasmid are arranged such that 
the inserted DNA sequence encoding a polypeptide of the present invention expresses the 
polypeptide with the six His residues {i.e., a "6 X His tag")) covalently linked to the 
amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of SEQ ID 

20 NOS:l-982 were amplified using PCR oligonucleotide primers from genomic E. faecalis 
DNA. The PCR primers anneal to the nucleotide sequences encoding the desired amino 
acid sequence of a polypeptide of the present invention. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQElO vector were added to the 5' 
and 3' primer sequences, respectively. 

25 For cloning a polypeptide of the present invention, the 5' and 3' primers were 

selected to amplify their respective nucleotide coding sequences. One of ordinary skill in 
the art would appreciate that the point in the protein coding sequence where the 5' and 3' 
primers begins may be varied to amplify a DNA segment encoding any desired portion of 
a polypeptide of the present invention. The 5' primer was designed so the coding 

30 sequence of the 6 X His tag is aligned with the restriction site so as to maintain its reading 
frame with that of E. faecalis polypeptide. The 3' was designed to include an stop codon. 
The amplified DNA fragment was then cloned, and the protein expressed, as described 
above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of SEQ ID NOS: 1-982 

35 may also be cloned and expressed as fusion proteins by a protocol similar to that described 
directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, 
Wl 5371 1) is preferentially used in place of pQElO. 
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The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either full 
length polypeptides or desired fragements therof 



5 6(c). Alternative Expression and Purification of Enterococcal 

polypeptides in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in this 
example, the polypeptide coding sequence is inserted such that translation of the six His 

10 codons is prevented and, therefore, the polypeptide is produced with no 6 X His tag. 

The DNA sequence encoding the desired portion of the E. faecalis amino acid 
sequence is amplified from an E. faecalis genomic DNA prep the deposited DNA clones 
using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences 
corresponding to the desired portion of the E, faecalis polypeptides. Additional 

15 nucleotides containing restriction sites to facilitate cloning in the pQE60 vector are added 
to the 5' and 3' primer sequences. 

For cloning a E. faecalis polypeptides of the present invention, 5' and 3' primers 
are selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5' and 

20 3' primers begin may be varied to amplify a DNA segment encoding any desired portion 
of a polypeptide of the present invention. The 3* and 5' primers contain appropriate 
restriction sites followed by nucleotides complementary to the 5' and 3' ends of the 
coding sequence respectively. The 3* primer is additionally designed to include an in- 
frame stop codon. 

25 The amplified E. faecalis DNA fragments and the vector pQE60 are digested with 

restriction enzymes recognizing the sites in the primers and the digested DNAs are then 
ligated together. Insertion of the E. faecalis DNA into the restricted pQE60 vector 
places the E. faecalis protein coding region including its associated stop codon 
downstream from the IPTG- inducible promoter and in-frame with an initiating AUG. 

30 The associated stop codon prevents translation of the six histidine codons downstream of 
the insertion point. 

The ligation mixture is transformed into competent E. coli cells using standard 
procedures such as those described by Sambrook et al. E. coli strain M15/rep4, containing 
multiple copies of the plasmid pREP4, which expresses the lac repressor and confers 
35 kanamycin resistance ("Kanr"), is used in carrying out the illustrative example described 
herein. This strain, which is only one of many that are suitable for expressing £. faecalis 
polypeptide, is available commercially (QIAGEN, Inc., supra). Transformants are 
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identified by their ability to grow on LB plates in the presence of ampicillin and 
kanamycin. Plasmid DNA is isolated from resistant colonies and the identity of the 
cloned DNA confirmed by restriction analysis, PCR and DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O/N") in liquid 
5 culture in LB media supplemented with both ampicillin (100 \ig/ml) and kanamycin (25 
H-g/ml). The O/N culture is used to inoculate a large culture, at a dilution of approximately 
1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of 
between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a 
final concentration of 1 mM to induce transcription from the lac repressor sensitive 

10 promoter, by inactivating the lad repressor. Cells subsequently are incubated further for 
3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 
4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na-acetate 

15 buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein can be 

successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 mM Tris/HCl 
pH 7.4, containing protease inhibitors. After renaturation the protein can be purified by 
ion exchange, hydrophobic interaction and size exclusion chromatography. 
Alternatively, an affinity chromatography step such as an antibody column can be used to 

20 obtain pure E. faecalis polypeptide. The purified protein is stored at 4°C or frozen at - 
80° C. 

The following alternative method may be used to purify E. faecalis polypeptides 
expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-1 0°C. 

25 Upon completion of the production phase of the E. coli fermentation, the cell 

culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM 

30 EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear 
mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
35 centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 
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The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E. faecalis polypeptide-containing supernatant is incubated at 
4°C overnight to allow further GuHCl extraction. 
5 Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing for 
12 hours prior to further purification steps. 

10 To clarify the refolded £. faecalis polypeptide solution, a previously prepared 

tangential filtration unit equipped with 0.16 \im membrane filter with appropriate surface 
area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 

15 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a stepwise 

manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions 
are collected and further analyzed by SDS-PAGE. 

Fractions containing the £. faecalis polypeptide are then pooled and mixed with 4 
volumes of water. The diluted sample is then loaded onto a previously prepared set of 

20 tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 
(Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 

25 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A 2 go 

monitoring of the effluent. Fractions containing the E. faecalis polypeptide (determined, 
for instance, by 1 6% SDS-PAGE) are then pooled. 

The resultant E. faecalis polypeptide exhibits greater than 95% purity after the 
above refolding and purification steps. No major contaminant bands are observed from 

30 Commassie blue stained 16% SDS-PAGE gel when 5 ^g of purified protein is loaded. The 
purified protein is also tested for endotoxin/LPS contamination, and typically the LPS 
content is less than 0.1 ng/ml according to LAL assays. 



35 



6(d). Cloning and Expression f E. faecalis in Other Bacteria 

E. faecalis polypeptides can also be produced in: E. faecalis using the methods of 
S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein Expr. 
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Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. 
Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et 
al., U.S. Patent No. 4,952,508. 

5 7. Cloning and Expression in COS Cells 

A E. faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or pDNAIIl 
(which can be obtained from Invitrogen, Inc.). The expression vector pDNAI/amp 
contains: (1) an E. coli origin of replication effective for propagation in E. coli and other 

10 prokaryotic cells; (2) an ampicillin resistance gene for selection of plasmid-containing 
prokaryotic cells; (3) an SV40 origin of replication for propagation in eukaryotic cells; 
(4) a CMV promoter, a polylinker, an SV40 intron; (5) several codons encoding a 
hemagglutinin fragment (i.e., an "HA" tag to facilitate purification) followed by a 
termination codon and polyadenylation signal arranged so that a DNA can be 

1 5 conveniently placed under expression control of the CMV promoter and operably linked 
to the SV40 intron and the polyadenylation signal by means of restriction sites in the 
polylinker. The HA tag corresponds to an epitope derived from the influenza 
hemagglutinin protein described by Wilson et al. 1984 Cell 37:767. The fusion of the HA 
tag to the target protein allows easy detection and recovery of the recombinant protein 

20 with an antibody that recognizes the HA epitope. pDNAIIl contains, in addition, the 
selectable neomycin marker. 

A DNA fragment encoding a E, faecalis polypeptide is cloned into the polylinker 
region of the vector so that recombinant protein expression is directed by the CMV 
promoter. The plasmid construction strategy is as follows. The DNA from a E. faecalis 

25 genomic DNA prep is amplified using primers that contain convenient restriction sites, 
much as described above for construction of vectors for expression of E. faecalis in 
E. coli. The 5* primer contains a Kozak sequence, an AUG start codon, and nucleotides 
of the 5' coding region of the E. faecalis polypeptide. The 3' primer, contains 
nucleotides complementary to the 3' coding sequence of the E. faecalis DNA, a stop 

30 codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with 
appropriate restriction enzymes and then ligated. The ligation mixture is transformed 
into an appropriate E. coli strain such as SURE™ (Stratagene Cloning Systems, La Jolla, 
CA 92037), and the transformed culture is plated on ampicillin media plates which then 

35 are incubated to allow growth of ampicillin resistant colonies. Plasmid DNA is isolated 
from resistant colonies and examined by restriction analysis or other means for the 
presence of the fragment encoding the E. faecalis polypeptide 
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For expression of a recombinant E. faecalis polypeptide, COS cells are transfected 
with an expression vector, as described above, using DEAE-dextran, as described, for 
instance, by Sambrook et al. (supra). Cells arc incubated under conditions for expression 
of E. faecalis by the vector. 
5 Expression of the E. faecalis-HA fusion protein is detected by radiolabeling and 

immunoprecipitation, using methods described in, for example Harlow et al., supra.. To 
this end, two days after transfection, the cells are labeled by incubation in media 
containing 35 S-cysteine for 8 hours. The cells and the media are collected, and the cells 
are washed and the lysed with detergent-containing RIPA buffer: 150 mM NaCl, 1% NP- 
10 40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRJS, pH 7.5, as described by Wilson et al. 
(supra ). Proteins are precipitated from the cell lysate and from the culture media using 
an HA-specific monoclonal antibody. The precipitated proteins then are analyzed by 
SDS-PAGE and autoradiography. An expression product of the expected size is seen in 
the cell lysate, which is not seen in negative controls. 

15 

8. Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E. faecalis polypeptide in this 
example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No, 
37146). The plasmid contains the mouse DHFR gene under control of the SV40 early 

20 promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate activity that 
are transfected with these plasmids can be selected by growing the cells in a selective 
medium (alpha minus MEM, Life Technologies) supplemented with the chemo therapeutic 
agent methotrexate. The amplification of the DHFR genes in cells resistant to 
methotrexate (MTX) has been well documented. See, e.g., Alt et al., 1978, J. Biol. 

25 Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 1097:107-143; 
Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing concentrations of 
MTX develop resistance to the drug by overproducing the target enzyme, DHFR, as a 
result of amplification of the DHFR gene. If a second gene is linked to the DHFR gene, it 
is usually co-amplified and over-expressed. It is known in the art that this approach may 

30 be used to develop cell lines carrying more than 1,000 copies of the amplified gene(s). 
Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain 
the amplified gene integrated into one or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of 
the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. (1985) 

35 Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the immediate 
early gene of human cytomegalovirus (CMV), Boshart, et al, 1985, Cell 41:521-530. 
Downstream of the promoter are the following single restriction enzyme cleavage sites 



WO 98/50555 PCT/DS98/08985 

71 

that allow the integration of the genes: Bam HI, Xba I, and Asp 718. Behind these 
cloning sites the plasmid contains the 3' intron and polyadenylation site of the rat 
preproinsulin gene. Other high efficiency promoters can also be used for the expression, 
e.g., the human A-actin promoter, the SV40 early or late promoters or the long terminal 
5 repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet-Off and Tet-On 
gene expression systems and similar systems can be used to express the E. faecalis 
polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. 
Sci. USA 89:5547-5551. For the polyadenylation of the mRNA other signals, e.g., from 
the human growth hormone, or globin genes can be used as well. Stable cell lines carrying 
10 a gene of interest integrated into the chromosomes can also be selected upon co- 
transfection with a selectable marker such as gpt, G418 or hygromycin. It is 
advantageous to use more than one selectable marker in the beginning, e.g., G41 8 plus 
methotrexate. 

The plasmid pC4 is digested with the restriction enzymes and then 
15 dephosphorylated using calf intestinal phosphates by procedures known in the art. The 
vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to the 
5' and 3' sequences of the desired portion of the gene. A 5' primer containing a 
restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5* coding 
20 region of the E, faecalis polypeptide is synthesized and used. A 3' primer, containing a 
restriction site, stop codon, and nucleotides complementary to the 3' coding sequence of 
the E. faecalis polypeptides is synthesized and used. The amplified fragment is digested 
with the restriction endonucleases and then purified again on a 1% agarose gel. The 
isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. 
25 E. coli HB101 or XL-1 Blue cells are then transformed and bacteria are identified that 
contain the fragment inserted into plasmid pC4 using, for instance, restriction enzyme 
analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
transfection. Five \ig of the expression plasmid pC4 is cotransfected with 0.5 jig of the 

30 plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin™ or 
LipofectAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 
confers resistance to a group of antibiotics including G418. The cells are seeded in alpha 
minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and 

35 seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus MEM 

supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 
10-14 days single clones are trypsinized and then seeded in 6-wcll pctri dishes or 10 ml 
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flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 
nM, 800 nM). Clones growing at the highest concentrations of methotrexate are then 
transferred to new 6-well plates containing even higher concentrations of methotrexate 
(1 jiM, 2 |xM, 5 |iM, 10 mM, 20 mM), The same procedure is repeated until clones are 
5 obtained which grow at a concentration of 100-200 p.M. Expression of the desired gene 
product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase 
HPLC analysis. 

9. Quantitative Murine Soft Tissue Infection Model for 
10 £. faecalis 

Compositions of the present invention, including polypeptides and peptides, are 
assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E. faecalis) using the following quantitative murine 
soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 7 weeks 

15 old) arc first treated with a biologically protective effective amount, or immune 

enhancing/stimulating effective amount of a composition of the present invention using 
methods known in the art, such as those discussed above. See,e.g., Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 
ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

20 The desired bacterial species used to challenge the mice, such as E. faecalis, is 

grown as an overnight culture. The culture is diluted to a concentration of 5 X 1 0 8 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswollen in 
sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile and 

25 injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 

subcutaneously in the inguinal region. After four days, counting the day of injection as 
day one, mice are sacrificed and the contents of the abscess is excised and placed in a 15 
ml conical tube containing 1 .0ml of sterile PBS. The contents of the abscess is then 
enzymatically treated and plated as follows. 

30 The abscess is first disrupted by vortexing with sterilized glass beads placed in the 

tubes. 3.0mls of prepared enzyme mixture (1.0ml Collagenase D (4.0 mg/ml), 1.0ml 
Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 20 min. 
incubation at 37C. The solution is then centrifuged and the supernatant drawn off. 0.5 
ml dH20 is then added and the tubes are vortexed and then incubated for 10 min. at room 

35 temperature. 0.5 ml media is then added and samples are serially diluted and plated onto 
agar plates, and grown overnight at 37C. Plates with distinct and separate colonies are 
then counted, compared to positive and negative control samples, and quantified. The 



WO 98/50555 



PCT/US98/08985 



method can be used to identify composition and determine appropriate and effective 
doses for humans and other animals by comparing the effective doses of compositions of 
the present invention with compositions known in the art to be effective in both mice 
and humans. Doses for the effective treatment of humans and other animals, using 
5 compositions of the present invention, are extrapolated using the data from the above 
experiments of mice. It is appreciated that further studies in humans and other animals 
may be needed to determine the most effective doses using methods of clinical practice 
known in the art. 

10 10. Murine Systemic Neutropenic Model for £. faecalis Infection 

Compositions of the present invention, including polypeptides and peptides, are 
assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E. faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 weeks 

15 old) are first treated with a biologically protective effective amount, or immune 

enhancing/stimulating effective amount of a composition of the present invention using 
methods known in the art, such as those discussed above. See,e.g., Harlow et ah, 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd 
ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

20 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNL'S. 

The desired bacterial species used to challenge the mice, such as E. faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 

25 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to ascertain 

30 morbidity and mortality. Mice remaining at the end of the experiment are sacrificed. 
The method can be used to identify compositions and determine appropriate and 
effective doses for humans and other animals by comparing the effective doses of 
compositions of the present invention with compositions known in the art to be 
effective in both mice and humans. Doses for the effective treatment of humans and 

35 other animals, using compositions of the present invention, are extrapolated using the 

data from the above experiments of mice. It is appreciated that further studies in humans 
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and other animals may be needed to determine the most effective doses using methods of 
clinical practice known in the art. 

The disclosure of all publications (including patents, patent applications, journal 
articles, laboratory manuals, books, or other documents) cited herein are hereby 
5 incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific embodiments 
described herein, which are intended as single illustrations of individual aspects of the 
invention. Functionally equivalent methods and components are within the scope of the 
invention, in addition to those shown and described herein and will become apparant to 
10 those skilled in the art from the foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of the appended claims. 
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SEQUENCE LISTING PLACE INDICATOR 



PAGES 280 TO 2076, WHICH ARE THE COMPLETE SEQUENCE 
LISTINGS FOR THIS APPLICATION ARE LOCATED AFTER THE 
DESCRIPTION, CLAIMS, ABSTRACT & DRAWINGS. 
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(i) APPLICANT: Charles Kunsch 

Patrick J. Dillon 
Steven C. Barash 

(ii) TITLE OF INVENTION: Enterococcus faecialis Polynucleotides and 

Polypeptides 

(iii) NUMBER OF SEQUENCES: 982 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP : 20850 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: herewith 
• (C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION : 
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(B) REGISTRATION NUMBER: 36,3"73 

(CJ REFERENCE/ DOCKET NUMBER: PB369PCT 



(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8512 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : ■ 4315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGTCAATCA CTTGCAAGTC GTTTTCTGTC ATATGGCCGA CTAGCTCAGT GACACCAGGA 

ATATAGCCAC CAGGGAAAAT ATAACGATTA ATCCAAGCAT TTTTAGCCCC ACCTTGTTGG 

CGACTGATCC CATGAATCAA CGCCGTACCT TTAGGCGCTA AATTCCGCTG AACGACATCA 

AAATATTCAT GTAGATTTTC CGCACCGACA TGTTCAAACA TCCCAACACT CGTAATATGG 

TCAAAAGACT CTCCTTTTAA ATCACGATAA TCCATCAATT TGACAGTCAT TCGATCTTGT 

AGATCTTCTT TTTCTATAAT ATGGCGAATA TGATGAAATT GCTCTTCACT TAATGTAATC 

CCAGTTGCTT TGGCTCCATA TTCTTTCACC GCAGTTAAAA TTAACGTGCC CCAGCCGCAG 

CCAATATCCA GTAAAGTGTC GCCCTCTTTG ATAAACAATT TATCTAAAAT ATGATGAACT 

TTAT TCACTT GCGCTTGTTC TAATGTATCT TCAGGCGTTT TAAAATAAGC ACATGAATAC 

GTCATTGTTT GGTCAAGCCA TTTTTTGTAA AAATCATTTC CTAGATCGTA ATGGCTGTGA 

ATATCCTCTT GCGAACGTTT TTTTGAATGA CTTTCTTTAG GAAGCCATTT AATAAATTTA 

GCATTGTGTA AAAAGCTATC CTTTTGGTTA TACACATCAT AAATCAGsGC TTGGATATCG 

^nr- rrrTCCCCTA AAGTTAACGA AGCGTTATTC 
CCTTCGATTT CAATTTTGCG ATCCATGTAG GCTTCCCCTA a**. 

AGTAAATCCT TCACAGGAAT TTTTTCATTG AATACAATTT TAAAAACCGG ATCCCCCGAC 
CCTTGCCCAT ACTCTTTGAC GGTACCATCC CAGTATGTGA CTTGTGTCTT TTTTGAAAAA 

««. ("TafTTTTCT TTTTCTAACA TTGCATTCCC TCCATTAAAT 
GACCATTTAA ACAGTTGACT GTACGTTTCT lin<-i 

ACCATTTGAA GGCAAAACAA «™»™ ma ° H> 

ostoconc taaactoaag cacagaaaag ™t«cccc t T c T a T gttc «n«m 

TICTGCAATTa' CAGTTCTATT GTACTCCTCT TTTAAAAATT TGAACATTCT T1TAACGTAA 
TACCTACTAT TGTTATTCTT TATCACAAAA AAACtAGAGC CAGTCCTTGA CAGACTCCTC 
t «~CT»* TATTATGCTT TCTTACGCAT CCGTTGTTCC GCATGAGTGT AAGCGCCATG 
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CCATACGTGC 


CCCACATAAG 


GATTAACTTG 


AATACCGTGT 


TTAATCGCCG 


CTGCTACAAA ■ 


1320 


TTTTTTTGCT 


AAAGTTACTG 


CTTCTAACAC 


CGAATAACCT 


TTCGCCAAGC 


CAGCTGTGAT 


1380 


TGCCGCTGAA 


AAAGTACAAC 


CTGCACCATG 


ATTATAATCA 


G TAG GAT AT A 


ATTCATTTTC 


1440 


CAAAAGATGC 


GCGGTGTGAC 


CATCGTAAAA 


CAAGTCCAGT 


GCTTTTTCAC 


CAGCTAAGCG 


1500 


ATGTCCCCCT 


TTAACCACGA 


CATGCTTGGC 


TCCCATTTGT 


ACAATTCGTT 


TTGCCGCTTC 


1560 


TTCCATCTCC 


GCCACGGAAG 


AAATTTCGCC 


TAAACCAGAT 


AAGATGCCCG 


CTTCAATTAA 


1620 


ATTAGGCGTG 


GCCACTAATG 


CTAATGGCAG 


TAAATCGTTT 


TTTAGGCCTT 


CCACACTTTT 


1680 


GGGTTGCAGA 


ATTTGTGCCG 


TTCCCTTACA 


AGCAATGACT 


GGGTCAATCA 


CGACTTTTTG 


1740 


AATTTTCTCT 


TGTTTAATGT 


ACTTACTAGC 


CATTTTAATA 


TTTTGTTCAT 


TACCCATCAT 


1800 


CCCTGTTTTC 


AAAGCCGCTA 


CTGGACCGCC 


TGCAAAAACC 


GAAATCAATT 


GTTTTTCTAA 


1860 


GAGCGTTTCT 


GGCAATTCAG 


TTACTTCATG 

X X *lw X X X \J 


TGACCAACCT 


GTCGTAGGAT 


CCATCGTCAC 


1920 


AATCGAGGTT 


AAACTTGAAA 


ATCCAAAAAC 


TCCATACTCT 


TCAAATGTTT 


TTAAATCTGC 


1980 


TTGAATCCCT 

x x unn x v» v> x 


GCCCCTCCAG 


TTGAATCGGA 


GCCTGCAATC 


GTCAATACTT 


TTTCCATTAA 


2040 


ATCACCTAAC 


CTTTTTCTCC 

X X X X X X \m* V 


AAGTATACTG 


AAGAAACAAG 


TCTGCTAAAA 


CAGCCAATTG 


2100 


GCTTATTTTT 


TAGCCAGCCA 


ATTTCTAAAC 


AAAAAAAAGA 


CCAGAGAATA 


AATTCTCTGG 


2160 


TCTTACGTCC 


GAATACCCCA 


GTTTTTTACG 


CTGGTTAAAA 


CTATAGTTAA 


AAAGTTAATT 


' 2220 


ATTTAACGAT 


TTCAGTAACA 


ACGCCTGAAC 


CTACAGTACG 


TCCGCCTTCA 


CGAATAGAGA 


2280 


AACGAGTTCC 


GTCTTCGATA 


GCGATTGGGT 


GAATTAATTC 


AACGTCCATA 


GCAACGTTAT 


2340 


CACCAGGCAT 


TACCATTTCA 


GTACCTTCTG 


GCAATTCTAC 


AACACCAGTA 


ACGTCTGTTG 


2400 


TACGGAAGTA 


GAATTGAGGA 


CGGTAGTTAG 


TGAAGAATGG 


AGTGTGACGT 


CCGCCTTCTT 


2460 


CTTTTGATAA 


TACGTATACT 


TCAGCTTkGA ATTTgGTGTG 


TGGAGTGATT 


GTAGCTGGTT 


2520" 


TAGCTAATAC 

x nuv x **** x A\\mr 


TyGyCCACGT 


TCGATATCTT 


CACGTGCTAC 


ACCACGTAtA 


AAGCACCGAT 


2580 


GTTGTCGCCT 


GCTTCAGCGT 


AGTCTAATAA 


TTTACGGAAC 


ATTTCAACAC 


CTGTAACAGT 


2640 


TGTTTTAGAT 


GTTTCGTCTT 


TAATACCAAC 


GATTTCAACT 


TCGTCACCAA 


CGCGAACTTC 


2700 


ACCACGTTCA 


ACACGGCCTG 


TAGCAACAGT 


ACCACGTCCA 


GTGATTGAGA ATACgTCTTC 


2760 


GACTGGCATC 


ATGAATGGTT 


TGTCAGTATC 


ACGTTCTGGA 


GTTGGGATAT 


ATTCGTCAAC 


2820 


TGCAGCCATT 


AATTCTAAGA 


TTTTTTCTTC 


ATAAGACTCG 


TCGCCTTCTA 


AAGCTTTCAA 


2880 


AGCAGAACCT 


GCGATAACTG 


GAACATCATC 


GCCTGGGAAA 


TCGTATTCTG 


ATAATAAGTC 


2940 


ACGAACTTCC 


ATTTCTACTA 


ATTCTAATAA 


TTCTTCGTCA 


TCAACCATAT 


CCATTTTGTT 


3000 


TAAGAATACA 


ACGATGTATG 


GTACACCAAC 


GTTACGTGAT 


AATAAGATAT 


GTTCACGTGT 


3060 


TTGAGGCATA 


GGACCATCAG 


CAGCAGAAAC 


TACTAAGATA 


GCTCCGTCCA 


TTTGAGCAGC 


3120 


ACCAGTGATC 


ATGTTTTTAA 


CGTAGTCCGC 


ATGTCCTGGG 


CAGTCaACGT 


GTGCaTAGTG 


3180 


ACGAGTTwCA 


GTTTCATATT 


CGATATGAGA 


AGTGTTGATT 


GTGATTCCAC 


GTTCTTTTTC 


3240 
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TTCTGGAGCG 


TTATCGATAG 


AATPG'IAGGT 


TTGTGCTTCC 


CCGCCACCGT 


GTTTTGATAA 


3300 


TAPAP.TAGPA 


ATTGCAGCTG 


TTAATGTAGT 


TTTACCATGG 


TCAACGTGTC 


CGATAGTACC 


3360 


A ATP.TTA APA 
rtrt lull rtrt^rt 


TGGGATTTAG 


AAPGGTCAAA 


TTTTTCTTTT 


GCCATTTTGA 


ATGTTCCTCC 


3420 


TZiAAAATATA 
X rtrtrtrtrt lnln 


A ATTTTATTG 

/Art 11X1 ri 1 1 VJ 


TTTGTTTTAT 

111U1111 Cx X 


AT GT GAT AG A 


CGT TACT AAA 


AATCGTCTGA 


3480 


TPZVTPAT A l\CZ 
1 v^,rt 1 v_rt 1 rtrtvj 


PTATTTTAPA 
\^ 1 1 l l l nun 


PGAAATPTTG 


AAAAAAGGAT 


AGTTCTAAAA 


ACAAACCTTT 


3540 


lni 1 1 \*£\e\\3 


P.A A AATTATP, 


PGT1-GPPGPP 


GTTTTTCTTG 


ATGATTTCTT 


CTTGAACAGA 


3600 


rtvjo 1 rtV-o 


1^11 V^rt 1 rtu 1 


GPTPAA ATAP 


PATPATAAAT 


GTACCACGTC 


CTTGTGTTGC 


3660 




PTTPTAPPAT 
ol lol nuLn 1 


A APPA A APAT 


TTPAGPTAAA 


GGTACCATCG 


CATTAACGAT 


3720 


1 1 bnbnb 1 In 


rrprrTr:rTT 

LvoluloLi 1 


PP AT APPTTP 
k-v—rt 1 rtV^V- 1 1 V— 


AAPAPGTCCA 


CGACGACTTG 


TTACGTGTCC 


3780 


Urt 1 VJrt 1 rt 1 L.rt 


PP T Zi Zi PT A Q T 
lw ^ 1 rtrto 1 rtrt 1 


PTTPAGGAAP 


AGTAATTGTA 

rx\J x nn i i \j i 


ACTTTCATCA 


TTGGTTCTAA 


3840 


An 1 AAL 1 vjuvj 


111 uLn 1111 


T APPPPPAGP 


TTTPAATGCP 


ATAGAAGCAG 


CTACACGGAA 


3900 


uuLtul i ±V-rt 


TTTPA ATPPA 
1 1 lunnlUbn 


PATPGTGGTA 

V-rt 1 \^\J 1 Ou 1 rt 


TGAACCATCG 


TAAAGTTTTG 

A fWiW A A -A A 


CTTTGATATC 


3960 


AACTAATGGA 


TAACCAGCAA 


GAACACCGTT 


GTTCATAGAG 


TCTTCTAACC 


CTTTTTCAAC 


4020 


TGCTGGGaTG 


TATTCACGAG 


GAACCACACC 


ACCGACAATT 


GCGTTTTCAA 


ACTCGAAGCC 


4080 


TTTACCTTCT 


TCATTTGGTG 


TAAATTCAAC 


CCaTAcGTGA 


CCGTATTGTC 


CTTTACCACC 


4140 


AGACTGACGA 


ACGAATTTAC 


CTTCTGCTTT 


AGTCGCAGCA 


CGGAATGTTT 


CGCGGTAAGA 


4200 


TACTTGAGGA 


GCACCTACGT 


TAGCTTCAAC 


TTTGAATTCA 


CGTTTCATAC 


GGTCTACTAA 


4260 


AACGTCTAAG 


TGCAATTCAC 


CCATaCCAGA 


GATAACTGTT 


TCACCAGTTT 


CAACC 


4315 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATTGTGCCAT AGATTTGACC ATCTGCTGCT GTTGTAGTTT CCCAGTTTGT AATTTGGTAA 60 

TGTTGTAAGA nCTTAGTTGG GACATTAATG GCAAAAGTTT TTTGGCGATA GGTATTAAAA 120 

TTGGTTTTAC CCACCCAACC TTCATAGGCT TCAAATTCAG CTCCACCTAA CAAAGGGTTG 180 

TCGGAAATAA TCCAAGTGTC TTTTTGATGA TTCCGAATAA AACGAACGAT TGTACTGTTA 24 0 

AAGCCTTTCG TTGTTGTACC GCCGTAATCT TTTTCTGCAG ACCAATGGTT CCAAACCGAG 300 

TCATACTCTA AAGCATTCGG AAATTCATTG GTTGTAAACC AACCGAGTGA ATTAATTTCT 360 

TTACTTAGTT GACGACTTGC CCAACCTGAT TCGCCTTGGT TGCCCCAAAC ATCAACATAA 4 20 

ATATAATCTA GATTCGGTGc TTTTtGTTTT AATcnTTAAA GCGCTCATAG CGACGACCAC 4 80 

TCAATGTAtC GGGtCTTkGT TTAATAAAAT AAGACGGATC TAACCAATCC CAGCCACGCT 540 
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TCGTTGGATC 


AACTAATTCC 


TCATTAAATC 


CTTTTGCTTC 


TGGGTAAGAC 


TCGGTGTCAT • 


600 


TAATATGCAC 


ACCGAAAACG 


GCATTTAATT 


TATGTCCTTC 


ATCAATTAAT 


TGATTCAACG 


660 


CTTGTTCCCC 


ACCAGGTCGC 


TGACCAATAG 


CACCGTAATC 


TGGATGCGCA 


GAATCATGTC 


720 


CTTCATTTTG 


ATACCCTTTT 


AGTAAATTCA 


TTTGTCCTAA 


TCCATCTGTT 


AAATTGTAAA 


780 


TACGTTTTGA 


TTCGTCTAAC 


GTCACTAAGA 


ATGGGTTTGT 


CGCCTGACTA 


GCAAAGTTAA 


840 


AAGGAATCCG 


TTGGTTGACT 


AATTCAGGGA 


CTTTTTCCGC 


ACCCATTGGG 


TTATTCATAA 


900 


TTGAACGGAA 


CCCAATTGCC 


GCATCTTGCC 


AATCCACCCG 


ATTGTCGTCG 


TTGCTATCTT 


960 


TTGAGAATTT 


AACTTTCACT 


TCTGGCGTTT 


CTCCAGTTGT 


GTAATCTTCC 


GGTGCATCAA 


1020 


ATGGTCGATA 


GGTCCATGCC 


CCACTTGACA 


AGGTTGTTAC 


AAAACCAGTC 


GCCGCTTCTT 


1080 


TTGTTTGTTT 


ATGGATTCGA 


TTGTTGTCAC 


TACCATCGAC 


AGAGCCGTCT 


CCATAGGCGT 


1140 


TCGTCCAAAA 


ACTTGCAGCA 


TAATTCGCCG 


TATTTAAGAA 


ACCATACATA 


TATTTTTTCG 


1200 


TTTCTTCTTG 


GCTTGTTGTA 


GTTAAATCTT 


GGATAGTATC 


ACCATTTTTA 


TTTCCGTTAT 


1260 


TTGCATTTGT 


CCCAGTATTC 


ATCACAACGC 


CAGCAAAAGT 


CGCCCCTTCA 


TCTGTACTAT 


1320 


TGACAGAAAT 


CAAGCCTTGA 


TTTGGAATTG 


AAATTGTGTG 


TATTTTGGTT 


CCTTCTTCAC 


1380 


GAATATCTGA 


GAGTTTCATA 


GCTAATTCTT 


GGCCTTCGGA 


AACAGTCAAT 


GTCGTTGTGA 


1440 


AACGCAAACC 


AATTTCTGGG 


ACAGCTACGT 


TATAAACGGC 


TTGaTTTGCT 


TCaCTCTTCT 


' 1500 


GTTTTTcAGC 


AGTTGCTTTA 


tATGCTTTGC 


CAyCAAyCAT 


TACTTGGTCa 


ATTTTCTCgT 


1560 


GTTGGCCATT 


AAAAATCACT 


TGATTACTCT 


GAGGGTCTTC 


GTAGCGAATA 


ATTCGTGGAA 


1620 


ATTGTGGATC 


AACCACGGCT 


GTTAAATCTG 


CGGAAGAGAT 


TTTGTCGCCT 


TTTTCAGCTT 


1680 


CATTGTCTTT 


AGCAACAACA 


ATGCTGGCTT 


TTGCTTTGAT 


GTTTGTATTT 


TCCAAAGTAC 


1740 


CGTCGACTTC 


AAAAGTTCCT 


GGTTCTTTAT 


ATGCATCAGG 


ATCGATTTCA 


TTCCAACGGA 


1800 


CTGCTTCATT 


GGCTTCTTTA 


CCAGTATTAT 


AGGTCACTTT 


TACTTGTTTC 


GGtAATTTTG 


1860 


GTGCGACACC 


AATTTTTGTA 


AAAACTTTGA 


CTGGCGCAAT 


TTGTGTCACT 


TCTGGCACAA 


1920 


TTTCTGGAAT 


GCTATCTACG 


GGGCCATTTT 


TAAAATAGTC 


ATAATTGACA 


ATTTTTTTGT 


1980 


CGTACCACAA 


ACGGACACCC 


ACATGTCCGG 


CTTCTGTTGG 


AATTTTGTCT 


CCATTGGCCA 


2040 


AAACAGGTTC 


TCCTTCGTAA 


ATCAACGTGG 


TGTTAAGCCA 


GATAGTAATT 


TTTTCACCAA 


2100 


CATAACGTAC 


TTTCAACAAA 


TAATTCGTAT 


CTTCATTTAA 


CGTTGGCCCA 


GAAATTGAAT 


2160 


CATTCCACGA 


ATTAGGGCTT 


TCAACTAACC 


ATTTGCCATT 


f /-» rp rp T\ rp 7\ 

CGCGTTATACa 


r»/*"TA A A A A A 


£. c* c. \) 


CCCAGCTATC 


TTTGGTATTA 


CCACGTATAA 


TCACACCCGT 


TCGACCGCTT 


CCAGCAGTGT 


2280 


ATTTAAACTT 


TGTTTCTACT 


TC AC CATC AG 


CTAATTTAGG 


GGAATCTAAA 


TTTAAAGATA 


2340 


CGGCATTATT 


ACCTGCTGCT 


GCATCTCGTG 


AAATCGCCAA 


TGACTCTCCT 


GCTAATTCTC 


■2400 


GATTTGTTTT 


ACCAATCACA 


TCTTGCCACT 


CACCAGTTTT 


CCCTCCTGGA 


AAATCAGATT 


2460 


GCCACAATTT 


TTCTTCTGTT 


GCGGTAGGTT 


CCACCATTGA 


TGAGCTTGTT 


TCAGTGCTGC 


2520 
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TGTTTGTTGT TTCTTCTGCA GACGTACTTA ATGGTACTAA AATCGTTGCG CTTGCCAATA 2580 

GTGTCAATGT ACTAAATCGT TTTATTTTTC CATGTTTCAT CTGaTAAATC AGCCCCCTGA 264 0 

ACATTTTTGG AACACGCTTT CATTTATTTA ACAATTAAAG GGGGGTTnCC AAAAATTAAA 2700 
TTGTnTTCAT TTTTGG 2716 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TTGAAATTGC 


TAGCGTTTTC 


CGCCATTGGC 


TATGTTTAGT 


CGCAACGGAG 


GGACCsCtGA 


60 


CTTTTTAGCG 


AAGCATcgTT 


GCCTGTTAAG 


aAAGTGACGA 


AAATTTCTAA 


TCCTGAAGGG 


120 


GAAACCGTGC 


TGGATGTCAT 


TCGCAATGGG 


AATGCGCAAG 


TGGTAATCaG 


TACCATGGAT 


180 


AAAAATCGTT 


CCAGTGcCAA 


CCAAGATGGA 


TTTTCAATCC 


GTCGGGAAGC 


AGTCGAACAT 


240 


GGGATTCCAT 


TGTTTACCTC 


ACTTGACACA 


GCAAATGCGA 


TTTTAAAAGT 


TTTAGAATCA 


300 


AGAGCCTTTA 


CGACAGAAGC 


CATTTAAGCA 


AAAATCTGGA 


ACAATCGTCT 


GAGGAGGACT 


360 


GTTCCAGATT 


CTTTTACGAA 


GAAATAGAAA 


ATCATTTATA 


GGGGAATCAG 


AGAATAAAGG 


420 


AGTGAAGGAT 


ACTGATGCAG 


AGAAAGCAAG 


AAATGATGAC 


CATTGTAGCT 


CAAAAACAGT 


480 


TAGCGCCACG 


GATTTATCAG 


TTAGACTTAC 


AAGGCGAGTT 


AGTGAAAGAG 


ATGACACGAC 


540 


CTGGACAGTT 


TGTGCATATC 


AAGGTTCCGC 


GTGCAGATTT 


ACTTTTGCGG 


CGACCGATTA 


600 


GCATCAACCA 


AATTGATCAT 


TCGAACGAAA 


CCTGTCGTTT 


GATTTATCGA 


GTCGAAGGCG 


660 


CTGGAACGGA 


AGTGTTTGCC 


ACCATGAAGG 


CTGGTGAACA 


ATTAGATATT 


TTAGGTCCTT 


720 


TAGGAAACGG 


CTTTGATATA 


ACTACCGTAG 


CAGCTGGTCA 


AACTGCTTTT 


ATCGTTGGCG 


780 


GAGGAATCGG 


GATTCCCCCA 


TTGTATGAAT 


TATCAAAACA 


ACTCAATGAA 


AAAGGCGTGA 


840 


AAGTGATTCA 


TTTTCTTGGT 


TATGCATCAA 


AAGAGGTTGC 


TTATTACCAA 


CAAGAATTTA 


900 


TGGCATTAGG 


AGAAACACAT 


TTTGCCACAG 


ATGACGGCTC 


GTTTGGCGCT 


CATGGCAACG 


960 


TGGGCCGTTT 


ATTGTCAGAA 


GCGTTAGCAA 


AAGGACGGAT 


CCCTGATGCA 


GTGTATGCTT 


1020 


GTGGTGCGAA 


TGGTATGTTA 


AAAGCAATTG 


ATTCTTTATT 


TCCAACACAT 


CCACACGTTT 


1080 


ACCTTTCTTT 


AGAAGAACGG 


ATGGCCTGTG 


GAATTGGGGC 


TTGCTATGCC 


TGCGTTTGCC 


1140 


ATAAAAAAGG 


AGACACTACT 


GGAGCAAAAA 


GTGTCAAGGT 


CTGTGATGAA 


GGTCCGATTT 


1200 


TTAAAGCAAG 


. TGAGGTTATC 


TTATGATGAA AAAyCCTTTA GCCGTTTCAA TTCCAGGTTT 


1260 


AACATTAAAA 


AATCCAATTA 


TTCCCGCCAG 


TGGTTGCTTT 


GGGTTCGGGG 


A 


1311 



(2) INFORMATION FOR SEQ ID NO: 4: 
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fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TCCTCTGATG AACCGATTGA AAGAATTAAG * GAAAGAAnAA AATATTACTT TAGTTGAGTT 60 

AAGTGAGGAG TTAGGTATTC CACGTTCCAC TCTTAATAGG TATGAAAACG AAGATAGCGA 120 

ACCAAAACAA GAAaCTTGGG AAAAATTAGC TGATTATTAT GGTGTTTCTA CGGCTTATTT 180 

AATGGGGATA TCCAACCAAA AGGTTAGCGA AGAAAAAGCT TTGACGGCCG CAAAGAAAGT 240 

TTATCAAGTC TATCTTTCCG ACGACGATTT AGGAAAAGAA ATTCGAAAAG CTCTAATGTA 300 

TTTTAATAAA AATGATTTAG ATAGTGTCTT AAAACAAGCA ATGCAGCAGT ATTTTACTAT 360 

CCCAGCCGTT GAATGGAATA CAGAATTTCA ATCTTTAAAA AATACTGGTT TTCTATACAG 4 20 

TTGGTTAGAA GGATATTTGG TAGACAGGTA TCGAAAAGAA GTGAAAACTA ATCATAATTT 4 80 

AATTACAAAT ACGTATTATA GTATCCCTTC TATTGATGAT GTGAATGAGT ATGGAATTTT 540 

CTCTGATGAA ATTCAAATAC CGTTGACTGA AGATTTTAAA CAAATTTTCA AAGTTGAGGA 600 

TCTATCCACG AAAACACATG AAGAAATAAA AAAACTTTTT AAATCCGATT CTACGCTCAT 660 

TGcTTACAGC TATGAAAGCT CCGTAGACGA TAAATTAAAA GATGATATTA ATATGATTTT 720 

AGATAATGCT CGTTCCGAAA TTCTTAATCT AAAAGAAAAA TATCCTGATA AACCTAGTAG 780 

AATAGaACAA ACAACCGTAC TCATATCTAA AGACGAAGAT ACTTCCTTTT GGTCAAGGAA 840 

TGGCTCAAAT GATTATATAG ATGAATTGAA CTTATCTGAG GAAACTAAGA AATTATTTGT 900 

TCATTTAGGT TCAGTATTTA TTGAGGAAAA AAAACGTAAA TCAAAAGCTA AAGAATGAGT 960 

TTTTTTCTTT GTATAATCCA GT TAT AG AAA CATCTCTCCC CTAACATCTA CATACTATTG 1020 

GCGTGGTGGT AGATAGGAGA AAAACAATGA CACAAATTAA CAAGTATACA AAAAAAGACG 1080 

GTACAACCGC CTATATGTTT AATGCTTATG TAGGACGTCA TCCCAAAACT GGTAAAAACG 114 0 
TTTATCGTAA GCGGCAAGGA TTTAAAACAA AAAAACAAGC ACAAATTGCA CTTGCTGAAA • 1200 

TTCTGAAGGA TATTGAAGAA AATGGACTAG ATAATAAACC CTCTGTTTTG ACGTTTAAAC 1260 

AGCTTTACGA GAAATGGTTA GCACAGCAAC GGTTAACGAT TAAACCATCT TCAATCGCTG 1320 

TGAATAAGCG TTTTGCTGAA AAACACATAT TACCTTATCT AGGTGACTGC AAACTTGACG 1380 

AAATAACCGT TATACAGTGT CAGGACTTAG TTAATAAATG GTTTAATCAA GGACACAAGC 14 4 0 

AATACTCGTT CTATAGAAAA CTCACTGCTC AGATTATGCG ATATGGCGAA TCTATGGAAC 1500 

TAATGAATAC TAACCCAATG AGAAAAACAA TCCTTCCTAA GTGGAAAGAG GAAGAAACAA 1560 

AACTGGAGTA TTACACTAAA CAAGAATTGA ATCATTTTTT TGATTGTCTA AAACAATATG 1620 

GTAATTTCAA ACAGTTAGCA TTTTTTAGGT TGCTAGCCTT TACAGGTTGT CGTAAGAGTG 1680 

AAGTTCTAGC CTTGCAGTGG AAAGACCTAG ACATAACAAA CAAGAGCGTA TATATAGGCA 17 40 
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AGACCGTAGC 


ACAAGACGAA 


TTCTATAATA 


TACTCACTCA 


AACGCCCAAA 


ACAGCTTCTA 


1800 


GCACACGTTC 


TATCAGCTTA 


GACAATGAAA 


CACTTCAAAT 


TATGGCTAAA 


TGGCGTACAA 


1860 


TCCAACGTAG 


TGAcTATTTC 


CAAATGGGCT 


TTAATACATC 


TAGTGAAGAA 


CAATATATTT 


1920 


TCACGAACAA 


CCATAATAAA 


CTGCTCTCAC 


CAAATGTAGT 


TAATATATGG 


CTCAATTGCT 


1980 


TAATAAAAAA 


ATATGATTTA 


CCTAGTATTA 


CACCACATCA 


TTTTAGGCAT 


ACACATGCAA 


2040 


GTTTACTTCT 


TCAATCAGGA 


GTGCCTATTA 


AAGAAGTTGC 


TGAAAGATTA 


GG AC AT ACT A 


2100 


GCACTGCTAT 


TACTGATCGT 


ATCTATTCTC 


ATGTTATGCC 


TGAAGAAAAA 


GAAAAAACAG 


2160 


CAGATAAGTT 


CGCTCAATTC 


GTCGGTTTTT 


AATAAATGCA 


CAGACAAAAG 


CACAGATCTT 


2220 


TACTATTTTT 


ATGTAAAAAA 


AAGAGCTAGA 


ATCCTTATAA 


ATAAAGGTTT 


CTAACTGGCA 


2280 


TTAGTAACGC 


TATTAATGAA 


ACAACGTGTT 


TAATCATATA 


ATAAACCAAC 


AAATAAGTAA 


2340 


ATATCAAATT 


CTAAATTCTG 


TAAAAAACGA 


ACAATCACGA 


TTAAAAAAAG 


TATTCCTTTC 


2400 


TAGGAATACT 


TTTTAAATTA 


AATCGTTTGT 


GTGGTCGGGC 


GAACAACCAT 


TTCACTGATT 


2460 


GACATACGAT 


CTGGCGTGTC 


TATGGCAAAG 


ACCACTGCTT 


GCGCAATGTC 


TTCCGCCTTT 


2520 


AAGCCCCAAC 


TTAACTGTTC 


TAGATGTAAC 


GTTTCAGCTA 


CTACTCGATT 


AGAAATTGTT 


2580 


TGATAAAGTT 


CTGTTTGTAC 


AGCACCTGGT 


GAAATAATCG 


TTGATTTAAT 


GTTATTCTCT 


2640 


CTTTGTTCTT 


GTCGCAATCC 


TTCCATAATT 


GCTCGAACAG 


CAAACTTTGT 


TCCACAGTAG 


'2700 


ACAGCCGAAT 


CTGGATAAAC 


GACATGTCCT 


GCCACAGAAT 


CAGTAGCAAT 


AATCTGCCCT 


2760 


GATTTTTGTT 


CAACCATAAT 


TGGAAGAACT 


GCCGCAATGC 


CATTTAAAAC 


ACCCATAATA 


2820 


TTAATATCTA 


GCATTTGACG 


CCATTCTCCT 


TTTGGTGCTT 


CAATAAGAGG 


CGCCGTTGGC 


2880 


ATAACTCCTG 


CATTGTTAAA 


AAGAACATCA 


ATTCGTCCAT 


ATTTTTCCAT 


TGTAAGCTTG 


2940 


ATTACACGCT 


GAACTTCCTC 


TTCTTTCGTT 


ACATCTGCTT 


GTTGCACGAG 


AATCGTTGCT 


3000 


TCAGGAAGTT 


CTTTTTTTAT 


AGCAATTAAA 


CGTTCTTGTC 


GACGTGCTGC 


AATGACTAAC 


3060 


TTCGCTCCTT 


TTCTGGCAAG 


TAAACGGGCT 


GTTGCCTCAC 


CGATGCCACT 


GGAAGCGCCC 


3120 


ATGATAACAA 


TAACTTTTTC 


TGATAATGAT 


TTCATTATAT 


TTCCTCCTTA 


TCTTCCTGAA 


3180 


TCCAACGAAT 


AATTTTAGCT 


GGTGTTCCTG 


CAACAATAGC 


GTTAgcTGGc 


ACATCTTTAG 


3240 


TTACAGTCGC 


ATTAGCTGCT 


ACGATAGCAT 


TTTCACCAAT 


CGTAATACCT 


GGTAAAACAG 


3300 


TGACACCCGC 


GCCTAACCAC 


GCAAATTTTT 


TAATGGTAAT 


TGGTGCTAGC 


AATACGCCTC 


3360 


GCCGTTTCAT 


TGGCAGTTCC 


GGATGATTGA 


CACTTAAGAG 


ATTAACGCGC 


GGACCAATTA 


3420 


AAACATTGTC 


TTCAATTATA 


ATGCCCCCTA 


AATCAACAAA 


CATAGCTGCC 


CGATTAATAA 


3480 


AAATATTTTT 


ACCAAAATGT 


ATATTGCGTC 


CAAAATCTGT 


ATAAAAAGGT 


AATAAAATCG 


3540 


TAACGGTTTC 


GTCAATTTTG 


TCTTGGATAA 


TCGTGGCAAG 


AAGATTCCTT 


ACTTCATTAT 


3600 


GTGTATAGGC 


TTGTTGATTT 


AATTCTATTA 


AATAGCGACT 


ATTTTCCTCT 


TGAATCTGAT 


3660 


GAATAAGTTG 


AAATAGTGGT 


TCATCTTTAT 


CAATGCTACC 


TGTTGCAATT 


AGTTCTTGGA 


3720 
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GATTTTTTGT 


AAGCATTCCT 


AATCGCCTCC 


TTTTGCTAAA 


TTTATCGTAA 


TACGTGACAA 


3780 


CTCCTCTGTC 


TAATACTTAT 


AATAAATAGA TAAGTATGCG 


TTTTAGTTAT 


AGGAGGGAAA 


3840 


GTAATGGAAT 


TACGAGTGAT 


TCACTATTTC 


TTAGCAGTGG 


TCCAAGAAAA 


AACGATTAGT 


3900 


GGCGCAgcCA AACAATTGCA 


TGTATCACAA 


CCAACATTAT 


CTAAGCAATT 


AAAAGAATTA 


3960 


GAGGAAGAAT 


TAGGCGTGAC 


ATTATTTATA 


CGAGGAAATC 


GACAAATACA 


ATTAACACCT 


4020 


GAAGGAGAAT 


ATTTAGCTAA 


ACAAGGGCAA 


GATATTTTAA 


GCTTAGCGAA 


TAAAACAGTC 


4080 


ACCAACTTGT 


CGCAAAATGA 


AT T CAT T AAT 


GGCGAAATCA CGATTGGTGG 


CGGCGAAACA 


4140 


AAAGCTATGT 


CTTTTTTAGC 


GAATGCACTA 


CAACAAATAA 


CAAGCCAGCA 


CTCAGCTGAT 


4200 


ATTCACCTTC 


ATT TAT AC AG 


TGGGAATGCA 


GATGATGTAA 


TAGAACGGCT 


AGACAAAGGA 


4260 


TTATTAGATT 


TTGGTTTAAT 


TATTGAGCCT 


GCACCTAAAC 


AAAAATATAG 


CTATTTAACA 


4320 


TTACCAGTTG 


T AG AT AC AT G 


GGGCTTAATT 


ACTGTAAAGG 


ACCATCCCTT 


AGCCACTAAA 


4380 


AATGTTATTA 


CTGCAGCCGA 


TTTAAAAGAA 


GAACCTTTAT 


TTATTTCTCG 


ACAAGCACAA 


4440 


GTTCCGAGCC 


AACTCTCTGA 


TTGGCTCGAA 


GCAAGTCTAG 


ATCAGTTCCG 


AATCGTTGGG 


4500 


ACCTACAACT 


TACTTTACAA 


CGCTTCACTG 


ATGGTAGAAG 


CTGGGCTAGG 


TAGCGCCCTA 


4560 


AGCATTGATG 


GTATTCTAGA 


AACAAAGCAA 


ACAAATTTAC 


GTTTTATCCC 


GCTATATCCT 


4620 


GCTTTAACAG 


CCAAAATTAG 


TTTAATTTGG 


CGCAAAAATA 


CGGTTCTTTC 


CACAGCTGCT 


4680 


GCATTATTTT 


TAGAACAAAT 


AAAAAAAAGT 


ATTCAACGCC 


CTGAATAATA 


GGTCGTTGAA 


4740 


TACTTTTTTA 


AAT CATC ACT 


GTGATCGTAG 


CTGCTGCCAA 


AATCAAAACT 


AAGCCGATAA 


4800 


TCGTTACGGT 


CATTTCCTTG 


GCTGTTTTCT 


TTTGACCTAA 


GAACCAAATA 


CCTGTCAACG 


4860 


TTGCAAGCAC 


AACTGATGTT 


TGGGATAGAA 


TAAAGCCAGT 


TGCCAAGCCG 


TTCATATCTG 


4920 


GcTGCGCTGA 


AATTAAATAA 


GTTAACGCTG 


CAAAGGCGAA 


GAAAAAGCCT 


GAAAAAATTT 


4980 


GTTTGTAAGA 


AACGGCTTCT 


ATAAATGGGG 


ATATTTCCCC 


ACCCTTAATA 


GTGACAACCG 


5040 


CTGAGTAAAT 


AACTGCGACA 


ATTACCATGC 


CTATTGCTTG 


TGGTAAAAAG 


GCATGCATGC 


5100 


CATCAATTGC 


CGTTGCTTGA 


GGTGCTGCAG 


AGTACGCCCA 


ATAGCCGATT 


TCACCAACGG 


5160 


CTAATAGAAG AACTGCTTTT 


TTCATAATCC 


CAGCACTCTC 


AGCGGACTCC 


GTTTCCGACC 


5220 


AGACCGTCAT 


TTTGGCACCA 


ATCATGATTA 


ACACTAAGGC 


AAATGCGCCT 


AACAATTTGG 


5280 


CTGTCGCACC 


TGGCCAGTTT 


CCTAAGAAAA 


AGACACCCCA 


GAGAGAAGCA 


CCTAAAAGTT 


5340 


GGAACGCGGT 


CGTTACTGGC 


ATGGCACGTG 


ATGAACCAAT 


CATCGTGAAG 


CATTTAAACG 


5400 


TAATGATTTG 


CGCACAGGCC 


CAACCGACAC 


CAGATAAAAT 


TGAAAAGAAT 


AAATCCATGC 


5460 


CCGCTGGAAA 


AGCTAAACCA 


TTAATCATTG 


AAAAAATAGC 


GGCAAAAATC 


AATGTTCCTA 


5520 


AACTGGTACC 


GAGAATTTGA 


TTCACTGGAC 


GACCACCAAT 


TTTGGAAGCG 


ATTGTTGGGA 


5580 


AAAGCCCCCA 


ACCTAGAAGC 


GGTCCTAAAC 


CGATTAATAA 


TGCAGTCGCA 


TTCATCTCTA 


5640 


TTTCCCCTCT 


CTCGTTTAAA 


AGACAACGTT 


ACTTTCTAAT 


AAAATATTTG 


CGTAAGGAGT 


5700 
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CATTTCCCCT GTCCGAATAA ATGCATGGCA GTTGTTTAAT TCTTGTTTCA TTTCTGTATG 5760 

AGGAATAAAA CTGATGGGAG TTTCTGGTAA ACGAGTTTCA ATTGCTGCCA ACATGTCAGG 5820 

ATTTTCTGTT TTAATTTCTT CTGCTAAATA AATTCGTTGA ACTGCCAATT CCTCAAGGAC 5880 

ATTGTTTAAA ACTTCCATAA AACTAGGGAC GCCATTGGTA ACTGCTAAAT CAATTTTTTC 5940 

TGTAGTCCGA GGCACTGGCA TGCCTGCATC ACCAATACTT AATTTATCAA AATGACCCAT 6000 

TTGTGCGATG ACGCGGGAAA TATCTGAATT AATAACTTTT GTTTTTTTCA TGTCGCTCGC 6060 

TCCTTTTTAT TTGTCTGCCA ATTCGTGTTG ATAAGGAATC GAAGGTTGGG CTCCAAAACG 6120 

TTGAACAGTC AACGAAGACG CCTTGTTTCC ATAACGAATA GCTTCTTCCA AATTGCTAAA 6180 

ATCTTTTTCT AATATACTAC TTAATGCGCC AATAAAAGTA TCGCCAGCAG CCGTTGTATC 6240 

AACCGCTTTC ACTTTAAAAG CAGGCACAAT ACCACTTCGT CCATTGACGT CATAAAAGGC 6300 

GCCTTTACTA CCTACTGTAA TAATTACTGC TTCAAyCCCT AATTGATGAA GTGCTTCTGC 6360 

GGCTTTACGC ATACTCGCTT CaTCTGTGAT TTTAATGCCT GTTAAAATTT CGGTTTCTGT 64 20 

TTCATTTGGT ACAATCATAT CTGTCACGTT TAGTAATTCT TCAGGAACTT GTTCTAACGC 64 80 

AGGTGCGGGA TTTAAAATCG TTTTGACGCC TGCTTTTTTA GCAATTTTAA ACGCTGCAAT 654 0 

CGTACTATCA ATCGCACTTT CAAATTGGGC AATCACAAAA TCACTTTTTT CAATAATTTC 6600 

TTGATGTTCC TGGACTTGTT TTGGCGTAAA GGCGTTATTG GCTCCTGCGT AAATCATAAT '6660 

GCTATTTTCA CCAGCGTTAT CTACCAyAAT AAAGGCTTGG CCAGTAGCTG TTTTCTCTAA 6720 

AGTGGTGACC CCAGTTAAAT TAATTTCATC TTGACTCATT AAATCAGTCA TCATAGCTCC 6780 

AGCGCCATCA TTTCCCACAG CACCAATAAA ATATGTTTCT GCGCCTGAAC GTTTCGCTGC 684 0 

AACTGCTTGG TTTGCTCCTT TACCACCACC TGCTGTAAAA TGTTCAATGG CGTGAATTGT 6900 

TTCGCCTGGT TTTGGCATTT' CTTTGACTCT TAATGTTGTG TCTAAATT 694 8 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AATTTGTTGA ACCAGCGATT GCGCnGCnAA TACTGGTTTT TCCAGTGCCA GGCGGCCCAT 60 

AAAGAATCAT CGATGAAAGC ATTTTGGCTT CGACCATCCG ACGAATAATT TTACCAGGGC 120 

CGACTAAATG TTGTTGACCA ACTACTTCGT CTAAGTGACG AGGTCGCATT CGATAGGCTA 180 

AAGGTTGTTG- CATAGAAGGA CCTCCTTTTT CATAATCATC TTATTATAAC ATAAAGAACG 24 0 

TACATyCGCT TTTTGGTGGk TTTTTGGGkT TTAGATGGAT ATAGCGGAAG GTTcGTGtAT 300 

GATAGGGAGT GGTGAAAGAA ATGAAAGAAA CGATAAATAT CTTTAACACA AAAACAACTG 360 
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AAGAAGTGGC 


ACAATACTTG 


TTAGGGATGT 


ATTTAGmGCA 


CGAAACAGCA 


ACAGGCGTTT 


420 


TAGGTGGGTA 


CATTGTGGAT 


GCAGAAGCTT 


ATTTAGGTCC 


TGATGATGAG 


GCGGCCCATA 


480 


GCTTTGGCTT 


GAGAAAAACG 


CCACGCTTAC 


AAGCCATGTA 


TGACAAACCA 


GGTACGATTT 


540 


ATTTATATAC 


TATGCACACG 


CATTTAATTT 


TAAATATGGT 


GACACAAGAA 


CAAGGCAAAC 


600 


CACAAGGGGT 


GATGATCCGC 


GCGATTGAAC 


CTGTTGAAGG 


TGTCGACAAA 


AT GAT TG AAA 


660 


ATCGCCAAGG 


ACGTCAAGGC 


GTGGAATTGA 


CCAATGGACC 


AGGTAAATTA 


GTTGCAGCCT 


720 


TAGGAATTGA 


TAAGCAATTA 


TATGGGCAGT 


CGATTTTTTC 


TAGTTCGTTA 


CGGCTCGTGC 


780 


CAGAAAAACG 


AAAATTTCCG 


AAAAAAATTG 


AGGCACTACC 


ACGAATTGGG 


ATTCCCAATA 


840 


AAGGTCGTTG 


GACAGAGTTG 


CCTTTAAGAT 


ACGTCGTGGC 


TGGCAATCCA 


TATATTTCTA 


900 


AACAAAAAAG 


AACAGCAGTA 


GACCAAATAG 


ATTTTGGCTG 


GAAGGATGAA GAAAATgAAA 


960 


AAAGCAACAA 


TGCTCACATA 


CTTAGAGGAA 


CAACTTGAAA 


AACACTTAGG 


AGATTACGAA 


1020 


gTAGGCCTTG 


ATTGGGATCG 


CAAAAACCAT 


ACCATCGAAG 


TCATTGTTcG 


TTTATATGCA 


1080 


GAAAaCAATG 


AGCmAGTGGC 


GATTGATGAT 


GTTGATGGTA 


CGCTTTCAGA 


AGwAGAATTC 


1140 


ATTGAATTTG 


AAGATGGTTT 


GTTATTTTAC 


AATCCGCAAA 


AGTCTGTCGT 


TGATGACGAA 


1200 


GAGTATTTAG 


TCACAATTCC 


TTATGAAGGG 


AAAAAAGGgC 


TACGCAAAGC 


AGTTTTAGAC 


1260 


GGATTCATTC 


ACT AT TT AAA 


AGTGGTTTTA 


GATGAAGGGC 


AAAGTGATTT 


GTTAGACTTT 


1320 

i 


TTATCAGATG 


AAACAGCGGA 


AGTTTTTGAA 


TTGCATTGGG 


AGCCAGCAGA 


TTTTGAAGCG 


1380 


ATGATTAAAA 


AAGTGGCAGA 


AACAGAAAAA 


GAACAATGGA 


TTGCGTATCC 


AAGTTATTAA 


1440 


TTTAGAGGTG 


AGAACATGAA 


GTGGACAGAA GTAAAAGTCG AAACGGCTAG CGAAgCAGTT 


1500 


GAAGCAATTT 


CAAACATTAT 


GATGGnArCT 


GGCGCAAGTG 


GCGTGGCCAT 


TGAAGATGCG 


1560 


TTGGACATTG 


AAAATTTTGA 


AAGTGATCTG 


TATGGGGAAA 


TTTTAGATAA 


AGAGCAATTC 


1620 


ACCCACATTA 


AAGAAGGGGC 


GATTGTGATG 


GCTTATTTTC 


CTGAAACAAC 


CTTCTTACCA 


1680 


GAAATTTTAC 


CATTTATGAA 


AGAAAATATT 


TTACGCTTGC 


CTGAGTACGG 


CTTATCCATT 


1740 


GGTAAAAACG 


AAATGACAAT 


TAGTGAAGTA 


GCAGAAAGCG 


ACTGGGCAAC 


TGCTTGGAAA 


1800 


AAATATTATC 


ATCCTGTCCG 


TGTCACACGA 


TTCTTAACGA 


TTGTGCCAAG 


TTGGGAAGCC 


1860 


TATCACGCAC 


AAGATGAAGC 


AG AAAAAAT T 


ATCACATTGG 


ATCCGGGCAT 


GGCTTTCGGT 


1920 


ACAGGCACGC 


ATCCAACGAC 


TCGTTTAACC 


TTGCAAGCTT 


TAGAAaCTGT 


TTTACGTGGT 


1980 


GGCGAAACAG 


TTCTAGATGT 


AGGAACGGGT 


TCTGGTGTTT 


tGAGTATTGC 


AAGTCGCTAT 


2040 


TTAGGGGCTA 


AAGACGTCTA 


CGCATATGAT 


TTAGATGAAG 


TAGCAGTTGC 


AGCGGCAAAA 


2100 


GAAAaTATGG 


ATTTAAATCC 


CATTGCGGCC 


GATGTTCATG 


TGTCAGCCAA 


TGATTTACTA 


2160 


AAAGGGATTG 


■ ACCATTCTGC 


TGATGTAATC 


GTCGCAAATA 


TTTTGGCAGA 


TATTATTGTG 


2220 


TTAATGATTG 


AAGATGCTTG 


GCGTTTGCTA 


AAACAAGACG 


GTACCyTCmt 


TAtTTCTGGA 


2280 


ATTATTGAAG 


wTAmAAAAGC 


aATGGTTTTA 


GAAGCACTAA 


CGAAAGTCGG 


TTTTGTGGTG 


2340 
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GACCAACTCT TTAATCAAGG 
TAAAaTGCAA CGGTATTTCA 
ATGAAAACTA TCATCATATT 
TAGTATTTCA AAATAAAACT 
TTTACTTTAA AGAAATTAGT 
TTGCGTGTGG GTTACCAAAA 
TAGGTGGCAA TCAGTTTATC 
AAAAAGAGCA GCCAAAGAAA 
GCATCGACAG CAGACACCAA 
ATTTGACTCA TATGACACAG 
AAGTCAATTA GCACAAGTAT 
TGGACCTGAA GGTGGCTTTG 
ACTTTGCGGG TTGGGTCCGA 
TGTGGTTAGC TACCAAATGG 
TAGGCATTCT CTTTAAATAA 
AATTTAGTAT AATAGAGTTG 
TTAAAATAGA TAAAACGTGA 
GAATACCATG CCAAAAGAAG 
CTATATGGGA CCAGAACATG 
ACACGAAGGT CAAGTCCGAC 
AGGCATTCTG GCTGAATTAC 
TGTTGTCGAA GATACGGACG 
CGCAATGTTA GTCGATGGCG 
ACAATTAGCT GAAAATCATC 
TATGGTGAAA CTAGCCGATC 
TAAGCAACGA CGGATTGCAC 
AGGGATTAGT CGAATTAAAT 
CCAATACTAT CGAATTGTTA 
GGCAGAAGCA GTCGAAGATA 
ATATGGTCGA CCAAAACATA 
GTTTAATGAA . ATCTATGACT 
CTATGCCGTG TTAGGGcGAk 
ATATTGCTAT GCCAAAAGCG 



291 

CGATTGGTAT GCGATTaTCT TGAAGAAACC aGAGGAAGAG 2400 

TGAAAGAaGA TTATCCAgAA AAgATTTGTA TAAgTCGCAG 24 60 

GTGCGAGTTA TGCGAATGAC GCCGAATGAT CGTTGTTATT 2520 

GCCATTTTAG CTGAGATTGT GGAAATTGAT TCAACATCTG 2580 

AAAGAAGAAA TGGACAAAGA ATTGCCAATT GAAGTGACGA 2640 

GGGGACAAGT TAGAATGGAT TGTTCAAAAA GGCACGGAGC 2700 

GGCTTTCCTG CGAAAAtTCC GTTGTCAAAT GGGACCACAA 2760 

AAAGATTACA AAAAATTGCC ACAGAAGCGG CGGAGCAATC 2820 

GTGTTTCTTT GGTGGAAAAA ACACAAGAAA TCATTGCGCA 2880 

TCTTGGTTGC GTATGAAGAA TCAGCAAAAC AAGGAGAAAA 294 0 

TATCCACTTG CCAGCCAGGC GCACGTTTGT GTGTGCTCTT 3000 

CACCGCAAGA AATTGAACAA TTTCTGCAGG CTGGGGCGAA 3060 

GAATTTTACG AGCAGAAACA GCACCGTTGT ATCTTTTAAG 3120 

AACTTTTAAA TTAAACGAAA GCCCGCCTGT AAAATTCTTG 3180 

ACAGAGAGTA AAAAAGCGAC AGTGATTGAA ATTTAAGGGA 324 0 

TTGAAAGTGC TACTGTTTCA AGTGGTACTT TTCTTTTTTA 3300 

ACGAATgnTC TTTATCCAAA GGTCACCTTC ATAGAGGAGT 3360 

AAATACTGAC AGGCCCAGCG GTCATCAAAA TTGTAAGTAC 3420 

TTGAGTTGGT TCAAAAAGCA TTAACTTATG CTGAAAAAGC 3480 

AATCAGGTGA ACCTTACATC ATCCATCCGA TTCAAGTGGC 354 0 

ATATGGATCC CCATACTGTA GCAACAGGAT TTTTACATGA 3600 

TAACACTGGA CGATTTAAAA GAAGAATTTG GCGAAGACAT 3660 

TGACGAAACT AGGAAAAATT AAATATAAAT CTCATGAAGA 3720 

GGAAAATGTT ATTGGCAATG GCGCAAGATT TACGGGTCAT 3780 

GTTTGCACAA TATGCGCACA TTAAAACATT TACGTGAAGA 384 0 

AAGAAACTTT AGAAATTTAT GCGCCTTTAG CACATCGCTT 3900 

GGGAATTAGA AGATACTGCC TTACGTTATA TTAACCCCAA 3960 

ATTTGATGCA AAGCAAACGC GATGAACGTG AAGCTTATGT 4020 

TTCGTTTAGC yACGGAAGAT TTAGAAATTT ACGCrGAAAT 4080 

TTTATTCAAT CTATCGTAAA ATGAAGGACC AAAAGAAACA 414 0 

TATTAGCCAT nCGTGTTATt GTCGATTCCA TTAAAsaTTG 4 200 

CATACACGTT GGACACCAAT GCCAGGTCGC TTTAAAGATT 4260 

AATATGTATC AATCTATCCA TACGACAGTT ATTGGACCAA 4 320 
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AAGGGAATCC 


AGTAGAGGTC 


CAAATTCGGA 


CCCATGAGAT 


GCACCAAATC 


GCCGAATTTG 


4380 


GGGTGGCGGC 


TCACTGGGCC 


TATAAAGAAG 


GCAAAGCGGA 


AAAAATTGAA 


ACAGATGAAC 


4440 


TGACGAAGCA 


AGTCGATTGG 


TTCCATGAAA 


TTATTGAATT 


ACAAGATGAA 


AGCTATGATG 


4500 


CTTCTGAATT 


TATGCAAGGT 


GTGAAAGAAG 


ACATTTTTAG 


CGACAAAGTC 


TATGTCTTTA 


4560 


CGCCAAGTGG 


AGATGTAACC 


GAACTGCCAA 


AAGGCTCCGG 


ACCACTAGAT 


TTTGCTTACA 


4620 


GTATCCATAC 


TGAAATTGGG 


AATAAAACAA 


CCGGTGCGAA 


GGTCAACGGC 


AAAATGGTAC 


4680 


AGTTAGACTA 


TGTCCTAAAA 


AATGGGGATA 


TTATTGAAGT 


CCTAACATCA 


CCGAATTCTT 


4740 


TCGGACCTAG 


TCGCGACTGG 


CTGAAAATGG 


TTAAAACAAG 


TAAAGCCAAA 


AATAAAATCA 


4800 


AACGTTTCTT 


TAAAGAACAA 


GACCGTGAAG 


ATAACATTAT 


TAAAGGGCAT 


GATGCTGTCA 


4860 


TTAAATACAT 


GACAGAAATC 


GGCTTTACAC 


CGAAAGAATT 


TTTAACGAAG 


AATAAAATGG 


4920 


CGGAAGTGTT 


AGATAAATTT 


AAATTTCAAA 


CGGAAGATGA 


TTTATTTGCG 


GCAGTAGGTT 


4980 


ATGGCGAAGT 


AAGTGCGCAA 


GTAGTGTGCA 


ATCGTTTAAC 


AGAAAAAGAA 


CGCCGCGAAC 


5040 


AAGAGTTGGA 


ACGCCAACGT 


CAAGAAGCAG 


AAGAGTTGTT 


AAATCAGCCA 


GCTAAAAAAG 


5100 


AATCAGAAAA 


AATGAAAGTT 


CGTCATGAAG 


GCGGCATCGT 


GATTCAAGGG 


GTGGACAACC 


5160 


TATTGATTCG 


TTTAAGCCGC 


TGTTGTAATC 


CAGTGCCTGG 


TGATGAAATC 


GTTGGTTACA 


5220 


TTACCAAAGG 


ACGTGGCGTT 


TCAATTCATC 


GTGCGGACTG 


TCCTAACGTA 


CAACATCAAG 


,5280 


AAGAACTAGC 


ACAACGTCTA 


ATTGAAGTTG 


AGTGGGAAGA 


TACCGAGCAC 


AGTCGCAAAG 


5340 


AATACGCTGC 


CGATTTGGAG 


ATTTACGGCT 


ATGATCGTAG 


TGGCTTACTG 


AGTGATGTTT 


5400 


TACAAGTCAT 


TAGTTCGATG 


ACAAAAAATT 


TAGTCGGCGT 


TGAAGCTCGT 


CCATCCAAAG 


5460 


ATAAGATGGC 


CCAGATTCAT 


GTTACTGTGA 


AAATTCAAAA 


CCTTTCTCAT 


TTAAAAACCA 


5520 


TCGTTGACAA 


AATTAAAAGT 


GTACCAGATG 


TTTATAGTGT 


TCGTCGAACC 


AATGGGTAAG 


5580 


TGTTAAAAGT 


GAGGCAAAAA 


AGATGAAAGT 


AGTTATTCAA 


AGAGTCAGTC 


AAGCACAAGT 


5640 


TGCGATAGAA 


GAGCAGATAG 


TGGGACAAAT 


CAAACAAGGC 


TTCATGGTGC 


TGGTGGGGAT 


5700 


TCATCAAGAA 


GACACGCCAG 


AAGATGTGGC 


TTATGTTGTT 


GGCAAAATCA 


GTAAATTGCG 


5760 


TGTTTTTGAA 


GATGACGAAG 


GAAAAATGAA 


TCGCAGTATT 


CAAGAGATTG 


AAGGATCAAT 


5820 


TTTGAGTATC 


TCTCAATTTA 


CCCTCTATGC 


AAAAACTAAA 


AAAGGCAATC 


GTCCCAGTTT 


5880 


TATTGAAGCG 


GCTCGTCCAG 


ATGTGGCGAT 


TCCTTTGTAT 


GAATTATTTA 


ATCAACAATT 


5940 


AGAAGCAGAA 


GGAATAGCGG 


TAGCCACAGG 


AGAATTTGGC 


GCGGATATGC 


AAGTGTCGTT 


6000 


GACCAATGAT 


GGTCCAGTGA 


CCATTGTGAT 


TGATACACGA 


GAAAAATAAT 


TGATTTTCCT 


6060 


GTCGGAACCC 


CTTGACTTTG 


TAAGAGAACT 


TTAGTAGTTT 


ATAGATAATA 


TTAAAAATCT 


6120 


ACGAAGGAAC . 


GAGTAAGTTT 


CATCTGTCTG 


CGAAGAGAGA 


GTTGCCGTGG 


CTGAGAGCAA 


6180 


CTTCAACAGT 


GAAAGTGAAA 


GACATTCTGG 


AGATGGACGG 


AAGAAAAGCC 


GTCCCGTTAC 


6240 


GAGCGTTAAT 


CGTTGAGGAA 


AGAGTAAAAT 


CTTTCAAAAC 


TAGGTGGTAC 


CGTGTAAAAG 


6300 
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AGACTTTTAC GCCCTTGTAT GTTCAACATA CAAGGGCTTT TTTCATTTTT AGAAAAAAGA 6360 

AAATGGCGCA CACATTTTTG TGGAATGATA GAAGGAGAGA AAGACAATGA GTTATCAAAA 6420 

ACCAAAAGGA ACAAACGATA TTTTGCCAGG AACTTCTGAA AAATGGCAAT TTGTGGAAGA 6480 

AACAGCTCGT TTGATTTTTA AAGATTATCA ATACCAAGAA ATCAGAACCC CGATTTTTGA 6540 

ACATTATGAA GTAATATCTC GCAGTGTTGG CGATACCACA GATATTGTTT CAAAAGAAAT 6600 

GTATGATTTT TATGATAAAG GAGACCGTCA CGTGACGCTA CGTCCTGAGG GGACAGCGCC 6660 

aATTGTTCGG GCCTTCGTTG AAAATAAATT ATATGGTCCG GAATATACGA AACCATATAA 6720 

AACCTATTAC ATGGGGCCGA TGTTCCGCTA TGAACGCCCA CAAGCTGGTC GTTTGCGTCA 6780 

ATTCCATCAA ATTGGTGTGG AAGCATTTGG TAGTGAAAAC CCAGCATTGG ATGTTGAAAT 6840 

CATGGCTATG GCTTTGGACT TCTTCAAACA ATTAGGCATC CAACAAATCA AATTAGTTAT 6900 

TAATTCCTTG GGGGATAAAG AAACACGTGC TACGTACCGT CAAGCATTAA TCGATTATTT 6960 

AGAGCCCCAT ATGGCAGAAT TAAGCGAGGA TTCACAACGT CGCTTACACG AAAACCCATT 7020 

GCGGGTGTTA GACAGCAAAG ATAAAAAAGA CAAGGTGATT GTCGCAGAAG CGCCCTCCAT 7080 

TTTGGATTAT TTAAATGAAC CATCTAAAGC ACATTTTGAA GCAGTAACTG ATATGTTAGA 7140 

TTTACTAGAA ATTCCTTATG AAATTGATAG TAATATGGTT CGTGGCCTGG AT TAT TAT AC 7200 

ACACACAATT TTTGAAATTA TGAGTGAAGC GCCTAAAATG GGTGCGCAAT CAACTATTTG ( 72 60 

TGCAGGAGGC CGATACAATG GTTTAGTTGA AGAATTAGGC GGCCCAGACA CACCAGGTTT 7320 

TGGTTTTGGT ATGGGCATTG AGCGAGTGTT GTTAACAATG GAAGCTGAAG AAGTTGTGAT 7380 

TCCAGCGTTA TCTGAATTAG ACGCATATGT GGTTGGGATT GGTTCAGACA CCAACGTCGC 74 4 0 

AGCTTTGCAA CTTGTTCAAA GCATTCGTAA CTTTGGTTTC TCAGCTGATC GTGATTACAT 7500 

GAATCGCAAA CCAAAAGCGC AATTTAAAAC GGCCGATAAA TTACAAGCAA AATTAGTTTT 7 560 

AACAATCGGT GAAAATGAAT TGAATGAAGG CATTGTCAAC GTAAAATCAA TGGCAACACG 7 620 

CGAAGAAAAA GCCTTCCCGT TAAGTGCTAT TCATGATTCA TTTGATGAAG TGTATGACGA 7680 

AATGATGACA AAAATGATTG AAGAATAAAA AGGAGAGATT GAGAAATGGA AAAACGCACA 77 4 0 

ACCTATTGCG GCAATGTATC TGCCGAATTT ATTGAAAAAG AAGTTGTATT AAAAGGATGG 7800 

GTTCAAAAAC GTCGTGACTT AGGAGGCGTT ATCTTTATTG ATTTACGTGA CCGTGAAGGA 7860 

ATCGTCCAAG TGGTCTTTAA CCCAGAAAAG TCAAAAGAAG CATGGGAAAT TGCTGATAAA 7 920 

TGTCGTAGTG AATACGTCAT CGAAGTTAAA GGTCAAGTCG TTTATCGTGA TAAAGAAGCG 7 980 

ATTAATCCCA AAATGAAAAC TGGCGAGTTT GAAGTGATGG CGACTGACAT TACGATTTTA 804 0 

AATACTGCGA AAACAACGCC ATTTACCATT GAAGATGACA ATAACGTTAA TGACGAATTG 8100 

CGCATGAAAT . ATCGTTATTT AGATTTACGT CGTCCATCAA TGACAAACAA CATTAAATTA 8160 

CGTCACCAAG TAACGAAAAC AATCCGTCAC TATTTAGATA ATCATGACTT TTTAGACATT 8220 

GAAACTCCTT ATTTAGGCAA ATCAACACCA GAAGGCGCTC GCGATTATTT GGTTCCTTCT 8280 
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CGTGTCCATG CCGGTCATTT TTATGCTTTA CCACAATCAC CACAACTCTT TAAACAATTA 



8340 



TTAATGGGGG CAGGCTTCGA TCGCTATTAC CAAATCGTTC GTTGTTTCCG TGATGAAGAT 



8400 



TTACGAGGCG ACCGTCAGCC AGAATTTACA CAAATCGATA TTGAAACGAC CTTTTTAACA 



8460 



CCAGAAGAAA TTCAAACATA TACAGA 



8486 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMOEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AATGTCAAAA TTATTAGTTG TTAAAGCACA TCCATTAACA AAAGAAGAAT CACGCTCAGT 60 

TCGTGCGTTA GAAACATTTT TAGCTTCATA CCGTGAAACA AATCCATCAG ACGAAATCGA 120 

AATTTTAGAT GTTTATGCCC CAGAAACAAA CATGCCGGAA ATCGATGAAG AATTATTATC 180 

TGCTTGGGGT GCACTTCGCG CTGGCGCAgc ATTTGAAACA TTAAGCGAAA ACCAACAACA 240 

AAAAGTGGCT CGTTTTAATG AATTAACTGA TCAATTTTTA TCTGCAGACA AAGTAGTAAT 300 

TGCTAATCCA ATGTGGAACT TAAACGTACC GACACGCTTA AAAGCTTGGG TAGATACAAT 360 

CAACGTTGCT GGAAAAACAT TCCAATATAC TGCAGAAGGA CCAAAACCTC TAACAAGTGG 420 

TAAAAAAGCC TTACACATCC AATCAAATGG CGGCTTCTAC GAAGGAAAAG ATTTTGCTTC 4 80 

TCAATACATT AAAGCGATTC TAAACTTTAT TGGCGTTGAT CAAGTTGACG GATTATTCAT 54 0 

CGAAGGAATC GATCATTTCC CTGATCGCGC GGAAGAACTT TTAAATACTG CCATGACCAA 600 

AGCAACTGAA TACGGTAAAA CATTCTAAAT TTTCCGAGTG ATTTACATCA AAAaGAGACA 660 

AGCAGTTCGT TCACTGCTTG TCTCTTTTGT TACGTAGATT CTTGTTGaTC TGGcTTATCt 720 

TCATcATACA AACCGGaAAC GaTTTGGtAC CACTCaAAAA TGTGTGTAAT ATCACnTTAG 780 

CTGCGGCAT 789 
(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGTCGTTTAT TACGTCGTGA CGGTAATGCA CGGGAAAAAA ATTAGGCATT AATGAAGCCT 60 

TTTTATACAA . ATTAGTGCCT GTTGTTGGCG AAATCATGGT AAGCTACTAT CCAGAAGTAT 120 

TACAACAAAA AGACTTCATC GAAAAAGTGG TTCGGACAGA GGAAGAAnGT TTCCATGAAA 180 

CCATCAATGA AGGTTTAAGC ATGTTAAATG AGGTTATTAA AGAAGTTAAA GATGCAAAAG 24 0 
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GCGATACATT AGATGGAAAA . ATTATCTTCA AACTTTATGA TACTTTTGGc TtCCCCGTAG 300 

AATTAACGGA AGAAGTTGCA GAAGATGAAG GCTtGAAAGT CGACCAtGCT GGGTTTGAAA 360 

CAGAAATGGA AGCACAGCgT GAACGTGCTC GTTCTGCCCG CAGTAAAGAA ACTTCAATGG 420 

GTGTTCAATC TGCTTTATTA ACAGATATTA AAGTAGAAAG TAAATTTGTT GGCTATACAG 4 80 

AATTAACACA TGATAGTGAA TTATTTGTTA TTATTCAAGG TGACGCACTA GTAAATGAAG 540 

CATCTGCAGG AACAGCCGAA TTAATTTTTG CTGAAACACC ATTTTATGCT GAAATGGGTG 600 

GACAAATTGC TGACCGCGGC TATGTAAAAA ATACCGCAGG GgAAGTCGTT GCCAACGTGG 660 

TGGATGTGAA AAAAGCACCA AACGGTCAAT TTTTACACAA AGTAGAAGTT TTGGCGCCAT 720 

TAGCAGAAGG TCAAATTTAT CAATTGCAAG TGGACGAACG GATGCGGACA CGTaTTTTGA 780 

AAAATCATAC GGCAACCCAT TTATTGCATC GTGCTTTAAA AGATGTGTTA GGGGaGCATG 840 

CCAACCAAGC AGGCTCATTA GTTGCACCAG GACATCTACG TTTTGACTTT ACTCATTTTG 900 

GGCAAGTTAC ATCAGAAGAA CTAGCGCGGA TGGAAGCCAT CGTTAACGAA AAAATCTGGG 960 

AAGCTATTCC TGTTGTCACA ATTGAAACAG ATATTGATAC AGCGAAAAAC ATGGGCGCAA 1020 

TGGCGTTATT TGGCGAAAAA TATGGCAAAG AAGTCCGTGT AGTTAATATT GGGGATTACT 1080 

CTATCGAATT ATGTGGTGGA ACACACGTTG CTAATACAGA AGATATCGGG ATTTTCAAAA 1140 

TTGTTTCTGA ATCTGGAATT GGTGCAGGAG TACGCCGTAT TGAAGCAGTG ACGAGTAAAG 1200 

AAGCGTATCA ACTTTTACAA GAAGAAGAAC GTCAATTGAA AGAGATTGCT ACATTAGTTA 12 60 

AATCACCTCA ATTAAAAGAA GTGGTTACGA AAACAGAGCA ATTGCAACAA CAATTACGTG 1320 

ATCTACAAAA AGAAAACGAA CAATTGGCAG GCAAATTAGC GAACCAACAA GCTGGTGACA 1380 

TTTTTAAAGA TGTCAAAGAC ATCAATGGTG TTCGTTATAT TGCCGCTCAA GTCAATGTTA 14 4 0 

AAGACATGAA TCAATTACGC CAATTGGCTG ACCAATGGAA ACAAAAAGAA TTGTCTGATG 1500 

TACTAGTTCT AGCAACTGCA CAAGATGAAA AAGTAAGCTT GTTAGCGGCT ATGACAAAAG 1560 

ACATGAACGG AAAAGGCTTA AAAGCCGGTG ACTTAATCAA AGCAATTGCG CCAAAAGTTG 1620 

GCGGTGGCGG CGGTGGTCGT CCTGATATGG CTCAAGCTGG TGGGAAAAAT CCAGCAGGGA 1680 

TTGCGGATGC CTTAGCAGAA GTTGAAAACT GGTTGGCAAA TGCCTAATTA AAAATGTTTT 1740 

CCCTATTATT AATATAAGGA GAATGCTCTT TATAGCaATA ATGACAGCCC TCTTCAAAGT 1800 

TATGAAATTG ATTTTGAAGG GGGTTCTTTT TTTtAAATGA TTAAGGAGAG GAAACAGTAG 18 60 

ATTGCACGAA TGTGTAAGTT TTCATCTTCA ATTGACAATG GTATAATAAG CAACGGAGGA 1920 

ATCAAAAATA TGAATGAAAA TCAATTaTCa AAACGTCTCG CAACGGTTGG TGATTTAATC 1980 

CCTAGAGGAA GTCGCTTAGC AGACATTGGT TCAGACCATG CTTATTTACC AGTTGCTTTA 2040 

ATGTTAGAAA ■ ATAAACTGTC TTTTGCTGTG GCTGGGGAAG TGGTGGAAGG CCCCTATCAA 2100 

TCTGCCAAAA CCCAAGTCAG TAAATCCAAT TTGACGGATA AAATTATCGT ACGTTTAGCG 2160 

AACGGGCTAG ATGCAATTGA ACCTGAAGAT CAmATAGACG TCATTAGTAT TTGTGGAATG 2220 
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GGCGGCACAT 


TAATCCGAGA 


TATTTTAGAA 


GCTGGACGCA 


AAaAAAATCG 


TTTAACAGGG 


2280 


AAGGAACGCC 


TTGTTTTACA 


ACCGAaTATT 


GGTGAACCAA 


CTTTACGCCG 


CTGGTTAATG 


2340 


GCAAATGACT 


ACAGTATTAT 


TGATGAAACA 


ATCGTGGAAG 


AAAACCGTAA 


ATTGTATGAA 


2400 


ATTATCGTAG 


CCGAAAAAAC 


AGAGCAATCC 


GTTTCTTATA 


CGGACCAAGA 


ATTACTATTT 


2460 


GGGCCAGTTT 


TAATTAAAAA 


ACAAGGACCG 


GTTTTTACTA 


AAAAATGGCA 


ACGAGAATTA 


2520 


AAACAGCGCA 


AAACTGTTCT 


TGCACAACTA 


GCAAAAGCAA 


GTGGAGAACA 


TATTGAAAAG 


2580 


CAAGCGAAAC 


TACAGCAAGA 


CCAACAATTA 


ATTGAGGAGG 


TACTGGCCAA 


TGGCTGTGAA 


2640 


CGGTAAGACA 


TTGATTCGAC 


GCTTCAATGA 


CTATTGTCCC 


GAATGGTTAG 


CAGAAACGGG 


2700 


AGATCCTGTG 


GGCTTGCACA 


TTGGTACATT 


AGATAAACCA 


ATTGAAAATG 


TGATGGTTAC 


2760 


CTTAGATGTT 


CGTCCTGAGG 


TCGTTGCAGA 


AGCAATTGAG 


AAGCAGGTTG 


ATTTAATTAT 


2820 


TGCGAAACAC 


CCACCTATTT 


TCCGCCCAGT 


TAAGCGATTA 


ACCACAGATA 


ATTTTCAAGA 


2880 


AAAAATGTAC 


GCTGATTTAT 


TAAAACACGA 


TATTGCTGTT 


TATGCAGCGC 


ATACTAACAT 


2940 


GG AT ATT ATT 


GACAATGGGC 


TAAATGATTG 


GTTTTGTGAA 


TTATTGGGAA 


TTAAACAGAC 


3000 


CACCTTTTTA 


ACTAAAACAC 


ATACTGTTCC 


TTATAAAAAA 


TTAGCCGTTT 


TTGTGCCAAT 


3060 


TGATGAAgCT CCGCAAATGA GAGAAGCTTT AGGTCTAGCT 


GGCGCCGGGT 


CTCAAGGTGA 


3120 


TTATTCTAAA 


ACAAGTTACT 


CTTTAATTGG 


AACAGGACGT 


TTTACACCAA 


CACAAGGAGC 


3180 


AAATCCAACG 


ATTGGCGAAA 


TTGGTCAAGA 


AAGTGTCGTC 


CAAGAAGCAA 


AAATCGAAGT 


3240 


GATTTTCCCT 


GAAACTAAGC 


AAGAACAAGT 


TCTTGCAGCT 


ATGTTACAGG 


CGCATCCTTA 


3300 


TGAAGAACCG 


GcGTATGATG 


TTTATACCAT 


TGAAAATCAA 


TCAAAAGAAT 


TTGGTTTAGG 


3360 


TCGGGTTGGA 


GTATTAGACA 


AGCCTGTGAG 


ATTCTCTTAT 


TTTGTACaGC 


AAGTCAAGGa 


3420 


AGCTTTCCaA 


TTGGATGGCT 


TGCGGkTCAT 


TGCAAAAGAT 


GATACTAAAA 


TGATTCAACG 


3480 


GGTAGCCATT 


TGTGGTGGAA 


GTGGTGAAAA 


ATTCTATCAT 


GAT GC ATT AC 


GTAAACAAGC 


3540 


CGATGTCTAT 


ATTACTGGTG 


ATGTGTATTA 


TCATACGGCA 


CAT G AC AT G A 


TTGCTGAAGA 


3600 


TTTACCAGTT 


ATCGATCCTG 


GTCATTACAT 


TGAAGCGCTT 


TGTAAGCCAr 


AACTTGTGGA 


3660 


ATTAATGAAT 


CAATGGAAAC 


mAGAAAaTGA GTGGgCTGTA TCCATTTTkG AaTCAGAAGC 


3720 


GAATACCAAC 


CCATTTCGTT 


TTAAATAAAA 


AAATGAAGAT 


TAAAGGAGTG 


TTCTTTTATG 


3780 


TATGAAAATC 


TATTACCTCG 


TTTTTTACGG 


TATGTGAAaA 


CAGAAACACG 


CTCGGATGCA 


3840 


ACTAGCACAA 


CGACACCATC 


AACACAAACA 


CAAGTAGCCT 


TTGCACAAAC 


CTTAAAAAAA 


3900 


GAATTAGAAG 


AATTAGGGAT 


GAGCGATGTT 


ATTTATAACG 


AAACAAATGG 


TTTTGTGATn 


3960 


GCTACGTTAC 


CTAGTAACGT 


GGAGAAAGAC 


GTTCGTTCAA 


TCGGCTTTAT 


TGCCCATATG 


4020 


GATACCGCTG ■ 


ATTTTAATGC 


AGTGAATGTT 


TCTCCGCAAA 


TTGTTGAGAA 


CTATGACGGA 


4080 


GAATCAACGA 


TTCCTTTAGA 


TAAAGAAGGC 


AAATTCACTT 


TAAACACGAA 


AGACTTTCCT 


4140 


AACTTAAAAA 


ATTATCGTG 










4159 
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(2; INFORMATION FOR SEQ ID NO: 8: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATCGATACAA 


T ryi T TT T n"* T T 


GTPAPA A PAP 


A AP A AC A ATT 


ATTApQPATT 
nl 1 nonOn 1 1 


TTA A A nrATP 

1 lAAAAbATb 


60 


CCTTAAAAGA 


TAAAGPPTTA 


GTTGTPAPTA 


PZ\TTAP.TPAP 
On 1 1 nu 1 bAb 


x rtHHLHAt J, A 


A 1 b AbAbbGb 


120 


CGAAAGAATT 




APAPPPTTPT 


1 lAbA 


ttt a ATrrrr 


CCATTTTTTG 


180 


AATTAATTPA 


AGPPAAAPPP 


P P. A P. T A P. Zi T P 


pt a TTfRnrn 
J, bAAb A 


btL 1 bbAbbA 


CGbbACbAAC 


240 


TAGATCGTGC 


PTATTTTGAT 


A A A ATPTPAP 
nAnri x O 1 OAb 


PPA TTPl ATT 
bbAl IbnMl x 


TrPTPTA AAA 

1 1 b 1 AAAA 


1 AlbaTGATG 


300 


GC AAAAAT P P 


TPAAPGPTTT 


PTTPATTPTP 


1 b 1 1 


bl I Abbbb 1 f 


TCbCGGACCT 


360 


CAAAGACGCC 


AGTCAGTATG 


TATTT APPPA 
J. n X 1 XnuOwn 


ATP A APPPT Z\ 
n. 1 b AAbbb x A 


PPPPPTTTPT 

bbbbb 1 1 1L1 


AAL 1 I AbbA I 




TAATTCCAGA 


AGTTPPATTG 


PPPPP A A TTT 


TPP A AP A A AT 

1 b b AAb AAA i 


/"^fAT^T^A^A A 

bbAl bbAbAA 


AAAATGATTG 


4 80 


GTTTAGTTTG 

W XXX XXX w 


TTPGPPAGAA 


APATTAPPAP 


A.P A TTPPT A P 
AuHl IbblAb 


bAb 1 bbb lib 


bbi IbbI TAG 


54 0 


GTTTAGGTAA 


TGAGAPPAGT 


TAT A PP A ATP 


TTPA QPPP AT 


TPA APA APA A 


i i Abb x I ATb 


600 


CCGAAGAGAT 


TTTTGPGAAG 


TATGGPATPP 


P APTC ATTP A. 


tpt a a rrrpn 
1 b i AAbbbbA 


AAA TPfPTrr 

AAA lb lblbb 


DOO 


AAGAAACAGC 


PTTTTTAATT 

V*> J. J. A A X n^i X X 


AAAGAAAAAP 


TAPATPAAQP 
1 nun x o\AAMb 


AAATTAAAAA 

nnnl IrtHHnA 


PPTTPPPA A/"* 

bbl IbbbAAb 


/zu 


CTCACTAAAT 


TTCAGTGAAG 


TTGPPAAGPT 


TTTTATAPTT 
X XI X n X nv x 1 


A ATTf7AP.PTT 
nn 1 1 onu Oil 


blxlllbllA 


Ton 


GTCAATATTT 


TTTTGATCGA 


CAATGAGTAA 


AGPGATPATP 


APCTACAPAA 


TPPA APT APT 
1 bbA/ib 1 Mb 1 


OHVJ 


TTTAGAATAA 


AGTGTATAGC 


GGGGTTGCPA 


TTTGPTAGPA 

XXX VJv# X flOV/ri 


A A TTTTTPPT 


TPT ATTPP.PP 


on n 


TAATCCTTGG 


AAGGAGTAAA 


TTTCTGAACP 

xxx x urxn^ w 


AAATTPATAA 

ruwi x x un x nn 


ATTAAPGPAP 
n 1 X nn^ O ^no 


PA ATPPPTTP 


you 


TTGAATAAAA 


CTTTTTCTTG 


ATTGACCAAC 


GTTGGCTAAP 


GGAGPPATPP 


PTA A A TTPA A 
U 1 nnn X 1 unn 




ATAATGAATA 


TTTTCTGTTT 


TCATATATTC 


AAATAAATGA 


ATAAATAAGA 


AATCCATACT 


1080 


TCCAGAAGGG 


GCTTTTTCTT 


TATGGTGACG 


CATCAAGTCG 


ATGGTACCAA 


CTTCATTTGT 


1140 


GTAACTAGGA 


ATGATATTAG 


CGAAAGAGAC 


CATTTCTCCT 


TCAGCATTTC 


GGACAACAGC 


1200 


AATTGGGTTT 


CTTTGAAGAT 


ACGCTTCATC 


GAAAAATCCT 


AGTGAAAAGC 


CTTTTTCTTT 


1260 


TCGATTGTCT 


AACCAACTAT 


CCGAAATAGC 


TTTTAATTCG 


TGCATTTGTT 


CGGTACTAAA 


1320 


AGGAGGTTGG 


AGTACGTCAA 


ATGAATAGCC 


GGCTTTAGTA 


ATTTTATTAA 


ACACTGCGCG 


1380 


GGTACCTTTG 


AATTTTTTCC 


CAGACATCGT 


AAATGTTTCT 


AGATTAACTA 


AGGCTTCTTC 


1440 


GCCCATTTTG 


ATAAATTCGT 


AGCCATATTC 


GTGTAGTAAC 


ATCACGCTGT 


CCTCATTGGT 


1500 


TTCGTAAAAG 


ACGGGTACAT 


AGCCGAAACG 


ATCTGTTTCT 


TTAAGGAATG 


CATCAATCGC 


1560 


CGCAGGAAAA 


TCTTCTTTGT 


TCCCAGAAGG 


GTTGCCCATC 


ACAACGCATT 


TATTATTAAT 


1620 


GGTGTTGAAT 


TGTAAAAAGA 


CAGTCGGTTC 


TTCTTTGGGA 


TATAAGAAAA 


CCTGTTTATC 


1680 
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ATGAAGAAAG ACAAGTTCAC 
GATACTATCT TCTGGAAACT 
TAGAAAGTTA AAACTCATAA 
GGAAGGGAAC AAGAAAAATT 
ATAAACACCA ATTACCAGGT 
CATCCACTCC CAAGAGTAAA 
GATAAATAAT AAAATAGCTA 
ATACAAGATA GCACCAATTA 
AGCATCCCcG TCCCATAATT 
TAATATGTGA ACGGAAGGGG 
GAATAGTGGC TAGGAGCACC 
TGTGGGCAAT TTCCAAAGTA 
GGCTTAAGTG TTTTGTAAAG 
AACGATAGAG CAATAACCAC 
CTAAAATCAT CATGACATCA 
TAATCGAAGC TGCTACATAT 
AGTAACCAAT ACTGATAAAG 
CTAGAGATAA GCGATTTTTT 
AAATAAGATA AAGTAACGGG 
GTAAATAAGC CATCTTTGGA 
ATAGACCGGA TAACACAAAA 
CTGTTTTATT TCCATAAAAT 
CAaTATTATT GATTGTATTG 
TCAAGTTTAA AATGCGATTT 
AACCGATAAC CATCATCAAT 
ACTGTTCAAA TGAGATCGTT 
CCGCAATAAC AAACAGTGTT 
GTTTCATTTA GACACCTTCA 
TGGTGAGTTT TTGGCAAATA 
AATGGcATCA GCAATCGCCC 
AACAAGAGTA GGACAGGGGA 
TAACAGAGCC ACGATAGGGA 
ACAAAACCAA TATTGAAAAT 
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TGTCAATATT CCCGCCATAA GTTGTTAAGA TATTATGCAA 1740 

CACCAACTTG GTGTTTTTTC CCTTGGAGGT AGCGAACAAA 1800 

CTAAGAGGAT AGCGATAAAT CCTGACAACC AGATCCGTTC 1860 

CCACAAAATG ATGACGATGG TGTGGGAAGT TCGGTAAATT 1920 

ATAACAAAGT CAACAAGCCA AAGATAAAGC CATCAATCGT 1980 

CCAGTTGTTC ACGGTACAAT TCTGATTTTG AAAAAATAAC 2040 

AATAGAAAAT GGAAAACATA CTAAAGTCAA CAACAAATGA 2100 

ATAAAATAAT CGTTGGATAA TAGGCTCTTT tAaCmCGATC 2160 

mATAATGCAA AGCCTAAAAC AATACTCGGA ATTTGnATAA 2220 

TTAATATCTC TTAACCAATG CACTTGAGTG AAGGCTTCTG 2280 

ATCATAATTC CTGAAAAATA AAGTAAAACC ACTTCTAGCT 234 0 

ATTTGTTTTA GCAGTTGACG ATAATGAGTA TCCAACTTTT 24 00 

AACAAACAGC CAATTAAGAA CGGAATAATA TAATAGAATA 24 60 

AAAACAATAA TTTCACGATC GACACCTAAG TTGCTCAGTC 2520 

AAACTACCTA AAGCCCCTGG AATCATCGAA GCAATCCCAA 2580 

AAAGGAACAA TATCAATTAA AGGAATCGGT ACATCTAATA 264 0 

GTAATTAAAA CACCTGTCCA TTCTAAAAAC GAaGAAACAA 2700 

GCATCGAGGT TACCAAAGGA ACTATTTTTT TGGAAATGTA 27 60 

AAATAAAGAC CGCCACCTAA TAACCAAATC CAATATTGCT 2820 

TCAAACTGAA CAAGGAAAAA GCAAAGAATA CTATAAATAG 2880 

AGTAAAATAT GGGTTAACGC AGGTAAAATT TTCTTTTCCT 294 0 

TCAGAACGAA GTCCCATGCT GACCAAACCA CCGAATCCAC 3000 

ACAATCCAGC TAGATTCAAG GAGTTCCCGT TTTGAGACTT 3060 

AATGTTAAGT CATAATTTAG CATCGGAGTA ACGGAAACTA 3120 

AACAATTTCC ATAGCGGAAT CTCATCAAAA ATCTGTTTTA 3180 

TTACCAATTG ATAAAAGTTG GGCAACAACA ATAATTGAGA 324 0 

TTTAGTAATC CTGTATGCGT TTTGATCCAA TGAATAATTT 3300 

TTTCCTTTAA GAATTTTTCG ATTGTTTTAT TAAAAATATC 3360 

AATGACCCTG TTTATTTACC AAGACCAATT TGCCGGCAGG 34 20 

GTGAATGGGA AAGCGCAATT AAATCTCGAC TGCCAGTAAT 34 80 

TGTGTTCTAA ATCTTGTTTT GTCAGTCCAG TATCTTCAAA 354 0 

GCCGTGCGCG CAATTTTTTT GAAAAAAAGG CGCCAATACG 3600 

AAGTGGCAAT CGTTACATAT TTTTTCAGAC CAGACATGGT 3660 
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GACATTTCCA 


GAATTTAAGA 


CGAGTTTGTT 


GACGAAATCA 


GGATATAAAT 


GGGTAAAAAC 


3720 


CATGGCAATA 


TTTGCTCCGT 


CACTAAAGCC 


CAATAAATTA 


ATTTTTGTTA 


AGtGTTCTTG 


3780 


TTGAATAATA 


CCAGCTAAAT 


CTTCTGCCAA 


AAGCGGAAAA 


GTTAATTGAC 


TTTGTGTATT 


3840 


GGTGGAACGT 


CCATGTC^GC 


GGGTATCCAC 


CGCAATAACT 


TGATACTCCT 


TGGAGAAATA 


3900 


AGAAATTTGA 


TAATTAAAAA 


AAGTATGATC 


ATTGCCATTA 


CCATGAAGTA 


ACACAAGGGG 


3960 


GTAATTTGTT 


kCTTTGCCAA 


AAATTTGAyA AAAAATTTGA 


CTGCCATCTA 


TTGTTTGAAA 


4020 


ATATTTTTCT 


ATTGCCATTC 


TCCGTCACCT 


ATCTCATCTA 


TAATCAATCG 


AGTCCTCTCT 


4080 


CAT TAT AG AG 


TCTTTTGTCT 


TTTTTTTGTA 


ATAAAAAACA 


GAAAATTTTC 


CGAACAGTTT 


4140 


TGTAAGGTGT 


TGTAATATAG 


TGTGATATAG 


TAGAAGATAG 


CCACCTAAAA 


ATTAGACTAG 


4200 


AGGAGGGAAA 


GTTGATGTCT 


TTAATCGTTT 


TAATTGGTGC 


CCAAGCTGTC 


GGGAAAATGA 


4260 


CCGTTGGCAA 


GGCTTTAGAA 


AAGCAGTTAG 


ATGCTAAATT 


ACTATTTAAT 


CATCAAACCA 


4320 


TTGACTTATT 


TGCAAACTAT 


CTTGGGTATA 


CGGAACGTGc 


ATTTCAATTG 


TCTGATTCCG 


4380 


TTAGAAAGGA 


ACTTTTTCAT 


GCCTTTGTCG 


AAAATCCAGC 


CACAAATACA 


ACAAAAAACA 


4440 


TTATTTTTAC 


AGTGGTGATT 


GCTTTTGATC 


AAGAACAGGA 


TTTTCAATTT 


TTAGAAGAAA 


4500 


TATCAGAAAT 


TTTTCTTAGT 


AAACATGAGT 


CCGTTTATTT 


CGTCGAACTC 


GTCTCTTCAA 


4560 


CATCAATCCG 


CCTAGAACGA 


AATGTTCATC 


CAGAACGGCT 


AGAGGCAAAA 


CCTTCAAAAC 


4620 


GCGATGTCGA 


CTTTTCTCGA 


AAAGAATTAC 


TCAATACCTT 


TAAAGAACAT 


CGATTAGTAA 


4680 


GCCACGAaGG 


TGAACTAGAA 


AGTATATTTC 


CAAATGTAAA 


AATAGCGAAA 


ATAAATAATT 


4740 


CAAACATGAC 


ACCAGAAGAA 


GTAGCCACGA 


AGATAATTGC 


TACTTTTGGT 


TTGACATAAT 


4800 


GTGAAAGGAA 


CATCGGAATC 


AATGAAAAAG 


TTTTTTAAAT 


ATCTCGGTAG 


CTTGATGGTC 


4860 


CTTCTTTTAA 


GCATAGCTTT 


AATTTTGATA 


GGATATTTAA 


CCTTCCGAGA 


GTTTCGACCA 


4920 


CGAGCGGTTG 


AACCAGTTGC 


TATAAATAAA 


CAATTTTCAG 


AGCAAACCAT 


CGAAAAAAAG 


4980 


CAATCCATTT 


CTTTAGTCAC 


GTATAACATA 


GGCTATGCCG 


GCCTAGGACA 


AACAGAAGAT 


5040 


TTTTTTATGG 


ACGGTGGCAA 


AACTGTCCAA 


CCAGTTAATA 


AGGCAATGGT 


CCAACAGAAT 


5100 


TTAACAGGCA 


TTGAAGAAAC 


CTTAAAAGAA 


TTACCAGCGA 


TGATTTACTT 


ATTCCAAGAA 


5160 


GTCGATCGTC 


GCTCGCAACG 


TTCTTATGAA 


GTCAATCAAG 


AGGAAGAATT 


ACAAAAACAG 


5220 


CTACAGCTGA 


ACAGCGCATT 


TGCTTATAAT 


TTTAAGGTAG 


ATTATGTGCC 


CATTCCTTGG 


5280 






AAGTGGTTTA 


TTAACATTAT 




AATAACAGAG 


5340 


GCCAAACGCA 


TTGCTTTACC 


CAATCCCTTT 


CGTTGGCCCG 


TTAGCATTAG 


TAACTTAAAA 


5400 


CGAGCACTTC 


TAGAAaCGCG 


CTTTCCAATT 


AAAGGCACAG 


ACAAAGAATT 


AGTGGTTTTT 


5460 


AACTTACATT 


TGGAAGCCTA 


TGACAATGGG 


GAAGGGAAGA 


TTGCTCAAAG 


CAAAAAGTTA 


5520 


GCTGACGTAT 


TATCCCAAGA 


ATACGCTAAA 


GGAAACTATG 


TAATTGCTGG 


AGGTGACTTT 


5580 


AATCaAGTTT 


TCaAAGGCAG 


TCATCGATAT 


CCTGACTTAG 


GGGAAGCTGG 


TTGGGTACCA 


5640 
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PPAGAAATTG 


ATCCAGCAGA 


CTTACCTAAA 


CATTTTTCAC 


TTGCTTATGA 


TGATCAACAA 


5700 


PPGAPTGTGC 


G G G TGC T T AA 


"taagccttac 


ACTGGCTCTT 


ATGAAACATC 


CCAAGTGTAT 


5760 


GTAATTGATG 


GTTTTATTGT 

\j X X X X a* X X X 


TTCTGATAAT 


GTGGCTGTTC 


ATAGCGTGAA 


AACAAAGGAC 


5820 


GAACAATTTA 


AA T AT AC AG A 

nn x a* x **\^*avj ** 


TCACCAACCT 


GTGAAGATGG 


AAGTTGGGTT 


GAAATAGAAA 


5880 


A A^n AG GAG A 


TAGATGAAAT 


taggggttat 


TGCCGCAATG 


GATTTAGAGC 


TTCAAAAATT 


5940 


ATTP.P.AATAT 


TTTPPACCAC 


AACGAAAGAT 


ACAGCTAGCA 


AAAAATACAT 


TTTATATTTA 


6000 


TPA APA A A A A 


APATPGPAAG 

nv>r\A • w vy/uivj 


TCATTATGGT 


TTGTGCAGGA 


CAAGGAAAAA 


CGAATGCTAC 


6060 


P.PTATATAGT 

X rt A rt X rtVJ X 


PAAATACTTA 


TTGATTCATT 


TCAAATAGAA 


CAACTAATTA 


ATATCGGTAT 


6120 


1 iyiyuuioi 


TTAAATGAAA 


AACTGCAACT 

nnu x vj v^nnv x 


CTTTGAGATG 


GTGCTAGGTG 


AGGAATACTG 


6180 


PP ATT APGAT 


ATTPGTGAAA 


GACAGTCAAT 

\J,t\\-'**\J X wiMl X 


CAATAAATTT 

X ****** X A. X 


CCTTATCAGC 


TGTACTATAA 


6240 


ii pp. a ft: st a a a 


GGAATGPTGG 


PPGAAPTTCA 


ACATCTAGAT 


GAAAAGATAA 


AGTGTGTTCG 


6300 


1111 bunn^ 1 


P.PAPAAPPPT 


TPPTTTGTGA 
1 v^*j x x x o x on 


CACAACTGAA 


AAAAGAAACT 


TAATTGAACA 


6360 


Al l^lLlnl 1 


PAPTPTPTPP 
VjriV^ lulul \3<3 


AT ATPPAATP 


PGPAGCTATT 


GCACAATGCT 


GCTTTTTAAA 


6420 


1 tztt.1 1 oL-LjU 1 


TTTPTTTPTA 
1 1 lul 1 lulrt 


TAAGAATTAT 

1 t\r\\j£\r\ 1 lnl 


TTGPGATAGA 

X 1 VjOVJfl a nun 


GCGGATGCTA 


ACGCAGTCAA 


6480 




caTararaaa 

un 1 rt^rt^n-rVrt 


PAPAPPPAAT 


GGAAAAAGTA 


TTTGAATTGG 


TCTGTGCATA 


6540 


TAT A A ATT A A 


AAAAP.AAAPA 


GPTTGGAACA 


AAAATCCAGA 


AAGATTTTTG 


TTCCAAGCtT 


6600 


A A A A APT A AT 


ATAGGTTGGG 


GATAAAAGTA 


ACATTGTTAT 

nvn x x Vp? x x a a. x 


TGTTCCAATT 


ATTTTACAGA 


6660 


TAGCCTTTAG 


TAATAGAATG 

x nn x nwnr* x vj 


AGTGGAATTT 


AGCATATTTG 


CTGGGGTTCC 


TGTAAAAAGA 


6720 


AGPTGTPCGC 


PATTTTTTPC 

W li X X X X X X S^V* 


ACCTTCGGGA 


CCTAGTTCAA 


TCAACCAGTC 


GGCACTTTTA 


6780 


ATCACATCAA 


TGTGATGTTC 

X v_J X \J *1 X %J X X 


TAATAAAATC 


AAGCTGTTTC 


CTTCTTCTAC 


CATTCGATTG 


6840 


AAAAGTTGAA 


GACTTTGTTG 

unv/ x x x vj x x \j 


GATATCAATT 


AAGTGTAAAC 


CATCTGTAGG 


TTCATCCATT 


6900 


a A AT AG AT AG 


PTTTTTTTTG 

V^XXXxXXxXVj 


GTGAAGTGTG 


TCTGCTAATT 


TTACCCTTTG 


CAATTCGCCA 


6960 


GGAGATAATG 


TTGAGAGCGA 

x x un\jnu ^— » wn 


TTGATTGAGT 


TTAAGATAGC 


TTAAGCCAAC 


TTCCAGTAAA 


7020 


TTTTTTAATP 
i x x x x i x \3 


AAAGAGCAAA 


AGGCTGGTCT 


TTGAAAAAAT 


CAT AGC CATC 


TTTAACGCTT 


7080 


AAAGCTAGTA 


CTTCAACGAT 

w x x wnnvvni 


ATTTTTTCCG 


TTATACAGAT 


AATGAAGCAC 


CTCTTCTTTG 


7140 


TAACGCGTTC 


CGTGACAGGT 


TTCGCAAATA 


CTAGTAACAT 


CCTCCATAAA 


AGACATATCG 


7200 


GAGACAATTA 


TGCCTTTCCC 


CTTGCAAGTA .GGACAGGCGC 


CTTTGGAATT 


ATAGCTAAAT 


7260 


AAAGCTGGAC 


TAACATGATT 


TTCTTCCGCA 


AACAGTTTGC 


GGACCTTATC 


AAAAATATTA 


Toon 


AGATAGGTCA 


TAGGTGTGGA 


TCGTAAATTT 


GCGGTAATGC 


TTTTTTGTGA 


AAGATGAATA 


7380 


ATTTCTTGGT 


TATCAGCCTG 


GGCTTTTTGA 


TAAATTTCTT 


CCGCTAATGA 


TGATTTACCT 


7440 


GAACCAGCGA 


CCCCACAAAT 


AACAGTCAAA 


ACTCCTAACG 


GAACTTCGAC 


AG AT AG AT TG 


7500 


TTTAAATTAT 


GCAGTGTCGC 


ATGTTCGATT 


GATAAAGACT 


TCCTTGCTTT 


TCTTGGTTGG 


7560 


TCGTTTAAAG 


GGAGCGGCTC 


TTGAAGCGCT 


TGACTAGTCA 


AGGTTTTGGA 


GACTAAAAAT 


7620 
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GCGTCATACG TTCCTGAAAA CTGGACATGG CCACCATTTT CGCCTGCGAA AGGTCCGATA • 7 680 

TCGATGATAA AATCAGCTTC TCTAATTAAT TGTGGATTAT GTTCCACGAG AACCACGGTG 7740 

TTTCCTTTAT TTTTTAAATT GAGCAATGCA CGACTGATCC GTTCGATGTC CTTTGGATGT 7800 

AAGCCAGCAC TTGGTTCATC TAAAATATAC ATAATATCAT TTAAGGCGCT GTTTACATAC 7860 

TTAGCAATTT TAATCCGCTG TGCTTCACCA CCAGAAAGTG TTTCAGTTGC TCGTCCTAAT 7 920 

GTGAGGTAGG AGAGACCGAT ATTAATTnAT GCTTCTAGAC GTACAAGTAG CTCTTCTTGA 7980 

ATAGTTTTTA TTAAGGATAG GTCCATTGAA CGAATAAAAG AATGTAATTC AGTGAGTGGC 804 0 

ATGTCAACAG CATCAGCAAT ACTTTTCTGA TTAATTTTGC AGCTACGAAC ACGTTCATTG 8100 

ACTCTACTTC CTAAACAATC AGGACATCTT TTTACGGTAA CAAAGTGATT AAGGTATTTT 8160 

TGATGACGTT TGCCTTCGTC TGTATGCAAT ATGCTACGTT GCATACGCGG GACGATTCCT 8220 

TCATACAAAG CTGTATGAGG CCACTCTTTG GGTGGATTAG CTAGTTTTTG TTGTGGAGCa 8280 

TATAArmrTA ATGCTAACTC TTCAGGAGmw TAGTCCTTGA TTTTwkTATC CAAATCAAAT 834 0 

AGACCACTAT GGGCATAACG CTTCCACCGC CAGTTGC 8377 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTTTTTAAAC ACGAACGTTA AACGGCCGAA TGTTCGGTTT TTACGTTATT TTTGTAAATA 60 

ACTAGGAGGC AGTAAGTATG AAAGTTTTTG ATTATGAAGA TGTCCAATTA ATTCCCAATA 120 

AATGTATCGT GAATAGTCGT TCTGAATGTG AT AC T AC AG T AACTTTAGGA AAACATTCGT 180 

TtAAAATGCC TGTCkTcCmG CGAACmTGCm AACatTATgr TGrAACgATT GCGGAAcCCT 24 0 

TGCTGAAAtG GTTATTTTTA TATCATGCAT CGGTTTGGAT GAAGAGGCGC GAGTGCCaTT 300 

TATTAAAAAA ATGcAACAAA AGGGCTTGAT CACGTCAATT AGTGTGGGTG TGAAAGAAGG 360 

GGAATACGCT TTTGTTGAAA CGTTAGCCCG AGAAGGTTTA GTTCCTGACT ATGTGACAAT 4 20 

TGATATCGCC CATGGTCATT CAAATGCGGT GATTAATATG ATTCAACACT TGAAAAAATC 4 80 

TTTACCAGAA ACGTTCGTTA TTGCTGGCAA TGTTGGAACA CCAGAAGCCG TTCGCGAATT 540 

AGAAAACGCT GGAGCCGATG CCACCAAAGT GGGGATTGGA CCAGGAAAAG TTTGTATTAC 600 

TAAGATTAAG ACTGGCTTTG GGACAGGTGG CTGGCAGTTA GCGGCTTTAC GTTGGTGTGC 660 

CAAAGCTGCT . CGGAAACCAA TTATTGCAGA TGGTGGAATT CGGACCCATG GAGACATTGC 720 

AAAATCTGTG CGTTTTGGGG CaACTATGGT CATGATTGGT TCaTTATTTG CTGGACATGA 7 80 

AGAATCACCa GgTGrAACaA AAGTGGaAGA TGGTGTTGTT TACAAAGAAT ACTTTGGTAG 8 40 
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CGCCTCTGAA TTCCAAAAAG GCGAAAAGAn AAACGTTGAA GGCAAAAAAA TCTGGTTACG 900 

CCATAAAGGG AAGTTGGCAG ATACTTTAGT CGAAATGCAA CAAGATTTAC AATCTTCCAT 960 

TTCATACGCG GGAGGTCGCG ATTTAGAAGC GATTCGTAAA GTTGATTATG TGATTGTAAA 1020 

AnATTCTATT TTTAATGGTG ATACCATTTA ACATTCAACT ATAGACTGAA AAGAAGACCG 1080 

CACGTAGnAA GGAAATTAAA AT 1102 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(x-i) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



n/i/ilJ-ruA 1 rttb 






A TT ft A T ft AT A 


TTTTAfiAARA 


TAAAGAGATT 


60 


CAAATCGTTG 


TGGAATTAAT 


GGGAACCATT 


GAACCAGCCA 


AAACGTATAT 


CATGAAAGCT 


120 


TTAGAAAAGG 


GCAAACATAT 


TGTGACTGCC 


AATAAAGATT 


TACTGGCGCA 


ACATGGGAGT 


180 


GAATTAGTAG 


CGTTAcCCAA 


AAACATCATT 


GTGATTTATA 


TTACGAAGCA 


AGTGTTGCTG 


240 


GCGGGATTCC 


AATTTTAAGA 


ACAATTGCTA 


ATAGCTTAGC 


GGCAGACAAT 


ATTCAACAAG 


300 


TTTTAGGGAT 


TGTGAACGGT 


ACGACGAATT 


ATATGCTTAC 


GCAAATGGTT 


TCTGCTGATA 


360 


AAAGTTATGA 


AGAAGCTTTA 


GCAGAAGCGC 


AAgcGTTGGG 


GTTTGCTGAA 


GCGGATCCGA 


420 


CAAATGATGT 


CGATGGAATT 


GACGCAGCCT 


ATAAAATGGT 


TATTTTAAGT 


CAATTTGCTT 


480 


TTGGGATGAA 


TGTTTCATTA 


CCACAAGTTG 


ATATTCGTGG 


AATTCGCGGA 


TTGTCTTTAG 


540 


ATGATGTCGC 


TATGGCTAAA 


CAACTGGGGT 


ATGAAATCAA 


ATTGATTGGC 


TCTGCTGAAC 


600 


AAAATGAGAA 


TAGTATCTCG 


GTGGAAGTGG 


CGCCTATGCT 


AGTCAATCAA 


AAGCACCCTA 


660 


TTGCATCTGT 


TCGGAATGAG 


TACAATGCAG 


TATTTATTAA 


AAGTGCAGGC 


GTGGGGGAAT 


720 


CAATGTATTA 


CGGTCCAGGA 


GCTGGAGCTA 


AACCGACAGC 


AACCAGTGTG 


GTCAGTGATT 


780 


TAATTACGAT 


TGCTAAAAAC 


ATTCGCTTGG 


CAACGACTGG 


CCATATGTTC 


AATTCGTATC 


840 


AACATAAAAC 


ACAATTGACG 


AGTTCCGAAA 


ATGTATTTGG 


GCAGTACTAT 


TTTyCATTAG 


900 


aTGTTCCAGA 


TACACCAGGA 


CaATTTTTGC 


AGTTGACGCA 


ATTAATGACT 


AAAGCAGAAG 


960 


TCAGCTTTGA 


TCAtTAGTTC 


AACAGaAATC 


AGATGGACAG 


CGAGCAAGA 




1009 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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nCGGnTAAAA 

ATcAGCaCtT 

TCAaTTATTT 

CTTTCAAAGT 

AGGGCAGGAA 

CAATCGGCCC 

CACCTTTTTA 

TGAACGCACT 

AGCTATTCAA 

TTATGTAGGC 

TGCCGCCAAT 

TCTGGATCAA 

GCAAACAAAT 

GAGTTATGAA 

TAACGATATC 

AGAGAAAAAT 

GATTGAATCG 

TCTGTTTCAA 

CGAACTTTTA 

TTAACTTTAG 

ACCATTTATC 

GTTGCCCACG 

GTGCCAATTT 

CACTTAATTC 

ATGGGTTCAT 

tTAATGraTa 

kTTCTTTTGA 

TATTACCTGA 

CAGCCGTTAA 

TTATTCCTTG 

AATATGAGAT . 

TCTCTTTAGA 

GAGAATTAAC 



TTTTTTCAGC 
TgTAcAgTTA 
GG AC ATCTT A 
GCAGAAGGGG 
AAATTGTTTA 
ATTCCTGTTA 
ACGACATTTT 
CAAGCGGAAA 
AAAGCAGTTA 
AATGTAGGTG 
CTTGATAAAT 
TCAATCAAAC 
CTGGTTAGTT 
CGAATCCTTC 
TCGAAAGAAA 
CACGAAATGC 
TCTTATCAAA 
CGAAGATGCA 
ACAGAAGTTC 
TGAAACGTTC 
TTCACTATTC 
GATATGCCAC 
TTCAATCGTT 
ACTATATGGA 
TGAAAGAwAT 
ATGtGACmAC 
AGAATTgCCC 
AAATAAAACG 
AATTAGAGAT 
TGAATATAAT 
TGTCGGCATC 
AGAGTTGATT 
TAAAGATATG 



ATCCTTATAT 
aTTGTGCGCA 
AAGgCGCTTT 
GCGTTTTATT 
CATTTTTAGA 
CGTGTCGGTT 
TAAGACGAAT 
GAAAACAATT 
CTATCACGCC 
AATTAAGAAA 
GTGTTATTCC 
TAGTTCCTGA 
TGTTAGAGGA 
GTCTTTATGT 
TTGAACGCTT 
TATTGTTTAT 
TTCGGTTTAA 
TTGATTGGTT 
AAACCAAATT 
ATTAGATATT 
GGGTTCAGTA 
AGCAAGTAGT 
TGACATGCCG 
ACGTCAAGAA 
TTATCAGTAT 
mAGcTTGGCT 
CAAAAAGcGT 
GAACgTGTCA 
TTACTTGAAA 
CAATTACGTA 
ATCGGTACAA 
GCAGGAAAAG 
CTGTCCTATG 



303 
TACCAGTTTG 
ATATGCCAaC 
TACGGGAGCA 
TTTAGATGAA 
TCAGGGTATT 
GTGCTTTGCC 
TCCTATTCAA 
AATTATGCGA 
ACAAGTTATC 
TAATATAAAA 
AATCACTTTG 
TGAAAATGAA 
AACAGAGCTT 
CTCTCACCAT 
ATTCGATGAT 
TACGCAAAAT 
TGGCAGTTTA 
TCCAGAACAA 
AGCAACTAGT 
GAACTATCCC 
AAGAAAAGTC 
ATTGCGAATG 
TTAGATGTTA 
ACGAGGAATG 
TTtCCGGCAG 
CTGkCTaTwG 
TAmCGGTyCA 
TTTTAACAAC 
AAAGTTTACC 
ATGCAGAGTC 
ACAACCCGTC 
GAATTACGAC 
TAAATCATGA 



GTCTTTCAAg 
AaTCCAGaGC 
GaTTCCGATA 
GTTCATCGTT 
ATTTATCGAA 
ACAACTGAAG 
ATTAAAATCC 
GCATTTTATG 
CAACTTTTAG 
ATTATTACTG 
CACGATCTAC 
TTGCCTATTC 
GCGCAArGAA 
CATCATCTTT 
TTATTATTTG 
ATCCGTCAAT 
GTCTATGCAT 
GAACCTACAA 
TATGGtACGC 
AGATGGAwCG 
ACTATCCGAA 
TTGCAAATCG 
CTCCCArAAA 
GTTTAGTAAT 
AAGrAGAAGG 
GAGrAGCCmT 
TCCmAATGaA 
GTGTAgTACC 
TGCGGAAGCA 
GATCAAAGAA 
TTCAAATGAC 
CCTTTTAGAA 
GTTGATTCGT 
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AcTACTTGcG 60 

TTTtAACTaG 120 

AAATTGGGsC 180 

TGAGTCCTGA 24 0 

TGGGTGAAAC 300 

AAATTTCTAG 360 

CaAGTCTTGC 4 20 

AGGAACAGCA 4 80 

AAAACCACCA 540 

CACGCAGCTT 600 

CGAAAGAATT 660 

GTTTAGATGA 720 

GAATTATCCA 780 

CCACTGCTAA 840 

AAAAGAAGCG 900 
TATTGGAAAC ( 960 

TAAGCACCTA 1020 

CAGTCATTGA 1080 

TGAACAACTG 114 0 

AATTATTGTC 1200 

AGCGGTGATT 1260 

GCTGTTAAAT 1320 

AATTTCCGAA 1380 

TCTTTTTGAT 14 4 0 

kCCtTTTTTA 1500 

TwAAGmCGAA 1560 

TGGGAAaTCa 1620 

GGAATTGGCA 1680 

CAATTAAAGA 17 4 0 

AGTTTTCCAG 1800 

CTCCCGTATA 18 60 

TGGACAAAGA 1920 

AaCTTTTCAT 1980 
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TAGATCGGGT 
AGGTCTTTTT 
TAG CGG TATA 
AAAATTATAA 
AAGAGGCGTT 
TTTATGTCTA 
AAAATAAAAA 
TGAGGTGAAG 
GATTCAGACA 
TTATATGGAA 
ACAAAATGAA 
TAATTATGTG 
TGAAATATAT 
TGAAAACGGA 
AGAAGaATTT 
ATGGTCAAGT 
ATGATGACGT 
GAGTAAAACT 
CAGAAAAATA 
CTGCTTATCC 
GAAACATTTC 
TCGCAAATGG 
TAAATGTCTT 
TCGCTTTTAT 
TAACCGGTTT 
CGACCTTAGA 
TAGTGACTGG 
CAGCCTTAGT 
TGGGTTTGCT 
ATAAAGGGGT 
TCATTGTAAC 
TCCCTGAATT 
TTGCGATGCT 



CATTCAATCA 
AATTCAATTA 
TGTCCATGTC 
TGGCGCTGAA 
TAGTGTCATT 
TGATGTGTTG 
GGAGGGAGTG 
AAAATGGAAC 
TCATTAGCTT 
CAACCTTATG 
TTAATTGTGA 
GGGAAACAAG 
ACCCAAATAA 
CAAGAAAGTC 
TAGGAgGAat 
GGCTTTTTCA 
ACCTAAAAAT 
GGTCATTAAA 
CAAACTTTTT 
AGAAATAAAA 
TAAAGCAATT 
TTGTGAAGTT 
GTAGAAAGGA 
TGCGCAGTCG 
ATTTGTCGGA 
GTTAGCTTTC 
TGGGATTCTT 
CTTAGGCTTG 
CATCCCAATC 
GGAGCGCATG 
TATTTCATTT 
TATTCAACAT 
TGCACGCTTA 



GTGACGATTT TAGATACAGA AAAAATTATT 
GAAGAGCGGT GGCAACAAAC AATTCAAAAT 
AGTTGTTTAA TTGAACGATT AATCCGGAAT 
CAACTAAAGC AGTGTCAAAG AACGGTCTTG 
GAAAAAGTTT ATAGTGTCAA TATACCTGAA 
TTTGGAAAAA CGGAGTTTAA CAACGCAGAA 
CCACTTTTTT GGCACTCTTT TTGCTATATA 
GTAAAATTAT TCTGGCAACA CATGGTAATT 
TAATTTGTGG TGAAACAAAG AATGTGGAAT 
ATTTAGCTAA GACGGTCAAA GACATTTTAA 
TTACAGACCT ATTTGGCGGA AGCGTGAATA 
CAATGCATTT AATTTCAGGA TTAAATTTAG 
ATACTGTTGA TTCATTAGTT GATTTAGTAA 
TTTGTTATTG TAATGAGTTA TCTAGTAAAG 
gAAnTGATTA AATTGTTAAG AGTCGATCAT 
TGGACCCAAG GCTTAGGAGC CGATTGTATT 
GATATTCGAA AAACAACCAT TAAGCTAGCT 
ACGATTGAAG ATGCGATTGC CGCACTAAAA 
ATTGTAGTTG AAAGTGTGGA AGATGCTTAT 
GAAATTAACT TAGGAGGAAT GAAAGTAAGG 
AATATCACAC CGGATGAAGA AAAAATGGTT 
GAAATTCGAC AAGTCCCAAA TGACAAAAAA 
GAAATAAAAA TGATAGGACA AGCGATTCTT 
GAATATGCTT TAGGAACCTC GTTGCTTTCA 
ATTGTATTAG GTGATGTCAA AACGGGCGTA 
ATCGGTTCTT TTTCAGTGGG CGCATCAATC 
GGTACAGCGT TTGCTATTAC TGCTGGCGCA 
CCAATCGCAA CGTTAACGTT GaTTCTTAAA 
ATGAGTCAAA AAGCTGATCA GTATGCTGAA 
CATTTACTTG CAGGCTTTGG CTTATCGTTC 
GCCGTGGGCA GCAACACCAT TAAAGGCTTA 
GGATTGTCTG TTGCAACGGG GATTATTCCA 
CTGATTAACA AACAAGTGGC GCCATATTTC 



CGACAAGTAG 
GATCGTAAAT 
GAACCTATTG 
CAAGAGTTAA 
TCGGAGCTTT 
AGCGATTTTT 
AAGAAGTAAG 
TCGCTGCAGG 
GTCTTTGCGC 
AGAACAATCA 
ACGAATTCTT 
CAACATTAAT 
AACGTGCCGT 
AAATTTTAGA 
CGCTTGTTAC 
TTAATCGCAA 
AAACCAGCAG 
AGTGGTGTTA 
CGATTGGCTA 
GAAAATACTC 
AAAGAATTGG 
GTAGCAGCAT 
TTAGGTTGCA 
AGACCAATTG 
ATCATGGGCG 
CCACCTGATG 
GGAACAGAAA 
AATATTTATT 
GAAGGAGCAT 
aTGctAGCGC 
CTTGATCTGA 
GCGCTAGGAT 
TTTTTAGGTT 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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TTPTPTTAAT 


GGCCTACTTT 


AAGTTACCTG 


TAACAGGAGT 


AGCAATTTTG 


GGTGCCATTA 


4020 


c a rr zi P.T A A T 


TPiTTPTGAAT 

X O 1 1 O 1 1 


ATCATGAATT 


ATGCCAAAGG 


AAAGAATGAA 


ACAATACAAA 


4080 


p a a p a 21 ^ T P P 


flGAGGTAATT 


GATGATGACG 


AAGACTTCTA 


AAACTAAGAC 


AGAAAACTTA 


4140 




AAGAACTTAA 


TCAAGTCTTT 


TGGCGTTCAT 


TTCAAATGGA 


ATTTTCATGG 


4200 


J\nTT5 TP A AP 
Art. 1 1 A 1 Vjr\rl\»< 


fITPAGATGAA 


TATGGCTTAT 


GTCTATGCCA 


TGATCCCTAT 


TTTAAAGAAA 


4260 


1 I Ai A1U<- 1 C 


a aaaapaaga 


GATGGCGGCA 


GCTTTAAAAC 


GGCATCTTGC 


TTTCTTTAAT 


4320 




ATATPPTPAP 


GTTTATTTTA 


GG AATTAATG 


CTGCAATGGA AGAAGAAAAT 


4380 


GT1CI bbAlU 


AAAATTTTPA 
/\rxAA 1111 Ort 


AGTTGGAACA 


ATCGATAGTA 


TTAAAACCTC 


GTTAATGGGA 


4440 


LL All nbL 1 0 


PTTTAPGTGA 


TTCCTTTTTC 


TGGGGAACCT 


TACGCTTGAT 


TGCAACAGGT 


4500 


GTTGGAACTT 


CGTTAGCTTT 


ACAAGGGAAT 


GTGTTAGGGC 


CAATTCTATT 


T T TACT C ATT 


4560 


TTTAACGTCC 


CCCATGTGAT 


TPTTPPTTAT 
loll 1 x n X 


TTACTGAATC 


GTTGGGGCTA 


TAAATTAGGA 


4620 


ACTGGTTTTC 


TCAAAAAGAT 


TP A APP A A AT 


GGCATGATGG 


AAAGTCyCAC 


ATTAGGCGCT 


4680 


TCmATAATTG 


GCyTGATrGt 


PPVArGTPPv 


ATGaCCGCCT 


CCATGATTGA 


TTTAACTATC 


4740 


CCgATTACAA TCAGCgGAAG 


TP. P P a AAA AT 


PPaGTCACGG 


TyCAAAATAT 


CyTTGACGAT 


4800 


ATCATGCCTA 


AATTATTACC 


GCTTGCATCT 


TTTGGTTTTG 


TTTTTTATTT 


ATTGnAAAAA 


4860 


GAAGTAAAAC 


CTTTAGCGAT 


TTTAGGTGGC 


ATGGCAATTG 


TCGGAATTCT 


TGGCTCATTA 


4920 


ATTGGTCTAT 


TTTAGGAGGC 


AAAATATGAA 


AACGATGCTG 


GATTATrTAA 


ATGAGGAGCA 


4980 


aGCAGCTCTA 


rCTCAGr.TTC 


TTAACCAATT 


TCAaTTAACT 


GATACAGACG 


TTTCAAAGGT 


5040 


CACGCACTGt 


CTAATTTTAG 


CAACTGGCTC GTCACATAAC GyTTkTCAGG 


CGGCTAAArT 


5100 


TTACCTTGAA 


GAAGTGGCGC 


CTATTTATGT 


GGAAATCCAA 


GAGCCGTTCA 


ATTTTGCTCA 


5160 


TTATGGCAAA 


GTGGATGATC 


GGTTTGATCT 


GGTCGTAGCG 


GTTTCGCAAA 


GTGGTAAAAG 


5220 


TGCTTCAACA 


ATTGGATGCC 


ATTGCAAAAA 


TTAAAAACCA 


AACGTCTGTA 


AAAACAGTGG 


5280 


CCTTGgACAA 


GTGaTGTCAC 


CAGCCCGATT 


ACGGaGGTTG 


TGGaTGaAGT 


CCTGGaTTTa 


5340 


AAAATAGGGc 


TTGAAACAGT 


TGGtTTTGTC 


ACAAAAGGGt 


ATACGGcAAC 


ACTTTTaAAT 


5400 


TTAtTTCtAt 


TTGGtTTgAA ACTGGGCTA-T 


CAAAAGCATC 


AATTAACAAA 


AGTAGAGGTT 


5460 


GAAGAAGAAT 


TAGCAAAACT 


TnACCAAAGC 


AATTCAGGAA 


. ATCGATTCGG 


TGATTTATAA 


5520 


AACCGAACAA 


, TTTTGGGCAA 


AAAACGGAAA 


. AAAGGAAn 






5558 


(2) INFORMATION FOR SEQ ID NO: 12: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AnATnTGAAT CGTATAGCGA AGCAATTCTC TGATCAGAAT nATTTAACTT CGTTTGGGAA 
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AAATTTCTGG CGCAAAACGT ATAACGGAAA ATCACCCACT CAGGAACTAC TGAAGGTTGC 120 

AACGGAGGAT TTAGTACAAC ATAAAATGAT TAAACAGCTG GCAACGGAGT "TAGAAATAGC 180 

CCACTCACAA ACATTTGGGc AAGAAAAAAA AGAATGGCAA GACCAAAAGA GCAGTTTAAC 240 

GCTCTGGCAA TTTTTAGATG CAAAAGATCA ACAATTACAG GACCAAATAA AAGAGAAATT 300 

AATGGAGAAA GAAAAGCCAA CGCAAAAAGA ATTGCGGcAG GCCTTTGAGC AATTGGATGA 360 

TAAATwTAAA AAGACCGATT ACTTTGTTGA AGCAATTGAA ATTCCAAATT TTTCAGGTAA 4 20 

GCAAGCAGAA TTAGAAAAAA TAGCCGAAGC AATCTCTCCG AATCTATCTT ATGAGGAGAC 480 

GTTATTGGAG TGGCAAAATA AATTACCTAA TCTTGTTATT GAATCGTATC AATTGAAGAG 54 0 

TGCAGAAATC CAAAAAGAGG ATATTTATTC GTTGTCTGTT GGGGAGATTC TTAGTGAAAA 600 

AGCTGTTGGT ACCGTGGTGA AAGGATACCA CGAAAACCAA TACTTTTATA TTTTTAATAA 660 

GGAAGGAGGG CAACTTTTAC AGTTTGAAGA AGCACCACAA TTTGGCAAAA ATGCCTATAT 720 

CAACACACGC TACAAAGAGA AaTTAGCTAC GTATCmACmA GCAACCAAAG TAGACTTGTT 780 

GGAGAAAGAC AGAGAAAAGT TTTTTCAAAA TTATCAAAAT AAAAACAGTT AAGGAGCAGA 84 0 

AAATGAGAAA AAAAGCGTAT TTTAAAAGTG TAACCATCTT CCTTTCTATT TTGCTTGTTT 900 

TTTCCCAGCT TCAATTATTT AGTCTTCCTG CTTATGGGGA AACAGTGAGT GATCAACCGT 960 
TACTTTTTCG TTCAGTAGGT GTCGCTCAAA GTGGGACGAC CTACTATGTG GATGGGGAAG , 1020 

GAGGAAATAA TGCCAATGAT GGGCAGTCGC CAGCCAGTGC TTGGCGTGAT TTTGAAAAAG 1080 

TGAATCAGAC GGAGTTTCAG CCAGGCGACC ATGTTCTGCT GAATGCGCAA AGTACCTGGA 114 0 

ATAACCAATT GTTGCATCCA AAAGGGAATG GGACAGCGGC ACAAAAGATT GTCATTGATT 1200 

TTTATGATAC GAATGACAAA GGAGAAACCA TTTTTGAAAC AACACGGCGT CCGATTATTA 1260 

ACGGTGGTGG CACCTACAGC ACAGGGACTT TCAAACGTGC AATCTCAGGG GCCGTGCAGT 1320 

TAGTGAACCA AGAATATTGG GATATTTCAA ATTTAGAAGT GACAAATACA CCAGAGCTAG 1380 

ATAATCTCGA AGGCTATAAG AAACCAGGAG ATGCTCAAAG AGCGGGGATA TTAGTACTAG 14 40 

GATATGAACA AAATCGAACG TTCAATAGTG TCACGATAAG GAATAACTAT GTTCATGATG 1500 

TTCAAACAGA ATATTATTTA AATCTAAGTG GGAATACTGC AACCAAACGA TTAAAAGCGG 1560 

TAGGTGGAAT TATTGTTTTA GGAAGTTGGT TTGATGAGAA TGGCAATGTA GTAACGGCAG 1620 

CTAATGATCA TCGAACAACG ACTGGATTTA AT GAT ATT TT GATTGAAAAT AATGTGATTC 1680 

AACGAGTGGG ACTTGAAGGA ATTCGAACAA AAGCTGATTC AGATACCtCA AGAGGCAACA 17 4 0 

CTTTCTACAA AACGTTTTCT AATATTACGA TTCGTAATAA CTATTTAGAA GATATTGCTG 1800 

GCGACGGGAT TGTCTTGTCA GAAGCCAAAA GTGGCGGTGT. CGTCGAAGGA AATGTTGCTG I8 60 

TCCGAATGTG CAATGCCGAT TATGGTACAC AAAATTACGC GGGCGTTTGG GCAATGTCTG 1920 

TAGATGACGG ACTGTTCCAA TATAACGAAg TATACGGAAT CAAATATGGA TTTAACGATG 1980 

CTGAAGCTTA TGATGTGGAC ATGCAGTCAA ACAATGTTAT TTATCAATAT AACTATTCAC 204 0 
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ATCATAATAC 


TGGCGGCTTT 


TTATTGCTTA 


TGTCTGATCA 


AAAAAATTCG 


GTTATTCGGT 


2100 


ACAATATCTC 


AGCTAATGAC 


GGCGGTGGTA 


ACCGAGGAAC 


AGGTAAAGAC , 


AATCCAGGCG 


2160' 


GAGCTGGTGG 


CTACAATTAT 


AAAGAACAAA 


GCATTTTCCA 


TTATTGGGTG 


AAAAATGATG 


2220 


GAGCGGCAAT 


GCCAACTATC 


CACAATAATA 


CGATTTATGT 


TGGCGACGGT 


ATTTCAACAT 


2280 


CATTATTTGG 


TGAAGGAAaT 


AGTTCTGACA 


ACAGTGGTAC 


AGTAGCCAAT 


TTTTACAATA 


2340 


ATATTTTATA 


CAAAGAAGGT 


ACCGGACAAT 


TAAAATTCTT 


ATCTAATTAT 


CCAACAAATG 


2400 


GGACACAACC 


AATTGAACGT 


AAAATGGTCG 


ATAATCCAGA 


AAAATATTTC 


AAAAATAATG 


2460 


TTATTTGGCC 


AAAAGAAATT 


GCTACAGAAA 


AAAGTGGTGC 


AACTGTTGAA 


AAATTAGTAT 


2520 


CAAGTGGAAA 


CATTTTTGAA 


AAACCACAAT 


TAGAAATCAC 


AGATAACCCT 


GAAAAAGTGA 


2580 


AAGAGTTGGC 


AGAGCAAGAA 


TTTACGACAT 


TGAAACCAAC 


AAAAGATAAT 


GTGGTTGAAT 


2640 


TTACTTCTAA 


AGAACGCTTA 


CGTCAGCGAG 


CACAAATGTT 


CCGATTAAAA 


GAAAACTCTC 


2700 


CAGCGATTGG 


TAAAGGCTTG 


TCGGAAGTGA 


ACAGTCCGGC 


CGAAGACTTC 


TTTGGAAATA 


2760 


GCTTGAAAAA 


TAAAGTTTTA 


GATATTGGTG 


CGCAACAAGC 


ATCCACAATT 


GAAAAAAGTA 


2820 


TTCGTTATCA 


AAATCAAGTG 


TTAGAAATTA 


GTTCGGCAAC 


AGGTGTGTAT 


CCTAATTTGC 


2880 


CAGAGCAAGT 


TGAATTAACG 


TACGAAGAAG 


TCGTTAATGA 


AGAAGTGGTT 


GCGACAGGCA 


2940 


AAAAAGAATT 


TAAGTTACAA 


TGGGAAGCAA 


TCCCTCAAGA 


AAAAATTAAT 


ACAGCAGGCA 


3000 


CAGTTGAAGT 


TGCAGCAACA 


GTTATCGGCT 


TACCGATAGA 


TGCTGTCAAA 


GTAACAGCCA 


3060 


AGGTCTCTTT 


TGAGGGTGAA 


CTTGGTGAAG 


GAAAAGACAC 


CGTTAAACTA AAAACAGCTC 


3120 


AAACTGCCTA 


TGTTCAAAAG 


AGTGATGGTA 


ACCGAGCATA 


CTCTGCTATA 


GCAGGCGGTA 


3180 


CCGCTGCCAT 


TAGTTCaGGA 


GACGCCTATA 


AATATCCTTA 


TGGAGTAAAC 


TATACGGGCA 


3240 


ATTACGCATT 


-AAAATTAAAA 


AATGCTAGTT 


CTGCGGGGTA 


TAATCGCCGG 


ATTTACGTTG 


3300 


AAATTGATAC 


ACAAGAATTG 


AAGAACTATC 


AATCACTAAA 


AAGTGCCAAT 


TTAGAACTAA 


3360 


ATGTGATGCG 


TTATGATGCG 


TGGAATGGTG 


CTGGAAATAC 


AAATGATGAA 


CGTTTGAAAA 


3420, 


ATACGCAGTT 


CCAAGTAGAT 


GTTTATGGAa 


CAGATACTAA 


TTGGATGAGT 


AACACCATTA 


3480 


CTTGGAACAA 


TGGTCCAAAT 


AATTTAAATG 


TACCAAATGA 


AGAATTTATT 


GCTCGACAAT 


3540 


CTTTTACAAA 


TAGCTCAATT 


ATGAATAATC 


AAAATACAAT 


CTCTATTGAT 


ATTAGTAACT 


3600 


ATCTACGGAA 


GCTAATTCAA 


AGTGGTGAGA 


AAATTCCAGC 


CAAGTTAAGT 


TTCTTATTAG 


3660 


C GAT TAG AG A 


CAGTCGTTTG 


CCTGGTTATG 


ATTCTGATAA 


TGCTGGGTTT 


GATGCTTTTT 


3720 


CAAAAGAAGG 


CGCACAAAAG 


GCCTATCAAG 


ACTTTTTAAC 


AGGCAAATTG 


ACCTTGCCAA 


3780 


CTGGTCAACA 


GCTAGCAGAG 


GATAGTTTAG 


CTCCTAAAAT 


TGTTTTATCG 


AATGTTTTCG 


3840 


AAGTCAAACA 


TGAGTCGATT 


GAGGTCACGA 


CAGAAGCGGG 


TCAAGCCCCT 


AAACTACCAG 


3900 


AAAAAACAAC 


TATTTTTTAT 


TCGGATGGTT 


CACAACGAGA 


, AGTGACTGTC 


AATTGGTCAG 


3960 


AAGTTCCAGC 


ATCTTCTTAT 


CAAAAAGAAG 


. GAATATTCAC 


: TGTTGTAGGA 


. CGAGCAGCAG 


4020 
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G7G7C7CAAT SCCAATTATT GCCAA7GTAA AAG7CACCGC TAAACATATT GTTGGATTCA 
AGGAAT7GCC 7GCGC7GGAT CGACTAACAG GCACCTCGCG AGGCGAACTG AATTTGCCAA 
C7GAAGTAA7 7GCMAATTA GATGATGGTA GCGAAAC3AA ATTGAAGGTT ATTTCTTGGG 
A7GACGA7GT TAGTAAT7AT TCGCCAAGTA GCCCGCCAGG TACTTACCAG TTCCCTGCTG 
CAGTAGAAGA AAAAATAGGA ATTGCGAATC C7GA7GAACG AAAAATCTTC CAAGTCGTGC 
AAACTCA73C GATTCC'.GAA AGAATTCAAT TTGCTACAGA AACAGCCACT ATAAAATCAG 
GCGAAAATTA 7CAAA7TCAA TCTAAAGTGA 77GG7CAAGC GCCGCACACA GAAACGGATG 
C77GGAGC7C GCAAGTAACG TATGAGTTGG 77ACACCAGA TGCTGGCAAT ACAGTAAGTG 
TAGATGAAAA CGGTC73ATT AGGACAGAAG CAACGACTGC TGCAGGAAAT TATCAAATCA 
AAGTAACAAG CAAAGTTTTG CCGATAGTGA CTGCACAATT TTCAATTAAA GTCACAAAAT 
AAACGAACTG 7TTGAAGAGA CTGGAACAAG AATTGCTGTA CCTTGCAATC TTTCTAGTCT 
CT7CGA77AT CTAGAATCAA TAAATATTCT A7TGCCTACT ATTCATTCTT TTTAAAACAA 
GGTAAAA7AA 7AAATGAAAG TAAACGATAA AGGTGTGTAG ACAATGAATA CTATCTTAAC 
AGGTGATCGT CCAACTGGTA AATTGCACCT AGGACATTAC GTAGGTTCCT TAAAAAAACG 
AGTAGAAATG CAAGCAGACC CTACAAATCA A7TATTTGTC ATGATTGCAG ATTTACAAGC 
ATTGACAGAT AAT GCAAGAA ATCCCGAAAA AGTTTCAGCC AATGTTTTAG AAGTTGCTTT 
GGAC7A777A GCASTTSGTT 7AGACCCTAC GAAAACAACA A7TTTTATCC AA7CGCAAAT 
TCCACAG7TA GC7GAATTAA CAATGTAC7A 7CTAAA7TTA GTGACCACGT CACGTGTTCG 
TCGGAA7CCA ACGG7CAAAG CAGAAA7TGA GCAAAAAAAA TTTGGGGAAG GTGTTCCGAC 
AGGATTC7TT A7ATATCCCG 7T7CACAAGC AGCTGATA7T AC7GCCT7TC AAGCGAATTT 
AGTCCCGG77 C-GGGAA3ACC AAAAACCAAT G7TGGAAC-AA GCGCAAGAAA T7GTGCATAG 
TT7TAA7CAG ACGTACGGTG AAGTTTtAGT 7AGACCGGAA GCTGTGTTAC CACCAAAAGG 
CATGGGACGC 77GCCAGGAA 7TGATGGCAA CGGAAAAATG AGTAAGTC77 TAGGCAA7GG 
TATT7A7C7T 7CAGA7CCAG CTGAAGTGGT ACAGAAAAAA GTAATGAGTA TGTATACCGA 
TCCAAATCA7 ATTCGTATAG AGGACCCAGG ACAAG7TGAA GGAAATATGG TCTTTACTTA 
CTTGGA7GTT 7TTGGGAAAA A7AAAGAATA TATTGAAGAA TTAAAAGAGC ACTATCGCCA 
TGGCGGTTTA GGAGATG7GA AAATTAAACG T7ACTTAATT GATGTCCTAG AAGAAGAACT 
CGCACCGATT CGTCGACGTC GTGAAGAGCT GGCTAAAAAT CCAGAAGCGA TTATGGAAAT 
GTTGCATAAA GGAAGTCTTG CTGC7GAAAA AG7GGCAGCA CAAACCTTAA CAGAAGTCAA 
AAAAGCAATG GGAATTCAAT A7TTCTAAGC AA7AAAAGAA CTGGAACGAG TCAAAAGAAG 
AATGACTTG7 7CCAG7TCT7 7TTAAGTGGT 77GAAATTGA CTATTATATA AGCTGGCG7A 
AAGAG7CGGA GC7TG7AA7A GTTCCTGATG CGTTCCTT7T TCAATAATAT CGCCATTTTT 
CATCAC7AAA AT7AAATCAG CA77TTCAAT 7G7GGATAA7 CGATGAGCGA 7AACGAAGCT 
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TGTTCGATTT 


TCTGTGACAG 


TTTCCATCGC 


TTTTTGAATA 


TGGGCTTCTG 


TTCGTGTGTC 


6060 


CACACTCGAA 


GTTGCTTCAT 


CGAGAATAAC 


AACGGGCGGA 


TTTGCTAAAA 


TGATTCGGGC 


6120 


GATGGTTAAT 


AACTGTTGTT 


GCCCTTGTGA 


TAATGCACCA 


TTTTCGCTAG 


AAATAATTGT 


6180 


GTCATATCCT 


TGAGGAAGGG 


TCCGAATAAA 


AT GAT C AC AT 


TGAGCAATTT 


TAGCTGCTTC 


6240 


AATTATTTCT 


TCACGAGAAG 


CATCTTTTTT 


TCCATAGGCA 


ATATTATCTG 


CTACGGTTCC 


6300 


TTCAAATAGC 


CAAGTGTTTT 


GTAAAACCAT 


GCCAAATAAA 


TTTCTTAGAT 


TTTGTCGAGA 


6360 


AAGTTTCGTA 


ATATCAATCC 


CATCAAAAGT 


AATGGCACCT 


TGATTTATTT 


CATAGAAACG 


6420 


CATTAATAAG 


TTGACTAATG 


TTGTTTTACC 


AGCGCCGGTG 


GGTCCCACAA 


TGGCCACTGT 


6480 


TTTTTTCGGT 


TGAACAGAAA 


AATCAACATT 


CTTCATTAAA 


ATTTTTTCTG 


GTGTGTAGCC 


6540 


AAATTGAACA 


TTTTTAAATT 


CAATGGCTCC 


TTTAGGAGAA 


GAAATAGTTT 


CTAAATGAGT 


_ 6600 


TGCTTCTGGC 


TGTTCATCAG 


CTTCATCTAA 


GATAACAAAA 


ATCCGATCAA 


TGGAAGCCAT 


6660 


CGCTGCTTGA 


ATTGAGTTAA 


TGACATAAGA 


AGCGGTAGAA 


ATCGGCTCAG 


AAATTTGGTT 


6720 


GATATATTGC 


AAATACGCTT 


GCAAGAAACC 


AATCGTAATA 


CCACCAGATA 


AAACGAGCAT 


6780 


TGCGCCTAAG 


ATAGCACTAA 


TAATAAAAGC 


CAACTGATTA 


ATAAAACGAA 


TAGCTGGATA 


6840 


AATCGCAAAA 


TTTAGAAACT 


GTGCTTTTTT 


AAAGGcACGA 


TAATGTTGTT 


GATTAACAGC 


6900 


ATCAATTGTT 


TTTTCAGCAT 


TTTGCTGTTG 


ATTAAAGGTT 


TTTGTGACCA 


AATTTCCTGC 


6960 


TAAATATTCT 


TCCATCTTAT 


TATTTAATTG 


ACCTAATTCA 


GCTTGACTTT 


GATCAGCAAA 


7020 


CACCTTGTTT 


TTATTCGCGA 


TTTTTGTCGT 


CATGAAAGTA 


CTGCCGCCAA 


TTAGGAGCAA 


7080 


CACTAAAATG 


GTTAATTTTG 


CATCAATATA 


GAACAACATG 


ATTCCTGCAA 


AAAGGATAGT 


7140 


CACGACGGAA 


GTGAAAAATT 


GGTTGATGCC 


TGTTAAAAGA 


ACTTGTGACA 


ATTGGTTTAA 


7200 


GCCAGTTGTT 


GAACGACTAA 


TGATATCGCC 


CACTTGGTGA 


TTGTCAAAGA 


AGGCCATTGG 


-7260 


TAAAGTTTTA 


AACTTTTTTG 


TCACTTCTTT 


TCTAATTCTT 


AAAGTAACCC 


GTTCGCTTAA 


7320 


AGAAGCCATT 


GCTCGTTCCT 


GAATAAAAGA 


AGTGATACTA 


CTAATGATTG 


AAAAGAGAAT 


7380 


TAAAAGTAAG 


ACGGGACCTA 


AGAGCGCTTC 


TTCAACTAAA 


GGAAGTGTCA 


TACCTTTGAG 


7440 


CCCGACACGC 


TTGATAGCTT 


CTAAGAGGTT 


ATCAATCCCG 


ATTGCCATAA 


TATAGGTCAT 


7500- 


GGCAACAATT 


AATAAGTTTC 


CAATTAAACT 


GCAGATCATT 


AGACCATAAA 


AGATTGGATG 


7560 


CTCAGGTTTA 


ATCATTTTCC 


AGAAACGTTT 


AAAGCTTTGA 


AGATTCATTG 


TTTTTTTCAT 


7620 


TGTATGGCTT 


GAAAAGAAAA 


CTTTCTTTCA 


AGCAGTCACA 


TCCTTTCTTT 


AATTACTTGG 


7680 


GTAAGATACC 


TTGCGATTTA 


GCAAAGTCTT 


GGTAATAGCT 


ATTAGTGGTA 


AGTAAATCAG 


7740 


CGTGGGTGCC 


TTGACCAACA 


ATCCTTCCTT 


CAT C TAG G AC 


GATAATGTTG 


TCAGCGTTCA 


7800 


TGATTGTACT 


TAACCGTTGA 


GCAACAATGA 


GTAAAGTTTT 


GTCCGACATT 


TGTGCATGTA 


7860 


AAGCGGCACG 


TAGAGCGGCA 


TCAGTTTTGT 


AATCTAATGC 


GGAAAAGCTA 


TCATCGAAAA 


7920 


TATAAACGTC 


TGCCGGTTTG 


ATTAAGGCTC 


GTGCAATACA 


, CATTCTTTGT 


TTTTGACCAC 


7980 
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CAGAATAGTT GGACCCGCCT TGTGCTACGA AACTTTCAAT CCCTTGTGGT AAGGAATCGA 8040 

TAAATTCAGA AGATTGTGAA ATTTCTAGTG CCGTTCTTAT TTCTTCTGTA GTTGCTTTGG 8100 

CATTTCCCAT TAATAAGTTT GATAAGATTG TCCCACTGAA AAGAAAGGCT TTTTGTGGCA 8160 

CATAACTGAT GACTTGGCGA ATCGTTTGCT GAGATAATGA GCGGATATCT GTGCCAGAAT 8220 

AGCTAATCGT GCCGGCTGTG ACCTCATTTA TTCGTAAAAG TAACTTAACT AAAGTACTTT 8280 

TACCAGCGCC AGTTGCACCA ACAATCGCCG TTGTTTTTCC TTTAGGAATG ACAAAACTAA 8340 

CATTTTCTAA AACAGGATCA GCTGTTTCTG TGTATTGAAA CGTGACGTGA TCGAAGCAAA 84 00 

TCAAGGCTTT CTCAGGATGA TAAGCAATGG TTGCCTCAGT ATGAGGATCG CTAATTTCAA 84 60 

TTTCTTCTGA CAAAACTTCT TGCAAGCGAT TCAAGGAAGC AACGGACCGT- GGAATGACAA 8520 

CTAATACCAT AGCCGCGATC ATTAAGTAGG CCAAAGTTAG AAGCGTATAT TCAATCACAG 8580 

CTGTAACGGT GCCAATTTGT AGGGTGCCTA CTGCAACTAA ATTACCGCCA AACCACAAAA 864 0 

TAGAGGCGTA CACAACGCCC ATTAATAAAA AGGCAACGGG AGTAATCCAg CAAAGATTTG 8700 

ATTCACTTTA CTCATGCGTT CAGCATAATT TTTAAATGTT TGATTGGTTC GCTGCTCTTC 87 60 

AAAATCTTGA TTATTGAACG CACGAATCAT ATTAATTCCA GTAAAGAACT CTCGTAAAGT 8820 

CGTAGTAATT CGATCCATTT TTGGTTGAAT TTTTAAAGAT AAAGGGGTAC CTTTTTTCAT 8880 

TAAAAAATAA ACAACGACTA AGTAAAATAC GATGGAAGCC AATGTTACCC AAGCGAGTGA ,8940 

AACTGAATAA GAAAATGTCA TCACTAAGGC AAATAAACTA ATAATAGGCG CTGGAAAGAT 9000 

TAATTGGCAA AACAGCACAA TCATTGTTTG AACATTATCT ACATCGT 9047 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 13: 

TTGTAATTTT ATTGGGTTAA CGTTATTTTT TGGTATCGGG ATCTTAGTTG CGAATGGCGC 60 

CAAAATTGTA GCGAATAAAG GGAAAAAGAG TAAGGGgACC ATTCTATTAA TGGTGATGGA 120 

ATTGCTCTTG tTAGGTATCT TTTTATTTGT GaCTTTGCCa AGTAAAAATC TTTCaACCGT 180 

GTTATGGATT AATCTGATTG GCGCAGTGAT TGCTAGTGGG GTGTTATTAA GTACACGCTT 24 0 

AGCTAGTAAA GTCGAAGTTC CTGGTATCCA AAAGGGCCGC AAAGAAGTAG GGAAGAATAC 300 

CAACAAAACG GCTAACAAAG ACTGGTCTGG CAAGATTTTG GGTGTTGCCG TTTTGGCAAT 360 

GATTTTGTTG . ACTATTGTCA G TAG T ATT AC CAGAATCTCT TCTATTGATG AAATTTATCA 420 

AACGATTCCT TTAAAAACGG AAGAAAAAGC TGAAGTTTTA ACTTCGGCGA AAGAAACACC 480 

CaTTGCGATT GCACCACAAA CGGCGAAACG AAAAATGTTA CAGAAATTTT CTGtCATTCC 54 0 
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CAATTcCAAT 


ATGTTTACTT 


ni y - * _ m ^ TV fry 

TGGaTGGtAT 


AACGGCACAA 


G 1 1 b 1 LrtHi^ 




600 


TTATGTAGCm 


ACAGTCGAAT 


TTAATGGCx y 


prpmmjv TV T\ rp^/"" 

CTTTAAATGG 






660 


TGGCTATTTC 


ATCATTAGTG 


CG AC AG A I Al 


L AAG G G AL AG 


OLAuAAl 1 lu 


•rZXriA AAAACC 


720 


AATTGTTTAT 


ACTCCTTCAG 


CTTACTTTGG 


C AAAG A I GCC 




111 n\~ x V^^ov- 


780 


TTATCCAGGT 


TATGCAGCAA 


CGGGCACAAT 


C AA1 T 1 AGAG 


L. 1 bGALoALL 


rtnubonn 1 


840 


TTACTATATT 


CAAACGCTTT 


ACAAAGAGTA 


ni /"* f~* n"* rp rfi v"! rp 

TGGGG I 1 1 G I 


UG 1 LobAlbC 


AL i A 1 rvri 1 Vj A 




ATTTAAAACG 


GCAGTATTGA 


ATGCTACAAC 


AGGAGAAG 1 1 


AAGGi 1 inlu 


ALAu 1 LAAAA 




AGCCCCAAGT 


TTCGTTGATG 


CGCCGATTAC 


TAGCTGAGGG 


GGG AA I AvjL 1 


lAAAlVjAAi 1 




CTTTGGTCGT 


TACAGTCAAG 


rr> /"* m y y tv ta 

GCTGGTGGAA 


CCAGACaATG 


T I i GG IGLvA 


AAAAALj A ibl 


1UOU 


CAAAATTCCA 


ACTGAAAATG 


GTATTTATGC 


■n m /~1 TV ^% TV ^ T\ TV 

ATCAGGACAA 


ATTALGGGGA 


TP ATT A APR 7\ 

I GAi GAALAA 


X X ft u 


GGAAGGGAAC 


CAGTTGCTTT 


ATTTTACGGA 


CTTTACAAGT 


AG T G AAAAAG 


AGGAAGAI IG 




CGCTTTAGGT 


TATTCCTTAA 


TCAATGCCCG 


AACAGGCCAA 


m m *r\ m TV rn 

GTTACGTATT 


ATCGTGAIAG 


1 £DU 


TAAAGTCGGC 


ATTATGGATA 


GTGATGGCGC 


CATTTCTATC 


/-» ^ m rn ^> TV TV TV 7\ TV 

GCTTCAAAAA 


TTTATCCAGA 




AAAGAAATGG 


AAAGCTAGTA 


TGCCAGTCCT 


TTATAACATT 


GATGGCG 1 GG 


GAAGG 1 uljft 1 


1 JOU 


TGTTTCGTTA 


ATGGACAGTA 


AAGGGATTTT 


CAAAAAATAT 


/■» m m TV rn TV rp 7\ TV 

GTCTATAT. AA 


AGGGGG 1 AGA 


i 4 a n 

X fl fl u 


TAATGATATT 


GTAATTGATG 


CAGACACGGC 


ACAAGGGGGA 


1 lAwACoLl 1 




1500 


GTTAGTAACA 


AAAGGAAGCA 


ACAATGAAAA 


TACAGACAAA 


GCCAACTTGA 


AAACGATGCA 


1560 


AGGTAAAGTA 


GCTCGTGTCT 


CAGTAGTAGC 


TGGGGAAGCG 


CAAACGGTAG 


TTTCATTTTT 


1620 


ACTAGAAGAT 


GAAAAAaCCA 


TTTTTAGCGT 


AACGaCTAaT 


AACAGTcCcT 


GGCGCTATTC 


1680 


TTAAAaGAGG 


GCGaCCAAGT 


GACaTTTAAA 


GCCGTAGtGA 


CTGAAAATGC 


TAAAGCAGCG 


1740 


AATGTTGAAG 


A 










1751 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CGTTTATTTT ATCAAACCAA AATGGATTTG TAAAACATTT CACATGTTAT TTAATAAACG 60 

ACTATTACTA GAAAAAAGAC AAGCCAAGTT TCCTTTATTT AGGAAACTTG GCTTGTCTTT 120 

TTATTCTTTT TACTTCGTTA TTTTTTGATA GTTTGCGAAT AATTGATTCG CATCTCCTTC 180 

TCCATCAATT TTCTTGAGAT GGCCGTTATC TATTTTAAAC ATAGATGGCA CATGTAAAAC 24 0 

TTTATTTTCT CGGATAAATG TTGACCATTC CTTCATGGTC TCtTTGTTGG TTAATTCTAC 300 

ATAGTAAACC TGTTTAGGTG CTTGTAATAG TTTCTCGAAA TCCTGACAAT CTGAACAGTT 360 

TtTCTTTTTT a^AACAACAT AAAATGTTTC GCCAGAGTGT TTTTTTTCTT GATACTCATT 4 20 
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GAAACTTAAT 


TCCTTTTTAT 


GCGGAATAAA 


TGTAAAACCT 


ACCAACAAAC 


TAAGTAACAA 


480 


AATTAACAGA 


CTGAACGAAA 


TTTTTTTATT 


CATGTTTTTT 


CTTCCTTTTT 


AGTTTGTCTA 


540 


ATTTTTTCTT 


TTTTTATTTT 


TTGTTATGCC 


AAAGCTAATA 


GCCACTATAA 


GGACAATTAA 


600 


ACCTGCGATC 


ATGTAAAGTG 


GATTTACTGT 


TGTATTAGTC 


ATTGGTAAAT 


GAATTCCTTT 


660 


ACTTGCTGaT 


TTTCCAAAAT 


TGaCTGGGTT 


CGTTGGTGCT 


TGATGATTTG 


TAGGCGTCAA 


720 


CGGTTTTTTC 


GGTTTTAGCG 


GTTGTTTAGG 


TTCAGGTTTT 


TTTGGGTCTT 


CTGGCGTTGG 


780 


TGCAGGCGGT 


GTTACGGTTG 


GCTTGTTGGA 


ATGCAACACG 


TCACCTTCGT 


TGCCAAAGTT 


840 


TAAGTCGGCT 


TGGTTGGGAA 


TCCCGCCTTG 


TTCAATATAA 


GGCGCTAATT 


CTTCATCCGT 


900 


TGCGTCAGTT 


TTAATTTTAG 


TGGTAATAGT 


CATTGTGTAT 


GTATGACCAG 


CTAAGTAAGA 


960 


GTAGCTGTCA - 


-TCTTTTTTGT 


TCATAGTAAA 


AGTTACTTTG 


TTATTTTCTT 


GTGTTACTAT 


1020 


GCCATTATCT 


GTAACATCTT 


TGCCATTTTC 


mTCAGwAACT 


TTCACGTCTG 


TGATGTCTAA 


1080 


CACTTTATTA 


ATGTCATCCA 


CCATGCTGGC 


TtGGGTCCAT 


GTGCTTGTTT 


CGTTACCGAA 


1140 


AGCTGTTTtG 


ACGTTCCATT 


TAAATTCTTG 


GTCACGGTTG 


GTTAAATCTA 


AATGTTCTTG 


1200 


GCCTTCGATA 


TCTTTTGTAA 


TCGTTGGATC 


TTCTGGcGTT 


GGTGCAGGTG 


GTGTTACGGT 


1260 


TGGTTTGTTG 


GAATGCAACA 


CGTCACCTTC 


GTTGCCAAAG 


TTCAAGTCGG 


CTTGGTTGGG 


1320 


AATGCCACCT 


TGTTCAATAT 


AAGGTGCTAA 


TTCTTCGTCC 


GTTGCGCTAg 


CTTTGATTTT 


1380 


AGTAGTAATG 


GTCATTGTGT 


ACGTATGACC 


ACTTAAATAG 


TCATAGCTGT 


CkGCtTGkTT 


1440 


GtTCATTTCA 


AAAGTTACTT 


TGTTATTTTC 


TTGTGTTACT 


TTGCCGTTAG 


CTGTAACATC 


1500 


TTTACCATTT 


TCATCTGTGA 


CTTTTACATC 


AGTGATGTCT 


AACACTTTAT 


TAATGTCATC 


1560 


TACCATGCTG 


GCTTGGGTCC 


AAGTGCTTGT 


TTCGTTACCG 


AAAGCTGTTT 


TGACGTTCCA 


1620 


TTTAAATTCT 


TGATCGCGGT 


TGGTTAAATC 


TAAATGTTCT 


TGTCCTTCTA 


CGTCTTTAGC 


1680 


AATATTTGGA 


TCAACTGGCG 


GTGGTGTTAC 


GGTTGGTTTG 


TTGGAATGTA 


ACACGTCACC 


1740 


TTCATTGCCA 


AAGTTTAAGT 


CGGCTTGGTT 


GGGAATCCCG 


CCTTGTTCAA 


TGTAAGGCGC 


1800 


TAATTCTTCG 


TCCGTTGCGT 


CAGTTTTAAT 


TTTAGTGGTG 


ATAGTCATTG 


TATACGTATG 


1860 


ACCACTTAAA 


TAGTCATAGC 


TGTCTGCTTG 


TTTGTTCATT 


TCAAAAGTTA 


CTTTGTTATT 


1920 


TTCTTGTGTT 


ACTGTGCCGT 


TAGCTGTAAC 


ATCTTTACCA 


TTTTCGTCGG 


TGACTTTCAC 


1980 


ATCAATGATA 


TCTAGCACTT 


TATTAATGTC 


ATCCACCATG 


CTGGCTTGGG 


TCCAAGTACT 


2040 


GGTTTCGTTG 


CCAAAGGCTG 


TTTTGACATG 


CCAATCGAAA 


CTATCTTCAC 


GATTGGTTAA 


2100 


ATCTAAGTGT 


TCTTGATTTT 


CGATATCTTT 


TGTAATCGTT 


GGATCTTCTG 


GCGTCACGTC 


2160 


ACACGCATGC 


GCATTGTCAT 


TTTTAGGAAT 


CGTCGCTTCT 


GGAAATTCCT 


TAGCAATTGT 


2220 


TCCTTGATTC 


ATTGTCCCAC 


TACTTACAAG 


GGTTGCTGCA 


TCAATCGCTG 


TATTTTCTTT 


2280 


GACTTCGTAG 


TGAATCGTGA 


TGTTTCCTAC 


GTAACCTTCT 


GTACTAGTTG 


CTGAAATGGT 


2340 


TTGTCCGTTA 


ATTTGAACAG 


GCACATCATT 


TCCAGCATCG 


TCCGTAGCGG 


TCGCAGATTG 


2400 
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AATATCAAAT 


TGATTGGCAA 


TCGTTAATGA 


AGCTGTTGCT 


GGTGTCGATT 


GCGCCAGACG 


2460 


ATCTTTGACA 


ATTTGTTTTA 


ATTGGGTTGT 


AAAATCATCA 


ATAGATTGGC 


TTGTAATAAA 


2520 


ATCTTCTGGT 


GTGCTAGACC 


CTTGTTGCAA 


CTCTTGTTTT 


ACAAAAGGAC 


CCACTTCTGT 


2580 


TTTATATTTA 


TCAAAGTATG 


AATTCACTGA 


ACTTAAAGAT 


TCAACACTTT 


CCCAATACGC 


2640 


ATTAATCATT 


TCATAGCCTT 


GGTTAGTAAT 


TTCTTGGTTT 


AACGCTAAAA 


CTTCTGCTGC 


2700 


TGCACCTTGG 


TAGTCATTAC 


TATATTCCAC 


TGAGACTTGA 


AGAGGATGTC 


TTGGATCTGG 


2760 


ATATTCATTG 


ATTGAATCAT 


TGGTATTGGT 


CTTATGCAAG 


TAACCATCTA 


AACGTGTATT 


2820 


AGCGACCCCA 


TCTGTCACTA 


ATAGGAAATA 


CGTTTTTCGA 


TTCGTTAAAT 


CTCCGTGTGT 


2880 


TTGATTGTAC 


GTATCTAAAG 


CGAGTTTCAA 


TCCTGGGGCG 


GTTGGCGTAC 


CACCATACGT 


2940 


CCGAACGTCT 


CCAAAACCAG 


AGACAAATTG 


GCTTTTATCA 


TAAGTCAATT 


GCGTATTGAC 


3000 


GCGCACATTC 


ATATCATAAT 


CAGCTGAATT 


AATTTTTGTC 


TTTCCATCAG 


GAAACATAAA 


3060 


TTGTTTTCCG 


CCGCGATATG 


AAGCCAGCAT 


CACGCGGTCT 


TGATCAGATA 


AGCCTTGAAC 


3120 


CACTTCATCA 


ATCGCTTGTC 


TTACATGTGG 


AAAATTATCT 


GAAAAACTAC 


CACTAGCATC 


3180 


TTCAACAACT 


ACTAAATCGA 


CTGGTTCCGT 


TTCACCCGCT 


TGAACTGGGA 


AAGTAGCTTG 


3240 


TCCTTCCAAT 


ATCCGTTTAC 


AATCAGCCAG 


ACTGTCGTCT 


TTCACTTTCA 


CCGTATTGGT 


3300 


CATTCCTTCT 


GTATCTCCTG 


CTTTAACCGC 


TTCTTCAGCT 


AAAGCCAATG 


TGGGACTGAA 


.3360 


ATTTCCCACA 


ACTGTTCCAG 


CGACAATCAC 


CATTGTCGAT 


AATAATTTCG 


TTGCTTTCTT 


3420 


AAACATTTTA 


ATCCTCCCTA 


TTTTCTTTCG 


CTAGTGTTTG 


GGACACTTGC 


TTCTCTTTAC 


3480 


TTCAACTACA 


GCTCCTTAAA 


TCTTTCATCA 


AGTCGATGTG 


TCTCCTTGTT 


TTTGTGACAC 


3540 


ATCTCTTACT 


CAATTTCTGA 


TTCTATCACC 


TTTAAAGGCT 


AAAACTTGGC 


AGTATTCGGC 


3600 


TCTTGAAAAC 


GTAAAATTTG 


TCTTTTAAAA 


ATTAATGAAA 


AGACTATCCA 


CTGACAAAGT 


3660 


TGGTTTTCAC 


TTACTTTTTG 


CTACTTGATT 


CCTTTTAAGA 


CTCTTTTTTT 


ACACAAAAAA 


3720 


AGTAAAAGAA 


CCTTTACAGT 


CCTTTTACTT 


AATTGTTAGA 


TGCTATTCAT 


AATTTTCTTA 


3780 


GTTTTGTTCT 


ATAAGCTGAA 


TAGTTTAGCC 


TGTTTATTTC 


AACAATATTT 


GACGTTAGTC 


3840 


AGCCGTTTCG 


TTATAGTTAA 


AGAAAAGGCT 


TTTTGGTAGG 


AGTGAATACG 


ATGAATGAGT 


3900 


GGCCCCATCA 


GTTAATGACG 


CTTTTAAC 








3928 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGTTAGGTAC ATTGCTGGTT AATGGAACAA GTGAACAGTT ATATGAAACA AaCGAmaCCA 



60 
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TTCACTTGGC TTTTCCAGCT GAGAAATTAG 
GGAAGATTTA AAAATGAAAA AACGCGTAAT 
AATGACTGCT TGTGGAAACA GCGAAgcAAC 
TGCTTTAGTC GTTTCAACTT TCGGATTAAG 
TCCATTTGAA AAAGAGAATG AAGCGAAAGT 
CTTTACGAAA TTAAAAAATA ATCCCAATGC 
AAATGCAGCA CAAGGTGGAA AAGATGGGTT 
TAATTTAAGT CAGTTAACGC CGGGAGCAAA 
AATCGCTGTA AACAGTATCG GGATTGTTTA 
AAACTGGGAT GACTTATGGT CAGCTGATTT 
CACGACGGCA GGTCCTTTAA TGTTATACGT 
AAAAGATAAC GGGAAGGCCG CTTTTGAAGC 
AACGTATTCA AAATCGTCAG ACTTAGCyAA 
TGTGGTTGCT GATTTTGCGG TTGATATTAT 
GTTCCAGAAA GCGGCACGTA TGCCAACTAT 
AACAAAGAAA CCGCCTTTAA ATTTGTTAAC 
AAAGCGATTT CGTTAAATGA AGGACCTACA 
GCGAAAAATA AAACATATGG TGCCATTGCC 
ATTAATAGTC AACTAGCGGA TTGGATTGAT 
TAGAAAAGAG GGACAAAATG AACGTCGATG 
CAGTCACTCA AACATTCGTT CAACGAAATG 
TTTATGAAGA AGAGAACGTC AATTTGCAGC 
GGATGATTCC CGGCTTAATT GATGCACACA 
CTATTTTTTC CAAAGCAGTA GTGCGTTAcG 
AAATGGCTAA CGTATTTGGC TTAGAAGGCT 
TAGATATCTT TCACGCAATT CCTTCCTCGG 
CTGGTGGGAT TATCGGCTTA GCAGAAGTAG 
GTTTAGGGGA AGCCATGAAT TTTAAAGGGA 
AAATTATCGA TTTGTGTCAA AAACAACGAC 
AAATCGAAGA TCAAGAGTTA GCAGATTTTC 
ATCAATTTCC . TAAAACGTTA AAAGAAAAAA 
ATAAATCAAT CACTCCTGAA AATATTCAGG 
CAAGTATTAT CACGGATGAC GTGATGGCTG 
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TTATTTTAGA AAAATAAAAA CCATTTTGGA 120 
TTTAGGGACA TTAGTCGCTG CAACGTTATT ' 180 

TACGAAAAGC GAGAGCAAAG GTGGAAGTAA 24 0 

TGAAGATATT GTCAAAAAAG ACATTATCGC 300 

TACCTTAGAA GTAGGCAATA GCGCAGACCG 360 

GGGAATTGAT GTCATTGAAT TAGCACAAGC 4 20 

ATTTGAAAAA ATTACAGAAA AAGAAGTACC 480 

AGAGGTTTTT GAAAGTGGTG CTGGCGTACC 54 0 

CAACAAAGAA AAATTAGGCA AAGAAATTAA 600 

GAAAGGTAAA ATTTCTGTTC CAGACGTTGC 660 

TGCTAGTGaA CATGCTGGTC AAGATATTAC 720 

GATGAAAGAA TTAAAACCAA ACGTTGTTAA 780 

TATGTTCCAA TCTGGTGAAA TTGAAGCAGC 84 0 

TCAAGGCGCA CAGAAAACGT GACGTATATC 900 

AATACAGTCA ATATTCCTAA AAAAGCACAA 960 

GCACGGATTA GTGAAGAATC ACAAAAGGCC 1020 

AATCAACAAG TAACGTTAAG CGAAAAAGAA 1080 

GAACGTGCCA AAACAGTCGA TTTTAATTTT 114 0 

CAATGGAATC GGACGATGAA TCAATAGTTT 1200 

TAT TACT AAA AAATGTGTGG CTGTATCAAA 12 60 

TTGCGATTAA AAATGATAAA TTTTATTACA 1320 

CCCAAAAAAC AATTAATGCC GAGAATCAAT 1380 

TGCATATTGA AAGTTCCATG ACGACACCTA 14 4 0 

GTGTGACAAC GGTGATTGCC GATGCCCATG 1500 

TAAAAGCTTT TATGGCCGCT GAAACAGAAC 1560 

TTCCTTCGAC CACTCCCGAA TTAGAAACAA 1620 

CAGAATTACT CAAAGAGCCA AAAGTGATTT 1680 

TTTCCTATGA GCCAGATTCT TTAATTCGTC 17 4 0 

CAACCATGCC TTTAGAAGGC CACTGTCCTA 1800 

TATATAGTGG TATCACTTCA GATCATACCC 1860 

TTGAAGCGGG CGTCTTTATT CAATTTCAAA 1920 

TGATAGTAGA CAATGACTTC TACAATTATG 1980 

ATGATTTGCT AAAAGGTCAT CTAAATGAAA 204 0 
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ATGTCAAAAA AGCTGTTCAT GCAGGGTTGC CTATCGAAAA AGCCATTTAT ATGGCAACTT 2100 

ATACGCCAGC CAAACGAATG GGTCTCCATG ATCGCGGCGA AATTGCACCT GGTAAAAAAG 2160 

CTGACTTTTT GTTATTAAAT GACTTAGAAA GTTTTGATAT TAATACGGTT TATAAATCTG 2220 

GAAAGGTCGT TTTTGAAAAA GGCGAGCCCT TCCATTATCC AGAAAAAATC GAAGAGTTTC 2280 

CAGCCACTTA TCAACAAACC ATCCAATGTA AAAAGTTAAC AGAAGAAGAT TTACTTTTAA 2340 

AGGTGGCTAC AACAAAAGAA ACAGTTCGTT GTAATGTGAT TCAAAAACAA GAAATTGGCA 24 00 

CAT TT AC AG A GCGAATTACA AAAGAAATTC CTGTTGAAAA TGGATTGTTG CAATGGCAAA 24 60 

AAGCCAACTG CGCCTTATTA ATTGTGATGG AACGTTACGG AAAAAATGGT AATATTTCTT 2520 

TTTCTTTAAT GGATCAACCT CTTTCCGAGA AAGGAGCCAT TGCGACAACC TGGGCCCATG 2580 

ATCATCATAA TTTAATGGTG ATGGGTAATA CGATTGAGGA TATTCTGTTA GCGCAGAATG 2640 

AATTATTAGC GATGCAAGGC GGCTATTTGG TTGCTTCGGA TCAGCAAGTG ATGGCAACTT 2700 

GTCCTTTACC AATTGGTGGG ATTTTATCGC AAGCACCGAT TGAACAATTA GGCGCTTCTT 27 60 

TACAAAAAGT CCGACAAGCT ATGCAAGCGT TAGGCTATCA AAATATGAAT GAAATCATGT 2820 

CCTTTTCTAC AC TAT CAT T A CCTGTCTCGC CAGCCATTAA AGTGACAGAT TTTGGGATGA 2880 

TGGATACGAA GTCACAACGT TTCTATCCAT TAGTTTTTCC AGAGGATGGA GTTCTTTTGC 2940 

ATGAAAACAC TCATTAAAAA TGTGCATATA CTAACAATGG ATGAGCAATT TTCAGAAATC 3000 

AAAGCCGGCT ATTTAGTGAT TGAAGAAGAC ACAATTGTGG AATTAGCACC AATGACCACA 3060 

CTTGATGAAA AGCGAATGGC TGCAAATCAA GTAATCGATG GTCAAAATGG AATTTTAATG 3120 

CCTGGGATGA TTAACACCCA TACCCATGTA GGCATGATTC CGTTTCGTTC GTTAGGAGAC 3180 

GATGTGCCAG ATCGACTCCG GCGTTTTCTT TTTCCATTAG AACAATTCAT GACAAAAGAA 324 0 

TTAGTAGGAT GCAGTAGTGA TTATGCAATT GCCGAAATGT TACT GAG TGG TATTACGAGC 3300 

TTTTGTGATA TGTATTATTT TGAAGATGAA ATTGCTAAAA GTTGTGAAAA AATGAGTGTT 3360 

CGTGCTTTGC TCGGAGAGAC GATCATTGAT ATGCCCACTT GTGATAGTCC GGAGCCTTCA 34 20 

GGCGGtCTTT TTTACGCGGA AACCTTTATT CGCAAGTGGC AAGGCCATCC GTTGATTACG 3480 

CCTATGCTTG CGCCACATGC ACCGAATACC AATTCACGAG AAGTGTTGGC GAAAATTATT 3540 

GAACTTAGTC GGCAATACCA AGTTCCTGTG ACCATGCACG TTGCTGAAAT GACTTATGAA 3600 

ATGGCTGAGT TTGAAAAAGC CTATCAAAAA ACACCAATTG CTTTCTTAGA AGAACTGGGT 3660 

TATTTGAGCG AGCCGTTTAT TTTAGCGCAT TGTATTTTGG CAACAGATGA AGATCTTGCG 3720 

AGTTTAGCTG CTACTAATGG AAAAGCGCGT GTCGCTCATT GTATCGGTGC GAATACTAAA 3780 

TCAGCCAAAG GCGTAgcGCC GATTAAGCAA ATGCTTGATC AAGGGATTAT TGTCGGTTTA 384 0 

GGCACGGATG . GACCTAGTAG TGGGAATACA TTAGATTTAT TCACCCAAAT GCGCATGGTT 3900 

GCGAATTTTC ATAAGACAGC ACACCAAGAT CGCTCCTTGT TTCCTGCTAA AGAAATTGTT 3960 

TATCTGGCAA CGATGGGGGG CGCTAAAACG TTAGGCTTGG CGGAGCAAGT CGGCTCATTG 4 020 
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GAAGTGGACA 


AAAAAGCGGA 


TATAACATTA 


ATTGAAACGC 


AATCAGTTAA 


TATGTTTCCG 


4080 


ATTTTTGATG 


CCTATTCAGC 


GTTGGTTTAT 


TCAGCAAATG 


CTAGCAATGT 


TGAAGCCGTT 


41'4 0 


TGGGTGAACG 


GTCAACAGTT 


AGTTGCCAAT 


AAAGAATTAC 


AACAAGCTAA 


TCTCAAAGAA 


4200 


ATCAAGGaAA 


AATTATATCA 


GGCCATGAAT 


ACGTTTGTGA 


AAGAAGCTAA 


AAAAAGAGCT 


4260 


GCTCTCTAAA 


CAGAACAAGT 


AGGACGTATC 


GGATTGATTA 


AATAACGCTG 


GATTGACGTT 


4320 


TAGTGATTCA 


GGTTCTAATA 


GAGCGCAGAC 


AAACTTTTAA 


TGAAAGAGTT 


GTCTGTGTTC 


4380 


TTTTTTGTTA 


TAAACATTTT 


TCTTTTTCTA 


TATATATTTT 


TAAGGTAAGT 


TGTATTATAG 


4440 


TAAGTTTGAG 


AAGATGAGAA 


TGAGAGGAAA 


AGCAATGAAT 


ATACAACGGT 


TTATAGAAAA 


4500 


ACGCAAAGCA 


CGTGGTTTGT 


CGCAAAGTGA 


ATTAGCTAAA 


GGGATTTGTA 


CGCAAGTTAC 


4560 


GGTTAGTCGC 


TTTGAAAAGA 


ATGGTCAAGT 


CCCAACTTTA 


AAAATATTGA 


TCCaACTATG 


4620 


TAATCGTTTA 


GAGCTACCTT 


TGGGGGAACT 


CTTCCCGCGC 


G TAG GG ATT A 


AACAACCTGA 


4680 


AATTCTTGAA 


AAAATGGAAG 


AAGCAGAATT 


CTTCTTGATT 


ACCAGCGAAC 


ATGACCAATT 


4740 


GCAAACAATT 


TTAAAAAATA 


TTCCTTTTGA 


TGAAATCAAA 


GATTCCCAAC 


TGTTATTAGA 


4800 


ATATTATTAT 


TTACAAGGCT 


TTGTCATGAT 


TTTTCAAAAT 


GCTTCGTTGA 


TGGATTGCTT 


4860 


ATTTACCTTT 


GAAAAACTTC 


TGTTTGAAGA 


GCAAAAATAC 


ACGAGCGATA 


TTTACCGTTT 


4920 


GTTAGCTTTT 


ACTGGAATTG 


GCATGGCCTA 


TGCCAAAGAA 


GGCGAAATTG 


AGAAAGCCGA 


4980 


ATTTTATTTC 


AATAAGGTCT 


TCAAGGAAAT 


TTATCTCTAT 


ACTATTCAAT 


CAATGGAAGA 


5040 


TACTTGGCGA 


GTATTAAATG 


TTGTCTTTCA 


CTGTGGTGTT 


TTTTATGCAG 


AAAAAGGCGA 


5100 


TTTAGAAACA 


AGTGATGCTT 


TGTTAGAGTA 


TGCGATTTCC 


ATCTGTTCTG 


ATAATCACGT 


5160 


CACGTATTAC 


TTGGCAAGAG 


CAGCTTTCCA 


ACTTGCGAAA 


AATGCCTTAG 


CTGAAGAAAA 


5220 


GCCGCAAGAG 


CAAATTTTAG 


AACTACTACA 


AGATGCCCG 






5259 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

AGGCCGTnTG AAGTGGaCCc TnGAgGAaTT wG.tTCaAAAr GCmAAgGnTT TAATTGGaGA 60 

AGGCcAmCGT GGAACnTGCG CaACmATTcA TTGrAGAACA TAAAGaTGAA TTAGGCGAAC 120 

AC TAT CAT AA AATTACAGCG ATGTTAGCTG ATACAACgAC CgATGgTTTG TTTGATAAAA 180 

TTAAAGGATT GTTTCAATAG TTGAAAAAAA TCTGAGATAA AAGTGTTTAG CTCCAAAAAC 240 

CAAGTAGGTA CGATGATTTC ATTTGTTCAT GCTTCACGGT GAAACATTGC AACTCTCGCC 300 

GTTTAACAGT TTTTGAGCGT GGAGCAAAAA TCCAAAGTGA TTTTTGTCCC ACGCTCTTCT 360 

TTTTTTATTT AACTTTTAAC CATTGTTTTG GCAACACATA CGCAATTGGA TTAAAAAGAT 420 
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AATCCGCTCC 


CCGTTTCACC 


ACAGGCATAT 


TCGTAATAGC 


TAAAATGAAG 


AGGGCTAAGA 


480 


AGAAAAGTGC 


CAGGATTCGG 


AAAGTAAACC 


ATTCTTTAAA 


GAGTGTGTCT 


AATTGCAAAC 


540 


CACTTTCCAA 


CAGAATAATT 


AAGGGCATAT 


GCAAGAGATA 


AGGGTAGAGG 


GTATCTTTTC 


600 


CGAGGGCGCT 


TAAAGGTAAC 


TTTCGTTTCG 


GCAATAATGC 


AAACCAAGAA 


AGAGACACCA 


660 


ATGCTGCAGC 


AACGTAAAAG 


AACAGCTGAA 


TGAAACTACT 


GTATTCAAAG 


GCCGGACTAA 


720 


TTyCAGCGTA 


GGAAGCCCAA 


CCAGTTAAGA 


aGCCCCAAAA 


TTTCGTTGCC 


AATGACCATT 


780 


GACTATTTAC 


AAATAAAAAG 


GAGCTTAAAA 


TTACGCCTAT 


CGCTGCCCAT 


TTGACTTTGC 


840 


GATGATCTAA 


AAAGGTTTGT 


AGCTTTTGAA 


TATCCATTAA 


GTAGCCACAA 


TAAAAGAAAG 


900 


GGAAGAAGCC 


CAGAGTCCGT 


CCTAAAGACA 


AGAAACCTTC 


CACAGGTAAA 


TAGCCTGAGG 


960 


CGCAgCcGAT AACAATGATG 


ATTGGCAACA 


TGATTTTCGG 


CTGAACGCGC 


TGAAGAAAGA 


1020 


TGGTAATTAA 


GTAGAAACTT 


GCTAAGGAAA 


GTAAGTACCA 


AGGAACGCCT 


GTTTCGTTCA 


1080 


GTAAATCAAA 


ATTCATTTTA 


TTTGCAAAGG 


CGAAATTAGC 


AAGCGTAATC 


AACAACTTCG 


1140 


CTAAGAGATA 


AAGCCAAACA 


AAGGAAAGTA 


AGCGTAGATA 


TTTTCGTTGT 


TTAATGGCAT 


1200 


GTT TACT AAA 


GAGTCCAGAG 


ACAAAAATAA 


AGGCTGGCAT 


ATGGAATGAA 


TAGATCCAAA 


1260 


AGTTGATCGT 


TTCCATGAGT 


CGTGAAGGAG 


AAGCTTCACG 


ATCTAAAATA 


TGCCCAAAAA 


1320 


TCACTAAGAA 


AATTAAAAAA 


AGTTTCACGT 


TGTCCCATTT 


ATAAATTCGC 


TTGTCGTTTT 


1380 


CCATAACGAT 


CACTCCTTTT 


AGTAGAAAAC 


AGTTCACTTT 


TGTTTAAGGC 


AGAAGAGGAC 


1440 


AATTTTTCCA 


TTCATTGAGT 


AGATAATTGT 


AGATACGAAC 


CGGCAGCCGT 


TAAATGAGCC 


1500 


CCCACATACT 


TGCGCTTAAA 


ACAAAAAATT 


AGGCAGGATG 


TTAGTCTTAA 


ATTATACTGA 


1560 


AAGAGTGACG 


ATCCTCCAAA 


GATTTTGTTT 


ATTAAGAAGT 


GAAAGATGCG 


GTGAAAATAA 


1620 


ATGATAAAAT 


TTCAAAATGT 


TTCAAAAGTA 


TATCAAGAAA 


ATGCCGTTTT 


GTCCGATGTG 


1680 


AATTTAACAA 


TCCAACAAGG 


CGAATTTTTT 


GTATTAGTAG 


GGCCGAGTGG 


TAGTGGAAAA 


1740 


ACGACGACTT 


TGAAAATGAT 


TAATCGGCTC 


ATTGAGCCAA 


CGACAGGTGA 


AGTGGCTCTC 


1800 


AATGAACAAG 


CGGTAACCAA 


TTATCCTTTA 


CGTGAATTAC 


GACTAAAAAT 


TGGCTATGTT 


1860 


CTTCAGCAAA 


TTGCCTTATT 


TCCTAACTTA 


ACCGTGGCGG 


AAAATATTGA 


GTTGATTCCT 


1920 


GAAATGAAAA 


AATGGCCCAA 


AGAGCAACGC 


AGAGAGCGAA 


CCATTGAACT 


ATTGAAAAAA 


1980 


GTTCAGCTGG 


ATCCTGAGGA 


ATATCTGCAT 


CGTAAGCCAG 


CAGCGCTTTC 


AGGAGGCGAG 


2040 


CAACAACGTA 


TCGGGATTTT 


ACGCGCCATT 


GC AG CCGAAC 


CAGAAGTGAT 


TTTAATGGAT 


2100 


GAACCTTTTA 


GCGCGCTAGA 


TCCGATTTCA 


AG AC AT CATC 


TTCAACGTTT 


GGTGAAAGAA 


2160 


CTTCATCAGG 


AATTAGCGAG 


TACCATTGTT 


TTTGTGACCC 


ATGATATGAA 


TGAAGCTTTA 


2220 


CTGTTAGGAG 


AGCGCATCTG 


TATTATGGAC 


ACTGGCAAAA 


TTATTCAAGT 


CGATACGCCA 


2280 


GAAGCCATTC 


AAAAACATCC 


GAAAAATAAT 


TTTGTCGGTC 


AATTTTTCAA 


ACAATCTTCA 


2340 


CCAGAACTAG 


CGAACTATTG 


TGGCGCTGAC 


TTGACTGCAT 


TTTTAGAAGC 


CGCAGATGAG 


2400 
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ACGATTGAGC CAGTGCCGAC TGTTTTCATG GAAACCAATT TAAAAGAGAT TGTCCAACTC . 24 60 

ATTAATGCGC AAGAGAAAGC AAATTTAGTA GAAAAACAAC AAGTTTTAGG GTGTATCACC 2520 

TCAAAAACGA TTACTCGTTT TATGGAACAG CTGTTGGAAA GCGAGGGACA GTAGATGCAA 2580 

GCACTGATTG AAACTTTTTT CGCTCGAAAA GAAGAATATC TGCAAGCGTT AATCGAGCAT 264 0 

ATGCAATTGT CCTTTATCTC GTTATTATTC GCTGCGTTGA TTGCAATTCC TTTAGCCATT 2700 

TATTTAACAA GTCATCGTAA GTTAGCGGAA AGTCTGTTAC AAGTGACTGG GATTTTCCAA 27 60 

ACCATCCCTT CTTTAGCCTT GCTAGGCTTA CTGATTCCGT TAATTGGTAT TGGTCGACCA 2820 

CCGGCAATTG TTGCGCTAGT GATTTATGCC TTATTCCCAA TTTTACAAAA CACCTATACT 2880 

GGTTTGACGG AAATTGATCC TTCGTTAGAA GAAGCTGCAG AAGCATTTGG GAT G AG T AAA 2940 

AAAGAAAAAC TGTTTAAGTT TGAGTTGCAA ATGGCGTTAC CAT T TAT TAT TTCTGGGATT 3000 

CGAACGGCTA CCGTTTTAAT TATTGGTACG GCCACGTTGG CGGCGTTAAT TGGAGCAGGT 3060 

GGTTTAGGGA CCTTCATTTT ATTAGGAATT GATCGAAATA ATTTATCGTT GATTTTTATT 3120 

GGAGCCCTTT CATCGGCTGC ACTAGCAGTC TTATTTAATT ATGGAATTCA TTGGTTGGAA 3180 

AAAGCGAATG GGCGGCGACT AATCATTGGC GGTACTATTT TAGGCCTTTT GTTAGGTGGT 324 0 

TCATTTTTCT GGAATCAACA AACTTCTTCA AAGGAAAAGC AGCTGACAAT TGCTGGCAAA 3300 

TTAGGAGCGG AACCAGATAT TATTATTAAT ATGTATAAAG CCTTAATTGA AGAAAATAGC 1 3360 

GACATTCAGG TGACGTTAAA GCCAAACTTT GGCAAGACTA CTTTTTTATA TAATGCCTTG 3420 

AAATCAGATG AAATCGATCT TTATCCGGAA TTTACAGGAA CAGTGATCGA AACCTTTTTA 34 80 

AAAAATCCTC CTCAGCTAGA CAATCAACCT CAAGTAGTCT ATGAAGCGGC ACGCGCTGGC 354 0 

TTGAAAAAAC AAGAAAACCT TACTTTGTTG AAACCGATGC GCTATGAAAA TACGTATGCC 3600 

GTCGCAGTTA AGCGCTCATT TGCAAAAGCG CATCAATTAA AAACGATTAG TGATTTACAG 3660 

AAAATTAGCA ATCAGTTGAA AGCAGGTTTT ACGTTAGAAT TTATTGATCG GCAAGATGGT 3720 

TATAAGGGAT TACAAGAAAA ATATCATTTA AATCTGAATG TCCAATCGAT GGAACCGGCG 3780 

TTACGCTATC AGGCGATTAA TAATGGTGAG GTAAATGTGA TTGATGCCTA TTCAACGGAT 3840 

AGTGAATTGA GACAATACGA TTTAGTAACG TTAGAAGATG ATCAAGCGCT ATTTCCGCCT 3900 

TATCAAGGAG CACCACTTAT TAAAACAGCA ACGTTGGAAA AATATCCAGA ATTAGCAGAA 3960 

ATATTGAATA AATTGGCAGG AAAAATTAGC GAAGAAGAAA TGAGTGAAAT GAATTATCAA 4 020 

GTCAATGTGG AAGGGCAAGA TCCTAGCATA GTGGCGAAAG ACTATTTAAA AGAAAAGAAT 4080 

CTTCTTAAAT AAGTATTTTT TACTGAAAAA GACTGGTATA TAAACATTTA TCCAGTCTTT 414 0 

TTCAGATAAA TAAAGAGTTT AATGTGAAAA GTACCTGTAT TTTTACTCAT TTTTGGATAT 4 200 
AATAAAAGAA ATGAATGGAG AGGAGTGGTC TATATGAAAA AAGCAGAACG TCAACGTTTG ' 4 260 

ATTAAACAAT TAATTATGCA ACAAGAAATT GAAACACAAG ATGAATTAAT CACTCGTTTG 4 320 

GAAGAAATTG GCGTCCGAGC CACTCAAGCG ACTGTTTCAA GAGATATCCG TGAAATGAGT 4 380 



WO 98/50555 

ATTGTTAAAA CACATGGTGC 
GGCACAAGCA GTGAAGAGAA 
CGAGTACAAT TTATTGTCAT 
TTTTTAGATG AAGTAGCTTA 
ATTGTGATTA CACGTTCAGA 
ATTTTTTAAA ATTGAAAGCA 
CTACAAAATG GTTCGGATAT 
TTAACCGCAA CAGCAGTAGC 
AAACAGCTGC CACGAAAGGC 
CTAACGGCGG AGcatTAAGC 
GATTGACGTT GGTTTAGCTA 
TCAATGTGAT GCAGCGATTA 
GAAGTTCTTC ACAGCAGAAG 
TACTGATCCG CTGACGGCCA 
TTATGCGGAG CATTTGGTGG 
TTTACAGGGC TTTCGAATTA 
GCAGGTTCTT CAAGTTTTGG 
GCATTTTCCT AATCAT^i'AC 
GG.CTGTTTTA GCTAATCAAG 
GGCCGTGGTG GACCAATCAG 
GGCAATTGTT TTAAAAGAAG 
TCATTTAAAA ACATTTATTG 
CCGAAaTGTC ATTAATAAaA 
CATTGAAACC AGTGGTCATG 
TGTCGTGGCG AAAATTTTAA 
AGATCTGATT GCTACATTGA 
TGCGGAGGAT GTTACTTGTT 
AAATCAAGCA GACTTTGCTG 
TGGTCAATAT GGCAGTGGTT 
CTTCAGGTAG AGAATGACCA 
TTTTTACAAA AATACGAGGA 
AAGGTGTTCT TATGGAAAAT 
AAGAATATAT CGAACATATT 
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AGATGGTCGT GTCAAATATG CAATTTTTTC TCAAGCACAA 4 4 40 

ATTACGAGAA TCAGTAAAAG ATTCAGTTGT TCGAATGGAG 4 500 

CTTGCATACA GAGATGGGAA ATGCCGATGT GGTGAGTAAC 4 560 

TCCAGAAGTG GCTGGTACAG TGGCTGGTGC GGATACAATT 4 620 

AGAGGATGCG GAACACTTTA TCGAACGCAT CGAAAATATG 4 680 

TCAATAGAAG GGAGTTTTTT TCTTGGCAGA ATTGAAAGCG 474 0 

TCGGGGAATT GCTTTGGATA CGGAAGAGCA AACTGCAACA 4 800 

CGAGATTGCG GTGGGCGTGG TTCGTTGGTT ACAGGATAAA 4 8 60 

ACAACAGCGC CTGACAATTG CAATCGGACA TGATAGTCGT 4 920 

AAGCATTAGT AGATACGTTT CTTTCATTAG GTATTCAGGT 4 980 

CGACGCCTGC TATGTTTATG GCTACTCAAT TTCCAACGCT 504 0 

TGATTACAGC TAGCCATTTG CCTTATTACT TTAATGGCTT 5100 

GTGGAGCAGA GAAAGAAGAT ATTCGGTATA TTTTATCTCA 5160 

ATGAAAATGG CACACTGATG AAGCAAGAGT TACTGCCAAT 5220 

AAAAAATCCG CCAAGGAATT CATTCGCCAG AAGAGAAACC 5280 

TCGTTGATGC GGGTAATGGG GCTGGGGGCT TCTTTGCGGA 5340 

GCGCTGATAC GACGGGATCA CAGTTTTTAG AACCAGATGG 54 00 

CTAATCCAGA TAATTCTGAA GCAATGAAAA GTATTCAAAC 54 60 

CAGATTTGGG CATTATCTTT GATACAGATG TAGATCGTTC 5520 

GTGAGGTACT AAACCGAAAT AATTTAATTG CTGTTTTAGC 5580 

CGCCTGGTAG TTATATTGTG ACAAATTCGC CAACCTCAAG 564 0 

AAGAAAAAGG CGGCCAACAG ATtCGTTATA TTTCTGGTTA 5700 

TGATTGAATT GAaTCATGGG GGCTTTCAGA CACCGTTAGC 57 60 

CAGCGTTTCA AGAAAACTAC AATCTGGATG ATGGTGCATA 5820 

TGTTGTTACC AGAATTGAAA CAAAACAATC AAACGTTAGG 5880 

AGCAACCGGC AGAAACCAAC GAATTTCGCT TCAAAATTAC 594 0 

ATGGGCAACA AGTCTTGCGA GACTTTGAAT TGTTTGTAGA 6000 

TGGACCGTGA AAACCAAGAA GGTGTTCGGG GAAACGTGTC 6060 

GGTTTtTGTA CGCTTAAGTC TGCATGAACC GTTGCTTGTT 6120 

AAGCGATAAA AATGCGTGTG TCATCGAAAA AATTGCCACT 6180 

AATAGATAGT CAGCAAATAG AAAAATAATC TAGGAAATTA 624 0 

GTTTTAAAAG GAAAACACGT TGGCTTTTCT CATTTTAGAG 6300 

GCAAAACAGC AATGGGATAA TGAATTGCTT CGGCATTTGA 6360 
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GCTGGGATGC 


TTTACATCCG 


TGGGGGCTTG 


AAGAATGGAA 


AGATTTCACA 


ATTAATAAAG 


6420 


GAGAAGATGA 


TCGATTTCTA 


TTTGCTATTT 


'tagaaaatac 


GACAGAGGAA 


TTTATTGGTT 


6480 


GGGTTTCGTT 


GTCTGATGTA 


CAGTTGAAAA 


ATAGAGGGGC 


CAATTTAGGG 


ATTGCTATTT 


6540 


TACAAAAAGA 


GCAACGAGGT 


CAAGGATGTG 


GCTTTGAGGC 


GGTGTCTTTA 


ATTTGTAAAT 


6600 


TTGCTTTTTA 


TGAATTGGGT 


TTGCACAAAA 


TCAGATTAGC 


TGTCAATAGT 


AACAATCAAA 


6660 


AAGCGATTCA 


TGTTTATGAA 


AAAGTAGGCT 


TCAAAAAAGA 


AGGAATTGAT 


AGAGAAGCGT 


6720 


TATTTCAAGA 


TGGCCAATGG 


CTGGATATCT 


ACAATTACGG 


AATTTTACAA 


AAAGAGTGGC 


6780 


TACAAATGAT 


TAAGGCTGAG 


AGTTAGCTCA 


GGAACTGTTT 


AACACCAAGA 


CACTGAGCAA 


6840 


ACGCAGATAT 


GCCGTTCTCA 


GCAGTGAATG 


TGCGGTAAAA 


TAAAAAAGCA 


GAAGAAATCG 


6900 


GCGGTTACCG 


TCCGATTTCT 


TCTGCTTTTT 


TTTACCATAG 


GCCAGGTGTT 


GCAGTCACAA 


6960 


CGATTTGTTT 


TTTTGTAAAC 


GG AT GGAGT A 


AATGAAGTTG 


TGAAGCATGG 


AGCATTAAAC 


7020 


GAGCTGCCTG 


CGATTTTGGG 


TTGTATAACG 


GATCACCAAT 


AATTGGATGC 


CCTTCGCTAG 


7080 


CTAAATGGAC 


CCGAATTTGG 


TGCGTCCGAC 


CTGTTTCTAA 


AACACAGGTA 


ACTGCGGTTT 


7140 


GATTATTAGA 


AAGAGCTTGG 


ACCACTTGGA 


CATGTGTAAT 


CGCTGAATCC 


CCTTTTTGAG 


7200 


GATCAATCCG 


TCGTTTCCGG 


CGATCATGTC 


GATCACGACC 


GATCGGCTGA 


TTGATTGTCT 


7260 


GCTCTTTTTG 


GCTAAAGTGA 


CCAGCTACAA 


TCGCTTGATA 


TTGGCGGTAA 


ATTTGTTTTG 


1 7320 


TTTCTAATAA 


ACGACCTAAA 


ATAGGTAAGA 


CAAAAGGATT 


TTTAGCAAAG 


ACAATCGCAC 


7380 


CACTGGTTTC 


TTTATCTAAT 


CGATGAACGA 


CATAAGGGAC 


CTGATTTTTC 


TCGGCTAAAT 


74 4 0 


AGGCAGCAAC 


ATGATTTAAC 


AAGGTGTCTG 


TTTCCTGTGG 


CTGATTTGGA 


TGGGTTTTAA 


7500 


TGCCATAAGG 


TTTATTTACA 


ATCATTAAAT 


GTTCGTCTTC 


ATATAAAACC 


TTCACATGCG 


7560 


TTGCTTCCCC 


CAAATGAATA 


GTGGGACGTA 


CATAATCGGT 


TTCTTCAAAC 


GTTAAGGTGA 


7620 


CACAATCGCC 


AGCTTGGACT 


GGAAAATGAA 


AAGCAGCTGT 


TTCTCCGTTG 


ATTTTGACAT 


7680 


TTTT 












7684 



{2} INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGGAACACCT GnCGCCTATA AAAAGGTTGG CCAACAAACC CTGATGGCTG GGGCGATTGA 60 

TAAACCGAAC ACTTACACGG TCAAAwTTga TGTAGAAAcC GAGGGTTCTT TAAAAATCAA 120 

AAAAGTTGAT AAAGAATCAG GTGCTATTGT ACCAGGAACG GTTTTCCATT TAGACTTTGG 180 

AAAAACTTTA CCTGCAAAAG ACGTGACTAC TGACAAAGAA GGCATTGCTA CATTGGATGG 24 0 
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GATTCCTCAT GGTACAAAGG TAACCATTAC 
TGATACCATA CCCATGACTG CTACGATTAA 
AAATACGCGG GAAAAAGGTC AAATCATTCT 
TCTTTGGAAC GACAATTACT CTTTAGCTGG 
TACTGGTGAA ATTGTTCAAG AAATGACCAC 
AGAGATTGCT AATGCATTGG AATTGGGAAC 
TGGCTTCGTG AATACCTTTA AAGCAGTAAA 
GGCCCTTGTG ACCAGCAATA TCAAAGGGCA 
GATAAAAGAA GACAAGGATA CTGGAGATAA 
TGAATATACC TTGTTTACTG CTAAGGATGG 
ACCAGAATTG GTAAAGGGAA CGAAAGCTTC 
AAAGAaTCAA GCTGCCGTTa AACACtTAGC 
AGCACCTGAA GGATATACCT TGGaTGaAAC 
TgATAACGAA AAAAaTGCCG TrATTACyCG 
TTTTGGGTTT GATTTCTTCA AATTTGCyGG 
TAATGACTTA ACCTTTAAAG TGTCGCCATT 
AGATrAAGCA ATCACAGCTT ATAATGAGCA 
AAATCTTCCT TATGGGGATT ATTTACTTGA 
AATTACGCCA TTAGAAATCC GTTCTACTTT 
TGAGTATGTC TTTACCATTA CCGAACAAGG 
TCCTTACGAA AAGCTGACGA ACAAAGCATT 
TGATTTGCCT GAGGAAGAAG ATAGTTTGAC 
AAAATTGAAT ACCCTTGATT TTACCGAGCT 
AATCAAAGAA GACTGGTATG TCGTAGCTCA 
CCAAGAAAAA GACGAAAAAG CCAAACCAGT 
GAACAAAGAG AAAACAGGTA CTTGGAAAAT 
GGATAAAACT ATCGTCTTGT TTAACTATGT 
CGATGAGCCA GTAGCGAAGG ATGCTAGCTT 
GGTGGAACGT CATGTTTCCA TCCAAACAAA 
TACTCATGGT GATGTGGTGG ATATGTTTGA 
TGGATCAAAA. GAGGCTTTCG AAACAATTCT 
AGAAATTTGG AAATCAGGCA AGATTGATTA 
TATGCTTGCG GAAAAAGTAG ATACCGGAAA 
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CGAAAAATCG GTGCCGGCGC 
AGCAGGCGAA ACCATTTTTA 
GGATAAAACC GGGATTGAAA 
AAACACGTTT GCTATTCGCA 
GGATGAAAAA GGTCATGCAG 
TTACTACGTG ACCGAAACCA 
AGTCGAATTG AAGTATGCCA 
AAACCAAGAA ATTACTGGGG 
AACACAAGGC AGGGCAGTCT 
AAAAGCGGTT AAATGGAGTG 
TGATGAAACA GTGACTCTGG 
CATCaACGAG TACTACTGGC 
GAAGTATCCT GTwTCyATCA 
AGATGTTACG GCAAAAGAAC 
ATCrGCyGmT GGCACTGCCG 
rGAAGGGACC AATGAGATCA 
ATTAGGCTTT GATGGCTACG 
AGAAGTAGAA GCGCCAGAAG 
TAAGGAAAAC AAAGAAGACT 
CCAAAAGCAA CCAATTAAAA 
TTCTGTTAGT TTGAACCGTT 
TTCTCTTGCA ACTTGGAAGG 
AGTTGATAAA TTGAGATATA 
AGCCATTGAT GTGGAAGCCA 
GGTAATTGCC GAAACAACGA 
TCGGCATAAA TTAACGACCG 
GTATGAAAAC AAGGAAGCTT 
GAATAATCAA GCCCAAACTG 
AGCCCACCTA GAAGATGGTT 
TGATGTATCG ATTACCCATG 
GTATGCACTA CTACCGGATG 
TGAAGTAAAT GATAAAGAAT 
GTTTCCAGAA AGTACTAAGT 
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CTTATACGAT 300 

TGACTTCAAA 360 

CAGGAAGTGA 4 20 

AGGACAGTCC 480 

AAACACCAAA 54 0 

AAGCATCTAA 600 

ATCAAACCGT 660 

AAACAACTCT 720 

TTGAGGGGAC 780 

AAGCTTTTAA 84 0 

CTTTAGATGA 900 

AAGAAACCAA 960 

AAAArGTwGA 1020 

aGaTTATTCG 1080 

AAACTGGGTT 114 0 

CAGGTGCTGA 1200 

GAAAGTTTGA 1260 

GATTTCAAAA 1320 

TTGTGAAGAG 1380 

CGGTGACCGT 1440 

TGATGCTCTA 1500 

AC GG AAATAA 1560 

ACTTGCATGA 1620 

CAAAAGCTGC 1680 

CCACTTTGGC 174 0 

AACAAGTTTT 1800 

TTGAAGCAGG 1860 

TTAATTGTAC 1920 

CTCAGACCTT 1980 

ATGTGCTGGA 204 0 

GTACAAACAA 2100 

TTACCAAAAC 2160 

TTACTTTTGC 2220 
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AGAAATCAAT TATGATAAAG 
AAAAAATCAA ACCTTAACGC 
GGAAACACCA ACTGTTCCAA 
TCAAACTGGG GAGAAAAATT 
GACGGCGGGC TATTATTTCT 
TCCTAAGAAC CAACGTGCGC 
AAAACTAACA ATGAAAA^AG 
CCGAAACGTT TGGAAAGTTA 
GACGGAATAC TACGGTGATT 
ATGATATTGA GGTGGTAGTA 
CGCTAAAAGC CGTGAATCCC 
TTACTGATTA CGTGTTGTAT 
GATTAACAGA AGGTATTGTC 
CATTACGCCG AGAAGTACGG 
AACGAACATA TAATTTGAAA 
CTAATGTGCC CTTACGCGAG 
TGGATACAGT CGCCAATTAT 
CAGACGATTT AGTcTTGAAA 
GAAAGAAATA GGTGAAGATT 
AATATGGAAA AGGTGAGTTA 
AAATGTTTCT GATTATGCAT 
TTGATGTACA AGACTGGTTA 
AAAACTTATG TCGGCAGGCA 
CCATTGAAGG CTTTTTAAAT 
GCTAGCATAA CTATATGAAA 
TGGAAACCAT CAAAATGTAC 
ATCACTTTAT TCTTGGCTGG 
TTCAAGAGTT TAGACAATTT 
GGTCCATGAA GGAGTTAATC 
TGCTTTATAT TCACTTTTTT 
CTCGGATGAT . TCTAGAAAAC 
TCAAAGATTT AAATAGTAGT 
ATCGAATGAA GGAGGGTTTA 



PCT/US98/08985 

322 

ATGAGAATAT CAATGGGAAA CACAATGAGG ATCTAAAAGA 2280 

CCAAAAAAGT TCCAACCACA CCACGTACGC CGAAACAACC 234 0 

GTGGCTATCA AGAATCTAGT CCCACAGTGA AGACATTCCC 2400 

CCAATGTTTT ACTGTTCATT GGTTTTACTT TGATTTTTGC 2460 

GGAATCGCCG TAACTAAGGT GATGAGATGA TACAAGTAAA 2520 

GACCGCCTCC TAACAAAGTG GTCGCGCTAT TTTTGTATCC 2580 

GAAAAGAAGA TGGAATTGAA ATTTGTTGTG CCAGATATGG 264 0 

AGTTATGCCG GTGAAGGAGA AATTTTAACA GAAGGATATG 2700 

GGTCGCAGTT AC CATC TATA CTCCAGCAAA CAACGGGCCG 2760 

GCTGCGGAAG CCGGTGAAAA GGATTTTGAT CAAGACCAAC 2820 

CATTTGGTTG CCAAAGGCTA TGAGATTGAA AATCGTGGGT 2880 

GTTGATGATT TAGTGAAAGA ATAGGAGGAA AaGAAGATGA 2 94 0 

GTAGATTCAG TATTAACATT TGGAAAGTTG CGTTTTTCCG J3000 

AAGCAAAATG AGGATGGAAC GGTTAGTAAT GAGGTAAAAG 3060 

TCCTCTGCAC AAGGGCGAAT GATTCAAgTT AGTwTCCCGG 3120 

TTTGCTTATG ATGCAGAGGT AGAGTTGGTC AATCCAATCG 3180 

GTTTTTCGAG AAGGAACTAC TGTCAATTGG TTTATTAAGG 324 0 

CGACAACCAA ATCAAGGAAA TTCTACGAAC CAAATTGAAG 3300 

ATGGAAATTA ATGTTGAACA TATTTTAGAT TGTCTCGATC 3360 

ACTGAGGAAC AACTCGTTAA AGCATTAACC TATGATGAAA 3420 

CAAGGTTTAC TTGGAGAAAA TAGCGACAGT GAAGACGACT 34 80 

GAAGAAGAAG AGTCCTTCTT TATGGTTGTC GAAGTAAATG 354 0 

GAATCCGTCT TGGAAGAAAT CGGTGTGGAG ATGCCAGATG 3600 

CAATTGGTTG AAAAGAAACA ATTCCCCGTA GTAATAAATA 3660 

ATGCCCAATG AAATCCTATC GGGCGTTTTT TGATCGGAGG 3720 

AAAGAACATC GCATACGTGC AAGAGATCAG CATTTGGTTT 3780 

CTAATTGCTT TGTTAATCAG TTGGATGGGT GTCTTTTATT 3840 

GATATTTCTA GGGTATCACT TTCTACTATT GAAACTGTTT 3900 

TGCTTACTTG GAAGTCTCGG TTTTTCAGGA GCTATGCTGT 3960 

CCTGATCATT GGCGAAGCTT GTGGCATCGG CAAAAACTAG 4020 

CATTGGTATG AAGTGAAACA AACTCAAAGT GAGGGCTTTT 4 080 

CGAACCAGAG AGACTATCAG TTACTTCCCC AAAATCTATT 414 0 

CTTTCTATTC GCGTTCAAAT TTCACTAGGG AAATATCAGG 4 200 
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AGCAGCTATT GAAGTTGGAA AAGAAGCTAG AAAGTGGATT GTATTGTGAG TTAGTAGAAA 4260 

AGGAACTCAA AGACTCCTAT GTGGAGTATA CTTTACTGTA TGATATGATC GCCAATCGAA 4 320 

TTGGGATAGA CGAAGTAGTG GCAGAAAATG GGACTCTGCG ATTGATGAAA AATCAAGTAT 4 380 

GGGCTTATGA TTCTTTACCC CATATGTTAA TCGCTGGTGG CACAGGTGGG GGGAAGACCT 4 4 40 

ATTTCCTCCT CACCATTATC GAAGCACTAT TGAAGTCAGA TGCAGAATTG TTTATCCTCG 4 500 

ATCCCAAAAA TGCAGATTTA GCTGATTTAG GTACGGTGAT GCCTCACGTA TATTCTCAAA 4 560 

AGGAAGAAAT TTCTGCTTGT GTGGAAGATT TTTATGAACG CATGATAGCT CGTAGTAAGG 4 620 

CAATGAAAGA AATGCCCAAC TACAAACCAG GAGAAAATTA TGCCTATCTT GGACTTCCAC 4 680 

CAAACTTTTT AATCTTTGAT GAATACGTGG CTTATATGGG TGCGAATCGT TTTCCTACAA 474 0 

GCATCGAATA GATGTGAAAT GGAAGCCTAT TGAGATTATC AATAGTGACA ACTAATGTTT 4 800 

TGAGAGATGG AATGAAAGCC TTCAAGTTGA GCTGAAACAT CTCCGCTAAT ACTCCTACAT 4 8 60 

GCTCTAATCA ATGAAGTAGA TTGGTTTTGG TATGAAGCTA GGTGAAGTCG TCTGAAATCT 4 920 

GCCTAAGTGT TTGAGCATAA GGCTGACTAT AGGGCGGGCG GCTGAAAAAG ATAACTATGG 4 980 

TAAGAATGTC TGAAAATGAC TGACGAATCT GCGAATGTAC GGGTCTAAAA GGCTGATCTT 5040 

GTCGAAAGAT AGGAACATAC GGACGTATGG TGAAATACCG ACGAGtaagG TTCCATGATA 5100 

TGAAAATCGG GATAGCTAAC GAAGAAGCTA TAATTTCACA GGCTCATAGT AGATACCTAA 5160 

GGTTATACAT GATAGCTAAA AACATTGGAA CGATGTAAGC AGAAAACTAT CAAATAAATG 5220 

ACTACTTCTA GATAGGTGCA TATAAGTAAT AACGAAATTG CTTTGTTTCT GTGAAAGTAA 5280 

GGGCACAGTA CCGATGAAGC GTCTAACAAA CGTGGAGGGA TAGCCCTAAG TCTATTTAAT 5340 

AACAATGAAT GTTGAAACAG ATGAAGTTCA AGGATCTGGT AAGAACGTGG GGAAATCCGT 54 00 

TATGGAGGTG CCACGCTTGA ACACAAAAAT ACGTTACTGG GAATATJACG ATCTACAAGA 54 60 

AACATTCGAT CAGCTATTTA CTCAAAGTAA AAACGGGAAA AAGTTTTATC AATTATACGA 5520 

GTTAATCATT TCTGAAAATA ATATTCTTTT AGCATACCGC ACTATCAAGG CAAACAAGGG 5580 

GTCTTCGACA CCAGGAACGG ACTCCTTCAC AATTGATAAC TATAAGGAAA TGAaCCAAGC 5 64 0 

AGAGTTTATA CATCTTATTC TCAGTCAGTT AGAAAATTAC AAACCAAAAT CCATAAAGCG 5700 

AGTAATGATA CCTAAACCCA ACGGCGAGAA ACGCCCATTA GGAATTCCaT GCATGATTGA 5760 

TCGAaTTATC CAACAGATGT TCAAGCaGGT ACTGGAGCCA ATATGTGAAG CAAAGTTTTA 5820 

TGAACATAGT TACGGGTTCA GACCATTGAG AAGTGCAAAA CATGCTCTAG GTCGCATTAT 5880 

GTATCTCATA AACATCAGTA AGATGCACTA TGCAGTAGAC AT T GAT AT C A AGGGTTTCTT 594 0 

TGATAATGTA AATCACCGTT TACTGATAAA ACAATTATGG AATATAGGAA TTTGTGACAA 6000 

ACGAGTCTTA. GCTATTCTTT CAAAATCTCT GAAATCACCC ATTCAGGGAG AAGGTATTTC 6060 

GAGCAAAGGA ACGATCCAAG GAGGTATCAT TTCTCCACTA TTATCTAATG TTGTTCTAAA 6120 

CGACTTAGAC CATTGGGTCT CAAAGCAGTG GCATACCTTT GAAACAAAGT ATCCCTATAC 6180 



WO 98/50555 



PCT/US98/08985 



GAAAGGTTAC AATAAATTTC 
CAGGTACGCA GATGATTTTA 
CCACGCTGTC AAGCTCTACC 
TAAAATCGTC AACTTGCGTA 
ACAAAAGGGG AAAAATGGGT 
AAAAAAGAGA TTAAACAAAG 
CTCCTATTTA ATGCTCTAGT 
AACTTAGATT TGAATAGGAT 
AATTGTAGTA CAAGATGTTA 
ACAACGATGA AAACGTATAA 
ACaAAAACAA TTTACGGATT 
aGGTTGGAGC ACAAAAGCTT 
TCGAAAATAC TGAATAGAAT 
AAATCAGGCA AGTGTGAGAT 
CACTATTTAC CAAAAAGTCT 
AAGGATATCC ATCTTTTGAT 
GAACTGAAAC TGCTGCCAGA 
CTGCAAAACA TTCAATAGAA 
TACGGTGTGG AGCGGGGGAA 
TGTTAACGAC GAAAGAAAGT 
GTCGTCAGTC TGGTTTCTTT 
GAGACGGGAT TCGCGATCAA 
GGTATTCCAT GATGTTTGGA 
GAGGCTATGT GGATACTGGA 
AAGGGTATGA TTTGTTAGAA 
TACAGAAAAA TTTAATCTAC 
GTGTCAGGGA GGTGCAATAA 
CAAGAAGGAA ATTATTTCTT 
CATTTTTATG GCTGGAAGTC 
ATTAAATAGT AATCTTTGTG 
TGTTGGGAAT - AATTCAGATG 
CGATTTAGTT CTTAAAAAAG 
GGTAAAACAA AATATTGAAC 
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GTGCTTTAAG AGATACAAAT CTAAAGCAAG GCTATATTGT 624 0 

AAATCATGAC TAATGATTAT CCGTCCGCAT TAAAATGGTT 6300 

TAAAAGACAG GCTTAAATTA GATATCTCCA ATGAGAAGTC 6360 

AACGCAAATC GGAATTTTTA GGATTCACCA TTTGCGTGAA 6420 

TTGTAATTCG CATATTTCTG ACAAAAAGAA AAATCAAATT 64 80 

AATTAAAGAT ATTCAAAAGA GCCCTACAGC ACAAAATGCT 654 0 

GCTTGGTGTT CATAATTATT TCAGGTATGC CACTCATATT 6600 

GAACTACGTT CTTTCATTCA CTACGTATAA CCGATTATCA 6660 

TCCCAACAAA GCTTCCCCTA CTTACAAAAA GTTTTACTCG 6720 

AGTAGCCGGA GTCTATTTGT ACCCTCTTTG CGACGTATCT 6780 

TAATACGGAA GATACACCGT TTACCCCGTT TGGtCGTCaA 684 0 

ACAGTCTCTT GTATACCAAG AATTAAGAAA ATTGATGGAA 6900 

GGTTGAGTAT TTAGATAATA GAATATCTCG TTACTCAATG 6960 

TACAAAACAG TTTTTGCCAG CAAAAGCCGT TCACTGTCAT 7 020 

TGGTGGAGAC GACAAATTTG ATAACTTGAG AATTATTCAT 7080 

CCATACCACA AATAAAATGA TAATCGATCA CTACGTCAAT 714 0 

GCAAATAGCA AAAATCAACC TTTATCGAAA AATGTGTAAT 7200 

AATAAATAGA TGGAACGCCG TATGAGGTGA AAGGCTCACG 7260 

AAGATGGAGA TTATATCAAA TTCTTACCTA TCGTTATAAA 7320 

GCGGTGATTT TAAATAAGCT CAAACAAATC GTGATGTTGG 7380 

CTAATTCTGG CTTGTCAAAG ACCGGATGCA AAATATTTAG 74 4 0 

TTTAACTTCC GAGTGGCTTT GGGTCGTATG AGTGAACTCG 7500 

GAAGTTGATA AAAATTTCTT TATGAAACGC ATCAAAGGTC 7560 

GGGAGTGTGA TTAGTGAGTT TTATACGCCA CTTGTGCCAA 7 620 

TCCATCAAAC AAGTAGCACA ATCAAAAGAA AAATGACAGA 7 680 

TAGTTCATCG GGTTAAAAGT ATGATAAACT AGCTTTGAAG 77 4 0 

AAATGACAGA GGATGATTCT TTACAATATG CCaAGGAGAA 7800 

CTGTTATTGA TGGAAAAGAA AAAGAAGAGG AAAAAACTGC 7860 

CTGGGGCTGG CAAGACTGAG GCTGCCCAAA CTTTAACGGT 7 920 

TGATTGATGC TGATAAGTTT CGTGTATTAT TTCCAGGTTA 7 980 

AATTTCAACG AGGATCATCA CTCTTGGTAG ATGCAGCTCT 804 0 

GCTATAGTTT TATTCTTGAT GCTACTTTTG CGACCTCTAA 8100 

GGGCACTGAA AAAGAATTAC AATGTGCTTG TCTATTATGT 8160 
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TTATCAAGAT 

TTTTGTTTCT 

TGTGAAAGAG 

ATACGATTTC 

ATACAAAAGA 

ATGGAAAAGT 

AAGACAGTCT 

TTGGATAAAG 

TCTTGGTTTT 

ACTCAACTGA 

TGCCTGTTCA 

GTACAAAAAC 

TAAGTAATTT 

AGAACTTGGT 

GTATTTCGCA 

TAGAAAACGG 

AAAGTTTTAA 

CCACAACGGA 

TCCACGAAGA 

TGATGTTGTC 

GGCAGTTTGA 

GTCTAAAAGC 

TATTGGATAT 

GTACCTTTAA 

GGAATACCTT 

ATTATGAACA 

TTGAGATTCG 

GTAGCATTGA 

TAGAAGGCAA 

GGAATCGGAA 

TAAATTGGCT . 

TGAATCAGAC 

AAAAGATTTT 



CCATTTATTG 
AAAGAGCGTT 
CAAATTTCAA 
TGATTTTCTG 
AATATTGGAG 
ATAACCGTAA 
TTAAATTTGT 
AAAAATAAAT 
TTTCTATGCA 
TAGGCACTGA 
TGGACGGCGC 
ACAAGAAATA 
GCATAGGTGT 
ACAACGGAAT 
AAATAAATTA 
TGGCGTAACT 
TCCCGATTTA 
TGCACGGAAA 
TTACGCCTTT 
TCATGAAGAA 
AACTTTTTTA 
CGGTGGCGTA 
TCCTGATTTA 
AAGTTACCGT 
ATACATTGGC 
ATATATCAAG 
ACTAAAAAAT 
AAGTACCACG 
AAGGCGAACT 
AGAGATACAA 
AGGTAGACAA 
AACCTATATC 
GGAACAACAA 



CTTGGGATTT 
TTATCAATGC 
GATCAAGTGA 
AT GG AT AT TG 
GAAGAGCTAC 
CTTTTCCACA 
AGCAGATGAA 
ATGAAATGAG 
ATGAAAGTAG 
CCGCGGAGGA 
TAGcgGCCAT 
AGCAAGCAAC 
TAACTCATTT 
TTAGATTATC 
GCTACCGCTT 
GCCTCTACAA 
CCTTTATCGT 
ATCATTCAAG 
TACTCCTATC 
GATAAAGGAG 
CTCGCCCAAA 
ATGAAACGCT 
ACGAAAAAAT 
TCGGGGGAGC 
AGTCTCAAAA 
CTAGGGATTC 
GACCGTGCCT 
TTTTCCATTA 
AACTGGCCAT 
CTGACTTCTG 
GTCGCCCCTA 
CAGGATATGG 
AGTATGGCAA 
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TACCAAAAAA 
TTTTTTCCAA 
CAATCAATAT 
ATAATGTTGA 
ATGACTGAGA 
TTACAAAAGA 
TCCAACCGAA 
AATAGTTGTA 
ATATGGAAGG 
TTTGCGAAgc 
GATGTCTCAC 
GATTAAGAAT 
GAAAAAGTTA 
GGTTATTAAA 
GTGGGCTTAG 
AAACCATGAG 
TACTGTTTGA 
AGATTCTCCA 
AAGAACAATA 
TTCTTTTGGA 
AGCGTAgcTG 
TAGATTTGGC 
GTCAGAAGGA 
TGTTGAAAGC 
GTGAAGTTTA 
CATTAGACAA 
ATCATGCAAT 
TCAATCGTTA 
TAAATGAACA 
AGCCGAAACC 
CGTGGAAGAT 
TACAGAATGC 
TCGAAAACTT 



TGAGAAGTGG 
TCACGAAAAA 
CTTAGTGAAG 
GTTGACATTG 
AAGAAAAAGT 
ATGCTTCTGC 
AACAACGTGA 
GAAGCCAAGA 
AAAGGAAATC 
AGAAATCGTC 
CCCCCGCATC 
TGCTGGTGTA 
ACACCTTGAA 
AGATCGTCGC 
TCGACCCTAT 
AAAAATTTTT 
CTATGTAAGG 
CCTGAAATTT 
TGTCATGGGC 
ATTAAAGGGT 
GtACGACTTT 
AATCAATGAT 
AGAATGTATT 
TGATGAAAAG 
TTTTTGTTTG 
AACTGAAACG 
TCAAGATTTG 
CTTGCGATTT 
ATGGGGGCGT 
GTATACCATT 
GGCCAAGGAG 
GCGCTTATCA 
GATTATATGA 



TTGAAGGGCG 
ATTTGATGCG 
GACTTTCAAA 
CCAATCAGTT 
TGAGGAAATT 
GAAAGAGCTG 
ATTGATTGGT 
GCAAAAATGT 
ACAATCAGCA 
CGTGGTCCTG 
TAACAGGGGG 
GCAAGGATAT 
TTTGGAGGTG 
AATGAATATG 
TTAAATCAGA 
AACCAACTGG 
ATTCGTTTCC 
GATTATATGC 
GATATTGTGG 
CGTGGTTGTC 
TTCGAAGATT 
CTAGTAGGGC 
TCCTTGTTTC 
GATGGTATGG 
TACGAAAAAG 
AAAAATCGCT 
TTAAAGGGTC 
GCGGATAAAG 
TTTATTGGAC 
GAACGAACCT 
TTGGATCGGT 
GACCGGCATA 
AAAGAGGAAT 



8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
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TGTGGTAGGA 

CGTGATGTGC 

GTGGTAATAA 

CACTGAACTA 

AATAACATAC 

TAGTAGCGTA 

TTTATGCTTG 

TCTCCTTACG 

GGCCATAGGA 

GGCATATTTA 

TATACCCGCA 

TAGCACAACG 

ACGAATTTCT 

TAACCGGTAG 

AAAACGATAC 

AAGTTGGTCC 

TCCACCTTCT 

AAAGTGTTGC 

TTGGAGGGCA 

ATAGTCAGCG 

ATACTTTTTA 

AAGATCTTCT 

AATCACTAAG 

GCTTGCGCCA 

GTCTGTCATA 

CATGTGGAAG 

CCAGAATCAC 

AGCCACCCAA 

TTATACCAAC 

GTCTTCCCGT 

GTAATCGTTA. 

AAGTTTCTTC 

TATCAATGGT 



AGTGTAAAAG 
TTGATGCATG 
GTCAAGATAA 
AACCCGACCA 
TGTTATTTAT 
TCCACCAGAC 
GGTGTTTCAG 
GAGTTGGTGG 
GTATTTTCAT 
GCCAATTTGC 
GCATAAACAG 
ACTAAATCGT 
TTGGCTAGAA 
GAACTACGTA 
TTCCCTAGTT 
ATTGTGTAAT 
GCTTTTAGTT 
TTTGTACGAA 
AGGTATTCTT 
ATATAAAAAG 
ATTTTATTTG 
TTAAAAAACA 
CTCTCTATCT 
AGTTGACTAT 
AAGCAAGCAT 
GACCTCCTTT 
GTTGTCTAAC 
TCTcTACTAG 
GGTTCGGGTC 
AAGTCCTTTC 
TTTTATTTAA 
AAAATATATA 
TTCTAACCAA 



ATATGTTATT 
CAAGTGTAGC 
TTTTTCAAAA 
AAATTAAAAA 
CTATCACGCT 
GCGTAAATTT 
CGTATTTTTT 
CTTCAACTAG 
ACTCAGAGTT 
TTTCATCTGA 
GACCGATTCC 
CAATGGCTTT 
CACCCAAAAC 
TGGCTTTAGA 
TTTGAATTAG 
CTTCCGTCAT 
CTTTAGAAAG 
CCAGCTGTTT 
CCTCTTTAAT 
CATCAATTTG 
GTTGTTCAAC 
TAGCTGGATG 
CAAATTTTTG 
TTCCATAAGA 
CTAATTTTTC 
CAT TG ATT AT 
GACAGCCTCG 
GaTTGGTAGA 
ACAGACTTTC 
CAAGACTTAT 
ACATATTAGG 
TTACGAGGAA 
TGCCCAATCC 
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GGCATTTGGT 
GGCGAACCAA 
GATTTTTAAA 
TTTTTTCGGG 
CCTTGGTGGC 
TCTTGCGGTT 
CTTATAATAC 
ATAATAACGG 
ACCTGATTGG 
AAAGCGATTG 
TGGAATACTG 
GTCTAAGTCT 
AATATCTACA 
AATTGCCTTT 
GTTCGCAAAC 
TAGAGTTACT 
CGTGTTGCAT 
AATTAGCTGA 
GATGGAGTTG 
ATCGTTTTTG 
TGAAACCATA 
AAAGCTATAT 
AGAAAATTCG 
TGCTTCTTTT 
AGAACTAACG 
TCGGAAATTG 
CATAATAGAG 
GGTCAATGGa 
TGTCGCAGTA 
TATTCAGGAA 
AATAGCCCCT 
AGTGACAAGA 
TTAGTCTTAC 



GGCAGCATGC 
GCAATGGAAA 
GGTAGTTAAT 
TACTTCCAAG 
ATATAGAGTT 
AGTACGATGG 
GCATGGTACT 
AGATAACGAT 
TTCTGTCGCC 
ATAGGGCCAA 
GTTAAACATT 
TTAATTAACT 
GAGTCCTGAT 
GCAATTCCtT 
TCTTCTAATG 
AATGTGGcAC 
TTATAATAAA 
TAACGCGTTC 
ACTTGCCTTT 
TTTTCTTCAA 
AGGTTCAATT 
AAAGATGTGG 
AGAATGTATT 
AAAACAGAGC 
TCTAAACCGA 
CTTGGGTACT 
TTTCATTCAT 
CTCAGCTAGT 
AAACTGCAGA 
TATCCTACAG 
AAGGGmGwTT 
TGAATTTCGG 
TGGCAATTGT 



TTAGTGTTAC 
AGAAAGTCTA 
TTTTGAAGCG 
GATTAGCGTC 
GATGGTTACG 
CTCGTTTGTG 
CAGGCTCGTA 
TGCCTCGTTT 
AATATAAACC 
TTTCAGCGAG 
GATATTCAGG 
TTTCAAGACT 
GGACTTTTCC 
CGGGaTTTTT 
ACAGTTCGGC 
TTAAAACGGA 
TGTTTTCGAT 
TAGTTAAGTG 
GGATGCGAAA 
AAATATCACG 
GATTTAACTC 
CCTCCATTCC 
CTTTGATCTG 
AAGTTGAATC 
CAAATAGTTT 
GTGGGATATC 
TAAACCTCTA 
GTGTTGGaGG 
AGGTGATAGC 
TATCCAAGTA 
GAAGGAATGA 
ACAAAATTTG 
GGTCATTGGT 
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TTGTTTTTAG GATTTAAGCG GGAGTTTTCT AAATTGaTTG GCTcCTGGTG ATTGCCTTGA 
TTGCGGTAGG TCTGGTCTTT AATGCTTCtG GTGTAAAAGA TGTCyTACTC aATTTATTCa 
ATCgAATCGT CGGTGCGTAA GGAGGAGTCA CGATGGAACa GATGCGTGTT TATATTGCGa 
ACTTAGGTCG ATATAACGAA GGCGaATTAG TTGGTGcTTG gTTTACACCA CCTGTTGATT 
TTGATGrnGaT AAAAGAACGA ATTGGATTAA ATAATGAATA TGAGGAATAT GCGATTCATG 
ATTATGAACT TCCTTTTGAG ATTGATGAGT ATACACCAAT TGAAGAGATT AATCGGTTGT 
GTGGTTTAGC GGAAGAGCTT GAAGGAACGC CGATTGGTGA GGTTGCTTCA GAGATTCAAC 
ACGCTTTCTT CAATTCATTT GAAGAAATGG TGGAGCATCT GGATAATATT GTTTATTATC 
CAGATTGTAA TGACATGAGT GATTTAGCGT ATTACTTGAT TGAGGAAGCA GGCGCTTTAG 
GTGAAGTTCC TACTCATTTA CAAAATTATA TTGACTATGA GGCTTATGGT CGAGATTTAG 
AAATTGAAGG GAATTATTTG GTTACGAGTC GTGGGGTCTT TGAGCTATGT GAGTGAGAGT 
ATATTGAATA CTGCCCTTAA AGATACTACA ATAGAAAATA TCGAGTAGTA TTTCAAAGGA 
GTGGAAGTGG TGAATATTGA TGAGGCAAAA CAGCTTGCAA ATTATTTTTC AGACGGATCT 
CCTGGATATA TATTGGAATT CAATAATCTT GAATGGGACC TATATACTAA GAAAATAGTT 
GGATTTGGAA TTCAAGAAAG ATTTGGTGGC TCTAAAGGAA ATTCCTTCAA AGAATTTCTT 
GTCTCAGAGG ATATTAGTGA TCACCTAAAG TTCAAATTAA TATACTCATa AATGaAACTA 
TGAAGCAAAT gCTTCTATTC GCCAATTTGa AAATCCAAGT TCaTATAAGT CaCCTAAAAk 
GAATCCACTT GtGACTCyAw TGaTTgAAGA GGCaTATCaT AAGaAAGATA GTGCaAGATT 
GCtTAATGGT ACTAAGTTTT CGAAAGAAGT GAATTTAGAG TATATTCAAC AACTTCCTAC 
TCGTATAAGT AAGGATATTC AAAATAAAGA TTTCGATTCT GTTTTAACAA AATCAAACAC 
TATGATTGAT GAAGTCTTGA AATATATCAT TGAGAACACA AAAGGAAAAA GCATTGATAA 
TAATACTGTA AGGTCTAAAG ACTTAAGAAA TCAAGCCTTT GAAAGACTAA ATATAAAAAT 
TGACAAAGAT ATGGATAmAa GAATAAAaGC TTTAGTTGGT GCTCTAAACA CTTTGyCAGA 
CAAAATTCTy GAGATGAGGA ATAGTCAAGG TGATGCCCAT GCTCAGGGCT CCAGGAGGAT 
TGTAATTAAT GAAGAAGAAG CTATCCTTAC TGCGAACTCA GCAATGATTT TATGTGAGTA 
TCTATTTCGA AAGTATGAAA AGAGAAAAAG GTTCAGTGAT GCAAAAGAGA GTTGATTtTT 
ATCAGCTCTC ATTTTCTATC TTTyCAATTT TTGCGAATAG CTTCATAAGA ATACTAGTTt 
GTTCTATAAC AAATTCTACT TCTAGGTCAG TGAATGTTTG ACTATGTTTA ACATTTTCAT 
TTTGATAGCT TATAAGTCCt TTAATAATTA AACGGACCAA ATT GAT TAT T TCTTTTGGAA 
TGCCCtTTTT TCTCAATAAG GGCAGGAGTT CTCCTTCGAT AGGCTGGTTT TCTAAAGACT 
TTGAGTTTTT . TAAAATATCC TGCATAAAAA ACTCAAAGGC CAGCCTGATA CTGTCTAAAG 
TATAACGGTT ATAGATGCTG TTGTCGTAAG ACTGTATGGC TTTTTCAAAT TCTTCTTTTG 
CTCTTGAGGA AGCAGTCAAC CAAGTGTTAG TTTCATTGAT GATATTTCTA TCTATTTTAT 
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CATTAGAATA ) 
TATTTGCTGG i 
GTTCTTCTGT < 
gAAAGGGAGT ( 
AAGTTGCAAT < 
AGTTTTGCAT 1 
GAGCCGATTG ; 
CATTTATGCA 1 
GTTTGTTTTG r 
AGAGGGGGCC r . 
AAAAACCTTT ( 
ACGGTTTAAG ( 
ACCCATCACT ( 
TTGTCTTTAA 1 
CCTTCTTATC C 
AGAATCGTGA 1 
CGCAAGAACG ) 
TTGACCTTCA ; 
TTTTTATTGG C 
AATCTTTCCT 1 
ATTTTGTTTC C 
AAAGTAAATT C 
TAATTGAACA 1 
CAAAGAAAAA C 
GTTGCTTGAT 1 
ATGTAGCCTA 1 
TTTTTTATTA C 
TTGTGACCAA 1 
TAGACAACCA 1 
ATTCAGTAAA 1 
GTTATGTAGT. 1 
TGTTCGATTT 1 
GGCTACATGA / 



AACTGAACCA 
ATTTTGAGCA 
GAAAGATTGT 
CGAAATAAAG 
CTCAGTTCCC 
ATATACCTCC 
AATATGAAAA 
ATCAATGACT 
TCCTTGCTGT 
TTTTTAAAAT 
GATGGTAAGA 
GTCACTTTTG 
CTAGTAAGGA 
tCAAGAGGGa 
GCCTGAACAG 
AGGCAAGATC 
ATCAAAAAAG 
AACAGATGCA 
CTTTAAACTA 
TTCTGGGCTT 
CTTATCCAAT 
GGCTAGAAGA 
TATTTATGGC 
GCTAAAATCA 
TGAAGAAAAT 
TTTGACAATC 
CCAGCAACAA 
TAAAAAAGCT 
TGCCTACCAA 
TGAGTTGGAA 
TCGAGTGAGT 
TTATGATGAT 
AGAATTCTTG 



TAATTTTTTA 
AGCACACTTT 
AAATTATTTT 
GGAACAGAAg 
GAGTAAACTT 
ATAAAAAATT 
AAATAAAAAG 
TTCAGCTACC 
TTGTCATCGT 
ACTTAGGCAT 
AACCACTGGG 
CAGGAAAGAA 
GTGTGAACTA 
GAATGTTTTG 
AAGTATCAAG 
CATGCCTTAC 
GAAATTACAG 
TTAGTATCTA 
ATTGCAACAG 
CAAGAATTTG 
GAAGAAATTC 
TTCAAAGTCA 
GAAAAAGGCA 
GAAACGCTGG 
CCTCGGTACT 
AATACGATTG 
CAATTCACTT 
CTTGCCACTG 
TCCGATAATG 
ACTACCTTGG 
GCGGAATCCG 
ACTAATGTAA 
CCCTCAAGCA 
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GTAACAGTTT 
CACAAAGCTC 
GAATCGCGGT 
AGTTAAATTC 
CGTTAAGTGT 
AAGCATGATG 
CTACACCAGT 
TTTTCCTGTG 
TTTTGCCCAT 
ACCGGTGGTG 
ATTTTTACGA 
GGTCAAAGAA 
TGTACCCGAT 
CCTACTAwGA 
TGCACGATAA 
AAATTGCCAC 
GCAGACTCAA 
TGATTGGCGA 
ATGAAGAGGT 
TCTATGGGGT 
GCCGATATAC 
GACGGGTCAC 
CACCGTTTGA 
TGAAACGTTA 
TACGCATGGA 
TAGGGGAGAT 
TTCCCATTGA 
TTCGCAATAA 
AGACAAATTC 
ATCAATCCAA 
TGGATGAATT 
AACTAGTCCG 
AACGCTATAT 



TAAAATCTTA 
TTTTATAATT 
AGCCTTAGAA 
TGCTCCATAT 
TCCACTTGCC 
TTTATATTAT 
ATCTGGGCAG 
ACCTTTAATC 
GTGCCTCCGC 
GTCACTTGGT 
TCCTTTCTCA 
CAGAAGAAAC 
TAAGTATGTT 
ACTTGTGCCT 
CTTTCGTCAA 
AGAAAGTAGT 
AGAAGTCGCG 
TAGTCAGATT 
CAATTTAAAG 
GAACCATCAT 
CAAGCTTGAA 
ACCGAGTGAT 
AGAATATGAA 
TGATTTATTA 
ACATGAAAAT 
GGAGTTTCCT 
TACTTCAATG 
GAAAAAAGAA 
CAATGTCTTA 
GGAGTCGATG 
AAAACGCCGT 
ACCCTTTGGC 
GAACGATTAT 
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GCAACGTCTT 14160 

TTAAATTGCT 14 220 

GTAAAGTTCG 14 280 

TCAGCAAATA 14 340 

ATTGTTATTA 14 4 00 

ATTAGAnAGT 14 4 60 

TTGAAAAGGT 14 520 

AGATGGCTTG 14 580 

TATCTATGAT 14 640 

TCATGTCGCA 14 700 

GTTATCACTT 14 7 60 

GATGGGATGA 14 820 

GAAAATAATC 14 880 

TAwAACTATT 14 940 

TTGATTGCTC 15000 

ATTCGTGCCA 15060 

AAACAACGAA 15120 

GATTATCGCT 15180 

AGTCTCAAGA 15240 

TTGATGGGTG 15300 

AAACTAATGG 15360 

CTGACGTATT 15420 

TTTCAGCTAC 154 80 

CATCCTAGTC 15540 

CmTGAATCGT' 15600 

TCTTCAGAAC 15660 

AATGTGGAGA 15720 

CTGAAAGATT 15780 

GATGCTTTGG 15840 

TACAAATTAA 15900 

TGTGATGAAG 15960 

GATATGATGG 16020 

ATCCAATATG 16080 
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TTACTTCTGA CTTTTTAGCG GGACTTGGTT TTGGGGCTAC TCAAAAGCTA GGGGAATTAG 16140 

AAGGCATCTA CTTTGGCTAC AATGTAGATA CGGGGCGGAA TGTTTACCTG AAACCGGCAT 16200 

TAGCTTCACA AGGGGTCAAG GGTTCAGTTA CCAATGCTTT GGCTGCTGCG TTCCTTGGTT 16260 

CACTCGGTGG CGGAAAATCC TTTAGTAACA ATCTCTTGGT CTACTATGCT GTTCTTTTTG 16320 

GTGGCCAAGC TGTGATTGTC GATCCAAAAG GAGAACGAGG TGGTTGGAAA GAAAACTTGC 16380 

CAGAAATTGC GGATGAAATC AATATTCTCA ACTTGACTAG TAAACCAGAA AATCAAGGCT 16440 

TACTTGATCC TTATGTGATT ATGAAGAAAA AGAAAGATTC AGAAAGTTTA GCAATTGATA 16500 

TCCTGACTTT TTTAACAGGA ATTTCCAGTC GGGATGGGGA GAAGTTTCCA GTATTGCGCA 16560 

AAGCCATTCG GACGGTTACC CAAAGTGATC AACGAGGATT GCTGCAAGTC ATTAATGAGC 16620 

TTCGAAAGGA GAACTCTCCA GTCGCGGAAA ATATCGCCGA TCATATCGAG TCCTTTACCG 16680 

AT TAT GAT TT TGCCCATTTG CTTTTCTCAG ATGGTTCGAT TACCCAATCA ATTAGTTTAG 16740 

ATAAGCAGTT GAATATCATT CAAGTAGCGG ATTTGGTTTT ACCAGATGCA GAAACCACTT 16800 

TTGAAGAATA TACCACGATG GAGTTGCTAA GTGTCTCTAT GCTAATCGTC ATTTCCACGT 16860 

TTGCCTTGGA TTTTATCCAT AGTAATCGAG AAATCTTCAA GATTGTGGAC CTGGATGAGG 16920 

CTTGGAGTTT TCTTCAAGTG GCTCAAGGAA AGACCCTTTC AAATAAGCTC GTTCGTGCTG 16980 

GTCGGGCCAT GAATGCCGGT GTTTACTTTG TCACGCAAAA TAGTGACGAT TTACTGGATG 1704 0 

AACGCTTGAA GAATAATATC GGATTGAAAT TTGCTTTTCG TAGTACCGAT ATTCATGAAA 17100 

TCAAGAAGAC TTTGGAATTT TTTGGGTTGG ATCAAGAAGA TGAAAGTAAT CAAAAACGCC 17160 

T AC GAG ATT T GGAAAACGGC CAATGCTTGA TGCAGGATTT GTATGGTCGG GTAGGCGTGG 17220 

TTCAGGTACA TCCAGTCTTT GAAGAACTCT TCCATGCCTT TGATACCCGT CCACCTGTTC 17280 

AAGAGAGAAT GGAGTGAGGG GATGAAAAAG AAACTTTTTC GGATTGTGGG AATCGTTTTT 1734 0 

ATTAGCATAC TCACTCTGAT AGTTCTATTG AGTTTAGTTG GTACGGTAGC CGAGGCGACA 17400 

GGGCTAGTTG ATGATACAGT TGAATCAGGA AATCTATATT CCAAATATTC ACTGAACAAC 174 60 

TATCAACTAG ATTTCTTTGT GGATAGCTCT TGGGATTGGT TGCCTTGGAA TTGGGGTGAT 17520 

GGCTTAGGGA AAAGCGTGAT GTATGGTCTT TATGCCATTA CTAATTTCAT TTGGACGGTG 17580 

AGTTTGTATT TGTCGAACGC CACGGGTTAT GTGGTTCAAG AAGCCTATAA ATTAGATTTC 17640 

ATTTCGGATA CGGCTGAAAG TATTGGAAAG AATATCCAAA CATTGGCTGA TATTACTGAA 17700 

AATGGACTCC AGACCTCCGG ATTTTATTTT GGGTTTCTAT TATTGATGAT TTTAGCTCTG 177 60 

GGGGTGTATG TGGCGTATAC CGGATTATTA AAACGAGAAA CCACGAAAGC AGTTCGAGCC 17 820 

GTGATTAATT TTGTGATGAT TTTTCTACTA TCGGGTTCGT TCATTGCTTA TGCGCCTACC 17880 

TATATCACAA. AAATCAATGA CTTTAGCTCA GATGTTAGTG AAGCGGCCCT TACCCTTGGA 17940 

ACCGAGATCG TTGTACCCAA TTCAGAAAGC CAAGGAAAAG ATAGTGTGGA TTTGATTCGG 18000 

GATAGTTTGT TTTCGATTCA AGTCCAACAA CCTTGGTTAC TACTGCAATT TGATGATTCC 18060 
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AACATAGAAG AAATTGGTGA AGACCGGGTC AACAAGATTC TATCTGTGAG TCCAGATGAA 18120 

AACAAAGGCA AAGATCGGGA AGAAGCGGTC AAGGCAGAGA TTGAAGACAA TGACAATGCG 18180 

AATCTCAGTA TTACCAAGAC CATGAACCGT TTAGGGATCG TGGTCTTTTT GGTATTGTTC 1824 0 

AATATTGGCA TTTCCTTCTT TGTTTTTCTT CTAACAGGGA TTATGTTGTT CTCGCAAATT 18300 

CTCTTCATTA TCTTTGCGAT GTTTCTGCCA ATTAGTTTCT TATTGTCCAT GTTGCCGACT 18360 

TATGAGAGTT TGGGCAAGAT GGCGATTATT CGTTTATTCA ATACGATTAT GATGCGAGCG 18420 

GGTGTCACCT TGGTGATTAC GACTGCATTC AGTATTTCTA CGATGTTTTT CAATATTTCG 18480 

GCTACCTATC CCTTCTTTAT GGTAGCTTTT CTACAAATTG TGACCTTTGC CGGTATCTAT 18540 

TTCAAATTGG GTGACATTAT GAGTATGTTC AACCTTCAGA GTAATGACAG TCAGTCAATG 18 600 

GGCAGACGGG TCATGCGAAG ACCGCAAATG TTGATGAACC ■ GTAAATTGCG GCAACTCAAT 18660 

CGAAATGTGG GCCGAACTCT GGCTTTTGGC GGTGCTGCAG CAGTygGaaT GGCAAAATGC 18720 

ATTACtCTGA TTTATGGTAC AATTTTCCCA TTAATCCGCC GTATGTACCA ACGGAAAACC 18780 

CAACAGGTAT TTACAAACGG ACATTTGCTA TAGATGAAAC ATTTCACGAT AAAAAAATCA 1884 0 

TTTTACGTTT TTGTGGCGTA GATTCAGCTT ATCATGTTTG GGTGAACGGC CATGAAGTTG 18900 

GCTACAGTAA AGGTGCACGA AATGAAGCAG AATTTGACAT TACGCCTTAC GCTAAAATTG 18960 

GTGAAACAAA TGATTTAACG GTGCGAGTCT ATCAATGGTC AGATGGTACG TATTTAGAGG 19020 

ATCAAGATAT GTGGTGGTTA AGTGGTATTT TTCGTGATGT TGAATTACTC GGTGTACCAG 19080 

AAAATGGGCT GGAAGATTTC TTTATTATTT CCGATTTAGA TGATTCGTAT CAAAATGGGC 19140 

ACTTGGCAAT TACGGGTAAA TTTTGGCAAG ACAAAGGACA ACAAGTTCAA TTGGAGTTGA 19200 

TGGATCAGCA GGGAAAAACG GTTCTTAAAG AGACTGTCGC AGGAAATCAA GGTACGGTTG 19260 

AATTTTCAGC AAGTTTGCCG TCTGTCACTG CTTGGTCAGC TGAAAAACCA TACTTGTATC 19320 

AACTATTTAT CACTGTTTTT TCTGAAGGGG AAGTCGTCGA AGTTATTCCG CAAAAAGTTG 19380 

GTTTCAGAAA TATTCATGTA TCTGGTGAGA CTTTCTTAGT CAATGGTGTG GCCATTAAAC 19440 

TAAAAGGAAT GAATCGTCAT GATTATAATC CGAAAAATGG TCGAGTGGTT AGTCGTGAAG 19500 

AAATTGAAAA AGATATTCGT TTAATGAAAC AATTTAATAT TAATGCCATT CGAACCAGTC 19560 

ATTATCCAGC ATCTGCTTAT TTCTATGATT TATGCGATGA ATATGGAATG TATGTGATTG 19620 

ATGAAACTGA TTTGGAATGT CACGGCTTTG AATTAACCGG AGAATATGAT TGGATTAGCA 19680 

ACGATCCAGA ATGGGAAACG GCTTATGTTT CTCGGATGGT ACGGATGATT CAACGGGATA 19740 

AAAATCATCC ATCAATTCTC TTTTGGTCGT TAGGGAATGA ATCTGCGTTT GGTCATAACT 19800 

TTATTGAAAT GGCCCGTATT GCAAAAGAAA TGGATCCAAC ACGTTTGGTT CATTATGAAG 19860 

GTGATTTTGA. GGCAGAGGTC ACGGATGTCT ATTCTACAAT GTATACGTGG TTGGAACACC 19920 

CAACTAGAGA ATTACTGATG AATACGATTA TTGAAAATTC AAAAAAACCG CATATTTTAT 19980 

GTGAATATTG TCATGCGATG GGTAATGGCC CTGGGAATTT AAAAGAATAC CAAGAGCTAT 2004 0 
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TTTATGCTCA 

AATCTGTAAC 

CAAGTAACAA 

GCTTGTATGA 

CAGGTGAATT 

TTTATACAAT 

TTGCTGCTAG 

CAGGAGCCGC 

CTGCTGGTCA 

AGATCACACC 

GACCAAACTT 

ATGGTAAAAA 

ATGATATGGA 

TTGTCCGTTC 

TTAACGGAAC 

CAAATGGCGA 

CAGATATGTT 

AATATTTCGG 

GCGTTTATGA 

GTAACCACAT 

CAACAGCGGC 

TTGCTAAGCA 

AACAAAACGC 

CGTTTGAAGA 

CAGCGGATGT 

ACAGTGCACT 

TTTTTTTCAT 

ACATAGCGAA 

GGGTTTTAAT 

AATCAATGGG 

AAATCTTAGG. 

GGGCAGTAAT 

GTAAATAATT 



TGATAAATTA 
AGATAACGGT 
AGATTTTTGT 
ATACAAAAAA 
CTCATTATTG 
TACTGAAGAT 
AGCAGAAGGA 
CTATTATTTA 
TGAATTAGCT 
AGTTGGCTCG 
TTCAATTAAT 
ATTATTGCAT 
AATTATTGAT 
TTTTGAGTGG 
GACTAACAGT 
AATTTTCTTC 
ACCACGTTTG 
TAAAGGGCCG 
TGCTACAGTG 
GGCGACTAAA 
TGATTCATAT 
TACCAATGAA 
CCTAGGTAGC 
ATTCCAATTG 
CGCACATGAA 
TCGGTCTCCT 
GGAAATAAGC 
TTATTCGCTA 
CGTTGGTGCT 
TTGGATTGCG 
TAGCTGGGGT 
TTTAGTTGCT 
GTTTAATTAG 



CAAGGTGGCT 
GAAGTTTATT 
ATTGATGGGA 
GTCATTGAAC 
AGTCGGTTTG 
CAAACAGTGA 
AG ACT GC ACT 
ACTCTTTCAT 
ACCGCACAGT 
CTAATGGCAA 
TTTGATAAAG 
AAAGGGCCGA 
GAAATGAAGA 
AAAAAAGTTG 
GCTTGGCATT 
GACTTAAAAG 
GGTGTGACTC 
AGAGAAAACT 
GCGGAAATGT 
TGGTCCGCAT 
AATTTTAGTG 
TTACAAGAAA 
TACTCTTGCG 
GCATTCCGCT 
CGAGTAAAAC 
GTTCTGAGTT 
AAACTATTTT 
GATAAAGCAT 
GTTATCGGTG 
AATATCGTTG 
CCTCACGTAG 
GTGGTTTCAT 
TTAAAGGAGG 
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TTATTTGGGA 
ATCGTTATGG 
TGTTGATGCC 
CCATTACAAC 
ACTTTGAAAA 
TTCAATCAGG 
TACCTTATCA 
ATCAATTGAA 
TTGAATTGCC 
AGGAAATTGG 
TGAAGGGCGC 
AATTTACATT 
AGAAATATTT 
ATGATTTTAT 
ATCAATGTAC 
GTAGTCCTGC 
TCCATTTAGA 
ATGTAGATTC 
TCACTAATTA 
TCACGGATGA 
TTTCTTATTT 
GTGAGTATGT 
GCCAGTGGCA 
TAACACCTTT 
GACCAACAAT 
TTTTTAATGA 
GTTATCTAAT 
TTAGGAGGAA 
CTAtCgcTGG 
CCGGTTTAGT 
CAGGTATGGC 
TCTTCTTCGG 
TCGGGACAGA 



ATGGTTTGAC 
CGGCGATTTC 
TGACCGTACG 
TAGCGCTATT 
CTTGGCTATC 
GACAGTAGCA 
TTTAGATTTT 
AGAAACGACT 
AATTGCAACA 
CCCTCATTTA 
CTTAACGAAT 
TTGGCGGGCA 
CTTACATTTA 
CCAAGTCATC 
CTATCAGTAT 
TGGTAAAATT 
CAAGTCTCTA 
ACAAGAAGCT 
TGTTGTTCCA 
TCGCGGCCAA 
TGAAGAACAA 
TGTTTTAAAT 
ATTAGAAAAA 
TAATAATAAA 
TTCTTAAATA 
AATTATCAGA 
AAAAATAGTT 
TTATTTTATG 
CGCGCTTACA 
TGGTTCATTT 
AATTGTGCCA 
ACGAAAAAAT 
ACCGTTTAGC 
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CATGGCATTG 20100 

GGCGATGATC 20160 

CCATCTCCTA 20220 

GACGTTCTGT 20280 

TTTAAGCTGG 20340 

GTGCCAGCAA 204 00 

CCTAAAAAGG 204 60 

GCCTATGCAT 20520 

CCAGGGATTG 20580 

TACATTGAAG 20640 

GTGACACGAG 20700 

CCAATCAGTA 20760 

GAGCATGAAA 20820 

GTAAAAACAA 20880 

TTAATTGCAC 20940 

GAAAATGCGC 21000 

TCAGAAGTAA 21060 

GGTTTACTTG 21120 

CAAGCCAATG 21180 

GGGGTAGTAG 21240 

GCGCTAGATG 21300 

ATCGATTACA 21360 

TACCGAACTA 21420 

GAGATTCAAG 214 80 

AAAGCTCAGA 21540 

AATTATTGCT 21600 

CATAATAGTA 21660 

CATACTTTAT 21720 

AGCAAAGGTC 21780 

ATCGGAGAAA 21840 

TCAATTATTG 21900 

AGCTAATTAA 21960 

TCCGAGAAAT. 22020 
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AAGAAGAAAT TTCCGAAAAT TGCTTTTTAA TTTTTGGATA ATTTCGGCTT ATTTCCGAAG 22080 

GAGTTGGTTC TGTTCCCGCC GTTTATCAGT TTTTGAGCGT GGAGCAAAAA TCCAAAGTGA 22140 

TTTTTGTCCC ACGCTCCTTT TTTTATTTTC TCGTTAATTT TACGACAACT TCACAACGAG 22200 

CGGTTTGTGG AAACATATCA ACAGATTGTA AATAGTCTAC TTGATATACT TTGGCTAATT 22260 

GAACTAAATC ACGTGCTAAC GTGGATACAT TACAAGAAAT ATACACCATT TTCTTAGGTG 22320 

GTTGTTTCAA AAGGGCCGTT AATAATTTAC GATCCAAACC AGTCCGTGGG GGATCAACTA 22380 

CAATGGCATC AGGTTTAAAG CCTTCTTTGA ACCATTTAGG TAATAAGTCT TCAGCTGTTC 224 40 

CAACAGCATA ATGTGTGTTC GTAACACCAA GTCGTTTTGC ATTGAATCTA GCATCGTCAA 22500 

TTGCGGCGGG AATGGTATCC ATACCACGAA CTTGATGCGC TTGTTTCGCT AAACTTAAGC 22560 

CAATGGTCCC GACGCCACAA TAAGCATCAA CAACAGTTTC GTTTGGTTGT AAATCTAAGG 22620 

CCTTGATTCC TT CAT TAT AT AAAACTTCCG TTTGTTCTGG ATTCAATTGG AAAAATGCTC 22680 

GCGGCGATAA ATCAAAGACA ACGTCGTTAA TGTGTTCTTC AATGCTTTCT TTGCCCCATA 22740 

AGTGAAGGGT ATCATCACCC AT TACT AG AG AGGTTTTTTT ATTTTGGACA TTTTGCATAA 22800 

TAGAAACAAC TTCTGGTAAT TGTTCATTAA TTGCGCGAAC CATTTTTTCT TTTTGTGGAA 228 60 

ATTTCTGCGA TTGTGTAATG AACACTACTT GGACTTCGCC AGTTTGGATG CCCACTCGAA 22920 

CCATTAATGT TCGAAAGATA CCACTATTTT TACGTTCATC ATAAATTGGT AATTGAAAGT 22980 

CATTCAATAA ATCAACTAAT GTATTCATCA CCTTGGTTGT CGCAGGCTGT TGGACCAAAC 23040 

AGTTATCAAT CGGTACTAAT TGATGTGAAT CAGCTTGATA GAGACCAGCT TCTACTTGGC 23100 

CATCGATTTG GCGTAATTGG AATTGGGCTT TATTTCGATA ATTCCAAGGA TTATCCATCC 23160 

CAATTGTTTT TCGTAGTTCA TAGTTTTGAT AATTGGCTGG TTTAAATTTG TTAAGCGCCT 23220 

GCTTTAATAA ATCTTTTTTA AAGTCTAATT GTGCTTTGTA GGCTAAATGT TGTAGTTGGC 23280 

AGCCGCCACA TGTTTCATAG ACAGGACAAG GAGGGACCAC GCGATCTTTA GCAGCTTTTT 23340 

TGACAGATTG AAGGGTTCCT TCAGCAAATT TTGGTGTAAC ATTTGTGATC GTGGCAGTCA 23400 

CTTCTTCTTT CGGTAGCGCC CCAGGAACGA AAATAATTAA TTTTTTATAA TAGCCGATAC 234 60 

CTTCGCCATT AATTCCTAAT CGTTTAATTT TTAAAGAAAT TTGTTGGCCT TTTTTTACTA 23520 

GTTGTTGGGT CATATTCTTT CATCCTTTAG TCATCTCACA TTTTTCACTT CCTCCCATTT 23580 

TAGCACATTT TTATTTATTT GGCTTCCAAG GATTCTATGG TATAGTAGAA TATATGTTCT 2364 0 

GATTTAGAGG GGCAGGAGAA TAAGATGACA ACAATCACAC GAATTAGTAA AGATAAAGGT 23700 

GAATTTTATC TACTTTGGCT ATCTTCAGGC GAAAAACTGC GTGTCTCGGA AGATATTTTA 23760 

GTACGGCAGC GTCTGCTAAA AGGCCAAGAG TTATCAGATA CTTTAATAGA AGAAATAAAA 23820 

AAAGCCAGTT ■ CTTATGATGT CGGGTTACAA ATGGCGATGA ATTATTTAAG CTATCAATTA 23880 

CGTTCAAAGA AAGAGATTTT TACTTATTTA AAAGAGAAAG AAATTGTACC TGAAGATCGA 23940 

GTAAAAATCG TTCAACGTTT AGAAGAGCTG CGGTTGTTGG ATGATGCAAT TTTTAGTGAA 24000 
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AGTTACGTAC 

TTAAAGCAAA 

GATGAACAAC 

AAAAGTTTTa 

AATGAAATCA 

CAACAAGCGT 

TCTAAAAAAG 

ATTCAGCAAT 

GTGGAGTCCC 

AAAACGCAAT 

TATGCTACAA 

GTTTCCGACG 

AGGGTTAGGT 

AGAATTTGGT 

CCCCTATACG 

CGGTAACGTC 

GAGTAGTCGT 

GGATTTTAAT 

GTGTGAAAGT 

TTATCCAGTC 

TGAAAATAAA 

TATGTGGCTG 

GGCGCAACCA 

TTTGGTCGCT 

ACGTACCTTA 

TTATGGCCGA 

ACAATTTTCA 

AGAACACGAA 

GTTTCCTCTA 

TAAAATAGAC 

GAGAGAGGAA. 

CATTTTGGTT 

GAAAACAATT 



GGACAGCTAT 
AAGGTATTAG 
TTAATGTTGC 
AAGATGCACT 
TTGATCTAGC 
TAGACAAGGA 
TCCAAAAAGT 
TTATTAGCGA 
GCAAAAGTTA 
TTACCTTGGC 
CAAACTCGCG 
ATTCAAGATT 
TaCTATTCAC 
GGGAAAATGC 
GCTGGTGCGA 
ATGCGGGTAG 
AAAGTATTTG 
CAAGCTTTAA 
TGTCCGCTTC 
AAATCAAAAA 
AAGCAAGAAT 
TTTCCGATTG 
GCAGAAACGG 
GAAGAACCTG 
GGCGAAGTCG 
AATACAGGCG 
GACTATGTTT 
AAAAAATAAC 
GGTTATACCC 
TTGTCTCTTG 
GTACAGCCAA 
TTTGTCACAG 
GATAAAGGCT 



GCGAACCAGT 
TGAAGAAGAT 
CACAGCAaCT 
TCaGAAAATG 
TTTGGAATCT 
AGGCGAACGT 
CAAACAAAGC 
AAAGGAAGTA 
GTTCATTCCA 
GAGCGAATAC 
TAGATACAGT 
TAGCGGAAGC 
GAGCGCGTAA 
CTGACACAAT 
TTGGCAGTAT 
TGAGCCGTTT 
AAGCGGCGAT 
TGGATCTAGG 
AGCAATATTG 
AAGTTAAGCC 
TTTTATTAGA 
AAGAAATTAG 
AAAAACAATT 
TCAACTTTTT 
TTCATATTTT 
AATTAGCCAC 
TTCCAAAACC 
AAAAAAGAGA 
CTATATAAAT 
TCTGCTTAGA 
TGAGTAGGAA 
TCGGCTCAGC 
ATGTTCCCAT 
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GATAAAGGAC 
ATTCAGCATG 
GCAGAAAAAG 
CGTCTGCATC 
TTAGCTTTTG 
TTATGGCGTG 
TTATTCCAAA 
GAACATGACG 
GGAAGATTTT 
AGATGCATAT 
CATTGATTAT 
GCCAGATGAT 
TTTAAAAGTG 
CGAAGATATT 
TGCCTTTAAT 
ATTTGAAATT 
GCTTAAAATC 
CTCAGCCGTC 
TGCGGCCTAT 
GAAAGATGTG 
ACAACGCCCT 
CAAAAAACAA 
AACTTTAGAG 
TACGGACTAT 
CAGTCATTTA 
ACTAGAGTCG 
ACAGCAAAAA 
GGCTTTTTAA 
TCTGCTATGT 
GGAGTAGTAA 
GCGAAAAATC 
ATACTTTGCT 
AAAAAATGAT 



CACGAAATGT 
GTTTGACCTT 
CCATGAAACG 
TGATGCAAAA 
AAAAGGACGA 
CCAATCAGCG 
AAGGGTTTGA 
AATAAAGAGC 
TTAGCTTGGT 
CGTATTTGGA 
TTTTATCGAT 
AAGTTGTTGA 
GCAGCGCAAC 
CGGAGTTTAA 
CTCCGCGAAC 
GATGCAGATA 
ATCGATCGTG 
TGCACCCCAA 
CAGGCAGATA 
TATTATGTCG 
GAAACCGGTT 
TTTCAACAAT 
TTAGAGCCTG 
GAAACAGTTG 
AAATGGCATA 
CAGCGATGGG 
ATGGTGGAAT 
CAAACAAGCC 
TTAGTTTTGA 
TGATGGTGGA 
AGCTTAATTA 
GTAGCGGGTA 
TATAATGAAG 



GGCACAACAA 
TTATACGTTA 
CTATCGAACT 
GGGCTTTACA 
AGAACAAGAA 
ATTTGATTTT 
CTATGATTTG 
CTTGGGAGGC 
ATGAACGAGA 
TTTCTGAAAT 
TTATGGAATG 
AAGCTTGGGA 
AGATTGTTTC 
AAGGAATCGG 
CAGCAATTGA 
TTGCAAAAGC 
AGCGTCCAGG 
CTTCCCCTAA 
AAATGACCGC 
GC ACT AT TAT 
TGTTAGCCAA 
TGCAAAAACT 
TAACAGAGCC 
TCTGGCAAAA 
TCTTAGTCTT 
TGGCAGCACA 
TGTTCAAAAA 
CCTCTTTTTT 
AAGGGACTGG 
GACTTTTTAA 
GTTTAGTCAT 
GCTATTTAAA 
CGCAAAATAA 



24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 



WO 98/50555 



PCT/US98/08985 



AGATAGTCAA 

AACTAGGACT 

TTTAAGTTTG 

GCGAATTGAA 

ATTATTGAAT 

AATTGATGCG 

AGACGGGCCA 

TTTATCTTAT 

AGAAATTATT 

GGACATTATT 

TCATTTAGTC 

CTGGCGCACT 

AAATGTCCGT 

AGACGGCTAT 

AGATGAAGCA 

TCCTGGTAAT 

ATAATTTATT 

ATTATTCTTC 

AAACCCAACA 

AGGGAGCAAG 

GAAGTATACA 

CAGGCTGCAC 

TTTTTTACAG 

ACGAAAATCT 

GCATTAAGTG 

AAAGAATCCT 

AGCGCTCATA 

CCCTAACTTT 

AAAATCACGT 

TTTTGTGGAT 

AGTGAGCGCA. 

CAATGAATTG 

TCATAAGCAG 



TCGTTTTTGA 
GATGCTATGA 
CCACGGGATA 
GCCGCCTATA 
ATTCCAATCA 
GTTGGCGGCA 
GGATCCATTC 
GCCCGTGAAA 
CAAGCAGTTG 
GATTCGTTAA 
AAAGAAGGTT 
TTTAGTAATG 
CATCAATTAC 
GTCTTCCATA 
GCTGAGGAAA 
ACACAGACCG 
TTCCAAGTGT 
TAAGGAAAAA 
TTCCCTTTAT 
GAGTTATGTT 
ACGAAATTTT 
ATCGGAGAGG 
AAGAAATGAT 
TGCTAGAAGG 
AATCTTGCCA 
CTTATTTATT 
GTGTTCATAA 
TTTATTTCAA 
CACCTTGATA 
GTGTATCGAT 
AAAGCACCAA 
CCGTATACGT 
GATCAATCTC 



TTATGGGGCT 
TGGTGATTAC 
GTTTTGTTCA 
GCTACGATGG 
ATCATTACGT 
TAGATGTCAA 
ATTTTGATGC 
GACATAGCGA 
AAGACAAGTT 
ATGGAAACAT 
TGACTTGGAC 
AAGGGCGCAG 
GTGTGTCTTT 
CGAACGGTGA 
ACGAAATGAC 
GCCCGTTGCC 
AGAAAATCCG 
ACGAAGGATT 
AATGAAAGTA 
TAGTTATACA 
CAAAGCCATC 
CCGTGACACT 
GTTTCGTAAA 
ATTAGATACG 
AGTGACAGAT 
AATTAATCAA 
GATGTAAAAA 
TATCATCTGA 
AACAAAGCCA 
TCGAATATCA 
AGCCAGTCCT 
AGTTTGAGAA 
CGAACATAAA 
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AGACAATACA 
CGTGAATAAC 
AATTGATGCG 
ACCAACAGCT 
TGTGTTTAAC 
TGTCAAGCAG 
AGGGAAACAG 
TAACGATATT 
GAAATCTGGT 
TCAAACTGAT 
CAATTATGAT 
TATGGTTGAA 
AAATTTAGAA 
ATTTTTATAT 
TTCCATCAAC 
ATCAGTTAAA 
AACATAACAG 
TTAGCAAGGA 
ACAAGCAAAT 
GAT AC C ATT T 
TCTAAAGAAG 
TATATGTTTT 
AATTTAGAGC 
TATAAAGAAG 
GAATTGAAAT 
GCTTGTAAAG 
GTTAGGGATA 
TAAGCATAAC 
GCTTTTTTAA 
GTAGCACCTT 
TGACCTTGAA 
TGAGGTTGCC 
ACACCAACGC 



ATTGAACGGA 
AAGACGAAGA 
AAAAATTACC 
TCTGTTAACA 
TTTTTATCTT 
GCGTTTGATG 
CATTTAGATG 
ATGCGTGGAT 
CAATCAATCA 
GTGGATTCCA 
AAACAACAGC 
CTATACCCAG 
AAGCCAGATG 
CAAAGTGATT 
GGCAATACGT 
ACGGAAAATG 
ATTAAATAAT 
AACAGCGTAT 
TTTTGTTACT 
TGTCAATTAT 
TTCAAGAATA 
GCCGAAATAA 
CGTATTATGA 
GAGTATTTTT 
ATCAATTAGG 
AAGCCTGTGA 
GCTGTCAATC 
GTTCGCCATC 
TCAGGTGCTG 
GCAAACGAGC 
AGGAACTTTT 
AAGAACCATT 
CTGCCAAGGC 



AATTAGGCAC 
AAATAACCTA 
AAGGGATGCA 
CAGTTGAGAA 
TTATTAAGTT 
GTGTCACCAA 
GTACGAAAGC 
TCCGACAACA 
TGAAAATAAT 
ATGAATTGAC 
TTTCTTTTGA 
ATAGTATTGA 
AACGAGATCA 
ATACCGTTCA 
ATATTGGTGT 
GCTTTATAAA 
CCGCTTACTT 
CATTTGATAT 
TAGAAGGGGA 
GCAAAGAATT 
CGGTTGTAAT 
TGTGAATCGT 
TAAAGAAGCA 
CTGGTTAGAG 
AATCCGCCAA 
GGGCATTCAA 
TGGCAGTTAT 
GTTAACATCT 
CATAGCGAGA 
GGCTGTAACC 
TAGAGCAACA 
AAAAATCGCT 
ATCCTCAACC 



26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
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ATTAAAATAT AGCAAGTTTG CTGGTCGATA TCTTTTGCTA AAATTTCTTG ATTCGGGCCA 28020 

TCGCCGTTTT GCCATTGGGG rATGCCACTT TTTTGTAAGG TGCGGCGTCC ATCTTCAATG 28080 

ATAGCAAGGA TGTCAGGCAA ATCAGAAGGT TCTGCTTTTC TTAAGTAAAC AGTCAAAAAA 28140 

TATTCCCTCC TCTTAGAAAC CTGTTGTTGT TTTCAATTGG TAAATTTTTG AAGGGCGGCC 28200 

AATGCCAACG GGACGTTCAC CTAACTCTTC AAAATATTGT GTCATGGCTT TTTTAAAATT 28260 

TGAATGGTCA ATGCTGCGAT AATCAACGCC GAGGAACTTC GCAAAAACTT TACGGGCTTC 28320 

CGTAATGGTG AAATCTTTTC CTAAGACTTG AAGCACTTGC GGTTCATGTT CCATTTTATC 28 380 

AACAACACGA TTAAACGCCT TAATTATAAT TTCACTATGG TCGAAGGCTA ATGTATCTTT 28 4 40 

ACCAAGAGAG GCCGCTGTTT TCAAGTCTAA AGTAATTTCC ACATCTTCAT GAGATAACGT 28500 

GATGTGTTGA CCATGCCGTT CAAGATTGAA CCAATGGACT TCTTTGGCAT CGTCGCCAGC 285 60 

AATTAAAGGT TCTTCACCAA TAAAGGCTAA ATAGCTGACT GTCACGACCC AGCCACGAGG 28620 

ATCTCGATCT GGGCGACTAA AGCTGTGTAA TTGTTCAATA TTTTCTTGAG AAATCACCAC 28680 

ACCAGTTTCC TCTTTTGTTT CTCTTAAAAC ACTATCTTCA GTAGATTCGT TGCGATTAAC 28740 

AAAACCGCCA GGAAGCGCCC ATGAGTTTCT AAAGGGGTGA CCTTTACGTT GAATCAGTAA 28800 

GACTTTTAAT TGATCAGCTT CCTTGTTGTA ACAAAGAAGA ACCATATCAA CGGTTAAAGA 288 60 

GGGTTTTTCA TATTCAGGTA ATTCTTGTTG ATGATACCAA GTTAAAAACT CTGCTAAGCT 28920 

AGCTTGGCGT TCATAATAAT TTTTTTCTTC GGCTTTAGAA GCAAATTGTG GCACAATTGC 28980 

TCCTCCTTTC AAAAAGAAAC ATGGTTTTCT TTTATTGTAT CATTAAGTGG CTGAAATACA 29040 

ATATTCAAGC GTAAAAAGAA AAGCAAGCCT GGGACAAAAA TCACTTTGGA TTTTTGTCCC 29100 

AGGCTTGTAC TATAAGAAGT TTGTTCTAGA CGTTATTTAT TGTGTTAATT CTAATAATTT 29160 

ATTTTTCAAA TCTTCTTCCA TATGGCCAAG- TTCGATTTCG GCAGCTTGGC GTTTTTCTTT 29220 

TCCTTCTTGT TGAATACGTA ATGTTTCTTG GATAGTTTCA ATTAAATCAT TTTGTGTTTT 29280 

TTGtAATGTT TCAATATCGA CTAGGCCACG TTCGTTTTCT TTTGCTGTTT CAATTGCAGA 29340 

AATTTTTAGC ATTTCTGAAT TTTTCTTCAG TAAGTCATTG GTGGTTTCAG AGACTTGACG 2 94 00 

TTGAGCAGTC ACAGCATCTT TTTGGCGGAG TAAGGTTAAG GCGATGACTA CTTGGTTTTT 294 60 

CCATAACGGA ATCGCCGTAT TCACCGATGA TTGAATTTTC TCAGCCAAAG CTTGGTTAGT 29520 

ATTTTGGATT AAACGAATTT GTGGGGCTTG TTGAATCGTA ATTTGACGAG CCAAGCGTAA 29580 

ATCATGGGTT CTTTTTTCTA AACGGTCTAG GAATTGGTTA TAATCATTAA CAATTTGGAC 29640 

ATCCATTTGA TCTCCAGATT CTTCCGCTTT TTTCATGGCT TCAGGGATAA TTTGCGTCCG 297 00 

TAGTTCTTCT GCTTTTAATT CCCCAGCGGC AATATAGATG TTTAAGGCAT CAAAATAATC 297 60 

TTTATTTTTT. TGATACAATT GATCTAACAT GGCATTATCT TTTAAAAGAC CATCTTTTTC 29820 

ATGGGTTAAT TTGACAGCAA TTTTGTCAAT TTGCGCACCA ATTTTTTGAT ATTTGGCGGT 29880 

AATTTCGTAA ACGGATTGTT TGACTTTACC TAACATTTTT TGAATAAAGT TGCCTTCGCC 29940 
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AGCACGTAAT 

ACCAACAGGA 

GAGTTTGATT 

TTTTGAAGCT 

AGGAGCTGCT 

ATCCATTGAT 

TTCTTTGGGT 

TTACTTGAGG 

TTTCGACATC 

ATTTTGACAA 

TTTTTACTTC 

AATGTGTATA 

TAAACAGAGC 

TTGAACGATT 

TTCTGAAAAA 

CTTTCGTTAA 

GAACGAATAC 

CAGcAATGaT 

T CAT T TAT AT 

GTTATTTTAG 

AGTGAAATCC 

GGACGGGAAA 

AAATAATACA 

TTCTAAAGCA 

TTTTATCATT 

GACTTCGAAG 

TTTTTGGATG 

CAGGCGCAGT 

TTAGAAAACC 

AAGAAAATCA 

CAGGTCAGCT. 

TGGCTCTCTA 

ATGAAATCTT 



TCATTGGGAT 
CCAATGTCTT 
TGGGCATTTG 
AATTCTTTTG 
TGCTGTTCTT 
GAAAATGGAT 
TGTTGCTCAG 
TGATTCCTCA 
TAAATCGTCT 
TTGGTCAATG 
GTGTTGTTCG 
GAGAAAGTGA 
TTTAGAAACC 
CATGTTTTCT 
ATCAATTTCT 
GTAAGGCATT 
TAAAAAGCCT 
TAAGGCTCCT 
AGTTCCTCCT 
CATACTCTGT 
TAAGAAAAAA 
GGAGTTAAAA 
AGGAAGTTGT 
ATAAATTTTT 
TTAGGGTATG 
rAAAAACACT 
ACGTGGCTTT 
AGCTATGATT 
CTTGGAACAA 
ACTGGAAACG 
TACCTACATA 
TTTGGCAACG 
GGAACTTTAC 



TGGCTTCATT 
GTGCTTGAAC 
AACCATAGGT 
CTTGGGCTTG 
TTAACTCAGA 
TACTTAATAA 
TCATTGTTTT 
GTTGCAGCTT 
AAGTCATCGG 
ATTTGTGCGC 
ATTTCTAAAT 
TTGGCTAAGT 
TTCGTAGTAT 
TGCAATTGTA 
TGTGGAGATA 
TGATCGCTTT 
ACCAGAATCA 
AAAAGAATAT 
ATCTATCTAA 
GAGAGAAAAG 
AGTATAAAAC 
ATGAAAAAGT 
GAATTTTTTA 
TTCTTTGaAA 
ATAAACAAGA 
TCmACgTCGA 
ACCGACAgCG 
CCGTTAACCG 
GTAATTTTAG 
ACGGCTATGC 
AATTCAATGT 
GATTTGCAGA 
GAGCTAACAA 



336 
TAAACGAAAC 
GTGGTTTAAC 
AATTACTGCT 
ACGTTCTTGT 
TAGTTCATTT 
ATCGTCCAAT 
TCCTCCCTAA 
TTTGCGACAA 
AAACGATTTC 
TTTCTTCTAA 
GTTTACTTGT 
GTAACTTTTT 
CATTACGTAA 
TGATTTGTTT 
ACCCCAACTC 
TGTCGGCGGC 
CAAAAGCGAA 
AGCCAATCAG 
TCCTCGAAAG 
ACATACAGCG 
GCAAAAAAAG 
GTTTTGTTAG 
TGTCTATTTT 
AGGGTTTAAC 
AAGGTTGGTG 
GrAATTTTTA 
GCACAGCTAA 
CAGAAGGAAA 
AAATTCCGGC 
GGGAATTAGA 
ATCTGTCACC 
AAGTCGAAAA 
TAGCTGAAGC 



ATTAGATCGG 
ATCGCTTGCG 
TGTGAATCTT 
GGTAATTTAT 
TGTTGCGTTjG 
GTATTGGTTA 
TGATTATTGC 
GCTGCTTTTA 
CTCATAATCA 
TTTTTCATAC 
TAAGTCAACG 
AGGTTCTTTC 
GTCAATCGCC 
TTTGGCTGTG 
ACGATAATGG 
ATTTTTCTTT 
GAAGCCATAT 
TAATTTTGTA 
AGTCAGCTCT 
GAAGATGTAT 
ACCCGTCCTA 
GGTTGTTTGT 
TTTTCGAAAA 
TAGAGAAGAA 
AAAGtATGTT 
AAGGrAAGrT 
ACGCGAATTG 
AATCGTCTTA 
GGGAAAAATT 
AGAAGAAACT 
AGGATTTGCT 
TCCGCGTCCT 
CAAAGCAGAG 



TCAGTGAATC 
AAAATTCGCC 
GGACATCAAT 
CAATTAAACG 
GTGTTAGTTC 
CAGATGTTGT 
TGGTCTTCGT 
GCGATCGACA 
TTTTTAACTA 
GTTTGTTTGT 
ATATTTGGTA 
ACTAACTCTT 
CGTAATTTAG 
CTCATTGTAC 
GCTTCTTTAT 
TTTCGATAGA 
TCTCTAGCGA 
TTTAATTTGT 
CG T T AC AAT A 
TTTATGATTT 
ATTGGGGGTA 
TGGTATGCTT 
TGTAATTTTT 
GGGGTGCGAT 
AAATTATGAG 
TATTGATGTT 
GTTTTTCATT 
GTTAAACAAT 
GATCCTGGAG 
GGGTATCGTG 
AATGAAAAAT 
CAAGATGAAG 
GTGGCAAAAG 



30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 



WO 98/50555 PCT/US98/08985 

337 



GCACGATTTG 


TGATGCCAAA 


ACACTTTTTG 


CTATACAATA 


CTGGGAATTG 


TATTTACTGC 


31980 


AGAGACAGTT 


TAAGGAGGAC 


ACGCAATGAG 


CAAAGGTCCG 


TTAGTCACTC 


GGACAGAGCT 


32040 


TCGCAAACGC 


AGAGAAGCAG 


AAGAAAAAGA 


AGCGGAACGT 


CGTCAGCAAG 


AAGAGCAGAA 


32100 


GCTGGCGGAA 


AAAGCGTATA 


AGCGAAAAGA 


AAAAGAAATT 


TCGACGTTTT' 


ATCGTAAAGA 


•32160 


AAAGAAAAAA 


CaAAAACCGA 


TCAACAAGTC 


ACGAGTAGGA 


GAATACTCGA 


AGCGTCGAGA 


32220 


ACGGAGTACT 


TGGTTAAACA 


AGGCAATTAT 


TATTGTAGCG 


ATT TT ATT AG 


CCGTTGTGGC 


32280 


ATATATCGTT 


TTGAATTTAT 


AGAAAAGAGG 


AATCACTATG 


AAAATTGGAA 


TTATTGGAGC 


32340 


AATGGATCAA 


GAGGTCAAAA 


TTCTAAAAGA 


AAAATTGACA 


GACACGATGT 


CATGGGAACG 


32400 


AGCAGGCGCT 


TTATTTGTTT 


CTGGTTCGTT 


AGGAAGACAT 


GAGGTGATTG 


TGGTTCGTTC 


32460 


AGGAATTGGT 


AAAGTGGCCT 


CAGCTGTGAC 


CACGACTTTA 


TTGATTCATC 


AATATGGCGT 


32520 


AAATATGGTG 


ATTAATACAG 


GTTCTGCTGG 


TGGTATCGGT 


GAAGGACTAA 


AAGTTGGCGA 


32580 


TTTAGTCATT 


GCAGACAAAT 


TAGCTTATTT 


TGATGTTGAT 


GTCACTGGTT 


TTGGTTATGC 


32640 


CTATGGACAA 


TTGCCTGGTG 


GTGTGCCTTT 


GTATTTTGAA 


ACAAGTGAAT 


ACTTGCGGAT 


32700 


GGAAATTGCG 


AAAGCCGCTG 


AAAAAACTGG 


TTTAGCTATT 


CGTAAAGGAT 


TGATTGTTAC 


32760 



TGGAGATA 32768 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



AAGTAGAACG 


TAAATGGTAC 


GTAGTAGATG 


CAACAGATGT 


TCCTTTAGGA 


CGTCTATCAG 


60 


CAGTTGTTGC 


TTCTGTACTA 


CGTGGTAAAA 


ATAAACCAAC 


ATTCACACCA 


CATGTGGATA 


120 


CTGGAGATTT 


CGTAATCGTA 


ATTAACGCTG 


ATAAAGTAAA 


ATTAACAGGT 


AAAAAAGCTA 


180 


CAGACAAAAT 


TTACTACCGT 


CACTCAATGT 


ATCCTGGAGG 


ATTAAAATCA 


GTGACTGCTG 


240 


GTGAATTACG 


CGACAAAAAC 


TCTCGCCGTC 


TTATCGAAAC 


TTCTGTAAAA 


GGTATGCTAC 


300 


CTAAAAACAC 


TTTAGGACGT 


AAACAATTTA 


CTAAATTGAA 


CGTATACGGT 


GGAGCTGAGC 


360 


ATCCACATGC 


TGCACAACAA 


CCAGAAGTAT 


TAGATATCAC 


GAACTTAATT 


TAAGGAGGGA 


420 


AATTCATTGG 


CACAAGTACA 


ATATAGCGGC 


ACAGGCCGTC 


GTAAAAATGC 


AGTTGCCCGC 


480 


GTACGCTTAG 


TACCAGGTAC 


TGGTAAAATT 


ACTGTAAACA 


AAAAAGACGT 


AGAAGAATAT 


540 


ATTCCACATG 


CTGACTTGCG 


TGAAGTTATC 


AACCAACCAT 


TTGGCGTAAC 


TGAAACAAAA 


600 


GGCGCTTACG 


ATGTAATCGT 


AAACGTAAAC 


GGTGGTGGCT 


ATGCTGGACA 


ATCAGGAGCT 


660 


ATCCGTCACG 


GAATCGCTCG 


TGCATTGTTA 


CAAGTAGATC 


CTGACTTCCG 


TTCTGCTTTA 


720 


AAACGCGCTG 


GGTTACTTAC 


TCGTGACGCA 


CGTATGGTTG 


AACGTAAAAA 


ACCAGGTCTT 


780 
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AAGAAAGCCC GTAAAGCTTC ACAGTTCTCA AAACGTTAAT ATATCTTATA TATCAACATC 840 

TCAAGACACT TTCCAAATTT TGGAGGGTGT CTTTTTTTGT GTTTTTAGGT ACGACAATGG 900 

GTACGAGTTT TTTAAACCAG CTATTCATTT TATATTTTTT GCTTCTGCTA ATCGTTGTTC 960 

AAATTTTTCA TAAGTGGATA GATCAACGAT GTTTGGTGTA TTCACATAGA CTTCTGTTGT 1020 

TTTTGTATCA GAATGGGTGA GCGCTCTAGA AATTTGTTCC ATGGTTGCGC CACCTTCATA 1080 

TGCTAGCGTA CTAAAAGTAT GTCTTAACTT ATGAGAAGAA GTATTTATTA AGTGTTTGTG 1140 

TCGACGTTTA ACTGAGTTGA TGCGGTAGTT AAGATAGTCT ATGTGTACCG GGACATTCAC 1200 

ATTTCCCTTT CTATCAATAT AAGTGAAAAG AAATTGCTCT GCATTTTGnT tAATTCCAAG 1260 

TTGTAAAAGT TCTTGTGCTT GTTCTTTTTT CCATTTCATT AACAACTCTA TAAGGGAGGA 1320 

AGGCAAT-TGG aATTTAGTGT TTTTTCTTCC TTTGGTAGGA GTTAAATTTC CTAGCTTGwC 1380 

TTTGCTTTGG ACAAGTAGTA TTGTTCCATT CTGTAAATCG ACATGTTTCC ATTGCAGAGC 14 40 

ATAACTTTCA CTTTTACGAT CGCCTAAATT TAATGTTAAG GTAAAAAGGA CAAAATCTTG 1500 

AAAGGTTAGA TGTTTGTTAT TGTAGTCTTG TTGAATGGCT TGAATCCACT CAAGAAGTTG 1560 

TTCTGCGGTT AGTGCTTCTC CTTTTAGCGT TCACTCCATT TTTCTACGTT GTTTTAAAGG 1620 

ATCTCCTACG TAACGAGTAA TCTTAGCAAC ATGATTGTAT TCGATGTAAT CAAGAAGTTC 1680 
TGCAATATCA AACAATTGAT TTACATAGCT TTTAATCGTT TTGATATTTG CATATTTTTG ' 1740 

CGCTTTCGCA TTTAATTTTC GCAAAACTAA ATCTTTATTT GTATTCAATT CAAAGAGAGA 1800 

GTAACTACCG AACATAGGTA GAATATGAAG ACGAAACAGG TCTTTAGTGT TTAAAATCGT 1860 

TTGTTCTGAT GGAATCATTC GGTTTCTCCC ACTAGAACCA TTGCAATACA TGTCTAACCA 1920 

GACCTGTTCA TAAAATTCTT TGAAAGAAAT ATTTGCTTTT AATTCAAAAG CTCGTTCATG 1980 

TTTTTCAAGT TGTACTTTTT CAATTTTTTT AAGAATATCT TTTTCAGCTA CCAGAGCTTC 2040 

ATTTTTGGAA GAGAACGTTT TTCTGTATCG CTGTGAAGTA ACACCCAGAA TTTCTCTTAC 2100 

TTCTTTAGGA TAAAATACTT CGACTTTATA ACTTCCGCTT TTCATTTTTT TGATTGCCAT 2160 

TAATATTTCC TACTTTCTTG CGTGAATCAG AACGTAGAAA GACTATCGTT TGATTATAGA 2220 

AT AG AT AC TT GAATGTTTAT AGGTTGGTTT AAGCATATTT TTTTAACCAA TTATCAATCT 2280 

CCAACTTATC ATAAAGTACT ATTCCGTTAA TACTGATTCG TGGAAGTCCA ACTGTAATTA 234 0 

ATTTATCAAT AGTATTGTTG GAGAGATTAA GATATTGGCA TAGCTGATGC TTTTTAAGAT 2400 

AACGATTATT TAATCCTAAA TCAATTTGAA ATTGTGTGAT TTCATTTTCT AGTAGTTTGT 24 60 

GAATGGCTTC CTGTAATCTT TGTTTTTGAG TGTCAGACAA TAAAATAGTG AGTTCGTTCA 2520 

TTTATGCGTT CCTCCATTTT TCTTGATTTT TGTGTAACGA ACATTTTGTT CTTCAAAACC 2580 

TATTTCTTTG TTTGGGATAT TTTGTTCATT CCAGTCTTGA AAGAAGACAG GTCTTTTACT 2640 

TACATAATTG TTGGTAAATA AGACTTTTTG ATCTTTTACT TCATAAGGTC GATCTGTCAG 2700 

TCTTTTTATT TCGAGCGGAT GCTTCAAGTT TTGAGATTTA AAGAAGGCTT TTTTCTTATG 27 60 
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TATCAAAATA CTTGCTGACA TACCGTCCTC TATTTTCAGC 2820 

TGATTTTGAT TAGTCCATGT CCCCAAATAG CTGTTAGTTC 2880 

AGCTGAATAA AATAACGTGA TAATGAATAG CGCCATGTTT 2940 

GGTACTTAAT TTGGGCTTTC TTTGTCTTGT ATAAATGGTA 3000 

TTTTGAACTC ACTATTTGTA TAGGCTATAT CTGTTATATT 3060 

AAAATTTTGT CTGGTTATCC AAATTCATAT CAATGAGTCG 3120 

TTTTGTAATG TTTTTGTTTT CTCTTTAAAC TGTCATATTG 3180 

CAAATGTCTG TTTTGTAGTT TTTAGCCAAT CAGGTTTTTC 324 0 

TTTTTGGTTT TGTTAAGATT AATTGTTCAT ATTCCCAAAT 3300 

CAATGATTTT TTGATTGTAT CCATGTAGCA ATCGCAATCA 3360 

TGTTCTTACT GGTTATTCAG TTGTCAAAGA ACGAGTGTAT 3420 

GATAATGTTC TATAAATCAA GTTAAGTAGG AAGAAAGCCT 3480 

TCATCCATCG TGTGGTTGAT TTTCGAGCGG AAAGCGGCTT 354 0 

CTTTCGTCGA AACCACCCGC TGGAAGTCTG CTCTTGGGTG 3600 

AGCCAAGCGT TGTTCTAATG TTTGATAGCT TCTTTCCAAT 3660 

TGCTTGAAAC AACTCCTCTT TATTTGCCAA CTTAACTTCA 3720 

TAGTAATTCA CTATGATTCA TGGTATCTAA ATGATGTTTC 3780 

AGAAGTATTT CTGCCCAATA TAGGGATCAG TTTTATTACT 3840 

TTCCAGTGAA AACATTCTGG TTTTGCGGTG ATTGATCGTT 3900 

GGCAAAACGA ATATAATCAT AGTGAAATTG GTAATCGCAC 3960 

CCATTTGTTT AAAGTAGAGA GGGAATTCAG CCCCTCTACT 4020 

GGTGAAGTAA TCAGTCATTT GTTTGGAACT AGTGTGGCTT 4080 

CCATAGCTAA CAAATTGTCC ATTTACACGA CCCCAAGGAG 4140 

CGAACGATTG CCATACCTTT TTCATCTAAT GTTTCTTTCG 4200 

TTAACTTTGA CACTGATTTT TTCAAATCGG AGTTTTGGAA 4260 

CCCAACGTTC GCCGTCTTTT TGCCAAAGGG AGACTTGGAC 4320 

GCTTTTCTTT AGGTAAAGTT AAATCGTTAA TTTTTACTGC 4380 

TTTTAGTACA TTTTTTAGTA CAAACATATT CTAATGGAGC 4 440 

TTTTAGCACA AAAATTAGTA CAAAGCTGTT TTACTCTTAG 4 500 

TTTTGATAAA AATTGGCTTT TAACCATTGA GGAGATTGAG 4 560 

AAAAGTAGGC GAAAGAATTC GAGAAATTAG GACAAATCTA 4 620 

TGGAAGTTTA TTAGGTGATT CTCCACGAAt TCTGTGAACA 4 680 

ATTCCTAAAA GAGATAAACT AGAAAAAATT GCGATATTAG 4 7 40 
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GGAATATGTT ACCAGACCAG AT ACT AT AT G GTCGAGCAGA TGAATATTTA TATGATTTAA ■ 4 800 

TTGATAGCAG TTTCAATGTT AAATTCAGCG ATAATATTCT TTATGAAATT TTTGAAAGTG 4 8 60 

TACCACCTGA AAAGCGTTCT TATGATGATT TGATGTGGTT AGCCGTTGCG AAATATTTTA 4 920 

TTGATAATGG TACATATGGA AAAGTTGTTG GAAAATTCAA TTATATTTCA ATGCTAGGTA 4 980 

TACCGAACCT TTATACAGGA AGCTATCAAA ATGAATTTAT TGAACAACGT GAAGATTATA 5040 

AACAACAAGA AGTTAAATAC TATGTATATG CAGATCCGGA AAAAAACATA TT AC AC ATT A 5100 

TGCCATTTGT TCCTAATGAG AAGAATATAG AACTGCTTTA TGAATTTCCT GAATTATTAG 5160 

AACAACAAGC AGATCACGAA GTATTTACTG CAAATTTTGA -ACAAATTGGG CTTACCTTAA 5220 

AAGGATCAAC CATTGTGTAT TATGGAATTG ATAAACAAAA ATTAGAAGCT AAAACACAGG 5280 

CATATAAATA TGATGAAGCG GTTGATTTAT ATAGAATAGC TCAACTACCA GAAACTTTAT 5340 

CAATGTTTGA TGATGAAATC CAAAAGCAAA TTTCGTTTTT AAAAAGTAAA TAATTGCATA 5400 

TAAAAAGAAG TCAACAGTGT TTCATAACTG TTGGCTTCTT TGTTTTGCTA TTTAGTTTAT 54 60 

AAGTTAGGCC AGAATCAAGT ATATCGTAAG TCTAAATTCT AACCTTTTCT CTTCTTTTTA 5520 

AATTTTCAAG CTGTTTTTCT AAACTCTCAG TACGTTGTAG AAAGTCATTG AGAAAGATTT 5580 

TGAATATTTG GGCATTTGCA TGGTATTTTC GAATGATCTT TTTTTTATTA ATTGTTTCTT 5640 
GTTCAAAATC AAAAAATTTG TTTAATATCT CTAATGTTGA TGTCCATGAT TGAAGTTGTT ' 5700 

CAAGTGTTAT TTCTAGCGCA TTGATATCTT GTTGAGTAAT GATTTTAATC GGCTGATTCT 57 60 

TCTTTGTTAG TAATTCTTTA TTTTCTTTTT CCGATGTCAT AAGGTTTCCT CCTGCTATTT 5820 

TACAAAATCC ATGTCTATAC GAACAACAAT CTAAAATTAT TATACTATAC CATGTTTATA 58 80 

TAGACAACAC GGGGAGTGAA TATGAATGAA CGAAAATGAA GGTGGTAGAA AAGTGGAAGT 594 0 

GCTGATCAAC GATTTATTAG CAGAACGTCA AATGTCTTTA AGAGAACTCG CAAGATTGTC 6000 

GGGAATTGAG CCTTCAAATT TAAGCAATTT GGCAAATGGa AAAAGACAGA AAATTTATTT 6060 

AGAACACATT GAAAGAATCG CAGATGCGTT GGAGATTGAT GATATATCGA GGATATTAAA 6120 

GTTGAATAGG AAGTAGTATT GGGGATATAA AGAATAAATA TAAAAATAGT TAAAGATATT 6180 

GTTAAATACT T TAG ATT AAA TGAAAAAATT ACAGAAATAA ATTTGATAGA AGAAAAGTAT 624 0 

AAAGCGAACT ATTTGATTTT TAATATTTAC TGATGTTTAC GATGAAGGTG GAATATAAAT 6300 

GAAGAATGAT AACACTCTTT CTGAAGAAGG AATTGATACA GATAAAGGAA TAGTTGGTTC 6360 

AATTTTTGAA AGTATGCTAT CGGATACTTT ATCAGAAATT AAAAATTTAG ATAATGTTAA 64 20 

AATACAGAAG ATAGTAGAAA ATATAGGGAA ATATGATAAA AAAGAGTTTT TGCAAAGACT 64 80 

TGCTGCACTA AGAATTCCCT TTGAAAATAG AGATAAGGCT GTATTATTGG ATGCAACAAC 6540 

TACAGCAACA CTGAATTGGT TATCGGAAAA CAACTGGAAT TTTAATGGGC TTTCAATGAG 6600 

CTATGGTAAA TTCAAAAAAG TTATTCAACA GATTAACCAA TTAGACTCTA AAATGGCTAT 6660 

TGATCCATTG GATAATCCCT ATATAGATAA TATCCaaTTT TATGGTAATC ATAAGGTAAT 6720 
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GCCTGGAATA AATTTTGCAA GCTCGTATAA TTTGCAAATG ATGATCCAGT CTATATTTTT 6780 

ATCAAATCGT ACAAAGTTAT CTGATGAGAA AAATAGATAT ATATCCATTT CATTAAATGA 684 0 

TAACCTTATG GTTTCTACCG ATGTTTGTAA AAATGTCAGA TTCCAAACGA ATTnCCATCA 6900 

TTCACTCGCG AAATATnTAT nCCTAATAAA 6930 
(2) INFORMATION FOP, SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TGTACTTTCG TCCAATATTT ATTTCTTCTT CAAGCTGATA AATAGACACA TCTCTATCTT 60 

TTGCTAGTTT TTTTATACGT TCCAAAAGGT TCATATGAAA ACATTCTCCT TCTAGTTACg 120 

mGTTATTTmC AAAAAAAGTT AGTAAAAATG TTTGACATTT ACAATATTTG TTAGTAATGT 180 

AATCACGTAA GCTAATTTAT TAGCTAATAA GTTCTCAAAT AAAACCTATA AACAAAACTT 24 0 

GAAAATCGTT GGGGAACGGT AAAAGTATTG TTTTAGAAGG CTTTTAAAGT CTTATTTAGC 300 

TATGGGTTCA TTTTACAATA TGTGTTAGAA AGTGTCAACG AATTTTATTA AATTAGCTAA 360 

TTTTTTAGCT TACAAATTAA AAAGAAGGAA TGGAAGAAAT GAAAATTGAC GAaGCTGTTT 420 

CAAAAGCAAT GAAAAAAGGa AAATATATTT ACAGGGAGTC TGAAAATGAC TGTTCAGCTC 4 80 

ATGTAAATAT ACTTCCTACA AATACCTATG ATTGTTGTCT AT TACT AC AA GAAAATAGTA 540 

ATTCTGTTGG CAAACGGTGG AACCCTACAG CAAATGATTT AATGGCGAGT GACTGGGAAA 600 

TACGTTAGTC AAGGCCCAGT GTTTTTGTAA TAACTTTAGC TGCGATTTCA CTCATTATGT 660 

CAATTGAGAC ACTTGcTAGT TTGGATGATA TTnTTTTTGA TTCCTTCCAA tTTTCGGATC 720 

TCGAATATTG TCCAAATATT TATGACCATC GAATGTAAGG GAAGAGATAC ATGTCATATA 780 

AGGTTTGTTA GATGCAAACT TAACATATCC TTTTATAAGT TGTGCATCGT CATCACCTAA 84 0 

TTTGGAGACG GCATAATTAA TTTCCTCTGA ACTATATTTT TTTAGTAGAT CTGATTTAAA 900 

TATTTTATCA CCAAAAGCCG GTTCATTGTA CGGAAGGCTT TCGCTAAATA ATAAAATATC 960 

ACGAATACAA TCATGAGACA GTTTCATATC ATTCACCACC TTATCAGTTA TTTCAGCAGA 1020 

CCACTTGCTG ATAAGGAAAT TATATCAAAA AAAGAAAGTG AGGTAATTAA ATGTCACAAG 1080 

ATTTAGCTAT TGAAGTAAGG GCAGCGCTAA TTCGTGCAAA GAAAACTCAA TCTTGGTTAG 114 0 

CG AAACAATT AGGGATTTCA AGTCCGTACT TATCTGATAT TCTTCATGGT CGTAGACGTT 1200 

CAGAAGAGCA. AGTTCGGAAT ATCAmAAAAA TATTAGATAT TAGATAGGAG GTGTAGGTAA 1260 

AAGTGGAAGA AGCGATTATC AAAGTTGACT TACAAAACTT AAAGAAGTTG ATCAAACAAG 1320 

CAAAGGAGCA AGCTGACCAA CTTCAAAAAA CTTTAGATGA GATAAATAAA ACTAAAATCC 1380 
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TAATTTCTTA 


GCTACCCATT 


CTGATCCAGC 


AGCAGACTTC 


ATATCTTCCC 


ACGAATCGAA 


1440 


GTTTGTGTTT 


GAGCTAATGA 


AATTATCTAA 


TTTATTGTCA 


TCTATTGATT 


CCATATCAGA 


1500 


GAAATCAAAT 


CCGGATTTTT 


CAATAAACTC 


ATCAATATTA 


GAGAATTTAG 


TATTTTCAAT 


1560 


CATAAATTTT 


TTAGTGAATA 


GTTTATCGAA 


TGGTACAGAA 


TGTTCGTCAT 


CTAAAGACTC 


1620 


GCCATTTTTA 


GCAAATTGAT 


TTAGCTCATG 


TTGTAATTCG 


TCAAATCCAT 


TTAATTCGAA 


1680 


TTTCATAATA 


TTTCACCTCG 


CTTTCAAATT 


CATTTTACCA 


AGAGGTGAAT 


TGTAAAACAA 


1740 


TCAGTATAGG 


AGGTGATAGC 


AATGGAAGTG 


ATTTTAACTC 


CAGAAAATGA 


AGCTTCTCTA 


1800 


AGAGATTTTG 


TACACGGAAT 


TATTGTTGAT 


GAAATAGAAA 


AAGCaCGaAG 


aGaTACCGcA 


1860 


GTTGaTAAgC 


gaGtCtTAAA 


tCaAACaGaG 


rTTGcaAAAT 


AtTTcaATGt 


TTCCACAACA 


1920 


ACAATAAGGG 


AATGGGAGAA 


GCTAGGTCTT 


CCACATGGAT 


CAGTAAGTAA 


ACAAGGGAAG 


1980 


TTCTACGACA 


AAGAAGGGTG 


TCGGAGATGG 


CTTCTATCAC 


AAAAAAGATA 


AATCTTGGGC 


2040 


AAGCGAAATC 


AAGGGAGGAA 


ATAAATATGA 


AAAAAATATA 


TCACTTAAGA 


CGTATGGCGG 


2100 


CACTGTTGGT 


TGTATTCGGA 


CTAGGTCTAT 


TGGTAGGTGG 


CAATATTGGC 


CCATTAATCC 


2160 


AAAACATATA 


TATAGCAGCT 


TTTATCATTT 


GGTTGCTCTA 


CTACGATTTA 


GCGTTGGAAG 


2220 


ATCGAGAAGA 


AAAAAAACAA 


AAATAAAGAC 


CCACTTCGAC 


GGCCATCAAA 


GTAGGTCAGT 


2280 


TACAAATATC 


AAATTCAAGG 


AGAGTGTACC 


ACATGAATAG 


AAAAATTGAA 


AGAATGATTA 


2340 


TTGAACTTGA 


AAAAGAATGT 


AAGGCACAGA 


ATGTTGAACT 


TCTTTTATGT 


GCTACAAATT 


2400 


TTGAAACAGG 


CCAAGGAAGT 


ACTGCGTTTT 


GTGGTTCAGT 


TATCGGTTTA 


GCTATACTCT 


2460 


TGCAAAAACT 


TGTAGGTGAT 


CTAAAAGAGC 


AATTAAGTAT 


AAGCGAATCT 


TGTGATTGTC 


2520 


CAGAATGTGT 


AGCAGAAAAA 


GCCGAAGATG 


CTGCAAATGA 


AAAATCTATG 


GATGAACTAC 


2580 


TAACTGCATT 


TTTACGAGGT 


GAACTGCAAT 


GATTGAAGTA 


AGAGGTTTAA 


GTGATGATGT 


2640 


TTACGAATTA 


ATGTTAGCGA 


ATGCTCAAAA 


TAGGATTATT 


CAATCAATTC 


GAACTGCAGC 


2700 


AGCAAATGGT 


AATACAAGTT 


GCGTAGTGAA 


TAGTAAAGGT 


CTTACATCAA 


CGTTTTTATC 


2760 


TCAATTAGAA 


ACAGAAGGAT 


TTGATCACGT 


TGAACTTGAA 


GAAAACAAAA 


CGAAAATATT 


2820 


TTGGGAGTGG 


TGAAAATGCC 


TGAATTTGAT 


TCATTAGGGG 


CTAGACAAGA ACCTCCAGAA 


2880 


GAAAAAGAAG 


CATTAGAGCC 


AACATGGGAA 


TATGACGAAG 


AAGAGGAGAA 


TGACAATGAG 


2940 


TAACGATTTA 


ACACAAATAA 


CACAACGATC 


TTTAGATGAA 


CAAGTCATCG 


GAAATTTGAA 


3000 


TAGATTGCAA 


GAGCAGGGAT 


TAGAAATGCC 


ACCAGGTTAT 


AGTCCACAGA 


ATGCATTGAA 


3060 


AAGTGCTTTC 


TTTGAACTAA 


CCAACAATTC 


AGGAGGAAAC 


CTTCTTCAGT 


TGGCAGCTAA 


3120 


CAATCCAGAA 


ACTAAAACAT 


CTATTTCTAA 


CGCCTTGCTT 


GATATGGTCA 


TCCAAGGATT 


3180 


ATCGCCAGCT 


■ AAAAAACAAT 


GTTATTTCAT 


CAAATATGGA 


AATAAAGTTC 


AGCTTATGCG 


3240 


CTCATATTTC 


GGAACCATGG 


CTGTATTAGA 


TCGAGTAACT 


GGAGGGGCAG 


ATATCACGCC 


3300 


TGTTGTAGTA 


AGAGAAGGCG 


ATGTATTTGA 


AATTGCTATG 


GATGGTCCCG 


ACTTAGTTGT 


3360 
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TGCTAAACAT GAAACGGCCT TCGAAAACCT AGACAACGAC ATTAAGGCTG CTTATGTGGT 3420 

TATTAAGCTA GCAAATGGTA AAGAAGTAAC AACCGTCATG ACAAAGAAAC AAATTGATAA 3480 

GTCATGGAGC AAAGCAAAAA CAAAAAATGT TCAGAATGAT TTTCCAGAAG AAATGGCAAA 354 0 

AAGAACTGTC ATCAATCGAG CTGCTAAATA TTTAATCAAT ACTAGTAACG ATAATGATTT 3600 

ATTTGTGCAA GCCGCTAAAG ACACACTCGA AAATGAATTC GAACGAAAAG ATGTAACACC 3660 

AGAGCGAGAA GAGCAAGCTG CGGTACTTGA AGAAAAACTA TTTTCCAACA ATATAAAAGC 3720 

TGTTGATCAA GAAAACGAGA ATGAACGAAT TACACGTGTA GCTGACGTAC CAGAGCAACC 3780 

CGATATTGAA CAAGCCAAAC CAATTGAAAA AGATAATTTA ACGAAAGTAG CTGACCAAAT 384 0 

TTTAGAAGAA CCAGTTCAAG AAACTTTAGA TGTGATGGCT GGTTATGAAA CCAATCAGAA 3900 

AGAGAGTGAA GCTGATGTCT CAACGATTGA AGAAGACGAT TATCCTTTCT GATGAAAATT 3960 

ATTATTCACA AGAAGCGGAC CTAGCTTATA TGTCTGTCTC TCAATATAAA AAATTTCTTG 4 020 

AATGTGAAGC TGCAGCTCTT GCCAAGTTAA AAGGCGAATG GACACCAGAA AACGATCCTA 4080 

AAGCATTGCT AGTTGGTAAT TATGTTCATT CTTACTTTGA ATCACCAGAA ATTCATGAAG 414 0 

CATTTAAAGA AGAAAATAAA AGCAAGATGT TTTCTTCAAG AAAACCGTTT GGCCTACTGA 4200 

AAGATTTCCA AATTGCGGAG CAGATGATTG AAAGATTAAA ACAAGAAGAA GCCTTTTTAA 4260 
ATATTTATCA AGGTGAAAAA GAAGTGATCG TCACAGGTGA AATTGGTGGT GCAATGTGGA ( 4320 

AAGGGAAAAT TGATTGTTTG AATTTAGAAG AAAAGTATTT TGTAGACATC AAAACAACCA 4380 

AAGATATGCA TGAGAAGAAA TGGGATGAAC GTTTAAACAG AAAAGCAAAC TTCATTGAAC 4440 

GCTTCGGTTA CGTGTTACAA ATGGCTGTTT ATTGCGAACT GCTTCGGCAA CAATATGACA 4500 

AAATTTTTCT TCCTCTCATT GCAGCCGTTT CGAAACAAAC ACCTAGTGAA GCAAAACTAA 4 560 

TCACTCTTAG CGAAGAAAAA ATGATTTACG AATTAGAAGA ATTAAAAGAA AACATCGAGC 4 620 

ATGTTGTGCG AGTGAAAAAC GGCGAAGAAG AACCAGTTAG TTGTGGGATT TGTGAATATT 4 680 

GTAGAGGACA CAACAAAATT ACAAATTTTA CCAGTATGGA CGATTTATAA AAGGAGGCGA 474 0 

GCAGGTTGGC CATTGGAGGT TGGATAAAAC TTTATAGGAC CATTCAAGAT CACTGGATTT 4800 

GGGAAAATCC TCAATACCTT AAATGGTGGC TTGATTTAAT ATTCATGGCC AtCACCAgAC 4 860 

AGAaAGATTC TTTTTGATGG AGAGTTAAAA cGGTGAAAGT CGGAGAAAGA wTTACATCCG 4 920 

AAAAAAAaCT TtCTGAAaGG TGGGAaGTAG CCGAAACACC GTCAGAAAAt TTTTAGATTT 4 980 

GCTTGTTAAA GATGACATGA TTGAATTAAG CAGAAGTAGA CAAAACGGGA CATGGTACAA 5040 

AGTCAGTAAC TACGCAGAAT ATCAAAACTT TTCTGAAATA AAAAAACAAC GAACTGAACA 5100 

ACGAAGGGAA CAACAAGCTG CACATCAAAC TGAACAACGA ACTGAACATA AACAAGAACC 5160 

TAAAGAACCT- AAGAACCTAA GAAATAATAA TAATAACAAT AAAGGGGCGT TCATTCGTTC 5220 

AATTTGGGAA AATAACGGAT TTGGATTGAT GTCGTCTAAA ACCATGACCG ATTTTGATTA 5280 

TTGGATTTCT GATTTTGAAA AAATCGGAGC TAGTCAAAAA GAAGCTGAAC AATTAATTGT 534 0 
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TAAAGCTATT GAAATTGCTA TTGATGCAAA CGCAAGAAAC TATAACTATA TCAATGCCAT 5400 

_ AT TGAAAGAT JTGGGAACAAA GAGGGTTCAA ATCTGTTGAG GAACGAGAAG CGGCAAGGAA 54 60 

GCAAAAGAAA ACAACCAAAA AACAGAAATC AAACACAGGT CATTCTGATT ACGATGATCT 5520 

TGGATTTTAG GAAGTGAAAG AATGCAGTCA GC AT C AG AT G GATTTTCAAA AATGATTAAA 5580 

ACGTTGCTTT ATATCACGCC TGATCCATGT CCAGAGTGCG GAGGAAATC.T TTATGCTTGG 564 0 

CGTGCAAAAA ACAAAGATGG GTCCGATAGG TGTCCGCCAA CTTGCATGGA ATGTGGCTAT 5700 

AAAGCACGCA AAAAAGCAGA AGACCTTGAA ACAGAGAAAA TGTTTAACGA TAGTTTGAAA 57 60 

GCCAGAGCGA TTAATTATCT GAAATATAGT TCGCTTTACA CCGACAAAAA TTTAATTAAT 5820 

TGTCGTTTTA AAACTTACAA AACAGTAGAC ACAGAAACTA AGCTTGCTTT TGAAATTGCA 5880 

AATCGAGCCA CAACTGAAAT TCTTTTGAAT AAACCAATTC ATATGATTCT TTCAGGCAAA 594 0 

AGTGGTGTTG GAAAAAGTCA TTTAGCTATG TCAACGGCTT GGGAAGTGTT GGAGAAATCA 6000 

AACTATGATA AACGCTGCTT ATTTATTAGC TATGCGGAAC TCTTAGAACA ACTAAAATTT 6060 

GCGATGAATG ATGAGCAAGC CAGAAAAGAG ATAACAGGTA GTTTGATGGC AGAAATCAAA 6120 

AGCGCAGATT TAGTAGTTCT GGACGATTTA GGAGCCGAGT TGGGAGTTAA ACAAATTGAT 6180 

GATAGGAATA AAAGTACTAA CTTCAATAAC GACACCTTGA ATCGCATTGT AGAAGCTCGG 624 0 

CAGAATAAAG CAACTATTTT TACTACGAAC TTAACTGGTA AAGAAATGAG TCAAGCTTAT 6300 

GGGGAGAGAA TTCTTTCTCG TATCATGAGT AATTCACAAG GTTTTGTGAT GAAAATTGAA 6360 

GGGACATCAG ACAAACGAGT AGCAGGCATC TAAAATGTTA TTTTTAGCGA ATATATTCAG 6420 

CGTAGAGCGG TTTTACAATC AAGTGAATAT AAATAGATAC AAAGAAAGAA AAACGGATTA 6480 

AAACGCATTT TAAAGCC t'TA AAAACAAATC GATAGAAAGG GGAATCATTC AATGCCGTAT 6540 

GTAGTGAAAA TTTCAGCCTA TCTTGGCAAA GATGGTCGAC CTGTAGCCAA TTTAAAAGAT 6600 

GCTGTGCTAT TTGAGCAAAA AGAGACAGCA GCTATTGCAA CAATCGTATC TGGCGGAACC 6660 

GTTTCAGAAG TAAAGGAAGC CATTATAATA CCAGAAAAAC CAAAGAAACA TATAGGGAAA 6720 

TCTATTAAAC GGATTGATAA GAAGGAACCG ACCGAAAAAG CTACCAAAAG TAATCAAGCC 6780 

TGGATGAAAG GGGCTAAATA AGAATGAAGT GTGTTAGATG TCAAGATCAG CGCGTGATTT 68 40 

GGGGCAAAGA CAGATTTAAT TATGCAACAC CTATTCCATG TCCTGAATGC AACAAAGATG 6900 

GAAAAGCAGT TCGAGCGGAA ACTGCGACCA AGGAAAGGGA GTTAAAACAA TGCAATCACC 6960 

AACAGCCCTG AATAAGCGAG GAAACAAAGT CACAATTGAT GGTTACACAT TTGATAGCCA 7020 

GAAGGAAGCT AACTTTTATA CAAAGTTTGT CAAAAATTGT GGGTTACCTT TTGAAGTTCA 7080 

TCCGCGTTTT AGACTAACCG AACTTACACC AACTGCGGAT GGTATAGGCA AAATTTCGGC 714 0 

GATAGCTTAT- TCACCTGACT TCATCATAAA AAACTTAGAT GGGAGTTGGA GACATGTCAT 7200 

TGATATTAAA AACTCTTTTG GCGTGTATGG TATTGACCAA TCCGTTAAGC TTCGTTTTCG 7260 

TCTATTTGCC CTTCGATATG GTCATCCAGT TGAAGCGATT GTTGTTCGTG CTAGAGATTT 7 320 
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TAAAGTGATC ACTCAAGGTG TAACTAAGCC TTTAAACGAA AAAAGACCAT TCATAACCGA 7380 

TAATTTCGAT TACGAATGGA AAGATGCAAC TAATTATTAA ACGAAAGTAG GAAAATAAAA 7 4 40 

TGACAAAACA AGTAAATTTC AGACCAGAAG TGAAAAAAGT GACATCTAAA TCAAACGGAA 7500 

ATATCGAAGT GCTATTAGTG GTTAGCAACG CTTCATTAAA AGGGAAATAT GAAAGTTTAA 7 560 

ACGAATTTTT AGGTAAAACA GTATCAACGA CCATTGAGCC AGAAACAGTA GAATACAaGG 7620 

TACCAGTTAA CAAGCrGACC AwTAAACCGn ATGTCGAATA TATTGyGmnT AACGACGGAA 7680 

CAGTTGAAGT TCTAAAAGAA GAACAAACTT CTTTAGAAAT GGGCGATGAT GTGCAAGAAG 7740 

TTGAAGAAGT TGCTGTGCAA GTATCGAAAG AAACCATTGA CGAATTCATC AAGAAGGCAA 7800 

CAACAATCGA ATGGCCAGAA TCAGTAACAA TCAACGTTCG TGGCGTGTTG CATCGAATCG 78 60 

ATGAAGGGGA AGCCTTAGAA GAAATTGCAG CTGATCATGA TGTTTCAGTT GAAAATCTAA 7 920 

TCAACCAAGT TGAACTTGCA CGCCAACATT TTGCACCGTT TGCAGATTCT TGGAGCAAAA 7980 

ATAAAGAGAA CATCATTTTC CCTGAAAAGA CAGTTGAAGA TGATGAAGAA GAAATCGAAT 804 0 

AATAATCTCG TAGAAAGTGA GTGTTCATTT TGCTTGAGAT TTATTATACG CCAACATCCG 8100 

CAATTATTGC GGATGCACTG GCTAAAACAT ATGAAGTCGT TTCTTTAGAA ACAGCTAGAA 8160 

ATATTGCCAA GAAATTTAAG GCTAGCTTGA AGCAGAAAAC GGATCTTTAT GTAATTGAGG 8220 

GAATTTTGAT TGATGCTGGT TATAAAAAAG AGCCAGTGAA TTTATAGAAG GGAGTGGAGG 8280 

TTTGCGGCCG CGATTAAAAA GCTTTTTACT CCGTTGAGAT TATGAATATT TACCAAAAGA 8340 

AAATGGGTAA AGTTATGAAA GTCCTTTGCG GTAAAAGAAG TGAAAACATT GGGTAAATAA 8400 

AAAGTGAAGA AGAGTAGCTT TTTACTACTC TTCTTCGCTA GAAAAAATTT TTGAACAGAA 84 60 

TAAAAAGCCA AGGTCAGTTA CATGTATGGA GCCACGAATA CTTATAATGC TAAACCCTGT 8520 

ATCTTTAAAA AAAGAACTGT TTGCAAAAAA ACTATTAAGA TTATATTTCA AAGTTTTAAA 8580 

ATAAGTTGAA TTTTCAAAAT CATCATAAAT TTTTGTATGT TTTTTTTGTG TTAATTCAAT 8640 

AGAAGGTCTT ATATCAATTA ATCCATAAGA GCGTAAAG*TT TGTAAAGAGG GACTTAAATC 8700 

AGAAAAATTA TTTTCAACAA TTCCTTTCTG AGTAAGGATT AAATTTTCTA TATAATCTAT 87 60 

ACCGTTACCA TTAGGAGCCT GTACTCTAAC CTTTATAGTT GGGGTAATAC TAAGTTTATA 8820 

CAATTTTTTT AATAGATTTG CATC T AT AG T CGTTAGTTCA GATAAAATAA TTGAAAACCT 8880 

TGGCGAAGAA TCTTGATTTT TTCTGTTATC AACTAATCCA GCAaGTAAAT TAGAAAAGTA 8940 

TTCACGCATA TCTTCATCTT CTAACTGATA TCTAGCGTCT TCCATTGTTT TTAAAGCAAA 9000 

CCCTAATTTC GAATCATCTC TGTTATCTAC AGGGATACTA TCGGTTTTAG AATTAATCTT 9060 

TTCTACAAAA TCAATATAAG ATTTTTCACT AACAACATTT AGTTTTCTAA GAGGACCTAC 9120 

AACATAATTT. ACAATCCCGC CAAGGCCTTC TCCAATTGAT TTTGTAGCAG GTCCAAAGAG 9180 

ATTAGTTATC AATTCCTTTG GTAATAGTTC AATGTTAATT AAAGATTTTT TATCGTCAGA 924 0 

CATTTAAACA TCTCCTTTTT CAAAATATTA TACCAATAAA GAAAGGAATT TTCTATGCAA 9300 
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CTTAGACCAT 


ATCAAGAAGA 


ATCTCGTTCC 


GCTGTTCAAA ATGAATGGGA 


GAATAAGAAA 


9360 


AAGAAAACAT 


CATTAGTTTT 


GCCTGCAGGA 


TGAATCTGAA 


TGAAAATATA 


TTGATATGAA 


9420 


AGTTCAAAAA 


ATACTGGGTG 


TTGGACAGAA 


GGCCTAGATC 


ACTCAAATAA 


AAAATTTGAA 


9480 


AAGGAAAAAG 


GCTACTTGAT 


GGTTGACGAT 


CCTGAGGAAG 


ATGAAACAGT 


AAAACTAGTC 


9540 


ACAATTGAGG 


TGATAAACGT 


TTTAAAATCC 


TTAAGAATGA 


CCGTAACTAT 


CAAAAAGGCG 


9600 


ATATCTTACG 


CTTAAACGAA 


TATCAAGACG 


GACAGTATAC 


AGGTGATGTT 


CATGTCTCAG 


9660 


AAATAACATA 


CATTACAGAT 


TATACTCAAC 


AAGATGGTTA 


TCTAGTGTTA 


GGAATTAAGT 


9720 


GAGGAGGATT 


AATAAATGGA 


ACAATTCTTA 


TTAACAAAAA 


CTGGTAAAAA 


ACAAATTGAT 


9780 


ATAAAGGGTA 


CGGGAATGGA 


TCATAACGAA 


ATTGTCTTTA 


CGTTAGCTGC 


TACTTTAGTT 


9840 


GGATATAGCA 


AAGAATTAGG 


CCTAACAAAA 


GCAATACTAA 


ACGAAAGTAT 


GTACGTGCTG 


9900 


TGGAAAGATG 


GTGAATAAAT 


GAAACGTAAC 


TGTAAAAGTG 


TAATAAATAA 


AGTTAGTGGC 


9960 


ATTGCAATAA 


TGATTCTTGT 


AGCAAAAGCA 


ACCGTGAGCC 


ATTTCGTGTA 


TGGAAATGAC 


10020 


ATAACAAGCA 


GTGACCTTGT 


TTATTTCCTT 


TCATGCTCGT 


TTATTTTGGG 


ATTAGGGCTA 


10080 


TATTTAGGAG 


GTTCCAGTGT 


ATGAGTTATC 


CAGAAGTTTA 


TATCATAGGA 


AGGCAAGTCG 


10140 


ATGGCGTTTA 


TGTTGAATAC 


TTACATGGAG 


CAGAGCAAGC 


CGATTTATTT 


TTCGATTATA 


10200 


CGATAGCTCG 


TGATGAAAGA 


AATCATATGA 


ATAAAACCAA 


TATGAAAGAT 


GGTACTTGGA 


10260 


AAATTTTGAA 


ATATGGTAGA 


CCGATAACGG 


TCGAATTAAA 


CAATAAAAAA 


GCCAACCGAC 


10320 


CAATGGTTGA 


CTAAGAAGAA 


TATTTTACCA 


GAAAAGTGGT 


AGCTTGTGAT 


ATGTGAGGTT 


10380 


ACTTTGCCCC 


AAACATTGGT 


CACAATAAAA 


ATATTTTATC 


ATGAGTAAAG 


AAAGCTGCCA 


10440 


ACAAAAAAAG 


CCGGATTTCT 


CCGACTGTGG 


ATAATGTGTG 


GATAACCTTG 


TGGAAAAGTC 


10500 


GTTATCATTA 


TACCACAAAT 


CACAAACGGG 


GGAATCAAAG 


GATGGTACTT 


TTTGACGTAA 


10560 


AGAnATATGA 


AACACCGGAT- 


GCAAAGGACG 


TAGATATGGA 


GCAAACTAAA 


CATAACGTCA 


10620 


GTGTGTTCCT 


GTCTGCCTAT 


CTTGCTGCTA 


GATGTCGTGT 


TGGCCAGCCG 


AGGGAACCAA 


10680 


AAGTAACAGC 


TTCATTCTCT 


TTGGTTCCAC 


CATCCACGGC 


CAAAAACACT 


TTTGAAGCCG 


10740 


AGCAAATGTT 


AATCCAGAAA 


GAAGAAGCCC 


AAGAAGAGTT 


TGATTATTTG 


CATAAGCTTT 


10800 


TTGTTAGAGG 


TTATTCTGCG 


ATTCAGCATC 


CGCACAAACC 


AGATGTTACC 


GAGCGAAGAA 


10860 


AAAGAATCTT 


CTATGATCGA 


TATATCAACG 


GTAATCCAAT 


CTATCTAGCA 


GCGCAACGAA 


10920 


ACTGCATCAG 


CGAAGAATCA 


GTGAAACAAG 


AATCTAACAT 


GATTATTGTT 


CAATTTGCTT 


10980 


CAGCGCTGGA 


ACTGGTTGCT 


TTTAAGTAGC 


CAT T TAT T AC 


ACTTTTTATA 


CCTCTTTTAT 


11040 


ACACTTTATC 


TACACTTCAT 


ATACCTTCTA 


AACGAGTTAT 


TATGATAGTG 


TCAAAAAAAT 


11100 


AAGAAATGCG 


. ACACACTTAC 


ACAAAATTTA 


AACGGAACGA 


TTGCCTACTT 


ATTTTTTTGA 


11160' 


TTTGAGATTA 


CAAGGAAGTA 


AAAAAATTCT 


ACTTTCTTCG 


TTTAGTCACT 


TGTGATCTCA 


11220 


TTTAAATTCT 


CTCGCAAACC 


ACCAATTATA 


AAACTAAAGA 


AGTGAGGTGA 


ATTTCCTCTC 


11280 
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77777777C7 


ACAGGTTTGC 


GAGAGTTAAT 


GGAGCATAGC 


TTAATCGGCA 


GAGCAGCGGT 


11340 


.--~-_-.-C 


G7TGGTATAG 


GTTCGAGTCC 


TATTGTTCCA 


GTAAGTGGCA 


TAAGCTGCTT 


11400 


AJLA7AAAA7A 


GATCGTCAAT 


AAATGTTCGG 


ACAAACAAAT 


TGGCGCTACT 


ACCTTTCACT 


11460 


AGGG^TGCAT 


T7ATATGCAG 


TCCTTTTTGT 


TTTAAGTGTA 


GTAGATTTTT 


CATTTTGAAA 


11520 


G^GAT^G7 


AAGACAATGC 


GTGTATTAAT 


TAGAAGTTCA 


GCATCTGGTT 


CAGAGTATTG 


11580 


GGA7ACCGAA 


GAAAAAAGAA 


ATGTGTTTGT 


ACCTAAAGGT 


CAAGAACCTG 


ATTTTGAAGT 


11640 


7AC7GAAAAT 


CCTGAATCAA 


TGCTAAGTAA 


AGAAGCTGAT 


TTACATGTTG 


GTGGATTACC 


11700 




GGGAATGTAA 


CGATTGATAC 


TGATGGAATT 


AAAGGCGAAC 


GATTATTAAC 


11760 




GCTGCTGATG 


ATGAAGAACA 


AGATGAGCTT 


GTTCCGTCTG 


ATGATGAATC 


11820 




GAAGAAATGA 


ATGTAAAAGA 


ATTGCGTGAA 


TATGCAAAAC 


GAAAAGGTAT 


11880 


TGAGA.TTCCA 


AGTGCTGTAC 


GTGCAAAAGG 


TGAAATTCTC 


AATATTATTA 


AAGAATCTGA 


11940 


A7AATGCGCT 


ATTGCCAGTT 


TGAAGGTTGC 


TCTAATACAA 


CAGAAAAAGG 


AGCTTATTGT 


12000 


TGCGAACATG 


CTAGGAAGTC 


AAGAAAAAAG 


AAAAAGCCAA 


TCAATGTTTA 


TCATCATGAT 


12060 


AACAAATCAT 


TTTATCGAAC 


AAAAGAATGG 


CAAGATGTCG 


CTGACTTTGT 


CTATGAAAGA 


12120 


GAAGGTGGTT 


GCTGTCAAAG 


ATGTGGCCGT 


TTTGTATTTG 


GAAGGCAAGC 


GCATCGGCAT 


12180 


CATGTGA7TC 


CAATCAAGAA 


GAACGAAATG 


CTCAAACTTG 


ATCCAAACAA 


TATTCGTTTG 


12240 


T7GTGTCCGA 


AGTGTCATGT 


GATTGAAGAA 


AATGAAGCAG 


ATGAGAAAAA 


AGTTTTTCCA 


12300 




AAAAATGAAG 


CCCCCCTATC 


AAATCTGATT 


CAAATTTTTT 


GTCGGGGGAT 


12360 


AGGGTAGGGG 


GCAGTCACGC 


GTGTCGTTAG 


GTCAAAAATT 


TTAAAAATAA 


AAGGGGGGTG 


12420 


TATAAAAAAT 


GACCACAAAA 


GCGCAACGCA 


AAGCGATTAT 


TGATGAAAAA 


GTTAATCACG 


12480 


AAAAAACGCG 


AATTTTAGAA 


ATTATGCGCA 


AGTCTGATTT 


ATACACTATT 


ACTCTTGATC 


12540 


CATTGATTGA 


ATCATACTTG 


GATATTTTTG 


AAGTTTACCA 


ATACAAATAC 


ATGCTGTGGA 


12600 


AAGAAAAAGG 


ATTTCCCGAA 


ACCCAAAAAA 


CAACAAACAA 


GGyTGGAGmT AyTAACAATA 


12660 


GCAAGCATCC 


ACTAGCGCAA 


CAAGTCGAAG 


TTTGGGCCGA 


TArAAAAATG 


AAAGCATTGG 


12720 


A7TTATTAGG 


ATTGACCAAT 


AAGTCAAAAA 


CAGGCAGACA 


AATTACTGGT 


GGTTCAACAG 


12780 


CTAGAGCAGA 


TGnaGAAATG 


AAACGGCCAG 


AAGAAAAGCC 


TGTAAATGAA 


TTGGCAGAAC 


12840 


ATCGGAAAAA 


ATGGCGTAAA 


AAGGCAGGGA 


ATGAAACATG 


ATTGAACCTG 


GTGTAAATTA 


12900 


TGCTGATTTA 


TTTGCGAAAG 


AAGTTCGAAA 


ACATCCTAAG 


AAATATCCGA 


AAACGGTTCG 


12960 


TTtA.GCAATA 


GATCGTTGGT 


ATCGATGGAA 


GAAACGAAAA 


GATATTTGGT 


TTGATGTTGA 


13020 


TCGTGCAAAT 


GAAATGATGG 


ACTGGGTAGA 


ATCTTTTATT 


GTTCaTACAA 


AAGGCGATAT 


13080 


GGTAGGTAAA. 


CCATTTCTTT 


TAGAGCCATG 


GGAAAAATTC 


ATTTATTCTT 


GGATTTATGG 


13140 


C7GGGTAAAA 


GAAAATGAAA 


AAGGGCAAGT 


AGTCCGTGTT 


ACTCGTGAGG 


CATACGTACA 


13200 


AATACCAAAG 


AAAAATGGGA 


AAACATTAAT 


AGCCGTAGGG 


GCGTTGGGAT 


ATGCTATGTA 


13260 
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TGGCGAAGGT GCCTTATCTG TCGATTGCTA TGCATGTGCT TCTGATTTTG CGCAAGCCCA 
GTATGCTGCT AAGCCTTTTG CCGCTACTAT CCTAAATAAT CCAGTGCTAT TAGATGGGAC 
TAAAATATTT AAAGGTCCAA AAGGCACCGT TTCAAGTATT ACGTATGACT ATTTACATGG 
AGATATGGCT TATACAAATA AGTTTATTGT TCAGACAAAA AACATTGATA ACATAGAAGG 
TTCCAATCCA TATTTTGTTT TAAATGATGA GCTGCATAAA CAAGAGAAAA TGGAACAGTA 
CGATAACTTT AAATCTGCAC AAATTTCATT GCCACAGCCG TTAATGTTTA ATATTTCTAC 
AGCTGGTAAA GGAAGTAGTT CGGTTGGTAT TCGTGTTTAT AAAGAAGCAA AAGAAGTCTT 
GAAGCGCGAT GATAATGATT CAAACTTTGT TTTGATTTAT GAACCAAATA AGGGATACGA 
TTGGACaGAT AGAAAAGTTT GGGAAATGTG CAATCCTAAC TGGGGAATAT CTGTTGATTT 
GTCTGCTTTA GAATCAGCCT TTAAAACTGC GCAACGTTCC GCTCACTCGA AAGCTGAATT 
TTTaACGAAG CATTTAGATG TGTTTGTGAA TGGTGCAGAT AATTTCTTTG AACAAGATCa 
AGTGGAACCG TGTTTGGTTC CCACAAATGA ATTAGGTAAC TTAAGTGGTG AGCCATGTTG 
GATTGGTTTG GACTTATCTA AAAGTCGAGA TTTAACTTGT GTATCATTAA ATTTTCCTAC 
ATGGGATGCC GAAGGAAAAG CGATACTCAA AGTAAAACAA TTATATTTTA TTCCTAGTGA 
AAATATTGAT TTTCGAGAAA AGGAAGATAA TGTGCCATAT TCTGAATTAG CAGAACAAGG 
ATTTGTTGAA TTTTGCGATG GTAAGTTAAT TGATCAAGAA CAAATATTTC ATTTTATTGA 
AGATTGCATG GATTTTTATG ATGTTCAACA AGTCAATTAT GATCCAGCGA TGAGTGACCG 
ATTAGTTGAA AAATTGGAAA ATTTAGGCTT GGAATGTGTG CAAGTTGATC AGTACGCAAG 
AGTATTGAAC TCGCCGCTTG AAGATGCCGA GCGATTATTT TATGAGCAAA GGATTATGTT 
TGATAATCCT TTATTTTTGT ATTGCGCTTT AAATGTGGTT GTCAAAATGG ATTTTCAAGG 
CCGTAAAGTA CCAAGTAAAA ACCAGTCAAA GAGAAAGATT GATGGATTTG TTHCTTTCCT 
TTGTGCGCAT AAGGAAACAA TGGATCAAAT GATTGATGTC AACGAAGATG ATATGGATGA 
ATATTTAGAT TCTATCTATC GATAATAGAG AGGCGGTGAG ATTTTGAAGC TAAGAGATAG 
ACTTTCAAAT GCAGTCTATG GATTTTTGGA AAAGCGTGGC TGGATTGAAG ATATTTATGG 
CAATGTAACA AGATATTCAC AACGTTTTGT TAACGATTCT TCTATTATGG AATCGTCTGA 
TGTTTATGAA TTGGTACAAG ATATTTCTAA TCAAGTTGCA CTAGCAGAAC CAGTAGTAAT 
TGGGCCTGAT GGCGAGGAAG TGAAAAATCA TTTCTTGTTA AACATATTGA AAAATCCTAA 
TGATTATTTA ACTGGTTTTG AATTTGCAAA GCTTGAAACA AATACATTGT TAATCAATGG 
AGAAGCTTTT CCTATTACAG ATAATGACCA GTTACATTTG GGATATGGTG TTCAAACGAA 
ATTAGATGAT CGTTTGATTG AAAGATTTTC AATGAATGGC CAACCAATAC CAGGGAGTAT 
GATTCGTCAT ■ ATAAAAAATA TTGGTGTGGA TTCCTTAAAA GGTGCTGGAA TTATTGATCT 
TGCAAAAAGC ACACTAGAAG GTGTTTTAAG TGCTGAAAAG GTTTTGACAG AAAAATATAA 
GAAAGGCGGC TTGCTCGCTT TCTTGTTAAA GCTGGATGCG CATATCAATC CAAATAACAG 
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CGCTCAACAA 
TAGTCATTCA. 
CCCAA7TGA7 
tTTTTAGGAA 
TGATGTATC7 
CGGCTCTT77 
ACTTTGTACC 
CCAGTCCTGA 
AAGCCGTCTA 
CATTGACAAC 
ACATTTGACA 
ACTGGTTAT3 
ATTAAGCCTG 
CATGACTGGG 
GATAAGGGAC 
TCAATCAATG 
AACTTGGGA7 
AGTTTCTA77 
ACAAGAAATG 
GCTTGAAGC3 
TCACGAGAAA 
AGATTTAGAT 
AGCTGAATTA 
TAAATCTGAT 
CAACGATGAT 
TTTGTTACGC 
TGAAAAACAA 
AGCACGTGCA 
ATCCGAAGTG 
TCGCACGGCT 
CAAAAAAGAA 
TGATCCAGCA 
TGTGCCAGTA 



34 3 

AAGA7AGTAA AAGC7ATTT7 AAATCAGTTG GAAGAAACGC AAGATAATGA 
STT^A^ATGA T7CCTTTGGG CAAAGGATAC TCAATTGATA CTTTAAAAAG 
SATSCAGCTA TTCtTAATTA TTtGC-GTGTT cACAAAAAAG ACCTAGGAAA 
7AGATGTAAA TACT 7 AT C Ac CA.TTAATGAG AACaGaTATT GnAAAAGCAa 
GCACAACAAA C-CAaTTAAmC CAATATTAAA AAATAAGAGC 3 AG CAT TACT 
TTTTATGCCT AAT7CTGGT7 ATCGAGT3GA ATGGAAAATT AATATTTTGG 
TTATTCCACC AAAACAAATA TTGGGTACAA CATTGTTCGA ACTGGTATTA 
TAATGTGGCA C-AAATGCTTG GTTTTCCTAG ACAAAATACT GAAGCAACAC 
fATTTCAAAT GATTTAACGG AAATCGGCAA AAAGAATGCT ACCGATAACT 
AGAG3ATGAC 77GAAGG3AG GTGGTAAGAA TGAAGAAACA GGAAATTCGG 
TCACAAACCT TAAAACAAGA AGCGAAGAAG ATAGTCAAAC ACAGATTGTT 
CGGCGGTGTT TAATAGTCCA ACAGAATTAT GGGAAGGCCT AAATGAAGTG 
3AGC77TCAG 7CG7GCTTTG TCAAATTCTG ATGTTCGTTG TTTATTCGAT 
GCAAAGTAT7 AGGGCGCACA AGAAGTGGAA CTTTGAAACT TGAAGAAGAT 
7ACGA7TTGA AGTTGAGTTG cCCCAATACA ACTGTTGCCA ATGACTfcGAT 
7CACG7SGGG -CATTAvTCA G7GTA3CTTT GGTTTTTATC CaACGGAAGA 
7ATAGTTCAG ACCCAG77TT AAGAACTATC CaTGAAGTCG AATTGTATGA 
3TTTCT11AC CTGCTTATGA AGATACAGAA GCAGCACTAT CAAGAAACAA 
AAGCAA3ATA TTAAAACTAG AAAAAAATTA ATTGAAAAAA TCAAAACAGC 
7AGC-AGGAAT TTATTATGAA CAAAGAATTA TTGCGTCAAT JACAAGCTCG 
CGATTAAGTG ATTTACAAGG CAAAA7TGAA TCTGGAGAAG TGCGTGAAGC 
TCAC-TTAATG AAGAAATTGA TGGTTTAATC GATGAATTAA AAGCCATTAA 
3GGC-ATGATA A7T C AGAAT C TGGTGATGGT GAAGGCGATG ACGGGACCGC 
AATACTGATC- ATGAAAGTAA AGAAGATCGT GAGAAAGATA CGAACGGAAA 
AAAAAC GAAG " AAAA7 CGTGG CGGCATGATT AGCCAAGAAC AGCGTGATGG 
ACAATTCATG AAGGAATGGA GGCTASAAAT GTGATGTCTA ATGAACAACG 
ATTC3TAAAG CATT7GCTGA TTTTGTTATT GGTAATATTT CAGAAAGTGA 
TTAGGTATTG AAACAGGCAA TGGTTCAGTG ACAGTACCAG AAGTGATTGC 
ATTTCITATG CTCAAGAAGA AAACCTA7TG CGTAAATACG GAACTGTGAT 
GGCGATGTGA AGTA7CCAAT TCTTGTGAAA AAAGCAGAGG CTAATGTAAA 
CGTACGACAG ATAT7ACTGA AACAGCGATT CAATTTGATG AAATTCTACT 
GAATTTGATG CATTAGCAAC TGTAACGAAA AAACTATTAA AAATGTCTGG 
GAA3ATATTG TTGTAGAAGA ATTGAAAAAA GCATATGTTC GCAAAGAAAT 
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TAATTATATG TTTAATGGCG ACGATGCAGG aAATGAAAAC CCAGGAGCTT TAGCTAAAAA 17280 

AGCTGTTGCA TTTGAAAAAC CTGTAGATTT AACAGCTGCA GGTGCTGGTC AAAAATTATA 17340 

TGATGCATTG ATTGAATTTA AAAATACACC AGTAACAGAA GTAATGAAAA AAGGTCGTTT 17 400 

TATTATTAAT CGTGCAGCTT TAACTGCTAT TGAAAAAATG AAAACAGATG ATGGATTCCC 174 60 

ATTGTTACGA CCATTCACGC AAGCAGAAGG TGGTATTGGT TATCAATTAG TTGGTTATCC 17520 

AGTTGATTGG ACCGATGCAG CAGATAAAAA AGGTGAACCA GATACACCAG TAT T AT ATT T 17580 

TGGTGATTTT TCTGCTTTCA AAATTCAAGA AGTTATTGGA GCGTTAGAAA TTCAAAAATT 17 640 

GGTTGAAAAA TTCTCTGGAA AAAATCAAGT TGGTTTCCAA ATTTACAACT TGTTAGATGG 17700 

CCAATTAGTT TACTCACCAT TCGAGCCGGC TGTTTATCGT TATGAAATTA CAAAACCAGT 17760 

AGGTGGTTAA TGTGAATAAC GAaGCTGAAA CATTATCTTT AGAAGAAAAA TTCAAAGCAC 17 820 

ATATTCATTT TGAAGAGGGG ATGGATGATT CCAtGCTCTC TTTTtATTTA AATATGGCAA 17880 

AAAATTATGT AAAAACTGCa ACTGGAGGGC AAGAAGAATA TTtAATTTTG ATGGTTGCTG 17 94 0 

GTATgCTTAT GAaTATCGTG TTTCAGAAGA TGAATTAGAT AAGGCGTTGA ATGCGATCAC 18000 

GCCATTTATC ATCCAAGGAG TGATTCAACA TGCCGAAGAG GCAGACGAAT AGGTTTCGAT 18060 

GGAAAGCGGA CTTGCTAAAT GTAAAAGAAG AAACAGATTC GAACGATAAA GTAGTTACGA 18120 

CCTATAAACT TAATAGGCTT TTATGGTACG AAGATATTGG AGTAACTGCA CAAGAAAAAT 18180 

ATCTTTCACA GCAAGCCAAA ACAGACGTTG TCAGACGGAT TAAAGTGAGA TTGGATAAAT 18240 
CTATCACAGA AAAGTTTAGC GCTGTTAGAA TCGATTCTGT GACCTATAAA ATCACTCGTA 18300 
TTTACACAAA TAT GG AT AAA CGAGAAATGG AGTTGAGTTT GGCTTATGTC GATTAGTTTT 18360 
GAAAAATTGA GAATAACTCT GAAATCAGTA GGCGTACCTG TGACACGTGA TAAAGCGGAA 18 4 20 
AAAGGAACGG ACTATCCATA TATTGTTTAT TCAAATGTTA GTCAAGGTAA AAAAATGGCA 184 80 
TCGTCTAAAG TGCATAGACG AATGCCCTAC TATCAAATCT CTTTTTATAC AACAGGTACT 18540 
GAAAAGGATT TAATTGCTTT AGAAAATGCA TTGGAGGAAG CTGGTATTCC TTACACTGAT 18 600 
TTTGTAGGCA TTCAAGGCGA TGAAAATGAT GATACTGTGA CAAATTTTTA CACATATGTG 18 660 
AGGTGTATTG AAGATGGAAA ATAATAATGG TTTTGCAGAT ATGGCAGACT ATTTAGGAAA 18720 
GCTTTCGCAA GTAGATGCGA CAAAATTATC AATAGAATCA TTAACCGCTG CAGCTAATTT 18780 
TTATATGGAA AAATTACTAC CCAATATACC TAAATCGCTT CTAAAAAAGA AGCACATGGT 18840 
TGATCAAGTG AAAGTAAATA TTAAAGATAA TGAAGTACAA GTAGCTTTTG AAGATACAGC 18900 
TTTTTATTGG CGATTCGCTG AAAATGGAAC AGTAAATCAA AAAGCGCAAC ATTTTGCCAG 18960 
TGGAACATTT GAACAAAATA AAGATCAAAT TGAAAAAATT ATGACTCAAC AAATATTAGA 19020 
TTTATGGAAA- GGATGAGTAA ATTGGGAAAA CAAGATGTGT ATTATTTTGA AGGCTTAGAT 19080 
GACATCTTAA TTGCCATGAT GACCACAAAA GAT GC AG TAG GTACAGAACC AGCATTTGGC 1914 0 
GAGGTTGTTC GTTTGCCAAT AGCCACAAAA TTGGGAATTA AAGGAAATGG AACAGCTTTA 19200 
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b AC Ab 1 1 Ab I 


Al HwAAbvjolw 


1 Q"7 A n 


ATCAGTTTCC 


CATTTTGTTA 


ACGATGGAAA 


r~> tv TV rp/^/"T\ /^mA 

CAATGbAb i A 


1 Al 1 bG 1 bAl 


PTTT ATPATft 
b 1 1 1 A 1 bn 1 \j 


1 3u 


ATGACTATTC 


tv tv tv tv m m m ^ tv -tv 

AAAATTTGAA 


tv ta m tv m Z~* TA 

GCAGATATGA 


71 mrT\ tv TV rri A T 1 

ATGAAA1 GAI 


1 AAG AAG AG 1 


PPTPPTPPT A 
bb 1 bb 1 bb 1 A 


1 QQ CO 
IjODvJ 


TTTCATCAAA 


tv z*^ t\ m m rn tv rn z"** tv 

AGATTTATCA 


GCTTCTGATT 


TAAAAAT'T'A'T 1 

T AAAAA I 1A1 


GGGlbbAl lb 


tv rp m rn TV p Pp A 

All 1 RA^oov^H 


1 QQ?0 
x ^ ^ c yj 


TGCTAAGAAC 


my** m z™* z"* t\ m m z"» 

TGGTGGATTG 


TV TV 7V TV TV f-y /*i /~1 

GAAGAAACGC 


TV 7V A A /^*/^A T 

CAGAAAGGA i 


rnrpmrpTV T\ Z -1 T T T 
1 1 1 1 AAb 111 


TT SPPTBTP A 
1 lAbOlAlori 


i qqo pi 


GTGCCACAAT 


TGTTGAAATT 


TATGGTGCAT 


OfTITl m^o )\ TVAm 

GTATGGAAAT 


A rp rn rn f* r*' tv t\ a f 

ATTTGGAAAG 


/-i tv p A ATTTT/** 
GAGAAi 1 1 lb 


onnzi Pi 

Z UUfl u 


AAGTTGAAGA 


CTTAAAAAAA 


m /"^ TV TV Z^ TA TV Z"^ Z"^ 

TCCAAGAAGC 


• tv tv tv /"■» tv m rn A 

C AC AAG AT T A 


TCAAAG1 GGG 


A A A A A A BTP A 

AAAAAAAbbA 


Pi Pi 


AAAACAAAAA 


GAAAAAGCCT 


TV TV Z"^ Z*^ /"^ TV TA TA HTl 

. CAACGGAAAT 


AG G AACG C GG 


m z*^ z*^ fTi rn rri 

I GGGG 1 1 1 1 1 


Al 1 lAlAlbl 


00 1 fin 


AGCTCTCACT 


CTTTTGGGAT 


Z™* Zi Ti z^ m Z"* TV Z"" 1 TV 

GGAGTGAGGA 


CTTTTTTTTA 


a a a r*r* A APTP 
AAAGG AAG 1 G 


c c a n C T T P T c 
bbArt^ llolu 




GCTTAAGTCA 


m ta «ti t\ rn z—> tv /"* m 

TATATTCAGT 


GGTTAGTAAG 


I AA 1 AGGGAG 


TTTPA. A PPAP 
ill vjAAvvLnv^ 


PPA(^AAP,Tf;T 


20280 


GACAATGGAT 


tv t\ tv tv m z~* on rn 

AAAAGTCCTT 


GGTGGTAGGA 


AAGGAGGGG 1 


A APPTPTrA A 

A/ibb 1 b 1 bAA 


A AP A AC A ATP 


U J *i u 


CGATGTTGTC 


TTAAATTTTA 


TV r"** A m A A rn /-> r-" 

AGATGAATGG 


A A A A T 1 A A A O 

AG AAA I AAAG 


1A1 IbAbbAA 


PA ATT A A AP. A 


20400 


CATTAATAAA 


GAAATGAACT 


TAGCGGCTAC 


/"•r^A/^TA^A A A 

GG AG 1 AGAAA 


A APPBPfTZiT 

AAGGAbb 1 A 1 


PPP.P A ATP,P,A 


?ri4 fin 

c. \J *a U vj 


TAAAAATGCA 


ACTCAAACTG 


TV TV TV TV TV m m TV TV Z*^ 

AAAAATTAAC 


TV m TV A TV A/^ 

AGCTACAAAG 


A AAAAATTAP* 

AAAAAAI lAb 


A A A A AP a ATT 
AAAAAlwAAl 1 


9 n ^9 n 


ATCTTTAGCT 


Z^ TV TV Z™* TV TV TV Z"^ TV TV 

GAACAAAGAA 


/-i ti tv tv ti m m tv mm 

CAAAATTATT 


AC G T GAG G AA 


rp Tv p"/— tv A AA AT 

1 AGGAAAAA 1 


paPTaaa ap a 

bAb 1 AAAAurt 




AACTGGTGAA 


TATTCAGAGC 


A ATOA^AAA A 

AAT C AC AAAA 


GG 1 G I Ai AAG 


ppt tt a TT(t:rt 

bb 1 1 1 A 1 1 oo 


A ATPPPAAAP 


20640 


AGGTGAAAAT 


ta tv tv /^m z - ^ rn m 

AAACTGCGTT 


CTGCATTGCA 


AA/*^ r PAr~ , /^*AA r T 1 

AAG 1 AGG AA I 


PAATPTTTPA 

bAAbbl 1 Ion 


A APAPP A APP 


20700 


TAATTTATCA 


TV m TV TV TV T\ Ti /""* TV r*** 

ATAAAAACAG 


r ,, TT , AAAAA/™*T l 

G I G AAAAAG 1 


AbbbAAAA 1 1 


r*A A A A APr*Tf^ 


nACArAAAAT 


20760 


I AAA 1 GAG 1 1 


rrrp?\)\7\ a a p* 
GGGCAAAAAb 


rn z"* rn z™ 1 rr* np rn z^ Z^ 

IGiGlGi 1 GG 


a TT 71 Ti r* TA c c n 


VwV^rVril i- r\l. ou 


nAATTGGTGC 


20820 


TGCTTCTATT 


/"" , l r '*r- , ^"P' , ATT/" , A 

GCCGCA1 IGA 


A Af'A ATTSrS 

AAGAfvl 1AGA 


i b AA 1 b 1 I lb 


flATAATATAA 


PAArAGCAAP 


20880 


AGGGGCTACT 


GGTAGTCAGC 


TAGAATCTTT 


ACAAGCCAGT 


TTTAAAACAG 


TAGCAGGTCA 


20940 


AATACCCGCG 


GATATGCAAG 


ATATATCAAC 


TGGTATTGGT 


GAAGTAAATA 


CTCAATTTGG 


21000 


CTTAATGGAT 


. AAGCAATTGG 


AAGATACAAC 


AGGCCGAATG 


CTTAAATTTT 


CAGAAATTAA 


21060 


TGGCTCAGAT 


GTTTCTCAAT 


CAACTATTAA 


TGCAAAAAAA 


TCAATGGACC 


TTTTTAGGTT 


21120 


GTCTATTGAG 


GATTTGCCAA 


TGATTTTAGa 


TTCAGTATCT 


AAAACTAGCC 


AAGATACTGG 


21180 
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a p t a p n. n fi t a 
Ab I Aouuu 1 A 


uAltaol lnl 


TTP ATP.PPP.T 
1 1 bAl o*~.v-o J. 


r\r\r\ Lnunuu l 


GCGCCCCAAC 


TCAAAGCTAT 


21240 


bbbAV- 1 1 ou 1 


TTTTfT^B AT 
1 1 1 1 k* 1 u/in 1 


PAAPTAPGTT 


AATAGGTCAA 


ATGGAAaAAG 


CTGGTATTGA 


21300 


nr» t p a pp a p p. & 
1 I bAbbAbbA 




n tttp. P.P a a a 

Al 1 lbuLnnn 




GTATATGCGA 


AAGATAATAA 


21360 


t\ ti /- , /^ > a tpp a a 
AACb A i bb HA 


PAaPPPPTTA 
bAAbbbbl 1A 


P.PP.P. A APT AT 
uLbunnL lnl. 




AAAGGGGCCA 


CAACTGAACA 


21420 


AG AAAAAb i b 


APTaTTPPTA 
AL 1 A 1 1 bb I A 


PTPAaPTTTT 
b 1 uMo 1111 




GCTGCTTCAA 


AAATGGTAGA 


21480 


AGCTATTGAT 


AbbbbAgbb i 


TPTpaaTPPa 

1 b 1 b AA 1 buH 




GATTCAGCAA 


AAaACGCAGC 


2154 0 


TGGCACTGTG 


GAT bAb Abb I 


1 1 Ab 1 bAl Al 


TPTTP. ATPP A 


ATTGACCAAG 


CAAAGTTGGC 


21600 


ACAAAATCAA 


rnrnfTl T\ 7\ TV Kfp^P 

TTTAAAATAb 


/"■AAT/^PPTPA 

bAAIbbbi bA 


a pttpp a p a a 

Ab 1 1 bbAuAA 


PAAP.TAPAAA 


TAGGATTATT 


21660 


ACCAGCATTT 


/"< TV T\ /*l /--i m TV TV 

CAAGCTGCAA 


CGGATGCAAT 


A A A nTVHUPTT 

AAAAAAAb 1 1 


TPaP A ATP-P-T 
1 bAbAA 1 u^s 1 


TTP.P. A AP.TTT 

1 1 \3\3t\r\\3 111 


21720 


AACAGATAGT 


CAAAAGCAAA 


CCATACTGAA 


A ATAPPTPPT 

AAT. Abb 1 bb 1 




PTZjTPP,GTPP 


21780 


AGTATTAGTA 


GTTTTAGGAA 


CACTTGC I AG 


T>n">PPAT ,r P7AP r r 

I 1 bbAl 1 Ab i 


APTTTP-ATTP 

A(j 111 Ort 1 Iky 


P AGTTATTGP 
Lnvj i 1 1\ i i 


218 40 


TTTTATTGCG 


TCGCCAATTG 


GTTTAGTAAT 


i bbbbbbb 1 1 


oLLoL, 1 1 VjVjVs 


t- AGPTGP A AT 
Unlaw 1 oUnn 1 


21900 


CGTAGTTGCA 


ITlTvrnTV TV m TV TV TV T\ 

TATAATAAAA 


TCGGTTGGTT 


TAPPPflTTTT 

1 AbbbAl 111 


A TP A AT APPT 


PPTTTAAAGT 

Ill rtrtrtVj 1 


21960 


AATTAAAGAT 


ATTGTGGTTG 


GTGTATTTAA 


m^mmrpnp/'^A. a tv 
1 G I 1 1 1 G AAA 


P AT APPAPAA 


A ATPT APTTT 
nrt 1^1 nvv 111 


22020 


TGATTTCATC 


tv ^ 7v ^ 7V rp rn m tv 

ACAGGATTTA 


TTGG IGG l GG 


P7\TPPJTPPP 

bA 1 bbA 1 IjoIj 


PPTP.PA A A A A 


TTATTGGPGA 

1 lnl lOvjvwVJri 


22080 


TTATGTAAAT 


GAAATTAAGC 


/■"< m TV m rn rprp rn f~* /— » 

GTATTTTTGG 


PPPT]\TPPTT 

GbbI AJlbb 1 1 


P ATTTTPT A a 
bAl 1 1 luiAH 


PGGG AGT ATT 
tuvjonvjlnl 1 


22140 


TACTGGAGAC 


T G G T C AAG AG 


CGTGGGAAGG 


TPTTPTTPIP 

1 b 1 1 b 1 1 bAb 


ATTTTTP.GTG 
Al 1 1 1 1 oo 1 u 


GT ATTTTTGA 
Valnl l l l l On 


22200 


AGGCATCGCT 


GCAGTAGCTA 


A A f rp /"•/*"* A A T 1 

AAbG i GGAA 1 


PA 7\TPPPSTP 

bAAl bbbAl b 


ATTAPP.TT A A 
ri 1 1 rtVrfO 1 1 rtrt 


TPAATGGATT 


2226.0 


TATTGGTGGA 


TTAAACA \TA 


mTVTVT\A7\TITV 

i AAAAA I AGG 


rn tv tv tv rn^/-»/^rp 

1 AAA 1 bbb 1 la 


PP A PPA A TTG 


GPGGTAAAGG 


22320 


ATTTCATATT 


/"•/"•TV A TV TV 7l(PPP 

GGAAAAA I GG 


G 1 I A 1 1 1 AGG 


A P A APPTPP a 

AbAAoo 1 IjvjH 


APT ATTPT A A 


ATGGPPAAGC 


22380 


CATTGTTGG1 


GAAGbl GG1 G 


b 1 b AAb i A 1 1 


a appppt a a a 


A APGGPAAGA 


GAAP AGT AAC 


22440 


TCCATTGTCA 


r~*r"* T\/**T\T\/* , A7\ A 

CCAGAAGAAA 


AAGG I Gblbb 


AAl l bb 1 1 


PPTTTGAAAG 


GTGGPAAPAP 


22500 


TATTGAGCAA 


CATGTTCATA 


T1GGGGAAGI 


APATPPA A AT 

AbA 1 bbAAA 1 


A ATPPnAGTG 
AA 1 v*\yvj/ilj 1 \j 


AGTTAGATPG 
no 1 1 no;n 1 v» O 


22560 


tv tv m tv t\ rn /""•/"" 

AATGAATCGC 


AAGCTTTATA 


A TlPP7\ nPTPP 

AAGG AAb 1 bb 


PPa aPPTTTP 
bbAAbb 1 1 1 


T ATP. APTT^G 


GAGGTGTTCC 

OnVJO 1 VJ l l v^n_^ 


22620 


aACGTGATTT 


TTATGAATGG 


1 bA 1 bAAbbA 


AAl 1 IbAl 1 1 


CP. A A AG ATTT 


GAATGPAGTT 

Onn l uv^nu i i 


22680 


CG T GAT AT GG 


/-"» /Tp /■"*/"*> T\ HirpTirp 

GGTGCATTAI 


pp a a a a t*p rp 
bbAAAA i bAb 


PTPTPafZAP.P. 
b 1 b 1 Vw AuAVjVj 


TTTTAGGAAA 
1111 nv-/Vjnnn 


TAAAPGATAT 


22740 


bAAAbb 1 A 1 1 


bbAl I Al bbb 


a apa &PTPP.T 

AAbAAb 1 ou 1 


P. A ATTT AATfi 


AAAGGTTCAA 


TGATTATGAA 


22800 


bbb III bAl 1 


TvTrJV A A TTP A 

A 1 b AAA 1 1 b A 


apaTP.TZvaPT 

Ab A lul /\Hiw 1 


ATTPPATATG 


AAAATTTAGC 


GGCAGTCAAA 


22860 


AGATGGTTAA 


CTGGTAAAAG 


TAAACTTATT 


ACTCACAATG 


ATGAAGATAA 


ATATTTAGAT 


r\ r\ jrt 

22920 


GCTATTTGTA 


CAATGAGTAA 


ACCAACTTCA 


TTCAAAAATG 


AATGGGGTGT 


TTTTTATACC 


22980 


TTTAACGTTG 


. AATTTAGATG 


TCAACCGTTC 


AAAAGAAAAG 


TAAACGAACA 


ACCAAAAGTG 


23040 


ATTAAAACAA 


AATCAATTGA 


AATTACTGAT 


CACGGTGATG 


AAATTGCTTT 


TCCTTATATC 


23100 


GAAATTAATT 


CAAAAGGTGG 


CGATATTACG 


TTAAACATTG 


GTAGTAACTC 


ACTAACGATT 


23160 
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TTGCGTACAC 


AATCAGGAAT 


CGTCACTATT 


GATACCGAAA AGGGAAAAGC 


AA 1 AO AAu AM 


23220 


GGAAATCCAC 


TATTTACACG 


CGGCAGTTGG 


ATAAAAACGA 


ATCCCGGTCA_ 


AAA1AAM1 1A 


23280 


AATATATCAG 


GAAATTTTAT 


H Tl TV /"* /"l m TV TV 

AGAAGCTAAG 


TTTTGGAATA 


GGAGCGCGTA 


1 1 1 Al bAL.AU 




AAAATTTTAT 


TTATGCCTAT 


ACTGCTATTC 


CTGAAAATTT 


AAACGATAAC 


/"•/■"■ft Tv TTTT'TT 

GGAAj. bbO I I 


o"\a nn 

Z J fi uu 


TGCCAGATTG 


GCAAGATTTA 


CCAGAAATTA 


ACCGTGTGTT 


AAATGGTGTG 


TATLGAI 10 1 


Z 0 ft DU 


ATGGTAACTA 


TGCAAGAGAT 


GGCCAGTATC 


GCTCATACTT 


AAAAAAAGGA 


AACTTTCTAA 




AGGCACAAGT 


TGAAGATGGA 


TCATATCAAT 


ATTTTGAGAT 


TTACAATATT 


AAAAAAAAI U 


O *3 C Q A 


TGCAGTCAGT 


TTCAGTGACA 


GCGAGACACA 


TTGGTTTTAT 


GGCAAATAAG 


AATTTCA1 1 A 


ZJOl u 


TTGATTCGTT 


CACTGCTAAC 


GGAAATGGCA 


CGCAAATTAT 


GAATAATTTA 


AAGGCTGCAT 


z J / UU 


TAACGTTTAA 


GCAACGGTTT 


AACTATTTGT 


CGAATGTCGG 


TACTACACAT 


C AAT T T AC AG 


/ DU 


CAAAACAAGT 


AGGTCCAATC 


GATGCAATTA 


TTGGTTCTAA 


CAATGGCAAT 


t* ta tv ta mntfn tv tv 

CAAAATTTAA 


O *3 Q o n 
Z OOZU 


CAGGTGTTAC 


TGGTGGAGAA 


TTAGAGATGG 


ATAACTTTAA 


TTTGAAATTA 


rn TV 7V TV TV 7\ TV 7\ 

GTAAAACaAA 


0 O Q Q n 


TTGGAGCAGA 


TAATGGCTTT 


AGAATTGATT 


TTGGAATTAA 


TTTGGAAGCT 


AT AG AT GAG G 




ACTATGACGA 


CGAATCAATT 


ATAAACAGTC 


TTTTTCTTAT 


CGGTGGCGTA 


^ TV TA *n TV rn 

CCAGACAATG 


Z4 UU U 


ATTATGACCA 


AGATAAAGAG 


CCAATCACGT 


ATGGCTTTTT 


AGAAATTGCT 


GGTGTAAATG 




ATAGTAACAG 


AAGAATTGGA 


AAACGTGAAA 


ATTCGGAATG 


TAAAACAGTT 


G AT GAGL I 1 A 




AAAAATGGGG 


CCAGTCATTG 


TTTGATAAAG 


ACCGCATTCA 


TGAACCAAAA 


GTAACGCACA 


o /i i o n 
z ^ i o u 


CTATTAGCAT 


GGTAGCATTA 


GAACACACTT 


•TGGAGTATGA 


AG AC AT GT AC 


GAAGAbCl 1 1 


Z 4 Z fi U 


CTTCTTTGCA 


TTTTGGTGAT 


GTAGTACATG 


TTAGAGCAAA 


AGAAGTCGAT 


ATTGAAb 1 AA 




CGGAGCGCAT 


GGTGGAATAT 


ACTTGGTTTC 


CGACTTTAGG 


CAAATTTAAA 


TV TV fTl TV rp m f*"*fTl^^ff* 

AATATTbl I 1 


Z 1 -3 DU 


TGGGGAATGA 


TTTATCACTT 


TACACCTCAA 


CAGTAAATAA 


TCAAACTCAA 


GAGC I AAAAO 


94 4 9n 


AAAAAATTGA 


TAATCGAACA 


GAAACATTAG 


TACAAAATGT 


TTTAAATGCA 


ACGbCAI bbA 


9 4 4 r n 

Z fl 4 o u 


TTACTGGAAA 


CAGTGGTGGA 


CATGTCGTTT 


TTCGTCCAGA 


AAAGGCCCCG 


J. CTGAAA lib 




TTATCATGGA 


CAAAAACAAA 


GTAGCTACCG 


CAAAACGTGT 


GTGGCGCTGG 


AAC1 lAAAlb 


9 zt £nn 


GTTTGGGTTA 


TTCCTCCAAC 


GGCGTGAATG 


GTCCGTTCGA 


ACTTGCTATG 


AL- 1 i L i AAbb 


94 6fi0 
Z fi O OU 


GAGAAATCGT 


TGCTGATTTT 


ATTAAAGTGG 


GCATTATTAA 


CGCGAATGTT 


1 lACAAAOMA 




GCTTTAATAA 


AGCAACAGAC 


T\ ftl m TV fTl TV TV 

GATGTACTAA 


AATTAGTAGC 


TGGTGCTTTG 


L/Wil 1 1 bbA 


94 7RO 


ACAATAAGAA 


AAAAATCATG 


GAATTGACTA 


AAAAAGGGAT 


GGAATTTTGG 


7\ tv m/"*^*" f™ , fp^fp TA 

AAIboLlL I A 


9 4 P 4 n 


GTCATATTGG 


GACAATTGGT 


ACGAAAGGAA 


ATCCCTTTCC 


AGGGGTAGTA 


GATAAAAATG 


24900 


GAAATCCTGT 


AGTTTCTGAT 


GGGAATTCAT 


TGCTATTAGT 


CGCAGATAAT 


CCCAAAAAAA 


24960 


TTATTGGTTT 


■ GTCTAACCAA 


TCAGGAACAG 


GACATTTAAT 


TACTGGTCCT 


ACACAGTTTT 


25020 


TTGTTGGAAA 


TAATTTTAAC 


TTTTTTGGTC 


CGAATGGAAG 


TAAAGCAATT 


CTGACAGTTG 


25080 


ATCGATTGAT 


TGTGGGCGGC 


AAAGAAGTTA 


TACCTGGTCA 


AAATGGTGGT 


GGCGGTTCTG 


25140 
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GAGCTGGAAC AGGTGGTTAT CCATCAGAAG TTACAAGCGA TGCAGATAAA TTTGCTTGGG 25200 

ACTTATGGAG TTACCTATTA GCTAACGGAT ACAGCAAAGC AGCTGCTGCA GGTATCCTyG 25260 

GAAATGTACA AGGAGAAGTT GGTCCAAGTA TGAACCCAGA TACCGAACAA ATAGGCGGTC 25320 

CAGCTTACGG ATGGGTTCAA TGGGACGGTT CAGCATATCC ATTGGTAGGC GCACCAACTT 25380 

GGAATGGCCG AGAATATGTA CAACGCTTAA TCGCAGCTGC AGGTATCAAA CAAGACTATA 254 4 0 

GGACGTCATT AGCCCAAGCT CAATTAATTA ATTGGTGTAT GTTCAATGGG CAATGGTTAG 25500 

GACAAGTAAG TCCATTAACa GTTGATGaAT TTaAAnnTCT ACGGTCTTTG GTTGGTAACG 25560 

TTACTTCTGT TTTTTCAGAA 25580 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20; 

ACTTAAATAC AAATGTTGGT CGTGCACAAT GGGAACCTCA AAGTATCGAA GCGCCCAAAA 60 

ATACGTTTAT TACTGGTAAA TTAGCCAGTG CAGGTGTCTC AGGTTTTGAA CCTTTGGATG 120 

AACAAACAGC TACTCGTTGG CACAAAACAA ACATTACAAC AGGTCCCCTT GACATCACTT ,180 

GGAACTTAAC TGCCCAACAT AGAACTGCTT CTTGGGATTA CTATATTACT AAAAATGGCT 240 

GGAATCCCAA TCAACCATTA GACATTAAAA ACTTCGACAA AATTGCTTCA ATCGACGGCA 300 

AACAAGAAGT TCCTAATAAA GTTGTTAAAC AAACAATTAA TATTCCGACA GACCGCAAAG 360 

GTTATCATGT CATTTACGCT GTCTGGGGCA TTGGTGATAC GGTGAACGCC TTTTACCAAG 4 20 

CGATTGATGT GAACATTCAG TAAAAATACT CACATCTTAA ACTAAATTAA AGGAGTTCTC 4 80 

AAAAAATGAA GCTAAAAAAA ATAATTCCTG CTTTTCCCCT TCTTTCAACC GTTGCAGTTG 54 0 

GCTTGTGGTT AACGCCTACT CAAGCTTCTG CAGATGCTGC GGATACGATG GTAGATATCT 600 

CTGGCAAAAA AGTGTTGGTT GGATATTGGC ATAACTGGGC CTCAAAAGGA CGCGATGGTT 660 

ACAAACAAGG AACATCAGCA TCACTAAACC TTTCAGAAGT AAATCAAGCC TACAATGTCG 720 

TACCGGTTTC CTTCATGAAA AGCGATGGCA CGACACGGAT TCCTACGTTC AAGCCTTATA 780 

ACCAAACGGA CACTGCCTTC CGACAAGAAG TCGCACAATT AAATAGTCAA GGTCGCGCAG 8 40 

TTTTATTGGC ACTTGGTGGA GCAGATGCAC ATATTCAATT AGTCAAAGGC GATGAACAAG 900 

CCTTTGCGAA TGAAATCATT CGTCAAGTGG AAACATACGG CTTTGATGGT TTAGACATCG 960 

ACTTAGAGCA ATTGGCGATT ACTGCTGGCG ACAACCAAAC CGTCATCCCT GCTACGTTGA 1020 

AAATAGTCAA AGACCATTAT CGAGCACAAG GAAAAAATTT CATCATTACG ATGGCACCAG 1080 

AATTCCCTTA TTTAAAACCT GGTGCCGCTT ATGAAACATA CATTACTTCC CTAAATGGTT 1140 

ATTATGATTA CATTGcCCCA CAATTATATA ACCAAGGCGG CGACGGtGTC TGGGttGATG 1200 
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AAGTTATGAC TTGGGTTGCT CAAAGCAACG ATGCTCTAAA ATACGAGTTC CTCTATnATA 1260 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



m/iOTl TV f~" f CT 1 


TATPP.ATT AP 


AAPAAPTGGA 


AGTtCTTTTA 

X X X X X *i 


AATCAAGCTT 


GCTTTTTCCA 


60 


Al iAbAlKLt 


t a Tczrzn a arr 


A TP A wT A AT A 


ATATATTCGT 


CTTTTTTATC 


ATCACAATAG 


120 






TPPP^TPTTTT 


AAAATCGATA 


ATTCTAAAAG 


CTCCATTTCA 


180 






PTTATAAPTA 


TTACTTTTTA 

A X * X X X X X * * 


TTTTCTCTAC 


TGGAACCATT 


240 


TTT7\rn r r r p r rzi 




ATPAATCTTG 


TTCATTGATT 

X X X^XX X A X X 


TCTCTCCTAA 


AAAATAAATA 


300 


7\ T , *T'f^T""P f r^ r PP 

Ai 1 1_ 1 1 HjlO 




HTTTAAAAPA 


ATAATGTTTC 


AGATTATTTA 


AGTACTGGTC 


360 


1 i 1 i L 1 l/i/V-i 


P. A A A T TPPT A 


PAPAPTTTTT 


TTCACATTTC 


TTGTCTATGA 


AAAATAAAAC 


420 




A A ATAPTTTP 


PAPTATPflTP 


ATTTTPATCT 

C\ X X X X V> X w X 


TCAACAAAAA 


CATCTCCAAT 


480 


ATCAATAAAC 


TCATTGGAAA 


TTTGAATGGA 


AGAAAATCCA 


ATTATCCTTT 


CCTTATTCAT 


540 


TAACAAAAAC 


CAAATAGTAC 


TAGATGTATT 


TATTAAATAA 


GGTAATTTTC 


TTCTATATAC 


600 


TCTTTCAGCA 


AAATATTTCC 


CCATTAAACT 


GTAAAAATTT 


TGTyTATTGT 


ATGGGGGTAT 


660 


ATAAACTTTA 


ATATCTATTT 


TGCTCATTTC 


ATTCCTCCTA 


TTTTTTGTAC 


AATATTTTCG 


720 


TATTTCTTTC 


CTTTTAATTG 


ATTTATCCCC 


AAAAAGTCTT 


TATATTCTTT 


AAACATTTCA 


780 


TATAGTTCAG 


CTTCATCTTT 


TTTCGTTAAA 


CTATAATTTC 


TAGTAGATAA 


AACAAAGTCA 


840 


TTTTTTTCTA 


AAGCTTTAGC 


TATCCCTTTC 


CAGTCATTCC 


AAAGCCTTtC 


ATPTTTT ATP 


900 


CAATCTTTTC 


TTTTTGATTC 


ATCTGGCATA 


TCTTGGATAT 


CCATATtATA 


tGTATtCktG 


960 


TAATAaGCAA 


AAaGAATACA 


TATCTTTCTG 


TAATAATGAT 


TCATTAAAAG 


AGGAGAATAT 


1020 


AATCCTAATG 


TTTCTAGTAA 


AAAAATAGAA 


TATCTTTGCC 


AAGTCATAAA 


TGAAGGTTTT 


1080 


TGAGAAGTAT 


TTAATCCTAA 


AAGACTAGTT 


TTACCATATA 


TATTTCCAAA 


ATTAACTCCA 


1140 


CTTACTCTTT TTACTAATTT ATCCCATGTT TCAgGTTCAA GTTCAGCAAA 


TTGATTTAAC 


1200 


GCTTTTCTTT 


GTTGTAATCC 


GAAAGGCTGG 


CATACTCTCA 


TGTCCCGGTA 


CTTATATCCA 


1260 


ATCTTATACA 


TTTTCTCATA 


GATATAATTA 


TATTCTAGTT 


CACATTTAAA 


AGTTGCAGCC 


1320 


CAAATGTCAC 


CAACTTTCCA 


ATCAAATAAA 


GGATAAAACA 


AAATATTTCC 


TTCTAAAGTT 


1380 


ATATTTATCC 


. AACATCGATT 


CTTATATTTA 


TTTTCACCTT 


TTGTAACAGC 


ACTATGTCTG 


1440 


TGTAAACTTT 


CTTGTGCCCT 


TATACCTACA 


CCACATGATA 


TAGCACTTTG 


GGGATATTGT 


1500 


TCTTTATACC 


AATTTTGAAA 


ATAGCGAAGA 


AATCTGTCGG 


GATTACCATT 


TGATAATGAA 


1560 
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TAATATTTTT TTAAACCGTC ATCAATATTT TCCAAAGTTA TGGCATTTTT AGGAAGAGAA 1620 

TGAATCCATT TTTCTTTTTC GCTTGGATCC CACATTTTCC ATTGAGGTTG AAAAAAACTA 1680 

GTGTTGTTAT CCTCAAAAAA AGGTAAGCAA AAATGATAAA TATTTTTTAC TCTAGGTAAC 174 0 

TGTTCAATTT TTTTTATAAA ATCAACAGTT GCTGTGTAAT TTGCTTCAAT ATTTAATATT 1800 

AATAGATCAA AATTTCTATC TAATTTTTCT GCAACTAGAT TAGCTAACTG AACCATTAGC 1860 

CCACTATCCT TACCGCCACT AAATGAAAAA TACACCTTAT CAAATTCTTC AAATATGTAT 1920 

TCTAATCTCT TTAAAGTAGC ATCAAAGACA TTCATTTTCC ATCCTCCCTA AAAAATAAGC 1980 

ATTTAATATC AATGGTAATA CATTAAAAAC TTTGTCAATA ATAAGGATAT ATTCCAAAAT 2040 

CCATTCATTG GAAATAGAGA CCTTCTTTAT TAATGTATTA TTACTTACAT AGCGAAgcAA 2100 

TaCAAACATT ATCCAATAGT TTCATCTACA AAAAATAAAT TACTTTTAAA ACATCTTGAT 2160 

TATGAATTCA TTTCCCAAAG GCCTTTTAAA ATTGATACAG ATGTTCGTTC ACTTTTTTTA 2220 

GACTTATTGT ACAAGTCTTT TTAATAGTCC CAGAACAAAA TAGTTTGTTT ATCCAGCTTC 2280 

AAATCATTAA AGGCTAATGA ACGGTTTATT TTTTCGATTA CTTTAGATAA AGCTACAGAA 2340 

ATTTTTATAT TTTCACAATG CTCAGTTTGA ATTTCAATAA T TAT AT CATC AATTAGTTTT 24 00 

TGAATTTCTT ACCTATCAAC ATATTCAGTC TTAGAAAGAG AATCTTTTAA AGCTACTAGT 24 60 

GATTCAACGA TTTCAGACAT TTTCTTCCTC CTTTCTAGTG GCCTGTAAAA CCGTGTAGAA 2520 

CAGAGTCCAT CGAACCGGTA ATTACACTAT CTACaGCGCC ATTCCAATTT ACTCTAcATT 2580 

TTCATTTTTC AGATTTGTCA GCCCATACAG ATAGCCTTTT TGACGTTCAT ATTCGGATTA 2640 

TACCATTCAG CCGCAGAAGC AGATTCGCTC GACACAAACA ATCCCATACC AACAAAACCA 2700 

AT AC AC AT C A CACTATAGCC AACTAGTTTT TTCCTCTCGk. ATCCATCTCC TAAAAAATAG 27 60 

AATAGTtCTT GTAATTATTC ATAAGATATA CTTTTCCGTT TTTTATCCTT TCTGCTTTAT 2820 

AACAACTTTT TTCAAGTAAA GAATGACATA CTTCTTCTTA AAAAAACAGT CTGTTTTTTT 2880 

CTATTTTTTC GGTGTATAAT TAATACGAAT GTTCATTTTT TGTTTTAATA CGCACTCCGT 294 0 

ACATAATAGA TCTTGATTGC ATCAATCTTT TGTACTTTTT GATTTATCTT AAGCTGAATG 3000 

AATAGAAACC TAGTTACCGC ATTACTTTTT TGTTCTCCAA ATAACAAGTA ATGATCCGTT 3060 

TGGGTTATAT CTTGTGAAAC ATTATGAATA CGACCAATGT TTTTTTGTTG TACATGTGGA 3120 

TCGTTGTCAA TTCCATTTTC TTTTATACAA CAATCGATTA AATAGTTTTT TACGGATTGA 3180 

TTTCGTTCGT ATATGTCCGT GTAATTTACC CATTTTCTTC CAAAATCCTT TAACAAATGT 324 0 

TGAATTACTT CAAATATGTT GGATTTTATT TTGTTTTCCT GATTTCTTTT ACTTTCTATA 3300 

CTTAAACTTT CTGCTTGCAT CATCGTTTCT CTTGATATTG AGGAAACTGC TGTTTTTTTC 3360 

TTCAGAAGAA. ACCCACCTAC AACTTCTAAG GACAAACAAA AGCAACCAAG AAAAAACAAT 3420 

ACTTTTTCAC AATTGTTTCA TCTCCCTTTT CTACTAAAAA TTTTCATTGA GAGAATTCTT 34 80 

TTATTTGGCT TTACGAATCA AACTTCGAAT TTTCGTTTGG TTATAGGTGC GTTGATACAG 354 0 
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TGCTATGATT CCCCTTGCTT GGCGTTCTGT TCCACAGTTG AAAAATAATT CTCACTATCT 3600 

GAAACATCCG TTATGATACC GGTGTGTCCA TAGATTCCAG GAGACAATTG GCTTTACCAG 3660 

TTGGCCATGA AAGGATTCTT TTAAACGACA CTGAAATAAA AGAAAAGAAG CCAAATAATT 3720 

ATAAGGCTTC CTTCCTTGAA ATTGTTTTCG TGTCATTGAA AAAGCAAATA ACCACGGACA 3780 

AAAAATTTGT CGAGTAATCG TATGTTCCAT ATCAAATGGA AAAATTCGAA AAATCCCCCC 3840 

CAAAAAAGCC AATCCTAAAC CATACCACAA ATGTTTGTGT TACGGGACGA GGTAACGCGA 3900 

CCCCGTCCCA CAATTTATAA AACACTCGCT CTATATACCA AATCTTTGTC CAATTGCGTA 3960 

TATTGCTCAT TTTATCGCCC CCATGACCGC ATGCGATCCA GAATCCAATT GAATACTCCA 4020 

GACGTATCTC CAACTAACCA AACAGCTACT GCCAGCACAA GAATCAATGG AATGGCTTTG 4080 

CTTGCTTTTC GTTCAAAGGC AATATAGGCC GCACCACCCC AAGCATCGCC AATATAATCA 4140 

AATAATTAAA ATCCCCAACA AAATTGGTAA ACACATTTTG AAAAAAATCA TTTGCTTTTT 4200 

TCCTCCCTGT TAATACACAT TGAACTAATG ACAATACACA TCACCACAGA GCACAACTGA 42 60 

TAAAAATTTC ACTTGATATT CCTCATTATG CATTTGTTCT GTTCTTTGAT TGCTGTTTTT 4 320 

ACGTAAGAAT TATAAATAAT TTCTCACACG GCGATACATT ATTTTCACAG GAACATGCGT 4 380 

ACAATAAATC CGGTTCTGCG TACTTTTCTT TGTTTTAGGA TCAAGTTTCA TAGAATAGAA 44 4 0 

ATCATAAAAA TATTTACCAC TTTTTTATCA TATGTTCCTT CGATCACTAG TGAAAAGCTA 4500 

GGGTGGTAAA TCCAACAAAT ATAATTGATA CTTCCCTCTA CTGGATAAAG GGACAAAACC 4560 

AAGTACCTTT AAATGGTCTG ATACAGTCAT AAACACGATC TATCTCCTCC CAAATAAAAC 4 620 

AAAAAAAGGC TGTAATCGCA TTTTGATTAC AGTCTTTTTT TGTAAATGTA TTAATTAAAG 4 680 

CTTTACTTCC GGTCTTTTTA TTTAATGACT TTAGTCAGTT GAAATCTCTA AAGAGATTTC 474 0 

GACAAAAAAA CCACGCGTAA TGCGCGTGGA GGTCAGAGCT TTAGCGATAA AAGTTTAGCA 4800 

CATACTCGTT GGAATTGTAA GTATCATCTT GTGTTTACAC CAAAATATAG AAGAAAAGTA 4 8 60 

ATCTATGGTC AATTGAGGCG AGATATAGGG AAAATACTAA GAAAATTGTG TGAGATGAAA 4 920 

GAAGTCGAAA TCATTGAAGC GCATGCAATG CCCGACCATA TCCATATGTT GGTAAGAATT 4 980 

CCACCGAAGT TGAGTGTATC AGGTTTTATG GGTTTCTTAA AAGGAAGAAG TGCAGTAATC 5040 

ATTCATGAAC GTCATGCGAA TTTAAAATAC AATTATGGAA ATCGATCCTT TTGGTCAAAA 5100 

GGTTATTATG TGAGCACTGT AGGACTGAAC CAAAAAACGA TAGCAAAGTA TATACGAGAA 5160 

CAAGAAGCAG AAGATCGAGT AAGGGATAGT ATAAACAAAA GAGAGTATAA AGATCCATTT 5220 

AGGAAGTAAC AGCTAGATAA GAATAGAAAG GTAGCGGTTG GCGGTCTCTA AAAAAGCTCC 5280 

CTTAAAGGGA GCAGCAGGTA GCGAGCCCTT CTAGGGCTAT TaAAAAGCCA CCCGCTATGC 534 0 

GGGTGGATAT. TTACTTAATA AATCAATTTT CCCAAATTAT TTTCTTTAGC TTTCTTAACG 54 00 

ATTTCTGTTG CAACTACCAC ATCTAATACA GCCGAACCAA CTGTTTTGAA GATCGTGATT 54 60 

TGTTCTCCAC TTGTCCGACC GACCAATTCT TCATTGATCA GTTGGCCTAG TTCTCCATGA 5520 
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TAGPTGTCTC 


TTTGAATATA 

x x x unn inin 


TPPATPTTGP 


AGCGGTGAAA 


TGaAGTCCCC 


AGCTTCTTTT 


5580 


AGAACACCGT 


CCATGGTATC 


AAAAATAACA 


ACATCTGCGG 


CTTTAATTAT 


TTCTCGTGGa 


5640 


ATTTCACACA 


TTTPPGGTGT 


GTAAGCACCG 


ACTCCATTGA 


TGTGCGCACC 


TTTTTTGACC 


5700 


PAPTPAGPAG 


AGAATGTPGP 


APGTTTTGAA 


GTGGTCACAC 


TTGTGATGAT 


GTCTGCTCCT 


5760 


TP" a aPPPaTT 
IbAAbbbAl A 


PTTPaPTPPT 


TTTTPTTPP.P 


PGPATPGTTA 


CAGAAAATTG 


TTGCATCATT 


5820 


1L1 1 b AbC/\fA 


ATTPPTTPPP 


APGPTPAAAA 


TPAATATCAA 


AAATCCGAAC 


TTCTTCTAAT 


5880 


1 I AbbAAbbb 


TPaapaTTPP 

1LAAUA1 luv- 


TTPT a aTTPT 


PAPTGTGCTT 

\Jf\v_> X O X vJ\-> X X 


GTCCACCGGT 


TCCAATCAGC 


5940 


bbbbbbAl 1 1 


TanraTPTTT 


TTTTPPAAPA 
1111 oL-rtnun 


AGTTPTGTTG 
no x i\>iui xo 


CTGCTCCTTG 


AACAGCGCCT 


60.00 


b 1 1 b 1 1 AA 1 i 


rTrTpaa aTa 

b 1 b 1 b AAA 1 A 


Zi PT Zi PP Zi TP T 
Ab 1 MCbrt 1 C 1 


A Zi APAAGPGG 


ATACCATACC 


CGTTTCTGGA 


6060 


I bbAA I ALbA 


xL.AJ.Cbl Ibb 


TppTaraPTT 

1 bb 1 AbAb 1 1 


pr.Taar.TTTT 

Co X rtrib X X X X 


TPTTPATPTT 
1^1 i \Dr\ lull 


TTTAGGATAP 


6120 


ar'ppaa appa 


TTTTPaP aPP 
1111 bAbAbb 


TZiPTPPTPPT 
1 Ab 1 Cb 1 CC X 


TPGPPAPPTT 


PTACATAAGC 


TGGCATATAT 

X \J\J\-rr\ X C\ X C\ X 


6180 


AAov* 1 1 Iblt 


Lui. 1 AC 1111 


TPPPUP ATPP 


ATATTPPTPP 
nlnl 1 oo 1 o>w 


GAAGAGGCAC 


AATTGPCTTT 

> X X XIX 


6240 


bb I ut 1 oM 1 


flPZiZlTPZiPliZV 
ALAR 1 oALM 


TPPTTTTTTP. 
1 bb nil lib 


TPTGPATPGA 


TPGPTTPTPT 


CATTGAAAAA 


6300 


1 I 1 1 1 b A i AA 


A Al b 1 A b 1 1 1 


TTTZiZlPZiZiZiT 
1 1 innbrt/irt 1 


APPATTTGAA 


TPGPTPPTTA 

X^O^X^u>^X l r\ 


ATTAGCTGCT 


6360 


b 1 Abb 1 1 I I 1 


1 . 1 AA 1 1 bAb 1 


PPTPZiZiPATP 


GTATATGPAP 


TGTATAGTAA 

X \3 X n x r\\j x An 


GACCAAAATC 


6420 


ALbAiLLAl 1 


1 b Ab AA 1 A 1 A 


PPTZiTPTZiZiZi 
bb 1 A 1 b 1 AAA 


PaTTTPAPTA 
\3r\ 1 X X Vjrtv^ X r\ 


GGAATAPAGP 


GAPPAATACC 


6480 


bb 1 AA 1 /iCriv- 


PP a a PPTTPa 


AGTGAAAAGT 


GTGATTTTTC 


TGCTGTATTC 


ATCAAATTTA 


i 

6540 


A JL oM 1 1 b AA 


PaPTPPPPQT 
bAb 1 bbbbA 1 


CGGTACTGAG 


AATGTACAAG 


CCCCCATCAT 


AATAGGAAAG 


6600 


rr"v r* n a a c t p 

1 bbAAb 1 b 


pflTTpaappp 

bAl 1 LAAuLL 


TAGCAAGTAG 


ACAGTTACCA 


TTGTTAACGC 


ATAAGAACCA 


6660 


Al bbbbAl A A 


TPTTP&STPP 
1 b 1 1 LAA1 bb 


GCCATAGACA 


AAGAGTAAAA 


ATCCTGCCAA 


GATTAATTTT 

VJ*x i. x nn x x x x 


6720 


bbbbbAl AbA 


TiTTPRPTPPP 

All bAb 1 bbb 


TGTTCCGTTT GAGgGAAkCC AATTCATTTG 


TCCCATAAAA 


6780 


ZiTTaaPaaTP 
All AAb AA 1 b 


pappaaTPap 

b Abb AA 1 bAb 


CAATCCTATG 


CCGACAAACA 


GTTTGATCGT 


TTTCTCTGGT 

XXX v*-* X W X x 


6840 


AAL 1 1 L-n^un 


pa a atpttpp 

CAAA 1 b A 1 bb 


TCCTAAATAT 


GCACCGATAA 


TTTGGCAGAT 


AATACAAACA 


6900 


PPT a aTPTPT 


TPATPPPT AP 


GTTGATAGCA 


GTGATATAAG 


ACAATGCCAT 


GACGGCTACC 


6960 


GGTAPTAPAP 


a TTPPPTLVTT 


CAACGTTCCG 


GGTAGTTTTT 


TCATGCTGAC 


CCATTTCAAC 


7020 


TTTGGATAAA 


PPAPPPTTPP 


GATAGCAAAG 


TCGGAAACGC 


CAAATGTTGA 


AAGAAAGAAG 


7080 


GTGATAAATG 


ftPPTPAAPPP 


CAACAGACGC 


ATATCCCCAG 


GTTCTTCTTT 


GAAATTTTGT 


7140 


TTPTTTTTP A 

1XV7XXXXX \jr\ 


AAAPPTPTPT 

AAAUU 1 ^ 1 ^ 1 


TAGAAAGTAA 


ATAATAAAAT 


ATAGATTGAT 


TGCGACAATT 


7200 


AGTGCCAGCA 


AAATATTCCG 


AATCATAAAA 


ATTCCTCCTA 


CTAGTAAATC 


TTTCTGATTC 


7260 


CCTCAAAGTT 


CTCACCATAT 


TTTGCTTTAA 


GTTCTTCAGA 


AGATTCTGCA 


ATCACTTTTA 


7320 


GTGTATATGC 


. ATCATGTTTG 


ACATATTCTT 


TAGCTAGATT 


GACCATGGCG 


TGGTTTTTTG 


7380 


TCCAAGTGAC 


ATCGGTTGAT 


TGCAGTTCTT 


CATTAATGGT 


GCCAAACACG 


ACTTCTTGCT 


74 40 


CATCGGCGAC 


GATGTTGATC 


CATCTTGCCC 


CAAAGTCATC 


AAGTTTGTCT 


GTTTCAAAAC 


7500 
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CATGAATGTA ATACTTTTCT AAGGGAAGAT CATATTCATG GGTAGAACTG AATAAAATCA 7560 

GTACAAATTT ATCGACTTTT TTTTCGGCCT GCTGCAATGC CTTTAGCAGT GAATCCgTGA 7 620 

GGTTCTCATG CCAAACTTGG ATGAGCAAAC TTTCTTTTGC ATTTTtCACC AAATGCTCTG 7680 

CTTaTCCaGA ACATACTcGA TGCCATCAAC TTTnCCACAG CATGTCTTCT TCATCTTTTT 7740 

CCTTAATCAT TcCTAGATTC TTTGTAAGAa AAGaAAGATC aTTTTTAACA CTTTTTTCTA 7800 

ATTTCTCTAA AAACTCaTTT GCCGAGATGG CATGATAAAG TTTAGGTTCT GATTTATTGA 78 60 

CGACAATCAA ATTCTTTTTT AATAGTTTTT CCAAAATGTT ATATACTTTC GAGCGTGGCA 7920 

CcCTGAGATT TTACTTACTT CATATCCTGT AAGATTACCT TTTTCTAATA GGGTGGTATA 7980 

CACTCTTGTT TCCATCTCTG AAAAATCATA TTTTTTCATG ATTTGAATCA TCGTTTCCAT 804 0 

TGATTAATCC TCACTTCATT TTTAGTAGCT ACTACGGTAA CAACTATAAT TTTTTTTTGA 8100 

TCATCTGTCA AGAGTTAGTT CAATAAAACA CATATAAGAA TTAGCTTAAA AACAAGCTTT 8160 

tTATTAAAaA TaAATaATAT AATAGGGA 8188 
(2) INFORMATION FOR SEQ ID NO: 22: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3467 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

T GAT GAG CAT AAGGAGGACA GGCATGAACA AAAAGAATAA TGGCATGAAA AATGGTTTAT 60 

ACTACGTCTT AGTAGTACTT GCCATGGTCA GnGTTGTCTA TTTCATTTTT GGAAATAACA 120 

ACCAACAATC GCCAGACATC GATTACTCAA CCTTCCAACA ACAATTGGAA GACGGAAAAG 180 

TCAAAGACAT GACCA-TTCAA CCAACTAATG GGGTTTACCG GATTGAAGGT CAGTATAAAG 24 0 

AAAAACAAGA AGTGAAAGAC ACAGGTGGTC TGTCATTGTG GGGCTCAACG CAAGCGTCGT 300 

CAAAAGGCTT CACAACAACC GTTTTACCTA GTGATACAAC CTTAGCTGGC ATTCAAGATG 360 

CGGCACAAAA TAACAAGGTG AAGCTAGTTG TCAAAGAACA ATCAACAAGT GGTGCTTGGT 4 20 

TGTCACTGTT GTTTAGTTTC TTACCATTAG TGATTATTTT CTTCTTCTTC TACATGATGA 4 80 

TGAGCCAACA AGGCGGTGGC GGTGGCGGCG GTGGCCGTGT GATGAACTTC GGTAAATCAA 54 0 

AAGCCAAAGA AGCCGATAAA AAAGCTAACC GTGTCCGCTT CTCAGATGTA GCTGGTGCGG 600 

AAGAAGAAAA ACAAGAATTA GTTGAAGTAG TGGAATTCTT AAAAGATCCA CGTCGTTTCG 660 

CTGAATTAGG CGCACGGATT CCTGCGGGCG TACTATTAGA GGGACCTCCA GGGACTGGTA 720 

AAACGTTACT TGCTAAAGCC GTAGCAGGTG AAGCAGGTGT GCCATTTTAC TCAATCTCAG 780 

GTTCAGATTT CGTTGAAATG TTTGTCGGGG TCGGTGCTAG CCGTGTCCGG GATTTATTTG 84 0 

AAACAGCGAA GAAAAACGCG CCTGCAATTA TCTTTATCGA TGAAATCGAT GCAGTTGGTC 900 

GTCAACGTGG TGCTGGTATG GGTGGCGGAC ACGACGAACG TGAACAAACC CTTAACCAAT 960 
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TATTAGTTGA 


GATGGATGGT 


TTTGATGGCA 


ATGAAGGAGT 


TATCGTAATT 


GCTGCAACGA 


1020 


ACCGTTCAGA 


CGTGTTAGAC 


CCAGCGTTGT 


TACGTCCAGG 


TCGTTTTGAC 


CGCCAAATCT 


1080 


TGGTAGGACG 


TCCAGATGTC AAAGGTCGTG AAGsgATTTT 


aCGTGTTCAT 


GCGAAAAACA 


1140 


AACCGTTAGC 


TGATGACGTT 


GATTTGAAAG 


TTGTTGCACA 


ACAAACACCA 


GGTTTTGCTG 


1200 


GTGCTGATTT 


AGAAAAT r.TC 


TTAAACGAAG 


CAGCCTTAGT 


TGCCGCTCGT 


CGTAACAAGA 


1260 


AAAAAATTGA 


TGCTTCTGAT 


GTGGATGAAG 


CCGAAGACCG 


CGTAATTGCT 


GGTCCAGCTA 


1320 


AAAAAgATCG 


CGTGaTTAAT 


AAAAAAGAAC 


GCGAAATGGT 


GGCTTACCAT 


GaAGCAGGAC 


1380 


ACACAATTGT 


TGGTTTAGTC 


TTAAGCCGTG 


CTCGTGTTGT 


CCaCAAAGTA 


ACAATTATCC 


1440 


CTCGTGGTCG 


CGCAGGCGGT 


TATATGATTG 


CTTTACCAAA 


AGAaGATCAA 


TTCTTAATGA 


1500 


CTAAAGAAGA 


TATGTTTGAA 


CAAATCGTTG 


GCTTACTTGG 


TGGACGTACA 


GCAGAAGAAA 


1560 


TTATCTTTGG 


CGTTCAATCA 


ACAGGGGCTT 


CAAATGACTT 


TGAACAAGCA 


ACAGGGATTG 


1620 


CTCGTAGCAT 


GGTAACTGAA 


TATGGAATGA 


GCGACAAATT 


AGGACCTGTT 


CAATATGAAG 


1680 


GAAACCATCA 


AGTCTTTGTC 


GGTCGTGATT 


ATGGTCAAAC 


AAAAGCTTAC 


TCAGAACAAG 


1740 


TGGCGTTTGA 


AATCGATCAG 


GAAGTTCGTC 


GTATCTTAAT 


GGATGCTCAT 


ACGAAAGCGC 


1800 


ATGAAATTAT 


CGAAGCGCAC 


CGTGAACAAC 


ATAAACTAAT 


CGCTGAAAAA 


CTATTAGAAT 


1860 


ATGAAACATT 


AGATGCAAAA 


GCAATCAAAT 


CATTATTTGA 


AACAGGTAAA 


ATGCCTGAAG 


1920 


GTGCTGACAG 


CGACTATCCT 


AGTGAAAAAG 


AAGCGCAAAC 


ATTCGAAGAA 


GCAAAACGTG 


1980 


CTTTAGAAGA 


AAAAGAA^CG 


CAAAAACAAG 


TTGAAGAAAA 


ACAAGACTTT 


GAAGAAGCGA 


2040 


AAAAAGAATT 


ACATGATGAA 


GCAGAAGAAG 


TCAAAGTAGA 


AAGCGAACAA 


ACAGAAAAAG 


2100 


AAGTCCAATC 


TGAAGAGAaA 


AAAGATTCAG 


ATTCCAATTC 


TGAATATGAT 


CGCAATAACT 


2160 


ACGAAGATCG 


TTATAAATAA 


CCAATTAGTA 


GAAGGTGCAG 


TATTTGTCGA 


GTTGACGAAT 


2220 


ACTGCACCTT 


CTATAATATT 


AAGATATTCT 


ATAGAAGGAA 


GATTATTTGG 


TTGAAAAAAC 


2280 


ACCTGATTCC 


TAGTATTATC 


yGACTAATGG 


aCmAAAAAGC 


ACCAAGAGAG 


nGAGGAAAAA 


2340 


GAATGGAAGA 


TTATTTAGTA 


AAAGCCCTTT 


GTTATAAAGG 


TTCAATACGA 


GCATATGCCA 


2400 


TCAGTGCCAC 


TGAAACTGTT 


AGCGAAcGCa 


aAGAAGACAT 


GATACATGGA 


GTTCTTCTAC 


2460 


TGCAGCGTTA 


GGACGCACAT 


TGATTGGAGC 


CCTTTTACTA 


GGAGCGACTT 


TAAAAGGTGA 


2520 


CGATAAATTA 


ACTGTAAAAG 


TACAAGGAAA 


TGGCCCAGCT 


GGCGCTATTA 


TTGTCGATAG 


2580 


TAATGGCCGC 


GGAGAAACAA 


AAGGCTACAT 


TAAAAATCCA 


CATGTCAGCT 


TAAAGTTGAA 


2640 


TGCAACAGGT 


AAAATTGATG 


TACGTGGTGC 


TGTCGGTAAC 


GAAGGAATTT 


TCACAGTCAT 


2700 


TAAAGATTTA 


GGATTAAAAG 


AAACCTTTTC 


TGGACAAACA 


CCTATTGTTT 


CTGGTGAAAT 


2760 


CGGTGAAGAC 


TTTACATATT 


TTATGGCGGT 


CTCTGAACAA 


GTTCCTTCAG 


CTATCGGTTT 


2820 


AGGTGTATTG 


GTGGACACGG 


ATGAATCTGT 


GAAAGCAGCC 


GGTGGTTTTA 


TGATTCAAGT 


2880 


AATGCCAGGT 


GCCGATGAAA 


GTACCATTGA 


TTTTATTGAA 


CAGCGTCTAG 


CGGAAGTGCC 


2940 
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ACCAATTTCA CAATTACTAG AAAATGGAGA AACACCTGAA CAAGTCTTGT ATCGCTTATT 3000 

GGGAGAAGAC GAAGTAGAAA TTTTAGAAAA AATGCCTGTA CAATTTAAAT GTGACTGTTC 3060 

CAAAGAAAAA TTTGCGACCG CTTTGATTGC TGTTGGGATC GATGAATTAA ATGCGATGAT 3120 

TGATGAAGAT CATGGAGCAG AAGCAGTTTG TCAATTCTGT AACAATAAAT ATGACTATAG 3180 

TGAAGAAGAA TTAATTGAAc TTCGCGATGA AGCGATTCGC AATACGAAAC AAAAGTAGGA 32 4 0 

GGAAAGACTA TGTGGAAAAT TGGAGACGTT GAAATCCCCA ACCGTGTTGT CGTAGCTCCG 3300 

ATGGCGGGTA TTAGTAACGC CGCTTTTCGT GTGACTGTAA AAGAATTTGG CGCaGGATTA 3360 

GTCGTTTgTG AAATGATTAG TGACAAAGGa ATTAAATTAA GAAACAAAAA AACCTTAGAA 34 20 

tGCTATATAT TGATGAGCGG GAATAnCCTT TAAGTGTACA ATTTTGG 34 67 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTTGAnACAA TTGGGGCAAA AAATGTTTTG GC AAATGAAA AAGGCTGGTT ACCAGTGACA 60 

GAAGAAGCGG CCATTGCTGC AAAACCAGAG GTCATTTTAA CCAATGTTAA CTATATGAAA 120 

GATCCCGCCA AAGAAATTTT AGCTCGTAAA AACTGGGAAA GCGTTCCAGC AGTTCAAAAC 180 

AAAGAAGTTT TCGAAATCGA TAACATGTCT AGCTCTTTAC CAAATAACCA CAT T AC C AAA 240 

GCATTAAAAC AAATGGCAAA AGCTGTTTAT CCAGAGGAAT ACAAGGATTT AAAAGATGAG 300 

TAAAAAAACG TGTCTCATAC TGATGCTCTT GAT TAG TAT G AGCATGCTCT TTTTGGGTAT 360 

TAAAATTGGA AGTGTTCAGA TTTCGTTAAC GGATTTGCTT CAAATTTTAG TGAAAAAGAC 4 20 

GGTTTCAGAA GATGGGATTT TAGAAGGAAT CATCTGGAAT GTTCGGTTAC CGCGTGTCGT 4 80 

GATGGCCTAT CTCGTGGGCG CAGGCTTAGC AGTAAGTGGC ACGGTGATGC AATCGCTCTT 540 

AGGTAATCCT -TTAGCTTCTT CGTATACTTT GGGCGTATCA GCGGGAGCAT CCTTAGGTGC 600 

TGCGTTAATC ATGGTAACTG GAGTAACTGC TTCGATCTTA GGGGCATTTT TATtGCCTTT ' 660 

AACTGGTTTT ATCTTTGGCT TAGCAACGGT CTTTTTAGTT TTGTTATTTA CGCAAACAAT 720 

GGATAGTCAA ATGTCCAATC AAACAGTTGT TTTAGTAGGA ATGATTATGA CCTTGTTTGT 780 

CGGTGCTATT TTGACGTTGA TTACAGCACT TTTCCAAGAT TATTTGAAAC AATTAGTCTT 84 0 

TTGGCAAATG GGTAGTTTTT CTGGCAGTAA TTGGCAAAAA ATTGCTATTT ATTGTCCAAT 900 

CTTGCTTGTC AGCAGTCTGT TTCTTTGGTT TGATGCCAAC GCATTGGATG TTCTCGGTTT 960 

AGGAGAAGAA CATGCGATGT TAGCGGGAGT AGAAGTCAAA ACAGCAAAAC TACGCATTAT 1020 

TCTTTTGGCT AGTTTATTGG CAGGAAGTGC GGTTTCATTT GTCGGTGTGA TTGGTTTTGT 1080 
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GTTTAAATTG GACAGCCAAA ATTAACGAAG AGGTATTAAT TTTTACCAAT CGTTTATCAG 3120 

ATTATTTTTA TGCGTTGGCT CGCTGTTTAA ATGTTCAAGC ACAACGACCT GATGTTTTTT 3180 

ATGAACGAAG CGAAATGGTT TTCCATAAAA TTAAAGAAGA TGGATTGTAA ACAGTTGCTT 324 0 

TTCTAAAAAA CTTATGGTAA ATTTCAGATA GaTTAAaCGA ATATTTTATG AaAAATATGA 3300 

ATGGAAGCCA GTGAGAATCT GGCACGGTCC CGCCACTGTG AAGAAGCAAG GTTGCTTTTA 3360 

AGTCAGGTCT TTTCATTTTT TCATTATTGG GCATGCTGTT TCGAGGCAAA ACAGGATGTT 3420 

TCTTAACAAC GCTTGTG TTG AAAATCCAGC CCAAGATATT TGTATTAATC CAATTAATGG 3480 

CACGAGGAAG TGCTTTTGGG AATTTTGGGC TGGTCTTTCA ATGTTGAAGG AGATTTCAGG 354 0 

TACAATGACG TACTGCTATA TTCCCTTTTT TTATTGACAA TTAATTAAAG GCGTTGTACT 3600 

CTATGGATAA GGTTTCGTGT ACAATGGCGT ATACATAAGG AAGCAAAGAC GCTTCAGACA 3660 

GATGATTTTT CGTTTGTTTG AAGTGTCTTT TTGTATTTTA TTAAGGAGGA ATTGAGATGA 3720 

AAACAATTCA TTTCCCTACC GAACTATGGG TAGGAGAAGG CGCATTAGCC AATTTGGAAA 3780 

CACTTCATGA TAGACGTGTT TTCATTGTCA CAGATCCATT CATGGTTGAC TCTGGATTTG 38 4 0 

TGAATGAAGT GACAAAGCAT TTAACAAAAA GTGAATGGCA AATTTTTAGT GATATCATCC 3900 

CCGATCCACC AATTGATAAG ATTGCTGCGG GAATTAAGCA TTTAGCGACA TTTCAAGGAG 3960 

AT AC GAT ATT AGCTTTAGGT GGGGGATCCG CTATTGATGC GGCCAAAGCA ATGAAATTTT 4 020 

TTGGCAAACG TACTTTGCAA ACTCAGATTG CTGAGTTTAT TGCGGTTCCT ACAACTAGTG .4080 

GAACTGGTTC AGAAGTTACG AATTTTTCAG TGATTACGGT TGCTGAAACT GGCACCAAAA 4140 

TCCCGCTTGT AACGGATGAA ATTCAACCAG AAATTGCGAT TTTAGATACA AATTTAGTGA 4200 

TGAGTGTCCC ACCAAAPAIC ACAGCGGATA CAGGAATGGA CGTCTTAACG CATGTTATTG 4 260 

AAGCGTATGT TTCAACAGAA GCAAATCCGA TTTCAGATGC GTTGTGCGAA AAAGTCGTTC 4 320 

GCTTAGTATT CGACAATTTA GAAATAGCCT TTAATGAGGG TAGCAACCAA CAAGCCCGTG 4 380 

AAAATATGCA TTTAGCTTCA TGTATGGCAG GGATGGCGTT TAATGTTACT tCGTTAGGCT 4 4 40 

TGAATCACGG TATTGCTCAT GCGGCCGGTG CACGTTTACA TGTGCCACAT GGACGAATGA 4 500 

ATGCAATGCT CTTACCTGAA GTCATTGCAT ATAACAGTGG CTTAGCGAAT GGCAAAGTCA 4 560 

CCAACGAACC AACAGCTAAA CGTTATGCGC AGTTAGCAAA TTGTCTAAAT GATACACAGA 4 620 

CAACCAATGC ACGGATTGGT GTTCAACAAT TTATTCGACA AATCAAGCAA CTCCGTCAAA 4 680 

AATTAAACAT GCCTGCCACT TTTAGTGACT ATGGTTTACC AAAAGAAGAA GTCCAGGCTG 4740 

CAATCCCAAA AATAGCTGAA GGCGCATTGA TGGATGGCTG CACCAAAACA AATCCTGTCC 4800 

AACCAACAGC AGCAGAAGTG ACAAAAATTT TAAATTCTAT CTTATAAGCG ATTGACAGGA 4 860 

TGAAATCGAT- ArCATATAaT GTGGGTAGCA CAACGGCGTG CTTCmAAATT TAAGAGCAAA 4 920 

GAAGCTCCTT AGTAGAAAGC GAAAGTGGAA CCTTTACGTT TTTTATTGAG GATTTTTTAT 4 980 

TTGCTAAATT AAAGGAGCGA GTCTGGTTGG AAGAAAAACA ACGAATGATT CAAGAATATG 504 0 
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ATTTPPTTTa 
Ai 1 1 \J\J X 1 1 A 


A ATPPAGAAA 


TTTTAGPTGA 


9720 


AGLALAAGA 1 


1 1AGLGL 1 1L 


A r* fiPTPPTP A 

ALAL 1 LL 1 LA 


appappTPPT 

Hbbnbb 1 Ijiu 1 


PPT A ATGTGA 


TGTATTTTGA 

1 U 1 / i 1 X X 1 \JC\ 


9780 


AACAGGTCAA 


GGCT C AG AAL 


rp rp rp rp rri z*' rp /-* 7\ 

I11L11L1 GA 


TV r^^p A APTTT 

AGLLAAL 111 


uuLbLnunbb 


A APTP APA AT 


9840 


GGAAGCGCGT 


TGTTATGGCT 


TAPPPA AA A A 

1 AGLLAAAAA 


TV rn rrrnr* a TPPP 
Al 1 1LA1LLL 


TaTaTTPTaa 

1H1 nl X VJ X/xri 


ATAPPGTTGT 
c\ X r\\w- v^o i lui 


9900 


CGGCTTTATT 


GGACCAGAA 1 


ATPTTTAPPA 

A 1 L 1 1 1 ALGA 


rp(pp A A A AO A A 

1 1 LAAAALAA 


PTPaTPPPPP 
o 1 bnl v^ULjCvj 


PTGPTTTAGA 
x uu x x xrvvjrt 


9960 


AGATCATTTC 


^TfTTT'TV A AT 1 

ATGGGLAAA 1 


TA !\PTPP7\RT 

1 AAL 1 GGAAi 


1 yCAAl bbbb 


TPTPaTPTPT 


GPTAPAPPAA 


10020 


LLAI A1GAAA 


PPTPJITPBBB 

GL 1 GA 1 LAAA 


ATPBTATPPD 

A 1 LA 1 A 1 LLA 


a a aPTTAP,PA 


ATGTTATTAG 

r\ X Vj X X X X 


CAACAGCTGG 


10080 


i 1L1 ALL 1 A I 


STPITPPPPS 

A1LAIGGGGA 


TTPPTPDTP.P 
1 1 LL 1 tHlbb 


PPATP.ATGTP 
k-oA 1 VJrt 1 VJ 1 C 


ATGTT A A ATT 


ATCAAAPAAC 


10140 


1 GGG 1 i LLAI 


r* A A A C^VCCC* A 
LAAAL 1 LULA 


PP ATTPPTP7A 
IwLjAI lUoloA 


AnPATTAP,P,P 


TTAPPTPGAA 


TTAAAGAATT 

X X *lll4*VJiJ4l ^ X 


10200 


1 GAAG AA 1 GG 


ATPP7\ A AA AA 

Al LLAAAAAA 


TPPPPTTZi AT 
Ibbbbl 1AA1 


PPA A A APfiP, A 


A AATTAAPAA 


GTCGPGPTGG 


10260 


LGA1LLA1 LI 


L1A1 1 1A1 1A 


AATZiDZiPPZiP 
AA 1 nnnL o Avj 


P.TPAPA A TAP 


AAATGAAPGA 


AAAAGAATTA 

rUinn\j*Lri x x n 


10320 


AAAGAAA 1 LA 


TTPPPPPTl A T 

1 1LLLLLAA1 


TTTZYZiPHPAIi 
1 1 1 nrtvyftbnn 


ATPP.TPPPTP, 
A X ou X X \J 


APAATPAAGC 


GGTTTPGACT 


10380 


LLLAL Ha 1 laA 


PTPPTPBBP A 

L 1 ot 1 biAALA 


Zi a 21 a PPHPTP 
AAAAv^L* nu 1 V^, 


APAAPAPAPP, 


TPAPTGAAAC 


GACAGAAATT 

vjnv^nvjnnn i ^ 


10440 


L AAL AAL L L L 


TA ATTPPTPS 

1 AA 1 1 LL 1 GA 


PBTT BPTP& Q 
LA 1 1 AL 1 LAA 


PTaPATTTAP 
b> 1 Hon 111 nb 


GTAAAPAATT 


APTATTGAAA 


10500 


a aTPPPPTap 

AAiuU^o 1 Abj 


Hi LL 1 LAALL 


apTaTTaaaa 

nLlnl 1AAAA 


A TP A A APPTT 


TPTCGCCAGC 


TCGTTTAGGC 

x xxx nwwv 


10560 


PTTPPTPPTP 


PZiPPP a PPPP 
LALLLALLLL 


TT AT a TP. APT 
1 1 A 1 A 1 vj AC 1 


AP.TTPAAPAT 


TACGTTTCCG 


AGCAGACCAT 


10620 


r*r*r* ecu rear* 

\j b. L b3 L. AbiL AL 


A A P A TPPBPT 
HAbA 1 LLAL 1 


pTTTTPapap 

bl 1 1 1 bnbnb 


PTA APPPAAP 


APTTGGTAAA 

r\\^ X X x nnn 


AGAAATGAAT 


10680 


TTTATTTCTA 


CAAAAACAAT 


CTGTAACAGT 


AAAGATGAAT 


ATTTAACACG 


TCCAGATTAT 


IV / Hi) 


GGTCGTCAAT 


TT GAT GAAG A 


AAATAGTGAA 


ATCATTCGGA 


AAAATACGAC 


ACCAAAAGCC 


10800 


AAAATCCAAA 


■ TGGTCGTAGG 


GGATGGCCTT 


AGTTCAGCTG 


CTATCGAAGC 


GAATATTAAA 


10860 


GAAGTCCTAC 


CaGCmATTAA 


ACmAGGATTA 


AACATGTACm 


ACTTaGATTT 


CGACmATGTA 


10920 


GkTTTTGtTA AAtATTGtCg 


CGTGcCAGCA 


ATGGaTaAAA 


TTGGTGAAAT 


CACTGACGCT 


10980 
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\jr\ loi loll 1 


bLl 1 \j 1 X 


tppppa A PPT 


PCTGGTTTAG 

V^\-f J. VjVJ J. J. low 


TAACAGCTGA 


ATCTATGAGT 


11040 


P.PP T A P A T TP. 


PTTAPAAAPP 


A APTPTPPPT 

JL Vj 1 v_,0 O J. 


ATGCCAGAAG 


CACGGCGGAC 


AGTTATTTCA 


11100 


A ATATTPAPA 


A APPTPPPAP 


APPPPPTPTT 


GAAGCAGGGG 


CCTACATTGC 


CGAAATTATT 


11160 


AAA AAAATPT 


TAGACAAGAA 


AAAATCTGGT 


ATTGATTTAA 


AAGAAGCTGA 


ATAAAGGAGG 


11220 




AAAAATGATC 


GTCTAGGCGC 


AAATGTCCTA 


AGTATGAAAG 


TCATTCCCAA 


11280 




GCATTAGCAA 


AAGAATTATC 


ATTGAAACCA 


GAACATCGCT 


CATTAGGAAT 


11340 


1 G I 1 AG 1 1 GG 


GATTGTGATG 


ATGTCACTTA 


TGTTGCATTA 


GATGAAGCAA 


CCAAAGCCGC 


11400 


CGAl Gil AG i 


GTAGTTTACG GTAAAAGTAT GTATGCTGGC GCAgCAAATG 


CTTCAACCAA 


114 60 


ATTAGGGGGI 


GAAGTGATTG 


GCATTATTGC 


TGGTCCGAGT 


CCTGCCGAAG 


TCAATAGTGG 


11520 


CTTGAGTGTG 


ATTACGCAAG 


TCATTGAAGA 


AGAAGCCAGC 


TTCTATAGTG 


CTAACGAAGA 


11580 


1GAGAGGA1 1 


GTTTATTTTG 


CACATGTAGT 


ATCAAGAACG 


GGTAGCTTTT 


TATCTGAACA 


11640 


AGGGAA1A1 I 


CCTGAAGGTG 


AAGCGATTGC 


TTATCTGATt 


GCACCACCGC 


TAGAAPPAAT 


11700 


GG I IGGG 1 I A 


GATGCAGCAA 


TGAAAGCTGC 


CGATGTTCAA 


ATGGGTGTCT 


TTTATGGCCC 


11760 


ACL 1 1 G 1 GAA 


ACGAACTTTG 


GCGGCGCCTT 


ATTGACAGGT 


AGTCAATCTG 


PATPTAAAGC 


11820 


TGCTTGGAGI 


GCGTTTGAAC 


AAGTCATTCA 


AAATATCGCT 


GACAATCCGT 


TPAPTTATTA 


11880 


AGGA1 GIGAG 


CTGAATGAAA 


GCATTAGGAA 


TGGTCGAAGT 


CAAAGGATTT 


TTAGGTGCCA 


11940 


1 IAG1G1GGG 


CGATGCGGCA 


TTGAAAGCCG 


CCGACGTTAC 


TTTGTTAAAA 


GCTGAAATTA 


12000 


1 1 AAJ. GGGGG 


TTTAACAACT 


GTGGAACTGA 


TTGGTGACGT 


TGCCGCGGTT 


CAAGCTGCCa 


12060 


1 i GAAG 1 AGG 


TACGGAAGTA 


GCTAAAGAAT 


TAAATTGTTT 


AATTGCTCAT 


CATGTGATCT 


12120 


GAGGGG 1 1 GA 


TGCTCAAACG 


GAAGTGATTT 


TATCAGATCC 


AGAACCCAAG 


TCAGCACCTG 


12180 


AAGGGA 1 GGA 


ACAAGTAGAA 


G I AA I i GAAG 


AAGAGATTGA 


AACACCAGAT 


TTGGAAGAAC 


12240 


AG AG AG \j nL 


ACGCCAAAAA 


TTAGAAACAC 


AACGCGTGGT ,TGATTTACGC 


AAACAAGCCT 


12300 


AG AAAA 1 GAA 


CCTAAGTAGT 


TTAAAGAAAA 


GTGAGATTAA 


ATTTGCCAAT 


AAAGAAGCTT 


12360 




AATCATGGCT 


GAAATAGAAA 


GAAGTGAAGA 


CGAGTGGAAT 


TAGTAGATAA 


12420 


1 i ntb 1 


TCCATCCAAG 


AAACAAGAAA 


TTTAATCAGA 


AAAGCCAAAG 


AGGCACAGCA 


12480 


aptattappa 


ACATTTTCGC 


AAAAACAAAT 


CGATGCGATT 


GTTCAAGCGG 


TTTCAGAGGC 


12540 


TAPATTTAAP 


CAACGAGAAA 


AGTTAGCGAA 


AATGGCCAAT 


GAAGAAACAG 


GATTTGGTAT 


12600 


PTATPAAPAT 


AAAATCATCA 


AAAATGCTTT 


CGCTTCAAAA 


GTAGTATATG 


ACGAAATGAA 


12660 


GGACAAAGCT 


ACAGTTGGTG 


TGATCCATGA 


TGATGCTGCC 


AAAAAAGTCA 


CTGAAATTGC 


12720 


TGTCCCAGTG 


GGTGTAATTG 


CAGGATTAAT 


TCCTTCTACC 


AACCCGACAT 


CAACAGTCAT 


12780 


TTACAAAGCA 


. TTAATTTCAT 


TAAAAGCTGC 


CAACAGCATT 


GTTTTTTCAC 


CACATCCCAA 


12840 


TGCGTTGAAA 


TCGATTATTG 


AAACAGTGGA 


AATTATcCAA 


AAAGCGGCTA 


TTGCTGCAGG 


12900 


GGCACCAGAA 


GGTTGTGTCT 


CGGTCATCAA 


AACGCCAACC 


ATGCAAGCGA 


CCAGCGAATT 


12960 
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AAI GAAAAA 1 


AAAoAAAGAA 


Al 1 1AAI ill 


not 1 nlv 1 uo 1 




TGGTTAAAGC 


13020 


GGGG1AG1G J. 


1 G 1 GGAAbAb 


r , ap*r , 7AaTPPP 
UAoGAA Hub 


loll OUOLL 1 




CAGCCTACAT 


13080 


CGAAGGAAGG 


P*P* 7\ 7A 7A r PP , TT/* > 
ob AAA 1 o 1 1 G 


U 1 G A 1 otHu 1 


u/vinv^nnrl x r\ 


ATfiGATTCAA 


AAACTTTCGA 


13140 


TAACGG1 AGA 


Al 1 1 G 1 bLAl 


UGGAAbAA 1 V- 


riri 1 Ini lo i ± 


P.AAACTGTCA 


ACCGTGAAGC 


13200 


GGTCaAAGAA 


GAGTTAAI I A 


AGGAGGGAGG 


vjlAl 1 i 1 lrt 


AP,Tpppr;ppn 


AAGPPHAPAA 


13260 


ATTAGCTAAA 


TTTATTTTAC 


G AG G AAAG G G 


1 MLnn 1 unrtC 


PPP.PAAATPH 


TP,P,fIAPC;TTP 


13320 


CGTTCAACAT 


ATTGCTTCAT 


TAGTTGG1 1 1 


A1G1A1 ILLrt 


Zi AaPaTPPPP 


PPTTAATTHT 
ov_f i Inni lul 


i J JOU 


TGCCGAAGAA 


tv 7v — _ tv m m tv 

AC Ac AT GT AG 


/"•OTirpP" TV 7\ TV Ti 7\ 

GCTTGAAA I A 


1GGA1 1 1 IGA 


APnPr*AHAaT 


TP,r;pAPPfi^T 


13440 


TATTGCTTTC 


TATACAGTTG 


AAAACTGGGA 


AGCAS 1 1 GGG 


Lbl 1A1UAA1 


1 ortrtrV 1 1 1 1 o 


13500 


AAAGGTGAAG 


GTGCGGGTCA 


m tv ta t\ m /"** o 

TACAATGGGC 


AT 1 GAG AG AG 




ZiPTPATTPCT 
nU 1 V^rl 1 1 VrfO 1 


13560 


GAATTTGGCT 


TAAGAAAACC 


r« ^> m m ^ rn 

CGTCTCTCGT 


TTGG 1 1 G 1 1 A 


TaPTAPPTPTPP 
AGAL/Lf 1 b 1 Vjo 


TZiPZiPTTPPP 
1 ritr\t 1 1 UUvy 


13620 


GGCATTGGTG 


CTTCAACTAA 


r> rn m tv m TV 

CTTAGTGCCA 


GGA1 1AAGAG 


TTPPPTPTP.P. 
1 1 ubl lol ova 


PPPZXPTTPPT 
v^otfto 1 1 OO 1 


13680 


GGCAGCTCAA 


CTTCTGATAA 


m tv m rn /■*/"* o rn m 

TATTGGGGTT 


G AAAA 1 G 1 G 1 


1 L.AA ill ntb 


TPPTPTTPP A 


13740 


TATGGCGTGC 


GTGATTTAGA 


tv tv /■* ta m rn rn 

AGAGATTCG1 


G AAG AA I 1 lb 


C TP n a 7i P A TP 


A2XPPAPATPT 


13800 


GTGGCTACTT 


^1 niftt^" TV TV TV 

CTTGCGAAAC 


Ti T* /"*/'"' TV TV rp/~* 7V 7V 

AACCAATGAA 


G AAGAAG 1 Avj 




PPTAPPTPAA 

OO 1 nuvv X VyiVil 


13860 


GTTTTAGCTC 


/■■* ^ rn m tv tv tv /■* m t\ 

GCTTAAACTA 


AC AAG AAAG G 


AGGGAG 1 AAo 


ZiPT&ZXAAPAA 


AT A A ATT ATT 


13920 


GGATTTAGGG 


TTTGGGAGCT 


tv tv z* 1 m/*** T\ Tt 7V rp rn 

AAG TCAAAT 1 


GG i AA1 1 G 1 G 


PP TA Ti 71 TTTTT 
L*^/viril 1111 


A A A A AT A A A A 


13980 


CATACATCTT 


AGGAGGAAAT 


m tv rn tv t\ rn /"■■ rn r"*' tv 

TATAATGTGA 


ATTATA 7\ ATP 

AG i AG AAA 1 o 


PTTT Zi CP. A Z\ T 
till rtbunn i 


PATPPtAAAPT 


14040 


CGTGGTTTAG 


TAGGTGCTGT 


TGAAGCAGCG 


GA1GGAA1 GG 


TP7A2i7APPPPP 


A a A PPT T APT 


14100 


TTAATTGGAA 


AAGAACAAGT 


CGGTGGCGG 1 


1 1 AG 1 1 AG AG 


T2\ 7ATPPTTPP 
lAAlool H-u 


PPPTPAPPTT 


14160 


GGTGCTGTAA 


AAGCAGCAAC 


AGATGGTGG 1 


GG 1 GG 1 uGAb 


papa 7APPTPT 


TP.PPPAATTA 


14220 


TTATCTGTCC 


ATGTAATTCC 


ACGTCCACAT 


AGAGAAG 1 AG 


AGGG 1 Al 111 


APPAPAPPPA 


14280 


ACAACTGAAT 


TATAATTTTG 


tv #n rn m m rn rn r*** 

ATTTTTTGGG 


GGAGAGG 1 AG 


oil G A 1 1 1 


TPTPTPPTA A 


14340 


ACTTTCATTG 


AGGAGGGGTG 


TTATGACAG1 


1 A 1 I AbAb AA 


PZiTZiTPPTTP 
oA 1 A 1 oLr 1 iL 


PPATPAAAAG 


144 00 


/-l rrt TV TA TA TA TV Tl TV 

GCTaAaAAAA 


GAACTAGAAG 


A I GGG AGGGA 




PPTPTAP,nPA 


PTTTTPTGAP 


14 4 60 


GCCAGCCGCC 


AAAAGCTATG 


1 AGGGGAGGA 


T end D. TT 2V JXT 


AfiPAGPTTCT 


CTTCAACAAA 


14520 


CCCATTGGTT 


TCTGAGGI I G 


All GAG 1 G AA 




PAAAAACAAG 


AACATTTATC 


14580 


7\ 7\ Tv /""* 7\ TV -,/■"■ TV 

G AAAG AAa C A 


ATTTATGAGA 


1 1GGGGAG 1 1 


Zi BPPPZXTPT21 




CTTTCTTAAC 


14640 


GAATGAAGCG 


TTATCAACAG 


AAGGTTGGTT 


ATATTTCGAA 


CAACAACAAC 


AGTGGTTAGA 


14700 


GAAATTTATT 


TTAGAACAGT 


CGCTTGTAAT 


GGAACCAGTC 


ATTTTGGAAT 


CAAAGGTAAC 


14760 


TATTCTAACT 


■ AACCAACAGA 


GACAATGGCG 


CTATTCCTCA 


AAAGAAATTC 


AATTACAACT 


14820 


AGATAAAATT 


ATTCTCCAGT 


TAAAAGGAGA 


ATCACAACTG 


TTCGGACTTT 


TTCAAGCTTG 


14880 


GGCTACAGAA 


TTAATTAGCA 


CCATGGCTGA 


AATATCACGA 


AACTAGGAGA 


TGAACAAATT 


14 940 
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GGATTbAA 1 b 


A A T/" 1 A A ATTr 

AAlbAAAi lb 


TTPAAPAAPT 


lbl lAAALbA 


a m m p TV r* n A a r> 


AACAACAAAA 


15000 


TACTTT 1 bAA 


bl 1 bAA^L 1 1 


b 1 bbbbbAbA 


X b 1 1 Lrtl lib 


1 J. J- ^dnw 


AGATTGATGC 


15060 


t\ rn FT* tv ryi rn rn ^ 7\ 

AT TAT a I bbA 


bbAbbAl Al b 


AAPTAAPa&& 
AAb 1 AAbMAA 


ApTPaaaPaT 

rib 1 b AAAb A a 




rTGGACAGTT 


15120 


CGTTTGTAAA 


/-•TV 7> /"""» TV TV rp rp TV 

G AACGAAT T A 


bbb 1 lbb 1 bb 


7\PPTn A APPA 

Abb 1 AAAbbA 


PTrTTTPAAA 
^ 1 X 1 1 l*rirlri 




15180 


TTTAGGCCCT 


GAACGTAGCG 


T\ 7S *T> f" 1 A P"" A A PT^ 

AA 1 bAbAAb 1 


PPA APTTTPT 

bbAAb I 1 lbl 




v_. O V-O 1 J. X 1 1 


15240 


AGGAATTAAT 


GCCCCAGTTA 


GAG AG AG lbb 


bAAbAbAbAA 


bbbAbHbb 1 b 


OwU 1 wriwww 1 


i J JUU 


GATGAACGGA 


TCAGCAGTTG 


m/""» tv /"•Ti rnrriTi TV /"* 

TGACATTAAG 


rn/N* PrrPTT7\ 

TbAbbbbl 1 A 


HI i b 1 1 bbbrt. 


AAV^O 1 w A 1 1 


1 J JUU 


TCACATGACA 


CCCGAAGACG 


7\ Tv TV 7\ 

CCTTGAAAAA 


1 AAAb 1 AAbb 


AAlAbLLArtn 


TTCZTCIC A AHT 


15420 


CAAAGTGGAA 


GGCACACGTC 


CGTTAATCTT 


TGATbATGTb 


PTTPTTPP A A 

bl 1 bl bbbArV 


1 wAO i UVjUVyVJ 


1 JIOU 


ATTTGCAACG 


TACATGCACA 


TTGACTACGA 


mpiv Tv /^« /~» /~" TV Tv /"■» 

TGAAGbbAAb 


bbl lbbbbbl 


tq acta AAr:n 


15540 


AGCTCGTGGC 


TACATTCTTA 


TV V™* m TV TV TV TV TV 

AGTAAAAGGA 


/*"■/■*• y» TV 7\ TV 7\ A 

GGCAAGACAA 


1 baAbb 1 i 1 b 


1 blTAAl y bAA 


15600 


AATTTGATTC 


t* i* t\ ^ ta rn TV * 

AACAGATTAC 


my' tv m/"» A t\ tv rp m 

TGATCAAATT 


rp/^rp/^A A/"* A AT 1 

TGTbAAb AA 1 


TPPTPTP^Tf 
1 bb lbl bb 1 b 


biH Inl 1 


15660 


GTGGCTTTAG 


CAGAAGCAAA 


tv tv tv ta m m m 

AGAAGCATTT 


/~i /~» rp TV TV A A A 

CCTGAAGAAA 


Tb 1 1 1 bl lbb 


1111 bb 1 bAl 


1 ^7?0 
x. ^ / £• yj 


GTAAATTGGG 


TAACCmAGGA 


GCAGAAGCTA 


GCACGAGGb 1 


1 bb 1 1 b 1 AAA 


bAbbb 1 AAbb 




ATTTCCCAAG 


TAAATGCGAT 


TGCGCATTTA 


CAGGAAACGb 


A TP* A ATTAfT 

A 1 b AA 1 1 Ab 1 


p a a a d. a p n t p 

wAAAAA^-AX v> 


1584 0 


TTAGCATTCT 


TATTTGAAGG 


TAAGCCgGTA 


TTGGTTTTAA 


/->/■"»/-» z-' Tv tv rp Tv C 

bCbbAA I Abb 


TTPBPTPIiCT 
1 1 b Ab 1 bAb x 


15900 


AAAAATTCAC 


GATTGAAGTA 


TCGTTTGAAA 


TV Tt TV /™t TV m T> /"* 

C AAACG AT T C 


APPA A A A CC^ T 

AbbAAAAbb 1 


ppapaTf^TCP 


15960 


CAACAGTTTG 


GGCTGATTTT 


m m tv m tv m t\ rrv 

■TTATCATGAT 


rn/-»TV f TV A A A rp rri 

TCAGAAAA1 1 


A r™'P , P ,, T 1 P^T , f^ r T ,r P 

Abbb lb 1 bl 1 


TPAfZf^PP.P.PP 


16020 


TGTCAAAAAC 


AAtTCGTTCT 


TTGGCTGAGA 


TV TV TV /""I/""* TV TV 

CCAAACGAAb 


PT APATTAfT 

b 1 AbAI 1 Abl 


paanaapaaT 


16080 


TGATAAGAAT 


GACAGAATCA 


GGTGTTTCTT 


rpT\Ar"^rpAAAAA 

TAAGTAAAAA 


bbbb 1 Al 1 1 A 


BPPPPP,TT AH 


1614 0 


CGAAAGATTA 


TGCTAGGAAA 


CATCAATTGC 


m /■■> tv /"t/->rp t\ 7V/^/^ 

TGACGTAAGG 


AbbAAAAAbb 




16200 


GAAAAGTGAC 


TGGTAGCCTT 


TGGTCAACAA 


G AAAAG A 1 b A 


AAAA r PT2i7i& r P 

AAAA1 1AAA1 


P-P-PTCf^AAAT 


16260 


TTATGATGGT 


TGATATTTTG 


AAT C AAG AG b 


AbbAAAAbbA 


AbbAl i 1 lln 


ATTP.rTP.CTG 


16320 


ACAACGCTGG 


AGCTGGTGTC 


/-» /*-• /— T\ y* TV Tv T\ /"• 

GGGGACAAAb 


Tb 1 1 AA 1 1 Ab 


1 bAAbbbAAl 


npTGprrGTA 


16380 


TTTCGGCAGA 


AGATCCGAAT 


bibbbAAl lb 


H. 1 bbrirt 1 b A I 


1 b 1 1 bb 1 Vj l 


ATTGATTCTA 
r\ x x un x x w i *i 


16440 


CAGCbbAAGA 


A I Abb b Abb A 


LbAbALA i bA 


uLA 1 1 /AH 1 bM 


MA 1 AMI IrtlVj 


TATATTATGG 


16500 


TTCTTTTCAT 


/-* ti mm T\ pn rrt /-> z^ 1 t\ 

GAT TAT T b b A 


bbAAl bbAl A 


A A r PP ,r PA r P r Pr"r^ 

AA1 bl HI i bb 


AAbAArtv^ l r\ 


gc;tttagggg 

vj \j xxx nvj www 


16560 


AGCAATTTGA 


T\ TV /** TV fT> fit 

GGAAGGGATT 


TV rn/^/^r"T TV <T>/^/™" 

ATGGCTATGG 


PTTPTPTAPP 

b 1 1 b 1 b 1 Abb 


bb 111 b 1 Al b 


p,TAP,r;r;ATTA 


16620 


TCACTTTAGC 


GCCTGTCTTA 


GCCAATCTTT 


TAAGTCCGAT 


TGTCGTTCCT 


GTCTATGAAT 


16680 


TACTAGGGGC 


TGATCCAGCA 


ATGTTTGCCA 


CAACCTTATT 


GGCCAATGAT 


ATGGGCGGCT 


16740 


TTGCGTTAGC 


. GCAACAATTA 


GCCAATGATC 


CGCAAGCTGG 


TTTATTTGCG 


GGGGCAATCT 


16800 


TAGGTGCAAT 


GATGGGACCA 


ACGTTAGTCT 


TTACAATTCC 


AGTAGCGCTT 


GGCATyATCC 


16860 


AAAAAGACGA 


CCAACAATTC 


TTAGCAACAG 


GtGTTTTgAG 


CGGGaTTATT 


ACaATTCCTT 


16920 
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TTGGTTTATT 


AGCAGGTGGT 


TT AAG AGG I G 


PPATPPP ATT 


PTPATTP ATT 


ATTCCTAAPT 


16980 


. . TV 7> rp rp TV 7\ rp 

WAATTCGAAT 


TATTATTGTT 


GCCGCT r i AA 


TTATT+-+- APP 


TTT ATPPTT A 


PPPPPAAAGG 
vj v\jv v<nnnv u 


' 17040 " 


/-^ 7\ m /■* TV rn TV TV 

GCATGAT a AA 


AGGCTTCCAA 


TV rnrprT»T*'PPPTP 

ATI T 1 1 GG I G 


a arrrpT apt 


PATTPTPP,PT 


ATPTTTPP^Tv 


17100 


TAGTTGTGGG 


GGCGATTCAA 


/-« m -j\ m rn tv m rp tv z'*' 

CTATTATTAG 


G AA 1 J. AG AG 1 


P ATTPPTPPP 
Oft 1 1 \-,\~> 1 OUVj 


ATTPPAPPAP 


17160 


TTACTGAAGG 


AATTGAAGTC 


GTTGGCGGGA 


1 1 GL 111 AAG 


TTT A PPTPPP 
1 1 1 AGG 1 vjGG 


PPTTTPTPTT 
GL. 1 1 ILiljl 1 


noon 


TAGTAGCTGT 


GATTACGAAA 


GTCTTTAATA 


tv tv /"""P A rri rp /■** t\ rn 

AAGGA1 1 GAI 


p a a 5\rT arra 
GAAAG 1 AGuA 


A 7i APTTTT AP 
AAAG 1111 nu 


1 / lOU 


GAATGAATGA 


AGTAGCGGCT 


GCAGGAATGG 


fTt rp rp TV /"* 7\ fTirn 

TTGC 1 AGA1 i 


A Pr*T A AP A AT 
AGG I AAGAA i 


ATTPPP ATP V 


A ' J4u 


kCCAAATGTT 


AAAAGACATG 


GATAATCGAG 


GGAAAATTAl 


rn TV ATATPrfT 

1 AA1A1GGG 1 


TT TP P T P 4- T +• 
111 (jl— 1 vjL 1 1- 


i 7 a on 


CGGCTTCTTT 


CGTTTTAGGT 


GACCACCTTG 


GTTTTACTGC 


AGGGG i AGAA 


A A PA A A TP AT 

AAG AAA 1 GA 1 


i 7d fin 


TTTTCCAATG 


ATTGTTGGCA 


AATTAGTTGG 


AGGGATTACG 


GGGA1 1 i I 1G 


TPPPTATPTT 
1 vjvjL IrVl LI 1 


1 7 ^90 


TATGGCAAAT CGAATGCTAG 


GGAAACCCGA 


TV TV TV TV TV m 

AACAGAAGTG 


AAAIAA1 GAG 


PTPT A A AT AT 
GIG 1 AAAlril 


1 7 ^Rn 


P A P A P A A TT A 

GAGAGAA1 1A 


A ATPPTP A A A 


TGATTGAAAC 


GCTTGTTCGA 


f~> T\ tv ATTPTPA 

CAAA1 1G1GA 


PAPAA7AAATT 


i 7 fi4 n 


a a r T > (^crT' apt 
AA1 LLUj/iL 1 


A A AP A APTPP 


ATCCAAGTGG 


tv t\ m m 7v m ^ rn O z^ 1 

AATTATGTCC 


ATPTl A7\ TT AP 

A 1 G AAA 1 1 AG 


PPP A A ATPPA 


1 7700 

X / ' vj VJ 


PPTAAPAPAA 


P. AP P, A r r CI <~ T 


T AG AT AC AG G 


CAACCCAGGG 


/-i tv /-* TV rp rp rp iti fp 
GAGA 1 1 G 1 1 1 


APAPAQAZiPZi 


177 60 




TTAPAAPAAA 


GTCCGCGCCT 


TGGTTGCGGT 


TTAA1 GG1GA 


TPAAAPATAP 

I GAAAG A 1 AG 


X / O £. \J 


PAPPTTPPAP 


TP, GAP APT AG 


AATACGATGA 


tv m rn tv rn rn TV m 

AATTGATTAT 


7V rprpTV rp rp rp f TV 

ATT. AT 1 1GAG 


PAPAATTAPA 
GAGAA1 1/\LiM 


17880 


TATAPTPATT 


PAPPPAPP.TA 


AAATTTCTGC 


AACAGCCGGC 


/■» Tv 7V TV rp TA rp rP I 1 

GAAATTA1 1 1 


T A A TTPPP A A 

1AA1 1GGGAA 


1 7 Qd 0 


AAGTAGCAAA 


ATTCAATTTT 


CAGTGACGGG 


TGATGCCCGC 


Tl TGI G 1 AGG 


TP A PTT AT PP 
1 GAG 1 1 A 1 L»U 


18000 


AGCAGATTGG 


CAAAGCCAAT 


AATTATTTTA 


TCGGCAG1 CA 


/-• TV /-"• A ATAATT 

GAGAAlAAi 1 


TTPPP A A A AP 


18060 


CCTTTACTCA 


GCGTTATAAT 


AAATAAGAAC 


GAAAAAAGGG 


PTTPAPTPZiA 

Gil l bHrt 


TP AP ATTTPA 


18120 


GAATCCAAGT 


CAAAATCAAA 


TTTTTGATAT 


rprnrp tv t\ j\t\P7\ tv 

Tl 1AAAA.LAA 


PPAAAAAATA 


TTPPPPTPPT 

A A vjVrfVju A vjo A 


18180 


TGGTTTGAGT 


AACAAGCCCG 


ACCGCACAAG 


fpTTiPTl TV 7\ TV rprn 

TTACAAAA1 i 


GGGoAAAl 1 I 


TAPAPPAAPA 


18240 


TGGCTATCGT 


GTATTGCCTG 


TCAATCCAGT 


m m rn tv tv tv t\ rT*T 

TTTAAAAGG1 


GAAAGGG I GG 


TTPPPP A A AP 


18300 


GGTTTATGGG 


ACATTAACAG 


AGATTCCTGA 


TV TV 7\/""'7\ r P r P/" , 7\ T* 

AAAG AT TG A 1 


A TTPTTP APP 
Al 1G 1 luHtb 


TPTTTPPPPP 
ALIA ALVjVjIwVJ 


18360 


GAGTGAATTT 


TTGCCAGAAG 


TGGCCGCAGA 


71 t\ (~* TV T 1 Tr/"* n TV tv 

AACAI iGAAA 


ZiP TiP A APPPP 


PTPTPTTTTP 

vi vj 1 v^ 1 A A A \J 


18420 


GGCCCAACTA 


GGTTTGGAAA 


AC G AAG AG G C 


tv r^f^r' tv tv i^T'TP 
AGGGAAG 1 1 G 


TTATTA A ATP 
1 1 Al 1 \3 


AAfiPiTCGTCA 


18480 


AGTTATTATG 


GACCGTTGTA 


m rn tv tv ta tv m rn tv 

T T AAAAT T G A 


Al 1GGGAAAA 


ATPPP AP ATP 
A 1 ovaC non 1 u 


PTfJAATAAAC 


18540 


ACGTGTAATT 


TTATAAAGAA 


AGAGCGTCCT 


/"TV TV TV TV A 

G AAG GG AAAG 


G1G111G111 


TTTPAPPPAP 
AAA onvyLLnw 


18600 


ATAGAGTATT 


ATTGTAATGA 


TGGTTTTAGA 


AGGAGCGAAC 


GGAAAACGCA 


AGAAACCGGC 


18660 


TCACTCTTCT 


TCTAATTCTT 


TAAGrraTAG 


TTAATGTACG 


CTTATTAACT 


AAACGAGAAT 


18720 


CTArGTATAA 


GGAGTTTTTT 


CTTTGGCTAA 


AAAAATTAAC 


AATGAGTACA 


ATGATGCCTC 


18780 


TATCCAAGTC 


CTAGAAGGAT 


TGGAAGCAGT 


GAGGAAAAGA 


CCTGGTATGT 


ATATCGGTTC 


18840 


AACCGATAGC 


CGTGGACTCC 


ATCACTTAGT 


CTATGAAATT 


GTCGATAACG 


CAGTCGATGA 


18900 
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AGCGTTATCT GGCTACGGGA ATGAAATCAA CGTAACTATC CAAAAAGATA ATAGTATTTG 18 960 

TGTTGCAC-AC TCTGGCCGCG GAATGCCAAC AGGGATGCAT GCTTCTGGTA TCCCAACAGT 19020 

AGAAGTAATT TTTACTGTCT TACATGCGGG TGGTAAGTTT GGCCAAGGTG GCTATAAAAC 19080 

TTCAGGCGGA CTCCATGGGG TAGGTGCCAG TGTTGTTAAT GCCTTATCTA AGTGGTTAGA 1914 0 

AGTTCATATT GTTCGTGATG GCGTAGAATA CATGGAACGA TTTGAGGACG GCGGTAAACC 19200 

AGTTGGTACC TTAAAAAAAA TCGGCAAAAC AAAAAAACGC AACGGTACCT CCGTTACTTT 192 60 

TTTACCTGAC GACACCATTT TCTCTACGAC AAATTTTTCT TACGAAATTT TAGCAGAACG 19320 

TTTGAGAGAA TCAGCTTTCT TATTAAAAGG CGTCAAAATT ACTTTAACAG ATGAACGTGG 19380 

GGAAGAACCC AAAGAAGAAG TCTTCCACTA CGAAGAAGGT ATTAAAGAAT TTGTTGCCTA 19440 

TCTAAATGAA GAAAAAGATA CATTAACACC CGTTGTTTAT TTTTCAGGAG CCAAAGAAGG 19500 

TATTGAAGTT GAATTAGCTT ACCAATACAA CGATGGCTAT TCTGAAAATG TTTTATCTTT 19560 

CGTTAACAAT GTCCGCACAA AAGATGGCGG TACCCACGAA gTCGGTATGA AAACGTCAAT 19620 

GACGAAGGCG TACAATGAAT ATGCACGGAA AGTGGGACTA TTAAAAGAAA AAGATAAAAA 19680 

TTTAGAAGGT AGCGACTTCC GTGAAGGCTT AGCAGCCGTC CTTTCTATTC GAGTACCAGA 19740 

AAACTTACTA CAGTTTGAGG GCCAAACCAA AGGAAAATTA GGTACACCTT TAGCAAGAAC 19800 

CGTTGTCGAT AACGTCGTAG GCGAACAAAT GGGCTTCTAT TTACAAGAAA ACAGTGAAAT 198 60 

GAGTCAATCA TTAATTCGGA AAGCGATTAA AGCCCGTGAA GCCCGTGAAG CGGCTCGTAA 19920 

AGCCCGTGAA GAAAGCCGGA ACGtAAAAAA CGGAAAAAAG GTGAATCGTT ACTTTCTGGA 19980 

AAATTAACAC CGGCTCAATC CAGAAATCCT AAGAAAAATG AACTATATCT TGTCGAAGGA 2004 0 

GACTCGGCCG GTGGCTCAGC AAAACAAGGG CGCGACCGGA AATTCCAAGC CATTTTACCG 20100 

CTACGAGGAA AAGTCATTAA TACGGAAAAA GCGAAAATGC AaGACATCTT GAAAAACGAa 20160 

GAAATCAATA CTATGATCTA TACGATTGGC GCAGGCGTTG GTCCTGAATT TTCAATTGAA 20220 

GATTGTAATT ATGATAAAAT CATCATCATG ACCGATGCGG ATACCGATGG TGCCCATATT 20280 

CAAGTGTTGC TATTAACGTT CTTCTATCGT TATATGAAAC CATTAATTGA AGCCGGCAAA 2034 0 

GTCTACATTG CCTTACCACC ACTTTATAAA GTCTCTAAAG GAACTGGAAA AAAATCAGTC 20400 

ATTGAGTATG CTTGGACCGA TGGTGAATTG GCAGAAGTGA TTGATAAAGT TGGTAAAGGC 204 60 

TATATGCTCC AACGCTATAA AGGTCTGGGG GAAATGAATG CAGAACAACT TTGGGAAACA 20520 

ACGATGGATC CTGAAACTAG AACCTTAATT CGTGTTCGGA TTGATGATGC AGCTCAAGCA 20580 

GAACGCCGTG TGACCACATT AATGGGCGaC AAAGTCgAAC CTCGTCGAAA ATGGATTGAA 20640 

CAACATGTTC AATTTACGTT GGAAGAAGAT GGCAGTATTT TAGATCGTTC AGAAGAAGAC 20700 

ACTTCTGCAC CAACAGGAGA AAGTCTTTTA GATGCGGAAA AAACAAAAGA AGCAGAACAA 20760 

ACCGATGACA CTGAAATCAG TTTGTTTGAC ATAGAGTAGG GAGTGACAAT TTTGGAAAAA 20820 

CGCCAAGAAG TTCAAGAATT AACCCTTGAA GAAGTAATGG GCGATCGTTT TGGAAGATAT 20880 
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TCCAAATACA TCATTCAAGA ACGGGCCCTA CCAGATATTC GAGATGGGCT AAAACCAGTA 2094 0 

CAACGCCGCA TTCTCTTTTC AATGAATAAA GACGGCAATA CCTTTGATAA AGGCTTTCGT 21000 

AAATCAGCAA AATCTGTCGG AAACATTATG GGGAATTATC ATCCCCATGG CGACAGCAGT 21060 

ATTTATGAAG CGATGGTGCG TCTAAGTCAA GACTGGAAAT TACGGGAAGT ACTAATTGAA 21120 

ATGCAC 21126 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

nAGAGAAAnC CGGTAGCTTT TTACGGTTTT CTCTTTTTTG ACTTTCAAAG TCAAATCAGG 60 

TACGACTTCT TTTTAAACAG ATAAGaATGT GGTATCATTG TAAAGGACTA TATAATAAAG 120 

GAGATGGACG TTTTTTGACA GATAATCAGT CAAAATCTTT GGAATTAATA CTTACTGATT 180 

CTGATGATGC ACACATGCTT TTAGGAACAC ATGATAAACA TATCAAATTT TTAGAAGAGA 240 

ATACTCACGT AACGATTAAT AGTCGTGGCG AAGTTATTCA GTTAATTGGC GAATCATCAG 300 

AGGTAGAGTT AGTCGCTTCC GTGTTAAGAG CGCTACAAAC ATTAATTCAA CGAGGCATCA 360 

AAGTGCATAC GCCAGATGTC GTTTCTGCTT TAAAAATGGC CAAAGCTGGA AATTTAGAGG 4 20 

CGTTTATTGC CATGTATGAA GAAGAAATTA TGAAGGACCA TCATGGTCGG GCCATCCGAA 4 80 

TTAAGAATGT TGGTCAAAAA AAATATATCG ATGCAGTTAA AACTCATGAC GTCATTTTCG 54 0 

GGGTGGGACC TGCGGGGACA GGAAAAACCT TTTTAGCCGT GGTGATGGCA GTAGCAGCTT 600 

TAAAAAAAGG CGAAGTACAA AAAATTATTT TAACTCGTCC AGCGGTGGAA GCTGGCGAAA 660 

GCTTGGGCTT TTTACCAGGT GACTTAAAAG AAAAAGTCGA TCCTTATTTA CGTCCAGTTT 720 

ACGATGCCTT GTATCAAATT TTTGGTATGG AT CAT AC AAA TCGTTTAATG GAACGCGGCG 780 

TTATTGAAAT TGCCCCGCTT GCTTATATGC GTGGTCGTAC ATTGGATGAT GCTTTTGTCA 84 0 

TTCTAGATGA AGCACAAAAT ACCACGGTTG CCCAAATGAA AATGTTTTTA ACTCGTTTAG 900 

GGTTTAGTTC AAAAATGATT GTTAATGGAG ATACTAGTCA GATTGACTTG CCACGTGGTG 960 

TAACAAGTGG TTTAGTTAAC GCCGAGCGTA CGTTAAAAGA CATTGAAAAA ATTGCCTTTG 1020 

TAAATTTTGA AGCCAGTGAC GTTGTGCGTC ATCCTGTTGT AGCACAAATT ATTCAAGCCT 1080 

ATGAAAAAGA ACAACAAAAG CATTCCTAAG CAGTTAGGGA AGGAGTGAAA GTACCTTTTG 114 0 

AAAAAAGTAC TTACAAAGTT AGGGGATCGT TTAGGTAAAG CCTATTTACC TATTTTATTA 1200 

GCCGTTTTTT CTCTTCTTTT ATTTATGATT ATGTTTGGCA GCGTCCATCA GAAAAGAGTA 1260 

GAAATTAAAG AAGGTCAACT GGCCGAGAAA ACCAttCGTG CAAATAAAAA TATCGAGAAT 1320 

ACCTACGAAA CAGAACAAAG AAAAAAATTG GCAGCAGAAG CTGTCACACC CGAATATATT 1380 
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TATCAAGAAG 


ATACGGCATC 


TGTCCAACAT 


AATCGTATTG 


ATAAATTATT 


TAAATTGATT 


1440 


GATTCTGCGA 


ATGAAAAAGT 


AGACAAAGAG 


TACAGCAACA 


AGCAAGCGAA 


GGCCAAAAAG 


1500 


GAAGAAACGA 


TTCCTGCGCC 


GACCGTGGAA 


GAACGAGTAG 


CTAGCTTAAA 


ATCaTTGTTT 


1560 


GAATCTTTAC 


CGCAAGATGA 


AGTTACGTTT 


TATCaAAGTT 


TTCCTAATGT 


TTTTTATCaA 


1620 


ACAATCTTTA 


CATTAACCTC 


TGAGCAATTG 


GATAAGGTTC 


GTTCAGAAAG 


TTTGATGTTG 


1680 


GTGGATGACG 


CGATGCAAAA 


TCACGTTCGA 


GAATCTGATT 


TGGATAAAAT 


TCGCCAAGAA 


1740 


GCTAACGGTA 


AAATTCAATA 


CT TAG AT ATT 


ACGAGTACCA 


TGCAACAAGT 


CATTCGTTAC 


1800 


ATTGTTAACC 


AAGGGATTGT 


GGTAAACGAT 


ATTGCCAATG 


AAAAACGAAC 


AGAAGAgctA 


1860 


CGCCAAAAAG 


CGATGAATGC 


TGTTCAACCC 


GCAATGATTT 


ATCAAGGCGA 


AATTATTGTT 


1920 


CGTGAAGGTA 


CTCAAATTGA 


TGCAAAAGCG 


GTCGAAAAAC 


TTGAGCTACT 


AGGAATGACT 


1980 


AGTCAAAATA 


CCTCTATTTT 


TCCGATGGTG 


GCATTAGCAT 


TAGCTATTTT 


ACTTCAAGTT 


2040 


GAAGTGTTAA 


TTTTCTTTAC 


AAAACAAGTG 


ACTGAGCCTT 


CGCGACAACG 


CTCGTTTATT 


2100 


ATCTTTTACA 


CGGGTGCTAT 


GCTAATCAGT 


GTTATTTTAA 


TGAAATTCTT 


CCAGATTTTC 


2160 


CAAACAGAGC 


AGTTGATGTA 


TATTCCCTTG 


TTCTATCCGG 


CAGCCTTTGC 


GCCGCTGATT 


2220 


CTAAATCATT 


TTGTAAATCG 


TCGTTCAGGG 


ATAATCGCTG 


CTATTTTCCA 


AGTCGTTTTT 


2280 


GCGTTATTTA 


TCTTTTATAA 


TTCTATCGGC 


ACAAACTCGT 


TGACGGTTAT 


TCTGATTATG 


,2340 


TACTTATTCT 


CAGGATTTCT 


AGCAACAGTT 


GTTAAACGGA 


AACGGATGAG 


TGAGCAAGTT 


2400 


TTCCCAGCTT 


TAATGTGGGT 


AGTGGTCTTT 


CCTGTTTTCA 


TGGCGGTTGT 


CTTAATGATT 


2460 


TATCAAGGGA 


TGAGTTTAAC 


AGATGGTAAA 


ACGTGGACAG 


CTTTAATTTG 


TGCAAGTGCA 


2520 


GGAACGGTAC 


TTTCATTTTT 


AGCAACAATG 


GGCTTGCATC 


CATATATCGA 


ATTATTAGTG 


2580 


ACCGATGATA 


GTATGATTGT 


CTTGAATGAA 


TTAAGTAATC 


CGAACCATCC 


GTTGTTAAAA 


2640 


CAACTGTTAG 


AAGAAGCGCC 


AGGTACCTAT 


CATCATAGCA 


TGATGGTGGC 


TAGTTTGAgC 


2700 


GCTAACGCTG 


TTGCTGAGAT 


TGGCGGGCGG 


TCGCTATTAA 


CACGAGTTGC 


TTGTTATTAC 


2760 


CATGACATTG 


GTAAAATTAA 


ACACGCCAAT 


TTCTTTGTGG 


AAAACTTACC 


TGCTGGTGCA 


2820 


GAAAATCCAC 


ATAACTTTTT 


ATTACCAGAA 


GATAGTAAAC 


AAATTATTTT 


TGGCCACGTA 


2880 


ATTGATGGCG 


CCAAAATTTT 


AGAAGAGTAC 


AATATGCCGC 


AAATGGTGAT 


TGACATTTGT 


294 0 


CGTCAACATC 


ATGGCACAAC 


ATTGATGAAA 


TTTTTCTATG 


TGAAAGCAAA 


AGAACGCAAT 


3000 


CCTGaAATTA 


AAGAATCCGA 


CTTCCGTTAT 


CCTGGaCCAC 


GTCCACAAAC 


aCGAGaAGCa 


3060 


GGGATTGTGa 


GTATTGcTGA 


TACTTGTGAG 


GCTGCTGTGC 


GGGCGATGGA 


TCATCCCACC 


3120 


AACGAAAAAA 


TTnCAAGCCT 


TTGTGCATAA 


TGTGATTCAA 


GACAGnATTT 


CAGATGGCCA 


3180 


ATTAGATGAA 


TGTGGGTTAA 


CGATGAAAGA 


GATTCGCATC 


AT 




3222 



(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 7812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GGACATGATT 


AC GT TAG ACT 


TAAACTTACC 


TAG TAT G GAT 


GGGATGGCTG 


TTTGTCGTGA 


£0 


AATTCGCAAA 


GTTTCAGCCA 


ATGTTCCCAT 


TATTATGTTG 


ACCGCTCGTG 


m m /*> m tv tv TV /"* 

aTTCTGAAAb 


ion 


TGACCAAGTA 


ATTGGCCTTG 


AAATGGGTGC 


GGATGATTAT 


GTGACAAAAC 


CATTTAGTCC 


ion 
loU 


ATTAACTTTG 


ATTGCACGGA 


TGAAAGCTCT 


ACATCGTCGT 


GCCGAGGTGG 


CAGAGGCTGC 


<t4 (J 


GCATGATACG 


TCTGAGAATA 


CAGATGAAAC 


ATTTGACGTG 


ATTACGGATC 


ATTTTAAAAT 


300 


GAATACGAAA 


AC AC GAG AAA 


CATATTTAGA 


TAATCAATTA 


ATTGAAGGCT 


TAACACCTAA 


Ten 
J bU 


AGAATTTGAT 


TTGCTATACA 


CTTTAGcAAA 


AAAACCACGC 


CAAGTCTTTT 


CACGcGAACA 


/on 
4 


ACTACTAGAA 


TTGGTTTGGG 


ACTATCAATA 


CTTTGGCGAT 


GAACGGACAG 


TTGATGCCCA 


4 oU 


TATCAAAAAA 


TTACGACAAA 


AAATTGAAAA 


AGTGGGTCCT 


CAAGTCATCC 


AAACGGTTTG 




GGGCGTAGGG 


TATAAATTCG 


ACGATTCAGG 


TGTTGCTTAA 


TGAAGTATTT 


m *n m TV TV /"* TV TV 

GTATCAACAA 


buu 


TTACTTGCTT 


TTATAGGTGT 


GATCGCTCTG 


ATTATTCTCA 


TTGTCGGAAC 


GTCTTTCACA 


bbU 


CAATTGACCA 


AGCGCACAAT 


GCAAGAAAAT 


AACTATGAAC 


AGCTGTATGG 


mm t\ m ^ tv TV TV 

TTATGCAGAA 


Ton 


TCGGCTTTAG 


AAACACGTGA 


CTTTTTTATT 


AATGTCGCAG 


GCGTTTCAGA 


TCGAGATGTT 


*7 pn 


TTATCCTATT 


CATTtCAGTT 


GACTGAGCGC 


GTACTCCAGA 


AGCAAGATGT 


GCAATTTblb 


O fi U 


TTTATTAACA 


AGGACCGCGA 


AGTGCAGTAC 


CCACCAGTTG 


ATAGCACAAA 


AAAAI lAbAb 


qoo 


TTTTCCTTAA 


TCGATA^ >VA 


TTGGGATCAA 


ATCATGAAAG 


GCAATCGTGT 


T a T G C AAb Ab 


I70VJ 


AAAACATTGA 


TATTTACGGG 


GCCCGAAATA 


CTTCGTCCTA 


TGTAATGTTA 


bubo 1 b i Hi b 


1 090 


CATCAAATCA 


GTCTTCTGAT 


AAAAAAGTGA 


TTATTGGTTC 


ACTAGTTATT 


AbAbAAbb 1 b 


1 0P.0 


CCAAAAACGT 


TGACCGAAGT 


GTTCAGTCCG 


TGACACAAAA 


^> m m ^* tv m rr* TV T\ TV 

CTTGATTAAA 


bbbl I 1AJ. I I 


114 0 

ill V 


TTTCTGGTGT 


GATTGCCTTG 


TTACTAAGCT 


ATCTATTTGC 


GACTTTCCAA 


b 1 bAAAbbHA 


1 900 

J. Z. \J \J 


TAAATCGGAT 


GCGTAAAGCC 


AC C AAAG AAA 


TTACTAGTGG 


AAATTTTGAT 


ATTCAATTAC 


1260 


CGGTTCATGA 


CAAAGATGAA 


TTTGATGACC 


TAGCAGAAGA 


TTTTAATAAA 


ATGGCCGCGT 


1320 


CTTTAAAAGA 


ATCACAAGAA 


GAAATCAATC 


GGCAAGAAGA 


GCGCCGTCGT 


CAATTTATGG 


1380 


CGGATGCCTC 


TCATGAAATG 


CGGACGCCTT 


TAACAACTAT 


TAATGGCTTG 


TTAGAGGGAT 


1440 


TGCAGTATAA 


TGCTATTCCT 


GAAAATCAAA 


AAGAGAATGC 


GATTAAACTG 


ATGCAAAATG 


1500 


AGACCGCTCG 


CTTGATTCGC 


TTAGTCAATG 


AAAATCTTGA 


TTATGAAAAA 


ATTCGGACCA 


1560 


ACCAGATTCA 


• AATTGTTGTG 


AAGAAGTTCA 


ACGGAACAGA 


AGCTTTAGAA 


AATATTGTGA 


1620 


CTCAACTAAC 


GGCCAAAGCT 


GAAGCCGCCG 


GTAATCAATT 


GT ATT TAG AC 


ACCACCGAAC 


1680 


CAATTGATGT 


GTATGCCGAT 


TATGATCGTT 


TCGTTCAAGT 


GGTTGTCAAT 


ATTGTTCAAA 


1740 
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ATGCGATCCA ATT TACT GAA AATGGCGAAA TTCATATTGC TTTAGAAAAA GGCTATTTAG 1800 

AAACCATTGT CCGTATTTCT GATACAGGAA TCGGCATGAC GGAAGAGCAA ATTCTGAATA 1860 

TCTGGGATCG TTATTACAAA GTAGATCCAT CCCGTAAAAA TACGAAATAT GGAGAATCAG 1920 

GATTAGGCTT ACCAATCGTT CAACAATTGG TCCGTTTGCA TAAAGGAAAA ATTAATGTTG 1980 

AGAGTGAACT TGGTAAAGGC ACCACGTTTA TTATTTCATT TCCTGATGTT GAAATTACTG 2040 

AAAACTAAAG AAAACGTCTG GTCACAATCA GTGCCAGACG TTTTCTTTAG TTTAATAACG 2100 

CATGTAGAGA CAGGGGGTGC ACATAGGGAC TATCAATAAT TTGTGGATCT TTTGTCAGAA 2160 

ATAGCAAGCG CTTGTTTCTT TTTTGAACTG CTATTTTTAA TTGAGGTAAA ATTTCTTGAC 2220 

GTTGTCCAAT CGTTAATGAA GAAAAGATGT CATCAATCAC TAACGTACTT GTTTTTGAAA 2280 

GCAAAAGTTG TATTATGCGC ATTTTAATTT TTTCAAAAGT CGTAAAAGAA GTAAAGGAAC 234 0 

GACCAAGTAA GGCAGGTTCT AATTGGAACA CCTGGAAAGC TTCGGCTAGA ACTACTTTTC 2400 

GATCCCGTTC TTTGATCGTA CTAGCGATAA ATAAATTTTC TTCTAATGGC AAGTAGGGCA 24 60 

AAAAGGTTTC ACTAGAAGAA AGCCTAGTAA TGGTTAATTG GTTTGTTTGG AAAAAATCGG 2520 

CCAATGCGAG TTCTTGTTCT GGAGCAGAAT AAATGATTCC ATAAACTTCT CGCAAAGctA 2580 

ACGCGTCGAA TGAGGATTTT TTTTCGGTTA TTTTTTTGAA TGATGGTGTG ACATTCATAG 2640 

AAAATCCCTC CCTTATTTCT TATGTGTACG TAAATAAATC AACGTGAAAG GAGTCCCGCA 2700 

AACAAGAGGA ATGCCAAGCA AGATTAAGCA GTTAACAAAG CAATTACGGA TTGTATAAAG 27 60 

TGACTGCATA CTTTTAAAAT CAAAAGAAAA CGCTTGGATC GTTTGAATCC CTAAATGCTG 2820 

GATATTTGTA ATCGTGACAT CCTGGTCGGT ACTTTTAGCT AAAAATGCTG GAGTTTCCAA 2880 

AT G AG AC C AC TGTAAAATAG CATCACGAGC AGCCCAAAGA ACAGTATTAT ACGTTTCTTG 2940 

AAACATTAAA GTGAAAAGTA AAAAACTGAA AATTCCTAAT AGAGCCGGTA AACATAACTC 3000 

TAGTAAGAGC TGTTTGGTGA CATACATATT TGAAAAGTGC ATGTGATACC ATGTATGTAT 3060 

TTCTTCCTTT TTTATGCGTA ATGTCACATA CATCAACAGA AGAAAAGTAA CACCAAAGAA 3120 

AATGACACTA CCTAGATAGA GGTGCCAATA GTATTGGATC AGTTGGTAAT GGTAACTAGT 3180 

TTGTTTTTCA ATATCGACAA AGTCTGGCAT CGTTCGTAAA AACTCTTTTT CAAAATCAAT 324 0 

TAAGTTAAAC AGCCCAGTCA ATAAGAAAGA GAACAGGAAG AAGAAGAGTG CGGTGATTAA 3300 

CGTGATTTTT TTATGATAAA AACTACTAGC TGTAGCGTAA CGGACACTTC TCATCAACTT 3360 

TCACCTCTCT AAACTATCTT TAGTATAGTG aAATAATATG AATTAwTTAk GAaAAAGtAG 34 20 

CAAAAaGCTT GAGCAAATkG CTCAAGCTTT TTGAAGATTt ATTtACCCGT TTTTtGATAG 3480 

CCTtGATAAC TGTCAGtGTG ATATtCATCA ACGGTtGATT TATtATkGTT TTTACTGTAA 354 0 

ACACTCGTTG . ATTTTnCACC nTTTcTTTTT CAATTGCTTC CAGTTGTTGC AATTGGTTCT 3600 

TGTAATCATA GTCTTCTGGG TTAACTGGTT TTAAGCCGCT ATTGGTATAG AAACGAAGCA 3660 

AGTCACCGTT AGTAATTTGG TCAGACGTTT CTAATTGTTT CGTCGCTTTG GCTTTTAAAT 37 20 
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t AbLbAL 1 1G 


TTTPTTAAPT 

1 1 1 rtrt W A 


TCTTCGGTTG 


GCTCTGTAAT 


TAAAGTGCCT 1 


GTTTTAGTAT 


3780 


prrnn j\ 7\ TPPT 


bpaapptaaa 

AortrtL-v* X rtrtrt 


AT AG TAT ATT 


TAGGTGTTAC 


AACATTGCCG 


TTGCGGAAAG 


3840 . 


tMLbAi 1 1 G 


ZiTTPTPiTTPrT 


TTAGAGAATA 


AATCTTGGCC 


TAACTGAATG 


TAGTTTTTCG 


3900 


TATGGACAGG 


n nrrB Z1P.TP.T 


AGAAGGGTTG 


GTAAGGCATC 


AACTTGGCCG 


CCATACGTAT 


3960 


GGTTGAGAGG 


VjCCA 1 1 1 1^1 


TGACCAGGAA 


CGTGAATCAT 


ATAaGGAACA 


CGTTGCATAT 


4020 


TGGCG1 1 G 1C 


7a n n 7ATPP.TTP 
AAAAiCLjl ±L- 


PAATPTGCTT 


TGGTTTTACC 


TACTAACTCT 


GCTAAGTTTT 


4080 


GGTTTC1 IGA 




PPATAP.TGAT 

v^v^rtX-rtO 1 un x 


CGCCATAAAG 


AACAATGACT 


GAGTTTTCAT 


4140 


AG AG ACL 1 GA 




TAGTTAAAGA 

1 rtU A X rtrtrtOrt 


ATTCTTCGAC 


CGCTTTATCT 


AAATAATTCG 


4200 


CTG1 IGCAAA 


PTH TAPPTATT A 


rt 1 101 1 1 V^O A- 


CAGAGGTTTT 


TGCAATTGGG 


AATCCTGCTT 


4260 


/—i 7\ m /■* /"» rp rp rp /— , rn 

CATCGTTTG1 


AAA 1 1 G AG AA 


T D. 21 A T A A T 
1 AALJv3 A 1 Art 1 


HATTGGACAC 


AGCAATAAAT 


TTAGAATAGA 


4320 


T\ r** rn rp /■"» rp rp /"» 

ACGGTTGTTG 


1 AAA1G 1 1 G 1 


TXUnTATTf^AA 
AAA1AI 1 VJ AA 


PQGATTGATT 

^UVJrtX ± VJrt 1 J. 


AAAGAATGGT 


TTATCGTGTA 


4380 


ACCCATATTG 


GAA 1 GAG I 1 A 


1 Ly. 1 Lj AA 1 1 AA 


PATPATAATA 

^rt 1 wrt 1 rtrt X rt 


ACTAGCATCA 


AAGAAATAGT 


4440 


/-ill mlv P/^/Tn^ 7\ 

CATAGCCTAA 


AGGI I 1A1AG 


Lj 1 1 1 Un 1 HL 


fiATTPPAGAA 


GTTACCGGCA 


TTTCCGTGGA 


4500 


AAGCAGCGCT 


AG 1 G 1 AAOC 1 




TTAAAATATP 


GGGTGCTGCT 


TCAAACGTAT 


4560 


TTTTGCCAGG 


AAG 1 1 G 1 G 1 A 


AAortVj 1 OrtrtL* 


PTTGGTCTAA 


ACCAAATAGT 


GAATTTTCAA 


4620 


/imn T\ r"* rn (Tt rp 

GTAACGTTTC 


GGGA 1 LAL- 1 i 


PTTTTTPPTT 


flAPPAAPTTG 


GTGGAAGAAG 


TTATCAAAGC 


4680 


TAAAGGTACT 


Tl I AG 1 Al iG 


1 AAAVj AC 1 A 1 


TP,ATAAATP,H 

X Ort X rtrtrt 1 VJU 


TGTGACTTCG 


TGTTCTACGC 


4740 


CATTTTCAIG 


rpfri rn rp 71 Tv T* T* T* 

111 1 AA1 1 ILj 


T7121TT7AAPTA 
1 AA 1 1 ArtL. 1 rt 


AAAATTPiTTG 

rtrtrtrt X X VJ X X \J 


GAAACTTTCT 


AAATGGATAT 


4800 


AAATTACAI I 


mmrp/^^/^rprnrprp 
111 LjGO 1 1 1 J. 


PPZiHT APP A A 


APTTAPTATC 


ATCGGGTGCC 


GCATATTGTT 


4860 


GT 1 1 CAUG 1 A 


A 1 L, 1 x LnrtL 1 


TP, TTTP AT AT 

lull! ^rt 1 rt 1 


CGTTAGCACT 


GGCTTCGGCA 


CGGACTTGGT 


4920 


T 1 G i 1 I GG 1 A 


A P T P P T 21 & T P. 
Au 1 L« I AA 1 0 


PPATPflTACA 


CGGTAAAGGC 


ATTTAAGCCT 


AAGAATTTCA 


4980 


GGA I A 1 AG 1 G 


AL-Li AL3AL3 AA 1 


TPAPP.TPPTA 


ATAATTGGGG 


ACGTTCTGTT 


TCAGCCATAA 


5040 


AT AG A I I GAA 


1 AA0AAAAA0 


APAAPAHAAA 
nL rtrt v> rto rtrtrt 


GGAGTGTGAC 


AGAGACTGCT 


ACACGTGCGC 


5100 


GGAG 1 GGAGG 


GGGA lLlbl 1 


TT H ZiTTTTTT 
l x rtrt 1 x 1 1 x 1 


TCGTTAATAA 


TAATACGCCA 


ATAATAATGA 


5160 


AG I G 1 AAG AA 


7A T 7A 7A 7A TP B PP. 

Al AAA1 onLu 


TPATAAP,HTP 

1 ^rt A rtrt VJ U 1 ^ 


TAAAGAGACG 


TAAGGCACTT 


TCGCCAAGAC 


5220 


/-irpT\r~>/*>rpT\/-< 

L.LjL- 1 AtaL. 1 AL. 


111 bLLnuLn 


PPAAPiPATPG 

L^^rtrtO^rtX 


TATTAACTGT 

jl A* x x nov x w x 


GATGAAGTCA 


GTAAATTCTC 


5280 


bH 1 AH 1 AL, AC 


cT^r'Tk tttp 

LiACAl x luno 


AAPiAPiPAftTA 


AACTCATCAA 


GAAGTAAATA 


ATCATCATTG 


5340 


i Art 1 A 1 AALjA 


AL)G 1 1 1 1 1 1 A 


PP. P.PP. A AP AT 


APAAAGCGAT 


GGCTAATAAA 


AAGACCGTTG 


5400 


TGGCAATTGG 


ATTAATTAAG 


AGTATGAAGT 


ACTCGTTAAT 


ATTTTCCAAA 


LG 1AAA1 GGA 


fin 


AATCAACGGC 


GTACGCAAAG 


AGGTTTTTTA 


GCCACAGTAA 


, GATGGCGATT 


AGCGAGAATA 


5520 


ACCCCAGTCG 


. CGTGTTCAAA 


ATATTTGTTT 


TAAATAATTT 


. TTTCAAAATA 


AACGTCCTTT 


5580 


CTAAACGATG 


ATTCTATAAA 


AGTAAAATTA 


, GGTAAATTTC 


: GTATTTTACA 


. ATATCGCTTT 


5640 


AATTTTACTC 


TTTCACAAAT 


TTTATGTCAA 


t TTCTTACCGA 


, AGTGAGGAAT 


GGTGATGTTA 


5700 
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/UibnAn^i^ 1 1 


aptatttppt 

AL X ILL I 


TZiZi APTTTTT 
I rvrvrtO i J. x x x 


AGTTCGTTAA 


ATGACGTGAA 1 


GCTTGAATCC 


5760 


tttaaapapa 

I 1 lAAAVjALrt 


PATPPPPPTT 


TTTTTPPTAT 

11111 \J\j X n x 


ACT AG AT AG G 


ACTTTGAAAA 


GTAGGTGTAC 


5820 


A P A a TP A A AP 


tt p a a p t a a p 


TZi AAAAAPPA 


GAACATAAAT 


TTAAAAAAGG 


ATATCCTTTA 


5880 


AT ILAonHrtb 


AAuAl 1 IIjLA 


APAAPTPPPA 


GCGCCGTTGC 


CAACAGATTG 


GCTTACTTTG 


5940 


ATlbAlAbl A 


t\r\K3 \J L LA 0 L Vj 


TT T A PPPP A A 


GGATATTTAG 


GCGAACAAAA 


TAAAGGAATT 


6000 


m ffl ^ ni /"* z" 1 

GGTTGGTlbb 


lul LA i bbLn 


TPPAPP A ATT 


AATPAATCTT 


TTTTTCAACA 


ACTCTTTGAA 


6060 


7* m* TV Z^" 1 rp /~i rn /"■■ 

ATAAGlbblb 


a p a zi rrr<7 a p 


PTPTTTTPA A 


AAflGATTCGC 


TAACAACCGC 


GTATCGATTA 


6120 


TTTAATGGAb 






PTAATPATPG 


ATCGCTATGC 


AGATTACGCC 


6180 


GTTTTTTCAT 


bb 1 A 1 AA 1 bA 


7A TiPPPTTT AT 
HnvvLL 1 1 Inl 


PAPA AAAAAPi 


PPGAACTTTT 


GACGGCGTTT 


6240 


CGCACAGTTT 


Abbb 1 bAb A 1 


T ATTPPTPPP 
i Al 1 bu i VjUU 


T APHAA A AP^A 


TTPGCTTTTC 

X X \J X X X X 


CACAAAAGAT 


6300 


TTACCAGAAT 


L Ab Ab 11111 


P T A m PP PP Ti A 
b 1 A 1 bbbb AA 


PA APPAPPPP 


APPPATTACT 


TGTCACTGAA 


6360 


AATGGCGTTC 


TV Tv frv rn rp /""• rp tv /"> 

AA ill bb 1 AG 


L1A1 1 1AAAC 


P A APPATT A A 


TPAPAGGCAT 


CTTTTTAGAT 


6420 


CAAAAAGAAG 


TT. CbTbbl GG 


ATTTiPTAPBT 
All Ab 1 Ab A 1 


PPTTTTPPAP 


TAPGTAAAAC 


AGTCTTAAAT 


6480 


ATGTTTAGTT 


ATACAGGTGG 


fTi fjy fT* fT> rp z" 1 TV f*^ T 1 7\ 

11111 b Ab 1 A 


PPTPPPPPT A 
Ljb 1 IjCv^^aL' 1 


TPPPPPPTGC 


CGTGGCCACC 


6540 


ACTAGTGTGG 


7\mmrp7\/*" , /~'rpT\ TV 

ATT TAG G TAA 


APP A APPTTTA 

AbbAAbb 1 1 A 


PPP AAA APP A 


PTPAGCAATT 


TGAAGTCAAT 


6600 


CATTTAAATC 


TTGCCGGGGA 


A A 7v -v- Ti rp A rprp 

AAAbr 1 1 Al 1 


PTTATPPATP 
ul IrllobHlu 


TTTTnACTAC 

1111 y rt^* x nv^ 


TTTAAATATc 


6660 


TTCGCGTAAA 


GGCTTGmGl 1 


A TP AP A TP AT 

A 1 b Ab A 1 bA 1 


PATTTTPPAT 


PPPPCGAnTT 


TTGCTCGCAA 


6720 


*"■» TV TV TV TV TV 7\ 7\ 7\ 

CAAAAAGAAA 


rp rn rp rp rn rp r^fC* 

Gl 11111 LLb 


TTPPT A A A A A 
1 IbL I AAAAA 


TTATPPPPAP 


PTAGTGAAAG 


ATTCTATTGA 


6780 


TATT11 AAbb 


P7\mn A PPP 7A TV 

GA 1 AAbbb AA 


PPPTT ATTP.P 

bbbl 1A1 lU\y 


PTPAAPPAaT 


GCGGCAAATT 


TATCCTTAGC 


6840 


/— * ^\ Ti TV fTl TV 7\ TV 

CAAATATCAA 


AAAA1 bb 1 bA 


1 1 Ab 1 bLL 1 1 


PP A A PAP AAA 


AATGTCCGTT 


ACAAGATTAC 


6900 


GGACACGTAT 


bAAl lALLbb 


P AP APTTTPA 
bnbnb 111 V^,rt 


APTPAATPPT 


AATTTTCCAG 


AAGGTAATTA 


6960 


CTTAAAAb 1 b 


G1G1 1 lAltb 


A A ATTP A A A A 


AT A ATTTTTT 

n X r\r\ 1111X1 


ACACCAAGCA 


AAAGTCCCAG 


7020 


CGCCGC 1 AAi 


bbblnl Ibbb 


APTTTTPPTT 


PPTnTTTATT 

vjUilll 1 1 X 1 


TTATAGTAAC 


TTTTGGGCTT 


7080 


TTTGAb lb 1 1 


1 bb AAAA 1 b C 


APTTPPPPT A 
Avj i 1 LuUu 1 A 


PTTTPPTA AT 
\j x 1 1 1 nn 1 


AAGTCTAGGC 


CGCCTTTTTT 


7140 


/"» T\ TV 7\ 7\ rp TV (~*ff~' 

CAAAA I Abbb 


/T , /" , pp r PAP r r tr r 
bbbbb 1 AC 1 1 


PPTPTP ATTT 


PPTPPPAPPA 


CCAGAGGGCA 


CAGAGAAACC 


7200 


LAA 1 1 b i ol«H 


P7Jl ^^^^ PT ^^ TP'T , 
on llUlllul 


PT A APTPTTP 


TAAAAAAGCA 


GCGCGGCTAA 


TAATAGAGGC 


7260 


pppappaapp 


PPA APATP AT 


APTPPTPTPP 


TTTAPTGACG 

XXX ixvj X \JAbVJ 


AAGAACAATT 


TTTCAGTGAC 


7320 


1 1 bn 111111 


TP ATT A PPA A 
Ibnl lALuAA 


PAT ATTTPPP 


ATA ATT ATTT 

1 rLT\ X 1 XXX 


TCAGGGGTGA 


ACTGATCAAT 


7380 


TAAAATCCCT 


TCTGGCTTTG 


TGGGAGCCAA 


TTCTTGTAGC 


AACAGATAAA 


1 bbb 1 Ibb 1 1 


7 4 4 n 

( 4 4 vJ 


ATGCAAAGCG 


ACTTTCATGT 


GTACAGCATT 


GTAGTTTGGT 


TGGATCTCAT 


TGTATTTTTT 


7500 


GGGTTCAACG 


. ATTAATAACT 


TGTAAGGAAT 


CAGTTCTTTA 


. ATGACATGTG 


ACAATTGAAT 


7560 


AATTTGTGGG 


TCGGTTAATT 


CTTTGGAATC 


ACGAACACCT 


AATGATTTTA 


, GTTTGCTAAT 


7620 


CATGnTTTTG 


TCAACATAGG 


CAGCACAGAC 


AGTTACTGGA 


k CCAAAATaGC 


TACCATTaCC 


7680 
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GACTTCATCG GAACCtATCA CAGgACCcAT TTGCCCAAAA TCCAGCAGGG CAGTTGGTAG 77 4 0 

TTAGTCTGGG GGAAGAAACT TTTTnTCTTC TTGGGTGTTG GTTACCTTCG GTTTCCCTTT . 7800 
CCCCCAAAAC CG 

(2) INFORMATION FOR SEQ ID NO: 26: 



7812 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TTCAGCGACA GGGATGTTAT TGATGATTAA ACGTTATCAA TCGCAATTCG TGTTTTGGCT 60 

ACTAGGAAAT ATTTTTTCAA TTTTGTTATG GTTCCGAGCA GGGACGCACG CTGGTGGTGA 120 

TTATGCGATA TTTGTTATGT ATTGTATGTA TACTTTTAAC TCAATTTTTG GTATGTTTAA 180 

TTGGTTGAAG ATAAAAAATA AAATGGAAAA AAATAGGTAG GGAGTTCGAT GGCAGTGaTT 240 

GTTTTAGCAG GTACGATTGG CGCTGGTAAA TCTAGTTTAA CAGCTTTAAT TGCAAACCGT 300 

TTAGGTTCAG AAGCCTTTTA CGAATCAGTT GATGATAATG AGGTATTACC TTTATTTTAT 360 

GCGGAACCGG AAAAGTACGC ATTTTTATTA CAAATTTATT TTTTAAATAA ACGTTTTGAT 4 20 

AGTATTAAAC AAGCATTAaC GCATGAAAAT AATGTATTGG ATCGCTCGAT TTATGAAGAT 4 80 

TCATTACTTT TTCATTTAAA TGCAGATTTA GGTCGTGCCA ACGAAACAGA GGTCAAAGTG 540 

TACGATGACT TATTACAAAA CATGCTCCAA GAATTGCCTT ATGCTGCGCA TAAAAAAAGA 600 

CCTGATTTGT TAGTTCACAT TCGTATTTCA TTTCCCAAAA TGTTGGAACG CATCAAAAAA 660 

CGTGGCCGTC CTTATGAACA AATTGAGACA GATCCAACGT TGTATGATTA CTATCAAATG 7 20 

TTGAACGAGC GTTATGATCA ATGGTATGAA GACTATGATG AAAGTCCTAA AATTCAAATT 780 

GATGGCGATA AATATGATTT TGTCGAAGAT CCTGAAGCGT GTCAATATGT CTTGGCGTTA 840 
ATTGAGAAGA AAATTGAAGA ATTAGAAAGA TAAATAAAAA GAAGCAAGTC CGGACTTGCT 900 
TCTTTTTGAC TTAAAATGTG GATAGATAGG CTAAAGTTGC TACTAAATAA AGGAGTGTAG 960 

TGCTAAATTA TAGAATCGGA TTTTAACCTT CCCCTAAGTT AAAATCGGAT AATCTATCTA 1020 

TCGACAGTTT AAGTTTGTAT AAAATAGCGG TAGCTATTAT CAACGAAGTC ATTCATCGGT 1080 

TGACCTGACT TCTCTAGAAG TATACCATCG AAAATAACCG ACAGTTGTTC AAAATTTGGA 1140 

CTTAGTAAGT GTAAATTTTG ATTGAAAGTT TCTGTAAATC AATCATTTAT TCAAAAAATA 1200 

ATGCTCGAAA GAAGGAAGAT AAATGAGCCA AATTTGGTTT GAACGCATTC TACGTTGGCG 1260 

CAAAAAAGCG CAACGATTTT TACAAGGACG TTATGCCCGG TTCGATGAAT TGAATCGAAC 1320 

GCTGTTGCTA ACAAGTGTCG TGcTGGCGCT CATCAATATT TTTACTGGCT ATCTTTGGGT 1380 

TCGGTTATTG ATTTTAATCA CAGTCGCCTA CGTGTATTAT CGCTTTTTTT CAAAACATAT 14 40 

TCACCCGCGA TTGAATGAAA ATCAATGGTT TATTCTTCGA AAACAACGCC TAATGCAAAA 1500 
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GGTTCATGCT TTTCGCAAGC 3ACATA7GAC TCAGAAAGAG TATCGTTATT TTAAATGTCC 1560 
AAAATGTCAG CAGTCAT7AC GTGCGCCGAA AGGACGAGGA ACAATCAAAG TCACTTGTTC 1620' 
AAACTGTCAA AATCAATTTA 7CAAAAAAGT CTGATCTTCT GTGTTTTTTG GTCTAGGCAA 1680 
CTTTCGGACT TGTGGTATAC 7TC-AGAAAAG AAACCA7TGG ATATTCTAAA AGGGGAATAT 1740 
CTTGkTAAAA AAGGAG.-.CrA AAAACAATG 
(2) INFORMATION FCR SEC ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1903 base pairs 

(B) TYPE: nucleic acid 

(C) STRASDED'.TESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID SO: 27: 

CnGTCGTTGA CGCATCAACG CCTCnC7CGA GTATATGnGT TCACCTTCAC CGCGTTTAAT 60 

AGcAAAcGcT GTTaCCGTTA AATTTTCATT GGCATTAGCT AATCGATTTG TTACTTCTGC 120 

AATCGTTGCT TCGACATTTT CCTTTTCTTT GGCAACAATA AGACTGACAG AAGACTTATA 180 

GGTAATCACC GCAAAAATGG TAAAAACTAC AAATATAAAG AAAGAACTTG CGAAAGCCCA 240 

CTTTATAGTT AAAGATGGGC CTTCCAGTTC TTTTTTAATC GTTCTTTTCA TTAATTAGTT 300 

CTCACGAACG CATAACG7AG CCAGTTCCAC GGACAGTTTG GATGTAGCTT TCTTCTCCAG 360 

GTACGTCAAT TTTATT'JV/T .-_-.GTAG CGGA TATACACATC CACAACGTTT GTTTCTACTT 420 

CTGTTTCATA GCCCCAAACT TTATTTeGTA ACACATCACG TGCCAAGACA ACGTTCACGT 4 80 

TTTCCATTAG CGTTAAZAGT AATcGTAtTC GCGkTTkGTt AATTCAATCA TTTCAGAATT 540 

ACGACGAACG ACACGA7TTT CTTTTTCAAT TGTTAAGTCA CGATATGTAA TCGTTGTTTG 600 
TTTTGCAACG TTTTTATCGC CCTCAA7ATC AATACGACGA AgTAACGCAC GTAAACGAGC 
TAACAATTCT TCAATTGCAA AIC-GTT:AAC AATATAATCA TCCGCTCCAT GGTCTAAGCC 
AGAAACACGG TCAATTACTG AATCACGCGC AGTCATCATA ATAATTGGTG TATTTTTCAC 
TTGGCGAACA CGGCGACATA CTTCTAATCC ATTTAATTCT GGTAACATCA AATCAAGAAG 
GATAGCATCC CATTCGTTGT TAAGAGCGGC TTCCAATCCT GTACGACCAT TGTAGTGTAC 

TTCTGTCGTA TACCCCTCAT C-TTTTAVTTC AAGCTCAACG AATCTCGCTA AGTTCTTTTC 960 

ATCTTCAATA ATTAAAATGT TGCTCATTTG ATTAATTGTC CTTTCTCTTT AAAAGCTATA 1020 
CTCAAGCTCA ACCTTTTTAT TCCTTACTAC TTATTAGAGA ATTACTACTC AATCTATCTG 
GTTCTTTTTT AAGCCACAGT AAAACCATGA GCATGGTGAG CCACACTCAT GGTCTCCAGC 

GTGTTTATTC TTCGTTGTAC CAGCTGTAAT GGTAGA7TCC TTCTTTGTCT GTGCGTTCGT 1200 

ACGTATGAGC ACCGAAGTAA 7CACGT7GTG CTTGAATTAA ATTCGCTGGT AAGCGATCTG 1260 

AACGATAAGA ATCGTAATAA GCAATTGCAG ATGAGAATGT TGGTACAGGA ACACCTGCTT 1320 



660 
720 
780 
840 
900 



1080 
1140 
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GAACAGCaAT CGCaACAACT TCACGAACGG CTTGTTGATA TTTTyTCGTA ATTTC7ACSA 1350 

AGT ATT CATC TAACAACAAG TTTTCAAGTG CTGGATTTTT TTCATATGCA TCGGTAATTT 1440 

TTTGTAAGAA TTGTGCACGG ATAATACAAC CTGCTCGCCA GATTTTTGCA ATTTCACCAA 15C0 

ATGGTAAATC CCATCCGTAT TCTTCAGAAG CTGCACGGAG TTGTGCAAAA CCTTGCSCAT 15€C 

AACTCATTAA TTTACTAAAG TATAAaTgTT CGCGAATTTT TTCAATCAAT TCTTTTTTAT 1623 

CGCCTGCAAA GTTAAAGGCT GCAGGTTTTG ATAAAATGCC ACTTGCTTTT ACGCGT7CTT 1663 

CTTTGTAAGC CGAAATAAAA CGTGCAAATA CTGATTCTGT AATTAATGGT AATGGTACGC 1743 

CAAGATCTAA CGCACTTTGG CTTGTCCATT TACCTGTTCC TTTGTTGCCT GCTGCATCAA 1803 

GAATAACATC AACGATTGGT TGGTCGGTTC CTTCATCATC TTTACGTGTC AAAATGTCAG I860 

CCGTGATCTC GATAAGGTAA CTGTCTAATT CGCtTCGTTC CATTCTTT 19C3 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TCTTCTATkG TTAAtTGCaw CTTGTCCAAC TCCTTGyCTT CTTCTACATT ATAACTAAST €0 

TAGGTGACTT TTTCACTAAT AACCAATTGA AACTTTTTCT TGAAAACTTT TTGCATTTTT 120 

CATAATTTTG CGTTATTAGC GGATACATTA ATGAACTTAT GGTAAACTAA CATTCTGGAT 163 

GGTTCATTCC AGCCAGCCAA TTAAAAACTA TTTTTAGAGC GTTCAACACG CTTATTCATG 240 

ATTTTTTTAT AAAGGTGGAC TTATTTTGAT GAAAAACAAA TCGAAGCTTT TATGGTTAGG 300 

AAGTTTTTTC TTGCCGTTTT TACTTCTGTT GCTTGTATGG ATGACATTAC AACTTGCGCC 360 

TTTTGGTGAC AACAACTTAT TAGTTAGTGA TTTAGGCACA CAATACATGC CTTTCTTGAG 420 

TTTTTTAAAA CGCTCTTTTC ACGAGGGAAT AACTACATTC TACTCTTTTT CCAATGAGAT 460 

TGGTGAATCG ATTGTTCCTT TGGCAGCCTA TTACTTACTG AGCCCTTTTA ATGTGTTGGC 540 

TTTCTTTTTT CCTTATGAAC AATTACCTAT AGCAATTTTA TGGATTATTA CCTTAAAACT 600 

TGCCTTGATG GGCACTACGA TGTTTAGCTA TTTAAAATAC ACCTATCAAA AAGTTGATGG 660 

CACCACGTTA CTTTTTTCAA CGTCCTATAG TTTCTGTGGC TTTGTCACCG TCTATAGCCA 720 

AAACTTTATG TGGTTGGATG CACTGATTCT ATTCCCGCTT ATCTTATTGG GACTCCAACG 760 

TTTATGGGAT CAACGCAAAT GGGGCTTATA CAGTATCACC CTATTTTTAG CGATTGTGAC 840 

GAATTACTAT ATGGGTTATA TGATTTGTCT CTTTGCTGTT TTGTACAGTA TCTATTGGTT 9G0 

CTTTAAAAAG AATACCAAAG CCCATGCTAT TCGGCAATTT TTTAAACAAA GTCCGTTATT 960 

CATCCTCGTT TCTTTTTTAA CAGGAACCGC AACTAGCTTT T TAT TAT TAG CTGCAGCAGA 1020 

AGGGATGCTC TATACAAAAA AAGCAGACTT TGATGTCTCG ACGTTCTTTT TAACGCCTAA 10 = 0 
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GTTCAACACT 


TCATTCTTCT 


CGCAATTAGG 


CTTAGGTTCT 


ATTAATTATG , 


AGTTGCGTCT 


1140 


agaccattta" 


CCAACCGTTT 


TTGCTGGATT 


ATTTGTGACA 


CTCCTCTGTG 


TTGCCTATTT 


1200 


TCAAACGAAA 


CAAATTGCGT 


TAAAAGAAAG 


AATAGCTTCA 


GCAATTCTTT 


TATTCATTCT 


1260 


TTTTTTAAGC 


TTTTGGTTAG 


AAGCCTTTAA 


TACTGTCTGG 


CACATGTTTC 


AAAGCCCAGC 


1320 


TGGTTTCCCT 


TACCGAAATG 


TCTTTATTTT 


TAGCTTTTTA 


TTGATTGTCT 


TTGCCTATGA 


1380 


AGTTTGGCTT 


AAAAAAGTGA 


CCATTCCTTG 


GACCGCACCT 


ATTATTTTTA 


GTCTGCTGTT 


1440 


AGTCATTGGC 


TACGGATCTT 


TGTATTATGG 


TCCACAAAAG 


AATCTCTTAA 


TTTCGATCAA 


1500 


TTATTTATGG 


CTAAGTTTAC 


TTTTTATTTG 


GTTGATTTTC 


TTTTGCTTGC 


GTTTAGCGCA 


1560 


CAAAAAGGCC 


TTAAGAAACT 


ATGTAGTCGT 


GGCTTTATTC 


TTACTGGTCA 


GCACAGAATT 


1620 


AACAACGAAT 


TTCTGGATTT 


CTTTTAAACA 


CATGCCCTTT 


GGTAGCCAAG 


CAACATTTGC 


1680 


CCAAGATTAC 


CGAAAACACA 


GCCAATTAAT 


CGATGAAAAA 


ATGGCCTCAG 


CACCAGAACT 


1740 


TTATCGCATG 


AAACAAGTCA 


TTCCTTCCAA 


AGAAACAGGG 


TTCCGTGAAA 


TCAATAACGG 


1800 


CTACAATAAT 


CCTTTACTTT 


ATGGCTATGC 


TGGCGTTTCT 


AGCTATACTT 


CGACTTTAAC 


1860 


TGCCACTACC 


CAAGACACAT 


TAAGTGCGCT 


AGGTTTATAT 


CGAAAAAATG 


ATCGCCGGAT 


1920 


TGCTTATGTG 


GATAATTCAC 


AACTAACTAA 


TTTACTTTTA 


AATGTGAAAT 


ATGACTTTTT 


1980 


ACCAATTGAA 


AAACCTACCA 


GTGAAAAATT 


GCTAAAAACa 


GTTGGTTCCA 


CGAAAATCAT 


i2040 


GGaAAACGAT 


GAAGCGATCG 


GTATGGGCTT 


TTTGGCGCCA 


ACCGCATTAA 


CCAAGTTAAA 


2100 


ATTAGCAAAA 


AATAATCCTT 


TAGATGCCCA 


AGAAGaACTC 


CTGCAAACGC 


TCGTGCCGAC 


2160 


GGATAAACCT 


TACTTTAAAA 


CAGCTTCGTT 


GATAAATGAA 


CCTCATCATA 


CCAACGAAAC 


2220 


AATTGAAGCA 


ACCTTTAAAG 


TGAATAGCAC 


TGGTGACTTG 


CATCTTTATA 


TTCCGAATTT 


2280 


AAAATGGAAA 


AAAGTTACAC 


AATTGAAGGT 


AAACCAACAA 


GTTATCTCGA 


CGCCGATTTA 


2340 


TATTGCAACC 


AATCAACTGT 


TCAATTTAGG 


GCATTTTGAA 


AAAGGAACTA 


CCGTGACCCT 


2400 


GTCACTAACT 


GCTGAACAAG 


TGGTTGATTT 


AACCAATTGG 


CAACTTCAAA 


CTTTAGACCA 


2460 


AACAGCCTTT 


AATCGTGCGG 


TTGACAAATT 


ACGCCAACAA 


GCTCTTCACG 


TGAATGCAAC 


2520 


TAAAAAAGGC 


CATTTAAATG 


GCGCACTGAA 


TGTACCTGGA 


AATGACACTC 


AGTTACTGTA 


2580 


TACGTCCATT 


CCTTATGACC 


AAGATTGGCA 


AGTCAAATCG 


TCGCTACAAA 


AAGAACCCTT 


2640 


AAAAACACAA 


CGCATTCTAG 


GTGGCTTTTT 


AGCTGTTGAA 


GTCCCAGCTG 


GCAAACAGCA 


2700 


GTTAACCTTT 


GCCTATCATC 


CAAGAATGAT 


TTATCTCGGC 


ACTGCCGTCA 


GCGGAACGAT 


2760 


TTTACTGGGA 


ACGGCTGGTT 


ATCTTGGGTT 


TAAAAAATAC 


CGTCGAAAAC 


GTCAGGAGGC 


2820 


CACTCATGAC 


TAATCAGAAA 


AAATTTTTAA 


ATGGACTCAT 


TCGGTGGTTA 


CCCCTAGTTG 


2880 


GACTGGTGCT 


TTTCTTCGGC 


TTAATTCTAT 


GGGGATATTC 


CAGAGGAATT 


TTTCATTCTG 


2940 


TTGCTTCCTT 


GCAAGCTTTT 


ATTAAACAGT 


TTGGTAACTA 


TGCGGTTCTG 


TGTTTCATTC 


3000 


TTTTACAAAT 


CGTCCAAGTG 


ATTATCCCTA 


. TTCTACCTGG 


AGGAATCTCT 


TCTGTTGCAG 


3060 
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GAATGTTGAT 


GTTTGGCAAT 


TTGCAAGGAC 


TTTTGTATAG 


CTATCTTGGA 


TTGATTAkTG 


3120 


GTGAGTTTAT 


TGGCTTTTTA 


CTGGTTCGTT 


ACTATGGACG 


TAGCTTTGTA 


AAAATAATCC 


3180 


TTTCACCCAA 


TAAATACAAA 


AAATTTGAAG 


AGATTTTGGA 


CAAAAATGAA 


CACAATGTCA 


3240 


AAAAGCTCTT 


AATCTTTACT 


ATGTTAGTCC 


CCTTCGCTCC 


AGATGATATT 


GTTTGTTTAG 


3300 


TGGCGGGAAT 


T AC AG AT ATT 


TCTTTAAAAG 


AATTCATGAA 


AATTGTTCTT 


TTATTGAAAT 


3360 


TCTGGTCTGT 


GGCTACTTAC 


AGCTATTTAA 


TGTTGTATTT 


GTTTCAATTA 


TTTGGCAAAT 


3420 


TATAACAAAA 


AGACAACAAA 


ACCAACGTTG 


GTTTTGTTGT 


CTTTTCTTTT 


TTAGATTGAA 


3480 


CGTAATTGGG 


CAATCCGTTC 


TTTGACAGCC 


TCTTGTTTTT 


CAAGGTAGTC 


CTTTTCTTTT 


3540 


GCACGTTCTG 


CTTCCACGAC 


CTCATCTGGT 


GCATTTGACA 


CAAAACGTTC 


ATTTGAAAGT 


3600 


TTGCCTTGCA 


CACGTTTGAC 


TTCTTTTGTC 


CATTTGTCTA 


ATTCTTTTTC 


TAAGCGAGCA 


3660 


ATTTCTTCCT 


CGATATTAAT 


CAACCCAGCT 


AATGGTAGGA 


ATAGTTCTGC 


GCCTGTTAAA 


3720 


ACAGCGGACA 


TGGCTAATTC . 


TGGCGCTTCT 


ATTTCACGAC 


TGATGACTAA 


TTCCTCTGGG 


3780 


TTACAGAAGC 


GTTCCAAGTA 


GCTAGTATTA 


GCTGTTAAAA 


ATTCTTCTAC 


TTCTGTATCG 


3840 


TTTGTTTTAA 


TTAACAAGGT 


AATTGGTTTA 


GAAAGTGGTG 


TATTCACTTC 


TGCTCGAATA 


3900 


TTCCGAACCG 


AACGAATAAC 


TTCTTTCAGA 


ACTTCCATTC 


CTCGAGCTGC 


TGCTTCATCG 


3960 


TTAAATTCTT 


CATGAACAAC 


TGGGTACTCT 


GCCACAACCA 


ATGACTCACC 


TTGATGCGGA 


4020 


ATTTTTTCCC 


AAATTTCTTC 


TGTTACAAAC 


GGCATAATTG 


GATGCAATAA 


GCGTAAGATT 


4080 


TGATCCAATG 


TGTACACCAA 


GATGCTCCGT 


GTTGTTTGTT 


TAGCTGCTTC 


ATTGTCTCCG 


4140 


TAAAGAATTT 


ctttact:at 


TTCAATATAC 


CAATCACAGA 


AATCATCCCA 


GATGAAGTTG 


4200 


TATAATTGGC 


GACCTGCTTC 


ACCGAATTCA 


AAGCGATCAA 


ATAATTCTGT 


CACACGGGCG 


4260 


ACTGTTTCGT 


TTAAGCGCGT 


TAAAATCCAA 


CGATCAGCGA 


CAGTCTTTTC 


ACCACTAAAG 


4320 


TCAATATCTG 


CAGCAGTCAT 


ACCTTCTACA 


TTCATGATAA 


CGAAACGACT 


TGCATTCCAG 


4380 


ATCTTATTGA 


TAAAGTTCCA 


AGAAGCATCC 


ATTTTTTCAT 


AACTAAAACG 


AACATCTTGA 


4440 


CCAGGTGCTG 


AACCATTTGA 


TAAGAACCAA 


CGTAACGCAT 


CTGCGCCATA 


TTTTTCGATG 


4500 


ACATCCATTG 


GATCAATCCC 


GTTCCCCAAT 


GATTTACTCA 


TTTTGCGTCC 


TTGCTCATCA 


4560 


CGAATTAAGC 


CGTGAATTAA 


GACATTTTGG 


AAGGGACGTT 


CACCTGTAAA 


CTCTAAGCTT 


4620 


TGGAAAATCA 


TGCGACTTAC 


CCAGAAGAAG 


ATGATGTCAT 


AGCCAGTAAC 


TAATGTGCTT 


4680 


GTTGGGAAGT 


AACGTTGGTA 


ATCTTCGCTT 


GCTTCATTCG 


GCCAACCCAT 


CGTTGAAAAT 


4740 


GGCCATAAAG 


CAGAACTAAA 


CCATGTATCT 


AAGACGTCAC 


TATCTTGAAC 


CCAGTTTTCA 


4800 


CTGTCGGCTG 


GCTCTTCCAT 


CCCAACATAC 


ATCTCGCCCG 


TTTCTTTGTG 


ATACCAAGCA 


4860 


GGAATTTGGT 


GGCCCCACCA 


TAATTGTCGC 


GAAATAACCC 


AGTCATGTAC 


ATTTTCCATC 


4920 


CAACGTAAAA 


ATGTTTGATT 


GAAGCGTGGT 


GGATAAAATT 


CAACAGCGTC 


TTCTGTTTCT 


4980 


TGATTTTTCA 


TTGCTTTTTC 


AGCTAATGGT 


CCCATTTTTA 


, CAAACCATTG 


GGTCGATAAA 


5040 
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CGAGGTTCCA 
ATCTTAATTA 
CGATCCATTs 
TTTACTCGAG 
GTGATTTTTA 
ATTTCTTTAT 
TCTTCTGGAT 
ATTTCAACAA 
ATATCTTTAT 
• ATGATATATT 
ACGGCCTCAG 
CCCATTTTTG 
TGATCAACAA 
TCTTCTACTT 
TCAAAGCCTT 
TGCCCTAAGT 
GCTTTTTTGT 
GCTTCGATTT 
CCTTCTTTCT 
CTAAACACTA 
TACGCGGTAC 
GATAACGTTC 
AGCTACCTTC 
ACGCCAATGA 
AAGTCAATTT 
ATCGGGTTCC 
AATCACAAAA 
ACCAAATGAG 
ATTTGCTTGT 
CGCCGCTTCT 
AACCACACTA 
TAAATTTTTC 
TTCTAATACG 



CGACAACGCC 
AGCGACCTAG 
CCTcGTATTT 
GTAAGTCATG 
CAACCCCTGT 
CGACTAATGG 
GGACCGCAAT 
CGCCAGAACC 
GGATTACTTC 
CACCACGGTA 
ATAGCCCTTC 
CCCACTGTTC 
ATTTTTCACG 
TTGCTTGCGT 
GCATTCTTTT 
GTAGTTTCCC 
CGCCACTAGG 
CTGTCGGTTG 
TAAAATACGC 
AAATACGCCC 
CACCTAAATt 
ACGAAACGAA 
TCACTGGTTG 
TACTCCTCTT 
GTAATCTAGA 
GACGAAACTT 
ATAGCACGTG 
TCTTCAGTAT 
TCTTCAACTG 
TGATTGAAAC 
ATTAAAACCG 
AAAGAAAAGA 
TGCCCTTTTC 



TGTTCGTTCT 
TTCTTTCAAG 
TCCAGCCAGT 
ACGATTACCG 
TCCAAATTCC 
CAAGACAACT 
CGCCGTATCT 
ATCTGATAAT 
AATATCAGAC 
AATCAAATCT 
ATCTAAAGTA 
ACGAATGTGA 
ACCTAAGTCG 
GGCAATTCCG 
TTGGCGGATA 
TGTTACATTG 
TTTAAATAAA 
ATATTTCGTT 
GATTTGGAGC 
TAAAATACTT 
GCATTtGAAT 
TCGTTCTACT 
TGTAAAAGTC 
GTTCGCTGTC 
GGGATTACAC 
CTGAGACGAT 
CTAAACGGCC 
CATGAAGCAT 
TGTTATTGGA 
GTTTCGTTTG 
TTTTACCTTT 
CGGGAGCTTT 
TTGTAACATT 



GAATGGCCCA 
TCAGAGACAA 
TCATTCATcG 
ACTTCAAAGT 
ATATCTACAT 
GTCTTACCAA 
CCCAACATTG 
GGATAGCTCA 
AATGCTGTTT 
TTCTCGTAAA 
AAACGTTCAC 
GAAGCATATT 
TAACGTGAAA 
GCATGGTCCA 
ATCATATCTT 
GGTGGTGGAA 
TCTTGATCTA 
GGTAAGTTTT 
GGTAATTATT 
TCGTATTTTA 
tGAATtCCTT 
AACGTACTGT 
TTGCACCTGA 
TTTGCTTCAT 
TTTTTTTGCA 
TTCTTCATAA 
CATTTCTGGA 
TTCCATTTCG 
AATCGTGATA 
CAGTGAACAA 
ATAATCTGCC 
TGTGCCAACT 
CATTCTGAAA 



CGCTGTGATT 
TCGCTTTACG 
TGCCGTCTTC 
CATTTGGATC 
AGTCATCTGC 
TAAGTTCTTG 
TTTCAGGACG 
TGTGGTAAAA 
TCGCTTTTGG 
GAGAGACAAA 
GGCTGTAATC 
CTTCTTTCCA 
TTCCTTGTTG 
TTCCTGGCAG 
GTAATGTTGT 
TAACAATTGA 
GCCATTTTTG 
TTTCTTCTGA 
TTACAAATGA 
GGgCsTAGaT 
CAAATGCCAC 
GTTCAAACCA 
ACGACTTTTT 
TCTACAATGA 
GATTCTAACG 
ACTAACGTTC 
ATATACAAAC 
ACACCCTCAG 
ATCTGAACAC 
ACACGTGTGT 
AAGTTGATTT 
TTAGGCTGCT 
TCCTCCTTTT 



CATTGTTTCG 
TGCAGCAAAA 
GTTCATAACA 
ATGGGCAGGT 
AATAATTGGA 
ATACCGTTCA 
CGTTGTCGCA 
AGCTCCTTCA 
ATCCCAGTTA 
CACTTTACGA 
TAAAGAAAGA 
TTCCCACACT 
TGCTAGTTTC 
CCATAACGTA 
ATCCCAAGCA 
ATAGGGctTC 
ATAACGGCCA 
CATTTTTTCC 
GTTTTTTAGG 
TCATCTTATT 
TCATtCGTGA 
TTGACTCACA 
CTCTAAAACG 
AGCGCAAgcA 
CTTGTTGATA 
CTTCTGGGTC 
CATAAGCTGC 
CGGCACACCA 
CGTCTAATTT 
CGATATCAGG 
CTTGGTTGTT 
CACCTGTCAA 
CCACTTTCAA 



5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
,6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
. 7020 
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TATTTGCTAG 
CAATATATTG 
TCGCAGTCAC 
CTGTAGAAAT 
CATGAACTAA 
AGACAATATT 
TGCCACAACC 
CCTCAATTAA 



TAATAAGTAT AGTAGAGGAT TTTTATGAAG ACAATTTTAT TGCTTTGGGC 
TGAAAAAGTT TCATTAAAAT AAATCAGTGT TTGCAGTTCC GTTGTTAAAT 
CACACTAACG TTAGCTGGCA CACTGACACG GTCTGGCGCA AAATTTAAAA 
ACCCGCTTGT ACAATTTTGT CGATGGCTTT TTGAGCATGG TGACTTGGTA 
GGCTACCGTG ACCCCTTCTT GTCGAACGAA TGCTTCTAAC TCAGACATAT 
TAATCCATTA ATCGTGGTGC CAACTAAAGC CGAATCATTA TCAAAAGCAC 
CAAATTCTCG TTGCGTCTAA AATTATTTTT TAATAATGCC TTGCCTAAAT 
AATCAGAGCA ATTCTTTTTT CTTCTTGCGT ATTTAAAATA TTACTGAATA 
ATACgGCACA TCGTn 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATATGGCCTT CACTTTAGTG AAGAGATTTA ATTGTCTTTA CAGATAAACG CTTAATTTTA 
GTCGATAAAC AAGGAATTAC AGGTAAAAAA GTTGATTATA AATCGATTCC CTACAAATCT 
ATTTCACGCT TTTCTGTTGA AACTAGTGGA CATTTCGACC TAGATGCAGA ACTAAAGATA 
TGGATTTCAA GTACTGAGTT ACCTTCAGTA AGTCTTCAAT TTAGAAAAGA TAAAGATATT 
GTTGCTATTC AACAAGCTTT AGCAGCTGCT GTATTATCTT GAATACACAA AAAAAGACAG 
GAAAGCGACC AACTTTCCTG TCTTTTTAAT TktAGATAAA ATAAAAAAGG CTTCTCAGCC 
TTGATACGTT CCTTATTCAA TC AG AC AC AT GGCGGCACTT GCTTAGTGCT GCTTCCTTCC 
GGATCTGACA CAATTCACTA GCCCGCCATT GTCCTAGCCT TTCGGCAAAA CCTAGTATAA 
CGTATTTTTA TTCTTTTGAC TAGTCTTTCT TTTGATTTTT TAAAAGTTTC TCTTCTTTTT 
TCTTTGCCCG TAACTTTTTA AAGAACTGAA CTAGCAATAA GCGACACTCT TCTTCTAAAA 
CGCCTGCTTC GACATAGGCC ACATGATTGA AGCGTTCATC TTCTAATAAA TTCATCAATG 
TGCCAGCAGT GCCTGCTTTA GGATCAGTTG CGCCATAATA AACTTCTGGA ATTCGTGCCA 
AAAGCATCGC CCCACTACAC ATTGGACAAG GCTCTAAAGT TACAAATAAT TGGGTTTCTT 
CTAGACGCCA ATTTTCTATC CCGCGACACG CTTCTTGGAT TGCATACATT TCTGCATGCG 
CCGTTGCTTC TTGCCGTGCT TCTCTCAGAT TATGCCCGCG GCCAATAATT TCCCCTTGCC 
GAACAATTAC AGCTCCAATA GGCACCTCAG CCAAACCTTC TGCTTTTTTC GCTTCTGCAA 
TGGCTTCTCG CATAAAAAAT TCTTTTTCTT CTTGTGTTAA CGTTGTTTCT TTTTTTCCCA 
ATTAATTTCC ACCTTTCGGC TTTTTCCCAC TAAACTTCTG TCTTGCTTCA TGGTACAATA 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7515 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AATCAGTGAG GTGTGTCTAT GATTATTGAA AATTTATCTC AGCTTTATCC CAAAGGCTAT 1140 

TTATCAAAAA CAATTGGTTC CGAAGCCACC TATTCAATTC CTGTTGACGA CCAATTTTTC 1200 

■ ATTGTGAATA AAGCTTTCTT AAGTATTAAA GAACAGCATT TGTTGGAAGC GCTTTTCCCA 1260 

ATTTCTGAAA aCCCAACAAT TACTGGTAAC CATCCTTGGT TCAGTTACTT ATTTCAACAG 1320 

GCAAAATTAC CTGCCGAAGG GACCTTTCGT ATGTTGCAAA TTCAAACGAA TGTCACAAAA 1380 

GAATTACAAG CTGAATGGCA ATTTAATTTA ACAAAGATGT TTCCAGATAC AGTTGATTGT 14 4 0 

TTCAGTCCAA GTAATAATAT GTATATTTTA GTTGAAGAAC AGTCAAAAAA TACCTTTCAA 1500 

CAAGAAGAAA TTCAAG^AT CTTTTTAACA TTAGATACAG ACTTTGATTG CACTAGTGCT 1560 

GTATTTGTTG GCAATTTTTA TTCCTCTGAA GACATTCTCC GGCGTTGTTT CCATGAAGAA 1620 

CAACGTATTT TTTCAGAAGA ACTTAATTCC TCTAGTCGGA CGACTGTTTT TAGTTTAACT 1680 

GATGTTGCCC TCCACTATTT TACCAAGGAA GCCATGAGCC AAAATGTACT TGTTGAGTAT 1740 

TATCGTCGTT TATTAAATAA AGACACCGAT ATTCAGCCAA TCATCAGCGC TTTATGGAAA 1800 

AATCAAGGCA ATATCAGCTC CACTGCAAAA GATTTATTTA TGCATCGAAA TACTTTACAC 1860 

TATCGGTTGG AAAAGTTTTT TGAACAAACT GGCTTATCTT TAAAAAAAAT GGATGATTTA 1920 

ATCTTTTGTT ATTTGTTATT AAGAAAGTAG TACGTCCATT TTTTACTCGT ATGTTTTTTA 1980 

TTCCCAATTA GAAAGTAGGC GTTCCAATGA AAATTACCGC ATATGTCGCT AGTGCATTTA 204 0 

GTAAAAACCA CGAAGGTGGC AACAAAGCAG GCGTTGTTTT TATGACACAG CCCTTAACGA 2100 

GAACCCAAAA AATGGCCATC GCCAAAGAAC TAGGATTTGC AGAAACTGCT TTTATTTCAG 2160 

ACTCAAAGAT TGCTGATTTC CGCCTTGAAT ATTTTACACC AAAAGAAGAA GTCGACTTAT 2220 

GTGGCCACGC GACAATTGGC GCTTTCACCA TTATGAAGCA CTTAAATAAA TTATTGCAAA 2280 

C AG CAT AT AC AATCGAaACG AACAGCGGTT TGCTCGCAAT CACGGCTAAC GGTGAACAAC 234 0 
TATTTATGGA ACAAACTGTC CCAACTTTTT ATGATACGCT CACTTTAGAA GAATTGAGCG 2400 
GCTGCTTTGA TTTAAACGCA GTGgCAACTG CTTTTCCTAT CCAGATTGTG TCAACGGGCC 2 4 60 

TAAAAGATAT TTTGGTTCCC ATTAAAAATG CGCAATTATT ACAGGATTTA CAACCAAACT 2520 
TTGAAAAAAT TAAGTTACTT AGTCAAAAGT ATGGAGTTAT TGGGATGCAT CTTTTTGCTT 2580 
TGTCTGAAGA AAATATCATT TGTAGGAATT TTGCACCCCT TTATGATATC GATGAAGAAG 2640 
CCGCTACTGG TACATCAAAT GGTGCGTTAg CTTGTTATCT GCACCATCAC CAAATTTTAC 2700 
AAAAAACCAA TTACACCTTT GCCCAAGGTT ATACCTTAAA CGCTCCTTCA GAAGTCTTAG 27 60 

TGCGTTTGAA CACTACATCC CAAAAGACAA TTGATCAAGT TCTAGTTGGT GGCAGAGGCT 2820 
ATTATGTTGA AACAAAAGAA TTATTTTTAT AAAAAAAAGA TGGcCTAGGC CAcTTTTTTa 2880 
ACTTCaTTGC- ATCTTCAATA AATGTGAAAA AGCGTTCTTT TAAAGAATCT GTATTGTCTT 2940 
TAACTTTTTC GCCATATTCA GCTAATTTTG AACCGCCATC TTTCACACGA TCCACAACAT 3000 
TTAAAACAGA ATCTAATTCA TCATCGGATA AATCATCGAC AATCGATAAT AATTTTTGGT 3060 
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TTCCATCAAA TTTATCGTCT. ACAAACTTTT TAACTTTATA ACGATTGGAA ACATGAGATA 3120 

CCTTCTTAAT AATTTTTTCA GAAGCGATGA CTGCAACAGA GACACTTGCa ACTGCAGcAA 3180 

TGCTTAAACC GATACCAATT TTTGTTGATG CTTTCaTTTC CTTCAGCTCC TCTGTTATTT 324 0 

TCTACTTTAA TCATAACATA TTATTACTTT wAkGCGTAwT rGAAaGrCCT TTACTAGCAC 3300 

TTAGTCTCGT CGGTTATTGC CACCAAATAA AATAACCAAA CGTAAAAGTT GTAAAAAGGT 3360 

CGAAAGCGCC GCTGCTACGT AGGTAAGTGC AGCTGCAAAA AGAACTTTTC TGGCCATTGG 34 20 

GACTTCTTCT TCTGTTAATA ACCCACCTTC AGAAAGAATA CTTAATGCAC GTCTAGACGC 34 80 

ATTAAACTCC ACCGGTAAAG TCACCAATTG AAAAATCAAG GCTAATGAAA AAGCCAAAAT 354 0 

TCCAATGTTG ATTAATGTTT GATTCCAACT AAATAAAACA CCAAGCAAAA TTAGTGGAAA 3600 

GGAAATGGTT GAACCGATGT TGGCAACTGG AACAATAGCT GCACGAAGAC GTAAAGGCAC 3660 

ATAGTTCGTA TGATCTTGTA CCGCATGTCC GCATTCATGA GCCGCCACGC CAATAGCCGC 3720 

GACCGATGTC GATTGAGCAG TGGCTTCGGA TAAGCTCAAC ATTTTGTTTC CAGAATTATA 3780 

ATTGTCGGTT AAATCACCCG CAATTTGCTG AACTCCAACA TCATTAATTT GTTCTTTTTG 384 0 

TAAAATATAT TGTGCTGCCT GCGTGCCTGT AACATGTTTA CTGCTTCGTA CTTGATCATA 3900 

TTTACGAAAG GTACTGTTAA CATATGCGGA GGCCGCACCA GAAATCGCAA TTCCGATAAT 3960 

TACCAATATA TAGGTTGGAT CGATTCCAAA ATAAAAGGGC ATCATTGTCC TCTACCTCCA 4020 

TCTTTTTTTC TTTATTGTAG CATGATAGTG AAAAAAAGGG AGCGACAAAC GTTGTCTTTT 4080 

GTTTAAAAAA ATGATGAATT TTTCAAGAAA GTGCCGAAAC TAACAAAATT GTCACTCAAT 414 0 

TAACGGAAAA AAATATGGCA AATTTCCTAA AAAAAAGGCA TACTATTCCT TGAGGTGAGG 4 200 

AAAATGAAAA CATTACTAAA AATATTAATT CCCATAGTCA TTTTAGCCGG TGGTAGCTGG 4 260 

GGCGTTTATA ATTACTATTA CGGTGGAGAA GCCTATTATA CACAAATCAC AACATCTGGT 4 320 

GAAAAGAAAG ATGAAAAAAC AAATTCTGGT GAAGCAATGA CCATTTATTA CTATCAACAA 4 380 

CCTGCGTTTA ATAAAAACGG TGAAGAAAAA ACCGTGGAAT TAAACGAATC CCGTGATCAA 4 4 40 

CCTTTGCGTA TGAAAGCCTA TCTAAAATTG AAAGTTAATC CCCGTAAAGG CGTGATCAGC 4 500 

TGGAATGAAG TGACAGAAAA AGAAGTCCCT GAAAAAGCTT TAGAAAAATT AAAATAAAGA 4 560 

AAAAACCGTT GGCAAGCGAC CAACGGTTTT TTCTTTATAA TTGATTACTA ATTTTTTCGA 4 620 

TATTTTTTAG CATAATGGAC ACTGTGCCAT CTGGATTATT AACAAATTCA ATTAAATCGC 4 680 

CATTGCGGTA AACATCTAAC GGAACAATTA ATTCAATACC GTTATCCATT TTTAACTTCT 474 0 

GTTTGCCAAA TTTCTTTTCA GAAATTTCGC GAGATTCTCT GACTGGCGGT GTTTTGTCGA 4800 

CAAACTTCGA TTCATTCACT TCTTCTTGGT AAGCTAATTT AGCTGATACA TTTTCTTTAA 4 860 

ACACTTGCTC ■ AGCAATCTGC TCATTGTCAA TCGTCCCCGT TTGCTCAATG CTATCATACA 4 920 

CCGcTTCTTT TACAGAAGCC ATTAGTTCAA ACTCATCTTC GTTAAATTTC TTTCCAATGC 4 980 

GTTTGACTGC CTTTTTAATT TCTTTGATAT TTTCTTCCAT GGAAGGTGCC GCTTGCGTCT 504 0 
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UAA 1 o AL ill 


t t p a p, a p. a a a 


T A PP AC. A TTT 1 


TTTCTCCTGA , 


AAATTCATAT 


CTTTTCTCTA 


5100 


biAAl 1 L. AAA 


tijU Innl bin 


TPTZi ACTPAA 


PPPTAATCCC 


TTCATCTGGT 


TTTTGTGATT 


5160 


Tl GAAbAbAA 


aaTTPPTPPP 


TTP.aTA ATP A 


P,TTTC5TTATA 

O XXX vj x x n x o 


AACGGATTCT 


TCTTCATAGT 


5220 


CGACG 1 AA1G 


PPTPTZi APTT 
bb lul AAb 1 I 


lol i x rV x r\\j x 


TTAATTTAAT 


CATTGCGACA 


TGTAACACCG 


5280 


TATC1 AA1 1L 


AT a fa a a 2Tr 


aaPaAAATAT 


ranrACTAGG 


CGCATCTTCG 


CTTCCTGAAT 


5340 


AAACATLA 1 A 


PPa ZiTPTTPP 




VJVJVrf X X X X S-f X X 


TATAAAATCA 


CTAGGAATCC 


5400 


m >^ rp TV TV r"^ rp 

CACGTAALG I 


PTP A AP A A AP. 
b 1 b AAb AAA O 


P. A APT ATTTT 


pppppaaTGT 


TCCAGTCTTT 


GTTTGTGCAG 


5460 


TCGAAAGTTT 


I bAAAi 1X11 




n\^\3 1 1 VjVjrtrt x 


ATACTCGGTC 


GTTAAATCTA 


5520 


GTTCCTTTTC 


APAPAAAAPP 


UOCvj 1 V^Vw.V^M.0 


TTTfTffi ATP 
x x x \w x \jr\ x 


AATAATGTGT 


AAAATTGCTT 


5580 


TTTTTAAATA 


n Tv m tv TOPTiTT 

AATAIbbAl 1 


TTTAPPPTPT 
111 AUGU 1 b 1 


PPTTTTPTP a 
1 1 I ivy 1 v^-r\ 


APPTTTTTAG 

r^VrfVw X X X X X *V w 


TCTAACTGAA 


5640 


AGAGAGAAAA 


t\ t\ /"* V" /"*• 7V 7\ rp Tv r^ 

AAGCCAA 1 AG 


rp rp rp rp 7\ rp 7V 7V 7\ 

1 1 1 L 1 Al/\fVA 


aapar-p.aana 


ATTPGTAGAC 


AAATGTCTAC 


5700 


GAATTCtTCT 


rp rp TV r"* rn rp f* tv tv tv 

TTAL 1 1 LAAA 


AAPPPATTAP 


PPTTPTTPPP 


TGTATAACAT 


CCCTGTTGCT 


5760 


TCATCAACTG 


/"•A rp /-i/-' 7. ^-t rn rp rn 

GATbbAb ill 


TV rnrp TV A A A ATP 

A I 1 AAAAA1 Lj 


v^vv i J. it! i 1L 


TnAAAATATA 


AGTTAAGCCC 


5820 


ATTGGTACGA 


m TV TV rn TV TV rp 

TAATGGCAAT 


CAvwt-Al IGCo 


ppta aaaapp. 


PPAAGTAATA 


TTGTGGCACA 


5880 


ATAGATAAAA 


TTCCTGGCAA 


GbUAUbGAUvj 


ppp.aTaPTPP 


TPGCACGAAC 


GCCCATTAAA 


5940 


GTGGCAAACA 


rr> rn r~* r** <T» r*^ f~* TV rp 

TTCCTGCCAT 


Trri 1 !" a a pp a 
TGC 1 GAALCA 


ATPaTPPPPP. 


PAAPAAATGG 


ATAAACATAT 


6000 


TTTAAGTTGA 


rn /"» /*■ Tv TV tv /"* Tv rp 

T CbCAAAbA 1 


Lbblbbl ltn 


pTpapappTA 


P, G T AA P AAG A 


AATcATCGCT 


6060 


#*i -m y\ rn T\ T\ T\ 

GGAATCGAAA 


/■*trpcn/-«rp +. /-» rp rp /-> 

CTTGTtb lib 


11111 Kb 111 


pPTnpp,aTnPA 


TAAAGaTAAC 


CGCTAAAACA 


6120 


GCTGAACCTT 


rn /"</-« TV t\ rn 7\ rp rp 

GTbLAAlAl 1 


TP AT 1 A P APPTi 

1 G A 1 AbAbLA 


AT»PaTTP,P,PP 
A1LA1 lobUL 


ATAAGTTGGT 


TGACTGAAAG 


6180 


m TV /""•/*"• T\ TV rp /~« TV 

TCAGCAATCA 


AL 1 G 1 AAA 1 b 


a a TPP.PZVTTZ1 


PTPATATHAT 


GCAACCCAGT 


GATGACTAAT 


6240 


GGTGCGTACA 


rp A AAf/^OAAA 

TAAAbbbAAA 


unLbuLbuLn 


A ATAAPPAAT 


TAAATGTTGA 


CGTTAATCCT 


6300 


GCATTTACGA 


r PA r P/" , APAAAP 
1 A 1 b Ab AAAb 


PP Zi Zi PT Zi PP Zi 




PGATTGGTCC 


TAAAATGACA 


6360 


TGTGCAGCCA 


Ibi 1 bb 


T Zi PT Zi Zi a P.P a 
1 r\v_^ 1 rulrtov^rt 


AAPAATGGGA 


CAAAAATCAT 


TGAAATTGCT 


6420 


TGAGbAA 1 CA 


/^rpmrpnn^~«rp7\ T\ A 

b 1 1 1 1 b 1 AAA 


pa a a aTTTPa 


AP,ATAP,nPAA 


GCAAGAAGCC 


GGCTAACATG 


6480 


rp /-■/—•/-» t\ rp tv Tv 

GLiGGbAl AA 


LI HjLvjLI lb 


a t a a pp a a tp 


aTPTTAAPTT 


GAGCAAACCC 


--AAAATCCCAG 


6540 


AAAbbAAl 1 I 


b 1 1 111 


aPTPPTPP.pa 

/iVj 1 1 v_vjv_,rt. 


APTP»A ATAAPi 


C ATT C AG AAG 


TTGTGGTGAG 


6600 


T\ rp tv T\ rn rp tv 71 

AL I AA Ibi AA 


1 LtL 1 HouHO 


a a t p pp p a a P. 




TTCCCATTTT 


CTTCGTAATA 


6660 


/TT - l/"'/r-»TV O/"* rp TV Tv 

CTbCAbb I AA 


I bbbAAb 1 


pa apaaPTPP 


a a a attpttp 


GCCGATTAAC 


CATAAGAATG 


6720 


AATTGACTCC 


ATTCCAAAAT 


TGAGAAACTT 


CCACGATTGT 


TTGACCATCA 


AGAAAGCCAA 


b / oU 


ATGGAACCCC 


TTCCAAAATA 


TTGCGGAACC 


CTAAAATCAA 


TCCCCCGATA 


ACAAGTGCAG 


6840 


GAATCAATGG 


. TGTAAAAATC 


TCAGCTAAAA 


, CAGAAACAAC 


TCGTTGAACA 


. GGACTCAAAT 


6900 


TACTTTTGGC 


AGCTGCTTTC 


CCTTCTTCTT 


TTGAAACCCC 


: TTCTAAACCG 


1 GAAATTGCAG 


6960 


AAAAATCATT 


ATAAAATTGT 


GGGACTTCAT 


1 TCCCAATAAT 


' TACTTGAAAC 


: TGTCCAGCAT 


7020 
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TTGTAAAGGT TCCTTTTACA GAAGGAATAT CTTCAATCAC AGGTACATCC GCTTTTTTTG 7080 

GATCTTTTAA TACAAAACGC ATTCGAGTTG CACAATGAGA AACAGCTGAA ATATTTTCTT 7140 

TGCCTCCTAC TGCTTT 7156 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATTCGTTGTA ATTCCAACTG GAATTCCTAA TACGAACTTA GAAACGTCGT ATCCATTTCT 60 

TGCTTTAATG TCAGTTGTTT TTGGACCAGT AGCTGGAGGG CTAATTGGTT TGATCgGTCA 120 

CACgTTAAAA GATTTTACCA CGTATGGTAG CGCTTGGTGG AGCTGGATTA TTTGTTCAGG 180 

AATTATTGGT ATTATTTTTG GCTTCGCTGG TCGTAAAATG GATCTTCAAC ATGGTGAATT 240 

CACAACAAAT GACATGGTGC GTTTCAATAT TTTCCAAGCC TTTGGCAATA TCGTCGTTTG 300 

GGGCTTAATC GCGCCTAGCT TGGATATTTT GATTTATAGT GAACCCGCAA GTAAAGTGTT 360 

TACACAAGGC GTATTCGCGA CTGTTTCAAA TATTGTCGCT GTGGGGATTA TTGGAACATT 4 20 

GTTGATGAAA GCTTATGCAT CAACAAGAAC AAAGAAAGGC AGTTTATCAA AAGATTAAGA 480 

AACTGTCGAA ATAATTTAAA TGGTTAAGGG TCTGCCCTAG TAATTACTTT GCTAATGCTC 540 

AGACCCTTTC TTTTATGGAG AAACTGAGGT GAGGCCGAAT GAAAAAACCA ATCATTACTT 600 

TTAATAATTT TTCCTTTCAA TATCATAGCC AATCTGAACC AACATTAAAG GGGATTCAAT 660 

TAACCATCTA TGAAGGAGAA AAGGTTTTAA TCGTTGGGCC TAGTGGTAGT GGTAAATCAA 720 

CA-TTGGCGCA ATGTATCAAC GGGTTAATTC CCAATATTTA TGAAGGTGAG ATTCAAGGAA 780 

CAGCTACGGT TGCTGGTAAA AATATCCAAG AAACAAGTCT ATTTGATTTG TCTTTTGATG 840 

TAGGAACTGT TTTACAAGAT ACCGATGGGC AGTTTATCGG TTTAACGGTT GCTGAGGATA 900 

TTGCTTTTGC TTTAGAAAAC GATGCAGTTG AACAAGCAGA AATGAAAAAA GCGGTTCAGA 960 

AATGGTCCGA GATTGTTGAG CTGAATCAGT TATTGCAACA CCGTCCACAA GATTTGTCAG 1020 

GTGGTCAAAA ACAACGTGTA TCTATGGCGG GCGTTCTAAT TAACCAATCA AAAATTTTAC 1080 

TTTTTGATGA ACCTTTGGCT AATCTTGACC CACGAGCGGG ACAAGAAACA ATGACTTTGA 114 0 

TTGACACCAT TCAACAAGAA ACAAAAGCGA CTGTTTTAAT AATAGrACaT CGCTTAGAGG 1200 

ACGTtCTTTG CGAATCAGTT GATCGAATTA TTGTCATGAA TGAgGGCACa ATTATTcTGA 12 60 

TACAACGCCA GATGAATTGT TA 1282 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5840 base pairs 

(B) TYPE: nucleic acid 



WO 98/50555 PCT/US98/08985 

389 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



TAAGGATTGT 


TACTGCTTTG 


CAGGCAAAAC 


C AbAb I L 1 1 b 


CATTAACACA ATCAGTGGAG 


60 


GGATTCTGGG 


CTTTTTTTAT 


AAAAaAGTGA 


GGTTGCA1 1C 


ATCATGAGGA 


TTGAAAAAGT 


1 70 


TTTAAATAAT 


AATGTCGTGC 


TTTCAAGAAA 


TGAGCAAGGC 


GAAGAAATTG 


TTTATATGGG 




ACGCGGCTTA GyCTTTCAAA 


AGAAGATTGG 


AGACACGATC 


AATCCAGATT 


ATGTTGAAAA 


7 A n 


GGAATTCGTG 


TTAAAGGACT 


CCGCTATGGC 


AGGGCAATTC 


CAGCAGCTGT 


TTGAAGACGT 




CCCCACCAAA 


GAAGTAGAAG 


TCGTCAAACA 


AATCGTTGAT 


TTAGCAGAAA 


CAACGCTAGC 


J D VJ 


GATTGAGCTT 


TCATCAAATA 


TTTATTTAAC 


TTTAACGGAT 


CATATTCACT 


ATGCGTTACT 


*i £ u 


TCGAGCCAAA 


GAAGGAATTG 


ATATTCCGAA 


TCCTCTTGTA 


TTCGAAACAC 


GCAAATTTTA 


4 A fl 


TCCTAAAGAA 


TTTGAAATCG 


CAAAGCAAGC 


ATTAGCGATT 


ATTGCTGAAA 


AGCTAGGGGT 




TCAATTTTCA 


GAAAATGAAG 


CGGGCTTCAT 


TGCTTTTCAT 


ATTGTGAATG 


CTGAACAAGG 




AAATGGCAAT 


ATGGAAGTTA 


CCATGGAAGC 


AACCAAGATG 


GTTCGTGATA 


TTTTAACGAT 




TATTAGTCGC 


TACTTTGGAC 


AGGTGTTTGA 


TGAAGATTCA 


TTGAACTATC 


AACGAATTGT 


170 


TACACATCTT 


CaATATTTTG 


CGCAACGTTA 


TTTAAAACAA 


GAAGCGCATG 


aTGAGGAGGA 


780 


CGAATTTCTT 


TTtGCATtGG 


TCCAAGGAAA 


GTATCCTAAG 


GCGTTTCAAG 


CGGTCCAACG 


84 0 


TATTAATGAG 


TATTTATTAA 


AAAGCTATGA 


TCGACCAATT 


GATCAAGCGG 


AAATGATTTA 


900 


TTTAACCATT 


LA J. A 1 


GAGTGGTCAA 


TGATAAAAAA 


GCATAACAAC 


CAATAATCGC 


960 


GbAl X b 1 1 AL 


I bn 1 Art 1 oL-ri 


GGCAAAACCT 


AGATTGAAAA 


AACCGATAAC 


GATTTTCGCA 


1020 


GAG I GbbbAL 




AGGATCTGGC 


TCTTTTTCAA 


TGTAGGTTTT 


TTTGATATTA 


1080 


AAAATTAATG 


GAAATGAGGt 


CTAGAGATGA 


m TV TA 7A 7A 7\ m^** TV 

GTAAAAA TUA 


AGAAATTGCA 


GCACGTGTGT 


1140 


TAAAAGCTGT 


TGGGGGCGAA 


GsCAmCGTTA 


AlAbKbl ibK 


TCACTGTGCy 


ACACGCTTAC 


1200 


GtTTTAAATT 


AAAaGrTGAA 


AATAAAGCAG 


A 1 AbAbb i 


ATTGAATGCT 


GACCCAGATG 


1260 


TTATTCAGGT 


AGTTCAAAGT 


GCAGGACAAT 


ATbAAb 1 bb 1 


lAi IbbbHAjl 




1320 


TGTCTATAAG 


GATTTGATGG 


CCAACAGTGG 


ACTAGGTAAT 


GATTCTGATA 


ATAGAGAAAA 




AGAAAGTGGC 


GGAAATATCT 


TTAACCGTTT 


AATTGATATT 


ATTTCTTCTA 


TTTTTACACC 


1440 


GTTTTTAGGT 


GCAATGGCCG 


GCGCTGGGGT 


TCTAAAAGGC 


TTCTTAACAT 


TGGCTGTAAC 


1500 


TATGGGCTGG 


TTAGCTGACA 


CTTCAGGCGT 


ATATCTTGTT 


CTCTTTTCGA 


TTGCAGATGG 


1560 


TTTATTTACT 


TTCTTACCAA 


TTATGTTAGC 


GTTTACGGCT 


GCTAAAAAAT 


TCAATACTAA 


1620 


TCCCTTTTTA 


GCGGTTGCTT 


TGGCGATGGC 


CTTGGTTCAT 


CCCAGCATTA 


CTGCTTTAGC 


1680 


AGGCAAAACA 


ATTAGCTTTG 


CTGGTCTTCC 


AGTGATTATT 


GGGCCAAGTG 


GCTATACTTC 


1740 


TTCGGTTTTA 


CCAATTATTT 


nGGCTGtTTT 


TGCACAAAGT 


1 TATGTAGAAC 


: GTTTCTTCAA 


1800 
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AAAAGTAATT CCGAGTTTTC TACAAATTAT CTGCGTTCCG TTAGCTGTTT TCCTTATTAT I8 60 
GGCACCTGTT ACCTTCTTAG CTATTGGTCC TATCGGGACG GTCATCGGCG ATTGGTTGGG ' 1920 

ACAAGGATAC AACGCAATTT ACGCTTTTAG TCCAATTATC GCAGGGTTAT TAATGGGTTC 1980 

GTTGTGGCAA GTCTTGGTAA TGTTTGGTAT GCATTGGGGC TTTGTACCAA TTATGATGTT 204 0 

AAACTTAACA CAAGGTGGCG ATACGATGGT ACCGATGTTA TTACCAGCCG TTATTGCACA 2100 

AGGCGGGGCT GCTTTAGCTG TCTTTTTCCT AACAAAAAAT GTGAAACTAA AAGGTTTGGC 2160 

TTTGTCTTCA AGTATTACGA CTATTTTTGG AATTACTGAA CCAACTGTAT ATGGCGTGAC 2220 

TTTACCATTG AAAAAACCAT TTATTGCAGC TTGTATTGGT GGCGGTATCG GTGGTGCATT 2280 

TGTGGCTATG AATCACGTGA AAAACTTTAC GTTTGGCTTG GTTAGTATGT TGAGCTTGCC 2340 

TGGCTTTATT CCTGCAGAGA CAAAAGATAC TGCACCGATG ATTACTGGTG CAATTGGTGC 2400 

CGGAATTGCC TTTATCATTG CGTTTGTCTT AACGTTTGTC TTACGTTTTG AAGATCAACC 24 60 

TAATCCAGAA ACAGCAACTG AAAAAACAGA AACTGATAAG ATGGTGGCAC CTGTAAAAAC 2520 

GAATCAAGAA GACAAAATTA TTTTAGCAAG TCCACTTCAA GGTGAAATTT TACCGCTAGA 2580 

AAAAGTACAA GACCCTGTTT TTGCTTCAGG TGCTTTAGGA AAAGGTGTTG CAATTGAGCC 2640 

GACTGAAGGC AAACTGTATG CACCCGCAGA TGGTGAAATC ACCACATTAT TTCCGACAGG 2700 

ACATGCTGTT GGCTTGACGA CAACAGAGGG CGTTGAATTA TTAATGCATA TTGGCATGGA 2760 
TACGGTCGAA TTAGATGGTA AAGGCTTTGA ATTATCAGTG AAACAAGGTG ATTCTGTTAA 2820 
AAAAGGAGAT TTGCTAGTTA CTTTTGATAT TGCTGCCATT AAAGAAGCTG GTTATCCGGT 2880 
AGTTACACCG ATTGTGGTAA CGAATACGAA TGATTATTTA GATGTTTTGG ATATGAACCA 2 94 0 

AACAGACGTG TTACATGGTG AAGACTTTTT AGCCATTATT AAATAAAAGA CAATTGGAGG 3000 
AGGAACAAGT ATGACAACAA CGACAAAATT TCCAAAAGGA TTTTTATGGG GCGGTGCGAC 3060 
CGCAGCCAAT CAATTAGAAG GGGCTTACTT AGCCGATGGT AAAGGGTTAT CTGTGGCAGA 3120 
TGCGATGCCG GGAGGAAAAC AACGTTTCGC TATTTTAGGA GATGAAGCGT TTGATTGGAC 3180 
GATTGmCGAA ACGAAATATC GCTACCCTAA TCATACAGGA ATTGATCACT ATGATCGCTT 3240 
TAAAGAGGAC ATTGCGTTGT TTGCTAAAAT GGGCTTCAAA TGTTATCGCT TTTCAATTGC 3300 
GTGGTCGCGT ATTTTTCCAC AAGGAGATGA AACACAACCG AATGAAGCCG GCTTGAAATT 3360 
TTATGATGCG GTTATCGATG AATGTTTAAG CTATAACATT GAACCAGTAA TTACGATTTC 3420 
TCATTATGAA ATGCCACTGC ATTTAGCGAA AGAATACGGC GGGTGGAAAA ATCGTCAATT 3480 
AGTTGACTTT TATGAACGTT TTGCGGAAAC TGTTTTAACA CGTTACCACC AAAAAGTTAA 354 0 

CTATTGGATG ACATTCAATG AAATTAATTC AGCTTTTCAT TTCCCAGCGC TTAGTCAAGG 3600 
ATTGGTAAAA- AGCAATGGAG CCAATGATTA TCAAAACATT TTCCAAGCAT GGCATAATCA 3660 
ATTTGTTGCA AGTAGTAAAG CTGTAAAAAT TGGCCATGAA TTAAATCCAG AATTACAAAT 3720 
TGGATGTATG ATTATTTATG CCACAACGTA TGGCATTGAT TCGAATCCAG TCAATCAAGT 3780 
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TGCTACAATG ATTGAAAACC AAGAGTTTAA TTATTATTGT ACAGATGTTC AAGTTCGTGG 3840 
TGAATATCCA GCATACGCAG. AaCGCATGTA TCAAAAATAT GCTGTGAAAG ACTTAGTTAT ' 3900 

TGAAGAAGGA GATTTAGAAT TATTAAAAGA ATACCCAGTG GATTACATTG GCTTTAGTTA 3960 

TTATATGTCG ACTGCTGTCG ATGTTACTGG TACTACCAAC GATACAGCAA ATGGGAATCT 4 020 

TTTAGGCGGC GTTAAAAATC CCTTCTTAGA AGCAAGTGAA TGGGGTTGGC AAATTGATCC 4 080 

CGAAGGCTTA CGAATCGCCC TAAATGAATT ATATAATCGT TATCAAAAAC CTTTATTTAT 414 0 

TGTGGAAAAC GGCTTAGGTG CAATTGATAA AGTAGACGAA AATTTCTATG TGGCAGACGA 4200 

CTATCGGATT GATTATTTAC GTCGTCATAT TGAAGCAATG GCAGAAGCAG TTGCAGACGG 4 260 

CGTCGATTTA ATGGGCTATA CGCCGTGGGG CTGTATCGAT TTAGTCAGTG CTTCCACAGG 4 320 

TGAAATGAGT AAACGGTATG GGTTTATTTA CGTTGATTTA GACGATGAAG GAAATGGTAC 4 380 

TAAAAACCGT TATGAAAAAA AATCGTTTAA TTGGTACAAA CAAGTTATTG AAACTAATGG 4 4 40 

ACAAAACTTA GACTAGTTAT TGTAATTAAA CAAAACAAGA TAAGCGTAGT AGGTTGTTGC 4 500 

TTCTGTTTCC GCTGTTTATT AATTTTTAAG CTTGGACCAA AAATCTAAAG TGATTTTTGG 4 560 

CCCAAGCTCT TTTTTGTGAA TTTTTCAAAA TAGTGCACAA TACGTTATAA TAGAAACAAC 4 620 

TGAGAGGAGC GTTTCTAATG ATTAAGCAAT TACAACATCC ATTTTTTGTT ATTATGGGTC 4 680 

CGAGTGGTTC TGGCAAGACC GCAATTACAA GTAAGGTTTT TCCGAAAAAC TATAAAGTTA 4740 

TCTCGCATAC TACGAGGAAA AAACGATTGG ATGAAGTGGA TGGCGTGGAT TACTATTTTG 4800 

AAACAAAAGC AAGTTTTCAA GCCTTGATTG AAACCAATCA GTTAGTGGAG TATGATTTTT 4 860 

ACCATGGAAA CTATTACGGT GTCGGTGTAG CTGCCATCGT TGAAACAACG AAAGAGCATC 4 920 

CAGCCTATAA TGCGTTAACT TTTCCAGGCT TTCAAGCGGT CTTTGAGCGT TTCGGCGAGT 4 980 

CCGTCATTCC AGTTTTTTTT GACGTTTCAA AAGAAAATAT TTATCAACGT TTAAAACAAC 504 0 

GAGAATCTGA TCCTAAAATT ATTGAAGAAC GTTTAAACCT TTATGATCAA GAAATTCTTA 5100 

TCAAAAACCA ATTGGAACAA TATCCTAATT ATCAACGAAT AGATGCCAAT GGACCTATAA 5160 

AAGAGGTCGC TGGCTTGTTA CAAGAATGCA TCAAACACTA TTATTAATCA AGCAACTGGT 5220 

GAGACAGAAT CTAACATCAA TTTTGTCTTG CTAGGTGCTT GATTTTTTTT GTAAAAAACT 5280 

TGCATTCTGA ATAGAGAGTG CTATAATGAC ATGGTGTCAT ATGACAATGT GTCATAAAAG 534 0 

GGAGGAACAC AAATATGCCT ACGAATACGT TTTTTCATTT GCCAGAAGAA AAGCAACAAC 5400 

GCTTATTAGA TGCTGCGCAA ATTGAATTTT CGCGCCATTC TTTACAAGAA GCGTCAATTG 54 60 

CAAATATTGT TAAATTAGCG GGAATTCCGC GCGGTAGCTT TTATCAATAT TTTGAGAACA 5520 

AAGAAGATTT GTATTTTTAT TACTTTGCGA CATTAAGAAA AAATAGTGAA CGAGATTTAG 5580 

AGAAACAAAT. TATTGCGGAA AATGGTGATT TAATCGAAGC CATGGATGTC TATTTCTCCA 564 0 

AAATGATTGT CGAAGTGCTA ACTGGaGAGA ATGCTTCTTT CTATCGGAAT TTATTTGTCA 5700 

ACATGGACTA TCGTGCtTCA cGTCGGGTGA CGGGATnAAT TTTAGCCGAC TGGGGGAAGA 57 60 



WO 98/50555 



PCT/US98/08985 



392 



GGGAAAAAAA ATAGGGAAAC CAGCCACTGT CCATnAAGCC TAGAnGGGAG GAAAAAGGnC 5820 
CATGCCTGCC CCATGGCGGA 5840 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGATCTTTTT GACGACGTTC AATTTTACGT TTTTTCTTAT TGCTGTCTTC AATTGCCCAA 60 

TTAATTTTTT TCTTGTATCC TGGTTTAATT TTTTTCTTTT TCTTTTTCAC TAAACCAATT 120 

AACGTTGGGT CTAATTCTTC TCGAGATTTT TCTCGTTTTG TTCGACGATT ACGATCATAC 180 

GTCTCTACAA TCTCACCATT TTTAATTTCT TTTGGTTTGA AAGAAACACC TAACTGCTCA 240 

ATCTGAGTAA tCGCTTCATC ATCTgCTGGT GAATATAATG TTATGGCAGT CCCATTcAAT 300 

CCATTACGTC CTGTGCGGCC AACCCGATGG ATAAAGAAGT CCAATTCATG TGGAACTTCT 360 

GCATTGATGA CATGTGAAAC CCCTTCAATA TCAATCCCAC GTGCGGCTAA ATCAGTAGCT 4 20 

ACTACATATT GGTAGTCAAG ATTTTGAACT TGACGCATCA CACGTTTTCT TTCGCGTGGT 4 80 
GTAATGTCTC CATGAATTTT GGCAACTTTT AGGCCCTGAT CTTTTAAGTA ATCGGTGATT , 54 0 

TCATCAACTC TTTGTTTAGT ATTTGCAAAA ACAATCGCTA AATAAGGATG CCCGATAGTC 600 

AATAATTGAT AAATTATTTT ATTGCTGTTT TTTCCTTTGG TGGAAATCAA CCAATTGTCA 660 

ATTGTTTCTG AAATAACTGC TTTTGGTTTG ATATGTTCAA TCACTGGATT TTCTAAATAT 720 

TTTTTTAAAA ATGGTCGTAA TTTTTCTGGA ATGGTTGCAG AAAAAACGAG CATTTGTAAT 780 

TTTTCTGGCA AACGACCCGC AATTTGATCG ACTTCTGCTA AAAAGCCCAT ATCTAATGTC 84 0 

ATGTCTGCTT CATCGACAAC AAAGGCAAAA GCCGTATGCA CTTTCAAGGC TTGTTCATTC 900 
ATCATATCTA AAATACGCCC TGGTGTACCA ATCACCACAT GAGGTTGCTG ATGCTTTAAT 960 

TTATTTAATT GACGTTGTTT ATCTGTTCCG CCAACAAAAT TTGACACTCG AATTTCTGkT 1020 

TGACTGAAAC GAGCCAGTTG tGCGCTTCTT GATAAATTTG ATTGGCTAAT TCACGACTAG 1080 

GCGCAGTAAT CACAATTTGT ACTTCGTCGA TAGTTGGCTT AACTTTATCC ATTAATGGAA 114 0 

GTAAAAAAGT ATGCGTCTTT CCACTTCCTG TTTGCGATTG TCCAATAACA CTTTTCCCTT 1200 

TCTTAATGAT TGGGATTAAT TTTtCTTGGA CTTCTGTTGG TTCTTCAAAC CCTTTTTCTG 1260 

CGAGTGCTTC GTTGATAAAA GGTTGAAATT GAAATTGTTT AAATGAAGGC ACTTGCCTCA 1320 

CCTCTTCTCG TTAAATTCTT TATGCACTTT GGCATTATAA CACAAAAACA ACTTTTACTC 1380 

ACCTGCTGAT TCCTTTGTTT TCGCAACTTC TAATGTACTG ACTGTATAAG TGTGGCTGGC 14 4 0 

TTTTTCATAA AACAAACGCC GTTTTTTACG AATCAAGGTT AACAAAACGT TGATCAAAAA 1500 

TAGTAAATAT AAAATTAAAC AGAACAAATA CAGAAGCAGT GACACCATTT GCAACATCCT 1560 
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ATTAGAAGAA 


TTCAATAACT 


CAGTGGCATT 


GGCTAGATAA 


ATACTAATTA 


AACCATACAC 


1620 


AAAATACAAA 


TAACCGTAGC 


GAACAAACAA 


GGCTTTTAAA 


CGAATTTTTT 


CATGTCCTGT 


1680 


TTCAACCACC 


CGAATGCGAA 


CAACTTTTTT 


CCCTAACGTT 


TGACCATTTG 


TCAGTTTCAT 


1740 


CATGACGATA 


AAGACCGCAA 


ACACCATGAT 


AAAGAACCAC 


ATCCGATCTG 


AAAAAATCGG 


1800 


ATAATTGGTT 


GCCCAATTTT 


TATAAGCTGG 


AATGAAATTT 


AGTCCAAAAA 


CGAGCAGGCC 


1860 


TTGCACAAAG 


CCAATCACCA 


ACCAATCAAT 


TCCCCATGCA AAgAAaCGTC 


TTAACAGGCT 


1920 


AACTGTCTGC 


CCTTTTTCAT AAGAgGTGGC 


ATCAATTTCT 


TCTCTTGTTG 


GCAATAAAAA 


1980 


GGTAAAGAGC 


GGCGTCAACC 


AATAACCAAC 


TATGCCTCCG 


ACGGTATTAT 


TAATTAAATC 


2040 


ATTGACATCC 


GCCAAGCGAT 


ACGAACGTGG 


ATAAATAAAA 


TAAAGGCCAG 


ATAACTGTGT 


2100 


TAATTCAnAA 


AATAATGACA 


ATAAAAAACT 


CGCAAGAACT 


GTTTTTAAAA 


ATGAACATTT 


2160 


GAAATAATAC 


CGTAAATACA 


CACCAAAAGG 


TAAAACTAAT 


AAraACATTAA 


ACAGTGGTTC 


2220 


TAAAACGGCA 


TGCTGTTTTA 


ATGCTGGCAA 


ATACGTACTA 


GGATCTTTGA 


TCGAAAAAAC 


2280 


AGTTTCGTTT 


AAAATCCCTG 


AAATAAAATG 


AAAAGGCCGC 


AGTTCCATTG 


TTGGTCCTGT 


2340 


GTATTGAGCC 


ACTTCTGCTC 


GAGATGGCAA 


CGGTAAAATT 


ACCAAGAAAT 


AGGCACATAA 


2400 


TAAATAAAAA 


ACAAATGAAT 


ACAAAATCAT 


TGCTCGTGTC 


CAAACAAATG 


AACCATATTT 


2460 


GCGGTATTCA 


TAGATTAATA 


ATGCCCCAGA 


AATAAAAAAT 


GCTAGAAACG 


GGAAAATAAT 


2520 


GATTGCCCAT 


TTAATTGGTT 


CGCTATATAC 


TGCCATAAAA 


ATCCCTCCAA 


GATAAACAAT 


2580 


TTCAAGTATG 


AATGCCTCAA 


TAACACTATA 


CTTGTTTCTA 


AAAAAATGTA 


CCAACAATAA 


2640 


ATTTTATTTT 


GCAAATCCTC 


TCCTTTGCAG 


TATCATTAAG 


AAAAAGGAGG 


ACGTTCTCAT 


2700 


GTTACAGCCA 


TATTTTowGT 


TTGGAGTGCC 


GCTATTTTTA 


CTTGTGCTCT 


ACTTACTCTT 


2760 


TGCACTGATT 


CACCGTCAAA 


CAACGATTCA 


TTATTTACGG 


TTTATTTTGT 


TACTTATTTC 


2820 


AACATTTCTC 


ATGGTTTTTA 


GTTTCCAAGT 


ATTGCAAGAA 


TCATGGACCA 


TCAACCCTGA 


2880 


AACGTTAAAA 


GATGCCGCTT 


ATTCTCCGCA 


ATGGTTATGG 


ATTCCTTTAG 


GTATCGGGTT 


2940 


AATCCTCACA 


CTTTACAACG 


CTTGGCACGG 


CCTTCGTACA 


ATGATAAAAT 


ACAAGACCGA 


3000 


TAAACATTAG 


TAAGAGTCTA 


GGACGATGGT 


TGTGTCCTAG 


ACTCTTACTG 


TCTCATCATT 


3060 


CAAATGAAAT 


TTAATTTTTC 


TTAACAAACT 


CTGATTTTAA 


CTTCATTGGA 


CCAAAGCCAT 


3120 


CTACCTTACA 


ATCGATGTTA 


TGATCACCTT 


CCACTAAACG 


AATGTTTTTC 


ACTTTTGTGC 


3180 


CTTGTTTAAT 


TGCGCCACTG 


GCACCTTTAA 


CTTTCAAATC 


TTTGATTACA 


GTAACACTGT 


3240 


CACCATCCGC 


TAAAAGATTG 


CCATTTGAAT 


CCTTAACTAC 


TAAGCCTTCT 


TCTGCAACTT 


3300 


CTTCTGTTGG 


TGACCATTCA 


TGACCGCATT 


CTGGACAAAT 


AAATAATCCG 


CGATCTTCAT 


3360 


AGGCATATGC 


CGACCCACAT 


TCAGGGCAGT 


TTGGTAATTG 


CTCTGTCATT 


AACGTTCTCC 


3420 


TTTGTAAAAA 


ATAATTCTTT 


TTCATTTTAC 


CTTATTTTCT 


AGTGAATGAA 


AAGTAGTCCT 


3480 


AATGATTTAT 


CAACGTCCGA 


TAACTACTTT 


CTAATTACTT 


ATTTATTTTT 


GAGCCCACTC 


3540 
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TTGGCAAGCT GCTTGGATTT CTTGGTAAAT CACTTCAATT TCAGCCGCAT CACGTGCATT 3600 

AGCGCCACTT GCCAAAGGAT GGCCGCCGCC ATGATGTTTT TTAGCAATTG TATTAATGAT 3660 

TGGTCCTTTG GAACGTAACC GAACGCGATA ATAACCTTCC GGCTGTTCAA CAAAGATTGC 3720 

CCATGAGAGA ACATCTTCAA TTTTTCCTGG CATTGAAACA ATGGCTGCCG TTTCAGAATC 3780 

GACAATATCA TAACTGTTTA AAATACTTTG TGGTAAAACG ACACGAGCAG CGCCATTTGC 38 40 

GTCGATTTCT AAATTTTGAT AAATATAACC CGCTAATTTA GCAACCTTTA AAGGCATTTG 3900 

GTCTAACTCA CGATTTAAGT CTGCGGCATT AAAGTTAAAC GTACGCAATT GCGCACGACA 3960 

GCGAGtGTAT GCGCCGAaGT GGaAGGATAA AGAAAACGTC CTGTATCCCC CACAATCCCA 4020 

GCATACAATA AACGAGCTGC ATTATCTGTC ATCGTTAATT CTGCTAAATG CTGTgCCAAA 4080 

AATCAACAAT AATTTCACTA CAACTACTTG CATTTGTATT CACCCAAACT AAATCTCCAT 414 0 

AAGGTTCATC ATTCGGATGA TGATCGATTT TAATTAATTG ATCGGCTAAA GAAAAACGCG 4200 

CATCACTAAT TCGTGGAGCA TTTGCAGTAT CTGTTACAAT GACTAATGCA CCTCGATAGA 4260 

CATCGTCAGT AATAACGTCC ATTTCAGCTA AAAACTCTAA GCCTTCCACT GGTCCGCCTA 4 320 

CTTGATAaAT ATTTTTTyCA GGAAAGCTCG CTCTTAAGAG TTCTGCTAAC CCAACCTGTG 4 380 

AGCCAATTGC ATCTGGATCC GGCCTTTGAT GTCTATGAAT AATAATTGTT TCATATTGTT 4 4 40 

TAATTGCTGC CATAATTTCT TTTACTACGT CCATTCTTTG TAGCCTCCTA CGTTCTCTCC ,4500 

ATCACTTGGC AAACAACAAT TGCTTTTGCC ACAATGACAT TTTCTAAATA AACTTCAATA 4560 

• TCTAATTtTG CAGAGCGACG GCCAATTTCT AAAATCCGTG GACGAATGTC CAATTCACTT 4 620 

TCCAACTGGA TTAGCCGCAA ATAATGTAGA TTCACTTGTT CAATTAGTAC ATTTCTTTTT 4 680 

TGATTCATCA ACATTGTTCG TTGTGTCACG TTAGAAATAA TTTCACTTAG GACACCGAAC 4 74 0 

GAGATGGTAC CAACACTGTT AACCATTTGG GGGGCAACCG AAAAACGAAA ACGGGTTGTT 4800 

TCTTTGCCAT TTTCTTCAAC AGGCATCACT TCCCCGGAAA TTTGATCTGA AATTGTATCT 4 8 60 

GCAATTTGTG GTTGTCGTTG CACCAATTGC ATAGCTTTCA TCACATCTTG GCGAGAAACA 4 920 

ATCCCAACAA GCGCTAAATC ATCTTCCACC ACCGGCATGA CTTCTAGACC ATCCCAAATC . 4 980 

ATT T GAT G AC TGACACTCGC TACACTCATC ATCTTTTTCA CAACATTGGG ATCCTTTGTC 5040 

ATGACTTTAT CCACGGTTAA ATGTTCACTT TTACCAATGA CATCTTTGGC TGTGACAATC 5100 

CCCACTAAAC GTAATTTCTT ATTTACTACT GGGAAACGAG AATGATGGGT ACTTTCAGAT 5160 

AATGCCTGAT AATCTGCAAT CGTCTGAGTT GTCAATAAAT AATTGGtTTT TTCAAGCGCT 5220 

GTATAAATAT CACTGACGAG TAAAATGTCT TTTTTAATCA GCTGATCACT TAAAGCACGG 528 0 

TTAATCATAG TAGCCACAGT AAATGTATCA TATGTGGTTC TCAAGACAGG CATACCCAGA 534 0 

CGATCTGCTA AcTGCGCAAT CTCTTCAGAT GTTTCAAATC CACCTGTAAT TAAAACGGCT 5400 

GCCCCATTTT CTAAAGCAAG CTTTTGCACA CcTTGGCGGT TACCAACAAT CATTAAGGAT 54 60 

CCGGGAGTAA TATATCGCGT CATCGCGGTT TCTGTCATCG CACCAATTAC AAATTTATTA 5520 
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AGGACTTTGT CTAGACCAGA GcTACCGCCT AACACATCCC CTTCAATAAT CCGGACCACT 5580 

TCACCAAATG TTAATTTTTC AATATGTTTT TTTAACTTCC GCTCAATTCG TATOGTACCA 5640' 

ACACGTTGAA TGGTGGAAAC AAGCCCAATA TTTTCTGCAT CTTTGATGGC CCGATACGCT 5700 

GTTCCTTCAC TAACACTTAA ACTTTTGGCA ATGCTTCGAA CAGAGATTCG ATCGCCAATT 57 60 

GGTAAGTCTT CAATATGTTT TAAAATTTGA TCATGTTTCG TCGCCATAAT TTCCTCCTAA 5820 

ACTAGGATCG TCTCTCCTGG GTTTAATACT TTCCCGCCAA CTACTTGCTC TACGAATATT 5880 

GCTGGATTTT GTTGGATAAC AGGAAAGGTA TTGTAATGAA TTGGAATGGT TATTTCGGGA 594 0 

TTTAAGTAAG CGATTGCTTC TAG CG CATC C TCTGGTCCCA TCGTATAATT ATCGCCAaTT 6000 

GGTAAGAACG CTACATCAAT GGATTTGTCT TTTGCAAAAA GGCGCATATC TGAAAACAAC 6060 

GCCGTATCAC CAGCGTGATA AATTTTTTtA TCTTCTGCTT CTAAAATAAT CCCACTTGCT 6120 

TCGCCCATGT AGGTCATTAC GCCGTCTATT TCATAACCGG AACTATGCAA AGCTGGCACA 6180 

AATTTAACCG AACCAAAAGG AAAGACATAA CGCCCACCTA GATTCATTCC ATGCGCTTTT 624 0 

ACACCTTGCG ACTGCGCATA CATAGCCACT TCAGCTATCG CAATAATCGT TGCCTTGTTG 6300 

GCTTGAGCAA TTGCTAGCAT ATCGCCAATA TGATCGCTGT GTCCGTGGGT GATTAAGATG 6360 

TAATCAGGGT GTAAATCTGC AAGTGAAACG TcCGCTAAAG GGTTGCCATT AATAAAAGGA 6420 

TCAAATAAAA GGTTCGTGCC GTCGTTTAGA CGAATTTCAA TACATGAATG TCCATGTCCT 6480 

ATCAGTTCCA TTTGTATTCC TCCTACGGTA TTTTCTACAC AATCTTATGA TAATTAATCG 654 0 

kCAAATTCAT yCTaAAAGTA GkaCaGTTaG TACAGTTACC TTCCTTAATG 6590 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

• ATAAAGAGGT AAATGAAGGA GAAATGCGAT GAAAAAAAGA AAGGCAAGAT ATTTGAAGTT 60 

GGTATGTATT GGAATTTTAG GGATTTATGC GGGCGCCAGT TACTACAATT CGGAACCATT 120 

AATCGCTCAA GAAGAGACCG AAACGAGGGA CGAAAGTCGA CAGCTATCGC TGGAAGAAAA 180 

AAGAGCGGAC CAAGCCAATC AAGACCAAGG AACCCTTACT TCTTCAACGC TGAACGAAAC 2 40 

AATAATAAGT TCGTCTGAAC AACAAGAAAG TTCAACACAA AATGACAGCA GTGACCATCA 300 

GGAAAATAAT GTGGAACCAT TAGAATCTGA AAATAGTTTA CCGCCACAAG AAAAGACAAT 360 

TGCTAGTACT- AAACCAACCG AATCGTTACA AAATGAGCAA CCAGTATCTG AAGAGCGTAA 4 20 

AGCGCTCACT GAGAAGTCAC CTCATAATCA GGAACCACAA AATTCTACAT TAACTGAAAA 4 80 

GACCAAAGAG CCGTTGGTAC ATTCAGAGGT AGACAATAGT CAAGAAAAGG AAAAGCAAGC 54 0 
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AGGAATTCAA CAACAAGCCG AGGCAAATTT TGTTGTCCAA GAGAATGCTT CTACTTCTGA 
GTTTATTAGA AAAATCGGTG AAGAAGCACG AGTTATTGGT CAGCGACATG ATCTATACGC 
TTCTGTAATG ATAGCTCAAG CTATTTTAGA GAGTGGCTCG GGCAATAGTG CGTTAGCGGC 
GGCGCCAAAT TATAATTTGT TTGGCATTAA AGGAGCTTAC CATGGTCAAA GTGTCAGCTT 
TCCCACACAA GAAGATGATG GTAAAGGGAA GATGACCACC ATTCATGCCG ATTTTCGTCA 
GTACCCATCC TATAAAGAGT CATTAACAGA CTATAGTAAA TTAATCATCA ATGGGCTGGC 
GGGTAATCCA ACGTTTTATC ACGGCGTTTT GAAAGCAAAC ACGACGAATT ATCAGCAGGC 
AACAAAATTT TTAACAGGTC GTTATGCAAC GGATACATAT TACGATAAAA AATTAAACGC 
GTTGATTGAA ACCTATCAAT TAACTGAATA CGATCAAGAA AAGAAGAAAC CAGTAGTGAC 
TAATTTGGCA GAAGAAAAGA AACCTTCGTT TGATAAGGAA GCAGTCAAAG AACAGCTAAA 
AAAAGAAGCA GTCGTCTATG AAGTAAGCAA AGGAGATTCT CTCGCCACCA TTAGTCAGAC 
GTTTGGTGTG TCAGCAACAG CCATCTTAAA ACAAAATTCA AAGACTCAAG AAATGTTTTA 
TATTGGCCAA AAGATCACGA TTCCTCAACA TACAGCGGCG ACTTCTCTAG AACCGAAAGA 
ACAAGCGTTA CTTCAATCAC TAATAATCAG TAAATCGGTG ACTAATGCTT TGAAGACAGC 
AGAGCAATCA AATGGTAAAG CGAATGACTC AAAAACAGCG TCTACTGAGT ATTATGAAGT 
AAAAAGAGGA GAGACACTCG CAAAGATTGC TAAAAAAACG GGCTATTCCT TAACCGCCCT 
CAAGCAGGCA AATGACTTAA CGTATTCTGT ATTAACCGAA GaCAAACGAT AGCACTTCCT 
AAGCTAAAGG ACTAATGCTG ATTTAACTTA TCGGTCGATG GAGTCAATAT ATTATTAGgG 
TAAGAGAAGA GGAGAAGTTC ATCCATGGAT GAACTTCTCC TCTTCTCTTA CTATGTTTTT 
CATAAATACC TTAATAAATA GTAAAATTAG CAAGAGAATT GTGTCTTTAC ACAAAATAGA 
TAGGTTTTCT TGCCGAAGTT TGATAAAATG ATTAGTTGAA AGGATGCGTG AAAATTGAAA 
GAATTATTTC GTAAATACCG AGAAGTACTG GCTTATTTAT TTTTTGGTGG TGCAACGACG 
GTCGTCAACT TAGTTGTGTT TTTTGTTTGT CAAAACGTGT TAGGGCTGGA TTACAAGATA 
AGTAACGCAA TTAGTTGGTT TTTATCTGTT CTATTTGCCT TTTTCACGAA TAAATATTGG 
GTATTCGCAA GTAAGCATGA AAGCATAGCT GGTTTTTTTA AAGAAATGGG CTTATTTTAT 
TGGTACCGAA TTTTATCTTT TGTGGCTGAT ATGGGTTTAA TGATTCTCCT AATTGATGGC 
ATTCATTTTT CATCGTTTTG GGCGAAAATG ATTACACAAG TTGTAGTAGT GATTTTAAAT 
TATTTCTTCA GTAAATTTTn CATTTTTaAA GAAAAAGAGG tCTAAGCAGA TTTtAGGCTA 
AAAAAAATTT TTTTGTTATA ATATAAACAA GTTTTTAATA CAAAGAATAG AGGTAGAAAa 
CGATGGACTA TACATATGCA GATGATAGTT TGACATTGCA T AC AG AT AT G TATCAAATCA 
ATATGATGCA- AACATACTGG GAATTAGGGA GAGCAGATTT ACATGCTGTT TTTGAATGTT 
ACTTTAGAGA GATGCCCTTT AATCATGGct ACGCAATTTT TGCAGGTTTA GAACGTTTAG 
TCAATTATCT CGAAAATTTA ACGTTCACAG AATCTGATAT TGCATATCTC CGAGAAGTTG 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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AAGAATATCC AGAAGATTTC TTGACGTATT TAGCAAATTT TGAATTTAAA TGTACTGTAC 2580 

GCTCTGCTTT AGAAGGGGAT TTGGTTTTTA ATAACGAACC GTTAATTCAA ATTGAAGGTC 2 64 0 

CTTTGGCACA ATGTCAATTG GTAGAAACTG CCTTGCTAAA TATGGTGAAT TTCCAAACAT 2700 

TAATTGCGAC AAAAGCGGCT CGAATTAAAT CCGTGATAGG CGATGATCCT TTATTGGAAT 27 60 

TTGGAACACG CCGGGCACAA GAATTAGATG CTGCCATTTG GGGCACAAGA GCCGCCTATA 2820 

TTGGTGGGGC GGACGCGACA AGTAATGTTC GTGCTGGTAA AATTTTTGGT ATTCCGGTTA 2880 

GTGGGACTCA CGCCCATTCA TTGGTTCAGT CGTATGGAAA TGATTATGAA GCATTCATGG 294 0 

CGTATGCCAA AACACATCGG GATTGTGTTT TCCtTGTTGA TACGTATGAT ACTTTGAAAG 3000 

CGGGAGTGCC AAGCGCAATT CGAGTAGCTC GAGAAATGGG GGATAAAATT AATTTCCTAG 3060 

GTGTGCGGAT TGATAGTGGC GATATGGCTT ATATTTCTAA ACGAGTTCGT GAACAACTGG 3120 
ACGAAGCAGG CTTTACTGAA GCAAAAATTT ATGCATCGAA T GAT T TAG AT GAAAATACGA • 3180 

TTTTGAATCT AAAAATGCAA AAATCAAAAA TTGATGTATG GGGCGTGGGT ACCAAGCTGA 324 0 

TTACCGCCTA TGATCAACCG GCTTTAGGAG CGGTGTTTAA ATTAGTTTCA ATTGAAGGTG 3300 

AAGATGGTCA AATGAAAGAT ACCATTAAAC TATCTAGCAA TGCTGAAAAA GTGACAACAC 3360 

CAGGAAAAAA ACAAGTCTGG CGCATTACAC GTAAATCTGA TAAAAAGTCA GAAGGGGACT 3420 

ATGTGACACT TTGGAACGAA GATCCTCGCC AAGAAGAAGA AATTTATATG TTCCATCCCG 34 80 

TTCATACCTT TATTAATAAG TATGTCCGTG ATTTTGAAGC ACGGCCAGTA CTCCAAGATA 3540 

TTTTCGTTGA AGGAAAACGT GTGTACGAAT TACCAACTTT AGATGAAATT AAACAATATG 3600 

CAAAAGAAAA TTTAGATTCA CTTTGGGAAG AATACAAGCG GGACTTGAAT CCGCAGAAAT 3660 

ATCCaGTGGA CCTTTCAACA GATTGTTGGA ACCACAAAAT GAATTTATT.G GAAAAAGTGC 3720 

GTAAAGACGT TAAACATTTA ACGGAAACGG TGAATAAGGA GGCCTAGAAA TGACAACACT 3780 

TCAAGAAAAA ATCATTCAAG AGTTAGGTGT GTTACCGACA ATTGATCCAA AAGAAGAAGT 384 0 

TCGTAAAAGT ATTGATTTTC TGAAAGCTTA TTTAACTAAA CATCCTTTTT TAAAAACCTT 3900 

TGTCTTGGGA ATTAGTGGCG GACAAGATTC AACATTAGCT GGGCGCTTAG CACAATTAGC 3960 

GATGACTGAA ATGCGCGAAG AAACAGGAGA TATGTCGTAT CAATTTATTG CGATTCGCCT 4020 

TCCTTACGGT GAGCAAGCAG ATGAAGCTGA TGCACAAGCC GCGCTAGCCT TTATCCAGCC 4080 

AGATGTTTCT TTACGAGTGG ATATTAAACC AGCAGTGGAT GCAATGGTTG GTTCGCTAGA 414 0 

AAACGCTGGC GTTCAAATTA GCGATTTTAA TAAAGGGAAC ATGAAAGCAC GTCAACGTAT 4200 

GATTACACAA TAT GC AG TAG CTGGCGAAAA TGCTGGCGCA GTAATTGGCA CAGACCACGC 4 2 60 

AGCCGAAAAT GTGACAGCTT TCTTCACCAA ATATGGCGAT GGCGGAgcGG ATATTTTACC 4 320 

GTTATTTCGT TTAAATAAGC GCCAAGGAAA AGCTTTGCTG AAAGAACTTG GGGCACCAGA 4 380 

AGCCtTGTAT CTAAAAATAC CAACAGCTGA TCTAGAAGAT GACAAACCGT TAGTTGCAGA 4 440 

TGAAgTTGCC TTAkGTGTCA CGTATGATGC CAT T GAT G AC TATTTGGAAG GCAAAAAAGT 4 500 
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CTCGGAAACA 
TTTACCCATT 
ACGATGGTCT 
TATCCCATGA 
AAAAGGTACG 
ACAAAATtGG 
CATCAGCCGA 
ATGTTCTGGG 
ATTACCGCTA 
ATTATGTTGA 
CTGAAGTCAT 
GAAGATGTTT 
CTAGTTCCAG 
ACAGGAGAAT 
AATGGTGACG 
ACAATTGTGA 
GATCGCTATG 
GTTTCAAAAA 
ATTCTATCTG 
GTTATTAAAT 
AAAACAGGCA 
GGAATAACTG 
ATTTTAGCTA 
ACAGATCTAG 
CGGGTAGGTA 
ACGACTATTC 
GTACGCCCAG 
TTAATGCTGA 
ACCGAAGTAC 
AAAGAAAATC 
GCTGCAGACG 
GACGTTGTTA 
GATACCATGA 



GATCAACAAA 
ACAATTTTTG 
GTGTTTTTTT 
TAATGAAAGT 
CTCAATAATG 
GGATTACAAT 
ATTGGGCGTA 
AAATGATTCA 
TCGTTGCAAC 
CTGGTGGTGA 
TATTGGATAA 
CTGTTGAGGA 
TTGATGGCTT 
CAAAACCAAT 
GCTCTTTGAA 
ACTTAGTGAA 
CGGTACCTTT 
GTCCGACACG 
CCCCAATTGC 
CGGGAACGAT 
CGATTACGCA 
CTGCTGAATT 
GATCAATTGT 
CGGAAGTTTC 
AAAAGAATTT 
ATATTTCACG 
AAGCAAAAGA 
CGGGGGATCA 
ATGGGGAATG 
ATCCAGTCAT 
TAGGTATTGC 
TTTTAAAAGA 
AAATTGCCAA 



CAATTGAAAA 
ATGATTTTTG 
ACATAACTAC 
CGTTTTCATT 
TAAAGTGAAT 
tATAACAGGA 
TGGCATTATT 
AACCTTACGT 
CTTAGCTGTG 
TTCATTAGAA 
CTCGCCACAA 
AATCAATGTT 
GGTAAAAACC 
TGAAAAAAAT 
AATGGTTGCT 
AGAATCTGCG 
TACACTAGTT 
TTTTGCGGAA 
TTTAGTGGCA 
GGTCGAAAAA 
AGGACAACTT 
AGTGGGATTG 
TGCTTATGCC . 
TGGTGCTGGC 
TGTGACACAA 
TAATGGCACA 
GACTATGGAA 
AGAATCCGTT 
TTTACCACAA 
CATGGTAGGA 
TATGGGTGCT 
TGACTTAAGT 
ACAATCTGTA 
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CTGGTATAAA 
GAAATAAAAA 
TTTTTAGAAA 
TAGGCGTTTT 
GATGGGAGAG 
GTTTTGGCAT 
TTAATAACAG 
GAAGGAAAAT 
GGAGAATACT 
GACTATGCCG 
AAAGCTCATC 
GGCGATGAAT 
GGGACATCAA 
CCTGGGGATG 
GAAAAAACTG 
GCGCGTCCAG 
GCCTACCTAA 
GTCTTAGTTG 
GGGATGGGTC 
TTAGCTTCTG 
TCTGTTGATC 
GCAGCAAGCG 
. AGAAAGCAAG 
GTGAAGGCAT 
GAGTCTCAAG 
TATTTAGGCC 
AAATTACACC 
GCAGAAACGA 
GATAAATTAA 
GATGGTGTAA 
CATGGAGCTA 
AAAGTCAGCC 
TTAATCGGAA 



AAAGGACAAC 
CACAGACCAT 
GTAAACGTTG 
CACTTTCTTT 
AAAAAGAGAT 
TATTATTTGA 
GTTCTGTAAT 
ATGGTGTCGA 
GGGCCAGTTT 
CTGGAAAAGC 
GCTTGAATGG 
TAGTAGTAAA 
CAGTCGATGA 
AATTAATGTC 
TAGCAGACAG 
CTCATTTTGT 
TTGCAGGTGT 
TTGcTTCGCC 
GTTCAAGTCG 
CAAAAACGAT 
AAGTCCAACC 
TGGAACAAGA 
ATGTCCCATT 
TTGTGGATGG 
AAACTGAAAA 
GAATTACTTT 
AATTACATCT 
TTGCTGCAGA 
CTATTCTAAA 
ATGATGCACC 
CTGCGGCTAG 
AAGCGGTCGA 
TTTTTATCTG 



ACAAACGGCA 
CGGCGAATGA 
CTACAGCAAC 
TTCAAAACAG 
GAAGCATGTA 
ATTTATTTTA 
GGCGTTAATG 
TATTTTAGCG 
GATGATTTTA 
TAACCAAGAG 
CGAAAATTTA 
ACCAGGGGAA 
ATCTTCATTA 
GGGTTCCGTG 
TCAATATCAA 
ACGTTTAGCA 
TGCTTGGTTT 
GTGTCCTTTA 
TCATGGGGTC 
TGCGTTTGAT 
AATCAATGCT 
ATCAAGTCAT 
AAAAAATATT 
TGCTGAGATA 
AATTGATAAA 
TACAGACACT 
TCAACGAATT 
AGTAGGAATT 
AGAATTGCCT 
TTCGCTTGCT 
TGAAACTGCT 
AATTGCCCAA 
CGTTTTACTA 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

53.4 0 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 
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ATGTTAATTG CTAGTACCGG GATCATTCCG GCGCTAATCG GGGCTATGCT 
GTGGACACTG TGTCAATCTT ATCTGCTTTG CGTGCTCGTC GAATTGGCCA 
ACAATTAATT GCCTGAATGT CCAGCTTAGT GGTAAAwTAA AGGTACGTTT 
TAAGTTaCAG GGGAGAGATT GTGCAATGAA AGGAAACCAA ATAGATGGGC 
GGAAGAACAA CTATTAAAAG AAGGCTACCG TAAATATACT GGGGAAGGAA 
TTATAATAAA GATATTT'JTG AACATATTGG TAATTGTGTT CGTGGAAATC 
TGAAGTTGGT CGGAAACCAT GGATTATTCC AGATAACGGA AGTGTTGAAA 
TGTGGTAGAT TCATGTCCAA GTGGTGCTCT AAAATATATT CGGAAAGCAG 
TGCAAATTAA AGAAGAAAAC AATCGTTTTG CTCTATATAA CGATGAAAAC 
GTGAAATGAC TTGGTCAGAT GCTGGCGAAA GCATGATGAT TATTGATCAT 
ATCCAACTTA TCGTGGACAA AAATTAGCTG AAAAACTTGT TTTAGCCGGT 
CGAGAAAAGA GCATAAAAAA ATTATTCCGC TATGTCCATT TGCCAAAAAA 
CTAAACCTGA ATACGGAGAC GTATTGAGAA AATAAAGAAG AAACTacAAC 
AGAAATTTGA TTCCTACTTG CGGCTGTTGC TAT TAG AG AC AAATAAGGAA 
TTTCACTGGT GTTTACAGAT AAAATATGCT ACATTAAAGT AGATGAATTT 
TATAAAAAGG AGAATTAACG CATGTATAAT CTAATCTTAA CGtTGGTCAT 
GTTTTAATCG TCATTACTGT TATGAnGCAA CCAAGCAAAC AAAACAGTGC 
TTCACTGGCG GTGCAGATAA ATTATTTGGG AAGCAAAAAG CGCGCGGTTT 
ATGCAACGCT CAACAGCAGT TCTTGGCGCA ATTTGGATGA TTTTATTATT 
TTCTTATCTT CAAAATAAGT ACGTAATCTT TTAAAAT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TGTCAATGAA ATTTTCTTGA CAGCCCTTTC TTATTTCCTG TTTTTCTTTC 
TAATTGTTTT TCTGTTGAAG ATAGCTTAGC GCTTATTTAA TTAAATAACA 
GTGAAATAAC ATAGATAATT TAAAACTAAG ACATAATTTT ACTATTTGCG 
TATTTACTGA ATACGTGTTT TCTGAACATT TCGTACTACT GTCTTCCTTG 
TGAGACAAAC TGACGAATCG GTTTGTCCCA GCTTTGCGTC C GAG T AC AT A 
TCAAATGATT GAATTAATTA GACTTAGATA GCTGTTGtTG CTCCGCGGtA 
CGACGAGGaT CaACAGTAAT CACTTCGTCA TTATTAATTA ATGAkGTkGC 
CCTACGATAA CTGGAATATT TkGCGCAATC GCtACaACAG CTGCATGAGA 



ACAAGAAGTC 
GTAAGGTGGA 
TTAATATTAA 
AACCAGTCAC 
TTGATATCTT 
CAGAAGTCTT 
ATGACATGAT 
GGAACGAATA 
CAAGAAATTG 
ACGTTTGTTG 
GTTGAAAAAG 
GAATTTGATA 
AGCCGCAAGT 
AG AC AT C AAA 
CGCATATTTG 
CATTTTATCT 
AGCAAGTGCC 
TGAAGCTGTT 
TGTTTTGGCG 



GTTTTTTTTA 
AAATTTTTGT 
CCCAACTCTT 
TAAAAAAAGC 
TTGCTTAAAA 
AACAATACCA 
ATCTGCAGCA 
AGTTAAACCA 



6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7657 



60 
120 
180 
240 
300 
360 
420 
480 
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CCTTCTTCAA CAACTAATGC TGATGCTTTT TCAATTGCTG GCATGTATTC TTTATCTGTT 540 

GTTIXIGTAA . CTAAAATAGC TCCTTCAGTC GCTTTAGCAA CTGCTTCTTC AGCAGTTGCT 600 

GCCACAACGG CTTTTGCAAT AATTGCTTCT TCGCCGACAC CTTGACCTTG AACTAATTTA 660 

GAACCGATCA TTTGAATTTT CATTAAGTTT GTTGTCCCTT TTTCGCCAAC TGGAACACCA 720 

GCAGTGATAA TGATTAAATC ACCTTCTGAA GCATAACCTT CTTCTTGTGA TACTTTTGAA 780 
GCTAAATTAA ACATTtCATC TGTGCTTGAT GGkTTGTCAG CAACTGTTGC GTAAACACCC 840 
CATGATAATG ACAAGCTACG TGCTTTTTGT TCTGAGAACG TAATTGCAAC GATATGTGCT 900 
TnTGGACGGT ATTTAGAGAT CATACGTGCT GTATGGCCTG ATTCTGTTGC CGCAACGATT 960 

GTTTGGATAC CCAAGTTACG TGCTGTATGG CCAACAGATT GTCCGATTGC TTCAGTCATA 1020 

TCTGTTTTGC TATATAATTT CAATGCGAAT GAATCTTGGT TAACTAATGT TTCTTCTGTA 1080 

CGAACTGCAA TACGAGCCAT TGTTTGAACA GCTTCTAATG GGTAATCCCC TGCTGCAGTT 1140 

TCGCCTGATA ACATTACTGC ATCTGTTCCA TCGTAGATTG CGTTTGCTAC GTCATTTGCT 1200 

TCCGCACGTG TTGGACGTGG GTTACGTTGC ATTGAATCTA ACATTTGTGT TGCAGTGATA 1260 

ACTGGTTTAC CTAAAGCATT ACATTTTTTA ATTAAAGCTT TTTGGACAAC TGGAACATCT 1320 

TCTGTTGGAA TTTCAACACC CATGTCACCA CGAGCAACCA TTAAACCATC AGAAACTTTT 1380 

AAGATTTCAT CGATATTATC GATACCTTCT TGGTTTTCGA TTTTAGGAAT GATTTGGATG 1440 

TGTGTTGCGT TTTCTTCTTC TAAAATTTTA GTGATTTCTA AAACGTCAGA AGCACGACGA 1500 

ACGAAGCTAG CTGCGATAAA GTCAATTCCT tGACCAATTC CGAAACGGAT ATCGTTAGCA 1560 

TCTTTTTCAG TGATCCCTGG TAAGTTTACA GAAACGCCTG GTACGTTAAC GCCTTTTTTG 1620 

TTTTTCAAGA CGCCTCGTTT TTAACAACTG TTACGATTTC GTTTGCATCA CGGTCGATAT 1680 

CAGTTACTTC TAAATCGATT AAACCATCAT CTAAAAGAAT GTGTGAACtA CGTTTACATC 1740 

ATTGATTAAT TCTGGATAAG TGATTGAGAA TTTTTCATTT GTTCCTAAAA CTTCTGTCAT 1800 
AGAAATACGT ACTGAGTCAC CAATTTTCAT TGTGATTGCG CCATTTTCCA TGTCATTTGT 1860 
GCGGATTTCA GGACCTTTTG TATCTAGTAA GATCGCTACG CGTTTCCCTG TAATTTTTAC 1920 
AGCTTCACGA ATGTTTTTAA TACGAGCACC ATGCTCTTCG TAGTCACCAT GTGAAAAGTT 1980 
CAAACGGCAA ACATTCATAC CTGCATTAAT TAAGTTTACT AGCATATCAA CGGATTCACT 2040 
AGCTGGTCCA ATCGTACATA CGATTTTCGt TTTTTTCATT ACTAAAACGC TCCTATTCCA 2100 
AATTTGTTTA TACAAAGCTA ATAAGATTAG CTAATATACA AGATTAAAAA GAAATTTCAT 2160 
GATTTAACTC ATAAAGAGAT AAATCTGGTT TATGTTTATT GTTTTCTAAC GTGTCAATAA 2220 
TATCAGCTGC AACTACTTGG TTATCTAACA TACCGATACA TAAACCGCCT TTGCCTTCTT 2280 
TTAGAAGTTC AACAGCATAT GACCCAAATT TACTTGCTAA TACACGGTCA CGAGCACTTG 234 0 

GCGAGCCCCC ACGAACCACG TGTCCTAAAA TTGATACACG CGTATGGAAA TCACCGTATT 2400 
CAGAAAGTTT ATCAGCAAAT TCATTTCCGC CCATTACGCC TTCTGCTAAA ATGATTAAGC 24 60 
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AATGTTTTTT CCCGCGGTCG CGGCCTTCTT GAATACGTTT GGCAACGTTT TTCATATCAA 2520 

AATCATGTTC AGGAATAATG ATTTCATCTG CTCCGCCTGC CACGCCTGAC CACAATGCGA 2580 

TATCACCAGC GTTACGTCCC ATTACTTcGA TAACGAATGT ACGAACATGA GAAGTCGCTG 264 0 

TATCACGAAT TCGGTCAATT GATTCCAAAA CAGTGTTAAT CGCTGTATCA AAACCGATTG 2700 

TGAAATCTGT TCCAGGAATA TCATTATCAA TCGTTCCAGG AATTCCAACA GCGGGGAAAC 27 60 

CACGTTTTGT CAAAGCCATT GCGCCATGAT AAGAACCGTC TCCACCAATA ACAACTAAGC 2820 

CTTCAATACC GAATTTTTTC AATTGTTCGA TTCCTTTTAA TTGCCCTTCT TCAGTAGCAA 2880 

ATTCAGGATA ACGAGCAGAA TACAAGAATG TTCCACCACG TTGAATCTTA TCGCCAACGT 294 0 

CAGCAACATC TAAACGACGG ATGTCGCCTG CGACTAAACC GGCAAATCCA TAGTTAATAC 3000 

CGTAAACTTC GATTCCATCA AAAATACTTT TGCGTACGAC TGCACGAATT GCAGCGTTCA 3060 

TTCCAGGGGC ATCTCCCCCG CTGGTTAAAA TACCAATGCG TTTCATTCGC TATTTCACCT 3120 

CATATAATAT ACTTTCAACT GCCAAAAGAA ACCTTTTCGC AAAAATTGAT ATACAAAAAT 3180 

ATTACGTCAT TCATTTTACC ATTAAAGAAC CAAAATGTCT TGAGTTAAGA AGAAAAAAAT 3240 

GAAAAAAGTT GAAAACAACT AGTATTTTTT ACCGTAAATC ATTTGAAAAT GACGTTTTGC 3300 

TCGCCTAAAA CGTAAGCTAA TTGTTGCTGA CTAGCGCTTG TATGAGCGAC CCAATTTTCT 3360 
TCATTTAATA CTATTTTCTT ACCTGTTTTT TCAAAATAGA CAATCACTGG AATGTGTCCA f 3420 

GCGTGACGAC TGAGTACCTC TTTCATTTTC TGAAAAACGT CCGTTTGTTC GACATCTTTT 3480 

GTAATTCTGA TAAAACACGT TTCGTCACTA ATTGATGTTT CTACTTGTTG CGCTTTTTCT 3540 

ATTCGTTGAA CCAATATTTG CAATTCTTGA TTGTACGTAC TGCGTTCCAC TTTTCCTTCC 3600 

ACATAATAAA CTTGCCCTTT CTCGACACTC TGTCTTAACT GGCGAAAAAC GGTTGGGaAC 3660 

AACGtTAGTG AAATACTTCC TGTTGtATCA TCCCCATCAA CAAAGGCCAT TTGTTCTCCC 37 20 

TTTTTGGtCC GAATCGTCCG AATGTTTTTG ACATAAAyCA ATAATTTCgT GGGcTGATTT 3780 

TCAACAaGTT CATGAACAAG TTGCGTTTGC TTGGCTAAGC GTGTTTTTTT GAACTCTTCT 3840 

GTCGGATGTC CTGATAAATA GACCCCTAAA TATTGTTCTT CTTGTTCTAG TCGCTCTTCT 3900 

AGCGGATAAT CGGCAATTTC TTCTTCCTTG GGCTTCAACG TTTCTAACAA ATCCATACTA 3960 

CCACCGCTGA GAAGAATACT TTGGATATTA CTCTCTAAAC CTACCGCTAA TTGACGGCGA 4 020 

TTAGGCGCAA TTTCGTCAAA TGCACCGATG GCAATCAATG GTTCGATATT TTCAACTTTC 4080 

AACCACTTAC GGTCAATTCG CAAAAGAAAC TCATCGAACG TTTTGTACGG TCCATTTTCT 4140 

CGGCGTTCAT TTAAAATATT TTGAATGAAG TCTCGGCGGA TTCCTTTTAA AGAACTAAAA 4 200 

CCAAATAAAA TTTGCTGTTG CTGATTTAGG TAAAAGCTGT AGTGACTTTG GTTAATTGAC 4 260 

GGCTGCAAAA TCGTCTGCTT GTTTTTTCTA GcTTCGCCAA TATATTCTTT AATTTTTGTT 4320 

GGATTATGAC GAACAGAATG CAACAATGCG ACATAAAAAG CACTCGGATA ATGAACCTTC 4 380 

AAGTAAGCCA TTTGAAAACC AATAAAGGAA TAGGCAAATG CATGCGAGCG GTTAAAGCCA 44 4 0 
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TAATTAGCAA AGCGCTCTAT ATAATCGTAC ACTTGGTTAG CTGTTTCTTG GGGATAGCCT 4500 

TGTTGTAACG CACCATTAAC AAAATGATTT CGTTCTTCAT CTAGCACATC TTTTTTCTTT 4 560 

TTACTAATGG CCCGACGTAA AATGTCAGCT TGCCCTAAAG AGAAGCCTGC CATTTTCGCA 4 620 

GCAACTTGCA TAATCTGCTC TTGGTAAACA ATAATTCCGT ACGTATTTTC CAAAATGGGG 4 680 

GCGAGACTAG GCTCTGGATA ACTAATTTGT TCCAAGCCTT TTTTCCGGCG GATGaAAACA 47 40 

TCAATATTTT GCATCGGACC AGGACGATAC AAGGCATTGA CAGCAGCAAT ATCTTCAATG 4 800 

CTAGTTGGCC CTAATTTTCT TAATACATTC CGAATTCCAG CAGATTCAAA CTGGAAAACA 4 860 

CCACTTGTTT CCCCTTTTCT AAATAAAGCC AGCGTTGTTT CGTCATCTAA TGGAATCTGA 4 920 

TTTAAACGAA TGGTTCGATT ATAGACGCGT TTAACAGCTG TGAGGGTATC ATCawkGATG 4980 

GATAAATTTC tTAAGCCCAa GaAATCCATT TTCAGAAGAC CAATTTTTTC AACATCATTC 5040 

ATAGTAAATT GGGTCAATAA AATTTCATTC GATCCTGGTT GTAACGGAAC CAAATTCAAA 5100 

AGATTTTCAT CaCTAATTAC CACACCTGCT GCGTGCGTAG AAACATGGCG AGGCAATCCT 5160 

TCTAGCTGAA CAGCTGTTTT ATAGAGCAAT TGATTATTTG GTGAAAAATT GACCAATTCT 5220 

TGCATCCgTT TTGACTCTTG GTATGCTTCT TCTAACGTAA TTTTCAGCTT GTTGGGGACA 5280 

GCTGCCGACC AACGATTCGC TTCACTCTGG GACAACCCGA AAACACGTGC AAcGTCCCGT 5340 

AAAACCATTT TCGCAGCCAT CGTACCAAAC GTGGCaATTy sCGCCATGTG ATAATGACCA 54 00 

TATTTTTGAC TGACATAAGC TAAAACTTCC TCCCGCCGAT TATCAGGAAT ATCTAAATCA 54 60 

ATATCGGGCA TCGTGTAGCG TTCCGGATTC AAAAAACGTT CAAAGAGCAA ATCATATTTG 5520 

ATTGGATCTA CATCTGTAAT TGATAGCACA TAG G AC ACT A AAGAGCCTGC AGCTGAACCT 5580 

CGGCCAGCGC CTGTAACAAT TTGATTGTTA TGACAGAAGT CCATCACATC CCAAACGATT 564 0 

AAAAAGTAAT CATCAAAGCC CATATTGTGA ATGATGGTTA ACTCTT-TTTC TAATCGTTTT 5700 

TGGTAAACTT CATTTAGATG AGGCACTCTT TCTGGTAATT TTTCTAAACA CAATGTGCGT 57 60 

AAATAAGAAC CAGCCGTTTG ATTATCAGGC ACAGGAAAAT GCGGTAATAA TTTTTGATGT 5820 

AAAGGAATGT CAACTCGGCA ATTTTCAGCT AGCTGAATCG TATTCTGGAC CACTTCTGCA 5880 

CCAGCTATTT GAAGCATTTG TTCCATCGTC GTCTCTGCTT CACGCAAGTA ATTCAGACCT 594 0 

TCTATTTCCG GTGTTTGGCT TGTAATATTC AATTGATTAT TTTCTTTGAT GCTCTCAAGA 6000 

ACAGTTACGG CAAAGCCTTC TTCCCTTGTT AAATGACGTA CCTCTTGCAA AGCCACGCCA 6060 

GGTAATTGTT CTCTTTGGTA AAAAGAAAAG AGCGCTGGAT TACGTTGTTG ATTCATTGAA 6120 

AAAGAATGTT CAACAAATAA CGACTGGGGA TCAAAATATG AAGCAAGTTG TTGGAAAGCG 6180 

TGAACCGCTT GTTGTTCATC TTCAAAAAAT GTTTGGGCGA CTTCTCCTTT AGCTGACAAA 624 0 

GTCACGGCAA ATAAATCACT TAATAAGGAT TGATGTTCCG CTAAGTAAAA GGGACCCTCT 6300 

GTCGTCATTT TGCTACTAGA TAGTTGCATT AATTGCTGAT ACCCTTTTAA ATTTTTAGCA 6360 

AATAAAAGCA ACTCGGAATG CCCTTCTTTT TCTGCTGGGG TATATTCTAA TCTTAAACCG 64 20 
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ATAATTGGTT GAATTCCCGC TTCTTGGCAT GCTTCATAAA ATTCAACCGC 
ACATTGATAT CGGTAATCGC TAATGTTTGA TAACCTAACg CTTTCGCTTG 
GAACGAATCC TAACCGTACT AGATAAAAGC GAATAGGAAG TAACGGTATG 
AATGACATTG GACAGCTCCT TTTTGAATCC TTGCTATTTT TTCCTCTTCT 
TATCGCGGAT TCTTTACTTT TAAAGAAAAT ATGCTAAACT AAGAGCTAGA 
TCATATGCGT TATCTTGTTG TATTTTTCTG GACCTTCGCA TTAGGCCAAG 
TATCGGCGGT GCATTATCTG GTGGTTCATA CGACTTTAAA ATGACAACGA 
GGCAACAGCC GTTATTATTT TATTAATTGC aCACTTTGCA GTACCGAAAG 
TGTTAAATAA GAAAAAACCA GTTACTAAAG TTATTGTAAC TGGTTTTTTT 
CTGTTTAGAG CTTTTTATTT AAAACTTTTC GGAAATTAAA GAAGGACAGC 
CTAAAACGAA ACAAGCGGTT ACCAGAAAAA CAGAAGAATA GCCAAAACCA 
TCGACCCTAC CATCGGTCCT AAAACTTGAC CGAAATTACT ACACATCTGA 
AAACTCTACT TACGCCCTCT GTTGGCGTTA TTTTACTAAT TAACGTATTA 
TTAAAGCACC TGTTGAAAAA CCTAAAAGGA ATCGTAAAAC ACCTAACTGA 
GCACAAACGC CATGGGAATG TAACAAATAA AAGAAAGAAT TAATCCGCCT 
TTTGGTTCCC AATGCGGTCC CCAATTTTTC CCAAAGTTGG AGAAGAAATA 
CGCCGGCAAT CGAAACAATC AAACCACTAA CAAATAATAC ATTCTCTGTA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TCGGAACAAG CTTTTAAGTG AAATTGAGAT TTTATATAGT GAAGAATTTG 
ATGGGCTGTA GAATATTTAA CACAAACCTT AGAGATTCCA TTTAGTTATG 
GTATATTGCG ATTCATATCC ATAGTGCTCG CAGCGGGCGT ACTGATAATA 
CCGTGAAGTT ACAATCGTTT CTGAAATTAT TCATTTAATC GAGCAGGAAT 
TATTCATGAT GATAAAAATA GTCTCAGTTA TTCACGTTTG GTGAATCATT 
TATTCATCGC TTCCAACAAA ATCAATACGC TGTTTTAGAT GAAGAAATTT 
AAAGAAAAAG TATGCTGAAA GCTATGAAAT CTCAAAAAAA GTACAAGTAT 
AAATTTTCAT ■ TATCAAGTAC CAAATGAAGA ACTTGGTTAT TTATCGATTC 
ATTAAGAATG ACCAAATAAT AAAAATTAAA TAGTTTATGT GAGGAGTTAC 
AGCGTATATG CAAAGAATGG GACGTTCGTT AATGTTACCA GTTGCTGTAT 



CCCATGCAAG 
TTTTACATAG 
CAATTGTGGA 
CCTATTTTCC 
AAAGAGGTGC 
TCGTTGGATA 
TTCTATCGTT 
AAACCAAACC 
ATCTTTTCAA 
CCAATGTTTC 
TGGGCAACTG 
TTGTAACTAT 
ATTGATGGCA 
AAAGGTGTTT 
AATAAAACTT 
ATTnnTGAAA 
TCGC 



CTATTGCTCA 
ATGAAGCGGG 
GTAAAAGTAT 
TGGCTATTGA 
TACGTTTGTT 
TGGAAGTCGT 
TATTAATGAG 
ACATTGAACG 
GTATAATGAA 
TACCGGCGGC 



6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

,7380 

7434 



60 
120 
180 
240 
300 
360 
420 
480 
540 

600 
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TTCTTTATTA GTCGGTATTG CTAACTGGAT CGTGGGAACA AfCGGTGCTA GTCCAGCCAC 660 

AACTTTTCTA ATGAATGGTG GTTTGGCTAT .CTTAAACAAC TTAGCATTAT TATTTGCGGT 720 

TGGTTTAGCG TTAGGAATGT CAAAAGATAA AGATGGATCG GCAGCATTAG CTGGTTTAGT 780 

CGCTTATTTA GTGCCAAAAA CTGTTTTAGC GCCTGCTTCT ATCCAAGCCA TTAAAGGCTT 840 

CAAGGATATT GCTGAAGTAA ACCCAGCCTT CAACAGCATG GACAACAATG TCTTTGTCGG 900 

GATTGTCGCT GGTTTAGTTG CCGCGGCAAT GTATAATCGT TTCTCTGGTG TGAAATTGCC 960 

AATGGCTTTA TCATTCTTTA GTGGAAAACG TTTAGTTCCT ATTATGTCAG CTATTTCGAT 1020 

GTTAGCAATC TCAGCTGTCT TATTCTTCTT CTGGCCAGTT GTTTATAACG GATTAGTTGC 1080 

ATTTGGTAAA GGAATCTCTA GCTTAGGTTT CGTAGGAGCT GGCTTGTATG GTTTCTTTAA 1140 

CCGACTATTA ATTCCAACAG GATTACACCA TGCCTTGAAT TCTGTTTTCT GGTTTGATGT 1200 

TGCTGGTATC AATGATATTG GGAACTTTTT AGCTGGTCAA CAAGCGTTAG ATACTGGTAA 1260 

AGCAATTGTA GGACAAACAG GGATGTACCA AGCTGGTTTC TTCCCAGTAA TGATGTTCGG 1320 

TTTACCAGCA GGTGCTTTTG CTATTTATCA ATGTGCACGT CCTGAGAAGA AAAAAGTTAC 1380 

AGCTTCATTA ATGTTAGCTG CTGGTTTTGC GGCTTTCTTT ACAGGCGTTA CTGAACCTTT 14 40 

AGAATTCTCA TTTATGTTCG TTGCTTGGCC TTTATATGTT TTACATGCGG TCTTTACAGG 1500 

TATTTCTTTA GCATTCGCTG CGTTTATGCA TTGGACAGCT GGCTTTGCCT TTAGTGCAGG 1560 

TTTTGTAGAC TTCTTCTTAA GTTTGAAAAA CCCAGTCGCG AATCATCCAA TGATGTTGGT 1620 

GGTTCAAGGA CTTGTCTTTG CAGCAATTTA CTACTTCGGT TTCCGTTTTG CAATCACTAA 1680 

GTTTAACTTA ATGACACCAG GACGTGAAGA AGGCGATGGC GAAGAAACGC CAGATGTTGC 1740 

AGAAGGTGAC AACAAATTTG CTTCTCTAGC TCGTCGTATT TATGATGGTT TAGGCGCTGA 1800 

TGCTAATGTA ACTTCTATTG ATAACTGTAC AACTCGTTTA CGCTTAACAG TGAAAGATAC I8 60 

AGGCAAAGTG GATCAAGCAA AAATTAAAGC AACTGGGGTT CCTGGTGTGA AAGTCATTGA 1920 

TGACACCAAT ATCCAAGTGA TCGTTGGTAC GGAAGTACAA TTTGTTGCTG ACGAAATGCA 1980 

ACGATTATAT AACCACCAAG CGCCAGCAAC ACCAGTCAAA GAAACACCTG TTTCTCAATC 204 0 

AGTTGTTGAA GAAAAAGCAC CAGTTTCAAC GAAAGAAACT GAATTATATT CTGTAGCGAA 2100 

TGGGAAAGTT ATTCCTATTT CTGAAGTACC AGATGATGTT TTCTCAGCAA AAATGATGGG 2160 

TGACGGTTTT GCTGTCGTAC CAACAGACGG CGAAGTTTCA ACACCCGTTG CTGGTAAAAT 2220 

TACTAGCATT TTCCCAACAA AACATGCGTT AGGTATCCAA AC AG AT TC AG GCATTGAAGT 2280 

ATTATTACAC ATGGGTCTAG ATACTGTTGA GTTACAAGGT GGACCATTTA CATTACATGT 2340 

TGAAGAAGGC CAAGTTGTAA AACAAGGCGA TAAAATTGCA ACCATTGATT TAGCGGCATT 2400 

AGAACAAGCT ■ GGCAAAAAAT CAGATTTAAT TGTTGTTTTC ACAAATCAAG ATATTGTTGC 24 60 

TCAATACGAT TTACAAAAAG CAGGTCAAAC GACTTCAATG AATGATGTAA TCGGAAATGT 2520 

TACCGTAAAA TAAAAGTAAA AAAGGGCTAG CCGTAAGGCA GCCCTTTTTT TGATGAAATT 2580 
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CGATAAAATT ACTTGTTATT CTATTCTAGG TGAAAACGTT CAATCTTGCG AAGTTACGGT 264 0 

TTGTTCACAA AAAAGTTACT TTTATTAAAT AAAACTAAAG AAGTAACAAA AACTTCGCnT 2700 
TTTTCAGTTA TTTCAAGTAA CATAGAACGT AT 2732 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CCAATGGCTG GATGGAACAG AACAAGACGT GGTATCACGA TTACATCCGT GCGACAACTG 60 

CACAGTGGAA GGCTACCGTG TAATATCATC GATACTCCAG GACACGTGAC TTCACAAnTG 120 

AGTACACGTT CTCTACGCGT ATTAGATGGT GCGGTAACAG TTCTTGACTC TCAATCAGGT 180 

GTAGAaCCTC AAACAGAAAC TGTTTGGcGT. CAAGCGACTG AATATAAAGT TCCTCGTATT 240 

GTATTCTGTA ACAAAATGGA TAAAATCGGT GCGGACTTCT TCTATTCAGT AGAATCATTA 300 

CATGATCGTT TACAAGCAAA CGCTCATCCA ATTCAAATTC CAATCGGAGC TGAAGAAGAC 360 

TTCACTGGAA TCATTGACTT GATTAAAATG AAAGCTGAAA TCTACACAAA TGATTTGGGA 420 
AC AG AT AT TC AAGAAACTGA TATTCCTGAA GATTATTTAG AAAAAGCGCA AGAATGGCGC , 480 

GAAAAATTAG TTGAAGCTGT TGCTGAAACT GACGAAGACT TAATGATGAA ATATCTTGAA 540 

GGTGAAGAAA TCACTGAAGA AGAATTAGTC - GCTGGTATCC GTCAAGCAAC AATCAACGTT 600 

GAGTTCTTCC CTGTATTAGC TGGTTCAGCA TTCAAGAATA AAGGGGTTCA ATTAATGTTG 660 

GACGCTGTTC TTGATTACTT ACCATCTCCA TTAGATATCG ATGCTATCAA AGGGATCGAT 720 

ACTAAAACAG ACGAAGAAAC AACTCGTCCA GCTGATGACG AAGCACCTTT CGCTTCATTA 7 80 

GCATTTAAAG TTATGACTGA CCCATTCGTA GGTCGTTTAA CTTTCTTCCG TGTTTATTCT 84 0 

GGTGTCCTTG AAAGTGGTTC ATACGTATTG AACGCTTCAA AAGGCAAAAA AGAACGTATC 900 

GGTCGTATCC TACAAATGCA CGCCAACACT CGTCAAGAAA TTGACAAAGT GTACTCTGGT 960 

GATATCGCTG CTGCTGTTGG TCTAAAAGAC ACAACAACTG GTGATACATT ATGTGCATTA 1020 

GATGCACCAG TTATTCTTGA ATCtATTGAA TTCCCTGACC CAGTTATCCA AGTCGCTGTT 1080 

GAACCTAAAT CAAAAGCTGA CCAAGATAAA ATGGGCGTGG CTCTACAAAA ACTTGCTGAA 1140 

GAAGATCCAT 1150 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37; 



/"■ft PPPTZi ZiTA 
bAbbb irirVl , 


aapaaptaaa 




SACTATATGG ' 


"CACCGATGA : 


:gttcaaaaa 


60 


AbAA llol b A 


& apapttacp 


T T A TTT A PPP 


ATCACAGACA 1 


7GCAAGTAGA ACGTGATGTA 


120 


AAT G bAAAAb 


tab ptp & a a a 
1 AAb 1 bAAAA 


ziTziAAfAAAA 


AAACTCAGCC ■ 


rATAAACAGG J 


-.TTGGCTGAG 


180 


TTTTTTTAbb 


CTTCTTTTTT 


rpprprp TV A A P.P.P 


AAGGTG^TTG 


TAAAAGTGGT J 


CCCTAAGCCT 


240 


AAGTGACTAT 


bAAl bbbbAl 




AATAA T TCTG 


TGTAATTTTG i 


GACAATAGCT 


300 


AGACCTAGAC 


CGG1 bbbAbb 


ATR A TP A CnCl 

AbAA 1 ortbbo 


An APGTGCTT 


T^-CGACACG , 


ATAGAACCGT 


360 


TCAAAGATCC 


G 1 xbl ibb lb 


fpTTTTP APT A 

Hill brVb 1 rV 




CAGTATCTTG 


AACAGATAAA 


420 


CGTAATTGCT 


GATTTTCTAT 


A f~" r~< A T A a P A 7A 
Abb A 1 AAbArt 


rt 1 AO x nn lux 


GACCATTTTC 


TTGGGAAxAC 


480 


TGAATGGCAT 


TTTbAAl X Ab 


A TTPTTP APfl 

A 1 1 b X 1 bHbb 




ACAATTCGTA 


TTTGGTGAAA 


540 


AACTGGCTCT 


C T G bC Ab Ab b 


A 1 bAAbbrtb 1 


ul 1 1 1 wrtoo x 


GTTTTGCTGC 


CAATTGTTGC 


600 


TGATAAGAAC 


CGAGAATTTb 


1 b 1 AAi AAA 1 


TTTTPTS A AP 


TAATTTCTTG 


TTCTGCA7AA 


660 


GGAATTGTCG 


CGCTGCCCbG 


A/"'AAAf , TTPT 

Ab AAAb 1 1 b 1 


A A A ATTTPTT 
hnnn 1 X 1V_» x x 


G AAT T AAT TG 


TTGCAGACGC 


720 


TGGGCATCTT 


TTTGCATGAT 


b 1 bbAAAAA 1 


TP ATT A AP^A 


CTTCCGGATC 


TTCTTTGGCA 


780 


CCGTCTATGA 


GTGTCTCGGT 


tv jv t\ (~* (~< 7\ a a 
AAAGCbbAAA 


JS.r ACAAPTPA 


CAGGCGTTTT 


TAATTCATGa 


840 


GACACGTTTG 


ATACAAAATC 


TTT 1 I bbAl 1 


111 1 \* 1 nu 1 1 


GGCGAACGCG 


CGTTAAAtCA 


900 


TAGGCAATTC 


CTAACACTTG 


ATAAtbAI lb 


nr*PPTTTPAT 

UUuL- 1 X XOnX 


CA7CCAAGTA 


ACGAAGAGAA 


960 


AGATCTAATC 


GTAAAGGTGT 


Ibblbbl Ibl 


PTPAPAPTPA 
u 1 v^rt v_r*.o x un 


TTTCTTGATG 


TAAACTGTTT 


1020 


TTTTCAGCGA 


CGAbTibAxb 


P&TTZiZlATPA 
bAl lAAAlb-rt 


ATPAAATCGG 


CGTCTTGAAT 


CACTTCGAGA 


1080 


TAATTGCTTT 


GCATA X I 1 1 I 


nrrar ziczapt 


PPTAACATTT 

X rtflVrf^l X X X 


CCTGCATTTT 


AGGGTTCAAC 


1140 


AGACGAAGCC 


GTbbTTbbb I 


A TP A TP7A T Z\ 
Al brirw bril/1 


AAPAPACPAA 


TCATCAACTC 


ATCTAAAAGT 


1200 


TCGTGAAAI 1 


bb I LA 1 bbbb 




PPTTTATACG 


TT7GACTTAC 


TTGTTGaCTC 


1260 


AGTAGGT1AA 


mTV/^rprprp^^niTV 

1 Ab 1 1 ibb 1 A 




PATTCAGAAG 


ATTGCTGGaT 


AATGGATCGA 


1320 


TTTTCATCTG 


b A I b 1 1 1 AAb 


bril 1 bo 111L 


APAATCGGTA 


AG AC AG TC AC 


TAGCGGTTGA 


1380 


TTTTTTTGCC 


GTAAb AA A AA 




AAPPTAATTA 


GTAGGAACAA 


GATGCCAAAT 


1440 


GTAAAbAl AA 


1 A J. Abb b Abb 




ATATTTTTAG 


AAAATAGCGC 


GGTTTGTTCC 


1500 


bAAA I bbbAA 


1 AAi bbbAAb 


vjM. 1 1 lLi 1 


PATTT7AAAA 

uri x X X x rmm* 


CAGGCGTGGC 


AACGTACAGA 


1560 


AGTTCTTTAT 


TTAACGTTTT 


ACTCTTTCGT 


AAAGCTGAAC 


CATAGGTAGC 


TCCTGATAGA 


1620 


ACCGCAGCGA 


TTTCAGGTCG 


TTTTTTGTGC 


GACTCTAGCG 


GTTCATTTTG 


GCTGCTGTCA 


1680 


AAAAAGACGG 


TACCATTTTT 


TTGCAACAAA 


GTTACCCGTT 


CTTCTGCATC 


ATCTAACGCA 


1740 


tGGGTGaCAA 


GGAGTTGTTG 


TTGGTTTGTC 


AGTTGCCCTT 


7TTCGAAAAC 


GGAAGGGGGC 


1800 


AACTGGTCTG 


TTAAAAGGGT 


ATTTTTTCTT 


CTTAAATAGG 


7TTCTTGCTG 


AGCAACCATT 


1860 


TCTTTTTGAA 


, AGAAAAAATT 


1 GGTCGCTGCfl 


t ATACTCCCGA 


. CAAATA--TAA 


k AAGCATGAC? 


1920 
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GCCGCAACTA AAAAATATTC AATCCGCAGT CTCTTTTTCA TCjGTTTTGGC TCCTGAAAAC 1980 

GGTAGCCAAA GCCACGGACA GTTACTAAGT AAACGGGCCG TTTGGGATCG GGCTCGATTT 2040 

TATCACGTAA ATGACTAATG TGGACATCGA CGATGCGACT TTGTCCAGCG AAGTCATATT 2100 

GCCAAATTCG TTCCAACAAG GTTTCCCGAT TAATGACCCG ATCTTTGCGT TTCATGAAGT 2160 

AAACGAGTAA TTCAAATTCT TTCGGTGTTA GCTCAATGGG CTGATTGCGC ACAAAGACTT 2220 

CATAATTTTG TTCATCCACA CGAATCTCGC CAATCACAAG CGGAgCTTTA GGTGTGTCCT 2280 

CTTGAAGCGT TTCGGTCGTG GTAGGTTTTA AACGACGGAA AATAGCTTTC ATGCGTGCTA 234 0 

GGACTTCTCT GGGACTAAAA GGTTTTGTTA AATAATCATC TGCCCCAATT TCTAAGCCGA 2400 

TGATTTTATC TACTTGTTCA TCTTTGGCAG TTAAAATCAA AATAGGCGTG TCAATTTTTT 24 60 

CTCGACGAAG TGcTTTGGTA ATTTCTAAGC CATCCATACC AGGAAGCATT ACATCTAAAA 2520 

TTATAAAATC ATATTGATTA GACAAAGCTA ATTCGAAGCC GTTTTTACCA TCTTCGGAAG 2580 

TAGTCACCTG ATAGCCTTCT TTTTCCAAAT TAAAGGTTAA CAAAGTTAAA AT CG ATGGTT 264 0 

CATCATCGAC GACAAGAACT TTTTTCATTT ACATAACTCC TTCAAGAAAT TTCTGAAAAT 2700 

AAGTGTATAT TTCACTATTG TTAGTTTAAC ATATTTTTAG AGGAAGAACG TACTTTAAAG 27 60 

TAAGCGAGAA TGCTACTTTT GAGAAAAGTC AACTCATGGT ATAATAGAGA AATGAAATGA 2820 

GGAGAAGCGA ATGATTGGTA TTAGTAGTTG CTTAGGTGGG ATTTGTTGCC GTTACGATGG 2880 

AAAATCTCAA GAACAAGAAG CGTTAAAACA ACTTGTAACT AATGGAAAAG CAATAATGAT 2940 

TTGTCCCGAA GTTTTAGGGG AGTTGCCTAT TCCACGTCCG CCAGCAGAAA TCaTTGGCGG 3000 

GGATGGCTTT GATGTTTGGG CGAATACTGC TTATGTATAT ACGGATAACG GGGAAGATGT 3060 

G ACT GAG AC A TTTAAAGCAG GGGCACAACG CGCGTATCAA AAATTAAAAG CCCTACAAAT 3120 

TACAACCTTG ATTTTGAAAG AAAAAAGTCC CagCTGTGGC AGCCAGCTGA TTTATGATGG 3180 

ACAATTTTCT GGCATTAAAA AAGCTGGAGT TGGAGtCGCA ACTGCTTATT tTATTCAACA 3240 

AGGAATGACC GTTTATTCTG AAGAAACATG GCAACTTGCG GACATTTCTT TTTAAAAGGT 3300 

AAAAACTTAG TAAAAAAAGA TTGACTATTC CCTATGAAAT CGGTAaCCTA TAAGGGATTG 3360 

AaGATGGGAG CACGGCTCAT GGAAaTGGmA AAAACAAACC GCATGAATGC GTTATTTGAA 34 20 

TTTTATTCGA CATTATTAAC GGAAAAGCAG ATGAATTATA TGGAGCTTTA CTATGCTGAT 34 80 

GACTTTTCTT TAGGAGAAAT TGCGGAAGAA TATGAGGTTA GTCGCCAAGC GGTTTATGAT 354 0 

AACATTAAAC GAACCGAAAA AATTTTAGAA GAATACGAAA AAAAGTTACA TCTTTATTCA 3600 

AACTATGTTG TACGGAAAGA ATTTCTTGAA CAACTACAGC AGTATGTTCA AGAAACTTAT 3660 

CCCAAAGATA GTGAAATCAA AAAATATATT GAACAAATTC AAGAAATAGA GGAATGAAAC 3720 

TATGGCTTTT GAGAGTTTAA CAAACCGCCT ACAGCAGGCA ATGAGTAAAA TCCGTCGTAA 3780 

GGGAAAAGTT TCCGAAGCCG ACGTAAAAGA AATGATGCGA GAAATCCGTT TGGCTTTATT 3840 

AGAAGCCGAC GTTAATTTAC AAGTGGTCAA AGATTTCACA AAACGTGTCA GAGAACGGGC 3900 



WO 98/50555 PCT7US98/08985 

408 



AGTAGGAGTC 


GAAGTATTAG , 


AAAGCTTATC ACCAGCCCAA C^AATTGTAA AAATTGTTGA 


3960 


TGAAGAATTA _ACGAAAACGT 


TAPPTTPAPA AACGGTTGAA CTGAATAAAT CTCCAAAAAT 


4020 


CCCGACAGTG 


ATT AT GAT G A 


CAGGGTTACA AGGGGCTGGT AAAACAACTT TTACTGGTAA 


4080 


ATTAGCAAAA 


CACTTAATGA 


AAACTGAAAA CGCTCGTCCG CTTTTAATCG CTGGTGACGT 


4140 


TTATCGTCCA 


GCAGCGATTG 


ATCAGTTGAA GGTTTTAGGT CAACAATTAG AAGTTCCCGT 


4200 


TTTTGATATG 


GGaACAGATG 


CTAATCCAGT GGAAATTGTT CGTCAAGGGT TAGCATTAGC 


4260 


AAAAGAAAAG 


ArAAATGATT 


ATGTCTTAAT TGATACGGCC GGCCGTTTAC ACATTGACGA 


4320 


AGCTTTAATG 


GACGAATTGA 


AACAAATTAA AGAGTTGGCT GATCCCAATG AAATTCTGTT 


4380 


AGTTGTTGAT 


GCGATGACGG 


pppaapatpp TPTGAACGTT GCAGATAGTT TTAATGAACA 


4440 


GCTTGGAATT 


ACTGGGGTTG 


ttattappaa AATPPATGPP GATACTCGTG GGGGGGCTGC 


4500 


GCTGTCAATT 


CGGGCAGTAA 


rrrrTCCTCC PATTAAATTT GTCGGTTCTG GTGAAAAATT 


4560 


AACCGATTTA 


GAAATTTTCC 


nTrrrrnTrr TATPTPPAPT PGTATCCTAG GTATGGGGGA 


4620 


CATGTTGACG 


CTAATTGAAA 


aappppaapa APATTAPPAT GAGAAAAAAG CAGAAGAACT 


4680 


TGCTCAAAAA 


ATGAAAGAAA 


apapttttpa PTTTAAPGAT TTCaTTGAGC AATTGGATCa 


4740 


AGTTATGGGC 


ATGGGACCGA 


ttpaapaptt ATTAAAAATG ATCCCTGGAA TGAGTAACAT 


4800 


GCCTGGTATT 


GAAAATGTCA 


aaptppatpp AAAAPATGTG GCACGGAAAC GGGCGATGGT 


4860 


CTTATCAATG 


ACCCCTGCAG 


AAPPTPAAAA TPPTGATCTA TTAAATCCTA GTCGCCGCCG 


4920 


CAGAATTGCA 


GCTGGTTCAG 


PAAATAPTPT PPTTGAAGTC AATCGTATGA TTAAACAATT 


4980 


TAAAGAATCC 


AAAAAAATGA 


tppaapaaat PTPPAAAGGG GATATGAACA TTCCTGGTAT 


5040 


GGATCaAATG 


TTAGGTGGCG 


pppttaaapp PAAGPTAGGT AAAATGGCCA TGAATCGTAT 


5100 


GATGAAGAAA 


AACAAAAAGA 


APaaAAAPAA AAAAAGATAA ACAGGCTTTA GTTAATTATT 


5160 


GGAAGAAACa 


GAAAGTGCGT 


patpaptptp TTPPAPTGAT GGCGCACTTT CTGTTTAAGC 


5220 


GTAGTTTTGG 


ATAATTGCGT 


VTAAPPPTTP PPPAAAAPCA GCGCCGTGTT GTTCATAATA 


5280 


TGACGTAAAT 


CGTTCATCGC 


PAAPATAPAT AAGGCCTAAC CCTTTGTGTG CTTCTGGCGA 


5340 


ATAGCTTGAC 


CAAGAGTACA 


TTAACCAATC TTTGTGTAAT TGGAAAATTT CTTGCGCAAG 


5400 


TTGGCTAGTC 


AACTTGGGTG 


ATTTTTGATA CATGGCTAAC TTAGAAAATA AATCTTTTTC 


5460 






TGTCTTTTTC CGTCAAGTTT AAATATTTTT TGTTGGCTTG 


5520 


TTCAAtCGTT 


TCGTTGCCAT 


ATTTTTCCCG AATCTCTTTA CCATATTTTT CTtCGTTTTC 


5580 


TTGAACTTTT 


TGTTGTTTGA 


ATGCTTCGAA TTTTTGCTGA TCTGACATGI llAlClul^u 


5640 


TTTGtAATGG 


TGTAAGGTTT 


GTTGAACAGA GGCTAAAAGA CGATCAATTT CATTGCGCTT 


5700 


TTCCAATAGT 


■ TTTTGTTGAT 


GCTCGTATAA AGCCTGTTCA ACATTAAAGT CAGGTTGCTC 


5760 


CAAAATTTCT 


TTAATTTCAT 


CTAGTTTCAG ATCCAATTCA CGGTAAAAAA GAATTTGTTG 


5820 


CAAACGATTC 


: ACCTCGGCTT 


1 GCCCGTAAAT TCGATAACCA GAAGAATTAA TTCTGGCTGG 


5880 
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TTGAAGAAGA CCAATTTCAT CATAATATCG TAACGTCCGT GCACTGACGC CAGATAGTGA 594 0 

AGCCATTTTT TTAATTGTAT ATTCCATCAT TCATTCCTCC TGACAAAATC TATCATAAAC 6000 

CTTTACGTAA CGAGAGGGTC AAGCAAAAAA ATAAAAAAAG TGAGACAACC TGTAAAAAGT 6060 

TGCCTCACTT TTTCTCGATT GATTCAATTA GGCAGTTAAG CGATTAAAGA AATCCACGTA 6120 

AGCATCTACA ATTGATTGTA AGAAGCCAAC AGTGCCATCA ATAATTTTAC CATCTTCACC 6180 

AATTAAGTTC GTAATACCGC CAATATAGGC TTCAGGTTGT TGTAAAGTTG GAACATTTAA 624 0 

GAAGACTAAT GATTGTCTTA AATGATGGTT CGCACCaAAT CCGCTAATkG CaCCTGGtGa 6300 

AACACTCaCC aCAAGTCCtG GTTTTTayCC CCAAACGcTT TCgCCATACG GACGTGAACC 6360 

GACATCCCAA TGCATTTTTT AAAACAGCTG GTACTGAACG GTTATATTCT GGTGTTACAA 64 20 

ACATTACGCC ATCTACTTGT TTAACTTCTT CACGGAATGT TGTCCAAGCT TCGGTGGGGG 64 80 

TTCCTTCaTC ATCCmAATCT TGATTGTATA ATTCTAAATC ACCTATGTTA ATGAAAACAG 6540 

GTTCGAAATC AGCTGGAAAT AAGTCCGCCA TTGTTTTTGC TACCAATTTG TTAAATGAAT 6600 

CTTTTCTTAA ACTCCCTACA AAAATACCAA TTTTTTTGTC ATGTAAATCC CTCCATTTTT 6660 

CTTTTACTTA CTTTATGATA GTACTTTTTC AAAAAAAGCG CAAACGTTTT ATCCTTGTAC 6720 

ATTGTTGAA 6729 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

AACtGTTCAC TTGtTCCATT AACCAGCAAT GTACCTAACG CTGTTTCCAC TTCATATTGA 60 

TAGCTTTTCC CTAAAAATGT TCGGACAGCG ACAGTACCTG CGACAGTATT AGTAGCAGCT 120 

TCTGAAACAA TCTCAATGTC CTCTGGAcGA AtCGTTGCCA CCGTCGTTGT AACATTTGGT 180 

TCAGGATTAG TCGTTTTCAC TATTTGATTT GTTTGTGTCC GATAATGACC GGCTTCCATT 240 

TTTTCGACAG AGAAAAAGTT TTCAAAACCG ATAAATTCTG CAACAAACTT AGTACGAGGC 300 

AAACGATAAA TCGTTTCTGG TGAATCATAT TGTTCAATCA CGCCATTATT CATAATGGCC 360 

ACTTTATCTG AAATAGAAAA ACATTCTTCT TGGTCATGTG TGACAAAAAC TGTCGTAATG 4 20 

CCTAATTGTT GTTGAATCCG TTTGATTTCA ATCCGCATTG CCACTCGCAA TTTTGCATCT 4 80 

AAATTACTCA ACGGTTCATC TAGTAGTAAT AGTTTAGGTT CGATAATCAG AGCACGTGcT 540 

AGCGCAACCC GCTGTCTTTG ACCACCAGAT AATTGTTTTG GATAACGGTC GCCCAAGCTT 600 

TCTAAACCAC ATACTTTTAG CATTTCCGTG ACTTTTTGGT CCATGGTTTG TTTGTTTTCT 660 

TTGCGTAATT TCAGCCCAAA AGCGACATTT TCTGCAATCG TTAAATG 7 ° 7 
(2) INFORMATION FOR SEQ ID NO: 39: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 







o J. uunl o x \J\*> 


(TGGTTCCAT 


TCnAGGTGAT 


TTAGCCAAAC 


60 


GTTCCTTATG 


CGTAAGCCTC 


TGAAGTCAGC 


CACCTTTACG 


AACGAAAAAG 


AGAGCGCAGC 


120 


AACGATTGCT 


TATTTAAGGG 


AATTGATTGA 


AAAAGTTGGT 


TTCAmCCCTA 


AATATTATAC 


180 


TGCCATTAAT 


TCAAGCTATG 


ATTTACCTTA 


CGATTTTTAT 


CGTCCAAATA 


AAGACCGTCA 


240 


TCGTACACAA 


ATTGAATTGA 


TGCAAAAAGA 


TGGCTCATTA 


GTGGAATTGG 


CAACGGTCAG 


300 


TCCCCTAGTT 


GCAGCACTTG 


CTGGTCAGAG 


TCAAGGAGAT 


GAACGTTTCT 


ATTTCCCTAA 


360 


AGAAATGCTT 


GACCAAGGAA 


ATAAAAAACA 


TTACGATTTA 


TTTGATGAGA 


CGTATCGTGA 


420 


GTTTTCAAGT 


TACATTCATA 


ATGGTGCGTT 


AGTTTTAAAA 


AAATAGGTTT 


TCTAGGAGGA 


480 


TTCTATGTCA 


ATTAAGTTAG 


TTGCTATTGA 


TATCGACGGA 


ACATTGCTAA 


ATTCACAACA 


540 


CAAGATTtAC 


CCCACGGGTC 


AAAGAAGCGC 


TCCAAAAAGC 


AAATGAGCAA 


GGTGTTCGTA 


600 


TTGTTTTATG TACAGGCCGT CCTTTGCCAr gCGTGAAAGA ACAATTGGAT 


GAATTAGCCT 


660 


TATATGGTGA 


AAATGATTtC 


GTGATtACCT 


ACAACGGTTC 


GCTTGTCCAA 


GCAACCAAAG 


720 


ATAATACGAT 


TATTTCACGC 


TATACCTTGA 


GTTATGAGGA 


TTTTTaGAAA 


TTGAAATGTA 


780 


TTCTCGTAAA 


GTCGGCGCTC 


ACTTGCATAC 


AATTGATGAT 


TCCGCTATTT 


ACACTGCCAA 


840 


TCGCAATATT 


GGTAAATATA 


CGATTCACGA 


AGCATCTTTA 


GTGAACATGC 


CTTTAAAATA 


900 


TCGTACGGTG 


GATGAAATGA 


CACCAGAGAT 


GAACATTATT 


AAAATGATGA 


TGATTGATGA 


960 


GCCGGAAGTT 


TTAGATCCTG 


CCATTGCAAA 


ATTACCATTA 


CATTTTACCG 


AAAAATATAC 


1020 


GACTGTTAAA 


AGTACGCCTT 


TTTACTATGA 


AATCATGAAT 


AAAAATGCTA 


GCAAAGGCAA 


1080 


TGCTCTAGCA 


AAATTGGCAG 


ACCATTTAGG 


CTTAAATAAA 


GACGAAGTGA 


TGC3OUAJ. IbVj 


1 1 a n 

X X fl \J 


TGACAATGAA 


AATGACTTAT 


CCATGATTGA 


TTACGCTGGG 


ATTGGTGTTG 


CGATGGGCAA 


1200 


TGCGACAGAA 


AATGTTAAAA 


CAATTGCCGA 


TGTGCATACC 


ACTAGTAATG 


ACGAAGATGG 


1260 


TGTCGCTCAA 


AT TAT TG AAA 


AAATGGTTTT 


AATTTAAACA 


TTCTTTTCAA 


TAACAATAAC 


1320 


CCTCAATCTA 


ATCAACATAG 


ATTGAGGGTT 


ATTCGTTATT 


TTATGATGCT 


GCCGCTAGCC 


1380 


ATTTTTCAAC 


GGCCCAACGG 


TGGCTTCCTC 


TTTTTAATAA 


ACGGACTGCT 


TCTTCTGGTG 


1440 


CAACCCAGTG 


TAAAGTATTG 


GTCCTTTCTA 


AGGGTTCGCT 


AAGTTGCCGC 


CAAGTATTCG 


1500 


CTACATAAAA 


ATATCCAGGA 


TTATAATAGG 


CAGTTTGGCG 


ATGGTTCGAA 


TAAAAATATT 


1560 


CATCAGCTTC 


CCCTAAATAA 


CAGCCAATTT 


CTACACTAAT 


TCCTAATTCT 


TCTAGCACTT 


1620 


CTCGGTGGAT 


CGCTTCTTCT 


TTCGTTTCGG 


TGCCTTCAAT 


TTCGCCTCCT 


GGTAAAAAAT 


1680 
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AAGCTCCATT TGGCGCTTGT ACCAGAACCA TTGTGTTATT TTCTGGCTTT GAAACAATAA 1740 

TGTATGCAGC ATATCTTGTT TGATAGGTCA ATGTCTCTTC TCTTTTTCCA AACGTTGGCG 1800 

TTTTCATTGG CGCACCTCTT CTCTTTTCAT CTATTTTACC TGAGAAATTC CTTTAGAACA 1860 

AGTCAGAAAC CAAGCGTTTT cTCTACATTT CACGTTTATC TATGCTACTA TTAACGTTGG 1920 

AAATATAATT AAGTTTGAAA GGGAGTTTTT AACTATGAAA ATGGCTCATA CATGTGTGCG 1980 

TGTAAAAGAT TTAGAGGCAT CATTAGATTT CTATCAAAAA GCCTTCAATT TTGAAGAAAG 2040 

TCGCCGTCGT GATTTTCCAG AAAATAAATT TACTTTGGTT TATTTAACTT TACCAGGAGA 2100 

CGACTACGAA CTAGAACTAA CTTATAATTA TGATCATGAA GCATACGATT TAGGAAACGG 2160 

TTATGGTCAT ATCGCCATCA GTACAGATGA TTTAGAAGGA TTACACGAAA AACATCAAGC 2220 

GGCTGGCTTT ACTGTCACTG ACTTAAAAGG CTTACCTGGA ACTGCACCAT CTTATTACTT 2280 

TGTTGTCGAC CCAGATGGTT ATAAAATCGA AGTCATTCGT GAACGTTAAA AAGTTTCACT 2340 

TGTTCACTTA CTGAGCTTTT TGTGTTTGAG AGCTGTCCGA GACAAAACTT GTCTCGACTT 2400 

CTTCTCAAAC ACTTTTTTCA AAAAAAGTGC TACAATAGAA CGTATGAATT TATTGAGGAT 24 60 

GTGATGTTAT TATGACAAAA AAAATTATTG GAATCGCTGG CAATCAACTT. TTGCAGGCAG 2520 

CTGAAGTGTT TCACGGTAAC CAAGTGACGT ACACCCCACA AGGTTTTGTC AGCGCTGTTC 2580 

AAGCCGCAGt GGCGTTCCTC TCGTTTTGCC AATTGGCCCC AAAGAATTAG CCGCTACGTA 2640 

TATACAACAA ATTGATAAAC TACTTTTAGC AGGCGGTCAA GATGTTGCCC CTGAATTTTA 2700 

CGGACAAGAA CCCCATATAA AATTAGAAGA AACGAATCGT GATCGTGACG AGTTTGAGTT 2760 

AGCCCTAATT GAAGAAGCCT TAAAACAAAA CAAACCTATT TTTGCGGTTT GCCGTGGAAT 2820 

GCAATTAGTC AATGTTGCTT TAGGTGGCAC ATTATATCAA GATTTATCGA TGTATCCTCA 2880 

ATGGTCAGTT AAGCACGGAC AACAACCCAC TCAACCAATC TTTGCGACGC ATCGTATTGA 2940 

TGTTGAACCA GACAGTCAAT TATCAAACAT ATATGGTACA ACTGGTCAAG TCAACTCGTA 3000 

CCATCACCAA GCACTACATA CTTTAGGAAA AGATTTACGC GTTACTGCTT GGTCCTCTGA 3060 

TGGCCTTGCA GAAGCAGTGG AATCAACAAA TGAACAGCAG CCTTTATTAG CTGTTCAATG 3120 

GCATCCTGAA CTAATGTACG CACGTGATGC CAAATCCCAA GCACTCTTTA ACTACTTTGT 3180 

TCAAAAATTA TAGTAAAAAC GGmGGGACGT CTTTCGCCCC TCCGTTTTTA CTTGCtATCA 324 0 

TACTTAAAAA TCAAAATTAT CTGGAtCTGC ACCAAAACGA TGATCCTTAT TTAACTGATT 3300 

AATTGCGGTT ATtTCTTCTG GTGTTAATTC AAAATCAAAG ACAGCAAAAT TTTCCTGAAT 3360 

TCTTTTTTCG TGAACAGATT TGGGAATCAC GATAATATCA TTTTGTAAGT GCCAACGAAT 3420 

AATCACTTGG GCCGCTGATT TACCATATTT TTCACCAATT TTCACTAGTA CTGGTTCAGA 3480 

TAATAAATTA • CCTTGTCCTA GAGGAGACCA AGCCTCCACG GCAATCTTAT TTTCGGCTAA 3540 

ATATTTTCGT AGCGGTTCTT GTGTTAACGT AGGATGTAAT TCAATTTGAT CCACCATTGG 3600 

CACAACATTT GCTTCTGTTA ATAAATCTTC CAAATGATGT TGATGAAAAT TGGAAACGCC 3660 
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AATTGCTTTG GCACGACCAC TTGCGTAAAT TTCTTCTAAT 
TTTACCAGCA ACAGGCCAGT GAATTAAATA TAAATCAACA 
TAAGCTTTTA TCAAACGCTT TAATCGCATT TTCATAGCCT 
CGTCGTCACG AAAATTTCTT CGCGGGGAAT ACCCGATTGA 
TACTTCGTTT TTATAAACGG CTGCTGTATC AATCAAGCGA 
TTTTACTGAA TTTGTTGCTT CTGAGCCATC TTCTACACGC 
TGGCATTTCA TTGCCGTTTG CTAATTTTTT TGTTTTATCC 
TCCCTTCATT TTCATTACTT T TAG TAT ACT ACGCATTCTT 
CTTGTCAGGT CATTTTCTTT ATTTTTAATA AATAACGACG 
TTAGTGCCAC AACTACAATT GTTCGAaCCG TAACGCTTCC 
GTAACCAGAT ACAAAAGAGA TAAACACAGG ACCAATTAAA 
ATTCCGCAAT TTCGATTATC TGACTACTAA AAAAACGACT 
GATTATGGAA GATGAATGTT CGTCGCTTGT CAGCCGTCAA 
CAATAAAAAT TGTTACAGGT AATTTAGAAA GTAAATCATT 
CTTTCACGCA GCACTCACAA CCTTTAACAA AATTATAAAA 
TTATAACATT CCTAAAAAAA CATGTTAAGT CTTAAAAAAA 
TTTCGCCGAG AGATTTAGTA TACTAAAAGA AGAAATTTTA 
TTGATACTTG GTTTAGCCAG TACTGAAACA GTCGATTCTT 
GCGACCGTAA AAAAAGTGAG CGCGCTACAA CGCTTTTGGA 
ATTGTCGCTA CCTTAATTGA AAAAAGTTTA TCAATTTTAT 
ATTATTCAGC GAATCGGGAA GTATTTAATT GATCGCACCT 
CAACATTTTA GTGAAAGCCG TTTAAAAACG TTACACACTT 
TATAC 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CGTCTGTCCT TATCTCGCAC AGCTATTTGG AAAGCGATTA ATGAATTGAA AAAAGATGGC 
TATCAAATTA CTAGCGTGCA AAACAAAGGG TATCGCCTTG AAAAATCAGA TGTCCTTTCT 
GCAGAAGGCA TTCAGTTGGC ATTATGGCCA CAAACACCGC CTTTAACAAT TACCGTCCTA 
GAAACATCTG AATCTACCAT GACCGATGCT AAGCGGGCGA TTTTAGACCA GACGCCCGAC 
AATACTTTGA TTGTTGCTGA CATGCAAGAA ATGCCTCGGG GCAGGTTCGG CCGGCCGTTT 



GCTTTCCATG ATTCTTTATA 
TAGTCTAACC CTAATTTTTC 
TGATCTTCAT TCCATAATTT 
CGAATGCCTT CGCCCACCCC 
TAACCCACTT CTAAAGCCCA 
CAAACACCTA AACCTAGGCG 
AAAGTCATGT GTTTCTTCAG 
CTTTCACGAA ACAATCACGA 
CCAATAAGTC AGCACTCCGA 
AATCCACAGT TTAACTGCAA 
GGAAACTTTC AATGTACATG 
CCCTAATCGT AAAGCGACTG 
GCGAGTAGCA CATCGCTCAT 
TTTTGCCCAT TCAATTGAAA 
TATATTATTT ATTTACAAAA 
AACTTTCTAT TTTCTTCCCC 
GAGAAAGTTG GATTTTTATG 
CTGCTCCTGA TTTAGCAGAA 
ACAATATTAA TTGGGATGCC 
TTTTAATTTT TCTATTTTTT 
ATGCAAATTA TAGTAAAAAG 
TAATCATTAA TGCATTTCAA 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
1 4680 
4740 
4800 
4860 
4 920 
4980 
4985 



60 
120 
180 
240 
300 
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TTCGCTACGC CAGGTAAAGG GATTTACATG AGCATGGTGC TTCAACCGAA CCAAAATTTT . 360 

GAAGAAATCG CTCAATATAC TGTGATTATG GCCGTAGCGG TCGCTCGAGC AATGGATGCT 420 

CTGGCAGGCG TTCACACAGA AATTAAATGG GTCAACGATC TTTATTTAAA TGGCaAAAAA 4 80 

GTTTGCGGGa TTTTGTCTGw AGCCATGAGC AATGTGGAAA CAGGGCAGAT TTCTAATGTT 54 0 

ATCATCGGCA TGGGGATTAA TTTTTCAATT ACAGAAACCG AATTCCCTGA GGACATTCGG 600 

CAGAAAGTGA CTTCCCTTTT CCCAGATGGT GAACCGACGG CGACGCGAAA TGAATTAGTC 660 

GCCAACATTT GGAATCATTT TTACCAAATT TTAGACGAAC GTTCAACGAA AGACTTTTTA 720 

GAAGAATACC GGCAACGCTC CTTTGTGATT GGCAAAGACG TCGCTTTTAC CCAAAACGGG 7 80 

CAAGACTTCC GCGGCGTTGC GACAACCATT AATGGAAATG GGGAATTGAT TGTCCAGTTG 84 0 

CCTGATGGCA CGGCCAAAAC ATTGTCTTCT GGAGAAATTA GTTTGGACCA GATTGGTGAA 900 

TGGCGTCGAG GATAAGTGGG GAAGTTTAGA AAAAAGCAAG AAAGCACCTG TCTAAATTTA 960 

AACAGATGCT TTCTTGCTCT TTAAAAATCA ACTTCTTTTT GTGCTAACAT TTCGCACAAT 1020 

TCTTTGGGGA AATTCGTATA GGGATTTTCA TTGATTTTCA TTTCTTCTAA AGGAGACAAC 1080 

CAGTTTTTAT TCGTATTCTG AAAAGCAATA GCTACTTTTT CTTCAAGATA CTTACATACT 114 0 

TCTGCGCCTT TAAAATTTCG ATAACTGTCT TTTAAATGCA GAACTTTCTC TAATCTCCCA 1200 

TATAAATtTT CTCTAGTCAT GTATCGATTG ACTTCCTCAA AGTGAAGTAG TAACCAGAGT 1260 

TCAAAGCATT CATTTGAAAA AATTGTCTTG ATTTCAGGAC GATGTCGTTG TGCACATCTn 1320 

ACCGCGTTTT CAATCGATTC nGTGGT 1346 
(2) INFORMATION F'OB SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CCAAAATATk GaTGCAATGG TTtTGkTTyC CTGaCAATAG nTGayCGATC GTCACAAATT 60 

CGTATCCTTC ATTTTTTAAC ATAGTGATCA ATCCTGATAC AGCTTTGACT GATTCATGAT 120 

GGATATCGTG GATTAAGATG ATTGCGCCAT TAAAGACATT CTGTTTGACA ACATTATTGA 180 

TTGCTTCAGG ATTTCTTGTT TTCCAATCTA GCGAATCAAT ATTCCATTGA ATAATTGGCA 240 

TGCCAATTGT TTCAGCACTT TTTTTATCAA TAGCGCCATA AGGAGGACGG ATATTGCGAG 300 

GCAAGACGCC CGTAGCCTTA AAAATTGCTT TATTAGCTTT ATTCATTTCA CTTTGTAATT 360 

CATCTGTCGA AAGTGTATTC AACTGTGGAT GAGAATACGA GTGACATGCT ACCTCATGAC 4 20 

CCTCGTCGTG GACTTGTTTT GCAACATCTG GGTTTTGATC CACCATCTGT CCTAACATGA 4 80 

AAAAAGTAGC CTTCACGTTG TTTGTTTTTA GTATATTCAA CAAATCAAGT GTTGAGCTAT 54 0 
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TATTGGGTCC ATCATCAAAA GTAAGAGCGA CATACTTTTT. ATTTTCATCT AATGCAGGTA 600 

ACGCATCTTT TATACTTTCT TGCGAAAGCA GATCTG-TATG AATAAAAGGC GCAATGTCTC 660 

TGTATTTTAG AATAACTTTA GTAGTTCCAG TTGTGTTCTT TGGTAGGTCG ATGGTCAATT 720 

CATCAGGTGT ATAAGTCATT TTACTCTCGT AAGAAATTCT ATCCATCGTT AATACAGCAT 780 

CAATTATTTT TTTAGGCTCT TTGGCGTGAT CAAGAATGGA TTGCTGAATC ACCTGTTGAA 84 0 

TACCTAAGAG ACTTCCTTCA TCCGGAATCA AATCCTTATT TGTGATTTCT TTACCAGTTT 900 

TTTCGGAAAC ATAAATGGCT TTCTCAGAAA TACTGTCTGG CTTTTTAAAA TTTTCCTTAC 960 

TGCGATCCCA GATAAAACTA TCTGCTTCAA TTCTATACTG ACCAATTTTA GAAGATATTG 1020 

CTGTTGCTTT TATTCTGGCA ACTGTAAGCA CTTCTTTTTC ACGATGCTTC TTTTGTTGCT 1080 

TTTGAGCAAG CTCGTCTAAG AGCTGGTTTG CATTTTTAAA AGGAACAGTA CTTTCATAAA 1140 

CTGGACGGTA AATAAGGCAC TTATTTTCTC CGTCATTTAT AGTATCTATT TTCGTTGAAA 1200 

TTCCACTTTG GTCTTGCTCT TCTTGGATTG AAGCCATGAG ATTTTTTACA GCAGTTTCAT 1260 

ATTTGGCTTG AGCTTCTACT TCTTGCTGTT TTAATGTATA AACAATATAG CCGCCAGTAG 1320 

CGACAATTAC TAGAATAAGT GCGGTAAACA ATGCGAATAT ATACTTGCTA TAATTAGATT 1380 

TTTTCTTCGG TATTTTTGGT TGTTTGTTTT CGTTTCCGGT TTTGGGCTTT ACAGCTTCAT 14 40 

TCAATTGATT TGCATTTTCA C AG AT AT C AG GTTCTTGTTT GTGACGTTGA CTACGACGTG 1500 

CATAGATATT TTTATCAGAC ATATGACTTT ATCCTTCCTT T TAT CAT CAT AAAGTAAAAT 1560 

GAAACAGCGC AATTGTAATA AAAAATAATT TATAGATAAt TTTTACAAGA AAGTTTTCTC 1620 

TATTATTCtT TTTCtTTTGG CGTAGTAATT ATCGCTAATT CCTCACCATT TTAGATATTG 1680 

TGGCTACTTA ATGTTAAACT CAATTTTGAT ATCATTAGAA TCACTAGTTT TGATTTTAGT 1740 

TTTTGGAATT GTACCCTTCG ATAGGTTAAT CACTATGGTT GCAGTCTTTG AATCACCAAT 1800 

AATGGTCTAG GATTAAAAAT ATAAACTATT ATCCAATCTT GTTCTATTTG ATTAAATATA 1860 

CTACTTTAAT ACCTTTAATT TGTGATCAAT TCCAACTTCA CCAAACTCAT TTACAATTAT 1920 

AATAATTTCT TCCCTGGGAT GGGGTGAGAC TTGCAAGGC 1959 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ': double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TTTTTATTTG GAGGGATTTA TTGTGTTAAA ACCATTAGGC GATCGCGTCG TAATTAGAGT 60 

CGCGAAAGAA GAAGAAAAAA CTGTTGGAGG AATTGTTCTT GCATCCGTTG CAAAAGAAAA 120 

ACCACAAACA GGTGAAGTTA TCGCAGTAGG TGAAGGTCGT ' GTGCTTGAAA ATGGCACAAA 180 

AGTTCCGATG GAAGTAAAAA TTGGTGACAC AGTAATGTTT GAAAAATATT CAGGAACAGA 24 0 
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AGTGAAATAC GAgGCGTAGA ATACTTAATT GTATCAGCCA AAGACATTAT 7GCCAC7GTT 
GAATAATAAA CAACTTTTAC AGAaTAATAT AATAGAAkAA AAAAATTATG AGGTG'aAGAA"" * 
ATCATGGCAA AAGAGnTTAA ATTTGCAGAA GATGCACGTG CAGCAATGCT ACGCGGAGTA 
GATGTATTAG CAGATACAGT GAAAGTGACA TTAGGCCCTA AAGGTCGTAA CGTTGTTTTA 
GAAAAATCAT TTGGTTCACC ATTGATTACT AACGATGGAG TAACAATTGC TAAGGAAATT 
GAATTGGAAG ATCATTTTGA AAACATGGGC GCAAAATTAG TTTCAGAAGT TGCTTCTAAA 
ACGAATGATA TTGCTGGTGA CGGAACAACA ACAGCGACTG TTTTGACACA AGCCATTGT7 
CGTGAAGGCT TAAAAAACGT AACTGCTGGA GCCAACCCAT TAGGTATTCG TCGTGGGAT? 
GAATTAGCAA CAAAAACAGC AGTAGAAGAA TTACACAATA TTTCATCTGT AGTTGATTCA 
AAAGAAGCGA TTGCACAAGT CGCTGCTGTT TCATCAGGTT CTGAAAAAGT CGGCCAATTA 
ATTGCCGATG CAATGGAAAA AGTTGGTAAC GACGGCGTAA TTACCATTGA AGAATCAAAA 
GGGATTGAAA CAGAATTAGA TGTGGTTGAA GGAATGCAAT TCGACCGCGG TTATTTATCT 
CAATACATGG TTACTGACAA CGATAAAATG GAAGCTGTTT TAGAAAATCC ATATATCTTA 
ATTACCGACA AAAAAATCTC AAATATTCAA GATATCTTAC CTTTATTAGA ACAAATTCTA 
CAACAAAGCC GTCCACTATT GATTATTGCG GATGATG7TG ATC-GGGAAGC 7CTACCAACA 
TTAGTATTGA ACAAAATCCG TGGTACATTT AATGTTGTCG CAGTAAAAGC GCCAGGATTT 
GGTGACCGCC GCAAAGCGAT GCTTGAAGAT ATTGCTATTT TAACAGGTGG TACAGTAATC 
ACTGACGACT TAGGGTTAGA GTTAAAAGAC ACAACTATTG AAAACTTAGG AAATGCTAGC 
AAAGTAGTTG TCGACAAAGA TAACACAACA ATTGTCGAAG GTGCTGGTTC AAAAGAAGCC 
ATTGATGCCC GCGTTCATTT AATTAAAAAC CAAATCGGCG AAACAACGTC T GA TTTTGAT 
CGTGAAAAAT TACAAGAACG TTTAGCTAAA TTAGCTGGTG GGGTTGCTGT CGTTAAAGTC 
GGTGCTGCAA CTGAAACAGA ATTAAAAGAA TTAAAATTAC GAATTGAAGA TGCATTAAAC 
GCAACACGTG CCGCTGTAGA AGAAGGCATG GTTTCTGGTG GTGGTACCGC ACTGGTCAAT 
GTAATTGGTA AAGTCGCTGC GCTAGAAGCT GAAGGCGATG TGGCAACAGG GATCAAGATT 
GTCGTTCGTG CATTAGAAGA ACCAATCCGT CAAATCGCTG AAAATGCTGG TTATGAAGGA 
TCAGTGATTG TTGACAAACT AAAAAATGTT GACTTAGGTA TCGGATTCAA 7GCAGCTAAC 
GGTGAATGGG TAAACATGGT TGAAGCCGGT ATTGTTGACC CAACAAAAGT AACTCGTTCT 
GCCTTACAAA ATGCAGCTTC TGTGTCAGCT TtATTATTAA CAACTGaAGC AGTTGTTGCA 
GACAAACCAG AACCAGCTGC ACCAGCTCCT ATGATGGATC CATCAATGGG CATGGGCGGT 
ATGATGTAAA AATAAAAAAG TATTCTAAAA GCTTACGATT GAAAAATCGT AAGCTTTTTT 
TTACATAAAG'nATAAATAAT CTAAATAATA TCTTTGCTTA TGGTGTTTTC 7CAAATATTT 
ACTCAATTAA CAAAAAAGAT GATAATTTAA AGGCTTCGTT TTAGCTAGAC AGCC TTGTTT 
TTATTCGATA AAATTTAGTA ATAACAATCA TTTTTTGAAA ACTAGTAGTT GGTAAT7TTT 



300 
"360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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ATTTATTAAT 


AGAATATAAA 


TAAAAATTAT 


TATATTTTGA 


AGGGGAATAT 


TAGGAAGATT 


2280 


GAGGTGGAAA 


AATGGAAAAA 


TATACAATTG 


GTGTCGTTCC 


TTTGACAATT 


GAAACAAAAG 


2340 


AATACTGTCA 


AAATTGGTTT 


GTTAGTAATC 


AAGTAGAATG 


TCAGATCATC 


CAAGGAACAA 


2400 


ATATACTTCT 


TCATTTGCAC 


CATATTGATG 


GATTAGTGAT 


TGAAGTTACT 


GACCACCAGT 


2460 


AAGTAAATAC 


TTGTTGTGAG 


TTATTAATGA 


CGATTCGCAA 


GCAGTCAGAT 


TTACCGATTT 


2520 


GGTTATTTTC 


TCGGACAGAA 


GTAATTAGTA 


AAGTCAATCG 


AATTATTTAT 


TTACAATTAG 


2580 


GTGCAGATGG 


CGTTTTTGAT 


TATTCTTATG 


ATCGTCAAGA 


ATGCATGTTA 


AGCATGAGTA 


2640 


ATTTATTGCA 


ACGTGTAAAA 


AGACGTTTTT 


ATCCTAAATT 


AGCTATTGCA 


AACGAGGAAC 


2700 


AAACAGTTAC 


TAAGAATCTT 


TCTGAGAGAT 


TGTATTTAAT 


TGCGCCAAAT 


TTGAGTGTTT 


2760 


GCTTGGGAAG 


TGGAGAAGAA 


ATTCTTTTAA 


CAAAATTGGA 


ATTTTTCACA 


ATAGAGTATT 


2820 


TATACAAACA 


CGCAGGACAA 


ACAATTACAT 


ATGAAGAACT 


CTACAAAAAT 


GTCTGGAAAG 


2880 


ATACTGCGAA 


TGAACGAAAA 


TATCGTGTCG 


CTAATGTGAT 


TTTCCATTTG 


CGCAAAAAAA 


2940 


TTGAACAAGA TGTGAACAAA CCACAATATA TCAAAACAAT 


TCGTTCAAAA 


GGCTATATGT 


3000 


TAACGGTGTA 


AAAGGTACGA 


AGGAGGAGAA 


AAAGATGTGG 


CAATTTTTGA 


CTAAAAACAA 


3060 


TGAAGATGAG 


GAGTATGAGA 


AGATGCCAGA 


TTTTGAAAAT 


GAACAATCGA 


TGAAAGCACA 


3120 


ACCAACGGTA 


ACAGAAGAAG 


CAATCTTTGA 


TGACGAGGTC 


GTTTATGCAA 


AAGAGCAATC 


3180 


CGCAAAAGTA 


AAACAGGTCC 


AGCAACAAGT 


GCCGACTGAA 


CTGTCAGAAG 


AGTCATTTTC 


3240 


TTTTGTTGAA 


AAACTAGAAG 


AAGAAAATCA 


AC T AT AT C AG 


CAATCCATTA 


AAAAGCTAAC 


3300 


GGAGCAATTA 


TTAATGCAAA 


CCAATGAAGT 


GGAAGCATTA 


CAAAAACAAG 


TAGTCGAAAA 


3360 


AGATGTTCAA 


CTTAAACATG 


TTAAAGAAAC 


ATTAAGTGAT 


AAAGAAACAA 


CTATCACTTC 


3420 


TTTACAGAAA 


CAATTGTCTG 


AAGAAAAGAT 


GCAACAGAGA 


CAGACCAGTG 


aAGAGrATTT 


3480 


AGACACAGCC 


GTTACGCTTT 


CTCAAAAAGA 


AATTGGCGAG 


TGTTATTAGA 


AGCCAAACGT 


3540 


CAAGCAAAAG 


AAACATTAGT 


CAGCCAACCA 


CCAGTTGCAC 


ATTCATGAGA 


ATGGACACGT 


3600 


TGCACTTTAC 


ACGCTGAGCA 


GTGGn 








3625 


(2) INFORMATION FOR SEQ ID NO: 43: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCnCCCCCCT TTTTTTTCCG GTCCCnAATn CCTCCGGCCn TTATTTTAGG TTTCCATTTG 
GTAAAAATAG TATAGGrGsw AAgCatCaGT TATGTATtAt TAGTGamAGA AGGTTGCcGT 
GAAGTAAAwA TAAAmAGGAG AGCcAAAGGA TGTTAAmAAA TGTAGAAGTG TTCTGGCAAA 
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ACTTTTTAGA TAAACATGAA TTAGATATGT TGATGCCCGA TQTGTGGATG TTTGGTGATG 240 

GCTCTTCTGA GATGGGTAAT CGTTTGGGTC AATTGGTTGT TTCTGGTCGT AAGACAGCGA 300 

CATGCTCTTC TTTAGATATT TATAAGATGG AAGAAGAACA ACTCCCTAAA GCTGGTCAAT 360 

ACGACATAAT TTTAGATGGT CAGTCTCAAC CGCTGGCTAT TATTCGAACA ACCAAAGTTG 420 

AAATTATGCC CATGAACAAG GTCTCAGAAT CTTTTGCGCA AGCGGAAGGT GAAGGAGATT 480 

TGACGTTGGA TTATTGGTAT GAAGAGCATG CGCGGTTCTT TAAAGAAGAA CTGGCCCCCT 540 

ATCAGTTACA ATTTTATCCC GATATGCTGT TAGTTTGCCA AAGTTTTGAA GTAGTGGATC 600 

TTTATACAGA AAAAGAATAG ACAGAGAAGC GGTGACTGAC ACGATTGAAA CGAGCAGTCG 660 

CCGCTTTTTT TTTACTGATT TTTTGAAAAA GTTGCGTATG TTTATATAGA CAGGGTGATA 720 

GAAAAGAGAT GTGTATTTTA AGTGTCGAGG AAAAGATGGT ACAATAACCT AAGGAGAATA 780 

CTAACAAGGA GTGAATGTTA TGAAAGAAGA ATTTTCATAT GAAATTTTAG AAGAAGTCGC 840 

TGTGTTATCT GAGAATGCTC GTGGCTGGCG TAAAGAATTG AATTTAATTA GTTGGAATGG 900 

TCGTCCACCA AAATTTGATT TGCGTGAGTG GGCCCCTGAT CACGAAAAAA TGGGTAAAGG 960 

CATCACCTTA ACTAACGAAG AATTTGCTGA ATTAmGCAAA ACAATTAmaT CCATGTAAAA 1020 

GGAGCACCCT GTCAATGAAA AGTATGACTG GATTTGGAAA AAAGACCATT CAAAATGAAA 1080 

ATTATCAATT GGATATCGAA GTCAAAAGTG TCAACCAGCG GTTTTTAGAT ATTCAATTGC 114 0 

GTATGCCCAA GGAACTGAAT GCCTATGAAC TGGTGATTCG GCAAGTAATC AAACGCACGT 1200 

TAAAACGTGG GCGTGTGGAA GTCTATGTAA ATCTCCAAAA AATCGGCAAC AATCAAAAAG 1260 

AAGTGCGCGT GCAGTGGGAC TTAATTGATC AATTATTAAC TTCGGTAGAT CAACATTTAA 1320 

AGGAAAAATA TCCGGAAGCA ACGTTTGATG CAGGGGACAC GGTCAACCAT TTATTAAAGC 1380 

AAAATGATTT TGTCGAAATA GTGGAAGCAG AAATTGTTGA TCAGACATTT GAACCATTTC 14 40 

TTGTTCAAGC ATTTGAAGCA GCCATCGCTA GTCTTGACCA AAGCCGTGTG CAAGAAGGAA 1500 

CGCAAATTAA ACAAGTCCTG CTTGACTATG TAGCAGTGTT GACCCAATCA ATTCAAGAAC 1560 

TGCAAGCGTT TGTGGGCGTT TTTGAACAAG AATACCGTCA ACGTTTTGAA GCGAAATTAA 1620 

ACGAATGGTT AGGTAGCCAA GTGGATGAAA CACGCTTATT AACAGAAATG GCCATTTTAT 1680 

TGGAAAAAGG CGATATTCAT GAAGAATTGG ATCGTTTGGA TATCCATATT GATAAGTTGC 1740 

ATCAGTTGCT TGACGAAACA GAACCAGTGG GCCGAGAACT AGATTTCTTA ATTCAAGAAA 1800 

TGAACCGTGA AGTCAATACG ATTGGTTCGA AATCAAGTCC AATTGAAATA AAAAATAGTG 1860 

TCGTTCAAAT GAAAACAACC TTAGAAAAGA TTCGTGAACA AATTCAAAAT GTCGAATAGA 1920 

GAAAATGGTT TCGTGTTTTT CCAGAAAACG CTATAATATA GAAGTAACGA ATAAATGACA 1980 

GAGGTGAAAG . CAATGACTAA ATTTTATGAT GTAACCTTTC AAGAATTAAG TGGTCGTTCT 204 0 

GTGGTAAAAA CAGAAGTCGC TTCAGACAGA GAACCATTTG ACGTCTGGCA AGATGCTTGT 2100 

GCAAGCTATT CTGAAACAGA ATTAAATATC CAAATTAACG AAGACACGTT TGTTACTTTG 2160 
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AATCGTCACT 


TTGTTGTTCG 


CATTGACGTG 


AAAGAAGTTG 


ACGGTCCTGT 


CGACAAACAA 


2220 


GTTCGTCGTC 


GCGATGAATT 


GATGAACGTG 


GTCAATACGC 


TTTCTAATAT 


GGGCTTATAA 


2280 


AAAATACCAC 


CAAGCATTTC 


ACGCTTGGTG 


GTATTTTTTT 


TAATTAGAAT 


AAAAGAAACA 


2340 


AGCCAACGGC 


GCATAGGAAA 


GAAAGAACTG 


CGGCACTTAA 


AAAGATTGTC 


CGCATGGATG 


2400 


PAGTTTTACT 

^_r/^W i i i a x 


GCTTTCTTTG 


CCGTGTTTTT 


TCTGACGTTC 


TAAACGTTTT 


TGAAGCTGAC 


24 60 


P.GG T A A T T AC 


TGAGAAAATC 


AGCCCGAAAA 


GGACTAAAAC 


ACTTGTGGCA 


ATCGATTTGG 


2520 


PA ATAAAATG 


ATTCGGTTCT 


GCAACGGTGG 


TATCCCCAGA 


ATGTTCGTTT 


GATTCTTCCg 


2580 


PAP.PTTPPGP 


TGTTGTTTGT 


TTTGGCTTAA 


CTTCTTGGTA 


AGCCACTTTG 


GGTGCTTGAA 


2640 




TAACAATGTC 


AAGGGCGTGT 


CGACTTGCAC 


TTGCCCTGCT 


ACTAATTTAA 


2700 


a nPTAP.P.PTP, 


APTGPCtTTG 


GGAACGACAC 


CGTATAAATC 


GTTAGCCAAA 


GTAACTTTTT 


2760 


TP.Tf2ATPA AT 
iUi VJrtX \-,r\r\ X 


GGTTTGTGGT 

\J\J XXX \J X VJv X 


CCTTTGTCTA 


AAAGTTTTTT 


GCGTTCGTAT 


GTCGTATAGG 


2820 


p&tzxp.tppap. 


PAAAGPATTC 


CCAGCCACAT 


GCCGTTCGTA 


TTCACCATCT 


TGGTTTTGCC 


2880 


7i TiTTZlPP A AP 


GPPTAAAATG 


ACTTCAATTA 


AACGGAAATC 


TCCTTGTTTG 


ACAGTCGCAA 


2940 


TZiTZifiTTA A A 


GCTGCCTGTT 

OvV^UVU lull 


GGACTAGAGC 


CTGTTTTTAA 


ACCATCCACG 


CCTTGGTAGC 


3000 


p7\T APTT AGP 


TPPTGGTAAA 


GAATAATTGT 


ACGCTTCAAA 


CGTTTCCTCG 


TAAGGCGTGC . 


3060 


PP. APPATTGT 


TTTPAPCACT 


GGTTTATTGG 


TGATTTCTAA 


AATTTCTGGA 


TAATTTTTCA 


3120 


Pa A A ATP, AT A 


GGTTAGTGAA 


GCTAAATCTC 


TAGCTGTAGA 


GACGTTATCG 


CCATTTGGGT 


3180 


PAATTPPTTG 


fATTTGATAC 

uii 111 vjn ± 


AGGCCGTTAA 


AGGCACTTGC 


TTGCGCTCCA. CTGCAGTTGT 


3240 


A ATAGGT AGT 


ATTTGTCATC 

X* 1 1 X V3 1 1 u 


CCTAGTTCAG 


CCGCTTTTTG 


AT TC ATT AAA 


TGAACAAAGT 


3300 


PAGTAGGCTG 


GTTCCCTGAA 


ATTAAGTTTG 


CCAACATGAG 


AGTCGCCACA 


TTAGAAGAGG 


3360 


GGAPAGPAAT 


CATTTTTAAC 


AGTTCACGGA 


CTGGATAAGC 


AACACCTAAC 


GTAATTTTGT 


3420 


TATTAPTAAT 


GGCATATATT 


TTAGAAATAT 


CGACATCTTT 


TTGCGTAGCA 


GTCACAGTCG 


3480 


TATCCATTGT 


AAATTTTCCT 


TGCTCCATTG 


CTTCAAAGGC 


CAAGTACATG 


GTCATCACTT 


3540 


TGGCAATACT 


GGCAGGATTC 


CACGCTAAAT 


CTGGTTGCTG 


CGACCAGAGA 


ATTTCTCCAG 


' 3600 


AATCAGCATC 


AACAACGATG 


GCTGCTTTGG 


GAATGCCATC 


CGCTTTCAAA 


GTCGTCCCCA 


3660 


TTTTTTTAGC 


AATCGCGGTA 


ATATCTTCTT 


CTGCGTGAGT 


CACAGTAGGT 


TGGGCAAAAA 


3720 


AGGCCCCACa 


ACTGACAACT 


AAGCCTAGCA 


GAAACAAT 






. 3758 


(2) INFORMATION FOR SEQ ID NO: 44: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TCTTCTAATC CTGATGCATT aGCTGGCGTG GATGCACAAC GTGTAGCAGC TTTCCAAGCG 
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GCTAATGGGA AAGCATTGGT CAATCTACGA AAAGCAACGC AAGCCAACAA AGTAAGTTGG 120 

ACTGTTGTAG CTGCTGCTAG TGAAGGCTGG GCAGCTAAAG TCTTTCCAGA ATTAGCAACA 180 

AGTGAAGAAC AAGTTGATGC ACTTTGGAAT GAAATTTTCA AAACAACACG AATTTATGAA 24 0 

GAAAACCCAG TCATTGCTTG GGATATCCAT GATAAAAAAT TACAAGAAAA AGCGGCTGAA 300 

TTAAACGAAC AACAATTTAC TGCTTTACAC TACACAGCTC CTGGGACAGA TTTAACGATT 360 

GGTTTGCCTA AAAACCATTT ATGGGAAGGC GCTGGTAGTT ATAACGCTCG TGGGGAAGAA 420 

TTCATGGCTA ATATGCCAAC AGAAGAAGTT TTCACCGCCC CAGATAGTCG CCGCGTTGAT 480 

GGTTATGTTT CTAGTACAAA ACCACTAAGT TATGCTGGCA CAATCATTTC GGGAATGAAA 540 

TTTACATTTA AAGATGGAAA AGTTGTTGAC TTCTCAGCTG AACAAGGCGA AGAAGCATTG 600 

AAAAATCTCT TAGCTATTGA TGAAGGTGCC AAACACTTAG GAGAAGTTGC TTTAGTACCA 660 

GACCCTTCAC CAATTTCACA ATCTGGTTTG ATTTTTTATA ATACATTGTT TGATGAAAAT 720 

GCCTCTAATC ATTTAGCCTT TGGTTCGGCC TATGCCTTCA ACTTACAAGG TGGCACGGAG 780 

ATGAGCGAAG AAGAATTAGC TGAAGCTGGT TTAAATCGTA GCCAAACACA TGTGGACTTT 840 

ATGGTAGGAT CTGACAAAAT GAATATTGAC GGCATTAAGG AAGACGGGAC GATTGTTCCA 900 

GTTTTTAGAA ATGGCGATTG GGCATAGTCC TACCAAATGT TGAAACCAAA GTTTTTAGCT 960 

CCAAGAAATA AGAAGAAACT GCTTTTATTC AGTAGGTTCT TTTAAAATTT AGAAAGTGAG 1020 

GCCTAAGTTG CAGCGACTTA GGCCTCACTT TCTTATGCTA AAAAATCAAT TGCATTCGAA 1080 

GGACCTTTAA CGTTATAATC AAATTGAAAA CGGATTCTTT TGCACGATAT AAGTTTACTT 1140 

TTAAATAAAA GAATTTTTTT CAAAAAAGTA GAATTGGAGA ATTAACATGG GATTATTTGA 1200 

TGGATTATTA GGAAATGCAA CACAAAATAA TAATGAAACA GCTGAaAAAG AaTTACGGGA 1260 

TGTTTTAATC CCTAACGrAA AAGTAGATAT GGCCTTCACT TTAGTAAGAG ATTTAATTGT 1320 

CTTTACAGAT AAACGCTTAA tTTTrGtCGA TAACCAGGA 1359 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15614 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATTAAnATAn ACCAGGTTTT AGATAGTATT TATTCAAGTT TACCTCnAAG GTATTAATCT 60 

GAAACCCkGT TATtTtAtTa GAGAGTTCyT TTTCaAATAT ATGTAAATrG cTAACaACrG 120 

GGTTCTGTTT GTTGTTGGAC tATTTTTTTT GATCcCCAmC CGTTCTTGAA AAAAAAcAAA 180 

AAAATAAACA GTGyTCGCTT GATTAATCGA ACGTTTGTTC GTATAATGAT TTTGCaAGTG 24 0 

AGGTGATGTT TGTGAAATTT GATTATGAAA AAGAACCTAG TCGCGATATT CTTTGTATTG 300 
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ATTGCAAAAG TTTTTACGCT AGTGTAGAAT GCGCAGAACG TG?GTTAGAT CCTTTGAAAA 


360 


CGATGTTGGT 


AGTGATGAGT 


AATGCCGAAA . 


ATGCGGGGGG . 


ATTAGTTTTA 


TCAGCATCGC 


420 


CAATGGCAAA 


AAAAGTTTTA 


GGTGTTTCAA 


ATGTTACGCG 


TAAAAATCAA 


GTGCCCGCTC 


480 


ATCCTGATTT 


ATATGTGGTT 


CCGCCACGAA 


TGAATTTGTA 


CATGAAAAAA 


AATCAAGAAA 


540 


TCAATAATTT 


ATATAAAAAG 


TTTGTCGCAG 


ATGAAGATCA 


CTCTGTGTTT 


AGTGTAGATG 


600 


AATCATTTCT 


TGATGTAACT 


GCTTCTTTAA 


GTTATTTTCA 


TTGTGAAACT 


GCCTATAAAA 


660 


TGGCTCGGAT 


TATCCAAAGA 


GTCGTTTATA 


ATCATGTGGG 


TATTTATGTG 


ACCGTTGGGA 


720 


TAGGGGACAA 


TCCGTTACTG 


GCTAAATTGG 


CGTTGGACAA 


TGGCGCCAAA 


CATTCGCCAG 


780 


ATTTTATTGC 


AGAATGGCGG 


TATGATCGTG 


TACCAGATAC 


TGTCTGGCAG 


TTGCCTTCTT 


840 


TGACAGATTT 


TTGTGGGATT 


GGTCGCAGAA 


TGGCTAAACG 


ACTGAATCGT 


TTAGGAATTG 


900 


ATTCTGTATA 


TGAGTTAGCG 


CAACGAaTCC 


GCATCTTTTA 


CAAGAAACAT 


TTGGCATGAT 


960 


GGGGCTTCAA 


TTGTATGCTC 


ATTCATGGGG 


GATTGATCGT 


ACCTTTTTAG 


GGAAAAAAGC 


1020 


ACCACATAAA 


GCTGAAAAAT 


CATTTGGGAA 


TAGTCAGATT 


TTACCGAGAG 


ATTATGCGCG 


1080 


AAGAGATCAG 


ATTGAACTAG 


TCTTAAAGGA 


ACTGACAGAA 


CAAGTGGCTG 


CTCGTTTAAG 


1140 


AAAAGCGCAT 


TGTCAAACGG 


AATGTATCAC 


AGTCTACGTT 


GGTTATTCCA 


AAGGTCAAAT 


1200 


TGACCGTGAA 


GGACGAACTG 


GTTGGCGAAA 


ACAACAAACG 


ATTCCTGCAA 


CAAACAATAC 


1260 


AAAAGTTTTG 


ATCACCTATG 


TATTAGCCCT 


TTTTAGAGAA 


CATTATCTAG 


CGGGAACCGA 


1320 


TGTCCGGCAA 


TTAGGGCTTT 


CATATGGGAA 


ATTAGTTTGG 


AATGAATCTT 


TACAATTAGA 


1380 


TTTATTTTCG 


GAACCAGAAG 


AGCAAATTAG 


TGAAATGGAG 


TTAAACTATC 


TGATTGATAA 


1440 


AATCCGACAA 


AAATTTGGCT 


TTCAAGCTTT 


AATTCACACC 


TCTTCTTTGT 


TGGAAGGCGC 


1500 


GACGGCAATT 


CATCGTTCAG 


GATtAGtCGG 


TGGACATGCT 


GGAGGAAATG 


TCGGTTTAGG 


1560 


AGGATAACAT 


CATGAAACGA 


ACAAAAAAAG 


AmTTTACACC 


TTATAATGAG 


TACCATGaTC 


1620 


GGCCCTTTgA ACTGAAATGG 


GCGACCGCTT 


TTGCATTGGG 


GGAGTTACAA 


GAAGGAATCG 


1680 


AAGCTTCTGA 


CGAATATTCT 


AAGAGAGCGT 


TTGAACGCTT 


ACCGCAACAG 


ACAACAGAAG 


1740 


AAATTGAAGA 


ATACTTAGAG 


CAGTCTTTAA 


AAAATAACCA 


AGTTTTAGAa 


ATCCAATTGA 


1800 


ATACATTGGA 


TGATTTTGGC 


CGTGTTCAAC 


CGCCAATTGT 


TGGTGTTTTT 


GAAGGGATGA 


1860 


GTTCAATGGA 


GTATGTAAAA 


GTTGGACATA 


CACATATTTT 


ATTTGAAGAG 


ATTCGGCATA 


1920 


TTAAAAGTCA 


TGAGTTTACA 




U^hsV^ 1 1 vJil 1 XJTi. 


AAAAACTATT 


GTTATACGTG 


1980 


CGTTTTGTGA 


AAACATATGA 


GATAATGTTC 


ATAGGTAATA 


GAAATAAAAA 


TTATGGAGGT 




GGCTAGGATG 


GGACATCATT 


TAAAAGGACA 


CCATGGCAAA 


AAAACAAACG 


CAAATGGCGG 


2100 


CTTTTTAACA 


. AAGTTATTGG 


CAGGTGGCGC 


CTTAGTGGGC 


GTAGGAAAAA 


. AGTTATACGA 


2160 


CAATCGTGAA 


AAAATTAAAG 


AGCTGTTAAG 


TGACGACAAA 


. GAAAAAGGCfl 


t ACGACACTGA 


2220 


AAACAAATAA 


AGGACTGGAT 


TGACGGAATG 


GTAAAAGATG 


CAGCGTATAT 


' TATTGAAGAG 


2280 
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ATTACCACGA TTTTAGAAGG AACCTATTTT GTGATAACCA AG^AAGCGAA TAAGCGCCGC 2340 

ATGTGTTTAC ATGTTTCAGA TACCGCGAAA GAATTTTTAC GTTGTCGACT GGAGTATGAA 2400 

aCATTCTTAG AGCAACAGTT TGATTATGCC aCAATGTTTT TAGAAACATG TACGGAAGAT 24 60 

GTTTCTGTTG AAATTCAAAC GGGACTACTT GAAGAAACAA ATTATCAAGC ATTAACCGTG 2520 

GAAAAATTAC AACAGATAGC CACTAAAATG TAACACAGCT GTTGTGTTAG TTTTGATTCT 2580 

TAATTTGGTT AAGGAGGCGG ATAAAATGTC AGTATATCAA CATATTTTGG TTGCTCTTGA 2640 

TGGCTCAGAT CAATCTGAAA AAGCTTTTCA TGAAGCTGTC AGGATTGCCA AAGAGGAACA 2700 

AGCCACTCTT TATTTAGCTA CAATTATTAA TGATGCTGAA TTTACGACGA GTCCATTTTC 2760 

TTTTGAAGAA TTGTATGATT TAGAAAAACA CAAGTCAGAA GAGATGCTCA CTGGAAAGGC 2820 

AAAACAAGCG AGTGAAATTG GTGTAAAAAC AGTCAAAAAA ATTGTTGAGC TTGGGAGCCC 28 80 

AAAACGTTAT CTTGCCAACA CGATTTCTGA AAATTATGCC ATAGATTTAA TTGTGCTAGG 2940 

TGCCACGGGA AGAGGTGCTA TTCAACGAAC CTTGATTGGT TCTACTACAG ATTATGTGGT 3000 
TAACCATGCT TTGTGCAATG TCTTAGTCGT TCGCTAATTG TGAGTAAATA ATCAAGGAAA • 3060 

CAATGTTTTA TCGGGACGTT TTTTTAAATT AAACGAGAGG TGAGTTTATG AAAACAAAAA 3120 

TCGGAAAAAC AGTTATCTTG TCAGCATTTT TATTCACAAG TTTCCTTTTA CTGAGTGGTT 3180 

GTACCTCGGC TGGCGAAGAG ATGGAAAAAA CAATTGATCG ACAGAAAGAA AAAGTCGATA 3240 

AAACGGTCGA TAAGCAGAAA CATAAAAATG AAAATTCCAT GGAAAGTTAC GACGAAAAAG 3300 

TTGACCGTTC TTTAGATAGT CAAGAAGACA AAATCGATAC TACTGAGTAA CGGGAAAGCT 3360 

TTCAACTATC GGCTAG7AAA TAAATAATAT ACAAAAGGGA GCATCCACTG CAATTATTTC 3420- 

TCACAGCGGA TCCTCTCTTT TGTGGTTAAA AAGAAAAAGT TAACTGCTCC CCAACAGTTA 3480 

ACTTTTAATC ATGTTGCACA TAGTTAAAAC ACTCTATATA TCATTTATAT CATTTGAAAC 354 0 

GGGTAGTGTA AAAAGAAATC TCTTGAAAAG AGTCTCCATT TATAGGGAAG ATTTCCCATA 3600 

AAAAAGAGAA ACGTATAATC ACAGAGAGTT AGAGAAAGCA AAGGGAAATA AGTATGTTAA 3660 

AATAAGAAGC AAGAAAGAAT ACAAGTATGC AGAAAATAAC AACTAATTAG GTGAAAACAA 3720 

TGGAAATACA TTTTGAAAAA GTAACAAGCG ATAATCGTAA AGCAGTGGAA AACCTACAAG 3780 

TTTTCGCTGA ACAACAAGCC TTCATTGAAT CAATGGCAGA AAATCTAAAA GAATCAGACC 3840 

AATTTCCAGA ATGGGAATCG GCAGGTATTT ATGACGGCAA TCAATTAATC GGCTATGCCA 3900 

TGTATGGACG TTGGCAAGAT GGGCGCGTTT GGCTAGACCG CTTTTTAATT GATCAACGGT 3960 

TCCAAGGACA AGGTTATGGG AAAGCAGCCT GTCGACTTTT AATGTTAAAG CTAATAGAAA 4 020 

AATACCAAAC AAACAAGCTA TACTTGAGTG TTTATGATAC GAACAGCTCa GCAATTCGGT 4080 

TGTATCAGCA ACTCGGTTTT GTATTTAATG GTGAATTAGA TACGAATGGT GAGCGGGTAA 414 0 

TGGAATGGAC ACATCAGAAT AAGTAGATAG AACAGGTGAG ACAAAAGCCA TAGCGCATTT 4200 

TGTTTCATCT TTTTCTTTTA ATTGGCTTCA TTTGTTTACT TTTAGTAAGT AAAATGGTAA 4 260 
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ATTTAGTTCA ATCTGTTTTG AAAAAAGAGT TACTGGAGGG TAAGGATGAA AATAGAATTT 4320 

TTCCATGATG TTATTTGTAG TTTTTGTTTT CCAATGTCTG ACCGCATGCA TGAGATTCAA 4 380 

CAAGAATTTC CTGAAATTGA AT T GAT T CAT CGGTCGTTTG CGTTAGGATG GTCAGCAAAC 4 4 40 

GATTTTGAAA CAATGTTTGG CTCACGAGCA GCAGTAAAAG AAGAAGTTTT GACGCATTGG 4500 

GTGCATGCCA ATCAAAACGA TTCAAAGCAT CGTTTTAATA TTGAAGGGAT GAGGAAGCAA 4 560 

CGATTTGATT TTCCaACATC AAAGAATGTA TTATTAGCAG CAAAAGCAGC TGGATACATT 4 620 

GGCAACCAAG ACACCTATTG GTTACTGTTT GATAAGTTAC AAGAAGGTTT ATTTGTGAGA 4 680 

AGTTTAAATA TTGAAGAGCC AGAAGTAATT GAGAAGCTAG TAAAAGAGAC AACGATTGAT 47 40 

TTTGCTTTAT GGAAAGAAGC TGTTGCTTCC GAGGCTGTTT GGACGGCGGT TCAAGAAGAT 4800 

TTTGCGCTAG CTTCTGCTTA TGGTTTACAA GGAGTTCCCG CACTAATCAT TAATCAAAAA 4 8 60 

TATCTAATTA ATGGTGCAGT ACCGAAACAG CAAATCAGCC AAACTATTCA AAAAATTCTC 4 920 

GCAGAAGAAA AACAACAGCA ACCGTTAGTT TCATTAACTC CTTCTTCTGA AGAGTCAGCT 4 980 

TCGTGTGAGT TTAAGGAAGG CACATGGACG TGTCATTAAG CAGCTTAAAA TAACGTTATT 5040 

CTTTGACAAA ACGAAAGACT TCTGTAATAG TTAATGCGGG GGTGATAATA TGTTTAACGC 5100 

ATATAGAGAA TACTGGAATA ATATTACAAC AATGAATGCG TCACAACaCG AGCACAATAT 5160 

TGGTGGCCAC AATTAATTAA TTATTTGGTG TTAGGGATTT ATTCAGCAAT TACGGGCGTT 5220 

TACAGATATA TTGAAGTAAC ACCCAATGAT GGAACAATCA TTAAAGAATG GAATACTGTG 5280 

ACCATTATAT TTTTCTTGCT GACTGCTTTA ATTTGGTTGG CAAACTTTAC TGTTCGAGCA 534 0 

AGAAGATTGC ATGATCGCGA CCACAGTAAC TGGTGGATTT TATTCTATTT ACTTCCTTTT 5400 

ATCGGTACTT TAGTGATCTT CATTACATTA ATTTTGCCAA GTAAAAGTCA CACAAGATGG 54 60 

CCAGTGAATC AATCAGAAAT TTAATTTGCT GGTTTCgTTT AGCTGATTAT CTTAAGAGTG 5520 

AAAATAGGAC ATAAGTCATT TCGACTTATG TCCTATTTTC TTTTTTATTT CAGAAAAATA 5580 

AAAATATCTT CTGTATGATG GCACCTCACA TTACGCGGTT CTTGATAAAG GGGCTATCTT 5640 

TTTGGTAGAT AGGGAAAACC ATTAAGACGT TAAAAGTTTA ATAAATCGAG CAATTCTCTT 5700 

TTGTTAGAAA ATAAGATAGC ATCATTAATT AAAATACTTG GAACACTGGT TTCAATAATT 57 60 

GGTTGATTGG TAATAATCAA GTCAAATTCT TCGAGTCGAT GGTCACCTAA TTGTTCTTCC 5820 

GTAATTGCTA ACGTATTTGA ACGAATCCGA TGCAAAGCTA AAAtATCGTG GaCATAGTTT 5880 

CCAATATATT GACTATGATT CAGGCCCATG TCGCTAATAA TCAAAATCTT TTTTGAGTCG 594 0 

AAACTCTGGT TAATTTCTGG AAAAGTATTA AGCAACCAAA AAATATAAGA AGATGAATAG 6000 

TTTAATAGAT TTGTTTCTAA AAGAGCGGAA AAGAAATGCA ATTCTGCTTT TAATCGTGCA 6060 

TATAAGTGAA • GCTGAGTAAA ATGAATTTGT CGAGTGAAAA AGGTGGCCCT TTGTGTCAAT 6120 

TCGGAGGTGT CAAAAGGCAC TAATTGAGAA AACAAATAAA GACAGGTTAA CAAGTGAACA 6180 

GCGAAGTCGA CGACTTCTTC TGTTAATTGT TCAATCTGTG AAAAAGCATT AAATTTTAAA 624 0 
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TTTTCACGTA 


TTCTTTGACT 


GATGACTTCT 


TGGTCTTCGG 


AGJAGTATTG 


CTGAAAAGCG 


6300 


CCTCTTTGAA 


AATATCGGAG 


AACTTCCCGA 


TAGGTTTCAC 


GGCTGAGTCC 


AGTGAAATGA 


6360 


CG AC TGAGT A 


AAAGATAATC 


TTCCATTTTG 


ATTGGACTCT 


TTTTATCTGT 


TTTTTTCTGG 


6420 


AAACGTTGTT 


GAAATCCTTG 


ATGCTCTCTT 


AATAAATCAA 


CAATAAAATA 


CATAATAAAG 


6480 


AAATCGTGTT 


CATAACTGTT 


ATCCATaCTA 


ATAATATTTA 


ATTGGCATTG 


ATTACGACGC 


6540 


ATTAATTGTT 


TGATTTCCTG 


CTTATCTAAT 


TGATAAGGCA 


ATTCATAATG 


ACCATAATAC 


6600 


ATTAAAAAGA 


AAAAGGTTGT 


AAAAATACGT 


AAAGGCAATT 


CTTCTGTTTG 


GCTAGTTAAT 


6660 


TGATAGCCTT 


GATCAACGAT 


AACGCGAAGA 


GAGTAAGTTT 


CCAAGACCTG 


GTTACACTGC 


6720 


TTCAATTTTC 


GAGCAAAAGA 


AGAGCGACTA 


ATATTGATGG 


TATCGCAATA 


ATATTCGATG 


6780 


GTTTTGCCAG 


GATTTTGGAT 


AAACTCATGG 


AGCAATTGGG 


TGGTCGTTGA 


CGTTTTTGCC 


684 0 


ATATCCGTCA 


GCACGTTTAA 


ATATAAGGAC 


ACATTAAAGT 


TTTCTAAACG 


ATAGCCTAAT 


6900 


TTATGCGATG 


TTTTAATGGT 


TAAACTTTCG 


CCCCAACGCT 


GTTTAATCTC 


CTCAATATAA 


6960 


CGAATAAAAG 


TAGACTCAGA 


ACTACCAATA 


TATTTTGCAA 


GTTCTTTTTT 


AGTCATCCAT 


7020 


TTATCAGAAA 


TGGATAAGAC 


ACGAATGATT 


TCTTCCATTT 


TCCGATAATT 


TTTTTCTAAA 


7080 


AACCTAATCA 


TATTGTTTCC 


TCCCAAGTCC 


TTTACTTTAA 


T GAAACAATC 


ATCGAAACAA 


7140 


TAACGCAATT 


TCAGAAATGA 


AACCGCTTGG 


TATGAAAGCT 


GGTGTGATTT 


AATGAGTTCC 


7200 


TTCTCTGTAT 


AATAATTATC 


AGATTAGAAT 


AATGAAATTT 


TACGTGCTAG 


GCAATTCtaA 


7260 


TTTAACTGkT 


AAwTCmTaAA 


AGgAAAGGAG 


AGAACAGACA 


TTAAATTACT 


AAAGAmATTT 


7320 


GGGGTTTTCG 


GCGTAGTCTT 


TTTACTTGTC 


TCATCCTATT 


TCATACCGCT 


CATTGGCTAy 


7380 


GCAGAAACAG 


CCAAAGAAGT 


TGAAATTACA 


TCCGCTCAAA 


TGATAACAGA 


TGAGAATGAT 


7440 


AAAACGAATA 


TTAATATCGA 


GTTAAATCTT 


CTCAACCAAA 


CAGAGCAGCC 


ATTACAACGA 


7500 


GAAATTCAAT 


TGAAAAATGC 


ACAGTTCATG 


GATACTGCTG 


TAATTGAAAA 


AGACGGATAT 


7560 


TCTTACCAAG 


TGACTAATGG 


TACGCTTTAT 


CTGACTTTGG 


ACGCACAAGT 


AAAAAAGCCG 


7620 


GTACAGCTTT 


CGTTAGCTGT 


TGAGCAAAGT 


TCGCTTCAAA 


CAGCTCAGCC 


ACCTAAGTTA 


7680 


TTGTATGAAA 


ACAACGAATA 


TGATGTTTCA 


GTTACTTCTG 


AAAAAATAAC 


AGTAGAGGAT 


7740 


TCTGCTAAAG 


AATCAACTGA 


ACCAGAAAAA 


ATAACTGTAC 


CAGAAAATAC 


GAAAGAAACT 


7800 


AACAAAAATG 


ATTCGGCTCC 


AGAAAAAACA 


GAACAGCCGA 


CCGCAACAGA 


AGAGGTAACC 


7860 


AATCCATTTG 


CAGAAGCAAG 


AATGGCGCCA 


gCTACTTTGA 


GAGCGAATCT 


GGCACTGCCT 


7920 


TTAATTGCAC 


CACAATACAC 


GACGGATAAT 


TCTGGGACTT 


ATCCGACAGC 


TAATTGGCAG 


7980 


CCCACAGGCA 


ATCAAAATGT 


GTTAAACCAT 


CAAGGGAATA 


AAGACGGTAG 


TGCACAATGG 


8040 


GACGGCCAAA 


. CGAGTTGGAA 


TGGGGACCCT 


ACTAATCGCA 


CAAATTCTTA 


TATTGAGTAT 


8100 


GGCGGTACAG 


GAGACCAAGC 


CGATTATGCC 


ATCCGAAAAT 


ATGCTAGAGA 


AACAACAACA 


8160 


CCAGGGCTTT 


TTGATGTATA 


TCTTAATGTG 


CGTGGGAATG 


TTCAGAAAGA 


AATCACGCCA 


8220 
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TTGGATTTGG TCTTAGTCGT 
GAAGTTCAAA AAGGAGTGAA 
AACATCAACA TGGGCTATGT 
ATGGGGCCGT TTGATACAGT 
GGAGGAACTT TCACTCAAAA 
GGACATAAGA AAGTCATTGT 
AGTCGAGTTC AAACAGAGGC 
GATCAACCAG GTAGCACTTC 
ATCAATAAAC GGATTAACAG 
CAACGTGGGa TTGAAATACA 
TTATCTAAAC AACAAGTTGA 
GACCTTTATT ATGAATCCGC 
GCCGTTCAGA TTTCAGGAAC 
TTTAAATACG AGCCAAATAC 
TTACCAGAAG TGTCGCTAAC 
GGGCAAGAAA TTCAAATTCA 
CCTGATTTTT GGTATCAAAT 
GAAAAAGTTG ATTTTGGGGT 
AAAATCTGGG AAGAGTATGA 
ATTAGTAGAA AGCAAGT^AC 
AAACCAGAAA ATGATACCAG 
ACCGCGGATG AAAGCTATCA 
GCTTTCAATT ATCAAACAAC 
GACGATACTA CTTGGAAAAA 
AATTCTTCCT CAGGTGAGAA 
GTTCAAACAA CATTAGTGGA 
CTACAAAAAG GGGAACGCTA 
GGCAAGAAAA CGACTTGGCA 
CAAGAAGTGA CCACCACAAA 
TTGCCAATCA GAATTAGAAA 
GCGACTTTTG ■ CGTTGCAAAG 
AAAAACAGAT ACTACAGGAT 
GGTGGAACAA TCAGGACCAT 
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TGACTGGTCC GGTAGTATGA ATJ3AAAACAA TCGGATTGGT 8280 

CCGTTTTGTT GATACATTGG CAGATAGCGG TATTACCAAT 834 0 

TGGCTACTCA AGTGACGGTT ATAATAACAA CGCCATTCAA 8400 

CAAAAATCGA ATTAAAAATA TTACGCCAAG TAGCACTAGA 84 60 

AGCATTAAGA GATGCTGGTG ATATGTTAGC AACGCCAAAT 8520 

ACTTTTAACG GATGGCGTCC CAACCTTCTC TTATAAAGTG 8580 

GGATGGTCGC TTTTACGGGA CACAATTTAC GAATCGACAA 8 640 

TTATATCTCT GGTAGCTATA ATGCGCCAGA TCAAAACAAT 8700 

TACGTTTATC GCCACGATAG GTGAGGCAAT GGTCTTAAAA 8760 

TGGATTGGGC ATTCAATTGC AAAGCGATCC ACGAGCTAAT * 8820 

AGATAAAATG CGTGAGATGG TGTCAGCCGA TGAAAATGGA 8880 

GGATTATGCA CCAGACATTT CTGATTATTT AGCGAAAAAA 8 940 

GGTTGTAAAC GGAAAAgTAG TTGATCCAAT TGCTGAACCT 9000 

ATTATCAATG AAAAGTGTGG GTCCTGTTCA GGTTCAAACA 9060 

AGGCGCTACA ATTAATAGTA ATGAGATTTA TTTGGGTAAA 9120 

TTATCAAGTA CGTATTCAAA CAGAGTCAGA AAACTTCAAA 9180 

GAATGGTCGG ACAACGTTTC AGCCATTAGC CACGGCCCCT 9240 

TCCTTCGGGA AAAGCACCTG GCGTGAAGTT AAACGTGAAA 9300 

TCAAGACCCG ACAAGTCGGC CAGATAATGT GATTTATGAA 9360 

TGACACAGCC AACTGGCAAA CTGGGTATAT TAAATTATCA 9420 

CAATAGTTGG GAGCGCAAAA ATGTAACCCA ACTTTCCAAA 9480 

AGAAGTTCTT GGGCTTCCCC AATACAACAA TCAAGGACAA 9540 

CCGTGAATTA GCAGTTCCTG GTTACAGTCA AGAAAAAATC 9600 

CACGAAGCAG TTCAAGCCAT TAGATTTAAA AGTAATCAAA 9660 

AAACTTAGTG GGAGCCGTCT TTGAATTGAG TGGTAAAAAT 9720 

CAATAAAGAT GGTAGCTATT CCTTGCCAAA AGATGTGCGC 9780 

TACATTAACT GAAGTAAAAG CACCTGCAGG ACATGAGTTA 984 0 

AATTGAGGTG AGTGAGCAAG GCAAAGTAAG CATCGATGGA 9900 

TCAAGTTATT CCATTGGAAA TTGAAAATAA ATTTTCTTCT 9960 

ATACACCATG CAAAATGGCA AACAAGTGAA CTTAGCAGAG 10020 

AAAAAATGCT gCAAGGAAGT TACCAAACTG TGGCAACTCA 10080 

TGAGCTATTT TAAAATTAGT GAACCTGGTG AGTATCGAAT 1014 0 

TAGGCTACGA CACTCTTGCT GGAAATTATG AATTTACTGT 10200 
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TGATAAATAT 


GGGAAAATTC 


ACTATGCAGG 


CAAAAATATT 


G AAG AAAA I b 


LbCbAbAAI b 


lUlOU 


GACACTGACA 


CATCAAAATA 


ATTTGAAACC 


TTTTGACTTA 


AC AG T 1 AA1 A 


AAAAAbbbbA 


i ri *3 o ri 


TAATCAGACG 


CCACTTAAAG 


GAGCGAAATT 


CCGTTTAACA 


GGACbAbA 1 A 


bbbAT-Al IbA 


1U JoU 


ATTACCAAAA 


GATGGCAAAG 


AAACGGATAC 


TTTTGTTTTT 


tv t\ ti t\ /"i m m t\ tv 

GAAAACTTAA 


AACCAGGGAA 


104 4 0 


ATATGTTCTA 


ACAGAAACCT 


TTACGCCAGA 


AGGATATCAG 


GGGTTAAAAG 


AACCAATCGA 


10500 


ATTAATAATT 


CGTGAAGATG 


GTTCAGTCAC 


GATAGATGGG 


GAAAAAGTAG 


CAGATGTTTT 


10560 


AATTTCTGGA 


GAGAAGAATA 


ATCAAATTAC 


TTTAGACGTT 


ACGAACCAAG 


CAAAGGTTCC 


10620 


TTTACCTGAA 


ACTGGTGGCA 


TAGGACGCTT 


GTGGTTTTAC 


TTGATAGCGA 


TTAGTACATT 


10680 


CGTGATAGCG 


GGTGTTTATC 


TCTTTATTAG 


ACGACCAGAA 


GGGAGTGTGT 


AATCAATGAA 


10740 


AAACGCACGT 


TGGTTAAGTA 


TTTGCGTCAT 


GCTACTCGCT 


CTTTTCGGGT 


TTTCACAGCA 


10800 


AGCATTAGCA 


GAGGCATCGC 


AAGCAAGCGT 


TCAAGTTACG 


TTGCACAAAT 


TATTGTTCCC 


10860 


TGATGGTCAA 


TTACCAGAAC 


AGCAGCAAAA 


CACAGGGGAA 


GAGGGAACGC 


TGCTTCAAAA 


10920 


TTATCGGGGC 


TTAAATGACG 


TCACTTATCA 


AGTCTATGAT 


GTGACGGATC 


CGTTTTATCA 


10980 


GCTTCGTTCT 


GAAGGAAAAA 


CGGTCCAAGA 


GGCACAGCGT 


CAATTAGCAG 


AAACCGGTGC 


11040 


AACAAATAGA 


AAACCGATCG 


CAGAAGATAA 


AACACAGACA 


ATAAATGGAG 


AAGATGGAGT 


11100 


GGTTTCTTTT 


TCATTAGCTA 


GCAAAGATTC 


GCAGCAACGA 


GATAAAGCCT 


ATTTATTTGT 


11160 


TGAAGCGGAA 


GCACCAGAAG 


TGGTAAAGGA 


AAAAGCTAGC 


AACCTAGTAG 


TGATTTTGCC 


11220 


TGTTCAAGAT 


CCACAAGGGC 


AATCGTTAAC 


GCATATTCAT 


TTATATCCAA 


AAAATGAAGA 


11280 


AAATGCCTAT 


GACTTACCAC 


CACTTGAAAA 


AACGGTACTC 


GATAAGCAAC 


AAGGCTTTAA 


11340 


TCAAGGAGAG 


CACATTAACT 


ATCAGTTAAC 


GACTCAGATT 


CCAGCGAATA 


TTTTAGGATA 


1 1400 


TCAGGAATTC 


CGTTTGTCAG 


ATAAGGCGGA 


TACAACGTTG 


ACACTTTTAC 


CAGAATCAAT 


114 60 


TGAGGTAAAA 


GTGGCTGGAA 


AAACAGTTAC 


TACAGGTTAC 


ACACTGACGA 


CGCAAAAGCA 




TGGATTTACG 


CTTGATTTTT 


CAATTAAAGA 


CTTACAAAAC 


TTTGCAAATC 


AAACAATGAC 


i i con 
1 lboU 


TGTGTCGTAT 


CAAATGCGTT 


TAGAAAAGAC 


CGCTGAACCT 


GACACTGCGA 


mm TV TV TA TV ^> TV 

TTAACAACGA 


llo4U 


AGGACAATTA 


GTCACGGACA 


AACATACCTT 


GACTAAAAGA 


GCCACAGTTC 


GTACAGGCGG 


I 1 T fin 

II /uu 


CAAGTCTTTT 


GTCAAAGTTG 


ATAGTGAAAA 


TGCGAAAATC 


ACCTTGCCAG 


AGGCTGTTTT 


11/ OU 


TATCGTCAAA 


AATCAAGCGG 


GGGAATACCT 


CAATGAAACA 


GCAAACGGGT 


ATCGTTGGCA 


1 lOZU 


AAAAGAAAAA 


GCATTAGCTA 


AAAAATTCAC 


GTCTAATCAA 


GCCGGTGAAT 


TTTCAGTTAA 


noon 


AGGnnTTAAA 


AGATGGCCAG 


TACTTCTTGG 


AAGAAATCTC 


TGCACCAAAA 


GGTTATCTTC 


11940 


TGAATCAAAC 


AGAAATTCCT 


TTTACGGTGG 


GAAAAAATTC 


TTATGCAACG 


AACGGACAAC 


12000 


GAACAGCACC 


• GTTACATGTA 


ATCAATAAAA 


AAGTAAAAGA 


GTCAGGCTTC 


TTACCAAAAA 


12060 


CAAATGAAGA 


ACGTTCTATT 


TGGTTGACGA 


TTGCAGGCCT 


GCTAATCATT 


GGGATGGTAG 


12120 


TCATTTGGCT 


ATTTTATCAA 


AAACAAAAAA 


GAGGAGAGAG 


AAAATGAAGC 


AATTAAAAAA 


12180 
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AGTTTGGTAC 


ACCGTTAGTA 


CCTTGTTACT 


AATTTTGCCA 


CTTTTCACAA 


GTGTATTAGG 


12240 


GACAACAACT 


GCATTTGCAG 


AAGAAAATGG 


GGAGAGCGCA 


CAGCTCGTGA 


TTCACAAAAA 


12300 


GAAAATGACG 


GATTTACCAG 


ATCCGCTTAT 


TCAAAATAGC 


GGGAAAGAAA 


TGAGCGAGTT 


12360 


TGATAAATAT 


CAAGGACTGG 


CAGATGTGAC 


GTTTAGTATT 


TATAACGTGA 


CGAACGAATT 


12420 


TTACGAGCAA 


CGAGCGGCAG 


GCGCAAGCGT 


TGATGCAGCT 


AAACAAGCTG 


TCCAAAGTTT 


12480 


AACTCCTGGG 


AAACCTGTTG 


CTCAAGGAAC 


CACCGATGCA 


AATGGGAATG 


TCACTGTTCA 


12540 


GTTACCTAAA 


AAACAAAATG 


GTAAAGATGC 


AGTGTATACC 


ATTAAAGAAG 


AACCAAAAGA 


12600 


GGGTGTAGTT 


GCTGCTACGA 


ATATGGTGGT 


GGCGTTCCCA 


GTTTACGAAA 


TGATCAAGCA 


12660 


AACAGATGGT 


TCCTATAAAT 


ATGGAACAGA 


AGAATTAGCG 


GTTGTTCATA 


TTTATCCTAA 


12720 


AAATGTGGTA 


GCCAATGATG 


GTAGTTTACA 


TGTGAAAAAA 


GTAGGAACTG 


CTGAAAATGA 


12780 


AGGATTAAAT 


GGCGCAGAAT 


TTGTTATTTC 


TAAAAGCGAA 


GGCTCACCAG 


GCACAGTAAA 


12840 


AT AT AT CC AA 


GGAGTCAAAG 


ATGGATTATA 


TACATGGACA 


ACGGATAAAG 


AACAAGCAAA 


12900 


ACGCTTTATT 


ACTGGGAAAA 


GTTATGAAAT 


TGGCGAAAAT 


GATTTCACAG 


AAGCAGAGAA 


12960 


TGGAACGGGA 


GAATTAACAG 


TTAAAAATCT 


TGAGGTTGGT 


TCGTATATTT 


TAGAAGAAGT 


13020 


AAAAGCTCCA 


AATAATGCAG 


AATTAATTGA 


AAATCAAACA 


AAAACACCAT 


TTACAATTGA 


13080 


AGCAAACAAT 


CAAACACCTG 


TTGAAAAAAC 


AGTCAAAAAT 


GATACCTCTA 


AAGTTGATAA 


13140 


AACAACACCA 


AGCTTAGATG 


GTAAAGATGT 


GGCAATTGGC 


GAAAAAATTA 


AATATCAAAT 


13200 


TTCTGTAAAT 


ATTCCATTGG 


GGATTGCAGA 


CAAAGAAGGC 


GACGCTAATA 


AATACGTCAA 


13260 


ATTCAATTTA 


GTTGATAAAC 


ATGATGCAGC 


CTTAACTTTT 


GATAACGTGA 


CTTCTGGAGA 


13320 


GTATGCTTAT 


GCGTTATATG 


ATGGGGATAC 


AGTGATTGCT 


CCTGAAAATT 


ATCAAGTGAC 


13380 


TGAACAAGCA 


AATGGCTTCA 


CTGTCGCCGT 


TAATCCAGCG 


TATATTCCTA 


CGCTAACACC 


13440 


AGGCGGCACA 


CTAAAATTCG 


TTTACTTTAT 


GCATTTAAAT 


GAAAAAGCAG 


ATCCTACGAA 


13500 


AGGCTTTAAA 


AATGAGGCGA 


ATGTTGATAA 


CGGTCATACC 


GACGACCAAA 


CACCACCAAC 


13560 


TGTTGAAGTT 


GTGACAGGTG 


GGAAACGTTT 


CATTAAAGTC 


GATGGCGATG 


TGACAGCGAC 


13620 


ACAAGCCTTG 


GCGGGAGCTT 


CCTTTGTCGT 


CCGTGATCAA 


AACAGCGACA 


CAGCAAATTA 


13680 


TTTGAAAATC 


GATGAAACAA 


CGAAAGCAGC 


AACTTGGGTG 


AAAACAAAAG 


CTGAAGCAAC 


13740 


TACTTTTACA 


ACAACGGCTG 


ATGGATTAGT 


TGATATCACA 


GGGCTTAAAT 


ACGGTACCTA 


13800 


TTATTTAGAA 


GAAACTGTAG 


CTCCTGATGA 


TTATGTCTTG 


TTAACAAATC 


GGATTGAATT 


13860 


TGTGGTCAAT 


GAACAATCAT 


ATGGCACAAC 


AGAAAACCTA 


GTTtCACCAG 


AAAAAGTACC 


13920 


AAACAAACAC 


AAAGGTACCT 


TACCTTCAAC 


AGGTGGCAAA 


GGAATCTACG 


TTTACTTAGG 


13980 


AAGTGGCGCA 


■ GTCTTGCTAC 


TTATTGCAGG 


AGTCTACTTT 


GCTAGACGTA 


GAAAAGAAAA 


14040 


TGCTTAATTT 


C TAG CATC AC 


CGAAGAAATT 


TTTAGAAAAA 


CAAAGAGCCT 


GGGCCAATCA 


14100 


CTGTCCCAGG 


CTCTCATGCT 


TTATTTTTAA 


GGAGGAAGCA 


ATGAAGTCAA 


AAAAGAAACG 


14160 
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TCGTATCATT 


GATGGTTTTA 


TGATTCTTTT 


ACTGATTATT 


GGAATAGGTG 


CATTTGCGTA 


14220 


TCCTTTTGTT 


AGCGATGCAT 


TAAATAACTA 


TCTGGATCAA 


CAAATTATCG 


CTCaTTATCA 


14280 


AGCAAAAGCA 


AGCCAAGAAA ACACCAAAGA AATGGCTGAA 


CTTCAAGAAA 


AAATGGAAAA 


14340 


GAAAAACCAA 


GAAT I AGCGA 


AAAAAb G C Ab 


CAA 1 v^L, 1 own 


TTAGATCCTT 


TTTCTGAAAC 


14400 


GCAAAAAACA 


ACGAAAAAAC 


C AG AC AAA I C 


T 7A T T T T ft 21 7A 


AGTCATACGA 


TTGGTGTTTT 


14460 


AACCATTCCA 


AAAATAAATG 


TCCGTTTACC 


An 1 i 1 1 1 1 


AAAACGAATG 


CATTGCTATT 


14520 


GGAAAAAGGA 


AGCTCCTTGT 


mTI /^7i TV TV TV /■"■ 

T AG AAG G AAC 


C 1 CC inl ^V- 1 


ACAGGTGGTA 


CGAATACACA 


14580 


TGCGGTCATT 


TCAGGCCATC 


GTGGTCTCCC 


rn/-> t\ Tv f*CT' T\ 7\ 7A 

TCAAGLCAAA 


TTATTTACAG 


ATTTGCCAGA 


14640 


ATTAAAAAAA 


GGCGATGAAT 


TTTATATCGA 


AG 1 C AA 1 GGG 


AAGACGCTTG 


CTTATCAAGT 


14700 


AGATCAAATA 


AAAACCGTTG 


AACCAACTGA 


rn TV /"> T\ TV TV TV 7\ T" 1 

T AC AAAAG A I 


TTACACATTG 


AGTCTGGCCA 


14760 


AGATCTCGTC 


ACTTTATTAA 


CTTGCACACC 


rn Tv HP 7\ rn /"» tv m 7\ 

GT AT A 1 G A 1 A 


AACAGTCATC 


GGTTATTAGT 


14820 


TCGAGGACAT 


CGTATCCCAT 


ATCAACCAGA 


TlTVTVTV^/^TV/^ /"^ 7V 

AAAAG C AG C A 


GCGGGGATGA 


AAAAAGTGGC 


14880 


ACAACAACAA 


AATTTACTAT 


TAT GG AC ATT 


ACTTTTAATT 


GCCTGTGCGT 


TAATTATTAG 


14940 


CGGCTTCATT 


ATCTGGTACA 


AGCGACGGAA 


AAAG AC G AC C 


AGAAAACCAA 


AGTAGTATGA 


15000 


CGAAAAGGCT 


AAACATACTA 


AAAAAAAGAG 


m TV TSTVTVTNTVTvnPTV 

TAAAAAAATA 


GCTTTTCAAT 


TTTTAATCCT 


15060 


CCTTATCGTG 


CATAATTGAA 


^% <u» A «h Tl TV TV 

CCAGAGAAAC 


AGAAGTATIA 


ACGAAATAAC 


TAAAAGAGCA 


15120 


AGCCCTGAAT 


TV TV TV TV TV /"* /■*•/"• TV /"* 

AAAAAG C G AC 


AAAbooCCAA 




GTTTAAATTC 


CTGCCAAGTT 


15180 


TGGATTTTTC 


TGTTTTTTTT 






GTAAATAATT 


CAATAGTAAG 


15240 


AGGAGTAGCA 


ACACCGTGAA 


ATCATTTGTG 


GT AAAAAG C A 


CATGTAAAAA 


TAGAATGACA 


15300 


AAGACAACAC 


GGGATAACAC 


TCGATTCCGC 


AAAATTAAAA 


ATAACTTAGC 


ACGCATAATA 


1 C *3 C f\ 

15360 


AACCACCATT 


TCTTaTCAGA 


GATAATGAAT 


CTGTTTTTGT 


CTACTCTTTA 


GT TAT AT CAT 




AAAATTCTTA 


ATAATGAAAA 


AATGACTCGA 


GAAAATAATT 


GAAAAAAGTT 


TTTTTTCCTG 


15480 


AATCATTATT 


TTCGTAAATA 


AAGAATAAAC 


GTGTTACTCT 


TGGCTTATCA 


AATTTGGAAG 


15540 


GAGTGTTAAA 


AATGAAATAT 


CTGGATATTA 


TTGCTTTAAT 


TTTATTGATT 


GTCGGAGGTT 


15600 


TAAACTGGTT 


ATT A 










15614 


(2) INFORMATION FOR SEQ ID NO: 4 6: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TTCACAAAAA AATGAAAACA CAAATTTAAC TTGGAGGATG TTGAGTAGTG AGCAACGGGA 
AAAAGATTTT nATTAGTCAT AGTTCAAAGG ATcAAGAATA TGTGGATGCG TTTATCCAAT 
TATTGAAAAA GTTTGGGTTT CGAACACAGG ATATTTTTTA TAGTTCCACT ATTGAAACAG 
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GTGTACAGCC AGGAGAATTA 
TCATGCTGTA TTTTTTATCT 
GTGCTTCTTG GATGCTATCA 
AAGATATGAA AGGTGTTATT 
GTACTAATGA AATAAACTGc 
AGCCAGATTT TGAACTAAAC 
AACTGATTCG GCAAGCAAGT 
tTAGCACTCA TCGTCCTGTG 
CTAGCTTAAT TGAGCCTAAA 
GGCTATTCTT TTTTCTTACG 
TGAAAAAGGa TAAGGcGTAT 
TTTACGTCTC TTATTTAGAG 
TGGcGTCAGT TGGTTGGTAC 
TGGTTAAATA AGCAAGTTGA 
CGTAGAGTAA TTAAAAGTGA 
CTTTwGrAnC GTAATGaTTT 
GCATAGCAGG GATATACTTA 
TATGGAATTA TCAGAATGCT 
CAATCCCCCT TTTGGACATT 
TCCTTTGTAT GCTTGTTGGT 
TGAAGGAAAC GCTCAAACCA 
TAAAGCAGGA ATGAAACTAA 
TTCAGATCAA TTAAATAAGC 
AAAACAAGTG TTTAACGCCA 
AGTCtTTTTG TTAGAAGCTT 
CTATAATTAT GGTTTATACA 
AACAAAAAAA TTTTTAATGT 
TTTGAGAAAG ACACAAAGAG 
TGACTTAATT ATGAGAGGTA 
GAAATGCTTT CAGGCATTAA 
TTTGGATCAA GAGGTATTGG 
AGTCGTACTG AAATATGAAA 
TATTTCGGAA AGCGCGAAAG 
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ATTTTTGACA CGATTAAAAG GGAGCTAACC AATCAACCTG 24 0 

GATCATTATT ATCAGAGTAT TCCATGTTTA AACGAAATGG 300 

GATAAACACT ATCCAATTGC GTTAAATAAT TTTTCTATGA 360 

AGTAGTGAGC GTTTGGCGAT TGCGTTTAAT GATAAGACAA 4 20 

TTATTAAAAA AGTTATCTCA T GAG AC AG AT GTACAAGCTG 4 80 

GTTGAGAAAA ATATTCAGCC ATTCCAAAAT AAACTGACAC 540 

TATTTAAAAC CTGATGAAAA GGGATATTTT GAAACGrTTC 600 

TATGGCACAG CAAAAGGGGT CTATGACTGT TTTAAATTAC 660 

AGTTTAGGTT TAGACACGCT ATCAGAGGAT GAAAGCCATT 720 

TGGGGAACTT TTCAAGAAGG CGAAAAAGTG CGGTTTAAAt 780 

AACAATCGGG AATTCAGTGA TATTGGTAAA TGTAAAAATA 8 40 

AAGGTAGAGT AATTGGAGGA GGTAATAAAT GAAAATGAAT 900 

ATATCGTATT TCTGATAATG GTAAACAATA TCAAAATGAC 960 

TTTAGCCACA GAACTTCGTA AATCTTTTGA TAGTGACTAT 1020 

TGCATTTaGA CGCTTwnCAa GAkAwAACAC cAAGTTTTyC 1080 

TgGTAAGAAC aCGTTTAACA CATAGCTTAG AAGTGGCAAT 1140 

GATTAGTTAT TTCACAGTTG AACGAACGAA ACATTGAGGT 1200 

ATCGGTTGTT AGAAACAGCT GCATTAATCC ATGATATCGG 1260 

TTGGGGAAGA AGCCATTCGT ATTTGGTTTG AAAAAAATGG 1320 

CTGATTTTTC GGAGCAACAA CAAAATGATT TTCTTAACTT 1380 

TCCGACTTTT AACAAAACTG CATC AT GAT A ATGGGACTTC 14 4 0 

CAGCCACGAC ATTAGATACA GTTATCAAAT ATACGGCATC 1500 

AACAGCTTTT GTCTAAAAAA GTTGGCTATT TTTACTCGGA 1560 

TTAAATTTGC GACAGGAACA CmAAATAAGC GACACCCATT 1620 

CAGATGATAT CGCCTATACT TTTTCAGATA TCGAAGATGC 1680 

GCTATCaAGA TTTAAAGCaG TTTGTTGATG CGCAAACAGG 174 0 

TAGAAAAAGA TAAAACAGAA GAAGCTACCT TACAAGAATT 1800 

CTGTTTGTCA AAGCGTAGCG AAAAATTTTG CTAATCATTA 1860 

TCTATCATAA TGAAATTATC ATTGAAGATT GTGATGAAGT 1920 

AAAACTTTTC TAGAGTCTAT GTTTTTCAAA CAAAAACAAT 1980 

GCTTCAATAT CATCAATCGT TTACTAGATG AATTTGTTCC 204 0 

AAGTTTCCAT GAATAAGTAT GAGGAGCGAA TCTTTAATAA 2100 

CGTTGTATAG AAGAGAAGCA AAGAACGCTA CAGAAGCAGA 2160 
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2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3169 



AAAAGACTAT 


TATCGTTTGA 


AAATGGCGGT 


GGACTTTGTA 


TGTAATATGA 


CGGATGGATA 


TGCCAAAAAA 


GTTTATGATA 


CATTATTCAC 


TTAAAATTTG 


AGAAAAAGCT 


TCTCAGCGGT 


TGCCGCCGCT 


GAAGAAAACT 


AGTTCATTGC 


ACAGGCTGAT 


CGTTTCAGTA 


AAAATTTTAA 


AGATGCTAGT 


AATTTCACTG 


AAAATCGGGT 


ACACTAAAAT 


TAGCTAGTGT 


AACTAGCAAA 


AAAGGCAACT 


GGAATAGriTA 


GGTTGTGCTG 


TCTATGAAGG 


GCTTTGTTTG 


TCAGGTGGTG 


CTGACATGCA 


AAGTGCCTTT 


TTTCTTTTTC 


TCAACATTTT 


AAGGACAAAA 


AACGATTGCA 


ACATGTCACA 


ATCGCTAACC 


TCGGAGAAAA 


AGGAATTTTT 


CTCCAATGGC 


CATTTCGATA 


ACCTGTTGTG 


TGTTTCGAAA 


CATTCAGAAC 


GGTGGTGCGT 


TCTTTCAGTG 


CCTAATTATC 


CAATTCGTTT 


CTTGTTCCTC 


TATTTACAGT 


ATACTGAATG 


AAAGCCTATT 


CTGCAATGAT 


AGAATCAGGT 


TCCTGTGATT 


AAATAACATA 


AAAAATGGGC 


TCCTTATATA 


AAGAAAAATA 


ACAAAAGAAA 


TTTTGAATAT 


TGACGTTTTT 


TTTAGGTACG 


TTATGATAAG 


CGAGAGGTGA 


111 Inlul In 


r\r\r\r\ \- O ^ rtrVrt 


TTftTTfUBTT Jl 
1 lui 1 ort i in 


TTTAPfiPAAT 


CAAACAGCTT 


TTTTTGAACC 


ATCTTTAGTC 


AGTGAAATTT 


TCACAGCAAG 


CAATATTGCA 


ACGACATTTT 


CTATCAAACG 


AAATACAGCT 


AGTCACTATT 


TAAATCAGTT 


AAACGAAGAA 


GGAATTCTAG 


TTAAGATCAA 


TACACGACCG 


GTTTACTTTT 


TTCATAAAgA 


AGCGTTTCAG 


CAACAAAACT 


ATCTTTTAAA 


AAGGAACTGT 


TTATCCAAGT 


TTCCAAGAAA 


TGATCGATGA 


ACAGCCTGTT 


TTTGGATCGC 


CAAAGnGGAn 


TTTTnTTCAG 


TCGGGTCATn 


GGGTATCGCG 


GGTAGTTTA 





(2) INFORMATION FOS SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GTCTATACAA GTTCACAAGG AGGCGCCAAA CATGACCATT TTAGAATTAA CATTCGCAAC 60 

GAACCAACTA ACGACGACTT TTTTTAACCA TAAATTGGAA CCGCTTCACC AAAAAACATT 120 

CCCTTTTGAA GCCTTGACTA TCGAAGAAAC GGTTCCTGAA CTGTGCCAAT TGATTTTGGC 180 

AGAGGCGGCA CCGATTCTGG GGGGGATTGT GGCTGATTTC CCAGCGGACT TCTTTTCGCC 240 

AGAGGAAACG TTGGCGCTGG TTCCGCAAGC GGTGATTCAA GCATTCGCGG AACAGTTGAA 300 

CACGCCACTT TTAACGGCAG CCGAACAAGC GGCGGCACCG AATCTGCTAG AAGCATTTGA 360 

AGAAAAACGG cAGTATCTGG CTTGGCATTT GGACTATTAT GGCTACGTGC CTGGGAAAAA 4 20 

TGAATATGCG GTGGAGTCCC TGTTAACTGT CGGCAATGGC TTTTTAGGCT TGCGCGGCAC 4 80 

GACACCTGAG ATGACGATTT CTGATGATCA TTATCCAGCG ACCTACATTG CGGGGCTTTA 54 0 

TAATACAGCG GCTTCGGAAG TTGCGGGTCA AGTAGTGGAA AATGAAGATT TCGTCAATGC 600 
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TCCTGACAAC 


CAACACATCG 


CTTTAAAAAT 


TGGCGATGCC 


ACTGACTGGT 


TAACGATTTC 


660 


TCCAGACACG 


CTACAACAAT 


TACATCGTCA 


ATTGAACTTG 


AAAACGGGGT 


TGTTTGTCGC 


720 


AGAAATGATT 


TTGAAAGATG 


CGGACAATCA 


GCAAATTAAA 


TTAACGACGA 


AAAAAATCGC 


780 


CAATATGGCA 


CAGCCGAATG 


ACTATCATCT 


GCAATATACG 


TTTGAGCCGT 


TAAACTTTTC 


840 


AGCCCCGATT 


ACTTTGAAAA 


CCGTCACGGA 


TGGCAGTGTC 


TACAACTATA 


ACGTGGCGCG 


900 


GTATCGCAAC 


TTAACAGCGA 


AACATTTTCA 


GGTGACAGCA 


TTAAGCGCGC 


AGGAAAACAA 


960 


AACCGTGATT 


GAGGTTTGTA 


CCAATCAATC 


CAACTTAAGC 


GTCCGAGAAA 


CGTCTTTGAT 


1020 


AACCGGTGAT 


TTTTTTGAAA 


AAGAAGCCAT 


TACGATTCAA 


GAAGAAGCGG 


AGAAAATCGC 


1080 


CCAAGTGGTG 


ACCGTGATGG 


CGCACCAAGG 


AACGCGCTAT 


ACGCTAGAAA 


AACAAGTCTT 


1140 


TGTACAAGCC 


AGCCACGCCG 


AACAAAGTTG 


GCAGGTACCG 


TTTACGCCGA 


AGGACTCTTT 


1200 


TGCGGCCGCC 


GCTCAGGAAA 


GCGCCCGCGC 


TTGGCAAACT 


CTTTGGCAAC 


AAGCCAACAT 


1260 


TACCGTGACA 


GGCGATTTGA 


TGTCTCAAAA 


ACTTTTACGG 


ATTCACAGTT 


ATCACTTGCT 


1320 


GGCTTCGGCT 


TCGCCTTTTA 


GTAACCAAGC 


GCAAGCATTG 


GACGTCTCGA 


TTACCGCCCG 


1380 


AGGCTTGCAT 


GGTGAAGCGT 


ATCGTGGACA 


TATTTTCTGG 


GATGAAATTT 


TTATTTTACC 


1440 


TTTTTATATT 


CAACATTATC 


CTGACACAGC 


GAAACAATTG 


TTGCTGTACC 


GCTACCACCG 


1500 


ATTAGAGAAG 


GCGAAGGAAA 


ATGCGGCGGC 


CAGCCAGTAT 


CGTGGCGCAA 


TGTATCCGTG 


1560 


GCAATCAGGG 


CGTGATGGGC 


GTGAAACCAC 


GCAAAAACTT 


CACTTAAATC 


CGTTGAATGG 


1620 


ACATTGGGGC 


GAAGATCATA 


GTATTTTGCA 


ACGGCATGTT 


TCCTTGGCGA 


TTGCTTACAA 


1680 


TGCTTGGTTG 


TATTGGCATA 


GTACACAGGA 


TCATGAATTT 


ATGAAACAAT 


ATGGCGGCGA 


1740 


AATGTTGTTA 


GAAATCGCTC 


AATTTTGGAA 


CAGCGCTGCG 


ACGTTAGATG 


ACGCAACAGG 


1800 


ACGGTTCTTT 


ATTGATAAAG 


TGATGGGACC 


TGATGAGTTT 


CACGAGGGCT 


ATCCAGACCA 


1860 


AGCGGAAAGC 


GGCTTGAAAA 


ACAACGCCTA 


TACGAATTTG 


ATGGTGGTTT 


GGCTATTCGA 


1920 


AGAATTAACC 


AACATTTTGG 


CACTTTTCTC 


TGAGGAAGAA 


CAGGCACAAC 


TTTTTGCCAA 


1980 


AACGCAGACC 


ACTTCGGCTG 


ATTTAGCCCG 


CATGCAGCAA 


ATTCAAAATT 


CACTGGAAAT 


2040 


TGAAGTTAAT 


TCTGACGGTA 


TTATTGCGCA 


ATATGAAGGG 


TATTTCGGCT 


TAAAAGAGAT 


2100 


TGACTGGACA 


GCGATGAAAG 


AGAAATACGG 


CAATATTTAC 


CGCATGGATC 


GAATTTTAAA 


2160 


AGCCGAAGGA 


GAATCCCCCG 


ATGATTACAA 


AGTGGCGAAA 


CAAGCCGACA 


CGTTAATGTT 


2220 


GTTCTACAAC 


TTAGACAAAA 


CTCGGGTGGA 


TCAAATTTTA 


GAAGATTTAG 


GGTATCAATT 


2280 


GCCTGCGGAT 


TATTTGGAAA 


AAAATCTTTT 


GTATTATCTA 


AAACGGACGT 


CTCATGGTTC 


2340 


GACTTTGTCA 


CGAATTGTTC 


ATGCCCAATT 


GGCAGAAATG 


GCGCAATTCC 


ATGAGCTATC 


2400 


TTGGCAACTT 


. TATCAAGAAG 


CGTTGTATTC 


CGATTATCGT 


GACATTCAAG 


GTGGCACGAC 


2460 


CGCTGAAGGG 


ATTCATACAG 


GGGTCATGGC 


TGCGACGATC 


CACGTTACTT 


TAGCGACGTA 


2520 


TGCAGGCGTT 


GACACGCGCC 


AGAACGAGTT 


ATCAATTTGT 


CCAAACTTAC 


CTGAACATTG 


2580 
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GCAAGCGCTG 


GCCTTCCAGT 


T TAT T CATC A 


AGGCGTGACG 


TATCAATTTT 


CATTAACACA 


2640 


AACCAGTGTC 


ACGATTACAG 


CAGATAGAGA 


CACACAATTA 


TTGGTTCAGG 


GCGCGTTGAT 


2700 


TCCGTTAACC 


GCCGAACGAC 


CAAAAGAAGT 


ACATTATCAA 


TAAAGGAGCG 


AACCACAATG 


2760 


ACGATTGGAT 


TTATTTTTGA 


TTTGGATGGT 


GTCATTACTG 


ACACCGCCAA 


ATTTCATTAT 


2820 


CAAGCTTGGA 


AAGCGTTGGC 


CGATTCGCTA 


GGTATCCCCA 


TTGACGAAAC 


CTTCAACGAA 


2880 


ACATTAAAAG 


GCATTAGTCG 


GATGGATTCC 


TTGGACCGTA 


TTTTAGTCCA 


TGGCCACCGC 


2940 


GAAAATGCGT 


TTACTCCAGC 


AGAAAAAGAA 


GCCTTAGCTC 


AGCAAAAAAA 


TGAnCaCTAC 


3000 


GTTCAATTAT 


TAGAACAATT 


AACAACAGAA 


GATGTTTTAC 


CTGGTGTCGT 


GCCATTGCTA 


3060 


CAGCAAGCCC 


AAGCACGTCA 


CATTCCCTGC 


GCCGTGGCTT 


CGGCTTCAAA 


AAACGCGCCC 


3120 


CTGATTTTGG 


AAAAATTAGG 


CGTGCGCGCG 


TAcTTCGCCA 


CAATTGTCGA 


TCCCGACTCG 


3180 


TTAAGTAAAG 


GCAAACCTGA 


TCCTGAAATC 


TTTTTAGCCG 


CCGCTGACAG 


TATTGGCGTG 


3240 


ctACCGCAAA 


ACGCCATTGG 


CTTTGAAGAT 


GCACAATCAG 


GCATTGACGG 


CTTGAAAGCC 


3300 


GCGGGCATCT 


ATGCCGTCGG 


CTTGTCGGCC 


AGCCAACCCT 


TAATCGGAGC 


CGACATGCAA 


3360 


GTTTCTGAAA 


TGACTGAACT 


AAGCGTTGAT 


GCACTGCTCA 


ATCGCTAAAG 


AATGACAAGA 


3420 


ACCTTTAGCA 


GCAGGCACCT 


CTTTTCATCT 


ACATTGTATC 


TATGCTAGAA 


TGTAGATGAA 


3480 


AAGAGAGAGA 


TGCGTCAACA 


TACCTCTCTA 


GTGTAGAGCC 


GTTTGAGACG 


GTGACCAATT 


3540 


TAGTTATAAA 


AAATAACCGT 


ACCGGTCAAA 


GTAGACGGTT 


ATTTTTTATT 


GTCATTACAC 


3600 


TCCATTATTG 


GATTAAAAAG 


GATTGAGTTG 


AACGTGGTTC 


TGTGGTAGAC 


TTTTATTAGA 


3660 


AACAAGTAGA 


TGTTTCTTGA 


TGATCAGGCA 


TCATAATAAC 


TCAATACTGA 


AATCGGAAGT 


3720 


AATAGCCGAT 


AGTTGGCGAG 


CAACAGAAAA 


ACTCGATATA 


GAATTGGACG 


TAGAGCCAAT 


3780 


AGTTGGTGAG 


CAACAGAAAA 


ACCTGAATTA 


TATTTGAAGT 


TTAAAGAAGG 


CAAGCTAAGC 


3840 


TTGGCTTCTT 


TATTTATAGG 


AGAAACTAAT 


GAT TG AG AAA 


AAAGATTTTG 


ACACAATTAG 


3900 


AAAGGCGTTC 


TTAATTTATG 


ACACCAACTT 


CAATAATCAT 


ATGTTCACTT 


ATACATATCA 


3960 


ACATCGGAAA 


ACAAAGAAAT 


TGTTAGAAAT 


GAACGTTCAA 


TTTGAAGCTG 


GGAATTTTAT 


4020 


GCATTTATGC 


GGAGTTGACT 


AT TAT C AAA T AGATAAACAA gGGAGCTGTA AATTTCGTTT 


4080 


TAAGGCTACT 


CAATTTTACA 


GAGCTTTAAA 


AAATAATAAA 


GTTTCACTGC 


GCGGGATTAA 


4140 


ACCTAAAGAT 


GATGGAACCA 


CAGGCCAAAA 


ATTACAAGTA 


ATTCCTCTAT 


TAGAAATGCT 


4200 


TATTTCTCCA 


GGTGTACGAA 


TATGCGATGG 


AGGAAAATTT 


TATAATTTAC 


AGTATGAAAA 


4260 


AGCAATTCGT 


TCAGGCAAAA 


TGATTGTGGC 


GTTAACATGT 


AAAGAGAATA 


ACAAAAAGTA 


4320 


CGTTCCACAA 


TCTTTATTAT 


CGTTAATAAA 


TCAGCCACGA 


AAGAGCCAAT 


CGAAATCTTT 


4380 


AACTGAATCC 


' CATGAAGTAA 


TTAAGATAAG 


TAAGTCTGAG 


CTGAATAGCA 


CCTCAGTAAT 


4440 


TGAAGTTTAC 


GATAAATTTT 


AACAAAAAAG 


CGGGCTAAAC 


AGACTATAGG 


CAAAGAGAAA 


4500 


AAGCCCAGAG 


GTTTTTCTCC 


TCTTTAACAG 


CTTTTAGGAt 


TTCACGGATT 


AACGCTAAAT 


4560 
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TTATTCGTAC 


TTTTCGGGAA 


AATGCGCTAT 


AATGAAAATG 


AAAAGAGAGA 


TATGCTTCAA 


4 620 


CATACCTCTC 


TGATGTAGAG 


CCGTTTAAGA 


CGGTGACTTT 


TTGAATTATT 


TAAAAATAAC 


4680 


CGTACTCGGT 


CAAAGTTGAC 


GGTTATTTTT 


TATTGTCATT 


TTTAACAGCT 


TTTTAGTTTC 


4740 


CAAGGACCAA 


TACTTTACTA 


TTAGCAACGA 


AGTTCACGGT 


CCTTTGTGCT 


TGTCTCCACG 


4800 


TTCATTTCTT 


TTGCTTATTG 


TTTAAATCCT 


ACTAACGGAA 


AAATGAAATC 


AACGAATTTT 


4860 


ATCCCCGCAT 


AAAAACATAG 


GGCAAACATC 


AAAATCTTAA 


TTAAGATTTT 


GACAATTGGT 


4920 


CGTTCAGTTA 


TCTTTT'iCAT 


TTGTATACTC 


CTTTCCaATA 


TTTTTAAAGC 


TCCAACCTTT 


4980 


TTAATCTTTT 


TGACATCTAT 


ATTTTATCTA 


ATTATAATAA 


AACTCACAAT 


AGAGGAAGCA 


5040 


TCATTTTTTC 


ATTTCCTTTT 


TATTCCTTGA 


TTAGTAGTTC 


AATTTGTAAA 


CATATCATTC 


5100 


TGACAAAATA 


GAAAAGGAGG 


TTGTTATAGT 


ATCGAAACAA 


AATTATCTAC 


AAAGAAAAAA 


5160 


ACACTTCTTT 


ATCTCCTGTT 


CACTTTTTTG 


ACAGCCCTAC 


CCAGCATAGA 


AGCATTAACC 


5220 


GGAATGAGCT 


TAATAGCTAT 


AAAAAATAAC 


GAAAAAGGCT 


TGATTATCTT 


TGATAGTGTA 


5280 


CTATCTTTAA 


TTATTATTAT 


CGTGCTGTCA 


GTTTTCTATA 


AAAAAATAGT 


AAAGAAAAAA 


5340 


AAGCAAGCTA 


TCTTTAAAAG 


TCTATTACTA 


GCTATTGGCA 


TGTTCTTATT 


TTCTATTATT 


5400 


CTAAATTTAC 


TTTTCTCACG 


CATTCCTACT 


CCAGAAAATC 


AACATTATAT 


TAATGAAGTC 


5460 


CGAAATGTAG 


CACCTGTTTT 


AATGGCCCTT 


CAT AC TAG AC 


TCTTTTCACC 


TATAATAGAA 


5520 


GAACTCCTGA 


CTCGTGGCAT 


TTTTATGAAC 


TTATTTTTTA 


TAAAAAACaC 


CTCTCGAAGT 


5580 


GTCTTATGTA 


AAATATTTTT 


TTCTGGTCTT 


TTTTTTTGGT 


CTTAGCCATG 


GACTATTACC 


5640 


ATCAATTTCC 


ATGCTTTTTT 


ATTGTGGAAT 


GGGCTGGGTT 


TTAGCCTCTA 


CCTATTATTT 


5700 


CTGTAATAAC 


AATCTCACGT 


ATCCTCTTGC 


TATTCATTTG 


CTATTAAATA 


ATTTATAATA 


5760 


CACAAAAAaC 


CCGCCCACAT 


CAGAAAAATG 


ATGTGGGCGG 


GTTTTATAAT 


TATTTATTTT 


5820 


TkGTTTCAAA 


TAACGGACTA 


ACTGGTTTGT 


TTTCGTGGAT 


ACGTTTAATC 


GCATCTCCGA 


5880 


TTAATTCACC 


AACGCTGACT 


TCGTCAATTT 


TaACAATTTT 


ACGATCATCA 


GATAGGTAAA 


5940 


TAGAGTCTGT 


TACAACTAAG 


CGTTCAATTG 


CTGAATCTTC 


AATACGTTGT 


AAGGCTGGTC 


6000 


CTGATAAAAC 


GGGATGTGTA 


CAAGAAGCGT 


AAACGTCTTT 


TGCTCCGGCT 


TCTTTCAATG 


6060 


CGTTTGCAGC 


AAGAGAAATT 


GTCCCCGCTG 


TGTCAATCAT 


GTCATCAATT 


AAGACACATG 


6120 


TTTTGCCTTC 


CACATGACCA 


ATGATATTCA 


TTACTTCTGC 


TACGTTCGCT 


TTTGGACGAC 


6180 


GTTTGTCAAT 


GATCGCAATT 


GGTGCTTTTA 


AGAATTCTGC 


CAATTTACGC 


GCACGCGTTA 


6240 


CACCACCATG 


GTCAGGGGAA 


ACAACAACGA 


CGTCATCGCC 


TTGAATGCCG 


TGTTCGATGA 


6300 


AGTAATCGGC 


AATTAATGGT 


GCACCCATCA 


AATGATCCAC 


TGGAATATCA 


AAGAATCCTT 


6360 


GAATTTGGAC 


• CGCATGTAAG 


TCCAATGTCA 


ACATTCTTGT 


TGCACCAGCT 


TTTTCAATCA 


6420 


TGTTTGCAAC 


AAGTTTAGCA 


GTGATTGGTT 


CCCGGGCACG 


TGCTTTACGG 


TCTTGACGTG 


6480 


CATACCCATA ATAAGGCATA ACCACGTTAA TTGTTTTAGC 


ACTCGCACGT 


TTTAACGCAT 


6540 
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CAATCATGAT TAATAGTTCC 
CGTAAACATG. .TGCACCACGA 
ATTGTGTTAC AGAGCACTTT 
AGGGACGATT AGAGTTCAAA 
TTAGAACCTC CACTTTCTAT 
AAATAATAGT AGTCCTAGCC 
TCmTcAAAAG tCGcTTTCTT 
TTGAATAAAG AAAGGTACTC 
CTCAAAGCTG CCGCATTAAT 
AAAACATGCC CTAAATGTGA 
TGACTAAAAT CAGCTTTTTC 
CAGCCGCAAC CAGCATCGTA 
TCTACATAGA TTCCGTCTTG 
GCGTTTTCTT GTGTGACGGC 
GTTGGTTTTG TCATTCTGTC 
CCACGTTTGC CAAACAGGCA 
TGACGAATGA AAAGGACTGT 
GCAATTGGTA AGCCCATAAT 
CCTTGTCCTT GGATACCATA 
GCATGGATTT CCGTGTCTGT 
cGCTcTTTTT CCCCTTGTTT 
AGGAGGCGCA ACATGCGATA 
GCCAATGCCA CAATTGTATC 
GCCGCTGCTT TTTGGGCAGC 
ACAGTTTCAT CGATGTCGGC 
TCCTGACGGC GAGGAGATTG 
TCTTTTTAAT GGGGGTCTTG 
TGAATCAGCA CCGGACGTCC 
TTTAATAGCA CGCGTGCTTG 
CAGCTCATCA TAATGGCAGT 
AGCAACATGT CAATCATCTC 
GGGTGCGCCC GCACAATGAA 
GCAAGGGTTT CTTTCTGCTC 



433 

ATTAAGTTAT CGTTAACAGG GCTACTTGTT GATTGAATGA 6600 

ATACTTTCTT CAATATTCAC TTGAATTTCG CCATCACTAA 6660 

CCTAGCTCCA CACCCACAGC ATCCGCAATT TTTTCTGCTA 6720 

GAAAAGATTT TCAATCTTGG ATCAAAATAA TGTTTCGACA 6780 

TTCAGCCAAT TCGTTAAAAC TGaACAATAC TTnCCTTCTA 6840 

mAATAAATCa aGAGaTTTTC tGtATTTCAT ACmAaGATAT 6900 

CTAACTATTT TATCGTATTT GTCTAAAAAA TGAGGATTAT 6960 

GCCATAGCCT TCTTTTTCTA AATCTGCCAC TGGCACAAAT 7020 

GCAATAACGT AAGCCACCTT CTTGGAGTGG GCCATCAGTG 7080 

ATCCGCTTCT TGACTGCGAA CTTCTACTCG GTGCATGCCG 7140 

TTTGACGCCA CGTTTTTCAA TTGGTTTGGT AAAGGATGGC 7200 

TTTGTCCAGG GAGCTAAACA ACGGTTCGCC ACTAACAATG 7260 

GTAAAAGTCA TCATATTCTC CTGAAAAAGG GCGTTCTGTT 7320 

ATATTGAAGA TCCGTCAAGG TTTGTTTTAA TTCTTCTTCT 7380 

CACTCCTTTT CTTCTGTTCA ATTTATTGGG TTCAATGATA 74 4 0 

CCTaATAGTT TGCATAGTAA AAGACAGTCC TTTCTCTTTT 7500 

CTTTTATTAG TTGAATTCTT TTAACAGTCG AGCTACTTTG 7560 

GGCGTAATAA TCGCCAGCGA TTGCTTCAAT TAGTAATGCT 7 620 

AGCGCCTGCT TTGTCGGCAT ATTCTGCGGT GTCTAAGTAA 7 680 

CAGTGGATAA AACGTCACGG TTGAATGAAC TGTCGCGGAg 7740 

CAACGTCACA CTTGTATACA CGTCATGTGT TTTGCCACTT 7800 

ACCGTCTTCC CGAGAAGTGG GTTTGCCGAG GATCTCTCCC 7860 

GCAGCCAATG ACTAGTGCTT CGGGGGATTG CTCTGCTATA 7 920 

CATTTGCGCC ACATATTCTG CTGGTAAgCC GTCTTTGCCA 7 980 

TGGTGCAATC GTAAAAGTTG GAACCACACG CTTTAATAAC 804 0 

GGATGCTAAA ATAATTTGCA TAAATTGATG ACGCCTCGTT 8100 

AATGCGTTTA AACATTCGTT CCATATCTGA ATTCGTAAAA 8160 

ATGCGGACAA TTAAACGGAT TTTCGCACAA GGCTAAATCT 8220 

TTGCTCATTG AGGTAATGAT TGGCTTTGAT GGATCGCTTA 8280 

TGCTTCACGG AATTTTTTCA CGCTGACCGA GCCAGTTGTT 834 0 

GCGAACAATG CTTTCCTCTT CGCCTGCAGG ATACCAAGTT 8 400 

ACTATTTTGT CCAAAGTCTT CCAAATGGAT GCCAACTTCC 8 4 60 

TTTAATTTTC AAGGCATCGC TATTTGGATA ATCAATAACG 8520 



WO 98/50555 



PCT/US98/08985 



ATTGGTACCA ATAGTTCTTG 
TCGTATTTAA TTCGTTCTTG 
TGTGCGAATA AATAAGTTCC 
GTCGGTCGTT CTTCTTCCAA 
CTATTGGCAT CGAATTCTGG 
TCTGATGAAC GCAGAGAAGG 
ATTACCTGTT GCTCATTACT 
GAGGTTTCGT AAGCTGACGT 
AATGGATCGT CCATTGGGTC 
GAGGCTGGTG TTTCGTCTGT 
CTTAAGCTGC CCGGTCGTTG 
TGTTCGGCTT TAGGTTCTGG 
GGAATCAACT GTTCTTGGCT 
TCTTTTTCTT TACTTAAACG 
AGCGGATCCA TTTCAATTTC 
CCATAGCCGG CTACAATTGC 
ATCGTTGATA AATAGTTGCG 
AAAGTAAAAT CAAGGTCTTT 
TAAATCCCCG CAATCGTTTG 
CCATCATGAA CTAAGCGAAA 
TCACCGATAT TGGCCAATTC 
TTATAAAATA AATTAGAAAC 
TCTTCGACCT TGCCCCCTTT 
GTTTCGACAA TCATTTCAGA 
AAGCTGCGAA TTCGGAACAA 
AAGGCATTCA GTACATCTTC 
TTTAAGCCAG CTTCTTCAAT 
TCCACCAATT CTTTGACGAC 
TTGGCTAGTT GTTCGGATAG 
GGTTCCTAAT TAGATACGTT 
CATTTCTAAT • AGATTCATTT 
TTCTTTAAAT AATGAAAGTT 
AACTGAAATT GGAATCGTTG 



434 

TAAATCATCA CTGACTTCGC CAATTTTTTC ACGGAAGTAC 8580 

GGCAGCGTGT TGGTCGATAA. TMACAGGCC ATCTTTGCTT 8640 

GTGCATTTGA CCGAAGTATT CTAGCTCAGG AAAACGCTCT 8700 

CTTGTTCAGG GCTTGTTTCA GTTCTCTTTG ACTCGCAGCG 87 60 

GTGAAGTTTC ATTTCTTCTT GGAAAACCTC GTCAGCAGCC 8820 

TTCTGAATCA GGCACTTCAG TTATTTTCGT TTCTTCGGCT 8880 

AGCTGTTTCC CAATGATTTG CGGAATCAGC AGCTGCCGAA 8 940 

TTCTTCGGCT ACACTCGTTG ATTCTGGCAC TGGCGGCGCA 9000 

CACTGCTCTT TCTGTCTGGT TGACAGCTTC AGGGGCGCTT 9060 

CGCATGTTCC GCATAGAAGC GGCCTGTTGC GGGATTATAG 9120 

TTTTACCGCA GgTTGTTCAG TCGATAAAGG AATTTCCATT 9180 

CGGTAGTTTC TTTTTAAAAC GCAAATTGTC TGCCGCATTG 924 0 

CAACACTTCT TGAATCGCCT GTTCAATTAA TGCCATTAAA 9300 

AACTTCTTGC TTAGTAGGAT GGACATTGAC ATCGACTAAC 9360 

TAACACAGCA ATTGGAAAAC GCCCCACCAT TAGTTTCGAA 9420 

TTTATTTAAC GCGAAATTTT TAATATAGCG ACCATTGATA 94 80 

GCTGGCCCGT GTGACTTCTG GCAATGAGAC ATACCCTGTC 9540 

TCCTTCAATT TTCAACATTT TCTTTGCGGT GCTAATGCCA 9600 

TTTTAAATCA CCATTGCCTG TTGTACTCAT CATTTTGTGG 9660 

AGCAACTTTA GGGTGGCTTA ACGCCAGACG ATTGACAATG 9720 

CGTTTGGATC GTTTTGACGT ATTTCAAACG AGCGGGCGTA 9780 

GGTCATTTTT GTGCCTTTAC GGAGGGCTGC TGGTCGGTTT 984 0 

TAGAATCACA TAGCTGCCTT CTTCTTCCTC AGCTGTGGCA 9900 

AACTGAGGCA ATACTTGGtA AnGcTTCGCC TCGGAACCCC 9960 

ATCATCTCTT GTATGAATTT TGCTTGTGGC ATGGCGCTTG 10020 

TTTGGCAATT CCTTCACCAT TATCAATAAT TTGAATGGTT 10080 

AAAAATATCA ATTTGAGTGC TTCCTGCATC CAGTGCATTT 1014 0 

AGAAGCGGGA CGTTCAACAA CTTCACCAGC AGCAATCTGG 10200 

TTCTTGAATT TTCCCCATTT CACGACCTCC TAACCGGTTC 10260 

TTTGCAATTG ATGAAGCATG TTTAACGCAT CTAGTGGTGT 10320 

TCTTCAATGT GTCAATGACA CTTAATTCTT CTGTTGAAAC 10380 

GCTCGGTTTC TT CAT G TACT TCTGAAACTT CTTCATGATG 104 4 0 

TTTCTTCCGC TTCTAAAGCG GATAAAATGG TGGCTGCTCG 10500 
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TTCTAAAAGC GGGCTTGGTA AACCAGCGAT TTTAGCAACA TGTATCCCAT AGCTCTTATC 10560 

AGCAGGTCCT TCCATCATTT TATGAAGAAA GACCACTTCG CCATCTTTTT CGACAGCGCr 10 620 

CACGTGAATA TTCTTCAAGC CTTTTAAGGT TTCATCTAGA ACAGTTAGTT CATGATAATG 10680 
TGTAGAGAAT AGCGTTTTGG CTTGTACTTC AC GAT G GAT A TATTCAATGA TGGCTTGCGc - 10740 

TAGCGCCATT CCATCATAGG TTGCCGTGCC GCGTCCTAAT TCATCAAATA AAATTAGGCT 10800 

GTTAGGCGTG GCGTGCCGCA GTGCTTGGTT GGCTTCCATC ATTTCTACCA TAAAGGTACT 108 60 

TTGCCCTGCA ATTAGATCAT CAGAGGCCCC AATCCGTGTA AAAATTTGAT CGAAAATTGG 10920 

CATTTCAGCA CTTTCCGCTG GTACAAAACA GCCAATTTGC GCCATGACCA CCGTCAAAGC 10980 

CAATTGACGC ATGTAGGTAC TTTTCCCTGA CATGTTTGGT CCAGTAATCA AGAGAATATC 11040 

TGTTTCTGGG TTCATGCGGA TACTGTTTGG AATGTATTCT TGATGCCCTA AAACTTTTTC 11100 

CACGACTGGG TGTCGTCCTT CGACAATCGC CAAATTTTTC GTGTTACTTC TTAACGTTGG 11160 

TCGAACGTAT TGATACCGTT CACTAATTGT TGCAAAAGAT TGTAAAACAT CAACCGCACT 11220 

AATGGTTTTA GCCAGTGTTT GCAACCGATC AATGTTCGTT TTCACTTGTT CTCTCACGGC 11280 

CAAGAACAAT TGATATTCTA GTTCAACTGA TTTTTCTTCG GCTTCTAAGA TTAAACGTTC 11340 

TAATTCTTTT AGCTCTGGCG TAATAAAGCG ctCCGCATTA GCTAATGTTT GctTACGCTC 11400 

GTATTTTCCT TCTTCTAAAT TCGCTAAATT TGACTTAGTG ATTTCAATGA AATAACCAAA 114 60 

GACCCGATTG TAGCCAATCT TCAAGTTTTT AATCCCTGTT TCTTGGCGTT CTTTCGCTTC 11520 

TAACTCCGCT AACCATTGCT TACCATTGCG CATGGCATCA CGGTATTCAT CTAATTGGTC 11580 

GTTATAGCCG TCTTTAATAA CCTTTCCTTC AGTAATTTGT AAAGGGGCTT CTTCATTAAT 11640 

CGCCGTCGCA ATCAATGCCA CTAAATCTTC AACAGGGTTT AAGTCGACTA ACAAATCATC 11700 

CCATTCCCCT TGATTGATCC CGACAATTAA CTGTCGAATC GTTGGTACTT GTTCTAAAGA 11760 

GGTCCGTAAT TGAATTAAAT CACGCCCATT CACACTGCCG AATGCCACGC GTCCTGCTAA 11820 

CCGCTCTAAA TCATAAACGT TCGTCAATGC TGCTTGCAAA TCCAAGCGTT CAAAATAGGC 11880 

ATTTAATAAG GATTGCACCA TTTCTTGACG CGCCTTAATT TGACGTTCCT GAATCAGCGG 11940 

ACGATCCAAC CATTGTTTTA ATAAACGGCC GCCCATAGCT GTTTTCGTtT CATCTAATAG 12000 

CCACAATAGA GTCCCTTTTT TCTGTCCTGT TCGAATCGAC TGACTCAATT CTAAATTGAA 12060 

TTTAGAATAA TGATCCATTT TCAAGAAATG ATCTGGCTGA TATTCCACAG CTTTCTGAAT 12120 

ATGGGACAAG CTCCGCTTTT GGGTAACGGA TAAATAAGAT AATAATTTGC CTGTAATTTC 12180 

AATTTCTAAT GGATTAATCA GTTCATTGGT TAAAAAACTA AACTCGGCAT TTTCTTCCAC 12240 

CGTTTCTTGT GTCGAAAAGA TGATATTTAA ACGTAAACTT AAGTTCTCTT TTAACGATTC 12300 

AGGGATCCCT GAACCTAGGA CCATTTCTTT TGTTTGCAAC GCAGAAGCTT CGTTCAAAAC 12360 

GCCCTCTTCA TCAGCCAAGA CTGCTGTTTT TAATTCGCCT GTACTCAAAT CTACATAGGC 124 20 

AAAACCGAAC T GAT T AC CAT CTGTTAGCAC AGCGGTTAAG TAATTATTGT CTTTGGCTTC 124 80 
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CAAGCCTTTA 
CATGCCCTTG 
CTCAATCAAC 
CGGATCATCC 
GATAGCATCT 
CTTGTATTGG 
TGGCATTTAA 
GACATGTTGG 
GTGCTCATCA 
TGGTTTGCCA 
TAATTCTGTC 
GACTTCATCT 
TAAATTCAAC 
CAGAAAAAGT 
GGCGATCTTC 
TCTCTAAAAT 
GCAATGCTGT 
AAGGACCGGT 
CATTTGTTTG 
AGCTCATGGC 
TCCGTTTGCG 
CCATAAATGT 
AAACCATTCC 
CATCATCAAT 
CATCGACACC 
CCGCATTTTC 
CATACGTTGC 
GCAAAAAGGT 
AAAAGAAAGT 
TGTTCGTTGA 
CGGCACGTGT 
CAATTTTCTT 
GTTTCACCAT 



CTATTCATCA 
GTAGTCTTTG 
GTATCAATAT 
GCATTGCGGT 
TCATAAAATA 
GCCTTAATGG 
TTCCCTTCCT 
ACTAAGTCAT 
AATTCTTTTG 
TAGTGAGCAA 
AGCTTTTTGT 
TGTCCAATTA 
GGCTCCATTT 
TAACCTCTTA 
GCTAATCTGA 
ACAGGCAGAT 
CATAAACTTC 
CATACCGACA 
AACATGCGTA 
TTGTTTTTCA 
AGCGATAGcT 
GCGGGCCTGT 
TTGACCAGGC 
GAATTCAAAG 
ATCTTGCAAA 
ACCATTGGCA 
TAAGGCTTCT 
TCCTCTTCTC 
TTTGGTTTTA 
GACaCTACAC 
TTCACGGATA 
CCGAATGTCT 
GACACGAACT 



CCGTTCCTGG 
GATCTTCCAC 
AGCCTTGCGC 
TACGACTGGT 
ATTCATAAAA 
ATAAATATTG 
TCTCTAGCTC 
TGGCTTCGAT 
TTTTCGCATC 
ATTGAACCGC 
TTTTCTTTAC 
AATGTAACAA 
TTTTCACCTT 
ATTATCCGAG 
ATCGGCTCAA 
AAAATTGTCC 
TCAATAATCG 
TCACTTAAGA 
TGTGTTCCAA 
CTCGTCGTTT 
ACTAACTCGG 
AAATTAATCA 
ACATCTTCTG 
ATATCCTTGT 
AATTTCTTAT 
ATGGTCACTT 
CTTCCTAAAG 
TTTTTAACAT 
AAAGATTAAA 
GCGATACTTA 
ACCGTTACTT 
CGAACTAAAC 
TCACGTCCTG 



436 
CGTTACCAAC 
TTGCTCACAA 
GGCATGATAA 
TAACGTCAAT 
ATCACCTAAG 
TTCCATCATG 
TTCGTTCACG 
CAGTTGTTCG 
GGCTCGTTGA 
ATCTTTTTGC 
CTTTTCTTCA 
TTTTTCTAAT 
CATCTTTCAC 
ATTAACTTTT 
TCATTTTCGC 
GCCCTGTTTC 
GTTCACGCTT 
AGGCTGTCCC 
CAACGGCTGA 
CGCCATGAAA 
CGGCTTTCTT 
CCGCTAGTTC 
GAAAGTTCGC 
TGTCCCACGT 
AAATCTTGCC 
GTGGACGATA 
AACCGACAAC 
CTCTTGTTTA 
AAAAGtGTCT 
TTCGCTTTTT 
TGATATGTCC 
GAACAGATTC 
CTTGAACAGC 



TGGACCACTT 
ATCGCTACTT 
GGAACGCCAC 
TCTAAGATTT 
CGATAAAATA 
GGTGTATTTT 
CGGTATTGAA 
CGGTAGGCTA 
ATGGCTTCTT 
GCAGCTTTAA 
ATCGCTTGGT 
TCTTTATTAC 
CGGCACTTTT 
CTTCTATTAT 
TTGGCCCGTT 
GACTACCTCA 
CATTCCTAAA 
TTTCGGTAAA 
AACTTGGCCA 
ATCCACAAAA 
GAAAGGATCT 
CAATTGATTG 
TGGACGAACC 
ATGGTTACCC 
AGTAATTCCC 
TTTTTTCTTT 
ATCTCCTATA 
TTATAGTTGT 
CAACGAACAA 
ATTTTATTTC 
TGGATAATCC 
TAAATCAGAA 
AAAACTTGAA 



CTCGTTTGAC 
TATAGCCTTG 
ACATAGGAAT 
GCGCTGCATT 
AAAACGCATC 
TTGTCTTTTG 
TGAGCGCCGT 
CAACGAGTGG 
TTTCAGCAGT 
TTTCTTCCAC 
AGCGCTGAAT 
TTTGTGCATC 
TCTATAAGAG 
TCTTGAAACG 
TGGTCATCAA 
AAGCGCTTCG 
ATCCCATCAT 
ATCCGAGCGT 
TCTAAAAACC 
ATGAGCGGTG 
TCCAATGGCA 
ACTTTCACAA 
ATTTTCTTCG 
ATCGTAACAA 
CGACCAGATG 
AATTTAGGGA 
AATAATATAC 
TTTCGCTTTG 
TCCTTGCAAT 
GCATAGTCTA 
AAGTCATCTT 
ATTTCTTCTG 
TCAACGCCTT 



12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
144 60 



WO 98/50555 

CAAAACTATT AGAAATATTT 
CcTACGTGCA CCTGGACGGG 
AACTGAGGTT GCTTCCACAT 
TTCTTTATAT TTCGCAGCTA 
ATCCAATGCT TTCCCAATAT 
GCCTAATtCG GCGGCTAAAA 
ATTTTGACCA TAACTTGTTC 
TAACGTATGA GCACCTACTT 
CATTTCTTTA CGTGATTTTT 
TTGGATTAAT TTTTCCAAGG 
TAACACAACC GCTTCTGGCG 
TGTACGAATA TTACGTCCTT 
AGTGACCACG GAAACAGTTG 
GGAAAGCAAG TTcTTCGCTT 
GACCATCAAG GTTAGTTCAT 
TGCTTCTTCT TTTGAAAGGG 
GTTTTCTACT TCTTTTTCTC 
TTTTTCTTCC AGTGAGCCTT 
TTCCCGTTGA ATTAAACGAT 
ACTTTcAATT TCTGAACGAT 
TTTCAGTGTT TCTGATTCTT 
ACCATTTATT GCTTTATCGT 
AATCAAACCG ACGATAGCGA 
TTTTGTTAAT TCATATCTTG 
CTTGCATATT TGTGCATACA 
TAAAAAAGCC CTTTCTTTTG 
TTCCGCCGTC ACGAATTGTT 
AAGCTGCTTG ATGTGGATGG 
AAACAAAAAA AGACAACACC 
GTATTCACTC GTTATCTATT 
AATGGTTGAA . CCGTCACCGA 
CATTTCAGGA TGATCCGCCA 
TTCTTTGTAG CCATACCAAG 
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TCTAAGTTTT CTAAGCGGCG AATGTAGTTT TCTAACGATT 14520 

CTGCAGATAA CGCATCTGCG GCTGCCACTA AAACAGAAAT 14 580 

CACCATGATG CGAGGCAATG GCATTAATAA CTACTGAATT 14 640 

ATTCTGCGcg AATCtCAACG TGAGAACCTT CAATTTCATG 14 700 

CATGTAGTAA ACCTGCACGT TTGGCTAATT GAACATCTTC 14 760 

CGCCAGCTAG TTTTGCTACT TCGATTGAAT GGTTCAAGAC 14 820 

TGAAACGTAA ACGGCCTAAA ATTTTAATTA AATCAGGATG 14 880 

CAAAGGCTGC TTGTTCTCCG TATTCACGAA TCCGTTCATC 14 940 

CTACCATTTC TTCGATACGA GCGGGATGAA TCCGACCATC 15000 

TCATTCGAGC GATTTCTCGT CTGATTGGAT CAAATCCGGA 15060 

TATCATCAAT GATTAAGTCA ATCCCTGTTA ATGTTTCCAA 15120 

CACGACCAAT AATGCGACCT TTCATCTCAT CATTTGGTAA 15180 

TTTCAGAAAC AGAATCGGCG GCGCAACGTT GAATTGCTAG 1524 0 

TGCGATCAGA TTCTTCTTTG gCACGTTGTT CTGACTCCTT 15300 

GATTCAACTC TTCTTCTGTT GACTTCATAA TAATCGATTT 15360 

AAGCAATTCG TTCTAATTCT TGATGTTGCT GATTAATTAA 15420 

GCTCATCAAT TAATTGCTGT CTAGCACCAA GTTTCTCTTC 15480 

CACGTTTTTC mAGrGAGTCA TCTTTACGAT CCAACGTTTG 15540 

TTTCTTGAGA TTTAAGTTCT AATCTGCTTT CTTTCAACTC 15600 

ACTTCTGGTT CTCTTCCTTA GCTTCTAGCA ATGCTTCTTT 15660 

TcCGAGCGTT CTCGAGAATA cTCGACGCTG AAATTTTTGC 15720 

GACGTGACTT TGCAATAACA AATCCTAAAC CAAGACCGAC 15780 

GGAGAATATT TAAAACCATA TTTCCACCTC CATACTATCT 15840 

TAGTTTTTTG ATAAACTAAA ATCAATTATC AATATGCCTA 15900 

CACAATTTTA TTCTAAtGTT TGTCGTGAAA GGTGTCAACT 15960 

GAAAGGGCrG CAAATATTAG ACAACGTCAC ATTTTTAGGT 16020 

TAAATTTTAG AACAAACGCC GCATAGTCTG CTGTTTGACT 16080 

GTTCGGACAT GGACAACAGT CTATTTAACA ATTTAGTTCT 1614 0 

TGTCCTGTTC AGTAAGTGTT GTCTTTCTTC TTAACGAGTC 16200 

CATCTAAGGG TAATTCTTCT TGACCTTCTG CTTCTTCTGT 16260 

TGCCGTATGC ATCACGGACC AATTTAGACA CTTCTGCCAT 16320 

TGTATTGTTT GGCATTTTCA CGGCCTTGGC CAATTCGTTC 16380 

CACCACTTTT ACTGATAAGG TCTTTTTCCA CAGCCATATC 16440 
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TAGTAATTCG 


CCTTCTTGGG 


AAATACCTTG 


GCCATACATG 


ATATCAACTT 


CTGCCACTTT 


16500 


AAAT.GGTGGC ..GCCACTJ'TGt 


TTTTAACGAC 


TTTaATTTTT 


GTGCGGTTAC 


CGACGATGtC 


16560 




mrprpTi T\ /Tp f~>rp rp 

1 1 lMnL lbl 1 


CTGCACGACG 


GACTTCTAGA 


CGAACCGTTG 


CGTAGAATTT 


16620 




K*n\* C/iVj La/io 


TTGTTTCAGG 


ATTTCCGAAC 


ATCACGCCAA 


CTTTTTCACG 


16680 


AATTTGGTTA 


ATGAAAATAG 


CAATTGTTTT 


TGTCTTATTA 


ATTGAGCCTG 


ATAATTTACG 


16740 


TAGTGCTTGA 


GACATTAGTC 


GAGCTTGTAA 


GCCGACATGG 


CTCGCTCCCA 


TCTCACCATC 


16800 


AATCTCTGCA 


CGAGGAACTA 


ACGCAGCAAC 


CGAGTCGATG 


ACAACGATGT 


CAATCGCACC 


16860 


ACTTGAAACT 


AAGGCATCGG 


CAATCTCTAA 


GCCTTGCTCG 


CCCGTATCTG 


GTTGAGATAA 


16920 


AAGTAATTCA 


TCGATGTTAA 


CGCCTAGTTT 


CTCCGCATAT 


TGAGGATCCA 


ATGCATGCTC 


16980 


AGCATCGATA 


AAAGCGGCTG 


TTCCACCATT 


TCGTTGCACT 


TCTGCAATTG 


CGTGTAAAGA 


17040 


AACAGTTGTT 


TTACCTGAAC 


TCTCAGGACC 


ATATACTTCA 


ATAATTC 




17087 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



AAATGAATAT 


TTATTCTGCT 


TTATTnTCAA 


TACTTTGTTT 


TCATAGTTGC 


TTTAGCTGTT 


60 


CCTTTTTTGA 


ATAAGATTAA 


AATGAATAAA 


TAACAAGCAA 


AATATAAGAT 


TTTTTTGTTT 


120 


AACTCTTTTT 


GTTTCAATAG 


GAAGTGCTTA 


GTGTTTTTTC 


AGAATTGAAA 


AATGAATACG 


180 


ACAAAAAATT 


TATCTATATA 


ATAGAAGGAA 


AAATAAAAAT 


AGTGAATGAT 


AAAAAAAATT 


240 


CAAAGAAAAC 


ACCCGTTCTT 


TTTAGGCCTG 


AGTTTTTGAA 


AAAAACATTC 


TCATGTCTAG 


300 


TATCTTTAAC 


ATTTTGTTAC 


TTTTTGACTG 


AAACGAATGT 


AAAAAAACCG 


ACATTTAAGG 


360 


GCTAATCAAG 


GGTTAGAAAA 


CGGTTTCTCG 


AAAATAAAAT 


CCTGTTTTTT 


TAACTTTTTT 


420 


TCGCAAAATA 


TAAACAGGAT 


AATATCAAGC 


TTCCAGCAGC 


TTTTTATTAC 


AAAATGTAAC 


480 


AAAACCGACA 


TTCTAGTAAT 


AAATTGTCAC 


GAGATATTCA 


CtACGGCCGG 


CCATTCGATG 


540 


ATAAAGTATG 


TGAAGTCGTT 


TCGATAAAGC 


GAAGCAGACG 


TGGTTGTTGC 


CGAGTGCTAA 


600 


GAGCAACAAT 


CAAATATTTT 


TAATTACAAA 


GGAGAAACAT 


ACATGAAATC 


AATCAAAACG 


660 


ATTTTACTTG 


GAACAACTTT 


GGCCGCAGtT 


TAGGATTATT 


CTTAGGAACA 


GACGCAAACG 


720 


CAGAAAGCTT 


ATATACAGTT 


AAAGCAGGAG 


ATACTTTATC 


AACAATTTCT 


CATCAATTTG 


780 


CAGGAGACAA 


TAGCTTAATT 


CAAAAAATTG 


CTTCTGATAA 


CAAATTGCCA 


AACCTTGATT 


840 


TAATTTTTGA 


AGGAGAACAA 


TTAGTTATTC 


GTTCAGAAAA 


AGAAgTTGCT AATACTCCAG 


900 


CACCAGCTGT 


AGAAGTTGCA 


CCAGTTCAAC 


AAGTAGTTGA 


ACAACCTGTT 


GCACAACCAG 


960 


TACAACAAGA 


AGTACAACAA 


CCTGTTGCTC 


AAGAAGTAGC 


GCAACCAGCA 


GCACCTGCTG 


1020 



WO 98/50555 

CAAGCAGTGA TGCAAAAGAG 
CAAACGGTCA GTATATCGGT 
CACCTGCCAA CCAAGAACGC 
ATGCAGCTAA ATCATTCTGG 
CTAATTACTT ATTAAAAACC 
TTTGTCGATT TGTTAAGGTA 
TTTTTGCTTC CCAAACTTGA 
GGCTACCATC ATGGACACTA 
TGTTTTCAGG AACGTTTTCG 
TTAAGCCCCC GTGGACAAGG 
GCTGACATTT CTCAGCAGAA 
TAGTAATCaG TCCTTTCTAA 
GAAGAAAATT kGAAATAGCC 
CCGATTCTgk GkATAwTrGA 
GGACAACAAT TAAAGAATAT 
TTACTATCAA CCGCCCGCAT 
TTGATGCATT CAACATTAGT 
CAGGTGACCA AGCATTCTGT 
TTGGCGAAGA CAACATTCCA 
TTCCCAAACC AGTCATCGCA 
AATTAGTTTG TGATTTAACA 
ATGTCGGTAG CTTTGATGGT 
AAAAAGCAAA AGAAGTTTGG 
TGGGCTGGAT TAATACAGTT 
CAGAAGAAAT GTTAACCAAA 
CCGATACAGA TGGCTTGGCT 
ATACAATGGC TGAAGCACAA 
TCGATCAATT CCCTAAATTC 
CGTCCCGATC ATCCTGCTTT 
CAAGAAGTCA GCCATTGGGT 
GTGGCTTTAT TCAGTAAGAA 
TTGGGGAAAG AG T TAT TAT T 
TTAAAAGATG CGCAAGTTAA 
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TGGATCGCAC AACGTGAATC TAGTGGTTCT TACGATGCAA 1080 

CGTTACCAAT TATCTGCCTC TTATTTAAAT GGTGACTATT 1140 

GTGGCAGATC AGTACGTAGC AGGTCGCTAT GGTTCATGGG 1200 

TTAGCAAATG GTTGGTACTA AAATTAAATA AACGArTAAT 1260 

ATAAACAAAA AAACAGTAAC CTGTAAAGAT TACTGTTTTT 1320 

CAACGCCCAA CGCTCGTCAA TTGGTGGTCT GCGTTATAAA 1380 

AGGGTCTTTC CCGAGTGATC GGGCGTCGCA ATCACGGTAA 14 4 0 

CGAAGATGAT TTACTTGTAA GTCTATGCCG ACAGCAAAAG 1500 

TTCGCACCTA AACTACAGGC AGTTTCAATT AAGACGCCGT 1560 

CCATAGGGTT GTTTATGCAC GTCTTTTACC GTTAAAGTAA 1620 

ACCTGTTGAA TCGTGATGCC TAAATACTCT AAAAGATGAA 1680 

TAAAAGAGAG AATAAAAGGT TTTTaTTaTT ATACAGGAAT 174 0 

AkGTTTTTtC TTAAAATtCa AGCGAmTTTT TCyCtAAAGg 1800 

AAaGATGrGA TtAATGAAGG AGGTTTCCTA ATGAGAAACT 1860 

GATGAAATTT TATTTGAGCA AGCAGGCAAA GTAGCAAAAA 1920 

GTTCACAACG CATTCACACC AAAAACAGTC ATGGAAATGA 1980 

CGTGATAAAG AAGATGTCGG GGTTATTATT TTAACAGGCG 2040 

TCTGGTGGCG ATCAAAAAGT TCGAGGTAAT GGTGGTTACG 2100 

CGTTTAAATG TGTTAGATTT ACAACGGTTA ATCCGTGTCA 2160 

ATGGTGAAAG GCTGGTCCAT TGGTGGCGGC AATGTTTTAC 2220 

ATTGCGGCGG AAAATGCTAA GTTTGGGCAA ACAGGACCAA 2280 

GGATACGGCT CAGGTTACTT AGCGCGTGTT ATCGGCCATA 2340 

TTTATGTGTA AACAGTATTC TGCTCAAGAA GCTTTAGATA 2 4 00 

GTTCCATTGG AACAAGTAGA AGATGTGACA ATGGAGTGGG 24 60 

AGCCCGATTG CTTTACGGAT GATTAAAGCC TCTTTAAATG 2520 

GGTGTGCAAC AATTAGCTGG TGATGCGACA CTTCTTTACT 2580 

GAAGGCCGAG ATGCATTTAA AGAAAAACGG ACACCAGATT 2 640 

CCATAAACCA ATGACTTGGT TAAATAAACA AGTACAAAAA 2700 

TTATTTTCAG GATGAATCTT GGACGTTTTT AGAAGTGcAA 27 60 

AGCCACATAT CAACAGGTGC TTGCACCAGA AGAAAAACGT 2820 

CAGCAAAGAG TTGTATTTTT CAATTCTAGC CTTATGGGAA 2880 

TTTAAATACG CATTTGACAC TTGCTGAACT GACTTTTCAG 2 94 0 

AACGATTATT GGTGCGCCTG AAACGCAGGC TTTGTTAGAA 3000 
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GAAATTTCTT 


TTGTCGACGT 


TCAGCCAATG 


ATTAAAAAAC 


AGCATAGTCT 


TTCACATCAG 


3060 


GAGTTTCAAC 


AGCCAAGTGA 


CTTAGAGTCG 


GTGGCTTCTA 


TTATGTATAC 


ATCTGGAACT 


3120 


ACAGGACAGC 


CTAAAGGGGT 


CTTGCAACGG 


TTTAAAAATC 


ATTTGGCCAG 


TGCTAGAGGC 


3180 


ACTCAAGAAA 


ATATGGGGAT 


TACTGCTGAA 


GATTGTTGGT 


TGTGTGCAGT 


CCCATTATTT 


3240 


CACATTAGCG 


GATTGTCGAT 


TGTTGTTCGA 


CAACTCGTCT 


TAGGTTGTAG 


TATTCGCCTT 


3300 


TACGATAAAT 


TTGATGAACA 


GCAAGTGACA 


CAAGATTTAC 


AAGAAGGCCG 


TGGAACCGTC 


3360 


ATTTCAGTGG 


TTGCGACAAT 


GTTACAACAA 


TTATTGTCTG 


TCTATCCTGA 


AGCTGGCTAC 


3420 


AGCGCCAgCT 


TTaAAGGAAT 


GCTATTAGGC 


GGCGGACCGA 


TTGCTCCGGA 


TAAATTAGCG 


3480 


CAGTGTGAGG 


AAAAAGGGAT 


TCCAGTCATT 


CAATCTTACG 


GGATGACGGA 


AACGTGTTCG 


3540 


CAAGTTGTTG 


CCTTGAAATT 


TGAAGATGCG 


GCACTGAAAA 


TCGGCTCTGC 


TGGACAACCG 


3600 


TTAAAAGATA 


TGCAAATCAA 


AATTGTTGAT 


GAACTAGGAC 


AAGAACAGCC 


AGAAAAGCAA 


3660 


GTCGGGGAAA 


TTTTACTTAA 


GGGACCAAAC 


GTTGTTTCAG 


GTTACCTTAA 


CCAGCGTCAA 


3720 


CCTGAAAAAT 


GGACAGCAGA 


TGGTTGGTTT 


AAAACAGGCG 


ACATGGGCTA 


TTTAGATGCA 


3780 


CAAAGTTACC 


TGTATTTGGT 


TAGTCGTTTA 


AGTGAACTCA 


TTATCTCTGG 


TGGTGAAAAT 


3840 


ATTTACCCCA 


CTGAGGTTGA 


ACAGGTGTTA 


CAGGCGATAA 


CAGGAATTAA 


AGCAGCGGCA 


3900 


GTTGTAGGAG 


AACCAGATGC 


GCAATGGGGa 


GCCGTCCCAG 


TTGCATATGT 


GATTAGTGAC 


'3960 


CAAGAAATCA 


CCTTAGCGCA 


AATTCAAGAC 


CAGTGTTCAC 


GAAAACTGGC 


AAAATATAAA 


4020 


CGACCGAAAC 


GCATTTATTT 


TTGTCATTCT 


TTCCCGCAAA 


CAGCAAGTGG 


AAAGAT TGCC 


4080 


AAACATCGTT 


TCATGACAGA 


AGAAAGAGAG 


GCGTTTTTAA 


TCAGATGAAA 


TTGCCTGATG 


4140 


AATTACGACA 


AGCCTATCAG 


AAGAATGCCC 


GTCAGTTCAG 


TTGGGTTTTT 


CC AT TAG AAA 


4200 


"CAGAACAAAC 


AGCCTTGGCA 


ATTTTTGCCG 


CTGGTGAAAA 


GGCATACCAA 


GGAGAACGTT 


4260 


TTTTTTGGCA 


AACGCCGAAA 


AAAGATTTTG 


CTTTGGTTGG 


CTTCGGATAT 


GAAACTATCT 


4320 


TAAAAGGCGC 


AGAAACAGAA 


ACGCATCAAT 


TGAACAACTT 


TTTAAAAAAA 


GAGGCGGCAA 


4380 


CTCGTTTTCA 


AAATATGTCT 


ATTTCTGGTA 


CGGGGGCCCT 


TCTGTTTGGC 


GGTTTGCCTT 


4440 


TTGATACAGA 


ACAAACGCCT 


ACTGAGGCCT 


GGGGAGAGTT 


AGGGGAAGGT 


TGGTTTTATC 


4500 


TACCAAGTAT 


TCTATTTACT 


TTTTCAGGTT 


CACGTATTTA 


TGGAACGCTG 


AACTTTTCAG 


4560 


GTTCTAGTGA 


ACAAGAAGTC 


GAGGAACGTT 


GGTCGACTTT 


GGTGGCTCAG 


TTTGATCGCT 


4620 


lull nuL I 








AAAGAT T G AC 


AAGGAAGAAG 


4680 


TTGCCGTGAC 


TGAATGGCTA 


CAAGCTGTCA 


ACGAAACAGT 


AGCAGTCTTA 


AGAGAAGATG 


4740 


GCCCTTTAAA 


AAAAGTGGTT 


TTGGCTCGAC 


AACTTGCTGT 


GAGCAGTCCT 


GAAAAAATTC 


4800 


GAAGTAACCA 


AGTGCTCGTT 


AATTTAATGG 


CACAACAACA 


AAATACGTAT 


TTGTTTGCTT 


4860 


TAGAAGCCAA 


AGACACAAGT 


TTCATTGGTG 


CAACACCAGA 


ACGACTTCTG 


TTGGGTACTA 


4920 


AAGAAACGTT 


TGCGACCGCT 


TGTATTGCAG 


GAACCATTAA 


AACGGGTCAA 


ACACCAGAAG 


4980 
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AAACGAAAAC TTTAGGAGCA CAACTTTTAC AAGACCGCAA AAATACCGGA GAACACCAAA 5040 

TTGTGGTGGA ACGTTTAGCG AAAGAGTTGG CGAAGATGAC GACTTCTGAA AACAGTATTC 5100 

AAGCACCCAT TATTTTGGAA AATCGAGATG TACAACATCT TTATGTGCCA ATCTCGGGTC 5160 

AACGTAAACC GGGAATTTCA TTTTTGGAAA GTGTGATGCA GCTACACCCA ACGCCAGCTT 5220 

TAGGTGGTGA GCCCAAAGAG TTGGCAGTCG AATGGATTCG ACAATATGAG CCAGGGAGTC 5280 

GTGGGTTATA TGGTGCGCCA ATCGGTTGGA TTTCAGGGAA TGATGATAGT GGCGAGTTCG 5340 

CTGTGGCCTT GCGTTCAGGT GTCTTTGCTG GTCAGCAAGG CGTTCTTTAT GCAGGTTGTG 5400 

GTATTGTTGC GGATTCCCAA GCAGAGCTAG AAAGAGAAGA AACGAAAATA AAATTTCAAC 54 60 

CGATGTTACG AGGAATTGGA GGTCAGGTTT AATGAATCAT CAAGAAACAA TGACCGATTA 5520 

TTTAATGGCC TTTATTGAAG GCTTAAAAAA TAGTGGGGTT GAACAAGCAG TCATTAGTCC 5580 

AGGGTCCCGG TCCACACCAC TGGCTTTATT ACTTCATCGG GAAACCGCAA TCCAAACTTT 564 0 

TGTCGATGTC GATGAACGTT CCGCTGCTTT TTTTGCTTTA GGATTAAGTA AAGCGAGTCA 5700 

AAAACCAGTG GTCTTACTTT GTACCTCTGG AACAGCAGCT GCAAACTATT ATCCAGCAAT 57 60 

TTGTGAAGCC AATATTAGCC ATGTGCCTTT AGTCGTTTTA ACAACAGATC GGCCGCATGA 5820 

GTTACGTCAA GTTGGTGCTC CTCAAGCCAT GGACCAATTA CAAATGTATC AAAATCACGT 5880 

GAAATTATTT GTTGAAATGG CTTTGCCAGA AGCAACAGAA GAGATGTTGA ATTACGCCTA 5940 

TTGGCAAGGC GCTAAAGGAG CAGCATTCGC GCAACAAACG CCTGCAGCAC CTGTTCATTT 6000 

GAATTTTCCT TTGCGGGAAC CATTATTACC TGACTTAGAG AGAAAAACAA AGAGTTCTCA 6060 

ACAAACGGCT TTATTCGCTG GTCAATCAAT CCTTTCTACA GAACAAGTGC AACAGTTAGC 6120 

GGATCAATGG TACCAGAAAA ACGGCGTCTT GGTTGTCGGT GGCAGTCATA CAGAGGAAGA 6180 

AGCGACGTTA TTTATTCAAT TAGCCGAAGC TTTGCAATGG CCGTTACTAG CGGATCCTTT 624 0 

AGCAAACATT GTGACACATG GGCAAAATAG TGAAGTTGTT ATTGCACACA GTGATCTGTT 6300 

TCTAAATGTG GCGACCTTGC CACAAGAACC AGAAGTGGTA GTGCGTTTTG GTTCGCTGCC 6360 

AATTTCTAAA AACATCATGT TATGGTTGAA ACGCTTAGCA ACGACAGAGA CTGCTTTTTA 6420 

TTTTGTGGAC GAAAATGGGC AATGGCAAGA GCAATTGAAA AAATCACAAA CAGTTATTCA 64 80 

GGCAAAGGAA ACCACGTTTG TAGAACAACT ACTAACGGTT GTCAAACCAA CAGAAGCGAC 6540 

TTGGTTAGCA CAATGGCTCC TGCTGGAAAA AACGGTCTCT GAAGTTCTTC TAGAAACGCT 6600 

GAACGCGACA GAATTAAATG AAACAACAGC CAGCTTAGCT GTTCACCAAA CAATGAAAGA 6660 

AAATGGTCAG TTATTTGTAT CTAATAGTAT GGCTATCCGC TACTTAGATC GATTTATGGA 6720 

TAGTCGCCCT TATCGGATGT TTGGAAATCG CGGCATCAAC GGGATTGATG GGATTGTTTC 6780 

AACTGCTTTA GGAATGAGTG CCATAGCACC GACACAGCAA AACGTTCTAC TGATTGGTGA 684 0 

TTTAGCGCTA TATCATGATA TGAATGGTTT ATTTTTGGCT AAACGTTATC AGTTGCCGTT 6900 

AACAATTGTT TTATTAAATA ACAATGGTGG CGGGATTTTC TCTTTTCTAT CTCAACGAAC 6960 
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TTTACGTGAG 


GACGATTTCG 


AGCCTTTATT 


TGGAACACCG 


CTTGATTTAG 


ATTTTTCGTT 


7020 


AGTTGCTGAA 


CTTTACGGTG 


CTTCGTACCA 


AGAAGTCAAA 


ACAATCGCGG 


AgCTCAAGCA 


7080 


AATTTTACAA 


GCAGCGGCTG 


AAGAACCACA 


ATTTCAAGTG 


ATAGAAGTCA 


AAGGAAATCG 


7140 


CCAAGAGAAT 


GTTCATTTGT 


ACGAATCAAT 


TTTAGCTGAA 


ATTGGTAGAC 


GAGTAGAAAG 


7200 


ACAGGGGATT 


TCATGGAACG 


GTTAATTCGG 


GGGATGCAGT 


ACCATTATCA 


ATGGTTAACA 


7260 


CCTTTTGATG 


CAAAGCGAAC 


GACAGTAGTC 


TGTTTACACG 


GTTTCACAGG 


AACGTTAGCC 


7320 


ACGTTTGCAG 


CCGTTTTTCC 


CAGTCAAACA 


CCTTATAACG 


TTTTAGGAAT 


TGACTTACCT 


7380 


GGGCATGGCG 


CGACTGCTAG 


TTTGGTAGCG 


CCAGAGCGTT 


ATACCATGAA 


ACAGGTTTGC 


7440 


CATGATATTG 


CTGAACTAAC 


CGAATCGCTA 


AATTTGCCTT 


GCTTTTGTTT 


ATTAGGCTAT 


7500 


TCAATGGGCG 


CTCGAACAGC 


TTTAGGTTTT 


G CAT T AC ATT 


AT CC AC AAAA 


GGTACAGCAT 


7560 


CTTTTATTGG 


AAAGTGGATC 


ACCAGGTTTG 


GCCACTGCCG 


CAGAACGTCA 


GGCTCGTATC 


7620 


TGTCAAGATC 


ATCGTTTGGC 


GGAACGTCTT 


TTAGAGGAAC 


CGCTGGTGGA 


TTTTATTGAT 


7680 


TTTTGGCAAG 


AATTACCGTT 


ATTTCAAACG 


CAAAAAGCCT 


TGTCTGTGGC 


ACAGCAAATG 


7740 


GCCATTCGTC 


AGGAACGTTT 


GAGCCAATCG 


GCCTTTGGAT 


TGGTTAGTAG 


TTTGTGGTAT 


7800 


ATGGGCACAG 


GAGCGCAAGA 


GAGTTACTGG 


GAACGTCTAG 


CAGAATTGCA 


GCCGATTCCT 


7860 


ACCGATTTAT 


TGGTAGGCGG 


CGAAGATCAA 


AAATTTATTG 


GAATTGCGAA 


AAAAATGCAG 


7920 


GCACGTCAAC 


CGTTGTTGAG 


GCTGACAATT 


TTTCCTGAAG 


CAGGACATTG 


TATCCATTTA 


7980 


GAACAACCAA 


CGATTTTCTA 


TGAAAAGGTG 


ACAGCGTTAT 


TGGAAGGAGC 


CATCTGAATG 


8040 


AACATTCAAT 


CAATTGAAAC 


CTATCAGGTT 


CGTTTGCCTT 


TAAAAACGCC 


GTTTGTCACG 


8100 


AGCTATGGTC 


GCCTGGAAGA 


AAAAGCGTTT 


GATCTATTTG 


TAATAACGGA 


TGAGCAAGGA 


8160 


AATCAAGGGT 


TTGGCGAATT 


AGTGGCtTTT 


GAGCAACCTG 


ATTATGTGCA 


AGAAACGTTG 


8220 


GTAACCGAAC 


GCTTCATCAT 


CCAACAGCAT 


TTAATTCCGT 


TACTTTTAAC 


AGAAGCGATT 


8280 


GAACAGCCGC 


AAGAGGTTTC 


AACAATTTTT 


GAAGAAGTTA 


AAGGGCATTG 


GATGGGAAAA 


8340 


GCTGCGTTAG 


AAACTGCCAT 


CTGGGATTTA 


TATGCCAAAC 


GGCAACAGAA 


GAGTTTAACT 


8400 


GAATTTTTTG 


GACCAACGCG 


GAGGAAAATT 


CCTGTAGGGA 


TTAGTTTAGG 


GATTCAAGAA 


8460 


GACTTACCTC 


AATTACTTAA 


ACAAGTTCAG 


TTAGCTGTAG 


AAAAAGGGTA 


TCAACGAGTG 


8520 


AAGCTGAAAA 


TTCGTCCCGG 


TTATGATGTG 


GAACCTGTTG 


CTTTGATTCG 


TCAGCATTTT 


8580 


CCTAACTTGC 


CTTTAATGGT 


CGATGCAAAT 


TCAGCGTATA 


CATTGGCAGA 


TTTGCCTCAA 


8640 


TTGCAACGTT 


TAGATCACTA 


TCAATTAGCG 


ATGATTGAAC 


AGCCTTTTGC 


TGCAGATGAT 


8700 


TTTTTAGACC 


ACGCACAATT 


ACAAAGAGAA 


CTGAAGACAA 


GAATTTGTTT 


GGATGAAAAT 


8760 


ATTCGTAGTC 


TCAAAGACTG 


CCAAGTGGCG 


TTAgcATTGG GCAGTTGCCG 


TAGTATCAAT 


8820 


TTGAAGATTC 


CTCGTGTAGG 


TGGGATTCAC 


GAGGCGTTGA 


AGATTGCCGC 


ATTTTGCCAA 


8880 


GAAAATGATT 


TATTGGTTTG 


GTTAGGTGGT 


ATGTTTGAGT 


CTGGTGTTGG 


TCGCGCATTG 


8940 
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AATTTGCAAT 


TTGCGTCACA 


GCCTACCTTT 


TCCTTTCCAG 


GTGATATTTC 


AGCAACGGAA 


9000 


CGCTATTTTT 


ATGAAGATAT 


CATCACGGAA 


CCTTTTATTT 


TAGAGCAAGG 


GACAATGACC 


9060 


GTGCCTCAGG 


GACTTGGTAT 


TGGTGTTACA 


CTTTCCCAGA 


CGAATTTGCT 


AAAATACAGT 


9120 


CAGTATCAAA 


AAATCATGTA 


ATGATAACAA 


AATTGCGCCT 


ATGACTATGC 


AGAGAAGCAT 


9180 


AATCTAGGCG 


CAATTTTTTA 


TTTTTCTAAT 


TGACTCGCTG 


CGGCTTCATC 


CAAGAGAACC 


9240 


GT.TAAATTTG 


GATGGAGTTG 


AAGAATAGAG 


GCAGGATACG 


TTTCTGTGAC 


AGGTCCTTGG 


9300 


AGCACTTGTT 


TGACCATCTG 


AGCTTTTTGT 


TCACCATTTA 


CAATTAGGAC 


AAGATGCTTA 


9360 


ACTTTCATGA 


TGCTGGCTGG 


CCCCATCGTA 


ACAAAGGTCA 


TGCCTGGCTG 


CATCATTTCA 


9420 


GGAACAAACC 


AAGGGTCTGA 


ACCAGTGACG 


GTCACTTGGT 


AGGTTTCATT 


TTTAAAATGA 


9480 


GTGGTAG-TGG 


GCATGTTCCC 


GCAAAAATGA 


CCATCTGCTC 


CTAAGCCAAT 


TAACATCAAA 


9540 


TCTAAGCCGC 


CTGCGTCAGC 


AATCCGTTGT 


TCTTGTTGTT 


GAAAGTTTTC 


AACAGTTAAT 


9600 


GGATGGATGT 


TTTGCTCGGA 


TATTTCCGCA 


GGTGTCAAAT 


ACAGCGTCCG 


CAAATCTGTC 


9660 


AGAGTAATGC 


CTTCCTTTTG 


ATGAGGAACG 


GGGATTTCAT 


CAAAGTTATA 


ATAATGAACA 


9720 


TTATCAAAGT 


CAGATGAATT 


TTTCACAATC 


GAGGCTAATT 


TTTGGTAAAC 


AAGTGCTGGT 


9780 


GTTTTGCCAG 


CGGTAATTGA 


TAAATTCACT 


CGTTTATCTT 


GGCTCATCGT 


GCTTAGAAGT 


9840 


AACATTTTTG 


CCCATTCGCT 


CATCGTGTCG 


AAATCTTGTT 


TAATTAAAAT 


TTTCATACTT 


i 9900 


GCTCACTCCA 


TTCTTTGAAT 


AAGCTAAGTA 


TAATTTGTGG 


GAACCATCCC 


ATGTCAAGCT 


9960 


CTTCACAAAA 


AGGATTTAAT 


TAACGGTAGG 


CAAAAGAAAG 


AGGAGCAGCC 


GTCAAAACGG 


10020 


CGACTGCCCC 


TCTTTTTAAA 


ACTTAAGATA 


ACCTACAACG 


AGTTAACTTT 


TTAGCTTTCT 


10080 


GTTCCACTTG 


CTCCAACGCT 


GGTCTAAACG 


TTCTTTTGAC 


CAACACTGTT 


TAACAAAGAA 


10140 


TTTTATTTTA 


TTGTTCTTTC 


AACCATTTTT 


CTAGTTGTAA 


ACGTTGCACT 


TTCATAGTAG 


10200 


CTGTGTGAGG 


GATTTCATCA 


AAAGCCATAA 


TGATAGGTTC 


GTTTAAATGT 


GGTAAATCGG 


10260 


CTACTTGTGT 


CCACCAAGCG 


TCCCAATCCA 


TTTCTTCCCC 


AGGGACAACC 


GCTAAAATTG 


10320 


GCTGAGGTTC 


ATTTTCTTTG 


CCACGGACAA 


TAATCACTTC 


TTCTAAGAAA 


TCTAATGCAT 


10380 


CCAATAAATG 


ATCTTCAATA 


GCTAAATTAC 


TGTCGATGTT 


TTCAATTAGA 


TCTACTTGGC 


10440 


GATCTTTCAA 


GAATAAGTGG 


CCGCGTTCAT 


CTAACATTCC 


GTAATCGCCG 


CTGTCCCACC 


10500 


ATTCGCCATA 


GACATTTTCT 


GCAAAGCGGG 


CATCTTCTTT 


GTAATATGTT 


AAGGCCCGTC 


10560 


CTTTTGATAA 


GAATTGGATA 


TGGCCGTCCG 


TCATCGCAGG 


TAAAACTTGT 


CCTTTTTCAT 


10620 


CGGTAATTCG 


AGCTTGTGTT 


AAATCCTCTA 


AACCAACGCC 


CATGTCACGA 


GCATCTGAAT 


10680 


CTTTTAGGGA 


TTCTAAAGTA 


TGCGCTCGTA 


AGATCATTGG 


CCCGCATTCA 


CTTTGTCCAT 


10740 


AAACTTGTAG 


GAAAATTGGT 


TCATTGGCAG 


CAGATGCTTT 


TAAGAAAGCC 


ACCATTGTAG 


10800 


CGTTGTTAAT 


TGCATCAAAA 


GTAGAATGAT 


AATAATGGAC 


ACTAGCGAAA 


GCTTCAGGTT 


10860 


TTTCTTTGGC 


CAAACGAACC 


CATTGAACGA 


AGTTATTGGG 


GTGTGTTTCT 


ACAGCGATTG 


10920 



WO 98/50555 PCT/US98/08985 

444 



GTTGATAGGT 


GCTGAACATT 


TCAGCGACTT 


TTTCGCTCGA 


AGCATTTGCT 


AAAGGCATCA 


10980 


TAGGGAAGCC 


CATGGCCATT 


AAAGAAGACA 


CGCCAATGTT 


AAAACGAGAA 


TGAACTGGCG 


11040 


AAATATGAAA 


GCCAACTAAT 


TTCTTTTCAG 


CAATTTTTGT 


AAAAATGGTT 


TTTTGCCATT 


11100 


TTGTACGCCA 


ACCCATAGAG 


TGATTGGAAT 


GACAGATAAG 


TTTAGGAATA 


CCTGTTGTTC 


11160 


CAGAGGTGTG 


GGTCATGTAG 


GCAATTTCGT 


CTTTAGTCAG 


TTCGTCTTGA 


GGAACTTCCG 


11220 


TGGCATCAGC 


TTCTAAAAGA 


GAAGCAACCG 


AAAGTTTTTT 


ATTGGCAGAG 


CTATTACGAA 


11280 


CAGCTTGAAC 


TTTTTCAGCA 


GTCACATCAT 


CAAATAAAAT 


ATAAGGATCT 


TCTAAACGAT 


11340 


CAACGAATAC 


TTCAATTGTT 


TCAAAAGGTA 


AATGGTATGA 


AACCATTACA 


GGAACGGCTG 


11400 


CTAGATATGA 


GGCTGCAACA 


GCTAGTAAAT 


ACGTGTCGAA 


TTTAGGACTT 


TTATAAATAA 


11460 


TAATTTTGTC 


GCCTTTTTTT 


ACACCTAAAG 


TGGCTAATTG 


ATATGCCCGT 


TTCAAAATTT 


11520 


CTTGATGGCT 


TTCACCATAA 


GTTGTTTCTA 


AGCCAAGTGC 


AGGAAAAGCA 


GGCAAAGATT 


11580 


CATCAAAAAT 


AATTGGAACA 


GTTGGTGTTT 


TTTCTGCGGC 


TTCTTGATAG 


TTTGTATAGA 


11640 


GGTTTAATGG 


TTGATAATCA 


AGTAAGTGTG 


TCAAGGGAAA 


GTCTCCTTTG 


TAAnATAGTT 


11700 


TGTGTACGAA 


TGTACAAGTT 


CACTTTATCC 


TATCATAGAA 


TGGGTCTTTT 


TCTACGTCAn 


11760 


AAAT 












11764 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11307 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



TCCAATATAA 


AGACACAACA 


GACAAACGaT 


GAGGAGTGTG 


AAwTAAtATG 


TTaACAATTA 


60 


TTATTGGGTT 


TATCTTTTGG 


ACAATGACAC 


TAATGTTAGG 


TTATCTAATT 


GGTGAAAGAG 


120 


AAGGCCGTAA 


ACATGAGTAA 


TTTAACAAAA 


CGTAAAAAAG 


ATTTATTTGA 


AATGAAAAGC 


180 


GTTGTATTTA 


AAGATATTTC 


AAAGCAACAA 


AGCGAAAAAG 


CACAAAAAAG 


AAAACGACTC 


240 


TTACAACTAA 


TGAATCAATA 


TCCCGATTGG 


GCAAGTCAAA 


AAAATAAACT 


TATTATGCAG 


300 


GAAATTCAAG 


AATTAGGACA 


AGCAATCGGT 


AATTGGTCGA 


TGGATCAATC 


AAGACCCATC 


360 


CAATCCATCA 


AGGCCGCATC 


GTTTACAAAA 


AGCGAGTATC 


TCTATTTAAT 


TTGGCTCGGT 


420 


TATTCAGATG 


AAGCGATTCG 


TCACGGCTTA 


GACATGTCGA 


AAGAGTGTTA 


TTTTATTTAT 


480 


CGATTAACAC 


TTTTAAATGA 


ATAAAAGTAA 


AGGAGATTAA 


CCAATGCGTA 


CGTCAACATT 


540 


TAATTATATC 


■ AAAGATATTT 


TAGCAGACTT 


TTATAAAACA 


GAAGAGTATA 


TCCGTCAACG 


600 


GGAAGAAGAA 


TTACGGCACC 


CTTATCAAGA 


AGCAGATTTA 


AATGCTGGTA 


TTAGAGGACA 


660 


AGGACTTCAC 


TCTGTAGTGA 


CCGAACGAAT 


GGCGATTACG 


ATAGCTATGG 


aTCGTCGTCT 


720 
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GTGGAACTTa 


GAGAGAAATC 


GAGACATTAT 


CAAAAATTGT 


TTAGCCGAAG 


CGGaTGAACA 


780 


AACGCGCGTG 


ATTATTGAAG 


AACTATATAT 


GAAAAAACGG 


CCCTCTTTAA 


CATTAATTGG 


840 


ACTTGCCCAG 


CAATTATTTA 


TTAGTAAAAG 


CCAAGCCTAT 


AAATTAAGAA 


ATCATTTCTT 


900 


TGAAGCGGTG 


GCGGATGAAC 


TAGGGATGTA 


AACATGGAAA 


AAGCGTGGAA 


TTTTTTCAGG 


960 


TGTCAACATG 


GTAAATTAAT 


AGTGTCGAAA 


GAGATAGATA 


AACGTGAGGC 


AACCAAAAAA 


1020 


ATGAAGACAC 


GGAATTCTAT 


GATTTTGACT 


GCTTTCTTGT 


GTCAGCTATG 


AAGGAGCAGA 


1080 


AAATGCCGGC 


TACTTTCAAG 


ATCCTTCATT 


TTGACTAGAA 


GAGAGCCAAT 


TTGTTAACCA 


1140 


ATCCTGAATT 


TTTTGAATGG 


AAAGGTGGCG 


CTAAAAATGA 


ATGAAGCGGA 


ACAAGAGTTA 


1200 


TATGAAGCCC 


TTGTTGCAAT 


CTGCCAGACG 


TCaGGATTTT 


TGTTGCTAGA 


GGAACTGCCG 


1260 


ACAGATTTAC 


CAGATCAGCC 


ATTTGTTTAC 


TTAGGTGATA 


GTAAAGAATT 


ACCTAAGCCA 


1320 


ACTAAATCAG 


CTATTTTGGG 


AGAGATTGAA 


TTAATAATGC 


ATGTTTATGG 


TGCGTTATCT 


1380 


GAACGACAAC 


AAATTTCTAC 


AATTAAAGGA 


ACGATTTTAC 


GGCAGGCAAC 


CAGTAACTTA 


1440 


AAACGAACGG 


CTCATTTTAA 


TTGGGGTATC 


AAACATCAAG 


AAGTCAAAGC 


ACAAATGGTA 


1500 


AAAGATACCA 


AACAAATGAA 


AAAAACAATT 


TGGCATGCTG 


TACTACCATT 


ACACATGCAA 


1560 


TTTTACTAGG 


AGGAATTATC 


AATGGGAGAA 


GTTATGCAAG 


GAAAAGACCG 


TATTTTATTA 


1620 


GTTCGTCGCT 


TGGATGAAGC 


AGCGACAAAG 


AAAGCAATGA 


AACCCTTATT 


TCAAATTGAA 


1680 


CATGAATGGG 


AATTCTCACG 


TGAATCGAGC 


GGTACGCAAA 


CAAAAGATGG 


CGTCGCGAAT 


1740 


GCTGTTTCTG 


GTTTAGAAGT 


TACGTTATCG 


TTAAGCGGTT 


TAGCCTCTCG 


AGATGATGAA 


1800 


AATTTATACA TGaAAGACGC AgcGAaGATG 


GCATCTTAAT 


GGAATTTTGG 


GATGTTGATT 


1860 


TAAAAGGTGA 


AAAAAATGCG 


GAAGGTAAAT 


ATCCAGCAAT 


TTATGCCCAA 


GGTTATGTAA 


1920 


ATTCATGGAG 


TTTACCAGCC 


AATGTAGAAG 


AATTAGTAGA 


AATCGAAACA 


GAAGCCTCTA 


1980 


TTAATGGCAA 


GCCACAAGAT 


GGCTTTGCAA 


CAGTAGAAGC 


AGATATTATT 


GCAGAAGCAC 


2040 


AATATGCGTT 


CCAAGATACC 


GTTCCAGATA 


AAGCACCACA 


ACCTGGCGAA 


TAATCAAAAA 


2100 


GTGTTGAATT 


TTAGGAGGAT 


AAAAAATGAA 


TTTAGAGATT 


AACGGAAAAA 


CAATTGAAGT 


2160 


GAAATTTACG 


ATTGGCGCGA 


TTCGCGAATT 


AGATAAACGT 


TACCAAATTG 


AAAATGGCGC 


2220 


TGCCAAATTC 


GGCATGGGCA 


TCAGTTCAGC 


AATGATTTAT 


TTACGCCAAT 


ACAATCCAGT 


2280 


AATCTTAGTT 


GACATCATGG 


AAGCTTTACA 


AAGTGGGCAA 


TTAAAAATAG 


GTAAGTCGGA 


2340 


AATTGAAGCA 


TGGTTAATGA 


CCCAAGATGT 


CAAAAAACTT 


TCAGATGATT 


TGCTTAAAGA 


2400 


AATGGGAAAG 


CAACCTCTTA 


CAAAACCAAT 


GATCGATCAG 


TTCAGCAAAG 


AAGCGAAGAA 


2460 


AGCAGAAGCG CAAssgACCA ACTAATTAAA 


ACGAGCGATG 


ACGTGTATCA 


CGACATCGCT 


2520 


CTTTCTGCTT 


• TTCGCTACTT 


AGGCTGTCGT 


TCATTTGAAG 


AAGTGGATCA 


GATGACCATG 


2580 


TCTGAATTTG 


AATTACGAAT 


GATTGCTTTT 


AATTTAGCAG 


AAGTAGATGA 


AGAGCGGAAA 


2640 


AGGCACGAGC 


TTGCCTACTT 


AAATGTTAAA 


GCGCaAGCGA 


CAAACAAAAA 


AGGAAAACCC 


2700 
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GTTTTTGAAA 


GCTTTAAAAG 


TTTTTATGAT 


TATGAAAAAC 


GAG.TTGCTGA 


AGTTCTGGCA 


2760 


GCTAACCAGC 


CACAACGAAC 


AAAATTAAAT 


GAGCGGAAAA 


AAACGCAACT 


TGCCACTGTG 


2820 


GCAGAgCGTC 


TACGCCGCTA 


TCGAGAAGGG 


AGGAGAGTAG 


ATGGAGAATG 


ACAAAGAAAA 


2880 


AACGCCGTTA 


TCGGAGGCAA 


AGAAAAGCcT 


TGCAGGCGTC 


CAACAAGCAT 


TAAAAAGTAT 


2940 


GAGCGGTGAG 


TATGCCTTAT 


TAAGTGGATA 


TTTAGGGAAA 


ATTAGTGCGG 


GTGTCAATCA 


3000 


GTCAGCCACG 


GTCATGAACA 


CATTTAAAAC 


CGTCATGCAA 


CAATCTGGAG 


AAACAGTGAA 


3060 


AAAAACAGGA 


GACGAAACAG 


CAAAGGCAGC 


AGATCAAATG 


AACACAGCGT 


TAACAGATTC 


3120 


TGCTGAACAA 


GCCGGTGaAG 


CAGCTAAAAA 


AGCGGGGAAA 


GAAACCTCTG 


ATGGCTTTAC 


3180 


TAATGCACAA 


AATAATATGC 


TGAGCTTTGG 


GACGGCCATG 


ACTAGTGCCG 


TTTCCTTACC 


3240 


TATGCTGAAC 


GTTTTAAAAA 


CAGCTATGGG 


CGTCGGTGCT 


GGGGTCAGTG 


GCGAATTTCA 


3300 


AGGAATGCAA 


GGACTGATTA 


TGGCCAGTGC 


AGGAGGGATT 


TCTGATTCAT 


TGCAAGGCGA 


3360 


GTTGCAAGGG 


GCATTGACTC 


AGATGAATCA 


ATCATTTGAA 


GCGGCGGCAC 


AAGTGATTCA 


3420 


AAGCGTGATG 


GCTCCAGGAA 


TGGAAATTTT 


GGTTCAAGTG 


GTTATTACAG 


TCGTCAAAGG 


3480 


CATTACAGCT 


TTGGTTAATT 


TATTTATCAA 


ATTACCAAAA 


CCCGTCCAAG 


TTTTTATTGT 


3540 


TGCCATTATG 


GGCATTTTAG 


CCGCCATTGG 


GCCCATGTTG 


ATTATGGTAA 


CGATGGCTCA 


3600 


GCTAAAATtT 


CAACAGTTTA 


GTGCTGGTTT 


GGCTCTTGTA 


CAAGGAAACA 


TTGGGAAGTT 


3660 


AGGTGGTGGC 


TTATCAAAAC 


TAAGTGCTAG 


TTTTAGTGCC 


TTAGGTGGAG 


GACCATTAAT 


3720 


TTTAATTGTA 


GCAGCCGTTT 


TAGCAGCGGT 


AGCAGCGTTT 


ATTTATTTCT 


ATAAAACCAA 


3780 


TGaAACaTTT 


AGAAATAGTA 


TCAATAGCTT 


AGCTAGTGCC 


ATTCmAGGaG 


CTGTTTCAGC 


3840 


GGCGTTTGGC 


AAATTGGTAG 


GATTGCTACA 


ACAGATCCAG 


CCGGCCTTTC 


AGCAAGTAAT 


3900 


GGCAGTTTTT 


AAACAATTTT 


TTGCAGTAGG 


CTTAGAGAAA 


ATGGCGACTA 


TTTTTTCAAC 


3960 


AATTGGTCGT 


GTGCTAGCAG 


GCGTTTTTGC 


CAGCGGTTTG 


CAATTAGGTA 


GTAACTTATT 


4020 


AGGGCAATTT 


GGTGGCACCT 


TTGACAAAGC 


TGGTTTAGCG 


GTTGGTCTTT 


TGGTAAAAGT 


4080 


TCTGACAAAG 


GTTGCACTGG 


CTGCATTAGG 


AATTCTGGGC 


CGTTTGGTCT 


AATTATTTCC 


4140 


TTGATTGTTT 


CATTCGTGAC 


GGCCTGGATG 


AAAACCGGTG 


ATTTGAGTGC 


GGGTGGTATT 


4200 


ACCCAAGTCT 


TTGATAATTT 


AGGTAACACG 


ATTACATCGG 


TTACAACAAT 


GCTGGCAACT 


4260 


AATCTACCGA 


AAGTTATACA 


ACTTTTTACA 


ACAGTCTTAA 


CCAGTATTCT 


CGGGAAAATA 


4320 


ACAGAAGCTA 


TTCCAAGCAT 


CGTAACCGCG 


TTATCTAGTT 


TAATTACGTT 


AATTGTTGGT 


4 380 


GCGATCGTTG 


CCAATTTGCC 


AGTCTTAATT 


GAAGCGGCAA 


CACAAATTAT 


TACTACGTTG 


4440 


ATTCAGGGGA 


TTACAACAGT 


CTTACCAATG 


TTGATAGAAG 


TTGGTTTGAG 


CTTATTAATG 


4500 


ACTTTAGTTA 


ATGCGATTGT 


CACCGCCTTG 


CCAACAATTA 


CAACTGCAGC 


GATTAATATC 


4560 


ATCACTACAT 


TAGTGACAGC 


TTTTGTCACA 


GCGTTACCAA 


TGCTAGTTAC 


AGCAGGTGTT 


4620 


TCAATTATCA 


CGGCCTTAGT 


CAATGCATTT 


GTTACTATGT 


TACCGTTGAT 


TTTGACTGCT 


4680 
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Pfmmmivpjl Tv Tv 

GG T f i AGAAA 


1X1 1GA1GGG 


TV TV ATPRPT 

All AA 1 GAG 1 


PPP 7A TT 7A TP B 
GGGA 1 1 A 1 GA 


GGAl i I InLL 


TPBP.TTBaTT 


4740 


TV 7\ HPr™* 7\ P P P 

C AAT GAG G G G 


TP 7A PP 7A T T Tv T 

I GAGG A 1 1A1 


T , f"T , 7\/-'/T""P r P TV 

iGl AGGG 1 1 A 


G i GAGAGGG 1 


T P B T 7A (2 CL T CI f* 
1 GA1 AGG 1 G*w 


PTTappapap, 


4800 


ATTATCAGGG 


ptv rnTTr^ n 
GAGG1G1 GAA 


7V TTPTT A tv rp /— 
Al 1 Gl 1 AA1G 


GGG 1 1 AA1 1G 


AAGGAAi Iril 


TTPP.BTTTTa 
1 1 Lunl 1 1 1M 


lOOU 


CCAACCTTAG 


TT. GGGGGAgG 


TATTACCTTA 


ATTTTGACAT 


TPPTaaaTPP 
1 GG 1 AAA 1 GG 


PTTBBTTPP.T 
G 1 1 AA1 1 GG 1 




GCCTTGCCAC 


TV TV _ . fTl /"» TV rn /*■ TV /"» 

AAaTCATCAG 


CGCaGGCGTC 


tv Tv T\ rn rp /-^ /~i rn Tv tv 

AAATiGC TAA 


1 GGG 111 G A 1 


PPTV APPPfsTT 

GGAAGGG3 1 1 


/I QQ A 


ATTTCAATTT 


TACCGCAACT 


GGTTACTGCA 


TV TV fTl fTl TV /""'P/** 

GCAATTACGC 


rp tv nrfi'PTVPPPP 

1AA1 1AGGGG 


1 1 1 AA1 GGG1 




GCGTTAATCA 


ATGCGTTGCC 


ACAGTTGTTA 


AGTGGTGGGA 


TTP7\7\PTPI\T 

I 1 GAAG 1 G A 1 


TPS1PPPTTB 

1GAAGGG1 Irt 


jlUU 


ATTAATGGTG 


TACTCAGTCT 


ATTGGGTGCC 


TTGCTGTCCG 


PTiPPnppTi TVP 

CAGCAGGAAG 


TVTTTV ATPTPa 

Al 1 AA1 GIGA 


fid 


CAAATGATCA 


CGAAGATTGG 


TTCTTATTTT 


GGTCAACTGT 


TAGCTTCGGG 


PPPTVPTVPTTTi 

CGGAG AG 1 1 A 


Cv?9n 


GTTGAAAATA 


TCAAAAATGG 


GGTTACCAAT 


TV P TV TV fTl/™ 1 

G C AG C C AAT C 


Tv r** rn TV Tv TV Tv TV TV 

AGGTAAAAAA 


rp^PPTVrnrT'r-'P'P 
1 GGGAi 1 GG1 


Ci 9flfi 
jZOu 


TCTGTAATTG 


AAGGTGCTTG 


GCAAGCAATC 


GAAGGTTGGT 


TTTCAAAATT 


P 7\ f~* Ct^ Ti TPPP 

GAGGGA 1 GGG 




GGTGCGAATA 


TTGTCGGCAT 


GATTGCTGAT 


TV TV m m TV TV /"^ 

G G AAT T AC AG 


r^rtf^r^Ti tv rn m <r*» 

GCGCAATTGG 


-%7V7VTVPPPT\7VA 

aAAAGGGAAA 


D ft UU 


GAAGCAATCG 


ATGGAGTTGT 


tv rn 7\ tv tv tv m m 

CAGTAAAATT 


CGTAACTTTT 


rpTV/^OTVrprflfprp/^ 

TACGATTT TG 


AGGAGGAAAA 


c»4 fin 


GAAGGTCCCT 


TATCTGATTT 


GCATAAATTG 


AATTTCGGCG 


/~> Tvr^/^T\ mmpp 

GCACGATTGC 


/->APPPPPT\ TT 

GAGGGGGA1 1 


jjZU 


TATGCAGGCG 


AAACAGCCGT 


TAGTAGAGCA 


tv m ^ ^^r^mm y^m tv 

ATGGCTTCTA 


TTTTAGATTT 


Tv /-^/-'/"■r'^rp/^rprpTV 

AGCGCTG1 1A 




AATGATTTTG 


CCTTGGACTT 


AGCTGGTCGA 


/-i tv tv tv m m tv 

GGAAACTTCA 


/"» TV TV /-*/"• tv rn 

CGGCAACGAT 


Tr7\PP?\ TPPT 

TGACCArGGl 


O Dfi U 


TTAGAAAATG 


ATGCATACAA 


TCGACCATTA 


mm tv ^ t\ 

TTTGTGACAG 


rnTV/^TV/^m/^A TVr^ 

TAGAGTCAAC 


PTTTVPATPPA 

GTTAGA1GGA 


C7nn 
J / uu 


AAAGTTGTCG 


CAGCAACTAC 


GGCGCCTTAT 


m m TV /-»/"* TV TV TV /"^ 

TTAGCAACAG 


AGTTACAACG 


TV P TV TVP A A PTP 

AG AAG AAG 1 G 


fin 


AAACAAAATA 


ACCGCTTAGG 


TV TV<""«/">7\/"'TV/-«**"'TV 

AAGGAGAGGA 


m tv tv tv fr* ti m 

TAACATGTAT 


tv tv T\rTinnni^rprn^» 

AAATTTGi IG 


TATTAPPTA TV TP a 

A 1 AGGAA 1 GA 


Cift 90 
JOZU 


AGCAACTCAT 


TCAACGCCTC 


TTCCTTCAGA 


tv rTT'TTPn TV rp 

AGCGTTGAAT 


TTaAAGGGGG 


TA7ATTTTTBP.B 
AA1 111 IMurt 




AAAAGTCATC 


CCTGGCTATC 


Tv TV TV /"» TV TV TI T\ 071/™t 

AAACATTATC 


tv O rp rn rp/-i tv /■*/■"• n 

AGTTTCAGGA 


PP7VP7V 7\ T* T* TV P 

GGAGAA1 1AG 


m mpp a a ppp a 


594 0 


AATTGAAAGC 


m m /■** tv tv rn m tv /** 

TATCAATTAG 


^/~" TV rn rn /"» m TV 

GGATTCGTGA 


rp /-•/-• rp Tv Tv Tv /"">/—• rp 

TGG TAAACG 1 


GAGG 1 1 1 A 1 G 


ppppaaTTPP 

GGGGrtMl ll~C 


finnn 


AGAACGAGAA 


TTAACAGTCA 


TV AT 7\ TPPPPT 

AA1 A1GGGG1 


T 1 TP TV r/^TPTr 
1 1 GAGG 1 G 1 G 


a a fa btpb ar 

Art. i Art 1 


V^M 111 VrfUnyn 


6060 


T GC AT TT AAT 


CATTTAAACG 


TTGCTTTGTT 


TAPfPTVA A7V7V 

1 AG GGAAAAA 


.PTV PPTTf PT a 

GAGG 1 1 Irt 


TTTP.PTTTaa 
111 VJVJ 111 MM 


6120 


CGATGAACCG 


GAAATGCTGT 


^/""•TTTPPOA P 

GGTT1GGGAG 


rp tv nprppmmoT. 
1 AAG 1 G 1 1 G A 


PTP Ti PTP a TP 

G 1 GAG 1 G A 1 G 


Tapppcaap.t* 


6180 


GTTAACCAAG 


TAACAGGCAC 


CTTTACTTTA 


1 1GG1 1 1G 1G 


TVTPPPTfiTBB 

Al GGG 1 rtirt/4 


a t a bp ap p,p, 


6240 


AGkATgGCGA CTAGTGTGAT 


GTGGGG TTGG 


GGAAGGA1 1 A 


PTV TTTPH a CP 

GA1 1 IGrtrtvaU 


OMM 1 1 MV-» AIM 


6300 


ATGGGGAATA 


CAGGCTCAGG 


TGCATTTGAT 


TTTCCAATTT 


rn TV TV rp rp 7\ T\pp 

TAA1 IGAAGG 


PPPPPPTTBT 
GGGoVji^l 1M1 


6360 


TGGGGATCAA 


CCATGATTAC 


CTTTCAAAAT 


CGGGCTTACA 


CGATGGGGGA 


TTTAGGCAAA 


6420 


GAAGTTCGGC 


CAATTGAAAT 


TTATCCTACG 


GTTGAAGGAT 


TAAAAGTCAA 


ACCGACCATT 


6480 


ATTTTAACAG 


■ GAACCGGACG 


TGGTGTTTGG 


ATTAAAACAC 


GGAACGATAC 


AATTAACTTA 


6540 


GGAGACTTTG 


ATCGTTCGGA 


AATTATTATC 


GATACTGAAA 


ATTTTTATCT 


GACaAAAAAT 


6600 


GGTGCACCGA 


TGATTCGACC 


AATGAACGAT 


TTTTATCTAT 


ATCCCAATGA 


ACCGCTGTAT 


6660 
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ATTCAAGCCA AAGATAGCGA CTTCCGCTTG ACGATTCGCT ATCCTAACCG ATTTGTGTAG 6720 

GAGGGTGATT AAATGTTAAT GGCGCTGGAT TTGAAAAGAA CATATACGGC AATCTTGGAT 6780 

AATGCCTATC AAGTCAGTTA TGAAAAAATA GAGAACAAAA TTGGCAGTTT AGATTTTACC 684 0 

ATGCCACTAG ATGATCCTAA AAATGAATTT ATTGCAGAAA TGCAATGGGT GGAACTGACC 6900 

GACAATGAGA ATGAATAYAT TGGTTTATAT CGCGTGATGC CAACCACAAT TAAGAAAGAT 6960 

GCGAACAATA ATCAAATTCA TTACTCTGCC ACAGAAGCAT TATGTACCTT AGGTGATACT 7020 

GTCCTTTTTG GTTGTCACGA AATTAAAAAC AAAACAACGA AAGAGGCCAT TCAATTTTTA 7080 

TTGAATAAAC AAAAAACAAA GCATTGGGTC CTAAAAAAAT GTGATTTTTC AAGGAAATTA 714 0 

ACCTATAAAT GGGAAAATGA AAATGGGTTA GTTGAGCCTT TATTTAGCAT CCCAGCCGAT 7200 

TTCGAAGaGG AATATCTTTG GCAATGGAAT ACAGAGGTCT ATCCTTTTGA ACTTTCATTA 7260 

GTCAAACCGC CAACAGAACC AGTTGCGCGA ATTCAAGAAG GTTACAACAT GCAAGGATTT 7320 

GAAATAGAAC GTAATCCCAA GATGCTAATC AATCGGATTT ATCCATTAGG TTCAGGCGAA 7380 

GGTGTTAACA AAGTCAATAT TCGCTCGGTC AATCAAGGGG TTCCGTATTT AGAGAACAAG 74 40 

GCCGCAATTG ACCGCTATGG TTTATTGGAG TCAATTTGGG TGGAACAGCG TTTTTCTGAT 7500 

CCCAAGGCAT TAAAGGAAAA TGCTTTGCGA ATGTTAGAAG AATGGACCAA GCCACAAGTT 7560 

TCTTGGGTAG TGACTGCAGC TGATTTAATT AAATTAACAG ATCAACCTTT GGCAATCGAT 7620 

CGTTTGCGGT TGGGCACGGT TATCATGATT AATACGAATG AATTTGGGAG TGTCAACCTT 7680 

CGTATTAAAA AAGAAAGJ;-iA AAAAGATGTC TTTGGTGCCC CCCAAGACAT TCAGCTAGAG 774 0 

TTGGGAAACC TGCAAGAAAC AATTCATAGT ACCATGACAG CTTTCAGTCG GAAACAAGAG 7800 

ATTAACGAAA CTTACGCACA AGGGGCGACG ACACTTTTAA ATCGTTCAAT ACAAGGAGAA 7860 

CTTAGCAAGA CACAGCCAGT GGAGCTGAAT TTATACTTTG ACGAGGACAT TCTTTATATA 7 920 

AACACCGCAG AATTAACGTT CAAGGCAACT GCTAAAGGAC CTTCGCATTC TGTAACGAAC 7980 

ATTGATTTGG TAGTGGATGG CAAAAAATTA CCCCAACTAT CATTGCAACA ACAACGGCTA 804 0 

AACATTTTGA GTTATTTACG AAAAACAACA GATGAAAAAA TCGAACGCGG CAATCACACG 8100 

CTTCAATTTT TCTCTCATCA GCCACTATGG TTGGATGCTT CGGTCATCTG TCGTGTGTAT 8160 

ATTCAATCCC AATTGGGTGG CCAGTTTTAA TAAAATAATG AAAACTAGAG GAGTGTGACG 8220 
AAATGTCAGT AGAACATATT GAAGAATTAG ATACCCTGAA TCAAGGTCGC CTTAAAATCA 8280 
ATGCAATCTT GGATCAGTCG AATGCATCAG CTGAGAAAGT AGATGCTTAC CAAGTCCAGT 834 0 

TAACGAATGG AATTTCTGAA GCGAAAAACA TAGCAGATGA AGCTGGCAAA GAAGCCGTAC 84 00 

AAATTGCCAC CGATGCAGGC AATCAAGCmA aTGAAACAGC CAACCAAGCG aTGaACaATG 8 4 60 

CCAAAACAGC AaTCATGATT GCAGGaAATG CAGTTTCAAC GGCAAATAAT AATAAACAAG 8520 
AATTTGATAC TTTACGAAAT GATTTCGATC AATTAGTAGC AGAAGCGGGT GATAGTAATC 8580 
CAGAAATTGT CCAAGCACGT ACAGATACAC AAGGCATCAA ACAAGCTACC TTAGCGAATC 864 0 
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GTCTTCAAAT TGATTTGAAT GACCGTATGA CAAAAGCAGA CGGTATTTCT TTATTGGCTA 8700 
JVGCCAACTAC TGTCAAAATG AAGTTAGACT TTAACGGTAA AACGGCCGGC AATACAGCCA - 87 60 

CCAATGCAAA CAGTTATTCC ACTGATTTTA CGGCAAAAAT TCTTAAGAAG CCAACAGAAG 8820 

TTTGGGAGGA AGTTTCCCAA GCGGACTACA ATAAAATGGC CAGCCGTGAT GATGAGGGCG 8880 

TGAAAACAGG TTCCACCCAA AGCGGTGTGA TTCCGCAACA ATTAGCGGCC TTCAATCTCG 8940 

TTGAAGCCGC TAAAAAATTA ATTCCACAAA TGTTTGAAAC AGTCACAACT GACGAGGCGG 9000 

TGGCATTTAT TCGCCAGAAC GTTCAATTTT TTACGATTAA TCAACGTGTG AAAGCCGCTG 9060 

CGCCCAATAA TCAAACGAT.T AAAATCGCTA CGTATTTACC AACTACGGAT AATTGGGTAA 9120 

CTCAAATCCA AGAATCAGCA AAAGAGTTTG GCGATTTTTC AATTCAAATC AATGATCAGA 9180 

ATTTTATCAC AGATGAAGGT TTCATTTATT TAATGAGCTA TACAGATTCA TCGAATGGGG 924 0 

TAACGCCAGC TAGCTTAGAA GTTGATTACG TGGGGCTTCA TATTGGTCTG TCTGTTGATG 9300 

CCCAAGCGGT TTTAGCGAAG AGTGGTTTTG TTCAAGCAGA GCAACTCAAT ACCCATATGG 9360 

AAAATCAAGA TAACCCGCAC CAAGTAACCG CTGAACAAGT GGGGCTAGGC AATGTAGAAA 9420 

ATTATGGnTT CGCATCAGAC AGCGAAGCAG TCGCGGGAAC TTTAACGAGT AAATATATGC 9480 

ACCCGAAAAA CGTTGCGGAA GCGATTAAAG GTCAAGCTGT GACACAAACA GGTGATCAAG 9540 

AGATTGCTGG GGTGAAGAAT TTTGTAACTA TGCCAACCGT CAATGGTGTG CCTTtGGAAT 9600 

CCTCTAGAAT GGCCATTTAT GAAGCTAnTG GAGTTGGTGA AGyCGAGGCA AAATATCAGg 9660 

CGGCCTTTAA TAAGGAtAAT ATGAAATTTG TATTAATTAG GGTAGGAAAT CGTGTCGATG 9720 

CATTTGTAAG ATGTAATTTG AGTGATCCaA CGAAATTGAA TAATAATTTG GTTAAAGTGT 9780 

TTACTGTTCC AACAGGATAT ACATTaTCGA CGAAGATTAC AAAGGGAATA TGGAATyTGG 9840 

CGTTAACTGC TATGCAATAT ACATTCCCTC AACCGAATTG TGCTGGTTTA TATGAGATGG • 9900 
GAAATCAAGG AATTCTTTTT GGTGCTAACC GTGCTGGAAA TATTTACCTA CAAGGAAGTT 9960 

GGTACACGGA CGATCCGTTT CCAACAAAAT AACAAGCTAG TTTAGGAGTC TTTTTATGGA 10020 

ACGTTATCTC AACACAATAA CAATGCTTTT AAGCATTTTC GGTGGGATTG TCGTACGTTT 10080 

ATTAGGCGGA TTAGATCAAT TGTTGGATGT CTTCCTCTTT TTAATTATTG TCGATTTCAT 10140 

CACAGGTTGG ATTAAGGCAA TCGCCACAAA AGAATTGTCC AGTCGGATTG GTATGCTCGG 10200 

AATTGCGAAA AAAGTGACGA TGTTATTTGT GGTTGCCGTA GCGGTTCGTG TTGAAAAAGT 10260 

TGTGGGGAAC AATTTGCCAA TTCGGGAAAT GGTTCTGATT TTTTACATTG CGAACGAAGG 10320 

ACTTTCTTTT TTTGAAAACA TTGCGACCTT TATTCCTATG CCGAAAAAGT TAAAAGAGTT 10380 

ATTTATTCAG TTAAAAAATA AAGATGATTA AGTAGAAGTG GTCGGGACAA ACGTAGAACT 104 40 

TTCGcTGATT . GCCGAAGAAA TTACTTCTGT CCCGCCATTT ATCTGCAGGT TTAAGCCGTG 10500 

GAAGGGAAGT TATTTTGACT TTCCTTTCAT GGCTTTTTTA AGAAAGGAGC ATGCTATGTT 10560 

TAAAAAATTA ATGATTCAAC TTGCTTTAGT GATTGGTTTA AGTTTAACGA TTCCGATGAC 10620 
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GGctTrCGCT TACACCATCG AAGCGGATCC AATCAACTTT ACJTATTTTC CCGGCTCTGC 10680 

AAGCAATGAA TTAATT.GTT.T . TACATGAATC TGGAAACGAG CGGAACCTAG GACCACACAG 1074 0 

TTTAGACAAT GAAGTGGCCT ATATGAAACG AAATTGGTCA AATGCTTATG TCTCATATTT 10800 

TGTCGGATCT GGTGGACGAG TGAAaCAATT AGCTCCTGCT GGCCAAATTC AATATGGCGC 10860 

AGgTTCTTTA GCTAATCAAA AAGCCTATGC GCAAATCGAA TTGGCTCGAA CGAATAATGC 10920 

GGCGACATTT AAAAAAGATT ATGCTGCCTA TGTTAATTTG GCCCGTGATT TGGCTCAGAA 10980 

CATTGGTGCT GaTTTTTCTC TGGaCGATGG AACAGGTTAT GGCaTAGTCa CTCATGATTG 11040 

GaTTACAAAA AATTGGTGGG GAG AT CAT AC AGATCCTTAT GGtTATTTAG CGCGTgGGGG 11100 

ATtAGTAAAG CGCAtTGGcA CmAGATTTaC aAcGGGCGTT TCyGrAACAg GTGaGACTGG 11160 

TCATTATTCA GCCAGGTAAA CCTAATGCGC CAAAATATCA AGTAGGACAA GCAATTCGTT 11220 

tCACTTCAAT CTATCCAACA CCAGATGCTT TAATCAATGA ACATCTATnA GCAGAGGCAC 11280 

TTTGGACACA AGTAGGAACA ATT AC AG 11307 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

AAAGCACAAA TGCAACAAGT TGTGATAGCT ATTCAAAATA CGGAAGCCTT TCAGTTATTG 60 

AATGTAATGA AGGTTATAGG AGATGAGTAA ACATGTATGA AGGTTTGTTA AAACAGTATC 120 

AAGCGTATTT GCCCGTAACA GAAAAAACAC CAATGATTTC TTTAGCCGAA GG AAATACGC 180 

CACTGATTCC TTTACCTAAT TTATCCAAAG AATTAGGTAT TCAATTATAT GGAAAATACG 240 

AAGGCTTAAA TCCCACTGGC TCATTCAAAG ATCGCGGGAT GGTGATGGCT GTAGCAAAAG 300 

CGGTAGAAGA AGGTGCCAAA GCGATTGTTT GTGCTTCTAC AGGAAATACT AGTGCCGCCG 360 

CTGCTGCGTA TGCCACAAGA GCAGGCATTA AAGCGTATGT AGTGATTCCA GAAGGGAAAA 420 

TCGCACTGGG AAAATTGGCG CAAgcTATTA TGTATGGGGC GG AC AT TAT T TCGATTCCAG 4 80 

GTAACTTTGA TGAAGCATTA AAAGCGGTGC GAGAGATTGC GAAAACTGAG GCAGTTGCGC 540 

TAGTTAATTC AGTGAATCCT TATCGCTTAG AAGGGCAAAA AACAGCAGCT TTTGAAGTTT 600 

GTGAACAATT AGGTCAAGCG CCCGATGTCT TAGCGATTCC AGTTGGAAAT GCCGGCAATA 660 

TTTCCGCTTA CTGGAAAGGC TTTAAAGAAT GGCATGAAAA ACAAGGAACA ACTTTACCAA 720 

GAATGCATGG TTTTGAAGCA GAAGGTGCCG CAGCGATCGT GAAAGGCCAA GTGATTGAGC 780 

AACCTGAAAC CGTGGCAACA GCGATCCGGA TTGGGAATCC TGCTAGTTGG CAATTAGCAG 84 0 

AACAGGCACG GGATGAATCC GGTGGTTTCA TTGATGCCGT GACAGATCAA GAAATTTTAA 900 

CGGCTTACCG AAAAATTGCT GCTCAAGATG GGGTTTTCAT TGAACCTGGT TCGGCTGCTT 960 
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CATTAGCTGG AGTTATTCAA CATGTTAAAA GCGGTAAAAT CAAAGCAGGT GAAACCGTCG 1020 

TGGCAGTCTT TACTGGTAAC GGACTGAAAG ATCCTGATAC AGCAAT GGACTACAGAAGTAG 1080 

CTATTTCAAA AATGAGTGAC GTGGAAGaAA TGCGTCTGcA CTTACgAAAA GGAGTCGCAA 114 0 

CACTATGaAA ATAAGAGTAC CTGCCaCtAG TGCCaACTTA GGACCTGGCT TTGATTCATG 1200 

TGGCATTGCA TTATCCGCTT ATTTAACAAT CAACGTTTTA GGTGAAAGTG AATTTTGGGA 1260 

AATTCAACAT ACTCTGGGAG AAGAGATTTC TACTAATGAA GAGAATTTAT TAATTCAAAC 1320 

GGCATTGAAA ATTGCACCAG AATTAACACC AAAAGTTATT CGGATGGTTT CTGACATTCC 1380 

CCTTGCAAGA GGACTAGGCA GTAGCTCAAG TGTGATTGTA GCAGGCATTG AATTAGCTAA 14 40 

CCGACTAGCA CATTTAAATT TGTCCCCTAA AGAAAAAGTC CGTTTGGCCA CAGAGATGGA 1500 

AGGACATCCG GATAATGTGG CCCCAGCTAT TTTAGGAGAT TTCGTGGTGG CTAGCCATGT 1560 

AGAAAACCAA GTCTACCATG TCAAACATCA . CTTTcCAATG tGTGATGtCA TTGcATTTAT 1620 

TCCAGAGGAA CCATTATTTA CCGAAAAAAG TCGCGCTGTT TTGCCCGAAA AATTGGCTTA 1680 

TAAAGAAGCG GTCGCGGCCA GCTCAATTGC CAATGTTATG ATTGCCGCAA TTTTAAACGG 1740 

GGATTTACCT TTAGCAGGCA AAATGATGGA ACAAGATAAA TGGCATGAAA CGTATCGTCG 1800 

TTCATTGGTT CCTCACTTAA AAGAAATTCG TCGTTTAACT CAGCAAAAAG GTGCTTATGG 18 60 

CTCTTTCTTA AGCGGTGCTG GTCCAACTGT GCTAATCTTG TCGCCAGAAG AACGAACCAA , 1920 

CGAAATTGTT CAAAGCTTGG AGAAATTATC TACCAAAGCG AGCATCCAAA TTTTCAATAT 1980 

TGATCAAGAA GGTGTTCAAG TCTTTTAAAA AAAACGAGAA AAAATCACTA GTGATTTTTT 2040 

CTCGTTTTTT TTAATCCTCG GATGTTTCCG TGGAATAATA ACTATGATCT TCTAAGTTCA 2100 

AAGCATTAAG AATCATTGAG GCAGTTTCTT CAATTGAAAG AGAAGCGACA TCAATAACGA 2160 

TACAACCTAA TTTTTGATAA AGATCATTAG CAAAAGCCAG TTCTCGCCGA ATTTTTTCAA 2220 

TATCTGAATA AGAAGTATTT TCAGGCAATC CATATGCGCG CATTCGTTCT TTACGAATCC 2280 

CGTTTAAAAC ATTGGGATCA TTGGTTAAGC CAACAATTTT TTTAGGGTCC ATTTCGAATA 234 0 

ATTGTTTCGG TAAATGTGCT TCAGGAATTA GTGGCAAATT AGCAACTTTT AAGTTCTTGT 24 00 

TGGCTAAAAA GAGACTTAGC GGTGTTTTAG AAGTTCGGGA AACGCCAAGA AGTAACACAT 24 60 

CCGCTTCTAA AAAGCCTCGT GGGTCCTTGC CATCGTCATA TTTGACAGCA AATTCCATTG 2520 

CTTCAATTCG TTTAAAATAA TTTTCATTCA GATGATGTAA GGCACCTGGT TCCCGCATGG 2580 

GCGCGACACC TGTTAATCGT TCGATTTCCG CAACTGGTGG TGTCAAAATG TCAAAATGGA 2640 

ACAATTGATG CTCATTGAAA AATTGATTAG CAATCGCCAC GAGATGATCA TTAATGATms 2700 

tATGCAGTAC CATGGCCTCT GCTTTTTTGG CATCTTCTAA CGCTTGCACC AGTTTATCTT 27 60 

CGTCTTTAAC GAATGTTCGT CTAATCAAAG TAAAATCAAC AGTGGGGTAC TGGGCCATTG 2820 

AAGCAGCCGC CAATTTTGAA GCGGTCTCGC CAGCAGAATC T GAG AT 28 66 
(2) INFORMATION FOR SEQ ID NO: 51: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



GATTTnAAAn 


nAATTAATTG 


AAAATACCCG 


CTGsCAtGaC 


ACGGGAAGGG 


aTTACGAAAG 


Du 


AAATTCCAAT 


GGACGAAATT 


GTTCCAGGAG 


ACATTGTAAC 


TTTAGCTACA 


GGAGaCATGA 


1 9ft 
J. Z U 


TTCCTGCAGA 


TGCCGTGTTG 


ATTTGGACCa 


AAGATTTATT CGTTAATCAA TCgTCGCTAA 




CAGGCGAATC 


CATGcCyswC 


GAAAAATTTG 


TGGATGCAGG 


TGTGGATCGT 


CAACAAACAG 


OA ft 


AGGTTTCTGC 


ATTAGATATG 


CAAGACCTAG 


TGTTTATGGG 


CACAGATGTC 


TTGAGTGGAC 


•*ftft 


AAGGAAAAGC 


GATTATTTTA 


AAAACTGGGC 


AACATACATT 


TTTTGGTGAT 


ATTGCTAAAA 


JuU 


ATGCTACAAC 


ACAACGAGGA 


AAAACTAGTT 


TTGACTTAGG 


CTTAGCAAAA 


GTCAGTAAGT 


*± £. U 


TTTTATTACG 


AATGGTAATG 


ATTTTGTTTC 


CAATCGTTTT 


CTTGATTAAT 


GGTTTAACTA 


A Rft 


AAGGTGCTTG 


GGGCGAAGCG 


TTTTTCTTCG 


CGATTGCCGT 


TGCAGTTGGT 


TTGACACCAG 




AAATGTTGCC 


AATGATTGTT 


ACGAGTAACT 


TAGCCAAGGG 


TGCATTATCC 


TTATCGAAAC 


DU U 


ATAAAGTCAT 


TGTTAAAGAA 


TTAGCGGCTA 


TCCAAAATTT 


AGGTGGTATG 


GATGTGCTCT 




GTACAGATAA 


GACTGGAACG 


ATTACCGAAG 


ATCGTGTAGT 


TTTGGTTCAA 


CATTTGAATC 


790 


CATTAGGTGA 


TTTGAGTGAC 


GAGGTTCTAA 


ATTTAGCCTA 


CTTAAATTCT 


TCTTATCAAA 


780 


CAGGTTGGAA 


AAATTTAATG 


GATATTGCCG 


TTATTAATTT 


TtATGAAGAG 


CATCAATGGA 


84 0 


AAACGCCGTT 


TAAAAATGTC 


ACTAAAATTG 


ATGAAATTCC 


TTTTGATTTT 


TCTAGACGCC 


900 


GCTTAACGGT 


TGTCGTGAAT 


GCAGATGATC 


ATCAAI lb A. I 




\J O OvwVJ A, \j\J 


960 


AAGAAATGGA 


AGAAGTGTGC 


ACGCATGCGC 


AAATTAATGG 


CGAAATTGTA 


CCATTAAGCA 


1020 


CTGCTGTACG 


AGAAGAACTA 


CGCCGAGTGA 


ATGTTCAAAT 


GAACAAACAA 


GGGATGCGTG 


1080 


TGTTAGCGGT 


AGCTGTAAAA 


AAAGATGTCC 


ATAAAGAAGC 


AGTTTATTCA 


GTTGAAGATG 


1140 


AAAAAGAGAT 


GACATTGATT 


GGGTTTATGG 


GTTTTCTTGA 


TCCAGCAAAA 


GAGTCAGCCG 


1200 


TCAGCGCTAT 


CCGCTCCTTA 


CATGAACATG 


GTGTCAACGT 


GAAAGTGCTA 


ACGGGCGATA 


1260 


ATGACATTGT 


CGCCAAAAAA 


GTCTGTAAAG 


ATGTAGGCAT 


TGAAGTTTCC 


CATGTGTTAT 


1320 


TAGGTTCTCA 


AATTGAAGCA 


ATGACAsnTG 


AAGAGTTGCG 


GGCGCAAGTA 


GAAGAAACTA 


1380 


ATTTATTTGC 


CAAATTAAAT 


CCGATGCAAA 


ATCTAAAATT 


ATTGAATTGC 


TACAAGCAAA 


1440 


AGGTCATACG 


GTTGGTTTTA 


TGGGGGATGG 


CATTAATGAT 


GCACCGGCAT 


TACGAAAAGC 


1500 


TGATGTGGGC 


■ ATTTCAGTAG 


ATACTGCAGC 


T GAT AT TACT 


AAAGATGCTA 


GCTCGATTAT 


1560 


TTTACTGGAA 


AAAAGTTTAA 


ATGTACTGGA 


ATCTGGTGTG 


ATTGAAGGAC 


GAAAAGTATT 


1620 


TAGCAATATG 


ATGAAATATA 


, TTAAAATTAC 


CATCAGTTCT 


AAT 




1663 
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(2) INFORMATION FOR SEQ ID NO: 52:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

CTTAATGGnG ATGGnAAGGG GATATTATTG GTTATTTTTT GAG n AT AT AT GATATCCTTT 60 

ATTATGTAGG AATCnTATTA TATGGsAGGm ACGAGCGATG AAAGTTATTG TTTTAGGATC 120 

ATCACATGGA GGTTATGAAG CGGTAGAGGA ATTACTAAAT CTTCATCCTG ATGCAGAAAT 180 

TCAATGGTAT GAGAAAGGTG ATTTTATCTC ATTCTTGTCT TGTGGCATGC AGTTGTACCT 240 

AGAAGGAAAA GTGAAAGATG TTAATTCTGT TCGCTATATG ACTGGCGAAA AAATGGAGAG 300 

CCGTGGTGTA AATGTCTTTT CTAATACTGA AATTACAGCG ATTCAACCAA AAGAACATCA 360 

AGTGACAGTG AAAGATTTAG TGTCAGGTGA AGAACGTGTT GAAAATTATG ATAAATTAAT 4 20 

CATCAGTCCC GGAGCTGTCC CATTTGAATT AGATATTCCA GGTAAAGATT TGGATAATAT 480 

TTACTTGATG CGTGGTCGTC AATGGGCCAT TAAATTAAAA CAAAAAACAG TAGATCCAGA 540 

AGTCAATAAT GTGGTTGTGA TTGGTAGTGG TTATATTGGG ATTGAAGCTG CCGAAGCATT 600 

TGCAAAAGCC GGCAAAAAGG TTACTGTTAT TGACATTTTA GATCGTCCAT TAGGGGTATA ' 660 

TCTAGATAAA GAATTTACAG ATGTTTTAAC AGAAGAGATG GAAGCTAATA ATATTACCAT 720 

TGCAACTGGT GAAACAGTTG AACGTTACGA AGGCGACGGT CGTGTGCAAA AAATCGTTAC 780 

AGATAAAAAT GCGTACGATG CTGATTTGGT CGTTGTAGCG GTTGGTGTCC GTCCAAACAC 840 

TGCTTGGTTA AAAGGTACCT TGGAATTACA TCCGAATGGC CTAATCAAGA CGGATGAATA 900 

CATGCGGACA AGTGAGCCGG ATGTATTTGC AGTAGGGGAT GCTACGTTAA TTAAATACAA 960 

TCCTGCAGAC ACAGAAGTAA ATATTGCCTT AGCAACGAAT GCTCGTAAAC AAGGTCGCTT 1020 

TGCTGTGAAA AACCTAGAGG AACCAGTTAA ACCTTTCCCT GGTGTTCAAG GATCTTCTGG 1080 

CTTGGCCGTC TTTGATTATA AATTTGCTTC AACAGGGATT AACGAAGTCA TGGCTCAAAA 114 0 

ATTAGGAAAA GAAACAAAAG CGGTGACAGT AGTAGAAGAC TACTTGATGG ACTTTAATCC 1200 

AGACAAACAA AAAGCTTGGT TTAAATTAGT GTATGATCCT GAAACAACAC AAATTTTAGG 1260 

CGCTCAATTA ATGTCGAAAG CAGATTTAAC TGCAAACATT AATGCTATTT CATTAGCGAT 1320 

TCAAGCCAAA ATGACGATTG AAGACTTAGC CTATGCGGAC TTCTTCTTCC AACCAGCGTT 1380 

TGACAAACCT TGGAATATTA TTAATACAGC GGCTTTAGAA GCGGTGAAAC aAGAACGTTA 14 4 0 

ATAGAAGAGA mAATAAAACC CATGAGTATT TATAGTACTC GTGGGTTTtA TTTTTATTTT 1500 

GTATTCAATT GATTTTTCAA CTCTTGTTGG ACACGAGTTA ATTCTTGTTC ATCCACACGT 1560 

TGATAGGAAA CACCATCTTG CATAAAACCA GTACCTTGAA GTTGGTCTTG TTTCACTTTA 1620 

CCAAAGGCAC TGCGATAATC TAAGGCAATT TTTTTCATGT CATCAAAACT TAAATCTGTC 1680 
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TTCATATTAT 


CAGAAACAGC 


TGTTAAAATT 


TCTTGaTAGT 


TGCTTACGCT ' 


GTTAAGACTT 


1740 


AAGACTTTTt 


GGACGATGCC 


TTCAATCACT 


TTTCTTTGAC 


GTTCTTGGCG 


GCCGTAGTCA 


1800 


CCATTAGGGT 


CTTCGTAACG 


CATTCTTGAA 


TAGGAGAGTG 


CTTGTTCACC 


ATCCAATGAA 


1860 


ATTTTACCAA 


TCGTAAAATC 


ATATCCGTCT 


TGAGAAAAAG 


TC AG ATT ATT 


GTTCACTTCG 


1920 


ATTCCGCCAA 


CCGCGTTGAC 


TAATTCTTTT 


AAACCAGCCA 


TATTAATTGA 


AACATAATGA 


1980 


TTAATAGGTA 


TGTTTAAATA 


GTTTTCAACT 


GTGTCCATTG 


CTAAAGATGC 


GCCACCAAAA 


2040 


GCATAGGCGT 


GATTCAATTT 


ATCTTGTTTT 


CCTTGACCTG 


GAATATCAAC 


ATAGGTATCG 


2100 


CGAGCAAGAC 


TGACTAAGGT 


TGTTTGCTTG 


TCACGAGGAT 


TAACTGTTGC 


AACAATTGTT 


2160 


GTATCCGAAC 


GACCTTGCTC 


GACACGCCCA 


TCATCGCCTG 


TATCAATCCC 


TAATAATAAA 


2220 


ACAGAAAAAG 


GCTCCTTATT 


GTTTAAATTG 


ACCTGACTTT 


TTTTAGATCG 


TTCTACTGTT 


2280 


TCATAGGTTT 


TATCCATTGA 


TTTAGAAACA 


TCCCAATAAA 


GTTTTGCTCC 


CATtCCAaCA 


2340 


ACTGCAaGAA 


aTAGAmCCAa 


GATAATTCCA 


ArGATAATGG 


CAAAAATTTT 


CTTCCCTTTT 


2400 


GACATATTTT 


yCCTCCAATA ATTTATCACA 


GGTTGTGATT 


TTTTTCTTAT 


AGGTTAATCT 


2460 


TACTAATTTT 


TAGTGAAGAA 


GTATATAATA 


ATTGTTGAAA 


GACTATAAAA 


CTTTAAGGTA 


2520 


TTTTGCATTA 


TTTTTGCTTA 


ATCTTTTGAA ATATCTCAAT 


TCTATTAAAA 


TATGCGAATT 


2580 


AG TAT ATT AC 


ACGTTTCCAA 


TGTGTTTCAT 


TGTTGATTAA 


TTAGAATTTT 


AGTGGTATTT 


2640 


TTGTATTTGT 


AATTAGTACA 


GATTTTAATT 


GGAGGTATAA 


CAGTGAGTAA 


AAAAGGATCA 


2700 


GATATCATAG 


TAGAAALTTT 


AATCAATCAC 


GAAGTGCCCT 


ATATTTTTGG 


GATTCCCGGT 


2760 


GCAAAAATTG 


ATGGGGtTTT 


TGATGCATTA 


GTtGACAAGG 


TCCAGAATTA 


ATTTTAGCTC 


2820 


GACATGAmCA 


AAaCGCTGCC 


TtTATGGCAC 


nAGGGaTTGG 


TCGTTTGACG 


GGAGAGCCTG 


2880 


GCTTAGTTAT 


TGCAACTAGT 


GGACCTGGTG 


CCAGCAATTT 


AGCAACTGGG 


TTAGTAACTG 


_ 2940 


CGACTGCAGA 


AGGGGACCCT 


GTTTTAGCAT 


TGGCTGGACA 


GGTGAAACGG 


TCCGATTTGT . 


3000 


CAAAATTAAC 


TCACCAAAGC 


ATGAATAATG 


CTGCATTATT 


TAGTCCAATA 


ACAAAATTCA 


3060 


GTACAGAAAT 


TCAAGATCCA 


GAAACATTAT 


CTGAAAATAT 


TGCCAATGCT 


TACCGTATCG 


3120 


CCAAAACAGC 


GAAAAAAGGC 


GCAACATTCT 


TATCGATCCC 


ACAAGATGTC 


ACAGATAGTC 


3180 


CGGTGACAGG 


CGAAgcAATT 


AAGCCTTTGT 


CGGCTCCGAA 


ACTGGGCCAT 


GCATCAGCAG 


3240 


GAACAATCGC 


AGCTTTAGCT 


GAACGAATCA AGCAAGCGAA 


ATTACCTGTT 


TTACTTTTAG 


3300 


GAATGCGGGC 


CTCTGCACGA 


AAAGTTACCG 


CAGCCATTCG 


TGAATTGGTT 


GCTATAACTG 


3360 


GTTTGCCAGT 


CGTGGAAACG 


TTCCAAGGAG 


CAGGCATTAT 


TTCAAAAGAA 


TTAGAAGAAC 


3420 


ATTTCTTTGG 


GCGTGTTGGT 


TTATTTCGAA 


ACCAACCTGG 


GGACCGTTTG 


TTAAAAAGAA 


3480 


GTGATTTAGT 


GATTGCGATT 


GGTTATGACC 


CAATTGAATA 


. TGAGGCACGT 


AATTGGAACG 


3540 


CAGAAAAAGA 


TGCGCGCGTT 


ATTGTTATTG 


1 ATGAAGTTCA 


k AATGGAAATT 


1 GATCAATATA 


3600 


TGCAACCGGA 


, AGAAGAGCTA 


. ATTGGTGATA TGTCTAAAAA 


TATTTTGAAA TTATCAGAAG 


3660 
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CATTTAGTGA ACCAATATTA ACAGAAGATG CGCAAGATTA TTTAGAAACG CTTCAAGAAA 3720 

AATTAACGAT TAAAGAAGTA AAAACAAGTA CAATTGAAAA TCGATTACAT CCATTAGAAA 3780 

TTGTTCAAAC GCTTCAAGAA AAAACCACGA ATGAAATGAC AGTTACTGTT GACGTAGGGA 38 40 

GTCACTACAT TTGGATGGCT CGCCACTTTA GAAGTTATGA ACCGCGTCAT TTATTATTTA 3900 

GTAACGGCAT GCAAACATTA GGTGTCGCTT TGCCTTGGGC AATCTCAGCT GCACTAGTTC 3960 

GTCCAAATAC ACAAATTATC TCTGTTTCTG GGGACGGCGG TTTCCTATTT TCGGCGCAAG 4020 

AATTAGAAAC AGCGGTTCGT TTAAAGCAAA ATATCGTGCA TATTATTTGG AATGATGGTA 4080 

GT TAT GAT AT GGTTAAATTC CAAGAAGAAA TGAAATATAA TCGATCCTCT GGCGTAGATT 4140 

TTGGTCCAGT TGATTTTGTG AAGTATGCTG AAGCTTTTGG AGCCAAAGGG GTTCGAGTTC 4200 

ATTCACAAGC AGAATTTGCC GCAGCTTTAG AAGAAGGAAT GCAAACAGAA GGCCCAGTGA 4260 

TTATTGATGT TCCTGTGGAT TATTCAGATA ATCmAGAATT AGGTAAAaCA TTGTTACCAG 4320 

ATCAACTTTA TTAGGAGATG GAAAAATGAG CGAACAATAT GTTTATCAAC ATGGTACGCT 4 380 

AGGCGGTTTA ATGGAAAGTT TGATGGCAGG AACGGCAGAA ATTGGTACGT TACTCACGCA 44 40 

AGGTGATTTT GGAATTGGGA CATTAGAAGG CTCAAATGGC GAAATTATTT TATTGGACGG 4 500 

TACATTGTAT CATGCAAATC AAACTGGCGA AATTACTATT CTAGAAGGCG AAGAATTAAC 4 560 

GCCATATGCC GCAGTTACTC GTTTTCAAGA AGATGGCGCA TTCCCTGTAT CAACGGAAAC ' 4 620 

CGATGAAAAT ATTAAAGCAC AAATTTTAGA AAAAATTAGT CCTAATTTTT TTGCGGCAAT 4 680 

TAAAATTAGC GGTCTTTTTG CGAAGATGCA TGTCCGTGTG GCACCTAAAC AAGAAAAACC 4740 

GTATCCACCA TTTGTAGAAG CAGCACGCAA TCAACCTGAA TTTACAGCGG AGAATATTCA 4800 

GGGAACAGTT GTAGGGTTCT TTACACCTAA ATTATTTCAT GGTGCCTCTG CCGCAGGgTT 48 60 

TCATTTGCAC TTTATCAGTG AGGATCACCA ATTTGGTGGG CACATTCTTG ATTTTGGCAT 4 920 

TAAACAAGGG ACTGTTTCGT GGATGGAAAC AGCAGAATTG CGACAGCATT TTCCAGTTCA 4 980 

TGATGCTGAT TATCGGAATA AAGAAATTGA TATTGCAAAA GCTTTGTCCG CAATTGAAGA 5040 

AGCGGAATAA CTGAATAACA AGGCACATTT CTCCACGscC ATTCAAGTTG ACTAACTAGG 5100 

TCAACTTGAA TGGGCTTATT GCTGTTCGTC GCTTTTTGTT GTTATTTTTC AAAAAAAAAT 5160 

AGTATCCTTT TTATAATAAG CAAATAACAA TTGGAAAGGA TGGCGTAATG GACACACTTT 5220 

ATAGAAGTTG GCAATTGTCT GGTTGGCTCT ATCATGATAT TTTTGTTATC ATTGTAGCAA 5280 

TTATTTTTAT AGTAATTAGT GGTATTTTGG TTATTTCCTT AATAAGAAGG CGGAGTACAA 534 0 

GGAGACTGGT TCCCTATGCG TTAATTTTAT TGGTTTATCT TGCAGTGGTA CATTTTGCTG 5.4 00 

GTTTAATCTT TTTCGGAATG TTTCGTTCAG TAACGATTGA AGAGAAAAGT GCGACGTTTT 54 60 

ATTCTGAAAA AACAAAAGGC TTAACTTCAA TCGAGCGAAT GATTATTCCA AATGGACGAA 5520 

CGAATGGCAT TAGCACCAGT AACTCATTGT TCCCAAGTGG ATTTCnGTTA ATAGTCAGAC 5580 

GGGCCGAACG AATGTGGnAG TAAACGTTTA AGTTGGCCGT GATTAATTTA ATGGnCCAAC 564 0 



WO 98/50555 PCT/US98/08985 

456 

AG AT CC AT AG GGGGTTTTGA nAA 5663 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

AAACAGCATA GACAGCAATT GAGTTATGGC CmCgGTTtGA AGcwTwAAGG rAACgGCCaT 60 

CAGArGAAaT ACGrATCGsA GcmCCmCCmT TAAACsCATC ATAATCAGCA GGTAAAGTAG 120 

AAACTTTTTG TAAtTCaGTG AAAGaACCAT CTGTTyCATC ATATGAAAGA ACTGAAACAG 180 

TGCTATCTAA TTCACCAAAT AAATAAGCCA ATGATTTTGT TGGGTGGAAC ACTAAATGAC 240 

GTGGACCAGT CCCTGGCTCA GTCACGAATT GAGCGATTTC AGAAAGCTTT CCATCTAAGC 300 

TAACATTATA CGTATACACA CGATCGGTTC CTAGATCACA GACGACTAAG CGTTGATCAG 360 

GCGTCAAATC TGTGTAATGA ACATGGGCAT TGTTTTGGTT TTCATGAGGA CCAGTTGGCT 420 

CTGTATGAAT CACTTTGTCA GCATCTTCCA AAGAACCATC TTCTAAGATA CGATACACGT 4 80 

GAACGATACC TTGGTGATAA TTTGCTGCaT AGACAAGTTG TCgGTtyTCG TCaACAGCTA 54 0 

cATAAcATGG CGGCGCACCT TCCtCaGTTA CTTTAtTCaA TAAGTTAAAA tGGTCCATTT 600 

CTGGTTGATA AGCGGCAGTG CCACCTTCAC CGTCAACACT AGTGACGCTA AAAAGTTCGC 660 

CCTTTTGACT CAGAGCAAGA TAGGTAGGAC TCGTTTCTTC TAACAAAAGT GTCGCTTCTG 720 

TTAAACGACC TTGTGTCGTA TCTAATGCAA TTTTATAAAT GCCTTGGCTG GCTCTTCGAG 780 

TATAGGTACC TAGTAAAATT TGTTCTAACA TCACAATTCC TCCAATTTGT TTCAACACAA 84 0 

TCACTTTTTA TGAAAGTGCT TTTATATAAT GTAGTATACC aTAAGAAGTC TAAGAACAAT 900 

AAAACTTTCT ATCTGAGATG TTCgCaTTTT AAAgtGGTTT GtGCtATTAT GaTGaGaGAC 960 

tGTGaTGACa GTCaTTTTTT gaTTTTTAGA GGGGTGACGA ACATGTACGA CaATACgGAT 1020 

AATAACGGgA CCGTTCGTTG TTCATTTTGT GGGAAAACaC AAGAAGAAGT AAAAAAAATT 1080 

GTTGCTGGTC CGGGTGTCTA TATTTGTAAT GAATGTATTG ATCTATGTAA AGAGATTATT 114 0 

GATGAAGAAT TTTATGATGA AGCAGTCCGA GAATTAACGG ATGTGCCAAA ACCCCAAGAA 1200 

ATTTTAAATG TTTTAAATGA ATATGTTATT GGACAAGAAC GTGCAAAACG CACACTTTCT 1260 

GTTGCTGTTT ACAATCACTA TAAACGTGTG AATCAATCAG AAACAGCTGC AACTCAAGAT 1320 

GACGTTGAAT TGCAAAAAAG TAACATTTGT TTAATTGGAC CAACAGGTTC AGGTAAAACT 1380 

TTCTTGGCTC ' AAACGTTAGC AAAAACATTA AATGTACCTT TTGCAATCGC TGATGCGACC 14 4 0 

AGTTTAACAG AAGCAGGTTA TGTAGGGGAA GATGTTGAAA aCATCCTTTT AAAATTACTA 1500 

CAATCTGCTG ACTATAATGT TGAACGTGCG GAAAAGGGCA TTATCTATAT TGATGAGATT 15 60 
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GATAAAATTG CCCGTAAGAG TGAAAATGTT TCGATTACTC GAGATGTTTC aGGTGAGGGT 
GTTCAACAAG CCTTACTAAA AATTCTTGAA GGAACAGTTG cAAGTGTACC ACCACAAGGG 
GGCCGCAAAC ATCCACATCA AGAATTTATC CAAATTGACA CAACGAATGT TCtCTTCCAT 
CGTTGGTGGG AGCATTTTGA TGGGAATTGG AAACCATGGT TTAAAAACCC GCCTTAGGC 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGCCAAGfTC GTCAATAATC GGTCAGATTG TTTTCAATCA GGAAAtcAcG 
TTCTTTCAGg TgAAAATTTT CGCCCAGCGT TCATAGCCTG CGCCnTTTtC 
TTTCGCCTGA ATGTCAATCA GGAGATTGAC CTTTCTGCCG GTTCGCTTCG 
TCCTGACGTC AAACGTCTGC GCTGGCTGGT AATTCTGGCG CGCAAGGCGG 
ATCGTCGCCC AGCCGGTTCA CCCTTGTAGG GCGATCCCAT CCGGGCAGAC 
ATTTTTGCCG TCCACAACCT CATATCCCTC AGCCCGCATC TTTGCCAGAA 
ATGATTGCAC TCTGGCAGCA GCCGGTCAAT GGCTTGCTGT AACTGCTTTT 
GCCGTACTGC GCCGCCGCCA TTTCCTTTTG GGGCCTGGCT TTTTTCTTCG 
CACCGAAAGG CCGTGGGCGC GGCACAGTTC ATCATTCATG CGTCGCAGGA 
GTTTTTCACG TTCTGGA.VTT TTGCGTCACA GTTCAGGTTG GTGCTGTTAA 
GGTATGGATA TGCCGCTTAT CCGTGTGGGT GGCCACCACA AACTGATGCT 
AAATTTCATC GCCAATTCAC AGCCGATTTT GTGCGCCTCC TCCGGGCTGA 
TTTGAATGAC TGGATAATGC GGTACATGAT AATGTCATGC TTTTCTGAmC 
GGTAAGCTGG GCGTACAGTT TTTTGCTGAT TTCAAATTCT TCGGCGGCAG 
GC AC AT AT AG GAGGAAACCA GTTCGCCGCC GTCTGTCTTT TCTTCTTTCT 
GTGGCGTTCT GCCATTGACT GCTGACGGGT GCGGTTTTTC ATAATCTCTC 
ATTAGAGATA GCCCAATGGT TCCTCCTTTC AGAAAAAAGG CTATCCATAG 
AGCATGGGTA GCCTACTTTT GCAAGCCATA TTTTGTATGC TTGATACnAA 
GATACAGCTA TGGATACGAT TTCAGCGATT AAGACCCGGA TACTCCGTTT 
CGCAACATTA CCATCAACAA ACTGGCTACG ATGTCGGCGC TGCCGCCATC 
AACATCTTGT ATGGAAAGAG TACCGATCCT AAAATTTCAA CCATCAAGAA 
GGTTTGGATA TGACGTTGAT AGATTTTTTC GATGCGCCCG AATTTGAAAA 
GAAATTAAAT AGCGTTACCG CGCCGCGTCC CGCTGCCGCT GTGCCGTCCG 
ACGTGGGTGC AGAGAAACTC GTTATAGTCC TTGCCCTGCT TCGGCGGGTT 



TTTTcGCCGC 
CGGCGGCAAG 
GCGCTGTTTT 
TCTTGGTATA 
GGAATCCGGG 
ATTCCTCATA 
TGTGgCTGAC 
GATTCTCAAT 
TAAAAGCGGA 
ATTCAATGTG 
GCCCGCCGGT 
CTTCATCGGG 
TGCTGCGCCC 
TTTCCGGGCT 
GGTCATAGTC 
GTGGTAAAAT 
CCTAAATAGT 
AGGGGGGCAG 
ATGTGCGGAG 
CTCTGTCAAA 
AATATGTGAC 
TTTAGArCCG 
TACCGCTTTA 
GGGGTAAACC 



1620 
1680 
1740 
1799 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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AGCTTCACCC GCCTGGAAAG TTCCGCATCG TTCTGGATTT CCTGGGTGAG CCGTTCCATG 1500 
CCCTCCAAGC CGGCCTTGTC ATTGTCGAGG CACAGACTGA CTTGGGTAAC GTGGGGGTGG 1560 
TCGTGAAGAA ACTGTATCAG GGCGCGGGGG CCGGTTCCGC CCAGGGATAA ATAATGGCTG 
TCCCGTGGTT CGCCGCCGGA CAGCTTTACT AATGTGGCGA GGGAGAGCGC GTCAATAGGG 
CTTTCCGCCA CCGCCAGCCG GGGGCAGTCT GCCTTGCCTG CCGGCAGACA GAAGCTGTAC 
CGCTTGTCGC TTCCCTCCAC ATCCATGCGG AAGCCGTCCC GCGTCCCCCG CArGCTGGCG 
GCGCGGGCGT TCCCTGCTGT GTCGCGCCCA ACAAAGACGC AgTTCTGGTA TTTGCAGCTC 
TCGTATAAAA TCCCGGCTTC CATGCACACG CCAAGCAGTT CCGGGTCAAT GCCGCGATCC 
TGCAAATAGC CCACAACAGC GGATGCACAT CGGCTGGCCT GGGGCAGTCG GAAGGGCTTT 
GGCTGTTTTT CTTTTATGGG CGGCGCTTTT TCGGGCGGTG GCGCACAGTA GCCGCACAGG 
GTTTCCACCG CGCCTACAAA GTCCATGCCG CGCACCTTGA TCAGATAATC AAGGGCCGTG 
CGCCCACCGA TGCCCCGGCT GTTCCAGCAC CATTTCCCGT TGGAGATTTT TAAGCTGTCA 
TGGGTTCTGG TGCAGTATTC GCGGGGGCCG CTTTTTTTCA GTTCCTGCGG CTCATAGGCC 
TGCAAATAAG AGAGCAGATC CCATTCCTTT GCTTTTGCAA TCTGCTCTTT GGTTACGCCT 
GCCAAGTAAT CACCTCCAAA TATGAAAAAA GGGCGGCGGG AGCATCTTTC TTTTGCCCCC 
GCCGCAGTTG TTTACAGTGc TTCAAATTTC TTCAGCAGGT CGTTCACGCC CTGCCAGATT 
TCAGAGTAGC CCTGCCGCAA GTCCTCCACA TCCTGCTCAT AGAGGTTGTG GGTGCTGTTG 
CACsGGCGGG CGATCTGGTT GACGTTGCTG GAAACGGAAc GCAGCAATTT TACCAGTTCC 
AGCACCGCGC CCATGTCAAG CTGCACGATG TAGCCGTCCA CCGCCATCTT GCGTAGGTAA 
GCCCGCAGgT TGGAGGTTCC CAGCAATTCC ATTTTGCGGT CAATCGCCGC CTTTTCTTCC 
GGGGAAACTT TGGTGAACAG CCCCACCGTG CGCCCCTCCT TTATCCTCAC AGCACCGCCT 
CCTTTCGCGG GGTGCGCTCC ATGCCGGAAT GTACCGGCGT GGGCGGTTCA TAGTTACGGA 
TTGCCTGACG GATAGAGGGT TTATAGCCAG CCGCCACAGC GGGCATCGGC ACAACCGGCG 
CGGGTTTTCC GGCGTCCTCC AAATCAGGGC TTTCTTCAAC CTCCTGCTCA TCCTCCTTTT 
TGTCCGGCCT GGACAGTTCC CGGTTAAGCT GGTTTAAGCG GGCGGATTTT TCCCGCAGTT 
CTTCTTCTTT TGGGAATGGC CGGTCGGCTT CTTCTTTGGC GTTTTCCAGT TCGCTTTCCA 
GTGTGGCAAG CCGTGCTTTT TGTGCGTCCA ACCGCTCCGT CAGGCTTTCC AGCGCATTGT 3060 
TAATACGGGT GACGTTGCCC ACCGCATCGT CAGACAGTTC GGCGGAATAG GTCAGGTTCT 3120 
GCTTCATGGT GACGCGGAAT TTGCTGCCGT CACACCTTAG CTGCATGGGA AAGCCGCGAA 3180 
ACTCGCCCAA ATCCACGGTT TTTTCCGGGT CGTCCATCAG CATACACGCT TTAATCACCG 3240 
cTTCGCCCGC CGCCTTGCGC TCCGTGTAGG TCTGCCCCAT GACAGTCATG GAAAACGCAT 3300 
CGTCCTTGAC CGGGTGGGCC TGCACGGTTG GCAAATCTGC CGTCAGCGCT TCGATTAACA 
GCTTGGTTTC CGCGATTTTC TGCGGGTAGT ATTTCAGCGC TTTGTCCTGC ATTTCGTACT 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 



3360 
3420 



1 
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GTTGGGAGGT 
TTTTTTCGCG 
CAGTGGCGTC 
TGAATTTCTG 
CATACTTGAA 
GCTGTTCCAA 
TCTGCACATT 
CCCGCCGCAC 
AGGCGATTTC 
CATTGAATGA 
AAATCTCCGT 
TGTCCGGGTC 
CGTCCGAAGT 
TTTCCGCACG 
TGACGGTGAT 
CGGCGGCTTC 
TCTTGGAGCG 
AGTCTGCCGC 
GGTAGCGCAT 
ACACGATGCC 
TGGATTTCTG 
ACAAGTTCTT 
AGGTAACAAC 
TTTTCTTTGT 
CGGCAATGGC 
GGGAGAGGGG 
TGGCGATCCG 
CCAGCACCTT 
GGTTGGGAAC 
CGGCAATCAT 
CATAGAGCAT 
TCCCAGCAAA 
CGTTGTACCG 



GTGGTTGGCT TTGAGCATTT TCAGCTTGGA AACCTGCACA TCCAAATCCA . 3480 

GATACGGTCG TCGCCGGTGG CAAGGGCCTT GACCTCAGCA TAGGAAAGGG 354 0 

CACATCCTCG GCAGAGCGGG CAGGGCTTTT AGAGGTCATG ATCTGACCGA 3600 

TTTGTTTTCC ACCAGCCCCC AGTTGTAAGC GTCGAATGTA CCCTTGGTGA 3660 

CATTTTCACC GACGCATTCA TGTTGCCCTG GCGCAGAGAT CGGCCCGCCC 3720 

ATCGGCAGGA CGCCAGGGGC AATCCAGGTC ATGGGACGCG ACAATGCGGT 3780 

CGTGCCTGCG CCCATTTTTG CCGTGGAGCC GATTAAGACA CGCACCTGCC 3840 

CTTGGCGAAC AGTTCTGCCT TTTGGGTTTC CGTGTTGGCG TCGTGGATAA 3900 

CGCTTCGGGG ATGCCTTTGG CAATCAGCTT TTCCTTGATG TCATCATAGA 3960 
GCCGTCCCCC TTGGGTGTGC TGAGGTCGCA AAACACTAAT TGTGTTCCTA " 4020 

GCTGGCCTGC CATTCCTGCA CGATATTTTT CACGCAGGCG TTGACCTTGC 4080 

GTCGGGCAGG AGCGGATTTT GCAGCCGCTG ATCCAGCGCC AgCTTGCGCC 4140 

AATCCGCAGC ATATTGTCGA TATGGGGGTC AACCTGCCCG CCGCGCACAC 4 200 

CTCGCCAAGT TCCTCCACCA TTTCTTTCTG GAATCCGCTG GGTTCCGTGG 4260 

ATATTCCGCC TTTGGCACGG GGAGATTCAG CATATCGGCG GTCTGGATGT 4 320 

TTTCCAAATA GAAATAAGTT CCGGCAGGTT ATAGAATTTC GCAAACCGGG 4 380 

GAAGCCGCTG CCCTCCGGCT TTAATTCCAT CGCCGTGACC TTTTCGCCAA 4 4 40 

CCATGCGTCG AAATGGCGGT GGCCGTTCTG CTCCAGGGTG TCAAACTGGA 4 500 

CATGGTATAC AATTCTACCA TTGAGTTACT CACCGGGGTT CCGGTAGCGA 4 560 
GCGCCCGCCG GTGATTTCGT CCATGTAGCG GCACTTGGCA AACATATCGC 4 620 

CGCGTCCGTC TGGGCAATCC CCGCCACATT CCGCATTTTT GTGTGTAAAA 4 680 

AAAGCCGTGC GCCTCGTCCA CAAACAGACG GTCAACGCCC AATTCCTCAA 4 74 0 

ACTGTCTTTT TTCTTTCCGT CCGCCAGCTT TTGCAGCTTG GCTTCCAGGT 4800 
TTTCTCCATC TGCTTGATGG TGTAGCGCTC GCCGTCCTCC GCCTTTGCTT 4 860 

GGTGACGATT TCATCAATCT GCCCCTTAAT GACGGCTTTC TGCCGTTCCG 4 920 

GATTTTTTCA AACTGTGAGT GTCCGATAAT TATCGCGTCA TAGTCGCCGG 4 980 

GGCGCAGAAT TTCTTGCGGC GGGCAGGCTC AAAGTCCCGT TTCGTGGCAA 5040 
TGCGCCGGGG TACAGGCGCA AAAAATCGCC GCCCCATTGT TCCGTCAGGT 5100 
GACGAATAGG GATTTCTGGT TCAGACCCAG CCGCTTGCCC TCCATTGCGG 5160 
TTCAAAGGTC TTGCCCGCGC CGACACAGTG GGCAAGCAGC GAGTTTTTCC 5220 
CCGCGCCACG GCGTTCTCTT GGTGGGTACG CAGGGAGATT TCCGGGTTCA 5280 
ACGGATATGG GAACCGTCAT ATTCGCGTGG ACGGATGCAG TTGAATTTCT 534 0 

CCTGCAAAGG GTTTCCCGAC GTTCCGGGTC TTTGAAAATC CAGTCCTTGA 54 00 
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ACGTTTCGCA 
CGCGCACCTC 
AGGTCTGCTC 
GCACGTTATC 
AATATTGCAG 
TGAATTGCTG 
ATGCCGTCAG 
GGTTTTCCTC 
ATTCGTCCGC 
GGCCGGTGGC 
TTTCAGGGCC 
TAGATACCGC 
TGGTGCGCCT 
CAAGAGAGCA 
TGTTAAGAAG 
CGGACTGGAT 
TGTCCCGGAT 
GGGTCATGCG 
TGCGTACAGT 
CTTCTAACAG 
AAAGGTCTGC 
GTCCCATTGT 
GGATGCCGTC 
CGCGCTTCTG 
CATTGCTGGG 
CCTTGCTGTT 
GACGCACCTT 
CGTACCGCTT 
AGAAATCGTC 
ATAGCTGTCT 
CCGCCAGTTT 
CCGTCCCCGG 
GGCTGGTGTA 



CATGGCTTCC 
CTTTCCGTCC 
AAAGATTTCA 
CCGGTTGTCT 
CTTGATTTTT 
GTAATACTCC 
GTCTTTG 3GC 
AGCGGCGGCG 
CGTCTGCCAG 
GGGGTTTTTG 
GCCCATGAGG 
CAGCGCCTCC 
GGTGAACATA 
GAGCAGCGGA 
CCCGTACTTT 
TTCTGTGTCG 
GCCGATCATG 
GGAGTTTTCC 
GGGGTCGGCG 
TTCCCGGTCA 
GCCGGGGATG 
CATTGTCCCC 
CTCTGTCTGC 
GAAAAAGAGG 
CAGGCGGACT 
TTTCTTGTCC 
GTCCAGCGTT 
GTCATGCACC 
CGGGTGGTCG 
GGCGATGCGC 
TTCGGGCAGC 
CGTAAAGTCC 
GTGGGCGTTG 



TGCTTCTGCT 
ACTTCTGTCT 
TAAGCGTTGA 
GTGCTTTTTC 
CTGTCCTGTA 
GGGTCAATCC 
TGCACCTGTT 
CGGGCAATCG 
CCCTGATGCC 
AAGATGATGC 
GATGCCATAT 
ACAGCCGTGT 
TCCGCTTTAC 
TAGGAGGAAT 
GCGCTGAACG 
TCTGCGCCGG 
CCCCTGACGC 
CGGAAGTACA 
GGGATGCTTT 
GGCGGTGCGA 
GGCTGGCAGG 
AGCACCATTT 
CCGACCTCCA 
ATGTCCGTTG 
GCACCCAATA 
ATTGTGCCTT 
TTGGTGATGA 
TGATAGCTGC 
GTGCGCTCAA 
CCGGTCAGGT 
ATCCCGAAAA 
ATCTGTTCCA 
AGAACCGTGC 



GGGCAATGGC 
TATCAAATAT 
TGCGCTTGGT 
CCTGCACGTT 
AATACACAGG 
AAGACGCGCC 
CCAGCTTTTC 
CTAACTTGGC 
AGTGTGCGCC 
CTTTTAAGTC 
AGCCCAAATC 
CCACGCTGGT 
GCTCCAGCTT 
CCTGCTCAAA 
CATCATAGAG 
CAAGCTGTAA 
GCTCCGTGGG 
GCTTATCATT 
CCAGTTCCTC 
TATGCCGGAT 
CGGTTTCAGT 
CCGGGTGGTC 
CCCATTCCGG 
TGACTTCGGT 
AATCGGCTTT 
TGGTGGTGAT 
AATAGTCGTG 
CAAAGGGGAC 
ATCCGTCTAC 
CGTCCAGTTC 
AGTTCCCGAT 
CCGCATTGTA 
TTTGTGCAGA 



GGTCTGCTTT 
CCGTACATCC 
GCCGTAAGTA 
CCATTCGCCG 
TGTATGCAGC 
GATACGGACG 
GGCATTGATG 
CCGGACGTTG 
GCCGTCCTCC 
CTCCACAATC 
CACGCAGGCT 
AACGGCGCGG 
CCCCTCCTCG 
GGCCAGCTTG 
CCGGTTTAAT 
GTCGATCAGG 
GGTTTTGCCA 
GGATAGGGTG 
GCCGTCTTGC 
AGCTTCTGCA 
TTCATTGCCG 
TAAAAAGTAG 
GAGTTTTTCC 
TCCGGCGTTG 
TTGTGCCAGC 
AAAGGCCACA 
AATCATCAGG 
GTTGCCCACC 
CGTTATATCC 
CACACCATAT 
GCCCATAGAC 
GATTGCCTGG 
CTGGTACTCC 



TCGTTGAGGA 
CGCTGGTTTA 
GCATAGACGC 
GTGGTAGGGG 
AGTTCAAACA 
CTGATTTCAG 
GCAAACTGCG 
CCGGAAAGGT 
AGGTCAAAGG 
TGCGGGATTT 
TTTTCACCGA 
TGGGTCTGGA 
TTGATGATTT 
TTGCCAGTGC 
TTTGCCTGCT 
CGGCGGGCGC 
AGGACGGCCT 
TAACTGAAAT 
CCGGAATCCA 
AGCTGCCCGG 
TACATACTGC 
CGGTTCAGGG 
GGGATGCGGT 
GCCTTGAACG 
GCCTCCCGCG 
ATGCCGCCGG 
TTTTGACGGT 
GCAAGGTCAA 
GCCTTTTGGT 
AACTTTGCTG 
GGCTCCAAAA 
ATAACGGTGG 
CCAGGGGTGA 



5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
■ 6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
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GCAGTTCTTT CAGTTCGGCG TATTCCTTTG CCCATTTTGG GTCGTTTGGC TCAAAGGCGT 74 4 0 

GGGAAATGCC GCCCCAGCCC ACATAACGGG ATAAAATGGC CTGTTCTTCT GGTGTGGCAA 7500 

GGCGGTTTTC GGCTTCAATC TGTTTCAGGG TGCGGATGGC TTCCACGTTG TACTGGTACT 7560 

TGGTCTTTTC GCCGCCTGCG CCTAAATTTT CATCGGTGAT GCGGAAGTTG TGGCGTTCCG 7620 

GCTCCAGACG GAGCGGCTGT AAAATAATCT CGCTGGGGAA TACCCGGCTG TCCACTTCCT 7 680 

CCTGGGCGCG GCGGCGGGTC TGTCTTTGCG CCGGGGTTTC CGTAATCTGC CCGCCGTCAA 774 0 

TCTCCACCAG TTCCGGCGGT GCTTCCTGTT CTGGAGAAAC GTCCTTCTCT GGAAAAATTT .7800 

CCGGCTCTGT TGGCTCCACG CCCTCTCTGG CACCTGCCAT TTCTGACCAC CTGATCCGCT 78 60 

GGTTCAGTTC GGCTTTTAAC CGTTCTTCAA TATCCGGTTC CACGGAAACC AGCATATCAT 7 920 

AGTCAAGGAA AGCAAGTTCC ATATCAATCA GTTCCCGCAG GCTGCCATAC TGCCGTGTTT 7980 

CCACGCATTT CCCGTCTACG GACTGGCTGA TGGAGAAATC CAGCAGGTTT ACGTCCACCT 8040 

GGACTTCTAA ACCGCCGTCC TCGGTCGTGC CATAGGCAAT CCCGATATGC TCCAAATCGG 8100 

AAAAATCCAC ATCATCGCTG CCATATTCGG CGTAGGCAAA CTGTTCAATT AGCTGTTTTG .8160 

CCGTATCTAA ATCGGAGGTT TCGGGGCGTT CCGGCTCCTG CATGGGTTCC GCTGTAAAAT 8220 

GCTCCTGCTG GACTTCCTCT ACAAACTGCC GCACAAAGGG GATGCTTTCA CTGCGGAAAA 8280 

TGGGATACCA GCCCCGCAGT TCCATATCTT GCAGGCTGAC GGTTCCGGCG TTATAGTCCA 8340 

CGCTGTCAAT CTTCATCAGC CGCCCGTCAA TGGTCAGTTC CATGCCGACA GGGATATAAT 8400 

CCGCCGCGCT GCGTTCCTTG AT GAT TT CAT AGGGGGCATC GCCGCCGCGT TCCGTGTCAG 84 60 

GCCGCGCCAG CACCACGCTT TTTCCATGTT CCAGCAGCTT TTTTAAGACG GACTGCCAGC 8520 

CATTGGCTCC GGTGATGGAC ACCATCCCCA GGTCGGGAAT ATCGCGGGTG ATGGTTTTCG 8580 

TTCCCAGCGC AGGGGCGGCT TCCTCTGCGT CTGTGCCGTA AAACAGCATG GTTTTCCCTA 8 64 0 

CCCGGACGCC TACCAGTTTG TCAGGGTATT GGGCTTTCAG GTTCAGGTAT TCCTCCACGT 8700 

TGGGCGTAAG GTTCAGACCG TCTGCGGCGG CTGTTTTTGG GGCTTCCTGG GTCTGCGGCG 87 60 

AACCTGTGCC GGTGGGTTCT GCGCCCGCCT TTTGCGCGGG GGATGGTACA ATATTGCTGT 8820 
CCCCGGTTTC CGGCTCTGGA TTGACCGCAG CGGGCGCGTC CTGCACCTTT TCCGGCTCTG 8880 
GCTCCGAGTT CCCaGGCTGC ACCCGTTCCG GCTCCTGGGG CGGGGTGGAA AATACTGGCG 894 0 

TGTCCTGGGT GGCAGAAATG GACATTTCAG TGGCAGCGGC AACCTTTTGA ATTTTTTCCC 9000 
TGATTTCCTC CGGCATATCC GCGTCGTAAA AAGTCACGGT ACGGTCAGGG GCGATATGGG 9060 
CAACTGTTTT GTAGTCGCCG TGTTCTTCCT CCAGACGGTT CCAGACGGTC AGGCCGTTCC 9120 
CCATATGGCC ATAGCCTAAA TCATAGCGGT CAGCGGGGCT TTCGGCCATT ATGGGCGCAG 9180 
CGCCTTCTGG TATGTTCGGC GTTTCCTGCG GTGGTTCCGG CTTTGATGCG ATAGGGACGG 924 0 

CCGTTTCAGA TGCGCCGGTT TCTGCCTGCG AAAGTTCCGG CTCCGGTTCA GGAGTTGCGG 9300 
CTGGCTGTTC TGGTTCGGAC GAAAGCCCGA ACATATCTAA AGTAAGCTGC CCGTCTGCAT 9360 
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CCTCCCGTGG 


GGCAGGCTGT 


TTCTTGCGGG AAAGGGCTTT 1 


TGCCGCAGCA GCAAGGCCGG 


94.20 


TTGCAGGTTC 


AGCCGTTTGT 


CTGGCGGAAT CCTGCCTTTC ■ 


CTTCTGTTTC 1 


TGTTTCTGCT 


9480 


TTTTCTGCCG 


TTCCTCCTGC 


AAGACGGCAT CAGGCAGAGG 


GGTGAATAAG ' 


GAGTATTCGC 


9540 


CCCGCAGgTA TGCGCCCATT 


TTTTGAAGCG TCGGCTCCAT 


CAGGGAATAG 


GCGCGCTCAT 


9600 


CCGGGGATAC 


CATTGCCAGG 


GCTTTCACCA TTT.TTTCATA 


GTGCGCCGCG 


CATTGTTCCG 


9660 


GGTCATCTAA 


CAGCGGGCGG 


AATACCTTTT TTGCCGCGTC 


AACATCCCAG 


TCGTGGGGGA 


9720 


TTGTCTTGCT 


GGCATCGTTG 


GGGTCGTAGT ATTGGAAGGT 


ATATAGACTC 


CGAGCAAGCT 


9780 


GCATTTTTTC 


ATATTCCGGC 


AGATATGCCT TTTCCcTGCG 


GTTCAGATAC 


CGATCCGTTT 


9840 


TAATCAGTCC 


GGCAATCCGC 


TTTTGTACCT GCTTCCAGTT 


CAGCGTAACC 


GTATCATAAT 


9900 


TGCCACCGGA 


AAAATCGTCG 


GCGCGGGACA GCTTGATGCC 


TTTGCCGTCG 


TGCCATTCAT 


9960 


CATAGCCAAT 


GTAGCCATGC 


CCGCCAATGC CGTATTCGTT 


TTTCAGAAAT 


TCGATGCACT 


10020 


CTTTGGCATC 


ATGCCCCTGC 


ATAAAGTAGG CGTAAATACG 


GAATTTCCCC 


TCGGAAACGC 


10080 


CAGAGCCGCC 


AGTGAACATC 


CGGGTAATCT CGTCCTCGGT 


GACAAAAGCT 


CCACGGGCAG 


10140 


GCGCAAAGCC 


CTCCACCCCA 


TGAAATGCCG TCACCGGCTT 


ATCCATATCG 


GTCAGGCGGG 


10200 


TAAACAGTTC 


GGAGGGCGCG 


GGCCGCTGGC GGAAGCGCAG 


CAGATTCGGG 


TTCJCCTGAT 


10260 


GGGAGATGGC 


AAACTGCCGG 


ACTTCCTGTG CAATTTTGGC 


TCGTTCTGCC 


GGGTCTTTCA 


10320 


GGAGCGCGGC 


AATTTTAGCC 


GTGCTGTCCG GGAAACCACC 


GTACAGCGCG 


TCAATGGTGG 


10380. 


GCAGCAAGCC 


CTTTTCCTTT 


GCTTCCTCGC TGAAATTCTG 


CCGCAGAAAC 


CACAGTTTTT 


10440 


CCGACAATTC 


CCGAAATTCA 


TTGTCACGGG CCGCGTCGAT 


TTTGTCCTGG 


CTGGCAAACT 


10500 


GGCCGCTGTT 


CAAAAGCTGC 


. CGGATCTGTG CCGCCGCCTG 


CTTCCAGGTG 


ACAAGGGAGC 


10560 


CGCCCCCGGC T^GAGCGCCCC 


GGCGCGATGT GGATACCGCT 


ATGGCTGAAC 


CACAGCGAAT 


10620 


ACTCATGCCC 


GGCAATTTTA 


AGCCCCTTGC CGCCGCTGCC 


GTATTCCTTT 


TGCAGAAAAT 


10680 


CGGCGGCAGA 


CTGCTCCGTA 


GGGTCTTTCT GGAAAAAAGC 


GACGATACGC 


AGGATGCTGT 


10740 


CCGGCTCGTT 


GCCGCCGCTT 


GTGAGGGCGC GGCCAATGAC 


CGCATCCGTG 


AcCTTGCCGG 


10800 


AGGAAAcGGC 


GGrGCGCAGT 


TCCTGCTCGG TTTTCTGTGC 


CTGAGCAATA 


GTTTCAATCT 


10860 


GtGTTCCACA 


GTGGGGAATA 


AAGAAAACTG TGTAAATGCC 


GGCTTTTCAG 


AcGCAGAAAm 


10920 


GcGGACTGTT 


CGCCCGCCGC 


TTCCTGCTGG TTCTCCTGAT 


TTACTTGTAA 


ACGATCTCGT 


10980 


TCAGTACGGT 


TTCCTCCGCC 


TGGGCGAGAA ACAGGTTGAC 


TGCCTGCACC 


CATCCCATCT 


11040 


GGTCGCGGGC 


TTTCATCCCC 


TCGTmCACGC CGTTCTGTTT 


CTTTAATTCC 


GCCACGATCC 


11100 


GGTCTGCCTG 


TTCCTGCGCn 


TGCCGGTCGA TCTCCCGCAG 


GTGGGGTTCC 


AGCCGCCCGG 


11160 


TCAGCAGCAT 


CGAGGTGTAT 


GTCCCCTTGT GGTGTTCCTG 


1 AAGAAAGGTC 


: TGCCGCAGCC 


11220 


TGCCGTACTT 


GCCCAGCGGC 


: ATTtCCTCCA TCTCGTCCTC 


: CCCGTC 




11266 



(2) INFORMATION FOR SEQ ID NO: 55: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



AATTTTAGTG 


TAGTAAAGAA 


GTACGTGAAT 


TTGATTTTGG 


ATATTGTATT 


AGGGATTTTA 


60 


GGAGTTTTCT 


TATTAGTATT 


TATGGCTAGT 


TATTTAGTTG 


ACCTCGCTAC 


ATATCTCATG 


120 


AAGCCAATGA 


CGCCGGATAA 


TTTTTCAGTT 


ATCATGCAAG 


AAATTATTTC 


ATTCTTTATG 


180 


TTATTTGAGT 


TT ATT AT GAT 


GATTCTTCGC 


TATATTCAAG 


AAGGACACCA 


TATTCCGATc 


240 


GTTATTgATT 


TTAATTgTTa 


ACCGCAATTT 


TACGTCAATT 


AATGGkTGCG 


CATGGCGATG 


*3 a n 

300 


GCyTGCAGAC 


TTwATkGTtA 


wCTTTAwCAA 


TTCTTTTACT 


AGTGGTTGTG 


CTATTTATTC 


o c n 

360 


TCGGGTTAAA 


TGGCAGCAAA 


TTTTAtACGC 


TAGGAAAAGC GAAAGTAGaA gAAGaTCACG 


4 20 


AGCaAGATTT 


TCayCATTAA 


TCgAACaTAA 


TPa AAHAPCG 


TCCAACGTGA 


GAATTTTTCT 


A O A 

480 


CTAACGTGGG 


GCGGTCTTTT 


TAATGCATAG 


TTAAATAGTT 

X X r\rv\ X t\\3 X x 


AACTCAATCA 


GTTCAATTGT 


54 0 


TGGTAAGACA 


CCAAAGCGTG 


CGCTGATTTT 


r\\3 1 OVJ X X \*\*\J 


ATTCCTGGGT 


GAACAATTAG 


c t\ r\ 

600 


CGGCGTTTCA 


TCTTCTAAAT 


GATAAAGCCC 




TTTTTAGCAA 

X X X X X t iw w * * 


GCTGATTTTT 


ooU 


GTAAGTGAAA 


AAAGGCAAGC 


GAATCTGTCC 


GCCGTGACTA 


rr\ r+ m fn ^* tv 

TGTCCTGCTA 


AAA 1 tjAbb 1 o 


1 O f\ 


AATGTTTTTG 


TTGACTGCTG 


AGTCTGCGAC 


ATCTGGCTCA 


TGGGTTAGTA 


GCAGAGTGGG 


/ oU 


CGTTCCAGTA 


GGGCGGTTGA 


CTAAAGCTGC 


AGCTAAGTCG 


GTTTGTCCAT 


ATAACGAATC 


q a n. 
o 3 U 


TTCTAAACCA 


GCGACGTACA 


AGGAGATACG 


GTGTGAGACA 


GGCAGTAATT 


TGCCCGTGtT 


yuu 


TTTTaGTACG 


GTAAAGTCGG 


CAGCTTCTAA 


GATTGtTTCG 


TAAAAATTGG 


TATCTAAGCC 


Q £fi 


GCTATACTCG 


TGGTTGCCAT 


AAATAGCGAA 


TTTGCCATAA 


GGGGCGTTAA 


TTTTACGGAA . 


1 ft9Pl 


AAGTTGACTC 


ACTTCATCTT 


GAATTTCGAT 


ACCGAAACGG 


GCATAATCAT 


CAAATAAGTC 


1UOU 


TCCTGTAAAA 


ACGACTATGT 


CTGGTTGTAA 


TGTATTTGTT 


TTTTCAACAA 


TCTGTTTTAA 


1140 


GGCCTTCACA 


GAATAATTCT 


TTTTTAAATG 


AATATCAGAG 


AGTTGAACAA 


TCCGTAAGTT 


1200 


TTCCCTGATT 


CTGTTACTTt 


AATTTGATTT 


TTGTGtATGA 


CTAAGCATTT 


AGGTTCAATA 


1260 


AAAAAACTAT 


AGCTTAAAAT 


GCCGATAATA 


ATGAAAAATA 


GTGCCAGAAA 


TAAATAAACG 


1320 


TTCATAATTT 


CCTCCTTGAA 


GTCGCTAAAG 


TAGTTTACTA 


CAAAAGTGTT 


AAGCAAAAAA 


1380 


TAAGGAAAGG 


TTAAATTTTC 


AGTAAACTTT 


CTCGATGAAG 


TTTTTTGGTG 


ATTCATGCTT 


1440 


TTCTGAAGTA 


AATTTGCTAT 


AATGAAAAAT 


GAGAAAAAAG 


AAAGAAGGTC 


GAACCAATGA 


1500 


TGGCAAGAGT 


• CGAAAAGCTA 


CGGGAGTTAA 


TGAAAGAAAA 


TAATTTACAA 


GGATTTTTAG 


1560 


TGACAAGTCC 


GTATAATTTA 


CGTTATTTAA 


CCAATTTTAC 


TGGGACmACA 


GGCTTGGCAA 


1620 


T GAT T AC ATT 


AGATAAAGCT 


TTTTTTGTAA 


CGGATTTTCG 


TTATaCGGAA 


CAAGCTGCGG 


1680 
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AACAAGCAAa CTGGTTTTAC TATcGTTAAA AACACAGGAC ATaTTTTTGa CGAAGTAGCA 
GATTTAGCTG AACGTTTGCA ATTAGATAAT TTAGCTTTTG AAGAAACGCA AGTTAGTTTC 
GCTGATTATA GCTTATTGGA AGAAATTTTA CCATGTGAAT TAGTTCCTGT AATGGGTTTG 
ATTGAAGAAT TAAGAGAAGT GAAAGATGAA GAAGAAGTAG CGATTATTGA AAAAGCGTGT 
GCCATCGCGG ATCAAGGATT TGCCTTTGTC TTAGAGATGA TTAAACCTGG AATGACAGAG 
ATTGAAGTGG CTAATCAATT AGATTTCTTC ATGCGTTCAA AAGGAGCCAG TGGTGTTTCT 
TTTGAGACAA TTGTGGCCAG CGGTCTACGT TCAGCGATGC CACATGGCGT TGCTAGCCaT 
AAAGTCATCG AAAAAGGTGA GTTAATCACT TTAGATTTCG GTTGTTATTA TGAAGGCTAT 
GTTTCTGATA TGACACGGAC ATTCGCTATT GGTAGTATTC AGCCAAAACT AAAAGAAATT 
TATGACATTG TTTTAGAAGC ACAATTAAAA GTGTTAGCAG AAGCTAAACC TGGGTTGACG 
GGCATTCAAT TAGATGCGAT TGCACGGGAT CATATTGCTT CCTATGGCTA TGGCGATGCG 
TTTGGTCACA GCACTGGTCA TGGCATTGGG TTAGAAATTC ATGAAGGACC AAATGTTTCA 
TTCCGAGCAG ATAAACAGTT TGTACCAGGG AATGTGATTA CGGACGAACC TGGAATTTAT 
TTGCCAGGAA TTGGTGGCGT GCGTATTGAG GACGATTTAT TAATTACTGC TGAAGGTAAT 
CGTGTATTAA CACATGCGCC AAAAGAATTA ATCATTTTAT AAAGGTTGGG AGAACATGGA 
TATCCAAGAG TATCAGCGGT TTATTAGTGC GTTTTATAAA AAACGTAACT GGTATCAATA 
TGATCCGTTT ATTCGTTCCA ACTTTTTAAC AGAAGAAGTC GGCGAAGTTT GTCGGGCCAT 
TCGGACAATT GAAATTGGCC GTGATCGATC GGATGAAACA GTTTTGGCGG ATGAAGAGAA 
TCTGGAAAAT TTAACGGAAG AGTTAGGCGA TGTGTTGGAT AATTTATTTA TTTTAGCCGA 
TAAATATGAC ATCTCTTTAG AAATGATTAT GCAAAAGCAT CGGACGAAAC TAATGGAACG 
TTATCCTGAA GAATAAAGAT GTAAAAATAA GAAAGCCAAA ATTACTTGGT CTTTCTTATT 
TTTACGTCTA AAGGCTTGAA TTAGAAATTT ATGACAAGAA ATTATCTGTT CTTTTAGTCA 
GAACTTAGCT TTTCTGGTAG CGAAATAGGC TAATTAATGA TAAACTAAGA ACGAGTATTG 
ATAAGCAAAA TTTGGACGTT CTTTCACAAC TAACTGAAGA AACCAGTGtT GTTTTAATTT 
TtAAGAATCA TCCGTTGCTT AAAAGAGAAT CCTTAAGAGA CTTAGTGATT CATCAAATAA 
ATCGTTTATT TAACGATTTT GTGCGTAATA GATAAGGAGG ACAACCAAAT GTCAGAAGAA 
AAAAATCTTG TAATCAATGC AAACGATTCT TTAGGAGAAA TCGTGATCGC TCCTGAAGTC 
ATTGAAGTAA TTATCGGAAT CGCTGCTTCA AAAGTAGAGG GCGTTTATGG GATGCGCGGC 
ACGTTTGCCA ATAATGTGAA TGAATTATTA GGCCGTGCAG CACACGGCAA AGGTGTATAT 
TTACGAACAG AAGAAGAAGG CTTAAAAGTC GATATCTATT GTTACTTGAA CTACGGTGTT 
TCTGTACCAA- AAGTTGCGTT AGAAATGCAA GACCGTGTGA AACAACAAGT CTTATTTATG 
AC AG AT AT TG ATTTAGTTGA AGTGAATATT CATGTTGTAG CGGTAGTACC TGAAAAAATC 
CCTCAACCAG ACTTAAATGA ATTATTCCCA GAAGATGAAG ATGGAGAAGA GAATGAGTAA 



1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 



WO 98/50555 PCT/US98/08985 

465 



AACAGAATTA ACCCGTCACG . 


AGATTCGTGA . 


AAAAGCGCTC CAAGCCCTTT TTCCTTTAGA 


3720 


TTTTAATGCT 


GATTTAACCA - 


AAGAAGATGC ■ 


CATTGATTAT GCATTGGCTT ATGATAATCG 


3780 


GGAAATCGTC 


AGTGAAGATG 


GCGAAGACCT , 


AGTCCCTACG TATTTAGATT TACTTGTCGG 


3840 


CGGTGTGTGT 


AGCCGCAAAG 


CAGAACTAGA 


CGAAGTGATT ACGAACCATT TAGGTAATAA 


3900 


CTGGTCTATG 


CAACGCTTAG 


CAAAAATAGA 


TATTGTTATT TTACGTTTAG CTATTTTTGA 


3960 


AATGCTATAC 


GTTTCAGATG 


TGCCAAACAT 


TGTGGCATTA AATGAAGCCG TTGAGTTAAG 


4020 


TAAAAAATAC 


AG CG AC GACC 


GTTCACGTAA 


ATTTGTTAAT GGTGTGTTAT CAAATGTAAT 


4080 


GAAAGAAATT 


GATTCAGAGG 


CGTAAGCTCT 


GAATCTTTTT TTACTATCTA ACTGGTGTTA 


4140 


AGATACTGGG 


TTATATGCTA 


AAATAAATAC 


TATTAAAGAA GTAAAGAAAC AATAGAGGGA 


4200 


GCGAAGGAAT 


GAGTACAGTA 


ATTAACGGTC 


GTGAATTAGC AGATCAAATG CAAGCAGAGA 


4260 


TTCAAAAAGA 


TGTAGAAAAG 


ATGACACAAC 


AAGGCATCCA ACCAGGATTA GTTGTTTTAT 


4320 


TAGTTGGGGA 


AAATCCTGCC 


AGCCAAACGT 


AT GT GAG AAA TAAAGAACGT GCAGCAGCCA 


4380 


AAATTGGCAT 


TCTGTCAAAG 


GTCGAAAAAC 


TGCCAGAAAC TATTTCAGAA GAAGAATTAT 


4440 


TGGCTGAGAT 


TGACAAATAT 


AATCAAGATT 


CACGCTTTCA TGGCATTCTT GTACAACTAC 


4500 


CTTTGCCAAA 


ACATATTGAT 


GAAGAAAAGA 


TTTTATTAGC GATTGATCCC AAAAAAGATG 


4560 


TAGATGGGTT 


CCATCCAATG 


AATTTAGGCC 


GTTTGTTTGT AGGCAAACCT GAAATGATTC 


4620 


CTTGCACGCC 


GTATGGAATT 


ATGAAGATGT 


TTGAAGCTTA TGATATTGAT TTAACAGGTA 


4680 


AACGTGCGGT 


GGTTATTGGT 


CGAAGCAATA 


TTGTTGGCAA GCCAATGGCA CAATTATTGT 


4740 


TAATGAAAAA 


TGCGACCGTG 


ACGATTGCCC 


ATTCTAAAAC TGAACATTTA GCAGAAGTTG 


4800 


CAAAAGAAGC 


GGATATTTTA 


GTTGTAGCAA 


TTGGACGCGG" gcattttgtc accaaagagt 


4860 


TCGTGAAACC 


AGGTGCGGTA 


GTGATTGATG 


TGGGCATGAA TCGCAATCAA GAAGGAAAAC 


4920 


TGATTGGTGA 


TGTAGCATTC 


GATGAAGTTT 


CTGAAATTGC AAGCTATATC ACGCCGGTTC 


4980 


CCAAAGGTGT 


TGGCCCAATG 


ACGATTACCA 


TGTTAATGTA TCAAACAGTC GAGGCAGCAA 


5040 


AAAAACAAAA 


GTAAGGTGAA 


AAAATGACGC 


AACAGTATTT AACAGTAACA GCGTTAACGA 


5100 


aGTATTTAAA 


ACGTAAATTT 


GATGCAGACC 


CTTACTTAGG ACGTGTCTAT TTAACAGGTG 


5160 


AAATTTCGAA 


CTTTCGCTTC 


CGAGCGAACG 


CGCATCAGTA CTTCAGTTTA AAAGATGATC 


5220 


ATGCGAAGAT 


TTCTGCAATT 


ATGTTTAAAT 


CAGCCTTTCA AAAATTGAAG TTTCAACCAA 


5280 


AAGAAGGCAT 


GAAAGTCATG 


GTGGTGGGGC 


GTATTTCGCT TTATGAAAAC AGCGGCTCCT 


5340 


ATCAAATTTA 


TATTGAACAT 


ATGGAACCAG 


ATGGTGTAGG TGCACTTTAT CAAGCATTGG 


5400 


CAGAATTAAG 


AGAAAAACTA 


GGAAAAGAAG 


GACTTTTTGA AGGGCCTAAA CAACAGTTGC 


5460 


CTCGTTATCC 


CAAACGAATC 


GCTGTGGTTA 


CAAGTCCTAG TGGTGCTGTT ATTCGGGATA 


5520 


TCATTACGAC 


AGTCAAAAGA 


. CGGTATCCGA 


TTGCACAACT GGTCCTTTTT CCAACCTTGG 


5580 


TCCAAGGCGA 


GCAGGCGGCG 


. GATGACATTG 


; TGCGGAACAT TCAGCGAGCC GATGCGCAAA 


5640 
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GCGATTTTGA TACGATGA7C ATCGGCCGTG GCGGTGGTTC AATCGAGGAT TTATGGCCA7 
TTAATGAAGA AAAAGTGGCT CGAGCCATTC A7GCTGCCAC CACACCGATT AT7TCATCGG 
TTGGTCACGA AACAGATGTG ACAATTGCCG ATATGGTAGC AGATGT7CGT GCAGCAACGC 
CAACGGCAGC AGCGGAACTA GCGGTACCAG T7TTAAATGA AGAACTA7TA AGAATTAG7G 
AACGGCGTAG TCGTTTAGAA CAAAGC77T7 TATATCTCTT ACAACAACGA ACAGAACG7T 
TCCAACGACT ACAAAAT7CT 7ATGTG7TTA AGCAACCTGA GCGACT77AT GAAGGTCAAA 
CGA77AAAT7 AGA7CGAA7G ACGCAACG7C 777TCCAAGC AATGACAACA A77CA7CACC 
AAAAACAACG 7CAAGCCCAA GGAATTATCG C7CAAT7GCA ACAACAGACA CCAAAGGGAC 
AA7TAAGAGA AAG7CAGCAA CAA7TGGCAT 777TACAACG AAACTTGCAA ACACAAATGA 
CACAATTG77 CCTAAACAAA CAAAAGCAA7 TCACGTCTGC TG7TCAACAA C7AGAC77AC 
TCAGCCCACT CAAAA77A7G GGTCGAGGA7 ATAGT7A7AC 7ACAAAAGAG GACCGTG7CG 
77AAAAC7G7 TACCGAGT7A CAGCCAGCAG A7CAA7TAAC GATTCA.7TAT GC7GACGG7A 
CAGT7CAAGC AAACG7CGAA ACAA7AACAG CGAAAAAAGA GGAG7T77AA CATGCCAGCA 
AAAGAAAAAA CGT7TGAAGA ATCA7TAAAT GCC77AGAAG AAATCG77CA GCGCTTAGAA 
CGAGGAGA7G T7CCT7TGGA AGAAGCC77A GC7GCCTTTC AAGAGGGCA7 GGCATTAAGT 
AAACAATGTC AAGATACAT7 GGAAAAAGCG GAAAAAAC7T TAACGAAAAT GA7GACAGAA 
AACAA7GAAG AAA7TG7777 TGAAGAGAGC GAGGAAGCG7 AATGACGAA7 T77AG7CAAC 
AGCA777ACC G7TGG77GAA AAAG7CA7GG 7CGA77TTA7 CGCAGAA7AT AC7GAAAATG 
AGCG77TGAA AGAAGCAA7G CTCTACTCAA 7ACACGCAGG TGGCAAACGA 77ACGCCCGC 
TA7TAGTG7T AACAACAGTG GCCGC777TC AAAAAGAGAT GGAAACACAA GACTA7CaAG 
TGGC7GCC7C T77AGAGA7G A77CATACGT A77CAT7AA7 TCA7GA7GAT 77ACCAGCAA 
TGGACGATGA 7GA777ACG7 CG7GGCAAAC CAACCAATCA TAAAGTG77T GG7GAAGCGA 
C7GCCATTT7 AGCAGGGGAC GGCTTA7TAA CAGGTGCATT TCAGTTGCTT 7C7TTGAGTC 
AAT7AGGC77 AAG7GAAAAA GTTT7ACTGA TGCAACAACT GGCAAAGGCA GCGGGGAATC 
AAGGCA7GG7 77CCGGCCAA A7GGG7GA7A 7TGAAGGAGA AAAAGTCAGC 7TGAC77TAG - 
AAGAGTTAGC GGCTG77CA7 GAAAAGAAAA CCGGAGCGCT AAT7GAATTC GCCTTAATTG 
C7GGGGGCGT ATTAGCCAAT CAAACAGAAG AAGTCATTGG 77TACTGACA CAG7TTGCGC 
ATCACTATGG CTTGGC7777 CAAAT7CG7G A7GACTTATT AGA7GCCACA AG7ACAGAAG 
CAGAT7TAGG 7AAAAAAG7A GGCCGAGA7G AAGCGTTGAA TAAAAG7ACG TATCCAGCTC 
TTCTAGGGA7 TGC7GGCGCG AAAGATGCGC TAACACATCA ATTAGCAGAC ■ AAGCTGTGAC 
CG7C7CCGGG AGCTGCA7GT G7CAGAGGT7 T7CACCG7CA TCACCGAAAC GCGCGAGACG 
AAAGGGCC7C G7GA7ACGCC TA77777A 
(2) INFORMATION FOR SEQ ID NO: 56: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 6682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

TTTTTTTGCA ACGTTTCCAT GAAAGAATGC GGGAAGATAT CCCTAAAAAT TGATCTTCTT 60 

ATTTCTCTAT TCAAAATGAT TCTAGCCTAA CCTTTTTTAC TATTTCATGA TATGATGAAG 120 

CAAAGAATAT TTTAGGAGGT TTTTACATGG CTGCAAAAGA AGCAAGTTTT GATGTAnTTT 180 

CAGAGGTCAA TATGGAGGAA GTCAAAAACG CAATTCAAAT TGCCTTGAAA GAATTGAAAA 24 0 

ATCGTTTTGA CTTTAAAGGG TCAATTGCTG ATATTAAATT GGAAAACGAT AAATTAGTAG 300 

TCGTAGCTGA AGATGATTAT AAAGTCGAGC AAGTGAAGGA TATTTTATTT GGaAAATTAG 360 

TCAAACGCAA TGTTCCGATT AAAAATATTC ATTTTTCAGA AAGTGAAAAA GCCTTAGGCG 4 20 

GTACTGCTCG CCAATATGGT GATTTAATCA GTGGTATTGA TAAAGAAAAT GCAAAAAAAA 4 80 

TCAACACAGC TATCAAAAAC TCTGGGATTA AAGTTAAGTC ACAAATCCAA GAAGACAAAA 54 0 

TTCGGGTAAC TGGAAAAAGT CGGGATGATT TACAAAAAGT CATGGCTCTT TTACGAGAAT 600 

TAGATTTACC AATGGcTCTT GAGTTTAATA ATTATCGCTA AAAAAATCCT TCAACTGTTA 660 

ATGAACAGTt GAAGGaTTTT TTTGTTTtGG TACCGGCCAT TCTTGATTAT TCAATAAAaC 720 

GATTGGTCGG CGTTGGATTT TTCCTGAAAA AGCGGCAATT GCTTTGAAAA CCTTTGgCAT 780 

AAAATAAAAC GCATTtAATT CTTCTAAATC CAGTTCTTCT TCTGCCGTAA ACGAGACACA 840 

AACCTCTTGT TCTGAAAGCA TAATTTCAAT TTTCTTTAAA AATTGAAATT GCCGCATTCC 900 

TGGCACGAAA TAACCAGCCT TTGCAACACG AGAGGTTGTA ATATCTAATT TATCACTAAT 960 

GATTAGCGCT AGTGTCATTA ACTCTTCCGA ATCAAAGCCG TTACTATGAT TTTCTATAGA 1020 

AGATAAAATT TCTTCTTGAT AAGGCAAACA AATCTTTTGc TTACGAAAAt ATGCTTCTGC 1080 

AAATtGTTTC CCACGAAGTG CATGTCCTGC TTTTCCTTGT AGCGCACCCA CATCATGGAG 114 0 

CAGTGCCGCA ATTTTTGCCG CCTCTAAATA CTCCTCTGAA ACAGCAAGTT GTTTCAAAAT 1200 

CATTTCCGTC ATGGCTACGA CATTGGTTAC ATGAAACCAA TTATGGTATG CCCACCCAGC 1260 

TTCTTCATCT TCCATTTGAC CAATTGCATC ATAAATTTGT CGAATTTCAG TATCTTGATG 1320 

AATGCGTTCA AACAGCTTCA TTTATTTTCT CCTTAAAATT AGTTACTGTT CCAAAAATTC 1380 

TTTTAACTCT GCAGGATTTT TTTCGCTGTT TAAACCAACA ATTAAACGAA TCCGCGCTTT 14 40 

CGGTCCGTTT AAACCACGGG CAAAAACGAC CCCCATTTTT TTCAAGCCAA CGCCACCACC 1500 

AGCATAATCA TAGATATCTT CAGCAATGCC ATTGGAACAA CGAGAAACTA AAACGACTGG 1560 

AATCCCGTTG TCTAGCATTT TTTGTAATGC TGGGAGCGTT TCTGGCGGTA AATTACCCGC 1620 

GCCTAGCGCT TCAATTACTA ACCCATCCGT TTCTGGGGTG TTTAATAATT CAAACATCCG 1680 

TTCACCCATC CCCGCATAGG CCtTCcTACG TGAACATTTC CTTTCACAGA aGAAACATCG 174 0 
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CAAACTTCTT GCGGTAAGAC CTCTTTCGCA AAAAAAGCAC GTTCTTTCGC AATGGTTCCT 
ATTGGTCCAA AAGTTGGCGT CCGAAACGTC GCTACGTTCG TTGTATGTGT TTTTGTCACA 
TAACGCGCCG TATGAATTTC ATCATTCATC ACGACTAAAA CACCTTTATC GTACGATTCG 
TCTGAACAAG CAGTCCAAAT AG C ACT GAT G AAATTATACA AGCCATCACT ACGGATTTCG 
TTACTTGACC GCATTGCTCC TGTTAAAACA ATGGGGATTT TTTTCTCTAA GGTTATATCT 
AAAAAATAAG CAGTTTCTTC TAATGTATCT GTTCCATGCG TAATAACCAC GCCATCAATT 
GGTTCAGAAT AAGCTTTTTG AATCCGTTCT TTTAATTGAA ACATCCGTTC TAAGGTCATG 
TGTGGTGAGG GAATATTAAA AATATCTTCA ACAACTAAAT GAACCTTGCC aGAAAATAAT 
GCTTCTTGTT CCATCAACGG ATTTTTTTCA TTCGGGGCAA CACTTCCTCC TGCTACTTTC 
GACATGGAAA TGGTCCCACC TGTATGAAGG ACTAAAATGG TCTTCATTTG TTTCCTCCTT 
TGTTTTCTTT ATTTTTTAAC CAAAATTGTT TCACTTGTTG AATGAGTTGT TCATTGGTGT 
TTGCCTTTCG ATGCGCCAAC CAATGCGGCG GAAAGAAACG CTGGACACTT GGTAATTCAA 
ATCGTTCGTC TAATAACAAC ACAAAGCCTT TATCTTGCGC ATCTCGAATG ACACGTCCAG 
CCGCTTGCAA GACTTTATTC ATGCCaGGAA TTTGATACGC AAATTGAAAC CCTTGCCCTC 
GCTCTTCTTG ATAAAAAGTG CGAATCAGTT CTTGCTCTGG GTTCATTTGT GGTAGCCCGA 
CACTAACAAT TATCGTACCA ATCAATCGGG TACCTTTCAA GTCAATTCCT TCTGAAAATA 
TACCACCTAA AACACAAAAA CCGATGAAAG TTTCTTGAGG ATTCTCTTTA AATTCAGCTA 
AAAATGCTTC TCGCTCTT7T TCGTTTAAGT CCGTTCCCTG AATTTGAGTT TTTACTTGCG 
GGTAACGTTG GCTAAATACT TGATAGACAT CATCCATATA GGCGTAGGAA GGAAAGAAAA 
TTAAATAATT ACCTGTTTTT GTTTCACTTA ACTTCCCTAA AAGTTCACTA ATCTGTGGAT 
AACTTTTTTC ACGGTTTTTA TAGGTTGTTT GAAGATGACT" ACCAATCATC AACAATTGAT 
TTTCTTCCGG AAAAGGATAC GGAATCCGAT AACGTAGGCT TTCTTCACCT CCACCAAGCA 
CTTCTTGATA ATAATTCAGA GGTGTTAAAC TTGCCGAAAA TAAGATACTC GCTTTCCCTT 
TATCTAATCG TTGTGACAAT AAATAAGCCG GATCGACACA AAATTGTTTC ACAACGCAGT 
CATAGTTTCG CAAAGAAATA GACGTATAAA AGCCGTTGTC GTAATACTCG CTGATTCGAA 
GATAACTTAA ACAATCAAAA TAAAAACTGA GAATCGACGC CAATTCCTCA ATGTGTTGAT 
TTTCTGGCAA CCATTCTGCA ATTACTTCTG TTAATCGATG TATCTGTTTA ACTAATGAAT 
CAATTGGCGC GCGTTGGGCA AAAAAATCCG TGTCTTTTTC TTCACAGATT GCTTCAATAT 
CATCAAACGT AGTTAAGACT TTTTGACAAG CTCTTTTCAA TTGAACAGAT TCTTTTTGCA 
ATCTTTTTTG AATTgCAAAA CCGCATTCCT AGAAATAGCG GCCGAATACA TCTCCCGTGA 
ACGATTAACC AAATTATGGG TTTCATCCAC TAAAAAAATA TTCTCTTCTG cACTTTGGGG 
TTCTTCAAAG AAGCGCCGTA AATAAACCGT TGGATCAAAT AGATAATTGT AATCGCCAAT 
AATTAAATCA CACCAAAGAC TTGCATCTAG CGACAACTCG AATGGACACA ATTGGTGCTT 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
,2700 
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TTGTGCATAC 


CGTTCAATAA 


CCTCTCTCGT 


TAAATGATTT 


TCATGTTGCA 


ATAAATCCCA 


3780 


CAGTCCCTCA 


TTTAAACGAT 


CATAATAGCC 


ATCAGCAAAC 


GGGCAATGTT 


CTGGTGTACA 


3840 


ATTTCGTTCA 


GTTAAAAAAC 


AAATTTTATC 


TTTTGCTGTT 


AACGTAATAC 


TCTTAGTTTG 


3900 


PGPTTGTTTT 


TCGGACAGCG 


CTTTGACGGC 


ATCTTCAGCC 


ACTTGCCGCG 


TGATGGTCTT 


3960 


AGCTGTTAAA 


TAAAAAATCC 


GTTCACCTTC 


TTCTTCACCA 


ATTGCTTTCA 


TTGCTGGAAA 


4020 


AA^AGTGGAC 


ATTGTCTTGC 


CTGTTCCAGT 


TGGTGCTTCC 


ACGAAGAGCC 


GCTGTTTACT 


4080 


a apaattgtt 


TTATAGACGG 


CTACCGCCAA 


TTCTCGTTGC 


CCTTTGCGAT 


ACTCCCCGTA 


4140 


AGGAAAACTA 


AGATTTTTTA 


ATGATTTATT 


TCTCAAAATA 


CGCCACTCTT 


CTTTAAAAAT 


4200 


tapppactgt 


TCATAGCGCT 


TCATCAAATC 


AGTAAAAAAC 


TCCGCTAATG 


CAGCTTCTGT 


4260 


AAAATGGCGT 


TCTTGTCGTG 


TGATTTGTTT 


TGTGGTGGTC 


TGATAATAAG 


TGAGTTGAAG 


4 320 


PGTTATTTCA 


GAAAGTTGTT 


CTTGTTGGCA 


ATACATGTGA 


CCATAACACA 


TCGCTTGATA 


4380 


CCAAAACAAT 


TCGACCTTTT 


CTGGCGGCAA 


TTCTTCAAAC 


GCTGGTTCGG 


ATGTTTTAAT 


4440 


TTPATPAATG 


ACCACTTGCC 


CAGTTTCTTT 


CACAAAAATG 


CCGTCTGCAC 


GACCTTCTAC 


4500 


A r A AT AAT T A 


TGGCCATTTA 


AAGGCACCTC 


AATTTTTAAA 


GATACTTCCT 


TTTGATAAAG 


4560 




TCCTTTTGCA 

X x x x x y vn 


ACATTCGATG 


AAGGCGACTA 


CCTTCTTGTG 


GTGTATGTTC 


4620 


AC T C AC AT G G 


CGAGAATCGA 


TACTTCCCCG 


CCTTAAAATA AATTCGACCA 


CTTGACGCAC 


' 4680 


PHP AATTTTT 

U W ViiTi X X X JL X 


CTCGTTGATT 


TCACGTAATC 


CCACCTATCT 


GCCTCCCTAT 


TATAGAACAT 


4740 


ACATTCGCTA 

nun x x \*>\-j\u> x 


AACAAAAGCA 


AAAAATCCAA 


AACGAACACA 


ATCATGCGTT 


CATTCTGGAT 


4800 


TTCTCTTTAG 


ATTATCAACA 


ATTAAAAACA 


ACTTGTTTCT 


TCTAACTGTT 


TGGTTTCTTT 


4860 


TCCATTTGCT 


TTTGCCACGA 


CCAATTGCGC 


CATTTGTTTA 


AACGAATTAT 


TTAAACCAAT 


4920 


GCCGCCCTTT 


TTGGGCATCT 


TCACAATCGG 


TACTTGCTGA 


TAATAGTGAC 


TAACTGCTGG 


4980 


ATTTTCTTCT 


GCCATCGCTG 


CAATTGCAGG 


AATGCCTGCA 


GCATTAAACT 


TACAAGCTAG 


5040 


ACGAGCAGCC 


ATTTCTCCAA 


AAT TAG GAT A 


ATGCATGGCT 


GGTCCGCAAA 


GAACCGCATC 


5100 


GGCATGAAAC 


TTTTTTGCAA 


AG CC AAT AAA 


TTTCTTTGCT 


ACCTCTTCCT 


CGTTTTCCAA 


5160 


ATAATATTGG 


TCACCACAAT 


AAAGTGTCGC 


AACAATTTCC 


GCACCTTGCT 


TCTGTAAAAA 


5220 


TGGTAGCAAT 


GTTACCCCTG 


GACCTAAGGG 


AGCTTTTTTA 


CCACTTGGTT 


TCAGCTGAGC 


5280 


TTGTTCATCA 


CTGCCAAATC 


CTGCTAAACT 


GTGGTTTAAG 


ACTAAAAGGA 


TTTTTTTCAT 


5340 


GCCTATTTCC 


TCCTTATTTA 


TAAATCATCG 


AAGTCAAAGG 


AATCAAAATC 


A ATTTPTTPA 


5400 


CTCTCTtCTT 


CTTGTGCTTT 


CCATTCTTCA 


CGTAAATATT 


GGTCATCCAT 


AACTCGAATA 


5460 


AAAGGATAAT 


AAATGAACAT 


TCCTAGTACC 


AATAAGAGAG 


CTTGCAGAAT 


TGATCCTTGC 


5520 


CAACCGCTGA 


CTAAAAATCC 


T GAG AT AAT T 


GGCGGGGTCG 


TCCACGGCAA 


TTGAATACCA 


5580 


TTCGTCAATG 


GAACTAAGCC 


CCAATCCATA 


GCAAAATAGG 


CAATAATAAT 


ATTAATTGCG 


5640 


GGTACTAAGA 


CAAATGGAAT 


TAAAAGGATT 


GGATTTAAAA 


CAATTGGTAA 


CCCAAAAATG 


5700 
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ATTGGTTCGT 


TAATCCCAAA 


AACACCAGGA 


ATTAGTGAAA 


GTTGCGAAAG 


CTTTTTAATT 


5760 


CGTTGaCTTT 


TACAGAATAA 


GACCATCACA 


ATGATTAATG 


AGAGTGTGGA 


aCCGGCGCCA 


5820 


CCAAATGTTG 


cAAcATATCT 


TGAAATTGTC 


CAGTGATGAT 


GTTGGGAATT 


TCATGACCGT 


5880 


CTTTAAATGC 


TTGTAAATTT 


TCAACAGACA 


ATGCCCGCAA AACAGGATTA AAGACTGCGC 


5940 


CAACCACACT 


ACCTCCGTTA 


ATGCCAAaGA 


ACCAAAAGAA 


GTGTAAAAAT 


AAATAAGCAA 


6000 


TGCTCATTGC 


ACCTAAGGTA 


TTACCAAGAC 


TCAATAAGGG 


CGTTTGTAGA 


AATTCAAAAA 


6060 


TAACATCTTG 


TAAATTACCA 


TAAGGAGTTA 


AAGCAGCCAG 


CCAATTGACT 


AAAAACATGA 


6120 


GAGTAACAAC 


TAAAAAACTC 


GGAATCAGGG 


CCTCAAAAGA 


CTTAACCACT 


GTTGGTGGCA 


6180 


CTCCTTCTGG 


CATTTTTATT 


GTCCACCCTT 


TACGAACAAT 


GGCCACAAAT 


AAACGAGTCG 


6240 


CAATTAATGC 


GACAATCATT 


CCTAAAAACA 


AGCCACTGGA 


ACCCATCCAT 


TTTAAAGGTA 


6300 


AACTAGTAAC 


TTCATAAACC 


TTTGAACTAT 


TTTCCGGTGT 


AAATAATGTT 


TGATAAGGCG 


6360 


TTACAATAAA 






TTCCTGCTTG 


CATTGCATCC 


ACATTGAATT 


6420 


GTTTTCCTAA 


ACTATAGGCw 


ATACCGACAA 


CGGCCAGAAT 


AGTCATTACA 


TCAAaACTAG 


6480 


CACTAGCTGG 


AACAGACATC 


ATTGTCGCCC 


AGTCTTTACC 


AAGCAAATGA 


GACATCCAAG 


6540 


TATCCCATTG 


TGGCAAAGGA 


AAATTAGCAA 


TTAATAAGAA 


GACCGAACCA 


ACAATCAACA 


6600 


AGGGCGTTGA 


TAAAAGGAAG 


CCATCACGAA 


TCGCTACTAA 


AT ACT TAT Tk 


TCTCCTATCA 


6660 


CTTTTGCCAA 


AGGGGTCAAT 


AC 








6682 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

AACTCCTAGA ACCAAGTAGG TACTGTTTAA CATCTATTTG CTTTGCAAAT AGTGTTTCTC 60 

AGCAATAAGA AGGGATTTCT GAAAATTGTT CTATAATTTT TGGAGACCAA GTAGGTATTG 120 

TTCATCATCG AATCACTACG TTCTTCGTGA TTTTCAACAA TTTCGGCTTA TTTCCGAAGA 180 

CCTTCAACTC TTAGAGGTTT AGCTCTTAGA AGTTGTTGAG AACGAAGCAA CGCGTAGTGT 240 

TGCTTCTGTT CCCGCCGTTT ATCAGTTTTT GAGCGTGAAG TAAAAATCCA AAGTGATTTT 300 

TATCCCACGC CCATTTTCAT TTATGACGTT GGCTTCGTGT AAATTGAGAA GTCTCGTCCT 360 

CTCTAGTGGT TGATTGCTGT GTTCGTTGTA CGGCTTGCTT TTCTGGCTCG CTCTTTTTTG 420 

CAACATCAGT ' CACAACGTTT GGCTTCTGAG AAGGGGGAAC AGTTGTTTTT TTGTTTAAAG 4 80 

TTGGCGTAGT TCTTTTTGTT TCTTTCTTTT GTTTCTTTTT ATTATTTTTA TCATTTCCAG 540 

TGACTGTCAA ATAGATAATC AGAAAAATCA TTAGGATCGT TGCAACAATT AATAGAATAA 600 



I 



471 

ACATGTTAAA TGAAATGAAC ATGCATGGCC CTCGTTTCTC GTTTTTTCTt AGTTTACCAA 660 

AATTtCAAAA TTTCGTCACT CTTTTCTtCA GAAAAAACGA ATTTGAAAGG GGTTAAAAGA 720 

AACTTTTATA AAAAAGCCTC CTTTTTTGTG TGAAAATGCA CAAAATTTCT GGaATCGTTT 780 

GCTGTTAAAA CCAGATACTG GTAAAATAAT TATTGTATAA GaATCaTTTT AAAAAACGAT 840 

TTTTGTTACA AATAACGTAT TTGTTGAAAA GGcACTGTGA AGTAGTTAGA GACGACAGCC 900 

aAGTGACTTT TAATGACAGC CGCCCAATTA ATTTCATGGA GCCTTTTCGT AGAAAACGAG 960 

GAGGTTATTT TGCGTGVWG TGAGTATATT TTCGACATGC GTGGTGGATT TACTTTTTCC 1020 

AAATGTAGGG CAAGCAATGG TAGAGGTTTT GGAACGTTAT GGCTGTGAAA CGGTTTTACC 1080 

AACAAGCCAA ACGTGTTGTG GTCAGCCAAC GTATAATAGT GGCTATGTGA CAGAGAGCAC 1140 

GACAACGCTT AAGAATCAAA TTGATAGTTT TGACGGTGCC GATTATGTGG TGGGACCTGC 1200 

GGGGTCTTGT GTCGGCATGA TGAAAGAATA CCACAAATTT TTAGCAGATG ATCCTATTTA 1260 

TGGACCAAAA GCACAACGTT TAGCTGAGCG AACTTACGAG TTTAGTCAGT TTCTTTATCG 1320 

TGTCTTAGGG GTCAAAGATG TTGGCGCAAC GTTACATGGG AAAGCCACTT ATCATCGTTC 1380 

TTGTCATATG ACGCGAATTT TAAATGAACG AGAATCACCC TTTGTTTTGT TGGATCATGT 14 40 

GAAAGGATTA GAAATGATTC CTTTAGGACA TCTTGAAAAT TGTTGCGGAT TTGGCGGAAC 1500 

GTTTTCAGTG AAAATGCCAG CTATTTCTGA ACAAATGGTG ACGGAAAAAA TGAACGATGT 1560 

GATTGATACA GGCGCAGAAA TTTTAATTAG TGCGGATATG GGTTGCCTGA TGAATATCGG 1620 

GGGTAAATTC AATCGTGATG GGAAAAAGAT TAAGATTATG CATATTGCAG AAGTCCTTAA 1680 

TCACGAAGTG GATGAAGCAC GCATGGACCA ACCACAAATC ATATCGGTAG GGTAAGGGGG 1740 

AAGAACTGTG GGATTATOA CGAGCAATAA GCCGTTAAGT GAACGAATTG AAGAAAGTAA 1800 

AAAAGATGTG TTTATGCAAA AGGCTGTCGC TAAAGCACAA GATGCCCAAT GGGAGAAACG 1860 

AGAAGGTGCG CGGGAAGCTT TAGGCAATTG GCCACAATGG CGTGAACTGG GAGAACAAAT 1920 

TCGCCAACAT ACCATTCAGT ATCTGCCTGA CTATTTAGAA GAGTTCAGTG ATAACGTTGC 1980 

CAAACGAGGT GGCAAAGTCT TTTTTGCTCA AACCGCCGAA GAGGCCAATG AATATGTGAA 204 0 
ACAAGTGGTT CTAGAAAAAA AGGCGAAAAA AATTGTGAAA TCTAAATCAA TGGTGACGAC " 2100 

CGAAGTCGAT ATTGATCCAA TGTTATTGGG ATTAGACGAT GTCAGTGTGA TGGAAACAGA 2160 

CTTGGCGGAA TTCATTTTAC AAATGGATGA CTGGGATGAG CCGTCACATA TTGTTTTCCC 2220 

AAGTATTCAT AAAAATCGCG AACAGATTCG TCAGGTTTTT GCGAAAAAAT TAGGGTATCA 2280 

AGGAGATAAT GACCCGGTAA ATTTGGCACG CTGTGCACGA GAAGTGATGC GCAAATTCTT 234 0 

TTTGGAAGCG GAGATTGGCA TTACAGGTTG TAATTTCGCT ATTGCGGATA GCGGCTTAAT 2400 

TAACTTAAAT ' ACAAATGAAG GAAATGCTGA TTTGACCATC AGTATTCCTA AGACACAAAT 24 60 

CGTGTTAATG GGGATGGAAA GAATCGTACC AACAATGCGT GAAGCAGAGG TTTTAGATAA 2520 

CCTTTTAGCC AGAAGTGCCG TTGGTCAAAA TTTAACAACG TATGTGACGT TTGCCGGGCA 2580 
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AAAAAATGCG GATGAATCCG ATGGTCCAGA AjSAATTTCAC GTGGTCATTT TGGATAACGG 264 0 

ACGTTCAAAA GCGTTAGGAA CAGCCTTTCA GCCAGTCTTA CAGTGTATTC GTTGTGGGTC 2700 

TTGTCTGAAT GTTTGTCCGG TCTATCGCCA TATTGGAGGG CACGGTTATG GCTCTATTTA 2760 

TCCAGGACCA ATTGGTGCTG TGCTTTCTCC AATTTTAGGT GGATATAAAC AGTTTGGGGA 2820 

ATTGCCTTAT GCATCTAGTC TTTGTGGGGC ATGTACGGAA ACGTGTCCTG TGAAAATTCC 2880 

ATTGCACGAA TTATTAATTG AACATCGTAA AGTGATGACC GATGATCTGA AAATGAAACA 294 0 

TGGATTTGAA GATTTCCAAA TGCGTATGGT AGGTAAAGCC ACTGGTTCAC CAGCAATGTT 3000 

TAAAGCAGCC ATGAAAGTCG ATCATGCTGC AGCGGGCATT TTAAGTAAAC AAAAAGACAT 3060 

TACTGTTGAA AACATGTACA ATCaTGGAGG CTACCTTGAT AAAGGACCAG GGTTAGTAAA 3120 

AGGTTGGACC GATGTTCGTG ATTTGCCAAG ACCACCAAAA TCTTCAGAAA ACTTCAGAAG 3180 

TTGGTTCAAA AAACATCAAG AAGGTGAAAA AAATGACTAA CGAAGCGATT CAAAATCGAG 324 0 

AACCTTTTTT ACAAAATCTA CGAGAAAAAT TAGACGTAGA AAAACAGCCT GTTTCGGCCC 3300 

ATCCATTTGA GCCAGTTAAT CATTTACCAG AAGAACAGTT GGCAGATAAA ACACCAGCAG 3360 

AGCTATTAAC GATTGTGAAA GAACGTGTGG AAACGATTCA CACGAACTTA GTGGAAACCA 3420 

CACAAGAAAA TTTACTGACG ACGATTCAGC AAATTGTTGC TGAGTTTGGT GGGGACAATC 34 80 

TTTTATTGCC AACAGATGCA CGTTTTGAAG CCTATGGATT AGCTGATTTA GCTAAGTCAT 354 0 

TAGAGGCTGT TTCTGTTAAG CAATGGCAAC CTGGAAGTGA GCAACGAGAA GCGAATATTC '3600 

AAACAGCTGC TCaGGCaAAT ATTGCGATTG CCTTTGCTGa GTTTTTAcTA GCTGAATCaG 3660 

GAACGATTGT GGTGGAGTCC aATGCGGGAC aAGGCAGAGC CtTACATTTT TTACCGAAGC 3720 

ATTACATTTC AATTATACCG TTTAGCAAAC TCGTTCCACG TTCGACTCAG CCAGCAGCTT 3780 

TTTATACAGA AAAAATAGAA AAAGGCGAAA AAATCGGCTC AGCGATTCAC TTTATTTCAG 3840 

GGCCTTCTAA TTCAGGAGAT ATTGAAATGC AACTGGTTGT CGGGTTGCAT GGTCCTTTAG 3900 

AAGTTTGTTA CGTGGTTGTG ATGGATCGCT AAAACGCTAG AAGCATAATC AATTCAGCCC 3960 
TTTTTAGAAT TGATTATGCT TTTTTTATGA GCAGGATCAA AAAAGTAAGA ATCAAATTTG . 4020 

AAAGAAATAC GTTTTTTTTC TGACCATTTT GTAAAAAAAG CGTACATAAA AATTAATTGG 4080 

GTGTTAAGCA CTATAGGATT GAT T TACT TT GTGGTAGAAT AT GAG AC AT A AGATAATAAG 4140 

GTTTTTGGAG GAATGAGTTT GAAATCAAAA GAATTAATTA AAACAGTCGT CTTTTTTGCC 4200 

TGTTTAGCTT TGGGTCTGTT TTTACTGAGA CAATTTGTAT TTACGCCTGT CGTAGTGAGA 42 60 

GGTCATTCAA TGGATCCAAC GTTAGCAGAT GGTGAACGGG TAATTACGTT AAAAAACACA 4 320 

GAAATTAATC GTTTCGATAT TATTACTTTC CCAGCGCCAG ATGAACCAGA TAAAAATTAT 4 380 

ATTAAACGTG . TGATTGGTTT ACCTGGAGAT ACAATTGCGT ACAAGGATGA TACGTTGTAC 4 4 40 

ATCAATGGAA AAGAAGTTGA CGAACCCTAT TTAGATGAAT TTAAAAAAGC CTTAACAGAT 4500 

GGTCAACCTT TGACAGGCGA TTTTTCATTA AAAGAAAAAG TACCAGCAGA TAGCTACTTT 4 560 
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GTTTTAGGTG 
AAAGATATTT 
CCAGAAGTGT 
AAACAAACAC 
GATGTGAAAC 
CACTTATACA 
TAGTCCCGAA 
TACAGACAAC 
GTTTGGCTTG 
CAGGTGCTGC 
TTAGAGGTGA 
TGCGCGAAGG 
ACCCTAAAGG 
TCCAATTGCA 
AGTATTTACA 
TTACTGCCAG 
TAGCCCTTTT 
ATGAAGCGGG 
TTCTTTCAGA 
TAAACGATTA 
GGGGCGATGG 
CAACCGAAAT 
TGACTCGTGA 
TtCCTGTTTA 
AATCTTTATT 
GAACGGAATT 
GGTTAAACCA 
CATACGGCTA 
ATTTTGAAGC 
GTCAATCGCT 
GTTTGGAAGC • 
TAAAAATGTG 
AACAGACTTG 



ATAATCGACG 
TGGGTGAAGT 
CAAAACAATA 
ATGTTTCCGT 
CATGAGTGTG 
GGCGACAAAA 
CCATATTAAG 
TACGTCTGAT 
GTACTATTTA 
GATGATTTTT 
AATTAACAAG 
CAATATTGAG 
ACAAGATTTA 
AATGAGTCAG 
AACGGTTGAT 
AGAATTAAAG 
ATTGGACAAA 
ACAAACCTAT 
TTATGTGGAA 
TTGGATTCAA 
CTTATTTCTG 
TCGCCGTTTA 
TTTAGATTGT 
TGTCGACCGA 
TGCTATTCAT 
ATTTATGCCA 
ACGGAACGCT 
TGAAGGTTAT 
GGAAGAGCAA 
TCAACGGCTG 
AGCGACCGTC 
GCGCTTACAA 
GGATGCATTG 



GAATTCAAAA 
GAAATTTGTG 
AAAGAGACAA 
CCTTCTTTTG 
CAGTTTATTA 
CAATGGCTAG 
TTTGAGCAAG 
TCAATAACCA 
CAACATACAC 
CGCAAAATTT 
CCTGGCTTTA 
ATTGCGGAGC 
CAACTTAAGT 
TATGGCTATG 
TTGTCGAATG 
TTGATTGAAG 
CAATATCCGC 
CATCAGCTTT 
AAGAAAGAGG 
TCGCAAGAAC 
TGGCGTGCTG 
GTGGTGGAAG 
TATGAAAATT 
GATATGGCTA 
TCGTACAATT 
ATGAATCAAC 
TGGCGCAGAA 
TATTGGTCAC 
GAAGATGTCG 
TTACCTAGTT 
TTCTATCACT 
GCTATCGAAG 
ATGTCATTAC 



473 
GACGGTCGTG 
ATGTGGCCAT 
TGAGTAAAAG 
TGGTAAAATA 
GAGGAACCGC 
AGGAAGACGC 
AAATTCAAGT 
GTACCCGTCT 
CTTTTTACTC 
TAGTAGAAGC 
TTCaGCAACt 
TTTATCCATT 
TTCAAGATTT 
AATCAGCGGA 
TTCAATTCGT 
TGCTAATGGC 
ATGACTTACC 
ATCAACTAGC 
TATTAATTAC 
ACCTGCCTCC 
AAAACGTTAA 
AAGGGTATCG 
TATTAGAACC 
TGGATCGGCA 
ATCGTTATCG 
TCGCTACTTC 
AAGTTGATAT 
AAGAAAAAGA 
CTACGATGGA 
ACTTTCAAGC 
TTTTACTGCA 
CAGGTCAACT 
TAGATGAATA 



TCATTGGTTT 
TCTCACGGTT 
AGAAGGACGG 
AAGGCAGTGC 
CGTGGCAGAC 
ACAGCATGAA 
CTTGCAAAAG 
GCAGGTTTTT 
GGCAGATGTT 
AGAAGAAGAG 
ATTTCAGCTT 
TTTAGAAAAA 
AACGTTAATT 
GATTATTCAA 
TATTTCAGGC 
GCAAgcAGGT 
TGATCCGCGG 
AAGACAGAAG 
AAATCCAGAT 
ATTAAGCACA 
AGAAGAATTA 
GTACAAAGAA 
GATTTTTGCG 
TCCTTTAGTC 
CGATGTGCTC 
TGAAGAAAGC 
TACCGAAAAT 
TTGGGAATTT 
AGAAGAGTCA 
AATGATAAGT 
AAGTGGTGTG 
AGAAACAGCT 
CGTCACTGTT 



TAT T CAT AAA 
TGGTCCAATA 
AAATGTGTAA 
AAAGGAAAAG 
TTAGAAGCGC 
GTATTCTATT 
TTACGTCAAT 
AGTTTTTATC 
TTATCTGATG 
TTACAAATTT 
TATCaAGAAA 
CAAACGGAGA 
TTTACGCGAT 
CATTTAAGCG 
TATCAGCAGT 
TCTGTGAAAG 
TCGTTATTTT 
CAAATCCCGA 
TTGCAAGGAT 
GCTGACTGGA 
ACACATGTTG 
ATTCAGGTGT 
GAACACGAGA 
GAATGGATTG 
CGTTTTTTGA 
TTGACTGATT 
GTAGTCCTGG 
ATTCGGTATG 
AATGCGATTC 
GCCAAAACTG 
GCAACACAAT 
CGTAACCATG 
TACGGAGAAA 



4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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GTTCGTTTGA TTTTACAACG TTTCAAGAAA TATTTGTTAG TGGTTTGGAA GGTTTGCACT 6600 

ATAGTAAAGT GCCGACAGCG ATTGACCAAG TACAAGTTCG CGCAATGGAT TTAACTCGAC 6660 

CTGGCGCTGC CAAGGTTACT TTTGCTATTG GGATGACGGA AGAAATTTTT CCACAAAAGA 6720 

TTGAAAATAA AACACTATTG TCTGATGAAG AACGGCAGAC AATCAATGAT ACTTTAACGG 6780 

AAAACCAATA TTTACGTGGG ACAACCGGAC GCAAAATTGC GCAAGAACCT TATGTTGCGT 6840 

ATCTTGTCTT TTCTTCAGCG CGAGAACGGT TATATTTAAC CTATCCAAGT GTTAAAGATA 6900 

CCGCTCAAGA AGTCAAACCG TCTCCTTATT TTAAAAATAT CCAAAAAGAT TTAAACTTGC 6960 

CCGTTTTTGA AAAAAATMAA ACAACCATTT TCGATGATGA AACGACCAGT TTAGCCCATA 7020 

TTAGTACGTA TCGGACGTTA ATCGGTGAAT TAACGCGTTT GAAAAGACAA CGGAAAGAAA 7080 

CACAAGAAGG ACTGTTGCCT TTCTGGTTAA ATATGGAAAA AGCGCTAATG AACCAATCAA 7140 

TTGCGCCGCT AGCCAAGCAA GTTTTCGAAA GTTTAACGCA TCAAAATATT CCTGAAAAGA 7200 

TTGATGAGGT CTTAGCAGAA CCTTTATATG GTAAAGATAT TTATACGTCC GTTTCTCGGA 7260 

TGGAGAGTTT TTATCGCTGT CAGTATCAAT ACTTTTCTCG CTATGGCTTA CGTTTGAAAG 7320 

AAAGAGATGT CTTTGGGCTT TCGCCAGCAG CTACAGGGGA ATTTTTTCAT GAAGCATTGG 7380 

ATCAGTTCTT TAAATTATTG ATTATGAATC AACGGAACTT ATCTGAATTA ACCGATCAGG 74 40 

AAGTCAATTT ATTGGCGGAG GAAGTACTAA ATAGTATTTT AGGGGATGCG CGTTTTTCTG 7500 

TATTAACGAC CTCTAGTCGC ATGAACTATA TTCGCTATCA ATTGAGTCAA ACAATTAAAA 7560 

AAGTCAGTTG GGCGCTGAAA CGTCAAAGCC AACGAAGTGs CATGACGACT GTTCAAACGG 7620 

AAGTGTTATT TGGCCAAATT GCCGCTAAAA AAGGAATTTC TGGTTTGGaA TTACCATTAA 7 680 

AAAATCAAGG GAAGATTcAT GTTCGGGGAA AAATCGACCG GATTGATCAA TTGGTTACCC 7740 

CCGAATCAAC CTATTTAGGT GTGATTGATT ATAAATCGAG TCACCGTAAA TTTAATATGA 7800 

CAGAAGCGTA TTACGGCTTA GCCATGCAAA TGCTGACGTA CTTGGATGTG GCTTTAATGG 7860 

ATGCTGTGCA ATTAGTTGGT CAAGAAGCTA AACCTGCTGG CTCACTTTAT CTGCATGTAC 7 920 

ACAATCCTAC ATTATCTTAT GAAGGCAAAG ACGACATTGA ACAACAAATG CTTAAAAAAT 7980 

ATCAGTTTGA TGGTCTTTTG ATGAAAGACC CAGATTTATT GGATCATTTA GATACGAGTC 8040 

TGCAAGCTAA ACAAAGTTCG TTGCTGTTTC CGATTGAAGA ATCCGCCAAA GAACAAATTA 8100 

AACCTGGTCG TCGCCAAGAA GATAAATTTG TGACAGAACC AGAATTAGGC GCGTTGCTTT 8160 

CCCATAACCG AAATAAATTT ATTGAAGCGG GGAATCAAAT TATTGGCGGC GAAGTTCAAC 8220 

TTAATCCCGC TTATCAAGGC AAGGAACGGA TTGCTTGTCG CTATTGTCCG TTTCGAAGTG 8280 

TCTGTGATTT TGATGTGATG CTAAAAGAAA ATAATTATCA TCGTATTGAG AATTTATCCA 8340 

AAGAGGAAAT ' TATGGCGCGC TTGTTAAACA AAGATGAGGA AGGAGCAACC GAAGATGAGT 8400 

AAAACAATTC CACTACGTCC AGCCAATGAG CAATTCACCG ACAGTCAATG GCAAGCGGTG 84 60 

TTTGATGGCG ATGAAAATAT CTTAGTTTCT GCTTCGGCTG GCTCAGGAAA AACCACCGTT 8520 
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TTAGTCCGCC 
GTGACGTACA 
AAGGCAATGA 
CTTTTGCCAA 
TTTTATTATT 
TTGTTATTAA 
GAAGCGTTTT 
AACTTAATTT 
AATGGCTTAA 
ACGTATTTAA 
ATGGTGACCT 
GAAAAGGAAC 
TATAATTTGA 
AAACCAACCG 
TTGAAAAAGC 
ACGCCCATTG 
GAAAAACGGC 
TTAGCGAAAA 
AAATTTGATG 
TTGTATTGGT 
AAACAGTCTA 
CAGTATGGGC 
AAAGACGTTT 
CAAATTGCTT 
GCTGATTATT 
CCAGAATTAC 
ACTGCTTTGA 
AAAACGGATC 
AATTTAACTA 
CAAAATTATT 
AATCCTTATC 
GAAAACGAAT 
TTAACGTTTA 



GTGTCATTGA 
CTGAAGCCGC- 
ATGAAGAACA 
CAGCCAATAT 
TAATTGACAT 
AAGAAGATGT 
ACCAATTAAC 
TTTCTTTTTA 
CACAAGCCTA 
AACCGCTAGC 
TAACAGAGGG 
AAACAAAACA 
CTGAATTGTT 
CGGAACAAGC 
AACTGTTTAC 
TTCAAGAAAT 
TTAAAAATTT 
ATCAGGCAGA 
AAGTGTTAGT 
TGCGCCGTCC 
TTTATTCTTT 
AAGGCAAAGA 
TAGACTTTAC 
ACGATGAGTC 
CAACTGAATT 
AATCCCCTGA 
AAATTCGTGA 
GACCGATTAC 
TTTTAGATGT 
TCCAAGCAAC 
AAGACATTCC 
TAGTGCTTAT 
ATCAAAAGAT 



AAAAGTCAAA 
CGCCCGTX5AG 
AGATCCAGAA 
CAGCACGTTA 
AGATCCAGTT 
TTGGGATGCA 
AGCGAATTTT 
TGAATTTGCC 
CGAAGTAGGG 
CGTTGAGACG 
CGAAGAGAAA 
ATTTTTACAA 
GAGCTTTGAC 
CAAACAGTTA 
ACTTTCCCCA 
GGCTCACGTG 
AGTGGATTTT 
TGGCTGGCAT 
CGATGAATAC 
GCTTAGCACA 
CCGCTTGGCA 
GGGCAAACGA 
GAATTTAGTA 
AGCTGCTTTG 
GCTAATTTAT 
GTTGTTGATT 
ATTAATTGAC 
TTATCAAGAT 
TTTTAAATCA 
GGAAATTCGT 
TTTGGCAGCT 
TCGTTTGGCT 
GGAACCAACC 
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AGCGGCGTCG 

.ATGAAAGAAC 
AGAAGACGGC 
CATGCCTTTT 
TTTCGGATGT 
TTACGTGAAC 
TCCAATGACC 
AAAGCCAATC 
GaTCAGCTAG 
TTACAACGAA 
CTTCAAAAGA 
TTTTTAGAGA 
CGCTACCCAA 
CGAGAACAAA 
GATGCCATGA 
GGGAAACAAT 
AACGATTTAG 
GCATCAGAAG 
CAGGATATCA 
GAAGGAAACC 
GATCCTACAT 
ATTATTCTAG 
TTTAGTCAAT 
GTTCATGGTT 
GAAAAAAAAG 
GAAGATAAGA 
CAAAATTTCT 
ATTGTCTTAT 
TTAGAAATTC 
ACAATGATTG 
GTGTTACGTT 
AATAAGGAAA 
ATGGAGGAAG 



ATATCGATCG 
GTATCCAAGT 
ATTTTTCTCG 
GTTTAACTGT 
TAACAGATGA 
AATTTTATGC 
GTAGTGATGA 
CTGATCCAGA 
GCGAATCAAA 
CTCTTCAGCG 
TTTGGTATTT 
GAAATGATTT 
CGGTTCGTGC 
ATAAAAAAGC 
AACAAGTCTT 
TTATGGAGGC 
AACATTATAC 
CCTCGGTTTA 
ATCAATTACA 
TTTTTATGGT 
TGTTTATTGA 
CTGAAAATTT 
TAATGGATGA 
TTGATCAATT 
CAACAGAATC 
CAGAAGGGGA 
TAATTTATGA 
TAACGCCTAC 
CAGTTCAAGT 
CCTTGCTTCA 
CACCAATTGT 
CCAGTTACTA 
CAGTTGTTCA 



TCTCTTGATT 
GGCTTTACAA 
TCAAATTGCT 
TATTCGTCGT 
AACAGAAACA 
GGAAAATCAA 
CGGACTCACT 
GGCGTGGATT 
GTTATTTCAG 
TTACGAAGAA 
AGCTCmAAAT 
AGAGAGCGCC 
CGAAGAGTTG 
GTTAAATGAT 
GAAAGAGGCC 
CTATGGCGCA 
GTTAGCTATT 
CTATCGCGAA 
AGAATCTATT 
AGGAGATGTG 
AAAATACAAT 
TCGTTCTCGC 
GCGAGTGGGC 
TTCGGAGGCG 
AGTTGAATTT 
ACTTTACGTT 
TAAAAAGCTG 
GAAGAAAAAT 
CAATGATGCT 
ATTGATTGAT 
TGGATTAAAA 
CGAAGCATTT 
GGAAAAAACT 



8580 

8640 

8700 

8760 

8820 

8880 
8940 
9000 

.9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
•9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
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ATACGTTTTG 


CTGAATCATT 


AGAAAAATGG 


CGGGAACAGG 


CTCGAGGGAA GGAGAl I AGG 




AACTTGCTGT 


GGACGATTTA 


TCGGGAAACG 


GGCTATCTTG 


ATTATG-1TGG -CGGG1 JLGUG1 


■LUOtU 


GTTGGCAAAC 


AACGCCAAGC 


CAATTTATAT 


GCGTTGGTCG 


AiGGGGGGGG bbtulAlbAA 


luoou 


AAAACAACCT 


TCCGCGGATT 


ATTTCAATTT 


GTTCGCTTCA 


TCGAAAAAAI GGAAGAAAAA 




GACAAAGATT 


TAGCAGA/YCC 


AGTCGTTTTA 


AGTGAAGAGA 


ATGCCGTACG CGTCATGACG 


lUoUU 


ATTCACGCTA 


GTAAAGGGTT 


AGAATTTCCG 


GTGGTATTCG 


TTTTAGATAT GACCAAAGAA • 


lUo bU 


TTTAATGTCA 


GTGATTTGAA 


TGAACGCTAT 


ATCTTTGAAG 


AAAATTTAGG GGTAGGGAi I 


1 noon 


CGTTATTTAC 


AACCTGAAGA 


ACGAGTTATG 


T AC GAT ACT T 


TACCATTTTT GGCGATTAAA 




CAAGTACGAT 


TAAGAAAATT 


ACTATCCGAA 


GAAATGCGGA 


AGTTATATGT GGCGTTGACA 


1 1 n a n 


CGTGCAGAAC 


AGAAACTTTT 


TCTGGTAGGC 


TCCTATAAAG 


ACCAAGCGGG GATGTGGAAA 


111UU 


GAGTGGCTGA 


AAGTTGGCGA 


TGTAGAAACA 


CTGGTGCTAC 


CAGCTGAAAA TCGCTTGCAA 


111 Cf\ 


AGTAAAAGTA 


GTTTAATGAA 


TTGGGTTGGG 


ATGACATTGG 


TACGTCATCA AAAAGCCGAT 


1 XZZyJ 


GAATATCAGC 


AAGAAGTAGT 


AGTAAGTAAC 


GTTCCGCAAG 


TCAAAAAGCA CCCAGCTAAT 


1 IZoU 


TTTCATATTC 


AGTGGTTCAA 


CGAAGAACAA 


CTACGAGCAG 


CCATTCAGCA ATTACAATTG 




CCAGAAAGAC 


AAGCAGAGGA 


TTTAGCCGAA 


AAAGCGCAAT 


TATCTGCTGA CAAAATCAAT 


1 1 /inn 


CGTGGCCTTG 


CACGGTTAAG 


TTTCAATTAT 


CCGTTTGAAG 


TGGCTACTCG GACCACCAGT 


1 1 ^ cn 
114 oU 


TATCAGTCTG 


TTTCTGAAAT 


CAAGCGAGTA 


TTTGATGACC 


CCGATAACAA AGAAATTGGT 


1 Ij^U 


AAAATTGAGG 


TTCGGGAAGA 


CAATACTATT 


CAAGCGCAAC 


CGTTGATTGT CAATCGAATG 


1 1 jo V 


ATTGAAGGAG 


ATTTAAGTAA 


GCCTAAATTT 


TTAGATACGG 


TTCAAGCGCC CTCAGCAGCT 


I 1 c^n 

I I OH \J 


GAAATTGGGA 


CAGCAACCCA 


TTATTTATTA 


CAACTAATTG 


ATTTAAGTAA GCAACCTTC1 


1 1 / Uu 


TACGAGGAAG 


TGCGAGCTGT 


TCAAGAACGC 


TTAGTGGAAA 


ATAAATTGAT TTTACCAGCA . 


1 1 £n 

11/ DU 


ATTGCTGAAA 


AAATGAATCT 


GGAACAAATC 


GTGGCCTTCT 


TTGACACCGC CTTAGGAAAA 




CAATTGATTC 


AACACCATCA 


AACCGTTAGA 


AGGGAACAAC 


CTTTTTCTAT GCTGATTGAA 


1.1 Q Q A 
1 1 O O U 


GCAGAAGAGC 


TCATTCAAAA 


TTATCCAGAA 


ACGACACAAG 


AT G AT T T AT T AAT T C A 1 GG G 


1.1 Q/n 


ATTATTGATG 


GCTATATCGA 


ATTAGACAAT 


CAATGCATTT 


rrt t\ nrt n r^+tti T\ rri 7V TV TV TV f*t^^ T\ 

TGTATGACTA TAAAAGGGAI 


1 


CATGTAAAAG 


GCACCTCTCC 


CCAAGCAATT 


TCTGAAATTG 


TCGAACGGTA TCGCGGAGAG 




ATGAATCTTT 


ATCGTAGAGC 


GCTACAAGAG 


GCAACGCATA 


AAGAGGTGTC GCACGTGlAi 




TTAATTTTAT 


TAAATGGAGG 


AGTAATTATT 


GACATGCAGA 


CmGGAAATGT TGTAGA1 1 i I 


1 01 Of) 


ATAAAGTAGA 


AAAAATTAGA 


AGCAGAAACA 


CTTACATTTT 


GTAAGGAATG TGCTATACTC 


12240 


ATTGAAGAGG 


TGAGGAAGAT 


GGAAAAAGTA 


AAAACAGAAT 


CGTTTGTACT TTGTsCCgaA 


12300 


ATTTGAAAAA 


■ TCATTTTCAA 


TACTTGGTAA 


AAAATGGAAT 


GGTTTAATTA TTGATGTTTT 


12360 


ATTAGAACGA GGT CCTCAAC 


GTTTTGGCGA 


. ATTAAAAGAA 


k rAAATTCCAA TGTTAAGTGA 


12420 


TCGTGTCTTA 


. GTAGAGCGAT 


TGAAGGAACT 


AGAAGCTGAA GGAATTATCA CCAAAGCTGT 


12480 
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TCGTTGCGGT GAAGGAAATC GCTTTGGAnT ATTTCCTGAC CGAAAAAGGC GAAGACTTGC 12540 

AATTAGCAAT GGAACAAATT CAACATTGGG CAGAAAAATG GGTAAAAGAC GAAGAATGCA 12600 

GTTGACCGCA AGCAAAACTT CTTGTAAACT AAACGTAATT AAATAACTAA AAAAACAGTG 12660 

ATGGAAACAA GTAGTTTGAA TTGACGTGTT TTAGGGAGAG TCAGCCAGAG ACTGGAAGCT 12720 

GATTTACACA GTCACCAAGc GAAATTCACT TCCTGAGTTT GTTTGTTTTC AAGAAGAAAG 12780 . 

CAAACCGGTA ATTGCCGTTA AATGAAATGA GTGCATGACA ACAATTTGTT ATGAATTAGG 1284 0 

ATGGTAACAC GATAGCTCGT TCCTTTGAAA GGAATGGGTT ATCGTTTTTT TTTTAGTTTA 12900 

AATAAAAGAT AGAGGGGAAC GCATAATGAC ATTACAAGCG CAATTAGAAG CTCTTAGAGA .12960 

CAATACGCTC AAAGAAATCG CACAAGTTGC TACTTTAAAA GAATTAAACC AAATTCGCGT 13020 

AGAAACATTA GGGAAAAAAG GGCCAATCAC CGAAGTTTAA GAGGCATGAA AAACCTTTCA 13080 

CCAGAAGAAC GACCGGTGGT GGGGGGCTTT GCAAATGAAA TTCGTGATTT ATTAACAGAA 13140 

GCAATTGAAG CGCGCAAAGT TGTTTTAGAA AACGAAGCGT TAAATGCGGC ACTAAAAGAA 13200 

GAAAGTTTAG ATGTAACTTT ACCTGGGAAA CAAATGCCTC AAGGCACACG TCATATTTTG 132 60 

ACACAAGTTA TGGAAGAGAT TGAAGATATT TTCTTAGGCA TGGGTTATCA AGTAGTGGAA 13320 

GGATACGAAt AGAaTCCGAT CATTATAATT TCGAACGCAT GAATCTACCA AAAGATCATC 13380 

CGGCACGCGA CATGCAAGAT ACGTTTTATA TTTCAGATGA GATGTTAATT CGTACGCATA 13440 

CTTCACCAGT TCAAGCTCGA ACAaTGGaAA AACACGATTT TTCmAAAGGk GCATTACGAA 13500 

TGATTTCTCC TGGGAAAGTA TTCCGTCGTG ATACAGATGA TGCGACACAT AGTCACCAAT 135 60 

TC CATC AG AT TGAAGGCCTT GTTGTCGATA AAAATGTCAC AATGGGTGAT TTAAAAGGGA 13620 

CGTTAGAAGT CATGATGaAA AAAATGTTTG GtGaAGATCG TAAAATTCGT TTACGTCCTA 13680 

GCTATTTCCC yTTTACAGAA CCTTCAGTCG AAGTAGATGT TAGCTGTTTT AAATGTGGCG 137 4 0 

GCGCAGGTTG TAACGTCTGC AAACATACAG GTTGGATTGA AATTTTAGGT GCGGGGATGG 13800 

TTCATCCAGA TGTGTTACAA ATGTCAGGGA TTGATCCAAC AGAGTACTCA GGCTTTGCCT 138 60 

TTGGCTTAGG ACCAGATCGC GTTGCTATGT TACGTTACGG TGTAAATGAT ATCCGTAATT 13920 

TTTATCAAAA TGATTTACGT TTCTTAAATC AATTCAAGGT AAAGGAGTAG TTGAAAAATG 13980 

TTAGTTTCTT ATAAATGGTT AAATGAATAT GTTAATCTTT CAAACGTTAC TCCGCAAGAA 14 040 

TTAGCGGACA AAATGTCTGT GACAGGTATC GAAGTTGAGG GCGTTGCTGT ACCTGAAGAA 14100 

GGATTAAAGA AAATTGTTGT CGGCGAAGTA AAAGAATGTG TTCCTCATCC AAACTCTGAC 14160 

CACTTATCAA TTTGCCAAGT AGACATCGGT GAAGAAGAAT TGTCACAAAT TGTTTGTGGA 14220 

GCACCAAATG TAAAAGCCGG AATTAAAGTC ATCGTAGCAT TACCTGGTTC AAGAATTGCA 14 280 

GGAAATCAAA ■ AAATTAAAAA AGGTAAAATG CGTGGCGAAG TCTCTAACGG AATGATTTGT 14340 

TCGTTAGAAG AGCTAGGATA TTCAGATAAT GTCGTACCAA AAGCCTATGC TGAAGGGATT 14 400 

TATTATTTAC CTCAAGAAGC AGTGAATGGG ACACCTGTTT TCCCTTATTT AGACATGGAT 14 4 60 
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GATGCGATTA TTGAATTATC AATTACACCA AACCGTGCAG ATfCACTAAG TATGAGAGGG 14 520 
GTAGCCTATG AAGTTGGCGC AATTTATCGT CAAACACCTC AGTTTAATGA TCCCGAACTC ' 14 580 

AAAGAAGATG CTTCAGATAA CGTGGAAAAT TACGTAACAG TGACTGTTGA AGATTCACAA 14 640 

GATGCGCCAg cGTATCAAAT TCGTGTTATT AAAGATGTGA CGATTGCAGA AAGTCCTCAG 14700 

TGGTTGCAAA ACCGATTGAT GAATGAAGGA ATCCGTCCGA TTAACAATGT GGTGGACGTG 147 60 

ACAAATTATA TTTTATTATT ATTTGGTCAA CCATTGCATG CATTTGATTA CCAAAAATTA 14820 

GATAGCAAAG AAATCTTGGT TCGTCGAGCA ACAGCAACAG AAGAACTAAT TACATTAGAT 14 880 

GGTGAAACAC GTCAATTAAC AGAAGAAAAT ATCGTCATTA CGAATGGAAA AACgCCTGTC 14940 

GGCTTAGCCG GAGTAATGGG TGGGGCTAAT TCTGAAATCA GTCAAGAAAC AACAACTGTG 15000 

GCTTTAGAAG CGGCATTGTT CAATCCATTG TCTATTCGCA AAACGTCTAA ACAATTCAAT 15060 

TTACGGAGTG AATCTTCAAG TAGATTTGAA AAAGGCATCA ACCAAGCGAC AGTTGGGCTA 15120 

GCTTGTGATG TGGCAGCCGC GATGATTGCA GAGTTAGCCA ACGGAACAGT GGTCTCAGGC 15180 

ACTGCAGTTG GTTCTGAAGT GGCCGTTAAA GAAGCGCAGG TAGCTGTGAC TTTAGAGCGA 1524 0 

ATCAACCAAT ATTTAGGCAC TGCATTAGAT GAAGCAACGG TGAATGAAAT TTTTGAAGCA 15300 

CTCGGTTTTG CTTATGAAGT AAATCAAGGC GCATATGAAA TTACGATTCC ACCAAGACGT 15360 

TGGGATATTG CGATTGAAGC AGATATTATT GAAGAAGTGG CGCGCATTTA TGGATATGAT 15420 

CATTTACCTT CAACATTGCC AAGTGGAGAA ACAGTTGCTG GAAGTCTAAC CAAAGCACAA 15480 

CATGTTACGC GCCAATTAAA GAGCTTACTG GAAGGTCATG GTACGAGTGA AGCCATCAGC 15540 

TATGCGTTGA CAACAGAAGA AAAATCCCGT CAATTTATGA TGAAAGAAAG TCAAACAACA 15600 

CGTTTGCAAT GGCCAATGAG TGAAGAGCGT TCTGTGTTAC GTATGAACTT AATTTCTGGT 15660 

TTATTAGATG ATGTCGCATA CAATGTGGCA CGTAAAAATA ATAATATCGC CTTCTACGAA 15720 

GTAGGACGCG TTTTCTACCA AACAGAAGAT CCAACAAAAA ATTTACCTAC AGAAGAAAAT 15780 
CACTTAGCAC TTGCTTTAAC TGGTAATACA ATGGTTAAAG ATTGGCAAAC AAAAGCAACA 15840 
GCCGTTGATT TTTATACAGT AAAAGGCTTA GTGGAAAGTA TTGTAGCTGT TTTAGGATTA 15900 
ACAGAAAAAA TCAGTTATCA AGCGACAACG GCGATTCCAG AAATGCATCC AGGCCGGACA 15960 
GCATGGATTT ATTTAGAAGG TGAAGTGGTT GGTTTTGTCG GACAAGTTCA CCCAACGACA 16020 
GCGAAAGCGT ACGATATTCC AGAAACATAT GTTGCTGAAT TAAACTTGCa ACAATTAGTA 16080 
GCCATAGAAG CAGGCGGGGT TACTTATGAA GCAGTTTCTA AATTCCCAGC AGTTTCTCGA 16140 
GACATTGCTT TATTAGTCGA TGAAACAGTT ACCAATCAAG AACTAGTTAA AACTATTTCA 16200 
GATAACGCAG GTAAATATTT GAAAGAGATT CATTTATTTG ATGTATATCA AGGTGAAAAA 16260 
CTAGGCGCTG- GCAAAAAATC AATGGCGTAT AGTTTAACTT TCGTTAATGC AGAAGCGACA 16320 
TTAGTGGATG AAGAAATTAA CCGTTCAATG GAAAAAGTTG AAAAAGCGCT AATTGAAAAA 16380 
CATCAAGTAG AAGTAAGATA AAATAAAAAC AGGAAAGTGG GGCATAAGTC GAAATGACTT 16440 
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ATGCCCCACT 
CCGAAATTCT 
AAACACTTTT 
TGAAAGCCAA 
GATTCCGTAA 
TTTGCCAAAG 
AGTAACAAAC 
ATGATAAAGA 
GTCGACCCCA 
AAAACTCACA 
CCACAATAAG 
TGGTCCATGG 
AATTGAGACG 
ATTTGCTGTC 
CTATATGCTT 
AGAATGAAAT 
GACCAACTAT 
TTTTTAATTA 
GGTAAAATCA 
AATTGCCCTC 
TGATAGAGGC 
AGCAGAGGAA 
AATTCTACAT 
CCAAAGAAAA 
TAGAGAAATG 
GCAAATGGCC 
AAAATTTTTG 
ATTTTTTCTA 
AATTCAGTAT 
AAAATTTCTG 
TCAATTCGTT 
GTAAATTTGA 
TGTGCAACGC 
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TTCTAAATCT 
CCAAAAATTA 
GTCTCAGCCT 
CGTGAAAATT 
ATAGTGAATA 
AGTTCT? J..-A 
TGAGAAACAA 
ATTTGCCAAT 
TTTAAACCAC 
ATTCCAGCAG 
TTTGGAATAT 
GTTTGATTTC 
AAGG A/VAT AT 
AGTAATTGAC 
TCTTATTTTT 
ATAAGAAACC 
TGGCAGCAAA 
GGTTCACTAC 
CATACCCjCAT 
TTGGGACAGC 
CT AC AC AC AG 
AGCCATAATA 
AGACACGACT 
TGCCTAAAGC 
CCTGACCAAA 
CTTCTTCAAG 
ATTCTTCAAG 
GTGTGCCGTC 
TTGCTTTCTT 
TCTTGCCATC 
TAGAAGTTAA 
TTCCTTCCTC 
CGATTGTTTT 



TAAGATAATC 
AAGAACAATT 
CCTATTTTTA 
GTGAGATACT 
CTTGTCCTGT 
AAGCAATCAC 
TTGCTGGTAA 
TTGAAAAACC 
CACGAATAAT 
GTGTGGATTT 
TACGCACCAC 
GCAAAACGGC 
AGAGCGTCAA 
GCATTTCTTC 
TTCTTCCATT 
AGCAAGCGCA 
CATTACGTCA 
TTGATTGGTT 
CGTTTGACCG 
TCCGATTCCT 
AATGGCAATC 
GACTACGATA 
AATTGCGTGT 
AAAGGCAACT 
AATTCGCCAA 
CCAGGAAATA 
TCTTAGGAAT 
TTTTTCCCAT 
TGTGGTAATT 
CACATAGCCA 
CGCTGTTTTT 
TTGCGCTTTT 
TCCGTCTAAA 
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AGCTGAACAA 
TTCGGAAATT 
ACTACATTAA 
AAAGTTAATA 
TTGTGCGTAg 
GGAATACAAG 
GACATTGCGA 
TTGAGAACGT 
TTCCGCTAAA 
GATTTTGAAA 
TTCTACATAA 
TAGAATGGTG 
TTTTAAGCCG 
TGAAAAAGGC 
TTTCTAGCCA 
AAGGCGGGAA 
GCGCCAGAAA 
AAAGGCGGTA 
TACGTTAATC 
GAGCGAACCA 
GTAGTAATAG 
AATTGAATCA 
AAAAGTTTAC 
AAGATCGCTC 
TCTTGAAATA 
TTGGCTAACA 
GATTAATTTT 
TTTTCAATTA 
CCATATTCTT 
GATAGGATTG 
AATTCTGGAT 
TTCTCTAAGT 
TCTTTCATGC 



AAPCAGCTCC 
CCTTCTTATT 
TAACTTTTCG 
ATGAAATAAA 
CGCCCCATTA 
AAACTTGTAT 
ATGGCTTGAG 
GCCGCTTCAA 
GCAGCGGTGG 
ATCAAAAAAA 
ATACTGGCAA 
CCAAAAATCG 
TTAAATAGAA 
ATCTGTTTTC 
AATTCGCTAA 
TATAGTTCAA 
TAATGGCAAC 
ACATAATCCG 
CTTGGGAGTA 
CTTCAGAAAT 
AAAATGAAAG 
ACAAAGGTGT 
TTTTGGTCGC 
CTAAAGAAAG 
AAGCCTCCCA 
AAAACATAAA 
TTAAGCCCCA 
AACCATTTAG 
GTGGCGAAAA 
ATTTATCCAC 
AGGAACCTAA 
TTTCTTTTGT 
TGGAGAAGTT 



AAGAAATAAG 
TCTTGGAGCT 
CCCAATTTCG 
TTAGGGCGAC 
AGATATAAGA 
CCTTGATGAC 
GCAAAACAAT 
TTTGGCCTTT 
TAAACACAGT 
TAACATAAAT 
GTAACTTCAG 
TACTCAAAAT 
ATTGAAAATT 
CTCCTATCTG 
TGGGAAGCAA 
GTTGACAGAC 
TGTTGACGTA 
CCATGCTTGT 
AGCCGCTTCA 
GTAGGCGCCA 
CAGCGGGCTT 
GTTTTGAAAA 
TGATAAACTG 
CAGAATAGTA 
ACGATATAAG 
AATCCCTCCT 
GTCTTGATAA 
ATAATCATTT 
ACCATCTTTT 
TGAAAATGCA 
TTCTTGATAA 
CGTAGCAGAT 
ATCCTCTTTG 



16500 
16560 
16620 
16680 
. 16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17 340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
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CGCACAAGGA AACCAATCTC ATCTGTATAA TAAGGAGTTG TAAAATTATA AGTCTTTTTA 18 4 80 

CGTTCATCAG TAATTGTAAA. TGTAQCAATT ACCATATCTA GTTCACCATT GTCTAAAAGG 18540 

GGACCGCGCG TTTTCGCTGT CACGCCCACA AATTCCACAT TCTCTTCGCT GCCGGTAATT 18600 

TCTTTCGCAA TCAAGCGTGC AATATCTgGT TCCATTCCTT CAtTTTTGTn TGATCAGGAT 18660 

TCATATAGCC AAAATTAGGA ATGCTTCTTT GACACCTACT TnAATnTnCC CTGCTTTTTT 18720 

TAAT 18724 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

AAAAATTACG TGATTCGCTT AGCCTGACCA GACTTTTGTC TAAGTCTTAA AACTTGATAA 60 

GTAATAAGAG GTCGGGACAG AAGTGCTTAA CTCCGAGAAC AAAGTAGGTA CTGTTCAACA 120 

TCTATTTGCT TTGCAAATAG TGTTTCTCAG CAATAAGAAG AAATTTCCGA AAATTGTCAT 180 

AAAACACAAC GAGGAACGAG TTGATGTTGC AT GAT AC ACT ACACTCCGTT TCGTTCGCAT 24 0 

CAATTTCTAA GCATGGAACA TGCTAGAATT GAAGGACCCG GTGTTCTTTA ATTTTTGGAG 300 

ACCAAGTCCT TCAATTCTTA GTGATTTATC ACTTAGAGAT TGATGAGATC GGAGCTCTGC 360 

GAAGTAGGTA TTGTTCATCA TCGAATCACT ACGTTCTTCG TGATTCTCAA CAATTTCGGC 420 

TTATTTCCGA AGGACCTTCA CCAAGTAGGT ACGATGATTT CATCTGTTCA TGCTTCACAG 4 80 

TGAAACATCG CAACTCTTAG AGCTGTAGCT CTTTTGAAGG CTGTACTGTG ACGAGACGCA 54 0 

TGTCGAGAAA ACACCTTAGA AGTTGTTGAG ATCGGAGCAA CGCGTAGTGT TGCTTCTGTC 600 

CCCGCCGTTT ATCAATTTTT AAGCCTGGAA CAAAAATCCA AAGTGATTTT TGTTCCAGGC 660 

tCTTACTTTC ATACTAATCT TAGGTCAGGT TGTCATCAAG TTTGCCAATA ATAGACTTTT 720 

ACAAGCGCGC CTTTTTTGAG TAGTACACCA GCTGGTTGTT CTAGCAGGCA ATTTGTTTCT 780 

AAAGTATTTG ATAAATTGGA TGAAAGATGA CTTTTTGTCA CTAAATTTAC TGTGCCATCT 84 0 

ATTTCTTTCG CTCGGTAAAA TCGACGTAGT CTACTAGGGG TTAATTCATG CGCTAAAATT 900 

ACTGTTCGTT CCTTGAGATT TAACTCGACA CAATTAAAAA AAGTAGCTAA TAATGTCCAG 960 

TAAAAAAGAT GTAAATTAAC CATTGCCGCA AACGGATTGC CTGAAACGCT TAAAATGAGT 1020 

GTTTCTTGAA ATTGACTTCC CATAACTGGG GTTCCAGGTT TCATATTTAC AAAATGAAAC 1080 

AACTCTGTTC CTCCGATACG TTTAATTGCT AAAGGTAAAT TGTCCTTTTC ACCTACCGAA 1140 

ACACCACCAC AAGTAATTAA TACATCAAGT TTTGGTGAAA TCTCAATCAG CCTCTTCGCT 1200 

AATTCTTCAG GGTCATCTGA ACAATGGTCT AGGTGGATTA CTTCATTACC ACTCGCTTTG 1260 

ACAAAAGCTG CTAACATGTA GAGATTACTA TTATAAATTT TCCCAACTGA CAGTGGTTCT 1320 
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TCCAAAGAAA 
ACAACTTTAT 
TGTTTTGCTA 
CCCGCAGTGA 
GGCACTACAG 
TTATTCGTCT 
AAACAACTTG 
TCCATCCCGG 
ATCCGTCCCA 
TGGGTCATTT 
TATTCACTGG 
CAAATACAGC 
CTTCTTGCGG 
AATGAACTTC 
TCTCAATATA 
GGGTATTGTA 
GTAATTCTTG 
TACCCACATG 
TCTCCTTCTC 
CTTCCATAAT 
CACAAAAAAT 
TTTGCACGCG 
GTAAAACGAC 
AGTCnTAAAC 
TCTTCTCCAA 
GTAGAAGTTA 
ATCACTTCTG 
TCGCAAGCAC 
CCAACTTTAC 
GAATGTTCCG 
CGTGCTGCTT 
GTATAATCAG 
TGGCGATCTG 



CTAACTCACT 
CAATCCCTTG 
AAATTTTTTG 
CAGAGCGAAA 
CATCAAAATC 
GATTACAATA 
GGCTTTCAGC 
CACGAGGAAA 
AGGAATCAAG 
TCTCTCGTGC 
TTTAATCGCA 
ATTGGTGATT 
AAATACTCGA 
AACTTTGTCA 
AGAGTGATTC 
ATGGTAAAGC 
TGCTTGCTTA 
CATAATCGGC 
TAATCGTTCA 
TTCTTTTGCA 
TTCATAGTCT 
CCGTTCTGTA 
ATCTGcATAA 
TTTCTAAAGC 
AAATATAAAG 
AACCTGTGTT 
GATTATTGAC 
CTTGAACATT 
CAACCAACAT 
TTACACCTAA 
GAATTAAATC 
CCACCATTTT 
CTAAACCTTC 



ACCGGTCGCT 
CATTGCCAAC 
CTGAAAAGCA 
AATTTTAACA 
GGTTGGAATA 
AAACATGGAA 
AGTCGCACCT 
ATGAGGAATC 
CAGCGCCACT 
TTCTTCCAAC 
ACGGCACAAA 
TCGTTAACAT 
TTTCCGACAG 
CCTTCTTGGA 
GCTATTTGGT 
ATTCGGCCTG 
TACGGAATTG 
GTACCTGGAT 
TAATTAATTC 
GATTCATACG 
TCTCGCGCTT 
TTTGTAAATG 
GCAGCCGTTT 
TTGACGAACA 
GCCTTTGACG 
TCGATTTAAC 
TTTTTGATAG 
ATTTTGACCT 
TGCCAAATTA 
ACAATAAATG 
TTCTGGATGA 
TTCTAAATCT 
TTTCAAAATC 



AAAATCCCCA 
AAACCAATCA 
ATATCTTCTC 
TGGTCTTGCC 
AAAGCACCTG 
TCCCCTGCCA 
CGTGTTTCTA 
GCTGTCTCTG 
TCTTCTGTTT 
TCGATCATGT 
CTTTATATTC 
TCCCATCTGG 
CCGCATAGGT 
TTCCCAGAGC 
TAATCCCTTC 
TCGACATTAA 
CTTTAAATAA 
CTGTCAAGCT 
CACCAAACGA 
CACAAGGATA 
GTCCTCTTGG 
TCCCGTCTTT 
CGGTCATAAA 
TGTCCTG v TGT 
CTTCCTGCCG 
GGCACATGCC 
CCTGGGAAAT 
CGCAAAGGAT 
G AC AT ACT C A 
ATTGGTGCTT 
AT AT G AC AAA 
AAAAATCCTT 
ACATGCATCA 



CCTTCATGCG 
TGCGACTATT 
CAATTTCCGC 
CAAAATCCGT 
TTGTTATTTT 
CTAAAGAACC 
CTGCACGAAC 
CAAAAATGGT 
CACATTGACT 
TTTAACCATC 
AGGAATTGTG 
AAAGTGGAAT 
TTCGATTTTG 
TTGTGCATCG 
CGTTCGTCCT 
ATACGGGTAC 
ACCTTTCCCA 
CCGACAAGGC 
AGGCGTCaCA 
GCCCATTCGT 
TTCAACAGCT 
TTCTGCATAA 
TAATTCTTGA 
CTGGATCCGT 
TTGCTGCTGG 
AAGCTTTTTC 
CATAAGGCAT 
TGACACCACA 
TAACACCCTC 
TCTCTGCTTT 
TTTCTGCGAC 
CTGTCCGTTC 
TGCCATTAGC 



CTTCAAAACA 
AATCTGTTGA 
ATAGTTTTGG 
CCATTCTTGT 
AATCGCACAG 
GACGACTTTT 
GGCATACCCA 
TTCTCCGCTA 
CTTCATATGT 
CTTTTTTATT 
GCGATATCAT 
GTCATAAAGA 
CCACGTCGAG 
ACTGCATTTA 
GTCATTGCTC 
TCTTCATCTG 
CGAGCAAAAC 
CATTGTAAGC 
GCTGAAATTT 
GTCATGACTT 
TTACGGACTC 
CTGATGCCTG 
ACAACTAAAA 
AACAATTGGA 
TAAGACTTTA 
AAATTTGTCG 
ACAGCCCATA 
GCCTGGTTTT 
AGTTCCTGTC 
TGCATACATG 
TTTTTCCGGC 
TTCAATAAAA 
AAATGCGACA 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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2040 
2100 
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2400 
2460 
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2580 
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TTCGTTCCCG CCTGAACTTG TAAATGCAGC GCACTATCCT TCACTAAGTT AATTTTCCGC 
GGATCAACAA CAACAACTTG CGTTCCTCTT TGGATGGCTT GGCGAATTTG CGCACCAATA 
ACTGGATGAG CTTCTTCTGG ATTGGAACCT ACTAATAAAA TCATGTCGAC ATCTTCAGTG 
ATATCAGCAA TAGGATTAGT CATCGCACCA GAGCCTAAGG TTTGGGCTAA ACGATGGACC 
GAAGCCGAAT GACAAACACG GGCACAGTTA TCAACATTAT TGGTACCAAA AGCAGCACGC 
ACCATTTTTT GAAAGACATA ATTATCTTCA TTTGTCGCTC TAGAGCACGA AAAACCAGCA 
AGAGCATCCG GACCGTTT < C TGCTTTAATT TCGTTAAATT TTGACGAAAC AAGCGTTAAG 
GCTcTTCCCA AGAAGCAGGT TCAAAAATGC CATTCCGTTa ATTAATGGCT CCGTTAAGCG 
ATCTCCTGAC CCAACAAATT TATATGAAGC AAATTTTCCT TTCACACAAA GTAAATTTTT 
ATTAGCTGGT CCATCCAGGG GTTCAACACC CACCAAACGA TTATTTTTAA CGAGTAAATT 
CATCTGACAG CCTGTCCCAC AATGCGGACA AGTTGTAGGA ATTTTCTGGG TTTCCCATTT 
GCGGTACTCT kTCGTATCTT TTGCTGTTAA CGCACCTGTA GGACAGGAAG AGACACAATT 
GCCACATGAT TCACAGATAG ATTGATCAAA CGCCTGCCCA TAACTCGGCA TCATTTTTGT 
TTCAAAGCCA CGATTAGCAA TACTAAGTAC ATCTCTTCCC TGCCGCAATT GACAGACACG 
GGCACAACGA CGACACATGA TACACTTTTC TGGATCATAA GAGAAAAATG GATTACTGGT 
ATCTTCTTGA TGGCAAGGCA TTCTTTTTCC CTCGGTAAAG CTAGTGACAT CAATCCCATA 
ATCTAGCGCA TACTGTTGTA ATTGACAATC CCCGTTTTTT CCACAAGAAA AACATTCTAA 
TTTATGATTA CTCAATAATA AATCAAGAAT AAAACGTCTG GAATCAGCAA CTTTTTCAGA 
ATGTGTTGCC ACAACCATGT CCTCTTGACA ATGTGCCGTA CAAGCAGTTG TTAAGCCACC 
ACGTCGGCCA CCTTCCACCT CAACCACGCA CATCCGACAA GAACCATCAG GTGCTAATTC 
CTTTAAATGA CAGAGCGTTG GGATTTCAAC ACCTAGCCCT TTCGCTGCTT CGAGAATGGT 
AGTCCCTTTA GGCACAGTTA CTTCTTGGTT ATCAATAGAC AACGTAACTG TTTGCGTATG 
CAACTTTGTT TTCATTTTTA CCAACCTCCT TTGCTGGTGG CATAGATTTC TGTTTCAGCA 
GTCGTTTCAA ATTCTTCTGG AAACAAAGCC AATGCACTCT TCATTGGATT tGCGACGGAT 
TGGCCCAAAC CACAAGCTGA TAATCGCGTG ACATGTgTGA GCATCTTTTc cAaTCGCGGC 
AAATCACTAG CTGGTTGCCT CGTTCCGATk GAATTTACTC AATAATTCCA AAATCCTAGT 
TGTTCCCAAT CGACAAGGGG TGCATTTACC ACAGGACTCA TGAGCAAAAA AGGCCGCCAC 
GTGTACTAAA TAGTCCACAA CATTAACTGA ATCATCCATT ACTACAATTG CACCAGAGCC 
GACAGATAAG TCATGGGCCC ATAAATCTTC GTAGGAATAC AAACAATCCC TTAACGTTGC 
GACAGCACCA ATTGGACCGG ATTGTCCACC GAAGTGAATA AATTTCAATG CTCGACCAGT 
TGCAGAGCCT CCGCCGTATT GCTCGCCATA AATTATTTCT TCTAATGGTG TTCCAAGGTT 
GACCTCGTAT AAGCCCCGAT TTTTAATATG TCCAGACAAA CAAATTAGCT TTGTTCCGCC 
ACCATCGGCA GTACCCATCT CTAAAAAGGC TTGTCCACCT TCTCTTAAAA TTACAGGAAC 
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ACTTGCAAAT 


GATTCAACAT 


TATTTACTAA TGTTGGTTGC AAATATAAAC CAACATCTGC 


5340 


TAAGTGTGGT 


GGTTTGACAC 


GCGGCCGCCC CGTTTTTCCT TCAATTGAAT TTAAAAGTGC 


5400 


TGAATTTTCC 


CCACAAATAT 


AGGCGCCAGc ACCAGAAATA ATCGTAATAT CATAATTGAA 


5460 


TCCTTCAATT 


CCTAAAATGT 


TTTCCCCTAA AAAGCCAGCT TGTCGTGCAT TATCGAGTGC 


5520 


TTCTTGAAAT 


GTTTTTTGGA 


TTCTGCGATA TTCTCCTCGC ATATAAATAT AACCTGCTTT 


5580 


TGCTGAAAAT 


AAATAGCCAG 


CAATAATCAT TCCTTCAATC ACACTCAATG GATCTTCAGA 


5640 


TAAAAGTACT 


TTATCCTTAA 


AGGTCCCTGG TTCGCCTTCA TCAGCATTAC AAACAATGTA 


5700 


CTTTGTGGtT 


CCTTTTGcAT 


GATAAAGATG GcGCCATTTT TTTCCTAAAG GaTAGGcTGc 


5760 


TCCACCGsGa 


CCTCTTAAGT 


GcGcAATATC CAGTTCGTTT AAAATTTCTT CATCCGATAG 


5820 


TGAAATTGCT 


CTTTTTAAAC 


CATCAAATCC ATCGTATTTA CAATATTCAG CAACATCgGT 


5880 


CGCAGATTGC 


ATTTTCCCCA 


TTCTCTCTAA TAACATTGGT TGGTTTCTCT TAAGCATTGA 


5940 


CTCACCTCyT 


ACAGTCCTTC 


ATAGCAATCA TTTTGTAAAT GTTGAAwTAA TTGATAAATT 


6000 


TTCTCTTCTG 


TTAGTTGACT 


AAAAACGGTG TCCTTAATTT TAATTACTGG ACCTAAATCA 


6060 


CAAGCCCCGA 


TACAAGGAAT 


ACTATGATAC ATAAATAAGC CATCTGGTGT CGGCTGATTC 


6120 


TCTGGCACTT 


CTAAAATGGT 


TTCTAAAACT TCGGCAACCA TCGCCCCACC AGTATAATGA 


6180 


CACGGTGTGC 


TGTTACAAAT 


CTTCAAGACA TACTTCGCTT GCGGTTCCGT CTTTAAAATG 


6240 


GCATAAAAAG 


TGACGATTTC 


ATAGACTCGC GCTTCCGTTA AATGTAGATG CTCTGCAACT 


6300 


AATTGAGCTG 


TCTCCTGATC 


GATATACCCT TCCTCTGAAG CAAATTGTAA TTCAATTAAA 


6360 


ATATTTAAAA 


TTCGTTGTGG 


ATCAGCATCG TTTTCTAGAA TAATCGCTTC TTTTTCAATT 


6420 


AAGCTTAATG 


CACACATTCC 


TTTTTATTCC CCCTTTTTAG TGAAACTTGA AGTTGTTCTC 


6480 


CTTGATAATA 


GATACGCAAT 


TTTTTTGGTA TTTCAGCAGA TAGTAAACAA AAATCGGCTT 


6540 


TATCTATGAC 


TTTTTTTGCT 


GATTCTTTTT GGCGATTCTG GTTATTCATC ATAAGAAATA 


6600 


GTCCTGGCTG 


CACAACATTA 


CGTAATGAAT TTGATTCACA AATGATTAAT GCTTGGGCCG 


6660 


GCACAGCTTT 


CAGAAATTGC 


TGAATGGCTT CCGCCAAATT TTCTTGAAAA ACTTTCAATA 


6720 


GAAAAACTTT 


TTGACAACCT 


GCAGCTAATA ATTTCATGGT ATCTTTATTG CCAATCGTCT 


6780 


CTTGCTCTTC 


TACTAATTCA 


TAACCAGCTG TAATACTGGT ACAAATTCCG CATCCTTTTG 


6840 


TGCCTCGTTG 


ACACACCCCT 


CGTTGACCAC TAATCGTAAT AATCTTCAAG CCATAAATTG 


6900 


GATAGACTTC 


TTGGTACTGT 


TTAATAAGGG CTTCGGCTAA TGTCGTTTTG CCACTATTCC 


6960 


GACCAGTAGA 


ACCAATTTGA 


ATCATGTTTG GTATCTCAAT TGCCATAGAG CTCCTCCGTC 


7020 


TGAATCGTTT 


TCTCAACTCT 


TGGTTTGACA GTTGCTGGTT TTGTCATAAA GATCAATGGT 


7080 


ACGGCCATAA 


, AAAGACTGCC 


ACCTACTGCA TTGCCTAAAA ATACCCATAA TGTATTCTGC 


7140 


AAATAATCCC 


: ACCAACTGAT 


TGTGGTACTT TGTGAAAAAA TTGCCGCTGG AATAATAAAT 


7200 


GCATTAGCGA 


: CCACGTGTTG 


1 AAACCCACTT GGGGATTACA ATTCAAAATG AGAT 


7254 
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(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7296 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GGGGCCAATT 1 


TaGaAAATkG ACmATTGAAT AGAAAAGGaT GTGAACaAAT GGCaAAAGAa 


60 


ACAAATGTTA AgTTAGTCaC GGTGAGTGTT TTTGTGGCAA CATTTATGAC AGCCATTGAA 


120 


GGGACCA1 L\j 


TGTCTACTGC GATGCCAACG ATTGTCGGCT CGTTACATGG CATGGAAATT 


180 


Al (jAAL. .L 


TATTTTCAAT TTATTTATTA ACGAATGCGA TGTTAACACC GATTTATGGG 


240 


AAAb 1 1 OV-OVJ 


ATAAAATTGG TCGTAAACCT GTCTTCATGA TTGGCATTAT CATTTTTATT 


300 


I 1 LitjuC 1 1 


CGTTGTGTGG CTTTGCTCAA GATATGTTGA CTTTAATTAT TGCCCGCGCA 


360 


A i lLAALrtjxLj 


TGGGGGCAGG CGCAATTTTA CCAGTTGCGT TAACGATTAT TGCCGATATG 


420 


I Al AL/ii Hjo 


ArAAGCGAGC GAAAATTTTA GGTTtAAACA GTGCCGCCTG GGGAATTGCT 


480 


Ab iAl 1 1 1 


GTCCGTTAGC AGGTGGTTTT AT TG TAG AT A CAGTCGGTTG GCATTGGATT 


540 




ATGTTCCTAT TGGACTTGTT TTATTGGGCT TGATTAGTAT TTTCTTAGTT 


600 




GGGAACGGAC CAAGATGCCA ATGGATATTT TGGGCAGCGT TACTTTGATG 


660 




TAACGTTATt GCTAGGTTTT CAAATGATTA GCGATAATGG TTTTACATTA 


720 


AAPATTTG 


GTTGTTTAAG TTTGAGTGTG CTCTTTTTTG TAGCATTTGT GATGATAGAA 


780 


AAACGCGCGC 


AAGACCCAGT GATTGATTTG CATTTATTTA ATCAACCAAC GTTTGTTTTA 


840 


GTAAATCTTA 


vrrrnrrrrT TATTAftPGGT TTCTTAATGG GGATTGATGT CTACATTCCG 


900 


ATGTGGATGC 


AAGGTGTCTT AGGAAAAAGT GCAGGAATTG GTGGCTTAGT TTTAGCGCCC 


960 


TATGTCGTTA 


CTTTGGATGG CTGGATCATT TATAGCAAGT AGTTTTATGG AAAAATATGC 


1020 


TATGAAAAAA 


GTCTTAACGA TTGGGTTATC GATTCTATTA GTCGGGGCCA TCTTTTTAGT 


1080 


GGTAATGCCA 


ATGGCCGTTC CGTTTTGGCT TTTCTTTGTA GTGTCTTCTG TCTTAGGAGT 


1140 


TGGTTTTGGG 


ATTACTATCA CAACGACTAC GGTGACAGCA CAAAGTACAG TGGAGCCTGA 


1200 


AAAAATGGGG 


GTTGCAACAT CGTTTAATAC GTTGGTGCGT ACAATTGGGC aGACTGTGAT 


1260 


GGTGTCAATT 


TTTGGTGTGA TTTTAAATGc AGGAATGTTT GCGAAATTGG AAGCGAGCGC 


1320 


GTTAAACGTC 


GATGCAGATG TCATGAATCA ATTAGTGAAT CCACATACTG CaAATTaATT 


1380 


CCAGCTGCGT 


TGTTAAAACC ATTACGCGGT ATCCTCTATG CAGGTCTGCA TAATGTTTAC 


1440 


TTAGTCGGTG 


■ CGGGCTTAGT TGTTGTCGCT CyTTTGTTAA ATATTTTCGC AAAAGCGCAA 


1500 


CGAGCGAAGG 


TTTAGTGGTA CACTAATAGA AAGAGTATTG GAGGGAAAAG TAGATGGCCT 


1560 


TTTTACAAGC 


: AAATATTTAT TCmAATGTAT TGGAAATGGA AGTGAATGTG AACGTAATTT 


1620 
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TACCACAAGA 
CCGTGTTGTA 
CTATTGAGCG 
CTTGGTATAC 
CGAAAATTTG 
CTGGTTTGTC 
TTGCAGCCGT 
TATTAAAAGT 
AAGGTTCCAT 
TGCCCCAATT 
TGGCTCAAGC 
ATGACTGGGT 
AATAGAGAAA 
ATGACTGACG 
CATCTAAGTA 
ATTGAATGCT 
ACACTTGTTC 
CCGTTGCGTC 
TCATTTCTGG 
ATCGTTGGGG 
G AT AG AG AT C 
ATGCAGGTTC 
TATTTTTAAA 
GCaTAGAAAG 
TCGTGTCTTT 
AGTTTATCAA 
CGCTTGTATT 
ATTGGTGAAG 
ACTGAAGAAG 
ACGCCAAAAA 
TCTCAGCACT 
ATAGCGACAA 
TTGCAGGGGT 



AACGGTAAAA 
TTTATTACAT 
GTATGTCGCT 
GGACACACAA 
TCATCAACTC 
AATGGGTGGT 
TGCTTCTTTG 
CCGTAAGCGA 
TCACGATCCG 
TTACCTTTGT 
GTTGGAACAG 
GTTCTGGGAT 
GTAGTTTTCA 
GACCAAGGGc 
GGTTTTAACG 
TTCCTCCATG 
TTTTAAATAG 
CATAATCATC 
TGAAAAACCT 
ATAGGCAGGG 
ATAATAGTTT 
TTGAAAGTGC 
ATACCmATaA 
TTGtTCTAAA 
TGATAATTTC 
AAAGAAGGTA 
TTACCACTAA 
TCAATGAGCA 
CAGGGAACTA 
ATACAAAGGC 
TTGGCAACCA 
AATGAAAAAC 
TATTAATACA 



AAAGTGGGCA 
GGGATGGGAG 
GATTTGGGGA 
TACGATATGA 
TTTCCACAGT 
TATGGCGCCT 
TCTGGGGCAG 
AgcTATTGGG 
CTGTATTTAT. 
TGTGGGGAGC 
GCACAGGCAA 
GAATGGATTC 
ATATTTATTT 
TTTTTTTATT 
GCTTGAATAA 
TACGTTAACT 
TCATCTCCGT 
CCAAATAAAA 
GCTTCTCTTA 
ACGGTTTGGA 
TCCATGAAGG 
TCTTCAATCG 
ATTGcGGGCG 
GTGGGACTTT 
GGTTCCATGA 
CACTTTATGA 
AAATGGTAAA 
AATTGATTTT 
TTTTAAGAAA 
TCGGTAGTTA 
AAATTGTTAC 
TTAGGGATTC 
ATTTTTTCTG 



485 

CAAGTACCCA 
GGAATCACAG 
TTGCTGTCAT 
ACTATTGGAC 
TAAGTACAAA 
TGAAATTAGG 
TCAGTTTATC 
AAGGTATTTT 
TACAACAATT 
AAGATATGTT 
GTTACACTTT 
AAAAAGCCTT 
TAAAAGTTAA 
TAGATGAAGA 
AGGCGGATTT 
CATTGTCGGA 
TCAACACTTG 
GATGTTGTAA 
AAATGCCCAT 
TTTCGATAGC 
CTAGTAAGTC 
CTTCAGCCAT 
CCTGTACACC 
TTTCTAACAA 
TTTTAGCfCC 
TAGGAAAAAC 
TTATGAATCG 
TTTGTAATGG 
GTAGGGATAA 
TTGCCTTGTT 
AACAAGTGTT 
AATTAATTGC 
CCAAAGTTGC 



AGCAGCATTA 
TGTCTGGGCT 
TATGCCGTCC 
GTTTATCTCA 
ACGAGAAAAA 
ATTAGCGAAA 
ATCCACTAGT 
TGGCCCGTTA 
AGTAGAAAGC 
ACTTTCTGCT 
TGAAACTGGA 
GGCTTGGCTA 
ssCTTGGTCC 
TGTGTTGTCT 
TTGGTGGATG 
TACATATTGT 
CGAAACGAGC 
AGACGTTACT 
CATTTGATTC 
TACGGCACAG 
AGAATACCAT 
ACTTTGAAGT 
AAGCTGTTTG 
AGAAAATGCG 
TATTAAACGT 
TTGTGAAAAA 
GCCTTATTTT 
GGAAGAATTC 
GATGGACCTT 
GGCGGTGATC 
GGAAGCAATT 
CATTGCAGGA 
CCAAGGTGTC 



AC AG AT AT TC 
CGCAGAACCT 
ACGGATTTAG 
GAAGAATTAC 
ACCTTTGCCG 
CCAGAATCTT 
TTTGGGGAAT 
GATCAAATTG 
CAAACAGAAA 
AATCAACAGA 
CCAGGAGAAC 
CCAATACCAA 
GTCAGTCGAA 
GCTTGCAACC 
GAGTGTCGAA 
TTCATGTGCA 
TGTTTTTCCT 
GCTAAATGAG 
AAATGGCGTA 
GGAAATTGTT 
TCACCACATA 
AAGGCTTGCT 
GCGACTTTAC 
GCTGCAATAA 
TATCTTCGTA 
TTAACTTGTT 
ACAGTGTTAA 
GCAGACCAAC 
ATTATTCAAC 
GTAATGGTTG 
ATGAATGACG 
TTAGGACTCG 
AGTGCTGACA 
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TTCGTGAAGC AACTTTCCGC AAAATCCAAA CCTTTTCATT TGGCAATATC GAAAAATTTT 3660 

CAGCAGGTAA CTTAGTTGTT CGTTTAACAA ATGACGTTAC TCAAATTCAA AATGTCATTA 3720 

TGATTGCGTT GCAGACACTT TTTAGAATTC CTTTTTTATT TATCGGGAGT TTTATTTTAG 3780 

CGATGCTAAC CTTGCCACAA CTATGGTGGG TGATTGTTGC ACTTGTCATC GCAGTTATTT 3840 

TGATTTCTAT GCTGTCCTTT TCGCAAATGG GCAAGCATTt CAT GAT T ATT CAAAACTTGA 3900 

TTGATAAGAT TAATGGTATT GCAAAAGAAA ATTTGTTAGG CATTCGTGTC GTCAAATCAT 3960' 

TTGTTCAAGA AAAAAATCAA TTGTCACGTT TTACAAAAGT GaGTGAAGAA TTaAcCACCC 4 020 

ATAACTTAAT CGTAGGTAGC TTATTTGCTG TGATGATTCC AGCGTTTATG TTGGTGGCGA 4080 

ACTTAGCTGT TGTAGGCTCA ATTTTCTTTG TGAGTAACTT AGTCAAGGAC GATCCAACTT 4140 

TAATCGGTGG TGTCGCTTCC TTTATGAACT ATTTAATGCA AATCATGATG GCGATTATTA 4 200 

TCGGCGGAAT GATGATGATG ATGACTTCTC GTGCCGCTGT TTCCATTAAG CGGATTAAAG 4260 

AAGTGATGGA AACAGAACCA GATGTTACCT ACAAAAAAGT CCCAGAACAA GAGTTAATTG 4 320 

GTAGTGTTGA GTTTGACCAC GTATCTTTCC GTTATCCTGG GGATGAAGAA GATACCTTGA 4 380 

AAGACATCTC TTTCTCTATT CAGCCAGGTG AAATGATTGG GATTGTGGGC GCAACTGGCG 44 40 

CTGGTAAATC TACATTGGCA CAATTAATTC CACGTTTGTT TGATCCGACA GAAGGAAAGA 4 500 

TTGAAGTTGG TGGTGTCGAT TTACGAGAAG TGAACGAACA CAGTTTACGA AAAACAGTTT 4 560 

CCTTTGTGTT GCAAAAAGCA ATCCTGTTCT CTGGAACGAT TGCGCAAAAT CTACGCCATG 4 620 

GTAAACGTGA CGCTTCTGAA GCGGACATGG AACGGGCTTC AGGCATCGCC CAAGCCAAGG 4 680 

AATTTATTGA AAAATTAGCA GAAGGTTATG ATGCACCCGT CGAAGAACGG AGTAATAACT 474 0 

TTTCTGGTGG ACAAAAACAA CGGTTATCAA TTACAAGAGG TGTGATTGGT GAACCTAAAA 4800 

TTTTGATTTT AGATGATAGC ACCAGCGCGT TGGaTGCTCG TTCGGAGCGG TTAGTCCGAG 4860 

AAGCGTTGGA TAAAGAGTTA AAAGAGACTA CCACGATCGT TATTGCCCAA AAAATTTCTT 4 920 

CTGTTGTCCA CGCAGATCGT ATTTTAGTAT TGGATAATGG ACGTTTAGTC GGGGAAGGCA 4 980 

CGCATGAAGA ATTAGCAGCT ACGAACCCCG TGTATCAAGA AATTTACGAA ACTCAAAAAG 504 0 
GAAAGGAGGA AGCTTAATGA CTGATTTAAT AAAAGCTAGT AAATTTTTCT ATCATTATTT 5100 
GAAACGTTAC AAAGTTTCGT TCCTCTTTAT TTTCTTAGCA ATTTTCGCAG CGACTTATTT 5160 
ACAAGTCAAA GCGCCGCAAT TCGTTGGGGA AGCTATTCAG GAATTAGCGA AATATGCGGT 5220 
TAATGTGATG CAAGGAAAAG ACGATAAAAG TGCGTTCGTT TCTGTCATTT GGAAACTACT 5280 
CATTTTTTAT GTCTTAACTA GTGCCGCTAG TTTCATTTAT AGTATTCTCT TTACACAAGT 534 0 

CGTGGGGAAA TCGACGAACC GCATGCGGAT TGGTTTGTTT AACAAATTAG AAAAATTGAC 5400 
GATTCGTTTC TTTGATTCTC ATCAAGATGG TGAAATTTTA AGTCGTTTTA CTAGTGACTT 54 60 

AGACAACATC CAAAATAGCT TAAACCAAGC GTTGCTACAA GTATTAACCA ATATTGCCTT 5520 
ATTGGTTGGT GTCTTAATCA TGATGTTCCG TCAAAATGTG GAACTGGCAT GGGCCACAAT 5580 
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TGCTTCTACG 


CCGATTGCGA 


TTTTAATTGC 


GGTCTTTGTG 


ATTAGCAAGG 


CGCGCAAATA 




TGTCGATTTA 


CAGCAAGATG 


AAGTGGGTAA 


ATTAAATGGC 


TATATGGATG 


AAAAAATTAG 


5700 


TGGGCAAC.GT 


GTGATTATCA 


CTAATGGCTT 


ACAAGAAGAA 


ACCATTGACG 


GCTTTTTAGA 


57 60 


GCAAAATGAA AAAGTTCGTG 


CCGCTACGTA 


TAAAGGTCAA 


GTGTATTCAG 


GATTACTTTT 


C Q O ft 

5820 


CCCAATGATG 


CAAGGAP TGT 


CATTAGTCAA 


TACGGCGATT 


GTTATTTTCT 


TTGGTGGTTG 


coon 


GTtAGCAATC 


AATGGCTCTG 


TTGATCGTGC 


CGCTGCGCTA 


GGTTTAGTCG 


TTATGTTTGT 


c o a ft 


CCAATATTCT 


CAACAATATT 


ATCAACCATT 


AATGCAAATT 


TCTTCAGGTT 


ATAGCATGAT 


6000 


TCAATTAGCT 


GTTACAGGCG 


CTCGTCGTTT 


AAATGAGATG 


TTTGATGAAC 


CAGATGAAAT 


/"ft £f\ 

6060 


TCGTCCCGAA 


AATGGCGAAA 


AATTAGAAGa 


GATTAACAAA 


GCTGTGGCTT 


TAAATCAtGT 


6120 


TGTTTTTGGC 


TATAATCCAG 


AAACGCCTGT 


GTTGAAAGAT 


GTTTCTATTC 


ATGTGGATAA 


6180 


AGGTGAAATG 


GTCGCTTTAG 


TTGGACCGAC 


AGGCTCAGGA 


AAAACAACGA 


TTATGAATTT 


6240 


AATGAATCGC 


TTTTATGATG 


TGAATGAAGG 


CGCCGTGACG 


TTTGATGGTG 


TGGACATTCG 


6300 


TGAAATGGAT 


TTAGATAGCT 


TGCGTTCACA 


TGTAGGCATT 


GTTTTGCAAG 


AGTCCGTTCT 


6360 


ATTTTCAGGA 


ACAATTCGTG 


AGAATATTGC 


TTTTGGTAAG 


CCAGAGGCAA 


CGGATGAAGA 


6420 


AATTGTTCAA 


GCAGCGAAAC 


AAGCGAATAT 


CCATGAATTT 


ATTGTAAATC 


TTGAGCAAGG 


6480 


CTACGATACT 


GAGATTACGG 


AAGAGAATAA 


CCTTTTCAGT 


ACGGGGCAAA 


AGCAATTAGT 


6540 


AAGTATTGCA 


CGAACGATTA 


TTACCAATCC 


AGAATTATTA 


ATTTTGGATG 


AAGCAACAAG 


6600 


TAATGTTGAT 


ACAGTAACAG 


AAGCTAAAAT 


CCAAAAAGCG 


ATGGAtGAAG 


CAATTAAAGG 


6660 


GCGCACCAGT 


TTTGTGATTG 


CCCATCGTTT 


GAAAACGATT 


CTAAATGCGG 


ATCGGATTAT 


c n i ft 


TGTTTTACGA 


GACGGTGAAG 


TCATTGAAgA AGGCAACCAC 


CATGAACTAG 


TTGAACAAGA 


C ^ O ft 

6780 


TGGATTCTAT 


GCGGAACTTT 


ATAAAAtCAA 


TTTGTTTTTG 


AATAAGTCGA 


AAAAGAAGCA 


C O A ft 

684U 


AAATGAGTCA 


TTCTCATTTG 


CTTCTTTTTT 


TGCTATGCTA 


AATAGAAAAA 


GGAGAAATGT 


6900 


CAATGGAAAA 


AATGAGGAAT 


ATACGGCAAT 


TACGGCGCAC 


GTTTTTGTTG 


CTAAATACTT 


6960 


TATTTGTACT 


GAGTGTATTC 


AGAGAGAATC 


GATGGACAAA 


ATCGGCACTG 


CTTTTTTTGG 


^ A 1 ft 


TTGTTGGCTA 


ATAGTTTTTT 


ATATGGACTG 


CAGTAGTTGA 


TTTATCTTGG 


GGGCGAAAAA 


7080 


GGCTGGCTGA 


TACATTTGAA 


GCGGTGGTAT 


TATTGCGCGG 


TTCTTCTCCA 


AATAGCAGGC 


7140 


ACATTGATTG 


TATATTTTTC 


GTTTTTTGTA 


TAAAGGTCTC 


TCGTTTCTTG 


CCGGAGAATA 


7200 


TGCTAAAATA 


AAAAGCAGCA 


GGGGGCGAGA 


t AAACCATGGA 


AAAGCCnAAT 


nAACGCCnAG 


7260 


CTGGATTTAG 


AACGTACACG 


GGCCAATTAA 


, TTTTAA 






7296 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



GATTAAATGT 


CCTTGCCTCT 


GTTTTTGAAA 


ATCCTAAATC 


AAGCCTTTCA 


CACCTATATC 


60 


TTTTCGGTAG 


TACATTCCAG 


GAAATGAGTA 


ATTATCCAAT 


GTTTGATAAA 


CTTTTTTGCG 


120 


GGCTTCTTCA 


AGAGTCGAAG 


CAGTTGCTTC 


TGCTAAAAAG 


ATTCGTCCGC 


CAGCCGAAAC 


180 


CAGAGCCTGT 


TCTGAACGAG 


TTACCCCTGC 


ATAATAGATG 


GTTGCACCAG 


ACATTTCTGG 


240 


CAATGGTAGA 


TTTGTCTTAT 


ATTGATTAGG 


ATAGCCTTCT 


GCTGCTACAA 


CAACACCAAC 


300 


ACTCGCTAAC 


TCATTCCATG 


CAATCGTTGG 


TTGCTCTTTC 


ACCAGCAATG 


AAGAAATTAT 


360 


TTCTGAAAGG 


GGACTCTCTA 


AACGTGCCAA 


CACAACTTGT 


GTTTCTGGAT 


CCCCAAATCG 


420 


AGCATTAAAT 


TCAATAACCT 


TGGGGCCAGT 


TGCTGTTTGG 


aTTAAACCAG 


CATATAAAAT 


480 


CCCTTGAAAC 


GGACGTTTTT 


CAGCCACCAT 


TCCTTTAGCA 


GTCGGTTTCA 


GGACTTGGCT 


540 


AATGGCTTGC 


TGAACCGCTT 


CTTCTGGGAC 


CTGAGGCACT 


GGTGTATAAG 


CGCCCATCCC 


600 


CCCTGTATTC 


GGACCAGCAT 


CCCCATCGAA 


AATCCGCTTA 


TGATCTTGAG 


CAATCACCAT 


660 


CGGATATACC 


TTCTCATTAG 


AAACAAAAGC 


CATTAAGGAA 


AATTCTTCTC 


CTTGCAAAAA 


720 


TTCTTCGATT 


ACAACCGTTT 


GACCGCTTGC 


TCCAAATTGT 


TGGTCCTCCA 


TCATCGCAAC 


780 


TAGCGCCCGT 


TTTGCGGCTG 


CTTTTGTTTC 


GGCAACTACT 


ACTCCTTTTC 


CAGCTGCCAA 


840 


ACCATCTGCC 


TTAATAACAA 


TTGGTACTCC 


TTGCTGTTCA 


ATATATTCTA 


AGGCAGGTGC 


i 900 


CAAACGTCGA 


AAGACCCGAT 


ATTCCGCCGT 


TGGAATCTGG 


TATTTTTGCA 


TAAAATCTTT 


960 


CGCAAAACTT 


TTTGAACCTT 


CAATTATTGC 


AGCTGCTTTC 


GTCGGGCCAA 


AAATCAGTAA 


1020 


TCCCGCTGCT 


TGAAAGGCAT 


CTACAATTCC 


TTCAATTAAT 


GGCGTTTCCG 


GTCCGACGAA 


1080 


TGTCCAATCA 


AT CT GAT TAT 


CCTTCGCAAA 


TTGAATTAGT 


TCCGTTTGCT 


GCTCTTCACC 


1140 


GATAGGTACA 


GTCCGAATAT 


TATCACCACT 


CATACCAGGA 


TTACCTGGTG 


CACAATAAAC 


1200 


AGTCTTGACT 


TTGTCTTCTT 


GCTGTAATTT 


TTTGGCGATT 


GCATGTTCAC 


GACCACCACG 


1260 


CCCAATCACT 


AATATATTCA 


TCGGTTTCTC 


CTTTAAAAAT 


TAATGTTTGA 


AATGACGAGT 


1320 


CCCAGTAAAA 


AtCATCGCAA 


TACCATAATG 


ATTCGCTAGA 


TCGATTGATG 


CTTGGTCTTT 


1380 


AATGCTCCCG 


CCTGGCTGAA 


TAATTGCTTG 


AATGCCaTGT 


TGTGCAGCAT 


ATTCCACGCT 


1440 


ATCTGCCATG 


GGGAAAAAGG 


CATCACTTGC 


TAAGACGGCC 


CCTTGTAATT 


TTCCAGCTGC 


1500 


TTGGGCTTGA 


TCAACTGCAA 


TTTTTACTGA 


ACCAATTCGA 


TTCATTTGAC 


CAGCGCCAAT 


1560 


CCCCACCGTT 


TGGCTTTCAT 


TGGCTAAAAC 


AATAGCGTTT 


GATTTTACAT 


GCTTGACTGC 


1620 


TTTCCAAGCA 


AAGTTCAATG 


CGTCTCTTTC 


GGCTGGTGTG 


GGTTGGCGCT 


CTGTAACAAC 


1680 


TTGCCATGCT 


TGGTCATTTT 


CAACAACTGT 


ATCCTGTTCT 


TGAATTAATA 


ACCCGCCCAT 


1740 


AACAGAAACT 


AATTCCACCG 


CTTTTTTTTC 


TTGAGCATGA 


AAATCTAACG 


TTAACAAACG 


1800 


AAGATTCTTT 


TTCTGAGATA 


ACAGTGCCAA 


AGCTTCTTCC 


TCAAAACTAG 


GCGCAATAAT 


1860 


TATCTCTAAA 


AAGATTTGAT 


GCATTTCAGC 


CGCAGTCGCT 


TGGTCCACTG 


GACGATTCAA 


1920 
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AACAACAATG 


CCACCAA^AA 


TTGAAACTGG 


ATCTGCTTCA 


AATGCCTGAC 


GATAGGCTGC 


1980 


CAAAATCGTT 


TTTCCTGTTC 


CAATTCCACA 


TGGGTTCATA 


TGTTTAACAG 


CGACGACCGT 


2040 


AGGTTCTGTA 


AACTCGCTAG 


CAATCCGGAG 


TGCTGCATCA 


GCATCTCGAA 


TATTGTTATA 


2100 


TGACAGTTCT 


TTGCCATGAA 


GT7GTCGAGC 


GGAAGCAATG 


GATAATGATA 


CAGGTACAAC 


2160 


GGATTGATAA AAGGTGGCTT 


GT7GATGGCT 


ATTTTCGCCA 


TAACGCAATG 


TTTGCTTACG 


2220 


TTCA-TAAGTA 


AGCGTCTGTT 


TC7CTGGTTC 


AGTTTCACCC 


ACTTGCGTCG 


TTAAATAGTC 


2280 


AGCGATAAGC 


GCGTCATAAG 


CAGCGGTATG 


ACGAAAGACT 


TTTGCTGCTA 


ATCTTTTTCT 


2340 


AGTGGCTAAC 


TGGCTAGTTC 


CAGTGCTGCT 


AATTTCACTT 


AATACCTGTT 


CATAATCACT 


2400 


AGGATCAACC 


ACTGCTGTTA 


CG7CTTGATA 


ATTTTTTGCC 


GCTGAACGTA 


ACATGCTAGG 


2460 


ACCGCCAATA 


TCAATCAT7T 


CAATCGCTTC 


TGCTTCAGAG 


ATCGCTGGTT 


TTAAAATTGT 


2520 


TTCTTTAAAT 


GGATAAAGAT 


TC ACT ACT AC 


AAGGTCaATA 


AAGTGAATGC 


CTTGTTCAGT 


2580 


CAGTGCTTTT 


TGATGCGCTT 


GGTTTCCCCT 


TTTTGCCAAA 


AGTCCGCCAT 


GGATATTCGG 


2640 


ATGTAACGTT 


TTGACACGGC 


CATCTAACAT 


TTCTGGAAAG 


CCAGTGATGT 


CGTCAATAGC 


2700 


AAGTGTCGGT 


ACCCCCGC7T 


CTTCCAAGAC 


AGTCCTTGTT 


CCCCCAGTGG 


AGATAATTTC 


2760 


AAAGCCATTT 


GCAACTAAGC 


CTGCTGCAAA 


TGTTGTCACT 


CCTGTTTTGT 


CTGAAACGCT 


2820 


AATCAATGCT 


CTTTT3GTCA 


TTTCATTGTC 


TCCTTTACAA 


TTTGTGAAAT 


TATTTTTGGG 


2880 


TACCATTCGT 


GCTCAAGAGC 


ATGAATTTTC 


TCTGCTAATG 


TGTCCACTGT 


ATCTTCGGTA 


2940 


TCAATCTTGG 


TGGTTGTTTG 


GAAAATAATC 


GGTCCTGTAT 


CAACCCCAGA 


GTCAACaTAA 


3000 


TGGATAGTGA 


TTCCT3TTAT 


TTTCACTCCA 


TAATGAAATG 


CTTCTTCAAT 


TCCgTGAAGA 


3060 


CCTGGAAAAC 


TAGGTAACAG 


AGAAGGATGA 


ATATTGACGA 


TTCTCTTTGG 


ATAAGCCTCC 


3120 


AGTAAAGTCT 


TGCCGATAAT 


TCTCAAATAA 


CCAGCTAACA 


CGATTAAATC 


GATCTGATGC 


3180 


TCTTTTAAAT 


GTTTCAATAC 


TTGTTCCTCA 


TATTGCCCAC 


GAGATGAAAA 


ATCAGATGGC 


3240 


GAAAAGCAAA 


CAACGGGAAT 


CTTGCGTTTT 


TGGGCACGGG 


TCAATACATA 


GGCTTCAGGC 


3300 


TGGTCACAAA 


AGACTAAGCT 


CAAATGCCCT 


GCGATTTTTT 


TTTGGGAAAA 


AGCTGCAGCA 


3360 


ATCGCCTCAA 


AATTGCTACC 


ATTGCCAGAT 


GCAAAAACAG 


CTATCTTCAT 


AAGGCTTCCT 


3420 


TAAAAATAAC 


GGCTTTCGTT 


TTCCTAGGTA 


CCAAACGACC 


AATTTCAAAA 


GCTTCTGGCA 


3480 


AAAGCTCCTG 


AACTTTTTCA 


ACATGCTCTG 


GACTCACCGC 


TAAAATCATC 


CCTAAGCCCA 


3540 


TATTGAAAAT 


TTCATACATT 


TCTTCAGGTG 


GAATTTCACC 


GTATTTTTGA 


ATCAATTTAA 


3600 


AAATTGGTAG 


CACTGGCCAA 


CTATTTAATT 


GAATCTCTGC 


AGCTAATGCG 


GAAGAAAACA 


3660 


TTCTTGGTAG 


ATTTTCTAAA 


AACCCGCCGC 


CAGTAATATG 


CGCAGCCCCA 


TGCACGAGTC 


3720 


CTGCTTTTAA 


CAATGGCAAT 


AATTCCTTCA 


CATAAATTTT 


TGTTGGTGTT 


AATAATTCTT 


3780 


CCCCTAACGT 


TTTGGGTGCC 


AATTCTGGAA 


GCTTGTCCGT 


TGTTTTAAAC 


GAATGTTTCT 


3840 


TAAAAAAAAT 




AAAGAGTAAC 


CATTAGAATG 


AAGACCACTT 


GAAGGAAGAC 


3900 



4500 
4560 
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CAATCAAGAA ATCGCCTTCT TTTACGTTAC TTGGAGTCAA GAGTTGACTT TTCTCAGCAA 3960 

TACCCACGGC AAAACCAGCC ACGTCATAAG CATCTGCTTC ATACATATCA GGCATTTCAG 4020 

CTGTTTCGCC CCCAATTAAG GCTGCATTCG CCTCACAACA GCCAGCCGCA ACACCCGCAA 4080 

CAATTGCTTC AACTTTAGCT GGATTAACGG TCCCTAACGC TAAATAATCT AAAAAATATA 4140 

ACGGTTCGGC GCCTTGTi.-CG ACGACATCAT TGACACACAT TGCTACACAA TCAATTCCGA 4200 

TTGTCTCGTG TTTCTGTTCT TCAATTGCCA AGAGTAACTT TGTGCCTACG CCATCCGTTC 4260 

CTGATACCAA AACTGGTTCT TTAAGCTTGT AACTACTTAA ATCAAAACAA CCACCAAAAC 4320 

CACCAAGCAT GCCCAACGTT CCCGTCCGTT GTGTTTTTTG GCTATGCTTT TGAATTCGTT 4380 

CTACTACTTC GTATCCCGCC TCAACATTTA CACCAGCTTT TGAATAGGCA TTTTCCATGT 44 40 
CAATTCCTCC TAGTTTCTTT TTTCAAGGAT GCTTGATATT GTTCTTCATA ATCATACAAA 
GGCGTTGGAT AGTCTCCATT AAAGTAAGCC ATACACAGCC CAGAATAAGG AGCATCATAA 

TTTAATTGAA TCCCATCAAT CAGCCCTGCT TCACTCAAAA AAGCCAGCGA ATCTGCGCCA 4 620 

ATCTGCTCTT TGATTTCTTC AATTGAATAG TTTGCCGCAA TTAGTTCGTC CCGCGTCTGT 4 680 

ATATCAATGC CATAAAAACA CGGATATTTT AATGGTGGTG AAGCAATTCT GACATGAACT 4740 

TCTTGCGCTT CTGCTTCTTT TAATAAATGA ATAATGCGCC GAATAGTGGT CCCGCGAACA 4800 

ATCGAATCAT CTACTAAAAT GACTTTTTTT CCTTTGACCA CGCCACGAAC TGCCGATAGT , 4 860 

TTCATACGGA CACCTTGTTC TCTTAATTCT TGTGTCGGCT GGATGAATGT TCGAGCGACA 4 920 

TATTGATTTT TTACTAA^C TAGTTCGTAA GGTATCTGTG CTTCTTCTGC ATAGCCGCTA 4 980 

GCTGCAGACA ATGACGAGTT CGGTACACCA ATTACCATAT CAGCTGCGAC TGGACTTTCG 5040 

ATTGCCAAGT TCTTCCCCAT TCTTTTACGA GCTGTATGCA CATTAACTCC GGCAATATTT 5100 

GAATCTGGTC GTGCAAAATA AATATATTCC ATTGCACAAA TTGCCGGTTG GACTTCCGTA 5160 

GTGAAAACCT CTATTTTCAA GCCTTCATCA GAAATGATGA CTACTTCACC AGGCGCCACA 5220 

TCTTGAATAA AGCGAGCCCC AACTGTTTCT AATGCACACG TTTCGGATGC CACAACGTAC 5280 

GCCCCATTGT TCATTTGTCC GATGGAGAGA GGGCGAAAGC CGTTGGGATC CAGTGCAGCA 5340 

ATCATCATCG TTTCTGTTAA TAAAAGATAA GCAAAGCCAC CCTTAACCAA ATTCAAGCTC 5400 

TCTTTTAATT TATCGAGAAA CGAGGCAGCA GTGCTTTTTC GAATCAAGTG CATTAAAATT 5460 

TCAGTGTCTG AATTGGAATG AAAAATAGCG CCTTCTCTTT CTAGTTGTTT TCGTAAACGT 5520 

TTGGCATTTG TCAAATTGCC ATTGTGTGCT AaCCCAaTtG ACTATCATAA AATTTAAATA 5580 

■ AAAATGGCTG AATATTATCC ACGCTACCCG TACCAGCGGT TGCATAGCGT ACATGACCAA 5640 

TAGCCGCCGT TCCTTCTAAC GCATTCAGTT GTTCTTGCTT TTGGAATACT TCTGCTAATA 5700 

AACCTAAATC ACGGTGGCCA TCTAATTTCC CAGAATTGTT CGCCACAATT CCCGCGCCTT 5760 

CTTGGCCTCG ATGCTGCAAG CTGTGTAAGC CAAAATAGGT CACTTGTGCC GCATCAGGAT 5820 

GACCCCAAAT ACCGAAAATA CCACACTCTT CATTTAAGCT TTTTGCTTCA TAAGACATGG 5880 
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AATCGCATCC TCCCATAAGG CTTTGGCTTC TTTTGTTGAG CAATTGATCA CATCATCTAA 5940 

CGCATGAATG ACCAAACCAG TTTCAGTCAC TTTCCCAATA TGTTGAGCTT TTCTTCCGAT 6000 

AAGTGTTTCA AATGCTTCTT GATGCTGGGG ACTAACTGAA AGAATAAAAC GGGATTGCGT 6060 

CTCCGCAAAT AGATATTCTT TTTTTAATGG CAGTGTCACC TGTAACCCCA GTTCGTTAGC 6120 

AAAAGCCGAC TCGGCTAAAG CAACGGCAAC GCCACCTTCA GCGCAATCAT GCGCGCTAGC 6180 

AACTAGGCCT ACTTGAATTG CTTTTAAAAC TAATTCTTGG TTTGCTTTTT CCTCTTTTAA 6240 

ATCAAATGAG CGTAATTGCC CTTCAATTCG ACCCAGTTGT ATTTTTTGAA TTTCTGAGCC 6300 

GTTAAAATCT GCGTGGGTTT CACCGATGAG ATAAATTAAG TCATCAACTT GTTTAAACGC 6360 

TTGTGTGGTA ATTTGTGAAA CATCTTCAAT CACTCCCACC ATGCCAATCA TTGGCGTTGG 6420 

ATAAATGGCT TGACCATCTG TTTCATTGTA AAGAGAGACG TTCCCTGAAA TCACGGGTGT 6480 

TCCTAGCTGA CGGCAGGCGG CGGCGATTCC ATCCGCCGAC GTCCATAATT CCCAAAAACT 654 0 

TTCTGGTTTA TCTGGGGAAC CATAATTCAA GCAATCAGTG ATAGCCAAAG GTTGTGCTCC 6600 

ACTCGCTACA ATATTACGGG CTGCTTCTGC AACCGCAATT TGCCCACCAA TTTCAGGATT 6660 

TAAATACAAA TAACGGCTAT TACAATCTGT CGTCATCGCT AAGGCTTTAT TGGTGCCCCG 6720 

CACTCTTAAA ACAGCTGCAT CACTTCCTGG TTGCACCACT GTATTCGTTT GTACTTGACT 6780 

ATCATAGGTT TCGTAAACTG ATTTTTTCGA TGCAATCGTT GGCTGTTGCA ATAGGCGTTT 684 0 

TAATAATTCC GTACTATTCG TAATTTCTGG AATGAAGGGC GCTAATTTTT TAAAAGCACG 6900 

CATCCGCTCT GGTTCTTGAT AGGCTTTCTG GTAGACAGGG GCTTCTTCTG CCAAAGCATC 6960 

TACTGGTACA TCTGCGACTA TTTTGCCACC GTGAAAGACA CGATACTGTC CATCATCTGT 7020 

TACGGAGCCA ATCGCTACCG CTTCTAAGTC ATACGTTTGA AACAATTCAC AAACTTGCTG 7080 

TTCTGCTCCT TTTTCCACAC AGATTAACAT CCGTTCTTGG GACTCAGACA AGAGCATTTC 7140 

ATAGGGGCTC ATCTGCGTCT CTCTCTGCGG AACTTTATCT AGCTCTAGAA TCAAGCCTGA 7200 

AcCTGCTTTT GATGCCATTT CAGCACTTGA AGAAACCAAG CCAGCCGCCC CCATATCTTG 7260 

GATGCCGATT AAAATATCTT GATGATTTTT GATTAAATCA AGACAAGCTT CCAATAATAA 7320 

TTTTTCCATA AACGGATCAC CTACTTGTAC CGCAGATCGT TGCTGTTCTT CACCTTCACT 7380 

AAATTCTTCT GAAGCAAAGG TCGCACCATG AATTCCATCT CGTCCTGTTT TCGCACCTAC 74 40 

GTAAAGAATG CTATTTCCGA CGCCTTTTGC TTGGCCTTTT TGAATCTCAT CATGACGAAT 7500 

CAATCCCACA CACATAGCAT TAACAAGTGG GTTTCCTTGA TAACATGGCT CAAAAGCAAT 7560 

TTCGCCGCCA ACGGTTGGAA TACCAATACA ATTTCCATAA CCGCTAATCC CCGCCACAAC 7 620 

TTCTTGAAAC AAATAACGTG TTCGTGCATC CGTCAGTTCG CCAAATCTTA GCGAATCCAG 7 680 

TAAAGCAATC GGCCGCGCCC CCATACTGAA AATATCGCGA ATAATTCCAC CAACGCCTGT 774 0 

AGCTGCTCCT TCGTAAGGTT CTACTGCAGA AGGATGATTA TGGCTCTCTG CTTTAAAAAC 7800 

AACAGCTAAT CCGTCACCAA TATCGACAAT CCCAGCTCCT TCACCAGGGC CTTGAAGCAC 7 860 
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TTGTGGCCCT 


TCCGTCGGAA ACTTTTTTAA -AACGGGCTTG GAATTTTTGT AAGAGCAATG 


7920 


TTCACTCCAC 


ATCACGGAGA ATAAGCCAGT TTCTGTATAA TTAGGTtCTC TTtGCAAAAT 


7980 


GGTCTCAATC 


TG AT GAT ATT CTTCATCTGT TAGTCCCCAT TCTGCATACA ACCGTGTTTC 


8040 


CTTGATTTCT 


TTTGGTGTTG GTTCATTCAT CATTGGCTTA ATTCCTTTCC AAAGTTTTTA 


8100 . 


ATGATTGAAG 


CAAAAAAGCT GCGTCCATCT GTTGAACCTA ACAATTCTTC CATCGCTCGT 


8160 


TCCGGATGAG 


GCATCATTCC TAGAACATTG CCCTGTTCAT TACAAATACC AGCAATATTC 


8220 


GCAAGACTAC 


CGTTCACCTT TTCTTGATAC GTAAAAACAA TTTGGTTCTT TTCTTTCAGC 


8280 


TCCTTCAACG 


TTTTTTCATC ACAATAGTAA TTCCCTTCGC CATGCGCTAC TGGTAAATGT 


8340 


ATCGAAACAT 


TTTCAGGATA CTCGCTTGTA AACTTTGTGT GCGTATTCAC TTTCAATGGG 


8400 


ACACTTTTAC 


AAATAAAACG AAGCGAATCA TTTCTTAACA ACGTGCCAGG AAGTAAGCCA 


8460 


ATTTCTGTTA 


AAATTTGAAA GCCATTACAA GTACCAAAAA CCATTTTGCC TTCTGTAGCA 


8520 


AAGCGAATCA 


CTTCATTGAT AATTGGCGAA AACCGAGCAA TTGCACCACA ACGTAAGTAA 


8580 


TCGCCATAAG 


AAAAACCACC AGGCAACAAT ACGCCATCAA ATTTTGCTAA AGAAGTGGCA 


8640 


TCGTGTCGGA 


CAAATTCTGC ATCGATGCCC ATCACyTCTT TGACAGCCCA CAATAAGTCT 


8700 


GCATCACAAT 


TGGATCCTGG AAAAACAATC ACTGCCAATT TCATTTAAAT TTCCTCCAAA 


8760 


GATAAAATTt 


CAAAACGATA GGATTCCATA TTTACATTGG CTAATAGTTG ATCACAAATT 


,8820 


TGTTCAATTT 


TCTCTTCAAT TGCTTCTGTC TGCTGGACGT GCATTTCAAA ATACTTACCG 


8880 


ACACGGATTT 


CGGCTACTTC TGTAAAACCA AGCCGATGGA CTGCTTCCTT AACCGCTTCA 


8940 


CCTTGGGGAT 


CCAAAATTGA TTCTTTATAC GTAACATAGA CTTTGACAAA ATACaTACGC 


9000 


GCTTCCTCCT 


CAATTAGCTG TTAGTAAACG TTGTAAAACT TCTTCATATA CAGGAACTAT 


9060 


TTCTCCTAAT 


TCACGGCGAT AAACATCTTT ATCCAGATGT TCGTTTGTGT TTAAATCCCA 


9120 


TAGACGACAG 


GTGTCTGGAG AAATTTCATC TGCCAGTAAC AATTGATTGG CCTTCGTCCG 


9180 


ACCAATCTCG 


ATTTTAAAAT CGATCAAACA AATATTGAGG CGTTGAAAGA GTTGACTAAG 


9240 


CGCCTGATTA 


ATTTGTAAAG CCAATGCTTT AATCTCCACA ATTTCAGCAG GAGTGGCTAC 


9300 


TTTTAGAAAT 


TGAATATGGT CTTCGTTAAT AAAAGGATCA TCCAGACGAT CTTCTTTGTA 


9360 


ATAAAATTCA 


ATAATGGGCG TCACTAATTT AGTGCCCTCT TCAATCGCTA AACGCTTTGA 


9420 


AAAACTTCCT 


GCCGCATAAT TACGTACGAC AACTTCCAAA GGAATCaTTT CTACGACCTG 


9480 


AATTAATTGT 


TCGTGTTCAG ATACTTTTTT GATAAAATGA TTGGGAATCT TCTGTTGCTG 


9540 


TAAATGTTCA 


AAAATTAGAC TAGTGATTTG ATTATTTAAA GCGCCTTTCC CAAGAACTTT 


9600 


GTCTTTTTTT 


TGACCGTTTA AAGCTGTTGC TTGATCAAGA TATTCCACAA AAAGAACCGC 


9660 


AGCATTTTCT 


GTTTGATACA ACTTCTTCGC TTTTCCTGTA TACACAAGTG CTTTCTTTTC 


9720 


CATTCtTTCC 


: CTCTTCTCGT TAGCTTCATG AACCCTATTC TTTCTAACTc AGCTGATTCA 


9780 


CATCaGACaA AAACTTACAT TAGTTACAAA AAAACaATAA TCGTTCGTGT TTGTCGAATG 


9840 
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TAAAAATAAA AAACTTGCTT GGAGCAAGCA TTTATTAAGT TACGTTCCAA ATACCAACAG 9900 
TCTCAATTTC CTTTAACGTT TCTGATA 9927 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



TGTCTGCmrT 


rCAGTAAGGG 


CGTTTTTGGC 


TGAAGCAAGA 


m tv m >^ tv m^^*^ 

TATGAGGTGG 


ATGCCTGCAC 


60 


AGATGGAAAC 


GAAGCACACA 


CCAAGTTCTA 


TGAAAACACC 


TATCAACTGG 


TTATTCTTGA 


120 


TATTATGCTG 


CCCGGTATGA 


ATGGGCATGA 


ACTTCTACGT 


GAATTTCGGG 


CGCAAAATGA 


180 


TACCCCCATT 


CTGATGATGA 


CAGCCCTGTC 


GGATGACGAA 


TV TV r+r* TA TV TV m /""i 

AACCAAATCC 


GGGCGTTTGA 


240 


TGCAGAGGCA 


GACGACTATG 


TAACAAAGCC 


ATTCAAGATG 


CGGATTTTAC 


TAAAGCGGGT 


300 


GGAAGCCCTG 


TTACGGCGCA 


CGGTGCGCTG 


GCAAAGGAAT 


TTCGTGTGGG 


CAGGCTGACA 


OCA 


CTTCTGCCGG 


AGGATTTTAG 


GGTACTTTGT 


GACGGTACGG 


AGCTGCCCCT 


• 7% m TV TV TV TV 

GACACGAAAA 


420 


GAATTTGAAA 


TCCTTTTGCT 


GCTGGTGCAG 


TV TV ^1 TV Ti TV — ^ TV 

AACAAAgGCA 


y-' tv tv r*r* m m tv tv /** 

G AACC T T AAC 


CCATGAAATC 


A Of\ 

4 oU 


ATTTTGTCCC 


GCATATGGGG 


ATATGACTTT 


GACGGTGATG 


GCAGCACAGT 


/^OAOA/^T/^AT 

CCACAC jL CA1 


c ^ n 


Tl m/""» T\ TI TV T\ TV m/^ 

AT C AAAAAT C 


TGCGGGCGAA 


CTGCCGGAAA 


ATATCATCAA 


AACCATCCGC 


GGI Gl AGG1 1 


DUU 


ACCGA1 i GGA 


GGAAI CA i 1 A 


1 AATGGAAAG 


A A 7\ a /T"* z'"* 7\ np np 
AAAAGGGA1 1 


1 1GA1 IHHxjVj 


rp ni rp rp rp rrt rn t\ 

11X111 LLln 


uDu 


T AC GAT CAT T 


GTCCTGTTAC 


TGCTTGTCGG 


TGT. AACGGCA 


AC AC i G 1 1 1 G 


CACAGCAA1 1 


/ £.\J 


TGTGTCTTAT 


TTCAGAGCGA 


TGGAAGCACA 


GCAAACAGTA 


AAATCCTATC 


AGCCATTGGT 


780 


GGAACTGATT 


CAGAATAGCG 


ATAGGCTTGA 


TATGCAAGAG 


GTGGCAGGGC 


TGTTTCACTA 


840 


CAATAACCAA 


TCCTTTGAGT 


TTTATATTGA 


AGATAAAGAG 


GGAAGCGTAC 


TCTATGCCAC 


900 


ACCGAATGCC 


GATACATCAA 


ATAGTGTTAG 


GCCCGACTTT 


CTTTATGTGG 


TACATAGAGA 


960 


TGATAATATT 


TCGATTGTTG 


CTCAAAGCAA 


GGCAGGTGTG 


GGATTGCTTT 


ATCAAGGGCT 


1020 


GACAATTCGG 


GGAATTGTTA 


TGATTGCGAT 


AATGGTTGTA 


TTCAGCCTTT 


TATGCGCGTA 


1080 


TATCTTTGCG 


CGGCAAATGA 


CAACGCCGAT 


CAAAGCCTTA 


GCGGACAGTG 


CGAATAAAAT 


1140 


GGCAAACCTG 


AAAGAAGTAC 


CGCCGCCGCT 


GGAGCGAAAG 


GATGAGCTTG 


GCGCACTGGC 


1200 


TCACGACATG 


CATTCCATGT 


ATATCAGGCT 


GAAAGAAACC 


ATCGCAAGGC 


TGGAGGATGA 


1260 


AATCGCAAGG 


GAACATGAGT 


TGGAGGAAAC 


ACAGCGATAT 


TTCTTTGCGG 


CAgCCTcTCA 


1320 


TGAGTTAAAA ■ 


ACGCCCATCG 


CGGCTGTAAg 


CGTTcTGTTG 


GAGGGAATGC 


TTGAAAATAT 


1380 


CGGTGACTAC 


AAAGACCATT 


CTAAGTATCT 


GCGCGAATGC 


ATcAAAATGA 


TGGACAGGCA 


1440 


GGGCAAAACC 


ATTTCCGAAA 


TACTGGAGCT 


TGTCAGCCTG 


AACGATGGGA 


GAATCGTACC 


1500 
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CATAGCCGAA 


CCGCTGGACA 


TAGGGCGCAC 


GGTTbCbbAb 


bl.bb lALLLo 


7ATTTTPZ1 A TAP 


1 jOU 


CT-TGGCAGAG. 


GCAAACAACC 


tv r** r** r** rnrn rp 

AGCGGTTCGT 


CAbAbAIAl 1 


pp a pppppa p 


AARTTP.TPPT 




GTCCGATCCG 


AAGCTGATCC 


AAAAGGCGCT 


tv rp /-» TV tv TT\f*rnf* 

ATCCAATb rb 


A1A1 luftAlu 




1 rlRfl 

1DOU 


CACGCCCCAG 


GGAGGTGAGG 


TACGGATATG 


GAGlbAbbb I 


p p P P P T P 7i A 71 


tv TATAPPPTPT 


17 4 0 


TTCCGTTTTG 


AACATGGGCG 


TTCACATTGA 


rp /-» TV rp TV /-•m TV 

TGATAbl bbA 


PTTTPA A APP 
bill Lft/iAovy 


TriT TP ATP PP 


1800 


ATTCTATCGC 


ATTGATCAGG 


CGCGAAgcAG 


_ _ i\ i\ "n tv rp /■■ ^* 

caAAAAGTGb 


bbbAAgCbb 1 


TTPPPPPTTP 
1 lbbbbv^l lvj 


X O \J\J 


CCATCGTACA 


AAAAACGCTG 


GATGCCATGA 


/"*/■'/"• m O Tv tv rp t\ 

GCCTCCAATA 


1 bbbb I bbAA 


7\ TAP APPTP AP. 




ATGGCGTTTT 


GTTCTGGCTG 


GATTTACCGC 


CCACATCAAC 


7\ /*> rp TV rp TV TV TV rp TV 

ACT AT AAA 1 A 


TTT A A 7A 7A PTT 
1 i i AAAAb i 1 




AAATGATTTT 


GACCGACAGG 


TATAACCCTG 


CCGGTCTTTT 


TGTTTTTCGC 


bbb 1 AbAbbrV 




AAACTACAGA 


TTGACTACAG 


GGAAAGTACA 


GATACGCTTG 


O T\ rp tv Tv rp tv tv/"* 

CCATAATAAb 


AAlbbl Abb A 




GCCACAAATC 


GTAGTTTTAT 


TGCAAAGGAG 


GCATTCAATC 


AAATGGAAAA 


AAbb AAb 1 A 1 


9T fin 


CATTCCAATG 


TGAATCATCA 


CAAACGGCAT 


Ti rrt tv 7\ 7\ t\ TV rn 

ATGAAACAAT 


rp /""/"• T\ t\ tv tv 

CTGbGbAAAA 


Abbbbb 1111 


99 90 


CTATGGGCGT 


TCATTATCTC 


GTTCACAGTC 


TGCACGCTGT 


TTTTGbbbTG 


bAbAl lbb 1 1 


99fl fl 


TCCGTATTGG 


AGGCAACACA 


GCTACCGCCC 


ATCCCTGCAA 


C Tb AT Ab Ab b 


bAbbbbbHb 1 




GGTGTAGCGG 


AGAATCCAGA 


GGAAAACACT 


CTTGCCACCG 


f*f~* TV TV Ti r 1 J\ TV r^TV 

CCAAAGAACA 


bbbAbA 1 bAA 


9 d nn 


CAGGAATGGA 


GCCTGATTTT 


AGTGAACAGG 


CAGAACCCCA 


TCCCCGbbbA 


b i AbbA 1 b 1 b 


9 4 fin 


GAACTTGAGC 


AGCTGTCAAA 


TGGTGAGCGG 


ATAGACATTC 


GGATTTCTCC 


PTT\PPTPPJ\r 

b I Abb 1 bbAb 


9 ^9n 


GATTTGTTTG 


ATGCCGCAAG 


AGCTGATGGA 


GTTTACCCGA 


TTGTbbbAIb 


bbbAi Abbou 


9 SRO 


ACAACAGAAA 


AACAGCAAGA 


AATCATGGAT 


GAAAAAGTCG 


CCGAATACAA 


GGCGAAAGGC 


9 fi4 n 

Z D H \J 


TACACCTCTG 


CACAGGCTAA 


AGCGGAAGCA 


GAAACTTGGG 


TGGCCGTGCC 


GGGAACAAGC 


£. 1 \J\J 


GAGCATCAGC 


TTGGTCTTGC 


TGTGGATATC 


AATGCGGATG 


GAATTCATTC 


AACCGGCAAC 




GAGGTTTACA GATGGCTGGA TGAAAACAgC 


TATCGCTTTG 


GTTTTATTCG 


CCGCTACCCG 




CCAGACAAGA 


CAGAGATAAC 


CGGTGTGAGC 


AACGAGCCGT 


GGCATTACCG ATATGtCGgC 




ATCGAAGCTG 


CCACAAAGAT 


ATACCACCAA 


GGGC1 I IGbb 


TTGAGGAATA 


TTTAAACACA 


2 940 


GAAAAATGAG 


AAAAGGATAT 


AATGCTATGA 


ACAbAAAAAb 


ATTGACACAG 


CGCTTCCCGT 


3000 


TCCTGCTTCC 


AATGAGACAA 


GCGCAGAGAA 


T\ A tv rp TV rp/^/^rprp 

AAATAIbbl 1 


TTATGCGGGA 


ATGAGATTTG 


3060 


ACGGCTGTTG 


CTATGCACAG 


ACGATAGGAG 


TV TV TV TV TV TV ppprp 

AAAAAAbbb I 


TCCCTATTTG 


CTCTTTGAAA 


3120 


CGGATTGTGC 


GTTATACAAC 


CACAATACCG 


O t\ rp rp rri /"*• tv r> tv rp 

GAT I i bAbAl 


GATATACCAA 


GAAAACAAGG 


3180 


TGTTCAACTT 


AAAGCTGGCG 


GCAAAGACCT 


TAAACGGCCT 


ATTGATAAAA 


CCGGGGGAAA 


3240 


CCTTTTCTTT 


CTGGCGGCTG 


GTACGCCATG 


CGGACAAAGA 


TACCCCCTAT 


AAAGACGGCC 


3300 


TTACGGTGGC 


CAATGGTAAG 


CTCACCACCA 


TGTCGGGCGG 


CGGTATGTGC 


CAGATGAGCA 


3360 


ATTTACTATT 


TTGGGTGTTC 


CTGCATACGC 


CATTGACAAT 


TATCCAGCGC AgcGGTcACG 


3420 


TAGTAAAGGA 


GTTTCCAGAG 


CCAAACAGTG 


ACGAGATCAA 


AGGGGTGGAT 


GCAACCATCT 


3480 
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CAGAGGGCTG 

TGACCCTAGA 

TATACAAAAT 

ATGCCAAGGT 

TGCTTTATAC 

AGGCGAACCA 

AAATGCTTTC 

GATATCGGCA 

GTCCGAGGTT 

TACCCGCAGC 

GGTTGGCACA 

GATGGCCATA 

GGATTGTGTC 

TATAGGGCAA 

TAACAGCCGA 

TATTGTTACG 

AATCGGAGAG 

TACCGGGTCG 

GTTGGAGGGC 

CTTTTTGTCG 

CACCGAGCGT 

AAGGAGTTTA 

AACATGATGT 

TCGATCCGCA 

GTACGGAATG 

GTCTGCTTGT 

TTTTGCATGG 

TCCCCTATGT 

ACATTCTTAC 

ACAAACCGGA 

CGTCCTTTGG ■ 

CAGGACAATA 

GCGCGGTCAT 



GATTGATTTA 
TGATGAGAAA 
TGCAAACGGC 
TGAACGGATG 
AAACAAATGC 
ATGAGAAAAA 
CGCACCTTAT 
GACAACGCAA 
TCCGAGGCGA 
ATCGGCTGCA 
GTTGCGTATT 
CGGGGTGCAA 
CGGGGGAAAG 
GCGGTCGTCA 
AAAATTGAGG 
CTCCATGTGC 
ATGAAGCAAG 
CTGGTGGAGG 
GAGGATCAGT 
CAGCTCCTAA 
GTGCTGCGAG 
CAGCATGAAT 
GTCGGTAAAA 
CTACATCGGA 
GGAAGCCGAT 
CATGAAAGAA 
CAAATGCGGG 
AGGCTGCGAT 
AAAAAATGCG 
GGCGAGGACG 
CGTAACCAAA 
TGATGGAAAA 
GGGAAACGAG 



AAAGTGCGAA 
ATCATCGGTC 
AGTATTCAGT 
CAAGTTGCCT 
AAAATCTGCT 
GTATGGGCAT 
CACCAGATTT 
AATTGGCCGC 
CAATTCTTGC 
ATCACATTGA 
CGCCGGACAG 
AGTCCACCAT 
AGCTGCGGGA 
AAAGGCTGCG 
CAGATTATGT 
CGCTTTGTGC 
GCGCATTTTT 
CACTGGGAAG 
TTGTTTATAC 
GGATGCCAAA 
ATACCACAGA 
AAAATAAAAG 
TCCGCAATAG 
ATTACAAAAA 
AGTCTCCCCG 
AGAGAATACG 
GAGGATGGTG 
ATTCAAAGCT 
GGCATCGCCG 
CTTACCTACC 
GTAAACAGTA 
ATCTTAATTG 
GATGATTTGA 



495 
ACGATACCGA 
AGGTGTTCGC 
ATGTCCGTGA 
TAGGTACCGG 
ATCCCCTCCC 
TACTGTTTTT 
TCATATTATC 
TGGCAATCAA 
GCTGAGAAAG 
TACGACTGCC 
CGTTGCGGAT 
ACACGCCGTG 
TATGACTGTG 
GGGATTTGGA 
CCAGCTTGAT 
GGATACCCGC 
AATCAACACT 
CGGAAAGCTG 
CGACTGCTCG 
TGTGATCATC 
AAAAACAATC 
TCGCAATTAT 
AAATTGCTGC 
ACGGCGTATG 
CCATATTCTC 
AAACTCGGCG 
CGATACAGGG 
CCGCAGCTTG 
TCCCCGAATT 
CTGTCTTTGT 
CGGAAGAACT 
AGCAAGCGAT 
TTGTCGGCGA 



CTGCACCTAC 
CGACAAACAG 
AAGTGGCGGG 
GGAAATAATA 
GGAAAGTGTG 
GGATGCGAGC 
CCTACGCTGA 
TGCATTAGCG 
GTCGGGGTAA 
GCCGAGAGAA 
TATGCTTTGA 
GCGCAACAAA 
GGAGTTATTG 
TGCCGTGTGC 
GAGCTTCTAA 
CATCTGATCG 
GGGCGCGGGG 
GGCGGTGCGG 
CAGAAAGTGC 
ACACCCCATA 
AGGAATTGTC 
CTTCGGCGGT 
GAACATTAAT 
GAAGCTATGC 
CCCGGATAGG 
TATTGACGTG 
TCTGTTTGAA 
CATGGACAAA 
TCAAATGATT 
GAAGCCGGCA 
AAAGGCTGCG 
TTCGGGCTGT 
AGTGGATCAA 



CAAATATGGG 
CCTCAAGCAT 
ATTTATGAAT 
GATTGCAAGC 
GATATTCAGG 
AGGATGAGGC 
TCAGTGATGC 
TAGGCCATAA 
AATACATTTC 
TGGGGATCTC 
TGCTGATGCT 
ATTTCAGACT 
GAACCGGCCA 
TAGCCTATGA 
AAAACAGCGA 
GCCAGAGGCA 
CGCTTGTCGA 
CACTGGATGT 
TTGACCATCC 
CGGCGTACTA 
TTAACTTTGA 
TGCTCGGAGG 
ACTGAAAAAT 
AAGAAGCCAT 
AAAACGCATG 
GCTTTCCCGG 
TTGTCTGGTA 
TCACTGGCCT 
GAAAAAGGTG 
CGgTCAGGTT 
ATAGAAGCAG 
GAGGTcGGCT 
ATCCGGTTGA 



3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100. 

5160 

5220 

5280 

5340 

5400 

5460 



5940 
6000 



6180 
6240 



WO 98/50555 PCT/US98/08985 

496 

GCCACGGTAT CTTCCGCATC CATCAGGAAA ACGAGCCGGA AAAAGGCTCA GAGAATGCGA 5520 

TGATTATCGT TCCAGCAGAC ATTCCGGTCG AGGAACGAAA TCGGGTGCAA GAAACGGCAA 5580 

AGAAAGTATA TCGGGTGCTT GGATGCAGAG GGCTTGCTCG TGTTGATCTT TTTTTGCAGG 564 0 

AGGATGGCGG CATCGTTCTA AACGAGGTCA ATACCCTGCC CGGTTTTACA TCGTACAGCC 5700 

GCTATCCACG CATGGCGGCT GCCGCAGGAA TCACGCTTCC CGCACTAATT GACAGCCTGA 57 60 

TTACATTGGC GATAGAGAGG TGACCCGTAT GGAAAATGGT TTTTTGTTTT TAGATGAAAT 5820 

GTTGCATGGT GTTCGTTGGG ATGCCAAGTA CGCTACATGG GATAACTTCA C'JGGAAAACC 5880 
AGTGGATGGG TATGAGGTGA ATCGCATCAT CGGCACAAAG GCCGTGGCGC TTGCTCTGCG 
CGAAgCACAA ATCCATGCGG CAGCCCTTGG CTACGGCTTG CTTTTATGGG ATGGATATCG 

GCCAAAATCT GCGGTGGACT GTTTCCTGCG TTGGGCGGCG CAgcCGGAgG ACAACCTCAC 6060 

AAAAGAAAAA TATTACCCCA ATATTGAGCG AGCCGAGTTG ATTACAAAGG GCTATGTGGC 6120 
CTCACAATCC AGCCATAGCC GTGGAAGCAC AATTGATCTT ACGCTCTACC ACTTGGATAC 
AGGGGAACTT GTTTCAATGG GAAGCAACTT CGATTTTATG GACGAACGGT CGCACCATAC 

AGCAAAAGGG ATAGGGAATG CAGAGGCACA AAATCGAAGA TGCTTGCGTA AAATCATGGA 6300 

AAGCAGCGGA TTTCAGTCCT ATCGCTTTGA ATGGTGGCAC TATAAGTTGA TTGATGAGCC 6360 

ATACCCCGAT ACCTATTTTA ATTTTGCTGT TTCATAATGA AAGTATTTGA TTTTCTAATT 6420 

ATGTATAAGT TGGCTACAAA TTACTTAGTA TTTCATCAGA CCAATTACTC TCTTGTTTAC 6480 

AGAAAAATTC TGCGCTGATG GAATCTGCTT TATTATGCGG GCGAAAAATG AAATTGACCA 6540 

TATTTTTTCa GAACTTTACT CTGTACCGaA TTGCCTGCAA AAGCCTTATT TTAAGCTGAA 6600 
AGTTCAGGAA TTGCTTTTGT TTTTGTGTAT GCCCCTCGTG ATTTGTACAC CTATCTTAAT 
TGGCTTTGCA ATTCTCATTC CGTATCTCTG CTTTAAGAAT TTGGaAAAAc GAAGCATTGT 
GAATCGGCTG CGGGCAGAGC AAAAAGAGAA CCAGCAGAAA CAAGTCGTTC TTGCTCTGCT 
GATTCACTCG GAACTGTTTG ATTCGGGTTT TCGTTGAAGG TCAAGTAGCT GCTCTGTCAG 

GAAGTCCAGT GTGTTCAGCA GAATCTGCTG ATTGTCACGG TTGCATGACT GAAATTTTCC 6900 

CATGAAACGC TGGAGTTCTT CATCCTCAAT AGAGTTTGAA GCTTGAAACA GGACATTCGG 6960 

AGAAATTCCC AGTCGTTCAA TTAACCGGGA AAGTGTTTCA TAAGACGGAT TTTTTCTTCC 7020 

CTTTTCGATG TCCTGAACCG TTTTGACCGC CAAGCCAGAT TGATCCGcCA GTTCCTGTTG 7080 

GGTCAAGTTA CATCGAACCC TTTCGGCTCG GATGTGCCTA CCCAGAAATT TCAGGCTGTC 7140 

TTCCGGCATG ATtGTTCACC TCATATATAT TCTATCGTAC CAATTTGTAT TTTGGTACTC 7200 

CCTTAGAGTT GTGTTCCAAT CAGTATTATC ATGCCGGTTT TTGAGTTTAA TATGCAATAA 7260 

AGCCCTGTTG • GAAGCAATAA AAAAAACAGC TTCTCACGGC aGGGCCTAAT TGCCaCGCCC 7320 

CCGTGGTTTG TCTGTAAAAT CATGGGGGTA TTTTATGTTC AAAAGAATAA TGGGAGGTAA 7380 
TGCGTTAAAA AAAGTGTAGC TCCGCCGTGG GCATTATCCG CCCCGCTAAG CGCACTATCG 



6660 
6720 
6780 
6840 



7440 
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TTCCTATACA AT AC AT AT C A TCATTCCTAC TACCTTATTC GGTCGCATTG CGCCAGGAAG 7500 

CCTAAGAAGC TTCCTGGCCy TTTTGTTTGT CGAATTTTGT TGAATGATAA CGAGAATTGT 7560 

CGATGTCGCA GCCCGCCTTC ATCGGTTCGC AGCCCATGAA CTGATGAAAG GAGTAAACGG 7620 

CTATGACCAA AGAATATTGG GTAGCGGTCT TAGAACAAGA AGATCGTnTG ATAAGTAA 7 678 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4732 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ATT TAT TATA AGGGAGCGAn AT-TnCATGGC ACGCGTAGAA AGTTtTGAAT TaGATCACAA 60 

CACAGTAAAA GCACCATATG TTCGCCTTGC TGGCACAGAA CAAAATGGTG ATGCGTTAGT 120 

CGAAAAATAT GACTTACGTT TCTTACAACC AAACAAAGAT GCCCTACCAA CAGGCGCATT 180 

ACACACGTTG GAACATTTAT TAGCAGTTAA CATGCGTGAT GAATTAAAAG GAATCATTGA 24 0 

CATTTCGCCA ATGGGTTGCC GCACTGGTTT TTATATGATT ATGTGGGATC AACATTCACC 300 

ACAAGAAATC CGTGATGCAT TAGTCAACGT TTTAAACAAA GTAATCAATA CAGAAGTTGT 360 

TCCAGCAGTC TCTGCAAAAG AGTGCGGAAA CTACAAAGAT CATTCTTTAT TTGCAGCGAA , 420 

AGAATACGCA AAAATCGTCT TAGACCAAGG AATTAGTTTA GATCCATTTG AACGTATTCT 4 80 

GTAATCTTTA GCAATTAACG ACAGAGAAAC CCACTATTAA GCTGTGGGTT TTTCTGTTCT 540 

CTTTCTTAAT TTAAACGGAG GTTTTTTATG AATCAAACAA TCGAACAATT ACTAAGTCAC 600 

CGTTCCGTTC GTCATTTTAA AAAGCAAGCG TTAACGGACA AGCAGAAACA ACAGTTAATT 660 

ACTGCAGCTC AAGCAGGATC AAGCTCCAAT TTTTTACAAG CTTATACAAT TATAGAAATC 720 

AAAGATCCTG AGCTACGCAG AGAATTGGGA CGTTTAGCGA ATTGTGAAGA CTATGTCGTT 780 

AACACTGGTG TGTTCTATGT TTTTGTTGCT GATTTGTATC GTCATGCAAC GATTTTGTCA 840 

AAAGAGGGGC AATCATTGGA ACCCTTAAAG ACACCAGAAT CATTATTAGT AGCTGCCGTT 900 

GATACAACGA TTGCTGCGCA AAACATGGCC ATTGCGGCTG AATCTATGGA TTTAGGGATT 960 

TGTTATATTG GCGGCATTCG AAATGACTTA GACACAGTAG CCAAACGCTT ATCTTTGCCT 1020 

GAATTAACTG TACCACTTTT CGGATTAACA ATCGGTGTAC CAGAAACGTT AAATGGTGTA 1080 

AAACCACGGA TGCCTTTTGA AAATATTTTA AGTGAAAATC ACTACCAATC AGACAAATTG 1140 

ACAGATATGC ACACATATGA TGAATTATTA AAAGATTATT ATGCCAGCCG CTCAAGTAAC 1200 

GCCCAAACAG CGGATTGGTC ACAGAAATCT TTGTCCTATT TTTCTTATAA TCGTCGCCCA 1260 

GAAGTAAAAA TTTTCTTGCA AAAACAAGGA TTtGATGTCT AAACAAAAGA GCGTGGGCTT 1320 

ACGTTATTTC AACGTAAGCC CACGsTsTTT TGTTAATTTT TATTTATCTT TACTAAATTT 1380 

GTCTTTTAAG TCTTCAAAGC CTTCTTTAAC TTTATCAGTT AATTTTTCAG CGCCTTCTTT 14 4 0 
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AAGGTCGTCG CCGGCATCAC 
TCCTTCAAGT TCTTTGCCTT 
TTCTACTTTA TCTTTTGCAT 
TAAATAATTA ATTACATAAT 
TGAACTACGA ATAAATGGTT 
ACCGATTGTT CCTGGATTAT 
TGTTTCCACA TATCCATTGA 
ACGTTGTTTC ATGACGCCGC 
ATTGTACGnC GtTGCGCTAA 
AATAAATGTC CTTTTTTACC 
AAACAATCTC CATGATAAGG 
AACGCATGCC CCATTTCTGG 
CCAGCGCCAA CACCAGTTCC 
CCAGCAACAT ACTCACCATA 
TCAAAATGTT GTTTCATAGT 
GAAGTGATAT AACCATACGT 
ATCCCAATCG CTTTTAAAGG 
TCTGGGGTTG TTGTGGGAAA 
CCACATACAA ATTTTGTGCC 
ATATGCTCCT TCGTTTCCAT 
AACATACACG TTTAGTTTAC 
GTCAGGTTGC CCAATGAAAT 
CGAGCATTCA CGGTTATCTA 
AAATATAACA ATAGTCCGAT 
AAAAGCGTTT CTTTTAGAAT 
GTCACCAAAG AAAAGAGGAA 
AACCAACTAG CCATTATTCT 
AATGACACTA TAAATTGCCC 
CCATGTGGGT AGGACCCATT 
GTTAATCGTA AATTCTTGTT 
ACGTCCTGTA AATATATAAG 
ATTCATGGAA CTATTTGGGG 
AGAGGATCCT TTTTTTATCA 



GTGCTTTGTC 
TATCATCCGT 
CGTCAAAACG 
AAGAGTATCA 
ATTTTTGAGC 
CTTCTAAAAG 
CTAATTCAGC 
CACCTAAAAT 
ATAATAAGCT 
AGAACGACCT 
ACAATTTCCT 
GTGGCTAAAG 
AATTGTATAA 
TGCCGCAGCA 
TCCTAACAAG 
AGGAGAATCA 
ATATTGTTGG 
ACTGACACGT 
ACCGGCTTCG 
TTTACAAGTA 
CACTTTTTTG 
CTGGCAAGTT 
ACAATGAATG 
TGCAATCCAG 
ACTTCTGGGC 
AAACATAAAA 
CACTCTCCTT 
CAATTGTGAT 
TGACCGTAGC 
GAAAAGGCCA 
TCCCAGCATC 
CAAAGCGAAC 
CTTGACCTTC 



TTTTACGTCA 
AACTTTGCCT 
TCCTTTTAAA 
TTCTTAGCCA 
CATTAAAGCT 
AGGCGTCACC 
AAATTTTTCA 
GATCACTTCT 
TCTAATTCCC 
TCAACTGCTG 
GCATACGTAT 
CCTTCAATAA 
TATACACAAC 
TTCACATCCG 
TCAAAGTTAC 
ACGTGAATAT 
AAAAATTCTA 
TCTACGATGG 
ATACTTCCTA 
AGCGGATACA 
TCGTTTCGAC 
GCTAAAGAGC 
CTCGTTTTTG 
CCACCACTAA 
TCTTTTGCAT 
CAAATGACCG 
TTCGCTTGTT 
TACTAACAAC 
TTCTTGATTC 
AGACTGTTGT 
TAGCGGTTGT 
ATTTTTTAGT 
TAAACGAACT 



GCAAAGGTTG 
TGCGCTTCTT 
TCTGCCATGC 
TTTCCTTCAA 
TTTTTTGCCA 
AAGTATTTTT 
CGAACTTTCG 
GGCGCTAATA 
AAGTTTTATG 
GTCCTGCTGC 
CTTCAGGATG 
ACTCACCGTT 
TAGATGTATG 
TTGTCCAAGC 
GCCAAGCTAA 
CAATCGGACC 
TTACTTTTTT 
TCAAATCATC 
AAAGTTTTTC 
TTACTTTTGA 
AGAACAATTT 
ATTTTCCTAA 
TTTATTGATT 
GTAAAAGCAT 
GCTGAATAAA 
CTAAACCGCC 
TTTTGATAAT 
GTGTAGAAAG 
AATTTTTTTG 
TCATCTTGTT 
TTACCCAAAT 
GTCCGTTGTG 
TCTGGAAAAA 



ATTGAGCTTT 
TTGCAGTACC 
TAATCCCTCC 
ATATAATGCA 
AGGCAAAGCA 
CTAAAGGCGG 
GCATCAAATG 
ATAAAGTCGT 
ATCCTCTTCC 
AATCCCTTCT 
ACGACGAACT 
TTGAATCGCT 
TTGCCCATTT 
CATTGGCACA 
TTTTGGTGTA 
AAATGAACCA 
CATTGTTTCT 
TGTCCCAACG 
TGTCATTGCA 
TTAAATAACT 
AACAATTTCA 
AACAAAAAAA 
CTTTAGCGAA 
TAACCAACTT 
ACTCATACTA 
AGACAACCAA 
TATTGCGCCA 
TCGGTAACCA 
CTTCACGAGT 
GTAAAATAGC 
CAAGATGAAA 
CAATTTTTTT 
TAGCAGCATA 



1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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TGGATTTTGA 


ATGACCGCCT 


TTAGCGATTT 


GGCTCCTAAA 


GCAATTGTAG 


GTGTTACTTT 


3480 


CGCTAATTTC 


AGTTGGTTCA 


CCTCTGTTAA AGGAACATTC 


GCAGCACTCC 


TCATGGCAAT 


3540 


CGCAACATTA 


AACCCGTAGC 


GACTGGTTAA 


TCCCGTTGTT 


TGGTTCTCTT 


CCTCTGTTTT 


3600 


TTCATCTTTG 


AGATGCGCAT 


AAACCCCGCC 


CAAAACAACT 


CCTTTGATAG 


GCTGATCAAG 


3660 


TCCCTTCACC 


TTCATCTTTG 


CTGATACTGT 


TTGATGAGGC 


GGCACACTCA 


CTTCCTCATT 


3720 


TTCTAAAGTA 


ACCAAGTCTG 


TCAGCGGAAC 


CTTTAACGAT 


GAATCAAGTG 


GTTTATCAAA 


3780 


ATCATTGTAC 


ACAATCAAAC 


CATTATCATT 


TGTTGaTGCA 


TTTGCTAGAG 


TCAATACGAT 


3840 


GGaCAATTTT 


TCGTCTGTCT 


TATTTTTTAT 


GTCTAAGACA 


ATAGTGGAAG 


ATTGCTTTGG 


3900 


TGCCAGAATT 


AAATCATAAT 


AACTAGATGT 


TCCTGACACT 


TGATTTTCTG 


GTAAAACAGG 


3960 


CGTTACTGTA 


AATTTAGTCT 


TCATTTCTTC 


TGTAGCTTCT 


ACTTTTGGAA 


CAATAATAAA 


4020 


AAATGTGAAA 


AACAGAAATA 


GACAGAwAAC 


ACCGACTTTT 


TTTCTACTCA 


TTTCTTCAAG 


4080 


ATCTCCTTTA 


TAAAGCTGAA 


TGTACCTACT 


AGTCTAACAA 


ATCAGCGAAA 


CAATGCTGAT 


4140 


CAATCTTTGA 


CTTTTGGGAA 


GGATAATCTT 


GAAAAAAGTC 


TCCTTGAAAA 


ATACCAAGGA 


4200 


GACTTTTTAT 


TaTTCACTAT 


AATATAATTC 


TTTTTTTATC 


TGTGTGGTTG 


ATACTTCTGG 


4260 


CGTTCGTGGC 


AAATAAAGAA 


CCTTTGCGGT 


TGTTTCAACT 


TCTAGAAAAT 


CAAAAGCCCC 


4320 


TTGCCAATCA 


TCGCCCATGA 


CTAAAGTATC 


AATTTTAAAT 


TCAGCAATAT 


CGGAAACTTT 


. 4380 


TTGTTTCCAA 


CTAGTTTCAG 


GAATAACTAA 


ATCAACATAA 


CGAATTGCCT 


CTAATAATTG 


4 4 40 


TTTACGTTTT 


TCATATGAAA 


AATAACTTTG 


TTTCTTTTTT 


TCTAAATTAA 


AAGCATCTGT 


4500 


TGAAAGTCCA 


ACAATCAGGT 


AGTCCCCTTG 


TTGTTTCGCT 


TTTTTCAACA 


AATTAATATG 


4560 


ACCATAGTGT 


AGGAGGTCAA 


ATGTGCCGTA 


AGTAAGTATT 


TTTTTCATAA 


AAACCTCCAT 


4620 


TTGTACTTTG 


ATATTTC TTC 


GTTAATTATc 


AATTTTTaTG 


AATAGAAAAA 


AGGCTACTCT 


4680 


TGaTTGaCTT 


TGaTGGTTTT 


TCGATGCAAA 


TTTTTTATTT 


TnCGTAGCCG 


Tn 


4732 


(2) INFORMATION FOR SEQ ID NO: 63: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:" 7294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GGGaaAATwt TTTTAtgkGc tATGGcGTAt TACtAGTTAC gTTGTTGCCC TAGTGTCCcT 60 

TATTTAAAAt ATTTGGtGAA AAATCaGCGT TTATGGACcA TTGgATACGG CCcATTTCcA 120 

GTGGGGAATG- TTTTkGcCTG TGCTTTTTGG CTTTTTCATG CTGATTGGCT TTGATCAATT 180 

TTTTATTGCG TTTCATGGAT TATTTTTTAA CAACGATGCT TGGTTATTTA ATCCGCTGAC 240 

TGATCCGATT ATCAATGCCT TACCTGAAGC CTACTTTATG CATTGTTTTA TCTTGTTTTT 300 
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TATTTTATTA GAAGGCCT^T TTTTGATAGG TATTTTAATC GG&AAACGTT CTTTAAAAAA 360 

ATAAAGAGAG AAAAGTGAAC AATTCCAAAA TTGCTCGCTT TyCTTTTTTA TTCAaTCtAG 4 20 

TTTTAAAAAC TAGATTGAAT AATTGTCAAA AGTATGATAT TCTTTTTTAG AAATGGTTAG 4 80 

GAGGGTTTTT ATGGACGAGA TAGAAAAATC ACTACAAAAT TGGGGCCAAC AATTGGAAAA 540 

CGTTCATTTG CCAAGATGGC ATGAGTTACC AGATATTGAA TTATATATGG ATCAAGTGAT 600 

TACTCTGATA GAAAAATACT TATCCCCATT GATTACACTG GAAAAACATA CTTTGTTAAC 660 

GTCCTCAATG GTTAATAATT ATGTCAAACA CGGGTTAATC CCGGCCCCAG TGAAAAAGCG 720 

CTACAATCAA AAGCATTTAG CGTTTTTAAT TGCGATTACG TTGTTGAAAC AAGTGTTAAC 780 

ATTGCCAGAA ATTAAGCAAG GCATTTTATT . CCAAGGAGCT ACGGTTGGTA TTCGTGAAGC 84 0 

gTACAATTTA TTTTGTCAAG AACAGGAACG AGCGATTTAT GTGATTGCTG CTCAAGCGCA 900 

AGAAAAAGAA GTTCAGGCCA AATCACAAGA ACCTATGGGG ATTGAATATT TGGCTGTAAA 960 

AGCAGCGACA ATGTCTTTTG CTACTAAAAT GTTTAGTGAA AAAGTGATTG AATTGGAACA 1020 

AGAATACTTA AAAGAAATGG ATGAGATGAC ACATGAATAA AGAAAAAATC GCACTACTAG 1080 

TTGATTCTGG GACAGATGTT CCAGAGGCAT TAGTCAAACA ATACGGCATG TATGTCTTGC 1140 

CCTTACAGAT TATTTATCCT GAAAAAACCT ATACAGATAA GGTAGATATT ACACCAGAAG 1200 

AAGTGTATCA ACGTTTAGAG AAAGAAATCC CTAGTACTTC ATTACCTGaT GGTGCAACAA 1260 

TCCAAGCTAT TTTTGACaAA ATTAAAGAAG CGGGTTACGA AAAAGTCTTA GCGGTGACCA 1320 

TTTCTAGTGG TTTAAGTGGA ACGTATAATG TAGTTCGCTT GCTTGGAGAA CAAACCGAAG 1380 

GACTAGATGT ATTTGTTTTA GATACAAAGA ACATTGGTAT CGGGGCTGGC ATTCAGGCGA 14 40 

TACGAGCAGC AGAATTAATT GAAACAGGAT TAGGATGGCA AGAATTACAA. CAAAAATTAA 1500 

CAGAAGAAGT AGCCAACGCA AAAGTATTTT TTAATGTAGC CACTTTAGAA TATCTACAAA 1560 

AAGGTGGACG AATTGGTTTA GTCACTTCTA TTTTAGGGAA TGCGCTAAAA TTAAACCCCA 1620 

TTATTTCTTG TAATGAAGAA GGCATTTACT ATACGGTGGC TAAGTCTCGC GGCCGTAAAA 1680 

AAAGTCTTGA TAAAACGTTT GAGTTGGTGA CGAACTTTAT AGGTGAGGCA CCCCGTTTCC 17 4 0 

GTTTGGCAGT CGCTCACGGT GCGGCAGAGG AAGAAGCmAA AGCmATGATG GAACGCTTGA 1800 

AAGCAGCTTT TCCACAAGCA GAAGAAATTT ACTTTGGAAC AATTAGTCCG GCATTAGTTG 1860 

TTCATACAGG ACCAGGACTT TTAGGTGTTG GCATACAGTT ATTAAATGtT AtTAAATGAA 1920 

AAATCCCCTC AAATTTTTTT GaGGGGrTTT TTCaTTTAAG ATTyCyCTTT TyCATAGrAA 1980 

TATCaTTtAG CyGGaAAAAC aACGrAAATT wTwTTGaAAA TyTCACGAAT GTGTTACAAT 2040 

CCATCTACAA AGGGGGAtTT TTTGTGaAAA AAGCAATAGC AGAATGTTTA GGAACATTTA 2100 

TTCTTGTCTT • TTTTGGGACA GGAACAGCCG TTCTAGGAAA TGGTATGGAA GGGATTGGAA 2160 

CAACGGGGAT TGCCTTAGCA TTTGGGTTAA CAATCGTTGC TGCTGCCTAT AGTATTGGCA 2220 

CAATTTCAGG TGCACATCTA AACCCAGCTG TTTCTATAGG AATGTGGCTT AATAAACGAA 2280 



i 
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TGACAACAAT GGAACTAATC 
CATTTGCTTT ATTAAGCATT 
ATGGCGTAGG GGAATTAAGT 
TTTTTGTTTT AGTTGTCATG 
GAATCGTAAT TGGGTTAACA 
CTTCTGTCAA CCCAGCTCGT 
CTGAGTTGTG GATTTTTATC 
CAAAATTCGT CTTAGATACG 
GTCAATTGTT GTCAGAGATT 
AACCAAAGAT GAAAACATCA 
AAGGAAAAAG AGGTGTCTCT 
TTCAGAGATC GTTAAATATA 
TGGAGAAGAA GCACTTGAAA 
AATGCTCCCT AAAATGGATG 
GCCAATCATT ATGGTGACTG 
AGGAGCCGAT GACTATGTAA 
AGCCAATTTA CGGCGAGGTG 
ATTAACGATT GGTGAT." iVA 
AAAAATTGAA TTAACCCACC 
ACAAGTGATG ACTCGTGAAC 
TGTGCGGACA GTGGACGTAA 
TCATCCAACG TATTTAGTTA 
GGAGTAATCA GGTATGAAGA 
TTTATCATTT ATTCTGCTGT 
TGAATTGGAA GCCACTACAA 
ATTGTCAAAC ACGTTAAGTA 
TGCAAATTTA AAAAAAGCGT 
TGTCGATGAT AAAGGGATTA 
GAAAAAGAAT GATTATCGTG 
TAATGATAAA CGCGTGTATG 
GATTGGCGTC • CTTTATGTGA 
AGCAAGTATC TTTTTCACTG 
ACTGATTGCA CGATCAATCA 
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TACTATGTAG TjGGGACAAAT TGTTGGTGGC TTGATTGCCT 234 0 

CTAAAAGGTG CTGGTAAAAG TATTGAGAAC TTAGGTCAAA 24 00 

GTTGCTGGTG CATTGACAGT CGAAATTATT TTAACATTTA 24 60 

ACAGTTACAA GTGCTAAGAA AGGCAATGCT TCATTAGCAG 2520 

CTAACAATGA TTCACTTAGT CGGCATTCCT TTGACAGGAA 2580 

AGCATCGCGC CAGCCGTTTT TGCGGGTGGT TCAGCGTTAT 264 0 

GTTGCGCCAT TAATTGGTGG ATT ATT AG CC GCTCTTGTAG 2700 

GAAGAATAGA TTAGCGAAAG AAGAATCTTT GATAACAAGC 27 60 

CTTTTTTTGT TATACTAGGA GAACCTAAAG GCGCATTTTT 2820 

TCAAAATAAG GTATAATAGG AAAGTATCAA ATAAGATTAG 2880 

TTGTGAAGAA AATTTTAGTA GTTGATGACG AGAAGCCAAT 294 0 

ATTTGGTTAA AGAAGGATAT GAAGTATTTA CTGCTTATGA 3000 

AAGTGGAAGA AGTGGAACCA GACTTAATTA TTTTAGACTT 3060 

GCTTAGAAGT CGCGCGAGAA GTGCGCAAAA CACATGATAT 3120 

CCAAAGATTC TGAAATTGAT AAGGTTTTAG GATTGGAATT 3180 

CGAAACCATT TTCAAATCGT GAATTAGTTG CTCGTGTAAA 324 0 

CAACCAATGC GAAAGAAGCC GAGGTGACAA CACAATCTGA 3300 

CCATTCATCC TGATGCATAC ATGGTCTCAA AACGGGGTGA 3360 

GTGAATTTGA GTTACTTTAT TACTTAGCAA AACATATCGG 3420 

ATTTATTACA AACCGTTTGG GGTTATGATT ATTTTGGGGA 34 80 

CCGTACGTCG TTTAAGAGAA AAAATTGAAG ATAGTCCAAG 354 0 

CTCGTCGTGG GGTTGGTTAT TATCTAAGAA ATCCTGAACA 3600 

AAAAAGTTCA CTTTTTTCAA TCTGTTAATT TTAAGATTGC 3660 

TACTGATTGC GATTCAAATC ATTGGTGGCT ATTTCATTCG 3720 

TTTCCGATTT TAAAAAAAAT ATGGATTCCC AAGTTGTCCA 3780 

CGCAGATGAG CAACAAAGAT CTCGAACGTA GTGACGTTGA 3840 

TATCTGATTT TTCAAATGCA GATATTTCTG AAGCGAGAAT 3900 

TTCGGGCAAC CAATGATTTA AATCAACAAA ATATTATTGG 3960 

ATTTAAATGA CTTTACGAGT AAAAAATATC AAGCTTTAGA 4020 

TGAATGTCCA GCCGATTCAA TCGCCTACTG GAGAAACAGT 4080 

AAAGTAATTT AGAAAATAAA TACCAAGAAA TTACCAACAC 414 0 

CTTCTATTAT TGCCGCAGCA ATCTCGATTA TTGTGACTTT 4200 

CGAAGCCGAT TGGTGAAATG CGCGAGCAAG CCATTCGAAT 4260 
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CGCTCGTGGT 


GATTACGCTG 


GAAAAGTAGA 


AGTCCATGGA 


AAAGATGAAT 


TAGGCCAATT 


4320 


AGCAGAAACA 


TTTAATCAJ\T_ TATCAGAACG 


GATTGAAGAA 


GCACAAGAAA 


CAATGGAAGC 


4380 


AGAAAGAAAT 


CGTTTAGATA 


GTGTCTTAAC 


GCATATGACA 


GATGGTGTCA 


TTGCGACGGA 


4440 


TCGCCGCGGA 


AAGGTGATTA 


CGATTAATGA 


AATGGCCCTT 


TCATTATTAA 


ATGTAAAAAA 


4500 


TGAAAATGTG 


ATTGGGACCT 


CGTTATTAGA 


GTTGTTAGAT 


ATTGAAGAAG 


ATTACACATT 


4560 


GCGGAAGCTG 


TTAGAAGAGC 


CAGATGAACT 


GCTGATTGAT 


CGCTCAACGT 


CTGATCGTGA 


4620 


AGAAGACCAA 


AT GAT TAT CC 


GGGTAGACTT 


TACAATGATT 


CGTCGGGAAT 


CAGGATTTAT . 


4680 


TACTGGCTTA 


GTTTGCGTAC 


TTCATGACGT 


CACAGAACAG 


GAAAAAAACG 


AACGGGAAAG 


4740 


ACGGGAATTT 


GTTTCCAATG 


TTTCTCATGA 


GTTGCGAACG 


CCTTTAACAA 


GTATGCGTAG 


4800 


T TAT AT AG AG 


GCTTTGA3TG 


AAGGAGCTTG 


GGAAAACCCT 


GAGATTGCGC 


CGAATTTCTT 


4860 


AAAAGTCACG 


TTAGAAGAAA 


CCGACCGGAT 


GATTCGTATG 


ATTAATGATT 


TGTTAAATTT 


4920 


ATCTCGGATG 


GACTCTGGGa 


ATACACATCT 


TCAATTAGAG 


TATGTGAATT 


TTAACGAATT 


4980 


GATTAATTTT 


GTCTTGGATC 


GCTTTGATAT 


GATGATTGAA 


AATGAGCAAA 


AAAATTACAA 


5040 


AATTTGCCGT 


GAATTTACTA 


AACGCGATTT 


ATGGGTAGAA 


T TAG AT AC AG 


ACAAAGTAAT 


5100 


TCAGGTTTTA 


GACAACATTT 


TGAACAATGC 


GATTAAGTAT 


TCGCCAGATG 


GCGGCGTCAT 


5160 


TACCTGCCGA 


CTAGTTGAAA 


CACATAATAA 


TGTCGTCTTT 


AGTATCTCGG 


ACCAAGGTTT 


5220 

i 


GGGCATCCCT 


AAAAAAGATC 


TCGGGAAAGT 


CTTCGAGCGT 


TTTTATCGTG 


TGGATAAAGC 


5280 


ACGTGCGCGA 


GCACAAGGTG 


GGACTGGTTT 


AGGTTTAGCA 


ATTTCTAAAG 


AAGTAATTCG 


5340 


GGCCCATAAC 


GGGAGTATTT 


GGGTGGAAAG 


TACAGAAGGT 


GAAGGATCGA 


CTTTCTATAT 


5400 


TTCACTACCA 


TATGAACCTT 


ATGAAGAGGA 


TTGGTGGGAA 


TGATGAAATT 


ATCAGAATGG 


5460 


ATTACAAGAA 


TTGGCTTGAT 


TTTGATGGTC 


■ ATTTTAAGTA 


TATATTTTTC 


AGTCAATATC 


5520 


TGGCTGAATT 


CTGCCAAAAA 


AATACCAGAA 


ATGAAGTCGG 


GAAGCCAAGT 


CACAACAGCT 


5580 


GTCAATGAAA 


AAGCCATTGG 


CGATGTCTAT 


TTACCTTTGC 


AATTGATTCG 


AATAACTGAT 


5640 


GGAAAAGCGA 


TGCAAAGTAA 


TCGTGAAACA 


TTAATTAGTA 


ATGTTCAAAA 


TGATATTAAA 


5700 


ATGGCTACGT 


TTGGTAAATT 


GACACAAGTT 


GTGACAAAAA 


ATGCAGAGCA 


ACTTAAGCGC 


5760 


TACAACCAAA 


TGGAACAAGG 


CATTGAACTT 


CTTTATCAAG 


GtCCCTTTTT AATCtCgGAC 


5820 


TATGCTTCGA 


TTTATAATCT 


ATCCATTAAT 


TTTACTAACT 


TtAATGAGTT 


GACGGACCAG 


5880 


TATTTTACGA 


AAATTCAATT 


GGgAtTTTAA CGAAAATAAG 


ATACGTTTTT 


TAGATTATGA 


5940 


TCAAGCCAAC 


GTCTATGAAG 


CGCCCATGAC 


TGTCAATAAG 


GCGCGCTTAA 


TGGGAATTAT 


6000 


CAATAAAGAG 


GGATTACAAT 


ATCAAGACGT 


TTCCGAAAAT 


ACGCTAACCA 


AACAAGGGCA 


6060 


ATGTTATTTA 


• ACCAATGATA 


TGAAGTTGAA 


AAAGTACAGT 


TATATCTTAG 


CTTCGCAACC 


6120 


AGTTACTCGT 


TTTAGGAATG 


CTTTTTTCAA 


TGAAACGGAA 


GATATCCAAA 


CCAATGAAGA 


6180 


CAGTCAAGAC 


TTAACCTACA 


CGAGTAAAGA 


AGAACGATTG 


TTTGCAGAAG 


AAAAACTGGG 


6240 
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AAAAATTGAT 


TTTAAAGGGA 


CCTTGCCAGA 


AGAGAATAAA 


CGGGACTCAA 


TCTATAATCA 


6300 


AAGCTTTTCT 


TATGTAAAAC 


GTTTAGGAAC 


CAATATGGGG 


AATTTGCGTT- 


ACIXTGATCG 


6360 


AACGAAAGAT 


AGTGTCAATT 


ATCGGACTTT 


TGTGGAAGGT 


TTCCCAGTGT 


TCAGTAATGA 


6420 


TTTAAAAGGC 


CAAGTGGATA 


TTCGCATCAC 


GAACAACGAT 


GGTGCTGCAC 


CAAGCGTAAC 


6480 


CATTAACACA 


AGTGTGAATA 


CGATCCAAGT 


GCCGATTCCT 


TCAGAAGAAG 


AAGTGACGCT 


6540 


GGAAAGCACG 


GAAAAATTGA 


TTAAGCGTTT 


AGAAACGGCT 


GGTGCTAAAA 


AGGAAAAAAT 


6600 


TCAATCGGCT 


GTTATCgGTT 


ATACGTGGCA 


GACAATTGAA 


GAAGTCAAAC 


AAGTCGTTGA 


6660 


TTTAAGTCCG 


GAATGGTATG 


TCCTTTATAA 


CAACAATTGG 


TACACTGCAA 


CGGACTTGGT 


6720 


AAAACAATTA 


CCAAGCTTGG 


AGGTGGGCTA 


ATGGATTTTA 


AACGAATTGA 


GTGGATTTTC 


6780 


TTCTTAGCAT 


TTTTAGGATT 


AAATATCTTT 


TTATTTGGAA 


TTTATCGGGA 


AGGTCAACAA 


6840 


GAGGAGAGCA 


ACGTCTCTTC 


TTCCAGCCAA 


ACAGAATCGA 


TTGAAAGTCG 


GTTGGAAAAA 


6900 


GATAACATCT 


CGTATAAAGG 


GACACTTTCT 


TCAGAACGAT 


TGGAAGGTTA 


TTATTTAAGT 


6960 


GGCGAACAAA 


CCAATTTTTC 


TGCTGCTTTA 


AAAATCCAAC 


GTGAAAAGAA 


TAAAAATTTT 


7020 


TTGAGAAATG 


GGCTGCAAAT 


TGCGGATAAT 


ACTTTAACGA 


GTGTACCTAG 


TAAAAACTAT 


7080 


TTTATTGATC 


CTAAGAAAAT 


TGATAAAGAT 


TTAAGTACCT 


TTTTAAATGA 


AAAAAATGCT 


7140 


TTATTATTCG 


GAGACGAATA 


TCAATACTTA 


CCAGAATTTT 


CTCATTTAAA 


AGAGCCGACG 


7200 


GCAGAAATTG 


TGGCTGCACA 


ATCGTATAAA 


GGAATTCCTT 


TTAGAGACGA 


CACGGCAAAA 


7260 


TTAAGTATTT 


TAGCAGATAC 


GTCAGGTGAA 


TTAT 






7294 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

nATTTCATCT AACTTTTTGA CGCCnTAACA CCCTATTATC ACTTTTTACA TCTTATTTCG 60 

ATTTAAAAAT AACGATAAAA AGCCCGTTCC TAACAGAATT ACTGCTGAAA CATAATAAGG 120 

GAAAAAGTGA TTGATATCAA ATAAAGCTCC TGCTGCCATT GGTCCTAAAA TATTACCAAA 180 

ACTTGTAAAT GTCGAATTTA GTCCGTTGaT GGTTCCTTGT TGATCTCCAG CATGTTTGGA 24 0 

TAAATAAGTA GTTACTGCTG GTCTAAACAA GTCAAACGCT AAAAAGACAA TAAACGTTGa 300 

AAATACAACA ACTAAATTAT TTTTTGTAAA GGCAATCACG GCAATAAAAA TGGCACTTGC 360 

AAAAAAGGTT AATTGGATTA AACCCATTTC ACCTATTTTT TGGACGATTG CATCAAAGAA 4 20 

AAACAACTGA CAAATCAACG CTAAAATACC ACTAACCGTA ATCACGATTG CTATTTCACT 4 80 

TGTGGTAAAG CCAAAATTAA TAGTCGCCAT AATACTATAA ATAGATTCGA ACGCTTGCAG 54 0 

GCCAAAAGAG GAAATTAAGA TAATCACAAA TAATGAGGTA AACATTGGAT TTCTTAAAAT 600 



WO 98/50555 PCT/US98/08985 

504 



ATCCATAAAT 


GAACCTTTTT 


TCGCTTCAAC 


AGCGGCTAAA 


ATTCGTTTCT 


CTGGCTCCTT 


660 


CAAAACAGTT 


AATGTCAAAA 


TAAAACCAAT 


AAACGCTAAA 


AATGCGGCTG 


CAAAAAACGG 


720 


GGCGCGAATA 


CCTAAATAAG 


CaATAAAACC 


ACCAACTCCT 


GGTCCGATAA 


TAAAACCACC 


780 


ACTAATTGCG 


GCTGACACAA 


GCCCCATCGC 


TTTCGGACGT 


TCAGAAATCG 


TGGTCATATC 


840 


TGCCACAAAG 


GCTGTCACTG 


ACGGCATTAA 


TAAAGCGGCG 


GCAATCCCGC 


CTAAACCACG 


900 


AGAAATATAA 


AAACCGCTTT 


TCGCTTGGGC 


TAAACCAAAT 


AATAACTCAG 


AAATTGAAAA 


960 


CACCAACATG 


CCCGTTGCAA 


TCATTTTTTT 


CCGACCAATT 


TTATCCGAAA 


GGACACCTGC 


1020 


GATTGGCGAT 


GTGATTAATT 


GGGCAATCGC 


AAAAATAGAA 


ATCATCATTC 


CCATTGTGGT 


1080 


ACCAGAAAAA 


TGCATTTCTT 


CTTTTAATTG 


GGGAATTACG 


GGAATGACTA 


GGCCTACGCC 


1140 


TAGAAAAACA 


AGAAATAAGT 


TAGAAATTGC 


TAAGTACATC 


ATACTATTTT 


TTTTCGTCAT 


1200 


AATTTGCCTC 


CTCCATAAAA 


AAACAGCCAT 


CAAAAATGGC 


TATTTCTGTT 


TCTTTGTATG 


1260 


TACTTTCTTT 


GACAGTGTTT 


CGGCTGAAAC 


ACTCGTTGAG 


T CAT T GAG AT 


ACATTTTACA 


1320 


CTAATTCACT 


GAAAATTTCA 


CTAGTTTTTA 


CTTATTTTTT 


TAAAGGAAAT 


TGTTATTTTT 


1380 


TAAAGACATA 


ACTTAAGTAT 


CCAAGTATTT 


AATGAATCGC 


TTTTGTTTTT 


TATCAGAAAC 


1440 


ATAGCCAATG 


TGTTCATAAA 


ATTGATGTGC 


TGCTGTACGC 


TGTTCCCCTG 


AATTTAAGCG 


1500 


AATCGCAGAC 


AAGTTACACT 


TTTTAgCTcC 


GTTcTAGTTT 


CGTCATTAAt 


TGtTTACCAA 


1560 


TTCCACGGCC 


TTCTGCTTCT 


TTTGCCACTG 


CTAATCCCaG 


AACGTtCAAT 


AACGTTTGGG 


1620 


AATAAATTGT 


TTCATAC\CT 


TGCGCATGCA 


CATAGCCTAA 


AATCTTGCCt 


GTCTTCGTTT 


1680 


CTTCATAAAC 


ATATAAAAAC 


TCTCTGTCAC 


TGGCTAGTAG 


TAATTTCTCA 


ATCTGTTTAC 


1740 


AGGTCACGTC 


AACGGGAACT 


TCATAGCCCA 


ATTGTTCTTT 


ATTAATGATA 


GTTAACGCCG 


1800 


CTGCATCTGT 


TGGCTTCATT 


TTTCTAATCA 


AGGGTAACAC 


CCTCCTTTAC 


ATTCTTACTT 


1860 


AAATAATAAC 


GAAAGCATAA 


TATCCGCAAA 


ATCATTTAAT 


' TCTGAATCAT 


TGTATTGCTC 


1920 


TAACAAAACT 


GTTTCGCCTC 


GATACTCTTT 


TAAAGCTAGT 


TGTATCAGCG 


AATGATGATT 


1980 


AGTAGGAAAA 


TGTACTAGGC 


CCCATTCCCC 


GCCAGCCCTT 


TTTGATAAAA 


TTTTATTCTC 


2040 


TTGCTTAAAT 


GCCATTACTC 


TACACATATT 


TAAAATAACG 


TACATAGGTT 


GTTGAACAAT 


2100 


TTCTTTTTTT 


GCCTCTTTCA 


CATCAGCTAC 


AATACTTTCA 


AAATATTTCT 


TTTGATCTAT 


2160 


CCAGCCAAAA 


ACGGAATCAA 


TCGGTTTTCC 


TACGAGTACT 


TTTCCATATA 


ATTTTGTAAT 


2220 


PATAAf^ATfiT 

V^/"l X r\r\\Jt\ 1U1 
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TCCCATrATA 


TCTTCAATAT 


ATTTATTTTT 


2280 


ATTAGTCAGA 


TATCTGGAAC 


GATGTATCGG 


AGAAAAATGT 


AAATCGAAGG 


GAATCGGATA 


2340 


ACTCAAATTT 


ATTGTATCTT 


tTAATTGtAG 


CACAkGGAAT 


TCTAtCCCTT 


TTTTTGGJcGc 


2400 


CCACTTATCT 


AAAATATTAA 


TTGTATAATC 


TAATAATTGT 


TTCTTCTCAT 


CAATAGTAAG 


2460 


CTTTTCATAA 


ACCACAATCA 


GTACATCCAA 


ATCACTTACT 


GAATGACAAT 


AGGTTCCAAA 


2520 


AGCATAAGAA 


CCATGAAGAT 


ATATTCCAAC 


TAAATTTTCT 


TTTATTATTT 


TCATATATCC 


2580 



WO 98/50555 PCT7US98/08985 

505 

ATTTTCCATT TGAGCTAATA ATTGATGAAC CACGATTATT CCTCCATTAT TGATTGACAA ■ 2640 

TTGCTATAGT TTTTTCGATA AATAAAGAAC TAACTCATCA TCATTAACAC ACCGTTCGTT 2700 

TGGTTTCAGC TGCTTATTTT GAAACCAGAC TCCTGAACCA TCTGGCACAT ATCCTTGTTT 27 60 

GATATATAAT CTTTGCGCTG CACCATAACC AGGATGCAGG CCTACTCCCA AAGAAACAGT 2820 

ATCTGCATAA CTTTTGGCAA CATTTTCTGC ACGATTTAAT AATAACCGCC CCACTCCTTG 2880 

TTTTTGAAAT GGTAAGAATA CATTGAAATC AGCTATTTCA GGATATAAAT TTTTAAAAGG 294 0 

TCCTTCCTTA GCTAATGGCA GTAACGTCAC ATAACCTAAC AGCGTTGCCT TTTTCTCCGC 3000 

TACAAAAACC GTTCGTTGCT TGACTAATTG TTCTTCTAAA TAGTTCATTA AAATCTCTTG 3060 

GCGACTTGGC CAACCTTGTT GGrcAAATTG AGTATCCAGC GCTAAGATAT CTTTCTCTTG 3120 

CATTTCT'CTA ATAACCACTA GCTCAGACAT CAGCATTTCC TCCCGAAAAG CGTTTTCATT 3180 

TTATAAAACG AAGCGAATAA TTCTTCGCTC CCTTATTCAT TTTCAAAACC ATCGAGTTGA 3240 

TAATATTTGG GCAAAACAGC AACACATGTT TCTTCTGAAT TAACAGCAAT AACGAACCCT 3300 

TGCCATAAGG nCCACGTCAC CGCTTCACCT AATTTCTCTT TCACTCGTTG nATCATTTCA 3360 

GCTTGCGTTA AGTCTCGTGC AGGrAAATGT TGAACTTTAT CATTACTGAT TAAAAAAATA 3420 

AA 3422 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



CCGGACCATC 


AAAATCACTT 


TTTTCCTGCT 


CTTTTnAAGA 


AACGAAnGGA 


CCCATGTCTT 


60 


TTCAGAAAAG 


TTTGTTACAA 


TnACAAGCGT 


AGAGAGGATG 


TTAGGGTATG 


GATAAAATGA 


120 


AAGCTATTAC 


TTTTTTTGAT 


TTnGATGGAA 


CTTTGTTAGA 


TGGAACATCG 


CAAATAACTC 


180 


CTGAGATTAC 


CGCAGCAGTG 


GCGGCCTTGA 


AAGACAATCA 


AATTTTACCT 


TTAATTGCGA 


240 


CAGGACGGAC 


CTTGTGTGAG 


ATTCAGCCAA 


TTATGAAAGC 


CAGTGGCATT 


GATTCTGCGA 


300 


TTGTAATGAA 


CGGGCAATTT 


ATTCATTATG 


AAGGAAAAAC 


GATTTATTCA 


GATGAATTTA 


360 


CAACAGAGGA 


ATGTGTTTCA 


CTGCATGAAC 


ATGTCAAACA 


GCGTGGTCAT 


GAATTGGCTT 


420 


TTTACAATGA 


GCGCCGTATT 


TTTTGTACAG 


GGCATACTGG 


CACAGTGAAA 


CAAGCCTATG 


480 


ATTATATTCA 


TTCAGCTGTT 


CCAGAAATTG 


ATCCCACAGG 


TTATGAGAAT 


GATGCCGTCA 


540 


ACATGATGTT ' 


GGTCTTAAGT 


CAACATGGCG 


ATGATGATGA 


ATATTACTAT 


GAGCGGTTTC 


600 


CTGAATTGAC 


CTTTTATCGC 


AATGGTCCTT 


TTTCAATAGA 


TATTGTTCGT 


AAAGGGGTCT 


660 


CTAAAGGCTC 


TGGTGTCAAA 


AATCTTTTTA 


ACACCTTGGG 


ATTAAACGGC 


ATTCCGACTT 


720 
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ATGCCTTTGG TGATGGCATC AATGATTTGG . CGCTGTTTGA GGfcCTGTGAT TATGGAATTG 780 

CCATGGGTAA TGCGCGAGAA GAACTAAAAG AAAAAGCCAC CTTTATTTCA ACTAAAAATA 8 40 

CTGAGAATGG CATTGTTAAT GGCTTGAAAA AATTTGATTT ACTATAAGAG AAAACTACCT 900 

TCTTCTATTG ATGTACTTTA GAAGAAGGTA GTTCTTTTGT ATGTTCATTT TTCTTTTTCT 960 

GCTGTGCACA GAGAGCTACT CCCACTACTT GAATCAGCAA AATCGCTCCC AACAAACAAC 1020 

TTAATACGAT GTTTCCTGAT AGGATTGTtC TTCTGATTAA AATAATTAAT ACAACACTGC 1080 

TAATTCCGTA GAGTAACCAT TGGTTGTCCA TTTTTATTGA CCCCTTTGTT TTTCTATGTT 1140 

TAGGATAGGA TGTTtCCTAC TAAAATGCAA GGnAAGATCC TAGATTCTTA CAACTAACCC 1200 

ATAATTTTTA CTAATTCCTT ATTTGTTTTC ATTTATTCTT AAGATTCATT .TCATTTTTGg 1260 

TAAAAAATCA CACTTTTTTT CTATCTCTGT TTTAAATTGC GTACAGTTCT TAAAAAAAAJc 1320 

GATACACTAA AaGTGaACTC GaAAkGAArG GrGCCATtCC TTTGGtTAAT AGCGAAAAAA 1380 

GAAAACAATT AATTATTGAT ACATGCATTT TAGCTGGAAA AATTATGCTG GAAAGCGGAT 14 4 0 

CTGAAGTATA TAGAACCGAA GATACAATTA CTCGGATTGC AGCAAATGCT GGCGAGCCAG 1500 

AGAGCGTTTG TTACACGACG GCTACAGGGA TTTTCGTGGG CTTTCGTTCC AGTAATTATA 1560 

CGCAATTAGA AAACATCCCC CAGCGCTCGA TTAATTTGGA AAAAGTTAGC CTTGTCAATC 1620 

AGTTATCAAG AGAATTTGCG CAAAAAGAGA TTACTTTACC AGAGCTTTAT CAGCGCTTAA 1680 

CTCTATTAGA AACAGATACC CCCACTTTTT CAATTTCGTT ACGCTTGTTA GCAGCCGGGA 1740 

TTGTTAGTTG TACGTTAATG TACATTTTTG GTGGCACTTG GcAAGATTTT ATTGCAACTT 1800 

TTTTTGTTGG CGTGATTGGG TATGCCAGTT ATCTTTTTAC CCAAAAGTTA TTCCAAGTTC 18 60 

CTTACCTCGA TTCATTCGCA GCCGCTTTTG TCATTGGTTT GTTGGCCTAT TTGGCTGTGC 1920 

ATTTTCACTT AGCGGTCAAC ATCGATAATA TCATTATTGG TGCGGTAATG CCGTTAGTTC 1980 

CAGGTGTGGC GATTACGAAC TCTTTTCGCG ACATTTTAGC GGGCCACCTC ATCAGCGGGA 2040 

CGGCGCGAGG GACAGAAGCT ATTTTTATTG CTGGGTCTGT TGGCCTGGGA ATTGCCTTGA 2100 

TTTTTAAATT ATTTATGTAA GGAGTGTTCA AATGGAACTA ATTTTTCATC TTCTCTTTAG 2160 

TTTTTTAGCG ACTGTGACGT TTGGTATTAT TACGAACATT CCTCGTAAAG CACTGGTGGC 2220 

TTGTGGCATT ACTGGCATGA CAGGCTGGAT GATTTATTAT GTCTTAACGC AAACTTTTGA 2280 

TGCTAGTCAA ACCTTTGCAA ACTTTTTAGG AACGGTCGGG TTAGGAATCG CTAGTATCTT 2340 

TTTTTCCCGT TACAAAAAAA TGCCGATGAT TATTTTTTAT ATTCCGAGTT TGGTTCCTTT 2400 

AGTGCCAGGA GGCCCTGCCT ATCAAGCAGT TCGGAGCATT TTGTTAGGAA ATATTGATGA 24 60 

CGGTTTGCAG TTGATTTTAA AAGTTGTCTT TACTGCTGCA GCCATCGCAG CGGGCTTTAT 2520 

GGTGACCTCG CTACTGGAAC GAATTGTCAA ACGTTTTTTC CCCAAGCGAC TCTTCCGATT 2580 

GGTCAAAAAA TAAAAAAACA AGCTCATTTT CTGAGCTTGT TTTTTTATTC TACTTCTTCA 264 0 

AAGAACAGCA ACAAATCTTG TAGCGTTAAT TGTGCTTTTT CAGCAGCGTT TAAATCTTGA 2700 
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ACAATTTTAC CTTTTTGTAA AACAATTAAA CGATTGCCAT ATQGTAAAGC ATCTTCCATT 27 60 

CGATGGGTAA TCATTAAACA AGTTAAGTTC CCTTCTTCAA TCCGTTGTTC CGTTAAGTGC 2820 

ATCAACTGCT TGGATGTTTT GGGATCTAAG GCAGCAGTAT GCTCATCTAA AAGTAATAAT 2880 

TCTGGTTTTG TGATAGTGGC CATTAATAAA CTTAAGGCTT GGCGTTGACC GCCAGAAAGG 2940 

TTACCTGTCG GTGTATCCAG ATGATTTTCC AAGCCATTGC CAACTTCTTG GCAGATTTTT 3000 

GTAAACAAGG CGCGTTGCTC ATTTATCTTC CGTAATGTTA AGGGACGTTT TTGTCCGCGT 3060 

TTCATGGCAA GCAATAAATT TTCTGCCACA GTCATTCTTG GTGCTGTCCC CATTTTAGGA 3120 

TCTTGAAAAA CACGCGCTAG AGATTTGGCC CGTTTGACTT CAGAAAGATT GGTGCGATTG 3180 

ACTCCTTCAA TTAAAAATTG TCCTTCTGTT AGTGTCAAAG TTCCTGCTAA ACTATTGAAT 324 0 

AGCGTACTCT TGCCTGCGCC ATTTCCACCT AAAACAGTAA TGAAGTCCCC AGGATAAATG 3300 

GCTAAGTTCA CATGATCCAA AATCACTTTT TTTTCATTCA TCCCATTATC AATGATTTTT 3360 

GTCGCATTTT TTAATTCAAC AACTGGTTTC ATTTCGTTTC CCCCTTATCA GAAAAGCCGT 3420 

GTAGTCGTAA TGCTTTTTTC AATTGTGGGA TCATTAAACA GATTGCTAAA ATCACTGCTG 3480 

AGAAGATTTT CAAATACGTC GTATCAAAGC CTAATTTAAT CACTAATAAA ATCAATAGTT 3540 

GATAAATAAT ACTCCCTACA ACAATCGCCA TCAGCCGTTC ACCAAACGTT AATTCTCCAA 3600 

ACAGTACTTC ACCAATAAtG AGTGAAGCCA AGCCAATCAC AATGACACCG GTTCCTTTAC 3660 

TTACATCCGC ATATCCATCA tTTTGGGCAA TCAAAGCGCC aGATaAGCCA TCAAGCCATT 3720 

CGaTaAAACC AaTCCTAAAA TTTTCATGCG ATCCGTTTGA ATGCCAATCG AGCGAGCCAT 3780 

TTCTTCATTG TCACCTGTCG CAATATAGGC TTGTCCTAAA CTAGTGTTAA AGAAAAATAA 38 40 

CAGTAAAATC ATTACCACAG CCAAAACACT AATCCCTAAA AAAATAGTAT CATAATAATC 3900 

GGGTAAGGCC ATTTTTTGAA AAACATCTTG AATCTTCGGT TGGTTTAATA AAGACTTATT 3960 

TGGCGACTTC ATCACAAATA AAATCACGGA ATTTAAACCA GACATCACTA AAATTCCTGC 4020 

TAAAATAATT GGAATTTTTC CTTTTGTAAA CAGTAAACcA GTCACTAAAC CGGCAAGCAT 4080 

TCCTGCTCCG ACCCCTAAAA GAGTCGCTAT AAATGGTGAA ACCCCTGTTG TAATGGCAGT 414 0 

TACACAAACC GCCCCCCCTA ACGGAAATGA ACCTTCTGTG GTCATATCTG GAAAATTTAA 4 200 

AATACGATAA GTCATAAAAA TACCTAGTCC TAACAAGGCC CATAACATTC CTTGTCCAAT 4 260 

TGCTGAAACA ATCATCGCTT CTCCTACTTT CTTTGATAAT TAGTTAATCA ATTTCGATTT 4 320 

TTTTGCTAAC TCTGCGGGAA TCGTAATTCC TAACTTTTGA GCTTGTTTTT CATTGATAAT 4 380 

TGTGTCGCCT GTATTAAAGG TATAAATGGG TGTAGTTGCT GGTTGACTTT TTCCAGACAG 4440 

AACCTCTGCT GCCATTTTGC CGGCTTGTAC CCCCAACGCA TGTTGATTAA TCCCTACAGT 4500 

TGCTAAACCG • CCTTGTTCTA CCATTGTATC CACCGAAGGA ATAATCGGCG TCTTGGTTCG 4 560 

ATTTGCTTCT CCGACTACCG TTTGCATGGC ATTAGCAATC GTATTATCAG TTGGTATATA 4 620 

AATCACATCT GTTTCCTTAG TCATCACTTG AACCGTTTGT GCAATTTCAT TACTTGATGG 4 680 
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CACAG.CATAT GTTTTCACTG TCAATCCTTT GGCTTCAGCC GCTTTCTGCG CTTCAGCTAC 4740 

TTGGTATCGA GAATTTTCTT CTGAGGAAGC ATAGAGAATA CCTATTTTTT. TGGCAGCTGG 4 800 

TATCAATTCT TTTAGTAAAT CCATCTGGGC TGACAAAGGA GATTGATCAC TAACACCCGT 4 860 

GATATTACCG CCTGGCTTTT GATTGTCTTT CACCAATCCT GCACTAACAG GGTCCGTAAT 4 920 

GGCGCCTAAA ATGATTGGGA TTTCTTGGGT AGTGTTGGCT AAcGCTTGCG CAcTGGCGTG 4 980 

GCGATGCCAA TTAAAACGTC TGCCTTTTTT TGTACTAATT GTTGACTCAT GATTGCTAAC 5040 

TTACTCTGGT CGGCTTGACC ATTTTGAAAA ATAATTTCTA AGTTTTTGCC TTTGTCATAG 5100 

CCATTTTCCT TTAATCCTTC TTCGACACCT TTATAAATTT CGTCCAATGC GGGGTGACTG 5160 

ACAAACTGCA AAACCCCTAC AGTAACTGTT TCTTTCTTTG AATTCGTCAC TTTTTGATTC 5220 

ACAAAAAAAG CACCGATTAA ATAAATAGCT AATAAAGACA CGACAAACGT TAAACGTTTA 5280 

TTTTTCATTT TGCATTCTCC TAACTGCCCT TTTGTTTGGC AACTTTTTCT TTTTAATAAT 5340 

AAAACCATTA TCTATAGGGC GTTTCTTCCG TTGAAAGTGA AACTGAGATT CGTAATCCTC 5400 

TGCTTAGTCA AAATCCAAGG AACACTTTTT TACAAATGTC AGGATTTAGC TGCGACGAGG 54 60 

ATTGCAACTT TGGTGAGAAT CGCAGGAGTA CTTTTTCTAC GAATGGGACA ACGTTCTCTT 5520 

CGTTCACCTT GTACTACTCA TCACTCTTGA AGGCCAAACT GCGGCGAGTA TACGCAGCCT 5580 

GCATCTACTC TGTCnCTTCG TGCCAATCGT AGTGCTTGTC AGGATTTAAC TGCGACGAGG 5640 

ATCGCAACTT TGGTGAGAAT CGCAGGAGCA CTTTTCTACG AATGGGACAA CGTTCTCTTC 5700 

GCTCACCTTG TACTACTCAT CATCTACTCT GTCACTTCGT GCCAATCGTA GTGATTCGTA 57 60 

GCAAAAAAAC AACGACTTAT CCTAAGTGTT TAGGAGTGGT CGTTGTTTTT ATGCAAAACA 5820 

AAAACCACAT AGATGGGTCC TCTATGTGGT CGTTTTTTCT TCTTGAATAT AACCAGCATA 5880 

GATACACTTT TTAGAATAAG TCTTCTAAAG TTGTATCTAT GAAAGAATCA TACATACAAC 5940 

TATCTACGCC AGTTTTGCCA TGATTGGTGC ATTTCTTTTA CGTTAGATAA TTGTGTCATG 6000 

TATTTTCCCT CCTAGTTATG TTGTTAAAGG TAGTCTACAC TATATTGATT TTGTTCGTCA 6060 

AGAAAAAGTT TTTATTTTCT TGCATTTCTT TTACTTCTAA AAAACGAGCC ACTCTTATGA 6120 
GAGGCTCGTT TTTCTTTATT TCAGCATATT TACCACTTAA TTGTTTCGGT TTCTTTCAAT " 6180 

TTTCACGGGA ACTTTAACTG CTTGTTTTTG CGATTGAATC AGCCATAAAC TTACTAAAAT 6240 

CGTCATTGAA ATTACTAATG CCATCGAAAA TCCCCTCTTT CTAATTCCTT AAAACTTTCT 6300 

ACAATCTAAG TTTAACATAG CTAGCTATTA TTCACCAGTT TCTGGTACTG TATCAGTGCT 6360 

TTCTGTCGTC GTTTCAGTAG ATTGAGCAAC GCCAGTCGTG CTGGTCGTCG TCTGTGTTTC 64 20 

CGCGGTCTTT GCTGCTTCTT TTTCAGCTTC TTGCGCCGCT TTTTGCTGGT CTTTCAAAAC 64 80 

GCCAGCCATT ' GGTAATCGAT TTAATTCTGG TATTAATTGT GTTTTTAAAT CCATAGATTT 654 0 

TTCGACAACC GTCCGTGTCT TTTCTTTGCC GTTTTCAAAG AAAACAAGCG TAATCTCATC 6600 

GCCGCTAACA TTGTATTGCT CTTGTGCTTT GCCAGCATTA TAAATTAACG GATAGATATA 6660 
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CAGCGAACGA TGAAATTCAT TCATTTTATC TGAATTAAAC AGCTTCACCA TTTGATCGTA 6720 

TTGCTTGCCA GATGGTTTAG CAAACATGAC TGACATGGCT CCTGTTTCTT GGATTTTCTT 6780 

GTCTGCTGTT TTGTAATCTA AGACACTGAC TTTCTCTGAT TTGATGGGTC CTGCGACTAA 6840 

CGATTCAACT TGAGCAGCTC GAATATTTTT TTGCATCGTT GGAATAAAAA TGATTGAGCC 6900 

AATTATTAAT ACTAATGCTA CAATAGCGGT AATCATTAAC AGTTTTCTTT GTCTTAATTC 6960 

ATAAAGTGCC TCTTTTAGTT CTTCCATAAA ATTATTCAAC TTCTAATCGC TCCTCGCTTT 7020 

TAGTTCTTAT CT ATT AT ACT AATCATTTTC TTTCTTGCCT ACCCTTGCGC TTTAATTACT 7080 

AAAAAAACCA GCAAACTTCC CCAGCTTTCT AAGGAAAACC CAACTCCTTT TTAACAAGCT 714 0 

TCTTTCTTAA ATTCAAAAAC TCCGCTATAA TAAAGCTATT AGTCGGAAAG AAACGAGGGA 7200 

TTTATTATGT CATTTGaTGG CGTATTTACT CATAGCATGG tTCATGAGTT GaCAGAAACA 7260 

CTCGTAAGTG GCCGGATTTC AAAAATCCAT CAACCTTATG AAAACGAAGt CGTTTTAGTC 7320 

ATTCGTGCGA AAGGTAAAAA CCATAAATTA TTATTATCAG CACATCCTAG TTATGCGCGC 7380 

ATACAACTAT CTACAATTAC CTATAGCAAT CCGGAAACAC CTCCTAATTT TGTGATGATG 74 4 0 

TTACGAAAAT TTCTAGACGG TGCCATTTTA GAAACGATTC AACAAATAGA AAATGATCGC 7500 

GTGATTCACT TCACCTTTTC AAAAAGAGAT GAACTAGGTG ACTTACAAAA TATTGTTTTG 7560 

ATTGTTGAAT TAATGGGACG TC AC AG TACT ATCGTTTTAG TTAACCAATC TTCTGGAAAA 7620 

ATCTTAGATG CGATAAAACA CATTGGAATG AGCCAAAATA GTTATCGTTC GTTGCTTCCT 7680 

GGAGCAACTT ATATTGAACC GCCAAAACCA ATGGGACTTA ATCCGCTCAC GGCCTCAAAA 774 0 

GAAGAAGTCT TTGCGCTTCT TTCTACAGCT CCTGAATTAA ACGGACGCTA TTTACAACAA 7800 

CATTTCCAAG GTCTAGGCAA AGATACAGCA GACGAATTAA GTGCCCGTTT ACTTGCACAA 7860 

CCAAATGAAA AAATGGCCAT CTGGACGGAA TTTTGGTCCT CAGTCACTGA AGCTGTTGTA 7 920 

CCGACATTAA CAGTTACTGA AAAAAAAGAA TATTTTACGC CCATTCCTTA TCAATCATTA 7980 

GTTGGTGAAC AAACAACGTA TGCGACACTC AGTGAATTAC TAGATGCCTT TTATTCTGGT 804 0 

AAAGCAGAGA AAGACCGTGT CAAACAACAA GGCGGCGAAC TAATTCGCAA AATTGAAAAT 8100 

GAACTAAAAA GAAATCAAAA GAAACTTAAA AAATTACAGC AAACCTTAGC CGATACAGAA 8160 

AACGCTGAAA ATTACCGTCG CGACGGAGAA CTATTAACAA CCTTCATGGC ACAAGTCCCA 8220 

AAAGGAGCCA CCGAGGTGGA ATTACCAAAT TATTATGAAG AAAATGCACC GTTGCGTATC 8280 

TCGCTAAATC CTGCACTTTC ACCTAATCAA AATGCACAAA AATATTTTCA AAAATACCAA . 8340 

AAATTGAAAA ATGCTGTCCG AGTTGTTAAA ACACAAATTC AGCAAACACA ACAAGAAATT 84 00 

TCTTATTTAG AGTCAGTTGT GGCACAACTT GAAATTGCGA CACCAATGGA CATTGAAGTG 84 60 

ATTCGTGAAG AACTGATTGA ACAAGGCTAT TTAAAACGTA AAAAAAATAA AAAACAAAAA 8520 

CAGCCGAAAA AAAGCCAACC CGATCTTTTT TATGCCACAG ATGGCACACC TATTCTCGTT 8580 

GGAAAAAACA ATTTGCAAAA TGATCAATTG ACTTTACGAA CCGCTAAAAA AACTGATTAT 8 64 0 
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TGGCTCCATG CCAAAGATAT TCCTGGATCT CATGTAATTA TTCGTGACGC ACATCCTAGC 8700 

GAAGAAACCC TTACTGAAGC CGCACTTCTA GCAGCTTATT TTTCAAAATA TCGACTTTCT 8760 

TCACAAGTGC CTGTTGACTA TGTCCAAGTA AAACATGTGC ATAAACCCAA TGGAGCAAAA 8820 

CCAGGTTATG TGATTTATGA AAACCAACGA ACGTTATATG TGACGCCAAG AGAAGAATCG 8880 

ATCAAAAAAA TCCAGcAAAA TAAGGCTTCT GCTTCCTAAT TGTATTAAAC TAAAAAACAA 8 94 0 

CGTACTGcCG ATTGATCGTA CGTTGTTTTT TTGCATTGGC AAAGACATTT ATCAAGTGAT 9000 

AAATAACACA AGTAAGCTCA ATTAGTTTAC AAACTAGCTA TTTCTCtCTA TACTGACTGT 9060 

TGGAGGTGAA ATTCATGGTC TTTTTACACA TTATTTTTGG GATGTTTGCC TTTATTGGCG 9120 

CTTTAGGTTT AGGTGTCGCC TTCCGCGATA GCTTTACAGA ACTTTCAACA CC 9172 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

ATTATTGATC GAGAAAATGA AATTCGAGAA TTTAAACCTG AAGAATATTG GAGCATTGAT 60 

GGGAATTTCC AAAAAGGTCG CAAGAAATTC AAAGCAAATT TCTGGGGCGT AGATGGTAAA ' 120 

AAGAAAAAAT TACCAAATGC AGAAGCGGTG AAAGAAATCA CTTCTCGGAT TGACGGTAAA 180 

GATTACGATG TTACAAAAGT TGAGAAAAAA GAACGGAAAC GTAATCCTGC ATTACCTTTT 240 

ACAACCAGTA GTTTACAACA AGAAGCGGCA CGTAAATTAA ATTTCCGTAC CCGTAAAACC 300 

ATGATGGTTG CGCAACAATT GTATGAGGGA ATTGCTTTAG GGAAACAAGG CACTGTGGGG 360 

CTGATTACCT ACATGCGTAC AGACTCAACC AGAATTGCTG "ATTCCGCCAA AGCAGAAGCT 420 

GCTGAATTCA TTGAAAAGAC cTACGGAGAT GAGTTTTCAG CACATGGCGG CCGCAAAGCA 480 

AAAAATACGC AAGGTGCACA AGATGCCCAT GAAGCGGTTC GTCCGTCAAG CGTTTTAAGA 54 0 

ACACCTGATG AAATGAAGAA GT ATT TAG AT AAAGATCAAT TAAAATTATA TACTCTTATT 600 

TGGTCGCGCT TTGTTGCTAG TCAAATGACA CCAGCGATTT TAGATACCAT GAaGGTCACG 660 

CTTCAACAAA ACGGCGTAAC GTTTATTGCC AATGGCTCCA AAGTGAAATT TAAAGGGTTT 720 

ATGCAAGTAT ATGTTGAAGG AC GC GAT GAT GGTAAAGAnG ACAAGGAAAA ATATTTTACC 780 

TGAAnTAG 788 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATGATTAA GGCAACAACG CCCCAAGCAC GATATGGAAA TCCCACGCCA CGTGTAGCTG 60 

AAACACCTAG TGGTATGCTA AATGCCaTTG GATTGCAGAA TCCAGGTTTA GAAGTGGTTA 120 

TGCAAGAAAA ATTACCGAAA TTAGAAAAAT ACCCTAACTT ACCGATTATC GCAAATGTTG 180 

CAGGTGCTTG CGAAGAGGAC TATGTGGCTG TTTGCGCAAA AATTGGGCAA GCGCCTAACG 240 

TCAAAGCAAT TGAACTGAAT ATTTCCTGTC CCAATGTGAA ACATGGCGGC ATAGCTTTTG 300 

GTACAGATCC AGAGGTTGCT TTTCAATTAA CACAAGCAGT CAAAAAAGTC GCCTCTGTTC 360 

CGATTTATGT TAAACTATCA CCAAACGTAA CGGACATTGT GCCGATCGCC CAAGCAATTG 420 

AAGCAGGGGG CGCAGACGGT TTTTCTATGA TTAATACATT GCTAGGGATG CGCATTGATT 4 80 

TAAAAACGAG AAAACCTATT TTAGCCAATC AAACTGGCGG ACTATCTgGC CCAGCCATTA 540 

AACCTGTTGC TATTCGCTTA ATCCGACAAG TCGCCAGTGT TTCTCAGTTG CCAATTATTG 600 

GAATGGGCGG TGTTCAAACA GTGGATGATG TCTTGGAAAT GTTCATGGCA GGTGCTAGCG 660 

CTGTCGGTGT TGGTACGGCC AACTTTACAG ATCCATATAT TTGTCCGAAG TTGATTGACG 720 

GATTACCTAA GCGAATGGAA GAGTTAGGTA TTGAATCGTT GGAACAATTA ATTAAAGAAG 780 

TGAGAGAGGG GCAGCAAAAT GCACGATAGA CCAATTATTG CGTTAGATTT TCCCACGCAA 840 

AAAGAAGTGG CTGTTTTTTT AGAAAAATTT CCTAAGGAAG AAGCCTTGTT TGTTAAAGTG 900 

GGGATGGAAC TTTTTTACGC AGAAGGACCA GCGATTGTTC GCTGGCTTAA AGAACAAGGA 960 

CACGATGTTT TCTTGGATTT AAAATTACAT GATATTCCGA ATACTGTGGA AAAAGCGATG 1020 

ACAAATTTAG CAAAATTAGG CGTTGCCATT ACTAATGTCC ATGCAGCAGG GGGCGTTCGA 1080 

ATGATGCAAG CAGCTAAGGA AGGTCTAATC AAAGGGACTC AGCCAGGCGC TAAGGTTCCT 114 0 

GAGTTAATTG CTGTAACCCA GTTGACTTCT ACTAGTGAAG. AAGAGATGCA CCATGATCAA 1200 

TTAATCAACG TTCCTTTAGA GACAAGTGTC ATT CAT TAT G CTAAATGTGC GGAAAAAGCG 1260 

GGTCTGGATG GTGTGGTTTG TTCAGCATTA GAAGCACGTG GTATTCAAGA AGCAACCAAG 1320 

CAGACGTTTA TTTGCTTAAC ACCTGGTATT CGTCCAGCAG GAAGTGCAGT AGGTGACCAA 1380 

CAACGGGTCG TTACGCCGCA ACATGCTCGC GAAATTGGGT CCACATATAT CGTTGTTGGC 14 4 0 

CGACCTATTA CCCAAGCAGA AAATCCTTAT GAAGCCTATC AAGAAATCAA AAAAGACTGG 1500 

AGCGAAAAGT AACATGACAA AAGTAGCAAA AAAGATTGCC AAAGATTTAT TAGATATTGA 1560 

AGCGGTATTT TTAAACCCTA ATGAGCCGTT TACCTGGGCA AGTGGCATAA AAAGCCCGAT 1620 

TTATTGTGAT AACCGTATCA CGATGAGTTA TCCTGCTGTG CGTAAAGAAA TTGCAGAAGG 1680 

CTTAGCGGCA AAAATTAAAG AGACTTTCCC AGAAGTCGAA GTGATTGCTG GGACAGCTAC 17 4 0 

CGCAGGGATT CCACATGCGG CATGGGTAGC CGATATTTTA GG AT TACCAA TGGTTTATAT 1800 

CCGCAGTAAA GCCAAAGATC ACGGCAAAGG GAATCAAATT GAAGGCCGAA TTTCTGAAGG 1860 

ACAAAAAATG GTAGTTATCG AGGATTTAAT TTCAACAGGC GGTAGCGTGT TAGAAGCGGC 1920 
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AGAAGCGGCG 


GAACGTGAAG 


GAGCAACGGT 


TTTAGGCGTT 


GCGGCAATCT 


TCACTTATGA 


1980 


GTTACCCAAA 


GGAACGGCAA ATTTTGCAGA TAAACAGATG 


ACGTTACTAA 


CGTTAACCAA 


2040 


TTATTCAACT 


1 IAATTGATG 


CTGCATTAGA 


AGCAAATTAT 


ATCGAAGAAA 


AAGATGTTAC 


2100 


CTTATTACAA 


CjAATCjGAAAA 


AAGATCCTGA 


AAaCTGGGGC 


AAaTAACCGT 


TGAACTAAaG 


2160 


AAGGTTCTTG 


TCGTCTGATT 


TTTAGTGTCA 


TTTTCAGTAA 


CCAGATTTAA 


CGGTAGGAGA 


2220 


ACTTAGGTCA 


TTTGCCGAGG 


AGCGTAATGA 


TCTAAGTTCn 


TTTGTTTGAC 


GTAGAAAAGA 


2280 


TGTTAGGATA 


GAGTTTTAAC 


ATGAGTTTTA 


CCTGATTACT 


ATGTGGAAAT 


GrAGGaAGAG 


2340 


TGTATGAAAA 


AAATAAAAAA 


TATGGACGTk 


GAATGGAGCT 


ATACAGGCCA 


ACGATGGACC 


2400 


CAGAACATTG 


GCATACGCTA 


TGTGACTGGT 


TTGCAGAAGG 


TGCTAAGTTT 


GCTTATCAAT 


2460 


CGCCAaTTGC 


TTTAGAAAAA 


GAAaGTGCTG 


AG AC AG T AAA 


TTCTCAGATT 


ACGTTTCATT 


2520 


ATAAAAAAGA 


AGAATTCACT 


GAAAAAGAAT 


TCAAAAATAC 


GTTTCATTTT 


GTACCACCAA 


2580 


ATACGGAAAG 


TTACGTAATG 


TTTGAAAATG 


TTGCGTATCA 


TTTAACCGAT 


ATCCATTTTC 


2640 


ATATGCCTAG 


CGAACATTTG 


CTATCGGGAA 


AGCAGTATCC 


GCTAGAATTC 


CATTTGGTTC 


2700 


ATATGAACGA 


TGCTGGCGAA 


AATTTAGTTG 


TCGGTTGCCT 


GTTTACCATC 


ACAGAGGAAG 


2760 


AAAATCGTTT 


TTCAGAAGCG 


AATCATCCGA 


TGGACTGGGA 


AAACGGCACG 


CACCAACAAT 


2820 


GGTTTAATCC 


GTCAATCTTT 


TTACCAGAGG 


AGCGTCTACA 


CTACCATTAT 


GTAGGCTCGT 


2880 


TAACAACACC 


GCCAACTAAG 


GGTCCAGTGA 


AATGGTTTGT 


ATTTGATACC 


ATTCAAAAAA 


2940 


TGGATCAAGC 


CTTTTTAAAT 


AAGATTAAGG 


AAGGTATGTT 


GGCCTTTAAC 


AATCGGCCCC 


3000 


TTCAGCCGTT 


GAATGGGCGT 


AAGATTTATT 


TTTCAAATGA 


TTAGTGTGTG 


GAGGAAAAAG 


3060 


GGTTGAATAT 


TCAACAAATG 


AAGTATGTTG 


CTGCCATAGC 


TAACAATGGG 


AGTTTTCGGG 


3120 


AAGCAGCCAA 


AAAACTATTT 


ATTACACAAC 


CTAGCTTGTC 


AAATAGTATT 


CGAGAATTAG 


3180 


AAGAAGAACT 


AGGGATTTCC 


CTCTTTTTAC 


GTACCAATAA 


GGGCGCTTTT 


TTAACAGAAG 


3240 


AAGGAATGGT 


TTTTTTAGAA 


CAAGCAGAGA 


AAGTTTTAGT 


TCAAATGGAG 


TTATTGGAAA 


3300 


ATCGCTATCG 


CGAGACAGTG 


ACCAGCGAAC 


GATTCTCCAT 


CTCTTCGCAA 


CATTACGATT 


3360 


TTTTAGGTGA 


GGTAATTGCC 


AAAGTTCTTA 


AAAAGTATGG 


GGATCAATAT 


AAAGATTTTC 


3420 


GTGTCTTTGA 


AACCACGACC 


TTAAAAGTAA 


TTGAAGATGT 


CAAAGGATTT 


CACAGTGAGC 


3480 


TGGGGATTAT 


TTATTTAAAT 


GAACAAAATA 


GTGTCAGTAT 


TGAACGGTAT 


TTAGAACAAG 


3540 


CTAATCTAGC 


CTACGAGGTA 


AT TAG TACT T 


TTAATACACA 


CATTTTTTTA 


GGAAATCATC 


3600 


ATCCTTTAGC 


CAAGCAAAAA 


GAAATTCATT 


TAGAGGAATT 


AGTTCCCTAT 


CCACAAGTTC 


3660 


GCTTTAATCA 


AGAAGGAAGC 


AATTTTTCCT 


ATTTTTCTGA 


AGATTTAGTT 


GAAATTCCTG 


3720 


AACAGGAATC 


GGTCATTCAT 


ACTACGGATC 


GTGGAACCTT 


GATGAATTTG 


TTGGTAGAAA 


3780 


CCAATGCTTA 


TGCTTCTGGT 


TCAGGAGTCG 


TAACGGGTTT 


TACGAAGAAA 


GAAATTCGGT 


3840 


TAGTGCCATT 


GGCGCCAcTT 


TGGAAAATCG 


TATTTGTTTA 


CTCTTTCCCA 


AAAATCGGGA 


3900 
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AATCAGTCCA ATTGGTCGCT ACTTTATCAA AGAnTTAAAA GcGTTGTTtA AAAAAgAGGT 3960 

AG AT AC C AAA ACATTTmGCa CTAAGGaTaA AAAGAAGAGg TCGGGACAGA- AGTG-TTTAAC 4 020 

TCCGAGAAAT AAGAAGGAAT TTCCGAAAAT TGTTCTTTAA TTTTCGGAGA ATTTCGGCTT 4 080 

ATTTCCGAAG GATTTGCTTC TGTTCCCGCC GTTTATTAGT TTTTGAGCGT GGAGCAAAAA 4140 

TCCAAAGTGA TTTTTGTCCC ACGCTCTTCT TTTTTGGCTC ATTTTGAAAC AGTAGAAATG 4200 

GTTkCTCTTT CAATTAATGT GACAGGCACT TGAATGTCTT CTATGATTTG AAGGTCTTGT 4 260 

TGCATCGTTT GTAATAACAC TTCACCAGCT AGTTTTCCCA TTTCTTTAAA GTC 4 313 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



ATTnTTnCAT 

X i 111 X 1 1 X 


GCGATAAATG 




TTTATrAGCC 


GTftAAftrfTA 




60 


PTATATTTTT 

^ X ** X am X X X X X 


CCGAAATTGA 


TACGGCTAAA 


TTTfSAPTTTT 
xxx \jrv^ x x x x 


ATGATftATGA 


A A AATTTHTT 


120 


TTaGATTTCT 

x x t*. wn xxx *w x 


TACACAAACA 


TCATATTTTA 

x \~*fi x *i j. x x x r\ 


TTAGTfTATG 

X X eWJ X ^Vri X nJ 




CAACTGGCAA 


180 


CAACCCGATC 


ACTTCCGTAT 


TGTTTATTTA 


CCGAAGATGG 


AAGACTTAAA 


AACTACAGCA 


1 240 


GATAAAATGC 


GTGAATTTTT 


AAGTACCTAC 


AAACAAAAAT 


AAAAGAACTA 


CCAAAAAATA 


300 


GCCTCTAACG 


AAGCTATTTT 


TTGGTAGTTC 


TTGTTTTATC 


AACAATACTA 


TTCTGCGGTT 


360 


TCACATGCTT 


TTTTACTAAT 


AATCATTCGG 


TTGGTATATT 


TTTCTTCCAT 


ATCAGGACTA 


420 


GGATTTGCTA 


CTACGATGAG 


ATAAGAATTT 


TCATACTCCT 


TCTCCACAAA 


GCCTGAATAA 


480 


TTTGTTTTTT 


CCCAAATGAA 


GTGAACCTTT 


TTCTCTTCGG 


CCATACAATA 


ACACTCCTTT 


540 


TTAATAAAAA 


TGAACCTATT 


TATTTTCTAA 


GAAAACCTTA 


GATAATGTTC 


TTGCAGTATA 


600 


TTATGCAATA 


AAGTACAATA 


AAAGCATAAT 


GCGAATTCGC 


ATATTTTGAC 


AAAATCCCCA 


660 


TTTTGCAAGG 


AAAATGGCCG 


ATATAAATAG 


GAGCTATTGT 


GATGAATTCT 


TTAGGTGCAG 


720 


TTATTAAAGA 


AATTAGAAAA 


AATAGAAAAT 


TGACACAGAA 


AATGTTATCA 


GAAGACATCT 


780 


GTTCGCAAAG 


CGTTTTAAGT 


CGCATTGAAA 


ACAATGAAGA 


GTTGCCAAAC 


GTTCTCGTAA 


840 


TGCAACAACT 


TTGTGATCGT 


TTAGGTGTGA 


CTGTTGATCA 


GATTATGCGT 


TATAAATCTG 


900 


GTGATGTGCA 


CGTTGTTACG 


TATTCATTTG 


AAAAAATGGC 


TGAATATTTT 


CGCCATAAAA 


960 


AATATCAATT 


ATTATTAAAT 


TATTTAAAGG 


AAAATCGTAT 


TGAAGAACAA 


TTGTATTTAG 


1020 


ATACGGACTG 


GCAAAAATAT 


TAT TAT T ATT 


TAGGtAGTTG 


TGAATTGTTT 


GTTCTAAATG 


1080 


ATTATGAAAA 


AGCTATTGCT 


AGCTTGCGTA 


AAGGGTTATC 


TTTCACCTAT 


AAAGCCGATA 


1140 


AGTTAAATGT 


TTCTGATTTA 


GAGATTCAAT 


TAATTAGTTG 


TCTAGGTGCC 


ACTTATGGTT 


1200 


ACATGGGCAA 


TCGAGTAGAA 


GCAGAACGAT 


TTTTAAGTTT 


GAGTATTCAT 


TATTTTAATC 


1260 
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AATTGCCAAA 


TGAACGCAGT 


AATGCAGAGT 


TAACGAAAAT 


ATTTTTTAAT 


TATGCTGATT 


1320 


TCTTATTTAA 


AAATTATACA 


GAAAAAGATG 


CCGAAATTTA 


TGTGGATCAA 


GGAATTACTT 


1380 


GGGCCCGCAA 


AAAAAATAGT 


TATTATTATT 


TAAGTGAATT 


ATTAAATCTA 


AAATATTTAG 


1440 


TTTTAATGAG 


AAAGAATAAA 


CACGAAGAAG 


CAGAACGTTA 


TCTGAACTTA 


GCACAACAAA 


1500 


TGAAAAATGT 


GGAAAGTGGC 


AATCTTTAAA 


AAATCCTCAG 


AATTGCTTAA 


AAGGTCTACA 


1560 


CCATTTCTTG 


ACTTTTGTAC 


TGATTAAGAG 


TACTTTTGAA 


GTAAGTAAAA 


TAAAGAAAGA 


1620 


AGGATGAAGA 


TGAAAACATT 


TATCTTTGCA 


GACAAGTTCT 


TTTTAAAAAG 


CGATGTTAAA 


1680 


GGACCTGGTT 


ATTTAGAAAT 


CACGGATGGT 


ATCTTTGGTA 


AT TAT AC AAA 


AGATGAACCG 


1740 


CAAGGGgACG 


TAAAAATTAT 


TCGTGAGGAA 


GGCAAATGGA 


TTGCCCCAGG 


TTTAGTCGAT 


1800 


ACTCATA-TTG 


ATGGTTACAT 


GAATCATGAC 


GTAATGGATA 


ACGATGCAGA 


AGGCATTAAA 


1860 


GTGATGTCTG 


AAGGCTTATT 


ATCTTGTGGT 


GTAACTTCAT 


TTTTACCAAC 


AACGTTAACT 


1920 


TCAAGTAAAG 


AACGTTTAAC 


AGACGTTGCA 


CGTACGATTG 


GTCAAGTCTA 


CCAAGAAGTA 


1980 


CCTGGTGCAA 


AAATTCAAGG 


GAT T TACT TT 


GAAGGACCTT 


TCTTTACTGA 


AGAACATAAA 


2040 


GGCGCACAAA 


ATCCAAGTTA 


TTTTGGAGAT 


CCAGACCTTG 


ATACATTCCA 


CGAATGGCAA 


2100 


GAAGCTTCAG 


GCGGAATTAT 


TAAGAAAATT 


GCGTTGGCAC 


CAGAACGTAA 


TGGTGTGAAA 


2160 


GAATTTGTTG 


AAACCGTTAC 


AGATGAAGGT 


GTCGTTGTTG 


CTTTAGGCCA 


TAGTAATGCG 


2220 


ACCTTAGAAG 


AAGCTGATGT 


GGCAGTTGAA 


GCAGGCGCTA 


gCGTTTTTGT 


TCATGCATAT 


2280 


AATGGGATGC 


GCGGCTTAAA 


TCACCGTGAA 


CCAGGGATGG 


TTGGTGCATT 


GTTAACGTTA 


2340 


CAACACGTTT 


TCTCTGAATT 


AATTTGTGAT 


GGACACCATG 


TACATCCGCA 


AGCGGCTGAA 


2400 


GTATTGATGG 


AAAAAGCGGG 


C CAT GAT C AC 


GTTGCGTTAA 


TTACAGACTG 


CATGATGGCT 


2460 


GGCGGTATGC 


CAGATGGCAA 


CTATAATTTA 


GGTGAATTTC 


CTGTTGTGGT 


AGCAGAAGGA 


2520 


ACGGCTCGTT 


TGGATACTGG 


GAACTTAGCA 


GGTAGTATTT 


TAAAACTAAA 


aGAaGCAaTT 


2580 


AAAAATGTAG 


TTGATTGGGG 


TATTGCAACA 


CCTGCACAAG 


CAATTATGAT 


GGCTTCATTA 


2640 


GTTCCtGCGA 


TTAGCTGTAA 


AATTGATGAT 


CAATGTGGCA 


TGATTGCAAA 


TGGTCGCGAT 


2700 


GCTGACTTTA 


TCGTATTAGA 


ACCAACCATG 


GAATTAGCAG 


CCACTTATTT 


AGATGGCGTA 


2760 


GAACGTTATC 


GAGCATAAGA 


CACAAGAAAA 


AGCTTGCGGA 


AATTTATCCG 


CAAGCTTTTT 


2820 


TTATAATTCT 


TTTGGTAAAA 


TCGCAGAAGT 


GATTTCTTTG 


TTCTCGTAGG 


CAATAAAAAA 


2880 


ATCGATTAAT 


TGCTTGTCAA 


ATTGCTCAAT 


ATCCTCTTTG 


TAAAGGGAGC 


TAGGAATCAC 


2940 


AAAGGTTTGC 


TCTAGGGTGC 


CTGCTAGATA 


ATCGCCATCT 


AAACTTTGGG 


CGACAACTAA 


3000 


AAATTCTTGT 


TCGCGCCAAT 


GTTCCACATC 


TGTTTCAGAA 


ATGGTTTCTG 


GAGAGGCTAA 


3060 


AAATTTACAA 


TTTTCGACAA 


ACTGTTCATC 


AATTTCTTCA 


CTGAAAGTTA 


ATTCAGCATC 


3120 


CTCTTCTTGA 


GGTTGAGAGA 


GGATTGCTTG 


CAAAAATGTT 


TGTGGTTTTG 


CATCGGGTAA 


3180 


TGTTTCTTTT 


AGCTGTTCAA 


AATCCAATGA 


AAGTTCCTCC 


TTATTGTTTG 


TTTAAAATAG 


3240 
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TTACTGAAAA 


ATCAATGGGT 


GAATTTGTTT 


TAAGGCTTTT 


GGCTTCTAAG 


ATGATTTTTT 


3300 


CAGCCAACAA 


ATCGTTTCCG 


T TAT AT AC AG 


GTGTCACGCG 


ATAGCGAACG 


GTTTCGCCGC 


3360 


GATTAAGAGC 


TGCCCGAATC 


TGATTTTCAT 


ATTTCGTCAT 


ATAAGGAGTG 


TTGACAGGAT 


3420 


TCTGATACAA 


GGTGGTTAAG 


TTACGTGGAT 


CATCACCACT 


GCCTCCCATT 


TGACGGCCGA 


34 80 


TTAGGTGGCC 


ACGCGAATGA 


TTAGCTTTTC 


CTGAAATGAA ACCAGCTGGT 


CGAATATCTT 


354 0 


TATTAGCGGA 


AGTTCCTGTA 


TTGACCATTG 


CGGGTTTTAA 


TAGCGCATCA 


GCGCCTGTCG 


3600 


CTCGTTTTAG 


AC GAT CT AG A 


GAATGATAAG 


TAATCCAGCC 


GTTTGAGTTA 


TCCTTTAATT 


3660 


CCTCTGCTGA 


AAATGTGGCT 


TTACCATTTT 


CGATGGGTnC 


GGGATTATCT 


CCTTTTGGTG 


3720 


CTGGTGTGCT 


TGAAaGGGgT GGTTGTTTGT GGTTGAACAT TAAATAAATC 


TTGTAAAGCA 


3780 


TCAGGAACTT 


TAACGCCAAC 


CGCGCCTAAA 


ATTAAAATGA 


TTAATGCGCC 


AATCAGrATG 




ATGGTTGAAT 


TAAATGGTGG 


TTTCTTCTTT 


CTGTTTGCCA 


TAACGTGCTC 


CTCTAATTTT 

w a \j i orv X X X X 




ATACTTGCAA 


TAGTCCAAAT 


TTAAGTGGAC 


CATCTTTTCT 


ATTATACGTA 


AATTTGCTAA 




AAAATCAAAA 


AAGGCTTTTC 


TTTGAAAGAC 


TTGTTTTCTC 


AGATATGTTA 


CTATTTTAAT 


d 090 


GTAGAAGGGa 


AgGGATAGTG 


GATGGCCTAT 


ATTGAAGTCA 


AAAATGAATA 


CAAGCGTTAP 




CAAATGGGAG 


AGACAACTAT 


TACTGCGAAT 


GACGGCATTT 


CTTTTGAAGT 




A 1 A O 


GAAGTAGCGG 


TTATTTTAGG 


CCCCAGCGGT 


GCtGGAAAAT 


CAACTGTTTT 


AAATATTPTT 

nnn J. n x x x x 


1 4 900 


GGTGGAATGG 


ACAGCTGTGA 


TGAAGGGGAA 


ATTATTATTG 


ATGGCACAGA 


TATCGCTCAG 


4260 


TTTTCAGAGA 


AACAATTAAC 


CACTTATCGA 


CGAAATGATG 


TCGGGTTTGT 


TTTCCAATTT 


4320 


TACAATTTAG 


TCCCAAATTT 


AACGGCAAAA 


GAAAATGTCG 


AATTGGCTTC 


GCAAATCGTT 


4380 


GCGGATGCGT 


TGGATTCGAC 


CAATGTTTTA 


CAATCGGTGG 


GTTTAGGGGA 


ACGTTTAGAT 


4440 


AATTTTCCAG 


CGCAACTTTC 


TGGTGGTGAA 


CAACAACGAG 


TAACCATCGC 


CCGTGCGATT 


4 500 


GCTAAAAAGC 


CCAAGTTACT 


TTTATGTGAT 


GAACCGACAG 


GAGCGATGGA 


TTATGAAACA 


4560 


GGGAAACAAA 


TTTTAACCAT 


TTTGCAAAAC 


ACTGCTCGTG 


AAACTGGAAC 


GACCGTCTTA 


4620 


ATTATTACCC 


ACAATTCAGC 


CATTGCAGAA 


ATGGCCGATC 


GGGTAATTCG 


AATTAATGAT 


4 680 


GCGAAAGTTC 


GCGAAATGAC 


TGTGAATGAT 


CAGCCTAAAT 


TGGTTGCTGA 


AATTGAATGG 


4740 


TAGGTGCCTA 


AAATGAAGAA 


AACAGCGCTA 


ATTAAAACAA 


GCTTACGTGA 


AATTAGACAA 


4800 


TCTACTACAC 


GATTTTTATC 


GATTATGGGG 


ATTATTTTCC 


TTGGTGTAAT 


GGTCTTTGTT 


4860 


GGTTTGAAAG 


CTACTGGACC 


AGATATGATT 


AAAACCGCGA 


ATAATTATTA 


CCAGAAAGAG 


4920 


CAGCTTCCAG 


ATGCGCGAAT 


AATTTCTACA 


ATGGGACTGG 


AAAAGAAAGA 


TTTAACAACA 


4980 


CTACAGTCTT 


TAAAAGATGT 


CGAAACAGTT 


GTGCCGCGGT 


ATACAATAGA 


TGCCACGATT 


5040 


GGGCCGCAAA 


ATAATGCAGT 


AAAATTGTTT 


GGCTATCGAA 


AAAATCAAGC 


AGGATCGGTT 


5100 


AACTATCAAG 


TAGTTGACGG 


TCGGTTACCT 


AAACAAACAA 


ATGAAATTGC 


AT TAG AT AC A 


5160 


TTAGCGAAAA 


CTAGATATGA 


CTATAAAATG 


GGTGATAAAA 


TCACTTTAAA 


CGATGCAGCG 


5220 
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ATTAAAGAAA 


AAGGGCTGAA 


ACAAACTCAT 


' TTTACTGTTG 


TTGGATTTAT 


CAATTCACCA 


5280 


GAATATATTG 


ATAATACATC 


CAGGGGAACC 


ACAACGGTCG 


GGTCTGGAAC 


GTTAAACTTT 


5340 


TTTGGCGTGG 


TTTCGGAAAA 


AGCCTTTGAC 


AGTCAACGGT 


ATCCAGAATT 


GCTAATCTCA 


5400 


TTTCGTTCAT 


TAAAACATCA 


AAATACGTAT 


TCCTCTGATT 


ATGAAAAGAA 


AAGAGAGCAG 


5460 


GCCTTAAATC 


AAGTAAAAGA 


GGCACTGAAA 


AATCGGCCAG 


AAGAACAGGT 


TGCGGCTCTG 


5520 


CGTGATTCGG 


CTCAGCCGGA 


CATTAACCAA 


GGTCGGCAAG 


CGCTGGAAAC 


GGGAAAACAA 


5580 


GCACTTGCGC 


AGTTGGaGCA 


ACAACCAGGA 


ATTCCAGCTG 


AGATGCTTGA 


AAAGCAAAAA 


5640 


GACGAATTGG 


CTAAACAAGA 


ACAACTGTTG 


GCTGAAAAAG 


AAACAGAGCT 


AGCCAATTTG 


5700 


AAAGCCCCaA 


CGTATTATTA 


TTTTACTAGG 


GAAGACAATC 


CAGGATTTTC 


TGAATATCaA 


5760 


GATAACGCTG 


ATCGGATTTC 


TTCTTTAGCC 


ACGCTGTTCC 


CGTTGTTCTT 


TTTCCTAATT 


5820 


GCGGCATTAG 


TCAGCTTAAC 


TACCATGACA 


CGTATGGTTG 


AAGAAAAAAG 


AATGGAAATT 


5880 


GGTAGTTTGA 


AAGCTTTAGG 


TTATCGTAAT 


CGTGAAATTG 


CTTCGATTTT 


TATTACCTAT 


5940 


GCAACGGTTG 


CCAGCTTAAC 


AGGTGCATTA 


TTAGGATTGG 


CAGTTGGCTA 


TTATCTATTC 


6000 


CCTAAAATAA 


TTTTTGATGc 


CTACGGTCAA 


ATGTACAATA 


TTCCTGATTT 


AGTTACACCG 


6060 


TGGTACTTAA 


ATTACAGTTT 


ATGGGGCATC 


ATTGTAGCCT 


TAGCTTGTAC 


TGTGGGAGCG 


6120 


GCCTTAGTGA 


CATTAAGAAT 


CGATTTATTA 


AGTACCCCAG 


CTACCTTGCT 


AAGACCCAAA 


6180 


GCGCCGAAAG 


CAGGCCAACG 


GATTTTACTA 


GAACGAATCC 


GCCCTTTATG 


GCAACGCATG 


6240 


AGCTTTATTC 


AAAAAGTGAC 


CGCTAGAAAT 


CTGTTGCGCT 


ATAAACGACG 


GATGCTGATG 


6300 


ACCGTTATCG 


GAATAGCTGG 


TTGTATGGGG 


CTGTTAATTG 


TTGGTTTTGG 


TTTACGCGAT 


6360 


TCAATTGTGG 


ATGTCGCTAC 


AATTCAATTT 


AATAAAATCT 


GGCATTACCA 


AGCAGTCGTG 


6420 


ACTTTTAAAG 


AGCAAACAAC 


ACCTGAAGAG 


ACAAAAGAGT 


ATCAAGAAAC 


GTTGCGCAAA 


64S0 


ATGGATGGTT 


TGAATAAAAC 


AATTCCGTTA 


TATTCAGAAA 


TCTTTAAAAC 


GAAGGGCAAA 


6540 


GGCGCACCAA 


CTCAAAATAT 


TACTCTTTAT 


GTGCCACAAG 


ATCCTTCGAA 


AATGGCTGAT 


6600 


TTTCAATTAT 


TTAATGATCG 


AGTCACTGGA 


GAAAAATATT 


CCCTCAACGA 


TGACGGTGTA 


6660 


ATCATTAATG 


AAAAATTAGC 


TAAATTATTT 


GGCTATAAAG 


TCGGAGATCA 


ATTAAATTTA 


6720 


GAAAATAGTG 


ATAATCAAAC 


CTATCACGTG 


AAAATTGCCG 


CGATTGCAGA 


GAACTACACG 


6780 


GGTCATTTTG 


TCTATATGAC 


GCCGAAACTC 


TACCAGACGA 


TGACGAAACA 


GAAGCCGGAG 


6840 


TATAATACAG 


AATTCCTACT 


TTTTGACAAA 


AAGTTATCCT 


CAAAACAAGA 


AACCTCAATC 


6900 


GGAGAGGCGT 


TGACAAAACA 


GCCGAAAGTG 


TTGAACATTA 


CGTTTTTAAC 


TGCGATGAAA 


6960 


GGTTCGTTTG 


ATGATATGCT 


GAAAAGCCTT 


GATATTGTCA 


TTTGGGTACT 


AATTGCTGTT 


7020 


TCTGGTTCTT 


TAGCGTTAAT 


TGTGTTATAT 


AATTTAACCA 


ACATTAATGT 


TTCTGAACGG 


7080 


ATTCGAGAAC 


TTTCCACGAT 


TAAGGTTTTA 


GGGTTTTACG 


ATCGTGAAGT 


CACCACCTAT 


7140 


GTTTACCGTG 


AAAACATTAT 


TTTAACGTTC 


ATTGGTATTA 


TTGTTGGCTG 


CTTCTTTGGG 


7200 
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AAAATTTTGC AT C AG TAT AT TTTAGCAACC GTTGAAGTTG ATTTAATTAT GTTTTCGCCA 7 26C 

AT TAT TC ATT GGCCAAGTTA TCTGTATTCC GCAGTCATCA CCATGTGTTT CACGTTGTTT 7 32C 

GTGATGGTCA TTATGCATCG GAAATTGAAG AAAATCAATA TGATTGAAGC GTTAAAATCA 738C 

AATGAATAGC ACATTTTAAA AAaCGAGGAG TCCGTAGAAG TTCGATAATT CTAGGGCTGC 74 4 0 

TCGTTTATTG CTGCTTCTTT TATTAAGAAA AAATAGGTCT TAAGATGTAT CTTTTGAGAT 750C 

TTATTTGGAA ATTACTACTC TTTTTATTTA AAAATTGGTA CACTATTACT AGGTTATAGC 756C 

ACAGAAAGGA GTGTGGGAAA AGTGTTAGAA GGTCTACCAT TGGAAACGGT TTATTTTTAT 7 620 

GCATTAGTGG GGAGCsCgGT CCTCGCTTTT TTACTAATTA TTTTTGGGGA TGTTTTTAAT 7 680 

TTTGATGGAC CGGTTGaCCC AATGCTGATT ATTCCATGGA TTGCTTTTAC CTCGCTGTTT 774C 

GGGTACCTTG GGGAAGAATT AACAGCGGTG AATAGTTGGC TTATTTTAAT TGTCAGTGGT 7800 

ATTCTTTCAA CCATTATTGT TTTCTTCCTA AATTTTTATG TGTTGGTCCC TTTGAAAAAT 786C 

TCGGAGGCCA CAATTTCTAT TTCGGAAAAA GACATGGAGG GTCGTGTTGC TACT GT GAT T 7 92C 

ACCCCAATTC CTGTCCGGGG AATGGGCGAG ATTCAATTGA AGAGTGTCAC AGGTTCGTTA 7 980 

AGTCGACCAG CAGCATTTTA TGTGCCACAA GAAGTGGCTG CGCCAAGAGG TAGTGAGGTA 8040 

TTGATTATTG AATTGAAAGA ACGGGTTTGT TATGTCATTC CTTACGAAGG CAGTCTAAAA 8 IOC 

ATATAGAGTA GGAGAGAAGA AAATGGGGAT TTTGTTACCg ATTATTATTG CaGTTTTAGT 8160 

ATTATTAATG TTATTAATTG TGTTTGTTTC TAAATATCCA GACAGCCcAA ACCTGATGAA 8220 

GCGTTAATTA TCcAGCGGGA GCTATCTAGG CTCTAAAAAT GTTCATGTAG ACGAAGGTGG 8280 

CAACAAAATT AAAATCGTTC GTGGCGGTGG TGCGTTTGTC TTACCAGTGT TCCaACGTTC 8 340 

AAATCGAATT AGTTTGCTTT CAAGTAAATT AGATGTTTCC ACACCAGAAG TGTACACAGA 8400 

ACAAGGGGTT CCTGTGATGT GTGATGGGAC ATCAATCATT AAAATTGGTT CGTCAGTTGA 84 60 

AGAAATTGCG ACAGCAGCGG AACAATTTTT AGGAAAAACA ACGGAAGAAT TAGAAAATGA 8520 

AGCACGTGAA GTATTAGAAG GACATTTACG TTCGATTTTA GGTTCAATGA CAGTGGAAGA 8580 

AATTTACCAA AATCGTGATT AATTTAGCCA AAGTGtACAA GAAGTTGCCa GTGTTGACTT 8 640 

AGCTAAAATG GGCTTAGTTA TTGTGTCGTT CACAATTAAA GAAGTTCGTG ATAAAAATGG 8700 

ATACTTGGAT TCATTAGGGA AACCAAGAAT CGCTCAAGTT AAACGTGATG CAGATATTGC 87 60 

AGAAGCAGAA GCCTTGAAAG AAACTCGCAT CAAAAAAGCA GAAGCAGAAA AAGAATCACA 8820 

ACAAGCGGAA TTGCAACGTC AAACAGAAAT TGCAGAAGCT TCAAAAGAAA AAGAATTGAA 8880' 

ATTAGCGTTA TACAAACAAG AACAAGATAT TGCCAAAGCC A 8 921 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


69: 






TGCAAATGCA 


CGA6P,Aftr,AA 


PAAAApAArp/-» 


GAACArGGAA 


CAGTAAAATG 


GTTTAACGCa 


60 


GAAAAAGGTT 


T T TT A T 

X 1 UOU J. 1 l fx i. 


PTPnrp.ppaa 

^- 1 CrtL-u^Lartrt 


GACGGAAGCG 


ACGTATTCGT 


ACACTTCTCA 


120 


GCTATCCaAG 


P-TP-Zitp-Pttt 

\J I vxn 1 oo 1 I X 


U AAAAG 1 1 1 A 


GAAGAAGGTC 


AAGCAGTGAC 


TTTTGATGTT 


180 


GAAGATTCAG 


AL.L*G I GGGGG 


TP A A PP* APTT 


AACGTAGAAA 


AAAACTAATA 


AAAAATAATC 


240 


CAAAAACACC 


1 AAGAl GGG 1 


TAGGTGTTTT 


TTTGTTAGAA 


AAAATAGAGA 


AGAAAAATGG 


300 


ACAAGCATTT 


1 1G1 IG1G 1A 


rp m tv t\ 7\ *"P P" 7\ P" 7\ 

I 1AAATGAGA 


GCTTTATTGA 


TCACTAACTG 


TGGCAGTCAT 


360 


AATCATTGGC 


nrn at>p a t^pp 


GA 1 GAGGTTC 


TGTTTGTTCA 


AACAAGAAAG 


GCTGTAAAGC 


420 


GGTCGTCATG 




ATTTTGCTTC 


GGTACAGTTT 


TTATCTTTCA 


TTGCTTCACG 


480 


TAGAGCATTG 


Art 1 AAAAG AG 


Gl iggccttc 


ATGAATCATA 


TCACCAGACT 


CACGCATGTA 


540 


GACAAAACCA 


pppp a a t\ t\ a 
L,vj<jGAAAGAA 


TGTCTGGACC 


AGCCATAATT 


TCTTTATTTT 


TAATATCAAC 


600 


TGTTGGTACC 


Clevis n n a pi* a 


AACCT TCTTC 


AGATAAGATA 


CGACGATCCC 


GTAAAACAAC 


660 


ATTCCCGATA 


T P 21 PPP A f A 

1 GAG G GAG AG 


P AT>n"PPPJ\ rn r> 

GATTTCCATC 


AACGTACACG 


TCATTAGCGT 


TGAAGTGACC 


720 


AGCAGGGCGT 


GGGG 1 ATCAG 


CCGTTAAAGC 


TAAGACATCG 


CCGTTGCCCA 


TGATAAAGCA 


780 


ATTTTCTTCA 


P* A A p , p*p , p"t p* 

GGAAGGGGTG 


TATCTTGAGC 


GAGCGAAcAT 


GGATTTTTAA 


CATTCTAAAT 


840 


TCACCATGAA 


GAGGGA.' OAA 


A rr« 7\ Tfprnnr^PP 

ATATTTAGGC 


TTCATCAAAC 


GAAGCATTAA 


TTTTTGCTCT 


900 


TCTTGTCCAC 


pptp a pp" a a 
I GAGGAGA 


AGTATGGATA 


TTATTGATTT 


TTCCGTGGAT 


AACTTCTGCA 


960 


CCGGCTTCAG 


rt. i MA 1 AALa 1 I 


GAT T AAACGG 


TTAACGCTTG 


TCGTATTCCC 


AGGAATTGGT 


1020 


GAACTTGAGA 


21 A A P 2i 2i PPPT 


AIGGGG TGGT 


TGAATACTAA 


TTTGGCGGTG 


TGTACCGTTA 


1080 


GCAATGCGAC 


ta ap-pcpppjip 


PZITRPPTTPT 

GAi AGG I I L 1 


CCTTGAGAGC 


CAGTACATAA 


AATCATTGTT 


1140 


TCATTTGCTG 


nPAATTftATT 


p ApTTPPPPP 
L-rtu 1 i V^l^GGG 


GCATCGACAA 


ACGTTCCTTT 


TGGCACTTTG 


1200 


ATGTAGCCTA 




ATTPApAATP 
Ail G AG AA 1 G 


GCATTTTCCA 


TGCTACGACC 


AAAAACAGCA 


1260 


ATTTTGCGAC 


P T TTT TP A P 


uuL 1 GGAI GA 


GCAGCTTGTT 


GTAAACGGAA 


AATGTTTGAA 


1320 


GCAAAACTGG 


PAAAf^2\T21I\T 


PPP BPP A TP A 

GAG GAI GA 


ATTTTTTCAA 


AGATTTTTAA 


AATAGAGGTT 


1380 


CCAATTGTTT 


i i 1 V— oOri 1 1 1 


TPT aaa ppfpr 
1 (j 1 AAAGG 1 G 


GGTATTTCAG 


CATTGGTACT 


GTCTGAAAGA 


1440 


AGACAAAGGA 


CGCCTTCTTC 


ACCTAATTTC 


GCCATTCTGT 


GTAAGTTGGC 


CGGCTCACCA 


1500 


ACGGGTGTAA 


AGTCAAACTT 


AAAGTCACCT 


GTTGCGACAA 


TGTTACCAGA 


AGGCGTTTTA 


1560 


ACCACAACGC 


CCAAGGCATC 


AGGAATACTG 


TGAGTCGTCC 


GGAAAAAGCT 


AATCGCTGTT 


1620 


TTTCTAAAAC 


GAATCACCGT 


ATCTTCATTA 


ATTTCGTGTA 


ATTCCGCTTC 


TCGCAATAAA 


1680 


CCATGTTCAT- 


CTAGTTTGTT 


GGTAATTAAA 


GCCAAGGCTA 


GTGGTCCCGC 


ATAGATTGGG 


1740 


ATATTCGCTT 


GACGTAAAAG 


ATAAGGAACG 


CCACCAATAT 


GGTCTTCGTG 


ACCGTGGGTA 


1800 


ATAACGAGTG 


CTTTGACTTT 


ATGCAAATTT 


TGAACGATGT 


AGCTATAATC 


TGGAATCACG 


1860 
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TAGTCAATCC 


C AAG T AAA T C 


ATPTTPTPPP 


AaTTTPaTPP 
AA1 1 1 bAl Lb 


pappaTPaao 1 
bAbb A 1 bAA 1 


AA I bAl AATT 


1 OO A 

1920 


TCATCTTGGA 


ATTGTAPPPP 

X X o x nvwV_.VwVv 


aTATPTPTTT 
rtX/iX o i X X X 


rn rn rp A A TTT 
1 1 ibbAAl 1 1 


bbbb 1 AAAbb 


Tv/^«/-'/" , 7V rnrp/^/^'TV 

AbbbATTGCA 


1980 


AAAACGCCAG 


TTTPPTTATT 
XXI v_vj X X rt X 1 


111 lAlu 111 


Ab 111 b A 1 b 1 


rp tv 7\ 7\ TV rn /~» /~> 

1 AAAAbTbbG 


TTAGAGTGAA 


2040 


PTppnrATCiT 


TPTTPPTPPT 


a app a ap a tp 


11111 bb 1 bl 


A A T 1 A 7\ rp rp TV TV 

AA1AA1 TGAA 


CGTATTCAAT 


2100 


a TTPT A tppp 
ri i 1 ij XrlX vjVjo 


PTaTTTTPTT 


LrtHL-lrtAl lb 


bbbbAbbAl 1 


AbATCbbTAT 


CGGCTTCCAT 


2160 


p t a a. a p a p a t 

b 1 nnnvjnun 1 


TTTPTPTPTT 


1111 bb 


fp rp HP /"^ T 1 T\ rp rp 

b 1 1 1 b I bA I I 


TTTGTTTCTT 


GATAGTATAC 


2220 


1 1 1 b 1 AAA1 b 


aTPPaTTPaT 

AXLVjAI ILni 


1 1L1 bb 1 1 LA 


bb 1 Ab 1 AATA 


P«PJV/ , /^Tt TV rnrn rn 

GTAGCAATTT 


ATAGTTTTTT 


2280 


1 b A i x 1 1 MJ\\j 


apanaaaaTT 

ALAUAAAM X 1 


AAAbAbAAbA 


AAAbbAbbTG 


CTGATCACCT 


ft Tt TV T\ 71 T\ TA TV m 

AAAAAAGAAT 


2340 


ppTpaTpaaa 


aapaTPTPTT 


AA x b A 1 b AAA 


TPa APPrTpr 1 
1 bAAbbb TbG 


ACTTTGCTTC 


ATTCTGGGTA 


2400 


X i lunl 1 1 In 


X vj 1 ij X 1L1 1 1 


X 1 vvMLLb 1 b 


bbb IbTbTTG 


t>/^tv tv m TV 1\/V)I TV 

TGAATAACAA 


CAATTAAGCT 


2460 


TaTTTTaTpa 


TAPaaTPaTa 

X ttu/Ui X b A 1 A 


app a BPTaTn 
AbbAAb 1 A I A 


PAA A/** , ACT , /^A 

b AAAb Ab T b A 


CTGGACTTTT 


AAAaAAAACA 


2520 


a aPTTPTTPT 


TTPPPPTT 7i T 
X X ^bbb 1 1A1 


1 b I 1 b A 1 b AA 


bTTbbAGTGC 


CTTTTTTTTA 


ggacgccaat 


2580 


appaaPTTa a 


pppa a tp app 

OLLAAX bAbb 


1 bAbb 1 AAA 1 


A A A/^A 7\ 01/""/^ 

AAAbAATCGC 


m tv tv t\ m f m 7i 

TAAAGTCGTA 


GTCATCAACG 


2640 


XriX 1 1 LVjoAb 


aaTTaaapap 

AA1 lAAAbAb 


bbAA 1 bbbAA 


Arp7V7\TV7\OTVTVTV 

ATAAAAGAAA 


CATGGAAATA 


AATAGTTTCC 


2700 


HI 1 1 Abb ill 


AbbAbI AA1 X 


TTTTTCTCAA 


AGAAGGCTTC 


TTGGATATAT 


TTTTGATAAT 


2760 


a a p. a art a app 

Art u>iHo AAb b 


1/WirtAl 1 lb 


TTATAAAGTT 


TATCTGAACT 


ACGCATCCAA 


AAAAAGGCCG 


2820 




A a A A ppnrTr 
AAAAbbAb I b 


bTGGGTAAAA 


GCGGCAAAAA 


GATTCCTAAC 


GTGCCTAAAG 


2880 


paa APPTapa 


ppaappp apa 

b uAnL b bAb A 


AAAA 1 b AAGA 


ATAATTTTTT 


CATTCAAAGG 


GTCCTTTCAG 


2940 


TTTTTPPPZiB 
X 1 1 1 1 V/^*-nn 


TTa TPrrarn 
X i A 1 bbbAb A 


A I b AAAAbb A 


AGTAGCAATT 


TTAATCGTAT 


AAGTAAATCA 


3000 


TTa A SBTTZiT 


PTaTTTTTTa 


Pa 7A 7A 7A TTT A A 

b AAAA 1 b 1 AA 


A ATATA A ATP 

AAb Ab AAAG G 


TV rp TV TV TV TV/^TV/"* 

ATAGAAAGAG 


T\ ta Tv r*> m fn m 

AAAGGCTGTT 


3060 


TTTTPTAPPP 


APAP A PPT TP 


t a tt a ppp a t 
1A1 lAbbbAl 


rp rp rp rn rp rp rp t\ 

IlblXlII bA 


1 AbAGTGGGA 


omivOfnTvr'Tv tv tv 

G TACT AG AAA 


3120 


ATTTAAAKTft 


APATAAPA AP 


papaTPaTTa 

oAOAXbAl 1A 


All b AAb 1 1 A 


ArbTbGbGTT 


A /""» TV TV A m /"^ 

ACACAaTGCG 


3180 


TTATAAGAAA 


TT APTTTTPP 
X X nvv XXXI L^vj 


PTPat- TTTaT 
L 1 bAt 1 1 1 A 1 


rp rp /"^ rp ^ rp rp rp 

1 Ibiblbl 11 


GGTTTTATTT 


m o ^ mm ^ tv tv mm 

TGATTGAATT 


3240 




APPATTTTaP 


p app a a tp a t 

LAHjAAI bAl 


1 bAbAAAbbA 


ATTATTCCTA 


GAGATATGGA 


3300 


CTATATTT AT 


PAPpaapppa 


TTTPPaTPPT 
111 bbAl bb 1 


1 b 1 AA1 1 Abb 


A rp rp tv rn /~> /~> /-> 

ATTAbTGGCG 


fry pn ^ /"^» tv t\ m Tv tv 

TTGCAATGAA 


3360 


TATTTTftPTA 
x " x x x x uvw a rt 


PPPTaPTTTP 


b 1 bbbAbAA 1 


TAP'AA^AA A 

i Ab AAb AAAb 


ATTGTTCGTG 


Tv m t\ m m m ^ tv 

ATATTCGTGA 


3420 


TGATTTATTT 

A wii 111 *i XXX 


PA A AAPaTTP 


a a aPTTTPTP 

AAAb 1 1 1 b 1 b 


APAfBT rp/-> ti tv 

AbA 1 Ab 1 bAA 


rPTMn/^ _ tv tv rn ti 

TATGaAAGTA 


TTGGGGTTTC 


3480 


TTCCTTAATT 


APPPPaapTa 


ppa aTPaTpp 

bbAA 1 bAl bb 


TV rp TV A AATT 

Al AbbAAATT 


ATGCTCTTCA 


fTt /~» t\ T\ T\ m Ti rri 

TGGGAAATAT 


3540 


TTTACGTCTT 


GGCTTTATGA 


CGCCAGTGAT 


GTTTATTGCC 


AGTCTTTACA 


TGGTGATGCG 


3600 


AACGAGTCCG 


TCGTTAGGAA 


TGTACGTTTT 


AGGTGCCTTG 


CCTTTTCTGC 


TGCTAGCAGT 


3660 


TGTCGGGATT. 


GCTCGTTTGT 


CAGAACCGTT 


ATCTAAAAAG 


CAACAAAAGA 


ACTTAGATGG 


3720 


AATCAATGGG 


ATTTTAAGAG 


AAAATCTTTC 


TGGATTGCGG 


GTAATACGCG 


CATTTGTTAA 


3780 


TGAAAAATTT 


GAAGAATCTC 


GTTTTAATAA 


AGTCAATGAA 


ACTTACACTA 


AAAGTTCAAA 


3840 
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AAGTCTGTTT 


CGTTTAATGG 


CAGCAGCCCA 


ACCAGGGTTC 


TTTTTCTTAT 


TTAATATTGT 


3900 


GATGGTCTTA 


ATCATTTGGA 


GCGGGACTGT 


TCAAATCAGC 


CATGGGGATT 


TAGAAGTTGG 


3960 


GAATTTAATT 


GCTTTTATTG 


AATATATTTT 


TCATGCGCTG 


TTCTCGTTTA 


TGTTATTTGC 


4020 


CAGTGTCTTT 


ATGATGTATC 


CGCGGGCTGC 


GGTTTCGGCT 


TCACGGATTC 


AAGAAGCCTT 


4080 


AGACATGGAA 


CCAGCTATTC 


GTGAAGAAGA 


AGGCGTAACA 


GAAACAGCTA 


CTAAAGGCTA 


4140 


TCTAGAGTTT 


AAAAATGTAA 


CCTTTGCCTA 


TCCTGGACAT 


GCGGAAAGCC 


CAGTTATTCG 


4200 


CAATGTAAGT 


TTTAAGGCGT 


CACCTGGTGA 


AACAGTGGCC 


TTTATTGGGA 


GCACAGGTAG . 


4260 


TGGGAAATCA 


ACATTAATTC 


AATTGATTCC 


ACGGTTTTAT 


GATGTATCAG 


AAGGTGAAAT 


4320 


TTTAATCGAT 


GGAGTGAACG 


TAAAAGAGTA 


CAAACTTAGT 


GCATTACGCA 


ATAAGATTGG 


4380 


CTATATTCCA 


CAAAAAGCGT 


TACTTTTTAC 


TGGTACAATT 


GCTGATAACC 


TTCGCTACGG 


4440 


TAAGGAAGAT 


GCGACGTTAG 


AAGAAATGGA 


ACGAGCAATT 


GACATTGCTC 


AAGCCACAGA 


4500 


ATTTGTTTCG 


CAAAAACCGC 


AAGGCTATGA 


TGAACCTCTT 


TCAGAAGGTG 


GCACGAACTT 


4560 


TTCTGGTGGT 


CAAAAACAAC 


GTTTaGCTAT 


CGCACGGGCG 


ATTATTCGCA 


ATCCGGAAAT 


4620 


TTATATATTT 


GATGATAGTT 


TTTCTGCGTT 


AGATTATCAA 


ACAGATGCAA 


ATTTACGAGC 


4680 


GCGTCTGAAA 


AAAGAAACAA 


CAGAATCTAC 


TGTTTTAATT 


GTGGCACAAC 


GTGTTGGAAC 


4740 


AATTATGCAT 


GCGGACCGCA 


TTGTTGTTTT 


AAACGAAGGC 


GACGTGGTCG 


GAATTGGCAC 


4800 

i 


ACATCGTGAA 


TTACTTGAGA 


CTTGTCCAAT 


TTACTATGAT 


ATTGCGGCTT 


CTCAATTGTC 


4860 


AGAGGAGGAA 


TTAGCATGAA 


ACACGCCTTT 


TCTTCTATGA 


AACGAATCGG 


TCGCTATATT 


4920 


AAACCGTACC 


GAGTGACGTT 


TTATTTAGTT 


ATTTTATTTA 


CAATATTAAC 


CGTTGCCTTT 


4980 


AATGCAGCGT 


TGCCTTATTT 


GACTGGATTA 


CCGACGACAG 


AAATTAGCCG 


TAATATTGCG 


5040 


GCCGGCGAAT 


CCATTAATTT 


TGATTATGTA 


ATCCAATGTT 


TAATTTGGAT 


TTTAGTTGTG 


5100 


GGAACAGGTT 


ATTGTGTGGC 


ACAATTTTTG 


TCAGGCTTTT 


TAATGACGAA 


TGTCGTTCAA 


5160 


CAATCCATGC 


GCGATTTGCG 


TCGCGATATT 


GAAGAAAAAA 


TCAATCGTTT 


GCCAGTTTCT 


5220 


TATTTTGATA 


AGAACCAACA AgGAAATATT 


TTGTCAGGGG 


TGACGAACGA 


TGTGGATGCT 


5280 


GTCAGCAATG 


CGATGCAACA 


AAGTTTTATC 


AATATTGTTT 


CAGCAGTCTT 


AGGTATTGTG 


5340 


ATGGCGGTAG 


TGATGATGTT 


CTTAATCAAT 


CCGCTGATGG 


CGATTTTTTC 


AGTGATTATG 


5400 


ATTCCGTTGT 


CTCTGATTAT 


TTCCAGAACA 


ATTGTTAAAA 


TCTCCCAGAA 


ATATTTCCAA 


5460 


GGAATGCAAA 


ATTCTTTAGG 


AGAATTAAAT 


GTTATGTCCG 


GGGCCGAGnT 


CGAATTTTAT 


5520 


AAT 












5523 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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SEQUENCE DESCRIPTION: 


SEQ ID NO: 


70: 






ACAGTCGATT 


TCGAGCACCG 


GCAACTTTTG 


CTACATCAAT 


GATAGACGTC 


ATTTTTTACC 


60 


TCCAAAGTGG 


AATGTATTCT 


TATAAGAAGT 


ATAGCATACA 


TTTTTTTGAA 


AATAGTGCTT 


120 


GACATGGGAA 


CATTCCCATA 


ATTTATCCTA 


AAGATGTAAA 


CGATAACAAT 


TTAATCAAGG 


180 


AGGAAGAAGA 


CGGATGACAA 


GAGGAGCATT 


AAAAATTGCA 


ACAATTGGTG 


GCGGTTCTAG 


240 


TTATACACCT 


GAATTAATTG 


AAGGATACAT 


TAAAAGAAAA 


GATGAACTGC 


CAATTAAGGA 


300 


AATTTGGTTA 


GTCGATATTG 


AAGCTGGTAA 


AGAAAAATTG 


GAAATCGTTG 


GGGCAATGGC 


360 


GAAgcGCATG 


ATTAAAGCTG 


CTGGTTTAGA 


TTGGGAAGTT 


CATTTAACAT 


TGGACCGTGA 


420 


AGCAGCCTTA 


AAAGGGGCGG 


ATTTTGTTTC 


AACACAGTTT 


CGTGTAGGAT 


TATTAGATGC 


480 


TCGCATTAAA 


GATGAACGGA 


TTCCTTTGTC 


CCATGGAGTA 


TTAGGCCAAG 


AAACAAATGG 


540 


CGCGGGAGGA 


ATGTTTAAAG 


CATTTCGAAC 


AATTCCTGTC 


ATTTTAGCTA 


TTATTGACGA 


600 


TATGAAACGC 


CTTTGCCCAG 


ATGCTTGGTT 


AGTGAATTTT 


ACTAATCCAG 


CTGGAATGGT 


660 


TACGGAAGCT 


GCAATTAAAC 


ATGGTGGTTG 


GAAAAAAACA 


GTTGGCTTAT 


GTAATGTGCC 


720 


GATTGGTCAT 


AGAAAACAAG 


CGGCCGAAAT 


GCTGGGAATT 


CCAGAAGAAG 


ACTTATTCTT 


780 


TAAATTTGCT 


GGAATTAATC 


ATTTCCATTG 


GCACCGCGTT 


TGGGACAAAG 


AAGGACAAGA 


840 


GCGAACACAA 


GAACTCATTG 


ATTTAATTTA 


CGGaCCTAAG 


CAAGAGCAAG 


AAAGTCATTt 


900 


GAAAAATATT 


TTTGATc-JGC 


CGTTCCATTA 


TGAACAGTTA 


AAAGATTTAG 


GAATGTTACC 


960 


ATGTGGTTAT 


CATCGCTATT 


ATTATATTGA 


AGATGAGATG 


TTGAAGCATT 


CCATCGAAGA 


1020 


GTATGAACGG 


GGCGAAACAA 


GAGCACAAGT 


GGTTAAAGAA 


ACGGAAGGTC 


GTCTGTTTGA 


1080 


AT TG TAT AAA 


GATCCCAAAC 


TAGATTACAA 


ACCAAAAGAA 


TTAGAAGAAC 


GGGGCGGCAC 


1140 


ACACTATAGC 


GATGCAGCTT 


GTGAAATGAT 


TGCATCAATT 


TATAATGATA 


AACGAACGGA 


1200 


TATGGTTGTT 


TCTACTGAAA 


ATAATGGGAC 


AATTACTGAT 


TTACCGTATG 


ATTGTGTTGT 


1260 


AGAAGTATCT 


GGACCTGTGA 


CAGCTCATGG 


CCATGAACCT 


TACAACTGGG 


GGGCCTTTCC 


1320 


GCCAGCAGCA 


CGAGGAATTA 


TTCAAAATAT 


GAAAGCTATG 


GAAGAAACGG 


TTATTCGAGC 


1380 


AGCCATCAAT 


GGTGATTACG 


GCGCAgCATT 


gCATGCTTTC ACTATTAATC 


CATTAGTTCC 


1440 


TGGCGGCGTC 


ATGGCGAAGA 


CCTTGTTGGA 


CGAATTATTA 


ATTGCCCACA 


AAGCGCATTT 


1500 


ACCGAATTTT 


GCAGATGCGA 


TTAATAAAAT 


TGAAGAAACA 


CAACCAGAAA 


CAGTCACATA 


1560 


TGTGGCCGAA 


TTAATGAAAA 


GTAATTAAAA 


AATATCGCAC 


GCTAAAGATT 


CGTTTAGTGT 


1620 


GCGATACTTT 


TTTTAGAATA 


GGAAGCAAAG 


GATTCCGTTT 


TTTTCTTGAC 


AGTAAAATGA 


1680 


TAGGTTTGTT 


GCCGGAACAG 


CCACTGAATA 


CCAGAGTAAA 


TTAAAATTTC 


AGCAATAATA 


1740 


AACGTCGAAA 


GTTTCTGTGC 


GGAAAGGAAT 


AATAGCGTTA 


AAATTAAAAA 


TAAGACGACA 


1800 


CAGGTATAAA 


AAACATGTTG 


CAATTTTTTA 


GAATTGATAA 


TCGTGATTTC 


TTTGGTATCT 


1860 


TCAGTAACTT 


CAATGCGCCG 


ATAAAATTGG 


AACCAGCCTT 


CATAGAACGT 


TAAAACATCT 


1920 
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CCAGGATCAG 


CCTCAATTTC 


AATAGGTACA 


TTAGTAAAAA 


GTTTCCCTTG 


ATGCACACCA 


1980 


TTTTTATAAA 


ATGtAAAATA 


GCGTAATGCG 


CCCCAAGCTG 


ATGTTTTTCG 


TGTCAATGTA 


2040 


ATCTTCATGG 


TTTTCACCTC 


CGtCTTTATT 


TTGCCmTwGg 


TtTCGGTAAA 


ATTGgTATCC 


2100 


TACTAAAGTC 


CGATTCTCTC 


ATTTTAGGGT 


CAGAATTGAC 


AAAAGAAAGC 


AGAGGTGTTA 


2160 


ATGTGGGAGT 


AATTAAACGA 


GGAGGTATAA 


TTGTATGTTG 


G CATC TAT CA 


GAATTTGGAT 


2220 


TGACTTTTTG 


ACAAAATAAA 


TTGAATAAAT 


CTTATCAAAA 


AGTCTGAAAT 


TAAAATTCAT 


2280 


GGGTGGCGTG 


TGTACTTTTA 


TACGAAACTA 


CAAGGATAGC 


GATTAGGCTG 


TCCTTTTTTG 


2340 


CGTACCCAAG 


ATTCTTTCGT 


TTCAGAAATG 


AAAGAAGAGG 


GCAAAATGAA 


AAAATTAAGT 


2400 


ATTCATCTGA 


AAGACGTAAG 


TATTCACTTT 


TCGGGAAAAC 


CAATTCTAGA 


' AATTGATGAA 


2460 


TTATTTGTAT 


ATGAAAATGA 


AAAAATTGGG 


ATTATTGGTA 


AAAACGGCGC 


AGGAAAATCA 


2520 


ACACTGCTCA 


ATTTAATTAT 


GGGTAAGATT 


CAATCAGATA 


AAGGAAAAGT 


TCAAAGATTG 


2580 


AACGACTTTC 


ATTACTT.GGC 


ACAAGTAGCT 


GAAGAAATAA 


CGAATGAATC 


AGAGAAAACT 


2640 


GACAAAAATT 


GTTTGCTAAA 


TCAGAAAAAT 


CAAAAACTAA 


GTGGTGGCGA 


AAAAGTCCAA 


2700 


AAACGCTTAG 


CAACATTATT 


TTCAGAGTAT 


CCAACCGGGG 


TTATTTTAGA 


TGAACCAACC 


2760 


ACACATTTAG 


ATAAAGAACA 


TCGTCAGTTG 


TTAGTGGCAG 


ATTTAACGTA 


TTATTATGGG 


2820 


ACCGTTTTGT 


TTGTTAGTCA 


TGATCGCTTT 


TTCTTGAATC 


AATTAGCAGA 


GAAAATTTGG 


2880 


GAAGTTTCGG 


ATGGACATGT 


CAAAGAATAT 


TTAGGGAACT 


ACGATGCGTA 


TTGTCGTCAA 


2940 


AAAGAATTGG 


AACAGCAAAC 


ACAATATAAT 


GTCTATCATC 


AGTATCAAAA 


GGAAAAGAAA 


3000 


AAATTACAGG 


AATCTTACGC 


AAAGAAACAA 


GCACAAGCGC 


AAAAATCTAG 


TCATGTTTCA 


3060 


AAAAAACAAA 


AACAAAAGCA 


AATTAAACCT 


AGTCGTTTAG 


CTGGTTCTAA 


ACAAAAAGAT 


3120 


ACCGTACAAA 


AAGCACTCCA 


AAAACAAGCG 


AAAGsGwTTA 


ATGCCAGAAT 


TGACAGATTA 


3180 


CCCGATGTTG 


CGCAAGCAAA 


ACAAGAGCGA 


AAAATTATTT 


TTCCTACTAA 


TAATCAGTTC 


3240 


TCTTTATACA 


ATCCAT AT CC 


AATTAGAATT 


GAAAATTTAA 


CTTTCGCTTA 


TGAGAATCGA 


3300 


ACAATTCTTA 


ACCAAGTGAA 


CGTTCAAATT 


CCTTTAAACG 


AAAAAATAGC 


ACTTTGTGGa 


3360 


AAAAATGGTG 


CTGGGAAGAG 


TACCTTCCTT 


CAACAAATTG 


AAGCCCGTCA 


CCCGGCCATT 


3420 


TACTTTTCTC 


CCAAAGTCCG 


GTTAGGAACG 


TaTCATCAAT 


TAGATTATCG 


ATTAAAAAAT 


3480 


GATGAACCAC 


TGTTGACTTA 


TTTATTGAAG 


CGGACAAATT 


ATTCTGAAAA 


GATTGTGCGT 


3540 


TCACTTTTAT 


ATCGACTAGG 


TTTTCAACAA 


GAAAATTTAC 


AGACAAAGAT 


ATCTTCTCTA 


3600 


AGTGGTGGCG 


AACATGAAAA 


ATTACTTTGG 


CGCAATTATT 


TATAGAACCA 


AATAATATCA 


3660 


TCTTATTGGA 


TGAACCCACC 


AATTTTCTTG 


ATTTAGACAC 


AATCCAAGCA 


CTTGAAGAAT 


3720 


TTATAAGTGC 


CTATCAAGGA 


ACAGTAATTT 


TCACTTCGCA 


TGATGAAACG 


TTTGTGGAAA 


3780 


AAGTAGCCAC 


CCGCACTATT 


TATTTAGAAA 


ACGGGAAAAT 


TATTGATAAA 


TAaAAAAAGT 


3840 


rAACAAAAAG 


TCGCTATTTT 


AGCGACTTTT 


TGTTTAGACT 


TCTTCAATTG 


GTAAATAGGT 


3900 
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GTATTCTGCA ATTTCTGCGG 


CATGGGAGCC 


AGCAACATGT 


CCAGTGACAA 


ATGCAGCTGT 


3960 


AACATTGTAG 


CCTCCAGTAT 


AGCCATTAAT 


ATCTAAAAGT 


TCACCAGCAA 


AAAATAAACC 


4020 


ATTGACTAAT 


TTGCTCTCCA 


TTGTTTTAGG 


GGTCACTTCT 


TTGAGGGAAA 


TCCCACCGCC 


4080 


TGTGACAAAA 


GATTTTTCCA 


AAGGCAATGT 


TTTCGTAACA 


GTGAACTGAA 


AGTCTTTCAG 


4140 


TAGTTCCACA 


AAGGAAAGmC 


GTTGCTTTTC 


TGTTAGTTGT 


TTGGCAGGCA 


CTTCTTCAAA 


4200 


AGATTTTGTC 


GGAAACACAT 


CCAAGGCTAC 


CGTGACAGGT 


TGATTACCGT 


TGCGAGTTAA 


4260 


TTCTTGGTTA 


ATAAAACTAG 


AACAGCGGAG 


CGCGGCAGGT 


CCTGAAATGC 


CAAAATGTGT 


4320 


AAACAGCATA 


TCCaTTTGAT 


GATTAACTAA 


AGGTTTTCCT 


TTTTGGTTCA 


AAACAGTTAA 


4380 


ATTAACATCT 


TGTAAAGAGA 


GACCTTGCAA 


CGTTTTATCC 


AGGATAAAAG 


GTTCTTCAGA 


4440 


AATAATAGGT 


GATTCGGTAG 


GGTAGAGCGG 


GCTGATGGTA 


TGCCCCATTT 


TTTTGGCTAG 


4500 


TTTATAGCCA 


TCACCAGTTG 


CTCCTGTGGA 


AGGATAAGTG 


CGGCCGCCAG 


TTGTTAATAC 


4560 


AACACACGGT 


GCATAAATTT 


TTTCCAGTTC 


GGTTTCAACG 


CCAATTATTT 


GATCGTCTTT 


4620 


TCGTAGTAAT 


TTTGTGACCT 


GTGTTTTTGT 


AAAAACAGTG 


ACTCCTAATT 


CGTTAATGCG 


4680 


GTTAAATAGC 


GCATCAACAA 


TTGACTTCGA 


TTTATCTGTA 


ACAGGGAACA 


TGCGTCCGTG 


4740 


ATCTTCTTCT 


TTTAAGTGAA 


TACCATTGGA 


TTCAAAAAAG 


TTCATGATAT 


CATAGTTATC 


4800 


AAATTGTGAA 


AATGCGCTGT 


ATAAAAATTT 


TCCATTCCCA 


GGAATAAATG 


AAATGATTTC 


'4860 


TTCTGCGGGC 


CGATTATTGG 


TT AC ATT AC A 


GCGGCCGCCA 


CCAGTCATTA 


ATAATTTTTT 


4920 


CCCAACACGG 


CGATTTTTTT 


CAATCAATAA 


TACTTGAGCG 


CCTGCTTCGG 


CGGCCGCAAT 


4980 


CGTGGCCATC 


ATACCGCTCG 


TCCCAGCACC 


GACAATAATT 


ACATCAAATT 


TCTTCATAGT 


5040 


TCTCCTCCAA 


TTCAATAGGT 


AgTGTACCAT 


AATTACGGGA 


TTTTTTCTGG 


ATTTAAGGAA 


5100 


AAAATAAACG 


TTTGACTATG 


GTAATTTTTG 


AATTTTCAGT 


TAAAATAAAC 


CTATAACAAT 


5160 


GAGAGGATGA 


TTAAATGGAC 


GCAAAACAAT 


ATGTTAAAAA 


TATTCAAGAA 


AAAATTCACC 


5220 


AATTAGACCA 


AGGTCAAACA 


GAATATTTAC 


AAGCAGTCGA 


TGAGTTTTTA 


CCAACCGTAG 


5280 


AAGGTTTTTT 


AGAAAAAAAT 


CCTCAATACA 


TTGAAGCAAA 


TGTTTTAGGT 


GTCTTGATTG 


5340 


AACCTGAACG 


TATTTTTCAA 


TTTCGTGTGC 


CGTGGCAAGA 


TGACCAGGGC 


AATTGGCATG 


5400 


TGAACAGAGG 


CTATCGTGTC 


CAATATAATT 


CAGCGATTGG 


ACCATACAAA 


GGTGGTTTGC 


5460 


GGTTTCATCC 


AAGTGTGAAT 


TTAAGTGTGA 


TGAAGTTTTT 


GGCCTTTGAA 


CAAATTTTTA 


5520 


AAAATAGTTT 


AACTGGTTTG 


CCAATCGGTG 


GCGGCAAAGG 


CGGTAGTGAT 


TTTGATCCAA 


5580 


AaGGTAAGTC 


AGATGCTGAA 


GTGATGCGCT 


TTTGTCAAAG 


TTTCATGACC 


GAACTTyAAA 


5640 


AACATATTGG 


TCCAAGCACA 


GATGTTCCAG 


CAGGCGATAT 


TGGCGTAGGC 


GCTCGTGAAA 


5700 


TTGGTTATTT 


GTTTGGCATG 


TATAAAyGTT 


GCGCAACTAT 


GATGCAGGAG 


TGTTGACTGG 


5760 


CAAACCTTTA 


GGATACTGGG 


GAAGCCAAGC 


AAGA 






5794 


(2) INFORMATION FOR SEQ ID NO: 71 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


71: 






ATTCGTATGC 


CaATTAVY^T 


gGCtaTTTGg 


CgAAAAAgCc AnCnGGAGAA 


gTCaGrGnAA 


60 


AATGTGGAAA 


ATTATAGGAA 


GCTACTGGGT 


TTTATGCTAT 


GATTTTTAAT 


AAaACAAAAT 


120 


GAAGGGGCGG 


GAAAACATGT 


TATTTGGTAC 


AGCAAAAATG 


AATCGGGAAA 


ATCATTTAGA 


180 


AATCGGTGGT 


TGTGACACCG 


TAAAGTTGGC 


GCAAAAATTC 


GGGACGCCGT 


TATTTGTCTA 


240 


TGATGTCGCT 


CATATTCGCG 


CACAAGCACG 


TGGTTTTAAA 


CAAACGTTAA 


ATCAATTAGG 


300 


TATCAAAAAT 


AAAGTGGTTT 


ATGCAAGTAA 


AGCATTTAGT 


TGTTTAGCTA 


TTTATCAAGT 


360 


GTTAAAGGAA 


GAAGATATTG 


CTTGTGATGT 


TGTCTCTGGT 


GGAGAACTAT 


TCACAGCTTT 


420 


AAAAGGGGGC 


ATGGAGCCAG 


CGGAAATTGA 


GTTCCATGGA 


AATAATAAAA 


CCCCTGAAGA 


480 


ATTACGTTAT 


GCTTTGGATA 


ACAAAATAGG 


TACAATTGTC ATTGaTAATT 


TTTACGAAAT 


540 


TGACTTGTTG 


GAAGAATTAT 


TAGCTACACG 


AAACCAAACA 


CAAAAAGTTC 


TTTTTCGAGT 


600 


GAGTCCTGGC GTGGATGCAG 


AAACACATGA 


TTACATTTTG 


ACAGGCCAAG 


AAGATTCAAA 


660 


ATTTGGTTTT 


GATGTTGCAA 


GTGGCCAGGC 


AACAGAAGCA 


CTCGTTCGTC 


TTTTATCAAA 


720 


CCCCTCTTTT 


GATGTACAAG 


GGGTCCATTG 


CCATATTGGC 


TCGCAAATTT 


T7GCTGTGGA 


780 


AGGCTTTTTA 


GCTGCTGTTG 


AAAAAATGTT 


CACTATTTTA 


GAAGACTGGC 


GGCAAGTCCA 


840 


TCAATTTACA 


GCACGTGTTT 


TAAATATGGG 


TGGTGGTTTT 


GGTGTGCAGT 


ACACGCAGCA 


900 


AGATGAGCCA 


TTAGCACCAG 


CAACATTTGT 


TGAAAAAATT 


GTTTATTCCT 


TGAAAGGTCA 


960 


TTGTGAACAA 


tTAGGGTATC 


CaCTGCCTGA 


GCTTTGGATT 


GAACCAGGCC 


GCAGTTTAAT 


1020 


TGCCGAAGCG 


GGCACAACGA 


T AT AT AC AG T 


TGGCGCGCAA 


AAAGAGGTTC 


CTGGTGTTCG 


1080 


TCATTTTGTT 


TCCGTAGATG 


GTGGCATGGG 


TGATAATATA 


CGGCCTGCTT 


TGTATCAAGC 


1140 


AGTTTATGAT 


GGTTTTCTAG 


CGAATCGTGA 


TGGGCATGAT 


TCAGTCAAGG 


AAGTCACAGT 


1200 


GGTTGGTAAA 


TATTGCGAAT 


CAGGAGATGT 


CTTACTTCGA 


GATATTTTAT 


TGCCAGAAGT 


1260 


TAAAGCAGGG 


GATTTATTGG 


CCATTAGCAG 


TACGGGGGCT 


TACGGTTATT 


CAATGGCAAG 


1320 


TAATTACAAC 


CGAAATCCTC 


GACCAGCGgT GGTTTTTGTC GAAGATGGAC AAGCAAAATT 


1380 


AGTTGCTCGC 


CGTGAGACAT 


ATGAAGATAT 


GACAACGTTG 


GaTTGTTGAT 


AGATTaAAAG 


1440 


TAGCCGCCTC 


TTTTTGAGAC 


gGCTGTTTTT 


TTTCcAAAAA 


ATAACraAAAT 


GTAAGGGTGA 


1500 


TTTCTTTCAG- 


AAaTGaAAAA 


TCCaTAGTAT 


ATTAAATGAA 


GAAGGAAATA 


CTTTAGAAGA 


1560 


GGGTGTGATT 


TGCAGATGAA 


ATGGCTCCTC 


GGGTTACTGA 


TCATTTTTGC 


AATAATTGGC 


1620 


GTTGTTTTTT 


CTTTTTACAT 


TGCCAATTAT 


TTTTTGAAAA 


TTGCGTTGCT 


TAAAGACAAC 


1680 
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CCTTGGTATC ATAAAAAAGG GCATCGGTTA TTAAATCCTG ATAATTTTCA AGAACGGGAA 174 0 

ACAAGATATA CAAAAATAGA AGAACAACAA AAACAAGAAG GCGAAGCTTT TTGGACAGAG 1800 

TCTTTTGCTG AAGATCGTTG GTTGAAAATT AAAGATGAAA CATTGTATGC ACGGTGCTTT I860 

ATTCCTTATC CAGATAACCA TCGTTGGGCG ATTTGTGTGC ATGGTTATCG TTCAAACGGA 1920 

AAACGCGATA TGGCATATAC TGCATTACGT TTTGCAGAAG AAGGCTACAA TGTTTTAGTT 1980 

CCCGATTTGC GTGCGCACGG GAAAAGTTCA GGGAATAAAA TTGGGATGGG ATGGCTGGAT 204 0 

CGGTTAGATT TATTAAGTTG GCTTAGTGAA GTTCTTGCCA TTGATATGGA AGCAGAAATT 2100 

ATTTTAGTGG GCGGTTCCAT GGGAGCGGCT ACTGTCATGA 'TGGCAAGTGG TGAAAAGTTA 2160 

CCAACGAATG TCCGCGGATT AATAGTGGAT TGTGGCTACA CTTCTGTTTA TGATGAATTT 2220 

AAATACGTCC TACATGAATC TTTTCATTTA CCCGCTTTTC CTATTTTAAC AATTGCTAAT 2280 

CAATTAGCAT TAAATAACTA TGGCTTTCAA TTGAAAACCG CTTCTTCCGT GAGACAATTA 2340 

CACAAAAATA CGTTGCCCAC ATTCTTTATT CACGGTACAG GTGATCGCTT TGTTCCAATG 2400 

ACAATGTTTG AAGAAAATTT AGCGGCAACT CAGGGGATAA AAAAAGGCCT CATTGTTGGT 2460 

AAAGCACCAC ATTTATCTTC AAGTGTCTAC GAACCTGAAA ATTATTACAG TAGTATTTTT 2520 

GAATTTTTGG AAGAAAACTG tCCTGCGGTA AAAACAATAT CCGACTAAAA AAGACATGGA 2580 

TTTTCATGAT TCCATGTCTT TTTTTTGTCT GACTCGTTTA GTTAAATTTA AAGTGCTTAA ,264 0 

AAAGAAAAAT AACGGCGCCA AACTACCAAG GCAAAGAAAC CATTTCATTG AATAAGTTAA 2700 

CTCTTGCTGT AAAAGAGATT CTCGTATATA GCTGAAACCT TCGCCATGGA CCCGCAAATA 2760 

AATATAAATG TAAGCAAAAA GAATCACTAA AGGCAAAATA CAGACTTCTA ATATTTCCAA 2820 

AAGATAAGGA AATGTTTTAT TTAATAATTT GAGGCGTGTC TTATTCAATA GATAAGCAAA 2880 

AATGAAAATT GACCATAAAA AACCAAAAAA ACTAATCAAA TAAAAAATCA GATTCGCACT 2940 

TCCTGCATTC ATAAAAGCTT TCGGGAAAAG TAGTCTGCCG ATGAATGTAT TTTTAATTGT 3000 

AAAAGGCACA TCAAAAATGG AAATATAAAG ACAAAAAAGT ACTAATGTTA ACAAGGCATG 3060 

CCAAATCTTT TTCTCCATTT CATTTAAAAA ATATCTAATT ACTTGGTACA TTTCCATAAA 3120 

CTGTCGCTCC GTGTTTGAAA TAATCGTAGT TTTGACTATA CCATAGCTGT TCTCAATTTT 3180 

CTTGAAAAAA ATACACGCAA CCAGCCTCGA CTATACGAAA CACACAGAGT GTCTAAAAAA 3240 

TAAGTTGCCG ATCGCATTTT TTCGCATGCT AGTGTTGCAC TTCTGTTTCG TTGGTGTTTT 3300 

CTAATTGCTG ATTTGTTGAT TAAAAAAGGG AAGGAGGGAG CGTTTTTCTA GTATAATAAT 3360 

ATAAAAGAAT ATAGAGAAAA TCGTAGGATG GGAGTTATTC AAAGTGGCGG AAATTACAGT 3420 

TAAGACAATT TCAGAGAAAG ATGATTTAAA GAAGTTGTTT TTGGGTCAAG AGTATGACCA 34 80 

ACAGCAGCCA GTTAAAAATA TTTTAAAAGC TAGAACATCG AATGTCCATA TTGAAAAGTA 3540 

CGAGGTTGTG GAGGATGTTT TTTTAAAGTT AACTGGGAAG GCGACCGCGC ACAAAGATAT 3600 

TATGAACTCA TTTTGGACAA CGTATAAAAT TATGTTACAA CTTGTTTATC CCGATTTCTT 3660 



4020 
4080 
4140 
4200 
4232 
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TAGACCAGCG GAGGTAATTG GGGAAAATCA AGAAAGCTAT TTAGAAAAAC CTTCTGAACA 3720 
GAATTTTCTT GCAGTAAATA GCAAGTATCC GCCTCATGAT TCCGGACCGA TATTTAACTT 3780 
ATTTTGACCA AGTTTTTCCT GACTATTTAC CAAATCCTGT ACCAAAAAAA TATACATGGA 3840 
ATGAATTTCT TCTCGACAAT TTTACTAAaT TTGAGCGAGT TCATCmGGAC CCACAaTTAA 3900 
aGCGTTTTGC CgAACTTACC CaTTCAATTG GAAACaTTAC AGTAGTTCCA CTAGGATTTA 3960 
ACAGCGGAAG AAGTTTATCG TTTAAAGATT ATTGGGATTA TTCTTTAGAA CAACTTTCGA 
TATTTTTAGC TTCATTTCAT TCATGGGAAA GCTACGTTCA TACGTATGAG ATGCAACCAT 
TTTTAAATGA ACAGTATCAA CCAGTAGCGC TATGGAAAAA TCATTTGAAA AAAGACTCGT 
TTATCTTACC TCAGAATATA GAGGAGATAA ATGAATATTT AGTTCAAGTA AATCAACGAA 
TAGAAAAAAG AGGGCAGCGA ATAGTGAATC GG 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

TGGAnTTATG GTTGGGTTTT nAGGTTGGCC GnAGGTTGGG GCCAAAGGTT ACCCGTTATT 60 

TTCCTTGGAC CAAnCCACTT TATTTCGGAC CAAAAACCCA TTTTAAGAAA CCATTTGTAC 120 

CCAAATGGTA AAGCAGAAGA TTTAGATGCT GCTTCCGCAG AATATGAAAA AATTATCGAT 180 

GCACATCCAG TAGATATCCA AATCTTAGGT ATTGGTCAAA ATGGACACAT TGGTTTCAAC 240 

GAACCAGGCA CACCTCTAGA TAGCTTAACA CATGTCGTTG AGTTGACTGA ATCAACAATC 300 

AACGCAAACA AACGTTATTT TGACAAAGTA GAAGATGTCC CAACACGTGC TGTTTCTATG 360 

GGGATTGGCT CAATCATGAA AGGTAAAAAA ATGATTTTAA TGGCTTATGG CGAAgCAAAA 420 

GCAGAGGCCA TTAAAGGAAT GATTGACGGA CCAGTAACAA CAGACATGCC AGCAAGTGCA 480 

TTACAAAACC ATCAAGATGT TGTTGTAATC ATTGATGATG CAGCTGCAAG TAAATTATAA 54 0 

TCGTTGATGA AAAAGAACTT GAAACATAAG AATTTCCTTA TGTTTCAAGT TCTTTTTTTG 600 

TTATACTGGA AGTAATAGAA GAAGAGGTGA AAAGGATGTT ACAGCTAACA ATTCCAGAAA 660 

TTATTTTACG GTTAGTCCTA GCGATGGTGA TGGGTGGCGC AATTGGTTTT GAAAGACAAT 720 

ATAAGAATCG CCCAGCAGGG ATGCGCACTC ACATTTTAGT TTGTATGGGC GCGGCTATTA 780 

TTGCGTTAAT TCAGTCGCAA ATTGCAATCA ATGCATTGAA AGATGCGTTA gcTAATCCGG 840 

CTTTGACGGG GGTCATTCGT TCAGATCAAG CAAGGTTGAT CGCACAAGTT GTAAGTGGAA 900 

TCGGCTTTTT AGGTGCTGGA ACCATCATTG TAACGAAACA AAGTGTCACC GGATTGACGA 960 

CAGCAGCGTC ACTATGGACC GTTGCTGGTT TAGGGATCGC GGTGGGCATG GGCTACTATA 1020 

CGATTGCCTT AACAAGTTTT ATCGGTGTAT TATTTGCCTT AACCATTGTC AAACGAAT TA 1080 
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TTCACGTACC AACAATCAAA AAATTAGAAA TTCGCTATCA ACATAAACAA GAAACCAAAG 114 0 

AATTTATTAA TCAGTATTTT GAAGAAAaAC ATATTGAAAT TTATGATGTG AACTTCAATG 1200 

TTTCTTGGGT G GAT GAT AC A AAAATCTATA CGAATATCTA TACTATTGAT TTGCCTAAAG 1260 

GCCTAACGTA TGCCGATGTT ATCGAAGATC TATCTGTTTC AAATAATGTA ACAAAATTAC 1320 

GATgATTAAT GTGTAGAAAC ATCTATTaTA ATTAATGACA ACCGCTATAA GTTAAAATGC 1380 

TTATGGCGGT TTTCTTCTAG GACCTAGGAC TGATTTTTTA AGTTTAATAT TTCAAAAAGA 14 40 

TTACAAAAGA GTTATGATTG TATTACGAGT TAGAAAGAAA GGTGATAAGT AAGATGAATA 1500 

AAAAAAATAA ATTTCTTAGT ATTATTGTTG TTGTCATTGT GATTATTGGT GCATTTGTGA 1560 

TGAAAGCTTA TAATCAATTG ATTACTTTAG ATAATAAAGT CGAAGCAGAA TGGTCACAAG 1620 

TAGAGAATGT AATGCAAAGA CGTGCTGATT TAATTCCTAA TCTTGTCAGT AGTGTTCAAG 1680 

GCAGTATGAC TCAAGAAAAA GAAGTTTTAA AAGAAATAAC GGAAGCACGT AAAGCATACG 174 0 

CAGGAGCAAA ATCTTCTTCT GAAAAAGGGC AAGCAAATGA GCAAATTGAA AAAGGATTAG 1800 

GTAATTTTGT TACAGTATTG AATGAAGATT ACCCAAAGTT AGCTTCTTCT GAAAATGTCA I860 

AAACCTTAAT GACTCAATTA GAAGGATCTG AAAATCGAAT TTCTGTTGAA CGCCGTAACT 1920 

ATATACAAGC CGTTGAAACA TATAACCAAC AGATTTCAAA ATTCCCTGAT AAAATAATTG 1980 

CAAAACTTTT AGGATTTGAG AAGAAAACTA ACTATGTTGC GACAGAGCAG GGAAAAGAAG ^040 

TGCCGCAAGT AAAATTTAAA TAAAAATTTA TAGAAAATTT GGGTTAAAAA GAAAGAAGTG 2100 

AGGAAGGTTG GATAAAAAAG TCGAAAAAAT TCAAGAAAAA GTTCGTTTGT TTTATAAAAA 2160 

GCATCAGCGG AATACAA7.0A TTGTTGTGGT AATTAGTTTA AG TAT GAT TT TAATATTAGG 2220 

AATGGCTGCT TGGTATGAAG GAAAGGGAAT CTCTTATGAA GCAAAACAAA CTGCACTGAG 2280 

AGAGAAGCAA CATGCCTTAC AAAAAAAGAG AGACGATATC TTAGCAGGGA AAATGACTGA 234 0 

AGCGTCAACC CAGACAACAA TAAATAACTT TTCTAAACAA GAAATAACTG GTGAAAATGA 2400 

ATTAGTGACA ATTACTGGAA CGAATGGGCA GTTGAACATT AATGATGCTA ATATCTATGT 24 60 

TTCAGATAAT GCAGGTATTG TCTCAAATCA ACTTAAACAG AAAATGTTTC AATTGAATCA 2520 

ACAATTGTTG GAAAATGCGA ATGGCGCACA ATTTATGCTT ATTACCGTTC CTGCGTTACC 2580 

TAGTGGGGAA TCTGTTGAAT CTTACAGCAA TAAGATTGCT AATCAATTAG GTGTAGGGGA 2640 

TCGTGAAAAA AATAATGGTG TGGTTTTTCT AATGGCAATT GAGGATCGCG AGTCTCGCTT 2700 

GGAAGTTGGA TATGGCTTAG AGAGTATTTT AACGGATAGC TATGCGGATG ATATTATTAA 27 60 

TAATGAAGAA GTAAAAGAGG CTTTTAGAGA CGAGGATTAT AATACAGGTC TCAATAAAAT 2820 

TATTGATCAA GTATCGGCTG CAATCAATTC AAAAACCGCA CAAGTGGATA ATGAATTAAC 2880 

AAATATACAA ACAGAGTTAA ACACTAACAC AACGAAAAGA AATGTTCTTT TAATGGCAAG 2 940 

TATTGCTGGG ATGCTCATTT GTGCTGTTTA TATTTTACAA ATTTTAAGAA CAAGAAAACT 3000 

TGTTAAAAAA ATGTATCAAG ATTATTTGAA TTGTTTACCA ACAAAGGCTG TCCTAAATAA 3060 
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CTCAGAACAG 
TCTTAATGGA 
TTCTTTACTA 
AGATACTTTA 
CAATCCGTCT 
ATTCGGCGGT 
TTCTAGCTTT 
CTGTTCCTAG 
TCGAAATACC 
CCATACCTAA 
CGGTCCAACC 
CGTAAAGTAA 
TAAATCCTAA 
TCAATTTAAT 
TGTAAAACCC 
ACGGAGTACC 
TGGTATTTGT 
TAGAAGAATA 
CTTCATGAGG 
TATCTAATAA 
GAGTCCGCCA 
CGGTTAATTT 
CTTCATAATA 
AGTTGTCTCG 
TCTTCTTTGT 
AAGACGCTCA 
TTGATTGCTG 
AATCCCTTTG 
TGTTTCTTTC 
CATTCCTTGA 
CATGTCGTAA 
GTAAATCTTA 
GCTGGCACTA 



ACAAAGAAAG 
GCAACGTTAT 
AGTATATATC 
TATAGTTATG 
AATCATAGTA 
GGCGGCGCCT 
TTGCTATTTA 
TAAATAGCCA 
ATTCATTACA 
GTTCATTAAA 
TAAGTCATCA 
ATTATACATC 
TTGGAATTTA 
AATTTCAGCG 
AGAATAGAAT 
GTTTTTATAA 
CCACTCAAAA 
ACTTTTGGCA 
CGTCCGCACA 
AGTTAAACCG 
ATCATCTAAG 
TTCCCAAGTG 
CGTTGCCATG 
TTTCTTCGCT 
CCAGTATATC 
AGCAGAACGC 
CCAATTTCTT 
TTGAACGCAT 
CATTCATACA 
TCAACGTGTT 
GCAGAAGTAT 
CCAATTTTCT 
GTATCATTAA 



TATTAAATAA 
TTGCCACGAA 
CAAATGCAAA 
ATGGTAGTGT 
GTTCTGGTAG 
CAGGAAGTTG 
TCTAAACCGT 
TTTCCTACTT 
ATCGGGTTCA 
GCTTTGTTGG 
TATAATTCTT 
CAGTCTTTTA 
TAGCCAATAT 
ACGTTCGCTA 
AAAAACGTTT 
ATTTCGTTAA 
ATATCATCAA 
TGGACAGATT 
TCATTTCTCA 
CCGAAAACGT 
TCATTGGATA 
GATTTATCAA 
GTTATTCCTC 
TAAAACGTTC 
TGAAAATGTC 
CTTATTTAAG 
CAG CAT CATC 
AATGACGTAA 
AGCCTTCAGG 
GTTGCGCTGT 
AGTAAGGGAT 
TCTCTTGACG 
TATATGAAAT 



GATGAAAAAG 
AGGGAAAATA 
AAAACAATCT 
TTTGACGTAT 
CGGCAGTGGT 
GTAAAATAAA 
AAAGATAGTC 
GGGAGAAGAA 
CATCATTGGC 
CGTTGTAACG 
CTGTATAACG 
GTTTATCTTG 
ACGTGCCGTG 
GTTTATTGTT 
CTAAAAATAC 
TCCGTTCTGC 
TCTCTTTTTT 
CCATGAATTG 
GTTGATCCAT 
AGCCGACAGT 
ACGGAATCCG 
TGACGTCTTC 
CTGTATGCTG 
TACAGAGTCC 
CTGTTTAAAA 
TGCTTTTCAT 
TGTAAATGTA 
AATGTTCAAG 
GATTTCTGAA 
TGCGTAGACA 
TGTATCATTA 
TTCTTCAATC 
TGAACCATTT 



ACTTCTTTTT 
AGACGTGCCA 
TTTGGGCGTT 
GCTTATTTAA 
GGTTCATTCG 
AAGCTAGAAA 
ATCATCTTTC 
GTCATGGTTG 
TGTATCCGCA 
TAAAAAGGTT 
TTCTTCGTTT 
CTCATCTTCT 
CACTGATTCA 
TCCTAAATAG 
ACTAGCAATT 
TTTATATTGC 
TGTATTTAAT 
GATGTTATTT 
GCCGCTTTCT 
AGTTTTTTCT 
TGTATCTAAC 
AATGGCGTTC 
GGACAGCAGT 
AAATAGGAGT 
AGAGAACATT 
TAAATCACGC 
CGAATATAGT 
TCACGGGTTG 
CGCATAAACA 
TCAATCACTT 
GCTAAATAAG 
ATACGAGTAA 
GGCGCTACGG 



ATTGTTTATA 
CTAAGCGTGG 
TATTAGTAGG 
ACTCACAATA 
GTGGCGGCTC 
CGTTGATCGT 
ATGGCTTCAA 
CTAGTTCCTG 
AATAAGGGAT 
TTTACTTCTT 
TCGTACAATT 
GGTAATTCAT 
TCACGGATAA 
TAAAGAGGAG 
TTTTTCTCTA 
AGGTGTTTGT 
GTACTGAAGA 
AAGACGGCTT 
GATTGAACAG 
AAATCGGAAA 
CAAAATTGTT 
CAGTTAATTG 
CTGTTTGACT 
GTGCAGCCGT 
CCTAAAAAAT 
AAGACTCACA 
AAATTGATTT 
TTTGTTTTGG 
AGGTTAAACT 
TGCGCATATC 
GCGCTGGATA 
TTGGATGAAT 
CTAAACGATT 
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TTGATGATAA AGACCATCTT TTTTCACAGC ATCGCGTAAC GCAGCCCAAT CTTCAGCCGT 5100 

TGGAATCTGA ATGTCTTTGA AAATTTCTTT CACTTTATCA AATTTTGGTG TAAAGTTGCC 5160 

TTCAATATAT TTATCGAAGT AAGCACCAGA AGCATAATCT GATTTTTCAA AATTATGGAA 5220 

GACTTGGTTA CGTTCTTTCG CAATTTGGTT ACTTTCCATT AAAGTCCAGT AATTTAATAA 5280 

CATGAAATAA ATGTCTGTAA AGTCTAATGA TTCTTCAGAG CCGTATTCCA TATGATTTTT 5340 

AGCGAAGAAG GTGTGTAAGC CCATTGCGCC TAGTCCGATT GTATGGCTTA AACGATTTCC 54 00 

ATTTTGGATA GTTGGCACGA CATCAATTTC AGAAGCATCT GTTACAAAAG TTAACGCACG 54 60 

CGTCATAGCG CGGACAGATT TTCCAAAGTC AGGACTTTCC ATTAAATTAA CAATGTTAGT 5520 

TGAACCTAAG TTACAACTAA TATCTGTTCC TAAAACTTCA TATTCTTGTT TCCCATTAAT 5580 

AACAGAAGGC GTTTGAACTT GAAGAATTTC AGAACATAAG TTACTCATGA TAATCTTGCC 5640 

ATCAATCGGA TTGTTTTTGT TAGCTGTATC AATATTGATG ATATAAGGAT AGCCAGATTC 5700 

TTGTTGTAAT TTAGAAATTT CATTTTCTAA GTCACGGGCT TTAATTTTAC GTTTACGTAT 57 60 

GTTTGGATTG GCCACTAGGT TATCATATTC TTTGGTAATA TCTACATAGG AATAAGGCAC 5820 

ACCATATTCA CGTTCCACAC TATAAGGACT AAATAAATAC ATATCTTCGT TGTTACGTGT 5880 

TAATTCGTAA AATTTATCTG GAACAATAAC ACCAAGAGAC AACGTTTTTA CACGGATTTT 594 0 

TTCATCAGCG TTTTCTTTTT TAGCAGATAA GAACATTTCA ATATCTGGAT GGAAGACATT 6000 

TAGATAAACC ACACCAGCAC CTTGACGTTG ACCTAGTTGG TTAGAGTAGC TAAAGCTATC 6060 

TTCAAATAAT TTCATAACAG GGACAACACC ACTTGCTGCG CCATCATAAC CTTTGATTGG 6120 

TGCGCCCGCT TCACGTAAGT TGGAAAGAGT AATTCCTACG CCGCCGCCGA TTCGTGATAA 6180 

TTGAAGGGCA GAATTAATTG AACGCCCAAT GCTGTTCATA TCATCTGTTA CTTGAACTAA 6240 

GAAACAAGAA ACCAATTCAC CTCGACGTTT ACGCCCAGCG TTTAAGAAAG AAGGTGTGGC 6300 

TGGTTGATAC CGTTGATGGA TCATTTCATC CGCCAATGTT AAAGCCAACT CTTCATCGCC 6360 

ATCCGCAAAA TACAAGGCAT TAAAGGCAAC CCGATCTTCA TATGTTTCTA AATACTCTGT 6420 

TCCGGCGTTG TTCTTTAAAG CG TAT T GAGA ATAAAATTTA TAAGCCGCCA TGAAAGACTT 6480 

GAATTGGAAA TGTTGTTCAT CTAAGAAAGC ATATAATTTT TCAATAAAGT CCATTGAATA 6540 

TTTTTCGATG AATTCAGTTT CAATAAAATC TTCTTTCATT AAAAAGTTGA TTTTTTCTTC 6600 

AGCAGAGCTG AATTTACGTG TATTCGGTTC TACATTTTCT TTAAAGAAGG CTGCGAGTGC 6660 

TTCTTTATCT TTATTTAAAG GGATTTGACC ATCAACAGGA CGGTTAATCT CGTTATTCAG 6720 

TTTGAAATAG CTGACATCTT TAATTTCTTT TAGACTCAAT TTCTTCCACT ACTTTCTTAA 6780 

AGGTCTCCAC GTCTTCGTCT GTTCCGCTAA ACTCAAAGGA GTATAGAAGC GGAACGTGAT 684 0 

AATCGCGGGC AATATCTTTG GCGGTATAGA CAAATAATTC AGCAAAATTA CGGTTGCCCC 6900 

CACCAGCTAC ACCAACTAAC AGTTCTTGGT TACTTTTATA ATCAAGAAAG TCATTCACTA 6960 

CCTCTGTAAT TTCTGCATCA TAGGTAGGAA TGACAAGAAT AAAAGGCTCA TTTATTTCAA 7020 
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AAAAAGGATT CGCTGGCTCA AGTTCATAAG CAGGCAAATC CAATTTTTTA ATAAAGCGCC 7080 

TTGTTTGACC GGTGACTGAA AAATAAACAA GTTTCATCTT AGCTAGCCAA TTGTTTTAAT 7140 

TGGTCTGGAC GGAAGCCAAC AACGGTTGTT GCTTCAGATG TGATTACCGG TACGCTTTGG 7200 
AACCCTTGTT CTTTTAACCA GTCGATAGCA TCTGGTTGTG CGTCGATATT GATTTCTTCA 
AAAGCGATGT TATTTTCACT TAGGaAACGT TTCGCCATTT TACATTGGAT ACAGTTATTT 
TTTGAAAAGA TTTTTACGTT CATTGaAAGT TCCTCCTCTT TTTTTCTACA TAGACTAGTA 

TAAATCTATG TAGAAAA.AAG TCAACACTTT AACACTATAT ATTGTGGTGC GTTTTTCTCA 74 4 0 

TAATACTAGT TTTTGTGTTT TTGAAAGAAA AATGAAAACA CGTGAATCCT CGTTTTTAGG 7500 

AGAAAAAACA CATTGGTCAG TGTTTGACAA ACAAAAAAAT TAAAATTTTG AAAGAGAACA 7560 

TTTAGTGAAA AAACACTAAA TGTTGTGTTG ACATTTGAGT TAGGCACAAG AAGAAAAAAT 7620 

AAGCGGATTT GTTATTGAAT GAGTGGGATT TTACAGTAGC TAGTAACATA AAAAACAAGT 7 680 

GCTGACTTTT TTTCAAAAAT AGTAACCTGA AAAAAGTGTC TCTATATAGA AGGAAAATAG 774 0 

AAGAAAACAG GAGATACATC CAGAAATTCA TTGACTTTCC GTGTAAGCAA GCGTATTCTT 7800 

TTAATTGAGA ACGATTCTCA TTTTCGTTTG AAACTTTTTC ACATGGAGGA AAGTAAATGA 7860 

TTACGTTAGC GCAGGCAGAA GTTGGGAAAG TCTATACAGT CGAAAGCGTC CAAGCAGATA ,7920 

TTCAGACAAA GAAACATCTG AACAATTTAG GTGTCGTTGC AGGCCAAGCA GTTGTCCTTG 7980 

TCAATTATCA GAATCAAAAT GGTATTGTTT TATTACATAA TAGTCGAATT GCGCTGACAG 8040 

ATACGATTTT ACAAGCGATT CATGTGGAGG AGCGCTCGGC GAATGAGAAA GTTTGGGTCT 8100 

CCTTGGATAC CTTAAAAGTT GGAGAACGCG CTACAATAGT AGGGATCCAC GGACAAGGAG 8160 

CGGTGAAACG TCGGTTGATG GACATGGGCT TAACAAAAGG AACAGTAATT TTTATTCGCA 8220 

AAGTAGCTCC TTTAGCTGAT CCAATTGAAA TCAATGTACG TGGcTACGAA CTAACGTTAC 8280 

GAAAAAGTGA AGCGGaACTA ATCTTGGTTG AAAAGGAGGA AAGCGAATGA CTCAGCAAAT 834 0 

TGCACTAGCC GGCAACCCTA ATAGCGGTAA AACGACAACG TTCAATATGT TAACAGGAGC 8400 

CAACCAATAT GTTGGAAACT GGCCAGGGGT GACTGTGGAA CGCAAAGAGG GGGTGGCAAA 84 60 

GAAAGATAAG ACACTCATTA TTCAAGATTT GCCAGGGATT TACTCTCTTT CTCCTTATAC 8520 

GCCTGAAGAA ATTGTGGCAC GAGATTATCT TTTGGAGGAC CAACCTTCAG TGATTTTGAA 8580 

TATTCTCGAT GTGACCAATT TGGAACGGAA TCTTTATTTA ACAACACAAT TAATTGAAAC 8640 

TGGATTACCG GTTGTTTGTG. CGTTAAATAT GATGGATTTG TTGGAAAAAA ACGGCCAAAC 8700 

ATTGAATAGT GAAAAGTTAA GTTATGGCTT AGGCGTGCCT GTTGTTGAGA TTAGTGCTTT 8760 

GAAGAATCGT GGGTTGGATC ATGCTTTAAA ACAAAGTAAG CAGCGAGCGA ATGCCATAGA 8820 

GACAGAGGTT ATTTATCCTT CTTATGATAA TCGCTTAGAA GCAGCACTTG CTGAAATTGT 8880 

TGATATTTTA GGGAATACTG TTCCTGAAAC GCAACAGCGT TGGTATAGTT TAAAACTTTT 8940 

TGAACGAGAT GTTCGAACCA AAGAGCAATT ATTATTGAGT TCTTTTCAAG AAAAAGAAAT 9000 
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TGAAGAAGTC ATTCAAATTA CTGAAAAAAT CTTTCAAGAT GAAAGTGAAG CGATTATCAT 9060 

TAATGAACGC TATGCCTTTA TTGCTCGATT AATTGCTTTG TGTGCGACGA AAAAAACAGA 9120 

AATGACATTT ACTCACAGTG ATAAGATTGA CCGAGTGGTA ACCAATCGGT GGTTGGcGTT 9180 

GCCAATCTTT GCGTTTGTGA TGTGGTTGGT CTATTACTTA TCTATTCAAA CGGTTGGTAC 9240 

AATGGGGACA GACTGGTTAA ATGATGTCTT TTTTGGGGAA TGGGTTCCTC AATTTGTGGG 9300 

AAATTGGTTG GCGCAATGgC AAGTAGCGCC ATGGATGCAA AGTTTAATTT TAGATGGGAT 9360 

CATTGCAGGT GTTGGGGCGG TGTTAGGATT TTTACCACAA CTGGCCGTCC TGTTTTTATG 9420 

TTTAGGTTTt TAGAAGATTG TGGCTATATG GCCCGGATTG CCTTTGkCAT GGATCGCTTG 9480 

TTTCGGAAAT TCGGCCTATC TGGTAAGTCG TTTATTCCAA TGTTAATTGC GACAGGTTGT 9540 

GGGGTCCCAG GCGTGATGGC TAGTCGAACG ATTGAAAACG AACGCGATCG GCGAATGACT 9600- 

ATTATGGTCA CGACTTTTAT GCCATGTTCA GCGAAGTTGC CAATTATCGC TCTGATTGCA 9660 

GGTGCATTTT TTCCTAACCA AAGTTGGGTC TCGCCCTCTG CTTATTTCTT AGGGGTTGCT 9720 

TCCATTGTGT TATCAGGAAT TGCTTTAAAA AAGACCAAGC ATTTTTCGGG AGATCCAGCA 9780 

CCATTCATTA TGGAACTTCC CGCCTATCAT TTGCCACAGC TTAGAAGTGT GGTTCGGCAT 9840 

GCCTATGAAC GATGTCGCTC ATTTGTAAAA AAAGCAGGTA CGATTATTTT TGTTTCCAGT 9900 

ATTCTCATTT GGTTCATGTC ACATTATTCT GTGACGTTTC AACCCGTTCC AGAAAGTCAA 9960 

AGTATTCTAG CATTTCTAGG TAAAGGGTTG GCGGTGCTAT TTATCCCATT AGGCTGGGGT 10020 

AATTGGCAAG GGGCAGTTGC CACAATTACA GGATTGATAG CGAAAGAAAA TATTATTGGT 10080 

ACCTTAGGTA TTTTATTTGG CAATGTGAAA GATGTTTCTG AAAACGGGGT AGAAGTCTGG 10140 

GGCGCACTGC AACACACCTT TACGCCAGTC GCGGCCTATT CGTTCTTAAC TTTTAACTTA 10200 

TTATGTGCAC CTTGTTTTGC TGCAATTGGC GCAATTCGTC GTGAAATGGG AGACCTCAAA 10260 

TGGACATTAG GAGCCATTGG TTATCAATGC GGTTTGGCTT ATATGGTCAG TTTTGTGATT. 10320 

TATCAATTAG GCCATGTGTT AGTCGAAAAA GGCACACTAA CTCTGGGGAC TTTTTTAGCG 10380 

ATGGGTGTTG TTTTAGCTGG ATTTTATTTC CTTATAAGAA AACCAAAGCC AGACAAAGAA 10440 

AGTGTCCAAG CAATCACTTC TTTAGAAAGA GGGTAATAAC GGATGGCAAC AATTATTTTA 10500 

TCCATTTTAA TTTTTGGAAC AGCAGGAATC ATCACCTATC GACGAATCAA AAAAGGGGAA 10560 

AGTTGTGAAG ATTGTCAAAC AGCTTGTCCT GTAAAAAAAG AACAATCTTC ACAaGAATAA 10620 

CAACTGAAAA AAATTCCCTC TTGTATAAAT AATAGCGGAG TGCTACAATA TTGAGGTAGC 10680 

TGAAAAGCCA TTTTATACAA CGGGGGCATA GCTCAGTTGG TTAGAGCATC TGACTCTTAA 1074 0 

CCAAATGCTC TGCTTGTGAA AAAAGCTTTA TAAAACTTTA TAAATAGGTT TTTAGTTTGT 10800 

GAATTTTTTT ATTGCCAACA TACTGGCAAC AAATTGCCAA CGAAAAAAAT TCTAAAAACA 108 60 

AGAAACGATA CCACGAAGAT ATTGTTTTTT TGTTTTTTAT TCTTTAAACC CAAATGCGTT 10920 

TTCTAATGCT TCTGCTGCAC GCTCTTTTAC TCCTGCAGTT AAATGTGTAT ATAAATTAGC 10980 
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AGTTGTCGAA 
ATCTAACATT 
ATTTTTCTTA 
ATCTTCCATA 
ACGATTATCT 
AGGCAACGTA 
ATTCTCTGGA 
AGCAGACCAT 
AACTACCAAA 
ATTATAAACT 
TGTACTTATC 
ATGAATGACC 
ATGATAAGGC 
ATTATGGACA 
TTCTAAATAA 
CATAGCTAAT 
TTTATTTCCT 
GTCTTTATAA 
TTTCAATTCA 
TAGCATGATT 
GTATTTTGTA 
AAGCTTTATT 
GATAACTTTC 
GATAATAAGT 
TGTTAGCTCA 
CACAGCCCAT 
AAAACGTATA 
TAAAAGCCCA 
AAATTGCTAA 
GTAAACATAC 
AGAAAAACTA 
AAAAAGTGTT 
CATAGGAATT 



TACGATGCAT 
ATTGAAGCAT 
ATCAACTTAC 
GTAAATACGT 
TGTTGCCATT 
CGATAGCTGG 
TTTTCTATGA 
TTTAATCCTA 
TAGAGCTTTT 
TGAGAAACAA 
AATTCTGTAA 
GTATTAGCTG 
TTTAACCTAT 
TAATGCTTAT 
TTAAGCAAAT - 
TGACTTTCCA 
TTCTTTTTTA 
GATACAATTA 
ATCTCTCCTC 
TTTACCTTCA 
CCTTGCTCCA 
TTTTCCAATA 
AAATATTTTT 
ATTGCACTTT 
GTTGGTAGAG 
TGGGTGCCAA 
TACTGTATTT 
CTGTAGTCAT 
TATAACAATA 
AGGATTAAAT 
AATTTACATG 
GAAATCAACA 
TCACCTCTCT 



GTCCCAGCCA 
TAGAATGACG 
TCAAGCGATG 
ACTCACTATC 
TAGCTTGATT 
AAGAGTTCTT 
TTACATGGTT 
AAACTTCACT 
CTTCTGACAA 
ATTGTTTCTT 
TAACAGCCTC 
TACAGTCATC 
TTAGTTTAAT 
ACCCAACATA 
- AGTCATAAAA 
ATtCTAGTAA 
ATTTTGTATC 
CTTGTAATGT 
TTAAACTGAA 
TCTTGTATTT 
ATTTTTATAT 
TTTAACATTT 
TCATAATCTT 
TTTATTTTAG 
CAGATGACTC 
ACCCTTAAAT 
TTTTGTTTTA 
TGATATTTTT 
ATCCGAGAAA 
TTGAGTACTA 
ATGCTATTTG 
TATGGAAGTA 
CCAAGTTTAC 
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TTCTTGAATG 
CAAGTCATGA 
TGTTACATAT 
TTTTAAAATA 
TACTAATGCT 
TGTTTGATCT 
TATCATGATT 
TCTTCGCAAT 
AATATCAAGT 
TTTTGGCTTT 
TTGAAGAGCT 
ATTTAACATA 
TTTTTTAGGA 
TGTGTTAAAT 
CAATAAATCT 
AAATTCATCT 
TTTCCACTTT 
TCCTCTTTTC 
ATATCGGCAA 
TTAATATATA 
TTTCTATTGA 
TAGTTAAATC 
GAAAATCCAT 
GTGTTAGAAT 
TTAATCATCG 
ACAGTGTAAA 
AAAAAAGACT 
GATAAGAAAG 
CTTTACAAAA 
TAAAAAACTA 
TCTCACTGAA 
ATTCTTAATC 
GGAGAGCcTT 



GATTTCATAT 
AAGCGAATTT 
TGAGGCTTCA 
TAATCCTCTC 
CGTTCAATCG 
TTCTTTATTA 
TTTTTGTTTA 
CCATAAAATG 
AATTTCAAGA 
CTTACTTTAT 
TTGCGTATAA 
TATTGGTAAA 
TAGAAAAACT 
GCTACTTGTT 
GGATTAGCAC 
CTAATTTTTT 
TGTCTAATTT 
TCAATTATGT 
AAATTTTGAC 
CTCAATGACA 
TTTATCTTTC 
AGTAATTTTC 
AAAATTCACT 
AACAAAGTAT 
GGTCGCGGGT 
CGTTGGTGCT 
GTAGTACCCC 
GAACATAATT 
CTATACTGCA 
AAAAGTGGAG 
TTTGGTTTTA 
AGCTGCTTTT 
ACGGAAGTAT 



TCTGTCCACT 
TTTTCAATCC 
TCAACCGTCC 
CTAAAAGAAT 
TTTCAACCAA 
ACTGGGACAT 
AAAAATCAAT 
CTGTTAAAAG 
TCTCGTGTTG 
CTGCTACATT 
ATGCGTGATA 
AATCTTGTAA 
TGTAAATTCG 
TTTTTGATAC 
ATTCACATTC 
CtGCTCGTTT 
TTCCGTTAGA 
TTGCTCTTAA 
TATTCTATTA 
GCTTTTTTAG 
AGCAAATTAT 
AATATCGAGG 
CCTACATTCT 
TAATTTGGGC 
TCGAGCCCCT 
TAGCATTTAG 
GCTTTTTTAC 
ATGAGTAATG 
AATGAAATCT 
AACGTTTAAT 
AAAATAATAG 
GTCAAAAGCA 
CATTATCCCA 
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AATTCAAATC ATCGCTTtTT ATgACTATAA AAATtaTTTT AGAATGTTtA TCGCTCCTTC 13020 

TTTACGATAA GAAATCGTGG CACAACTAAA ATATGCTCCT TGGAGTGTTA ATGTACTGAT 13080 

AGAATGTATT TAATTTTCAA TGATCAATAG GTTTTCTACA TAAAAGAATT TACTAATCTT 13140 

TTAAATATTA GAAGATTATT TTAAACAGAA AAAGTGGAAA GGGGCTTTTT TTAGTTAGTT 13200 

AATTTTAAAT TTTAAGATAG CTTCTATCAA TTTAATTTCT AGCTGTCTTA GCATATACTC 13260 

ATCTACATAG GTAATAAATT GCTCCGCCTC ATTAAATCCA TTTGTCAAAC ATAACTTAAC 13320 

TATATAACTA TGATAATGTT TGAGTATCAT CTTGATCGAA ATTGGATCTT CTTCTAGAGC 13380 

AGATTGAATA ACGGATAGCG GAACTAATGG ATACTGCCGT TTTTTTTTGT TGTCTACCAT 13440 

AGGTCATACT CCGTTTTTAA TTGTTCAACT GCCTTTTTTC GTTGATACCA GAT AC C ACT T 13500 

GTACTCATGT TGAGAATTTT ACCGATTTCA GTATTATTAA AACCTGCAAA ATAATAAAGT 13560 

AGGATGACCT TTCTTCTTAT CTCTGATAAA TGATCAATGG CTGTTGATAA CTTTTGATCT 13620 

GAAATTAATA TTTCCATTCC . AAACAAGAAA TAGACATCTT GTTCATCAAA ATGAGTTTGA 13680 

GAGTACTTGC CtTCAGTTGA ATCTGTCAAT GGTTCCTCAT TATACTTTAT GTTAGkGTTT 13740 

CGTTTTCGAA TAGACGTAAT TTCGTTTCGT AATACTTTTT TGCAGTAGGC ATCAAATTGG 13800 

TGTTGTATAG ATTTCTCATT ATAATTTTGT TGCATGATAA TATGCTCCTT TTTCCCCTTT 138 60 

i 

CCACTAATAC AACAATCAGA GAGCCCTTTT TTTTCGAAAA AACTAGAAAA AAACAAAAGA 13920 

CACGTAGTTT TTATCTACGT GTCTTCCATT TTTTTCAGAA CAACTGCAAA GAAAAGAAAT 13980 

ACATATTATA CAAAACAAAA AAATATTCTA ATTTTTTTCA TATAGAAATA ACTCCTATTT 14040 

TAAAATTATA AATCGTAACG AATGCGATTT ATCCAATGTA ATTTTACATA CTTTATTTTA 14100 

TTTGTCAACA AAAAATGAAC GATCATAATA ACGTTTTCCA AACTTATTTC GC TAG AT ACT 14160 

ATTTTAATTA TTCTTCACTA TGACATCATA ATAAAGTTTT AATCAGAAAA TTTAGACGAT 14220 

ATGAAACGGA ATAATCAATT AAAAGCTAAA TATGCTTTGA GAAAAAATAA TTTGTTTTTA 14 280 

GCGGAAGACA TATAGATAAA AACTATGTGT CTTTTATTTT TCTAGCTTTT TCGAAAAAAA 14 340 

AGGGCTCTCT GATTGTTGTA TTAGTGGAAA . GGGGAAAAAG GAGCATATTA TCATGCAACA 14 4 00 

AAATTATAAT GAGAAATCTA TACAACACCA ATTTGATGCC TACTGCAAAA AAGTATTACG 14 4 60 

AAACGmAATT ACGTCTATTC GAAAACGAAA CmCTAGAATA AAAGAAAATG AGGAACTATT 14520 

GAACGTCTCA AATGAAGCTA AGTACCCTTA CTCATTTTGA TGAACAAGAT ATCTATTTCT 14580 

TGTTTGGAAT GG AG AT ATT A ATTTCAAATA AAAAGTTATC AATAGCCAAA CCAGTTATTG 14 640 

GACATAAAAA ATTTTTTATT TTGCAAGTTT AATGATACTG AAATCGATAA AATTCTCAAT 14700 

ATAAGTATGA ATGGATTTAA TGAAGTGGTC AATTTATTAC CTATGTAGAA GAGTATACAT 14760 

TGAGGGAGTT ATAAATTTTC AGAATCTATA TTGAAATTTA AAGTTAAATA ATTAAAAAGT 14820 

CCCCTTTCAA TTATTTTGGT AGAAAATTTA TAGaTCATTG AAAATTAAAT ACATTCTAGC 14 880 

AGTACGTTAA CACTTCAAGG AGCATATTTT AGTTACGCCA TGATCTCTTA TCGTAAAGAA 14 94 0 
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GGAGCGATAA 
GGATAATGAT 
CTTTTGACAA 
GCATCATGTA 
GTATATTTTT 
TCtTTTTTTT 
GAGAATATAA 
TAATAAACTA 
TATAAATCTT 
ATGAATCATC 
AAGGTGTAGA 
TACAGAATGT 
AAAGAGTGAA 
CAAAAAAAAG 
TTGCCTATAA 
AAAAAACAAT 
CTAAGTGTGT 
ACGCAACCAG 
ACACCTAAAA 
ACCAAAGTAG 
AATGATACCA 
CCAGTAGTAA 
GCTGAAAATG 
GATAAAGCAG 
GAAGACTTAG 
CTACAAAAAG 
GAAAAAATTG 
TAyGArAAAG 
GCGATTAGTA 
TCGATTAAAG 
TCAAAAGATA 
CAGAATTCCT 
GAAAAAAATA 



ATATTCTAAA 
ACTTCCGTAA 
AAGCAGCTGC 
AGTTTAGTTT 
TAGTTTCTTG 
AAATCtAAaT 
AATACGGTAT 
ACATAAGAGA 
TAAGTAATAG 
AGTCATTGGA 
CAAGAATAAG 
TGTTTAGAAA 
AAGAATATAT 
GGGGAGGGGA 
AATAmAAAAA 
ATAAGACATA 
TAGGAGCCGT 
AAACAACGAC 
CGGCAGTATC 
AAGAAGTAGC 
CAAACGCGCA 
GCCCTGAAAC 
AAGTGAATAA 
TTGATGAAGT 
GCAACGTTTC 
AACAAGCGAA 
CCAAAGAAAA 
AAGTCGCkGA 
AAAACCTAGT 
GCGGAAAATT 
TCTTTACAAA 
TTGTAAAAGA 
AACCAGTGAC 



ATAATTTTTA 
GGCTCGCCGT 
TTAAGAATTA 
TTAAGCACTT 
aATTTTAtTA 
GATTATAGTT 
ATACGTTTTT 
CTATAACCTT 
AAAAGATGTG 
TAGGAGGCTT 
TAAAAATAAA 
AAAGGACTAG 
GTTAAATTAC 
GAAGGATGTA 
TTTACGAGGA 
TAAAGCTAAG 
AGGATTAGCT 
GGTTCAACCC 
TGAAGAAGCA 
GCCAGAAAAT 
ACAACCAACA 
AACCAATGAA 
ATCAACGTCC 
AAAAAAAGAT 
TTCTAAAGAT 
AAAGATTGCC 
TGCAGAAATT 
ATACAACAAG 
GTTCGATCAA 
TATCAAAGCA 
ATTACGGAAA 
GGCAAATCTT 
AGTGACCTAT 



TAGTCAATAA 
AAACTTGGGG 
GTTTTAAGAG 
AATTTTGTAT 
TATTAACtAT 
TAGTAAAAAA 
TATGCTAAGC 
TGACCCGATA 
TTGAATGAAG 
TATTTTAAAA 
AAGCTAACTA 
TAACAAAAGC 
TAATTTTATA 
TTGTGCACGT 
GAATGATTCT 
AATCACTGGG 
ACTGATAATG 
AATAACCCCG 
ACAGTACAAA 
AAAGGTACTG 
GTAGGAGCTG 
CCTCTAGGGC 
ATTCCTAAAG 
CCAAACATTA 
TTAGCTGCAA 
CAACAAGCAG 
GCGGCAAAAA 
CATAAGAACG 
TCTGTCGTGA 
ACTGATTTTA 
GATATGGGyG 
GGGTCTAATG 
ACAGGACTAA 



AAAGTGATGA 
AGAGGTGAAA 
TAAATTAAGT 
ATTTAGAGAT 
TTCATTACTC 
GAGAGGATGG 
ACCAATGTTT 
ATTATGATTA 
TATACATTCT 
GTGTTAAACG 
TCTCTCGAAA 
TACCTACATT 
GCATAAAGTA 
GATCCATGCA 
TGAAGCAACA 
TAACTGTCCC 
TACAAGCCGC 
ACCTGCAGTC 
AAGACACTAC 
AACAAAGTTC 
AAAAATCAGC 
AGCCAACAGA 
AATTTGAAAC 
CCGTTGTTGA 
AAGAAAAAGA 
CTGAATTAAA 
ACAAAGCrGA 
AAAACAGCTA 
CGAAAGACAC 
ATAAAGTAAA 
GGAAAGyTAC 
GTGGGTATGC 
ACGCTAGTTA 



TGTGAAGTTG 
TTCCTATGTG 
AAGACAAAAA 
TTCATTTGTA 
ATAATCATGT 
AGTCCCTTTT 
ACTCTGTATT 
TAG T TAT TAG 
GTAATTAACA 
ATTCATCTTT 
TTAAATATTC 
AGGTAGCTTA 
AAATATTTGT 
GAAAGAATCA 
AATGAAAGCA 
TATTCTTTTT 
GGAATTAGAT 
AGAAAAGGAA 
TTCTCAACCG 
AGCTACCCCA 
ACAAGAACAA ' 
AGTTGCACCA 
ACCAGACGTT 
AAAACCAGCA 
AGTAGACCAA 
AGCCAAAAAT 
AAAAGAGCGy 
TGTCAATGAA 
TAAAATTTCG 
TGCAGGGGAT 
TGGCAACTTC 
GGTTCTTTTA 
TTTAGGACGt 



15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 
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AAAATTACAA 


AAGCAGAATT 


TGTTTATGAA 


CTACAATCCT 


CACCAAGCCA 


AAGTGGAACG 


16980 


TTAAATGCAG 


TATTTTCAAA 


CGATCCGATT 


ATCACrGCTT 


TTATTGGTAC 


AAACAGAGTC 


17040 


AATGGTAAGG 


ATGTTAAAAC 


ACGCTTAACG 


ATTAAGTTCT 


TTGATGCGTC 


AGGTAAAGAA 


17100 


GTACTACCAG 


ATAAAGATAG 


TCCATTTGCG 


TATGCGCTGT 


CTTCTTTAAA 


TTCAAGTTTA 


17160 


ACGAATAAAG 


GTGGCCATGC 


GGAATTTGTT 


TCTGATTTTG 


GGGCsAACAA 


TGCGTTCAAA 


17220 


TACATTAATG 


GyTCrTATGT 


GAAAAAACAA 


GCGGATGGAA 


AATTTTACTC 


ACCGGAAGAT 


17280 


ATTGACTATG 


GCACAGGACC 


TTCTGGATTG 


AAAAATAGTG 


ATTGGGACGC 


TGTAGGTCAC 


17340 


AAGAATGCCT 


ACTTTGGTTC 


AGGTGTAGGT 


CTAGCyAATG 


GrCGTATTTC 


CTTTTCTTTT 


17400 


GGTATGACAA 


CAAAAGGAAA 


AAGTAATGTG 


CCTGTATCTA 


GTGCGCAATG 


GTTTGCCTTT 


17460 


AGyACTAACT 


TAAATGCGCA 


ATCAGTGAAG 


CCTATTTTCA 


ATTATGGGAA 


TCCAAAAGAA 


17520 


CCAGAAAAAG 


CAACGATTGA 


ATTCAATsGA 


TACAAAGCCA 


ATGTCGTTCC 


TGTwCTTGTG 


17580 


CCrAAtAAAG AAGTCACTGA TGGyCAGAAA AATrTCAATG 


ATTTAAATGT 


GAArCGTGGC 


17640. 


GATTCTTTAC 


AATACATTGT 


GACAGGGGAT 


ACGACAGAAC 


TTGCCAAAGT 


AGATCCAAAA 


17700 


ACAGTAACmA 


AACAAGGGAT 


TCGAGATACm 


TTTGATGCAG 


AAAAAGTGAC 


GATTGATTTA 


17760 


TCCAAAGTGA AAGTTTATCA AGCAGACGCA AGTCTrAACG ArAAAGACTy 


AAAAGCTGTT 


17820 


GCTGCAGCrA 


TTAATTCAGG 


AArAGCTAAA 


GACGTGACTG 


CTTCTTATGA 


yCTcAaTTTA 


17880 


GATCAAAACA 


CCGTCACAGC 


AATGATGAAA ACCAACGCrG ACGGyTCyGT 


TGTTTTAGCA 


17940 


ATGGGGTATA 


AATATTTACT 


TGTCTTGCCG 


TTTGTAGTGA 


AAAATGTAGA 


AGGCGATTTT 


18000 


GAAAATACAG 


CTGTTCAGCT 


GACAAAyGAT 


GGnGAAACGG 


TAACAAATAC 


AGTGATTAAC 


18060 


CATGTGCCaG 


GTaGTAATCC 


TTCCAAAGAT 


GTAAAAGCAG 


ATAAAAACGG 


TACAGTTGGC 


18120 


AGTGTTTCTC 


TACATGATAA 


AGATATTCCG 


TTACAAACAA 


AAATTTATTA 


TGAAGTGAAA 


18180 


TCTTCCGAAC GTCCAGCyAA CTATGGCGGA ATyACmGAAG 


AATGGGGCAT 


GAATGATGTC 


18240 


TTGGACACGA 


CCCATGATCG 


TTTCACAGGk 


AAATGGCACG 


CTATTACrAA 


yTATGACCTT 


18300 


AAAGTAGGGG AyAAAACGTT 


AAAAGCAGGA 


ACAGATATTT 


CTGCCTACAT 


TCTTTTAGAA 


18360 


AACAAAGACA 


ATAAAGACTT 


GACGTTTACr 


ATGAATCAAG 


CATTATTGGC 


mGCsTTAAAT 


18420 


GAAGGAAGCA 


ATAAAGTAGG 


CAAACAAGCT 


TGGTCTGTGT 


ATCTGGAAGT 


CGAACGGATy 


18480 


AAAACAGGTG 


ACGTAGAAAA 


CACGCAAACA 


GAAAACTACA 


ACAAAGAGCT 


TGTkCGTTCT 


18540 


AATACsGTGG 


TGACGCATAC 


rCCTGATGAT 


CCAAAACCAA 


CCAAAGCCGT 


TCATAACAAG 


18600 


AAAGGGGAAG AyATTAAyCA 


TGGAAAAGTk 


GCTCGTGGTG 


ATGTTCTTTC 


TTATGAAATG 


18660 


ACnTGGGACT 


TAAAAGGGTA 


CGATAAAGAC 


TTTGCCTTTG 


ATACAGTCGA 


TCTTGCGACA 


18720 


GGCGTTTCTT 


TCTTCGATGA 


TTACGATGAA 


ACGAArGTGA 


CACCAATCAA 


AG ACT TACT T 


18780 


CGTGTCAAAG 


ATTCTAAAGG 


GGnAGACATT 


ACGAACCAGT 


TCACGATCTC 


wTGGGACGAT 


18840 


GCCAAAGGCA 


CGGTGACrAT 


yTCTGCCAAA GACCCACAAG 


CCTTTATTCT 


AGCGyATGGT 


18900 
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GGGCAAGAAT 


TGCGTGTAAC 


rCTCCCTACA 


AAAGTCAAAG 


CcgATGTTTC 


TGGkGATGTT 


18960 


TATAATTCAG 


CGGAACAAAA TACATTTGGy CAACGAATTA AAACCAATAC 


yGTTGTCAAC 


19020 


CATATTCCAA 


AAGTGAAyCC 


TAAAAAAGAC 


GTGGTTATTA AAGTyGGTGA CAAACAAAGt 


19080 


CAAAATGGyG 


CCACAATCAA 


ATTAGGGGAG 


AArTTCTTCT 


ATGAATTTAC 


AAGTAGTGAC 


19140 


ATTCCTGCAG 


AATACGCTGG 


wGTTGTGGAA 


GAATGGTCGA 


TTAGCGATAA 


ACTAGACGTC 


19200 


AAACATGACA 


AATTTAGTGG 


cCAATGGTCT 


GTGTTTGCCA 


ATTCTAATTT 


TGTTTTAGCA 


19260 


GACGGAACCA 


AAGTGAATAA 


AGGGGACGAC 


ATTTCGAAAC 


TATTCACGAT 


GACCTTTGAa 


19320 


CAAgGGGTAG 


TGAAAATCAC 


GGCCAGTCAA 


GCCTTTTTrG 


ATGCGATGAA 


TCTAAAAGAA 


19380 


AACAAAAACG 


TTGCACACTC 


ATGGAAAGCG 


TTCATTGGTG 


TAGAACGAAT 


TGCGGCAGGA 


19440 


GACGTTTACA 


ACACAATCGA 


AGAATCTTTC 


AACAATGAGA 


AGATTAAAAC 


kAATACGGTA 


19500 


GTGACrCATA 


CGCCAGAAAA 


ACCACAAACr 


CCACCAGAAA 


AAACAGTGAT 


TGTACCACCA 


19560 


ACACCAAAAA 


CACCGCAAGC 


ACCAGTAGAG 


CCATTAGTGG 


TAGAAAAGGC 


AAGTGTrGtG 


19620 


CCAGAATTGC 


CGCAAACAGG 


CGAAAAACAA 


AATGTCTTAT 


TAACGGTAGC 


TGGTAGTTTA 


19680 


GCCGCAATGC 


TTGGCTTAGC 


AGGCTTAGGC 


TTTAAACGTA 


GAAAAGAAAC 


AAAATAATTG 


19740 


AAGACGAAAA 


GAAGGACGAA yTGmTTGTCC 


TTCTTTTTTT 


AGTTATGGAG 


GGGAAAmATT 


19800 


GGAAGCAGTA 


GTAGtAGAAA 


GAGAGGCAAA 


GGGGATGAAA 


GAAATTGCCA 


TCCAAGAAAA 


19860 


AGATTTGACC 


TTACAGTGGA 


GAGGAAACAC 


AGGTAAGTTA 


GTTAAAGTTC 


GATTrAArAA 


19920 


TACACGTGCA 


ATGGAAATGT 


GGTACAACAA 


ACAAATTACC 


GAAGAAAACA 


TTCAAGAGAT 


19980 


CACCACGTTG 


AATATyATTA AAAAyGGAAA ATCTyTGGCA TTAGAAGTAT 


ATCCAGAAAA 


20040 


AAGTATCTAT 


GTGAAACCAA 


ATTTAGGCAG 


AATCAATGTG 


CCTGTCTTTT 


TTATCAAAAC 


20100 


ACCTATTAAT 


AGAGGGATTT 


TTGAAGAGAT 


CTTCGGCGAA 


ACTTTAAAAT 


CATAAATAAA 


20160 


AAATTATTTG 


GAGGAAATTA 


CAATGAAAAA 


AATTATTTTA 


TCAAGCTTGT 


TTAGTGCAGT 


20220 


ACTAGTATTC 


GGTGGCGGAA 


GTATAACAGC 


ATTCGCTGAC 


GATTTAGGAC 


CAACAGATCC 


20280 


AGCAACTCCA 


CCAATTACCG 


AACCAACTGA 


TTCTAGTGAA 


CCTACGAATC 


CTACTGAGCC 


20340 


GGTGGATCCT 


GCAGAACCGC 


CAGTAATACC 


AACTGATCCA 


ACAGAACCAA 


GCAAGCCAAC 


20400 


CGAGCCTACA 


ACACCGAGTG 


AGCCAGAAAA 


GCCAACAGAA 


CCAACAACGC 


CAATTGATCC 


20460 


TGGAACGCCG 


GTTGAACCGA 


CTGAACCAAG 


CGAGCCAACA 


GAACCTAGTC 


AACCAACCGA 


20520 


GCCTACAACA 


CCAAGCGAAC 


CAGAAAAACC 


TGTTACTCCA 


GAACAACCGA 


AAGAACCAAC 


20580 


TCAACCAGTG 


ATTCCAGAAA 


AACCAGCAGA 


ACCAGAAACA 


CCAAAAACTC 


CTGAACAGCC 


20640 


CACTAAACCA 


ATAGACGTAG 


TCGTTACACC 


TAGTGGAGAA 


ATTGATAAAA 


CGAATCAATC 


20700 


GGCAGGAACA 


CAACCAAGTA 


TTCCTATTGA 


AACAAGCAAC 


TTAGCGGAGG 


TAACACATGT 


20760 


ACCAAGTGAA 


ACTACTCCAA 


TTACAACAGA 


AGCTGGGGAA 


GAAATTGTAG 


CAGTAGATAA 


20820 


AGGTGTTCCG 


TTAACCAAAA 


CACCAGAAGG 


ATTAAAACCA 


ATTAGCAGCT 


CGTATAAGGT 


20880 
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TTTACCTAGC 


GGAAACGTTG 


AGGTAAAAGC 


AAGTGATGGA 


AAAATGAAAG 


TATTGCCACA 


20940 


TACAGGAGAG 


AAATTCACAC 


TCCTTTTCTC 


TGTATTGGGA 


AGCTTCTTTG 


TATTAATTTC 


21000 


AGGATTCTTT 


TTCTTTAAAA 


AGAATAAGAA 


AAAAGCTTAA 


TCAGAAAGAG 


TAATTTCGAT 


21060 


AAATTTCTGT 


GAGGAAAGGA 


GGTGCTTTGC 


TTGTTTATTT 


TATTTACTAC 


mAAGCnAAGT 


21120 


CATGTATTAC 


TAAATGAATT 


AGGGTTTAAT 


AGAAAAATTG 


ACCTGGTACA 


AGGTATTGAA 


21180 


CAGTTAACAA AAATCAATGC 


GGATTTAAAA 


AAATCGACGG 


ATCGAATTCT 


AATTACGTAT 


21240 


GAGATTTATT 


ACAACGTGAC 


AGACGAGAAA 


GCTTTTTTAA 


GAGAAACAAT 


GACTTTGCCG 


21300 


GTAGTTGAGG 


AAACTGTCCT 


CCAAAACATG 


GTACAGACTG 


TAGAAACCAA 


TGAAGAAGTA 


21360 


AGACAAGAAG 


ATTTAGAAGC 


GTTTCTTTAC 


CTGTTCAAAA 


GAAATGTGAG 


TAATAAGTCG 


21420 


AAGAGAATAG 


GCCGTTCAAC 


AGGTTCTTTA 


AAAAAGAAAG 


CAGAAAAGCA 


ACAAAAATCA 


21480 


ACTGTGTTTA 


CAACAATTTC 


TCGCTATAGC 


TATTTCTCAG 


TGGCTTTACT 


TATATTTTTA 


21540 


GCAGTAGGTG 


TTTTTACAGG 


TGCGTCATTT 


GGTTCTCAAA 


GTAAACCAAC 


GAGCCAAAAT 


21600 


GATTCTAAGT 


TAGAGGAACG 


ATTAACACAT 


TTACAAGAAC 


AAGTGCATGA 


GCAACCACAA 


21660 


GTTGATGTTG 


TTGCTCGGTT 


TTTCTTATCC 


aGTCTGTACT 


CTGGTGAAGA 


GAGTGACAAA 


21720 


GCCACGCAAA 


AGCAAGTTAA 


AAAGTATGTT 


TCTGAAGGTG 


TACTCAAAGA 


GATTAAGGGG 


21780 


TCTAAAGAAC 


AAATAAGAAC 


GTTATTTCTT 


TGGGAATCAA 


AAAAAGAAGG 


CAACCAATGG 


21840 


CTCCTTACGT 


ACGTTGTTAC 


CTTGAAAGTG 


GCTAAAGAAG 


AAACTGAAAT 


TCGTCAAATG 


21900 


AAAGTAGCTA 


TGGAAAAGAA 


AAACAATAAT 


TATCAAGTAA 


CAGAACTTCC 


AGAAGAAAAA 


21960 


GAATTTACTA 


TTAATGAATA 


ATAATTGAGG 


AGGCAATTCT 


ATGAAATATG 


AACGTCCATT 


22020 


AAAAAGAGAG 


TCACAAATCA 


AGGAGTTTGA 


GTTAGGCACG 


CACGCGGCAG 


TAATTGAAAA 


22080 


AGTTCAAAAG 


AAACGGTCTC 


AAAAAGGGAA 


TGATATGTTT 


TTACTCTCTC 


TTTTGGGAAA 


22140 


AAGTAACGAA 


AAAGGGGTTT 


ATTTTCTTAC 


CTTCGGGAAT 


GAT T AT ACT G 


AAGATAATTT 


22200 


ACGCTATATT 


TTAGCCAGTA 


TCCAAGACAA 


CGGTGTAGAG 


ATTCCAGACG 


TTGACTTTGG 


22260 


CTATAATCGA 


GAAACTTTTG 


AATTTCTGAA 


AGGTAAAGAT 


GTCTATATTC 


AGGTTGAAGA 


22320 


mCAAGAGTAT 


AAAGGGAAAG 


TTAAACaTGC 


GGTAACaAAT 


TTTTTAACTC 


aGGATGAATT 


22380 


TGAAGAAAGC 


GAAGAAATGG 


AGTTTTCAGA 


AAGCAATACT 


GAAGAAGATT 


GGTAAAATAA 


22440 


GCCATTTCTT 


AGAATATAAA 


AACTGAAGGA 


GAGACAAAAC 


TATGGATATA 


GTCACAAAAA 


22500 


TT ATT AC ACT 


AATTGGAGGA 


ATCATTGGAC 


TTGTTTCAGC 


AGTTAGTATT 


ATGTTTGGCG 


22560 


TGAAAGAAAT 


TCGTTCAGGT 


ATGTCGAATG 


ATGATCCACG 


TACGCTTGAT 


AAAGGAATAG 


22620 


AAAAAGTAGT 


TGTTGGTGGG 


GCTGTTATTT 


TAGCAATTGG 


TGGGGTAGTC 


GCCTATGTTA 


22680 


TTACACAGGT 


AGGaGCTATT 


CGTTTTTAAA 


AGGAGGAAAA 


AAATGAAGCT 


ATTAGTAGAA 


22740 


ATGATAGTCA 


ACGGTCAAAC 


TGAATGGGAA 


GTTGTAGAAG 


AAGAAAATGC 


GCCACAGGCA 


22800 


ATTATTCAGA 


GTCGAGGAGG 


CTTTTCATTT 


GATGAGAATG 


GAGAGCTCAT 


TGTCAATGAT 


22860 
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GATGAAATAA GTTATACAGG CGTGTTTGAG GTTTGCGAGA CTAACTTATT GGATTTCACT 22920 

GTTAAGGAAG CAGAAATCCA TAGATTTTAT CACAAAAAAT TAGAAAAGCT AGGGaTAGAT 22980 

CCTCTAACGT TTGAAAACTC GCmAGAAATA GCTAACTGAG CGGAGGGGaA GCnATGAGAA 23040 

ATCCAAAAGG AACGACCGAT CAGCTATTTA AAAGTTTAGC TGAATACmAT CCniACmACgA 23100 

aTGrAGCCAT GTCTAAAAtA GcGAAAGTCC tAGTCCCtTT AGGAATCgCT ATTTTAGGCA 23160 

TTCTTTTTAT GATTGAATTG GCGAATACCC AAAAGAAATT TCAATCAGAA GACGGTGGTT 23220 

TGACAATCGA GCTATTAACT AACATTGCAT TGAAGTATGT AATCGCGTAC GTCTGTATTA 23280 

TGGGTTCTGG CT AC ATT ATT GATGGCATTG TTTGGTTTAC AATTCAAGCA GCCAAATGGA 2334 0 

TTAATTCCAT TGTAACAGCG ACAGGAACAA GTGAAGCGAT CCCACAAATG GAAAAAGTTT 23400 

CTTGGTGGGC AAAACCAATT GTATTTTTCT TTCAAATATT TGCATATATT GCATTAATAT 234 60 

TATCTAGTAT TATTGCAAAT ATTCTTATTT TCTTACGAGG GATACAATTG TATATGATTA 23520 

AAGCTCTTGC ACCGTTGTTT GTGGCGTTTT TTGTTCATGA TGAATTACGT TCGATTGCCA 23580 

TGGGGTACAT GAAACAAATC ATGGCCTATG CGCTTCAAGG CGTTCTGTTG GTTCTATTAC 2364 0 

TTGGGTTAAT TCCGATATTA ACAGCAAACG ATTATCTATC CTTTGAATCT CTTGAAGGTG 23700 

GCATTTTGgC AGGTGCTGGA GCGATCATCA TTAATGTTAT GACGTACTTT AATTTAATTT 237 60 

TAAAATATGT GGCAGTGATT ATTCTTCTAG TTGGTTCACA AGGATTTGCC AAACGATTAG 23820 

TAGGAGCGAT GTAAAATGGC GATTAGTTCA GAATTCTATA rAGATTTGTC AAAAGTAGAG 23880 

AAAAAAATCT GGGGAATTAC AGTCAGAGAA TTTAAAGCGT ATGTTTGCTT TGTTTTTATA 23940 

GGGATATTTC TTCTGTTAGA AGTATTTTTT TTGCCTGATC TTTTATTTAT GGTTTGTGCG 24000 
TTAG 

(2) INFORMATION FOR SEQ ID NO: 73: 



24004 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

ATTTGAAACC TGTTGTTGAA TCCATGAATA CTACTGGTTG GTAATTTGGA nnGATATCTT 60 

GTTTCATACG TTTTTTCGCT CCTTTGCCCT GATCCATTTG CTGTATCAGA GTTATTTTGT 120 

CGGTATACAA CAATGCTATC TTACCATGTT TTCAGTTGGT ATGCAATTAT CTTTTTGGAT 180 

TTCTTTTAAT TTGTTTGCCA AACGAAACAT CATGGAAAAC TTCAAATAAT TGTTGATTAT 24 0 

TTTGGGTCCG • TTTCAACATT TTTATGAATT GATCCGTATA TTCCAGCGAG TCTCCTGACA 300 

TATTATTTCT CAATTTCCAC GTTTCTTCTA GTTGTTCTGG TGTCATTAGC AACTCTTCTT 360 

TTCGCGTACT TGATTTTTTA ATATCAATTG CTGGGAAAAT ACGGCGTTCT GCTAATTCAC 4 20 
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GAGATAAATG GAGCTCCATG TTGCCTGTAC CTTTGAATTC TTpATAGATG ACATCATCCA 


480 


TTP<7GPTAPP 


TGTATCCACC 


AATGCCGTGG 


CTAAAATAGT 


CAGACTGCCC 


CCTTCTTCGA 


540 


Irti i ilivjol 


AGCACCAAAA AATCGTTTAG 


GTTTGAAAAA 


CGCAGCTGGa 


TCTATTCCGC 


600 


LriL 1 1 nrtou i 


TPP.APPAPT A 


OVJ>><wvurivvri 


CTAAATTATA 


GGCCCTTGCT 


AAACGAGTGA 


660 


1 1\3 1 LL/i 1 


t a n a at a a pt 


APGTPPPGTT 


TGTCTTCAAC 


TAAGCGCATT 


GCTCGATCCA 


1 *"i r» 


7\ Air'AAT'TT* 

GAACCAAl 1L 


AGLAAL 1 GGL 


PT ATP ATTTT 


GTGGTTGTTG 


ATCAAAAGTT 


GAAGAAACCA 


780 


CATGGGGI 1 1 


AACGGl iGGG 


TPTA a ATP AG 


TPAPTTCTTC 


AGGGCGCTCG 


TCAATTAACA 


840 


T\ 7\ 7\ T\ rp TV TV 

ACAGAATCAA 


1 lCCAGAiLA 


PP a TP a TTTT 
CCA1 CAl 111 


PTP.TAATPCC 


ATTAGCAATT 


TCCTTTAAAA 


900 


CACTTGTTTT 


I CCCGGl ill 


PPPPPPPPT2X 


PAATTAAAPP 


ACGTTGCCCA 


AAACCAATTG 


960 


GGGCAAATAG 


Al GAA1 GA1 1 


PP TP? TP A TA 
CG1C1 ICrilrt 


ATPTP.PPAGP 


CGTTGTTTCT 


AACGTTAATT 


1020 


GTTTTTGAGG 


AT. AGAAGGG 1 


Gil AuLbU 1 C 


PAAAATv^TGG 


CCGCTCTTTC 


GCTTCTTCAG 


1080 


GAlCfl ICCG 


ATTGAGAGl I 


I G AAGA 1 GGA 




GTAACGTTCT 


GATTCCTTTG 


1140 


GCGGACGGGG 


LTTGlCAGlT 


AC T I I G 1 GGG 


PATTTPTPAA 

L/A 1 1 1 L 1 Lrtft 


TCCGAAACGA CGAATTTGAg 


1200 


7\ Tv t\ tv A TV TV 

AAGAAG AAA 1 


Tv rp Ty tv Tv i. /-< m /^rp 

AiAAAtGlGl 


I G 1 GCAL 1 lb 


PTPPATAATT 


AATTGGTPGP 


AAAAAG C C AT 


1260 


AACCA1 CL 1 G 


7\ O 7V A 7\ Z^* 1 TV TVT'A 

AG AAAG AA I A 


i G 1 AAAMl 1 L 


PTTPPATPA A 
L 1 1 LLrt 1 Vjrlri 


GAAAAAPPPT 


TGCTTCTCAG 


1320 


CTTGTGGAGG 


AA I GAGAGG 1 


A A A/" , 7\r , R7ATT 
AAAGALAH.1 1 


PTTTTTTP.P.T 
L 1 1 1 1 1 1 oo i 


CATTTGGCAT 


AATAAGGATT 


1380 


TTAAC 1C I 1 t 


i^/^i^^rnTy^^Tvrp 

GGGG I AgGA 1 


AAA1 Al Lilt 


TA APPT APTG 


tTTTCCaATt 


CAGCCATgTa 


1440 


AAJ. AG i GGG1 


GA1 1GA1GGA 


PPTPA anPPP 
GG 1 Ln/iJlLuvj 


APPTTPTTPT 
ML LI 1L1 lol 


TCTTCGGTTT 


CTAGCATTTG 


1500 


AA 1 A 1 LAbLo 


GG I AAAGGAG 


A 1 AAL 1 1 1 1L 


PATAATPTPP, 


TCGTACCCAC 


GTAAAATATA 


1560 


1 itl AC A 1 1 A 


1 AAA 1 AC 1 CC 


TPPTDPPTTP 
1 LC 1 ALL 1 1 L 


AP.PPATPAAG 


CCGGCAGTGA 


CTAGGCAAGC 


1620 


GGCiGCGGGl 


AAA 1 L AGACu 


PPnPTLPTTP 
LuAL 1 rVL 1 1 L 


TPPAPPPTPP 
1 OLnLLU 1 VJL 


AATTGATTTG 


GGCCATTTAA 


1680 


f s A r PAATT'ATA 

GA i AA 1 GA1 A 


TTPPPTTPSIl 
1 1 CLL 1 1 LAA 


PPPaaPPaTP 
LLonnoLn 1 L 


APPPPPPATP 

rioLLLLLnl L 


CGCACCAACT 


CAGGGATATG 


1740 


rp rp 7\ ^* T> f™* 

G 1 1 AJ. I i GG 1 


tp n pp a t a a a 


TPPT ATPAAT 
1 Lu 1/11 Lrlri 1 


A ATTTPAPPA 
rtrt 1 1 1 LnLLn 


GTTCCTTGTG 


CTTTTAAAAG 


1800 


I AA1 GGGGi 1 


AAPPPTTPTT 
ftfttuv) Hull 


VjLrtrt.V3 1 L 1 U 1 


PPPAAAAPPA 

UOLrtrtrtrtLLrt 


GGATACGGAT 


AGGTTTTCAC 


1860 


Lvj 1 VyU I Lrt 1 1 


T T P a P a T P A T 


PTP,ATP,P.ATP 
o 1 vjrt 1 o on i o 


AAPTTCAATC 


ATGTCTTCTT 


CAATTGTCAT 


1920 


PTTA APPPPP 


ATTTCTTGCA 


ATTTCGCAAT 


PAAGCTTTCT 


AAATGCTCAT 


AAATAACATT 


1980 


T PP. A a P p T T P. 
1 LurYrVLL 1 lu 


ATTCCTTCAC 


CCATAGCTGC 


T GC C AT AGCT 


AAATAAGTCC 


CAGCTTCAAT 


2040 


TCGATCTGGA 


ATAATCGAGT 


GACGACAGCC 


ATGTAATTCT 


TCGACTCCTT 


CAATCCGAAT 


2100 


AACATCCGTT 


CCAGCTCCTC 


GAACTTTTGC 


ACCCATGTTA 


TTTAATAATG 


TAGCAATATC 


2160 


AATAATTTCT 


GGcTCACGCG 


CAGCATTTTC 


AATAATCGTT 


TTGCCTTTGG 


CTTTAACTGC 


2220 


GGCTAACATG 


• ACATTGATTG 


TAGCGCCAAT 


AGATACCATA 


TCCATAAAAA 


TTCGGGTTCC 


2280 


TTGTAAGCCA 


GCTTCGTCTG 


TTCTCAAGTA 


CATCGCCCCA 


TGTTCGTTGG 


TTACATGAGC 


2340 


GCCTAATGCT 


TCAAATCCTT 


TGATATGTAA 


ATCAATTGGA 


CGAGGGCCTA 


AGTAACAACC 


2400 
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GCCAGGTAAG CCGACAACGC CTTCCCCAAA TCGACCAAGA AG?GAGCCCA TAAAATAATA 24 60 

GGATGCACGT AAGCTATTGA TTTTTCCTTT TGGCATTGGC ACAGAAACAA CTTCTGTTGG 2520 

GTCAATAATT AATGTATTAT TTTCAAAAGT AATTTTAGCA CCCATAATTT CTAAAATTTC 2580 

AATTAATGAG TGAACATCCT GAATGTCAGG AACGCCATCT AAAGTCACGG GCGAATCTGC 2640 

TAAAATAGCA GCTGGAATTA AAGCGACCAC ACTATTTTTG GCGCCGCTAA TGGTTACTTC 2700 

GCCTTTT 2707 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TCACCTTATT 


GATAGCGCTT 


CCTTATATTG 


nAATAATGCG 


ATTTTCTAAA 


CAAAAAATTA 


60 


TAGGTGGTAT 


TGATAATGAA 


AAATCCGTAT 


GTCCAAATAA 


CCGACCAAGA 


ACAATATAAA 


120 


AAGTCAATTC 


GTCGAATTCC 


TCCAAAAGAT 


CCTAATATTT 


TTTTAATTGA 


GAGACCTACA 


180 


GCAGGTGAGA 


TTATTGCTCT 


ATGGTGGACA 


ACCTCTTCCC 


GTGGCAAATC 


AACTGATAAA 


240 


TATATACCAA 


AATATTTCAA 


GCGTATCTAT 


CGAGTAGATA 


TGATTAAACA 


ACGTAAACTA 


' 300 


TTPATCAAAG 


CTAACGTTAT 


CTCCGAAGAA 


AATAATACTT 


ATGTACTGAC 


TGAACTAGGA 


360 


GAAATGTTAG 


TACATAAATA 


CTTTTATATT 


ATTGAAGAGC 


ATAATAATTC 


TTCCAATGAT 


420 


TCTTTCAATA 


CTGAACGAAG 


AAAGTTAGCT 


GATAAAATTC 


TATATTCGCT 


AGGCGGTGAA 


480 


ATATAATGCA 


AACGGAAGTT 


AAAGAGTTTA 


TTGCTTTTGA 


TATTGAAACA 


ACAGGTTTAG 


540 


ACTTCCATCA 


GGATGCTGTT 


ATTCAGATAT 


CAGCTGCAAG 


ATACGTTGAT- 


GGTATTGAAG 


600 


TTGATTATTT 


TGATACATTG 


GTAAATAGTG 


ACTACATTCC 


AGATGAAATT 


ACCAAATTAA 


660 


CAGGAATCAC 


TAATGACCAA 


GTACTAAATG 


CGCCAACTTT 


AGATGAAGTT 


ATGCCTTATC 


720 


TTTTTGATTA 


TTTAGGAGAT 


ACGATTCTAG 


TTGGACATAA 


TATAAAAAGT 


TTTGACTTTC 


780 


CTTTTTTAAA 


AGCAAAAGGA 


TATAATATTG 


CAGAAGGACA 


TGAAATTTAT 


GATACTCGCT 


840 


ATTTTGCAGC 


CACACGAAAG 


CATGGGGCTG 


TAAACAATCA 


ATTAACTACA 


TTGAAATTAC 


900 


TTTTTGGTAT 


CGATGCGATT 


TCACATAATG 


CATTGAATGA 


TGTTAGGATA 


TCTGCAATTG 


960 


TATTTATGGA 


ATTATTAAAG 


ATAGAACCTC 


AAGAAAAAAG 


CTCTACTCTT 


AAGAAAAGTA 


1020 


GCTTGGACTC 


TTTTGAATTA 


GATGATGATA 


CTACACCTTT 


TTTTGAAGGA 


ATGTCTTTTG 


1080 


TAGTAACGGG 


GGCATTTGAT 


GATTCTAAAT 


ATAACCGTAA 


ACAAATTGAA 


ACACTGATAA 


1140 


AACAACATGG 


CGGTAGGGTA 


TCGTCAAGTT 


TATCTGCCAA 


GACTGATTAC 


TTTATCCAAG 


1200 


GTATTCAAAT 


TTCTTCCCAA 


CTTAAAGACG 


GAAAGCACAG 


CAGTAAAGAA 


CTAAAATATA 


1260 


TAGAGTTACG 


TGAAAAAGGG 


GTTGATATTT 


ATAAATTTAA 


TGGTAATCAG 


TTTTATGAAC 


1320 
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TAATTACAAA 


CTATAGAAAA 


TTGAGGAAAA 


ATTAATGACA 


GAACTAAAGA 


AAATAACTTC 


1380 


AAAAAATGGA 


CATGTTTACC 


TCTTTAAAGA 


TATACAACAT 


AAACAACGAA 


ATGAGTTAGG 


1440 


CATAGCCTCT 


GGACAATCAG 


TACTGATTTC 


AATTTTTGAA 


CCGGATCCAT 


TCTATTTTAC 


1500 


AGAAGATATT 


CAACTTATTC 


ATTATAAAGA 


TATCGTTAAA 


ACCGAATCCG 


TTTCTTCAAA 


1560 


AGAAAATGAA 


TTTTTTGAAT 


ATGTAAAACA 


AAATTCACCA 


GTTGCTTATA 


AAAAGAAAAT 


1620 


CTCGGATAGT 


TATAGTATCT 


CTGAGTATAT 


TAATTATGCT 


CCACAACTTA 


AATTTAAGTA 


1680 


TATTATAAAA 


GATAGTATAC 


TAACTATTAC 


GGGGGAAATT 


AAAGACAGAA 


ATGTTAACTT 


1740 


TCCAATTTCT 


CAGCCTTACG 


TATATGTCGA 


TAAACCAGAA 


AATGAAATTC 


CGTTAGATAA 


1800 


TAATTTAAAA 


TTCCAATATG 


TAGTCAACGA 


TTTCGATATA 


CATGCTCCCG 


AATACAAAAT 


1860 


ATCGTTTATT 


TTAGGTACTT 


CAAAAGCCGA 


GTATGGTCGT 


TCTTTCAATA 


TGAAAAATTT 


1920 


CGAAGGTGTT 


TTACGCACCC 


CATCTGAAAT 


TAGTTCTATG 


AAAGCAAAAA 


AAAAGAAATA 


1980 


GTTACTTAAA 


TAAAGCAAAA 


AAGTTCTACG 


CCTAAAATTC 


GAATTTAGAC 


GTAGAACTTA 


2040 


TTTATTTCAA 


TGTCCGAACC 


AAATAAACTT 


CTTCACAAAA 


GCCTTTAAGT 


CCATTATAAA 


2100 


TCCGTTGTGC 


TCGTGTTTTC 


TTTTCACATA 


AAGCAAAGAC 


TGTCGGTCCA 


CTGCCGCTCA 


2160 


TCAACGCTGC 


ATCAGCGCCG 


TACTTCAGCA 


TGCGATCCTT 


GATTTGTTGC 


ACGATTGGAT 


2220 


GTCTGGCAAT 


CGTGACACTT 


TCCAACGCAT 


TGCCAACCGT 


TTGGGTCATC 


CCAGTATAAT 


2280 


CCCCATTTTC 


AATAGCTATT 


CTTAGTCCAG 


CAATATCTGG 


ATGGTGCAAT 


TCATCTACCG 


2340 


CTAAATCATT 


AAAAACGGTA 


CTAGTGGATA 


CACTAATTCT 


TGGTTTAACT 


AACACAATCC 


2400 


AACATTGAGG 


CATGGTCGGT 


AATGCTTCAA 


TTTTCTCCCC 


ACGTCCGTTT 


GCAAAGGCTG 


2460 


TTCCGCCCCT 


TAAACAATAA 


GGAACATCCA 


TACCAATTTG 


GCTGCCAATT 


TCACAGAGTT 


2520 


CGTCCATCGT 


CAACCCCAGA 


TTCCACAGTT 


TGTTCAAGCC 


TCGTAATGCT 


GCGGCACAGT 


2580 


CACTGCTCCC 


CCCTGCTAAA 


CCAGCTGCCA 


CGGGGATACG 


TTTTTCAATA 


TATATTTTAA 


2640 


TTCCTTTATG 


AATATTATAG 


GTACGCTTTA 


ATAAGTCAAC 


TGCTTGATAG 


ACATGATTCC 


2700 


GTCGATCCAC 


GGGTAAAAAA 


GAGCTGTCCG 


TTTCAATACG 


AATGATGTCT 


TCTTCTAGAT 


2760 


TTTCAAAAGT 


TAAATGATCG 


GCTAAATCTA 


CACTTGCAAA 


AATAGATTCA 


ACTTCATGAT 


2820 


AGCCATCCAC 


ACGTTTATGC 


AGAACATCTA 


ACCCAAGATT 


AATTTTCGCA 


GGAGCTCGTT 


2880 


CGATTATTTC 


CATCGGTTCC 


CTGATTCACC 


TTCCTTAATC 


TTTTGTCTAC 


CAACATTTTA 


2940 


ATCTTTTAGA 


TTCTATCATA 


TGTCCATGAA 


GAAGTACACC 


GAAGATTGCT 


CTTTCTCTGA 


3000 


CGACAGATGG 


CTGAAATTAA 


AAAGAGCTTA 


GGAACGAAGG 


GCTCTTCCTT 


CACTCCTAAG 


3060 


CGGACTTTCT 


AAATTATACG 


GCTTGACCGA 


TGAATTCGAT 


TTCAACGGTG 


CGTGTTAAAA 


3120 


CATCTGAATA 


GCTATAAGAT 


ACACGTTCAA 


ATGAGTTCTC 


GTCTGGATCC 


AAGTCAACAA 


3180 


CAAAAACAGA 


TGGATAAGTC 


TCCGTTAAGA 


TTCCTTTACG 


TTCTGTTTGG 


CGCTTTCTTC 


3240 


CAGTCTGAGC 


AACTAACATA 


ATTTCACTGC 


CGATACGGCC 


TTCCAAATCT 


TTCTTAATTG 


3300 
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ACGATAACGT 


TGTTGGCATT 


GCATTCACCT 


CGAGAAACAT 


TATATCACTT 


TTTTTAAAAA 


3360 


ATTTCAAGTT 


TATCATAC.TA 


GAAGTCTGTT 


TGCAACTACT 


TTTGCTCGAC 


AAACCGGCTG 


3420 


TTTTGTCGAA 


TTTTTATTCA 


TTTTCTAATG 


TATATTTCAT 


TTTTGCTTTG 


GCACGATCAT 


3480 


AGGCACTTCG 


GACAGCTGAA 


AAAGTTCCCC 


CACGTACTTT 


AGCAATTTCT 


TCTAACGTGT 


3540 


AGCCTTTTAA 


GTATGCGTTT 


AAAATTTGTT 


GTTCCAACGA 


GGAAAGACGT 


GTCGGAAAAG 


3600 


CTGCCAATTT 


TTCTTGAATC 


ATTAATTGAC 


CTAATGGGTC 


GGTAGCCGCT 


GTCCCTAAAT 


3660 


AGTCCACTAA 


CTCTTCACCA 


GCGCTTTCTA 


ATCGTTGTTC 


CAGAGAAATA 


GACAAGGTAT 


3720 


CTGTTCGCCG 


CTTATAGGCG 


CATTGATGGC 


GAATTAAGCT 


ATGTACGTGA 


TTTTCAAAAT 


3780 


TTCGTTTAAA 


AAACTTTCCT 


AATGATGCTT 


GTTTTTCTGA 


ATCAAATCGT 


TCTAACGTCC 


3840 


GAAAGAAGAC 


AATTCGCCCT 


TCTTGAAGCC 


AGTCCTCAGT 


ATCATAATCT 


TTTAAATAGT 


3900 


ATTTTTTCAT 


GCACTTATAA 


ATCACTGGCT 


GATATTGCCC 


ATATAAGGCT 


TGAAATTGTT 


3960 


CTTGGTCTCC 


TGCTAATGCT 


TGTTGAATCT 


GTTCTAACAT 


AAAAAGCCCT 


CCTTTTCGAT 


4020 


AAACGTATCG 


AATGATTTAG 


GAGAGCATAG 


CATGTTAGAA 


ATTTATGACT 


TAAGCGAACA 


4080 


TTTGTTTAAT 


CTTTTTTTTG 


AGACAATTCT 


TCTAGCTTCT 


CAGAAAGTTT 


TGATAACTGC 


4140 


TCCACGTTCC 


ATGGAGAATT 


TCGACGAAAT 


TGCTGGAAAT 


GGATATCCGT 


TGCATGAATG 


4200 


GCAATCGTCT 


TTTCTGTTTT 


CTGGACAGCT 


TTGTAAAGTT 


CATTGGCCGA 


AGTCCGCAAT 


4260 


GCACCTTGTG 


AAAAGACCAC 


CCATTGTTCA 


GCCAAATCAC 


TCGTTGCGAC 


TGTTACTTGC 


4320 


GTTAAACGAT 


CATTTTTTTC 


ACCAGCTAGA 


CGTTCAATAT 


AACTATCTGC 


TGTTTCATCT 


4380 


TCTTTGGTGA 


AAACCACGGT 


CAAAAGGTAC 


TTTTTGTACG 


TTTGCTGAAT 


ACCTGGGACT 


4440 


AATTGGGCAT 


CAAAGACCAC 


AATGATTTCC 


AACCCTTCAT 


ATTTGGCGTA 


ATTGGACAGA 


4500 


CGACTAAGAA 


GCAGTTCACG 


CGCATCCTCT 


AAACGCTCTT 


GCTTTTTCAA 


CTGGACCAAT 


4560 


TCTGGCCAAG 


CACCAATCAT 


AT TAT AG CCA 


TCAACGATTA 


ACAATTGTTT 


TTTCATACAA 


4620 


TCGCCTTCTT 


CTATAATGGG 


TTGCGGCCAC 


GATAGACTTC 


ATACATTAGC 


AAGCCTGCAG 


4680 


CCACACCAGC 


GTTTAAGCTT 


TGAACGTGTC 


CAGTCATTGG 


AATCGTTAAT 


AATTCATCCA 


4740 


CTTCTTTATG 


AAGACCTTGA 


CTCATGcCAC 


GACCTTCATT 


TCCGATGATT 


AAGGCAATTG 


4800 


CGCCTTGTGT 


ATTCCAGCGA 


CGATAATCCG 


TCCCAGACAT 


ATCTGTACCA 


AAAATCCAAA 


4860 


ATTGATTCTC 


TTTTAGCGTT 


GCAATACTTT 


GCGCTAGATT 


CGTTACACGA 


GCCACTGGAA 


4920 


TATGTTCCAC 


GGCCCCTGTT 


GACGCTTTGG 


TTACGGTTGG 


TGTAATTCCC 


ACTGCACGAT 


4980 


GTTTCGGAAT 


GATAATACCG 


TCCACGCCTG 


TCGCATCAGC 


TGTTCGCAAA 


ATTGAGCCAA 


5040 


AGTTATGTGG 


GTCCTCTAGA 


CTGTCTAAAA 


TTAGGAAAAA 


TGGTGTTGCT 


GTCTTTTCTT 


5100 


TTGTTTGTTC 


CAATAATTCT 


TCTAAAGTTA 


AATATTGATA 


CGCCGTGATT 


GCTAAAACCA 


5160 


TGCCTTGATG 


CACGCCGTGA 


TCACTCATCG 


TATCTAATTT 


TGCTTTAGGC 


ACCCATTTAA 


5220 


CTGGAACAGC 


CTGTTCTTTC 


GCTGCTTGTT 


TTAATTGCTC 


AATTTTCTCG 


CCACGAGCAT 


5280 
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CTTCTTGTAA 


AAACAATTTA 


TTCCCGCGTC 


CTTGCTGCAA 


CGCCTCAACA 


GTGGCATGGA 


5340 


AACCAAAGAC 


AAAGTTATCT 


GCTAGCTCTT 


CTGAGGTTCC 


ATCAGCACCT 


GCTGTCGCTG 


5400 


GTCGTGGGTT 


CCGTTCTTTT 


GAACGCTTAT 


CTGAACGTTT 


CTTACCTTTT 


TTAAAAGGAC 


5460 


GAGATTTTTC 


ATTTTTCATT 


GGTTTCACCT 


ACTTTCTTGA 


TACACCAGTT 


AATTAATTCT 


5520 


TCTAAACGTT 


CTTTTTGTTG 


TGTTAAATGT 


AAATAACCCA 


TCAAGGATTC 


GAAACCGGTC 


5580 


GCTACCCGAT 


AGGTCGTCAC 


ATCAGCATTT 


TTGGCAAAGG 


TGTGACTTTT 


AGCATTCCGA 


5640 


CCGCGTTTGT 


ACATCACTTC 


TTCTTCCTCC 


GTTAGCATTT 


CTCCCGCTAA 


CATCGCTTGC 


5700 


ATTAAGCTTG 


CTTGGGCTTT 


TGCTGACACG 


TAATGTGTCG 


CCATGCGATG 


CAAAACATTC 


5760 


GGTTTCGTTT 


GTCCCAAGGA 


AACTAAATAA 


TCACGAATGT 


AAATTTCATA 


AATCGCATCG 


5820 


CCAACATAGG 


CCAAAGCTAA 


ACCGTTTAAT 


TGTTTGTAAT 


CTCTCATTCA 


CTTCTTCTCC 


5880 


ATCTCGTTCC 


TTGCGGAGTA 


TCTTCTAAAA 


CAATGCCTTG 


TTCTTTTAAT 


AAATCGCGGA 


5940 


TTTCATCACT 


TCGGGCAAAA 


TCACGGTCTT 


TCCGTGCTTG 


GTTCCGTTCT 


TCAATTAACG 


6000 


CATCGACTTG 


TGCATCCACT 


AATTCTTCGT 


TTTTAAAGAA 


TAAACCAAAA 


ATAGCTAACC 


6060 


AGCCACTGAA 


AAGCTTATCC 


ATCGCCACTA 


GAACTGGTTC 


CGAAACTGTC 


GCTTGTTCAC 


6120 


TGTACTGATT 


CATCATCTTC 


GCTAATTCAT 


AAACGACCGT 


AATCCCGTTA 


GCTGCATTGA 


6180 


AATCATCGTC 


CATTTCCGTA 


ATAAAACGTT 


GTTCTAATTC 


GTTTAATTGA 


GCCAAATAGT 


'6240 


GTTCATCCTC 


TGGCAAACTT 


GCGACCGCAT 


CTGCTTGTCT 


AAAACGTAAA 


TTTTCAAACG ' 


6300 


CATTTTTCAA 


TTTTTGATAA 


TTGGCTGCTG 


CCTCTTTTAG 


AGTGCTTTCG 


CTATAACGAA 


6360 


TCGGGCGACG 


GTACTGGGTG 


GTGGACATAA 


AGAAACGTAA 


AATTTGCGGA 


TCCACTTTTT 


6420 


GAATCATTTC 


ATGAACAGTA 


ATGAAGTTAC 


CTAGGGATTT 


ACTCATTTTT 


TCATCGTCTT 


6480 


CACCAATTGT 


GACATAGCCG 


TTATGCATCC 


AGTAATTGGC 


AAAGGTATGG 


CCTGTTTTGG 


6540 


cTTCGCTCTG 


TGCAATTTCA 


TTCTCATGAT 


GTGGAAATTC 


TAAATCTTGT 


CCACCACCGT 


6600 


GAATATCAAT 


CGTCTCTTCC 


AAGTGTTTTG 


TCGCCATGAC 


AGAGCATTCA 


ATATGCCAGC 


6660 


CGGGACGTCC 


TTTCCCCCAA 


GGAGAATCCC 


AAGAAATTTC 


ATCTTCTTTG 


GCACTTTTCC 


6720 


AAAGAGCAAA 


ATCCAATGGG 


TCTTCTTTTA 


ATTGTTGTTC 


AACGCCAGTT 


CGTTGACTTG 


6780 


CCCCCACTTC 


TAATTCATCA 


ATCGATTGAT 


GGCTTAATTT 


ACCATAATTA 


GGGAATTTAC 


6840 


GGGTGCGATA 


AT AC AC AT C A 


CCTGCGACCT 


CATAAGCAAA 


ACCTTTTTCA 


ATTAAAACTT 


6900 


CGATGAAAGC 


TAAGATATCA 


GGCATGTGGT 


CCATCACCCG 


AGGATGTAAC 


GTTGCTGGTT 


6960 


GGACATTCAA 


TGCTTGCGTG 


TCTTCTTCAA 


AGGCTTTGAT 


AAAACGCTCC 


GCCACTTCCG 


7020 


GCGCAGTAAT 


TTTCAATTCT 


TTAGCTGCCT 


TGATAATTTT 


ATCATCGACA 


TCTGTAAAGT 


7080 


TCGAGACATA 


ATTCACTTCA 


TACCCACGAT 


ATTCAAAATA 


ACGACGAATC 


GTATCAAAGG 


7140 


CGATCGCACT 


GCGCGCATTA 


CCGATATGGA 


TAT AGT TATA 


CACGGTTGGT 


CCGCAGACAT 


7200 


ACATCCGAAC 


TTTACGCGCC 


TCAATTGGCG 


TAAATACTTC 


TTTTTCTCTG 


GTCAATGTAT 


7260 
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TATAAATTTT 


AATCATGCCC 


TTTTTCCACC 


TTTCTTCCAT 


TTAGACGAAC 


GACTTTCGCA 


7320 


GGAACACCCA 


CAGCTGTGGC 


ATCTGCTGGG 


ATGTCTGAAA 


TAACAACAGC 


ACCCGCACCA 


7380 


ATTTTTGAAC 


GTTCACCAAT 


CACCACTGGC 


CCTAAAATTT 


GAGCTCGTGC 


CGAAATCATC 


7440 


GCTCCTTTTT 


TCACTGTTGG 


ATGGCGTTTA 


CCCGTATCAC 


GACCGGTCCC 


ACGTAACGTC 


7500 


ACTCCGTGAA 


ACAAAACAAC 


ATCTTCTTCT 


ATTTCCGCGG 


TTTCCCCAAT 


CACAATGCCC 


7560 


ATGCCGTGAT 


CGATAAAAAC 


TCCCGCTCCA 


ATTGTGGCTC 


CGGGATGAAT 


TTCTACGCCC 


7620 


GTTATGAAAC 


GCCAAAATTG 


TGCATGAATC 


TTCGCCAACA AAAACAACTG 


ATGCCTATAT 


7680 


AAGAAATGAG 


ACAAGCGATG 


CCAAAACAGG 


GCGTGCAAGC 


CAGGATACGT 


TAAAAGAACT 


7740 


TCTAGTGTGG 


TCCGCGCTGC 


AGGATCGTTT 


CTTTTAACAG 


CTGCGACGCT 


ACGTTTCAAC 


7800 


CAACTCATAT 


GGTGAACTCC 


TTTCTATTAT 


ACAAAAATAA AAAACGTCTT 


TTAAGCAACA 


7860 


AAAAAACGTT 


GCTTAAAAGA 


CnCCCaTGCG 


TGGTTCCACT 


TTTCTTCACA 


AGAACTAGCT 


7920 


CTTGCCTCGA 


TACAGTTAAC 


GCCTGCCGAC 


GTCTTCGGTT 


ACTTTTTTCA 


CCAAAGCCAC 


7980 


TCATAGATGC 


ATTTCGACAA 


AAGATGCTCG 


CTGTTCTCAC 


CACCCACAGC 


GTCTCTGAAA 


8040 


TCCATCTGAT 


TGTCTACTTC 


TTCTATTCTT 


TGTGTTTTTA 


TAACACTTTA 


TTTAAATGAT 


8100 


CTAATGCTTT 


TTCTTTGCCT 


AATAATTCGA 


TGGTTTCGCC 


TAATTCAGGG 


CCATGCATTT 


8160 


GTCCAGAAAC 


AGCAATTCGA 


ATTGGCATAA 


ACAAGTTTTT 


GCCTTTAACG 


CCGGTTTCTT 


8220 


TTTGGACGGC 


TTTAATCGCT 


GCTTTAATAC 


TTGGGACATC 


GAAAACGTCC 


AAGGCTTCTA 


8280 


ATTGTTCTTT 


AAACGCATGT 


AACACAGTTG 


GCACTGTTTC 


TCCTTGCAAG 


AATTCTTTCG 


8340 


CAG CAT CATC 


TAAAACTGGA 


TGTTCATTAA 


AGAACAAGTT 


TGATAGTTCC 


ACAATTTCAG 


8400 


CTGCATAACT 


CATTTGTGGT 


TGATACAAGC 


TAACTAATTG 


TTTTAACCAT 


TCGATTTTTT 


8460 


CAGGAGATGG 


ATTTTCTTCA 


ACACGACCGT 


CAGCGACTAA ATAAGGAATA 


CACATATCTG 


8520 


TTAGTTCACT 


TAAGTCCATT 


TGTTTCATGT 


ATTGATTGTT 


AATCCATTCT 


AATTTTTTCG 


8580 


CATCAAAGGC 


CGCTGGTGAC 


TTGCTTAAGC 


GTTGTTCATC 


AAACAATTTA 


ATCAAGTCTT 


8640 


CTTGTGAGAA 


GATTTCTTCT 


TCGCCAACAG 


GAGACCATCC 


TAGTAAAGCA 


ATAAAGTTGA 


8700 


ACATGGCTTC 


TGGTAAGTAA 


CCTAATTCAC 


GGTATTGTTC 


AATAAATTGT 


AAAATTGTTT 


8760 


CATCACGTTT 


GCTCAATTTT 


TTGCCAGTTT 


CTGAATTAAT 


GATTAAGGTC 


ATATGGCCAA 


8820 


AGACAGGTGG 


TGTCCATTCA 


AACGCTTCGT 


AAATCATTAA 


TTGTTTTGGT 


GTGTTCGCAA 


* 8880 


TGTGATCATC 


GCCACGTAAA 


ACATGAGAAA 


TTTTCATTAA 


GTGATCATCT 


ACTGCAACGG 


8940 


CAAAGTTATA 


CGTTGGCATC 


CCATCGCGTT 


TTTGGATGAC 


AAAATCGCCA 


CCAATGTTAT 


9000 


CTGACTCAAA 


AGTAATTTCA 


CCTTTCACGA 


TGTCATCAAA 


TTTGTATTCT 


GTATTACGTG 


9060 


GCACACGGAA 


ACGAATCACT 


GGTTCTAGAC 


CAGCCGCTTC 


TTTGGCTGCT 


TGTTCAGAAG 


9120 


GTGTTAAATT 


CGCACAAGTG 


CCTGCGTAGT 


GAGGCATTTC 


CCCACGAGCT 


TGTTGTGCTT 


9180 


CACGTTCTGC 


TTCTAATTCT 


TCTGGCGTAC 


: AATAACACTT 


ATACGCACGA 


k TTGCTACTTA 


9240 
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ATAATTGATC 


AATCAATGGT 


TGGTAAATTT 


CTTTACGTTC 


TGATTGACGA 


TATGGACCAT 


9300 


ATTCTCCCGG 


ATGAGCAGGT 


GATTCATCCC 


ATTCCATCCC 


TAACCATGCT 


AAGTTTTCCA 


9360 


ACTGACTTTT 


CTCGCCGTCT 


TCGATGTTCC 


GTTTTTGGTC 


GGTATCCTCG 


ATACGAATGA 


9420 


TAAATTCTCC 


GTCATTGTGG 


CGGGCAAATA 


AGTAGTTAAA 


TAATGCTGTT 


CTTGCATTCC 


9480 


CAATGTGCAA 


GTGTCCAGTT 


GGACTTGGTG 


CGTAACGCAC 


ACGAACTTTC 


GTCATGTTCA 


9540 


TTCCTCTTTC 


CATTTTTTAA 


AAACAGACGG 


TTTAACGTCG 


TTTTCAGTTA 


TTTTTCCAAT 


9600. 


AAAGCAACGG 


CAATTGCGCC 


GATTCCTTCT 


TTTTTACCAA 


CAAAGCCCAT 


CGTTTCCATG 


9660 


GTCGTTGCCT 


TAATGTTCAC 


TTGTGTGACC 


GCTAAGTGGC 


AACTCGCGGC 


TAGATTCTTT 


9720 


TTCATCTCTG 


CCAAGTATGG 


GCTCATCTTT 


GGCTCTTCCG 


CTAAAATTGT 


ACAATCAATA 


9780 


TTGCCAATAG 


TAAAGCCCGA 


CCGCCCAACT 


TTTTCATTTA 


CTTTTTTTAA 


TAAGTTTACA 


9840 


GAATTCGCAT 


TTTTGAACTG 


CGGATCGGTT 


TCGGGAAACA 


GTTGCCCAAT 


ATCACCCAGA 


9900 


CCAGCGGCAC 


CAAGGATTGC 


ATCAATAATT 


GCATGTGTTA AAACATCGGC 


ATCTGAGTGA 


9960 


CCTAACAGAC 


CTTTTTCATA 


AGGAAGAGTA 


ACCCCACCAA 


TAATTAAAGG 


TCGTTGCGGA 


10020 


ACTAATTGAT 


GAACATCAAA 


ACCTTGACCG 


ATTCGAATCA 


TACTAAGTCC 


TCACTTTGCT 


10080 


TAATTTTTAT 


CGTTTTTAGG 


ATGGTTATCC 


TTGCGATCAT 


TAATCCCGCG 


ACCCGCGTGC 


10140 


GCCGGTTTGG 


CAAAAATCAT 


ACGCCCAGCT 


GCGGTTTGTA AAGCACTTGT 


GACGACCACT 


10200 


TCGATTTGCT 


CATTCATGTA 


ATGCTGGCCA 


TCTTCTACCA 


CAACCATTGT 


GCCATCATCT 


10260 


AAGTAAGCGA 


CCCCTTGTTG 


ACGCTCTGTT 


CCGGCTTTTA 


CAACCATCAC 


ATTCATCGTT 


10320 


TCACCTGGAA 


TCACCACTGG 


TTTCACAGCA 


TTAGCTAAAG 


CATTAATGTT 


TAAAACAGGC 


10380 


ACATTTTGAA 


ATTCAGAGAC 


TTTATTTAAG 


TTGTAGTCAT 


TGGTCACAAC 


CACGCCATCT 


10440 


AATAATTTTG 


CTAATTTGAT 


TAATTTACTG 


TCTACTTCAG 


AGATATCCTC 


GAAATCTCCG 


10500 


TCATACATCT 


CAACGGAGAT 


ACCTTCTTCT 


TTTTGTAAAG 


CATTTCAACG 


CTGGTTCCCT 


10560 


GCCCTTTAAA 


ACTAATAATT 


TnAACTGTAC 


CGCCCATGCC 


GACAACACGT 


CTCGGAATAT 


10620 


TAATTAAGCC 


ATAACT 










10636 


(2) INFORMATION FOR SEQ ID NO: 75: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GAAnGGGTCT • TTTTTCGAGT AGACGTTTGA TnTTAGCGTA TCCAAGAAAG GAAAACCCtA 
ACATAAAaAT TTTTTATTTA CGAaGGAGAC CGATTTATAT GGAAAAGAAA GAATTTCACA 
TTGTAGCAGA AACAGGAATT CACGCACGTC CAGCTACTTT ATTAGTACAA ACTGCAAGCA 
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AATTTAACTC 


AGATATTAAC 


TTAGAATACA 


AAGGTAAATC 


TGJTAACTTA AAATCAATCA 


0 A C\ 
£. H. U 


TGGGCGTTAT. 


G.TCT.TTAGGC 


GTTGGTCAAG 


GTTCTGACGT 


AACAATCACT 


GTTGATGGTG 


"3rin 


CTGACGAAGC 


TGAAGGAATG 


GCAGCAATCG 


TTGAAACATT 


ACAAAAAGAA 


GGATTGGCTG 


obU 


AATAATGTCT 


GAGATGCTAA 


AAGGGATTGC 


CGCAAGTGAT 


GGCGTTGCTG 


TTGCTAAAGC 


/ion 

4 


TTACCTGCTA 


GTTCAACCTG 


ACTTGTCTTT 


CAACAAAACT 


TCCGTAGAAG 


ATACTGATGC 


A Q f\ 

4 oU 


AGAAGCAACT 


CGCTTAGACG 


ATGCTTTAGC 


AAAATCTACT 


GAAGAGTTGC 


AAGCAATTCG 


D4U 


CGACAAAGCA 


GCTCAAAGCC 


TTGGTGAAGC 


AGAAGCGCAA 


GTTTTCGATG 


CTCATTTAAT 




GGTTTTATCT 


GACCCAGAAA 


TGGTTGGTCA 


AATCAAACAA 


AATATCCAAG 


ACAACAAAGT 


c c r\ 


CAATGCAGAA 


GCTGCCTTGA 


AAGAAGTTAC 


GGATATGTAT 


ATTGGTATGT 


TTGAAGCAAT 


720 


GGaCGACAAT 


GCTTACATGC 


aAGAGCGTGC 


aGCAGATATT 


CGTGACGTTG 


CCAAACGTAT 


Ton 


CTTAGCACAT 


TTATTAGGTG 


TGACTCTTCC 


AAATCCTTCA 


ATGATTAACG 


AAGAAGTAAT 


0 4 U 


CGTGGTTGCC 


CATGACTTAA 


CACCTAGTGA 


TACTGCACAA 


TTAGACCGTA 


CGTATGTTAA 


900 


AGCCTTTGTG 


ACAGACATCG 


GCGGACGTAC 


TTCTCACTCA 


GCCATCATGG 


CTCGTTCGTT 


960 


AGAAATCCCT 


GCAATCGTTG 


GTACAAAAGA 


AATTACTGAC 


AAAGTAAAAG 


CAGGCGATAT 


1020 


TTTAGCAGTG 


AACGGAATCA 


TTGGGGATGT 


TATTATTGAC 


PCAACAGATG 


CTGAAAAATC 


1080 


TGAATTTGAA 


GCAGAAGCAA 


AAGCTTATGC 


AGATCAAAAA 


GPAGAATGGG 


ATAAACTAAA 


1140 


AAATGCAGAA 


ACAGTGACAG 


CTGACGGCAA 


ACATGTTGAG 


TTAGCTGCAA 


ACATTGGTAC 


1200 


ACCAAAAGAT 


TTAGAAGGCG 


TACACAAAAA 


CGGCGGCGAA 


PTflTTGGTTT 

\m# x *J X X V-J W XXX 


ATATCGTACA 




GAATTCTTAT 


ACATGGATTC 


ATCTGACTTT 


CCAACTGAAG 


AGGACCAATA 


CCAAGCGTAT 


1320 


AAAGCAGTTC 


TTGAAGGAAT 


GGAAGGAAAA 


CCAGTTGTGG 


TTCGTACAAT 


GGATATTGGT 


1 00U 


GGGGACAAAG 


AGCTTCCTTA 


CCTAACATTA 


CCACACGAAA 


TGAACCCATT 


CTTAGGTTAC 


14 4 U 


CGTGCATTAC 


GTATTAGCTT 


ATCAGAACTA 


GGTGACGGCA 


TGTTCCGTAC 


ACAAATGCGT 


louu 


GCATTATTAC 


GTGCGTCTGT 


TCATGGTAAC 


TTACGTATCA 


TGTTCCCAAT 


GGTTGCTACT 


1 c c n 
lbbU 


TTGAAAGAAT 


TTAGAGCAGC 


GAAAGCAATC 


TTTGAAGACG 


AAAAACAAAA 


ATTAGTCAAC 


icon 


GAAGGTGTTG 


AAGTTTCAAA 


CGATATCCAA 


GTAGGTATTA 


TGATTGAAAT 


TCCAGCAGCT 


1 n 
1 gov 


GCCGTATTAG 


CTGATAAATT 


TGCCAAAGAA 


GTTGACTTCT 


TTAGTGTTGG 


AACTAACGAC 


1 T A C\ 
1 / 4 U 


TTAATCCAAT 


ACACAATGGC 


GGCAGACCGT 


ATGAACGAAC 


GCGTTTCTTA 


CTTATACCAA 


J.0UU 


CCATACAACC 


CATCAATCTT 


ACGTTTAATT 


AAAAATGTAA 


TTGATGCAGC 


ACACGCTGAA 


I00U 


GGTAAATGGG 


CTGGTATGTG 


TGGTGAAATG 


GCGGGCGATC 


AAACAGCTGT 


TCCATTACTA 


1920 


TTGGGTATGG 


GCTTAGATGA 


GTTCTCAATG 


AGTGCTACAT 


CTATCCTTAA 


AACACGTAGC 


1980 


TTGATGAAAC 


• GTCTAGACAC 


AACTAAAATG 


GCTGAACTTG 


CTGACCGCGC 


ATTAAAAGAA 


2040 


TGCGACACGA 


TGGAAGAAGT 


CTTTGCATTA 


GTTGAAGAAT 


ATACAAAATA 


ATTAAGTGAT 


2100 


TATGAAAGCA 


CATCGACAAT 


TGTCGATGTG 


CTTTTTTATT 


GTTGCGATTC 


ACGTATTTTT 


2160 
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TTCTTGAAAT AATAAATCGT GCTTCCCAAA ATGAATAGAA AAACCAGCGT GATGCTACTC 
ACCAGTTTCC AAGGGAAATT GGTGTGTTTT TCAGGTGGGA. .GGT.T.GACGGC GGTAGCATTG 
TATTTTTTAG CTTCTTTTTC TTGGATGACT AAGTGTTTTT CCCACGAATA GTCTTGATAG 
CCATCGTTTG CTTTGATTTT tACAGTATAA TTTCCAGGAA CAAAAGGTTG TTCCTTTAAG 
TCAATCCGAT AATTAAAATT TGTATTAGGA GCCATTGTTA ACGACTCTTG GTGATTTTCG 
TAGCGAGGTT TGGTTTCTTT TTCATAATAC AAAGAGGCAT CGACTTGTAA TTTTTTAATC 
ATAGCCGCTT GTTTGTTTTG GAGATTCATT AATACGCGGT TACGACCGTT AGCTTGATCG 
GTTTTTACTT CATTTAAAGA TAATTCTGGC TGAACAGGTT CATCTGTTTC ACTAAGTAAG 
ACACCGACTA CATAAGAAAA ACGATTATTG ATGGCCACAC CGTTTTCTGA GTGAGCAGGT 
TGTTCATCTG ACTTTTGTTC AAAGTAGAGT CCTCCTAAAA CGTACCCATT AAATGGTTTT 
TCCGGTAAAG TAAGCTGACA TTCAACAGTC TTGGTTGAGT GTTTAGGTAG GTGGATTTCT 
TTTGGCAATT GCGCAATCTC ATTAAAAGTA AAAGGTGCAG ACGGATCTTT TTTTGTTTCA 
GCATGAGAAT AATCTGCCAA GCCATTATCA TTAGTAATCG CTGCATTGGC TGAAGCTAGC 
ACGGTAATGT CGTTGGCTGT TTGATTCGTT AGTTCGATTT GTAGCTTTTG TGTTTGGTTT 
GGAGCGACAC GTAAATCAAA ATAACTTGTC TCTTTGGTTC GTTGATTGTC GGGTAAAATG 
GCTTTTACAG AAAAAGTCAT TTCGCTTGCT TCTACAGAAT AAGAGCCAAT AAAAAATAAA 
AAAATGATGA CGCTAATAGC ATGGAGCCAT TTAAAAAGCT GCTGATTCAT AGTGTGCACT 
CCTTTTTGAC AATTTTTTAT AACAAGGTAT TGCATGAAGT TGTTTCATGC AATACCTTGT 
TTGTTTTCTT TTAAAGTGGT GTATCATCCA AAATCCAAGT AAGCGTTGTT TTATATTGTT 
TAGCTGCAAC TTTTTTCGTT GTCGCAGGAA CAGTTAATTG AATGCCTTGA GCAGCGGTcg 
sTCCGCTACC AAAAGCTAAT GTCCAAGTGC CCATTCCTGT TGAAAGAGCG GCAGTCGCCA 
CAGGAGAAGT GGCGGCACCA GTTGGATCCA AGGTTACTGG GACCGTTAAT GGCGCCAAAG 
AAAGTAACTG TGTTGAATTG GGTGTTGCTG CTAAAAAGGT CAAGGATGCA TTATCCAAAA 
CAGCGGGTGT TGAATCACTT GTAGCAAATT GCGCACTCTG TTTAACTGAT AATTTCCAGC 
CAAGATTTAG ACCACGTTTG TCAGTTACTT GAACATAGTT TGGCACGCTA" ATTAAATCGC 
CAGTACTATT TTGCACTTGA TCCAGTTGTG CCGAATAGAT CGCTGTTCCG GCTTGAATTT 
GTTTTGATCC AAAATGGATA TTTGAAACAT AGTCAATACT AAGGGGTCCG GCTGTACCAG 
GTTGATGAGG ATCAGCAGGA TTAGGTGTCA CAGGCTGAGA AGGGTTCGTC GGGTTGACAG 
GTGGCGTGAC AGTATTATCA AGAGCAAACG TAATATCAGC GTTGGATTGA ATAGAAGCGA 
CTTGAGCTGC TTCAACTTTT GAACTATTTA AAAGCATAAG AGGCGTAgcA ACGAACAAAA 
TGCTTGAAAT • GCGCACGATC TTTTTCATTT TTTATTCCTT CTTTCTTGTT TATTTTTAAC 
TTTTAAAAAA GCTAAATCCT AATTAGAGGG GTGTATCAGA AAGAATCCAT TGTAATTCGG 
CAGTGTAAGA TTCTTTGACC GTTAAATTAA GGTCAGGAGC AACGCTTAAA AAAAGTCCAT 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 
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4020 

4080 

4140 
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TTTGTTGGTT 


CCAAGGAATT 


TCTACATTGT 


TATCGTCCGT 


TGCGGTG7G7 


TGA7AAAC77 


4200 


CGGTTGCTAA. 


AGAGGAGAGA 


GGAAAATCAC 


TGCCATTTTT 


TCGATAAA7G 


AAAGC7TGC7- 


A2£0 


GGATCGTTTT 


ATTTGTAGAA 


GTTGAATGCA 


TGGGCTGACT 


^ fj* fji ^ ^ 


AACAAATGCC 


4320 


ATGGTGTTTT 


TGCAACTCGA 


CCATCGGCAA 


TGAGGAAAGG 


AGCGCTGA7G 


GTCGA7GG73 


4360 


AGTATATGGT 


TCGTTTGCTG 


GGGAAAGGTA 


AATTTTTAAA 


GGAAACTG7A 


GT7C-GCATC7 


4440 


GCACAAAACG 


GAGGGTACCG 


TTTTTGATAG 


GTAAAGATTG 


TTCTTTAG7G 


AT7GGCGTAA 


4500 


TTGcGTTCCG 


CCACCATCAT 


TGCCAGTAAC 


CGTGCTAGAA 


AA7AAAAGAA 


ATTGG7TTA7 . 


4560 


CCATTCTTCT 


GTAACCGTTG 


CTGTATATTG 


TAAGGTGAAT 


TG7GTATTGG 


CTTCCGTAAA 


4620 


ATTATCTGCT 


GGCAGCGAAT 


TATGATAACT 


CCAATACGGC 


AAACCAG7GG 


AAC7A7CAG7 


4660 


TAAAAGTTGA 


AGCCCAGCTA 


TTGGCGTACT 


TAAACCTGCT 


GC7GATTG77 


TTG7TAATG7 


4740 


AGCAGCTGAA 


AGTTGGGACA 


GTCCTTTAGG 


CACCGTTGCT 


CG7AATTGC7 


GATTCATCCa 


4800 


CGTTGCTGGT 


TGATTGGTTA 


AACGTGTGGT 


TAACGTAAAA 


CGAATGA777 


GACCACTTAC 


4860 


GACAGAGTTT 


GCAGCTTCCC 


AGAGCCCTTG 


TGCATTTTTC 


CGCTCAA73A 


TTTSCGTAAC 


4920 


CGTTAGTTGA 


GGCACAGGAT 


AAGCAACTGT 


TTGAGAAACA 


GTAGAACT3G 


TTAACAAAGG 


4960 


CAGGCCAGTG 


CGATTGCTGA 


GACCTTGGAC 


CCACAAGCTG 


ACCGTTTC7C 


CCCCT7GGAA 


5040 


AGTGGGTAAA 


TTAGCAAATT 


GAAAATGGCC 


ATTGGAATCT 


AAGGACGT7G 


TGGAAATAGA 


5100 


ACCACCATTT 


ATTTGTACTT 


TAACTTGATA 


GGTTCCGTTA 


GA7ACTTGAG 


AATCTTTACC 


5160 


TGAAAGTTGA 


GTTGAGCCGG 


CATTGGCGAC 


TACGTTTGTG 


AC7GGTTG3A 


TTGACCAAT7 


5220 


GACACTTTGG 


TCAGCTGAGG 


GGCTGCTTTG 


AATATAGTCG 


GT7GACG777 


TTGAACGAGC 


5260 


AAAGACAATT 


GTAGAGAGCG 


TGTCGCCATA 


TTCTAATGGT 


GAAGCTAACG 


ATGTCGAGAA 


5340 


ATTTCCGGCT 


GCATCGACAG 


AAATCCCTTT 


CACCACAATG 


GGGGTAG73G 


TGCCA.7CATG 


5400 


GCGGTTCAAG 


GTTAATTGAA 


CAGAGTAGTT 


ATAGTTTGGG 


TCTTGTAC77 


GTTGAATGG7 


5460 


CCCTGTTACA 


GTAGTGTCCA 


GATGATTTGG 


TTGAGTTAGT 


TTGGCTGG7G 


TGACT7GAGG 


5520 


AACGGGATAA 


TTACCTAAAC 


TAAGTTGAGC 


TTTGCTAATT 


ATGCCTGT7T 


TTGTA7TATT 


5580 


CCCAATGATA 


ATATTAATTG 


CGCCTAATCC 


aGTGCCGACA 


TCTGTAAA3G 


TTCCTCGAAT 


5640 


CGGATAGTCT 


TTGCCGTTAA 


GAGCAGGAGG 


GATTGAAATT 


CCTCGTTTAT 


AAAGTGCGCC 


5700 


CACATCAAAC 


GAAATGGCGG 


CAGACCATTT 


GGTTCCGGGA 


AGTAACACCA 


AGGTGCTACT 


5760 


TGGAAAAGTC 


AGATAAACAG 


AATGATAAGT 


GCTACTATAA 


GAGAGcTGAC 


GCCGT7AGTT 


5820 


GAACTGTGAA 


TATCTTGAGT 


GGCTGTTCCT 


GGATAACTGA 


CAGTCCCTGT 


TAAAAGAGTC 


5880 


AGAAAATCTT 


GTTGCTTTAG 


AGTGTCGTTA 


TCTAACTGGC 


TTGAAAT77C 


TGTAGGAAGT 


5940 


TGAATAGCAA 


• TAATCGGTCG 


ATCATTAATT 


GCTAAGCCGA 


TAAGTGTCAC 


AC7GTATTCA 


6000 


TACGAAAGTG 


TGATAATATC 


TTGGTTATAC 


GTTCCCGTGC 


TTAAGGTA7T 


TT7TAAGG7G 


6060 


CTATTACCAA 


CAGAAACACT 


AGCTGGCGTA 


GGTGCCATAG 


GT7CATCC-77 


TTCAGAGGAA 


6120 
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GCATGGGTTT GCTCGGTTTC TTGTTCCACG ATATCGGTTG CTT ( GAGTTGT TTGTTGTTCT 6180 

GTTGAAACTG AAGTAAATGT TTCACTAAAA GTAGAGGCGT GAGTTGTAGG GTTCAGTGTT 6240 

TCAGAATGAG CTATCAAAGG TTGAGTCAGC GATAAAAAAA GCCATGATAT GATGCAAGAA 6300 

AAGATTCCCC ATACGAACTT ATTCAAAACC ATCCTCTCCT TTTTTGaTAT TATATGtATA 6360 

ATGACGAAAA TATATAATGT GTAAATATTA AGTACATACA TTCTATAATG GGCCATnCAT 6420 

AAAGTAAAGC GATTACATCC GATGGTGnAC AGGATATTTT GTTCACAATC TTTTAGTGTT 6480 

TTAAAGAGGG AAATTTCTCG AAATTCTATG CGTTTTnAAT CCGGGAAAGA T 6531 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

TTTTCGGGAA AGAAACACCT ACCATTTTTG ATGTTTTGGG CAGGGAATAG GAGGTAATAC 60 

ATGGAATGGA GTTTTCCTTG GCTATCAATA GATGGCGACA GAATGTACGA TGATAGCGAC 120 

TTCTCGAAAT TTTTTGAGGG TCTTTTTTCT TATGGCGTTT CTCTAACAAC AGCTAACGCA 180 

TTAAAGGTTA CTGCCAGTCC TAATGGCGGG ATGAAAGTAC AAGTAGATTC AGGCTATGCA 24 0 

TTTGTCGGTA AAGTCTTCTT GAATAGTACT ACAAAAGCTT TAAGTATTGA TGTAGCAAGC 300 

AGTATGCAAG ACAGGACGGA TAGTATTGTG GTTCGCGTGG ATAAAAGTGT GCGAGATGTA 360 

TTTTTAGCTG TTAAAAAGAA CGATACGACA GTAACCCGTA CATCGGACGT TTACGAGCTA 420 

CAGTTAGCGA CAATAAGAGT GCCCCGAAAT GTTTCTAGTA TTACAGGCGA TTTAATCACA 480 

GATAAACGGG CAGATACGAA AGTATGCGGG "TATTCTTCGC CGTTTCAAAA GGTTAACGTT 54 0 

TCAGGGTTAG AAGAGCAATA CGGTACAATA CTAAAAAAAA TTGCGGAAGC CAACAAGACG 600 

AGTTATGAAA AAATCCTAAA TGATTTTAAA AACTATGTTG CAAAAGCGCA AACCGATATG 660 

GATTCTAATA TTGAAGAAAT TATCCAATCA GGTAATGGAA AGGTAAGTGC TTTTGATGTT 720 

TTAATTCACG AATGGTTTGC AGCTTTAAAA AATGAATTAG ATACAAATCA AGCATCAAAT 780 

TTACAGAATC AAATCAATGA AATGAAAGCT ACTGAAGAGT TACCAGCTAT AGAACATAAT 840 

TTACTCGGTT ATCCTAATGT ACAAGTTTTG TATTGGGAAT ACGGTATTGG CCTATCAGGA 900 

TTAGCTAATG AGCCAACAGG TCTAGGCGGT AGCAATGTGA AAAAGATTCC TCACAGTGTA 960 

GAATATCTTG ATTTATTCAG TTTCAAAGTT AAAGTGCCAA TGAACTTTAA ATTGGTAAAT 1020 

CCAACAGTAA CAAAAATAGA TAGTCGAACT ATTCGCTTTA TTGAAGCATT TAAAGTTATA 1080 

GAAATTAAAT TTTAGGAGGA AAAGAATGTA TACATTTAAA AAAGGTGATG CAGACTACCA 114 0 

AGTCATGCTA AATGAAAATT TTAAAGAGAT AACTGATTCT TTCGAAGATG GTTTATTTGT 1200 

CAGAAAAAAT TCTAAAATAT T AG ATT T AAA CGATGCTATA TTGCCAGGCA TTTATTCAAT 12 60 
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CCCAGCTACA GGAGTTGATA ATAAACCTTT ACCTAATTCT GGAAGTTTAT TCGTTTGTAA 1320 

AGACCCAGGA GGGGTTAGAC AATTATTTCA GACAGAAAGA ACAATATTTA TTCGACAATT 1380 

AGGGGGAGTA CCTTCATCTT GGaCGGATTG GaAAAAAGTA GCTTTTGAAA AAGAACAACC 1440 

TTTTGAAGCT TGGTATTCAC CAGGAACTAA CCATGCTGGA TTCAAAAATA AGGCAAGATA 1500 

TAATTTAGGG CCAGAATTTA GCAACATAGG CCAGCGACTT GGGTTGCCTA TGAAAAGTGA 1560 

ACCGTTGGAG TGGAATAGTG GGCGATGGCA AGCAAAAGTT CTTAGAGACT GCAAGCTAAA 1620 

TATAAGCGGA ACTGTAAAAT ACCATGTTGG TAGTTCGAGA GGTGTTCTCT ATGCTTATAC 1680 

T CAT AT AG AC AAAGGCCTTG ATGAAGGCGT AGGTGACTTA GGTATTGGAT CAGCAGTCGG 1740 

AGCTGTTGGC GGCTTGGATT ATAAAAATGT CGCAGCTTTT GATTTAAACG TTACACTAAA 1800 

AAAAGGTGAG TATCTTGCGT TTCGCTTAGA ATTAGCAGCA GATAAGCAAC TTGATTATAC I860 

TCAATTATCT TCTATGCATA TCACAGAATT AGTATAGGAA CTGAATTTTA AAATAAAAAC 1920 

CGTTTAGCAA AAAAGCTAAG CGGTTTTTAT TTATAGAGAG AAGGAACTTT ATGTGAAGGA 1980 

TGAACTGATT CAAGATGTTG TAGAGCGTTT AGTGCGTATT GAGACAAAGC TCGATAACTA 2040 

CGAAGTGTTA AGAGAAAAGA CAGAAGATGC TAAGGAAAAG GCTGATTATG CTTTCTCTAT 2100 

TGCTAAAAAT AATGAAGAAG ATATCAAAGA AATAAAGGAA AATCAGAAAT GGTCTTGGCG 2160 
GACAATTGCG GGCATAGGGG TGTCAGTAGT TGTCTATTTA TTTACAAAAT ATTTAGGAGG . 2220 

TGTATAAGAA TGATTCTACC AGATAAGTAC TACAAAATTA TCAAATGGGG CGTGCTAACA 2280 

GTGCTACCTG CAAGTTCTGT TTTGGTTGCC ACACTAGGTA AAGCTTATGG ATGGCAGCAA 2340 

ACAGATATGA CTGTTTTAAC TATTAATGCC ATAGCAACTT TTTTAGGAGT GGTAACAGGA 2400 

GTATCAGCAT ATAATTTAAA AGACAAGGAG AAATAAAAAT GAAAAAGAAA ATTTTAGCAG 2460 

GAGCGCTTGT CGCTCTGTTT TTTATGCCTA CAGCTATGTT TGCCGCAAAA GGAGACCAAG 2520 

GTGTGGATTG GGCGATTTAT CAAGGTGAAC AAGGTCGCTT TGGCTATGCA CATGATAAAT 2580 

TCGCTATTGC CCAGATTGGA GGCTACAATG CTAGCGGTAT TTATGAACAA TACACATATA 2640 

AAACGCAAGT GGCAAGTGCT ATTGCCCAAG GTAAACGTGC GCATACCTAT ATTTGGTATG 2700 

ACACTTGGGG AAACATGGAC ATTGCGAAAA CAACAATGGA TTACTTTTTG CCACGTATTC 2760 

AAACGCCTAA AAATTCCATC GTTGCATTAG ATTTTGAACA TGGAGCGTTG GCTAGTGTTC 2820 

CAGATGGATA TGGAGGATAT GTAAGTTCAG. ATGCCGAAAA AGCAGCAAAT ACAGAGACAA 2880 

TTTTGTACGG TATGCGCAGA ATCAAACAGG CTGGCTATAC TCCAATGTAT TACAGCTATA 294 0 

AGCCATTTAC ACTAAATCAT GTAAACTATC AACAAATCAT CAAAGAGTTT CCTAACTCTT 3000 

TATGGATTGC TGCGTATCCT ATCGATGGTG TGTCACCATA TCCATTGTAT GCTTATTTCC 3060 

CAAGCATGGA TGGTATTGGT ATTTGGCAAT TCACATCCGC TTATATTGCA GGTGGTTTAG 3120 

ATGGTAACGT AGATTTAACA GGAATTACGG ATAGTGGTTA TACAGATACC AATAAACCAG 3180 

AAACGGATAC GCCAGCAACA GATGCAGGCG AAGAAATTGA AAAAATACCT AATTCTGATG 324 0 
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TTAAA'GTTGG 


CGATACCGTC 


AAAGTGAAAT . 


TTAATGTAGA TGCTTGGGCA , 


ACTGGGGAAG 


3300 


CTATTCCGCA 


ATGGGTAAAA ■ 


GGAAACAGCT 


ACAAAGTGCA AGAAGTAACT ' 


GGAAGCAGAG 


3360 


TATTGCTTGA 


AGGTATCTTG 


TCATGGATTA 


GCAAAGGTGA TATTGAATTA 


TTGCCAGACG 


3420 


CAACAGTCGT 


CCCTGATAAG 


CAACCAGAAG 


CGACTCATGT GGTACAATAC 


GGAGAAACAT 


3480 


TATCAAGTAT 


TGCTTATCAA 


TATGGAACAG 


ACTATCAAAC GTTGGCGGCA 


TTAAATGGAT 


3540 


TGGCTAATCC 


AAATCTTATT 


TATCCTGGTC 


AAGTTTTGAA AGTCAATGGA 


TCGGCAACAA 


3600 


GTAATGTCTA 


CACGGTTAAA 


TACGGCGATA 


ATTTATCTAG TATTGCAGCA 


AAACTTGGCA 


3660 


CTACTTATCA AGCTTTAGCT 


GCATTAAACG 


GATTAGCAAA TCCTAACTTG 


ATTTATCCAG 


3720 


GTCAAACATT 


GAATTATTAA 


TAGCTTTAAT 


ATAAAATAAG GATACACTTA 


TTTAAATTTC 


3780 


TCTCGAG-TCG 


CCGTCGCCAA 


GGCGGCTCTT 


TTCAGGACCA TTAGCTCAGT 


TGGTTAGAGC 


3840 


AAACGGCTCA 


TAACCGTTCG 


GTCACAGGTT 


CGAGCCCTGT ATGGTCCATA 


GCTAAAAAGG 


3900 


GCACGAAAAG 


GGCAAAAGTA 


GTAATACTTT 


GTCAGATTAT GTTGTTTTTT 


ATAAATTTTA 


3960 


TTAGTATTAT 


ATAGCATTAT 


AAATGCCTGT 


TTTACGGGCT TTTACAATTT 


TTGTGAAAAT 


4020 


TGGACCAAAT 


CCTTGGGATC 


CAATGGGCGC 


ATTAGCATAA AACAGAAAAA 


GAAGGCCCCC 


4080 


GCCGTCAGGG 


CGGGGGCCTT 


CTTTTTACGT 


TGGGCGGCGA ACTTCACGGA 


TTTCGCCCAA 


4140 


TGTTGTATAA 


TCATTGCCTG 


CTAAACGAGC 


AATCGGTTGG AGTTTTTCAG 


GTAAAATGTA 


4200 


CTGATGTTCT 


TCATCGAAAA 


CGGCTGAATC 


AAAATAGAAA TCGGTAATTT 


CTACAATGAA 


4260 


TAAATCCGTA 


ATAATTTGAC 


CTTCATGGTT 


AGCAATAGGT ACATATTGAT 


GTAAGCGAGC 


4320 


TTCCATCCGA 


ATCGGCGCGG 


CTTTGATTGC 


TGGAACAGCA ACAGTCTGGC 


TGGCAATGGT 


4380 


TTCAATACCA 


AATGTGTCAA 


TTTCACTGAT 


CTCTGCTGCT AAAGACGCCG 


AAGTCTGATT 


4440 


CATTTGAGTA 


AGAACGTCTT 


CGTTAACGAA 


AT G GAT G AC A AGTTCTTTAG 


TAGCCAAGAT 


4500 


GTTTCGTGCC 


GTATCTTTAG 


GCTCTTCGTC 


TGGTCGTAAA ATGGACAACG 


TTGCTAAAGG 


4560 


AATTTCAGCT 


GCCGCCGCAT 


TAAAGAAACT 


AAAGGGAGCA GCGTTGACGA 


CACCTGTAGC 


4620 


TGTATCTTGT 


GTTGTAACCC 


AAGCGATTGG 


CCGAGGAATA ATACTTCCTG 


ATAAAAATTT 


4680 


ATACGCTGTT 


TTTGCAGATA 


ACTCGTTACT 


ATTGTAATGA ATCATAGGAG 


CCTCCTAGAA 


4740 


AAAATACTAA 


TTTCTTTGAA 


CATTCGCATC 


TGAAGTATCG AAAGGACGCA 


CAAATTCTTC 


4800 


AATCCCTGCC 


CGTTTTGGTT 


CTAAAAATGG 


CGGTAATGAT AATTTTTCGC 


CAGCGTGTTC 


4860 


ATAGGTTTCA 


TCCTCTAAAA 


ATCCAGGACC 


ATCGGTTGCC AGTTCAAATA 


AAACATGTGT 


4920 


TGCTGCCAAG 


AAaTATTCAG 


ATTTAAAGTA 


AAAaCGTtCC ACAAAGCCAG 


AATTAGGAAA 


4980 


TTCCAAACGA 


TTAATTTTTT 


CAATCCAGAA 


ACGTAGTGCT TCCTCATCGG 


CAACGCGCAA 


5040 


GGCTAAGTGA 


TGGACATTGC 


CGTAGCCTTC 


TTGAGCCGCT GGCAAATCTT 


CCCGATGCTC 


5100 


TACGATGATG 


CTGGCACCAT 


TCCCACCTTC 


GCCAACTTCA AATAAATGAA 


. AACTACCTTC 


5160 


AGCATCAATC 


AATTTGAACC 


CTAAAACTTC 


; AGTTAAAACT AGTTGCAGAT 


GTTCGTAATT 


5220 
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TTCGACAGTT 


ACAAAGGCTG 


GTCCTAAGCC 


GATTAAAGCA 


TGTTCACTAG 


GGACGTTAGA 


5280 


TTTTTTCCAA 


GGCGTGCCGG 


CTGCAACACA 


TTGGTTCTTT 


TCATCAGAAA 


TAAGTTGGTA 


5340 


ACGTTGATTG 


TCAAAGTCTT 


CAAATTCTAA 


ATACTTTTTA 


CCAAAACGTT 


CTTGAATTTC 


5400 


TCCATGATCA 


ATTGCATATT 


CATTAAAGCG 


TGATACCCAG 


TAATCCAAAG 


TAGCATCATC 


5460 


TTTTACACGA 


AAGGAAGTAC 


GAGAAATTGT 


ATTGGTTCCT 


TTGGTTCCtT 


TrGGTAAGCC 


5520 


TGGGAAGTCA 


AAGAAAGTCA 


TATCTGTTCC 


GGGAGAGCCG 


GCATCaTCAG 


CAAAAAwTAA 


5580 


aTGATAkGTT 


TCAATATCGT 


CTTGGTTGAC 


AGTCTTCTTA 


ATTAAACGTA 


GACCTAAAAT 


5640 


ATCGGTGAAA 


AAGTGATAAA 


TTTTTTCGGC 


ACTTGAAGTC 


ATGGCAGTGA 


CGTGGTGTAA 


5700 


ACCTAAAATT 


TGTGTATTCA 


TAATAAAAAT 


CCTCCTAAAA 


GAAAAATAAT 


TTTTTGATTA 


5760 


ATTAACTTAC 


TAAAAGTAAG 


TTTAAATGAT 


TAGTCCTAAA 


AATGCAAAAG 


AAATGTCTCG 


5820 


AATTCGAACT 


ATTATTTTTT 


AGTTGATTTT 


TTAAAAAAAT 


AAGAATATAA 


TAGCCTGAGA 


5880 


TATTTTAATA 


CATAAACTTT 


CGTAAACATC 


GCCGGATGCT 


TGCGACTAAA 


GTGTTAATGA 


. 5940 


ACAGTTCTTA 


GGCCATAGCT 


GAACTGTTTG 


TTGGCACTTT 


TTTTGTTTGG 


AAGGGAAGGG 


6000 


AAACAAGATG 


AGTAAAGCTT 


GGTTAAAAGT 


TATTTTTGGC 


GCTTTTTGTG 


AAGTGATTTG 


6060 


GGTCATTGGC 


ATGAAGCATA 


GCACAACATG 


GTGGGAGATT 


TTAGGGACAG 


TAATCGCAAT 


6120 


CTTTATCAGT 


TTCTATGCGC 


TGATTAAAGC 


AGGAGAAGAA 


TTGCCAGTCG 


GTACAGCCTA 


6180 


TGCTGTATTT 


GTTGGCTTAG 


GTAGTGCTGG 


AACTATTGTG 


ACAGATCATT 


TTTTATTCCA 


6240 


CACACCATTG 


GGCATAGGAA 


AAGTTCTTTT 


TCTGTTGCTG 


TTATTAATAG 


GTaTAATTGG 


6300 


CTTAAAAATG 


GTAACTGGrA 


ATAAAGAGGA 


GGAAACAAGA 


TGAGTTGGTT 


ATACTTAGTT 


6360 


ATTGCAGGCT 


GTTTTGAAAT 


TTTTGGCGTA 


GGTTCAATT 






6399 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

AACAGGAGGA AAACTTGATG GGCATTCCAA TCACCATTGT TTCTGGCTTT TTAGGCGCAG 60 

GAAAAACAAC CTTAATTAAC CATGCGCTTG CGCAATCACC GTTTCCTAAA GAAGAAATTA 120 

TTATTATTGA AAATGAATTT GGTCAAACAG GCGTTGACCA CGCATTACTT TTACAAACCA 180 

CGGAACGAAT TATTCAATTA AATAACGGTT GCATGTGCTG CAGCTTACGA GGAGATCTTC 240 

TTGCCTCTTT • ATCAGCAATC TTAGAAGTGT ATCAAGAAGA ACAACAACCC ATTGCCCAAG 300 

TCATTTTAGA AACAACGGGC ATTGCTGATC CGCAACCAAT CATTCAAACA ATTTTAACGA 360 

CCCCGCACAT CAAAAATCAT TTCTATATAG ACAGTCTCTT AACGGTCGTT GATGGGCATC 4 20 



WO 98/50555 PCT/US98/08985 

553 

ATTGGCAACA ACAGCTTCAA GAAGCCGAAG CQATCAAACA ACT.GGCCTTA GCAGATCGCC 4 80 

TCTTJT.TTTC TGTCAAGGAA CCAGCAGACA GCACTCAGCT ACCAACGTTC CAAGAAACCC 540 

TTCGTACCAT TAATCCTTTT GCTGATATTT TATCCTTTCA AGCCAACGAG CCTCTTTTGG 600 

CTAGTGACTT TTTTCAATTA AATAAGTTTA CGGCAACACT TACAGAACAC GAAATGACTC 660 

ATTCCCCGCA TGAGCACGCC CACGAGCATC ATCACCATAC CTTTCACTCC CTCACGCTAA 720 

CGGCGTCCTC AGCCATCAAC GAACCTCTTT TCACTCGTTG GCTTGATTGG TTAATGTATA 780 

CGCATCAGGA AAAGTTATAC CGTTTCAAAG GGATTCTCGC CTTACAGGAG CATGACTTGG 840 
CCATTGCCAT GCAAGGCGTC AACCAGCAAG TGGCCTTTCA AATGACGAAT CAACCCCCTC 900 
AAGAAACGAC AATTGTTCTG ATTGGCAAAG AATTAGAGAC AAAAAAAATT CAAGAAACCT 960 

TTCAAACCCT CAACAAAATA GCTACTCCGT AAATTAATTT TTTACGAAAT GGAGGTCCTA 1020 

TATATAATGG AAAAAACTGA CTTATCTAGC GCTTATCGTC GACTGAAAAG TCCCAACATC 1080 

AAAACACGAA AACGCGCACT TAAAATCATT AAAGAACATA AGCGCAATAA ACAAAAGAAA 114 0 

ATTGCGTAAT TCACTAAAGG AGCACACCTA TGAAAAAATT TACTCTTCCC CTGTTAGCCG 1200 

CCTTATCGCT AATCCTTTTC GGCGCTTGCG GCAAAACAAA CACCTCTGAT AAAACCGCTG 1260 

ACGGTAAAGA AAAACTATCC ATTGTCACGA CTTTTTATCC TATGTACGAT TTCACTAAAA 1320 

ATATTGTAGG CGATGAAGGA GACGTCAAAT TGTTAATCCC TGCTGGTTCT GAACCACACG 1380 

ATTATGAACC ATCCGCCAAA GATATGGCTA CCATCCATGA TGCGGATGTT TTCGTTTACC 1440 

ACAATGAAAA TATGGAATCT TGGGTACCAA AAGCTGCTAA AGGTTGGAAA AAAGGAGCCC 1500 

CGAACGTCAT TAAAGGTACC GAAAACATGG TCTTACTTCC CGGCAGTGAC GAAGACGGAC 1560 

ACGACCATGA CCACGAACAT GGCGAAGAAG GCCACCACCA TGAATTAGAC CCGCATACTT 1620 

GGGTTTCGCC TCATCGTGCC ATCCAAGAAG TCACAAACAT CAAAGAACAA TTAGTCAAAC 1680 

TTTACCCTAA AAAAGCCAAA ACATTTGAAA CAAACGCAGA AAAATACTTA ACAAAATTAA 1740 

CAGCCTTAGA CAAAGAGTTC CAAACAGCTT TGAAAGACGC TAAGCAAAAA AGTTTTGTTA 1800 

CCCAACATGC TGCATTTGGT TATCTTGCCT TAGATTACGG CTTAAAACAA GTGCCAATAG 1860 

CTGGTTTAAC ACCTGAACAA GAGCCAACCG CAGGcGCTTG GCAGAGTTGA AAAAATATGT 1920 

CACAGACAAC CAAATTCGCT ATATTTATTT TGAAAAAAAT GCCAACGATA AAATTGCTAA 1980 

AACGTTAGCT GACGAAGCGA ATGTTCAATT GGAAGTCCTA AACCCGCTAG AAAGTTTGAC 2040 

ACAAAAACAA ATGGACAATG GCGAAGATTA TCTTTCTGTA ATGAAAGAAA ACTTAACTGC 2100 
TTTGAAAAAA ACAACAGATA CAGCCGGGAA AGAGGTTCAG CCAGAAACCT CTGAAAAAAC 2160 
AGAAAAAACC GTGGCTAACG GATATTTCAA AGACAGTGAG GTGGCTGAGA GAACACTGAC 2220 
AGATTACGCT- GGAAATTGGC AATCCGTCTA TCCTTTATTA AAAGATGGCA CATTAGACCA 2280 
AGTCTTCGAT TACAAAGCGA AACTGAAAAA AGATAAAACA CCAGCCGAAT ACAAAACCTA 234 0 

CTATGATGCC GGCTATCAAA CCGATGTCGA CCACATCAAC AT C ACT GAT T CCACCATTGA 24 00 
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ATTTC.TGGTC AATGGCAAAC 


CACAAAAATT 


CACCTATAAA 


GCAGCCGGTT 


TV FTV TV TV TV TV rp rp rp TT\ 

ATAAAATTTT 




AAACTATGCA 


tv tv t\ r** r** /~* 7\ tv r** 

AAAGGCAACC 


GTGGCGT-CCG- TTT-CCTTTTT 


GAAACAGACG 


ATGCCAATGC 




TGGGCGGTTT 


AAATACGTCC 


TV TV rri rn rp T\ /""< TV 

AATTTAGCGA 


CCACAACA 1 L. 


GCACCAACGA 


AAGCCGCTCA 


icon 


TTTCCACATC 


TTCTTCGGCG 


GCGATAGCCA 


TV /~" 7\ TV TV f"* *P /"* T* 

AGAAAGTC 1 G 


TTCAATGAAA 


TGGACAACTG 


2640 


GCCAACGTAT 


TATCCAAGCG 


ACTTAAGCAA 


ACAAGAAAI I 


GCCCAAGAAA 


TGATTGCGCA 


2700 


TTAAGCATTC 


AAAAGAGAGG 


AGGTCGGGAC 


AGAAGTG il 1 


AACTCCGAGA 


AATAAGAAGA 


2760 


AATTTCCGAA 


AATTGTTCTT 


TAATTTTTGG 


AGAATTTCGG 


CTTATTTCCG 


AAGGAGTTGC 


2820 


TTCTGTTCCC 


GCCGTTTATC 


AGTTTTTGAG 


CGTGGAGCAA 


AAATCCAAAG 


TGATTTTTGT 


2880 


CCCACGCTCC 


TCTCTTTTTT 


TATTTAGTCA 


TV TV /"* TV fl^ 

AGCACACCTG 


kTCTGTTtCA 


AACGTCGCTA 


2940 


ACamTGGCGC 


AcTTCATCCG 


TTGATTTJcGT 


ATTCATTAAT 


TGTGCTCTTA 


AATCACTGGC 


3000 


ACCTGGAAAG 


CCTTTGACAT 


AAATTTTAAA 


GAAGCGATGC 


AGCCCAACGA 


TGGAGCGAGG 


3060 


CACCAATTCC 


GCATATTTGT 


CTTGCAAATC 


CAATTGTAAA 


CGTAGCAAGC 


CCAGCAATTC 


3120 


TTGTGGCGTA 


TGTGTTTTGG 


GTTCTTTTTC 


AAAGGCATAA 


GGATTTTTAA 


AAATCCCCCG 


3180 


ACCAATCATG 


ATGCCATCTA 


CACCATATTG 


TTCTGCTAAT 


TCTAGGCCCT 


TTTGACGATC 


3240 


GGGAATATCC 


CCATTAATCG 


TAATTGTCGT 


TTGCGGTGCG 


ACACGGTCCC 


GAATAGCCAT 


3300 


AATTTGCGGA 


ATGACCTCCC 


AATGGGCATC 


CACTTTGCTC 


ATCTCTTTTC 


GTGTTCGCAA 


3360 


ATGAATGGAT 


AGATTCGCAA 


TGTCTTGCTC 


TAATAAATGC 


GTGATCCACG 


CCTCCATTTC 


3420 


CGCCATTTCA 


GTAAACCCAA 


TGCGTGTTTT 


GACACTAACA 


GGTAAGCCAC 


CTGCTTTGGC 


3480 


AGCGTCAATC 


AATTCAGCCG 


CGACTTCTGG 


GCGCAAAATT 


AGCCCGCTGC 


CTTTTCCACG 


3540 


CTCAGCCACG 


TTAGGCACAG 


G AC AG LLLAl 


ATTTATGTCT 


AACCCTTGAA 


AGCCCATCTC 


3600 


CGcTACGCCG 


ATACTCATTT 


CACGGAAAAA 


TTCCGGTTTA 


TCCCCCCAAA 


TATGTGCCAC 


3660 


CATCGGCTGT 


TCATCTTCTG 


TAAAAACGAG 


GCGCCCGCGT 


ACACTATCTT 


TTCCTTCAGG 




ATGACAATAA 


CTATCCGAAT 


TAGTAAATTC 


TGTAAAAAAC 


ACATCTGGTG 


CCCCAGCTTC 


3780 


TTTCACGACA 


TGACGAAAGA 


CCACATCAGT 


GACATC.TTCC 


ATCGGTGCTA 


AAACAAAAAA 


3840 


GGGCTTTGGC 


AATGTTGCCC 


AAAAATTCTT 


GCTCATCTTC ATATACTCCG 


TTTCTGATTT 


3900 


TTTCyTTAwT 


TTCAACGACA 


AwTGTCGACA 


TAATCTGGGG 


CGAAACAATG 


CTAATT 


3956 


(2) INFORMATION FOR SEQ ID NO: 78: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
AAnCAATTTT TGgATTAgCy GCTGGGTTTT CCAAnGnGTT TTTAgCkGCT AAAAGCGCTT 60 
TTgGTAAcTC TGTAACGAGG AGGCAGTAAA nTGGTGCCCt TCACTTGGAG TGAACTTnyG 120 
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CGCTCGTtTG 


ACTACTTCTT 


GTAATTGGTC 


TTTATTTACA 


AGGGGTACTT 


CTTTTAATTG 


180 


AGCCATTGCA 


GTTTCCAATG 


CGTTTAGAGC 


CTCTGAGACT 


TCTTCTTTTG 


TCGCCGTTTC 


240 


TTTCGCTACG 


ATTGCTTCAG 


CCGCTTGAAT 


TGCTTCTTGC 


AATTGTTTTT 


CTGTTTCTTT 


300 


TGTAAATTGG 


TATCCTTCTT 


CCGGCTTTCT 


TCCTCGGGCT 


TGAGCGATTT 


GCTCTTTTAA 


360 


GGTTTCCTTA 


TCAACGGCGA 


CGGGTTGTTC 


TTTTAATTGT 


GCAAtTGcTT 


CACGCAATGT 


420 


TTTAGTCGCT 


TGGTCTATTT 


CTTCTTGAGT 


AACTTTTTCT 


TGAGCAAATA 


CTTTTTGAGC 


480 


TGCTGCTAAT 


GACTCTTCGA 


GAACTTTTTT 


AGTTTCTTCG 


GTGAATTCTT 


GCCCATCACT 


540 


TGGTTTCTTC 


GCTAGCCCCT 


CTTGAATTGC 


TTTTTCTAAA 


TCAGCTTTAT 


TCGCTACATC 


600 


AGGCATTTTT 


TCCAACTGAC 


TTGCAGCCTT 


TTCTAGGTTT 


TCCGCGGCAG 


CAGAAACGTC 


660 


TGTCTGTGTG 


GCTTGTTCAT 


TGGCTAGAAC 


AGCTACTGCC 


TCATCTAAGA 


TTGGTTTAAA 


720 


GATTGCCCAA 


GTTTCAGGCG 


TATATTCTTT 


TTCGACACGT 


GCTTGATTTA 


GCGTAATTAG 


780 


CGCTTGTAAT 


AACTCTTTAT 


CTACTAGCTC 


TGTGGGCTTC 


ACT GT TAG AG 


AAAAATCTGA 


840 


AAGTGTGGCT 


TGTTGGAATC 


CTTGGCCTAC 


TAATAAGCCT 


TGACTCATAT 


CAAATTGTAA 


900 


TGGCTGCTCA 


ACAGTTGTCG 


CACGTTTTAC 


GCGAGTTGTT 


TCTTTTACTT 


TAACTTTAAA 


960 


GGTAACTAAG 


TCCATTGAAC 


CATTTAACAA 


TTGTTTATTT 


CCTTGATTAC 


TAAGAACTAA 


1020 


ATATAGATTC 


TCCACATCAT 


TTGAATGTTT 


CCGATATTTA 


GAATAATTTT 


CCATTTGTTG 


1080 


TGTATTATTC 


GTGGTTGCAG 


GCTGACCGAC 


AAGTTCAAAT 


TTGCTAGAAT 


CAAAGGACAT 


1140 


TTTTGTGCTT 


AACGCATTAA 


TGTTTTTTAA 


GTCTTGACCT 


TTAAGTATAA 


AGGTTAATGT 


1200 


ATCGCCAGGA 


AGATAAGTAT 


CCTTGCCATT 


TTCATTAACC 


ACAGCAAGGG 


ATAAACCGCC 


1260 


CGCAATTTCT 


TCGACATCAG 


GTATCTCAAT 


CCCACCATCT 


AATTGTCGTA 


AAACATAACT 


1320 


GATATCATAA 


GCATCAATTA 


CCCCATTTTT 


ATTTAAATCA 


CCTTTTTCAA 


CATAGCCATT 


1380 


AAAGTCACTG 


TCCACAGATT 


CTAAGCCTGT 


ATAGTTTCGA 


TAACTCATTG 


CATCATTTTC 


1440 


ATCAATCGTT 


CCATCGTTAG 


TAATGTCTCC 


ATGTAAGATT 


CCTTCTGTTC 


CTGGTTGTTT 


1500 


GTAAAAGAGC 


ATCTCACGAC 


CAGAGCCAAA 


ATTACCCACC 


GaTTTTAATA 


CCTTCATTTc 


1560 


AACAAAACGa 


TACGCTTGGt 


CCTTCGTTTc 


AAkGGTTTTT 


GtCAAAGCAT 


CTTGcTTCCA 


1620 


GTTAATAGGT 


TCTGAAAACT 


CTGTCCAATT 


AGCTCCATCT 


TTCGACGTCC 


GATACTGGAG 


1680 


CTGCAAAATA 


TTTCCGTTGC 


CGGcATTGTC 


TCTTGGCAAG 


TATTCAATTT 


TATCCATTTG 


1740 


ATACTCCGCA 


CCAAGATCAA 


ATTTTAAAGA 


CAAGAAATTT 


CCATCACTTG 


GATTAGCGAT 


1800 


GCCTGTTGAC 


CAGTTCGTAT 


GCCAACCTGT 


ACTCAAATCT 


TTATCTGTTA 


ATTTCTTCAA 


1860 


TTCTGCGCCT 


GGTTGTTCTG 


GTAAATTACT 


TGTTGCTTTG 


ACTTGATTAA 


TTGTTTCTTT 


1920 


ATAGGGATCA 


TCTTGTGTTT 


TTCCTTTAAT 


CGGTTCACTC 


CATTCAGAGA 


CCCCGTTTTT 


1980 


TCCAACAGCT 


CGAACTCGAA 


AAGTATGTTC 


GCTTAGGAAG 


GAAAATCCAT 


CAAATGTTGC 


2040 


TGTATTCGTT 


TGAATATTTC 


CGrAAACAGT 


CCCGTCACGC 


TCTACTTCAT 


AGCTTGTTGC 


2100 
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CTCAGTTACT 


TGATCCCATT 


GAAGAGTCAA 


AGAAGATGGC 


GTTGTTTTTT 


CTTCATTGAT 


2160 


TGCGACATTT 


GCCGGAATAG 


TTAACTGGTC 


ATCGACCGTT 


GTATTTCCTC 


CGTCAACCGT 


2220 


TCCTTTATTA 


ATGAAACCTT 


TGACAGTTAT 


TTGAACATCT 


TTGGCGGTCA 


CAGATTGTTT 


2280 


CGCTAGTTTT 


ACTGATAACG 


CTGATTGGTT 


CAGTTTTTCT 


CCTGAAGCTT 


CAGAAAGATA 


2340 


CTGATTCACT 


TGGAATTCTT 


TATCAAAATA 


ATATAGATTT 


GTGCCAGCTA 


AAAATTCTTC 


2400 


TTTATTAGCT 


GCTtGTTTCA 


AGGTTACTTC 


TGTCCCCCCC 


ACCATTGCTG 


TTACGCTTTC 


2460 


GGsCGCTtCT 


GAAGCTAGCA 


AwTCAAGCGt 


GGTGCTtCGT 


tCATCAACAA 


AGnCTTtATA 


2520 


ACTTCCTTTt 


GTCGGTtCGA 


tAGTaACGGT 


CAGATCCCtT 


TTcGTTTGAT 


TTAGGGCCTT 


2580 


GACTATTTAT 


TTTAGTCGTA 


GCTGATTGCC 


CAGCTTCATA 


ACTAGTTGAA 


ATTCCGTCAT 


2640 


CTTCATACAT 


AT T AAAAGAA- 


GTGGTGCCAT 


TTGGATAAAT 


CAAGAAAGAT 


CGTTGATCTC 


2700 


GTTGGATTTC 


CTTTGGATTG 


TTGTTTGGAT 


TCGTCATTGG 


GATAATACTG 


CCATCTTTGA 


2760 


CGAAAACGGG 


CACCTTCCAC 


AGCGGTGTTT 


T TACT C CAT T 


TAAGACACGG 


CCACCTTGAT 


2820 


ATTTTTCGCC 


TGTAAATAAG 


TCAACCCAGA 


CTTGTTTTTC 


ATCTGGTAAG 


TAGATTCCAT 


2880 


CACGGATTGA 


ATTTCCTGCC 


TCATCTTGAT 


TGCCATTATA 


AATTGGGGCA 


ACGAGTAAAT 


2940 


TTGGTCCCCA 


CATATATTGG 


TATTGAGAAT 


CTTTTGTATA 


TGCTGTTCCT 


TCATTAGGAA 


3000 


ATTCTAAAGC 


CATGGCCCGT 


ACCATTGGTA 


GTCCATCAAC 


TGATTCCTTG 


GCAATACTGT 


3060 


AGTTATAAGG 


CAT CAT CAT T 


GATTTTAATT 


TTAGATACGC 


TCGGTTCAAA 


TCTGTGGCTT 


3120 


CTTGATCAAA 


AGCAAATGGC 


GTTTTTGGAT 


TGGAGCCCCA 


GCCGTCCATA 


TTTAGCTGCA 


3180 


CTGGTGTAAA 


AGTTTTCCAT 


TGGAAATCAC 


GTATGTTGAT 


CTCTTTATTT 


TTTCCACCAA 


3240 


AAATTCCATC 


CATATCTGAA 


CCGACATTTG 


GTTGTCCAGA 


CAACGAAGTC 


CCGATATATG 


3300 


TTGGGATATG 


GAAGCGAATA 


TATTCCCATT 


GACCACCCGT 


TTGATCGCCT 


GTCCAAATAC 


3360 


CAGCGTGCCG 


TTGGGTTCCG 


GCCCAACCAT 


CTAATGAAAC 


AATCATCGGA 


CGAACTGCGC 


3420 


CGTCTGTTTC 


TTTTACAAAA 


ACATTCGCGG 


CATCTTCTAC 


ACCATTTAAG 


CCAAAGGAAT 


3480 


ACCCATACCC 


GACCCACGCC 


ACGTCAGTTT 


TGAGAGCTTT 


AACACCAGCG 


ACACTAACTT 


3540 


CTTTCGCAAT 


ATCCCGTTCT 


CCTTTTTTAG 


GATTTTTAGG 


ATCCGCTGGA 


TGGAGATTAG 


3600 


ATTGAGTCCA 


CAGCCCCACT 


TCAACACCAT 


TGGCTTGAGC 


ATACTCTGTA 


AATTCTTTTA 


3660 


AGTTTTGTAC 


ATCGCCGTCC 


AATGAATCTG 


TTTGGCCATA 


GCCAGCACCG 


TAGCCGTCAT 


3720 


TTGGCAAGAA 


CCACCCTAAC 


GGCATATCGT 


TTTTCTTATA 


TCGATCAATA 


ACTGCGCGTG 


3780 


CAGAAAATTG 


ATAATTCTCT 


TTTTCTCCAT 


TTAAAGATTC 


TAGCGTGCCA 


TTTAAGTTTC 


3840 


CTAGATCGCC 


TGGTTGGTAT 


TCTTTGTAAA 


AATTACCATC 


TTCAAACTTA 


ACTGCTCCCG 


3900 


CGGTTCCTTC 


TGCTACTTTC 


ACCCAATAGT 


CCCGATTATA 


CGCGTTTAAA 


TGCGCTTCAT 


3960 


AGAACCCGTA 


TTCTGGCATC 


AAAGCTGGTT 


TCCCAGTTAA 


TTCATAGTAA 


TCTTTTAAAA 


4020 


TACCTGCTGA 


GGAATCATTA 


AAGAAATAAA 


AAGCATCAAA 


ATCTGTTCCt 


TCGTGCGTTG 


4080 
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TTGTGGT7TT TTGTGGATCA TGACTTCCAA AA7CATAGTT GCCAGG77TC CATGTATTTC 4140 

GAACAACGCC ATACCC7GCA GTTGACCAAT AAAATGGGTT AGGCGA7GCT ACTCCGCCAT 4200 

CTACCCAATT GTTCGTATTG ACAATCTSGA TAGCTGT7CC TTTATGCGTA AATCGACCAT 4260 

TTTGAGTTCC TCCGCCAAAA TAATTTTCTT GAC7ACT7TG TTTTAACGTT TGTGTAGCTT 4320 

TATCATTTTT AAAGGA7AGC GGAgCGGTTT C7TCTAAAAT AACTTGATTC TTTTTCTTGT 4380 

CTAGAACCTT CATTAAAGCG C77TCTTTT7 C3AACATAAT TTTTAAACCa TTATTTtCTA 4440 

GAATAAATTG GT7TCC7GAA TCTGTGACGT TCGTTTGCTC AAAGGCTTGT GTACCATAGT 4500 

CGGCCATTGT TTTAGCCGTA A777TCGCAA CATGCTTTGG ATCATTTGGC GTTGGATATT 4560 

CTTCAAATTT ACCAGT7GGA TC7AGATGAT AGCGAAAAAG ATGGTCG7TT AAAATACTAA 4620 

TTTGTGCTTT TTTTCCAGAA GCATACGTAA TGACAAAGTT TTTCTCCGCT TTTtCAATCG 4680 

ACACAATTGC TCCGAC7GAC G7TGCTGAAT CAGCTACGTC TTCTTT7GCT GTTTGCTCTT 4740 

GCGCATCAAC CACTTGAGAA TCCATCGTAA CAAACGAATG GAACGCCAAT ACAGACGAAC 4800 

TCACTAATAA AAATGTCAAC CAGTGCTTTT TACCGTAAAT TTTTTTACCC TGATACTCTT 4860 

TGTTCATTTT TTCCTCCTTA AAATTGCTCC CCATAGAATA GTTCCGACAA AATTACTCAC 4920 

AAAATCCAAA ACCCAACAAA GTTTTATCCA GATGAAAAGT CATAACGATT TTCTTAGTTT 4 980 

ACTGAGACTG TTAGAAAGTC ASTTTCAGTA ACCCCTTCAT AAACACATCC tTTCAATAAA 5040 

ATATTAAATG AAAACAATCT A7TTTCATCT TATATAAAAA 7TTTAG7ACA GAATCATAAT 5100 

TTCCTAA7GA ATGCGTTTTC ATTTGTTGAC TACCAAT7TT TCACTTTTTT AATAACATAA 5160 

TTATCGTATG ATGGTTTTTT TGTGTTTGTC AACGGTTAAT TATAGCAGCC ACCATTAACA 5220 

AGTATTAACC CTTGATAAAC C7GTTAA77C ACTTATAATG ATGA7T7TAT TGCGAAAATA 5280 

TCTACATCAG AAAAAGAAAG ACTTTTTCAA AAATCTAATA CCTGTTGCTC AAATCTAGTC 5340 

TATTTAAAAA 7AATCAACAC CCTCCTGCTC AGCAAGAGGG TGTTAA7TAC TCGTTTTTTA 5400 

GTGAT7GATT tGACTTGCTG GTGTGAAAAG CAAACTAA7G ATGATGCCAA CTATCAGTAC 54 60 

OGCTAGCAAT ATAAAAATTA ACTTTCCAAA CCGGTGATAG GGATTGATCG AAAGCCCAAT 5520 

CCCATAAACT 7TCGGAACCC ACAAAGGAGG CGTTGGATTA TCTT777GAT CTTTATAGGC 5580 

TATCTTTAAA ATCTGCCACG CAAAATACAA CAATGGTAAA ACCATAATTA CCCCAACTAT 564 0 

ACCTAGCCAT TTTGCTAATA AC-TCACTAGT A7GCAAAAGA ATGA7TG7CA GAATTGGTAC 5700 

AAACAAAAGA ATAAAGCTAA T7GTGGCTCC CACTTGTGCT TGCTTAAGCA GTCGTCTCTC 5760 

ACGGAGAGTA AGATTTGCAA ACATTCCATC CATCCT7TCA 7GGTTT7CAG ATTAAATCAA 5820 

CGAATCACTA AAAAAC7AAA T7AACTGCGA AAATCAATGA CCATGCGCCC GCGAATTTTT 5880 

CCTTG7TCCA TTTCTTCAAA AAT AGCAT T A A77TC7TCTG GTTGACGTGT TTGACAAACA 5940 
GGAACAACTT TTCCTTCTGC ACCGAATTGG AAAGCTTCTC TTAAA7CTTG ACGGGTTCCA 
ACTAACGATC CGACTACG7G AAT GCCATCA AC-AACAAGAC G7GGAA7AGA TAAATCCATT 



6000 
6060 
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GTCTCAACAG GCAATCCAAC GGCGACAACC - GTTCCTCCCG CACGGACTGC 
CCATTAAATG CTGATTTAGC TACCGCAGTG ACTACTGCTG CATGGACACC 
TTTTGAAGTT GTGCAGCAAA ATCAGGATCT TTAACTGGAT TAATTAAAAG 
ATTTCTTTCG CAAAAGCTAA CTGATTATCA TTCACATCAA CAGCTACTAC 
AAGACATGCT TGGCATATTG TAAAGCTAGA TTACCTAGCC CACCTAATCC 
AGCCATTGAC CTGGACGAAT CCCAGATTCT TTTATCGCTT TGTAGGTTGT 
CAAGTAATAG AGGAAGCTGG CTCTGATGCT AAATTTTCTG GCACTTTCAC 
TCTGTGACGA TGCATTCTTC GGCCATGGCA CCATCAACTG TATATCCAGC 
GAACGACAAA GCGTTTCTCG ACCAGAAGTA CAATATTCGC AATGCCCGCA 
AACCATGCAA CAGATGCACG ATCACCAATT TTTAAACTAG TAACGTCTTT 
GTCACAATCC CAATTCCTTC GTGACCTAAG GTACAACCGC TAACATCGCC 
TTCTTAACGT GTAAATCCGT ATGACAGACA CCACAGACTT CCATTTTCAA 
TTATCTTTCA ATGGACGCGA TTCTTTTTCA ACAATTTTTG TTGTTTTTCC 
ACAATTGCTT TATACATGGT TCTTTCCTCC TTGTTTCTAT TGTGAATCAT 
TAGTATAATA CTACTTTCCA AAATAAGGrA CACCAATGAC TtGCGTAAAT 
TTGaCAATTT ATCCTkGCTA GACAGCTAtc TTCCCTTATG TCATAGATAA 
CATGAnTTTC TCTGCATAAT TGnCAACTAT TTT 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
ACTTATTTTA CTAACGTCGT GACGCTAAGT TTAGGTGGAC TTGTAGCGAT 
ATCGATCGAA TTTATACGCC GATAGCCATA TTTAATGTTT TATTTGTTCA 
GnATAAAGTA GCCTATCAGC GCTTGGAAGA TTTTTATAAA AAAGAAGATG 
AGCGGTTTCC GGCAAAGCGT TACCTGAAAT TCAAACAATT TCATTAAAAG 
TAGCGTAGAT AGTCAAACGA TTATGTCCCA ACAAAATCGA CAATTTTCAA 
CTATGGCTTG ATTGGAAAAA GTGGGACAGG TAAATCTACC CTGATTAAAT 
ATTATTGAAG CCGACAGAGG GGACAGTTTA TGTGAATCAA TTTCCTCTAA 
TTTGGAAGAT • T AT TAT G AAA AAGTTTTTTA TTTATCTCAA GATGTGCCAA 
AACGTTAAAA GAAAATATTG TATTTAATCA AGAAATTTCT GATGAACAAG 
TATGTATCGT TTTCAGTTGG GGGAACTTTA TGAACGGTTG CCGGAAGtTT 



ATCGACTGCG 
GCCAATTTCT 
ATCAGCACCC 
TTTTGCATTA 
TGAAATCATT 
TACCCCTGCA 
TGcATAACCG 
ATTTTTTACT 
GCCTTCATAA 
TCCAACCTCC 
AAAATCTGCA 
CCGCGCTTCA 
TTTTGCTGCG 
TGAACAATAA 
TAACGCTAAT 
TTCATACTGG 



AGTGATGTAT 
ATATAATTTG 
ATCCTGACTT 
ATGTGTGTTT 
TGAATAAAAC 
TGATTTTAGG 
CACAATTTAA 
TATTTCAAGG 
TAATTGAAGC 
AAACACTATt 



6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
.7020 
7053 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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GTGAGTGAAA 


AAGGAtGAAT 


TTTTCTGGTG 


/—» rp /"< Tv tv tv y\ t\ /"* tv 

GTGAAAAACA 


7V HC 7A T T P P T 
AbbbA J. L bb 1 


T/-APAP^PTT 


660 


AG CAT TT AC A 


CAAGCAGAGA 


TCCTTATTTT- 


AGATGAGGCG 


Ab i lLibLa i 


TPPAPf^AAAA 


720 


GnCCGAAGAA 


AAAGTGTTAC 


AAGAAGTACA 


AAAATTTACb 


pT.fi1\ 7\ rp TV TV /*' rp 

L AL AA 1 AAu 1 


1 unvv 1 A I L 1 1 


780 


GGTTACTCAT 


CGACCTAAAA 


CCCTGCGATT 


TGTAGATGAA 


AJ. IA1 IbAl 1 


T A 7\ TA TP TiP TP 
1 AAA I b Ab 1 b 


ft d 0 


AACTTGTTAG 


CTCTTTGGCG 


CTTCTTTTGC 


TACACTTTTT 


ATAAGAATTG 


TV TV TV ta TV 

AG AG AAAb b G 


yuu 


GCAACTGTTG 


ATGAATAATA 


ATGAGCAAGA 


ATTAGCGGAA 


TTTTGGGATG 


AAT1 IGbAGA 


you 


AGAATATGAA 


GAAATTCAAC 


AAGAATCGCC 


ATTTCCAATT 


GCTCGAGAAT 


TACGTGA1 1 1 


JLUzU 


CTTGGTCCAA 


GAAGGCATTT 


TTCCGTGTCA 


GACATTTCTG 


GATATCGCAG 


m tv r** f~* 

GTGGGAbGGG 


i npn 
luou 


ACGATACTTA 


CCTTTTTTTC 


AAGAGCAAGT 


CACGGAGTAT 


ACGCTAGCCG 


T\ rn Tv rp rp rp /-^ rp tv 

ATA i 1 1 L 1 LA 


1 1 /in 

J. J. H \J 


GAGAATGTTG 


GAGATTGCGG 


AAGCAAAAGC 


ACAAAGCAAC 


GTTGTTTTTC 


T T C AC C AAAb 




CCAAGAAAGA 


CTGATTGAGA 


CTGGGAAAAA 


ATTTCAAGTC 


GTTTTTTCAG 


CGA1 GAAbbb 




TGCTTTGGAT 


ACGCCGGAAA 


AAGTTAATGC 


ATTGTGCCAA TTAAGTGAAG 


TV t\ rp /~i /"■* rp rp HI 

AATGGTGlb I 




TATATTTCGG 


CTAGTGGAAG 


AGCAAGATTC 


GTTGTTTTCT 


CCTTTTGAGC 


■TV 7\/-»TV/^T\/~ , /*^7\ TV 

AAGAGAGCAA 


1 JO u 


CCCGCAGTTA 


AAATGGATGG 


CACAATACAA 


AGCATTTCTA 


AAAAAAGAAC 


AGCGACCTT1 


i a a n 

1 *i 4 u 


TTTCACAAAA 


AAATTTTTCT 


TTGAAGCATC 


AGAAGCTATT 


TCAAAAGACT 


TTTTCCGCAb 


IjUU 


CTATTTTGAA 


GAACAATGGT 


CAGTTCCAAT 


TTTAGAACAA 


CGCGTACAAG 


t\ tv ta rri m rr> 

AAATTTTTGG 


X DDU 


TTCTCATGAA 


ATTAAGcAAA 


ATCAGCGTAC 


CATTATTTAT 


GAATTAATTG 


CGATTCLA1G 




TAAAAAAACA 


ACCAGCGATG 


ACTAATTATC 


GCTGGTTGTT 


TTTCGGTTAT 


rp rp T\ tv rp rp /Tp 

TTAATTbl bb 


i fiftn 

lOOU 


GAAAGCTTTT 


TCTGGAATCT 


CTTTTTTATC 


AATTTTGACA 


GGAGATTGGA 


m T\ tt* nr* T 1 T* HP 

TGA1 lb I 1 I 1 


17 4 0 

1 fiv 


TTTACTAATG 


GTTGCTTCTA 


TGTAAGCATT 


TGGTTCCAAA 


GGTTGAGGAT 


rn fTl fT* rri n^^^T 1 ^^*"P 

n I IblLIbi, 




CCGTTCGTAG 


TCCATAACTT 


GCGTTTTTCC 


ATTACTTTTA 


ACAAAAGTTA 


r- t r» r* tv t 7a c t r* 
b 1 bGA I Ab 1 b 


i ft fin 


TAAAGAATAT 


TGGCCATCTA 


TGCTTTTTTG 


ATTGTCATCT 


ACTGTTTGTT 


TTPPTTP APP 
i I bb 1 1 bAbb 


1 V 


AATTTCTAGA 


GGAACTTGTG 


CATAGGCTTT 


TTCACCCTTA 


TATGTATCTG 


PATAATAAPG 


1980 


ATAGCCTGTA 


TAACCACCAA 


CAATGAATAA 


GCCAATAAAT 


AAAATAGCGC 


CGATTTTTTT 


2040 


CATTTTTTCT 


CCTCCTAACA 


ATTTACAAAA 


ATAGCATAGC 


AAAAGAAGAA 


GGAGATGGAA 


2100 


CATCGAAAAA 


ACGAACAGTT 


ATCTTACAAA 


ACTGTTCGTT 


TTTTTTGGTC 


TATTTAGCTT 


2160 


T CATC AG AG A 


GTAGAAAGGA 


ATCATTTTCT 


TACGATAACT 


AAAGAAAATA 


ACn 


2213 


(2) INFORMATION FOR SEQ ID NO: 80: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
ACGTTCAGGA ACTGATACCC TAGATTTGCG CAAGTTGCAT CGTCCGCAAa GATATCaTCA 
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ATCACCACCA ACGCCGTGGC 


ATTATTTTGA 


TGAAGAATGT 


CATCAAAAGC 


CATGTAATCT 


120 


TCAGGTTTTT 


CTAACGCAAC 


GCTTGGTAAA 


ACAGTTAATC 


CTTGCATCAT 


CATTGCTTTT 


180 


TGATAACCAA 


AGTAACGTTC 


AAAGTACAAA CTTTCATGCG 


TTGTATTGGT 


CACAAATAAA 


240 


ATATTTTTAT 


GGCCGTTTTC 


AATTAAAAAT 


TCCGTCGCTT 


GTTTCCCTAA 


TAATTGaTTA 


300 


TC ATT AT CCA 


CGTAAACTAT 


TTCCGTTTCA 


TGTTGGTAAG 


GTTGACCGAT 


TAAAGTAAAG 


360 


GGAATGTTGT 


TTTCGTCTAA 


GTAATCAATG 


ACGGGGTCCT 


TACTATCTGA 


ATACACAAGA 


420 


ATAAAACCGT 


CCACTTGTTT 


TTGAAGATGC 


ATTCTTTTAA 


CGTTTTCCAA 


TAGTACATCA 


480 


AAACTTTTGG 


CAGTTGCAAT 


AGCAGTAGTC 


ATATCGTAAA 


GACGGGCTTC 


CTCGTTGATG 


540 


GCTTCCATAA 


TTTCTAAATA 


AAATGGATTC 


CCTAACCGTT 


CTTTCGAATC 


TAAAGGAGGT 


600 


AAAATTACGC 


CAATGGCACT 


TGAAATTCGT 


TTACCCAAAt 


TCCGTGCCGT 


CATATTTGGC 


660 


ACATAGCCTA 


AGTCATCCAT 


GACTTTTCGA 


ACTTTCTTTT 


TTGTCGATTC 


AGAAATACTT 


720 


GGATGATCGT 


TAATGACACG 


CGAAACGGTT 


GAAGTTGCAA 


CCCCCGCTTT 


CTTAGCGACA 


780 


TCTTTTACTG 


TAATTGCCAT 


TTTTGCCACG 


CCTTTCTTAG 


TAATTATAGA 


AGGTTAAATT 


840 


GATAAACCGT 


TTGGTAGGTT 


TGCTTTTTAT 


TAGCTGGTAA 


TAAAATAGAG 


CCCCACTGAG 


900 


GATGATGAAC 


AATGTCTGGA 


TACTCTTGTG 


GTTCAATTGC 


TAAGCCATAG 


TTTGAATGCA 


960 


TCGCTTGTCC 


ATTAATAGAA 


AATGGAGCCT 


CAAAACCAGT 


CGTAGAAAAT 


AATACCATAC 


,1020 


TTTGGTTCGT 


TGTTGCTATC 


GTCATTTTTC 


TTCCAGAAGT 


AGGCTCTTGT 


AAGCATAATT 


1080 


GTGGCTGTCT 


TAAGTCAGAG 


CCAACTAAGG 


AGAAAACATC 


ATCTAACCCA 


GTTGGATAGC 


1140 


GTTGGAGGAT 


TGTTTCGATA 


GGTGTCAGTT 


CATTAAAACG 


TATGGATAGT 


TCAGAAGCTG 


1200 


GTAAAATCCG 


TCCAGTAGGC 


AGCTTATTAT 


CGTCTAATTC 


TAACATACCT 


TGACAGTTCA 


1260 


AAGATAGCAA 


ATGTGACAGA 


ATGTCTAACT 


TTCCATCGCC 


AGATAAATTA 


AAGTAGGCGT 


1320 


GGTTCGTAGG 


ATTAAAAAGT 


GTTTCCTGTT 


TACTCCGTCC 


AGTACTGATC 


aTTCGTAAAT 


1380 


GATTATCCTC 


TAGCTGATAA 


GTGATGGTGG 


CCTCAATTGG 


TCCAGGATAG 


CC AG AAATGG 


1440 


TGTCGGTTAA 


ATGAAAAGCA 


ACGCCAATGA 


TATCCTTTTT 


TTCAAAGGGC 


TCGACTGTCC 


1500 


AATATTGAAA 


AGACCACCCC 


TTTGAGCCAC 


CATGGATATG 


ATGACCGTCG 


CTGTTTTGTT 


1560 


CTAATATTCC 


ATTTTGCCAA 


TGGCCATTAC 


GTATGCGTCC 


AGCAACTGGT 


CCAACAGTGG 


1620 


CACCAAAAAA 


GCTTTGATCC 


TTTAATACAT 


CTTGATACGA 


ATCATACGAT 


AAAAGGATAT 


1680 


TTTCCCAATG 


ATTATTTTTG 


TTTGGCATCA 


AAAGTTGATG 


GAGGCGCGCG 


CCATAATTGA 


1740 


GAAGAACGAC 


TTCTAAAAAG 


TCGTTCTTCA 


AAGTAATTTT 


TTTTAAGTTT 


GTGTTTGGAA 


1800 


TATCGGTTAC 


GGTGATACGA 


TTATCCTTCA 


TGATCAAGCC 


ACACTTTTTT 


TAAGTAATCA 


1860 


AACGTTAAAG 


CGCTCGTCTC 


TGGCACGATA 


GGCAACCCTT 


CGCCTAACTC 


TTCTTTGCGG 


1920 


CCAACGCCAA 


CAGGTTGCGC 


TCCACTAGCA 


AGAATCGCTT 


GAATCCCAGC 


TTTGGCATCT 


1980 


TCAATGCCAA 


GACATTCTGC 


TGGTGTCAAA 


TCAATTTCTT 


TCGCAGCTAA 


> AATGAAAATA 


2040 
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TCTGGAGCGG 


GTTTACCTGC 


CTGGACGTCT 


GCTGGATTAG 


CAATGGCATC 


AAAAAGGGGC 


2100 


GTCAATTGCA 


TTTTTTCTAA 


AAGAAAAGGC 


CCATTTTTAC 


TTGCAGAAGC 


CAAGGCGATT 


2160 


TTAATATTGG 


CTTCACGTAA 


TTCCGTTAAT 


AAAGAAAGAA 


TTCCTGGGTA 


GACATCTTCA 


2220 


GGCGTAATCG 


CTTGGATCAT 


TTCCAAGTAA 


TAATCGTTTT 


TCCGTTGCGC 


TAACTGGGCA 


2280 


AATTCTTCAG 


AAGAAAACGT 


GCCTTCTTTT 


TTTCCGTATT 


TTAACAGTAA 


CTGCAGAGAA 


2340 


TCTTCACGGC 


TTACACCTTT 


CAGTTGTTCG 


TTGAAAACAC 


GGTCAATGGA 


AATTCCAATT 


2400 


TCGTTCCCTA 


ATTTTTTCCA 


AGCGTGGTAA 


TGAAATTCTG 


CTGTATCAGT 


AATAACGCCA 


2460 


TCTAAATCAA 


ACAAAACGCC 


TTTAAACATG 


AACTTCTTCC 


TTTCTGATCG 


GTAAAACTAA 


2520 


TCGATCTTTT 


AACGTGTATT 


TTTGATCGTA 


AACAGTTAAT 


GGAATTGCAG 


GACCGTCGAG 


2580 


TAAAGTGAAG 


GCCACTTGGT 


CTGCTGCAAC 


TTCAACAAAT 


AGTAAACGAT 


TCCGATAATT 


2640 


AATGTGGAAT 


GAGTAGGCTG 


TCCATGTCGC 


TGGTAAAAAG 


GGTGCAAATT 


TTAGTTGTTG 


2700 


ATGGTCCGTT 


TTCATTTGGG 


CAAAACCTTG 


AACAATCGCT 


AGCCAACTAC 


CAGTCATTGA 


2760 


GGTAATGTGT 


AAGCCATCTT 


CTGTATCATT 


ATTGTAATTA 


TCTAAGTCAA 


GGCGCGCTGT 


2820 


TCGTTGATAC 


ATTTCCACTG 


CTTTTTCTTC 


CATGCCTAAT 


TCAGCAGCTA 


ACACAGCATG 


2880 


AATGCTTGGT 


GAAAGAGAAG 


ATT CAT GG AC 


AGTCATTGGT 


TCGTAAAAGT 


TAAAGTTCCG 


2940 


GCGTTTTTCT 


TCTAAAGAAA 


AGGCATCATT 


GAAGAAATAA 


ATTCCTTGAA 


GAACATCGGC 


3000 


TTGTTTAATG 


AAACAAGACC 


GTAAAATTTT 


ATCCCATGAC 


CAATGTTGAT 


TTAAAGGCAG 


3060 


CTCACTAAGT 


GGTAAATCAG 


AAACAGGCAT 


TAAATCTTTA 


TCTAAGAAAG 


TGTCATGTTG 


3120 


TACAAAAATG 


CCTAATTCGT 


TATCCACTGG 


GAAGTACATA 


TTTTCTACAA 


TGTCTGCCCA 


3180 


TTTAGCTAAT 


TCGTCGTCTG 


CAATTTTGAG 


CGTTGTTTCT 


TCTTGGAATT 


TTAAATAAGA 


3240 


CTCACGGGTG 


TAACGTAGAA 


CCCACGCAGC 


GATGGTATTG 


GTATACCAGT 


TATTATTGAT 


3300 


ATTATTTTCA 


TATTCATTTG 


GCCCCGTTAC 


GCCATGAATC 


ATATATTGTT 


TGTGACGTTG 


3360 


GGAAAAGTGG 


ACCCGATCTG 


CCCAAAAACG 


AGCGATTGCT 


ACAAGTACTT 


CTAGACCAGC 


3420 


ATCTTTTAGA 


TAATCTTCGT 


CTCCTGTATA 


GTTGACATAA 


TTATAAATCG 


CATAAGCAAT 


3480 


CGCACCATTT 


CTGTGAATCT 


CTTCAAAAGT 


GATTTCCCAC 


TCATTATGAC 


ATTCGACACC 


3540 


AGTAAAAGTA 


ACCATTGGGT 


AAAGCGCACC 


CTTTAACCCT 


TGTTGTTGGG 


CATTATGAAT 


3600 


GGCTTGAGGC 


AATTGATTGT 


GGCGATATTT 


CAGTAGATTT 


TTAGTAACTT 


CGGGCTTAGC 


3660 


CAACGCCAAG 


TAAAGTGGGA 


CAGCATATGC 


TTCTGTATCC 


CAATATGTGG 


CACCTCCATA 


3720 


TTTTTCGCCC 


GTAAAGCCTT 


TTGGTCCGAT 


GTTTAAACGA 


TCGTCTTCTC 


CATAGTAGGT 


3780 


TGAGAAAAGT 


TGGAACAGAT 


TAAAACGAAT 


GCCTTGTTGG 


GCTTCGTCAT 


CTCCTTCAAT 


3840 


AACAACATCA 


GCTAATTGCC 


AACGTTTTGC 


CCAAGCGGCA 


GCTTGCCCAG 


CTTTTGCTTC 


3900 


AGGGTACAAG 


GTAGTCATTT 


CTGCAAATAA 


TTCGTTTACT 


CGTGTGATTT 


GTTGCGTTTC 


3960 


TGGCACATCA 


CGACTGGTAA 


CGACAAGCAC 


TTCTTTTGtG 


ATGGCGGTCG 


TTtCGCCtTC 


4020 
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GTTTAAAGAC aCTGTCAATT CTGCGGTTAA 
AACAGAGGCT CCATCGATGA AATGGCGCAT 
AAAGTTGTTA GGAATGGTCT TAGTCGTAAC 
TCTTTCTTCC CAAAAATGTT CTTCATAATT 
TTTTGAGTGA AGAGTGATTG TGCCTGTACC 
AGCTGCCTCT TTTTTTTCTA AGCTTAAAAA 
CGGTGTTTGG ATGGTGAATT GCCGAGAAAG 
AAAATCTTCG TAAGGTGTAG TGGCTAAATC 
TGCGATGAAA TTAATTGCGT TAATCACTTT 
CCAACCAACA CGTGTTTTGT CTGGATACCA 
GTCGCCAGAA TAGTGTTCTT CAAAATTTCC 
TGTTAATGAT TCTTGTAAAC GTAAATTTTC 
AGGATCAATT TGGAATAAAC GTTTGATTTG 
TGATAGGT 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4808 



562 

AGCCAAGGGC 
GCCTGCAGTC 
AAAGCCAAGT 
GCTGTCTTCA 
TTCCAGCATT 
ACGTTCAAAA 
AACCCCGTTT 
AATTGTTTGT 
ACCAAAATAT 
AACACCTGCT 
CCGCATCCCC 
TTTATCTAAG 
TTTCATGAGA 



TGTTGTGTAT 
ACTGTAAAAC 
GTTTCTTGTG 
TTTTGAACAT 
TCAATTGTTA 
GAGAAGCGCA 
TGCATGTCTA 
CCATCAATTT 
TCAGGATAAC 
AAATAGGTTC 
ATATAGCCGT 
TGTGTAGTAC 
ATCCCCCTCA 



ATGTAGGCAC 
GCTCTATTTC 
TTTCAATAGC 
CGCCATCCAA 
AGTGGATGTA 
CTTTATTCTT 
ATTCTAAAGA 
GTAAGTCCAT 
CATTTTTCCA 
CTTGATGGTG 
TGCCAATGCT 
GGATTTTCCA 
GTGATTTTAT 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

GAGTAGGAAA CAGATATACT TAATATAGAA AATTCAGATG CACTTATTGG GAGGTTATTT 60 

TATGCAACGA AAAGATGTTA CACCTAATTC AGATAAAAAA AGTTTATTGC AAAAATTTGG 120 

GATTGGTTTA GCTGGCGGCT TACTTGGTGG CGCCCTTATT CTAGGAGGCG CTTATTCTGG 180 

GATTATTCCC ACACCCAACG GCGGAAATAA TGCGGCGACG ACGACATCCA CTAATCACGG 24 0 

CGACACAAAA GTCAGCAATG TAAGTTACAA TGTGTCTAGC GATGTCACAA AAGCCGTGAA 300 

AAAAGTTCAA AATTCTGTGG TTTCTGTCAT TAATATGCAA AGTGCTAGCA ACAATTCTTC 360 

GGCAGATGAT CCTTTTGGGG GATTGTTCGG TGGGAATGAA GGTACTCAAG ACTCTTCCGG 4 20 

cAACAATGGT AACGATTTAG AAGCAGCCAG TGAAGGTAGT GGGGTTATTT ATAAGAAAGA ,480 

TGGTAAAACG GCTTATGTCG TGACAAATAA CCACGTCGTA GATAAAGCGC AAGGATTGGA 540 

AGTTGTTTTA TCTGATGGTA CCAAAGTGAA AGGCGAATTA GTCGGAACCG ATGCTTACAC 600 

GGATTTAGCC GTGATTAAAA TTTCTTCCGA CAAAGTTGAT CAAGTCGCTG AGTTTGGAAA 660 

TTCTAGTAAA ATCACAGTCG GTGAGCCTGC TATTGCAATT GGTTCCCCTC TAGGCTCTGA 720 

TTATGCTAAC TCTGTCACAC AAGGAATCAT CTCTTCTGTG AATAGAAATA TTACCAATAA 780 
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AAACGAGTCT GGTGAAcCAT 


CAATATTAAT - 


GCTATTCAAA 


CCGATGCTGC 


CATCAATCCA 


840 


GGAAACTCTG 


GTGGTCCACT 


AATCAATATT 


GAAGGACAAG 


TCATTGGaAT 


TAACTCAGTA 


900 


AAAATTGTGC 


AATCAACTAG 


TCAAGTGAGC 


GTTGAAGGGA 


TGGGCTTTGC 


CATTCCAAGT 


960 


AATGACGTAG 


tCAACATTAT 


CAACCAATTA 


GAAAAAGATG 


GTAAAGTGAC 


GCGGCCAGCT 


1020 


CTCGGGATTA 


CGATGTCTGA 


TTTAACAGGT 


ATCTCTTCAC 


AACAACAAGA 


ACAAATTTTA 


1080 


AAAATTCCAA 


CTTCTGTAAA 


AACTGGCGTA 


GTGGTTCGTG 


GTGTTGAAGC 


AGCGACCCCT 


1140 


GCTGAAAAGG 


CTGGATTGGA 


AAAATACGAT 


GTTATCACGA 


AAGTTGACGG 


CCAAGACGTA 


1200 


AGCTCTACTA 


CAGATTTACA 


AAGCGCGCTT 


TACAAGAAAA 


AAGTTGGCGA 


CAAAATGGAA 


1260 


GTGACTTATT 


ATCGTGGTTC 


TAAAGAAATG 


AAAGCAACTA 


TCGATTTAAC 


CATTGATAAA 


1320 


TCAGCTTTAA 


CACAACAAAA 


TAATCGCAGC 


AATCAATAAA 


AGAAAGAaAC 


GACCTGTTGT 


1380 


TCAAACAACA 


GGTCGTTTCT 


TTCTTTTATT 


CTCCTACCGC 


AAATGCAGAA 


CCAACTTCTG 


1440 


CATCAAAAGG 


CAATTCTAAG 


ATTCCTTTTT 


TAGCTGGTGC 


ATTTGGTAAA 


CGTAATTCTT 


1500 


TTGCTGAACA 


AATCATTCCA 


AAACTTTCTT 


CACCACGTAA 


AACCCCTGGC 


CAAATCATTA 


1560 


AACCATCAGG 


CATCATTGCG 


CCTGGTTTGG 


CCACAACTAC 


TTTTTGTCCT 


GCTTTGATGT 


1620 


TCGGTGCGCC 


ACAAACAATT 


TGTAAGATTT 


CGCCGTTGTC 


TACTTCTGTT 


TGTGTGATAG 


1680 


ATAAATGATC 


TGAATCTGGG 


TGTTTTTTAC 


ATGATTTGAC 


AAAGCCAACC 


ACAATTTTAG 


,1740 


GGGTGTCGTC 


TTTTTCAAGG 


GTTTCTGAAA 


ATCCCGCATC 


TTTAATCGCT 


GTATTGACTT 


1800 


TTTCGATTTG 


TTCGTCTGTT 


AAAGTCACTT 


GGCCGGTGCC 


ATTAATGTCA 


CCTAGAAGCG 


1860 


CTGAAGCATT 


GAAAATATTC 


CAAGCTACCA 


CTGTTTCTGG 


CGCCTCAACT 


AAACTTACAC 


1920 


GTGCCACATC 


GCCGACACGT 


TTTACTTGGT 


TTTCCGCCCC 


TTGATCGTCC 


GCAACGATGA 


1980 


CAAGTAAGAC 


ATCGCCAACG 


TGTTCTTTGT 


TATATGCAAA 


AATCATTTCT 


GTTCCTCCTA 


2040 


AATTTGAAAG 


CTATAGGCTG 


GaTAGCTAAA 


TTCTCTCCTG 


TCCTATAGTA 


ACAAAAAAAG 


2100 


TGGaAAGTTG 


CACTTACAAT 


TATTGAATTT 


TAGGCAATCC 


CTAATGCTGC 


CAAAAAAATA 


2160 


GATAATAGAA 


TAACCCCAAC 


TAAAAGTGTC 


GTTGTTTTCA 


CTTTTTTCCC 


TAATAACCAG 


2220 


TAAATAACTA 


AGAAGAATAA 


AGCCGGGAAA 


AAACCTaACA 


TGATTTGGTC 


GATATATTCT 


2280 


TGTAATGGCG 


TGGCGATTTT 


TCCAGAGCCA 


ACTTTGGCAG 


TTAACTCAAA 


CACGACATTA 


2340 


GAAGCCGTCA 


TCCCACCAAT 


AACCATTAAC 


CCTAAAATAG 


AGGCAGCTGT 


GGTAACTTTT 


2400 


TCGATTACGC 


CGCTATCTTC 


AACATCACTA 


AAAAATTTAG 


CACCTAATTT 


AAACCCTTTA 


2460 


TCCAAACAGA 


TATACCGTAA 


AGCAAAATGC 


GGAATATTGA 


TAATCAGTAA 


AAAGGCAATC 


2520 


GGACCAAAAA 


TATTTCCTTG 


TTTAGACAAT 


GAAATAGCTA 


CACCTGTTGC 


GATAACTTTT 


2580 


AAGGTCCCCC 


AAAAGAAAGA 


ATCCCCAATC 


CCAGACATTG 


GTCCCATTAA 


ACTAGTTTTC 


2640 


ACTGCATTAA 


TACTAGCTGA 


GTTAAATTCA 


TGATTTTTAG 


CATTTTCTTC 


TTCCATTGCT 


2700 


CCAGAAATTC 


CTAATAATAA 


GGTAGAGATA 


TGCGGGGTGA 


CAGACATAAA 


CTCTAAATGT 


2760 
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CGTTCTAAGG 


CCGCTTTCTT 


CTCTTCATTA 


TTTTTATACA 


ACCTACGTAT 


AATTGGAGCC 


2820 


ATTGCATAAG 


CATAAGCTAA 


ATTTTGCATC 


CGCTCATAAT 


TCCATGATGC 


ATCTAACGTT 


2880 


AAAGAGCGAA 


AAAATACAGA 


CCAAAAATCT 


TTTTTAGTAA 


TGACAGATTT 


TGTGTTTTTA 


2940 


GAATTCATTG 


TCATCGACAA 


CAGCCTCCCT 


TGTTTGACCA 


TCACTCATAT 


T CAT AT AT AG 


3000 


TAGGACAATC 


ACGATGCCAA 


ACAAAGCAAT 


CCCCACTACT 


GAAATTTTTA 


AATAAGCAGC 


3060 


GAATAAAAAG 


CCTAATAATA 


GAAATGGAAt 


CaTCTTTTTT 


GTTACCATCA 


TTCTTGCTAA 


3120 


CATCGCAAAA 


CCAATTGCTG 


GaATaATACC 


TGCCGCAATA 


TCTAATCCaT 


GCGTAATAAA 


3180 


GTCTGGGACA 


TAAGACAATA 


CCTTTTCAAT 


AAACGGCACG 


CCTAAGTAAA 


AGCCTAATGC 


3240 


CACAATTAGA 


GCTAACAAAA 


CTTTATTGCC 


AACTCCTGCA 


ATTAGTTGAA 


TTCGCTCTAC 


3300 


CCCTTGAATA 


TTGCCTCGTT 


TTGCTTCCTT 


ATCAGCTAAA 


TGGCTAGCGC 


CTGTAAGTAA 


3360 


AAATGTCATC 


TGAAGATTAT 


CGATTAAAAG 


TACTAATGTA 


GCGATTGGAA 


CAGCTAAAAC 


3420 


AACCGCACTA 


GAAATTGATT 


GCCCACTCAT 


AATTACATAA 


GCTGTTGCAA 


TAATCGTTCC 


3480 


TGAAACAAAA 


TCTGGTGGAT 


TACTTGCGCC 


AACAGGAAAT 


GCACCCATAA 


ATACCAACTC 


3540 


TAATGTTGCG 


CCAAGCTTAA 


TTCCCGTATC 


TAAGTCTCCT 


AAAGCTAACC 


CTACTAGCGG 


3600 


AG cG ACAACA 


ATTGGACGGT 


TCCACATCGT 


TGAACCAAAC 


CAGCTATGCA 


TGTAGGCAAA 


3660 


AAAGGCAATT 


AAAAACATTA 


ATAGAACCTT 


GATAAATGTC 


ACTTCCATCT 


ATTTGACCCT 


3720 


CCTATTTATT 


TAATTTAATA 


ACTTGTTTAT 


AGCAACCTTT 


TGATCGTCAG 


GGACTAATTG 


3780 


AACGGTTAAT 


TCGATTCCTT 


CATTAGTTAG 


TTCTTTAATT 


TGTGCAATGT 


CAGTTTCTGA 


3840 


TAAATGAACA 


GCTTTTGAAA 


TTTGTTTGCG 


CGTTTCCCCA 


TTTTTCATTC 


CACCTAAATT 


3900 


AACTTCCTTA 


ATTGTTGGCA 


CTTCTTTGAC 


CAATTGATAC 


ATATCAGCAA 


CTGATTCACA 


3960 


TAAAATAAAT 


AGTTTATATT 


TATCTGTCAC 


CCCAGAATTT 


AACGCCTTAA 


tTGAATCATC 


4020 


AATCGTTTTC 


ATAACTAATT 


TAACACCTGA 


AGGTTTTGCC 


ATTTTCATAG 


CGCTCATTCT 


4080 


TAATGGATCC 


TCAACCAACG 


TAT TAG TAG C 


AATCAAAATA 


CAATCTGCTT 


CTAAAAATTT 


4140 


TGTCCAAGTA 


AACCCGACCT 


GACCATGAAT 


CAACCGATGG 


TCCACTCTTG 


TCATTTTAAT 


4200 


CATCCTTATT 


CACCTCTCAG 


TTCTTTCACA 


ATTTTCATAA 


ACGAATCAAC 


ATAAGGAGCA 


4260 


TAAATATCTC 


CCGTATCTTT 


TCTTGTATCT 


TTGAAGTTAC 


TCCCTACAAT 


TGCTGCATCG 


4320 


CAAATCGCCA 


ATTGTTCTTT 


CACACTCTTG 


TCATTTAACC 


CCGCTGCGAC 


AATTAATGGA 


4380 


AACTCAGGTA 


ATTGTTTTCT 


AAACAACTTA 


ATCTTTTCTA 


GTGATGTCTC 


TTCACCCGTT 


4440 


GCATTTyCTG 


TCACTGCAAT 


AGCATCACAA 


CGTTGCTGTG 


CAATTTTCAA 


ATCTTCTTCG 


4500 


ACTGATTTTT 


CAGACAACAT 


TGGTTGATAT 


TTGAAGCGCA 


CGCCACCAAT 


TAATTTTGCT 


4560 


GTTGTTTTTG 


CACGATATAA 


ATCAAAAAAA 


GCTTGTAAGG 


ATGCTTCATC 


CCGCGGTTTC 


4620 


ACATGACCAA 


CCACAGAATC 


AATCTGTAAA 


AATTGTAAAT 


GATATTTATT 


GGCTAAATGA 


4680 


AAACCTAATG 


GGTCAACATT 


TAACACATTT 


ACACCAATAG 


GAATCGGTAA 


ATCTAATGAA 


4740 
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GTAACATATT 


GGaGCGCTTT 


TTCTAGCTGA , 


ACATAATCCC CATAATAGTT CTCCATCAAA 


4800 


ATkGCATCTA 


CCCCATGTTC 


ACTGTAAATT 


tGAATCTCTT TTTTTGCTCG CTCTTGAATC 


4860 


TCTTGATCTG 


TTTTACCTTT 


TAAATGAATC 


ACCCCAATAA TTGGTTTTTC AACAGCAAAC 


4920 


AGCTCTAAAA 


AATCAGCTTT 


TTTCTTCATT 


AATACCCCTC CTAGAAATCA TCATTTTCTA 


4980 


TCTCTTCGAT 


ATGAACGCGT 


TGAATTCCTT 


GTCGTGCATT TCGAATAATT ACCTGAATCT 


5040 


TTTCATCTGT 


TAATTTTTCT 


GAACCTGCTA 


GCATAAGCTC TAGTGCCAAA TTCAAATTAA 


5100 


CGTCAGAGAA 


AACGATTACA 


TTTTCAAGTA 


CAGTTAGCCC CATCATAGCT GTAACCACAC 


5160 


TTGCACCAAA 


TAAATCACCC 


ATGAGGATAA 


ATTCATCTTC TGGCGCTTGT TCGGCTTTCT 


5220 


TTTTTAACTC 


TCCAAAAAAA 


TCAACTGCAC 


TTTCACCTGG ATACAGACTA TATGTATGAA 


5280 


CGGCAGCTGC 


AGAGTGTCCC 


GCAATCATTT 


CTAACGAGTG TTTAATCCCC TTAGAAAAAT 


5340 


CACCATGTGA 


AGCAATGTAA 


AACCTTCTCA 


TTTCATAAAC CTCCTTTATT TAT T AT AT G A 


5400 


ACGAATGGCT 


ACTTTGTAGG 


ATTCGCTAAG 


GATATATTGC TCATTAAAAT ATAAAATTTG 


5460 


ATTATCATTA 


TCCACAACTG 


TTTGATTAAT 


TTTGACCAAC GCTTCTTCTT CACTAATCGC 


5520 


TAATGCTTCT 


GCCAACTTTT 


CATCGGCCAA 


GGCAATTTCA ATGATATTAT CAAAATAGCT 


5580 


AGCAATTACT 


CCTCTATATT 


GCCGTAAAAT 


ATCTAATAAT GAAGTATCTG TTAATTCTAA 


5640 


GTCCAACATA 


AACTGaTCTT 


CTTTTCTAAA 


ATAATCAATC TCGAAAATTG CAGGCATGTT 


5700 


ATCTAACGAA 


CGGAGGCGCT 


CAATAACAAT 


AACTTCTGTC GCTTTTTCAA CACCCAATGC 


5760 


TTCTGCAACA 


TGCCCTGTTG 


CTTTTTTCAA 


ACAAATTGAT ATGATCTTAG TTGTTGGTTT 


5820 


ACGATTCATT 


TTTAAACCTG 


ACtCAGTAAA 


ACTCCCACCA GCTGTCATGC TCTCAACATA 


5880 


GGATGGCATT 


GAAACAAATG 


TCCCTTTTCC 


ATGCCGTTTA ATTAAAATAT TTTCATCAAC 


5940 


TAATTGTTTA 


ATCGCATTAC 


GAACGGTTAC 


TCGGCTCACT CCGTAAATGG TCATCAATTG 


6000 


TTCTTCTGAA 


GGGATTTTTT 


CTCCTACTTG 


ATATTCACCA CTTCTAACTT TATTTAAAAT 


6060 


AGCATCGTAT 


AATTGTTGAT 


AAAgCGGTGC 


TTTTTTATTT GATTCTAATT TATCCATTAA 


6120 


TCATACACCT 


CATTTTAATA 


CGTCTTAATA 


CGTCTTAAAA ATAGCAAATA AAAAAGAGAA 


6180 


TGTCAAGAAA 


GAGTTTCCTT 


TCTTGACATT 


CTCTTTTTGC TTATTTTTCT TCAGAGTGAA 


6240 


CAATTAAGAC 


ATCGCAAGGG 


GCTTGACGAA 


TAATGTAGCT ACTGACGCTT CCCATGACCA 


6300 


CACGTTCGAn 


TGCGTTTAAT 


CCTGATTGTC 


CCACCATAAT TAAATCAACA TTGTATTTTT 


6360 


TAGHTAATTC 


ATGAPTCATG 


ACTTCTTTGG 


GTGAACCAAA CATGGTAACT GTTTCAACAT 


6420 


TATGAAAATC 


AACGCTTTCA 


GCATACTTTT 


TACAATCGGC CATTAATTCT TCCGCAGCGG 


6480 


CTGTTTCTTG 


ATCAATTAAT 


TGATCATTTA 


ACGTCGAATA CCCCATCATT GTATACACTT 


6540 


GATTTTCTAA 


AATATTTGCA 


ACAATGACTC 


GGCTTCCGTT TCGACGAGCA ACCTCAATTG 


6600 


CACGTTCATA 


AGCTAAATTG 


GCTTGATCAC 


TACCGTCTAC GCCGACAAGA ATATTTTTAT 


6660 


ACATTTGTTC 


TTCCATTTTC 


: GTCACCTTCC 


: CGATGGAGGC TGATCTAAAA GGCTTCTTTG 


6720 
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ACTAA6AAAC 


TAGAGTCAAA 


AATGAATTTA -GGACCCTTGC 


GCAGAACGCC 


CTTTTGTCCA 


6780 


GCTGATTCTC 


TCTTAATTTA 


ATGGTTCCTC 


TGGTTCATGA 


AAATTAGCCT 


AAGCTAGTTA 


6840 


AAAATGCACT 


GATTTCTTCT 


TTTGTTTTTC 


GGTCTTTATT 


TACTAAGCGT 


CCTAACTCTT 


6900 


GACCGTTTTC 


AATTACGACA 


AAACTCGGAA 


TACCAAAAAT 


ATTCCACTCG 


GCAGCCACGT 


6960 


CAATAAACTG 


ATCACGGTCG 


ACTTCGATAA 


ATTGAAACGC 


TGGAAATGCT 


TCTTCAATTT 


7020 


CAGGCATGAC 


TGGTTTAATG 


AAACGACAGT 


CTCCGCACCA 


ATCAGCAGTA 


AAGAAGAAAA 


7080 


CATTTTTACC 


TTTTTCAACA 


TAAGTTGCTA 


GTTCTTCTAA 


TGAATTGGGa 


TAAATCATTT 


7140 


AAACATCCTC 


TACTTTCCGT 


AAACAATTGT 


ATTAACTGTT 


GATTTATCCA 


AGCCACGTAA 


7200 


TGCTTGTAAC 


AACATTTCCC 


GCGCTGCTTC 


ATAATCCGCA 


ATACTAAACA 


TTGTTTGATG 


. 7260 


CGTATGGATG 


TAACGACCAC 


AAACGCCGAT 


GACTGTACTT 


GGTACACCGT 


TATTAGTTGT 


7320 


GTGCGCTGCC 


CCAGCATCCG 


TTCCGCCTTT 


AGAAACAAAA 


TATTGATACG 


GAATATTGTG 


7380 


TGTCGCTGCT 


GTATCTAATA 


AATATTCACG 


TAAGCGAGGC 


AATGTGATCA 


AACCTGGGTC 


7440 


GAAAATCCGT 


AATAACGTTC 


CTTCACCTAA 


ATGACCATAC 


GTCCCTTTTT 


TCGTGTGAAT 


7500 


ATCATCTGCC 


GCTGAACAAT 


CCACCGCAAA 


GAAGATATCT 


GGATTAAATT 


TATGAACAGA 


. 7560 


TGGTTTAGAA 


CCACGTAAAC 


CAACTTCTTC 


TTGTACGTTT 


GCTCCAGCAA 


TTAATGTGTG 


7620 


GCCTAATTCT 


TCATTTTGTA 


ACGTTTCCAA 


TGCATCTAAT 


ACCAACGTAC 


AACCATAGCG 


7680 


GTTATCCCAT 


GATTTACTGA 


TAATATTTTT 


GCCATTGGCT 


GTTTTAATCG 


TTTCTGTTTG 


7740 


TGGAACAATG 


GAATCCCCTG 


GACGAACGCC 


AAAGCTTTCA 


GCTTCTTCTT 


TTGAAGAGAA 


7800 


TCCTGCATCA 


AATAACACAT 


CAGAGACTTC 


TAATTGTTTT 


TGACCACTCG 


TACCACGCAA 


7860 


CAAGTGTGGC 


GGAATAGATG 


AAGAAATACA 


TGGATAGTTC 


CCTTTACTTG 


TTTTTAACGT 


7920 


AAAACGTTGT 


GCTGAAACAA 


CATAGGGATT 


CCAGCCACCT 


AAAGGAACCA 


CTTGGAATAA 


7980 


ACCATTATCG 


TTAATTTGAG 


TTAACATGAA 


GCCAACTTCG 


TCCATGTGAG 


CTGCTACCAT 


8040 


TACGCGTGGC 


GCTGCTTCCA 


CTTTTGAACG 


TTTAATCCCA 


AAAATACCAC 


CTAAACCATC 


8100 


ATATTGGACA 


TCGTCCACTA 


ATGGGGTGAT 


ATGTTCTTTC 


ATGTACGCGC 


GAATGTCGTC 


8160 


TTCAAATCCG 


CTGGTTCCTT 


GTAATTCAGT 


TAATTCCTTG 


ATTCGTTGAA 


ATGTTTTTTC 


8220 


TTCCATAGAT 


GTATGTCCTA 


TTGCTTTTGA 


TTTGTTGGAA 


ATTACGCCAA 


CCAAGCGTCA 


8280 


CTTTTTCTCT 


AAAAAGTCAC 


TTTATCCTGT 


CGATAAAAAT 


AGGTACTCCT 


TTCATATGAT 


8340 


TTCACTCTTA 


TTATAACGCA 


AATACTAGAA 


AATGCACATC 


GCTTTCTCTT 


TTTAAACGTG 


8400 


ACAATGAGAA 


TGCCCTAAAA 


GAATGATTGA 


AAAACTTTTT 


TTATTTCTCA 


GCGGCCCCTT 


8460 


CTCTATGGTA 


AAATAAGAGT 


G AT ACT G AC A 


AAAGTAATAG 


TAAGCAGAAA 


CCGACCAAGG 


8520 


GAGGTCATAC 


AAATGAACGA 


AGAAAATGAG 


TTGACATACT 


TTAAAGGCGG 


CTTAGCGCTA 


8580 


GGCGTTGGTC 


TTGGTTTAGT 


AGGCGGCATT 


GCCTCAACCT 


TGTTCTACCA 


TAAGAAAAAA 


8640 


ACAATCTCCG 


CTGATCTCGT 


CTTAGAAAAT 


GTAAAAGCTG 


CTTTTTTAAA 


AGAAGGACCG 


8700 
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ATTGAAGGTT CGTGGATTGA GTTTGAGAAA AAGCCCTTGC GGAAATTCGC CATCCACTCC 87 60 

AAAACGTATA CTGGAGGAAT TTGCCGCATT GAAGACGACG GCATTGTTCA ATACGAATTC 8820 

ACTGCAGATG CCTACACAGG AACTATCATT GACATTCAAC GATTAAAAGA TTAAAAAGAT 8880 

TGTTACAAAA GCACTTATTT GCTTTTGTAA CAATCTTTTT TTCTTCTCAT TTGTTTTTTG 8940 

TAAAAGGGAT GTTTATTTTA TGTCAAAAAA ATCAAATCAA TGATGAATAA ATACTCAAAA 9000 

ATCGAATGCT TTCTGCATGG AAATCAGCTT AAATCTTTAC TTATCAACGT TTGTCCACTT 9060 

TTTTCATAAA AAAGATTAGA CATTTTTTAA AAATTGATAA AAGAATAATC ATTTCTAACG 9120 

ATTTTTTAAT TTCTCCTTTT CATTTTTACA TTATACTGAG AGTATTGCTA AAAATGAAAA 9180 

GGAGCGTGTC AAATGAAGAA ATCTGTTTTA TTTACTTCAT TACTTGTATT ATCAAGCTTA 9240 

GCTTTAGCnG CcTGCGGCGG TGGCAGTGAC GATAAAGGAG CTAGCAACGG CGGCAGCGAC 9300 

AACCAAGTAT ACACAATGGT TGAATCCCAA GAAATGCCTA GTGCCGATCC GTCCCTTGCG 9360 

ACAGATGAAG TGAGTTTTAC CACTTTAAAT AATGTCTACG AAGGAATCTA TCGTTTAGAT 9420 

AAAGACAACA AACCCGCGCC TGCTGGTGCA GCCGAAAAAG CGACTGTTTC AGAAGACGGT 94 80 

TTAGTTTACA AAGTTAAATT ACGTGAAGAA TCAAAATGGT CTGATGGCAA ACCAGTTACT 9540 

GCTGCAGATT ACGTTTACGG TTGGCAACGA ACAGTGGATC CTGCCACTGC TTCAGAATAT 9600 

GCCTACATGT TTGAACCAGT AAAAAATGCT GAAAAAATTT CTAAAGGGGA ACTACCTAAA '9660 

GAAGAATTGG GCATTAAAGC AaTCAATGAT CATGAATTAG AAATCACTCT AGAAACAGCA 9720 

ACACCATATT TTGACGATTT ATTGGCTTTC CCTTCTTTCT TGCCGCAACG TCAAGATATC 9780 

GTTGAACGTT TTGGTAAaGA TTATACaAAG AGTAGCGATA AAGCAGTCTA CAaTGGTCCC 9840 
TTTACGCTAA CTGAGTTTGA TGGTCCCGGA ACAGATACTA AATGGTCTCT AACTAAAAAT 9900 
GAAGAATATT GGGACAAAGA GACGGTCAAG TTAGATAAAG TCGCTATCAA CGTGGTGAAA 9960 

GAAGCCCCAA CTGCCTTAAA TCTTTATGAA ACTGGTGAAG TGGATGATAC GTATTTATCT 10020 

GGCGAACTGG CTCAACAAAT GCAAAACTCG CCTGACTTGG TCCAATTAAA AGCCGCTTCT 10080 

TCTTTCTATT TAGAAATGAA TCAAGCAGAT GAAAAATCAC CATTGACTAA TGCAAACTTA 10140 

CGTCGTGCTA TGTCTTATGC CATCGATCGC GATTCATTAG CTAAAAATAT TTTAGCTAAT 10200 

GGCTCTCTTC CTTCACAAGG CTTCGTTCCT GTGGATGTCG CGAAATCACC AAAAACGGGT 10260 

GAAGACTTTG TTAAAGAAGC CGGCAGCGAC AAATTAGTCA AATACGACAA GAAAAAAGCT 10320 

GTGGAATACT GGAACAAAGC GAAACAAGAA CTTGGTGTTT CCAACTTAAC GGTTGATTTA 10380 

ATGGTAGACG ACTCTGAAGG CGCTAAAAAA ATGGGCGAAT ATCTTCAAGG ATCACTATCT 104 40 

GATACTTTGG AAGGCTTAAA AGTAACTGTG ACGCCTGTCC CTATGGCTGT TCGCTTAGAT 10500 

CGTACCTTAA AAGGGGATTT CCAAATCGCT GTTCGTGGTT GGAGTGCCGA CTATTCAGAC 10560 

CCAATTAACT TCTTAGATTT ATTAGAAAGC TCAACTTCTA ATAACCGTGG ACGTTACAGC 10620 

AATCCTGAAT ACGATAAATT CATTGCTGCG TCCAAAACCA CAGATGTTAA CGATCCTGAA 10680 
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AAACGCTGGG 


AAGATCTAAT 


CAACGCTGAA 


AAAACAGTAA 


TTGCTGATAT 


GGGTGTTGTG 


. 10740 


CCAATTTACC 


AAAAAGCAGA 


ATCACACTTA 


CGTGCACCAA 


ACGTGAAAGA 


AATTATTTAT 


10800 


CATCCAACAG 


GTGCTAAATA 


CGACTTCAAG 


TGGGCGTATA 


AAGAATAATC 


ACTCGATTAA 


10860 


ATTTAGAAAA 


AGATTTGGAA 


CAATAGCGAT 


TTTGGCTATT 


GTTCCAAaTC 


TTTTTTATAT 


10920 


AATCCCGATT 


AAAGAACCTA 


ACAACCCAAA 


AATCACAGTA 


AAAATAATTA 


AAGCAATCGG 


10980 


CGTGCGTCCT 


TTTTTCAACA 


TAATAAAACA 


AAACAGCGTA 


AACACTAAAG 


GTAATAAATT 


11040 


GGGCATGATT 


GTATCAAACA 


ATTGTGTTTG 


AAGGTCTAAC 


TTAGTGTCTT 


TCAAGCGAAA 


11100 


AACAAAAGTT 


GTCGTTAATT 


GTACACTCTT 


CGGAATCAAC 


GCGCCCACCA 


CGGAAATTCC 


11160 


TACAATTGTT 


GCTGAACGAG 


ATAATTTTTT 


AGTACTTTCG 


CTTAAAACTG 


AAATTGCTTT 


11220 


GACACCCATT 


TTATAACCTA 


GATTAATTAA 


CCCAATTTTG 


GCAGCCATAT 


ACGGAATATT 


11280 


AAATAACAAC 


AAGAAAACAA 


TTGGTCCTAA 


GATATTGCCT 


TCTAATGATA 


GTTGTGAACC 


11340 


AATCGCTGCT 


GCAATCGGCA 


AAACGGTTAA 


ATAAAATAGC 


GCATCCCCAA 


TTCCGCCTAA 


11400 


CGGGCCCATC 


AACGCTGTTT 


TAATCGCTTG 


AATGGATTTT 


GGTGATTCTT 


TTGACTCTTC 


11460 


CATTGCAATA 


ATCACACCAT 


GCATGAACGT 


CACTAATTGT 


GGATTCGTGT 


TCATAAATTG 


11520 


TAAATGATTT 


TCCAAAGAAA 


GAGCTAAGTC 


TTCTTTATCA 


CGATGAATCT 


TTTTCAATCC 


11580 


TGGAATTAAT 


GAGTAGGTCC 


AGCCGCCTGC 


TTGCATTCTT 


TCATAATTAA 


ATGAAGATTG 


11640 


TAGCAGATAC 


GAACGGAACA 


TTATTTTTTT 


TAACTCTTTT 


TTTGTTATTA 


CTTTTTCGGA 


11700 


TATCATATCA 


TTATATGTTT 


CTTCAGATGC 


CATCTTCTTC 


AAACCCTCCA 


TTATTATTTG 


11760 


ATGCAGTGAC 


TGTATTATTT 


CTTTGTGCGT 


AAAATTCATA 


TAAAGCAATT 


GCGACAGCTA 


11820 


AACCTGCAAC 


CGCTAATGTA 


TCTAACCCTA 


AATAGGCAGC 


CATAACAAAC 


CCAAAAATAA 


11880 


AGAACGGCAC 


GTAGGATGTT 


TCCCACATCG 


TTTTTAATAA 


CATGGCGAAA 


CCTACTGCTG 


11940 


GCATCATCCC 


ACCTGCAACG 


CTCAAGCCAT 


GAATACTCCA 


TTCGGGTAGA 


ATAGAAATGA 


12000 


AACTAGAGGC 


TTTATCAGCA 


CCAAAGAAAA 


TACATAAAAA 


TACTACAAAA 


AAATAACAAA 


12060 


TGAATAAAAC 


GATTGGTAAA 


AAACTAGACA 


TTCTTTCGAT 


GCCCCTTGTA 


TCTGCTTTTT 


12120 


TTGCATAAAC 


ATCAAATTTA 


TGCATAATTG 


GTGACATCGC 


TGTATATAAC 


AAGGTAATTA 


12180 


ACGCTTGCGC 


TGCAACTGCG 


AAAGGAACAG 


CCAAACcTAC 


GGTCACTTGT 


GGGTCAGATT 


12240 


TTGTGATAAT 


TGCTAACGCT 


GTTCCAATAA 


CTCCCCCAAT 


TACTACGTTA 


GGTGGTTGTG 


12300 


CACCGCCGAT 


TGCTACTGCA 


CCAATCCAAA 


CTAACTCTAG 


CGTTGCTCCA 


GTGATTAACC 


12360 


CTGTGTTGAC 


ATCTCCCAAT 


ATTAAACCAA 


CAATCAGCCC 


TGTAACAATC 


GGACGATGAA 


12420 


TATGTGTCTG 


TAAGACAAGC 


TTATCAACCC 


CTGCTATCCC 


AGCCCAAATT 


CCTACCAAAA 


12480 


TTGCCTGTAC 


TAACATTTTT 


TTCCTCCTaT 


TAaCTTCGTC 


TACTTTTTAT 


AAACTGTTTC 


12540 


ACCATTTACT 


AAGGTTTCTA 


GCACTTCATA 


TTTCTCATCA 


ATAATTGTAA 


AATTACCTAA 


12600 


AAGCCCTAGT 


TTGATTTCTC 


CCATAGATTC 


GTTCACATTC 


AACAAACGAG 


CTGGATTTTT 


12660 
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CGTGACTGAA 


TTAATAGCAA 


TCACTTCTGG ' 


AAGTAATGCT 


TTTTCAACAA 


GGTTGCGAAC 


. 12720 


CAT AT TAT TT 


AAATGGTTCG 


TACTTCCAGC 


TAACTTGCCA 


TTGGCTAAAC 


GGACCACATT 


12780 


TTGTTCATCA 


ATAACCATTT 


CGATCCCTTT 


TTCTGGTTTT 


GGATAAACGC 


CTGGTTGACA 


12840 


CCCTTTGGCC 


CAGATAGAAT 


CTGTCACAGC 


AATTAAATAG 


TCCTTGCCTT 


TTAGTTTGGC 


12900 


TAACACACGT 


ACTAATGAAT 


AATCAACATG 


TACCCCATCA 


GCAATAATTT 


CAGCAAACGT 


12960 


TTCTTCACTA 


TCCAAAGCAG 


CAACTACTGC 


GGTTGGTTTC 


CGGTGACTGA 


TATCTTCCAT 


13020 


TCCATTAAAC 


GTATGGGTAA 


AACTTTTTGC 


ACCTGCTTCG 


ACAGCTGCCA 


TTGCTTGTTC 


13080 


ATAGGTTGCA 


GCCGTATGGC 


CAATTGAAAT 


GACTACATCA 


TGTTCATGAC 


AATATGTAGT 


13140 


CAAGGCATTC 


TCTACATCAT 


TTTCTGGTGC 


CAACGTCATT 


AATTTTATGT 


GCCCTTTTGC 


13200 


TAACTCTTGC 


CATTTTTTAA 


ATGTCTCAAC 


GGATGGTTTT 


ACTAATAACT 


CTGGATTATG 


13260 


AGAGCCACGA 


AATTCTTCAG 


AAATCATTGG 


TCCTTCCGCA 


TGAATCCCTA 


AGATTTGTGC 


13320 


TCCGTTGGTT 


CCTTGGTCTT 


CCTCAATAAA 


TGAACCAATC 


ACTTCAAATG 


AATGTTCCAA 


13380 


GTCTTTTGGA AATGTTGTTG 


ACGTTGTTGG 


CAAAAAAGCT 


GTAATCCCTT 


CTTCTGGAAG 


13440 


GTAAGCTTGC 


CACTCTTTGA 


TAAATTCATG 


ATTGGCATGA 


TTGGCATCCC 


CACCATGCCA 


13500 


AC CAT GAT CG 


TGAATATCAA 


TAAATCCTGG 


AAGAATGCGC 


TTTTCTCCGT 


ATTCTTTGTC 


13560 


GACAGCTTTT 


TCATTGTACG 


GTAGAATTGC 


AACAATTCGT 


TGTTCTGACA 


CTTCAACTTG 


13620 


AGCTGGTTGA 


AATTGTTCGT 


TAATCCATAC 


GTTTTTACTT 


TGAATAATCA 


TTCTTCCCAC 


13680 


TTCTTCTCTT 


CAAAATTATT 


GTCCAACATT 


ACTTTTTTAT 


GAAAATAATG 


CATCTAAACT 


13740 


TTCTTCTGGT 


GTTGACGGTA 


CACGTTGAAT 


CGAAAGTTTA 


ACACCTAATT 


CTTGTAGCTT 


13800 


TTTAAAGGCG 


TCTACATCTG 


TTTTATTAAC 


GGCAACAGCG 


GTAGCTACTT 


GTtTTTGcCT 


13860 


TCTTCCATAT 


GCATATTACC 


AATGTTTAAT 


TTTTCAATTG 


GCACACCACC 


TTCTACCAAG 


13920 


CGCAATGCAT 


CAACAGGATT 


ATCAACTAAC 


AGAAAAATCT 


TTTGTCTTTC 


TGAGGCTTTA 


13980 


TGAATGACCT 


CAATCGTTTT 


CTTAATGGAA 


AAAAAACGGG 


TTTGTACTTC 


TGCAGGGGCC 


14040 


GCCATGTTTA 


ACAATTGTTG 


GCGCAGTGTA 


TCCTCAGAAA 


TTGCATCATT 


TGCGACTAAA 


14100 


ATTAAGTTGG 


CGCCTAAAGA 


ACCGTTCCAT 


TGTGTTGCGA 


CTTGACCATG 


AATCAAGCGA 


14160 


TTATCAATTC 


TTGTTAATAA 


AATATTCGGC 


ATAGTGCTTC 


CTCCTCTTTT 


CTAATTGGTT 


14220 


AGATTCCGTC 


TGCTTCTTCT 


TGAGTATTTG 


CTTGTGGTTG 


TTCTAGTTTT 


TTATGAACAA 


14280 


TCCCATCTCT 


GGCTGAATCT 


GTGACCATTG 


AAACAAAATC 


TTCTACGTTC 


GTGCTGATTG 


14340 


TGCGTAATGC 


AAGCCCTTCA 


AGAACCATCG 


GTAAATTTGT 


TCCTGCTAAC 


ACTTCGACAT 


14400 


TTGGATAATT 


ACTTGCGACA 


GTCATTGAAA 


TTCTAAACGG 


TGTGCCACCT 


AAAAGGTCAG 


14460 


TTAAAATGAC 


CACTGCTGGA 


CACGTCTCCA 


GCATAGAGTC 


AACGGCTTTA 


, CGAATGTTTG 


14520 


ACTCTAATAC 


ATCAAGCCCT 


GTATCATTGA 


, AAAAAGGGAC 


TGCCTGAACA 


v TTTGCTTGCT 


14580 


CTCCTGCAAT 


CATACTAACT 


GATTGATGCA 


> TTCCAGTTGA 


t AAATACCCCfl 


k TGCCCCGTTA 


14640 
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AAATAAGACC TATCATTTAT TCTTTCTCCC CTAACTATTT AsTrmGCTTA CTATATATTG 
GATAAGCAGA ACTTATTTCT CAAAGAGTGT AACAATTCTT GTGACAATTG AGTATGCTTT 
CCCATGATTT TGGACGATGa TTTCTATTTA AGAACATGTC CC TAT AC AG A TACAAGGCGT 
CCCAACTTAA AGATGCTTTT GAAAAAATGC TCGACTACAA CCGTGAGTCC AAACGAGCCA 
ACAAATGTAC CCATATACAA CACAAGAATA ACCAAGCTGA CTGGTTGATt GGTTAAAAAC 
GGGTGGACCA GTTTCAAGAC CACTTGAAAA ATAAAAACAT TCGCTAAATA ACTGCGATAC 
GCATAAgTGG CTATCCATTT AACCATTGGT AAAAGACGAA TCTTTTTGGC TATTCCTTGc 
GTTCCCAAAT AAAAAACCAA CCCGATAATC AATAAACTAT AGGCACTTTG AAGAGGATCC 
AAATAAGGTA GATCGTTCGG AGTCATCTGT GTTCCATCAA ATAAATTCCC TTGGACTGCC 
TGCCAAAAAA TAAAaAGaAA CAACAGAAAT AGTGCCACTC TCGTCTTTTT CAaGAAAGTa 
AAAATTTTCC CTTGATACTT CGCTGCGATG ACCCCGAAAA TACCATAAAT GAGATAACTA 
AGCCCTGAAC GATCTATTAA GTACCAAAGA GAAGTTTCTA CATATGGAAA AATCATTTTC 
TGGTACCAGA AGCACCAAAG GATATGCGCA CCGATTGTAA AAAAGACAAT TGGTAAAATC 
ACTTTTTTGT TTTTTGTAAT TCTGGTTGCC AACCAAATAA AATACGGCAT AAATAATTGG 
AATTGCAACA TCATTACGGC AT AC CAT AG A TGTGCCGCGC CATCACCACT GAGAACTTTT 
CCTAGAAATC CTAGGGACGT TTCATAGGGC ATTCGTTGTT GTAAACTAGG AAAAGCGTAT 
AGATAGAGAC TGGACCAGAC CACGTAAGGA AGCACCAATT CATAAAATTT TTCCTTTAAA 
AATTCAAGAT ACGATTCAGC TTGACTCTTT AACACCATGT TGTAAATAAT TGCAAAAATA 
AACATGGGTG CCGAATATTT AGACATT 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TGCGTGAAAA GTATAAAGCT TTTGAACTGC CAnATAAGGT AGGAACTTCC CAGCAAAATA 
ACCTnGTAAT TGACCATAAG CAAACAATGG CGAAACATTT CTTGAAAAAA ATTGTAACTT 
GTAAAATTCT TCATAAGGAT CCCCAAAATC CCATCGATTA AAATCAATCA CACCGACCTG 
CCTAGTTGGC AAGTAAATAA AATTTCCCAC ATGGAAATCT CCATGCTGAT AAACCACAGG 
CCGTTGTTGC AATAACGGTA ATTGTTTTTC TAAATATGAA ATTATTGGCT GCTGGTACGG 
CATACAAAAT ' TGAGATTCCT TATACTTATT TAGTTGGCTC TGCTTTTTTT CGTATAAATT 
TTGCGCAATC TCTCTTTGAC TAATCCTTTC AGGCAACAAA AGCTTATGGA TTTTGTTTAA 
ATAGCGGCCT GCCTCTACGC CTAAGTTCAG TTGTCTTTTG GGAGAAAGAG CGGGTAAAAC 



14700 
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14940 

15000 

15060 
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15240 
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15360 
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ATCGCTTAAG TCTTCACCTT CTAGATACGT TAATAACATG TAT.CCTTGTT CACTTTCAGG 54 0 

AATAGGTGCA CATAAAAAAG GTTTAGAACT GGGAAATCCT AACGCCGCTA CTTTTTTGAT 600 

AAATTGATAT TCTTGTTGTT TGGCTAATAA CTCTTCTTGT TGAAAAATTC GCAATAAAAA 660 

GGTTTCTTGA TTTTTGGTAA CTTTAAACTT TAAATCAGTT GACCAACCAC GGTGAATAGC 720 

TTCAACTTTT TGCCATTGCT GAAACGCTGG AATCTTCTCA AACGCCCTCA TTACTTTCGC 780 

CTCCATTTCC CTATTGGTTT TAGTATACCA AAGAAAAAAG GCTGGAGACA ATTGGGGATA 840 

AATTGTCTCT AGCCTTCACT GGTTATCATT TTCTAGTTTT CCTTATATAA AtCGTTCCAC 900 

TCGTTACTAA GAAAGCTCCT CCTAAAACAA CCCAGAAAGT ACTAGGTACT TCTGCACCTG 960 

CTTTTGGTAA TACTTTTTTG GGAAGCTGTT CTTTTTTCCC AATCATATTT TTATCAGAAA 1020 

AAGCATGAGT ACTTTCTGTT GATTTGGTTC CCTGATTTGC TGATGAATTC AGCGAATTAT 1080 

CTAAGGATGA CTCACTAGTA GTTTTTCCAG ATTTGTCTGG GTTTTGGTTT TGGTTTTGGT 1140 

TTTGATCTTT TCCTTCGATT GGTCCTTCAC TTGTTTTTGT TTCTTTGTCT TTATCTGGTT 1200 

CGCCACTTGT ATCTAGTTCT TTATCTTTAT CTGGTTCGCC ACTTGCATCT GGGTCTTTAT 12 60 

CTTTGTCTGG TTCGCCACTT GCATCTGGGT CTTTATCTTT ATCCAGTTCG CCACTTGCAT 1320 

TTGGTTCTTT GTCTTTGTCT GGTTCGCCAC TTGTATCTGG TTCTTTGTCT TtGTCTGGkT 1380 

CTTCTACATG TGGTTGATTT ATCTTGTGGT TATTTTCAAA TAAAGCACCT AATTGTAGCA 14 4 0 

ATGTTCGGTC TTCATTTAAT GCACTATTAA ATTGAATCCC CAAAGGCAGT CCTTCTTTTG 1500 

AAACGTAAGT TGGTACACTT AATGCTGGAT GTCCAA 1536 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS": double 

(D) TOPOLOGY: linear 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GCCAAATTTG GTGGATCGTG TCCGCTTGGG AnnGCCTTAT GGGTGGACCC TACGTGCCAT 60 

TTGGGTGAAA ACTAGGCCAT ATTTTAAAnG GAATATCCGT GCTTTCCTTG GCCACCGTTC 120 

■ ATAATCCATT TCCACTGAAC CGCAAAAAGA ATCGTATCCA GAGAGATGGC AAGGCCAnAC 180 

AGAGAAaTTT TyCTTkGGaA CAACTTCAAG AAACAATTCT TGAAAAAGCC AATCAACAAG 24 0 

TAGGCAGTCG CCGGaCGACT AAAGAAGTCA TTAAATTTAC TACTAAAAAT AAAATTGGAG 300 

ATAAAATGAC TCTTGGTACG AAGAGCTTGT TAATTTTGCT ACTTTGTATG ATTTTTTATG 360 

TAAGTCTTAC CTTTGTAAAG GGTATTTCAT TAACAACAGC TTTCATTCAT CCAGCGGAAA 4 20 

CAATTCAACG TATATTTGTT CCTGAGAAAG TAGCAGGGAA GGATACAAAT GCTGTGAAAa 480 

CGACTAAAAC TAAACCAGCT TCAACAAAAG AATCTAGGCA GCAGGAGGAA GTTCAGCCTA 54 0 

GCCCAACGAA CGTACCAGAA AATAATAATA GCGAACAAGC TGTGTCAACG TATACTGTGG 600 
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AAGTTGGAGA 


TTCAGTCAGT 


TTGATTGCTG 


AAAATCACGG 


ACTCACTATA 


GAACAACTTC 


660 


AAACTTTGAA 


TCCAGAAATC 


ATTGAGGTAC 


CTATTTATCC 


TGGACAAGTG 


TTGAAATTGA 


720 


AAGAGGTGAC 


TGAATAATGC 


GCGAGGTAAA 


ATTAATTTTA 


GCTGaACAGT 


TTTCaTACTT 


780 


AGGCsTTGCa 


CAACGAATTG 


CTAAATACAG 


TAATAAAGCA 


TTATATCAAa 


GCCACATCTT 


840 


aGgAAATTTA 


TGGCAAATTT 


TAAGTCCTTT 


AATTCAGTTA 


GGTGTCTATT 


ATTTTGCTTT 


900 


TGGGATTGCC 


CTTGGTGGTG 


CTCGAACCGT 


AAAAGGAGGA 


GTATCCTATA 


TTGCTTGGTT 


960 


GATGGTAGGA 


TTAAGTACAT 


GGATTTTTTT 


AAGTACAGTA 


ACAAAACAAG 


CTTCTAGTTC 


1020 


TGTTTATATG 


CAAGTTGGCA 


TGGTCTCTAG 


AATGAAGTTT 


CCAATAAGTA 


TTTTGCCAAT 


1080 


GGTTAAAATA 


TTTTCTGAGT 


TACCAAGTTA 


TTTTGCATTC 


ACTACGTTAG 


CAGCGATTGT 


1140 


GTCAATCTCG 


ACAGGTGAAA 


AGATTTCTAT 


TTATTGGATT 


CAGCTACCTT ' 


"ACTATATTGT 


1200 


AGCGATGATT 


ATCTTTTTAT 


ATAGTTTTAG 


TCTAATTAAT 


TCAACTATTG 


CTGCATTAAT 


1260 


TAGGGATTAT 


CAAATTTTCT 


TGAATTCTAT 


TATCCAAGTA 


TTAATGTATA 


TGTCAGGGGT 


1320 


CTTTTGGGAT 


TTGAGTACAA 


AGAATTTACC 


GTCTTGGTTA 


TCAAAAATTC 


TTATGTTGAA 


1380 


TCCTTACGCA 


TATATTATTA 


ATGGATTTCG 


AGATACATTT 


TTCTATAAaa 


GATGGTTTTT 


1440 


TGAAGATAAG 


CCCCAGCTTG 


TTGTTTTTTG 


GCTAACAACA 


TTAGTAATAT 


TTATCGTAGG 


1500 


AGCTCATTTA 


CATGTTAAGT 


TCAGAAGTAA 


ATTTGTTGAT 


TATATGTAAC 


ATGTGAAGAA 


1560 


GTAAAACCAC 


CGTATACTGA 


CAAGAATACG 


GTGGTTTTAC 


TCTTTAATTG 


ACAATAGCCT 


1620 


TCTCTTTAAA 


GAGAAGGCTA 


TCTAACGAAT 


CAATATTATT 


ATTTTAAAGT 


TGGATCAGAC 


1680 


TATTTCTAGT 


ATTCAAGTAT 


TTTTCATTTA 


AGCGATGTAT 


TCCCTACCTC 


TAATACATCC 


1740 


TTGTATCTCT 


CTAGACACGC 


CAgTTTTGAA 


TAAAATCATC AAAACAAyTG 


AATGGTTCGT 


1800 


TTAAAAATAT 


TTTTCGTGTT 


CTAATTATTA 


TAACATAAAA 


CTCGTCAACT 


GACCACTAGC 


1860 


TGgACGAGTt 


CATAATTTkT 


CCAGATAAAT 


AGACAGCCGT 


TTTCCGCCGA 


CTGTCTAAGA 


1920 


TTAAGATAGA 


TTTGTAAAGA AGTGgAATAA ATGgCTTCCT ACTATCTCtT 


TATAACAkGA 


1980 


GGrAAAgAAA 


ACkGCaAAaG 


ATAGCTGTGT 


TTTTTGAAAA 


AAAGATCAGC 


TGGTTTCCGC 


2040 


CAGCTGATCA 


TCCGTTGAAA 


AGAGATATAA 


AGAAGTGAAT 


AAAGCTTGCT 


TTACCTCTAT 


2100 


TTTATACCAT 


GTTTTGATTC 


AAATAACTAT 


TCAGTATTAG 


TTATACTTAA 


ATAATAAATA 


2160 


TGTATACGGA 


TGTATTGTTT 


TGAGGTATAG 


TCATAGAGAA 


AAGAGGGAAC 


GTTTATGAAA 


2220 


Q Q fi r P r T r P , T , f" , 'T ,p P 
H/iHl 1 1 1 \j 1 L 


TAAGTTATAG 


CTCTGGTAAA 


GATTGTTTAT 


TGGCGATGGA 


TfGATTGGTA 


2280 


CAAGCAGGAA 


ATCAGCCAGT 


TGCGTTAGTG 


ACAACATTAA 


GTGATGAAAT 


TAATCGTTCT 


2340 


TGGTTTCACG 


GCATACCAAT 


CTCCGTCTTA 


GAAGCAGCAG 


CCGAAGCGTT 


AGATTTACCA 


2400 


CTTGTTATAT 


CACATAACAA 


TGAAACAAAT 


TACACAGAAA 


AAGTGGTTGA 


GGCACTGCAG 


2460 


GAAACCAAAA 


AATTAGGTGC 


AGAAACGGTT 


TGTTTTGGCG 


ATATAGATAT 


TGAACAAAAT 


2520 


GGTGCTTGGG 


ATCGCCAAGT 


TGCGTTAAGT 


GCTGGACTTG 


AACCACAATT 


ACCTCTTTGG 


2580 
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CAGGAAAATC 


GCGAacACTT 


GTGAAAGAAT TCTTGGCAAA 


AGGGTACACA 


GCAATTATCA 


2640 


AAACGGTCAG 


CAAAGAGGCA 


GGCATTCCAA TAAAATTTTT 


AGGAGAACCG 


TTAAATGAAA 


2700 


CATTTATCAC 


GTACCTTAAA 


GAGCATCAAT TAGATATTTG 


CGGAGAAAaC 


GGGGAGTACC 


2760 


ATACGTTGGT 


AATTGACGGC 


CcGCTATTTA AAAAGAGGCy 


CAtTATTATm 


CTcAGGAATT 


2820 


aTGAAgcCTT ATGCTaTyCT TgATtATtGA TGCTTAAaTG 


AAaGTAAaaG 


GACACAAGCT 


2880 


GCcACTTTTt 


CTAGGGATAT 


gCTATAaTTG mCTtAaGAaA 


AG TAG AG AAA 


GTGAGTGGTT 


2940 


AATTATGGAA 


GAAACTGTCT 


TTTTTAATTT AGGAAATGCC 


CTTGCTTCAA 


AAAGAGATCA 


3000 


AAAAGAATTA 


ATTAAAGAAG 


CCCAAATTGC GAAAGATACT 


CGAAAGATTC 


CAGGAAAACT 


3060 


GATTATCGTT 


GAAGATGAAG 


AAAATGGTAC CCATATTTTA 


TTTGAATTAG 


CTGATCAAAC 


3120 


AGAACCGTCT 


GCTGAAACAA 


AAGAATTCAA AGTGAAAAAA 


GTGATTGATT 


AATAAAAAAG 


3180 


CAACGAATTT 


TTCTAAATTC 


GTTGCTTTTT TATATTTTTT 


TACACAAATG 


ATACAAAAAG 


3240 


AATATCAAAA 


TCAAATAAAT 


GATTAGTATA CTTGAAGaAA 


CAAATAGGAA 


GCAAAGCAAA 


3300 


CGTTTTGAGG 


TGTTCTTTCA 


TGGAAAATCG AGGAATTTTA 


TTTGATAAAG 


ATGGTACGTG 


3360 


TATTCGTTTT 


GACACTCTTT 


GGCAAGCAGG TTTGCAAGCT 


TGTTTTGAAA 


CACTAAGTAT 


3420 


GTTaGCCCCT 


CATCATTCAG 


CAGAAATAAA AAAgATATTA 


GCTATTCAGG 


AGCAACGTTT 


3480 


TTTGCAAAAA 


CATTTACTTG 


ATGAAGTCCT TTATCAGGAA 


CTTTATCAGG 


AATTGGCGCA 


3540 


ATTTGAGGAA 


TTAGTCGAAC 


AGGGAATCAG CAGTCGATGG 


CTGGAGCAAT 


TTTTTTATGA 


3600 


TTATTTACGA 


AAAAATCTGA 


AAAAGATCGA ACCGATTGGT 


GATTTAAAAC 


AGTTATTTCT 


3660 


TGAGCTGAAA 


CGGAAGAACT 


ATAAAATTGG ATTAGCAACT 


TCAGATACTT 


TGCCAGCGAC 


3720 


TATGTTGATT 


ATGGAATATC 


TTGGTTTAAC AGAAATGTTT 


GATTTTATTG 


CGACAGGAGA 


3780 


TCGTTACTTA 


CCGAAACCAG 


ATGCGGACAT GCTCCAaGCC 


TTTTGTCAGT 


CATGTCAATT 


3840 


GAAGGCGACA 


GAAGTAATTA 


TGGTAGGTGA TTCGCTCGTG 


GATGTTTTTA 


TGGGAACGTG 


3900 


TCATGGCAAA 


GCAGGAATTG 


GTGTTTTAAC CGGCAACTGC 


CAGTCAACTG 


ATTTTGAAAA 


3960 


GTTTGAAGTA 


GCCTATTTCC 


GTGATATCCA TGAAATACCT 


TATCAAGAAT 


TATGGGAAAA 


4020 


TACCAAGAAG 


AAAAAAATCT 


ATTATTTTTG ATATATGTAT 


TGTGAAAAAC 


GTTTCTTTTA 


4080 


TTCGAGTATC 


TAAGCAAGCA 


TACTAGTGTA TTTCAATAAA 


ATACCATGAT 


GTTCATAGGT 


4140 


TTTATAATTT 


CAGCTCACAG 


ATTTTTGTTC AATAAGAAAA 


AAATTACAAC 


TAAAATAGAA 


4200 


AACGGGCGTG 


GGACAAAAAT 


CACTTTGGAT TTTTGTCCCA 


CGCTCAAAAA 


CTGATAAACG 


4260 


GCGGGAACAG 


AAGCAACACT 


TTCGGAAATA AGCCGAAAGT 


GACTTTCTCA 


CACAAGTGTT 


4320 


CGGGATATCT 


TAATCCTTAC 


CTACGTGGTG CTCGGAGTTA 


AACACTTCTG 


TCCTGACTTC 


4380 


GTTGTTAATA 


TTTTACAATG 


ACAAGTTTTT CTTATTTTAA 


TCCTGAACTT 


GATTTAGAAA 


4440 


TTGCACTTCC 


ATTAGAAAAA 


GAAACAGTAA CAGTAGCACC 


TAAATCACCA 


> TCTAAGTTTT 


4500 


TCATCCAGAT 


TAAAGTATCA 


TTTTTTTCAC CATTGATATT 


TGTAGAAGTA 


k ATCCCTGTTG 


4560 
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GGTCTCCTAA ATCTTTTCTA GCTTGTTCTT . 


CAGAATATGA 


ACCATCAGTA 


GCAATGTTAT 


4620 


TTACTTGGTC 


TGCAGTAACT 


TTATCATGT'i' 


TTGCAACCTT 


CAAGCCACTT 


ACAGATTTGC 


4680 


TCGCTGCTTT 


GCCATCAGAG 


AATGAAACAG 


TAATAGATGC 


aAGAAGATCA 


CCACCCTTGA 


4740 


CATTTGTCCA 


AGATAGAGTA 


GTGGTAGAGA 


TGCCTTGGAT 


ATCTGTTGTA 


GAAGAAGAGG 


4800 


CTGGTTCACC 


TAAAATTGCT 


TTTACTTCAT 


CTTCTGTAGA 


ACCGCCTTCG 


CTGCTGTTCA 


4860 


TAATATCACC 


GACATTAATT 


TTATCGTAAG 


TkGCTTTTAA 


ATCAGAGTTA 


TCTTTCGATT 


4920 


TATCTTCTGr 


TTTCTTTTCA 


GAGCTTtCTT 


TTTTTGTATC 


ATTTTTTGAA 


CTATTCGTAG 


4980 


CTTTTGAAGA 


AGAGGCTTCT 


GTTTTCACTT 


CTTTGGTATC 


ATTGCTTGCT 


TTTTTATCCT 


5040 


CTTTGCCAAG 


ACCACTACCA 


GCAATCCCCA 


GTACAATAAC 


TACTACAACC 


CAAAACCAAA 


5100 


TTTTCTTATA 


AAATGGTTTT 


TTTACTTTTC 


CTTCATTTGA 


ATCATTTTTT 


TTCATTTTCA 


5160. 


CTTTCCCTCC 


CATAGTTTAG 


TTAATGATTT 


GTAGCGCTTA 


CTAAATAGTT 


GAAAACAATC 


5220 


CCCAATTAAA 


TAACTTTGGA 


CTACAAATCA 


CTTAGTTTAA 


ATGAAAATAA 


GTATACCGCC 


5280 


ATGTGTAAAA 


ATAACACACC 


ACAGTATACT 


TATTTTATCA 


TTATTTCATA 


AAAATAACAC 


5340 


CATTATTTTT 


ATGAAATAAT 


TAAAAAAGAT 


AAAAAATCTA 


AAAACTTCGA 


CCGCCGCCAC 


5400 


CGAATGTGCC 


GCCGCCAGTT 


GAGTGGGTGG 


TGCTACCACC 


ACCGCCCATT 


CCGCCACTGC 


5460 


CGCCATTGTT 


TTTAGGAATA 


CGACGCGTAG 


TGATGAAAGA 


GTTGGTTAAC 


TGATCTGTGC 


, 5520 


GGGAGGTTAA 


GTTTAAAGTT 


GTTTTTTCCC 


TAAAGGGATA 


TTGATAACTT 


GAAAATTTTA 


5580 


ATTGATATTT 


AGAAATATTA 


ATGCCTAAGA 


AGACCAAACT 


GAGTATCAGC 


GCAgCAGCAA 


5640 


AAGCAATTAC 


CATTTCCAGC 


GGGGTAATGA 


CTTTATAACG 


AGTGATTTTA 


CCTGTTTCGC 


5700 


TGTCCACACG 


ATAGTGCCCC 


CCAGGAACCC 


CTTTATTAAC 


AAATGCTTGA 


GTTTCCTGAA 


5760 


CAAAGGTTTG 


AGCAGCCGCG 


AAATAATTTC 


CTTGACTCAT 


ATTATCCCAT 


ATTTTATCTA 


5820 


AGGTATCATC 


AATTCGTGCA 


TCTGTCATAT 


AATCAATCAT 


GTTTCCAGAA 


GTAGAGATGT 


5880 


AGATTTTCCG 


TAAGTCCATA 


TCAATGAGAA 


AAAGAATCGC 


ATTTTGGTCC 


TTGCCAACTT 


5940 


TATTTAAAAG 


ATAATGATCT 


GCATATTCTT 


GTTCATCGCC 


ATAGGTATTA 


TTATTTGTGG 


6000 


TTACAATAAA 


GACAGAGGCT 


TTTGTTTTTT 


CTTCTAAAGG 


TTGTATCTCT 


TGCTTTAGTT 


6060 


GGTTGATTTG 


ATCAGGTGTC 


AGTAATTGTG 


CGCCATCATC 


AATTGAATTT 


TCAGCCGCCG 


6120 


AAACGGGTAG 


GGCAAGGCCA 


AAGGTAAGAA 


GTATTAGGAA 


AAAAATAGGT 


AAAAGTCGTT 


6180 


TTTTCATAGG 


AAATAACCTC 


CTAATAAAAA 


GAGTGCTAAG 


AGTCCAGCGA 


AAATGCCTAA 


6240 


CGTTGCTAGC 


CCTAAACGAG 


TGTAACTAAT 


TGGAAGTACA 


CCGCTTACTT 


TACCTGTTTG 


6300 


TCCATTCATG 


GCATAATAAT 


AAACTTTTTT 


TGATTGTTCC 


TGACTGCGAT 


AGGTGACTAA 


6360 


CCAAATTGGC 


AAAAGCATAT 


AG T GAT TAT T 


TTCTTGGTTT 


AGAGAAATAT 


CCGTGCGAAC 


6420 


ATTTGTCAAC 


GTAGTATAAC 


CAGAAGCAGA 


GTCCCGTAAC 


AATGATTCCG 


AATAATCTTG 


6480 


TAATTCTTGT 


TGGACGTGTT 


TTTTGATTGC 


TTCATATTCA 


ATATCGCGTT 


TTTCTGCTTG 


6540 
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GAAACCTGCT 
GCCTTCCACC 
CGATAATTTT 
TACGCGCCAA 
AACGACCCAA 
ATCTCGGGGA 
TTTTTCAATT 
CAAGACAACT 
AATTTGAGCG 
TTCATTCTCA 
AAATTTTCCT 
TCATAACGAG 
TGATCTTTGG 
GTTTCCATTG 
AAAAAGGGTA 
AACCCCTTGG 
AACAATTTCG 
TGGTGTACCA 
CCAAGTGTCT 
GTTTGTTTGA 
CATAAAGCCG 
GGAACCTTGA 
CATGTTGATT 
AATTTCCATG 
TAATTCAACA 
GATTGCTGCT 
ATCTACTTGC 
TTTAATTGAG 
ATAGTTTAAT 
AAAAACTTGT 
AATCGCATCT 
* ATAGCCTTCT 
GTTGGGATAG 



AGGTATTGGC TTTTGAAAGG CACCGCTTGA TCAATTAAAA ATGGTTGAAC 6600 

ATTTTTTGTT GGACGTTTTT TGACAAGGCA TTTTTAACTA ATTCTTTAAA 6660 

CCTGAACGTC CCACATTGAA ACGTTTTGTT TCGGTGTATT CAATATCACC 6720 

ATGCGAATAC TTGTGCCAGT AGCTTGCAGT TGTCCGTCGA CTTCTGCATC 6780 

TAGGGGAAGT AAACCCCAGT TAATTTATCG ATTTGATTTT TGCTGAAAAA 6840 

ATGAACCATT TTTTCTTGGT CCAAGCGAGA AATTTTTCAA TCGCTTCGTC 6900 

GTAAAGGGTA AGACATTTTC TGGTAAAAAT TCGCCACTTA ATCGACCAGC 6960 

GGATTATGGC AATAATAACA GTATGTGGCA GCCGTTGTTG CATCGGTGAC 7020 

CCACAGCTTG GACAATTAAA AATTTCCATT GTGCCTACAT CTGCAGTTGT 7080 

GTGGTCTCTG TTTGTTGCGT ACGGAgGATT CTTCTTCCGC TGTGAAAGTC 7140 

CGGCAGTCGT TGTTGAGGGA TTCTCTAAAT GCGCCTCATG TTGCTGTTGC 7200 

TCACTTCTTC TTCCGTGTAA ATATTTAAAC AATATTCACA ATGAAATTTT 7260 

GATTAAATAA GAGCGGGCCG CCACAGTTGG GGCATTTATG GGTAAATGTT 7320 

CAACCGTCCT TTCTAATAGA GTGAGTAAAT AACAGGAATC TGATTCAAAG 7380 

GACTTGTGCC TGACCCTTTT CTTTAATTGT TCACTACTTG CTTATAAAGG 74 4 0 

AATGGTTTAC CACAATGTGG ACAGAACTTA GGAATACCAT TACTTAAGTC 7500 

CTACATTCAC TACATTTCAT TTCTAATTTT GGTTTCTCGT TTGTTGGTTT 7560 

CAGTTTGAAC AAAACTTCCC AGTGTTTTCA GTCCCACATT GTGGGcAAGT 7 620 

GTAGCCCCTT GTGCGTTTTG CTGGTTTTGT TTTTCTTGCA TTTGTTGTTG 7680 

GACGctTGCG CAAAGTAGTT GCCATTCGCA TTCATGCCCA TACCAACGCC 774 0 

CTCATAGCGC CACCTTCATT TTTACCAGCA GCCTCCATAC CACGAGCCAA 7 800 

ACATAGCCTT CGCGAACACC TGGATCGCTG AGCATAGCAC CTTCATTTCG 78 60 

AATTTTGTTG AATCATCTGT ATAAGAAATG CTCGCAACTG CTACAGAAAC 7920 

CCGCGATTGG TGCGCCAATC CTCATCAAGC GCAGTTTGCA TATATTTGCT 7 980 

CTTTTTGAGG GAACATAAGA AATACGTTGC CCATCAGCAG ACATTTGATT 8040 

TGCAATGCTG TAATGAACTC TGCCAAATAT TGTTCATTAA TATCATTAAT 8100 

GTTTGGTTTT TAGGAATCGC ATTAGTATAA AATAGCAAAG GATCTGTAAT 8160 
TAATTTCCGT GTGCACGTAA GAAAAGTTCG GCATTATAAA AATTATCAAA 8220 
GGCGCAGGTG TTCCAAATTT AATGCCTTTA ATTTCTTGCA AATTAATATA 8280 
TGTTTTTGTG GAGTTACGCC ACCAAATTTG AAACGACTGA AGGTCTCTGC 834 0 

TTTAAAGAAC CGTTAAACAT TGAAGGCGCA GAATCATTTT TGACTGTATA 8400 
TCAGCAGTGT AATCGATGAT TTTGCCACCA TCGACAAGAA GCATCATCAT 8 4 60 

ACATGTACAA CAGAACCATC TGTTAAAACA TCGTCCGTTC CTTTACGATT 8520 
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AGAACCACGT 


TTATCATCTT 


TACGTACAAA 


AACACCTTTT 


GTCATGACGG 


TTGTGTCCCC 


8580 


CATATTATCA 


GGTTCAATAA 


CCTCTAGCCA 


TTGATCGGCC 


AAGCCACCGC 


CAACTGAACT 


8640 


TGTTGCTGCT 


TTAATAAGTC 


CCATGTTCAT 


TCCTCCTAAT 


AAAAAACTAA 


TTGTTAAAGC 


8700 


TAGTATACCA 


CACAATGAAT 


GCAGAGAAAT 


CCAACGAAAG 


CCTGATGAAA 


TGGAAAAAAT 


8760 


TGATCAAACT 


TTATTAAGAA 


ATTAAAAAAA 


GTATAGGTGA 


AACACAAGCA 


TTTTAAAAAA 


8820 


AGTTACGTAT 


AATAGAAGAA 


AAGACATATT 


ATATAGTTAG 


GGGATTATTG 


AATGGAACGA 


8880 


AGCAATCGTA 


ATAAAAAATC 


TTCAAAAAAA 


CCACTTATTC 


TTGGTGTTTC 


TGCCTTGGTT 


8940 


CTAATCGCTG 


CTGCCGGTGG 


CGGGTATTAT 


GCTTATAGTC 


AATGGCAAGC 


CAAACAAGAA 


9000 


TTAGCCGAAG 


CGAAGAAAAC 


AGCTACTACA 


TTTTTAAACG 


TATTGTCAAA 


ACAGGAATTT 


9060 


GATAAGTTAC 


CGTCCGTTGT 


TCAAGAAGCT 


AGCTTAAAGA 


AAAATGGCTA 


TG AT ACT AAA 


9120 


TCTGTTGTTG 


AAAAATACCA 


AGCAATTTAT 


TCAGGGATTC 


AAGCAGAAGG 


AGTCAAAGCT 


9180 


AGTGATGTTC 


AAGTCAAAAA 


GGCGAAAGAC 


AATCAATACA 


CATTTACCTA 


TAAATTATCG 


9240 


ATGAGCACGC 


CTTTAGGCGA 


AATGAAAGAT 


TTGTCTTATC 


AATCAAGTAT 


CGCCAAAAAA 


9300 


GGCGATACCT 


ACCAAATCGC 


TTGGAAGCCA 


TCTTTAATTT 


TTCCAGATAT 


GTCAGGAAAT 


9360 


GATAAAATTT 


CGATTCAAGT 


AGATAATGCC 


AAACGTGGAG 


AAATTGTCGA 


TCGTAATGGT 


9420 


AGTGGGCTAG 


CAATTAACAA 


AGTGTTTGAC 


GAAGTGGGCG 


TAGTGCCTGG 


CAAACTCGGT 


9480 


TCTGGCGCAG 


AAAAAACAGC 


CAATATCAAA 


GCTTTTAGTG 


ATAAATTCGG 


CGTTTCTGTT 


9540 


GATGAAATCA 


ATCAAAAGTT 


AAGCCAAGGA 


TGGGTCCAAG 


CAGACTCCTT 


TGTACCAATC 


9600 


ACAGTCGCTT 


CTGAACCAGT 


GACAGAATTA 


CCAACAGGGG 


CTGCGACAAA 


AGATACAGAG 


9660 


TCACGTTATT 


ATCCGCTGGG 


GGAAGCAsTG 


CGCAATTAAT 


CGGGTATACG 


GGCACCATTA 


9720 


CTGCAGAAGA 


TATTGAAAAA 


AATCCAGAGT 


TAAGTAGTAC 


GGGCGTAATT 


GGTAAAACTG 


9780 


GCTTAGAACG 


AGCGTTTGAT 


AAAGAATTAA 


GAGGGCAAGA 


TGGTGGCTCA 


CTAGTTATTT 


9840 


TAGATGACAA 


AGAGAaTGTC 


AAAAAAGCTT 


TACAAACCAA 


AGAGAAAAAA 


GATGGTCAAA 


9900 


CGATTAAATT 


AACGATTGAC 


AGTGGCGTAC 


AACAACAAGC 


TTTTGCTATA 


TTTGACAAAC 


9960 


GCCCtGGTAG 


TGCGGTCATT 


ACCGATCCTC 


AAAAAGGCGA 


TTTATTAGCA 


ACGGTTAGTT 


10020 


CACCTTCGTA 


TGATCCTAAT 


AAAATGGCCA 


ATGGCATTTC 


TCAAAAAGAA 


TATGATGCTT 


10080 


ACAATAACAA 


CAAAGATTTA 


CCATTCACAG 


CACGTTTTGC 


GACAGGCTAT 


GCACCAGGCT 


10140 


CTACCTTTAA 

i n^v x x j. **** 


AACAATTACC 


GGTGCCATTG 


GTTTAGATGC 


AGGGACATTG 


AAGCCAGATG 


10200 


AAGAGCTAGA 


AATTAATGGT 


TTGAAATGGC 


AAAAAGATAA 


ATCTTGGGGC 


GGCTATTTTG 


10260 


CGACACGTGT 


AAAAGAAGCA 


AGTCCAGTCA 


ATCTGCGTAC 


CGCTTTAGTT 


AATTCAGATA 


10320 


ATATTTATTT 


TGCCCAACAA 


ACATTACGGA 


TGGGAGAAGA 


CAAATTTAGA 


GCGGGTCTTA 


10380 


ATAAATTCAT 


TTTCGGTGAG 


GAACTAGATT 


TACCAATTGC 


CATGACGCCT 


GCTCAAATTT 


10440 


CAAATGAAGA 


TAAATTTAAT 


TCTGAAATTC 


TTTTAGCAGA 


TACGGGCTAC 


GGTCAAGGTC 


10500 
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AATTGTTAAT 


TTCGCCAATC 


CAGCAAGCTA 


CCATGTATAG 


TGTTTTCCAA 


AATAATGGAA 


10560 


CATTAGTCTA 


TCCAAAACTA 


GTTTTAGACA 


AAGAAACGAA 


GAAAAAAGAC 


AATGTAATCA 


10620 


GCGCCAACGC 


AGCGAATACA 


ATTGCCACAG 


ATCTTTTAGG 


TAGTGTGGAA 


GATCCTTCAG 


10680 


GTTATGTTTA 


TAATATGTAC 


AATCCGAACT 


TCTCATTGGC 


TGCTAAAACA 


GGAACAGCTG 


10740 


AAATAAAAGA 


TAAACAAGAT 


ACAGATGGCA 


AAGAAAACAG 


TTTCCTTCTA 


ACGTTAGATC 


10800 


GTAGTAACAA 


TAAATTCTTA 


ACAATGATTA 


TGGTTGAAAA 


TTCAGGAGAA 


AATGGTTCTG 


10860 


CAACAGATAT 


CAGTAAACCA 


TTAATTGATT 


ACTTAGAAGC 


AaCCATTAaA 


TAAaAAGAGa 


10920 


AAATGAaCGA AcGAGGTTtC 


TGTAAAAAGA 


AcCTCGTTTT 


CCCTTTTTAG 


AAAAAGGAAA 


10980 


GTGTGCTACA 


ATGAAGCGAC 


TAAATTGTAG 


AAGGAAAGGG 


ATTCGTGTGC 


GAATTTTAGC 


11040 


AATAGATACA 


TCTAACCAAA 


CATTGAGTAT 


TGCTGTGTGT 


GAAAATCAAA "AAATTCTGGG 


11100 


AAGTTATACA 


GCAACGGTTA 


AAAGAAATCA 


TAGCTTAACA 


TTAATGCCGG 


CAATTGACTA 


11160 


TTTGATGAGC 


CAATTGAACT 


TGGCGCCAAC 


GGCAATTGAT 


CGTTTTGTCG 


TAGCAGAAGG 


11220 


ACCTGGATCA 


TACACTGGCT 


TACGTTTAGG 


CGTGACGACT 


GCCAAAACAT 


TGGCCTATAC 


11280 


TTTAAAGAAA 


GAGCTAGTAG 


GCATTTCCAG 


TCTACAAACG 


TTAGCTGCCA 


ATTGTGTTGG 


11340 


ACAAACTGGT 


TTAATTGTGC 


CTCTTTTTGA 


TGCGCGTCGT 


AAAAACGTCT 


ATGCTGGTGC 


11400 


TTATCGGTTT 


GTCAATGGCG 


TGTGGCAAAA 


TGAGTTACCG 


GACCAACATA 


TTTcCCTGCG 


11460 


GGAGTTATTA 


GAACAACTAA 


AAAATGAACC 


AAATCTTTTT 


TTTGTTGGTG 


AAGATGTTGA- 


11520 


AAAATTCACT 


GAAGAAATAG 


CTCAAATCAT 


TCCTCATGGA 


GAAATTTGTG 


ATGTTCCACA 


11580 


ATGGCAAATT 


CCTAATGCCG 


CAGTTTTGGC 


TGCGTTAGGT 


AGTGTGGCTG 


AACCTGTTGA 


11640 


GAATGTTCAT 


GGGTTCTTAC 


CTGCGTATTT 


GAAAAAAGTA 


GAAGCGGAAG 


AAAATTGGTT 


11700 


GAAGACACAC 


ACGCCTAATG 


GAGAAAGCTA 


TGTGGAAAAA 


CTTTAATTTT 


GTGACAAAAT 


11760 


ACT TT AC AAA 


AAGACGGCAA 


CGGTATATAA 


CTAAAACCAT 


TGCAATGAAC 


CAGCAAGAAT 


11820 


TTCAATTGCG 


TGAAATCACC 


TACCAAGATA 


TTAAGGATTT 


ACTAGCAATT 


GAACGTGAAG 


11880 


TTTATGCTGG 


GGAACTTCCT 


TGGACGATGT 


CTGCTTTCAT 


GGCAGAACTT 


GGCTCAAAAG 


11940 


CGCCGCATCT 


CTATTTGTTA 


GCGGCAATTA 


AC GG AAAAAC 


AGTTGGCTTT 


ATTGGTTGTC 


12000 


GGATTCAAGG 


AACGGATGCA 


CACATTACGA 


ATGTCGCTGT 


TCACACAGCA 


TACCAAGGCT 


12060 


TAGGTTTAGG 


TCGTTCTTTA 


ATAGAAGAAA 


CGCGAATTTT 


TGCCAAGAAA 


AATCGGTGTG. 


12120 


AAACCTTATC 


GCTAGAAGTA 


CGAATGAGTA 


ATGTTAAAGC 


CCAACGTTTA 


TATCGAAAAA 


12180 


TTGGCTTTGT 


CTCACAGACG 


GTGAAAAAAG 


GCTATTATGA 


CGAAAATAAT 


GAAGACGCAC 


12240 


TAGAAATGAT 


AATTGATTTA 


AAAGAAGAGT 


GAAGAAAAAA 


TGTTAGTAAC 


AAAAAAAGAA 


12300 


ATATCTGTAG 


CAAATATTGG 


TGAAGAATTA 


TGGCGTTTTA 


GTAATTCTTC 


TTATACAACT 


12360 


GGCTCCCCTT 


GGaCAGAAAC 


ACAATTCGCG 


GAAGATTTTG 


CTCAGGaAAA 


TAGCGAATAT 


12420 


CTTTTTTTAG 


TTGAAAATGG 


CCAGTGGTTA 


GGCTATATTG 


CCTATCATTT 


TATTTTAGAT 


12480 
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GAAGCAGAAA 


TTAGTCACGT 


TGTTGTGAAT 


GGTCAGAAAC 


AACACCAAGG 


AATTGGTTGT 


12540 


ccAGTTAATG 


aAAGCTTTTA 


AAGAATATGT 


TAAAAGTCgT 


GACATTACAC 


AAATTTTcTT 


12600 


AGAGGTCAGA GAAyCGAATA TACTAGCyCm AAAACtTTAk 


GrGAAAaCAG 


GTTTtCGGAA 


12660 


AGTAGCTGTT 


CGAnAAAATT 


ATTATAAAGA 


ACCGCAAGAA 


AATGCCTTTG 


TAATGTGTGC 


12720 


GAAATTAAGG 


AAGGAAGCCC 


AATGACAATT 


TTTACAAAAG 


AACGTAAATT 


ATTATTAGCT 


12780 


GTGGAAAGTA 


GCTGTGATGA 


AACAAGTGTG 


GCTGTCATTG 


AAGATGGCGA 


CAAAATTCTT 


12840 


TCCAATATCG 


TAGCTTCGcA 


AATTAAGAGC 


CATCAACGTT 


TTGGCGGAGT 


CGTTCCGGAA 


12900 


GTTGCGAGTC 


GTCATCATGT 


TGAACAAGTG 


ACGATCTGTA 


TTGAAGAAGC 


ATTAACAGAA 


12960 


GCAAAAGTCA 


CACCAGAAGA 


ACTCAGTGGC 


GTTGCTGTGA 


CCTATGGGCC 


TGGGTTAGTC 


13020 


GGTGCTTTAT 


TAATAGGACT 


TTCTGCAGCA 


AAAGCATTTG 


CTTGGGCGCA 


CCAATTACCA 


13080 


CTTATTCCAG 


TCAATCATAT 


GGCTGGGCAC 


ATTTATGCTG 


CCCGTTTTGT 


GGCACCGCTC 


13140 


GAATTTCCAT 


TGATGGCCTT 


ACTCGTTAGT 


GGAGGACATA 


CGGAATTGGT 


CTACATGAAA 


13200 


GAAGATGGCT 


CTTTCGAAAT 


TGTTGGTGAA 


ACACGTGATG 


ATGCAGCTGG 


TGAAGCGTAT 


13260 


GATAAAGTCG 


GACGTGTGTT 


AGGGTTGCCT 


TATCCTAGTG 


GCAAAGAGAT 


TGATGCTTTG 


13320 


GCCCATGAAG 


GCACAGATAC 


CTATCAATTT 


CCACGGGCGA 


TGCTAAAAGA 


GGATAACTAT 


13380 


GACTTTAGCT 


TTAGTGGTTT 


GAAAAGTGCG 


TTTATTAACA 


CGGTTCATAA 


TGCAGAGCAA 


13440 


CGTGGTGAAG 


CACTGTCTAC 


AAAGGATTTA 


GCCGCaAGTT 


TyCAGGCGAG 


TGTGGTAGAA 


13500 


GTATTAGTGA 


CTAAAACCAT 


TCGTGCTTGT 


CAAGAATATC 


CTGTTAAGCA 


ATTGCTAATC 


13560 


GCTGGGGGCG 


TTGCTGCGAA 


TCAAGGATTA 


CGAGAAGCAA 


TGAGACATGC 


AATCAGTGaA 


13620 


CAATTACCAG 


CAGTGACATT 


ACTTATTCCG 


CCATTAAAAC 


TTTGTGGGGA 


TAACGCGGCG 


13680 


ATGATTGGCG 


CTGCCGCCTT 


TATCGAAGCT 


GcAAAAGAAT 


CACTTTGCTT 


CGTATAACTT 


13740 


AAATGCAGAA 


CCTGGCGTTA 


GTTTTATGAC 


AATTAGTGAA 


GAAGGCTAGC 


CAGCTAGACA 


13800 


AAAATTTGCT 


ATAATGAAGC 


CAGTTTAACA 


ATGACATTCG 


ATGATAAAGA 


GAGTACTTAT 


13860 


TTCAACGTCA 


GCTTAGCGAT 


CTTGGGAAGG 


TGGGAGCCAA 


GAGTGATCGG 


AATGAGGAAG 


13920 


CGCACTTTTA 


GAATTCTGCT 


GAAAGCAGAC 


GGCAATCACC 


GTTATCGATG 


AAGCAGGTCT 


13980 


ATCCAAGTAG 


TATAGACAAT 


TAGAGTGGTA 


CCGCGGGTAA 


ATTCkGGGnT 


CTGACAAAAC 


14040 


AAATTATTTT 


GTTTTGTCAG 


AGACGTTTTt 


GTTTTTTTAA 


TACTTACAAA 


TTAllTCAGGA 


14100 


GGGTTCTTAT 


GAaCAACAAA 


GAAATCGTAG 


CAAAAGCGTT 


ACATGATGtT 


TTAAATGAAG 


14160 


AATTAACGAT 


GGATCAAATC 


GAACAATTAT 


TGGAAAaTCC 


TAAATCTGTT 


GATCATGGAG 


14220 


ATGTTGCTTT 


TCCAGCCTTT 


TCATTAGCTA 


AAATTTATCG 


TAAAGCACCA 


CAACAAATTG 


14280 


CAGCAGAATT 


AGCAGAAAAA 


ATTGATGGCA 


CAAATTTTGA 


AAAAATTGAA 


GTGGTGGGAC 


14340 


CTTATTTAAA 


CTTCTTCATG 


AATAAAGAAG 


CAGTCAGTCA 


AGCTGTCATT 


GGCGAAGTAG 


14400 


TCAAAGAAAA 


AAATAACTAC 


GGAAACAGTA 


CCTTTGGCAA 


TAATGGCAAT 


GTTCCTATTG 


14460 
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ATATGTCTTC 


ACCAAACATT 


GCTAAACCAA 


TTTCTATGGG 


GCACTTACGT 


TCTACGGTAA 


14520 


TCGGGAATTC 


AATTGCCTTT 


ATTTTAGAAA 


AAATTGGGTA 


TCAGCCAATT 


CGGATTAACC 


14580 


ACTTAGGTGA 


CTGGGGTACA 


CAGTTTGGTA 


AGTTAATTGT 


AGCTTATAAA 


AAATGGGGTT 


14640 


CTGAAGAAGC 


AGTCCGCCAA 


CAACCAATTA 


ATGAATTATT 


GCGCTTATAT 


GTGCAATTTC 


14700 


ATGAAGAAGC 


AGAAGAAAAA 


CCAGAATTAG 


AAGACGAAGC 


ACGTGCATGG 


TTTAAAAAAT 


14760 


TAGAAGAAGG 


AGACCAAGAA 


GCTAATGAAT 


TATGGAAATG 


GTTCCGTAGC 


GAGTCTCTAA 


14820 


AAGAATTTGA 


TAAAATCTAT 


TCCATGTTAG 


AAGTTGAATT 


TGATTCTTAT 


AATGGCGAAg 


14880 


cyTTTTATAA 


TGATAAAATG 


GATGAAATTG 


TTACTTTATT 


AGAAGAAAAA 


CATTTATTAA 


14940 


CAGAAAATCA 


AGGTGCAGAA 


ATTGTTGATT 


TGACAGAATA 


TAATTTAAAT 


CCAGCACTTA 


15000 


TTCGTAAATC 


AGATGGTGCG 


ACGTTATATA 


TCACACGGGA TTTAgCTgcT 


GCGCTTTACC 


15060 


GTAAACGGAC 


GTATGATTTT 


GCCAAATCGC 


TTTATGTAGT 


CGGCAACGAA 


CAAAGCAATC 


15120 


ACTTCAAACA 


ATTAAAAGCA 


GTCTTGAAAG 


AACTAGGGTT 


TGACTGGTCA 


GATGATATGG 


15180 


AACACATTCC 


GTTTGGTTTA 


ATTACTCAAG 


GCGGTAAAAA 


ATTATCCACA 


CGTAAAGGGA 


15240 


AAATCGTTTT 


ATTAGAAGAA 


GTATTGAATG 


AAGCTGTTAC 


TTTAGCAGGA 


AATCAAATTA 


15300 


ACGAGAAAAA 


CCCAGACTTA 


GCAAATCGTG 


AAGAAGTGGC 


GAAACAAGTA 


GGTGTAGGTG 


15360 


CAGTTATTTT 


CCATGACCTT 


AAAAATGATC 


GTCTAAATAA 


TTTCGATTTT 


GTATTAGATG 


15420 


AAGTTGTTCG 


TTTTGAAGGA 


GAAACTGGGC 


CATATGTTCA 


ATATACACAT 


GCTCGTGCGA 


15480 


TGAGTATTTT 


ACGTAAAGCA 


AACTTTACAC 


CAGACGCTAC 


ACAACGTTAT 


GCACTAAATG 


15540 


ACAAGGATAG 


CTGGGAAGTC 


GTGAAGTTGT 


TACAAAAATT 


CCCTGAAACT 


GTGATGCAAG 


15600 


CTGCTGAAAA 


ATATGAACCA 


TCTGTTATTG 


CGAAACATAG 


CATTCATTTA 


GCACAAGCGT 


15660 


TCAATAAATA 


TTATGCCCAT 


GTTCGGATTT 


TGGATGAAGA 


TGCTCAAAAA 


GAGGCACGCT 


15720 


TAGCGTTAGT 


TTATGCAGTA 


GCTACTGTTC 


TAAAAGAAGA 


TTTACGTTTA 


CTAGGGTTAC 


15780 


ATGCACCAGA 


AGAAATGTAG 


AAGTTTTTGT 


GAAAATTTTA 


CTGCGTTGAA 


CGCTAAACTT 


15840 


ATTAAGCAAA 


TATTAAAGTT 


AAGTTACAGA 


TTGAGAAATT 


CATAGTAATT 


AAATAGCTGG 


15900 


GGTGCTATAC 


TCGAGTCAAG 


TTATAAAAGG 


ATTAGAAAAG 


GTGGCACCCA 


ATTATATGAC 


15960 


AGAAAAATGG 


CGTGAAGACG 


aAGAATATCT 


TTCTTATGTA 


GAAGATTTAT 


TAGcAACCGA 


16020 


AGAAGTCCAA 


AAATTAGCAG 


AATACACACA 


AC AT AT GC AT 


TCAACTCGTT 


TAGAACATTC 


16080 


AATTAGTGTT 


TCCTATCATA 


GCTATAAATT 


AGCAAAGCGT 


TGGAACGGCA 


ATGCTCGTGC 


16140 


AACAGCTCGT 


GCTGGTTTAT 


TGCATGATTT 


GTTTTACTAT 


GATTGGCGCA 


CAACAAAGTT 


16200 


TGACGAAGGC 


TCTCATGCCT 


ATGTTCaTCC 


aCGAATTGCA 


GCCAAAAATG 


CTGAAAAAAT 


16260 


TACAGAGTTG 


TCTGATTTAG 


AACGAGACAT 


TATTATTAAA 


CATATGTGGG 


GCGCAACTAT 


16320 


TGCACCACCA 


AAaTACAAAG 


AAAGCTACAT 


TGTAACCTTT 


GTTGACAAAT 


ATTGTGCTGT 


16380 


CAAAGAAGCG 


gCGTTGCCGA 


TGACGACAGC 


CATGAAAACC 


AAATGGCGTC 


AGTATTTTGG 


16440 
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TCAACAGTCA 


ATCTAAAAAA 


AGCTTATCCG 


TAAGGATAAG 


CTTTTTTTGT 


TTGACGTTGA 


16500 


CTAAAAAGTC 


TAGCATTCAT 


CTATTTAAAA 


TAAATATTAA 


AAGTAATTTT 


GTTCTGTTAA 


16560 


TTTCTTTTGA 


GCGTATCGAA 


AGACTGCTTC 


TTGATTTTCA 


GTTGTTAACT 


GTTTGTAAAT 


16620 


GTTCATAATA 


TTGGTTGGGA 


CAACAAATGG 


CGTTTCTTCC 


GTTTCAGGAA 


TAGCTAAGAT 


16680 


TTCAAGAGGT 


GATACATCTA 


AAATTTTAGA 


TAAGGCGACA 


ATTTTATCCC 


TCCCCATATT 


16740 


TTCAATCATT 


CCAGTTTCCC 


ATTTCCTGAC 


GGTACTTTTT 


CCAACACCAA 


CAAAATTTCC 


16800 


AACTTGTTCT 


AATGTAAGAT 


TTTTCTCCAT 


CCTTTTTTGC 


TTCAAAAAAT 


TTTCTTTCAA 


16860 


AAAAACCACC 


TCCGTAAAAA 


CATTCTAGCA 


CAAAAAGTGT 


CGCAAAAGAA 


ACATGTTTTT 


16920 


tTCAAAATGA 


AAGTAACGAA 


TATAATAGnA 


AAATCAGCAT 


TTTTTAATGA 


TATTGACGTT 


16980 


-TTTTTCACAA 


AATGATATAA 


TTTTAAGTGT 


CT TAT TAG AC 


ACTGAAAraGG 


GGGAGAGTAA 


17040 


AATGGAATCA 


AATTTAATTC 


ACTCAAAATT 


GaAAATGAAA 


GGAAAAAATG 


TTGaGTGGTT 


17100 


GGCAAATGAA 


ATGAcTGCaT 


ACGGGCAACC 


AATATCCGCT 


TGAACGATTT 


ACAAAAAATT 


17160 


AAAAGGAGAA 


GTTGTTTTCA 


AAGCATCGGA 


TATTAAATTG 


ATTTCACAAT 


TGTTGGATTT 


17220 


GAATAATGAA 


GAAATTATGG 


ACTATnTTTT 








17250 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



AGATTAnCGG 


AAnCTACCTA 


TCTGAACGTA 


CAGATAATAA 


TATTACmATG 


CAAGCAATTC 


60 


GTGATCATTA 


TGAAGTATCC 


AATCGATTTG 


GCGCAAAAGA 


AGTCTTGGCT 


TTTTCTTCTG 


120 


AACGAAAATG 


GGGAGCTATT 


GAATTTCCAG 


AAATTGGGAC 


AGTTTATTTA 


GGGGCTCCTG 


180 


AGCGTTTGGT 


AGACGATAGT 


CGGTTGCCAG 


AAGCTGTTTT 


TACAGCACAA 


GAGAATGGGT 


240 


ATCGCGTCTT 


GATGTTAGCT 


ATTGCGGAGC 


AACAACCGTT 


AAACGAAACC 


AAAATGCCTT 


300 


ATTTAGAGCC 


ATTAGCAATT 


CTAGAAATTG 


ATGATCCAAT 


TCGCCAAAAT 


GCCAAAGAAA 


360 


CACTGGCTTA 


TCTAAAAGAA 


GAAGGAATCG 


ACTTAAAAGT 


AATTTCTGGT 


GATAATCCGG 


420 


TGACTGTTTC 


AAATATTGCT 


CGTCGTGCTG 


GGCTACCCGG 


TTATGAGTCT 


TATATTGATT 


480 


TGTCGACCAA 


AACGACAGAA 


GCAGAAGTGC 


GTGAAGCAGT 


TCAGCAATAC 


ACAGTGTTCG 


540 


GACGTGTATC 


GCCTCAGCAA 


AAACGAACCA 


TTGTGCGTGA 


GTTGAAAGAC 


AC AG AG CAT G 


600 


TGGTGGCGAT 


GACCGGCGAT 


GGCGTTAACG 


ATGTTTTAGC 


GTTACGAGAA 


GCGGATTGTA 


660 


GTATCGCCAT 


GGCAGAGGGA 


GATGGCGCTA 


CTCGGCAAAT 


TTCTAATCTA 


GTTTTGCTAG 


720 


ATTCTGATTT 


TACAACGTTA 


CCAGATGTCT 


TGTTTGAAGG 


ACGACGTGTG 


GTAAATAATG 


780 
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TAACCCGAGT 


TTCCAGCGTT 


TTCTTCATTA 


AAACGATTTA 


TTCGTTTATT 


CTCTCAATCA 


840 


TTTGTGCCTT 


AACGGCCATT 


GCGTTCCCGT 


TCATTCCGAT 


TCAAGTGACC 


TTGATTGATT 


900 


TAGCAATTGA 


AGGATATCCA 


GCCTTCTTCT 


TATCTTTCGA 


AGGGGACAAA 


CGAAAAGTTG 


960 


TCGGGAAGTT 


TTTACCAACA 


GCTTTGAAAA 


ATGCGTCTGT 


TAATGCGCTT 


TTAGTTGTAG 


1020 


CGAATATCAT 


TGCTGTTTAT 


CTGATTGGAC 


AAAATCAAGG 


ATTTTCTTCT 


TTAGACACCA 


1080 


CGACGTTAAT 


GTATTATTTA 


TTAGTTGGAA 


TTAGCTGTAT 


GGCTGTTGTT 


CGAGCATGTC 


1140 


TGCCTTTAAA 


TCCACTACGC 


ATTTTCTTGG 


TTTTCAGTAC 


CATTATTGGT 


ATTTATGTAG 


1200 


CAGCTATGTT 


ATTCCACAAC 


ATTTTAGAAA 


TTGGTTTCTT 


AACTTCGCAA 


ACAATGGGTC 


1260 


TCTTCTTTAT 


TATGATGGCC 


ATTAATATTG 


TGGTGCGTGT 


AACAATTGGT 


TTCGTACAAA 


1320 


TGAAACGTGC 


TGGAAAAACA 


ATTAAAGATT 


TATAAAAAGC 


AGAGGTTGAG 


CTAAAAGCAT 


1380 


TCGTCTCCAG 


AAATGCAGTT 


CTTCTTTAAA 


TCTTGAAGGA 


AGTCGGAGCA 


AGCTTGATGT 


1440 


TTAACTGATT 


ATTTTAAGAG 


TTAGTGTCTG 


GGGCAGGAGT 


CAGGAAGACT 


TTTGTCCTAG 


1500 


ATACTTTCTT 


TTTTTGTTGA 


GTTGAAACAA 


GATACTTGAA 


AAACAGCGAT 


TATGTGATAC 


1560 


AATAAATCAT 


GTGTAAATGT 


GTCCATTGGA 


ATTCGCACTG 


AAATGACAGT 


GCGTTTTTTA 


1620 


AACGGTCAAT 


CACATGAGTT 


AGCACAATGC 


TATTCTTAGG 


AGGTGAGGAC 


AATGGCACAA 


1680 


CGCATTCCTC 


AGGAAGTCAT 


TGAGGAAGTT 


CGTCACCGAA 


CGAATATAGT 


AGATATTATT 


1740 


GGGCAGTATG 


TCCAGTTAAA 


AAAATCCGGT 


AAAAATTATA 


TGGGTTTATG 


TCCGTTTCAC 


1800 


GAAGAACGTT 


CCCCTTCATT 


TTCAGTAGCA 


GAAGACAAAC 


AAATTTTTCA 


CTGTTTTGGC 


1860 


TGCGGGAAAG 


GTGGAACTGT 


TTTTAATTTC 


TTGCAAGAAA 


TTGAAGGCAT 


TAGTTTTCCA 


1920 


GAATCCGTGA 


AACGTGTTGC 


AGATTTGGAA 


CATTTATCTG 


TGGACTTTGA 


TTGGTCAGAG 


1980 


CCGCGTGAAG 


TAGCGGATAC 


CCCAGAAAAC 


CAACAAAGAC 


GGAGTTTGTT 


ACAACTGCAT 


2040 


AGTAAAGCCG 


CAGAACTTTA 


CCATCATATT 


TTAGTGAATA 


CTAAAATTGG 


CGAACCCGCA 


2100 


TTAAACTATT 


TATTAGAGCG 


GGGGTTAACA 


CAAGAACTAA 


TTGAGACCTT 


TCAAATTGGA 


2160 


TTTGcGCCGC 


AAAAAcGTGa 


TTTCTTAAGT 


CAAGTTTTTA 


AAAACGAACA 


ACTGGATGAA 


2220 


ACGCTTTTTG 


AGCCTTCTGG 


GCTGTTTGTC 


CAAAGAGATA ACGGTACgTT 


TTTGGaTCGT 


2280 


TTCTATCAAC 


GAATCATGTT 


TCCAATTAAT 


GATCCCCAAG 


GCAATGTCAT 


CGCCTTTTCT 


2340 


GGTCGCTTGT 


TaAAAACAGC 


TGATTTTCCa 


GGGGATGAGA 


tGCCTAAATA 


CTTGAATAGT 


2400 


CCTGAAACAa 


CACTTTTTAA 


TAAGCGAGAA 


ACGCTGTTTA 


ACTTTGATAG 


AGCCAGAAAA 


2460 


GAAATTCGTA 


AAGAAAATAC 


GGTCTTGCTT 


TTTGAAGGAT 


TCATGGATGT 


TATCGCTGCT 


2520 


TGGCAATCAG 


GCGTAAAAAG 


TGGGGTGGCT 


TCAATGGGGA 


CTAGCTTGAC 


CAATGAACAA 


2580 


ATTCGGCGTT 


TGGAACGGGT 


CGCTAAGGAA 


GTAGTTATTT 


GTTACGATGG 


CGATAATGCC 


2640 


GGTGTTCAAG 


CCACAAACCG 


TGCGATTCAA 


TTATTGCAAG 


AAAACAGTCA 


TTTTGACTTG 


2700 


AGTATTGTCA 


GCATCCCTGA 


AAAGCTGGAT 


CCTGATGAAT 


ATGTGCGGAA 


ATATGGCGCA 


2760 
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GAAGCTTTTC AAAATTTAGC CAATCATGGC CGAGAAACAG TTTTTAGTTT TAAAATGAAT 2820 

TATCATCGGT TAACTAGAAA CATGAATAAC GAAAAAGAAC AATTGGATTA TGTGAACGAA 2880 

TTGTTACGGG AGTTAACCAA TGTTCAATCT CCCTTGGAAC GTGATCGTTA TTTAAACCAA 294 0 

ATTGCTCAGG AATTTCAACT TTCTGTTCAT AGTTTGGAAG AGCAATTCAA TCAATTGAAG 3000 

CAAGAGCAAC GTTCAGTCCA ACGGCAAGAA AGGCAACAAT TTTATCAAGA TGAAATGATG 3060 

CCACCACCAA TGGAAGAACC TGTTTTTGAA GAGAATCACG TCCAGAACAA GTTACCGTTA 3120 

ACCCAAGTTC AAAAAGCAGA ACGTTCTTTA TTATTTCGTT TAATGAATGA ACAAGGAGTA 3180 

CGGCAAACCA TTCAACAATT GCCAGATTTT TCATTCGCTC ATGATGAATA CCAAGAATTA 3240 

TATTTCTTGC TAGAAAGTTA TGCGACACTG CATCAAAGTT TCGACATTGC CGATTTTATT 3300 

AATTTCTTGC AAGATAATCA AACCaAACAA TTAGCAATTG AAATTGCGTA TCAGAATTTA 3360 

TCGGAAGAAA GCTCTGAGCG AGAAGTTGCC GATTTATTAC ATGTAATTGC CTTATcAAGT 3420 

ATTGCAGAGG CAATTGAGCA GAAAAAGATT CAACAGCAAG AAGCAAAACG AGTGGGCAAC 34 80 

CAACAGCTTG AAGCCGAATT AACAATGGAG ATTATCCAAT TGGCTCGACA GCTTAAAGCT 3540 

CAACGAACAT TTACTTAAGC AATATAATGA ATAGAACTAA GCGTTTTGTT CATTTTATTT 3600 

TGATAAGTCC CAGTGATTTA TCAGTTATTT CAAGCGTGAA CCGAAAAATG AAGGGGGCCT 3660 

TCTTTCATGG AAAAAGAAAC AAGTAAAAAA TATGAAGCTG CAGTGGCAGC ATTTATCAAA 3720 

GAAAACAAGC CTAAAGGAAC AGTGGTTTAT GATGATTTAG CGAATCAATT AGCCACACCG 3780 

TATACTTTGG ATGCAGAAGC TATGGAAAAA TTGATCCaAA AAGTTGAAGA TGCTGGTATC 3840 

AGCGTGGTCG ATGAAAATGG AGACCCAAGT GAACATAGCT TGAAAAAAGA TGAAAAAGAA 3900 

GCGGAAAAGG CCCAAGCAGA AGACTTATCT GCGCCAACAG GTGTTAAAAT TAATGATCCC 3960 

GTGCGTATGT ACCTAAAAGA AATTGGGCGC GTTCAATTAT TAACTGCCGC AGAA'GAAGTT 4 020 

GAATTGGCAC TTAAAATTGA AGAAGGCGAT CAAGAAGCAA AACAACGTTT AGCAGAAGCT 4080 

AACTTACGTT TGGTTGTTTC AATTGCCAAA CGATATGTAG GTCGTGGTAT GCAaTTCTTG 414 0 

GATTTAATTC AAGAAGGTAA TATGGGCTTA ATGAAAGCTG TTGAAAAATT TGACTACCGT 4200 

AAAGGATTCA AATTCTCTAC GTATGCTACT TGGTGGATTC GTCAAGCGAT TACGCGGGCA 4260 

ATTGCGGACC AAGCTAGAAC GATTCGAATT CCTGTTCACA TGGTTGAAAC AATCAATAAA 4 320 

TTGATTCGGA TTCAACGCCA ACTATTGCAA GATTTAGGTA GAGAACCAAC GCCAGAAGAA 4 380 

ATTGGTGCGG AAATGGATTT ACCAACAGAA AAAGTTCGTG AAATCCTAAA AATCGCACAA 4 4 40 

GAGCCAGTCT CTTTAGAAAC ACCAATTGGT GAAGAAGATG ATTCACATTT AGGTGATTTC 4 500 

ATTGAAGACC AAGATGCTAC CAGTCCTGCT GAACATGCAG CTTACGAATT GTTAAAAGAA 4 560 

CAACTAGAAG ATGTTCTGGA TACTTTAACA GACCGTGAAG AAAATGTTTT ACGTTTACGT 4 620 

TTTGGTTTAG ATGATGGTCG GACACGTACG CTAGAAGAAG TCGGAAAAGT TTTCGGTGTC 4 680 

ACTCGCGAAC GTATTCGTCA AATTGAAGCA AAAGCGTTAA GAAAACTAAG ACATCCTTCT 474 0 
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CGTTCAAAAC 


AATTAAAAGA 


CTTTTTAGAA" TAAACATACC 


GTTTAGgGGC 


TCTTTTCGTG 


4800 


CATGCGAGAA 


GAGCCGCTTT 


TGATTTTTAA 


AAAGTCTCTA 


AATGTCATTG 


CTAGTTATTG 


4860 


GAAGAAGGCG 


CAAGTTGTGG 


TAAATTAGAA 


GTAATGGAAT 


ATTAGCAGAA 


TGTGAGGCGG 


4920 


AAGAATGACA AAAAAATGTC 


CGAAGTGCGG 


GAACGAATTC 


GATGCAGAAT 


TAACGACCTG 


4980 


CCCAACATGT 


GGCTATTCAC 


TAACTGACAC 


AACAGTTGAT 


AAAGAAGAAG 


CGGAAACGAC 


5040 


TTCAACAAAT 


ATAGATTTTG 


AAACACAGGA 


AAATGAAGAA 


CACGAAGATC 


AATTAAATGA 


5100 


GAATATTGAG 


TGGTCTGAGT 


TAAAAGATAT 


GAGCTTAGGC 


CATGTGATGG 


AACTGTTTGG 


5160 


CGAATCACCA 


GAAGAAGAAA 


GTAATGATGA 


CAAAAAAGAA 


GAGTCGACAG 


AAGATAACTT 


5220 


AATAGTCTCT 


GATTCAGAGG 


ATGTTTCTGG 


TCTAGAAGCA 


TCGCTTCAAG 


AAGGGGGTAG 


5280 


TGAGGAGACT 


CATGATTCGG 


TGGAGGAGTC 


GATACCAGCG 


ACAGAAGAGA 


CGCCTACTCA 


5340 


TTCAACAGAA 


GAAAAACTAG 


CAACGGACGA 


AGCAGTTAAT 


TTAGAGGAAA 


CAACAGAAGA 


5400 


AACAACGGAA 


GAAACAACTA 


CTGTTGAAGC 


AGAAACAGCT 


GAGGTATCTG 


AGACGGTGAA 


5460 


GTCAGAAGAG 


GAAGCTTTAA 


CAGAGATTTC 


AGGAACTGAA 


GTTATCTCTA 


CCACTTCTGA 


5520 


AGAAGAAATT 


TTTTCACAAC 


CACCGATAGA 


AGATCAAGAT 


GTAACACCAA 


ACGAAACTTT 


5580 


GCAAGCTTAT 


ATTCAAGCGC 


ATCGAGCAGA 


TACAGAAATG 


TCTGAAAATC 


CATCAGAAGA 


5640 


AACAGCAGAG 


ACCCAAGAGT 


TAGAAAACAG 


CGGTGAGGCA 


GTATTAACCC 


AAGCCGAAAC 


' 5700 


ACCAACTGAA 


TCCATTTCAG 


ATTCTGAAGA 


AGGGTTAACA 


AGTGCAGCGT 


CGCTAGAGGC 


5760 


ACCGAGCGAC 


GCTGATGCAA 


CCGAGGGAAA 


AGTTTCAGAT 


TCAGGTGTGA 


TACCACCGAT 


5820 


GAATGAAACT 


GGGAATGCTC 


AACCAGCGCC 


TGCTCCGAAG 


AAACCATCAA 


AAAAAGTAGT 


5880 


TTTCGTTGCG 


CTTGCGGTTG 


TTTTATTAGC 


TGGAGGTAGT 


GCTTGGGCGT 


ATCATGATCA 


5940 


AACGCAAAAA 


GCTGCCGCGC 


AAGAGGCGGC 


GGCACTAACC 


AAGAAAACAG 


ATACGCTGAA 


6000 


AGATGAACTA 


GCAGCCTTTT 


ATACCACGAA 


GGAGCAAGTT 


TTTATTAAAC 


CCGATATGGT 


6060 


GACTGTTAGT 


CCAGAAAAAC 


TCTCAAAACA 


AGTCGCGGAA 


ATTAAAGATT 


CAGAAGAGTA 


6120 


TTCTCAATTA 


AATAAACAAA 


TTCAAACATT 


GAAAGAGAAG 


CAACAAACCA 


TTCAACAAAT 


6180 


CAACCAATTA 


TTTGAGGCGC 


CGATTGTCAA 


TGGCAATGAG 


TTGAAACCGG 


CGATTCTAGC 


6240 


TGCTGATCAA 


CCAATTTCCG 


TAAAGAAATT 


AACAGGAAAT 


GATCCATTTG 


ACCAATTGAT 


6300 


GAACCAAGCA 


ATCGATCAaG 


CGAACCaACA 


ATACAACCAA 


TTACAAAAAG 


CTAAAAAAGC 


6360 


GGTTGAGGTC 


ATTTATAAAG 


ATGGAAAAAC 


TACCAATCAG 


CTGAATCGAG 


ACACGTATCA 


6420 


AGCTGCTAAG 


GCAGAAGTAG 


ATAAAGTAAC 


AAGTGATAAG 


CTGAAAAAAG 


AACTGGTCAA 


6480 


ACAAGTCACG 


ACAGCTGACC 


AAGCGTTAAC 


CAAGGTTGAA 


GAAGAACAGA 


AAAGAATTGC 


, 6540 


AGAAGAACAA 


GCTGCCGCAG 


AGCAAGCAAA 


ACAAGCAGAA 


GAACAAGCGA 


AACAAGCTGc 


6600 


TGCCGCAAAA 


AAAGAGGACG 


CTAAGAAGGA 


AGAAACAGCA 


AAAACAGAAG 


CGAATGGCTA 


6660 


CACGGCGCCG 


AATAGTGACG 


GGGTTTACAC 


TAGTCCGCTT 


TATGCGCCAG 


ATGCTGCCGA 


6720 
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TATCGCTGAT AGTAGCAATC CAGCGTGGAC ATGGGCACCA GGTGTGAAAG AAAAAGTCTT 6780 

AGACACAGTG ATTGCTCGTG GCTATGTGGT TCCTGGCGGA TATTCTTTAG AACCTGCCAA 684 0 

AATTGTGAAT GGCGAAGGTT ACTACAATCT TTATGCAACA AACAATCAAT CAAAATTATT 6900 

AGAAGGCACT ACAGAGAAAA ATGTTCACAT GTATTTAGTT ACAATCAATG CTAAAACAGG 6960 

TTGGTTCAAA GGGAACGCTT CTCGTAATGC AGGGCAATAA AAGAAAATAG AAACAATCGA 7020 

ACAAAAGCGG AAATCTTTTG TTCGATTGAT TTTCCTTGAG GAGTTAATGA TAAATGAGAG 7080 

AATTACTAGC AACAACATTT ACCGGCATTG TGGTCGATGA AAATGAACAA TTTTATTTTG 714 0 

TTCAAAAAAA TGGCATCACA TTCCGTTTAA AAAAAGAAGA AGGTACGCAT GCTATAGGGG 7200 

AGGCTGTAGA AGGGTTTGGT TATTTAAATC AAAAACAAGA GCCAGCACTG ACAACTACAA 7260 

TTCCTACTGT GCGCATTGGT AGCTATGGCT TTGGTACTGT CACAGGAACG AGACGTGATT 7320 

TAGGTGTCTT TGTAGACGTT GGCTTACCAG ATAAAGATGT CGTTATTTCG TTGGATGAAT 7380 

TACCAACTAT GCGGGAACTT TGGCCTAAAA AAGAAGACCG AGTAATGGTT ACTTTAAAAG 7 44 0 

TGGACAGTAA AGACCGCATT TGGGGCGAaT TAGCAGnTGA nAAAGTCTTT AAAGCAATGG 7500 

CAAAACGTGG GAATGCTGAA ATGCAAAACC AAAACTTAAC CGGGCATGTG TATCGTCTGA 7560 

AACTTGCAGG AACGTATATT TTAACTGATG ATTTATACAT TGGATTTATC CATCCTTCTG 7 620 

AACGATTCCA GGAACCACGT TTAGGTGAAA AAGTAGTGGG GCGTGTGATT GGCGTGCGGC 7680 

CGGACGGCGT ATTGAATCTT TCATTAAAAC CAAGAAGTCA TGAAGTAATC AGCGATGATG 774 0 

CGTTAATGAT TCTAACTTTC TTAGAAAGAG CAAACGATCA CCAAATTCCT TTCACCGATA 7 800 

AATCAAGCCC TGATGAAATT AAACAAACCT TTGGCATTAG TAAAGCTCAA TTTAAACGCG 78 60 

CCTTAGGCCA TTTAATGaAA CAAAAATTAA TTAAACAAGA AgATGGTAAA aCAATTCTAA 7 920 

TTGGcTCGTT GGrACAATCm AAAGAAA 7 94 7 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GCACCAGCGT GGTTGGATCA TTCTGATCAC CCGTTTCATT GGCCCGATAC TGGTGTTTAT 60 

TTATATGAAC CTGGCCAAGA AGTTGTTTTT CAATTCGATC AATTGGTAAC AGATATTCGT 120 

GAATGACCCA GAAACCAGTT AAAAAGCAAT TGCGTTTAGG GTTAACTACT TTATTTGCTG 180 

TACAATTtAT' GmAAGAAaTT TCACGATTTT TAaCAACCCA TCCACATGTG AaTTTAATAT 24 0 

TACAGCArGA CGGCTCGCCA AAaTTACAAa CGATGTTAGC AAATAAAGAA ATTGATATGG 300 

GACTAATTTC TTTCCCAAAT ACCTTACCTG AAATTATTCA TATTGAACCT TTAGAAACGA 3 60 
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CTACCAAAGG 


CTACCATGTT 


TATGTCGTAG 


TTCCAGAATC 


AAATCCTCTC 


TCCCAATATG 


420 


AAAAATTAAC 


TTTTAAAGAT 


TTGAAAGATC 


AACgATTTTC ATCGTTAAGT 


GATAATTTTA 


.480 


TGATTGGTCG 


CCTACTACTA 


GATCGGACTC 


bbnbb 1 1 bbb 


1 Inl bnuL-^y 


AATATCATTT 


540 


TGCATAACGA 


TGATTTACAA 


GTACTTCTTT 


nl/iul 1 luUn 


t\r\r\r\rv\ i. nn. 1 


TCGATTTGTT 


600 


TGCTGCCGAT 


TGAATATTAT 


GAAGTGGGAA 


a. a. n c t c a rrr 

Anno 1 bnbbb 


rib 1 unnn 1 bb 


ATTCCtTTAA 


660 


AAGATAAGTT 


TGACTATTTC 


CCCaTCGGCA 


rp rp rp rp rp r> /"» 

1 1 bL ill buu 


bbbbbnl ill 


AGTCTGACAG 


720 


AAGATGTTAG 


AGATTTTATC 


CaAATCATTA 


AAPAAAATTA. 
nAurtnnnl In 


A 1 bun 1 AA 1 L. 


TAGCCTTGAT 


780 


AAGCAAAGAC 


GATTTTACTA 


TCATAAATTT 


Ibnnbnbl 1 1 


TTfTA 7i 7AT74 7AP 


AATTTTGAGA 


840 


ATAAACCCGC 


GACAATTGGA 


ACAACAAACC 


AAbAAAAbAb 


1 AAbbb 1 AAb 


ATCCCTAAGC 


900 


CCGCATCCAA 


TGAAGCAAGT 


GGTCCCACGA 


GTCCAACCAA 


/-»/-' 7\ Tv TV rnr>/~< tv 

bbbAAAI bbA 


GCCGATTGTG 


960 


GGGTTCCTGA 


AATGTTGAAT 


AGAACCACAG 


7\ tv m rn rn f* TV 

GAATTGCTGA 


AAT AG I I bb 1 


GTAAATAAAC 


1020 


AAGGAATTAA 


AATAATTGGG 


TATTTAAATA 


-n try t\ r+f* tv *t* 

AGTTAGGCAT 


CATCATTTTC 


ATGCCGCCCA 


1080 


AAGAGACAGC 


CAGTGTCACT 


CCCGATTGGT 


TGACATTCCA 


Tv /— » t\ tv m rn Tv tv rp /-« 

AGAATTAATC 


ACAAGAACAA 


1140 


CAGTTGTGGC 


TGCAATTCCC 


ATTGCAGCAG 


CACCTGCAGA 


C AAGC C AT T C 


AACTGAATCG 


1200 


CCAAACCAAT 


TGCCACAGTA 


GTGATTGGCG 


t\ *\ tv try y\ t\ m /""■ tv /■"• 

AAATAATCAG 


rn /"» TV /■"*✓'"• A A TV rp 

TGCAGCAAAi 


GAACAAGCAA 


1260 


TTAAAATACT 


CATGATAATT 


GGCTGAAAAT 


ORnni\ t\ tv 7V rnrn 

C AG T AAAAT T 


/-•rn rp tv tv rp /^» TV (■■"» rp 

GT TAA 1 CAC I 


TTCCCGATGG 


1320 


CAACCGTAAT 


TTGTGTCACA 


TAT GG AT AAA 


TCAACA 1 bCC 


A 7\ Ti P A A A PPT 

AnAbAAAbb 1 


GCACCTACTC 


1380 


CAACAACGAT 


CGGTAAAGCA 


ATGATTTCCA 


b 1 bAAbbbnA 


11111 lHlLb 


ATGACTAATA 


1440 


AAAGTAAGAC 


TGCAACAGAA 


GCGGTAATCA 


1AAIA1 1AA1 


IjA 1 b 1 b Abb A 


GTGCCTGCAC 


1500 


CAATAAAGCC 


TTTCACTTCT 


GaATTAAATT 


1 An 1 bnbbbb 


AbAbbbAbbA 


AATGCTGCAC 


1560 


CGCCAGCAAT 


CATCATTTTC 


TGTGGCGTTA 


AbbbnAAl lb 


A A A A ("m ATP 

AAAAbblTlAl C 


AGTCCACCAA 


1620 


TAATTAAAGG 


GGTGGCTAGT 


TGGAAAATTT 


bbbbbbbAlb 


AA1»A ATOATT 

AA1 AAlbAl 1 


TCGATAATCT 


1680 


TGTATTCTGC 


AAAATATTTT 


AAAATGGCTC 


r~ ,r PAA r PA/"*ArT* 

b 1 AA I AbAbb 


HI ibbbAAl 1 


AAGGCGATAA 


1740 


TGGTGCCGGA 


AgcTGTTCCA 


GCTAACACTT 


ib 1 InAAAnA 


rp ATTPTTrrP 

Inl 1 b 1 1 bbb 


GTTAATTGAG 


1800 


CATCCAGTTT 


ATCAGTGGGC 


ATTTTCAACA 


b 1 LL Inl ivjn 


1 1H1 1 1 1 lul 


TTTTTTAGAA 


1860 


CAAGTTTGAG 


AATGATTGAA 


CTTTCTATTA 


b i\ 1 AAAb Ab b 




AAGAGCGCTT 


1920 


ATTTCTTTGA 


GGCGCTTTTT 


TTTTAACTAA 


AGCAAGGAAC 


GATTTTTAAT 


ATATATCCGT 


1980 


ATCCGCATTA 


TATTTCGCTA 


AATTATCTTT 


AACATCTCGc 


AAGAACTTAC 


CTGcTTGTTG 


2040 


TCCATCTAAA 


ATTCGATGAT 


CAATAGAAAG 


ACATAAATTA 


ACCATATCTG 


CCACTTTAAA 


2100 


GCCTCCATCT 


GCGGTTGGCA 


CAAGGCGTTT 


ATTAATTGAC 


TCTACTTGTA 


ATATCGCAGC 


2160 


TTGTGGATGG 


• TTAATAATTC 


CCATCGATTG 


GACGGAACCT 


AGTGTTCCAG 


TATTATTAAG 


2220 


AGTAAAGGTC 


CCTCCTTGCA 


TTTCTTTGCT 


TGCTAGCGTT 


CCTTGTCGGA 


CTTCTTGTGC 


2280 


CAAGCGATTG 


ATTTCTTTGG 


CTAGCCCTGC 


AATCGAATAG 


TTATCCGCTT 


GTTGAATCAC 


2340 



WO 98/50555 



PCT/US98/08985 



586 

TGGCACATAA AGATGCTCGT C7GTCC-TTAC AGC7.-.7GGAT AAATTAACAT CTTTATGATA 2400 

AATAATTGAG CCATCATCCC AAGATGTATT GA77777GGA TTTTTCTTTA ACGCTTGAAT 2460 

GACTGCTTTG GCAAAAAATG GGAAGAAGCT 7AGAG.-.7AGG CCCTCTTGTT GTTTAAATTC 2520 

ATCTTTTAAT GAATTTCTAA G7TGGACTAA GTTGG7CACA TCCGCTTCCA CCATTAACCA 2580 

AGCGTGAGGG ATTTCGTTGA CGCTT7GGAC CAT777777A GCAATTGCTT TACGTACAGG 2640 

ATCTGCAGAG ACAATTTTAT CTGTGCTTGT 77CAG77GGA CTAGCTGTTG CGGCGCTTTC 2700 

AXTTTGCGAA gCAACTGGCT CCTCTGAAAT 7GAAGGC-C7A GTTCCTGGAC TCACTGTTTT 2760 

CTCAGGTGTT CTTGCCTGTG TTGGTGTAAA GXTTG7AACA 7CTTTTCTAG TAATTCGACC 2820 

ATCACGACCT GTCCCTGTAA CTTGTGTTAA A7CAA7CTTT TTTTCTTGGG CAATTTTAAG 2880 

GACTGCTGGG GAATAACGAC CATTA7TTTT 7TGATGGGAC GTCGCAGTGC TTGTTGTTTC 2940 

AGCCGTTTCA TGTTCTTGCG CTTGT7CCGC ACTTGCTTCT TTCACTGGTG CTAAAGTAGC 3000 

TACTTCTGTT TCTTCTGTTG T7TCT7CTGT CTCTAACGTC ATCACCGCTG TTCCGATGGG 3060 

CACATCTGTA TCTAGAGAAA TCAGAAATTC 7TT7ACTACA CCATCAAAAT CTGATGGGAC 3120 

TTCCGTGGTG ACTTTATCAG AAACGACTTC CATTAAGGGA TCATAgCGTT TCACTGAATC 3180 

TCCTGGTTTA ACTAACCATT GGACAATGGC 7GCC7C7GTG ACACTTTCGC CTAGATGAGG 3240 

CATTTTGATT TCTTTTGTGG CCATT7ATTT 7T7CC7CC7T GCTTCAATTA AAATTCTGCT 3300 

AATTCTTTCA TTGCTGCTAA TACTTGCTCT 7CAT7C-ATTA AAAATTCTCT TTCTAAAGGT 3360 

AATGCATAAG GCATGCTTGG ACAA7CAGGT CCTGCAAGTC GTTGAATCGG TGCATCTAAA 3420 

7CAAAGAGAG CATCTTCTGA AATCA7TGCC GCAA77TCAC TCATCACGCT GCCT7CTTTA 3480 
7TGTCTTC7G TTACGAGTAA GACCT7CCCT C-7TT7C7TAG CTGCAGCCAC TAA7G7TTCG 3540 
CGATCTAAAG GATATAATGA GCGAACATCG ACGA777CGG CATCAA7TCC TTCAGCGGCT 3600 
AGTTTTTCCG CCGCAGCTAA AGCTAA7TGC AAGG7CATGC CGTAACTAAT CACGGTTAAA 3660 
TCACTACCCG TTCTAACGAC A77GGC7T7G 7CAA77GG7A CGATATAA7C ATCTGCAGGC 3720 
ACTTCATCTT XTAGTAAACG ATA7AAACGC 77ATG7TCg7 AAAAAATGAC TGGATCATCG 3780 
GAACGAATCG CTGCTTTAAT CA7CCCTTTT GCATCATAAG GATTAGAAGG GGTCACCACT 3840 
CTCAATCCTG GTTGTCCACA AAAGACTTTT TCTG7AGACT GAGAATGATA CAGCCCACCA 3900 
CGAACACCGC CGCCATAAGG GG7CCGATAA ACGA77GGCG CAGTCCAATC GCCTTTTGTT 3960 
CTATAGCGCA TTGTCCGAGC CTC7GATAAC AATTGATTGG TTGCAGGCAA AATGTAATCC 4020 
GCAAATTGAA ATTCACCAAT TGCGCGATAG CCCA77AAAC CTAAGCCGAC AGCTAAACCA 4080 
CCAATTAAGC CTTCTGTTAA CGGTGTATTA AAACAACGCT CGTCACCGTA TTTAGCAGCC 414Q 
AAGCCC7TTG - 7TACACCGAA CACGCCGCCT 7TGTCGCCGC CGACATCTTC TCCAAAAATA 4200 
AC7ACTTTTT CATCACGAGC CATTTC7TCA GAAA77CC7A AGTTAAT7GC 7TCTAAATAA 4260 
GTCATCTCAG CCATTATATT TCTTCrCCTC 7TCTC7TATT TTGCATACAC TTCTTCTAAA 4320 
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ATTGATGTCG 


GTACAGGATC 


TGGCATTGCT 


TCTGCTTCAT 


CCGTTGCTTG 


ATTGATTTCT 


4380 


GCACGAATTT 


CTTCATCAAT 


TTTGGCAATG 


TCTTCGTCTG 


TTAAATAGCC 


CTCTTCTAAT 


4 440 


AATTGTTTTT 


CAAAAAGCTT 


CACTGCATCG 


TTCTTTTTCA 


TTTCTTCAAT 


TTCTTCTTTT 


4500 


GAACGATAAA 


CAGATTGATC 


ATCGTCAGCG 


GAATGAGACG 


TCAAGCGCGA 


AACCATTAAT 


4560 


TCAATCAATT 


TTGGTCCTTT 


TTTCCCGCGA 


GCCGCTTTTA 


CTGCTTCTTT 


AAATGCTAGA 


4620 


TAGACTTCAG 


TAAAATCACT 


ACCATCAACG 


GTTACACCTT 


CAAAGCCATA 


AGCTTTCGCG 


4680 


CGATCGGCCA 


TTCGTTTATT 


GGCATACTGT 


TCTTCAATTG 


GGACAGAAAT 


CGCATATTCA. 


4740 


TTATTTTCAA 


CAACAAAAAT 


AACTGGTAAT 


TTTTTTACGC 


CTGCAAAGTT 


CATAGCTTCT 


4800 


TGGACTTCTC 


CTTGATTGGC 


AGAGCCTTCC 


CCAGTGGTGG 


TTAATGCAAC 


AAAATCAGCT 


4860 


TTTTGAAGTT 


GCGCTGCATA 


ACCAACACCT 


GTCGCTAATG 


GCATTTGTGT 


ACTTACTGTT 


4920 


GAAGAGAAGG 


AAACAATATT 


ATGCTCTTTT 


GAACCATAAT 


GATTCGGCAT 


TTGACGACCA 


4980 


TGGGAAGAAG 


GATCCGCTTC 


TTTTCCAAAA 


GAACCCATTA 


AAATATCTTT 


GGAGGTCATG 


5040 


CCCCAAACCA 


AGCACGCGGT 


CATATCACGA 


TAATACGGTA 


AAAAATAATC 


TTTTTGAGGA 


5100 


TCAAAAGCCA 


TCGCCATCGC 


TACTTGTGCA 


ACTTCTGCCC 


CTTGACCAGA 


AATATTGAAA 


5160 


GAGGTCTTAC 


CAATCCTTGT 


TAATTGCCAC 


AACCGTTCGT 


CTAAACGaCG 


tCCTCTTAGT 


5220 


ACTTcAcGAt 


ATGCCTGaAT 


CAaTtCTTCT 


TTTGATAAAC 


CTGATTTTTT 


AAGCGCTTTC 


5280 


ATTTTCATCA 


ACCTTTCTCT 


ATTTGTGAAT 


CGCTAAGCCA 


TAAGTATCTA 


AGGCTGCTTC 


5340 


TTGCAATACT 


TCTGTCATTG 


TTGGATGCGC 


ATGGATTGCT 


TCACCAATTT 


CAATGGGCGC 


5400 


GGCATCTaAA 


TACATGGCAG 


TACTTGCTTC 


GGCAATTAAA 


TCCGTGACAT 


GTGGGCCAAT 


5460 


CATAGAAACC 


CCTAATAAAT 


CATCGGTCTT 


CTTGTCACGA 


ATTACTTCTA 


TAAAGCCATC 


5520 


TGTTTCTCCA 


TAAACAAGTG 


ATTTACCATT 


ACCATTAAAA 


TTAAAGGTGC 


CAATTACTAC 


5580 


TTCTTTTTCT 


GCTGGTAAAG 


TTTCTCTAGT 


ATAGCCGACA 


CTTGCTATTT 


CAGGATTTGT 


5640 


ATAAACACCT 


CGAGGTACGT 


TTGTATAATT 


TAAAGGTTCA 


ACTGTCTCGC 


CTAAAAGATG 


5700 


CTGAACAGCT 


AACTCTCCTT 


CTTTCATAGC 


CACATGTGCT 


AGTTGGAGTG 


TATCGATACA 


5760 


"ATCACCAATC 


GCATAAATAT 


GTCCTtCTGT 


GGTTTGATAA 


AATTCATTTA 


CTTCAATGCC 


5820 


TTTGTCAGTA 


TATTTCACTG 


AGGTATTCTG 


TAACCCTAAC 


TTATTGATAT 


TGGGCTGACG 


5880 


TCCAATTGCT 


ACCATGACTT 


TGTCAACGGT 


GAGGGTTTCT 


TGTCCTGCAA 


CTTCTACTTG 


5940 


AACCTTTTGT 


CCAGTAACTT 


TCGCTTCTTG 


GACTTTGCTT 


CCTAATAAAA 


TGTTAATTCC 


6000 


TCGTTGCTCT 


AAACGTTTCT 


TTAATTCTTT 


AGAAATTGTT 


GCACTCTCAT 


TTATAAGTAA 


6060 


AC GAT CC AAA 


AATTCAATGA 


TAGTTACGTT 


GACACCTAAA 


CTATTTAATA 


AAGAAGCCCA 


6120 


CTCGACACCA 


ATAACGCCAC 


CACCAATAAT 


TGCAATTGAT 


TCTGGTAGCT 


CTTCTAATTC 


6180 


AAGCATCCCA 


TCAGAAGATA 


AGATAAATTC 


TTCATCCAAT 


GGCAAATTAG 


GTAACGTTTT 


6240 


AGAACTAGAG 


CCTGTCGCAA 


TAATGACATT 


TTTAGGAcGA 


TAATTTCTtC 


TTCTCGTGTT 


6300 
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GGAnCATTAA ACGTAACTGC CACTGCGCCA GAGACTGGTG AAAAAATGGA GGGACCTAAA 6360 

ATCGCACCTT CG.CC.TGCT.AA A&TTgATTyT ATTTTTCTTG CATAAGCCTT CAACACCTTT 6420 

GTGTAACTGC TCAATAATTC CTTCTTTCCG TTGTTGTATT TTAGAAAAAT CAATGGACGC 64 80 

TGCCTCTGTT TCAATCCCAA AAGAAGCCGC TTGTTTTAAG GTATCAAAAA CCTCTGCGCT 654 0 

TCTTAATAAA GCTTTTGTAG GGATACAGCC TTTGTGCAAA CAAGTGCCAC CTAATTTGTA 6600 

TTTTTCAACA ATTGTGACGT TTAGCCCTTT TTGTGCAGCT CGAATTGCTG CTACATAGCC 6660 

TCCTGTTCCG CCACCTAAAA TCAGTAAATC TGTTTGTTCC GCCATTTTGA TCACTCCTGC 6720 

TCAATTTTTG CTTCACTATA ATTTAAAGCT TGTTCTTCCC CTGTTAATAC TCGATTGACG 6780 

CCTTCATATA AAGCAGCCAT CTCCATTTCA CCTGGGTAAA CCTTAATTGG TGCAATCCAA 684 0 

GTGACTTTTT GACTAATTTC TTGAACAACG GTTTGCGAAT AgcTGCGCCC CCTGTTAAAA 6900 

TAATTGCATC AATTGTGCCT TCAAGAACCA CGGCCATCTC GCCAATACTT TyyGCGATTT 6960 

GGTAACACAT TCCTTTTAAA TAGTAATTTG CTGTTTGATC ACCTGCAGCT ATTTGTGCTT 7020 

GGATATGCCG TAAATCTGTT TCACCTAGGT ATGATTTAAG ACCGCTATTC CCCGCAATTA 7 080 

GCTTTTTCAC TTGGCTAATC GTTAACTCCT GTTCAAGTAT CCATTGCGCA AATTCAACGA 7140 

GCGGTAAGGC ACCACTGCGT TCTGGCGTAT AAGGaCtTCA CCATCAgCCC ATTTACCACA 7200 

TCCACawGCG nCcTTTTgAT GCGCCCCCnA GCTAATGCCC CCACCTAAAT GAACAACGAT 7260 

AAAATTGCTT TGCTCATATG TTTTCCCtAA ATCCTCAGCG ATTTTTCGAG CGACCGCTTT 7320 

TTGATTTAAG GCGTGACCcA AaCkaCGCCG CTGAATGCCT TTTAATCCAG AAATCCTGGC 7380 

TAATGGCTGT AGCTCATCTA CAACAACAGG ATCGACAATA AAGGCTGGCA CGTGATATTT 74 40 

TTCTGCAAAC TCATTAG'JTA AAATCGCCCC TAAATTTGAA GCATGGGTAT TAAAGCGTTC 7500 

TGTCCGCAAG TCTTCTAGCA TTTGTTGGTC GACAAGATAC GTACCGCCAG GAATAGGCTT 7560 

AAGCAAACCA CCACGCCCTA CAACAGCGGC TAATTGTATG ATGTTATGTG TTTCTAAAAA 7 620 

TTCCGCGATC ATTTGTTTTC GAAATGATGT TTGACTGACA ACATTTTCAA AAGGTGCCAA 7680 

TTCTTGGACG CTATGTCTAA GCGTTTCTTC TGCCAAGCAA TCATGATTAG CAAAAAGCGC 7740 

TAACTTCGTG GAGGTCGAGC CAGGATTAAT AACCAATACT GTTTCCATTG TtTTCTGCTC 7800 

CTTkTTTtAA ACTTGTCTCA TGGCAAATCT TAATGAATGG AATTTGCTTT CTGTCGAATC 78 60 

GCTTC 78 65 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TGCTCCnGnT GTGCTCGATG TATTACAAGA AAGATTCCAA ATTCTTCGCA ATATTTATTG 60 
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GATGCAACCT 


ATTGGACGTA < 


GAAGTTTATC AGAAACGATG GGAATTACAG AGCGAGTCTT 


120 


GCGTACAGAA 


ACAGATGTTT 


TAAAGCAACT CAACCTGATT GAACCTTCAA AAAGCGGGAT 


180 


GACATTAACA 


GAACGCGGTT 


TGGAAGTCTA CCAAGGCTTA GAATTAGTCA TGAATCAATT 


240 


GCTTGGCATG 


CACCAAATCG 


AAAAAGAAAT GACACAGTAC TTTGGCATTC AAGGTTGTAT 


300 


CGTTGTTGCA 


GGAGATAGTG 


ATATCCAAAA GAAAGTTCTT TCGGACTTTG GAGATGTCTT 


360 


AACCAATACG 


CTGAATTTAC 


TCTTACCAAA TGGTGAGAAT ACAATTGCGG TTATGGGTGG 


420 


GACAACAATG 


GCCATGGTGG 


CCGAAAATAT GGGATCACTA GAAACTGAAA AACGTCATAA 


480 


TTTGTTTGTT 


CCAGCCAGAG 


GAGGGATTGG TGAAGCAGTT TCTGTCCAAG CCAATTCAAT 


540 


TAGTGCTGTA 


ATGGCCAACA 


AAACGGGTGG TAACTACCGG GCGTTATATG TCCCAGAACA 


600 


ATTGAGTCGT 


GAAACGTATA 


ATAGTTTATT ACAAGAACCC TCGATTCAAG AAGTTCTTAC 


660 


GTTAATTAGT 


CATGCCAACT 


GTGTGGTGCA CAGTATTGGT CGAGCGTTGC ACATGGCGGC 


720 


ACGCCGaAAA 


ATGTCTGATG 


ACGAAATGGT TATGCTCAAA CAAAAAAATG CCGTGGCAGA 


780 


ATCGTTTGGT 


TATTTCTTTG 


ATGAAGAAGG AAAAGTGGTC TATAAAATTC CACGAATTGG 


840 


CCTTCAATTA 


AAAAACTTAC 


AAGAAATTCC CTATGTCGTC GCAATTGCGG GCGGCAAAAC 


900 


GAAAGCCAAA 


GCCATTCGCG 


CATATATGAA AAATGCACCA AAACAAACGT GGcTCATCAC 


960 


AGATGAAGCT 


GCCGCAAACG 


AGATTTTAAA AGGGGTAACC CTTTAAAATA AAAAATTTTT 


- 1020 


GATTTTCATA 


AGGAGGAAAT 


CTTTAATGAC AGTTAAAGTA GGTATTAATG GATTTGGACG 


1080 


TATCGGACGC 


TTAGCTTTCC 


GTCGTATCCA AGATGTAGAA GGAATCGAAG TAGTTGCAAT 


1140 


CAACGACTTA 


ACAGATGCTA 


AAATGTTAGC ACACTTGTTA AAATATGACA CAACTCAAGG 


1200 


GCGTTTCAAC 


GGTACAGTTG 


AAGTTCACGA AGGTTCATTC AACGTTAACG GCAAAGAAAT 


1260 


CAAAGTTTTA 


GCTAACCGTA 


ACCCTGAAGA ATTACCATGG GGCGAATTAG GCGTAGACAT 


1320 


CGTTTTAGAA 


TGTACTGGTT 


TCTTTACTTC TAAAGAAGCT GCTGAAAAAC ATTTAACTGC 


1380 


TGGTGCAAAA 


CGTGTAGTTA 


TTTCAGCTCC TGGTGGTAAC GATGTACCAA CAATCGTTTA 


1440 


CAACACAAAC 


CATGAAACAT 


TAACTGGAGA AGAAACTGTA ATTTCAGGCG CTTCTTGTAC 


1500 


TACAAACTGC 


TTAGCTCCAA 


TGGCTAAAGC TTTACATGAC AACTTTGGTG TTGTTGAAGG 


1560 


TTTAATGACA 


ACTATCCACG 


CTTACACAGG TGACCAAATG ACATTAGACG GACCACATCC 


1620 


TAAAGGCGAC 


TTCCGTCGTG 


CGCGCGCTGC TGCTGCAAAC ATCGTACCTA ACTCAACTGG 


1680 


TGCTGCTAAA 


GCAATCGGCT 


TAGTAATCCC AGAATTGAAC GGTAAATTAG ATGGCGCTGC 


1740 


TCAACGTGTT 


CCTGTAGCAA 


CTGGTTCATT AACTGAATTA GTTACTGTAT TAGACAAAGA 


1800 


AGTTACTGTT 


GATGAAGTAA 


ATGCAGTAAT GGAAAAAGCT GCTAACGAAT CTTATGGTTA 


1860 


TAACACAGAC 


GAAATCGTTT 


CTTCTGATAT CGTAGGTATG ACTTACGGTT CATTATTCGA 


1920 


TGCAACTCAA 


ACAAAAGTGA 


k TGACAGTTGG CGACAAACAA TTAGTTAAAA CTGTTGCTTG 


1980 


GTATGACAAC 


GAAATGTCAT 


1 ACACTGCTCA ATTAGTTCGT ACTTTAGAAT ACTTCGCTAA 


2040 
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CTTATAAGAT TCATTCTTAT CATTAGTGAA ATTTGAACAA TTGTGAAATA ATAAGCGGGG 2100 

AAGCAACGCG CTTcTCCGCT TTCTTTTTTT AAAAAAGAAA AATCCTAAAA TTGACAATAC 2160 

AGGAGGTCTA CTCATGGCTA AAAAGACAAT TAAAGATGTA GATTTAAAAG ACAAAAAAGT 2220 

CCTTGTCCGT GTTGACTTTA ACGTTCCGTT AAAAGATGGC GTGATCACAA ACGATAATCG 2280 

TATCGTAGCG GCTTTACCAA CAATCAAATA CGTGATTGAA AATGGCGGAA AAGCAATTCT 2340 

TTTCTCTCAT TTAGGTCGTG TAAAAACAGA AGAAGACAAA GCTGGCAAAT CATTAAAACC 2400 

AGTGGCTGAA CGTTTAAGCG AATTACTAGG TCAACCAGTT ACTTTTGTTC CAGAAACACG 24 60 

TGGTAAAGAA TTAGAAGACG CTGtTAACAA TATGAAAGAT GGCGACGTTT TAGTATTTGA 2520 

AAACACTCGT TTTGAAGATG TTGACGGCAA AAAAGAAAGC GGCAATGACG CTGAATTGGG 2580 

TAAATACTGG GCTTCTTTAG GCGATGTATT TGTCAATGAT GCTTTTGGTA CTGCACACCG 2640 

TGCACATGCC TCAAACGTTG GAATTGCTTC AACTGGTATC CCAACAGTTG CTGGTTTCTT 2700 

AATGGAAAAA GAAATTAAAT TCATCGGCGA AGCTGTTGAA AATCCAAAAC GTCCATTCGT 27 60. 

AGCAATCTTG GGTGGCGCAA AAGTTTCTGA CAAAATCGCT GTCATTGAAA ACCTAATTGA 2820 

AAAAGCCGAT AAAATTTTAA TCGGTGGCGG TATGGCTTAC ACATTCATGA AAGCACAAGG 2880 

CTATAGTGTT GGTTTATCTT TACTTGAAGA AGATAAAGTG GACTTAGCGA AAAGCTTAAT 2940 

GGAAAAAGCT GGCGACAAAT TAGTTTTACC GGTTGATACA GTTGTCTCTA AAGAGTTCAG 3000 

CAATGACGCT CCTTTCCACA CGGTTCCTTC AACTGAAATC CCAGATGATG AAGAAGGCTT 3060 

GGATATTGGT GAAAAAACAA TTGAATTATT CGCTAACGAA TTACAAGGTG CGAAAACAGT 3120 

TGTTTGGAAT GGACCAATGG GCGTATTTGA AATGAGTAAC TTTGCCAAAG GAACAATTGG 3180 

TGTTTGTGAA GCGATTGCTA ATTTAGAAGA CGCAACAACA ATCATCGGTG GCGGGGACTC 3240 

TGCTGCTGCA GCAATTCAAT TAGGCTACGA AAACAAATTC TCTCACATCT CAACAGGTGG 3300 

CGGAGCAAGT TTAGAATTAT TAGAAGGCAA AACTTTACCA GGTTTAGCTT CTATTAACGA 3360 

TAAATAATCA GTTTAAAAGA CAATATTTTT AAAAAAAGAA AAGGATGTGC TTTTCATGCG 3420 
CAAACCAATT ATCGCTGGTA ACTGGAAAAT GAACAAAACT TTAAGCGAAG CACAAAGCTT 3480 
TGCAGAAGCA GTAAAAAATG CTGTTCCATC AAACGACGTT GTTGACGCTG TGATTGGATC 3540 
TCCAGCATTA TTCTTAGCTC CTTTAGCTTG GAACCTAAAA GATTCTGAAG TGAAATTAGC 3600 
TGCACAAAAC TGCTACTGGG AAAACGCTGG TGCCTTCACT GGTGAAAACT CACCAGCTGC 3660 
AATTGCAGAT TTAGGTGTAG ACTATGTGAT TATCGGACAC TCTGAACGTC GCGAATATTT 3720 
CCACGAAACA GATGAAGATA TCAATAAAAA AGCCAAAGCA ATCTTTGCAA' ATGGTATGAC 3780 
ACCAATCTTC TGTTGTGGTG AAACTTTAGA AACGTATnAA GCTGGTAAAA CAGCGGAATG 3840 
GATCGAAGGT CAAATCACTA AAGGCTTAGT TGGTTTATCA AACGAACAAG TTGCTTCAAT 3900 
GGTTATCGCT TACGAACCAA TTTGGGCAAT CGGAACTGGT AAATCTGCAG ATGCTAATAT 3960 
TGCAGATGAA ATCTGTGGTG TTGTTCGTTC AACAGTTGAA AAATTATACG GCAAAGAAGT 4 020 
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TTC 4023 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



GGGCGGTGTC 


ACTGCGTCTA 


TTTTCGGTTA 


TCGGCCAACC 


TTTTTCATTA 


CTGGTACGAT 


60 


TTTATwACTC 


GTTTTTGTTT 


TGAGTCTCGT 


CTTTGTCCAT 


GAAGAGTTTG 


TGCCAATTGA 


120 


AAAAAATCAA 


GCCGCATCTG 


GGAAACAAAT 


TTTAAAAAAA 


CTAGAACATC 


CCCACGTGAT 


180 


TCTTGGAATG 


TTTATTACGA 


CATTAATCAT 


TCAAGCTTCC 


AATAATTCAA 


TTAGTCCCAT 


240 


CATCAGTTTA 


TATATTCAAC 


AATTGTTGGG 


TGGTCACGGA 


AATGTCACCT 


TAATTAGCGG 


300 


AGTCATTGCT 


TCTATTCCAG 


GGATTGCAAC 


GTTAATAGCC 


GCCCCTCGTT 


TCGGCCGGTT 


360 


AGGCGATCGT 


ATTGGCAGTG 


AACGCATCTT 


GACAATTGGC 


TTAATTTTAG 


CCATTTTTGT 


420 


TTACCTACCA 


ATGGCCTTTG 


TTCAAAATGT 


CTGGCAGCTT 


GCCATGCTTC 


GTTTCTTAGT 


480 


TGGGATTTCC 


GATGCCTGTT 


TACTACCAGC 


TGTTCAAACG 


TTGATCACTC 


GTTATTCCCC 


540 


AAGTGACGCT 


GCAGGTCGTA 


TTTTCAGTTA 


CAATCAATCT 


TTCCAAGCGA 


CAGGAAATGT 


600 


GATTGGTCCT 


ATGATTGGTT 


CAAGCGTATC 


TGCCGCTTTC 


GGTTATCGTG 


GGGTCTTTAT 


660 


TTCCACTTCT 


TGTCTGGTTC 


TCCTTAACCT 


TCTATGGGTT 


CGTCGAAGCA 


CCGCCGAATT 


720 


AAAAAAAGAG 


AAAAATGATG 


ACTAAGTTAT 


ACACAGAAAA 


GTTATCCACA 


ATTTGAATTT 


780 


TTAATTTTTC 


TCTTTTTCTT 


ATTTTAAAAA 


AAATAGGCAT 


ATAAAACAAA 


AAAGAAAAGC 


840 


GG AT AT TT AC 


TGGTTTTATG 


ACCAGCATAT 


CCGCTTTTCT 


TTTGGtTAAC 


TATACGCTTT 


900 


TTGTTTCACA 


ACTCGTGAAA 


CTTTTTAGAC 


GTATTGTAGT 


TCTTTTTCAT 


CTTGGTAGGC 


960 


GCGATCAATT 


AAGCCGCCCC 


CTAAGCATTC 


CATCCCATCA 


TAAAAAACAA 


CAGCTTGACC 


1020 


TGGAGTGACT 


GCACGCGCTG 


GTTCCTTAAA 


CGTAACGGTT 


GCTTTTGTGC 


CATCTTCAGA 


1080 


TAAGGAAACA 


TGCACTGGAA 


TATCTGCTTG 


ACGATAACGG 


AATTTAGCCG 


TACAGTCAAA 


1140 


CTCTTTTGGc 


ATTGGTGTAT 


CAACAGTAAA 


ATGAACCTCA 


CTAGCTTCTA 


AATGTGtTGc 


1200 


ATATAGTTTT 


TCATGGTGAA 


ATCCTTGtCC 


CACATATAAT 


GTATTGGTAG 


TTAAATCTTT 


1260 


CCCAATCACA 


AACCATGGTT 


CTTGTGTTTT 


ACCGCCGCCA 


CCAATTCCTA 


AGCCTTGGCG 


1320 


TTGACCGATA 


GTATAATACA 


TCAGACCATC 


ATGTTGTCCT 


TTGATTTCGC 


CATCTTCTGT 


1380 


CACCATGTTG 


CCTTTTTTAG 


CTGGTAAATA 


ATTACTCAAA 


AATTCTTTGA 


AATTCTTTTC 


1440 


ACCAATAAAA 


CAAACGCCCG 


TTGAGTCTTT 


TTTCTTGGCT 


GTGGCTAATC 


CAGCACGTTC 


1500 


AGCAATTGCA 


CGAACTTCTG 


ATTTTTCCAT 


GCCACCCAAT 


GGAAACATTG 


TTTTTGCCAA 


1560 
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TTGGGCTTGT 


GATAATTGGC 


TTAAAAAATA 


GGTTTGGTCT 


tt;attattgt 


CAATTCCACG 


1620 


TAACATATGC 


ACTACGCCAT 


TTTCATCTGT 


TTCTACTTGC 


GCATAGTGGC 


CAGTkGCAAC 


1680 


ATAGTCAGCG 


CCTAGTTGCA 


TAGCATAATC 


TAGGAATGCT 


TTAAATTTAA 


TTTCTTTATT 


1740 


ACACATGACA 


TCTGGATTTG 


GTGTCCGTCC 


TAACCGATAT 


TCTGCTAAAA 


AATATTCAAA 


1800 


CACACGGTCC 


CAATATTCTT 


TTTCAAAATT 


GACCGAATAA 


TACGGAATCC 


CAATTTGATC 


1860 


CGCTACTTTT 


GCGACATCTT 


TATAATCCTC 


AGTTGCGGTA 


CAGACACCAT 


TTTCGTCTGT 


1920 


ATCGTCCCAA 


TTTTTCATAA 


AGATCCCAAT 


TACATCATAG 


CCTTGCTCTT 


TTAAAAGCAA . 


1980 


CGCTGTGACA 


GATGAATCGA 


CACCGCCACT 


CATGCCTACC 


ACGACACGCG 


TTTTGCTGTT 


2040 


ATCTGCCATT 


TTATCACCAT 


CATTTCAATT 


AGATTCTGAA 


ATCATCCACG 


ATTTGAAGGT 


2100 


CGTAACTTTT 


CAGTGCTTTT 


CATTATACTA 


TTTTTTTGAA 


ATAAGTAAAG 


CAAGCCGGGC 


2160 


GTATAATGAG 


AAAACACTTG 


GAACAAAGAC 


ACAGCCAACA 


AGCTATTGCT 


TTCAAATAAA 


2220 


CAGAAAACcT 


GmAGGAACAA 


GCTACTTTCG 


CTTGTkTCTT 


CAGGTTTCTC 


TAAATTATTC 


2280 


TACTTTCT 












2288 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



TTTAATTTTT 


GGGAAaAAAT 


TwCCaAAGGC 


TAAGGaTaGk 


TAAAtTGgGG 


ACCwTATTTg 


60 


GAATaTTGaT 


TAAATTCTGA 


CCACTTAATT 


GtCTTTkGAA AAggTT AT GC 


TAAAGTTACT 


120 


TTGTAATcGA 


ATGaGGAGGG 


CTATCACATG 


GaAGAAAAGA 


ATTGGCCTGA 


GTTATTTGCT 


180 


CGTTATCTTA 


TTGTCGAAAG 


AGGCTATTCA 


GAAAAAACAA 


AAAAAGCATA 


TCAAGAAGAT 


240 


ATACAACATT 


TTTTTTCGTT 


TTTAAAGACT 


TCAGGTAATG 


ACAATTATTT 


GACTGTGGAA 


300 


CATTTGGATG 


TTCGGGCTTA 


TTTGAGCGAA 


TTATATGATC 


AGGAATATAG 


CCGCAATTCG 


360 


ATTAGTCGAA 


AAATTGCCAG 


TTTGCGTTCT 


TTTTATCAAT 


TCTTATTAAA 


AAATGAAGCA 


420 


ATCCAAGAGA 


ATCCTTTTTC 


CTATGTTCAT 


ATGAAAAAGA 


AACAGTTACG 


GTTGCCCCGT 


480 


TTTTTTTATG 


AAAAAGAAAT 


GGATGCATTA 


TTTGAGAGCG 


CACAAGGGGA 


GCAACCGCTT 


540 


GATTTGAGAA 


ATCAAGCATT 


ACTAGAAGTT 


CTGTATGGGA 


CTGGTATTCG 


GGTCAGTGAA 


600 


TGTGCGAACT 


TAACGGTGGA 


CGCAGTTGAC 


TTTCAAGCTT 


CTGTTTTATT 


GGTTCACGGA 


660 


AAAGGAAATA 


AAGACCGTTA 


TGTGCCTTTT 


GGTTCATTTG 


CACAAGACGC 


CTTAAAGGAT 


720 


TACT TAG AG A 


ATGGCCGCGC 


GCTATTAATG 


ACAAAATATC 


AAAAAAAACA 


TCCGTATGTC 


780 


TTTGTTAATC 


ATCATGGTGA 


ACAAATCACG 


CCAACAGGGA 


TTGAATATGT 


CTTAAACCAA 


840 


CTAATTAAAA 


AAAGTTCATT 


AAATGCGGAA 


ATTCATCCCC 


ACATGTTACG 


CCATACATTT 


900 
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GCGACACATT 


TATTAAATAA 


TGGCGCCGAT 


ATGCGGACAG 


TTCAAGAATT . 


ATTAGGACAT 


960 


GCTAATTTAT 


CAACAACTCA 


GATATATGCG 


CATGTGACAA 


AAGAAAGTTT 


ACAAAAAAAT 


1020 


TATCGAACGT 


TTCATCCTCG 


TGCATAATGG 


AGGTAAACGT 


CTAGGAGGAA 


CAATAATGGT 


1080 


TGAATCACAA 


TTTCACTCTA 


CAACAATTTG 


TGCAGTTGAA 


AAAGATGGAA 


AATTTGCAAT 


1140 


GGCTGGTGAC 


GGCCAAGTAA 


CAATGGGCGA 


ATCTGTCGTT 


ATGAAAGGTA 


CGGCCAAAAA 


1200 


AGTGCGCCGT 


ATTTACAATG 


ATGAAGTCGT 


TGTCGGTTTT 


GCAGGAAGTG 


TGGCAGATGC 


1260 


CTTTACATTA 


GAAGAAAAGT 


TTGAAGGAAA 


ATTAAATGAA 


TACAACGGAA 


ACTTAACGAG 


1320 


AGCTGCGGTT 


GAGTTArCCC 


AAGAGTGGCG 


TACACAACAG 


TCTATGCAAA 


AATTAGAAGC 


1380 


TATGCTGATT 


GTGATGAATA 


AGGAAGAAAT 


GCTCTTAGTT 


TCAGGAACTG 


GGGAAGTCAT 


1440 


CACACCAGAC 


GATGGTATTT 


TAGCAATTGG 


tTCAGGCGGT 


AATTTTGCTT 


TGTCCGCTGC 


1500 


AAGAgCCATG 


AAAAACTTTG 


GCGATAAAGA 


AATGCCmGCG 


AAAGAGATTG 


CTAAAAATGC 


1560 


CTTAAATATT 


GCTGCAGATA 


TCTGTGTATT 


TACGAATCAT 


AATATCATTG 


TAGAAGAATT 


1620 


ATAAGCGAGG 


GAAAATCCAT 


GAATGAATTA 


AACAAAACAC 


CAAAAGAAAT 


AGTCAAAGAA 


1680 


CTAGATCAAT 


ATATTGTTGG 


ACAGCAAGCT 


GCCAAAAAAT 


CAGTGGCGGT 


AGCTTTACGT 


1740 


AACCGCTATC 


GTCGCTTGCA 


ATTAGAAGAA 


AATATGCAAC 


AAGATATTAC 


GCCTAAAAAC 


1800 


TTACTAATGA 


TTGGACCAAC 


AGGTGTTGGT 


AAAACTGAGA 


TTGCTCGTCG 


TCTCGCAAAA 


1860 


ATTGTCAATG 


CGCCTTTTGT 


AAAAGTTGAA 


GCAACCAAAT 


TTACAGAAGT 


AGGCTATGTT 


1920 


GGTCGGGATG 


TCGAATCAAT 


GGTGCGGGAT 


T TAG TAG AAA 


ATGCGATCCA 


AaTTGTTGAA 


1980 


AAACAACAAT 


ACAGTCGCGT 


GTATGCGCAA 


cATTAAAAAA 


GGCCAATCAA 


CGTTTAGTCa 


2040 


AAGTATTAGT 


GCCTGGAATT 


AAAAAAGAAC 


AAAAACAGGC 


TGGCGGTAaT 


CAATTTGAaC 


2100 


AAaTGATGAA 


CATGTTTAAC 


ATGGCTCAGC 


AACAGCAAGA 


AGCACAAGAA 


GAAGTAACGG 


2160 


AAGATATTCG 


AACGAATCGC 


CGAACAATTT 


TGGAACAGCT 


GGAAAAAGGT 


CT AT TAG AT A 


2220 


ATCGTGAAGT 


AACAATTGAA 


ATTGAAGAAC 


CGAAGAAAAC 


GATGCCAGCT 


ATGAACAATG 


2280 


GCCTAGAACA 


AATGGGTATC 


GACTTAAATG 


AAACGTTAGG 


CGCTCTGTCA 


CCAAAGAAAA 


2340 


AAATCGAACG 


TACTGTAACG 


GTGAAAGAAG 


CACAAGAATT 


ATTAGTGAAA 


GAAGAATCAG 


2400 


CAAAAATTGT 


CAATGACGCT 


GATATTCATA 


GTGAAGCTAT 


TCGTTTAGCT 


GAATCAAGCG 


2460 


GAATTATTTT 


TATCGATGAG 


ATTGATAAAA 


TCACCTCTAA 


AAGTCAACAA 


AATTCGGGCG 


2520 


AAGTCTCTCG 


TGAAGGAGTA 


CAAAGAGATA 


TTTTGCCGAT 


TGTTGAAGGC 


TCCCAAGTTA 


2580 


ACACGAAGTA 


TGGTCCTTTA 


CAAACGGATC 


ACATTTTATT 


TATCGCTTCA 


GGTGCTTTCC 


2640 


ACTTGTCAAA 


ACCAAGTGAC 


TTGATTCCAG 


AATTACAAGG 


CCGCTTCCCA 


ATTCGAGTTG 


2700 


AATTAGATGA 


TTTAACGGCG 


GATGAtTCGT 


AAGTATCTTA 


ACTGAGCCAA 


ACAATGCTTT 


2760 


AATTAAACAA 


TATGTAGCAT 


TAATTGGCAC 


AGAAAATGTT 


TCAGTCATCT 


TTACAAAAGA 


2820 


AGCAATTGAA 


CGGTTAGCAC 


ACATCGCTTA 


. TGATGTAAAC 


: CGTGATACAG 


ATAATATTGG 


2880 
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GGCGCGTCGT 


TTACACACAA 


TTTTAGAGCG 


TTTATTAGAA 


GAT T TAT TAT 


ATGAAGCACC 


2940 


AGATATGCaA 


ATGGGTGAAA 


TTACGATTAC 


CGAAGCATAT 


GTCAATGAAA 


AATTGAATGA 


3000 


CATTGTTCAA 


AATGAAGATT 


TAAGTCGTTA 


CATCTTATAA 


AAAAGGCTGG 


AGGAGAAAAG 


3060 


ATGGCTACAT 


TATTAGAAAA 


AACACGTCAA 


GTGAACGAGC 


TTTTGCAAAA 


AAACAATCTG 


3120 


TTTGACGTAC 


AAGCAGAACT 


TCCGTACAAC 


AAAATGGCAA 


TGATTTTAGG 


GGATATTTTA 


3180 


GAAAGTAATG 


CGTATATTAT 


TAGCAGTTCA 


GGGGATTTGT 


TGGGGTATAC 


TGAAAAATTA 


3240 


GATGTTAATA 


ATGCCCGTAT 


CAAAAATATG 


TTTAAAGAGA 


AAAAATTTCC 


TCAAGGATAC 


3300 


ACCGAGGCCG 


TGGATATGCT 


CAAAGTAACC 


GAAGCCAATA 


TTCCAATAGA 


TAGTGATTTA 


3360 


ACTGCATTTC 


CTTTTGAATC 


ACGAGAGCTG 


TATCCGTTTG 


GTTTAACCAC 


TATTGTGCCT 


3420 


TTGTACGGTG 


CTGGCAAACG 


CTTGGGTACG 


ATTATTTTAG 


CGCGTGTTGA 


AAAATCCTTT 


3480 


AACGAGGATG 


ATTTAGTGTT 


AGCTGAATAT 


AGTGCGACCG 


TTGTCGGCAT 


GCAAATTTTG 


3540 


TATCATCAAT 


CCAGAACAAT 


TGAAGCAGAA 


GTTCGCAGTG 


CGACTGCCGT 


GCAAATGGCC 


3600 


ATCAACACTC 


TTTCCTATAG 


TGAATTAAAA 


GCCGTCCATG 


CCATTTTTGA 


AGCCTTAGAT 


3660 


GGCGAAGAAG 


GTCGTTTAAC 


TGCTTCAAGT 


ATTGCAGATG 


AAATAGGCAT 


TACTCGTTCC 


3720 


GTAATTGTGA 


ATGCCTTAAG 


AAAATTAGAA 


TCAGCTGGAA 


TTATTGAATC 


ACGTTCTCTT 


3780 


GGTATGAAAG 


GGACTTATTT 


AAAAGTATTA 


AATCAACAAT 


TTATTAAAGA 


GTTGGAAAAA 


'3840 


TAAGCAATTG 


CTCGTATCAA 


CTAGGAGGAC 


AACGCAATGA 


CTGTACAAAT 


TGAAAATGAA 


3900 


TTTCTAATCG 


CAACATTTGC 


TGAAGAAGGA 


GCAGAATTAG 


TAAGTTTACA 


ATCAAAAGAA 


3960 


ACCGGCATTG 


AATATATTTG 


GCAAGGAAAT 


CCTGAATTTT 


GGGCCCGCCA 


TGCGCCCGTA 


4020 


CTTTTTCCGA 


TTGTCGGTCG 


TCTAAAAGAA 


GATACGTATA 


TGTACCAAAA 


TCAAGCGTAT 


4080 


CATTTAACTC 


AACATGGTTT 


TGCTCGTGAT 


CAAGTATTTG 


ATGTGATTGA 


AAAAGGTGGA 


4140 


GAAGAAGTTT 


CTTTTTCACT 


AAAATCAACA 


AAAGAAACCA 


AGAAGAAGTA 


TCCTTTTGAT 


4200 


TTTGAACTAG 


TGATTACTTA 


TACATTAGAA 


CATCAAGAAT 


TGACTGTAAA 


CTACCAAGTA 


4260 


GAATACACAG 


GGAAAGAAGA 


GATGTATTTT 


GGGATTGGTG 


GTCATCCAGC 


CTTTAATGTA 


4320 


CCTTTAGAAT 


CTTCACTAAC 


GTTTGAAGAT 


TATTATCTAA 


GTTTTTCACC 


TAAAAAATCA 


4380 


CGGACACAAA 


TTCCCTTAGC 


AGGCCCGTTT 


ATTGATTTAG 


CGAATAAAAC 


GCTAGCCCAA 


4440 


ACGAATACCA 


GCTTTGACTT 


AACGCATCAA 


TTATTTGAAA 


ATGATGCAAT 


GATTTTTGAA 


4500 


a c a z\ a a c c t c 


a a a p a CZC a a T 


TappaTTf^pp 


apap.apf:AAA 


ftTfJA^CATrA 


AGTCACTTTG 


4560 


AGTTATCCAG 


AAATGCCTTT 


TGTGGGGATT 


TGGTCACCGA 


CACCGAAAGA 


AGCGCCTTTT 


4620 


GTCTGTATCG 


AACCTTGGTG 


CGGGATTGCT 


GATGCTGTCG 


ACGCAACTGG 


CCAACTTGCT 


4680 


GaAAAAtTTG 


GCATCaATAA 


GTTmCCAGCT 


AACGAACTAT 


TTAAAmCACA 


GTATATGATT 


4740 


AGTGTAAAAT 


AAAGAAAAAG 


GCTGTGAACA 


TTATTTGTTC 


ACAGCCTTTT 


TCTTTATTTT 


4800 


TTTTGCACGT 


AAGCCAAAAG 


AAAGACGACT 


TTCTGTTCCA 


TTTTTAATTC 


GCTGAATGTT 


4 860 
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TTCACGATGA 


CGGACAAAGA 


TAAAAGTAGT 


CAAAGCAATC 


GCAATCACGG 


TTAATAACCA 


4920 


GTTAAATGTT 


GGTAAAATAG 


CTGGGACAGT 


GAAAGGCAAA 


ATAATAGTAG 


AAAGTGTAAT 


4980 


TAGTACAGCG 


CTAATCATAC 


TTGTTAAACT 


CACCATACTG 


GTTAGATACA 


GACAAATCAC 


5040 


AAAAATAAGA 


GCAGAATAAA 


TAAAAAATGT 


GGGGCTGTAT 


GCTAATAACA 


TGGCAGCGCT 


5100 


AGTGGCTACG 


GCTTTACCAC 


CTTTGAAATT 


GGCAAAAATA 


GGGAAGGTAT 


GCCCTAAAAC 


5160 


AGCTGCTACC 


CCAAAGAAGA 


GCGGATTCAC 


GCCTTGTAAA 


CCAAACAAAT 


AGGGTAATGA 


5220 


AGTGGCTAAC 


GTTCCTTTCA 


AGATATCCAT 


TAATAATACC 


GTAATTCCGG 


CAGGTTTCCC 


5280 


TAAGACACGA 


AATGTATTGG 


TTGTTCCTGT 


ATTCCCACTC 


CCAAATTGGC 


GTATATCTTT 


5340 


TTTAAAGAAA 


AGTTTACCAA 


TCCAAACACC 


TGAGGGAATC 


GAACCTAATA 


AATAGGCAAC 


, 5400 


AAGTAACAAA 


ATGACGATTT 


TCATCAAGTT 


AACCTCTTTC 


TTATTCAAAC 


TAGGATTTAT 


5460 


TTTATCATGA 


ATAGAAAAAC 


GGAACAACGT 


CTAATAACTA 


AAACAAGAAA 


AGTTTTTCTT 


5520 


TTGCTTGAGG 


AAGCGTTGAC 


AAATGGTTGA 


TAACCGTTAT 


AATAAATGTG 


TGTCATTAGA 


5580 


CAAAGACGTC 


TAATCACTGG 


TTTGAaAATT 


AGTTATCAAC 


GAAGATACAA 


TATATATGTA 


5640 


CTTATGAAGT 


GTCAGGGAAA 


GAGGTGCGAA 


TCCTCTACAG 


ACCTACCTAC 


TGTATGGTGG 


5700 


ATGAAACCAA 


TAAGACCACA 


GATTATTCTG 


GAAGGATTGG 


GAGTAAGAAG 


AAGCTAAGTC 


5760 


AGGATACCGG 


CTTGATAAGT 


CTAATCATTC 


TTTTGTAGGA 


AAAAGAAGGG 


GCTTTGAAGA 


5820 


TAACTGGCTG 


AGTATATCAT 


TAGCATAGTT 


AAGTGACTGC 


TTTTTTTCTA 


TTGGAGAAAA 


5880 


AAGTGGTTTT 


TTTGTATTGT 


TTTGACGTTG 


AGACAAAGGA 


GGTTCATTTC 


AGAAAATTTT 


5940 


CCCCAAAATA 


AAATAGACGA 


ATGCGAGGAT 


GAAAAAATGA 


AAAAATTTAC 


TTTAACAATG 


6000 


ATGACTTTAG 


GTTTAGTAGC 


AACACTTGGC 


TTAGCAGGAT 


GTGGTAAACA 


GGAAAAGAAA 


6060 


GCAACTACCT 


CTTCTGAAAA 


AACAGAAGTA 


ACGTTACCAA 


CCAAAGACCG 


TAGCGGCAAA 


6120 


GAAATTACTT 


TACCCAAAGA 


AGCAACCAAA 


ATTATTTCCC 


TAGTGCCATC 


AACAACAGAA 


6180 


GTGATTGAAG 


ACTTAGGTAA 


AACCGACCAA 


TTAATCGCAG 


TTGATACTCA 


AAGTAGTACA 


6240 


ATGATGACTG 


ATTTAAAAAA 


ATTACCACAA 


ATGGATATGA 


TGGCTGTCGA 


TGCCGAAAAA 


6300 


TTGATTGCCT 


TGAAACCACA 


AATTGTTTAT 


GTGAATGACA 


TCAATTTAGC 


TAGCTCAGAA 


6360 


AGTGTTTGGA 


AGCAAGTGGA 


AGATGCTGGA 


ATTACAGTCG 


TTAATATCCC 


CACTAGTACA 


6420 


AGCATCAAAG 


CAATCAAAGA 


AGACGTCCAA 


TTCATCGCTG 


ATAGCTTATC 


TGAACATGAA 


6480 


AAAGGACAAA 


AGTTAATCAA 


AACAATGGAT 


CAAGAAATCG ACGAgTAGCG 


AAAATTGGTA 


6540 


AAACTATTAA 


AArACCAAAA 


ACTGTTTTGT 


TTGAAGTGGC 


TGCCTTACCA 


GACATTTATA 


6600 


GTTTTGGTAA 


TGGGACATTT 


TTAAATGAAT 


GGATTGAACC 


AATTGGGGCA 


AAAAATGTTT 


6660 


TGGCCAATGA 


AAAGGCTGGT 


TnCCAGTGAC 


A 






6691 


(2) INFORMATION FOR SEQ ID NO: 89: 









(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



TTAnGGTAnC 


CAATCGTTCC 


CCGGTAAATT 


TGGACAGTTT 


TAATCTGTTC 


CACAAACGAT 


60 


nAACGGCTCC 


GTTTGTTAAT 


AACTGCCGTT 


CTAACATCTT 


TTCTAAATTG 


TGAAACAATT 


120 


TCTTGTTTGG 


CATAGGGCGG 


ATCCAATAAG 


ACCAAATCAA 


ATTGTAACTT 


CTCTTCATAA 


180 


AACTGTTCTA 


AGGCACGATT 


AGCATCCATT 


TTTCTAACTT 


CAAATTTTTC 


TGGTTCTTTT 


240 


GTAATCGCAA 


TATTTTCTTT 


AATTACTTTT 


AGTGCAGCAA 


AATTCTTTTC 


AATACAAATA 


300 


CTTTTGTCCA 


TTCCACGAGA 


AACGGCTTCA 


ATCGCTAACC 


CGCCACTTCC 


TGAATATAAA 


360 


TCCAGCGCCA 


TTCCGCCATC 


GAAATATGGA 


CCAATCATAT 


TAAAAATAGA 


TTCTTTTACT 


420 


TTATCTGTTG 


TGGGCCTAGT 


ATTGTCGCCA 


TCTAACGCTT 


TCAAGCGCCG 


GCCACCATAT 


480 


TCTCCAGAAA 


TAACACGCAT 


TGTTTCGACT 


CCTTTATCAA 


TCATTCTTTT 


ATAATTCATT 


540 


ATAACAAAAA 


GAGTGAAAAA 


TGTTTGAAAT 


TTCTCATAAC 


ATTTCTCACT 


CTTTTGCATT 


600 


AGACATTCTC 


TGCTAAGCGA 


ATCTCTGTAG 


ATAATTCCTC 


TACTACATAG 


CCGTCGTCTT 


660 


CTTGTTCTTG 


ATAAGCCGCT 


TTGGTACCAA 


TTTTTTCAGC 


AAAATTCATT 


TCGACATCTG 


720 


GTCGATACGA 


TTTATCGACA 


CTACGAACAA 


AATGTAACTG 


ATTAATTTTT 


TCCATGTTAC 


780 


TTTCAATATT 


TTCTTCATTC 


ATGTAAATCA 


CCACATATTT 


CATTTTTCGT 


GAAACATAGT 


840 


GAATCAGTCC 


ATATCGGCGT 


AACTGTTTTA 


ACGGTTTTAA 


TGTGTATACC 


CAAACAATTA 


900 


GGCAACGACG 


CTCTTGAAGC 


GTAAATTCTT 


TTTCTTCTTT 


AGCTTGCATG 


GCAGCTTCCT 


960 


CCACAACCTG 


AATGTTTCTT 


TCCTCGTTCA 


AAGAAAGGAT 


TTCCAGCATC 


TACTTTAATA 


1020 


TCTTCAGAAA 


TTAGTTGCGC 


AATCTTTAAA 


CCAATTGTAT 


CTAGCAACGT 


TTGAACGTTT 


1080 


GTCTCAGCAA 


AACGATAATT 


TGCAACGATT 


TCATTTAAAT 


CAAGCGCCCG 


CTTCGCTTTA 


1140 


CGTAGGGCTC 


GTTGTTTCTC 


CCGAAAATCG 


GGTGCATGAA 


TTCCATAAGC 


CTCTACTCGT 


1200 


TCAAAGGCTG 


ACTTAGCTTC 


TAAAAATGCT 


TTTTGTAACT 


GAGCAACTTC 


CGCCGAAGCA 


1260 


TACATAGCTT 


TTTTTGTTTC 


TTTATAGTTT 


TGAACAGCAT 


CACTTTCTAA 


AAGTGCTGAA 


1320 


ATTAAACATT 


CTGTTTGATC 


TTCAATTGCA 


AATAAATGTT 


CTGTTACAAT 


CATAACTTTT 


1380 


CACTACTCTC 


CGTGATTTGC 


CTATTTTCTG 


GTCGTTTCTC 


TAATAAGACT 


AACACACCTT 


1440 


CATTGGAAAA 


GGCAACGCCC 


AATTCTGTCG 


TTTCTGACTC 


AGCAAAATAG 


CTATCATATT 


1500 


GCGGGTCCGA 


AAACCATGaC 


AAAATAGTAA 


AGGTACTATC 


ATAAACAGGT 


TCTTGGAAAA 


1560 


TACCtGcTGG 


■ TTCTCTCAAC 


GAAAGTTkGC 


TCGTTTtAAC 


TAATAAAGrA 


AATTCGkCCT 


1620 


TAGTCmAATG 


AAaGGGATGA 


ATGGCTGTAT 


GTTCCTCTCT 


CGTTTGTAAC 


AATTCTGCTT 


1680 


GTCGATTTGC 


CGACAAAATT 


GTTGGTGTAT 


GGTTAGCCAT 


CGTTTGATAC 


AATGTTTGGG 


1740 
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cATCTTTTTG 
GATTCATGAC 
TATCAACTGG 
CTTTATTGAC 
CAAAACTGCC 
CGACATCGTA 
CAGACAAGTC 
AGGCCTTAAT 
TCAAACCGAA 
CAAACTGGCG 
CAATTCTTCA 
GGTTGGTTTT 
AAAAACAGCA 
TCACTTCTAA 
CTACCGTGCC 
GCAACGGCTG 
CAGACATCGT 
CAGCAGACTT 
AGAACAAGAC 
ACGTTTTTCC 
ATGTTGGCGT 
AAAATACTTG 
TTTTTTCGGC 
GcCGCTCTGT 
CACCAACTGG 
GTTCTTCACC 
CCATGTCTCC 
TCACTGTGTG 
CTTGCTGCTG 
GCGGGGCATT 
GATTGATTTT 
CATAATATAG 
CGTCAACAAT 



GCTTTGTGTA 
ATAAATCGCT 
CAAAGGAGAA 
ATAATTAACT 
ATTATCAAAA 
AGCTTCATGA 
AGCAAGCTTC 
TGCGATGACT 
CGTCCATAAT 
GATAAAGATT 
TGAGGAACAG 
TCAGCAGGGA 
GTAAAGGCCC 
TAACAAATCA 
CGCAAAACGT 
ACCTTTTTCT 
TGCGCCGATT 
AATAGAGGCA 
ACGGTTTCCG 
TCGTTCAATT 
ATCTAATAAG 
TG GAT AC ATT 
TAATTCTTCT 
GAAGGCTGGT 
TTGACCTAAA 
ACGCGCATAA 
CACGACTTTT 
ATATATTTTT 
TAGATTAGAG 
TAACCCATTT 
■ CTGTGCATTA 
ACTGTTCATA 
ATCAACGCCC 



GAAACTGAAA 
TGGTTACTTT 
CCTTCATTTA 
GCCATTAAAC 
GCAACCAGTG 
TGAAAATTAA 
ATATCTATTT 
TTATTTTCTC 
TCATACGTAC 
TCCACATCTT 
CGGTGTGAGC 
AAAAAACTGG 
AAAATCGCTT 
CCTGAGCTGA 
GCTTCGATAG 
ACTTTATCGC 
TGTTCTTTAT 
TCTTTAACTA 
TCAATATCAG 
TGAACTTCCA 
GTTATATCTC 
AAATAACTTa 
TGTACTTTCG 
CGACCTTTTA 
TCACCTTGGA 
ACATCTTGTT 
GAAGATGGCG 
TTGATTTCAT 
TATTGCCCAC 
TCAAATGGTT 
TCAATGTTAA 
CTTGGCTGGC 
GCTTTAGTAG 



597 
AAGCAACTTG 
GAACAGGATC 
ATTGATACGG 
CGCCTGTCCG 
GTCGATAGTT 
ATGCAAAATT 
TAAACGGTTC 
GAACGTTGAC 
GACCTGTTTT 
TCCCTATGTA 
GCTGGATAAG 
CTCTAAATAT 
CATTTTTTCC 
TTGcTTCGCC 
TCGTTTCCAT 
CACGTTTGAC 
TAGTTGGTTC 
AAACCTCACG 
GTTCGCCAAT 
ATGTCTCTCC 
CAAAGGTTTC 
AAACTTCTTC 
CAAAATCAAC 
GAATAATTCG 
AAAAAGTCAC 
CTGTCAGATT 
TCACTTTTAC 
CCCAACGGTG 
CAGGCATTTC 
GATAATACAT 
TAGTTGGCGT 
TAGTTGCGCC 
CTGCTGAATA 



ACTTTCTTGc 
GAAGGTCATT 
CATTTCCGTG 
CTGATTAAAA 
CATATCTTCT 
CGAATAAATC 
TATCATTTTG 
TTCTAAGTAA 
AATTTTTTCT 
TGCTGCGTAC 
CGCCCACTTT 
CCAATGAGTA 
CCTCTCATCT. 
TTCTTCAACA 
TTTCATGGCT 
CAATACTTGT 
CGCCTTTTGT 
ACGTTGCCCA 
TTCATCTAAA 
TTGACGAATA 
GTATTTTTGA 
CAATTTAGGT 
AGGTGCTGcT 
TTGCAGCTCT 
AACTGATTCA 
ATTTTGAACC 
AATATCGCCA 
CCCTAAGCCT 
ATGCATATAG 
GCGCACATCT 
CCGTTCACCA 
ACTCATCGCA 
TGTGATGATC 
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TGTTTAACCA 


1800 


GACTGCGTTA 


1860 


AGCAACGTTT 


1920 


AATAACACCG 


1980 


TCCATCAGCT 


2040 


GTCATTAAAT 


2100 


TCATTGTTAA 


2160 


TCATTCATCG 


2220 


TTAGGATTGC 


2280 


CAGAAGCTGG 


2340 


GAGCTGTTTC 


2400 


GAACTGTTAA 


2460 


TATTTTTCTT 


2520 


TAAATGTGAT 


2580 


TCTGTAATCA 


2640 


AGAACAGAAC 


2700 


TTTACTTGAA 


2760 


TTCAAATTGA 


2820 


CGAATAATTA 


2880 


CCATTAAAGA 


2940 


CGATACTCTA 


3000 


TGaTACCCAA 


3060 


AAGTCACCTG 


3120 


TTCGGGAACC 


3180 


GGAAAACTTA 


3240 


ATGAAGAGTG 


3300 


AACATCAAAT 


3360 


ACCGCTTTTG 


3420 


ACTTCTGTTT 


3480 


TCCCAATAAT 


3540 


TTGACTAAAG 


3600 


CTCATTGCGA 


3660 


CCATTGCCAC 


3720 
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TAGTGTCGTG 


AGTGTGGAGA 


TGAATTGGTA 


AATCCGTGGC 


TCJCCTTTAAT 


TCACTAATTA 


3780 


AACGATAAGC 


AGCTTGTGGT,. 


TTCAATAAGC 


CAGCCATATC 


TTTAATCGCA 


ATGATTTGTG 


3840 


CACCTAAATT 


TTCCAATTCT 


TTAGCCATAT 


CAAGGTAATA 


TTGAACATTA 


TATTTTGCTC 


3900 


GGGCTGGATC 


ATTGATGTCC 


CCAGTATAAC 


AAATTGCTGC 


TTCCGCAATT 


TTTCCGGTAT 


3960 


CCCGAACGAC 


TTGAATACTT 


TTTTCCATTT 


GAGGAATCCA 


GTTTAAGCTA 


TCAAAAATTC 


4020 


GGAAAACATC 


GACTCCTTGG 


CGAGCGGATT 


CTTTAATAAA 


TTCTTCAATA 


AC AT TAT C AG 


4080 


GATAATTTTG 


ATAGCCTACC 


GCATTAGATC 


CTCTGAACAG 


CATTTGTAAA 


AGTGTGTTTG 


4140 


GCATCAACTG 


ACGAATTTTT 


CTTAAACGTT 


GCCATGGGTC 


TTCGGTTAAG 


AAGCGATAGG 


4200 


CAACATCGAA 


CGTAGCGCCA 


CCCCACATTT 


CACTAGAAAA 


CAGCTCAGGC 


AGGGCTGCAT 


4260 


CAGTTAGACC 


AGCAATTGCT 


TTAAAATCTT 


GTGTTCTCAC 


ACGAGTAGCT 


AATAAACTTT 


4320 


GGTGCGCATC 


ACGGAACGTG 


GTATCTGTCA 


TTAACACACT 


TTCTTGATTC 


TTAACCCAAT 


4380 


CGATGACGGC 


GGTTGCTCCT 


TGAGCGTCTA 


GAATATTTTT 


AGCTGTAATC 


ACTTTTTCTG 


4440 


GAACTTCAAT 


ATCTGTCGGC 


ACGCGCGGGG 


CTTCAAAATA 


TTTTTTCTCC 


GTTCTTTCAA 


4500 


TGCCAGGGAA 


CCCATTGACC 


GTTACTTCTC 


CAATGTATTT 


CATCGTTTTA 


TTGCCACGAT 


4560 


CGCGCATACG 


AGGAAATTCA 


AATAATTCAG 


GTGTATTATC 


AATAAAGGTT 


GTTTTGGctT 


4 620 


CGCCAGATTG 


AAACGCTGGA 


TAGCTCACAA 


CATTTTGTAA 


AAACGGAATA 


TTTGTTTTCA 


4 680 


CGCCACGAAT 


CCGAAATTCT 


TTTAAGCAGC 


GTTGCATTTT 


ACTGATTGCC 


TGTTCAAAAG 


4740 


AGAAACCATG 


CGTACAGACT 


TTAACCAATA 


AAGAATCAAA 


GTAGGGAGTC 


ACAGCGTAgC 


4800 


CAGAATAAGC 


ATTCCCTACA 


TCTAAACGCA 


CACCAAAACC 


ACCTGGTGAA 


CGATACGTAT 


4 8 60 


CGATTTTCCC 


TGTGTCTGGC 


ATAAACTGGT 


TCAGGGGATC 


TTCAGTCGTA 


ATCCGACATT 


4920 


GAATAGCCGC 


GCCTTTTAAT 


GTCAATTCGT 


TTTGTTTTGG 


CAAATGCATA 


TCTTTATGCA 


4 980 


AATCAAGCCC 


TTGCGCAATT 


TGTAATTGAG 


AGATCACAAT 


ATCAATATCT 


GTAATCATTT 


504 0 


CTGTGATAGT 


ATGTTCTACC 


TGAACACGAG 


GATTCACTTC 


AATAAAATAA 


AACTGATCCC 


5100 


CTTCTACTAA 


AAACTCAACC 


GTTCCCGCAT 


TCACGTAGCC 


GACATGTGCC 


ATTAACTGCA 


5160 


CAGCAGCCGA 


ACAAATGGCT 


GCTCGTTGTT 


"CTTCATTCAT 


TGATACACAT 


GGTGCGACTT 


5220 


CTACCACTTT 


TTGGTGGCGC 


CGTTGAACGG 


AACAATCACG 


TTCAAATAAA 


TGCAAGACGT 


coon 


TCCCATGATG 


ATCGCCTAAA 


ATTTGTACTT 


CGATATGTTT 


AGGATTAGAA 


ATATACTTTT 


ci a n 


CAACATAAAC 


CTCGTCAGAA 


CCAAAGGCTG 


CTTTCGCTTC 


ACTTTTTGCT 


CTTTCGTAAC 


b4 UU 


CTTCTCGTGC 


TTCTTTGGCA 


TCGTGGGCAA 


CGCGCATCCC 


GCGACCGCCG 


CCACCTAAAG 


5460 


CAGCTTTAAT 


CATGATAGGA 


AAGCCATGTG 


TTTCACCAAA 


AGCTACAACC 


TCTTCCACCG 


5520 


TTGCTACTGG 


■ CCCGTCTGAG 


CCTGGAATCG 


AAGCAATCCC 


CGCAGCTACA 


GCTGCTTCTT 


5580 


TCGCTTTAAT 


TTTATCGCCA 


AAAATATCTA 


AATGATGTGT 


TTTAGGTCCG 


ACAAAAATGA 


5640 


TTCCTTCTTC 


TTCACAACGT 


TCGGCGAAcG 


TAAGTTCTCT 


GATAAAAAGC 


CATAACCAGG 


5700 



WO 98/50555 



PCT/US98/08985 



599 

ATGAA7GGCA 7CTGC7CCCS ATTTTTTGGC AA777GGATA A7AT77TCAA TGTCTAAATA 5760 

GGCTTCAATT GGTTT7TTCC CCTTACCAAC 7AAA7ACGC7 7CATC7GCTT TGAAACGATG 5820 

AACAGAATAC TCATCTTCTG CAGCA7AAAT C3CTAC7GTA CGGA7A7CTA ATTCTGTACA 5880 

AGCTCTAAAA A7TCGAA7C3 CAATT7CGCC ACGAT7GGCA AC7AA.TACCT TTTTCATAAC 5940 

GTCACTCCTA CCA7CAATTT CCTTCAAA7T T.-.TTGGTTCT GT7 CT AAAG A TAAATAGTAC 6000 

TCATTTTCTA ATT7CTGAC3 TGTTTC.-.TCT GCACTGATAT 7CAGGACAAA GGCCACAGCA 60 60 

ATCGAAATAA 7CAGCAAACT ATTCCCACCT TGACTCAAGA ACGGGAAAGT AATTCCTGTT 6120 

AATGGAATAA 7CCCAGTAAT CCCACCAACA 77AATAAAGA C7TGAA7TAA TAACATTGTA 6180 

CCAATTCCAA 7ACACATTA3 I3AATTAAAA GGTTTCTTCG A3CGTACACC GACTAAAATA 6240 

ATTCGCGCAA 7CA7AAACAT CAACAATrCT AAAATCGCTA GACCGCCGAT AATGCCTAAC 6300 

TCTTCTAAAG 7AA7CGCAAA GATAAAG7CC G7ATGTGCTT C-GGTAAAAA GCCTTTTTTC 6360 

TGAACACTAT 7CCC7AGACC TTTGCCAAAC CAGCCGCCAT 7AC7GA77GC ATAATAAGAA 6420 

TTAGCTAACT GA7GTCCTAA GT7CCGT7CA TCTAAAAATG GATTTTTAAA AACAGCAAAA 6480 

CGATTATAAA 7A7ATTG.--7A ACGCGCTGGG AAAATTTTTC C777AGACAT AATTAATAAC 6540 

TGAATAGCCG TGATACIAC" 7AAAA7GCCC AAGCCACCGA CCAAGCAGGT ATACATATAA 6600 

TTAATCCCAC TTGCTAAAAC CA7CACGAT3 G7AATTAAIG 77AGAATGGC CGCATTCCCG 6660 

AAGTCAGGCT GGATTGCAAC 7AGAGCAATT AATACAAAAA CGAGCA7CAA CGGGCGCCCG 6720 

GCAGCTTGTT 7-- AA77GA7 Z CATGCCTCCA T1AATTGTCT 7TTGTCGCCG TGCTAAAATA 6780 

TAAGACAGAT ACCAGACAA3 CATAATrTTC AAATACTCAG CTGGTTGCAT CGAAAATCCA 6840 

CCAATCTCAA 7CCAACCAG3 73CSCCG7TA ATTTCTTTCC CGATACCAGG AATCCGAACG 6900 

GCTAAAACCA 7AACCG7AA7 7ACCGCAATS GCAAACATGA TAAAACTTCG ATTCTGAAAA 6960 

ACGGATGTTT 7CA.TC77A7A AA7AJ^--AAC A7TGCAACTA ACCCTAC7AC CCAAAAGGCA 7020 

ACC7GG7TAA 7AACGAA7GA G37AGG7GAA AAACC7T7CA 77ACTTG7AA 7GCGGA7GTG 7080 

GAAC7ATATA CCATAA7CAA 77CCACGATA C7TAAAATTA AA7ATGGAAT AAAAATAC7A 7140 

TAGTCTAGCA AATGCCG77T C77TAC77TG T77GGCAAAG C7TAACCCTC CTTGCCAA7C 7200 

TCGTT7TTTT CTTGTG777C 77CATAAACA TCTGTA7ATA A77GATTCAA TTCGCGTTCT 7260 

AAATCACTTA GTAAGCGA7G 7CC7TCATCA 77AGT7AAAA TCCCCAAACG GGTCGCATAG 7320 

GTCAC7TGTC 77GAAAAACC A7ACAT77GT G7ATCCACCA C7TCTTCAAA TGC7TTACAT 7380 

TGAGAAATGC ACAAAC7G7T 77T7TGAITA CGAATAAC-CA T7TTAATGCG TTC7GCA7CG 7440 

TC77CTAATA AT7GAA7GGC AAA77TCTTT GAAAT7CCTT CGTCCATTTG 7ACAGAACCT 7500 

CCC7A7CTTA AT7AAA7AAT 7A7AACA7AA TXTCAG7TA GA7ACA7GAT GAk7TCAGAA 7560 
AAT77CCTT7 AAGAAA777T C7GAAAC7AA GCGATA7A7T AAGTAAGAAA TCTCT7TA7A 7 620 

7AACAAGC77 AT777A7CAC 7ACAACAACA AATACA7TAA AAGACTT7TT -TC77ATT7TC 7680 
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TGCTTTTAAT 


TTTAAATAAA 


ATAACAAACT 


ATATCATTTT 


TTTGTCTCTT 


GATATTCTTT 


*7 1 a n 


AATTACTTCT_ 


AAAAAGGCGG 


TGCCATATyT 


ATCTAATTTA 


TTTTGGCCGA 


CCCCTTTTAT 


T q n a 


TkGCAATAAT 


TGAATCGTTG 


TTTGCGGTAA 


TTTTTCACAC 


ATTTCTTTAA 


GTGTACTAIL 


I o ou 


GGAAAAGACT 


ACATATGGCG 


GCACGCCTGC 


CTCTTGAGCC 


ATGTCCATAC 


GTAATTCGCG 


/ y*£u 


TAATCGTTCA 


AACAAGGCAT 


CATCTACTGC 


TACTTTTCGA 


ACTTTTTGGT 


CTTCTTTTCG 


/you 


GAAGACTTTC 


CGTTCCCCTA 


AAAGCACCTG 


AACACCTTCT 


GTCGAGACTG 


ATAACAACGG 


O A A A 

804 0 


AAATTGCCCA 


TCAGAAGGAA 


TTAAATAACG 


TTCTGCTGTC 


AAATAATCAA 


TCAGTTGTGT 


8100 


GACTTCTTTT 


TGCGTGCGAC 


CTTTCATTAG 


TCCGTATGTT 


GGCAAACGAT 


CAAAATGCCA 


8160 


TTGATCAATT 


TTTTGATCTT 


TTGAGCCAGT 


CAAGACTTTA 


CCAACTAAGC 


CCTTACCGAA 


8220 


ACGTTCTCCC 


ATACGTTTTA 


CACAAGACAA 


AACTTTTTGG 


GCATCGACCG 


TAATATCGAC 


8280 


TAATTCTCGA 


TTATCCAGAC 


AATTCGAGCA 


GCGACCGCAG 


TCCGTCCCTT 


CTTCACCAAA 


8340 


ATAGCGTAAA 


ATATATTTTT 


GTAAACACAT 


TTGTGCATTC 


GCATATTGTG 


ACATTTCCCG 


8400 


TAATTTCAAG 


TACTCTTTCT 


GCTTATAATC 


AATGGTCATT 


TCAGATTGTT 


CAATAAAGTA 


84 60 


TTGTTGAATT 


TGCAAGTCTT 


GTGGAGCAAA 


CATTAACACA 


GCATCACTAG 


GCAGACCGTC 


8520 


ACGACCAGCT 


CGGCCGGCTT 


CTTGGTAATA 


AGATTCAATA 


TTCCCTGGTA 


CTTGTGCGTG 


8580 


AATGACAAAA 


CGAACATTAC 


TCTTATTAAT 


CCCCATACCA 


AAGGCATTTG 


TTGCTACCAT 


864 0 


CACTTGCACT 


TGATCATACA 


AGAATGCTTC 


TTGATTTTCA 


CTACGTAACT 


GTTCACTCAT 


8700 


GCCCCCATGA 


TACATCCCAG 


CTGCAATTTT 


TTTACTTTCT 


AATAAATGAT 


AAATTCGTTC 


8760 


AACTTCCTTC 


CGCGTGCTTG 


CATAAATAAT 


ACCTGATTGT 


CCAGTATTCA 


TCTTTAAATA 


8820 


TTCCAAAAGA 


AAAACGTCTC 


TATTTTGATC 


TTTCACGACT 


TGAAAAGATA 


AATTTTCTCG 


8880 


AGCGAATCCT 


GTTTTGATTT 


CATTTTCTGA 


TGGAATTCTT 


AATTGCTTGA 


CAATATCCTC 


8940 


CGCTACTTGT 


GGCGTTGCTG 


TTGCTGTTAG 


TGCAATCACA 


GTTGGTTGCT 


GTTGAAATTG 


yuuu 


ATCGATAATC 


TCTGCTAAAC 


GCAAATAGCT 


TGGACGAAAA 


TCATGCCCCC 


ATTGTGAAAT 


yubu 


ACAGTGTGCC 


TCATCTACTG 


CCAATAAATC 


AATCGGTACA 


TGTGTCAGCA 


TCTGTTGAAA 


y izv 


ATCATAAGAT 


TCTAATCGCT 


CTGGCGCAAC 


ATACAATAGC 


TTGACTTCTT 


TGTTAACCGC 


en on 

y iou 


TAATTGAATA 


CGATGATTCA 


TCTCTTGATA 


GGAAATGGTA 


CTATTGATAT 


ATGTAGCTGG 


no/in 

y/:4 u 


AATCCCCATT 


AAATTCAGGG 


CGTCTACTTG 


ATCTTTCATT 


AAAGAAATTA 


ATGGTGAAAT 


yjuu 


CACCAATGTT 


AAATTATCCA 


GAAGTAAGGC 


TGGCAACTGA 


TAACAAATCG 


ATTTGCCTCC 


G £ A 

y Jou 


ACCTGTTGGC 


ATAATGCCTA 


AGACATTTTC 


TTGTCGCAAT 


ACATGACGAA 


TAATCGTTTC 


9420 


TTGTCCTGGC 


CGAAAATCAT 


CATAGCCAAA 


TGTATCTTTT 


AATAATTCTT 


GTAATGCACT 


9480 


CATTTATTGC 


• GCTCCAATCT 


TTGTTCTACA 


AGTCATTTTA 


ACACGCTAAT 


CTAACAGGAA 


9540 


CAAGGTGTGT 


TAAAACAAGA 


CACAATCGTG 


AAACACCAGT 


TAACGTGAAA 


CATCAAAAAG 


. 9600 


AATGAAAAAT 


TGTGAAAGCT 


TGCTATCAAA 


AGCTTTCAAA 


AATAGCGGCA 


TTTGCTTGTT 


9660 
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GTGGATAACG 
TTCGTTTTTG 
AATGTTTATT 
TCGTTTTTGT 
TCATCTTCAT 
ACTGATGTTT 
TTCACAGTTA 
GTCGTTGGTT 
GCTTTACCAG 
ATCATTGGTA 
AAGAATTCTG 
TGTCCGTTAC 
ATCACGCTAC 
TTAACTCCAT 
TCACGACGCA 
CAAGAATCTG 
AAACGTTCTT 
TTGTTAACTA 
TCTTGATTGT 
GCAATTAAAT 
AAAATTTTCG 
TCGCCGACTT 
TTGTAGTCTA 
GGCACATGTT 
TCATCTGATT 
TGATCGTCGT 
GGACGAGCAG 
AATGCTTTTT 
AAAACAACAC 
CCTGGTGTAT 
AAGATAGTAA 
TGTGTGTGTC 
CCATGGTCAA 



AGTAGTTATC 
CCATCAAAAA 
TGTCGAAAAA 
TAAGAATTTG 
TCAAGAATTC 
GGTCTTTCGT 
AATCGTTGTC 
CAACGAAAAC 
TATCGATTGA 
AGTATTGATC 
TTGAATAACC 
CGGCATTAAT 
CCATGTATTC 
CAATTTCACG 
TGTTTTCAAT 
GGCCAATTGG 
CGATTTTACG 
AGAAAGTCAT 
GAACATCTAC 
CGCCCGCTTT 
TTACACGGAA 
TCATTGTGCC 
GTAATGAGAC 
CAATAATTTT 
CACTTGAAGT 
CTGCACCTAA 
AAGGTTTGTC 
TCAATACGAA 
CGTCTACCAT 
CTAGAATGTT 
TTCCACGTTC 
CATCTAAAGT 
CGTGGGCGAT 



CACACATAAA 
AACCAACTAT. 
TTATTCAGCT 
TTTTCTTAAA 
TAATGATTCT 
TGCCGAACGA 
GGGGTTGTTT 
TGTACCACGT 
AACTAGTGCA 
GAAGGTGTGG 
AATTAAGCCA 
CATATCTTGC 
TTCAGGTGTG 
TTCAATAACT 
TAAAATTGAT 
ATCAACACGT 
AGCGGTGATG 
TTGTAAGGTT 
AACTGTTTCC 
TGCTTCATCA 
ATTTTTCACG 
ACGGAACACA 
TTGGAATTGA 
ATCAAAAATT 
TCCGTTTAAA 
TTCGATGAAT 
AATTTTGTTA 
ACGTGTTTGA 
TTTCATGATA 
GATACGTGTA 
ACTTTCAAGT 
GTCAGATTGT 
AATTGCCACG 



601 
AAAATAACCT 
TTTCGTAGTA 
TTTTTCTTTT 
CGGATGCTTT 
TCTAACGTTA 
ACGTTAGTCA 
TCCCCAACAA 
TCTTCAATAC 
CCTTGATGAC 
TTCATAATAC 
CGAGCTGGTG 
ATTTCGCCTT 
TCAATTTGAA 
TCTGGACGAG 
AAATGCAATT 
AAAGATACGT 
TATTTTCCTT 
GGCTCATCAA 
CCAACGAAAA 
ATTTCCACAC 
CTGCCATCTA 
CGGCCAATCC 
AGTGGTTCGT 
GGGGCCATTG 
GCAGAAGCAT 
AACTCTAAAA 
ACAACCACGA 
GGCATTGTTC 
CGTTCTACTT 
CCGTTATAAT 
GCATTGGAAT 
TTTAAAAGTT 
TTACGAATAT 



TATCTTAAAA 
AAACGAGCAT 
TCTTGCTTGC 
CTGGTGTTAC 
ATTTTTTCGG 
TTTGTTTAGC 
TCATTCCTTC 
TCATGATTGA 
GGCCACCAAT 
CATAACCACG 
CTAAGAAGAT 
TACGTAAACT 
CACGTTCAAA 
AAACTTGTAA 
CGCCACGACC 
CTGTTTGTAA 
CACGTCCCGC 
TGTGTAGAAT 
TGTCTTCCAT 
GTTGTAAGCC 
ATTTCATCAA 
CAATACGTCC 
CTGAATTGTC 
TTGGCTCTTG 
AAACAACTGG 
CTTCATCTAC 
TTGGTGTTAC 
CTTCATACGC 
CACCACCGAA 
CTACGGCTGT 
CCATTGCACG 
CATCTACTAA 
CATCTCTGTA 



TACGCATTAA 
AGTTGGTTTA 
TTTTTCGCGT 
TTCACAGTAT 
TTTTTTAATG 
TTTTGTGATG 
GTAAACTTCT 
GTAAGTCGTT 
CGTTCCTTGA 
AGTCATTGAT 
TAAACGCATT 
TAGTGATTCA 
TGGTTCACAT 
TTCATAGCCT 
TGATACAGTC 
CTCAGCCATT 
AAATGGAGAA 
TGGTAATGCT 
TCCAGAAACG 
AAAGAAACCT 
CGCAACTTGG 
AACGTAATCG 
AACTGGAGCT 
ATCTGCTGGA 
GAAATCTAAT 
TACGTGTTCA 
TTTTTGTTCT 
ATCGACAACT 
GTCCGCGTGT 
ATTTTTTGCT 
TTCTTGTAAT 
GGTTGTTTTA 
TTTCAATTTA 



9720 
9780 
9840 
9900 
9960 
10020 
10080 
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10440 
10500 
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10740 
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ATTGCTCCTT 
TmGCAAACT-T 
TTATTCGGCT 
GAAGAGCCTT 
GTAAATACTT 
GTTAATGGTT 
CCTGCTGCAT 
ATAGCAATAA 
ATTTCCGTTG 
TCTGCCAAGG 
CCTTTGCCGC 
ATGCCGTCTT 
ATTGTGCCAT 
TAGCCTTTTT 
AAAAATTCTT 
TGAACGGTTA 
CTTTGAATTG 
TTACCGCTTC 
TTTTCAAATT 
TGTTGATAAG 
GCTGTCCACA 
TAT T GAT AG T 
AACATCTTAT 
GC GAT AC AAA 
TAAGCAAACC 
GAAACAAGCA 
TAAAAAAGGA 
TTCAAACAAT 
GAATAATTTC 
AGGTAAACTC 
GGCAGCGCCA 
TAAAAAGACA 
AAATAATTTT 



ATCTATATAC 
TTTTCATTAA 
TTTTCTGAAA 
ACCTTTCAGC 
CATCATAGGT 
TTCCTGTAAT 
' AATCCCAAGG 
TTTCCAAACC 
CGTGATGAAT 
CTTTCATAGT 
CCCAATAAAG 
CATAAACAGC 
CAATCGGATC 
CTTCCCCTAA 
GGGTTTGTTC 
ACTCTGATTC 
TTTGAATAAA 
CCTGTTT i'TT 
CACGACCTAA 
TTGTTAAAAA 
TAT TAG kG AC 
CTTTCATGTC 
TTACGTCGAT 
CtTTGTTGTG 
ATGAAAAAAT 
ATCGGCGTAT 
ATTGATAGAT 
GGCGTCACTT 
CCAACAAAAT 
AAGCCTAAAT 
' AGAAACGCAC 
ATGGAACGAC 
ACAAATAGTT 



GCTTATGTTA 
AAAAAGcTTT 
ACAAGGACAT 
AACGCTCTTC 
TTCAGGAGTT 
ACCAGAGTAT 
ACTTAGATTA 
TGCACAACCG 
ATTTTTTCCA 
AGGCGCTTTT 
TTCTTCTCGC 
TAACATAATA 
GATGATCCAC 
AATTTGATCT 
ATCCATATTT 
CAAGTTCATT 
TTTTTGCGTT 
AGCTTGTTGA 
GCGTTTTTCT 
TTTGTCAACA 
AATCACCATT 
TTCACCTCAT 
TTTTTAATGC 
AAGCATAAAT 
AGACCGCCGC 
TCGTGGCGCC 
GTAACAAAGG 
CTCCTCCTTG 
CAAAAGGCTG 
AACGTTGGTT 
GAAGGTAGAC 
TAAAAACCCA 
CAACCGACCA 



602 
CTTAAGTCCA 

CGCCTGCAAA 
TTTTTtACAA 
TTTGATTATT 
GCTGCAATAA 
TTAAACCCAA 
GATAAATAGC 
CTAATACGCA 
TGCATGTAAG 
AATAATTGAT 
ATCACATCAT 
CAAAAATTTT 
ACGCGACCAG 
TCGGGAAAAT 
GTCACTAAAT 
CGAATCACAT 
TCAGTCATCT 
AGCGTACGAT 
TCGCCGATAC 
GGCAAGCCTG 
TCCtCTGTCG 
CTTCTCTAGT 
ATTAAAGAGA 
ACCGCTATTT 
TTCACTACCA 
AGAAAAAACA 
GGCCAAAGAA 
GAATCCGGCG 
CGCATTACCT 
GTTTAAGACG 
ATTAGCAAAC 
ACCTGCCAAA 
AGTCGGAATT 



TAAAmGCCGA 
AATAATGAAA 
ATTTtyCTCT 
CCGATTCATT 
AGTATTCACG 
ATTCTTCCAA 
CATGATGGTT 
CGCCCATGCT 
CATTCATGCC 
TATTCCGATA 
AGATAAAACC 
CTCTTTCCAT 
CAAAACTTTT 
TTGTTTGAAT 
CGGTGCGGCC 
CGGCTGCTTC 
AATCATCCCC 
ATAACGAATA 
TTTTAACGAC 
TTTCGTTGGC 
TCCAATCTAA 
TTAGCATATT 
GGATATTCTG 
CCTGAACACA 
AATAGTTCAA 
CCAATAAATC 
CTCCCTAACG 
CCCAAGGATA 
GCAAAAGCAT 
AGCACAAACA 
CAATTTGCAT 
CCAAAACAAA 
TTGCCCATCG 



CTGCTTATTA 
GTTTTAGAAA - 
TTTAAAAGAG 
TAAATATCGG 
ACCCGCAAAA 
AAGTACTAAG 
GCCTTTTAAC 
TGCTTGTCCG 
CCATAATCCA 
AACGCCAAGC 
TAATTTACCA 
CACGAAATTC 
TAAAGTATTG 
TTTATTCATT 
ATTTTTCTGT 
AAAAAGCCAA 
CAGCTTTAAT 
TCCTGAAGCA 
CGTTTTAAAT 
TTGCTCAACT 
ATCTAATGGA 
TTCATAAAAn 
CTGTTCCTTT 
TAAAACTGAT 
TTCCCATAAT 
CTAAACCTGC 
TAGCCCCAAT 
ACACCGTAAA 
CTTCTAGTAA 
GAACGACAAT 
AGGTCTTTTT 
TCGATGCCAA 
GATAATGCGT 



11700 

11-Z60 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 
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ATGTGTTACA CCAAAACTTT CAGTTACAAA A7TAGCAAAT AAGCCTGCAA AAAAACTTGG 
GAAAAGTGCC TCTGCACGAA CTTTTCCAAC CGCCAAAACT TCTAATCCGA AAACCGTTCC 
CGCTAATGGT GTGCCAAAAA CAGAACTAAA ACCTGCACTA ATTCCGCTAA TAATAATTAC 
TTGTCGTTCC AAAGCGGACA GTTTAAAGAC TTTTCCTACA GCATTTGCGA TTGCGCCCCC 
CATTTGCACC GCCGTCCCTT CACGACCAAC AGAACCACCA AATAAGTGCG TAGTAATCGT 
GCCAAACAAT GTTAAAGGAA TTAAGCGCAA GGGAATATCT TCTTCCCCGC CATTGCCTTG 
TTCAATCACT AAGTTATTTC CTCGGCTAGC ATTTTTCCCA AAACGCGTAT AGAAATACGT 
AAAAACGACA CCGCTAATGG GGAGGATAAA GAGTAACCAA GGATTTGCTA AACGAACATC 
TGTTACATAG GTGAGACTCT TTAAGAAAAA GGCGGATAAC GCCCCCATAA ATAAACCAAT 
AATTGCGGTA ATGAGCAACC ATTTTGCCAT ATAACCGCCG ATTGATAGTA GTTGCTTTGT 
TTCTTTTTTC ACTTTACATG CTCCTTTACA ATTTTCTTAT CGCCAATCCT AGCGTACAGA 
AAAAGCCTAC CAAAGGGTAC ACTTCTCCCG TTTGkaGGCG TCATTAATTT ATTCGTCCmA 
TAAATAGTTC GTGGTGAACA CCATCACCAT GTTGCTATTA AACTAATwAa ATwATACCaC 
GCAaCTCTGG AAAGTGCAAT AAAAAaTCaA TTGCGTTCCG TTTTtATCCA CGTGCTATTA 
CCTTTGATAA ACCGATAATA AGCCATCACC ACACTAACGA ATAAAATTAA GTACATGTAG 
GAAATAGTAA AAGGCAAAAT AATCAGCCCC AGTTTTGCAG GTACTTTTAG TTGAAAATTT 
TTCAACTCGC GATAATAGTT GATTGTAAAA AAAATGGCGA AAACTAACGA AATACTGACG 
GCCAAAACAA AACTAATGAC AAGCGATAAC GAAAAGCCTA ACCTATACAA GTGTTGCAGT 
TGAAGCCCAG TTAAGACAAA AACGAGCATC CCCGCTACGA GATTAATAAA GGGTTGCGCT 
AAAAAATAAT AGATTCCACA CTTCTGCCTT AGCGTTAATG ACTTTGATTT AATCACCCTG 
GGTAAATACG CTAGACAATC TAAATTTCCT TGAACCCAAC GACTTCGTTG ACGAATAAAA 
CGTTTCACAT CTCTCAGTGC TTCTTGAGCC ATAAAGGCCT CGTCAATATA AGCAATCGAT 
AACTCTTTCA GCATTAATTT AATGGTTAAC T CAT AG T CAT CCAACAAAGC ATTTCCCCAA 
GGCTGATAGC CTAAGTGTTC GGTAATCATT GATAACCGAA AAAACTGACC ATTGCCACCA 
AGTCCGATGG TTTTTGTTGC TCGACGCGCT TTTTGAATAA AATTATTAAT CGTAAAAAAC 
TCAACATCTT GTGCAATAAA CATAAAATTT GTGACATCTT TCATTTTGAT CCTTGTTTGC 
GCAs cACAGA CTGTTGTATC TGTGAAAACA GCAGTTAGCT CATTGACAAT ATTTTCACTA 
AAAAAACCAT CTGCATCTAC CACACCCACT AAAACTTGAT CCCAAGAATA GTGTCCTTTT 
TCTGCATCTG TTAATACAAT TTGGAAAGCG TGGTTCAAAG CGGCCCCTTT GCCTTGTTGC 
GCATTGGGAA ATTGACGAGA CAGAACTTGG ATTGGACCAG CAATACTGTC CATCATTGCT 
TTGGTTTGGT • CCGTTGATCC ATCATCAATC GCAATTATTT TTTTCTTAAT TGGTAAAGCA 
ACCAGTTTTT TTAATGTTTC TCCGATGACT TTCTCCTCGT TTACACACGG TACAAGAAAG 
TAGCAGAAAA ATTCTTCTGC TACTTCGACT TCTCTTCTTG TTTTTTTCTT GTTCGAAAAC 
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TGAGAAATAA ACTTGTAAAT AAATAGGTTA AGAAGATGAA CATAATTACT TTGTATAACT 
GATACAAATA. AATCATGTAA TTTGACCTCT TTTAAAAAAA TAACGGTATA GAATGTAGGC 
AGTGACAATC GAAACTAGAA TAACGCCAAT TAAATCAACC ACAAAAAAGG CGCCTGTTGA 
AATTACCCCA AGTTTCGGTA AAGCTGATtG GCTCATtAAA TACCCCTTTC TCTCTATTTG 
TTTTTACCAA CACACCACCA CGATGACCAT CACACACCCC CTATAATAAT ACTTTGGaGG 
ATTATAGTAC ACAGGTGGAT TATAGTACGT AAmTGGcGGA ATATAyCCTG GcCAATATGT 
TGGTGGTGGC G GAT TAT AAA CCGGTAwTTT TCGGTTTTCT CGAACAATTG cAATTGTTTG 
ACCCGCATGA TTTTGCGTCA GACTGACGCG TGTTGGCGTA GTATTTCGTG AATAACCAGT 
CGGCGCTTGA ACTTCTCTAA TTTCATATAC CCGGTTGACA GATAGCCCAC TAATAGTGAC 
TGTACCATCC CAACCTGTTG TTACTCTTTG TGTATAGCCA TTAGTTGTAT CTTTCCATTC 
AAAAACAGCG CCCGCTAAAC GTGCTTTTGT TTCAACATCT TGTTTAATAA TTCTGATAGC 
ACCATAGTTT GCTTTATTGT TTAACACTAA CGTATACGTT TTTGTGCCTT GACCTCGTAA 
ATCAATCCGA TGGACCCTAC TATCCAACGC AT AC CC AG TT GGCGCTTGTA TCTCGGTGAT 
TTCATAAATA CGATCCGACG ATAATCCTGG AATCAGTACT TCACCATTTC CATTTGCTGT 
AAATGTTGCC GTATAGTTAT TGGTAGTATC GCGAAATCTG AATTTCGCTC CTGCTAAACG 
TTTTCTTGTA TCCGTATCTT GCTTAATAAT TTTGATGGCG CCTTTTTGTG CTTGATTATT 
TAAGACTAAT GTATAGGTTT TACCACCTTG TCCACGTAAA TCTACTTTAT GAACACGTCT 
ATCTATGATA TAACCATTCG GCGCTTTGAT TTCTGTGATC C CAT AG AT AC GATCACCGGC 
TAATCCAGTA ATCAGTACTT CACCGTTCCT ATTCGCTGTA AACGTTGCAG TATAATTATT 
GGTTGTATCT CGGAATTTAA ATTGTGCGCC TGCTAAACGT TGTTTCGTAT ATCCATCTTG 
CTTAACAATT TTGATGGCAC CTTTTTGTGC TTGATTATTT AAGACTAATG TATAGGTTTT 
ACCACCTTGT CCACGTAAAT C TACT T TAT G AACACGTCTA TCTATGATAT AGCCATTCGG 
TGCTTTTATT TCTGTGACCT CATAGATACG ATCACCGGCT AATCCAGTAA TCAGTACTTC 
ACCGTTCCTA TTCGCTGTAA ACGTTGCAGT ATAATTATTG GTTGTAtCTC GGAATTTAAA 
TTGTGCGCCT GCTAAACGTT GTTTCGTATA TCCATCTTGC TTAACAATTT TGATGGCGCC 
TTTTTGAGCT TGATTATTTT TGGTGAACGT GTAGGTTTTG CCAGCATAAT TCGCAGGTAA 
AAGAATTTTA TGCACCGTTC TGTCAAGTAT ATAACCATTT GGTGCTTTTA TTTCTGTAAT 
TTCATAAATC CGGTTGACGG AAAAATTATA CAAAGTTGCT TCACCATTAG CATTAGCAGT 
TACTACGCCT GTATTCCCAT TCACTGTGTC ACGCCATCTA AATTGTGCGC CTGCTAAACG 
TTGTTTTGaG TAACCATCTT GTTTAATTAA TTTTAAATTG CCACGTAGCG CAACATTTTC 
AATGGTATAA ■ TAACCAATCT TGTCAGCAAA ATTAGTCGTT AGGGTAACTC GGTAAGTCTT 
GTTATTTAAG ATATAGCCGT TTGGTGCTTT AATTTCTGTA ATTTCGTATA CACGATTGAC 
TGGCAAATTC GTTATCAGGG CTTGTCCGTT TTTGTCAGCT ACAACTGTTC CTTCTTTTCC 



15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 



WO 98/50555 PCT7US98/08985 

605 



TGTCACCGTA 


TCACGCCATT 1 


TGAACTCTGC ACCTGCTAGA CGTTTTCTTG AATCTTTGTC 


17640 


TACTTTGA-T-T 


ATTGCTAGAC 


TACCTTTTTG CGCTGTATTT TCAATGGTAT AATAACCAAT 


17700 


TTTATCwGCA 


AAATTAGTCG 


TTAAGATAAC TCGGTAAGTC TTGTTATTTA AGATATAGCC 


17760 


rTTTGrTGCT 


TTAATTTCyG 


TAATTTCGTA TACACGATTG ACTGGCAAAT ctGTTATCAG 


17820 


kGCTTGwCCr 


TTTTTGTCAG 


CTACGACTGT TCCTTCTTTT CCTGTCACCG TATCACGCCA 


17880 


TTTAAACTCC 


GCACCTGCTA 


GACGTTTTCT TGAATCTTTG TCTACTTTGA TTATTGCTAG 


17940 


ACTACCTTTT 


TGCGCTGTAT 


TTTCAATGAT TACTGTCACT ACTTTATTAG CTTGTGCAGT 


18000 


TGTTAGAGTG 


ACTTTATGCA 


CAGTTTTATC TAATACATAC CCCACTGGTG CTTTCGTTTC 


18060 


TGTGATTTCA 


TACGTACGAT 


TGACTGATAA GTTGGGAATT GTGATGGTTC CATCGG.TTCC 


18120 


AACTGTCACT 


rTTCCTGTTT 


TTCCTGTCAC TGTATCyTTC CACTGAAATT CTGCGCCTGT 


18180 


TAAACGTTTT 


TTrCTkTCTT 


TGTCTTGTTT AACAATTTTG ATGGAGCCTT TTTGCGCTGT 


18240 


ATTTTCAATG 


ATTACTGTCA 


CTACTTTATT TGCCTGTGCA GTTGTTAAAG TGACTTTATG 


18300 


CACGGTTTTG 


TCTAGTACAT 


AACCTGTGGG TGCTTTTGTT TCTGTAAGTT CATACGTACG 


18360 


ATTAACTGCT 


AAATTGGGAA 


TTGTGACCGT TCCGTCGGTT CCAACTGTCA CTGTTCCTGT 


18420 


TTTTCCTGTC 


ACTGTATCCT 


TCCACTGAAA TTCTGCGCCT GTTAAACGTT TTTTGCTTTC 


18480 


TTTGTCTTGT 


TTAACAATTT 


TGATGGAGCC AATTTGTACT TGCTGCTTAT TCTTCACCCG 


18540 


ATATGTTAAA 


GTGTAGTCGG 


CTTTTCCAGC AAATGAAACA GGATGTTTAG TTTCGTCTAA 


18600 


TTCATAGCCA 


TTAGGCGCCT 


TCGTTTCAAC TAAATAATAT TCTCCATCAG GTGATAAATT 


18660 


TGAAAATAAC 


AGCTCGCCAT 


TTTCATTCGT GACACCTTTT TCTATTAGAT TATTTTTTkG . 


18720 


GTCATACAAC 


GAAAATTCAG 


CACCTGcTAA ACGTTTCACC TCATTATTTT CATCTACTTT 


18780 


TTCyACTTTT 


AACTGGkTTn 


CTAAACGCGt ATTTcTTTTt tCAAGCGAAT CGTCTGkGTT 


18840 


TGTTCTAcTT 


AATTnCAAAT 


GGAaCTGGTG TTGCATCTAA CTTGTAACCA GTTGGTGCTT 


18900 


TTGTTTCAAT 


GAATGCATAC 


TGACCAGCTT CCAAACCAGC TAAACGAATT TGACCTGCTT 


18960 


GATTGGTGGT 


TAATGTTTCT 


TGAACAACTT GTTGTGTTTT TGTGTTCACG ACTTTGAAAA 


19020 


CAGCACCTTC 


TAAAACTTTA 


GCAGTTACTG CAcAATTTTT TCyAAAaCGA CACCCCyTCG 


19080 


ACCTGTTCCA 


TCACTACTCT 


CGTCTGCTGT GATGTAGGCA ATATCATCAT CTTCCCAaGC 


19140 


ATCGAATCCC 


TC AC TAG AAA 


TACGCATATA CGTGTAATTT GATAGCACGG CATCACTTGG 


iy/uu 


AATATTTTCA 


TTTGGAATCG 


TTTCTAAATA AATATAATAG GCATTTTTAT TGGAAAGATA 


1 Q O C A 

19260 


ACTGCCCGTG 


GCTGTTGGTA 


TTCCTTCAAA TTTCTCTGCT TCATAATTAA ATCCAGATTC 


19320 


AGAGGCATTT 


TTTTGGAAAA 


ATTGACTGGC ATCATAGCTA TATTCGTCAT GTTCCAAATT 


19380 


CCGAATCATT 


' TCACCATTTG 


TTTCTAAAAT AGGCTGCTTG GTGAGATATG CAGCAATACT 


19440 


GTCAATAGCT 


AATGTTTGGT 


TTGGACCTAC ATTATCTAAA AATTTAAAGG TTGTTTtTTC 


19500 


TGAACCTGAA 


k AGACCTGCTT 


GATTATAGTT AAAATAAATG ACCCAAGCAA TTCTTCCCTC 


19560 
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TGAGGTAAAT CCTAACGGGA CTTTTCGAAC CAGTTCAGTT GGCGGAACTT TCTCCACGTA 19620 
TTGCCCGATA TTAATTTTGA CAATTTTTTG TGAATCCCCA -AAAGTAAAAA CGACTTGATC 19680 
TTCAAAATCT TCTGATAAAT TTTGATTATA TCGTAGCTTC AGTTGCAAAT CTGCTGAGTA 19740 
ATCTGTAAGC GTTTGTAAGG CTTTAATCGT CCAAGTACCA TCATCGTTTG GAATCACTTG 19800 
AAGTGCGTCT CCAATTTCAA AAGGTGTCAT TGAATAGGAA TATTTTTCGC TAGCCATCCT 19860 
TAATGTCATC GTCTCTCCGC TTGCTAATTC AGAGATTCCT AATGACAAAG AAACTGTCGC 19920 
CCAATCTTTT GTAGCAACAA TTTCCGTTTT TTCCCCCGGn yATTCAACAG CTATAGACCG 19980 
GATTTGTGTG CCACTTTCAT TATCCATTGC ATAAGAAATA ATACTTGTAT TAGATCCTAT 20040 
AATAGTAAAG ATAATTAAAC AAATCGAAAA TA 20072 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

TTGAnACAGA TGGAGTnnGC ACTGGCTATA CAGTAAGATA TGATCTTGGT GGAGAAAAAG 60 

AnAAAGACAT AACAGTCAAT TTACCCGGAG CAGaATTACC ACTTAGTTTA AAAGTAAAAG 120 

CACATTACAC AAATGGATTA CCACTTGAAG CGGGAAAAGA AGTACGCTTA AGAAATTTAA 180 

CAGATGGATC AACAGAAGTA mTTAAGAAAC ArGTAGACGC TAACGGAGAA GTTGTATTTA 240 

CTGAGCAAGA TGGTATAAAA AAGGAAGTAA ATTATGGTAT TGAAACAGAT GGAGTAAGAA 300 

ATGGCTATAC AGTAAGATAT GATCTTGGTG GAGAAAAAGA AAAAGACATA ACAGTCAATT 360 

TACCCGGAGC AGAATTACCA CTTAGTTTAA AAGTAAAAGC AC ATT AC AC A AATGGATTAC 420 

CACTTGAAGC GGGAAAAGAA GTACGCTTAA GAAATTTAAC AGATGGATCA ACAGAAGTAC 4 80 

TTAAGAAACA AGTAGACGCT AACGGAGAAG TTGTATTTAC TGAGCAAGAT GGTATAAAAA 54 0 

AGGAAGTAAA TTATGGTATT GAAACAGATG GAGTAAGAAA TGGCTATACA GTAAGATATG 600 

ATCTTGGTGG AGAAAAAGAA AAAGACATAA CAGTCAATTT ACCCGGAGCA GAATTACCAC 660 

TTAGTTTAAA AGTAAAAGCA CATTACAC 688 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
nGATAAGGTC CTATAAATAG nCCGGATATn TTTTTATTAT TAAGATCTAG TACTTTCCTC 60 
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TTACTATGAG 


TGTACTAGAT 


TCTTTTTTTG CCAACTTTAA 


AGAACTAGAC 1 


CAATTTTCGG 


120 


TTTTTAGATT 


TTTGTTGACA 


AGTACCTACT TTTAATTGTA 


TGATTGTTTA 


TGTTTCTAGG 


1-8 0 


GAATATATTT 


ACAAAGGAGG 


T AT AT AG AC C AATATCGGAT 


AGCGCCAGAC 


CTGAACGTTC 


240 


AGGTGACGAG 


GAGAGAGCTT 


ATCGAAGATT CGGCGGGTGG 


CTCTAGGGAC 


TGCACTCTAC 


300 


AGATAACAAA 


GAAAAACTAA 


TTGTGAAGTT AGAACAAAGC 


GGTTATCACG 


CAGGTAGAAA 


360 


CATATAATGG 


GATTTGCTAG 


TTAAACAGGA CCTAGTAAAT 


ACAGAATTTC 


AATTGGAAAG 


420 


TAGGAATTAG 


CATTATGTGT 


GGAATTGTCG GAATGATTGG 


AAAAGAAAAT 


GTCACGGAAA . 


480 


GTTTGATTCA 


AGGATTAACA 


AAATTGGAAT ATAGAGGTTA 


TGATTCTGCT 


GGGATTTGTG 


540 


TGAATGrTCm 


AAGCCAACAG 


CCCCATGTAG TTAArGCAGT 


TtGGGGgCAT 


TkCCsATTTa 


600 


GCAGCAAAAT 


TAACGCCAGA AATCGATgGA ACAGTTGGTA 


TTGGTnCATA 


CTCGTTGGGC 


660 


CACTCATGGC 


GAACCAACTG 


TCGCCAATGC ACATCCCCAT 


GTTTCAAGCG 


ATCAACGCTT 


720 


TGCATTAGTG 


CATAATGGTG 


TCATTGAAAA CTTTGAAGAA 


TTAAAAGAAC 


AATTTTTAAG 


780 


TGGTGCTCAC 


TTAATTGGTG 


AT AC AG a T AC TGAAATCGTC 


GTTCAATTAA 


TTGcTCATTT 


840 


TGCTGAAAAT 


GGTTTTTCAA 


CAAAAGATGn CTTTTAAAGC 


GGCTTTAGCm 


GAwATTaAAG 


900 


GTTCATACGC 


TTTTGCATTA 


ATGGATAAAA TGGaCCCCAA 


T AT TAT C TAT 


GTAGCGAAAA 


960 


ATAAAAGCCC GTTACTTATC GGTCTAgGTG mAGGCtTaAT 


GkTTATtTGT 


AGTGaTGCGA 


1020 

i 


TGGcGATGAT 


TAAAGAAcCA 


ATCAATTtGT TGAGATtGTT 


GAT GGT GAGA 


TTGTAACTGT 


1080 


TACAGCGGAC 


AAAGTAGTAA 


TCGAAACGCC AGAAGGACAA 


GTAATCAACC 


GAGAACCTTT 


1140 


TGAAGCGCAA 


TTAGATTTAA 


ATGACATTGA AAAAGGAACA 


TATCCGTTTT 


ATATGCTCAA 


1200 


AGAAATTGAT 


GAACAACCTG 


CAGTGATGCG CAAAATTGTT 


CAAGCGTATA 


CTCAAGGTGA 


1260 


AAAAGTGACT 


TTAGATGAGG 


CGTTACTGGA AAAAATTAAT 


CAATGTAACC 


GAATTTACAT 


1320 


TATTGCTTGT 


GGGACAAGTT 


ATAATGCAGG TTGGGTCGGA 


AAATCTTTAA 


TTGAAACGAT 


1380 


TGCCAGTATT 


CCTGTGGAAG 


TTCATTTATC AAGCGAATTT 


GGCTACAACA 


TGCCTATTTT 


1440 


ATCTGAAAAA 


CCGTTCTTCA 


TTTTCTTAAG TCAAAGTGGT 


GAAACGGCAG 


ATAGTCGCCA 


1500 


AGTACTGGTA 


CAGGTTAATG 


AATTAGGGCA TCCATCACTT 


ACATTAACAA 


ATGTTGCTGG 


1560 


CTCAACATTA 


TCACGTGAAG 


CGGACGATAC CTTGTTATTA 


CACGCTGGAC 


CAGAAATTGC 


1620 


TGTCGCTTCC 


ACCAAAGCTT 


ATACTGCTCA AATCGCTGTC 


TTAGCTATTT 


TAGCAAAAGC 


1680 


AATTGGCGAA 


GAACGCCAAA 


CATTAGCAGC CGTCTCATTT 


GATGTGGCTC 


ACGAATTGAG 


1740 


CGTGGTAGCA 


GCTGTCATGG 


AAACCTTGAT TGATGAAAAA 


CAACAAATGA 


AAGCCTTGGT 


1800 


GGAAGATTAT 


CTGACGTATA 


CTCGAAATGC CTTCTATATC 


GGGCGAGGAG 


TCGACTATTA 


1860 


TGTGTCGATG 


' GAAGCGGCCT 


TGAAGTTGAA AGAAATTTCC 


TACATTCAAG 


CAGAAGGCTT 


1920 


TGCGGCAGGT 


GAATTGAAGC 


ATGGAACGAT TGCCTTAATT 


GAAGAAGGAA 


. CACCTGTGAT 


1980 


TGGGATTGTG 


ACAGATGCCA 


. AAGTCGCTGC CCATACAAGA 


t GGCAATTTAA 


. AGGAAGTCGA 


2040 
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AAGTCGTAGA 


GCAAGAAATA 


TTGTGATTGC 


TTTGGAATCC 


TTGGCAAAAC 


CACAGGACCA 


2100 


AC T GAT T ATT . 


CCAGATGTTC 


ATCCGTTACT 


TTCTGCCTTA 


GTTAGCATTG 


TTCCAGGTCA 


2160 


ATTATTAGCG 


TACTATGCAA 


CCTTACAAAG 


AGGCTACGAT 


GTTGATAAAC 


CAAGGAACTT 


2220 


AGCCAAAAGT 


GTAcTGTTGA 


ATAAATCAAG 


AACCTGAGAG 


AAAGATGTTC 


AACGGGTTTT 


2280 


CATGAGTTTT 


AATATCTGAG 


ACAAAAATCA 


CTTTAGATTT 


TTGTCTCAGA 


TATTTTTTAT 


2340 


AAAAGAAAGA 


TATTCAGTCA 


AAAATGAAAG 


ATTCCAAAGA 


AATTCTTTTT 


TTCCGTTGAA 


2400 


AACTGTTTGC 


ATTTTAGCAA 


AATTAGATAT 


AATTAATAAG 


AACCGAaaAA 


TAAGAATTAG 


2460 


CCAGTGAGGT 


AACTTTGTTT 


GAAAGAAAGC 


AAAAAGTAGG 


TTAAGCTGAA 


AAGTGCTAAT 


2520 


TTATTTAACA 


AAAAAAGATT 


GTGAGTTGTT 


TGATATGGAT 


CAGTTTAGTA 


CATTAACTGA 


2580 


ACAAGAATTT 


ACTACCTTTG 


CAATGACACA 


TCCTGCAGGT 


AACTTTTTAG 


AAACACCCGA 


2640 


AATGAAACAT 


TTATTGGAAC 


GACGTGGCTG 


GCATTGTGAA 


TACGTTGGCG 


TCAAACGAGA 


2700 


AGGCCAACTA 


ATTGCTGCGT 


GTATCCTAAG 


CAAGAAAAAA 


GTCAAAATTG 


GTTATGCTTT 


2760 


CGACATTGAC 


GGCGGAATTT 


TGATGGATTA 


TACAGATAAA 


AAATGTGTAG 


AAGCCTTTTT 


2820 


CACAGGCTTA 


AAAAAATATG 


TGAAAAAAAA 


TGATGGCCTT 


TATTTAACCT 


TTACTCCAAA 


2880 


TAAACAAATT 


TGTTTACGTG 


ATTTTAATGG 


CGGTGAAGTT 


GAAAAGGTCA 


ATCAAGAAAC 


2940 


ATTTGACTAT 


TTCACATCAA 


TTGGTTTTGA 


ACACCAAGGG 


TTTGATGTGC 


ACAATTTTGA 


3000 


TGGTGCGCCA 


CGTTGGCTGT 


TTGTCAAAGA 


TATGGCTGGA 


TTAACAGAAG 


AAGATCTTTG 


3060 


GAAATCTTAT 


GGAAAAGATG 


CTAAATATGA 


CATCAAGAAA 


ACATGGGAAT 


ATGGAGTAAC 


3120 


AATCCGAGAA CTTCgTTACG AAGAGCTACC 


TCTCTTTAAG 


AAATTAACGG 


AAGAAACAAG 


3180 


TGCACGGCGT 


AATTTtGAaG 


ATAaGGATTt 


AGCTTATTAC 


CAaGCTGTTT 


ATGAaGAGTT 


3240 


TGgCGAaCGA gCGAAtTTAT 


GGTGcGGACT 


AAcTTTGCTA 


CTTATTTaGA 


AAATCTTCAT 


3300 


GAGAAATTAC GCAAATTACA AGAGACGCTC AACGAAgTCA ATGAAGCGTT 


AATTGCCAAT 


3360 


CCTAAAAGCC 


GTAAAAAAAA 


TAACCAGAAA 


CGTGAATTTG 


AAGATGAAGT 


TCGTACCGTG 


3420 


CGCAAACGGA 


TTGACGAGGC 


TAAAGAAATG 


AAAACATCAG 


ATGAACCTGA 


AATTTTAGCA 


3480 


GGTGCCTTAT 


TCATTGTTCA 


TCCACAAGAA 


GTTGTTTACT 


TGTTTAGTGG 


TACGTATGAA 


3540 


AAATATAAAC 


AATACTATGC 


ACCGTATTTA 


ATTCAACATA 


AAATGTTAAC 


TTACACAGTA 


3600 


GAAAATAACA 


TTCCCAAATA 


TAATTTTTAT 


GGCGTTGACG 


GTATCTTTGA 


TGGAAGTGAT 


3660 


GGGGTTCTTA 


AATTCAAACA 


ATCCTTTGGC 


GGTCATGTCG 


AAGAATTAAT 


GGGCAATTTC 


3720 


CAATGGAAAG 


CCAAACCAAT 


GAAATATGCC 


TTATATCACG 


CATTAAAAAC 


AATTAAAGAA 


3780 


AAAGTCTAAA 


ATTAAAACCC 


AATCCTTAAA 


AAGGAATGGC 


AAAGTGAGGC 


GTTTAAATGG 


3840 


AAAATCAAGT 


CGAGGTTATG 


ACGTATGCAC 


ATTTAAAAGA 


AATTATGCAG 


GCGTTAGAAG 


3900 


CAAACGAAGC 


GATTACAGAA 


GATACAAAAG 


TCTTTATTGA 


TACAGGATGG 


GATAGCGTGC 


3960 


AAGAAGTAGC 


CCCAGATGCG 


GTCAGTATTG 


AAAAAGTTGC 


TAAATTTACA 


GTAGCAGATG 


4020 
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TTT7AACAAA C3AAAr7777 G77GGGTATA GITTAGAAGA AAA£GCGGAG AAGATGAACG 4080 

CTGAGGGCGA C77AGAAACG GCAATTATTA TTCGTAATCT TTACTAAAAA AflTCCGTTTG 4140 

AAT AGT T AAA. A77GAGG7CG GGACAGAACC GTTTAGCTCC GAGCAACGCG TAsTGTTGGT 4 200 

TCTGTTCCCG C7GG77A7CA GG7TTTGAGC CTGGAGCAAA AATCCAAAGT GATTTTTGTC 4260 

CCAGGCTCA7 C7 GC777AAA 7 AAGAAAT AA AGGAGGTTAG ATATGAGTAC CATTCmTGrA 4320 

TTyCCAAAAA A:7A7GAjlCG 7777ATTGCA CAAGGAGAAG AAGCACTTGT AGAACATAAT 4 380 

CAGATCGCCG C7TTaGAAAA 777TCAACAA GCCTATCAGC wACCAmCAAc GCCGCCAGTA 44 40 

AACCAAAAAA 77GTCCAA77 A77GTTGGAA ATGGGtGAaG CTGATGAAGC AC TAG CAT TG 4 500 

GCAGAAGCG7 77CAAGAG7C 77ATTTTGAG . AACCTTGAAA CAGCGGCAAT TTATATGCAA 4 560 

ATT7ATAGTC AAAG7CG7CG C77TATTGAA GGGTATATTT TATTGAAGCA ATTACTTCAA 4 620 

ACTA-AAA-AA-A TAACGC7AGC ACAACAAAAA ACGTTAGAAC AGCAATTAAT GCAAGTGGAA 4 680 

GAGGCGTATC AACAA77AGA AACGCAGCAA ATTCAAGCAA TTAAAAGGrA CTTGTTAGTA 4740 

AGCGATCAA7 TC-CCCG777A TCAACAATTA GCGAACATTA AAACCAGTTT ATATTTGCCT 4 800 

AAACCAGTA7 T7G7GGAAG7 CGCAAAAGAT TTAGTTATGA ACCAAGCACT TAGTTATTTT .4860 

GCTCGGGAA7 GGTTCA77GA AGAATTAGCA CTTTTGCAAT TTTCAGAACC ACTGACTTTT 4 920 

CTT7GGTATG ACAA7CA7-CC TCAGACAGTC TTACTAGAAG GAAAAACTGG ACCGTTAAAT 4 980 

ACGCCGATCT A7TCCGAAA7 T7GTACTGAA CTTAGGAATC GTTTAGAAAA TGATGATCCA 504 0 

ATTATGCTGC AC-CA777GC-A AGAAGAAATT CGATTGCATC TTGCCTATTT GTATCCATTA 5100 

GCAGAGACGG T7AT7TCAGA TCCAACTATC TGGGTCTTGG GTTATTTGGC AACCTATTAT 5160 

CCTGAATATA T7GAJ 1 JAAAGA- A7TGACTGAA GCAAAAGGCC ATCAAATCGA TGCTGTTCAA 5220 

AAAGTTCAAC AGGCGA7ACG AA.CGGCATTT ACACAAATAA TGCTTTAAAA AATGGCACGT 5280 

CCTAAAAGCG AGGACG7GCC A7TTTTTCGG TTTCAGAAAT TTGCTGAATC TGCTCTAATT 534 0 

GCT7TTC7TG ACTGAT7A77 GCGGATAATA GAAAAAAGCA AGACAGAAAT AAGTAAAAAG 54 00 

AAT7CCT7GA A7CAAGCAAA TCACAACGAA CAATGGCGTT GCTCCTGGAA AAGAATCAGA 54 60 

AATCCGTGTA. A7TA-AAAAT? CAACTAACCA TAAAATCATT GAAAATAAAA ATAATTTTTT "5520 

CTGACGAGTT C7CA7AC-7CC GCAACAACCT TTCTGGTGAT TTTCACTTCT CTATCATAAC 5580 

ATAGAAAACG AGCTGTT7T7 TCTCAAAAAT TTTGTTACTT CATAAGAAAA ATGGGAAATA 564 0 

AGCAAGTTTC AGGAAG7AC7 AGCTTTACCG AAAAGAATAC TTAATCCATC TTAATCAAAC 5700 

CATACAGTCA CGTTTA77GC AAAAGAAGTG TTTTTGGATG GAACAAATTA TTTCATAAAC 57 60 

ACAAATATT7 C7TTGT7GTT TGTATAAATG TCATAAAAGT GATTACTTAC TACTTTTTGA 5820 

TTTTTAGACA TATCCG7ATC GGTCAGTGGT GCCATCTCTA ATCCTAAATT CGATGAAGTA 5880 
TTGAACCAAA CGTAATTAGG GAAGTATAGA CCGTCATTTG GTGGCACAAT T-TTTGAAAGA 594 0 

ATTGGGTAG7 T7CGAC77G7 A7TAGCATAA ACAGTTGAAT TTTTGAAAAA CGTATTTAAA 6000 
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TAAACTTTTT TTCGTTCGTT 
TGGCGATTGA. A.TGC.TXCTTT 
AAAATATAAT AAGATAACAA 
TGTCGATTGT TAAATAAACT 
TAGCGAGGCG CATCACCTGC 
AGAGAAGCAA GTCCTAAATA 
CTATTTGATA GATGACTAAT 
AAACGATTAC TCTGGTTCAA 
CTGGGAATAT CGTGCAAACT 
GGTACAAATT TTAACTCGAT 
AGTGCGACTT TTTTGCAAAG 
AGGAATAAGA CGATAACTAT 
AACAGTGAAA AAACAGAAAC 
GGAAGAAACG AACAAAAAAT 
CATTGGAGGA ACCAAGGATT 
ATTTTTTGGT CATTGATGAT 
GTTAGAATCG GTGTGGTCAA 
GAGAGAAATT CACTGCCCCA 
TGGCGAGCAT TATCATCTAA 
TGATAAATTA AAATGATCAA 
TCAAATTTTT TTGATTCkGT 
TTAmCaATTA AAATATmCCA 
TCTTGACAGA ACACTAAGGG 
GGGATTTTAC CAAAAAGCCA 
TTAATTAAAA ACAATTCTAT 
TTTGGGTAAT TTTTGGGTAT 
TyCTTTTTTC TATTTGCTTC 
TGTCCTAGCT GTGTTTTGaT 
CGGGcATGTC TAAAACTATG 
TTTATATATT T ATT C AC AG A 
GCAAAATTTT ■ GGGGATTACG 
TATTTTTTTA AATAATAGAA 
GATTTTGTAG GGCCAATTCT 



TGTTACAACA 
TTGATAGTTT 
GCTGGCCACA 
TAACATGCTG 
GATATTTCTT 
CAAGATACAA 
AAAAAATAAA 
TAGCTCTGAA 
TGGTAAGGCC 
AAGGTAGCTG 
AGCAATGAAA 
ATAAATTCCT 
CAAGAAAAAC 
ACTCAATGCC 
GAGGCCAATT 
ATAAATTGCC 
GCCCAGATCT 
ACGTGCGTAA 
GTAAGGGAAA 
TAGAAGATAG 
AGGTAATTGT 
TAACGAGTGG 
TAGGCAAAAC 
CCTGCCAGCT 
ATGTTTTAAT 
TTACAAAATA 
AGTAGCTGAT 
ATATATTGGA 
AGATGAAATT 
TGAATTTGAG 
TAAATCTCCT 
AAGAGAAGAA 
ATACTGGCCA 



610 
TTTTTTAAAT 

AATGTGGAAG 
AGTAACGAAG 
ATTGTAACGA 
GAATAGACAA 
TAAAGGAAAC 
ACAAGCAGTA 
AAATAATGGT 
TGACTGCCCC 
AC TAT AG AAA 
TTTTCTCCAG 
GACGAGGCTT 
GTAAAAGAAT 
ATATAGGGAC 
AAAGTAGCTA 
aGTATGCTTG 
GTCAAATGAC 
GTCGTACCAA 
TTGATAATGC 
GCTTTATTTT 
TTCATCATTG 
AATGAAGAAA 
AAAAAAATCC 
TCATAGATAA 
AT AT AC AT CA 
TCTTTGTATT 
GCGTATTCAT 
TCAGCTTCAG 
CtTTCAATAT 
ATGGGCCATG 
CTCAATATTT 
GGGACATCAA 
G CAT CAT ACT 



CATCTGTTAA 
AATACACCAA 
TTAGATAAAT 
AAACAGCAAA 
TAAAAATGCC 
TTAAGCCTAA 
GGAGCAGTAA 
AACTATTTTG 
CAATGTTGTT 
GTAGATAAGC 
CAAGTAACTG 
GGTAAGTATT 
TACGCTGCCA 
TATCAAACCT 
CTAAAGAGGA 
TGAGAGATAA 
GACTTCCTTG 
AATCTGTCAT 
CGATGGATAA 
CAATACACCA 
TACTCCACTC 
TTGCTTTTGT 
CAACTCTTTA 
TTATTCAAGA 
AGATAACAAT 
TGTTTTCATA 
ATGACTGTAT 
CTAGGaTTAA 
TTGcTAGTTC 
AATTTTTTTT 
TTTTCTTTTG 
TATTTCGAAC 
TAtAGGCATT 



AACAGCTGCT 
AACAAACGAT 
TGCTTTCACT 
TCCGTAAATA 
AAAACTTAAG 
GTTGATTTTG 
TAAAAGCCAT 
ATAAGCCACA 
TGTTGCGGAA 
GATAGCAGCA 
CTTGAAAGTA 
AAACATAACA 
CCAATAAAAA 
GAAACTAAGA 
ACCCCAGCTA 
GATAAGACCT 
AACAAACCAT 
TCCAGCGACT 
AACTGCCACT 
ATTCTTTAAA 
CAATAATTAT 
TATTTTCTCA 
TTAAAGAGTT 
ATATTGCTGT 
TTAGATATAC 
TTTTTCTTTG 
TATCTCTTTA 
TAAGTCAGAA 
ACATGTTTCT 
ACTAACAAAA 
TTCCATTTCA 
ACCTGCTTCT 
TTTTATCTTA 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 
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ATAACTTCGT 
GCTCCAGTAT 
AAAGCTTCTT 
GGTACTTGGC 
AACGCTTCTA 
AGTCCTTTTT 
ACTTTGGTGA 
GTTAAAGCTA 
CAAATATCAT 
GATTTAAACT 
TTTTTCCATT 
GCTAATTTCT 
TTTAAGAACG 
AAT ACGAATA 
TCTTTGTTAA 
CTGCACTTTG 
TTACTATAGT 
ATGACGCCGA 
TGCTTAGCTA 
GTAATGTCAT 
GCTTCAGTAC 
CTTTTAGAAA 
TTTTTCaTAt 
GCAGTTTCGA 
CCGTATAAGT 
G ATT TAT CAT 
aCTGAATATT 
GATTGACTGG 
AGTGAAATAC 
CCTAAATAAT 
TTTGATAATT • 
CTTGAAGATA 
GCTCATGATA 



TACTTTCATC 
ATAGTTGTAC 
TAAATTTAGT 
CAGTTAACTT 
AAAACATAGA 
CATTGATATT 
CGCCTTTTAG 
TTTTTTGCAT 
AAAAATCTAA 
CTGTTTCAGC 
CATTTGTAAT 
TTGGTTCTGC 
ATTTTTTATT 
CGAATATATG 
TAAAATAATA 
TGGAGTTATG 
AGCACTTACT 
TAATTGAATG 
AAGTATCTAA 
TAGTATTTTT 
TTACTTCTTT 
CATCAGAGTT 
ATTCTTGAAA 
ATGTGAGAGT 
CATGGTTACT 
CTAGTTGAGT 
CCCCTGACGC 
TTGACTTTTC 
CACAAACACA 
TTATTCAAAC 
TTTATAGATC 
TTCAGTAGTT 
ACAATCTATA 



AATGCTATCC 
TAAAATGAAG 
TAAATCATCA 
TATATTTCTT 
TTTTACAATG 
GCCAGGTTTT 
CAATTGTTTG 
ACGGCCAGCA 
AAGTTTAATT 
ATCTCTACAA 
AGAGTCTTTA 
CATAAATACC 
GAATATGTTA 
TtCGTTTTTT 
AGAAGATATT 
TGTATTTCCA 
TTTTGATACC 
CATTTTGCTT 
GACGGTTTGT 
ATCTTCGCTC 
TTTTTCAGAT 
TACTGTTTTk 
ATCTTTTTCA 
ATCTCCCTTT 
ATTTTCTCCG 
TAATACAGCA 
AACAAAGtCT 
AGTATCTTTA 
AACCAATCCT 
CAT T CAT ATA 
ATTCCATTTT 
ACTCCTAGGA 
AAATCTTGTG 



611 

CAAGTCAACG 
TATTTACTTG 
ATAGATATGG 
GTTGGATTTG 
TTATGTCTGT 
GCAAGCCAGT 
TCCCCAAAGA 
GACACTTTAT 
GCTTCAGTGT 
GCCTTCTTAG 
TATTTAAATC 
AACCTCTCGG 
TAAGTAAGTA 
GTGaTAAAAA 
AATAATCAAT 
CATTAGCTTC 
ATT T AC CATC 
CCTTTATAAG 
TGTTCTAAGG 
GGTAAAGTAT 
ACATCAGATT 
TGTTTGCTAG 
TTTAAAAATC 
TTAAGAGATA 
CCTGATTTGA 
TAGTAATCTC 
TTTGAATTCG 
TTATCATTAT 
AAACTAATTA 
CCCCAATTGT 
CAT AT AAT TT 
ACTCTGCACA 
GAGTTATTAA 



CAGpAACCTC 
TGGXGTTTTC 
TTTTATTTGA 
AATGAATATA 
TGCTCACCGT 
TAATCCATTG 
AATCATGTAA 
TCTTTTTAAA 
TTTCACCACG 
TAGATCTAGT 
TTAACTGCCA 
ATTTTTTTAA 
TCAAGTTATC 
GAAAAACCCG 
AATTTCTACA 
TCTTTTTGCT 
TTTTTGCGTC 
GATACATATC 
TTGGCTCATC 
TAGTAGCTAC 
GTTTATTATC 
TTTCAGTACT 
TAACAACAAA 
CTTTTTTCTT 
CGGTAATAAG 
CAGGATCAAT 
GTTGTTTAAC 
TTCCCGrACA 
ATTTTTTCAT 
ACCATCACAC 
TGAGTAACTT 
TTGCCAAACA 
CAATTTACAT 



TTGATAACGA 
ACGAGATAAT 
TTTCTTTTCT 
TTGCATGTCC 
CTCAACTGCT 
CTGATATTTG 
ATTCTTTCTT 
TGTATCAACC 
CATAATTTTT 
AATCATTTTA 
TTTTCCATTA 
TTTAAATACA 
GTATATCATA 
AAGGTTTCTC 
AGTCCGCTAT 
CCATATGCAT 
CATGGTTGGg 
ATTAAATTGA 
TGCTAGCTTA 
ATCAGACTTC 
TTGGCTACTA 
JcGATACTGGa 
ATCTTTGTCG 
GTTTCCCACT 
AACTATGCTC 
ATCTTTTCCA 
TGTGCTGCTA 
AGCAGTAAGT 
CATAATTCCT 
CTAAAAATTA 
TTGATTGCGT 
TATATTAATC 
CCACAATCTC 



8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 



WO 98/50555 



PCT/US98/08985 



612 



TAGCTCGTTG TTCTTGTTGT CTTTTTTCAT TACTATTTTG CTCT.ATAATA TCCCCAACAC 
TTGTTAAATG ATGACCAATT TCCTCTGCGA GAGTACCCAC CATTTCTTCG TTTGATTGAT 
TAGGGkTTAA ATATACTACA TTATTAATAT AGAGACCTTT CTGATTTcTG GGCATCATAG 
GTCTATTCTG TAGTTTAGTT CAGGATAATT AGCCATCAAG CTTTCCAATT TATn 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
rTATTGGTTGG TACAGGTATT nGAATTGGTT nGCTCCCTTG GCATCAAAAG GTTTGCmAAT 
tCGtTAAwTT mTyCAGATTG GTAATATCTG aTAACTCGkT GtAAGAAcAA TCaATAGAAA 
aTAGATTTGT ATTTTgGCTG CTAACAtCTA GAACAGTTAA TTGATTGCTC aAACAAACTA 
AAATGGCtAG ATCAACATTA TTAGTAACAT CTAAAACGGT TAATTGATTG TAAGAACAGT 
CTAAAAGATG TAACCTAGTA TTTTGGCTGA CATCTAAAGT AGTTAATTGA TTGCTAGAAC 
AGTCTAAATT CCTTAAATAG GTGTTCTGAG TAATATCCAA GTCTGTTAGA CGATTTCCAC 
TACAAGATAG AGTCTCAAGC ATTCTATTCT GACTAAAATC CAGATTGGTC AAATTATTAA 
ATTGACAGCT TAAATCAGTT AATTGTTCTG CATACTTGAT ACCTTTCAAG CTATCTAAAT 
TTTTATTAGT GACAGTAATT TTTGTTAAAC TTGACATATC TGCAATAGTC AATTGATCCT 
TATCTTTCCC AAAGCCTGAC CAGTTATTTC CTTGGCTTTG TACAGTAGAA TCAGTCCTTA 
AAAGTGTTGC GAACCTAGCA TCGATATCTG TTCCTTCAAT TAAATTGGTT GTTTTAGGAT 
CTAATTCTTT ATTTTttTTG TCCAGTTTTG CCGATTCGTT TTCTTGAGTA GGAGTAGTCT 
TTGTAGATAA AGTTGAATTT AATGTGCTCG TTGTTGTTAA TTCCATTGAG TTTTTTGTTC 
CTTGTGATTG ATCCGTATTT GCATCTATAG AGGAATCTGT GCCCGAAAGT AAACTTGAAG 
ATGATGTTGA TTCAACTTTT TTGGATGAAG TAGTTTTTGC CGATACCATC ACACCCATTA 
TTCCTTGTGA AAGGACTAAA CCTAAACACA AAAACAATCT AAAGGATTTA CTCATAGTTA 
TTCTCCTTTT CTACATAATT GTCTAAATAG AAACAGCCAA AAACTACTTA TTAAAAACAA 
TATGCCAAAC CAGCTTAGCC AAGTTATTCG ATTGTCACCA GTTTTAGGTA ATCTTTTTAT 
TTGCGTTCTG CTACGAACGA TTGATAGCGA TTGAAtGGTT GTTTGCGGCA ATACATtCAC 
AACTGGATTT TTTTTTGATG GTATGTCTGA AGTTTTTGTA AACCCGATTC CGACTTCTGT 
GGTCTCCCTA TTTTCTGCAT AAGTTGTTTG AGGAATATAA ATTAATAGAA CCAaCAaTAA 
TAAAAaCkGA AGATTTCyTC yTCTCATTAT GTCACTTCCC CTTCGTTTTT CCACTTCTCT 
CCACGACGAA CTAAGTAAGC AATCAATCCA GCAAGTGCTA CTGTCAAAGA TAATGCTACC 
CATGGTACCC ATTGTGGGAT CACTACTTGA TACGCTGCCG ATTGATTCAC TTTTTTCGCT 



10020 

laoao. 

10140 
10194 
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120 
180 
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300 
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540 
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1080 
1140 
1200 
1260 
1320 
1380 
1440 



WO 98/50555 



PCT/US98/08985 



ACCTGTTGCT 
AATGTTAAAA 
TAAAATTTTC 
CTTTGGTTGA 
GCTGGCTTTC 
CATAATCGCA 
CGTACTATTC 
AGGTGGCGTG 
CTTAACGGCC 
GAACTTAnGG 



613 

TGCGCTTCTC C TACT TT AAA TTCTTTTTCC 'CATTTCCAAG TTCGACCATC 
TTTCCAGTAA AGTGATAAGT TCCTGGTTTA AAGGTATTCA ATCCAATAGG 
TCGAAATTCG AATTTGGTGC TACAGAGAAT TTCTCCATCT TATGTTTATA 
TTTTCCCCTT TTAGACGTAT CTCCCCTTGA ATCGTTAGTT CTTGAAGAAC 
TCGTTTTGAA TACGTGCTGC AATCACTTTT TGACCATTCG CTAATTGTAA 
TTTAAATGAA GCATCGCTCC CTGATTAAAG GGTTCTTCAT CCTCCGTAAG 
ATCACGCGCG CATATTGAGG AGTTAAGCCA GATTTCTTTT GTTTCACTTC 
TTTCGAACAA AGCGCAGAGA GCCCAGCTTT ACTCCGGAGA AAGCCGATTC 
GATCTCAAAT GTnATCGGTT TCTCTTCTTT CCCCTTGGAG GGGGAATTTC 
ACCTTTnATT TTTTACTAAC TCACGGAATG "GGGGCTTTCA AACGGGCATC 
GGGAACATTG GnA 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GCAGTCGAGC CTGTTGGTCC ATAAAAATCG GTGTATGAAA GCAATTCTTT TTAGATATTT 
TTGACTTCCC AATTTGATAT ACTGCGGAAA GTATCTGAGA ATTTATTTAA CACATTTTTT 
ATTTACTAnA GAAAGTAAAG ATCaGAGCTG TGCaAATAGA AATGATTAAA AACATCCCaA 
TATCATCATC TAATGTTTCA AAAACTTCAA CAGGTTTCAT AAAATGTTTA ATTATTTTTT 
TCATTGtATT CaCCTCAAAA AGAGAATAGA ATGAAATATT CTACTTATCC ATTGATATAT 
CTAACGATTG ACTTACAAAA TTGAAATAAA AGTCTAAAAG TCAGTTGGTG TG ATT ATT AT 
TATAATACAA AAAAGACCTC CATTCTTTTA AGAATAAGGT CTTTTTTTTC GTAGTCGCCT 
TCTAGGCAAA CTAATTCAGT TAGCCGTATT CTAAACTCAA TAAAATGTCA AATACTCATA 
GGTGTATTTT TATCACAAAA TTTACCACAG ATTTGCGGTA TTATTTACAT ATCTTAGATT 
ATTAAAAATG ACAAAACAGC TATTTAATGC GTATTGTATG ATTTTTTTGT TCCTTAAATT 
GTTATGGAAT TTCAGATGGG GAGTGGCAAA TATACGAAGA AAATTAACTA TTTATAGCGG 
TTTTATTTTA TTAGAGAGTC TCGTTAGTTT TAGTATTGTA TGTGTGATAG CAGGGATATT 
CTCCCTGACT ATTACACAAT TAATTCAGCA AAATTATCAA CGGGAGCAAG AATTAACGCG 
CACTCGTCTA' GGATACGAAG CAATTTTATT TTTAGAGCAG ACGGGTGATT TACGTTTCAA 
AGAGCGGATA TATCAAGGAG AAACTTACCG ATTTTCTTTG ATGGAGAACG ATGGAAGAAG 
GATACTAAAA GTCACAGATT CAAGAGGAGC GATTTTGATT GGTCAATAAA AAATATGCGG 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2053 
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GTTTTACGAT 


GTTAGAGTGT 


T TAG TAG C AC 


TAGTCGTTTT 


AAGTTGTATG 


TGTCAATTGT 


1020 


TTCAGTTGAT 


GATTCAACAA 


AGTTTTATCG 


GAAATCAATA 


TTTGAAAAAC 


aATGATTCGA 


1080 


AATCCTGGCa 


TATTTTTTTA 


ATTCAATTAG 


AAAAAGAATG 


TCAAAAGCTC 


GTTTTCCAAA 


1140 


CAGGCTCTGC 


TCcAAGAGAT 


TTCTTTCCTC 


GACTCAGAAA 


CAAACAAGAC 


TATTTCAATT 


1200 


CAAATCaAAG 


AAGATAAAAT 


TATTAAGCGA 


GTCAATGGTA 


AAGGCTATCA 


ACCGCTGTTA 


1260 


ATTGGTATCm 


AAAATGGACA 


GTTTAAAAAT 


GAAGGACAAT 


CGTTTACTTT 


GGAAGTTACT 


1320 


TTTACTTCTG 


GAAAAACATT 


TGATAGTTTT 


TTTCCGATAA 


AGGGAGAATT 


TTAATGAGGC 


1380 


AAAAATATTC 


AGGAAACTTA 


TTGTTCACGG 


CCATGGCCAT 


TGTTTATTTG 


ATGAGTTTTC 


1440 


TCGCCCTTCA 


GTTACTAGAA 


GAACGTCAGT 


TAACACAAAA 


ATTTACGCAA 


GCTACCCAGG 


1500 


AATACTATGC 


AGGGAAAAGT 


ATCTTTCATT 


TATTTCTTGC 


AGATGTTAAA 


CAAAATAGAC 


1560 


GAAAGTTAAA 


AACAGAAGAA 


AGGCTCGTAT 


ACGCGCAAGT 


GACCCTCGAT 


TATACATACA 


1620 


AAAATGAACA 


ATTAAGAATA 


ACTGTTTTAT 


TAAACAAATC 


TGGTCGAAAA 


TACCAATATC 


1680 


AAGAGAGAGT 


TTCTCATCAA 


AAAAAAGCGG 


AAACAATACT 


GGAATAGCGT 


TGTTTTTCTA 


1740 


CAAATTATTA 


GATTTTAGAA 


GAAAGACAGT 


AAAGGCATTG 


ATTATTCTTT 


CTTTTTTcGT 


1800 


TAAAATAAAA 


CAGGTATGGA 


AGTTGTTACA 


TTTTGAATAG 


AATAGAGGTG 


CTAATTTGTT 


1860 


CCCAGAAAAA 


aTTGAAGAGG 


CTTTCGAGCC 


AATGGAACAA 


GCAATCCAGT 


TATTGCAACA 


1920 


GTCATTGGAT 


ACTTCTTTTT 


TAGATGCGTA 


TATTGAAAAT 


GGTGAAAATA 


TTTTAGATGA 


1980 


TTTTCAAGTA 


CGAGTACTTG 


ACGGAGTACC 


TAATCCAGAG 


ACAGTGAAAC 


AATTAGAAAC 


2040 


GCTGTATCAT 


ACGATTAAAA 


AAATTGATTT 


AGCTCCTGAA 


GATGTGCGAC 


GCTTGTCCCA 


2100 


ATTACTTTTG 


CTAAAAGGAA 


CGAGAAAAGA 


GCAGCTTCAA 


GCAAATCATC 


AATTGACGCC 


2160 


AGACGGTATT 


GGCTTTTTGT 


TTGTTTATTT 


AGTAGAACAA 


TTAACAAATA 


AATCAGAGCC 


2220 


ATTAAAAATT 


TTAGATCCTG 


CTTCAGGAAT 


GGGAAATTTA 


CTGTTAACGG 


TTCTGCTGAA 


2280 


CCTTGAAACA 


GCTGGTTACA 


AAGTTTCTGG 


TTATGGTGTG 


GATATCGATG 


AAACACTTTT 


2340 


GGCAGTATCT 


TCTGTGAACA 


ATGCTTGGTC 


ACAAGCCAAT 


ATTCAGTTGT 


TCCATCAAGA 


2400 


CGGATTGCAA 


GATTTGTTAT 


TGGATCCAGT 


TGATTTAGCA 


TTAAGTGACT 


TGCCGATTGG 


2460 


CTATTATCCA 


AATGATGAGC 


GAGCAAAGGG 


ATTTGCTGCT 


GCTGCGGAGG 


AAGGCCATAG 


2520 


TTATGCGCAT 


CATTTATTAA 


TGGAACAAGC 


CATGAAATAC 


GTGAAGCCAG 


CAGGGTTTGG 


2580 


CTTATTCTTA 


ATCCCGACCA 


ATATTTTAGA 


GACTGAGCAA 


AGTGAATTCT 


TTAAAAATTG 


2640 


ATTGACAAAA 


AATGTCTATT 


TACAAGGAAT 


GATACAGTTA 


CCTGATGAAT 


TGTTTAAGTC 


2700 


AGAACAATCA 


CGTAAAAGTA 


TCTTGCTTGT 


TCAAAACAAG 


GGTGCAGATG 


CTGAGCAAGT 


2760 


GAAGGAAGTT 


' TTGTTAGCAA 


AACTTGCCTC 


TTTGAAAGAT 


ATCAATAAAG 


TAACGGAGTT 


2820 


CTTTAAACAA 


TTTGAAGCTT 


GGAAAGCTTC 


AAATTTAAAA 


TAAAATAGTA 


ACGAAGAGGA 


2880 


GAAAATTATG 


TCTAAAACAA 


TTGCAATTAA 


TGCAGGTAGT 


TCAAGTTTAA 


AATGGCAATT 


2940 
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ATACCAAATG 


CCAAATGAAG 


AAGTGATCGC 


TAAAGGAATC 


GTTGAACGTA 


TTGGCTTAAA 


3000 


AGATTCTATC 


TTCACAAT.TA . 


.AATATGGTGA 


AGGACAAAAA 


TATGAAGTAA 


TTGTTGATAT 


3060 


TGATAACCAT 


GAAGTGGCAG 


TAAAAATGTT 


ACTAGACCAA 


TTAATCGACT 


TAAACATTTT 


3120 


AGGTTCTTAT 


GATGAAATTA 


CTGGTGTTGG 


TCACCGTGTA 


GTTGCTGGTG 


GAGAAGAATT 


3180 


TAAAGATTCT 


GTTGTTATTA 


CAGACGAAGT 


ATTAGAAAAA 


ATTGAAGCTT 


TATCTGAATT 


3240 


AGCACCTTTA 


CACAACCCAG 


CGAATGCGAT 


GGGGATTAAA 


GCATTTAAAC 


ATATCTTACC 


3300 


AGAAATCATT 


AGCGTGGCAG 


TCTTTGATAC 


GTCGTTCCAC 


ACAACAATGC 


CAGAACACAA 


3360 


CTATTTATAT 


AGTGTACCGC 


GTGAATACTA 


TGAAAAATTT 


GCTGCACGTA 


AATATGGGGC 


3420 


ACATGGAACA 


AGTCATCGTT 


ATGTCTCTCA 


ACGTGCTGCA 


GAAATGTTAG 


GTCGTCCAAT 


3480 


TGAAGACTTG 


AAAATTATCA 


CTTGTCACTT 


AGGAAATGGT 


GCTTCAATCA 


CTGCAGTTGA 


3540 


TGGTGGTAAA 


TCAGTCGATA 


CATCAATGGG 


CTTCACTCCT 


CTTGCTGGCG 


T.TACAATGGG 


3600 


GACTCGTTCA 


GGGGATGTTG 


ATGCGTCACT 


ATTACCTTAT 


TTAATGACAA 


AACTTGGCTT 


3660 


GACTGATGTG 


CAAGACATGG 


TTGATATTTT 


AAACAAAAAA 


TCTGGTTTAT 


TAGGTTTAAC 


3720 


AAATGGTTTA 


TCAAGCGACA 


TGCGCGATAT 


TCAATCAAAT 


CTAGATAAAC 


CTGAAGTTCA 


3780 


AACAGCATAC 


AACATTTTTA 


TCGATCGTAT 


TCGTAAATAC 


ATTGGTAGCT 


ATGTCACTGT 


3840 


TATGAATGGC 


GTTGATGCGA 


TTGTCTTTAC 


TGCTGGAATC 


GGTGAAAATG 


CAGTTGGCGT 


3900 

i 


TCGTAAAGAT 


ATTATTGATG 


GCATGACTTG 


GTTTGGTTGT 


GAAATCGATG 


ATGACAAAAA 


3960 


TAATGTACAC 


GGTGAAGAAG 


CAGTTATCTC 


AACTGATGAT 


TCAAAAATCA 


AGCTTCTATT' 


4020 


AGTACCAACT 


GATGAAGAGT 


TAATGATTGC 


CCGTGACGTA 


GAACGTCTAA 


AATAATTAAA 


4080 


CGAACACACC 


TTGTGAGATG 


CTCTCACAAG 


GTnTCTTAAT 


AGGTAAATAA 


GAAGAAACTG 


4140 


AGCCAACTAT 


CAAAAGTGAT 


AGTTGGCTCA 


nTTnCCTT 






4178 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CAAAATCAGC CACCTATGGC CAGTCTATCC TGCCAAAGCG TTTTTTATTC TTCATCGGTT 60 

GCTTTTTCTw GTTCACGATA kTTtcTCaTT TyTTTGCTGA ATAATTCACC GTTACTAGGG 120 

GGCGTATATG TAATAGCATT CGCTCCGGCG ATAATGGTTT CATGAATACT CGATTCAGTT 180 

GGTCCTCCTG TTGCAATAAT CGGTAATTCT GGATACTTGC CACGGAAATA GCGTACGGTG 240 

GCTGCAGTGT CTTTTCCTGC ACTGATATTA ATCATATCCA CGCCAGCCGC AAGTTTTTCa 300 

TCAATTGGTG TATGaATAGA AGTTAcTGTG CTGATAATAG GAATATCGAT TGCTTCATCC 360 

ACCATTTTAA TTGTTTCAAT TGGTGTAGGA CCATTCAAAA CGACACCCAT TGAGCCTTGT 4 20 
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GCTTCTGCAA 


ATAAACTCAT 


ATAAGCAGAA' 


CGCATGCCTT 


GCGTTAAACC 


GCCGCCGACT 


480 


CCTGAAAAGA 


CAGGAATATC 


CGCTGCTTCA 


ATAATGCCTT 


TTGTAATCGC 


TGGATGCGGT 


540 


GTAAAAGGAT 


ATACAGCAAT 


CACTGCCTGT 


GCATCCGTAT 


TGCGGATAAT 


CGCAATATCT 


600 


GTCGTAAAAA 


TAATTGATTT 


TATTTTACGG 


CCAAAAATTT 


TAATGCCACT 


GGCTTGCCGA 


660 


ATAATCTCTG 


GCATTTTTAC 


AATATCTTGT 


TTCAGCGGCG 


TCATAATTTC 


TGGAATCCAC 


720 


TTTTCGCTCA 


TTGTCATCAT 


TCCTCCATGC 


TTAACGGTCG 


TATTTATTTA 


AATTGTATTC 


780 


TTCAATTACA 


TGAATGATCT 


TATCAGCGTA 


TGTAGGATCC 


GTTGCGTATC 


CAGCATCTTG 


840 


AAGCGCTTGT 


GCAGCCTCTT 


TATAATTTTT 


GGCTAACAAG 


ACTTTTTCAT 


ATAAATTTGG 


900 


ATCCCAATCG 


ACCCCATCAA 


CAAATAACTT 


GGTATGATCT 


CGCATGGATT 


CTTCCCAATT 


960 


GTCATAAftCT 


CGAAAATCAC 


CCTGAATCGT 


AATCCAGACC 


TCATTGACAT 


ACTCTTTTGT 


1020 


TTCTAAATTG 


AsTTTTGGTT 


GATTCCCATA 


TGCTTTAATC 


CCAAAAAGAT 


TGTTATATTT 


1080 


CGATGCTAAT 


TGACTkTGTC 


CAAAATTTGA 


TTCTAAAACT 


GCTTGCCCCA 


TAATAATACT 


1140 


TGGTAATACC 


CCGTAAGATT 


CTTGTAATTC 


ACGTGCATaC 


GGTGTGaTTT 


CAGCAAcAAA 


1200 


TTCTTCTTTT 


GAGATGTTTT 


CATTTTGGCG 


ATTGACTGAC 


TCGCTTTGTC 


CCTTATCTGA 


1260 


ATCaGAAAGG 


ACAGATAACG 


AAAAGACAAA 


GGCCAAACCG 


ACGACAACTA 


CGCCTAAGAA 


1320 


TAACTTTGGC 


AAATTCAGCT 


GCTTTTTTCT 


TTTTTTCTGA 


TAaGTTGgCT 


CcACTTCATT 


'1380 


AATTCCCCaC 


TTCaTCTTTT 


aTTTTCCgAt 


AAAATGTTTc 


aCTAtCTTTc 


ACTTTcgtGA 


1440 


tTGatTTCaT 


TTGtTaCTAt 


aACCGAATAA 


TTGTATCATT 


TTCACAGAAA 


ATTATCAACT 


1500 


TTCATCACTG 


CTTACGCTCG 


TGATTTCAGT 


ACTTGATTCA 


GTGGTAGCAG 


TTGAAGAAAC 


1560 


TG TACT ACT T 


TCAGAAATTT 


CAGTGGATGA 


TTTCACTGTT 


GTTTCAGTTG 


ATGTGTGAAC 


1620 


AGATTCTTCA 


ACTGGCTTTT 


TGCCAAAAAA 


GACAAAAATT 


ACCGCACCAA 


ACATAATAAT 


1680 


CACAGATAAA 


ATCCCAGCAA 


TTGTGTATTT 


CAAAAGACCA 


TCCTTCCTAT 


TCGTTATTCA 


1740 


CCTACGTAAG 


TATCCATCAG 


ATTTTCATTG 


ACCCAATCTA 


ATAATTCAAC 


TTTTGAACCT 


1800 


TCCAATTGTC 


GTTGAATCGT 


AGCTGGTAAC 


CGGAATGTAA 


TAATATCATC 


AAAACCCCAT 


1860 


TCACCATAGG 


CCAGCGTCAC 


TTTCAAATCA 


ATATAACCAG 


CAGCAATTTC 


TTGTTCTTGT 


1920 


TTATTGTTAG 


GAATTAATTC 


TACAACGCCC 


AAATCAGATT 


GTAGCCATTT 


TTTGGCAATT 


1980 


GTTGTTTTTA 


ACCGTTTATA 


TTCCGAAACA 


GCTAGGTTAC 


GTTTTTCCGG 


AAATAAGATG 


2040 




f^TATTTTTTP. 
clnl 11 lllo 


aptpappatp 


PAPTPATAAT 


PPCTAAACAA 


ATTCTTAATT 


2100 


TTTTGCATGA 


CTTCATTGTC 


CAAACCAAAT 


TGGCCTTCTT 


TCCAAGCTTG 


CCAATCGGCT 


2160 


GCTTGAATTT 


GATAAGCGGA 


GTAAAAATTT 


TCTTCTGTTT 


GATACTGTTC 


AATAATTGCG 


2220 


TCAACAATAA 


TAT CT AC AT A 


AACTTGaTGC 


ATAATCATTC 


GCCTCCATTT 


TCTATTTTAC 


2280 


TGAAATTCTC 


TAAAAAAAGC 


ATCTTTTTAT 


CTCTGGaTTT 


TATCATATTT 


ACAAAATTGT 


2340 


CCTTGTTTTC 


TAATTTCTCT 


TGTTTACAGT 


ATAATAAATA 


ATGAAGAAGA 


TTTGAAGAAA 


2400 
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TTGCATAGAT TAGGAGGACT TCTTTTGACA AAAAAATTAT ATCAATGGGG CATCGTAGGA 24 60 

TTAGGTGGAA TTGCCcACGA ATTTGCTTCA ACGTTTAAAC AAGAAACTAs CCAaCTTGCA 2520 

GCTGTTGCTT CACGGACATT GTCTAAAGCA GAACAATTTG CCGCTGATTA TTCAATCCCT 2580 

AAAGCCTATG GTTCTTATGA AgAAtTGCTA gCaGATGaAA CAATCGATGT TGTCTATATA 2640 

GCAGTGCCAA ATAAACAACA TGCTGAACAC ATTTTAAAAG CGCTCCAGGC CGGAAAACAT 2700 

GTGTTGTGTG AAAAAGCTAT TACGATGAAC GCGGCAGAAT TAAAACAAGC CTTGGCAGTT 27 60 

GC AG AAAAAA ATGACCTTAT TTTAGCAGAA GCCATGACTA TTTTTAACAT GCCACTATAC 2820 

GACGAATTAC GTCGCTTGAT GGATGCTGGC AAATTTGGTC AGTTAAAAAT GATTCAAGCA 2880 

CCATTTGGAA GTTACAAAGA ACCCGATCCT AAAAATCGCT TTTTTAATCC TGACTTAGCA 2940 

GGCGGCG'CAT TGTTAGATAT TGGTACTTAT GCGGTTTCGT TTGCTCGTTG GTTTCTCAGC 3000 

TCACAGCCAG ACGTTTTAGC TTCTAGCATG GTTCCCTTCT CTACTGGCGT TGATGAACAA 3060 

TCGGTGACAT TGCTTCGGAA TAAAGAAAAT GAGTTGGCTA CTATTTCATT AACATTCCAA 3120 

GCAAAAATGC CAAAGCAAGG AATTGTTGCT TTTGAAGATG CC TAT AT TAG CATTACAGAT 3180 
TATCCACGCG CAGATGAAGC GATTGTTCAC TACAACGACG GTACGGTTGA GACCATCGTT ' 3240 

TCTGGGTACT CAGCTGAAGC CATGAATTAC GAAATCTACA ATATGGTTGA AGCAATCAAA 3300 

GGTACACAAC CCAATCGCTC CTTATTTTTC ACGACAGAAG TCATTGATAT TTTGGACCAG 3360 

ATGCAAAAAT TATGGCAGAA TTAAAAAAAA GAACTTCGGC TTTTGCTGAA GTTCTTTTTT 3420 

TTATTTGTTC TCTATGGATA AATACGGTTT AACATACGTG GGAATGGAAT CGCTTCACGA 34 80 

ATGTGTTCAT TACCAGTAAT GAACGTAACA GCACGTTCTA AACCTAATCC GAAGCCAGAA 354 0 

TGTGGCACAG AACCGTATTT ACGTAAATCT AAATACCAGC TGTACTCTTC TTCTGATAAG 3600 

CCAAACTCAG CTACTTTTTC TTTCAAATAA TCATAATCAG TTGCACGCTC TGAACCACCG 3660 

ATAATTTCAC CATAACCTTC TGGTGCAATT AAATCGGCAC AAATTACAAC ATCTTCACGT 3720 

GTCGGATGTG GTTTCATATA GAAGGCTTTA ATTGCTTTTG GATAATTTAA AATGAATGTT 3780 
GGAACGCCAT AGTAATTTGA AATAAAGGTT TCATGTGGGC TACCAAAGTC TTCGCCCCAT ' 3840 

TCAATGTGAT CATAATCATT GTTTGCTTCT TCTTTTTGTA ATAACTCAAT CGCATCATCG 3900 

TAAGTAATAC GTTTGAATGG CTCCGAAACA TATTTTTCCA GTAACGCTGT ATCACGTTCT 3960 

AAAACATCCA ATGGATATTG TTGATTTTTC AATACCGATT TAATTAAATG TTTTACGTAG 4020 

GCTTCTTGAA TATCTAGTGA TTCTTCATGA GTTGTGTATG CCATTTCTGG TTCAATCATC 4080 

CAGAATTCCG TTAAGTGACG ACGAGTTTTG GATTTTTCCG CACGGAAAGT TGGTCCAAAA 4140 

GTAAAGACTT TTCCAAATGC CATCGCACCA GCTTCTGCAT ATAATTGACC TGTTTGTGAT 4 200 

AAGAATGCTG GTTGTCCGAA GTAATCTGTT TCAAATAATT CTGTTGTGCC TTCTGGTGCT 4 260 

GAGCCTGTTA AAATTGGGCT ATCAATTTTA ATGAATCCTT GTTCGTTAAA GAACTCGTAA 4320 

GTAGCACGAA TAATTTCGTT ACGAACTAAC ATGATCGCAT GTTGTTTGCT TGAACGTAAC 4 380 
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CATAAATGAC GATGATCCAT TAAGAAATCT GTGCCGTGTT CTTTCGGCGT AATTGGGTAA . 4 440 

TCTTCTGAAG CGCCAACGAC TTCTAAATCA GTGATATCCA TTTCATAGCC GAATTTAGAA 4 500 

CGGCTATCTT CTTTGATAAC ACCAGTAACC ATGACTGCTG TTTCTTGkGA TAAATGTTTA 4 560 

ATTTCTTGGA ATTTTTCTGT ACCAGCTTCT TCACCAAAGG CTTCAATGAA ATTTGGTTTG 4 620 

AACACGACAC CTTGGAAAAA GGCTGTCCCG TCACGTAATT GTAAAAAGGC AATTTTTCCG 4 680 

CTTGAGCGTT TGTTAGCAAT CCAAGCGCCA ATTTTTACTG TTTCTCCAAC ATGTTTGTTT 4 740 

GAATCAATAA TTTGAATTTG TTCCACGTTT GTCTCTCCGT TCATTTcTTA GTTTATTCTT 4800 

TTAGGCGTTG TGCTGAATTC GTTTCTTTTC TATAAAGTCA TTCATTCGCT CAATCGCTTT 4 8 60 

TTCTAACGTC ATTAAATCTG TCGCATAGCT CATCCGCACA TGTTCTGGTG CACCAAAGCC 4 920 

CTCTCCCGTT ACCAGAGCAA CATGCGCTTC TTGTAATAAA TCTTCCACCC AGTTAGTCAC 4 980 

GTTATCATAC CCACAAATTT CTAACGTTTT TTTGACATTC GGGAATAGAT AAAAGGCGCC 5040 

TTCAGGTTTT ATTAGCGAAA CGCCTGGCAA AGCTGCTACT TTTGGATAGA TATGATTTAG 5100 

GCGTTTTTCA AAGGCCTGaC GCATGTCTTC AACTGTCGAC TGTTCACCTG TTAAAGCCTC 5160 

AATTGCCGCA TATTGGCTAA CTGCGACTGG ATTACTGGTG GATTGTGAAG CGAGTTGTGT 5220 

CATTGCTTGA ATAATTTTTT CATCCCCGAC TGCAAAACCA ATGCGCCAGC CTGTCATTGC 5280 

ATATGtTTTG GAAACACCAT TAATAATGAT TGTCTGTTGA CGAATtGCTT CTGAGATGGT 5340 

TGCAATTGGT GTGAACCGAT GTCCATTGTA CACCAAACGA CCATAAATAT CATCTGCAAT 5400 

AATGAGGATA TTATGAGCAA CTGCCCATTC ACCAATTTGG CGTAATTCTT CTTCCGTATA 54 60 

AATAACACCA GTTGGGTTAG ATGGTGAATT TAAAATGATT GCTTTTGTTT TGTCCGTTCG 5520 

TACAGCTTCT AATTGTGCGA CTGACACTTT AAATGATTGC TCTTGCGTGC TTGATACAAA 5580 

TACTGGCTTG CCTTCTGCTA ATTTGACTTG TTCACCATAA CTTACCCAAT AAGGGACAGG 564 0 

GATAATCAcT TCGTCCCCAA CGTTCAAAAT GGCTTGAAAT AATAGATACA GCGCATATTT 5700 

GGCACCATCT GTCACAATAA CTTGCTTGGG TTGATAACGA AgTTGATATT CTCGTTCCAC 5760 

ATAAGAGACA ATTGCTTGCT TTAATTCAGG AATGCCGCCA GAAGGTGTGT AATAACTTGC 5820 

TCTTCCATCC TCAATTGAAG CAATTGCTGC TTTTTGAATA TTTTTCGGGG TTACAAAATC 5880 

TGGTTCCCCA ACCGTTAAGC TTAAAACATC TACACCTTTT GCTTTTAATG CTTTCGCCTT 5940 

GGCGGcAGCT GcTAATGTAA CAGAaGGAGT CAATTGCTGT GCTCGATTTG ATACGTTCAT 6000 

ACACCTTTCA CTCCCACTTT CATCAAAACC TATCTAAAtT AAATATCTTT GACAACGTTG 6060 

ATCTCTTCGC CTTTATCAAA GCTCATCAGA TAATACGTCA GTACTCCTTG ATCATTTTCG 6120 

GTGACCACTT CCCATGCAGG GCGCTTATCA TAAAGCCCTA ACGTAGCTTT CATCACCGAT 6180 

TGATCGGGAT GGGCTTTAGC AACAATGTCT GTTATTTGTT GTTCAGTTTG ACCATCTTTT 624 0 

TGATTTAAGA CCGTGACTTT TTCTCCCGAT TTAGGGACAA TCACGACAAT CTCTGTGCCT 6300 
TTATCATTTT TGCCGGTTAA AGTAAAATAC GTATTTTTGC GTGTAAACCA GTAAAATTTA 6360 
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TCAACTGTTT CTAAATTAGC 
GcCATTGGTT GATTGGtTCG 
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TGCGAAGTAA AAAGAACCAA 
TCTGAAATGT AcTGACCCAT 
GGTACATAGA CTCGCGCATT 
ACCTCTGTTA AACCTAACCG 
CCGCTTGTAT AAATAATTCG 
GCTTCTAAGT CATGAATTTC 
ACCCAACGTG GACGCCAATC 
TCATCAAAAA ATTGAAACAA 
AACTGACTCA TAATTGTTTG 
AGCTGGAGCG TTTGACTATG 
GTTAAAAGCA ACTCCTGATG 
TCAAAAAGAA AATCTTGCCA 
ACAAACCGCT GCTCTTCAGA 
TCTTTTTCAA TAAAATGCGT 
GCCATCCGAC TAGCGATGTC 
TCAGGTAATA ACGGTTGTTT 
AACACGTTAC TTTGAGCGAC 
TCATCTTTTT TTGAAAGAAA 
TTTAAATTCG TCAATTGTAG 
AGAACGCCCA TTTGATAAAG 
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AGAAGTTTTG*GCAATTGCAG TTGCTTCTTT TTTGGCTTGG 6420 

TACaAAAAAG ATAATTATTG TTACAAGAAT TGCTAAAAGG 6480 

GACAGTAAT.C CGTTTCTTCT CTTTCTTTTC TTCTAATTCC 654 0 

CTCTAAACTT ACTGGGTCTT ATTATAGCAG AAAAATAGCG 6600 

AAAAATAATC GATTATTTGT TTAAAAATTC CTTTAATTCT 6660 

CGCTTCTTTT ACTGAGAGTC CCTTGGGAAA CGCTTTGATT 6720 

AG TG ACT AAA CGACGATCCA GAATCAGCAA TGCCCCTCTG 6780 

ACGGCCCAGT GCTTGTCGCA AACGTAAGGC TGCTTTCGGT 684 0 

TTGTATCCCT TCTGCTTCTA AATACGCATA TTTTGCTTGC 6900 

TTCAAAAGGC AAACGCGTGA CTATCAATAA CTGTAAAGCT 6960 

TTCCCAAAAG CTATCGGCTC CTAGCAAAAC ATTTGGTTTT 7020 

AATTTTTTCA CGACTGCCGC TAATCCCTTG GGCGAGGACC 7080 

TCTTGGCTGG ACTTGATGAT ACACGGTGGA CAATAATTCA 7140 

CATCGAACGT TCCTCTTGCG CCATCAATTT TTCAATGGTA 7200 

tTCATAGGmA GAGGCATTTT GAATGGCTAC TGCTTCTTCC 7260 

TTCTGCATAG TTATAAGGCG TTGGCAGGGT TTTGAACGTC 7320 

TTGCGGTAAA TACTTTTTAT TGTTGCCAAA CTTCAATGTG 7380 

TTCATACCGC GGATACCACT TTGTTTCAGG AAGCAAACTA 74 40 

AATCAAGCCA TGGTCTTCAT TTTGTGGTAA ATATTCTTTT 7500 

ATCAAGATAA ATGGTAAAAC AGTGATGAAT ACGTTCAATT 7560 

GCGAACAAAA ACAATGCGAT CAGCCAAACT AAAGGTTTCC 7 620 

CTGTAACTTA CTTTGAATAG TTAACAGCTC CTGCAGTAAA 7 680 

TTTTTCTCCC GCAGCAGATA AATAATCAAT ACGTTCTTTG 7740 

GTCTATCCGC TTCTGTGTTT TACTAAATAA ACGATGCAAT 7800 

TTCATCCACA AAATCCAGTA AACCTTGCTG ATAAATTTTT 7860 

AAACTGTTGA AAAATTTGCT GTAAAGCAAC AAAAAGTTGT 7920 

GACTTGTTTT TTTAGTATCA CACTGTTCAC TCGTTCGCTC 7 980 

TGGTAAATGA TGTGCTTCAT CAATAATTAG CCACGGACTT 804 0 

TCTTAGCGAC TCTTGCGCCA AAAATGCATG ATTAACAATC 8100 

TTTTTTATAC AGATGTCTTA AAAAGTCAAC TTCATAAAGT 8160 

ATCCAATCCG CGATGTGTAA CTTCTTTCCA GAATAAATGA 8220 

TTCATCTAAA TCGCCCGTTT CTGTTTCTAG CAACCAGACG 8280 

AGCGTATTGT TTTTGCGGAG TTGGCTGATT CAACGTTGCC 8340 
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TTAAAGCATT GTAAATCAAG ATAGTGTCGA* TGGCTTTTAA TAACAATGCC TCGTAATTTC 84 00 

CCTTGGCAGA TTTGATTGGC TAATGGTAAA TCTTTTTCCA CTAATTGATT TTGTAAAACA 84 60 

ATCGAAACTG TTGAAATGAT GACTGGTTTT tCTGGCGTTG CTAAATAGCT CATCGGCAAC 8520 

AGATAGCCTA ATGTTTTTCC CGTTCCAGTC GCCGCTTCAA TAAATAAATC TTTTGTCGTT 8580 

CCTTCTGTAA AATGATCATA GACTAAATTC ATCATCCGAC TCTGTTCCGC ACGATAAGCG 8 64 0 

AAACGTTCGC CAAACAGTTT TTCTTTGGCT TTTTTAGTTT TGGGATACGT GCTAGTCTGA 8700 

TAGAACGTTT CTTCAAAAAG AGGAACTTCT TTTTTGCGTA AAGCAATGCC ACTGACCACT 87 60 

TGATAAGCTG GATTTAArGG TGTGACTTGT TTTTTCATTT GTTCATACGT TTGTTGAATA 8820 

AAGGCACTGG TTTCCCGCGC TGTCTGTTGA CTTAATTCCG CAATTTTTTC CATAGTGACT 8880 

AATGGTAAAG ATTTCATTTT TTCTTGAATT AATAACAAAA GTTCgGCTGT AACCTGCGCA 8940 

TCACTATCTG CTTGATGGGG ATTCTCATGA GACAAACCTA AGCTTTCCGA CAAATCACTT 9000 

AAACGGAAAC TTTTTTCTGT CGGTAAAAAA ATCTGTGCCA ATTCCACTGT ATCAATGGCT 9060 

GGAATAGTAA GTGGCGGCGT GCCACACCGT ACCAGTTCAC GAGCTAAAAA ATTATAATCA 9120 

AAGTGGACGT TATGCGCCAC AAAAATAGTA TCTTCCAGAT AATGATAAAT TGTATGTGCT 9180 

ACATCTTCAA AATAAGGCGC TTTTTGAACT TGCGTATTAC TAATCCCCGT TAAACTTTGA 924 0 

ATTTGTTTTG GTACTACTTG ATTGGGATTC ACATCTGTCG CAAAGTTCGC AATAATCTTT 9300 

CCATCTTGAA CGAGAACGCA TCCAAACTGA ATGATCCGAT CACTTGTAGG ATCTGTTCCA 9360 

GTCGTTTCAA GGTCAACTAC TGCATACACT TGTTTGTCTT TCAATGACTT CACGCCTTTC 94 20 

ATTCATCTTT GAAAATTATA CTACAATTTA TAGGATTTTC ACTAGTATTT CAAAAATTCT 94 80 

CTTGAAATCT TTTCCAGAGA TAATAAGAAA GrGCCTGAAA CAAAAATCCA AAGTGATTTT 954 0 

TGTTTCAGGC yTAAAAGCTG ATAAATTGAA CGGTTTTGTC TTACCTTCTC CTAAATAAAC 9600 

GATTAATAAC AGTCAAGGAT TTTCTGCATA TACTGGTTTA ATTGCCCATA GCCTTCCAAT 9660 

TCTCTGATTA CTGGGAGCGG ACAACGAACC TTTTTATTAA ATAGTGTCCG CTTTTCAACA 9720 

TATTCTGTAA tTCGTCTAAA AATTTATCTA ATCCTTTTTC TAATTTCATA CAACTTAACT 9780 

GATTGCAAAT TCGCACGTCT CCTGCCTTTT TCTTTTGAAC TTGTTTAAAT GATTGTGAGA 984 0 

ATGCTTTTGC TTCAAACATT TGTGCTTCTA ACTGTCTTTT TTCAATAGTT ATTTTCGGTA 9900 

CTGCTTGATT TCCATCAGTA TCCTCTAAAT GGCGACTCAT AAAGCGACGA TAAATTTCCT 9960 

CTTCTAATTC GTCCAATTGT CGTTGAAAGT CCGCAGGAAT TGTCCGACTT GATGAATTTT 10020 

CTTTCTCACT TTCCATAGAG ACAGCTTCTT TTTCTGGCTT CTGCTCCTCT TCTTTTTCGG 10080 

CAGACTTATT CACTGAATTT AGCGAGCTTT TTTCTGTTTC TTGTGGAAAA GTAAATAAaG 1014 0 

GTTTTGCCGC GGTACTCTGC TGGTTTTCTT GTTTTTTTTC TGGCAGCGCT TCTTTAATTG 10200 

GCGCTGGTTC ACTTGTTGCT ACAGTTGTTT CACCTTCTTG ATTTTCCCGA TTATCAGGCG 102 60 

AG AG AT TG AC GTCTGGTTGT TCCAGATTAG TATCCTCTGA TACTTTCAGC GATGCCTGTT 10320 
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CTGCGGCTAT CACAGGAGAA 
CGGGATTTCT CGTTTGATTA 
TCAACATTTC ACGTAAGACA 
GCACTACTTG CACATTGGTT 
TTTCATACAT CCATTGTTTC 
AAAATAAACA GTGTAGTACA 
AACTCTTCAA ACTGTTCTGA 
CCTTCTGCTA AATTTTGATA 
TACGGTGTTT CAACATTATC 
TTACAAA'GAA AGACCGTTTT 
ACTTCCTTTT TCTCCGTTAA 
AAATTCTTCT GATTTTCTCA 
AGAAAAAAAA CGTGAGATAA 
CTGACTATTT TATCCTTGGA 
TGTCTTTTTT CTCGCGACGT 
TTGGTGTCAC AATGTTAGAA 
CAAATTTAGG CACCATTCCT 
CAAACACAAA ACCAATTTTA 
ATGCTTTGTT TAGAAGCATA 
CAGAATTTAA GTTAGTTAGC 
GTAAAGCGAT AATAATACTT 
CTTTTTTCGT ACGTTTACGT 
GAATTTTCAG TGGAGCATCA 
TCAAAGCATA AACAATCATT 
AGGCACTGTA GGCACCATTT 
TGCCCATTGC AAATGAACCT 
TTCCTTTAGG GAACTCTTTG 
CAGCGCCACC TACCGCAAAA 
TAGGAATATA GGTGCTGACT 
TAAATGGTGC GCCTACCGCT 
CAGCTAAACC ACCAATGACT 
CCAAGAAAAT TATTAATGAA 
TTCCATTCCC TTGAACTGCC 



621 

GCTCTTTTCA AATGGATCGT TTTTATTTGT CTAGAAATGA 10380 

ACGGCTTCAT TTAAGGAGGC TAAATTTTCT TTTAAAAGTG 104 4 0 

GCAATTTCAC CAGCTAATTC CGTTAATTGA TTTTCTTGCG 10500 

TCATTCATAC GGACTGTTTT TTTATTTTTA GTT GTCGGTA 10560 

AGTGCGGATC GTAAGTCACG AGTTGTCGGC TCAAAcTTCC 10620 

TTAcTCCGTT TCAACATAGA CATAAATCTT TTTAACAATT 10680 

TTCATCCCAA CTTTCGCATA AAGCCTCAAA AAATTGTTCC 10740 

CTCTTTCGCT AAATAAAGCT TGCGrATTAA GTAAAAaTTA 10800 

TGCATAAATT TCTCGTCCTA AAAAATCTGG ATTCGTTTCT 10860 

CTCTTCTGGT TGTGTTTCAA GTGTTTTAGT CATGAGATTC 10920 

TAACCATTTT TTAATGATAG ATACCTTTAG TATACAGGAG 10980 

TATTTTTAAG AGACAATAGA AATTTTAGAG AAAAATCGAC 11040 

ACAGTCCTAT TGACCATTTA TCTCACGTAA AATACTTAAA 11100 

CATGTAACCA ATTATTCTGC TGAATCAATT GGCAAAATCG 11160 

GCAATTGCTG GTAAAATCAT TCCTAATAGA ATTAAGACGA 11220 

GCTAATTGGA ACAACCAGCC TTGAGGGTCT GAGGCATACT 11280 

AGGATACAAG CAAAGGCTGT AAATAGGAAA CACCAAAGAC 11340 

GGATTTTTAA CAAATTTATA TTCTGAATTA AATTGTTTGT 114 00 

TAGGCAAAGA AGACCCACAA GTAACGCATT GGCATAACTA 114 60 

CATTTAACCA CTTCGTTCAT GTCCTTCAAC CCAAACATTG 11520 

ACTAAAATAC CTGTTAATAA GTAACCATTA ATTAACGTAC 11580 

AACCAACTTG GAACAAATTC TGGATCTGCT TCAGCTAAAA 11640 

ATTGAAAAGG CTAGCGCAgc CACTTGACCC ACTGTATTCG 117 00 
AAGAGATTAC CAACACCATA ATATGCGCCT AATTTTTGGA _ .. 117 60 

GCCATTAAAT CTTTAGGAAT ATTGTCACTT GAGAATAACA 11820 

AAAACAGCCG ATAAACCAAC CATTGCCGCT AGGAAAATCA 11880 

GCTGGATTTT TAGTTGCATT TACGTAAGGA GAAATTTTTT 11940 

ACTAACATCG ATACGGTTGT AAAATAACTA AAGTCAAATT 12000 

TGTGACATGT TTGGCGTCGC AAATTCAACC GTTGAATTAA 12060 

AATAAAATGA ATAAAATAGA CATCACAAAC ATGGCGGTTC 12120 

TTTAAGGTGG TTAACCCTTT TGTTGATAAT AACAAGAAAA 12180 

ATAAATGATA CCGTTACGAT TGGCATCGTA TTGACTAGGT 12240 

CAACCTAAGG CAATTAAAAT TGCTTGTGGT TTTTGTGCCA 12300 
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AGTAAGGAAT GTGCACTACC 
TGGTATTTTG CACCCACGAA 
GTCCTACGAT TAACGCATAG 
CCACTGAAAT CCCTTGTTGT 
TAAAGGCAAT TAACCCGACG 
CTTTCTTATT AAAAAAAATA 
AAGCTCAACA TAATTAAATA 
TGATTTTCTA ATTTATTTGA 
TTTTCTAAGA GATTTTTCTT 
AAGTCTT^TA AGACACCATT 
CGTGTTGCCA AATCTTCTTT 
TTCGCTACGT TGAGGCCTAA 
TCAATCACAA AATTTTCAGC 
CGTTCCAACA TTTCATTACA 
GCGTCCATCC CTTTGtCCGT 
GACATTGCCA CTAAGTCGAA 
ATTGGGAAAT AAACTTTAAA 
TGGGCTTTCG GGAACATATA 
TTGTTTTCTT TCATCGCATT 
TGTCCTTTAC GTACCGTTTC 
ATTAAATAAA CCATGATATC 
CGACGAATCA ATTCTTCTGC 
ATTTGAAGTA ATTCTGCAAA 
CGTGTCCCAA ATTCAGGAAT 
ACACCTAGAA CATCAGGTCC 
GTTTTTGGCT CAATCCCAGA 
CCTAGAATAT CAAGTTTTAA 
TTCCATTCTG CCTCTTGATC 
ATGTAATCAG GAATTACAAT 
GTTGAACCTT TGGCTAAGCG 
TCATATCCTT TAACAAAGCC 
CGGTAGACAT ATTCTTCACC 
TCGCCAGAAA AGTTCAAATC 



622 

CAATAAGTCC" ATGCTGCATA ATAGGCTAGA CGTTTAGTAC 12360 

CTAACTCCAC CATTACTGTC TTTAAATGTT GAACCTAACT 12420 

GGAATAAAAT ATAAAACTAA AATCAAAAGC CATGAAGTGA 12480 

GCATAATTAT TCACAACATT TCCAAGACCC CAGACCATGT 12540 

GTGAACCATC GTAATTGCTT TTTGTCCATA TAACCAATTC 12 600 

AAGCTCTTCA TTTATCATAA GCATTTTACT TTTTATGCAC 12660 

AAGATGGCTT TTTAATAAGT AATTAAAACG CTTCTAGCAA 12720 

TGACTAAAAA AGGCCGGTGA ATAAGTTGCT TCACCAGCCT 12780 

TTATAACATA TCAAATAATG ATAATTGGTT TTCATCTGGC 1284 0 

TTCATTCATG TATTCAATTA ACGTTTTAGA AACTTTGCCT 12900 

CGATAAGAAC GGTTGCTCTT CTCGCGCAGc AACAATGGCT 12960 

ACTTGGCACT GCTCGGAAAG GTGCAATCAA GGTATCCCCT 13020 

ATCTGATTTG TACAAATCAA TCATGCCAAA TTTATAGCCC 13080 

TAATTCTAAA ACCGTTAATA AGTTTTTCTC TTTAGTGGAG 13140 

AATTTCTTTC ATTCGTGCTT TCACAGCTTC TTTTGCTTGT 13200 

GTCATCCGCC CGAACGGAGA AAAACGCACA GTAATATAAG 13260 

GTAAGCGACC CGCAACGCCA TTAATACGTA TGCTGCCGCA 13320 

TTTAATTTTA GAACAAGAAT CAATATACCA GTCAGGGACG 13380 

TAAATACGTA TCCCGTAGTT CATCAGGAAT CTTATTCCAT 134 4 0 

CATAATCTTA AAGGCCATCC CACTGTCCAA ACCAGCATGA 13500 

ATCCCGACAA CCAATTACTT CAGCAAGATT GgctTCGCCA 13560 

ATTCCCTAGC CACACATCGG TTCCGTGAGA CAGTCCAGAA 13620 

AGTTGATGGA TGGGTCTGTT CTAACATACC TCGAACGAAA 13680 

TCCTAAAGTT CCTGTTTTAG AATAAATTTG CTCTGCATCC 13740 

TTCAAAAATA CGCATCACTT CTGGATCGTC TGTTGGGATG 13800 

TAAATCTTGT AACATCCGAA TCACGGTCGG ATCATCATGT 13860 

AATATTATCG TGAATGGAAT GGAAATCAAA GTGCGTCGTT 13920 

ATCAGCAGGG TATTGAATTG GCGTAAAGTC ATAAACATCC 13980 

AATACCGCCG GGATGCTGCC CAGTCGTCCG TTTAACCCCT 1404 0 

ATCAATTTCA GCACCACGCA AATGTAAATT ATGATCTCTT 14100 

ATACGCTGTT TTGTCGGCTA CTGTACCAAT GGTTCCTGCC 14160 

AAAAAGAACT TTCGTGTAGT TATGGGCTTC GGCTTGATAA 14220 

GATATCGGGA ACTTTGTCGC CATGGAACCC TAAGAACGTT 14 280 
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TCAAAAGGAA 


TATCGTGACC 


ATCTTTAAAC 


AATCGAGCGC 


CACATTTTGG 


ACAGGCTTTT 


.1434.0 


TCAGGCATAT 


CAAACCCTGA 


ACCATAGGAA 


CCATCTTCAT 


AGAACTCAGA 


ATATTGACAC 


14400 


TCAGGACAAT 


AATAATGTGG 


CGCTAAAGGA 


TTTACTTCAG 


TAATCCCTGT 


CATTGTCGCA 


14460 


ACAAAACTCG 


AACCGACAGA 


GCCCCGTGAA 


CCAACTAAAT 


AGCCATCTTC 


ATTACTTTTA 


14520 


TGCACCAGTT 


TTTGAGAAAT 


TAAATAAATG 


ACCGAGAAAC 


CATTCCCGTT 


AATCGAATTC 


14580 


AGTTCCTTTT 


CTAACCGTTT 


TTCGACAATT 


TCTGGTAAAG 


GATCGCCATA 


CAATTCGCGC 


14640 


GCACGATTGT 


AGCTTAAATC 


AGTGATCTCT 


TGTTCAGACC 


CAGGAATTTT 


CGGCGTATAC 


14700 


AAATCGTCTT 


TTACTGGAAT 


AACTTCTTCA 


CATAAATCAG 


CGACTTTGTT 


GGGATTTTCT 


14760 


ACTACAATTC 


GCTTAGCGAT 


GTCTTGTCCT 


AAAAATTGAA 


ATTCCGTTAA 


CATTTCATCG 


14820 


GTGGTTCTGA 


AATGAACCTT 


TGGTAAACTA 


TGGCGATTTA 


ACGGATTAGC 


GCCGCCCATA 


14880 


GACCCGACTA 


AAATTTTCCG 


ATAGATGGCA 


TCTTCTTCAT 


TTAAATAATG 


AACATTTCCA 


14940 


GTGGCGACAA 


CCAGCTTACC 


AAGTTCGTCA 


CCGATTTTCA 


CCAGATTACT 


AATAATTTCT 


15000 


TCCAAATCTG 


CTTCATTTTT 


CACCAATTCT 


TGTTCAATTA 


ATGGCGCGTA 


AACTGGTTTG 


15060 


GGCATAACTT 


CAATATAATC 


GTAGAATTTG 


GCACGATTTT 


TTGCTTCTTC 


GACACCTTTT 


15120 


TGCATCATTG 


CTTCAAAAAT 


TTCGCCGTTT 


GAACATGCGG 


AACCGATTAA 


AAGTCCTTCT 


15180. 


CTTAACTTCG 


ATAATTGCGA 


ACGAGGAATC 


CGTGGTACTC 


GGAAAAAATA 


ATCTACATTG 


15240 


GACATGGAAA 


TCAATTTAAA 


AAGATTTTTT 


AATCCTGCTT 


GAGTTGTTGC 


TAAAATCGTT 


15300 


GCATGGAAAG 


GTCGAGCTCG 


TTTGTATGAG 


TCTCCTTCGC 


CAATATGACG 


ATTCAAATCA 


15360 


TCATGGAAAT 


GCATATCATG 


ATTCTCTTTC 


GCTTCTTTTA 


AGAAAATCCA 


GCACAAATGA 


15420 


CCTGTTGATT 


CCGAGTCATA 


GATGGCGCGG 


TGATGTTGTT 


CTAAGTTGAC 


GCCAAATTTT 


15480 


TTCGATAAGG 


TATTTAGACG 


ATGGCTTTTG 


AAATGTGGAT 


ACAAGAATCT 


GGATAATTCT 


15540 


AATGTATCGA 


TGACGGGATT 


AGCCGCTTCA 


GGTATTCCAT 


GTTTGCCATA 


ACTTGTATTT 


15600 


AAGAACCCCA 


TATCAAAAGA 


AGCGTTGTGG 


GCCACTAAAA 


TAGTTCCTTC 


TGAAAACTCT 


15660 


TTAAACATGC 


GTAAAACTTC 


TTCTTCGGAT 


TTTGAACCGC 


GCACCATTTC 


ATCTGTAATC 


15720 


CCAGTTAAAT 


TGATCGTTGT 


TTGAGATAAT 


GGATGGCCAG 


GATCAATAAA 


TTGCTCAAAA 


15780 


GTCTCAATGA 


CATTTCCTTT 


GTGcATTTTG 


ACAGCGGCCA 


ATTCAATAAT 


TGTATCATAA 


15840 


ACAGCGGAAA 


GCCCCGTTGT 


TTCCACGTCA 


AAGACCACAT 


AGGTAGCATC 


TGTTAATTCA 


15900 


ATATGTTCTT 


CGTTGTAAGC 


AATGGGTACA 


CCATCATCAA 


CAATATTGGC 


TTCCACGCCG 


15960 


TATAAAATTT 


TAACGCCTGC 


TTTTTTACCT 


GCAGCATGGG 


CATCAGGAAA 


CGCTTGGGCG 


16020 


CCGCCATGAT 


CCGTAATTGC 


GATGGCCTTA 


TGACCCCATT 


TCCCCGcTTG 


CGCCACCAAG 


16080 


TCACCGACTT 


TATTGGTCGC 


ATCCATTGTA 


CTCATGTTAC 


TATGCATGTG 


TAATTCTACA 


16140 


CGTTTTTCAC 


CTTCTGGCGC 


GTAGTCTTTT 


CTTGGCGCAT 


GGGCGACTTC 


CATCAAATCT 


16200 


TGAGCATTCA 


TGACTAAATC 


ACGCATGAAG 


GTATCTTCTT 


GTACACTACC 


ACGAACACGA 


16260 
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ATCCAACTTT 


GCGGTTGAAT 


CGCATCAAAA 


ACTTGTTCAT 


CTTTTTCCCC 


ATTCGAAAAT 


16320 


TTCTTCACCA 


CGAAAGAGGA 


AGTATAATCA 


GTAATTTTCA 


AAATTAAAAT 


TTTTCGTTTA 


16380 


GAACGCAATT 


CACGCACTTC 


TTTATCAAAA 


ATAAAGCCTT 


CAATCGTTAT 


CCGACGTTCT 


16440 


TCTTCCAGTA 


TGTTTCCCAT 


TGGCATAATG 


GGTTCATCAT 


TGGGAATATT 


CCGACCTAAA 


16500 


CGAATTGGAC 


CATCAAACGC 


AGGCGCTTGT 


TGTTGTTTTT 


CTTTTTTCAT 


TTGTTCATGT 


16560 


TTAACAAGCG 


ATTCAGCAGC 


TTGTTGTTGA 


AAGGCTGCGG 


CTTGTTCTAA 


TTTTTGCTCT 


16620 


TCAAATTTTT 


TCAGCACTTC 


TGCAGCTTGC 


TGTTGATCCA 


TTTTTGGTTC 


AATATGAAAT 


16680 


TTAGGAAACC 


CGTAAGAGAA 


ATAAAGTTCC 


TCAATAATTG 


GTAAATATTG 


TTGCTTCAAA 


16740 


TAAGGAATAA 


CTGCTTCATT 


ATCAACAGGT 


AAGATAATTT 


TCCGATCTTC 


AAATTGTGGT 


16800 


GTCTGTGTTT 


TTAAGACTTT 


CTGAACTAAC 


GGCGTATTGC 


ATTGACTATT 


TAACAAGGCC 


16860 


AATTGCCAAT 


AATCCGTTAA 


TTGTTCCTCT 


GTAAATGTTG 


TCTGATTTGT 


GGTTATTTGA 


16920 


ACGGAAACCT 


GCGCAATTTG 


TTGAAAAGCT 


AGTTCCAATT 


GTTGCATAAA 


TGTTTGATAT 


16980 


AGCATCACTG 


GCATCAATTC 


ATCAAAACTT 


AAATGAAATT 


CCCATAAGCG 


ACTTTGTTGG 


17040 


TGAACAATGA 


CCTTGTCAAT 


CCGGCCATTT 


TGAATTAAGG 


GATGTTCTTT 


CGCTGTGTCA 


17100 


TCTAACTGAA 


TTTGCGTCAT 


CAGTTTATCA 


AATAAATCGC 


GTGCTTTTTC 


TGTCACAAAA 


17160 


TGAACCTCCT 


TCTATCATTG 


AAAAAAGGCC 


CTCAGACAAA 


TCTGTCAAGT 


GCCTTTTTTC 


17220 


GGTTTCTCTG 


CATTCGCTTC 


TCTATTAAGT 


ATACTCGTTT 


TTTGTAAAAA 


AACAATCTTG 


17280 


TTTCTTTTCT 


AAAAAGAGCT 


AAGAAGAGCT 


TGAAGCAAAA 


ATCACTTTAG 


ATTTTTGCTT 


17340 


CAAGCyTAAA 


CGCTGATAAG 


TGGCGGAAAC 


AGTTTATCCT 


AATCATTCTG 


TCCGCCCTTT 


17400 


TCATTATTAT 


TCCACTTCTG 


ATGTTGTGTT 


CATCArAATA 


GAAAGTGTAC 


TTTCTAAttC 


17460 


TTCTTTGCGT 


ACTTCCaACA 


TTTCACCAGT 


ACGTTTGATT 


TkGACTTCTA 


CCACACCATC 


175-20 


GACTGCTTTT 


TTCCCGACTG 


TGATACGGAT 


TGGACAACCA 


ATTAAATCAG 


CATCAGCAAA 


17580 


TTTTACGCCT 


GCGCGCTCGT 


TGCGGTCATC 


AACTAATACT 


TCGTAGCCCG 


CTTCGGTCAT 


17640 


CATTGCTTCA 


ACTTCTTGTG 


ATAGTTTTGT 


TTGGTACTCA 


TCTTTCACAT 


TCATTTGGAC 


17700 


CACGTGTAAA 


TCAAATGGCG 


CAATTCCTGT 


TGGCCAATTA 


ATACCACGCT 


CGTCTGCATT 


17760 


TTGTTCAACG 


ATTGCTGAAA 


GTAAACGACT 


AACGCCGATG 


CCATAGCAAC 


CCATGATCAC 


17820 


TGATTTTTCA 


CGACCGTTTT 


CATCTAAAAC 


TGTTGCGCCC 


ATAGCATCAC 


TATAACGCGT 


17880 


TCCTAATTTA 


AAGATATGAC 


CAATTTCGAT 


TCCTTTAGTA 


AAGGCTAAGA 


CACCATTGCC 


17940 


ATCAGGAGAA 


GGATCGCCTT 


CTTGAACAAA 


GCGTAAATCT 


TCGTAACTAA 


TTGGTTGGAA 


18000 


ATCACGATCT 


GGATTTACAT 


TTGTCAAATG 


ATAGCCATCT 


TCGTTAGCTC 


CAACAATGGC 


' 18060 


GTTGGCTAAA 


TCTTGAACAG 


CTAAATCTGC 


ATAAATTTTA 


ACATCTTCGG 


AAACATTGAC 


18120 


TGGACCAATT 


GAACCAAAGC 


CCGCTCCTAA 


GACACGGCGA 


GCATCTTCTT 


CTGTTGCTTC 


18180 


ATCTAAGAAA 


TCAGCGCCTA 


AGAAGTTTTT 


CAATTTGACA 


TCATTGACAT 


CATGATCGCC 


18240 
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ACGAACCAAG ACCATTACTG GCTCTTCATC 'GGCAATAAAT AGTACTGATT TAATAATTCG . 18300 

TTGTGGCTCA ACTTCAAAGA AGTTTGCCAC TTCGGCAATT GTACCCACTT CTGGTGTTGC 18360 

AATTTTTTCT AAATCTAATT GTGTTTCATG TGATTTTTTC GGCGTATAAA GACTTGTTGC 18 4 20 

CATTTCTAAA TTGGCAGCAT AATCACTTTC TGTTGAATAA CAAATAGTAT CTTCCCCAAT 184 80 

TTCAGAAATC GCCATAAATT CTTTAGAATC TTTACCACCC ATGGCACCAC CGTCCCCAAT 1854 0 

AATGGCACGG AATTCTAGTC CACAGCGTTC AAAAATGCGT GAGTAAGCTT TTTCGTAATC 18 600 

ACGGTATGAT TGATCTAAAC TAGCTTCATC GGCATGGAAA GAATAGCCAT CTTTCATGAT 18 660 

AAATTCACGG CCTCTTAATA AGCCAGAACG TGAGCGTTTT TCATCACGAT ATTTTGTTTG 18720 

AATTTGGTAT AAATTCAAAG GTAATCGTTT GTAAGAATTA ATTTCATCAC GAATTAATTC 18780 

TGTGAAGGTT TCTTCATGCG TTGGGCCTAA GATATAGTCA CGGTCATTGC GATCTTTTAA 1884 0 

GCGATACAAG TTAGGTCCAT AAGTTTCATA ACGGCCTGAT TCTTTCCAAA GTTCTGCGGG 18900 

TAATAACGCA GGCATTAACA TTTCAACTGC ATCAATTtTC TCAAATTCTT CTCGCATAAT 18 960 

CGTTTTTAAT TTTTCTAAAA CACGATTGGC TAAAGGTAAA TAAGAATAGA TCCCTGCAGC 19020 

GACTTGACGA ATATAGCCTG CGCGCAACAG AATTTGATGA CTTAAGACTT CTGCATCATT 19080. 

TGGTACTTCT CTTAACGTTG GAATTAACAT TTTTGACTGT TTCATTAAAA CTCTCCTCTA 19140 

TTTTAAAATC AACTATTTTT GATTTCTTTA CACACTTTTT TACTTAAACT TTCTCCTTAA 19200 

AAGAAAAAGC GTTGAATATC GTTCCAAGTA ACTaACACCA TTaACACCAT GACAAACCCA 19260 

AAGCCAATTA ACGTAAT3AT GCCTTCTTTT TCAGGACTAA TTGGTTTTCC ACGTACACCT 19320 

TCAATAATGT TTAAGACAAT TTTCCCGCCA TCTAAAGCTG GGATCGGCAA CAAATTAATA 19380 

ATCCCTAAGT TCATTGACAA CATGGCCATT AAGAATACAA CTGTACTTAC TCCAGCATTG 19440 

GATGCTTCTT CCGATAATTT AAACATCATG ACTGGCCCAC CTAGTTTGTT TAAACTAAAG 19500 

CcTGTGAATA GTGAGCCGAG TGCTTTAAAA ATCTGTGTCG TACTATTCAA AGTATCCTGA 19560 

ATACCGCCCA TCAATTTTGA CGGTAAATCG GTTTTCATAT AAGGATAAAC GCCGACTTTA 19620 

CCAATCGTTT GTTTTTCCAC TTTTTGTTTT TCTGGTGTCA CTGTTAGTTG CTCTTCTTTG 19680 

CCGTTACGCT CAACTACGAA CGTTAACGGC TTTTCGGGGT TCTTCTGCAC AATGGTTGTA 19740 

AAATCTTCGT ATTTTTTGAT TTTTTGATTA TTAATCGATA AGACTTTATC GTTTTCTTTC 19800 

AACCCAGCTT CTGCGGCTGG GCCATTAGGA ATCACTTGTC CAATTTGATT CGTGTTTAAA 19860 

TCAGTGACGC CTCCTTGTAG AAAGACCGCT AACGTAAACA GAATAAACCC TAAGATAAAG 19920 

TTATTCATGG GTCCCGCAAA GTTCGTCAAA ATGCGTTGCG ATAATTTCGC TGATTGAAAT 19980 

TGAACGTCAA GTGGCGCAAT CCGCACCTCG GTTCCATCAC TTTCAATAAT CGTTGCATCA 2004 0 

TGGTCAACTT TATAAACGGT TTCTTCTTCC TCGTTTCCAT TGACATAGCC CTTGATGAAT 20100 

AATTCTTTTT CAAGATCAAA ATCAATGACT TCCATCGGAA TACTATGAGG TAATTGTACT 20160 

TTTTTACTTG TATTAATTTT AACCACATTA CCCACGGCAT TTAACTCAAC AGATAGAGGC 20220 
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ATACCTGGTG TGATTTCTGT CATGTCTTCG CCCATCCCAG CCATTCGCAC ATAGCCACCA 20280 

ATTGGCAATA AGCGAATCGT ATAAGTGGTG CCATCTTTTC CACGATGCGC AAAAATCTTT 20340 

GGTCCCATTC CGATTGCAAA CTCACGAACT AAAATACCCG CTCGTTTTGC AAAATAAAAG 20400 

TGGCCAAATT CATGTACGAG GACAAGAATA CCGAAGACAA TAATGAATGT GATAATTGTT 204 60 

TTCATAGATG TCCTTCTTCC TTCATTCACT GACGTTCAGT GCTTTTTATT ATTACATGGG 20520 

AAAGTCTCGT AAAACTTGCT CTGTTCCCTT CCCGATAAAC CAGTGCAACT GTTTACTGAG 20580 

AACAGTCGCC ACTGTTAGTG AACAAATCTT TCCTTATGTA AGACAAGTTT TATTTTAGCA 2064 0 

TATTCACTCA TAAATAACCA TTGGACTTGA CCACTTCGCC TAGATATTTT GTGTTTTTTT 20700 

AGAAAACTGT ATAAATGAGA GGmaTTGATT ACTACGACCA GTATTTATTT TAAAGACCTT 207 60 

CAACAATAGT AGCTAACTTT AGACCATAAA CTTTTCCGAC AGTTTGTCGT ATTTTATCAA 20820 

CACGCTGATT GACATGTGCT ACCATGAAAG TATTAATAGT AATTTTAGGG GGAATGAATC 20880 

AATGTATGGA GAATTACATG GTAAAGTAGC CGTTGTCACT GGTGCAGCTA CGGGTTTAGG 2094 0 

AT TAT T TAT T AcTCTGCGTT ATATCTTAGA GGGCATGAAT GTTGTTGCTG ACTATGTCGG 21000 

TGAGCTGCCT AAAGAATTTG AAGATGTCCA AGCCAAGCAC GCAGATCGCG TAAAATTCGT 21060 

GAAGGCTGAT GTCTCTAATG AAGAAGATAT TAAAGCGTTA gCTGAAACCG CTTTAAAAGA 21120 

i 

GTTTGGTCAT GTTGACATTT GGGTAAATAA CGCCGGTGTG GAGGCTTCTT TTCCGACAAT 21180 

TGATATGCCA TTAAAAGAAT GGCAACGTGT CATTGATGTC AATTTAAACG GTGTCTTTTT 21240 

GGGTTCTACG AC 21252 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

GCATATCAAC AAGTCAAAGA ACGATATGAA ATTTTACAAA AAGAAGAAAA AGCCTTAAAG 60 

CAACAACTGG CTTCTTATCA TGATttCTAT GAAACGTTTT GCTTGGCTGA AGAAGTCTTA 120 

TTGGCGGTTA TCCAAAGAGA ACAAGCACGT CAACATTTGT TAAATGCTTC ATCATTTTTC 180 

ATCTTGCAAA CGTGGATTCC AGTCGAAGAA AAAGCCGAGA TTTTGACAGC GATTGAAGAA 24 0 

AAAGTTCCTA AGGACGAAAT AGCTCTTACT TTTGAAAATC CTACGAAGGC GGAAATTGAG 300 

ACGGATATAC CAGTTAAACT AGCGAATAAT AAATTGGTGC AGCCGTTTGA AATGTTAACA 360 

GAGATGTATA GCTTACCAAA ATATGAAGAA GTTGATCCGA CACCTGCGAT GATGCCGTTT 420 

TATCTAGTCT TTTTTGGTAT GATGGTAGCG GATATTGGCT ATGGCTTACT GATGCTACTC 4 80 

TTGTCTATTA TTGCTTTGAA AGCATTTGTT TTACCGAGAG GCATGAAACG TTTTGCCGAC 54 0 
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TTTTTCTTGA TTTTATCATT TCCAACAATT ATTTGGGGCT TCATTTACGG TTCGTTTTTC 600 

GGTGCAGCAT TACCGCCAAT AATGTTTGGA ATCAAATCAC CATTTCCGAT TTTATCAACG 660 

ACCGAAGATG TAAATACGAT TCTTATTTTA TCGGTTATTT TTGGGTTTAT CCAATTGGTT 720 

GTCGGTTTAA TGATTAATGG GATTCAGTTA TCAAAACAAA AGCGCTATTT AGATAGTATT 7 80 

AACGAAAGTT ACGCTTGGTT AGGGATTTTG TTTGGTTTAG CATTGTTAGT CGTTGGTAAA 840 

TTAGTGGTTA AAAATGAAGG CTTATTTACA GCAGGTGCGA TTTTAGCTAG CCTTTCTGCC 900 

ATTGCAATTA TTGTGATTCC AATGATTCAA TCGAAGGCCA AATTAAAAGG CTTAGCCAAA 960 

GGACTGTATG GACTATACGG TGTGACAGGT TATGTCGGTG ATTTAGTTAG TTATACACGT 1020 

TTGATGGCGT TAGGAATTGC TGGGGGAAGT ATTGCTTCAG CGTTTAATAT GCTCGTAGAG 1080 

TTCATGCCGC CAGTTGCCCG TTTTAGCGTG GGCATCTTGC TGTTAATTGT TTTACATGCG 1140 

TTAAACATAT TCTTATCGTT ATTAGGTGCT TATGTTCATG GCGCACGTTT GCAATATGTT 1200 

GAGTTCTTTG GAAAATTTTA CACAGGCGGT GGCCGGGCGT TTAATCCGCT AAAAACAAAA 1260. 

GAAAAGTATG TCAATGTTGA GAAAAAATAG AAAATTATTT ATTGGAGGAA AATAGAGATG 1320 

ATGGATTACT TAATTAATAA TAATGGTGGA ATTGTTTTTG CAGTATTAGG AATGGCAATG 1380 

GCGACAATTT TTGCTGGGAT TGGATCAGCC AAAGGGGTTG GCTTTACTGG GGAAGCAGCA 14 40 

GCAGCTTTAA CGACGGAGCA GCCAGAAAAA TTTGGTCAAG CGTTAATTTT ACAGTTATTA 1500 

CCAGGTACAC AAGGACTATA TGGTTTTGTT ATTGCTTTCT TAATTTATAT TAACTTAGGA 1560 

AACGATATGA GTATGGTTCA AGGATTGAAC TATTTTGTAG CAGCTTTACC AATTGCTTTT 1620 

GCAGGATTGT TCTCTGGGAT TGCCCAAGGT CGTGTAGCCG CTGCGGGGAT CCAAATTTTA 1680 

GCGAAGAAAC CAGAACATGC AACAAAAGGA AT TAT C TAT G CTGCCATGGT TGAAACTTAT 174 0 

GCCATCTTAG GGTTCGTAAT TTCCTTCTTA CTTGTATTGA ACGTCAAATA ATTGAGGAGG 1800 

CAGATGGATG GATGCCATTG AAAAAATCAT CAGTGAAATT AAGCAGCAAG GAAAGCAAGA 1860 

AGTCGAAGCC TATGTCACCA GTGAACAAAC GCGCATCGAT CAAGAATTCC AAGCAGCACA 1920 

ACAAGAAATT TTGCTGAAGC AAGAACATGA AATAGAAAAA CGACAACAAC AATTATTAAA 1980 

AGAATTTAAA CAACGCCAAC AACGGCAAAC ACTAGAAATT CGTCAGGATG CATTAAATAA 2040 

AAAACAaGCA TATCTTAACC AaTTATTTGA CGAaGTGGtC TTAAAGATGA GTGAATGGTC 2100 

AGCGGAAGAa TTTCAACAaT TCATGAAAGC ACAATTAGGT TCACTTGAAT TAACAGGAAA 2160 

AGCAACCATT CTTTTAGGCG AGTATTCTCA AACGAAAGTG ACGCAAGAAT GGCTGACCGC 2220 

CCTTTCTGAT GCTACTGTTC AATGGGAGCT TTCAGAGGAA GTTGTTCCAA AAGAAAGTGG 2280 

CTTTATCGTT GCAAAAGACG GTTTGGATTA TAATTTTCTC TTTTCAGCAT TAGTTGAAGA 2340 

GATTCAAAAA ACAGAAGGCT TTAAAGTAGC AGAAAAACTT TTCAGTTAAG AAGGGAGGAC 2400 

GTAGCATGAG AAAACCAACG TATCATCAAA TTAATCCGCT GATTCGTTTA AAAGAAACGG 24 60. 

AGCTGCTTTC TGAGCAACAG TTTCAACAAC TTTTAGAAGC GGAAACAGTT GAAGACGTTA 2520 
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GAAATATGTT 


AAAAAGTACG 


GTTTATCAAC 


CTTATTTAAC 


cgAaactttt 


GAAGAAAAAT 


2580 


TTGACTACCA 


TTCTTCTGAA 


GCGCTGGGAA 


GTCTTTATCG 


CTGGTTATAT 


GAAATGGCTC 


2640 


CAGAACCAGC 


TGTAGTAACG 


TTGTATACGA 


TGCGTTTTAC 


CTTCCATAAC 


TTGAAAATTT 


2700 


TGACGAAAGC 


TGAGCGAACA 


GGTAAAGATT 


TTGATTATCT 


TTATTTAGAA 


GATGGTCGTT 


2760 


ATTCTCTAAA 


CACAGTCAAA 


AGTGCCATTC 


ATACAAAAAA 


CTCTTCCGAA 


CTTGAGCCAG 


2820 


C ATT AT TAG A 


AGTGATT VGT 


GATGTTTTTG 


CCTATCTGGA 


AGAGGGCGGT 


TTACCACAAG 


2880 


CAATTGATAT 


CCTTTATGAT 


CGAGCATTTT 


TACAGCAACA 


ACGAAAATTG 


GCGGATGAAT 


2940 


TGGGCTATGA 


AGAGTTAACG 


AAGGAAGTCA 


TCGCTTTTAT 


TGATTTAACC 


AATCTATCAA 


3000 


CGATGGCTAG 


AGGGATTGTT 


CAACATCAAA 


ACAGCAACTT 


TTTATCAACC 


GTACTTTCTA 


3060 


GCGCTGGCAG 


TATCACAAAG 


AAAGAACTGT 


TGACCTATGC 


GGAAAAGTCA 


CTTGTAGAAT 


3120 


TTACAGCATT 


TGTCAGAACA 


ACGAATTACG 


GACAACTATT 


AGAAAAGATT 


ATCAATAAAG 


3180 


AAACCAtGAa 


TTAAATTTAT 


TGGCTTTtGA 


AAAACTCAAA 


GATGATTACC 


TGACCTCAAt 


3240 


GTATGAaAAC 


GGTCGAACGG 


TCGCTTTtGG 


ACCGCTCCCA 


TTACTAGCCT 


TTTTGAATGC 


3300 


GAAAGAGGTT 


GAATGGAAAA 


ATCTACGTTT 


AATTTTGGTC 


GGAAAACATA 


GTGGGTTCTC 


3360 


AAATGAAAAA 


ATT C GAG AAA 


GGATGCGAAA 


AGTCAATGGG 


GTATAAAATT 


GGTGTCATTG 


3420 


GAGACAAAAA 


TTCAATCTTA 


CCATTTAAAC 


TGTTTGGCTT 


TGAGGTCCAT 


CATGCCATTT 


3480 


CAGAAATGCA 


AGTTAGAGAA 


GCCATTGAAA 


CTATGGCGAA 


AAATAAGTTT 


GGGGTCATTT 


3540 


TCATAACGGA 


AGAAGCCTCT 


ACTTTAGCAG 


AAGAAACGAT 


TGAACGATAC 


AAAGAACAAG 


3600 


TGACACCAGC 


GAT TAT T 7' T A 


ATCCCAAGTC 


ATAATGGTAC 


AATCGGAATC 


GGTTTAAGTG 


3660 


AGATTGAAAA 


AAATGTTGAA 


AAAGCAATTG 


GACAAAATAT 


TTTATAATGC 


AAAGGAGAAT 


3720 


GAGCGTGCAA 


ATTGGAAAAA 


TTGTCAAAGT 


TTCAGGTCCT_ 


TTGATTTTAG 


CTGAAAACAT- 


3780 


GTCAGATGCT 


AGTATCCAAG 


ACATTTGTCA 


TGTAGGAGAT 


TTAGGCGTTA 


TCGGAGAGAT 


3840 


TATTGAAATG 


CGAGGCGACG 


TCGCTTCGAT 


TCAAGTATAT 


GAAGAAACAA 


CAGGCATTGG 


3900 


ACCAGGAGAA 


CCAGTTATTT 


CAACAGGAGA 


ACCATTATCT 


GTTGAATTAG 


CCCCAGGTTT 


3960 


AATTGCCGAA 


ATGTTTGATG 


GTATTCAACG 


ACCATTGGAT 


ACATTTCAAG 


AAGTAACCCA 


4020 


CAGTAACTTT 


TTAGGCCGTG 


GCGTTAAAAT 


TGATGCGTTA 


GAT CGT GAGA 


AAAAATGGAC 


4080 


GTTTGAACCA 


ACTGTGGCAG 


TTGGTGAAGA 


AGTGTCGGCA 


GGTGACATCG 


TCGGTGTGGT 


4140 


TCAAGAAACA 


CCGATTATTC 


AACATAAAAT 


TATGGTGCCT 


TTCGGCGTTT 


CAGGAACGAT 


4200 


TGCCGAAATT 


AAAGCAGGTG 


ACTTTGCCAT 


TGATGAAACA 


GTTTACTCAG 


TGGAAACGGC 


4260 


TAAAGGAACG 


GAAAGTTTTA 


GCATGATGCA 


AAAATGGCCC 


GTTCGGCGGG 


GACGTCCCAT 


4320 


TTTAGAAAAA 


CTAAGTCCCA 


AAGTACCGAT 


GGTGACCGGA 


CAACGCGTAA 


TTGATACCTT 


4380 


TTTCCCAATT 


ACGAAAGGCG 


GAgcGGCAGC 


AGTTcCAGGA 


CCATTTGGCG 


CTGGAAAAAC 


4440 


AGTCGTTCAG 


CACCAAATTG 


CTAAGTGGGC 


CGATGTCGAC 


TTAGTCGTTT 


ACGTTGGTTG 


4500 
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TGGGGAACGC GGGAATGAAA TGACAGATGT 
AACAACTGGT GAGTCTTTGA TGAATCGGAC 
GGTAGCGGCA CGGGAAGCCT CGATTTATAC 
TATGGGTTAC TCAGTCGCAA TTATGGCGGA 
AGAAATGAGT GGTCGGTTAG AAGAAATGCC 
TAGTCGCTTA GCTGAATATT ATGAACGAGC 
TCGTGAAGGA AGCATTACAG CGATTAGTGC 
ACCTGTCACT CAAAATACGT TACGCGTTGT 
AGCACAAAAA CGTCATTTTC CTTCTATTAA 
AGAAGTAGGG CAATATTTAG ACTTGGAATT 
AGGGATGCGG ATTTTACAAG AGGAATCTCA 
TGATTCCTTG TCGGATAAAG ACCGTTTAAC 
CTATTTGCAA CAAAATGCTT TTGATGACGT 
GAAAATGTTG CAATTGATTC TAACTTTTGG 
CACTTATTTC TCTGAGTTAA TGGCGGGAAC 
CAAGTATTTA CCAGAAGAAG AATTAGAAAA 
AACGATAAAA GAAATCATTG CTGAAGGAGG 
TCAATGAAGT CGTTGGTCCT CTGATGATTG 
AATTAATTGA AGTACGCATG CAAAATGGCG 
ATGGAGATAA AGCGATGGTC CAAATTTTTG 
CAAAAGTTCG CTTTCTGGGA CATCCTTTAG 
GCGTTTTTGA CGGCTTAGGT CGCTTAAAAG 
AATTAGATAT TAACGGCGAA GTTATCAATC 
TCCAAACAGG GATTTCAGCG ATTGACCATT 
CTGTTT TTTC AGCATCTGGC TTACCTCATA 
CCAACGTGTT AAATAGTGaA GAAGAATTTG 
TTGAAGAAGC GGAATATTTT ATGGAAGATT 
TCTTGTTTAT GAACTTAGCG AATGATCCaG 
CTTTGACAGC CGCTGAATAT TTAGCTTATG 
CGGATATGAC AAATTATTGC GAAgCGTTGC 
CAGGACGTCG TGGTTACCCA GGTTATCTTT 
CAGGCCGAAT TCGTGGCTCA AAAGGTTCCG 
AGGAGGATAA AACACATCCA ATTCCCGATT 
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TTTAAATGAA TTTCCAGAAT TAATTGACCC 4 560 

GATTTTAATT GCGAATACGT CAAATATGCC 4 620 

AGGGATTACC ATTGCAGAAT ATTTCCGTGA 4 680 

TTCTACTTCT CGTTGGGCAG AAGCGTTACG 4740 

TGGTGATGAA GG C TAT CC AG CCTATTTAGG 4800 

AGGACAAGTC ATCGCGTTAG GAAAAGATCA 4 8 60 

GGTTTCGCCA TCTGGTGGGG ACATATCAGA 4 920 

TAAAGTATTC TGGGGCTTAG ATTCCCAATT 4 980 

CTGGTTGCAA AGTTATTCTC TTTATTCCAC 504 0 
GCAAGGAAAC TGGGCCgCTA TGGTAGCTGA . 5100 

ATTGGAAGAA ATTGTTCGCT TGGTTGGGAT 5160 

gTTGGAAACA GCCAAATCAT TACGGGAAGA 5220 

GGATACGTTC ACTTCTCGAA CAAAACAAGC 5280 

TGAAGAAGGT CAAAAAGCCT TAAGTTTAGG 5340 

AGTTGAAATC CGCGATCGCA TTGCTCGTAG 54 00 

ATTGGATCGT TTACAAGCAG AAATTAAAAC 54 60 

AATGACGAAT GATTAAAGAA TATCGTACAA 5520 

TTGAAAAAGT GGCAGGCGTA AAGTACGAAG 5580 

AAATTCGCCA AGGGCAAGTT TTAGAAATCA 5640 

AAGGAACGAG TAACATCAAT ATTCGTGATT 5700 

AATTAGGGGT TTCGCCAGAT ATGATGGGGC 57 60 

ATAATGGACC AGAATTATTA CCTGAGAAAA 5820 

CAGTTGCTCG TGATTATCCC GATGAGTTCA 5880 

TAAATACCTT AGTTCGTGGT CAAAAATTAC 5940 

AAGAATTAGC GGCACAAATT GCCAGACAAG 6000 

CCGTAGTTTT TGCGGCCATT GGGATTACCT 6060 

TCCGTCAAAC AGGCGCAATT GATCGTTCAG 6120 

CCATTGAACG GATTGCAACA CCTAGAATGG 6180 

AAAAGGGCAT GCATGTCTTA GTTATCATGA 624 0 

GAGAAATTTC AGCAGCACGC CGTGAAGTTC 6300 

ACACGAACTT AGCAACGTTG TATGAACGGG 6360 

TAACACAGAT TCCTATTTTA ACAATGCCAG 6420 

TAACTGGCTA TAT TAG AG AA GGGCAAATTA 64 80 
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TCTTGTCCCG GGAACTATAT AAGAGCGGTA TCCAACCACC AATTGATGTA TTGCCATCAC 654 0 

TTTCCCGTCT TAAAGACAAA GGAACTGGCG AAGGCAAAAC GCGCGGGGAT CATGCaGCGA 6600 

CGATGAATCA ATTGTTCTCA GCCTATGCAC AAGGGAAACA AGCCAAAGAA TTAGCTGTCA 6660 

TTTTAGGAGA ATCAGCTCTT TCCGATGTCG ATAAAATTTA CGCAGCTTTT GCCCAACGTT 6720 

TTGAAGAAGA ATATGTCAAT CAAGGATTCG ATACAAACCG TTCGATTGAA GAAACGTTAG 6780 

ATCTTGGGTG GGAACTATTA AGTATGCTAC CGAGAACAGA ACTAAAACGA ATTAAAGAGG 6840 

ATATGCTCGA TCAATATTTA ACTGAAGGGA AGTAGAAAAA TGGCTCGATT AAATGTTAAT 6900 

CCAACCCGTA TGGAACTTTC CCGATTGAAA AAACAGTTAA CTACGGCCAC GAGAGGTCAT 6960 

AAGTTGCTGA AAGACAAACA AGATGAATTG ATGCGACGGT TTATTGCGCT AGTCAAAGAA 7020 

AACAATGAGT TACGTATTCA GGTTGAACAA GAAGTGACGG ATGCGTTATC AAATTTTGTA 7080 

TTGGCAAACG CAACGTTGAA CGAAGCCTTT ATTGAAGAGT TAGTGGCAAT TCCAGCTGAA 7140 

AAAGTTGAAT TAGAAATTAT TGAACAGAAC ATCTTAAGTG TGCCAGTTCC TAAAATGATT 7200 

TTTGATTACG ATGAATCGGT TCAAGAAGCA CCCTTAGATT ATGGCTACGT GAATTCAAAC 7260 

AGCGAGCTTG ATCAAGCTTT TGCTAAAATT TCAAGCATTT TACCGAAATT ATTAGCACTT 7320 

GCAAATGTTG AAAAGACGTG CCAATTATTA TCAAAAGAAA TTGAAAAAAC GCGTCGTCGT 7380 

GTCAACGCCT TAG AG TAT AT GACGATTCCA CAATTrGArG AAaCAaTTTA CTmCaTTCmA 74 40 

ATGAAATTwG rAGrAAATGA ACGTGGcGAA ATTACACGTC TAATTAAAAT TAAAAGCATG 7500 

AACAAgAAAA TTAGTGGTTA GGAAGAACGA GTGCGAAcCa GTTGGTAGCT TTCGTGCTCG 7560 

TTTTTAATAA AAAAGGGAGT GGAAAAAATG AAAAATGAAG AACGCCGCAA AGCGATTGCA 7620 

CTGAATTGTC AAAAATACGA AAGTGACTAT GCTCGCTTGG TGGAACCAAT CAATGaGCTA 7 680 

CTGTTAAACT TGGGAGCGGC TATTTCAGAA GAAGCCgCGA AcAAATCATT CTGAATGTAA 774 0 

AACGATATCA TCACGGCGTG AAATATCTTC CTGAGTGTCA TTTAGATGAA AGTAACCAAT 7800 

TTATCGAAGA TGGGCTTGAA GCTCTTAAAA AGGGAGATTT AGGCAACGGT GCGTTACAAC 7 860 

TATTTGGGGC CGGCTTAAAC TTTGCTAGTT TTGCAGCAAA AGCATAAGGA ACCAAAAAAA 7 920 

TTGATGCCCA CCAAATGTTG GCAGAACGTT TtACGAAGTT GTTGTCTGTA CaAACGGATA 7 980 

ATGATAaCAA ACAATAAGrA AAATACCCTk GGTTTATTAA GTCGACTTAG TAAACCAAGG 804 0 

syATTTTTTT AACTGTTATT GATTAGGGTA TTCAGACACC TGTAATTGCC ATAACTTATC 8100 

TGTCACTGAA GCAATAAACG CAGCACTCTT AGCATTATCA GGTCCTTTAG TAAAAATTGC 8160 

TAAAGCAAAG GGGTGAGGTG TTTCTAAAAT CCCCATATCA TGAATAAATT CATCATAGGA 8220 

ACCAATTTTA TGAGCAACTT TTCCTTGAGT TGTGGGAGTT TCCATTCTTT CATGGAAAAC 8280 

ACTTTGTTTC ' ATAGAATCAT AAATCGCTTG GTAATCTGGT TTTGTTGCTT TTTCAGTATA 8 34 0 

TAATTTTTGT AGAATAACAA GCGCATCTTC AGAAGAAAAT TGTGGCTCTT CaATCGAAGG 8400 

TGTCTTGTGC AAATAACGCT GATACATTTC TCGTTTTGCT TTTGCATCAC CACCCAATGT 8 4 60 
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GTCATAGAGC ATATTTTTAG CGATATTATC AGAATAGGTG ATGGCATATT CTTGTAACGT 8520 

TTTTAATGGA TATTCTGGTT GGATATTATA GGCAATGATC CCcAGT.TCCT TCTTQATAAT 8580 

CTTCTTCGGC GTTATAGGGA ATCAGGTCTG TCCATTTTTT TTGACCAGAA GCAACCGTGT 8 64 0 

CGGCAACTAG CATCGTCAGG GGCACTTTAA TAGTACTAGC TGTATAAAAT TCATGTTGGC 8700 

CATTCACGGA ACGCTGTTCG CCAGTT 8726 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(x*i) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

GTAAAGACTT TTCACAAGTA TCAAAGAAAT CCGTCGTTAA TCTTTACTAT TCGACAAACG 60 

AACACCCTCT TGCTCAGCAA GAGGGTGTTT TTTTGTGTAA TTAAGTGGAA AATCAACATT 120 

GACTTTTTTT GAAGTCACAT AAGTGTGAAA ATTAGGTTTT TGGACAATGA TTCAGAAATC 180 

TTTTGAAAAT TGAAGGTGTG AGGAAATTTA GAAAGATAAA AAGAGGGAGT ACTGAAAACC 24 0 

GAAATTTCAG TTAAGTGCTT GGATCAAGAA ACAAAAACTA CGTTAkTTTG TCAATATCAG 300 

TTGAAATTGG AGAAGAAGCA GAATGGGGTG ATAGTATTGG CAAATTAATT ATTTAAGGTC 360 

ATCGTTACAG GATTTTTGTT TTTTATATyC aATCmAAAAG AAATAAATAC AAAGaACACA 420 

GATAAGTATT TTAGCAA^TA ACCAATAAAT AGCAGGACTG TTGGTGCTAT AAGAATACAA 480 

GAAAATAGTA GTAAACTTAA TTGAGAATCG TTTGCTATTT AAATAGTGTT CTTTTCCTTT 540 

TAATATATAT ACGATAAAAT AAGTATATTA ATTGTAAaAG GACATGATAA TTATGGAAGA 600 

ATTAGAATGG AATwwTTTTT aTGTTTGCTA TAGTTTTGAG GGTAGTGATT CTTTTAATAA 660 

CTGTCAAGAT AGTAGATTAC TTTGnGAATA CTCCTGAAGT ACAAATACAG AGAAAACTTC 720 

CTGAGTTTGA AAAAGTGCCA CCTTATGATC CAAGAGATAA TTCATTTCTG ATACAAATTA 780 

TTCTTTAAGT AATACAACGA CTAAGTTTAT TTACCAAACC ATTACAATAA GAAAAGGTGC 84 0 

AAGTATACCT AATTACTATT GGTATGATGA TGGTCAATTT AGAGGAAATC TAAAACTTCA 900 

ACGTTGGCAA GACTACGGAG AAGATCCAAA CATAAGAGGA TATATCTGTA CTTATGGAGG 960 

CTACGTTACC AATGGTTCAA GCCCGGCGCA AAGAATTGTC GATATTAATG AACAATAGCT 1020 

ATTCTAACTT TATGTAATAG GAGGGAGTAA TCATGAAAAA AATCGGGTAT TTTAGTTGTA 1080 

TTATTTTTTT CATGTTTTTG GTAGGTTGTA GTAATAACAA AAAAGAAAAC GGCAATCTTT 114 0 

TGAATGCCAG TTCGTTTCCT TTAATACTCA CCACGATTAT TGAAAAAGAA GAAGACCTAA 1200 

CGAAAGGTTC AATTTTTTTC AACAAGGATA AAACCATGAC GCTTGAAAAA GAATATTTAG 1260 

TTAATCCCAA TAATGAAGAC ACAAAAAAAA CAAGTAGAAC AGAAAAAAAG GTATATAAAA 1320 

ATATTAAAAT ACAAGAAAAT AAAGAGAGCT ATGAAATTAT AGGTCAATTG GACAAAAAAA 1380 
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CGAAAAAAAT AGAGTTTAAA AAAGTTGATG AAGGTAAACG TATATCTGAT GCAGAAGGTA 14 4 0 

ATGTGTATGG TGATTTTGGT GGTAAATAGG ATAGTTAAGT CATAAAAAAG AAGGCTAAAA 1500 

ATTTTTATTT TTTTCGTCTC TATACTTTCT GATGTTAGTT TTTTTACCAA CATCAGAAAG 1560 

TATAGAGACG TTTTTGTGTA ATTAAACAAA AAATCAACAT TGACTAAGAC CAGAAATACT 1620 

TCCTTTAGTG CTCCGAGTTG CTCTAATATA AGACACCAAA TTGATAACCA TTTGCTTGAT 1680 

AAAACTCTAT TAATTGTGGC AATGCTTGTT TTGTTAATTC CTTATCTACT GAATCATGCA 17 40 

TTAAGACTAC TCTAATATTA TAATCCGGAT AAACCTCAAG TGAATGTTGA TGGAAAGCGA 1800 

GCATTTCTGC GACTGTAGTA GGCTGTCTGT CTAACGGCTC CGCATCTCCA ACCATCGCAT 18 60 

TCCAGTCAAT CCAATGGATG CCAAGTTGTT TGAATAATTC ATCAGATTTT TCTGTACCTC 1920 

CCCATGA'CAT ATGACCTCCT GGGTATCGCC AGAGAGTTGT TTGAAATGTT TCGCCCAATA 1980 

TTTTTTTAAG AGAGTTCTCC ATGGTTGCTA TTTCCGCTTG TATTTCTTTA GTGTTGACAA 204 0 

TCCCGTTTGG ATATAACGTT GCATAATCAT GAGTTGAACT ATGAAAGCCT ATTGAATGTC 2100 

CTTCTGCAAC TTCGCGTTTC ACTATCTTTT GATTTTCGGA AGTTAGGGTA TTTCCTACAA 2160 

GAAAAAAAGT TGCGTGGACG TTATACTTTT TTAATGTATC TAGGACTTGT GGGGTtATTT 2220 

GATTATTAAC GCCATCATCA AAGGTTAAGA ATACTAATTT TTCACCTTTG TAAGGTTCTT 2280 

t 

CTCCAGTTAT ATATTTTCTG ATTTTGTCTG CTGAGTAAGC ATATTTTTTT GATGAAATAG 2340 

TCATGTGTGT TGATTCCTTT TCTAACTTGG TCGGGGAATC TTTTTTTGCA GTTGAATTTT 24 00 

TAGTCGATGT CTGCGTAGTA TCATCATTTT TACTTGTTAT CTTGGGTTGA GTTAATAGGA 24 60 

ACGTTTGAAT GAAAACAAGA ACAATGAGGA TTAACGCACC TAAATnATAC ATACCCGTnT 2520 

ATTTTGAATT T 2531 
(2) INFORMATION" FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

ACCGCCGCcG CCAArGAtGa AcGAAAATAT CTcGAATCCA CCAAAACCAC TACTGCTAAA 60 

TCCGCCGCCA CCGAAACCGC CAAAGCCaCC aGCGCCGcCA CCGwAATTAG GATCAGTTCC 120 

TGCATGGCCA TATTGGTCAT AGGCCGCTTT TTTCTGTGGA TCACTTAATA CTTCATACGC 180 

TTCAGAAACT TCTTTAAATT TTTCTTCCGC ATCGGCTTCT TTGTTAATAT CTGGATGGTA 24 0 

CTTTTTGGAA' AGTTTGCGGT ATGCTTTTTT AATTTCATCA TCTGAAGCAC CTTTTGCTAA 300 

TCCTAGCACT TCATAATAAT CTCGTTTCGT CGCCATTGAC TTCCCTCCAA CTTTCTTCAA 360 

ACTTTACAAT GTGTTTAGAA CCAGCCAAAT TCATCGATTG CTTTCGGTCT CTCTGGTTGG 4 20 
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ATTATCTGTC ACACCTTGAA 
AACATCGTTG TAATCATACC 
TTGTGGTTGA TTTTCTCACA 
CCAGTCCCTA AAATCaATTG 
AGCATCAACA ACATCATCAG 
TGTTGCGCat TGgTTCGTAA 
CACGTTTTGC TTTCATTTGT 
GCGCATCTTC TGCTTTTTTC 
TTTTATCAAC AGTGAATAAT 
GTTGTTTATC TGCTTCAGCG 
CAGATAAACC TGAAGAAGAT 
TCGCACGAAC ATTTACAATC 
CACCACGAGG AGCTGCAGGA 
CGGCCATtGG ACGTTCACCT 
CAGTTGAGAA TACTTGTGAT 
TAAATAcGCC ACCCaTTGTT 
CGTCTTTAAC ATCACCAGTG 
GTCAGGGTTA ACTGATTTGT 
AGCTGGAATA CGTGTTGAAC 
TCCAGCATCT TTTAAGGCTT 
TAATTCATCA AATTTTGCAC 
AGCAGTAATA AATGGCAAGC 
TTTTTCAGCA GCATCTTTCA 
ATTTTCTTTT TTAAATTCTG 
ACCTAGGTTG TTATCACCGG 
GATTGACACG TCGAATGTAC 
TGTTTTATCT AAGCCATAAG 
TTCTAAACCA GCAATTTTAC 
TGCAGGAACT GTAATAACTG 
TTTTAAATAT TGTAAGATCA 
ATCAACTTTG ■ TAGCCAGCTT 
TGTAACAGCT TGACGTTTTG 
GACCGAAGgA GTTGTGCGGT 
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TCAGTGTTAC TAGCTGTGAA TAGGTATTTT CAGCTTGTTT . 480. 

ACAGTTCTTT CAAAACCTAA ATAGTTTCTT GCTAACTTGA 540 

AAAGACAAGG GACTGGGACA TTAGTCATTG TACTAATGAT 600 

TTATTGATTA TTTGTCATCA CCATTTACTT CTTCAAAATC 660 

CGCCACCTtG gAGCAGCTTC AGGgATTTTC TTGTGCTTGT 720 

AGTTTTACTG TTAAGTTTTG GACGATTTCA TTTAATGAAT 780 

TCGATATCGT TTGCTTCGAT GGCTGCTTTT AATTCATCAC . 84 0 

ACTTCTTCAG CATCTACTTT GCCTTCTAAT TCTTTCAATG 900 

AAAGCATCTG CATCGTTACG TAAGTCAACT TCTTCTTTAC 960 

TTTGCTTCAG CATCTTTCAC CATACGTTCG ATTTCATCAT 1020 

TTAATCGTGA TTGTTTGTTC TTTTTGAGTG CCTAAGTCTT 1080 

CCGTTTTTAT CAATGTCAAA GCTTACTTCA ATTTGTGGCA 114 0 

ATATCTGTTA ATTGGAAACG ACCTAATGTT TTATTATCAG 1200 

TGTAAAACGT GAATATCAAC AGCTGGTTGA TtGTCAGCAG 12 60 

TTACwTGtTG GAATCGTAGT GTTACGATCa ATTAATTTTG 1320 

TCAATCCCTA ATGACAACGG TGTTACGTCT AATAAgACAA 1380 

ATTACACCAC CTTGGATTGC AGCACCCATT GCTACTcTTC 14 40 

TTGGTTCTTT GTTTGTTTCT TTGCGAACAG CTTCAACTAC 1500 

CACCAACTAG GATAACTTCG TCGATTTCAG AAGGATTTAA 1560 

GACGTACTGG TACTTTTGTA CGTTCAACTA AATCACTTGT 1620 

GAGTTAAGTT CATTTCTAAG TGCAATGGTC CTGcTTCGCC 1680 

TGATTTGTGT GCTTGTTACA CCTGATAAGT CTTTTTTCGC 174 0 

AACGTTGTAA TGCCATTTTA TCGTTAGCTA AGTCAATGCC 1800 

CAACCATGTA GTCAATGATC TTATTATCAA AGTCATCCCC 18 60 

CTGTTGATAA TACATCGAAA ACGCCATCGC CTAATTCAAG 1920 

CACCACCAAG GTCAAAGACA AGAATTTTTT CATCTTTGTC 1980 

CTAAAGCTGC TGCTGTTGGT TCGTTAACGA TCCGTTCTAC 2040 

CAGCATCTTT TGTTGCTTGA CGTTGCGCAT CGTTGAAATA 2100 

CTTTTTCAAC TTTTTCACCT AAATAGTCTT CAGCAAAACC 2160 

TTGCAGAAAC TTCTTGTGGT GTATATGATT TTCCTTCAAC 2220 

CACCCATGTG ACGTTTAATA GATGAAATCG TGTTAGGATT 2280 

CTACTTCGCC CACTTgAATT TCACCGTTTT TGAATGAAAC 2340 

TACCTTCTGG GTTTGCAATA ATTTTCgctT CGCCGCCTTC 2400 
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TAGTACTGCG ACAGCTGAGT TTGTTGTTCC t-AAGTCAATA CCAATAATTT TACTCATAGT 24 60 

GAATATCTCC AATCTGTATT AGTTTTCTAT AATAATATAT TTTTAATTAA CGTTTGTTAT 2520 

TGTGCAACAA TGACCATTGA TGGTCGTAAC ACACsATCTT GTAATTTGTA CCCTTTTTGT 2580 

AGAACTTCAA CAATGGTATC TGCTGGGGTT TCTTCAGAAG CAGGGACTGT TTGAACGGCT 2640 

TGATGTAAAT TTGGATCAAA GGTTCACCCA TTGCTGGGAT TTCTTCAATT CCTTCTTCTT 2700 

TTAAAGCCAC AGTAATA 2717 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



GAATTTAGAA 


ATGAGGACTA 


CTGATGATTT 


TATTACAAGC 


AAATCAAGTT 


GCCCGGCATT 


60 


TTGGCTCGGA 


AACATTGTTT 


GAAAACATAC 


ATTTAGAAAT 


TGCAACAAAA 


AGTCGGATTG 


120 


CCTTAGTTGG 


TCGTAATGGT 


GCTGGAAAAT 


CGACTTTTTT 


AAAAATCATT 


GCAGGCATTG 


180 


ATGCTCCCGA 


TAGCGGAACC 


ATTGCCAAAA 


ATaAAACTGC 


TACGTTAGGT 


TATTTAGCTC 


240 


AAAATACCGG 


CTTAGAATCA 


GATAAAACCG 


TTTGGGAAGA 


AATGACAAAA 


GCCTTTGCTG 


300 


ACATCCTAGA 


AATGGAACAg 


CGTATGCGAG 


aATTAGAAAC 


TAAAATTAGT 


GAAATGGAGC 


360 


CAACCACTTC 


CGTTTATGAA 


GGAATTTTAA 


AAGAGTACGA 


TCAATTGCAA 


CATACGTTTT 


420 


CTGAAAAAAA 


TGGCTACGGC 


TATGAAAATG 


AAATTCGCTC 


AGTCCTTCAC 


GGCTTTGGCT 


480 


TTGATGAATC 


CTTTTACACG 


AAAGATATTC 


AAACCTTATC 


TGGTGGTCAA 


AAAACCCGGC 


540 


TTGCATTAGC 


GAGAATGCTT 


TTACAAAAAC 


CAGACATTTT 


AATTCTGGAC 


GAGCCTACAA 


600 


ACCACTTAGA 


TATCGAGACG 


CTTtCTTGGC 


TGGAATCTTA 


TTTGCCAAGT 


TATGCCGGCG 


660 


CCCTATTAAT 


TGTTTCCCAC 


GATCGtTATT 


TTTTAGATAA 


GGTAGTTAAT 


GAAGTTTATG 


720 


AACTGAGTCG 


CAAAAAAATG 


ACTCACTACA 


AAGGaAACTA 


TTCCAAATAC 


TTAGAGTTAA 


780 


AAGCAGAACA 


ATTAGCCAGT 


GAATGGAAAG 


CGTATGAAAA 


GCAACAAGAA 


GAAATCAATA 


840 


AGTTAGAAGA 


TTTCGTTGCC 


AAAAATCTGG 


TTCGTGCATC 


TACAACGAAA 


CGTGCACAAA 


900 


GTCGCCGAAA 


AGTATTAGAA 


AAAATGGACC 


GTTTAGACCG 


ACCTCAAGGA 


GATGAAAAAT 


960 


CGGCGCATTT 


TCTTTTCGAT 


AGTGAAAAAG 


TCTCGGGAAA 


TGTTGTTTTA 


CAAGTCGAAG 


1020 


ATGCCGCCAT 


TGGTTACGAC 


CAAGAACATA 


TTTTATCCGA 


ACCTATTCAC 


TTGGATATTC 


1080 


GTCGCAAAGA 


AGCCATTGCC 


TTAGTCGGAC 


CGAACGGAAT 


TGGTAAATCC 


ACTCTCTTGA 


1140 


AATCAATTAT 


TGACCGCATT 


CCTTTCATTA 


AAGGAAGTAA 


AACTTTTGGC 


ACCAATGTTT 


1200 


CTGTAGGTTA 


CTATGACCAA 


GAGCAAGCCA 


ATTTACATGG 


CAATAAAACG 


GTCTTAGCGG 


1260 


AATTATGGGA 


TGAACACCCA 


ACCACACCTG 


AAAAAGAGAT 


TCGAAGTATT 


TTAGGCGGCT 


1320 
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TTCTCTTCAG TGGAGACGAT GTTGAAAAAA XGATTCCTTT ATTAAGCGGT GGCGAAAAAG 1380 

CCCGTGTGGC ATTAGCAAAA CTAGCGATGG ATCGTGACAA TTTCTTGATT CTCGATGAGC 14 40 

CAACCAATCA CTTGGATATC GATAATAAAG AAGTTTTAGA AAATGCGCTG ATTGATTATG 1500 

AAGGAACCAT CCTCTTCGTT TCCCATGACC GTTACTTTAT CAATCGAATT GCAACAAAAG 1560 

TTGTTGAGCT TTCTGAAAAA GGCAGCAAAC TTTATTTAGG CGACTATGAT TATTATTTAG 1620 

AAAAGAAACA aGAGGAAGAA GAAATCGCTG CCCTCTTAGC TAATGAAGAA GCGGCGAAAA 1680 

AACCCGAACC AGTTACAGCC AAAAATACCT TTTATCAAAA CAAGGAGCAA CAAAAATTAC 17 4 0 

TCCGTACTTT GCAAAGAAAA ATAACACAAG TCGAAGAAAA TCTTGCTCAG TTAGATACGA 1800 

CAATTGCACA ATTAGAAGCA CAAATGAGTC AACCAGACAT . TCTAGAAAAT CATGTTGAAT 18 60 

TGCTGGCTTT AAACCAACAA TTAGATGAAG TTCGTCAGCA ACAAGATGAA CTACTTGAGC 1920 

AATGGGAAAA TTTCAGTTTA GAATTAGAAG AAATGGAAAA TAACAATTAA CGGAGGGATT 1980 

CACTATGGCT TCCAAAAAAA TCAGTTTAAC GAaGCCCAAA TATCAACAAA TCGCCGTTGA 204 0 

TGTTGCGGAA AAAATTGCAG AAGGCAAATT ACATGTTGGA GATAAAATCC ATGCCCGTTC 2100 

CACCCTAGCT AACCAATACC AAGTGTCGCC AGAAACAGCT CGCAAAGCCA TTATCGTTTT 2160 

AGTTGATTTA GAAATCGTTA AAGCCAAACA CGGCAGCGGC TTCTATGTGG CATCAAAAGA 2220 

AAAAGCGCAG GACTTTGTGA CACAATATCA AGACGTTCAA ACGATTGCTG AAATTAAAGA 2280 

AGAATTGCTA GATAGCGTAG cGAAACAAAA AGAAGAACTT ACTCATTTTT CTAGCATTCT 2340 

AGATACTCTT GTTGAACAAA CAAAGCGTTT TGATTCTTTC AATCCAATGA ATCCCTATTC 2400 

ATTAGTTTTA ACAGAAGAAG CTGCTTATCT TGAGGCGACG ATTAGCGAAA TGAATTTTTG 24 60 

GCAAAACACT TCGGCAACGA TTATCGCCAT CAAACACAAA GAAGAGCTTC TGGTCTCACC 2520 

TGGTCCATAT GCAAAGATTT CGTTGAACGA CACACTTTAT TTTGTTGGAC ATGACGAATC 2580 

AACCTTACAA CGTGTCCAAA ATTTCTTTTA TCCTTAAACA AAAAAGTGAC TACTTATCTT 264 0 

GCGGTAAATA GTCACTTTTT TTATTTATTA TTTCTATTTT TCATAAAAAC CCACTATAAA 2700 

ACCTAGTTAA AATACTGTTT TTCTTCATTT TCAGAAAGAT TTTCATCTAA TTTGACAAAG 27 60 

TGACAACTTT ATAGTAGTCT GATGTAGTCA CTTTTTTTAA TGAGGAGGAA CAATGATTTG 2820 

CCAAAAGTAA AAgTAAATCA CCTTACCAAG ATCTTTGGtA AGAAAACCAA ACCGGCACTT 2880 

GAAATGATTC GTGCCaACAA AAGCAAAACA GAAATTTTAG AAAAAACGGG TGCTACAGTC 2940 

GGTGTTTATG ATGTAAATTT CGAAGTTGAA GAAGGCGAAA TTTTCGTTAT TATGGGGTTA 3000 

TCAGGAAGTG GGAAATCAAC ATTAATTCGT TTATTAAATC GTTTGATTGA ACCAACTTCA 3060 

GGAAATATTT ATATTGATGG ACAAGATATT TCTTCTTTAG ATAAAGAAGG ATTACGTGAA 3120 

GTACGTCGAA ATAAAATGAG CATGGTTTTC CAAAACTTTG GTCTTTTCCC CCATCGAACT 3180 

ATCTTAGAAA ATACGGAATA TGGTTTAGAA ATTCGTGGTG TTCCTAAAGA AGAACGCCAA 3240 

GCAAAAGCTG AAAAAGCCTT AGAAAATTCT AGCTTAATTG CTTTTAAAGA CCAATTACCA 3300 
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AGTCAATTAT CTGGTGGGAT 
CCAGAAATTT TATTAATGGA 
ATGCAAGACG AACTATTAGA 
CATGATTTGA ATGAAGCTTT 
ATTATGCAAA TTGGGACTGG 
ACCTTCGTGG AAGATGTGGA 
GCTTTAACTA CTAACATTGA 
GAAGAAGTGA GTATGTTACT 
GCAGAAAAAG CTTTAGAAGC 
GAAATTCAAA CAATCGACAA 
GCACAAACGC CTCTTGCCGT 
AGCGTACTTG AAGCACTTGC 
TGTCGCGTCT TGGGTTGAAA 
CAGCTTTTTA CAAACAATTG 
CGTACCACCA CTTGTGCTGA 
AAAAATTGGC TTAAGTTTAT 
ATGGAACGAT TTAATGAGTA 
CATTGGTGTG CCACTAGGGA 
ACCGATTTTG GACTTCATGC 
ATTCTTCGGA ATTGGTATGG 
AACCGTACGC TTTACGAACT 
TGATTCTTAC GGTAGTACTG 
AAACACGATT ATGGCTGGTG 
TGCTTCAATG ATTGGTGCGC 
ACAAGTCGGG AACGGTTTCG 
CCGTTTTACT CAACATTTAA 
TAAAAAGAAA AAATACGGGA 
TGCTTCAATT TTTACAACAA 
TACCGAAGTA GCCTCCACAC 
TAAAACGACC CCTTTAGATA 
TGCCATGGTT GGTGCGTGGC 
TAAATTAGAC GACCTTGGTG 
TTATATGGAT GTTGATTCTA 



636 

GCAACAACGT GTCGGCCTTG CTCGTGCCTT GGCTAACGAT - 3360 

CGAAGCGTTC TCTGCTCTTG ATCCGTTAAT TCGTCGTGAA 3420 

TTTACAAGAA AATGTCAAAA AAACCATTAT CTTTATCACA 3480 

ACGAATCGGC GACCGGATTG CTTTGATGAA AGATGGTCAA 3540 

AGAAGAAATT TTAACCAACC CAGCCAACGA ATATGTGCGA 3600 

TCGTTCAAAA GTCTTAACTG CCCAAAATAT TATGGTACCT 3660 

AATTGATGGC CCAACTGTTG CTTTAAAACG GATGCGTCAA 3720 

TGCCGTGGAT AAAAAACGCC AACTAAAAGG CGTTGTACGC 3780 

GCGTAAAAAT GGGACTTCTC TTGTAGAATG TGTGGATCCA 384 0 

AGATATGTTA GTGAATGATA TTTTCCCACT GATTTATGAT 3900 

GACTGATAAC GGCAAACTAT TAGGCGTGGT CATCCGTGGT 3960 

AGAAACAGAG GTGAACGAAC ATGAATAAGT ATCAATTACC 4020 

GTTTTACTGA TTGGTTAACT TCAACATTTG CTGGATTATT 4080 

GTCAAAGTGT AATGGACAAC ATTACTGCCT TACTAACAGC 4140 

TCGTGCTATT AACGATTGCT GCTTTCTTTA TTTCTAATAA 4200 

TCACGTTTAT TGGCTTAATG TTTATCTATA ATCAAAACTT 4260 

CCGTTACGTT AGTTTTACTT TCTAGTGTTA TTTCCATTAT 4 320 

TTTTAATGGC GAAAAGCGAA AAAGCCAAGA GTATCATTAC 4 380 

AAACAATGCC TGGTTTCGTT TACTTGATTC CAGCCGTTGC 4 4 40 

TTCCTGGGGT CTTCGCCTCT GTTATCTTCG CTTTACCACC 4 500 

TAGGAATTCG ACAAGTGCCA AATGAACTTG TTGAAGCAGC 4 560 

GCTGGCAAAA ATTATTCAAA TTAGAATTAC CTTTAGCCAA 4 620 

TTAACCAAAC AACCATGCTA GCCCTATCAA TGGTAGTTAT 4 680 

'CTGGATTAGG ACGTGGCGTC CTTTCCGCTT TACAACGTGC 4 74 0 

TAAATGGTGT GGCTTTAGTT ATTTTAGCGA TTATCATTGA 4 800 

ATAATAAAAA AGCAGCTCCT AAAGCAGCTG GTGCCACCTC 4 860 

TTATTGCAGC GGTTGTTGTA ATTGTCGCTG GCTTAGTTGG 4 920 

CAAATGATAA GCAAATTTCC CTTTCTTATG TGGAATGGGA 4 980 

ATGTGGTTGC CGAAGTCTTA AAAGATATGG GGTATGATGT 504 0 

ACGCAATCAT GTGGGAATCT GTAGCTAAAG GTGAAACAGA 5100 

TGCCAGGAAC TCACGCAGAG CAATATAAAC AATATAAAGA 5160 

AAAACCTTAA AGGCGCGAAG TTAGGAATTG TTGTCCCTTC 5220 

TCGAAGATTT ATCTGATCAA GCTGGGAAAA AGATTACTGG 5280 
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AATTGAACCT GGGGCTGGTG TTGTTGCAGC* AGCAGAGAAA ACTAAAGAAG CTTATCCTAA 5340 

TTTGAAAGAT TGGTCTGTTG AAACTTCTTC ATCAGGTGCC ATGACTGTCG CCTTAGGACA 54 00 

AGCAATTAAA AACAACGAAG ACATCGTTAT TACTGGCTGG TCTCCTCACT GGATGTTTGC 54 60 

AAAATATGAT TTAAAATATT TAGCTGATCC AAAAGGCACT ATGGGCGGCG AAGAAGCGAT 5520 

TCACACAATG GCTCGCCAAG GCCTAAAAGA AGATCAACCA GAAGCTTACA AAGTGTTAGA 5580 

TAATTTCCAT TGGACCACTA AGGATATGGA ATCTGTCATG TTAGAAATTA ACGAAGGCAA 5640 

AGACCCACAA GAAGCTGCAC GTGATTGGGT TGATTCACAT AAAGACCAAG TAGCAGAATG 5700 

GnAAAAATAA TTCTTAACAA ATGACTTCCC CATCTATTAA AAGrGGGCGT TGAACAAATT 57 60 

GTTCAACGCC CTCTTTTTTC GCAAAAAAAA CCACTCAGTT GGGTAGACTG AGTGGAAAGG 5820 

AGTTATTTTT ACTTGAGGAG TAAAAATGAA AAGTGTTTAT GTTGGGTGTT TTGTTGGTAT 5880 

GCTTATATAT TAACGAGAAC CTGTGAATAA TTTGTGACGA AAACCTTGGA AATTTGCTAC 594 0 

GAATTACTTA AAAAAGTTTA AATGTGTCTC ACCCTAGTTA ATCGATACAG TGATTATTTC 6000 

TTTTTAGCAT TTTGGATGGC TTTAAATAAT GTTGAAAATT GTCCTTTGAA AGACATCACA 6060 

TCATCAAAGA TATCAACACC ATCGTTTAAC ATTTTGACGG TAACCCAGCC AAGCGCTTCC 6120 

GTAATTGCTA CCGCAACTGA AGCATTGACC ATGCCACCAG TCACAGTTCC GACAACAGGA 6180 

ATCACTTTTA GAATATTCCC GACTGCACTC TTACCTAAAC TAACCACTAC TAATTCTTTG '6240 

GTTAAGCTTT TCCCTAAGCT TTCAGACCAA GACTGACCAA AAATTTTATG CAGCCGTGAC 6300 

ATCATAGTTA ATTGAACAGG TACCAGTAAA AAAGCATCCG AAAACGGAAT TGGCGAAAAC 6360 

CCAATAATTG CTGCTGACAA TGCAGCAAAA TGAATCGTTT GATGACTTTT CTTTTTCACG 6420 

TCAGGTGAAA CACCAGCCAA AAATTCATCA AATACTTTGT CGAGTTGTGT TTGACTGTTC 64 80 

GTCACTAATT GTTCTGCTTT ACCTTTAGAA 'GTTGGGAAAG CATTCATTAT AGCCGTAGAC 6540 

GTTTTTTGTG GAAATTTTTT CAAGACATCT TGAAAAAAAG AACGAAATTC TGGGTCTTCG 6600 

ATTTCTTCAT TCAATTGCTT AGTTTCTTGT TCTGCTAATT CTTGTTTTTC AATCTCTTCA 6660 

AAAAGTTCTT GTTTCTTTTT GTTTTTATGT TTTTTTCTTC TCATCTCATT AACTCCTTTA 6720 

TCTATCAATA TGTAGAAACA AAGTATAGCA TAGCCCCTGA TTGATTCTCT TTTAGATGAT 6780 

TTGAAAAGCC AGAACTTATC TGTTTTTTAA ACAAATAGAA ATGTTCCAGC TTTTCTCAAA 6840 

AAAATCAGCT ATTTTGTTTG TTCTTCACCG CAATAAACCG TCAAGTGTTT AGGCAAAATC 6900 

TTCAGCGTAA TTGGCAAAGC AGCTCCTTCA TCACCATCCA CATTAGTGGT TAATTCTTCT 6960 

TGGTTTTCTA AGGAGATTGT TCCTTCTTTA AAGGTTAGGT AAACTAAATT ATCAGTGGAC 7020 

TGGTCGACGC CTTTTAACAA ATCTGGGACT GCTTTTACTG CATCCCATAA TGATTGATcT 7080 

TTTAAATAAA CTAAATGAAG TTTGCCATCA TCCACTTGGG CCTCTGGTAA TAATGTTTCA 7140 

AACCCACCGA TTGAATTCGT TAAGCCAACT AAAACGGTAC TACTTTCAAT GGTTTGTTCT 7200 

TTTTGATCCA AGCTTAAATG AAAAGGATAC GTTTGAGCAT TCGCTAAATG TTTGGCACCT 7260 
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GAAATAAAAT AAGCGAGTTT TCCTAACTTC GTTTTTTGTT CAACGTCCAC GTCGTTAATT 7320 

GACTCTGGAA TCGTCCCAAT TGCAACGACA TTCATAAAGT AGTCGTCATT GATTTTCCCC 7380 

ACATCTAAAG CACTGGTTTT TTCTAAATCT AATTGTTGAA TCGCTTCTTC AGGGTCCATC 74 40 

GGTAAATTCA AGGCTCGCGC TAAATCGTTC ACCGTTCCTA AAGGAAAAAA GCCAAATTTT 7500 

GGGCGGTAAG CTTGTTCTGC CAACCCGCTA ATTCCTTCAT TAACGGTTCC ATCTCCCCCC 7 560 

ATTACGAAAA CACTGTCGAA GTGACTTTCC GCAGCTTCAC GGGCAAATTG TTCTGCATCG 7 620 

CCGCCTTTTT CTGTTTGTTT CACTACAACT TCATCAAATA ACTGCTTTAA TTTTTCTTCT 7 680 

GCTAACGTTT CAAATTCTTT GGCTTTTTCA CCACCAGAAC TGGGGTTAAC AATTAAAACT 774 0 

GCTTTTTTCA CTGGTACCAC TCCTCATTGA TATCTAAATC GTTTGATTCC TTACAATTAT 7800 

TATAAAAGAT ATTTTTGATT CTGGCGAAAA ATATGCTGAC ACTTGCTTAA CAAACAAGCA 7860 

TTCCTGCTCC TCACTACTTT TCAAAACGTT CTTTAAAGCA ATAGGCAAAA TAATAGAACA 7920 

ATTTCCAAAA ATCTTGCCAA ACTTTTGGAT CCATCCCCGT CAATTCATGA ATTCGTTTCA 7980 

AACGGTAATT CAATGTATTT CGATGGATAT GTAATACTTG TGAGGTTTCA TTGATGTTTA 804 0 

AGTTATGaTT GGCGAAAAAA ATAACGGTTT CATAGTATTC ATCATAAATC GTCTGGATGG 8100 . 

TTTCAAAAAA ATCCAATTCA ATATGAAAAG GAAAATTAAA AATTCCTTGA ATGAAAAAAT 8160 

CTTCTGCTTG GTAAAACTTA GCCTTTTCTA ATTGGAGAAA CGATAGAAGT AAATACGTGT 8220 

TTTGTGCCTC TTGGACCATT TGTTGAATCG TGGCTTTTTC CGAACTTATC ACAAGGTATT 8280 

CTGTTACTGG CTTTGTTATC TGAGGTGGTT CAGAAGTAAT CACCAGATAA CAATTTTCCC 8340 

ATTCAAATAT TTCATGTCCA GGATAATAAC TGGATAATAA GCGCTTGTCC TTGGTCAGTA 8400 

CACAATGATT CTGGTCATCT AATTGAAGGT TATATAGTTC CAACGCTTCA AGTTTTAAGG 84 60 

CTTCTGAGTA AGAAGAATCA ACATGCATTA AGTGCTGGAT 7U^ACTGATTT TTTTGTTTCT 8520 

TCTTTTTTTC GCGTTCTTGT TGTGTATTTA ATTCTTCTGT CAGTAAAAGA ACAATCGAGC 8580 

GCACAAGCTT GGTAAATTTG CGCACCTCGT CTGGTTCTCC TGAAATGCCT ACGACACCTA 8 64 0 

AAATGTGATT ATTCAAAATA ATCGGTTCAT TGGTGCCTTT TTTCTCTAAA GAAGTATCTT 8700 

TATAAATTTC AATATTTTTT CGTTCATTGA TGGCTCTTTC TGCACCACGA TGGCGCTCCC 8760 

CAATTCGAGC AGAATCTCCA CTCGCAATAA TTTTTCCTCG TTCATCCATA ATATTGATGT 8820 

TATAAGGAAT ATCTGCCATC AACTTTTCAA CAATTTTATT TGCTTGGATT TTGTCTAATT 8880 

TTAACATTGC TATCGCCAAC TTTCTAAAAA GCGAGTaGCG TTTGGGACAT AAGCCGCAAT 8 94 0 

GACTTATGTC CCAAACgCTT AATTCTTCAG ATCAACGACT GAACATTTTT ATTTCAGCTT 9000 

TCATTACTGA TGATACCATG GTTAGTTCTT AGTCACAAGG CACCAGCTGT AATGTACGAG 9060 

CAATATTTTC AACGGTTCTT TCTAAATTAA CTGGTCCATT TTTCAATGCT GCTTCCAATG 9120 

GTTCGGCTTT GGCCAAAATT CCAAAAATAG CCGTCATCCC TTCTTCATAG AGGACATCCA 9180 

CATCTTTGCC GATATAGCCA GCACAAGCGA AAACCGGTTT TTGATACTTC TTGGCCAAAC 924 0 
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GCGAGACGCC ATAAGGGGCT TTGCCGAATT 
TAAAGACATA ATCAGCGGCT TTTATTTTTT 
CAATTCCTGG GCGTAACTGA GCTTTTGTGA 
CAGCTCCTGC GCCTGGCATC ATCTCAACAT 
CATAATGGGC TAAGTTTTTA TCTAATTGAG 
GACCAAAGAC ATGAGACGCC CCTGTTGGCC 
CTAGAACTTC GGTAGCAAAA ATGCGTTGAT 
TATCTAATGC ACCACCGCCA CGTGTCAATT 
CAAGTGCCTG AATCATACCT GCACCCCCAT 
TGATTTT-TTT TGCCCCATGA TTGAGCGCAT 
AAGTGATCAA GGGATTGCGT TCTGCTGCTT 
TTTCAATGAC CACCGTTTCT TGGTCACCTA 
CAGGAAGCGG TCCTGCCACG CGCACTGGTA 
AATCCACCGT TCCTTCCCCG CCGTCTGCCA 
ATACAGCTAA AATCCCTTTT TCCATTGCCA 
TAAATGAGTC TGGAGCCAAA ACAAATTTTT 
AGTTTAAAAT AAAGCCATAT AAAATGGTTG 
TTTCATAAGG AATCACTCGA ATTCTTTCTT 
TATGGAAATA ATTGCCTTGA GGAACACTAT 
CTGCTGCTGA AAGTGAAGAT GTGCCCATAT 
CTAAAATAAC CGCTGTGGAT GTTGAACCAA 
GTGCCAAGAA GGTTCCAGAA ATTCCCATAA 
AATCAGAAAC AGAAATTAAT CCAGCAATTG 
CGGTTACTCT CTTCAAGCCA GAAGATGCAT 
CAATTGTCCC AACAACTGAG GCGAACGGCA 
TGATGAAAGA AATATTTAAA ACGGCGCCAA 
ACACTGGCGC GATCAACGCC TTTTTCAAAG 
GATCTGTTTC TTTGACCATT GATCCTTTGT 
TAATTAAGCC AAATACGGCT GGAATAAAGC 
CAAAGCCATT CGCCGCTGCA ATAGTATTTG 
CGCCACCTGA TAAGGCAATC • AATAATGATA 
ATAGAGCAAT CGGGGCAACA ATCAACACTG 
TTACCATTGT GGATAATGCC AAAGATAAAA 



639 

TTGTTTGAAA ATCCATACCG 
CTTCTAATTG ATTCACTTCT 
ATGCTAGTAA CCCTGCACCC 
CAATGCCGAT CTCTTTTTTG 
CCACCATTTC TTCTGTCGCA 
CTGTTAGTGG GTTATTCACA 
CAAACTGTGT TAAATCGATC 
CTTGCCCTTC CTTGTCTAAC 
CATTTGTCAC GCTACCACCA 
GTTGAATCAT CTCTCCTGTA 
TAACCAATTC AATTCCGTTG 
ATAAGCCGTA ATAGGTTTCA 
CTCTTGTACC TCCTGTTGCA 
TCGGGACATG TTGAATGTCT 
GACAGGCTTC TTTTGCACTC 
TCGTCATCTT TTCGCGCCTA 
CAACAATCGT CATTACCCCA 
TAATCGTCAT ATTCATACTT 
CAATCACAGT TGCACCAGTA 
CTAAAATCGC TGGTCCAAAA 
CTGCTAAGCC CATTAAAATA 
CAGAAATTAA GTGGACAATT 
CACCTGCACC AATTAAAATT 
AAAAGGCCAG TTGTTTTGTT 
AAATATACAG AGCATCAATA 
TCGGATTAAG TAATAATAAG 
GTAATAATTC TTTTGTTTGT 
TTCTCATGAT ATTTGCAAGA 
CAGCAATCAT AACGTCACTT 
GATTAGGCGA AATAATATTG 
ACTTACTGAT ACCCGTACTT 
CAACAGGAAT GAATACGCCC 
TAGCTTTTTT CTCGCCAAGC 



CCTTCACCAG . 9300 
ACGACAATAT 9360 
AAACCACCAG 9420 
ATGATCGCTG 94 80 

CCTTTTTGCG 9540 
TCACTTGCTA 9600 
TGCGCCAGTT 9660 
AAACGCGCAC 9720 
ATGCCGATAA 9780 
CCATAAGTAG 98 4 G 

GCTTTCGCCA 9900 
AC GAT TT GAT 9960 

TCCACTAAGG 10020 

GCATCTTTAA 10080 

ATACTTTCTT 10140 

CTTTCCAACT 10200 

CCGACACACA 10260 

TGTGCCGTAA 10320 

TGCACCATGA 10380 

GAACCTGTCG 104 40 

CCAGAAATCG 10500 - 

TGTGTTGATA 10560 

AAAACGGTAT 10620 

TGCCCAATCG 10680 

TTCAAACTTT 10740 

ACAATCGCTA 10800 

TCCTCTACAA 108 60 

ACAACAGCTG 10920 

AGAGGTAAAT 10980 

CCCGCTTTGC 1104 0 

TTCCCAACAG 11100 

ACTGCTGTAA 11160 

TTTTGTACAA 11220 
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TGGCGTTGGC 
CTGCTAAGAC 
CTGTATCTGA 
AATACACAGG 
CAGCAACCCA 
TTTCATTCTA 
AATGCACAAA 
ATAATGACCA 
TTATATCCAT 
AATGAATATA 
AATCACAGAA 
CTTCATCCGT 
TATATGATGC 
GAGTGTGCCG 
AATCCCTGGT 
ACCATGTGAA 
TTGAATATCT 
CCCGCTACTG 
GCTTTATACA 
AAGCCGCCCC 
AGATACTTTT 
AAGTATTCTT 
TAAAATTGTT 
GTGGAATAAC 
ATAAATGCAT 
CAGAGATTAG 
TCAGCAGCTT 
TTGATTTCAT 
GCCTGACGTA 
AATCCTAACA 
CATAAATCAT 
CCCAACGTGC 
AAAAAGCGTG 



AATTGATTCT 
ACGGATTGCG 
CAGGGAAGCG 
ATTAAATTTT 
AGATATTGTA 
TCAAAGTCCT 
AAAACCGTTC 
CCAATCCAAC 
GAAATAGCkT 
GCGGCTATTA 
GAAGGAATAA 
CGAATGGACT 
ATCAAAAAGA 
CTGAAGGGAC 
ACTGCACACA 
CGAATTCGCT 
GTTTGATATT 
ATTAGTAGTA 
AAGTGTAGCA 
TAAAAGTTAG 
TTTATTTCCG 
GACCTAAGTC 
TATCTTCGGC 
CAATATATTC 
AGGCACCCTC 
AGCCTTCACT 
CACCAGAGAA 
CGGCAACAAT 
ATTGTTGGTC 
CTTCGCGAGC 
CCCAATGTTG 
CCCAGAAGTA 
CATCAATGTT 



GCCGCACCAG 
GTTCCCATAA 
CCTCCAATAA 
TTCAAAATTA 
ATATCTGGCA 
TTCTTTTTAA 
TTAAAGAACA 
GaACAAAAGA 
CGCCTCCACA 
AATTTTTTAA 
TTGGAATCAT 
GGACCCAATG 
TTGCAACCAA 
CAAAACTCAA 
TGATCGCGGT 
CTAATAATTG 
CTAGATACAA 
TTCGGATTGT 
QAGGATTCTT 
ACTAAAAATC 
ATACATCTTA 
TTGGTAAACT 
AACTTCTTTT 
TGCATTTTGT 
TTTATTTTTG 
GGGAATCACA 
AGTTACAGCA 
TGCTTTGACA 
GTTTTTACTG 
GCCATCAATC 
GATCTGACGG 
CGGAACGGAA 
TTCTAAGCCT 



640 

ATTCCATCAT 
TACTTTGTGT 
GACAACCAAC 
ATACTATCGC 
TTCAAAATCA 
AATTGTAAAT 
GTTTTTTGAA 
ACACTCACAA 
TGCGTAAAGT 
TGGCAACTGC 
GTAGCCCACC 
ATTGGTCTCC 
ATTTCCCATA 
TCCTGCAATC 
CAAAAAAACC 
TGTCTTACTG 
ACGGTAACCT 
TTTCCTCTTC 
TGTGCCTCAA 
TAACTTTTAG 
AACTCCAAGA 
TCTAAATGTT 
GGTAATAGTT 
GCCGCATTTT 
GCTGTCTTAG 
TAATGTAGAT 
ACTGCACTTT 
TTGTTCGTTA 
TTTAACGAAT 
AGCATGACAT 
CCGTCAATAA 
AACTTATTTT 
TTTAATTTGC 



CACACCCGCC 
TCCGTTTAAT 
AATTGCACCA 
TACAACTAAT 
GCTCCTTAAA 
TTAACAGGAT 
ATCTTTTTCT 
TTAACAATAC 
ATTGAGGAAG 
ATAGTATCTG 
ATAATCCCAA 
GTTTTTTTAT 
CTATTCCCTA 
ACACCAATTA 
AACAATAGCG 
GCTTTAGGAT 
ACCAGCAGGA 
ATTTTCTTTG 
TTAGAAGCTT 
GGGCGATTCG 
ACAGATCGTT 
TGATAGTTTC 
TTTTAGCTTC 
CTGGTCGTAA 
GCATCACAAT 
GTTCATTGTT 
CTTCATTAGc 
ATCGGTTTAA 
AGCCTAAACT 
TATTTTTTAA 
ATTTATCATT 
TGGGATCAAA 
TGTGATCAAG 



AACATGCCGG 
AGAACAGCGA 
AAAAACAAGG 
CCTAAAATTG 
TTGTTTACGC 
TTATTGTTCA 
ATCTTTCTCA 
TGCCGTTTTA 
GAATTACACC 
CAGCAAAATT 
TTTTTGGATG 
CTAAAAATTT 
AAACGTTAAT 
CTGAGGTCGG 
CTGCTGAAAA 
TCAAAAATAA 
TTAAAATAAG 
CTCCTTTTAT 
GTATGAATAA 
AAAGTTACGA 
ATAAATTCCT 
ATCATCTGGA 
TTTATTTGGT 
CATAAAGTTC 
GTTATCAAAC 
TTCTAGCATT 
CAT GT AC ATT 
CTTATCGGTA 
GTTCAAAGAT 
TTCCGGGCGC 
ATAAATAATC 
GGATTGATCT 
TGGTAAAAGC 



11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
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ATCTTCGCTT TCATCATTTT TTGAATCATA TATTCAGAAG GAATGGCAAT ATCATAGGCT . 13260 

GTGCCACCTT GCTGAATTTT TGTATACATA GCTTCATTAG AATCAAAGGT TTCGTAATTG 13320 

ACTTTATAGC CTGTTTCTTT TTCAAATTTC TTAATCAAGG CCGGATCTAT ATAGTCCCCC 13380 

CAATTGTAAA TGGTCAAGGT ATCTGCTCCT GCCATGCCAC TTGCTTTCTC CAATTGGCGC 134 40 

ACGCCAAAAA ACAAGATGAC AATAATAGCG ATAATTCCAA TAAAAAGTGA CTGTAATTTT 13500 

TTCATTTTAA CTTGGCCACC TCACTTTGTT CCCGACGCAT TTTCTTTAAA CGTTTTGGCG 13560 

TATTGTCTTT ACTGATAAAG TAATAACCGA CAACTAGAAT CATTGAGAAT AGGAATACTA 13620 

ACGCACTTAA GGCATTGATT TCTAAACTAA TTCCCTGACG TGCTCGTGAA TAAATTTCAA 13680 

CAGATAATGT TGAAAAACCA TTGCCTGTCA CAAAGAATGT TACGGCAAAA TCGTCCAACG 13740 

AATAAGTAAA GGCCATAAAG TAGCCAGCAA TAATTCCAGG CGCTAAAAAA GGCAAAATAA 13800 

TATTTTTAAT CACTTGAACA TTATTGGCCC CTAAATCCCG TGCCGCATCT ACCATAGAAT 138 60 

CGTTCATCTC TTGTAGTTTT GGTAAAACCA TTAAAACAAC AATTGGAATA CTAAAGGCAA 13920 

TATGCGACAA TAATACACTT GTAAAACCTA AACTGATAAA CCCTACTGCC GTGAAGAAAA 13980 

TTAAGAAACT GGCTCCAATA ATAACATCTG GTGAAACGAG TAAAATGTTA TTAAAGCTAA 14040 

GCAACGTGTT CCGGTTTTTA CGACGTTTTG TATAATAAAT CCCCATCGCA CCAAATGTTC 14100 

CAATGATGGT TGCTATTAAA GCCGATAAGA AAGCTAGCAA AAACGTATTC AGAACAATGG 14160 

TTAACAAGCG TGTGTCTTCG AAAACGTTCC GATAATTATC TAACGTAAAA CCAGTAAAcT 14220 

GATTCATCGT TTGCGTGTCG TTAAACGAAT AAAAAAtCAA ATAAAAGATT GGTGCGTACA 14 280 

GTAGCGCAAA GACAACGAAT AAATACACAT AAGACCATTT AAACTTTTTC TTTTGCATTA 14 340 

TTTGCCACGC CCTTTCTTCT TCTCACCCGT CAGCAACATC ACAATAAACA TGGCAACAAT 14 4 00 

TAAAATCACA CCTATTGTTG AACCCATGCC CCAGTTTTGC GTTACTAAGA AGTGTTCCTC 14 4 60 

AATCGCCGTT CCCAACGTAA TCACCCGATT GCCGCCAATT AAACGCGTTA ACATAAAGAG 14 520 

CGAGAGCGAA GGAATAAAGA CAGCTTGAAC GCCACTTTTC ACACCGTTTA GGGAGAGAGG 14 580 

AAAAATGACA CGGCGGAACG TCTCCGCACT GCTAGCTCCT AAGTCCCGAC TAGCACTAAT 14 640 

CAATGAAGGA TTCATCTCTT CTAAGGCATT AAAAATCGGC ATAATCATAA ATGGAATTTC 14 700 

AATATACGTT GCGACAAATA AAAAACTGAA ATCTGTAAAC AGAATCTGGT GCGTACCCAA 147 60 

ACCCATAAAA CTCATAAACT GATTGATACT CCCATGAATA CTGAAAATCC CAATAAAGGC 14 820 

ATAAGCTTTT AATAATAAAT TCACCCACGT TGGTAAAATA ACAAGCATTA ACCACAACTG 14 880 

TTTATGCTTG AGCTTTGTCA AAAAATAAGC CGTAGGATAA CTTACCAATA ACGTAAAAAT 14 94 0 

GGTAATCAGA AAAGCATACC ACACTGAATT AATGGTCATA CTTAAATACG TTCCTGATGT 15000 

AAAATACGTT TGGTAATTCG CCAAGGTAAA CTGGCCATTC ATATCGAAAA ATGATTGATA 15060 

AATAATCATT AGCACTGGCG CAATGACGAA TAAAAGTAAC CACATTACAT AGGGAATCGA 15120 

ATAAATTCGA CGCATTGTTT TCATTTTATT TTCCCCTTTC GTTTACTCGT CGTAACTTTC 15180 
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TAAACGGGCA 


TCAAAGTCTT 


CTTCTGATTC 


* ATTGAAGCGC 


ATCACGTGAA 


TATCTTCTGG 


■ 15240 


CTCAAAAGAG 


AGCCCTACTT 


GAGCGCCTTC 


TTTGGCTTTT 


CGAGTGGAAT 


GCACCATCCA 


15300 


TTCGTTTCCT 


TGTTCATCAA 


AACAAATAAT 


TTCATAATGA 


ACGCCGCGGA 


ATAACTGCGT 


15360 


GTCCACCGTT 


ACGGTTAGCT 


TGCCTTTTTC 


AAGCGTGGTA 


ATCGTTAAGT 


CTTCTGGACG 


15420 


TAAAACGACT 


TCTACAGoTT 


CATTTTTCCG 


CATGCCGCCA 


TCGACACATT 


CAAACTTTTT 


15480 


GCCCACAAAT 


GAAACTAAAT 


TATCTTCCAG 


CATCACGCCA 


TCCACAATAT 


TACTTTCCCC 


15540 


CACAAAGTCA 


GCAACAAAGT 


GATTAATCGG 


TTCATCATAA 


ATATCGACTG 


GTGTGmCACT 


15600 


. TTGCACGATA 


TGACCTTTAT 


TCATGACAAA 


AATTTCATCA 


CTCATCGCTA 


ATGCTTCTTC 


15660 


TTGATCaTGG 


GTCACAAAAA 


TAAAAGTAAT 


TCCTAAACGT 


TGTTGTAAAT 


CACGTAGTTC 


15720 


ATATTGCATC 


GCTGTGCGTA 


ATTTCAAATC 


TAAAGCGGAT 


AACGGCTCAT 


CTAAAAGCAA 


15780 


TACTTTTGGC 


TCATTGACAA 


TCGCTCGTGC 


AATTGCGACC 


CGCTGACGTT 


GGCCCCcAGa 


15840 


CATTTCACTA 


ATTTCaCGTG 


TTTCGTAGCC 


CGGTAATTGA 


ACCAAACGTA 


ATGCCTCTTT 


15900 


CACTTTTTTT 


TCAATTTCTG 


CTTTGGATAG 


TTTTTTGATT 


TTTAAACCAA 


ACGCAACATT 


15960 


ATCAAAAACA 


TTCATGTGAG 


GAAACAAAGC 


ATAATCCTGA 


AACACGGTAT 


TGACTTGGCG 


16020 


TTTATTGGCA 


GGTACATCAT 


TTATTCTTTT 


ACCATCAAAA 


TAAACATCGC 


CAGAAGTCAC 


16080 


GTCATTGAAT 


CCCGCAATAA 


TACGCAAAAT 


CGTTGTTTTG 


CCACATCCTG 


AAGGTCCTAA 


16140 


TAACGTATAA 


AATTTTCCTT 


CTTCAATTTC 


AAAGCTGACA 


TTTTTTAATA 


CTGGTTCATC 


16200 


GTCAAATTGT 


TTGACAACAT 


TTTCAAATGA 


AATCACATGG 


TTTCTCACTG 


GATAAACTCC 


16260 


TCCATTTCGT 


CCTTTATAAA 


TACGATTCTG 


TCGCCACAAT 


TAACGCACGA 


CTTTTTCCTC 


16320 


GACCATTGTT 


GCTAATTTGG 


TGAGGTTCTG 


TTGCTTGGTA 


ATACATGGTT 


TGACCTTTTT 


16380 


TAGCAATATA 


CGTGGCTTCC 


CCTAATGTCA 


AACGAATGGA 


GCCATCAATC 


ACATAAATAA 


16440 


ATGTTTCTGA 


TAGAGAAGGT 


TCAAACGTTT 


TGTATTCGCC 


ATTTTTATCA 


AAAGTAATAA 


16500 


TGACAGGTTC 


CaTTTCTTTT 


TCATTCGAGG 


CTGGAATTAA 


CCATTTTAGT 


TCATAGCCAT 


16560 


GTTCTTCATC 


ATAATAGAGG 


GTACTTTCTT 


CTTCATTGTA 


GACAATTTTC 


TGTTCAAGCG 


16620 


GTTGCTGACT 


AAAAAACTGC 


TCTGGCGTCA 


CGCCTAGTAC 






lOOoU 


TTTCCATTGA 


TGGTGAGCTT 


AAATCCCGCT 


CCAATTGGGA 


AATATAGCCT 


TTGGATAAAT 


16740 


CCGTTCTTTC 


GCCTAATTCT 


TCTTGTGTTA 


AATTTTTCTG 


AATGCGTAAA 


TTTCTCAACT 


16800 


TCTCACCAAT 


TTCCATGGCA 


CACCTCGTTT 


CTATTTTCAG 


CTAATCTTTT 


CAACAAAAGT 


16860 


TTTGTAATAC 












16870 


(2) INFORMATION FOR SEQ ID NO: 99: 









(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 3905 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GAATTATTTC AATTACCTGT TGACTATCAA GATGAACGAT TAACAACTGT CCAAGCAGAG 60 

CGTTTCTTGG TGGAGCAAGC AGATGCTTCA AGAGCCAAAA GAAAAAAAGT GATTGATAAA 120 

CTAGCGGCAG TAATGATTCT ACAAAATTAT TTAGATGCAC ATAGCCGATA ATTATCGAGC 180 

TTTGAATACA TCAnGGGAGA GAAGAAGAAT GACAGAACAT AACCACGATC ATGATCACGA 24 0 

GGACATGAAC ACATTACTTT AGTAGACGAA CAAGGAAATG AAACATTATA CGAGATCTTA 300 

CTAACTGTTG ACGGTCAAGA AGAATTCGGT AAaAATTATG TTTTATTATA TCCAGCAGGA 360 

ATTCCTGAGG ATGAAGATGT TGAATTGCAA GCTTACTCCT ATGTGGAAAA CGCAGAAGGT 420 

ACAGAAGGCG ATTTACAGCA AATCGAAACA GACGCTGAAT GGGATATGAT TGAAGAAGTT 480 

TTCAATACAT TCATGGCTGA AGAAGAAGAG TAAAACTGTA ACGAGAGAAG CGCTACTTCT 540 

CCTATTGAGA AGTAGCTTTC TTAATAATAA TTGTTATAAT GTACATGTTG TTTCTATCAT 600 

AAGATACTAT CACTATCATT GAGTGTGATG GATAAAGAAG GACAGCATTG TCTAGTTTTA 660 

CAAGATGTTG TAAAAACGTA ACATCCTTGT AATGACGTTA ATTTAGTAAC TATCTAGCAT 720 

AATTTATAAT TTGTGTTATA CTAAGTATGA GGTTTTAGTA ATACTGTTAA TTTkAkAaTA 780 
TTAAATAAAA AAATGGCTCC ACCTACTGGC ATAGGTAGAG CCGTCAGAAA TTCAGTTAAA . 840 

CAAACACGTA ATTATTGGTG TGGTGCATTG CCACGAACTG AAATTCCAGT TAATAAAGCA 900 

AGTAAATCAA AC AT AT AT TT CACCTCCTTT AT TAT GAT AG CTTCATTATA ACATTATCTG 960 

TAAAATGATG GTATGTTTAA TACAAAAAAA ATAGAAAGAT TGGAGTTTTG ACTGTGAGAG 1020 

TAGCGGGAGA AAAAATCAAA ACAGCGCGTA AAAAGAAAAA ATTATCGCmA GCTGAATTGG 1080 

CAAAAGGAaT TTGTACACAA GCmACAATTA GTAACATTGA AAACAAAAAT GTGTGTGACA 1140 

GTTTAGACAT TTTCTCATCT GTATGTCTAC GTTTAGACTT ACAAGTAGAA GAATGTATCG 1200 

AAGGTTCTAG TGAGAAAAAA CTCGAAAGCT TATTAAATAA AGTAGAAGAA TTATGCTTTT 1260 

ACTTTAAGCA CGATGAAGCC TATGATTTGT TGAAAGACTA TCCAGATGAT ATCGAAAGTT 1320 

CAAACAGAAT TTTAGAAACA AAATTTTTCT ATTATAAAGG AATTACGAGT TTATTAGGTA 1380 

AGAAAAATAA TAGCGAACAT TGTTCTATTT ACATCGTGGT TCTGAAATTA GTCGTGATAT 14 40 

TAATATTTAT AATATTtTGA GTATGAATGC GATCGGTATT TTATATGAAT TAGAAGATGA 1500 

TATTGAAAAA GCAAAAGTTT ATTACGATAA ATCTTTACAA TTGCTAAGCG AATTCAAATT 1560 

AGATTATCCA TTAGAGCAAT GCCGTATTTA TTATAATACC GCAAAATTCT ATTCATTAAT 1620 

TAAAGACTAT GCAAAGAGTA TTGAATTAAG TGATAAAGGA ATCGAAATTA ATCGGACGCA 1680 

CAGTTCTATT ■ TACTCTTTAG ATTGCTTATT ATACGAAAAA GCGTTTAATA AACAAATGCT 1740 

AGGACTAGAT GCTGTTGAAG ATTATCGTAT TGCGTACTAC TTTACACGTT TCTTTGAAAA 1800 

TAAAAAATTA CTTACTTATA TTGAAAAAGA TATGCAAGAG TTTAATATCA GTTTTAAATA 1860 
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ATAAATTATT GTTTCAGATT 
TTTTTTCTCT TTTTAATTTG 
TGCGACTCCT TCTTGGAATG 
AATCAATTTT GCAATACCTT 
CCGAGCATCT AAGGCACCAC 
CGGAAAGTCA CTACGACCAG 
AAAAATTTCA GGAACTGGGT 
AATCCACTCA GGTTTTAAAA 
AAGAGCAGTA GCTAAATCAC 
GTGGTGTGGT GGCAGAGCTG 
AATATGTTTT ACGCCAGCAG 
GCCGTTGATT ACCACATGAA 
CAAAGCCGCT AGAACAACAA 
TAGTTCATCG ATTAAACGCT 
GTTAATCCCC CCAAAGGTTG 
TTGTGTATCT AAAACAAGGG 
TTTTCCTTCC ATGACAGGCA 
TGAACCATCA CTAATAACGG 
TTTTTTTTCA TGTATGGCAG 
ATCTGCTTTT GTTTCAATCG 
TGCTTGCTCA ATCGCTAATT 
AATTCACTTA TAGTTAAAAA 
CACCGCCTAA ACGTGTCGCA 
TTAAAATAGC CACATCGCCA 
CCTCAACGGG ATTCATATTC 
AAACGACACT AATTACCACA 
AAGGAATATA TAACATACyT 
ATTTATAAAG CTGTTTTGAT 
AAAAGGCTGC GACAACAATC 
AACCGCCTGT AATAAATAAG 
CAATTGGTGC' TTTGTCTTCT 
TCAACTGACC TTGTCCAGAA 
AACACATAAT AGCAAAAAAA 



644 

TtAAAAAGAA CAACCTAACT 
TTTCTTTGAC AGCGTTACGG 
GATCGGGAAT AATATTAGTC 
TGGCAGCGGC AATTTGCATT 
GGAAGATACC AGGGAAGGCT 
TTCCGACAAT ATAAGCGCCT 
TAGCCATTGC AAAAATAACT 
CACCTGGTGC CGAAACACCA 
CGGTACGgTG TTcCCGATTG 
TATCTGTTTC GCTCAGAaTC 
CTAAAAATTT TCTTGTGAtC 
TGTCCTCTAT TTTCTTGTTG 
TCGCTGTCCC GTGTTGATCA 
GTTCAATTTC AAAACAACGA 
GAGCCAAAAA TTTCACCGTT 
GAATCGAATC CACACCAGCA 
TCGCTGCTTC AGGACCAATG 
CGACGGTGTT TTTCTTTGTC 
AAGAAACAGC CGCAACACCT 
GAACTTTAGG GCAGACTTCT 
CTTTTACATC TTTTGTTGAC 
AGCATTCTAA AAATAGCAGT 
ACTTGTGCAA ACGGCATTAA 
GTCCCACCCA TACCACTCTG 
ATGAAGCGTG AAACGAAAAA 
ACAAAATATT GCCAGCTAAG 
AAGCCGACCA TTAACGGGAA 
CCGCGTTGGG TTTCTTTCGG 
ATCAAAACAG GACCAGGAAA 
GTACACGCAA TTAAAACACC 
TCTAAAGCAT CACTTAAATC 
AGTTCAGGGC GTTTTTCTCC 
TTACCAATAA TGGTTGCGGG 
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CAAGAGGAGT TAGGTTGTTC 1920 

ACGCTTTCGG CGACGACTTT 1980 

GGCGTGAGTT CATTATCGGG 204 0 

TCAATCGTAA TCTTTTTCGC 2100 

AACACATTAT TAATTTGATT 2160 

GCCGCTAAAG CTTCATCAGG 2220 

GGCTGCTCAT TCATCTGTTG 2280 

ACAAAGACAT CAGCGCCTTC 2340 

GTTAGcTTCG CAATTTCTGC 2400 

CCTGCACGGT CAACAATGAT 24 60 

GAAAGACCAG CGGAGCCACC 2520 

ATTAATTTTA AGCTGTTGTA 2580 

TCATGGAAAA CGGGAATATC 2640 

GGAGCGCTGA TATCTTCCAA 2700 

TGAATAATTT CTTCGGTATC 27 60 

AAGCGTTTAA ACAATGCAGC 2820 

TTACCTAAAC CTAAAACTGC 2880 

GTTAATTCGT AAGCGCGTTC 294 0 

GGTGTGTAGG CAATTCCTAA 3000 

AATTTCCCCC CAAATTTTTT 3060 

ATATCATTTT CCTCCTAAAT 3120 

CATTGTTATT ACTGTAATGG 3180 

GTTCATTCGA TTGGCTGTAC 324 0 

GCAAGCGGAG ACAATGGCAG 3300 

TCCTGTAGCA ATAAywGTGA 3360 

CATTCCTACG ACATCTTTTA 34 20 

CGTAAAATTG CCAGAGATAA 3480 

TACTACGTTT AAGTATTTTA 3540 

GCCTGTTAAG TGTTGCAATA 3600 

AGCGCCCATT AATTTTACAT 3660 

ATCTGAATTG GTAATTTTAA 3720 

TAAACGACTA AGTAAACCAG 37 80 

AATTAATTGC CCCACTAATT 384 0 
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GTTCACTTGG CAATCCAGTA ATGGCCACTA TAACCAAGGG ATAAAGGAAG AATGCCTTCA 3900 
CCGAT 3905 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



Taan n zi P.PPP 
1 nrlrinrtvJVjVw.0 


P T T Zi P. 21 ZI T P Zi 


InUUl nun 


iy/iluul ibl 


Zi BTTfTTP A rn 
nAl lbl lunl 


bl 1 1 Itl 1 n A 


en 
bU 


arTTPTIi H ZIP 
H(j 1 1 vj 1 HJ\ti\j 


71T , "PTP7\T ,r P r Pfi 
/il 1 1 VjAl 1 iH 


■ T\ T Zi PTP ZV TfT 
n 1 i bA 1 v_o 


TPTPBPPTZl Zi 
lul Onv-v- 1AA 


1 1 uH 1 nrt 1 L- 1 


r P/**7\rT"P7\ ATT 

1 LACoi AA.1 i 




TZHi£iP.aPTTT 
1 nJ%n\* obi i 1 


Zi P 21 PPTTPTP 
nbftbL 1 lulu 




r* zi r"v t> zi p t p t 
LAL 1 1 rio lul 


T Zi Zi 71 Zi TTPP'? 
lnnnrl.1 1 L-vj 1 




i on 
X o u 


nnnu 1 1 iul i 


ZVTTPPPTPTZ1 


TTTTTPa ZiPZi 
11111 Cclnvjn 


P T TPP ZIPPTZi 
O 1 Iv^vjnvjvjln 


TTPTPZi HTPT 
1 lul unnl v_ 1 


PTPZi Zi ZiPTPZi 


0 AC\ 

£. fi U 


PPTPGTPAGT 

V_r l^Ul \-r/AVj X 


TGTPTTTTP.T 


Zi Zi C. T P T P (7 Zi P. 


PZiTPTZiPGTZi 


TZiZlPTPTTTT 
1 nnO 1^1 1 1 1 


PTTTPTZiPZIP 
1 1 1 vj 1 rivj AC 


ouu 


PTGTTTTPTP 


GTTAPPPAPP 

VJ 1 1 rtV^V^V^nVrfV^ 


APTAPHPTTT 
no 1 nono 111 


TUPPTTPPGA 


GPPTTPZ1PPT 


TPH CHCZ Zi Zi ZI Zi 


fin 

O D \J 


PAATAGPGGT 


TGAAGATTPA 


TP A A AGTTAT 


TGTTPTGAT A 
1 vj 1 1^1 vjn i n 


PPPPGPTTTT 
^t^^ol, 1 111 


P.TTZiPZiTTTT 

Ul 1 i\\3t\ 1111 




PGTATTGGTG 


TGAAAAPTPA 


ATPGTTPTPG 


A A APATPTPG 


APPTTTAAAA 


AT ZiP Zi T Zi n PP. 


4 O U 


TATTACCAGG 


AAPGPPTAAT 
ftA^vj^v.. inn i 


TTAGTAGPTG 
1 1 nu i noo 1 vj 


TPTPTPTT AT 
ivlOl^l inl 


PPPA AAATPG 


TAGGHTGTGP 
1 nuonl vj 1 




AAAGAGTTTP 


PAPTTPTTPG 

OnO i. 1U1 l Co 


GPTAPGAPAP 


PATAAPTATT 
On 1 nnO inl 1 


PTGATAPTGG 
o 1 on i nO 1 uu 


ATZiPUPPT AT 
rt 1 riuriv^u Inl 




TCCCTAAATT 


AGPTPGGGAA 


TPP A A TTTT A 


AATATTGAAT 


fri fp fp fp rp fp rn fp rp 
1111C11111 


APATPPGA AG 


660 


GcrTTTACTAP 


TTPATTA ?T A 

1 X Vrt 1 X F\ J- Irt 


AGATGTTPPP 

nun i vj l l \— ■ O v.* 


AAAPAATTTG 


PTPTGGPGGT 

1 O 1 UO^Uu 1 


GPPGTTTT AT 

V3V*V/V3 ill inl 


720 


TATAAATTCG 


GTAAACAATA 


CGATCTAATG 


ATTTCCCAAG 


AAGTGATTTP 

nnu x \jn x x x v_* 


PPPGAPAPPT 


780 


TTATTTGTGC 


TGTTGCCTGA 


TCGTCTACAA 


CTACGGCATC 


AACATAGAAA 


TAGAGCCCTT 


840 


CCATATAAAT 


AACAGTGTCT 


GGAACAAATA 


TTTGTATATT 


TTCAGGTGTC 


AAACCAACAA 


900 


ACAGCTCAAA 


CGTTGAATAG 


GTATAGTAAT 


TTAATTTAAC 


AGTTAAACTT 


TTAAACCCAT 


960 


CAAATACTTT 


TTCGGAAATA 


AATTTCCATT 


GATTATCTCG 


ATCCTGAGTG 


AAAATTTCTA 


1020 


ACTCCATTAC 


ACACCACCTA 


CCAACGGCCG 


GAAGTCAAGT 


TGTACAATTA 


CGTTACCAAT 


1080 


ACCAGTAGTG 


GCTTTCACTT 


GAAAATAGTT 


ATCACCCTTT 


TCAAGCTGCA 


GAAAGGTAGA 


1140 


ATTAGGGTCT 


CGCAATGGCA 


TAGCGTTAGA 


TTCAACACCG 


TTTGGGTCCG 


TCATGATTGC 


1200 


TTCTTTCTGT 


CCACGAGTAG 


TTACCAGTTT 


AAACTTAGTA 


CCTGCTTCAA 


ATGAACCTTT 


1260 


AAATCGGAAA 


AATTCCTGAG 


TAATTACGTT 


ATACACTTCT 


GGATCGGTAG 


CTTCTGCACC 


1320 


TAAAGAAAAA 


TAGAACGTGC 


CTCCCACTGA 


TACATCACCA 


TCATTGGTTA 


CAGGAACAAT 


1380 


CTCACCACTT 


TTTAGTGTAG 


CAAACTCATA 


CCCTTTCGTT 


ATTTCTAACG 


GCCACATGTG 


1440 


TTTTTTTGTG 


GCAACGGCTA 


GGGGAATTAA 


CGTATTATAT 


TTACTCATAT 


CAGACCAATA 


1500 


CGGATCTAAA 


GCTAGAAAGG 


TACACGTAAA 


AGCCTGTGTA 


AT AT TTTT AG 


CTGGATCTGG 


1560 
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CAGGTCGGGA GCTTTTACTA CTAACACATC" AATTTGAAAC TCACGATCAA ATACTCTGTA 1620 

AATCAACGTA CCTGGTGTTT TGGGATTTAA CGTTTTTATC ATTTCATGCT GTAAATTGAA 1680 

AAGATCCTTT GTTCCTCTAG CAATTATTTC ACCTTTTATC GTTAAATCGC GCTTATCTAA 1740 

ACGCTCCGAA ACTTTAATAA CACCATCTAG TCCATATTGT TCTTCTGTTA TGATCTTATT 1800 

TTCAACAGCT CCGAAGCCTG TTTTACTTTT AACGGTAAAA GGTGGTTGAA CACCGAAACT 1860 

AATCAAAGAA CCTTCCGAGT TCTTGTATAT AAGTTCGTAC ATCCAACCTA CCTCCTATCT 1920 

TCTCAGTGTT GCAAGGTCTT GCAACTTGTA TTTTGTTTCT CTTGCAATTT CTCTAGGCGT 1980 

TAAAGGCTCG GGACTTGTAA TATATTGTGT AACATTGATA TCTCCGTCTT TTTGTTCAGC 2040 

CATTACACCT CTCACTGCAG TTTCAACATA TCCTAACAGC GTGTCAATCG GTGCGACAGC 2100 

CTCAGCCCCT GCTTCTCCAC CTATCATTGC GTTATTTCCG TTCATGCCAA ATATGGTAGG 2160 

TTGCGTCATG ATACCGCCGT CTTTGTACCA TTCTATGCCC AAACTTGGTA TTTTACCTTT 2220 

CAATAAATCG CCCGCAGACC AACCGGCCGG ACTAATAGAA AAGTGAGGCA GCGGAATTTT 2280 

CGGCCAGCTA ATATTAAAGT TAAAGAAACC TTTAATCGCA TC TACT AC AT TTTTCACTAG 234 0 

ATTTTTAGCC GCATTCATTG GCCCATCAAT AGCATTTTTT ACACCATTAA AGATATTAGA 2400 

AACTGTTCCT GTAATACCAC CCCATACATT TGAAATAGTA TTTTTCACTG CGTTAACTAC 24 60 

ATTACTAATA GTATTTCTTA CACCATCCCA AATAGATGAA ACGGTATTTT TCATACCGTT 2520 

AAATAAATTC TTAACGGTGT TCAcCGTTGC ACTAACGATA TTAGACACGG TCGATTTAAT 2580 

ACTATTCCAC ACCGTGGATG CTGTATTTCT AATTGCGTTA AAAATGTTTG TAATTGAATT 2640 

TTTTAAAGCA TTAAAAATAT TTATCACTGT GTTTTTAACA GAAGTAACGA TATTGGAAAT 2700 

AGTCGTTTTA ATACTATTCC ATACACTAGA AGCCGTGTTT CTAATGGCAT TAAAAATGTT 27 60 

TGTTATCGTG TTTTTCATAG TATTAAATAT CGGTGTAACA AAGTTAAGGA TAGTCTGCAC 2820 

AACATTAGAG ATTGTTGTTT TAATAGCATT CCAAACGTTA GTTGCTGTGT TTTTAATTGC 2880 

GTTAAAGACA TTAGTAATCG TCGTTTTAAT AGCATTAAAT ATAGGAGTGA CAAAATTTTT 2940 

AATAGCCGTA GAAACTGAAG TAACTGTATT TTTAATTCCG TTCCAAGCGA TTGTTACAGA 3000 

GTAAACGACA GCATCCCATA TCTGTTGAAA GAAAGTTTTA AACGCATCAA AAATACCCTT 3060 

AATAACTCCA ATGAAACCGT TAATTATTCC TGTGACGGTA TCACCTATTT TTTCCCATAC 3120 

ACCGGTAGCA ATTGCTTTTA TGAGATCCCA TAGATATTGA AAATACGCTT TAAAACCATC 3180 

AAAAATACCT TTCCAGATAT TAATTGTATT AACGATTATC GCCCCTACTA CAGCGTATAT 324 0 

TACATTCCAC ACTGTTTGTA GAAAATTCTT AATGCCGTTA AAAACGTTTT CTACTGTTTT 3300 

TTTGAAACCA TCAAATGTTT TCTTGCCAGC ATCCGTGAAT TTCTTCCAAA TTTCTGAGAC 3360 

TTTAGCGGTA ATATCATTCC ACGTTTTTAC TAAGCTACCT TTTAACCCAC TAGCTGTTTT 3420 

AACGATTGAA TCCCACGTAT TTTTTATAAA ATCGGCTATA CCATTAAATA TCTCTACTGC 34 80 

TTTATCTTTA ACGGAGTTCC ATGTATCAAC TAGAAACTTA GTAAAAGATT TCCAAATCTT 3540 
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TTGTCCTGTT TCAGTTTGGG TGAAAAAATA* 
CGAAATAATC AACACAAAAG GGTTGGCGTT 
AGTATAGGCT TTTGTGACAC CTGAAGCTAT 
AATTTTATAG GTTGCTAAGG CGGCTCCTAC 
AATAGCTTTA TTTTTAGTTA TAGCCTTTGT 
TACAATAACC TCACTCACAG* CTCCTGATAT 
CGCAGGTAAT ACTACTGTAT TTATTGTTTT 
AGGTACTACA GCTGTGAAAA TATCGGCAAA 
CAATTTCGTA AATATCATAG ACACTTGACT 
AGGCAGTTCT TTTTGTACAA TCTGAGCTAC 
AGCCCACTGA AGATTCATCA CAAAACCTGA 
CGATTTTTGG AATCCCCCAA AGTCTGGTAT 
TAAGCCTTTC GTAATGCTTT GAATAATCGA 
TTTTGCTAAA AAGTCCTGCC ACGCTTGTTT 
GGCTTCGCGA GCAGCCTGTC CTACAGCTCC 
TAAACGAGTA GCTTGTTTCG TCGCTTCATC 
TACTCCCTGA GAAATAGCAA ACTGCGCCAT 
TTCACCGGCT TCGTAGTTTC CTTTTAGAAA 
TGAAACATCA TAGAAAGCTG CAGCATCTGC 
CATTGCGGCT TCTGTATCTA GCCCTAAGCC 
CGGCTTTATA CGGTTAGGTA GGATATTCAT 
TGCTAATGGT TCTAACTTAC CAAAAACCTG 
TGCCGCGGCT TCTACGCTTA GCTTCCCAAA 
ACCAATTGCT AGCGCTGCTT TTTTAAAGAA 
CTTAGATTCC GTCTCGTCCA AAGCCTTATT 
AAAAAGCTTG AAAATTTCGC CCATTAAGAC 
TGACTAGCAA AATCTAATGC TTTTTGTTGT 
TTACGTAAAG GTCTGTATTT TTGTTGGTCT 
GTGTGTAACC ACTGATTGTA GATTTTTTCT 
AATTCCATTA AGTCACACAA ACGATAGGTA 
TTGAACAACA ■ AATCTTTAAT TCGGTGTTCG 
AAGAGAAAAA ATCAGCTAGT TCTGGTTTCT 
CCTTTAATCC AAGAGCTGAA ATTTCTTGTA 



647 

AATTAACGCC GCTACTAACC 
TAAcACTGCG TTAAAAGCTG 
TTTAGTAGCT GTTAATACAC 
TCCAACTATA ATAGCTTTTA 
AAAATTAGAA AAAGCAGGGA 
GTCCCAAATC ACCTCTACGA 
AAATGCAGAT TTAAAAATTT 
AGTAATTGAC CATGTACTAA 
GCCCGTATCC TCAAAAATCC 
ATTTTTTATA GCATCACTAA 
ATTTTGAATG CCGTTAAATG 
AACTGCAGCT ACCTTATCAC 
AACGACCGTT CCTAAAATAG 
TAAGTTACCC ATAACGTTTT 
GGCTTGTTTA AGCATGTTGC 
TAAATTTTGC CAATCTTTTG 
TTGAGTATCA TTTGCAAAAA 
CGAAGTAAGA GAACTTGAAG 
CGCTAAGGTA GTTGCTTGCT 
TTTAAACATT GAAGTAATTC 
TTTTTTACCC ATGCCATCTA 
GGTAAATTGT GCATTCAAAG 
ATTAACGATT TTTTCTACGG 
CCCCACTAAT TTACTTGTTG 
GGCTTCACCA TTATCGACAC 
ACCTCACCTT CTTTTCGAGG 
TCTTCTTTTG TGGTAGATTT 
TTAAATTCTT TTAAAGACTG 
TCTTGTTCCA CTTCTTGAAC 
CGAAGTAACG ATATCGGATC 
CCGTCTTTTA TTTCAATAAG 
TAAAGAAATC GACTAATAAA 
TAGAAACGCC TGTTAACTCT 



CCACGATTGC 3600 

TTTGAATTGC 3660 

CTTTCATGAT 3720 

AAGAACTAAA 3780 

TGACATTATT 38 40 

TTTTTTCAAT 3900 

CTATCGCTTT 3960 

ATACTCCTGC 4 020 

CTTTTATTTC 4 080 

TCCACTTAAA 4140 

CATC CAT AGC 4 200 

CTAAGCCACT 4 260 

GAGTGCCTAT 4 320 

CATATCCGTC 4 380 

TAGCATAATC 4 4 40 

TAGAACCTAC 4500 

TCCCTATCGC 4 560 

CATCTTCCAG 4 620 

CTGATTTCTT 4 680 

GAGAAAATGT 47 40 

CCATTTTTTG 4 800 

CTTGTGCACT 4 8 60 

CAAAAGCAGC 4 920 

TTGATTGTCC 4 980 

CGATTGTACC 5040 

TTTAACAAAT 5100 

CGCTTTATTT 5160 

CGTCATCTGT 5220 

ATAAAAAATA 5280 

TCCGTACCTT 5340 

CTGGCGATAG 5400 

GTCGTATACT 54 60 

GCTAAAAGTC 5520 
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CGT7AATA7C ACTC77AA7A TCTTTTAAGT TAAGCATGAC TTTTTGCAAC ATCGctGCGC 5580 

CCA77TCAA7 GCCACGCTTA GTAGCTTCTT TGTCAGCTTT TTCTTTGGCT GCATCAATTT 5640 

TTTTCTGC7C 7GC7777G77 GGCTCT7TTG ATTTGTGGTC CAACGGTACA ACTTTTTCAG 5700 

CAGATTCA.-.7 ATTT777TCA AATAAATTGA TAAACTCGTC TTTCACATCA AGCTTTCCTA 5*760 

TGATGGCAAG 7AA7G7AAA7 AAGTCATCAC CTTTTAGTTC ACGCATTTCT AAAGTCATTT 5820 

TSTTATTCC7 CCATG7AAAA AAGAACGACT GATAAGCCGT TCTTTAATTA TTTCAGTTCG 5880 

TCTAAA.TTTG T7AT77Co>A ATATTTGATT CCTCAATAAT CTACAAGCTC TCAGTTAGCT 5940 

CTATAGGA77 T7TCGA7GTA CTTGGGTAAT AAATACGGAA TGGGAATTCG 7CATTTATTA 6000 

GC7G7TCA7A AGAAGCGT7A GCTGTGATTT CCTGTTCAAT TACTACCTCT T7GCCATCCT 6060 

CTGTTTTAA7 TTCAAGTCCG CTTTTCACTA ATCCGTTATC "TAACGCAACG ATGATAGGCT 6120 

GTT7CGTTCC A7TATGAATA CCCACAACGG CCATGTTAGG GATATAGTCC CCTTCTTCTA 6180 

AATAGCGTT7 AGGTTTAA7A A7TTTATAAC CAGCAGGTGC TTCATCTGTA GTAGCATCTG 624 0 

TCATCGTAGC ATTTAATGA7 CGGCGTAAAT TTTC7GCTGT TAATTCAATC AAGTTGGCTT 6300 

TCATCGTAGC TGTGGCTGAT TCAAGCACAT TCAAACCTAC TACGTCCATA ATATAAGCAC 6360 

CGTCTAC77C CACTTTACGG TAAGAAAGTT C7G7TTTGAT TTCTACACCA CCAGAAGTTG 6420 

CCCCCATGGG AATACCGGTA AACTCCTCTT TC7C7TTATC GTATTTAAAA TCAGTAAATA 64 80 

CTGTCGCAGA ATCAATGACA AAATTGTCTG CGGTTGTTTT TGTATAACCT GTTTTAGGTA 6540 

G7GCCATAT7 CTTT7TTCCT CCAATCGACC GCTACATAAA AGCGGACATT TCTTCTTTTT 6600 

AACAGCTC77 CTT77GTTGG TATTTTATTT GATCCTTGAA AACTAAAAAT GAGAT77AGT 6660 

TCTGGAGT7A GTACCCTTTT GTAACAAAGA GCTGT7TTAA AC77GTCTTC TAGAACAAGC 6720 

AAG77TAAAA ATGAATTATT TTTGTCAAAA ATATCAATAT CCAGATAAAA ACCA7CTTGA 6780 

TTTCTACGAA TAGGCTCGGA ATCAAAATCA AAGGTGGCAT ATGGATAGAT GACCTCTTTT 6840 

CGAGGAT7CG T77CTAAAAA TGTTTCTGGG TGTATCTCTT GTAAGATTCT TACCAGCTCT 6900 

CGTATAAACT C7TCCATCAC TTATCCCCCT T7AAATGTAG CTCCGTAT7C TTTCCCTAGA 6960 

ATTTCGATGA CTTTTTC7TT ATTCTGCCGA AACGCG7TTC TTAAAAACTT 7TGTGGTTTC 7020 

TGACCTCT7G TAAAATGCCA TTCTCCATCT GGTCCCTCAA AGAACCATCC ACCTTTACGT 7080 

CCTGAGCCAT TTTCAGCAAA 7TCTCCGGTT CCGAATTCAA CATAAATGGC GTATTGTTCT 7140 

GGCGAGCCGA CAA7GCCGAC CACTTTTCCT CCAACAGCCT 7AAGTTGATA GTCTATCCGG 7200 

TCCCGCAAC7 GTCCTGTT7C AACTGGCGCA CCAGACTTTG CAGCAGCTTT GATAA7CATG 7260 

AGGACTT7AG TCATCCCCCG TTCTGCCGTC GCGTCTAACT CT7TTTTTAC CTTGTCTTTG 7320 

TAGCTGATAA ATTTAAAGTC ATTCTTCGCC ACTTAAAACA CCTTCAAACT 7CAAATAAAT 7380 

T7CA7TGTGA TGC7TTATGC CAACAGGATT ATCGGAATAC GTAATATCGT AATAACGATT 74 40 

7GTTTGA7CA ACCACTCGCA TATTGTCTGT TATCCCCTCA ATAAAT7CGG GAATAATAAG 7500 
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AATGTGTGTA GATTCTTCTA 
CAAGTCGATA TATCCAGAAA 
TCCGTCAGGC TTTTGGCCTT 
CCCCACCTCA TTTTTTTATA 
ATATTAnTCA CTAGCGTTCA 
CTTAATGCCT AACTTTGCGC 
TTTAATATCT GCTGGATATT 
AAAAGGCTCT TCAACATCTG 
GCAATCGTTT ACTTTAGAAT 
GTTATCTTTT AACACAATAG 
GTTTTGAAAA TTATTATTGG 
CTGAGTAACC GATGGATCAA 
CTCCTAAGAT GCAGGTGTAC 
TTCAAATGTC GCTTTACCTT 
AGTGATAGTA GCAACACTCT 
TGCGTTTGTG GCATCTACTG 
ACCCGTCCAT GTTTTCTGAC 
TTTTTTACTG CTTTTAAAAT 
CCAGCTCCCT GGATTGCTAC 
ATACCAACGC CCGCAATACC 
AATTTGCTTT CTGGAATTTC 
TCAATGTTAG CACTTGAATT 
TAAACATCAG GGTGAACATA 
AAAGTCTTAT GCGCCGTTGA 
ATCGTTTCAG AGGCTGCATC 
CAAGCAATCG CTTGTTCCTC 
GTGATATTAT CCACTCCTTC 
ATAGATTTAA TTTCTTTACG 
GCTACGTTTT CACCAGTGTT 
ATCAAAAATG TGTCGCTATC 
CGTAAAAAAT AAGATTTTTT 
TAAAATTTTG TAGCCATAAA 
AATGGATCTT TTTCGTTTGG 



649 

CAAATGCGTT CTGTTTCGTT GTTTCATCTG TTCCTGTCAC 7560 

CTTCTTTAAA CAAGCCCCAA TCGTCAATAA GCCCGCCTAA 7 620 

TGACCTCTTG TAGAAAAAAG GTTTGAGGTG AG T AC AT TAT 7 680 

TTTATTTAAA AAACTGAACT TTGCTGCAGG AAAGCCCTCG 7740 

CATCGTAATA CGTGATAGAC ATTCTAGCGA TAGATTCTGA 7800 

CCATCGATTT TTTAAATCGT AAAAGTTCTT CAATACCCAT 7860 

CTACTTTTGT AATAAAAGCA CCAGAAAAAG AGCCATCAAA 7 920 

TTTCCAGTGT CTTTTCTGTG ATAGCTTCAA CAGTTGTTAG 7 980 

TACTAATTTG AATCGTATCA CCTACTCGTA AACCAAGAGG 804 0 

AATTACCCTT AAAAACAACA TCCCTAAAGC GTATATGAAC 8100 

TTAGTTGCCT AACTGTCTGT TCGAACGCGT CTAAGTCACT 8160 
TTTTTTTTGC TTCTTCAATG CTGATAATCA AGTTAGCCAC , 8220 

CTGTTACTGC AATAGAGGTT GTAAACGCAC CGGAAGTAAA 8280 

CTGCAATAAT TGAGCCATCA AATCCACCGT TTTTATTTTT 8340 

CAT TAG TAG A TATCACTGTA ATTGCTGAAA TTACAGCTTG 8400 

GATTAGCTGC AATAGTAAAT GTCTTAGTTG CGCCTACTGC 84 60 

TAGGCGTGAT ACCCGTTGCT GGATTAGTAG GCGTTACGCT 8520 

AGCTACTTTA TTTTTTTCAG GGATGTGTTT ACCATATTTT 8580 

TCCGTTAAAA TCTTCGGAGT CAATAGCACG TGTTACAGAA 8640 

TACGCTATCC GCGGAAAATT GAGCCATTTC CCCAGTTTGT 8700 

AACCAATACA AAACCTTTAA ATT TAT AT AT TGTTTGTTCA 87 60 

TTTAGTTGTT GTTGCTAAAC CATTGTCTAC TAAAAAATCA 8820 

AGCAACCCAA GCAAGAGACT TAGAAACTAA ATTATTCACA 8880 

GAACAGCTTA GTTACTCCTT CACTGGTTAA TTCAAACCGA 8 940 

AGATAATGCT TTACCTAATA AACCGTCGAT ATACTCAGCC 9000 

TAATCTTTCA GCTACTACTG CGTCAGCGTC aTCGTTTACT 9060 

GTGAATCGCT AACGGAGATT CATAAGGAAC CGTTGTATCA 9120 

TGGCCCAAAA CGATTAGAGT TGCCAGTTCC ■ TGTACCAAAC 9180 

GTAGGGTTGC ATTACCACAG GTGTATCTGA AGTTTTTAGC 924 0 

TTTTACAGCA TCCGACGTCT GCAACTCTCC TCCAAACGCA 9300 

CTCTGTAATT TTGGCTAATA AGCCTGCATA TTGTTTCGTG 9360 

AAATTCCTCC AATTACTCGT ATTTTGATAA GATTGCTTCA 94 20 

TTCCTTCCCT TTTGGCAGTT TGTTATCAAT TGGTTGATAG 9480 



WO 98/50555 

CCtGCTTTAT TTTCAGTCTT 
TAATCAGGGA TGGAAGCTTT 
TCTTTGTCAA GTTCTAACTC 
GCACTTTGTA ATGCAGCAGA 
AACGCGTCTG CCTTTTCTTT 
GTTTTAGCTT CTAGCGATTT 
TTATATTGGT CTTTTGGTAC 
GCCTCCATAT CCAACGTACC 
TCCATCGTTT TCTACCTCCA 
ATATACCGTC TGCTCGGTAG 
AG ACT TAT C A ATAAGCAAGT 
ATCTAGCATG CCTAGCAGCT 
ATTTGGCCGC TGTACAACAA 
TTCGTAAAAC TCTTCGTAGT 
TCCCTTAGAT ATAAAAATAG 
ATTATCTGCA TCCGCTTGCC 
ATAATATTCC TGTCTCAACC 
ATATAACTCA TCAAATTCCT 
TACCTTACCC AAGAATTGTC 
AGTTTCTGTT CAGTAGGAAT 
CGCCTTGCGT TCCTTTAGGA 
TAAAATTTAA AAAATCCTCG 
ATACATTTCC GTTCAGAATA 
TATCATGTTT CAATTTAAAT 
GATAAGGTTT TTCTCTTGGA 
ATGCGTCTGA TGTATAATCT 
ATTTATCACT ATCATCTTGC 
GCTTTGGTTT CGATTCATTT 
ACTCCGACAT CTCTTTAGAT 
CTTCAATTGT TGTACATCGG 
CTTTTGCCGA ATGCCCTCTA 
ACTCTTGGTG GGTTCGGCGT 
TGATGCCCAG CTCTTTTGCT 
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TTTTTTCGAC ' TTATCATCGA GTGTATCCTT CTTTTCAAAG 954 0 

CAAGTCTTTC ACTTTACTTT CTAAGTCCTT CACATTCCCA 9600 
ACCTAATTTA AATAAAGCAT AATCAATATC CTTAGCCCCT 9660 
TACTTGGCTA TTGATTTTCA AATCTTTGTT TTGCTTTTCC 9720 
TAAATCAGCT AGTTCTTTTT GAACATCCGG ATTGTCCTTT 9780 
TAAGGTTTTA TTTGCTTCAG CTAACTGGCT TGAAAGATTA 9840 
TGCATTTACA GGAAATTCTT TGTCAATTTC TTTATTAGCA 9900 
ATCTTCTTTC ATATGTTTTG CTAAAATTTG TTTGATCCAT 9960 

TAGCATTTAT ATAGCGGTCG CTGCCGCTTA GAGTGTCAGA 10020 

CGGTGCCCTT TTAACGTCAT GGCTCCTGGA CAAAATAAAA 10080 

CTTAACTCCT CAT AT CATC T AATACTTGTT GTTTTGCTAT 1014 0 

GCAAACTTCC ATTTTGACTG TAATAAATAG TGACCTCTCC 10200 

TTGTGGCCTG TTCAAAATCA GGCTCTTGGC TTGATACCCA 10260 

CTACATCTAA ACAAGCGTAA GGATCTTCCA TCTTATGGCC 10320 

CACTCAGCCG CTTAATGGGT GAGTGCTACT ATTCGTTATA 10380 

ATCTATCTAG CAAAGTGGGT TGATTAAGTT TATGTTGTTG l'0440 

GCTCACGTTC CTCAGGCGTT TTCGCTGTGT CTAGCAGCTT 10500 

TATCAGTCAT CTCTAAAGCG cTACGCCTAT AATCCACATC 10560 

ACCTCCAATA TATTATCGAC TATCTCCACT GTATACTTCA 10620 

TCACTTTCCT TTTTCATCGA ACTGTTCGTT CCTATATAAG 10680 

ATCAGGAATT TAACTACTCT ACCATCCTTA TTTGCACTGG 10740 

GCAACTTTTT TGTTGATACT AGTAGACTTG AATTCTTTGA 10800 

GCATCGTATT CATTTGCAGA TACACCTCTA TAAGTTATTA 108 60 

TTACTAATAG CATCATCTAA ATTTTGAATA ATATTGACAA 10920 

TTATATTTTT CATTTCGGAG TATCTTATTC ATTTTAGCAA 10980 

TTCATAGCAT TAACTTGTTC AGGCTTCAAT CCAGCAACCC 11040 

AGATCAGATT GACCGAAGAA GTCCAGCTCA GTAAAATTCG 11100 

TGTCTGATTC TATCTGCATA CCACTCATCA TAGCTTTTAA 11160 

TCATTATCTT TTCTAAGTTC AGGGCTGATA CCGTCGACAA 11220 

CAATTCACAT CTTCTGATGC TACCCCAAAC ATCCGCGGAC 11280 

ATGGTAAACT CTTCGTCTAC ATCAACCTCT TTACCGTCTA 1134 0 

GTATGTTTAT CGATCGTGGC AAGCCATTTC TTTTTAATAT 114 00 

TCTTCATATC CTTTTTGGGT AG TG AC AG AC TGAGTACGTC 114 60 
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CTGCTTCTGT 
TCCATCTAGC 
TCACATTTTG 
CTACAGGTGC 
CTATATTCTG 
TTGCAGAATA 
TAATTTCATC 
TTTCTGCATT 
AAACATTAAA 
CCTTCTGCCA 
TTGTCCGTCA 
TCAAGGTCAA 
ACTTGCAAGA 
ACAATATCTG 
TCAAATGCTT 
AGTAAAGCTC 
GCTTTTATCA 
CCAAACTTAT 
ATCGTTACTG 
TTAACATTTT 
GCTTCCGCAA 
ATCAGTGCCT 
AAAGGTATTT 
ACATGTGGCC 
GCTACGAAAA 
TCGCCGTCTT 
TCATTGTATA 
GCATTGGTTC 
TGTAGCACTA 
TTTTCTGTTT 
TCTGGGAAAA 
CCCTCATCGT 
CGAATACCTG 



TCTTGCAATA 
TATTTCAGCA 
GGCTAATTCA 
ATTGACGAGA 
TGACTGCTCT 
GCCTTTGATA 
AGCAACACTA 
TTCTGTATAA 
TAGCTGATTA 
TTTGTTGAGT 
AACCTTGCGT 
ACTGTTCGCA 
TAGATTCAAG 
TTTCATTCAC 
TAGTATAACG 
GTAGTCTGGC 
CTATGTTGGT 
AGATATTTTC 
TTTTAACATC 
GTCTTAATTT 
AATCTTGTAA 
TAATAGACTT 
GTCCGTAGCT 
TTGGATTAAT 
AATAAACGTT 
TCTCGATTTC 
CACCAAACAC 
TTGCATAAAC 
CTTGAAATTC 
CGTATTCAAC 
AGCCGTGCGG 
CTACGTAAAA 
TCTCTGCCTC 



CGTAATGCTT 
TAACTTTTAC 
TCACGGTACT 
TTCATAATAT 
AGCGTATACC 
GTTTTTTCAA 
AAAAGTCTCT 
ACTTTTAGCC 
TCCGTTTCCT 
TGGGACATTT 
ATATTCCGCC 
GATAAGGCGT 
GATAATTTTT 
CATTACTTCT 
ACGATTAATG 
TTCAGTCTTG 
GATATTCCCG 
TTTGTCAATC 
TAAGCCCCCA 
TGAAAGATCG 
ATTGTTTGAT 
TAAATCCGTC 
TCGTTGCAAT 
TGGTTCAGCT 
CTGGTCAGTC 
AGTGATATAA 
GTTTAAACTA 
ATACTCAAAA 
AGAATTGTAA 
AGGATTAGAT 
GATTCGTACA 
GATACGATTG 
TCTTTTTGAG 
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GTTTGTAGCT 
CCTCGAATAA 
TGTATAAACG 
AATCATGGTT 
ACAGTCCATA 
CTTTCGGTGA 
CTACTTCCAA 
GCTTTTTGAT 
GGTAATTGGC 
GGGGTCACCG 
TCTTCTAATG 
AAAACGTTGT 
CTTGTCTCTG 
CTAGTAAACG 
TCATCAATAA 
TTGGCTTTCA 
TCGCCTACTT 
TCCATTTTGG 
TCTGAACCAG 
TCACCTTGGA 
AAAAAACAAT 
GTTTCCTGTT 
AAGCTTTCAT 
TCATCTAATT 
CACACTTCCG 
TGACGGCAAA 
TCAGCCACTT 
CCTTTTTGGC 
TACTCTGCTA 
AAAAGGTATT 
TTCGAGGCGT 
TTCATGATAT 
GTAGATTTTC 



AGCTTCTGTT 
GCCCGTTATG 
CTTTGAGAGC 
AATTAACGGC 
ATATCCCTGT 
GTTTAACTGA 
ACGGGTGGAA 
GTCGATTAAT 
TTTCTGTAGA 
CCGTTTCTTC 
CCTCTTTCAC 
CGTCGTCAAG 
CTTCGGTTTT 
TAAATGAAAC 
CAAGCTTATT 
TGTTTAATAG 
GAGTTGAATC 
TTTTACGCCC 
TACCAACAAC 
AACCTGATAC 
TCATTAAGTC 
TATTATTTGA 
TATCGCTATC 
CATAATCTTT 
CGTGATGGAT 
TACGTTGTAA 
GAAAACACAA 
TTGAACCTTC 
AATACTCTTT 
GAGTTTTTTG 
ATTTGTCTTC 
CATTTTCATG 
GATCTTTGTC 



TCCTCATTAA 
ATATTGTTAG 
CTCTTACCCG 
AT ACT TAG TG 
TCAGCTTGTT 
AGAATGGCAT 
AAAGAAAGAC 
GATTGACGAT 
GCCTGTAGCT 
TTCATCGGTG 
GTCTTCCCAG 
GCGAGGTGCT 
CTCATTGTTA 
TTCTGTTGGG 
CATCCATTCT 
TGTATAACGT 
AAAG.GCCATT 
TTCAGTTGGA 
CTTTTTAGAT 
AACGTAAATG 
GTAATCATCT 
TAACCGATAA 
AACTGCTAAA 
GTTATCCTCA 
ATCGACTGTC 
TTCATTGTAT 
TCGATCTTCT 
CACAAGCTCT 
AAGTTCTTCG 
GTCAACAATT 
TCGCAGTACC 
GTTATAGTAC 
AATCGCCGCC 



11520 
11580 
11640 
11700 
11760 
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12300 
12360 
12420 
12480 
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12600 
12660 
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12780 
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TTTAAAGCAC 


TAGCGATAAT 


CTTCACGTCT 


" TCACTAAGTA 


AAGCTTCCAT 


TTACAGCCTC 


13500 


CTTTCTAGTA 


TAGCCATTTA 


TTTGTGTTCT 


TAATTTCTCT 


AAGCAGACTA 


GCTGCACTGT 


13560 


CTGGTGCGTC 


GTCGTGCTCT 


GCGTTCTCTG 


TATAGTCTAG 


TATCTCGGCT 


ATATATTCTT 


13620 


TGTCTGTGTC 


TTCAAGCCAA 


ATTATTCGGC 


TCCAATACTT 


CCGCAAATAA 


GAAGATATCT 


13680 


TGATAAACTT 


ATTTGTTTTT 


TCGTGATACT 


TTTGCACGTA 


TTGACTTCGT 


TCGATTAGGT 


13740 


GTTTAGCTAA 


ATAACCTTTA 


TCGCCGTTCG 


TTTCAGTGTA 


AAATGTTCCA 


GCTTGGTAGT 


13800 


AT T GAT G TAG 


TTGCAATATT 


TCAGGTAGGC 


AATCATCAAC 


GTGTTTCTGC 


CATTTCTTGC 


13860 


CAAACCCGAT 


AATGGTTCCG 


TCCTTTTGCT 


CTTTAAAAAT 


GGTAAATGCT 


GTGCTATCAC 


13920 


CACCTCCGTA 


TGCCGCGTCG 


ATGTGTGCTA 


CACCGTTATA 


AATAAGGTTA 


GTGTTATCGA 


13980 


TATAGGTCGG 


TGCAGTAAAT 


AGCGATTCAC 


TATCAGCGAT 


GTGTTTTAAC 


TCGTAGTTCG 


14040 


CTGCAAAGAG 


CGACGGTGTC 


. ATAGACTGCT 


GTAACGCTTT 


ACGTTGTTCT 


TTGTCAATTA 


14100 


ATCCTGTTTC 


GTAACAATCG 


AATTTCTTGA 


CGTTAGGCAT 


TTTAGAAATT 


GCATCTTCCT 


14160 


TATGCCAAGG 


CGTGCCTGTG 


TTAATAAAAC 


GGCCCTCACG 


ATTCTTCACA 


TTCTGCAATT 


14220 


CTTGATACTG 


CAGCTTTGTT 


TTCTCTCGTT 


CAGCACGACT 


TACTCGGTCT 


TTAATGTTAA 


14280 


CAATATCATC 


AGTGATAACG 


ATATCCGCGT 


GTTTACCTGT 


TAATGAAGCA 


TAAATCCCCA 


14340 


TACCAAGTAA 


TTGAGATGTT 


CCCCGGGTAG 


ATGTTTTTAA 


GTTGGTATCT 


ATTTCTGTTG 


14400 


TAGTCTCTTT 


CAAAAGCACC 


AATTCAACAC 


CATATAATGC 


AAATACAAGC 


GTTTTAAAGT 


14460 


ATTTGCTAGA 


TAAAACCTTA 


GCCACTTGTA 


AAATAATCTC 


TACGACGTCT 


GTATCGGTCT 


14520 


TACGTAAGAA 


GATAATATTT 


TTGTTGGGAA 


AAAGAACCAT 


CAACAATGCA 


ATAGCAATCG 


14580 


CCAATGTGGT 


TGTTTTAAAT 


GAGCCACGAT 


GAGCCAATAG 


CGTTTGATCG 


TCTTTCTCGA 


14640 


ACAAAAAAGA 


CTTTAGCCAA 


TCATTGTGCA 


GTTCTGCTAA 


GTCGGTAAAC 


CCAACTAAAT 


14700 


TTCCGAAAAT 


TACCGGGTTC 


GTTTTGATTA 


AGTTCAAGTA 


TTTTCGCTTT 


TTAGGGCTCA 


14760 


TGTAGAATCA 


TCCTCGAAAA 


ATTGCTCGAT 


TTCGACAGCT 


GCATCGGAGA 


TATTAAGCAC 


14820 


GCCTGAAACT 


TCAGTTTCTT 


TTCGATCTCG 


CCATTCGTCC 


GGCTTTCTAT 


TTTTCAACCA 


14880 


GAAAATAGCT 


GCAGTTGGAT 


TAGGAGCTAC 


TTGTTTTTTA 


ACCTTTTTTG 


TAATTTCCAT 


14940 


ACCGTTCTCT 


GTTAGTTCTT 


CTGTAACTTC 


GGTATATTCG 


TAGCCTGTAG 


CACTCTTAAA 


15000 


TAAAGCATTT 


TCAACTTGTC 


TATCAACGAC 


TTCTTTTCCC 


TTTTTTAAGG 


ATGCCGAAAG 


15060 


TGCCGGAAAT 


TTTTTT A CCC 


r\x l u 1 V, 1 \jtvi 


A CTTC A AT 21 21 






1 0 1 £ U 


TATCTGCTTA 


TCGATGAGGC 


CATCTCGTGC 


CCATCCTTCG 


ATTTTGATTA 


ACCCTTCCTC 


15180 


GGTTAGCCAC 


TCTGTGTAcT 


TCGCCATGAC 


CTCACCGCCT 


TTCTGTATCA 


AAAATAGACA 


15240 


CCTTCCCATT 


TTCTTGGGAA 


GGTGCCTTTT 


CTGCGTAATT 


T TACT GAT AA 


CAGTATATCA 


15300 


CGATTTATCA 


ATGGGAATTG 


TCACATCATT 


GTCACTTTTT 


TCGATTTTCT 


TCAAAATCTG 


15360 


GTTATGTTTA 


TTTCTGATAT 


AGCCATAGCT 


ATGCCCAAAT 


TCTCCCGCAA 


TTTGTTTTAA 


15420 
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CGATAGCCCT TCGATATACT TTGCTACTAA 
AACTAGCTTT CGTAAATCGT AAATTGaGTT 
TTCGATAATA TCTTCCAACT TAGAAGCTTT 
ATCCTCTGGG TCACACCATC TTTCTAACTC 
GTAATAGATT TCTTGTTCTA ATTTTTGATA 
ACCTACCTTC TTGTTCATCG CTTTCACTAT 
TTAGAGTTAA TAATGCGGAT AGCAATATAA 
TCGAAATAAT CCACTCCGAC ACTTTGTATA 
ArGATATTAA TTTATCCAAG TTTATCCCCC 
GCCATTAGCG CTGATTGCCC TTGTTCTTTT 
ATAGTTCCTT TCGTCACTAA GTGATGAATA 
AATCTGGCAT TGGCTTGTTG ATAAAACTCC 
ATGTGCCCGC CTTTTTGTAA ATTTAGTCCG 
AAAGAAATTT TTCCCTCGTT CCATTTCTCA 
GCTTGTTTAA ATCGTGCTTG GATTCTTTCT 
ACCGATTGCC CTTGTGCGTC CTCTATAACG 
ATTTCCCTTC CGTCGCCGTT TTCGTCGTAT 
TTTGATAGTG TAGCCGCATT ACTAGCTACA 
TCCCGTTCTA GCTCTTTGTA CTGTTTCCAG 
TTTTCCGTTC TCGGTGGCAG TCGCAAATAA 
CTTATTTTGT TGTAAATCGC TTCTTCTGCT 
TGCCCGTTTT TTTGTGCGGG GACAAAATAT 
AGCCGTTCGC CTTGGTCCAA TAAATACATC 
GAAGGTGTGC CTGTTAGCCC GATAACACGT 
TTAAATCGTT TGGCACTACT TGACTTAAAA 
AAGGGCCAGT TTCGTTGATA GTAATTGACT 
TAGACATCTG CCTTTTTGAA TAAAGCTTCT 
GAAAAAGTGA GGTGCTGCAG ATGGTCCCAT 
GCAACTGATA ATGGGGCAAT TACTAACACA 
TCGTCAATAG CTGTCAAACT GGATAAGGTT 
CAGTAAGGAT GATCAAGAAT AAAATTCTTA 
TTCAAACTCA ATCACTCCTT TTTATGGGAC 
TGCATTGCGT AGATTAACAT ATCCACTCCT 
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GATTTTTTGA TCTAATCCTT TGAAACTATA • 154 80 

CATTTTCCAC GCTAAACGTT TTTCTAAGTC 1554 0 

GGAATCTTTT GTTAGTGTGT ATCTTCCTAA 15600 

TTTTTTATAC GTTTCTAGCT CCGAGTCTAA 15660 

GCTGACTAAC CATTCGTACA ATGTACTTTC 15720 

TAACCGTAAC TTGTACATTT TTATACAGAA 15780 

ATCTAACATC ATACGGTATC TTAGCTAATA 15840 

TCATCATAGC TAAACCTAAA CATAACCAGA 15900 

CTATACTCCT CAATTTTGGC TTTCACTGCA 15960 

GCTTGTAAAG CTTTGATTAC TTGTTCGTCA 16020 

ATAACAGGCT GTGTTTGTCC TTGGCGGTCT 16080 

AAAGACCATG TAAGCCCAAA CCACACGATA 16140 

TGCCCTGCTG ATTGTGGATG GGCTAAAAGT 16200 

ATATCCCCGT CAGACACATT TAAAGCCTTT 16260 

AGGTCATGTT GGTATTGATA AAAGACTAAA 16320 

CGTTCTAGCG CATTTAACTT TTCTTGGTGT 16380 

ACAGCGCCGT TAGACAGCTG TAAAAGCTTG 16440 

ACGTCTGTTC CTTCGAGTTC TAACACGTAT 16500 

CTTGTCGGGT TCAAGTCTAG CTCGATAATA 16560 

TCTTTTGCTT TCATACTCAC GCATATATCG 16620 

CCTGGGATTA ATTGCCAAGA ATACACGATA 16680 

TTATTACGGT ATTGTGTGAT AGTCTTTCCT 16740 

TGCGCCCACA AATCCATCAA GCTGTTTGGG 16800 

TCCATCTTCG GCCGTACTTT TCGTAATGCT 168 60 

CTGGATAACT CGTCTATAAT CACGGTTTTA 16920 

AACCACTCCA CATTTTCACG ATTGATTAAA 16980 

TCACGTTGCT TTGGGTTTCC TAACACTTTT 17040 

TTTTCAATTT CATCGGTCCA AGTTTTCTCG 17100 

TTTTCAATAA TTTCAAACGT GTGTAACAAC 17160 

TTCCCTAGTC CCATATCTAA CATCAAAGCA 17220 

GAGTATTCCT GATAGGGATG TAACGTTGCT 17280 

ATATCAGGCA TAACTCTTGC TATACCGCTT 17340 

TGCTTGCTGT CAATGACAAA CACTGGGATA 174 00 
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AGCTGCTCTT 


TGaATTTCTC 


TATAATTTTT 


AACTGGTCTT 


TTCTTGGTTT 


ACCTCCAGGC 


17460 


CTCTTTAATT 


CAACGAAAAA 


CACGTTGCCT 


TGATATAAAA 


TAATCCTGTC 


aGGAACACCC 


17520 


CTAGTTCCTG 


GGGAAGTGAA CTTATAACAT AAAgCTCCtA TGCGTTTTAT CTGCCTAACT 


17580 


AGATACTTTT 


CAATGTCGTT 


CTCAATTTGC 


ATTACTTCGG 


CCTCCCATCT 


GATAAGATAT 


17640 


TGGTTGATGT 


GTTCCATCAG 


ATCTATTTTC 


TTCAACCTCC 


GTTTAAAAAT 


ACCGTGTCAC 


17700 


ATGTAACACA 


AAAATTCTTC 


AACTTATATA 


TATTCCCCAT 


GTACATATAT 


ACATATACGC 


17760 


CCTATATATG 


TATATTAATT 


ATTTCTATAT 


ATGAATAGAA 


AATAGTGTTA 


CAGGTGTTAC 


17820 


GATTGTnCTT 


CAT TCT TACT 


CTCCCAAGAT 


GTAATAGCGT 


AACACTCCCT 


GTAACACCAA 


17880 


CGGGAAATAA 


GGTGTTACAG 


GTGTTACAGT 


GAATATTTTA 


GGTGTAACAC 


TCGTAGCTTT 


17940 


TTTAGGTGTT 


ACACCTAACC 


CCGCGTCATT 


ACTACCTTTT 


TGTTTTAAGG 


TGTAACACGT 


18000 


AAATAGGCTA 


CTTGTACTCC 


ATACCCAGGG 


CCAAATCTTA 


AACGTCCTTT 


ATTACCTTTG 


18060 


CTGTTTTTCT 


CCCAACCAGA 


TAAATGGCTT 


AAAACTTGAC 


GAATTTCTGC 


AGCTTTTGCA 


18120 


GGATGTATGT 


TTTTAGAATC 


TCCGTTATAC 


AATTCATTCC 


AAACCTCTGC 


AATACATACT- 


18180 


TTGTTTCTAA 


CAATCTCGCC 


TTCTTCCTGA 


ATATCCGTCC 


CCCACCCCTG 


AATATACTCT 


18240 


CGTCTTTCTT 


GCTTACTTCG 


TTTGTACCAA 


TCTTCCGTTA 


TTGGGATTTC 


CAAATATTCA 


18300 


AGGATTTCGC 


CTTCCATGCT 


AGAGGTTTCC 


GTGTGCATTT 


CTTGGGCTTC 


TAGTGCTAGT 


18360 


TTTTCTTGTT 


CGTCTGTTAG 


ATACAAAGGT 


TCTCCTGCTT 


GCCACAACTC 


AACGGCTTCT 


18420 


GcCCATATCT 


GATTGCGCGT 


TTCGTCGGTC 


ATTTCCCACA 


CTTTATTTTT 


AATCGGTTGA 


18480 


ATGCCGACAT 


CTACTGGCCA 


AAAACGCCGA 


TTTCCTGTTT 


TATCTCTTAG 


AAACTCGTTG 


18540 


TCGTTCGTTG 


TTCCCCAAAA 


GACACAACGG 


CGTTTAAAAT 


AGGACTTATG 


CCGACCGTAA 


18600 


-GCAACGCGGA 


AAATGTCTTC 


TTGCTTGCTA 


ATAAAATGCT 


TGGTGGCTTC 


AATGTCTGCT 


18660 


TTTTTTGTCG 


CTGACAATTC 


GCCCATTTCC 


ATGATCCAAA 


CACCTTGCAA 


CGCTTCGTAA 


18720 


GAGTCTTTCC 


CCGTCACACC 


TTCTAAGCTG 


TTAGAAAACC 


AGTCTCCCGC 


CAATTTTGCA 


18780 


GGCAATAGCG 


TTTTTCCTAT 


TCCTTGCGGA 


CCGCTAGTCA 


CTAGCATGTA 


GTCAAACTTA 


18840 


ATCCCTGGCA 


CAAAAATACG 


CCCTACGGCA 


GCTGTAAGAA 


ATTTTTTCGT 


AACTACTCGA 


18900 


ATGAACGGTG 


TATCTTCTGC 


GCCTAAGTAG 


TCTATTAGTA 


AAGTTTCTAC 


GCGAGGCACT 


18960 


CCGTCCCAGT 


GCAAGCTCTC 


TAGGTATTCT 


TTCACTGGAT 


CATATGAGTT 


TCTTTCGATT 


19020 


TCTTGGACTA 


GCGCATCATC 


GATTTTGCCA 


CGGTTCACAA 


TACCGTAAAT 


TTTTTCGATA 


19080 


TATACCCGTA 


ACCCCGCATC 


GTCGCTGTCC 


TTCCACATCT 


TGTCTCGATC 


TAACTTCCGC 


19140 


CAAGGCAAGT 


TGTCTTTCAC 


CTCAATACGA 


TTAGAAAAAC 


TGTTCATAAA 


AATTTTCTTT 


19200 


TTGAGGTTCG 


GGTCATTCAG 


CATAATGACT 


TCTAAATTTT 


TCGCGGAAGC 


CATAATCTGA 


19260 


CCGTATTCGT 


CAATTTCGAG 


GTCCAATTTT 


GTAAACCAGT 


CTTTGTCGTC 


CTCTTCCAAT 


19320 


TCGTCTAGCT 


CGCCGTCAAA 


GTCTTCTAGT 


GCTTGTGACA 


GCTTTTCACT 


CTGAATCAAG 


19380 
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GTCTTAACTT 


GCTTGTCTTC 


CATTGCGAAC 


TCTCGCATCG 


CTTTAAACGA 


TGGATAACGG 


. 19440 


TCAATCCGCG 


TCGTAGGTTT 


GACGTCTTCA 


TCTAAATCGC 


CAAATAGATG 


GATACGGACA 


19500 


AGGTCAAACG 


CGTTCACCAG 


TTGATCGCCT 


ACAGGGTCCG 


TACCGTGATG 


AGAATAAGCG 


19560 


AACTTATCAT 


CATAAATCAC 


TAGCCCACCA 


CTAGTAGAAC 


CTTCTAAAAA 


CGTCCAACGG 


19620 


TCCTCTCTTC 


CTGTGGACCC 


GTAGATGTCA 


GGCAAAAAGG 


TTTCAATCGC 


TGAAATGATG 


19680 


TCATACGTGC 


GACAAAATGC 


CCCGACAATC 


CCTTTTTTCT 


CTAATGGGTC 


GCCTGCTTTT 


19740 


TTTGCTTGTC 


TTTCGCGGAT 


AGAATGCCCA 


CGGGAACTTT 


CTGGCCAAAA 


ACTTGCATCT 


19800 


CGCCAGTCTT 


CGTATTGGCT 


TAGAATTTCG 


TCAGGATCGA 


CCCACGGCAA 


ATCGATGTTA 


19860 


TCGGTGAAAT 


ATTCCCCGTC 


TATCGAATGA 


CTGGGCCAAT 


ACATTAAACG 


TTCTGCCTGA 


19920 


TAGGTCGTAT 


CATCGAAGTT 


ATCCATGCCG 


AAAAACTCCG 


CAAGCTTTCT 


CGCTAGGGGC 


19980 


TCATACTCTT 


CGGCGGTCAC 


TGGGCGACTT 


AACGGAATAA 


TCAAGCGATA 


ACGTGGACCT 


20040 


TTTACTAAAT 


GGCTGTGTGT 


CGTGTAGACA 


GCTGCAGCGT 


GGTCAAATAA 


CAGCTGTACA 


20100 


TCGTCCCAAA 


AGTCTAACGT 


CGTGCTGTCG 


GCATCTAACG 


TAACAAGGcT 


ACGCTGTTGC 


20160 


GTGTTTCCTC 


TTTTCCGTCG 


TCCTTCTTTT 


AACCAACCAC 


CAATAAACGC 


CCCAACGTCT 


20220 


TTAACTTCGC 


CCTGCTTAGA 


CTTTGGCATT 


TTTTTGTAGT 


CTTCGACAGT 


CTCTTGTGTC 


20280 


ACTGTCGGCG 


TTTTCAATCG 


TTGGGTAAAA 


TCCGACCAAG 


ACATCTGCCG 


ATTTTTCCAT 


20340 


TTTTTCTCGG 


TTTTTGAAGC 


ACCCACTGCA 


AGATGGATTT 


CACCATCATA 


GGCTAATTTG 


20400 


ATGTTTTTTT 


CTGGCTGTTC 


CATCTTTCCG 


CTTCCTTTCT 


TTTATTGTTT 


AAGGATGAAA 


20460 


CCTAAACAGT 


CAAGCCATCC 


TTCTTGATAG 


TCACACGTAG 


AGGATTTATG 


ATCATAATTA 


20520 


AACGCAGCTC 


TCATTGCTTT 


AAGATATTTA 


CGCTTACCAA 


CAACGTAGCC 


CAACAAAAAG 


20580 


CAAAGTACTA 


AGTTTATTAG 


TATAAGTATG 


TTATTTCCCA 


TTTATTAACA 


ATCCTTTCCA 


20640 


TATTCGTACT 


CGTAAATTAG 


ATTAGTTAAC 


CAGCTATAAT 


CTAGCTCTTG 


CTCTTCGTCA 


20700 


AACTCCTGTT 


CGTTGACATA 


AGCAGCAAAA 


TCTACGATAT 


TTTCGATATT 


TTCCTCATCA 


20760 


TAAAAACCCC 


AACTCCAAAT 


GAATGCTTTA 


ACACGAACGA 


GTTCCCCAAT 


ATCTTTTTTA 


20820 


AAAACAAAAA 


CGGGCTCATT 


GACCATTTCA 


AC AC TC AT AC 


AAAATTCACA 


TGTCCCCACT 


20880 


TCTTCTTCGT 


GAGGATTAGT 


TTCAACATCA 


ACTAGTTTTA 


GTTTCATTTT 


CTATCGCTCC 


20940 


TTTTTCTAAT 


TTTTAGTGAT 


TTTCAATTCC 


CAATCATCGC 


TATCAAAAAA 


TACAAATTCT 


21000 


AATTCCTCAA 


AATCGTTTGT 


ATTTTTTATC 


AAATCTTTAA 


ATTTCTTATT 


TGAACAAATA 


21060 


TTTCCAAAGC 


GTTTTAAAAA 


GCGGTGCTTA 


AAATTTTCAT 


AATCACTGTA 


ACTAAATTCA 


21120 


AAAACTTTCC 


CAGTTGAAAA 


AACTCCACCA 


GTTGTCAGTT 


CTAACTTTAC 


ATAATTTAGT 


21180 


GCCATTATCT 


GAACCTCCTA 


ATCTTTCATG 


TAATATTTCG 


TTTCAAATCC 


TTCGGCGTTT 


21240 


AACGGCAAGC 


CTTCCGCCCA 


ATCAGGAACT 


ACAGACATGA 


TTTTGTTCAT 


TTCTTCAATA 


21300 


GATTTTTCCC 


CTTCTGGCAC 


TTCGGCTACT 


GCCTCATCGT 


GCACATGGAA 


AACAATAGGA 


21360 
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CTGCATCGCT 


TCTGCTAAAA 


CGTCTCTTGC 


CGTTGCTTGG 


21420 


ACAA7G7777 


C7ACCAATTT 


ACCGCCGTAG 


GTTTGCTGTT 


CCGTGAAAGC 


GACTTTATCG 


21480 


r^rnrr* {-* ~ ' 

Uuli w 


CA7AAAAAAT 


AGCAGGACCG 


TAGTCACCTT 


CTTTTAAATG 


CGCTTTTGCA 


21540 




T^CG'CCGCT 


TGGTAGCTGA 


ATAAACAAGA 


ATCCTGCTTT 


TTTATAAAAT 


21600 




GAGGGCCTTT 


TTTAATGCCG 


CCGTTTTGTA 


AACAGTCAAT 


CACAGCTTT7 


21660 


7GGGTT7C.-.7 


GCCAAAAGTT 


CTTAATGCGT 


TTATTGGCTG 


TACGCCAACG 


GTCCACAATG 


21720 


7CTTGTAA77 


CG7G7TCTTC 


AATGCCGTTT 


TCTAATGCAC 


CCATTGCTTT 


AAGCGCCCCT 


21780 


GGTCCGC777 


GGTAACCAAG 


CGCCAATGTC 


GCCACTTTTC 


CACG7TGGCG 


CATGTCTTTA 


21840 


CCTTCG7GGC 


T7T7CCAG7C 


GTAGTCCGTC 


AC7tCGCCTA 


AATGGAACAT 


CTGCGCCGCT 


21900 


GTTGCT7CGT 


AAAT7TTGCC 


GTGTGTGCGG 


AATACTTCTA 


ATACCCAATC 


TTGTTTGGCA 


21960 


TACCAAGCAA 


7CAC7CGGGC 


CTCAATGGCT 


GAAAAGTCAG 


ACACGATGAA 


CCGATGCCCT 


22020 


TCTTTGGCAA 


CTAAACCTGT 


TCGGATAAGT 


TGTTTCAATG 


TGTCTGGCAC 


ATCTTCATAC 


22080 


ATTAAT7CGA 


TGCC7TCAAC 


ATCTTTTGCT 


TTCACAAGCT 


GACGGGCAAA 


ATCAATTTCA 


22140 


CTTAAATAG7 


TTC7AGGCAA 


GTTCTGTACT 


TGTAATAATC 


GCCCTGCCCA 


CCGTCCTGTG 


22200 


CGGTTGGCA7 


CG7AAAATTG 


TAAAATGCCA 


TGAATGCGAT 


TATCTGAACA 


ACGTGCATTA 


22260 


7CCATCATCA 


GATA7TTTTT 


CGTGCTTGAA 


TTAGACAAAC 


TAAGGCGCAA 


T7TTAATACC 


22320 


TCCGCCACA7 


TCTCGGGCAA 


GTTTCCTAAT 


GCTAGAGCCT 


TTAACACAAC 


CTCTTTTCCT 


22380 


AACTTC7CGA 


AAGGCGTTCC 


TTGTTCTTCT 


AGCCACTTTT 


TCAGTTGCGC 


TAAGCTGTTA 


22440 


GGATT77CCA 


ATCC7GTTAA 


AGCCTTCATT 


TCGTTTAATC 


CCGCTTCGCT 


TAAATCGGCC 


22500 


ATAATA7CAA 


7AGCAGCCGT 


TGCCAATTCG 


7GGTCAATTT 


CTGCCCCTCT 


GTCGTTAATC 


22560 


CGCTGA7CCA 


AGG7GTAAAG 


CTTCCACTCA 


CTTTCGGGGA 


CAGGGAAGCG 


ATTCAACTTA 


22620 


TTCGCGATAg 


CCATTTCTAC 


ATTGACGTCT 


TGAATACAAT 


ACTCTATAAA 


CGTTTGCCAT 


22680 


7TTTCGGGAG 


CGTG7TCAGG 


TAAATTCCTT 


GTACGCATGC 


CGTTTTTCTT 


GGTCGGCTTA 


22740 


CACGGT7TAG 


AAAAGAAGTT 


AATCAGTTGC 


GTACCTCTTG 


TGTCTTTTTG 


CTGTTCGATG 


22800 


7T7AAATA77 


TCGCACATTG 


CCCTAAAGAA 


GCTGGCAACC 


CTAATTCGTT 


TGCGTGAACC 


22860 


ATCGTACAGT 


GCCATTGTGC 


AGGGTCTAAG 


TAGTACGGAA 


TACCTAAATA 


CTTAGACAAA 


22920 


CAAACGCGCT 


CAAA7TGGGC 


ATTAAATGCA 


ATTTTAAGCA 


CGTTCTTATC 


TGTTAGCGCA 


22980 






I 1 LbL 1 Art. I 1 


oil AAb I CC 1 


C AC AT T C G AC 


AGGCTGACCA 


23040 


TCAACCGAA7 


AAGCAAAAAG 


TAAAATTTCA 


AAGTTGGGGC 


TATCAGCGTA 


TTTATAAACC 


23100 


CCAACT7TGG 


TTAAGTCTTC 


ATCCGAATAC 


GTTTCAATAT 


CAATATTTAA 


CGTTTTCAT7 


23160 


7GCTATCAGC 


CG777CAGAT 


GGTAACCGCA 


ATTTACTTTT 


ATCAACGGCA 


ATCATCGGAA 


23220 


GCTGTGCTTC 


TTCCTCTTTA 


AAAACATCAA 


CGTTGATCCC 


TAGCTCTTCA 


AGATAAGCCA 


23280 


C7CCTTGTTT 


AGCAGGCACT 


CTTTCTACAT 


TTGCAATAAT 


CGCATTGTAC 


GTTTCTCTTA 


23340 
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CCTCTTGGAT 
ACTTAAGTTT 
GATAAAAGCT 
CGTTTTGGCG 
TTTTCAGCCA 
CTTTTTCCAC 
TCATTTTTTC 
GTATCTCCGA 
TTTTCAAAAT 
TCAATAG'GCA 
GCATTTTAGC 
TAGGTCACCA 
CGTTAAAATG 
AAAGTTAATA 
TACAAGCACA 
AAACATTGCG 
TAGtAgTTTT 
AGGCTGTTTT 
TCAACATGCA 
GCACAAAACT 
TAAATTCCTA 
CGAATTAGAA 
CTCTAAAATA 
TGAGTTAGTA 
CACTTTATTC 
ATAACCTTTA 
CCCCACCTTC 
ATCTGTCACC 
CGCATTTAAA 
TGAATGCTCC 
AATTTTTTTC 
TTTAACGCTG 
ATGAAGTCTG 



TTCTTGATCG 
TTCAAGAAAC 
CCATATTTCC 
AATTCTAAGT 
CTCTGATTCA 
TGAATTTTTT 
CTACCCCCTG 
ATTCTTTGAA 
ATATAGAATC 
AATAAATTGG 
CCCTTATTCA 
AAATCGGATT 
TTGTTTAGCC 
GATGCAATCG 
CCATCTTCAC 
TTTTCAAATT 
CAAGCGCTCA 
GATTGCTTCT 
AGAATATTTT 
TAATCTCACT 
CTTTCTTATA 
GCTTGTTCTG 
GATTCTTTAT 
ACTAACGACG 
TCACTGAATA 
CCGCTAAAAA 
TACTTAAATA 
TTAATATTTA 
TCATCAAAAG 
GCTTTCTCCA 
ATAGCCATTA 
GTCGCTTGTC 
CAGCTAATTC 



TAAACAGCTT 
CCTTTCTTAT 
AAACTGTTTT 
GAGCTGAAAT 
ATATGTGCAC 
AAAGTAATCA 
ATCTTCTACT 
TTCCCAACGG 
CAGTTTTCTC 
TCGTTTACCC 
ATTTTTAGCT 
CAGCATTGGC 
CAGCAGTAAC 
CATAAACACC 
GTTTTACGAC 
CTGGGCGTTC 
AATTTAACGC 
TTCATCGCTT 
TTCTCTTGCC 
TGATTTGTAA 
TTTTTTAGTT 
TAAAATACTG 
AGTTTTTTAG 
ATAACGCCGC 
CATAAAGCCC 
TCACAGGCTT 
CATCCAAAGT 
CAGGGCGATT 
GCTGTATGAC 
TTCCCTTTTT 
CTCCACCCCT 
ACTTkCAGGA 
CGCAAATTTT 



CTACTTTCGC 
AGGTTTCAAA 
CTTCTAACAA 
ATCTATGCAA 
GAATAGCCTt 
CATGGTTTCT 
GGTGTATATT 
TGCAATCTGA 
TCTTCAATTA 
ATTTCAATTT 
AAACATATCG 
ACGACCGCCT 
CCCTTTGTTT 
TGAATAGACT 
TTGTGGTTTC 
TTCGGTGTCC 
CTTTTAATTT 
TAAGGGTTTC 
CTTCTTCCAT 
TTACTTTCGT 
AAATAATTCG 
TTTAATGTCC 
GTTCATTGTC 
TTCAATCGTG 
AACATTGCGT 
TTCTTTACAT 
ACCTTCTTGA 
TCCTGTGCTG 
AGTAAATTCA 
TAGAGTTGAA 
TCAAAATCAT 
ACAAGAACAG 
TTCTTGCCGA 



AAGCCCTTTT 
ATCTCTAGCA 
ATCGACAAGT 
TTTCTCAAAT 
CGTCTTTTTC 
TAAACGTGCT 
CTTGTTTGAC 
CGGATTCTAA 
GCTCAAAAAC 
CTCCTTTCAA 
TCTTCTTCGT 
AAGAAGTCCC 
CCTGcAGTAC 
TCGTCTGGAT 
GTTTTACTTG 
ATTTCTTCAT 
ATCCCCTTTT 
CTTATCGTCT 
CGCATGAGGT 
TCCAGTTACT 
TCTTCTTCTA 
GAGATGATTT 
CACGTTTCAT 
TTAAAATATC 
GTATCTTCTC 
GATTTTTGAG 
CGTTCTAAGA 
TTTTCAACGG 
TCCTCAGaTA 
GCTATTGCCT 
TAAGCGCATT 
GCTTTCCTTC 
CAACTTTTTC 



ATTGAGTTAT 
TCCGTATCGC 
TCATTTTTTA 
TCAGGATCAA 
GCTCTTATTT 
TTAGTTGGTT 
ATAAAAATTA 
ACGGCTACCG 
ATCTTGTAAT 
AAGAAAAGGG 
CTTCCCATTC 
CTTTACATAA 
TGTAAGCATA 
CGTCTGTTTT 
ATACGTTGAT 
CGCCATCGCG 
GCGCCTTCAT 
TTTGGAATAA 
TCTAACACAT 
TTTGCCATTT 
AATCATCTAA 
GTTCTTTCAT 
CTACAAGCAA 
CTTTAAATAA 
TTGTGATGTA 
GCATTTTATC 
TAACCACCGT 
TAACAATAAC 
CGTTAATTCC 
CTTGTGTGTC 
TTCTACGCTG 
TGGTTTGATA 
TAATTGCCCA 



23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 

24900 

24960 

25020 

25080 

25140 

25200 

25260 

25320 
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ATGGCTTTCA 
AC CAT C AG T A 
CAACCAGGGA 
TCCAGCCATT 
AAAAGCGGTG 
GGCCGTAGTT 
CCTGTACCTT 
AATTCTTCTT 
ATTCGCACGG 
AACATCAGTT 
ATAATTTCAA 
ACCCATTTTG 
ACACGTTCTT 
TACGGATGGT 
TTAAACCGct 
TAAGAGcTGC 
CACCGATGCG 
CTACCAACGT 
GTTTTTCTGC 
TAGCCTTTCT 
TTGCTGATAC 
GTTCGGTGTC 
TCTTTTTTGC 
CTGTATTTAA 
CTTTTAACTC 
CCTCCAGTTA 
TTGCGTGATA 
CAGGAAGTAG 
TCTTGTACGG 
CATTCCCATG 
TTTCCTGTAG 
AATATCTGCT 
ATCAACTTTC 
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GTTCCTGCGG 
GCCCTTCATT 
ACTCTTCGCC 
TTTTTATCTC 
CTTCTTTCAG 
GGGCACGAAC 
CCCAAGCCTG 
TTTCAATTTC 
TTTCAAATTC 
GAGGGTTTAA 
TCGTCTTATC 
AAAAATCTAC 
CTACTAAATC 
CTTGtTCAAA 
TCGCCAAATA 
CGCGATCTTT 
CACTACTCGC 
TTCTAAGTCA 
GTTGTAACGA 
CATCGCTTCT 
TTCTTCTGTT 
TGAGGTTGAG 
AGGTtCTTTC 
AGATTGCGTG 
TTGTGCTGAA 
TTTTAATTGG 
ATGCAACGGT 
CATTCTTTTT 
CGATAACATG 
CTTCAAATGC 
GCCCcTGCTG 
CTTTTTGCTT 
ACCGCCAATT 



TTTTAAAATG 
ATCTGAAATT 
TTCGTCCCTT 
TGGCGCACGT 
TTCGTGCTTA 
TTTTGAAAAC 
CACTGCTCGA 
AAAGGTCGAA 
GTAAATAATG 
ATACGCATCA 
TGATAAGACA 
TTTTTGTTCT 
ACAATAAGCG 
AAATTTCAAT 
CAACTCTGCT 
TACTTTTTCC 
CCCTAATAAG 
GTGATAAATT 
CTAAAGCAGG 
TGCACATCTG 
TCTGAATCAG 
GCCGTAGAAA 
TTCGCTGCTT 
ACTTCTACAA 
TCTGCTTCAA 
TTAATGTCTT 
GGGTCTACAT 
GCTAATTCTT 
AGTTGCTACA 
TTGGATTCCC 
ACCTTTCACA 
CGATGTTAAA 
CCCGATAAAA 



TCTTCATCTT 
TTTCGATTAC 
GCTTTCTGTA 
TCTAAAATTT 
TCAATCAGCT 
TTCACAACGT 
GGCGCCACGT 
ATATTGTCTA 
TCGTACTTAT 
ACAGGAACAC 
ACCACGTCCG 
AATTCGACTG 
GTGACGTATT 
CGTGAGTTTA 
AATTCGTGTG 
TCTAGTCGTG 
GCATGATTTC 
TACCGTAATG 
TCTTTATCAT 
CTTTTGTTGC 
TAGGCGCGAC 
CTTCTTCTTT 
TTTCTACTTC 
CGTTGTCTGT 
TATTTAAATA 
CATCTGTTAC 
TACGTTTTTC 
CTTGTGACAT 
CTTCCTATCT 
TGTTTTCTAA 
ATGATTGGTT 
AAGTCAGGAA 
TCCAGGTACT 



CAAATCCTTC 
TtCGCCCAGC 
AAGCGTAAGT 
CAGCAATTTC 
GGAAATTTCT 
CGTCTGTAAT 
AATTATCAGC 
AACGAGGTTG 
CCACCGCACC 
CTTTGCCGTA 
AAGTTCCAAA 
TGGCATTTTC 
CTTTCATGGA 
CCGCTCTAGC 
CAGTTGTTCC 
CCAAAGGTGG 
CTACTGGCAT 
CTCTTCTTTC 
AGGTCGCTTG 
CCCTAGGTGT 
TTTCTCCCCA 
CTTCTTCGCA 
TTCTGTAGCT 
TGCTAACTGC 
GATGTTTGCC 
ATTGAAAATC 
ATAAACTTTT 
GCCCGCTTTT 
GGTTTAGCTC 
GGCAATTAAC 
TTCCCCATTT 
GTACTATCTT 
TCTTCTTTTA 



GGCTTCTAAA 
AACTACTTTC 
TTCCACATGT 
CTCATTGGTT 
CTCCGCACGT 
CGTCCATTCT 
CCAGTATAAT 
AATAATCGTC 
AAGCGCATAT 
TTTCAAGTCG 
TCCTTCTGGG 
GTATTGATTG 
TTCATCACAA 
GGTTAATAGC 
CTCCTCCGCA 
GCATATTAGC 
GCTTATTCAC 
AAGTCTGAAA 
CCTTTATCCA 
AAATCGGCTG 
GTTTTCGATG 
GTTGCTTTTT 
TTTTTATCCG 
GATAAAATTT 
ATTTTTTATT 
GCTTTAAAAT 
ATCAGCTTCA 
CGTCTAATTT 
TAGGGTTAAA 
AAGCGTTGTT 
ATATGCTTTT 
TTTATCGCTC 
GTTCTGTTTC 



25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
, 26280* 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
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TGTGAAACGT TGTTTTAGCT TGTCTGATTT TGCTGTACAA ACAAAACGAT TCTTTTCAGT 27360 

GTTGTAAGAT AAATATTGCT GTTTTCCCGT AGCATTTATC GCGTATGGCA AAGGAATTTC 27420 

ATACATTGGT TCTTTTTTCA CTGTGTAACC ATCAAGAATC GCACGTGTAT GCAGTTCTCG 27480 

ATTTTCTTCT ACTTCTTTCG TCCACTCAAG GCCTTTCCCG GTAAGCAACT CTTCATCTGT 27540 

AATAGGATCT TCTAACCAAT GATCCCATCC CATCTTATTA ATTTGCCAAA TACACCAACT 27 600 

CTTATAATCA ATATGTGCTT CTTTACTTTT CTGTACCCAT TTATCAAATT CTTTTGAGAC 27 660 

TTCAATTGAT TTTTTAGGTG AAGTTGCTGT CCATTCAATA AAATCAGTAA TTCCTAATAT 27720 

ATGCTCTAGA AAAGGAGCAC TAATAATTGA CAATTCTCTA TTCGCTTTAT TTACTTCAAT 27780 

AACTGTTTTT TCTTTATGCC ATTTCCATAA ATCGGTTTTT GTAGGCTCAA GTCCTGTAGC 27840 

CCAGACATAC CCTTGTTCTT CCAAAAATGC CATTAAGTTA TCATATGCTT CTTGTGTTTT 27 900 

TACGTGATAA TAGTTTCTCA TTTGCGTTTC TCCTTTTTCG TGCTATTATT TACTTGTATA 27960 

TTTTTTGTTT AGTAGCTTAC TTCGTTGGCG GACGAGGTAG GCTCTTTTTG TTTCAATACA 28020 

GCAATCATCA CCTAACACCC CACTTCGTTG AGCATTTTTT GATAAGCAAC TTCGTAACGT 28080 ■ 

TCTAACTCTG CTTGAAAATG CTTTAGCGTG CGGAGGTCCT GCATTGTCGG ATGCTTTGCA 28140 

CTTTCGTGAC GTACTGCATC TCTTAACGTT TCAATCTTTT CTCGTACTGC TTCACGTACT 28200 

AAAAAAGCTT CATTAGCTGT TAACATTTAT TTCACCTCAC GATCTTCAAG CGCTAAATCA 28260 

TAAATTAAAA GCCAAATGAT AAAAACCGCT ATATAAATGT TTTGGATTAA TGGTCCAATA 28320 

TTGCCACCTA CCAATAGACC TAGTCCGAAT ACAACCAACA GTGCCGCTAT ACGTCTTAAT 28380 

TGATATATTT TTTTCACTGT AATCATCCTT TCTTTCCAAT GTAAACATCT GCAAATCTTT 28440 

CATCGTACAC CTCATAAATT GAGATATTTT TTTCTGGATC TGTGCAGCAA TAACGTAAAT 28500 

TAATCtTTCT TTCAAATTCT TTAAATTGAG CGGCATCTAA AAGTTTGTAG TCAATTGATC 28560 

CATAAAATGT TTCATAAAAT TCCACGCTAG CCTTTTGCAA TATTTTGTGT AATTTTTTCA 28620 

TTGCTTTCAC CTCCCTAAAA CCATCACAGA ACCGTTAAAA TTTCGGATtG CTTTTGCTTC 28680 

TGGCAAACGT TCATCGCCGC CTACAATTTC ATTTGTTCGA TGGTTATACA CTCGTTTTTG 28740 

CCCTTTAAAC AACACAACTG AAATGTTGTC GTATGTGCTA CAAGTGTCTT GCACTTTTAA 28800 

ATCTTCGCCT TTGTAGCAAA TAATCATTTT TGTAAAACCT CCTACGGAAT TAATCTGTGC 288 60 

ATGCGGTTCT TATCTTCTAA ACGACTTAAA AAATCTAATT CGTTTTGAAT TTTTCCGCTT 28920 

CTCTGTCTGT TAATAAATCA GCGCTTCTAA GAGCTGCCCT ATCTGATTGT TTTTGCTTGC 28980 

GATCTTTTTG AATTTGGCAT AATATCCACG CTTCTTGTTC TGTCGTATAA GCCACGTACT 29040 

ATCCGCCCTT TCTTTTCTTG TAACGGTTTT TATCCATCCA GCGAACAAAC TCATCAAACA 29100 

CATCTAACTC AATTAATACA ATCTTATGTG TTGGATGAAT TACACCAGAC GAAAATTGCG 29160 

GCGTGTCCTG CATTAAAGCC AGCCATTTCT CTAAACTAGA TTTAGTGATT GAGTACATTT 29220 

TTTGTACTAC GGCTTTATTT CCATATTTAA TTGACGCACG ATCTGGATTC TGTTCGACAG 29280 
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CTTCCAGATC 
GTTTAGTTCA 
TTTTTCTGTC 
TTCACGAATT 
TAATGAGCTT 
TTCTTTAAAT 
TAGTGCGGGC 
AATTACCACT 
AAAATCGTAT 
TAGTTTGkAA 
ACCaGTTAAC 
AGACCCTCTT 
TACATAAATG 
CAAAAATAAC 
GAAAAAATAA 
ATATTTCTAA 
TTTCAGAACT 
AGAAAAATGA 
CAAAGCAAGT 
AAATAAAAAA 
GTAGACGAGT 
TGTTTGGGAA 
TCTAAATCAA 
GGTATGATTA 
AAAAGAAAAA 
GAAAATAAGG 
CCAGAATTTC 
AATGAGCTGA 
CAAGCTATTT 
GCCGCCTATG 
TTAGGACGAA 
CTATAATGAA 
TTGCAGGCAT 



AACCTTAACC 
ACTAGCACAG 
CATCGATAAG 
GATGCATCTA 
GATGAAATTT 
AGATCTTTGA 
ATTTCGTGTG 
TGCGTAAATA 
TCCCACAAAG 
CTCGGGCAGG 
TCAGCTAAAC 
GTACAATTCA 
AGTATACAAT 
ATAAAGGAGC 
AAAAATGTCA 
CTATGAGCGA 
TTTTGATGTA 
ATCGATGGCT 
AAAAGAACTA 
AGAAAATTTA 
ACCTTAACTT 
TGAGTGTTAA 
TTAAAATAAG 
TTAAAGATGA 
ATTTCACGAT 
TATTCACCGA 
AAGCGAACAT 
AAAAAGGAAC 
TTATGAGATT 
ATACGGCCAA 
ACTTAGAAAG 
TTTATATTAA 
TGCTCTTATT 



GGTTTTGGCA 
CCACATAATC 
ACACATCTAA 
GATTTTCTTG 
TTTCTCTGCA 
CAATTAAATG 
CATAGTTCAT 
AGAATTACGT 
TTTCAAAATC 
AACGGCCATT 
GTTTTTGAGT 
TATGAACATC 
GTAATTAAGr 
TTTAAAATGA 
CAGCAAGAAC 
GGGGTAATTA 
TCTATAGACT 
AAACAAGTAA 
GAGGATGAGA 
AGTGAATAAT 
TTGCGGTACC 
TAATTATGAG 
GCCCTTTCCA 
TT AC GAG ACT 
ATCTCATGAA 
CACAAAAGAT 
TGGAGCATCT 
TCCTCCTTAT 
ACTTCAGCAA 
TAAGATAATG 
AAAAATTTTA 
ATCGAAAGAA 
TTTGCAGCTC 



TACCCCTTAC 
GGTAACAAAA 
CTGCAGTAGT 
TGCTTTCCAA 
TCGATTAATT 
CAGTCTTGAT 
TTTCTAAGCT 
TTTATAATAA 
TACATTAAGT 
CTCTATATTA 
TAATGACTTT 
TCCTCTCATT 
ATTACGCAAG 
CCTCATTTGC 
TTGCAACTTT 
CCAATGTATC 
ATTTGCTAGG 
TGATGCGCAT 
TGGAACGTTT 
TACAAGGTGG 
ATT AACGAAT 
CATAGATATA 
TTTGAAAAAA 
ACGCTAGCTT 
CTTATTCACG 
ACTTTATCCT 
TCTATACTAC 
TTTATCTCAA 
ATGCAGGCTA 
AACGGCAACT 
TACAGCAACC 
GGATGATAGA 
TTCTCTCTGC 



CCCCTCTCAT 
GAATAACGGT 
TGATGAAGTA 
TAACGATTTG 
CTGTTGTTCG 
TTAACAATGT 
CCTTTTGTTA 
AAAAAGAATT 
AATTTTGCTA 
CGAATAGTTG 
TTGATTCTCG 
TAATTTACGT 
AGGTTTATTT 
GCTGAGATTA 
TTTAGGTATG 
TAGTGATACA 
AAAATCAGAT 
GGATACAGAT 
CTTTTCATGG 
TTTTCAGGAT 
TTATTTCAGC 
TTTGGGATGA 
CAGCACGTAG 
ATAATTCTAA 
CCATGTATCA 
ATTCACTGGC 
TACCCGAACC 
ACCGTTACGG 
GTTTTGAAGC 
CTAAAAATTT 
CTTTTTATGA 
AATGAAAAAA 
CTGTAGCAAC 



GCTTTAGCTC 
CCTGATGTAG 
AAAAACTTTT 
TTCCATCGCA 
TTTTAAAAAC 
CTTCTGCCGT 
AAAAAATAGT 
AAGAATTACG 
ACATAATTGA 
TTTCAGTAAC 
CTGCTCGCAA 
AATTGCCAAT 
GCTATTTTTA 
AAGGAATTAA 
AAAAGAGAAA 
TTAGAAAAGA 
ACATCTAATC 
GGACTTTCTT 
AGAATAGAAG 
GACAGTAGAA 
ACACATGTTA 
AATATTAACA 
ATCTATATCA 
CATGGGCGAA 
TTTAGATAGC 
AGATATCTTA 
TGTTCTTATT 
TATATCTGAA 
ATCTTACGTT 
GGCAATTGAA 
AGCAATTACG 
ATGATTAAAT 
GCAAAAAATA 



29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

3084 0 

30900 

30960 

31020 

31080 

31140 

31200 

31260 
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ATACACAAAA GAAAGCCGAA ACTGCTGCCC AGTCAAGCAC 
ACGAAAACGA GCCTAATACA GAAAACATAA CCCAAGCAGT 
TTAACTCTGA CGAGAAATTA GTAAAAATAG ATGTTAAAAA 
CAGATAACCC TCACGCTGTC ATTACGGTTA AGGTAATTAA 
TGGAAGAAAT GCAGACTGCG ATAGATTCCA ACTCAGGTAC 
TATACGGAAT TCAATTAAAT GTTGAAGAAG TAGCCAAAAC 

TTATTTCTTT CATCACACCT TACACGAATG GGAACGACAG 

AAAATGAAAA TATTATTCCG TTAGTAAAAT AAAAAAGTGC 

TGGATGCACC TTATCAGAAA ATAAACCAAA GGCTTATTTA 

AAGCCTCATT ACGAGGTTTT TATACACCTC CAAAAGAACG 

GTTGAATATG TGGGTAGAAA AGAAAAAGGA TAAAAAAGGA 

CGAAAGATAC ACCGACCCTC GGACCGGTAA AACACAAAAA 

AAATACAAAA CAAGCTGAGA aTGCAGCACG TCGTGAGTTA 

ACTGACCGAC TACGAAGAAG AGAAACGTCT AGAAAATTTA 

ACCAAATGAT AAAACGTTAG AAGATATGAT AAAAGAATGG 

AAAATGAAAA GAACGTAAAA AAGGTTCCAC GCTCTCGGGA 

ACTCTTAAAT GAGTTCTTAG TAATAGAAAA AGATAAACGA 

TGATATACAA GCATTTTATG ACCGTATGAT TTTTGACTTT 

AAAAAGATTC CGTGCGATTA TCCGTGGTGC ATTTACTCTC 

TAAAAATCTT GATGCCATTA ATCTTTCTAA AATAGTAACG 
TCTTATTAAT GAGCAAGTAC CTAGATACTT AGAAAAAGCC 

AGTTTTAAAT GGATACAACC CACTTTATAC GCAAATACTC 
GATGCGTTTT GGTGAATTGA TTGCTTTAGA ATCAAAAAAC 
AGATATTCAT GGAACTTACG ATCATGCAAC ACGTAGTCAC 
GCCAAAAACA AGAAAATCAT TTCGTACTAT TGAGTTAGGT 
AAAAGACC 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
AGTGAGCGGG TCACTCTAGA GAAAAGGAGT AGAAAAAAAT GGCGGATTTA AAACGACTGT 



TATTGAAGCT 
TAAACAGTTA 
TAATGTTAAA 
TGATGAAGCA 
AGAGGCACAA 
ATTAGAAAAT 
AACCATAGCA 
ATCCCCCGCC 
ATGCTATTAT 
TAAGTTCCCA 
AATATCACAT 
GTCACCACGA 
GAAATAAAAA 
ATCAAAAACG 
TTTGATTATA 
TATAGCTATG 
ATCCAGGATT 
GAATATCAAA 
GCCTATAAGA 
CCAGCAAAAA 
GAGGCTGAAC 
GAACTACAAT 
TTTCACGACG 
ACCCGAGGAC 
GATGCTGCAG 



TCAGACAGTA 
GAAGAAAAAT 
GATGACACAT 
AAAAAAAATA 
AAGACTGCCA 
GATAACGATG 
AAATCAACTA 
GGCAAGCTAG 
ATCAAAAAAT 
TAAAAGGAGA 
ACCAATATCG 
CTTTAGCAAA 
TCCAAAAGAT 
AGGGCGTAAT 
TTCAGGATCC 
GCATTGATTC 
TAACTTTTGA 
AGTCCTATAT 
AAAGATACAT 
CGGTGGaTGA 
AAATATTTCA 
TTCACACTGG 
GATTTTTGTT 
AAAAAGTCCC 
TACAGATTAT 



31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32768 



60 
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TCATTGGGAA 


AC CAT TG AAA 


TCAGCCGAAA ATGATGAACA TAAATTAAGC , 


AGATTTGCAG 


120 


CGCTTGCGTT 


GTTATCTTCT 


GATGCACTTT 


CATCCATTGC GTATGGTACA ' 


GAACAAATAG • 


180 


TGGTGGTTCT 


GGTTACTTTG 


TCGGCGGCTG 


CGATTTGGTA TTCATTGCCA . 


ATTGCTGCAT 


240 


TTGTTATTAT 


TTTACTTATT 


TCTTTAACAT 


TGTCGTACCG ACAAATTATT 


CATGCGTATC 


300 


CGCATGGCGG 


CGGCGCATAC 


GTTGTTAGTA 


GTGAAAACCT CGGCAGAAAT 


GCAGGTTTAT 


360 


TGGCTGGTGG 


TTCGTTGTTA 


GTCGATTATA 


TGCTAACAGT AGCCGTTTCA 


GTTTCAGCGG 


420 


GTGCAGAGGC 


GATTATATCG 


GCGATTCCAG 


CGTTATAtGG TCATCAAGTA 


GCAATTTCTA 


480 


TTGGGATTGT 


GATATTGATT 


ACCTTAATGA 


ACTTGAGGGG CCTGAGAGAA 


TCAGCTTCGT 


540 


TCTTAATGTT 


GCCAGTGTAT 


AGTTTTATCG 


CAATTATTAC CTTACTGATT 


GTTGTTGGGC 


600 


TATTTAAAAT 


TGTGACTGGC 


GCACAGCCAT 


TGAATGCGAC AgcTTTACCA 


GGAGCCGTGG 


660 


TTCCAGGAAT 


TTCTATTGCC 


TTGGTTTTAC 


GGGCTTTTTC ATCTGGTTCT 


TCTTCCTTAA 


720 


CAGGGGTCGA 


AGCAATCAGT 


AATGCAGTGC 


CATTCTTTAA GAAACCACGA 


GCAAAAAATG 


780 


CTGCTGGGAC 


GTTAGCGTTA 


ATGGCAACGA 


TTTTAGGGTT CTTCTTCGTG 


GGGATTACGT 


840 


TTATTAACTA 


TTGGTATGGG 


ATTGTGCCAA 


AAGAAGAAGT GACAGTTCTT 


TCACAAATTG 


900 


GGAAAGCGGT 


CTTTGGTCAA 


AATATTTTAT 


ATTATTTATT ACAGTTTGCG 


ACAGCCTTAA 


960 


TTTTAGCGGT 


TGCCGCTAAC 


ACAGGATTCT 


CAGCCTTTCC AGTTTTGGCT 


TATAACTTAG 


1020 


CCAAAGATAA 


GTTTATGCCG 


CATATGTATA 


TGGATCGTGG CGACCGTTTG 


GGCTATTCAA 


1080 


ATGGTATTTT 


AACTTTAGCT 


GCTGGTTCCA 


TTGTTTTATT ATTGATTTTC 


CAAGGTTCCA 


1140 


CAGAACGTTT 


AATCCCATTG 


TATTCCATTG 


GTGTGTTTAT TCCGTTTGCT 


TTGTCTCAAT 


1200 


cGGGGATGGT 


TGTAAAaTGG 


CGAAAAGAAA 


CTAAAAATTG GTTGCCAAAA 


TCAATTGCTA 


1260 


ATATTGTCGG 


CGCCTTTATC 


TCTTTCGCGA 


TTATCGCCAT TTTGTTTATT 


TATCGATTAG 


1320 


GGGACATCTG 


GCCGTTCTTT 


AT TAT TAT GC 


CAGTGTTGAT TTATGCATTT 


TATCGGGTCA 


1380 


ATACTCACTA 


TAAAAATGTG 


GCGGAACAGC 


TACGTTTAGA AGATGGGGCA 


CAGTTACATG 


1440 


AGTTTGACGG 


GAATACAGTT 


ATTGTTTTAG 


TAGGGAACGT AACCAAGGCA 


AATGTCGGCG 


1500 


CCTTGAATTA 


TGCGCGTTCC 


ATTGGCGATT 


ATGTCGTTGC CATGCATGTC 


TCAATGGATG 


1560 


AAAATGTTGA 


GAAGGAAAAA 


GAAATTCAGG 


AAGAGTTTAA AAAACACTTC 


CCGGACGTTC 


1620 


GTCTATCAAT 


TGTCCATTCT 


TCGTATCGTT 


CACTGCAAAA TCCTATCTTA 


CGTTATGTTG 


1680 


ATTTGGTTAG 


TAAAAATGCC 


ACGAAGCATA 


ACTACAGTAC AACGGTTTTA 


GTGCCACAAT 


1740 


TTGTACCAAA 


TAAACGTTGG 


CAAAATATTT 


TGCATAACCA AACAAGTTTA 


CGCTTGCGGA 


1800 


TTCGTCTAGC 


GTGGCGGGAA 


AATATCATCG 


TGGCTACCTA TAGTTATCAC 


TTGAAAAAAT 


1860 


AAATAAAAAA 


' ATCTGCCATC 


TTTTGATGGC 


AGATTTTTTT ATTGACGTAA 


TTGTTTTTGA 


1920 


TAAATTCTAG 


CGCAAGACTG 


CTCTTCGGTG 


GTTACCATTG TCAAAAATTC 


CCAGTCATAT 


1980 


CTTTCATAAA 


AAGTAGTATG 


; GTCTGTGAGC 


: AAATAAAGCT TTTTAAATCC 


: AGAATTAGCT 


2040 
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AGAAAATCAC 
TCGACATAGA 
ATTACGCCCA 
ACGTGGTTTT 
TGCCACTTTT 
TCGATAATAG 
AAGAAACAAT 
CTAAACGCCG 
GTAAGGCACC 
TATCCCCTAA 
CTAACGCATT 
AAAGAAGTTG 
CTTGAATCAT 
TTACAACCAC 
AAGCTGGTAC 
TACGTGAAAT 
TCGAAATTaA 
AGTAGATAAA 
ATTTtCTTTT 
ATAGTTCAAT 
TTTAATTTCC 
CGGGAAGAAG 
TGTTGAGTAA 
AATTTGCATT 
ACCGTCGCCT 
TTTCCCTAAA 
GAATACATAA 
AAATTCTGAT 
CGTAGTAGTC 
CATTGCTTGG 
TGTGTAATGC 
CCCACCTGTA 
ACCTGCCAAT 



CAACTGTTGT 
ATGCACAAAG 
CGCCTGCAAC 
CGCTAGTTAG 
GGCTAAACCA 
TATAGTTAGA 
GTTGGATCTG 
TTCAATAATC 
GACTAAAATG 
ACCACCAGCA 
TCGAATACCT 
TACTTTTGTC 
GCCAGTGTAC 
CGTGTTGACG 
TGTTTGGATA 
CATAATCCCA 
AAAGTGACGT 
TTGTTGGAAT 
AAGAAaCGTT 
TCTTGCATTT 
GGCGTTTCTT 
CGCTTGTCAT 
CCTAAGATGA 
GGTAAAATAG 
TCACGTGTGA 
TCACTGACTG 
GTATTTTCAA 
TGAACTAGAT 
ACGTCAGTAC 
TGATTGATGG 
' TCCACCATAT 
ATGGCAACAG 
AAAAGTAAAC 



TAAAAGAATT 
GTTTGGCGTA 
GATGTCCTGC 
CATTTCCTGC 
TTGAGCAGCT 
CATTATTGAT 
CTCTGTATTT 
CCTAGAACCC 
ATAGAAGCAC 
CCAACGAAAG 
GCCATGATTA 
ATACCCATTC 
GTATTCTTGA 
CCTAAGCCCA 
ATGTTCGCTA 
ATAGGaATCC 
AAAAATTGTT 
AAATcATATT 
CGGCTACGAC 
TTTTGGTAGA 
TTAATAATTT 
CTTTTAACAT 
CGTCCATTTT 
ATTTAAATTG 
TCCACGCAGT 
TTTTTAAATT 
AACCATATGA 
TGAACGCTTT 
CAGTATAACG 
TAGTAGTGGC 
CCGCTACAAT 
TGTTATCATC 
TGCTTAAGAA 



663 
CCTGCTAAGC 
AGGTCTGCAC- 
TGTTTATTCA 
ATACTTGCTT 
GACGGAATTA 
CTCCTAGATA 
TTTATTAATT 
AATCTGTGAT 
CATCTGTGGC 
CACCAATCGC 
CTGAAACAGA 
CTTTACCAAC 
TGATTGGTAG 
TTCCCAACAT 
AACGGATAAT 
CGaCAATTGC 
GaAAGACGTA 
ATTCAtCTGk 
AGCAGGTTCA 
AATTTTACCA 
GTCACTAACC 
GACTAAATCG 
TCCAGCGTTG 
GAAGTTGTAT 
ATCGACACCA 
GTATTTCTCA 
AGGGAACCAA 
TTTAGGGTCT 
AGCTGCAGAA 
TAAGTTGTTA 
GCTCCCTAAA 
AGAAGTTGAA 
GAGTAAGCTT 
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CTT*GGTGACG AAATGTTGGT 2100 

GAT-TGTCGAAr -GTCATTATCA 2160 

AGCAGACCCA CCAATGTGGT 2220 

GATACGCTTC AGCGGGGATT 2280 

AGTGTGGTTG TTGGTTTAAA 2340 

AAAAAACAGA TCCCATTGAA 2400 

GGAATGTTTG CTAGCTGGGT 24 60 

AATCGCCATA AAGGCCGTTG 2520 

ATTGGTTCCA CGAATAATAA 2580 

AGTAATCCCG ATTGCGACGA 264 0 

TAAAGGTAAC TCGACCATAT 2700 

ATCTAAAATA TTGCGATCCA 27 60 

GAGAGAATAA AGGAAGACCG 2820 

TAAAATCGAA AGCATTGCCA 2880 

CCAGTTGGCT AAGCTACGTT 2940 

AGCAAAGATT ACGCCATAAA 3000 

GCCGCCATTT tGTTGGaAAT 3060 

TACTCCCCCT CAAAATAGTT 3120 

ATTAAATCAT TATCTGCTTG 3180 

TCTAAGCGAT TTAAGACATC 324 0 

ACAGGAGCGG CATCATAAGG 3300 

TAACTACCAA TCCGGCCATC 3360 

ACTGCATCGT ACACCAAGCC 3420 

TCCTTTTTAA AGCCTTCATA 3480 

GCAACTAATT TGTCTGCGAC 3540 

GCCGTTTCTT TTGTAACTAA 3600 

GTTTGTTGAT AACGTTTTTC 3660 

TTTTCCGCTG GTAAGCCAAG 3720 

ATTGAGGCAT CTCCGTTTAA 3780 

ATCACATTTG TTTTTTTATC 384 0 

ATCTGCGCTT CTGAAGTAAT 3900 

GCGAGACCAG GTAAAGAACA 3960 

AACTTGAGTT TCTTTTTCAT 4 020 
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CA1 1A1 Ibbb 


LvloLLl inn 


APPTTTTGGC 

n\Jw 1111 UVJ*-' 


GTTAAACGAT 


TXTCTAATAA 


TCCAAGAAGA 


4080 


AC A I L I bb i A 


TAAAPPPPTAA 
AAAbbbb 1AA 


a atapttapp 

nri 1 nU J. 1 n\_< 


GGAATAGTCC 


CGCCAAAGAT 


TAATTCTGGC 


4 1 4.0. 


TGATAGTTAT 


fnrpAB tpptapt 
1 1AA1 bbAbl 


PZ1 A A ATPAT A 


TPAPCAAGAC 


CACCTGCACC 


AAT AT AAG AA 


4200 


GCCAGTGTAG 


CCC AAbb AA 1 


p&patapapa 

bAbA 1 non^n 


PPAGPTAAAP 


GAATGCCGGC 


CATAATTGTT 


4260 


GGCATGGCAA 


TGGGTAAi lb 


7\7\P7APTTATA7A7A 

AAb AC I AAAA 


AT anzvfTf^TA 

n 1 nonV— luin 


AATTGGTCAT 


GCCCATACCT 


4320 


TTAGCAACAT 


CTCGGTAATT 


TTCA 1 CbAbA 


TTTTTPATTP 
11111 ^-ni 1 ^ 


CGATATACGT 


GTTTCTTAAA 

VJ X X X N-» X X 1WI 


4380 


ATAGGTAGAA 


GGGAATAGAT 


bAAl Abbbbb 


a t a a t a r:rn p, 

Al AA1 rto^uo 


O 1 O 1 1 1 X 


GACACCAAAA 


4440 


ATTGGAATCA 


TCAAAGCTAA 


T AAAb C C AAC 


rn apr A APPP 




APTTGTTAAA 


4500 


CCAATGACAA 


TATTGGCTAC 


TTTbbbbAAA 


PPTPTTAAPP 


A A APPtPPAAP 


GGGCACAGCG 


4560 


ATAGCGACCC 


fry y\ 7* rn TV TA TV "TV /"* 

CTAATAAAAG 


TbCAAAbAA 1 


P7A7ABTATAAQ 
b AAA 1 A 1 nnn 


TATPTTPAAT 


APTTTTTGAA 


4620 


ATCATTTCTG 


ATCCATATTC 


TTG AAAb AAb 


b 1 Ab 1 b A i lu 


vTaPPPPTPP 


TPTTTATCAG 

x x x x *i x ^.riAj 


4680 


TTTCGGACTC 


GCTTTGTTTT 


GCT IC 1 ACAb 


PTTP 7APT A AT 
bl lbAblAnl 


PP.TTP,TTTPA 
v_»o llulll 


TPGPPCCAGA 


4740 


TAACATCATA 


AACAATGTCA 


AC AAG b G AAb 


CACb 1 b 1 1 AA 


P ATPPPPAPP 


APTPPTTTTT 

rt\> 1 »wVJ X X X X X 


4800 


GTTCATCAAC 


GACTGGCACA 


TATTTTAAGC 


CGCbl I I 1AA 


PATTPTTTPT 
bAl lbl 1 1 O 1 


AAAPPATPAP 
rtnnVJ^n 1 V-n^ 


4860 


GTAAAAGAGC 


GGTTTTTTGA 


ft n TV TV TV /"* TV TV TV 

ACAAAGAAGA 


tv rn A"* rn rn rp T\ 

CATC 1 1 I AI 1 


P & TAP AT ATP A 
b AAb A 1 A 1 A 


PPPAPAPTAG 


4920 


AAGCTTTGCC 


ACGTTGCTGA 


TCTAATGTTT 


CAACA1 CbAl 


a Zi APPPTTT A 
AAAbbU 1 1 in 


APPAPATGGG 


4980 


AATTATCAAC 


AAGTAATAAG 


GTATCAACCC 


b 1 1 1 i 1 bAbb 


PA TP A ATTT A 
Cn 1 V^nn. 1 1 1 n 


ATPPPTTPTT 
n 1 L.vjvj x i vi x 


5040 


GTAAAGATTT 


TTCCGGCGTG 


ATAGTAATAb 


r'Tr'TATATTTTAIi 

b 1 b AA 111 AA 


P ATP APT TP A 


TPAACAGTGG 


5100 


TGAAATCAGG 


TTTGGCTTGT 


AATAbbCbAx 


PTTPTPPPTiT 
b 1 1 b 1 bboA 1 


PAPTTPTTPT 
urtu 1 i \^ 1 1 V- 1 


ACAAATTCAT 


5160 


TGGCTGGATG 


ACGTAAGATA 


TTAlCibblb 


TPTP7A 7A TATTP. 
lbl b AAA 1 lu 


A ATT APTTTP 
nn 1 1 n>— 1 1 1 0 


PPTTCGCTCA 


5220 


TAATGGCAAT 


TTTATTTGCT 


AA1 I ICAACb 


PTTPZ1TPP AT 
b i 1 bA 1 b^- A 1 


ATPATPPP,TA 


ACAAAGACAA 


5280 


TAGTTTTTCC 


m tv ta tv rn rn rn 

TAAACGTTCT 


rn /"»/■"« TV T\ T\ Tr , T r P 

IbbAAAlbl 1 


TP APT A A ATP 
1 b Ab 1 Arirt 1 ^ 


TTPT A ATP A A 
1 1 u 1 nn 1 Unn 


TPACGGGTGA 


5340 


TAGGATCTAA 


TA ^ TV /*"* TV TV T\ TV 

AG C AC C AAAA 


bb I 1 CA1 CCA 


TT AP A AT A AT 
1 1 rturlri 1 /xrA 1 


ATPTTPATTP 


GCAGCTAAAG 


5400 


CCCGCACAAC 


GCCGATTCTT. 


Ibl lbl lblC 


PPPPAP A A AP. 


TTPATTPPPA 
1 l\>ni 1 00 on 


TAGCGGTCCA 


5460 


ACATTTCTCG 


TGGTAAC I CA 


nr'Pn 7\ 7ATP TA 7A 

AC C AAA 1 bAA 


TPTATTTTTTP 
1 ^.M 1 1 1 1 1 JIL 


PPPTATPTTA 
oo^ini^x xn 


TTGCGTTCTT 


5520 


CCACAGGTAC 


T T T T AAT AAT 


TTAbbCAC 1 A 


TA a ZiPP. AT ATT 
AAAboAlrtl 1 


TTPAPPAATG 


GTCATGTGAG 


5580 


GCATCAAbCC 


bAl ATI I IbA 


t\ rn/~> zv/^TaTTAPP 
A 1 b Ab A 1 Abb 


PA ATTTP,P,PP, 
bAAl 1 1 uuLu 


APPTAGPTCT 


ACAGGATTGA 


5640 


TTTTTTGAAT 


ATCT Kbbbba 


TP a TATTA A A A 
IbaAl 1AAAA 


TTTTTPPTTT 
11111 V^\_. Ill 


TGAAGGGTCG 


GTCATGCGGT 


5700 


TAAGCATCCG 


CATTGATGTT 


GTTTTTCCAC 


TACCACTTGT 


TCCGATGAAG 


CAGATAAATT 


c n c f\ 
57 DO 


CACCTTTATC 


GAAAGACAAA 


TTAATGTCAT 


CAACGGCGAC 


TTTTCCGCCT 


TTATAAAATT 


5820 


TTGATACATG 


• CTGAAATTCT 


ATCAACTTAC 


TCACCTCAAC 


TAAATATTTT 


GAAAATGTAT 


5880 


CTTATTACTA 


GTATGCTATA 


AAAATGCGAT 


TTCGACAAAT 


GTAATCTATT 


TTTAATGGGA 


5940 


AAAAGAAGGC 


GGTAAAATTT 


TTTTAATTCG 


TTCTAAAAAA 


. AAGAAAAGAT 


1 AACTATTGTT 


•6000 
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TAGTAAAAGT 


TATCTGAACG 


AAACAAGATG 


TTAAAAAGAG 


TATTCTGAGA 


AGGAAGAAAA 




GCCGCTATAA 


CAAAGAAAAG 


TGTCCTTTGT 


AGCAAGTTGA 


CGGGAGCTGA 


GTTTTTCATT 


£i on 


AGAATGGAAA 


GAGAACAAGG 


GAATAAAATA 


AAAAAAGGCG 


CTTCATGATT 


TAGCGGCAAC 


on 
Ol O U 


TAAATCACGA 


AGCCTTGAAT 


AGTCACCTGT 


GCAAGTGGAC 


CATCAAGTTG 


CCCGTTCAAT 


6240 


CAATGGAAAC 


CATTGATCAC 


TTTTATTTTA 


CTGAATTTTT 


GAAAAAATTA 


CCAGTTTTTT 


6300 


TAAGAAGTAA 


GAAGTGGAAG 


AACGAAGGAT 


GCTTTTTATG 


GAATGTGCAA 


TGACGTTTAG 


6360 


AAAAATCTGA 


AGGTCATTTT 


TTATTCTCTT 


GTTCAAACAT 


TCAAACATAG 


GGGAAGAGAG 


6420 


ACGGACGTAA 


TTTTGTCACC GATTATTACG 


TCCGTCTTCC 


GTTGTTTTCT 


AAAGCGTTGG 


64 80 


TTACATGTCT 


TTGTAGAGGT 


TGGCTTCAAT 


TGCAGTCCGT 


TTTGGTTCTA 


AAAAAGGCGG 


oS4U 


TAAAGCCAAC 


GTTTCCCCTA 


AATGTTCCAb 


AGGCTCATCC 


AATGTAAAAC 


CAGGAGTCAA 


60UU 


AGTAGCAAAT 


TCAACCCGAT 


TCCCGCCTGG 


TTCACGAATG 


TATAAACTTT 


TAAAATATCC 


66 oU 


ACGACTTACA 


TACATTTCAA 


TTTCCCAGCC 


TTGTGCTAAA 


GCGCGCGTGT 


GTAGTTCTTG 


O / ZU 


TAATGCTTCA 


TCATCAGCCA 


CAGAAAAAGC 


AATATGATCC 


ATACTGCCAC 


GCCCCATTCG 


67 80 


GCTTGTTTTA 


TCCGACGTTG 


TTTGATGAAC 


AGAAAAGCTT 


TCCGTCTCAC 


TTAAAATAAT 


6840 


CGTATTGCCA 


ACTGCTTGCC 


AAC b AAb AAA 


TGTTTCAAAA 


AAGGCCAAGG 


TTTTTTCAGA 


6900 


ATCTGGCACA 


TGAAATTCAG 


TAGAGGCCAA 


ACCGATAATT 


TGAACCGCTG 


CTGGAATGTC 


6960 


GTTAGAAACA 


Ti /"■/■"■TV m m rp rp 

ACGAGTGTTT 


CI r I 1 AAIbb 


ATTTTCAGGA 


ACTTCTGTCA 


GGATAATATG 


7020 


CGTATCATCT 


bGA I CAL i. bA 


AA 1 b 1 Abbb 1 


CTTTTCTTTT 


TTGTTAAATG 


GAATTTTGTT 


/Ub U 


TTCTGTTAAA 


Lb 1 J. b 1 i ttt 




GCTGTTTTGG 


GGAATTTTTA 


AGCCAATCGT 


/ X 4 U 


GGCTAAATAA 


rp rri tt% rp ^i rri rp rp 
1 1 I i 1 i 1 1 


^ri 1 rtO^vjVj JL 1 


GCCTAATAAG 


GGCACGATAA 


AAAAAGTTAC 


"7 0 n a 


TAAGCTTCCC 


GGCGTGCCTT 


GATAATCGCC 


ATAAAAATAA 


TGAGGCATGC 


GATTGTTGTC 




TTGATTTACG 


GTATTTTTGA 


CAAAACGTAA 


GCCTAACAAT 


TTTGTATAGA 


ATGATTGGTT 


*7 0 1 n 


TTCTTGACCA 


TGACGCGCTA 


ATAGTGAAAC 


ATGATGAAAT 


TGAGCATGCA 


TTTTGAGTTC 


/JOU 


CTCCTCTAAA 


TAACTATGTA 


TACTTACTTA 


AAGTAAGTAT 


AGTGCTTCTG 


ACATTTTTTG 


/ 4 4 U 


ACAACTATTT 


AGTTTGATTC 


AGAAGGCTGA 


CATAGCCAGA 


AACmAGAAAA 


AG T G T T AG AA 




TAACCAAGAT 


wAAAACTAGT 


AGAAAGAGTG 


ACGGAAATGC 


TAAGTACaGA 


AGATTTTbAl 


*7 ^ fin 


TTTGATTTAC 


CAGAAGAATT 


AATTGCTCAA 


ACCCCACTAA 


AAGATmGCGC 


TAGTTCACGA 


"7 fi?n 


TTATTAGTTG 


TAAATAAAGA 


AaCAGGGGAC 


ATGGAGGATA 


TvTv^»7vrnrn4-/^/^7V 

AACATTtCCA 


TbAi AI lb 1 1 


/ DO vj 


GATGAATTAC 


AGCCTGGGGA 


TGCATTAGTC 


ATGAATAATA 


CACGGGTACT 


GCCGGCCCGT 


7740 


TTATATGGAG 


AAAAACCTGA 


AACAGGCGGT 


CATTTGGAAG 


TTTTACTTTT 


AACAAATACA 


7800 


GAAGGCGATA 


" CATGGGAAAC 


CTTGATTAAA 


CCCGCGAAAC 


GTGCAAAAGT 


CGGCACGGAA 


7860 


ATTCAATTTG 


GCGATGGTCG 


CCTAAAAGCG 


GTGGTTAAAG 


AAGAGTTGGA 


ACATGGTGGT 


7920 


CGAATCATTG 


AATTTAAATA 


TGATGGTATT 


TTCTTAGAAA 


TTTT 




7964 
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(2) INFORMATION FOR SEQ ID NO: 102-: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



AGAAGTGAAA 


AGAGCTTTTG 


ACTTACGCTn 


nTATTATGAG 


GAACGAACCT 


TTCAGGCTGG 


60 


GAAAATAAGG 


AGGTATCAAA 


AGAATGAATT 


TGTTCAAaAA 


GArACAATCA 


AAGArACAGA 


120 


CGTTATCTAG 


TAACGTTGAA 


AAAGTAGATA 


AAAAAGTAAA 


AAAACGTGTA 


CAGCAACAAG 


180 


CAAATCCTAC 


CGTACAGGAG 


AGTATTTTTT 


ATACGAGTCA 


GTTTGAAGAA 


GnGCTCATGC 


240 


ACATTGTAGA 


GGATGAGTTT 


TCTAAATGCT 


ATAAACTtGG 


AGAaGTGGAC 


TACGAAATTG ~ 


300 


CTGTAGAAGA 


AGaTCmATTA 


AACaCGGTAA 


TGGGGTATGC 


AGAAGGGTTA 


AATACACTAG 


360 


ACAAAAAATC 


TCGGTATCaG 


CTGCTGGTTG 


TTAATAAAAG 


AGAAGAAGAG 


TCGTTGATGG 


420 


AAGACGTTTT 


ATTGGCGTAT 


CaGGGAGATC 


ACTTAGATAA 


cTACCGTCAA 


GAAATCAATG 


480 


AAATTATAAC 


GAAGCAGTAT . 


GAACAAGACG 


AACGAAACTT 


TAAGATTGAA 


AAATATGCTA 


540 


TTTTCACCAC 


AAAAAGTTCG 


TCCATGAAAC 


AAGCTAACAA 


AGAGTTAGAA 


GGAATCATGC 


600 


AGAACTTTGC 


CAATCGTTTT 


GAAGCCAACC 


AAATTAAATT 


AGAGGTAACC 


CCAATGACTG 


660 


GCTTAGAGCG 


ATTGTCTTTG 


TTCAGCTATT 


TTTTGCGCCA 


CGGACAATAT 


TTTGGCACAA 


720 


GCTATCAAGA 


TATTGCCGTA 


TCAGGGCTAA 


CGTCTAAAGC 


GTTTATTGTT 


CCAAGTAAAA 


780 


TCAAGTTTCC 


AAACAATAAA 


GCTTATTTTC 


GGTTAGGAGA 


AAACTACGCA 


GCTGTTTTAA 


840 


CCATTCAACA 


ATACCCAAAA 


TACCTAGAAG 


ATAAGTTAAT 


CAAAGAATTA 


TGCGCTACAG 


900 


GAAGAGAGTT 


AGCGATTAGC 


ATTCATGCCA 


GACCTTATGA 


CATGATGGAA 


GCCAGAAAGG 


960 


CAATACAGGG 


AACCAAGACG 


TTAAATGATG 


TTGCCATTCA 


AAAGCAACAG 


AAGCAAAACT 


1020 


TCAGAAGTGG 


TGTTTCAGAG 


GATGCGATTT 


CAGGAGAAGC 


AAAGGAAATC 


CAAGAAACAA 


1080 


CGAACGCGTT 


AATGGATGAA 


ATAAAAAATA 


ATGGCCAAAA 


GTTATTTAAT 


GGCCTATTTT 


1140 


CAGTTTTCCT 


AATTGCAGAA 


ACAGAAAAGG 


AACTTTTAGA 


AGCTGTTGAA 


TCGGTGAAAA 


120O 


ACGTAGGAGC 


AACCTGGCAA 


GTTGTGTTTG 


ATACGGTAGA 


CAATTACAAA 


GAAGAAGCAC 


1260 


TCAATACTAT 


CTTACCGATT 


GGAAAACCTT 


ACTTAGACGT 


TGAAATGAGT 


TATATGCGtG 


1320 


ATCTAACmAC 


mACmAATATt 


GCTACACAaG 


TACCTTTtAC 


CAATGTGGrA 


TTACAAgCCC 


1380 


AcAGGTCAGT 


TTtATGGCCG 


GAaTCmGTTG 


ACGAAGAaCA 


TGATtACTAT 


tGrCCGAAAA 


1440 


AAaGwTTAAT 


TwCACCAgTG 


GGCTTATTTc 


GGAcGcyGGT 


tCTGGTAAAG 


GAtGGCGACC 


1500 


AAATGGGAAa 


TTCTTtCAaC 


GCGTTtAAAA 


TtTCCGAAGa 


TCGCTTTTTA 


ATTGTTGATC 


1560 


CAGAAAGTGA 


ATACTTACCG 


ATTGGGAAAG 


AATTGGATGC 


AGAAATACTC 


GATATCTCaA 


1620 


CTGGTACGAA 


AAACCACTTG 


AATATTTTAG 


GAATGGTAGA 


CAAGCAGTTA 


TTGGATGAAG 


1680 
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AGGACCAGGA ACTTGATTTA GTTAAAGAAA AATCTAATTT GTTATCCACT ATGTTTGAAT 17 40 

CGCTACTAAA AAGCTATACG GACGTGGATG CGGGACTTGT TGATCGTGTG ACGTTGTTGA 1800 

CGTATGAACG CTTTGAAAAC AAAGAGACAG AGCCAACATT AGTCGATTGG TTTAGAATCT 18 60 

TAAAAGAACA AACGGATGAA GGTGCCAAAG AATTAGCGAA TAAAGTGGAA ACCTATTGTG 1920 

TGGGTTCACA AGATATCTTT GCGCACCAAA CGAATATCGA CTTAACGGCA CCCTTTATTG 1980 

TCTTTAATAC AAAAAAATTA GATGATCGCT TGaAAAAGTT TGCCATGAAA GTGATTTTAG 2040 

ATCAGTTGTG GAAACAAGTA GTATCCGGAC AAGGAAAAGT AACGTCaCGT TTGTATTTTG 2100 

ATGAATTACA AGTCAATTTC ACAACGGaAG AAGAAGCAGA ATGGTTCTTA AATTTATGGG 2160 

CGCGTATTCG TAAGTATGGC GCCGTGACAA CAGGTATCAC GCAAAACCCT ACGACGTTAT 2220 

TAGACAGTAA AGCGGGGCAA AAAATGATTG CCAATAGTGA ATTTTTGATT TTATTACGGC 2280 

AAAAaCCAAT TGATTCACAA CGATTAGTAG rAATTTTAAG TCTGACACCT GCATTgAAGA 2340 

AGTATaTCmA TGATCGAGCC CCACAAGGrA CGGGGcTAAT TtCTGCAGGT GGGGTAGTTG 24 00 

TTCCGTTTGA AAATCCTATT CCAAAAGAAA CACATCTATT TGAAATTATG AATACCGACG 24 60 

CTTAGGATGT GAGGACATGG TAGAAAGTAA GCGTACTTGG AAAGAAAAAG TACATGTTGA 2520 

ATCTCAAGTA GAAGAGCGTC TTAGAAAAAA TTTGACCTTA AAATCAAGAG AAGGGACGAA 2580 

tTGCAAAAAA AAGCATTAAA GACGACTAAA AAGGAGTTTA AACTTTCTAA GAAAAATTAC '2640 

AAGCAATCAC TAAAAAAACA TAAACTTACC TTGAAAACAG GGAAGGAACC ATTAAGTATT 2700 

AACCATTTAC TGGATAAAAA GAAATTTGTA GAGCAAAAGA AACGCAAAAA AGCACAGAAA 27 60 

ATTGCTTACA AACGAACGAA AAAAGTAGAT GAAACGCGTG TCGTTAACCA AGTGAAAAGG 2820 

GAAACCAAGG AAGGGTTGAA AAGAGAAGCG ACGCAAAAAG TACGAACCAC GTTAACGCAG 2880 

GAAGATACAC TGAATGAAGC AATGACCCTT TATGAAAAAA CACAACAAGC TAAATTTAAC 2940 

ATGCGTACAG CTTTGAAAAC AGGAAAAACG GTTAAGAATC TTTCGGTGAA AACTGCAAAA 3000 

GATACGTACG GATTGGGGAA TCGCTTATTT AACTTTTCAA GAGGACGTGG GTTTCAGCGC 3060 

ACACCAAAAG ACTTTACGTT AAAAAAACAA CTTATGAAAC AACTACGGAA TCGCGCTATG 3120 

CGCTTTAAAG CAGCAAAAGA AGCGAAGAAA GCCGAACAAG GTATTGGCTT GATTCGTTCT 3180 

TTTTTTAATG GTCAGAAGAC ATTAGGGAAA GTCGCTGCAT TGATTCTAAA AAATCCAATT 3240 

TCTTGGGTTG TTTTATTGGT TCTTTTTCTT GTTTTTTTAT TGAGTGGCGT TGCAAGTAGT 3300 

ACACAAAAAC CAGCGATTGT TCAAGAAGAG GAAGATTTAA CCGCTTCATG GACGTATTTT 3360 

ACTAAATTAG ATGCGCAACA TACAGACGAT AACAACCTGT TTTATAGCAA TATCGATGAT 34 20 

GTTCTTTTCT ACATGAATTA TCGGTACGAT GATTTTAAAT TATTAGATAT GGATTCCACT 3480 

GGTACAAAGA ATTTTGAAAC GATTCTATCT GAACTTTGGA CGGCACTAAA CGGAAAGAAA 3540 

CCTGATTATC AGTTGAAAAC AATGCAAAGT TTAGAGACAG ATAAGAAATC GAGTTATTTT 3600 

ATCGAaGAGG AACAAGCAAA GCATTATCAA GAGaTAAAAA AAGAGTTAGG CTACCAAACA 3660 
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TTGGATGATC 


TCCTTTCTTT 


TCCAGTAAAA 


ACTGACGCTT 


T^ATTGTCaA TaAGCGTTAT 


3720 


GGCTATGACA 


AtCmAAAGAG 


aAGCTCACCT 


TATATAAAGg 


CATTGATGTA TTGATTGAAG 


3780 


ATAATCAGCC 


GTTTCATTCA 


CCAATGAaCG 


GCCAAaTCGT 


TTCAGTGCCT GATACaGAAA 


3840 


CGCTGGTTAT 


CGAAAAaGAG 


AAAGTAGCTA 


GGTTAACGAT 


TCGTGGTGTG AaTACCCTTC 


3900 


GACTAACAAA 


AGGTATrGAT 


GTTGAAGAAG 


GAACATTTCT 


TGGGAATACA AAGAATTCAA 


3960 


CAGTTACGTT 


CCAATATGAA 


AAATATAAAA 


AAGAAACGAA 


AGACTGGTTT TTTGTAAATC 


4020 


CAGCCTTTTA 


TTTTCCACGT 


GTAACCTACA 


CGCAAACGAC 


TCTTTTAGGA AGTGCAGAAT 


4080 


TTTCTCCTGG 


AGCAAGTGTC 


GAAAAAAGAG 


CGCAAGCTGT 


T TAT GAT TAT TTATCAAAAA 


4140 


AGGGATACAC 


CAAAGAGGGA 


ATTAGCGCGA 


TTTTAGGCAA 


TTTTTCGGTA GAAAGTGGCA 


4200 


TTAATCCAAA 


ACGTGCAGAA 


GGGGATTATT 


TGAATCCGCC 


TGTAGGTGCG CATGGGAATT 


4260 


CAT GG G AC G A ACCGAGTTGG 


CTTGCTATGG 


GAGGACCGCA 


AATTTATGGA GGACGTTTTC 


4320 


CAAATATTCT 


CCATCGAGGA 


CTAGGTTTAG 


GTCAGTGGAC 


GGATACAGCT GATGGTGGTC 


4380 


GTAGGCATAC 


TTTGTTATTA 


GACTATGCCA 


AAGGTAAAAA 


TAAAAAATGG TACGATCTGC 


4440 


AATTGCAACT 


GGACTTTATT 


TTTGACGGCG 


ATGCACCAGG 


TTCACGAACA GCAGCAGACA 


4500 


ATGTAGCAAG 


AAGTAAGGTG 


GCTGCAACTA 


TTCCAGAACT 


TACTACTTAT TATCTAACTG 


4560 


TTTGGGAAGG 


AAATCCCGGA 


GACAAGTTAG 


GAGAACGAAT 


TCAAGCTGCA CAAAATTGGT 


4620 


TTACATTCTT 


TTCTCGATCG 


GGTACACCAA 


TCGGTGGTTC 


TGGGAGTGAG GTCTTTGCGC 


4680 


AGTATAAAGA 


AAAAATGCAA 


CCATTACCCA 


CAGACAGAGA AmCAAAaGCA GGACaAGGCT 


4740 


GGCCAGGCmA 


TGCGTATGCG 


CCAGGGAATT 


GTACTTGGTA 


TGTGTTTAAT CGTTTgCACA 


4800 


GTTAGGAAAA 


AGCATTTACC 


CTACTATGGG 


AAACGCCAAT 


CAATGGGTGC ATAACTACAG 


4860 


TCmAACGCCC 


GGAGCsACAC 


TTGAATCGGC 


CCCTAAAAAA 


GGGGATGCTG TTATTTTTAC 


4920 


AAATGGTGTA 


GCTGGTTCTA 


GTACGCAATA 


TGGACACGTG 


GCTTATGTGG AGCATGTAAA 


4980 


CTCTGATGGT 


AGTTTTGTCA 


TTTCTGAAAT 


GAATGTCAGT 


GGTGAGTATT CTATGAATTG 


5040 


GCGTGTCTTA 


AAAAAAGAAG 


CAGGCGAATA 


CTTTATGCGC 


CTAAACTAAA AAGAAAGGAA 


5100 


CCAATAAGAA 


CAATGTATAA 


TATGGAAAAA 


TTAAGTAGAT 


ACAAGAAAAT TGGCTTTATT 


5160 


GGTGGCAGCT 


TACTGCTTAT 


TGTGCTTATT 


TTTTTGATTG 


GGCTTAGTGT TGGGCAAAGA 


5220 


AAACAAGTAA 


ACACCAATGA 


AAAACAAGTA AAAGTCGAGA AAAAAGAGGA ATTAACAACA 


5280 


AGTACGGTAA 


AAAAATTTTT 


GATTGCGTAT 


TATACAAAAA 


AAGATCTTGG TGAAAATAGA 


5340 


AATCGCTATG 


AACCTTTGGT 


GACAAGCGCC 


ATGTATAACG 


AATTAGTGAA TGTAGAAAAA 


5400 


CAGCCTGTTA 


ATCAAGCGTA 


TAAAGGCTAT 


GTAGTCAATC 


AAGTACTTGA TACGTATAAA 


5460 


ATCTATATTG 


ATACTGAAAA 


CAATGAAGTC 


ATTGTGGACG 


TCACGTATAA GAATACACAA 


5520 


CGAACCAAAC 


GAAATAATGA 


TGAAGGAGCG 


TTGAAAAATC 


AGAGCAATCA AGAAGCGATG 


5580 


AAATTAACCT 


TTGTAAAACA 


gGCGCTAATT 


TTTTAGTGGA 


> TAAAtGGCAC CAGTAACACT 


5640 
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TACCAATGAA 


TTmCAAGAGG 


AGCCAAATTC ' 


TTATAATACa 


CATGTAGTAA 


CGACAGAAGA 


5700 


ATCAACAAAG 


GAGAGTGCAA 


ACAGTGGGGA 


AAAGTAAAGT 


AAGTGAATTA 


GAAAAAAAGA 


5760 


AGGATCGAGT 


GGAGAAAGAG 


CTTAATAAAG 


CCAATGAACG 


TCTCGATACA 


TTGAAACAGT 


5820 


CGATTGCTAA 


AAAAGAGAAT 


GAATTGAAAC 


AAATAGAAGC 


GGAAATTGTC 


AGTGCTCTAT 


5880 


TAACAGAAAA 


TAATTTATCT 


TTTAGTGAAT 


TGACTGCGTT 


GTTGAGTGAA 


AAAGAaTCAG 


5940 


GTGGTGAAGC 


mAGTGTTTAT 


AGGGAaTAAA 


ACAACCTATC 


GCTATGAACT 


TCTTCATTCA 


6000 


AAAAATATTA 


AAGGCGAGGC 


GATCGACTCT 


ACCGCTACAC 


TTGTTCGTTA 


TGTGATTGGC 


6060 


GTAAmCAaCG 


ATCTTATTAA 


GAAGGAAGAA 


GAAGGATGGC 


TTTATGTCGA 


AAAAATAAAA 


6120 


GAAGGAAAAC 


AaGAGGACCA 


AATCCTTTAT 


GGCAAGCGTA 


TAGATTTACC 


AATTTTAGAA 


6180 


GACAATCCTT ATTTTCATGA 


TCTTTTAGAT 


CCGTTTGATA 


CAAAAAAAGT 


CATTCCTTTT 


6240 


GTAGAGCTAT 


TGCCTTCAGC 


TGTAGAAAAT 


CAGGTAAAGC 


CAGAAGAAAA 


AATTAAAGAA 


6300 


CCACAGGATA 


TTGTAGAGTT 


AGAGAAAAAT 


AACAAAACCA 


CTATTACTAC 


TGAAGAAAAG 


6360 


GAAATCACTG 


AATTAAAAGA 


GCAACCAGAT 


GAAATGGCCG 


ATAAAGAAAC 


ACAGAAGAAA 


6420 


ATTCTGGAAC 


AAAAGATTCA 


GTTTAAAAAA 


GACGAGTTAG 


CCACATTGAC 


CGAGGAGATT 


6480 


GCGACTAAGC 


AGGAAGAAAT 


CAATGACTTA 


GAACAAAAAT 


TAAGTACATT 


TTCAAAGAAG 


6540 


CCAGAGGGAC 


TTCATTTATA 


AAAAAGAGTT 


TTTAATCAGC 


TAGTAAAAAA GCyTCTTCTA 


'6600 


GCTAAGAAAy AAGTATTATC 


ACTTtmCCCT 


TACAGTTGTT 


TATGGCCTAT 


CAmAAAAGTA 


6660 


GAGAAATAAA 


AAAGACTaGC 


AAAAGTTGCT 


AGTATTTTTT 


TTGTAAAGGA 


GGAGAATTTT 


6720 


GGTGGAACAA 


AGAGAAATTG 


TACACCGCTA 


TTATGTTTTA 


GGTACTAGTA 


CTTTAGATAA 


6780 


AGTATTAAAA 


GTGCTGCTTA 


ATTTATCCAG 


TGATGGTGTA 


TTGGCGCTAA 


AAGACAAGTT 


6840 


TCTTCCTTTA 


"aaaggggaaa 


ATAATTTGTA 


TAAGCTAATG 


AATCGTGAGG 


ATCCACTGAC 


6900 


TTCGGCGCAA 


CTTCATGAAA 


AAGTGAATTT 


AAATAAGTTA 


AAAGAACAGT 


TGGAAGCCCA 


6960 


AGGGCTTCCT 


TTTGCGTTTA 


AAGAAACCAA 


AGAAGGGACT 


AACTTCTATT 


TTCGAGTAAA 


7020 


AGATACGGAA 


CTCGCAAAGA 


AAGCCTTAGA 


ACGCGTATTA 


ACGGATATTA 


AGAAGAACCC 


7080 


ACAAATGATT 


TTGCGTAACC 


AAATACCATG 


ACTTTTGATG 


AACGGTTAGC 


CTACACTGCA 


7140 


GCCAATAAAA 


AATACGTGGG 


tAAAAtCGaT 


CAAACCAAAA 


GCcAtCcCcA 


. AAGGGAAGGA 


7200 


AACTGTTGAT 


TGCnG 










7215 


(2) INFORMATION FOR SEQ ID NO: 103: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5504 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
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TGTTCGCCTA 


AGAGATTCAC 


CATAACCACT 


GATTGAAGTA 


ATTGGGGTAA 


CCCAATGCCC 


60 


CAATTGCTAA 


TCCCTCGAAT 


ATGCGCATCA 


AATTGACTAA 


ATGAGCAAGC 


TTCAATAGAG 


120 


TAATGACCTG 


AG T TAT GAG G 


CCGTGGCGCT 


AATTCATTGA 


CGTACAACTC 


TTCTGAATCC 


180 


GTTAAAAACA 


TTTCGATAGC 


TAAAACCCCA 


GATAGATTTA 


ATTCCTCAGC 


AATGTATTGC 


240 


GCCAATTCCG 


CTGCTTTTTG 


TTGCACATTC 


TCAGAAATTC 


GTGCTGGCAC 


AATTGATTCA 


300 


TGCAAGATAT 


TCTTTCGATG 


AATGTTTTCA 


GATACAGGAA 


AGCTTTGGTA 


ACCATGCTTA 


360 


TTCCCTGCAA 


CGATGATTGA 


AAGTTCTTTA 


GTGAAAGGCA 


CCCACTTTTC 


TAATTCACAA 


420 


GTGGCGTTAG 


cTAATAATTT 


AGTCGCTTGG 


GAGAAATCCT 


CTTCGCATTT 


TAAGACGACT 


480 


TGTCCTTTTC 


CATCGTATCC 


ACCTTGAATA 


GTTTTTAAGA 


CACATGGAAA 


ACCAATCTTT 


540 


TCTATTGCTT 


GTGGCAGCTC 


TTTCTGGCTT 


TTTACTTCCG 


CAAAAGGGGC 


AATCTTACAG 


600 


CCAATCGATT 


GAAGAAACGT 


TTTTTCAAGC 


AATCGATTTT 


GAGTAATTTT 


TAACAATTCT 


660 


GTTCCTTGTG 


GCAACTTTTC 


TAGTGGAATG 


GCCTTTTCTA 


GGGAGGCCAC 


TTCAATATTT 


720 


TCAAATTCAT 


AGGTGATCAC 


ATCGCTTAAC 


TCTGCTAGTT 


CAGTTAAAGC 


AGATATCTGA 


780 . 


TCAAAGGCTG 


CCACAATTTG 


ACGGTCAGCT 


ACTTGACCAG 


CTGGACTATT 


GAGCGTTGGG 


840 


TCCAACACAA 


TCATTTGGAA 


ACCCATTTTT 


TTACCACTCA 


TGACTAGCAT 


TCGCCCTAAC 


900 


TGTCCCCCAC 


CTATAATGCC 


AATAGTTGCT 


CCTACTTTTA 


AAGGTTTATT 


CAAGTTGTTC 


960 


CCCACTTTCC 


ATAACCGCTT 


TTTCCAATGA 


TTGTCGTCTT 


GCTTGGATTC 


TTTCTGCTAA 


1020 


CTGTAAATCA 


TCAATACTTA 


GTATTTCTGC 


AGCTAATAAC 


CCAGCATTGG .TAGCACCTGC 


1080 


TGAGCCAATC 


GCAGTTGTTG 


CTACAGGAAC 


TCCCCCTGGC 


ATCTGCACAA 


TAGACAACAG 


1140 


TGAATCAATA 


CCGTTTAACG 


TTCGTGTTTT 


AACTGGAACA 


CCAATGACAG 


GTAAGGTTGT 


1200 


CTTTGCAGCT 


ACCATCCCTG 


GTAAATGGGC 


CGCTCCACCA 


GCGCCAGcgA 


nTAATAACCT 


1260 


TCAATCCTTT 


TGTTCGTGAT 


TCCTCTGCAA 


AACGAAACAT 


CAAGTCAGGG 


GTCCTATGTG 


1320 


CTGATATCAC 


TCGTTTTTCA 


TAAGGAATTT 


TCAACTCTTC 


CAACACAAGA 


CAAGCAGCTT 


1380 


TCATCGTTGC 


CCaGTCTGAG 


GTACTTCCCa 


TAATCACAGC 


GACTAATGGT 


TtCATGAATG 


1440 


TAATAACTCC 


TTCCGCCTTT 


TTTCACTGTT 


ATTCTAACAA 


TGAGAAAAAG 


AGATGTCAAr 


1500 


GAAAAaCCGA 


AATATTATAG 


ATTAAAATTA 


TATAATGTTC 


GTGTTTATCA 


AAAAAAGAGT 


1560 


CATCGGAAAG 


ATgcTTTTTT 


GCtaTTTTTA 


GCGTTGCTTC 


CTCTTTAAGA AGAAACGTCG 


1620 


CCTGTATTAA 


TAACTGGTTG 


TGTGCCCTTA 


kCAGTAATGA 


TTGAGACTAG 


TAAATTATTA 


1680 


ATAAGTTGTA 


CTTTTCGTTC 


ATCGGTAAAA 


TTAATTTCTT 


GCCCTTCTTC 


AATCTGTTCT 


1740 


AAGGCCATTT 


GTGTCATGCT 


TACCGCACCT 


TCAACAATTG 


TTTGTCGAGC 


CGCTAAGATT 


1800 


GGTTTTGCTT 


GCTGTCTTTG 


CAACATGGAG 


CTAGCTATTT 


CAGTTGCATA 


AGCTAAATGA 


1860 


TTCaACCGTG 


TTTCGATAAC 


CTCTACACCA 


GCCACTGCTA 


AGCGTTCTTG 


CAATTCTTTG 


1920 


GTTAACTCCT 


CGGAAATTTG 


TTCGGTATTT 


CCTCGCAGAG 


TTACATCATT 


ATCTGAAAAA 


1980 
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GTGTCATAAG 


GATATTGTGT 


TGCTACATGC 


CGGATTGCCG 


TTTCACTTTG 


AATTTCCACA 


2040 


AAGTCTTGaT 


AATAATCGAC 


ATTAAATAAC 


GCCTTAGCTG 


tATCGACyAC 


CCGAAAAACG 


2100 


ATGACCGCAG 


AGATTTCGAT 


TGGATTCCCG 


TCAGAGTCAT 


TGACTTTTAA 


TAGAGAACTA 


2160 


TTGAAATTTC 


GCACTTTCAA 


AGAAATGTTC 


ATTTTTTGCG 


TAAATGGAAT 


GGTAATAAAC 


2220 


AACCCATTTT 


CTTTAATCGT 


ACCTAAGTAG 


CGACCAAAAA 


ATAAAATAGC 


TTTCGCCTGG 


2280 


TTGGGACTAA 


CAATGGTTAA 


GGAACTTAAA 


AATAAAATAG 


CCCCAACTAA 


CAAAATAATA 


2340 


CCTAAAACAA 


CAAGAACACC 


ATTGGTATGA 


CTCGCTCCAT 


AAATAAATAA 


ACCAATTCCC 


2400 


GCAATTAAAA 


TAACTAATAA 


GCTCGCAATT 


CCTACGTAGC 


CACTCATATA 


AAATGTTTTT 


2460 


TTCTCTTCCA 


TCTGTTTCAC 


TCTCCTAACT 


CATTTATCTG 


GTCCTTATTA 


TACACCTCAT 


2520 


TGCCTTCAAT 


AGAGTAACTT 


TTAACATTAA 


ATAAACTGAT 


TTTTTCGGGG 


CTCTTGTAGT 


- 2580 


ATACTGAGAA 


CAAAGAAATC 


CCAAGAAAAG 


GAGTGATAGA 


ATGCGCTATT 


TAAGCAAGGA 


2640 


CATTTTAGAG 


GAAGTCATCA 


CTCAACGCCC 


CTCTGACTCT 


TATAAAAGTA 


ATTTTGGGCG 


2700 


TGTTGTATTA 


ATCGGAGGAA 


ACCGACAATA 


TGGCGGAGCa 


T CATC AT GAG 


TACCGAAGCG 


2760 


TGTATCAATA 


GTGGCGCTGG 


TCTCACCACT 


GTGATTACTG 


ATGTTAAAAA 


TCACGGGCCT 


2820 


TTACATGCAA 


GATGCCCAGA 


AGCGATGGTC 


GTGGGCTTTG 


AAGAAACAGT 


CCTTCTGACG 


2880 


AACGTCGTAG 


AGCAAGCGGA 


TGTTATTTTA 


ATTGGTCCAG 


GTTTAGGCTT 


AGATGCTACT 


2940 


GCACAACAAA 


TATTAAAGAT 


GGTACTCGCC 


CAACATCAAA 


AACAACAATG 


GTTAATTATC 


3000 


GATGGCTCAG 


CAATTACCCT 


GTTTAGCCAA 


GGCAATTTTT 


CCCTTACCTA 


TCCTGAAAAA 


3060 


GTTGTTTTCA 


CACCTCATCA 


AATGGAATGG 


CAACGTTTGA 


GTCACTTACC 


TATCGAGCAA 


3120 


CAAACACTGG 


CAAATAACCA 


GCGCCAACAA 


GCAAAATTAG 


GCAGTACGAT 


TGTTTTAAAA 


3180 


AGTCATCGCA 


CCACAATTTT 


CCACGCAGGA 


GAACCTTTTC 


AAAATACAGG 


TGGCAACCCT 


" 3240 


GGGATGGCTA 


CTGGCGGAAC 


TGGGGATACC 


TTGGCTGGCA 


TCATTGCTGG 


TTTTTTAGCT 


3300 


CAGTTTAAGC 


CGACAATTGA 


AACCATTGCT 


GGCGCCGTCT 


AC TT AC AC AG 


TCTCATTGGA 


3360 


GATGATTTGG 


CCAAAACTGA 


CTATGTTGTT 


TTACCAACGA 


AGATTAGTCA 


AGCCTTGCCG 


3420 


AC AT AT AT G A 


AAAAATATGC 


CCAGCCCCAC 


ACTGCACCTG 


ATTCTGAACT 


ATTAGAACAG 


3480 


AAGCGTTCGA 


GATAAAACCC 


GAAAAGGCTA 


AATGGTGACA 


AATTTTGGGT 


TAAAAAGCAG 


3540 


GATTTAAGCC 


TGGGCCGAAA 


ATCAAAAGCG 


ATTTTCAGCC 


CAGGCTCTCT 


TTTTATTTTT 


3600 


CCTTATAAAC 


ATTTTTTAAC 


GTTAATGGAG 


CGCCAAAGGT 


ATGAAAATTC 


AAGCCTTTGA 


3660 


CTTTATCATT 


TTGCAGGACT 


GTTTGTTCAT 


TTtGGCTAAG 


TGGAATCATG 


CCAGCAGTCG 


3720 


TTTCAATCAC 


TTCTTTTTCA 


GCTGCAATCA 


GTGTCGCCCA 


TCTTTTTTCT 


GGCTCTAAGG 


3780 


CATAGGTTGT 


GGCTGCTTCA 


TCTAATAATT 


TGTCATAGAC 


AGGGTTCTGA 


TAATTGCGAT 


3840 


CATTGCCCTT 


GTATAAAGTC 


ATCAGGGTAG 


AAATAGGGTC 


TTGATAGTCT 


GGTGTCCAGT 


3900 


AAATTAAGAA 


TAAATCATAG 


TCACTTTCTC 


GCCCAAAGTT 


CAATGCAGCT 


TCAGTCGGCA 


3960 
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ATGCGGTTAG CTCTATTGTC AAACCAGGAA ACAATTCTTG TAGCGAGCCT TGCAAACTTT . 4.020 

CACCAATTTT TTTGTAAGAA CCATCATCTG TTACCATCAA TTCAACGTTA ACCTTTTCTC 4080 

CTAATTCTGC TTGTGCTTTT TTCCAATAAC TTTGCGCTTT TtCTTTGTTA TAAACCATTA 4140 

AATTACCTGC TTCTTGACGA AAATCGAGAC CCGTTGTGGG ATTCGCCACA AAGCCTTCCG 4 200 

TAATCGCACC ATGTAGCGCT TTAGAACCAT CTGCAATAAT ATTATtGACT AGATTTTCTT 4 260 

TATCTATTCC TAAAGCCAAT GCTTTACGCA GGTTTTCGTT TGCAAGCGGC GTkGCTTGCC 4 320 

CTTyCCGTTT TTGATTTAAG CGCAAATAGT TCATTGTCGC TGTTGGATAC GAATGATACA 4 380 

ACGTATTATT TTTATTCTGT TGCGCCAGTT CACCACTTAG TGTAGCCACA TCTAATTGTC 4 4 40 

CATCTTCAAA AAGATTCAAT GCGGTAGATG TTTCTTTGAT AACTGTATAA TTGATAATGT 4500 

CTGAGCGCAC GTTCTGGTGA TCCCAATAGC GATTATTTTT TGCTAGTTGC CAGTCCATCT 4 560 

TTGTTTGCTG CCAATCTTTT ACCACGAACG GACCATTATA GACGACTTTA TCACtAGCAG 4 620- 

TTCCATAGTC CGCACCAAAT TGTTCGACTA CTTTTkGATT TtGCGGGaAA AATGTCGGAA 4680 

AaGCTAACAA GGACGTAAAG TACGGTTTTG GCTCTTTGAG CGTCACCTTT AATGTATAAT 4740 

CATCCACAGC TGTGACACCT AGTTCATTGG GTGCTAATTT CCCCGCTGAG ATTTCTGCAC 4 800 

CATTTTGAAT TGTTTCAACG ATGAGGAAGC TATAAACAAA GCCGTTTTTA GGATCAATCA 4860 
TTTTTTTCCA AGCATATTCA AAATCATGTG CTGTGACAGG ATCATCGTTA CTCCAAACCG ' 4 920 

CTTCTTTTCT CAAAGAAATC GTGTAGGtTT TTCCATCTTC TGaAATCATC GGCAATGCTT 4 980 

TTGCTACGGC TGGTACCAAT TGGTCTTGTT CATCTAAACT ATATAACCCT TCAAACGCTG 5040 

CAGTTTGGAC AATAGCATCT GGAAAATCCA ATAATACAGA GGTGTTGAGC GTTGTTAATT 5100 

CCGAGGGTGA CATCAGTTGA ACGGCTTCTT TGGTATTTTT CTGTTTCTTT TTATTTTCAT 5160 

TTTTATTGGT ACACCCAGCT AAAAATCCTA CCAATAAAAT ACAAACCATT GTGATTTTTA 5220 

AATACTTTTT CATCTTCTTT CTCCTATAAC TCTTCATTTT TTTATGCTCT TACT TAG AG A 5280 

AAGACAGCTC CCCTTTTATT TTGTCCTTTC CACAAGCATC CAGACAACGA CCGCTGGCTC 5340 

CAACCCATTA TTATAATTAA TGTGCCCCTC ACCAGCTTCA ATTAATTGGG CTTGTCCAGC 5400 

TTTTCCAACT AAGTCTTGAC CATCTTCTAA AATCGTATGT GTcGCCAGAA ATCACATAAG 54 60 

AATATTCATC CTGTTCATGG GGAGCATATC CAGCTAAAGG CCGT 5504 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GTTGCGACTA AACTATAGCC AATTGCTGAA AAACGCTTAG CAATTTCTAA CGCCTCTTCT 60 
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1 1 iu 11 1 Lnl 


LftuLnn 1 Co 1 


PAAAAPTAPA 
CAnAAvj 1 nL.rt 


GCCCCATAAC 


TTGGCAAATG 


CAAGCCAGAT 


120 


(jL 1 1 CAAAbC 


C i 1 1 Al ACAA 


obl 1111 1 CI 


AAATAATAAT. .CACTACCCAT_ 


AACTTCCCCA 


180 


Gl iCAl 1 1 CA 


1 1 1CACCCCC 


TA APT A PPT A 
1 AAC 1 ACo 1 A 


TCTACTTTTT 


GTAGTTTCGT 


AAAGGAGAAT 


240 


ACAGGGGCTT 


I C AC A 1 G AAC 


i lol 1 1 AC 1 1 


TCAGGATATA 


ATCCATCTTG 


ATATCCTAGG 


300 


TCAGTCAGTT 


1 CTCACL A AC 


7\ 7\ TPPPTTTT 
AA1 CCC 1111 


GTAGCAACTT 


GGGCCATCGG 




360 


ATTTTACTTA 


AGAAAGGAAC 


lull CCAC 1 C 


GCACGAGGAT 


TCACTTCAAT 


pap AT A A AP A 


420 


CGATTTTCAT 


/"■* TV 7V >TI TV TV TV 7\ 7\ 

GAATAACAAA 


1 1 C AA 1 A i 1 C 


ATCATACCTA 


CACAGTTTAA 


nprn AP.T^PP 


480 


AGTTTTTTTG 


m tv m tv ti n>r , ^ , /^r i 

TATAATCGGC 


AA 1 1 C 1 i CC 1 


TGAATTTCTT 


GAGAAAGATA 


P T P. P P. P. T P, ft A 


540 


TAAACCGCCA 


T T G AAT C AC C 


Ti/^T\7\TPPJ\PP 

AGAAI GCACG 


CCAGCTCGTT 


CAATATGTTC 


PATA ATPPPT 


600 


GGAATTAGTA 


CTGTTTCACC 


AT C AC AAA 1 C 


GCATCAACTT 


CACATTCTTG 


TCCr A ATA A A 
1 V_ v_v_..f-\n i nnn 


660 


TAGCTATCAA 


CCAAGACAGG 


TV ipplTlTPTPPTl 

ATC1 1 CI CCA 


GAAGCTTTGA 


CTGCATGTCG 


PAT ATAATPT 


720 


TCCAAATCCT 


TTTGATTTTC 


/~i T\ /~i TV T\ rp T""P P 

GACAATT 1 CC 


ATTGCACGAC 


CACCTAAAAC 


AT A APTTP.P.A 
n 1 nnC 1 1 oon 


780 


CGAACTAAAA 


CGGGATAGCC 


CA I ACC A I C A 


GCAATCACGA 


CTGCTTCTTC 


TPPAPTTPTP 
1 oUnvj i lulu 


840 


GCCGTATCTC 


CTGGTGGTTG 


AGGAATCGCT 


AACTCTTGCA 


AAGCTTGTTC 


A A AT A A ATP A 
nnn 1 nnn 1 on 


900 


CGATTTTCCG 


CACGATCTAA 


tv m /-* m m tv tv <n/"* 

ATCTTCAATC 


GTTGTTCCAA 


GAATTTTAAC 


PPprprpPTTTT 


960 


GTGAGCGGCT 


CAGCTAAATT 


7v t\ rn f~* r** rn /~» rp rn 

AATCGCTGTT 


TGTCCACCAA 


ATTGGACGAT 


P APPPPA ATP 


1020 


GGATTTTCTA 


tv t\ m tv t\ m 

AAT C AAT C AC 


AT T C AT G AC G 


TCTTCTAAAG 


TTAATGGCTC 


A A APT AT AAT 
nnnu X n 1 nn 1 


1080 


T TAT C AG AAA 


TV TV /"* TV TA ^ Ml 

CAGAGAAniC 


rppmrn/"* tv TV TipW 

TGTTGAAACT 


GTTTCAGGAT 


TACTATTCAT 


A AT A ATTPtPT 
nn l nn x j.uoi 


1140 


TCATAACCAG 


CCGCTTGGAT 


CCC 1 1 1CACA 


GAATGAACCG 


TTGCATAATC 


AAAPTCGACC 


1200 


CCTTGACCAA 


TGCGAATGGC 


ACCACAACC 1 


AATACTAGTA 


CAGAAGGCTT 


TTTCGAAACA 


1260 


TTGCTTTCGT 


TTTCCACTTC 


Al ACC 1 CC 1 A 


TAAAAATAAG 


GCGTGTGTGA 


TTCAAACTCT 


1320 


GCAGCACACG 


TATCCACCA1 


II I AI ACACC 


GGCACAATTT 


GATTGGCTCT 


t^PflftAAGTCA 


1380 


GCGATGGCTT 


GTTCTGTTTG 


I CCCC AAAC 1 


GCTGCGATTT 


TCCGATCAGA 


AAAAPPATTT 


1440 


TGTTTGGCTT 


pltimfpp7\ TV TV TV /"*■ 

CTTTCAAAAC 


ACC i ACC 11C 


TCCACGTGAC 


TTTCAAGGGC 


TPiTTTPAATT 

j>vji i x v^nn x x 


1500 


TCGATAATGT 


/"•IT) TV 7A TV TV /~rn rf>rn 

GTAAAAGT 1 I 


T\ rn f»p tv Tv p T\ n p 

A 1 CCAACAAC 


AATAAATCAA 


TTTTCGTTAA 


AfTTTG^TAAT 

nL> X X X o X nn x 


1560 


TCI ICGAICC 


rATArCClCi 


/Tr* 7Y7\ TPPPT 

CCCAA 1 vjUL. 1 


TCTGATAAGT 


AG AAC AAT CG 


ATCATCTTGC 


1620 


CCA1CCACCA 


1 I 1 1 C 1 ICC 1 


T 7A 7i T T r* P Zl 7i Zi 
1 AAI 1 v^lvnnrt 


i C AC 1 uhLR 1 


P 7A p "T T B CZ TTP 
onL 1 1 /vj 1 Ivv 


TGCTAACTCG 


1680 


TTGTGATAAG 


CACCAAI IlC 


1 AAAC AAC C A 


ACTGCTTTTA 


ATAAGGATTC 


TTCAATATTG 

x x V— » nn x *x i x w 


1740 


CGGCCAATTG 


CCATGACTTC 


CCCGGTTGCT 


TTCATTTGCG 


TTCCTAGTTC 


TCGAGCACCT 


1800 


TTTTCAAATT 


TGTCAAAAGG 


CCATCGAGGA 


ATTTTGGAAA 


CAACATAGTC 


CAAAGCTGGT 


1860 


TCAAATTCGG 


CATAAGTGGT 


TCCAGTAACT 


GGATTTTTCA 


TTTCATCTAA 


TGTTAAACCA 


1920 


ACAGCAATTT 


TTGCAGCTAA 


CTTCGCAATG 


GGATACCCTG 


TGGCTTTACT 


AGCAAGTGCA 


1980 


GATGAGCGAG 


AAACACGAGG 


ATTCACTTCA 


AT c AC a T AAT 


AwTTAAAGcT 


ATGCGGGtCC 


2040 
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AArby i AAI K 


TA TV f** t . rp 7V f** — i 

bAAbWl I Aba 


bbbALL 1 J. LA 


ATTTTTAATP 
Ai«l 1 1 lArilLt 




TTTPAArGAP 


2100 


GCATCTCGCA 


TV /**• TV rn rn rrt /■> /■*• m TV 

ALAI ITbba A 


rp rp (~+ TV rp T\ T\ /■> rp 

I 1 OA 1 AA 1 L 1 


C A T 7A A PCT T T 

LA 1 AALL 1 J. 1 


vjrtiw 1 vyOunuv^ 


AAAAACAATC 


2160 


GAATCCCCAG 


TGTGAATGCC 


bAb AbbA I bA 


A Ar"*T r rT r PP , P , 7A 
AAL1 I 1 1L-LA 




PAPAATPflPA 


2220 


TTATCAGCCG 


AATCACGCAT 


LAL 1 1 LA1A1 


m 7\ a tttptt 

I LAA1 1 lul 1 




A AT APTTTTT 




TCAATTAAAC 


ATTGTGTAAC 


m rp tv rp t\ tv rp 

TGGTGATAAT 


1 ILAbALLb 1 




TTC^nPr^CAPT 
1 1 LioL,ov3AL7 1 




TCTTCTTCCG 


TATCACACAT 


GCCGCCGCcT 


/■•mippp A P" T A 

b 1 1 LLALL 1 A 


AAvj 1 AAA 1 vjL 


PPP APPA APP 
Li Li L7 AL* L7 AAL o 




ATAATTGGAT 


AGCCAATTCG 


rn fT> m ^ /"■*/"■* TV TV TV TV 

TTTGGCAAAA 


b^LAHjvjtl 1 


rTTrr a pppt 


ATTPAPAATT 
nil onvyftn 1 1 


9 4 fin 


TCACTTTCTG 


GAATCGGTTG 


TTCTAATTCT 


I LLAILAA1 1 


^mmrpTvnaTAA 

vjl 1 lAAAXArt 


ATPAPPP.TPT 
ril L/A^wOVj 1 Lf 1 




TCCGCTTGGT 


CAATCGCACT 


GAGCTTTbTT 


LL 1 ALLAA 1 1 


PT APATTT AP 


PTPATPTAA A 
L> 1 \*r\ 1^1 rtftn 


9Sftn 


ATTCCTGATT 


CC GAT AATTC 


CATCGCCATA 


m m m tv tv T 1 7\ 

TTTAAILbAb 


rn m rp *^ yi CH 7A /** 

111 LALLALL 


T A ATPTTPPT 
1 nn lull LjLj 1 




AGAAGAGCAT 


CAGGACGTTC 


TTTTCGCAAA 


7\ m monn rp /"> TV 7\ TV 

AT TL 1 IbAAA 


LAAAl 1LIAA 


APT A ATPPPT 
AVj 1 AA 1 L«LiLi i 




TCGATATAAA 


CATGATCCGC 


GATTTCCTTA 


TCGGTCAI bA 


1 Ablbbblbb 


ATTTPAPTTT 
Al 1 1 LAL? Ill 




ACTAAGACAA 


CTTCATACCC 


TTCTTCTTTT 


AGCGCTAAAL 




PPPTPP AT AP 
uL.L 1 LjL A 1 rtu 


9R90 


TCAAATTCTG 


CTGCTTGTCC 


AATAATGATT 


/■*> /-» fn /-■ /"• m /*" tv 7v 

GGTCCTGAAL 


b AA 1 bAbb A J. 


LA 1111L11L 


9flftf) 


ATGTCCGTTC 


GTTTTGGCAT 


TAGTTTTGCT 


/"« 111 m y"t /*■ TV rp/"'/*' 

CCTTCCATbb 


7\rn/^/^T\rnrpTi 7vm 
AlbbAl 1 AA1 


TPPAT A A ATT 
1LLA1AAA1 1 


9Q40 


CGTCAAATAA 


ATGAAGACCA 


TCGTGCGGTC 


CTGGAGCCGC 


TV rn/-«rp/-<r->/— <rp/— j\ 

ATb I bbb IbA 


1A1 1LLALAL 


JUWU 


TAAAAGCAGG 


ATAATCTCGA 


TGACGCACGC 


CTTCAACCGI 


I bbAlbAi 1 b 


APTTPPAPPT 
AL 1 1 LLALL 1 


JUOU 


GGGTCACAAG 


TAATTTTTCT 


GGATCAATTG 


TTGCTTCATC 


JL AboLiLA 1 A 1 


L,LrU 1 ViLi 1111 


3120 


GTGAAGTGAA 


ATCAATGCGT 


r^i m ^ rn rn ^ <^ tv t\ 

CCTGTTGCAA 


mrnrn/" , rn/" , /""rp7\/*» 

TTTClbb 1 AL 


tv ff* TV TT A r P r Pf 

Abb A 1 b A i 1L 


A A APPPPP. AT 
AAAL L oL ori. 1 


3180 


GTCCAAACTT 


CATCTTATAC 


GTATCCGCAC 


/"'TV rp rp f~* rn TV TV 

CAI ILbbiAA 


JL LirtAHA 1 HAL. 


TP A TP A PPT A 


3240 


AGCAGATGCC 


GAAAATCGGA 


ACTTTTCCTT 


GGA1 I bb lib 


AA 1 L. A 111 L. A 


AT APPTTPAP 


3300 


GCACATCTTT 


GGGATCTCCA 


GGCCCATTTG 


1 bAAL A 1 bAb 


ALL.A1 ULnLioA 


PTT A A TTPT A 


3360 


AAATCTCTTC 


GGCCGTTGTA 


miTI7vm7V TV/* , /"'/~tTV 

T T AT AAGGC A 


AbAbbbi 1 AA 


rT^rr a ttpp 
\j 1 1 oLAl 1 uL 


PPTTTTP AT A 


3420 


ATTCTCGCAA 


AATACTATGT 


mm/™* tv tv /-'/"■' r~* Tv tv 

TTlAAClLAA 


AA1 bAAUtjAL 


P APP APATT A 
LAL.LAL.A 1 in 


PPPPP A AT AP 


3480 


CTGGACTCGG 


ATAAGGCTTA 


/"Tn/"""/"*rn/" , ^* , TV TV 

GTGGTGGAGA 


CT I b 1 bbbAb 


1 1 bn 1 1 1 1 1 L/ 


PPP ATP APTP, 


3540 


TCGCTTTTAA 


TTGATCAAAC 


GCATGCGGCA 


TV JVTPTVPrT'JiP 

AA 1 b A 1 bb Ab 


PPP ATP A ATfS 
LiLiL.r\ 1 L.riM i va 


ATPPTPPPTT 

r\l 1 VyVyVy 1 I 


3600 


TCAkTGTTCC 


AGCTGATCGT 


tv i\ /rnrn tv /""" /*• Tv 

AAC T T AC GAG 


1 b AbbbbAbb 


PPT A TP A ATT 
Lv3 1 A 1 L,AA 1 1 


PPTPAAATTP 
VyLlunnni iv^ 


3660 


CCGGAATCCC 


TTTGCGCTTT 


TVTVTV TV 7V rp cry r-* y\ rn 

AAAAATTLAT 


T\ TV/^/^rn/^T\rp 

bb AAbb 1 b A 1 


1 1 Li 1 1 IjALuL 


PAATTTPAAP 

v_//-\_rt ill OALrlO 


3720 


CAACCCGCGC 


ATGCTCTTTA 


ACAATGACAC 


CTTTACAGGT 


TGGTGCaATG 


GAT T CAT a AT 


3780 


CaTCTCGGTT 


GACCCCGTAA 


TTCCCaACTA 


ATGGATACGT 


AAAAGTAATC 


ATTTGTCCAT 


3840 


TAAAACTTTG 


* GTCAGTAATT 


GCTTCTTGGT 


AACCTGTCAT 


GCCAGTAGTA 


AAAACAACTT 


3900 


CTCCTACTAC 


ATTGCCtTCT 


GCaCCAAACG 


CTTTTCCTTC 


AAATACTGTT 


CCGTCTTCTA 


3960 


AAATCAATAA 


CCTTTTCAAA 


CCTACTGCGC 


CTCCTCCGAC 


CAAGCTAATG 


CTCCGTCAAC 


4020 



j 
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A A A TP, TP A T A 


AP,TP»TPTTTP 


PTTTP,APTP,T 


CCACCCTGTA AAJGGTGTAT 


TGACAGCCAT 


4080 


T P A T T P A A A T 


P.PTTPTP.PAT 


p a a t t p. p,t p,p 


AGCTGTTGCT 


AAGTCAAAAA 


CTGCAATATC 


4140 


ibb i bb IbbA 


ppn A. T T o TT 7i 


PTPTTPPTP-P 
b 1 b 1 1 bb 1 


GTTTAATCCA 


AAAATTTCTG 


CCGGTTTCAC 


4200 


I bbbAl bbAA 


1 bAA 1 Ln^ 1 1 


PTTPT A ATPT 
bl l^lnnlvjl 


GAAAATGCCT 


GTTTCGACAA 


AGTTCGTATA 

aw x x N-^* \j x n x ** 


4260 


AArbAAI lbb 


a Ar , r , p/ r " , P r PT r P 


PAPT APPP AP 
bAb 1 n^lvOnL 


AATACCAAAA 


GGTGCGTTTA 


AAAAT GAT T G 


4320 


mmp rr»rn rprri rp/rp 

TTbi i I l Ibi 


TPT A & APPAT 


P.P,PPPPP7iTP 


GTCGGTAGCA 


ATACAGTCAA 


TCGTGCCATC 


4380 


TAACAAbbbb 


TP A ATP A ATP. 


PTTPTPPATP 


TGCTAAACCA 


CGTAATGGCG 


GGTTCATTTT 

NkJ \J X 1 vHA XXX 


4440 


CbAAAAbbb 1 


TPPTP APP AP 
1 bb 1 LnuLnu 


P. A ATATPPTP 


ATCGACTAAA 


ATTAAATGGT 


GTGGCGAAAC 


4500 


TTCGbblbX 1 


Abb 1 bnii 1 Lt 


b X bb 1 1 1 1 1 1 


AGCATCTCGA 


ATAACGCGCA 


PGPTTTCTTC 

\w \J \^ XXX X X >^ 


4560 


rp /Tp rp /-» Tv TV TV P A 

TGTTGAAACA 


rp T\ /*■» TV TV TVpnfp 

T G Ab AAAb A I 


PZiTAATPAAP 
b A 1 HA 1 bnnL 


GCCTGTTTCT 


TTCGCTAAAG 


TAATATCCCG 


4620 


tv y"»^ tv TV rp rp rn rp 

AGCAATTTG i 


bAAbb I 1 Cbb 


TTTPJi r*TT 7A 7A 

1 1 bbAb 1 1 nn 


AATCCCTGGT 


AGGCCTAATT 


TPT T AGAAAC 


4680 


TTCTCCTTCA 


TGCATbACAb 


P" A rT"7\ A ATA A 

b Abb AAA 1 AA 


TAAAGATTCA 


TCTTCCGTGT 


P,PGPAAPAAG 


4740 


TGCCATATTT 


AATGCAGbbb 


rri ffi crt m ^* tv rp 

bT rbi i ibAi 


GGCTAAGTAC 


ATCGTTCCTG 


PPGTTTGGAP 


4800 


TV TV TV /*"< TV 7\ 

ACCAACACCA 


TCATTCGTAA 


Al bbAAAbbb 


GCCCGCTTCT 


TTTAAGGCCT 


TPTGATTGGT 
1 Vh> 1 on l l vjvj i 


4860 


TAAAACTTCA 


CTCCGTAGCT 


CTTCTGTAAT 


GGGGGCATAT 


TGTAAAACTT 


TPAPP,APTGP 
1 \ynvyV3rtU 1 OVrf 


4 920 


ATCTTTTTGA 


TV rp/Ti TV TV /"*rp /■»/"* *P 

ATTAAGTCGT 


AAA r*PTP APT 

AAAbb TbAb 1 


TAATTTTTCA 


GCCGTATCTG 


GTAPTGGATT 


4980 


CAAATTGGGC 


TV CP rp/'-'r-' /-■/-« TV 7\ TV 

ATI bbbbAAA 


b.bb 1 Ab J. AAA 


ACCACCGCGA 


GCAGCAGCTT 


TAPTACCCGT 


5040 


TTTAATCGTT 


TCTTTATACG 


1 AAAbbb 1 bb 


TTCTCTAAAG 


TGAACATGCA 


CATC C AC C AA 


5100 


GCCTGGCGTG 


aTTAGTTGCC 


bbl 1 bbbAl b 


AAACACTTGC 


TCAAAAtCTG 


CTTCGTCAAA 


5160 


CGATTCGCCG 


ATTGCGTGAA 


IbAbAbbAl 1 


TTCTAGCCAA 


AGTGCTGCCT 


CAATCAATTG 


5220 


ATTCTCTTTT 


TTGATGATTT 


TTPPIiTTPTT 
1 I bbA 1 1 b 1 1 


AATTAACGTC 


TTCATTTCTC 


CACCTAGGCC 


5280 


TTTCCATGTA AAATTGCTTC 


p a zi z\HTT(^rr 

bAAAAl lotvv 


ATGCGCATAA 


AGACTCCGTT 


ACTCATTTGC 

llv— » x un X X X VJ\y 


5340 


GCGACGATTC 


TTGATTGCAG 


Z1PTTTPP APT 
Av— 111 l-.V-.Ak-. 1 


AATTCATCTG 


CCAGTTCAAC 


ATCCCGATTG 


5400 


ACTGGCGCTG 


GATGCATAAT 


A A T T P.PP. T P T 
AA 1 I bbb 1 b 1 


TTCTGTAAAC 


GAGTTGCCCG 


TTCGTTGGTT 


5460 


AAACCATATT 


CCAAATGATA 


TPPTTPTTTT 
oLVrf 1 1L1 111 


GAAAAACTTT 


CCTTACCATC 


ATGACGTTCG 


5520 


TGCTGCACGC 


GAAcAACATC 


A TP APATPP.A 


CTTTTTCAAC 


AATTTCATCT 


AGCGGCACAT 


5580 


ATTGTCCATA 


GACATCAAAC 


TP,ATCATPP,T 
1 1 V3rl 1 1 


ACCACTCTTC 


TGGACCTGAA 


AAATAAATTT 


5640 


CAGCGCCTAA 


ACGGTTTAAT 


A APTPP ATAT 
rlrtV^ 1 ijvn 1 A 1 


TTfZATTTACP 
1 1 un 111 rivjVw 


AAPCPfiAGAA 


TGTGTAATAT 


5700 


CTCCCACAAT 


CGCCACTTTC 


AAGCCTTCAA 


ATCCACCAAA 


TTCTTCATAA 


ATAGTCATCA 


5760 


GATCCAATAA 


ACATTGTGTA 


GGGTGTTGTC 


CACTCCCGTC 


CCCACCGTTA 


ATAATAGAAC 


5820 


ATTGAATCGT 


TTTACTTTGA 


ATCAATTCAT 


CGTAATAATT 


TTCTTTACCA 


TGACGAATGA 


5880 


CTGCTACATC 


TACACCAATT 


GCCGACATCG 


TTAGTACTGT 


GTCGTATAAC 


GTTTCTCCTT 


5940 


TTTGAACAGA 


ACTTCTACTT 


GCTTCAAACT 


CAATCACTTC 


TAAGCCTAAT 


TTTTTCTCAG 


6000 
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CTACCTCAAA ACTTTTATGT GTTCGAGTAC TATTTTCAAA AA^CAAGTTT GTTGCAAAAT 
JVTTGACGTTC TTCTGGGTGC CATTTTGCCC CCTGTTTAAA TTCTCCTGCG CGACGGATTA 
ACCCCATGAC TTCACGGTCG GTTAAAGCCT CAGCAGTTAA AAGATGTTTT AAACTAATTC 
GTTCAGATGT AATAATCATT CGTCCAGTCC TCCTACTGTT CTTCGCTACG TGCGTTTTCC 
GGTAAAATTA AATTCAAAAC AATTCCTAAG ACAGTTGCTA ATGCCATTGA AGACAATTCA 
AAAGTTCCTA CTTTGAACAC TAAGCCACCA ATGCCGATGA CTAAAATTAC TGAAGCAATC 
AATAAATTCT TTTTCTTATC AAAATTAATC TTATTATCGA TTAAGATTTT CAACCACTGG 
GGnAACAAAC CGA 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
ATAGAAACCG CCTCTCTCTC GACACGCAAC TTGCGTGTTA TCTGTACGAT TTGTTTACAT 
GTTGCATTGT AAACAGTTTT TAGGCAATCG TCAAGCAGAA AATAATTAAC gTTGCAtGAA 
GAAACtAATA ATTGCAAAAA TAGCAATGTG AGCTGGATAG AAAATATAAA AGAAATTCCG 
CATGCCACGG CCTTTTTCAC CATTATAAAG AACAATTGGA ATCGCTGCTA AAATCATCAT 
CCATTGGTGG TTCACC-JoA ATAGTTCTTG GAAATTATAG CTGGCTGCTG AAATCAACGC 
AACAACGCCA ATTGCTGAAA TTTGTAACCA CTTTTTGCCA TGGAATAAAT AAAAAGCTAC 
GGCCAAGAGA ACAAATAAAA ACCCACCTTC TGTCATCATT AGTGATGGCA CAATCAGATT 
GAAAATTTGG AAACCAATTA AGATAGCAGG GTTGCTTAAT ATGTCTGGAG AACTAAAAAC 
AACGAGAGGT AGGGCACTGA TTAATAAAGG CACAATAAAC CAAAGAAGCC CTTTACCAAT 
TTGTTTTTGT TTAAAATAGT CCATAGACTG CATGTAGACC GTACCTAAAA ATAACGTCCC 
AAAAATATTA TTGATAATTA AGCCACCTGT TGAAAAATAC GCATTTAAAA AGCCGTTCAG 
AATCCCCATA ATCCAGAAGC CTAACAATAA CCGGAACATG TATTTTCGCC GATTGCTTGT 
GTGAATGAAG CCTTCAGAAC TTTCAAATAA GAAAATAGGT GCAGCAATTC GTCCAATCCA 
TCCGAACCAA CCAGGCACGC CTAAAAAACT AAAAAATTGC AATAGATGGT CAAAGACCAT 
TGAAATAACC CCAATGACTT TCAAATGAAA GCCAGTTAAT GTTTTACCCA TAGTATAAAC 
GCCTCTCTTT CATAAAGAAC AATATAACAA AGTCCAGAAA AACCAACCAT TTATTTGCCT 
AACAGTTATC TTACAAAATT GTTTGATTTC TTCAGCTTTT GATGAAAGAA AAAAGTGCCA 
AGACTAGTTT TTATAATTAT TCTATGCTAT AGTAAAAGTA CATTTTAATA AAAAACATTT 
GGGGTGCTGT TATGGCTGAG ATGATACCCA TTGAACCTGA TGCAGTTAGT ACTGTCGCAG 
GGAAATGCCG ATTTACTTTG ATTTTTCACG CACTCATTCT GTCCTAGGAT GAGTGTGTTT 



6060 
6120 
6180 
6240 
6300 
6360 
6420 
6433 



60 
120 
180 
240 
300 
360 
420 
-480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
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TCTTTTATCA 


GGGTTTATTT 


AAGTCACTCC 


TTTTGCTTTT 


TGTTAAGAGA AAGGAGTTTT 


1260 


TTTATTGAAA 


AAATGGTCAT 


TGCAACAAGT 


CGTCTTACTC 


GCTTTTTTAG 


CCTTTTTGTT 


1320 


TGGCGGTGTT 


TTTATGGGGG 


CAGGATTCCT 


CTATGCGTTA 


TTGAAtGCTG 


CGCTGTTACC 


1380 


ATTAGGACTG 


AGTCCATTTG 


CCAATGAGCT 


TTTATTTGGA 


ATGTGGACCA 


TGGTGGCACC 


1440 


AATTGCAGCG 


ATGCTGATTC 


CACGAGCGGG 


AAGTGCTGTT 


TTGGCGGAAG 


TTTTAGCCGC 


1500 


TTTGGCAGAA ATGCTTTATG 


GTTCGTATTT 


CGGTCCTAGT 


GTATTAATTT 


CTGGTGTGAT 


1560 


TCAAGGTCTA 


GGTAGTGAAA 


GTGGCTTTTT 


GGTGACACGC 


TACAAACGGT 


ACGATACACT 


1620 


AACGTTATTT 


TATAGTGCGA 


TTGGGACGAC 


TATTTTTAGT 


TATGTGTATG 


AATACTTTAA 


1680 


GTTTGGTTAT 


GGCAATTACG 


GTCTGGGGAT 


GAACATTGCT 


TTAATTAGCG 


TTCGGTTCGT 


1740 


TTCTATTTGT 


TTCTTTGGTA 


TTTTCTTAAC 


GAAAGTTATT 


TTACGAATGT 


ATCAGTCTGC 


1800 


GCAAGGGTTA 


GCTGTGAAGG 


CAAAATGAAT 


AGTATCCAAT 


TAAAAGCTGT 


GTCCTTTTCA 


1860 


CGAGAACAAC 


CCTTGTTTGA 


AAAGACAAAT 


TTAGCAATTC 


CTCAAGGAAC 


GTTTTCCTTA 


1920 


TTAATTGGCG 


ATAGTGGTAG 


TGGAAAATCA 


ACGCTCTTAC 


GATTGATTGC 


CGGATTTGCT 


1980 


CCGCTTGATT 


ACCAAGGAGA 


AATTTTGATT 


GAAGGAATGG 


AACGTCGACA 


ATTATCAACT 


2040 


CGTGAAAAGG 


CGCAAAAAAT 


TGGCATGCTG 


TTTCAAAACC 


CCAGTCAACA 


ATTTACAATG 


2100 


AAAACCTTGG. 


AACGAGAACT 


GATTTTTGCT 


TTAGAAAATT 


TAGGAATCCC 


GCCTGAAGAA 


2160 


ATGAACCGGA 


AAATCCAAAC 


AGCGCTTCAA 


CTTGTTCAGA 


CTCAAACCTT 


ATTCACACGA 


2220 


GAATTAGCGA 


CGTTATCTGG 


TGGTGAAAAA 


CAAAAAGCGG 


CGTTGACAGT 


TCTGCTAGCA 


2280 


ATGAACCCAG 


ATATTCTTTT 


ATTAGATGAA 


CCTTTTGCAA 


GTATTGATCC 


AACCTCTCGA 


2340 


AAACAATTTA 


TTCAAATATT 


AGCAAGACTC 


CATCAAGCAG 


GCAAAACGAT 


TTTGGTGTGC 


2400 


GACCACGATT 


TCAGCGATTA 


CGCTGATGTT 


GTGGACCAAG 


TCGTCACGTT 


GAAAAATGGC 


2460 


CAGTTTGAAA 


AGCAACCGTT 


GACGTTTATC 


AAAACAAAAC 


CACAGACATT 


TCAGCTTACA 


2520 


ACTTCTGTGG 


TAAAACAACC 


AATGCTCCAA 


TTGAAAAATT 


TTCGTTTGTC 


TCAGGGAAAA 


2580 


CGAGTGCTAC 


TTGAAGAAAA 


AGAaGCGCTT 


TTGTTTAAAG 


GAATTACCAC 


GTTGACGGGG 


2640 


CCAAATGGGG 


CGGGAAAATC 


CACCTTATTA 


AAAGCCATCG 


TTCAGAGACA 


AAAATATCAA 


2700 


GGGAAAATGT 


TCTTGGCAGG 


CCGCCGCTTG 


CGGGCTTCAA 


AAAAATTGTA 


TCAACACATG 


2760 


ACATTAGCGG 


TGCAACAAGC 


CAATCGTCAG 


TT.TGTCACGT 


TGACTTTACG 


AGAAGAGCTT 


2820 


TTGTTTGGTC 


AGAACATGAC 


AGCAGAAAAA 


AGAAGAAAAC 


AAGAGGAAGC 


CTTAACTTTT 


2880 


TTAGGTTTAA 


AAGAAAAACT 


AGAGCACAGT 


GTTTTTCAAC 


TAAGTGAAGG 


GCAAAAAAAG 


2940 


ATGGTGCAAT 


TAATTAGTTT 


GCTGAGCTTA 


GATTTGGACT 


GTTTACTTTT 


AGATGAACCT 


3000 


TTTGCTGGAT 


TAGATGAGCG 


AGCATGCAAT 


TATTTTGTTG 


AATGGATCAA 


AGAAAAAAGT 


3060 


GCGCAACAAG 


ATTTTCTAAT 


CGTCACGCAT 


CGGCTAGAAC 


CTCTCTCTGG 


CGTGAGCAAT 


3120 


TACCGAATAG 


AATTGCTGCA 


GCAACAACTG 


ATTATCTGGC 


AGGAGGGAAC 


CACATGCAAA 


3180 
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TAAAGCAAAC AAATGCAGCA 
CTTTTTCTCA GTCAATTTTG 
TAGGCCAGCG AAAATCCCGC 
TCGGGACGTT TTGGTCTATT 
TTAGCCGAAC GTATGCCTTT 
AAGAATTATT GTTGTTATTG 
TAGTGGTGGT TCACGCCTTA 
GCCTACTCCG TGGCAAAACG 
TCGTCGCTGT TTCTTGGCGA 
AAGGAGCCAC CAGAAGTCGA 
CTATTTTGAT CATCATTTTT 
GACATTTACA GAACAAGCCA 
TCCTTTTATC ACAGAATTAC 
AATTCAAGAT CATTACTATT 
AACGCAACAA CCTCGATTGA 
CGAATTAGCT ATTCGTGAAA 
AGCGACACCG ATTGCACCAA 
TGAAGGAACG CCTAAGACGG 
AATTGGTGCA CAGTTAGTCA 
AACCTATGCT GGGGAGGAAG 
ATTATATGAA GAAAGTTCAC 
CaGTGAGATG GAATACGCTT 
ATACAAAAGC AAGTCGTTTA 
TAATTGTCAG TTGCTTTTTT 
GTAACTCGCT CAGTAATAAC 
ATTATCCAGA ACCGATTGCA 
AAAAAACAGC GACACGGCTT 
CTTTTGCAAA AGCGCTAATT 
ATATTACTGA AGAAGATCCT 
TTTTAGTAGT GGAAGAACCA 
GCTTGTATCA CGTACTCCAC 
TTAATATTGC CTCTTTGATC 
CAACCAACGC TACCACTGAA 
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ATTTATGCTT CTTTGATTCT TATTTTGACC TTTGAATTAT 3240 

GCAAATTTGG CGGTTTTTAT GGGTTGTGTC ATTTTTTTAA 3300 

TTACTGCTTT GGTTATTTTT CTTGCCACTT TTACCAGCAA 3360 

TATTTGCATG GTACGAGTAG TCAACAAGCC TGGCTTTTGT 3420 

GCCGGTTTAG GACTAGCTTT TGCTGTAGGT GTTGATTTTG 3480 

GAACAAAAGG GGCTAGCACC GAACTTTGTT TATGGAATTT 3540 

CCGGAAGTGA AAAGAGAAAT TAATGATTTA AAAGAAGCAA 3600 

TTTCATTTTT GGTCGCCGAT ' GCTTTATGTT AAAACACTGC 3660 

GATAAATATA CTGAAGCAAT GTCTGCCCAT GGCTATGAAG 3720 

AAAGAGCATT TTGTGTcAGC AAAAGTCAGT CTGGGGCTTG 3780 

ACAAATTTGT TTATTTTTCT AACGTAAAGG AGAAACCTAT 3840 

AAGATCaAGC GCAAGCTTCT TGGCAAGGGA GTTTTCAGCA 3900 

ATGAAGGAAC CTTGAGTCCA ACGATTTTTC GCTACTATTT 3960 

TAAAACACTT TAGTCAACTT TATCGCTTGA TTGCGGCACA 4020 

AAAAATTACT CTTAACAAAT GCGGAAAATC TCGCGTTGGG 4080 

CTTTTTTTGA AGAACTAGCG ATTACGGAAG AAGAAGTTGC 4140 

CCGCGTATCA CTATGTTTCT CATATGTATC GCCAACTTAT 4200 

CCGCAGCAAG TATGTTGCCT TGTTCTTGGC TCTATCAAGA 4260 

AACAACACTC ACCAGAGCCA CTGTACCAAC GTTGGATTGA 4320 

CGTACCAACA CGTCCAAGAG GAACGCCAAC TGTTAGACCA 4380 

CACAAGAACA AGCAGCTATG ATCaCCGCTT TTGTTATCAG, 44 40 

TTTGGGAAAT GGCCTATACA CACGAAACCT GGATTGGTTA 4500 

GTGCTCGTAA CTAACGACTT GTTTTTGTAT TAGAAAGCAG 4560 

GACAGGGCGT GATAAGATAT TCAGTGACGC AAAATCTGTT 4 620 

AGAAACGCCT TGCTAAGAAA GGACTGTAGA AGCCTCATGA 4 680 

AAATTGATTG AAAGTTATAT GAAATTACCA GGAATTGGTC 4740 

GCTTTTTATA CAATTGATAT GAAAGAAGAA GATGCTAATG 4800 

AGTgTCAAGA GAGaTCtCCA TTTCTGTAGT aTTTGCGGAA 4860 

TGTGAAATTT GCCAAGATAA AAATCGTGAC CGTAGTATTA 4 920 

AAAGACGTTA TGTCAATGGA AAAAATGCGT GAATATCAAG 4 980 

GGGGTGCTTT CACCGATGGA AGGAACAGGA CCAGAAGATA 504 0 

AAACGATTGC ACGATGATGA AGTGAAAGAG GTCATTATAG 5100 

GGAGAAGCGA CGGCGATGTA TTTATCACGG CTCATTAAAC 5160 
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CAGCTGGAAT CACCGTTACT CGTTTGGCAC -ACGGCTTGTC TGTCGGCAGC GATATTGAAT 5220 

ATGCaGACgA AATAACCTTG CTAAAAgCAG TCGAAGGACG TCGAGAAATT TAAAAGAAAC 5280 

ATCGTTGGAA AAAGAGGTTG AAATCGAAGT AGGTCGTTTC TGTGAGAAAA TGTCGCAGTT 534 0 

CAAGCTTTwC CGTACTTCTT TTTCAGTTTC TTTTTTTCAG CGGATGCATC ACTTTAAATA 5400 

GGACATAAGG ATTTCAATTA G ACT GAT AG A GCAAGATGAA GTATAATAGA TACAAGTAAA 54 60 

AATTGATTGG AGTTTTCGCA CGTGCAGGgT ATTTTTATCA CATTTGAAGG TCCAGATGGT 5520 

GCTGGtAAAA CCAGTGTTTT GAAGGAAGTT TCTGAAnATT AGCTAAAGAG TCAAAACGAA 5580 

AAATTGTCAC TACAAGAGAA CCAGGCGGCA TACCGATTGC CGAAAAGATT CGAACAGTTA 5640 

TTTTAGATCC AAGAnATGAC AGAATGGATG AGCGAACAGA AGCTTTACTT ■ TACGCAGCTG 5700 

cTCGTCGTCA ACATTTAGTG GAAAAAATAT TGCCAGcTTT GGAAGCGGGT CATTTAGTCC 57 60 

TATGTGATCG TTTTGTTGAC AGCTCATTAG CCTACCAAGG CGCTGGACGT nGAATTGGGG 5820 

ATGGCCCCCA TGCCTCAATT AATGCGGTTG CGGCA 5855 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear ' 



(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 106: 



GTTTCATAAC 


TGTACCAATT 


GGAAAAACTG 


ACATGGAACG 


GCGTAACTCT 


TTTTCTCTCA 


60 


TCTCCATCCT 


CCTCTCTTAG 


TTATTTCTTA 


GAATAACAAT 


AGAATGACAA 


ATTGTCAAGA 


120 


ACTTATGTTA 


GATTTTCTAA 


CATAAAACAA 


AAAACCTGAT 


AAATATGGAC 


TTTTCCATCT 


180 


TTTCACGGCT 


GATTTTCTCC 


TACTAATCAA 


AAAAATAGAA 


ATGATTTGTT 


TTTACAAATC 


240 


ATTTCTAATC 


AACTTACCAT 


TTTTTATTCT 


TTTTCGGATT 


CAGCCAACCA 


TTTAGACTTT 


300 


TTTTTCGCAA 


AACCTTTCAC 


GATATACCGA 


TTTTCATTTT 


CAGCATATTC 


TTCTGAAATC 


360 


ATTAGTGTCA 


TCCGGCGAAG 


TTCACTTAAT 


AGTTGTCCGT 


CAGTTGGGCT 


AATTTCTAAT 


420 


TGATAAGGCT 


CGAGTAATTC 


CATCATTTGG 


GCACGGATAG 


CTTGAACCAG 


TCGTTCTTTA 


480 


TCTTCAGAAA 


TTTTAGTTGA 


GATCAATACA 


TTGGGAAATA 


AGGTTGGGAC 


AAATTCTTTA 


540 


CTGTCCACTT 


GATCTCTTTT 


ATTATAAACC 


GTCAAGCACG 


GAATATTTTC 


CAACGCTAAT 


600 


TCTTTCATCA 


ATGTTTGAAC 


TGTGCGTTCA 


TGTTGTAAAC 


GATCTGGCGC 


ACTTGCATCC 


660 


ACCACATGGA 


GCAGTAAATC 


CATCGTCCGA 


CTTTCTTCCA 


ATGTCGACTG 


AAAGGCTTCA 


720 


ATCAGCTGTG 


TTGGTAAATC 


TTGAATGAAG 


CCAACCGTAT 


CGGTCAGCGT 


AACTTCCATG 


780 


CCTTGAGGCA 


ATTGCCATTT 


TTTGGTTAAA 


GGATCTAACG 


TTGCAAACAA 


TTGATCCTCT 


840 


GAATAAGTTC 


CTGCTGTTGT 


CAACATGTTG 


AGAATAGTTG 


ATTTGCCAGC 


ATTGGTATAA 


900 
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CCAATCAAGC 


/~» tv tv rrv m rn tv 71 tv 

CAATTTGAAA 


rp TV /Tprp/Tn/ r ">T\ 7\ 

TAGTTCTGAA 


T.TTTGTCTTT 


Jl b X bbbbbb 1 


1 t-b i 1 UAbbA 


Q£0 

7DU 


TGAGCAGTGA 


CTTCTTTTAA 


CTCACGACGG 


TV rn O /™""P 7\ 7\ /"> Tv 

AlbCC 1 AAbA 


ill 1 Al 1 bbb 


AA 1 b 1 bbbbb 


J.UtU 


CGGTCCGACT 


CTAATTTCGT 


TTCACCAGGC 


CCGbbTblbb 


C AA 1 Abb 1 bb 


I bL-AAb I bbb 


i nan 


GACAACTGTT 


TTCCTTGGCC 


AGCCAAGCGA 


f~* f {~* T\ Tv rp TV T\ T\ rn 

GGCAATAAAI 


AAlbbAAl lb 


ibbl AAI lb A 


1 1 4 n 

i X H U 


ACTTGTAACT 


TTCCTTCTTT 


GGAGCGCGCA 


p/sp7i TV TV O TV TV 

CGCAAAbCAA 


7\/->TvrpTvrp/-</~t7v tv 

AGATATCCAA 


AATTAATTGT 




ACACGATCAA 


TTACTCGTAC 


TV ^« /"i m TV TV /'/'ffTl 

AGCTAACGCT 


rn i'"' 7\ /"*■ TV. rn n TV 7\ 

T C AG AT AAC A 


/"•rprpz-'Tv +.rprprp/-^ 

GTTGAtTl lb 


Abbbbb Tblb 




AGCTCATGAT 


TAAAAATCAC 


AAGATCCGCT 


TCATAGGCAT 


CTGTTAAtTG 


AATTAATTCT 


1 oon 


TCCAATTTCC 


CCTTACCGAT 


TACGGTTTGT 


CGATCGACTT 


GTGGTCGTTT 


TTbCbT TAAA 


ijtJU 


ACAAACACCA 


CTTCTCCATT 


AGCGGTTTCC 


/~* m m t\ t\ tv m rn /— ^ /™* 

GTTAAATTCG 


f~\ tv rrv rn rn rp r - * 

CCAGTTCTTC 


CATTGATbCA 


1 /i ^ n 


GAAAATGTCT 


GATAATTTTn 


TTCCGTTTCA 


ACT CC TACT A 


Tv ti rp tv piprnm 

G AAT C AC T T T 


TTCAT bAblb 


lOUU 


GTCATCGCCT 


ACTCACTCTC 


CTTCTGTTGT 


tv /^i tv rn m j* - * 

AGCCATTCCG 


CTACTTCTTT 


rp rp /~» Tv Tv /"^ rp rp 

TTCCAACTTG 


i ten 


GGcAAATCTG 


TTGGcTGCTG 


AACCAAATTC 


CACCAATGCG 


CAGCCATTCG 


ivmrnmpmTi tv tv O 

ATTTCTAAAC 


1 con 

X DZ u 


CAAGTCAATT 


GACGTTTGGC 


ATATCGCCGC 


GATTGCTGTT 


TGACTGTTTC 


tv tv r+r+f+r*r+ tv rn 

AACCGCCATC 


i con 


TCTAATGATT 


GTTCACCAGA 


AAAATACGGG 


TV TV TV TV t> m rn rn m 

AAAAATTCTT 


m tv m tv /~>/™»/^7v t\ m 

TATAGCCAAT 


7v / ^/-irprp/ , ~»7v/" , /-'rp 

ACCTTGAGCT 


1 Tin 
1 /4U 


GCTTGTGCAT 


GTGGTTGCTG 


AAACATCTGT 


TTTGCTTCTT 


^ ^1 TV TV TV /"^ 

CCAACAGCCC 


TTCTGTCATC 


XoUU 


ATTTGATCCA 


CCCGCTGATT 


AATACGCTCA 


TACAATAAAG 


CGCGATCTGT 


mmr>/*>Ti tv tv t~+ rn 

TTCCAAACCT 


10 bU 


AACAAATAAT 


AATCATAGAG 


ACGAGCTGGT 


TTTTCCTTCG 


/-* fry TV TV TV tv m 

GGGTCAAAAT 


TV /^rtl TV m TV 7\ y"l /"»/"' 

ACTATAACCG 


xy^u 


GTTTTATCAA 


AGACCTCTAA 


CGCACGAATC 


ACTTTTTTGC 


i~>Tl mm'AmrnTV tv tv 

GAT TAT T AAA 


tv JT'/ -, / , ^T\ rp/^/^rp^ 

ATGGATGCTG 


x you 


TTAGCCGCTA 


GAGGATCCTT 


TTGTTGTAAT 


TV TV TV ^ TV m TV 

AACAGCCATA 


tv rrPrnmr'Ti mm 

AGGCTTGATT 


CCCTTTTTCT 


9 n a n 


TCCGCAAATA 


AATTGTACGT 


TTCTCTAATT 


TCAGGCGAGT 


OTvnP/~"T'Tvrp/^ , rpO 

CATCTATCTC 


ACGAC 1 1CL 1 


*? i n n 
z xu u 


AGTTGAAAAT 


CATAAAGGAG 


TGATTGAATA 


rp 7V /"* TV tv T\ /^Orn^ 

TACAAAbb I b 


I XCCICCAAC 


bAl AA1 1 bbl 


91 fin 

Z X DVJ 


AGTTTCCCTT 


TTTCAGTAAT 


TTCTTTGATT 


TTTTGACGAC 


CCTCTTTCTG 


AAAAl CAbbb 


9 99n 

zzz u 


GCAGAATATG 


TTTCGCTGAC 


TTCACGGCAA 


T C AAT T AAAT 


TV/~ > ITl/^«T\(^ > / r ~ , A TV rp 

AGTGAGGAA1 


rn ^+ fjy rn rn rn rn rp 

1CC1 Ibi 1 1 1 


9 9Qn 
z z 0 u 


TCTGTTTCTG 


TGACTTTAGC 


GGTGCCAATG 


TCTAACbAbb 


y-< 7\ rn tv yv TV (~ifTl rp/-' 

GA1 AAAL1 lb 


P7A m PP A P m P TA 

bA 1 bbAb 1 bA 


9^40 


CCGCTAATAA 


TTTCACCATT 


AAATTTTTTT 


/— > /-1 rp TV 7> TV TV TV 

G C T AAAGC AA 


TACTTAATbC 


rn rp rn rn rp ^*/~* ^ 

1 bi 1 1 ibbbb 


9 4 nn 


ACCGCAGTGG 


rrtAAm tv ^> tv tv m 

GTCCTACAAT 


tv tv m t\ tv m tv /"•• rn 

AACTAATACT 


TTCTCCAATT 


ATGTTTCTCC 


ilbiibil bb 


9 a fin 


CAATTTTGAC 


GAGCAAATTT 


TGCTTCCTCA 


i^i /-» TV TV TV TV T> /-* TV /"^ 

GGAAAATCAG 


r PA r P/^ , T\ tv/~»tv 

TAT G AAC AC C 


PPPP7\ TV T mmr T 

CbbCAAl 111 


9^^n 


AGCTGAAAAC 


AATTTTCCAT 


TTCTTCCATC 


/*»/■-» Tv rp rp tv Tv Tv /"■> 

C GAT T AAC AG 


rp^/iTv orropr 1 

TCCACGGCCb 


P7\ PPPPTTTP 

bAbbbbi lib 


9 ^Rn 

4JOU 


CCAATTGTGA 


TAAGTTCTTT 


TTCATGAGCC 


AACACCCAAG 


ACATTTTTGG 


ATGTAAACTA 


2640 


TCGACAAGCG 


CGAATGCCGG 


TCCCGCTTGA 


CTTAAATCCT 


CTGCAGATTC 


ATAAATAAAG 


2700 


GCAATTTCTG TCTTAGGATC 


AGCTTTTTTT 


AATTTAACTA 


AACTTGGAAA 


ATAAAAACTG 


2760 


GAATATAAAT 


AACGAAACGG 


CCAGTTTTTT 


TGCTTCATTT 


GTTGGACGAT 


TTTTTTCTCA 


2820 


ATTCCTTCAT 


AATGGATAAT 


ATCGGTTTTA 


AGTTCAATAT 


TCAAAAAACC 


ATTGAATTGT 


2880 
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TCTTCTACAA GACAATTTAA AACATCTTCC AATGTTGGAA C&CATTGAAC AGAAGGATTT 2940 

. TTCTGAAACC AGCTTCCAGC ATCTAGTTGT TTTAATTCGT TTAATGTTAA CTGTTGGATT 3000 

TCACCATGAC CATCTGTCGT ACGATCCACT GTTTCATCAT GCATAACGAT TAAATAACCA 3060 

TCTTTAGATA GGTGAACGTC TAATTCAATT CCTTCAGCAC CAACACGAAC AGCCTCTCTG 3120 

AAAGCAATAC ACGTATTTTC AGGATGGGTG CCTTTACTTC CTCGATGTGC AATAATATCT 3180 

GTCATGCTCT CACTCCAATC TTGTTTATTG TACCATTGGA GTGAGGTGGG TTCAAATAAT 324 0 

GTAGAAAAAA GAAGCCTCTC ACAAATGAGG GGCTTCTTTT TAAGATATAA CAACTAACGC 3300 

ATTCCTGGAG CATTTTTTTC TGTAATATCA AAATTTACCG AATCAATAAC TTTGTTAGTA 3360 

TTACTATCAA TAATCACAAA GTTCAAACCT TTTTTATTTC TAGACATTTC TGTATCATTT 34 20 

ATTTTTACAG AAGCAAATGT TTGATTTTCA TAATTAGTAT AACTTGAGAA TACATCATAA 34 80 

TTTAAAGCTT CAATTTTACC AATTAGTCTC GATAATTCTG CATTTGTTTC AAATGTTTTA 3540 

TTATCTTTTG AAATCACTGC ACTTAAAGTC CACGCATTGT TACCtTCGAA TTTTACGTCT 3600 

ACCCCTAAGT CTTTTAAAAG CGACTCTTCC GCTTGTGAAA GATAGACTTG ATTACCATAA 3660 

CCACTTAATA AAATAGTATA TTGCGTATCA ATTAATTTTT GTAAATAATC ATTAAGATTA 3720 

AAAATATTCA TCAACTCTGG GTTACCATTT CTATATATAT TGTCCCAATC TTTTTCTGAA 3780 

ACACTTCGCA CAGACTCCAA GCCGTAATAT TTTGCAACAG AACGATTAAC TTGTGCATCT 384 0 

GAATTAGTTT TTACATGCGA TAAATGATTA GAATAAGCTG AATATTTTGT TG TAT TACT A 3900 

TATGATATAT CTGGAAACAC AGGATTGATA TTTCCTTTTT TCTCTTGCTC AACTAAATAA 3960 

CTATATTGTT TTGTCAAATC TAATTTTGAC AAGAATACAT CACTTACACC AACCATTACG 4020 

TTCATGAAGA ATGTACATAC CAAAATCACA TACAAAACAC TGTAAATTGG AGAAAACCTG 4080 

ATATTTACTT TATAATTTGG CAGTGACATA GAAAAAGCGA TAATCAAGAA TAATACGGCT 4140 

CCATAATATG AACGTCCGTA ATCTAATGCT GCTGGAGAGA TGGCTAAAAC TAGCATAGTA 4 200 

GCCATTCCAG AAATAAAATA TACAACACTT AGGTATACTC TTTTTTTATC TTGACTAATA 4260 

TAAGCATATA AAACAAGCAA CAAGATCAGT ATCAATAAAA ATAATGATAA GTGTTCTTTC 4320 

ATTTGCTCAA ATATTGTAAT AATTCCTGTC ATTAATTTTC TTGGTAAAGA CCAAGTACTC 4380 

CTAGCAAAAT AAGTAGCGCG AATCTTATTT CCTGGTGCTA AAGCCATTAA CGCTAAACCT 4 4 40 

ATAGTATTAC CAGCAACACC CGTAAACATC CATAAAGAAA GTTTTCGTTT TGTATAAAAC 4500 

TGCCAGCCAA GATACCCTAA AACGATAAGC AAAGCTCCTC CaGAGGTATT TTCATTACAC 4560 

CAACCAGCCA AAATTCCTAA AGCAAACATC AAAAAAATCA TTAATTTTGT TTTCGTAAAG 4 620 

ACGTGTTCTT CAATGATTTC TCTATGATAC ACATAAAGGA ACGAAAGAAT AATTAAACTT 4 680 

CCCCAAAGAT AATTAGCTGA GCCTGTCTCC CACAAAAACA CTTCTCCAAA GGCTGGGGTA 4740 

AACAACCAAA ATAGCACTTG AATAATAATA AATTTAAATG AATTATACTT TTTATCTTGC 4 800 

TTTTGTAAAG TAAGTCTATA AACTAAATAT GTTACTATTG TGTATGCCAA AGCATTAAAA 48 60 
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ACATTAAAAA TCCCTTTTGG TAACAATAAA . AATATTCTTG QAATTACATG 


GACCACTGAA 


4920 


CGTCCTGTCC 


ATGTCATG.TA 


TTGATTGTAT 


TCATCTATAA 


GTATCGTCAT 


CCAACTTTTC 


4980 


GTTTTATAAA 


GGTATTCATA 


GTCATCTGCA 


AGTAATGGTG 


TTAAATTATT 


TAAAGTAAGC 


5040 


ATCACCAAAA 


ATGTAATAAT 


AATTGAGCAT 


ATAAATATCT 


TCTTTTTATT 


CAATTTAACT 


5100 


TtCATCTCTG 


CTTACCTTCC 


GCTTTTAGTT 


CATACTTAAC 


ATATTTATCC 


AAACCAGATT 


5160 


GAATACTATA 


TTTCGGTTCA 


TAACCAATGG 


CACGTAATTT 


rGATATATCC 


GCTAAGGAAT 


5220 


CTTkGATGTC 


CCCAGCTCGC 


GCTTCTTTGT 


ATTCAACTGG 


CAAAGTGACT 


TTCATAATAT 


5280 


CATTTAAAGA 


AGAAATTAAC 


TCATTTAAAT 


CGGTTGCTTT 


CCCAGTCCCG 


ACATTATACA 


5340 


CTTCCCCAAG 


TGATTGGTCA 


GAATGAGCGA 


CAAGATTCAA 


TGCTTGAACC 


ACGTCCTCAA 


5400 


CAAAGACAAA 


ATCACGAGAT 


TGTTTTCCAT 


CGCCAAAGAT 


ATTGAATGTC 


ACTTCTTGAT 


5460 


TGGCTAATAA 


GCGTTTGTAA 


GAGTCCATCA 


CAATCGAAAT 


AACGCCAGAA 


TAAGGCGAAT 


5520 


TAGGGTTTTG 


GTTTGGCCCA 


TAAACGTTAA 


AGAAGCGAAC 


AGCGCTTGTT 


GGAACATCAT 


5580 


ATAAATGACA 


ATAGTTTAAC 


ACATATTTTT 


CAGAAGCAAA 


TTTATCCACA 


GCATAAGGCG 


5640 


TTAACGGACG 


AAT TACT GAT 


TCTTCCTGTT 


TTGGTAAAGT 


TGGCTCATCG 


CCGTAAACCG 


5700 


CTGCAGAAGA 


AGCAAACACC 


AAACGTTTTA 


AATCTTTCTG 


GTATTTCCGA 


ATTAATTCCA 


5760 


GTAATTGTAA 


CACACTCTCA 


AAATTCACTT 


GGTGTGTTTC 


CACTGGACGT 


GCTACTGAGT 


5820 


CTGCTACACT 


AGCAATCGCG 


GCTAAGTGAA 


AAATGTAATC 


GAACTGATAC 


TCTTGTAATA 


5880 


CTTTTTCCAT 


TAATTGTTGA 


TCAGTCACGC 


TACCTTCAAT 


GAACGTGATG 


TTTTCAGATG 


5940 


CGTTTAAATT 


TTCAGTTTGG 


CCCATTGATA 


AATCATCAAT 


GACCACAACT 


TGATTGTCTT 


6000 


TACTATaATa 


ATTTGCTAAG 


GTTGAGCCGA 


TAAAGCCGGC 


GCCGCCTGTG 


ATTAAAAAGT 


6060 


TCCTCAAGTT 


TCCCACTCCA 


ATTAAAATAA 


TACATTAATT 


TTACCATTTT 


TAGGTGTCAA 


6120 


AGACTACAGT 


TTGTTTTTTC 


AATAAAAAAG 


GAGAGAAAAT 


AACTTTCTCA 


ACCTTTTACT 


6180 


AAATATTTTA 


AATTCTTTGC 


GTTATTTACA 


AAGCTCTTAT 


GAATGAATAA 


GAAATAGATA 


6240 


ATATACACAA 


TTGCATAAGA 


AATGAAACTA 


TAAATACTAC 


CGCCAAACCA 


AAG ATT ACTA 


6300 


AAGATAAACA 


ATAATGTAAC 


CAACGTTTCC 


AATGCAGTGC 


TATTTCCTAT 


TTTAATATTC 


6360 


ATATCTTTGC 


ATAGAAAAAT 


TTCCGCTAAA 


TTACAACGAA 


ATGCTAAACT 


TACTAATATC 


6420 


AATCCAACAG 


TTAAATTCAA 


GTTCCCAACA 


ACGAAAATCA 


CAAACAACGA 


TAAGATTAAC 


6480 


GACAAGGCAA 


w J. w i v^hlti j- ci w 


GtTAACAAAT 

\J W 4. £i*^N^ ****** ^ 


AAAATTGTTT 


TTTCTTTTCT 


CAACGTTTTT 


6540 


AAATACGTGT 


TAATTAGTAA 


AGACATCCGG 


CCTTCATAAA 


TAACTATAGG 


AAATAGAATT 


6600 


CCCATAAATT 


TCAAACTCTC 


GCTATATTCG 


GGTAACCACA 


TACCTAAAAC 


AAATTTGACG 


6660 


GGTACATATA 


• ACAGCAGAAT 


TGCATACGTC 


AACGGAACAA 


AAACACCTCT 


TAATGTGACA 


6720 


AATAAAGACG 


GcAATCGCTC 


TTTATTTGTC 


CGTCTTAACA 


ACGGAAACAT 


TACAATTCCA 


6780 


ACAGCATTAA 


TAAAAGTTAA 


AAACATATTT 


GATAAaCTTA 


AAGTAAAgGm CAaTTTACCA 


6840 
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AAGGTTTCGA TTGTCCAACG C7GT7GAACA AAAAAGCGAA 
CTTGCTATAC TACT GAG CAT 7AnT7nACTA CC7AT3TTAA 
GTAACTTCCT TCAAACTCA7 CATG7TCACA CGCaACATGT 
CAAACCGTAA TAATCAGTTT A3AAAGTATA TCCATAATGA 
CCTCCTAATG CAAAGTAAAT 7GCAATGAAC AGGACATATA 
AATTGAGCAT ATTCCTTAAT ACGATTTGTT GA.CTGAAAAA 
GCAATTGTAA TTACTGAAAC AATGGCCGTC AA7ATCAAAA 
GGCATGAAAA AGTAAGCCCA AAGAATGACA AAAGCAGAAA 
AAATACCAGA ATTGAGAACC GAGAT77CTT TTATC7AAAT 
ATTTTTAAAT AGATACCATC AATCCAGCCT AA3TGGAGAA 
AAAACATAAA GTTGCCAATA ACTGTATTCT GTAACGCCTA 
TTTAAGAGAA CCGAAATACC CAACGTTGCA AAATT7GCAG 
TTAGCCACCA TTTTTGCTTT 7GAA77CATA GCATTATGCT 
CATGATATGG TAGATATCTG 7GCTAAAATA AAACT7GTGA 
TAGAATCGGG ATTATTTTTT T7CTTTCTAA TAAAGAAACT 
CAAATTTGAA AGAACTTTGT C7TTA7ATTC AAATTCTTCA 
ATAATTCTTT GCAGCCTCTG TACGC7GTAA 7C7CCGGCGT 
AAAGTGATTT TGTTTCTTAG C7AAA7CAAT 7TCTT7TTTA 
ACGCCATCTA ATTAATTCTT CCCTAACAAT ATAGAGCCCA 
CCACATCGCC ATATCATGTA CAGGAT7TTC CATTTCAGAT 
AATGAAGCTC TTTCTAATTG CATA7ACACA TCCCGGGCGA 
ATTAAATATA ATCTGTTCTG CTTT77GATT AGGCAATTTA 
TTCCTCTGCT AATCCCCCAG GTTCTACTAA T7CAG7ATAA 
ATCAGTGTTT GCAAAAGCTT CACACA7CAC TTCAA7TTTA 
TTGATCACAC GTAAAAACAA TTTCTTTAGT AGCAGCATTC 
GCGCCAACCT AAATTTTCCG kGTTGATTtG CAAaTGCcAC 
AGAAaTAAAG TtCTCAATtA TTTtAACTGT rGAaTCyGTA 
TACT T CAT CA ATTT7TTTCG ACTGA7TTTT TAAGGAATCC 
(2) INFORMATION FOR SEQ ID MO: 1C7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



7GGTTCCCA7 
TAITATCAaC. 
CTTTAACTCT 
7TAGCCAATA 
AATAGCGATC 
CATACAGAA7 
TAATCCCTTG 
TACAACATTC 
TTTCGTATTC 
ATCCAACATA 
ACAACTTAGG 
CCACAGTATA 
CA.CTTTCTAA 
7GA.TACTTAA 
CGCATATTCA 
ACA.TGTGGT7 
AAAGTATTCA 
AAGGAACTTT 
TCACTTAATA 
GCGTATAGAT 
CGTAAAAATA 
ACTGAATCAA 
7CA.CTTACTA 
GATGCATCCC 
AATAGATTAA 
GTTtCACCTA 
GrGCAGTCAT 
AACTGTTCTA 



7ATTnGCnTA 
7AXTTCAGGA 
AAACATTCCC 
AAAATCTCGT 
ATTTCTGGAT 
AAACGT7TTT 
ATATTCTTTC 
AAAAATCGCT 
7TCTCCCCCA 
7GAAGAATAA 
GACAAATAAA 
ATAAAGGTTT 
TTTTATATTT 
AAAAGGCACA 
ACTCTTTAAT 
GATTATTTAA 
ATCTTTCCTC 
GTCCATGTTT 
AAGAACTTCC 
TAACATTGGC 
AATTGTTCTC 
TTTTTTTTAA 
AAACTTTAAC 
AAATATCGTC 
AGAAATTTTT 
AwTTAwTTTC 
CCCAAATTAA 
AGATATTTT 



6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8519 



(xi) SEQUENCE DESCRIPTION: SEC ID NC: 1C7 : 
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TTTTGGGGGG ATAATCAAAT TCCCCTGnGT TCCGCCACAA GCCACACAAT GATGCCcATT 60 

CTGGTCGTAC ACCAAACCGT TCTAACAATT GAATTTCAAA AATATTTGTA ATGATTTCCg 120 

GATCTTTTTG CTCaTTTAAA GCGGATAATG CCATGTGGGT AAATTGAAAT AAGTTCGGAT 180 

CATAGTGCTG ATCTTCAATT GCTGCATCGA CTAAGTTTAA AATATATGTT CCATATGCGT 240 

TAATAAATAT GTCTTGTTGA ATGTTTCGAA AAGGTTGAAC TTCTTTGCTG CTATTGAGAA 300 

AAGATAAGCC TTCTTCTCTA AAGTTTCCGA TGTAGACGGC TTCTGTAAAT GGCAGAATCG 360 

CTGGCAAAAG AGGATTATTT TTTCGATGCG CACCTTTCAC AAAAAACATC AATTTTCCAT 420 

AGGATTCTGT AAAAATTTTG ACGAGTTTAT CCTTCTCTTT AAAATCCTTC GTAAATAAAA 480 

TAATGCCTTT GGTCTCCCCT AATGCCATTC AATTCACTTC CATTTCTTAT GTAAATTAGT 540 

GTAGTGTTTG -GAAAGTTCGT TTATACACTC GCTTCCCAAA CACTACTTTC TTAATATTCT 600 

TCTTTGCGAT AACCAAAATC TTGTAAATAG ATTTTTTTGT CACGCCAGTC TTTTTGGACT 660 

TTTACCCACA GCTCCAAATA CACTTTGTCA TCTAATAAAT ATTCAATATC TTGACGCGCC 720 

TTGGTACCAA TTTGTTTCAG CATTTTTCCG CCTTTGCCAA TAATGATACC TTTTTGACTG 780 

TCCCGTTCCA CAATAATGGT TGCTTGGACA TGGACTTTAT CGTTTTCATC CCGTTTCATT 840 

GAGTCCACAA CAACAGCGAC AGAATGAGGa ATCTCATCTC TTGTTAAAAG TAACACCTTT 900 

TCACGAATTA GTTCAGAAAC GATAAAATAT tCTGGATGAT CCGTAATTTG GTCaTCaGGG 960 

AAATATTGTG GTCCTTCTGG CATTTGCtCA ACGAGAACAT CCATCAAATG CTsCAAAATT 1020 

ATTTCCTTCA GTGGCAGAAA TGGGAATAAT TTCTTTAAAG TCCATTTGTT TCGAATAGTC 1080 

TTCAATAATA CTCAATAAAT CATCTGGATG GACTTTGTCA ATTTTATTAA TAATCAAGTA 1140 

GACTGGACTA TTATTATTTT TCAATCGTTC AATAATAAAG TCGTCACCTT TCCCGCGTTT 1200 

TTGATCAGCA CTTACCATAA ATAAAGTAGC ATCTACTTCA CGCATCGCAT TGTAGGCACT 1260 

TTCCACCATG AAATCGCCTA AGCGGTGTTT AGGTTTGTGA ATCCCTGGAG TATCGATAAA 1320 

AACAATTTGT GCTTCAGGAA TTGTATAAAT TCCTTGAATC TTATTACGAG TTGTTTGTGC 1380 

TTTGTCACTC AtAATTGCAA TTTTTTGGCC GACAATTCGA TTTAACAATG TTGATTTnCC 1440 

AACAnTTGGT CTGCCGACAA TGGGCTACGA AACCAGATTT AAGGTTCAGT TGTCATGGAA 1500 

TnCGCCGC 1508 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
ATGAAGTATT CAAAAGGACG GTACGTCCTT GTAGTCTTAG TAATGGTCTT GATTGCTTGG 



60 
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TTAATTTTTA 


TTTTGTCAGG 


TTTAGCGAAC 


GGTTTAGCTC 


TV T\ /"*/"• TV BUmAT 

AAGGAAATCG 


TTTAGCCGTA 


1 ort 

1/U 


GATCAATGGC 


AAGCGAATCA 


AGTGGTTTTA 


TCAAAAGAGG 


CCAATAGTAA 


TTTAAATGTA 


1 on 

loU 


TCAGTGTTAG 


ATGAAAACGT 


GAAAGAAACG 


ATTTCAGGAG 


GCAAAATTGC 


ACCGATTGGT 


1 A A 

240 


CAACAATCTT 


TAGCCATCCG 


CCCAGCAGAT 


GATAAAAAGG 


CTGAATTAAC 


GAATGTTAGC 


300 


TTATTTGGGA 


TTGAAAAAGA 


AAGTTTTTTA 


ATGCCAAAAG 


TGATTGAAGG 


AAACGCATTT 


360 


ACTGATAAAA 


ATCAAGTGAT 


TGCTTCaGAA 


ACATTGAAAA 


ATCAAGGATT 


TAAAATTGGC 


420 


GACAAATTAA 


CTGCAGGGAA 


ATATGATGAG 


CAATTAGAAA 


TTGTCGGCTT 


TATTTCTAAA 


480 


AGTAGCTATA 


ACATTGTGCC 


AGTTATTTAC 


ACTTCTCTGG 


ATACTTGGCG 


GTCAATTAAA 


540 


TATGGTGACA 


ATCCAGCAAT 


GGCCAAAATG 


GTTAATGGTT 


TCATTGTTCG 


CAGCAAGGAC 


600 


AACACGGAAG 


TTAAAACGAC 


TAATAAAGAC 


AGCCAAGTTC 


TTTCAATTTC 


AGATTTTATT 


660 


GAAAAACTGC 


CAGGATACAG 


CGCTCAAAAC 


TTGACCTTGG 


ATGGCATGAT 


TTATTTCTTG 


720 


ATTGTGATTG 


CAGCGTTCAT 


TATCGGTATT 


TTTATCTTTG 


TGATGACTTT 


ACAGAAAACA 


780 


GCAATGTTCG 


GCGTCTTAAA 


AGTTCAAGGG 


GTGCCAACTA 


GTTTCTTAGC 


AAAGGCAGTC 


840 


ATGCTACAAA 


CCGCTTTATT 


GGCTGTTCTA 


GGAGTGGCGA 


TTGGTCTTGC 


TTTAACGGGA 


900 


ATTACCGTGC 


TCTTTTTACC 


AGAAGCGATG 


CCTTATGCAA 


CTAATGGACC 


ACGCATGATT 


960 


TTGTTTAGTG 


TACTATTGAT 


TTTATCTGCA 


TTAATTGGCG 


GGGCATTTTC 


AATTCGAACG 


1020 


ATTGCTAAAA 


TTGATCCGTT 


AATCGCGATT 


GGAGGTTAGA 


GACATGGCAA 


ATGTTTTAGA 


1080 


AATGAAAAAT 


ATTTATAAAA 


AATATGGTGA 


AAAACATACA 


GAAGTGATCG 


CGTTAAAAGA 


1140 


ATTATCTTTT 


GCGGTTCAGC 


CAGGTGAATT 


TGTCGCAGTA 


ATTGGTCCTT 


CTGGTTCTGG 


1200 


TAAAAGTACC 


TTTTTAACGA 


TTGCAGCTGG 


TTTACAGGCA 


CCGACAAGTG 


GCGAAGTCAT 


IzbU 


TGTTGGCGGG 


CAATCACTAA 


ATAAGTTAAC 


GAAGAAGCAA 


CGTTTGGCGC 


AACGTTTTCA 


i ion 


AAAAATCGGC 


TTTATTTTAC 


AAAGCTCTAA 


TTTGGTGCCA 


TTTTTAACAG 


TGGAAGATCA 


1 O O A 

1 JoU 


ATTTCACTTA 


ATTGAAAAAG 


TTGATAAGTC 


ACGTAAAAAT 


AGTGAATTAA 


AAGAGCAATT 


14 40 


GTTAGAGACG 


TTGGGTTTAA 


AAGAATTACG 


AAATAGTTAT 


CCTCGTGATC 


TCTCTGGTGG 


T C A f\ 


GGaAAGACAA 


CGGGTAGCCA 


TTGCGTGTGC 


ATTGTATCaT 


GAGCCAGACG 


TAATTTTGGC 


T C C A 


AGATGAACCA 


ACGGCTAGTT 


TAGATACAGA 


AAAGGCGTTT 


GATGTTGTCA 


AATTACTAGC 


loZU 


GAAAGAGGCC 


AAAGAGAAAG 


ATAAAGGGAT 


TATCATGGTG 


ACACATGATG 


AGCGTTTGTT 


icon 

1 DOU 


GAAATACTGT 


GATCGAGTGG 


TTCGTATTCG 


TGATGGTGAA 


TTGACAGAGT 


AACAAAACAA 


± / *i u 


ATAAAAGaGA 


ACATGATCGG 


ATTGCTGGAG 


CAATCCGATC 


ATGTTCTCTT 


ATTTCTTTTC. 


1800 


AATACAGAAA 


AGAACAGGTG 


GCTGATTTTT 


TTGATTGATA 


AACTCATAAC 


GTAAGACACT 


1860 


GTAAACATCT 


TGTGGTAAGG 


CTTGCGTGTA 


ATTCGTGACC 


AAATCAAGTT 


CCTCTGAACC 


1920 


ACCTTCGTGG 


CCATAATAAA 


CGACTAAAAT 


AATGCGACTT 


CCTTTTACCA 


AATGAGGCAG 


1980 


TAACCCATCT 


AACGCCTGTT 


TAGTCGTAGT 


GGGTTTTGTA 


ATAATTTCCT 


TATTACTTTT 


2040 
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AGGTAAATAG CCTAAGTTAA AAATAGCGGC AGTGATTTCT GTTTCTTCAT CTAGGACAGC 2100 

-ATGAAGATGT TCATGTCCTA ATGGAAATAA CGTAGTTTGT GGGAGTAAAT TAAGTTCTGT 2160 

TAACTTTTGT TCTGTATTGA TCAACGCTTC TTTTTGGATA TCAAAGGCGA AAACTTCACC 2220 

AGAAGGGCCA ACCAGCTCTG CTAAGAATGC TGTATCATGA CCATTTCCCA TTGTTGCATC 2280 

AATGACGAAG tCGCCAGGTT CCACCACTTC TTTTAAAAGT TGGTGACTAA AATGTAGGGC 2340 

TGTTTGTAGC ATTAATTGAA AACTTCCTTT CCTGAAATGA CAGTATATTT TCCTTGATAA 24 00 

CTGTTGCGGC GTTTCATTTC CGCATCAATC GCATTGAGAA CTTCCCATTT TTTCAAGCTC 24 60 

CACATCGGTC CGATAATTGT TTCAAAGGGA GCATCGCCTG TCAAGCGATG AATAACAATT 2520 

TCGGGTGGAA TCATTTCTAA CTGATCACAA ATCACTGAAA CATAGGCTTC TTTGCTCATC 2580 

AATTGTAAGC GTCCTTCATT ATAATCCCGC ATCATTTTGG TATTTGTCAT TAAATGAAGT 2640 

AAGTGCAGTT TAATCCCTTG GATATCTGAA TCCTGAATAG TTCGCCGCAC ATTTTCGCGC 2700 

ATCATGGCAG GTGTTTCCCC AGGTAGCCCG TTAATTAAAT GTGTACAGAC CCGAATCCCG 27 60 

TGTTTCCGTA ATTTAGCCAC ACCATCTAGA TATGTTTGAT AATCATGAGC ACGATTAATG 2820 

GCAGCGCTAG TTTCCTCAAA CGTGGTTTGC AAACCCAACT CTACCCATAA ATAAAAGCGT 2880 

TCATTTAATT CGGCTAAATA GTTCACTACC TCATCAGGCA AACAATCAGG ACGGGTACCA 294 0 

ATGGATAAAC CAACGACACC TTTTTCATTG ACTACTTGTT CAAACCGATG ACGAATAACG 3000 

TCAACGGGCG CATGTGTATT GGTGAAATTT TGGAAGTAAA CAATGTATTG ATCGACAGTT 3060 

GGCCATTTTT GATGCATCAA TTGGATTTCT TTTTGAAATT GAAGCGGCAG GGGATCGCTA 3120 

GGTGCAACAA TCATGTCGCC TGAGCCAGAA ACACTACAGA ACGTACAACC ACCTTTGGCA 3180 

ACCGTCCCAT CACGATTGGG GCAATCAAAG CCACCATCAA TTGGCACTTT AAAAATCTTT 324 0 

TCACCAAACT GTTGACGAAG TGCATAGTTC CAAGTGTGAT AGCGTTTATT TGGATCATCT 3300 

GAATAGGGAA AAATTTGCAT AATCAAGACT TCTTTCTAGC GTTGTTTGCT TTGAAATCGC 3360 

CAAACAAAGC GTTTTTCTAA ATAAATAGGG TAACTTAATA ACAACCAAGC TAAACCAAGA 3420 

CAAAAACCAC CTAAGATATC GCTAGGGAAA TGGACACCTA AATAGATTCG ACTAACCCCA 34 80 

ATCATGAAAA TCCCAATACC TAGTAAAATT TGCACACAAA GGCGGACCGT TTTATTTTTA 354 0 

ATGAAAATTG GCAGAATCAA TAAAATAGTG CCGTAGAACA AGGTACTTCC TACAGCATGT 3600 

CCGCTAGGAA AACTATACGT ATGTTCAGTT ACTAAATGCT CCAAAGTAGG ACGTTGGCGC 3660 

ATAAAGAATA ATTTTAATAA AGAATTACCT ACGCCAGCAA TTAAGGCAGT ATTAATAAAG 3720 

AGCCATAATG CTTCAGCATA ATATTTCCAA ACAATAAACA GAGCAATGAC AACGATTGCT 3780 

AAGATGATAA TCGTCAATGA ATTGGCAAAT TTGGTGTACC AAATAAAAAA GTTATTTAGC 3840 

TGTGGAAAAG' GACGACGAAC AAAAGAAGTA ATCGTTTGAT CAAAGCCTTT GAGCCAGGTT 3900 

GGATAAAAAC GAACAACATA GCCTAAAAAC ATAAATACTA AAAGAAAACA ACTACCTGCA 3960 

AATTGATAAT AAAGTTTGTT TTTCATAAAA AATCCTTTCA AAATTTAATC GCAACAATCC 4020 
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A tp a a a appp 

Al CAAAAubb 


lA/WiAAl In 


aaaf^ATAATT 


TCATTGCCTG 


ATTTP»GAP,r.P. 
mix uunouo 


4 U O U 


PATAPAP.ATA 
bn 1 At-Ao A 1 A 


TTAAaaaaP+- 
1 lAAAAAAb t . 


TPTPPPappa 
J. C J. bbbACLA 


pa aTPTPTPA 


TTGTGGTGPP 


APmAPAP,aTP 


414U 


1111 rt/A^w 1 uo 


PAPATPTTTB 
^ AL. A lul 1 1 A 


ttpbttbtpa 

1 1 uA 1 1 Al L.A 


TP ATPAfrGAG 


TATCAAPPPT 


A AT ATPTPPP. 


1 U VJ 


AA1 l CbCl Ab 


B a RPPTTa R R 
AAAbb I 1 ArtH 


PTPTPTaTPP 
C 1 L 1 b 1 Al LL 


ppap^app at 


TPT T AT AAPiA 


TPTTPPPTPT 
loll V^OO 1 1 


& 9 fid 


TT^TTTR A A R 

1 1 b 1 1 1 AAAA 


PTTPa aPaTP 
ul 1 b AALA 1 L 


PPPTaaTTTT 
U^lnni 111 


(^ATTPPPPTT 


TRATT^IAAAT 

x \3r\ 1 x Ortort 1 


P.TTAPf;PAAA 


4320 


T\rprprn/~>rp/->/~'7\P 

AT X 1 C 1 LtAb 


r A r P rnr nAPA r nA 
AA 111 A 1 A 


a aTaTPTPpa 

Hn 1 A 1 L 1 LL A 


TPPAPAAfZAP, 


Pf^APA^TTAA 


TTTTTT APPP 
11111 Inuyv 


4 380 


a p Ti p p r p p p A 


tptptrb apa 

1 L 1 L 1 AAAb A 


aa a aTPappa 


TTTTPAPTTP^ 

1111 \*rW* 1 x \2 


AAAAPrTTAHP 


TTPTPPTTTP 
x l ^ x v^\-> x x i o 


4440 


AAATC A I Ab A 


LbL 1 AAi AIL. 


P P P A T P T P T P 

bLCAlCiblC 


prpTPTTPPTT 
ol lull LL 1 i 


PPATPTTTPP 
LbA lul J. 1 LO 


APP APP ATTT 


4 500 


TGCAL 1 CGAA 


1 A 1 CACC AG A 


rptaptpp'pr 
Ab 1 Abl Cb 1 A 


TTTTTPIiTTT 
i i 1 1 1 o A ill 


p a tpp a a a t a 

Vrf A 1 Lbnrt/i 1 n 


PPPTPa ATPP 




AACGTTATAr 


CGCCG 1 LAG i 


GG 111 C AAC 1 


f~* rp rp ^ rp rn rp rp 

Gl 1GL1G 1 1 1 


rp a r^r'nms. apt 
1 ALLbbAAL 1 


a TrrrrTr r r 

A 1 LLbL 1 bAA 


4 G9f) 


AGGTAGA1 AC 


LA I L I b AAb A 


Gl 1AAGCG1C 


A T 1 T* T 1 rp rp rp rp /"> 
Alllllll LL 


rtTTT R R PRPP 
C 1 1 1 AALALL 


ppaTapapTa 

LbAl AbAL 1 A 


4 fiflD 

4 DOU 


t\ 7v TO TA T\ TV 7V TO 

acataagaat 


TTGGtAGATT 


7\ /~Tprp/-«rprnTv tv rp 

AGTTGI 1 AAI 


1LG1C1ACG 1 


lAAAAbLAl 1 


/"•AATfAAAr^T 

L AA 1 bAAAL 1 


/IT 4 C\ 


GGTGAATGGG 


TTGAGLLA xC 


AA 1 C AC 1 AAC 


TO TO TO rn z*'* TV 7A *^ 

1111 1LALA1 


1 Al lAbbAAl 


PTPPAPaPTP 
L 1 bbALAL 1 L 


4 ft on 


ACGTTCGGTG 


rn s~> -n A ^ rp /""» Tv pni 

T G AAG T C AC T 


AAAAA1 ACCA 


ATA TT" A A A r" A 

A 1 A 1 1 AAAG A 


nTmPB R rnrn/-> 

A 1 bAL AA 1 1 L 


PP a TTPPTTT 
LbAl ibbl 1 1 




TGTGTAbGAG 


TGGTATTGAT 


GG1TGC1G1 1 


T\ tojTO fj> TO 7V T TO 

Al 1 1 1 Al 111 


TTTCTTCTTT 


pRPATPTaap 

bAL A 1 LI AAo 


4 Q9fJ 


GTGCCI I CAG 


CI 1 1 ibAAl A 


AAAGGCA1 1A 


1 1 LLo 1 LA 1 U 


PTa appppaT 

L i AALLLLA 1 


TTTPTTaPTa 
1 1 lul 1 nL 1 A 


4 QRD 


TCAGTTTTAT 


Tv tv Tirp^-z-rpA rn A 

AAATGG x A 1 A 


AGG TCC 1 1 1 G 


PPTTTTA APP 
LL ill 1 AALG 


aTa aPTPanp 
A 1 AAL 1 b AAb 


T"T' P I 1 T 1 f~* P P P T 
1 1L1 1LLLL1 


1 

^040 

J \J H \J 


GCTTGTTTAT 


TTTTTAGTGT 


7\ TO 7\ T\ ^ 

A1AAC1 1 1 C 1 


1 1 InvjLbulo 


rn /*> a P'TI A rp rp rp 

1 urtL 1 on 111 


TTPPP.PTTP.P 
1 1 b^bu 1 1 oU 


~J ±\J\J 


Tib 1 AA1 AA1 


AbACbbt LjCC 


AG A 1 CCACCA 


ATT a pp a TPP 
Al 1ALLA1LL 


pTaapAPapp 

L 1 AAb AL Ab L 


aaTPPaTaap 

AA 1 bon 1 AAo 


ST fiO 


AGAA I 1 b 


TTGTTTTTTT 


Ca 1 1 AGAl Al 


pappTPi^Taa 

L ALL. 1 LLr 1 AA 


aapappa a TP. 

AALAbLAAi kd 


TTPPaTTPPa 


S??0 


A A T» A ti /"p f»rpm 


A A APAPATPT 1 

MAoALA 1 b 1 


LL 1 o/iftAn 1 1 


TTaaaaTTPP 

1 1 n/Wil loo 


tap a a ap APT 

1 nunnhvv/iu 1 


a APP A APP A A 


5280 


TGAI IbAlbC 


ACCAAA 1 AAA 


nnbnvjHL 1 LM. 


TapaTaaTTP 

1 ALA 1 AA 1 1 L 


pa a a a app a a 

LAAAAAbbAA 


PPPPP AT TTP 
oCLLLAl 1 lb 


S^40 


f~»rp r-p /"•/'■"•/"* rp j\ * m 

C 1 1 Gbb I AA1 


AAC AC 1 A 1 AA 


TV rp _ r>r" A rp a A R 

Aid LL A 1 AAA 


ttpptp a a a a 

1 1 bo 1 uAAAA 


Taa aPTTPPa 

lAAAbl loLA 


PPTPPPPPA A 


C 1400 

J *J U \J 


1 bbluALAALL 


uALu 1 nCA 1 1 


a apppa aTpp 

AALLLAA 1 LL 


pTaaa at a ap 


PPAAAAPATP 


A arir AAAA AT 


54 60 


tt a a a p a p a a 

1 1 AAALAuAA 


a aTappa app 

AA 1 AL L AAbb 


aPTTPPaaaa 

AL 1 1 oLAAAA 


ATPPP.P.TTAP. 


TP.PPPP.ATP.P 


TPTPPTPPTT 

1 O 1 OO 1 V^O 1 1 


5520 


PATAPTATP.P. 
on x no 1 n 1 uo 


APaaTpaTT a 

nonn 1 k^n i in 


TpaTap.Taaa 


ATTPAHATTP 
nil \^nun l lv/ 


TTPTTPATAP. 


P.P.ATP,ATPAT 
oon l on l ^n l 


5580 


aap.papp,tp.t 

nnuv^nuu lul 


AP.PPPPTP.PA 


aTTTPTTf^PP 


A^PPATTATT 
nvjvyVvnl xrxx x 


TTP A ATTTTT 

1 IVyfUll 1111 


PP1TTPPP1TPA 
vso l l v^>b>o x L^n 


564 0 


ATTCAATGCC 


TTGTTCTTTC 


AAAATTTCTT 


CTGCGATAAC 


CCGTGGTTTC 


CCCAATGATT 


5700 


TTGCCACTTG 


TTCTTCTGTT 


TCGCCTTCCG 


CAACACGAGC 


ATCAAATAAC 


GCCTTGTATT 


5760 


TCGCAAGAAT 


AATAGCTTGA 


TCTTTTGGAT 


TTAATGCTTT 


TAAATAAATT 


TTCAGCTCAA 


5820 


TAATAAAATG 


TTCTTTATTC 


ATTCCGCTCC 


CTCTTTCTAC 


AC GAT T CAT T 


TCTCATCTTA 


5880 


AACTATAATT 


TCTTTTTGAG 


CGAATAACAA 


GAGGAATT 






5918 



(2) INFORMATION FOR SEQ ID NO: 109: 
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(i) SEQUENCE CHARACTERISTICS:, 

(A) LENGTH: 2350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CATACCCGCT CAATCATGAG GAGACTGGCT AAAAGTAATC CTAGCGTTTG CATCTTTTTA 60 

TTTTTATTAA AGAAGTGCCC TGCTAACCAA CCTTGTCCGC CATGAATCAA TAACGAAAAA 120 

ACGAACCATT GTGGATAGCC TGACAATAAA TCAATTAAGC CACCGCTTAA AGCTCCGACT 180 

GTCAAGCCAC CGATCGGACC AAAAAGTAAA CTAGCAATGT AAATCCCTGC ATCCGCCAAT 240 

GTAACAAAGC CATTTGTTGC TGGGACGGGG ATAATAAAGA CCATTGACAC AGCCACTGTT 300 

AAAGCGGTAA TTACTGCATA TAGACTAATT TTTCTTGTCA TTATCCGCCA CACCTCCTGC 360 

TAGTAATCCT TCATGCCACA CACTTCCAAA AGCTGCATTT AGCTGGACCC CAGAAACAAT 420 

GGCTTCATGA ACGTATGTTT TTGCTAAATC AACTGCTTCA ATTAACGGAT ATCCTTTAAC 4 80 

TAAATTCGCA CCAATTGCAG AGGCAAAGGA ACAACCAGCG CCATCAATGG TGTGGGCCAT 54 0 

TACTTTTTCT TTGACCAATT GAATATGTTG TTGTCCATCA TAAAAAAAGT CACGTGCTTC 600 

TTGACCAAGC CAACGATTAC CACCTTTAAT CACCACCGTT GCCGCTCCTA AAGAGTTTAA 660 

CTCTTCCGCC GCTTGTTGGA CTTCTTCGAC GGTTGTTAAT TTTCTGCCAA TGAGTAATTC 720 

TGCTTCTTTT AAATTAGGCG TTATAATCGT TGCGAATGGA AAAAGTTTTT CAATCAATAA 78 0 

CTGCATGTAT CTTTTTTCAT AAACTGAAGT CGTTTCTTTA AAAGCCAAGA CAGGATCCAA 84 0 

CACCACCGGC AGACCTGCGT GCTTTTTCAA AAAGGTTACA ATTTGTAACA TACTTTCTTC 900 

ACTGTGAATC AAGCCAATCT TGATGCCGTC TAAACTCCCG GcAGGTATGG AAGCGAGTTG 960 

CTCTTTccAC CAAAGAAgTC TCaAGGTCAT GAATCTGAAA ATGATTCTTG GGGGCTGTAG 1020 

CAAGGCAAGT TAAAGTTGAT AAGCCAAAGA CTTGATGATT TTCAAACGTT TTTAAATCTG 1080 

TTTGAATCCC GCCACCTGAA AAAGGATCGG AACCGCCAAT TGTTAAAATT ATTTTTGTCA 1140 

TTTGTTTCGC TCCTTTATAC ACTCAGTTAT CATACAAAGT TTTCTAAAAA GAAAAAACAG 1200 

CCAATTGGCA GATTTTTAGC TCAGCCACTT TTTAAGTCTT TCGTTATCTT TTTCCTTTCC 1260 

AAAAAAAGAG TGACCTCATC CGTCAATGAA GTCACTCTTT CTTTGGATTT AACACTTATT 1320 

CGTCAATGGC TGTTACATCA ATGTTTAATT CAAACAATTG TAATGGAGAA ACTAAACTTG 1380 

GTGCGTTGTT CATTGGGTCA ATTGCTTTGC CGTTTTTAGG GAAGGCAATC ACTTCACGAA 14 40 

TATTTTCTTC CCCAGCCAAT AGCATTGCTA AGCGGTCTAG CCCTAAAGCA ATCCCTCCAT 1500 

GTGGCGGGAA GCCGTAATCT AATGCGTCTA GCAAGAAACC AAATTGGTCT TGTGCTTCTT 1560 

CTTTTGTAAA ACCTAACGTT TCAAACATTT TTTCTTGTAA TTCTCGTGTA TGGATCCGTA 1620 

ATGAGCCGCC GCCTAATTCG TAGCCGTTTA AGACAACATC GTAAGCTTCA GCATACACAG 1680 

AGGCTGGGTC TGTTGCTAAG CGGTCCACAT CCTCTGCTTT TGGTTGTGTA AATGGATGAT 1740 
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GAGCAGAAAC ATAGCGACCA GCTTCTTCAT CATATTCAAA CAACGGCCAA TCCACAATCC 1800 

ACAAGAAATT GAATTTCGAT TCATCAATTA AACCTAATTC TTTACCTAAA CGTGTACGTA 1860 

CCGCACCTAA AGCAGCGGCT ACGATTTCAG GTTTATCGGC ACCAAACATT AAAATATCAC 1920 

CGACTTCTGC ATTGGTAGCA GCAATTAATT CATCACTAAC TTCTGTTAAG AATTTAGCAA 1980 

TCGGTCCTTT TAAGCCATCT TCTTCAACTT TTAACCAAGC TAAGCCTTTT GCGCCAAATT 2040 

GACTCACATA CTTACCTAAG TTATCCATAT CTTTACGAGA ATACTTATCT GCGGCGCyTT 2100 

TTGCATTCAA AGCCTTCACA TGTCCACCAT TTTCCAACGC TGCTTGGaAA ACTTTGAAGT 2160 

CTACGTCTtT CACTACCTCT GCAACATCAA TTAATTCCAT AGCAAAACGA GTGTCGGGTT 2220 

TATCGyTACC ATAGCGTGCC ATTGCTTCGT CATAGCTCAT TCGTGGAAAC GGAACAGArA 2280 

TCTCAATTCC TTTTGTCTCT TTCATCACTT CAGCCAACAT ATTTTCTGTA TATGTTTGAA 234 0 

TTTCTTCTGG 2350 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TGAATTGGCC TCGATGACAG GTCTTCTACG AGGATACGTG CAACAATTAG ACGTCCCTGA 60 

ACAGCGTGCA CTAACAGAAG AACTAGCCAA AATTTGCGAG TTAATCTACC ATGTCAATCC 120 

AACGACCCGA ACAAAATTAA CAGTCACAGA GGACGAAATA GCCTGGCTTT TGGAACGTGT 180 

CAATGCTATG AATGAACTGA CATACGAAGA AAATCGTCCT TTTGTTTTAC CGATGGGCAC 24 0 

AATCTGTTCC TCTTATGCAC ACATTTTAAG AGCCAAAGCC AAAGATATTG TTCGTTTACT 300 

TTATCGCATG GACTATGGTG GTAAAAAAAT TGATCCGCAG CTGTATGATG TAGTCAACTT 360 

ACTCTCAGGT TACTTTTTCA TGTTAGCCTT GTACCTAAAT CAGCTCGAAA ATGGTGAAGA 420 

AGTTCCCTTT GTTTCTCGCA ACTATTCTAT TTAAAAAGTA CACCGACCGC TAACGACGAA 480 

TTGCTAGCGG TCGGTGTACT TTTATACAGG TAAGTACGCT TCTCCTTTTG CAAATAGTTT 54 0 

AGCAGTACCA CCAATAATGA CTCGATTTTC TTTCACTTCA CAAGTTAAAA AGCCGCCTCT 600 

TGGGGACACT TGATAGGCCG AAAGGGTCGT TTGATTGAGC CGTTTTCCCC AATAAGGAAT 660 

TAAATTGGCG TGCGCAGAAC CACAAACTGG GTCTTCGTTA ATTCTTAGTT TCGGGAAAAA 720 

AGTTCGTGAA ACAAAATCCA CCGAGTCTCC TGATGCCGTT ACAATGACTC CTACTCCTAA 780 

ATCCAATGCT • TTTAATGCAG AAAAATCTGG TGTGATTTTT GCCACTGTCT CTTCGTCTTT 840 

TAAGACAAAA AATAAATCGC GACCCAAATA AGCTTCATAA ATTTTCGTGC CTAAAGCGGC 900 

TTCATACTCT GGCAAAATTG GAATCCTTTC CGGTAGGATA TAAGGAAAAT CTAGATAATA 960 
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GTATTCCTCT 


TTTTTGGTCA 


CTGCGAGCGG TCCACTTTGA CTAGTAAAAT GGAGTGTTTC 


iuzu 


TTCCGCAACG 


GAATAATAGT 


TAAACAACAC 


AAAAGCCGTC 


GCTAATGTAG 


CATGTCCACA 


1 no n 
lUbU 


GAGATCGATT 


TCTCGTTCCG 


GCGTAAACCA 


GCGCAGGGCG 


TAACTTTGTC 


CCTCCTTGAC 


1140 


CGTAAAAGCA 


GTCTCTGACA 


AATTATTTTC 


AATCGCAATG 


TTTTGCATCA 


CCGCTTCTGG 


i 1 1\ f\ 
1200 


CAACCATTTT 


TCCAAAACAT 


AAACAGCGGC 


GGGATTGCCT 


TTAAAAACTT 


CCTCCGCGAA 


1260 


AGCATCGACA 


ATATAATAGG 


GATAACTCAT 


TTTAATTCTC 


TCCTTGTATT 


TTTTGATAAA 


1320 


TGTATTCCGC 


GCAACACAAA 


TCCTCAACTG 


CTAAGCCAAC 


CGCATCAAAA 


ATCGTGCGGC 


1380 


CTTTCTGCTC 


CGCCTTTAGT 


TTAACTTTGC 


CAGTGACTAA 


TTCGCCGAGA 


GAACCTACAA 


14 40 


TTTTTTCAGC 


TGAAAAGGCA 


CCTTCTGAAA 


TCGGAATTAA 


AAGATCCCCG 


CTTTCTTTTA 


*i c o rv 
1500 


ATGCAGCCGC 


ATACTCATCT 


ACAAACACTT 


GACTGTTTTT 


AACAAGGTCG 


CTTGTTAATT 


1 C C f\ 

lbbO 


CTCTGGAAGT 


CGGCGTGAAT 


GTACCAATTG 


CATTGATATG 


GGCATTCGGC 


AAAACGTCTT 


1620 


TTTCTTCTAA 


AAATGCGTCC 


TGACTGGAAG 


TTAGTGTACA 


GATAATTTCC 


GCTTCCTGGA 


1 C Q f\ 

lboO 


CGGCTTCTTG 


GACAGAAGAA 


CAACCAATAA 


AGGTACATTC 


GGGGTAATTG 


TGCTGTTGCT 


1 f H U 


TAGCGATTAA 


TTTTCGTGCA 


CGTTCTGCTG 


TGCGGTCATA 


TATAAAAACC 


TTTTCAATCG 


1800 


CCCGAACGCT 


TTGTAGCGCT 


GCGAGATGAG 


ATGAAGCTTG 


TTGACCTGCA 


CCAATCAAAG 


1860 


CGATACTTTT 


TGCCTTTGGA 


GCTGCTAAAT 


AATCGGTTGC 


CAAAGCAGAA 


ACAGCTGCCG 


1920 


TGCGAAsCCA 


CGTAATCGCA 


TTGGCGTCCA 


TCATCGCGAC 


TGGTCTTCCA 


GTAGAACTAT 


1980 


CAAACAATAA 


AATTTGACCA 


AGATGCGAGG 


CCAAATTTTT 


TTGTGGATTT 


TCTGGAAAAG 


2040 


CTGTAATGAT 


TTTTGCCCCA 


AAAAAGCGAT 


TTTTTCCTAA 


GTAAGCAGGC 


ATTAATGCAA 


2100 


AAAGATTTTG 


ATTTTTCCCA 


TCTGGTAAAA 


CCTCTACATG 


GCGCTCACTT 


TGCGAAATTT 


21 bO 


TTCCTTTTTG 


CAACTCTGCA 


AAGCAACGTT 


TCATCACTTG 


AATTGCTTCG 


GAAAACGACA 


2220 


ATTGTTGAAT 


AACCCTCTCA 


TAAGAAATAA 


TGTCCATCTG 


AATTCCTCCC 


TTTGCGCTAT 


Z/oU 


CATAAACCAG 


ACAAACAATA 


CACTCAATGC 


TATAATTGTA 


TGGAATACAA 


TTTTATAGAA 


2 J4 0 


AGGAAGTTCC 


CCTCATGCCG 


ATTAATTCCT 


ATGAGAACTA 


TGTGTTGTCA 


TGGCGCCCTG 


24UU 


TTAAAGAACG 


TTTAACAAGA 


CCTTATTACC 


AATCATTGGT 


TCAACAATTA 


GAAGCGGATA 


1 A C. ft 

2 4 bU 


TCTTATCTGG 


TGCGCTTCAA 


AAAAATGTGA 


AATTGCCTTC 


GCAAAGAGAA 


TTAGCTGATT 


2020 


ACTTGGATCT 


GAATTTCACG 


ACGATTGGTC 


AAGCCTATAA 


ACACGCTATG 


GAAAAAGGCT 


icon 


TACTTTATAC 


CAACATCGGT 




ILbl 1 1 v_ i v_v_. 


rAAPnCClTTT 

V—rt-r\v^Vj^.O i. A 1 


CACTCAATTA 


o £ /i n 
2 bfl U 


CAATCTCTAC 


CAACCAAGTA 


GCGGACCATC 


TAATTGATTT 


AGGGCTGGTT 


AGTTCCTTTG 


2 / UU 


ATATGTGTAA 


TCAGCGGATT 


TTACCTTTTA 


TTGAATCCGT 


TAGTAAAAAT 


GCGGCCCTTA 


2760 


ACAGTTTGCT 


• AAATTATCGC 


GATCCTTTAG 


GTACGCACTT 


TCAACGAGCA 


ACCGCTGCCG 


2820 


AATGGCTTCA 


GACACAAGGC 


GTTCGGACCA 


ATGCCGAAGA 


AGTTGCCATT 


GTATCTGGTG 


2880 


TCCAGAATGG 


ACTGGCCGTG 


ACGTTAGCCG 


CCGCTTTTTC 


TCCAGGTCAG 


CGGATTGCCG 


2940 
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TAGATCGATA 


CACGTATTCA 


AATTTTATTG 


AACTCGCCCA GCTTTATCAT 


T T Zi fZ Zi Zi Z\ TPC 
1 1/ibAMMH-Vj 




TCCCGATTGG 


CTATGATTCC 


GAGGGGATGG 


ACCCAGAGCA 


TTTACTGCAA 


bAAi o^ririrTtM 


f) fi 0 


AGAAAAAAAT 


ACATGGTATT 


TTCTTAATGC 


CCGCATGTAA 


TAACCCGATT 


(jo 111 ttrtxirt 


3120 


TGTCTAGCGC 


CCGCCGAATG 


ACTTTAGCTG 


AAATCATTCA 


GCAAGAACAC 


rprp7\ TrTTTQ & 


1 a n 

JlOU 


TTGAAGATGA 


CATTCATTCT 


TTTTTAACAA 


CTTATGCACA 


GCAAGCAGTT 


I lAbbAAbbJ. 


10 AC) 


TTCAAGAACT 


CTTACCTCAA 


CAAACGATTT 


ACTTAGCGGG 


GATGACAAAA 


III b lb Ibl A 


0 DKjyj 


CAGGcTTACG 


CATTGCGTAC 


CTTGTTTTCC 


CTCCGCTTCT 


TCGCCAAGAA 


Al AbAbbb 1 b 


Jj ou 


CGATTTTTAA 


CATTAACGTT 


AAAACTTCTG 


GCTTTGATGC 


TGAAATTGTC 


AbbbAAb L b I 


^4 on 


TACGCTCACC 


AGTTGCAAAA 


GAATTAGTGA 


TAGAAAAATT 


GGCTTTAACA 


AAAb AAb b b A 




ATGACTTATT 


TGATACCATT 


TTTGGGTTAG 


CTAGACCTAG 


CAATCCCTTG 


bbb I Ab 1 Abb 




GAACTATTCC 


CCTTTCTACC 


GAAAAAACGG 


rncrGCAAAT 


TGAACAAGAA 


TTTC 1 bbAAA 


jOUU 


ATGGGGTACG 


TCTTTTTCAC 


TCGAGCCGCT 


TTAPTGTGCA 


AAATCAGCCA 


GATGCTTTTT 


jouv 


TACGAATTTC 


CTTGGCCTCC 


AATCAGCTTG 


AGGTATTGGC 


GAAAGGATTA 


AC CAT TAT C C 




AAGAACTCTT 


ACCGACATTA 


AACGAAAAGA 


AAGGACATTC 


CCTATGAATT 


CAT! bbbAAJ. 




ATTTGAAAAA 


GCAGTGGGCA 


CGATTACCCT 


GCTTTCCGAA 


GAAAAACAAA 


ATGbbGAA I A 


J04U 


TGAAGGCTTT 


TTATGTCAAG 


TTAATGAATC 


GPAACAACTT 


ATCCGCAGTC 


GCTTAbbbAA 




AAAAACGCCG 


AAAAAAGAAG 


GCTATTTTGT 


AGCCTTCTGG 


GAAAAAAATC 


TV "A /"» TV TV 7V TV <T> f~* TV 

AAb AAAA 1 b A 




GAACGAAGCC 


TTTGATGCAA 


CGGAAGCCCC 


TGAAATGTTA 


GCAATTGTCA 


1 1 bb J. bA J. bA 




AGAAAAGCAA 


GGTCTCTTTC 


TTCTGCCAAA 


AGAATGTTTA 


ATCCAACAAA 


TV T\ 7\ T 1 TOTf 7\ 7i 

AAAI iblbAA 




AACGCATCAA 


CAAAAAGGCA 


AAATGGCGGC 


CCGTTTTTAT 


CCTTCTTGGT 


GbbAAAAbb 1 


fl -L *i U 


TAATCAAACC 


GCCAAAAAAA 


CACAAAAATG 


GCAACTAACT 


TACTTTACTG 


Al I 1 A.1 b 1 AA 


d 9f)D 


ATACTAAAAA 


CGTCAGCTGG 


CTTCAGAAGC 


CAGCTGACGT 


TTGTTTCTTC 


i Ab i b I 1 1 AA 


4 9 fiO 

*J ^ DU 


CCAATTGGTG 


GCGTACCTTC 


TGCGTTCCCA 


GCAATCGCTT 


CGATTTGAAT 


b I bAbbAbb 1 




TTTGGTAAAT 


CCGTCACACC 


AATCACTCTT 


CTTGCAGGCG 


TGCCTTCAGG 


bAAb 1 Abb 1 i 




TGGTAGACGT 


CATCTACTGC 


CGCCAGCTCT 


TCAATCTCTT 


TCACAAAAAT 


b 1 1 b Ab 11 It 


4 4 4 0 


ACCAGATCAG 


CTAAGCTGTG 


ATCCACACTT 


TCAATAATCG 


CTTTAATATT 


T TPT Zi 71 Z\ P 7A T 
1 IblArlAbAl 


4500 


TGCGTCGTTT 


GTTCCTTGAT 


TCCGCCAGCG 


ACTAACGCAT 


TTGTTTTAGG 


b 1 bbAA 1 bb l 


4 560 


AGCTGAGCTG 


ATAGATGGTT 


GTAATGAGAA 


AAAGCAACAG 


m rn fry t~** rn its 7\ 

TTTGTbTbbA 


7A 7A 7A TPPTAPZiP 

AAA 1 bb Hb/V- 


4 620 


TTCGGTGCGT 


GCTCTGTATT 


ATTTGCTTCA 


ATAATTAATC 


CATGTCGATC 


TTCAATCGCT 


4680 


TGTGGTGGCG 


TACCATCCCC 


ATGAGAAACC 


ACCGCTTCTA 


CTTGCACCGC 


CGCGCCCATT 


4740 


GGTAAATCAG 


• CAACAGCAAT 


CACTGTTCGT 


GCTGGCATAT 


AATTGACCGT 


GCGCGCAATT 


4800 


CCTGAATCTG 


GGAAAAAAGC 


GGCATGTACT 


TGATTGACAG 


CTTCCAGCTT 


AGATAAATCT 


4860 


TTTAAATAAA 


TGTTGATCTT 


CACAATATCA 


. TCAAAAGGCA 


CATCAATACT 


GGTTAAAATG 


4920 
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GCTTTTATGT 


TTTTCAAACA 


TTGCTTTGTT 


TGCGTTTTAA 


CACAGCCAGC 


CACAACTCTG 


4980 


CCCGTTTGAG 


GATCTATAGG 


CAATTGTGCA 


GACAGATTGT 


TATAGTGAGA 


AAACGCGACC 


5040 


GTTTGACTGG 


CTAGAGCATC 


AATTGGTGCC 


TGAACAGTCT 


GATTTGTTAA 


TTTAATTAAA 


5100 


TCCCCAGCTT 


GAGGTGCATT 


AGGGATAGTT 


CCTTCCCCAT 


TTGAAACAAG 


TGCCTCCATT 


5160 


TGAACAGAAG CgCcTAACGG 


CAAATCTGCG 


ACGGCGACCA 


CTGTTTTAGC 


TGGATAATAA 


5220 


TGGGTAAAAA 


AGGCCGCATA 


GACCTCCTCA 


ACTGCGGCAG 


CATCCCGCAT 


ATCTTTGACA 


5280 


AAAATGGTCA 


TTCTGACAAT 


GTC ACT CATC 


ACATGGTTGA 


TACTTTCAAC 


AACCGCTTGG 


5340 


ATATTTTTCA 


AACATTGTGT 


CGCTTGGGCC 


CGAATGCCAC 


CTTCTACCAA 


TTGTCCAGAG 


5400 


GTTGGCTCAA 


TAGGTAATTG 


TGCCGACAAA 


TGATTATAGT 


GGGAAAATGC 


GACTGTTTGG 


5460 


GCTGCGAGAT 


GATGCTTTGG 


TGCACGCTCT 


GTATTTCTTG 


CTAAGACGAT 


ATTTTCATTA 


5520 


CTCATAACTC 


TACTCCCTTT 


TTTAGCATTC 


CAACAAAAAT 


GATTCCGCTT 


ACATTTTTAA 


5580 


GCATTAAACT 


TCCAACAAGA 


TCCTTTTTAT 


TTTCCTGCTT 


TTCTATTAAT 


GGTGATAGCT 


5640 


TTCCACCACC 


ACTTCTTTCA 


GTCTACAATA 


AGAACAGGAA 


CATTTCAATA 


AGGCATTCCT 


5700 


AAATTGATTG 


TTGGCCCGCA 


AACAAATCTT 


CTTCAGTTAA 


AGAAGACTGG 


CACGAATCAA 


5760 


GCTTGATTCG 


TGCCAGTCTT 


TATAGaTAAC 


TCGTTAGCTT 


TAACTAGTTT 


TATttAaCCA 


5820 


CCATATACTC 


t\ y™* rn tv tv m m m 

ACGATAATTT 


TCCGAGAACG 


TAT CC AT TAG 


CGTATAAATT 


CGTTCAAAAT 


5880 


CATAGTCACG 


ACGAGGGTCA 


TCATACCAAA 


CACATTGAAT 


ACTTCCAATA 


ATTGGAAGAT 


5940 


TCGGGCTACC 


TGCTGGCACA 


TCTGTAAATG 


CTTTTTTCGA 


CATATTTGCT 


AAAGCATCAT 


6000 


CATATTGATA 


AAT G C C AG AA 


TCAACATTGC 


CAGCGACCCA 


ATACCACGCA 


TCATTGGTAT 


6060 


TAAAAATTTT 


GTGTCCTTTT 


TGAACAAAGT 


ACTCAGGTTT 


GGCTACGTCA 


TATCCGCTCC 


6120 


AACCGGCTGT 


CCAA 1 AAGAA 


ATAATAATCT 


CTGGGTCAAA 


TGTACCGAAA 


GAATCGTCGC 


6180 


TGTTGTAATA 


AATGLLA1 CA 


TTGAAGCTCA 


TTGGCTGCAT 


GCCCGCATCT 


TTAATTATTT 


6240 


TAGCT AAA i C 


tv rp rp /-»/"*» /""» TV rn 7\ 7\ 

AI ILGCAiAA 


GCGACAAAAT 


CTTTGTAGCG 


CCCAGAAGAT 


TGCAGTTTCG 


6300 


CCCAACCGCC 


TGTATCGACA 


TCATTtGCAT 


ACTCATCGCC 


GCCAAAATTG 


AAAATTTCAC 


6360 


TATGCGCGGA 


AAAATAAGCC 


ACGTACTTAC 


TGATAATCGC 


CTTTGTGAAA 


TTCACTGCCT 


6420 


TTTGGTTCCC 


TAAATCTACT 


GTTCGTTTTG 


AGCCATCAAA 


AGCGGGATTT 


TTAATCGCTA 


C A Q A 

64 oU 


GTTTTTCCAT 


GGCCACTAAC 


AAGGCATCCA 


TATGTCCTGG 


ACTGTTAATC 


ACGGGAATGA 


6540 


TGTTGATGTT 


GCGCGCTTTC 


GCAAAAGCCA 


ACAACCGATC 


CATTTCTTTT 


TGTGTTAACG 


6600 


CGTTGCCATT 


AGGATCATTG 


TAATATGCGT 


TGTTGCCTCG 


TTGGATTGCT 


TTTGAAACCC 


6660 


GGTTGTGGTT 


GTATTTTTTA 


CCATTCACAT 


TGACCGACAT 


ATCATCCAAG 


ATAAACCGTA 


6720 


AGCCATCATT 


• TCCTAAAATT 


AATTGGACAT 


CTGTGTACCC 


ATTTTGACTA 


GCTTTGGCCA 


6780 


CTAATTCTTC 


CAGTTGCTCC 


ACGGAAAAAT 


ATTTTCTTCC 


CGCATCAATG 


GAAAAGACAC 


6840 


TTTTTAAAGG 


TTGTGTTTGT 


TTCTCTTTTA 


AATTATCCAA 


AGCTGTTTGG 


ACCTCCGTTT 


6900 
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GCCAAGTAGT CACTTGTGCG GCTGTATACG T.TTTTCCCGC AAGTTCTGTG GCGACTGTCT 6960 

CTTTTGTTGT TTGTAGTGCT TGAATCGTTT CTGGTGTATA GCTTTCT.GCCL TTTXCAATGG 7020 

CTGTAACTGC TTGATCCAAC GCACGATATA AAGTGGTTAA GTCAATGGCT AACAGGCCTT 7080 

TTTCTAATTC AGCATCATAG GTGTTTGAAA TCCCTTGGCC ATAATCGCCA CCTAAAATAG 714 0 

CTTTATTCAC ATCATAAAGC GTTGTTGCTT GTGTCACTGC TTCTGCGGCT TTATTTTGTG 7200 

CTTCAAGGTC TGTGTTGGCG TAgCTAATTC GTTTTAAATA GTGTTGCGCA GCcGCTTTCA 7260 

TCTCTGCAAG AGAAACGCCT TTACTCTCTG CAATGGCCGT TTTTGTCCAA AGTAAATTAG 7320 

AAGGTTTTAT CTGATTTAAA TCGTCTTCAT TATAGGTGCG ACCATCGCGA TCTAAGGCAT 7380 

AGAGGAACAT GCCCCCTAAA TTATTTTCAT AAGAATAACG AGCTACTTTA TACATGTTAC 74 40 

TTTCCATATA CGGCTCTTTT GTGTCATACC AGCGGTTGCG ATCTTGCTCT TCTGGGAAAG 7500 

TTAATCCTGG AACAAATCGT TCCCCATTGA GAACCGGACT TAAATTATTT AATGCTCGTT 7560 

GCGTGCGTTG GTCATCGCTG CCATACTGTT GATACGCGAG AAAATCGAAA CAGTCACTGA 7 620 

CATCTTGTAA AGGTGGTAAA TATTCTGCAT TGGTATCATA CAAAAATGGA CGATCCGTTC 7 680 

CCGACTTCGG TCCAATGTAT TTTGATAATG CACGAATGAC ACCATTAGAT AGAACAATAT 7740 

CTTTTTCACT TGGACGAGTT TCCATGTCAA TATCTAACCC ATCAATTCCT AAATCATCGA 7800 

CAAATTTGGT TAACAACTCT TTCGCATAAG CATCAAATTC TGCTTCTGTA GGCGTTGTTC 7860 

CTGCATAAGG AACTTTTAAT AGCTCGCTGT AATCAATCCC ACGAACTAAA CGAACACCTC 7 920 

GTGCATGCAG TGctGGCGCA TACT CAT TTT TTAACGTATC ATAAAATGGC TGTGCAAGTG 7980 

CTTCTTGTCC TTTTGGCACA TAACTAAAGA CATTCACAAT GTCAATACCA TAAGGAATAT 8040 
CGTGCATCGT TAGCCAATTT TCATCTGGCA ATGTTGTATT AACTCCTTGC ATCGTTTTGT 8100 
CACGCCAAGC CCGATAATAA ACCATAAAAT GTTTCTCTTG GGTCGGCTCA TTGGCTTGGC 8160 
TGATCATTGG TTGATGAAAC AGACTAAATA AGATAAACGC CAAGCCAACC AATAACGAAA 8220 
GACCCAGCCA ACTGACAATT GTTTCTCTGT TTTCTCCTTT TCTTGTCGAT AAATTGTTCC 8280 
CCACTTTGAA CACCATTCCT TTCTGCACTC CGTTCATTTT TGTGTATCCA CCTAAAAAGA 834 0 

AAGCGCATAC ACAACCATCA ATAGAATAAC AAAAAATAAC GAATTTGCAA AATAGAAACT 84 00 

GGAAAAATGA GTATTTTCTA TAAAAATCTT TTCCCGCCTA AAATCACTTT TGTTAGACGG 84 60 

CGTTTAAATG GTAAATTTTG AGACTTGCAA GAAAAACAGG CCTATTTTTC TTGCAAGTCc 8520 
CCTTTACTTA GTCGTGATCA ACCAATAAAT AATCACTGTT TGAATTTTTT GGGATCTACT 8580 
CGTTTGAAAT TTATCTCATC ATCAATAACT GCTCTATGCT TTAAATcTAA ATGATATCGC 864 0 

ATATCATGAT TTATATTTGA ATTTTTTTTC AGATTATCAT TATATGTTTC TAAAAATAAT 8700 
TTTAAAACTT TTTCAGGTTG CCTTATACCT AATACTTTTC TTGATATTTC TAAACCATTT 87 60 

ACCTTTTTTA TAAACCATTT TTGATTGTAT TTCATTATAG ATGACAAATT AAAATAAGCT 8820 
TGGAATACAG CTCCCGCATC TTTAATTTGA GTATTTTTTT TCAATTCATT GTATTCATTC 8880 
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AATAAATTCT TTAGCTCTTC ATAATTGTAA AAACTGAACA TTCCATTCTT AACTTCTGTC 8940 
ATCATTTTAT TCTGCCTAGT CCAAGATCCC ATGCATACAG AAACTAACTC AACTAAGGGA * 9000 

ATTTTGTTAT TCTCTATATG TAAATCTTTT AACTTTTTGT ATTCTTCATT TCCTAAAATT 9060 

ACGTATTTAT TAATATAGTC TATTACCCGC CACTTACATG CTGTATTATT TATTTTAATA 9120 

ATGTCATTCA TTTCTTTGTT TACGGAAACA TAGTATGGCA CGGGTAAGCC ATATTTTCTT 9180 

GCAATACTAT ATCTATGTTG CCCATCGATG ATCTGCATTG TGGAGTTTAC TATAATGGGT 924 0 

CGAATAATTC CTTTTTGTAG AATAGAATTT TCCAGTTTCT TGTTCAGTGT AACTTCACGA 9300 

TTACTTCTTA AAAAGCTGAA CATTTCATAT TCTTCCGTTA TATAAATATC ACTCATTTCT 9360 

CTATCCATTT TTTATCTCCT TTATTAAAAA CATACTTTGA ATATCTCTTA TTACTGGCTT 9420 

CCCATCAATT TCGATTTCAT ATTTAATTGC TTTGTTTGAA TTTCTATGCA ATTTATAATT 94 80 

ATATGCTTTC AAAAACTCTT CCAAAACGAC TCCTTGGTGA AAAATTCCTT CTATAACATT 9540 

TTTAGTTTTT ATTTTAAATC CATTAATTAA CCTATAATAA TCGAAATCTT TTACAATAAA 9600 

TAATTCTACA TAAGCAAAAA ATGTTTGCTG TGAACTTCTC AAGTTCAACT CTTCACAAAA 9660 

GTTTCCGTAA GAATCTAATA ATCTTATAAA TTCCTTGTAG TTATAAAATG AAAATTGACC 9720 
TGTTTTTACT AATTCTTGTG AATTACCATC ATAGTGTTTA TTTCCCATAG CCAAAGATAT 9780 
TAAAGAAGAG GGTGAGATAT TTTTATACTT CTTTACAATA AACTGTAGTT TTTTATATTC 9840. 
GTTCATACCA TCCATCGAAT ATTTCGTATG AAATCTCCAA TTTCCAGTCT TAATGTCGAA 9900 
TTAAGCTCAA TT AC AT CATC TAAGTTCAGA CCTTTCGAAA TAATATAAGG AATGGACTTA 9960 

TTCAATTTTC TGGCTATCAA AAAACGAGTT TGTCCATCAA TAATTTCAAA TTTCTCATTT 10020 

AC AT CT AT AG GAACTAAAAT ACCTGATTTT TC TAT AG ATT CTATTAAACT ACGCTTTTCT 10080 

TTAATTGCTC TGTTTCCCTT TAAAAATTTA AATTtTGAAT AATCCTTAGT TTCATACAAT 10140 

TTACCAATCA ACTTTCCCAT ATTCTTCACT CCTTATAAAA TATATTTTTA ATTTATATAT 10200 

TTTATAATAT CGTCTAAAAA AAAGAAATGC AATTCTTTTT TAATCAATAT AACACCAACC 10260 

ACACATTAAA ATAACATTTT GGTTAACCAT CCATGTTTAT TAATTCGGCG ATTTTCACAA - 10320 

CAATTTTCCC TTGCGCTCGT CCTGTCAATG AATAATCTAC CGCTTCTTGA ATTTGGCTAA 10380 

AAGGAACGAC TCGGTCAATT ATCGGCTGTA ATTTTCCCTG TTCAATAAAC TCAGTGAGTA 104 4 0 

AAGCTAATTG CTCGCCATCT GGACGCATAA ACAAAAAATG ATAACTAACA TCTGTGGCTT 10500 

GCTCCAATCG ATGAATCTTG CGGGTAGCTA TTTTAAAGGC CCATTGTTTC CAAAGCGGCA 10560 

AGCCATACTC TTTAGCAAAA CGTTCGTTGG GAATGCCTGA CAATGTAACA ACTTTTCCCT 10620 

GAGGTTTAAC CACTGAGAAA GCTTTTTCTA AGATTGTCCC CCCCATTGTA TCAAACACAT 10680 

AATCATAGTC • GGATAAAACT TCTTCAAAAT TTTGTGTCCG ATAGTCAATC ACTTCATCTG 10740 

CTCCCAACGC TTGAACCCAT TCTTTATTTT TACTACTCGT TGTGGTGGCA ACGTAAGCGC 10800 

CTGCTAGTTT TGCTAATTGA ATCGCAATGG TTCCAATCCC TCCTGAACCT GCTTGAATCA 108 60 
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GGACTTTCTG 
CGACTAACGG 
CCGCTTGATC 
CGTAAACTGC 
CAGCAAAATC 
CATCTTTTGT 
CATTGTCATG 
TATGAATTAA 
TTAAAAGTTT 
TTCAGCCCTC 
TACGAACCAA 
GCGCTTAGTG 
CTGTTAGAAG 
CTCTCACGCC 
TGTCGCAGTG 
CAACTCCAAC 
AAAAAATTCT 
GCCTCTGGCT 
TTGACTATGT 
TAGATTCATC 
ATTCCCTCTC 
GCCATGGCGT 
TTTACGAACA 
TCAAAGATGC 
CAGAAGAAAT 
AAGAACATGG 
ACCGTATAAT 
AACAACTATC 
TTCTTCTTTT 
AACATCGCGC 
ATGAATTTTG 
CCAGATGTTG 
ACTTCTGGAC 



GCCTGGCTGT 
AATGGCCGCA 
CACTGCAATA 
ATCGCCCAGC 
ACTGCCTAAA 
TTTTAAATCA 
AATGGTGGGT 
AGCGGCTTTC 
AAACCTTTAA 
AGCGTATTTT 
AGAGGCGAAA 
ATTTTTTAAT 
ACCAAGCCTG 
CCGCTGTTTC 
AAGGAACTAA 
AACTATTGAC 
CGTGGTCCTC 
AGGTGAAGCC 
TAGTCCCCAA 
CATTATGGCA 
TCTTGAAGAA 
TATGTGGGAC 
GGGCAAATAC 
GTCAGGAAAT 
TTTAGCCATG 
TGCTACTTTT 
CACTGGGCAA 
AAAATGAAAC 
TTCCGAAAGC 
CAATTAAGAT 
TCCAAGTGGC 
GCGTGATCAA 
TTAAACCAAT 



ACATTCATAA 
GCTTCTTCAA 
TATTCAGCAA 
CTAAAATTTT 
ATCAGTGGCA 
ATCGGGTTGA 
AAnGGCACCT 
ATCATTGAAT 
TCAATTAGTA 
ATTGACAAAA 
GAATATGAAA 
CGCATTGGGT 
TGTTGGCCGT 
TCACCACTTA 
AAATTATTAC 
CCACATTACG 
ACAAACGTCT 
ACGGAGTTCG 
GGAGGCTATG 
GTTTATGAAA 
ATTGAGCCAG 
TTTCCTGATA 
CTAACTTCTG 
TATTTGATTG 
AAAAAAGCAG 
GCGAAAAAGC 
AATCCTTTTT 
AACAGCGACT 
CGAAGCCCTA 
AATTAGGTTA 
ATATTGAACT 
ATTAATCAAG 
GTTTTTAGcT 
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TATCATGTAG 
ACGTTAAATT 
AAGTTCCCAC 
GAACATTTTT 
TTTGATAATC 
TACTGGCAGC 
CCTCGATTGC 
CTTTCCTCTC 
TCTTAACGAA 
AACACGAAAA 
GAATTAGAAG 
GATGAAAAAC 
CGTGTCACTG 
AAAATTTTGA 
TCACTTTCTC 
TCGATTATGC 
CCCGTTATCA 
TGGAAGAAGT 
TCCCTTTGGA 
GTGCTGATTT 
ATGAATACGC 
ATCCTCAGTT 
TTTGTCACGG 
CAGATAAAAC 
TTGTTCCTTT 
GTCCCTATAA 
CTGTCCGCGC 
ACTTTTAGGT 
ACATAAAAAA 
ATTAAGCCTA 
TTGGAAATTT 
CCACAACCTG 
AACGCACCCA 



CGQTTGATAA 
TTTCGGCTTC 
GCGATTTTTG 
TCCAACAGAG 
TAAAAGCATC 
GATAATTTTG 
TAATTCtTTT 
TTTCTATCCC 
GAATGGGATC 
ATGGTTTAAT 
AAATTCAAAC 
GCCAAGCAAT 
AGTTAACAGA 
AACAAGCCAA 
ACGATACCAC 
CAGGGAGGAA 
CGGCACCGAA 
AACCAAAGCA 
TCCCCGGAGT 
TCAAGAACGA 
AGCAATTTAC 
GCAAGCAATC 
AATTGCTGGC 
CATTACTGGC 
CTTAAACGAA 
AGAATTTGCG 
TGTCGCAAGA 
AGTCGCTGTT 
TTTTTGATTA 
AGACAATCAT 
CCAAACCACC 
AAGCAAAAAT 
TTACAGGAAT 



CTTGTCAAAC 
ATAGCTACAG 
GGTACTCGCC 
ACAACAATCC 
TTTACTTTGC 
ACCAAAACAT 
TGTCCGTATT 
TTTTATCGGT 
CACGTCAAGT 
CGTTTAAAAA 
AGAATTTACT 
TATTATTGCA 
GGCGACACAG 
ATTAATTGAT 
CAAAATTGAG 
GCAAGCATGA 
GAACCAACGG 
GGATTTTCAG 
ATGAAATATG 
GCGCTAGCAC 
TTTACAGGAG 
ACTCGTAGCC 
CTTTTAAATG 
TTCACTACTG 
ACAATTGCCA 
TATCAAGATG 
TTGTTAATTC 
GTTTCATTGC 
GCTAAAGAGT 
TGGTTTGGCC 
CATTACGACA 
CATAACCATC 
AGAAACGTAC 
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GCTAAACCAG 


AAGTTGAAGG 


AATTAGGAAA 


GATAAACCTA 


AGTATAAAAC 


AAAGGCACCG 


lzyuu 


ATGACAAATA 


AGACTGGGGA 


GAGTCCTTGT 


AATAAGCCTG 


CCGCTTTATC 


AAGGATAAAT 


iz. you 


GAATCTAAAT 


ACGTTTTAGA' 


CATCAAAACA 


GATGCCCCGC 


GAGCAACTAC 


GATAATTAAA 




ACAACGGATA 


ACATATCTTT 


GGCACCGTCG 


ATAAATGCTT 


CAATGGTTTC 


TTTTTCAGTA 


13080 


AAGCGACCAA 


TCAAGGCAGC 


AATGATCCCT 


AAAAGGAAGA 


ACCACATCGC 


TAAGTCGCCA 


13140 


AAGTACCAGT 


CACCGAAACT 


GTTGCCTGTT 


AAAACAGACG 


TCCAGCCTTT 


GAAAATCGTC 


13200 


ACTCCAAAGT 


CGTTCCATGG 


AATTAAGGAA 


ACAATCATAA 


TGACAAAACA 


GAAAGCGAAT 


13260 


ACCATTAAGA 


TTTTTTTATG 


ACGATTCGTA 


AATGGCATTT 


CTTTCGACGC 


TGCTTGTCCA 


13320 


TACGTTTTTT 


CCATATCCTC 


TTGTTCTTGT 


AAAGATAAAA 


TTGTTGAACC 


TTTGTCAGCC 


13380 


TTCACTTTTT 


TCGCATAACG 


CATCACAATG 


AAAATAGAAT 


AGCTAGTAGA 


GGCCGCCCAC 


13440 


AAAATAGCAC 


CGACAATTAA 


AATAATTCCT 


GTATTCGGCT 


GAATGCCAAT 


ACCTCGCAAC 


13500 


GCATCCATCG 


CTACCCCTGT 


GGAGAAAGGG 


TTAACCGTTG 


AACCGATAAC 


ACCTGATCCT 


13560 


GCACCCAATA 


GCACAGTACC 


CACTGCGACA 


AACGTATCAA 


ACCCAGCGGC 


GACCATGGTA 


13620 


GCACTCAATA 


AACCATAAAA 


AGGAATTGTT 


TCTTCTGCCA 


TTCCATAAGT 


CGACCCACCA 


13680 


ATAGAAAATA 


AAATCATTAA 


AATCGGAATG 


ATTTTCAATT 


CGTTCCCTTT 


TAACTTTTTA 


13740 


ACGACTGATT 


GAATCCCTGC 


TTCAAGAGCC 


CCAGTTTGCG 


TCACAATATT 


TAAAAAGCCA 


13800 


CCGAGTACTA 


AAATAAAAAT 


ATTAATTTCA 


ATGGCATCTT 


TAAACCCATT 


GAATGGAGCC 


13860 ■ 


ATGATAATAT 


CCGATAATTT 


GGCCCCTACC 


ACGTGGTCAA 


CCATTTTGCC 


ATCAATTTCT 


i 'a ft o A 

13920 


CTTGGGGTAA 


ATGGCTGGCC 


ATTTAGCAAG 


ACCGTTACGA 


TACCTAAAAC 


AATTAAAATA 


13980 


ATAAATAAAA 


TACTAAAAGA 


CGAAAGCTGT 


TTTTTGTTTT 


TGTTCTTTTT 


TTCTGCCATA 


14 040 


TCCTTTTATA 


CAGGAACCTA 


CTTTCCTGTA 


TCTCCCTTCT 


CTATGATTAA 


TTACTCAATC 


14100 


GATCAATAAA 


GAATGGCGTA 


TCTAGAAAAG 


TTAAGAATTT 


ATGACTGTAG 


CTAATACGGT 


14160 


TCATTTGGAC 


TAATTTTTTC 


ATAACACTGC 


TTGCTGTTTC 


CCGCGTAGTT 


CCACTCATTC 


14220 


GTGCAATGTC 


ATTAATGGTA 


ATTGGACAAG 


TAATTGCAAC 


GCGTTCTCTG 


TTGTCAATTC 


14280 


GTTCTCCTAA 


ATCATTTAAT 


AACATCGCCA 


GTGATGTCAC 


AATCCGTTCA 


CATGCATCAT 


14340 


TGGTTGTTCC 


TTTTTGGATT 


TTGATCATAC 


TTTCTTTTAA 


AAGGACCGAT 


TGTTCTTGAA 


14400 


TCCAATGAAT 


CAATTGCTGG 


GGATTATTTT 


GGAGAAAACG 


CTCAAACAGT 


TTAACTGGCA 


1 A A £ f\ 

144 bO 


CAAACACAAC 


CTCAATATCC 


GTATAGGCAA 


TCGCGGAGTC 


GTAATGAACC 


TCATCTTGAA 


14520 


ACAGTCCGAT 


TCTCGGAAAA 


AGAGCCTGCT 


TTTTCATAAA 


ATGCAGATAG 


GAAAAATTAC 


14580 


CACTGCGATC 


ATTTTTTTCA 


ATTTTAATGA 


CACCTGAAAC 


CAATAAAATT 


AAATGATTTC 


14640 


TCGGATCACC 


■ TGGATAAAAA 


AGGACTTGGC 


CTTTTTTATA 


GGAGCGAACG 


AAACTCTCGT 


14700 


TGCGCAATTC 


TGTTAGTTCT 


GAT TC AG AAA 


AGTATTGAAA 


ATCTTGATTA 


TTTTTAGTGA 


14760 


TTATTTTCTC 


TATCATCATC 


TTAATTCTCC 


TAAATACTAT 


ACTTTATTTT 


GTCACAACCG 


14820 
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TTCCTACGAT 


TTCATCGCCG 


CTCATACTTC 


GTAAGTTTTC 


CAAAGAAGTG 


ATAATTGCTT 


14880 


GTTTATTGGG 


TTGGCTTTCT* -ACAAACTGAA 


TGGCGGCTTC 


AATTTTCGGT 


AACATACTAC 


14940 


CTGGTGCAAA 


ATGACCCGCT 


TGTTTGTATT 


CTTCCAACTC 


TGCTACAGTG 


ACATTGGTTA 


15000 


ATTGTTTTTC 


ATCTGGTTTG 


CCATAATTAA 


TACACACATA 


ATCCACGCCA 


GTTAAAATAA 


15060 


CTAGTGCATC 


CGCATCCACT 


AATTCTGCCA 


GTTTTTCAGA 


AGCAAAATCT 


TTATCGATTA 


15120 


CTGCTTCGAC 


ACCTTTTAAT 


TCTTGTCCTA 


CGACAGGGAT 


GCCGCCACCA 


CCACATGAAA 


15180 


TGGTAATTAT 


ATCATTTTTT 


ATTAAGGTAT 


TAATAGTCTC 


AGCCTCGTGG 


ATGTCAATTG 


15240 


GCTTAGGACT 


TGGAACGACT 


TTGCGCCAGC 


CACGTCCTGC 


ATCTTCTTTA 


AAAATAGCAC 


15300 


CTGCTTGCAT 


TGCTTCTTTG 


GCTTCAGCTT 


CTGTTAAAAA 


TGGACCGATC 


GGTTTTGTTG 


15360 


GATTTTTGAA 


TGCCTCATCT 


GCTGGATCTA 


CGACCACCTG 


TGTTAAAACA 


GTAGCCACTT 


15420 


GTTTCTTGAT 


TCCTGCTTTG 


TTTAATTCTT 


GATTAAGCGC 


ATTGGACAAC 


CAGTAGCCGA 


15480 


TACTTCCTTG 


TGTCATAGCA 


ACACACGTAT 


CTAGCGGCAT 


CGCTGGATTT 


TTTTCAGAAT 


15540 


CAGCTGCTTG 


TTGTTGCAAT 


AATAGATTTC 


CAACTTGTGG 


ACCATTGCCA 


TGTGAAACAA 


15600 


TCAACCGATG 


CCCTTGTTTA ATTAAATGAA 


CTAAATAGGC 


AGATGTTTGA ACTAATGCTT 


15660 


GCTGTTGTGC 


ATGTGCGCTA 


GCATCATTTG 


AAAGGATGGC 


ATTTCCGCCT 


AGTGCAACGA 


15720 


CCATTTTTTT 


CCCCATTACA 


AATCCTTCTT 


TCCTATCATA 


ATAAAGAATA 


GGCCGCCCTT 


15780 


CAAAAGAAAC 


TCTTAAAGGG 


CGAAACAACT 


ATTCTTTTTT 


CAATGTTTTT 


TCAGTTGCTT 


15840 


CCTACACACG 


AGGAATGAAT 


AAGTTGCCTA 


AAGTAGCTGC 


CATGATCGCT 


TTAATAGAAT 


15900 


GCATTCTGTT 


TTCTGCTTCT 


TCAAATTGAC 


GACCATATTT 


ACTGCGGAAG 


ACTTCATCCG 


15960 


TAATTTCCAT 


TTCAGTAATG 


CCAAATTTTT 


CAGCAACCAT 


TTCACCGTAT 


TGTGTTTTCG 


16020 


TAT CAT G AAA 


GGCTGGCAAA 


CAGTGTAATA 


CGATTAAATC 


GCCATCCATG 


TTTTCGGTTT 


16080 


TTTCTAACAT 


AGCCATATTA 


ATTTGATAAG 


GTTTCAACAG 


ATTGACACGT 


TCTTCAAATT 


16140 


TATCTTCTTC 


ACCCATGGAC 


ACCCAAACGT 


CTGTATATAA 


CACGTTGGCT 


CCTTTAACGC 


16200 


CTTTAGCAAC 


GTCGTCTGTA 


ATCATTAATT 


CTGCACCAGA 


TTCTTTCGCA 


AATTCTTTTG 


16260 


CATAATTAAC 


GACTTCATCA 


GAAGGGAACA ATT CTTTTGG 


TGCACAAATG 


CGAACATTTA 


16320 


CGCCTAAAAT 


AGCACCTGTC 


ACTAACAAGC 


TGTTTGCCAT 


GTTGTTACGG 


CCATCGCCGA 


16380 


CATATACTAA 


AGTGATTCCT 


TCTAAACGAC 


CAAAGTTTTC 


TTTCACTGTC ATAAAGTCCG 


164 40 


CAATCATTTG 


TGTTGGATGC 


CATTGATCCG 


TTAAACCATT 


CCAAACAGGA 


ACACCAGAAT 


16500 


ATTTCGCTAA 


ATCTTCCACT 


ACTTCTTGAC 


TAAACCCACG 


GAACTCAATC 


CCGTCAAACA 


16560 


TACTACCTAA 


AACAATCGCT 


GTATCTTCAA 


CAGATTCTTT 


TTTACCTAAT 


TGAATATCGT 


16620 


TGGCGCCTAA 


• ATACTCAGGA 


TGTGCCCCTA 


AATCAATCGC 


TGCTGTAGTA 


. AATGCAGAGC 


16680 


GTGTCCGTGT 


CGAAGTTTTT 


TCAAATAAAA 


. GAGCGATATT 


TTTTCCTTCT 


AAATAGTGAT 


16740 


GTGGAATCCC 


TTTTTTCTTC 


AATTCTTTTA 


L AGTGAATACT 


GAAGTCTACC 


: AAATACTCTA 


16800 
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ATTCTGCTCT 


TGAAAAATCT 


TTTTCTGCTA 


ATAAACTACG 


TCCTTGAAAT 


ACTGAATTCA 


16860 


TGATGATTCC 


TCCTATTTTT 


GGGTGTTTTT 


TTAAAGATCT 


TCACGGTAAA-. GTGGCATACT 


16920 


CATACAACGA 


GGGCCACCAC 


GGCCACGTAC 


TAGTTCACTT 


CCTGGAATGT 


Z\ Z\ r P r P^* 1 Z\ T ,r P f T l 


16980 


CACGCCTGCT 


TCTTCTAGCG 


CTTTATTCGT 


AATCGTATTC 


CGATCGTAAA 


LrtALAALl iG 


17040 


CCCAGGGGCA ATTGCTAATG TATTTGAACC gTCGTTCCAT 


TGTTCACGGG 




17100 


TAAATTTCCA 


GCGCCGCAAC 


GGATTAGTTG 


AACATTGTCT 


AAATGCAAGT 


Al I IGGA1AA 


17160 


AATGTTATCT 


AATGTATCTT 


TTTCTTTTGT 


AATTTGGATG 


TCTCCATCTG 


rn rp rp rp rp 7\ /*»rp 

L 1 i J. 1 1 GAG 1 


17220 


GATCGAGTAA 


ACAACCAAAC 


CACCTTCGAT 


TTCTGGATGA ATCGTAAATT 


TATCATAGTC 


17280 


AATCATGGTA 


AAAACGGTAT 


CTAAATGCAT 


GAATTTACGG 


TGTTCACCGA 


TATCAAAAGC 


17340 


CAAGATATTT 


TTAAACCCTA 


ATTTTTGTTC 


AAAAATATTT 


CTCGCAATTT 


TTTCAATTGA 


17400 


CGCGGCATCC 


GTTCTTTGAG 


AAATCCCAAT 


GGCCACTACT 


TCTTTTGAAA 


GAATTAATTC 


17460 


ATCGCCACCT 


TCAATTCTGG 


TTGATTCTGA 


AC GAT CAT AG 


ACTCTAGGAA 


CCTCTTTTCC 


17520 


AGCAAAACGA 


GGATGATAAT 


CAAAAATGTA 


TTGCCCAAAA 


ATGGTTTCCC 


GTTGTCGAG1 


17580 


TACTGAATAC 


ATATGATTTA 


AAGAAATCCC 


GTGCCCCATT 


GTCGCAAAGT 


TATCACGCGT 


17640 


GAAGTATAAG 


TTAGGCATTG 


GATCAATAAT 


GAATGGATAA 


TCCGCTTCAA 


r*f* Tv rp Tv rp/^m/™»m 

CCATATCTG T 


17700 


TAAAAATTCT 


TGCTCATATT 


TAGGTAATTC 


TTGTTTTTGA 


ATGCCTGCAA 


TCGCTTTTTG 


17760 


AATCAGTTCT 


TCGTTGTCGT 


CAATTTCTAA 


CATTAACTCT 


CTAACTTTTT 


CTTTTGCTGA 


17820 


TTCGCTGCGA 


ATATTGGCTT 


CTTCTAAGAA 


TTGGTCAATA 


AATTGTCGGC 


GGACCTCT I G 


17880 


ATTAATCAAC 


GCTTCAGCAG 


CTAAGTCCTC 


TAAATAGACC 


ACTTCGATAT 


/-^ rp rp rp m /~> 7\ rt\r*r* 

CTTT1GA1GG 


17940 


TAACAACTCT 


GCAAATGCAT 


CATGTTCTGC 


TTGTGCTTTT 


TCTAAAAACG 


GAAT A I G A i G 


18000 


AAACAACAGT 


CTCTCCAGAT 


AATCTGGCAT 


TAAATTTTCT 


AATTCCTTAC 


CTGGAGGA1 G 


18060 


TAACATCACT 


GTTTTCAATT 


TCCCGATTTC 


AGAAAATACA 


TTAATTGGAT 


GAL 1 OA 1 G i G 


18120 


AAATAACCTC 


CTCAACTTAA 


TTACAAACAC 


AGAATAACAG 


ACCTGATTAA 


pr< rp j\ p j-> t\ m 

LL.1 ALjoAHj 1 


18180 


GAGCCATTTG 


CCAAACAAAA 


TACCAAATAT 


ACAACGGTAT 


TCTTTCTTAT 


GGA1A1 1 1G1 


18240 


AGCCTAAAAG 


AATGCGCTTT 


CATAGATATT 


CTAAAAAAAT 


AATGCATAAT 


I AAA1 AAAAA 


18300 


AAAATGCATT 


TATTCAAAAA 


AAATTATTTC 


TCCACACACT 


TCATAAAGTT 


T\ rp 7\ 7\ rp ^ rp rp 7\ rp 
A1AA1G1 1A1 


18360 


AGAGGkAAAA 


GGAAGTGAGC 


CAGAAGTCAA 


AACGACTTCT 


GGCTCACTTC 


CTTTTATGCG 


18420 


TTACTTTTTG 


ACAAAGGTGT 


TGGCTGAATA 


AATTGTTGAA 


AATACTGCTC 


TAGTTCTTGT 


18480 


GCTTTTTCTT 


CGGATGATGA 


AACAATAATA 


ACAGTGTCGA 


AGcCAGCGAC 


AGAACATTTC 


18540 


TTTTCACACA 


AAGACACTTC 


ATCAAGTACA 


GCTGAAAAcA 


ATGTTGCGTT 


GTCTGGTAAT 


18600 


GTATGAACCA 


' CTAGTAAAAA 


GGCGACACGT 


TCAACTTTGG 


TCACTACCTC 


ACGAATTGCT 


18660 


TGTATCAACT 


GTTCTTTTTT 


TCGTTCTTCT 


AAAGAAGTGA 


ATTGTTCAAA 


AATCGTTAAC 


18720 


CGCATCTCGC 


CAGACTCATC 


TAATGCTTTG 


ACGACTTTTA 


ATTCCCATAA 


ATCCCGTGAA 


18780 
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ATCGTTGCTT 


GTGCGACATT 


TATCTCGCGC 


TTTTTTAATT 




L. 1 1 1 I i 


IDOIU 


CTATGAATTA 


CGGTTGTTTG 


AATGATTTCT 


TTAATCAAAC 


T A 1GGUG1GI 


mfji/""f»mrp7\ rpipo 

1 1 kjK, i 1 Al 1 U 


1 Q Qfifl 

10 yuu 


ACAAATAACA 


CCCTTCCTCC 


AGACATTTAC 


AAAGCAATAC 


TAGC AGAA 1 L 


1C1 AA1 1 A 


1 Q Q £ ft 

lo y ou 


GAATAGAAAA 


AGGGTCGTTT 


AGCTAAAAGG 


TTCTCATTGA 


TTTTGTCGGA 


TATTTCGCAT 


lyozo 


TTAAAAAAGT 


GAGACTTGCA 


CAATTTAGTA 


AAACGTTGTA 


CAGTGATTTT 


aGGGAATTAC 


19080 


TGGCACCGTT 


TTAAATAACA 


GCAATGATGT 


TTTTTTGCAT 


GATTAACTGA 


TV m Z" 1 TV ^ TV T\ 

ACTGGAGGAA 


19140 , 


AGTAATGAAC 


CGATTAAAAC 


GCCAAACGAT 


GATTAAGCGA 


TTGGTCACAG 


i\ TV tv n t\ tv tv m 

AAAATGAAAT 


19200 


CCATACACAA 


AGTGAATTAC 


TAGCATTATT 


GGAAAAAGAA 


AATATTCACA 


CCACACAAGC 


19260 


AACTATTTCT 


CGTGACATTC 


GTGAGCTGAA 


TTTAAACAAA 


GTAGACAATG 


TV TV T\ TV, 

GTCAAGGAGA 


19320 


AGCCTATTAT 


CGTATCTTGA 


ATAATTCTGC 


CCTTCGTCCA 


AAATATAAAA 


n yx rn tv tv TV 

CAGATGAAGA 


19 joU 


ACGTCTTGTT 


G AC AT TAT T A 


TTGAAACTGG 


TGTTTCTTTA 


ACACAAATAG 


AGTTCACAAA 




TCTTTTAACC 


GTCTTACCTG 


GCAACGGGCA 


AGCCATTGGG 


GTTTTAATTG 


TV /"^ TV TT\ TV 

ATCGCATACG 


lyouu 


TATCCACAAC 


CCTCATATCT 


TGGGCTGCAT 


CGCTGGGGAT 


GATACCATCT 


TAATTCTTTC 


19560 


AAAAAATAAA 


GAAGATGCCC 


TTGAAGTTAA 


CAACTATTTC 


CAGCAGTATT 


TGTACCACCC 


19620 


ATAATTCATC 


ACTTAGTAAA 


ATCGACTATT 


TTTTGACATC 


ATtGCTATTT 


TTATTAAAGA 


19680 


TAAATAAAAA 


AAGGCGTCTC 


AGTGaAAGCT 


GAGGCGTTTT 


CTCATGAGTT 


TTTCTTTTGA 


1 0 "7 a a 


ACACCGTCTT 


TTTTTTGTAT AATAATCAAA AAGCAAAGAA AgGGkTTCTT 


TV ttl TV <AfTl^* TV TV F** 

ATGACTGAAC 


198UO 


AATTCATACT 


CACTTCAAAC 


GATCAACAAA 


CTCAGCTCAA 


TGTACGTCAT 


TGGCCCTGTC 


1 0 0 C A 
198 oU 


CTTCACCCAA 


AGCCGTAGTT 


CAATTGATTC 


ACGGCATGGC 


AGAGCATATC 


CAACGTTATG 


iyy/u 


ATGAATTTGC 


TCGTTTTTTA 


AATCAATTAG 


GATTTACCGT 


AATTGGTCAT 


/»> tv m tv m^^rri tv /"* 

GATCATCTAG 


1 noon 


GTCATGGCGA 


ATCTGTAsaA 


CCAAACGCAC 


CGATCTATGG 


TTTTTTTGGA 


GAACAAGGGL, 


OA A A A 

Z U U fi u 


CTGAAAATGT 


CGTTACAGAT 


ATTCATCAAG 


TGAAACAATG 


GGCTGTGAAT 


CGATAxCLAU 


nni A A . 


AATTGCCCTA 


TTTTATGATG 


GGTCACAGCA 


TGGGGTCTTT 


CGCCTTGCGC 


AACTATClbC 




AAGACTATCC 


TGTGACTGTG 


CAAGGAGTCA 


TTTTCATGGG 


CACTGGAACA 


AGTCCG1 1AC 


O AO O A 

z uz z u 


CTTTAACCGC 


AGCATTGCCT 


TTTATTAAAA AAATGGCCGA 


GAAACAGCCG 


TV Ti TV TV TV 7\ Z"' 7V /"* 

AAAAAACCAG 


O A O Q A 

ZUZoU 


CTCCGTTCAT 


TGATAAGCTG 


GCATTTGGTT 


CTTTTAGCAA 


AAAATTTCCT 


tv tv tv tv ^*rn ^ 

GAAGCAAGTT 


Zuj4U 


CCTTTAATTG 


GCTTTCTAAA 


AATCAAGCCA 


ATGTGGCTGA 


CTATGAAAAT 


GACCCATTAA 


O A /I A A 


TGGGCTTCGT 


ATTTACCAAC 


AATGGTTTTG 


CTACCCTTTT 


TTCGCTTGTT 


TV TV TV y - * TV ^ 7\ 

AAAC G AG C U A 


z u <a ou 


ATCAAAGGAA 


TTGGTACCAA 


GCGATCCCAA 


AAGAATTACC 


TATTTTAATC 


ATTAGTGGCG 


20520 


CTGAAGACCC 


AGTTGGTGAT 


TTCAGCAAGG 


GGCCAGCGAA 


AATTCAAAAG 


CAATTAAAGC 


20580 


ATGCAGGTTT 


■ TCAGCACGTG 


ACGTTACGAC 


TATTTCCCAC 


ATTGCGTCAT 


GAAATTTTAT 


20640 


TAGAAACGGA 


AAAAGCTACA 


GTTTTTCAAG 


AAATTGGGCA 


TTGGTTAACG 


GATTTAACGA 


20700 


ATTAAAAAAG 


AAGGTTGATG 


TTGTGACATC 


AACCTTCTTT 


TTTAATCTAT 


TTTCGTTAAA 


20760 
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CCGCCCATAT AAGGACGTAG AACTTCTGGA ATCGTTACGG TGCCATCTTC ATTTTGATAA 20820 

TTTTCCAAAA TGGCAGCCAC TGTTCGACCA ACAGCAAGTC CTGACCCATT AAGAGTATGG ■ 20880 

GCATATTGAA CATGACCCGT TTTATCACGA TAACGAATCA ACGCCCGACG TGCTTGGAAG 20940 

TCTTCACAGT TTGAACAAGA ACTAATTTCT CGATACGTTT TCTGTGCCGG AATCCAAACT 21000 

TCTAAATCAT AGGTTTTCGC TGCCGAGAAG CCCATGTCAC CTGTCGAAAG AGTAATCACT 21060 

CGATAAGGGA GGCCCAATTT TTCTAAAATG TCTCCAGCAT TGTTGGTCAT TTTTTCTAAT 21120 

TCTTCATAGG AGTGTTCTGC ATCACTAAAT TTAACCATTT CTACTTTATG AAACTGGTGT 21180 

AAACGAATTA ATCCTCGGGT ATCACGACCT GCACTACCCG CTTCCGAGCG AAAAGAAGGA 21240 

CTTAAAGCAG TAAAATAAAT CGGTAACTCT TGCGCATCTA AAATTTCATT ATTATAGTAG ,21300 

TTAGTTAACG GGACTTCCGC TGTGGGGATC AGCGTCAAAT CACTTTCTGC AACTTGGAAA 21360 

ACATCTTCTT TGAACTTAGG AAACTGACCG GTGCCAAACA TTGCTTTGCT ATTGACCAAA 21420 

TAAGGAGGAA TCACTTCGGT ATAACCATGT TCATTAACAT GTTGATCTAA CATAAAATTA 21480 

TAGACCGCTC GTTCTAAACG AGCCCCTAAG . CCTTTGTAAT AAAGAAAACG GCTACCAGAT 2154 0 

ACTTTCGCGC CTCTTTCAAA ATCTAAAATT CCTAAGGCGG GGATCCTCTA G 21591 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CAATTGCCCC GATTGGAATA AATGAAGGGC CTTGCGCAAC TGGTAATCTC ACACAAAAAT 60 

GAGATTGAAT AATCGTGGCA AGTCCTGCTG CTATAAATGT TGCTTGGATT AAAGAAGACG 120 

TTTGTGCTGT GCTAATCCCA ATAATTGAAG CAATGATCAC AGGTACGACG TACACGTCCA 180 

TTGCTAAGAC ATGCTGAATC CCTAACATAG TGGCCTGCCC AAAAGATACT GAATCTTCTG 240 

GCCCCACTGT TAAATGAAGA TTATTTTGTT TCTTTTCACT CATTTTTTAT ACTCCTTTAG 300 

TTGCTGACAT TAAATTTTTA TCATGTACTT TTTCACCTTG TACCCAGACT TCTCGAATAT 360 

TTTCTGGTCG TGCTAAATAT AAGATTTTTT GGAAAATATC AAGCAATGAT TCCCCTGGAT 420 

TTTTAGGAAT TCGTGCTTGT GACAAGCTTG TATTAATAAT TTGAACATCC CAAGCATACC 4 80 

CTTTTTCTAA TTTTCCAATT GGTAAACTTA GCGCTTGACC ACCACCAGCA GTAGCTAAAT 540 

AAAACGCTTC ATTTACGGTA ATCCGTGCTG TTTTAACACC TCGTACTTCT GGTTTTTTTG 600 

TAGCATCGAC GCCATCTTCT AACATTCTAG AAGAAATAAC CGCTTGCTTT AAATTGTCAA 660 

AAAGACTTGG TGAAAAACCA CCTGAAAGAT CACTACCTAA ACCAACTTCT ACTTGATGTT 720 

GATGTACTAA TTTTGCAATC GGTGTTACTG CATTACCAAA ATAAGCATTA GAGATTGGGC 780 

AGTGCGCCAC AGCCGTCCCT GTTTCATGGA ATAAATTCAT ATCTGCTTCT TCTAAAAAGC 84 0 
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CTGCATGTGC 


CATCACTGcT 


TTTTCTGtCA 


ATAAACCAAA 


ATCATTCAAT 


GCtTGAGCAT 


900 


CATTTTTTTC 


AAAACGTTCT 


TGaACAAATT 


GATGTTCCCA 


ATCACTCTCA 


CTACAATGAC 


960 


TTTGAACAkG 


GACTTGATAT 


TTCGCTGCTA 


ATTCTCCTAA 


TCCTTTAAsT 


GCTTCCTCTs 


1020 


TACAACTTGG 


AaTGAAAcGC 


GGaGTTACAa 


CGGGrTACAC 


GCCTTGTTTT 


GTTTGCTGCG 


1080 


CTAATGTTTG 


AATTTCTTGA 


ATAAACCGTT 


CCGTTTCTTC 


TAATGCTTGT 


TGCGTTGTTT 


1140 


GATCACGATA 


GTATTCTGGA 


TTCTCATTAA 


GATCATCCAT 


TACAACTTTA 


CCAATTAAGC 


1200 


CACGCTGCCC 


TTTTTCCGCA 


CAAATCTCTG 


CCAAACGACG 


GCTAGCTTCA 


TAATGAATGG 


1260 


TTGCAAAATA 


AAGGGCTGTT 


GTTGTCCCAT 


TAGCTAACAA 


TTGGGTTACC 


AATTCATCAT 


1320 


AGACTTGTTG 


GGCAAATGCT 


AAATCTGCAA 


ACTTGGACTC 


CAAAGGAAAC 


GTGTACGTAT 


1380 


TCAGCCATTC 


ATACAATGGA 


ATATCCAAGG 


CGGTCCCTGA 


TTGTGCCCAC 


TGAGGTGCAT 


1440 


GGACATGTAA 


ATCAACAAAG 


CCGGGTAAAA 


GATAAGAATC 


TTGGGGCAAT 


TCTTGAAAAC 


1500 


GTTGTGCCTC 


TTTTTGTTCT 


GCTAGCACAG 


TTTCATACGC 


TTCAGCATCT 


GGTTCAATTA 


1560 


TTTCCTGAAT 


TACTCCGGCC 


TCATCTATAC 


AAAAAAGTGT 


CTCTTCATAA 


ACTTTAATCT 


1620 


CTTGCAAATT 


TTTACTAGAA 


AAACCTGTTC 


CTTTTATTAC 


AAAATGAAAC 


ATTTAATACT 


1680 


CCTCCTTACT 


ATCACATGAA 


GATTATACAT 


AAACTTTCGC 


TTAATAACTA 


TAAAAATGTG 


1740 


ATTTTTTTAA 


AAAAAATCCG 


TGTTAAAACA 


AGTAAAAACC 


GAACATTCTA 


AGaATGrTTT 


, 1800 


ATTTTTTCTT 


TGGATAAACA 


AGAAAACTTT 


CTATTCTTAA 


GATTCTTCGG 


TATACTTGAA 


1860 


GAGAATCGTT 


AGAACTTTAG 


GAGGTCCTGA 


GAT GAT AC AG 


ATCGaACAAA 


TTAGAACAGG 


1920 


TGTTATTCAA 


GAAAACTGTT 


ATCTTGTTTA 


TAATGAAGAA 


GCGTTATTAA 


TTATTGATCC 


1980 


AGGTGCAGAA 


GGAGCAAAAC 


TAATTCAAGA 


AATTACCCGT 


TTAGGGAAGA 


AACCAGCCGC 


2040 


TATTTTATTA 


ACACATACTC 


ACTATGATCA 


TATTGGTGCG 


GTAGAAGAAT 


TGCGCCATCA 


2100 


TTATCAAATC 


CCTGTTTATG 


TTAGTCCATT 


AGAACAAGCG 


TGGCTTTCCA 


ATCCTATTTT 


2160 


GAATCTTTCA 


GGATTAGGCA 


GACATGATGA 


TATAGCAAAT 


ATTATTGTTC 


AACCCGCCGA 


2220 


ACATGAATTT 


AAACTAACCG 


ATTATGAAAT 


TGGCGGCATG 


AAATTTTCCG 


TAGTCCCAAC 


2280 


ACCTGGTCAT 


TCAATTGGTA GCGTCAgcTT 


TGTCTTTGAT 


GATTTTGTTG 


TTGTTGGGGA 


2340 


TGCATTGTTT 


AAAGGTAGTA 


TTGGTCGTAC 


CGATTTACAC 


ACTGGTGATA 


TGCAACAATT 


2400 


ATTACACAGC 


ATTACGACCT 


ATCTATTCAC 


TTTACCAGAA 


GAATTTCCTG 


TTTATCCAGG 


2460 


CCATGGTGAT 


GCCACGACCA 


TTCAACATGA 


AAAAGCCACA 


AACCCATTTT 


TTAATTAGAG 


2520 


ATTTATTCAC 


TATTTAAAGA 


AACTTAGGAG 


GAGTAGCAAT 


GACTGAAGCA 


ACGACACTTT 


2580 


ATATTATTCG 


TCACGGAAAA 


ACGATGTTTA 


ATACGATTGG 


CCGTACCCAA 


GGCTGGTCAG 


2640 


ATACCCCCTT 


AACCAAACAA 


GGAGAAGAAG 


GCATTTATCA 


TTTAGGCTTA 


GGTCTTAGGG 


2700 


ACATAGACTT 


TAAGGAAGCC 


TATAGTAGTG 


ATAGTGGTCG 


TGCCATGCAA 


ACCGCTCAGA 


2760 


TTATTTTACA 


AGAACATCAA 


AACCACCAAA 


AGATTCCTTA 


CCTAACCGAT 


AAACGGATTC 


2820 
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GTGAGTGGTG TTTTGGTTCT TTAGATGGTG GCTATGATGG TGAACTTTGG GGTGTGGTGC . 2880 

CACGCATTTT AGCTTTTAAA AGTTATGAGG ACATGATGAC CACGAAGATT ACCTATCGTG 294 0 

AGCTAGCAAA TGCTATTATT GAAGCCGATA CGGCTGACTG GGCGGAGCnT TATGAAGTTA 3000 

TCCGTGATCG TGTTTGGTCT GGTTTCGAAG ATATCGCTCA TCATCGAGAA AAAAATGGTG 3060 

GCGGCAAAGT CATGGTGGTT TCCCATGGAC TAACAATTtC CTTnTTACTT tCCTTAATgA 3120 

TGCAAtTTGC CTATGCAGAT GGcGCTGGAG AATGGTAGCG TGACGACCTT AACTTACGAA 3180 

AAAGGAACAT TTACCATnCA AGGAATTAAT GATATTTCTT ATATTGAAAA GGGAAAAAAG 324 0 

ATCGCTGAAA AGCgCCGTTT GTTGTAAAAA AAGAGAAGGT TATCTGTCAA AATGACAGAT 3300 

AACCTTCTCT TTTTATTTAT TCTTTTAAAC GTTCTACGTT TTTTCTTGAA ATTTGAATTT 3360 

TTTCTAAGGT TTCCTTATAG TTTTTTGCTG CATAAGCAAA AAATAAAACA TCGACTACAT 3420 

ACAATTGTGC CACCAAGGAA ACAGTAGCCG CACTTCTCAA TGGCACGTCT TCCCCACTAG 3480 

AGGTTTGGAG GACGATATCA CTTTTTTCAC CAATCACTGA TTTTTCATCT TGTGTGATAC 354 0 

TAACAATAGG AATCCCTTGc TGctTCGCTA AATCAGCTAA AGTACTTGCT TCTTTATTCG 3600 

TTCCAGAATT AGAAATAACA ATAAAAACAC TAGGCTTTTC TGTTGAGCCT AACATGGAAG 3660 

CAAATAAATG ATGATCCAAT GTTGTAAATA CCGTTCGGCC TAATCGCGTA AATTTTTGAT 3720 

AAATATCTTG AGCAACCAAC GAAGAGGCGC CCAACCCATA GACAAAAATT ACTTCTGTTT 3780 

TTTCCAACAA ATCAACGGCT TCATTCACTG CGACATCCTC TAATGTCTGc aCgctTCGCT 3840 

CAACCACATG AACAAAGCGC GCCTGTAACT TTTGTTTAAT GTCAGCTGTT GATTCCCCTT 3900 

TAGTAACTTC AGTATACATT TGCTGCTTCA TTTGGCCAAG ATTTGCAGAT AATAAAAGTT 3960 

TCAACTCTGT GAAGCCATTC ACATCCaTAG AATGACAAAA GCGAATAATT GCCGCTGGGC 4 020 

TTGATCCTGC TTGTTTCGCT AGCTCTTGCG CATTCATTGA AATAACTGTT TCTGTATTTT 4080 

TTAAAATATA TTCTGCGATT TTTCGTTCTG ATTTTGGTAA TTGTTGTAGA CGATCTTGGA 414 0 

TAGATAAGAC AATATTTTGT TGCATAAACA ACCCCTCCTG ATACACCATT TCCTACGCAA 4200 

TTAAATTTTA ACAAAGAAAA AGGGAGAAGC CAAGTTCTCC TTTTTCTATT TATTATTTAG 4260 

TTGTTTGCAT ATCTTTAGGA ATCCCAAAGA ATAACGTTGC TACAAAACCA CCTGCGTAAG 4 320 

CAGCCAATAA ACCTAATACA TATGCAAGCC ATTGGTTGTT GGCAATTAAw GGAATTAAAG 4380 

CCACCCCACT TGGTCCAATA GCAATCGCAC CAACATTACC GAAgGCCCCA ATCACCGCAC 4 4 40 

CACCAATACC ACCGCCGATA CAAGCAGTAA TGAATGGACG TCCTAATGGT AAAGTAACTC 4 500 

CGTAGATTAA TGGTTCACCA ATACCTAAAA TCCCAACTGG CAAGGCACCT TTAATCATTT 4 560 

CAACTAATTT TTTATCTGAT TTACAACGAA TCCATAATGC TAAAGCTGCA CCGACTTGTC 4 620 

CCGCACCAGC CATGGCCAGA ATTGGTAATA ACAAAGTCAT CCCTGTTTGA TTAATCATTT 4 680 

CAATGTGGAT TGGCGTCAAA ATTTGATGTA AACCAAACAT TACCATTGGC AAsAAGGTTA 47 4 0 

AACCAAGAGT AAAGCCAGCA ACCATGCCAC CTTTTTCTAA GACAACATTG ATAATACCAA 4800 
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CTAAGCCATT AGAAATAGCT CCGGCTACAG GCATGATTAA GAAAATAGTT GCTAAGCCAA 4860 

TCACTAAAAG TGAAATGGTT GGCGTTACGA TAATATCAAT TGATTCTGGG ACAATTTTAT 4 920 

GCAATTGTTT TTCTAACAAT GATAATAACC ATACGGCAAA AATAACACCG ATAATTCCGC 4 980 

CTTGTCCTGC AGATAATGTT CCCCCAGTGA AAATATTACT GATTGGAGCA TCGGGATTCA 504 0 

TTCCTGTTAA CATTGTGACT GCCCCAATAA CACCACCCAA CGCTGGCGTT GCACCAAAAA 5100 

CGCTGGCACT GTTAATCCCA GTGTAAAGAG CTAAATAAGC AAAAATCCCA TTTTTAATGA 5160 

TATTTAAGAC ATCAATATAT TGTTGCCATG AAGCTGAAAT ATCACCAGCA ACAACTAAGT 5220 

TAGACATAAC AGCTGCAATC CCGCCAATAA TACCAGCCCC AACAAAGGCA GGAATCATCG 5280 

GAACAAAGAT ACTAGAAATG GCTTTTAACA CTTTTTTAAA TGGTGAAGTG TTATTTTGTT 534 0 

TCGCTTT-TTG TTGTGCTTTC ATTTGAGCGG CTTTTTCTTC TACTAAATCT TTTCCAGATT 5400 

TGCCTGCTGA AGCATCAGTT GTGGTGCCAT GATTAAATGG TTCTCCCAAT TTGACACCTG 54 60 

CTTGATCTAC CATTTCTTGT GCAACCTTGT TAACAGTCCC TGGACCAACA ACTACTTGTA 5520 

ACGTATCATC TTCTACGACG CCCATAACGC CATCAATTTT TTTCAATCCT TCAATATCCA 5580 

CTTTGTCATA ATCGCGGATA TCCATACGAA CACGAGTCAT ACAATGAATG ACTTTGTCCA 5640 

CGTTTTCTTG ACCACCAACT TCTTTATAAA TCCCATCAGC AATGCGTTTA ATTCTTTCTT 5700 

CTGCCATGTT AAAACACCCT CCTAATTCCT TAATTTAATG TATTCTTGAT AAAtTGaTTC 5760 

CCGTCAATTA ATTTTTGTGT TGCTTCTTCT TTAGTACTAT CAGTTAAAAT CATTACAATT 5820 

GCTAACTTCA CATTTTCTTC TGCTTCTACA AATTTTAGTT CGGCAACTTC ATAACTACAA 5880 

CCCGTTGCTT CCATAATAAT CCGTTTGGAA CGCTCCACTA ATTTTTCATT GGTTGGTTTT 594 0 

ACATCAACCA TTAAGTTATT ATAGACTTTC CCAATACCAA TCATGGAAAT AGTTGATAAC 6000 

ATATTTAAAA TTAATTTTTG AGCTGTCCCA GATTTCAAGC GAGTCGAACC TGTTAAAAAT 6060 

TCAGGTCCTG CATCTACTTC AATCGGCATT TGCGCATATT TACTAATTTC GGCATTTTTA 6120 

TTACATGCCA CAGTTGCGGT AGCTGCTCCC ACCGTTGTAG CATATTCTAA ACCGCCAATT 6180 

ACATAAGGCG TCCGACCACT GGCTGCAATC CCAACAACAA TGTCATTTGC TGATAATTTT 624 0 

AGATCAACTA AATCTTGACG ACCTAGTTCT TTTGAATCTT CGGCACCCTC TACTGCGACA 6300 

GTCATTGCTT TTTGACCACC AGCGATGAGG CCTTGAACCA TCTCTGGCTC GACGCCAAAA 6360 

GTTGGTACGC ATTCGGCTGC ATCTAAGACA CCTAACCGAC CACTCGTACC AGCGCCCATG 64 20 

TAGATCAAGC GGCCGCCTTG AwTGAAGCyT TTGaTGAkGG CTTCTACCAC TGGTTCAATG 64 80 

GCTGCTAATT CTTGACCCAC TGCCaTTGCA ACTTTTTGGT CTTCTTGATT CATTTTTTGT 654 0 

AAAGCTTCTT TGACACTCAT CTCATCTAAG CCCATTGTaT TTTCATTTCT 6590 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
GACATCTGTC AAAACGACCG ATGTCCTACT CTCTAACGAT TAAGATAGTC CGCTAAATGC 60 

GTTTGTTTTA GTCTTTTATA AAAGTTAAAT ACTGAGAATT TATTGTGTAA TTCGTGTACC 120 

ACTTTCTCCT TTGATTGCTT CCGAAGCTAA TTCTAAAAGC CCGATAATCG CCTGGCGGTT 180 

TGGTTTACTT TCTGCAAAAC TGATGGCCGC TTCCACCTTA GGTAACATAC TACCTGGCGC 240 

AAATTGATTT TCTTCTATAT ATTGTTGCAT TGCCGTAACC GTAACATCGG TCAAAGCTTG 300 

TTGATTTGCT TGACCAAAGT TAACAAAAAC ATGTTCCACT GCAGTTAATA CCACAAAAAC 360 

ATCTGCTTCA ATGAGTTCAG CCATTTTTGC TGCCGCAAAG TCTTTATCAA TGACTGCATC 420 

TACACCATGA TAGACTGGaC CATCATAGGc AaTAGGrACC CCGCCACCAC CACAAGCAAT 4 80 

TACGATTTGT CCTGgCATCA ACAAGTGTTC GCAGCCTAAC TTTTTCGTAA ATATCAATGG 540 

TTTTGGnTGA AGGAACTACC CGTCGcCAGC CTCGaCCCGc ATCTTCyTGa TAGACGTTCC 600 

CAGTTTCCGc CATCmACTGt TTGGckGTTT CyTCGtCATA ATAACCACCA ATTGGTTTTG 660 

TTGGTTTTGT AAAcGCTGGG tCTTTTGGaT CCcACCACGA CTTGTGtAAT TAACGTAACA 720 

ACTGGTTGTT CCCCTAAATT TCTGGCTACT AGTTCTTCAT CAATTGCTTG TTGTAAATGA 780 

TAGCCAATAT AGCCTTGGCt CATTGCTGTG CATTCaGGAA ACGGCACGAC ATCTTCTGGA 840 

GATTGACTCG TTTCCTCAAA AGCTAAGCGA ATTTTGCCAA CTTGTGGCCC GTTGCCATGA 900 

GCAATCACCA CTTGATGGCC TTGTTCAATT AAATCAACAA TGGATTTTGC AGCGGTTTGT 960 

GCTTTTGCCA ATTGTTCTTT CGAAGAATTT CCTAACGCAT TTCCACCTAA TGCAATCACT 1020 

ATTTTTGACA TCTGTTTCCC TCCTAGCGCT TTGGTTCTTG TTGTGTGATA CAGTGAATAT 1080 

TGCCGCCCCC ATAAACAACT TCTACCGTAT TAACGCCGAC AATTTTTTTA TCTGGGAACA 1140 

TCGTTTGAAC TTGCTCTAAA GCCAAATGAT CATTTTCATC ACCATACTGT GGCACAATCA 1200 

CGCCATCATT TGTAATTAAA AAGTTCATAT AGGATGCGAT ACAAATATCG CCGTCTTCTC 1260 

GTGGCATGGT TCCTTCCACG AAATCAATTT TGAAGGAGCC TTTAATGGTT ACATTTTTGA 1320 

CAGGACAGCA TAATTTGTGG ACTTTTAATT GGCGTCCTTT AGCATCAGTC ATTTTAAGCA 1380 

AACGTTGATA CGCATCTTGT GCCGCTTCAT AAAATGGGCT ATTTTGATCT TCAGTGTAAA 14 40 

TACAAGCCAC TTCACCAGGT GCAATAAAAC AArCTACATC ATCCACATGA CCATTGGTTT 1500 

CTTCTGGGTC AATTCCATCA CCTAGCCAAr GAaCTTTTTC AACATTTAAA TAGTCGCATA 1560 

ATTTTtGTTC GATTGCTTCT TTTGAAAGTT GTGGATTTCG ACCTTCACTT AATAAGCACA 1620 

TTTCTGTAGT • CAAGACTGTT CCTTGTCCAT CTACGTGGAA TGAACCGCCT TCTAAAACAA 1680 

AATCATCTGT ACGATAAGAA TCTACATGCT CAATTTCACA AATTTTTTGT GCAACCAAAT 1740 

CGTCTTGGTC CCAAGGGAAA TATAAACCAT CTACTAATCC CCCCCAAGCA TTGAATGTCC 1800 
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AATCCACACC ACGGATTTCT CCATGGTCGT TAATGACAAA TGATGGTCCA CAATCACGAA 1860 

CCCAGGCATC ATTGTTTGAC ATTTCGTAAA CCGTAATTTC TGGCGGCAAT TGGCGACGGC 1920 

AATTTTGAAA TTGTTGTTGA GAAACCACCA CATTCATCGG TGTAAATTGA CTAATTGCTT 1980 

TGGCCACATT AGTAAAAGCT TCTTGGACAG GTTTACCACC ATCACGCCAA TTATCCGGTC 2040 

TTTCTGGCCA AATCATCCAA ACTTTCTCTT GTGGTTCAAA TTCTCCTGGC ATTCTAAAGC 2100 

CATCTTGtTT AGGGGTACTT CCTACAATTC GTTTAGCCAT TTTCTTTCGT TCCCTTCTTA 2160 

ATTTTTTTCA CGTAAATAAT TACTTCACCA . AACAGAATAA AAATGATTGC CCCAATTGTG 2220 

ATTGGTAATT TTTCATTTAA TGCTGCGGGA CTAAAATCTA AAGGcACTGC TGTGAAAATC 2280 

AGTGAAATAA TAATCATAAT CATTGGCAAA ACCACTAAAA CTTTTAAAAA TCCTGGTTTT 2340 

CCGTTAACTT TAAACGGACG TGGTGTATCG GGATCAATTT TTCTAAGCTT ATAAAAAGCT 2400 

GGGAAAACTG GGATATACGA TAACAAGAAT AGGACGACAT TCAAGGAGAA AAAGGCCCAG 24 60 

AATAAATCTT GATTAGGTAA AATTGGTGCT AATACAATGA CCACCGATGC CACAATGCCA 2520 

TTCATCAAAG CAGCCCCAAT TGGCATGTCA TTTTTCTTAC TACGTTTTTC AAAAAATTTA 2580 

GGCATGTCAC CATTTTCAGC AGCATAACAA GCTGTATTAT TGACTCCTAA TGACCAAGAA 2640 

ATCATGTTTC CGAACAACGT AAGAAGGAAT AGGAAAGCCA TCAACATAAT GAACCAACCT 2700 

TCTGTCGAGC CTGTCAATAG TTTAAAGCTA TCCATCATAC CACTGCCGGT ACTAATTTGA 27 60 

TCGGTGGGAA TCGCTACCCC AATTCCAAAG GCTGAAAAAA TATAAATGGC CGCAATGACT 2820 

AACCCCGCTG CTACAATAGC TTGTGGAATT TGCTTCTTAG GATTTTCCAT ATCATCTGCA 2880 

AACGTACAGA TGACTTCAAA CCCTAACAAA TTAAAAATGA TTACAGAAAT AAAAGATAAA 2940 

CTCCTTAAGT CAAACGTTGG CAGCATTGAT TTCAAGGTAA ATTCATTGGC AACACCTTTC 3000 

GTCAAGGCTA CATACAATCC TAGCCCACCA ATTAAGACAG CTAAAACCAT CTTGATTACG 3060 

GCAGCTCCAT TTAGAATCCA GATACTGTCG CTGACTGGAT AAAAACTAAT CCAAACCACA 3120 

ATCCAAGTAA AAATTAATTC AACAATAATC ATCATCGGTG TTGAAAACTT AATACCTGTA 3180 

ATTGTCGTTA ATAGTTCTGG CGTCATAACG GCTAATGAAG CTAGCCACAA TGGATAATTA 324 0 

ATCCAATAAT ACCAAGAAAC TCTGGCACCC CAACGATGAC CATATGCTTT SGTCACCCAA 3300 

TCATAGATGC CCCCTTCACC AATATAAGTT GTTCCTAATT CTGATGAAAT TAACCCATAT 3360 

GGCAATAGGA ATGCAATTAA TAAGAAAATC CACCAAAAAA ACTGAGAATT ACCAATTGCA 3420 

GCAACAGGCG CCGCTGCTTC AGCAACGAAA ACCACACAGA TCACTGATAA AATGGCACTC 3480 

GTTAAACTAA ATTTCTTTTT CCCCTCCATG AAGACTCTTC CTCTCTTTAT AAACTGTCCA 354 0 

CTGATTCATT TTTTCAATCA GTGGACAGTT GGCCGTTAAA TGCTTTGAGT GTCTAAGAAA 3600 

ACTTCTAGTT ■ CTTTTTTCGC TTCTGcTTGC TTGATTAAAT CATAAGGATT TTTgCTTCAT 3660 

{2} INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



AATGACAACA 


TTTACAAAAA 


AGTTTGAGGC 


AAAAATTAAA 


ACGGCGGACA 


wTAAaCTTaA 


60 


AGTAAAAaTT 


TTGTTTTTAA 


AAaTTGCGAA 


CTTAATAAGT 


GGGCTTTTTC 


TTTTTCGTTC 


120 


AAC CAT TATA 


AATAATCCTA 


ATGCGATTAC 


TGCAATAATG 


AACAaTAAaT 


AGGATTGTAA 


180 


CGAGCCaAAC 


CCGCTCTCTT 


GTCCGAGGAA 


AATCCCGCCG 


AAGAACGTCA 


TAATCGCAAT 


240 


GGCAATACAA 


GCAAATCCTG 


AAAAATCGAT 


TTTTTCTTTT 


GTTTTTGTAA 


TATCTTTTGG 


300 


TAGGAACTTT 


TCGCCAATTA 


AAATTGTAAC 


TAGCCCCACG 


GGAACATTAA 


TCCAAAAAAT 


360 


ATACGACCAA 


GAAAAGTTTG 


ATAAGATCAA 


TCCACCAATC 


CCAGGACCAG 


CAATCGCGCC 


420 


TAAAGAGACA 


AAGGCACCTA 


CTGCACCAAG 


CGCCCGCCCT 


CTTTCATTTA 


AAGGGAATAC 


480 


TTCCGTAATA 


ATTCCTGAAT 


TGGTCGCCAT 


GGTCATGCTA 


GCACCAATAC 


TTTGAACGAC 


540 


TCGCGCAAAT 


AATAGAAAGC 


TCAGCGATTG 


ATTAAAGCCA 


CAAAGTAACG 


AACCAATCGT 


600 


AAAAATAACC 


GTGCCAATTC 


GATAAACTTT 


TATTTTTCCA 


AAACTGTCGC 


CAATTTTTCC 


660 


AAATAATAAT 


AAACAAGCAC 


AGACAACCAT 


TAAATAAATA 


GATACGACCC 


ATTCCGCTTG 


720 


ATTCATGGGT 


ACGGACATTT 


CTTtTGAAAT 


AGTCGGTAAa 


GCAATATTTA 


CAATGCTGGA 


' 780 


ATCTAATGTA 


GACATGAAGG 


TAAACATAGC 


CACAGAAACT 


AAGATCC 




827 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

AAGGCATGAA GTAATnGGGA TGGATTTATT AATCGrATGA AGyCAtATCa GGAAGAAGAA 60 

GCAGTTTGCT TTACAGATCG TTTGAAAGAG CAGGTGGAGA AAAATGGCTA GAAAAAGTCG 120 

TATAAAAACA CCGATTGcGG AACAAGCACC TACTATCAGA AAGAATAATT TTGAAGAGGT 180 

TTGCTTAGGC TATACATTAG CAGAAGGGCA AGAAGAAGCG ATTCGCTGTT TACAATGTAA 24 0 

AAATGCACCT TGTATTGCTA AATGCCCCGT GATGATTGAT ATTCCTGGAT TTATTTTAGC 300 

TATTCGTGAA GGAAACATGG AACAAGCAGC CGATATTTTA AGTAAATATA CGAATTTACC 360 

TGCTATTTGT GGACGCGTTT GTCCGCAAGA GAAACAATGT GAACAAGTTT GTAAATTAGG 420 

GAAAGCCAAA- AACTTTGAAC CAGTGGCAAT TGGCAAATTA GAACGTTTAG TTGCCGATTG 4 80 

GGCATTAGAA AATCACAAGT TTCCTAAGAA AACAGCGGAA CCAACAAAAG GGAAAATCGC 540 

AGTGGTCGGT TCTGGTCCTT CTGGGTTAAC AGTTGCGGGG GATTTGTCAA AGCTAGGCTA 600 
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CGAAGTAATT ATTTTTGAAG CGCTTCATGA AGCTGGTGGC GXATTAACCT ATGGCATACC 660 

TGAATTTCGA TTACCGAAAA AAATTGTTAA ACAAGAAATT GCGAGTATTG AAGCCTTAGG 720 

TGTAACAATA GAAACCAATG TGGTAGTAGG TAAAACGATT ACGATGGAAG AAATAATGTC 780 

AGAGTTTGAT GCCTGTTATC TATCGGTTGG TGCAGGGGCA CCAAATTTTA TGGGAATCCC 840 

AGGAACTAGT TTAAATGGGG TTTATTCTTC TAGTGAATTT TTAACACGAA TCAATTTAAT 900 

GCATAGCTAT GAGTTTCCAG AATATGATAC ACCGATTAAA CGTGCTAAAA ATGTTGTTGT 960 

GATTGGCGGC GGGAATGTTG CAATGGATGC CGCTCGTTCA GCCAAACGAT TAGGCGCTGA 1020 

AAATGTGAAC ATTGTGTATC GTCGTTCGTT AGAAGAATTG CCAGCAAGAA TTGAAGAATA 1080 

TCACCATTCC CTTGAAGAAG GAATTAACTA TTATTGGTTA ACGAATCCAA TTGCTTATTT 1140 

AGATGACCAA CAAGGAAATC TAGCCGGTGT TGAATGTGTC AAAATGGTTT TAGGTGAACC 1200 

AGATGCGTCT GGAAGACGAC GGCCAGAACC AATTCCAAAC AGCACGTTTA CCATTCCTGC 1260 

TGATGCAGTA ATCGAGGCAA TTGGCCAAGG GGCAAATCGA GTCTTGTTAT CAACATATCC 1320 

AGAAATAGAA CTAAACCAAT GGGGTTACAT TCAAGCTGAT CCTAAAACAG GAGCAACCTC 1380 

AATACCTGGC GTTTTTGCTG GGGGCGATAT AGTCACAGGT GCTGCCACCG TTATTTTAGC 14 40 

GATGGGCGCT GGGAAAATCG CAGCTAATGC AATTGATCAG TATGTCAAAA CACAAAAAAC 1500 

aACAGTCACA AGCAATGTTT AGGAAAGATA AAAGGTGAAA AAATGAAAAA AACAATGGAT 1560 

GGAAACACCG CAGCGGCTTA TATTTCATAT GCGTTTACAG AAGTTGCCGC TATTTATCCT 1620 

ATCACACCAA GTTCAACAAT GGCTGAGTTG GTAGATGAAT GGGCGGAAAG TGGTTTAAAA 1680 

AATATTTATG GTCAAAAAGT TCAAGTTATT GAAATGCAAT CAGAAGCTGG GGCAGCGGGA 1740 

GTAGTTCATG GTTCGTTAAA AACAGGTGCG TTAACCACTA CTTATACTGC TTCACAAGGT 1800 
TTACTGTTGA TGATTCCCAA TATGTATAAA ATCGCAGGAG AATTGCTGCC TTCTGTGTTC . 1860 

CACGTGGCGG CTCGGGCAGT AACGACAAGT GCGTTGAGTA TTTTTGGTGA TCATGGCGAT 1920 

GTgATGGCTA CCCGCCAAAC AGGTTTTTGT ATGTTAGCAG AATCGAGTGT TCAAGAAGTG 1980 

ATGGATTTGT CGGCAGTTGC TCATTTAGCT AGCTTAGAAG GTAGTTTACC TTTTGTAAAC 2040 

TTTTTTGATG GTTTTAGAAC GAGCCATGAA CTACAAAAAA TCGAAGTGAT TGATTATGAT 2100 

GACCTGAAAG AAATGGTGAA TCAGGAAGCT GTAACTCGTT TTAGAATGGG AGGCATGAAT 2160 

CCTAATCATC CCACTGTTTC AGGGACCGCT CAAAATCCTG ACATTCATTT TCAACAAAGA 2220 

GAAACAGTCA ATAAAAATTA CGAAGAAATG CCAAAAATTG TTCAAAAATA CATGAAAAAA 2280 

ATTAACAATC TTCGTGGAAC AACGTATGAT TTAACCGATT ATTACGGGGT TGAAGATGCA 2340 

ACAGAAGTCA TTATTTCAAT GGGGtCGgct TCGCCAGTAA TCAAACAAAC AGTCGACTAT 2400 

CTCAATCAAC • AAGGACGCAA AGTTGGTTTT ATCAATATTC ATTTATATCG TCCATTTCCT 24 60 

ACCGAAAACC TTCTTGAAAA ATTACCAGCT ACAGTAGAAA AGGTTGCGGT TTTAGATCGA 2520 

ACAAAAGAAT CGGGAGCCGA AGGGGAACCG TTATTGCTAG ATGTTCAAAG CGCTCTCTAT 2580 
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CAACATGAAA ATCGACCAAT TGTGATTGGT GGACGTTATG GGXTAGGCTC TAAAGACGTA 264 0 

GCGCCTAATC AAATTAAAGC GATCTATGAT CATTTGTTAT TGCCATTTAA AGAATTAAAA 2700 

CAACGTTTTA CGGTAGGGAT CGTTGATGAT GTAACCTATC AATCATTGCC AGAAGAACCT 27 60 

ACT T TAG ATT TAACGCCAGA TACTACCTTC CAAGCAAAAT TTTGGGGCTT TGGTTCTGAC 2820 

GGAACTGTGG GAGCAAACAA GCAAGCGATT AAAATTATCG GAG AT CAT AC CGACCTTTAT 2880 

GCACAAGCTT ACTTTAGTTA TGATTCTAAA AAATCAGGCG GTTTAACCGT TTCGCATCTG 294 0 

CGTTTTGGTA AAGAGCCAAT TACTTCCACG TATTTAATTG AACAAGCAGA TTTCATCGCT 3000 

TGTCATAATG CTTCCTATTT GCATAAATAC GACCTGTTAA AAGGTTTAAA AGATGGCGGA 3060 

ACGTTTTTAC TGAATACTAT TTGCGATCAA GAAAAGATTC ATCGTTTATT ACCGGCTAAG 3120 

TTAAAAAAAT ATATTGGGGA GCACCAAATC AAATTTTATA TTATCAATGC GGTTGAGTTA 3180 

GCTCGCAAAG TAGGCTTAGG CCGTCGGATC AATACAGTGA TGTCTACTGC CTTTTTTGAA 3240 

GTAACAGATA TTTTAACGAA AGAGCAATAT TTACCGTTAC TTAAAGAAGA AGTGAAAAAG 3300 

ACTTATGGAA AAAAATCAAT GGAAATTGTT GAAAAAAATT ACCAAGCGAT TGATGCTACC 3360 

TTTGAATCGC TGCAAGAAAT TGTAGTTCCA GAAGATTGGG CAACCATTGT TTTAGAGCCA 3420 

GAAAATCAAG AGTCGGAaGC TCCAGCTTTT GTGACAAATA TTTTAGAACC AATCAATCGT 3480 

CaAGAAGGGG ATCAGCTGAC CGTTTCAGAT TTGATTGAAA ACGGAATGAA GGGCGGTGCC 3540 

ATTCCTATGG GCACTGCGGC TTATGAAAAA CGTGGCATTG CGTTAGAAGT GCCTGAGTGG 3600 

CAAATGGATA AGTGTACGAT GTGTAATGAA TGTGCCTTTG TTTGTCCACA TGCGGCTATT 3660 

CGACCGTTTT TAGCGGATGA AGAAGAACTG AAAGAAGCAC CTGCAGGTTT TGCCATGCGC 3720 

GAAATGCGTG GCACAGATGG GTTGATGTAT CGTATCCAAG TTTCTTTAGA AGACTGTACA 3780 

GGTTGTGGTC TTTGTGTTCA GGCATGTCCC GTGAAAGACA AGGCGATTTT AATGAAACCT 3840 

TATGAAACTC AGAAAGAGCA AGCGATGAAT TGGGCCTTTG CGATGACTCT TAAACAAAAA 3900 

GCCAACCCAG TCAAAAAACT TTCTGTTAAA GGTTCCCAAT TTAACCAGCC ATTGATGGAA 3960 

TTTTCAGGTG CTTGTGAAGG CTGTGGCGAA ACACCTTATA TTAAGTTGTT GACGCAATTA 4 020 

TTTGGTGATC GAATGATGAT TGCTAACGCA ACAGGCTGTT CATCAATTTG GGGTGGTTCG 4080 

TCACCGGTGA CGCCTTATAC AACGAACGAA TGTGGCCAAG GACCAGCTTG GAGTAATTCG 4140 

CTGTTTGAAG ATAATGCGGA ATATGGTTAT GGTATGTATA TTGCCAATCG AACGAAACGT 4200 

CAACATTTAG CTAGTCTGGT TGAAGAAAGT CTTGCTAAAA ACGTCGGATC AGATTCACTT 4260 

CAAGCCTTGT TAAATGATTG GTTGGAACAC ATGGCTGAAG GGGAAGGAAC ACAACAGCGA 4 320 

GCCACCAAAC TAGCGGCAGC ATTAAGTGAA GAAGCCGATG AAGATCCACT TTTAACAAAA 4380 

ATTTACGAAC- AAAAAGATTT ACTAGTGAAA ACGAGTCAAT GGATTGTCGG CGGCGATGGT 44 40 

TGGGCCTATG ATATCGGGTT TAGCGGCATT GATCATGTTC TTGCCAGTGG GGAAGATGTT 4 500 

AATATTTTTG TTATGGATAA CGAAGTGTAT GCCAATACAG GCGGACAAAC ATCTAAAGCT 4 560 
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ACACCAGCTT 

TTAGGGATGA 

AACTCAATGC 

ATCATCGGTT 

GATCAAGCCA 

TTAGTTAAGA 

ATGCGTGATT 

CAAGAAGTAG 

AGTTATAACA 

GGCTTTGTTT 

TGTTTCTTTT 

TAATAAGGCA 

ACCGGTTGTC 

AGTGGCTGAC 

AATCGATTTT 

ACTTTTTAAA 

ACCACCAACA 

TGTAATTTGG 

ATAAAACATC 

TTCTAATAAG 

AATTTTACTA 

TTTCTTAAAA 

AAAAATCAAG 

ATCACCTGAA 

AAATATCGCA 

CGAGCCGAAG 

AATTCCGCTC 

GATTCGCCGT 

AACTTGGAAA 

GACTGTGGTG 

AGGAACATGC • 

CGTCATTAAT 

TCGATCGCTt 



CAGCCATTGC 
TGGCAATCAC 
AAACGATTAA 
ATACCCCATG 
AAGAAGCCGT 
AAGGCAAAGA 
TCCTTGAAAA 
TCGAACATTT 
AACTAGTCAC 
TTTTATCCAA 
GGTACAAAAA 
ATCATTAATT 
CCAAAGGCAG 
AAAAAGACGA 
TTGACATCAT 
AAGGCTAACA 
GAGCTAGGAC 
AAGTCCCCTA 
GAACTTACCC 
TAAATCAAAA 
AACAATGAGA 
TGTAACCACT 
AACATAATTG 
ATATCAAAGC 
TCTCGCCAGT 
ATTACTTGGA 
AGTCGCGGCT 
TTTGATAAAA 
AGAACCTCCA 
AGTCCaGTAA 
GAGTCAGGTT 
ATGTAATAAG 
cTctTTCGCT 



CAAATTCTCA 
-T.T&TGGGAAT 
AGCAATTGAC 
TATTAATCAC 
TGAATCTGGC 
TCCGATGGTC 
ACAAACACGT 
ACT TG AT AAA 
ACACATGAAA 
TAATAATTTT 
TCATTAAGGT 
TACCTACTGT 
AAGCGACTTC 
CAGCGAAAAA 
CGTCATCTAT 
AATACAAGCC 
TACAACCAAT 
AATCATTAAT 
AACGGCCACT 
TAGTCCCACC 
AGCGGAAAGG 
CATGAATTTC 
CCAGTAAAAA 
CTGAATTAGT 
TCGTATAATA 
ACCAAATTAA 
GATTCATGTC 
TAAAAAAGAC 
ATAAGACGAT 
CACTGACTGT 
CTCTGAAAAT 
CGACGATGAT 
GTGCCAGCCA 
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GCGGGTGGGA 
GTCTATGTTG 
GAAGCGGAGC 
GGGGTCAAAG 
TATTGGCCAC 
ATTGACTACA 
TACTCTGCGT 
ACTAAAGAAG 
AAATAAAAAA 
TTCTGAAGGG 
ATAAAGCATA 
TGTTAAATCG 
CACAATGATT 
GCACATAATC 
TTTACGGCTA 
AATAATCGCA 
AAACATTAAC 
TTGGAGGCCA 
TTCATTCATT 
AATAAACAGA 
CAAGCCTCGT 
CAGTAAAACG 
GAGATAATCA 
CACTGCAGAA 
ACCACTTATA 
AATCAAAAAA 
AGTCATAATT 
CGTTGAAATC 
CCCACGATCA 
ACTGATTGCT 
AGGGAGGCAA 
TTGAATAGAT 
CAGCAAACGT 



AACATACCTC 
CACAAGTGGC 
GGTACCCAGG 
GCGGAATGGT 
TTTATCGCTA 
AGAAAACAGA 
TGCATACAAT 
ATGCAATTGA 
ACAAAGCCTC 
TATTCGTAGC 
CCAATACGTC 
TCTGTAATAC 
GAAATCAAGG 
AAAGATAACA 
AAGACGCTAA 
ACAGTCGTTG 
ACAGAGAAAA 
GCGTTTCGTG 
GTCAAAAATA 
ACAATAAACG 
TTCTTCTTAC 
GGAAATCCAA 
TGAGCAAAAG 
ATAGCTTGAT 
TAAAAATAAA 
GTGGTGCGGA 
AAcTGACGTT 
ATCATAATCC 
TTAAAAACCG 
AAGAAAAATA 
AACAATAAAT 
GAAAAATTTT 
GTATGGTTCA 
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TAAAAAAGAT 4 620 

TTTAGGCGCA 4 680 

CCCATCAATT 4740 

TCGGACGTTA 4 800 

TAATCCAGAG 4 860 

TTTTGACAAA 4 920 

TAAACAGAAT 4 980 

ACGCTCTGAA 504 0 

TTACTTGAGA 5100 

CTAAGTCACG 5160 

CAATAAACAT 5220 

CCAAAGAAAG 5280 

GCAGATTTTC 534 0 

TGAAGACGAC 54 00 

TATCATCTTC 54 60 

TTCGAACACC 5520 

i 

TAATCAACGT 5580 

TAGTCATCGA 564 0 

GGTGATCTTT 5700 

CTAGCAAAGC 57 60 

GAAAGTTCTT 5820 

TCCCGCCGAT 5880 

GAATAATTGA 594 0 

AAAATCCATA 6000 

GCGAAAAGAG 6060 

TCAGTCGAAC 6120 

GCTTTAATGT 6180 

CTAAGCCACC 6240 

AATTAATATC 6300 

AATCAATAAA 6360 

AAGAAATAAC 6420 

TGCGAATAAA 6480 

CTAGATaACy 6540 
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TCCCaATTCT 


AATTAAACTA 


mGTATACCAA 


AACTTTTCGC 


CTCAAACGAA 


TAATTCTGCC 


6600 


AGAAAATCCT 


TGAAATGACA 


AATAAAAATA 


AGAAATTCGT. .TAGAA&TMT. 


.TGGAAAACAT 


6660 


CCACAAACAA 


GCGAGAACAT 


GATATTTTAT 


GAAAP.AATAG 


AAAAATAGGT 


TGTTCCCCAA 


6720 


ACAACTTATT 


AATAATTGGT 


CAGTTTGTCT 


AAPGATAPAA 


PTTTAAAAAG 


TAGTGGTAGC 


6780 


TTTTTTATTT 


CGGTTGATAT 


GTGAATGTGT 


TPAPTTTATG 


TAAPAAACTG 


TTATTTTTAT 


6840 


TCAAATGAAA 


AATAAGAAAT 


TGTGTTGGTA 


GAAATPAATA 


GTTTATAAGG 


TGGGAAAACA 


6900 


ATGAAAAAAG 


TGTTAATGGG 


AGTTCTTTCC 


PTAGGGTTAT 


TATTAGGTGP 


AGCAACAGGT 


6960 


TGTACAAGCG 


ATCAAGaAaA 


AGCCGCTGGT 


AAAAPAAAAG 


PGTPTTPTGA 


AaAAACAGAA 


7020 


GCAACATCTG 


GTGCCTCTGC 


TAATGGTTAT 


APAGATPPAA 


f^TP-A APT A A A 


AGACAGTTAT 


7080 


GATGTTGTTA 


TTGTTGGTTC 


AGGTGGTGCG 


PPPATP31PPP 


PTPPATT APA 


AGCCAAAGAA 


7140 


GCTGGCATGA 


ATCCAGTTAT 


TTTAGAAAAA 


Al X O 1 J. K3 


LAblj 1 (jLjvxAA 


TACAATTAAA 


7200 


TCGTCTTCAG 


GAATGAATGC 


TTCACAAACA 


AAA I ICUAAb 


AAAAAb AA Kj \j 


CATTAAAGAT 


7260 


AGCAACGATA 


AATTCTTTGA 


AGAAACACTA 


AAAbbUljCj 1 A 




CGATCAAGAA 


7320 


TTACTACGTT 


ATTTTGTTGA 


TCATTCAGCT 


pn nPPB. HTTP 
bAAlaUAAx io 


BTTPPTTZiPH 
Al ibbl IAoA 


TACAAAAGGr 


7380 


ATTACATTGA 


GCAATTTAAC 


GATTACAGGT 




A A A A APnPAP 
AAAAA^vjUrlL, 


ACACCGACCT 


7440 


GCTGACGGAT 


CAGCAATTGG 


GGGCTATCTA 


PTTP APPPTT 
X X bAUub 1 i 


TAPT APPTA A 
1 A« J. AUta i AA 


CGTTCGCGAA 


7500 


GAAAAAATTC 


CATTGTTTGT 


TGATGCTGAC 


PTPAPAGAPT 


TAP.TTP.A AGa 


AAACGGTCaA 


7560 


ATCGATGGCG 


TAAAAGTGAA 


AATGAAAGAT 


P.ATAAAP,AAA 


A A APAP.TTA A 


AGCAAAAGCA 


7620 


GTCGTTGTCA 


CAACAGGTGG 


TTTCGGTGCC 


Ar-GAAAAATT 


A ATT AP APA A 


TACAAACCAG 


7680 


AATTGAAAAA 


CTATGTTACA 


ACGAACCAAG 


AAGGPAPAAP 


AGGTGAPGGT 


ATCCAAATGA 


7740 


TTCAAAAAGT 


TGGcGGTGCG 


TTagTGGATA 


TGAAAGAAAT 


TPAAATTPAP 


CCAACTGTTC 


7800 


AACAAAGCGA 


TGCTTTCTTA 


ATTGGTGAAG 


PAGTTPGTGG 


AGAAGGPGPA 


ATCTTGGCTT 


7860 


CTCAAAAAGG 


AGAACGTTTT 


GTTAATGAGT 


TAGATAPAPG 


PGAPAAAGTT 


TCTGCAGCAA 


7920 


TCAATGCCTT 


ACCAGAAAAA 


TCAGCTTATT 


TAGTPTTTGA 


PPAAGGPGTT 


CGCGACCGTG 


7980 


CAAAAGCCAT 


TGATTTCTAT 


GATCAAAAAG 


gtttpgtgga 


AAAAGGTGAA 


ACGATTGAAG 


8040 


AATTAGCTGA 


AAAAATTGGC 


ATGCCTGCCG 


ATAPATTGAA 


AGPAAPGATT 


GATACTTGGA 


8100 


ACCAAGATGT 


TAATGCCAAA 


GATGACAAAC 


AATTTGGCCG 


GACAACTGGA 


ATGGAAGCTG 


8160 


ACTTAAGCAC 


AGCCCCTTAC 


TACGCAATTA 


AAATTGCACC 


AGGGATTCAC 


CACACAATGG 


8220 


GTGGCGTGAA 


AATCAACACG 


AAAACAGAAG 


TTTTACGTGA 


AGATGGCACA 


CCAATTAAAG 


8280 


GTCTATATGC 


TGCTGGTGAA 


TTAACAGGTG 


GCTTACATGG 


TCaAAACCGT 


ATCGGAGGCA 


8340 


ATGCAATCGC 


CGATATTATC 


ATTTATGGTC 


GTCAAGcAGG 


TACGCAATCA 


GCAGAATTTG 


8400 


CCTCTGCTCA 


AAAATnACTA 


GCATTAAAAA 


GTCTGAAATT 


GTAGTCAAAT 


AACTACAATT 


8460 


TCGGGCTTTT 


TTATTGCATA 


CTAGCAGGAT 


CATTAGTTGG 


AGTATAAATA 


GCCCTACTAA 


8520 
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AATGACTTTG 

TACTGCTTTT 

AACATGGCTC 

TTAAAAGTAG 

C TAG AT AC AC 

ATTCCAATTT 

TTTTGCGTAT 

CGAATTGTTG 

AATGAATTAT 

AAAACGATTT 

AACCCCGTTT 

TGTCGCAAAC 

TGTACGATCA 

ACCCAGCAGT 

CACGTGAGAC 

CCGAAGAACT 

TTTCTGATTT 

CAAAAGATCC 

ATGAAGCCGC 

AAGGGTGGAA 

TAATTATGAT 

ACCGCGTGCA 

CCGTTGTAGT 

GTGTCGATAT 

CTGTTCGATT 

AGCTAAAAAA 

AACAAGAAGC 

GTTCAGAACG 

GTTTAACCAT 

CATTAGTCTT 

TTTGAAGTCC • 

ATTCACTATG 

CAAGAAGAAA 
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GCGTAGATTC TGTTTTTTTC AGAAAAATGC ACGACCATTT TAGCTAAAAA 8580 

TTGCTGAAAA TATAGTAAAA TACGTAAGGA ACTAACGGAA AGGTGTAGTG 8 6 AO 

AATTATTTTT CAAGTATGGC GCAATGAACA GTGGTAAAAC CATTGAAATC 8700 

CACACAATTA CGAAGAACAA GACAAACCAG TTGTTTTGAT GACCAGTGGA 87 60 

GTGATGGTGT TGGGAAAGTA TCCAGTCGCA TCGGATTACG CAGAGATGCT 8820 

TTGAAGAAAC CAATGTCTTT GACTTGATTA ATGATTTGTC TTACAAACCT 8880 

TAGTCGATGA ATGTCAATTT TTAAACAAGC ATCATGTCAT TGAATTTGCA 8 940 

ACGAATTAGA TATTCCAGTA AtGGCATTTG GGcTAAAGAA TGATTTTAGA 9000 

TTGAAGGCTC AAAGTATTTA CTTTTATATG CAGACAAATT AGAaGAaTTA 9060 

GCTGGTTCTG TCATAAAAAA GCAACTATGA ACTTGCACTA TATTGATGGC 9120 

ATGAAGGGGA TCAAGTCCAA ATTGGTGGGA ATGAAGCCTA CTATCCTGTT 9180 

ATTATTTTCA TCCAAAAACA GTGAATGAAG AACAGTAAGG AGAGACACTA 924 0 

ATTwCAATCT ATTGAAGATC GCTATGAGGA ATTGGGCGAA TTATTAAGTG 9300 

CATTAGTGAT ACAAAACGTT TCATGGAATT ATCGAAGGAA GAAGCTAATA 9360 

GGTTGAGGTT TATCGCCGAT ATAAACAAGT TGTTGAAGGG ATTAGCGATG 9420 

TTTAAGTGAA AACTTAGATG CGGAAATGGC CGAAATGGCG AAAGAAGAAC 9480 

i 

GAAAAAAGAA AAAGAAGTGT TAGAAGACCG CATTAAAATT T TAT TAT T AC 954 0 

AAATGATGAT AAAAACATCA TCATGGAAAT CCGTGGTGCA GCCGGTGGCG 9600 

ACTTTTTGCT GGCGACTTAT TTAATATGTA CCAAAAATAT GCAGAAGCAC 9660 

AGCGGAAGTT TTAGAAGCGA ACGTCACTGG AATCGGTGGA TATAAAGAAG 9720 

GATTTCAGGA GATAACGTCT TTTCAAAATT AAAATATGAA AGTGGCGCTC 9780 

ACGAGTTCCT TCAACAGAAT CACmAGGACG GATTCATACA TCGACAGCAA 9840 

TATGCCGGAA GCGGAAGAAG TCGAAATCGA ATTAGCTGAT AAAGATATCC 9900 

TTATCATGCA AGTGGCGCTG GTGGACAGCA CGTCAATAAA ACCGCTTCGG 9960 

AACCCATTTA CCAACAGGAA TCGTAGTCGC AATGCAAGAT GAACGTTCAC 10020 

CCGTGAAAAA GCAATGAAAG TTTTGCGTGC TCGTGTGTAT GACCAAATTC 10080 

ACAAAGTGAA TATGATGCTA ACCGTAAATC GGCGGTGGGG ACCGGAGATC 10140 

AATCAGAACG TATAACTTCC CGCAAAATCG AGTAACGGAT CATCGTATCG 10200 

CCAAAAACTA GACCAGATTC TTGCTGGCAA ATTAGATGAA ATCATTGATG 10260 
ATATGATCAA ACATCGAAAT TAGAAGAGAT GCAAAATGGC TAAACGTTAC . 10320 

TTAACTGGGC TTCTTCTTTT TTAGAGGCGC AAGGCAAAGA AGGCTACGCT 10380 

TTTTTTTAGA ACGTAAAGGT TGGGATAAAA CGCAGTGgCT CCTGCATATG 104 4 0 

TGCCCCAAGA AGAAGAGGAA CAATTAAAAA CGGATTTAGC CCAATTATTG 10500 
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ACAGATTACC 


CAGCACAATA 


TTTACTAGGA 


CAAGCTGAAT 


TTTATGGACA 


TTCCTTTATT 


10560 


GTTAATGAAC 


ATACTTTGAT 


TCCTAGGCCA 


GAAACAGAAG 


AATTAGTCGA 


GCGTTGCTTA 


10620 


AAAGCCAATC 


CAGATACTCC 


TTTGACGGTC 


GTGGATGTCG 


GAACAGGGAC 


AGGTGCCATT 


10680 


GCGGTTAGCT 


TGAAACTGGC 


ACGGCCCAAC 


TGGCGAGTTA 


TTGCCATTGA 


TCTTTCAGAG 


10740 


GAAGCGTTGA 


CTGTCGCTAA 


ACAAAACGCG 


CAAGCATTAG 


GTGCTGGCAT 


CGAATTTTAT 


10800 


CATGGGAATG 


GCTTACAGCC 


AGTTGCTTCT 


GAAAAAATTG 


ATCTGCTTAT 


TTCTAATCCG 


10860 


CCCTATATTA 


GTGAACAAGA 


ATGGTATTTA 


ATGGATGCAA 


GTGTCCGAAC 


GTATGAACCT 


10920 


AAAACCGCTT 


TATTCGCTGA 


AAATAACGGT 


TTGGCTTTAT 


ACCAACAATT 


AATTCATGAA 


10980 


AGTCAAACGA 


TGCTTAAAGC 


AGACGGTAAA 


ATTTATTTTG 


AAATAGGGTT 


TCAACAAGGC 


11040 


GCGGCCCTCC 


AAGAATTATT 


GAGCGCCGCC 


TAT CC AC AAA 


AAACAATTAA 


AATTGAAAAA 


11100 


GATTTATCTG 


GCAACGATCG 


CCTCGCGATA 


GCCGAGTAAC 


TAAGGTTCTT 


CGTCGCCACT 


11160 


TTTCTAGAAG 


GAGCGATTTG 


CAATGG A/VAC 


AAAAAAATTA 


ACCAAACAAG 


AAATAGCCCA 


11220 


AGCAGCAGCA 


GCCATTCAAG 


CAGGCGAGCT 


AGTGGCTTTT 


CCAACAGAAA 


CTGTTTATGG 


11280 


CTTAGGTGCC 


GATGCCTTAA 


ATGAAGCGGC 


AGTAAAAAAA 


GTTTACCAAG 


TAAAAGGCCG 


11340 


CCCCAGTGAT 


AATCCATTGA 


TTGTTCATGT 


GAACAACGTT 


GAGATGGTCA 


AAAAATACGT 


11400 


GGCGACCTTA 


CCAGAAGTCG 


CTACAAAATT 


GGTGTCTGCC 


TTTTGGCCGG 


GACCATTAAC 


11460 

i 


TTTAATTTTT 


AATGTGCCTA 


GC GAT AC ATT 


TTCTAAAACA 


GTAACAGGTG 


GTTTAGAGAC 


11520 


AGTCGCTTTT 


CGTATGCCAG 


ATAACCAAGC 


GACCTTAGCT 


TTGATTGAAC 


AAGCCAATGT 


11580 


TCCGTTGGTC 


GGACCAAGTG 


CGAACACTTC 


TGGCAAGCCT 


AGTCCAACAT 


CTGCAGAGCA 


11640 


TGTTTATCAT 


GATTTGCAGG 


GGAAAATTGC 


TGCCATATTA 


GATGATGGCC 


CAACGCAGAT 


11700 


TGGGGTGGAA 


TCAACGGTCT 


TGGATTTAAC 


GGCGGAAGAT 


GGTGTGCCAG 


TTATTTTACG 


11760 


ACCAGGAGCA 


ATTACCAAAG 


AGCAGTTGGA 


AAGCATTATT 


CCCCAAGTGA 


AAATCGATAC 


11820 


ACATTTGATT 


AGTGAAACTG 


CAGCACCCAA 


AGCGCCAGGG 


ATGAAATACA 


AACACTATTC 


11880 


TCCTGATGCG 


GAAGTTTGGA 


TTATAAGCGG 


TCAAACAGCA 


GAATGGCAAG 


CGGCGATTCA 


11940 


GCAAGCAAAA 


GAACAACAGG 


AAAAAATCGG 


CTTATTTTTA 


AGTGATGAAC 


AGGCAGCACA 


12000 


ACTAGATACA 


GAAAATGCGT 


TTGTTTATTC 


TTATGGTGCA 


GAAACGGTTG 


AAAATGCCAC 


12060 


AAAAGAACTA 


TTTGCAGGGT 


TAAGAGCGTT 


AGATGAGCAA 


GGAGCTACCA 


CGATTTTTGC 


12120 


GCAAgGATTT 


GCAGAAACTG 


GTTTAGGCAC 


TGCTTACATG 


AATCGCTTGA 


AAAAaTCAGC 


12180 


CAATCAAAAA 


TTTTTTGAAA 


AATAAGAGGT 


ATTTGTAACG 


AAGCTGACAC 


AGAAGTATGA 


12240 


TACAATCAGA 


CAAAAGAAAA 


CAAAATGGAG 


TGTTGTTTGA 


ATGGmTTACA 


AAACGTATGA 


12300 


CCCAGATTTA 


' TGGAATGCAA 


TTGCAAGAGA 


AGAAGAGCGC 


CAAGAAAATA 


ACTTGGAACT 


12360 


AATCGCATCT 


GAGAATGTCG 


TGTCAAAAGC 


AGTTATGGCT 


GCCCAAGGAA 


GTATTTTAAC 


12420 


GAATAAATAC 


GCAGAAGGTT 


ACCCTGGCAA 


ACGGTACTAT 


GGTGGTTGTG 


AATTTATCGA 


12480 
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TATCG.TAGAA 


AATTTAGCTA 


TCGATCGTGC 


CAAAGAATTA 


TTTGGTGCAA 


AATTCGCGAA 


12540 


TGTACAAGCC 


CATTCAGGTT 


CTCAAGCCAA 


TACAGCGGCA 


TACCTTTCAT 


TGGTTGAACC 


12600 


AGGTGATACC 


ATTTTGGGGA 


TGGATTTATC 


AGCTGGTGGT 


CACTTAACAC 


ATGGTTCGCC 


12660 


CGTTAACTTT 


AGTGGAAAAA 


CCTATAATTT 


TGTCAGTTAT 


GGAGTGGATC 


CTTCAACAGA 


12720 


AGTAATCGAT 


TACGATGTCG 


TGCGAATTTT 


AGCAAGAGAA 


CATCGTCCAA 


AACTAATCGT 


12780 


TGCAGGCGCA 


AGTGCGTATT 


CACGAACGAT 


TGACTTCAAA 


CGTTTCCGTG 


AAATCGCTGA 


12840 


TGAAGTAGAT 


GCCAAGTTAA 


TGGTTGATAT 


GGCGCATATT 


GCAGGCTTGG 


TAGCTTCaGG 


12900 


GTTGCaCCCA 


AATCCaGTTC 


CgTATGCTGA 


TATCgTAACA AGTACgACCC ATAAAAcATT 


12960 


GCGTGGTCCc 


TCGTGGCGGT 


TTAATTTTGA 


CAAATAGCGA 


AGAATTGGCG 


AAAAAAGTAA 


13020 


ATAGTAGTAT 


TTTCCCAGGC 


ATTCAAGGTG 


GCCCATTGGA 


ACATGTGATT 


GCCGGAAAAG 


13080 


CCGCAGCTTT 


TAAAGAAGCA 


TTGGATCCAA 


GTTTTGCTGA 


AT AT AG TC AG 


CAAGTGATTG 


13140 


CAAATGCGCA 


AGCGATGACG 


AAAGTTTTCA 


ATCAAGCGCC 


AGAAGCTCGT 


TTAATTAGTG 


13200 


GGGCAACAGA 


TAACCACTTA 


TTACTGATTG 


AAGTGACAGG 


TTTTGGTTTA 


AATGGAAAAG 


13260 


AAGCAGAAGC 


TATTTTAGAT 


AGTGTAAATA 


TTACTGTCAA 


TAAAAATTCA 


ATTCCGTTTG 


13320 


AACAATTAAG 


TCCATTTAAA 


ACAAGTGGTA 


TCCGTATTGG 


AACGCCTGCA 


ATTACTTCAC 


13380 


GTGGTTTTAA 


AGAAGAAGAT 


GCGGTAGAAG 


TTGCTAAATT 


AATCGTTCAA 


GTCTTAAAAG 


13440 


ATCCAGAAAA 


TACAGCAGTT 


CATGATGAAG 


TCAAAGCAGC 


TGTAGCAGCG 


CTAACTAAAA 


13500 


AATATCCATT 


ATATAACTAA 


CATCAGGGGA 


TAGCTGAAGC 


GCCGATTCTC 


TGGTACAAAT 


13560 


AAGAGTCGGA 


GCTT 










13574 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

ATTTACAGCG TTACTGAnAC AAGCAAAAGA CGCTTTAAGC AAATAAGCTT AAACAATCTT 60 

TCACACCCTA CTCAAGGAGT AGGGTGTTTT TTTAATAACT CCGTTGGTAA CTTTCATCTA 120' 

TAAATGAAAA AACTTGTGAG AGCGGAATTT CTTCAGAAAG ACTTAACCAA TGCTCTTTAT 180 

TC AT AT GAT A GGCAGGAAAA TAGCCAGGTT GGTTCTTTAA TAAGGCAATC AATTCAGGCG 240 

CAAGTTTAAC GGTTAAAATA GTGAGCTGCT TATTCTCGGT AGCGCCGAGC AGTTGTGCAT 300 

CCAGTGGCAT AATTAAAGCA AACCATTTTG CATTTCTAGG ATGACGAAAG ACGAGATAGT 360 

CTGGGTGTTT TTTAAAAGGA AAATCGGGCG TTATCTGATA ACGTTCTTGG ATATAGGCGC 420 

TAATAGTTTC TTGAAAAGTC ATAGGTTTCT TCCTTTCGTT AAGGCTAGGC ATATCTATTT 4 80 

ACATCATAGC AAAAATTACA TAAAATTACT TCGTTCAAAG GAATGGGAGG AGGGAGCAGA 540 
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TGACATTAAT 


TTCAACAATT 


TTAGTAACCT 


TGGTGGCTGT 


AGAATTTTTC 


TACATTATGT 


600 


ATTTAGAAAC 


GATTGTACCA 


ACATCTGAGA 


CAACAAGCCG 


TGTTTTTAAA 


ATGGATAGGC 


660 


AAGAGTTACA 


AAGArAATCA 


GTCACTACAT 


TGTTTAAAAa 


TCAaGGAATC 


TATAATGGAC 


720 


TAATCGGTGC 


AGGACTTATT 


TATAGTGTCT 


ATTTTGCGCA AgCAACGATG GAGATGACAA 


780 


AATTTTTATT 


GATTTATATT 


ATTTTAGTGG 


CGGCGTACGG 


TAGCTTAACT 


AGTGATAAAA 


840 


AAATCATTTT 


AACACAAGGC 


GGTTTAGCTA 


TCGTGGCTTT 


GATTAGTTTA 


CTTGTTTAGC 


900 


CATCAGAGTT 


AATAGAGAGA 


GGCAGTCAAA 


TCGACAGAAA 


AAACACGGTT 


GATTTGACTA 


960 


TTTTTTATAG 


AATTGAGAGA 


GACAAAATGT 


GGTATACTTT 


GTTTATTAGA 


TAGGGAGGAA 


1020 


TTAAAAATGC 


TGAACCAAAA 


AGAAGTAAAA 


AAAACCGATT 


ATACTCAATT 


AATTTCTCTA 


1080 


TGGCGTTCTT 


CAGTAGAAGC 


TACCCATTTG 


TTTTTGAGTC 


AGGCAGATAT 


TGATAAAATC 


1140 


GAAGTGGTTT 


TACCTGATTA 


TTTTCAACAA 


GTTCAATTAA 


GTATGTGGTT 


AAATGAAGAA 


1200 


CAGAAaTGTG 


TGGGCTTTAG 


TGGAACAAAT 


CAACAAaCGT 


TGGAAATGTT 


ATTTATTGAC 


1260 


CCAGTCTATT 


TTCGTAAAGG 


CTATGGTGGa 


GAAATTATCC 


mAAAGTTAAT 


AGAGCAAGAA 


1320 


TCAATTATCT 


TTGTTGATGC 


TAATAAACAA 


AATGAAGGGG 


CAGTGAAGTT 


CTATCAATCT 


1380 


CAAGGGTTTC 


AAGTAATTGG 


TGAATCAAAG 


GAAGATCCTC 


AAGGAAATCC 


GTTTCCTATT 


1440 


TTACATATGA 


AACGGATATA 


GAAAAGAAGG 


GAGAAGCCAT 


GCGAGTAATC 


GTTACAGAAT 


'1500 


ACCAACCAGC 


CTGGGTGGAA 


CAGTTTGAAG 


AAGAAGCCCA 


AGCGTTGAAA 


CAGATTCTAA 


1560 


AGGAAAATTG 


TCTTAAAGTT 


GAACATATCG 


GTAGCACATC 


TGTGCcTAAt 


TTAGCAGCCA 


1620 


AACCAATTAT 


TGATTTTTTG 


GTCATTGTTG 


AAGAAATTGA 


AAAAGTAGAC 


CTGTTACAAT 


1680 


GGGAATTTGA 


AAGAATCGGT 


TATGAATATA 


TGGGGGAATT 


TGGGCTATCA 


GGACGTCGTT 


1740 


ATTTACGAAA 


AGGCCCGATA 


AAAAGAACAC 


ATCATGTGCA 


TATTTATCAG 


TTTGACAATA 


1800 


CGCAAGAAAT 


TTTGCGCCAT 


CTTGCTTTTC 


GAAATTATTT 


ACGAGAAAAT 


CCGGCAATTG 


1860 


CTACAACCTA 


TGGTACATTA 


AAAAAGCAAT 


TGGCCCAAGC 


ACACCCCGAT 


AGCATTGATA 


1920 


AATATATGGA 


TGGCAAAGAT 


GCGTTCATTA 


AGAAAATAGA 


AAAAGAAGCG 


TTGAAGAAAT 


1980 


ATTGGGAAAA 


ATAAAAAAAG 


TTGCTAAGGC 


TAGCAACTTT 


TTTTTGGTTC 


TTTTTTGAAA 


2040 


AAATGATAAA 


GAAGACCACT 


TAAAAGAACA 


CCTAGAATAA 


TTCCAATCGC 


TAAATTATGC 


2100 


GTATAGACGA 


TCACACCCAC 


TGTCACGAGT 


AAAATAACAA 


TTTCAGTAAT 


TTCAAAAGTT 


2160 


CGGAAAAGCT 


GCAAACTTTC 


CCAATCAAAG 


GTGTCGACTG 


CCACTGTCAT 


CATGATCCCA 


2220 


ATCAATGCAG 


CTGTAGGAAT 


GGCTACCATC 


ACGGATTTAA 


AAACAAAAAT 


GAATAGAAGG 


2280 


AGGGTAATAC 


CTGAAATCAA 


AGTAGAAAGC 


CGTTTTCGTC 


CTCCAGATTT 


AACATTAATG 


2340 


ACCGCTTGGC 


CAATCATCGC 


ACAGCCAGCT 


TGGCCACCAA 


AAAAGCCGGT 


AATTATATTA 


2400 


GCGAATCCTT 


GTGCTTTAAC 


TTCTCGTTGG 


CTATCACTTT 


GGCTCGCTGT 


CATTTTGTCC 


2460 


ACAATGGGGA 


TCGTCAATAA 


AGATTCAATT 


AACCCGACCA 


TTGCTAAGGC 


AACAGAGCTA 


2520 
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GGTAAAATAA TCCACAACGT 
TGAAACTGAC TCATTTCACC 
GACAGGATAG TCATGAATAA 
GGTAATAGAT AAATAAGTAA 
TGGGGCAATT GTTGTACTTG 
ACATCACTGT CTTGGGAATA 
AT TG TAG AAA GCCTGTTAAA 
TTAAGCCTGC aAGGACTAAG 
CGCCCGTAAA AGTAATCACA 
GAACACCAGC AATAATCGCA 
GACCGGCAAA GAGATCATTT 
ACATCGAAAA ACTCCTTTAA 
ATGAGCGTGG GACAAAAATC 
CGGGAACAAA AGCAACTCCT 
TTTCGGAAAT TCCTTCTTAT 
TGTAGCAGTG CCAACAACAA 
TAGATTAAGA AACGTTTGCC 
GCTGCCAAGG CAGTAATTTC 
ACGGGGAAAT GAATCAAACT 
GGTAGGGCGC TGCTTTTTTT 
CTTTGCTTGA AACGTCCAAA 
TTAAAGAGCA TCGCAAGTTG 
TGACCAACTG ATTGTAAAGT 
TAGAGAATCC CACTTAAAAT 
AAAGTAATGA TTTGGCCTTT 
ATAGCAGAAG ACCCACAAAC 
AAAGCCAGTT TTTTACCGAT 
ATAAATGCGA CACCTTGTAA 
AATAAAACAA CAGAAAATTC 
TTGGCACCGC GAACAAAGGT 
GCGCCTAAGC TAGGTAACCA 
GTTAGCAAGC CGGGAAGAAT 
ATTGAAAAAA TCTCTCTTTC 



715 

TTCTAAGTTG' ACGGGTACAT TAGGCCAACG GATCGCTGGA 2580 

AAGATCACCT ACCGTTTGCA AATGACCTTT TAAAAAATAG 264 0 

AATAACAATT AAAGCAGGAG GCACAATTGT AATAAACTTA 2700 

GATACTACAG ATAATCATAC CGTAGGACCA CAGTGTTTGA 27 60 

CGCCATGAAA ATTAAAATGG tAACGCATTG ACGAATCCGT 2820 

AAGCGCATCA ATTTGTGGAT ACCTAAATAA CCTAAAATAA 2880 

ATAGTCGCAG CAAGCATGTA TGACAAACCA TGTGCTTGAA 2940 

GCCATTGAGC CAGCAGACGC CGAAACCATT GCCGgACGGC 3000 

AGTAAAGTAA cAGctGAAGC AAACACTGCA GAAAGCGGGT 3060 

AACCCGATTA CTTCTGGTAA AATGGCGACA GAAGAAACGA 3120 

CGTTCATTGC CTAACCATTC GCTTTTTTTA ATACTAAATA 3180 

GTTAATAAAA CCTTATTCAT AATAGCATGA ATAAGCTGAA 3240 

ACTTTGGATT TTTGCTCCaC GCTCaAAAAC TGATAAACGG 3300 

TCGGAAATAA GCCGGAATTC TCCAAAAATT ACAGAACAAT 3360 

TTCTCGGAGT TAAACGCTTC TGTCCCGACC TCATTAGAAC 3420 

AATGGCTAGG ACGACTTGAA CAGTTCCTAC TGATAAACCA 3480 

TTCTTGGAAA AACTTTTTAA AATCGAGTCG TAACCCGATT 3540 

AAACCAAGAA CTAAAGAAAT GAGCAGTCTC ACTGATCACG 3600 

ATTAAAGACA CAGGCAATGA AAAAGCCAAC TACATACCAA 3660 

GGTGACTTCT ACCAACTCAG CCTCTGATTC TGCCGTCTTA 3720 

TAAATAGACA ACTGCTACGA GTAGGACAAT CCGCATAATT 3780 

AAcTGCGTTC TCATTGACCA TATTGGCACT GGCGACAACC 384 0 

CCCTCCGATT AGTGCGCTAC GAGCCAACAA ATTGGTACCA 3900 

GGGTAAGGTC AACATTAAAA CGGTTCCTAA TAAGTTTACT 3960 

TTCTTCCTCA TCTGCTTGAA TTGCTGGAGC AATCGAAGCA 4020 

GGCATTACCA CCAGCCATCA ATAAAGACAT ATTATCAGAA 4080 

CAAATAGGCA AAAATAATGG TCAGCGACAT TTGAATGAGA 414 0 

GCCAATTTGG GCAATCGTTT GAAAGGTCAC GGTGGTTCCC 4200 

TAGTAATTTA CTTTCGGCTA CTTTGGTTCC GCGATTTAAA 4260 

ATTTCCCAAA AAGATTCCTA GTAAAATAGC TATCGTAGCG 4320 

GATGGCAAGA AACTTGCTAA GGCAGGCGAC AAGGAAAGCT 4 380 

TTGGAATAAA GATTGGATAT AATCATTTTC TGAATTTTTC 4 4 40 

TTGTTCATTA CTTCCAGTAT AACGAAATTT TTTACATTTG 4 500 



WO 98/50555 PCT/US98/08985 

716 

AGAnATAAAT AGTTAAnATA ATATTTAT .4528 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



GAAGTCAGTG 


AAGCGAAATT 


AAAAGAAGCA 


GCCAATCTTA 


TGCCAGAAGA 


GCCGTTAATT 


60 


CAATTTGCTT 


TGGGAGAACT 


ATACTTTACA 


AATGGTCAAT 


TTGTCGAGGC 


TATTACTCGT 


120 


TATCAAAGCA 


TTGTAGAAAG 


TGGGACAGCG 


CAAATCTCTG 


CTATTTCCTT 


GAATGAACGC 


180 


TTAGGAAGTA 


GTTACAGCAT 


GTTAGGTGAT 


TTTGAAGAAG 


CGGTTCCTTA 


TTTGGAAGCC 


240 


GCAGTCAAAG 


AAGAACAAAC 


AGACGATCGT 


TTATTCCAGT 


TAGCTTTCAC 


TTATTTACAG 


300 


TTACATGAGA 


ATCAAAAAGC 


CATTGCGTTG 


TTCCAGCAAT 


TAAAAACACT 


AAACCCACAT 


360 


TATCAATCAC 


TTTATCTATA 


TTTAGCGGAA 


GCGCTCCAAG 


AAGAAGAGCA 


ATTAGAGGAA 


420 


GCAAAAGCGG 


TGATCGAAGA 


AGGAATTGCG 


GAAAATCCCT 


TCCAAGTAGA 


GTTATACCAA 


480 


TTTGCCTCTG 


AAAATGCCTA 


TCGCTTACAC 


GATATTGCTG 


GTGCTGAAAA 


CTGGTTGTTA 


540 


AAAGCTTTGG 


AATTAGGTGA 


GAAAACAGAT 


GAAAGTCGTT 


TGACCTTAAG 


TAATCTCTAC 


600 


ATCACGGAAG 


AGCGTTTTGA 


GGAAGTAATT 


ACTATCTTAA 


AT GAT T TAG A 


GGAAACGGAT 


660 


CATCCTTATG 


CTGAATGGAA 


CTTAGCAAAA 


GCGTACAACG 


AATTAGAAGA 


CTTTGCCGTC 


720 


GCTAAAGTTC 


ATTATGAACA 


AGCAGCGCAA 


GAATTGTCTC 


ACGAACCAGA 


ATTTCTAAAA 


780 


GAATACGCAT 


TGTTCTTACG 


AGAAGAAGGC 


GAACTTGAAA 


AAGCAAAACA 


CTTGTTATAC 


840 


CACTATTTGG 


AACATGAACC 


TGGTGATGTC 


GACATGCAAT 


CTGTGCTAGC 


AGATATGGAA 


900 


GAAAGATAGG 


AAACGACGAT 


GTTTGTTAGT 


TTATCAGAAA 


AGAAAAAATT 


TTTAACTTGG 


960 


TTAGTGAATA 


CCGCGCCTTT 


TGGAAGAAGA 


GAAGTACTGT 


GGATTTTAAA 


TTACTTATTA 


1020 


ACGCACGATG 


CGATTTTAAA 


TAATGTTCAT 


TTTGTTGAGA 


ACGTAGAAAA 


AACCGATCGG 


1080 


GGGATTCGTG 


TGGTAGCGGA 


TGGTTTAGGT 


AAGGAGCCGT 


TATTGCTTTT 


TATCCAAGCA 


1140 


CAAGAATTTA 


CAGATCCGGA 


ACAAATTTTT 


CACGAGATTC 


GGATGAATTG 


GCGGAAAGCT 


1200 


CTTTATTTAG 


AATGTGTATT 


TCCAGAGGCT 


TGGCAAACGA 


GCCAGTATTT 


ATCTGTTTTA 


1260 


GAAGATAATC 


CGTTTGCGCC 


ATGGAATGAA 


CAAGTGGATC 


AGGAAGTTGC 


CAGAGCGATT 


1320 


GATCAATATT 


TTAAGCAAGA 


AGAACAAACG 


CAGCGAATGG 


cATTACTTAA 


AGCTCAAATT 


1380 


GATGATGCTT 


TAGAAACAGG 


CAACAAAGAA 


GCGTTTTTAG 


AATTATCCGA 


TGAATTAAAT 


1440 


CGTTTGAAAC 


AACAATAAAT 


AAAACCCAGC 


ATTTATTGAA 


AATGCTGGGT 


TTTATTTATT 


1500 


GTTTTAATTT 


AAAGCGCCGT 


TGTTTGCTGG 


GTTTTGCATA 


AGTAAAGCCT 


TCGCCAATTG 


1560 
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CTTCATGTAC GTTAATGATG GACACAAAGG CTTTTTCATC TATTTCATTA GCAATCCGTT 1620 

TGACTTCCAT TATTTCTCGA GGGCTGACTA CCGTGTAGAG AACCCGTTTT TCCACTTGTG 1680 

AATAAGCACC TTCACCATTT AAATATGTTA CCCCACGTTC TAAAGAAGCC ATTAAACTTT 1740 

GACCGATTTC TTCGGAATGA TTGGAAACAA TTAAAATTCC TTTCGCAGCA TAGGCTCCAT 1800 

CGAGTACAGA ATCAACAACA CGACTGAAAA CAAAGGACAC AATTAAAGTA TACATCATTC 1860 

GTTTGACATC AATATAGGTC AACGAGAGGA. GTAAGACTGC AACATCAAAA AGTAACAACG 1920 

AGCGGCCCAT GCTAATCCCA AAGTATTTTT CGAAAATGCG GGCAATGACG TCGGTCCCAC 1980 

CAGTGGTTCC GCCTACGCGG TAAACAAGGC CACTACCGAA TCCTGCGGCC AAACCAGCTA 2040 

AGAGAGCAGC AATCAATAAA TCATGATCTA AATTAATTTC AACAGGAAAT TTTTGCCAAA 2100 

ACCACAAAAA TACCGATAAC GACACGGTAC CAATTAGCGT ATAATAGAGG GAACGTTTGC 2160 

CTAAAATTTT CCCACCAATT AATAAGAGCG GAATATTGAT TAATAATGTC GTATATGCTG 2220 

GGTTAAAATG AAATAAGGCT CTTAAAATCA , AAGTAATCCC CGTGACACCA CcTTCGGCCA 2280 

GGTCATTGGC AATATTGAAA GTAACCAAAC CAAAGGCATA GATACAGCTT CCTAATACAA 2340 

TAAATAACAC ATCTTTAAAA TAAAATCTTT GTTCTTCCAC TTAATCACGT CCTTTTACAG 2400 

TATAAACTGT ATCATTCTCT ATATAAAACA ACAAGTAGAA CCTTTCCAGA TAGCCAAGTT 24 60 

TTTGATAAGA TGTAGGAGGT AGAGTAAGAA AGGTTGGGGT AAAATGTCAG AATCAGAACA 2520 

i 

ACGTAGTTTG CAATCGATGC AAACAGAAGT TGATGATTAT ATTCAACAGT TTAAAACAGG 2580 

CTATTTTTCA CCATTAGGAC AAATGGCTCG TTTGACGGAA GAGGTCGGAG AATTAGCCCG 2640 

TGAAGTCAAT CATTATTATG GTGAAAAACA GAAAAAAGCA GACGAAAAAC CAAAGACAGT 2700 

ATCAGAGGAA CTTGGAGATG TCTTTTTTGT ATTGATTAGC ATGGCAAATT CTTTAGAGAT 2760 

TGATTTAACC GAAGTTTTTG AAGAAAATAT GGCCAAATTT AACAAGCGGG ATCGCTATCG 2820 

CTTTGAAAGA AAGGATGGGA AAACACATGA TTAAAATAAT TGTCGCTGGG TTTAAAGGAC 2880 

GGATGGGCAG TACTGCTACA CAAATGGTCT TAGAAACCGC AGACTTTGAA TTAGTAGGTG 2940 

TGTATGATCC TCATGAAGCA CAAGAAACTG TTTCTTTCAA TGATGAAACA GCAATTCCCG 3000 

TTTTTCAGCG CTTAGAAGAA GTCCTAGCTG TCAAACCAGA TGTTTGGATT GATTTTACGG 3060 

TTCCTGAAGC CGCCTATCCA AATACGCGTT TTGCGTTGGA ACACGGTATG GCGCCTGTTG 3120 

TTGGGACGAC AGGCTTTACG GAGGAACAAA TCAACGAATT GACAAACCTA TCTCGAGAAA 3180 

AAGCCATCGG CGGCTTGATT GCGCCAAATT TTGCAATTGG GGcAGTTTTA ATGATGCAAT 3240 

TTGCCCAAAA AGCGGCGCAG TATTTTCCAG ATGTGGAAAT CATTGAATTA CATCATGATA 3300 

ACAAATTGGA TGCACCGAGC GGAACGGCTA TAAAAACTGC AGAAATGATT CAAGAAGTTC 3360 

GGCCAGCAAa ' AAAACAAGGG AAcCGCAAGA GGTAGAATCA ATACCAGGGG CACGCGGGGC 3420 

TGATTTCGAG GGTCTGCGAA TCCATAGTGT CCGCTTACCT GGTTTAGTGG CACACCAACA 34 80 

AGTCCAATTT GGCAGTGTAG GTGAAGGGCT CACTATTCGG CATGATTCTT ATGATCGGCG 3540 
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TTCAT.TCATG ACAGGAGTAG CGTTAGCTTG tCGGCAAGTG GTACAGAGAA CAGAACTACT 3600 

TTACGGATTG GAACAGATGC TATGAAATTG ACAACTATTC CAAACGAATT TAAAGAAGCC 3660 
GCTCCCGTTA TTCGTGAAAT CAATGCACAG GGCTTTGAAG CGTATTTTGT CGGGGGCAGC . 3720 

GTCCGTGATG CTTTATTAAA TAAACCAATC CATGATGTCG ATATTGCGAC AAGTGCCTAT 3780 

CCAGAAGAAA TTAAGCAAAT TTTTAAACGA ACCGTTGATG TGGGCATCGA ACATGGCACA 3840 

GTCCTTGTAT TGATGG7V>GA CCAACAGTAT GAAGTAACAA CCTTTCGTAC CGAGTCAACT 3900 

TATCAAGACT TTCGCCGACC AGATGAAGTC ACCTTTGTTC GCTCCTTGAA AGAAGATTTA 3960 

AAACGGCGAG ATTTTACGAT TAATGCCTTG GCGCTCGACA GCACAGGTGA AATCATTGAT 4020 

TTATTTGATG GTATCGAAGA TTTAACTAAT CAAACCATTC GTGCAGTCGG TAATCCCCaT 4 080 

GAGCGGTTTC ATGAAGATGC TCTGCGAATG ATGCGGGGcT TACGCTTTGC TAGCCAATTA 4140 

GATTTTAAAA TTGAAGAAAA AACCTTAGCC gCTATCGCTG AATTTCATCC TTTATTGGAA 4200 

AAAATTTCAG TAGAACGCAT TACAATTGAA TTTGTAAAAA TGCTTTTAGG GGTTAATCGA 4260 

CAAGGTGGCT TAGCTCCGTT TATTGAAACA GAATGCTATC AATACTGTCC AAAATTACGA 4320 

GAACAAGGGG CTGGACTTTT TCGTTTAATG GACTTGCCTG CTCGTCAAAT TGAAACAGAA 4 380 

GCAGAAGCGT GGACGTTACT TATCCAATCG TTGAATTTAC CAGAAGCAGA AATTCGCTCC 4 4 40 

TTCTTAAAGG CTTGGAAACT TTCAAATCAA TTAATCCAAA ACGTTTCACA ATTAGTACGC 4500 

GGTTTGCGTT TTCGCTTGAG TAACGACTGG CAGCCAATGA TGCTTTATGA ATTAGGCGAA 4560 

GAATCAGCAG TACTTGTTGA AAGGCTTTTA TAT TAT T AC C AACAAGAGAG TCAAGTACAA 4 620 

GTAACAAAAG AGTTAGTTAA GGCATTACCC ATCCATCAAA GACATGAATT GGCCATAACA 4 680 

GGAAAAGACC TTTTAGCCGT CTTAGAAGAA ACACCAGGCA AATGGTTAGG GGAACTGATT 4 74 0 

GCTGAAATCG AGCAACACGT TGTTGAAGGT AGCTTAGAGA ATAAACAAGA AGTTCTGCTT 4 800 

TCGTTTGCCA AAAAACAACG TTCAAAAGGA GAGAAAGCAT AATGCCAACA GAAATGTTTA 4 860 

CAAAAGAATT CTTGGTGAAA GAATCACAAA CGGCTArAAT GCTCGGTTCT GGTGACTTAG 4 920 

AAGTTTTAGG AACCCCTGCT TTAGCTGCCA TGATTGAACA AaCAGCCAAA GAaGCTGTGA 4 980 

AGGACCAwTT ATCAGTTGGG GAAaCAaCTG TCGGAaCGGT GTTAGAATtG CGCCACTTAT 504 0 

TTCCATCCGC GGTAGGGGCG ACAATCGTTG TTACGATGAC CAGCATTGAA CAGACTGCAC 5100 

ACAAAATACG CTATGAATTT GTTGCTTATG AAGGAGAACG ACAAATT 5147 
(2) INFORMATION FOR SEQ ID NO: 117: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
AAGAATTTAG ACAACTATTA GCCGCAAAAG GCATTGAATT GACAGACCAA CAGATGACAC 



60 
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AATTTGACCA 


ATATTTCCAT 


TTACTAGTAG ' 


AATGGAATGA 


AAAAATGAAT 


TTAACAGCCA 


120 


TTACAGAAGA 


AAAAGAAGTT 


TATTTGAAAC 


ATTTCTATGA 


TTCTGTTTCG 


TTGGCaTTCT 


180 


TTGAAGACTT 


TGCCAGTGAC 


AAAGCCATTT 


GTGATGTTGG 


CGCAGGCGCA 


GGATTTCCAA 


240 


GTATTCCTTT 


GAAAATTGTT 


TTTCCTTCAT 


TAAATGTCAC 


AATTGTTGAC 


TCT TTAAATA 


300 


AACGAATTAC 


TTTCCTAACA 


GAGTTGGTGA 


ATCAACTAGG 


TTTGGCTAAT 


GTTTCTTTGT 


360 


ATCATGATCG 


TGCAGAAACA 


TTTGGTCAAA 


AAGCAGAATT 


TCGTGGCGCG 


TTTGATTACG 


420 


TTACAGCGCG 


AGCAGTTGCA 


CGATTAAATG 


TTTTAAGTGA 


ACTGTGTTTA 


CCCTTAGTTA 


480 


AAAAAGAAGG 


CTATTTTTTA 


GCCTTAAAAG 


CATCGAAAAG 


TGAAGAAGAA 


ATTAATGAAG 


540 


CAAAACCTGC 


AATCGCTACA 


TTAGGCGGTC 


AATTTCAAAA 


AGAAGTTGGT 


TTTACTTTGC 


600 


CGATTACGGC 


AGACGAACGC 


CACATTGTCG 


TGATTCAAAA 


GAAAAAAGAA 


ACCCCTAAAA 


660 


AATATCCCAG 


AAAACCTGGA 


TTACCAAATA 


AACAACCGAT 


TAAATAAGAT 


GGAGGCACAC 


720 


ACATGACACG 


AATTATTTCT 


GTAGCGAACC 


AAAAAGGCGG 


CGTTGGTAAA 


ACAACGACCA 


780 


CTGTAAATTT 


AGGCGCATGT 


TTAGCAAACT 


TAGGCAAAAA 


AGTCTTACTG 


ATCGACATTG 


840 


ATGCACAAGG 


AAATGCTACC 


AGTGGCATGG 


GTGTTCCTAA 


ACCAGATGTC 


GCACACGATG 


900 


TGTATGATGT 


CCTTGTAAAT 


GAAGAACCAA 


TCACTAGCGT 


GGTTCAACAT 


ACCAGCCGTG 


960 


AAAATTTAGA 


TATTGTCCCT 


GCAACTATTC 


AATTAGCAGG 


CGCTGAAATT 


GAATTAACTT 


1020 


CAATGATGGC 


TCGAGAATCG 


CGCTTGAAAT 


TAGCAATTGA 


TGAAGTAAGA 


GACATGTATG 


1080 


ATTTCGTTTT 


AATTGATTGC 


CCGCCTTCTC 


TAGGGCATTT 


AACAATCAAT 


GCTTTTACAG 


1140 


CAAGTGATTC 


GATTCTAATT 


CCTGTTCAAT 


GTGAATATTA 


CGCATTAGAA 


GGCTTGAGTC 


1200 


AATTGCTAAA 


TACGATTCGC 


TTGGTTCAAA 


AACATTTTAA 


CCCTGAATTG 


AAAATTGAAG 


1260 


GGGTCTTGTT 


AACCATGTAC 


GATGCTCGGA 


CCAATTTAGG 


TGCCGAAGTC 


GTTGAAGAAG 


1320 


TCCGTAAATA 


TTTCCGTGAA 


AAAGTCTATG 


ATACGATTAT 


TCCAAGAAAT 


GTCCGTTTAT 


1380 


CCGAAGCACC 


AAGTCATGGT 


TTACCAATCA 


TTGACTATGA 


TATTCGCTCA 


AAAGGTGCCG 


1440 


AAGTGTATCA 


AGCACTAGCA 


AAGGAAGTGT 


TGGAGAATGA 


ATAAAGGCAA 


AGGTTTAGGc 


1500 


AGAGGCATCG 


ATGCCTTGTT 


TCAAGATATC 


GCAAaGCTAG 


AAGATGTCGA 


CGTTAAGAaC 


1560 


GAACAAGTGA 


CAGAAATTCT 


ATTAAACGAA 


CTTCGTCCCA 


ATCCTTACCA 


ACCGCGGAAA 


1620 


ACATTTGATG 


AAACCTCTTT 


ACAAGAACTT 


GCAAACTCGA 


TTCTACATTC 


CGGTGTGTTT 


1680 








TS. 3i A (Z fZ P T & T C 




AGGTGAGCGT 


1740 


CGGTTCCGGG 


CTTCTAAATT 


AGCCGGCAAA 


GAAAAAATTC 


CCGCAATTAT 


TCGCGAATTT 


1800 


GATGAAGAAT 


CCATGATGCA 


AGTTGCTGTT 


TTGGAAAACT 


TACAAAGGGA 


AGATTTAAAT 


1860 


CCGCTAGAAG 


AAGCAGAAGC 


CTATGAAATG 


CTAATGAAAA 


ATTTAAAATT 


AACACAAGCG 


1920 


GAAGTCGCAG 


AACGTTTGGG 


CAAAAGTCGC 


CCGTATATTG 


CCAATTATTT 


ACGGTTGCTA 


1980 


ACTTTACCAG 


ATGCGGTCAA 


AGCAATGGTT 


CAAAAACAAA 


GCATGTCAAT 


GGGTCAAGCA 


2040 
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CGGACGTTAT TAGGCTTGAA GAACAAAGAA CAGCTATTGC CATTAGCTAA TCGTTGTATT . 2100 

AAAGACAATT TAACGGTTCG ACAATTAGAA CAACTTGTCG CTGAATTAAA CGAAACACAA 2160 

GGCAAAAAGG GCAAAAAAGC GAAAAAAGCC ATCAAGGAAA AACCCATTTA TATTCGAGAA 2220 

AGTGAAGACC GCTTAATGGA TAAATTTGGG ACAACCGTAG CGATTCAAGA AAAAGAAGGC 2280 

AAAGGAAAAA TTGAAATTGA GTACTTGTCC TCTTCGGATT TGGCTAGAAT TTTAGATATT 2340 

TTAGATATTC ATTTTGACGA GGAATAAAAA GCAAAGTGAG AAAGTAAGAC AGAAAGTCAC 2400 

AAATCCTTTT GCTAGTTTCG GTAAAAAATG GTATTGTAAT CATGGTATTT TTATATAAGT 24 60 

AAGAGAAGAA AGAAGGCTTG GAACATGTAT GATTTAGGCG ATATTGTCGA AATGAAAAAG 2520 

CCGCATGCTT GCCAAGCAAA TCGTTGGCAA ATCATTCGCA TGGGCGCTGA TATCAAAATC 2580 

AAGTGCACGA ATTGCGGGCA TATTGTCATG ATGCCGCGTC GTGATTTTAC TAAAAAATTA 2640 

AAAAAAGTAA TCGAGAAAAA AGCAGACTAA TGCTTTGATG CCTCTAATAG AAAAGAGAAT 2700 

AGACTGAGAG AAAGAGTGGA AAACAAATGG CACTAACAGC TGGAATTGTC GGCTTACCAA 27 60 

ACGTTGGTAA ATCGACCCTA TTTAACGCAA TTACAAAAGC AGGAGCAGAA GCTGCAAATT 2820 

ATCCCTTTGC AACGATTGAT CCGAACGTCG GCATGGTTGA AGTACCAGAT GCCCGTTTAC 2880 

AACGCTTGAC AGAACTAGTA AAACCGAAAA AAACAGTTCC CACAACGTTT GAATTTACAG 2940 

ACATCGCCGG AATTGTTAAA GGTGCCAGTA AAGGAGAAGG GCTAGGAAAT CAGTTTTTAA 3000 

GTCACATTCG TCAAGTAGAT GCAATCTGTC ACGTGGTTCG TTGTTTTGAT GACGAAAATA 3060 

TCATGCGTGA ACAAAATCGG GATGCGGATT TCGTTGATCC TTTAGCAGAC ATTGATACAA 3120 

TTAATTTAGA GCTAATTTTA GCAGATTTAG ATTCAATCAA TAAGCGCTAT ACTCGCGTGG 3180 

CTAAAATGGC GAAAGCAAAA GATAAAGAAG CGGTGGAAGA ATTAGCTGTT TTAGACAAAA 3240 

TTAAACCAGT TTTAGAAGAA GGTTTATCTG CTCGTACGAT TGAATTTACG CCAGAAGAAG 3300 

AGAAAATTGT TAAATCTCTT TTCTTATTAA CCACAAAGCC GGTCCTTTAT GTAGCAAATG 3360 

TTTCAGAAGA TGAAGTGGCG GATCCAGACA ACAACGAGTA TGTACAACAA GTTCGTAATT 34 20 

TTGCAACCAG TGAAAATGCG GAAGTAATCG TAGTTAGTGC GCGAGCAGAA GAAGAAATTG 34 80 

CTGAATTGGA TTCTGAAGAA GACAAGGCTG AATTTTTAGA AGCAATGGGC ATTGAACAAT 354 0 

CTGGCTTAGA TCAATTGATT CGTGCGGCTT ACGATTTGTT AGGCTTAGCT ACCTACTTTA 3600 

CAGCTGGTGA ACAAGAAGTT CGTGCATGGA CGTTCCGTAA GGGAATCAAG GCACCTCAAG 3660 

CAGCTGGGAT TATCCACAGT GACTTTGAAC GAGGCTTCAT TCGTGCGGAA ACAGTTTCTT 3720 

ATGAAGACCT AGATAAATAT GGCAATATGC AAGCTGCCAA AGAAGCGGGC CGTGTTCGTT 3780 

TAGAAGGAAA AGACTACGTC GTTCAAGATG GCGATGTTAT GTTGTTCCGT TTTAACGTTT 3840 

AAAAAAATGT GCCCAACAAG CAATTTGTTT TGGGATAAGC CCACATATTT AAGGCCATTC 3900 

AGCCGTTTAG AAAGAGGTTC CCTCTACATG AGGGTGAGGA GGATACAACT ATGGAAGCAG 3960 

AAGCATTACG AGCAATCGTG GCTGAGAATC GTCAATTAGA ACAGAATTTG ACTAAAAGAA 4020 
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ATbAACAATA CATTTTTGAC 
AGCTAGCATT GGCTCTACAC 
AAACAGCCCG TCAGTTATTT 
CAGCAGAAGT CAAAGAACCA 
TTGGTTTATT AACAATCATG 
CTCTCGGATT AACCACAIAT 
TGCACAAATA TGTTTACCGT 
GGTTAAAAAC GACCGCCATT 
GTTCAGCGAT GCTACCACCA 
TCGGTGCGTT AgcCTTTGTA 
GTTTCATGAC ACGTCAATAA 
AAACTGCAAC AAGCAAAGTC 
ATTAGTCGAA AAAACTAGTG 
CCTCTTATGG GACAACAAGT 
ATTAGTTTTC TCTTATTAAG 
TTACGAGCAA GTGATTTCCG 
AAAAACAACG ACTGGTTTCC 
TCTAGTTTGT TTTATCCAGA 
ATTTCCTTTG TCGCTAGTAT 
CT TACT TAT T ATGCAGGCCG 
ATTTTGTATG GCTTATCTAT 
GAATTTTTGG CGCTAAGTTT 
GGAACCACGA AATGTTGGCC 
TTTATTTCTA TTGAAATGGT 
CAGTTTTTTA AAAAGGAGGT 
TGGCTGGCCT TTTATCTTTT 
ACGTCCAATC CACTCACTTA 
AATAGTTTAA AGAGCAGTGT 
GTGGGGCTTG TTGTAGCAGT 
ATTGGTTTGA CGCTGGTACT 
AACCACACGC CTTTAAATAC 
ATGCTCGCTT TTTTCATTGC 
GCTTTGTTGG TTTTCCTTGC 
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TTAAAAAAAT CATTAAAAGC GGCTAATTTA TCTGAAGAAG 4080 

GGTATTTTGC CAGAATTAGT AGCCGGTCAA AAAACAGGTA 414 0 

GGCACTGTTT CAGAACGGAC GGAAGCAATT TTAAATAAAC 4200 

GCGGGCTGGA TGATTTGGCT CGACAATACC T TACT ATT AC 4260 

TTAGCGGCGA TGTCTCTATT TTCAAAAGGA ACCGCACAAC 4 320 

ATCTTAGGTG CGATGGCTGG GGGTTATGTT TTCTATCTAA 4 380 

TTTGATCGTC AAGGTGGCGA CAAAAGCAAA CGTCCTGGTT 4 4 40 

CTATTTGGTG GGATGTTCTT GTGGATTGCA GTTTTCGCTG 4500 

GTGATCAATC CCATCTTAGA TCCGATGATT GCTTTAGTAA 4 560 

GCACGCTATT TCTTCAAGAA AAAATACAAT ATTCAAGGTA 4 620 

ACGGCGCGCG CTCAATTGTG AATGCGCTCG GcTGACGCAG 4 680 

GCTCTCTCAA TAGAGCGTTT TAACAGTTAA AGCCTGGGGG 4740 

TCCCAGGCTT CTGTTTTCAG AAGTTTGCTA AAGAAAGGAG 4800 

GAAGTCTTGG TGCCGTCAAC ACGACCGGCT ACTCATTTTT 4860 

TGGACTTTCT CTCTATTTTG TCTATTTTCA AGAGAATTTT 4 920 

CTTTCATCAA AATCGGGTGG AAGGTCTCGC ATTAGCAATT 4 980 

TAAAATCAAT TATTTTTTCT TGGGAGGCTA TGGCTATGCC 5040 

TGCGTATCTT TATTTACCTG CGCTTTTGCG TGTTCTGGGG 5100 

GGCGATTTTT GTTTTTGCGG TAAACTTGGC AACATTTAGT 5160 

ATTAATGGCG CTCTCTAAAA AAAGAAGCTA TCTCTTTGCC 5220 

TTACCGCATG CAAGACTTGT TTAATCGCCA AGCGTTAGGA 5280 

TTTTCCGTTG GTCTTAGCGA GCCTGTTTTT ATTACGAAAG 534 0 

GTTGTTAACG CTGGCGATGA CTGGGATTGG GTTGGCTCAC 5400 

TTCTATTTGG ATTGGGTTAT ACATTCTGTT TTATTGGCAG 54 60 

CCTGTTGGCA TTAGCGAAGG CTGCAGGTTT GACGCTGCTT 5520 

ACCAGTTGCT GAACAAATGA AAAATCAAGT GTTTAAAGTT 5580 

TATTTCCGAA AGAAGTTATC CTATTGTTTC TCTTTTCATT 5640 

TTTTCACGCG AAAACAGCGA ATCTAGGAAC CCTTTTGTTT 5700 

GGTTTCGCTT GCCAGCAAAA AAATTCAGAA CAAACGGTTT 5760 

CTTATTGATG GTTACCACGC TTTTTCCTTG GCACTGGCTG 5820 

AATTCAGTTT CCTTGGCGGT TTTTAGGGAT TCTTTCTGTA 5880 

ACAAGACGAA TGGGGCGTTT TCCGTAAATC TTGGACCGTG 5940 

CATCAGCAAT TTGGGCATTT ATCAATATCA AAGTATTCAG 6000 
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TCACAACAAG 


GCCGCCTGTT 


AACAAAAGCT 


GAGTATGAGC 


AGCCAGCGCC 


TTTTTATATC 


6060 


GGTGCGGGCC 


ATGAATATTT 


GCCAGATGAA 


ATCAATTATC 


AGGAATTATT 


GAAACAGAAA 


6120 


AAACGACCAC 


TTGATTATTC 


GGCAGAACAA 


GTAACCATTA 


CAAATGTTAG 


AaTGCCTTAT 


6180 


GGGAAAATTT 


CATTTGACTA 


CCAAGTGGTG 


AtCAATCAGC 


GAAGGTGACC 


GTf CCTTTTA 


6240 


TTTATTACTT 


AGGCTATCAA 


GCCACGATTC 


AAATGAAAAA 


CCAAACGGGC 


GCTAAAAAAA 


6300 


TGAGCCTAAC 


CAATCAAGGA 


GGCTTAACCG 


CTCTTTCTTT 


ATCTGGTACA 


GGACATGTCG 


6360 


ACATCCGTTA 


CCAAAGAACA 


AAAGTGCAAA 


AAATAGGGAC 


AATGATTACC 


TTACTTTCTG 


6420 


TTGGTGGTTT 


TGGATTTAGC 


CGATTTTTAC 


AACAAAAAAA 


GAAACACAAG 


ATAAAAGAAC 


6480 


AACGATGAAA 


GTAGGACCTT 


AATTAGGAAG 


AGTTGAATAA 


AGGACAAACC 


GATTTTGGCG 


6540 


TTTgTCCTTT 


ATTCCTCATT 


GATTATTTAA 


ACAAATCTAG 


TTGACTTCCT 


TAGTATTATC 


6600 


TGTTATTTTC 


ATATATAAAA 


TTAAGTCGAG 


GAGTGGACAA AAAAATGTCA AAyTGGGAAA 


6660 


CAAAATTTGC 


GAAAAAAGGT 


CTTACATTTG 


ATGATGTGCT 


ATTAATCCCT 


GCAGAAAGCC 


6720 


ATGTCTTACC 


AAATGATGTA 


GACATGAGTG 


TACAATTAGC 


TAAAAATATC 


AAGTTAAATA 


6780 


TTCCATTGAT 


GAGTGCCAGT 


ATGGATACTG 


TTACAGATAG 


TAATATGGCA 


ATTGCGATGG 


6840 


CTCGTCAAGG 


TGGACTAGGT 


GTCGTACATA 


AAAATATGAC 


TGTCGCCCAA 


CAAGCAGATG 


6900 


AAGTACGTAA 


AGTCAAACGT 


TCTGAAAGCG 


GCGTTATCAT 


TGATCCATTC 


TTTCTAACAC 


'6960 


CAACAAACTT 


AGTAGCGGAT 


GCAGAAGAAT 


TAATGAGTCG 


TTACCGTATT 


AGCGGTGTGC 


7020 


CAATTGTAGA 


AACAATGGAA 


AACCGTAAAT 


TAGTCGGAAT 


TATTACAAAC 


CGTGATATGC 


7080 


GTTTTGTAAC 


GGACTACCAA 


ATCAAAATCG 


AAGAAGTCAT 


GACAAAAGAT 


CATTTAGTCA 


7140 


CTGCGCCAGT 


TGGCACATCT 


TTAAAAGATG 


CAGAAAAAAT 


CTTACAAAAA 


CATAAAATCG 


7200 


AAAAATTACC 


AATCGTTGAT 


GAAGCAGGTC 


GCTTAAGTGG 


CTTAATTACA 


ATTAAAGACA 


7260 


TTGAAAAAGT 


GATTGAATTC 


CCAAATGCTG 


CCAAAGATGA 


ACACGGTCGT 


TTACTTGTGG 


7320 


CGGCAGCAGT 


CGGTGTCACT 


AGTGATACGT 


TCGAACGTGC 


GGAAGCATTA 


TTAGAAGCAG 


7380 


GTGCGGATGC 


GATTGTCATC 


GATACTGCAC 


ATGGCCATAG 


TGCCGGCGTT 


ATTCGCAAAA 


7440 


TTAAAGAAAT 


TCGTGAAACG 


TTCCCTGAAG 


CAACATTAAT 


TGCTGGTAAT 


GTTGCCACTG 


7500 


CTGAAGCAAC 


CAAAGCATTG 


TATGATGTGG 


GCGTTGACGT 


TGTCAAAGTT 


GGTATTGGAC 


7560 


CTGGTTCAAT 


TTGTACAACA 


CGTGTCGTTG 


CTGGTGTTGG 


TGTTCCTCAA 


TTAACAGCGA 


7620 


TTTATGATGC 


TGCTTCTGTA 


GCTAGAGAAT 


ATGGTAAAGC 


AATTATCGCT 


GATGGCCjGTA 


7680 


TTAAATATTC 


TGGTGATATC 


GTGAAAGCCT 


TAGCTGCTGG 


TGGACATGCT 


GTAATGCTAG 


7740 


GAAGTATGTT 


AGCAGGTACA 


GATGAATCTC 


CAGGCGAATT 


TGAAATTTAT 


CAAGGCCGTC 


7800 


GTTTCAAAAC 


TTACCGTGGC 


' ATGGGCTCAT 


TAGGTGCAAT 


GGAAAAAGGA 


. TCAAGTGATC 


7860 


GTTACTTCCA 


AGGAAGTGTC 


AATGAAGCAA 


, ACAAATTAGT 


GCCAGAAGGA 


. ATCGAAGGAC 


7920 


GTGTTGCTTA 


CAAAGGAAGC 


GTTTCTGATA 


t TTGTCTTCCA 


. ATTAATCGGT 


GGCTTGAAAT 


7980 
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CAGGGATGGG 


TTACGTCGGT 


GCAGCAGACC 


TGAAAGCGTT 


GCGTGAAGAA 


GCACAATTTG 


8040 


TTCAAATGAG 


TGGTAACGGA 


TTGAAAGAAT 


CACATCCACA 


TGACGTTCAA 


ATCACAAAAG 


8100 


AAGCACCAAA 


CTATTCTGTT 


CAATAAAATA 


AAAGCTGATT 


TTTTTGAAAT 


TTAAAAGAGG 


8160 


AAGACGTCAT 


TTCGACGTCT 


TCCTCTTTTT 


CTGTAGAATG 


AAGGATTTGC 


TTCTGTTTCC 


8220 


GCCGTTTATC 


AATTTTTAnG 


CCTGTAACAA 


AAaTCCAAAG 


TGATTTTTGT 


TACAGGCtCC 


8280 


TTTTCCTATA 


GAATAAAAGC 


CGTTATCTGA 


TATCAGACAA 


CGACCTTCAT 


CTTCCCTTTA 


8340 


TTTAATAACT ■ 


GTTAGGTTAC 


CCATGTAAGG 


AACTAGGACT 


TCTGGTACAG 


TTACAGAGCC 


8400 


ATCTTCGTTT 


TGGTAGTTTT 


CTAAAATAGC 


AGCGACTGTC 


CGTCCTACGG 


CTAAACCAGA 


8460 


ACCATTGAGT 


GTATGCGCAT 


ATTGGACTTT 


ATCATTTTCA 


TCACGGTAAC 


GAATCATCGC 


8520 


ACGGCGC'GCT 


TGGAAATCTT 


CACAGTTTGA 


GCATGAACTG 


ATTTCACGGT 


ACGTCTCTTG 


8*580 


AGCAGGAATC 


CAAACTTCCA 


AGTCATAAGT 


TTTCGCAGCT 


GAGAAGCCCA 


TGTCACCTGT 


8640 


TGAAAGAGCC 


ATGACACGGT 


AAGGTAATCC 


CAGTTTTTGC 


AGAATTTCTT 


CGGCATTATT 


8700 


CGTCATTTTT 


TCTAATTCCT 


CATAAGAATG 


CTCTGCATCA 


CTAAATTTAA 


CCATTTCGAC 


8760 


TTTATTAAAT 


TGGTGTAAAC 


GAATTAATCC 


TCGTGTGTCA 


CGACCAGCGC 


TACCAGCTTC 


8820 


AGAACGGAAA 


GAAGGGCTCA 


aGGCCGTAAa 


GTAAATCGGT 


AAaTCCTTGC 


CATCTAAAaT 


8880 


TTCATTGTTG 


TAGTAATTGG 


TTAAAGGGAG 


TTCCGCAGTT 


GGAATTAACG 


TTAAATCCGT 


8940 


ATCTTGTAAT 


TGGAAGACAT 


CTTCTTTAAA 


TTTAGGAAAT 


TGGCCAGTCC 


CGAACATGGC 


9000 


GGTGTCATTA 


ACGATATAAG 


GCGTAATCAT 


TTCTGTATAG 


CCATGTTCAT 


AAACATGTAA 


9060 


ATCTAACATG 


AAGTTGTATA 


ACGCACGTTC 


TAAGCGTGCA 


CCTAAGCCTT 


TGTAATAAAC 


9120 


AAAACGGCTA 


CCAGCTACTT 


TGGCACCACG 


TTCAAAATCT 


AGAATGCCTA 


AGTTTTCAGC 


9180 


GACTTCCCAG 


TGTGGTTTTG 


GTTCAAAAGC 


AAAAGTTCTT 


GGTTCACTCC 


AACGACGTAC 


9240 


TTCTACATTT 


TCTTCTTCAC 


CAGCACCAAC 


AGGCACAGAA 


TCATCTGGCA 


AATTCGGTAA 


9300 


AGTAGTTGTG 


ATCCCACGTA 


GCTCTTCGTC 


AATAGCGTTA 


ATTTCTGCAT 


CTAATGCTTT 


9360 


AATATTGCCG 


CCGACTTCTT 


TCATTTCCGC 


GATTTTGGCC 


GTTGCATCTT 


CTTTGTTTCG 


9420 


TTTTAATTGA 


GCAATTTCAG 


CAGACACATC 


ATTCCGGTAT 


TTTTTCATTT 


CTTCGACTTT 


9480 


GACTAATAAA 


TCACGGCGAC 


TTTCATCTAA 


ACGCAGAAAT 


TCCACTAAGA 


TTTCTTCTTT 


9540 


CACGCCACGG 


GTTTGTAGCT 


TTGCTTTTAC 


TTCGTCAAAA 


TTTTGACGCA 


TCATTTTTAC 


9600 




TTTTTTCCTC 


U 1 lrt/W\l 1 i\ 




rlM 1 r\r\ 1 r\M^ 1 


O 1 1 u 1 rturtrtrt 


9660 


ACAACAAAAA 


AGCCATCTCA 


CCCCAAAAAT 


AAATATTCTT 


GGGACGAAAT 


GACTGGAAAT 


9720 


CTCGCGGTAC 


CACCCAAGTT 


CAGTAATGTT 


ACTGCCCTTG 


ATTTGCAGAT 


AACGGCTGCA 


9780 


GACCGTTCCA 


ACTTATTTCG 


CTGGACACCT 


GGAAGAGACG 


GACTTCTTTA 


TAGATGAATG 


9840 


TTTGCTCGCA 


CCCCCGCAAA 


CTCTCTGGAA 


TTCAGTGCCA 


TAAATACTTT 


TTCTCTTAGT 


9900 


TTTCTCTTTT 


CTATAGAATA 


CCGAAAAAGT 


TCAGGGATGT 


CAATTAACTC 


TCTGATTTCT 


9960 
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TGTTGGTACG 


CTGTAAGGGT 


AGATGAATAA * 


TAAAGGAAGT 


CCACTTTTGA 


TCAGACTTCG 


10020 


CATAAATATA 


TCCACCATGT 


AAAGCGACAA 


TACTTTGTGC 


AATAGCCAGA 


CCTAAACCAG 


10080 


TACCCCCCGT 


TTCTTGTGAT 


CGGGATTCTT 


CGACACGATA 


GAAGCGGTCA 


AACAACTGAT 


10140 


CTAAAGAATG 


TTTAGGGATA 


GCTGGTCCAT 


CATTGCGAAC 


AGCAATAATT 


GCTTCGGTGC 


10200 


CGACTTTGTC 


TACTTCCATT 


ACAATATGAT 


GGCCACCTTT 


GCCATATTTC 


AAAGCGTTTG 


10260 


AAAGAAGGTT 


GTTAAAGACT 


CGGACTAGTT 


TTTCCGTATC 


GCCTTCCATC 


ATTAAAGAAG 


10320 


CTGGATTGGC 


TTTTACTTGG 


ATTTGCATAT 


TAATCTTTTT 


CGCTTCCAAT 


TCAAAATCGG 


10380 


CAGCAAGCTG 


TTCAATTAAT 


TGTGCCATAT 


CAAAAGTCGT 


CGTATGAATC 


GGTACACTTG 


10440 


GTTGACGGAC 


TTTTGTATAT 


TCAAAAAGAT 


CATCAACTAA 


TAATTTCATT ' 


TGTTTCGCTT 


10500 


TGACGTAAGC 


GGTATGCGTA 


TATTTCAGTA 


AATCTTCTTG 


GCTATGAAAT 


TGACGATCTT 


10560 


CAATCAATCC 


CAGATAACCA 


ATAATAGAAG 


TTAACGGTGT 


CCGAATATCA 


TGACTGACAT 


10620 


TCGTAATTAA 


TTCATCTTTT 


GATTTTTCAA 


TTCGCCGTTC 


ATCTTCAATG 


GCTGCGACAG 


10680 


TACTATCCAC 


CAGACCGTTA 


ATACTCGTCA 


CGACTTTTGC 


TAAATCACCA 


CTTAATTCAA 


10740 


ACGGAATGCG 


GTGATTGTAA 


TTACCATCAG 


CGATGTAATG 


TAACTCACTG 


ATAATATGAC 


10800 


GCAATTGCAT 


TTGATGGTAG 


CGACGAATTA 


AGCGCCAATA 


GAGAACAGCT 


GCATCTACTA 


10860 


AAAAGAAAAT 


GGGTAAGACA 


AATTTATTCC 


AGCTCCAAAA 


CAAATCAGTA 


TTTAATTCTT 


i0920 


TAGCGACCGC 


ATCCTTAGTG 


GCCCAGATGA 


CACCAGCTAA 


GGGCGAGTTT 


TCACTAAGTA 


10980 


GTTGTCCTAA 


AAGGAACATA 


ATT GAG AC AT 


TTAATAGTAG 


TAATAAAACA 


ATTGTCACAA 


11040 


TTGCTTCTGC 


TAATAACTCG 


CTGATTTCTT 


TAGAAGTTAA 


GGTAATTCGT 


TTTTGTTTTT 


11100 


CAATCTTTTT 


AGGTTTAACG 


AGCATCGATC 


TTATATCCAA 


CTCCCCACAC 


TGTTTGAATG 


11160 


ACTTTTTCTC 


CACCAGTTGC 


TTCTTCAATC 


TTATCTCTTA 


AATGACTGAC 


GTGAACCATC 


11220 


ACGGTTTTAG 


CAGAAACAAT 


ACTTTCTTGT 


TGCCACACAC 


GCTCAAAAAT 


TTCATCCGCA 


11280 


CTAAACACAC 


GATTTGGATG 


ACTAGCTAAT 


AG AT AG AG G A 


TACCAAACTC 


AAGAGCGGTT 


11340 


AATTGAATTT 


CTTTTCCTTC 


TATTGTTTTG 


ACTTCGTGAG 


AGTCCTTATT 


AATCATTAAT 


11400 


GGGCCGACTT 


CTAATTCATC 


GGGTTCTTCT 


TTGGTTAATT 


GCATTTGGCT 


TCTACGTAAA 


11460 


ATAGACTTTA 


CCCGAGCCAT 


TACTTCTAAA 


GGATTAAAAG 


GCTTGGTGAC 


ATAATCGTCT 


11520 


GCACCAGCAA 


CTAAGCCTTT 


GATTTTATCC 


ATATCGGTTG 


TTTTTGCTGT 


TAACATAATA 


11580 


/111 kjoriri ill 






1 1 y~*r\\*i l n\* x i 


CCATTrCATC 


CATGATTGGC 


11640 


ATCATAATGT 


CCAAGATTAA 


TAAATCGATA 


TCTGGTGTTG 


TGTGCAATTT 


. GGATAACGCT 


11700 


TCTTTTCCAT 


CATAAGCTTT 


CACGACTTCG 


TATCCTTCAT 


TATGAATATA 


AATACTCAAT 


11760 


AGTTCTACAA 


TTTCTTTATC 


ATCATCTGCA 


ACTAAAATTT 


TCATAAGCGT 


TCATTTCCTC 


11820 


CATATATAAA 


AAATCTCTTT 


TCCATATTTT 


AACAAATAAT 


TGCTAAAAAG 


GACAAAATCC 


11880 


TTGATATATT 


ACGATaAATT 


TAGAAGATTT 


TGCTAAAAGT 


TAATAAACTA 


TAGCCAAAAA 


11940 
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AGGATGAAAG GGGGTAAAAA ACGAACGAAA ATACCTATAC CAATCATAAT AATCGGTATT . 12000 
GAAAAAAAGT TTAAAAAAAG TG 12022 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

TAAAATAATC GCAATGACTG CATCTGTAAA TGCTTCTACA CGTGTTTTCG GCATCATTTC 60 

CCCCTACTTT CATAAAAATT AAATAACTTA CCCAAAGTAA GTATAACGCA TTTCTGTAAA 120 

AATTCAATTA TTTATGAAGT CTCTCGGTTG TTGAAGTTCC TAAAAACGAG TGCAAGAGCC 180 

CCACTTAAAA GTAAGCCTCT TGCACTCGTT ATAGAACAAA TCAGTTATTG GAAGACGCCT 24 0 

GCCTGATAAG GTGCTAACTG CATCCGTTTC TTCACCGTAA AATCTTGATT CCCTAATAAT 300 

TGTTGCCACT TTCTGCCTAG TAATTTATTC GGCAAGTCAA ATTGGCACTT TTCTCCACTT 360 

AAATTGACAA TCACTAACGC TGTTGCCTCA GTTGTTTTTC TTAAAAAAGC AAAAACAGAC 420 

GGATGATTGG GcGCTAATAG TTCAAACTGT CCAtCTGTAA AGACCGGATT GTCTTtCCGC 4 80 

AACGCAATTA ACTTTTtATA ATAATcAATA CGGATgCGCA TcCGTTtCTT GGGCAGCCAC 540 

ATTTATnnCT TGAAAT 556 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TGTTAAAAAA ACCATTTTTA CTATTTTTTT CACTTTTAGG AGCTATTTTT ATTTTAGCTA 60 

GTTGTGGCAT AGGAAAaGaT GCTGTCACAG ATACTAAGTA CAAAGTTAGT TTGCAGCAAG 120 

CTGCTGAAAT CTATGAAAAA GAAGCTGGCA ACAGCAAACC ATTAGTAAAT GTCCAATTTG 180 

ATACAGAACC AGCAAGTGAC TACAGCTATA TCTTTACTAA CGATACAGAA ACACTTTACG 24 0 

TGAATCCTGA AACAGGAAAA GTCACCAAAA ATACTGAAGC AAATCAACTT GGCGAAAACG 300 

AGACAGCCTT TTCAGCTGCT GAAGTCAAAG AATTAGGCGC TGTTAACGAC GTTTTAGCCA 360 

AAGCAAAAAA AGAAGTTGGA GGACTTTCTC CACGTATTTT GACTTGGAAG TTAACCAAAA 4 20 

ATAACAATAA ACTTGTTTAT AC AG T AG AT G TTAAAACGAC TACGGCAGAT GAAAAAGTTA 4 80 

CCTTAGATGC CAATAAATAA GCACAAGCGT ATAACTTATC ACGAAATAAC CAAATCTTAA 54 0 

AATACTCTTT GGGACAGAAA CTACGCATTA GGAAACCTTA ACATTTTCCG AAGCCGATCG 600 
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TTCTGTCCCA 


ATTTCATTCT 


TTTAAGTAAT 


AATAATTGTT 


CGTTACCCCT 


GCTACCGTTA . 


660 


TACTTAGTTT 


AAAAGAAAAG 


ATGGGAGTAT 


CAATTCATGG 


TTTATACAGT 


TGATTTTAAA 


720 


GAAGTAGCCA 


CTATTGGTTT 


AGAAAGTTCA 


CCAGTGGCAC 


CTGCCCTTGC 


TGGACTAAGA 


780 


GCGAATGAAG 


CTCGCTATTT 


TTGGAATAAA 


TACAAACATC 


GCTTTACGAC 


CGTCCCTGCC 


840 


TCTGAAGCTC 


CAGAAACATT 


GGCCTGGATT 


GAAAAAATTT 


TATTAGAACG 


AGAATTACAT 


900 


TTCCCTTACA 


AGCCACTTGA 


AATTTCTTCT 


TTCACTTTAG 


ATGGCATAAA 


AATGGCCTAT 


960 


GTATTTTATG 


AAAACGGCTT 


AGCAGTTAAC 


GTGATGTACA 


CATTAGCTGA 


GGAAGGCAAA 


1020 


CGCGCCGTTG 


GGTTTAAACT 


TTCAGATGGT 


ATGGCAATCC 


CTGCAGAATT 


TGAAGGAAAA 


1080 


TTCAAATTTG 


CTCGTCAAAA 


ATCAAAATTA 


GCTGGAACCA 


TTCGCGGCTC 


TTATTTTGTC 


1140 


ATTAAAGGAA 


eCTATTAAAA 


AAATGCTTTT 


GCCCTTATTA 


TTAAAGGCAA 


AAGCATTTTT 


1200 


TTATTTTAAT 


TCTTCAATAT 


AGTTAAATAC 


TTGTTGTCCA 


GCAATACCGC 


CTTCACCGAC 


1260 


CGCTGTCGTG 


ATTTGACGCA 


ATGTTTTTTC 


ACGGACATCA 


CCAATGGCAT 


AAACACCAGG 


1320 


AATTTTTGTT 


CTCATTTCTT 


GGTCTGTTTC 


AATCCAACCA 


GCCTCATTCG 


TAATACCAGC 


1380 


TTTTTTAAAT 


GGTTCTGTTA 


ATGGATCTAG 


CCCCACGTAA 


ATGAACACGC 


CGTTTGCTTC 


1440 


AATTTCAGIAA 


ACTTCATCTG 


TTTTGACATT 


CCGTGCTTTC 


ACACCTGTCA 


CGACCATTTC 


1500 


ATTGCCAACA 


ATTTCTTCGA 


CAACTGTATC 


CCAAACAAAG 


GAAATCTTTT 


CATTTGCAAA 


1560 


GGCACGATCT 


TGAATGATTT 


TTTGCGCACG 


CAATTCATCA 


CGACGGTGAA 


CAATCACTAC 


1620 


TTCAGAAGCA 


AATTGCGTTA 


AATAGATTGC 


TTCTTCAACT 


GCAGAATCGC 


CGCCGCCAAC 


1680 


GACTACTAAG 


CGTTTATTTT 


TAAAGAAAGC 


ACCGTCACAA 


ACAGCACAAT 


AAGAAACACC 


1740 


ACGGCCAGCA 


AATTCTTCTT 


CGCCTTTTAC 


ACCCAATTTT 


CGGTGTTCAC 


AGCCTGTTGC 


1800 


AATAATTACC 


GCTTTAGCTT 


CATAAGATTT 


GTCATCACAA 


ATCACTTCTT 


TATAGGATCC 


1860 


GTGGTCTTCA 


ATGCCCATCA 


CAATACCATA 


CGCATTTTCC 


GTACCAAATT 


TTTCTACATT 


1920 


TTCATACATT 


TTATAGGCCA 


ATTCTGGACC 


CATAATAGAA 


TCAAAGCCTG 


GGTAATTCTC 


1980 


TACCTCAGCA 


GTGTTGTTCA 


TTTGCCCACC 


AGGCGCACCG 


CGTTCAATCA 


TTAATACAGA 


2040 


AAGATTGGAC 


CGTGATGCAT 


AAAGTGCTGC 


CGTCATCCCA 


GCAGGACCGG 


CACCAATGAT 


2100 


AATTACATCA 


TACATTTTAT 


TTCCCCCAAA 


ATTTTAAGTA 


AGTTTACTTT 


ACTCCTTAAA 


2160 


CAATTTCTTG 


TCTACTTTTC 


TGCCCTTTTT 


TGACATTCGG 


AGTAGGTGTG 


TCTATTACGT 


2220 


AAGTTATTGT 


ACCACGGTTT 


TGTGTTTTCT 


AGGAGGGGGA 


AAAGAAAAGT 


TTAAAAATAA 


2280 


CGATCACAAC 


GAATCATTGA 


TTTTTAGAAG 


CCGTTGTACT 


GCCGCATCAG 


AGAATTCTTC 


2340 


TTTATCATTT 


TTTATGCATT 


CTTGTTGATT 


TTGCCCTTGT 


GGAACAAAAC 


GAAATACTGT 


2400 


AAATGATGCA 


GATAACGTAT 


CAGGATTACT 


AATCCGATGA 


AGACTTTTAT 


GCATCCGAAT 


2460 


AAGTTCTCCT 


TCTGCACAAA 


TTCTTGAAAT 


AATTTGCTGA 


TTTTCTAGCT 


CTTCTATTTT 


2520 


CACTTTTCCC 


GACTGCACAA 


GTAAAACTTC 


TTCGATTTCG 


TGATGCTTAT 


GCCAACCTTG 


2580 
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CACAGTTTTT CCTGGCAAGC TATTTATATG TATTTCAAAC TCTGGAAATA GAAAATAATT 2640 

TACTTCTGTT CCATCTGGCT TTTTACGAAA GATGACCTCG TCTCTTGTTA AAATTTCTAT 2700 

ATATTTCTGT AACATGGCAG GCTCCTTTTA ACTTTTTTTA TAGCCGAAAG CTATTCTTTC 27 60 

AAACTAAGTT TGAAGCTACT TTCTGAAAAT TCGAGTTCCT TTAAATATAC TTTTATTACC 2820 

ATTTGAAAAT CACTTTTTTT AAAACAATTG ACTGTTCAGT ATCAATCTTc ACTAATCCAT 2880 

CTGTTCACCA GAAAAAATAT AGTATTTCCG ATTATTCACG AACTTATTAT TCAACTTCAA 294 0 

AGGAAAATCA GTCAACATAC AGCCCACATA TCTAACATCA GAAACACTTC TACTTTTCTC 3000 

AAGTACTGCT ATGGTATACA TTGACCTTAC TAGTAACATT ATAGCATCAA GTTTTAATCA 3060 

TATGTAGATT TAAAAAAAAT GaTAGcACCC TAACAGaTGT TTTCGTTGCT CATTTAATTT 3120 

ACATGATTAT TTTTTAGATG CTAATTAAAT GTTTGTCAGT AACCTGAATT TTTGATATTT 3180 

TTATCCAGAT TACT TAT GTC GGTTTATCAG TTGTGCTAAC AAAAAATGAT TGTTTTTAAA 324 0 

GATAATTTAA TTCATAATAT ATATAGAATG AAAAAAAGGA GGTTTTATTG TTGAAAGTTG 3300 

CTATTTGTGA TGACAATCCA ACGTTaACCG AAAAAATAAA CACGATGCTA TTTAATTATA 3360 

ACCCAAATAT ATTTGAAACC TATACCTACT ACAATCCTCT CAAGTTAGTC AATCAACTTG 3420 

ACCAAGAAAA TTTTGACTTT TTTATTTTAG ATATAGAAAT GAGTGAAATG AGCGGAATAG 3480 

ACTTGGCAAA AAAAATCCGA GAACGTGATA TTTTATCACC TATCGTTTTT TTGACGAGTT 3540 

ACAAAGAATA CATGGAAGAA GTCTTCCAAG TACAAACTTT TGATTATTTA TTGAAACCGC 3600 

tACGGAAGAT CGGTTGCACC AAGTTTTGGA AAAACTGAGA CAACAGCTAG AAAAAAAAAG 3660 

AAATTACTTT GTATTTTCTA GTAACAAAGT TACTTATAAA ATACCAACCA AAGATATCAT 3720 

TTATTTCGAA AAAGATAAAA GACAAGTATT GATTCATACA GTTGGAGAAa TCTACAAACC 3780 

CtATATGTCA ACGAATCAGA TAAACGAACA ACTTGATACG AATTTTGTGC AAGTCCACTC 384 0 

TTCTTTCATA ATAAATTGCG CTTATATAAA AGAGTTAGGA AAAAATTTCC TATTAATGGA 3900 

TTCTAAAGAA AAATGTATTG AAATACCAAT AAGTAGGCGC TTCAAAGCCG CCGCTCATAA 3960 

AAGTATAGTT ATGTCCATGA GAGGAAAGAT ATAATGCTTG ATATAACATT ACGAAGCCTA 4020 

AGTTTGTTAG TTTATTTATT TCTTATCTCT TATTTTTTTT CTAATTGGTC AACAGCTATT 4 080 

AAAAATCATA GAGAATTTTA TAATGTCATC TGGATTGTTA TAATCATATC AGCCATGGTT 414 0 

ACGATTATAT TATCCAATGC TTACATACCG AACATTTATT TACCGTATCT TAACTTATTA 4200 

CTAAGTTTCA TTTTAACTTA CACTATTGCT ATGCTACACT CGGTCCCTTA TCTCGACTCG 4260 

GCCGTATGGA CAATGACACT TATATCAATA AATTTAATCT GTGAAGTATT GTCGTTACAC 4320 

TTTACGAAAA TAATTTTAAA TGCAGAATTA ACATCCTATG GTAGCCCTAC TTTTTTCATA 4380 

ACTTCAATAA CAATTACAAC TTTGATTGGG GTAGCCTGGA TAGTCATACT TAAATTTTCA 4 4 40 

ATAGTTAAAG AAGCAAAGTC TGAATTATCG ACAAACTTAT CATTTATTGT GATTTTATTA 4 500 

CCTATTCCCA TACTATCAAT AATCATTTTA TTTGGGCTTT TAATAGGAAA CAACAATGAC 4 560 
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AAAATTTCTG 


AGATTACTAT 


TACGATTAGT 


GTCATATTTT 


TAAACATCTG 


TGTAATTTTT 


4620 


CTTTATAAAA 


TCACTATAGA 


ATATCAAAAA 


AATCAATATA 


ATCTTACTTT 


GAGAAAAAAA 


4680 


AATATCGAAG 


TGGAGTATAG 


AATTTTAAAT 


GAAATAAAGA 


GAAACAGAAC 


TAACGTTTTA 


4740 


AAATTAAAAC 


ATGATTTGAA 


GAATCAATAC 


TTAACAATCC 


TCGGACTTAT 


CGAAAATGAA 


4800 


GAAGTTAATG 


AAGCTATCGA 


TTATATTAAG 


AGTAGTTTTG 


ATATTTTAGA 


ACCACCAACA 


4860 


AAAACTTATG 


CTGCCGATGG 


AGTATTAAAC 


TACCTATTGA 


ATGAAAAGCT 


TGCTGAAGCA 


4920 


AGAAAGAATC 


AAATTAATGT 


TGATCATCAG 


ATATTTGTCT 


CTAAAAATAT 


AAAAATTAAT 


4980 


AATGACGTTC 


TAACTATTGT 


AATTGGAAAT 


ATAATTGATA 


ACGCCATACA 


AGCATCGAAG 


5040 


AGAATCAAGC 


CGATTGACAG 


ATATGTCAAT 


ATAATTATCA 


AGCAAGTCAA 


TAATGATTTA 


5100 


TTTATTGAAG 


TATCtAATAA 


CTATAATTCt 


GaAGAnATTT 


TTaCaAGAAA 


AC a TAG a AAA 


516"0 


AATaAAGGtT 


TAGGaATGaA 


AAACaTTGaT 


GATCTCTTAC 


AACAATTAGG 


TGGTATTCAA 


5220 


CGTCACTGGA 


CGAAAGAACG 


TAGATATTTT 


GTAACTATAG 


TTATATTTAA 


TGTATACAAA 


5280 


GGAAACAAAT 


GCGGTGATAA 


CTAATCAAGT 


AAATCACCGC 


ATTTTTTAGT 


ATACTTCCAT 


5340 


GTCGCATAAT 


TTATCAGAAA 


ATTTCAAAAA 


TAACCAAACA 


AGCATCAATC 


CCATAATCGT 


5400 


TGAAAATAGG 


AACGTTATAG 


TTGGAAAATT 


CATAAGACCC 


ATAGCAGAAA 


AATTCGAATA 


5460 


TATTAGCATT 


AGTACCACTG 


ACGTTATTAT 


TGAAGCAATT 


TCCGATTGTT 


TCCAAATTGA 


5520 


AATTATTTCT 


GACAATATCA 


AAATTACGAA 


TAGTAGACTA 


CAACTTGTAA 


CTATGGTAAT 


5580 


AAGACCAAGG 


TAAATTTCAA 


AAAATAAATT 


TGAACTATGA 


AACTTCAAAA 


TTAAGTTAAG 


5640 


CAAAACCTCA 


GTCACAAATG 


AAATAAGAAG 


CGCACTTCCA 


AAGCTAGATA 


AGATTATCAC 


5700 


GTAAAAAGAA 


CTTGTTTTTT 


TTACGAGAAG 


CTCTTTTCGA 


TCAATCGGAA 


AAAGAAACGT 


5760 


TCTATTCCGG 


TGATTGCCAA 


CATAAAATCG 


TACAAGCTCT 


TTCCTTAGTA 


ATACAGCTCC 


5820 


ATAAATTGAT 


ATGCCGCACA 


TAGAAACAGT 


GTTCGATAAT 


GCTATAATTC 


CTTGAAATGA 


5880 


CGAGATCGTA 


TCAGCACTTT 


CATCAAATAT 


TCCCACAAAT 


GAGAAAAmCA 


TAAAAAATAA 


5940 


CATTTGGATT 


ACCAGTGTAA 


TTAAGAAAAC 


TTTAAACCAA 


CATTnTAATG 


GnnCTAAT 


5998 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

AAATCCGTAG CAACATTTAT GGAGACAATT ACAGAAAAAT GCGGTGAAGC ACATCAAGAT 60 

TGGGCAAAAA ACTTTCAGGG CGGnCTTTGC CAATACTTTA CTAACAACAG TGAAACGTGC 120 

AAGAAGATGG TACTACCTTT TTATTAACAG GCGATATTCC AGCAATGTGG TTGAGAGACT 180 
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CAACAGCACA 


AGTTCGTCCC 


TATTTAGTGA- 


TTGCnAAAGA 


AGACGAAGAC 


CTTGCTCAAA 


240 


TGATTGCTGG 


TTTGGTTAAA 


CGCCaT-TTCG- 


-tTATATTTGC 


ATTGATCCTT 


ATGCCAATGC 


300 


GTTTAATGAA 


ACAGATAATC 


ATGCAGGCCA 


TCAAACTGAT 


AAGACAGAAA 


TGAATGGCTG 


360 


GATTTGGGAA 


CGGAAATATG 


AAATTGATTC 


ATTATGTTAC 


CCGGTCCAAT 


TAGCCTATTT 


420 


ACTTTATAAA 


AATACTGGGA 


TGACCGAACA 


ATTTAATTCA 


GATTTTGTGG 


AAGGCGTCAA 


480 


AAAAATTCTC 


AACGTTTTTA 


CAACAGAACA 


AGATCACGCA 


CAGTCACCTT 


ATTTATTTGA 


540 


ACGAGATACG 


TGGCGTCAAG 


AAGATACGTT 


GGTTGAAGCA 


GGGAAAGGAA 


CACCAGTTGG 


600 


CAAAACAGGG 


ATGACATGGT 


CAGGCTTCCG 


TCCTAGTGaT 


GATGCCTGCC 


AATATGGTTA 


660 


CTTAGTTCCT 


TCTAATATGT 


TTGCAGTCGT 


AATTTTAGGC 


TATATTCAAG 


AAATTTTCTC 


720 


TGATGTTTTA 


GAAGATGCTA 


CGATTGGGGC 


GATTGCTAAA 


AAATTACAAG 


AAGAGATTGA 


780 


AGAAGGCATC 


CAAACATTTG 


GACGAACAAA 


AAATCAAAAT 


AACGAAACGA. 


TTTACGCGTA 


840 


TGAAGTGGaT 


GGCTTAGGCA 


ATGCAAGCGT 


GATGGaTGAC 


AGTAATGTTC 


CCAATTTAGT 


900 


AGCAGCCCCT 


TATCTGGGTT ATTGTTCAAC 


AGAGGATGAA 


CAATATTTAA 


CAACGCGTCA 


960 


AACCTTATTA 


AGTAAAGAAA 


ATCCTTACTT 


CTATGAAGGC 


AAATACGCGA 


AAGGGATTGG 


1020 


TAGCTCGCAT 


ACACCAGAAA 


ATTACGTGTG 


GCCGATTGCT 


TTAGCAATGG 


AAGGAATGAC 


1080 


AACCAAAGAT 


AAAGCAGaAA 


AAGAACGAAT 


TTTAGATTTA 


TTAGTTGCCA 


CAGATGCGGG 


1140 


GACTCATTTG 


ATGCATGAAG 


GTTTTGACGT 


AGATAATCCT 


GAAAATTATA 


CAAGAGAATG 


1200 


GTTTTCATGG 


GCTAAT7-TGA 


TGTTTTGTGA 


ATTAGTTATG 


GATTATTTTG 


ACATCCGAGT 


1260 


GGAAAAATAG 


GAGGAAAAGT 


AATGACAAAG 


AAAAAAGTCT 


ATATTGTTTC 


CCATAGTCAT 


1320 


TGGGATCGTG 


AATGGTATCT 


GCCTTATGAA 


GrACATCATA 


TGCGCTTAAT 


TGAATTAGTG 


1380 


GATAATGTAT 


TAGATTTAAT 


TGAAAATGAT 


CCAGAGTTTA 


ACAGTTTCCA 


TTTAGATGGA 


1440 


CAAACGATTA 


TCTTAGACGA 


CTATTTACAA 


GTTCGCCCAG 


AGAAAAAAGA 


AGCCGTGAAA 


1500 


AAAGCTGTCC 


AAGCAGGCnA 


ACTAAAAATC 


GGACCATTTA 


GTATTTTACA 


AGATGCTTTT 


1560 


nAATTAGCTC 


TGAATCCAAT 


GTCGAACATG 


CTGATTGGGC 


TTAGAAAGCn 


AAATnGGGGC 


1620 


ACAGTCAATT 


AGGTACTCCC 


G 








1641 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

AGAGAAGGGA TCATATGTCA GCAAAACATA ATCATGAACA ACACCAGGAA AAACAGCCAA 60 

CAATGTCCCA CGATAAAATG AATCACATGA ATCATGATAT GGCGCATGAA CAAATGGCTA 120 

TGTCACACCA ACATGAAGGA ATGAAGGGAA TGGATCATTC AATGCATATG GGGAATTTTA 180 
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AACAAAAATT 


CTGGCTGTCA 


TTGATTTTAG 


CTATTCCGAT 


TATTGTTCTT 


TCACCAATGA . 


.240 


TGGGTTTTCA 


ATTACCTTTT 


CAATTTACTT 


TTCCAGGTTC 


TGATTGGTTG 


GTCTTAATTT 


300 


TAGCAACTGT 


GCTATTTTTT 


TATGGTGGGC 


AACCATTTTT 


AAGTGGCGCT 


AGAATGGAAT 


360 


TACAGCAAAA 


GAGTCCAGCA 


ATGATGACGT 


TAATTGCGAT 


GGGGATTTCG 


GTATCATATT 


420 


TTTACAGCTT 


ATATGCGTTT 


TACATGAATC 


ATTTTACGAA 


TCAGGCCCAC 


GTGATGGACT 


480 


TTTTCTGGGA 


ATTAGCCACT 


CTGATTGTTA 


TTATGCTTTT 


AGGCCATTGG 


ATTGAAATGA 


540 


ATGCTATTTC 


TAATGCTGGT 


GATGCCTTGA 


AAAAAATGGC 


TGAACTGTTG 


CCTGACACAG 


600 


TCAAGCGAAT 


GACTGAACAT 


GGGGAAGAAG 


AAATTCCTTT 


GCAAGATGTC 


CAAGAAGGTG 


660 


ATCGCTTGAT 


TGTTCGTTCG 


GGAGATAAAA 


TTCCGACAGA 


TGGTAAAATT 


TTGAAAGGAA 


720 


GCACAACAGT 


CGACGAATCG 


ATGGTTACAG 


GGGAATCAAA 


AACAGTAGAA 


AAAAACATTG 


780 


GTGACTCCGT 


GATTGGTGGA 


GCTGTCAATG 


GCAACGGAAT 


GATTGaAATT 


TCGGtAACTG 


840 


GCACGGGTGA 


AAATAGTTAC 


CTGTCAAAAG 


TTATGGAGAT 


GGTTAAACmA 


GcGCAATCGG 


900 


AAAAATCaAA 


AC TAG a AT CC 


aTTTCAGACC 


gTGTAGCCaA 


GTGGTTATTC 


TATATTGCCt 


960 


TATTCGTAGG 


TGTGTTAGCA 


TTTATCGGTT 


GGTTAATGGC 


GACGAAAGAT 


TTGTCGCTAG 


1020 


CATTTGAACG 


AATGGTTACA 


GTTTTTATTA 


TTGCCTGTCC 


CCATGCGTTA 


GGCTTAGCAA 


1080 


TTCCTTTAGT 


GATTGCTAGA 


AGTACCTCCA 


TTGCTGCTAA 


AAATGGCTTA 


CTTTTAAAAA 


1140 


ATCGGAATGC 


ATTAGAACAA 


GCCAATAAGG 


TTGAATATGT 


GTTGCTGGAT 


AAAACTGGAA 


1200 


CTTTAACAGA 


AGGACAATTT 


ACAGTTACTG 


GCTTAGAATT 


AATGAGTAAG 


CAGTTTACTA 


1260 


GAGAAGAAGC 


ACTTAAATAT 


ATTGGGGCAT 


TGGAAAAAAA 


TGCAAATCAT 


CCGTTGGCAA 


1320 


TTGGGATCAT 


GAACTATTTA 


AtAAACAAGC 


CgTTCAACCT 


TATGAAGCCC 


ATAATCTGCA 


1380 


AGCTTTATCA 


GGCGTAGGAC 


TGGTTGCTAC 


TGTTCAAAAC 


CAAGAAGTCA 


AAATAGTCAA 


1440 


TGAAAAAGAA 


GTTGCACGGT 


TACAGTTAAC 


CTTCGATGAA 


ACAATTAAAA 


CAAACTATCA 


1500 


AGAGCAAGGA 


AATACCTTGA 


GCTATTTAAT 


CATCGCTGGT 


CAATTAGTCG 


CACTGCTTGC 


1560 


TTTAGGAGAT 


AAAGTCAAAC 


CAGAAGCTAA 


AACATTTATT 


GCAGAGTTAC 


AAGCGCAAGG 


1620 


AATTACGCCA 


GTCATGTTAA 


CAGGGGATAA 


TCAAACAGCC 


GCCAGTGCAG 


TTGCTAATTA 


1680 


TTTAGGAATG 


AAGGAATACT 


ACGCAGAGCT 


CTTACCAGAA 


GATAAAGAAA 


AAATTGTGCA 


1740 


ACAATATCTT 


ACTGAGGGAC 


ATCAGGTCAT 


GATGGTAGGA 


GACGGTATTA 


ATGATGCACC 


1800 


AAGTTTAGCC 


CGTGCTTCAA 


TCGGCATTGC 


GATAGGCGCT 


GGTACAGATG 


TAGCGATTGA 


1860 


TTCGGCAGAT 


GTTGTACTGA 


CGGATAGTGA 


TCCTAAAGAT 


ATTTTAAGGT 


TCTTAGATTT 


1920 


AGCCAAACAA 


ACACGACGTA 


AAATGATCCA 


AAATCTTTGG 


TGGGGAGCTG 


GGTACAATAT 


1980 


TGTCGCCATT 


CCTTTAGCGG 


CTGGGGTTCT 


TGCACCAATT 


GGGATTGTTT 


TAAATCCAGC 


2040 


TGTAGGTGCT 


GTCTTGATGT 


CATTAAGTAC 


AATTATTGTA 


GCAGCAAATG 


CTATGACACT 


2100 


GCaTATTTcA 


AAAAAATAAT 


TCGATaTAAT 


TGTaGAGAGC 


GGAAAGTAAT 


CAATACTTTT 


2160 
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TGTTCTCTAT 
AGGGAGTGaA 
ATATGTTGGA 
CTGCCAGAAA 
AAAGCAGGAA 
TGTAAAATAT 
TTTTATTCCC 
ATTGACCCAG 
ACGTTGGCTA 
TGAATGGCAA 
GGAAGAAAAC 
AGCACAACAG 
GCATGAAATT 
ATTTCGTGTG 
TGTGATTCTT 
TCATGAGCAA 
AATTAGGAGA 
TTTAAGTACG 
GGAACAAACA 
GCTTTTAAAA 
CTATTTTTAT 
TTGCCTATTG 
CTATTTCAGT 
CCAAGTACAC 
GACAGTGGTT 
TCTATCGTAA 
GATGAGGAGG 
TCGGTTATCT 
ACTGCTTATG 
TGCGAAACCA 
ATACAAACGA 
GAAGCGGTCG 
CATCGTCCAG 



TTTTTTCTTC 
AGGGGACGAT 
ATTGTTGGGA 
ACTAAAGGTG 
ACGTTTAAGA 
CGTGAATTAT 
GAATTTTATA 
GTATTGGAAT 
GTTAATTTTG 
ATTCGACAGT 
AGAGAAATGA 
AATCATCTTA 
TCAGTGCCAA 
GAAGTGTTGA 
GAAGCACATT 
ATGAAAATAG 
AATTTTACAC 
CATATGAATA 
CTCGCTGCTC 
CGCCTATATC 
CACATCCTAG 
ACGGACTTTC 
GAAAAAAAGA 
CTCTTAATTG 
TATTTGCCAG 
GAAGTCCAAT 
ATATTAAAAT 
TAGGATGTAT 
ATCATGGCAT 
TTTTTGGTAA 
AATGTGGGAT 
AAGGGAGTTT 
ATACTTTAGT 



TTAAAAAAAT 
ATATGGaTTT 
TTTACTGCCA 
AGCGACAAAA 
TAGAAGGGCT 
GCGTTTGTGG 
TTTTACGTTG 
TACAGGAATC 
ATATTATTGT 
TAATCCTTTG 
CTCGGAAAAT 
GTTGGCTAAT 
ATGCTAATTT 
GCACTTCTTT 
TTGGTGGTTG 
AAAGTTTATG 
AACATCATTT 
GTTTATTAGA 
GTTATCAACA 
CAGTCTTTAC 
ATTTATTTAA 
CTCCAGAAAA 
ATATTACAAT 
TCAATCATTT 
AGGAATTATC 
TTTGTCTAAA 
GAAACAACTG 
GCGCTTAAAT 
TACATTTTTT 
GGCCTTAAAA 
TCGTCAGGGC 
ACAGCGTTTA 
GGAACCAGAA 
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TTTTGCCTTA 
CaAGGaATTA 
ACAAGAGCAG 
CCTCCATTGT 
TTGTAAGATT 
CTTGGCTCGA 
TCTAATTGAA 
AACTCTTCGG 
ACGTCAAAAA 
TTTTTATTTG 
CATTACCTTT 
TTATATTTGG 
ATTTCAACAA 
TATGTCTCTT 
TTTTGGCAAG 
CTTAAAAACA 
TAATTATCAA 
CGGACAAGCA 
AACCTGGCAC 
TTCAGTAAAA 
CCCGATTTTA 
GGAACAAGCG 
TATTCATGGA 
GTTTCAGATG 
TCCTGCTTTT 
TAGTGTTAAA 
CAATTTGGTA 
GGTGCTGAAA 
GATCACGCCG 
GAAAGTACCA 
TTTTTTGACT 
GGTGTAGATT 
GAAGTAGCTG 



TTTGACGTAT 
TTGgTCTTTk 
ACATTTTCTG 
TACATTCAAC 
TATACAACGA 
TTTAGACAAA 
GAGCGAATCA 
AAGAAATTTT 
CATTACGATT 
TTTTTCCAAG 
TTTGAGTTGG 
GAGAAACGAT 
ACAAGTGCTT 
AAAGAACAAA 
AGAGCGCGAA 
GCTATTTTTA 
GAAATTCATT 
TGGTTGCCAG 
AG ATT AC AAA 
GAACGGGAAT 
TATGAAAAAA 
TTGGGGCAAT 
AAACCAACGT 
AACGTAGCGC 
TTTGAAAAAG 
ATGAAGAAGT 
CAAGTGACGA 
ATCCTCAACA 
ACATTTATGG 
TTCGTCGTGA 
TTTCAAAAGC 
CAGTCGATGC 
AAGCTTTTCA 
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TCTTATwACa 


2220 


GAmCATAAAA 


2280 


TTTTGGATGG 


2340 


GTTTTGAAAA 


2400 


CCAAAGGAAT 


2460 


AATTTTGTTA 


2520 


ATTATGCACA 


2580 


CTAATATTCG 


2640 


TAACTGGTAA 


2700 


AAAGCTGTCT 


2760 


ATTTAAATGT 


2820 


ATAGAGGCGG 


2880 


TTTTTTATTT 


2940 


AAGCGTTATT 


3000 


AGTATTTTAT 


3060 


TTATGAAAGA 


1 3120 


TATGTCGATT 


3180 


CTCATAAACA 


3240 


AACTTATTAG 


3300 


TGACGAGTTG 


3360 


AATATATTAT 


3420 


CTATAAAAAG 


3480 


ATCAACTTCA 


3540 


TTTCTTCTAA 


3600 


TCGAAGCGAA 


3660 


AAGAGGAATG 


3720 


AACAGTTTCT 


3780 




3840 


TGGCGGCGAC 


3900 


AGATATTTTT 


3960 


ACATATTTTA 


4020 


ATTATTGTTG 


4080 


TTTATTAGAA 


4140 
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AAACAAGGGA AAGTTCGCTA TTTTGGTGTA AGTAACCAAA CGCCAGGACA AATTGAACTT . 4200 

TTAAAAACAG CTGTTAAACA ACCGTTATTA GCCAATCAAT TACAATTTGG CATCAAGCAT 4260 

ACTGGTATGG TAGATCAAGG CTTACAAACA AATATGGAaA TATCAGGAAG CATTGATTAT 4 320 

GATCATGGTA TTTTAGATTA TTCACGGTTG AAACAAATGA CTATTCAAGC ATGGTCACCG 4 380 

TACCAATATG GTTATTTTGA AGGTGTCTTT ATTGGAAATG AAAAATTCCC TGAGTTAAAT 4 4 40 

CAAAAATTAA GCGAATTGGC TGAAAAATAT CAAACAACGC CAACTGGTTT AGCCAGCAGC 4500 

TGGATTCTGC GCCACCCTGC TAATATGCAA GTCATCGCAG GAAGTATGAA TCTTGGTAGA 4560 

ATTGAAGAAA TTGCTAAAGC AGCAGACATT GTGATTAGTC GCGAAGATTG GTACGATATT 4 620 

TATCGTGCAG CGGGAAATGT TTTGCCGTAA AAAGATTCTC ATACTTGTGA AGATCCAGCG 4 680 

TGTATTGCTG ATTGATTTTG GTATTTGATA TAGTTGTCAC ~ TAAGGAACTA TGTCAAATAC 4740 

CTTTTTCTTT CATTCTGCAA TCGCCGTCTA CTGCAAATTC TGTAGTTTTT TGTTAAAATA 4800 

GAAAGGAAAC TAGTTTGCTA AAGATGAGGG GAAAAAAATG ACCAAAAAAA AAT TAT TACT 48 60 

CGTTGATGGA AACAGTGTAG CCTTTCGTGC ATTTTTTGCG CTACATAACT CATTAGAACG 4 920 

ATTCAAAAAT AAGAACGGCT TG CAT AC AAA TGCTATTTAT GCATTTAACA ATATGTTTGa 4 980 

AAATGTAATG cAAAAAGAAA TGCCTACTCA TGTTTTAGTC GCTTTTGATG nCAGGGAAAn 5040 
CCACATTCAG AACaGrATTT TATTCaGAAT ATAAAGCCGG ACGTTCAAAA ACACCGGGGG [ 5100 

AATTTAAAGA ACAAATGCCG TATATTCGTG AACTTTTAGA AGGTTTAGGC GTAAAATATT 5160 

ATGAACTACC TAACTAToAA GCAGATGATA TTATTGGTAC ATTAGnCCAA CAAAGTTGAT 5220 

AAAGATGAAT TTGATGTTGT CGTGTTATCA GGGGACCGAG ATTTGACGCA ATTAGCGACG 5280 

GACACTGTCA AAGTAGATAT CACTGTTAAA GGAGTTAGCG ATATTGAATC ATATACGCCT 534 0 

GAG CAT AT TG CTGAAAAATA TGATGGGTTA ACGCCTAAGC AAATTATTGA TATGAAAGGG 5400 

TTGGCTGGCG ACACGTCGGA TAACATTCCA GGTGTAACGA AAATCGGTGA GAAAACCGCG 54 60 

ATTAAGCTGT TAAAACAATA TGGCTCAGTT GAAGGGGTTT ATGAGAATAT CGATGAAATG 5520 

AAAAAAAGCA AAATGAAAGA AAACTTGATT AATGATAAAG AGCAAGCCTT TCTTTCTAAA 5580 

CGTTTAGCGA CAATTGAAGT CAATGCACCA GTCGAAGTCA ATGTGGaAGA CTTGGcATAT 564 0 

GAAGGGAAAA ACTTAGaAAA ATTGGTTCCT TTTTATAAAG AAATGGACTT TAAACAATTT 5700 

TTAGCTAAGT TGGACATTAC AGAAGAACCT GTTGAAATGG AAGATATTTT ATTTGAAGTA 57 60 

GTCGAAGATC AGTTAACCAA TGAAATGTTT ACAGACGATA TGGCTCTTTA TGTTGAGATG 5820 

ATGGAAGATA AC TAT CAT AC GTCACCGATT GTCGGCCTTG CTTGGGGCAA CAACAAGAAG 5880 

ATTTATACCA CGAATAATTT AGCTGTTTTT GAAAGTCAGC CATTTATTGA CTGGTTGATG 5940 

GATGAGACAC GTAAAAAAAA TGTCTACGAT GCaAAACGGA CGTaCGTTGC ACTAAATCGC 6000 

TATGTAGGAA AAATGACAGG GATTGCCtTT GATGTTTTAT TGGCTGCTTA TTTACTTGAT 6060 

ACCAATGACA ATAACGCTGA TATTGAAGGT GTAGCACAAC ATTACGGATA TGATGCGATT 6120 
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CAATCTGATG 


AAGCCATTTA 


TGGAAAAGGG 


GCCAAAAAAG 


GCTTGCCAGA 


GGACGAAGAA 


UlOy 


GTTTTCTTTG 


GTCATTTAGC 


ACGTAAGATT 


AAAGCAATTC 


AATTTTTAAC 


AAGCAAATTA 


£0 A n 


GATAGCGAAT 


TAACAGAAAA 


AAATCAAGCT 


GATTTATTCT 


ATAAAATGGA ATTACCACTT 


DJUU 


TCTCGTATTT 


TAGGCGATAT 


GGAAATAACT 


GGTATCCGTG 


TTGATGCCAC 


ACGCTTAAAA 




gAAATGCAAG 


TAGAATTTTC 


AGAACGCTTG 


AAAGAAATCG 


AAGAAAAAAT 


CTATGCAGAA 


64 20 


GCTGGCGAAG 


AATTTAATTT 

**** X X X *1XX XXX 


GAATTCACCT 


AAACAATTAG 


GCGTTATTTT 


ATTTGAAAAA 


o4 80 


ATGGGCTTAC 


CTGTGATCAA 

^— ' X W/ X \ X VjiUi 


AAAGACGAAA 


APTGGPTATT 


CAACAGCGGT 


AGATGTTTTA 


qDh 0 


GAACAACTAA 


AAGAACAAGC 


ACCAATTGTG 


GCCGATATTT 


TAACTTATCG 


TCAAATTGCC 


boUU 


AAAATTCAAT 


CGACCTATGT 


TGAAGGACTT 


TTAAAAGTAA 

i i nnnnw i rui 


TTCAGCCAGA 


TAATAAAATT 


O OfaU 


CATACTCGGT 


ATGTACAAAC 


ACTGACaPAA 


APAGGAPGAT 


TAAGCTCAGT 


AGATCCCAAT 


C7 o n 


TTACAAAACA 


TTCCTATCCG 


T T T AG AAG AA 


GGPPGAAAAA 


TTAGAGAAGC 


GTTCGTACCA 


D/oU 


AGAGAGGATA 


ATTGGTTGAT 


TTTPTPTTPT 


GATTAPTPTP 


AAATTGAGTT 


ACGTGTGCTC 




GCACATATTT 


CTAATGATGA 


GCATTTAAAA 


GAAGPATTTG 


TGGAAGGACA 


AGATATCCAT 


conn 


GCCAGCACTG 


CAATGCGAGT 


TTTPGGTGTT 

J- -1 JL v^OO lull 


GA AAA AGP an 


AAGATGTCAC 


ACCTAACATG 




CGTCGTCAAG 


CTAAAGCGGT 


GAATTTTGGA 


ATTGTPTATG 
ni lul^lniu 


GTATCAGTGA 


TTATGGCTTG 


7 no n 


TCTCAAAACT 


TAGGCATTAC 


TAGAAAGPAA 


GPGP A APHHT 


ATATTGACAC 


CTATTTTGAA 


/ Uo U 


AAATATCCAG 


GCGTGAAACA 


GTAPATGGAA 

Vj 1 nun 1 VJVJArV 


GAGATTGTPP 
Onon 1 lul 


GCGATGCCAA 


AGACAAAGGC 


1 14 U 


TATGTGGAAA 


CATTGTATCA 


TCGPPGTPGT 


T ATTT A AGTG 
x n ill nnu J. u 


ATATTAATTC 


AAGAAACTTT 


7 o nn 


AATCTACGTT 


CGTTCGCTGA 


ACGTAPAGPG 


ATTAATAPAP 


CGATTCAGGG 


AAGTGCCGCA 


"7 o 

/ i DU 


GATATTTTAA 


AAATTGCGAT 


GATTGAAATG 


GCGCGTPGTT 


TAAAAGAAGA 


AAAATTACAA 


7 ^ on 


GCGACAATGT 


TATTACAAGT 


GCACGATGAA 


TTAGTTTTTG 

1 1 ix\JJ 1 J. 1 A 1 VJ 


AAGTACCAGA 


ATCAGAATTA 


/ jOU 


GAACAATTAA 


ATCAACTCGT 


CAAAGAAGTG 


ATGGAGPAPG 


CAGTTTCCTT 


ACATGTGCCA 


7 A a n 

/ fi *i U 


CTTATCACCG 


ATAGCAGTTG 


GGGAAAAACA 


TGGTATGAAG 

X W W X *i J* VilTlW 


CAAAATAATA 


GCTTACCCAT 




TAAGAACGAT 


GAATAAAATA 


AGGATAAGTA 


GGATCTGGAC 


GTTGATATCT 


AAGCGTTCAG 


f J V V 


ACCTCTTTAT 


ATAAAGGAGC 


AAAAATAATG 


CCAGAATTAC 


C AG AAG TAG A 


AACAGTGCGA 


7620 


AAAGGACTTG 


AAAAACTTGT 


TGTAGGAAAA 


ACCATTCAAG 


AAGTCATTGT 


TTTTTGGCCA 


7680 


AGAATTATTG 


AATCACCAGA 


AGTTGATGTA 


TTTCAAGGTC 


AATTAGCTGG 


TCAAACGATT 


774 0 


GAAGGGATTG 


AGCGAAGAGG 


AAAATTTCTG 


ATTTTTAAAT 


TGTCTGATAA 


TGATATGATT 


7800 


TCTCACTTAC 


GGATGGAAGG 


AAAATATGAG 


TTTCATCAGG 


CTGATGATGA 


AATTGCCAAA 


7860 


CATACCCATG 


TAATGTTTAC 


TTTTACAGAT 


GGCACGCAAT 


TACGTTATTT 


AGATGTACGG 


7920 


AAATTTGGGC 


GTATGACATT 


GGTTCCTAAG 


AATCAAGGTC 


ATCAATATAA 


AGGGATTTTA 


7980 


GCTTTGGGCC 


CAGAGCCAAC 


ACCAGACGTT 


TTCCAATTAG 


CTACTTTTCA 


ACA 


8033 


(2) INFORMATION FOR SEQ ID NO: 122: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH;. .98.13 hasfi.pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



GTTTGGATAA 


TCGGCGTTAC 


TTTAGCAATC 


AGTGCTTTTC 


TTTTAGCTAG 


CTGTGGGCAA 


60 


ACCACACAAT 


CGCCAAAAGA 


GAAGAAAGAG 


CTCACGGTAA 


TGACTACTTT 


TTATCCGATG 


120 


TACGACTTCA 


CAAAACAAGT 


CGTTGGTGAT 


GAGGGAGAGG 


TTGAGCTGTT 


GATTCCAGCT 


180 


GGAACGGaAC 


C AC AC GAT T A 


TGAACCAAGC 


GCAAAAGACT 


TAGCGAAAAT 


TACTGATGCG 


240 


GATGTCTTTG 


TTTATaATAG 


TAAAGAGCTA 


GAAACTTGGG 


TTCCGAATGT 


TATTGAAAAC 


o ri o 

300 


TTGGACACGA AAAAAGTTTC 


AATCGTGGAA 


GCCAGTCAAT 


CCATTCAATT 


GATGCAAGGA 


360 


n r a g a a g a gg 


AAGAATCTGG 


CGaAGAAGGT 


CATGAAGGCC 


ATAACCATTC 


CCACGAACTG 


420 


r: at rrnr at G 


TCTGGTTAGA 


TCCCGTGTTA 


GCGCAAAAAG 


AAGTAACAGC 


GATTCGGGAT 


480 




AAAAAT AT CC 


TGAGAAAAAA 


GCCGTATTTG 


AAAAAAATAC 


CGTTGCTTAT 


540 


T T GG AAAAAT 


TGACTGCGTT 


AGATAAGGAA 


TACCAAGCTG 


CGTTTGCAGG 


rn ti t\ 7i Tk m 

CGCTAAAAAT 


600 


PGGACCTTTG 


TGACGCAACA 


TGCTGCATTT 


GGCTATTTGG 


CGAAACAATA 


TGGCTTGACC 


660 


CAAGAACCGA 


TTGCTGGGAT 


TTCTCCAGAC 


CAAGAACCTT 


CGCCAAGTCG 


TTTAGCCGAA 


"7 O A 


TTAAAAAAAT 


ATATTAAAAC 


AAACAATGTT 


TCTGTGATCT 


ATTTTGAAGC 


ATCGGCTTCT 


TOO 


ACAAAAGTAG 


CCAAAACATT 


GGCAGACGAG 


ACAGGCGTCG 


AATTAGCCGT 


ATTGAATCCT 


Q yi ft 


TTAGAGAGTT 


TAACTCAAAA 


AGAACAAGAA 


GCAGGCGAGA 


ACTACGTGTC 


TGTCATGAAA 


yuu 


GAAAACCTCG 


CTGCTTTACA 


AAAAAGCATT 


CATTAATCTG 


/"'/"'TV T\ /"» TV /"* 7\ Ti TV 

G C AAC AG AAA 


GGAGCCAAC I 


you 


TATGCATTAT 


ATTGAAGTCG 


AAAATTTAAC 


CTTCTATTAT 


GAT GAT G AAC 


CTGTTTTAGA 




AGATGTTTCT 


TATTATGTTG 


ATCCAGGGGA 


ATTCGTTATT 


TT AAC AG GAG 


AAAA 1 GGCGC 


1 ftfl ft 
1UOU 


TGCCAAGTCA 


ACGCTGATCA 


AAAGTACCTT 


AGGGTTATTA 


AAACCAACAA 


GTGGCAAAAT 


1140 


TACGGTCGCT 


AAAAAAAATA 


GTGCTGGTGA 


AAAAATCAGT 


ATTGGCTACA 


TTCCACAACA 


1200 


AGTCGCTTCT 


TTCAATGCGG 


GCTTTCCTAG 


CACCGTCATT 


GAATTAGTAC 


GTTCAGGTCG 


1260 


CTTTCCCAGA 


AATCGTTGGT 


TCAAACCGTT 


GACGAAAAAA 


GATCATCTTC 


ATGTGGAAAA 


1320 


AGCTTTGAAA 


TCAGTGGACA 


TGTGGGAAAT 


GCGCCACAAA 


CGAATTGGCG 


AGCTTTCTGG 


1380 


AGGGCAAAAG 


CAACGAATTA 


GTTTAGCACG 


AGTGTTTGCG 


ACCGATCCAG 


ATTTATTTAT 


1440 


TTTAGATGAG 


CCAACAACAG 


GTATGGATGA 


ACAATCGCGA 


AACGAATTTT 


ATCAATTGTT 


1500 


GCAACACAGT 


■ GCGCATGAAC 


ATGGAAAAGC 


TATTTTGATG 


ATTACGCATG 


ATCATGAAGA 


1560 


TATCAAAACC 


TATGTGGATC 


GTCAAATTCG 


TCTTGTCCGC 


AAAGAAGATT 


CGAAATGGCG 


1620 


TTGTTTCCAT 


ATGAGTGAAG 


AATCGTATAC 


GTAAATTTTC 


CAATAAAACG 


TTATCTGGCT 


1680 
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CGATCATAAC 


GAAAGCCTAG 


GAACGGCTTA 


AAGAAATGTG 


GGAAAAATTT 


AAAAAATAAG 


1740 


CAGGAATTAA 


AAAAATAATT 


CGCTTAAACG 


TCGCTACAAT 


GGTAGTGTTC 


ATTTTAGCGG 


1800 


TAAGGAGTGA 


ATCGTATGGC 


AGAAATGCTT 


TCTTATGCAT 


TTATGCAAAA 


GGCCTTTTTA 


1860 


GCAGCACTGT 


TTATCTCAGT 


GATTGCCCCA 


ATGCTCGGCG 


TCTTTCTAGT 


TATTCGCCGA 


1920 


CAATCTTTAA 


TGGCAGATAC 


CCTTTCACAT 


GTGTCATTAG 


CCGGTGTGGC 


ACTAGGCTTC 


1980 


TTTTTTAATT 


GGAATCCTAA 


TTTAATGACC 


TTAATTGTCG 


TGATTGTGGC 


TGCAATCATT 


2040 


CTAGAATATT 


TACGAAT-j AT 


TTATAGCACC 


TATTCAGAAA 


TTTCGATTGC 


TATTTTAATG 


2100 


TCAGGCGGTT 


TGGCTTTGGC 


GTTAGTTTTG 


ATGAATTTAA 


CAGGAGGCAA 


TTCAGCTGCT 


2160 


AGTATTCAAT 


CGTATTTATT 


TGGTTCCATC 


GTCACGATTA 


CGTGGGATCA 


AGTGGTTATG 


2220 


TTGGCAATTT 


TATTCGTAGT 


TTTAGTTCTA 


TTGTTTATGT 


TATTTAAACG 


TCCAATGTAT 


2280 


GTTTTAACAT 


TTGATGAAGA 


TACT GCT CAT 


GTTGATGGGC 


TACCTATTCA 


TTGGATGTCG 


2340 


ATGCTTTTTA 


ATGTAATTAC 


TGGTGTGGCG 


ATTGCTGTGA 


TGATTCCGAT 


CGCGGGAGCC 


2400 


TTGTTAATTT 


CAGCAATTAT 


GGTCTTACCA 


GCTGCAATAG 


GTATGCGAAT 


TGGTAAAGGC 


2460 


TTTAACACGG 


T GAT TAT TAT 


CAGTGTGTTT 


ATGGGCTTGA 


TTGGCATGCT 


AACAGGGTTG 


2520 


ACTAGCTCGT 


ATTATTTGGA 


AACACCACCG 


AGTGCAAGTA 


TTACCCTAAT 


TTTTATTGGT 


2580 


TTATTCTTAT 


TAGTCAATAT 


TTATCGCCGA 


GTGGTTGTCA 


TGGTCCAACG 


AAAACAAAAA 


2640 


ATGCAAAGAA 


ACTAAGGAGA 


AAAACGAAAG 


TTTTTCTCCt 


TTATTTCTTT 


TTTTTAAAAG 


2700 


AAATGGCGAA 


CTATTTTCAG 


TTATTTGAAA 


AAAATCCTCG 


AATTTTTCGG 


TTAAACCCCT 


2760 


TATAATAAGG 


AACGATAGAA 


TGAAAAGGAG 


TGACGAATTT 


GAAAATTCGG 


AGAAGTGAAC 


2820 


GATTAATCGA 


TATGACCCAA 


TATCTTTTAG 


ATCATCCCCA 


TACATTGGTT 


TCTTTGACAT 


2880 


ATTTTGCTGA 


CCGATACCAA 


TCTGCCAAGT 


CTTCCATTAG 


TGAAGATTTA 


GCAATTGTTA 


2940 


AAAAAACATT 


TAAAGAACGT 


GGAACAGGTA 


TTTTAGAAAC 


GATTCCAGGC 


GCTGCAGgcG 


3000 


GTgTCCgCTT 


TATCCCTGAA 


ATTCCTTATG 


AGGAAGCGGA 


CAGTTGATTA 


TGGATTTATG 


3060 


TGATCGTTTG 


TCAGAACAAG 


ATCGCTTATT 


ACCTGGGGGC 


TATGTTTATC 


TTTCAGATTT 


3120 


ACTTGGGGAA 


CCGAATTTAT 


TAAGACAAGT 


GGGCCGGATT 


ATTGCTTCTA 


AATATTTAGG 


3180 


CAAACAAATT 


GATGCGGTAA 


TGACCGTTGC 


GACTAAAGGA 


GTGCCAATTG CGCAAgCTGT 


3240 


GTCTTATTAC 


TTAAATGTTC 


CGTTTGTGAT 


TGTTCGTCGC 


GATTCTAAGA 


TTACTGAAGG 


3300 


TTCAACGGTG 


AGCGTGAACT 


ATGTATCAGG 


GTCTTCAGAA 


CGTATCGAAA 


AAATGGAACT 


3360 


TTCAAAACGC 


AGCTTAAAAC 


GCGGCTCAAA 


AGTTTTAGTC 


GTGGATGACT 


TCATGAAAGG 


3420 


CGGCGGCACC 


GTCAATGGTA 


TGAAGAGTAT 


GATTGAAGAA 


TTTGAAGCAG 


AACTTGTGGG 


3480 


CATTACCGTT 


• TTTGCTGAGT 


CAAAATTCAA 


TGGCCGTCGT 


GCAATCGATG 


ACTATACTTC 


3540 


GCTATTGTAC 


GTGGAAGATG 


TGGATACACA 


AACAAAAACG 


ATTAGCGTAG 


TCCCAGGAAA 


3600 


CTATTTCACA 


GAGAAATAAT 


TTGAAAAAGA 


ACCTGAAAAT 


GGTCCATTCC 


ATTTTCAGGT 


3660 
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TCTTTTTTTC 
AATGCCTTAT 
GACGATTTAG 
ATACGCAATT 
ATTGCATCCA 
AGAACCAACA 
AGGCGAACGT 
GCAAGCACAA 
GCCGTTATTA 
AAGTGCGACT 
TGATCATGTG 
ACGTGTTCAA 
GTTAgcGnAA 
AATTTTGAAA 
AATGGGTGTA 
AAATGAAATG 
TGAAGGCGTG 
AACAGTGATT 
CGGCAATCAA 
CGATGTGGGT 
TGGTAACTTC 
AACATACGTT 
TGTTAATTAT 
CTCTGCAACG 
TTCAACCATC 
GAATAAAGAA 
GAACGCTCTT 
GCTGCTAGTA 
TTAGTTATTT 
TTTTTATGTA 
ATATTGTCCA 
GTTTTTTATT 
TCTTTTTGTT 
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TGTTTTATTC 
TTTGTTGAAA 
AAGAAAATCA 
ATTTTAGCAG 
GTTGCTGGTA 
GAAATCGTGA 
AGTCAATATG 
GAG T TAT TAG 
ACTGCGGAAA 
ATTTTAACAG 
GGCATTGTTG 
GAAATTAATA 
ACAGATACAA 
AAAGAAGGCA 
AATGATCGTG 
CATATGCGAA 
GTAATTGGTT 
GGCGAAGATT 
GTGGTTGTTA 
CCGTATGCAC 
GTGGAAGTAA 
GGTGATGCAA 
GATGGCAAAA 
AACATTGTTG 
ACAGAAGATG 
GGCTATGCTA 
TTGCTTAAGC 
TTTGTTTAGA 
TTTTCAAAAA 
AAGTTCGTTT 
AAAATAAAAA 
TAACGGATAT 
GTTTGTTACT 



CTTTAAAAAA 
AAGACAAAAG 
AACAATTTAG 
CTGGTAAAGG 
AACCAATGGT 
CAATCGTTGG 
CCTTACAAGC 
GTGGTAAACA 
CCTTGAAAAA 
CGCATGCGGa 
AACGAATCGT 
CAGGAACCTT 
ACAATACACA 
AAGCTGTCGC 
TTGCTTTATC 
ATGGTGTTAC 
CAGACACAGT 
GCTTGATTGG 
AACAGTCTGT 
ATTTACGTCC 
AAAATGCAAC 
CATTAGGCAA 
ATAAACACCA 
CGCCAGTCAC 
TCCCTTCAGA 
AAAAGTTACC 
AAAAGAGCGT 
GACGTTCAAG 
AATAGTTTTT 
TTTGGTCAAA 
CAATTTACAG 
ATTATGATAG 
TTGACAATTG 



736 
AG AT GAG AAA 
AGTAAGATAA 
TTAATTTGAG 
AACACGCATG 
CGAACATATT 
ACATGGGGCG 
TGAACAATTG 
AGGAACAACA 
TTTATTCGAT 
AGACCCAACA 
GGAACAAAAA 
CTGTTTTGAT 
AGGGGAATAC 
TGCTTACCAA 
CACTGCTAAT 
ATTTATTGAT 
CATTGAAGCG 
CGCACATTCA 
GATTGAAGAA 
TAAAGCAGAT 
AATCGATGAA 
AGATATTAAT 
AACAATCGTG 
GATTGGTGAT 
AGATTTGGCG 
TTATATGAAG 
TCTTTTTCCT 
GACTAGACCA 
GTTCCGCTTT 
ATTACTATTT 
GAAAAACATC 
GAAAAGAAGA 
TTTGTTGCAA 



ATAAATTGGT 
ACTAGAAAAG 
GTGGCAAACG 
AAATCTAAAC 
TTAGATCAAG 
GAAATGATTA 
GGAACTGGGC 
TTAGTTATTA 
TACCATCAAG 
GGCTATGGTC 
GATGCCAGTG 
AATGAATCAT 
TATTTAACAG 
ATGGCAGACT 
AAAATTATGC 
CCAGACACAA 
GGAGTCACTA 
GAAATCGTTG 
AGTGTGGTTC 
GTGGGGGCAA 
GGCACAAAAG 
GTCGGTTGCG 
GGTGATCACG 
CATGCGGTGA 
ATTGCCCGGG 
GATTAATCTC 
CCTTAAAGAC 
ATGCTGTTTA 
TCCAATATAA 
TTTTCACAAA 
TTATTTTAAC 
GCAGCGTGAT 
ATGGGTTGAA 



CTACCTTTTA 
GTATAATGAG 
TGGAAAATAG 
TTTATAAAGT 
TAGAACAAAC 
AAAGCCATTT 
ATGCAGTCAT 
CAGGGGATAC 
GTAAGAATGC 
GAATTATTCG 
AAGAAGAAGC 
TGTTTGAAGC 
ATATCATTGA 
TTGACGAAGC 
ATCGTCGTTT 
CGTATATTGA 
TCAAAGGAAA 
ATAGTCACAT 
ACGAGGGGGC 
ACGTACACAT 
TGGGCCATTT 
GCGTTGTTTT 
CTTTTATTGG 
CTGCTGCTGG 
CACGTCAAGT 
AAATAAAAAA 
TAGACCAATT 
TTCGGTATTT 
AAAGACTTTC 
CAAATCCGCT 
GGAATTTACT 
GGGGGAAGGC 
ACACAAGGAG 



3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 
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GAAATTTGTG AAAAAGTTGA GCTTTAAAAA AGTGAAGTGG GQCATGCATT TTTTAATGGC 5700 

TGTTGCGTTG ATAGCGCCAA GTGTTACTAG TACGGCATAT GCAGTAGAAA CAACGAGTCA ' 57 60 

ACAAAGTTCA GAAGCAGTAA CAAGTACCAC CGATTCAAGT AGAAAACAAG AACCAGTCAT 5820 

TACACAGGAA ACAACAGACA TCAAACAAGA AGCACCAAAT CAGGCTACGA GTGACAGTGT 5880 

CAAGCAGTCA CAAGAAACCA CAGCACCAAC AGAGACGACG AATTTAGAAA CGTCAATCGC 5940 

TGAAAAAGAA GAAACGAGCA CGCCGCAAAA AATAACAATT TTAGGTACGT CAGATGTTCA 6000 

TGGTCAATTA TGGAATTGGT CTTATGAAGA TGATAAAGAA CTACCAGTTG GTTTGTCCCA 6060 

AGTAAGTACA GTCGTTAACC AAGTCCGGGC ACAAAACCCA GCAGGCACCG TTTTAATTGA 6120 

TAATGGCGAC AATATTCAAG GCACTATTTT AACAGATGAC TTGTATAATA AAGCGCCTTT 6180 

AGTGAATGAA AAGACCCATC CAATGATCAC CGCCATGAAT GTGATGAAGT ATGATGCAAT 6240 

GGTTTTGGGA AATCATGAGT TTAATTTTGG TTTACCGTTA ATCAAAAAAA TTCAACAAGA 6300 

AGCCACTTTT CCAATCTTGT CTGCGAATAC CTACAATAAG GAAGATGGTC TTCGTTTTGT 6360 

TGAAGGGACT ACCACGAAGG AACTTGATTT TAATCAAGAT GGGCAGCCAG ATTTAAAAGT 6420 

TGGGATTATC GGCTTAACAA TTCCGCACAT TCCTTTGTGG GATGGCCCTC GTGTTACTTC 6480 

GCTTAATTTT TTACCTTTGA AAGAAGAAGC AGAAAAAGCA GTTACTGAGT TGAAAGCTAA 6540 

CGATCAGGCT GACATTATTG TTGCCTCGAT TCATGCGGGA CAACAAAATA GTGATCCGGC 6600 

TGCCAGTGCC GACCAAGTAA TTGAAAATGT GGCGGGGATT GATGCGTATA TTCTGGGTCA 6660 

TGACCACCTT TCTTTTACCA AGCAAGGAGC AGCGCCGAAT GGAAAAACTG TACCGGTAGG 6720 

GGGACCGAAA GATACGGGGA CAGAAGTTGT CAAAATTGAT CTTTCAGTTG CTAAAAATGC 6780 

CGATAAGTGG GAAGTGCAAG AAGGTACAGC AACGATTGTA CCAACAACGA ATGTTCCAGC 684 0 

AGATGAAGCA GTTAAGGCAG CGACAAAAGA ATACCaTGAA AAAACGCGAG CGTTTATTCA 6900 

GGAGGAGATC GGCACAGCAA CAGCTGATTT TTTACCAAAA CAAGAAATTA AAGGAATTCC 6960 

CGAAGCACAA TTACAACCAA CAGCGATGAT TTCTTTAATT AATAACGTTC AAAAAGAAGT 7020 

AACGGGCGCA CAATTAAGTG CGGCAGCGCT GTTTAAATAC GACAGTAAAT TACCTGCGGG 7080 

GAAGATTTCC TATGCCACGA TTTTTGATAT CTACAAATAC CCGAATACCT TAGTGAGTGT 7140 

TCCCATTAAC GGTGAAAACT TACTGAAGTA TTTAGAAAAA CAAGGGGCGT ACTATAACCA 7200 

AACACAGCCA GATGATTTGA CCATTAGTTT TAATCCAAAC ATTCGTGTAT ATAACTATGA 7260 

CATGATTTCT GGAGTGGACT ACAAGATTGA CATTTCAAAA CCAGTGGGTG AACGAATTGT 7320 

AGATGCGAAA ATTGACGGCC AACCGCTGGA TCCTGCCAAA GAATATACGA TTGCTATGAA 7380 

TAATTATCGT TACGGCGGTT TAGCTAGCCA AGGGATTCAA GTAGGGGAAC CTATTAAAAA 74 4 0 

TTCTGATCCA ■ GAAACCTTAC GAGGAATGAT TGTTGATTAT ATTAAGAAAA AAGGAACTCT 7500 

TGATCCAGAA CAAGAAATCG AACGAAATTG GTCAATTATT GGGACAAATT TTGATGAAAA 7560 

ATGGCGTGCC AAAGCAATCG AATTAGTGAA TGACGGCACT CTTCAAATTC CGACTTCTCC 7 620 
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TGATGGACGT ACACCAAACG CCGCCGCTAT TACGAAACAA GATGTCCGTA ATGCGGGCTT 7680 

.TGAT-T-TAGAT AATGCATATA CCATTATGCA CACAAATGAC GTTCATGGCC GACTAGAAGC 7740 

AGGGAAAGgC GAATTAGGTA TGGCGCGTCT AAAAACCTTT AAAGACCAAG AAAACCCAAC 7800 

CTTGATGGTG GATGCAGGGG ATGTTTTCCA AGGATTACCA ATCTCCAATT TCTCCAAAGG 7860 

CGCGGATATG GCCAAAGCAA TGAATGAAGT TGGTTATGAT GCCATGGCGG TGGGAAATCA 7 920 

CGAGTTTGAT TTTGGTTTAG AGATTGCACT AGGTTATAAA GACCAACTGA ATTTTCCGAT 7 980 

TTTATCTAGT AATACGTVTT AC AAAG AT GG CAGTGGACGG GTTTTTGATC CGTATACAAT 804 0 

CGTAGAAAAA TCCGGGAAAA AGTTTGCCAT TGTAGGTGTG ACGACCCCAG AAACAGCAAC 8100 

GAAAACACAC CCGAAAAACG TAGAGAAGGT GACATTTAAA GACCCGATTC CAGAAGTAGA 8160 

AGCAGTGATT AAGGAAATTA AAGAGAAGTA CGCGGATATk CAAGCTTTCG TGGTTACTGG 8220 

GCATTTAGGC GTAGATGAAA CGACGCCGCA TATCTGGCGT GGTGATACGC TAGCAGAAAC 8280 

CCTTAGTCAA ACATATCCTG AGTTAGATAT CACTGTGATT GATGGACATT CGCATACAGC 8340 

CGTCGAAAGT GGCAAACGTT ATGGCAAAGT GATCTATGCT CAAACAGGTA AT TAT TT AAA 8400 

TAATGTTGGG ATCGTCACAG CACCAGAGAG TGAACCAACT AAGAAAACAA CAAAATTGAt 84 60 

TTCagcAGCA GAGCTGCTAG AATTGCCAGA AAACCCGGCA GTTAAAGCCA TCGTTGATGA 8520 

AGCACGTACG AATTTTAACG CTGAAAATGA AAAAGTAATT GTCGATTATA TTCCATTCAC 8580 

ATTGGATGGA CAACGAGAAA ATGTGCGCAC ACGAGAGACC AACTTAGGGA ATTTGATTGG 8 640 

TGATGCGATT ATGTCATATG GCCAAGACGC GTTTAGCCAA CdTGCTGATT TTGCAGTAAC 8700 

TAATGGTGGC GGCATTCGCG CTGATATTAA ACAAGGGCCA ATTAAAGTTG GGGATGTCAT 87 60 

TGCTGTGTTA CCTTTTGoCA ATAGCATTGC GCAAATTCAA GTAACCGGCG CCCAAGTTAA 8820 

AGAAATGTTT GAAATGTCTG TTCGTTCGAT TCCACAAAAA GATGAGAATG GCACAATTTT 8880 

ACTAGATGAT GCTGGCCAAC CAAAACTTGG CGCAAATGGT GGTTTCCTAC ATGTTTCAAG 8 940 

CTCCATTCGT ATCCACTATG ATTCCACAAA ACCAGGTACT CGCTTGGCTA GTGACGAAGG 9000 

CAATGAAACA GGACAAACGA TTGTCGGTAG TCGCGTATTA GGAATAGAAA TTAAAAATCG 9060 

GCAAACACAA AAGTTTGAAC CATTGGATGA GAAGAAACAA TACCGGATGG CTACCAATGA 9120 

TTTCTTAGCT GCTGGTGGTG ATGGTTACGA TATGCTAGGT GGTGAACGAG AAGAAGGGAT 9180 

TTCACTAGAT TCTGTCTTAA TTGAATACTT GAAAAGTGCA ACCAGCTTGC GGTTGTATCG 924 0 

TGCAGCAACG ACGATTGATT TAGCACAATA TAAAGAACCA TTCCCAGGCG AACGAATTGT 9300 

TTCTATTTCG GAAGAAGCTT ACAAAGAGTT AATCGGTGGA GGAGAGACGC CAAAACCAGA 9360 

TCCAAAACCA GACCCGAAAC CAACACCAGA AACACCAGTA GCAACCAATA AACAAAACCA 9420 
AGCGGGAGCA AGACAGAGCA ATCCATCCGT AACAGAGAAG AAAAAGTATG GCGGCTTTTT 9480 
ACCTAAAACG GGTACAGAAA CAGAAACGCT TGCATTATAT GGTTTACTGT TCGTTGGACT 954 0 

TTCTTCTTCT GGCTGGTATA TTTATAAACG ACGTAACAAA GCTAGTTAGT CATTTTTTAA 9600 
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GAACCGTCcT TCGGGATTGG TTCTTTTTTT ATTTTAATGC TTGCCATTTG TTTTTAATTT 
TAG TAT TATA AGAATATtCT AAAAATTAAA AAAT-GAAGAG TTTTTTCTGT GAAGGGAGAA 
GGGCTGAAAG TAAACGTATA AAGGACTTAT CGGGTTAAGT TTACAACGCT GTTAGTTTAG 
CTGCCTGTGG TGGTGATGGG GGACCTGGCG CCG 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



9660 
9720 
9780 
9813 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
CATTGGGAAA AGTGGAAAGC CTTAAACCTC GCAATCGCCC AAGGTTTACC 
GGCATCACTA AGCCgAAAAT TGAAAGCTGG AcCAACGGCT GGAATTTACG 
TGGGCTGCGT ATCGCAGTGA GCAACGTCAA GCGGAAAATG CTTGTTTGGC 
AATAAAAAGC AATTTCAAGT GTATTTGATG TTTCAACATT ACAAAAGTGA 
GGCTCAGTCG TCGCTTACAA TCAATTGCTT AATCGTTTAG AAGCTTGGAG 
GACTGCGAAG GC TAT TAT AT TTGGCCGCAG gAGGAGCATG AACTAGTGGA 
TTAAAAGATT ATTTAGCAGA TGAAAAACAA CGGAAAATTC TAGAAGAGCG 
CGCTCATTTC AGATCGGTAA GTTGTGGTTT GCGACGGAAA AGCTTGAAAC 
AAAAGTTGTC AGGCGCTTCA AGAATTAGCG CCGTTGTATA AC G CAT TAT C 
ACGAAACAAC GTCTCTGACC AATTACCTGG TCAGAGACGT TGTTTGTGTT 
TAAATCTGTT AGTAGTAGCT CGTTAGAAAC AGCAAAAGTA GTAACTTCTG 
ATCCAAAGTG TATTTTGTAA ATGAGGTAGT ACTTTCTGGT TGGACATACA 
AGTTACATCA TATCCTGACC AATAATGTGT GAAATCAGAC AGAGAATTAC 
ATGGCCAATT CCCCGAGGAA TCATCACTGT TTGGAAGGCG CGATAGAGTT 
TTCAGAGCTA TCAAATTCTT GGAGATTGTC AGAATAGTAT AACGCACGAA 
CGGCGAAGTG AAGTCTCCTG GGATTCCTAA TAGTCCGTAA GAAGTCCCAC 
GTCAAGTGTT GCGCCTGATT TTAAAGTGTA AGGTTCCTTA ATAGCATAGT 
gATATAGTTT CGTGCATTGG TTAAATGATA ATTGTATTCT GGACTGTTTG 
AATCGAATCA TAAATGACGA ACGCTCCGTT ATCTGTTGGT TGCAAAACGA 
ATTTTTATCA TTAATGGTCC AGTGTAAGGT AAAATGAACT TCCCCCAATG 
GAATTTTTGA TCTAACAGGC CGATTTTTTC AAAAGCTACT TTTACTTCGT 
AAAATTACTC AAAATATAAG CAAGTACTTC TTCTGCAATA TGAGGTGTTA 
TGCTAAACTT TCAGCAGGTG CTGTTGAAGA TTCCATTAAA TATTGAATAT 
GCCGGCTTCA TTGATACCAT CTGTGATAGC AAGGCTGACG GTTTTACCAT 



AGCTGAACTC 
CAgTCATTTC 
GTTGTTATTA 
GGAGCGGACA 
TCAAACCATC 
CCATTTGCCT 
GATAGGCAAT 
GATTGAAGAA 
TGAAAAATGA 
TTTATTGATT 
TTAAAGCTGG 
TGGTTAAATT 
TTTGACCTAA 
GCATAATCCC 
TGAAGCGTGA 
CTTCAATTGG 
TGCGCATCCC 
TGACTACGCC 
TACTATTATT 
AATCAAGTTG 
CAACGCTTTC 
AACCACGATC 
CACCAGATAA 
CGTTCGCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TTGATCTGTG 


CCTGACATTG 


CTAGTCCCAT 


AAAGGCATAT 


TTTGCCGTCC 


AAGGATTTAG 


1500 


TTGGCTGTTT 


AGAACTGTAC 


CTTTTGGAAA 


TTGAGCAATT 


AGAGTAGGGA 


TTTTTGGTGC 


1560 


AATTGGATCT 


TCGTTGCCAA 


AAAAGTCAAA 


CGTAAAATCC 


ATCGTACGTC 


CATAAAAAAT 


1620 


ATCATTTGTT 


TTGGAAATAA 


TCTTGATGCC 


TGTACACATA 


ATAGTTTCCC 


CAAATTAATG 


1680 


TAAGTTTGTG 


AAGCCTGTTT 


ACTAGTTAAG 


CAT AT CAT AC 


TCTAGGAAAC 


AGCGGTGGCT 


1740 


AAATTCCTAA 


AATATTAGAC 


AATTTTGTCT 


AAATGAGGTG 


AAAAAAATTT 


TTTAGCGATT 


1800 


TCCTTCACAA 


AACACCTTAT 


TATCTTTCCG 


AAAATATGAT 


ATCTTTAGAA 


TGGAAATTTC 


1860 


ATGAAAAGAG 


GATGCGAATG 


TTTAAAAAGT 


ACACACAAAT 


GATAAGGGAT 


GAATTTAAAG 


1920 


GGTATAATCA 


AAAAAAAATG 


AAAAAAGATT 


TCTTgGCaGt 


TTGACTkTCG 


CAGCCGTCGC 


1980 


GTTGCCATTA 


GCGTTAGCGT 


TTGGGGTCTC 


TAGCGGTGCT 


ACTGCAGCAG 


CTGGCATGAT 


2040 


CACGGCGATT 


ATCGCTGGAA 


TTGTGATTGG 


GAGCCTTTCG 


GGTGGGTTTT 


ATCAAATTTC 


2100 


AGGACCAACA 


GGTGCGATGG 


CAGCTATTTT 


AATGTCAATT 


GCGGCAGTGC 


ATGGCATGCA 


2160 


AGGGATTTTG 


TTGGCCACCT 


TTTTGGCGGG 


GATTTTCTTG 


CTTTTGGCAG 


GGATTTTGCG 


2220 


CTTAGGGAGC 


TTGACTTCTT 


TTATTCCGTC 


ACCGGTCATT 


ACCGGGTTTA 


CTTCAGGAAT 


2280 


TGCTATTATT 


ATTGCGTTGG 


GGCAGCTTGA 


TAATTTCTTT 


GGGGTTCgTT 


CAGAAGGACA 


2340 


CAATTTAATT 


GAGAAAATCA 


GCAGTTATCG 


TTCATTGGGC 


TTTGATTTAA 


GTATACCAAC 


2400 


GACGGTTATG 


TCGATTTTAG 


TGGTGCTTGG 


ATTAGTGTTC 


TTTCCGAAAA 


AGTGGAATGC 


2460 


GATGATACCA 


GCTTCATTGG 


TGATGATTAT 


TTTAGCTACT 


TTAGCAACAG 


TGGTTTTTCA 


2520 


ACTGCCAGTG 


GCCACTGTAG 


GGAAAATTCC 


GACAAGTATT 


TTTAGTGATA 


CACGTTTGGA 


2580 


ACTTGGTTCA 


CTTCAATTAA 


GTGCGTTTCA 


GGATATTTTG 


GTGCCTTCGA 


TTAGCATTGC 


2640 


TATGTTAGGT 


ATGATTGAAA 


GTTTGCTTTG 


TGGCGCTTCG 


GCTGGTCGCA 


TGACAGGGAA 


2700 


ACCATTAGAC 


AGTAATCAGG 


AACTAGTGGC 


ACAAGGAATT 


GGCAACTTAA 


TTTTGCCATT 


2760 


TTTCGGTGGC 


GTGCCTGCGA 


CGGCAGCGAT 


TGCCCGGACG 


AGCGTAGTGA 


TTAAATCTGG 


2820 


TGCGCAAACA 


CGTTTAGCGG 


GTATTTTTCA 


TGCAGTCTTT 


TTATTGTTAT 


CGATGTTGCT 


2880 


TTTATCGCCA 


GTCATGTCTA 


AAATTCCCAT 


GCCTGCGTTA 


CTGGTGTTTT 


aGTGGTGACA 


2940 


GCGTGGCGAA 


TGAATGAATG 


GTCGGTGATT 


AAAGAATTCT 


TCCAGAAACG 


TTTCCACACA 


3000 


GCTTTATGTT 


TATtCTTCTT 


AACGATGGTA 


GCGACCGTTA 


TTTTTGACTT 


GAGTTTAGCG 


3060 


ATTGTtTTAG 


GGGTCTTAGC 


GGGTTGCCTG 


TTTTyCATTG 


CGAAAAGTGC 


CGTGATTACG 


3120 


ATTAATGTCG 


AGGAAGTTGA 


TTGGTCAAGA 


ATGGGGCTAG 


TACCGACCGA 


TTGTACGGAA 


3180 


AATTGGGTGG 


TAGTTTATTT 


GAGTGGCCCG 


TTATTTTTCA 


TGTCTTCTGA ACGATTGAAA 


3240 


CAAACGTTAC 


TGCAATTGGA 


AACAAAAAGT 


GGTGTAGTAT 


TTTCAATGCG 


AGGAGTTCCT 


3300 


AG CAT T GAT A 


GTACGGCGTT 


AGATATTTTA 


GAAGAATTTT 


GTCTCGCAGC 


AGAAGGTCGT 


3360 


GGACAACAAG 


TTCAATTTGC 


CGCTCTGCAA 


CCAGAAGTTG 


AAAAAATGAT 


TCGAACCATT 


3420 
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CAAGGAAATC 
GTTGCAGAAC 
GGTGAAGAAA 
AATAATAAAG 
ATTGAATTAC 
GAGTGATTAA 
TTTGTTCATC 
TGCGAATACT 
TAGATAGCTC 
TAGTTCGGAA 
CATTCATTCC 
AAGGTGTTGC 
TTGACGAAGT 
TTGTTTCTAT 
TAGAATTTTC 
TAACTAAAAT 
TTTTATCTTA 
ATTGTTTAAA 
AAGTAATGAC 
AGAAATTGGC 
ATCTTTAACA 
CCAAGAAAAG 
GCGTTTGGAA 
GCGGATTTAT 
ACACTCGAAT 
TTACTTAGTC 
CACTCGAAAT 
AGAAGATGCA 
ATATAATACC 
ACAAAAGGAA 
AGCTGGCTAC 
GCTCGAGGTT 
ATT ATT AG AA 



AAGAGAAAGA 
AATAATTCAT 
TAAGAAAAAT 
AGAAAACTTC 
TTCTAGCATA 
AGATATCGAA 
GTTTGAAGAA 
GTTATCGGCC 
ATATTTATTA 
GGTAGCAGTT 
TTTAAAATTA 
TTCTAAAGAA 
TTGAGTGGTG 
GCTAGAACTG 
TTGTTTTTTA 
AACACAAACT 
CTTGTTAATA 
TCTGTAGTTT 
AATCATAAAA 
ATGATTGCGC 
CGAGGACAAT 
ATCGCCGAAC 
GAACAAGGCT 
GCCACAGAAA 
CAAGTTGCTT 
CGGATGCGTA 
TATTAAGGAA 
CCAGCGTTAC 
CCAGAAAATA 
TTGGCTAATC 
ATGAAATTAA 
GAGCGATTGT 
CTGGCGGAAG 



ATACCATTTC 
TTTCAAATGA 
AAGTGGGGAC 
TCAGTATGAA 
ACGTCTTTTT 
CCATTTTTAT 
ATATTAGTTC 
GCTGAAAATA 
ATGGCTTCTT 
GGTGTTGAAT 
AATACGAGTG 
TTTGCTTCTT 
CTGGCACTTT 
TAATGATTAG 
GGTTCATTCG 
AATTTTTTCA 
TAACAAAAGA 
TTATACTTAT 
TATTAGCGAA 
GGGCCTTAGA 
ATCTTTATTT 
TGATTAAAGT 
TTATTTATCG 
AAGGGAAAAA 
TGCAAGGCTT 
AAAATGTCAG 
AGGACGACTG 
AAGAAATCAG 
TGCAAGCCTA 
CCCACTCTGA 
ATCGTGAGAC 
ATATTTTGCC 
AGAAAGCTCG 



741 

TCAGTCGCAG 
AATGTTCTAT 
TAAAATGAGT 
GTTTTCTCTT 
GATCATCCGT 
TGTTGAATAA 
CTAAAATTAA 
TTCTAATCTC 
TCACTTGACC 
TTAAGTTCTC 
GATTGTTTAA 
CGACATTTGT 
CTGAAGTTTC 
TACTGCTTTT 
CTTTTTGAGA 
TACTATTCCT 
AATAATTATT 
TTTTGTTGCA 
AGGAAGTGTC 
TTCAATCAGT 
GGTCAGGGTC 
TGATCGAACC 
TCAAGAAGAT 
CGTTTATCCA 
GTCAGAAGTG 
CGAAGATTGG 
ATTGATGGGA 
CATTGAAACT 
TTTAGTAGAA 
CTTTTATTTT 
GGCACAAACG 
TGCGTTTAAA 
TGAATTTGGT 



AATTTTTACA 
CTTCTATCGC 
TTCTAGAGGA 
TATTATTTTT 
AACATTGCCA 
GTATAAAGTG 
TTCTGTATTT 
TTTTGTAGGA 
ATTTTCAGCC 
TAACGTAATT 
GCTACTAAGA 
AGATACTTCA 
TTTTAGTTTA 
TATTGATGTT 
ACTACAACCT 
CCTATAGACG 
TTATTTTCAC 
TTTGCAACAA 
CTAGTGACAG 
AATATTGAAT 
TGTGAAAATC 
ACAGCGGCGC 
GCAAGTAACA 
ATCATTGTGC 
GAAATTTCAC 
GAATTTGTCA 
GAAATAAAAC 
TATACAGATA 
GCCTACAATT 
ATCTTTGTCG 
GAACCGATGG 
GGaAAGAAGC 
AAAAAAGCAC 



AACGGTGCAT 
GGGCTTAATG 
GTTCCTAAAA 
TTGGTTATTT 
GCGTAGTTAG 
CCCGATCTAT 
ACTTTAACGG 
ATTGTATTTA 
CCAAACAATT 
GAAGTTGGCA 
TCGACAGCAT 
TTTTTCGACT 
TTATTAGAGC 
GTATGATTGG 
GTTAAAAAAA 
CTATAACTAA 
GAATGCTCAA 
AAATAAGTTA 
ATATTTTAAG 
TTAAGGAATT 
CTGGGATTAT 
GGGCGATTAA 
AAAAAATTAA 
GCGAAAATCA 
AATTGGCAGA 
AAAAAGGGAA 
AATTAACAAT 
CTTTCGGACC 
TAGAAAAATT 
ATAATCAATT 
GGCCAGAAAA 
TGGGGACACA 
TGTGGCTAGG 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
•4260 
4320 
i 4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
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TGTCTGGGAA 


CACAATGATG 


CGGCGCGGGC* 


ATTTTATAAA 


ACGATTGGCT 


ACCGTTATTA 


5460 


TAGTCAACAT 


TCGTTTTTTA 


TGGGAGACGA 


TGAACAGACG 


GATTTTATTT 


TGTTAAAGGA 


5520 


ATTGTAGCcA 


AGAAAATTTA 


GAAACAAAGA 


TTTCTGATAA 


AATCAGAAAT 


CTTTGTTTAT 


5580 


TTTTTTGACT 


AAAATTAAGC 


ATTTTTTTAT 


TTTCAAAATG 


TTATTTTTAA 


AAACGTAAAA 


5640 


ATTGAAAAAG 


AAGATTTTGA 


ATCTGTGAAA 


AGTTGTCGAT 


CTTGTGAGAA 


AACTCTTTTG 


5700 


TCAGTGTCTT 


TTTTCAGGAT 


GGGTCTCTTC 


CTGAAAaTCC 


ACTTCAGTTT 


ATCTGTTTTG 


5760 


TGTGTGTTTG 


ATTAAAAAAT 


GTTGTGTTGC 


ATTTCATTTA 


ACTGTATAAT 


TTaTTTATGT 


5820 


CATTTTTAAT 


AATAAGCGTA 


TTTTAGCATT 


TGamAAGAGA 


ATTTGTACAT 


AATATGTGAA 


5880 


TAAAAAGATT 


TGTTTCTTAC 


TTTCTAGAGT 


GAGAGGACGA 


TATAAAAAAG 


GTTGCTATTT 


5940 


TTTCGTTCAA 


AGTGAACAAA 


TAATCGTTTG 


TTATTTAGTT 


GGGGTAACTA 


ATGATTATTG 


6000 


TTATAAAAAG 


CGCTTACAAT 


TCGTGTGATG 


AGAGTAGTAC 


GCATGTTTGT 


GCAAAAGTGA 


6060. 


CAAAAAAGAG 


AAGGCAGGGT 


GTTTTAATAG 


AAAAAATTTT 


TTAATAAAAA 


GGAGAGTTTG 


6120 


TCGTGGAGAA 


AAATGTGTCT 


CAAGTAACAG 


CTCAAGAAGA 


AACAGCGCTG 


AAAGCACGTG 


6180 


GCTACAATGA 


GGATTTGTTG 


CCAAGTAGCC 


CGAAACAACG 


GACGATGGGC 


GCGCGCAATT 


6240 


TTTTTACATT 


ATGGATGGGT 


TCGATTCATA 


ATATCCCGAA 


TTATGCAGCA 


GTGGGCGGCT 


6300 


TTATTTTCTT 


AGGTTTATCG 


CCGCTGCAAG 


TTATGCTGGC 


AGTTGTGTTA 


AGCTCATTCA 


6360 


TTGTAGCAAC 


TTTCATGAAT 


TTGAATGGCG 


TGGCTGGCTC 


TAAATATGGC 


ATTCCTTTTG 


6420 


CGATGCACTT 


ACAGTCTACT 


TACGGCAGTC 


TCGGGGCAAA 


ACTGCCAGGG 


TTCTTAAGAG 


6480 


GTTGTGTGGC 


CGCGATTGCT 


TGGTTTGGTC 


TGCAAACATT 


TACAGGATCG 


CTGGCTTTAT 


6540 


TAATTATTCT 


CGGGAAATTT 


TGGCCGAACT 


TTTTAGAAAT 


TGGTGGATCA 


TTCCAGTTTT 


6600 


TTGGTCTCAG 


GCTACCTGAA 


TTAATGGCGT 


TTACTCTTTT 


CTGGCTCTTA 


AATGTGGCAA 


6660 


TAGGTTTCGG 


TGGTAGCAAA 


ATTTTGAACC 


GGTTTACCGC 


CATTTTAAGT 


CCGTTGATTT 


6720 


ATGTCGTTAT 


TATTGGATTA 


ACGATTTGGG 


CCATTCGAGC' 


TGGGGGCGGT 


TTAACACCAA 


6780 


TTTTGTCCTA 


TCAAGTTTCT 


GGTGCGATTC 


GTTCAGTCAA 


TCCACTGGTG 


GCGTATCTGA 


6840 


TTATTTTTAA 


CTCGGTTGTG 


GCCGTTTGGT 


CAGCGCCTGG 


TGCGAGTGTG 


GCGGATTTTA 


6900 


CTAAGAATGC 


GCGCTCGACA 


CGTGCTCAGG 


TTGTGGGGCA 


AACAACAGGT 


TTAGTTGTTG 


6960 


GTTATGGGAT 


TTTCGCTTTT 


TCCAGTGTGG 


TTATTCTGTT 


AGGTGGTTCG 


CTGTATTTTG 


7020 


GGATTCAAGA 


ATGGAATATT 


TTAAACATTA 


TTGATCGTTT 


AGATAATGTG 


GCAGTGGTGG 


7080 


TCTTAGCCAT 


GTCGGTTTTC 


TTATTAACGA 


CTATTTCAAC 


GAATGCAACG 


GGGAACATCA 


7140 


TTCCGGCAGG 


TTATCAATTG 


GCGGCGCTAT 


TTCCTAAAAA 


AATGACGTAT 


AAAAAAGGCG 


7200 


TCATGATTGC 


CAGTGTTATC 


AGCTTTCTCA 


TTATGCCTTG 


GAAACTGATG 


GAAAACGCCG 


7260 


ATAGTATTTT 


TATTTTTCTG 


AACGCAATTG 


GGGCTGTTTT 


AGGACCAGTC 


GCTGGTGTGA 


7320 


TGATTGCGAA 


TTATTATTTT 


GTTCAAAAGC 


AACAAATTGA 


TTTAAATGCT 


TTATACGTAG 


7380 
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ATAAACACAA 


AAAAGAGGAA 


GCCAATCCTT' 


TTTACGGATT AAACAAGCCG GCGTATGTTG 


7440 


CTACTATTCT 


GGCGTTGGTT 


CTTTCCTTAA 


GTGGTCAATT 


TATTCCACAG 


GTTAAAATCA 


7500 


TTGCTGATAT 


TTCGTGGTTT 


GTGGGCTTTG 


CCACAGGTTT 


TGTGCTGTAC 


TTAGTGTTAA 


7560 


AAAAATGGAC 


TTGGGATTCA 


AAAAAAGTGA 


AAGAAACAGC 


TTATCAGGAG 


GGAAAATAAA 


7620 


TGCAAGCAGA 


GTTAGTGATT 


AAAAACGGGT 


TAGTTATTTT 


AGAAACTGGA 


GAAGTCATTA 


7680 


CGGATGTCGC 


CGTTCAAGGA 


GGAAAAATCG 


TAGCGATTGG 


GCAAGACTTG 


TCAGGAGAAC 


7740 


GTGTGATTGA 


TGCGACGGGT 


TTAGTCGTTA 


GTCCTGGGAT 


GGTGGATGCT 


CATGTTCATA 


7800 


TTACCGATCC 


TGGTGGTGGT 


TACCGCGATG 


AATGGGAAGG 


GTATGTCACA 


GGAACGGCCG 


7860 


CTTGTGCCAA AGGGGGCGTG 


ACGACCTTTA 


TGGAAATGCC 


TTTAAATCAG 


ATTCCAGCAA 


7920 


CTGTGGATAA 


GACCTCTCTT 


GAGATCAAAT 


ACAAGGCAGG 


AGAAAATAAA 


CTAAAAGTAG 


7980 


ATGTAGGGTC 


ATTTGGGGGC 


GTGGTCCCGA 


CCAACTTGGC 


TGATGGGATT 


CAGGAACTTG 


8040 


ATGAAGGAGG 


CGTTTCTGGC 


TATAAATGCT 


TTTTGGGAAC 


GTGCGGCGAT 


CGTTCAATTG 


8100 


AAGGAGATTT 


TCAAAATGTC 


GACGATTATT 


CAT T AT AT G A 


AGGAATGAAA 


CAAGTCGCCA 


8160 


AAACTGGCAA 


AGTGCTAGCA 


ATCCATGCAG 


AAAATGCGCC 


AATTACGGAT 


AAATTAGGCG 


8220 


CGGTAGCTTA 


TCAAAATGGC 


GAAACGACTT 


TAGCGGCATA 


TGTAGCTACA 


CGTCCTGTGT 


8280 


TTACCGAAGT 


TGAAGCGATT 


CAAAAAGCCA 


TTTTATTTGC 


CAAAGAAACA 


GGTTGTCGGA 


8340 


TTCATATTTG 


CCATGTGGCT 


TGTCAAGAAG 


GCGTGGAAGA 


AGTTTTGAAA 


GCCCAAGCAG 


8400 


AAGGGGTAGA 


TGTGACTTGT 


GAAACATGTA 


CGCATTATTT 


ATATTTTACC 


ACGGATGAAC 


8460 


TTGATGCCAT 


TGGTCCAGTT 


GTCAAATGTT 


CACCGCCGAT 


TCGTGACGCG 


GACCAACAAG 


8520 


CCGCCTTATG 


GAACCACGTG 


CAAACAGGCG 


GCATCGCGTT 


TGTAACCTCT 


GACCATTCTC 


8580 


CTTGTACGCC 


AGACTTAAAA 


GATACAACCA 


ATGCTTTTGA 


AGCTTGGGGC 


GGCATTTCTG 


8640 


GTGTTCAAAA 


TAACGTCGAT 


GTGTTATTCG 


ATGAAGCCGT 


CCAAAAACGT 


GGTTTATCCT 


8700 


TGAAACAATT 


CGCGGATATG 


ATTGCAGCAA 


ACCCAGCTGA 


CCGTTATCAC 


TTAGCACAAA 


8760 


AGGGCCGCAT 


CAGCATTGGT 


AAGGATGCCG 


ATTTTGTACT 


AATTAAACCT 


AATGCGCCGT 


8820 


ATATCTTAAA 


AGCAGAAGAT 


TTAGAATACC 


GAAATAAAAT 


TAGTCCTTAC 


ATTGGTCGTG 


8880 


AAATTGGGGC 


CCAAGTGATT 


CAAACAATTT 


TAAGAGGCGA 


AACAATTTAC 


GCACAAGAAA 


8940 


CTGGGGTAAC 


CGAGGCGTTT 


AACGGCGTAT 


TTATCAAAAA 


TTAAACACCT 


ATTTTAGAAA 


9000 


TGAAAAAGGA 


CAGAAGCGTG 


ATTGAATTAG 


CGCCTCTGTC 


CACGAGTTAG 


AGGATGTTTG 


9060 


TTGGTTGTTT 


TGCACAAAAA 


AAGGCTTAAT 


TTTGTTTGAT 


AAGTACAATG 


AAGATGGCTA 


9120 


GGGAAACGTT 


TACAATCAGA 


GTGTAATCAG 


TCAGGAGGTT 


TTGTGATGGA 


CTTGGAAAAA 


9180 


GTTTTAAAAC 


AAAGAATCGA 


AGAGTTGTCA 


GCAATTGGGA 


GTGATCCTGC 


AGGAGGGATG 


9240 


ACACGCTTAC 


TTTATACAGA 


TTCGTGGTTG 


GCAGCACAAA 


AGTATGTGCA 


AAGCCAAACT 


9300 


GAGGCGTTTG 


GACTGGAAAC 


GCAATTTGAC 


GAAGTCGGGa 


ATCTGTTTTG 


TCGAGTGGCA 


9360 



WO 98/50555 PCT/US98/08985 

744 



GGAACGGAAT 


TTCCTCAGGA 


AACGATTTTA • 


ACTGGGTCAC 


ATATCGACAC 


AGTAGTTAAT 


9420 


GGTGGGACCT 


TAGATGGTCA 


ATATGGTGTC 


ATTGCTTCCA 


TGACAGCTGT 


TCAATATTTA 


9480 


TTAGACAAAT 


ATGGGAAACC 


ACGGCGTTCT 


TTAGAAGTGC 


TTTCCATGGC 


TGAAGAAGAA 


9540 


GGGAGTCGTT 


TTCCAACGGT 


TTTCTGGGGA 


AGCAAAAATG 


TCGTTGGTGA 


AGCCAAACGT 


9600 


GAAGAAGTAG 


TCGATATCTC 


TGACAACGAA 


GGTAAAAAGT 


TTGTTGATGA 


GATGCATCGC 


9660 


CAAGGCTTTG 


ATTTTCGGCA 


AGAGCAAACT 


AGACGTGAGG 


ACATTGAAGC 


ATTTGTCGAA 


9720 


ATTCATATTG 


AACAAGGCAA 


TATTTTAGAG 


AACGAAAAGC 


TCCAAGTCGG 


TGTTGTTCAT 


9780 


AGTATTGTAG 


GGCAACGTCG 


CTACACGGTT 


AACTTAAAAG 


GGCAAGCCAA 


TCATGCAGGT 


9840 


ACTACGCCAA 


TGAGTTACCG 


TCATGATGCG 


GTGTACGGCT 


TTGCTAAAAT 


TTGTGCAGAA 


9900 


GCCATTGATC 


GTGCGAACGA 


AATTGGGGAT 


CCGTTAGTTT 


TAACATTCGG 


CAAAGTCATT 


9960 


CCGAAACCCA 


ATACAGTCAA 


CGTGGTGCCT 


GGAGAAGTGG 


AATTTACAAT 


TGACTGCCGT 


10020 


CATACCGACG 


CTGCTTTTTT 


ACGACATTTT 


ACAGGTGAAT 


TAGAAGAACG 


GATGAAGACA 


10080 


ATTGCTCAAG 


AATTAGGCTT 


GACGATTGCG 


ATTGATCGCT 


GGATGGATGA 


AGCCCCTGTA 


10140 


CCGATGAATC 


AAGCAATTGT 


AGAAGTTATT 


GAACAAAAAG 


CCAAAGAAGG 


CAAGTATCAG 


10200 


TATCGTATGA 


TGCATAGTGG 


TGCTGGGCAT 


GATTCGCAAA 


TTATTGCGCC 


TCATTACCCG 


10260 


ACAGCGATGA 


TATTTGTGCC 


AAGTATTGGT 


GGGATTAGTC 


ATAATCCTGC 


AGAAGCAACG 


10320 


GCATTTCCAG 


ATTTGGTTGA 


AGGTGTCAAG 


CTTTTAGCAG 


ATACCTTATA 


TGAATTAGCG 


10380 


TATAAATAAG 


ATAGAAAAGG 


AAGAATTGTG 


ATGGGTTATA 


AAAATAATCG 


AGTTGGTTAC 


10440 


CAAAAAGAAT 


TGTTAACATC 


AAGAGCTGTG 


ATAAAAAAAG 


ACAATTACGC 


CATTATTCCC 


10500 


CATGATGGTT 


TGGTGCAAAA 


TGCAGTACCT 


GGTTTTGAAA 


ATGTCGATAT 


TTCCATTTTG 


10560 


GGATCACCTA 


AATTAGGTGC 


GACATTTGTG 


GATTACATTG 


CTACGTTTCA 


TAAAAATGGT 


10620 


CAACAAACAA 


CTGGTTTTGG 


TGGCGACGGG 


ATCCAAACGT 


TGGTTTATGT 


CATTGACGGT 


10680 


CGTTTGCGCG 


TTTCAGATGG 


TCAAGAAACT 


CATGAACTTG 


AAGCCGGGGG 


ctACGCCTAC 


10740 


TTTACGCCTG 


AAATGAAAAT 


GTATTTAGCG 


AACGCACAGG 


AAGCGGACAC 


AGAAGTTTTC 


10800 


TTATATAAAA 


AACGCTATCA 


GCCGTTGGCA 


GGCCATCAAC 


CTTATAAAGT 


GGTTGGGTCA 


10860 


ATTCATGACC 


AACAACCCGA 


AGAATATGAA 


GGAATGACAG 


ACGTGTTACT 


TTGGAGTTTA 


10920 


TTGCCgAAAG AGTTTGACTT 


TGATATGAAT 


ATGCATATCT 


TATCTTTTGA ACCAGGTGCT 


10980 


AGTCATGCCT 


ACATTGAAAC 


GCATGTCCAA 


GAGCATGGCG 


CGTATTTAAT 


TAGTGGCCAA 


11040 


GGGATGTACA 


ATTTGGACAA 


CGAATGGTAT 


CCAGTGGAAA 


AAGGGGACTA 


TATTTTCATG 


11100 


AGTGCCTATG 


TGCCACAAGC 


AGCCTATGCG 


GTTGGTCGTG 


AAGAACCGCT 


AATGTACGTT 


11160 


TATTCCAAAG 


ATGCCAATCG 


TGAACCAGAA 


TTATAAAGAA 


AAGCCGAATG 


TAAAGTAATG 


11220 


ACCTCGGAAC 


AATAGGATTT 


GAACACGCAG 


tGgTGAACGG 


TAGAGTCCTT 


CTTCTTGGTA 


11280 


GACGTTACTT 


TTATTCGGCT 


ATTTTCCTGT 


TAATTATTTT 


TCTGAAAGTT 


1 CTAACGCTAC 


11340 
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CCGTAAGAGT 
ATTCATGCGA 
TTCGCAATTG 
CGTCGGATAG 
AATCTTTTCA 
TTGAGCGTGT 
TTCTAGGTTT 
CAAACGTTCA 
CAAGATACTT 
CAATTGTTCT 
GGTTTCATTT 
GATTAAACGA 
ATGTGTATCA 
TTTCTGGTTT 
AGAAATAGGA 
GTTCACACGA 
AAAGATTTTC 
AAACTCCGCC 
AGGACTTAAT 
AGCATCAACA 
CATCTGTTCG 
TGGCTGTGTG 
TTCTGGTGAA 
TTGTTTTAGC 
CGTTGTTAAA 
TTCTACACTT 
AAATCTTGGT 
TAGTATAGCG 
AAGTCAAAAT 
GAAGAATTAA 
ACAATCATGA 
CCAATTTTAG 
ATTAAAGTAC 



AAACTCGTCT 
TATTTAATGG 
AAATCTAAAA 
GCCAACTCTT 
AAAAGTCCAG 
AAAGAAGCCT 
TCATAAGCAT 
GCGAGGTTTT 
TTATTTTGAT 
TTGAGCCATT 
TCTGGTTTAG 
TAATTTTTCC 
AGCAAGCGGT 
TTAAATAACA 
TAAGGAATCT 
ATTGGAAAGC 
TTTTCTTGTG 
GATTTTTGAT 
AAAATAATTG 
TCTTGCATTA 
GTTTTGCTAT 
CTAGGAATTT 
ATTTCGTCAA 
GAGTCAATAA 
ATCATGACAT 
TCCAAGGGCT 
AAAGCCAAGA 
AAGAATAAAA 
AGTGCAAATT 
AAATGAAGTG 
CGCCGGGACC 
GGCAATTTGA 
CTTTTGCCAC 



CAGGCTCTTG 
TGTTTCTGTG 
AAACTTTCAA 
TCAATTGTTG 
CTAGGCCTTT 
CTAATGTCGA 
TGCCTAAAGC 
GCAAAATCAT 
TTTTTAATTT 
GGAAAATTAG 
TCACGCCGTA 
AAAGTTCCAA 
CCAAAAAGTC 
TAAAAGTCAA 
GTTCAGGAGA 
CAGCTACTTG 
CCAATTGTTG 
TGCCATAAAA 
GCGCATTTAA 
ATAATGTTGA 
CACGAAGATA 
TAATCAGGGG 
GGAACCGCCC 
AAGGTTTCAA 
TTTTAGGGAT 
GTGTTAAGTT 
CCTCAGCTAA 
AGGAGTACAA 
GAACAATGAA 
AGGGATTACT 
AGTCGAAGCC 
TCCAGAATTT 
AAAGAATCAA 



745 

GATAGAGGTG 
AAGGTAAAGA 
GGTTTTTCGT 
CTGGCAAAAG 
GGGATGGAAC 
GGAAGCTTCA 
AAAGCGAATG 
CAAATGGTCA 
AAACAAGGCG 
TTGCCCTTCC 
GACAATCGCT 
CCAATTTTGA 
TGTTTTGAGA 
AACTAAAATG 
TAAGACCAAT 
AATGTTTTCT 
AAAGTGGTCT 
ATAATTCGAG 
AATTTTCGCA 
AAAACGCTTT 
GCCTACAATT 
CAAATCAACG 
AACTTTGATT 
TTTTTCTTGA 
AAAATCAGCC 
TGAGTGAGAC 
TATTTTCATC 
AAGATAGTTA 
TTTCTGGGTT 
ATGTATCAAC 
CATCCATCGG 
CTACAGATTA 
CAAGCCTTTG 



CCTAATAATT 
GCGTTGGCCG 
AATTCTTGTA 
TATTCAACAT 
AATTGCACAG 
ATATAAGAAT 
GTAATAGGCA 
TTTTTTTTAG 
ACATCTGGCA 
GCTTGCTGAT 
AATTGATAAT 
TGTTTACTCA 
TGCTCAAAAG 
GCCTGATCAA 
AGATAATAGG 
TGTCGCTCAT 
TTTGATAATT 
TGGGCAATCA 
AATTCAGCGA 
TGTATGTCAA 
TCATGCAACA 
GCACTGGCAT 
CCTAAAGCCG 
TCATCTTTAT 
ACATCAGGTG 
GATAATAATT 
ACAACAACCT 
TTGAATATGT 
TTTCCAAAGA 
AACCAAATTT 
TACTTCGTCA 
TGGACGAAGT 
CCATTGATGG 



TTTCACATTG 
TTTTAGTAAT 
ACGTCGGTTC 
CTTCAGTGCC 
TCGCTGGTTT 
TAGGCAAATC 
AAGCCTGTTG 
ATTGAAAAAT 
GGATTTCCGG 
AACGAATATG 
AATCACTGTT 
GCGCTTCTTG 
ATTCAGCAAT 
TGGCAAATTC 
GAAAATAATC 
CCTGTAAACG 
GTTCGATGAA 
CCTGTTGCCA 
TGAAGCGCGT 
ATGCTACAGA 
ACCCGCCTAG 
AGGCAACAAT 
TACACTCTGC 
AAATCATTGC 
TTTCCGTGAT 
GTAAGTCAGA 
CCTTTTTTTA 
CTAAAAAACA 
TAATAAAAGA 
TGTACGGCGA 
AATGGGTCAA 
TCGGGAAATG 
CACGTCACGT 



11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 
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TCAGGCTTGG 


AAGCGGGGTT 


ATTTGCGTTG- 


ATTGAGCCAG 


GCGACAAAGT 


GCTTGTGCCT 


13380 


ACGTATGGCC 


GGTTTGCTTA 


TTTATTAGGT 


GAAATTTGCG 


AACGCGCCCG 


GGCCGAAGTC 


13440 


ATTTATTTAG 


AAAAGGACTG 


GCTCGCACCG 


TTTGAACAAA 


CAACGGTGAT 


TGAAGCGATT 


13500 


AAAGAACATC 


AACCAAAAAT 


TGTTGCGATG 


GTCCATGGCG 


AAACGGCGAA 


TGCACAAATG 


13560 


CAACCACTGG 


ATCAAATTGG 


CGCCTTTTGT 


CGGGAAAATG 


AAATTTTCTT 


TGTAGTAGAT 


13620 


ATGGTGGCGA 


CTTACGGCGG 


TGTTGAAACA 


AAGGTCGATG 


ATTGGAAGAT 


TGATATTGCG 


13680 


GTAGCAGGCA 


CTCAAAAATG 


TGTCAGCGTA 


CCTTCAGGCC 


TTTCTTTAAT 


TACCTACAAC 


13740 


CAACGTGTAG 


CCGATTATCT 


GGCGGGCCGT 


TACCAAAAAG 


AGTTAGGGCT 


TGGCGCGGAT 


13800 


GCGCGGAATG 


AACGGTTCAT 


TCAAAGTAAC 


TACTTGGATT 


TATCGCAATT 


AGAAAAATAT 


13860 


TGGGGAGCAG 


AGCGCTTAAA 


TC AC CAT AC A 


GAAGCGACTA 


CAATGATTTA 


TGGGCTACAT 


13920 


GAAGGGCTGC 


GCTTGTTACT 


TCAAGAAGGC 


ATGGAAAATG 


TCTATGCACG 


CCATCGGAAA 


13980 


AACAATCGTA 


TTTTAGTTGA 


AAGTCTGCAA 


AAAATGGGcT 


TAGAAATTTT 


CGGGAAGTTA 


14040 


GACACAAAAA 


CACCAACAGT 


CATTCCTGTG 


GTCATTCCAG 


AAGGAATTGA 


TGGTGAAAAA 


14100 


GTGCGGAGCT 


TATTACTGGA 


TCATTTTAAA 


GTTGAAATTG 


CCTCTTCcTT 


CGGTGATTTA 


14160 


AAAGGAAAAA 


TTTGGCGGGT 


CGGTAATATG 


GGTTACAGCA 


GTCGCGAAGA 


CAATGTGTTG 


14220 


CATTTCTTAA 


GTGCCTTTGA 


GACTGTATTG 


AAACACCAAG 


GCTATCAATT 


TGAAGGCGGT 


14280 


AGCGGTAGTA 


CTCATGCATT 


GGCTGAGTAT 


TTAAACTAAA 


AAAGAAGGTA 


GGACAAAGGT 


14340 


CACTACGTAA 


AAAGCGTAGT 


GCTTTTGTTC 


CTTGTGAAAC 


GTTAACCAAA 


AAAATGATTG 


14400 


GTGGAGGAAA 


AGAAAATGAC 


AGACAAGACA 


AAAAGTTCGT 


TGGCTTATTG 


GAAAAAAATC 


14460 


GTTATTTTAC 


TATGTATGGG 


CTGGGTAACA 


ATCTGGATTT 


ATCGTTCCGC 


GCTTTCTCCT 


14520 


GTTTATCCAC 


AAATTCGTGC 


"GTCTCTAAAT 


GGTAACGCTT 


CCGATACGGC 


CCTAGGGTCC 


14580 


ATTTCTAGCT 


TTTATTTTCT 


CGGTTACGTA 


GTCATGCAGA 


TTCCTGCAGG 


CTTCTTGGTA 


14640 


GACAAAATTG 


GGAAAAAACG 


TGTATTAATT 


CCAGGCTTTA 


TCTTGTTTGC 


GCTGGCAGCA 


14700 


TTACTGATTG 


CCCAAGCCCA 


AACGATTGGA 


ACAATTTACT 


TAGGTAGCTT 


ATTGGCAGGC 


14760 


TTAGGATGTG 


GTTCTTTCTA 


TGGCTCGGCT 


TACTCGTTGA 


CGTCGCAAAA 


CATTCCTCAA 


14820 


GCCAAAAAAA 


GTTTTGCAAC 


AGCTATTGTC 


AACAGTGGTT 


CAGCCGTTGG 


TTCAGGTTTA 


14880 


GGGATGATTT 


TATCTAGTTA 


TTTAGTTGCC 


CAAAAAGGAT 


TACCTTGGCA 


AGTGATGATG 


14940 


TACATTTCCG 


CCTTTATGAT 


TATTTGTATG 


TTAGTGGCTT 


TTCAAGTGAT 


TATTCGTAAT 


15000 


CATAAAGAAG 


ACATGGCCTT 


GAT TC AG CCA 


ACCGTTGTTG 


ATGAAAAAGA 


TAACGGAGCG 


15060 


GTAAAAGAAA 


AAGTGCCCTT 


TAAAAAACTG 


TTTGCGCCAC 


ATATGCTATT TGCGTATATT 


15120 


TTATATTTTG 


GCACTTGTTA 


TGCATATTAT 


ATGATTGTGA 


CTTGGTTACC 


TAATTTTTTA 


15180 


AGTACCGAAC 


GTGGTTTCCA 


AGGAGCAGCA 


. ATAGGGTTAT 


CCTCTTCACT 


GGTCGCATTT 


15240 


GCCTCTATTC 


CAGGAGCGTT 


ATTTTTTAGT 


CGGTTGGCGG 


; ATAAGTACAT 


' GCATAAAAAA 


15300 
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GTTCAATTCA 
GTAAGTAATG 
GCTGTAGAAC 
ACTACTTTGG 
ACTGGGAAAA 
ATTTTACTAA 
TCGTCTAATA 
TTAgcTTTTT 
CTTTTTCAGG 
GATCAATACT 
CATCAATCAT 
CTTTCTTTTT 
ACCTACACGA 
CTTCAGTTGA 
GGAGAAAAaG 
GCAGTCACGT 
CGCAGGGCAT 
ATCCGCTAAC 
AATTTTAGCG 
TGGTCAAATT 
TATGTCAGCA 
AAAAGGCCCT 
CGCAGAAAGA 
AGATGTTCGT 
GAAACATGAA 
GAAACCTTTT 
AAATGTTCCT 
CGTGATTGGC 
AGAAGTCACT 
CGCTTTTGTC 
GGTTGCTTTT 
TGATTTAACC 
TGTAGCAACT 



TCGTTGTTTT AGAATTTTTA GCAACAGCCA TGC7CTTGT7 AACTGTACAA 
CGACGTTGTT ATTAATTGCT TTAATTATGT ATC-33TTCTT AGGAAAATTA 
CGATTATTAT TTCTTGGCTA GGTGAAAATG CACCAAAAAT 7GGAATTGGC 
GCGTGTTTAA CTTTTTCGGC ATGATGTCAT CTG73ATTGC CCGAGCTTTA 
TTTCTGACAC AACGGGTTCG AAAGTCATGG GC7777ATA7 CGCCGTCGTG 
TTGGCACGAT TCTTTTTATG CTTGTCAATT TAAAAAAACG CAGGGCAGAT 
AATAAAAAAA CAACGCAAAC TTATCTTTCG CTGAATAAGT TTGCGTtGTT 
GACAATCATA TTAATCGCTT GTGCTAATCG CTC77CTTGT tCTTTAGGAT 
ATTTTCAAAG CAATGCTTTA AATTCTCCGC AAC7A7CATC CCCATTACGC 
AGAACGGACC GCACTCAACT GAGTGATAAC ATCGATACAT TCTTTTTCTT 
TTTTTGAATC CCGCGAATTT GaCCTTCTGT 5CGSCGTAAA CGATTCAGCA 
TTCTTCAGTC ATTTTGCCCC TCCTTAAGAG TAGTGATTGG TTTAAAATAT 
GGTATTTTTG TCAACTATCT TTTCTTTGAA TC7AAAAATG GCAGAAATCT 
CAAGtCTAaC TTTTtCATGT tAAATATmCC CCC-'AAGgTA TATAAGAGAA 
rTGTTATTTC AATTGTTTAA AAAAACGCCA AGTATTTCrA CAAAGGAATT 
TTATCTAAAG AGATAACGTT ATTAGATGTC CGGACACCTA GCGAATACCG 
ATTCCACAAG CGATTAATGT CCCACTTAAC AAAA7TCCCG CCTACAACAA 
GAAGTCTATG TGATTTGCCA GTCCGGCATG CGC.-.gCAAAA ATGCGGCCAA 
AGAAAAAACT ATCACGTGAT TAATGTTCGT GGCSGGATGT CCCAATGGTC 
AAAGGAGGAA AATAAATGAA AATTGTAATT ATCGGAGGCG TGGCTGGTGG 
GCGACACGTT TACGTCGTTT AATGGAAGAT 3CTGAAATTG TTGTCATGGA 
TTTGTTTCAT TTGCAAACTG TGGTTTGCCT TACTACGTTT CTGGCGAAAT 
GAGCAACTGC TTGTTCAAAC ACCCGAAGCG TTAAAGGCAC GGTTTAATTT 
CCTCACCATG AAGTCGTGGC TATTGATCCA ATAGAAAAAG TGATCACAGT 
ACAGAAATTT TAACAGAACA CTATGACAAA TTAA7TTTAT CACCAGGGGC 
GTGCCACCAA TTACAGGATT GGCAGAAGCC AAAAATGTTT TTTCATTAAG 
GATTTAGATC AAATTATGAC AGCCTTGACA CCAGAAACAA AACGAGCCGT 
GCAGGCTTCA TTGGCTTGGA AATGGCAGAA AACT7GCAAA AACGCGGAT7 
CTCGTGGAAA AAGCGCCTCA 7GTTT7ACCG CCA77AGACG AAGAAATGGC 
AAAGCTGAA7 TAAGCAAAAA CAA7G7TCAA GTAA7TACGG GACAATCTGC 
GAAGAAGAAG GGCAAGTGAT TCGCT7AGAA GACGG7CAAA CATTAGCTTC 
ATT77GTCGG TGGGGG7CCA ACCAGAAAAT ACCTTAGCAG 7TGAAGCAGG 
GG7T7ACGTG GCGG7ATTGT TGTTGATGAA CACTATCAAA CGAA7CAACC 
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CGATATTTAT 


GCGGTTGGGG 


ATGCTATTGT 


AGTGAAACAA 


CAAATCACTC 


AAGAAGATGC 


17340 


GCTGATTTCT 


TTAGCTTCTC 


CCGCCAATCG 


CCAAGGACGT 


CAAGTAGCGG 


ATGTGATTGC 


17400 


TGGGTTAGAG 


AGAAAAAATC 


AAGGAAGCAT 


TGGGACCGCC 


ATTGTGCGAG 


TCTTTGATTT 


17460 


AACCGCTGCT 


TCAACTGGTT 


TAAGCGAACG 


GGCTGCTAAA 


GCTGCTGGAC 


TAACAACAGC 


17520. 


TGTTGTGCAT 


ATCAGTGGAA 


AAGACCATGC 


GGGGTATTAT 


CCTGGCGCAA 


CAGATCTTCA 


17580 


GTTAAAATTA 


GTTTTCCATC 


CTACGACAGG 


GGAAATTTAT 


GGCGCACAAG 


GAATTGGGGC 


17640 


AAAGGGCGTA 


GATAAGCGGA 


TTGATATTCT 


TGCGACCGCT 


ATTAAAGGAC 


AGTTAACTAT 


17700 


TTTTGATTTG 


CCTGAATTAG 


AGTTTACCTA 


TGCGCCGCCG 


TTTGGTTCAG 


CGAAAGATCC 


17760 


TGTGAACATG 


TTAGGCTATG 


CAGCGATGAA 


CCTTGTAGAA 


GGATTGAGTG 


AAAACGTTCA 


17820 


ATGGTATGAG 


TTATCCAACG 


AATTAGCTAA 


TGGAGCTGTT 


TTATTAGATG 


TCCGTAATCC 


17880 


CGCCGAACGA 


GCCAATGGTC 


AATTTAAAAA 


TGCTGTGTCT 


ATTCCTTTAA 


ATGAGTTAAG 


17940 


AGAACGTTTG 


GAGGAATTAG 


ACAAGTCAAC 


GGAGTACATT 


GTTAGTTGTC 


ACAGTGGTTT 


18000 


GCGTAGTTAT 


ATTGCAGAAC 


GGATGCTAAA 


ACAAGCAGGC 


ATCTCAGCCA 


AAAATTTAGA 


18060 


TGGTGCTTTT 


GCGCTATATC 


GAATGGTAAA 


ACCGGAGGAA 


CTAGAAAATG 


TATAACACGA 


18120 


TTGGCATGCC 


TGAATTTTAT 


CaGGAAGCAA 


AACGAAAAAA 


ATTACCGATT 


ATTGATGTAC 


18160 


GAGAAGAAAT 


GGAATACCAA 


CTGGGGCATG 


TCCCTCAGGC 


CATCAACTTA 


CCTTTAAGTA 


18240 


CTTTAACCGA 


AAATTATCAG 


CAATTAGATA 


AAAACCAACC 


CTATTATCTG 


ATTTGTCAAA 


18300 


TGGGGTCGCG 


CTCCGCACAA 


GCGTGTGCGT 


TTCTCAGTCA 


ACAAGGGTAT 


CAAGTAACCA 


18360 


ATGTTCTAGG 


TGGTACATCT 


GGCTGGTTGG 


GTCAATTAGA 


ACAATAAAAT 


AACAAAAAAC 


18420 


GTTTATTCTT 


TTACACAGGA 


TAAACGTTTT 


TTTGTTAGAT 


AAAGATGTGT 


CAAAACTGTA 


18480 


AGAGGAATGA 


TAGGAAGATG 


AAGGGACCAG 


TTTTTTTATT 


CAT TAT AT AC 


TAAATAAAAT 


18540 


GACAAACTCG 


TCAACAGAAA 


TCAAGGAGGA 


ACAAATAAAT 


GAAAAAGTCA 


GGCCAGAAGA 


18600 


AGCGAAATAA 


AAAAGTGTGG 


TTCGGTATTG 


GTGCCGCAGT 


GGTCGTCGTC 


GGATTTATCG 


18660 


GAGCAAAAAC 


AGTTTTTTCT 


TCAAAAGAAG 


TTGAACCTGA 


ATACACAACG 


TATACCATTA 


18720 


CAGAAATGGC 


CTCATTAAAA 


TTAGATGGTC 


AAGTGAGTTT 


TC TAG AT ACT 


CGGGATATTT 


18780 


TCTTTGATCC 


TTCTTTAGGA 


AAAATTGCCG 


AAATAAATGT 


TGAAAATGGT AAAGAAGTCA 


18840 


AGAAAGATAG 


TCCATTGTTA 


ACGTACAATA 


ACTCAGACAT 


CCAAGCCACA 


GAAACAGAAC 


18900 


AAGCGAATGC 


TGTTAATCGG 


AACAATCTTC 


AAGTTCAACA 


AGCACAAGAG 


AACGTTAACT 


18960 


TAGCCACTCA 


AAAATACAAT 


GAAGCGTTAA 


ACAAAGTTGC 


TGCAGCCAAG 


CAAAAATTAA 


19020 


ACACCGCAAA 


AGAGGCTGAA 


GAGAAAGAAA 


CACTAAATGC 


GGAAATTCAG 


CAATTGAATG 


19080 


AAGCCGTTAG 


TGCTGCGAAT 


AGTGAAGTCG 


CCCAAGCTAA 


CCAAGCCCTT 


CAATTAGCCA 


19140 


ACAGTGATGC 


CGTCGGAGCA 


GCAAACACGC 


TTGAACAAAC 


ACGTGGCAAA 


GTCAACACAG 


19200 


TCGTTACAGC 


ACCAATTGAT 


GGCCAAGTCA 


CTGTTGATGC 


TAGTGCCATG 


AGTAGTACCG 


19260 
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ACAAACCAGT 


CATTAAAATT 


GCAACACAAA * 


AGAAAAATAT 


TCAAGGAAAA ' 


GTAACCGAGT 


19320 


ATGATTACGA 


TAAATTACAG 


ACAGGGGAAG 


AAGTAACAGT 


GACGACCGTT 


GGTTCAGGTA 


19380 


AATCAGCACC 


AGGCAAAATT 


GTTTCCATTG 


CGCAAACCCC 


AATAGCTAAA 


AATGAAGGCA 


19440 


ATCCAGTAGT 


GAGTTATCAA 


TTTACAGTAG 


AAGGTGATTT 


TCCTTGGGCA 


GAAGGCTTGT 


19500 


CTACTAGTAT 


CGCTGTGCCA 


CAAAAGCAAA 


TGATTATACC 


GACAGCAGCA 


ATCCAAAAAG 


19560 


AAGGCCAAAA 


AGAATTTvrVC 


TATGTTTACA 


AAGCGGGCAA 


AGCGAAAAAA 


ACACCGATTG 


19620 


AAACAGAAAC 


AAATTTAGGT 


CGTAAAGTTG 


TCAAAAGTGG 


CTTGAATTGG 


AAAG ACCAAG 


19680 


TGATTGCCAA 


TCCCAATAAA 


GAATTAAAAG 


ATAATCAAGA 


TGTTCAGGTA 


GCTGCCAATG 


19740 


ATTAAGCTAG 


CCCATATTCA 


TAAATATTAT 


TATTCCGAAG 


AAGAAACACT' 


GCATGTGTTA 


19800 


GATGATATTA 


ACCTCCAAGT 


AGACGCTGGA 


GAATTTTTAG 


CGATTATGGG 


TCCTTCTGGT 


19860 


TCTGGGAAAT 


CGACTTTGAT 


TAATTTGCTA 


GGCTTTATTG 


ATAAAAAGTT 


TGAAGGAACG 


19920 


TATTTATTTG 


AAGACCGTGA 


AATTGGCGAC 


TTTTCTGATA 


AGGAACTATC 


CCGAATTAGA 


19980 


AATGAAGCAG 


TCGGCTTTGT 


CTTTCAGAAT 


TTTAGTTTAA 


TTGAAACACT 


AACAGTTGAA 


20040 


GAAAACATTG 


aATTACCTCT 


TTTATATAGT 


GGGTTAACCC 


CAAAAGAAGC 


CAAAGATCGT 


20100 


GTCCACGAAG 


TTCTGACAAA 


AGTCGGCTTG 


CCAGATAAAG 


GCAAGAAGCA 


TCCAAAACAA 


20160 


CTTTCGGGCG 


GACAACAACA 


ACGTGTTGCC 


ATTGCGCGTG 


CGATTGTCAA 


TCGGCCAAGT 


20220 


TTTATTATTG 


CAGATGAACC 


AACTGGGGCT 


T TAG AT AG T A 


AAACATCGGA 


AGAAATTTTA 


20280 


ACGCTCTTTC 


AGCAATTAAA 


CAATGAAGGT 


GTCACCATTA 


TTTTAGTGAC 


ACATGATGAA 


20340 


GAAACAATTG 


AATACTGCAA 


TCGTTTGATT 


AAAGTTCGTG 


ATGGAAAGAT 


TTTAGAGGAG 


20400 


GTGCTGACAT 


GAAACGTTCA 


ATTATTTGGA 


AAACCGCCTT 


CCGTTCCATT 


TTGAAAAATA 


20460 


AGCGACGCAG 


CTTGTTAACG 


ATGCTAGGGA 


TTGTCATCGG 


GATCGCTTCG 


GTGATTACCA 


20520 


TCGTGGCTAT 


CGGTAACGGC 


TTTAAAGAAG 


ACATGGTCGA 


TAAATTATCT 


GCTGAAAAAC 


20580 


AAAAAGAAAA 


TGTGAAAAAG 


ATTTCTTTTT 


CGGCCTACAA 


CACTTCGGAT 


ATGTTTAGTG 


20640 


ACCAAGCGAT 


GTTTACCGAC 


AATGATTTAG 


GGGTCGTGCG 


CATGGTTCCT 


GGCGTAGAAA 


20700 


AAATTGATTT 


TGATCAAAGA 


GAAGTTGATG 


GCGCCCAAAA 


AGGTACACTG 


AATTTTCAAG 


20760 


CAGGACCGAA 


AAATCTTTCT 


GTAACCTATG 


AACTTGCCAA 


CAAAACGACC 


AAAGAATTGT 


20820 


TGGCGGGACG 


CCAACTGAAT 


GATGGAGACA 


ATGCAAAACT 


GGATAAAACA 


GTGGTGGTGG 


20880 


ATGAACTGGT 


TGCCAAAGAA 


CTCTACCAAG 


ACTCGACTGC 


AGCAGTTAAT 


CGAACTTTTC 


20940 


CATATAAAGA 


GCAACTTTTT 


ACCATTGTTG 


GCGTGACAAC 


CAATACCAGC 


GGTGCCATTG 


21000 


GTCCAGGTAA 


TGATGACTCA 


TTGCTTTATT 


TTCCCAAAAA 


GACCTATGAA 


. CATTATTTCG 


21060 


GCAAGCTAAA 


AG AT AC AT CT 


ACGTTGAAAC 


TAACAGTAGC 


ACCTGGCTAT 


CAACCAGATC 


21120 


AAGTATTGAA 


AGAAACAATA 


AAAACTCTCT 


CTCAACAAGG 


AACCATGAAA 


i AACAGTGGGA 


21180 


CGTATCAAGA 


. ATATAATGTT 


AAAGATACCA 


, TCAAAGAAAT 


GGGCTCTTTA TTAAATAATT 


21240 



WO 98/50555 PCT/US98/08985 

750 



TAACCTTGTT 


TATTTCCGCA ATTGCGGCTA. TTTCACTGGT 


AATTGCTGGG 


GTCGGCGTTA 


21300 


TGAATATGAT 


GTACATTTCG 


GTCTCAGAAC 


GGATTAAGGA 


AATTGGTATC 


CGACGTGCAT 


21360 


TGGGCGGGAC 


AGCCAGTGAT 


ATTAAAAAGC 


AATTCTTAAC 


AGAAGGGATT 


GCCTTGACCT 


21420 


TAATCGGCGG 


AATTACTGGT 


TATCTTTTAG 


GAATGATTAT 


TGCCTACTTA 


GCCTCAATGG 


21480 


CACTACCGTT 


TTCTGTTCGA 


CCAGATTTAA 


TGACCGTTTC 


AT TAG CC ATT 


GGGATTTCTG 


21540 


TCTTTATCGG 


GGTTGTATTT 


AGCTATTTCC 


CTGCCTCGGC 


AGCTTCAAAA 


AAAGACTTGA 


21600 


TCGATATTAT 


GAAATAAATA 


TAAACAGCAA 


AGAAACAGCC 


ATTTTTGGCT 


GTTTTTTTTA 


21660 


GTGAAAGTAG 


AAAAAAATCC 


AAAGATTTCA 


AGAGAAAGTA 


GCTAACAAAC 


GTCAAAAGGT 


21720 


TGATATTAGC 


TACCAACCTT 


TTGACGTTTG 


TCAGTTAAAT 


AAAAAAGAAA 


AAAATTAAAA 


21780 


AAATAGCACG 


TTGAGGATTA 


AATAACAAAC 


ATTTTTTAAA 


TAAGACTATT 


GGAAATCCTT- 


21840 


TATAAATCGG 


GGATTACAGG 


AATGACTGTT 


AATGACTGAT 


TTGCTTGTAA 


ATTTCTAAAT 


21900 


TTTAATTCGA 


AAAATTTCCA 


GTATACTAAA 


TATATCAAAA 


CAAAACACGA 


AAATTGAGAA 


21960 


AATTTTGAAC 


CAAAAAAGAA 


TTAAGGAGGA 


ATCAAGAATG 


AAAATAATGG 


T CATC AC AAA 


22020 


AAATATTTTA 


TCAGAAACTG 


CTTTTCAAAA 


CCGCTTACAG 


CATTTAGATC 


ACCAAGTTTT 


22080 


TTGTACGACG 


TGTCTGATTG 


GTGAAATTAA 


GGAAAGTCAT 


TTGTTTTATC 


GAAAATTTGA 


22140 


TGCCATCATT 


CTCAGCGAAA 


CGATTCCTCG 


CAAAGAACAA 


TATTTATATC 


TAAAGGAATT 


22200 


AGCCAGAGTT 


AAAACGCCGT 


TGTTTCTTCG 


CTCGGAAGAC 


ATTCCTACCT 


CGGAAGAATT 


22260 


GAGCATTAGC 


TATCAATATG 


GCATTCTCAA 


TTGGTTACGA 


TTGGATGCGA 


CTTTGGATAC 


22320 


GTTAAGAGAA 


AATCTCCTTT 


TGGCAGAATT 


AAATAAACCC 


ATCCAACGAG 


AAGATGAAAA 


22380 


GTTAGATGTA 


GAGGGAATAA 


AGCTAACAGC 


CGCAGAACGT 


GCACTACTGG 


CTGTCTTGTA 


22440 


TAGTACGCCA 


AAAGAACACT 


TACCAATGGA 


GCGCCTGTGT 


AAATTAATAT 


AT G AC AC AG A 


22500 


AGAAACAGCC 


AATAATCGAA 


AAAAAATGAC 


CAAACAGCTG 


ACAAAATTAA 


ATAAAAAAAT 


22560 


TCAAGTAGGG 


GGCTTTAAAA 


AACAAGGTAT 


CCAAAATACA 


CGGTATAAAG 


GCTACAAACT 


22620 


GCATCCGCAg 


TTGCTTCCTT 


ACATTGGCGA 


ATTATTTACT 


TAAATTAGAG 


AAGCACTTGA 


22680 


CGAATGAAAA 


TCACCATCAC 


AGGATGACCT 


CTAAAAACCA 


TGCTATACTA 


ATACGGAATA 


22740 


GTATAGGAGG 


TGGGTCTGTA 


TGAAAGACGG 


ACTATATTGG 


GTACAACAAT 


TAAAAGAACA 


22800 


GAAAGTAAGT 


CGCATAGAAC 


TCTTAAAGGA 


TAT TG AG AAA 


AAAGTCCAAA 


AGTTAAATCC 


22860 


AAAGATCAAT 


GGCTTTGTAA 


CCTTTGATAG 


CCAAGAAGCA 


GAAGCAACTT 


ATCAAAAAAA 


22920 


TAAACAGCAG 


GACACACTAT 


TTGCTGGTTT 


GCCATTTCCA 


TTGAAAATGT 


TAGGCCAAGA 


22980 


AAAAGCGGGG 


TGGTCAGCAA 


CGGCCGGCTC 


GCAACTTTTT 


AAAAATAGTA 


CTGCAACAAC 


23040 


GACCTCTAAC 


TTTGTCAAAC 


AAGCAGAAAC 


GATTGGCTTG 


ATGCCTTTAG 


GACAAACCAA 


23100 


CGCTCCAGAG 


TTTGGCTTTA 


AAAATATTAC 


GGACCCTGCG 


CTTTACGGGC 


CAGCCCGCAA 


23160 


TCCTTGGAAC 


CTGGAACATT 


CGCCAGGCGG 


TTCCAGTGGC 


GGCGCTGCCG 


CAGTTGTCGC 


23220 
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TAGCGGGATT 


GTTCCTATAG 


CCGGCGCTAG . 


CGATGGCGGT 


GGfcTCCATCC 


GAATTCCCGC 


23280 


CTCTTTTAGT 


GGCTTAATTG 


GCTTAAAACC 


TAGCCGTGGC 


TCCATGCCTG 


TCGGCCCAGA 


23340 


AGGCTGGCGA 


GGCTGGCAAG 


GTGCCTCGAT 


TGACTTTGCG 


TTGACTATTT 


CTATGAGAGA 


23400 


CACCAAAGCG 


TTATTTTATG 


GCCTACGTGG 


TAGCCACAGT 


GGCGCCCCCT 


ACCAAGCGCC 


23460 


CTTGGCAGAA 


TGGCAAACAC 


ACCCTAAAAA 


ACAACGGCTA 


AAAATTGCTC 


TGTGTAcCGc 


23520 


TTCgcCTATC 


GGCTCACTGA 


TTTCGCCAGA 


AGCAACACAA 


GCGGCGAAAC 


AAGCGGCTGA 


23580 


TTTTCTGGCA 


GCCGCTGGCC 


ATGAAATCAT 


TGAGATTCCT 


TATCCAGTCG 


ATGGTGCAGC 


23640 


GCTGATTCGC 


TCCTATTATC 


AAATGAACGG 


CGCAGAAACT 


ACAGCGATGA 


TGAACAGCAT 


23700 


TCAACAGGGG 


CTCGGGCGCC 


CAATTCGTAA 


AGAAGAAATC 


GAATCTTTTT 


CAT GG AC CAT 


23760 


GCATCAA-TTT 


GGCCAAAAGA 


TTCCAGCAGC 


CACCTATGTT 


CAT T CAT T AC 


AGTTGTGGGA 


23820 


TCAAGCAGCA 


GTAACGATGG 


AAGAACTTTT 


TCAAAAATTT 


GATCTGTTTT 


TATCACCAAC 


23880 


AACGGCTTTT 


TCAGCACCAA 


AAATTAACGA 


AGATTTGCAA 


AGTGAGCACA 


TTCGTCAACG 


23940 


AATGGCGCAA 


GCAGCAGAAT 


TAACAGAAGC 


AGAATTAGCT 


GAATTAATCT 


ATGATTATTT 


24000 


TGATAAAAGT 


TTACAGCTAA 


CACCGTATAC 


TCAACTAGCT 


AACTTGACAG 


GACAACCAGC 


24060 


CATCAGCTTG 


CCCACACATG 


TTACGGCAAC 


GGGGTTACCC 


TTAGGCATTC 


AATTGCTGGC 


24120 


TGCTCGTGGA 


CGTGAAGATC 


TCTTATTCCA AGTAGGCGAA 


CAGTTTGAGC 


AAGAAGGCAA 


24180 


ATTTAAATTA 


CCAGAAAGTT 


ACCGCTAAAT 


AGGGTGTTAT 


TGTGACCAAA 


TTTCAGATAA 


24240 


AATAGACACA 


AGACAAAAAA 


CTGAGGTGGA 


AAAATGCTGT 


ACGTAGTGAG 


ACATGGTGAA 


24300 


ACAGATTATA 


ATGTGGCCCG 


TAGAATTTGT 


GGTCATGCGG 


AAGCGCAATT 


GACCGAAAAA 


24360 


GGCTATCAAC 


AAGCGGAATT 


GGTAGCTGAA 


AAAATTGCCA 


AACAAGGAAT 


TCAAATCGAC 


24420 


CGTTTACTGG 


CGTCGCCTTT 


AAAAAGGGCG 


CAAGAAACAG 


CCCGTAAAAT 


TGCAGAACGA 


24480 


AATCAATTGA 


CGATTGAAAC 


AGAACCTCGT 


TTAATTGAGA 


TGAATTTTGG 


CATTTATGAT 


24540 


GGTGAACCGA 


TTGAAACAAC 


GGCTTTCCAA 


GAAAATCGTT 


CTGAGATTTC 


CTTGCCGTTT 


24600 


CCAGAAGGGG 


AATCAGTCCT 


AGATGTCGCG 


GGTCGAATTT 


ATCCCCTGAT 


TGAGGAAGCC 


24660 


TTGGCTTCAG 


AGGAAACTTA 


CCTATTTGTT 


TGCCACAATG 


CGGTCATGCG 


AGTCATTGAT 


24720 


AACTACTTCA 


ATGGTAAAAA AATCCAAGAT TTCTTGGACT 


TTCATTGTGA 


AAATACACAA 


24780 


CTTGTACAAT 


ACGGAGAATA 


AAAATCAAGT 


CTTTAATCTT 


TTGGTTAAAG 


ACTTGATTTT 


24840 


TTTTATGGTA 


AATAAGAAAC 


CGCTTCCGTT 


TTAATGTATA 


ATAAATTAAC 


TTTTGGATCA 


24900 


AAAAAGAAAT 


GGGGAAAAGA 


CTATGAGAAT 


TGATAATATT 


CATCAAGCGT 


TAAAACAGTT 


24960 


TCACGTACAA 


AATCCAAGTA 


GCAAAGAAAC 


CTTGTTGCCC 


GCAACGGcTT 


CGCAATTAAG 


25020 


TCAGGCCTCT 


GGCTATACCC 


GAACCGCTGT 


TAGTGAAGAG 


CTTTCAGAAT 


TAGTGAGAAA 


25080 


AGACCAAGTC 


GTTAAAGTTA 


AATCAAGACC 


GGTGTTATTT 


TTTGATAAAG 


ACTACTTAGA 


25140 


GGAACAGTTT 


CCTTTTTTTA 


AAAGTAGCAC 


GTGCTATGAT 


TCCCTCGcTT 


CGCTGCAAGA 


25200 
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GGACATTTAT 


CAACAGACAG 


AACAGATACA 


TAAGAAAAAG 


CAGCTACTGA AAATGAATCC 


25260 


ATTTGCGAAC 


GTAATTGGTT 


TTGATGGGAG 


CTTAGCAGAC 


GCCATCAAGA 


AGTTGAAAGC 


25320 


CGCTATTTTA 


TATCCGCCTA 


ATGGCTTGAA 


TATTTTACTT 


ACTGGAGAAT 


CTGGGGTGGG 


25380 


GAAAACCTTA 


ATTGCAGAAC 


AATTACACCA 


ATTTTATCAA ' 


GTTAAAATGA 


ACGAAGAAGT 


25440 


TCCGTTTATT 


TATTTTAATT 


GTGCAGAATA 


TTTCAACAAT 


CCAGAATTAT 


TAACTTCGCA 


25500 


TCTGTTTGGC 


TATAAAAAAG 


GCAGTTTCAC 


TGGTGCCGTC 


AATGACCAAA 


AAGGTTTAGT 


25560 


GGAGTTGGCA 


GATGGCGGTT 


TTTTATTTTT 


AGATGAAGTC 


CATCGATTAA 


CGAGCGAAGG 


25620 


CCAAGAAAAA 


TTGTTTACAG 


TTTTGGATAA 


AGGGTATTTT 


ACACGGATGG 


GCGArCgAAA 


25680 


AACAAAGAAA 


TGTATCGATA 


AAATTTATTT 


TTGCAACAAC 


CGAAGCAGTC 


AATCAACATT 


25740 


TTTTGAAAAC 


GTTTTTAAGA 


AGGATCCCTG 


TGACATTAGC 


CATTCCAGCT 


TTAGCCGAGC 


25800 


GCTCTTTGAA 


AGAAAAAATT 


CAATTGATTT 


TGTCCTTTTT 


CTTAGAAGAA 


AGTCAACGGA 


25860 


TTCAAAAGAA 


TATCCATATT 


CGTTATGAGG 


CGTTGAAACA 


ATTAACATTT 


ACTCATTTTG 


25920 


AAGCCAACGT 


AGGCGAATTG 


AAAGCGGCGA 


TTCAATTTAT 


GAGCGCACAA 


GCCTATTTAG 


25980 


AGGTGTTCGA 


TCAATTAACG 


GAAACTATTT 


GCATTGAAGC 


AAACGAGGGG 


ACAGTCAAAG 


26040 


AGCAACCTGA 


TGTAGAAAGT 


CTGCfGGCTT 


TTAATGAGTG 


GATACCTAAC 


GATGGCCTCA 


26100 


CGATTCAATA 


CAACCATTTT 


TATTTAAGCA 


GTCAAGAGTT 


ACTTAAATCA 


GAAAGTCTGA 


26160 


AAAATGATCG 


TTTTTATAAT 


TTTCTATTGA 


AAGAATATGC 


GGATTTGAAA 


AAATTAACTC 


26220 


TTTCGACAGA 


GGATGCGCTA 


GCAATCATGC 


AAAACAAAGT 


CGAGCAACTT 


TATGATATGG 


26280 


ATATTTTTGC 


TCAAGAACCG 


CTAGCCATAG 


AGGCCATTGA 


CGAAACATTG 


CAGCCAAAAA 


26340 


TTCAAGAAGC 


CATTGTCTTT 


TTTGAACAAT 


TAATCCAAGA 


AAAATTGACA 


GAGGATTTCC 


26400 


AGAATATTTT 


AATTGTCCAC 


CTTTATTCTG 


CTGTTTTTTA 


TTCTGCACAA 


ATGAATGAAA 


26460 


CATTTTATTC 


AGATACCAAA 


AATATCTATT 


CTGGCAAAGT 


CACCCAGTAT 


CAAGCTGCTA 


26520 


AGACCATGGT 


GGACTTTCTA 


CAAACATTGT 


TTGGGATTTC 


TTTTGCTGAT 


ACGGAAATCC 


26580 


TCTTTTTCGA TCTATTTATT 


CGTAAAGTCA 


CAGCTAAAAA 


AAGAGCGCAC 


TCAGATGAAG 


26640 


AAAACTGTGG 


CGTTATCGTC 


GTTGCACATG 


GTGCCGCGAC 


GGCCAGTAGC 


ATGGTTGAGT 


26700 


ACACCAACCT 


TCTGTTTTCA 


ACAACGGTCA 


TGCGTGCAGT 


TGATATGCCG 


ATTAATCAAT 


26760 


CAGTGGAAGA 


AACATTAGAA 


AAAGTCCGTA 


CAATTGTAAA 


GCAAAATCAG 


TATAAAAAAA 


26820 


TTATTTTAAT 


GGTGGACATC 


GGTTCCCTGG 


TTTACTTTGG 


TAATGCTATC 


AGTCAAAAGT 


26880 


TTCAAAGGGA 


AGTGCTGCTA 


ATTAGCAATA 


TCAATCTATT 


AACATTGCTA 


GAAGTGGCCC 


26940 


GAGAAGTCAT 


TTATGAATCG 


ACGGCGTTTG 


AGTATCTATT 


ACCAGTCTTA 


AAAGAAAAAA 


27000 


ATCATCGCGC 


CTTGATTTGC 


AGTAAAGGTC 


GCTTCAACGA 


GGCAAAAGTT 


CTCATTATTT 


27060 


CCTGTCTAAC 


TGGCATGGGG 


ACTGCTATTA 


AAATTGAACA 


GTTGTTACTA 


AAAACTTTTC 


27120 


CCAATGAATT 


ACTAACGAAT 


GTACGCATCA 


. TTACTTTAGA 


> GAAAAGAGAA 


t GTGGAAGACA 


27180 
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TTGATCGCTT 


GCATTCTTTT 


GTTCGGAAAG 


AAGAGAAAAT 


CGTTGGTATC 


ATTGGCACCG 


27240 


TAAAGACAGA 


AGTGCCAGAT 


ATTCCGTTTA 


TTTCTTTACA 


AGATTTACTA 


TCGAAGCGGG 


27300 


GCATTCAGCT 


AGTTTTCGAA 


TTATTAGGCC 


ATCATTTAGA 


AGAAAAACAA 


AACCAAGAGT 


27360 


TACTAAAAAA 


AGTTTCAGCT 


AAATATATTC 


AAGGATTATC 


AATCGAAGCT 


ATTACAGACT 


27420 


TACTAACGGT 


TTTGAATCCG 


ACAACCATTT 


CAATCGAATT 


AGCTAAGATT 


TATTCAGACA 


27480 


TTTGCTTGCA 


AACTAAAATA 


AAAAGTGACG 


AAAAGATTTT 


ACTTCGTTTT 


ATCATTCATT 


27540 


GTGCTTGTAT 


GTTAGAACGG 


CAATTGTTGA 


ATCCAGAATA 


TGACCCGACA 


GCTTATGaAG 


27600 


TGTCaTATCG 


AGAGTTGCCA 


GAAGCTGTAT 


CTGTCATAAA 


AATGGCGTTT 


CGACCGCTTG 


27660 


AAGTGTCCTA 


TAATCTGTTG 


ATTCCACCTT 


TGGAAGTGCG 


CTATATTTAT 


GAATTGCTCT 


27720 


TTGAAAAGGC 


TTAATTCAAA 


CGGAAATGAA 


AGTTGGCACA 


CTTATTGCTT 


TTAATAAAAT 


27780 


GATTAATTAG 


CAAAGGAGTT 


TGTTTATGGA AAATGAACCA ATTTTACTTT 


TAACACACAA 


27840 


TGGTTGGGGC 


TTGGAATTAG 


TCAAAAGCGT 


CCAGATGATT 


GTTGGAGAAG 


TCACAAATGT 


27900 


ACATGAAGTT 


GCATTACAAG 


CAGAGGATTC 


ACTGGGGGAT 


TATTTAGAAC 


GAGTCACAGA 


27960 


AAAAATTGAC 


CAATTAACGT 


GGCACAACCA 


GTTGCTAATT 


TTAACAGATG 


TTAAAGGCGG 


28020 


CACACCTAGT 


AACGTTGCAT 


TACGTTTATC 


AAAAGATTAT 


GACCTCCTGA 


TTGTGTCAGG 


28080 


ATTATGCACA 


TCGTTATTAT 


TAGAATCCGT 


CATGAAACAA 


AGTGGTCCTG 


GATTTCGTTT 


28140 


AGAAGATGCA GAGATGATTC 


AACAAGCAGC 


AGTAGACAGT 


TGTCAAATTT 


TAGAAATTCC 


28200 


AAAATAAGAA 


AGTACAGGTG 


AACAGAATGT 


CAAAAATCGT 


ATTAACAAGA 


GTCGATAGCC 


28260 


GTTTAATCCA 


TGGACAAGTA 


GTGACTAAAT 


GGTTACAACA 


ATCAGGTGCA 


AATGAAATTT 


28320 


TGGTAGTGAG 


TGATGAACTA 


GAGCAAGATG 


AATTTTTACA 


AAGTATTTAT 


TTAATGGCGG 


28380 


CGCCTCCAGG 


GGTTTCTGTA 


GTAATTAAAG 


GGATTGAAAG 


TGCCAAAGAA 


TATTGGGAAT 


28440 


CGCAAGAAAA 


GGAAGAGAAA 


AAGGTGTTAT 


TTTTAGTGCC 


AGATTTAACG 


ACCTTAAAAG 


28500 


CGATGGTGGA 


CCAAGAAATT 


ATTAAAGAGG 


AGATTCAAGT 


TGGTGGCTTA 


GGTGGCGGAC 


28560 


CAAATCGGAA 


AAATGTTTTA 


AAAAATATCA ATTTATCTGA 


AGAAGATGTT 


GCAATTTTGA 


28620 


CTGAACTTTT 


AACGAAGGGC 


ATCCATGTTT 


TCTTCCAAGC 


GATTCCCGAA 


GAAAATCCCC 


28680 


TACCAATTCA 


AAAATTAATT 


GAAAAATATC 


AATCACTATA 


AT C AG AG AAA 


AAGGAGAGAA 


287 4 0 


TCATATGGGC 


GTTTTAGGTG 


TTTCAATTAT 


TATGGGACTT 


TATTATTGGT 


TTGCACGTTT 


28800 


GCGTTTTGGG 


TACACATTTT 


CAGGCATGCT 


GTCTCAGCCA 


TTAGGCGCAg 


cGTTGGTGGC 


28860 


_TGGTTTAGCA 


ACAGGAAAAC 


TTAGTGAAGC 


AATGGTTGTG 


GGGGCGGGGA 


TTCAACTTGT 


28920 


TTATCTCGGA 


GTCACCTCCA 


CACCGGGCGG 


CAATGTGCCT 


AGTGATCCGG 


CTTTAGCTTC 


28980 


AGCAATCGCG 


ATTCCGATTG 


CTTTAGGTGT 


GGGAATGAAT 


GCCGAGGCGG 


CTGTTGCACT 


29040 


AGCTGTTCCT 


TTTGGTGTGT 


TAGGCGTCTT 


TATGGATCAA 


TTACGGCGCA 


CCATCAATGC 


29100 


GACGTGGGTG 


CATATGGCTG 


ATAAATATGC 


GGAAGAGTTA 


. AATTATGGGG 


GGATTTACCG 


29160 
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AGCTGCGTTT 


GTTTATCCAG 


CTTTAGCGGG 


ATTTTTAATT 


CGCTTTCCGT 


TGGTTTTTGC 


29220 


GGCGAACTTT 


TTTGGTCAAT 


CTGTGGTGAA 


TAAAGTCTTA 


GAAGTTATCC 


CAGCATGGTT 


29280 


GATGCACGGC 


TTTGAAGTAA 


TGGGTGGTAT 


TTTACCAGCG 


TTAGGCTTCG 


CCTTAACAAT 


29340 


TATGGTAATT 


GGCAAGAAAA 


ATTTGATTCC 


TTATTTTTTA 


ATTGGTTTTG 


TTGCAGTAGT 


29400 


ATATTTTGGC 


GCAGAAGTTA 


TGGCCGTCGC 


TATTTTAGGA 


ATCTGTATTG 


CCTTTTTAGT 


29460 


TAGAAACAAG 


GCATTGAGTG 


AGGAGGCAGC 


ATAATGGAAC 


AAATGACACA 


AGCAAATGAA 


29520 


CAACAATTGG 


AACAGAAAAA 


AATTACTAAA 


AAAGAATTAT 


CCAAAGCCTT 


TTGGATTTAT 


29580 


CAACTAGGCT 


GTGAATTATC 


AAATTCTTAT 


GAACGTCTCC 


AAAGTTTAGT 


TTTTTGTGCA 


29640 


TCGATGATTC 


CTGCAATTAA 


AAAACTTTAC 


GCAGACGATG 


AAGAAC AAC A 


AAGAGAAGCT 


29700 


CTGAAACGTC 


ATTTGAATTT 


TTTTAATACT 


GAAGGAACAG 


TAGGGGCTTC 


CATTCAAGGG . 


29760 


ATTGCAATTG 


CTATGGAGGA 


AGAAAAATCC 


AATGGCGCAg 


cGATCTCTGA 


TACGGCGATC 


29820 


ACATCGATTA 


AAACTGGGCT 


GATGGGTCCA 


CTGGCTGGGA 


TTGGCGATTC 


GATTATTTGG 


29880 


GCAGCATTAA 


TGCCGTTAAT 


CATTTCCATT 


TTTATTCCAA 


TGGCTAAAGG 


AGGTAACGTG 


29940 


ATTGGAAGTA 


TCGGTCCGTT 


GGTTCTGTAC 


ACAGCCATTA 


CAT TAT AC AT 


CAGTTGGACG 


30000 


TTAGTGAATA 


AGTCCTATAC 


ATTAGGGCGT 


AATTCAATCT 


TAAGTTTGTT 


AAAAGACGGC 


30060 


AAAATTAAGC 


AAGTCATTTA 


TTCAGCAAAT 


GTTTTAGGAA 


T GAT GAT GAT 


GGGCGCTTTA 


30120 


AGTGCGAGTT 


ATGTTAAAAT 


TGCTAGCCCG 


ATGACGTTTA 


AAGTTACTGG 


CGGTGCCACG 


30180 


ATTGTTTTAC 


AAGATATCCT 


AGATCAAATT 


ATGAAAGGGT 


TACTTCCTTT 


AGCAGCAGTC 


30240 


ATGGCTATTT 


ATTTCTTCAT 


GGTAAAAAAA 


GGCCCACGTT 


ACGGTATCAT 


TATCGGCACG 


30300 


ATTGTCTTGG 


TTAGTTTGGT 


CACTTCATTC 


TTTGGTTTGT 


TATAAAGGAG 


AAAATAATGA 


30360 


ATAGTTACGA 


AAAATTCCAC 


TTAAAAGAAG 


TCATTAATGC 


TTCTGGCAAA 


ATGACCATTC 


30420 


TAGGTGTCTC 


CAAAGTTTCG 


GAGGCAGTCC 


TAGCAGCTCA 


ACGATTCGGC 


GGCGAACATT 


30480 


TCTTTGAAAT 


GAGTGAACTT 


AGTGTGCAAA 


CAGGGGCCTT 


TTTAGCCAAC 


TTGTTGAAGG 


30540 


TAGAAGATGC 


CCAAATCGTT 


TCCTCGGCTT 


CGGCGGGGAT 


TGCCCAATCG 


GTGGCTGCCT 


30600 


TGATTGGAAA 


AGGCAGCTTG 


TACCACGCGT 


ATCATCCCTA 


TACAGAAAAA 


ATCGAACAGC 


30660 


GGGAGATTGT 


TTTACCTAAA 


GGGCACAACG 


TGGACTACGG 


CACGCCTGTA 


GAAGTCATGG 


30720 


TAGCGCAAGG 


TGGCGGCCAA GTGGTGGAAG CTGgcTACGC CAACATGTGC 


AGTCCAGAAC 


30780 


ACGTGGAAAT 


GATGATTTCT 


GAAAAAACAG 


CGGCCATTCT 


ATACATCAAA 


AGCCACCATA 


30840 


CCGTGCAAAA 


AAGTATGCTG 


ACGGTGGCCG 


AAGCGGCCAA 


AGTGGCGCAA 


CGCCACAAGG 


30900 


TGCCGCTAAT 


TGTTGACGCG 


GCAGCAGAAG 


AAGATCTTTT 


CAAATACACT 


GAAGCAGGCG 


30960 


CTGATTTGGT 


GATTTACAGC 


GGCGCCAAAG 


CAATTGAAGG 


GCCAAGTGCC 


GGTTTAGTGG 


31020 


TCGGGAAAAA 


AGAATACATT 


GACTGGGTGC 


GCCTACAAGG 


CAAAGGCATT 


GGTCGAGCAA 


31080 


TGAAAATTGG 


CAAAGACAAT 


ATCTTGGGCT 


TTACGCAAGC 


AGTGGAAGAA 


TACTTGGCAC 


31140 
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ATGGCAGTGA 


ATCAGGGGCT 


TCGATGCAAG -AGCGCTTAAA ACCGTTTGTT 


GAAGCCATCA 


31200 


ACAACTTATC 


AGATTTAACC 


GCCAAAATCG 


TCCAAGATGG 


CGCTGGCCGA 


GACATTTATC 


31260 


GCGCCAGTGT 


CAAAGTGGAC 


GGCAGAAAAA 


CCGCTAAGGA 


AGTCATCCAA 


GCATTAAAAG 


31320 


CAGAAAGTCC 


AGCCATTTAT 


ACGCGAGAAT 


ATCAAGCCAA 


CAATGGCATC 


ATTGAATTTG 


31380 


ACATTCGTTC 


TGTCAATCAA 


GAAGAAATGA 


ACAAAATCGT 


GCAACGATTA 


CAAGAAATTA 


31440 


TGGACAAAAA 


GGAGAAATAA 


GTCATGTCAT 


TAAAACCTAA 


TTATTTAGAA 


GAACGCATTT 


31500 


GTTTAAACGT 


CTTAGCCAAT 


TCAGTAGAAA 


ATGCCCAAGC 


CTGTTATGAA 


GCAGCAGAAG 


31560 


GGCATGTTGT 


GTTAGGCGTC 


CTTTCAAAAA 


ACTATGAAAC 


CGATGAAGCA 


GCTATTGACG 


31620 


ACATGAAAAA 


ATACCAAGCA 


GCAACCAACA 


ACGCTTTATC 


TGTCGGCTTA 


GGGGCTGGCG 


31680 


ATCCTAATCA 


AAGTCAAATG 


GTGGCTCGCT 


TATCAGAAGT 


TCTACAACCG 


CAACACGTGA 


- 31740 


ACCAAGTCTT 


CACAGGTGTC 


GGcgCtTCGC 


GGGCCTTACT 


TCGTCAAGAC 


GAAACAGTGA 


31800 


TTAATGGCTT 


GGTTTCACCC 


ACAGGCAAAG 


TTGGCTATGT 


CAACATTGCG 


ACAGGTCCGT 


3186.0 


TAAGCTCTGG 


TGCGCCAGCG 


GCGGAAGTGC 


CTATCGAAAC 


CGCCATCAAA 


TTGTTGAAAG 


31920 


ACATGGGGGG 


CAGTTCCATT 


AAATATTTCC 


CGATGAAAGG 


CTTGGCTCAC 


AAAGAAGAAT 


31980 


ACCAAGCAGT 


AGCAGCAGCT 


TGTGCCAAGT 


ATGACTTTTA 


TTTAGAGCCA 


ACAGGAGGCA 


32040 


TTGATTTAGA 


AAACTTTGAA 


GAAATTGTTC 


AGATTGCGGT 


GGATGCCGGC 


GTTAAAAAAA 


32100 


TCATTCCTCA 


TGTCTACAGC 


TCAATTATCG 


ATCAAGAAAC 


AGGGGACACT 


AGAACAGAAG 


32160 


ACGTGAAAAC 


GTTACTAACC 


ATGATGAAAA 


ACACATTGAA 


TAAGTAGGTG 


GCAAAGATGA 


32220 


AAATTGCTGC 


GTTTGGTGAA 


GTCATGCTtC 


GTTTCACTCC 


GCCAGAGTAT 


CTAATGTTAG 


32280 


AACAAACAGA 


GCAGTTAAGA 


ATGAACTTTG 


TAGGGACTGG 


CGTAAATCTC 


TTAGCAAATT 


32340 


TGGCTCATTT 


TCAACTAGAA 


ACAGCATTGA 


TTACGAAATT 


ACCTGCCAAT 


CGTCTTGGTG 


-32400 


AAGCTGGCAA 


AGCAGCGCTA 


AGGAAGTTAG 


GAATTTCAGA 


TCAATGGGTA 


GGAGAAAAGG 


32460 


GCGATCATAT 


TGGTAGTTTT 


TTTGCAGAAA 


TGGGTTATGG 


AATCCGCCCC 


ACACAAGTAA 


32520 


CTTATCAAAA 


TCGCCATCAG 


AGTGCTTTTG 


GCATCAGCGA 


GGCGAAGGAC 


TACGATTTTG 


32580 


AGGCCTTTTT 


AGCGGAAGTA 


GATATGGTGC 


ATATTTGTGG 


TATTTCGTTA 


AGTTTAACTG 


32640 


AAAAAACTCG 


TGATGCTGCT 


TTGATACTTG 


CTCAAAAGGC 


GCATGCCTAT 


CAAAAGAAAG 


32700 


TTTGTTTTGA 


CTTTAATTAT 


CGACCTAGTT 


TGAATACAGC 


AAATAGCGCA 


CTCTTTATGC 


32760 


GTCAACAG 












32768 


(2) INFORMATION FOR SEQ ID NO: 124: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



TTTATTTACA 


CCTGCATTGG . 


AAGTCGAAGA 


TGCCGAAAAG . 


AGTAGTTGGA CTTATCCCGT 


60 


TTATGGTTAT 

.L X ** X *J W X X ** X 


TTAGATAGCG 


AAAATCCTTG 


GGAAAAAATT 


GCGGCTTTAT TAAATAAACG 


120 


AAPArAAGGC 


ACACCACGCT 


TTGCTTTAGA 


AAAACAAGCT 


CTTTCTTTAG CTCGTTTCGA 


180 


TPA ATTftA A A 


ACCTATTTCC 


CAGCTAGCGA 


TTTTTCGCAT 


GATGTTACGC CTTTAATTGA 


240 


2i Zi A A PTTP A A 


TTAATTAAAA 


CAGAACCTGA 


AATTCAACGT 


T TACT AG AAG CTGGTAGTTG 


300 


GGCTGATGTC 


GCTTTTGAGA 


TTGGCTTCAA 


AGCAATTCAA 


GCAGGCGTTG CGGAACAAGA 


360 


AATTGTTGCA 


GAAATCGAGT 


ATCAATTAAA 


ACGTCAAGGT 


ATTCGCTCAA TGAGTTTTGA 


420 


TACGTTAGTC 


TTAACAGGAA 


AAAACGGAGC 


TAGTCCTCAC 


GGGGTACCTG GCGAAACAAA 


480 


AATTGAGCCC 


CATGACTTAG 


TGTTATTTGA 

JL \J X X ** X X X w** 


TTTAGGTGTC 


GTTCATAATG GCTATTGTAG 


540 


TGATGCCACA 


CGAACAGTCA 


GCTACTTGGA 


ACCTTCTGAT 


TTCCAAAAAG AAATTTATGG 


600 


AATTGTCTTA GAAgCACAAT 


TAGCAGCAAC 


TGAAGCGGTG 


AAACCTGGTG TTACAGCAGG 


660 


cG AAT TAG AT 


GATATTGCCC 


GAGGTGTAAT 


TACmAAAGCT 


GGTTATGGTG AATACTTTAA 


720 


CCATCGCTTA 


GGCCATGGCA 


TTGGAACCAC 


TGTCCATGAA 


TACCCATCGC TAGTTCATGG 


780 


CAATGATTTA 


GTCATTGAAG 


AAGGTATGTG 


TTTCTCTATT 


GAGCCCGGTA TTTACATTCC 


840 


TGGAAAAGTA 


GGCGTTCGTA 


TTGAAGATTG 


TTTACACGTA 


ACGAAAACAG GCTCCGAACC 


900 


AT T C AC T AAG 


ALAAUUAAHo 


AATTGCAAAT 

nni x w ****** x 


TATTCAATAG 


CATAAAAATA AAATGAGGCC 


960 


ATCAGTCAAC 


CGGACTGATG 


GCCTCATTTC 


TAAATTTTAA 


GGTAAGCCGC TGAACAAAAG 


1020 


CAACACGATC 


CGTTGCTAAC 


TTTCATAAGC 


TAAACGCTTG 


TTACCCTTTA AATTCTTTTT 


1080 


TTACGTCTTT 


CGCTGCATCT 


TTAACATCTT 


CTTTTGCTTC 


TGCAGCTTTT TCAGGAACAT 


1140 


CTTTGGTGGT 


TTCTkTTGCT 


TCATCGACAC 


CAGCAGAAAC 


AGTTTCTTTG GCCGCTGCCG 


1200 


CAGAATCTTT 


AACGTCAATA 


TAGATATCTT 


CTGCTGCATC 


TTCGGCAATT TCACCTAAAT 


1260 


CTTCTGCGTG 


ATCTTGAGCA 


TCGTCAGCTG 


CTTTTTTAAA 


TTGATCAGAT AAATCACCTG 


1320 


TTTGTTTTTT 


AAAGTTATCA 


AGCATGTCGC 


CAGAAACACC 


TTGTGCTTTA TCCAATGAAT 


1380 


CTTTTGTTTT 


TTCTTTGACA 


GAACCTGCCA 


AATCAGAGGC 


TTGATCTGAT AAAACGCCGG 


1440 


CTTTTTGTTT 


TGCGATTTCT 


GTTAATTCTG 


TTCCTTTTTG 


AACAGCATAA TCTGTGTAAT 


1500 


CTTGCGCTTT 


GTTTTTTAAA 


TCATCTGTTT 


GATTTGATAA 


ATCATCACGT AATTCTTTAC 


1560 


CTGATTTTGG 


TGCAAGTAAT 


AATGCGGCAA 


CGGCTGCTGC 


TfiTTrrAffA ATTACTGCvC 


1620 


CTAAGAAAAA 


TCCGCCTTTT 


TTAGCCATAA AAATCTCCyC 


CAATTAGTTA TTkGTTTTTT 


1680 


TATTTTTTGA 


ACGCATTGCT 


CTAAAAGCAG 


AACCGCCTAC 


TTTACTCACT nAAACCAGcT 


1740 


TTCGCTGtGC 


• CTTTyCCTAC 


GGTTCCTACC 


TTTGTAATCA 


AATGTTTACT TGAGGTGTTC 


1800 


AATTCAGAGA 


CACTTTCGCT 


TAAATCAGCA 


k ACGGCCGTAA 


. ACAACGGATC AATCGTTGCC 


1860 


ACTTTTTGAT 


TCACATCTGT 


CAATAGTTCA 


l TTACTTTTAA 


, CAAGCAAGCC TTCCACTTGG 


1920 
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CGAC I GAAT A 


CATTCACATG 


GG 1 AG I GAGb 


a ptt t n a t t 

/iU III orV JL 1 o 


T.TGTATTTGT 


TTCATCAATT 


1980 


GTATCTTCAA 


TTTTTGAAAC 


AAG1 1GAGAG 




ACACAAGTAC TAAAAAGATG 


2040 


ACTAAAACAA 


CAAATGcTAC 


CGCTGGAA1 I 


a ar'^r*a^r , a a 


TTTCCCCACC 


TGACATATCA 


2100 


TTACTCCCCT 


CTGAAATTTT 


ATTTATAATC 


1. 1 AGGA1 AAG 


ACCAACTTAT 


ATTTTTCACA 


91 fin 


ACTCATAAAG 


CTAAGAATGA 


CAATTTATTT 


1 Al AAAA1 1 G 


TCATGCCCTT 


TTCTTTAAGA 


999n 


AAAAAAGCGT 


AAAATTTCCT 


rr> /■•■ n> TV tv tv tv itt 

TCTAACGAAT 


rti Tv Tv TV 'P^P 7\ 


CGCtTATTTT 


TATTTATTTT 


99ft n 


TCTTTTGGTT 


GCTCTTTTTT 


/*> rn tv t\ rn rn tv 

CGCTAATTGA 


GG AGG 1 AAA 1 


TAATAATGTA 


TTCTCGTAAA 


9 ^4 n 


TTATCTTTTA 


CTTCTGGATG 


TV /"* OT ^> TV TV TV TV 

AGTCAAACCA 


rp TV rp rp TV Tv T 1 r""* Z" 1 

TAT 1 GAA1GG 


TTGTTTTCAT 


AAACCCAAAT 


9 4 on 


TTATCGCCAA 


CGTCGTAACG 


TTCTCCTTTA 


AACTCAGGAG 


CAAAAACGCG 


TTGTGTTTTA 


9d fin 


TTTAAAGTGT 


CAATCGCATC 


TGTTAACTGA 


T\ m rr* fp rp rri TV 

ATTTGGT1AG 


CAGCGCCTGG 


TTTTTGATTT 




TCAAGTACAT 


CAAAAATTTC 


AGGGGTTAAT 


AAA 1 AAGGGG 


CGATAATCGC 


TAAATCACTT 


2580 


GGTGCTTCCT 


CGGGTTTTGG 


TTTTTCAACA 


tv tv tv m m rn r** 7\ 

AAATTCTCGA 


CATTGTATAG 


TCCTTTTTCG 


9fi4n 


ATTTCTTTTT 


CTGGGTTGAT 


tv tv m tv tv m tv m 

AATACCATAT 


TTTGAAGTAT 


CTTCATGGGG 


TACTTTCATT 


2700 


ACCGCAATTG 


TCGAAGCATG 


TGTTTGTTCG 


TAATCATCCA 


TTAATTGTTT 


CGTTAATGGC 


97fin 


ACTTTATCTT 


CCATTAAATC 


GTCGCCTAAC 


ATCACGACAA 


AAGGTTCATT 


GCCAACAAAT 


9R9n 


GCTTTCGCTT 


GTAAAACAGC 


GTGTCCTAAG 


/-*\ r~% rn m m /"» TV rp 

CCTTTCGGAT 


GGGATTGACG 


AATAAAATGT 




AAATTAACAT 


CCGTTGTTTC 


TTCGACTAAT 


m m /"i n t\ rpTv TV TV rp 

TTCAATAAAT 


CGGTTTTATT 


TTTTTCTTTT 


1 

9Q4 n 


AAGTTATTTT 


CTAATTCAAC 


ATTTGAATCA 


T\ TV tv rp /*» Tv rp rp rp 

AAATGATC 1 1 


CAATCGGACG 


TTTGGCTTTG 


3000 


CCTGTCACAA 


TTAAAATATC 


TTCGATTCCT 


GAAGGTAG 1 G 


CTTCTTCTAC 


AATAAATTGA 


3060 


ATCGTTGGTT 


TATCAACAAT 


CGGTAACATT 


TCTTTTGCCA 


TCGCTTTGGT 


AGCCGGCAAG 




AAACGTGTTC 


CTAACCCAGC 


ms** rr> ^ m Tv tv w TV 

TGCTGGAATA 


TV/^rrip'^/^rnrnrprn 

ACTGCGT1 1 I 


TAACTTTCAT 


CTTTTTTCCT 


3180 


CCTAAAAATC 


AAGTGTAAAT 


TCGTTTTCGG 


111 1GGGA1 G 


GCGCAACATG 


ATTTCTTTTG 


3240 


CTGCTTGTTG 


TACATCTTCG 


m t\ m m m t\ T\ 

TCATTGTACA 


AG AGA I 1 G 1 A 


GATTGTTTCT 


GTGATTGGCA 


3300 


TTTCGACTTC 


TAGTTGTTGT 


GGGAATTGAI 


AGGG 1GG 111 


AGTTGTTGAA 


ACACCTTCCA 


3360 


CAATCATGCC 


CATGTTTTCT 


TV TV TV omni/^ TV 

AAGACTTCAT 


C 1 AAG 1 1 A 1 G 


GCCTTTTCCT 


AATAAATTAC ' 


3420 


CCGCACGCCA 


ATTTCGAGAA 


TGGACACTTG 


mT\pTi TV /""* Tv TV/- 1 

TAG AAG 1 AAG 


AATTAAATCC 


CCAACACCAC 


3480 


TTAAACCAAT 


AAATGTTAAT 


GGGTTTGCTC 


CCAi GGGAAG 


/"l r> m TV TV TVr™*/^7\ 

CCCTAAAGGA 


pfp tv Tv TTTPTP 

Gl AA1 1 1G 1G 


3540 


m tv TV TV /■■* TV ^ ^ 

CTAAACCACG 


AGTCATAATT 


GCCGCTTTGG 


/-* tv rprp T\ rp/^i tv 

CA1 1 A1GAGG 


AAAGCCTAAG 


CCATGAATCG 


3600 


CTCCGGCGCC 


CAAAGCAATA 


ATATTTTTTA 


AGGCCGCCCC 


TGTTTCTACA 


CCAATCACAT 


3660 


CATCATTCGT 


ATAAATTCTA 


AAGTAATCAT 


TCATAAACAA 


CTCTTGCACA 


TACACAGCGT 


3720 


CTGCTAAATT 


TTCGCTTGCT 


GCTGTAATGG 


TTGTAATGTC 


ATGAACAGCG 


ACTTCTTCTG 


3780 


CATGGCTTGG 


ACCAGATAAA 


ACCACAATCC 


CTTGCCGTTT 


TTCTGCTGGA 


ATCTCTTCCG 


3840 


CTATCACTTC 


TGAAATCCGC 


TTATGTGTAC 


CTTGTTCGAG 


CCCTTTACTT 


GCATGAATAA 


3900 
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TAATTGGTTT TGTTTTTAAA TGTTGGGCCA CTTCTTGTGC TACTGAACGA 


ATGGCTTTAG 


3960 


TTGGTACCAC 


AAACAGTACC 


GCATCAACAT 


CTACTAAAGC 


AGTGGCTAAA 


TCAGTGGTTG 


4020 


CCTGAATTGA 


AGTAGGTAAG 


ATTAATTCTG 


GTAAATAATG 


TTTGTTGGTG 


TGTTTGTATT 


4080 


TATTTCATCG 


ATTTGCTCTG 


GTTTATTCCC 


CAAATACAGA 


CCTCGTGTCC 


ATTTcGGCTA 


4140 


ATACTgAGCT 


AAAGCTGTcC CCCAGGAACC TGGACCTAAA cAGCGACTTT TgTTTCATCG 


4200 


GTGAAACTCT 


CCTTTCGCTA 


AACAACGTTA 


CTTCTTTAGT 


TTACCATATT 


CTTCAATCAA 


4260 


GTTGGATTAA 


GAAGTTGTTA 


TTTTTTTCTT 


TTGTTTCTCC 


CGATTGTAAA 


ATTTCAAATT 


4320 


GGGATTTCTT 


CTGCGAA .:GA 


TCACAATAAC 


AATTGCTGCC 


ACGAACATCA 


CTAAAGACAA 


4380 


CAATTGTGAG 


ACACGAATGT 


TACTAAACGC 


ATACAAACTG 


TCCATCCGTA 


AGCCTTCGAT 


4440 


GAAGAAACGG 


CCGAAAGAAT 


ACCATATTAT 


ATAACCTAAA 


AAGACTTCGC 


CTTCTTTTAA 


4500 


AAAGTGTGGC 


TTTTTGCGTA 


ATAACACTAA 


AACGATAAAG 


CCTAAAACAT 


TCCAGACAGA 


4560 


TTCATATAAA 


AATGTTGGTT 


GATGATATGT 


TCCGTTGATA 


TTCATGTTAT 


CAATGATAAA 


4620 


CGTTGGTAAA 


TGGAGATTTT 


CTAAAAATTG 


TCTGGTGGTC 


GCTGGCCCAT 


AGGCTTCATG 


4680 


ATTCATAAAG 


TTGCCCCAGC 


GACCAATTGC 


TTGCGCTAAA 


ATCACACTTG 


GTGCGGCAAT 


4 740 


ATCTAAAAAA 


GTCCATGTAG 


AAATGAAACG 


ATGACGCGTA 


AAGAAAAACA 


GCGCCaGTCC 


4800 


CCCACCAATA 


AGACCACCAT 


AAATCGCTAA 


GCCACCATTT 


CGAGTAAAGA 


AAATTTCAAT 


4860 


CGGATTGTCT 


ACATAATCTT 


GCCATTGAAA 


AGCGACGTAA 


TACAAGCGCG 


CGCCAACAAT 


4920 


TGCTGCAGGT 


AATCCCCAGA 


GCATAAAGTC 


AAAGACATCA 


TCTTCTTTTA 


AACCAACACG 


4980 


AACGGCTTCA 


CGACTACTTA 


GCCAAACAGC 


TAAAGCAATC 


CCTGAAACAA 


TAATAATTGC 


5040 


ATACCAATAG 


ACGGGAATAC 


CGAAAAGGCG 


AAATGCAATT 


GAATTTACTT 


GGGCTAACAT 


5100 


CATCACATCT 


TCTCCTTCTA 


TTCTCCTGAA 


TTTTCTTCAA 


TCAAACGGGT 


TAAGCGCTCT 


5160 


TCGAACGTTT 


TTGTTGCATC 


ATAGCCCATT 


GTTTTCGCGC 


GGAAGTTCAT 


TGCTGCTACT 


5220 


TCAATGATAA 


TGGCCACATT 


TCGGCCGGTC 


TTAACAGGAA 


TACGAATCTG 


AGGGACATCA 


5280 


ACGTTAGCTA 


TTTCAACCAT 


TGCATCATCA 


CTACCAAGAC 


GATCATATTT 


CTTGTCTTTT 


5340 


TCCCATGCCT 


CTAAATAAAC 


GACCAATTGA 


ACTTGCATAA 


AGCCACGGAC 


CGCACTAGCT 


5400 


CCAAACAGGT 


TCATGACGTC 


AATAATTCCG 


ATACCACGAA 


TTTCAATTAA 


ATGCTGCAAA 


5460 


ATTTTTGGTG 


GTTCTCCAAC 


CACGGTTAAT 


TCATCTTGCT 


GATAAACATC 


GACGCGATCG 


5520 


TCTGCGATTA 


GCCGATGTCC 


ACGTTTAATA 


AGCTCTAAAG 


CTGTTTCACT 


TTTACCAATA 


5580 


CCGCTATCTC 


CTTGAATCAA AACACCAAGT 


CCATAAACAT 


CAACTAAAAC 


TCCGTGGACA 


5640 


CTTGTCCGAA 


CAGCTAAACG 


GCCATCTAAA 


TAACTGGATA 


GTTCCCCTAG 


TAAACGGGAA 


5700 


GTTGAAATCG 


• GTGAACGTAA 


TACAGAAACG 


CCATTTTCTT 


TTGCTGCTGT 


AATTAATTCT 


5760 


TCGGGGATTT 


CTAGTCCTCT 


GG AT AC TATA 


AAGGCTGGAG 


TATCTTTCGC 


ACATAAACGG 


5820 


CGCATCACTA 


ATAAACGCTC 


TTCAGGCATC 


ATTCGTTCCG 


CAAAGGTAAT 


TTCTTtGCTC 


5880 
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CCAAATAATT GTAAACGGTC ATGGGAATAA T.AATtArAAT AQCCaGTTAA TTCTAAACCT 5940 

GGCCGGGAAA TTTCACCAGT TTTAATTGTT CTATTTAAGC TTtCTtCATC TCCGTATACT 6000 
ACTTCTAAAG AnAGATTTTC TACCAGT 6027 
(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 



CAACAGTCCC 


AGATGATAAG 


GTTCTGCTAC 


CATTACGAAA 


CTCTACATCA 


GTATATAGTT 


60 


TATCTGACAC 


TCGCATAATA 


ATTTGGTGCG 


GATTTTCAGT 


AAAGTAATTA 


TCTGAATTTG 


120 


ATACCAATTT 


TGTAACATAT 


GATTTGTTAG 


CAGTTATATA ACCTTTGGCA GkCyTTAATC 


180 


yAGGTAATCC GTTTTCGGTA 


TAATCAACAC 


TaAAACTTgA 


TTAATGTATC 


TTTAGAAACA 


240 


AACTCACtTT 


TTTACTGAAA 


TTTACATCTT 


GATAATAATA 


ATCATTTGTT 


TTCATAACTA 


300 


CATACTCTGG 


ATAATCATAT 


AGATAATCTT 


TGATGTTTGG 


CACAACTTTT 


TGTGCATAAC 


360 


TTTTATTAGC 


CGTTAAATAA 


CCATTTTGGG 


TAATCAAACG 


AGGAACTTGA 


TTTTCACGGT 


420 


ACTCGATTCC 


CACAACAGGA 


ACTAAACTTC 


CCTTGGAAAC 


AGCTGCTCCT 


TTATATAAGA 


* 480 


AGTCTGAATC 


CCCATAAAAA 


TTATCACTAT 


TTTTCATAAT 


AATTTGCCCT 


GGATTAACCG 


540 


TATGATAGTT 


AGCAATCCAA 


GATCCAGTAG 


CTACTACATA 


GTTCTTGTTA 


GCCGTGAAAT 


600 


AATACCCATT 


TGCCATCTTC 


AATCTAGGAA 


TTCCTGATTT 


TGTTTTCTCA 


ATAGCAGTTA 


660 


CTTTTAAAAC 


ACTGCCTGCA 


GATACTTTAT 


TTGCTTTTGT 


CTTGAATTCT 


ACATCTTTGT 


720 


AATAGGTATC 


TTCATTTCTA 


GTAATTACTT 


GTCCAGGATT 


AGATGTGTAG 


TAAGTATCTT 


780 


CTAATCCTTT 


ATATTGTTCA 


ACTATATTTT 


TATTTGCAGT 


CAAATATCCT 


TTCGCTGTTT 


840 


TCAATCGAAA 


TATACCGCTT 


TGAGTATAAG 


CTAAATCTTC 


AACTTCTACT 


ATTGTTCCCT 


900 


TCGAAACCTT 


AGACAACCTT 


TGCTTAAATT 


CTGGATCAGC 


ATAGAAATAA 


TCATCGCTTT 


960 


TCAAACGAAC 


TTTTTTAGGA 


TTTGTCGTAA 


AATACAAATC 


AATATTGCTT 


TGTGCCGCTA 


1020 


ATACATAATC 


TTTATTTGCT 


GTTAAATAAC 


CTTGGTCTGT 


GACCAATCGA 


GGAATTCCTG 


1080 


CTGAACTATA 


TTCAATCCCC 


TTAATGGTAA 


CTAGTGTATT 


TTTTTTAACT 


CGCCAACCAG 


1140 


GAGTTCTAAA 


CGCTACATCT 


TGGTAAAAAG 


TATCGTCATT 


TTTCATGATC 


ACTCGACCTG 


1200 


GaTTAGTTGT 


GTAATACTTA 


CTTAAGTcAG 


GCCCACTACT 


ATTACCCATA 


TTTAaaAAAT 


1260 


TACTATAGGT 


CwGGAAtATC 


AAACGGACGC 


CCTTcATAAT 


tAGCTAAACC 


AGGgAAATAC 


1320 


ATTTGCGAGA CCATTGCCAA GCAccTGgTC ATTATyCCmC 


TGCATGGAAC 


TATCTGGaGT 


1380 


ATATGGATAT 


tGCGCTACCC 


aTACTCTGTC 


TCTTCCgAAG 


GCGGAGGTGT 


CGATATAGCC 


1440 


ATTJcGTTAAc 


CACCATTTGC 


CAACATACAA 


CGCATCATTT 


TTATACCCTA 


ATGCCTTTAA 


1500 
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TTGCTGGTTA 


AACGCAACAG 


AATTAGcATG 


cGTTTCGCCC 


AT.TGTTTGTA 


AGCGTTGGAT 


1560 


CTTCCGCATC 


ATTAAACATT 


ATGGTATTTT 


TATCTAAGCC 


AGAGTTAGCC 


GCCGCCTGAG 


1620 


CAAAATATCT 


TGCTTCATCT 


TGTGtGTCGC 


AGCAGAGACA 


TACATACTAT 


AGTGATATGC 


1680 


TGAAACTTTT 


AAGCCCGCAG 


CTTTGGCATT 


TCTAATTTGG 


CCCGCAGCAT 


ATGGATTAAC 


1740 


ATACCAAGTT 


CCTTCTGTCA ATTTAACCGA AACACCTGTT 


ACACCATAGC 


TTTTAATTTT 


1800 


TTGATATTCA 


GCTACTGAAA 


TGTCTCCATT 


CCAACTAGAA 


ACGTCAATAA 


AATTCTTGCC 


1860 


AGGCAAATTA 


GGATCATTCG 


CATAGACGTT 


CGCAACTGTT 


GCTCGCGCTA 


CCCTAGAAGA 


1920 


AAGGCCGTTT 


TTCATAGCAT 


ATGTTCCTTT 


TTTATGGTTC 


GTTCCTTCAC 


GTGCTGCTTC 


1980 


TTGATAATCC 


GCTTCATCTT 


GGTCAATGGT 


CTTTTTTTCT 


ACTTCAGAGC 


TTTGTGATGC 


2040 


TTCTGTTTTA 


CTATCTGTTG 


TGTTTGTTTC 


TGAAACAGAA 


GAATGTTCTT 


CCGTTGTACT 


-2100 


ATTTGTTGAA 


GAATCAAGAG 


ACTCTTCTTT 


TGTGCTTGAA 


GTAGAACTAT 


CTCTTTCAGA 


2160 


GGTATTTTCT 


TCTGTAACAG 


AAGATGTATC 


CTCCGTTGAA 


TTAGTCGTTG 


TTTCTGACGT 


2220 


AGCTTGAGTG 


ACCAAGCCAG 


TcTCCCCCGT 


TGTTTCTGTT 


TTTACAAGCT 


CTTCTCCTGA 


2280 


TGCAATCACA 


CTAAAACTAT 


TTAATAATAC 


AGTACAAATT 


AACATACCTG 


AAATGATTTT 


2340 


TTCATGAAAC 


TCTCCTTATA 


TATTTAATAA 


TATGTATCTT 


TCTTAAACAC 


TTGCCATTCG 


2400 


CTCTCGAAAA 


ATGTGTATCC 


TATAACCTTC 


AACTAATTGT 


TTTCACATTT 


TAAAACTAAC 


,2460 


ACCAATTATT 


ATCTCATACC 


TTTATTTCAA 


ATTCAATTAC 


GCTTTTTTAA 


CATTCCTATA 


2520 


TCAATTTTTT 


ACTTTTAAAT 


GGTAAAAAGA 


TGAAAACAGA 


TCTTTAATCG 


ATTGTTTTCA 


2580 


TCTTTTATTT 


ATTTTTTAAG 


ATATTCTGTG 


TACGATTTTT 


TAGTATCAAC 


AATTAGCGCT 


2640 


TCATGATGGG 


GTTGGCGATC 


TTCCTCTGGT 


GGCAATTGCA 


TATAATCCCC 


AAAAACCTCT 


2700 


GTTAAGTAGC 


GCTTGTAGCC 


CACTGGTATT 


GGCATTTTTG 


TATCTTCAAA 


TGGAACAAAA 


2760 


ATAGCAGATT 


TAAAATCTTC 


CAAATGATAA 


ATATTACCCA 


TGTAGCGTGG 


TCCGACACAT 


2820 


AATTCTGTAA 


CATAAGGAGT 


GGAGCCGAAG 


GAATACTTAG 


TCATTCTTTT 


TTCAGCAAAG 


2880 


CGCCAAATAT 


GATAGCGAAC 


CTTTTTCGAT 


GGAAAGATAT 


TCAATAATAC 


CCGACTCCCC 


2940 


GTTGCTAAAA 


TGCCACCATG 


TTTTTCTGGT 


ACCACTTGTG 


AACAGAATAA 


AGCATAAATT 


3000 


AATGCCCAAA 


TTTTTTGTTT 


CTTCCTTTGA 


AAATTTCCTT 


TTGGCGCACC 


ATCTAAAGGA 


3060 


AAAATATCGA 


TAGGAATACC 


ATGAGGAATC 


GCTAATCCAT 


CTTGATAGGT 


TTTAATAAAC 


3120 


GTTGTCTGTG 


CATCCCTAAT 


TGTCATAAAT 


GAATTATGAT 


CATTATAAGT 


TTTTGAAGCA 


3180 


ACTAACAATG 


GATAACATTC 


TGTATCTGCA 


TATTTAGGCC 


AAAGCACTTT 


CaATTTTTCa 


3240 


TAATCTGGTC 


TTGGCATAAA 


GAAATCTAAG 


TCATCATCCC 


aTGGAATAAA 


ACCTTGACTG 


3300 


CGGATTGCTC 


CGATACAACC 


GCCTCCACAA 


AAATAACAGG 


TTAGATTATG 


TTCTTCACAA 


3360 


AAAGCCACAA 


AATATTTTGA 


CATGCGTAAA 


ACGACTTGTT 


GAATTTCTAC 


TAATTGCGTC 


3420 


TGTGTTGTTT 


GTTCCACTTT 


TCCTTTTCCT 


CCCTATTCAT 


TCTTTGCATA 


TTTAAATGTT 


3480 
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GTATAGAATA 


AAAAACGCAT 


TAAGTAGAAT • 


GAACTTTTAA 


TAACTGAAAA 


CTGTTCATTT 


3540 


TGACGCAAAA 


TTTTCCAAAC 


CCAACTGATT 


GTATTCCATT 


TGTTAGAAGA 


TGTAGAGGTG 


3600 


GTTAGCTTAC 


GATAAATGGC 


TAATTTTTCG 


TTAATAAAAT 


AGACCGTATT 


TATTTCTTTC 


3660 


AAAATGTTTA 


ACCAAGTAGC 


ATAGTCTTCA 


TGTTTTAAAT 


CCGGCATTAA 


ATGGCGTTTA 


3720 


AGCAGATTAG 


ATTCGACTAC 


GACTGTTAAG 


CAGCCGATAG 


TATTTGTTTT 


TAAAAGATCT 


3780 


TGATAGGTTT 


GTGGTCCCTC 


ACTAAAATTA 


CGCTCATGAG 


TAATGTTGCC 


ATTTTCATTA 


3840 


CATACAAAAT 


AAGAACTACA 


TGTAAATAAA 


AGTTTTTTTT 


CTTCCATCCA 


CTTGACTTGA 


3900 


ACTTCTAATT 


TATTAGGAAG 


CCATAAATCA 


TCACTATCTA 


AAAAGGCAAT 


ATATTGTCCC 


3960 


TTTGAAGCTT 


CAATTCCTTT 


ATTTCTGGTC 


TTTGCAATGC 


CCATGTTCTG 


TTCATTTTTC 


4020 


AGTAAAAAGA 


TTCTTGAATC 


TTTTTCCATA AAGGCAAGyG 


CAATATCAAT 


AGAATCATCT 


4080 


TTACTTCCAT 


CATCAATGAG 


TAATAATTCC 


CAATTCTGAT 


AGGTTTGTTC 


CAGAACGCTT 


4140 


TCAATAGATT 


TTGATAAAAA 


TTTGCCAGCA 


TTGTACATGG 


GCATAATAAT 


TGAAACAAGT 


4200 


GCTGTTTCCA 


TkTGTTCACT 


CCTCGTTGCA 


TTTTATGTAA 


GAATGTAACA 


GTTTATATTA 


4260 


TCGCATAGCT 


CTTTTATTAT 


TTCAACGTTC 


CTTGTTTCCA 


AAAAAAGTTT 


GAGACATAAG 


4320 


CTTAAATAAC 


TTATGTCTCA 


AACTTTTAAC 


GGTAAATTCT 


AATAGCATTC 


TCTTTACAAT 


4380 


AATCCATCGT 


CAATTCTTCA 


CGTTCTTCAG 


ACTCATCCAA 


TCCTCTtGAT 


GATACTTTAT 


4440 


CAAAGAGAAT 


TTTGATGGTT 


tgAAGCAGAA 


TTTTCATATC 


CaATAGCAwG 


aATAATTCTT 


4500 


GATGTACAGT 


AAATCAAAGT 


TCAATTTACT 


ATTGTAGTCA 


GAAGCATACT 


TCCCATAAAC 


4560 


TTGGGCATAG 


CCAGTAATCC 


CCGCACGTAC 


ATTGTGACGT 


AAATAGTAAT 


TCGGGTTTTC 


4 620 


CTGATTAAAT 


TGATCAACAA 


AGAATGGTCG 


TTCAGGACGT 


GGTCCGACAA 


TGGACATATC 


4680 


CCCGCGAATA 


ACATTGATTA 


GTTGTGGCAA 


CTCATCAATA 


CGTAAAGCGC 


GCAAGTATTT 


4740 


ACCAACTGTA 


GTGACGCGAC 


TGTCGTTACT 


TGTAGATAAG 


ACTGGGCCTG 


ATTTTGCTTC 


4800 


TGCTGTTGCC 


GACATCGTTC 


TAAATTTTAA 


AATCGAGAAC 


TCACGTTGAT 


CTAATGTAAT 


4860 


CCGAGTTTGT 


TTGTAAATCA 


CTGGTCCCGG 


TGAGGTTGCT 


TTGACTAAAA 


TGCGC 


4915 


(2) INFORMATION FOR SEQ ID NO: 12 6: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
ATTAATGCAA ' TAGCTGCCCC TACAAGCGCT ACTCCTAATA AAGGTATTTT CATATAAACA 
GCCATTACAA AACCAATAAC AAGATAACTA AAATATTCTT TTGCAGGTAA ATATCGCAGT 
AGTAACCCTA TACCGACAGC TGGTAGTAAA CCTCCAGCAA CAGTTAATCC ATTGGTTAGC 
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\*e\ 1 1 LI uuno 


TATAPGATAG 
X n 1 on. 1 n\j 


AATTGTATTA 
nn JL luini in 


ATTAATGATG 


GAQCGAAAAT 


TACTGCCAAT 


240 


PP T A PTPP,T a 


TP.PP TfZTT P. T 


TPiPTPiPTGTT 


AAAAACACGC 


CAAACAT.HTG. .C&HTJLTGGTA 


300 


aTPaTAPTAT 
A 1 L.rV X rHw X n 1 


AATPAPPTTT 


TTTPAPAPCT 


GCTTCAGCTT 


TGTGAACCAT 


CCAGATACCT 


360 


Al oy 1 A 1 1 lb 


TP. AT A APATP 


TAPTTGGAPP 
XnuX lUUfivVw 


ATCAATAAAG 


CTACTGGTAT 


ACCAATAGAG 


420 


ATTCb i AbAb 


Al bu i 1 1 i iu 


l_V— V^nu 1 M.O 1 n 


ATTGTAATAA 


AAGTTGCTAC 


GATACTTGCT 


480 


GTTTGATAbi 


LAublAl IbA 


Abb 1 bb 1 lyOC 


AAAATTACTT 


ATTCCCaAAG 


ATAGTAACTG 


540 


TAAAGTACCa 


CbGAT AAA 1 A 


B a PPTP t- T/t 7A 
AAbb lot i b A 


TATATPmPPf 


AGGATTAATC 


CTGTAACAAA 


600 


ACCCGCTGnT 


m t\ f~* r** rp n"i tv 

TACCGGT IbA 


1 AAb 1 I bbAA 


QPaTTP.TTP,A 
HLo Hull On 


GTTTTTTTCA 


AAGTTAATAA 


660 


AAAATCCATA 


TV /** P TV /""• T\ T\ rp m 

CAGCACAA1 I 


AAlAblAAl 1 


paaaAATaPT 


TGCTTCCATT 


AATTTTCCTC 


720 


CTTAATTACA 


1 b AAA 1 AA 1 1 


1 1 bAAAAl 1 1 


aPPTTTTTPA 


CTTGTTTTGA 


TAAAAATCGG 


780 


GATTTGTTGA 


Al b I b I bb 1 i 


bbAb 1 AbbAb 


TPTTTTTPP A 
X u X X X X x U^n 


CCATCAAAGT 


TTTCTCTTGT 


840 


CCATAGGTTT 


b 1 bb AAb 111 


PAPPTTTTPP 

bAbb 1111 bb 


TZa a ATA A APT 


TCTCGACTAG 


TCTCTTTATA 


900 


ATTTTCAATA 


bbbbbAAbAA 


c a a Tpppa TP 
bAA 1 bbbA 1 \* 


TPPAAATA A A 


TAGGTATCTT 


CAATATTCCA 


960 


AGCATTClbA 


IbAl b 1 bbAl 


aTTpaTaaa a 

Al IbAlAAAA 


TA ATP,TTPTP 


ATAAGTGGAT 


CACCAAACTC 


1020 


ATGGGCCTGT 


b 1 bAl bAAb 1 


1 Ab 1 AA i A 1 A 


AP.PPTTPAAT 
noo W x x^nnx 


TCTTCTCGTA 


tTTTTATATA 


1080 


TTTCGtTAGT 


ATl 1 bl I bbA 


PTT*PTTTJVPP 

b 1 1 b 1 1 1 Abb 


aTL APZiPP A A 
n i nnvjnv^Vw.nn 


ATTTCATTTG 


GAGCTCCGGT 


1140 


AACCATTGAT 


CCTCCTCCA1 


bA 1 b Ab A 1 AA 


TPPTTTf^PTP. 
I 1 X X X 


TGTGGTAAGC 


GATCACCATG 


1200 


^ * n>m /"* /""* fTl TV T\ 7V 

CATT CGT AAA 


A 1 AbbbbAAA 


aTPTTPpaTa 

A 1 b 1 1 bbA 1 A 


TTP.PA APPAf^ 


CGGACTAGTA 


ATTCTCTAAA 


1260 


CTCAGGATbl 


1 1 AbbAA bib 


prpppaTpaaa 

b 1 bbA 1 bAAA 


TPPTPPAATA 
X v^v_» X ^V/rtn X n 


TCTGTTGTCC 


ACCAAGGTAA 


1320 


CCCAGCCAAC 


CCAACbTllA 


^TPPBPTBTT 

Al bbAb 1 Al 1 


TaPTT^aTTa 

I X 1 on X X n 


CGAAAAGCTT 


CAAAGCTAGA 


1380 


ATCAATATCT 


CCTGAbbA 1 A 


tv r*Ti a a pt^tpp 
AbAAAbb 1 bb 


aTap.PP.TTf^T 

ninoUJi x Vj X 


GCCCCTGCCC 


ATCCACCTCT 


1440 


TACAAGTGTA 


AC AAb TbAi 1 


bAl 1A1 1 1 bb 


Ta aappTTPA 

X nnnV^L x x Ln 


GAAATCATTG 


ATAAATAATC 


1500 


GATTGGATAC 


TV TV TV rp rp z^ 1 7\ T\ 

AAA1 I bbbAA 


rprrim^mTV TV Am/"* 
111b 1 AAA1 b 


nPTTPPTTTT 

UO X X X X X X 


T TAT AT C GAT 


AATTATCGAA 


1560 


AT CAT A I AC A 


/-TV TV T 7\ r^CC T\. P 

bAA 1 AbbbAb 


PPTP&P.PT AP 
bb 1 bAob 1 nvy 


ATPTAGCCAA 


AAATATTTAA 


CTCCTTTGTT 


1620 


AAbAlAAi lb 


rp rp rp m rp TV TV rTi/^Tv 

I I 1 1 1 /in 1 urt 


ftPTTPPAAAP 


ATATTCTCGT 

f \ X n A A %rf A V^* W A 


GCTTTCTCAT 


TAGTCATATC 


1680 


a apa a a sbtt 


PTaTTTPPTT 


f5AATTTP,PAT 
unn 1 JL X own i 


TGTTAATCTA 


ACGCCACGAT 


TTACTTTCAC 


1740 


1 AAb 1 AAbbA 


1 1 1 1 b 1 nunl 


a T TP ATT A A A 


ATTCATTGCA 


TCAGTTTGTA 

A wAv AAA nJ X n 


CAGTAGGCCA 


1800 


TA 1 bbAAA 1 A 


All bbb 1 b 1 A 


P ATT AT ATTP 


ATPTPTTAAT 

n A u A \-r A A n^l A 


GTcTTAAAAA 


TTGTTCAGGT 


1860 


TGAGGCCAAT 


ATnCTcATCA 


AATCGATATT 


CACCTTGTTT 


AGGCCAATGG 


AAATAGTCAA 


1 u 


TAGCTATAGT 


TGAAAGTTTT 


ATTCCCAAAG 


CATAGTATTT 


TTTTACAACA 


TCCATTACTT 


1980 


CATTTGGAGA 


• TCGATAGCGT 


AGTTTACTTT 


GCCATAATCC 


CAATAAGTTT 


TCAGGCATTT 


2040 


TAGGAACTTT 


ACCAGTTAAA 


TCACTATAAT 


TTCGaAGTAT 


CTCTTTTGGG 


GTATCACCTG 


2100 


CAATTACTAC 


ATAGTCAATA 


, TAATTAGTAG 


; AAGCCATTTT 


CCATCGAGTC 


: ATGTTTTTAG 


2160 
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CAAATGATAC CTCTCCTATA CCTGGATTAT TCCATAAAAA TCCATATCCT AAACTCGATA 2220 

CATAGAATGG TATAGATACT TGTGAGTTTC TTTGAGCTAA TTCTAGAACT GTATTCTTCA .2280 

AATCAAGATA AGATTGTTGA TATTGTCCCA TTCCAAAAAT CTTCTCACTT GATTCAGATT 234 0 

CAAATCTAGT TGTAACCTGA AAACCACCTT GTAAATTAGA GTGGTATTCT CTAGACTTCA 2400 

ACTTCAGCGT TGAATTAAAA TCTTTAGTTA TTTCAATTGT ACCTGCATCT TCACTTCCGT 24 60 

CATCATGTTT GACAGCTCTC AATCTGATAA ACTCTTTTAA TAGCACTTCA TTTTTTTCAT 2520 

TATAAAAAGT CAATCGGTCC CGATGGTCTA ACACAACTCT AATTTTCCCG TTTATTAATT 2580 

CTGTCTTATT TTCGTCTTGA TTTATTATGA CTGTACCTAA AGCATCAGGC TCACCAATTA 2640 

AAGAATTAAG TCTATTTTCA AAGTTCATAT CTACAAACGA TCTGACTCTA ACGGCATTTT 2700 

CACCCCAAGC TTCTACAATC AATAATTCAT CATCAAAAGT TCTCcGAAAA AGATTTTGGT 27 60 

GTAACTTCAA TCACATCTTC CAACCCACTT TCAATTTCTT CTTAATTAGT TTGGTATATG 2820 

ATAACAATAC TAAGTAATCG CAACAATACA AACGCTTTCC ATACGCACGC CTTTTGCTCT 2880 

CTCTGTGTcm ACAAAAGGTT ATAGAtTAAT TTGTTTTCmC GAtAAACTAA TTAACTAAAA 2 94 0 

AAATAAACAT AAGATAATAA GCATCAATGA CTTGAAATTT ATTCGTTAGC ACGTTAAACT 3000 

AATTAATAAA TAGACTATAA AACAGGCTCA GTAAAATGTC AACTGTAAAC GGCTATATTT 3060 

TTTTTGGATA AATTAATTGG CTTTAGGCAA TAGGGAATCT TAACCTTAAT TnCACTAATT 3120 

TATTCnAATT TAGGGGGGAA ATGGTTTCC 3149 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

ATTcCCCJcGC tTCyTTTyCC ATAAAAATAC yCTTTtCTAG ATATTTTAAa GCATTTTaTr 60 

GAAACGTTtC ACCAAAATAT GAACATCTTT TTCTTAAAAA TAATTCTGTT AAATAACAGA 120 

AAAAAGTCTG GTCAATCCGA CCAGACTTTT TAAATATAGA TAGTAAAATG TGTAATTTCA 180 

GGGATCACCC CAAAACGGGC ATGAATGGCG GTCGTTCCTA AGCCTGTATT AACATATAGT 24 0 

TGCGTTTCTk GGTCCAACTG ATACAATCCT CTGGTATACT TTTCTGCTAA AACATTTGTG 300 

ATTGGATAAA AAGGAATCCA AATTTGTCCG CCATGGCTAT GTCCTGCTAA AACTAACTGT 360 

GTACCCGTAC CAACAAATGC ATCCGCGACA TCTGGTTCAT GGGTCATTAA TAACGCATAG 4 20 

TCATACGTTT CTCGATAGGA TAATGTTTCT GAAACGGAAG GATTTCCCAA TAACGAGTCA 4 80 

TCTAAACCAC CAATAAAAAG ACGGCGTCCA TCTGCTAAGG TTACTGTTTC TCCTTGGTTT 54 0 

TTCATAACGG TAAAGCCACT GGCCTCCATG aCTTCrnCAT ACACCGCAGA AGCACCGCCA 600 

CCATAATCAT GATTTCCCCA AACTGCATAT TTTCCAATTG TCGCTTGTAA CTGTTTTAAC 660 
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TTTTCAATCA 


TCGGTTCTTT 


TTGTTGCGGA 'TTTTTGGCAT ■ 


AATTGTCAAA TAAATCACCT 


720 


GTAAAAACGA 


TAATATCTGG 


TTTTTCACGA 


TTGACTTTTT 


TTATGACTTT ATCTAACCGC 


780 


TTCGTCTCAT 


AAAATTCTGA 


AACTTGTATA 


TCCGAAAGTT 


GCACAACATT CAATGGTTCT 


840 


CGTTGATTTT 


TCTTTCCCAA AGTCACCTGg 


ATTAACGTGa 


ACTAAATTCG GCTCAATCTT 


900 


CCACGcATAG 


ATTGGGaGTG 


CGaTAATTAA 


GaAAAAAAAT 


AAGAGCATTA TACTTAAACC 


960 


AATTTTTTTA 


ACCACATCAC 


TCGCTCCTTT 


CTTTTCTTTG 


ATTTCCGAAG ACTTAGTATA 


1020 


GAGAAAAATC 


TTCTTAACCA 


CAATCTAAAA 


CAAAGGAAAT 


TAAGGCTTTG TTAAACAACC 


1080 


TAAAGAATTT 


CCACAATCTC 


TTATCCTTTT 


CTTAGCTTTT 


TTATAAAAAA AGCTAGATAA 


1140 


TTTCATCTGT 


CGATGAAATT 


ATCTAGCTAG 


AACATTCTTC 


TCTTAATGAT GATGTTTTCG 


1200 


AAACGTTTGA 


TAACCAATTA 


AGCAAAATAA 


AATCCCCAAT 


AAAATCATTC CAGTTACTGA 


1260 


AAATAAGCCT 


GCTAACATCT 


TTCGCATCCT 


CTCTGTTGCT 


AATCTTCCTT ATTTATAGTC 


1320 


ACTATTTCTT 


GTTGTAAAAT 


TGGTTGAAAT 


AGACGACGTT 


CTGCTAATTT TCTTGcTAAA 


1380 


ATTGCTTCAC 


TTTTTTTCGT 


AAAGATACCT 


AAGTTATGGC 


GCTTGCCTTT CAAAGAGATA 


1440 


TAAGCCAGCC 


ATTTCCCAGA 


AGCGGTCTGG 


CTCACCCCTT 


TGTGTCCACT GCTATTATTT 


1500 


TTGGGGCGTT 


TACTTTGTAA 


CTTCCCTGGA 


ATTACGCCAG 


CAACTCTTTG TCGGTAATCC 


1560 


TTCTTTTTAC 


GATGACCGCA 


ACTTTTGGTA 


TGGCCATTCA 


ACAAATTTTC ACAGCGAACC 


il620 


CAGCAAGATT 


GTCCGCACGA 


ACACAGACAA 


TACGCTTTCA 


TTGTCTGTTG AATACTTTCA 


1680 


ATCCGTACCA 


CGACTAATTC 


ATGATAGTTT 


TTTTTCAACA 


TTGTTTGATA GCGGTGGTGA 


1740 


GCACATTTCG 


TTCGATACCC 


ACGAATTAGT 


TGTCTTCTTG 


AAGCTTCTGC ATAGCTCCCA 


1800 


CAAGAACAGC 


GACAGGCCCA 


AACAATCAGC 


CCTTCTTTTC 


TTTTTGCCGT TGGTGCTACT 


1860 


ACTGTCAACT 


CACCAAACTT 


TTGGCCAGTT 


AAATCTTTAT" 


" AGTTACCTGC TTGTCTCAAT 


1920 


TGACTATCCT 


TCGTATTCTT 


GTTCTCGCCA 


CATCTCGATT 


GGGTCTCTCT CATAGTGCTC 


1980 


ATTTTTCTGA 


GTCTCTCCCT 


TTCTTAGTTC 


TCTGACCACC 


TCAGCAAATA CTTGCCAATC 


2040 


TTTTGTTTCC 


GGCAAATGAC 


GCCATGTCAA 


TTGACGACAC 


CGTACTTTCG AACTATAAAA 


2100 


ATCGGACAAA 


TACAAATCGG 


TTTCTAAGGT 


TAATTTGTGA 


TCAATCATGA CATCCATATA 


2160 


ATTGAAAGAA 


GCAATAATTT 


TTGCAATAAA 


GCGGTTATAA 


TCCTCCCCAC CAGAAAAATC 


2220 


AACAAATACA 


TGAATCGTTT 


TTTCAATTGC 


GCAAAAAGAC 


CGATCATTTA ACAAACAAAA 


2280 


CATAAAATCA 


TAGTATAGAT 


GCACGCGTTC 


AT AC ATT AAA 


AATCGTTTTT GATACAGCGA 


2340 


CTCTGTCTTT 


TGTATAAAAC 


CATCTAAGGC 


GCGATGAATT 


TGAGGAAACC GCTCTTCAAA 


2400 


GAAACTGCTC TGCTTGGCGG 


GAATAAATGC 


AGCCACCGAA 


AAGTATAGGC GGTCCCAGCG 


2460 


ATAACAAATC 


AACTTTAATT 


TTTCTTTTGT 


TTCCACCGTA 


ATTCGCTTTT CAAAGAAAGG 


2520 


AATAACACTA 


TTCATATGAT 


CAATAAATTC 


: GTCTAACATG 


GTTAACTTAT CTGGAAAAAC 


2580 


AATCGGACTA 


TCAATGTATT 


CATTTAAACT 


1 TAAAAAAAGA 


> AAAATATATG AAGTCTCTTT 


2640 
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GCCAAAATCC 


TCTTTAAAAA 


GAGGGGTTTC 


GCTTGTATAA 


ATTTTTTCAA 


ACGCATGCTG 


2700 


TAAACAGCTA 


GGCACATGAA 


CTTCCGTAAA 


AGAGTCCACT 


ACATGATCAG 


AAAATACACG 


2760 


AAAACGTTGG 


ATCGTTAAAA 


GAATAAATAG 


TTGTGCGCTT 


TGATTAAGAC 


TAAACGTTCT 


2820 


TTGATACATC 


AGTTGTAAAT 


CGTTTATAAA 


TCGTTTAATG 


CCAGGACTAT 


TTTCCAGCAA 


2880 


AATGTTAGGC 


AGCCTTTCTT 


GATTACTTCC 


ATAAAAATAA 


TACAAGCATT 


GAAAACAAAA 


2940 


ACTTCGGATA 


TTTTTCTCTT 


CTCCAACCAA 


GCCTGTTGAC 


TTTAATTTAA 


TATGCCAATT 


3000 


AGCTAAAATC 


AGATTTAGTT 


TATACGAGAA 


GGCATAAATT 


TTGCTTTTGC 


TCATAAAATG 


3060 


AATTTGAGAA 


AATTCTTGAA 


TCGTGTGCTT 


TTGATAGAGG 


CCCACATCAA 


GAAGCAACTT 


3120 


GGCCATTAGT 


GATTGATCAA 


AATAATATTT 


TCTTAAGGCT 


TGATAATCAA 


TTTGCGATAA 


3180 


TTTTTGCGCA 


CTAATCAGTC 


CTTTAGCTGG 


AAAAAGAATT 


TGACTCCTTG 


AAGAAATCTG 


3240 


TGTACATTCA 


TAATTTATTT 


GACGGCACAA 


CGATTCTAAC 


TTTTGAGTGG 


ATAAATCAAA 


3300 


ATATCCATAA 


AATTCTTCTT 


TTAATAATGG 


ACCTTCACTA 


TTCATAAAAG 


TTTGATAAAT 


3360 


AAGTAATTTA 


TCTCGGAATT 


CTTGGTCTAA 


TAATTCATCT 


AACAAAGTAG 


TCACATTTTC 


3420 


AACTCCAAAC 


TATCACGCCT 


TAAGTATCTA 


TTTTTTTAAC 


AAGACGTGAG 


TGTCAATCTT 


3480 


GCTTATTTTT 


TCCACAAATT 


TAAAAATGAT 


TCTTTCAATA 


TTTAGGAATC 


TTCATAGAAA 


3540 


AGTATTATTT 


TAACCAAAAA 


AAGCCCCCTT 


GCGTCACCAA 


GGAGGCTTTT 


TTTGTTATTT 


3600 


TTTTACTTTA 


AATTTAAAGA 


AAAGCTGATT 


TGTTTGACTA 


ATTTTCTGTT 


TCTTTTCTAG 


3660 


CCTATTCGTC 


AAAAATGATT 


GAAACTTTTG 


GTTCGTCTTC 


TATACTATCT 


TCCTATAAAG 


3720 


TTCTTACTCT 


TTTAAGTATC 


AGCTTCTGTC 


AATAAAAGTC 


CTCATTTGGC 


ATTGCTAATC 


3780 


TGAATACAGT 


ATGACTACTT 


GTTGCCGCTA 


TTTTTTGCTG 


TTATTTTACT 


CTTTAATTGA 


3840 


AGCGCTAGAA 


ACGCTGTCAA 


CTGTTGGCTT 


TTCGGTCGCT 


GACTCTTCTT 


CATGGACTGA 


3900 


TGCACTTGAG 


ACACTATCTG 


CCGTTTCTGG 


CGCTTCTTTT 


TTGGAGGCAC 


CACTGACTGC 


3960 


ATCGACGTTT 


TTTCCACCGA 


CTAATTCTTG 


CCCAGTTTGT 


TTTAAATACC 


AATCAACAAT 


4020 


TGGTTTAAAG 


TCTAAAATAT 


ACTCATTAAT 


TCCCGCGTAA 


TTTCCATTAT 


CATCATACAT 


4080 


CGCTTGATAG 


TTATGCACTA 


CATAGGTATC 


TGGTCCGTGC 


ATTGGAACAT 


GAACGCGAAT 


414 0 


AGCATCTGTT 


TTTCCTGACC 


GTAATTGTTG 


AATCACCCAT 


TCTACATTTT 


TCAATGCTTT 


4200 


TGGCGGATGG 


CAATTCGCTA 


ATGGATTACC 


AACTTGTCCT 


GGTTGCCGTT 


TAGCAAACAT 


4260 


TTCTTCAGCA 


GCCATTTTTT 


TATTGTAATA 


TAAAAATTGA 


TTGTTACTGT 


CTGCATAAGT 


4320 


CAATTCCATT 


GGCATTGATT 


TTAAGAAATA 


ATTTAGCTGA 


TCTACGGTTA 


AAATGCCTCG 


4380 


ATCTAATTTT 


ACATATgTGt 


CGCCTTCCAC 


AGCCTGAAcC 


GCtTCTGCTG 


CTTGtTCTAC 


4440 


CCAATCATCT 


GCTGCCATAT 


CAACCCCTTC 


GATCGTTGTT 


TCAATGGTTC 


CTTTTGCATG 


4500 


TAAATCCTCA 


GGTAAGTCTT 


TTGGCTTAGC 


TCCTTCTAAT 


TTTGAAATCA 


> CATCAATGAA 


4560 


TTCCaCGAAT 


TTTTGTAATG 


TGCTTTCTAA 


AAATTGACGT 


GTTCCCtCCG 


; CAATTAAGTT 


4620 
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GCCGTTTTCA TCAAATGCTT CTTTAGCTTT CCCTAATAAA AACTCGTTCC CaGGCATCaC 
GATTGCATTG ACACCAGGTG CATCAAGAAT TTGACGGAGA TGTAATTGCG CACGCGAGGT 
TCCTTGATCA TAATAAGAGC AACCAACAAT CATTACTGGT TTATTTTCCA GCGGATGAAT 
TTTGAATGAT AACCATTCTA ACACACTTTT CAAACCGGCA GGAATGGTAT GGTTATGTTC 
GGGCGTCGCA-ATAATTACGC CGTCTGCTTG TAAAATTTTG CGATTCATAT TCTGAATCAA 
GACACTGTTC GTTTGATCCT TACTTTGATT AAACATTGGA ATATCTTTAA TTTCTAAAAT 
TTCCAAGTCG AACATTTTAT AAAACTCTTT GCCAATATAT TGCAGTAGTA ACCGATTATA 
AGAAGAATCA GCGTTTGATC CGACGATCCC AATTAATTTC ATTTTATTCA CTCCATTTCG 
TTAAATCGTT TCCCATGAGA AATTTTCTGC TTCTTTTTTA TTTTGCGCAT • GCGCATGTTT 
CAGCTGTTCT GTAATTTCAA TAAATACACA AAAATCTGCA AACAGACCAG CTAGTTTATC 
TATTTGTTCT GAATCTTTTA ACGCATTGTT TTCATCAAAA GCTTGTAAGG AATGCGCTAA 
TAGAAATTCA GAGCTTGGCA TAATTCGTGC TTTCAATTCA G GAG CATC T A AAATTTGACG 
CAGATGCGCT TGTGCTCTAG AGGAACCTAA CGTGCCATAG GATGCACCAG TAATCATAAC 
TGGTTTATCA ACAAACGGAA AAATCCCGTA AGATAACCAA CTTAGGGCAT TCATCAAAGA 
AGCCGGCACC GCATGGTCAT ATTCAGGCGT ACTGATAATG ACGCCATCTG CTGCTTCAAT 
TTTTTCAGCA ACCGTTTTAA CTATTTCTGG CAAAACTTTG TCTTCTGGCT TGTTAAACAT 
AGGAAAGTCA ACAATTTCAA CTAGTTCAAT ATCAGCCTCC TGCGCAAAAT AGTGTTGCAT 
AAATTGCAAA AGTTGACGAT TGGTTGATTG CTCTGAGTTC GTTCCTACTA AGCCGATAAT 
TTTTTTCATA ATGAGCTGTC CTTTCTTTTT CGGTTCTATG AATGGATAAT GTGAAGAGTA 
TCTGCTAACC CGCTGCTATA TAAAATTTGC TGGTCTTGTG TAATGATGAC TCCTTCAATC 
CCATCTAGTT GATTGAGGAT TTCAAGCGCT TCTGGAATTG GATAACCAAA TAAACGCGTT 
GTCCAAATCT CACCATCCAC TGAAGCATCC GAGATAATTG TGAGGCTGGC CATTTCAGTT 
TCCAATGGGT AACCGGTTGT CGGATCCAAC AGATGGTGAT AGACTCGCCC AGCTTCGGTT 
AAACTTCTTT CATAAATGCC CGACGTCACA ACCGATTGAT TAGCCACTTC GACAAGTAAG 
CTGATTGTTT CTCTGGATTC TTGCGGATTG CGAATACCAA TTCGCCATTT TTTGTTTTGA 
TGCGTAGAGG GcCCCAACGT CACAATATTG CCGCCCAGAT TAATCAAAGC AGACGTCACA 
TGGACTTCTT TCAAATAGGc AATTAGTAAA TCAGCGATAT AGCCTTTTGc TAAAGCCCCC 
AGATCTAATT TCATTCCtTC TTCTTCCAAA AAAACCGTTT GTTTTAAGGG ATTtAAGTGA 
ATCTTTTCtG GATTAATTTT TTTCAAACAG GCTTTAATTT CCTCTTCTGA CGGGACACGG 
GCATCTTTAA AACCGATTCG CCATGTTTGA ACTAACGGGC CAATGGTAAC ATTCAAATGA 
CTAGCTGGAT CACAACTATG TTTTTTCCCT AATGCAATTA ATTGATACAA CTCTGGATGA 
ACGCTGACTG GTTTTTGCCC AGCTTGTTGA TTGACAGCCA TCAGTTCCGA AGTTGAATCA 
TTTGCACTAA AGCGTTGTTC ATAAGTGATC AATCGTTGAT GCACTTCTTC TAGAATTTTT 



4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
■ 5340 
5400 
54 60 
5520 
,5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
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TCAGATTCTT CATGGTCTAC AAATACATCT ATTACTGTGC CCATCAAATA AATAGTTTGA 
CTTTGTTGCA CCTCAAGACT TCCTCTCCAC GAAAGATTGT GTTTTATTTC ACATTTATTC 
ACAAATGAAT CATACGCCTT TATTTGATGC CAGTCAAGTC GTTTATATAT TTTTTTATAT 
AAAATAACGA CCTTAAAATT TTCATTTAAG GTCGTAACTT GTCTTGAACT CTTTTAAAAA 
ACGCTTTTTT TATTATATTT TGAAAATAAA TATATAATTT TCTATATAAA CTAATTTCCC 
TTAGATAAAA TTGTTTCTTT CAATGTTTGA ATTGCTTCCA ACAAGATGCG ATTTTTATTG 
ATTTTTGCAT ACGCGCGGGC ATCTTGTAGT AGTTCTAAAA TTTGCTCTTG CGGCTTTTC 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
TATTTATTCT TTTAACGAAA CACACAATGA AGGTAAAGAG aCTCTtGGTG GTAAAGGGGC 
AAACTTGGCA GAAATGACAC GCTTAGGTTT GCCGATTCCA CAAGGATTCA CGATTACAAC 
CCGTTGCTGT ATGGACTATT TGGCAGATGC TACTTTCTTT GAAGAACACT TACAATCAGA 
AATTTTGAAA GCGGTTAAAA ATCTAGAAAC TGAAACAGGC AAATCTTTTA CCGCGGACAA 
TGAGATTCTC TTAGTTTCCG TTCGCAGCGG TGCTGccTtC TCGATGCCTG GTATGATGGA 
TACTATTTTA AATTTAGGTT TAAATGATCA ACGTGTGAAA AAATTTGCGA CCTTAACCTC 
TCCTGGTTTT GCTTTTGACT GTTATCGCCG ACTTATTCAA ATGTTTGGAG ACGTCGTCTA 
TCACATTCCC AAAGAATTAT TTGACCAACA AAAAGAGCGC TTGGAACAAG AGCTCAACAA 
AAAAATTACC GCTTTTCAAG AAGAAGATCA TTTTGCCTTG ATTGTCCGCT ACCAAGAAGT 
TTTTGAACAA CATCAGGTGG TCTTCCCGCA AGACCCTGTC GCACAACTTT TTGAAGCAAT 
CAAAGCTGTT TTTGATTCAT GGAATAATCA ACGTGCGGTG GTTTATCGAA ACT T AC AT C A 
TATTGCACAT GATTTAGGTA CAGCTGTCAA CATTCAAGAA ATGGTTTTCG GCAATCGTGG 
ACTAGACAGT GGCACTGGCG TTGTTTTCAC TAGAAATCCG GTCACAGGGG AAAATCAACT 
GTTTGGAGAG TTTCTTTTGA ACGCGCAAGG AGAAGATGTA GTGGCTGGTA TTCGGACGCC 
AGAACCCATT CGTCGTTTAC GTTTGACTAT GCCAAAAGTT TATCAAGATT TTCGCCATTA 
CGCAGAACTT TTAGAATATC ATTATCGTGA CATGCAAGAT ATTGAATTTA CCATTGAAAA 
TGAAAAATTA TATATTTTAC AAACAAGAAA CGGGAAACGA ACCGCCGCAG CTACGGTTAA 
AATTGCTTTA" GACTTAGCCA AAGAAAATCG CATTACTAAA CAAGAAGCAC TGTTACGTGT 
GACACCTGAT AC GAT T GAT C AATTAATTCA TCCCGTGTTT GATCAAGAGA AACGACAACA 
CATGGAACGT TTAGCTATGG GACTTCCTGC TAGTCCTGGT GCAGCAAGTG GCCAAATTGT 



6660 
6720 
6780 
6840 
6900 
6960 
7019 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
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3120 


ACTGGAATTG 


AACCAGCGAC 


CTACCGCTTA 


GGAGGCGGTk 


GCTCTATCCT 


ACTGaGCTAC 
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GCCAACTGCG AAGAATGAAG CTACAGAAAC TGTACCGGCT GC/yVTGCCGT CTTTTcCATA 7200 

AGATTTAACT AAATTGTGTG CTACTAAAAA J3GCTACTAAA ACGGCAAAGA TTGCGAATGT 72.60 

ACCGTTCCAA ATGTTACCAC CAAAGCTTTT CCAAATTGGT GCATCGTTAT ACATCATTGG 7320 

GAAAATTTTG TTTAAAAAGT CCTCATATCC TGGAATTGGA ATATTATTAA TTAATACCGC 7380 

CAAAGCCCCT AAAATCATTA ATGGCATCGT TACGACAAAC GCGTCACGAA TTGCTACCAA 74 4 0 

GTGCCGTTGC GCTGCGATAA CTGACGCTTT CGGCATAAAG TGTTTCTCCA TCCATTGAAC 7 500 

AAATCCGTCC ATTGTTATTC CTCCTAATTA ACGCTTTCTC AAGCGCGTTT TTCTTATAAG 7 560 

ACGCATTGAC AACGAGATTA CTCGTTATCA ATACGGCGAT AAAGATCGAT AATTTCTTTT 7 620 

GCTAAATCTG TAAATGCAAT CGCAGTCATT AAATGATCTT GACTGTGCAC TGTTAACAAT 7 680 

GTTACTTGCA TGTTATTGCC TTTTGCTTCT TCTGTTAACA TTCCTGTTTG CGAATGATGT 7740 

GCATGTACAA GTGATTCTTC TGCATCTGCA ATTTTTTTGT CTGCTAATTC AAAATCCCCC 7800 

GATTTTGCTG CGGCAATCGC TTCCATTGCA TCGCTTTTAG CATTTCCGCC ATACATGATT 78 60 

AAACCCATGA TTGCTTCTAA GTTCTGTTGT CCGTCCATTA CCTCTACCCC ATTTTCTGCT 7 920 

CTAGTTTGAT GATCCCATTA ATGAAATTGC TTGATCTAAA ACTTTTTCGC CATTCATCAT 7 980 

GCCATAGTCC GCCATGTTAA TTACATCTAA AGGAATCCCT TTTGGTTGTA ATTTTTGTTC 8040 

AAATTGCCCT TTCATGAAAC GAACTTGTGG GCCTAATAAT AAAACATTCA CCTCTTTATT 8100 

TTCCAAGTTT GTATCTGCTT CAGAAGCCGA TACTGCAAAG ATGTCTGCTT CCATACCACG 8160 

ATCTTCTGCT GCTTTTTGCA TTTTTGTTAC TAATAAACTT GTGCTCATTC tGCGGAACAA 8220 

ACTAACATAA TTGTTTTTTT TGCCATTTTC CTTCATCTCC TAAATTCTCT TTGTTTTGCT 828 0. 

TTACACCTTT ATATAAGCAA GAAGCGTGCC AAGTTTTAAC ACATTAAAAA CCTTGATATG 834 0 - 
ACAACGTTTC TT GAT AT AG A AAGCGTGTAC ACACTAAAAA TAAAACGATT AC AC AC AG CT 8400 
TTTTTGTGTG CGTTTCCAAA ATAATAGGGC AACAAAAGAA AGTGAGCGAT AACCCGAAAC 84 60 

GACTCATCGC TCACTTTCTA AAGCTGAAGA GAATCAACTG GACAAAAGCC GCTTTTTCCC 8520 
TTTAAGCAGT TTCCATCATT TTAAGCATTT TATAAAACTG TTTCTTTTTA AAATAACTTA 8580 
AAAAGATACC TAATACTGCA TCATTCAGTT TTTGTAACGC CCCATAAGAA ACCATTGTTT 8 640 

CTGTATAACG CAGTTCACAG CTTTTTTCAT CAATTGGCTG TAG CT CAT AC CGCACCGAGA 8700 
AATCATTTTT AGTTGTGGAT GTTCGAAATT GATAGGTCTT ATTTTCTACA GCTTCTTCAA 8760 
TAACTATTTT CGCGCGACTA TTTTTACTAT ATTCTTTGAT ATATTCAAAC TTACGTAACT 8820 
GCTTTCTTGT AAGAGTTTTG CCTGTTTGCT TACGGATATC AAAAATAACA GAATTCATTA 8880 
ATTGCGTGTA AAATTCATCT GCTGAAATAG TTAATGTTTT CGTAATTTCC ATTACCGTTT 894 0 

CACCTCATCC • TTTGTCGCCT CTTCTTCAGC AACTTTTTTT CCTTTAAAAA ACAGGTAACC 9000 
ACCAAGTCCA ATGAGAAAAA TCCCCGTTGT CAGACTTAAT AAATCATAGT TTAAGGAGCT 9060 
GGGATTCAAC GAATACAAAA CAATCAAGAA GCCAATACTA CATAAAAATA ATCCATTTCT 9120 
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AATCATTTTC ATCCATCCTA TCTTTTTAGA CTACCGTTGC TTQGAAGTAT AAAACGCCAC 9180 

CCCAaGGTTG TTCCTTAATC AGCTGtTTTG CAACTTGTGC AGCAGTTTCA . GGGICTCCTX 9240 

CAACTTCAAA AAATAACTTA ATACCATTTT TTTCTTTAAA AGTAAATGTT TCATTTTCCC 9300 

CTTGATATGC CTCTAATAAA GTAATAATTT CCTCTTGCTT CATTGCACTT GCAATCGAAA 9360 

TCATGCCGCT CGCTCCTTTC TATACAAGTA ATTGCTGTTT TTATTGCTTA TTGCTCTCTT 94 20 

TCGCCTAATG CAGGATCCAT TTTGTTAGCA GCGATAACAA ACGGTGTCCA AATCAAGAAA 94 80 

GTGACTACCA TACATACCAA CGCTACGATT GGTGCGCGCC AGTCGGCACC TGTCGCTAAG 9540 

AAAGATAATA GCAACGGCGG TGTCACCCAT GCGACTTGTT GAGAAACTGG GGCGAGTAGC 9600 

CCAAGATACG TTGCAACCCA GCCAATTGTA GTAGCAACTA ATGGCGCAAT TACAAATGGA 9660 

ATAAACATAA TTGGATtCAG AACAACTGGT AAACCAAACA TAATCGGTTC ATTGATATTA 9720 

AAGATCCCAG GCCCTAATGA TAATTTACCA ACTGTTAAAT AATCTGCTCG TTTGGAGAAT 9780 

AACAAGATTG CAATAATTAG GACGATTGTA CCGCCTGAAC CACCAAACCA AGCAAAAGCA 984 0. 

TCAAACGAAC CACGGACCCA CATGTAGGCT TTACCGTCGT CAATAGCGGC TAAAACCGCA 9900 

GCTGTTCCTG TTTTTCCTTC ATAACCTTTT TGGAAAATAT CAATATTGAT TAATTGTGCT. 9960 

TGTCCCCAAA TTCCTTCAAG AACTGGTGCC AAAACGTTTG GGCCATGAAT ACCAAAGAAC 10020 

CAGAAAATTT GAACAAAAAC CGTTACGATT AACACAGCGC CTAGTCCTTG TGATAACCCA 10080 

AGGAACGGTT CCGCGATATA TTTTTGAATT AAGTCAATTA ATAATTGACC GCCTGAAAGT 1014 0 

TTTGACACCG TAAAGTTAAT AATTGCTACG ACATACAATG CGATTGTTGC TGGTAAAATT 10200 

GCCGCAAACG CTTTTGAAAC TGCTGGTGGT ACAGAATCAG GCATTTTTAT TGTTAAATCC 10260 

GCTTGCATTA ATTTACAAAA AATAATGGTT GATAAAGCGC CCATAATCAT CACCGTAAAG 10320 

AAAGCGTTAC CATTTAGATG GTTGAGATTT AACCAACCCC ATGGATTGAC AGTCAATTGA 10380 

TTATTAGCAA TTGACGCACC AATTTCTGCC CCATTAATTG CGTTGGCAAT CTTTTCTGGC 104 4 0 

ACTTTTACAG CTAATTCAGC TGTGTTATTT GGTGCAAAAG CAACACCTGA AATTAAGGTA 10500 

GCTAAACTGA CAATCCCACC AGCTAGCTCA TTCACATTGT ATGCTTTCGC TAAGTTATAG 10560 

CCCCAAGAAA AAGCGAAAAT CAAACCAGCG ATAsCcAATG TACCATTCCA AACAAAACCA 10620 

TTAATCCCAA TGATGACATT CAATACTGCA AAAATACCTT TGTCTGCTTC CAAACTTGCC 10680 

GCATAAGTTG AGACAAACTC TTGGGGAAGA TCTCGAATAA TTGCGTTAAT CATTACGGCG • 10740 

ACTGATCCAG CCATTGTCGC AGGTAATGTA CCAATAAAAG CATCCCTTAG TGCCACCAGA 10800 

TGTTTCTGTG cACCAATTTT TGCGGCAGTT GGTAAGATAT ATTTTTCCAT CCATGCCATT '10860 

AATCCGTTCA TTTTTATTCC TCCTGTATCC ATTTATATTT ATTAAAAAGG CAGTAGCCTC 10920 

ATTAAACGAA • TGAATCGAGC GTACAAAACA AAACCTTTGG TAACATGCAC TATCCTTATT - 10980 

TGCTATTCAC TTTTCCTAAA CGTTTTAATT CCTTATCCCA TTAATGAGAT TGCTTGATCT 11040 

AAAACTTTTT CGCCATTCAT CATGCCATAA TCTGCCATGT TAATTACATC TAAAGGAATC 11100 
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AGCGACTTTT TCTyCGATTT CaTTGTAAAC TTGCATGTCT CTAGCTGGAT CTTGTGACCA 
AACAATTTTG TTGGTGCCTG GCTGGAAAGA TAAATAATTA CTTTTACTAT CGACAAATTG 
ATCCAAGCGC TCAGGTACTT CTTTAATTTT TAATAATCCT TTTTGGACAG GTAACGACAA 
ATCCCGTTTA CGGATAACCA AATGTTCTAA ATTGTGCGTC CGATAATAAA GGAATGACTT 
AGCACAAACT AATTTTAAGT CCCGCTTCAC CTGACCAATA TTTCCTTCTG CATCGTATAA 
CATAAAAGCC AAAACTGCTT CGCGTTCGAT ATCAATTTGT TGGCCCAAAC GTGTTGCCTC 
TTGCTGAATA AATAACGAAA TGATGGCATA ACGCTCATCT AGCGTCCGTG CATCTAAAGA 
TGGTAATGTA ATCGACATCG GAATCCGCCG ATTAAAAGTG GTTAAAAAGT TTTCTGAGGA 
TTCCGTTGTC GCACCAATAA TTTGAACAGA GGCTTCATAG GTTTGACCAC TTTCTCCTAA 
TGGACGGTAC ACGCCTTTAT CAATAAACGT AAAGAGCATT TCTTGACCTT CTGGCGGTAA 
GCGATGAATT TCATCTAAAA AGAGAATCCC GCCATCTGCT TTTGCCATTA AGCCTGGACT 
ATCTTCATTC GCACCAGTAT AGGCACCTTT TTTTACGCCA AAGATATGAC CAAATAATAA 
TTGTGGGTTT TGTGCATAAT CAGCACAATT AAAAGAGACA AAAGGGGCAT CTTCATCTAA 
GGTTTTTGAA TCAATGGCAA AACGGTACAT ACATTCAGCA AAAAGAGATT TCCCTGTTCC 
TGTTTTCCCA AAAATAATCG TATGCAAACC TCTTGGTGGA TATAAAATAG CCGCCTTAGC 
CTGCTGAATA CTCACTTTTA AAGAATCATT GGCACCTACT AATGTGTCAA AACTAACAGA 
GGTGGACTCA TCTACATGGA CTTGTTCTCC TTCATTGATT GTTGGCACTG AATAGATAAC 
AGGACGTCCA GTACTTCTAG TAATTTTCTT TTCTTTGAAT AATTCATTTA AATAACGGCT 
GGCATTACTG CGATCAATCA TTAACTGTTC TGCTACCGCT CCTGCCGTTA AGCCAGTTTC 
ATTTTTTTGA AGTAACGCTA AGATTTCTTC CTTCCGAGAT TTCATGGAAT CCTCCTCGAT 
ACAACGTTCT GCTTGArGCT CGTAAAGCGT TATTTCTTCG CTTTACGAGC yCCTTTGTTT 
TTCGTGTTTT TCTTTTTGTT TTTCTTCTTT yTGCGTGTTT CTTGTGGTTC TTTGACAACA 
GAATTTACTG TTTCTGGTTT TCTTTCTTTC GTACTCTTTT TTTCTGAATA CGTATCTTTG 
ACTACGTTCT TTTTTTCTAG TTCATCTTTG GCTTTTGGCT GTTCATGAAG TGCACCATCA 
TATAAAAAGA TTTCCTGATA AGGAATGGCC AATTGTTTCA TCAAACGTTG ATAATCCCTT 
TTGGTGCCAT CGTTGATAAA CGTAATCACT GCCCCTTGTG CACCCATCCG GCCAACGCGA 
CCAGAACGAT GAATATAGCT CTCTTCGGTT AAAGGAACAT CTACGTTTAC GACGTAAGGG 
ACCCCAGTGA AATCTAAGCC CCGTGCCGCA ATATCCGTGG TTAATAACAA CTTCACTCGT 
TCTGCTTTAA ATTGATCGAT CGCCAATTTA CGCAACTGTT TACTTTGATC TGAAGCCAGA 
CCAATAGCCG GAACATTCTC ATAAACTAAT TTTTCTTCTG CTGCACCTAA ATCAGCAACT 
TGATTGAAAA' AGACCATGCT TCTGAATGCT TCTGTATGGG CCAAACGACG TAAGTAATCA 
GCTTTTTTTC TTGGTGCTAA CCGAATAAAG TAGTGGGCGA CTTGGCCTGC TGAAGTATCT 
TCTTCCGTCA CATCAATCAC CTGTAATTTT TGGCTTAAAG ATTGCGCTTG ATTCACGACA 
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CGATCAGCAG TCGCTGAATA AAAAACAAGT TGGTATTCGG TfGGTGTGTG TGTCAAAATT 
TGCT-TGGT1A AACTTAGCTC TTCCTCTTGG AAAAGTTGAT CAACTTCATC CATTACGATT 
GTTTTTAACA ACTGAGCTTT GACTTTTTTA TTCTTCATTA ACTCTAAAAT TCGACCTGGT 
GTTCCGATTA ACACTTCGGG ACGTTTTTTT AACTTGTCAA TCTGGCGTGA AACATTTGCC 
CCGCCGATTA ACGTTTGAAC CTTTAATTGA AGTGGCTTAG CCCAAGTCCG AGCAACTTCA 
GCAATTTGCA TAGCTAATTC TTGAGACGGA GCAATAATCA ACAATTGATT TCCCTGTCCT 
TTTTCAACAG TTAAGAGTAA AGGCmGCmTG TAAGCAAGTG TTTTTcCmGT TCcTGTCGGC 
GAAATTCCTA AAACATTTTC TCCTtCGCGA AGGGGCGTAA AACTTTTTTC TTGAATTGTT 
GATGGTTCGT TAAAGCCAGA AGCTTGCCAC TGTTCTTGCC AAGCCTCTGG AAGTTGTTCT 
AAAAATCTCA TATTGTTCCT TTCCTATCGC TTAATTCTTA TTTATCAGCA TCAAAAATAA 
TGCCGGCACT ATGACGTAAA TCAGTGACTA CTTTATTGAC ATTTCGAGCA AGCTCAACCC 
ATTCTTCATA CTTGGTGCCC CAATACTTAT CTGTTGGGTT ATTTAAAACT GCTGCAAAAT 
CACGGGCTTC TTCAATCATT GGATTATCGG CTACTTGAAT ATCTAAGACA TCACGTTCTT 
CATGTGCGCG ATCATGGTAC TCAGCCGAAG AAATAGCATT GACACTATCT AAAATCAAGG 
TCCCGTTTTC AAAGTAAATT TCGGATGGCG CAAAAGAATC ACTAATTTTC CCAGTGTTTA 
ACGTCACGTC AAACTGATCA TATCGTAAAA TAATTGTTCC TAAGCCATCG ACACCTGTCG 
CAATTTTACT AGCAAAATAA TGGCTTTCGT TTGGTACGCC AAACCAGCCT AAGGCTGCAT 
AAACTAAATA aACGCCTAAA TCCGCTAAGG CTCCCCCAGA AAAATGAGgA rAAAAtATTT 
GGCTCTTTGC CTTCTAATAC TTGATCATAG CGAGAAGAAT ATTTCATATA CGTAAAATTC 
GCACCTAAAA TTTGGTTTTT TAATGGCAAT AATTCAGCAA TTTTTTGGAA ACTTTGCTCA 
TGAATATTAC GGGCCGCTTC AAAGAAATAC ACACGATTTT TATTCGCTAA TTCAATAATT 
TCTGCCATCT CATCAGGCGT TGAAAAAGCT GGTTTTTCAA CAATTACATT TTTTTTCGCT 
AAAATTGCTT GTTTCGCTTG TTCAAAATGT AATGAATTTG GTGAGGCAAT ATAAACCGTA 
TCCATATGTG CAATACCAAA AAACGTCTCT AAATCAATTG CATATTCCAC ATCCCCATAG 
CGAGAACCAA ATTCTTGTGC CGTTGCTAGT TTCCGAGAAT AAACAGCCGT TAAATCATAC 
GCACCCGTTT CTAAAGCTGC TGCAACAAAT TGATGACTAA TCCAATTGGT ACCAATTACG 
CCTAAATGTA TCATAATCAA AAACCTCCAT TTTTTGTTTA TTTACTCTCT TTATAATAGT 
AATCTCTCGT ACAAAAGTCA CGTAATTACC CTGCCTTAAA GCGCTAGGAG TTCTTCAAGT 
TATTTATTAG GATCATCAAA TGCCTGTAAT AAATCAAAGG TTACCTGTGA CTTAAGTTCA 
ATCGTTGCTG GTGTTACGTG GCAATTGTAA GCAAGGACTT GCACACCTTG ATCAATTGCA 
GCACCCACAT- TTTCAGCAAA GGCGGGTTGC ATCTCTTGGT GAAGCGTTGC TTGCTTAATT 
TCTTCAAATT GGACCACAAA CAAAATGTAA CAGCGGTAAC CCGCTTTAGT TGCAGCCATT 
AGTTCAGTGA CATGCTTCAA GCCACGAAGT GTTGGCGCGT CAGGAAAAGC ACCGATGCCT 
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TTATTTTCTA AAGTCATGCC TTTCACTTCG ACAAAAGCTT GTXCGTCCGC ATCCGTTTCC 
ACTAGAAAAT CAAATTTTGA ATGTGCAAAT CGT.TGCTCAC GTTTAACAGT TTGGATCGTA 
CCAACTAAAC CAGGAAGCAC AATTTGACCA TTTTTTAACG CTTCATTGAC TAATGTATTC 
GGTACTTGGC TATCAATATT AATCCAAAAA GAACCTTTTT TTACTGCAAT CAAATCGTAA 
TCTGTTTTTC TTTTAGGCGA TGGTTGATAA CTAAGCGCCA CAACAGCACC TGGTAAAAAA 
ACTTCTTTCC CTCTGCCAGT ATTTTTGACA TGTGTGATTA CTTCTTCATT CGTTTCCATT 
AAACGACAGT GTGCAATAAA CCGATTGGGT CGTTCGATAA AATAAGCAAG TTGGACATTC 
GGGTACTTCA TACACAGACT TCCTTTCTTA C AC AC TAT AC CATAAAAAAA CAAGGAAAAG 
TATAGACCAA AACCAACAAT TTTACTATCT TGTGTCAATC ATTAAATATT TGTCAATTCT 
CAGCGATTTA GTCGAATATT TTAGACAAGT TCTATATAAT AAAGGTAACG AAGACGAATA 
AATGAAAATG AAGGTTAGGA GGAATTTCTT TGAAAAAACA TTGGACTTTC TTGGTTGGTG 
CATCTTTACT TACTGGATTA GCAGGCTCTA TCCTCTGGAA AAAAAATAAA CAGCTCCCTT 
CCCCACAGGA GCTTCAACAC AACTATACTG GGCATTGGTG GTTTGTGGAC GTGAAGAAAG 
CCACGCAACA CACCTTAGAA ATCACCTCAT CTTTCAATGT TTTCATTGAT GGCAAACAAA 
CGCATGCCTT GCTTGTTGAA TTAACCCCTA AACGCTTAGT TATCTTAGAC AAGTTAGGTT 
ACCATTTAAT CATTCAATGT GAAAATGGTG TCCCTGCGAC GCTTTACGAT GAAGCCGACG 
ACGAAACCTA CACACTAGAA CGGGCCATTT ACAATTAATA AATCACCCAC GACACATAAA 
ATGAAGTATG TGCGGTGGGT GATTTATTTT TTTACATCCC TTGCAAAAGC ATTCAGAAAG 
TTTTCTTACA ACTTGAATAT TATTAAATAG TCAAATGCGC CTTTTTCTGA TAAACTTAGG 
ATGATGTTTT TTTGAAAGGA TGAGCCCCTA TGACACCAAA TCGCGAAGAC TATTTAAAAT 
TAATTTTTGA ATTAGGTGGC GACGAAGTTA AAGTGAATAA TAAACAAATT GTTTCTGGAC 
TCGATGTTTC GGCAGCTTCG GTTAGTGAGA TGATTTCAAA GTTAGTAAAA GAAGATTTGG 
TTGAGCATTC TCCTTATCaA GGGGTACAAT TAACTGAAAA AGGCTTAAAA AAAGCGAGTA 
CGTTAATTCG CAAACACCGA ATCTGGGAAG TCTTTTTAGT AGAGCACTTA AATTACACTT 
GGAATGATGT GCACGAAGAG GCAGAAGTTT TAGAACATGT TACTTCACAG ACGCTTGTGA 
ACCGTTTAGC GGATTATTTA AATCATCCAG AATTTTGTCC ACACGGTGGT GTTATTCCCG 
AAGATAATCA ACCCATTCAT GAGGAGAAAC GCCAAACGTT AACAGACTAC CCTGTTGGCA 
CAAAAATTCG GATTGCACGT GTCTTAGACG AAAAAGAATT AC TG GAT TAT TTAGTTTCCA 
TTGATTTAAA TATTCAAGAA GAATATACGA TTAAAGAAAT TGCTGCATAT GAAGGACCGA 
TCACCATTTA TAATGAAAAC AAAGAATTAT CCGTCAGCTT TAAAGCAGCA AACACAATTT 
TTGTTGAGCC ' GTTGATTAGA GAAAGTGAGG AAAACTAATG AATGAAAAAG TAGCATTATT 
TACTATTTTC GGGGGGACAG GTGATCTTGC CAAACGTAAA TTGTATCCTT CCCTGTTCCG 
CTTATATAAA AAAGGATTAT TAGCTGAGCG TTTTGCGGTG ATTGGTACCG CACGTCGGGA 
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ATGGACAGAC GAATACTATC GTGAAATCGT CCGTGAAACC ATJCAAGACT TAAATCCCAC 19080 

GGCCGAAGAG GCAACTGAAT TTTCTAGTCA TTTCTATTAC CAATCACATA AXGTGAATGA '..13140 

TACGGAACAT TATAATACCT TAAAAGAATT ATCTGATCGC CTAAATGAAC AATACCACTT 19200 

AGAAGGCAAT CATGTTTACT ATTTAGCTAT GGCCCCTCAA TTCTTTGGAA CCATTGTTAA 192 60 

TCACTTGAAA TCACAACATA TTATTTCGGA AGAAGGCTTT GATCGTCTAA TTATCGAAAA 19320 

GCCATTTGGT TCTGATTATG AATCTGCCTA CGAATTAAAT GAAGAAATTC GTGCTGCATT 19380 

TCCTGAACAA GATATTTTCC GAATTGACCA TTACTTAGGC AAAGAAATGA TCCAAAATAT 19440 

TTCAGCCATT CGTTTTGCCA ATAATATTTT TGAATCACAA TGGAACAATC GTTACATTGA 19500 

TAATGTCCAA ATCaCCTTCG CTGAAAGCTT AGGTGTCGAG GATCGTGGCG GCTATTATGA 19560 

CCACAGTGGC GCACTAAAAG ATATGGTTCA AAATCATATT TTACAAGTCG TTGCATTGCT 19620 

TTCTATGGAG CCTCCTGTTG CCTTTTCTGA AAAAGAAATT CGGACTGAAA AAGTAAAAGC 19680 

TTTACGTGCC ATCCGCCTTT ATTCTGAAGA AGAAGCCCTT CAAAATTTCG TGCGTGGACA 19740 

ATACGGCGAA GGTCAGCTAG CTGATCAAAC ATTTGCAGCC TATCGTGATG AACCAAATGT 19800 

GGCAGAAACT TCATCCACTG AAACTTTTGT TGCTGGTAAA TTTTTAATTG ATAATTTCCG 198 60 

TTGGTCTGGC GTCCCTTTCT ATGTTCGCAC AGGTAAACGA TTAACTGAAA AAGGCACACG 19920 

CATTAACATC GTCTTTAAAC AAGTACCTAT TAATGTCTTC AAAGATGATA CGTGTGAAGA 19980 

AT GC GAT AAA ACCGATTTAC CACCGAATGT TTTAACTATC TATATCCAAC CAACTGAAGG 20040 

ATTTTCATTA ACCTTAAATG GAAAAGAAAT CGGTCAAGGG TTCAATACAA CTCCAGTGAA 20100 

ACTTGATTTC CGTCAAAGTG CTGAAATGAC TGAAAATAGT CCAGAAGCAT ACGAAAAACT 20160 

TCTATTAGAC TGCCTAAATG GCGATAGTAC AAACTTCTCT CATTGGGACG AAGTTGCCCA 20220 

ATCTTGGCGC ATTGTTGACA TTATTCGACA TGCTTGGGAT AAAACAGACG TTTCTTTCCC 20280 

TAATTATGCC GCTGGCACAA TGGGCCCGCA AGCTGCCTTC GACTTATTAG AAAAAGATGG 20340 

CTTTACTTGG GAATGGCAAC CAGATAACTG GTACCGTGAC CGCGGTCAAC TTGATCAATA 204 00 

AATAAGCCAA AAAAGATCCG CTAGTTAACG ACTAGCGGAT CTTTTTTCTT GTTTATTCGA 204 60 

ATCCTTCAGC AACCGCTTCT GGATAATTTT CTTCAACAAT GTGTGCACAA CGGCCaCAAA 20520 

GTGTTGGTAA TTTTTCATCT GAGCCGACAT CTTGACGAAC GGAACGACAA CGGTCACACG 20580 

TTTCGCCTTC TGCTTTTTCA ACTAAAATAG CCATGTCTTC AAATTGAACA GCTTCACTAG 2064 0 

GTGCTACTTC TTTTGATACT TCAAAGTCGG AAACAATTAG TAATTGAGCA ATATCTGCAT 20700 

CAACAGCTGT CATTAATTGA CGAATTTGTT CATTTGGATA AACGGTCACT TTGGCTTCTA 207 60 

ACGATTTACC GATTAATTTA GAATGACGAG CTTCTTCCAA CGCTTTTAAG ACATTGTCAC 20820 

GGAAATCCAT ■ AAAGGCTGCC CATGTATCCA TCAATTCTTC TTCATTAGTA AACGTTTCGT 20880 

AACCTGGGAA TTCAGCTAAT TGCACATACT CTTCTTCTTC TTGTAAGAAA CTCCAAATTT 20940 

CTTCCGCTGT ATGTGGAATA ATTGGTGTCA ATAATTTTGT TAACGAAACC AATGTTTGAT 21000 
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AGAAAACAGT CTGCATACAA CGACGTTGAT AATCGTTTTC AGpTTCAATA TAAACGACAT 21060 

CTTTCGCAAA GTCTAAATAG AAAGAAGATA AATCTACAGT TAAGAAGTTC ATAACTGTCC 21120 

GATAAATATG CATGAAATTA TATTTTTCAT AACCATTTTC ACGGATTTCT TGAATGACTT 21180 

GATTTAAACG AACCGTCATG TATTTATCTA CAGAACGTAA ATCGGCATAA GCTACAGTAT 2124 0 

GCTCAGCTGG ATTAAAATCA CTCGTATTCG CTAACAAGAA ACGCATAGTA TTACGGATTT 21300 

TCCGGTACAC TTCAGATACT TGGTTTAAGA TATCCATTGA GACACGCACA TCGGCTTCCG 21360 

CGTCAACACT ACTTACCCAC AAGCGTAAGA TATCTGCCCC CATTTGATTG ATAACTTTTT 21420 

CAGGTAAGAT AGTATTGCCT AACGATTTAC TCATCTTACG GCCTTCGCCA TCGAGAACCA 214 80 

TCCCTTGAGA AATAATTGAT TTATAAGGCG CTACTCCATT GATAGCGACA CTTGTTGTAA 2154 0 

TACTTGAGTT AAACCAACCA CGATATTGGT CAGAACCTTC TAAATACATA TCTGCTGGGA 21600 

AAGTTAACTC TTCTCTTTCT CGTAAAACTC CTTCATGTGA AGAACCTGAG TCAAACCAAA 21660 

CGTCCATGAT ATCTGTTTCT TTGGTAAACT CGCCATTTGG TGAACCTGGA TGTGTAAAGC 21720 

CTGCTGGTAG TAACTCTTTT GcTTCACGCA TAAACCAGAT ATTTGATCCA TGTTCTGCAA 21780 

ATAAATTCGC TACATGTTCA ATTGTTTCTG GCGTAATGAT GGCTTCCCCA TTTTCAGCAT 21840 

AGAAGATCGG TAATGGTACA CCCCACGCTC TTTGGCGAGA GATTACCCAA TCACCACGGT 21900 

CACGAATCAT ATTATATAAA CGTGTTTTTC CCCAAGGAAT TAGCCAGTCA ACTTTCTCAA 21960 

CTTCATCTAA AATATCTTGA CGGAATTTAG AGATTGAAGC AAACCATTGT GGCGTTGCAC 22020 

GGTAGATAAC TGGTTTTTTG GTACGCCAAT CATGTGGATA ACTATGCGTA AAGAAATCTA 22080 

ATTTTAATAA CGCGCCTTTT TCTTCTAATA ACTCAGTAAT CATTGGATTC GCTTTGTCAT 22140 

AGAAAATTCC TTCAAAACCA GGTGCTTCAT CTGTAAAGAC CCCACGGCTG TCAACAGGAG 22200 

AAATAACAGG TAAATCATAT TTACGGCTAA CGATATAGTC ATCTTCACCA TGTCCTGGTG 22260 

CGGTATGAAC TAAGCCAGTC CCGGCATCTA GCGTCACGTG GTCGCCTAgC ATAACAAGTG 22 320 

aTGTTCGATC ATAGAATGGA TGTTGAGCTG TCATACGATC TAATTTTTCA CCAGAAAATT 22380 

CTCGCAAGAC TTCTACTTCT TsCCAACCAA TCGCTTCTTT AACTGTTGTT AATAAATCTT 224 40 

TAGCAATTAC AAATTaCGAC CGTCTGCCTT CACTTCAACA TACGTAAAAT CAGGATTAAC 22500 

TGAAATACCT AAGTTGGCTG GTAATGTCCA AGGGGTTGTC GTCCAGATGA CGAAGGCTGT 22560 
CTCGTTATCT AAAAGACCTT TACCATCGGC CACATTAAAA GCTACGTAAA TAGAAGGTGA • 22620 

TTTTACATCT TTGTATTCAA TTTCTGCTTC TGCTAAAGAA GATTCACTTG AAGGAGACCA 22680 

ATAAATTGGT TTTAAACCTT TATAAATATA GCCTTTTTCT GCCATCTTAC CAAAAACACG 227 40 

AATTTCTGCT GCTTCATACT CTGGATCTAA CGTAATGTAT GGATGTTCCC AATCGCCTGA 22800 

CACACCTAAA • CGTTTAAAAT CGTTACGTTG TTTATCGACT TGTGATAAGG CATACTCTTT 228 60 

ACATTTTTCG CGATACTCAG CGACAGTCAT TTCTTTACGT TTAACCCCTT TATTGGtTAA 22920 

CACTTGCTCa ATTGGTAAAC CAT GAG t AT C CCAACCAGGc ACATAAGGAG AACGGAAACC 22980 
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AGAPATTGAT 


TTTGaACGaA 


TAATAATATC 


TTTACTGATT 


TTA.TTCAAAG . 


AATGTCCTAA 


23040 


A TP. A AT ATTT 
M 1 \jr\r\ 1 rt 1 J. i 


PPGTTTGPAT 


APGGAGGGCC 


ATCATGTAAA 


ACAAAGGTTG 


GTTTTCCTTC 


23100 


rTTT A A TTTT 
bill rxrt 11 11 


TP.APP.TTP.TT 
l UnV/O l lul 1 


PATATAAGPP 


TTTTTCTTCC 


CAGTCTTTTT 


GCCaTTCTGC 


23160 


1 1 bAbbb 111 


P.P.PA AP.TTPP 


PAPP.PATTGG 


AAAAGCTGTT 


TTTCCTAATT 


GCAATGTTTC 


23220 


T T I bAi 1 1 1 b 


7\ T ft. T P P A T T T 


PPPTPP.PTTT 

bbb 1 VOL 111 


CATTTTATCG 


ATTTAATTGT 


GTTAGTTCAT 


23280 


ACTAAAbA I b 


AAbb A 1 not 1 


PATAAAAP.TA 
brt 1 rt/vvrlO l n 


GATaAaAAAA 


GAGTACTCCa 


TCCCaAGGGA 


23340 


CGAAGT AC lb 




PPA A ATTPn A 


AAGTAGAGTT 


GCCCCTACTT 


TCTCTGAAAT 


23400 


TGATAAbbbi 


AAi 1 Al bbbb 


P APTTPPTAP 
bAb 1 1 bb 1 riv^ 


TrAAGTGTTC 


AGAACTGCAT 


CTAGGATAGA 


23460 


GGATbTbl 1 1 


AAI bbbAb 1 b 


TPTP.TP A AAP 
Iblbl brt/W— 


TTTCACTATC 


ATTGACTCGC 


TGCTAAGAAT 


23520 


TV T\ A /"* A HT 1 A A A 

AAALA I 1AAA 


bb 1 Iblbl lb 


T ATPT A TTTT 
lAlbiAllll 


GATTCTTCTT 


CAATTGTTTC 


TTCAGTGACT 


23580 


m/"< TV TV rp rp T\ /"» 7\ 

tcaattalacj 


bb lilllllb 


T E. PT B. P A PP A 
1 Ab 1 rtbrlbbrt 


TCTGTrTTCT 

1 V 1 VJ 1 v.. V A ^— ' A 


CTTCAGAGTT 


TACAACAGTT 


23640 


TbAl 1 l Ibbl 


lb 1 bAAbb 1 b 


TTPTTPATP A 
1 1 b 1 l brt i brt 


AGAATTTCCT 


TAACTGCTGT 


ATGTTTGTCT 


23700 


bbb Ab A 1 AAb 


^ n A^•nA.A.AAp^ ,, 
1 Ab AAAAAbb 


PTTTAAAATP 

bill /TdTXrVrt 1 ^ 


TfTTCCCACT 


CTTCACTCTT 


TACTTGcTCT 


23760 


AAT I GAG 111 


bAAbbAl 1 AA 


PPTT A AP.PPT 
bb 1 1 rirVbbb 1 


TGaTGGAAGA 


CACGTGTTTT 


CTTCTTCAAG 


23820 


TC TTCTGTT I 


bbbCAbb 1 AA 


TTPLPPPPPS 

1 IbAbbbbbA 


PGTTPAATTG 


CTTCTGCTAA 

\mf X X V> X W X *w* 


GATTTGTGTT 


23880 


m rp rp rp rp 

GAT Trbbbl 1 


b Abb A 1 b 1 bb 


A 7\ TPTATTPP A 
AAlbAl IbbA 


TTTGATTTAC 

ill un 1X1 nv* 


GTTCAGCTTC 


TACCAAGGTT 


23940 


TCTTTGGGI I 


bbT 1 Alb ibb 


PP ZiAPTPATA 
bbAAb 1 brt 1 r\ 


ATPATTTCTG 

^\ X x 1 Iviu 


ATTCTTTATT 

*V X X \m* XXX *i J- X 


GGCACTTGAT 


24000 


TTCACTTTAT 


bAbbbbl Alb 


Tt- PTPP A AP A 
1 Lb 1 bbrtrtbrt 


AT AAT T GAT T 

x nn x x un x *■ 


GATTCAATGC 


ATCTTTCAAT 


24060 


IbAl Lb AAb I 


Al ub 1 AAI 1 1 


TTPT-t- PTP.PG 
1 1 b 1 L b i b^-\J 


TGTTTTAAtG 


AwTTCTCAAG 


TTCACGATTT 


24120 


rrtrpmrp/— rpjv TV H""/-* 

1111b 1 AAlb 


P A *- PT TP A T A 
LnLL 1 IbAlA 


ATP APGTGTG 
r\ 1 brtv> ijiuivj 


ACTTGATCTA 


AAAAGTCATC 


TACATCGTCT 


24180 


1 bb 1 Inl AbL 


IjILI Cnl 111 


Tf^TTPAGAAG 
i b l i brtvsrtfAvj 


tCTTTGTTTT 


GAATAwCTAA 


tGGAGTTAAT 


24240 


bbbA 1A1 1 bb 


ttpb-PPTPTT 


T(^P, ATT ATTT 
1 b b.n i x / i i i j. 


TTGTAAAACC 


CCTTATGAAC 


ACATTCTCAT 


24300 


Ab 1 AA1 bb 1 b 


A A ATAPPA A A 
AAA 1 Abb AAA 


T ATPTT APTT 

1 i"\ 1 b 1 1 rtVw 1 1 


TTTTTATTAT 


ACACAACTTC 


GTCTGAATGA 


24360 


AfftPPPARft-B. 


Ibbll 1 1 b 1 1 


A A APAAAAGT 


TC GAG AT TAT 


TTTCTCAATA 


CCCCTAAGAG 


24420 


bAb Abb AAA 1 


TTTTCTTTCT 


TP.GTTTTTPP 
1 bb iiiii v ^- v -- 


TTCTAATTCT 


TGAATTTGAA 


GGCGCCCAAA 


24480 


1 vvtLl^vj 1 Al b 


PDAAPAATAT 
b AAAb AA 1 A 1 


PTAAT A AP.TP 


TAAAGCAAAA 


TCTGGTCGTG 


TCGTTTCTGT 


24540 


PP Tk A TTP RPT 


111 Ab 111 Ab 


PAPATTPGAT 
bribrll luoni 


CAATTGTTTt 


GATCGCTGCC 


TTGAAATATT 


24600 


AAA 1 Abbbbb 


PD.AATPAPAP 
bAAA 1 bAb Ab 


TATPPAGPPG 


CAAAGAGCTC 


ATAGTCGTTC 


GCTCTTCCTG 


24660 


CCAACCATCC 


TTCGGTATCA 


AAATTTGTGT 


ATAATCCCGT 


TCTTCTAAGC 


/""•A Tl/^A/TPA AT 1 

GAACAGTAAi 




TTTACCAATT 


TTGTCGACTT 


GGTTAACAAC 


GAAGCCAGCT 


ACTTCTTGTG 


CTATAAAAAC 


24780 


CTGCCACTTA 


TCACCATCGG 


AAATAATATC 


TCCAAAGGCT 


TCCcTACGAA 


, CCCCAACATT 


24840 


CATCAaGGTG 


CCTAAAATTT 


TCCCaTGaGA 


. TAaTGTCGCA 


. AaTTTAGATG 


; GATAAATAaT 


24900 


kTCAAATAAT 


ACAACACCAA 


. AATCTGTCTC 


TTCTGGTATG 


; TAATACGCTG 


1 GATAAATCAA 


24960 
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GCAGbb 1111 


bb 1 1 b 1 I 1 1 


pptpataapp jxppataaaaa GTAAACGATA AATCACTGTT 


25020 


TTGlbbAAlb 


2\ 7APPTTTPPA 
AAbbl 1 l^V^A 


AAATATAPP.P PTGTCTAGGA TCAAGAAAGT CTGTCAAATA 


2508a 


AGGCGCG1A1 


1 bbA 1 \~. 1 bb A 


PTTPPTPT AP* PPAATPPPPG ACAGTGTCAA TAAAAGGATG 


25140 


TTCATCT lib 


bbb AAn loll 


P,ATAPAPATT TGPGTTPATA GATAAATCAT CCTTTCTAAA 


25200 


AGTTATTAAT 


AAAbAA 1 Abb 


TPPTAAAATP, APTGPTAAGP CACCCGCTGC TAGGTTAAGG 


25260 


ACAATAATCC 


CTAbbAI lAb 


ptta A ATPPA APPATTPPAA TPPTTAAATT CAAACGGTCA 


25320 


AACAAACTTA 


AG lAlob 1 lb 


APAAATTPTA flA^AGPAAAP GCCCGAAAGC AGAATCATAT 


25380 


GCTCCTGGAA 


ACCATbAAAb 


TnrrrrrTna atpaptap,TA ATPPTHAATA GATTTGAACG 


25440 


CCTTTGTAT-A 


ATAGAGTAA1 


r t T\n r vr r r r rmr HTAATAAPAP PTPPPTAAGG CTTAGAAATC 

K^t\\J loll OV^V3 A 1 Art J. rtrt^rt^ ^ i V^V^O i. t\T\\J\J inunnniv 


25500 


AAAGAATTGT 


TTGTTGGCTA 


7\np7iTTPTPP TPTPHPAPTA TPAATTTCCA TATTCGCTGG 


25560 


TGTGCATAAG 


AAGATTTCAT 


fprr^TiTirrrr r"TPn iiTZiTPT PPPTPTAAAH PATACACAGT 

1 bbb AAbbbb O 1 unn IrtlLl L^jl^ tnnnu vnin^nvjnvji 


25620 


ACCAGTTAAG 


AAATCAACGA 


TrrrfprTrr TTrrTPTTrT TPT7APTAATP T A A A A T T A AC 
Ibbblbblbb llbblblH-1 J, U J. AV^ Irirt 1 o xrv\\^ 


25680 


TAGCACAGCT 


TCTTCAGCAA 


T\ T> A A A rn/-»rprprn CfT* A 7\ TrrTf 7A TTPPT TPTf^ AATAPAPAPG 

ATAAAlbiil bbbAAlb^lC Al IbL J. l^lc rtmi-rw-rtwinv-xj 


25740 


TGGTTCAATA 


ATAGTAATTT 


fp a rpr-rp^rpmrprp TrriiTTrrrr PTP.PTTTPTT f^TGPTPGTCT 
TAlblblill IbLnl lubLb olutJ. 1 1^1 1 uiu^i^uj-^i 


25800 


TGTATTCGTA 


GTCGCTGATT 


rnTTi\r7inrT n PP P D. T a P Zi TA 2\PP&PTTTTT TPTCTGTGTA 
bAllAbAAbl AbbbAlAoAA AV^-^AV^ i 1 1 i i. i^i^iuiuin 


25860 


AGCAGCTTGT 


TGTGTGTCTC 


TTTpnp r P(~7ATP PPPTTPZVP.PA P.A APP. AT AP.P GTGCAGAAGG 


25920 


TTTTTCAACT 


AC AG bb 1 bAb 


tttpttptpt ATAAPPTP^P Tf^APTGTAAC TTTCCTGTTG 


25980 


TTGAACAGGG 


/-»/"* rp m /""" rp /-"/Tprp 

CGTTGTbbl 1 


ttapppppqpp TPPAPP,APP,P P.GTGGTTCAT TAACTGCTTT 


26040 


GCGCTCTTCA 


TAT 1 b 1 1 bb 1 


nPTTaTPATA ATAATPATPT ' TPTTPGCCAG ATAAGCCAAA 


26100 


AAAGCTCGAT 


AACGCATCCT 


rprrnmA A7\ AAT TP QP AT A A AT TPPTPTPPTT TACGCATCTC 
I \j 1 I AAAAA X 1 VjAV^A I nnn I l\^.v».l^l^v^ii a n^uvn iVrfAV 


26160 


TGAATAACGC 


TGTTCCbAi 1 


rrTaTiiTifinr ttppappttp TTPAAPTGPG ATTGGGAAAT 


26220 


CGTtACTCAT 


TCCCATGCTA 


2\pttptpt7ap A APP APP A A A AP*ATAAATGT TTCTCATTTA 

Ab i IbibiA^ rtnuonu^nnn nun 1 rvrt-fA lu i i a ^ i * * 


26280 


CGGTCATTTG 


CAGCTCTTTT 


t\ ATf ptpts. a liTPTTPPBTP TAATAPTTPT TCACTTGCCC 


26340 


CAAAAGGTGC 


CATAGTCATT 


npnpppappn TTTf^A ATTTT PTPPAAATCA GCAAGTTCTT 

AbAbbbAbuA 111 orlA. 1111 ol v^^in-rtri ± x j. \* *. * 


26400 


GAATGAAAGG 


TAAbAbb 1 b 1 


TPTTTiiPPAn APPPPTPPTT PPTTTCTTCC CCAGTTATAT 


26460 


TCACTTCAAC 


AAA A /""* 7\ T\ f 7\ 

AAAAb AAb b A 


at a aptttpp TTP,PTPTTTT TTGAATTTCT TTGGCTAGAC 


26520 


TCAGACTATb 


l AA1 bbAl bA 


AAATAATPTA TPTPATTAAT AATTAATTTT ACCTTCCTAC 

1 /in 1^1 n 1 1 ^-n l l r\r\ i i i nn i i i x n v* ^ * ^ w * * *-w 


26580 


GTTGTAAATT 


7\ f**f^7\ A r P r P A. A A 

AbbAAl l AAA 


TPPPAPTPAA TATPATTAAA TTTTTCCAAT CCAGCTTTCT 


26640 


TTTCTAGTAA 


CTTATCTACT 


CGGTTTTCAG CTAAATTCTG GACACCAAGC TGAACAAGCT 




CTGCTGCTTC 


CTCAATTCCT 


ACTGATTTAG TGACGCCAAT CATTGTCACA TCAGCAACAG 


2 67 60 


AGCGCTGTGA 


GCTTTTGCAT 


GCTAGCTGAA GTTGATTATT AATATTTGTT AAGTTTTCAG 


26820 


CTATCATTTC 


TAATCGATGC 


TCCTTATCGT TTTCTGCGGA AAAATGGTGG CGTACTTAAT 


26880 


TCGTCATCAT 


TGTGATTTGG 


1 TTCTTCACGG TGGAATGTGT CAAATTCTTT TTTGTCTACT 


26940 
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TGTTCTAATG AAGATTCATC AACTTTTGGA CGTGTATTTT GTTCTCTGCG AATATCCCAA 27000 

TCGCCAAAAG CACTTGCTTC TTCTTGTGAT T.TT.GG.TTGAT CCATTTCCAC AGATTGAGTT ' 27060 

GTTTGTTGCA TTGGTTGAAC CGCTTGTCTT GTTTGACGGT GCGGTTTTCT ATCCTTTTTT 27120 

GATTCATCAA TACCTGTAGC GATTACTGTT ACACGAATTT CGTCGCCTAA ATCTTCGTTA 27180 

ATTGATGTAC CTAGGATGAT GTTCACATCA CCTGAAGCAG CATTTGTAAC GATATCTGAA 27240 

GCATCTTGCG CTTCAAACAA GGTCATATCT AAGCCACCAG TAATGTTTAA TAAGACTTGT 27300 - 

TCCGCACCAT CGATTGATGT TTCTAACAAT GGTGAAGAAA TAGCTTTTTT CGTTGCTTCA 27 360 

ATTACACGTT CTTCGCCGCT CGCTACACCA ATTCCCATTA AAGCTGTTCC TTGGTTTTCC 27 4 20 

ATCACTGTTT TCACGTCAGC AAAGTCCAAG TTTACGTAAC CTGGTGCAGT GaTTAAATCT 274 80 

GAAATCCCTT GAACACCTTG aCGTAATACA TTATCAGCTT CTCTAAATGC TTCAAGCATT 27540 

GGCGTTTTCT TGTCAACGAC TTCTAATAAG CGGTTATTTG AGATAATTAA TAGTGTATCA 27 600 

ACGTTTTCTT TTAATAAGGC AATTCCTTCA GCanGCAAAc GACCACGTTT TGGACCTTCA 27 660 

AAACTAAATG GACGAGTTAC TACACCAACT GTTAAAGCGC CTAATTCTTT AGCGATTTTT 27720 

GCAACTACTG GCGCAGCACC TGTACCAGTT CCGCCACCCA TACCAGCAGT AATGAAAATC 27780 

ATATCCGCGC CTTGTAATGA TTCTGAAATC ACTTGTTCAC TTTCTTCTGc AGCTTTTTGG 27840 

CCAACTTCAG GTTGTGAACC GGCACCTAAA CCACGAGTGT ATTTAGGGCC TAATTGAATC 27900 

ACTGTTTCTG CTTTTGAATG TTTTAATGCT TGAACGTCTG TATTGGCTGT GATAAATTCC 27 960 

ACGCCTTTAA CGTTTTCTTC GATCATACGG TTTACAGCGT TTCCGCCGCC GCCACCTACA 28020 

CCGATAACTT TAATGACTGC ACCGTTGTTA ATGTTATTGT CTAATGAAAA TTCCATGATA 28080 

TCGTATCCTC CTGTTTGTGT GATTAGTCGA AAATGTTCGA GAAGAAGTCT TTGATTTTGC 2814 0 

CTGTCACTTT TTCACCAGAC TCACGTTCGT TGAATTCTTC GTATTCTTCT TGAGGCTGTT 28200 

CAGCATATGT GTCATAGCGA ACTTCTTGTT GGACAGCGAC TGATTGTGCT GGTTTCGATT 28260 

TTTCACCAGG GATTGCATAT TTGGCAATGT GATAAATATC GTTTAGTTGC GCTGAATATT 28320 

CCACAATGCT AATGACATTA GCAAAGACTG GGTTACGTAA GCCCATGTGA TTTGGTACGT 28380 

ATAGTTTTAC GTTGGCTTCA AAAATCTCTT GCGCCAAATC GACAATACCT GGCATACTAG 284 4 0 

CGGCGCCACC AGtTAACACA ACGCCTCCAG GTAATTCmAA CGCATCAATT TCATCCAGAA 28500 

CTTCTTTTGA TTTTCTCAAA ATTTGTTCCA CACGCGCTTC AATAATTTCT GAAAGGTAAC 28560 

GTTCGTCCAC TCTTACAGGC TCTGACTTAC CGATAACATC AACAGGAAAC TCTTCGTTTG 28 620 

CTGAAGTTCT TTCTGGATAA GCATCCCCAT AATTAATTTT TAAGGCTTCG GCGTTGTTAA .28 680 

AAGAAGTATT CAATACGATT GAGATATCTT TCGTGATGAA CTCGCCGCCT TCTTGGTTAA 287 40 

CATGAGTAAA- TTTCAATTGT TTGTCGTGGg ATAACAGATG TCGTTGtTTG aCCGCCACCC 28800 

aTATCaATGa CAATkGtTCC aAAGTCTTTT TCGcCATCTG tTaAAATTGT TTCCGTTAAA 28860 

GCAAGCGGTG tAATCACTAA TTCGtTAATT CCTAAgCCAG CTTTTTCAAC ACATTTACGG 28920 



WO 98/50555 



PCT/US98/08985 



ATGTTATGGA 
CCAAGCATTC 
aAGATTGCAA 
TTGCGCACAT 
CAGCTTTCTA 
CCGGCTTTTT 
TCGACGACAA 
ATTTGACCTT 
CCTACATACA 
TAAGCGGACA 
TTTACCATAA 
AAACATTTTT 
GCGTACCGTT 
TTTCAACCGC 
GATAAGAAAA 
CTTGTGAATA 
CATCATTCAT 
CTGAAATTCC 
TTAATTTATC 
TTTCCGCCAA 
GAGCGACAAT 
TGTTCGCCTT 
TTGAATACTG 
GGCTCTCCAC 
CAAAATACAA 
TATATAAGCG 
CTTCTTCCAC 
AAGTGATGGT 
TCTTCAAATA 
ATTTTTTTGG 
ACACCTCCTA 
GATGCCTTCC 
ATCATCTATT 



TGaTTGTTTT 
CACGTGGGTC 
CAATTTGACG 
CCTCATCTGT 
CTTCTAATAA 
CTTCGGCTTG 
TCCCTCGATT 
CAATATATTC 
TTCCTGTTTT 
AAGGCGTCCG 
AAATGGGGGT 
GTAAAAAACT 
ATTCGCTTCA 
TGAATCTTCG 
AATGCCGACT 
ATAAGCCATT 
ATAGACATTA 
TTGTTTAATT 
CTCTTTGAAA 
AGTTTTACCA 
TTGATACTCT 
TTTTATAATT 
CTCCCATAAG 
ACTTTTGTTA 
TGCCACTAAA 
CTTGTTCCGT 
TGTTTCTTGT 
GACTTCTGCT 
TTCTTGGTTT 
AGGTAATGGT 
TCTCAGACTA 
CCTTTGGACG 
GCCGCAACTA 



TGGtCCAGTA 
TTTGATGCCT 
TTCAGGAGGA 
AATTTCTTTT 
ATTCGCTGGT 
TCGAACGGCC 
CAAACCGTCT 
AGCGACAACT 
TGCCATGAAC 
TTTAACTATC 
TTTAGAAAGC 
TATTCTGCTT 
TCCGTTGCTA 
GAAACTTCAC 
TCCATATCAA 
TTTTCACTTA 
ATCAAGTCTT 
TCTTGAGGTA 
TTCTCAAAAA 
TTCTCTAAAA 
TGAATATCAA 
GGAAACTTTT 
CCACTGCCCG 
CCTGAAACTG 
ATGGCTGTCC 
GTTTTCTTAA 
GGAAGATGTT 
TGTTCGTTTT 
TGTTTTTGCC 
TTTCTTTTTT 
ACGTTTTCAC 
CCGTTGCCAT 
AACGAGCGCC 



782 
TAGACAACGC 
TCAAAGCCGT 
GTTGAACGAA 
GATTCACTGC 
AATCCAACAT 
CGTTGGATTG 
GATTTTGCAT 
ACTTTGACAG 
GGAATTCCTC 
TCTTTTTTAG 
AGATTATACG 
GTTGCACTTC 
CGCCTGCATT 
TACCCGTTTC 
CGATTCCAGG 
AATCAGAAAT 
TATTGGTTTT 
ATTTGTTGTA 
TTGGTTTGCC 
TCGGATGATA 
TTTTAAATGA 
TCTGAATATT 
TTTCAAGTTT 
TTACTTCACT 
CTAAACAAGT 
CGTTCGGTAA 
GTTGAGGCTC 
CACCTTTTGA 
AAGGTGTCAG 
TGAATATATC 
AACTTGATAC 
CTGTTGTCTT 
GGTCAATTCT 



CA^ACATCTC 
CGACGGTAAA 
CCAAAGCTGC 
TAACAGCTAT 
TTACGCTTTT 
CTTGAACTGT 
TTCCAACGCC 
AAGTTGTACC 
CTCCTATTAA 
TATTATTCAC 
AATTTGTTCG 
TTGATTTTCT 
TGGGTCCACA 
TTCACTTTCT 
TTTGTCCATT 
ATTAACAATC 
GGATGGTGCA 
AGAGGCCATT 
ACTTTCAGCA 
GCCGCCTTTA 
ATTAATACCA 
TGCAGAAAAA 
TGACTGCTGA 
CAATCGACTA 
TAAAATAAAG 
ACGATCTGCA 
TTCTTGACTT 
AGCAGCTTCT 
TTCAGGTTGA 
GTTCTCGTTA 
AAACGATCAC 
TTTTCATTGT 
GCATCTGGAA 



CATGCGAACG 
.GTCTTGCGGT 
AGATGCGACG 
CATTCCTTGA 
AATTTGAATG 
TTGATCAATA 
GATGATGTTC 
AATATCGAGG 
TTCCAATATC 
TTCTTCCAAT 
TTAATTTTTG 
CCATTTGTCG 
ACTTCTTGAT 
TTTTCATACG 
TGCTCGGCTA 
ACTCGATTTC 
TATTTAATTT 
AATTCAGGGA 
GCTTTGGTGG 
GTCGCAGCAA 
TTTAACTTAA 
TAATTTCTAT 
ATAATCGCTT 
AGAGGCGAAA 
GCTAAACGCC 
AAATGCTCCT 
TGTTCTTGTA 
TCCGCTTGCT 
GACTCTTCCG 
CTAATCTTCC 
TTGCATCTGG 
TTAATAAAAT 
TCATTTCAAC 
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AGCACCAACT 


TTTACTAAGC 


TTTGCGCGTT 


T.TTCGTTTGA 


TGATCATibb 


1 bALb I AAbu 


jU7 DU 


ACTCGGAATT 


AAAATTGCTG 


GTAATCCTAA 


TGCGGTGAAC 


I LbbbAAl i b 


nou 1 1 bb 1 bb 


1 1 fi?fl 


TGCACGTCCC 


ACCATTAAAT 


CGGTATTAGC 


CATCACTTCA 


AC CAT T T TAT 


b AA 1 A 1 Abbb 


jlUOU 


TTGCACACTT 


ATGTTTGTCA 


ATTTCTTTTC 


AGAAAACTTT 


AGACTTTLI 1 


bAAb 1 1 bb 1 b 


J 1 1 4 u 


ATAATATCTT 


TCTCCTGAAG 


CGTATAAAAC 


TTGATACTCT 


CTCTCTTCAA 


TV V** TV TV 71 TV TV TV 

ACAAAGGAAA 




GGCCTGTTCA 


AACGCTTGAT 


TGATTTTCAA 


GGCGCCACGA 


CTTCCGCCAA 


TV fty TV Tl TV TV ^ TV 

ATAAAACCAC 


31260 


CGTTTTCTTT 


GCTGGATCCA 


AGCCAAACTC 


TGATAAAATA 


GCTGACTTTT 


CCACCGTTAC 


31320 


GACTTCTTGT 


CCCCGCGGGT 


TCCCTGTTAA 


AATTGTTTTT 


TCTTTCGGGA 


AAAAAC T AG C 


t i ion 


GACATCCGGA 


AAACAAATCG 


CAATTTTATC 


CACATAACGA 


CTTAAAAATT 


TATTTGTCAT 


*3 1 A A n 


TCCGGGGATA 


CTATTTTGTT 


CATGAATAAT 


TGTCGGTATT 


TTTAATTGGT 


GbGbbbbAI A 


ji,DUU 


AACCACTGCA 


CCAGAAACAT 


AACCGCCTGT 


ACCAATGACC 


ACATCTGGTT 


TV TV TV m ftl TA 

GAAATTCACG 


J IDOU 


AATAATTTTT 


TTTGCTTTAT 


TAATGCTTGT 


TAAAAATAAA 


TAAATTGTTT 


m tv tv tv tv fT\rnfr)fTi^ 

TAAAATTTTG 


■ji con 


AGGACTTAGC 


GAACGCTTGA 


ATCCTTGAAT 


TTTAATCGTT 


TTAAATGGAA 


TTTTCGCCTT 


JlboU 


TGGCACAATT 


TGACTTTCTA 


GACCATTtTC 


TGTTCCCACA 


TACAAGAATT 


CTGTTGCAGG 


31740 


TGCTTCTTTT 


TTGACGTGTT 


CGACGAAGGA 


TAACGCAGGA 


TAGATATGCC 


CACCTGTGCC 


j loUU 


ACCACCAGTT 


ACTAGTATCT 


TCATTTTCTC 


ACATTTCACT 


TTCTTTTAAT 


TGTTGAACAG 


J lo oU 


CTTGCATAAA 


GGCTTCCCCG 


CGTACTTCAA 


AATTCGGGTA 


TTGGTCCCAA 


CTTGCGCAAG 


o ± y^u 


CAGGTGATAG 


TAAAATAGTA 


TCATCTTTTT 


CCGAATAATC 


AAAGGCAATG 


GTAACCGbbb 


*3 1 Qfl n 


TTTGAACATT 


TTCAGCAAAT 


AAAATTGTTT 


CAATGTTCGC 


TTTTTTAGCA 


GCTTbbbbbA 




ATTTTTCTTT 


GGTTTCTCCA 


AACAAAACAA 


TTGCTTTGAG 


TCCCAACAGG 


GCAGbAAbbA 


•301 nn 


ATTCATCAAA 


TGAGTTACCG 


CGATCCAAGC 


CACCCGCAAG 


TAAAAGTAGC 


T i 1 lbb 1 1 bl 




CAAACCCACT 


TAACGCCATC 


TCTGTAGCTA 


AAATATTGGT 


TGCTTTTGAA 


TCbTTAI AAA 


VJ 


AACGTCTTTG 


CTGAACTTCG 


CCAACAAATT 


GCGTTCGATG 


AGGAACGCCT 


T\ T\ 7v 7\ /** n"> <T* rn /*■> 

G AAAAb 1 I I b 


"399 ft n 


TCAAAGTTTG 


TCTAATCTGA 


GCATTCGATA 


CATTTTTTAA 


TTTAGCTACA 


CAAAIbbb lb 


-a 9 -a 4 n 


CGAGTGCATT 


TTCAATATTG 


TGACTGCCAG 


GAATCCCTAG 


CTCATCGGCG 


GGCATAAI A I 




ATTCTTCATT 


GAAATATAAT 


TTCCCATCTA 


AAAGATAAGC 


CCCTTCTACT 


• rn m rn rn rn /~» 

GCTTCTTTbb 


"5 9 A £ n 
J Z ft o u 


TTGAAAAAGG 


CAATACGTTG 


GCAGCTGTGG 


TTTTGGCTAA 


CGTTTGAAGC 


TCTACTTGAT 




TCCAATTTAA 


AATCAAGGTG 


TCCTCTGLGb 


I b A i b 1 1 i 1 1 


1 IbAHlbbbb 




32580 


CAACATATTC 


TTTCCGCGAA 


CCATGATAAT 


CCAAGTGTGC 


CTCAAAAATA 


TTTGTAATTA 


32640 


CTGCAATTTG 


TGGGTGAAAC 


GTCTCAATTC 


CCATTAACTG 


AAAACTAGAA 


AGTTCCATCA 


32700 


CAAGATTATC 


• CTTGGCCGTT 


GCTTCTTGAG 


CCACCGTACT 


CGCTGGAAAA 


CCAATATTTC 


32760 


CCGCCAAA 












32768 


(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 6259 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



GGCTGGATTT 


TCCTAGTAAG 


AGCTTGTTGG 


AmCAGCTTAA 


TAAAGACACA . 


AGCCTTTTTT 


60 


GACGTCGTCG 


CTCAGGATTT 


TGCTAAAATG 


GCCCAGTTTA 


CGTATCATTT 


ACAAGAAGCA 


120 


AAAAAAGTCA 


CAGAGTATAC 


AAATTCTTTA 


AGATTAAAGG 


ATTTACAAGG 


AAAATAAGCT 


180 


GAAAACAACA 


TAAAAATAGT 


GTATAATTGA 


AAAAATTAAT 


CTGCGAGAGA 


AAGGGAACCT 


240 


TTTTTGATTG 


TAGATCACCG 


CAGTTGTTTG 


TTGCCTAGTC 


TCTAAGAAAT 


AAACCTAAAA 


300 


TTATTGATAT 


AAAGGATGAA 


CAAATGAAAA 


AAGAAGAAAT 


GCAAATGCGT 


AATACACGTC 


360 


GTCAAAAATC 


AGGAAAAAAT 


AATAAAAAGA 


AAGTAATTAT 


TACTTCTTTG 


GTTGGACTAG 


420 


CTCTGGTTGC 


TGGGGGCAGT 


TATGTTTATT 


TTCAAAGTCA 


CTTTTTmCCA 


ACCACAAAAG 


480 


TAAATGGAGT 


TTCTGTAGGC 


TGGTTAAATG 


TAAATGCTGC 


AGAAGAAAAA 


TTAGCGCAAG 


540 


TTAATCAAAC 


CGAAGAAGTT 


GTGGTTCAAA 


CGGGGACAAA 


AGAAGAAAAA 


ATTCAACTTC 


600 


CTAAAAAATA 


CCAATTGGAT 


CAAAAATTTT 


TAAAAGACCA 


TTTACACAGT 


AGCAAGGTGA 


660 


AGCTACCGTT 


AAACGAGGCA 


TTCAAAAAAG 


AACTAGAAGC 


CAAATTAGCA 


ACTTTGAGTT 


720 


TTCCAGAGGG 


GAAACCAAGC 


AAAAATGCGA 


GTATCCGTCG 


AGGCAATGGC 


ACTTTTGAAA 


780 


TTGTTCCCGA 


AGAACAAGGC 


ACAGTAGTGG 


ACACACAGCG 


CTTAAACCAG 


CAGATTATTG 


840 


CGGATGTTGA 


AGCGGGAAAA 


GGCAACTATC 


AATATAATGC 


CAAAGATTTT 


TATAAAGCCC 


900 


CTGAAATTAC 


AAAAGAGGAT 


CAAACGTTAA 


AGGCAACATT 


GACAACGCTC 


AATAACAAGT 


960 


TAAATAAAAC 


AATTACAGTT 


GATATTAATG 


GTGAAAAAGT 


AGCCTTTGAT 


AAAACACAAA 


1020 


TTCAAAACGT 


GCTGAATGAT 


GATGGCACAA 


TCAACAAAGA 


AAAACTAACT 


ACTTGGGTGA 


1080 


CACAATTAGA 


AACAACATAT 


GGTTCTGCTA 


ATCAACCAGT 


TTTATTTACA 


GATGTTCACG 


1140 




TfCJTTTTAAA 

x v\j ± x .L ± ruin 


AACAACGGAA 


GTTATGGCTG 


GTCGATTGAT 


GGGGCCAAAA 


1200 


CGCAAGAACT 


ACTAGTAAAC 


GCGCTGAATA 


GCCAAGAACA 


AACGAATGCA 


ATCACTGCTC 


1260 


CGTTGGTTGG 


TGATACCAAA 


GAAAATAGTA 


AAATTGCCAA 


TAATTACATT 


GAAATTGATT 


1320 


TAAAAGATCA 


AAAAATGTAT 


TGTTTCATTG 


ATGGCAAAAA 


AATAGTCACC 


ACAGATGTCA 


1380 


TTACTGGCAG 


ATATAACAAA 


GGAACCGCAA 


CAGTACCAGG 


ATTCCATACA 


ATTTTATATC 


1440 


GGACAACCGA 


TGTGAATTTA 


GAAGGTCAAA 


TGCTTGATGG 


TTCTCGATAC 


AGTGTGCCAG 


1500 


TAAAATATTG 


GATGCCGTTA 


TTAAGTCAAG 


GGGGCGTTGT 


CACACAAATC 


GGGATTCATG 


1560 


ACTCCGACCA 


TAAATTGGAT 


AAGTATGGCG 


ATAAAGAAGC 


CTTTAAAACC 


GATGCTGGTA 


1620 


GTAATGGCTG 


TATCAATACG 


CCAGGAACAG 


AAGTTTCAAA 


AATCTTTGAT 


GTATCCTATG 


1680 


ACGGAATGCC 


GGTAATTATT 


TATGGACATA 


TCTATGATGA 


. TGCACCAGGT 


GAATTTGATA 


1740 



WO 98/50555 



PCT/US98/08985 



785 



AACCTGTAGA 
CCAGCTGTGT 
TT7TTTATTG 
TGGGGGAACG 
GTTAA.CTAAT 
AGAGAATATC 
TGGAAAAAAT 
GGTTAAATAA 
AGAGAGGAAA 
ACCTTTGTTG 
TG7TCAAAAT 
TTTTTGTTTT 
ACAATGAAAC 
ATTACTTTTC 
GCTAAAAGTA 
TTAATCGATT 
CATGAAGATG 
CGCCGTCTTT 
TA7GATACAG 
GAAATGATGA 
GAAGCGTTAG 
GAAGATGAAA 
CACGTCCCTT 
TGGCGCGGAA 
AAAAAAGATT 
AAATTAGGaA 
TCTGGTTACC 
AAATTAGCTT 
AAACATCTGG 
AT GGCGAAAT 
AT GACATTCA 
GCTATGAAAA 
AT GGCCATAC 



TTACGGCGAA 
ATCTGAAAAG 
ACAAGCG7CA 
AAGATGCCAT 
ATGGTCAAAG 
CaTGTGATTG 
AAGTTTTCAG 
GTGCATAAAT 
ATCTTGGCTG 
AAACAAAGGC 
TAGGTGGAAC 
TTAACAAGGG 
GCTTCAAA7C 
CAGATGGCGC 
TTAGCAACAG 
TAGATCGTCC 
CTTTAGGAAT 
TCCCTAACAT 
ATAATGGAGA 
AAATT GTGAA 
AATTATTTGC 
TCATCACTGT 
CAACAGGACG 
ACTCTGACAA 
TAAAAGAGTT 
AgAATTAGAT 
AAAAGGCGCA 
AGGTTACCAA 
TCACTGGGAT 
GCTGGTATTA 
TAGTTACCGC 
ATCTGGCGCA 
TTTTGTTCGT 



GAAGTATAAA 
TAAAAATTGG 
TAACCCGCTC 
TTGTACATGT 
AT ATT AC AG A 
TCAATSaATT 
AGAATTTCTG 
GATTGATGAG 
AAAATTTTCC 
GGTCGCACCG 
CACGATATTT 
GAACAGGTGG 
AAAAAAATTA 
TGTTAAACCG 
TTTAGCCAAA 
TATCGTAGAA 
TTTACGTCAT 
TAAATTTGGC 
ATCCCCTGTG 
GGAAAATAAC 
TGATGATCCT 
CTATGATCAA 
GATTCAAGTC 
TCATATGATG 
TATCAAAATG 
TTATTTATGG 
ACCATTCGTC 
CATGTGTATA 
CATTACCATG 
CGTCCAATGA 
GAATTGCCAA 
TTATCAGGGT 
CCTGACCAAA 



AAGAGATATT AGGcAAGACC 
AGACGTCAGC GTAACGACTG 
TATGATACAA ACAGAACAAG 
AGAATTAATT GAAGGCCGCA 
AGCTGTGTCA AAAAACGCTG 
GAAAAAgACC GCTATGCACA 
TTGAATCACT TGCTTTTTCT 
AAAGACAACT GAACTTGCCC 
AGATTGGCAC AAGGGATGAC 
TTAACACGCT AGAGGAACAC 
TCGTCCTAGT ATTCGTTAGA 
CTGTTTCGTT TCATTGTGAA 
AAAGGAGAGA ATTTTATTAT 
TTTGATTCTG GAATTACAAC 
AAAGCTTTAG CTGGTAAATT 
GATGGTTCGC TTGAAATCGT 
TCATCAGCTC ATTTAATGGC 
GTAGGTCCTG CGATTGATTC 
ACAGCGGAAG ATTTACCTGC 
CCAATCGTTC GTAAAGAAAT 
TACAAAGTTG AATTAATTAC 
GGCGATTTTG TTGATTTATG 
TTTAAATTAC TTTCAGTAGC 
CAACGGATTT ATGGCACTGC 
CGCGAGGAAG CCAAAGAACG 
TTTCACAAGA AGTTGGTTCA 
GTACAATTGA ACGTTATATT 
CACCAATTAT GGCAGATGTG 
AAGATATGTT CCCACCAATG 
ACTGTCCACA CCATATGATG 
TTCGAATCGC TGAATTAGGG 
TACAACGTGT TCGTGAAATG 
TTAAAGACGA ATTTAAACGT 



GTACTTTGGT 
ACGTCCCCAA 
CAATTTAAAT 
CAGAAGAACA 
GTGCACCAAA 
AGGCGGCGAA 
GAAAATAAGT 
GCGATCTATT 
CACTTTCGAA 
AGATTATTTG 
ATACTAGGAC 
AAACAGGAAC 
GTCTATTCAC 
ATTTGATGTT 
CAATGGTGTT 
GACACCTGAT 
TAATGCCTTA 
TGGGTTCTAT 
AATTGAAGCC 
CTCACGTGCA 
AGATTTGCCA 
TCGTGGTGTT 
TGGTGCTTAT 
CTTTTTTGAT 
TGACCACCGT 
GGGTTACCTT 
GTGGACAAAG 
GAATTATACA 
GATATGGGTG 
GTTTATAAAA 
ATGATGCACC 
ACTTTAAACG 
ACTTTGGAGT 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 
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TAATGGTGGC 


AGTCTATGCT 


GACTTTAAGA 


TTACGGATTA 


TCGTTTCCGC 


TTAAGCTATC 


3780 


GTGATCCAAA 


TAATACAGAC 


AAATATTTTG 


ATGATGATGC 


GATGTGGGAA 


AAAGCGCAAG 


3840 


CGATGTTAAA 


AGCTGCCATG 


GATGAATTAG 


AATTAGATTA 


CTTTGAAGCA 


GAAGGCGAAG 


3900 


CTGCCTTTTA 


CGGTCCGAAG 


TTAGATGTTC 


AAGTAAAAAC 


AGCTTTAGGA 


ATGGAAGAAA 


3960 


CATTATCAAC 


CATCCAATTA 


GACTTCTTAT 


TACCAGAACG 


TTTTGACTTA 


ACTTATGTTG 


4020 


GCGAAGATGG 


TGAAAATACA 


CATCGCCCAG 


TTGTTATCCA 


CCGTGGTATT 


GTCTCAACAA 


4080 


TGGAACGATT 


TGTGGCTTAC 


TTAACAGAAG 


TTTACAAAGG 


CGCTTTCCCT 


ACTTGGTTAG 


4140 


CACCAATTCA 


AGCAACTATT 


ATCCCAGTTT 


CTGTAGAAGC 


GCATTCTGAG 


TATGCTTATG 


4200 


AAATCAAAGA 


ACGTTTACAA 


GCACAAGGCT 


TACGTGTTGA 


AGTCGATGAT 


CGTAACGAAA 


4260 


AAATGGGCTA 


CAAAATTCGG 


GCCTCTCAAA 


CACAAAAAGT 


ACCTTATCAA 


TTAGTGGTCG 


4320 


GGGACAAAGA 


AATGGAAGAC 


GCAACGGTGA 


ACGTCCGTCG 


TTATGGAAGC 


AAAGAAACGT 


4380 


CTGTCGAAGA 


TTTATCAATT 


TTCATTGACA 


GCATGGCTGC 


TGAAGTTCAC 


AATTACAGCC 


4440 


GTTAAAAAAA 


GAGAnAAAGT 


CAAAAGGACT 


TCTGAATTCA 


GAAGTCCTTT 


TGACTTTTTT 


4500 


CAACAACAAA 


ATAATTAAAT 


ATTTCCTTAG 


AGCTGGGAAT 


AATCCTGATT 


TTTTGACTAT 


4560 


TTTTTGTTAC 


AATGAGAATG 


CGAAACTTTT 


TAGTGAAGGG 


ACTAGAGGTA 


TGAAGAAAAA 


4620 


CTCATTTATA 


GnTrAaATAA 


AAAAAaCAGA 


TGCCGCAGCA 


AAGAGTCAAC 


GAGGTAGAAA 


4680 


GCCACATATT 


CCTTTTCGAT 


TGAACTTATT 


ATTTTTTGTG 


ATATTCACTT 


TATTTGTTTC 


4740 


ATTAATTGTC 


CGTTTAGGGT 


AT TT AC AG AT 


TGTAGAAGGA 


GAAGAGTTTA 


ACAAAAAAAT 


4800 


CACTGCCAAT 


TCTTCACTAC 


AAATCACGAC 


ACCATCCCCT 


CGAGGACAAA 


TTTATGATTC 


4860 


ACAAGGAAAA 


GTCTTGGTTT 


CTAACAAAGC 


CAATTTAGCA 


ATCACATACA 


CACGAGGAAA 


4920 


AAATATTGAA 


GGAAAAGATA 


TTTTACCGAT 


TGCGAATAAA 


GTTAACGAAC 


TTATTAACGT 


4980 


TCCAGTTGAT 


CCAAACTTAA 


CGGATCGTGA 


CAAAAAAGAT 


TATTGGTTAG 


CGAATCCTGA 


5040 


AAATTTAAAA 


GCAGCCCAAT 


CCCGTTTAAC 


CGATCAAGAT 


AAAGAAGATG 


AAAAAGGCAA 


5100 


CAAAATTACG 


GACGAAGGGA 


CATTATATGC 


CAAAGCAGTT 


GAAAAAGTCA 


CCCCTGAAGA 


5160 


AATTGCGTTT 


GATGACCGAA 


CCTTACAAGC 


AGTAACGATT 


TTTAAACGAA 


TGAATGCAGC 


5220 


ATCACAAATG 


AACACGGTCT 


TCATCAAGAA 


TGAAGGGGTA 


ACAGAAGGTG 


AAATTGCGAC 


5280 


AATTGGTGAA 


CACACAGCGG 


AAATATCTGG 


TGTTTCAACA 


GGGACTGATT 


GGACGCGTGA 


5340 


TTATTCTCAA 


AGTGGCGCAT 


TACGTAGCCT 


GTTGGGCACc 


GTTTCAACTG 


AAAAACAAGG 


5400 


ATTgCCAGCC 


GAAGAaGTCG 


ATGanTATTt 


GAAAAAaGGC 


tATGCaCGAA 


aTGACCGTGT 


5460 


CGGGACTAGC 


TATTTAGAAA 


AACAATATGA 


AGACGTTCTA 


CAAGGTAAAA 


AAGCAAAATC 


5520 


AGAAGTCGTT 


TTAGACAATA 


ACGGAAAGAT 


TGTTTCACAA 


ACCCCAATTT 


CCAAaGGGGA 


5580 


AAAAGGCTCC 


AACTTAAAAT 


TAACGATTGA 


TTCAAATTTC 


CAAAACAAAG 


TTGATGAAAT 


5640 


TTTACAACGA 


AACTATTCAC 


AAATAGTCAA 


AACCATTGGA 


CCATACTCAG 


AAAATGCATA 


5700 
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TGTCGTTGCT ATGAACCCTC AAACAGGAGC- TATTTTAGCG AT.GTCCGGAT TTCATCATGA 57 60 

TTTAGCAACT GGAGAAGTCA CACCTAATCC TTTAGCACCT ATTTTGAATT CTGAAGTTCC 5820 

TGGATCAGTT GTGAAAGCAG GTACTTTAAC AGCTGGTTAT GAAACAGGCG TCATTAAAGG 5880 

CAATGACGTT TTAACAGATG AAGCTATTTT ACTAGCAGGT AGTAATCCAA AAGCTTCTTG 594 0 

GTGGAATGCT TCTGGCGGAA CAACCATGCA ATTAACTGCG GAgCAAGCGT TAGAATATTC 6000 

TTCGAATGCT TATATGATGA AATTAGTCTT TAAAATGATG GGTGTGAATT ATTATCCCAA 6060 

TATGATTTTC CCTTATGAAG TAGGAGATGA CACTGTCTTT AAAGAATTAA GaAAAGCTTT 6120 

TGCAGAATAT GGTATGGGAA CGAAAACTGG AATTGATATA CCAGGAGAAA CAACAGGAAT 6180 

TCAArACAAG GACTTTAAGG ATTCGTCATC tGCGCCaCAA GGCGGTAATC TTCTTGACCT 6240 

ATCTTTTGGA CAATATGAT 6259 
(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9767 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CGCTGACTCA 


ATTAGCGTAA 


TGGAGTTTGT 


TTTAGAACTT 


GAAGATGAAT 


TTGGAACAGA 


60 


GATTTCAGAT 


GAAGATGCAG 


AAAAAATTGA 


AACAGTTGGT 


GCCGCTGTAG 


ATT AT ATT GT 


120 


GAGTAATTCA 


TAAAAAATAA 


TAAGACAAAG 


TCGATTAACA 


TTCCTGTTAA 


TCGACTTTTT 


180 


TTGCGTACAT 


AATGGTTATT 


TTATTAAGCA 


ATAAGGCTAT 


TTTTACACAT 


ATTTTTGTAA 


240 


AAAAAATCAT 


ACATGTAAAA 


AAATAACTGC 


GAAAAAAGTT 


ATTTTATCTT 


GCTTTTTATG 


300 


GAATAATTCT 


ATAAAATTAT 


GAAAAAACAA 


TGAGCTGTAA 


ATTAACATTG 


ATAAAAATTA 


360 


GAAAGTGCTA 


TCATTAAATA 


TTAGTTAATT 


TATATAATGT 


TTTTGAAAGG 


ACGTAAGGGG 


420 


TGCGTATGGA 


TGCAAGGAAA 


TGAACAATTA 


CTAGAGGTTT 


CAGGATTAGA 


AACAGCATTT 


480 


AGAATCAAAG 


AT GAT TAT T A 


TAATGCCGTC 


GATGATGTTT 


CATTTGAATT 


GAAGAAAAAT 


540 


GAGATTCTGG 


CAATCGTCGG 


AGAATCTGGT 


TGTGGAAAAA 


GTACCTTAGC 


AACAACGATT 


600 


ATGGGCTTAC 


ATGATCCCAA 


TCATACCAAA 


ATTACTGGGG 


AAATTTTATA 


CAACAATTTA 


660 


AACTTAACTA 


CGTTTAATGA 


AACATTGTAC 


AACAAAGTCA 


GAGGAAACGA 


TATTGGGATG 


720 


ATTTTCCAAG 


ATCCGTTATC 


TGCACTGAAT 


CCGCTCATGC 


GAATTGAAGA 


CCAAATTAAA 


780 


GAAAGTCTCA 


CG TAT CAT AC 


CAAAATGACC 


GCACAACAAA 


AACAAGCACG 


GGCCATTGAA 


840 


CTGTTAGATC 


■ AAGTGGGGAT 


TCCCAACCCA 


GAGCGTGTCG 


GGAAACAGTA 


TCCTCaTGaA 


900 


TTGTCTGGcG 


GcATGCGCcA 


ACGGGTGaTT 


aTTGcCATTG 


CAATCGCGTG 


tAAACCACCG 


960 


aTTTTaATTG 


cmGrTGaACC 


AACAACGGCC 


TTAGACGTTA 


CCATTCAGGC 


ACAAATTCTC 


1020 
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GATTTGTTAA AAGATTTACm AGaAGAAAcA GGGAGcGGGA TTATTTTAAt TACCCATGAC 1080 

TTAGGTGTAG TTGCGGAAAT GGCTGATCGT GTAGCGGTTA TGTATGCGGG TCAGTTTGTA 1140 

GAAGTAGCCA CTGCTGAAGA ATTGTTTAGT CATCCGAAAC ATCCATATAC ACGTTCCTTA 1200 

CTTCAGTCCA TTCCCCAAGA GAATTCAGAT GAAGAAGAAC TTCATGTGAT TGAAGGGATT 1260 

GTTCCTTCGT TAACCAAATT ACCTAGAAAA GGTTGTCGGT TTGCCCCACG AATTCCTTGG 1320 

ATTGATGCAT CGGAACATGA AGAAAATCCA ACATTACATG AAGTTGCACC GAATCATTAT 1380 

GTTCGTTGTA CATGTTATAA ACATTTTCAT TTTAGAGACG GGGAAGGGGA AGCGTAATGA 144 0 

CAGAAATTAT TCAAATTAAA GATTTAAAAG TTCATTATCC AATTCGCAGC GGCTTCTTCA 1500 

ATCGAATCAC TGATCATGTT TTGGCGGTTG ATGGTGTGGA TTTCATGATT GAACAAGGTA 1560 

AAACGTATGG GTTAGTTGGT GAATCGGGTT CTGGAAAATC AACGACAGGA AAAGCAATTA 1620 

TTGGTTTAGA AAAAATTACA AATGGTGAAA TTATTTATCA AGGACAAGAT GTAACCAAGC 1680 

CGCGAAGTCG AAAAGCGATT GGATACAACA AAGATGTCCA AATGATTTTC CAAGATTCGA 174 0 

TGTCAAGTTT AAACCCTAAA AAACGAGTAC TAGATATTAT CGCCGAACCA ATTCGAAATT 1800 

TTGAACGGTT AAGTGACCAA GAAGAAAAGA AAAAAGTCAA AAGTTTGTTG GATATTGTGG 1860 

GGATGCCAGA AGATGCGTTG TACAAATATC CTCATGAATT TTCTGGTGGA CAACGCCAAC 1920 

GTTTAGGCGT CGCCCGAGCG GTGGCTACCA GCCCCAAATT AATTATTGCA GATGAACCTG 1980 

i 

TTTCAGCACT GGACTTATCT GTGCAAGCAC AGGTATTGAA TTTTATGAAA AACATTCAAC 204 0 

AGGAATATGG GTTAAGTTAC TTATTTATTT CCCATGATTT GGGCGTAGTT AAACATATGT 2100 

GTGACAACAT TGCCATTATG TACAAAGGCC GTTTTGTTGA AATCGGTACA CGCCAAGATA 2160 

TTTACACTAA TCCACAACAC ATTTATACCA AACGCTTACT TTCAGCAATT CCTAAAATTG 2220 

ACGTAGCGAA CCGAGAAGCT CACAAAGAAG AACGAAGAAG AGTTGAGCAA GAGTATCGGG 2280 

AAAATCATAA AGACTATTAT GATGAAAATG GGCGGGTTTA TAACTTACAT GCGATTAGTC ' 234 0 

CAACCCACCA AGTAGCGTTA AAAAATGGAG GTGCTGAATA ATGTGGAAAA CAATTTTACG 24 00 

CCGTTTGTTA TTGATGATTC CGCAAGTGAT TATTTTAAGT GTCTTGATTT TCCTGTTGGC 24 60 

TAAAATGATG CCTGGTGATC CGTTTACAGG ATTGATTAAT CCGAACCAAG ACCCGAAAGT 2520 

GATTGAAGCC ATGCGCGAAg cTGCCGGGTT AAATGATCCT TGGTACGAAC AATATTTCCG 2580 

TTGGATTGGC AATGCCTTAC ATGGAGATTT TGGGCATAGT TTTATCTTTA AATTACCTGT 264 0 

GTCCACGCTC ATTGCTGGTC GGGTGGGGAA TACGATTGCA TTGGCCGCTG TGTCAGTTAT 2700 

AATCACGTAT TTAATCGCCA TTCCTTTCGG TTTAATTGCT GGCCGTTACC AAAATTCATG 27 60 

GTTTGACAAA ATGGTGGTTA TCTACAACTT CTTCAGTTTT GCCGTGCCAT TATTCATCTT 2820 

TGCCTTGATT ■ ATGCTCTTTA TTTTTGGGTA TCGCTTAGAC TGGTTCCCAA CGAGTGGAAC 2880 

CGTGACTGTT GGTTTGGCTG AGGGAACGTG GCCGTATTAT TTAGATAAAC TGAAGCATTT 2940 

AATTTTACCA GGGGTCACTC AAGCGTTGCT AGGGA'CAGCG GTCACTATTC AATATTTGCG 3000 
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CAGTGAAGTG ATTGATGTTA 
GCCAACGAAT AAAATTTTTA 
TCAATTAGGT TATGAAATTA 
TGCTTTCCCA GGAATTGGAA 
TATTACCGCT CTCGTGTTGA 
TATTGTAATG AGTATTGTTG 
GGTGATAAAA ATGGAATTGA 
ACCAATGGGC TTTCGGATGA 
TTTAATCCTT TTGGTCATTA 
AAGTGCCGTT ATGCATGTAA 
GAATGGCACC AAGTTCATTT 
GATTATCGGG GCTAGGAACT 
TATCGGGGTC GGTTTAGGGA 
GATGCGTATC GTGGATTTTA 
TTCCGTGATT CCTAAATATA 
GGTTGGAAAA GCGCGGTTGT 
TAGTGCCTCT AAGACAATGG 
GAATTTAAGT TCATTAATTA 
CGAAACAACG TTAACTTTCT 
CTTAAtTGgc TACGCCAGCA 
TGCATCAATT CTAATTTTAG 
GCGCTCAGCA GATGCACGAC 
AGATGAACAA GAAACGGATT 
TAGCTGCATG TGGTGGCGGC 
TTTCAAAAAT GCCAATCGCT 
ATGTCGCTGT AGTTATGGAT 
ACAACTATGA TGCACAATAC 
ACTTTAAGAT TGTCGATGGG 
CAACCATTAA ATTACGTGAC 
ACGTGATTTT CTCTTATGAA 
ATAACTTTAC • GAATATTGTT 
CTGGCATAGA AAAAGTCAAT 
GAATGCAACA ATTAGGTGGC 
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AAAACATGGA CTTTGTTCGT ACTGCTCGTT CAAAAGGAGT 3060 

ATCGACACAT TTTTAGAAAT GCAGCATTAC CAATTGCCTC 3120 

CTGCTTTGAT TGCCGGATCA GTCGTTATTG AAAAGATTTT 3180 

AATTATTTAT TGATAGTATC ATTCAACGTG ACTATTCCGT 324 0 

TTTTGGGATT GGCAACCCTT ATCGGAACCT TAATTTCTGA 3300 

ATCCAAGAAT TCGAATTCAG TAGAAAAAAC ACGAAGAAAG 3360 

CAGAAGAAAA AAGAGAAGAA GTCTTACAAG AAAGTATTCC 3420 

TTGCGCGAGA ATTCGTCAAA GAAAAAATGG CGATGTTTTC 34 80 

TCTTATTAGC CGTTTTTATC GGTTCGTTAG TGTTAGACCA 354 0 

GTATTTTAGA TAAATATGCG GAACCAGGTA CCGTTACAAT 3600 

TAGGTGCCGA TGAAGGGGGT CGTGATGTTT TAGGACAGTT 3660 

CGATTTTAAT CGGTTTTGCC ATTACGATTA TTACGTCTAT 3720 

TTATTTCCGG TTATTATGGA GGCATGATTG ATAACATTTT 3780 

TTATGATTTT GCCAATTATG TTAATTATTA TTGTGTTCGT 384 0 

GTATCTGGTC CTTTATTTGG ATTATGTGTG CGTTTTATTG 3900 

TCCGCAGTAA AACGTTATCT GAAGGGCGCC GAGATTACGT 3960 

GAACAAGCGA TTTTAAAATT ATGTTTCGAG AAATTATGCC 4020 

TTACGAACTT AACGATTAAT TTCGCGGCCA ATATCGGGAT 4 080 

TAGGCTTCGG GTTACCACAA AGTGTCCCAA GTTTAGGAAC 414 0 

GTGGGGATGT ATTAGTCAAT AAACAATGGG TTTGGTTACC 4 200 

TCTTGATGTT AAGTATAAAC TATGTTGGAC AAGCATTTAA 4260 

AACGATTAGG CTAATTAGCC AAAAAATGAG GGAGGAAAAG 4 320 

TTAGGTGCAA TCACGTTAGC TTCTGTGTTA GTATTCGGGT 4 380 

AATAAAGGCG GGGGCAATAA AGCAACGGAA ACAGAAGACA 4440 

GTTAAAAATG ATAAAAAAGC AATTGATGGC GGTACATTAG 4 500 

ACACAATTCC AAGGACTTTT CCAGCAAGAA TTTTATCAAG 4 560 

ATGCTTCCAA CGGTACAGCC ATTATTTAAC AATGATGCAG 4 620 

GGTCcTGCGG ATCTGAAATT AGATGAAGAT GCCAATACAG 4 680 

AATTTGAAAT GGTCTGACGG TAAAGATGTG ACAGCCGATG 4 74 0 

GTCATTGGTC ATAAAGACTA TACAGGGATT CGTTATGATG 4 800 

GGCATGGAAG ACTACCATGA TGGTAAATCG CCAACCATTT 4 860 

GATAAAGAAG TTAAAATCAC TTATAAAGAA GTTCACCCAG 4 920 

GGTGTTTGGG GCTCAGTTTT ACCAAAACAT GCCTTTGAAG 4 980 



WO 98/50555 PCT/US98/08985 

7 90 



GAATTGCTGT 


TAAAGACATG 


GAATCAAGCG 


ATGCAGTTCG 


n> TV Tt T\ TV T\ r*» o /~* 

TAAAAACCCT 


GTGACTATTG 


U 


GACCATACXA CAI.GAGTAAT 


ATTGTGACAG 


GTGAATCTGT 


m ^ TV TV fTl TV m TV 

TGAATACCTA 


CCAAATGAGC 




ATTACTACGG 


TGGTAAACCT 


AAATTAGATA 


AATTAGTGTT 


/"* TV TV TA ^1 V™* fT\ rrv m 

CAAATCTGTT 


CCT. TCTGCbA 


CI 


GCATTGTAGA 


AGCGATGAAA 


GCGAAACAAT 


ACGATATTGC 


tv m fn tv m /™* iv tv rn 

ATTATCAATG 


TV TV TV TV fTl TV 

CCAACAGATA 


5220 


CGTATCCAAC 


ATACAAAGAT 


ACTGAAGGGT 


ATCAAATCTT 


AGGACGTCCC 


T\ TV /** TV TV ^ rtl 

GAACAAGCCT 


5280 


APAPGTATAT 


TGGCTTTAAA 


ATGGGTACGT 


TTGACAAAGA 


AACAAATACA 


GTGAAATACA 


5340 


ATPPAAAAGP 


TAAAATGGCA 


GATAAAAGCT 


TACGTCAAGC 


CATGGGCTAT 


GCAATTGACA 


5400 


ATPATPPAPT 


CGGCCAAAAA 


TTCTACAACG 


GCTTACGAAC 


AGGGGCAACA 


ACGTTAATCC 


5460 


PAPPAPTPTT 


PAAGAGPTTG 


CATGATAGCG AAgcGaAAGG 


CTATACGCTT 


GAT T TAG AC A 


5520 


AAPPPAAAAA 


ATTATTAGAC 


p a t pp TP PT T 


ATZ\ AAPAPPT 


AGACGGCGAT 


GGCATTCGCG 


5580 


A APAPAAAPA 


AGGPAAAPPA 


PT7APAZ\ AT PA 


AGTTTPPTTP 
rto ill i i 


AATGTCAGGC 


GGCGAAACTG 


5640 


PAPAAPPAPT 


TGPTGATTAC 


TATPTPPA AP 


AATPPAAAPA 


AATTGGCTTA 


AACGTAACGT 


5700 


ATAPAACAGG 


ACGCTTAATT 


PATTTPPAAP 


PATTPTATGA 


TAAATTGAAA 


AATGATGACC 


5760 


PAGAAGTAGA 


TATCTATCAA 

X *i X X ** X V-*A*J* 




PPAPAPPTTP 


AGATCCTTCA 


CCAACCGGCT 


5820 


TATATGGTCC 


AAACTCAGCC 


TTT A APT AT A 

J. 1 J. / i-T^V^ J. n a n 


PAPGTTTTGA 


GTCAGAAGAA 


AATACTAAAT 


5880 


TACTTGATGC 

X *V*w X X VJi* X VJ V^ 


GATTGATTCA 


AAAGCATCAT 


TTGATGAAGA 


AAAACGTAAA 


AAAGCCTTCT 


5940 


APGATTGGPA 


AGAGTATGCC 


ATTGATGAAG 


CGTTTGTAAT 


CCCAACGCTT 


TACAGAAATG 


6000 


AAGTCTTGCC 


TGTCAACoAC 


CGTGTAGTTG 

V> X X **VJ X A Vv 


ACTTTACTTG 


GGCAGTTGAT 


ACGAAAGATA 




ATPPATGGGP 


AACGGTGGGT 


GTCACAGCAG 


ACTCACGGAA 


ATAAAAAAAC 


AGATTCTAGA 


6120 


AGACATAAAG 


CTCTTCTAGA 


ATCTGTTTTT 


TTGGATAGGA 


TAGTTTCTTA 


GGCCATCTTA 


6180 


TTTTATflTfiA 

J. A A A ri 1 (J 1 VJ^i 


AAATTTTTGT 

liJii*. X X X X A A 


TACTTCAAGC 


AATTACAATT 


TATATGTATG 


CGTTATTTTT 


6240 


TTGTTATAGA 

x x vj x x -** x n\jn 


TGAATTGTTT 


TTTATGCAGT 


ATGGTTTTTG 


ATGGGAGGTT 


CTGTTTTTTT 


6300 


ATTTTATTTA 


GTGTGTATAT 


CTACATGTTT 


GTTATGAAAT 


GTTATTTTTT 


T TAT TAT AT A 


6360 


TATAATACTC 


CCTTTACGTA 


TAAAGAACGA 


TAAGTTACAA 


TTAGATTAAG 


TTTTAGTTAG 


64 20 


TTTTTGAAGG 


TGTGTTTGGT 


ATTGTTTATA 


ACTAAAACTT 


ATATTTAAAG 


TV TV TV rf" TV fTl ^ 7\ 

AAAGGAGTGA 


C A Q Ci 
K>H O V 


ATGGATATGC 


CACTTTATAT 


GGAGAATTAT 


TTGCTTGCAT 


fr> /■* /•*• tv tv m tv tv t\ 

TGCAATCAAA 


AT AT a CALAA 




GAAATAACTG 


TAAAATGGGT 


TTGGGCAGTT 


GTAATATTAC 


TTTCAGTGAT 


TGGCTGTATT 


DDUU 


GGTTATGCAG 


TTTACTGTAG 


TTGGGTAGGT 


GGAACATTTG 


CTGGGAGTAT 


TAAAAT 1 GCa X 


DO DU 


GTTCCCGACT 


TGGTACACGT 


TACCTTCAAT 


TGTAAACGTT 


GATATTTAAG 


GAGGAAGGAG ■ 


6720 


CCTCTATTTT 


ATGGAGGCTT 


CTTGTTAAGA 


AACTTATGAT 


AAGAATTGAA 


AATTTAACAA 


6780 


AAAAATTTGA 


• TAGGAAAGTT 


TTAGATAATA 


TAACTATCTC 


ATTGCCTAAA 


AACCGAGTTA 


6840 


GTGTGAnTGT 


AGGTATAAAT 


GGAAGTGGGA 


AAACTACTTT 


ATTAGATTGT 


AATGTAGGAT 


6900 


TGAAAGATGC 


AACGTCTGGA 


AAGGTATTTA 


TTGAGAGCTA 


TTCTAATGAT 


TCTGAAAAGT 


6960 
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TTAAAGAA tg 


TATATTCTAT 


ATACCATGGG 


AAi I I i AJ. G 1 


7APPT71 ATT AT 


ATflAPCnfSTA 


7020 


AAGAG I AC 1 T 


T\ A TVprtimrnomm 

AAAL 1 1 TG 1 I 


1 1 AI GAAGAI 


A IIGGG 1 .G 1 HG. - 


TP 7A T A T Af^ A A 


pr;^ATAr;ATn 


7080 


ATTTTTTAGA 


ACTATTTGAT 


TTAAAATTTG 


CTGGAACTAA 


TTTAATAGAA 


TCGTACTCAT 


7140 


TTGGTATGAA 


GAAAAAAATA 


CAAATAGTtG 


CAGCAGCACT 


AGCTAACACT 


GATTATATAT 


7200 


TAGGTGATGA 


AATATTTAAT 


GGACTTGACT 


TTGAGACAAC 


ATTGCTTACG 


TTAGAGCTGT 




TTGAAAATTT 


ATCTAGAGAA 


GTAGGGATAG 


TGATTATTTC 


TCATAATAAA 


TTAATTATTG 


n tod 


AAAGGTTTTC 


AGAGAATATT 


TTATTGATGT CCAATGGCAA TTTAACTCCA TTTTTAGGGG 


Toon 


CGTCAGAGAA 


TTTAGAAAAA 


ftAAfiTGATAA 


GTACGGAGAA 


AATTCATGAA 


AAAATTAAAT 


1 A A C\ 


ACATCAAAGG 


ATATCATCCT 


ATTAATCGAG 


TTATTTATTG 

X X XXX X X w 


ATAATGCTTT 


ACGAGGTTTG 


/ jUv 


CTCAAGCACT 


CTATTTTGTC 


AAAAAAGAGT 


ACTCGCATCG 


TAATAGGTGT 


GTGTGTATTT 


1 S f>0 
/ JOU 


TTTATCTATT 


TTGCATACTT 


TTTTTTT1 RT 
1 1 1 X 1 1 X rvri X 


ATGAGTGAAC 

4* X W"VJ X VJ*xilV 


TAGCTAGAAT 


TGTACCAGAT 


1 £9fl 
/ DZU 


TCAGAAAAAA 


T TAG T CAT AT 


TTTGATAGAA 


CAAGGTCGAA 


AAATAACTTT 


TTATAGTTAT 


/ DO VJ 


TTTAGTAATA 


CATTTGTTTT 


AGGTATAATA 


GCATATATTT 


TAGTTGaTAA 


TACTGTTGCa 


11 AD 


TTAGATAAAA 


ACTCCTTATT 


TTTTf^THA A A 


ACATTGCCtT 


TTAGAAAGAA 


AGACATCAGC 


/ 0 UU 


TTATCTTTTA 


TGCTATTCAA 


ATTTHT A ATT 

r\ i X X U X xVr\ x X 


ATGATATTGC 


TTTATGAATT 


AGTTATGATC 


/ 0 OVJ 


ATTTCTACGC 


CAGCTATTAA 


ACTTGTGACG 


ACTGTTCCTA 


TTGAATATGT 


CATTTTTTTT 


Toon 


ATTGTTCAGC 


ATCTTTTTTA 


TCTAGTGGTA 


ATTGGGGTGA 


TTGAGTTTAT 


ACATTGTTTG 




TTTACATTCT 


TTTTGAAACG 


AAGAGTGAAA 


CAAATTAATA 


GTCTCAAAGT 


GATAGGTGAT 




AGTTTATTAA 


TGATTTTTGC 


TACTTTTTAT 


TTTTTTGATT 


TTCGTTATGG 


TTTAGAATTA 




TTTCTAGCAA 


ACCAATTGTG 


GAGTATCTCC 


TATATGATTC 


CTATCACTTT 


TTTTCTAATA 


ft! fin 


GTGGTATGTT 


TAATGCTTAT 


TAGTATAGGC 


CTATTAAGAC 


TAATTAGTAT 


TCTTGAAAAT 


ft 990 


CAAATGTCTC 


AACATTCAAA 


AT AT AT AT AC 


ATTCCATTCT 


TAAATAAAGT 


tATATTTCGT 


ft 9fl fi 


TATAAAGCGA 


ATTGGTATTT 


TATTAGTTTT 


ACTACAGTTG 


TTATGTTGGT 


TATTTTTTTC 




CAAAGTGGTT 


TAAGAACAAT 


GTTATTCATT 


TTGACAACTG 


TTATGGCCTT 


TTCAGGCGTT 


ft 4 on 


TTATTATTAA 


GCTATGGAGA 


TATTACTGCT 


GATTTCAGAA 


AACAATATGA 


TTTATTGAGA 


84 60 


ATAAAAATTA 


GAAATGAATG 


GTTGAGTCmA 


CTATTGCkAG 


TGATAATGTT 


GGCAATGCCA 


8520 


TtGtTACTAC 


TTgtTTTTGG 


GGATTtGGGG 


AGTTAGCTCA 


ACTAATTACA 


GCGTTGTCGT 


8580 


TAT CAT TAG T 


AGCTATAATT 


TTGGGATATG 


TATTTCCAAA 


ATCACAAGGA 


AGTTTAAATG 


864 0 


AAACGACtTC 


ATTATTGTTA 


TTATTTATTG 


TATTTGTTCT 


AGTTAGTTTA 


TTGACAAATC 


8700 


GCTCTTTTGG 


TTGGTTGATT 


CTAATTGTTT 


TAGTAnTCTT 


GCATTTGTTA 


GTAATTAAGA 


8760 


AGGTGAGAAA 


• TGAAAAAGAA 


TAGTAAATAT 


TTAGGAAGTA 


TAACAGTTAT 


CTATTTGTTA 


8820 


GTTCTTGTGA 


TCGAAACATT 


ATGGGAGCTA 


TTCCATACCA 


GTCGCACAGA 


ATCAACGAAA 


8880 


GTCACTACCT 


TATTAGGAAT 


TACGATTGAT 


AATCGTATAA 


GTAAACATGA 


AATTTCAACT 


8940 
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ACTTTTGGTT TGACAATTAA GGTATTAGTA - CTCTATTTGT TGTTATTGTT AGTTGTCTAT 9000 

ATTTTGACTG GGTTATTTGG AAAAAAGAAA ATATGATTAT TAAGTTATCA ATATTGTATG. 3060 

TGTTGTCAAT ATATGTCGTG TTACTTGTAC ATGAGTGGTT ACATTTTATT TTAGCCmAAT 9120 

TCTTCAAATA CAATGcTTAT ATAAAGAGAG TTGGTCTATA CCCATTTAAA GTGGTGTATA 9180 

CGAACAGAAA TAATCCtCTA GATAATTTAT TGATTTCAGC GATATCTCCG CTTTTCTTGG 924 0 

TTATAGTAGG AATAATTTTG CCATTAAATT ATTATACTGT GATTTTAAAA GTATCTTGTA 9300 

TTAGTAATAT TTTTAATCTC TTACCGTTTA CTGCAGATGG AGAAATAATT TTGCTTTCAA 9360 

TTTTTCAAAT TTTTAGGAGG AGAAATAAAT GAAAAAAAGA AAAGTGTTCA TTATTTTTCT 9420 

TGTTAGTTTG ATTATATTTG TTTTTGGTTA TTACTTAGCG TTTAATCAAA ATAAGTATGT 9480 

ATCATATAAT GACGAGACAA AACAGTTTGT AC AT ACT GAT ACTTCAAAAA TAATGGAAAA 954 0 

TCTGGTGAAC AGACAACAAG GAATTTATTA TTTTGGCTTT CCGACTTGTC CTTGGTGTCT 9600 

AGAATTATTA CCAATACTAG ATAAAGAATT AGAGAAGGAA AGTATGAATG CTTATGCTGT 9660 

AAATACTCGT GGTGATGACT ATACaGAAAA TGATGAkGAA TTATTACaAA AATTTkATCa 9720 

AAAATATACT GGGGATGAGT CGCTTTCTGT ACCTTTTATT GTTGCTA 97 67 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 1 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



AAAATCGTTT 


TGGATTTTTT 


CTCGGnTnCT 


TTCTATTTAA 


GTTATTTTAG 


TTGGATATCG 


60 


TTATTTAATA 


AGGCTATTAT 


ATGGTATAAT 


GATAATTAAC 


TGAAGTGTGG 


AGGAAATAAA 


120 


ATGTTATCAA 


TTGTTGTCCC 


TTGTTATAAT 


GAGGAAGCAG 


CGATTCCGCg 


TTTTTTgAAG 


180 


AAGTAGAAAA 


AATTAGCCAG 


AAACTGTCTC 


ATTCAGTAGA 


ATATATATTT 


GTGAATGATG 


240 


GCTCAAAAGA 


CAATACGTTA 


GCTGTTTTAA 


GACAGCTTTA 


TCGCGAACAT 


CCAGAGAAGG 


300 


TCCGTTATCT 


TTCTTTTTCA 


CGAAATTTTG 


GGAAAGAAGC 


GGGCTTATAC 


GCTGGTTTAA 


360 


AAGAAGCAAC 


AGGTGACTTA 


GTGACAGTCA 


TGGACGTTGA 


TTTACAAGAC 


CCGCCTGAGT 


420 


TATTACCTCA 


AATGATTGAG 


ATGATTGAAA 


CCAGTACAGA 


TTTAGATTGT 


GTTGGCACAC 


480 


GACGCATAAC 


AAGAGACGGC 


GAACCGCCCA 


TTCGTAGCTT 


TTTTGCACGG 


ATGTTCTATA 


540 


AACTCATTAA 


TCGAATTGCT 


GAGACGGAAA 


TGGTGGATGG 


TGCCCGTGAC 


TTTCGTGTCA 


600 


TGACCCGGCA 


AATGGTCGAT 


GCGATTTTAG 


AGTTATCTGA 


GTATAATCGT 


TTCTCAAAAG 


660 


GGATTTTTAG 


TTGGGTCGGC 


TTTAAAACAG 


AATATATTGA 


ATTTAAAAAT 


CGAGAACGAA 


720 


TTGCTGGAGA 


AACCTCTTGG 


TCTTTCTGGA 


GTCTATTAAG 


TTATTCAATT 


GACGGGATTG 


780 


TCAATTTTTC 


AGAAACACCA 


TTGAATATTG 


CTTCATACGT 


TGGCGCCTTT 


TCGTGTATTG 


840 
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GTT'CTGCACT AGCAATGTTA GTGATTATGT TTCGAACATT AGTTAATGGT GATCCAACGA 900 

GCGGCTGGCC TTCAATGGTT TGTATTGTAC TTTTTGTAGG TGGTTTGCAA CTATTATGTT 960 

TAGGAATTAT TGGCAAATAT ATCGGTAAAA TTTTCTTAGA AACGAAAAAA CGACCAATTT 1020 

ATATTGTAAA GGAATCAGAA AAAGATTCTA AAAAATAACC AAAAAAGCAG AGAAAAATTC 1080 

TCTGTTTTTT TGTTGTGCAT TAAAAAAAGT GTGAATAGAC TTTTCAGAAA AACAACATTT 114 0. 

TTTACATAAA AAATACATTA ACTTTACCTA TTTTTATCTT AATGAAACAG CTTTGCTCTC 1200 

GCGGTTGCAA GAAATAAATG TTAAAATCAA AAGGTATGCT ATTAAATGTT TTTTTGGAGG 1260 

TTGGGTcATG ATAGAAGTTA TCGACTTGAA AAAAGTATTC GATAATGGGT TTGAAGCGTT 1320 

AAAATCGGTC AATTTTACGA TTGAGCAAGG GGACTTGGTT TGTTTGTTAG GACCAAGTGG 1380 

GTGTGGGAAA TCCACAATTT TAAACTTAAT TGCAGGTTTA TTGCATCCAA GTGATGGAGA 14 4 0 

TATTCAGTTT CGACAGCAAT CGGTTGTCAA AACAGCACCA AAAGATCGGA ACATCGGGTT 1500 

CGTTTTTCAA AACTATGCCT TATATCCACA TATGACTGTT CTAGAAAACG TAATGTTTCC 1560 

TTTGACGGTA GGAAGTAAGA AAGTTCCCAA GGCAGAGGCA CAAGCTATTG CCGAAGAATA 1620 

TATGAAACTA ACGAATATTG AAGAGTTAAG CCATAAAAAG CCAGGCACAC TTTCTGGTGG 1680 

TCAACAACAG CGTGTTGCGA TTACCCGGGC GTTAGTTCAA AAACCAGATG TTTTATTATT 1740 
AGATGAGCCT TTAAGTAACT TGGATGCACG TTTGCGTTTG AAAATTCGTG AAGAAATCCG i 1800 

CCGCTTGGTG AAAGAAGTAG GG AT TACAAC AATCTTTGTA ACGCACGACC AAGAGGAAGC 1860 

GCTATCAATC AGTGATAAAA TTATTTTATT AAACGAAGGG GTTATTCAAC AAAATGATGA 1920 

ACCTCAAAAC CTTTACTTAG AACCAAATAA CTTATTTGTT GCTCAGTTCA TAGGTAACCC 1980 

AATTATTAAT TTATTGTCTG TTGAAGTGAA AGACGGCAAA ATGTACCACA AAAGTTTTGA 2040 

AATTCCGCTT GAGCGTTTTG AGCAAGCACG. CTTTAAAATG CCAATGACTG ATGGGAAATA 2100 

TACGTTTGCT TCTCGTCCgG AAGATGTGGT ACCAGCTGAA ACAGGTCTCT TTACCACAAC 2160 

GACAGATTTA GTGGAATTGA TTGGTCGAGA ACGTATTTTA CGATTTACAT TAGGAAATGA 2220 

ACAAGTGAAA TCAATTGTAA GTGTAGAAGA AGCGATTGAA GAAGGAGACA CTTTATCTTT 2280 

TGATTTTTCA TATAAAAAAG TATTTATCTT TAACGAAGCG GGAGACCGGG TTTACTAATG 2340 

AAAAAATACA ATCCAGAAAA TCAACCAAAA GCATGGCTCT TCCTTCTGCC ATCATTAGGA 2 4 00 

ATCATTTTAC TGTTTAGCGT TTATCCACTG TTTCGTTCTT TGTGGATGAG TTTCCAAAAA 24 60 

GGTTCCTTAA TTAACCAACG cTACGCTGGT TTAGAAAATT ATCAACGTGT ATTGAATGAT 2520 

CCAATCTTCT ATAAAGCGTT AAAAAATACA GCGCTCTATG CATTTGCTGT GGTACCGATT 2580 

GCCTTGATAA TTTCCCTAGC AATTGCTTGG ATTATCTTTG AAAAAGTTAA ACATAAGAGT 264 0 

TTCTTTGAAA CGATTTTCTT TATGCCTTAT GTAACGAGTA CGATTGCCAT TGGGATTGTC 2700 

TTCCGTTACT TCTTTAATGG CGATTATGGA ATTGTCAATT ACGTCTTAGG CTTTTTCGGC 27 60 

ATTCCTTCTG TCAACTGGCT AGATAATGTT CAAATGAGTA TGCCAACATT GATTATTTTT 2820 
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GGGGTTTGGA 
GATGAAGAAC 
CGTATTACGT 
GGCGCCTTTA 
AATAGTGCCA 
GGAATTGCGA 
CAAAATAAAC 
CGCATTTGTT 
TTTAGCTGGT 
ACAACCACAA 
CTTGAACACA 
AGCTTCATTC 
GATTTCGTTG 
GCAATTAGGT 
TTATATTTAC 
CAAAATTGAT 
ACCAGCGTTG 
GCCATTGTTG 
TGCGACAGAG 
TCCAATTTTA 
TGGAATCAAA 
CCGTGATTGA 
GACCAGAATT 
CAGAATTAAA 
AAGACTATCA 
TTAATTATTT 
ATAGTTTAAA 
CCCGAGAAAT 
AAATCGTGCC 
ACCATTTCAT 
TAGCTTTTTG 
GCTTCCCAGA 
TTCAGCACTT 



CGAGTTTAGC 
ATTTTAAAAT 
TTCCGCAGCT 
AAGTTTATAC 
CAACCGCAGT 
TGGCCGCCAC 
TCTTAAAGAA 
TTCTTAGGCA 
TTAATGAGTT 
TGGCAAAACT 
GTCTTTGTTT 
GCATTAACGA 
TTAATGGTTC 
TTGTTAAATA 
TATTTAAACG 
GGCGCTAGTG 
GTAACAGTAG 
GTGACGAACG 
AGCGGTAGTG 
AT TAT T TACT 
GGCTAAAACA 
AGTGGCGTAT 
AGATTTGCCT 
AGACTTATAT 
ACACACCGCT 
AGACCCAGCT 
TCGAAAAGGT 
GATTGGTCTT 
TGTTGATCAT 
CACATACACA 
TGAAAAAGCC 
ACGTGAACCA 
GGTTCGTTTA 



ATTTAATATT 
TGCAAAAATG 
AGTCCCGACC 
ACAAGTTTAT 
GTACTATATT 
AGTGATTTTA 
AGTGGGGCAA 
TTTTAGCCGT 
ACTCTGAAGC 
ACACAGAGGT 
CAGGCGTTAC 
GTTTGAAGTT 
CGTATGAATC 
CTTACAGCGC 
GTTATTTAAA 
ATTTAGAATA 
GGATTTTAAC 
AGAAAAAATA 
ACGTACATTT 
TGATTTTCAG 
ACTGTGACTT 
AAAGATGCCC 
GATGATCACA 
GATCCACGTT 
GTTTTCTTAT 
GTTCCGTTGT 
GATTTTTTGA 
AATAAAAACG 
GATGCGTACG 
GGTGATTTGC 
AAACATACTG 
GATCCAGCCC 
GAACTAGAAA 



794 

ATTATTTTGT 
TTTGGCGCCT 
ATTGCCTTTT 
GCTTTATTCG 
TATGACAAAT 
TTTGTGATTA 
TAAGCCAATG 
TATTACGTTA 
GACCAGCATG 
CTTTCAAAAA 
AACGATTGCT 
CCGTTTTAAA 
TATTATTTTT 
CTTAATTATT 
AGGTATTCCT 
TATTTGGCGT 
ATTTATTTCT 
CCGTCTATTG 
ACAAATGGCT 
AAAAGAAATT 
TTCACAGTGG 
ACATCTTCTT 
TTGAAACCTT 
TAGGTTATGA 
CTCATGCCCA 
ACACCCTAAA 
TTCCATCAGC 
ATGTGATTAA 
GTGCTTCGGC 
GCTTACAT.GG 
AATTATTAAT 
AAATAGCGGT 
ACCCGAATCG 



TGGCTGGGTT 
CAGACGGCGA 
TATTAACGGT 
GTGGCCGTGC 
TCCACATTGC 
TTTTAGTGGT 
AAAAAAGTTT 
TTTCCATTTG 
CCGCCAACCA 
GCACCCTTTC 
ACGGTAGTCA 

AATGTGGTCA 

j 

•ACCAATTATC 
CCATTTTTAA 
GATACTTTTT 
ATTTTAGTAC 
AGTTGGAATT 
AATAATGGAC 
GCTGCAACAT 
ATCAGAGGAG 
CATTTTAACA 
TGATTTTGGT 
AATTAATAAT 
ATATCATGGT 
TTTAGATCAT 
AGAAACGAAA 
GTTTGAAGAA 
AGTTGGCGAA 
ATTACTGATT 
CCATAATCGT 
GATGGAAGGC 
TGTCAGTGAA 
ACAAATTACC 
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GAGAAATATT 


2880 


AATTTTCCGA 


2940 


CAACTTAATT 


3000 


GGGGATTGCC 


3060 


AGGACGTCCC 


3120 


CACTTTCTTA 


3180 


TAACGATTAT 


3240 


TCTATATGAT 


3300 


TGTTTCCGAA 


3360 


TTCGGTATTT 


3420 


CCGCCGTGTT 


3480 


TTGCTTTGAT 


3540 


AAACGATTGC 


3600 


CTAGTATTTT 


3660 


ACAAGGCTGC 


3720 


CAATGTCAAA 


3780 


CTTTCTTGTG 


3840 


TATCGGCCTT 


3900 


TAACCGTTAT 


3960 


TTGCAAAAAA 


4020 


ATTGGAGGAA 


4080 


ACAGAATTTC 


4140 


CGTTTAGTGC 


4200 


GCAGAAGACA 


4260 


TCACGCATGA 


4320 


ATGATTCTAA 


4380 


AAGAATTTTA 


4440 


ATTTCAGTGG 


4500 


CGCACGCCTG 


4560 


GAAGAGACTT 


4620 


GTAAGCATTA 


4680 


GAAGATCTTG 


4740 


TTTAATGGTT 


4800 
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ATCCAGCAAA 


CGTGGaACGT 


TTTGCTAAGA* 


TTATTGAAAA 


GTCACCACGT 


ACAGTCGTTT 


4860 


TAGAAGCAAA 


TATGGcTGCG 


TTGTTACTTG 


AAGTATTTGG 


AATAGAAGTT 


CGTTATTATT 


4920 


ATGCTGAATC 


TGGTAAAATA 


CCAGAATTGA 


ATCCAGCGTT 


AGAAATCCCg 


TATGACACGT 


4980 


TACTAAAGGA 


CAAAACAGAC 


TATTTGTGGc 


AAGTTGTGAA 


CCAGTTTGAC 


AACCTCCAAG 


5040 


AAGGTAGTTT 


ATACATTCAC 


AGTGATGCAC 


AACCGTTAGG 


GGACTTTGAT 


CCACAGTATC 


5100 


GCGTGTTTTT 


AGATTTGTTG 


GCTAAAAAAG 


ACATTACTTT 


TGTCCGCTTA 


GCTTGTTCAG 


5160 


GACATGCAAT 


TCCAGAAGAT 


CTGGATAAAA 


TTATTGCATT 


GATTGAACCT 


CAAGTATTGG 


5220 


TTCCAATCCA 


TACGTTAAAA 


CCAGAAAAAC 


TGGAAAACCC 


GTATGGTGAA 


AGAATATTGC 


5280 


CAGAACGTGG 


CGAGCAAATT 


GTTTTATAAA 


TTTTTTTAAG 


GGAGAGAAAA 


AAATGAAGTT 


5340 


CAAAACTCTA 


GCAACAACAG 


TGTTAGCAAC 


CGCAGCTATT 


TTCGCATTGG 


GGGCTTGTGG 


5400 


TAACGGTAAT 


GGGGCCAAAG 


AATCAAACGA 


TATTGTGAAA 


GAAGTGAAGG 


AAGATACGAC 


5460 


AATCACTTTC 


TGGCATGCAA 


TGAATGGGGT 


TCAAGAAGAA 


GCGTTAACAA 


AATTAACGAA 


5520 


AGACTTCATG 


AAAGAAAATC 


CAAAAATTAA 


AGTGGAATTA 


CAAAATCAAT 


CTGCTTACCC 


5580 


TGATTTACAA 


GCCAAAATCA 


ATTCGACTTT 


AACTTCACCA 


AAAGATTTAC 


CAACAATTAC 


5640 


GCAAGCGTAC 


CCAGGCTGGT 


TATGGAATGC 


TGCACAAGAT 


GAAATGTTAG 


TGGACTTAAA 


5700 


ACCATATATG 


GAT GAT G AC A 


CAATCGGCTG 


GAAAGATGCA 


GAGCCAATTC 


GTGAAGTATT 


'5760 


GTTAGACGGC 


GCCAAAATCG 


ACGGCAAACA 


ATACGGCATT 


CCATTTAATA 


AATCGACAGA 


5820 


AATGTTATTC 


TATAATGCTG 


ATTTGTTGAA 


AGAATATGGT 


GTTGAAGTAC 


CGAAAACATT 


5880 


AGAGGAATTA 


AAAGAAGCTT 


CTAAAACAAT 


TTACGAAAAA 


TCCAACAAAG 


AAGTCGTTGG 


5940 


TGCTGGTTTT 


GACTCGTTAA 


ATAACTATTA 


CGCAATTGGA 


ATGAAAAACA 


AAGGCGTTGA 


6000 


TTTTAATAAA 


GACTTAGATT 


TAACAAGCAA 


AGATTCACAA 


GAAGTCGTGG 


ACTATTACCG 


6060 


TGATGGTATC 


GAAGCAGGTT 


ACTTCCGCAC 


AGCTGGTTCA 


GATAAATATT 


TATCTGGCCC 


6120 


ATTTGCAAAC 


AAAAAGGTAG 


CAATGTTTGT 


CGGTAGTATT 


GCTGGTGCTG 


GTTTTGTTCA 


6180 


AAAAGATGCT 


GAAGCTGGTG 


GCTATGAATA 


CGGTGTTGCA 


CCACGTCCTG 


AAAAAATCAA 


6240 


CTTACAACAA 


GGAACAGATA 


TTTATATGTT 


CGATAGTGcT 


ACGCCAGAAC 


AACGGACAGC 


6300 


GGCATTTGAA 


TTCATGAAAT 


TCTTAGCTAC 


TCCTGATTCA 


CAATTGTACT 


GGGCACAACA 


6360 


AACAGGTTAT 


ATGcCAATTT 


TAGAATCTGT 


TTTACACAGT 


GATGAGTACA 


AAAATTCTAA 


6420 


GACAACCAAA 


GTACCTGCAC 


AACTTGAAAA 


CGCAGTAAAA 


GATTTATTCG 


CTATCCCAGT 


6480 


AGAAGAAAAT 


GCTGATTCAG 


CCTATAATGA 


AATGCGGACA 


ATTATGGAAA 


GTATTTTTGC 


6540 


TTCATCAAAT 


AAAGACACGA 


GAAAATTATT 


GAAAGATGCA 


ACATCACAAT 


TTGAACAAGC 


6600 


ATGGAACCAA 


TAATCAATAA 


ATAAACAGAC 


GTGGGAGGTT 


GGGATCTGAG 


TCGTTATGGC 


6660 


TCGTGTTCTA 


ACCTTCCTGT 


TCTTTTGGGC 


GGAAGAAGGA 


GAGAAACAAA 


ATGCAAATAA 


6720 


AAATTTTAGC 


AACAAGTGAT 


ATGCATGGGT 


ACATTATGCC 


AACAAGTTAT 


AGTGAAAAAA 


6780 
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AGATGGATTT 


ACCTTTTGGA 


ACCGCAAAAG 


CAGCAACCAT 


GCTGAAAAAG 


TTACGGGCGT 


6840 


CTGCCAAAGG 


ACCAGTTTTT 


CAAATTGAAA 


ATGGGGATTT 


TATTCAAGGT 


TCGCCGCTTA 


6900 


GTTACTACGT 


AAGAAAAGCA 


G A/VAC ACATT 


CAGTGGCTGC 


CATCACTAAA 


ATCATTAATC 


6960 


AAATGAATTA 


TGATGTTAGC 


ATTTTAGGTA 


ACCATGAATT 


TAATTACGGT 


TTAGATTATT 


7020 


TAAAGGAAAC 


GATTGCCAGT 


TATCAACAGC 


CAGTTTTAGC 


TGCCAATATT 


CTTGGTAAAG 


7080 


ATGGACAACC 


TTATTTCGGT 


CAGCCGTATG 


T CAT CAT AG A 


AAAACAAGGG 


GTTAAAGTAG 


7140 


CCATCCTTGG 


CGTGACGACT 


CAATATATTC 


CTCATTGGGA 


ACAGCCGGCA 


ACAGTCAAAG 


7200 


ATTTAACGTT 


TAAAAGTGTC 


GTTGAAACAG 


CGGCGGAATA 


CGTGCCAAAA 


CTGCGGGAAG 


7260 


AAGCGGATCT 


AGTTGTTGTC 


GCTTATCATG 


GCGGTTTTGA 


AAAAGATTTA 


GAAACAGGGG 


7320 


AACCCACGGA 


ATTACTAACA 


GG AG AAAATG 


AAGGCTACGA 


TTTATTGGAA 


AAAGTTCCTG 


7380 


GGATTGATGC 


CTTAGTCACT 


GGTCACCAAC 


ATCGAGAAAT 


TGCTACAAAA 


CTGAATGGAA 


7440 


TACCTGTGAT 


TCAACCAGGT 


TTTCGAGGAG 


CTTTCGTTGG 


AGAAATTACT 


TTAGAAATTG 


7500 


AACCTATGGC 


TAAAGGCTAT 


CACGTTATTG 


GAAGTGACGC 


AGCTATTCAT 


CCTGTTGGAA 


7560 


ACGAACAACC 


AGATACAGAA 


GTTTTAGCTT 


TGACAACAGC 


GTTACATGAC 


GAGGTAGAAG 


7620 


AGTGGcTTGA 


TCAACCAGTG 


GGAAATGTGG 


AAGGGGATAT 


GACGATTCAA 


AATCCGAATG 


7680 


CCGTACGCCT 


CAAAGAACAT 


CCATATATAG 


aATTTATTAA 


TAACGTCCAA 


ATGGCTTCAA 


7740 


GCGGAACGGA 


TATTTCAGGG 


ACTGCACTTT 


TTAACAATGA 


AGGCAAAGGG 


TTTAATAACC 


7800 


AAATTACGAT 


GCGCGATATC 


ATTACCAACT 


ACATTTATCC 


AAATACCTTG 


GCTGTATTGC 


7860 


GTGTCACTGG 


ACAAGACCTC 


CGTGAAGCGT 


TAGAACAAAG 






7900 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
TCCTTACATT TCATGTTATA AAATGTAACA TAGAAATTAA GAACAAAAGA TTAAGATCAG 60 

TTGAAGTCCG AAACAGCAAA AGTGGCGAAA ATATGTTCGG TATGTTAAAC TAATAAGTGG 120 

ACATTCTGCC AGAATGGGCC GATAATTTAA GAGAAATTAG ACACTTTGAG GGGGAGTTAT 180 

AAATGATCGA AAGAGAAACT TCCAATACCT TTCATTTGGC TTCTAAATAT GAGCCTGCCG 240 

GAGATCAGCC AGCCGCTATT GCTGAACTAG TTGACGGTGT CAAAGGTGGA GAAAAAGCTC 300 

AAATTTTACT ■ TGGAGCGACC GGAACAGGTA AAACATTTAC AATCTCTAAT GTGATTCAAG 360 

AAGTGAACAA ACCAACATTA GTGATTGCAC ATAATAAAAC ACTAGCTGGA CAATTATACG 4 20 

GGGAATTTAA AGAATTTTTC CCAGATAATG CGGTCGAGTA TTTTGTCAGT TACTATGATT 4 80 
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ATTATCAACC AGAAGCATAT 


GTACCCTCAA 


GTGATACATA 


TATTGAAAAA 


GATTCCACTA 


540 


TTAATGATGA 


AATTGATAAA 


CTACGCCATT 


CAGCGACAAG 


TTCTTTATTG 


GAACGA&ACG 


60.0 


ATGTCATCGT 


GGTAGCATCA 


GTTTCTTGTA 


TTTTTGGTTT 


AGGGGATCCG 


CGGGAATACA 


660 


GTCAACAAGT 


TGTTTCATTA 


CGTGTGGGAA 


TGGAAATGGA 


TCGTAATGAG 


TTACTAAAAA 


720 


GCTTAGTCGA 


TATTCAATTC 


GAACGAAATG 


ATATTGATTT 


TCAACGGGGG 


CGTTTTCGTG 


780 




TGTGGTTGAG 


ATATTCCCAG 


CTTCACGGGA 


TGAGCATGCG 


CTACGTGTTG 


840 


AATTTTTCGG 


AGACGAAATT 


GATCGAATTC 


GTGAAGTAGA 


TGCGTTAACT 


GGTGAAATTG 


900 


TrGGCGAAAC 


AGAGCATGTC 


GCTATTTTTC 


CGGCAACACA 


CTTCGTGACG 


AACGAAGAAC 


. 960 


ATATGGAACA 


CGCCATATCG 


CAGATCCAAG 


AAGAATTAGA 


GGCACGTTTG 


AAAGTGCTTC 


1020 


GPAGTGAAAA 


TAAATTATTG 

x ruirt x X ** x x \m* 


GAAGCTCAAC 


GTTTGGAACA 


GCGAACCAAC 


TATGATATTG 


1080 


AHATGATGAG 


AGAGATGGGC 


TATACGTCAG 


GGATTGAGAA 


TTACTCTCGA 


CATATGGATG 


1140 


GTCGCCAAGA 


AGGAGAGCCG 


CCATATACGC 


TGTTGGACTT 


TTTCCCAGAT 


GACTTCTTAC 


1200 


TTGTAATCGA 


TGAGTCTCAT 


GTAACAATGC 


CGCAAATTAG 


AGGAATGTAT 


AATGGCGACC 


1260 


GAGPACGGAA 


ACAAATGTTA 


GTGGACTATG 


GTTTCCGGTT 


ACCAAGTGCA 


CTTGATAACC 


1320 




TTTAGAAGAA 


TTTGAACAAC 


ATGTAAACCA 


AATTGTTTAT 


GTATCGGCAA 


1380 


CACCAGGCCC 


TTATGAAATG 


GAACAAACAG 


AGACTGTTGT 


TCAACAAATC 


ATTCGACCAA 


1440 


CAGGTTTACT 


AGATCCAGAA 

r*wii X v** vrlWiui 


GTAGAAATCC 


GCCCAATTAT 


GGGTCAAATt 


GATGACTTGG 


1500 


TGGGCGAAAT 


TCATGAaCGG 


ATCGAAAAAG 


ATCAGCGGGT 


GTTTGTCACA 


ACCTTAACGA 


1560 


AGAAAATGGC 


CGAAGATTTA 


ACCGACTATT 


TCAAAGAATT 


AGGCTTAAAA 


GTTAAATATT 


1620 


TGCATAGTGA 


TATAAAAACA 


CTGGAAAGAA 


CGGAGATTAT 


TCGCGATTTA 


CGATTAGGCG 


1680 


AATTTGATAT 


TCTAATAGGT 


ATTAACTTAT 


TACGTGAAGG 


AATCGATGTG 


CCAGAAGTTT 


1740 


CCTTAATTGC 


GATTCTAGAT 


GCGGATAAAG 


AAGGCTTCTT 


ACGTAGTGAA 


CGCTCGTTAG 


1800 


TGCAAACGAT 


GGGACGGGCT 


GCCCGGAACG 


CTGAAGGAAA 


AGTCATTATG 


TATGCAGATA 


18 60 


AAATCACCGA 


TTCGATGCAA 


CGAGCTATGG 


ATGAAACGGC 


ACGACGTCGG 


GCAATCCAAG 


1920 


AAGCATATAA 


TGAAGAACAT 


GGTATTGAAC 


CAAAAACAAT 


TATTAAAGAA 


ATTCGTGATT 


T Q O A 

l you 


TGATTTCTAT 


TTCCAAAACA 


GCTGATAAAG 


ATGAAACAGT 


GGTTCAATTG 


GATAAATCAT 




ATAAAGATTT 


ATCAAGACAA 


GAAAAAGCTG 


ATTTATTAAT 


GAAACTTGAA 


AGAGAAATGA 


2100 


AGGATGCGGC 


CAAAGCATTA 


GACTTCGAAA 


CTGCAGCCAC 


ATTACGAGAT 


ACCATTCTTG 


2 160 


AATTGAAAGC 


TGCCAAATAA 


GTCAACCAAC 


GGATTAAGTA 


AAAGATACGA 


TGAACCATCG 


2220 


TATCTTTTAC 


TAATCTTTTC 


TCTATAGAAT 


GACTTGAAAT 


GATTTGACGT 


AAGAGATTTA 


2280 


GTAAAAGGAG 


AACATATATG 


GCAAATGATA 


AAATTGTGAT 


TCATGGTGCA 


CGCGCCCATA 


2340 


ACTTAAAAAA 


TATTGATGTC 


ACGATTCCTC 


GTGACAAAAT 


GGTTGTCGTA 


ACTGGACTAT 


2400 


CTGGTTCTGG 


CAAAAGTTCA 


TTAGCGTTTG 


ATACGCTTTA 


CGCAGAAGGG 


CAACGACGTT 


2460 
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AT G TAG AG AG 


TCTCTCTGCT 


TATGCGCGAC 


AATTTTTGGG 


AC AG AT GG AT 


AAACCAGATG 


2520 


TTGACAGCAT 


CGATGGTTTA 


AGTCCAGCTA 


TTTCAATTGA 


TCAAAAGACG 


ACAAGTAAAA 


2580 


ACCCACGCTC 


AACGGTGGGG 


ACGGTCACAG 


AAATCAATGA 


TTATTTACGA 


TTGTTATTTG 


2640 


CTCGGGTAGG 


TCATCCAATT 


TGTCCTAACG 


ATCATATTGA 


AATTACCAGT 


CAGTCTGTTG 


2700 


AACAGATGGT 


TGATAAAGTC 


TTAGAGCTAC 


CAGAGCGAAC 


AAAAATTCAA 


ATTTTGGCCC 


2760 


CTGTTGTGGT 


CAAGAAAAAA 


GGCCAACATA 


AAAAAGTGTT 


TGAAATGATC 


CAACGTGAAG 


2820 


GGTATGTCAG 


AATGCGGGTG 


GATGGTGAAA 


CCTATGATGT 


TAGTGAGGCA 


CCAGAACTTG 


2880 


AAAAAAATAA 


AAAACATGAT 


ATTGCGATTG 


TGATTGACCG 


TATCGTTGTG 


AAAGAAGGCA 


2940 


TTCGCTCTCG 


CTTGTTTGAT 


TCGTTTGAAG 


CGGCTTTGCG 


TTTAGCTGAA 


GGTTACGCAA 


3000 


TTGTTGATGT 


GATTGGTCAA 


GAAGAAATGT 


TGTTTAGTGA 


ACATTATGCT 


TGTCCTTATT 


3060 


GTGGGTTCAC 


AGTCGGTGAA 


TTGGAACCGC 


GCTTGTTTTC 


ATTTAATGCA 


CCATTTGGCG 


3120 


CTTGTCCCGA 


TTGTGATGGT 


TTAGGTGTTA 


AATTAGAAGT 


GGACAAAGAC 


TTAGTGATTC 


3180 


' CAGATCCCAC 


TAAAACCTTG 


AGAGAAGGGG 


CCATTGTTCC 


GTGGAACCCT 


ATTAGTTCCC 


3240 


AATATTATCC 


ACAAATGTTG 


GAGCAAGCAG 


CCACCAGTTT 


TGGGATTGAT 


ATGGATACGC 


3300 


CGTTTGAAGA 


ATTACCTGCA 


GACCAACAAG 


AAATTATTTT 


AAATGGTTCT 


GGCGAGAAAA 


3360 


ACTTTCATTT 


TCATTATGAA 


AATGACTTTG 


GTGGTGTTCG 


TGATGTGGAA 


GTGCCATTTG 


3420 


AAGGGATTTT 


AAAAAATATT 


AAACGACGTT 


ATCATGAAAC 


GAATAGCGAT 


TTTACACGAG 


3480 


ATCAAATGCG 


GTTATACATG 


ACAGAATTGA 


CTTGTCGAAG 


CTGTCAAGGG 


TATCGTTTGA 


3540 


ATCCGCAAGC 


TTTAGCTGTA 


AAAATcAATG 


GCACGCACAT 


TGGTGAAGTC 


AGTGAATTAG 


3600 


CAATAAAAAA 


TGCGGTCCAA 


TTTTTTGAAG 


GTGTGTCTTT 


ATCTGAACAA 


GAAACAACGA 


3660 


TTGCTCGGCC 


AATTTTAAAA 


GAAGTCGAAG 


ATCGGTTAAC 


CTTCTTAAAA 


AATGTTGGGT 


3720 


TAGATTACTT 


AACCTTAAGT 


CGAGCTGCTG 


GTACACTTTC 


TGGCGGAGAA 


GCACAAAGAA 


3780 


TTCGCTTAGC 


AACACAAATT 


GGATCAAACT 


TATCAGGCGT 


TCTTTACATT 


TTAGATGAAC 


3840 


CATCAATTGG 


TTTGCATCAA 


CGAGATAACG 


ACCGCTTGAT 


TGACTCATTG 


AAAAAAATGC 


3900 


GGGACTTAGG 


CAATACATTA 


ATTGTGGTGG 


AACATGATGA 


AGATACAATG 


ATGGCTTCGG 


3960 


ATTACTTGAT 


TGATGTGGGA 


CCTGGCGCTG 


GACATCTGGG 


CGGTGAAATT 


GTAGCCGCTG 


4020 


GTACGCCGGA 


AGAGGTTGCT 


AAAAATCCGC 


ATTCATTGAC 


CGGGCAATAT 


CTTTCTGGGA 


4080 


AAAAAGTGAT 


TCCTGTACCA 


AAAGAACGTC 


GCAAAGGTAA 


TGGCAAAGCG 


ATCAAAGTCA 


4140 


CTGGTGCCAG 


TGAAAATAAT 


TTAAAAAATG 


TCAGTGTTGA 


ATTTCCGCTG 


GGTGAATTTG 


4200 


TTGCTGTCAC 


AGGCGTTTCA 


GGTTCAGGGA 


AAAGTACTTT 


AGTCAATCAA 


ATTTTAAAAA 


4260 


AAGCGCTGGC 


■ CCAAAAATTA 


AATCGTAATT 


CCAATAAACC 


CGGGAAACAC 


AAAAGCATCA 


4320 


CTGGTTATGA 


AGcCATTGAA 


AAGATTGTTG 


ACATTGACCA 


■ AAGTCCAATT 


GGTCGAACAC 


4380 


CAAGAAGTAA 


TCCAGCGACT 


TATACTAGTG 


TTTTTGATGA 


CATTCGCGAT 


TTGTTTGCCC 


4 4 40 
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AAACAAATGA AGCCAAGGTT CSTGGOTATA AAAAA3GTCG CTTTAGTTTT AACGTCAAAG 4500 

GTGGCCSTTG TGAAGCTTGT CGCGGGGA7G 3TATCATCAA AATTGAAATG CACTTTTTAC 4560 

CTGATGTCTA TGTGCCTTGC 3AAGTTTGTC ATG3CAAACG TTATAATTCT GAGACGTTAG 4620 

AAG77C.-.TTA TAAAGGCAAA AACAT7TCTG ATATTTTAGA TATGACGGTG GAAGATGCCG 4 680 

TAGAArrTTT CAAACATATT CCTAAAA7TC ATC3TAAATT GCAAACAATT GTGGATGTAG 4740 

C-CTTAGGCTA TGTGACATTG 3GSCAGCC3G CAACTACTTT ATCTGGTGGG GAAGCACAGC 4800 

C-GATGAAGCT AGCGAGTGAA CTTCATAAAA ATTCTAATGG GAAAAACTTT TATATTCTGG 4860 

ATGAACCAAC AACAGGCTTG CATACCGATG ATATTGCGCG ATTGCTTCTA GTATTAGAGC 4920 

GTTTAGTGGA AGCGGGCAAT ACCGTITTAG 7GATTGAACA TAACTTGGAT GTCATCAAGT 4980 

CTGCGGATCA TGTGATTGAT CTAGGGCCAG AAGGTG3CGA CGGTGGTGGG ACGATCGTGG 5040 

CGACCGGAAC ACCAGAAGAA GTTGCTAAAG TCAAAGACAG TTACACAGGA TATTATTTAA 5100 

AACGTGCTCT GAAATAAACA TTAATGATTA GATAAAGAAC CTTGTTTTTA TGGGAAACAA 5160 

GGTT7TTTAT TATAAAATAA CAAATATTTT TTAATAAAAA TTAAAAATAA ATGTTGACTT 5220 

AATGTC-TAAT TAAAAGATAT ACTATGTTTG 7TAACAAAAT AACAATGAAT ACGAAGGGAA 5280 

GTCTTAAATG AAAAACAAAA AAATAACAAC AAGTTTATTA ACAGCAGGAA TGGTCTTAGC 5340 

TATCGGAATC ACACCTTTTG TACATAGTAA CACATGGAAG CAGCGACCTT AATTTCACAA 5400. 

TCAGACCATG AAAAGTAATG AGTTT7ACCA GAAAGACTAA AAGAAGCACG GGCATCAATT 54 60 

GGGCCGGCGC AGTCCAGCGT 

;2) "^FORMATION FOR S£Q ID KO: 133: 

!i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 10391 base pairs 
(B) TYPE: nucleic acid 
(C: STRANDEENESS: double 
(D', TOPOLOGY: linear 



5480 



60 
120 



(xi) SEQUENCE DESCRIPTION: SIQ ID NO: 133: 
CAGGCTTTTT AATGACGAAT GTCGTTCAAC AATCCATGCG CGATTTGCGT CGCGATATTG 
AAGAaAAAAT CAATCGTTTG CCAGTTTCTT ATTTTGATAA GAACCAACAA GGAAATATTT 
TGTCACGGGT GACGAACGAT GTGGATGCTG TCAGCAATGC GATGCAACAA AGTTTTATCA 180 
ATATTGTTTC AGCAGTCTTA GGTATTGTGA 7GGCGGTAGT GATGATGTTC TTAATCAATC 240 
CGCTGA.7GGC GATTTTTTCA GTGAT7ATGA TTCCGTTGTC TCTGATTATT TCCAGAACAA 300 
TTGTTAAAAT CTCCCAGAAA TATTTCCAAG GAATGCAAAA TTCTTTAGGA GACTTAAATG 
GTTATGTCCA A3AAAATATG ACTGGGT7CA GTGTCTTAAA ACTATATGGT CGGGAAAAAG 
AAACCCTTGA AGGCTTTAAA CAAGTCAATC ATCGT7TAAA TGGTTTTGGT TTCAAAGCAT 
CCTT7ATCTC A3GATTAATG TTGCCATTGG TTCAGATGAC CGCTTATGGG ACCTATATCG 540 
5GGTAGCTGT CCTTGGTAGT TACTAT GTGG TTGCTGGTGT GATCGTAGTG GGGCAATTAC 



360 
420 
480 



600 



WO 98/50555 

AAGCGTTTAT TCAATATATT 
CTGCAGCTTT ACAAAGCGCT 
CAGAAGAAGA ACTTAACGAA 
AATTTGAAAA TGTCAGCTTT 
TTAAAGTTGA TGCGGGCCAA 
CCTTAATCAA CTTACTGATG 
GCATTGACAC GAAAAAAATG 
AAGATGCTTG GTTGTATAAA 
CCACGGaTTA TGaAGTTGTC 
CAATGCCAGA CGGGTATGaA 
aAAAACAATT GTTGACCATT 
ATGAGGCGAC TAGTTCAGTC 
GTGTTATGGA AGGACGAACG 
CTGATTTAAT TCTTGTTATG 
TGCTGGAACA AGGTGGCTTC 
ACTATGAGGA ATAAAAAGAG 
TTTCCGAAAA TTGTTCTGTA 
TTGTTCCCGC CGTTTATCAG 
CACGCTCTTT TCTTGTCTTT 
AAAGAGGTGA CAGAGTTTGA 
AAACGTACAA CTAGCAAAGG 
TAACACAATT ACCATTGGTG 
TGAAGATGCC CCGACAATTG 
CGGTTGTAAA ATCGGTGATG 
TATTATTGGA GAAAACAGCC 
CCCGCCAAAT GTTTTAGCAT 
AGAAATAGCC AGTAATCATA 
CCAACAAAAA TACCCGAAAC 
TAATGACGGT TTTTGGAACA 
TAGAAGAGCA AAGTCAAGGA 
TGTTGGATCA AGTGTTAGAA 
AAGATGGGCA AACCTTAACA 
TTGTGACAGA ACAGCCGGAT 
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TGGCAAATTA- GCCAACCAAT GGGGAATATT ACGCAGTTGT 660 
TCAGCTTCGA CCATGCGGAT TTTTGAAATC CTAGATGAAC 720 
CAAGATGTTC CTTTGCCAGA ACCTATTTTA GGCTCTGTTG 780 
AGTTATGACC CAGAAAAACC GTTAATTCGT AATTTGAACT 84 0 

ATGGTTGCGA TTGTGGGACC AACTGGCGCT GGGAAAACAA 900 
CGTTTTTATG ATGTAACAGA AGGCGCCATT AAAATTGATG 960 

AACCGTAGTG ATGTCCGATC TGTATTTGGA ATGGTATTGC 1020 

GGTACCATTG CAGATAACAT TCGTTTTGGG aAGTTAGATG 1080 

GATGCaGCGA AAACGGcCAa TGTGGATCAC ' TTCaTTCGGA 1140 

ATGGAaATCa ATTCTGAGGG AGATAACGTT TCCCTTGGtC 1200 

GCCCgAGCGG TAATTTCTGA TCCGAAAATT TTGATTTTAG 1260 

GATACACGCT TGGAAGCCTT AATTCAAAAA GCAATGGATC 1320 

AGTTTCGTTA TTGCCCACCG CCTATCAACT ATTCGTGAAG 1380 

AAACAAGGAG AAATCATTGA AAAAGGTACG CATCATGAGT 14 40 

TATGAAAAAC TATACAATAG TCAATTTGCT GAAGAAGGCG 1500 

GTCGGGACAG AAGCGTTTAA CTCCGAGAAA TAAGAAGGAA 1560 

ATTTTTGGAG AATTCCGGCT TATTTCCGAA GGAGTTGCTT 1620 

TTTTTGAGCG TGGAGCAAAA ATCCAAAGTG ATTTTTGTCC 1680 

TTTACTATGA AAGTGAGGAA TTCTTGTCTA TACTTAATGA 174 0 

AAAAAGAATA TTTTATTGCT GCTAGTGCAG ATGTCTATGG 1800 

ATAGTAATAT TTGGTTTCAG TCGGTCTTAA GAGGTGACAG 1860 

AAGGTAGTAA TATTCAAGAT GGCACGATTA TTCATGTTGA 1920 

TAGGCAAGTA TGTAACGGTG GGCCATCAAT GCATGTTACA 1980 

GTGCTTTGAT TGGCATGGGT TCAACCATTC TAAATAATGC 204 0 

TGATTGGTGC GGGCTCTTTA GTGACAGAAG GCAAAATTAT 2100 

TTGGTCGACC CGCGAAGGTT ATCCGTCCGT TAACGGCAGC 2160 

CGAACGCCTT ACATTATATT CAACGAGCAG AAGAATTTCG 2220 

TAACATAGAA TTAGGAGGAA AATGATGAAG AAAATTAAAG 2280 

CGGCCAGAAG CAATCAAAAT GGCTCCCTTA ATCAAAGTTT 234 0 

TTTGACTCTG TAGTAGTCGT AACAGCGCAA CATCGCCAAA 2400 

GATTTTCAAA TTACACCGAA TCATGATTTA AATATCATGA 24 60 

GATATTACAA GTCGTGTTTT GAATTTATTA ACTGAGGTTC 2520 

ATTGTCTTGG TTCATGGCGA TACAACCACG TCATTTGCGG 2580 
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CTGCATTGGC TGCTTTTTAT 
CCTGGCAAAA ATATTCTCCG 
CTGATATTTA TTTTGCCCCA 
CAGAAGAACA TATTTATGTG 
ATGCCCACTA TCAAAAT~AT 
TTACAATGCA TCGTCGTGAA 
GTCGACTTGT AATGGAACAT 
AAGTCCGTGA AATAGTAGCG 
CATTGGATGT CAAAGATTTT 
CTGGCGGTGT GCAAGAAGAA 
CAACGGAACG TCCAGAAGGC 
AAGTAGTCTA TCAAGAGGCG 
CGCATGCACA AAATCCGTAT 
CCTATGAAAT GCAGAAATCA 
GGC AG AG AAG CGCTGAAAAA 
ACAAAAAATG CTAGAAAGTA 
TGGCACAATT ACGGATTCTG 
ATTAGGCTGG CCAGCTCJTA 
TGAGTCGTTT TTGCACATGG 
TCGAGCGTAT TATCAGAGAA 
GGTACTAATA AGGTTAAAGG 
AGAATTTGCT AAAAAGATTA 
TGGAGCGAGT ATGGACGGTC 
GGAAGCTCAA CTAGACCCAA 
CATTCTTGGT GCTCAACAAA 
AGAAACAGAG TTACAAGAGG 
TGCTATTTTG CTACGAAATT 
TAAGATTTAT TATCGTTTCA 
TGAATTTTTT ATCGGGGAAT 
TCATGTATAA TAGTTTCAGT 
TGATAACCAA AATAACCAAG 
AAATGGTAGA AATGATGGCG 
ACCTTCATCT GATGAATCAC 



801 

CAACAGATTC CAGTGGGACA TGTGGAAGCA GGATTGCGGA 2640 

TTTCCGGAAG AAATGAATCG TCAACTGGTA GACGTGTTAA 2700 

ACCACTCAAA GTAAGGACAA TTTAATCAAA GAAAATCATC 2760 

ACTGGCAATA CAGCAATTGA CGCGATGGCG TAGACCGTTG 2820 

TTATTGGAAA AAATCCCTAC AGACCAACGC ATCGTGCTAA 2880 

AATTTAGGCC TGCCAATGGC AAATGTGTTT AAAGCTGTTC 2940 

CCAGAAATTG AAGTCATTTT CCCTATGCAC AAAAATCCCA 3000 

GAACACTTAG GTGAATTAGC TCGTGTGCAT TTAATTGAAC 3060 

CAAAATTTTG CAGCGAAAAG TGCTTTAATT TTAACTGATT 3120 

GCTCCTTCCT TAGGTGTGCC AGT-TTTAGTT TTACGAGATA 3180 

GTAgCGGCAG GAACATTGAA ATTGGTTGGT ACAGACGAAC 324 0 

AAAACATTAT TGACTGATTC TGATGCTTAC CACGCGATGG 3300 

GGGGATGGTC AAGCTAGCCG CCGAATCGTT GAAGCCATTG 3360 

GATAAAAAAC CAGATACGTT TACAGCAAAA TAAATAAAAA 3420 

TGTGCTTCTT TTGCGTATCT TATTTTGAAG AATCAATTAG 3480 

GTGAAGTGAA GATGTATCAA ACGATTTTAT TTGATTTAGA 3540 

GTAGCGGCAT TATGCGTTCT ATTTTATACG CAACAGAACA 3600 

GCGAGGAGAC ATTACGTTCT TTCATAGGGC CGCCTTTATA 3660 

CGCCTTCAGC TGAAGCAGCG CAACAAGCAG TGGGTCATTA 3720 

AAGGTATGTT TGAAAATCAT GTGTATCCAG GGATCCCAGA 3780 

AAGCGGGAGC AAAGTTATAC ATTGCGACTT CCAAGCCAGA 3840 

TTACTCATTT TGATTTAGAT CGTTATTTTA CAGGCATATA 3900 

ATCGTTCTAA AAAAGCAGAT GTGATTCAGT ATGCATTAAC 3960 

CCAAAGAAGC AATTATCATG GTTGGTGATC GTAATCATGA 4020 

ATGGCTTAGA TAGTATAGGT GTCTTGTATG GATTTGGTGA 4080 

CGGGAGCGAC ATTCCTAGTA CACTCGCCTA AAGACTTAGG 414 0 

CTTAATAGAA AATGGATTTT CGATAAATTT AGGAGGATTT 4200 

ATAGAAAAGC GCCATCATAG GCTTTAAACG TTATCAATGA 4260 

GACGGTCTTT TTATTAAAAA CATTTAATTT TAAAGAGGTT 4 320 

CATAAACGAT AGAATAGTAA GGAGGGTTAG TTTTGGCTAA 4 380 

ATCCCAAATC TTCGCTTAGA GATCAAGTAA CTGGGTCTTT 44 40 

ATCAACCTGA TTCCTCTGAA AAAAATGATA GGTCACCCCA 4 500 

AAGAAACAGC GAGCCGCACA ACACAAACAC GTGCAGGTTC 4 560 
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TAGAG'CTGCT CGAAGACGTG 
GATTGAAACA GACGAAAAGC 
TTTAGTCGGT CGTATCGTGT 
TGGGTTTACT TTTTATAAAT 
GAAGCTTGTT CAAGTGCATA 
AGAAGAAAGC AACGTCATTA 
CTTAACAGAT TTTCAAGCGG 
TGGTGAAATG CTAAAAGAAG 
AACGATTCCT GAAGGCTACG 
TTTCAAGAAA GCAGATTTCA 
AGCTAAGTAT CCTGATTTAC 
AGAAGGCTAT TTGTTCCCGG 
TGTCGAACAA ATGATTGCTA 
TGCAAAGAAT TTAACCAATC 
AGTCAAAGAA GCAGACCGCA 
TATGCCAATC CAATCAGATA 
AACGTATGCT GACTTGGAAG 
ACCAGGACCA TTAGATAGCC 
AAGTGACTAT TTATACTTTg 
ATATGAAGAA CATCAAGTAT 
TGAAAACATA AAAGAGCTTG 
CAAAAAGTAA TCTGTTTAAC 
TTTAGCAAAA TCACAAAAAT 
TTTTGTGGAT TGAAAGCGGA 
CGAAAAAGGA GAATTAACAT 
AAATTAGAAC AAGAGCTAGA 
ATTAAAATCG CTCGTAGCTT 
GATGAACAAG CTTTCGTTGA 
GAAATCATCG ATAATGATGG 
TTCCAAGAAT TGCCTGATGG 
GATCCATTTT CTGGTAAAAT 
CAATTAGGCG ACCAAGTGAT 
AAAGTTGAAG CAAACTAAAT 



802 

GGAAAGACAA • GACAACACAG ACAGTCGAAG AGCCAACCCC 4 620 

CAACAAATAC AAAAAAACAA ACACGTAAAA AAGAAGATCG 4 680 

TAATTGTAGT TTCTGTATTA GTTTTAATGA TGGCAATCTT 4 74 0 

ATGTAGATGC TGGGCTACAG CCACTAGACA AGAACAATAA 4 800 

TCCCAGAAGG ATCTTCTAAT AAGCAAATTG CTGCGGTTTT 4 8 60 

AAAGCGGAAT GGTTTTTAAC TACTACGTAA AATTTAAAAA 4 920 

GCTATTATCA AATGTCTCCA AGCATGACAC TCGATGAAAT 4 980 

GAGGTACGCC AGAACCAACT AAAATTGCAG ATGGCAAAGT 504 0 

ATATAGATAA AATTGGCGAA GCTATTGAAA AAAATACAGA 5100 

TTGCATTAAT GAAAAACGAA GACTTCTTCA ATCAAATGAA 5160 

TTGAAAGTGC TGCAACTGCA GAAGGTGTTC GTTATCGCTT 5220 

CAACTTATGA TTACTACAAA AAAGCAACAC TTCCTGAATT 5280 

AAATGAACAC AGTAATGGAA CAATATACGC CGACAATTCA 534 0 

AGCAAGTCTT AACGTTAGCT TCATTGGTTG AAAAAGAAGG 54 00 

AACAAATTGC GCAAGTATTC TTTAACCGCT TGGCTGCCGA 54 60 

TTTCTATTTT GTATGCTCTA GGAGAACACA AAGAAACTGT 5520 

TGGACTCGTC TTATAACTTG TATAAAAATA CTGGTTATGG 5580 

CAAGTGAGGA GTCAATTAAA GCAGTCTTGA ATCCGACACC 5 64 0 

TAGCAGATAT TTCTACAGGA AAAGTCTACT TCTCAAAAAC 5700 

TAGTAGATCA GTATGTCAAT AATTCAAGTA GTGAATAAAA 57 60 

GTGAATCAAT TTTGTATGAT TCACCAAGCT TTTAGATACT 5820 

AGAGGGAAAG AGTAGCAACT AGAAATTTAT ATAAATAAAA 5880 

TCAATAATTA TATTTACAAT TTAGAAAAAG CGTGGTACTC 594 0 

ATTTTTTTCA ATAACCAACC AAGAACTACG TATCGTTAAG 6000 

GGTAGAAAAA GTATATCCAA TGACTCTTGA AGGAAAAGAA 6060 

AGAATTAAAA ACAATTAAAC GTAAAGAAAT CGTAGAGCGT 6120 

TGGGGATCTT TCAGAGAACT CAGAGTATGA ATCAGCTAAA 6180 

AGGACGTATT ACAACGTTAG AAAATATGAT TCGTTTTGCA 624 0 

CGTGGACCAA GACGAAnTTT CCATCGGTAA AACAGTTACT 6300 

AGAAGAAGAG GAATACACAA TCGTAGGAAG TGCCGAAGCC 6360 

TTCTAACGAT TCACCGATTG CGCAACCnTT AATTGGTAAA 64 20 

TATTGCAACA CCTGGTGGCG ACATGACAGT TAAAATTACA 64 80 

AATTGAGTAG GGCAGGCAGC TTCCAAGGCC tGTCCTTTCG 654 0 
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CAATCTATGA CATTTAGGGC 
AAATAAAATA AACGAGTAAA 
TGGCGCTTTT TTATCGTCGC 
CATAATCCTG GaTTAGCTGT 
AGGAAAGCAT CTAAAACACA 
GTCATTGGTG CAATGAATAG 
ATCGGCTTAA AAGGCTTTGA 
AAAAAACAAA TGATTATGGT 
AAACGCCGCT GGTTTGCCAA 
AATATTGATT TAATCTCTGG 
GAAGACAATA TTATTATTAT 
GgTGTAGCTA TTTTGTTAGA 
AAATATCAAT TGAAAAACGA 
CGTCGTTTGA AAATTATGAC 
CCGATCGGAT TTCAAGACGC 
TCTTAATTAC ATTGTTTTCA 
TTCTTCAGAG AAAAGTCTTT 
TAATAGGCTT ATTGACCTTT 
TTGAAGAAAA ACTTCGGTTA 
CAACGACCAG TGAAAATCAA 
TTAAAAATAA ATTATTAGAG 
TAATGGATGG GCAAACAAAA 
AGTTGCATGA TTCAGTCAGT 
ATGAACAAGC ACAACGAACA 
CAGAAATCAT TAATGCCTCC 
TCAGTCTAGA AGGAAAAAGT 
CAAAAATTAA AATTGAATTG 
AG GAT CAT CT TTTCCGAATT 
CAAAGGAATT AGAGGTATAC 
ATGATGGTGT CGGCTTTGAT 
ATAATATTCG AGAACGTGTT 
GGCAGGGAAC CAGCGTTGAA 
AAAGTAATGT TAGTGGATGA 



803 

TAAAGTCCTA TGGTAACCTG aGGCGCAAAC CACTATGATT 66.00 

ATAAATGACA GAGGGAGGAG GCAAATCTAT GAATAACCCT 6660 

AGAAGCATTA CTTTTTATTC TGGCGTTATG GCAAATTGTA 6720 

TTtATTAACA ATTGGCGTTT TACTTGTGGC CTACGTTTCC 6780 

TTTTAACAAC TTTCAATTCG TCCTCGGCGT TGTTTTTATT 6840 

CACGGCTGTT TGGTTTATGT TGATTTTTGG CGTACT CTTT 6900 

GATTTCAGGC GTGGATATAG CTGAGCGAGC ACCTTGGCGA 6960 

GGAGACGGCG GCAAAAGAAC CTAAAAATGG CAAACGGTTT 7020 

CGAACGCATT GGTAACAATA TCTATGAATG GGACGATATC 7080 

GGACACCATT ATTGATTTAG GTAATACGCT ACTACCGAAA 7140 

TCGTAAAGGT TTTGGCCGCA CGCGAATTCT AGTGCCGTTA 7200 

ACATTGAACT TTTTACGGAA CGGTACGTTT TGAAGAAGAA 72 60 

ATCATTAAAA ATTTACAGCA ATGATTATGA TACCAATCTT 7320 

GAACACTTTA GTAGGAGATG TTGAGGTGAT CCGTGTATGA 7380 

ATGATTTCAT TATATGCGTC CCTTAGCACC TTTATTGTTA 74 40 

TATTTTCATT CGATTAAACA AAACCGGTGG TTATTAGAGC ■ 7500 

TATTTACCAC TAATTGTGCA CATTGTTCTC ATATCCTTAG 7560 

TTACTGATTT CATTGGTTCA AAAAGGGCAA TATGGACGGA 7620 

TTGGCCAACG GTAATTATGA AAGTCCAGTC TTAAACAAAC 7680 

GACCATTATC TAACCGAAGT CGAACAAGAT ATTTGGTCGA 77 4 0 

ATGTCTAAAG AATTGCAATT ATTAAACAGT CGACCGCAAT 7800 

GAAGAAATTT TAGAGAACGA GCGGCATCGT TTGGCGCGGG 78 60 

CAACAACTTT TTGCAGCCAT GATGATGTTG TCTGCATTAA 7920 

GAAACCCCGG AACCATATCG TAAACAACTA GCCATGGTGG 7 980 

CAATCGGAAA TGCGCGCGCT ACTATTGCAC TTGCGTCCTA 804 0 

TTGCGTAAAG GTATTGAACA ATTACTGAAA GAACTACAAA 8100 

ATTTGGGATG TTGAAGATGT TCATTTAAAT AGCAGCATTG 8160 

GTGCAGGAGT TACTTTCAAA TACCTTAAGA CATGCCAAAG 8220 

TTACACCAAG TCGATAAAAA CGTGTTATTG CGTATTGTTG 8280 

ATGAAGGAAC AAAGTAATAA AGCCGGTAGT TATGGCTTAA 834 0 

GTCGGCATGG GCGGTACAGT TAAAATTATT AGTTTTAAAG 84 00 

ATTAAAGTTC CTGTCATAAA GGAGGAAACT GCAAGTGATC 84 60 

CCATGAAATG GTCCGTTTAG GCGTTTCATC ATATTTATCT 8520 



WO 98/50555 PCT/US98/08985 

804 



ATTCAAGAGG 


ATATAGAAGT 


CGTAGGCGAA ■ 


gcAGAAAACG GTAAGATTGG CTATGAAAAA 


8580 


GCATTGGAAC 


TACGTCCAGA 


TGTTATTTTG 


ATGGATTTGG 


TAATGGAAGA 


AATGGACGGC 


8640 


ATTGATTCAA 


CAAAAGCGAT 


CTTGAAAGAT 


TGGCCAGAAG 


CCAAGATTAT 


TATTGTGACG 


8700 


AGTTTTATTG 


ATGATGAAAA 


AGTGTATCCG 


GCGATTGAAG 


CTGGTGCAGC 


GGGCTACCTA 


8760 


TTAAAGACAT 


CAACAGCACA 


TGAGATTGCT 


GATGCAATTC 


GGGCGACTTA 


TCGCGGAGAG 


8820 


CGTGTGTTGG 


AACCTGAAGT 


GACGCATAAG 


ATGATGGAAC 


GGTTAACAAA 


AAAACAAGAG 


8880 


CCGGTGTTGC 


ACGAAGATTT 


GACAAACCGG 


GAACACGAAA 


TTTTAATGTT 


GATTGCACAA 


8940 


GGTAAAAGTA 


ATCAGGAAAT 


AGCTGATGAA 


CTCTTTATCA 


CTTTGAAAAC 


AGTTAAAACA 


9000 


CATGTTTCAA 


ACATTTTAGC 


AAAACTAGAT 


GTGGATGATC 


GGACCC AAGC 


GGCGATTTAT 


9060 


GCTTTTGAAC 


ATGGTTTAGC 


CAAATAAATA 


TCAGATAAAT 


CCATAAATAA 


ATTCTAGGAA 


9120 


ATCCTTCTAG 


GAATTTATTT 


ATGGATTTGT 


TATACTAATT 


TTAACGAACA 


GACAGGAGCG 


9180 


GTCAGAAGAA 


ATGAAACAAA 


ATTTTGCAAT 


TATTGGTTTA 


GGACGTTTTG 


GCGGCAGCAT 


9240 


ATGCCAAACT 


TTGGTGGAAG 


CTGGTCAAGA 


AGTTTTGGCC 


ATTGACAGCA 


GTGAAGACCG 


9300 


TGTCAATGAA 


TATATGAATA 


TTGCAACGCA 


TGCAGTGGTG 


gcTAACGCCC 


AAGATGAAAT 


9360 


GACGTTGCGT 


TCTTTAGGGA 


TTCGGAATTT 


TGATCACGTA 


GTGkTGGCAA 


TTGGTGAAGA 


9420 


TATTCAAGCG 


AGTATTTTAG 


TGACACTAAT 


GGTTAAAGAG 


ATGGGGGTTC 


CCAATGTCTT 


9480 


GGCCAAAGCA 


GTCAATGAAT 


ATCATGCCCG 


TGTCTTAGAT 


AAAATTGGGG 


CAGACATGGT 


9540 


CGTTCACCCA 


GAGCGGGATA 


TGGGCATTCG 


AATTGCTCAC 


AAACTTGTTT 


CACGCAATAT 


9600 


TTTAGATTAT 


ATTGAATTAT 


CTAGCGAATT 


TTCATTAGCT 


GAGGTTCGGG 


TCACCAATCC 


9660 


TAAATTTTAC 


AATAAAACCT 


TAGCAGAATT 


AAATTTCCGT 


CAGCGATTTG 


GTTTAACTGT 


9720 


GGTAGCGATT 


CGCCGCTCGA 


AAACAGAAGT 


AATTGCTTCG 


CCAGATGCTT 


CTGAAATTGT 


9780 


TCGAGAAAAC 


GACAATTTAT 


TAGTTATCGG 


TGATACCGCA 


GACGTGGATT 


TATTAGATGA 


9840 


AAAAATGAAT 


GGCTAAGCAC 


TTTATTCTCT 


AAAAGCACTG 


CGCTGTGCTA 


CAATAAATGG 


9900 


CGACAGAGAT 


GTGAAGGAGG 


AACAAAGAAT 


GAAATTAGAA 


GTGACACCGA 


AAGCCCAACA 


9960 


ATGGTTTAAA 


GAAGAAACCG 


GGGTTCAGCC 


TGAGACAGGC 


ATTCGTTTTT 


TCGGAAAAAT 


10020 


TTACGGAAAA 


ACACCTGTTC 


ATGATGGTTT 


TTCCATTGCG 


ATGTCTGTAG 


AAGCGCCAGA 


10080 


TGAGCCAATG 


ATTAAAGAGA 


ACCTAGATGG 


GATTACCTAT 


TTTATTGaAG 


AAACAGATGA 


10140 


TTGGTTCTTT 


AAAGGGTATG 


ATTTGATTGT 


T GAT TAT GAT 


GAAAAGAATG 


ATGAACCGCG 


10200 


TTATCmATTT 


GAAGCAAACC 


mAGrAGaTTT 


AACmAAATAA 


TTTTAGTGCT 


AAACAAAAGA 


10260 


GGCTGAAGTA 


GTGCGTAGGA 


CTATTTCTGT 


TGCTTTCGTT 


TAACCGCTAG 


TCAnnGTGAG 


10320 


AAAAAATCGT 


TTTGGATTTT 


TTCTCAGGTC 


TTTCTATTTA 


AGTATTTTAG 


TTGGATATCG 


10380 


CTATTTAATA 


A 










10391 



(2) INFORMATION FOR SEQ ID NO: 134: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

AATGTTAAAA GGAnAAATAA TAGAGTTAAA GTTGAAAAAG tGgAGTTAGA TTATGTAGGA 60 

AGTATTAmTG TkGATAkGGA TTTATTAGAA GcAGsAGGaA TATATGAATA TGAaAnAGTA 120 

CAAATAGTTG ATGTTAATAA TGGAAATAGA TTTGAAACTT ATACAATAGC TGGAGAGAGA 180 

GGAACAGGGA TGATTTGCTT GAATGGAGCT. GCAGCTAGAT GTGTATCAAC. TGGAGATAAA 24 0 

ATAATTTTAA TGGCATATTG TGAATTAAAT GAAAATGAGG TTAAGGATCA TAGACCTAAG 300 

GTTGTTTTTG TCGATGATAA TAATAAAGTT GAAAGAGTAA CATCATATGA AAAACATGGA 360 

AGGTTATCAG aTATAATTCT TTAGGTTTTG TAATGTTATT TATATATAGC TAGACTATTA 4 20 

TTGAAATATT TATTGATTAC TAATTGGATA GATGTCAATA AAGTAGACAC AAAAAGAGAA 480 

GATACTTAAA GATTTTCAAA GTAAAGAGCC TCCATTTCAT TTGGTGtAAG CATATCTCAT 54 0 

GTTCCATGAG GACGTTTGGT GTTATAGAAG CCTTCAATGT AGGCAAAACA AGCCAACCGT 600 

ACTTCTTCAA TTGAATGGAA TGAACGGCGA TTTAATTCTT CTTTTTTCAT ATATTAAAAA 660 

AAAGCTTCGG TAACGGCGTT ATCCCACGGA TAACCTGGTT TTGAAAGTGA GTGGACAATC 720 

GGATGATCCT CTATAAATTT TCTAAATTCA AAAGAGGTAT ATTGAGACCC TTGATCTGTA 780 

TGAAAAAGAA CAGAACTCGT TAAtGcgAAT GGCAACAGTT TTTTTGACAA ATTTTATAAG 84 0 

GTGCAGAACT TCTTTCCGTA TACTATTCCG ATTGTCCTAG ACAATGAGCC TCCTCGGATG 900 

TGACGATCTT CGGCAGGAGC TAACAGTTAA AGTTAGACTG CCTCGCTAGC GTTAgcACAT 960. 

CCGAGCTCAT TATCAAGGAT TCGcTGCGCC AATCTCTGGT TACGGTAACC AACCGGTGTC 1020 

TCGTAGCCAA GTGTACCGTG CAACCGAAGt GGnTTCCACC AATTGACATA GTCAAATAAC 1080 

TCCAAATCCA ATTGTTGTAA GGTTTCAAAT GTGTATTGAT AGACAAATTC TACTTTCAAC 1140 

GACTTATAAG TTGATTCAGC TACGGCATTA TCAAAAGGAC AGCCTTTATG ACTCAATGAT 1200 

CGATTGATGT CAAAAGTCGT TAATAATTCA TCAATAGCTT GGTTATCAAA CTCTTTTCCA 1260 

CGATCAGTAT GAAAAATCTC AACCTCTGTC AGAGGTTGTC TGATACGGCT AAATGCTTTT 1320 

TTTACTAGAA CGGCATCTTT ATGTTCTCCA CAAGAATAGC CGAGAATTTC TCGATTGAAC 1380 

AGATCCAAAA TGAAACAGAC ATAATTCCAT TTTTTCCCGA CTCGTACATA AGTCAAGTCT 14 4 0 

GTTACGATCG CTTCTAATGG GTTGTCTCTT AAGAATTTAC GATTCAATAC GTTTGTCGTT 1500 

TTGGCTTCAT ' TGCAAGTAGA ATGATGTACT TTAAAATAAG CAACAGTATA GCTCGATTTT 1560 

AATCCTCTAT TTTTCATGAT TCTACTAATT TTTCGTCGGC TGATCTGAAT GCCTCGTTTT 1620 

GATAAGGCTT TTTTTATTTT TCTTGAGCCG TAGgCtTTCG GCTGAGGATA AATTCTTCAG 1680 
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CGACTACTTC 


TTCAAGTTCT 


GATTCGTCTT 


TCTTTGGTTT 


1 GATTGATAA 


TAATAGGACT 


1 / 4 U 


GACGTGATAG 


ACCTAATATT 


CTGCACATCG 


CTGATAlAbb 


b i AAAbA I bL 


rp rp tv rp rn /"* yv rp 


1 Q AA 

1 bUU 


CGATTACTTG 


TCTCTTCGTC 


^ ^ n n m tv m /"> tv 

CGAATATCAG 


CGCTGCTTbb 


TTTAAAATAT 


CATTTTCCAT 


1 O C A 

lo DO 


TTCTAATTGC 


TGGTTTCTTT 


m t\ m t\ ^* mm ^ 

TACGTAGTTC 


m tv tv /"* tv tv rprp /"^rp 

TAACAATTCT 


TTTTGTTCAG 


GCGTAAGATT 


1920 


ATCTTTTTCT 


TTGAATGAAC 


CACTCGTTTT 


ta T\ rn m f** rp fp rn 

AGATTGCTTT 


t\ /■»/■■■•/■> t\ m m m /™» m 

ACCCATTTGT 


/** tv tv tv rn /"* m /■* tv 

CAAATGCTGA 


1980 


AGCCGTTAGT 


TCATATTCTC 


GAATGATTTC 


TACACGTGGC 


TTTCCAGCTA 


AGTAAATATT 


2040 


GACTATTTGT 


TGCTTGAATT 


CTTGTGAAAA 


AGTTCTTCGT 


GTTCTCTTAG 


ATATAAAAAT 


2100 


TCCTCCTGGT 


ATGtTTTCTT 


CTAGTCTACA 


CACCTTAATT 


TTTCTGTCTA 


GTTAATTGTA 


2160 


GCCTATCCAG 


TGCTTCTGCC 


GGAGAAAATC 


AGATCCGAGG 


ATGCTCATTG 


TCAAGGGCAA 


2220 


TCGGAGCCAT 


AACACCGCAG 


CATTCACAAC 


ACCTTATAAT 


TTTTGTCAAA 


AAAAGTGTTG 


2280 


CCATTCCnAT 


AGATGAAATT 


CTTCAATTGA 


GAGTGTAAAA 


TATTTTGTGT 


AAATAGAAAA 


2340 


AAGGAAGTCC 


CTTCTGTAGA 


AT AG AG TT AC 


CACAACACAT 


TCACAGAAAA 


GAGGACTTCC 


2400 


ATATGAACGA 


TTTTACTACA 


GAAATTCTAA 


AGACyCTAGC 


GAACnAAAGG 


CGATTTGaTG 


24 60 


AATTATTAGT 


TGCTTTAATT 


ATTTATAAAT 


AACTATAATG 


CTTTTATTAT 


AGTATCGTAA 


2520 


ATAGTTTATA 


TGAATACATT 


AATTGTAAAA 


ATGGTGGATG 


GCTCTGAACG 


ATTAACAATA 


2580 


TTTTTTTGTT 


TTGACTATCA 


TTTTATTGTT 


CAGAAATAAT 


TTATTGCACT 


TTTTGAGGTA 


2640 


AATAGTTTAG 


AATTATTTTA 


ATATAAGTAC 


TTCAGATTTA 


ATAAAGGAGA 


ta tv ta m r+ /n tv tv m 

AATATCTAAT 


2700 


GAAAAAAATT 


GATTTAATAA 


ATATGATAGG 


TATGTTAATA 


GGGATATTAG 


m tv tv ta m tv m tv rn 

TAAATATAGT 


27o0 


AATTTTCACA 


GATTGGCTCG 


GGGTGTTATT 


TTCTAATCTT 


ATTCCCATAC 


m m tv nn m tv m ta ^ 

TTATTATAGG 


O O O A 

2820 


CATTTGTGGA 


ATTATACTTT 


CAATATTAGA 


ACTTTTTGAA 


AGCAGAAATA 


^* TV TV nrt TV TV m TV 

CAATGAATAG 


O O O A 

2880 


AATATTTGCt 


TGTATTATAT 


TAATTGTTAA 


TCTATTGCCT 


ATGGTCTATT 


TTACTTTTTT 


2940 


ATATTTTGCA 


CTGGGATGAA 


ATAAATTAAC 


CAAAAGTTTC 


TTTTTCTTAT 


AATGTGTATC 


3000 


AATATTAAAA 


CTGAAAAAGA 


TATGGATAAT 


AGGATCAAAG 


CCCTCATTGG 


AACACTCAAT 


3060 


ATATTATCAG 


ATTTAAATTT 


CAAATAGATA 


ATTTCATTTT 


m m ta /"* rn ta ta m tv rn 

TTAGTAATAT 


/-» TV TV TV /— » TV m 7\ O 

CGAAAGATAC 


O 1 O A 


TTTTTTAATG 


ATGAGAGTGT 


t\ tv m m TA ta t\ /"» -r^ 

AATGTAAACC 


m m tv tv m t\ tv 

GTGTAAGTAA 


/— 1 TV mmn TV m /-•/-• 

CCATTAATCC 


TAGGCAGG I b 


■31 on 


GTTGcTTCGC 


GGTTTTTCTT 


TTTCTACAAT 


7\ TV 7\ m T\ 7V /** TV. 

AAGATCAAGA 


GGTGATTACA 


CGATbbbi Ab 


7 9 A C\ 
3£. ft U 


TV TA TV ^ TA TV TV TV TV 

AAAGAAAAGA 


AATCCTGATG 


f* tv tv t\ t\ rn m 

CCGAAAAGTT 


AGCTGAATCC 


tv rrimofTt/ r 'Tf tv rp /"V 

ATTCTGAATG 


ppfpTi rn /-^ tv Tv CC* 

CC lAibAAbb 


jjUU 


TGAATCTGTC 


GATGACATGC 


TA TV 7A m m fTl m 

AAGATGCTTT 


/*» ta tv t\ tv m /-*• m /~« 

GAAAGATGTG 


TTTGGGCCCC 


TTTTTGAAAA 




AATGCTTCAA 


GG AG AATTG A 


ATAATCATTT 


AGGTTATGAT 


GCCCATTCTA 


AAGAGCCTAA 


3420 


GGAACACGAT 


AACCGTCGAA 


ATGGCTATGG 


AACTAAAACG 


CTTAAAACCA 


GTTTTGGTGA 


3480 


AGTAGCTATT 


• GATGTTCCTA 


GAGACCGGGA 


AGCTTCCTTT 


GAACCAGAGT 


TAATTCCTAA 


3540 


GAGAAAACGA 


GATGTCTCCG 


ACATCGAAGG 


GAAGGTTCTT 


TCCATGTATG 


CACGAGGAAT 


3600 


GAGTCAACGA 


GATATTGCCG 


CAACCGTCGA 


AGCTATCTAT 


GGCTTTGATA 


TTTCCCATGA 


3660 
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AATGATTTCA GATATCACTG ACGCTGTCCT TCCTGAATTG GA&GAATGGC AAGCCCGCCC 3720 

CCTAGCAAAG TGTTATGCCT TTCTATTCGT TGATTGTATG TATGTTACTT TACGAGAAAA 3780. 

TTATGAAGCC AAAGAATATG CTGTATACAC CATTCTTGGC TATGATCTCA AAGGAATAAG 3840 

GAGATTTTAG GA 3852 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 



ATTGGCTAAG 


CGTTTAAACA 


AATGCGTCTT 


CTAAATAAya 


GGTTTTAATG 


ACATGACTTA 


60 


nTTCyCCTTT 


TtGtCCATAA 


TTTCTTGTAA 


tCGTgCACGA 


TTTTGTTCAT 


TTCTTCTTGA 


120 


TTGACAGAAC 


GAATGTCAAA 


TTCAATGATG 


CCaTTGTTGG 


CTTGATATTC 


TCGCGTATAA 


180 


ATGGCTGGAC 


TTTCTGCTTT 


TAATGCTTGG 


ATGACTTCCT 


TAGCGGTTTT 


TCTGCCGTCC 


240 


ACTTTGACAC 


TGGCGCGATA 


AATGTCTCGG 


CCAGCGCCAT 


CTTGGACGAT 


TTTGGCGGTT 


300 


AAATCTGATA 


AGTTGTTGAT 


GGCTTCAACA 


AACGGTTTTA 


AGCGCTCTTG 


CATCGAAGCC 


360 


CCTGATTCAC 


TGCCATGTGC 


CAAGTATTCT 


TCCACTGCTT 


GCGTAAAGCC 


CAAGATATTG 


, 420 


TCTTTGCCAA 


TTTTCATTGC 


TCGACCAATG 


CCTTTGCCTT 


GTAGGCGCAC 


CCAGTCAATG 


480 


TATTCTTTTT 


TCCCGACCAC 


TAAACCGGCA 


CTTGGCCCTT 


CAATTGCTTT 


GGCGCCGCTG 


540 


TAAATCACCA 


AATCAGCGCC 


TGCTTCAGTG 


TATTTGAAAA 


GATCTTCTTC 


TGcTGCCGCG 


600 


TCAACAATTA 


GCGGCACCTT 


GTGGCGTTGC 


GCCACTTTGG 


CCGCTTCGGC 


CACCGTCAGC 


660 


ATACTTTTTT 


GCACGGTATG 


GTGGCTTTTG 


ATGTATAGAA 


TGGCCGCTGT 


TTTTTCAGAA 


720 


ATCATCATTT 


CCACGTGTTC 


TGGACTGCAC 


ATGTTGgCGT 


AGCCAGCTTC 


CACCACTTGG 


780 


CCGCCACCTT 


GCGCTACCAT 


GACTTCTACA 


GGCGTGCCGT 


AGTCCACGTT 


GTGCCCTTTA 


840 


GGTAAAACAA 


TCTCCCGCTG 


TTCGATTTTT 


TCTGTATAGG 


GATGATACGC 


GTGGTACAAG 


900 


CTGCCTTTTC 


CAATCAAGGC 


AGCCACCGAT 


TGGGCAATCC 


CCGCCGAAGC 


CGAGGAAACG 


960 


ATTTGGGCAT 


CTTCTACCTT 


CAACAAGTTG 


GCTAAAAAGG 


CCCCTGTTTG 


CACACTAAGT 


1020 


TCACTCATTT 


CAAAGAAATG 


TTCGCCGCCG 


AATCGTTGAG 


CTGCTAGGAC 


TGCCTCCGAA 


1080 


ACTTTGGAGA 


CACCTAGAAT 


GGTCATTTTG 


CCAGAAGCAT 


TAATGACTTC 


TTTTAAGTGG 


1140 


AATTTTtCGT 


AACTAATTGT 


CATAGATTTG 


TCCTCCAATA 


ATTGTTTTGA 


TTGGGCGAAT 


1200 


TTGCTCTTTG 


GCTACTCGTG 


TCAATCCATT 


AGAATCAGTC 


AATGTTTTTT 


CTTCTGCTTG 


1260 


AATCGTGAAA 


ATCGTTAAAT 


CAGCATCTTT 


ACCTATTTCC 


AACGTGCCTT 


TTTGCGTTAA 


1320 


ATGGAAATTC 


TCAGCAGGAG 


CCTTGGTTAC 


CTTTTCAATA 


ATTTCTGGCC 


AATCATAGCC 


1380 


GACCACACGG 


AGTTTTTCCA 


TCGTTGTGGC 


TAAATCATAA 


ACAGGACCAT 


TTTCTCGGTT 


1440 
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ACGGATGTAG 


ATATCTGTGC 


TAATTGATGC 


AGCTTTCATT 


CCCTCGCGCA 


ACGCTGTTTC 


150.0 


AGCTACGTGA 


AAATTGAAAC 


TATCCGTACC 


ATGCCCGATA 


TCAAATACGA 


CTCCTTTGTT 


1560 


GTACGCTTGC 


CAAGCGAAAT 


CTTTTATTTT 


ATCCGTTGCT 


TGGTCTAAGA 


TACCATTTTC 


1620 


TTTCCCATTA 


AAACAATGGG 


TTAGGACATC 


CCCTTTTTCC 


ATCAAGGCTA 


AAATTTCATC 


1680 


TAAATGAGGC 


GGTGCTGATC 


CAATGTGAAC 


CATTAAGGGA 


ATTTCTTGGT 


TCTCTTGCTG 


1740 


AATCTGTTTG 


GCCAATTCTA 


AAGGCGTAAT 


GCCGTTATCG 


CCAATAACGG 


TCC TACT CAT 


1800 


ACGGGCTTTA 


ATCCCTACAA 


CAAAATCTGG 


TAATTCTTGA 


ATCGCTTTTT 


TGACTAAACT 


1860 


CGCTTGTACT 


TTACTTAAAT 


CTGCGAGTTC 


GTCCTGAGCA 


ACGATGCCCC 


ATTTAGAAAT 


1920 


ATTGACTAAT 


CCAAAAACAT 


TTGTTTTTGC 


TTGcTGCGCT 


AAGTCATAAA ATTCATGAAT 


1980 


GTTTTCAGCA 


CCTGTTGTCC 


CAGCATCAAT 


CACTGTCGTA 


ACACCCTTTT 


TGACCCCAAT 


2040 


TTCATCTGGA 


TAATCATAAT 


AAAGAGCCAT 


TTTTTCAAAA 


CAATGAACGT 


GATCATCTAT 


2100 


CCAGCCTGCG 


GATACATAAG 


TACCTGGTTC 


TAAGTGGATA 


GTTTCTTTTG 


CAGAACCTGA 


2160 


AATAGTTGCT 


GCAACAGCAG 


CTATTTTTTT 


CTCTTTAATT 


GCAATTTCAA 


CAGGCATACC 


2220 


ATTTACTGTC 


TGTCCATTTT 


TTATAAGTAA 


ATCGTAGTCC 


ATATTCTTCC 


CTACTTTTCT 


2280 


ATAAATCTAA 


ATTTTGTCCA 


AGGCTTAATG 


AATTCTAATA 


ATACAAGTTC 


TTCTTCTATA 


2340 


ATACGCCCTA 


CTCGGTTTTT 


GCGTGAGTCT 


TGATGTGGTT 


CTAAAACGAC 


TTGTAATTCA 


,2400 


TTTTTGTATT 


TTCCAAATGC 


ATCATTTCCC 


ACAACAACAT 


CTCCAGGTTG 


TAGCATTGCT 


2460 


TGATTATCAT 


GTGGGGGATT 


AACTTCATTT 


TTATATTTTT 


TACGCACTTC 


TGTGGAACGA 


2520 


ATCATCTGCT 


GAGTGATATC 


TCCACGGCGA 


AAATGTTGAT 


TTTTTGTTAC 


GATTTCTTTT 


2580 


TCAATTTCAG 


AAGTCGCTTC 


TTCAAAAACA 


ATGGTCAATT 


CTGTTTGATA 


ACGATTCAAC 


2640 


TGACCCAATG 


CTTCTAATTC 


TTCGTTGGAA 


GCATAAGCAT 


TTCCAATAAC 


AACATCATCA 


2700 


ATTAAATTTG 


TTGCAAATAA 


ATGTTTAGCT 


TGAACAGTGA 


CTGGCAACTG 


TCTATGCATT 


2760 


TCTAAAGTAG 


GTAGCCCATC 


ATTCACATCC 


CAAGGACCAA 


TTTCACCTAC 


CTGTGATGAA 


2820 


ATAAAAGCAG 


CTGTCCGAAT 


ACCTTCCTTT 


TTAAAACGGA 


TGCTACATTT 


TTCAAAGAAA 


2880 


TCATACGGCA 


ATGCAGTTCC 


CGCTTGTGGA 


TAAAAGTTAT 


GACAACCATA 


TAAGAAAGGT 


2940 


CGATTCGCTT 


CATACGTCAA 


AATGTTATCA 


AGATAAGCGA 


CATCATTACT 


CATATTCAAC 


3000 


TCAATAGCCA 


CGCCGAATGG 


ATTAAACGTC 


AACATGGCTT 


CTTTATTTCC 


ATCAAACCCT 


3060 


GTATCCAACC 


GAATCCCATC 


AGCTCCTAAT 


TCGCTGAAAA 


AAGTCAAATC 


ATCATATGAA 


3120 


ATTTCCAATT 


CATCAAAGAT 


ACTTGGAACG 


ACATCTAAAA 


TTGTTTCAAA 


TCCTAAATTT 


3180 


TTCGCATAAC 


TAATTAATTC 


TTTAAATTTT 


TTCTGAACCG 


CTTCTTTGCC 


ATCGGTCACT 


3240 


TCTAACATAC 


TCATAAAAAT 


TCGAGTAAAC 


CCACATTCGC 


TGGCTTTTTT 


TAAATACGCT 


3300 


TTATCTTGAT 


TGATATCACT 


ATGATCTGGA 


TAAACTGATA 


CACCTAATGC 


TCTTTTCATC 


3360 


TTCGTTCTCA 


CACTCCGCTA 


TTTTCAAATT 


TATTTCGAAA 


TTTCCAAGCA 


ATCACCGAGC 


3420 
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TACACAAGAT 


TGTTGTAGAG 


CCAACGAGCA 


ATACTGGTTG 


ATTTAGAAAT 


CCGCCAATTA 


3480 


TCATGGAAAG 


AGTAATACAA 


ACTGCCGTAA 


AAACCAACGT 


CTTTACCAAA 


CCAAATTGAT 


3540 


AAACAACAGT 


AGGCAACTGC 


CCTTTTTCTA 


TAATAATTTG 


ACTAACGATA 


AAAAAGAAAA 


3600 


TGAACCAAAT 


AATCATCTTA 


TACGCTCCTG 


ACTTTCTTCT 


CTGACTTAGA 


AGGGGCAGGA 


3660 


TATTCATATC 


AGAACACCCG 


CCCTAACATT 


TTTTAATTTT 


TTGCTGCTTG 


TTCAGCTGCT 


3720 


AATTCTTCTG 


CTTCTTTATC 


CAATTGTTGT 


TTTTCATAAA 


CTTTGAAAAA 


TGGATAGTAG 


3780 


ATAGCCATTG 


TAATTAAGAA 


GTTAACAATC 


ACTAATACTG 


CGGCCGAGAA 


ACTCCAATTG 


3840 


GTACTCATCC 


AAGCAGCAAT 


TGGTGCTGGA 


ATTGCAAACG 


CTAAGCGTGC 


AGCCATCATC 


3900 


GGAATTATAT 


TAGCGACTGT 


TAAGAAATAA 


GATAACGTTG 


TGGTAATTAA 


CGGTGCCACG 


3960 


ATAAACGGAA 


TACCTAAAAT 


AGGATTCATT 


ACGATAGGCG 


TTCCAAAAAT 


AACTGGTTCA 


4020 


TTAATATTAA 


ACAAGCCTGG 


TAAsAACsAT 


AAACGGCCAA 


GACTTTTCAG 


ATACGTTGAT 


4080 


TTAGAAAACA 


TCATTAAAAC 


AACTAAAGAA 


AGCGTAGTTC 


CAGCGCCACC 


AATCCAGATA 


414 0 


AACCATTGTA 


AAAACTGTTC 


TGTAAAAATA 


TTAGGTAACT 


GATGAACGTT 


TGCGCCATTG 


4200 


GTAAAAGCTT 


CAAGGTTTTC 


TGCAATCGAC 


ATATCCCAGA 


AAGGACGAAT 


AACTGGGCCC 


4260 


ATAATCGCTG 


GACCATGAAT 


CCCTAAAACC 


CAGAAGAAAC 


AGATTAAGAA 


GACTGTTAGT 


4320 


AAGCCGCCAA 


ATAAACTGTT 


ACCAGCAAGG 


ATACCTTTAA 


GTGGCATTAA 


TAACGTACTC 


4380 


AAGAAACCAT 


TCAAATCAAA 


ACCAATTACG 


TGGCGAATAA 


CCCAGAATAA 


TAAAAGAATA 


4440 


ACTGCTCCTG 


GAATTAACGC 


AATAAATGAA 


TTTGACACTT 


CAGGCGGTAC 


GCCATCTGGC 


4500 


ATCTTAATCA 


TAATATCTTT 


TTCAATAAAG 


AAGCGATAAA 


TTTCAACAGA 


TAACAGCGCT 


4560 


GTTACAATCG 


CACCAAATAA 


AGAAGCTGAA 


CCTAAATTTG 


CTAAATTAAT 


ATAGCGCCCT 


4620 


GCTGTAATGA 


CGTTGTCAAC 


ATCTTCAAAG 


ACACGTGTTG 


GTGGTGCCGC 


TGTTACTAAA 


4680 


AAGGCCATTA 


ATGCTAAAAT 


TCCGCACGTC 


AAAGAGTCAA 


GCTTATAACT 


TTTGGCCAAT 


4740 


GATGAAGCAA 


TCCCAAATGT 


TGCATAAAGT 


GCAAGTAACC 


CCATCGTATA 


GCGGGAAGGA 


4800 


ATATCTAAAA 


TTGCTTGATA 


AGGTTCAATC 


AATTTTGTGT 


ATGCCTCAAT 


TGGAATATTT 


4860 


TGAAAAATTG 


TAAAGAATGA 


GCCCACAATT 


GTTAAAGGCA 


TCGTTGCAAT 


•TAATCCTTTT 


4920 


CTAATCGCTG 


TCATATGGCG 


TTGCGAACCA 


AATTTATTCG 


CAATCGGCAT 


TAACTTAACT 


4980 


TCCATCCACT 


GTACAAAACC 


ATTCATTTGT 


TAATTCCTCC 


GTTCAACTAA 


TAAArATAAT 


5040 


AATTATTTCG 


CATTTATAAT 


AACATAGTAA 


cATTATAATG 


TAAaCGCTTA 


TTCTTATTTG 


5100 


TGACAGAGAA 


AGTGCAAAAC 


AAAAGGCTTT 


TTCTTATTAT 


AAAGAAACTG 


GGaATATTGC 


5160 


ACCAAAAATC 


ATTGCGCCTA 


AAATTGCGCC 


CCCAGCGATT 


GGTTTTTTCC 


AAATGTAAAA 


5220 


ACCTAAAGCT 


CCTAATGTAG 


CACCAATCCC 


AATAGGAATC 


GAAGCACCCG 


CCGCACTCAA 


5280 


AATAATTAAC 


GGGCCTAAGA 


AGCGTCCTGA 


GGCATTACCA 


GCCCCCATCA 


TCACATCCGC 


5340 


ACCAAAAGTG 


GAATTTCCTT 


GATTGATAGT 


AAATTTCCGA 


ATCAACACAA 


TTACGCCACC 


5400 
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AACTGCTAAC ■ 


CCTAGTAGCG CACCTGTTAA AAGTGCTAAG CCAAAGTTTT CTACAGGTGC 


5460 


TTTAATACCC 


ATGCTGAGGA AAATTGCAGG TACGCCAATC CCGATCCCCG TTAAAATTGA 


5520 


GCCACCTAAA 


TCTAAAATCC CGACTAAAGA ACCTTCTAAA ATTCGTGCAA ACAGGAAACT 


5580 


TGCACCAAAT 


GCTGCGGCAG cGCCATaAGA ACCACCATCT AAACCAGCTT TTAGCATCGC 


5640 


AACAATTGAA 


ACTTCATTAA AGGCGCCCAC ACCATAAACT ACATACATAT GTGTTCCTGC 


5700 


AAAAATTGCC 


GCTGACATAA TCCCGACTAA AATGGGAAAG CCCCAATCAG CATACCAAAA 


5760 


ACTTTTGCCT 


TTTAAACCTT CTTGAATTTC TTGTTCTTGT TGTGCCCATT CATTTTTTTC 


5820 


TTGCTGTTCC 


ATCTATGAGA CCTCCTTTTT TCTATTTAAC GGTGAAGACA TTGTGTAAAT 


5880 


TATTTAACCA 


TTGAGGAATA GACAAGCGGA AACTTTCGAT CATTTCTAAG TCGAACCCTC 


5940 


GGAAGAAGCC 


ACTTAAAATA AATAATAAGA TAATGACACC AAGCATGCCG CGTGTCATTC 


6000 


GGGTCCACCC 


TAAATCATCG ACACCTTTAC CGATTAAGAT CCCTAATACA ACTCCTGGGA 


6060 


CAGCATTGCC 


CATGACTAAT TGAGATAATC CGCCAAATAA TGTTCCCCAG AAACCCGTCC 


6120 


GTTTTCCTGC 


ATCTAAAGCC GCTAGCCAGA ATAAAACAGG CATCACAGTA TTGATTAAAA 


6180 


TCGTTGCTGC 


AGGTACCAAC ACCGCAACCG CTGTTACTTG TAGCGATTCT GGAATTGCTG 


6240 


CAGCTGTACT 


ATTTAAGAAA GCCACAGTCG CCATTCCTAC AAAGACCCCG ACAATCCCCA 


6300 


TTTTTTTGGG 


ATTATGCATC GTTTTAGTGA CATCTTTATC CTTTACCAAT AAAATTGCTG 


i 6360 


CAGCCCAGTT 


AGGCACAATC CGGTGGGTCA CATCTTGTGT AAAGGCCCCT GCTCCGACGG 


6420 


CCGAAGCCCA 


AGCATTAAAG AAAAAACCTA AACCAAAAGA AAAGTGTGAT GCTGGATCTC 


6480 


CTTGTGCCGC 


GTTCATTTCT CCTAATGTTC TAAATGCCCC TAACCCTTGC GTTTGTGGTG 


6540 


CATGAAACAT 


CCGAGCAGCG CCTGCGCCAG CTGCAAAACC TAGTAAACCA CCAATAAGCA 


6600 


ATGACTTCAG 


CAAAATGATT ACAAATTCCA TTCCTTCTTC CTCCCTTTAT ACGCCTCTTT 


6660 


TTGAAAGAAA 


CGGAACTACT ACTGCGTTCG GTTCTTTAAT TGTTTCCGTT AAAAAATCAA 


6720 


CTTCTTCGAC 


TGCAATGGCT TGGATTTCCA CTTCTACAGC TAGTTCGACA TGATAAATCA 


6780 


CTCTCTTTCT 


AGGAAAGAAG AAGAACAGAA ATTTTTCTGT ATAAACTTGT TCTTTTGCCT 


6840 


GACGAATAGT 


TACTTCTTTC GGTTCAATCC GGATGAGGAC CTCTGTTGTT TCTTTTAATA 


6900 


CTTTTTTTGA 


AACATCATTC AATGCACTTG CAAAAGCTTC ATTCTTCTTA TTACCTTTTC 


6960 


CAGAAACAAT 


CACGGTTTGT GTTGACCGTT TTGTGGCATC CATAATTACC GCTCCTTTCT 


7020 


AAGTTACTAC 


TTTCTATGCT CCGTGTTTCT TAATAAATGC TTCAGTTACT CGTTTACCAA 


7080 


GTTCTTCCTG 


ATCCATAAAA CCAAATCCAA GAACAGTTTT CCCATCGTTA ATTGCTGTTA 


7140 


CCCCTTCTTC 


GATTGATCGC ATGCCATGTC TTTCGGGATA CCCGTATTTA GTAGCAGCCG 


7200 


TTAACGCTCC 


TGCACCACCA CTTCCACAGA AAGAAATGCC TAAATCAGCC TTTTCTTGTT 


7260 


GCATCACATC 


TCCTAAACGC ATATCAGCGC CCATTCCTGG AACAACAACC GCGCGACCAC 


7320 


CGGCTTGCTC 


: CACTCCTTTT GCAACATTTT GTCCTTTTCC TAAACGATCT GCGATAACTA 


7380 
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CTGTAATCAT 


AACTTTTCCT 


CCTATTAATT 


GTTTTGTTTA 


GCAACTTCAA 


AATGAACAGC 


7440 


TAAAACATAC 


CATTCATCTT 


CTGATAACTG 


ACCAATTGCT 


TCTGTTGTTT 


CTTCTGCAAT 


7500 


TTTCAACGTT 


TCTGGCGTTA 


GTTCCGTAAA 


TAACGTCCGA 


TCAACACCAG 


CTAATGTCGT 


7560 


TCCTTCTCTT 


GAACGATTCA 


GCATCTCATT 


GAGATGGTTA 


ATCAAAATCG 


TCGACTGTAA 


7620 


TTCCGTTGGT 


TCAATTTGTG 


CTTCTGCTAA 


TCGCGCACTG 


GTTTGTTTGA 


TAATCGATTG 


7680 


TAGCTTTTCT 


GGTTCAGCAC 


TTTGTCGAAT 


GATCTCCTGT 


GCTTCCATTG 


CTAGTTCCAT 


7740 


CCTCTTCCTC 


CTTTGGTTGT 


TCTAAATAAT 


TAAGTAATGC 


TGAAATTTCA 


GCATGATTGA 


7800 


CAGGTAACTC 


ATTTTTAGCA 


AATAAACATT 


CAATTTGCCG 


AATTTCTTCT 


TGCCTTTCAT 


7860 


GAGATACTTC 


TCTATTACAA 


GGAATCTCTT 


CGTATTGTTT 


ATTAAACGTA 


ATCCGATGTG 


7920 


TCATTAATAA 


AACGTGAAGT 


AAAAAGGCTT 


CTTGATATTC 


GCTCAGTAAC 


TTTGTTTGAA 


7980 


ACATTCGTTT 


TAACTCTTCA 


TAAATAATAT 


ACGCTTTTAC 


TAATAAATCT 


TGGCTTTGTA 


8040 


TCCGTTCATT 


TTTTTCAATT 


ACATCAGATT 


TCGTGGAGCG 


AGCGGTACCT 


TTCCCTTGAC 


8100 


GTTTCCCTAA 


TAAAATTTTA 


TCGACCTCTT 


CCCGAATCAA 


ACTTAAGTCT 


GTGTCCGTCA 


8160 


ACAATGGACT 


AACTTTAATA 


AATGGGTATG 


TCATTTCTTC 


AATTGGGAAA 


ATACTCAATA 


8220 


CCAAATCGGG 


CTTTTCTGCT 


TGAATCACTT 


CAGTTGCATT 


TAAAACAGAA 


GCAAATCCTG 


8280 


CGAGTTCAAC 


GTTTGGTACT 


TCCTCTAAAA 


TTTTTTGTTC 


AATTAAACTG 


GTCACACCTA 


,8340 


AGCCCGTCGA 


GCAAACATAC 


ACAATCCTAA 


CAACTTTAGC 


TTCCTGCTGT 


TGTCGTTCCT 


8400 


GAGCTGCCAA 


AAAATGCAAC 


GCAATATATG 


CGCAGAAAGA 


ATCGTTGACT 


AACAAAACAG 


8460 


AACGTTGAAT 


TAATTGTTTA 


CAGGCACTTT 


GAATAGCTAA 


AAACAACGCT 


GGGTGTTTTC 


8520 


CTTTGATTTC 


TTCAGAAAAC 


GGATTGTATT 


CATTAATAAA 


TAAATGTTTC 


TTTGTAAAAC 


8580 


GTAAGGACAG 


GTGTGCAAAC 


AAGTTCGTAA 


ATAATTGGCG 


ATCATTTACA 


AAAGGATAAT 


8640 


TCAGCTCTTT 


TGAAACGCTT 


TCAATCACTT 


GCTTTGTTAA 


TTGAACCATA 


TCTTGCCGTT 


8700 


GATGGGTTCC 


GAATAAATCA 


CTATAAATAT 


AAAAATATTC 


ATCCGCTAAT 


AAAGGCAAGT 


8760 


CATACCGTTC 


AAACACCTCT 


TGTAAAAGTA 


AAAGTGGTAt 


TCGTT 




8805 


(2) INFORMATION FOR SEQ ID NO: 136: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TATTTTTTGC ■ TTAATATAAT TAATGACATC TTTCTTTTGT TCTTTAGCCT CTGATATATA 60 

TTCCTTTAGT TCTTTATCCG TAATTGTTGA ACTGTTAATT CCTTGTTCAT ACAATTGTCG 120 

TCTAGTTTTT GAAACATCAT CTGTTGGATT CAATTCTCTA TTCTTTAAAT TCTCGGTTGT 180 



WO 98/50555 

TTCAATTGTT ATAGATGATG TAGAAGTTGG 
AGAAGACTGG GTAGTTTGCT GCTTAGTTGA 
TCCCCCAGTA TTATTACTAC ATGCCGTTAA 
TTTTTTCATT ATATCCTCCT ATAAAATAAC 
TGGATCATTT GACGGGGTAT TCCAAAGTGC 
GCGACCATAC GGATCATAGG TTAAAGTGTA 
ATGTCCAATC CCAACATTTT GATATAAAAT 
TAATGTAGCT ACATCATTAA TAACTTTGAT 
TAAATCAGTT GCAGAAGCGC CAAAGTATCT 
TGTTGCAACA CTGACAGGAT TT AC ATT AT A 
ACTTGTTGGA ACACATCCTG cTGCGCCAAT 
TTGTGGGTCA CGTTGAGAAT AATAATAAGG 
GTTTGCTATC TCTGCTGGTG TGcGAGCTTC 
TTAGGAGCAG TTTCrCCyTT TCTAACTAyT 
GCAGCTTGAC TTCCAGCTGr GGCrCCATTA 
ATATGGACAC GGTAATAGAC ATCATAATAT 
ATGGCTTCTA ATTGTAAATT TTTTCCAGTT 
kCTTGCCATC CGATATTTTG GATGTGCGTA 
TCACTCAGGT TGATTTTGAT TGCTTCCAGT 
TCrCCATTCT TGACAACCCC TTGCCAACCA 
kTTGGTTTGA TTTCATCTGG ACGTTTTGCT 
CCTTTAGGAG CAGTTTCGCC CTTTCTAACT 
GATGCAGCTT GACTTCCAGC TGGGGCGCCA 
TGGATATGGA CACGGTAATA GACATCaTAA 
TTAATGGCTT CTAATTGTAA ATTTTTTCCA 
TAATCTTGCC AT cC GAT ATT TTGGATGTGC 
GCATCACTCa GGTTGATTTT GATTGCTTCC 
ATTTCGCCAT tCTTGACAAC CCCTTGCCAA 
ACAGkTGGTT t GAT tT CATC TGGACGTTTT 
TTTCCTTCAG GTGCCGcTTC GCCCTTCTTC 
TTAGCGGCGC ■ TTTGACTACt GCGGACTCAC 
CTTCTATATG CACACGATAA TAAACATCAT 
TTTGAATCGC TTCGAGCCGC AAAGACTTTC 
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TGTTGTGTCC GG AT TACT AC TATCTTTTTT 240 

TACTTCAGTA GATGAAGCGT CGCTATTTTT 300 

AAATAAGGCT AACAATAGTA CAATAAGTTT 360 

GAAACATGGT CGGCCTTCTG TCCAATCAAT 4 20 

TTGTGTTGAC ACCCAACCTG AGTAAAACTG 4 80 

ACCATTTTTG TACTCATGAA CGACAATAGC 54 0 

AACGGGATAT CCTTCAGAAA GATAGGCGTT 600 

CGAACGGCCA TATGATTGAG CTGTTATCAC 660 

TTGATTTAAC CCGCCATAAG ATGCCATTCT 7 20 

ACCGTAAGAG CCTCTTAAAA TCATAGATAA 780 

GGTATAGTTT CCGATATAAG TATTAATCCA 84 0 

TTTTTTCTTT AATCTTTCTG CACGTAATCt 900 

ATCGCCAACT AAAAAGGmTT TTCCgATcCT 960 

TTGATTTGrA TrGCTTCyAA ATGTTTAGAk 1020 

GCTGCCCAAT TyAACCAACC TTTATCTTGG 1080 

CTTGAAACAT CACCGGmCAG TTTAATTTTA 114 0 

GTGCCAGAyA ATGCCCCATT rGCTTTATAA 1200 

GAGTATTCTA TrTTTCCAGC TAGAGCAGCA 1260 

TGTAAATTTT TCCCACTTGT ACCAGCTATT 1320 

ATATTTTGCA CGTGTGTTTG ATArTTGACA 1380 

TCATTTCCAA CTAAAAAGGA TTTTCCaTTT 14 40 

ACTTTGATTT GGATGGCTTC TAAATGTTTA 1500 

TTAGCTGCCC AATTTAACCA ACCTTTATCT 1560 

TATsTkGAAA CATCACCGGA CAGTTTAATT 1620 

GTTGTGCCAG ATAATGCCCC ATTGGCTTTA 1680 

GTAGAGTATT CTATGTTTCC AGCTAGAGCA 17 4 0 

AGTTGTAAAT TTTTCCCACT TGTACCAGCT 1800 

CCAATATTTT GCACGTGTGT TTGATAAwTG 1860 

GCTTCATTTC CAATTAAAAA GGCTTTTCCA 1920 

ACTAATTTAA TTTGAATGGC TTCTAATCTT 1980 

TGTTTTTTGC CCAATTCAAC CAACCTTTAT 2040 

ATGCGTTAGc tATTTCTCCA GTTAAGCGAA 2100 

CGCTTGTACC AGACAATTGA TTATCTTTCA 2160 
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CATATCCTTG CCACCCAATT TCTTGGACAT GTGTTCTATA TTCJACAGAT CCAGcCAAAT 2220 

CTGAATTTGA AATATTCATT TTAATT-CCTT .CAAGACG.TAA AGAGCGTCTG CTAgkcCCAG 2280 

AAATTTCCCc GTTTTTGaCA AcCCCTtGGC cAwCCGATGT CTTGGGATAT GTGtCtGATA 234 0 

ATTAATtCAG gAcTTTTyCT cTGCtTTGCC gGTGCTATAG GTGTGtCCTC ATTACTTkGT 2400 

GATGTTyCCA AAGTAGTTtC AGTAcGTCGC TACTATyTTC TGTATCATTA ACAGTTGTTA 24 60 

CTTCTGATGT AGAAACTTCA GTGGTGGAAT CGGAGGTTGA GGAAGTGGAC GTTGTTTCTG 2520 

TTTCCTCTTG AGAATTTATG ATGCTTGGGT TCATTTCTTC AGTGACACCG CTACTCATTT 2580 

GTCGCTCCGT TAGAGATGTT TCTGAAGCCA CCGTTTCTTC AGTAGTTCCC TTAGAGCGTT 2640 

CCAAAGCGTA AGCAGAAATA GGGGTAACGA GAAATCCCGA CAACAAAACC GCTAACATAG 2700 

AACTAGTTAT AACACTTTTT TTCATTAATT TACTCCTT 2738 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GATTGGACGA AAGTAACTAA TAAATCTGTT TTTATAACTG GTGCTTCAGG CATGATTGGG , 60 

ACCTTTTTAA TTGATGTGCT TATGTACCAA AATAAAACGA GAAATGCGAA TATTTCGATC 120 

TGGGCGATGG GACGAACATT GTCGCGTTTA GAAGAGCGAT TTACTAGTTA TCTAGAAGAT 180 

CCGTTATTCC ATATAGTTAT CGGAGATGTC ACAGAAGAAA TTCAGATAGC AGAAGCTTGT 24 0 

GACTATGTGT TGCATTGTGC TAGTAATACA CATCCTAAAT CATATGCCAG TGATCCAATT 300 

GGAACGATTA TGACCAATAT TGCTGGTACG CAGCATATTT TAGATTATGC GGTCAAAGCG 360 

AATAGTGAAA AAGTGTTATT TTTATCAACT GTTGAAATCT ATGGAGAAAA TCGTGGAGAT 420 

CTTGATAAAT TTACGGAAGA CTATTGCGGC TATATTGACT GTAACACGCT ACGGGCCGGT 4 80 

TATCCCGAAG GAAAAAGAGC AAGTGAATCA TTATGTCAAG CCTATATTCA GAAATATGGT 54 0 

ATTGATGTTG TTATTCCAAG AATTAGTCGT ACTTTCGGGC CTACGATGTt ACTTTCGGAT 600 

AGTAAAGcTT CTTCTCAATT TATTATGAAT GCAGTCCATA AAGAAAACAT TGTGTTAAAA 660 

AGTGCAGGGA CCCAACTATA TTCTTATGCA TATGTTGCAG ATATTGTATC TGCTCTTCTG 720 

TTTTTATTAG TTAAAGGGCA AAAAGGGGAA GCTTACAATG TTTCAAACGA ACATTGCGAT 780 

ATTACGTTAC GTACTTTTGC GGAAACATTA GCCAATGTGG CTGGAACAAA AGTGATTCAT 840 

GGAGAAGCCA CCGTCCAAGA AAAACAAGGG TTTTCTAAAG CAACAAAAGC AC TACT AG AC 900 

AATCAAAAAA TCTACGCATT AGGTTGGCGG CCACTGTACG AC AG TAT GG A AGAACCATTA . 960 

AAACATACAA TTAAAATTTT AGAAGGTAGC GGAAATTGAT ATGCCCAAAA TTAGTATTAT 1020 

TGTTCCTGTA TACAATGTAG AAAAATATTT AGAAAAATGT GTACGCTCTA TTTTAGCTCA 1080 
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AACGTTTACT 


GACTTTGAAT 


TAATTCTGGT 


GGACGATGGC 


TCTCCAGACA GTTCTGGAGC 


1140 


AATGTGTGAT 


CAGTTTGCTG 


AACAAGATCA 


ACGGGTAAAA 


GTTATCCATA 


AAGAAAATGG 


"1200 


TGGGCTAAGC 


GATGCTCGTA 


ATGCTGGAAT 


TGAAATAGCA 


ACAGGTGAGT 


ATTTAGGTTT 


1260 


CGTAGATAGT 


GATGATTACA 


TTGCAGATGA 


TATGTATGAA 


CTATTATATA 


CAAATATCGT 


1320 


AAAAGAAGAT 


GCCGATTTAT 


CAATCTGTGG 


TATTTATGAT 


GTCTATGAGG 


GGAAAGAGCC 


1380 


AATTGTAAAA 


AGTTTAATAC 


AGGGAACTTT 


TTCCAGAGAA 


GAAGCATTAT 


TGTTAATTTT 


1440 


ACAAGGAAAT 


ATTATCTCCG 


TACATGCTGT 


AAATAAGTTA 


TATAAACGAA 


AACTCTTTGC 


1500 


GGATTTAAGA 


TATCCTAAAG 


GAAAGTATCA 


TGAAGATTCA 


TTTATTATAG 


TAGATTTGTT 


1560 


GTCTGAATGT 


CAGAAAGTCT 


CTATTGATTC 


AACACAAAAA 


T ATT AC TAT T 


ATCATCGAAT 


1620 


GGGCAGCATT 


AATACAGAAA 


CATTTTCAGA 


TAAACAATTT 


GAGTTTATAG 


AAGCTTGGGA 


1680 


GAAGAATGAA 


CTTAAATTGA 


AAGGTAAGGG 


AGCAGTTATA 


GAAGAAGCTG 


CCCATCAAAG 


1740 


AGTTTGTTTT 


GCGAATTTTT 


TAGTATTAGA 


TAAAATTTTG 


ATTAGTAATG 


CTCCGAAGAA 


1800 


AAAAGAAACG 


AAACAAATTG 


TTCGCTATCT 


TCGGGAAAAC 


TTTATTTTTA 


TTATGAAAAA 


1860 


TAAAGTTTTT 


ACAAAAAGTC 


GTAAACTATC 


AATGATTTTA 


CTTATGTTTG 


GTCTTCCTTT 


1920 


TTATAaAATA 


CCAATTAAAT 


TGAAAAGAAA 


ATAGTCAAGT 


CCAGACTCCT 


GTGTAAAATG 


1980 


CTATACAATG 


TTTTTACCAT 


TTCTACTTAT 


CAAAATTGAT 


GTATTTTCTT 


GAAGAATAAA 


2040 


TCCATTCATC 


ATGTAGGTCC 


ATAAGAACGG 


CTCCAATTAA 


GCGATTGGCT 


GATGTTTGAT 


2100 


TGGGGAAGAT 


GCGAATAATC 


TTTTCTCTTC 


TGCGTACTTC 


TTGATTCAGT 


CGTTCAATTA 


2160 


GATTGGTACT 


CTTTAGTCGA 


TTGTGGGAAT 


TTCCTTGTAC 


GGTATATTGA 


AAGGCGTCTT 


2220 


CGAATCCATC 


ATCCAATGAT 


GCGCAAgCTT 


TTGAATATTT 


TGGTTGATCG 


ATATAATCAT 


2280 


GAATCAATCG 


ATTTTTAGCC 


TCACGCGCTA 


AGTTAATATC 


TGTGAACTTA 


AAAATTCCTT 


2340 


TAACAGCTTC 


TCTGAAAGAT 


TTTGAATTTT 


TTTTAGGAAT 


GGTGGTAAAG 


ATATTTCTTA 


2400 


GGAAGTGAAC 


TTGGCATCTT 


TGCCAACTTA 


CGTTGGTGAA 


GGATTTTCTA 


ATGGCAGAGA 


2460 


CTAATCCTTT 


GTGCGCATCA 


GAAATAACGA 


GTTCCGTACC 


TTGTAAACCG 


CGTTCTTTTA 


2520 


GGTATTCAAA 


AAATGTTGTC 


CAGGTCTCTT 


CGCTTTCGCC 


ACTTTGAATC 


ATGAAGCCGA 


2580 


TAATTTCACG 


GTCGCCATCT 


TTGGTTATTC 


CAATCGCTAT 


ATGACAGCTT 


TTTGAGAGTA 


2640 


CTCGATTTTC 


TTCTCGTACT 


TTTATATAGA 


GTACATCGGT 


CATTAAGTAA 


GGATAATTTT 


2700 


TTTCTGATAA 


TAAACGATTC 


TGCCACTCGT 


TAACCATAGG 


TTCTAGCTGT 


TCTGTTAAGC 


2760 


TAGAAACGAA 


GGACTTAGAG 


ACGGATTTAC 


CACAAAGTTC 


TTCCACAATT 


TTTGATACTT 


2820 


TACGAGTTGA 


AACGCCTGAT 


ACATACATTT 


CCAACATTGA 


AGCCATGAGG 


GCTTTTTCGT 


2880 


TTCGTTGATA 


ACGTTCAAAC 


ACTGTGGGTG 


AAAAATGGCC 


ATCACGTGTT 


CTGGGTACTT 


2940 


TTAATTCTAG 


CGTGCCTACA 


CGTGTCGTAA 


AGcTGCGCTC 


ATAATAGCCA 


TTTCGTTGAC 


3000 


TTTGTCGGTT 


TTCTGTTCGT 


TCATATTCTT 


TTGCTTGAAT 


ATATTCTGTT 


CGTTGATTTT 


3060 
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CCATTAGTTG 


ATTAAATACC 


GTTGTTAAAA ■ 


TATTTTTAGA , 


AACGTCATCC TTTACAGAAT 


3120 


ATTCAATAAT 


GCTTTGAATC 


TCTTCGCTTT 


TCAGTGTAAA 


ATGTACTTGG GTCATGTAAA 


3180 


AGTCCTCCTG 


GGTATGTTTT 


TGTCGTTAAA 


AACATTGTAC 


CGTAAAAGGA CTGTTATATG 


3240 


GCCTTTTTAC 


TTTTACACAA 


TTATACGGAC 


TTTATCGACA 


AACTGATAAA TTAAAAGATT 


3300 


CAGTAAATAA 


TTCCATCGAA 


GATATTATCA 


ATGAAAAGAG 


TTTAACTTTA AGTAATCAAG 


3360 


TAAAAGAATT 


AGAAAAAGAA 


TATGAAAGCA 


AAATACAAGA 


AGCAGAAACA ATAAAAGATG 


3420 


AG AG TAT ATT 


AAAAGCAGCA 


ACTCACCACT 


ATGAAAAACA 


GAAACAAAAA ATTGAGGAGT 


3480 


CTTTTACAGA AGAAATTAAC AAAGAAGTTG 


AAAAAACGAT 


CAATACTGTT GTGGAAGAAC 


3540 


AAATAGAAAA 


AGTGGAAGAA 


AAGAAAAAGA 


AGACAACAGA 


AGATGATGTT CGAGATCATT 


3600 


TACGAGGATT 


TGCCCGTACA - 


-ATTCCTGCAT 


TCTTAATGGC 


ATACGGAGTT GAAGATACAA 


3660 


GATTAGAAAA 


TTTTGAGGTT 


AATATTGATG 


AAGCCACATT 


CGAAGATTTG AC GAG T ATT A 


3720 


CAATAGAGGA 


ATTCAAGAGG 


CTCCGTGATG 


GATTTGAGTA 


TCCAGATAAT AATGGGGAAA 


3780 


CTAAAGTTAT 


CCCAGGATTA 


TTCAATGAAG 


TTGTATTTAA 


TGCTAGTATC CAGGAATTTT 


3840 


TGGAGACAAA 


AAACCGTTTG 


GCAAATTATT 


TTGATGATAG 


CTTGCAAGAG GACATTTTTG 


3900 


ATTATATCCC 


ACCACAGAAA 


ACAAATCAGA 


TATTTACACC 


AAGACGTGTT GTAAAATCGA 


3960 


TGGTAGATTC 


CTTAGAAAAA 


AGAATTCCAG 


GAATTTTTTC 


GGATAAAACA AGGAAATTTG 


,4020 


CAGACTTATA 


TGTTAAAAGC 


GGTATGTACA 


TTACAGAAAT 


AGTCAAGAGA TTAAATCATG 


4080 


GATTAAAAGA 


TCAGATTCCA 


AATCAACAGG 


AACGTATTAA 


ATGGATTCTT GAAAATCAAG 


4140 


TTTATGCATG 


TGCTCCAAGT 


AATATTATTT 


ATAATATTGC 


CAAGAACTAC ATATTGGGAG 


4200 


AAATGGAGCA 


TGTTTCTAAT 


ACAAATATTA 


TTGAACTTGA 


TACACTGGCT TTGGCAAAGG 


4260 


AAGCAAAGTT 


ATCAGAATGT 


ATTTATAAGG 


AATTCGGAGA 


TGTTAAGATG AAATTTGATG 


4320 


TAATTATAGG 


AAATCCTCCA 


TATCAAGAAG 


AAAATACTGA 


AAGGAATAGA GATGACGCTA 


4380 


TCTATCACTT 


ATTCTTAGAT 


GAGGCGTACA 


AGATCGCAGA 


GAAGGTTGTT ATGATTACGC 


4440 


CAGCAAGATT 


TTTATTTAAT 


GTTGGATCGA 


CTCCAAAAAT 


TTGGAATGAA AAAATGCTAA 


4500 


ATGATAAACA 


CTTATCTGTT 


GAAATGtATG 


AACGGAAAAG 


TGaAAAAATC TTTCCTAATA 


4560 


CaTCgATTAT 


TGGTGGAATC 


GCAGTCACTT ATCgTGATGC 


CTCAAAAATC CTAGGACCTA 


4620 


TTGGAACTTT 


TACACCGTAT 


CAAGAATTAA 


ATTCAATATT 


GAGTAAAGTA AAAGAAATTA 


4680 


ACCAAGAAAG 


TAACTTTAGT 


TCACTGATGT 


TCGTACAAAA 


TAAATTTAAT TTAGATAAGC . 


4740 


TATATCGTGA 


TTTTCCAGAT 


TTTGAAACAC 


GTCTTGGAGG 


AAATGGACGT GAACGCCGTT 


4800 


TAACATCTTC 


AATATTTAAT 


GTATTACCTG 


AAATTTTTTC 


AGATAAAAAA ACAAAAGAAC 


4860 


ATAATCTGGC 


AATTTACGGT 


AGAGAAAaTA 


. ATAAGCGAAC 


aTATAAATAT GTCTCTAAAG 


4920 


AATATATTGA 


GGAACATCCA 


. AATATGTATA 


. AATGGAAGGT 


TTTTGTTCCA GCAGCAAATG 


4980 


GTGCTTCGGG 


TACAATAGGA 


. GATAGTCCAG 


1 CAAGAATTAT 


1 TAGTTTCCCG GTCTTAGGTG 


5040 
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AACCTAATTC TGGTCATACC CAAACCTTTA TAAGTTTAGG AGCTTTTGAT ACGAAGTTTG 5100 

AAGGTGAAGC GTTGTTAAAA TACATCAAGT CAAAATTCGC CCGTGCGATG CTAGGTGTAT 5160 

TAAAAGTAAC GCAAAATAAT AAGACTCAAG AAACATGGTC AAAAGTGCCT GTATTAGATT 5220 

TCACAAAAGA TTCTGATATC GATTGGGAGA AACCCTTGCC TGAAATTGAT CAGAGTATAT 5280 

ATAAAAAATT TAGTTTGAAT CAAAATGAGA TTGATTTTAT AGAAAACCAT ATAACGTACC 534 0 

TTGATGAAGA AAGTTTAAAT AATTTATAAG TCAAGATAGA TACTTGTGAT TAAAAAAGTA 54 00 

ATTGTATTCA ACTGGCATCT ATTTTTGGTT CAAATTGGTT CAAGTAATAT CAATATCAAT 54 60 

CTATTTCCAT ACATTTTTAT CTATATAACA TTTCCCAAAA CGTTGATATA AATGGATTAG 5520 

GATGTATTCA ACACACGTCG TATTTTTATA 5550 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AGAAGAAAAT AnCATCAAAT ACGCTTTCTC TAGATGGGTT AAAATGCTTG GAGTACACAT 60 

TACGATACGA TTGTTGCTTC CATTGTGATT CCTTTTGAAG AAAAAGATGG CTnTGTGACA 120 

GAACTATCTG CAGGCCAAGC CTTTAACCGC ATGATGGGAG ATCCTGACAA AGAGGTTCGT 180 

CAACGTTTGT TTACAGCCTG GGAGAAAGCT TGGAGTGGCA AAGCTTCAAT TTTAGCAGAC 24 0 

ACCCTAAATC ACTTGGATGG GTTCCGTTTA TCTGATTACA AATTGCATGG CACTACAGAT 300 

TATTTAGAAA ATCCCTTAAA CTATAATCGA ATGAGTCAAG AAACATTAAC GGTTATGTGG 360 

GaAACCATCC AAAAAAATAA ACAACCATTC GTCGACTTTC TAACGCGCAA AGcGCAAtTA 4 20 

TTTGGCAAAG AAAAAATGGA TTGGCAAGAC CAAGATGCGC CAATTATTTT AGGTGATTTA 4 80 

GAAGAAAAAA CGTATTCCTT TGATCAAGCC GCTGCCTTTA TTTTAGAAAA TTTCCGTAAA 54 0 

TTCAGTCCAA AAATGGCTGA TTTTGCCCAG ATGGCTTTTG AAAAAAGTTG GATTGAAGCT 600 

GAAGATCGTC CAGGCAAACG TCCTGGGGGT TATTGTACAG AATTACCTGA AACACAAGAG 660 

TCGCGTATTT TCATGACCTT TGGCAATTCA ATCAACGAAG TAGCTACTTT AGCCCATGAA 720 

TTAGGTCACG CTTTCCATAG TAGCGTGATG TGGGATTTAC CTTCATTGAA TCGAGAGTAT 780 

GCCATGAACG TAGCAGAAAC GGCTAGTACC TTTGCAGAAT TGATTGTTGC TGACGCCACC 84 0 

TTAAAAGAGG CCAAATCAGA TGTTGAAAAG ATTAACCTTT TAGACACAAA AATGCaAAAT 900 

GCCaTTGcCA- TGTTTATGAA CATTCaTGCG CGtTTTATCT TTGAAAATAA TTTTTaTCaA 960 

GCGCGCCAAA AAGGTTTAGT AGCGGAAGAA GATATTACTG AATTAATGTT AGCGGCGCAA 1020 

■ AAAGAAAGCT ACAAAGATAG TCTTGGCAGT TATCATCCTC ATTTCTGGGC GAGCAAATTG 1080 
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71 /-* rp rn rn rp rp tv 

CACTTb I i in 


I 1 b A 1 b A I b 1 


bbb 1 i ILirtl 


h ATTTPPPAT 


ATACTTTTGn 


PTATPTATTP 


1140 


Abb A JLbbbiA. 


I-CuA 1 bUo 1 A 


Trrra atp a a 

1 bbb AA 1 ^nn 


PAAPPPAGTT 


CTTTTGAAGA 


PP AATAPATT 


1200 


GCGTTATTAA 


bAbAb AC I bb 


TTPPATPAPB 

1 1 bbAl bAbA 


appp, a ap a at 


T AftfflAAAAA 


APATTTAAAT 


1260 


GTTGACTTAA 


C AAAAC C T b A 


111 b IbbbAA 


pppppp attp 




A A A APAT^TP 


1320 


GAACAATTCA 


TGACTTTAAC 


TGAAAAb 1 Al 


PT 1 A A APT A AP 
b 1 nftftL. 1 AAb 




bb Ab 1 AAbbA 


1 JUU 


ACATCAGGTT 


ACTGCTTCTC 


TTTTTTTCAA 


Ai Ab 111 bbA 


1 rVbbbrifibb 1 


AbbbbAbAAb 


1 A A C\ 


CATTAGTCCA 


AACGTAACAA 


ATTCAACTAC 


TbbbAIbbb 1 


A A CC TV 7A SPB^ 

AAbbrtAAbMb 


r-or-mm t. tv if/^O 

bbb 1 iAAlbb 


1 ^fift 
1 DUU 


TAGTAATTTC 


ACCAAAATAA 


ATAAAACTGG 


s~% tv /"^ T\ Tv rp rp TV tv 71 

CACAATTAAA 


TV TV /T> TV T\ f T\ 

AAbAAbbbAb 


bbAAAbAbbA 


1 jOU 


AATCATTAGC 


GCCATTTTCG 


GTTTACGAAC 


rTI /"•/"»/"• TV TV TV TV 

TGCCGAAAAb 


rp tv Tv rp Tv TV TV rp/~" /~» 
1 AA I AAA 1 bb 


PPBTR7AR7ATT 

bbAlAAAAl 1 


J. UtU 


CAATCCTGTA 


AACAAGAAGG 


AAATCGCATA 


TAAGCGbAlb 


bblbibblAb 


bAAI 1 Ibbbb 


lOOv 


CAACTGGTCG 


TTCCCTTCGC 


TATTAAAGAC 


rn Tv tv m TV nrntri 

CTCAATAATT 


rp rp /~i /■*> O /*• rri t\ tv 

1 bTbbbb 1 AA 


AbAb 1 1 b Ab 1 




GATTAAAAAG 


AACAGGAAGC 


CGGCGACCAA 


m Tv 0 rp tv tv rp tv 

GTAbGTAAl A 


A ArT^A 7\ rp rp 

AAbb 1 AAb 1 I 


1 AAb AAAI 1 1 




ACGCAACACC 


TTCGTTTCCC 


CTAAACCATA 


GTATCGGCTG 


TV^rnTv tv rn/** r*rprr\ 

Abl AAi bb 1 1 


bbbbbbb lib 


i r fin 


CCCAATCCCT 


GTAAAAATAG 


CAATCACAAT 


tv TvrnTV rp rp /™" TV Tv 

AATATTGAbA 


rp rp TV Tv r^* Tv TV 

1 IbbbAAbAA 


1 bbb A 1 AAbb 


A. 3 ^ \J 


TGAAATCGCC 


ACATGACCAA 


CTAAGTTTAA 


/~* Tv TV TV TV O TV TV rp Tv 

C AAAAC AA I A 


1 I AAAI AAAA 


TipTv'TIf TV r> TV TV TV 

AbA 1 b AbAAA 


i qro 

X ^ O VJ 


GGCCGAAGAA 


AATTCATTTA 


AAAAGGAGGA 


TV 7\ TV T\ fr™" A TV T^"* 

AAAAbCAAl b 


/-> T\ /-• tv rp tv 7\ rri rp rp 


b i bb AAb A 1 b 


2040 

i 


CTGAAATTTC 


ACTTTAAACT 


TTTCAAAACG 


TAAAACTCGT 


1 bbbbbbAbb 


Abl iAlbbAA 


9 1 n n 


CGTAATCAGG 


ATTAAGCTTG 


TTAACGGTGA 


/- » tv rp K> 7\ /"» /"^ rp rp 

CATCACCG 1 1 


bbAA 1 bbbbb 


bAbb 1 bbbAl 


2160 


ACCCCACCCA 


AATTGATAAA 


TAAAGACATA 


TV rp /-»/"■» 7V TV TV A rp TV 

ATbbAAAA 1 A 


A 1 b 1 Jl b Ab AA 


TTPPPPPBPT 
1 1 bbbbbAv- 1 


2220 


GACAAAAGCA 


ATCGTTGCCA 


AACGAGGATT 


ty rn m j\ rn rrt rn 

ATGGTCATTl 


b i 1 AAAAAAb 


1 AA 1 bAbbb A 


2280 


ATTGTTTAAA 


ATAAATAATA 


ATGAACAGCT 


Tv Tv HJ* TV T\ TV m 

AAGGT AAA 1 b 


7\ rn tv ^ Tv Tv m TA ^* m 

AiAIAAl Ai 1 


PPTTTPPP&fl 
bb 111 bbbAA 


2340 


ACCAAACAAG 


ACACCGGTCC 


CCTGAAGCCC 


bbbAA 1 AA I 1 


rnr^ tv rr\r+r* C 7A 7A 7A 

1 bA 1 bbbAAA 


AAA 1 Lnnu^ 1 


2400 


CACTAAAGTA 


AAGAGACTCC 


CTATAACGAA 


tv T\ Z™ 1 T 1 TV 7\ P* A 

AAb 1 AAbbbA 


Albbi lAAAb 


I AAAA 1 Ab 1 o 


24 60 


ATTGGCTTTT 


TTGATTTCTT 


TTTGCGCAAC 


CG ibb 1 bbAA 


AAb AAbb 1 Ab 


bbbbbbbAAV^ 


2520 


GCCAAACATC 


CAACCTAATC 


t> rn rp t\ Tvm)\ j\ T 

CATTAATAA1 


ATTAAPPAPT 

Ai 1 AAbb Abl 


^PT7A ZiPQPZi 7A 
bb 1 AAbAbAA 


TPTT A AP,PPP 


2580 


CGCCAACGCC 


TCGGTGCCGA 


TCCCATTI bb 


A A TP A A Ti A TAT 


PTPTPPPPPA 


AAATAAAAAT 


2640 


TGATTGCGCC 


GCCATACTTA 


71 7\ 7\ rn rp /"•/■'•rn 71 tv 

AAAI IbbiAA 


fTAf'Iirir'IiTTT 
tnunnLnl 1 1 


TTATA A A APA 


APPPAPTAAA 


2700 


ATGGTTtTGC 


TGTGtTTGCT 


/~% /-> t\ m TV TV TV 

CCATACCCAA 


Alblbbl ibi 


AAbAAAAAAA 


PPAPTPA+-TP 


27 60 


TTTTTGCTTC 


AAGGTTACGC 


AAAAATCAGA 


ATCCTTCAAT 


TATTCAACCC 


AGCGATTAAA 


2820 


TAATCTGTCT 


ATTTATTGAT 


ACTTCTAGGG 


TAGCATTCTC 


TCTTCGAAAA 


AACAACCATC 


2880 


CTTTTATTTT 


■ TCTCATTTTA 


AACTTTCTTA 


AGCAGTTGCA 


GTTGTTTTTG 


AACCACTTGC 


2940 


CAATCTTCTT 


TCACGACTTC 


TTTTAAACGA 


TAATTATAAA 


AGACAGCCGC 


TGGATGGAAG 


3000 


AGTGGAATGA 


TTTGATAGGT 


TTTCTGCGTC 


CAATGATAGT 


TTTGTGATTG 


. CTCATCAAAT 


3060 
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GTTTGAATGG 
AAAAGTCGTT 
ATTTCCCAAT 
GGAAATTTGG 
CGACTCCTAA 
TGTTCCATCA 
TTTCCAGGGG 
AAACCTTCTA 
TAGTGCATTT 
TAAGGCCCAA 
CACAAAGCTT 
GGTAAAACCT 
GCAATCCGTT 
GCCAAAGCCA 
TCCACTTGAA 
GACATCCATT 
TTTGGTAACT 
GTTTCAAGCC 
CGTTGCGCTT 
CCGCCTTTGT 
GGATACATGT 
GTTTGACCTT 
GCATTTAAGG 
AATTCGTTAG 
TTTAGTAAAC 
ACATCAAACA 
CCTATTTTTT 
CCAGCCATTG 
CGGGCATGTT 
TTATTTAACA 
TTACTGCCAT 
CGTTGTAATA 
ACATTCTCAA 
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GCGACTGAAT 
GCAACCCAAT 
CAAGCAAAGG 
TTACTCTTTC 
CCGCACTGGT 
ACAATTGACC 
CTTCCCCAAT 
ATGGTAAGTC 
ACCATCACGC 
AAGAAACTCA 
TGTCTTTAGC 
CCGATGGTAA 
GCCACATGTC 
CACATTCTTC 
AGAGCATGCC 
CATCATCCGA 
GTTCTTCAAT 
AGTGAACGAC 
CTGGCAATTG 
AGCCTGTAGT 
TTGCATATTC 
CATCATTTAA 
CCTTTAACAT 
CATTATTTGT 
GAGCTTCTTT 
TAGTAGCAGG 
GAGCAGTTTC 
GTTGATTAGG 
GGCACAGTTT 
CAAGAAGTCC 
• GCTCTTTGTC 
CTTTCTGCTT 
AAACAGTAGG 



GACCGTTCCG 
ATTCCCCAGT 
TGCAAAAATT 
TCCCGTTTTT 
GATATACACC 
TGAACGCCCT 
CAGCATCACT 
TTTCGCTTTA 
CTCTCTTCTG 
AAAAAAGGAG 
TAAATAAAAA 
ATCGAGCTTT 
TGGATAAGTT 
TGCACTCACC 
ACTTTCCAAG 
AACTAAATGC 
CGCTTGGTAC 
AGCTTGAAAA 
CTGCTCGTTT 
TAATGATAAG 
AAAACGTTGA 
CAAAGCCCCA 
ACTGCGAGAA 
TGCGCGAGCC 
CGGCAGCGCT 
TTCTTGCTGT 
ATCGAGAAGC 
TAGTGAAAAA 
TAAATTTTCA 
TTCTTCAAAC 
TGCATAGAAG 
GCGCGCCATC 
TTCAGCAATA 



818 
TGCACTGCAG 
GTCAAAATTA. 
TTGATTTCTT 
GCAATCTGCT 
GTTTCACGAC 
ATAAATGGTA 
TTTGGCTGTT 
GCTAAAACTG 
GTGTTCTTCC 
CCTATTCATT 
GGCCGTAAGT 
AATGTCTCCT 
TGCTGAATCT 
CCTGTATAAC 
CGAATGGCTG 
CCAGGATTCA 
ATTTCTCGCC 
TAGGGTTTTG 
TCAATAACAA 
GCATCACCTA 
CCAGTTGCCG 
ATGTCCGTTT 
GCCAAATTCG 
CCCTTTGCTC 
TTTTTCAGTG 
TGTGTCGTAA 
GTTTCAAAAA 
GCTAGATAAT 
TTACCTAAAA 
GTCGGAATAA 
ACCAATTGAT 
ATCGTTTCAT 
TGTGTATAAT 



TGATTGTTGG 
JGTTCAGGTTG 
CTTTGGTTGG 
TCGTTTTTAC 
TTAAGCCAAC 
CGTTTTCTTT 
GCGGACCTTG 
CTTTTTTCAG 
TTCGCTATAT 
TAAGCGCTGG 
AACCAACTAA 
TTAAATAGGC 
CTTTTTGCAG 
CAGGTACAGA 
AATCTGGATA 
AATGCCAGTG 
CTAACAGTCC 
ctACGCGCTC 
AACCTGTTCG 
ACTGAATTTC 
CAATGGTTAC 
CACGCTGTCC 
CCCCATATTC 
CAATCAATAA 
CTTCGACAAT 
ACATTTTCCA 
TCTGCGACAA 
GAATTAAATC 
TAAGTGTCGC 
AGCCAGTTAA 
CGAATTTTTC 
TCGTCAAATA 
TAATTTTTCG 



CTTTGGTCCT 
GCAAACAGCG 
CGTTCGATTA 
AGAAAAAGGC 
TTGTTGTAAT 
AATTTCTTCT 
ACCTTCCACA 
ATGTTCAGGA 
TGATTATAAA 
TTTTTCTACA 
•ATTAGACATT 
TTTGCGTGTC 
TGTTTCATCA 
AGGAATAATA 
AATAGGCGAC 
ATAGATTTCT 
GATTCGAATG 
TAAATAATCA 
GCTAGACAAG 
TTTGGCAGTC 
TACCGTCGGC 
TATTTCAATT 
ATAGTGCGCC 
GTGGGTCCCA 
AAAATGTGGC 
CCCAACGATA 
CTTTTGTTCC 
GGCTGAGATG 
CGCACCATCC 
GTATTCAAAA 
TGTTTCCATT 
CGTTGGAAAA 
CTTCATTATT 
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TCCAAACGTT CGCTTGTGTT AAACCAGTTG CTTGTTCAGA CCAAACAGGG AAGTTATCCG 7080 

CATATAATGG GAAGCTATCT TGTAAGTTTT TCACGATATC TGTATCTTCA AAGAATAAAA 7140 

TTGTGCCCGC ACCAGCTGCA AAACCTTGTA ATTTAGCTTG TGTGTTTGGA AATGCTTCTG 7200 

CTGGAGTTAA TGGTTTTAAG GCGTCTTCAG TCATTGCCCA TAATTTTTTG TGAGCATCGC 7260 

CGAATAAGAA CACAACACGT TGTGTTTGAG AGTTAAAAGC TGTTGGACTT TCTCTAACAA 7320 

CTTCTTTTAC TAAAGCAGTT ACTTCTTCTT TTGAAATTGG TAAGTTTTCT CCTAAAGCAT 7380 

AGATTGAACG ACGGTTTTTC ATCATTTCAG TAAATTTtGA CATAATAtGT TCaTCCTTtC 744 0 

GAATTtCCTC AGTAAATyCA TTaATTnTGT kTACATGCTT ACT AT ACT AC CTTACTAAAA 7 500 

GTAAGTAAAy CTTTTTGTCT TACTTTTTGT AAGTTTTTTT TATTTTTTAG TGAAAAaCCT 7 560 

TATTCTACCA AAGAAAAAAC AACCAGCTTT TTTCTAAGAA AAAGAAAAAT CTGGTTGTCG 7 620 

ATTAATAAAT GTAGAAGTCA ATTGAATCGT TTTCGGCCGT TTTTCTTGAA ATAAATGTGT 7 680 

GGCTTTTTCC CCTAATTCCA TTAATGGATG ACAGATTGTT GAAAAATTTA AAATTTCACT 77 40 

TAATAAAGTT TGTTCCTGTC CCACTAAAAA GAGTGGTCTT TCAATAACGG CTTGAATCCC 7800 

TGCGGCAATT TCATCACCGT TGGTCACAAC CGCTTGGAGC GTTGGATCTT GCCCACAGTG 7860 

ACGTCCTACT TCACACCCAT CTTCATAGGT TAAACAGTCC TTAAATAACC GATTTTCAGG 7920 

AATCGCTCCA TAAATTTCCT GATATGCGTC AACTAAATAG CGTGCGCTTG TACTTTTTTC 7980 

i 

TGGTGCACGG TTAATCGTAA TTGCCACTTC ACTACATCCT TGTTGTTcAA TGCTnGGAAG 8040 

ACCCGCCGAT AAACGGTTAA ACGATCGATA TAGATACTAG CATAACTAGG GTCTTCTATT 8100 

TTnTCACAAA AAACAACTTG CTCTTTATAT GGAGCAAACa CaCTTAArGG ATTTGCCTTT 8160 

GaCAAAAcAA TCATGCCaTC aAACGCACCC nGTGGCCcAT TTTTTCCmAG tAACTTTTTT 8220 

nCAACTTCTT TATCaTAATC CgTTGGTAAT AAACTGACCA TCATCCCTTG TTGAAATGCT 8280 

TGGTGGATAA TCCCACTTAA AATAGCATCA AAATAAGGTT GATTCGTATA AGGAACCATG 8340 

ACACCTAATG TATTACTTTT CCCAAAGCTA AGATTTCTCG CTAATATATk GGGCTTATAG 84 00 

CCCaACTCAT CGATAATTGC TTGGATTyTC AGTCGTTTTT CTTcAGAAAC ATAAGGATGa 84 60 

TGATTAATCA CACGCGAAAC AGTCGTAACT GAAGATCCTG AAAGGCGCGC AATTTCTCGA 8520 

ATTGTTGTCA TACTTTCACC T CAT TACT TT ACAAAATTGA GTTAAATAAG GCACAATCCC 8 580 

TGTCCCGTAA TCAACAAAAC GATAAAAGGT CTCCTGCCAA GCAAAAGAAA TTTCCTGAAT 8640 

ATCCTCCTGC TGTGTCAAGT GcCAtCCGTT ATCTGATGGT nACGGrATTT CTTGATGCCA 8700 

AATCTTATAG TATTTTTTGG GACTATTTAA CTGCAGACTT TTGGCCGTCG TAGTCACsGr 8760 

GACAAAATAG TAATCCATAC CAAGTCTATC GCTTATCTCC ACTATAACAT TCATTTTCAT 8820 

GCCCTCCAAT • ATTTTGATAC ACCTAGCTTA CCTTATGAAA CGCGTACCAG AGCAAGAGGA 8880 
AAAATAAAAA AAGTGAGACC ATGTTATACT AGATGTATCA TATAAAAGCA AAGGATGAGT 8 94 0 

GCAATGGAAA CTGTTACAGT AGCAACAAAA AGCGACTTAC TAACAAATTT GAACAAGCAA 9000 
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ACAGAT3AAA 
CAA7TAACCG 
TTAACC7A77 
AAAATT3AAC 
TTA7TACG7C 
CCGCGTACAA 
ACCATTGTCA 
TGAAAAAG7G 
AGAAAGAAA7 
GTT7TG7T7T 
CGA7TCCTGA 



TTACCA7TCG 



AGT7ACCAGA 
AAAAAATTAA 
GCGGCAAAAT 
GCG7ACAA7A 
CATAAA7G7S 
GCG7TAGA7A 
TACGCCGAAG 
CGTGTGGA7T 
AAA7TGAGTA 
AAAGCCAAAG 
ATCATTAAAT 
AATCCACCAA 
TAAACGCTGC 
CAACGATTTC 
TTT7ACCTTT 
TTTTGATCGT 
CGACCAATTT 
TTTCAGTTAG 
TGG7TAAT7G 
CAGATTTtGT 
CTGCCAAA7C 



TCATTATCTC GGGGGCACTT 
AAACGGAAGA AATGGGCTTT 
TAATCGAGAA GTTATCCTAT 
ATGCGATTTT GCGTTTGTAT 
7AAAACAGTT AGATTATTAA 
CGAGTCCCTT TCATTAATCA 
TTATCGATTA ACCAATGGTA 
TTTGGGAAAT GTTTGGGGAG 
7TCTGTTGCT AAGTTACGTT 
ATTGTTTTTC GCATTGGGGT 
CTGGTTATTA ATTGTCTTAA 
TTATATGTAC ATGGATACTG 
AATTTATTCC TTGTATATTG 
GGATGCCACG TCTTTATTTA 
GCTCTTTACA CAGACGCTAT 
CAGGCGATTT TTCGCTGATT 
GGCACACTAA TTTATGGCGA 
AAAGTTTTAC AAGAACGCAA 
AAGGTGCTTT AAGTATGATT 
TAGAAGAATA CTTTCACAGT 
ACTTCTTATC CGATCATCCT 
ATACTGGCAA TTGGGACGTT 
AAAnGTGCTG GGACAAAAGA 
TAAATGAACA AATTCTTCTT 
TAATGCTTGC CAAATTGATG 
CGTTGTTTCT TTTTTGGCAA 
TTCAACAGTC AAACGTGCAT 
AAATTTAGGC GCTTCACCAA 
CCTAATTTCC TGTTGATCTG 
AGATTCGACA TCAAAGAGCT 
TTGATATTTT TCGTCTAAGT 
CCCTTTTGAT TCAATTTTTT 
AACATCATAC ATCAAGGTAC 



821 

GCACAAGAAA TTTACCACTT AAAAAAACAA 
TTAGTGGGCA GCCAAGGTGC TGGCTCATTT 
ATAAATTCGG CGCTATCGCC AGAAGAACAA 
GTTATTAAAA AATTAACTCC AGAAACTACT 
AATTTTATTT AATCAAGATA GACCAATTCC 
CTTGATATAA TTACCTTTAT TCAAGTTCCC 
TATTAGAAGT AACAGATAAG TCATTTTATT 
GAATGTTTAT GTGTCAACCG CAGGAAAAAA 
ATCGTTTTGA AGTACCAATG ATTATCATTG 
TAGTTGCCTT ATTGATATTG AGTGATTTGC 
TTGCACCAAT TGCGCCAGTC ATTGCTTTTT 
CTTCTAACGG AGTGGaAATT ACAGAAAATC 
ATTTGGCAAA AGAAATGGGC TTTGGCACTA 
ATCAACGGGA ATGGTGTCTT AAACGCTTTT 
GTTGTCATTC ACAGTGATTT GTTGGATATC 
CGCTTTATTT TGGCACACGA GCTAGGACAC 
CTAATGTTAT CCATTATTTT GAAACCAGTT 
GAATATACCG CTGATCGCGC AGGTTTATAC 
TATTTATTTT CAGGAAAATA CATGGGCAGT 
AT TG ATT TAC ATGATGATAC CATCTGGCTT 
GTTGGCTTCC GTCGAATGCA AACATTGAAA 
CATGGGAAAT TTTTCTAGCA AGAGCCAACC 
TTCTTTAACT TTTGTCCCAG CACGTTTATT 
TGGTGATATT TACAAAGTAG TCTTCAAAGT 
AATACTGATA GTCCACACCT AATAAAGCTG 
AATAATCACC ATAAATCGTT AACTCAATTA 
CCACGATGCC GCCTTTAAAC TTTTCTTCTT 
AAATCCAAGC TTCATTGCCA TACACTTCAG 
CTTTGGTTAG GACATATTCT TTCTCCGCAA 
CCATCAGTAA ACGATTGCGA AATTCCTCAA 
ACGGCCGTAA ATTGGTTACG CGTCCCCGAA 
TCTTAGAAAC AGTTAAAACA CGTTGAACTT 
CATGAGTATA CATCTTCCCC TTTTTTGTAT 
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ACATGGCAT- GCCAC-AAAAT 77TT7CCCA7 CAATTAATAA ATCATTGCGA CCGCTAATCT 
CGGCCCCCGT-TGCGCCCATT 77ATGTAA7G CTTCAATAAt TGGTTTAGTG AACGCTTTAA 
AATCCCCAAA AGCTTGATGT CCTTCCTGAA CAACAAAACT AAAGCTGAQA TTCCCGAGAT 
CATCATAAAC ASCACCACCG CCCGATAAAC GGCGAGTAAT AACGATTCCC TTCTCTCTAG 
CATAAGCTAA G7CAATTTCT T CAT AAG CAT TTTGATTTCG GCCTAAAATA ACACAGGGTT 
CCTGAA7ATA AAACAACACT AAGGGCTCTT CAAAAGAGAG AGTGTTTAAT AGATAATCTT 
CCGTTGCTAA ATTT7CACGA ATATCTCC-TG ACTGCATTAT GACATAACGC ATAAGGTCGC 
CCCACTTCTT T7TCTTCTAG TGTATCACTT TCTTTCAAAA TTGGTTCACT AAACGTCTTC 
TCTGGCCATT C7CAAAAGCG GATGATCAAC AAAAAAACGA ACGTGGGACA ACAATCACTT 
TGGATGTTTG TTCCACsCTC AAAAT CT AAT AAACAGCAAT TGCAGAACTG TTTAGCTTTT 
TAATCTAACC AAGTTACTAC TTCTGAAACA TCTCGTCGGA CTGTTTCAAA GCCGGGTGTT 
TGAGCTTTGC CAATCGGAAT AATCAGTCCG TTCACTAAAT GATCCTCAAT ATTAAATAAC 
TTGCGAACAG CTTCTGTTTC AAAACCAGAC ATCGCCACAG TATCATAGCC TCGATCTTTT 
GCAGAAAGCA TCAGCTGCAT CGCAACTAAG CCTTGGTCTA ACATTAGCCA CTCTCTTTTT 
TGTTCT7CGC TAACATCATA G7ACCAA.GAA ACATTTTCAT TAATTCCTTG ACGAATTTCT 
TCCGTCATCG TTCC7TGTGC GACAGCTTTA GCACTGATTT CATCTGCATT GGCCACTGTA 
TAAGCAGTCA AATCACCTAA TAAAATAATC ACCGCCGCTG CCCCTTTAAC TGCTGGTTGT 
TGCATCGAAA CTTCGTAAAG GGCCTCT7TT TGTTCTTTTT CTGTGACAAT TAAAAAACGC 
CATGAA7TTA AATTGGTACT A7TTGGCGCG AACATCGCTT CCTCAATTAT GGCCATTAAT 
TCCTCTTTAG GAATCTCAAC ATCTGGCATA AAATCCCGAA CTGAGCGGCG GCTGCGAACA 
ACATCT7GAT ACATCATATT TCTTCTTCCT TCCTCTTGCC TTACTTGACA TAGTTATTAT 
TACTCACTAT GATGGAGAAA AAAAGTACGC ACTTTTTATT CACCTAGGAA GAAAAAAGTA 
ACCATTACGT GTCACAAGGA GAAACTA7GA CAACTGATAA ACAAACCAGT ATCAACTTAG 
CCCTTTCCAC CATTAACGGC AAATGGAAAT TAT CAT T AAT GGATGAATTA TTCCAAGGAA 
' CCAAgCGCAA CGGTGAATTG ATGCGTGCGC TCGACGGAAT CACTCAGCGC GTCCTGACAG 
ATCGCTTGAG AGAAATGGAA AAAGATGGCC TAGTCCATCG AGAATCCTTT AACGAACTAC 
CCCCGCGTGT GGAATATACG CTCACTCCGG AAGGATATGC TCTTTACGAT GCCCTCTCTT 
CACTTTGCCA TTGGGGCGAA ACATTTGCAC AAAAAAAGGC GAGACTTAAC AAATAAGTCT 
CGTCTT7TTT TATCTATTTT GGGTATTATG GGTAAACTTA AGATATTTTT GAAGTCCTTC 
TTGAAT7GAA TAAACGGGCG AAAAACC7AA GCCTTTCAAA GCAGAACTAT CTGAAAGAGA 
A7ATTT7 ATA * TCGCCATCTC G7TCCTT7TG ATAAGAAACT GGCAAAGAAA GTCCCACTAA 
TTGGTTCATT TCTTCAATTA A7TGATTAAG GGAAATGGCA ACTTCGGTAC CC AC AT TATA 
AACATG7CCC AACGCTTCAC T7TTGGT7GC TACTAAATTT AATGCAGTTA GGACATCCTC 



11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 



WO 98/50555 PCT/US98/08985 

823 





aagtgapgtg 


TTTGCTGACC 


A T CTCCAAAT 


ATfTGAAATT 


GGCTCACTTG 


13020 


PPPTTP.PTPT 


AATTGGATAT 


AACGATCCAT 


CAJXJU^TGGAC_ 


AGGACACCAG 


AATAGGGTGA ' 


' 13080 


a gp a cc a t t t 


TGATTGAGAP 


GATAGACGTT 


AAAGAAGCGA 


ACTGCGCTCG 


TGGGTACCCC 


13140 


P m p m a T\ n f r; 
G 1 G 1 AArt 1 


TATTPATTGA 


GAACATAACG 


TTCAGCAGCA 


AACTTATCAA 


TAGCATAGGG 


13200 


AG 1 CMftouw 




AAGTTTCTCT 


TTTAGGCAAC 


GTTGGCTCTG 


CACCGTAAAC 


13260 


GGG I GG I oAL 


GAAP.PAAAAA 


PAAAACGTTT 

v^ruLriii^ \-7 x x x 


GAGCTTTTTT 


TGATATTTTT 


TTATTGATTC 


13320 


CAG 1 Al 1 1 1 G 


a aTAP APT'PA 


AAAAATTGAC 


CTCATGGGTC 


TCTAGAGGCT 


GTGCCACAGA 


13380 


AGTTGGGAGG 


PTTPPPftPGG 


rrcic a ag atg 


AAAGATATAC 


TCAAACGAAT 


GCTTTGAAAG 


13440 


TACTTCG1GG 


AATfiWTTPPT 

AA 1 AA 1 1GG 1 


paTPPGTPAP 


AfTTrCTTTA 


ATAAAAACCA 


ACTGCTCAGT 


13500 


TTGTTGGAAA 


TTTPSPBPTP 
111 GAG AL. 1 L 


P.PPPPATGGA 


PAAATPATCA 


ATCACAAAAA 


CTTGATACTG 


13560 


TTGAC1 Al AA 


AAA1 1AGL.L-H. 


aaTTAPaapp 

Mr\l 1 


AATAAAGPTA 


GCTCCTCCAG 


TAATTAATAG 


13620 


TTTC I 1 GAGG 


1 G 1 1 IoL 1 1>- 


PTPTPTTTTT 


TTTTA ATTTA 
llll nn l i i 


ATTPCACTTT 

n x x v_> \_> xxx 


TTTCGTGTTA 

XXX Vrf \J X 1 w X ^ * ^ 


13680 


I I I I G J G I 1 1 


ppaaTaaaaa 


aaTPPAATAT 


PPAATGAACA 


CATAACTTAC 


AATAGCCACA 


13740 


GG 1 AA 1 AA^jA 


PTTP A a a AP.T 


AGAPAAAATA 


TTTAAPAGAC 

ill nrvv/ nvjfi\y 


TCGTTAGGCC 


TACCTTTTCT 


13800 


GCAAA 1 AAAG 


bill AAC 111 


GAPTAATGPT 


GTrGCAGAAA 


TTACTAATGG 

X X nV* X X V-J \J 


GAATGTAAAG 


13860 


GCAGGAI AAG 


I 1GGA1 AGAA 


TflTPAPTTGP 
lul L-rtt- 1 1 oV- 


AT AATTTTAT 


ATAGATTCAC 


TACTACAATC 


13920 


CC Al AAA 1 AG 


All GAGA 1 AA 


aaTTAGPAAG 


AAAAPAAPTA 


ATGGCATATA 


CAAATGACCA 


13980 


/-"•/ r ~'/'~'A , T"TT , AAA 

CGGAI 1 GAAA 


1 A 1 AGGLHajv- 


TAAAPATAAP 


GAACCTGGCG 


CTGTCATAAT 

\— x \j x v— r\ x nn x 


CGTCAATAAA 


14040 


GGAATGG1 GG 


ATTPRPPP&T 

Al IGAGGGAi 


TTPTTTPG AP 


TTPGT AATTP 


TPTTGATTAA 

1 V» 1 1 X X *V-i^ 


AATTGGTAAC 


14100 


AAGAG 1 AAA 1 


RfianpniiPiiTi 
AAAnbAHbnA 


PGPT APPPA A 


AAAACAATTT 


TTCCCAACTC 


ATTGATAAAC 


14160 


AA 1 J. G AGAAG 


1A1 IHouHrtl 


aapGPPAATG 


prGACAAACG 


TAATAAACCA 


GCTTGGATAA 


14220 


Al Al G 1 1GGA 


oil LuHl 111 


TGTTGGAAAA 


ATATGCACCG 


CTACAAAATA 


AATCATTAAA 


14280 


Al A3AA1G 1 A 




p,ppAATrrAC 


AACACGTTAA 


TCCAAGCAGC 


ATTGGTAAAT 


14340 


t\ 1 lwln 


AGaaaaPAPA 


HATAACCATT 


AAAGCCATTG 


TAAATGTTGG 


TGATACCGAA 


14400 


g c a a t a a t t g 


GATPATGTAA 

\jr\ 1 1 1— 1 An 


GGTGCTTAAT 


GCATGTTTCA 


TTGTAAAAAT 


AATTTTTAAA 


14460 


ATAAPTaaaa 


TP ATT a aG AG 
i \^r\ 1 1 nnonu 


APTGrrAAf^T 


ACACAATAAA 


TGGTCCCAAT 


TGTTGCATAT 


14520 


pppaanpaAT 


A A AGTA A ATT 


P CC C AAAG AA 


ACCATCCCTA 


AAATTAGTCC 


GCAAATGGGA 


14580 


aTpp.nc aptt 


TPPGPA AGA A 




CTTTCTACCA 


TTATGTAGTC 


ACACTCCGTC 


14640 


TTTATTTATT 


ATTATTGATA 


TTAATCAAAA 


GTATGTTATT 


ATACTAAATG 


AAATATTTAT 


14 /UU 


ATTTGTAAAA 


TAAATATTTG 


CAATACCAAC 


TATTAAGATA 


GTAAATAGAA 


AGAGGCGATT 


14760 


TTATGTTCGA 


■ ATGGATCAAA 


ACATTTATTG 


CTGTGTATGA 


AaCCAAAAAT 


TTTAGTACGG 


14820 


CGGCCAAACA 


ATTGTATATT 


TCTCAGCCAA 


CTGTTTCTTT 


GCAAATCAAG 


AAATTAGAGC 


14880 


AACACTTTTC 


GATCAAGCTG 


TTCTATCGTA 


ATGGAAAACA 


ATCGGTTATT 


CCTACCAAAG 


14940 
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AAbblbAI TI 


TT1 ATATCCA 


AAAA 1 bb lb 1 


PP ATT ATTP. A 
iuUnl lrtl 1 \jr\ 


AAPPTTAAPT 


APTTPTTTTH 


15000 


CGCAAbbTGC 


AGAAAAAGAG 


AAI I 1 1 AAAb 


A LPAPTPPAT 


T ATAPPPTPT 


TPAAATAPAH 




TTGCTATTTA 


TTTACTGCCA 


7\ m TV rp T* 7\ rn f~° 

GATATlAIbb 


Abnb 111 1 


T APPPPTTTT 
1 n^^ou 1111 


PP APTP ATP A 


1 JlcU 


ACTTTTCGAT 


TCAACTGATG 


AAT T b AAAb b 


7\ nr^TPPTPP A 
AAb 1 Co 1 uurt 


T ATPPTTP A A 


A AT A A TP A TP 
AA 1 AA I b A 1 b 


IjIOU 


CCCACATTGG 


ACTAATTGAA 


AAACCCATTG 




1 b 1 Ab AbAAA 


b AAA 1 1 b 1 11 


1 CO/I Pi 


ATGAAGATCA 


ACTAGTTCTA 


GCAGGAGATG 


bbbbl lb AAA 


Al 1 1 ibbb lb 


1 1 AAb AbAAA 


IjJUU 


AGAACTCTGG 


CTTACGTTTT 


TTCAATGAGC 


TTTAlblbAA 


/T* 7\ 7\ P"?* P"*7\ P"P" 

bbAAbAbAbb 


ATpn APTT AP 

A 1 b AAb 1 1 Ab 


1 J J DU 


CAATTATCGA 


ATTAAACAAC 


AACGAAGTCC 


TGI lAbAAl 1 


(T"V AA A7\ A AT 

bb 1 AAAAAA1 


AAbAl 1 bbbb 


1 J4j1U 


AATCAATTGT 


TTCTCGATTA 


TCTATTTCCG 


AlbAbAl bbb 


TTPPP A APP A 
1 ibbbAAbbA 


ATTP A TP APT 
Ail b A 1 V-/\Vj 1 




CCTTTGCTTC 


TCGAAAAATT 


TTCATTGTCC 


71 p1\ /-*/""»/'"' A "P P" A 

AGAbbbAi bA 


bbA ibbb AAI 


7A PTTPPTTPT 
Ab i 1 bb 1 i b 1 




CTGAAGTTTA 


TAACCGAATC 


GTTGAAAAAA 


TTbbi bAAbb 


Tipn AA prTA A 

Ab AAAbb 1 AA 


AAAAbAbbAb 


1 JDUU 


AACGACTTAG 


CTTTCGCTAA 


ATCGTTTTCC 


TGTTTTTTAT 


rn /"* t\ rp/^ tv ^ rn rp rp 

TbAlbAbl 1 1 


bb 1 b 1 AbbAA 


1 JDDU 


TTCAAATTCA 


AAAGGTTCTA 


ACTCAAATAG 


TTGGTGTAAA 


ATTTGCTbbb 


I T 1 b i IbbAA 




CACTTGTTGT 


GGCTCTTTAC 


ACAAAAGAGT 


GCTTTGAATG 


GTCTGTTTTT 


rn rn 7\ rp rp 7\ T\ 7\ 

TTGATTGAAA 




AAGAATTCTC 


GTTTCCTCTT 


CCTTAAATAC 


tv tv /"» rn rp /"^ tv TA /"> tv 

AACTTGATGA 


GTTcCCTTTC 


7171 apa r*r*r* t a 
AAAbAbbb 1 A 


1 jo fl U 


TATTCTTCGA 


ACATTTCCAT 


AACTAAtTTT 


TGACGAAACC 


CTTCTTTGAG 


TGCTAAi AAA 


ID7UU 


GTTACTAAAT 


CTGCAACGGt 


TAGCGkTTCT 


AACGCAAnTG 


kTTCaTCyTC 


TAPATPA 7\ f~* r~i 

T AbA 1 b AAb C 


1 t^Q^n 


AAGATAATnC 


GtACACTTCA 


CATtCGGACA 


^ /"i /*> tv m rn /"* tv r^* 

GGCATTGCAG 


CCGbbAI 1AA 


1 b AAb i b 1 bA 




GGTAGTTGTT 


AyTTTCACCT 


GTTTCATCTT 


tv tv m /™* ta rn ^* rn rp 

AATCCTTbll 


n^r^f^T TV rp T 1 rp rp 

IbblAbl 111 


r , P"7A/* , 7A A fi A AP 
bbAb AAAAAb 


iouou 


ACAAGGGGTC 


TCCTCAAAAA 


TATTCAATAA 


TV TA m TV TA /"* TV T" rp rp 

AATATGA1 1 1 


APTTPTTPATi 

Ab 1 1 b 1 1 b AA 


JiPAPTTPTTP 
Ab Ab 1 1 b 1 lb 


lull V 


GTGATTCCGG 


CATTGTTTTT 


CCATTACTAA 


AbbbAbA 1 bA 


1 IbAbAAbAA 


A Z1PTPA A PPT 
AAb 1 bAAbb 1 


16200 


CGTCGCTTGT 


TGGGAAACAT 


rrt TV fTl TV ^ TA TV /■■^ 

GTATACAACG 


^''P'PP rp rp rn tv 

bl 1 bbb I 1 1 A 


AbAAAAbAl 1 


PTrTTTCTTP 
blbl 1 Ibl 1^ 


1 6260 


TTCAATGAAT 


GGTTCTTTTT 


GATAGCCATA 


TiPPTP TV 7\ P" P" P"* 

TGbTbAAbbb 


Ab Ab A 1 1 bbA 


PP ATTTPPT A 
bbAl 1 l\*uln 


16320 


GATTTCTATT 


TTTGGAAGTT 


/•N TV m TV m TV TA 

CATAGCTACA 


Gb 1 AbbbAl A 


Ab 1 Ab 1 1 bAA 


A APPPTTPPP 
AAbbb 1 1 ^-uo 


16380 


CTGTTTTTCA 


TTATTGCTTG 


CATAATACAC 


m^** rp rp r™* rn /"* rp TA 

TGTTGlbbl 1 


b 1 bAl 111 bb 


PTPPTTATPT 
L> 1 \*\* 1 1 ri 1 ^ 1 


164 40 


CCTCTATTTT 


CCTACTCTTA 


TCATAACAAA 


• AAAAGbAbbb 


TT'r , /* , P"r'PTi t ,p p 
1 1 bbb 1 1 Al 1 


T A A ATTPTTA 
1 nt\t\ 1 1 vj 1 1 


16500 


TV rp iv * i» y\ rP T\ 

ATAAAGACTA 


TTACAGATTG 


m tv rp ta tv f*r+ TV 

GTATAACCAC 


7\ 7A A M\ TAP 

Abb 1 AAAAAb 


TTPPTPflTBT 
1 1 bb 1 bAl Al 


A ATTP APT A A 


16560 


GAAAGAAATA 


TTACAGAAAG 


AAGAGATGAA 


r+ ta rn /"* TV TA TV. rp 

CGATGAAAGT 


AT 1 1 AAb AAA 


A A ArTpmi A 
AAAb 1 bbbAA 




TTATTGGTAC 


TGGTTTTGTT 


GGCACAAGTA 


TTGCCTATTC 


CATGATCAAC 


CAAGGGATTG 


16680 


CGAATGAATT 


AATCTTAGTT 


GATATTGACA 


AAGCCAAATC 


TGAAGGCGAn 


CAATTGACTT 


16740 


ATTAGATGGT 


■ GTGkCyTgGG 


G kCmAGAAAA 


TGTAAACGTC 


TGGGCTGGCG 


ACTATCAAGA 


16800 


CTGCCAAGAT 


GCCGATATCG 


TCGTGATTAC 


AGCTGGCGCT 


AaTCAAAAAC 


CTGGGCAAAG 


16860 


TCGTCTAGAT 


TTGGTTTCAA 


TCAATGCAGa 


AATTATGAAA 


ACAATTGTTA 


, ACAATATCAT 


16920 
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GAAATCTGGT TTTGATGGAA TTTTAGTGAT TGCCTCAAAT CCTGTCGATG TACTGACTTA 16980 

TGTGGCCTGG CAAGCTTCTG GTTTACCTGT TTCAAGAGTA ATTGGAACTG GTACAACTTT 17040 

GGACACAACT CGTTTCCGCA AAGAACTGTC TCAACGTTTA GCGATTGATC CACGCAATGT 17100 

TCACGGCTAT ATTATTGGCG AACACGGGGA TTCTGAAGTG GCTGTCTGGT CTCATACCAT 17160 

GATTGGTACC AAACCTATTT TAGAAATTGT GGATACGACA GAGCGCTTAA CTAGTGACGA 17220 

TTTACCAATC ATTTCTGATA AAGTGAAAAA TACAGCTTAT GAAATTATCG ATCGCAAACA 17280 

AGCGACCTAT TATGGGATTG GTATGAGTAC TGCACGCATT GTTAAAGCCA TTTTAAATAA 17340 

TGAACAAGCT ATTTTACCTG TCTCAGCTTA CTTAGATGGG CAATATGGGC AACAAGATGT 17 4 00 

ATTTACAGGG ATTCCTGCAG TCGTTGGCAA TCAAGGTGTG ACTGACATTA TTGAATTGAA 17 4 60 

TCTGAATGCC GCTGAAAAAG AACTCTTCCA AAAATCAGTG ACACAATTAA AACAAGTGAT 17520 

GGCATCGTTA CAACCGAACG CTTAGTAATT TTTAACTAAA AATAACACTT CAACTAAATT 17580 

ATTCTCTACC AAAATAGATT CTAATTCCCC TTTATCGTTC TTTTTGGTAC CATGAAAGAA 17 640 

GAACTTCAAC TAAAGCAGAA ATTAGGAGGA CAAAAAATGG TAATCCAAGG AGATACGTTA 17700 

GAAAATAGCG CACGTCGTTT ATTGCAAGAA CGTGGCGTAA CAATGAATGA TTTAGCTGAA 17760 

CTTGTGTTAT TTTTACAAAA AG AT TAT ATT GACAATTTAA CATTAGATAC TTGTTTAGAA 17820 

AGTGTTGATG CCGTGTTAAC AAAGCGTGAA GTACACAACG CGATTATTAC AGGTGTCCAA 17880 

CTGGATATTC TGGCAGAAGA AAACAAGCTA CTCTCACCGT TACAAGAAAT TTTAACGGAA 17 940 

GATGAAGGGC TTTATGGCAT TGATGAAATT ATGGCGCTCT CCATTGTAAA TGTTTATGGA 18000 

TCTATCGGTT TTACGAACTA TGGGTATATC GACAAGGTAA AACCAGGCAT TTTAAAAGAG 18060 

TTAAATGCGC ATGATGGCGA ACATGTCCAC ACATTCTTGG ATGATATCGT GGGCGCAATT 18120 

GCGGCTGCGG CTGCCAGTCG GTTGGCCCAC TCACAACCAG ATTTAAGTGA TATTACTAAA 18180 

TAAAAAACTT GCCTCTTCAT TTTTATGAGA AGAGGCAAGT TTTTTATTTG CTGGACTTCT 1824 0 

CACAAGCTTT CTACTTAACA GTAATTATTT GCCGTGTTAT GTTACAGTCA AGAATAAACA 18300 

GCTAAAGGAG GTTATCTAGA AATGAAAAAG TTTTTACGCG TATTTTTAAT TTTTATTGGG 18 360 

GTATTCTTTC TTGCAAGTTG CGGTTCAAAA ATAGAGACAA AAACCTTTGT AGGTTCGCCA 184 20 

CAAGATGGGA TTGATTCCAC CTTAACCTAT ACGTACCAAG GAGATAAAGT CTTAACGCAA 18480 

ACGGCCAAAA ACATTGTTTC TTATGACAAA TTGGGGATTA CAAAAGAAGA GGCAAAAACA 18540 

GCGTTGGAAC CCGTTTCTAA ACAATATGAG GACATCAAAG GGCTTGATTA CAAACTGACC 18 600 

TATGAGGACA AACAAGCCAT CGAAAAATTG ACGATTAATT ATGAAAAACT TGATTATGAT 18660 

AAAGCCAAAA AAGTGGATGG TATCCAGATT GATGGAGATT CTTCTAAGGG AATCAGCATG 18720 

AAGAAAAGCC ■ AAGAATTGGT GGAGTCACAA GGTTATACAG AACAAAAATA ATAAAAGAAA 18780 

GCGTGCGACA ATGGGCATTG TCGCACGCTT TCTTTTTATT AATTACATTG ATGTTGCCAT 1884 0 

CTTCACTGCT TGAATTGGGT CAAAATGATG GACAGAGTAA ACGACTGTAC CGGCTTGCCA 18 900 
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AACTACTGAG TTTCGATTAT AGTCAGTGGT TCCTGCCACG TTCAGTGTGA TTTGACCAAC 18 960 

GGTATTCGGT GCTGGTAATG TTTCTTGTTC CAAAATGTTG ACAACATTTT TCGCTAAATC 19020 

ATCAGGCGAT TCACCATTGA TATTTGAGGC CCGTACGACT AAACTCCAAT TGCCTTCTTG 19080 

CCAAGATAAG TAACTAGAAC CTGCCGCCCC TTGTTTATAC CCAGTAATAT TGTAACCTAA 1914 0 

ATCTACTGGT TGACCGCCAT TGTCAATGAT TTGGTTCACT GGATTTTTTG CTTCGGCATC 19200 

AGAGCCATAT GTTTTTTTCT CAAAAGAACT AATTGGCGTT AACTGATTGA CACGTGCATC 19260 

ATTCACTGGA ATCGCTTCTT TTTCTGCATA ATATAAAACA CGGAAATTGT TTTGATCCGC 19320 

TTGTGAAGTT GTCGCTGCAG TTAAAAAGTT ATTCGTTTGA CTCGTTAAAA TTGCCTGCGG 19380 

TAAGCCTTGA CCAGGAAACA TGTTTGCTAG CTGGTTAAGC ACTTGGTTGG CTTGGTCATC 19440 

TGGATTCGCA TTTTTCGCTG GTGATTGCTT CTCTGCTGAG CCATTATTTG CCGCTTCCTT 19500 

CGTAGAAGAA GCTGTTGCTT TCGAATCAGC TGTTGTTGTT CCTGACGGTT TCGCATCACT 19560 

AGATTTTGTT GTTGCTTTGC TACTTGATTC TTTTGTAGCA GATGCTTTTG TACTCTCGAC 19620 

TGTGGTACTT TCCGCAGCAC TGGAACGACT ATCCGTATTT TCTGTACTTT TTCCGCCGCC 19680 

GCATGCTGCT AAAGCAAGGG TTAAAAGAAA AGCACTAGAA ATAAGCCCTA TCTTTTTCAT 19740 

AAGAATTCCT CCTTGTTTTC CCTACCCTTA TTGTACCTTA ATCATGTGTA TCCGTCACTT 19800 

GTTTGTGTTT CGTTCTAAGT TGGTTTAAAT AACAGAAAGT GGACGGTAGG ATTCGAACCT 198 60 

ACACCTCAAC ATTGAGCAAG AATAAACTAC TGTTAGTCTC TATGAGTTTA ACACCACGTT 19920 

CATCCACCTA TTTGATTGAT CCTTTGTATC ATAACTTAGA ATAATTAAAT TGCCTTTAAG 19980 

GAGCTTCAAA GATTATATGA AGAATTGTAA AAAATCTGTG ACGAAAAAAC GATTGAACCA 20040 

ATCAGTAGAT ACACTGAGGT TCAATCGTAA CCAATTGTTT AATTATCTGA AAAAACTCAC 20100 

AATTCAACGT TAGTCCCTGG TGCTTTTTTG TTTTCCCCTT GTTGTTTTTT TAAGAATATC 20160 . 

CCCAAAATGA CGGTGATAAT GGGACAAAGC AATGGGAAAA AGGCATATGG TAAGTATTCT 20220 

AGTGCACCAA CTTTTAACGT TCCCATAATG AAGGTACCAC TAACTCCCCA AGGAATTAAC 20280 

GAGTTGACTG CCGCCCCAGC ATCTGCCAAA GTACGAGTTA AATATTTTTT ATCAATACCT 2034 0 

AAACGAGTAA ATGAGGATTT AAATGTTTCC CCTGGTAAAA TAATTGACAA ATATTGTTCG 20400 

CCAACTAGTA AGTTAATTCC CACAGAACTT AAGGCAGTTA AAGCGATTAA TTTGGCTGGA 204 60 

TTATTCACAT AGCCTTTAAT TTTATCAATT AAAGTAGCCA CGATGTTAAA CTTAATCAAC 20520 
AAGCCGCCCA AACCTAATGC CAAAATAATC AACGCAGCAG AACCTAACAT GCTTTCAATC 20580 
CCACCACGAG ATAAAAGTGT ATCAATACTT TGATCCCCTG TATCTGCCAC AT AACCACTC 20640 
ATTAAGATGG TAGAAACTTT CGCTAAACTT AGGTGATGAT CATTTATAAA AGCTAAGATG 20700 
ACCGCCACTG TAGAGCCGAC TAGTAACGTT GGAATAGCCG GTACCTTTTT CCAGGCGAAT 207 60 
AAGAATAAAA CCGCTACTGG CAACAATGTA ATCGGTGAAa TCCAGAATCC TTGATGTAAC 20820 
GTTTGGACCA TTGCATCAAC CGATTGTAAA TCCGCTGAAC CAGATTGATG CCCCAAGAAG 20880 
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ATATAGCCGA 


CAATCGAAAT 


AATAAACGCC 


r*r* a a *p a a p a n 
bb AA 1 AAb Ao 


TPTAPATPAT P.TTTAP.TATA 




TGTTCAAAAA. 


GLG_T_TCACGCC 


ACCAATTCCA 


rpRrfTIl APT 
bbAbb 1 rt/ilj 1 


TTP.TTP.TATP AHATAAPPPT 


21000 


GAAATGTTGT 


TCCCTAAAAA 


GGCACCTGAA 


ALAAb 1 oUftL. 


PAPPTPTPAT PPPATTPTPP 


21060 


AATCCAAGAA 


TATGTCCAAT 


ACCTAGAAAG 


r*r*7\ aT'PPPPA 
bLAAl bbUCrt 


TAPTTP.AAAP APTPPTAAAP, 

1 rtO 1 1 <jrtrtrt^j rtO 1 v-/ 1 rtrtrtU 


21120 


GAGCTCCCTA 


CCGTCACACC 


AACTAGTGCA 


C Ab At L At Art 


aaarrrTT^r: taap.aap.aap 

ftAntou 1 1 bo 1 rtxiVjrtrtortrt^ 


21180 


TTCACTGAGA 


TAATTTTAAA 


ACCATAGACC 


ATAAi lb 1 1 b 


pbatpptppp apataapatp 


2124 0 


CAAGTAGCAA 


CTAAGACACC 


TATCAGCAAG 


AAAAJ. AAl AA 


TPPPAATPAT PPPTPPTTTP 


21300 


ATCCCTTCCA 


CAATGCCCTC 


GTGGATCTCA 


TCCCAAbAAA 


aTPPTTTPPA TTTP.PPATAA 


21360 


AGCAACAACA 


ACATGAATAC 


AAAAAGAATC 


GGTATGTGAG 


r/TTTTVR'PTT 1 T'T'P aPP a a m Q 
bbbllAAlll llbAbbAAlA 


?"\ A9C) 


ATTAAGTAPC 


CGAGAATCGC 


TAACAAGGCA 


ATCAAAATGC 


CTAAAC 1 1 I b AbbbAALbAA 


^ J. H O VJ 


ACAGTGGGTT 


TCGTTGATTT 


CATATGAAAA 


CCTCCTTTTA 


TGGAAAbbbA AAAL 1 bAAbb 




AATAATCGAA 


CTTGCTTCAG 


TTnCCAAACG 


ACAATTTACT 


ATAG1LA1LA bblbAAAAbb 


^ X \J u u 


CACTCAGCAA 


AATAATTGCT 


AGCCTTTTTA 


TTCACCTAAA 


m/^rprp/^tT'rprprp A /-»rpm/~«/^/^rnrp^t/^' 

TCTTC1111A bllLLblibb 




CCAACTTGCT 


TTTTTAAAGC 


GATTAmTGAT 


TAACGGAATC 


a P A A A P A T* a A f'A !\TTiPTPaT 

ACAAAtAlAA LAAlAblbAi 


917 90 


TAATGCAACT 


AAGACTGCAT 


CAGTCACGGT 


TGTTGTACCT 


AATGTbbbbA AAAlAAbAAi 


9 1 1 a n 


GATAACACTA 


AAAATTAACA 


TTGCTGCCAC 


TAAATAAGCT 


AAGCCAj. IAL LI 11 i 1 1 Abb 




AATTCGGTAA 


GGACGTAGTG 


TATCTGGTTG 


AGATTTACGT 


A A rprpm a A A C^C^CTK A a TPPP 

AATT J.AA1 1A bbbAAAlbbb 




AATCAATACA 


TACACAATAC 


TATAAATAAT 


TGTTGTTGCA 


TTTGTlAAAb 1 lAAbAAlAb 


91 QfiO 


AGCGTTGACA 


TCATCCATTA 


CCCCATAAAG 


TAAAGCGAAT 


AATbAbAiAA bbAbAbAl lb 




TGtTAACACA 


ACGTTACGAG 


AAACGCCATA 


TTTATTTTCG 


L GAT bb AAAL LAAA1 1 1 Ibb 


22080 


TGGCAATAAG 


CCATCACGAG 


CAACTTGAAT 


AATTGTTTTT 


c a r T i r , f~'*T>t~'r*' a p faapPPAAPP 
bAlbbltbAb IrtHtLLftHuL 


22140 


AGATAATTGT 


AATAACACAC 


CTAAGAAGAC 


7* m/" 1 ti Tv T\ rn Tv 

CATGAAACTA 


AAGATATlAb bAAlAAl Ibi 


999D0 

jL, £. £. \J \J 


TGGTAAACCT 


AAAACGTCAC 


AATAAAGCAA 


AATTGGTTGT 


r ,f TA7\n , rT , T , Rp apa aTTPPAT 
blAAlbl iAb AbAAl lbbAl 


99960 

£. L. C \J \i 


TTTGCCATTT 


GGAACAATAT 


TAGCAACAAA 


CATTGCATTA 


annaappTTTA AT&.AAAPTAA 
AlAAbblilA hlivirtrtUlftn 


22320 


ACCAATTAAT 


GAAATGAAAA 


CGCCTTTTGT 


tv TT\ TV T\ rn rp rp rn rn 

AT AAl 1 I I 1 1 


PTTPP ATPTT TP A A APPAPfl 


22380 


AATATAAACA 


GAAGACATTT 


CAATCCCAAC 


AAAG A I AAA 1 


ppa aTPPPTP PT AP.ATATTT 

VjbArt 1 kfV.,^ J. \j <w 1 rtort X r\ x i i 


22440 


AAATGTGTCC 


ATATTTTCTA 


AGTTTGGTAA 


AAC1 111 bAL 


pa apfip A APP P APPT A A AT A 
bAAbAbAAbu ^rt^^lrtrtrtirt 


22500 


GCCGCCAGGT 


GTTAAGCCTA 


CTTTAATCAT 


rp A A A^7iJ\rB 

TGAAAbAAbA 


rrfaaa&raa APATAAPTAA 


22560 


TACTGGGATG 


TAAACACCTA 


GCCAAACACC 


AATATTTCCG 


CCAACTTTCG CCATGTCAAA 


22620 


TTTCAAGTTC 


AAGATTGTAA 


TAATCCAGTA 


, AATCACTAAG 


ATACAACCTA ATACGAACCA 


22680 


ATGATTTTCA 


• CCTAAATGAA 


CATTaCCAAA 


. TGTATTaCCT 


AGCAAAGGGC CTAATGTAGA 


22740 


AGCAACCATT 


ACCATTCCTG 


GGAACATTTG 


1 CACCCATAGT 


AACCAAGCGA CAACAAAACC 


22800 


CCATTTGCTA 


, CCTAAGGCAG 


; TTTTAACCCA AAGTTGAGGG 


1 CCACCTTCTT GAGGAAGCAT 


22860 
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CGTAGATAAT 
TGTGTAAGAA 
ACACAATGCC 
ATCCAATTTT 
CTGAGTAAAA 
CCATCTGATT 
CCATTTCTTT 
CTTGTAAAGC 
ATGGAGTCAT 
CACTAAATCC 
AGTCAGTATG 
AGTCATATAC 
TAAACACATA 
TGTCGCCAAC 
TTGAAGCACC 
CCCATACAGA 
ACGCAGGAAT 
GCATACGGAT 
GAGGGTCAAT 
TTGAAATGTG 
CTTCGTAAGG 
AAGCAGCATC 
CAATAATTTT 
CACCTAGCAC 
TATCCATACG 
TAATATCAGC 
TTGCTTGTAA 
CAGCTGATTC 
ATTTGCCAGC 
TACGGGCATA 
AACCATTTTT 
AAGTTGCTGG 
AAGCTAATAG 



TCACCAGATA 
ATTAATGTCC 
ATTGTTAAAC 
GTACACTCCT 
CAGTCATTCG 
GATTTATCTC 
TTTCCGTTTA 
AGCTTGAATC 
TACAGCAGCA 
CAAGCTGTTT 
TGAAGTAATG 
ATCATGGTTT 
GTCAACCATG 
TTTGAATGTT 
AATTAATTTA 
AGCAGCTGTT 
ATCGGCACCT 
ATCTTGGATA 
TGTTACTGAT 
TTCTTTTTTC 
GATGAAGTTG 
AACGTGTACA 
ATCGATTGAG 
TGGAATTTGT 
GTAGTTATGG 
AGCTTTTAAC 
ATGTTTACCT 
TAATAAGTCC 
AACTAATTCT 
CCATAAGCCT 
GTAGCTCATT 
AGAAGATTCA 
AGAAGGCATT 



TCAATGCGAT 
AACCAACAG.C. 
CGATAAAAGT 
TATAAGATTT 
ACTGCCTCTT 
TTAAATAAAT 
AGATTCAATT 
TTAGGAGCGT 
CGTAACACAG 
ACAAATTTTA 
AATTCGTTGT 
AATTTGTTCA 
TTGAAGTCTG 
AAATCATTCA 
CCATAACCTG 
GCACCAGCTT 
TTTTCAAATA 
ACGATACCAC 
TCTGCTAATT 
TCTTTGAAGA 
TTGTCTTCGT 
TAATAGTAAA 
TCAACGGCAC 
TCTTCTGCTA 
TCAACTGGTA 
CAAGAATAGT 
GAACGAGCTG 
ATAATTTCTT 
GGTTTTACTT 
TCTAAGTTAG 
AAGTGAGCAA 
TAGGCAACGT 
AATGTTTCTG 



828 
TGGACCAGCA 

AGCAAGTGT.T 
ACCTAATTCA 
TGGATAAGGA 
ACCCAGACAA 
CAATCAGACG 
ATTTTACGTC 
AAACATCAAA 
TTACTTTACC 
ATGGGcTGTT 
TGTAAATATT 
TTGCTACTAA 
GATGAGTTAA 
AGAAGTTATA 
CAACGTTTAA 
TTGAACCTTC 
CGTAAGTTGC 
CAGCTGAATA 
CGATTGCTTT 
CACCGTATTC 
CTAAGAAGAT 
TACCGTCTTT 
CTTCTTCAGT 
AGCCACGAAC 
CTGGGATAAC 
GTTTAGTTTG 
AATGAGCTTT 
TTGTTGGCAT 
CTTTCATAGC 
CTAAAGAACC 
ATTCATGTCC 
TGTTCCCGTT 
AGTTCATATG 



AAQAaTAAGA 
GGGATACTAC 
ATTTTTTTAA 
GCATGAATGA 
AGAATTGGTT 
AACGTTCCCT 
GTAAATTTGT 
TTCTTCTTTA 
TGCACGGTTC 
GCCATAATCT 
TCCTTTAACG 
ATCATCGTTG 
TGTGTGGACT 
GAAATGATGT 
AGGTAAAACG 
AAGGATATAA 
GAAGTAAGAA 
AGGGATATAA 
AtATGCATCA 
TTCGTGAACA 
GGCACGTCCA 
CATTAATTCA 
AGAACCAACT 
GATTTTTTCT 
TTGATCTAAA 
TGGAACTAAC 
GATTTCATCA 
GTTCAATAGT 
AAATGGTAAT 
ATCAGCAACG 
TACTTCTTCT 
CCATAGCATT 
TCCCCAATAT 



CCGCGAAAAT 
GTACAGTTGC 
CTTGTTGTTT 
CTGTTATACG 
AAGAGGTTAG 
CATTCGGTTA 
TCTAATTTTT 
TCGTTCATGT 
CATTCTTCGT 
GGAATAGCAA 
TATGATGCAT 
CCTTTTTCTT 
TCGATTTCTT 
GATCCTTCAA 
TGATGTGCAG 
GCACCTAGTA 
ATAACGTCGC 
CCCATTTTAT 
TACACTTCTC 
TCTTGTAAAT 
TAACCACCAT 
TCGCGTAAAG 
ACACCGACAA 
AATTCGTTGA 
CCGATACCGA 
CATTTACCGA 
ATTTCATCTT 
TCCCAATCTG 
GATTTAATGT 
ATGTGTCCCC 
TCCATTTGAG 
GCAAAGTTGT 
CTACCTGCTG 



22920 

22980 

23040 

23100 

23160 

23220 

23280 

23340 

23400 

23460 

23520 

23580 

23640 

23700 

23760 

23820 

23880 

23940 

24000 

24060 

24120 

24180 

24240 

24300 

24360 

24420 

24480 

24540 

24600 

24660 

24720 

24780 

24840 
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T A T PP P A T P P 
1 A 1 u^Cn 1 bb 


a aptp.a aTfT 
AAb 1 b AA Ibl 


b X Abbb A X bb 


HAnAAGAGAT 


T^CATTCAAT 


ACATCTTTCA 


24900 


1 A 1 bb I 1 bAb 


Hb 1X1111 bb 


T A P P T T T P A ("1 
X Abb XXX bAb 


rt 1 w A VJVj-U J. X \w- 


TTGAGAAGAG 


ATAACTGGCA 


2 4 960 


X b X b 1 XbX bb 


b 1 Ao 111 


T P A PPP P TA T P 
X bAbbbbAX b 




ATCTACTAAG 


TCGATCAACA 


25020 


tl s*+ rp /"* rp rp rp TV rp 7\ 

AGTbX X XAX A 


rn tv t\ ttptppp 


XXX Xbbbb X X 


i U i VwV^Vw V_/T>rt X 


GAATAGTGCA 


TTAAGGTTCA 


25080 


TTTCGCCTTT 


rp /** rp tv T\ T* HP rp rp 

1 bb X AAX III 


Xbb X i X KX bA 


T APPZ\ A A ATT 
1 rtUurtrtrVrt i i 


PPTCrTTACA 


AATTTTAATG 

nn x x x x nn x 


25140 


AATTAGATTT 


l GbAl I bAb A 


AbX XAbAbbb 


a A APCCPT A A 
MArtUVjoVy 1 rirt 


AGTHTAAAGA 

rt\j 1 O 1 nAnun 


APAaAAATAA 
rtwrtartArt X /v\ 


25200 


CAATGGtTAT 


bAI Abb I b 1 1 


rp tv t\ rn rp rp rp TV T A 

X AAX X X X Al A 


ATTT'TTTPAP 
A 1 1 1 1 1 1 v^nl 


AAGGTGTPTT 


PTP^Pa APP A 

VJJ 1 VwCLOClrtV->\^rt 


25260 


TTATATaAAA 


T\ TV T\ rn TV Tv Z"* TV rp 

AAAT AAG AG T 


bbba x x gx b X 


P+- A PA 7\ TPPP 
b L rtbrtrt. X bbU 


ATPTT ATTAP, 
nlLl 1 rt 1 1 rtvj 


PTPTATTAAP 
OlOlnl x rtrtU 


25320 


GGTTTCACCA 


CTAbCTbTAA 


bAb n AAA X Ab 


A ATTTT A APT 
rtii 1 X X X rlrtb 1 


TTP.ATPAAAT 
1 X vjrt 1 \»,rtrtrt 1 


AATPAPAPGT 

rtrt X Wrt*wrt^\J X 


25380 


CTTCTCTTCG 


TGAAATTATT 


Tv rp/->> tv T\ TTTfT 

ATbAATT. XbX 


A A TPP ATT TAP 
AAX bbAX X Ab 


TTTTPPTAAA 
111 luvvl AAA 


AAPTAATTTT 

/ArtVj I rtrt 1111 


25440 


TCTtCCmTTT 


/— . ,r-i rp iiriji t\ Tv 

CCGTACAACA 


7\ / ^/^7\Tvrnrprpi_/-' 

AbGAATi X tb 


PA TPP A A ATP 
brt X bbrti\rt 1 b 


TTPTP-T TP. P. A 
1 lului i. Own 


TPGAPTGPAA 


25500 


AATCTAAATC 


GGTAACTCGG 


TbAbbGTTAA 


TAl^TT AT APP 
X AbX XAXHbb 


TPPTPPP.PTP 
1 bb 1 bbbU 1 U 


APATPTTPPP 

AV_* A 1L1 1 v/UL 


25560 


GAGCTTGACG 


rnrrt«THT>/™*T\ rp /"» rp 

TTTTGATGGT 


TbGAT i X b X b 


TP TP A APT A A 
XbX brtAb X rtrt 


PPA ATPT APP 


ATPTTTTTAP 

rtlUl 1 1 1 l rtO 


25620 


CATCTTTGGT 


AATTTCAAbA 


r~* rp rn /~» TV /—» TV rp 

Wl X bbbAbAX 


PTTrT A A PPP 

bX XbXArtbbb 


TTPPTPf^ AT T 
1 luLlLunl X 


TPPTPAATPP 


25680 


TTAAGTCTTT 


AATGTTGCCT 


TV T\ TV TV TV TV /"* 

GAGAAAAbAb 


/^TT/"*T P A AAT 

b X X b X bAAA X 


TTTTTPPPPT 
X X X X X bbbb 1 


TPTTPTAATP 
1L1 l^lAAlO 


2574 0 


CTGCATCATC 


GTGAACAAAT 


/«• rn m /■» *f\ ^ rp rp 

CGTGTCACTT 


CT X bAbb X AA 


/"•PPTPTTTPT 
bbb 1 b 1 1 1 b 1 


PPTTP APPTT 
utl l^nVvUl 1 


25800 


TACCTGGTTC 


TTTTTCAACT 


TTTTcAGTAA 


bbbAXbbrtl X 


TPTTPTTT7AT 
XbX Ibl 1 1AX 


PTA APA A APT 
Vw 1 AAlJ AArtO 1 


25860 


AAAGAATTTC 


AAGTATTTGA 


rp TV 7\ rp /"*■ TV /""•/"* 

TGACATCAbb 


ATPATPTTPP 

AXbAX bX X bb 


TTT A APPAPA 
111 AAUV^AvjA 


ATTPPTAPAA 

rt 1 1 VOVJ 1 rtOrtrt 


25920 


TTCAAATGGT 


/*• t\ /— i m rp rp rp rn 

GAGGTTTTCT 


X Abb A XbX AA 


PPAP AT APPP 
bbrtbrt 1 noto 


UVvrtv^Uv^UL x O 


TTTTPPPAAA 

11X1 v^UVvfuVi 


25980 


TTTTGTACCA 


TCTGCTTTTA 


r*r* ATTAAAPP 

bbAX XAAAbb 


A ATPPTTA AP 
Art 1 vju 1 1 rirtU 


PPP AAT APTT 

UVvOnrt i rtv^< 1 X 


TGGCTTCTGG 


26040 


TCCTTCTTTT 


T T 7\ r"' /"* Tv ATf^ 

T T Abb AA I b A 


A A TPT A A TPP 
AA X b X Art Xbb 


TP,PAP,TP,ATA 


TTPPPPPATT 

x l UvjVjVw.Ln x x 


GATCAGCGCC 

\j n x v>fxvjvf\jvvVv 


26100 


ACCAATTTGC 


AAT TG AAT AT 


rp 7\ rp ^* rp rp rp rp rp rp 

T Al b X X X X X X 


ATPPA ATP.TCI 
A 1 bbAA lulb 


T A A A APTPA A 
1 AAAAU 1 v/ttn 


TTGATTGAAG 
i i ort i i ortrt vj 


26160 


AATTTGGTAT 


GTGAATTbTb 


rp tv tv nrpRBBT 
X AAAbbAAA X 


PPP DPTTTPT 
bbbAb 1 1 1 b 1 


A A APP APT AP 


PPAPAATGTC 

V_*vjrt\^rtrt lUlVv 


26220 


TTTTGCCAAC 


ATCGTGTTGA 


T\ rp rp Tv tv A TTT 

bAI X AAAb X X 


TTTPPPP.T A A 
X 1 IbbbblrtA 


TPTPTTAAAA 

IVylUl 1 AArtrt 


AATPTAATAA 


26280 


T\r" , T\T7\r' , TT/" , T 

AGATAGTTCT 


G AT AAbbAA i 


bb 1 Ab X X b 1 X 


P APPATTP.TT 
Lnu^n Hull 


APPTPAPPAT 

rtv^U 1 ^wrt.VJV_,rt 1 


PTTTACCAAA 


26340 


rt* 7\ 7\ rp rp rp *"P /"** 

1 AA I X X X X X b 


TV TTT/~ > / , " , TTTP 

AX X X bb 1 X X b 


AAA P 7APTPTP 
AAAb Ab lulu 


PAP ATT ATP,T 


TPPAPAPPTT 


PPATCGTTTG 


26400 


1 AAbAbAbb 1 


rn/if* rp rp rp rp /~> 

X bbb X XbX X b 


PTPP APTTPP 
b 1 bbAb 1 1 bb 


P.TPAPP A ATT 


PTTPPAPTTP 


PGPCACCAAT 


26460 


rp7\T\7\7AT^TAT 

TAAAATb 1 AI 


bbb X bA X bAb 


PTPPT A A TTf 
b X bb XHHX X b 


P A ATPf^TTTP 


A TP ATP AT A A 
A 1 Urt 1 v^n 1 rtrt 


AAPGAATTAA 


26520 


ATGTCCAATA 


TGCATGCTAT 


CTCCAGTTGG 


ATCGACACCG 


CAATATAGCG 


AAATGCTCGT 


26580 


ATTTTCTGTA 


AGTTCTCTTA 


GTCCTTCTTC 


GTTTGTTTGT 


TGATTGATTG 


CATCACGCCA 


26640 


TGCTAGCTCG 


• TC AAT GAT AT 


TCATGTTATC 


ACCCTCCATA 


TTTATAATTA 


TCTCTTTATA 


26700 


GGAAATCGTT 


CATTTAACGA 


AACAAACGAT 


TAACACACTT 


GTTTTTACAC 


GATATCCCGT 


26760 


GTTTATTTCT 


ATGTATACAC 


GGGTTCACGA 


AATTTTCTTT 


GGTCTGCCAC 


GTATTCAAAA 


26820 
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7\7\7V.r*7A7AT7Af*7A 

AAAuAn 1 ALA 


a A a A ATPPPT 


a aPTPPTPPA 


AACGACTTAG 


GGACGAATTT 


CTCGTGGTAC 


26880 


f A f~* A 7\ 7\ TT 

CAbbbAAAl 1 


T T AT7A/" , aTaPT 

1 A 1 Ab A J. Ab i 


TT aPTTT ATT 
1 i nu 11 1 ri A 1 


TAAAATACTA 


TATACCTCTT 


AACAAATAAC 


26940 


GCTb 1 1 Abbb 


b l bbbb 1 b lrt 


ppp a p a a pp T 

bvj^ttvjrtnvjv-. i 


CAAGAATGTA 


ATTCGATAAA 


TAACGTGTAC 


27000 


TAGTTTbbAb 


bbbbAb 1 Abb 




AGTGGTTATT 


TCTCTACTGC 


TTTTCTGTCA 

X X X X v X \J X V-*l * 


27060 


TTGCTTTGTA 


TTTAACTTGT 


aP7ATPT APPA 
ALA 1 b 1 ALjv^rt 


T ATTPPTTTT 


PPiTTTTTTTT 


oTCaACACTC 


27120 


TTkGCTTAAt 


TTTT kTTACT 


1 1 i A 1 AAAAo 


ATP.TPPPTTT 


TTPAPAAAAA 


TAAAATAACG 


27180 


AATTCAAAAT 


AAT AT T TAT T 


lul 1 1 1 nnftft 


P.PAAPTAPAft 


ATAAAAAAAT 


AACGAAAAAA 


27240 


TTTAAAAAAT 


ATTCTTTTAA 


TV rprprprprprprp/^TV 

A I 1 1 1 1 1 luH 


VwlllllllOfv. 


PATTTTPT AT 


APTTTTATAA 

# X X X X XV X 


27300 


AATTCCCGAr 


j^i TV TV TV T\ TV *Tl /""* TV Z - ** 

GAAAAATGAC 


GAACT A bbbA 


TP a CI A APT TP 
1 LnUnno 1 ivy 


PTPa TTTTPT 

U 1 La 1 1 1 lOl 


PAAAaaCAGT 


27360 


TTTAAAGAAT 


rp <"->/"•• rp «—• TV Tv Tv tv rp 

TGGTGAaAAT 


TV A T^f" T A. A A 

AAlbb 1 bAAA 


TTTlTlTTnTTT . 
lllullulil * 


AAAAATPAAP 


PATTTTGACA 


27420 


TGTCGCGAAC 


tv tv r~< rp rn rn /"*» /*"* 7\ 

AACTTCTGGA 


GTbAI Ibl lb 


TaPaTTTTTT 
1 r\\->t\ 111111 


TPPATPTTPT 


AAG AAAAT T G 


27480 


CTTCTAATTG 


t\ m ^ tv t\ rn ^ TV fpfri 

ATCAATGATT 


TTTT L AlbAl 


aTA APAPAPP 


ATTTP.PTTPA 


AAPTTTAATT 

rtrlu ill nn i i 


27540 


TGTAACTTCG 


GATATCTTGA 


m rp rp /-» t\ t\ /*■» 

TTGGCTbAAb 


LAAL I b 1 bbA 


PAPTTPATPA 
bAb 1 X bA 1 \~<r\ 


TPPATPtATCA 


27600 


TTGTTTTCGC 


ATGTAAGAAA 


CCACCGTTAI 


7\7\ TATTnnaaT 
AAA1 1AAAA1 


TTPQ fiT&TTT 
1 1 bAA inl 1 1 


TPTTTPATPA 

1\>1 1 1 /\ 1 


27660 


ATAAGCGGCC 


GTAATACTGT 


GTTGCTCGAT 


7\ A ATA T\ 7\/~*f^r* 

AAA 1 AAAbbb 


aTPPTPPPPP 
A 1 bb 1 bbbbb 


TTATPTPPAA 
1 inl L 1 Vjijnn 


27720 


TCATAATTTT 


AACATCTACT 


CCTGAAGCCG 


pnpp]\7\ ppiPT 
CAbbAAbb 1 1 


t ta a a cc a ppp 
1 AAAbbAbbb 


AaPAPAPTAT 


27780 


CATCAGGAAC 


TAAATACGGC 


GTTTGAATCC 


Ab Ab I b 1 1 1 1 


PTTPPP AP A A 
bl IbbbAbAM 


PT A ATP A ATT 

Ulrtrtl^/Wl 1 


27840 


TAATAAAGGC 


CAATTTAATT 


TGTTCACGbl 


PRPTPTTliP^ 
bAb 1 b 1 1 Abo 


APP APT PPP A 


ATPATCTGAA 


27900 


TGGATGTATT 


TGCCTCTGGC 


AbbAAi bb 1 1 


r*aaPTTTaaa 

bAAbl 1 i Afii-i 


PA AATAATPP 


AATTGATACG 

riri x x un x w 


27960 


CCACACGATT 


TTTCTCGGGG 


GAAbAbAbb I 


TPPaATPPAT 


TAAAAAAPPP 


ATTTGCAGTA 


28020 


ATGAGGCTGC 


TGGCCCTTGA 


Albbbl AAA1 


PTPTATPPPP 


PP A AT AGPP A 


AAPTTTTTAG 

ruiv X X X X * *^fcs-r 


28080 


TCGTTCCGGC 


tv rp tv mrp/^Tl rp /"» /- 1 

ATATTGATCG 


/■"•OA 7, PATTA & 

GLAAbAl 1AA 


APPPPPP A AT 


PTAPPPTACT 


TTTCCGTCAA 


28140 


TAACAACAAT 


TTTCCGGTGA 


TTftTPATB TAT 

TCAI bAI AA1 


tp a a a pp. a a a 

1 bAAAbbAArt 


PTTPA A A APT 


PCTTTTTGAG 

vj Vw x x x x x unvj 


28200 


TV TV m TV TV rp T\ 7\ Tv 

AAGTAATAAA 


CGTTTGGAbA 


AAAbbAbbb J. 


TTTTTTTPa a 

1111111 Vyrtrt 


TTPATTPAAA 
i i i v^ruin 


TGATGAACCT 


28260 


TGGTGCCCTT 


m /*• TV ^ TV TV TV ^* 

TGAGCCAAAG 


bCA 1 bb 1 AAA 


PP A APPPPAP 
b b AAbb b ^rV\- 


TTPAAPPPPT 


TCAGCCGCCT 


28320 


TTTCTTCTAA 


rp »\ Tv /— rp rp T\ A A 

TAAGTTbAAA 


AT III bb lbb 


PPATATPATP 
bbA 1 H 1 b A 1 


PPTTAPAAAG 


GCATAATATT 


28380 


j*** tv m tv ^ tv ti rrt 

CAATATGAAT 


GGAGTGCTGA 


GCTTTTTTTA 


1 Al bbbbbAl 


PA APPPATTP 
bAAbbbAl lo 


AATTTTTCCG 

rtrt 11111 V^VwV? 


28440 


t a pp a tpppt 


a a a a atttpa 




TCTTAGTCAG 


AGGCATCCTA 


TTTAACGAGG 


28500 


AAAAAAAGTC 


TGTAAACTGT 


TGTTGCTCCT 


CCGTCGGCAT 


CTTTTGACTA 


n% Tv TV Ti ffi m ^* TV TV 

TAAAGTTCAA 


£. O JOU 


AACTTTCTTC 


TTGGAAATTC 


TGGAAATTCT 


CTAATTCCCG 


TAAATCACTT 


TGTTGCAAAT 


28620 


AAAATTTCTT 


• TTkGTCGGkT 


AAACCTCGAC 


CAAAGrATAA 


, ATATAAGAAA 


. AAACCAAAAA 


28680 


CGGGTAGTGG 












28690 


(2) INFORMATION FOR SEQ ID NO: 139: 
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(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 6146 base pairs 

(B) TYPE: nucleic acid 
•(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



AAATTATTGA 


AAAATCACCG 


AAGCAAACTC 


ATACAAAATA 


TACAAAATGG 


CAAACAGAGT 


60 


CCATTTATAA 


ACAAAAaATA 


AAAAGTCGTT 


TAGACTTTTT 


ATTAGAACAA 


AGTTCTTCAA 


120 


TTAATGACTT 


CCTAGAAAAA 


GCAGAAGCTT 


TGAATTTATC 


GGCAGACTTT 


TCTAATAAGT 


180 


GG AC GACCT A 


TCGATTGCTT 


GACGAACCAC 


AAATAAAAAA 


TACTCGTAGT 


CGAAGCTTGT 


240 


CAAAAAGTGA 


CCCCACTAGA 


TATAnTTATG 


AAAAGATTAT 


TGAACGATTG 


AAAGAAAATA 


300 


AAAACGTTCT 


AACAGTGGAA 


GAATGTGTCA AAAGtACGAg 


AAAAAAATGA- 


ACAAGAAAAA 


360 


AATAATTTTG 


ACTATCAaTT 


TACmATAGAA 


CCaTGGCAAC 


TGTCTCACAA 


AACAGAaAGA 


420 


GGATATTATA 


TAAATATCGA 


TTATGGCTAT 


GGCAATTCTG 


GAAAGCTCTT 


TGTAGGAGGG 


480 


TATAAGGTTG 


ATCCATTAGC 


CAACGGTAAT 


TACAATATCT 


ATATTAAAAG 


AAATGAGTAT 


540 


TTTTATTTTA 


TGAGCGACAA 


GGATGCTACT 


AAAAGTAAAT 


ACTTAACAGG 


GGCTTCGTTG 


600 


ATTAAACAAT 


TACGACTGTA 


TAATGGACAA 


ACACCACTGA 


AAAGAGAACC 


TGTGATGCAA 


660 


ACCATTGATG 


AATTAGTTAG 


TGCCATTAAC 


TTTTTAGCGG 


CGAATGAAAT 


CGAGGACACA 


720 


AGACAACTAA 


AATTATTAGA 


AGAAAAATTA 


GAAGTTGCTT 


TTCTTGAAGC 


AGAAAAAACT 


780 


TTAGAGACTC 


TTGACGAGAA 


GAT GT TAG AG 


CTGCACCAAT 


TAAGTAACCT 


CTTATTAGAA 


840 


AATGAACTGC 


AGGGAGATCC 


AGAGTTAATT 


CAGAAAAAAT 


TGAAAACATT 


GTTACCGGAC 


900 


GCAACGCTTG 


CGGAATTTTC 


TTATGAAGAT 


GTTCGAGGAG 


AAATAGAAGC 


AATTAAAACC 


960 


AGTCAAACTT 


TGTTAGAGAG 


TAAACTGGAA 


CGAACTCGAA 


ATGAAATTAA 


TCAGCTTCAT 


1020 


GAAATTCAAG 


CCGTACAGGA 


AAAAGAGCCA 


GAGAAGCAAA 


CCGACATTAA 


GCCCAAACTA 


1080 


TAAACTATCT 


ATTGGTGAAC 


GGAGAGGAGA 


ATTTTATGTT 


AGTTTTTAAA 


CGTATTTTGT 


1140 


CTGTAATACT 


ATTTTTACCT 


TTTTTTATTC 


TTCATTTACT 


TTTCCGATAT 


ATTGGAATTT 


1200 


TAACATTGCT 


TACTAGTGGA 


TTTACCACCA 


CTTTATTATT 


GATTATTGCC 


CGAGTTTTGG 


1260 


CTTtTCTTTT 


TTTAATAACA 


CCAATTGTGA 


TCTTAGCAAC 


AGAAGAACTT 


AGAATCGAAT 


1320 


GGGCAAGTTG 


GGTTGGTTGG 


CTTTTATTTG 


GATTACTTGT 


TCTGCTTTTT 


CATTATCTAG 


1380 


CTGCCTTGAA 


TGAAGAATTA 


GGAGGCTGGT 


TTTTAGATTT 


TTCTGATTGG 


TTAAGCAAGG 


1440 


TTCCTAGCAA 


ACTGTGGGCT 


GAATAATTGC 


TCCTATGCTA 


AAACGCCTTA 


ACTTAAAAAA 


1500 


GCCAAGGCGT 


TTTATGGTAT 


AATATACGAG 


ATAGTAAAGG 


TTTATGACGG 


TGGCTAATCT 


1560 


CTGAAAAAGG 


GGTGATTGCC 


TATGTTGATT 


TTTGGCTATA 


CTCCTAGAAA 


GGACTAGCTT 


1620 


TGACTATTTT 


TGAAGCGTTA 


ATGTTTGCGA 


TTACTTTTGC 


GACATTAATT 


CTAAAAATTA 


1680 


GTGATAAAAA 


CAACAaAAAA 


TAACCGTCAA 


CTTTAGCCGA 


, GTTTGGTTAT 


TTTTtGTAAA 


1740 
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AATATTTAAC 


TATCGGTCAC 


CGTCTTAAAC 


GGTGCTATCA 


TGAGGGCGTA 


TGACTAGTAC 


1800 


GTCCTCTTTA 


CATGTTTATT 


ATAACACATT 


CTCTACAAGA 


AGAAAGGCGG 


AAGCCTTTCT 


1860 


TTTTTTTCAG 


CGCGCCTTAA 


CGGCGGTGGC 


GGTGGGAGTT 


TGGcGCGCTG 


AaAAAaACGC 


1920 


TCtTTTCTTG 


ATTTCTAATC 


AAGAAGATAC 


TCGCTAAAAG 


CGAGTATCAG 


GTTTGTCTTA 


1980 


TATTATGATG 


ATAAGGCGTG 


CTGTAAATTC 


TCTTGCCATT 


GTCTGACTGC 


TTGTTTTATC 


2040 


TTTTCTGGTA 


CTTsCTGACG 


GTTACATwtn 


CCTTGTATTT 


GTATAACCTG 


TTGATTTCGA 


2100 


TATTCTAATG 


TGAAGTAAGG 


CATATCTGGC 


TTTTCTTTTC 


TTCGGATAAA 


AACAATCGTT 


2160 


GTTTTTCCTT 


GCGTATGCCG 


TTCGATATAG 


TGTTGACTGC 


CCACACAATG 


GCGTAGAATA 


2220 


CTTCCTTCTT 


GAATGATTTC 


TTGTAATGAA 


TGTGGTGTGA 


GAAAAAGCAA - 


ATCGTCAATT 


2280 


TCAGCTTCTA 


ACGCTTTAAT 


TTGGTTTTTC 


CGTTCTTGAT 


ACTGCTTTTC 


TTCAATCTCA 


2340 


CTTTTCAAAA 


GGTTTAGCGT 


GTTCATAGCG 


TTATCGTGAG 


CTACTTGTAA 


GTTTTCAGGA 


2400 


TACAATACAC 


TATCGTTATT 


AAGTGGGGTG 


TTCAACTCTT 


CTAACATGTG 


TAAATAGTCC 


2460 


ATATAATAGT 


CAAAACGCTC 


CCCTTTTCGT 


ATAAACCATT 


TTTGAAACTT 


TGTTAAGTTC 


2520 


ACTTCTTTCG 


GTAGTTGCTT 


AATTTGCGAA 


TAATGCACGT 


ACTTTTCTAT 


TTCGTTTATC 


2580 


ATAGGCGCAT 


TTCGCTCTTT 


TAACGTTTTT 


TCTAACATGA 


ATTCATGAAA 


GGTTGGATTT 


2640 


CTTGTTTTTA 


AtTTTTGtTT 


ATtCGCTTTT 


AGCCATTTCT 


TAGTAATTAC 


TCTCATATCT 


( 2700 


ACTTTATGAA 


AATACAGTCC 


AAAAACGTTG 


CGATAGTCGT 


CAAAAATGAT 


TTCATTCGCT 


2760 


AAGGTTGTGG 


CACCTATCTT 


TTGTAAAAAC 


TCAATTTCGT 


TTCGGTATTT 


GTAAATATGA 


2820 


GCTAGTTCTT 


GACGTTCTAT 


TTCAGGTAAC 


TGTAAGTACT 


TTAATTCCGA 


TACAGTGGTT 


2880 


AGTTTTTCTT 


GCCAGTTATT 


TGGATAGAAT 


ACTGTATTTG 


TATAAGCACT 


ACTCATAAAA 


2940 


CCATAATTTG 


CTAACAAGCC 


CTCATGCCAA 


TGTCCGTTCT 


CATAATGGGC 


TTTAACATGG 


3000 


GTATCGTTTG 


AAAACCGTTC 


AAGGTTTGAC 


GGCTCATAGA 


TAAGGTTTTC 


TATTCCATCT 


3060 


GTAATCGTCT 


GCCAATAACT 


ATGGGTTTGA 


ATTTCAATTC 


GTTTTGCACG 


AACGAGAATG 


3120 


ATTCCAAAGG 


AGTAAAATTT 


CGTTGGAAAC 


GATAATCGTG 


AGTATTTTGT 


TAATCGTTTT 


3180 


TTTATGATAG 


GGcAATTCTT 


GCGAGAACTC 


GCAAGAATTG 


TCTCCTTTTT 


GTTTTGCCAT 


3240 


TCATACGTGG 


GTATTTGTGC 


AGTACACCAG 


TTAAAAAAGG 


CTTTGGGTGG 


GGTTAATGGT 


3300 


TTTGGTGTTT 


TTTTATTCAT 


ACGCACTCGA 


ACAAATCAAT 


TTGTTCGTCC 


ACCTCTTTTC 


3360 


TTTCTGATTT 


TGTTTCTGAT 


TTTTTGGTTG 


GTTTCTTTTT 


TGTTGTGGTC 


GTTACTTTAC 


3420 


CAACTGCTTT 


GGTTTCTGGT 


AATTTCTCTA 


ATAAGAAATA 


CTCTTTTATC 


CAACTGAAAA 


3480 


CCACTTCGTC 


CGCCACCATG 


GCGCTTGATT 


GGTTTTTGGC 


TTGTTCTTGG 


GCTTTTCCTA 


3540 


TGCAGTAAGT 


CATAGCACCT 


TTTAACGTTC 


TATCTTyTTT 


GAGAATACCT 


AGAAAAAGCG 


3600 


CTTCGTCTGT 


TTGTCGGCAT 


AACCAGTTAT 


GAACAACTTC 


TAGTGTACTG 


TCATGGTCGG 


3660 


TTTCTAGTTC 


TTTTArCATT 


TTCTCAAAAG 


CTTGTTCTTG 


CGTGCTTTCT 


TGAGCTGTAA 


3720 
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AGAAAAATCG TTCGTAAGAA wkTTTGrCGC 7AA7CACTAG CCAyTkrTAk TTrTTTCCTT 3760 

GGTCTATyGT TTCGTCAATy GTTAGAAAmk" STGAyGArTA GTkwCCGTtG GggTCTTTwA 3840 

TCGyAACrAT yGCTTGTTCw GGTAAT7TTT C.---.GG7AAGy CArTAACGTT TCGACTGGAT 3900 

AAAGAGCGAC CTCATAATTA GSA7AAAAAC -VTAAAAACTG AACGTTTTTT CCGTCTTTTT 3960 

CAATGTGTTC TTTTCTGAAA GCGACACTGA 7A7AGCCTTG TATGTCGTCA AAAATTGTCG 4020 

TTGCTAGTGT TGGTTTGTAT TGT7TGAGTG C3737GT7AA TTCATTTACA GTTTGTATAT 4080 

TTTGAAAGTC CATAACGTTC CCTCGTTTCT 777AAGCTTC CCCTCTTACA TATGTAAGAG 4140 

GGGAGCAAAT TTTCGCTTGT CAAAGG7CAG TCGGTACGTT ATACTTGAAC TACGGTTAGT 4200 

AATTCAAGTA AAGTCCTATT AGAAATCTTT 7S7CGTTAGC AAAAGGTTTC TTTTTTTGTC 4260 

TTTTTTTCGT GGAAATACTT TGATAATGGT TCAAAATCTT TCATATCTCC ATTGCAATTC 4320 

ATAGTACTTA CTAGCCACTC TCCAGACTGA CSTTGTGCAT ATAAATAGCC GTTGTTATCA 4 380 

GAGCCACGAT AAGGTGTAAA GGTTACTTTC CCTTCAACGA GTTCCGTCCA ACCGTCAATT 4 4 40 

CGGTCAAGGT CAGTTAGCGT GAGGTATTGT T3TTCTCTTA TTTCTTCTAA TGATTGTGCA 4500 

AGATACATTT CTGTTTCTGG TCGAACATAT A3ACAAAAC7 CGTGCCAACC ACAACAAGTT 4560 

AGTTCCCGAC AGCCGACATT GTCGTA7TCG 7AAAGTACGT TTTCTTTCAC ACCTAAACAG 4 620 

GACAGGCCTT TTTTCTGATT GATAACTTTT TSAATTTTTT CAGGGTCTTG GTAATGTTCA 4680 

TATAAAACAG CACCTACTCC CTCTATATAA CCG7CCCAGT GACAATATAT GCCTTGATAC 4740 

GTTCCGTCTG CTTGTTCTTT AAAAATTGCT GTTCGTGTTC CCATAAAAGC TCTCCTCTTT 4 800 

CGTTGTTTTA AATGATAAAC TCTGTTGCTT C3TGATTATC TTCTGTCAAG ATAATTTCTT 4860 

GGTCGTAATA CATGTTAGAA ACCAATTCAT GTGCCTGTTC TTCGCTTTCA GCAAATACAG 4 920 

AAACAGCTAC CTCTAAGGTT TCAACAATTG TTACACAGTA TTTTTTCTTT TTTTCTGCCA 4980 

TATTTATACG TCCTTCCATT CAATGTGATT TAAAAAGAAT GCTAACTGAT TCAAACGATT 504 0 

ATTGGcATTT TTCAGAcAAC CATAGT7TAG TAACAATG7A GGCG7ATCAT T7TCAAGAAT 5100 

GGCTTTA77T AACTTCTGAT ATTCT77ACG 7GC7TGCTCT AGCTGATTTT TTAGCTGTGG 5160 

GTCGTTCATT TTTACTACTT CC7T7C7GGT A7TCTATTAT T7TTGGTAAT CTCTATCAGT 5220 

GT7TTTTGCG CCTCTAACGC AGTTAAAATT GGTAAGCAAT CTCCTAAAGT CATACCGTTA 5280 

CCGACAATTA CAAAAGGTTG AAGTAACTGC CC7ACCGTCA TTT7TTCG7C TGAATGAAAA 5340 

AGAGTAAATT CTTTTTCtAA AATAT77TCG CCT7GTAACA AACATTCAAT ATACACTTTG 5400 

CTAAAG7CGT AAAAACTTTC 7ATTGAAAAT AATTTTTGAA GATAATATTC TACAGTTTCA 54 60 

AAATCAATGC AATCAAAATA CAAA77A7TT 7CA7CAATCA AAAAACATTG AGAGCGTAAT 5520 

GTGTTAAATA AATCTAAACA 77CTTCTAAA TCTAGCGATT TTAAATCG7T A7TTAAATAA 5580 

CGATTAATGG ATAAGCTATT AATGAAA7TT TCT7T7TTGT TCATTGGTTC TCTCCTTTAT 5640 

TTT7TTGTTT TTGAGCAACC A7T777GTC7 G jCGTGAGGG AGGGAAGCTG T7CGCTTCTT 5700 
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TGGCTCATGC TAGGACGTGG GACACTATCC TCCTTTCGTT TTTAGATTTG ATTGTTCATT 57 60 

TTCGCCAGTA TGTTTCGGTC GGTTGCTGTG CCGAGAAAGA ATCATAGACA CCAAGAAAAG 5820 

ACAAGGGCGA AC G GAG T GAG TTCATATACC CTTGTCTTTT CGACTGGCTA TGATACTGTA 5880 

ACAAACAGCG ACTGACCGGG ACATACGACA ATGAGCGGTC ATAAAACGAT TGAGAGGACA 594 0 

CACGTCCGTA AAAgCATGAG CCAAAGAGGC GAAATTTTAA TGCACTTAAA AGAACTAGAC 6000 

GCAAAGCGGC TAGTTCTTTC TTTTGTTTTG CCTTGCTTTA GCAgGCATTT CTTTCGAGCG 6060 

GCGGTGGCGG TTGGAGCGwG ATAGwAATAA ACACTAAAcA aAGTAATnAG GTATGAAAAA 6120 

TGGAAAGACG ATTTTTCAGT GAATAG 6146 
(2) INFORMATION FOR SEQ ID NO: 140: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TTTTTGTCAT GATTTGTCAA TACTATGCTT CAGTCGCTTC GCAAGGTTTT GGGACGGAGC 60 
TAAGAAATCA ATTAATGAAG AAAATTAATC AGCTTTCACA CAAAGAATTG AATAGTTTTG 120 

i 

GTACAGATAC CCTCATCACC CGGATCACAA ACGATATCAA CCAGCTTCAA TTAGCTTTAG 180 

CGATGTTTAT TCGGTTGGTC ATTCGGACAC CTTTTTTAAG TATCGGTTCT GTGGTGATGG 240 

CTTTTTACAT TGACGTGCAG ATGGGCTTTC TTTTCCTATT ACTTTTACCA ATTTTTAGCC 300 

TTATTCTCTT TATTATCATT AAAGTGACTG TGCCTTTATA TCAAAAAGTC CAAGAATATT 360 

TGGATCGGTT AAACCGTCAA ATCAGTCAAA ACTTAAGCGG TGTCCGTGTG ATCCGTGCGT 420 

TTGCTAGAAA GGAAACAGAG CAACGACATG TTGATAAAGC TTCAGATGAT TTAGGCGATA 4 80 

TTTACATTCG TGTATCGAAT GTsTCTGCTT TATTAACGCC TTTAACCACT TTGATTATGA 540 

ATGTTGGAAT TTTATTTTTA CTTTATTTTA GTGGCTTAAA AGTTTCTTTT GGTTCCTTAC 600 

AACAAGGGGA AGTTTTAGCA .TTGATCAATT ATATGAATCA AATGATGCTC GCTTAA 656 
(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CGGAAGATTG ACAAAAGATC CTGACTTACG TTACACCGCT AGTGGTTCAG CTGTTGCGAC 60 

CTTTACACTT GCTGTAAACC GTAACTTTAC GAATCAAAAT GGTGATCGTG AAGCAGACTT 120 

TATCAACTGT GTGATTTGGC GTAAACCCGC AGAAACAATG GCTAATTATG CCCGTAAAGG 180 
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TACATTATTA 


GGTGTTGTCG 


GAAGAATTCA • 


AACTCGTAAC 


TACGAGAACC 


AACAAGGTCA 


240 


ACGTGTCTAC 


GTAACTGAAG 


TGGTTTGTGA 


AAACTTCCAA 


TTGTTAGAAT 


CTCGTTCTGC 


300 


TTCAGAACAA 


AGAGGAACTG 


GCGGCGGTAG 


CTTTAATAAC 


AACGAAAATG 


GTTATCAATC 


360 


ACAAAATCGT 


AGCTTTGGTA 


ATAACAATGC 


CAGTTCTGGA 


TTTAATAACA 


ACAACAATAG 


420 


TTTCAATCCA 


TCATCTTCTC 


AGTCGCAAAA 


CAATAACGGT 


ATGCCTGATT 


TCgATAAAGA 


480 


TTCTGATCCA 


TTTGGTGGCT 


CAGGTTCATC 


TATCGACATT 


TCAGATGATG 


ATTTACCATT 


540 


CTAAACGAAT 


TTGATAGGAG 


GGAACAAGAA 


TGGCACAACA 


AAGAAGAGGC 


GGACGTAAAC 


600 


GTCGGAAAGT 


CGACTATATC 


GCAGCAAACC 


ACATCGAATA 


CATCGACTAT 


AAAGATACTG 


660 


AATTGTTAAA 


AC GAT TC ATT 


TCAGAACGTG 


GCAAAATTTT 


ACCACGTCGT 


GTTACAGGTA 


720 


CTGGCGCTAA 


AAACCAACGT 


AAATTAACGA 


TTGCTATTAA 


ACGCGCACGT 


ATCATGGGAT 


780 


TATTACCATT 


CGTTAGTGAC 


GAACAATAAT 


TTTATGACAA 


TTTCCnGAAC 


CAAAGAGCTT 


840 


GCTCTTTGGT 


TCTTTTTTAT 


CTTGACTAGT 


GTTTCACGTG 


AAACATAACA 


AGGTTCGCAG 


900 


AATGAAGTGT 


TTTGTGATAA 


AATAAGGGAT 


GAAATAACTA 


GTGAATCACC 


AGAGCAATAT 


960 


GCTCGTTGGA 


ACAAGTCGCC 


TCAAAGGTCT 


GCAGAAAGAT 


TGGAGGCAAA 


AAAATGCAAA 


1020 


AGAAGAGAAT 


TCAAAAAAAC 


GGTTTCTTAA 


TTGTTGTGGG 


TCTTCTCTTA 


GTAGAATTTC 


1080 


TCCTCTATTT 


CTTACTAACA 


AATAAATGGC 


TGCTATTGGC 


GGTAATTATC 


GC AT TAGAT A 


1140 


TCTTTCTCTT 


AGTGGTTATT 


CGGCTGTTGA 


TTAGAGATGT 


AGAAATTACG 


AACGTAGAAA 


1200 


AGATTCAAGA 


AGCAAGTTCC 


ATTGCTGAAC 


AATCGTTGGA 


TTATGTTGTA 


AATGAAGTAC 


1260 


CTGTGGGAAT 


TATTACGTAT 


AACGGGGAAA 


CACGCGCGGT 


AGAATGGCTT 


AATCCTTATG 


1320 


CTGCTTCTAT 


TTTTAATAAA 


GACAATCAGC 


TAACGTTAAC 


CGCTAGCCAA 


GTTACGTCTT 


1380 


ATTTAGAATT 


AGCAGAACGA 


AACCAAGATA 


TTTTTACGAT 


TGACGAAAAT 


ACCTATCGCT 


1440 


TTAGCGTCAA 


TAAAGAACAA 


CATACAATTA 


CTTTTGAAGA 


TATCACTAAA 


GAAAGTAATT 


1500 


TGTATCAAGA 


AAAAGTCGAA 


ATGCAAACGG 


CTATTGGCAT 


TGTGTCTGTC 


GATAATTATG 


1560 


ATGATGTCAC 


CGATACAATG 


GATGAGAAAG 


AAATTTCTTA 


TTTGAATAGT 


TTCATTACGA 


1620 


CGATGGTTTC 


TGATTGGATG 


GACCAATACA 


AAGTTTTTTA 


TAAGCGAATC 


AACGCAGAAC 


1680 


GTTATTTTTT 


CATTGCCCAA 


TGGGAAGATA 


TTCAAAAAAT 


GATGGACGAA 


AAATTTTCTA 


1740 


TTTTGGATAC 


GATCCGTAAG 


GAATCAGCTA 


ACCATGAAGT 


AGCCATTACG 


TTAAGTATGG 


1800 


GGATTGCTTA 


TGGGGGCCCA 


ACCTTAGATC 


AAACCGGGAC 


CACGGCTCAA 


ACAAACCTAG 


1860 


ATACAGCTTT 


AGTTCGTGGT 


GGTGATCAAG 


TGGTTGTAAA 


AGAAGCCAAA 


GATGAAGCGA 


1920 


AgCCGTTATT 


TTTTGGTGGA 


AAAACGGCAG 


TAACGACGAA 


ACGTTCCCAA 


GTACGTTCTC 


1980 


GCGCAATGAG 


CATGGCAATT 


AAGGGAATTA 


TTGCGGAATC 


TGCTGACATC 


TATATTATGG 


2040 


GCCATCGTTA 


TCCAGATATG 


GATGCGTTAG 


GTTCAGCATT 


TGGTGTTGCT 


CGTTTAGCCT 


2100 


CGTTTAATAA 


TCGAAAAGCG 


TGGATTGTTT 


TAGATGAAAA 


TGAAATCATT 


CCCGATGTCA 


2160 
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AAAGAGTGTT AGAGGCGATT AAAGAGTACC CAGAATTAGA AGAGCGCATT ATTAGTCCTA 2220 

AAGAGGCCAT GAAGCGCAAG AAAGAAAGTA GCTTATTAGT TATGGTAGAT TACCATAAAC 2280 

CGTCTCTATC GATCTCACAA GAGCTCTATG AGCGTTTTGA TAAAGTAGTA ATCATTGATC 234 0 

ACCATCGACG AGGAGACGAA TTTCCAGCAA AACCCTTGCT TTCTTATATT GAATCTTCTG 2 4 00 

CCTCTTCTGC TTCAGAATTA GTCACAGAAT TGATCGAATA TCAAAGTAAT AGCGCAAATA 24 60 

AACTGCAGGC CTTTGAAGCA ACCATGATGT TGGCGGGAAT TGTGGTTGAT ACGAAAAGTT 2520 

TCAATACACG AACGACGGCG CGAACATTTG ATGTGGCTAG TTATTTACGA ACTTGTGGAG 2580 

CAGACTCATC TTTAGTACAA TATCTATTAA GTTCTGATCT TACAAGCTAT CTGGAAATGA 264 0 

ACAATTTAAT CTCTAAAAGC GAATATGTCA CAAAAGATAC CGTCGTTGTT GCAGGGAGTG 2700 

AAGACAAAGA ATATGATAGT GTCACAGCTG CCAAAACAGC GG AC AC ATT A CTTTCTATGG 27 60 

CAGGGATTAA TGCAGC 2776 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



TCCATTCCAG 


CTATTTCTTT 


CTACATCTGT 


AATATGTCTA 


ATGTTATCTT 


GATCATGCGC 


60 


ATCAAACTCA 


GATTTAGTTG 


CCTGCTTTTG 


ATTTAGTACA 


TTTCCtwAAC 


CaACCTGakT 


120 


AGCTGTTACC 


GCatGaGGAT 


TACTTATATT 


TGATTTATGG 


GCATTAAAAT 


CTTCAATATG 


180 


TGTAGCACTA 


TCTTGCAATT 


CTTTTAAAGC 


TCCGTGGGTT 


CTGCTAAAGA 


ACCAATTAAA 


240 


ATAATCAGCT 


GGTGGTTTTT 


GTGAAGCTTT 


CCAACCATCG 


CTAGTTAAGC 


TTTCGGGTGG 


300 


TTTAATACCT 


GGTGCTAACC 


AAACAGGCAA 


TTCCTTTGTA 


AATTTCATTT 


GCATTCTCCT 


360 


CTCTTTCTAC 


AGCGGCAGTT 


TATAATCATT 


TTCTGGAATA 


AAGACTCCTC 


CCAAAGTTCC 


420 


TCCGATATTA 


CCATCAATAT 


CAGCGAATCC 


TTCTTTACTA 


ATCTCTATAT 


CAGTCGTACT 


480 


TGAAAATGAA 


AAAGTTCCCT 


CCAAATTAAC 


GTAGGCTACT 


CTAATGCCTC 


CAGCACTGAT 


540 


ACTCTCGACA 


ATTTGTAAAA 


ATTGACTAAT 


ACTTAGACCC 


GAACTATTCA 


AGTAGTCAAG 


600 


AGGAGCTTTT 


TTTATGATTA 


CACAAGCAGG 


TTCTTTTTCA 


TCTACCGTCT 


CATTAGCTGA 


660 


TATAATATGG 


ATATCACTAG 


GATGACAATT 


TAAAGATGTC 


GCAATTGCAT 


GTAACATTTT 


720 


GTCGATTGAT 


CCATCGCTTG 


TAtTTCTAGC 


TACCTTTCCA 


CGAATCAATA 


CTCTATATAT 


780 


TTCATCAGTC 


•GTTTTtCCTC 


TCGCTTGTCC 


AACATTAGCA 


CCTAATTCAT 


CTAGTCCTTT 


840 


CCCTCGTGCT 


TCATCTATCG 


ATCGCCAGTT 


TGCTACTTTA 


TTCAGCAAAT 


CAGTCAAGTC 


900 


TTTAATTTCA 


AACTCAATAA 


TTTCAAGAAG 


TTTTGAAATA 


TTTGAATTTT 


CGCGATTGAA 


960 
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rp A /—A TPf PPT 
1 AoAJ b T-bb i 


A A A T* A A TP TP 
AAAi AA1 bib 


TZIIiPPTTTTT 
1 AAbb 11111 


P.f2TP APT APT 


AATCACCACA 


TTCTCTGTTT 

XX\rfXV>>XUX X X 


1020 


tapattpapp 
1 AbAl IbAbb , 


rn/-« TV rp 7\ f prT»rp 
IbAb 1 bAl I 1 


PPATTPAP.AT 
bbAl IbAoAl 


T A ATPTPTTT 
1 rvrt 1 UiVyl 1 1 


Onv/ 1 1 v^uVJ i r\ 


GTTTCAGTAG 

\J X X X N-'/k W X OVJ 


1080 


A T A A TPP 7i BT 
A J. AA 1 ^CHrt 1 


TTTTAPATP7A 
1111 Ab A 1 \_ A 


p.pp. zi p zi a pa a 




TTPtATAAATT 


AATGGaTAAA 


1140 


1 A 1 AAbA8AA 


TPTP APTAPT 
lbl bAb I ftL 1 


TPAPPPaTAP, 
1 V_<nL<^L*a l no 




AATGTAATTA 


TTTACTATAA 


1200 


CI i 1 1 Al 1 


TTPTTPTPPP. 
1 1 b 1 1 b 1 LLu 


Ir-TTTTTTPAA 


ATTTTGAATC 


TACTTGAATA 


GAAATATTCA 


1260 


CAAAAA 1 Abb 


AAbAbAl ill 


P.PATA ATP A A 


ATTTAACAAT 


ATGACTAAAT 


CCTCCTAAGT 


1320 


m m rp rp t\ pmnir' 

CTTTTAb lib 


7\ A PTTPPTP Ti 
AAb 1 1 bb 1 bA 


TTAPPAAPAP, 
1 InVwL^nn^nO 


THTrAATrrc 

lul ^rul 1 ^* ^ 


AGPGGCAACA 


CTTTTAAAAA 


1380 


TTGCCTGCCC 


AATAx bbl b 1 


TTP APTPPZVP 
1 1 b Ab 1 bbAb 


PTAAAATATPi 


PAPATHTAPP 


GACTTGGGTG 

vjnvi x x vjvju x vj 


14 40 


GATTACCATA 


/■"•/■■•Tl tv on /""* t\ r*"* rn rn 

CGAATbAbl 1 


lbbAI lb 1 1 1 


T ATTTTPA AP 
1A1 1 1 1 AAV- 


A APAPTAP.PP, 


\J 1 V— V_>0 1 flVvVvVv 


1500 


CACTAACTTC 


m TV ^ ^ TV TV TV ^ TV 

TAGCAAAGCA 


f rn rn tv tv tv ttp 

GTTAbAAl lb 


PPTTT A TTPP 
bb 11 I Al 1 bb 


PPPTPP AP.CA 


T TPPP APP. A A 


1560 


CAGAAATACC 


T\ A rnrp /*■» rp <*■• /■*• 

AAT 1 bbbl bb 


bbAlAbbbl 1 


TATPTP.TTTP 
1 Alb lbl i 1 


APP.TTPTP,PP 


PPAPPPTPAG 


1620 


CCCTAGCAGG 


m m tv rn rn tv t\ tv 

GTTATTAACA 


jr-^ tv rn 7\ rn 71 T 1 rnm 

GAI bAl Al 1 1 


PTTP A PT A PP 
b 1 1 b Ab 1 Abb 


TTPPAPTTPT 
1 1 bbAb 1 lul 


APTATAATAP, 
lnl nil i. no 


1680 


TATTAGCTGG 


Ti Tv /■"» rri rri rp tv TV 

TACGTTTbAA 


b 1 AbbAl 1 b 1 


PTTPT A A AP-A 
b 1 1 b 1 AAAo A 


A ATTP.PP,PP,P, 


PPTTTPCCAA 
vyLi i i VrfV^v^nn 


1740 


ATCCATTATC 


ti m tv tv rn m rn m /** 

ATCAATTTTG 


A/- , T , AT , AT/" , A A 
AC 1 A i A 1 b AA 


TPZ1TPTP. A A A 
1 b A lbl 0 AAA 


PATAAPTTTA 


TTTTPTP,TTP» 

X X X ivlul J.O 


1800 


AAAAACGTAC 


GCCTTCATTA 


A 1 b A 1 b i AAb 


P APPTTTTPP 
bAbb 1111 


TPAP.A APTPP 


AP,PTPAAPPA 


I860 


TTGCAACTGT 


tv /■» /"• rr» /"•/"• ti mm rp 

AGCTGGATTT 


CTA 1 AbA 1 Ab 


/*" APT AT TPPP 
bAblAi lloL 


APPT A A APC2A 
nLL 1 AAAWVjA 


TPPAATGAAA 


1920 


CCCCTGTAGC 


m m Tv /TprnAfT*/"* 

TTGACTTATb 


AAbbbAb 1 Al 


A A T Z\ Zi APTPT 
AA i RnHL 1 1 


TTPAP,TTAPT 
l l V— tvj l i no l 


TPATGAGATA 
x v—i*. i onon i n 


1980 


TAGATAAAAA 


CCAi GbAAbb 


A 1 AbbAAl AA 


TP AT APPPAP. 
1 VjA 1 AV-VwV_rtvj 


A APAP.AATHT 


PPTPAAAPAT 


2040 


TAACATCCbA 


A r~T>f\ A A A A P 

AbbAAAbAAb 


PPTTTTPPPT 
ob 1111 UV^V_. 1 


TTTPAP^APAT 


ATPATAGAGT 


ATTTCATCAT 

n xxx x w x 


2100 


ATGTTTTTCT 


I T 1 AAA 1 bbA 


rn rp rp rp (~* 7\ T 1 P A A 
I 1 1 1 bA 1 b AA 


PPAPPT ATT A 


ppRppTrrAf 


TTCATCACCT 
x x N^n i v^nv^v.i' x 


2160 


TV 7\ rp /*■» rp Tl /"* TV T* TV 

AATGTTGATA 


bbAl b 1 bbAb 


iul Art Ini 


A APTPTPT AT 


TGTTTPTTAP 
xox x i vv i x 


AATTTCTATA 

f Ul XXX X 4* X * * 


2220 


mm rn m /"* 7\ tv rp tv o 

TTTTCAAl AC 


rn tv rp rn TV TV > prp /"" rn" 

I A I I AA 1 1 b 1 


■ APPTTPTTP7A 
Abb 1 1 b 1 1 bA 


TPTAA A ATP^ 


PTTPTP.TA AT 
l l v i. U l nn l 


ATPTTGTTTC 

fi XOX XOl X IV* 


2280 


tv t\ /"** Ti tv tv rn m rn 

AAGTAATCTT 


tv ti rp /**" rrv tv ATT 

CAT I b I AA1 1 


TTTTPP A A A P 
111 IbbAAAb 


ATATTATPAP 


TTTPTA A APP 


AAPGGATGTA 
nn^oon jl o x n 


2340 


TCTAATTTAA 


T\rpmr^mrp/^rT>AP 

ATTb f 1 b I Ab 


TPPTATTPAT 

1 bb 1 Al 1 bAl 


A AP ATP ATP. A 
AAVjAI V^rt.1 on 


ATAPAPTTTP, 


TGPAAGTTCT 

X O VvXlIiVJ X X V>r X 


24 00 


Tv TV ATPTrTTT 1 

AAA1 b Ibb i. 1 


PA A PT A PT A A 

bAAb 1 Ab 1AA 


AATTPPATAP, 
AA1 iUbAlAu 


TPA ATAA ATP, 


ATAAGTPTCC 


AT TG ACT ATT 

x x wriw x ** x x 


2460 


TTPB A A TPPP 


IbAl iAAbbA 


ATPAPPPPPA 


TAATAAPAP,P 


GTCATTTTGA 

o X x i x x \jn 


CTATGCATTC 


2520 


T A TP A P A TP A 
1 A 1 b Ab A 1 un 


T Ti A TP A A T A Ti 
1 AA 1 b AA 1 AA 


TP A CP AC A AP 


TTPTATAATT 
1 l u ini. rvft i i 


ATPTATGTCA 


CGATCACAAA 


2580 


A A ZVPT APT AP 
AAAb 1 Ab 1 nL 


TAP A APPPPT 
1 AbAAbbbb 1 


PPPT^TP.AP,A 


TATrAfrAPTf^ 


APAATGCTTA 


GTGATCAACG 


2640 


CGTTTAGTAT 


TAATGCTCTT 


TTGCCTCCAT 


CTGATTTTAA 


TGCCATTGGC 


TrA ATAfPAP 

1 bAAb A 1 b Ab 


oi fin 


CTCTTGCACC 


TTTTACAGTG 


ACAACTCGAC 


ATAGTTGCAT 


AACATTAATT 


TCTTTTAAAA 


2760 


TTCTATTTTT 


■ AAATGATCGA 


AAAAAAGCTA 


AGTCAGTTTC 


TTTCATTAGG 


TTACCACCTC 


2820 


ACACGTTGTA 


ACAAATCGAC 


TACCATCATA 


GGAGTGCTGG 


cCATTTTTTA 


CATAAAAATT 


2880 


TCCyTTTACA 


TTTTTtGAAT 


CGATATAGAC 


TGCTGTACCT 


GTAGTGATTT 


TATGTTGCAA 


2940 
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AAbAbAb ILft 


AbAbAbbAAb 


PATTTITA ATP 

b Ab 1 A 1 AA 1 b 


ATGACTTTCA 


ACTTTTGTTG 


GTTGGTTAAT 


3000 


AAGTCCCGTT 


TCAGGACTTA 


7A T P T P T 7\ T P T 


CTCtTTATTT._CCATCTCGAA 


AATTTTTAAT 


3060 


TACAAGTTGA 


CCTCTTTTAA 


AA 1 A 1 Ab I bA 


TGTGTCACAA 


GCTTTGGAAA 


TCTCTTCTAA 


3120 


AACCATCATT 


GCTTGGCCAT 


bAbb 1 b 1 b 1 A 


GCCAGACCCA 


TATATTTTAT 


TATTTTTTAA 

x **> x x x x x x fui 


3180 


CTTAATTTCT 


GACAAAGGAA 


n> j\ rp rp rp /"* rp rp f~* 

TATT Ibb lib 


CCTAGCAACT 


CGATTtATAA 


TAGTAkGrGC 

X OVJ X rlA.\JX VV 


3240 


ATCCGTyCCA TTATTAAAAG 


TAATAT1 iAb 


TTCTTTTkGT 


yCCGAGTAAc 


TTTACCT vCT 

xxx x y x 


3300 


a AA AAAftTAA 


ATGTTGTTGC 


TbTArbTAbC 


CGTTTAGAAC 


AGAGGGTTTT 


ACTTCTGCTA 


3360 


ILol 1 t 1 vn 


PGTAATCACT 


CCAT 1 T bAAb 


TACCTGCGTA 


TCCTGCGTGG 


ATATAGACTG 

r\ x n x nwnv x w 


3420 


on 1 1 1 ill 


PTTAATAAAA 


m "A rp tv /*> rp rp rp 

TCAATACTT1 


TTTTATTTAA 


ATTGTATATT 


GTPArACTCA 

o x tn^ri^ l ^*r\ 


3480 


til ^nun 1 kJO 


ATTAGGTGAG 


TCAGAAAAAb 


GTGCTGTAAA 


GTGAATTTCT 


AAPPGATCCA 


3540 


nU\j 1 i \jn 


ATCAGCTCTT 


AGTAGAAbTC 


GATTCTTCCC 


ATTCTTGTCG 


THTATTTCAA 
i \j i rt i i i ^ftn 


3600 


T T T fZ T A A T A A 
1 1 1 O 1 nr\ 1 Art 


TPGTTGCCAT 


IGbb 1 Al lib 


CCATTTAATT 


ATCTTCCCTT 


TPPAAACTTG 


3660 


PA APATTAT A 


ATTCGGAGGT 


AAA1 b A i b 1 A 


TATAAAGGAA 


AACCTGAATA 


PPAAAATTTT 


3720 


PC A A Af^TTAT 


TTPTTTTGAA 


GCTCbbbAbl 


CATCCATCGG 


CACCAAATCT 


nrTHAA^TA 


3780 


AAr^A^TATr 1 


TAPAATGTPT 


TGCbAAAb Tb 


TTTCATCGAT 


TACCATACGC 


TPAPPA AT ACI 


3840 


PTATPf^flTf^T 
V 1 ri 1 tOv3 1 o 1 


ATGATPTATG 


rp TV rp ?v rp 7\ TV rp 

TCATATAbA 1 


CTACAGTAAA 


AAATTTTTCT 


P»TTTf^ ATT AT 


3900 


APTPPAPPTP 


AAAAATGTAA 


m rp fp /"* t\ rp rp t\ 

TTTbTAT 1 Ab 


CTAAAGGTAT 


TTCGAATTTT 


TPAfiGTAATG 


3960 


AATATTTATC 


AATAGGAATA 


m t\ rp rp rp rp t\ 

TATGCTbl 1 A 


AAGACATTTA 


GTCACCTCAT 


TTPAPAPGPG 


4020 


CACGAGCGCC 


TATGGGTATA 


T\ tv rp /"i rp t\ rp /■*< rp/""' 

AATCTA1 bib 


GCCACTTGTT 


CCAATCTCTT 


AATTGTTGAA 


4080 


TAGACGTACC 


ATATTGTTGC 


CACCAAbbbb 


AATAAGTATT 


TCCTGGCTGA 


APTHTPAPAT 


4140 


ATACCGCATT 


ACTTGGTTTC 


GAAGGTTb 1 1 


TTGCAACTGG 


CGGTTGTGGA 


ATPTTPTPPP 


4200 


AAATAGTCTT 


AGCTACACGT 


7\ rp /"* r~* rp rp rp tv 

ATGGGTTb 1 A 


ACGTAATTGT 


TATAGTGAAA 


PP ATTTTCCA 


4260 


CTGTGTCATA 


CCCTTTACTG 


Al Al b 1 1 b 1 A 


TAACTGCATT 


TTTAAAATAT 


GATCTGCCTC 

Oil 1 VI VJ W X ■ 


4320 


GATAAACAAT 


CCAACGAACA 


Tbl 1 1 1 lbbb 


ATGCAACTAA 


TGTATTGTAA 


GCTTGCTCTG 

VJv.^ X X X x w 


4380 


CAGCGTtACC 


ATTTTTAGCG 


BP l\ B.TPP AP.P 
Ab AA 1 OLnuL 


CACTAATTGT 


GACTGGACCG 


CCTGTATACA 


4440 


TCATATTGTC 


AGTGATTGGT 


C P T P P T P a T T 
bb 1 bb 1 bo 1 1 


CAACAGGATA 


TTGAGATACA 


TTAGCAGCAC 


4500 


TGGTTACAGT 


CTCACTGaCG 


111 AbAHl t 1 


PfinpTTT A P. A 
b AAb ill AbA 


TTTAPPAPTP 


TGaATGTATC 


4560 


CCATTAAACA 


ACCTCTCCTG 


1 1 bb 1 AbAAl 


ATTAAGTAAT 


TTTGCATACT 


PATCTTCTAA 

vri X V X X V i on 


4620 


TGCTTGTCTT 


ACAGCCTCTT 


TAATATCACC 


AACAACAGAT 


TTATCAGAAT 


TATTCCCAAC 


4 680 


ATTGATAGTA 


ATATTAGGAC 


TAAAATTAAC 


AGATGGAGAT 


CCTTGATTAA 


AGAGCGCTTT 


4740 


AGTTTTTTCA 


• TGAGGATGTA 


CAGTTCCAGC 


TGTATCTGCT 


TCGAACAATT 


CTGGTCCATT 


4800 


CTCTCCTACT 


AAAACTGTTT 


CACCTTTTGA 


AGGGCGGCCT 


CCCTTTGCAT 


AAGCCCTTAA 


4860 


TCTATGACCT 


GTTGGTCCCC 


ATCCAGAATG 


TCCATACGGC 


AAATCTGTTC 


: TCCAATTAGA 


4920 
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ATTATTGAAA 
TTTATATGCA 
AGCTGTATTT 
CCTCTGTATT 
TTTTGCAGCT 
AGCTCCGCCA 
ATAATGAACA 
AGCTTTAACT 
TAGaTTAT.Cm 
CCCaATCAAC 
AATACCGTGT 
AATGGTGATC 
TCAAAAAAAC 
AGTTTGTTCA 
TGcTTATTAA 
TCAGCTATAT 
GCATTAGATC 
TCAACAAAAT 
TGCCCCATAC 
GATGTTACAT 
TTGTTATATG 
CCACGAGATC 
GGTTTCCATT 
GTAACAGCAT 
CCAGACACAA 
ACTGTGCCGT 
TTAGATAATG 
TTAGCCTGCG 
TGTTCTTGGG 
TATCTTTCTT 
GTACCTTCTT 
TCACCTGATG 
AATGCAACAA 



AATGCCATTA 
TCAAAAGTTT 
ACATCCCAAA 
TGAGCTAAAA 
TTTCTAACTT 
TAAGTATCAG 
TGAGGACCAG 
TTATCGCCCA 
GcAACTTTaA 
CAAACCaGGA 
GAAAATCCCC 
CAAAATGTGG 
CTTTGAAAGC 
TGACACCAtG 
ATAAGcCAGT 
CTTTAGTTTT 
CAAAGTCTTT 
CAGAAAGCCC 
CACCATTTAA 
TAGTAACTTC 
CTAATTCAAC 
CTGTGCTAAA 
CTGGTATTTT 
TTACTTTATC 
CTTTATCCAT 
CCCAAATTTC 
TTTGTCCAGT 
TTACAACATC 
CTTTTTTTAA 
CATCTAAAAT 
• TTGCTTTATA 
CATTTTTCAA 
CAGCTTGATT 



GTTGATGGAA 
GAGGAATGTA 
CAGCAGAACT 
TGCCGTTTAC 
GAGGACCCCA 
GATTTACGTG 
TTGACCAACC 
TCTTAGCTCT 
TTCCTACGTG 
TA^TGAGCTG 
ATAGAGGCCT 
AGCAAAAGCA 
TTCAAGTTTA 
TCCCAAACCT 
CACTTTTCCa 
TTCTATAGCT 
TAATTTATCA 
ATTTCCTTTA 
TACAGCTTTT 
TGGACCATTT 
ACCTTCTTCA 
AGAGCGACCA 
AGGAATACTA 
GGCGAGTCCT 
AGCTTCACCA 
TTTAAATTTC 
TTCCCAGTCC 
TTCATGTTGT 
TGCATCATCA 
GCGCTTTGTT 
TGAAGCTGCT 
ATTACCCATA 
TTTCATATCT 



839 
ACCATTCATT 
CTGAAGCAAT 
TTGAATAATA 
TTCTGCATCA 
TCCTGCAGCA 
TTGGCCATTT 
AGrAGrACCT 
TATGCGGCTC 
CTCACCTAAT 
GaATAGGCGT 
GGTCGCTTAC 
CCTGAGCCAA 
TTTTTCACCC 
tGGATATTTT 
ATCGGGTCTG 
TTCGTTCCTA 
ACAGTATTTG 
CCTTTATGGA 
GTATCTGAAT 
GATCCTAATA 
CCAACTAGAG 
TGCTTGTTTT 
AAGAAATCTA 
GTTTTAAAAT 
AAGGCCTTAC 
TTTTTGGAAT 
ACTTGTTCAA 
TTTTCTGCTT 
TATTGTTGTT 
TCTTCATATT 
ACAGCATCAG 
ATGATTTTTT 
TCATGAATCG 



AT&T TTT CAT 
CCTCGAGCTG 
CTTTGGTTAC 
CTTACTTGTT 
CCTACCGCCA 
GGTCCGCCTT 
ACATCACCGA 
ATATGTCCAT 
CCAaTGGnAA 
cCAGTTGGAG 
CATAATCAGA 
AAGAGACACC 
ATTCGGCACT 
TACCCTTTTT 
ATAGCCAATC 
CTTCTTTTGC 
CACCGAAATC 
AACCTGGTAA 
GGTTTAAAAT 
AACGCATTGA 
CTTGACCACT 
TAGTAGAATT 
ATACGACATT 
TATCCCAAGT 
CAAAACTTTT 
CCTTTTTAAA 
GATGACCTTC 
CTTTGACTAC 
GAGTAATCGT 
TTTCATTTGC 
CAGCCTGTTG 
GCTCTTTAGC 
ACTTATTAGC 



ATCCACGTAC 
GGTTTCCACT. 
CACTAGACTC 
GATTCATTTG 
CACCACCAGC 
TTCTAAGTtC 
CGATTTGACC 
ACATAGcCCA 
GAAGATTGAA 
CAGCATAATC 
AGTACGAACA 
ATCTTCCGAA 
GGTGTCTTTT 
ATAAGkGTTA 
TTTTGcTGTT 
TTTATCTACT 
CTTAGCACTA 
AACTGTTCCT 
CCGTTCACCC 
AGAAGTGCTC 
GTATGATGCA 
AGGTGTATTC 
AATACCACCT 
TTCTAATGCT 
AGCGCCTTCA 
GACTTCCCAT 
CGCTTGCtTT 
ACCGTCTCTT 
ACCATTGACA 
AGATTTAATT 
CGCGCTAATC 
TGAATCTGAT 
AGCAGCATGT 



4980 

5JD4Q 

5100 

5160 

5220 

5280 

5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 

i 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 
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TTTTCTTCAA CAGCACGTAC TGCAGCAGCT GAGGTTTTTT CTAACGACTC AATTTCTTTT 6960 
TTATCGTTTT CTGATAGTTC TCGATTTTCT AATCTTGCTT TTTCTTTGAT TTCATTGATA ' 7020 

CGATTTGTGT AATACTCTGT AGCAGTAATA GATTCATCAT ATTCTTGTTT TTCTAATTTC 7080 

TTGAAATCTT GAACTTTTTC AGAGAACATA TTTGTTCGTA CTTTTGCTAA TTCATCAGCA 714 0 

GCTGCCTTGG CTCCTTGTAC ATCCTTTtCA TTAAGAATTC CCATACCAGC TAATTTATCA 7200 

AGATTGCCAT TAGATTTATC TkTCTkTTTA TkCAGCTGTT TAGTAACTTG ATTCTGCATG 7260 

TCTTCTAAAG CTTTATATCG TTTAGTAACT TCTTCCATTG GTAAGAACTC GTCCTTCAAT 7320 

TTTACCTGTA GTGTAnT 7337 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 "base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

ATAAGGAGGA AATCTGATAT ATGAAAACAA AACGCTTTCA GCGCACGGCG GCGATGCTGC 60 

TGTGCATTTT TATCTGCGCG TGGACAATGT TCCCCCTGAT GGGTGCGGCC TTTGCCkCsG 120 

AGAATGGGCT GCATGACACC GCTGAACCGG AAATCACCTG CTTCTGTGAT ATTCACTGCA 180 

CGGCGGATGC CCTCAATGCA GACTGTCCGG TTTGTACACA GGATGTGAAC GCCTGCACCG 240 

GCACGGAACC GGAAGCGCCC ACCGAAACAG AAACACCCAC CGAGCCGGAA ATCACCTGCT 300 

CCTGCAATAT CCGCTGCACG GCAGATACCC TCAACGCAGA CTGCCCGGTT TGTACGCAGG 360 

ATGTAGCTGC CTGCACCGGC GCAGAGCCGG AAATTGCCTG TTTCTGCGAT ATTCGCTGCA 420 

CGGCGGACAC CCTCAACGCA GACTGCCCCA TCTGTGTGCA GGAGGCAGCC GCCTGCATTG 4 80 

GCGCGGACCG GAACCCGTCT GTACTTGTGA AACGCGGTGT GCGGCAGACG CACCTGACAC 540 

GGATTGTCCC ATCTGTACGC AGGATGTAAC CGGCTGTACC GGCAAAGAGC CGGAACCGCC 600 

TGCCAAACTG GTCTGTATCT GCGCCGCCCA CTGTGAACCC GGCGCGGTCA ACGCCGACTG 660 

CCCCGCCTGC CTGTTGAACC TGGCAGACTG TATTGTGGAA GCCCCCACGC CCGTCCCGGT 720 

CTGCGTCTGC GAAAACCGCT GTGAACTTGG TGCGGTCAAC GCCGACTGCC CGATcTGCAG 780 

AACCGACCTG GCCGGATGCA CCGGcAAAGC CCCCGcGCCA ATyCCTGCAT CCAATCTCTC 8 40 

CATCAAAATT ACACCGCCCT CCGGCTGGGC AACCGGCAGT GCTGCCGCCG AGTTTCGCAT 900 

TACCGATGAA GCCGGGAACG GCTTTGCGCT GGTGCAGGTG AAAATCGAGA AAAACGGGCA 960 

ATGGCGGGAT GTGACGGACA GCTTGAAGCA GAACGAAAAC CGCTGGTATG GGGAAATCGA 1020 

CCTTTCGGAG AACTGCACCG TCTATGTCTG CGCcAACGGC CATGATGGGA AgTGTATGAG 1080 

AAtCCCGCTA TATTGAGTGT TTCGACCGmA CggcCTGCTT ACGCGcCGGC ATkGAcGGcG 114 0 

GtTsTcCGTG CGGAACAAGC 1160 
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(2) INFORMATION FOR SEQ ID NO: 144 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



TTTACnTGAA 


CCAATTTCCA . 


ATGTGGATGG GATGATTAnG CCAAACGTTC TGGCCATTn'C 


60 


CCTGGCCATT 


TTTCAGGACC 


TTCTACCACA TATTGTGGAT ATTTTGCCAT ATATTCCGCT 


120 


GCTCCGGGTC 


TATTTCTTAC 


ACGCATATTT AAAATTCCTT TCTTTTAAAA ACAGACTGGT 


180 


GAAATTCGCC 


AGCCTGTGTG 


GATGTTAAAT GAATTAGTTT AAATACTAAA CATTCGTTTT 


240 


AATTGTAAAA 


TTTCTGCAGT 


CATTGCTCTT CTTTCGCCCT CTTGGTGATG ACGAACAATC 


300 


TCTTGTAAAA 


AGCTAAATAA 


ACCATACCAA TAAATTCGTT GAACCGTACT TTCATTCGGG 


360 


ATTAAACCAT 


AAGATAGTAA 


CCATTGATTC CAACCGGAAC GAGGGACATA ATGACTCAAA 


420 


ATCATTCCTA 


AATCTACCGC 


TGGGTCCGCG ACCATCACAG AATCCCAATC TACTAAATAG 


480 


AGATAATTAT 


TCGAGACTAA 


CCAATTCCGA TGATTCACAT CGCCGTGAAC AGCTACATAA 


540 


CTAGTTTCTG 


GATATTCTGG 


TAAATGATTA TATAAATAAG TGATTACTTC TTTTAAATAA 


600 


CTGTTTCGTT 


TTAATTCTTT 


AGGTAGTTGA CGCTTATACT CTTTCAACAT TGTTAGCGGG 


660 


GTACAAACTT 


CGCCACCAAT 


TTTTTTCAAC ATGTCTTTCA GCAAATTGGA ATGATGCAAT 


720 


TGGTATAACA 


CGTCAATCAC 


GTCATTGCGT TTGCCAATTT. CGTCCGCATG TAATAAACGT 


780 


CCTTCCAGCC 


ATTCTTGGGC 


TGTTAGTACG TCGCCATTCG CTGTCCGCTT TGTCCAAATC 


840 


ATCTTCGGGG 


CTAACCCTTC 


TTTAGACAAT GCTGCAAGAA GTGGAGAAGT GTTTCGTTTA 


900 


ATGAACACTT 


yCTCGGTTTC 


TTkTATTCCC ATGTATGTTT GACCAGTTGC GCCCTTTATC 


960 


GGCTGCAGAC 


GCCAGTCTTT 


ATCCAACTGA AATTCCATGG AAATTATCTC CTATCTCAAC 


1020 


ACAATTTTAT 


TGCAATTTTT 


ACAGCAATTA CCATTTTAGT CGCCAAACCT TGCTTCGTCA 


1080 


AGCTTCTATG 


TGGTTCTTTT 


GTAAATTTTG CTGGGTTTcT CATCTTTATA ACTTATTCAA 


1140 


GCGCATTGGA 


ACATACATTT 


TAGTGAAAAT CACAACTTCT ACAATTAAGG CACCCAGAAT 


1200 


GATCAAGCTA 


TTTTGCAAAC 


TGCCTAAACG AAGGCAGACA ATCCCACCAA AAATAATGGC 


1260 


CGCAACAGAT 


AATGCGCCGC 


TAATTAACCG GCGAATCGCT GCTTGTTTTT TTTCTATTGA 


1320 


AATTGGATAT 


AGCTGTGTCA 


TCACCATATA ATCAAATTGC GTGTACATGG GAATCAACTG 


1380 


GAACCCGATT 


AAATAGACAA 


ACAATAAAGC AACTACCAAC GCAATCCAGA ATTCTTGTAA 


1440 


AAAGAAGAGT 


AGCACGCCAC 


CAACCAAAAT CAAACGAAAC AATAAGCCGC TATATTCGCT 


1500 


ACCTCGTAAA 


AAGCTCCGAA 


. TGAATAAGTA AGCATACGTG TTCTCAGATG TTTTTTTGAC 


1560 


TACTCCGAGG 


1 AGAGGATCTA 


AGTATTTACG GCGTTTCACT GTACTACTCA CTTCAGGAAT 


1620 
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ATCTGTAAAC AGCTGAATGA ATTTATAAAT TCTATGCATT CTGTTTTGTT CTTTTTGGAT 1680 
CATTTTTT.CC .CAATCCAAGA AACTCTGTTT CATCTTATTT TGCCAAATAC TCGATAAACA 17 4 0 

AACGGCCAGA ATGATGCCAC CAATCACACC AACCCAAGGC GTCCAGTAAA TAGCTCCCGT 1800 
CATAATCAAG AGACTACTGA TCAACCAAAC AAACCACCAA CTACGGTAAG TGGTTGACGA I860 
AAGATGATAG AGCCCATAGC TTTGTAAACT AAGATGTGTA TAAATCATAC AAACTAGCAT 1920 
GACCAAAAAT AAAAAGAAGG TTTGAAACGA CCAACCTGTT GTGACTACAA TCAACGGCAT 1980 
CAATAGGCCA CTTCCTAoAA AGCTGACTAC AATCGGTAAA AGTAAAGAGT AGCGCAgCGC 2040 
TCGTTTCAGA TAAGCAGCGA ATTGTTTTTC TTTTGGTAAT AAAAACACCT TGTCTGGCTC 2100 
TTCTGTCAGA GTAGCCAACT TACCAACTTG AATGAGTAGC AGCCAGCCCA ATAGAATTAA 2160 
CGGTCTGCCC CAAACAAAAT CACGAGGTAA TTCTTTTAAC AACTGGGAAT AGTACAGACC 2220 
TAATCCGCCT AATAAAAACA TACAGACTAA AACAAAATGA TCATTTAAAA TATAGCGCAA 2280 
GTACTTGGAC ATTTTTTTGA AATGCCGTGA CAAACGTTGA CTAAAAAATT CGCCCATGTT 
ACCCCACCTT TTCTTCCTTC GTCAAGGCAA GATAAATATC ATCTAAAGAA GACTCTGGTA 
AATTAAATTC TGCACGTAAT TCAGCCATTG TACCGTTTGC CCGTAATTTC CCTTCGTGTA 
GCACCACAAA ACGATCACAA TATTTTTCAG CTGTTGCTAA AATATGCGTG GACATTAAAA 2520 
TCGCTGCCCC TTGCTTACGC ATCGTATCCA TTAATTCTAA TAAAGCATGA ATTGCTAAAG 2580 
GATCTAAGCC TAGAAAAGGT TCATCAATAA TATATAAACT CGGTTCAATT AAGAATGCAC 
AAAGTACCAT GACTTTTTGT TTCATGCCTT TTGAAAAATT AGCAGGAAAC CATTCCAATT 
TATTGTCTAA ACGGAACGTT TTTAGTAATG GTTCTGCTCG TTTGAACGCT TCTTCTAATG 
GAATATCATA AGCCAAGGCC GTTACTTCGA TATGCTCTtT CAATGTTAAT TCTTCATATA 
AAGAAGGTGT TTCCGGAATA TACCCGATTT TTTTACGATA TTCTTCAGGG GCTTGTTGCA 
ATGTTTCCCC GTCAATCATA ATTTTGCCTT TTTGCGGCGT AAGTAACCCA ATAATATTTT 
TGATGGTGGT ACTTTTCCCC GCACCATTCA AGCCAATTAA ACCAACCATT TCACCAGACT 
TGACGTCAAA ATTAATATCT TTTAAGACAG GAATATGGCC GTAACCTCCT GTTAAATGTT 
CAATTGTTAA GCTCATTTCT TTTGCCTCCA ACCTTTTCTC TCTACCATTA TTAAACCCTA 3120 
AGCGAACCAA GAATTCAAAA ATTACTCCCT TAATTAAATG GATTTTTTTC TTTCTCTTAT 3180 
TATACACGAA ATGAAACGAA CTTTTTAGCG AACTCACGTA CTTTTTCATC CaCAGAGAAA 3240 
TCAACTTGTG GTAAAGTAAA GGAAAGAACC TAGAGAAAAG AGGAATTTTT CATGAATGAT 3300 
TGTATTTTTT GTAAAATTAT CAACGGCGAA ATCCCAAGTT ATAAAGTCTA TGAAGACGAG 
AAAGTCTACG CGTTTTTAGA CATTACACAA GTGACCAAAG GGCATACCTT AATGATTCCA 
AAACAACATG - TTGCTGATAT TTTTGAATAT AATGACGTTT TAGCAAGCGA CGTCTTTGCT 
CGTATTCCTA AAGTAGCACG AGCATTAGAA AAAGCTTTTC CAGAAATGGA AGGATTAAAC 
ATTTTAAATA ACAATAAGGA AGTGGCTTAT CAATCCGTTT TCCATTCCCA CGTTCATTTG 



2340 
2400 
2460 



2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



3360 
3420 
3480 
3540 
3600 
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ATTCCCCGTT ATTCAAAAGA AGATGATTTC TCTATTCATT TTQGCAACCA TCAAGAGGAC 3660 

TATTCCGCCG AAGCGATGCA AGAAATTGCT GAGACCATTG . CAAAGCAGGT GAACTAAATG 3720 

AAAAAATTTA TTAAAGGTCT CTTTTTTGGC GCAgCTGCTG GTACCATTGG CGGCTTGTTG 3780 

GCTGCTCCAA GAAGCGGGAA AGAAACACGC CAACATTTAA TCAATGAGCT AGAGGACTAT 3840 

CGTTCACTGA AAAATCAAGT CACGAATGAC TGGGATCAGG TGCAACGAAA TTTGGCAGTT 3900 

GTCGAAGAAA ACGTTCCACT GGCAACTGAA TTTTCAAAAG ACCTGCAACA AGAAATCACT 3960 

GATTTCAAAT TCCAAGCGGA ACCTCGGATT GCTCAAATTA AAGAACAAAT TGCTAAAATC 4 020 

ACGGCTGAAT TACCTGACAC ACAAACCAAC AAGCAAAAGT AAAGTCTTTT ATGTTAATTA 4080 

TTTCTTTTGT TATTAAAAAA ATAAAAAGTG TGACGTAAAT ACGAATTTTG TTTGAAACTA 414 0 

TGCCTGAGAC TCTTTCTTTT TCAAAATGAG GTAtGGTATA GTTATAACAG AwATAAAACT 4200 

ArAAAAAACA GGAGTGCATA AGAGAATGAa GAaAAAACTA ATCTTAGCTG CAGCGGGCGC 4260 

AATGGCCGTT TTTAGTTTAG CAGCGTGTTC AAGCGGTTCA AAAGATATCG CAACAATGAA 4 320 

AGGTTCAACA ATTACTGTTG ATGATTTTTA TAACCAAATT AAAGAACAAA GCACTAGCCA 4 380 

ACAAGCGTTT AGCCAAATGG TTATTTATAA AGTCTTTGAA GAAAAATATG GCGACAAAGT 4 4 40 

AACTGACAAA GAyaTTCAAA AAAACTTTGA CGAAGCCAAA GAACAAGTAG AAGCACAAGG 4 500 

CGGAAAGTTC TCTGATGCAT TAAAACAAGC TGGTTTAACT GAAAAAACAT TCAAGAAACA 4 560 

GTTAAAACAA AGAGCAGCCT aTGATGCAGG TCTAAAAGCC CACTTAAAAA TTACAGATGA 4 620 

AGACTTAAAA ACAGCTTGGG CAAGTTTCCA TCCAGAAGTA GAAGCACAAA TTATCCaAGT 4 680 

TGCTTCAGAA GAT GAT G CCA AAGCTGTCAA GAAAGAAATC ACTGACGGCG GCGATTTCAC 474 0 

AAAAATTGCT AAAGAAAAAT CAACAGATAC TGCTACGAAA AAAGATGGCG GTAAAATTAA 4 800 

ATTTGATTCA CAAGCAACAA CTGTTCCTGC CGAAGTTAAA GAAGCTGCCT TCAAATTAAA 4 8 60 

AGATGGCGAA GTGTCAGAAC CAATTGCTGC AACAAATATG CAAACCTACC AAACAACCTA 4 920 

CTATGTAGTG AAAATGACGA AAAACAAAGC AAAAGGCAAT GACATGAAAC CTTATGAAAA 4 980 

AGAGATCAAG AAAATTGCTG AAGAAACAAA ATTAGCCGAT CAAACATTTG TTTCGAAAGT 5040 

CATTAGTGAC GAATTAAAAG CGGCCAATGT GAAAATTAAA GATGATGCCT TCAAGAACGC 5100 

TTTAGCAGGC TACATGCAAA CTGAATCTTC AAGCGCTTCT TCAGAGAAAA AAGAATCAAA 5160 

ATCAAGTGAT TCTAAAACAA GCGATACCAA AACAAGCGAC TCTGAAAAAG CAACAGATTC 5220 

TTCAAGCAAA ACAACAGAAT CTTCTTCTAA ATAAGCATAA AAAAGAGAGT GGTCGATTGA 5280 

CCACTCCCTT TTTTTATTTA TCTTTTGGAA CATAGAAACT GCGGTTATCT AAACCAAAGA 534 0 

TTCGTTCCGT ATACTCCCCa GGCTGTGTTT GACGAATTGC CCCTAGCATt CATTTGAATG 5400 

GACGCATCCA • TGGTATCAAT TTGATGTAAG ATTTCCGCTT CCATAATTCT TGGTCGCACA 54 60 

GGTGAGCCAT ATTCTAACAA TCCATGGTGG GCTAAGACCA TGTGCCGAAG AACGACAACA 5520 
TCTTCTTCTC GATCATCAAT TTTCAAAGCC AAACACGCCT TGGTAATTTC CTCATCCACC 5580 
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AAAACTAAAT GACCAATTAA GTTCCCTGCC ACCGTATATT CTQTCGTAAT AGCACCAGAT 5640 

AGTTCTAAAA CTTTTCCTAA GTCATGTAAA ATAATCCCCG CGTATAATAG TGACGTGTTA .5100 

ATTTCAGTAT ATTCCTTAGC AATTGCTTTG CCTAGGCGCA ACATTGAGAC AGTGTGGTAA 5760 

GCTAAGCCAT TCGCAAAAGC ATGATGGTTC CGTTTGGCTG CAGGAAATTC ATAAAATTCT 5820 

TGTTGGTATT TCGTCATGAG ATAGCGCACA ATACGATTCC AATGAGCATT GGTAATTTCA 5880 

AAAATCACTT GATTGATTTC CTCGACCATC TCTTCTTTTT TCAAAGGCGC TCGTTCCATA 5940 

TATAAACTTG GCTGGCTTGG TTCCTCTGCT GTCGCTAAAC GCATGTGTAA AATTTTTACT 6000 

TGGGGTTTAT TTTGATAAAC TTCTCGtTTC CCATTCAGTA AAACAACGTT TCCCGCAGTA 6060 

AATCGTGTAA TTTCATCCTC AGAGGCGTCC CAATATTTtC CGTCAATCGT GCCAGATGTG 6120 

TCTTGAAAGG TAAAGGCAAT AAACTTTTTC CCATTTTTAG CTACACGTAC ATCGGCATTT 6180 

TTAATTAGAA CAAATGATTC AAAAAGTTCT TCTACTGTTA ATTCACGTAA TTTTTTCACT 6240 

TGACGCTTCC TCCTTCTAAC GCAATTACTC TTTGTTGTAA GTCCTGATAA TAGCTTACCA 6300 
TTTCTTGGTC AGAAGAAAAA CAAATGACTT GCTGTTTTTG ACCAAATTTA GTAAATAATT ■ 6360 
CAGCTAATTG GGCTTTTCTT TGATGATCAT AGTGTAACCA GCCATCATCA ATGATAATTG 6420 
GACAAATCAG TTGCTCGCCT TGGACTGCTA AAAAGGCAAA ACGAACAGCC ATCATCAATT 6480 
GATCTTTAGT GCCAGTTGAT AATTCATGTA AATAAAACAT TTCTTGTTCT TTCGTCACCG 6540 
CAACTAGTTG ACCTTCGGCT ACTTGGATTG ATTGATAACG CTGATTTGTT AAAAGTGCAA 6600 
AGTAGCTTGA TGCATACTGT AATAAGCTGG GCAATTGTTG TTCCGAAAGC TCTGTTAATA 6660 
GGTCCATTAG CAGTTGTCCC GCTAGTTGAT ACCCAGACCA ACGTTGAGCT AGTTCTTTGA 6720 
TTTCCGCTTT TAAAATTGCT TGTCGTTGAT ACAACTCATC TAAGGTACCA TCCGTCATTA 6780 
ATTGTTGTTT TTCATAAATC AGCGCTTGAT ACTGTTCTTG TATTTTATCT AACTGGCTTT 6840 
CTTCCAATGC TTGTTGctGC GCCAGTTGTA AAGAACGGTC TTTTAACGTT GCTTCTGTTA 6900 
CCGCTTCTGG AAACAGATTA CCTAATTGCG TCGCTAATTC TTGCCCACGA CCAACTTGGG 6960 
CTTGCTGAGT GGTCAGCTGT TGAATCCTAC TTGGAACATC AGCGATTGAT AGAATCCGAT 7020 
AACGAATTAG TAAAGGCTGA ATTCGTGTTA ATGCTTCTTG CTGTTTAGTT TTCAACTCTC 7080 
GTAACTGCTG TTTGAGATAG CCATCTGCTT GATACTCTTG AGCAAAGCGA ACTTTCTCCA 7140 
TTTCTTCAGC AAACCCTTCT AAAAAGTGTA ACTTTTCGGA AAGGCTCTTT CCTTGCAAGG 7200 
GTAAATGCGT CATCAATGGG AAGATTTGTT CTTCAAATAC TTGGCGTTTT TGTTGATTTT 7260 
CCTTCAATTG ACTCATGAGT TCTTCATTTG TTTGAACAAG CAATAAATAG CGCGTAATCA 7320 
ATTCCCGCTG ATTCATTAAA GTATCAATTT GTTGCATTTT TCCTAATTGA TGGACTTGGG 7380 
CTTGCTCCGC TACTGCTTGT TCAAGTTGTT CCACTTGCAG TTGTGCCTCA CGTTCTTGAT 74 4 0 

TCAGCGCCTG TTGGAACTGT TCATTTAAAT AATCTAGTTG CGAAAGTTTT GTCTGCCACT 7500 
CTTCTTTCAC TTCATCAGAG GAAGCTTCTT TTCTAGTGGC GTAgcTAAGC ACTTCTTTCG 7560 
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CGATAAAACT 
CGAGTAATAC 
TTTGGGCACG 
GGTCTTCCAG 
GAGCTTGCGT 
AAAGCTGTGG 
GACGATTTTG 
CATAGCGTTG 
CTTCTGGAAC 
CTTTGTCTAG 
CTAAAACAAC 
GaCGTTgcTg 
GTGtCACTCG 
GcTGCGCTTC 
TGAGTAATGG 
TTCGGTAAGT 
GTTCCTCTTC 
CTTGAAATAA 
TTTGTTGTTG 
GAACTTCCCC 
TAGGTGCGTA 
TAAATTGATA 
ATTCAAATGT 
ATTTCATAGT 
CAAGCTCTCt 
TAACAATTCC 
CGTTAATAAG 
CAGTATTAGG 
GTAGCTTAAT 
TTCTTTCAGT 
ACTGGTTTCA 
TTCATTCGTC 
AAGTGTGACC 



AACTCCCACG 
AAGGATACTT 
CGTCGCTTCA 
ATTTTCAATT 
ATACGCAACT 
TGGTCGTTCA 
CTCAAAGGTT 
ATTCAACAAT 
ATCGAGCGCT 
GT AC AT AT AT 
GTTCTCTGTT 
TTCCgCAACC 
CTGTTGTTCT 
TTTCTCTTGA 
CTGCTGTCCT 
CAAAAGTTGT 
TGTCAACTTG 
CTTTTTCGTT 
ATAATAAATG 
AAACACAGGA 
ATCGCGTTTT 
CAATGTGGAC 
AATTTTCTGT 
GGGTCCTCCT 
AATCGCCACT 
TTGGTGGTAT 
TTGGGACTAG 
CGATAAACAA 
AGTTCACCAT 
GTCAACGCCA 
• AGATAAGACA 
CAATCTATTT 
ACTGCAACCC 



AGAACTAGGC 
CCTAGCAATA 
CCAAAAAATG 
TGAGAAGCCA 
TGTTGTTGGC 
GGCGACCATG 
TGGGTCATCC 
TGTTGAATCA 
TCTGACTGAA 
TCTTGAAAgG 
TCTTcAGGTA 
TGTAACAAGT 
GTTTCGCGAA 
ATTTGCTCTT 
TTACCTTTAA 
TGACTACCAG 
TCACTCGTAA 
AGCGGATGTA 
ATTGCTTGCC 
TGTGCCAACC 
CTTTTGCCCT 
TTGCCAACTT 
TGTTGCCATT 
GAATAGTAAA 
GCTCATCCAT 
CTTCAAAAAT 
CGGCTAAATA 
AAAAATCACC 
TGACAATGGC 
TCAATTGCTG 
AAACGCTTTG 
CTGCGACTTT 
CTTGTAAAGA 



845 
CCCATTTTAA 

ACCAAAGTGG 
CGGCATGCGC 
AGAATGCTTT 
GCCACTTTTG 
ACCATTCGTT 
ATTCCGATTC 
ATGCTTCTTG 
GAGATAACTC 
TTTCTTCTAA 
GCGACTGCCA 
cCTGTTGCGT 
TCGTTTCTTC 
TTAGTTCTTG 
AGATTTTTTG 
AAACACCAAT 
TTAATTGTTC 
ACATTTGCTG 
CTTTATTTTT 
ACAAACGACC 
TTGTGGGAAA 
CATTTGCGCC 
TTCCAAAGCC 
GTCTTCATTA 
TGAAAAAATT 
AACTGGATCT 
GATCGGTTCT 
TTGCGTTTTT 
TTGCACCACG 
TTGCTGATGC 
GGTGTCACGG 
TTCAAATTGA 
CTGTTCTTTT 



AGQGCTTGGC 
TTGACTTGTT 
TTTTTCAAAA 
CTGTTCTTGT 
AACGGTTTCG 
CTCTAAAAGG 
CGTCATCAAT 
TTCTAGGTAA 
TTTGGTCAAA 
TTGaACTTGG 
TTCTTCGTAC 
TTCTTTTAAT 
TAATTGTTGA 
ATAAGCCGCC 
TGCTTCTTTA 
GGCTAACAAG 
TTGTTGGAAA 
TAAGGTTTTT 
TTCTTTAAAA 
ACCATACGAG 
CCCAAACAGC 
GTACAGCAAT 
CACAATCTCT 
ATTTGCTCAT 
CCAGCAAATT 
AAGTATGTTT 
TTCGTTGGTT 
TGTAAAAGCT 
GTCACATCCA 
GCCTGCCACT 
CATTGCGCAA 
ACAGTCGCTT 
TTCGTGTGTC 



AAAAAGAAAC 
TTCTGTGCTT 
TCTGTTAACT 
CCTTTTTGAA 
TTGTCATAAA 
ACCATTTCTT 
TGTTGCATAT 
AAAGCATATC 
CGTTGTCGTT 
TCTTCTTTTA 
aAGGGAAAGT 
TGCTGTTGCA 
AAAGTTTGTT 
AACTTCTGAT 
AAGTACGCAT 
GATGTTTGCA 
GTAAAAACCT 
TCATCCCCCA 
CGCTCGACTT 
CCACCATTTT 
ATTGCTTGAA 
TGATTACCAG 
ACTGCGAGTA 
GTGCTAGGCG 
CTGGATAACG 
TTTCTAATGC 
CCTCTGCTTG 
GCTGCTGTAA 
AATGTTGCGT 
TCGTCAGCAA 
GAGATACTTC 
GATTTTGGGA 
CTTGAGGTGT 



7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 ^ 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 
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8820 

8880 
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9000 

9060 

9120 

9180 
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9300 
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TCCAGGATAA ACAATTAGCG GTTGTTCACT AACAATTTGC GGQTGATGAA TGTGGCCCAA 
TGCCCAATAA TTATAGCCAG TGGCTTTTAA ATCCTTCCAT GTAAACGGAG. CATAAXTTTG 
TTGCGCTTTC CGACTGGTGT CTCCATGATA AATCCCAATA TGAATATCAG ATTCTGCTTG 
TTTGATAGGA AATTCTAGTG CTTTATTTTC ATCAATCCAG GGATGCTCAT AACTAAACCC 
TGAAACAGCC ACTTGTTCGC CCGCTTTCGT TTCAAAATAC AGTGTCTCAA CCATTTCTTT 
TTCAAATAAA AAGACATTGT CTGGAAAAGA AAACCAATAG CGGTCTTTTT TATAGTAATC 
ATGATTCCCA AATGTTAGAA TCACAGGGAT TTCTGCTTGT TTCAACTGTT CTAATGCAGC 
CATCAATTGG GCTTGCATTG AAATAGAGGT CTGACTTTGA TGAAACGTAT CTCCTGCAAA 
AATAACCAAG TCCACCGCTT CTCGAATGGC AACTGTTACG ATATTCATTA ACAATTGTTG 
ATTAGCTTCT CTTAaGCGcT TCGCTAAGGC TGTCGGGATC CCAGTTAACC CTTCAAAAGA 
CCGATCCATG TGTAAATCAG CTGTATGCAA TAGTTTCATC GTTTCCCCTC CTTTTCTCCT 
TCCTTTATCA TATCATTTTC TGCCAGCTTG GTCAGTAAAA ACATACGAAA ATTTGTTCGT 
TTTTTTATTT TCAGTAAAAG AAAAACTCTG ACAAATTGTC TTATAGACAA CTTGTCAGAG 
TTCAATTACG TAAATTAATC GT TAT AC AT A TCACGCACAG GAGTCATGAT AATTCGGTTT 
AAATCATTGA CAACTAAGCT AAACGCTTGT TCTTTTTGCA TTAATTCGTT GATAACTGCT 
GTTTCTTGCA CTTTCGTTGC AATGCTT-TGT GCTTTTTCAG CATCTTCATC CGTGAATTCT 
TCGCCGCGCA TTTGTTTTTC TTGTAGTCCT TGTTGGAATC CTTGGAATTC TTTAAACAAT 
GTGTAAGCTG CTTCGTCTGC TTTTACTTCT TCATACGCTG CTTGTAATGC TTTAAATTCG 
TCTAACTCAC GTAGTTCACG TTCAATCTGA TTGGCTGTAT CATAAATGTT ACTCAATGGA 
ATCCCTCTTT TCTCAATAGT TTCATCTTTA TTGTACCATT TCTCTTAGGT TTTCCCAACT 
AATTTCCTAA TAAGCCTCCG AAAATTCCTT CAATCTTGTC TTTGACTTTT TTCGCGCCGT 
CTTTAATTTG TCCCCCAACT TCATTCAATG TATCTCCGAA ATTGCTAGCG CCGTCTTGAA 
CAGAATTTCC AAAGTCTTTC AAGTTATCCC GCCAATTAGA ATTGGCACCA CCAGTTAAAC 
TATCGTTGCT ATCAACAACT TTCCCACCTG TGGCATAAGC ATCCGCGACA GGAAATTGTG 
CTTGTTTGAC TTGTGGCAAG ATGCCACTGG CTTGACTGTT GAACACCTGT - GAGGCATACG 
TAGCGCTACT GCCTTCCAAA TAATGGGTTT TACTGGTTTC TTGGAAGCCT AACCAAGTAG 
CAATGACAAC TTCTGGTGTA TAACCAATCA CCCACTGATC GTTCGTTTTA CTACTATCGA 
AGTTGGTTTC TGTCGTACCT GTTTTACCAG CCATTACGTA GCCTGCTGGT TGTGCGTTGA 
CCCCGCTCCC AGAACTAAAG ACCCCTAGTA ACATACTTGT AATCCCGTCT GCAACATCTT 
TGGTAATGAC TTGTTCTTTC TTGACTTTAG TATTATCAAC TACAATTGCG CCAGTTGAGT 
CGACGATTTT • TGTAATTAAA TGCGTTTCAG TTTTATAGCC TTCATTGGCA AAGGCACTAT 
AAGCTCCCGC CATAGTTAGT GGCGAAACAC CCGTTTGTAA TCCTCCAAGG GCTAAACCAT 
AGTACCGATC TTTTTCTGAT AACGGAATGC CAAATTTCTC TGTTTTTTCA TAGCCTTTAT 
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CTAAACCAAT TTGATGTAAT AACCATACCG CTGGTAAATT TAAGCTTTCT 
GATACATAGG _ AACT.TCICCA CTGTACGTGC GGCTATAGTT TTGCGCTGGA 
GTGGTTTATC TTCTAAAACA GAGTCTGGTT TATAGCCAGC CTCTAAAGCT 
CAGAAATTGG TTTTAAGGAC GAGCCTGGTG AACGTTTTGT TTGCGTTGCA 
CTCGATAAAC GTGTTCCCCT CGACCGCCCA CCAGTGCTTG CACGCCGCCG 
CAATGGCTAC AGAGCCACTT TGAACCATGG CACCGTCTTC TGCATTGGGT 
CATCATTTTT ATAAGTTGCA TCCATCGCAT CTTGATACCC CTGATTTAAA 
TCTTGTATCC TTTATTTAAA ATATCTTCTT CTTTAAATTT ATAACGATTG 
CAATTACCGC ATCAAAATAA TAT GG AT ATT TGTAACTATT TTCGTCGCCA 
CATTTAATAG ACTCGCTAAG TTAACACTCG CTTCTTGATT GGCTTCTTCT 
TTTTATTGTC GACCATTAGT TGTAAAACGG TATTCCGTCG CGCTGTGGCA 
AATCAATTGG GTTATAAATT CCTGGCCCTT TTAACATGCC CGCTAGCGTT 
CCACTGTCAC TTGGGACGCA TCCACGCCAA AGTATTTACG AGCTGCATCT 
AGACACCATT ACCAAAATAA GAATTATTTA AATACATTGC CAAGATATCT 
ATTTCTTTTC GATTTCAATC GCTAAGAATA ATTCTCTGGC TTTCCTATCT 
CTTGCGTTAA GTAAGCATTT TTGGCTAATT GCTGCGTAAT TGTACTTCCG 
CACCGATTTT CCCGAAGGTC AATTTTCCAA CAACCGCACG GGCAATCCCt 
ACCCTTTATG TTGATAAAAC CCGCGATCTT CAGTTGAAAT AACCGCGTCT 
GAGAAATATT ATCCAGTTCC ACAAAAGTCC CTTTTTGCCC GAATAACTTT 
TTCCTGATTC ATCATAAACG CGTGTTGATT CACTTAGCCC TGATTTTAGG 
TTGCGGACTT GGCTAAGTAA AATAAATAAA TACTGGCACC TAAAGCGACT 
TTAACAAGAA AATTTTATTG ATTTGATACT TTTTCCAAAT CCGCTTTCTG 
ATTGTATTAA ATAAGGCTTT 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AACGCTAAAA CCTTTAATAA ATAAAGGTTT TAGCnTTTTA ATAAAGAGCC 
CTTGAACCGG CGACCCCCAC CTTACCATGG TGGTGCTCTA CCAACTGAGC 
CTTTCAAAGG CTAAACTAAC TCGAGGATTG CAACGAATGA AACCCATAGA 
AAAGGCTACA CTGCGACGAG TAATCGTCAA AGTTCCTTTT GAAAAATCGT 
TGCACAAGAA ATAGAATAAC GTATAGACGA AAGTTTGTCA ATTGCTCTTT 



CCTAAGGCTT 
TAATAATCTT 
GGTGTATAGA 
AAGTTAAAGC 
GTTTTCGGAT 
GGAAATAGTG 
GAAGTGTAAA 
ACCGCTTCAT 
ACATAGGTGT 
TGAGATAGTT 
TTGTCAATGT 
GCGGCTTCCC 
TGCACGCCCC 
TTTTTCGAAT 
AACGTTTGTT 
CCACCACCGC 
TTGATGGAAT 
TGAATATAAG 
CCGACTTCTT 
GTCTCAACGT 
ACCAAACCAA 
GCTTGATGGA 



ACCTGCCGGG 
TAAGGCGGCA 
GTTCATTTTC 
ACAAATCTCG 
CGTAAAAAAA 



11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12860 
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GAAAAATGAT 
AAATCAACAT 
CAAAAGAGAG 
GAGAATAAAT 
GTTAATAAAG 
TGGTACCGCG 
AATCACTACG 
AAGTTGTGTC 
GTACAAAATG 
ACAGAGCAGA 
TACAAGGTGA 
ACAGTCAAAT 
GATTACTCGA 
AATCAGTTTA 
TGTCTAACAG 
ATTGAGCAAG 
TAGGGTATCG 
CATGAGGTGG 
GGATGCTTTT 
ATGATTGAAC 
GTCCAAGATG 
GGTGCTGGAA 
ACCGTTATTA 
CGAAAAAAGA 
AGTAATATCG 
ATCAGCCATT 
CTATCAGGAG 
ATTTTACTTT 
GCCTTGTTAA 
ATGCAAGTGG 
GAATCAAACG 
ATCCGGACAG 
GCTGATTTAG 



TTTTTTCTAA 
CACAGATAAC 
TCCTGATTGG 
TGCTGATAAG 
ACTTGGATAA 
AATCATTCGC 
ACTGGCGCGA 
ATTTGCACGA 
ACCGAAGGCA 
TGATGTCTGC 
CCGAAGGCAA 
CCTGTTGAGC 
TGC'TTGAAGG 
AAATGAATAG 
AGAGCCTAGG 
CGCTCCGAGT 
AACATATTGT 
TAACACGCAA 
TTCTTTTTTA 
TAAAAAATAT 
TTTCTTTGAC 
AAAGTACGCT 
TTAACAAAGA 
TTGGAATGAT 
ATTTTTCATT 
TATTAGAGCT 
GGCAAAAACA 
GTGATGAAGC 
AAAAGTTAAA 
TTAAAGAAAT 
ATATTGTTTC 
CAACTCATAT 
ATAAAAATCA 



GAAAAACTGT 
TATTCATTCG 
TGAmAAGGGA 
TAAGCAATTT 
GAGATGAGAT 
CCTTTACTAG 
AGACAGAGTA 
ATCACTACGA 
GGTTGTATCA 
GATTACTCGA 
GTTGTGTCAT 
ACTACGACTG 
GGTAGTAACT 
AAATAAAAAA 
TAGCTGAGAA 
GTTAAACACA 
TCCGTACTTG 
AAAAGsGTsC 
TGAAAAAATG 
CAGTGTGACA 
AATTGATAAA 
AGTACGGGTA 
AAATATTTTA 
TTTTCAACAT 
AAAATATTCT 
AGTTGGACTA 
GCGGGTTGCG 
GACGAGTGCG 
TCAAGAACTG 
TTGTAATAAA 
GATTTTTAGT 
TGACCAAGCA 
AGAATTAATT 



TGACAATCTC 
TTGAAGAGAA 
CAAGGTTCAG 
CGGCTAAGCA 
ATAAAGAATC 
CAAAACGCTA 
GATGATGAAC 
CTGGCGCGAA 
TTTGCACGAA 
CGCAGGATGG 
TTGCACGAAA 
ATGCAAAGAG 
CTAGTCTGTT 
GCAGTGATAT 
TAGGTGACAA 
AGGCGGCGAC 
AAAAGGGAAT 
TCAACGATAC 
CAGGTTGGTA 
TTTCAACAAA 
GGGGATATCT 
ATTAATTTAT 
ACTTTTTCTA 
TTTAATTTGA 
GGGCTAAGTA 
AGTGAAAAAC 
ATTGCCCGCG 
TTAGATCCAA 
AACTTGACGA 
GTCGCGGTGA 
CAACCCCAAC 
TTAACAACAA 
GAATTTTCTT 



TTT.TTTCTCT 
GAGTAAATTT 
AAAAGTTGAm 
CCCGTTACCA 
AGTTATATTT 
GTGAAGGGCT 
AGTACAAGAT 
gACAGAGTAG 
TCACTACGAC 
CCTTTGTGAG 
GTGTTCCTAC 
AG AG AG TAG C 
TGATTTTGAA 
AGTAAGTACA 
AGACGTTTTG 
GGGGGCTCCC 
TTGTGTGAGC 
AACTCGAGTT 
AGAATGAAGG 
AAAAACAAGA 
ACGGAATCGT 
TGCAACGACC 
AAAAGGAATT 
TGAAAGAGCG 
AAAGTGAACG 
GGGACGCTTA 
CTTTAGCAAA 
AAACAACGGG 
TTGTCTTAAT 
TGGAAAATGG 
AACCTTTAAC 
TTCTTGAACA 
ATGTGGGAGA 



AGTAAAGTGT 
TGATGACCTT 
GATGGCCTCT 
TGTGCACCAA 
CAAAAAAAGG 
TTTTGCTGTG 
GACCGAAGGC 
ATGATGAACA 
TGGCGCGAAG 
TGATGAACAG 
GATTCTCGTC 
TATTATCTGC 
AATAAACGAA 
AGACTTGATT 
GAAAATGGCT 
GTTaTCGAGC 
AAATTCTGAA 
GTGTCGTTGA 
GGGATAAAGA 
AATTCAGGCT 
TGGTTATTCC 
AACAGCGGGC 
ACGACAACAA 
G ACT AT ATT T 
GCGTCAAAAA 
CCCCAGTCAA 
TGATCCTGAA 
ACAGATTTTA 
CACCCATGAA 
CTGCGTTGTC 
CAAAGATTTT 
TCCTAAATTA 
TCAAACGAAT 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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GAGCCATTAA TTGCACAATT GTATAGCCAA TATCAGGTGT ATAGGAATAT CCTTTACGGC 2340 

AATGTCGAAA TTGTTCAAAA TGTTCCTATT GGCCATTTAA TTGTGGTGCT TTCTGGTGAT 2400 

GAGGCCCAGC GTCAACAAGC CCTTACTTAT TTAGCGAAAC AAGGCGTGCG AACCAATGTT 24 60 

TTGAAAACCT ATCAGCAAAC GAAGCAAAAG CAGAATTTAC AGGTGATTTA AAAAAGGAGT 2520 

GAACTGAGAA TGTATCAATT ATTTGAGAAA TATTTTCCGA ATGTTGTCCA ATTAAAACAA 2580 

GAGTTTCTTC AAAGTACATG GGAAACATTA TACATGGTTT TTTGGACAGC ATTGATTGCT 264 0 

GGCGTGTTAG GAGTCTTGTT GGGTGTCGTG CTTGTTAGTA CTGGCCCCAG TGGTGTTTTG 2700 

AAAAATCCAC CCCTGTACAG TGTCTTAGAA AAAATTATTA ATGTTTGCCG CTCTATTCCT 27 60 

TTCATTATTA TGCTCGCACT GATTCAACCA TTAACACGAA TTTTGGCAGG AACGACAATT 2820 

GGTACAACCG CAGCGTTAGT CCCATTAGTT ATTGGCGTAA TCCCGTTCTT CGCGCGCCAA 2880 

ATTGAAAATG CGTTATTAGA AGTGGATCCT GGCGTTATTG AAGCGGCAGA AGCCATGGGG 294 0 

ACGAGTCCCT TAGGGATTAT TTTTAGGGTT TATCTAATTG AAGGGTTACC AAGTATTATT 3000. 

CGTGTTTCAG CGGTGACAAT TATTAATTTG ATTGGATTAA CAGCCATGGC AGGAGCGATT 3060 

GGAGCCGGTG GTCTGGGCAA CTTAGCGATT ACTCGAGGAT ACAATCGGTT TCAAACCGAT 3120 

GTGACATTTA TGGCCACGTT AATTATTTTA ATTATGGTAT TTATCAGTCA AGCCATTAGT 3180 

AATCAATTAA TCAAAAAAAC AT C AC AT TAG AAAAAAGGAG AAATGACAAA TGAAAAAATT 324 0 

TAGTAAATTA ATTGGACTTA TTGGGGTATT AGCTTTTACG ATTGCAGGTT GTGCATCGGG 3300 

GTCTGTGAAG GATACTAAGA CAGAAACCGT TAAACTAGGG GTTGTAGGAA CAAAAAATGA 3360 

TGAATGGGAA TCGGTCAAAG ACCGTTTGAA AAAGAAAAAT ATTGATTTAC AATTGGTAGA 34 20 

AT TT AC AG AC TATACGCAAC CAAACGCAGC ATTAGCAGAA AAAGAAATTG ATTTAAATGC 34 80 

CTTTCAGCAT CAAATCTTTT TAGACAATTA CAATAAAGAG CATGGAACGA AATTAGTATC 354 0 

aATTGGCaAT ACAGTCAATG CaCCATTGGG AATTTACGCT AATAAATTGA AAGATATCAC 3600 

GAAAATTAAA GACGGCGGAG AAATTGCTAT TCCTAATGAC CCAACGAATG GCGGGCGGGC 3660 

GTTAATTTTA TTACAAACTG CAGGACTGAT AAAAGTAGAT CCTGCGAAAC AGCAACTACC 3720 

GACTGTCAGT GATATTACTG AAAATAAACG CCAATTGAAA ATAACTGAAT TAGATGcTAC 3780 

GCAAACAGCG CGCGCTTTAC AAGATGTCGA TGCTTCAGTG ATTAATAGCG GCATGGCTGT 384 0 

CGATGCTGGG TATACACCAG ATAAAGATGC TATTTTCTTA GAACCTGTAA ACGAAAAAGC 3900 
GAAACCTTAT GTGAACATTG TCGTGGCCCG AGAAGAAGAT CAAGAGAATA AACTTTATCA 3960 
AAAAGTTGTA GAAGAATATC AACAAGAAGA AACGAAAAAG GTCATTGCAG AAACATCAAA 4 020 

AGGCGCCAAT GTTCCAGCCT GGGAAACATT TGGTAAAAAA TAAAGGAGGC ATTTATAATG 4 080 

AGTACAACAA CGATTCAAAC AATCCAAGAA GCTATTGCTA CAGAAAAGGA ATGGATAATC 414 0 

CATTTAAGAC GTCATTTTCA TCAATATCCT GAAGCAAGTT TAAAAGAATA TGAAACGATT 4 200 

AAGCGAATTA AAGAAGAACT ACTAGCCTTA GCTATTCCTT TTGTAGAAGT AGGGGAAACG 4 260 
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GGTGTTTTAG 


CAACCATTGA 


AGGAGGTCTT 


GGCGCTGGCA 


AAACGATTTT i 


GTTACGTGCA 


4320 


GATATTGATG 


CGTTGGAATT 


GCCAGATGCA 


ACAGGTGCTG 


CCTATGCTTC 


TAAAAATCCA 


4380 


GGACTCAATC 


ATGCTTGTGG 


ACACGATGGT 


CATGCGGCAG 


CATTGCTAGG 


TGCAGCTAAA 


4440 


GTGCTCAAAA 


AACATCAGGA 


TACCTTTTCA 


GGAACGATTA 


AACTGGCCTT 


CCAGCCAGCT 


4500 


GAAGAAATTG 


GTGCAGGCGC 


TCGACAATTT 


GTGGAAGGAA 


ATTATTTAGA 


AGCAATCGAC 


4560 


CAAGTGTTTG 


GGaTTCATTT 


AGATTCCAGT 


GTGCCGGTCG 


GAAAATTAGT 


CGCTACCAAA 


4620 


GGCGCCACCA 


ACGCCTCTTG 


TGATATTTTT 


AAAAtTGAAG 


TCAGTGGTCA 


AAGTAGTCAC 


4680 


GTCGCCCAAC 


CACAGAATGG 


CCGGGATGCy 


stTTTAGCAG 


CGGCCaGTAT 


CGTTGTGGAA 


4740 


TTACAAAAAA 


TTGTAGCTCG 


CGAGATTGAT 


CCTTTAGATT 


CTGTCGTAGT 


AGGAATTGGC 


4800 


GTTTTACAAG 


CAGGAACACG 


CTATAATATT 


GTAGCAAACC 


AGGCAACCAT 


TGAAGGTACT 


4860 


GTTCGAACAT 


TTAGTCAGGA 


AACGCGCCAA 


TTTGTTTTAC 


AACGAGTCGA AGAAATTGCC 


4920 


CATGAAATTG 


CCCAGTCTAA 


TCGCACAGAA 


ATTGCTGATT 


TTTCTGTTTA 


TGCGGCAGCG 


4980 


AATCCATTAA 


TTAATGAAGA 


ACAAGCAACC 


AACCGTGCAC 


AGCAGGTAGC 


CAGTGAAATT 


5040 


GTTGGTTTTG 


AAAATGTTGT 


GACCGATCAT 


CCTAAAAGTT 


TAGGGGCGGA 


TGATTTTGCT 


5100 


GATTACTTAG 


CGGTAATTCC 


TGGTATCTAT 


GGACGGGTCG 


GTTCACGAAA 


CCCTGAAAAT 


5160 


CCTGCTACTC 


ATTTTGGACA 


CCATCATGAA 


CAATTTGATA 


TTGATGAACG 


TGCCTTACTT 


i 5220 


TTAGCAGCAG 


AGTACCACGT 


ACGCTATGCA 


TTAAATTATT 


TATCAGAATA 


ACAGGGATAG 


5280 


GGAGCCGACA 


AAATGAT "GT 


TTGTGGGCTC 


CTTTCGTTAT 


AAACCAACGT 


GGCTTGCTTT 


5340 


CTTGTCATAC 


TATGGTAGAA 


TACAAGCATA 


AGAAAGGTGA 


TTCACATGTT 


TTTTGTTTTA 


5400 


ACTGATAAAT 


TAACAAGAAA 


AGTAGCTCCC 


GATAGTTTAA 


GAACAGATGA 


AGCACTGTTG 


5460 


TGGAGCCGAG 


CAATTAATTT 


CTTCATCGCT 


TGATAGGACA 


TTTTTTGACT 


AGAAAAGCTA 


5520 


GTCTTATTTG 


AGTGTACGTA 


CAAGCGGTTG 


TCGGGATTTC 


TGTCAGGATG 


GACGACGTGC 


5580 


AGACGTGGTT 


CTTTTTTTGT 


TGTCCCAAAA 


TAATCAGTTA 


ACAAGAAAGA 


AGTGACGAAA 


5640 


ATGAGTATTT 


TAACTATTGA 


ACATTTAACG 


CATCGATTTG 


GCGAAAAGGT 


CTTGTATGAA 


5700 


GAGGCTTCAT 


TGCAAGTGAA 


TAAAGGGGAT 


CATTTAGGCT 


TAACTGGCCA AAATGGGGTC 


5760 


GGCAAATCCA 


CCTTAATTAA 


AATTTTAACG 


GGGGAAGTAT 


TGCCaGACGA 


AGGAACGATT 


5820 


CAGTGGCAAA 


AAAATTGCAA 


GATTGGGTAC 


TTGGATCAGC 


ACGTTTCTGT 


AGAACAATCA 


5880 


CTAACGATGG 


TTGATTTTTT 


GAAACAAGCC 


TTTCAAGAAC 


TTTTTGATAA 


AGAAGCGAAA 


5940 


CTGACAAAGC 


TTTATGAAGA 


ATACAGTCAA 


ACGGCTTCGG 


AAAAACTTTT 


AGAACAAGCA 


6000 


GGCAAGTTAC 


AAACAGATTT 


AGATGAAAGT 


AATTTTTACC 


AAATCGACAC 


GATTATTCAG 


6060 


GATTTAGCCA 


ATGGGTTAGG 


ACTACAAGCA 


. ATTGGTTTGG 


ATAAAAAGTT 


AGGGGAGCTA 


6120 


AGCGGTGGTC 


AACGTTCAAA 


AGTGATTTTA 


, GCAAAATTAT 


1 TGTTAGAGGC 


: CCCTGATGTG 


6180 


TTACTTTTAG 


ATGAACCCAC 


CAACTATTTA GATGATACAC 


; ATATTCAATG 


i GCTTGTTCGC 


6240 
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TATCTGAATA 


ATTTTGAAGG 


AAGCTTTCTA 


TTAGTCTCCC 


AT GAT TAT CA 


ATTTTTAAAT 


6300 


GAAGTGACGA 


ATTGCATCGC 


AGACATTGAG 


TTTGGCAAGT 


taaccaaata" 


TACTGGAAAT 


6360 


GTAGAAAAAT 


CTTTTGCACA 


AAAAGAGCAG 


AATAAACAAA 


CCTATTTGAA 


ACAGTATCAG 


6420 


GCCCAACAAG 


AAAAAATTGA 


AAAAATGGAA 


GCCTATATTC 


GTAAATACAA 


AGCTGGAAAT 


6480 


CGAGCAACGA 


TGGCTAAAAG 


TCGACAAAAA 


CAATTGGACC 


GGTTGGAACG 


ATTGACTCCG 


6540 


CCTGGTTCCT 


TGACTAAGCC 


AGCGATTGAA 


TTTCCTTATC 


AAGGGTTAGT 


TGCAACGCAA 


6600 


GCACTAACGA 


CCCAGAAGTT 


AGTTGTCGGC 


TATCGGGAAC 


CTTTGTTAGA 


ACCGTTAGAT 


6660 


TTAATGGTTC 


ATGTCGGTGA 


AAAAGTCGCA 


TTGAAAGGCT 


TTAATGGGAT 


TGGCAAATCA 


6720 


ACATTAATTA 


AAACGTTGAC 


GAAAGTGATT 


CCTTCATTAG 


ATGGAGAATT 


TCATTATCCG 


6780 


CTGAATACAA 


AAATTGCTTA 


TTTTACCCAA 


GACTTAGCGT 


GGCCTAATGA 


GCAGTTAACG 


6840 


CCACTAGACT 


ATTTATCAGA 


TCGTTTTCCA 


GATACAACGA 


TTAAAGAGCG 


AAGAAGTCAT 


6900 


TTGGCTAGAG 


CAGGTTTACC 


AGATAAGTTA 


GCAATGCAGT 


CGCTAGCCCT 


GTTAAGTGGT 


6960 


GGTGAGCAAA 


CAAAaGTAAA 


ACTAGCGGAA 


CTAATGATGC 


AAACAAGTAA 


TTTGTTATTT 


7020 


TTAGATGAGC 


CAACCAATCA 


TATTGATGAA 


GCGGCCAAAA 


AAAGTTTACA 


AGAAGCAATT 


7080 


CACGTTTACC 


CAGGAACGGT 


TTTTCTGGTT 


TCCCATGAAG 


CAGATTTTTA 


TGAGGAAATT 


7140 


GTGGATCGAG 


TAATTGATAT 


TGAGGAATTA 


GTTAAATAAA 


GAAGAAAGAA 


CCAAGTTGAG 


7200 


GGGGAGTCAA 


CTTGGTTCTT 


TCTTCTTTAT 


TATTTATTTA 


CTCATTAAGC 


CATCATGGAT 


7260 


TTTTGTTAAA 


TTCCAGTTCA 


TCATGCTGTA 


GTACGTATCG 


CCTTCTGTTC 


CTTCTTTGGC 


7320 


AAGAGAGTCT 


GTGAAAAGTG 


TATCGTAAAT 


TGGTCGTTTC 


ACTTCTTTTG 


AGACCCGTTC 


7380 


CATACTACGT 


TTATCGACAC 


TGGTTTCAAC 


AAATAACACA 


GGTGCTTTTG 


ATTTCTTAAT 


7440 


GGTATCAATA 


ATCGTGGTCA 


TTTGTTCAGG 


TGTyCCTTGA 


CTTTCTGTGT 


TAATTTCCCA 


7500 


AATATAAGCG 


GCATTTAAAT 


CATAAGCTTT 


GGAGAAATAT 


TTAAAGGCAC 


CTTCACTTGT 


7560 


AACTAATAAT 


TTTTTATCAT 


CAGGAATATC 


AGCAAATTTA 


GCTTTGGCTT 


CCTCATGTAG 


7620 


TTTGCTAAGT 


TTTTCGGTAT 


AATTTTTCGC 


GTTTTCTGTA 


TAGAAATCTT 


TATTTTTTGG 


7680 


ATCTTTTTCT 


ACTAACACGT 


CACGAATGTT 


TTCTACATAT 


TTAATGCCaT 


TTTCAATGTC 


7740 


TAACCAAGCA 


TGTGGATCTT 


CTGTTTGTTC 


tTGACCGGCA 


CTTGTTAAAT 


ATTGTGGCGT 


7800 


AACATTTTTG 


CTTGTAGAAA 


AGTAATCTTT 


ATTCTCAACT 


TTTTTGGCCG 


TTTTCATTAA 


7860 


TTTGTTAAAC 


CAGCCATTTC 


CGCCTGTTTC 


TAAGTTCAAG 


CCGTTAAAGA 


ATAAAATGTC 


7920 


CGCTTCAGAA 


GCTTTCGCAA 


TGTCTTCTGG 


TAACGGTTCA 


TATTCGTGAG 


GGTCTGTCCC 


7980 


AATTGGCACA 


ATACTATGCA 


GCTCAATTTT 


GTCTTGCCCA 


ACATTTTTCA 


CTAAATCAGA 


8040 


TAGGATCGAG 


TTCGTTGTCA 


CAATTGCTAA 


TTTTTCTTTC 


TTTTCAGCGG 


CTTGATTCCC 


8100 


GCAAGCAGCT 


AACGTTAACC 


CTGCTAAAAG 


TGTTAAAAAA 


AATAAACTAA 


ATTTTTTCAT 


8160 


TCGTTGATTC 


CTCCATTTCT 


TTCTCACGGT 


TTACAAATAC 


TAGGCCTTTC 


TTTGGTGAGA 


8220 
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AAAAGAAAGC 


CAATAAAAAG 


AATAAGGCGG 


CGGTTAAAAC 


AATCGTCGCA 


CCAGAAGCTA 


8280 


AGTTATAAGA 


ATAACTGAAG 


AATAGGCCAA 


TCACTGAACT 


TAAAATCCCA 


AaaGTaGAGG 


8340 


CTAaGCCAaT 


CATtGTCGGT 


AAGTGATTTG 


TTAaTwGATA gCTGTTGCTG 


CTGGCGTAAT 


8400 


CAGCtGGCAT 


GACTAAAATA 


GTTCCGACGG 


TTTGTAAmGA 


AGAAACAGCG 


ACTAGTGTTA 


8460 


GTAAAaACAT 


CAATGCATAG 


TGGAAAAATT 


GAaTATTTAA 


ACCGTAGGCC 


TGAGCCATTG 


8520 


TGGGATCAAA 


AGAAGTGAGT 


TGCAACTCTT 


TATAAAACAA 


TGCCACAAAA 


ATCAAAACAA 


8580 


TCACACCGAC 


GACACAAGTA 


ATTAAAATAT 


CGGTATCTGC 


GACCGCTAAT 


ACATTTCCAA 


8640 


ACAGAATATG 


ATATAAATCG 


GTGGAACTTT 


TTGCAAAAGA 


GATAAAAATA 


ATCCCTAAAG 


8700 


CAAAAAAAGA 


ACTAAAGACA 


ACACCAATGG 


CAGTATCATT 


TTTTAGTGGA- 


CTTTTTTGGG 


8760 


TAATAAAGCC 


GATCGATAAT 


GCAGCTAATA 


AGCCAAAAAT 


AGAAGCGCCA 


AAAATATAAT 


8820 


TGAAACCAAA 


CATmTAAGAA 


ACAGCGACCC 


CTGGTAAAAC 


AGCATGTGAA 


ATCGCATCAC 


8880 


CCATCAAAGA 


CATCCCACGT 


AAAATAATGA 


ATGAACCAAT 


TACCCCTGAA 


ATGAGTCCGA 


8940 


CAATTATAGA 


CGTCAAAAGG 


GCATTTTGTA 


AAAATTGATA 


ACGGAATAAA 


CCATCAATAA 


9000 


ATGCAGCAAT 


CATTTTGCAA 


CCCCCTGTAT 


TAATAAATCA 


CCTAACAAAT 


CACCGTATGC 


9060 


TTCTTGGAGC 


GTTTCTGCAG 


TAAATG 








9086 



(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



TGCGCTTGCA 


ATAGTGCAAT 


GTACTAATCG 


TTTATTTTCC 


ATGTTCATCT 


GATAATCAGC 


60 


CCCCTAACAT 


TTTGGACACG 


CTTTCATTAT 


TACATTAAAG 


GGGCGTTCCA 


AAATAAATCT 


120 


TTTATTTTTG 


GAACGtCCAT 


CTCTAAATCA 


TTCTCTTAAA 


AACTGTAATA 


GCTTACGACG 


180 


ACAAGTCAGA 


ATTATTCATC 


CTCATCATCT 


ACTGCAACAT 


TTAGCAGCAA 


TTCGTTGACT 


240 


TTCACAATAC 


TTTCTGCTGA 


TTCCTTAACA 


TAGTCACCTT 


CGATCGCTAA 


TAAACTTGCA 


300 


TTTACATAGC 


TGATAACCAT 


CGGTAAATTC 


ATCCCAGTAT 


ATAAGTCAAA 


CGAAGCACCT 


360 


TGCATAATTT 


TTTTGGATAA 


AATATTACAA 


GGTGTCCCCC 


CTAGTAGATC 


AGCAAAGACA 


420 


ACAAAATCTT 


CTAGCGATTC 


CGTAATCGCT 


TCAAATTTTT 


CTAAAAAGTC 


AGCTTCACCT 


480 


tCTTCTGGTA 


ATAAAGCAAC 


AGTGGnAATT 


GTTTCTTGCG 


GTCCCATAAT 


CATCTCCGCA 


540 


CTTTTTTTCA 


ATTCCTCGCA 


AAAACGGCCA 


TGACTTACTA 


AAAC TAAATG 


TTCACTCATC 


600 


GATTTTCyCC 


TGTCTTACTT 


ATTTTGAAwT 


AACACCTAAC 


GCAGACAaTG 


CGACACAGAC 


660 


AaTTAATACG 


ATAAAAATGG 


CTTTAGTAGA 


CGTCATATTC 


TTTTTACCTA 


GCATCCAATA 


720 
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AACGCCACCG 


ACAATTCCTG 


CAGGAAGTAG 


ACGTGGTAAA 


ATCATATCCA 


AGTTATTTTG 


780 


CATGTTAAGT 


GTTACATCCC 


CAATACTTGG 


TGCCCATGAT 


AGTTTGACAT 


TAACCATCGT 


840 


TGCCACTAAA 


GCGCCAACCA 


TGAAGACCCC 


TAGTAAAGTT 


GCAGCGTCTG 


TTAAGGCAGT 


900 


TAAGCGGTGT 


TGCATAGTTG 


TTACCAAAGA 


AATTCCTTCG 


CGATACGCAA 


ATTCCAATTG 


960 


TTTCCAACGG 


AAGACCATCA 


CGGCAATTTG 


AGCAACAATC 


CAGATAAAGA 


TACCAGTTGG 


1020 


GTTACCATTA 


ATCGCCATGT 


TGGCaGCTAG 


CGCCCCAAAA 


ATAGTTGGAA 


TTAAAGCTGC 


1080 


GAAAATTGAA 


TCCCCAAT s G 


cTGcAAACGG 


TCCCATTAAA 


CCAGTTTTCa 


AACCTGaAAC 


1140 


TGTTTGtTTT 


CCAGCAATGc 


CTTCTTTTTC 


TTCaATGGcC 


AAATCGATCC 


CTGTAAtAAT 


1200 


TGTATTAAAG 


AAGTTTGAAG 


TATTGAAAAA 


TTGCGTGTGT 


GTTTTCATTA 


CTTCTTTTAA 


1260 


TTCAGGCGTA 


TCATCCCCAT 


AAATTTTACG 


CAATTGCGGC 


AAAATCGTAT 


ACAAGTAACC 


1320 


TGATCCTTGC 


ATTCTTTCGT 


AGTTCCAACC 


TAATTGGAAA 


GTAAACAAGC 


TTCTGCGATT 


1380 


AATTTGTTTA 


AAATCTTCTT 


TCGTCAATTT 


ATAATTAGAT 


TTCGTCATCT 


TCAATCTCTC 


1440 


CTTCGTCTGA 


ATCATTCGCT 


GCATGTTCTT 


TTTTGGCTGC 


TGGCCCACTC 


GGAATCATTT 


1500 


GACCATTTTT 


GTAGCTAATT 


GCGGCTAAAG 


CGAAACCAAT 


TAAAGCGACT 


GCTAACATTG 


1560 


GTAGTGCGTT 


AAATACACCA 


CTGAAGTCTT 


TCACAACGCT 


CGCAACAGAT 


GTTCCTAAAA 


1620 


GTTGCATGTT 


TGTAAAGATT 


GTTCCTAGTA 


AAGCAGTTAC 


AGTAAAACCA 


AGAATTAAAT 


1680 


AAGGGAAATG 


TTTTTTCACT 


GGTAAGTAAC 


GTAGTAAAAT 


TGCAAAACCG 


ACCGCTGGTA 


1740 


AGACCGCCCC 


AGCAACAGAC 


AAGCCATCAC 


CTAACCATTT 


CAAATCGCCA 


TTTAGGACAG 


1800 


ATACAACTTT 


TTGTACTAAA 


CCACCACCAA 


AAGCAAGTGC 


TAAGAAGACT 


GGAATCATTC 


1860 


GAGAAAGAGA 


CCAAGGTAAG 


GCACCCATTA 


AGAAGTTACG 


TTCAATTCCT 


TTGTAGTTCA 


1920 


TATCTTCGAC 


CATTTTATCA 


ATCCGATGTG 


CAAAGTAGGT 


ATTCGCAAAA 


CGAGCCAAAA 


1980 


TGTCTAATTG 


AATCATTAAA 


CTAGCAACTG 


GTACTGCAAT 


AGCAGCGATT 


GCTTGTTCAG 


2040 


GGTTCATTCC 


TAAAGACACA 


GAAAATGCTG 


TGGCAAGAAT 


CGTGCCTGAG 


TTAGCATCAA 


2100 


TTTTGGAGGC 


ACCCCCGAAA 


GTCCCAACTC 


CTAATACAGT 


TAATTGCATG 


CTTCCACCAA 


2160 


TAATAAGTCC 


TGCTTTAATA 


TCCCCCATCA 


CTAATCCTGC 


AAAAAGACCT 


GCAAACACTG 


2220 


GCGCACTTAA 


TGAAGAATAA 


ATTTGTAATT 


CATCTAAAAT 


TTGGTATCCT 


GCGTATAATG 


2280 


TTAATAATAA 


GATCTGCCAC 


CATAAGATAT 


TCATTTTTTG 


TTCCCTCCTA 


TTTTGTTATC 


2340 


AAACTCATAA 


AGTCTTCCGC 


TTTGTCACTT 


GGCACCATTT 


GAGCAATCAG 


ATGGACACCT 


2400 


TTAGCATTTA 


ACTCTTTAAA 


AACCTCAATA 


TCTTGATCGA 


CCACATTGAT 


TGACTTAGTA 


2460 


ATCGACCGTG 


TTTCGTTTGT 


TTGAGACATG 


TTCCCAACAT 


TGATTTCTTT 


AATCGGCACC 


2520 


CCAAGTTCTA 


CTAGTTTTAA 


TAAACGATCT 


GGTTTCCGTA 


CCACAATTAA 


TAAACGTTGT 


2580 


GAGTCGTATT 


TTCCTGCTAA 


AATATTTTGA 


GCAGCTTTTT 


CAACTGGTAA 


, CACGCTTAGT 


2640 


TTCACGCCAG 


CTGGTGTTGC 


TAATTTCAAG 


CCACTTTTTT 


CAATGGCATT 


CCCTGCAACT 


2700 
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TCATCATCAA CGACCATAAT TCTGGAAATA TTCAATTTTG TCGCCCATAA ATTTGCCACT 2760 

YGTCCATGAA TCAAACGTCC ATCAATTCTT ACGCCAATAA TACTCATTGT AATCCCTCTT 2820 

TCATTTCTTT ATATAGTGGT TCTTTTACTA ACTGAATTTC TGGTTGGTAC GTTCCTTCCG 2880 

TTTCAAAAAT GACGATCTCA TTTTGCCCTT CTTTTAATAA TCCTTTGGGA ATATATAAGG 2940 

AAAGCGTCGG ACCAACATTC CAGAAACGAC CAAGATTGGT TTGGTTAACA AAAACAATTC 3000 

CTTTGCCAAA TTTAGAAACA TCAATAAATG TATCTTTCAC TTCTGCTAAC TCCACATGAT 3060 

ACTGGTAAAA GCTTGGTTGA TCAGGCTGCC ATTCTCTTGA ATAGTCTACC TGTTCACAGC 3120 

TTGTCATAGG TAAACAATAT TGTTGCCATT GCGTCATAAA ATGTAAATCT GCCATGACAC 3180 

CCGTTCGAAT CCCTTTTTTC TGAGTATCTG CAAATAGTTT ATGCCCGTAA TTGACACGAC 324 0 

CCATATTTTC CATTAAAATA TCGATTTGAT TATTCTCTTG CGGCAGTGTA ACGTAAATAT 3300 

CTTCGCCAAT TTCCGTTTGA TATTGCGTTG CTTGATGTAC CTGATTGACA AATAATTGCA 3360 

GACGATCACG ACCGTCAATC ACTCTTAGCT TTTCTTCTGC AGCATCTTTT TCAATGCTTG 3420 

tTCGGTAAAG AAGATAGCCg GTATTTTGTC CTAATTGTTC CATGGTTTGC GGATAAACGC 3480 

TGACCACTGG CTGACTAATC GTTTCTAAAG TCGCAAACAA ACTCACTTTG TTGGTTAAAG 3540 

GAATTGCTGT TTGCGCAAAA GATTCTTTTA CTAATGGTTC AGCTTGTGGT AACGCAGGGT 3600 

ACTCTTCGTG AAGCATTTTT TGTAACGCAA AATACTTTTC GGTTGGATTC CCTTGTTCAT 3660 

CAAGTGGCGC ATCATAATCA TAGGAAGTGA TTTGTGGTAA ATCAATGGTT CCTCGTGCGG 3720 

AGCAGCCATT CATAAACCCA AAATTGGTGC CGCCATGAAA CATATATAAG TTTATACTGC 3780 

CTAATGCTAA TGCTTCTCTA ACTGATTCAG CCAATTCTTG CGGATCTCGT TTAATAATTG 384 0 

GCTCTTTCCA ACGATTAAAC CAGCCATCCC AAAATTCCAT ACACATTAGC GGCCATTTTT 3900 

TACCATGTTC TTCAAAAAAG GCTTGCATCA TACCGAAATT TTCTTTGGCT TTTGAGCCAA 3960 

AATTTCCGGT TACTAAAATA TCATCTTCAA TCATGCTGCC TGCCCGCAAT GTTGCGCGCC 4 020 

ATGGCCCGTC AGAAGTAAAA AATGGTGCTG TCACGCCGCG TGCAATCATC AAATCGCGAA 4 08 0 

TCGCCCGTAA ATACGCTTTT TCTTCGCCAA ATGAACCATA TTCGTTTTCG ATTTGAATCA 4140 

TTAAAATATT GCCACCGTTA GCAAGTTGAT GCGGAACAAT TTTTTCCATT AAAACATCAT 4 200 

AATATTCCGC AACATGCTTC AGATAGGTTG GATTATTCGA ACGCATCCGT CCTGGTTCAT 4 260 

TTAATAGCCA GGCTGGAAAA CCGCCAAACT CCCATTCTGC ACAAATATAA GGAGACGGCC 4 320 

GCACAATCGC ATATAGTCCA AGTTCTTGTG CTAATTTTAA AAAGCGCTCC AAATCTAATA 4 380 

TTCCCTCAAA ATGAAATGTT CCTTTTTGCG GTTCATGTAA ATTCCATGGT ACATATGTCT 4 4 40 

CAACTGTATT GAAGCCTAAA GCTTTCAAGT TATAAAGGGA ATGATGCCAA TCTGAGGGAT 4 500 

CTACACGAAA ATAATGAATG GCGCCTGATA AAATTTTAAA TGATTGACCA TTTAAAAGAA 4 560 

ACTCCTCTTT AATTTCAAAT CTGTCCACAT TGTCACCCTT TCTAGTAAAC GCTTTCAATT 4 620 

GCTGGATAAT TGCTAAATAA AGAAAGCTCT TTATTTAGCA ATTTCGACTA CTTGGATGTT 4 680 
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CATAAGATGT TTCTTCACGT 
AGCCATTCCA GCCGCCATGC 
TCCATATGCA AGACCAGCAA 
CGTTGGAATT TTTACCACGA 
CTAAAGATAT GACAATCCAT 
CAGCTGCTAA TGGATTTTCT 
TAGGTTTAAT TAAATACGGT 
AAGTATCTAG TAGCACTTTA 
AATCTGATGG TAATCCTTTC 
TTAATTGATC GAAATTTTCA 
CTTCTAAAAT TTCAGTTTGA 
CTTTGATTGA CGTAAATGCT 
CCCCATGAAA ACCACCAAGA 
CGCGAGTGAC ATTTAGTCCT 
CGGTATCAAG CTTTAGATGA 
CTGTTACTAT CACAGCTATG 
AAGAGCTGGC GGTTGCTGCG 
TTTTTAACCA ATCACAAGCC 
TCCACGTAGC ACGGCCACAT 
GTAGTGTTTC TTGGAATAGC 
CAGTGGCTTG ACTTTGTTCT 
CAGCCGGCGC AAAACCTTCT 
CTACTTTATA TTGGGGTTTA 
CTTTTGCATA TTCTAAAGAA 
AGTAAAATGG TAGGTCTTCT 
TTTGATGATT GATTTCTGGA 
CTGCGCCTTC TTCTTTCAAG 
CCGGAGTTGT TGCATCATAG 
CGCGTGCTTT TGCAGCAGGC 
GTGTCAATTC TTTCGAGACT 
CATCTAGCGC TTTGATCATT 
TGATTCCTTC TTGGGTTGAC 
GCATAGTTCC TCCTCCAAAT 
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TCTCTACGTC TACATGCCCA GTCATTCTTT CTTGGGCATT 474 0 

CCCATTTTAG CAGTTCGGCA GCTGGGGCAT CCTTGGCTAG 4 800 

TGGTTGCATC TCCTGAACCA ACAGGATTTT TwGcTtGAaT 4 860 

TAGAATTGAT CGTGATGTTT GGCAATTGCA CCATCTTTTC 4 920 

TCAATTCCCG CAAACATTGG TTTTGTTAAA GCTGTTTGTA 4 980 

GAAAAATCTT GCCCTAGCAA TCCTTCTAGT TCCTCTAAAT 504 0 

TTCCATGGTC CTTGAAGAAC TTGCCTTAAA CTATCACCAG 5100 

ACCTCTTGTG CGTGAnCTTT TTGAACTAAT TCTTGATAAA 5160 

GCTAAACTGC CAGAAATTGT GACAATTTCT GCTTGCTTAA 5220 

AGGAAGTTAC TGATTTCTTC TGGGGAAACA GTTGGTCCAG 5280 

TTCCCCTCGT GCAAAATCGC AATCGAATCA CGTGTTTCTT 534 0 

TGGGGGATGT TTGCTTTCTT TAACTCATTT GCAATAAACG 54 00 

ACGCCGGTAG CGATAACATC TCCACCTAAA TCATGAATAA 54 60 

TTGCCACCAG GGGTTTTGGT CACCTGACTG GTCCGATTAA 5520 

TCTAATAGGT AGGAGATGTC AATTGACGGA TTCATTGTTA 5580 

CCTCCCAATT ACCCTTCGTT TACTTGAACT TTCGCATGCC 564 0 

ATGACTTCAT TTAAACTTTC AATGTTTTCT CGGCCTTCTG 5700 

GCAGTTTCTC CTGCTTCAAC GAAAGGTTTC ACGCCATTTT 5760 

AAAACACCAT TAAATGTTGA ACCAGCTTCT TTAGCGAAAC 5820 

TCTGTTGAGA CACCGGCACT TAAAAAGATA AATGGTAAAT 5880 

AAGAAATAAT TCGCCGCTTC CTCTTTTGTA TACGCTGTTT 594 0 

ACAAAATTCA TATTAACTGG AACTTCCACT TTCAATACAT 6000 

GAAAATTCTT TCATCATTTC ATTCACTTTA TGCGGTTTAA 6060 

GTCGCATCTG CTATTTGTGC ATCATAAGAA ACTAATTCTA 6120 

TCTGCACATT CACTTCCTAA GCGTTCAATG AATACATGTT 6180 

TCCTCATCCA CATCATAGTA TAGTAAAAAT TTAATCGCAT 624 0 

CGTAAAACAG ACCAATCTGC TAGTAAATCA GGCAATCGTC 6300 

CCCGTTTTTT CATAAGCGAG TAACAATCCA GCCTCGGTAT 6360 

AATCCATATT CTGGaTCCAA TAAAATGGcA GACGCATAAG 64 20 

AATTCTTTAA ACGTTTCAAT TTGTGCATCA GTTGGCTCTA 64 80 

TTTTTCAAGG CACCTCGTTG ATCAATGGCT AATGCGCTAA 654 0 

AAACGATCCA TTGCCGCTTT TTTTCCAGCT GTTAACGTAA 6600 

GTTGTATTAT TTATAGTCAT AAATCGTTAC GCCTTTAACC 6660 
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ACACGATTCA CTGTACCTGT CGGTGAAGGG GTATCCGGTT TATTTCCAAC TTTAATAGAT . . 6720 

GTTAAAAGTG CGACTGTTTG GGCTACCATG ATATCTGCnA ATGCnAAATA GCCATCAGGT 6780 

AATAGAGCAG TTTCTGCGGa ATATTCAATm GyTGAACCtG AGAAATTACG TTTGCCAGGT 684 0 

TGAGCAATCG CAAAAACACC CGCCGCAATT TCGTCACCGT GAACTTCTTC TAAGATATCC 6900 

AAGTCATAAT CTCTTGTGTA TGGTGTATTA TTaACAAAAC ' CAAACATTAT TGTTTTTTCA 6960 

TTAATAAATG ATTTTGGTCC ATGACGGAAG CCCATAGAAG AGTCAAAAAT TGTGGCAACT 7020 

TTGCCGGCTG TTAGTTCTAA AATTTTCAAT TGCGCTTCTC TTGTCAAACC AGCTAAACTT 708 0 

CCAGAACCAA CATAAACAAT GCGTTCAAAA TCAAGGTTAA CAATTTCATT TAATTCTTCT 714 0 

TCACGGGATA GCACTTCTTC ACCTAATTTA ATTAGAGCGG TTACATAATC TTGTTTTTGC 7200 

TCAAAGGCTG TTTGATCAAA GAGTAGTAAT GTACCCAAAG TCATGCAAGA AAAGCTACCA 7260 

GTCATCGCAA AACCACCATC ATTGGAACGT GTTGGCATTA AGAATAAAAA GCTGTTTTCT 7320 

TCTTTTTGAC TTATTTGTGC TAGTTGTCCT TCTTCGGCAC AAGTAATCGC TAAATGATGA 7380 

ATTGTTTCGA CCACTTGATT AGCTACTTCA ACTGCCGCTA AACTTTCAGG ACTATTTCCA 74 4 0 

CTTCTTGCAA ACGAAACTAA CAATGTTGGT TCGTCTTTAA TCAAATAATC TTCTGGTTTG 7500 

GCCACAATAT CCGTTGTCCC AATACTTTCG AATGAAAAGG CTTGTGTGTC GCCGTGTGCT 7560 

CGGAGATAAG GTACAACCGT ATCACCGACA TATTGCGAAG TCCCTGCTCC TGTAAAAATT 7620 

ACTCTTGTGC GTTTTCCTTG CGCTTTGGCC TGGACTTCTT TTAAAAAGTT TTCCAAAGCT 7680 

GTTTGATTTT CATGATAAAG TGTCACGGTT TCTTGCCAAA GCTCAGGTTG CTGACGAATT 774 0 

TCACGTGTAG TGATTTCCGC TCCTAGTTGT TCTAGTTCTT CTTTTTCTGC TGTAAACATA 7800 

CTTAACTCTC CTAACTATTC CGAATATGGC GTATCTGATA ATGAAATTGa TCTGCTCGCG 78 60 

CCACACTCAA CGTAAATTCA ATAATTTCAT TTTTCATGTT ATATGTCTGG CGTGCTAAAT 7920 

GAAGGACTGG CGCTCCTTCC GGaATCaTCA ATAATTTGGc ATCTTCTTTT GACGCAATGC 7 980 

TAGCGTATAA TTCTTCATCA GCTAAACGAA CGGTTTGCTT AAAATCTTCC GAAAATAGAT 804 0 

CATATAATGG TTTGCTACGT AGTAATTGAT CCGTGAGTGA TAAGAAAAAT TTCACAGGCA 8100 

AGTATGTGTC TTCAACCATT AATGGCTCGT TATCAGCAAT CCGCAACCGA CTTAATTTAA 8160 

AAACTGCCTC TCCTAACGAA AGATTCAAAT GCTGACAAAT AAATTTATCC GCTTCCAATT 8220 

TTTCGAAGGA TAATATACGG GTATGTGGTT TTCGACCCAG CCCTTTCATT TGTTCTGTAA 8280 

AACTATATGC GCCAGCTAAA TCAGCGGCCT CTTTTTTTAT ATCTGAAACA AATGTTCCTT 834 0 

TGCCATGACG ACGATAGATG GAACCTCTAT TTTCTAGTTC TTGTAGCGCT AACCGGACCG 8 400 

TCGTCCGACT CACTCCATAT TGTGCAGTCA ATTCACGTTC AGAGGGCAAC TTGTCATGAG 84 60 

GAATCATGCA TGTTTCGATT CGTTCCTGCA GCAAATCAAC AAGTTGATGA TACAAAGCTT 8520 

TGTTTTTAAA GGCATTTTCC ATAACT 854 6 
(2) INFORMATION FOR SEQ ID NO: 147: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 34 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRA>;DEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CGACACATTG TGGGACATTG CTGCACGTTT TGGAACGACA GTTGATAATT TAATGTCACT 60 

AAATGGATTA ACATTGGATT CTGTTTTATC GGTGGGTCAA ACCATCCAAA TCGGTTAATT 120 

TTGCAATTAT TTAGTkGAAG AAAGACCTwA CAAAAGCAGT AGTTTTTGTA AGkTCTTTTA 180 

TCGTGAAAAT AGGTAGGGCT AGCAGCAATT ATTTGATGAG CCTAATCCGT TTTTTTATTT 240 

AATTTTTCAC AAACTAAAAT TCGTTTTTCT GGTAGAAATC GTTTTTTTGA GCTATGCTTA 300 

GAGAAAGAAT TAT T AT ATT G TTTGTAAAAT TGGAGGATGT AGGATTATGA CTAAAGGTTA 360 

TGTAAAATCA GTAACAGAGT TAATTGGTCA AAGCCCAGTT GTGAAATTAA AACGGATGGT 420 

ACCAGAAGGA GCTGCAGATG TTTTTGTTAA ACTAGAGTTC TTTAATCCTG GTGGCAGTGT 480 

GAAAGATCGA ATTGCTTTAA GCATGATCCA ACAAGCCGAA GCAGATGGTC GCCTAAAACC 540 

TGGACAAACG ATTATTGAGC CGACTAGTGG TAATACAGGC ATTGGGTTAG CAATGGTAGG 600 

TGCTGCGTTG GGATATCCAG TTAAGATTGT GATGCCAGAT ACTATGAGTA TTGAACGCCG 660 

CAAATTAATG CAAGCATATG GGGCTGATCT ATTATTAACC CCTGGTGCCG AAGGAATGAA 720 

AGGGGCAATT GCAAAAGCTA CAGCATTAGC AGAAGAACAC GGGTACTTCA TGCCTTTGCA 780 

ATTTAATAAT CCAGCTAATC CAATGGTACA TGAACAAAAA ACAGGAAAAG AAATTGTTGA 84 0 

TGTCTTTGGT AAACGTGGCT TAGATGCGTT TGTTTCTGGT GTTGGCACTG GAGGAACCGT 900 

TACAGGAGTT GGCCATGAAT TGAAACGGAT TTTTCCAGAT ATTGAAATTG TTGCAGTAGA 960 

ACCAACAGAG TCGCCTGTTT TAGAAGGTGG CGAACCAGGT CCACATAAAA TCCAAGGAAT 1020 

AGGCGCTGGT TTTGTCCCAG AAGTTTTAGA CACCACCGTT TATCAAAAAG TTGCCGCTGT 1080 

TTCTAGTGAA GACGCATTAG AAACTGCTCG TTTAATGGGG CCAAAAGAAG GTATCCTTGT 1140 

GGGGATTTCA GCAGGGGCAG CAATTAAAGC TGCCATTGAT TTGGCAGTCG AATTAGGCGC 1200 

AGGCAAACGT GTCTTAGCGC TGGTTCCGGA TAACGGTGAA CGTTATCTTT CGACAGCTCT 1260 

TTACGAATTT CCAGAATAAA AAACACGCCA AAAAACGAGC ATTTCTCTTA GTTTAGTAAA 1320 

ATGCTCGTTT TTTCCACGAG AATCCTACAA TAATTCTTGT CAATCGTAAT AAGAAATAGT 1380 

ATTATTTAAG AAAGATAGAC AGGGGAGGGA TACAAAGATG GACAACGTAT TGGTTAAAAA 14 4 0 

TGCACTTGCT GAATTAAAAG AAGCCAATAT CCGAATTACT CCGCAACGTT ATGCTATCTT 1500 

GGAATATTTA ■ ATCGAAAATC ATACACACCC AACAGCTGAT GAAATTTATC GCGCACTAGA 1560 

AGATCATTTT CCAAATATGA GTGTAGCAAC GGTTTACAAC AATCTACGTT TATTTACTGa 1620 

AATCGGTTTC GTTCAAGaAA TGAGTTATGG CGATGcATCT AGTCGTTTTG ATTTTAGTTC 1680 
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GAAGAAACAT 


TATCACGTGA 


TTTGTCAAAA 


ATGTGGTAAA 


AlCu i i oAl 1 


T t r* a t t a t pp 


1740 


AGGGTTAGAG 


GACGTTGAAA 


TGGCCGCTAG 


m tv tv TV TT 7\ J\ 7A 

TAAA1 lAnLM 


Ijov^ 1 1 ^ortrVrl 


TTAATflAACA 


1800 


TCGTTTAGAA 


TTATATGGAT 


TATGTCCAGA 


T T GC C AAC AA 




Aft: A ATPTftTA 


I860 


AAAAATTACA 


TACTGTTAAA 


AATTTATCAA 


AAGATAAG I C 


AC 1 1 G 1 C 1 Au 


rp ft ft ft TTTP A. 
uCRnnl 1 l^A 


1 


CAAGCGTAAG 


AGTTTTCACT 


CTTATGCTTG 


TTTTTTTGTG 


1GG 1 1 Ai iGA 


/"* rp <t* rp rp ji ft 7\ rprp 

till 1 AAA1 1 




CTTAAAAACA 


AAAAATTAAC 


TAGCTATTAT 


TAAGAATTAG 


CAACAGT lib 


1GAAI 1 1 1 J. 1 


ZU H U 


TATATGCGAT 


TCACAAGCAT 


AGAACTCGAA 


TAAGCTTGAT 


m ^> Ti ^ 7A 

TCCTCTGATA 


rp tv rnmmmji j\ Tv /"» 

TATT i 1 AAAC 


ZiUU 


AGACAGAGGG 


AGACGTCACC 


TCTGCAAAAC 


AAAGATTGTG 


TV TV TV T\ TV /"* T\ 

AAAGAAGGGA 


T 1 1 AC AAC A 1 


o i fid 


GAAAGTCGTA 


GTCGTAGGAT 


GTACCCATGC 


TGGAACTTCA 


/"»/*» TV Ifl m TV TV A TV 

GCAGTTAAAA 


GTATT1 I AGC 




AAATCACCCG 


GAGGCGGAAG 


TCACTGTATA 


TGAAAGAAAT 


GACAATATTT 


CcTTCCTAIC 




TTGCGGAATT 


GCCTTGTACG 


TTGGAGGCGT 


AGTTAAAAAT 


GCTGCTGATT 


TATTTTATTC 




AAATCCAGAA 


GAACTGGCTT 


CTTTAGGAGC 


AACAGTGAAG 


ATGGAACACA 


tv m r-» m m T\ tv TV 

ATGTTGAAGA 


o /i fin 


AATCAACGTG 


GACGACAAAA 


CTGTTACAGC 


GAAAAACTTA 


CAAACGGGCG 


/—» 7\ TV/^fn/^TV/^TV/** 

C AAC T GAG AC 




CGTTTCTTAT 


GATAAATTAG 


TAATGACAAC 


TGGTTCTTGG 


CCAATTATTC 


/"< TV /"•/""' TV TV rprp 

CACCAAriCC 




AGGAATTGAT 


GCTGAAAATA 


TTTTATTATG 


TAAAAACTAT 


TCACAAGCCA 


tv /"*/*■* rp /"* — rp /■> t\ rp 

ACGTGalCAI 




CGAAAAAGCG 


AArGATGCCA 


AACGTGTCGT 


TGTCGTTGGT 


GGCGGTTATA 


rn m /*»/"• rp 71 rp rp /—« tv 

TTGG 1 Al I GA 




ATTAGTAGAA 


GCTTTTGTAG 


AATCAGGTAA 


ACAAGTTACT 


T.TAGTTGACG 


rp rp TV TV r"^ 

GGT T AG ACCG 


9 *7 fifi 


CATTTTAAAT 


AAATATTTAG 


ATAAGCCATT 


CACAGATGTT 


CTAGAAAAAG 


TV TV rfirpT\/" , rprp/ , "*TY 

AAT 1 AG I 1 G A 


9T £fi 

^ / ou 


TCGTGGCGTT 


AATTTAGCTT 


TAGGTGAAAA 


TGTCCAACAA 


TTCGTAGCAG 


ATGAGCAAGG 


9 p 9n 


AaAAGTAGCA AAAGTCATTA CACCAAGTCA AGAATTCGAA GCAgATATGG 


m^"»Tv rp m Tv rp r-» rp 

TGATTATG1G 


Z O 0 u 


TGTTGGTTTC 


CGTCCAAATA 


CAGAATTATT 


AAAAGATAAA 


GTTGACATGT 


TACCAAATGG 




TGCAATCGAA 


GTCAATGAGT 


AT AT GC AAAC 


AAGCAATCCT 


GATATTTTTG 


CTGCGGGTGA 


"}fififi 


TAGTGCAGTT 


GTTCACTACA 


ACCCAAGTCA 


AACAAAAAAT 


TATATCCCAT 


rp tv /— • /~i Tv jv ^> «-« TV TV 

TAGCAACGAA 


jUDU 


TGCTGTTCGT 


CAAGGTATGC 


TAGTAGGTCG 


TAATTTGACT 


GAACAAAAAT 


1 GGC i 1 Al CC 




TGGCACACAG 


GGAACTTCTG 


GTTTATATTT 


ATTCGGTTGG 


AAAATTGGTT 




3180 


GACAAAAGAA 


AGTGCCAAAT 


TAAACGGACT 


AGATGTCGAA 


GCGACTGTTT 


TOGAAGATAA 


3240 


CTATCGTCCA 


GAGTTTATGC 


CAACAACTGA 


AAAAGTTCTG 


ATGGAATTAG 


TrTATGAAAA 


3300 


AGGAACACAA 


CGCATTGTTG 


GTGGTCAATT 


GATGTCTAAA 


TAT GAT AT CA 


CACAATCTGC 


3360 


CAATACTTtA 


TCATTAGCGG 


TCCAAAATAA AATGACgTCG AAgATTaGCG 


ATTTCTGATT 


3420 


TCTTCTTCCA 


ACCACATTTT 


GATCGCCCTg 


GGAcTACTTA AACTTATagc 


CCAAGCCGCT 


3480 


TTAGAAAATA 


• TGT 










3493 


(2) INFORMATION FOR SEQ ID NO: 148: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



CTTCTTTGGA 


GATTAGGACT 


TTCTCAATAT 


CTTTTTCTAT 


CATAGTGCGC 


TCCTTTCAGC 


60 


ATTACTCCTT 


TGTCTTAAAA 


AGGAGTCTGT 


AGTGTATTTT 


ATCAGTTTCT 


AAAGGAATAC 


120 


TCAAATAGGA 


ATTTGCAAAT 


TTTGGTACCC 


AAATGATTTC 


CTCTTGCTCA 


TCCGTAATGA 


180 


CCCAAGCACT 


CTCTCTCGCT 


TCATTTGGCA 


CTTTTTGATC 


AATAAAAACT 


CTGCTTAATT 


240 


TTTTCGTAAA 


CCCTGGTTTT 


AATGTAATCC 


GATCACCACT 


TTGGCGATGA 


CGAACGGTCA 


300 


AAGGTGTGGC 


TGTGGTCAAC 


GGGATCGCTA 


ACAACTGACC 


CGTCCACTGA 


TTAATCTCTT 


360 


CAGGAAGAGG 


AAACGGTTTG 


TCTGTCGCAA 


TTAATCCTAG 


CCACTCGGTT 


TCCGATAGAT 


420 


AAATCCCCGC 


ACCAGGTACT 


AATAAATGTG 


TAACTTCTTC 


TCGCAGTGCA 


GCCTGTTTTT 


480 


GTTCTAAGCA 


TAGCCAATCA 


TAACTTTTAG 


CTAGTTGCCA 


TTGCTGTTCT 


AACATAACCG 


540 


AGCCTTGCGG 


TTGCCGTTCA 


TCATTCAAAA 


CTTTTAAAAT 


TTGTGTCATC 


TGGCGTTGGT 


600 


TTAACGAAAC 


CCCTAATGGC 


ACTAATGTTT 


TTTGGaAAAA 


CGCGGTTAAA 


AAGAAGTGCT 


660 


GGTACGCAGG 


CGTTTCTTTT 


AATAATCTCT 


TCAAAGGAAT 


GCGCCAGCCT 


TGTTTGGTTG 


720 


GTTCTACGGC 


TTCAGATAAT 


TGAGGCTCAA 


TTTGTTCTTG 


AATAAATTCT 


TGGGCAAAAC 


'780 


GTTGTTCATT 


ACTAAAAGAA 


GCAATCTGAT 


CTAAGAATTG 


CGGATTTTCC 


TGCTTTAATA 


840 


GCGGCAAAAC 


CTGATTACGG 


AGCCGATTTC 


TCAAATATTC 


ATTGGTTTGG 


TTCGTTTCGT 


900 


CCTCAAAATA 


AACGAATGCT 


TGGCGCTGTG 


CAAATTGATA 


CAATTCTTCT 


TTTGGATAAA 


960 


TGAGTAACGG 


ACGAATCAAT 


TCCCCTGTCG 


CAAAAGGACG 


TCTCTCTTTA 


ATGCCTGCGG 


1020 


AATGGGAAAA 


ATTAGTTCCT 


CGAATCAGCT 


TCATTAAAAT 


CGTTTCCGCT 


TGATCGTCTA 


1080 


GATGATGTGC 


CGTCATTAAG 


ACCGCCGCAC 


CTTCAATTTC 


CATGACTTCT 


TtAAAAAATt 


1140 


CATAACGAAA 


TGTGCGCGCT 


TTGCTTCTAA 


ATTtCGTTGC 


TTTcAgGGTC 


TcCCAACGTG 


1200 


TTcATAATAC 


GTaGcgCTCG 


CTGCTGACAA 


TAGGTAGCGA 


GATATTGGCG 


CTTCCTGTGC 


1260 


CGACGCTTCT 


CGCAATTGAA 


GGATTGACAT 


GTGGCAAnA 






1299 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 
TTACTAGTCT GTTGGAATAC ATTATGATAG TGTAAGATAA ACGGACTATT GGAAAGTCAC 
AG C AG AG GAG TTTGAACATG TACACATTTA AAATCGGTAT TCCGGCAGAA GTACACGATA 



60 
120 
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CATTTGTTAA AAATCACCCA TTATGTAATT TACTACAGTC ATCTTCATGG GCAAAAGTCA 180 

AGGATAACTG GGGTTCTGAA ATT6TTGGGG TTTATGAAAA GGATACATTA GTTGCATCAA 240 

GTTTAGTTTT AATCAAACCA TTACCAGCCG GTTTTACGAT GTTATATACA CCAAGAGGTC 300 

CAGTAATGGA CTATACAAAT GAACGTTTGG TCAGCTACTT TATGGCTGAA TTGAAAAAAT 360 

TTGGCAAGAA AAAACGAGCT TTATTTATAA AAATGGATCC AGCGGTTCAT TATCAAGATT 4 20 

TTCATTTGGG AGAAGAACAC CAACCGCATG CTGAAGCGAC ATCTATTATT GAGACCTTGA 4 80 

AAGAAGCAGG TGCCAAATAT CAAGGACTGA CAATGGATAT GGGGGCCACC ATCCAGCCTC. 54 0 

GCTTCCAAGC GAATATTTAT CGTGAAGATT TTAGTGAGGA ACAATTGTCA AAAAGTACCA 600 

AGAAAATGAT TAAAACAGCT GAAAAAAAAG GGGTCGTTGT GC AGC AAGGG CATGTTG ACT 660 

TTGTTGATGA ATTTGAAAAA GTCATTCAAT CAACCATGGA ACGCCAACAT ATTTCTTTGC 720 

GGAATAGTGA TTACTTTAAG AAGCTGTTGA ACATTTACCC AGAAGACTCC TTTATTATGC 780 

TGGCGCAAGT CAATTTAAAG GAACGCTTAG ATAGCACGCG ACAACGCTAT GACAAAAATC 840 

AAAAAGACTT GAGTAATTTG AAAGAAAATC AAGTGAAAAA GCGCCATAAT TTAGAAGAAT 900 

TAGATGCTTC CTTGACTCGG GAACTAAAAG AATTAGAAGA AAATATCGCA GAAGCTGGTG 960 

AAATCGTCAC GGTTGCTGGC GCTTTGGCTG TGACATTTGG TCCAACAAGT GAAATCCTTT 1020 

ATGCTGGCTT AGATGATCGT TATAAACGCT ACATGCCAAC ATATGTGACT TGGCGAGATG 1080 

CCATTCAAGA ATGTTTTAAC CGTGGcTGCG AAAcTGTAAT ATGGGCGGCT TAGAAGGTAG 1140 

TTTAAATGAT GGCyTAATTA AGTTTAAAGC GAACTTCAAC CCmACGATTA ATGAATTTAT 1200 

TGGGGAATTC GATTTACCGG TTAATAAATT ATTATTTAAA GCCAGTGAAT ATGCCTATAA 1260 

ATTACGAAAA CAAAAAAAAT AAATAAAAAA CACAGGGACT AAAAATCAAA CTTGATTTTT 1320 

AGTCCCTGTG TTTTTTTCTA TAGTTTTTGC GTTGCATTCC GTTTCATTAA GCGAACAAAA 1380 

CGCTAGCAAT TGCTGAAGCA GCTACACCAG CAACAAGAGC TTCTGGCACA CCAGAACTGA 14 4 0 

AAATCGTCGC AATAATAGCT AGGTAAACAG CAGTTCCAGC TTTACCAATA ATTTCAGCAT 1500 

AAGCTGAATT AAAAAGAAAA TAAATCATGT TCATGACTAA AATGGTATTC GTTGCTGAGC 1560 

CTAAAACACC AGCCACAAAT AAGGAGACAG GTTGCGCTTT TTCCTTCGTT AATTTATTTA 1620 

GCCATTTGTA AGCGAAATAA GGAACGACCC CAATTAAAAT CCGCGGAATG AGCACAATTA 1680 

AAAGTGCCTT CCAACTACCA TGATCGGTAC CGATCACAGG AATAAATGGA GAAAAAACAA 174 0 

ACGACAAAGG TGTTGGTGCT GTCGTATTTT TAATTAAGCT AATGATCCCA AACGTGCCGC 1800 

CCAACATACT ACCGATTTTT GGCCCTAAAA CAATGGAACC AATAATTACA GGAATATGCA 18 60 

TCGTGGTCGA ATTTAAAGGA CCTAATGTGA TAAATCCTAG TGGGGTTAAA GCCAGTAAAA 1920 

GTAAGATAGC TAAGAACATT GCTGTTAACG TAAATTTTTT CGTGTTATTC ATAAATTACT 1980 

CCTTGTTATC AGATGCTAAA ATTTTCATCA CTGTTTGAAG AATATCTTCA ACAGTGGCAA 2040 

GTGCGCCACG ACCATAATCG CCACAAGCTA ATAGAGCTTC ACGTGGGATT ATTTCTTGAT 2100 
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AGCCGACTTC TTTTAAAATG 
TATTCATCGC GGGCGCAATA 
ATAATAAGTC GTCGGCAATC 
CTACTAAAAA TAAATCCGCT 
TTTCCATCAT GACATCCGTA 
TGATGaACGC TGTGCTACTT 
GTTGACTTGT AATATCTGCT 
TAGTTTTCAT TGTTGTTCTC 
CCCTTGTTTT GAATAAGCTT 
TTCTGTTTCA TGAACGTTCA 
CAAAGCGGCT TGTGCAACTT 
GCCCACTAAA ACAGTTTGGG 
CTTTAGAAAA ATTAATAATC 
ATCAGCAATC TGATTCAATG 
CAATGTTTGT TCGCTTAATT 
GGTTTCCGTC GTAAAATCAC 
AGCAAAGAGT TGTTCTAACG 
TAAGTCCAAT TGTTGGGTTG 
TCCAGCAGCT AAAAaACTTT 
AATCGAACGA ACATTATCAA 
AATCAAAACG CCTCCTTAAG 
TAATGCTTCA AAAACAGGGG 
AGTGATATCT AAATCAGTGG 
ATTaACCGAA TCAATGCCCA 
CTCTAATGTA ATTCCAGCAG 
TGGTAAAACA GCAGAAGTAA 
TTTCATCAAT GCTTTAACAG 
ACCTAACTCT TTTAAAAGCT 
GTAGCCTAAT TGATGcAACA 
TACAATAATG GGGGCTTCTT 
TAATCCGGCC GTAAAGCCAG 
AGGAAGTGGA ACTACCCAAT 
ACTAACCCCT AATTGATAGC 



861 

CCAATATTTC GTTGGACAAT 
ATTTTTGGTG TTTCAGGATA 
CCATGGGCTA ATTTCCCAAT 
TTTTTAGCTA ACTCAATGTG 
TGCACTGGAT TCTTAGAAAG 
TTAGTCATTA AAATTTCCAC 
GCTTTATATG CAGAAATGCT 
TCCATTTCTT TTCAACATTT 
CTTGGACAAT GCCCGTTTCG 
TCAAGTCATT GGCTAAGACA 
GGACAAGTTC TTCTTGTGAA 
GTTGCTTGTC ACGAATCATT 
GGTCCGTATC AGATGAAATT 
TGTTTTGAAC GATAGTTACC 
GTTGAGCAAA aGAATCAATA 
TGACTGCCAT TGCGTGAACA 
TAGTGGCTAA TTCAACCGTT 
GACGTAATGC ATGTTTGGTT 
CAGCAATTGC TTTTCCTAGT 
TCGGTTCTGA CGTTCCGCCA 
AAATTCGGAC AATGGTCCGT 
GAAGTTCTTC TAAAGTAATC 
CAAGACGTTG CC AG ACT ATT 
ATwAATGAAC GCCCCGTAAA 
CGCGGCCACA AATGGCTGAA 
TTTCCCCACC AGTGGTGTCA 
GAGTTGCTAA GAAATCTTCT 
CAAAAGAATC TGGCTTTCTA 
TCGCAwTAGA CaATGTCGCA 
TGTCGTCGCG TAAGCCaTCG 
CAGTTCCGAA GATCATTGCT 
CGCCAGGAAC ACGCACATAT 
CTGTTACGAT CACTTTATCG 
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GGGGTTTTGG TACATATACG 2160 

AAGTGCCAAA G.CAACTGTT.G 2220 

TGTATTGGCC GAAGCAGGTG 2280 

ATTAATTTTA CTGGGATCAA 234 0 

TGATTGTAAG GTCAGCGGAG 24 00 

ATTGTAGCCA AGCTTAGCTA 24 60 

GCCAGAGACG CCTAATAAAA 2520 

TTAACGATCA TTGAACCAAT 2580 

TTAATTAATA ATCCTTTGTG 264 0 

AAGTCACACC GATTTTTAAC 2700 

ACGTTGACAA GTAATTTAAA 27 60 

TGGATGACTT TAGGATTTTT 2820 

TTTGTTGCTG TTTGAGGAAT 2880, 

AAGGCTTCTT TGGTCTTTGG 294 0 

AATTGTTCTT CCGTTTGCGC 3000 

ATCGCATCAA ATTGCTTTTC 3060 

GTTTCTATTT CTTTAATTGA 3120 

GTTACGTAGG TTACGGTGTG 3180 

CCGCCAGAAG AATGATTGGT 324 0 

GCAGTAACTA AAACATCCAT 3300 

CCAATATGTT GGCCTGCTTG 3360 

TCTTGATACA CAGCTTTTTC 3420 

TTTCGTTTGT CCATACCAAC 34 80 

ATAAAGGGTA AAACGGTTGT 354 0 

CGGCCATCAT AACGAAGTTG 3600" 

ATTGCAAAAG CAAAACGCTG 3660 

ACTAATAAGC ATTCACTAAC 3720 

GTTAATGCAG TGATTGAAGT 3780 

ACaCCGCCTG TCGCACCAGT 384 0 

TCTTCTAAAG CTGTGACAGA 3900 

TCTTTGAGAC TCATTTTTTC 3960 

TCAGAATAAC CACCAGTATG 4020 

CCTTCTTTGA AaCGAGGATC 4080 
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TTCCGATGTG ACAACAGTGC CACTGACATC AATTCCTGGA ATCATTGGAT AATTGCGAAT 4140 

CACACCGCCG TCTTTTTTTG AAGCTAAAGA GTCTTTAAAG TTAACTGAGG AGTAGGCTGC 4200 

TTTGATTACT ACAGTGCCTT CTGATAAATC GTCTAAAGTA ATGGTTTCTA GTTGACCAAT 4260 

AACAGATGTA TCTTGTTCTC TTAACGTAAA TGCTTGAAAT GAATTCATAT AAAAACTCTC 4 320 

CTTCCAATTA TGTAAAAAAG CAACTCGTCT TATTTTATCA TATTCTGATT TTTTCGTATC 4 380 

ATTTTTCTTC TTATAAACAG TAAGTTATAC AGAAAAGAAA GATTGTGTGA CAATTAGACT 4 4 40 

AGAAAAATAA AGAAAAATGG TTGAAATTTT TCAAAAAGTT CGTATACTAA AAAAGGTAAA 4500 

ACT 4503 
(2) INFORMATION FOR SEQ ID NO: .150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH f 11739 base pairs 

(B) TYPE: nucleic acici 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



GCGTTAGGTC 


AAGTAAGTCT 


TGGmTCGTTT 


TTGATGAcGT 


CTTTTTAtGG 


TGTTTTTmaG 


60 


TAPTmGTTTA 

■L iW*s X it 1 ill 


TPPTATTTTP 

1>jU1 n X X X X V-> 


TATTAPP.PAA 


Z\A ATPZXATPP 


rtrtl X 1 r\ 1 X X x 


Inl Inl 1 on x 




GATTGGCATG 


ATTTTAGGTA 


CTTTTTTCAG 


TAG TAT TAG T 


ACGTTTTTAC 


AAGTGATTAT 


' 180 


GGATCCTAAC 


GAATACGACT 


TACTCCAAGG 


AAAGTTGTTT 


GCCAGTTTTG 


GTAACGTCAA 


240 


TAGTCAGTAT 


CTCATTTTCG 


TTGGTGTGGC 


GGTACTCATA 


ATTTGTGGGA 


TACTTTGGCG 


300 


GGAAAGTTAC 


CGTTTAGACG 


TTTTACATTT 


AGGGAACCAT 


CaAGCACAGA 


ATTTGGGGAT 


360 


TGCTGTGAGT 


CAGTTTCAAT 


TAGTGTTGTT 


GTGTTTGATC 


AGTAGTCTGG 


TCGGCTTATC 


420 


AACAGCATTA 


GTGGGACCGA 


TTACATTTTT 


AGGTTTTATC 


GTGGCCAATA 


TGAGTTATCA 


480 


ATTCATGAAA 


ACCTATCGCC 


ACCGAGAACT 


GTTTGTGGCA 


GGGAGTTTAA 


TAGCAGTCTT 


.540 


TTTACTGGTG 


TTTGGTCAAT 


TGATGGTTGA 


ACAGGTCTTT 


CATTTGAATA 


CCCCGCTCAG 


600 


TGTGGTGATT 


CAATTTGTTG 


GGGGCGTTTA 


TTTTATATGG 


AAAATTATTG 


CAGAAAGGAA 


660 


GCAACGCACA 


TGATCCAGAT 


GAGTGATGTA 


TCAAAAAAAT 


ATGGCGATAA 


ATTTGTCGTT 


720 


TCAGACATTG 


ATTTACCTAT 


TTCTGAAGGC 


AAATTAACGG 


CCTTTATTGG 


ACCAAATGGC 


780 


GCTGGGAAAA 


GCACGCTGCT 


AGCCATGATG 


AGTCGCTTAA 


TTCCCAAAGA 


TACCGGGGAA 


840 


ATTTATCTGG 


ACAAGCAAGA 


AGTAAAAACC 


TGGAAGCAAT 


CGGCTTTTTC 


ACAAAAAATT 


900 


GCCCTTTTAA 


AACAAGCCAA 


CGGTGTACAA 


CTTAAATTAA 


CTGTGCGAGA 


ACTGGTCAAT 


960 


TTTGGTCGTT 


TTCCTTATTC 


CAAAGGTCGC 


TTAAAAAGCG 


CTGACCACGA 


AAAAGTAACA 


1020 


GAAGCACTGG 


AACAATTGGG 


CTTAACTGAA 


ATGGCGGAAG 


AATACATTGA 


TACCCTTTCT 


1080 


GGGGGCCAAT 


TACAACGGGC 


GTATATTGCC 


ATGATTTTAG 


CCCAAGATAC 


CGATTATATT 


1140 


TTATTGGATG 


AGCCCTTAAA 


TAACCTCGAC 


ATGAATTATG 


CGGTTCAAAT 


GAT GC AG ATT 


1200 
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CTTAAACGCT TAGTGGATGA 
TTTGCCGCAA GCTATGCAGA 
GGCGCAACAG AAGAAGTTAT 
CGGATTTGTG AAATTGAAGG 
AATAACGTTT TTGGAGGAAA 
TGATGGGACT GTTAATGTTA 
CAACAGAAAC AACAGCTAAA 
TTCCCAAAAA TCCTAAGAAA 
CACTAGGTGT CGGTGACCGC 
AAAAATACCA AAAAGTTGAA 
ATCAACTAAA ACCAGACTTA 
TAAAAGCCAT TGCGCCAACC 
CGAAACAAAA TATCGAAACG 
AAATAACTGG CTTAGAAAAA 
ATAATGCGCT TGTTGTGTTA 
GTTTCGGTTT AATTCATGAT 
CCACTCATGG GCAAAGTGTT 
TTGTGGTAGA TCGCACCAAA 
ACGAATTGAT TCAAAAAACC 
ATGTTTGGTA TCTAAGCGGT 
AAAAAGGATT AGAGTAAAAA 
CTTAAAAGCT GATAAACGGC 
CCAAAAATTA AAGAACAATT 
GTCCCGACCT TCTTGATGAA 
CGAAAATGTT TGCTAAACTA 
AAATGTGATG AGAAGGAAAG 
AAGAGGTAGT GACGTTAAAG 
TTAAACCAAT TCAGGTGCGC 
GATCTTTTTT AGCGAAGAGA 
TTTTAGGACG AATGCTTTTT 
TGGGGAAAAA AGTAGTAGGT 
CCAAAGAAGC GGCAGAGAAA 
TTAGTTCACC TGCACCAATC 
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ACTAGGAAAA ACGATTCTGA TTGTTCTACA TGATATTAAT . .12 60 

TGAAATTGTC GCCATGAAAG GCGGAAAGCT CTATGCGCAT 1320 

TCAAACAAGT ATTTTAAATG ATCTTTATGA AATGAATATC 1380 

AAAACGCTTT TGTCTTTATC ACTAAATCGG GCAATTTAAA 14 40 

AGAATGAAAA AGAAATTTTT AGCAATGATG GCAGTTTCAA 1500 

AGTGCTTGTC AAACAAATAA AAAAACAGCA GATTCTGCAA 1560 

ACGGAAGTCA CAGTCAAAGA CACCAATGGT CAATTAACCG 1620 

GTCGTTGTTT TTGATAATGG TTCCTTGGAT ACAATGGATG 1680 

GTGGTAGGTG CGCCAACTAA AAATATCCCT GCGTATTTGA 174 0 

TCAGCAGGCG GCATTAAAGA ACCAGATTTA GAAAAAATCA 1800 

ATTATTATTT CTGGTCGTCA ACAAGATTAT CAAGAACAAT 1860 

ATTTACTTAG CTGTAGATGC CAAAAATCCT TGGGCATCAA 1920 

TTAGGCACTA TTTTTGATAA AGAAGAGGTA GCTAAAGAAA 1980 

GAAATTGCTG ACGTGAAAAA ACAAGCAGAA GCTAGCGCGA 204 0 

GTTAACGAAG GACAACTTTC CGCTTACGGA AAAGGCTCTC 2100 

ACATTTGGCT TCAAAGCAGC AGACGATAAG ATTGAAGCTT 2160 

TCTTACGAAT ATGTTTTAGA AAAAAATCCT GGGATTCTCT 2220 

GCAATTGGTG GCGACGATTC AAAAGATAAC GTCGCTGCAA 2280 

GATGCTGGTA AAAATGATAA AGTCATTATG CTTCAACCAG 2340 

GGTGGATTAG AATCAATGCA TTTGATGATA GAAGATGTTA 2400 

AArGCCTAGG ACAAAAATCA CTTTGGATTT TTGTCCTAGG 24 60 

GGGAACAAAA GCAACTCCTT CGGAAATAAG CTGAAATTCT 2520 

TTCGGAAATT TCTTCTTATT TCTCGGAGCT AAACACTTCT 2580 

AAACAAAAAA GTTTTAAATT AAAAATGTGG TTGACGGCCT 264 0 

TCCGTAAATC AGTATGATTG ATGAAACAAA TAAAAGAACA 27 00 

TACTTTTTAA AATAGCTTGC AGAGAGCCTC GTTTGGTGAG 27 60 

GGGAAAATGG CCTTTGAACA GGTTGGTTGA AGCCGTTTGC 2820 

CTGTTATAGC GCTACAGTAT GATTGTACTG GAAACGAGCG 2880 

TCAACAAAGG TGGTACCGCG ATTGCATAAG CATTCGTCCT 294 0 

TTATTTGTTT AAAAAATATG AGGGGGCAAC AACATGAAAT 3000 

TTGATTGCAA CAGGGTTTCT TTTAGCCGCA TGTGGCGGAA 3060 

GTAGATTCGG GAAATTTAGC AGCTGAACAA AAAATCAGTA 3120 

TCAACATTGG ATACAACACA AACAACAGAT AAAAATACCT 3180 
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TTACAATGGC ACAACATTTA 
TGCCAGCTCT AGCTAAAGAT 
TGCGGGAGGG GATTAAGTGG 
CTTGGAAAAA ACTGGTGACA 
TTAAAAATAG TTTTGAAATA 
CCCCGAATGA CAAAGAATTC 
TCGTTTCGAT TGCTTGGTTA 
ATTACGCCTT GGATAGTGAA 
ATGCGACTTC AGATACTTGG 
TGAAACTGGA AGAAGTTGCG 
ATCAAGTGAA TGAACTAGAC 
ATGATCCAGG CTATGTCAGT 
AAGAAGGAAC GCCATTAGCG 
AAGAAGCCTT AACACAAAGT 
CAAGTAAACT TTATGCGAAT 
ATTTGAAAAA TGACGTCAAA 
GTAAAAAAGT GAAACTTTCA 
AATATGTTCA AAGTCAGTTG 
AACCAAGTAA TAATGTGAAC 
GATGGATTGC CGGCAGTAGT 
GTTACAATTA CGGCAATTAT 
CGATTAATGC CAATAATCCA 
TGTTGAACCA AGATGCTGCC 
ATCCTAAATT GAAAGGCaTT 
CCTATTTAAC AGAATGATTC 
TTTTAACTTA CGTGAAAGTT 
CGGTCGGTCA AGTGGTTTTA 
CCGATGTGCA TTATAACGAA 
CTGCAGCGAA GAGTATCGGC 
ACATCCGTCC CCAAGTCATC 
AAGGCATCGT TTTATCAGTT 
TGATTACTAC AGACGGAACC 
TATTAGCTGC TGTTGAGTAT 



864 

TTTGAAGGCC TTTATCGGTT T GAT GAT GAT AGTGCCACGG -3240 

GTCAAGATTA GTGACGATGG GCGCAAGTAC CACTTTACCT 3300 

AGCAACGGCG AGCCAATCAC GGCCCAAGAT TTTGTTTATT 3360 

CCAGCGACGA TTGGACCGAA TGCCTATTTA CTAGACAGTG 34 20 

CGCAACGGTG AAAAGTCAGT CGATGAATTA GGGATTTCAG 34 80 

ATTGTTGAAT TAAAACAGGC CCAACCTTCC TTCTTAGCAG 3540 

GCGCCACAAA ATCAAAAATT TGTCGAAGCG CAAGGCAAAG 3600 

CATTTACTTT ATAGCGGGCC ATTTACGCTA GCCAATTGGG 3660 

ACATTGAAAA AAAATCCAGA ATACTATGAT GCGGATCAAG 3720 

GTTAGCACAA TCAAAGAAGA TAATACTGGG ATTAACTTAT 3780 

TTAGTTCGCA TTAACGGACA ATATGTTCAA CAATATCAAG 3840 

CATCCAGATG TGGCCAACTA CTTCTTAGAT TTCAACAAAA 3900 

AATGTTCATT TACGAAAAGC GATTGGCCAA GCAATTGATA 3960 

GTCTTAAACG ATGGGTCAAA ACCCCTTAAC GGATTGATTC 4 020 

CCAGAAACGG ATGAAGATTT CCGAGCTTAC AGTGGCGAAT 4 080 

AAAGCTCAAG CTGAATGGAC GAAAGCCCAA GCGGATGTCG 4140 

TTGCTGgCGG CAGACACAGA TCAAGGAAAA CGAATTGCTG 4200 

CAAGAAAATC TGCCAGGTTT AGAAATTACC ATTTCATCGC 4 260 

CAATCGCGAC GTGAAAAAAA TTATGAGTTG TCTCTTTCAG 4 320 

GAATTAGACT CTTACTTTAA CTTATATGCA GGAGAATCAA 4 380 

CATAATGCCA AATACGACCA ATTGGTAGAA GAGGCACGAA 4 4 40 

GAGAAACAGT TTGCAGAATA CAAAGAAGCG GAAGACATCT 4 500 

CAAGTACCGC TGTATCAAAG TGCCTCAAAT TATCTAATCA 4 560 

AGTTATCACT TGTATGGGGA TTATTTCCAC TTGCGCAATG 4 620 

AGGAAGAAGG AGTAAAAAAT GGAAAAATTA GTCGaACGAT 4 680 

AACACCCGTT CAGATGCCAA CAGTCAAACC ACACCGACGA 474 0 

GCCAAAATGA TTGAAACAGA ACTTCATGAA TTAGGCTTGG 4 800 

CAAAACGGCT TTTTAACGGC ACGGTTGCCT GGTAACCAAC 4 8 60 

TTGATTGCAC ATCTGGATAC GGCGGATTTT TCAGCAGAAA 4 920 

ACTAATTATG ATGGTCAAGA TGTCCTCTTG AATCAAGAAC 4 980 

GCTGAATTTC CCAATTTAAA AGAGTATCAA GGGGAAACGT 504 0 

ACACTTTTAG GAGCAGATGA CAAAGCGGGA ATTGTCGAGA 5100 

TTTTTGGCAC ATCCAGAAGT AGCGCGGGGG GATGTTTGGC 5160 
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TCGCTTCGGT 


CCAGATGAAG 


AGATTGGACG 


CGGGGCCGAT 


CAATTTGATG 


CGCCGAACTT 


5220 


CCCAGTCGCT 


TTTGCGTATA 


CGATTGACAG 


TGGCCGCGTG 


GGGCATTTCG 


AGTATGAAAC 


5280 


GTTTAACGCT 


GCCCAAGCTG 


TGATAACAAT 


TGAGGGAACC 


AGTGTCCATC 


CGGGAACGGC 


5340 


CTATGGTTCG 


TTAGTAAATG 


CCATTAAATT 


AGGCGAACAA 


CTTGATCAAT 


CACTACCGCA 


5400 


AAAGGAAGTG 


CCAGAACAAA 


CACGTGGAAA 


TGAAGGATTT 


TATTTACTCA 


ATAAATTTAC 


5460 


AGGAAGTATT 


GAAAAAGCAG 


AACTTGTCTA 


TATCATTCGG 


GAT CAT GAT C 


AAGAAAACTT 


5520 


TCAAGCGAGA 


AAGCAATTTT 


TGGAAAAACA 


AGTCCAACGT 


TTAAATGCGC 


TGGCGGATAA 


5580 


ACCCAGACTA 


ACAATTACTT 


TCCAAGACCA 


ATACTATAAT 


ATGAAGGAAA 


TCATCGAAAA 


5640 


AGATTGGACT 


CCTGTAGAAT 


TGGCCGTCCA 


AGCAATGGAA 


AGTTGTGACA 


TTGAACCAAT 


5700 


CATCACACCA 


TTTCGTGGCG 


GGACAGATGG 


TTCTAAGATT 


TCTTTCATGG 


GCATTCCCAC 


5760 


GCCTAATCTA 


TTTACAGGTG 


GCGAGAATTT 


TCATGGGCAA 


TATGAATTCA 


TTACCGTTGA . 


5820 


ATCAATGGCC 


AAAGCTGTTC 


AGACGATCAT 


TGCCATCATT 


CGCTTAAACG 


CTAATTAAAG 


5880 


AAACACGCAA 


GGGAAATGCC 


GATTTTCCTT 


GCGTGTTTCG 


CTGTTTTTCG 


GTAAAGTATG 


5940 


TAGAAAAGTA 


GGGAGTTTTG 


CTCATGAATT 


ATCCaTTGAT 


TTTTGCAGAA 


AACCAACGAA 


6000 


TCGAAACCGA 


ACGCTTGATT 


TTAAGGCCCG 


TCACATTAGC 


CGATGCAGAG 


GATATGTACG 


6060 


AATATGCTTC 


AGATGAGGAA 


ACGGTCCGCT 


ATGTGTTTTT 


GAAAAATCAA 


ACAATCGCTG 


6120 


AAACAAGACA 


AAACATTGCT 


AAATTTTTTA 


TGGGGGAACC 


TTTAGGAAAA 


TATGGGATTG 


6180 


AAGTGAAAGA 


AACGGGCAAA 


ATGATTGGCA 


CGATTGATTT 


GCGGGTCAAT 


GAAACCAATA 


6240 


ATATTGGAGA 


ATTAGGCTAT 


GTCTTGAACC 


GTACCTTTTG 


GGGCAATGGC 


TACATGCCTG 


6300 


AAGCCGCGAC 


GGCGTTAGTT 


GAACTGGGTT 


TTGCAAAAAT 


GAAACTCATG 


AGAATCTTTG 


6360 


CTTTACATGA 


TCAAGACAAT 


CCAGCTTCTG 


GTCGTGTCAT 


GGAGAAAATT 


GGTTTTACCT 


6420 


ACGAAGGAAC 


ACTACCCAAT 


GCTCGAATTA 


GTAAAGGGAA 


AATTGTCACA 


GATGTTTATC 


6480 


GTGGGATGAC 


ACTGGAAACA 


TGGCAAAATC 


GACAAAAATA 


GAAGCGGAAG 


ACAAGTTAAG 


6540 


AGAGGAATTA 


GAGAAATGTT 


• GATGTTTACA 


GAAAAAGAAT 


TCGCAGCCTT 


TGAAGTTGCT 


6600 


GGATTAGATG 


AACGAATGGC 


GGTTATCCGT 


GCGCAAATTC 


AACCTATTTT 


TCAAGAATTA 


6660 


GACACGTATT 


TTGCCGAACA 


ACTAGCGCCT 


GAACTAGGAA 


CGGAGTTGTT 


TGTCCATATT 


6720 


GCGCAACATC 


GCCGTCGCAC 


CGTGTATCCG 


CCAGAAAATA 


CTTGGTCAGC 


ACTTAGTCCG 


6780 


AATAAGPGPC; 


Hf^TATAAAAT 


GCAGCCGCAT 


TTTCAATTAG 


GCATTTGGGG 


CGACTATGTG 


6840 


TTTATGTGGT 


f GTCATTCAT 


TGATAATCCT 


AAAAATGAAA 


AGCAGATTGC 


GCAACCTTTT 


6900 


TAGAGAATCA 


GCAATTATTC 


CAAGCATTAC 


CAGAAGATAC 


GTATGTyTCC 


TTGGATCATA 


6960 


CGGTTCCTCA 


AATTACTCCG 


CTGCCAGAAA 


CGGATTTAGA 


GAAAGCCTTA 


ACTCGTTTTA 


7020 


GAGATGTAAA 


AAAAGGTGAG 


TTTGAAATTG 


GGCGCATCAT 


TCCCAAAGAT 


AGTGACTTGT 


7080 


GGCAAAACCC 


TGAAAAAGCA 


CGAGCTTATA 


TGTTGGCAAC 


CTATCAACAA 


CTCCTGCCAT 


7140 
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TGTATCAATT AGCTGTTGCC 
TTTTATCGAA TTATCCAATC 
TAACATTGTG AGCTTTTCTG 
CTATGGAAAA GTACGTTTTG 
TTTAATAAGG GATTTTCGCG 
AAGAGTTCTT TGCTATTATA 
GAATGTGAAT GAGTTAAAAG 
GTGTTTACTA TTTTCCCCTA 
TTCGGTGGGA ATTGAATTTT 
GAAAACAGAT GAGCCAGCTG 
ACGAAgTGGT GGTTCGACAC 
CAAGAGTCAG GCAAACCTGA 
CCAGAGAAAG AAGGGACGAC 
TGTGTAGTTC TTTAGTGATT 
CGATTGAAAG TAGTCCGACA 
CAACGCCAGA AAGTACGGAA 
CACAGGAAAC AGTGGAACAA 
AACCATCTGG TATTCAAGCG 
CCACGAATAG AACCACGGTT 
ATATTACAAC AGATGATACA 
TGGCAGCTGG AATGACTAGA 
ATGTAGTTAT AGGAGAAGGC 
CAGAGTCGAC TAGGACGGTC 
ATACTGGGTA TATCAAAACA 
AAAAAAAATA TATCGATTCC 
ATGAAATGAC GTATGTGGAT 
TTTACGATGA GGATGGAGTG 
TATTTAAATC AACCAATTCG 
CAAATGCTGG TAGTGGGCTT 
ACGGGGGTTC GTGGTTTTTC 
CCAATAGAAT TGATAAAATT 
CAGGTGTGAT GACTTCCAGA 
TTTCaCaTAT AaCTTTACTA 



866 

CAATAAGAGA AACAGACAGT TATTTTGTGT TTTAGTGTTA 7200 

GTCAAAAATG AGAaAAGCTC ATAATGTTAG TTTTAATTAC 72 60 

TAGTTTTTTT GAAAAAAATT AAGGGATGAC TGGCTGATTT 7320 

ATAGCCTTTT TTGCTTAAAT AACAAGAGTA GCAACCCTTG 7380 

AATAGATGTT GTCAAATGAA AAAAAATGAT GTTTTGAACA 74 4 0 

AAGGTATGAA AAAGGTTGTT TACGCATTCT TTGTATAGAA 7500 

GAGGTTTTTA AATGAAAAAA AAGCGTTATT TAATAATTGC 7560 

GTTTTTTTAT AAATGTTGAA GCATCTGAGG GTGGTTCTAG 7 620 

ACCAAAATCC GGCAACACCC GCTCCTAAAG ATGCCCCACC 7680 

CGGATCCCAA GGAACCAGCT GGTCCTCTGC AAGGAGATCA 774 0 

AGACCACCAC AGCTGGCTCG CAGCTCCCTC GTACAGGAAG 7800 

GCATTCTTGG TCTTGTCTTG ATTGGTCTTG TCGGAATGGT 7860 

ATGAAGCAAA CTAAGTGGCA ACGATTAGCA ACCATTGGCT 7920 

AACGCCTTTT CTGGCGCGAC CGCAGTTGCG GAAACAATCA 7980 

GTAGCAAGTA GCGCTAAGGA AGCAACATCA GCAAGTAGCG 804 0 

AGCAGTCAAG AAACGACTGA AACGAGTCGG GAAGAAGTGA 8100 

GAAGATACAA CGGAAATAGC TCAAGAAGAA AATCTGCTGG 8160 

AAAATTATGC CTAGAGCATT TGTCTGGGCA GCTACAGGAA 8220 

CGTCTTAACG ATGTAGTTGA TTATCGAATG AAACTGGAAA 8280 

AATTCAGATA TTGTTGGTCC GATTCGAATT GAATCAAAAT 834 0 

CCAACGTCAG TTATTATCAA AGTTGGCCCC GAGGAACAAA 84 00 
TCTCATAATG CCAATAGTGG TGGCGAATAT TTTGTTTGGA . 84 60 

ACTGCATTTA TCAATCGTCT ACACGGGCGA ACTTCTGGAA 8520 

TTGTATTTTC AGACAAGAAT TACTTCGGGT ACACACAACG 8580 

ACGATTCGAT TTAATGGATT TGGCGCAGTG AATAGACGAA 8640 

AGATTTTCTG GTAACTTAGG TTTGAAAGGG AATTTAAAAT 8700 

GCTAGTAAGG AACAAATGAG TCTTTCCATA AACCCAGTAT 87 60 

AGAGTAGAAG GGACGCTGTT AAGATATCCT TTAACTATGA 8820 

TATAGGTTTG ATACAGGACT AAAAAATATA GGTACTAGTA -8880 

TTTAGTAACG ATATTGCGAT TAAwTTTGCT ACAGCTACCA 894 0 

GTGTTAGCAC CACCTAAAGC AACACTTCCA GACAGCCGAA 9000 

GCGTACACAT CAACAACTTA TACGAGAAGT AACTATGATA 90 60 

AAGCGACTGG TAAGACATGG CGTATAACGA ATCGTAGTTC 9120 
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TAATAGCGGA 


GTAAGCACCA 


ATATTATGTT 


TAGTAACAAT 


CTTGTAGGAA 


CAACAGGACA 


9180 


GTTCAATAAA 


GGGAATTCTG 


GTGCTACTAT 


TAATTATTAT 


ATTTATTACA 


AAAAGCTTTA 


9240 


TGAGAATTTT 


GTTAATAGTA 


ACGGACAAAA 


AATTACACCA 


CCATCAGGAT 


TCACACAAGG 


9300 


AAAGCGAACG 


GTTATTAATA 


GTGAAGCGTA 


CACTTTTAAA 


CAAAGTGGCA 


CCTTGCCAGA 


9360 


TACCTATCAA 


GCAGACGGCA 


AAACCTATAA 


GTTCAAAGGT 


TGGTACAAAG 


GCAAAACCAA 


9420 


GCCAAACACC 


TTGACCACCA 


CCAAAGCACC 


AAGCTATGCG 


GTGACTTATA 


ACAATGATGA 


9480 


TGATTTGACG 


GTGGTGTATG 


AGGAGATTGA 


GTTTTTTGAC 


TTCCCAGCGC 


TGACCTATCA 


9540 


ATTCGGGTTC 


GTGGACGAGT 


CTGGCAAACG 


AGTAGATGCT 


TCAACAATTG 


GGTTGACTTA 


9600 


TGACAAATGG 


CGTGGGGAGT 


TGTTAACAAA 


TAATGTCAAT 


GACTGGAAAA 


CAGTCAGTCT 


9660 


TGAAAAAGGT 


CAAGTTGCAC 


CAACCAAAAA 


CAATTTGAAG 


GAAATTGCTT 


ATCCTGCGCA 


9720 


ATCTCTTGAA 


ATACTGAGTG 


ATCGAAGTAC 


ACAATATAGT 


GCGGCTAATT 


TAACCTTTAC 


9780 


ACTGCCCAAA 


TACTATGGGA 


AAATCAGTGT 


TTATAACAAG 


AGTGGTACTT 


TTGATGCAGG 


9840 


CTACCTGTTT 


CCAAATATCC 


AAAATAATAA 


TAGTTCAGAA 


CCATCTGTTG 


AAGGCTTACA 


9900 


CAAAAATCTT 


TCTAGCTGGT 


TTGAACTAAA 


GAAAAGTGGG 


AACCAATCGT 


TTGTTTTTAA 


9960 


CAAACCGATT 


CCAGCAGCAC 


CAATCGATGT 


TCAGGTGCCT 


GCTTACTTGC 


GAGAGATTGT 


10020 


CTATAATCCC 


GATCTGATAT 


CAAATACAGC 


CGTCTATTAT 


GTAATAGATA 


AACCTGTCTA 


10080 


CTACTATTTA 


ACCAACCGCA 


AAGTCACCGA 


AAACTTCGTA 


GACACAAGCG 


GTGCCAAAAT 


10140 


TACACCACCA 


ACAGGTTTCA 


CCCAAGGAAA 


ACAAACAGCG 


ATTACCAGTG 


ACCCATACAC 


10200 


CTTCAAACAA 


GCAGGTACCT 


TACCTGATAC 


CTACACAACA 


GGCGGTAAGA 


CCTACAAGTT 


10260 


CAAAGGCTGG 


TACAAAGGGA 


AAACCAAAAC 


CGAGCCTTTA 


ACCACGACGA 


AAGCACCAAG 


10320 


TTATGGGGTA 


ACCTATAATG 


ATAATGATGA 


TTTGAATGTG 


GTGTATGAAG 


AAGTAACAGC 


10380 


TATTTATCCA 


GGGTTCTATG 


CGCAATTTGT 


TGATGAACAA 


GGCAAAGCTT 


TTACTAATTC 


10440 


GCTAACTCTT 


TCTGAAAATT 


ATACTGAGTT 


TCTCAGAAAG 


ACAGGCACTA 


CTGTTTTTGA 


10500 


AACGAAGGGC 


GCCTTTTACC 


CAATGGTAGG 


TAAGCGCGAT 


GCAACAGTGA 


CAAATAAGTA 


10560 


TAAGATTGAA 


ACAACGAGTA 


GTGTTGCAGT 


ACCAAATAAT 


TATTGGACAG 


CCATGCCTGC 


10620 


TGACATTGCT 


AATAGAGGGT 


ACGCATTAAA 


TAACTTGGAT 


ATTACCAATC 


AAATGAAATA 


10680 


TGTTGACAAG 


ATAGAAACTA 


CTGACACGCA 


CACTACAATG 


TACGATTATA 


CTGGAGTTTC 


10740 


GGACATTGGG 


GGCAAACCAA 


ACAATGCTTC 


TCAGACAGAA 


TTAAATCCTA 


TTGATAAAAC 


10800 


ATTTGTTAAG 


CAAACAGATA 


ACACTTTTGC 


GATAAAATCA 


ATTTGGGGAA 


ATGGaACGTT 


10860 


TTCGCATATA 


GTTCCCAAAA 


TGTTATTGTT 


AGCAGAAAGT 


GGCAGTTTTT 


CAAGATTATT 


10920 


CGGGTTTGAT 


GGTACTCCAG 


ACTACAAGCA 


AACCGTAAAC 


TACAAAGTCA 


CCCGTAAACA 


10980 


AGTCACCGAA 


AACTTTGTTG 


ATGCCAACGG 


TGCCAAAATC 


ACCGCACCAA 


CAGGCTTCAC 


11040 


CCAAGGCAAC 


AAAATACCCA 


TGACTAGCAA 


CACCTTCAAG 


TACACAGCCG 


CAAAAGCCTT 


11100 
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GCCAGCAACT TATACCGCAG GTGGCAAGAC CTATACTTTC CAAGGCTGGT ACAAAGGCAA 11160 

AACGAAGCCT AATACTTTAA CAACCAGCAC TACGCCAACA TATAACACAA CTTTTGATGG 11220 

CAATGACGAC ATGACGGCGA TTTATAAGGA AGAAGTGCCC AAAGCAAGTG TCGCTTTAAC 11280 

GCGCACTACC GCTGAAACCG TGACTAGCGG CGGCAATGTC ACCTGGCGCG CCACAATTAC 11340 

CAATACAAGC CAAGCACCAC TAACCACGGC GACCATCAAG AAATCAACCG CTTGGACAAC 11400 

TGGCTTAGCA GCACCAACCG CCATGATTGT CACGCCAGCT GGCGGAACCG CAAAAACAGT 114 60 

TCCCGTCACA GCGACTACGT GGACCAACGG TGTTAGCTTA GGCACTGACA TTCCCGTAGG 11520 

AAAATCTGCC ACGGTTCAGT TCACTACGAA AGCGACGGGG ACAGCGGGGC AAGTCTTGCG 11580 

CGCTGGGATT ACCACCAGTG GGAATTACAG TGGTGTTTCT ACTTCGGCCA CCGTACGTGT 11640 

GAAAGACAAT GATCAAGCAA TTGTAACACC AACGGCTGAA GGCTTCATCA GTGTTCCAAC 11700 

CTTCAATTTT GGTCAAGTTG GGGTAGCGGG AAGCACGCA 117 39 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4045 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



GAATTTCTTC 


AGTATCCTAA 


AACAAGAAGT 


GTATTATGGA 


AAAATCTATC 


AAaGT.CAAaT 


60 


GAATTGrTaG 


AAGCAaTTGA 


GAACTACATT 


TAT TAT T AC A 


ATCATCACCG 


AATCAAGGAA 


120 


AAACTTAACT 


GGAAAAGCCC 


AGTAGAATTT 


CGACAATTCA 


ATCAAAAAAC 


TGCATAAAAA 


180 


TAGAGTGGAA 


AAATCCACTC 


TATAAAAAGT 


CTAACTTTTT 


GGGGTCGCTA 


CAACAATACG 


240 


TTTGTTTTTT 


AATCCTTCAG 


TTTATAACAT 


TATAAGAGAA 


AAGCGAGAAG 


AATCATCCTC 


300 


CTTCTCGCTT 


TCTTGTTACC 


CTAAACCAAC 


ACGTTCAAAA 


ATCGTGTCAA 


CATTTTTTAA 


360 


ATGATAATGG 


TAATCAAAGG 


CATCATCTAG 


CTCTTCTTTT 


GATAAGACAG 


ATGTAATTTT 


420 


TTCATCCGCA 


TCTAGCAACG 


GACGGAACGC 


CGTTTGATGA 


TCCCATGCGT 


AAGCTGTTTT 


480 


TGGtTGGACC 


AGATCATACG 


CTTCTTCACG 


TGTCATGCCA 


TGATCAATCA 


ATTTTAACAA 


540 


GACACGTTGG 


CTGTAAATTA 


GACCATAGGT 


CGCATCCATG 


TTACGTTTCA 


TATTTTCTGG 


600 


GAAGACCGTT 


AAGTTTTTGA 


CAATATTACC 


AAAACGATTC 


AACATGTAAT 


TTAGTAAAAT 


660 


CGTCGTATCT 


GGAATAATAA 


TCCGTTCAGC 


AGACGAGTGA 


GAAATATCTC 


GTTCATGCCA 


720 


TAAACTCACA 


TTTTCAAAGG 


CAGTAATGAC 


ATGACCACGA 


ATCACACGTG 


CCAAACCAGT 


780 


CATATTTTCA 


* GAACCAATTG 


GATTCCGTTT 


ATGCGGCATT 


GCCGAAGATC 


CTTTTTGTCC 


840 


TTTTGCAAAA 


AATTCTTCTA 


CTTCACGTGT 


TTCTGATTTT 


TGTAGACCAC 


GAATTTCTGT 


900 


TGCAAATTTT 


TCAATACTTG 


TCGCAATTAA 


TGCCATGGcA 


GAAACATATT 


CTGCATGTAA 


960 
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ATCACGTGGT 


AAAACTTGTG 


TCGAAATCTC 


TTGAGCACGG 


ATCCCCAAAT 


GTTCGCAAAC 


1020 


ATATTGTTGA- 


A6AAATGGTG 


rAATATTGGc 


AAAGGTACCT 


ACGGCTCCAC 


TGATTTTACC 


1080 


CGCTTCGACA 


CCTTTGGCTG 


CATGTTCAAA 


ACGTTCAATG 


TTGCGTTTCA 


TTTCAGAATA 


1140 


CCACAACGCT 


AATTTCAAAC 


CAAATGTAGT 


CGGTTCTGCA 


TGAACGCCAT 


GCGTTCTTCC 


1200 


CATCATGAmC 


GTATGTTTAT 


ACTCTTTGGC 


TTTGGCTCCG 


ACAATCTCCG 


TAAATGTTTG 


1260 


TAAATCTTTC 


CGTAAAATAT 


CATTCGCCTG 


TTTCAATAGA 


TAACCATAGG 


CAGTATCCAC 


1320 


AACATCTGTG 


CTGGTTAACC 


CATAGTGTAC 


CCATTTTCGC 


TCTTCACCAA 


GCGTTTCTGA 


1380 


AACCGCACGA 


GTAAAAGCGA 


CAACATCATG 


ACGTGTTTCT 


GCTTCAATTT 


CTAAAATACG 


1440 


TGCAACATCA 


AATGATGCAT 


TTTCTCGGAT 


TTTTTGTACA 


TCTTCTTTAG 


GAATTTCGCC 


1500 


AAGTTCTGCC 


CAAGCTTCAT 


CCGCCAAAAT 


TTCAACTTCT 


AGCCATGCCT 


TGTAGCGGTT 


1560 


CTTATCTGTC 


CAAATTGCGC 


CCATTTCGGG 


TCTTGTATAA 


CGATCAATCA 


TTGTGCCTTA 


1620 


TCCCCTCTTT 


ACTCCTCAAT 


TTTTGAAAAA 


TAATTACGAC 


CAAATCTTTG 


TTTCTTTAAT 


1680 


TTCTTCCAGT 


GTTTCATCGA 


TTTCTTTGGt 


TAATATAGTA 


ATATGcCCCA 


TCGCTCGTTG 


1740 


CGCTTTTGCT 


TCwCTGTGGC 


CATAATAGTG 


GAAATGCCAA 


TCTGGATGTT 


CATTAATAAT 


1800 


GTCCATCGTT 


TCATAGGCTT 


CATTTCCAAG 


CACATTGACC 


GTGACTGCTT 


CAGACAACAG 


1860 


TTGAATCGTT 


GGCATTGGCC 


AATTACATAA 


ACCACGAATA 


TGCGTATCAT 


ATTGGCTAAA 


1920 


AGAACAAGCA 


TCTATTGAAA 


AATTCCCAGC 


ATCTTCTGGA 


CGAGGTAAAA 


TTTTATTTAC 


1980 


ATAAATCCCG 


CCTGTTTCTG 


TATAAAACAT 


TTCAATTGAA 


AGGGTGCCGA 


CTAAACCAAC 


2040 


TGCTTTAGCA 


ATCACCTTAG 


CAATTCGTTG 


CAACTCACCA 


CTGAcTtCGT 


CATCAATTCG 


2100 


GGCTGGTGCA 


ATCGTTTCAT 


GTAGcTTACG 


CTCTTTTTTA 


ATTGTTTCCA 


CAACTGGAAA 


2160 


TGTCGCAAAT 


TCGCCACGAC 


CATTACCAGA 


AATCAGAACA 


GATAATTCTT 


TTTCGTAAGG 


2220 


AATCCAAGCT 


TCCAAGACAC 


AGGTTCCTTC 


TCTCAGAAGA 


TTCATGGATG 


GCGCTAAGTC 


2280 


TGAAGTACTA 


TGTAGTACAT 


AACGACCTTC 


CCCTCGCGTA 


TTCTTCAAGA 


TACACGGATA 


2340 


ACCAATGCCA 


TCAATTGCTT 


CTTGAATATC 


AGTCGGACTA 


ACAATCGTTG 


CATAAGGTGC 


2400 


AATCACAATA 


TTATTATCTT 


CCAAGAAACT 


TTTTTCCATT 


AAGCGATCTT 


GCGTAATCGC 


2460 


TAAAATGTCT 


GTTCCTTGCG 


GTATATTCGC 


CAAATGTTGA 


ATCGCTGTTA 


ACGCATCAAC 


2520 


ATCTGCATGT 


TCAAATTCAT 


ACGTAATCAC 


TTGGGCACGT 


TTTGCCAATT 


TCTCTAACGC 


2580 


TTCAGGATCA 


TTATATGCTC 


CTAATAATTG 


CCAATCGGCA 


ACTTGTGCAG 


CAGGACAATC 


2640 


AATTTGTGGA 


TCTAGCACAC 


CCACTCGAAA 


CCCCATTTCT 


CTTGCACTAA 


GGGCCATCAT 


2700 


TTTTGCTAAA 


TGTCCGCCAC 


CAATGATACC 


AATCACTTGA 


CCAGGTAATA 


GTGGTTTATC 


2760 


CAAGTTCATC 


' ACAACTTTCC 


ATCACAGTTT 


TATCCTTAAA 


ATCCTTCTGC 


GGTTTCTCGT 


2820 


CGCAGTCTAG 


CCTTTAAGAG 


TGGCCCTnTT 


TACATGaTAC 


mAGGrAAAAG 


GcTCAACTTC 


2880 


AAGTGCCAGC 


AGTGTTTCCA 


TTAGGTTTTC 


TCATCAGAAT 


TTATTAACAT 


ACTTTAGGaT 


2940 
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t\ t\ t\ t\ r** 7\ r** 

AAAACAGCGG 


Tv t\ fry ft\ ta m ta 

TAACTTACTA 


CTCGTCTCTT 


TGTAACCAAG 




TTTTTPTPTT 

1111 1 V— 1 1 1 


3000 


CAAATTGATG 


TV TV TA TV ^"l TV TT% TV TV 

AAAACATTAA 


ATTTTTCTAA ACGAT.GATGA . 


PPTPA A A A A A 


^:tzaTP,aaaga 


3060 


AAAGCCACCT 


GACCTTATCT 


ATCATACTTT 


TTTGAAACTA 


1 1 \_o 1 1 1 1 


TPTP,AGGTTT 

1 1 ijnUU 111 


3120 


CTTTTAAACC 


ATTGAATAAA 


AGATTTAAAA 

nun x x x ruuu* 


TAATTGCGGT 


Art 1 1 oL» J. C 


21 TP APA ATTP 


3180 


CATTACCAGT 


AAACATGCTA 


APAGTTTCTG 


GTACTTTATT 




rrraTB at at 




TAAAGCCTAA 


ACCAAAGCCA 


ATPGAGATGG 


CAATAATTAA 


TAAATTTTTG 


1 UAl 1 l orto i 


JjUU 


AGTCAACTTT 


TGACAACATG 


PGCATTCCTT 


GGACAGCCAC 




STB APP A CP A 


jJOU 


TTCCGCCGCC 


TAAAACCGGC 


TCTGGGATAA 


TTTGGGCCAT 


CbLALCAAx I 


1 1 lublnnin 


J H £. \J 


AGCCTAAAAT 


AATTAAGAAA 


AAAHPAGAAA 


AATAAATTGG 




TTP atpppah; 


34 80 


ATAATTGAAC 


TAACCCGACA 


TTTTGAGAGA 

1111 


AACCTGTATA 


AbuAAALu 1 A 


TTl\ A a A ATPP 


3540 


CCCCTAAAAT 


GACGGCTAAA 


PPTTPTGPAT 
ll^l o^ri i 


GATACCCACG 


Atbi AAA I CA 


TTTTrTP^a A 
iLl 1 ^,lj^^Art 


JUUU 


TTTTCTTCCC 


TGTAATATCT 


CCTAAGGCAA 


AATAAACACC 


TGTTGATTCA 


n/T' = t nPT A a 
A^.t~ a i a^ i am 


JOUV 


CAATtGaAAT 


AATAATCATC 


AACACAATAG 


AAGAAAGATC 


AAACGTTCabl 


1 IvjU^AAAHl 




AAAATGGCTG 


TgGAAAaTGa 


aACCACGTAG 


CTTGACCAAC 


AuoAUAL,A>\a 


TPPAPTA ATP 


3780 


CCAGAAACGC 


AGCTAAAATT 


GTGCCACCGA 


CTAGGCCGAT 


TAACACGGCA 


ATTGACTTGA 


3840 


TAAAGCCACG 


TCCCCAAACT 


TGCACAACAA 


TGATTAAGGC 


AATTGTTACA 


AATGCGAGGA 


3900 


GCAAGTTTTT 


AGGATCACCA 


AAACTTTTGT 


CCGTTGCTAA 


GCCGCCACCC 


ATTTTTTCAA 


3960 


CAGCAACAGG 


AATTAAGGTC 


AAACCAATAA 


CTGTAATGAC 


TGTGCCGGTG 


ACTAAAGGCG 


4020 


GAAATAGACG 


TTTAATTTTT 


GAGAA 








4045 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

nAnnAATATA CTATAAAGTC CAAAGAAAGC GCTTTAATTC GATGGAGAAG GTGGTTTAGT 60 

GAAAAGATTT TCATTTTTTT TACTAATTTT ACTTGCTTTA ACAGGTTGTA AATCCGGTGA 120 

AAAAGAATTT GATGAAGAAT CTCTTCAAAA TCTAAAGGAA ACGkCACAGT CTTAyTCAGA 180 

AACAGAATTA CAAAATGGTG ACGTTCGTTT AAATGAATAT ATTTCTTTGA AAGGGGAGAT 24 0 

TGTTGAGAGT GACAGTCGTT CCAGTTTAAT AAAAAAAGGT GATCGTTTTA TTTTGAAAAG 300 

TGGTTCTAGT AAATATCAAG TTTyTAATGA GCAAAAGAAA AAATTGAAGA TTGGTGACGA 360 

AGTGACAGTT TACGGAGAAT ATTACGGCTT TTTGAAAGGG ACATTAATTG AAAGTGAGGA 4 20 

GAATCATGAT TCAGCCACGA ATTAGACAGA ATCCTTTTGA AACATACGAA GAAGTTGCAG 4 80 

AAGGATTTGA AGATTTATTA GCGCCACTGG AATTGTTTTT TGATCGAGAG TATCAAGGTC 54 0 



I 



WO 98/50555 



PCT/US98/08985 



871 

ATCTGGATTT AGGGACACAC GGAACGGTTT ATTCTAAAGG GACACGTGAT GCGGAGGCAT 600 

TTTTACGACC ATTATGGGGA CTAGGTCCTT ATGTAACGCA AAATGAAAGT GAGTATTTGA 660 

ATGACTTTCT AACGGGAATT ATCGAAGGAA CCGATCCTGA GAGTTCATCT TATTGGGGaA 720 

AAACAAAAGA TTATGATCAA TTGATTGTTG AGATGGcAGC ATTAAGTACA TTTCTTTTGt 780 

TAAACAAAGA AAAAACGTGG GACCAATTAa CAAAAGAACA GCAAAACAAT TTACACAGTT 84 0 

GGCTGATACA AGCCAATGAA AACATTATTC CGCCAAATAA TTGGCATTTC TTTAGAGTGT 900 

TGCTGAtTac GtGAAcATgT GAAgCCGTAT cnAAAGAaCA GAtCGCTGTT GATTTGGCTG 960 

TAATnGATCG TTTTTAtGtC GGTAATGGTt GGTATTATGA CGGAGTCGAG ACACAAGTTG 1020 

ATTATTATGT TTCTTtCGCT ATTCATTATT ATAGTTTATT GTATTGTCGT TTTGCCCCGG 1080 

AAGATACCGC AAGAGTGGCT ATTATGAAGG AACGAGCAAC TTTGTTCGCG CAAGAGTTTA 1140 

AATATTGGTT TACTCAGCCA GGAGAAGCGA TTCCCTTTGG ACGTAGCTTA ACGTATCGTT 1200 

TTGCTCAAGT GTCATTTTTT AGTGCTTTAG TTTTTGCAGA TGTTGAAGCA TTGCCGTGGG 12 60 

GTGAGATTAA AGGTCTAATC AGCCGACATC TTCATCAATG GATGAACAAA GATATTTTTA 1320 

CAACAGATGG ATTATTGTCG GTAGGTTATG ATTATCAAAA TATGGTCTTT GCCGAAGnTA 1380 

TaATGGTCCT GGTTCACCTT ATTGGGCCTT tAAAACATTC ATTTtGTTAG CGGTTCCAAA 1440 

AGATCATCCT TATTGGCAgC AGAAaCACaG CCtATTAGCT TtCCaGAAAA ACACTTACCT j 1500 

cACCgAAAGT CGGAACTACT ATCAaGTAAA TGATGCAGGG ACACATGGTT TAATGTTTCC 1560 

AGCGGGACAA TTTATTAATT ATCAAGCTCa TGCGCATGAT AAATATAGTA AATTTGTTTA 1620 

TTCTTCCCAT TTTGGTTTCA GTACTATAAA GAG TG AT TAT TGGTATTACG AAGGTGCTTA 1680 

TGATAATTGC TTGGCTTTAG CTGAAGACGA CCACTATTTC CGCACGAAAG GATTAGATGA 1740 

TCAGTACGAA ATTCTTGATG ATCGTATCAT TCATCAATGG CATCCTTGGT CCGATGTAGC 1800 

AATCAAAACG ACAATTGTAC CATTAGAAGG CCAACATTTA CGGATCCACG AAATTGAGAC 18 60 

ACAACGTGCA TTAGTGGCTT ATGAAGGTGG CTTCTCAATC CCGCTCTTCG ATGAAAAGGT 1920 

CACTTGTGTT TCTGATCAAA TGGCTGAGGT AAAAAATGCA AAAGGTGTGT CAAAAGTAGA 1980 

AAATATCAaC GGCTTTTCTG AAGCyGCGAT TATTCGAACA GAACCGAATA CAAATTTACT 204 0 

TTATCCGTTA ACGGAGTTGC CATATTTGAA AGCAAACTTG TCGAAAGGGA AACATCTTTT 2100 

AATTTCATTG GTGACAGGCG TACTACCGAA TGAACAAATT GAGCCAGTTA AAGTTAGGTT 2160 

AAAAGAAAAT CAGCTTTTAG TTGAAGAAAA AGTTGTAATT TTGGGGAATT AGGGGGAGAA 2220 

ATACACAATG ACAGAAAATG GACAGATTGA GTGGTTGCCG AAGCAAATCA GTTTTGCTTT 2280 

AAACAAAGTT AGAGGAAATT TAGAGACATT TAAAGAATTA GTTCCACCCG CGGCAAGTTT 234 0 

AAATCAAATT TATTATCCTG AAGAAAaTAT TGACTGGACG GCCAGCTTTT GGCCAGGAAT 24 00 

GTTATTTTTA GCAAAGGAAT TAACAAATTC AACAGAGTTC GATGAGGTGA TTGCCACACA 24 60 

GATGGCTTCT TTTCAACATC GCTTGGATGA ACAAATTGAG CTGGAAACAC ATGATATAGG 2520 
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CTTTTTGTAT 


ATTTTAACAG 


CCATTGCTGA 


TTATCAAGTC 


AATGGTCATG 


AGGCCTCAAA 


2580 


AGAAATGGCC 


TTGCAAGCTG 


CGGACTTATT 


AATGAAACGT 


TATTCACCAA 


AAGCCAAGAT 


2640 


TATTCAAGCT 


TGGGGGAATT 


TAGACGATCC 


AGAGCAACGA 


GGTAGAATGA 


TTATTGATTG 


2700 


TTTAATGAAC 


TTACCATTAT 


TATACTTTGC 


AGCAAAAATG 


ACTGGCAAGC 


AAGAGTATTA 


2760 


TGAAGCTGCC 


TACAATCATG 


CGAAACAAAC 


ACAAAAATAT 


ATTGTGCGAG 


AAAATCACAC 


2820 


TACGTTCCAT 


ACGTATT YTT 


TTGATACGGA 


AACAGGCGAA 


CTTGTACGGA 


AAGACACAGC 


2880 


AAGGATACTC 


AGATGATTCT 


TGTTGGGCTC 


GCGGCCAAGC 


CTGGGGAATT 


TACGGCTTTA 


2940 


CGTTAAGTTA 


TTTATATACA 


GGAGATAGTA 


GTTTTCTGGA 


AACAGCAAAA 


AATGTTGCTG 


3000 


ATTACTTTAT 


TCmAGAATTA 


CCAGAGGrCa 


AAATCTGTTA 


CTGGGATTTA 


ATTTTCAATG 


3060 


AAGGGAGTGA 


AGAAGAACGT 


GATAGCTCAT 


CTGCTGCAAT 


TGCCGCATGT 


GGCTTACTAG 


3120 


AATTATCTCG 


TCAGCTTCCA 


TTAAATGATG 


AAAAACATGG 


CTATTATGAA 


AAGGTTGCGC 


3180 


TAGAACTATT 


ACAAGCATTA 


GCGGAAAAGT 


ATACAACGGT 


TTTACAACCA 


GAGTCGAACG 


3240 


GCCTATTGCT 


TCATGGTGTC 


TATGATAAAA 


AAACAAATAC 


AGGAGTCGAT 


GAATGCATGA 


3300 


TTTGGGGCGA 


CTACTTTTAT 


TTGGAAGCGT 


TAACTCGTTT 


AGCGAAAAGC 


TGGTATTCTT 


3360 


TTTGGTAATT 


CATGGAGGAA 


AACTCCTAAA 


AAATAATAAA 


TAGCGGAGGT 


AGTACAAATG 


3420 


AAATTTTGGA 


AAAAAGGCTT 


AACAGCGGCA 


GCGCTGTTAG 


CAGTGGCGGC 


AGTAACTTTA 


3480 


ACAGCATGTG 


GTGGTTCAAG 


TGAAAAGAAA 


GCAACTGAAA 


AGAGTGAAGA 


TGGCAAAACA 


3540 


AAATTAACAG 


TAACTACTTG 


GAATTATGAC 


ACGACCCCAG 


AATTTGAGAA 


ATTATTCAGA 


3600 


GCTTTTGAAG 


CG GAAAi-.TCC 


TGATATCACT 


ATTGAACCGG 


TGGACATTGC 


TTCAGATGAT 


3660 


TATGACACAA 


AAGTAACAAC 


GATGCTTTCA 


TCAGGAGATA 'CGACGGATAT 


TTTAACCATG 


3720 


AAAAACTTAC 


TTTCATATTC 


TAATTAcgCG 


CTACGCAATC 


AATTGGTGGA 


TTTAACCGAT 


3780 


CACGTTAAAG 


ATTTAGATAT 


CGAACCTGCC 


AAAGCAAGTT 


ACGAGATGTA 


TGAAATCGAT 


3840 


GGTAAAACCT 


ATGCTCAGCC 


TTACCGTACA 


GATTTCTGGG 


TATTGTATTA 


CAATAAAAAA 


3900 


ATGTTTGATG 


AAGCCGGAAT 


TGCCTATCCC 


GATAACTTAA 


CTTGGGATGA 


ATATGAAGCG 


3960 


TTAgCGAAAA 


AATTATCTAA 


ACCAGAAGAA 


CAAGTATATG 


GTGCCTATCA 


ACATACTTGG 


4020 


CGCTCAACCG 


TTCAAGCGAT 


TGCTGCTGCT 


CAAAACAATG 


CCAATTTGAT 


TGAACCAAAA 


4080 


TACAATTATA 


TGGAAACTTA 


TTATGATCGC 


GCATTGAGAA 


TGCAAAAAGA 


TCAATCACAA 


4140 


ATGGATTTTG 


GAACAGCAAA 


ATCAACAAAA 


GTAACGTATC 


AATCACAATT 


TGAAAATTCA 


4200 


AAAGCGGCGA 


TGATGTACAT 


GGGTAGCTGG 


TACATGGGGA 


CTTTATTAAC 


AAACATTGAT 


4260 


GATGGCAAAA 


CAAATGTCGA 


ATGGGGGATT 


GCCGAAATAC 


CACAACAAGA 


AAAAGGCAAA 


4320 


GCAACTACCT 


TTGGCTCACC 


GACAAGTTTT 


GCAATTAATA 


AAAACAGTAA 


AAAACAAAAA 


4380 


GCTGCTCAAA 


AATTCTTAGA 


CTTTGCTTCA 


GGTAAAGAAG 


GTGCAAAACT 


TTTAGCAGAA 


4440 


GTAGGGGTGG 


TTCCTTCTTA 


TAAAACAGAT 


GAAATTGATA 


AAATCTACTT 


TGCAAGAAAA 


4500 
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GGAaTGCCTT 


CAGACGAGTC 


TCACAAAAAG 


CCTTTAACCC AGATACAATT AATTTAGAAT 


4560 


TCCCATCTGA 


TAAAAACGGT 


GCCGCAATTG 


ATAAAGTATT ACAAGAmGmA CATGATTTAA 


4620 


TTATGGTCGG 


CGmCGAAAAA 


CCAAAAGATG 


GTATTGCTAA CATGGAAAAA CGTGTCAAAG 


4680 


CAGAGATAGA 


CTAATTATTA 


TTTTCTGCAA 


AAACACAAAA GATTGATGAA AAAAATAAGC 


4740 


TGAAATGAGT 


AACCTACAAT 


GAATTTCAGC 


TTATTTTTTT AAATAAAGAA TGTAGCTTGA 


■ 4800 


GTTTGGAAGG 


AGGGCCTGTT 


AAGTTCACTT 


AACAGAATAC AATTATGGAA ATTACTTCAA 


4860 


AAACAAAAAC 


AGCAAATAAA 


TTGAGACGAA 


AAAATACGTG GACTGCGTTA TCTTTTATCG 


4920 


CACCAAACTT 


TATTGGCTTT 


TTCCTCTTCA 


CTTTAATCCC GGTTATCTGT TCACTGATTT 


4980 


TAGCTTTTAT 


GAGTTGGGAT 


TCTTTTTCAA 


CCCCTGAATT TGTcGGAATG AAAAATTTCA 


5040 


CCAAAATGAT 


T CAT GAT GAT 


ACGTTTTGGA 


TTTCGTTAAA ACAAACGTTT ATCTATACGA 


5100 


TTGGTGTTGT 


TCCGTTAACA 


TTGATTTGCT 


CATTAGGACT AGCGATTTTG TTGAATCGTA 


5160 


AAATTAGAGG 


GATGAAATTT 


TTTAGAACGG 


CTTTTTTCTT TCCTTATGTA ACGTCATTAG 


5220 


TTGCGATTGC 


GGTCGTTTGG 


AATATGCTAT 


TTCATCCAAC GATGGGACCA ATTAATCAAT 


5280 


TTTTGAAATT 


ATTTATTGAA 


AATCCTCCTG 


GCTGGACATC TAGTTCTACT TGGGCTTTAC 


5340 


CAGCAATTAT 


CATCGTTAGT 


GTGTGGCGCT 


TTATGGGGTA TTATATGATT TTGTATTTAG 


5400 


CTGGTTTACA 


AAGTGTGCCA 


AGAGAATTAT 


ATGAAGCAGC GTCAATGGAT GGCGCGGGTA 


5460 


AATGGAATCA 


ATTTTTAAAT 


GTTACCTTAC 


CTTCACTACG ACCAACGACC TTTTTCGTGA 


5520 


CTATCATGCT 


TGTAATTAAC 


TGTTTCAAAG 


TGTTTGACTT AGTTCAAGTT ATGACTGGTG 


5580 


GTGGACCAGG 


ACGTGCAACG 


AATGTGTTAG 


TTTACGAAAT TTACAATGAA GCCTTTGTCA 


5640 


AATTTAATTT 


TGGTTATGCA 


TCAGCCATTG 


CGATGGTCTT ATT CAT CATC GTTCTAGCGA 


5700 


TTACTGTGGC 


CCAATTTAAA 


TGGAATCAAC 


ATCAAGAGAA AG TT TAG GAG GAGAAAAGAT 


5760 


GGAAGCAACT 


TTTTCAAAAA 


AAGAAAATAC 


TCGAAAAGCA AAAACGaAAA AAGCAGAAAA 


5820 


AGTCACACCT 


GGTGGGATCC 


TTGTAATCAT 


TCTCTTAACT TTACTTGCGT ■ TAATTACTTt 


5880 


GGTGCCTTTC 


ATTTGGATGG 


TTTCAGCATC 


ATTTAAAnCn AACAATGAAG TTTTCACCAT 


5940 


TCCAATTCAA 


TGGATTCCTA 


AAAGTTGGCA 


TCCAGAAAAC TATTCCGTTA TTTGGGAACG 


6000 


CATTCCGTTG 


TTAACTTTCT 


TTAAAAATAC 


ATTATTTTTA AGTATTGTGA TTACGATTAT 


6060 


TCAACTATTT 


ACTTCAAGTT 


TTGCAGCATA 


TGGCTTTTCA AAAATGAATT TTAGAGGAAG 


6120 


AGATACACTT 


TTTATTACGT 


ACATCGCAAC 


AATTGCTATT CCTTGGCAAT CG TAT AT GAT . 


6180 


TCCACAATTT 


ATTATGATGC 


GTCAATTAGG 


TTTGACTGaC ACGTTATGGT CACTTGTTTT 


6240 


aTTACAAGCA 


TTTAATGCAT 


TTGGGGTTTT 


TTTACTGAAA CAATATTACA GCAGTATTCC 


6300 


AGATTCACTA 


TGTGAAAGCG 


CGCGGATTGA 


. TGGGTTAAGT GAGTGGGGAA TTTATTGGAA 


6360 


AATTATTTTA 


. CCATTAACTA 


. AACCAGCTTT 


AGCTaGTCTC ACGATTATTA CTTTkGTGAA 


6420 


TACATGGAAT 


GACtATATGG 


i GaCCTTTCAT 


1 TTATTTATCT TCTACTGAGA ATAAAACGAT 


6480 
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TCAGTTAGGT TTGAAAATGT TCGTCGGTTT ATTTGATGCT GAATATGCCT 
CGCATCAGTG GTTTCTATTT TACCAGTAGG GATTGTCTTT TTAGCAATGC 
TGTTGAAGGT ATTGCTACAT CTGGAATGAA GAATTAGGAG GAAGTTTGAA 
GCAAACCTTT GATAATAATA TCTATATGAA AATTTTTCGC TGGCTTTACA 
CGGTAATCTA GGGCTTCTCT TAGTGAACAT TCCTTTTTTC ATAGCGGTGA 
CATTGATCCA CGAAACCTCC CACTTTTTGT AGTAACGTTG TTGCCGATGG 
GATTGCACTT TTAGGTTTAA TCGATACTTT TAAAGAAGAA AAGGAGCTGG 
AACTTTTTTC CAAAAGTTTC GTCAGTTTGG GTTAAGAGGC TTTCTTATCA 
ATTAGGAAGT AGCATCATCA GTGTGACAGA TATATTCTTT TTTGCCAAAA 
TAAATGGTTT ATTCCATTGA TGGTCCTTCT GCTAATTTTT GGGTTAGCTA 
TGCTTGGTAT TTTCAAGTAC GAAACCCGCA AGCTTCTTTT AAAGATGTGT 
TGTTTACTAT GGACTGCGGA AATGGTATGT CAGCTGTCTA AATGTTTTTT 
AATGTTTGCA ATGATGTTTT TGAAGCCACA GTTTGGGTTT GTTGTAACCC 
CCTAGGGATT ATTTATTTAA ATACTGGGAA ATTGCATGAA AAGCAAAAAA 
ATTTTTACAT AATAAGTAAT CTCGATTTTC AGGAAAATGA ATGAAGGTGG 
TTTGTTCTTG CCTTGTTTAT TAAAGTGGAG GGAGCGAACG AATGAAGAAA 
TTTTAAGTAG CTTTTTGATA GGTGCTATTT TTGTGCTCGT TATTTTTACT 
AAAGTATGAT TAATAACgAa GCraATCCAAG TCgArGAATT AATTGTTwAT 
ATATGAATCG TATA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



TAATCATGGC 

AAAAATACTT 

TGTTTACTAA 

TTTTATTTAT 

TTAGCTTAGA 

GGACAGGGAT 

ATCCATTTAA 

GCTTGCTTGG 

CAACGATTGG 

TTATGCTGAA 

TTCGGATTTC 

TACTATTCTC 

CTGTCCTGTT 

AGAATAAATA 

GGTTGCTTAT 

AAAATAATTA 

ACCAGCCAGC 

GGCGATAAAA 



6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7574 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
TGTnTAATGT TCGTGTTTTA TnTTTTAATA ACCTTTTTCG AAAAAATATA CGAACGTTCG 
GGTTTAAAAA CGAACAAAAA GATGAAAATG TGTTGAAATA TACGTTTTCC TTGCGTATAC 
TGATGAAGTG AAAAAACGAA CGAGGAGTGT ACGAATGAAC AAGAAAGTTC CTTATTTAAT 
TGTGGCACCG GGGCTAGTAC TATTATTATT TTTCTTAATG ATTCCATTAA TTACTAGCAT 
TTTGCCAACC ATTTTTACTG ATCATGGCTT GACACTAAAT CAGTATGTTA CTTTTTTTAA 
AGATGACTAT AATGTTTCAA TTTTCTGGCG AACGATAAGA GTTTCGTTAA TTGTGACGGG 
TATTTCAATC GTGTTGGGCA TTCCAACGGC ATACTTTATT GCTGGAGTTT CCAAAAAATG 
GCGTGGCTTT TTAATGGCTA TGACCTTGTT TCCATTATTA ACCAATTCAG TTATTCGAAG 



60 
120 
180 
240 
300 
360 
420 
480 
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TTTTGCTTGG ATTAACATTT TAGGAAAAAA CGGTGTAGTG AATACGTTGT TATTAAAAAC 540 

■AGGCTTAATT GAGCAACCGC TGAATTTATT GTATACAGAA TTTGCAATTA TTATCGGTTC 600 

AGTGTATTTA TTTTTACCAA CAATGATTAT GACCTTAGTT GGGGTTATGG AAAACATTGA 660 

AGGAGAAATG CTAGAAGCGG CTGAGACGCT AGGCGCCAAT CCAATGACGG CTTTTCGGAA 720 

AATTGTTTTA CCTTTATCAA TTCCAGGCAC CATTGTAGGA AG TAT AT TAG TGTTTACAGG 780 

AACTTTAACG GCTTATACGA CGCCACAGCT GTTAGGTGGT AACCAAAAAA TGATGATGTC 840 

GACCTTCTTA TATCAAAAGG CTAATACTTT AGGTGATTGG CAATCGGCGA GTGTTCTTGC 900 

ATTTATTATG ATTTTAACGA CATTAATTGT GATGAAAGGC CTAGACATGG TCGCTAAGAA 960 

AGTAGATCGG AGAGAAGCAA ATsATGCGTA AsAAAAAGGA CTATCTGTTA TTGCAATTCT 1020 

AGTTTTT-sTG TTTTtATTTT TACCGTTATG TTTGATTGTG GTGcnTCTTT tGGAaCAGCC 1080 

GCAGCGATTC AATTTCCGAT TAAAGGATTA CTTTAGATTG GTACGCCAAA GCATTACAAT 114 0 

CGGAAACGTT T AT GG AC AG T TTAAGTTAAG CTTGTTGAAT GGGGTATAGC GACTGTTTTA 1200 

GCCATTAATA GTAGGGATCC GGCCTCCnA 1229 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 974 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

CTTGGTTATC TGGTATGGCC ATCCTCTGTG TGAGCAAAAA TTATGGAGAT CTATTnGTAA 60 

TATTATTTGT GTGACTCCGT ACCCTCTCCT TGCTGTGGCT GATAAGGGCA TCTATGTGGT 120 

TGAAGCACGA AAGrAGGgtT mCGTGTATTg rTaATCCTGT AGcgTATTaA CaATwACCCa 180 

cATTTGaTTa CCaATTATTT AATTTAAACA ATTATCGTGT ACTTTCAACT AGAACTCCaA 24 0 

AAAATAATTT TTCAGATCAA aTAGAGTTAG ATATTTATAG TAGAGGAATG GGTGGTATTG 300 

GGTTGCCAGG AGATTTATCA TCAGTATCTA GATTTGTAAA AGCAACTTTT ACTAAGTTAA 360 

ATTCTGTATC AAGAAGTTCA GAATATGAAA GTATTAGCCA ATTTTTTCAT ATTTTAAGTT 420 

CTGTCGAACA ACAAAAAGGA TTGTGTGATG TTGGTGATGA AAAATATGAG TATACGATTT 4 80 

ATTCTTCATG TTGTAACCTG GAAAAGGGAA TTTATTACTA TCGTACGTAT GACAATAGTC 54 0 

AAATTACTGC TGTGGATATG AATAAGGAAA ATT TAG AG AA GGATAGCTTA ATTGTTTATC 600 

CAATGGTGGA AACACAACAA ATTAACTATG CTAATTAATT TGTTGTAATT CTTGTTTTTA 660 

TAAGTATTTT- CGAAATGAGT TTAAATAAAC GGGaATAATA GAaGAGTAGA TTTaAGTTAT 720 

TGATAACTAC TAGGTAATTG ATGATAAATT AAmCAAATTA GATGCTTTAA GGAAAGATGA 780 

CAAAAAAGTA CAGTATTACA TGAAAAAGTG GATTTTTGAG AAAAAAAGGG TTTAACAATT 840 
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AATATCAAAA 


ACAACTATTC 


TTATAAAACG 


AATAGTTGTT 


TTTTTGTATT CATATTTTCT 


900 


ATAAAAAAAT 


GAATAGTAAA 


CACTTGTAAT 


TTTTTAATAA GAAgAATAGC TTGTTATAAG. - 


960. 


AGTAACTACG 


ATTGTCCGCA 


TAATTTTTTt 


PAAPrrTGnTA 


AAAAAAAtAT AGrGkTATAA 




ATGtGaTAGG 


kGtATTAGtA 


tTTATTTAaa 


AGrPtAAAAA 


AtGtGaAGGA GkTTGaATCg 


lUOU 


AAaTGaAAAA 


AGTATGtGCT 


ATTTTAATGA 


1 X lUlUvul^ 


APTTTTAGGA ATGGTGCCCA 


1 1 A Pi 
X 1 4 VJ 


GTAATGTACC 


AATTTTTACA 


GCAGAAATAA 


ATTATPPPPA 


TPAAAATGAA TTACAAATGA 


izuu 


ATCCGTTTGT 


TATAGCGTTT 


GTCGATAAAG 


tpaapppttp 

1 unnOOU X X \* 


TGTTGCAAAG TTTGAAAATC 


1 9 £ ft 


AAACGCGCCC 


TCTCTGAACT 


CCAAATATCT 


TAPPPTPPTA 


TTPPPATTAA CCTTGATTTC 

X X W V# Vh« X X 1*4* X X A i A 


J. VJ 


AGTAGACACC 


GAAAATCCGA 


AGAGCGTTCC 


rtttpttppp 


TTPTTTTTAT ATATTCCTCG 

X X X X X X X ** X 4* X i* X X V/ w x 


1 JO VJ 


AATGGTCTCC 


ATGCCCTTAA 


TCGTGGAAGA 


nf^PTPTAPPP 


APAPTTTPAT AAAATTTATT 


1 <* ^ u 


CCGTCGTTTA 


ATAGGTCGAT 


GGTCTTGTTC 


TATTAAATTP 


TTAAPATAPT TCACAGTTCG 


i t;nn 


GTGCTCTGTC 


TTAGTATATA 


AACCCACACT 


V_ x Vj 1 ill 


PTAAAPPPPP APPAAGaGAA 


1 ElCft 

IjOU 


GGTGCTTTAT 


CGGTCACAAT 


TGCTTGCGTA 


l tLLb X X 1 V_ 1 


TTPPTtt APP A PATATPTAAG 




GTTAAGCCGT 


CCGCATCAAT 


TGCACGATAA 




AAPPTPPPTT AATTTTPATA 


1 CQ ft 

1 bo U 


TAGGTTTCGT 


CCATTTTCCA 


TGAATAGAAG 


/"•TV T 1 T* (*"* T 1 7\ T 1 


TTTTPTTfTT PPAAAPATPA 


1 T A ft 

1 / 4 U 


TAGAGGACTT 


TGCTGTACTC 


TTGCACCCAA 




TAPTATPAPA AAPATTTATT 
1 rtVj Inl unLrt nn^n x x x r\x i 


1 Qftft 

loUU 


CCACGATCAT 


ATAACAATTC 


CTGaACTTCA 


pr^n Ta^rTT A 


PATTPTAAPP P^PPTAGTAA 


looU 


CCAACAGCGA 


CAATAATGAC 


GTCTTTTTTG 


H-rt 1 lul I 1 


PTTTAAAATP ATTCATTACT 


1 QOft 


CTGTCCTCTC 


TGTCTTTTTT 


CTCAATTTTA 


rspTJi A A AT A 


PATTTTTTGG AAAACTTTGC 




AACAGAACCT 


TTTTTTGCAA 


TAGAATTACT 


AATPPPAAAA 


PAAATTAATG CGATAACCAG 




ACCGATAGCA 


ATTTCGTGAA 


AAATAATTGT 


TATTAAPAAT 


AAAACTATAC TAGATATGAA 


91 no 


TAAACTTAAA 


ACATGTATTT 


TATCTTCTTT 


PATAGPATPP 


TCCCGTTATT GTCATACAAT 


9 1 fin 


AGTTGTTTGA 


TTTATATTTT 


AAATTATCAA 


TAAAAPAAAA 


GCTAGGAAAA AAGCATTCAA 




ACGAAAACCC 


AGTTAATACA 


GATACAGGCG 


TTPTPATTPT 

1 lul 1 1 \D X 


CGCAGTTGAA GATAGTAAAC 


99 RO 


CGATTATTCA 


AT T AG C AG AC 


GGAACAACAA 


APAAAPTTPA 


AGCTAAAGAA ATTGGAGCAG 


9 ^4 n 


ACGTTCAAAA 


AGACGGAACA 


GTAACGGTTA 


A APPTTPAPA 


CGGAAGAATG AAAGTTTTAC 


94 nn 


CAAAAACAGG 


AGAAACAGAA 


AATATTGCAT 


TGTCTGTCCT 


AGGTTCATTA ATGGTATTAG 


94 fif) 


GATCAGCATT 


CATTTTTAAA 


AAGArAaTCT 


AATTAAGTAT 


TTATAAAAGC ATGGTCGCAA 


9 S 90 


GTTTTACACT 


CTCCTACTTG 


AAGTtTCCGA 


ATGAAAACTC 


TATAATGAGA AATGTGAAAA 


2580 


TGTTCACTTA 


TTATAAATTA 


AGTtTGGAGG 


CTTTTATTTA 


TGAAAATTGC AATTGCAGGG 


2640 


GCCGGCGCGA 


■ TGGGTTCCCG 


GTTCGGACTG 


ATGTTGAAAC 


AAGGTGGTAA CGATGTTTTG 


2700 


CTAATCGACG 


GCTGGCAGGA 


GCATATCAAT 


GCCATCAAAG 


AAAACGGATT GAAGGCTAAT 


2760 


TATAATGGCG 


AAGAAATTAC 


TGTTAAAGTC 


TCAATTGTTA 


ATCAGAATGA GGTGCCCACT 


2820 
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bb 1 bAbbAA i 


1 I b A ill bA I 


T ATPTTATTP 


ACnAAHGCAA 


TGCAGTTGGA 


AAAAATGCTG 


2880 


PAPPATPTTa 

LAbbAi bl 1A 


a aPPa TT A AT 
AALLA1 1 AA1 


TPPTPATPAP 


APAGAAGTTT 


TGTGTCTTTT 


AAACGGTATC 


2940 


GGTCATGAAb 


ALb I AA 1 LbA 


a A A ATTPPT A 
AAAA1 X^OXn 


PPnATf^fiAAA 


AAATPTTTAT 


PPpPAAPACC 


3000 


ATGTGGACTG 


/"*tm rn /-* m A z"' A 

CTGGTCTAbA 


Abb 1 LLLbbb 


A AP^ZPTA A AT 


TATTTPPPAP 


PPPPTPTPTT 


3060 


GAATTGCAAA 


tv m rp tv /"* z - • it 1 A T* 

ATCTAGbl Al 


1 bbbL/uibnrt 


PA ATP AfZPf^ A 


AAAAATTPPP 


PPAAAPTTTG 


3120 


TCCGCGTCTG 


GTTTAAATbL 


/"> tv 7\ A ""P a TT PP 

L AAA 1 A 1 1 LL 


P APA AP ATTP 


ATTATTPA AT 


TTATPGTAAA 


3180 


GCTTGTGTCA 


ta rn m tv /**/^ T\ rp 

ATGGTACGAi 


bnAlboLI In 


TPPAPPATTT 


TAPAPPTPAA 


TATPGCGGGG 


3240 


CTTGGTGCAA 


CCAAACCTGL 


I LAI bALAl b 


PT APTPAPTA 
b 1 rtb 1 bML 1 ri 


TTPTTAAPPA 


ATTTPPAGCT 


3300 


GTTGCTGCCA 


Tv TV TV T\ T\ TV /"» fTl 

AAGAAAACGT 


L AAL I 1 AbAL 


HI ILLLbrVMb 


T A ATPPAAP A 


TPTAPAAAPT 


3360 


TGTTTTGACC 


CGACTACGAT 


rn rn Tv rn A rp 

TGGTATGLAT 


1 ILLLAlLbA 


TPPAPPAAPA 
1 bLAbbrlrtbrl 


PTTP AT A AAA 


3420 


AATAACCGTT 


TGACCGAGAT 


TGACTACATL 


AALbbAbLLA 


1 1 1 L 1 Lb inn 


APPTAAAAAA 


3480 


TATGGAGTAG 


TAACTCCTTA 


CTGTGATTTC 


mmr. A r , /~T' A A O 

1 lAALbLAAL 


TP PTTP AT AP 
1 bb 1 1 brVl Aka 


TAAAPAAPAA 


3540 


ATTCTAGGAG 


CGAAATAGTA 


tv m/^Tx rn rn Tt /"* A A 

ATCATTAGAA 


AbAbbA 1 bAb 


AAAA 1 bbnnb 


A AAAATTAAP 
n/T-Mn/\ X X nnU 


3600 


TCCAAAAATA 


TTTTTAAATA 


TV TV rn TV cnfri 7v 

AAGTATTAGC 


GGG1 ALLbL A 


BPPPPP ATP A 
ALbbbbA 1 LM 


TPPT APPPTT 
1 k^o X nbuo x X 


3660 


GATTCCTAAC 


GCGGTATTGG 


/■■-* tv /"* tv tv rn on rp rp 

CAGCAATTTT 


/** TV TV A rn rp TV rp rp rp 

GAAA1 1A1 1 1 


prqif a a A A T A 
bb 1 bnnHn 1 rV 


PPTT APP APT 
\*,\* X 1 nvav^nvj X 


3720 


AACGATCACA 


CAAATGGCAG 


TTATTTTCCA 


AL I AbLl ALi 


HC APT ATTP A 
LL AL 1 A 1 1 bA 


TTPPPPPTTT 

X X OvjUVJ\rf XXX 


3780 


GATTGCGGTG 


CAGTTTGGCT 


m TV 71 TV /*» 7\ T\ rp 

TCAAACLAA1 


bLAAA I b 1 1 H. 


PT APTTPPPP 
b 1 Ab 1 1 b^-^-o 


PTPPAPCTTT 

\j X UUnuU 1X1 


3840 


CGTTGGCTCC 


GGTGTTGTGA 


tv TV rn. rp tv tv rp r" 1 

AATTCAATll 


1 bALALAbL 1 


APTTAT ATPP 
ALi lAlAlbb 


PTPPTPPTAP 
uiuui uo x n\-> 


3900 


CGGAGACATC 


7\ P/l/™" TV TV m 7\ /*»/"» TV 

ATCAATACCA 


TbArl Ablbb 


7\ TP A ATPPPT 
A 1 LAn 1 bbb 1 


PTPPPPATPA 


TPTTPTTGAT 

x o x l v i i van x 


3960 


CGGCGAAAAA 


TTTGGGTCCG 


fPPPPTV ATTTT 1 

IbbbAAl lb 1 


TPPAAPAPPA 


ATPPTTPTTP 


PTATCGGCGC 


4020 


TGGTCTAATC 


GGCTATTATT 


rp TV rp Tv fTl z^ 1 rn rp tv 

1 Al Al LL 1 1 A 


PPT A APPA AP 


ATT APTPPPP 


PTATCGGTGA 


4080 


CTTAATTAAT 


tv m m m tv /-»rp t\ 

ACCTTTACTA 


CTTTbLAALL 


A ATTTTP ATP 
An 1111 \Jl\ 1 b 


TP A ATPTTPA 


TTPPATGTTC 


4140 


CTTTGCTTTC 


TTGATTATCT 


p p p p p a t p t r 1 
LbLLbA 1 L 1 L 


TAPAPTTPPP 
1 Hbrvo 1 1 u^v- 


ATTPPTATPP 


CAATTCAATT 

Vlili X X viul J. X 


4200 


GAACGGTGTT 


TCTGCTGGTG 


LAbLLbL 1 Al 


PPPTPTTPPP 
bbb 1 b 1 1 bbb 


PPA APAAPAP 


TTPTPTTPGT 

X 1 U 1 Li 1 X \J\J X 


4260 


tv ^ fry m tv tv ^* rn m 

AGTTAACTCT 


TGGAAAGTCA 


A f" A A A t~* A /**■ 

ALAAALLAbb 


PPTT APTTTP 
bb 1 InLl 1 lu 


PPPATPPP AT 


TPPPTGCTAT 


4320 


GAAAATGATG 


ATGCCAAAL I 


1 bl 1 1 AbbAA 


APPA ATT A TP 
/iLLriril lnlL 


T TPPTPPP AT 


PTPTATTCAC 


4380 


LbLTATCAlL 


TCAGCAA 1 1 L 


LAb i AbLb 1 1 


ATTPTPAPTP 


TPAPPPAPTP 


PTGCTTCTGC 


4440 


TGGTTTCGGT 


rn rn /"■» rp t\ /"» /"*■ T* 

T IbbT. Abbb 1 


1 bb 1 Abb 1 LL 


PTTPPPTTP A 


TTPPAPPPPP 


PTTTGAGTAT 

^ x x vjrvvj x ** x 


4500 


TATTCTACTA 


TTAATCAGCT 


GGTTTGTTGT 


ACCAATCGTA 


GCTGCTTTCG 


TAGGTCAAAT 


4560 


CTTATTTGAA 


AAAATCTTGA 


AATTGTATGA 


TCGTAAAGAT 


GTTTTCGAAT 


TTTTAGGATA 


4620 


GTCAACTAGA 


• AAAAGTGTGA 


GCCTTGTGCT 


CACGCTTTTT 


TGTGCAATTG 


AAGTTGGTGA 


4680 


ATCTCAGCGA 


AAAGAGAAAG 


GTACCTTAAA 


TAGTGTGTTT 


CTGGTCAATA 


, GCACAATGAA 


4740 


ATATCTAAAT 


ACATTATGAA 


CTGCTTTTCG 


TTTTATGAAG 


AGCAGTTTGT 


GCTATTTGAG 


4800 
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AAATTGCTCA 


ATTTAATCAT 


TAAGTAGTTT 


CATCAATGGT 


TV in T\ rnf*^ rn TV 

GACTATGCTA 


CTGCAATTCT 




TGTTGTACCA 


ACCGGGAGCG 


TCAATAATTT 


m rp m t\ tv j\ rn t\ 

TGTGTAAATA 


ACTCGTLCi I 


T> r" 7\ TV Ti Tv rp Tv 

LIGLAAAAI A 


h y^u 


ATGGGTTACT 


/**» ■* rn TV T\ TV /"l TV m 

CAGTAAACAT 


TGAAGCTAAT 


GTATCCGTCA 


CTTb 1 i bbAA 


bL-Cl I iAlbG 


/I QOA 

h you 


CTTCTATTTA 


AGAATTTTTG 


ATTGTATGTA 


TCAAAGATGG 


TV rn tv m TV r - * TV TV 

AT AC T AG G AA 


ALGCTCTAGT 


c n ^ n 
oU4 U 


GATTCTTCAT 


TTTGAAACTG 


CTCTTTTCTG 


CGGCTGTATT 


TCTTGATTTG 


rpryirnTirnm/^Ti Tv Tv 

1TTATTGAAA 


CI Aft 

blUU 


GATTCGATTA 


AATTAGTTGA 


GTAAATGCTT 


CGGCGGATAC 


CAGGCGGAAA 


f** rn tv m tv tv tv tv tv 

CTCATAAAAA 


5160 


GTCAATAAAT 


CTTGATTTTT 


TAtyCAnJcGC 


TGCGTCACTT 


rn tv j^* t\ Ti tv tv *P 

TAGGATAAGT 


TTTLTGCCAC 




TTCTCAATCA 


TACTCCCTAA 


AAAGGTATTT 


GCCTCTTCCT 


TTGAGCTAGC 


mm/^ TV rn TV TV TV /"* TA 

TTGATAAACA 


COOA 


GCCTTAAAAT 


CATCACAGAT 


TTCTTTTCGA 


TCTTTGaCAC 


GTACTTTATG 


AGCAATATTA 


CI yl A 

b j4 u 


CGGTAAACGT 


GAAcACAGCA 


ATGTTGGTAT 


TTAGCtTTAG 


GATAAATCTG 


GTGAATCGTA 


04 UU 


TCTTTCATGC 


CTTTTAAGCC 


ATCGGTAATA 


AACAGTAAGA 


CATCATGAAC 


CCCTCTGGAG 




TAAATGTCCT 


GTAACAACTC 


ATTCCAAACG 


TAGGTCGATT 


CAGTCGGAGC 


AATCGCATAA 


5520 


CTTAGTACTT 


CTTTAGTGCC 


GTCTTCTCGT 


ATACCAATGG 


CAATATAGAT 


TGCTTCTTTG 


5580 


GAGACGGTTT 


GCCGTTTTAA 


AGAAATATAA 


GTAGCATCCA 


TGAAAATAGC 


GACATACTTA 


564 0 


TCATTTAGAG 


TTCTAACTTT 


AAATGCATTT 


ACTTCTTCAG 


TCTGCGATTT 


CCGACATGGT 


5700 


AATACCTTTT 


CGGAAGAGGT 


GAATGACGGT 


CTCCTCTAAG 


GTATCATTCG 


TCCGTCTATA 


5 7 60 


AGCAGGAACA 


GTCTGTTGCT 


TGAACTCGCC 


GTTGCGGTCG 


CGCGGAATTC 


TGTGAATGTG 




TTGTGGTAAC 


GGTTCTGTTG 


CAAAGTTTTA 


AATCTACTAT 


TV TV Tl m TV TV ITT 

CAAATAAGGT 


TV /"* TV 7\ Ol T\ Tv rp T\ r"* 

AGAATAATAG 


coon 


AAAAAGATAG 


CAGGAGGAAT 


GACGGTGAAT 


CATTTTAAAG 


'a tv tv /■* tv tv m m 

GAAAGCAATT 


m tv rp Tv /"^ 7\ rp 

TCAGTAGGA1 


t; Q A A 


GTGATTATTG 


TAGCCGTGGG 


CTACTATCTT 


CGTTATAACC 


TTAGCTATCG 


rri /*"■ TV TV /*■" m rp /"t TV T\ 

TGAAGTTLAA 


buuu 


GAAATCTTAT 


ATGATCGTGG 


CATTAACGTT 


TCTCATACGA 


CGATTTATCG 


TTGGGTGCAA 


C A C A 


GAATATGGCA 


AACTACTCTA 


TCAAATTTGG 


AAAAAGAAAA 


ATAAAmAATC 


CTTTTATTCA 


C 1 O A 

0 J. ^ u 


TGGAAAATGG 


ATGAAACGTA 


CATCAAAATT 


AAAGGaAAAT 


GGCATTATTT 


GTATCGAGCC 


£ 1 O A 


ATCGATGCAG 


ATGGTTTaAC 


CTTGGaTATT 


TGGTTACGTA 


AAAAACGGGA 


f-y T\ T\ TV TV TV 

CACACAAGCA 


b^4 U 


GCCTATGCTT 


TTCTTAAGCG 


GTTAGTGAAG 


CAGTTTGATG 


AACCGAAGtT 


x_ Tv rp TV i*""* TV 

GtAGTCACAG 


CI A A 


ATAAAGCCCC 


CTCTATTACA 


AGTGCCTTTA 


AGAAACTAAA 


TV TV m TV ^ ^ 

AGaATACGGC 


m rp rp rp 7\ rp T\ TV r - ^ 

TTTTATLAAU 


DjDU 


GGACAGAACA 


TCGTACCATT 


tv tv tv m t\ f% m tv 

AAATACCTGA 


7i rn ta tv fri m rn tv iti 

ATAATTTGAT 


rp f TV TV /"< 7\ 7\ /"■< TV /"* 

TGAACAAGAO 


1 Co 1 LLALi 




TAAAGAGACG 


CAATAAATTC 


TATCGAAGTT 


TACGCACTGC 


CTCTACCACG 


7\ rp rp tv Tv TV f~° A 

ATI AAAGbL,A 


D fl 0 U 


TGGAAGCCAT 


TCGAGGATTA 


TATAAGAAAA 


CCCGAAAAGA 


AGGCACTCTC 


TTCGGGTTTT 


6540 


CGGTCTGTAC 


TGAAATCAAG 


GTATTATTGG 


GAATCCCAGC 


TTAAATCATA 


GATACCGTAA 


6600 


GGgATTTTAT 


■ TCTTTATTTA 


AAACTTTGCA 


ACAGAACCAA 


TAATTTGATC 


TACTCATTTC 


6660 


TAATTCTTGG 


GAGAATTTTT 


CTTTTGATTT 


GGTACCTCTC 


ATCTTTTTTA 


AGCTTTCTCT 


6720 


AAATGCTAGT 


TCCAAGTTTC 


TCCCTCCTGT 


TTATGTAAAT 


ATTTTGCACG 


TTTTCCATTT 


6780 
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A 7\ TT A T B TPf 
AA1 IrtlAl Lb 


1 1 ACACC 1 1G 


TAAATCACAA 


GTGATTAATC 


APAAATPAPT 


TPTGATTAGT 

X Vj l VJrt x x n\j x 


6840 


GAl.lbl.l A IX - 


^ rp /"^ rp rp r> ppfn 

CiGlijI 1GCCT 


AGAACGATTC 


TAAGCACATG 


TTTAATTTTA 
111 nn 1 1 1 Jn 


AAPTPPPPAA 


6900 


TTATCTTTAA 


AA1G 1 X AC 


AATGATTCAA 


TAAAAGCATG 


PPTP,PTAPTP 


AAAPTTP.PPA 


6960 


CCATGCTTTT 


AAAAAT ACT T 


AATGGGGTCC 


CGAGCGCTTT 


APPTPPTTPP 


A APPTPTPAP 
nnuL lul v.nu 


7020 


TAGTATACCy 


AATAATTTAT 


CTACATtCCC yTTAGTAACG 


TP ]r A APTTTP 
1 b K AAb 1 1 


L/uuil 1 InLa 


/ u o u 


AAAGCGACTC 


AtAGmmTtAT 


TTPPTCCCGT 


TAAATAATAG 


ATAAPTATTA 


A A A AT APAPA 


/ 14v 


ATACTTGCTC 


ATAAGTAACG 


GTAPTTAAAT 

W X nvr X X ruTil i. 


TGTyyACTTT 


1 b 1 1 1 C 


All bC 1 1 bA 1 


/ £. VJU 


GAAACTGATT 


TTTAGTAAAC 


APTTGAPGAT 
no x l ori^urv x 


ATTCTCGATT 


GACCCA1 111 


/"» 7\ n APA)\ APT 


*7 9 


ACGTATATAG 


CTTCCAATAT 


TTATPTGGAA 


CATCTGTGGT 


A 1 bbbbbb 1 A 


APTTTT ATT A 

Ab 1 1 1 Inlln 


/ «5 £ U 


AGACACTGTT 


TACTTTTGGT 


TTAPPATHAA 


AGCATTCCGC 


TGGCAGC 1 1 A 


AbCAAl IbCl 


7 ^fln 

1 JOU 


GAATCGAGAC 


TTGAGTGTGC 


aap^appaapp 


CTAGTGTTCG 


G rGAAl A ICC 


AAbb 1 ACbb 1 


7 A A n 


TGTAGAATCC 


TTCTTCAACA 


ATP Afl AT AP A 
r\ X Lnun 1 Hurt 


TGTCAGACGC 


Al GbC 111 CA 


A A A ArTAPTT 


7 son 


TTTTAATAAT 


TTGTGTGCTT 


AAATGGTAAG 


GAATATTCCC 


AACAA1 1 1 1A 


i ACC 1 C 1 b 1 1 




TGTTAGGGAA 


TTGAAACTGT 


AGAATATPTT 


GGTGAATTAA 


AG 1 GACACGA 


A 1 b 1 1 C Ab x x 


7 620 


TTAATTTTTC 


TGACGATAAG 


TTGAATAGAT 

X x Vj/ui x xivjn x 


GACTGTCTAA 


4_ rp /—! T\ t\ rp tv /""■ -> /"*• 

t rcAAT. Abac 


b i 1 ACC 1 b 1 1 


7 fiPfl 


TACTTATTTT 


AGCCAGTTTC 


PTPPTTAAAT 


GCCCTTTACC 


TGTTCCAATT 


i Cb 1 AAACbb 


7 7 4 0 


TAwCGGTTTC 


TTTTAAATTC 


A ATTPTTTTA 


TTATTTGGTT 

X X *i XXX VJ W A A 


GAGTACTTTT 


rp T\ fTp /"> /"^ rp A 

TCAClCbl 1 A 


7 Ann 


AAAAGTTTTG 


AGAATATTTT 


ATATTTTTGT 

-T* X a* ^ X X X ^ w A 


TCcATGTAAT 


CAC 1 CC 1 bAA 


b 1 bA i 1 Ab A 1 


7fl fin 


CTATAAATAA 


ATACAGAAGT 


TAAACGATTT 


GTTTGTAATT 


TTAGTTATCT 


/Tprprp j\ A A TV Zi P 
bill AAAAAb 


7920 


TCATAAGATT 


AGTCACTGGT 


AGGAATTAAT 


CTAACGTATT 


1 Al 1 i Al C1G 


PPTB A TP APT 
Cb 1 AA1 bAb 1 


7980 


GTTTTTAGGG 


TTCTGTTGCA 


AAGTTTTAAA 


TAAAGAATAA 


A A TPrPTTAP 

AA1CCC1 1AC 


pp<pj\ TPTBTP 
bblAIClAXb 


8040 


ATTTAAGCTG 


GGATTCCCAA 


TAATACCTTG 


ATTTCAGTAC 


AbACCbAAAA 


rr*PP A A P A P A 
C C C b AAb Ab A 


R 1 00 


GTGCCTTCTT 


TTCGGGTTTT 


CTTATATAAT 


CCTCGAATGG 


C 1 1 CCA1 bCC 


TTTA ATPP.TP, 
ill AA 1 bb 1 Vji 


8160 


GTAGAGGCAG 


TGCGTAAACT 


TCGATAGAAT 


TTATTGCGTC 


1 C 1 X X Ab 1 bb 


APP ATPPTPT 


8220 


TGTTCAATCA 


AATTATTCAG 


GTATTTAATG 


GTACGATGTT 


PTPTPPPTTP 
b 1 b 1 bbb lib 


A T A A A APPPP, 


8280 


TATTCTTTTA 


GTTTCTTAAA 


GGCACTTGTA 


ATAGAGGGGG 


LI 1 inlL lb 1 


PAPTAPAAPn 


8340 


TTCGGTTCAT 


7\ TV TV trt Z* 1 /"* fP CT* 

CAAACTGCTT 


CACTAACCGC 


TTAAGAAAAG 


PAT APPPTPP 
Ln 1 Abbb 1 bb 


TTPTPTP^TPP 


8400 


CGTTTTTTAC 


G T AAC C AAAT 


ATCCAAGGTT 


AAACCATCTG 


P ATPPATPPP 
L-n 1 CbA 1 bbb 


TPP.ATAPAAA 


8460 


TAATGCCATT 


TTCCTTTAAT 


TTTGATGTAC 


GTTTCATCCA 


TTTTCCATGA 


ATAAAAGGAT 


8520 


TkTTTATTTT 


TCTTTTTCCA 


AATTTGATAG 


AGTAGTTTGC 


CATATTCTTG 


CACCCAACGA 


8580 


TAAATCGTCG 


• TATGAGAAAC 


GTTAATGCCA 


CGATCATATA 


AGATTTCTTG 


AACTTCACGA 


8640 


TAGCTAAGGT 


TATAACGAAG 


ATAGTAGCCC 


ACGGCTACAA 


TAATCACATC 


CTGCTGAAAT 


8700 


TGCTTTCCTT 


TAAAATGATT 


CATCGTCATT 


CCTCCTGCTA 


TCTTTTTCTA 


TTATTCTACC 


8760 
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1 Ini 1 i Ijnln 


P T & P A T T T A A 
blAbAl 1 1AM 


fintl I ioV-drt 


pa^aapphtt 


PATGGGTATA 


GCATGAATTG 


8820 


111 MAt lult 


b i b 1 rtobrtl 1 






TPTTATTTTG 


TTAATTGTCT 


8880 


1 b 1 AAb 1 1 ol 


til IHiHILo 


P A PT A & P.TT T 


a a a a a ap,ta a 


A AAflATTTAT 


CTCTCAGATA 


8940 


All bb I Al Al 


bAAbb 1 AAb 1 




a a apta att a 


TPPTAPAAGT 


PTPAPAAPAA 


9000 


m tv t\ 7\ it* rp 7\ rn 

ATAAAGT1 Al 


/"* TV 7\ rp^-»/-< 7\ 7\ rp tv 

b AA 1 bbAA 1 A 


Alllllil 


1 no UlLul 1 1 


P,PTPATTPT A 
oV-> x \*-rv x X L> 1 Xx 




9060 


TCTTATTTAA 


TV /Tp rp TV 7\ T 1 T 1 T\ 

AL I 1 AAb 1 1 A 


Lnnnl 1 AAl A 




TTAAAATAAA 


AATAAAPAAA 


9120 


AATGCAAAAA 


TTTGCGCTTT 


rp rp/-> Tv T> /— ■ r. T\ TV TV 

1 IbAlbAAAA 


AAA1 1 1A1 1A 


PATAATAAAA 
A 1 AA 1 rirxrxrV 


A A A PP A AAA A 


ai or* 


ATCATTCCAC 


GAATGCACTT 


TTTTbbbbb 1 


bbbAAbbAb 1 


ATTTPA A A AT 
All IbAAAAl 


PTTTPTTT AP 




GAACTATTCA 


■A TV /"^ TV T\ /"^ TA /"* v™» 

AAGAACAGCG 


7\ tv t\ TV r""* rp /"»rp t\ rn 

AAAAbTb 1 Al 


AA 1111 Ab 1 b 


AAbAAbAAl 1 


TTAT^AAP.P,A 


9300 


ATATTCAAAA 


AAAGAGCTGC 


CTCTTCTTTT 


bAbb 1 AbAbA 


AbAbbbAi bA 


TPTP A A APTT 


9360 


AAAGATTTAC 


CTGTTTTATC 


AGATAGAAbb 


A i b A 1 b Ab 1 A 


TPTTRriiB TAT 

1 b 1 1 Ab AAA 1 


A RTTPATTAT 
AAl ILnl 1A1 


9420 


GCTAAAGAAG 


AAl 1 1 Al lib 


1 bbu i MbbAb 


ijnnljnL 1 1 AA 


ATTPATTPTT 
All ^n, X X \j 1 x 


THATATATTC 

X on X t\X n 1 1 v^ 


9480 


CAGAATAATA 


CCGAAAAAGA 


AAATAAAGAC 


TACATTTACA 


AATTATACAA 


AAAGTCTATA 


9540 


GAGCTAAAAG 


AACATTCTAT 


AATATATTGG 


AATCTTTACT 


TAATAATAAA 


AATCCAATGT 


9600 


TCTGAATACG 


ATTCAAGAAT 


TGTCCCTACA 


GACTCACAAG 


ATTTATCTGA 


GTTAAAAAAA 


9660 


ATGaTTTTAT 


CAAaACaAAA 


GTTTACTCTT 


TACGACTATA 


AAATTGTTAC 


TAATCTATCT 


9720 

i 


CTTGTATTTT 


CTTACAAAGA 


ATTGCAACC 








9749 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

AAAATCAGAT ATCTTATAAn nGnAATCAAG CACGTATTCG CGATAAGGGT AAAAAATTAC 60 

GTAAGAATCG GCACAGCGCT TCTTTAAAAA GTAAATTAAA CTTTGACAAC TTTTTCGAAT 120 

AATGGTACTG TTAAGAAGTA AATTTGTGAA TAGAGGAGGT CGAAGAACAT GGCAACAAAG 180 

aAAGCAGCTC TTGCTTGTTC CGTTTGTGGC TCTCGCAACT ATTCTAAATC CGTTAGTGAA 24 0 

GGTAAACGTG GCGAACGCTT AGAAATCAAT AAATTCTGTA AATATTGTAA TCAATACACG 300 

TTACATAAAG AAACGAAATA GATTTAGAGG AGGGTATCAA CATGAAATTT TTCCGCAGCG 360 

TTGCTGATGA GATGAAGCAA GTAACTTGGC CAACAAAAAA ACAATTGCGT AAAGATACAT 4 20 

TAGTTGTAAT TGAAACATCT ATTTTATTTG CAGCATTGTT CTTTATCATG GaTACTGTTA 4 80 

TCCAAACGGC ATTTGGcTGG ATTTTAAAAT AATTAATTCT AGTATTTCCC GACGAGGGGT 54 0 

GCTATAATAG TATTCGAGAA AAAGCTTCGG GAAACTGAGC TTTTTTTATT TGCTCGAAAA 600 

AAATAAACTA AAACATTTAA AGGAGCCGAA TCAGTAATGG AAACTTTTGA AAGAAATTGG 660 
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TAcGTATTGC 


ACACCTATTC 


AGGCTATGAA AATAAAGTAA AwGCAAACAT. TGAATyACGT 


720 


GCACAAAGCA 


TGGGGATGGG 


CG AT TAT ATT 


TTTCGTGTCG 


TTGTACCAGA AGAAACAGAA 


780 


AAAGAAGTCA 


AAAATGGTAA 


AGAAAAAGAA 


ATCGTTCATA 


AAACTTTCCC tGGTTATGTG 


840 


TTAGTGGAAA 


TGATTATGAC 


CGATGATTCT 


TGGTATATTG 


TTCGTAACAC GCCAGGCGTA 


900 


ACTGGTTTTG 


TAGGCTCACA 


TGGTGCTGGT 


AGTAAACCCG 


CTCCTTTATT ACAAGAAGAA 


960 


ATTAACCACA 


TTTTACGTTC 


AATTGGCATG 


AGCACCCGTC 


AATCAGACTT AGAAGTAGCG 


1020 


CTAGGCGATA 


CAGTGAAAAT 


TATCGAAGGT 


GCCTTTTCTG 


GTCTTGAAGG TGTTGTGACA 


1080 


GAAATCGATG 


AAGAAAAACA 


AAAATTAAAA 


GTAAACATCG 


ATATGTTTGG TCGTGAAACA 


1140 


AGTACAGAAT 


TAGACTTTGA 


ACAAGTCGAT 


AACATTGATT 


AATAAACAAG CTGACAAACT 


1200 


ATTGTTCAGC 


TGATTGATAC 


AGATTGTAGG 


GTCTGGAAAA 


GCAGTCAACT GACTTCTTTT 


1260 


CCAAGCCCTT 


TTTTAGTTCA 


TCGCAGAAAG 


GAtGrAAAAA 


AATGaACATG CCCAAAAATA 


1320 


TCnGTtATTT 


TTCTTTGCTA 


ATGGGTCTtG 


tTCTATTATT 


AAGTGCTTGc CAAATTGGGG 


1380 


CaACTACGAA 


GGATGACAAC 


CAAGCCGCCa 


CAAAAGAAGC 


AACTGTTGAG TTAAACCGCA 


1440 


CAACAACACC 


AACGCTTTTT 


TTTCaTGGTT 


ACGCAGGAAC 


TAAAAATTCG TTTGGCTCGT 


1500 


TACTGCATCG 


CTTGGAGAAA 


CAAGGTGCCA 


CAACTCAAGA 


ATTAGTGCTA CTCGTTAAAC 


1560 


CTGATGGGAC 


CGTGGTTAAA 


GAGCGAGGAG 


CTTTAAGTGG 


CAAAGCGACG AATCCCAGTG 


1620 


TTCAAGTTCT 


ATTTGAAGAT 


AATAAAAACA 


ATGAATGGAA 


TCAAACAGAA TGGATAAAAA 


1680 


ACACATTACT 


CTATTTACAA 


AAAAATTATC 


AAGTGAACAA 


AGCCAATATT GTCGGGCACT 


1740 


CTATGGGTGG 


TGTTAGTGGT 


TTACGTTATT 


TAGGAACCTA 


TGGGCAAGAT ACATCGTTAC 


1800 


CTAAAATTGA 


AAAATTCGTC 


AGCATTGGAG 


CACCTTTCAA 


TGATTTTATT GATACGAGTC 


1860 


AACAGCAAAC 


CATCGAAACG 


GAACTAGAAA 


ACGGCCCCAC 


AGAAAAAAGT AGCCGCTATT 


1920 


TGGATTATCA 


AGAGATGATT 


AATGTTGTTC 


CAGAAAAACT 


GCCCATTTTA TTAATTGGTG 


1980 


GTCAATTAAG 


TCCAACAGAT 


TTAAGTGATG 


GAACGGTGCC 


GTTATCTAGT GCCTTAGCAG 


2040 


TCAACGCCTT 


GCTAAGACAG 


CGAGGAACTC 


AAGTCACTAG 


CCAGATTATT AAAGGAGAAA 


2100 


ATGCACAACA 


TAGTCAATTA 


CATGAAAATC 


CTGAAGTAGA 


TCAATTGCTA ATCGAATTTC 


2160 


TATGGCCGAG 


TAAAAAaTAG 


CAaAAAGGCA 


CATGTTCCAT 


AAAAAATGGA ACATGTGCCy 


2220 


TTTTCTATAC 


TTTGGCTAGC 


GAAATAGCTT 


TTTTCAGCTA 


TAATAAATAA AATTATTTAT 


2280 


TTTTATGAGA 


TATTGGAATG 


GAATCCTCAT 


GATTTAAATA 


AGCACAAATC AAGGCTTCAA 


2340 


AGGTGATTAA 


TGTTTGGCCA 


AAAAACATTT 


GTGGCTGAGC 


GACACTTGTT GAAAACGGGC 


2400 


TGTAAGAGTC 


AGTTGAAAAA 


ACGAGATCTG 


CCAGTCGACC 


AAGAGTTGAA TTTTTGGCGC 


2460 


CAACAAAAGA 


AATGATAGGA 


TGTTTTTTCT 


CTTTCGCAAG 


CAGAGCTGTT TCTTTTAAGT 


2520 


ATTTTTCTTC 


ACCAGAATGT 


GAAAGGATGA 


. TAAAAAGAAA 


, ATTCTGATTA TTTTGACTAT 


2580 


TTATCGAATG 


AGTATAGGCT 


GTTGTGATGC 


TGGGGATGCC 


ATGAAAATTG AAAACTTCAC 


2640 
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TAATGTAAGC 


CGCCAAATGT 


CGTGAAAAGT 


CATGACCTAA 


AATAACAAAT 


AAATGACTCT 


2700 


TGTGTTTGGC 


AAGCAGTCGA 


CAAAACTCGT 


TGGTTGTTTC 


AAATGAAGGT 


GCTGTTTCAA 


2760 


AAGGGATTGG 


CTGAGAAAAA 


TGGGCTTCTT 


TTAATTTGTA 


AATAAGTTCA 


TTATAACCTG 


2820 


ATAAATTTAA 


TTTTTTGGCT 


AAACGAATGA 


TGGAACTAGG 


CGAAGAAAAG 


CAGTGTTTTG 


2880 


CAACTTCCCG 


AATACTCAAA 


TCGGCCGCAT 


TCGTACTTTG 


TTGATCCAAA 


TAATACAAAA 


2940 


TCTGaCTTTC 


TGTTTTCGTT 


AAATGaTACG 


TTTCTTTTAA 


GTAATCTAAG 


TTCATGAGCA 


3000 


AACCTCCTTT 


AT C AG AT TAT 


AAATAATTTT 


GTGGGAAAAA 


GGTATCAAAA 


AATAAAAAAA 


3060 


TGAGGTGTTT 


TTATGAAAAG 


GGCAACAAAG 


CAAAGGCTGT 


CTTTGGCAGC 


AATCATGGTT 


3120 


CTACTTCTCT 


CGGGCTGTGG 


AAGTGTTGGG 


AAAGAAACCA 


AAAAGCAAGA 


ACAACAGGTA 


3180 


TTACGGGTCG 


GGATTGATTC 


GGAATTATCA 


ACGGCAGACG 


TGTCGTTGGC 


AATGGATAAT 


3240 


ACCGCAGCAG 


ATGTAATGAG 


CCAAGTAGGG 


GAGGGACTTT 


TCTCCTTTGA 


CGAAAAAGGA 


3300 


GAAGCGAAAC 


CAGCATTGGC 


AACTGAAAAA 


GTACAGCCCT 


CCAATGATGG 


TTTAAGCTAT 


3360 


ACTTTTACGA 


TTCGAAAAGA 


TGCAAAATGG 


AGTAACGGCG 


AGCCAATCAC 


AGCAAATGAT 


3420 


TTTGAATACT 


CTTGGAAGCG 


CACAGTGGAC 


CCAAAAACAG 


CTTCCCCGCA 


AGCGTATTAC 


3480 


TTTGAAGGGT 


TAAAAAATTA 


TCGTGCTATT 


GTTGACGGTA 


GCAAATCTAA 


AGAAGAGTTA 


3540 


GGGGTAACAG 


CCATTGATGA 


CCATACCTTG 


GAAGTAGAGC 


TAAGCTATCC 


TATGAGTTAT 


,3600 


TTTCAACAAT 


TATTGGCGGT 


ACCAGCTTTT 


TATCCTTTAA 


ATGAAGCATT 


TGTCGAAAAA 


3660 


ACGGGCAAAA 


ACTATGGTAC 


ATCAGCTGAG 


TCAACACTTT 


ACAATGGCGC 


CTTCACATTA 


3720 


GAAGGTTGGG 


ATGGCACGAA 


TAATACTTGG 


TCCTATGTGA 


AGAATAAAAA 


TTATTGGGAT 


3780 


CAAGCGAATG 


TTTCGCTAGA 


TAAGGTGGAT 


GTCCAAGTAG 


TTAAAGAAGT 


CAATACTGGG 


3840 


AAAAATCTTT 


TCGAAGGGAA 


AGAATTAGAT 


GTTGTAAAAA 


TTTCTGGAGA 


AATTGTTGCA 


3900 


CAAGAACAAG 


GCAATGCAGC 


TTTGAAAATT 


CGTGAAATTC 


CTGGAACGTA 


TTATATCCAA 


3960 


TTAAATACGC 


AAAAAGATCT 


TTTGGCAAAT 


AAGAATGCAC 


GTCGAGCAAT 


AGCATTATCA 


4020 


TTGAATTCTG 


AGCGTTTAGC 


TAAAAATGTT 


TTAAATGATG 


GCTCAAAAAA 


AGCACTTGGC 


4080 


TTCGTGCCAA 


CAGGTTTCAC 


TAATCAAGAA 


ACGCAAAAAG 


ATTTTGCAGA 


GGAATTAGGA 


4140 


GATTTAAATC 


CTAGTGAACC 


AGAAAAAGCG 


AAAGAGTTAT 


GGCAAACGGC 


TAAAAAAGAA 


4200 


TTAGGAATTG 


AAAAAGCGGA 


GcTAACGATT 


TTAAGTTCGG 


ATACAGAAAA 


TGCTAAAAAA 


4260 


ATCAGTGAGT 


ATGTTCAaGG 


AGCTTTAGCA 


GATAATTTAG 


AAAATTTAAC 


AGTCAATGTT 


4320 


TCACCAGTTC 


CTTTTAATAA 


TCGTTTAGAA 


AAAAGTCGCA 


GCGGAGATTT 


CGACATTGTG 


4380 


GTTGGTGGCT 


GGACGCCaGT 


ATATGCTGAT 


CCAATCGATT 


TCTTAAACTT 


ACTGCAATCA 


4440 


AAAAATTCCA 


ATAATTTTGG 


TAAATGGTCT 


AATAAGACCT 


TTGATCAGTT 


GCTTCAAGAA 


4500 


GCAAACGTAA 


CTTATGCAAA 


TAAATATGAA 


GAACGTTGGA 


AAACATTACA 


AAAAGCGGAT 


4560 


CAATTGGTTG 


CGGAAGAAGC 


CCCCCTAGTT 


CCTCTTTATC 


AATTAACAGA 


AGCACGCTTA 


4620 
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GTGGCCGATT 


CTGTCCAAAA 


TTTAGTCTAT 


GGTCCATTAG 


GTTCAGGCTA 


TTACAAATCA 


4680 


GTCTCTATCG 


GCGACAAGTA 


AAGGTTGGGA 


AAAAATGAAA 


AAAAGTCAAG 


CTGTTCAATT 


4740 


AATTCAGGAA 


CTTTCAAATG 


CTAATGGTGT 


TTCAGGCTTT 


GAAACAGAAG 


TCGTTAGAAT 


4800 


CCTTCAACAC 


GCAACTGCGG 


ATT T TACT AT 


TCAACGGCTA 


GATTCCATTA 


AAAATTTATA 


4860 


TCTTGAGAAG 


AAAAATAATC 


TAAGTGAAGG 


TCCAGTAGTG 


CTTTTTGATG 


CTCATAGTGA 


4920 


CGAAGTGGGT 


TTTATGATTC 


AAGCAATTAA 


AGAAAACGGC 


TTGTTGCGTT 


TTTTTCCGTT 


4980 


AGGTGGCTGG 


GTCCCTAATA 


CGATTTCTGC 


ACAAAAAGTT 


CGGATTCGTA 


ACCGGGAAGG 


5040 


TACCTATCTG 


CCAGGCGTAG 


TTACTAGTCG 


GCCCCCTCAT 


TTTATGACAC 


CAGAGGAACG 


5100 


TCAGCGACCG 


TTAACCATTG 


CTGATTTGAC 


GATCGACATT 


GGTGCTACCT 


CAAAAGAAGA 


5160 


AGTAATTGAA 


ACTTACAAAA 


TCGATCTTGG 


TGCACCGGTC 


ATTCCAGATG 


TGACGTGCTG 


5220 


TTACAACGAA 


CAGACAGATC 


TTTTTTTAGG 


AAAAGCTTTT 


GATTGTCGCA 


TTGGCTGTGC 


5280 


TTGCTTAGTG 


GATGTCATGG 


AGGAGTTGAA 


AGAAGAGACG 


CTTCCATTCA 


AGTTAGTTGC 


5340 


GACAGTAACT 


GCACAAGAAG 


AAGTTGGCGA 


AAGAGGAGCA 


CTTATTGCCG 


CAAAACAAGT 


5400 


CAATCCTGAT 


TTAGCTATTG 


TTTTCGAAGG 


CTGCCCGGCT 


GATGATACGG 


CAGAAACGCC 


5460 


AGAGATGATT 


CAATCAGCAA 


TGGGAAAAGG 


ACCGATGTTA 


CGTTATTTTG 


ATGTTTCTAT 


5520 


GATTACGAAT 


CCGGAATTTC 


AAGAGTATGC 


ACTAGAGATT 


GCTAAAATAC 


ACAAAATTCC 


, 5580 


TGTTCAAGTT 


TCTGTTCGAA 


GTGGTGGTGG 


AACCAATGGC 


ATGGCGATTA 


CACAAGTTCA 


5640 


AGGAGCCCCG 


ACGATTGTTG 


TAGGAATTCC 


TGTCCGTTAC 


GCTCACACAC 


CCCATTGCTA 


5700 


TGTAGATTTT 


CAAGATTACC 


AAGCGGCGAA 


AGAATTAGTC 


ATTCAATTAA 


TCAAAAATTT 


5760 


AGATGCTGAC 


AAAATTCAAG 


CACTGGTTCA 


GCCATTGTCA 


AAGGAGTGGA 


ACAAATGAAG 


5820 


ATTTATGTAT 


CATGCGATAT 


TGAAGGATTA 


GCAGGTATTG 


CGACATTTGA 


TATGGAAAAA 


5880 


GAAGACACCG 


TTTTATTCCG 


AGAATTATAT 


CACCAACATG 


TGGCTTGGTT 


AATTGAAGGG 


5940 


ATTCAAAAGA 


GTGCCAAAAA 


TGAACAAATT 


ACAGAAATTA 


CTATTGCCGA 


TTCGCATAGC 


6000 


CGTGGATTGA 


ATTTAGCTTA 


TGCGCGCTTG 


GCAGAAATGG 


ATGAACGAAT 


TTCCTTAGTG 


6060 


AGCGGCTTTC 


CTCGCATGGA 


TTATATGATG 


AGTGGCTTGG 


ATAGTAGCTA 


TGATGTGGTC 


6120 


TTTTTCTTAG 


GTTATCATGC 


CGGTATTGGT 


AAACAAAAAG 


GGAATATGGA 


TCATGGGTAT 


6180 


AGCGCTAGTG 


TTGCTTATGA 


TTTAAAAATA 


AACGACTTAG 


CGATGAATGA 


AACGACAATT 


6240 


AATGCAGcTT 


ACGCTAGCGA 


ATTAGGTGTC 


CCTGTTGGTT 


TAATTATCGG 


TGAATCAGGC . 


6300 


TTAGAGGAAC 


AACTTTTTCA 


AGAAAAAATG 


ATGCCCGAAG 


TTCCCTTTGT 


TTCCACCAAA 


6360 


GAATCTCTGG 


GACGTTATGC 


AATCAAAAAT 


AGACCCATGC 


AACAAGTTCG 


GGAAGCTATA 


6420 


GTAGCAACTA 


CAAGTCAGGT 


GTTAACTTCC 


TTTGCGTTGT 


CTGAATTACC 


CCGATATGCC 


6480 


TTGCAAACAC 


CTGCAACTGt 


AAAATTGCAA 


TGTGTGACGA 


CAGCACAAGC 


AGATCGTATT 


6540 


GAAATGTTAC 


CAATGATTAA 


ACGCATTGAT 


GGTAGAACTG 


TTTCATTTGT 


AGGTGAAACG 


6600 
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ATGAAAGATG 


TCATGAATGG 


AATTGTCGCT 


GTGGTTGGTT 


TAGGCGGAAC 


AAGTTATTAA 


.6660 


AGTAAGAAGC 


TTACGGAAAA 


AACGAATGGA 


CATTTTTTTC 


TGTAAGCTTC 


TTGCTATAGT 


6720 


TATTTCTATT 


TTTGTTTCTC 


TAGAATTTCT 


TTGCTTAAAC 


GTTGTGCAGT 


TGGTGTCACC 


6780 


GCAAGACcGC 


CTTCAGCGGT 


TTCCTTGAAA 


ATTTGTGGCA 


TTTGGCGTCC 


CACTTGGTAC 


6840 


ATTGCGTGGA 


TGACTTCATC 


TGGCGGAATG 


ACACTTCGAA 


TCCCCGCTAA 


AGCCATGTCA 


6900 


GCTGAGATGA 


AGGCTTGAGA 


AGAACCTAAT 


GCGTTTCGCT 


TCACGCAAGG 


GACTTCAACA 


6960 


AGTCCCGCAA 


CAGGATCACA 


AATGAGGCCC 


ATCATATTTT 


TTATCGTAAT 


GGCCACAGCT 


7020 


TGAGCAGATT 


GATCAGgCGT 


ACCGCCAGAA 


GCCGCAACTA 


AAGCGGCACT 


TGCCATTGCA 


7080 


CTAGCAGAAC 


CGACTTCAGC 


TTGACAACCA 


CTTCGGCACC 


ACTGATTGAG 


GCATTATTGG 


7140 


CAATCACTAA 


GCCAAAAGCC 


CCAGCTGTAA 


ATAGAAAATC 


CAATTGTTGT 


TGATGCGTTA 


7200 


ACTTTAAGCG 


ATCAATAACT 


GCCAACAAGA 


CGCCAGAAAC 


TACTCCGGCA 


CTGCCTGCGG 


7260 


TTGGTGTAGC 


ACAAATCAAG 


CCCATTTTTG 


CATTAACCTC 


ATTAACAGCA 


ATCGCATTTC 


■ 7320 


GCACGGCTTG 


TAAAATGGTC 


TCACCGCTTA 


AAAAGTTGCC 


TGAATGTATG 


TAATCATTTA 


7380 


AGCGGGTTGC 


ATCGCCACCA 


GTTAGCCCTG 


TTACTGAGGT 


AACACCAGCG 


ATTCCTTCAG 


7440 


CGACGGATTG 


TTTCATAACG 


GCTAGATTTT 


TTTCCATTGT 


TTCAATAATT 


CGTTCACGAG 


7500 


AATACCCGCC 


ATGAGCCATT 


TCTGTTGGAA 


TCATTAATTC 


GCTCACAGAA 


GGGTAATCTT 


7560 


TCGCTTGAGC 


TACCAAATCT 


TCAATTGAAT 


AAAACATCGT 


ATTACTCCTT 


TACGTTCATT 


7620 


TAGTCAAAAT 


AGTTAACGCT 


ATATATATGA 


GGGATTTCAG 


CTAATTGCAT 


GACAATGTCA 


7680 


CCGACTTCGG 


CTTGATCAAC 


TTCAATAATC 


ATAATCGCTT 


TTTCCCCTTT 


AGATTCTCGG 


7740 


GTAACGGTCA 


TTGTACTAAT 


ATTGATATCG 


CTAGCTGAAA 


GGATGTTGGT 


CACCTTAGCA 


7800 


ATCATGCCAG 


GAACATCTTG 


ATGAACAACG 


ATAAATGTTG 


GCGTGCCCAT 


ACTTAAAGAC 


7860 


AATTTGAAGC 


CATTTAATTC 


AGAAATTTGA 


ATGTTTCCAC 


CACCAATTGA 


AATTCCTGTC 


7920 


ACAGATAGTT 


TTCGGTCGCC 


AGAAGAAACC 


ACAATTTTAA 


CGGAATTAGG 


GTGGTCTGCT 


7980 


TTTTCACTTT 


TAGGAATAAA 


ACAAACTTCC 


ATTCCTTGTT 


CATAGGCAAT 


TTCCAAAGAA 


8040 


TTGGATAGTT 


GCTCATCATC 


TGGTTCCATG 


CCTAAAAGGC 


CGCCAACTAA 


GGCGATATCT 


8100 


GTTCCGTGAC 


CACGATAAGT 


TTTGGCAAAA 


GACTCATATA 


AATAAATGTC 


AACGCTATCA 


8160 


GGTTGCTGTC 


CAAAAATACT 


GCGAACCACT 


TTTCCAATTC 


TTGCCGCTCC 


AGCTGTATGA 


8220 


GAACTGCTTG 


GACCAATCAT 


AACAGGTCCG 


ATAATATCAA 


AAACACTATT 


AAAGCGTAAC 


8280 


TCTTCCATTT 


CAGCACCTTC 


TTCATTAGAA 


TATGTTTTCA 


TTGGAAGTCT 


AACACAAAAA 


8340 


ATGGCGGGGA 


ACTAGGCAAT 


CGCTTGTGTT 


TTAGCGAATA 


ATTTAAATTT 


TTATCAGAAT 


8400 


GAAAGCAAGG 


GGCCTTCGTT 


GACAAATCAG 


TGAAGAGATG 


CTAAATTCTA 


GATAATAAAA 


8460 


GACGAAAAGA 


GGGATTCGTG 


CGTTGAAAAT 


TTAAGTTAAT 


TTTTTAGAGA 


AACTGTATAA 


8520 


CTTTGCCTGA 


ATATAACGGG 


TGGAGTGTGC 


GGATTTTCGG 


AAATTAGCTT 


AAAGAATTCA 


8580 
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ACCACGGGTC TTTTTGTATA CCTTGGATTG ATGCCGCTGA TTTCCGTAAA GCTGCATCAA 8640 

TTTTGCTTTA TGGAGGTATA TTTATGTCGA AAATTGAACT AAAACAACTA TCTTTTGCCT 8700 

ATGaTAATCA AGAAGCGTTG .CTTTTTGaTC AGGCAAATAT CACGaTGGAT ACCAATTGGA 8760 

AATTAGGaTk GATTGGCCGC AATGGCcGTG GGAAAACAAC CTTATTAAGA TgTTACAAAA 8820 

GCAGTTGGaT TACCAAGGAG AGATTCyTCa TCaAGTCGAT TTCGTCTATT TTCCACAAAC 8880 

AGTTGCAGAA GAACAACAGC TCACTTATTA TGTCTTACAA GAGGTGACTT CTTTTGAACA 8940 

GTGGnAATTA GAACGAGAAT TAACGCTTTT AAACGTTGAT CCTGAnGTTT ATGGCGTCCC 9000 

9021 

TTTCTTCTTn ATCAGGCGGC G 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i-) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

ACAATAAACG GGAGGGATCG TTTATGAATA TTGAAAAAAT GACAACCACG CTACAAsAGG 60 
CGATTGCTGA AGCTCAAAAA GTTGCGGTGA CACGTCAACA TCAAGAAATT GATATCGCAC 
ATTTATGGAA AATTTTTCTT CAACCAAACC ATTTTGGACG TAATTTTTAC ACAGATGCTG 

GTTTAGATGT TGACGCATTT GAACGAGAAG TAGATAACGC GTTAGATGAA TATCCAAGTG 240 

TGGCAGGCGG TAATGTGCAA TATGGTCAAA ATTTAAGTCA AAATTTGTTC CATTTATTGC 300 

AAGAAGCTGa TTCACTTCGA GAAGAtTCCA AGATGaATTT TTATCaaCCG rAATTGtACT 360 

TTTAGCTTTA ATGrAATTGa AAAATTATCG TTTaACmAAA TATTTAATGc raACmAGGcAT 420 

TACGGaAAAA GAGTTwAGrA AAAATATTGA AGaGATGAGA GGAGGAGATC GTGTGACTTC 480 

TCAAAACCAG GAAGAACAAT ATAAAGCACT AGAAAAATAC GGTGTTGACT TAGTACAACA 540 

AGTAAAAGCA GGAAAACAAG ATCCAATCAT TGGTCGTGAC GAAGAAATTC GTGACGTTAT 600 

TCGGATTTTA TCAAGAAAAA CTAAAAATAA TCCAGTCTTA ATTGGTGAACCAGGTGTTGG 660 

TAAAACAGCG ATTGTTGAAG GATTAGCACA ACGAATCGTT CGTAAAGATG TTCCCGAAAA 720 

CTTAAAAGAT AAAACCATTT TTTCTTTAGA TATGGGCGCC TTAATTGCGG GAGCAAAATT 780 
CCGTGGCGAA TTTGaAGAAC GGTTAAAAGC TGTTTTAAAA GAAGTGAAAA AAAGTGATGG 
CAAAATCATT TTATTCATTG ATGAAATTCA TAATATCGTC GGAGCTGGTA AAACAGAAGG 

CAGTATGGAC GCCGGAAATT TATTAAAACC GATGCTAGCA CGTGGTGAAT TGCATTTAAT 960 

CGGTGCCACA- ACGCTTGATG AATATCGCCA ATATATGGAA AAAGATAAAG CCTTAGAACG 1020 

TCGTTTCCAA AAAGTATTAG TCAAAGAACC AACAGTAGAA GATACAATTT CTATTTTGCG 1080 

TGGCTTAAAA GAACGATTTG AAATTCACCA CGGCGTGAAT ATTCATGACA ATGCTTTAGT 1140 



120 
180 



840 
900 
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TGCTGCCGCT 


ACTTTATCCG 


ATCGCTATAT, TACTGATCGT 


TTCTTACCAG 


ATAAAGCGAT 


X A, VJ VJ 


TGACTTAGTC 


GATGAAGCAA 


GCGCAACGAT 


TCGAGTGGAA 


ATGAATTCGA 


TGCCAAGAGA 


i ?fin 


GCTCGATCAA 


GTAACACGTC 


GGTTAATGCm ACyAGaAAtT 


GAAGAAGCGG 


CCTTGAAAAA 


i *39n 

1 JtU 


AGAATCAGAT 


GATGCAAGTA 


AAAAACGGTT 


AGCCAACTTG 


CAAGAAGAAT 


TAGPAGATTT 


1 Jo U 


GCGTGAAGAA 


GCCAACTCAA 


TGAAAATGCA 


ATGGGAAACG 


GAAAAAGAAG 


A AGTTA ATPP 


1 H H U 


GGTTTCCAAT 


AAACGTGCAG 


AGATTGATAA 


AGCAAAACAC 


GAATTAGAAG 




i c A n 

1500 


TAATTATGAT 


TTAGAACGAG 


CTGCTGTTTT 


GCGTCATGGA 


ACAATTPPGP 




1 OoO 


TGAATTGAAA 


GAATTGGAAG 


AAAAGAACGC 


TAAAGATAAC 


GTCAAAATGG 


T AP A A P. A ZiTP 


1 OZU 


GGTTACTGAA 


AACGAAATTG 


CGCAAGTGGT 


CGGTCGTTTA 


ACTGGCATTr 


PTGTGAPA A 21 


i con 

1 DO u 


ATTAGTTGAA 


GGCGAACGAG 


AAAAATTAAT 


GAAACTGAAT 


GAAACATTAP 


APAA APGTGT 


1 / 4 U 


GATTGGTCAA 


GATGAAGCCG 


TCGATGCTGT 


TAGTGATGCG 


GTGATTGGTT 

vsxonx x vol X 


PAAGAGPAGG 
v.* rtrt Vj rYVj LrtVa Vj 


i q nn 


CTTACAAGAT 


CCAAATCGcC 


CACTCGGTTC 


GTTCCTTTTC 


CTAGGAPPAA 


PTP.P.TP.TTPP 
w J. ovj lull VJO 1 


l q £n 


TAAAACAGaA 


CTTGCTAAAG 


CTTTAGCTGA 


AGATTTGTTT 


GATTPTP.A AP. 


ATP AT Zi TPPT 


i nnn 

i y^u 


ACGGATTGAC 


ATGAGTGAAT 


ACATGGAAAA 


ACATGCCGTG 


1 1 x oo 




1 QOfl 


TCCaGGCTAT 


GkTGGTTwTG 


AAGAAGGTGG 


CCaGTTAACG 


G A AGPTG+- TP 




^U<1 u 


tTATACAATT 


GTCtTATTAG 


ACGAAATTGa 


AAAAGCGCAC 


PPAGATGTPT 


TT2\ B T &Tr*t- T 


9 1 nn 

£. 1UU 


ATTACAAGTT 


T TAG AT GAT G 


GTCGTTTGAC 


AGATTCTAAA 


x uuuo x \»>Vj 


1 1 I I 1 1 AH 


i ^n 


AAATACAGTT 


TTAATTATGA 


CAAGTAACAT 


CGGCTCTCaG 


w x x x x n i iuu 


AAGGTGTGAP 


9 ^ 9"n 


ACCTGAAGGA 


ACAATTCCAG 


AAGAAGTTGA 


AAATCAAGTC 


ATGAATATTP 

*i x wrin x t\ x x 


TAAA AGGAPA 


99on 


CTTTAAACCA 


GAGTTCTTAA 


ACAGAATTGA 


TGATACCATT 


TTATTTAPAP 

X XrtX 1 1 rtwrtw 


P ATTAAGPTT 


9 n 


GGaTAATGTG 


AAAGGAATCa 


TTGGTAAAAT 


GACGGCACAA 


PTTGPTPATP 


GTTT AGA^PA 
bill rvwirlo WrV 


9 ^i nn 


ACAAGAAATT 


GTGTTGGAAA 


TCACTGATGA 


AGCTAAAACA 


TGGATTGPTG 

X OVJrt 1 X \J\^ X o 


A A A ATPPPTB 




TGAGCCAGCT 


TATGGAGCTC 


GTCCGTTAAA 


ACGTTTCATT 


ACTCGPGAAG 


TPG A A APAPP 


9 ^9n 


ATTAGCAAAA 


GAAATTGTTT 


CTGGACGAGT 


AATGCCAAAA 


APAAAAGTPA 


PG ATT AGTPT 


9 ^fl n 


ATTAGACAAT 


CAATTAGTTT 


TTGAAAATGA 


ACCGATAGAA 


GAAGTGTAAA 


AAAGAAPGAG 


? fid f) 


CGAATTCGCT 


CGTTCTTTTT 


TTATTATGAP 
x xnx iriion^ 


AATPGTTGPT 


TAATTTPTTP 

x nn xx x V/ x X V_r 


PAAPAAATPP 




TCTTCTGAAG 


AAACAACCCG 


ACCATTCAAT 


TTAATCAAAC 


CAACTGTGTA 


AAGGTTTAAA 


2760 


TAATGGAATT 


GATTTTCAGC 


GATTTCGTTC 


AAAGCATCAA 


GCTTTTGGTG 

"Jw X X X X \JU X V) 


ATTATPAGPA 
nl inl wrtUwrt 


O VJ 


L,l~ 1 I b 1 1 bGC 


GGCTGTCTGT 


GTAAAGGGCA 


ACCACCGGGA 


TTCCTTTGGC 


ATAAGCAACA 


2880 


CCAATTTCTG 


AAGcTACGCC 


CGCATCAATG 


GTTGGTCCGT 


CTAATAAAGC 


AACCAATAAG 


2940 


TCGCTAGCAA* 


GGACGTTTTC 


TGTATCTGCT 


AAGGCAATCA 


TTTTGCTATC 


GGCATAAGCA 


3000 


GATTTGTCGT 


TAATCGCCGC 


ATTTTCTTGT 


GGTAAATAAA 


GATCGATTGT 


TTTATCCAAT 


3060 


TGTCGAATTT 


GTTCGACTAA 


ATAGGCATTA 


TAACGTAAGT 


CAGCTTGCGA 


AAATAAAGGT 


3120 
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CCTGCAAAAT 


AAATTTTTGT 


CATTTGTCTG, TCTCCAATCC 


AAGGGTTTAG 


AAATAATCAC 


3180 


TGAAATAAGC 


TTCTTGGTCA 


GGAAAAATCC 


GTTCCAAAGA 


ATTTAAAGTT 


TCTTTATCTG 


3240 


TATCAATACG 


TTCAATTCGA 


TGTAAGTACG 


AAGGGCGTCG 


GTAATTCATG 


ACTAAACAAG 


3300 


TTAGTGTTTG 


AATATCTAAG 


TGAACGGCGG 


TGCCTAGAGG 


TTCATCCGTA 


ATGGTCACTT 


JJOV 


GATCATTTTC 


ATCCCAAATT 


AAGCCAAAAA 


TACCATTATT 


CCACTCAGCG 


ACAGGATCTT 


34 ?o 


TAACGACAAA 


ATGGAAGGGC 


TTAGCTGTAC 


TTTCAAAAGG 


AAAGTTTTCT 


AAAAAGGCTT 


34 r n 


TGACATCAAC 


AATTCGCGCC 


ATATAATAGG 


GCTCAATGCT 


TTCTTTAATT 


TGGCTGTCTT 


3^4 n 

O J fi KJ 


CTAAAAGAAA 


GGCTAATGGT 


TCGTTTTTAT 


AGATATCGCC 


CTTGACCCAA 


TAAAC CAT CG 




AAAAATGCGC 


AGTAATAAAA 


TTCCaCAAGC 


CATTTCGGGC 


TTCTTGGTTC 


AAGTAAAACA 


3660 


TTTCCTTAAT 


ATGAAATACC 


TCATCGGCcA 


CCCAATAGAA 


AAGTACTCCT 


AAAGGTTCTT 


3720 


GATTGGCACC 


ATAATAGACA 


GCAGCAGTCC 


GTTCTTCCTC 


GTTTTCGAAA 


CGCCAATATT 


3780 


CTTCCCAGTT 


GAAGGCACTA 


CGAATGAGTG 


CGCCGTGATT 


TTGACGAGCG 


AATCGCGCAT 


3fl 4 n 


AGACATCAAA 


GACATCTGGG 


TGATCGACAG 


CTAACCGTTC 


AATCATTCCT 


GGAACAGGAA 


3900 


CGGTTTTTGG 


TAATTGCGTA 


TCCCGAATTT 


TAAAAGATAA 


CTTGTCGGAC 


ATAATTTCCP 




AACCTTTACG 


TCGGTAATAA 


GGGATGTTGT 


AAGGGAAAAG 


ATAAGAAATC 


CACTGTTTAT 


4 090 


CTTGGCGCAT 


TTCTTCTAAT 


GCCGTTTGAA 


TCAAGTCTTT 


CATTAAACCG 


TGGTTGGCAT 


4 nan 

*i V O U 


ATTCAGGATA 


GGTACCGACA 


CCAGTAACGC 


CGCCCATTTT 


ATATAAAGCG 


CCATGGATAT 


) 

414 0 


TGACTTCGCA 


TGGATAGATA 


GCAATTTGTG 


AAATTAATTG 


ATTTTCGTGA 


AACCAGCCAA 


4200 


ATACTTTTGA 


AAGTTCTAAA 


ATCGGTTGTT 


TTGATTTGAT 


AAATGCGCGT 


TTGTTTTGAA 


4260 


AACCACTTTC 


TTCAATATCT 


GCTTCAGTCA 


CTTGAAAAAC 


ATAGGAAAGA 


AGTTCATTAA 


4320 


ATTGATCAAT 


GTGTTCyTCC 


yCaACAGGCT 


TTAAnGGTCa 


GTTGCTTtcC 


rAAATtCyTG 


4 380 


gTTCcATCCc 


ATTTTTcCAT 


CTCCAATAAA 


TTTAcTGTTT 


TTAgTATACC 


AAATTTTTCG 


4 4 4 0 


AAACTCTGTT 


AGCCTTTCGA 


AACCAAAAGA 


ATAAAAGGGA 


AGTGCTTTAA 


ATCTTCTTPA 


4 soo 


AAAATTGGTA 


TAATGAAAAG 


TGCTGATTAT 


CCTAGACCTT 


TAAACATGAT 


AATTTGATAA 




AAAAGGCAAC 


TAAAATATAA 


CAATTGATTA 


ATGAAGCTAG 


CAGGGAGCTA 


GCTTTTCTTA 


4620 


AAAGAAATGA 


GGCAAAAAAT 


GAACAATAAA 


GAAATGAAAG 


CAAGACAAGA 


GAAAATTCGT 


4680 


AATTTCTCGA 


TCATTGCCCA 


CATTGACCAT 


GGGAAGTCAA 


CTTTAGCCGA 


CCGGATTTTG 


4740 


GAAAAAACAA ATACAGTTAG 


CAGTCGAGAA 


ATGCAAGATC 


AATTACTTGA 


TTCAATGGAT 


4800 


T TAG AG AG AG 


AACGCGGCAT 


TACTATCAAA 


TTAAACGCAA 


1 KjAAI 1 AAA 


CTATACAGCC 


4860 


AAAGATGGTG 


AAATCTATAC 


TTTCCATTTG 


ATTGACACAC 


CAGGGCACGT 


CGATTTCACC 


4920 


TACGAaGTTT- 


CTCGTAGCTT 


GGCAGCTTGT 


GAAGGGGCTG 


TTCTAGTTGT 


TGaTGCGGCG 


4980 


CAAGGaATTG 


AAGCGCAAAC 


GCTAGCaAAT 


GTCTATTTGG 


CATTGGATAA 


TGACTTAGAA 


5040 


ATTTTACCTG 


TTATTAATAA 


AATTGATTTA 


CCCGCCGCTG 


ATCCAGAGCG 


TGTTCGGACA 


5100 



WO 98/50555 PCT/US98/08985 

888 

GAGATTGAAG ACGTAATTGG AATTGATGCA TCGGAAGCTG t TTTAGCAAG TGCAAAAGCA 5160 

GGGATTGGGA TTGAAGATAT TTTAGAACAA GTGGTGGAGT ATGTACCAGC TCCATCAGGC 5220 

GATATTGAGG CTCCTTTAAA GGCTTTGATT TTTGACTCTA TTTACGATAG TTATCGGGGG 5280 

GTCGTTTTAA ACATCCGTGT AATTGACGGT GTCGTTCGTC CTGGGGATAA AATCCAAATG 534 0 

ATGAGTAACG GTAAAACGTT TGATGTAACA GAAGTCGGCG TTTTTTCACC GAAACCGATT 54 00 

GCTCGTGATT ATTTAATGGT TGGTGATGTG GGCTATATCA CCGCTAGCAT TAAAACGGTT 54 60 

CAAGATACAC GGGTCGGGGA TACAGTGACT TTGGCCGACA ATCCAGCAGC AGAAGCACTA 5520 

CCAGGCTACC GCAAAATGAA TCCAATGGTT TATTGTGGTT TATATCCAAT TGATACGTCG 5580 

CGCTACAACG ATTTACGGGA AGCATTAGAA AAATTACAAT TAAATGATGC GGCGTTACAA 5640 

TTTGAACCGG AAACATCGCA AGCTTTAGGG TTTGGTTTCC GTTGTGGTTT CTTAGGTTTG 5700 

CTGCACATGG ATGTTGTTCA GGAACGTTTG GAACGAGAAT TTAATTTAGA GTTAATTACA 5760 

ACAGCACCGT CTGTAATCTA TCACGTTAAT AAAACTGACG GAACAACCGT TGTTGTTGAC 5820 

AACCCAGCTG AATTTCCAGA ACCAGTAACG ATTGAATCTG TGGAAGAACC TTATGTTAAA 5880 

GCGCAAATCA TGGTGCCAAA CGATTATGTA GGAGCAGTAA TGGAATTATC ACAACGTAAA 5940 

CGTGGCGAAT TCATTACAAT GGATTACTTA GACGATTATC GTGTAAACGT AGTTTATGAA 6000 

ATTCCGTTAT CTGAAATCGT GTTTGACTTT TTCGATAAAT TGAAATCAAG TACAAAAGGC 6060 

TATGCATCCT TAGATTACGA AATGGCTGGC TATCGTACCA GCCGCCTAGT GAAAATGGAT 6120 

ATTCTATTAA ATGCTGAAAA AGTGGATGCT TTAAGCTTTA TTGTTCACCG AGATTTCGCA 6180 

TTTGAGCGTG GTAAAGCGAT TGTTGAGAAA CTGAAAAAAC TAATTCCACG TCAACAGTTT 624 0 

GAAGTCCCAG TTCAAGCGGC GATTGGTCAA AAAATTGTGG CTCGTTCAGA TATTAAAGCC 6300 

TTACGCAAAA ACGTACTGGC TAAATGCTAT GGTGGCGATG TTTCTCGTAA ACGTAAATTG 6360 

TTAGAGAAAC AAAAAGAAGG GAAGAAACGG ATGAAAcAAA TTGGATCCGT GGAAGTTCCT 6420 

CAAGAAGCCT TTATGGCGGT TCTGAAAATG GACGACCAAG ATAACGCGAA ATAGCTTGTT 64 80 

TCCAACGGAA A 6491 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCTACGAAAA ATATGGCACA AAGAAAGTTG GAAAAATGAC AAACTTGTCG ATTTTCCAAC 60 

TTTCTTCATC TATTCGTGGT TGGCTTGCTG AACTGCATCC ATCACGGCAT AACGGAAGCC 120 

ATTTTTTTCT AGAGCTACGA CACCGCGAAT CGTACTTCCT CCTGGAGAAG TCACTTCATC 180 

TTTCAATGCT CCTGGGTGTT TTTTCGTGTC TAAGGCTAAG GTCGCCGTTC CTTTTATCAT 24 0 
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TTGAGCGACA 


ATTTCATAAG 


ATAAGTCTCT 


TGGTAACCCT 


TCTAAGACTG 


CTGCATCTCC 


300 


CAATGCTTCC 


ATAAAGATAT 


CAACAAAAGC 


TGGACTGCAA 


CCAGCCACAG 


TACCAAAAGT 


360 


TTCCAGTTGC 


GCTTCACGAA 


TTTCTTTGAC 


TAACCCTAAA 


GTAGCAAGAA 


GTTCCATTGC 


420 


CAGGTCCTTC 


GTTTGGCCTT 


CAATTGTTGG 


TGCCAAGGCT 


ACGCCAATCA 


CGCCTTGATT 


480 


CACGCTAACT 


GGTGTATTAG 


GAATGGCATG 


AACAACTTTG 


GCTTCTTTAT 


TGCCAAGAGC 


540 


CATTTGTGCT 


TCTTGTATGG 


TATGACCTGT 


TGCAACAGAA 


ATCAATAGAC 


TTTCGTTGGA 


600 


AAAAT GAG GG 


gctaattl-t 


GAAGAACAGG 


TAACACGACC 


TTTGCTCCTG 


TCGCAATAAA 


660 


AATAACCTTA 


CATTCTTGGA 


AAAGTGCATA 


ATCTGTGACA 


AGTTGAAAGC 


CCCACTCTTT 


720 


TTGAAGTGCT 


TCTGCGGTTC 


CACTTGATCC 


GCCTTTCACA 


TATAAATGTT 


CCTTCTTGAC 


780 


TGCCTCAGCT 


TTCAGTAACC 


CTTTAATCAT 


GGCACCGCCC 


ATACTGCCAG 


CACCGATAAA 


840 


GCCAATTTTC 


ATAGCATTCC 


CTCCTTAGTT 


AAATAArGTG 


TTtGTGkCAA 


AAGTCAATTA 


900 


GCTTTtGACA 


CnAACACTAA 


CAGTCGkACt 


TTTTcCGtAC 


aGCtGkTTCG 


TTAaTTaATT 


960 


gGTAAtcCAT 


wACGaCkGCT 


CaAAcGCgCA 


ATTTTTCaGA 


AGtCyGaACC 


AACT 


1014 


(2) INFORMATION FOR SEQ ID NO: 158: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 






TTATAAAAGA 


CCCTTCCTTA 


AAAAGCAATG 


TTTCTTCCAA 


TCAAAAAAAC 


GAATTATCTA 


60 


AAAAATGCAA 


CTTATTTGCT 


CAAAAAAGCA 


AGCGATGATT 


GACAAACGGT 


CGTTTTTAAC 


120 


TGATAATGAG 


AATCGTTATC 


AACTAAAAAT 


AAAGGAGTTA 


CACAATGAAA 


CTTTTAAAAA 


180 


AGACGGTCCT 


AATTGGTACA 


ACCCTTCTTC 


TTGGTTCATT 


CTTACTCGCA 


GCTTGTGGTA 


240 


ATACGAATAA 


AGAAGCCAAC 


AACGCTGACA 


AAACACATGA 


AGTAACAGAT 


ACCTTAGGCA 


300 


ATAAAGTAAC 


CGTCCCCGCG 


AAACCCAAAC 


GGATTATTGC 


GAGTTATTTA 


GAAGATTATC 


360 


TAGTTGCATT 


AGGAGAAAAA 


CCAGTGGCAC 


AATGGACAGT 


TGGACAAGGC 


AGCATTCAAG 


420 


ATTATTTAGC 


GAAAGAATTG 


AAAGATGTCC 


CCACTATTTC 


CTATGACTTG 


CCATATGAAG 


480 


CGGTTCTAAA 


ATTTGAACCT 


GACTTATTAT 


TAATCAGTTC 


ATCTGCTCTA 


GTTGAAGGCG 


540 


GTAAATACAA 


AGAATACAGT 


AAAATTGCGC 


CAACTTATGT 


AGTCAAAAAC 


GGCGAAAATG 


600 


TCACCTGGCG 


TGATCAATTG 


GaAGATATTG 


CCACTGTTTT 


AGATAAAAAA 


GAACAAGCGA 


660 


AAAAAGTGTT- 


AGAAGATTAT 


GATACCTTAA 


CCAAAGGCGT 


CCAAGAATAT 


CTTGGCAAAA 


720 


AAGATGCTGG 


CAAATCTGCG 


GCAGTCTTAT 


GGGTAACCAA 


CAACCAAGTC 


TTTATGGTTA 


780 


GCGATAATCG 


CTCAAGCGGA 


ACCGTGCTCT 


ATCAGGACTT 


AGGCCTCCAA 


GTTCCAAAAT 


840 
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TAGTGGAAGA 


AATTTCTAAA AACGCTACTp 


CGGATTGGAA, TCAAGTTTCT 


TTAGAAAAAT 




TAGCTGAGCT 


TGACGCAGAC 


CACATTTTCC 


TTGTAAACAG 


CGATGAAXCA 


GCACCTCTTT 




TCCAAGAAGC 


AATTTGGAAG 


AACTTACCTG 


CTGTGAAAAA 


TAACCAAGTT 


PATAPPTATft 


iUZU 


ATAAAAAAAG 


TAGTTGGTTA 


TACAACGGAC 


CTATTGCGAA 


TACTCAAATT 


a t t (z a a n a t p 


1080 


TAAAAAAaGC 


GCTCTTAAAT 


TAAAAAAATG 


CtTGGATGAA AATAATCATC 


L.rlriwL.Al III 


114 0 


TTTTAATTAG 


CAATTGCCGT 


TTCTAAACCA 


ACTTCAATCA 


TTTCATTAAA 


^.ul^ui I iub 


1200 


CGTTCTTCTG 


CCGTTGTTTC 


TTCGCCTGTC 


ACCAAGCTAT 


CACTCACCGT 


p a t a a Tprrr 


12 60 


AATGCTTCTA 


CATCAAATTT 


TGCTGCTAAA 


TAATATAACA 


TTGCCGCTTC 


111 AA 1 C 


1320 


GCTAATACCC 


CGTATCGACC 


TAAACGAAAG 


ACATCATCTA 


AACTATCTTT 




lJoU 


TCATCTGACA 


ATACGTTACC 


AACATGGGTA 


GTAAAGCCTT 


TTGCTTTAGC 


A A T T T PP T A A 


i a a r\ 
14 4 U 


GATTTGAGTA 


AAAGATCAAA 


GCTCGCAATT 


TGTGGAAAAT 


CATATTTAGG 


A A A ATPATTP 


lDUU 


CGAATCGCGG 


AAGAAGGCGT 


CGCTGCTGCT 


TGTGCGATAA 


CTAATTCACG 


AAUtj I 1 uAL i 


i c el r\ 
Id bU 


TTTTCTGAAA 


TAGAACCACA 


CGTACCAACA 


CGAATTAACT 


TCTTCACATC 


O 1 AAlj Ao i 1 A 


i c o r\ 
1 bZQ 


ATTAACTCAT 


GGGCATAAAT 


ACCTGCTGAA 


GGCATCCCCA 


TCCCTGTTCC 




lOoO 


ACACGCTGTC 


CCTTATACGT 


TCCTGTGTAA 


CCTAACATCC 


CACGAACTTG 


A 1 I A 1 AtjCACa 


1/4 0 


ACAGGATTCT 


CTAAAAAAGT 


TTCCGCAATA 


TATTTTGCTC 


GGAGGGGATC 


UCL-itjbi AGT 


1 O A /I 

1800 


AAAATTTTAT 


CGGCAATTTC 


ACCTGGTTTT 


GCTTCGATAT 


GCACACTCAT 


1 C I CjU ittac 


i 

1 O C f\ 

18 60 


TCCTTTTTTA 


TAAAGCTGCT 


AGTGTTGCTT 


TCACTAACTC 


TTTGAATTGT 


ill AACbC 




GTTCAGTTGT 


TTCAACTACT 


TCTGCATGAT 


TTAAGCTACT 


TTGCATACCA 


fZPTCPTA AAT 
1 uL 1 AAA i 


i y o u 


TCGTAATACA 


AGAGATGCCT 


AGTACTTTCA 


TGCCACTATG 


CACAGCTACA 






AAACAGTTGA 


CATGCCAACT 


GCATCCGCAC 


CCATCGTCCG 


TGACATTCGA 


ATTTPTPP^AP 


O 1 PtPt 


GCGTTTCATA 


TGTCGGGCCA 


GAAAAGCCCA 


TATACACGCC 


TTCTTTTAAA 


TPAATATTTT 


<£. 1 OU 


GTTCCGCAGC 


CACTTTTTTT 


GCCACTTCCC 


GATATTCTTG 


CGTGTAAGCA 


Tf^ A PTP A T A T 


ooon 


CTGGAAAACG 


CGGCCCGATT 


TCTTCATCAT 


TTTPAPPA AT 


1 AAL-VJoAi 1A 


TPf^PPTPTA A 




AATTAATGTG 


GTCATTGATT 


AACATCAAAT 


TTCCCGGTGT 


ATATGTTTCA 




o *3 a n 


CAGCTGCGTT 


GGTTACAATC 


ATTGAATGAA 


TGCCTAAAGC 


AGCCATCACA 


PPSA APH(^(7AT 


^ fi uu 


AAGTGACCGT 


TTGCATCGAA 


TGTCCTTCAT 


AATAATGAAA 


ACGCCCTTGC 


ATPHPPAAAA 


? 4 fin 


CTTTTTTTCC 


AGAAAGCGTT 


CCGTAAACTA 


ATTGTCCAGC 


ATGTCCAACG 


APTP,TTC:aa A 


0 ^9Pl 


PZ\^tZ\Z\fi.B7if , P ,, 
^rtuAAAAA 1 o 


GGGAATTTCA 


GAGAAAGGAA 


TAGCAATCGC 


ATCGGTTATT 


TCATTAGCTA 


2580 


ATTCACCTAA 


CCCAGAACCT 


AAGATTAGAC 


CGAAGTCTGC 


TTGTTGCACC 


CCTTTTTCTT 


2640 


TCAAGAaCTC- 


AGTTGTTTCG 


TTTAAATGAT 


TCGTTAACGT 


TGTCATGTGT 


TGCCTCCTCG 


2700 


AATTATTGTA 


ATTGATTTAA 


AAAGCTTTGA 


CCGTTTTCCG 


TTGCAGGAAC 


TTCAAAGTTT 


2760 


TCAGCAATCG 


TTGCTGAAAT 


ATCTGAATAA 


AATCCTTGTG 


GCAAGCTGCC 


TTGTTTTGTC 


2820 
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ATTTTTTTAC 

rV X X X X X x x 


TATAAGCCAA 


TAAAGGCACA 


TATTCACGTG 


TGTGGTCTGT 


TCCTGGGAAA 


2880 


GTTGGATCAT 


TCCCGTGATC 


TGCTGTAATT 


AAAAGCAAAT 


CATCTTCGGC 


CATAGCTGCC 


. .2940 


ATAATCTCTG 

n x ftfi x \rf x ^ x vj 


GCAAACGACC 


ATCAAAATCT 


TCGATGGCTT 


TTGCATAGCC 


CTTTACATCA 


3000 


en a p GGT G a p 


PATACAGPGP 


ATCGAAATCC 


ACTAAGTTAG 


TAAAACTTAA 


ACCTGTGAAC 


3060 


TPTTTTTTPIi 


TGAPPGTTAA 
X\3n\*\^\3X Xnn 


TAATTGATCC 


ACCCCATCCA 


TATTGCTCTT 


CGTGCGGATA 


3120 


kjL. 1 1 rlv3 1 rtrt 


TTPPTTGTPP 


ATTAAAGATG 


TCGTTAATTT 


TCCCAACAGC 


GATCACATCA 


3180 


1 1 utLb 1111 


PTTTPaaPGA 


ATPTAAAACA 


GTATGGCCAA 


AAGGATCTAG 


AGCATAGTCA 


3240 


1 urtCou 1 1 nK* 


TGGTTPGaGT 
1 oo 1 1 \^\jr\\j x 


GAAATTAPPT 


GGTTCACCAA 


CATACGGGCG 


AGCGATAATC 


3300 


L-oAL»v_»H/il 


TaTaaGGTTP 


GTPTTTPGTA 


ATGTCACGAA 


CATATTGACA 


AATTTTATAC 


3360 


rlHl 1LJ. 1 \- Iri 


aTGGAATaaT 
n 1 uunn 1 fin 1 


GTPTTPATGP 

1 V* X X url X v3V_r 


GCCGCAATTT 


GTAAAACAGG 


GTCTGCTGAA 


3420 




tt a paTPPPP 


aGTTTTPATT 


TGATGTTCGC 


CATAATCATC 


AATGACTGCT 


3480 


(j 1 UL,^vjiU IrVl 




apa aapaaPT 

n^nnn\^nn\*t 1 


TTTPGCCPAG 


AAAAATCTTC 


AATTTGTTTC 


3540 


7\ z^ 1 T 1 A 7\ T* T* f** T* T 1 
ftb 1 ni\ 1 1U1 1 


pappa a appp 


GTTTGGGAAT 


ACCCGAAAAG 


GTTTTTTGAT 

\J X X X X X X wfx X 


GTTCAAGCCC 


3600 


aTa QT?Trrr 
1KM.1 1 1 i^^u 


aPTP aPPP.PT 


PATAGTATPT 
\^n 1 n\D iniv^i 


TTTCCAACAG 


AAATTTCTTC 


TAATTTGGTA 


3660 


1 ftu^Lft 1 


PPTPPTPTHP 


PaPTGPTTTG 
k^/v— 1 1 l 1 vj 


APACCTGTTA 

n^/AWv x o x x rt 


AAGGAGCAAT 


AGTTCCCAAT 


3720 


L\-AA(j 1 1 1 1 i 


PTaaaTPapp 

C 1 AAA 1 UAvjIj 


a aTP.PTTaGT 

nn I Lju X x nu 1 


PPPGPTTPTT 


TGGCAATATG 

x o u^nri x *i x w 


ACCCaGTGTA 

Xk\<* \* W \ J X \J X A* 


3780 


1 LHU 1 UL-L-uA 


paTPTPPaaa 


TTTTTPAGPA 
1111 l^no^n 


TPTGGTGCTT 

X W X X \JVy X X 


CACCAATACC 


GACTGAATCt 


3840 


a p t a pp a tt a 


aaTPaapapp 


TTTAAAPAPA 
111 nnn\*,n\~, n 


PTTGPTTCCT 

Vu> X X X X V*Vr X 


CCTCTAAGGT 


CGTAAATACG 


3900 


TpR Aft aaTP. A 


k-rtril irtl 111 


APGPTAGPCT 


TTTAAATAGT 

XXX rMiii x xiv x. 


CTCGAGAATC 


AAACAGAATA 


3960 




TPTPAP.PTPT 


TPAATTATAP 
1 y^nn x xnx rtv. 


CATGAATCCG 


CTATTATTGC 


CTGAAAGAGC 


4020 


j\T"Pfi B & &TTT 


TPAPTP,aTTP 
1 ^r\v*- X \?n 1 1^ 


PTTTGATTTT 

1 1 x Un 11X1 


AGCAGGGCTT 


AAAAATTGAA 


AGAAAAAAAG 


4080 


TPTPPPaPPG 


TAATTAGGAA 
1 nn 1 1 nounn 


APTTTTTTAT 
i x xx x x n x 


TGTTTTTTTC 

X w X X X X X X X \^ 


GAATAGCCTG 


CTCCTTAAAA 


4140 


aGaapAPGap 

nn\* n \* o n \+ 


TPTTTa Tppa 


ATPGAAACTT 

n x vuruvi^ x x 


TTAACACTCC 


AAGCGGTCTC 


CCTCTTTTGG 


4200 


papaTTTAAa 

\^r\\^t\ 111 nnn 


TaPPAGATAA 
1 n\* \^n\jn 1 nr\ 


PPGPTTPTGA 


TAAAATCACA 


CCAATGCCAA 


TGGGCGGATA 


4260 


aaaaTAAATa 

nnnn 1 nnn 1 rt 


PPGaTATATP 
\*\^\3nXn X nx \* 


PGATTGCGAA 


AAGACAAAGA 


TGAATGATTT 


TTGTTGCCTT 


4320 


APTATAAGAT 


TTPGATTTTP 

x x x x a x vj. 


TTAACTTAGC 


TGCACCTTGT 


GGATCTTTCA 


CTACATTGTC 


4380 


TTPATAAATG 
i i vn i nnn l o 


aTPPTTGTT A 


GCAAGAAGTA 


GGCAAAATCC 


CaAACCGTAT 


ACACAACAAG 


4440 


ATAAAAATAA 


APAGGTGPPP 


GPATCCACCA 


AAATTCACTC 


ACCCAAGCAG 


TTGCTACCGG 


4500 


AATGAAGGCC 


ATGGAGAATA 


ACCAAAAATT 


ATTTGCCCAA 


AACGCTTTTT 


TAGAAAACCA 


4 560 


ATGAGTTACG 


GACATTATGT 


AATGATGATT 


GTACCATGCG 


ATACAGATAA 


AAATAAAACT 


4620 


GACAATAAAC 


• GATAAAAAAT 


ATGGGATATT 


TTCTATCAAA 


CCTGCCAAAA 


CACCCACTTG 


4680 


AGGAGTTTTA 


ATTTCTAAAA 


TCATGATGGT 


CATGATGATG 


GCAATGACTG 


CATCACTAAA 


4740 


TGCTTCTACG 


CGTGTTTTAT 


TCATTGAACG 


TCCTCCACTT 


TTCCTATGCT 


TACTACAAAC 


4800 



WO 98/50555 PCT/US98/08985 

892 



AAAAGTATAG 


CAGAAAAACC 


AACGATTTTT 


GAAAAAAAGC , TGCTAGACCT 


Ol 1 1 OO 1111 




TTTCAAAATT 


TCAACACTTC 


CTTCQAATTG 


TAAAGAACAG 


CTAAAAGAAA 


AmA'P^AAAAflT 


a q o n 
*i y z u 


tACCTTTATC 


AACTCGCTTT 


TTTACAGTGA 


GGCAGTGCCT 


AATGCCGCTT 


f^T^TTPA APT 


d q q n 


GATTATCCTA 


AGAAAAGAAA 


TGGCCCCCCG 


TCGCTTTGAC 


GGAGGGCCTA 


1 HL1 1 1 l^L 


OU4 U 


TATTTCGTTT 


TGACGTAATT 


TTTCCCGTTC 


GCTTTTGGTC 


CCGATGATTT 


Tpppaapa ap 

i bUbAMbAAb 




ACAACTAGGA 


CTAAAATGGT 


TAATACGTAT 


GGTGCTGCTT 


GTAGATAAAC 


111 lUOnnl 1 


CI rrt 

0 1 bU 


TCTGAAATAA 


TTGGTAGGTT 


TTCCCCAGCA 


ATACTTAAGT 


TTTGTGCAAA 


P P P h. 7A A (2 a B T 




AAGGCTGCGC 


CCATTGCCCC 


AAGTGGATTC 


CATTTACCAA 


AGATCATTGC 


PPPPBTTPB a. 


coon 


ATAAACCCTT 


GTCCTGCAAT 


TGTGGTAATC 


CCAAAACGAC 


CCGCAATACT 


I 1 bAbbAAAb 


0 .34 0 


ACAGcTCCGC 


CCATACCGCC 


TAAGAAGCCG 


GATAGTAAGA 


CACCAGAATA 


AbbbAl lAAb 


c a r\r\ 
DH U(J 


TAGACATTAA 


TTCCCAATGT 


ATCTGCCGCT 


TGTGGATGTT 


CCCCCACAGA 


ALvj 1 AAbbb 1 


3 4 bU 


AAGCCAAAAC 


GTGTTTTATA 


AATGACAAAC 


CAAGCTAAAA 


TCGCAATCAA 


MI 1 bb i Abb 


C C O A 


AAAGCTGGCA 


AAGAGGTTTT 


TGCAAAGAAT 


AGTTCACCAA 


TCACTGGAAT 


J. J. 1 IbAbAAI 


5580 


CCTGGGAAAG 


AAAAATAACC 


AAATGATTGT 


GTAATCATAT 


CTGTTTGTCC 


fo trt n\ fw 7\ m » *\ 

1 I I I IbATAA 


5640 


ATAACTTTTA 


CTAGGAAAAC 


CCCTAGCGCC 


GGTGCCATCA 


AGTTAATGAC 


A ^"* r P A /""/""A f^Hlf 

Ab I AbbAbTb 


5700 


ATAATATGAT 


CTGCTCGTAA 


GTTAATAGTT 


GCAACGGCAT 


GTAAGAGAGA 


AAAb AbAA 1 b 


5760 


CCGACAACGC 


CTCCTACTAA 


ACAAGCAATC 


CATGGTGTCG 


CCGCGCCAAA 




i 

c.q on 
Oo<£U 


AACGTCAAGT 


TAAAAACGAC 


CGAACTGAAA 


GCACCCATTA 


CCATAATACC 


1 1 ^trwiLtn 


coon 
Do o U 


ACGTTAACCA 


CACCACTTCG 


TTCTGAGAAC 


GTTCCGCCTA 


ATGCTGTCAA 


p a t p a cere a 




GTTGCATACA 


CTAATGTTTG 


CGTGATGATT 


GAGGAAATTG 


TTGCTAAATC 


L/M 1 1 HbA 1 b 1 


bUUU 


CTCCCCCCTC 


TTGATTGTTT 


TTGTCTGGCG 


CCGCTTCAAT 


TGTTTCTACA 






CTTTTTTACT 


ACCAGACGCT 


TTTGCTAACA 


AGAAACGAAT 


TAAATAACTA 




bl <iU 


AGAAAATAAT 


CGCTGCAATA 


ACAACATCAA 


CTAATTCGGT 


TGGTACACCA 


ot 1 A i bAAbb 


bloU 


GCATCCCTTG 


TCCACCAAGT 


TTTAACATCC 


CGAACAGTAA 


TGCGGAAAGT 


a a zi a ^rrr^r^ a 
AAAA 1 bbbA 


b^ 4 U 


TTGAGCTACC 


TGCTCCTAAT 


AGAGACACGG 


CCATACCATC 


GAACCCAATG 


ptpa appbtp 




TGTTTTGAAT 


AAAGAAGTTT 


TGGAAAGTTC 


CCAATCCTTG 


GACCGCACCA 


ppta apppap 

k-\- 1 rt/VjbbAb 


b JbU 


CTAGCGTTCC 


TGAAATCATC 


ATGGATAAAA 


TAATCGTTCG 


CTTGCTGCTC 


attppappat 


fid 0(\ 


ATTCAGAAGC 


AAATGGGTTT 


AACCCCACCG 


ACCGAATTTC 


AAAACCAAGC 


ui 1U1 Hill 


fid P.0 


TPATPaapaa 


pp a r* a r a a f n 
l~ fib Ab AAb A 


7\ T 1 7\ 71 /"* A 7\ A 7\ 

A I AAbb AAAA 


TGAAGGCCAA 


GAAAATGCCG 


ATGTTCAAAC 


6540 


GAGAGCCATC 


ACTGATACTT 


GTTAAAAACG 


GTGAGCGTAA 


TGATGCATTC 


GCACCAATCA 


6600 


TTTTGGTGAT- 


CCCTTTGTTC 


GCCATAATAC 


TTGGTTTCAT 


GACATTATTA 


gcaatatgat 


6660 


TTCCTGTGTA 


TAATAAAATA 


TAATTCAACA 


TAATCGTAAC 


AATAACTTCA 


CTCGTCCCAA 


6720 


AATAGGCACG 


CAAGAAACCT 


GGAATGGCTG 


CTGCTAAAGC 


TCCTGCAACT 


GCCCCAACAA 


6780 
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TAATCGCCAC CGGCAACGCA ATCATCCGGG ' GTGCATCAGG QATGGATAAA GCGACCCAAA 684 0 

TACTAGCCAC CCAACCACAT AAGGCTTGTC CAGAAAGCCC GATATTAAAG AATCCAGCCG 6900 

AATTAGCCAC CGAGAATCCT AAAGCGGTAA AAATCAACGG TGCTGCGGTG ACAAAAATCT 6960 

CACCAATACT TTTCTTTCCT TGGAAAGCTG TTTCAAACAT CGCTTGATAC CCAGCCACTG 7020 

GGTCATAGCC AAAGATAACC ATAATAATTG CCCCTAAGAG GAAACCCATA ATAACAGATA 7080 

ACACAGGGAC TAAAATATTT CTTATTTTTT CAGAACGATT ATTCAACAGT TTCACCTGCC 7140 

TCTACTTTTG ATAACTCAGC ACGGGCTTCT TCCAATGAGT AGCCAGCCAT TAACAAGCCC 7200 

AATTCATTTT CTGTCGTTTC TTTCGGATCA ACAATTCCGA CAATTTCCCC CGCATGAATA 7260 

AC AG CG AT AC GATCAGAAAC GTTTAAAATT TCATCTAATT CAAAACTTAC TAATAAAACA 7320 

GCTTTATTTT TATCTCTTTG TTCAATTAGT CGACGATGAA TGTATTCAAT AGCTCCGACA 7380 

TCTAAACCGC GTGTTGGTTG AGAAACAATC AACAAATCTG GATCGCGGTC CACTTCTCGG 74 40 

GCAATAATCG CTTTTTGCTG ATTCCCACCT GAAAGCGCCC GAGCTGGTAC TAATTCATTA 7500 

GGAGTCCGTA CATCATATTC TTCAATTAAT TCACGGGCAT GTTTGTTAAT CATTCGATAG 7560 

TTTAAAATAC CTGTTGAACT GTATGGTTTT TGGTAATACG TTTGTAGGCC AATATTTTCA 7620 

GCCAATGTCA TCTCTAATAC CAAGCCATAT TTATGTCGGT CTTCAGGAAC GTGGGCGACA 7 680 

CCCATTTCCG TAATTTCCCG AGGTTTGCGA TTAGTAATCT CTTTTTCATT CAAATGAATG 77 40 

GACCCGTTTT CGACTTTGGT TAATCCTGCT AAGGCTTGGA TTAATTCTGT TTGTCCATTG 7800 

CCATCAATCC CCGCAATTCC AACAACTTCC CCTGCTCGGA CATCTAAGCT AAGATTTTTC 7860 

ACTGCATCTA AGCCGCGGCT TTCTTTCACT ACTAAATCTT TAATTGATAA AATTGTTTCT 7920 

TTCGGTTGCG CTGGTTTTTT CTCTGTTTTG AAGGAAACAG AACGACCCAC CATCATATCG 7 980 

GCTAACTGTT GCGAAGAAAC GTCTTTCACA TTGACGGTGC CAATACTTTG CCCCCGGCGA 804 0 

ATAACCGTAC AACGGTCCGC CACTTTTTTA ATTTCATCTA ACTTATGTGT AATTAAAATG 8100 

ATCGATTTGC CTTCATCGGT TAGTTCTTTC AAAATAATCA TTAATTCATC AATTTCTTGT 8160 

GGCGTTAATA CTGCGGTTGG TTCGTCAAAA ATTAAAACGT TCGCACCACG ATATAAGGTC 8220 

TTTAAAATTT CAACCCGTTG TTGCATACCC ACAGAAATAT CGCGAATATA GGCATTGGGA 8280 

TTTACATCCA ACCCGTATTG CTTAGACACT TCTTCAATCT CTTTTTTCGC TTTTTTCCGG 8 34 0 

TCTAAAATAC CGCCATGGGT AGGTTCACTA CCTAAGATAA TATTTTCAGT CACAGTAAAG 8 4 00 

GCTTCAACTA ACATAAAATG CTGATGGACC ATTCCGATTC CTAAACGATT GGCTGCAGTA 84 60 

GGACTTGTAA TGTTGACTTT TTCGCCATTC ATATAAATCT CACCTGAGGT CGGTTCCAAT 8520 

AGCCCTGATA AGATATTCAT CAAGGTCGAC TTGCCTGcAC CATTTTCACC TAGCAGTGCG 8580 

TGGaTTTCAC- CAGGTTTTAC TTGTAAATCA ATTTGGTCGT TGGCTTTGAA CGTCCCAAAT 8640 

TCTTTGGTGA TTTTACGCAT CTCAATTACG TTTTCTTCTT GAGACAAGGT CTATTCACTT 8700 

CCTAACAAAA TTATATTGGC AATACTTTAG AGGGTAATTT TGACCCTTAA CTAAGTAGTT 87 60 
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GAAGAACTTC 
TCGCTGCCAA 
TAGGAGCTTA 
TTTTGCTGTT 
GCCACCATCT 
ACGGTTGGCA 
GTTGTCTTCT 
GCCGATTACC 
GACCCCTTGT 
TGCAGCAGCT 
AACAGTAATT 
TTGGAAACGG 
TGTTTCATTT 
AGAAACGACA 
GTTGGCATCT 
GTTGAATTTA 
TTGAATATAT 
CAATCCTTCC 
GATTACAGCA 
GCCACCACCA 
TAATTTTGCT 
CTAAGTTGTT 
GCTTCTTTAA 
CCATCTTTCA 
TCAGAGATTT 
TCATCCTTGC 
ATCACCCAAA 
CCTTGTCCTG 
GAAGCTAAGG 
TGAATATCTT 
AAACGGCCAA- 
GTGTTAGCTG 
GCGACGTTGT 



AACTATTTAG 

GgACCACTT.T . 

AATTATTCTG 

TTAACAGCTT 

TTTAATCCAT 

ATATCTTGAA 

TTGCCGTCTT 

CAAACTTTGT 

CCAGTCGCAC 

AAAGCTTTCC 

TCTTTACCTA 

TCAATTACGA 

GCAGCAGCAA 

TTCTTTTTGC 

GCTGCAGAAG 

CTTGATACCG 

GCATACCCTT 

CAAGAAGATT 

CTATGCGCTG 

CATGCTGCAA 

TTTTTCATTT 

TCTAAGGTTG 

CTGCCGTTTT 

ATCCATAAGT 

TTTTCACAGC 

CATCTTTGGT 

CTTTATCTTT 

TAGTCGCCGC 

CCCGTCCTTT 

TGCCTAATTT 

TAACAGGTCC 

CTGCCACACC 

TTTTCCCATT 



TTAAAAATCC 
A1TT.TTAGAA 
GTTTTTCTGG 
CTTTTGTTTT 
AAACTAAATG 
CCGCTGTGCC 
TTGTTTTGTA 
CGCCAGAACC 
CAGAAGCATG 
CTTTGGCAGG 
ATTCTTTCGC 
CCCCTTCTTC 
CACCGGCTAA 
CATCGATTTG 
AAATTGCATC 
CTTGGTCAAT 
TTGAACCTTC 
GGTTGAACGA 
CATCCCCTTT 
GTGAAACTGA 
CTGTAAAGCC 
GTCTGGGACT 
GGCTTGATCG 
TAAATGTTCG 
AATATTTACA 
TGTATATTTA 
TGAACCTGTT 
TGCGTGATAA 
TGAAGCGTCT 
TTTGCCAGCG 
TTCAACCCCG 
TGCTAAATAC 
AACTGTATCG 
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AAACATCATA 
GGAACAATAA 
GACTTTTACG 
GTCGTTTAAA 
TTCGCCACCA 
GACACCTTTA 
CTTGCCATCA 
TGATTCATTC 
GAAGATGATA 
ATCAGCAAAT 
AGCATCAGCC 
ACCACCAACA 
GTAAGCTGCT 
ATCATCGATT 
TTTTAGCAAG 
ATTGGTTGTA 
TGGTAAATCA 
CTTGTCATCC 
GCCACCGCCG 
TAAACCTAAT 
TCCTAAAAAT 
TTTACGTCAC 
CTTAGGGCTT 
CCGCCAGGGA 
CCTTTAATGG 
CCGTCTTCGT 
TCATTTAGCG 
ATAATGTCGG 
GCAAATGTCC 
TCTGCTACAC 
CCGATAAATC 
GCAGATTCAT 
TCGATTAGAA 



AAGGGTTCCC 
TGAGCCCTTC 
TCACCTGAGA 
TAGCCGTCTG 
GGGAATTTGT 
AGCGTTGAAG 
GCATCTTGAT 
AAGTCTTTTG 
TCAACGCCGT 
GAAGCCGCAT 
ACACCTTTTT 
AAACCGACTT 
TCATTATCTC 
AAAACAAAGT 
TAGCCAATAC 
TAGTCAGCTG 
TGTTCTTTAC 
ACGCCGCCTG 
CTTTCAGCGG 
GCAATCAAAC 
AGTTTAATGT 
CAGAGATGAT 
CTGTTGTTAA 
ATTTATCTTC 
TTGAAGCTAA 
TTTGGTCACG 
CTTTGGCTTC 
CACCTGCTTG 
CTGCATAAGT 
CTTTTTCAAA 
CTACTTTGTT 
TATCTCTAAA 
CGAAATTCGT 



TTGGCAGCCG 
TTTTTTCATT 
TTACTTTATC 
TTAAGTCAAC 
CTTCTAACGC 
TTAACGTGAA 
CGCGGTCAAC 
CTTCTTGGAA 
TTTGGTACAT 
ATTTCGTATC 
CAAAACCAGC 
TGTTCGTTTT 
TAAATGTTGC 
TTGTATCAGG 
CAAAGATTGT 
CATCATTCGA 
CCCAAGCTTG 
TATCTGTAAT 
TTTTGCCTTT 
TAAAACCGAA 
ATTTAAACAA 
TTGTTCTTTG 
ATCAACACCG 
TAATGCAAGA 
CGTTAAGTTA 
GTCAACACCA 
TTGGAAGATT 
ATACATAGAA 
AGACGTAATT 
ACCAGCTTGG 
GGTTTTCGTA 
TGTTGCAGAA 
TTCTGGGTTG 



8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
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GCGTCCGCTG 
AATTGACTTG 
ATATACGCGT 
CCTTCCCACG 
AC G AC ACT AT 
GCAGCTAACC 
TTCACTTTCC 
GAAATCTACT 
ATGAATAGGG 
GCGTGACCGG 
GAGAAAT'GGG 
CTGAAATGGC 
TTTCAATGTT 
GAAAATTAGA 
CTTGTTTTAC 
CCTTCGCCAG 
CCCGCTTCAA 
CCCATGTCAG 
GTTGAGAAGC 
GCTTTAACTT 
TTTCCTTTGG 
GATTTTAATG 
TCAGCTGCTT 
GT AC AC AC AG 
TTGATACATA 
GCTTCTGTCG 
AGAATTCCTC 
TCGCCGGTTG 
TCACGATTGG 
TTGTGAAGTT 
CCCAGCAACA 
TCTTTGGCAG 
ACGGAACTGT 



CATC TACT AC 
AAATGGCTTG 
ACCCTTGAGG 
CAGATTGGTT 
GTTGTGGATC 
CAATGGTTAA 
GAAAAGCCCC 
TTGAGCCCAA 
CAATAATGCC 
CATTTGTGGA 
TGCCTCTGTA 
TTTAAAAATC 
GACTCCTTGA 
ATACGGTACA 
TGTCATCTTT 
TTGCGCCAGA 
TCATTGcTAA 
GACCAACTGT 
CAGTAGATGT 
TTTCTTCGTC 
CTGCATCTAC 
CACCAA'xATT 
CGTATGCTTT 
CGACGTCGGT 
CAGAGAAAAA 
CTTCTGGTTT 
TTTTCTTTTT 
GTCCAATTTC 
CATAAATGGT 
TCAAGCCAAC 
TTGCGGCAAC 
GTAAATCAAA 
CGCCGCCTTG 



TGCATTTTTT 
GTCAATGTTT 
TCCTTCTGGA 
AAAAGATTTG 
ACCTTTTGTG 
GGCAACTGCT 
CTACTATTTT 
CGAAACCTTA 
TCAACTGTCA 
TCAAAGAATT 
TTACCTGCTA 
GCCGTTCTTT 
TACATTTTCC 
TAGGCGTGTG 
TTCGCCACAC 
AACAATTGCT 
CGCTTCTTCT 
CTCACGCATT 
TTTAACGAAA 
TGTTAATAAT 
TACGCCTTGA 
AATCACCATA 
AACTTCTGGT 
ATCTGCTAAT 
GTTGTACTCC 
CAAAATAGTG 
GTTTTATTCC 
AATGTTTTTA 
TAGTAATGAT 
TGCAAAATCA 
GCCAATTTCA 
TTGATAAGTT 
ATTTTCAATC 
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AGTAAATAGC 
GATGT ATAAT •' 
AGTTTGTGCT 
TCATCAATGC 
TTGGCGTTCT 
GCCAATCCTA 
AT CT ATT ATT 
TGTCCTTTTT 
TTGTTTTAGT 
CAGCCATTAC 
CAACTAAATG 
CTGCACAATT 
CGTCTGCAGT 
CCAATGCTTC 
CTTCGTTTAA 
ACACCCGTAC 
GCATTATGCA 
AAACGAATGT 
TCGGCACCTG 
GCTGTTTCAA 
ATGTCTTGAC 
TCCACTTCAT 
GTATTTGCCC 
TGCTCGCTAG 
TTAGCTTCGT 
TGGTCAATCA 
GTAATAATAT 
TAAAGCAATT 
TCGCCTTTTT 
ATCGTATCTT 
TTGGCAACGA 
GCGGTTAATA 
ATTTCTTTAA 



CAATGCCAAA 
CAGATGCTTC 
CTTTGCCCCA 
CGGCTGTATC 
TTTTGCTATT 
GACCTAAAAA 
TCATTCGCAT 
TTAAAAATCT 
CGCGCCATCT 
TTGACGACAA 
TTGGAAGGTC 
TGTTAAACCA 
CACTAAGCAA 
TACGGCACAA 
TTTGAAATAC 
TTGCACCGAT 
CGCCACCAGA 
CTGCTACTTT 
CTTCTTTGGC 
TAATCACTTT 
GTACGTAATC 
TCGCACCATT 
CCAAAGGAAA 
CAAAAGCGAC 
CAATAATTTT 
TGCGGTTTAA 
CGTGAATCAA 
GTTCGACTTC 
CAACTGGGCT 
CTTTCGTAGC 
TTTTTTGTAC 
ATTTTTCTGG 
ATTTTGCGAG 
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AATCGTTTTA ' 10800 

ATTTGATTGA 10860 

CTCTTGCATT 10920 

TGTGACCATT 10980 

GCCGCCACAT 11040 

TTTTGCTTGT 11100 

TTATGCTATT 11160 

TTATCTGTAA 11220 

TTACCAACTA 11280 

GCGCCACATG 1134 0 

CGTTCATCTT 11400 

AAGGAGGCAT 114 60 

GCGCCCACAG 11520 

TCAAGCCATT 11580 

GAATTAATAC 11640 

GCGTGTTGCC 11700 
AGCCTTAACG » 11760 

AGCGCCGCCT 11820 

TAATTCACAA 11880 

GACTAACGCT 11940 

GTATTGtTGT 12000 

TTTAATCGCA 12060 

GCCAATAACG 12120 

CCAACAAGGA 12180 

TTGGACAGCG 12240 

TTCCATGATG 12300 

AATAGGCTCT 12360 

TGTCACGTCT 12420 

CCCTACTTTT 124 80 

ACGACCAGCG 1254 0 

CACGCCTGTT 12600 

CTGGTCAACA 12 660 

CGCTCGACCG 12720 
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CTTGTTAATG 


CTTCTTCTAA 


TAAAGCCCGT .GCTTCGTCTA ACGTCTCGGC 


TTTTTTAGCT 

X X X X A X f\\J\*' X 


12780 


AAAACGACCA 


TCTGACTTCC 


TAAGACATAA 


CACATCTCCA 


CTAAATCTTC 


TnGfTCATTT 

iVjU^v^vnl x x 


12840 


CCTTGCAAGG 


TCTCGATTGC 


TTCAACGATT 


TCTAAACTAT 


TTCCGATGGC 


CTCCCC? AAA 


12900 


GGTTGTGACA 


TGTCTGAAAT 


TACTGCCATC 


GTTTGGCGAT 


TTGCCAGTTT 


TrraaTnr^A 

I VyLMn X O^-Vjrt 


1 ?960 


ACCATTGTTT 


CCGCTAATCG 


ACGAGCACTT 


GCTAGGTCTT 


TCATAAAGGC 


1 1 tiACLrt 




GTAGTCACAT 


CTAAGACAAT 


CGCATCTGCC 


CnTGCGGCAA 


TTTTTTTACT 


C ft T ft A TP P ft ft 


IjUOU 


CTTGCAATTA 


ATGGAATTGA 


GTCAACAGTG 


GCGGTCACAT 


CGCGTAACGC 


ftTft ft ftPTTTT 
r\ 1 AAAVj 1111 


1 Jilv 


TTGTCTGCTG 


GTGCTAGATC 


ACCTGACTGA 


CCAATTACTG 


CCACTTTTGT 


A i bo i 1 




AATTGAATAA 


AACGTTCTTC 


TGTCAATTCA 


ACTTTAAAAC 


CAGGAATCGC 


TTfTB ftTTT A 
1 1L1 Art 1 1 x r\ 


13260 


TCCAAcGTAC 


CGCCCGTGTA 


GCCTAAGCCA 


CGACCAGACA 


TTTTTGCTAC 


attp,appppp, 


13320 


ACACTAGCAA 


CTAACGGCGC 


TAAAATAAGC 


GTCGTAGTGT 


CGCCTACGCC 


nPPT^TA^AA 
MV^Vrf 1 1 /varu-i 


13380 


TGCTTGTCGA 


CTTTGATTCC 


TTCTATAGAA 


GAAAGATCCA 


CAATTTCGCC 


1 oAAl obntL 


134 40 


ATTGCTAACG 


TCATTTGTGT 


AATTTCTTCG 

X X x w X X 


TCTGTCATAT 


CTTGGTAAAA 


AALAbtLnl 1 


1 J JUv 


AAAAAGGcAC 


TTACCTGaTA 


RTrTfifiaATT 


gAACCmTCTG 


tGtATCCGTT 


b A 1 AA 1 AAA 1 


1JJOU 


TGAATTTCTT 


CGCTTGTTAG 


X 1 Ort 1 O^V^\->VJ 


TCaCGTTTCT 

X U \«* W X X X w X 


TTTCAATAAT 


b 1 b I Abb A 1 1 




CTCATGAATT 


CTTTCCCTCC 


nATTTfCCGa 

uni J- i Vrf^v^uct 


TAATGTATTA 


TACAACAAAA 


1 bb 1 1 bAbb i 


I JuDU 


TTAAACCATT 


TAGTAAACCA 


f^TTTAPTATT 

U X X X *■ x x 


ATTTTTTTCT 


TGTGAAAACG 


pz\ni\pfiAPAP 


13740 


GAACATTCTA 


TTTCTAAGCT 


nTTTTTTAAA 

\3 X X X X X X firm 


TCGGTTTTGT 


GCTTTTTCTA 


7\ ft T ft A A Pftl 


13800 


TCGCATCGAA 


AACTTTATTT 


GGAACTTTCC 


CTGTTGGATC 


TGCAATAGCT 


TPPAPP AA A A 


13860 


ATTTTGCTGC 


ATAATAGCCA 


ATATCAAAAC 


TCGGCTGATA 


GACAGTTGTT 


ft ft APTPftPiPA 


13920 


CCAAATATTC 


GGAAATTTCC 


AAGCCATCAA 


AACCAACCAC 


CGAAATATCG 


TPftPH A ATTT 


13980 


TCTTGCCCGC 


TTCTTCAATG 


GCTTGATAGG 


CCCCCATCGC 


CATTTCATCA 


TTCPPAPA A A 


14 04 0 


AAATTGCTGT 


GATTTCTTCA 


TTTTTCAAAA 


CTTCTTTCGC 


CGCAGCATAG 


PPf^PPATPiPA 


14100 


AGTTCAATTC 


CCCAGCACAA 


ACATAATGCG 


GACGAAAAGG 


TAAATTATTG 


tppt^taapci 


14 160 


CATGCTGATA 


CGCCGTCACT 


CGTTCCGTCA 


AACGATAATA 


GCCGGCACTT 


TPAPP,PAAPA 


14220 


TGCCAATATG 


CTGATGCCCT 


TGTTGAATCA 


AATGATGGAC 


CGCTTGATAA 


r^PAPPTTPP,T 


14280 


ACTCTTTAAC 


GATTAATCGT 


CCACTATCTC 


GTGGATTAAT 


CCCACGATCG 


AP A A(*SA ATf^A 


14 340 


CAGGAATCGG 


ATGTTCTCGT 


CGGTCTTTTA 


AAATATGAI i 


TTCTGGCAAA 


APATTPP,P,PH 


14 400 


TGGCAAAAAT 


AATGCCATCC 


ACCGACCGAT 


GAATCAACTC 


GCTAACATAT 


TGCATCGCTT 


14460 


GCTCGGAATC 


ATGCTTTGAA 


TTGCATAACA 


AAATCATGTA 


GCCTAACGAA 


TTTAAATATG 


14520 


TTTCTACACC 


• TTCAATCACT 


TTTGAGAAAA 


AGAAATCCGT 


CACATCTGGC 


ACAATCATAC 


14580 


CAATGGTTTT 


GGAATGACGG 


TTGATTACAT 


TCGAAGCAAA 


. GTAATCTGGT 


1 TTGTAATGAT 


14640 


TTTCAGCGAC 


TACCGCCAAC 


: ACCTTGTTGC 


GGGTTTTTTC 


: ACTGAAACGA CTCCCTTTAT 


14700 
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TATTCAGAAT TTGTGAGACG GTTGTGACAG AAACACCAGC CATATCAGCA ATTTCTTTAA 
TCGTTAGTTT CATCGCGCCC CTCCCAACGT CTCCTTTTGC ATTGACAGAC ATTTGTAGAC 
GTGTTACAGT CTTCAAAATA CTTATCGAAT CTTTTACATT TTAACAAAAA AAAGACAAAA 
GAAATAGACT AAAAAGATTA AAAAACAAAA GAAAATGGAG GCTATTTTTT TCATGGAAGA 
ATCCCGTGTC CGTCAATTAC CAAAAATCGA GCTGCACTGT C ACT TAG AT G GGTCGATTCG 
ACCAACCACT TTACGCACCA TTGCTGAAAA ACAAAACATT CCATTACCCC AAGACGAACA 
AGCCTTGAAA GAGTTAGTGG TTGCCCCTGA AAAATGCACC GATTTGAATG ATTATTTAAC 
TCGGTTCGAC TTCGTGCTCA CCTGTCTACA AACAGCTGAG GCTTTGCAAG CAGCGGCGTA 
TGACGTGATT AGCCAAGCTG CCGAAGATGG CGTGGCTTAC ATAGAGGTGC GTTTTGCGCC 
TTCCCAACAT ACAGAAAAAG GCTTACGGTT ACCTGAAATC GTGACTGCTG TTTTAACTGG 
CTTAAAACAA GGCGAAGAAG ATTTTGGTGT AAAAAGCAAT GCACTGTTAT GCGGCATGCG 
GCATGACCAA CAACAAGCCA TAGAAAAAAT CGTTCACTTA GCGCATGATT TCAGAGAGAC 
AGGCGTTGTC GGCTTTGATT TAGCTGGCAA TGAAGTCGAT TTCCCTCCAT ACACATTTGA 
AGACGTATTA GCTTTAGCCA ACCAATTGTC AATTCCATTA ACACTCCACG CTGGGGAATG 
CGGCTGTGGC AAAAATGTCG CTGATGCTGT CACTTTAGGT GCTACTCGGA TTGGTCACGG 
TATTGCTTTG AAAGATACAC CCGAATATTT GGCGCTCTTA AAAGAAAAAA AGGTTCTTTT 
AGAAATGTGT CCAACTAGCA ACTTCCAAAC AGGAACCGTC AAAACATTGG CTGAATACCC 
TTTTCAACAA TTTATAGAGG CTGGCTTAGC CGTCTGCATC AATACCGATA ATCGAACTGT 
TTCAGATACT ACGTTAACGA AAGAATTCAT GAAATTGGCC ACTTGGTATC AATTAAGTTA 
TGACGAAATG AAGCAACTCA CTAAAAATGC GCTTGCAGGT GCTTTTTTAT CGCCAGACGA 
AAAAAAACTG TTGAACCAAA AAATTGATCA AGCGTATCTA TTCTAAAAAA GAACGCCACT 
AAAGTGGCGG TTCTTTTTTA CTCTGCTTCT TTAACAtTCG TAAAATATAA TAGCCTTTGT 
CTTTTGTCAC AATTTCAACA TTGCCAAATA CTTCAGCCAT TTTCTTTTCT GCACTAGGTG 
CACCTTGTTT CTTTTGAATA ACCACCGTTA GTGTGCCACC AACTTTCAAC AACGGAAGAG 
CCCCTGTTAA AATGCCATGG ACAACTTTTT TGCCTGCACG AATGGGCGGG TTGCTAACAA 
TCGCCGCGTA GGTCGTTTCG TTTAAGGTTT CAT AC AC ATT GGAAGAATGG ATATCCACAG 
TAGTAATTTG GTTACGTTGG GCGTTCATTT GAGCTAGCCC AACTGCCCGT TGGTTGACAT 
CCACCATTTC CACCAGTCGA CCAGTCGCCG CTGCTAAGGA CAACCCAATT GGTCCATAGC 
CACAACCAAC ATCTAACAAA CGGCCTGCTG GTAAATTTTC CCATTCAAAG GCATCAATCA 
ACACACGAGA ACCATAATCC ACTGTTCCTC TTGAGAAGAC CCnG 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16484 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



CCGAGTAGTA 


CCAGAGAGGm 


AnTCCATGTT 


wAAAAcATTA 


CTTCTTTmCA 


ATCTAGTTAC 


60 


AGTAATATGa 


AATTTTGTTG 


AAGTGTTTTT 


AATTCGCCAG 


AAAAAAAGGT 


ATACTTGTTA 


120 


TAATrACAAA 


ACTAAAAAAT 


TACATTTACA 


TATAAGTTAA 


AAAAGGAAAG 


TTGGGGACGT 


180 


ATCAATGAAG 


AAAGAATCAA 


TGTCACGTAT 


CGAAAGAAGG 


AAAGCACAAC 


AAAGAAAGAA 


240 


AACGCCAGTA 


CAATGGAAGA 


AGAGCACTAC 


TTTATTCAGC 


TCGGCGTTAA 


TTGTTTCATC 


300 


TGTAGGAACG 


CCCGTTGCGT 


TACTACCAGT 


GACTGCTGAG 


GCAACAGAAG 


AGCAGCCAAC 


360 


AAATGCGGAA 


GTTGCCCAAG 


CACCTACTAC 


GGAAACTGGC 


TTAGTAGAGA 


CACCAACAAC 


420 


AGAAAcTACG 


CCAGGAACTA 


CGGAACAACC 


GACAACGGAT 


TCGTCAACAA 


CGACTGAATC 


480 


GACAACTGAA 


TCATCAAAAG 


AAACACCAAC 


AACACCAAGT 


ACCGAGCAAC 


CAACAGCTGA 


540 


TTCAACTACA 


CCTGTGGaAT 


CAGGaACGAC 


TGATTCTTCA 


GTAGCAGAAA 


TTACGCCAgT 


600 


AGCTCCTTCA 


GCAACCGAGT 


CTGAAGCAGC 


GCCTGCGGTT 


ACACCCGATG 


ATGAAGTAAA 


660 


AGTACCAGaA 


GCTAGAGTAG 


CTTCTGCGCA 


AACTTTTTCA 


GCGTTATCAC 


CGACGCAAAG 


720 


TCCTTCAGAA 


TTTATTGCCG 


AGTTAGCTCG 


TTGTGCACAA 


CCTATTGCGC AagCCAATGA 


780 


TTTATATGCA 


TCAGTGATGA 


TGGCTCAAGC 


AATCGTTGAA 


AGTGGTTGGG 


GAGCAAGTAC 


840 


GCTATCTAAG 


GCACCAAACT 


ATAACTTATT 


TGGGATTAAA 


GGCAGCTACA 


ATGGACAATC 


900 


TGTCTATATG 


GATACATGGG 


AATATTTAAA 


CGGCAAATGG 


TTAGTGAAAA 


AAGAACCTTT 


960 


CCGTAAATAT 


CCTTCTTACA 


TGGAATCATT 


CCAAGATAAT 


GCGCACGTGC 


TAAAAACAAC 


1020 


TTCTTTCCAA 


GCGGGCGTTT 


ACTATTATGC 


TGGGGCTTGG 


AAAAGCAATA 


CAAGCTCGTA 


1080 


CCGCGATGCA 


ACTGCTTGGT 


TAACAGGTCG 


TTATGCGACA 


GATCCTAGCT 


ACAATGCTAA 


1140 " 


ATTAAATAAT 


GTCATTACCG 


CATATAACTT 


AACTCAATAT 


GATACACCAT 


CTTCTGGTGG 


1200 


AAATACTGGG 


GGCGGAACAG 


TTAATCCAGG 


AACAGGCGGC 


TCGAACAATc 


AATCAGGAAC 


1260 


GAACACGTAC 


TATACTGTAA 


AATCAGGaGA 


TACCTTGAAT 


AAAATTGCCG 


CsCAATATGG 


1320 


yGTGAGCGTT 


GCTAATTTAC 


GCTCATGGAA yGGCATCTCT 


GGCGATTTAA 


TTTTCGTTGG 


1380 


TCAAAAACTy 


ATCGTGAAAA 


AAGGTGCTTC 


AGGTAACACT 


GGTGGCTCAG 


GCAGCGGTGG 


1440 


tTCTAACAAT 


AATCAATCAG 


GAACGAAyAC GTACTAyACt 


tAAAtCAGGs 


GATACCTTGA 


1500 


AyAAAATTGC 


CGCCCAATAT 


GGCGTGAGyG 


TTGCTAATTT 


ACGCTCATGG 


AATGGCATyT 


1560 


CTGGCGATTT 


AATyTTCGyT 


GGTCAAAAAa 


TtATyGTGAA 


AAAAGGTACT 


TCAGGTAACA 


1620 ■ 


CCGGTGGCTC 


AAGCAATGGT 


GGTTCTAACA 


ATAATCAATC 


AGGAACGAAT 


ACGTACTACA 


1680 


CGATTAAATC 


GGGCGATACC 


TTGAACAAAA 


TTTCTGCACA 


ATTCGGTGTT 


AGTGTGGCTA 


1740 


ACTTACAAGC 


CTGGAATAAC 


ATCAGCGGTA 


GTTTGATTTT 


TGCTGGTCAA 


, AAGATTATCG 


1800 


TGAAAAAAGG 


CGCCAACTCA 


, GGTTCAACGA 


ATACGAACAA 


. GCCTACGAAT 


AATGGTGGCG 


1860 
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GTGCGACAAC ATCCTACACG ATTAAATCAG GTGATACGCT GAATAAAATT TCTGCACAGT 
TTGGCGTGAG TGTTGCTAAT CTACGTTCAT GGAACGGGAT CAAAGGCGAT TTAATTTTTG 
CTGGTCAAAC AATCATCGTG AAAAAAGGCG CTTCTGCAGG TGGCAATGCT TCTTCAACAA 
ATAGTGCATC AGGCAAACGC CATACAGTTA AAAGCGGTGA TTCACTTTGG GGCTTATCAA 
TGCAATACGG AATCAGCATC CAAAAAATCA AACAATTAAA TGGCTTAAGC GGGGATACAA 
TTTATATTGG TCAAACTTTA AAAGTTGGTT AATTTAAGAT TGAAAAAAAT AGCTATCTTT 
GGTAACATGA GTTTTATAAT AGAAAAAAAC AATGAGAAGG AATAGTAGAA AGAACCATGT 
GATAGAGAGC GTATGGCTGG TGGAAATACG TACAGAAGCT TTTGAACTCG CCTTTAAGTT 
ACTTTTTTGA ACAAACCAAG TAGGAAAAGT CGGTGACGAT CGTTAAGACG TTTGAGGTTA 
AGGTGACGGT GACTGACCGT TTCTTTGACG AATTTAGGTG GTACCACGTT GCATTTGTAT 
GTTCACGTCC TATAGGAATT TTTCCTATAG GACTTTTTTT TATCACTAAG TGGCTAGTTT 
AAGTCAGAAT AGTGACAATG GCTGCAGGTG GCAAATTTTA AGAATGAAAA ATTTTATATT 
ACTAGGAGGA ATAACATGAG CTACAATCAC AAAGAGATTG AGAAAAAATG GCAAAAATAT 
TGGGCTAAGA ACAATTGTTT CAATACATTG GACGACCCAA ATAAAGAAAA ATTTTATGCA 
CTAGATATGT TTCCCTATCC ATCTGGACAA GGCTTACACG TAGGTCACCC GGAAGGCTAT 
ACAGCAACCG ATATTCTTTC ACGTATGAAA CGTGCGCAAG GCTATAATGT GTTGCATCCA 
ATGGGCTGGG ATGCGTTTGG TTTGCCAGCA GAGCAATATG CGTTAGATAC AGGAAATGAC 
CCAGCTGAAT TTACTAAGAA AAATATCGAA ACATTCCGTC GCCAAATTAA .TTCACTAGGA 
TTCAGCTATG ATTGGAATCG TGAAATTAAT ACCACTGATC CTGAATATTA CAAATGGACA 
CAATGGATAT TTACAAAATT ATATGAAAAA GGGTTAGCTT ATGAAGCAGA AGTTGCGGTT 
AACTGGGTCC CTGAATTAGG AACTGTTATT TCAAATGAAG AAGTCATTGA TGGAAAAAGT 
GAACGTGGCG GTTATGATGT GGTTCGCCGA CCAATGCGTC AATGGATGCT GAAAATTACT 
GCTTATGCAG ATCGCTTATT AGAAGATTTA GAGCTTGTTG ATTGGCCAGA GAGTATTAAA 
GATATGCAAC GAAATTGGAT TGGACGTTCT GAAGGAGCCA ATGTGACCTT TAAAGTCGCT 
GGCACAGAAG AAAGTTTCAC GGTGTTTACA ACCCGTCCTG ATACCTTGTT TGGTGCAACC 
TATACTGTTC TAGCTCCTGA ACTAGAACTA GTGAAAAAAA TTACGACACC TGAACAAACA 
GCAGCTGTAG AAGCATATAT TGAAGAAACC TCAAAAAAAT CTGATTTAAA TAGAACGGAT 
TTAGCAAAAG AAAAAACAGG TGTTTTCACA GGTGCGTATG CTATAAATCC AGTCAATGGC 
CAAGAAATTC CAATTTGGAT TGGCGATTAT GTTTTAGCAA GCTATGGCAC AGGCGCAATC 
ATGGCGGTCC CAGCACATGA TGAACGGGAT TACGAATTTG CGAAAACATT TGGCATTGAT 
ATCCTACCAG TAATCGCAGG TGGCGACATT ACAACAGAAG CCTATACAGG GGATGGACCG 
CATATCAATT CTGATTTCTT AAATGGATTA AACAAAGCAG AAGCCATCGC TAAAATGAAT 
GAGTGGCTAG AAGAAAATCA CGTAGGGAAA AAAGAAGTAT CTTATCGTTT ACGTGACTGG 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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TTATTCTCTC 


GTCAACGCTA 


CTGGGGTGAA 


CCAATTCCTG 


TGATCCATTG 


GGAAGATGGA 


3900 


ACAACCACAA 


CGGTTCCTGA 


ATCTGAGTTA 


CCTCTACGTT 


TACCAGTAAC 


ATCGGATATT 


3960 


CGCCCAAGTG 


GAACTGGGGA 


ATCGCCATTA 


GCAAACATTG 


ATGAATGGGT 


CAATGTCGTC 


4020 


GACCCTGAAA 


CTGGCATGAA 


GGGAAAACGT 


GAAACGAATA 


CTATGCCACA 


ATGGGCTGGA 


4080 


AGCTCTTGGT 


ATTACTTACG 


ATTCATTGAT 


CCTCATAATA 


AAAATGAAAT 


TGCTGATTTT 


4140 


GAAAAATTAA 


AACGTTGGTT 


ACCAGTTGAT 


ATCTATATTG 


GTGGTGCCGA 


ACATGCGGTG 


4200 


CTGCATTTAC 


TTTATGCTCG 


TTTTTGGCAT 


AAATTCTTAT 


ATGATATTGG 


TGTGGTTCCT 


4260 


ACCAAAGAAC 


CTTTCCAAAA 


ATTATACAAC 


CAAGGTATGA 


TTTTAGGAGA 


AAACAACGAA 


4320 


AAAATGTCTA 


AATCACGTGG 


CAATGTTGTA 


AATCCCGATG 


ATGTGGTGGC 


TAAATATGGT 


4380 


GCGGATACGT 


TACGTCTTTA 


TGAAATGTTC 


ATGGGCCCAT 


TAGATGCTTC 


CATTGCTTGG 


4440 


AATGAAAATG 


GATTAGAAGG 


AAGTCGTAAA 


TTCTTAGATC 


GCGTTTGGCG 


TCTGATTGTT 


4500 


GATGAAGAAG 


GCAAAATGCG 


TGACCGAATT 


ACCACAATTA 


ATGATGGCCG 


TTTAACGAAA 


4560 


GTTTATCATC 


AAACGGTTAA 


AAAAGTGACA 


GAAGATATGG 


CAAACTTGCA 


CTTTAATACA 


4620 


GCGATTTCTC 


AATTAATGGT 


TTTTGTGAAT 


GAAGCCAATA 


AAGTGGATGC 


CTTACCTTAT 


4680 


GAATATGTGG 


AAGGATTTGT 


CCAATTACTT 


GCGCCAATCG 


CGCCACATAT 


TGGTGAAGAA 


4740 


CTATGGCAAA 


TTTTAGGTAA 


CGAGGAAAGT 


TTAACTTATG 


TCCCTTGGCC 


AACCTATGAT 


4800 


GAAACGGCCT 


TAGTAGAAGA 


TGAAGTGGAA 


GTAGTTTTCC 


AAGTGAACGG 


AAAATTACGT 


4860 


GGCAAACAAA 


ATGTCGCTCG 


TGGGTTAAGC 


AAAGAAGAAT 


TAGAACAAAT 


TGCAATGAAC 


4920 


CATGAAGCTG 


TTAAAGAATT 


TATTGAAGGA 


AAAACAGTGC 


GCAAAGTGAT 


TGCTGTTCCA 


4980 


GATAAATTAG 


TAAATATTGT 


TGCAAATTAA 


GTTTACATGT 


TTTTATTAAA 


AAAAGAGCCG 


5040 


GACGTATCAA 


AACGTCAGGC 


TCTTTTTAAG 


GTTGTTTTGT 


TTTACCTTTT 


TGATTCATAT 


5100 


GTTCTTTACA 


TGTGATTGTT 


ATGACTGAAA 


GCAAGGCGAT 


GAAGAGAAAA 


AAGTAACCAA 


5160 


GCGGCGTACA 


GAAAAAGGCG 


GGTTCTTTTA 


AAAGTCCATC 


GCTAGATAAT 


GTTGATCCAA 


5220 


CTAAAATAGA 


GATCATAAAT 


AATAAAAGAC 


TAATCAAGCC 


AAGAAATAAG 


CTAATACGAT 


5280 


ATTTTTTCAA 


CTTCAGTTCC 


TCCTTTAATG 


AACTATCACT 


AAGATAAATC 


AAAAAGGTTA 


5340 


ACTTTTTTTT 


AGGCAACTGT 


TAAAACTTAG 


ATAAGAAATC 


TATTTTCTAA 


GTTTGTCGTT 


5400 


GTTGGTAATT 


GGTTAAGATC 


ATACGTGAAA 


AGGGACCGAC 


AATCAGCAAT 


TGATAAGGAA 


5460 


GGGCAACAAT 


AAAGTTCAGG 


CCCCAAGCTT 


GGAAATACGC 


ACT TAG TG AC 


AACGTCTGGC 


5520 


CTTCCATTAA 


TAAGCCAAAT 


AAAGACATAC 


AAGTCACCAT 


GCTGATAATC 


ATGAGACTCG 


5580 


AAATGGTCAG 


AATAAGTAAT 


AGCCGTTTTT 


CTTTATTAAA 


GGGTAAAGAA 


AAAGCGATTT 


5640 


TCTTAGCTAA 


GACACCAACG 


ATGAAAACAT 


CTAAAATAAA 


AGCAACGATA 


AATCCAGGAA 


5700 


GCAAACCTTT 


TGATAAGGCG 


TACAATGAAA 


ATTCTCCATG 


CAATAATAAA 


TTATAGGTGC 


5760 


TCATTCCGAA 


AACCATCAGA AAGCACATTA ACGTAgTAAA 


AAGCATTCCT 


TCTTTTTTAT 


5820 
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TAGTAGGCAT 


TTTTTTCTCC 


TTTCATTTAT 


TGAGAGTAAT 


GATTGTTCTA 


GGCAGAAAaG 


5880 


GACAAATTCT 


CTCGACTTTT 


ACTAGTGTAG 


CAAAAATAAA 


AGTTCTTCAT 


AAGTTAGAAT 


5940 


TTGCTAAGAA 


TTTGATTCAA 


ATAAAAAGGT 


GTACTAAGTC 


AGCAGTACAC 


CTTTTTATTC 


6000 


GTTATTTAGT 


TTATTTTTGT 


GTTATTTGTA 


ACGTCCTCTC 


CAAACCATTT 


TTGAGAAATT 


6060 


TTACTAAGGG 


TGCCATCTTT 


TCGTAACGTT 


TCAAAGGCAG 


TATTGATTTT 


TTGGACTAAT 


6120 


TGATTGTCTG 


ATTTGCGGAC 


GCCCACAGCA 


AAATCTTCAT 


TGTCATAGCC 


TACATGAGAA 


6180 


ATAGTATAGT 


TTTTTAAATT 


ATCTTCGTGG 


GAAAGATAGT 


AGTTGGCGTA 


AACGCGATCG 


6240 


ATTAGGAGTC 


CGTCAATTCG 


ACCAGATTTT 


AAATCTAAGA 


AAGCTTCATT 


AAAGCCGTCA 


6300 


TATAAAATAG 


GTGTTTGGTC 


TTTAACAAAT 


TTTTTCAAAA 


CGTCAGGCTG 


ACTTTCGAAG 


6360 


CCATCATAGC 


CAGAAGAGCC 


GTTTTGAACC 


CCTAAAATTT 


TGCCTTGCAT 


GTCGCTCGCT 


6420 


GTTGCAATGT 


TTTTTTCTTT 


TAAAGAAACA 


AGTACTTGGT 


CGTTCGTCAT 


GTAAGGTTGT 


6480 


GTGAATTGAA 


CTTTTTCGGC 


CCGCTCGCTC 


GTTTTAGTGT 


AGCCGTTCCA 


AATAAGATCA 


6540 


ATGGTTTGAT 


TTTGTAATTC 


TGTTTCTTTC 


ATAGACCAAT 


CAATCGGTTG 


GAAGTCAACG 


6600 


GAAATGCCAT 


AAAGTTTAAA 


AACCGCTTTG 


GCTAAGTCGA 


CATCAAAGCC 


GACAATTTTG 


6660 


CCTGATTTAT 


CTTGAAAACC 


CATGGGCACA 


AAGGAGTCAT 


CTAAGCCAAT 


AATAATCCGT 


6720 


TTTTCTTCGT 


TAATCCGTGT 


CCATTGATCT 


TCGTTGCTTT 


TTCTTTTACC 


ACAACCTGCA 


,6780 


AATAGGAGTA 


AACTACAACA 


GATAACCAGC 


AAGGCTAAAG 


AATATTTCTT 


TTTCATCATA 


6840 


AACTCCTTTC 


TATTGGACGG 


GGGCAACCGT 


CAAAAGATTG 


TCTGCAATTT 


TTTCAGCGAA 


. 6900 


AGCCATATCG 


TGGGTCACGA 


CAATCTGTGT 


CATGCCTTGT 


TTCCGTAAAT 


CTAAAATAAC 


6960 


TTCTTCCACT 


TGTTGGCGTA 


AAGCTGGATC 


TAGCGCGCTC 


GTTGGTTCAT 


CGTACCCAAG 


7020 


AACTTTTGGC 


TTCATTGCTA 


AGGCACGAGC 


AAGTGCAACT 


CGTTGTTTTT 


GTCCACCGGA 


7080 


AAGTTGGTAC 


GG AT AG AGTT 


TTTCTTTACC 


AGCCAAGCCC 


AATTTTTCCA 


GTAATTCTAA 


7140 


AGCTTCTTGC 


TGACTTTTTG 


CTTTTTCTTC 


TTTTAGTACT 


AATGTAGGTG 


CTAAGGTAAT 


7200 


GTTTTCTAAA 


ACTGATAAGT 


GAG GG AAAAG 


TTGGAAATCC 


TGGAAAACGA 


CCCCAACCAC 


7260 


TTGATCGGCA 


TTGTCCATTT 


CaGCAGGATT 


GAAAGGAACA 


CCATCCAACA 


AAAGCTCACC 


7320 


AGAATCAATG 


GTTTCTAAAC 


CAGCCAAACA 


ACGAAGTAAA 


GTGGTTTTTC 


CGCCGCCTGA 


7380 


AGGACCAACA 


ATCGTTAAAA 


TTTTACCATC 


TGGAATTTCT 


AAATTTAAGT 


GATCAATAAT 


7440 


GGTCCGGTTG 


TCAAAATGTT 


TGGTTAAATT 


TTTAATAGTT 


AACATGGTAG 


AGTCCTCCTA 


7500 


TTTATAATAT 


TGATAGAATT 


TTTCAATTTT 


CTTAGAAGCA 


ACAGTCAAAA 


TTGCCGTAAA 


7560 


GAGTAAATAA 


ATCACTCCCA 


CTAAAGCTAA 


TGGCACTAAT 


GTAACATCGC 


GGCTCATCGC 


7620 


AATTTTACCA 


GCACGTAAAA 


GATCGCCTAA 


GCCAAGAACA 


TAAATAAGAG 


AAGTATCCTT 


7680 


GACCAAATTG 


ATAATTTCAT 


TTCnTACGGA 


AGGCAGCACG 


ATTTTAACAA 


CTTGTGGCAA 


7740 


AATAATCCGC 


GTGACTGTTT 


GAAAACGTGT 


TAAACGTAAA 


ACCTTAGCAG 


CTTCATATTG 


7800 
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ACCTTGGGGA ATTGCTTGAA TCCCCCCGCG GAAAATTTCA GCAAAGTACG CAGCATAATT 78 60 

TAAAATAAAT GCGAATAAGG CCGCATCGTA ACGTTCAAAG ACAATACCAA TTAATGGTAA 7 920 

GCCGTAAAAA ACAAAAATTA ATTGTAACAA TAGCGGAGTG CCGCGCATTA ACCAAATGTA 7 980 

GAGGTTAATT AAGTAACGTA AAGGTTTAAA CGAACTTTGT AAAATAAAGG CAATGATGAT 804 0 

CCCCAGTGGA ATCGAGCCAA GGAGCGTAAA GAAAAAGAmT TTCAGTGTCA TGCCAGTCCG . 8100 

CTAACTAAAG CAGGTAAAAT TTCTAAAATA TAATCCATCA TTTTTTCCTC CAACAATTTT 8160 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

CAnCCCAAAT TTnTAGCTTG ATTCTGCAGC TTTTGATAAC aTTTGGTCAA TTTTTGGTAA 60 

TCCATTAAAT CAACTAACTG ACCTTGTTTT AACATGkkrs tAAAGTCTAA ATCCCAAAGC 120 

ATTTCTTTAG TCGGCTGTTG CAATAAATAA TACATAGAAT GACTACTAAT AATTTCAGTG 180 

GTCGGTTCTT CAATTGTTTC ATAAAAAGTC ACAAAATATT CTCGCTCAAT CACAGTAATA 24 0 

ATTTCTTCTT CTGAAAGCTC ACAATCTTCT AAATTTTGAT AGAATATCAA TAATTTAATC 300 

CGTTCATCTA TTTTATTCAG CGAancGACt AATTCAGGAA AAAGAACCGA CTGACAATTT 360 

TCTACTAATT CTCTTGCCTT ATTTTGTTCA TTTTTATAAA AGTTGTATAA CTTTTTCCGC 4 20 

TGTTTTAACG AGTTTAGTAA GTTATCGGGT ATATTCCAGT AATATTGTTT CTTCATAGCA 480 

TTCCCACTCT TCTTTTTTAT GTACAATTTT ACAAAATTGG TCTTAGCCAA TTTCGTCTCT 54 0 

AGTTATTCTA TCAAAAAAGT GATGCTTTTT TACT G AC a AT AATTACTCGT TTTACGCCCA 600 

AAAAGTGAAA ATCTAGCTGT TAAGTAATTA TTGGGGCTTG TTyCCCTAGG gCCGAaAAAA 660 

CTTTTTCCTT TTAAAAGTCC AGAAAAAGAC CTACACTAGA GATAGACAAA CGTTGGAGGG 720 

ATTTAAATGG ATTTACACTT AACGAATAAA TTAGCATTAA TTACTGGTTC AACCAAAGGA 780 

ATTGGCAAAG CAATTGCGAT TGAAATGGCT CGTGAAGGGA CCGATGTCAT TATCAATGGG 84 0 

CGCAATGAAG CCGAAGTAAC TAAAGTTGTT AAAGAAATAC AAACAATGTT TCCAGACACT 900 

CATCCTCAAG CAGGGACTGC CGATATTTCC ATTGAAAGTC AACGAGCTAC TTTGCTTGAA 960 

AAATTCCCTA AAGTCGACAT TTTAGTGAAC AATATGGGGA TTTTTGAGCC AATGGAATAC 1020 

TGGGACATCG ATGACGCCAC TTGGGAAAAA TTTTTTACTG TGAACGTGTT GTCAGGCAAT 1080 

GCATTAGCAA- AAGCTTATCT ACCTAAAATG CTTGCACAAG ATTTTGGTCG CATTATTTTC 114 0 

ATCGCTAGCG AAGAAGCGGT GATGCCTTCT GGCGaAATGC CCCAATATAG CATGACAAAA 1200 

ACGATGAATC TTTCCTTAGC TAAAAGTTTA TCCAACTTAA CTGTTGGCAC ACATGTCACC 1260 
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GTTAACACGG 


TTATGCCTGG 


CTCAACCCTT 


ACCGAAGtGT 


T\ *\ TV TV 7V TV fn /■** 

AGAAAAAATG 


T TGG AAG AT A 


i ion 
1 JzU 


TGTACGCTGA 


TTCAGACATT 


CCCAAAGAGG 


ATTGGGAAAA 


AGATTTCATG 


AAAAATCATC 


J J!) U 


GTTCTCGTTC 


ACAAATCCAA 


CGGCTCATTC 


GACCAGAAGA 


AATTGGTCGT 


TTTGTTACCT 


1 /I A A 

14 4 U 


TTGTGGcCAG 


CCCAGATTCT 


TCTTCTTTCT 


CAGGAGAAGC 


CTTAAGAATC 


GATGGCGGCT 


1 C A A 

1500 


TAGTTCCAAC 


AATCTTCTAA 


AATAAACAAA 


AAAGACAGCC 


AGCAAAAATT 


CACATTTTTn 


1560 


GCTGGCTGTC 


nTnTTArGCA 


TATTAATTGA 


TTACTTmCCA 


AAAGTAAGTT 


ATATTTATCT 


1620 


CGAATTCGAA 


wTAAAAAGAG 


GTGAAmCAAT 


GAACCAACAG 


CAAGAAGCTT 


TAAAAGCCTA 


1 C O A 

1680 


TATCGGTTTA 


TTAAGAACCA 


GCCATCGACT 


AGAGCAACTT 


GCCAAGCAAG 


ATGTCACTTG 


1/4 0 


TTATGACTTA 


AACATTACAG 


AATTTTCAGT 


ATTAGAGCTG 


TTACTCCATA 


AAGGTCCTCA 


i o a a 

1800 


GACCATCCAA 


AAAATTAAGG 


AGAAAATTTT 


AATCGCTAGC 


AGTAGCACCA 


CTTATGTTAT 


1 O A 

lo bU 


TGACCAATTA 


CATAAAAAAG 


GCTATGTAAC 


GCGCACCCCC 


AGTGAAAAAG 


ACCGACGCAT 


1920 


TACTTACGTT 


GAATTAACAG 


AAGCTGGAAA 


AACATTAATT 


GAAGAAATTT 


TCCCGACGCA 


1980 


TGCAAAGCGA 


ATTGCAGAAG 


CATTTGAACA 


ACTCTCTTCC 


GAAGAATTAA 


CACTTCTTCA 


2040 


AAAAACTTTA 


CGAAAAATAA 


CAAATGAaAC 


GAAATGAGGA 


AATAATGATG 


AAAAAAGAAG 


2100 


ATCAATTATT 


AGGAATCCAC 


CACGTTACAG 


CTATGACAAG 


TGATGCAGAA 


AAAAACTATC 


2160 


ACTTCTTTAC 


AGATGTTTTA 


GGGATGCGTT 


TAGTCAAAAA 


AACAGTTAAT 


CAAGATGATA 


O O O A 


TCTATACTTA 


CCATACCTAT 


TTTGCTGATG 


aTTTGGGTAC 


ACCAGGTACA 


ACCaTGACCT 


O *5 Q A 


TTTTCGATTT 


TCCCAATAAC 


CCTAAAGGAT 


TAAAAGGAAC 


CAATACAATT 


T C AAG AAC AG 


O "5 A O 


GGTTCCGGGT 


TCCTTCAGAT 


GCAGCTTTGA 


CTTATTATGA 


AAATCGCTTC 


AaTGAATTTG 


z4UU 


CTGTCAAACA 


CACAGGCATT 


TCTGAAGAAT 


TCGGGAAAAA 


AGTCCTTCGC 


TTTTGGGATT 


Z 4 t>U 


TTGATGATCA 


AGCGTATCAA 


TTAATCTCTG 


ATGAATTAAA 


TCAGGGCGTT 


GCAGCGGGCA 


ZDZ U 


CCCCTTGGAA 


AAAAGGACCT 


GTTCCAACAG 


AATTTGCGAT 


TTATGGCTTA 


GGACCTGTCG 


O C O A 

z ooU 


AAATAGCTAT 


TTCCTATTTT 


CATGAATTCA 


AAGAAGTCTT 


TGAAGAAATT 


CTAGGCTTTC 


z 64 U 


ACGTAGTGGC 


ACAAGAAGGC 


AATCGCTATT 


TACTAGAAGT 


TGGACAAGGT 


GGCAATGGTG 


O T A A 

Z /uu 


CCCAAGTCGT 


TTTGGTAGAC 


GATGATACTA 


GCTCACAAGC 


GCAACAAGGA 


TATGGTGAAG 


Z / OU 


TACATCACGT 


TGCATTCCGC 


CTAGCGGATC 


GTAAATCACT 


TGGGACTTGG 


CAAGCGCTC i 


9fi 9D 
z o z u 


TTGATCATTT 


AGGCTTACAA 


AACTCTGGCT 


ATGTCGATCG 


TTATTATTTT 


GAATCAT1G 1 


9ft ft n 

Z 0 o u 


ATGTTCGCAT 


TGGGCATATT 


TTAGTCGAAT 


TAGCCACCGA 


TGAACCAGGA 


TTTATGGGGG 


9 Q4 f) 
Z ?1 VJ 


ATGAACCTTA 


CGAAACATTA 


GGAGAAAAGT 


TATCTCTTGC 


GCCATTTTTA 


GAAAACCGTC 


3000 


GTGAATATAT 


TGAGAGTGTG 


ATCAAGCCTT 


TCAATACAAA 


ACGAGCCTAA 


GGAGGAAAAA 


3060 


ACATGCATTC 


• AaTTTTTAAA 


AAAGGACATC 


CTGAAGCACC 


TGTCTTTGTG 


CTACTTCACG 


3120 


GTACAGGTGG 


TGATGAAACA 


TCTCTCCTAC 


CAATTGCCCA 


AGAACTAAAT 


AAACAAGCTA 


3180 


CTGTGCTAAG 


TATTCGTGGT 


GATGTTTCAG 


AAAATGGAAT 


GAATCGTTAT 


TTTAAGCGCC 


3240 
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TAGCGGAAGG 


TCATTATGAC 


TTAGAAGATC 


TAGAAAAACG 


CGQCGAGGCG 


CTTCATAAGT 




TTATTCAACA 


AGCCGCTAAC 


GAGCATCAAT 


TTTCATTGGA 


T AAAAT TAT T 


m to m t\ m to rp 

TTTATTGGCT 




ATTCAAATGG 


GGCCAATATC 


GCTATTCAAT 


TATTGCTTAC 


TCATCCCGAC 


AGCTACCATC 


.J 4 ^(J 


AAGCTGTCCT 


CTATCATCCC 


ATGTTTCCTG 


TTGAATTGAC 


CAATCAACCA 


GACTTGACCG 


3480 


ACACTTCTGT 


TTTATTATCT 


CTAGGAGAGC 


ATGACCCGAT 


TGTCCCACTT 


CCTGAGAGTA 


354 0 


TGCGTGTGAT 


TCAACTATTT 


CAGAATCATG 


GAGCAACCGT 


ACAAGAGGTT 


TGGACACAAA 


3600 


GTCATCAATT 


AACTTATCAA 


GAAATTAAGG 


AAACACAAAC 


TTGGTTGGCA 


CATCTGTCCT 


3660 


CTTAACAAAA 


AAGTGGGTCC 


TAAGTCATTT 


CGACTTAGGA 


CCCACTTTTT 


CTATTTTAAG 


3720 


ATAATTCACT 


GTCTGAGCTT 


CTTTTTATTC 


TGCCATTTGA 


ACAACTATTT 


TACCAACTGC 


3780 


ATGGTGCGTT 


TCACTGAGTG 


CATGTGCATC 


ATAAATCCCT 


TGTCTAGAAA 


AAGGGAAAAC 


3840 


TTCACCAATA 


ATTGATTTAA 


CTTTTCCAGC 


TGCCATCAAA 


TCAGCAATTT 


TTTGTAATTG 


3900 


TTCTCCATTT 


GGTTGAAGCC 


AAATACTTTC 


AGCAGAAATA 


TTTTTTTCGG 


CTGCCAATTG 


3960 


TTTATCTTCA 


ATTCCCACAA 


TTGAAACAAG 


ACGACCGGTG 


TTTGGTTTTA 


AGACGGCAAA 


A A O A 

4020 


ACTATTTTTT 


TGAACCTCAC 


CACCCATTGT 


ATCAAAGACT 


AAATCAACAT 


CAGTTAATAC 


4080 


CTCTGCAAAA 


TTCGTTGTAT 


GATAATCAAT 


CACTTCATCT 


GCACCAATTT 


TTTTCAGTAA 


4140 


GGTATGATTT 


TTTGCGCTAC 


GGTcGTGATG 


ACATGTGCGC 


CCGCTTCTTT 


TGCTAGTTGA 


4200 


ATCGCATAAG 


TACCGACCCC 


ACCTGCACCC 


GCATGAATTA 


AAACAATTTC 


TCCTTCTTTA 


4260 


AGGTGACCAT 


GATCAAACAA 


TGCTTGCCAA 


GCGGTTAAAC 


CAGCCAACGG 


AACGGCAGCC 


A 1 O A 

4 320 


GCTTCTTCAA 


AGCTAATTGT 


TTCAGGGATT 


TTtGCTAATA 


AATGATCATC 


CACAATCGTT 


4 JoU 


ACTTCTGCAT 


AGGTACCAAA 


ACGAGTAGTT 


TCAGGACGAG 


AAAAAACTTT 


ATCGcCAACT 


4 4 40 


TGCCAATCCG 


TTACTTGACT 


CCCcACCTCT 


GTAATCACAC 


CAGCGACATC 


CCAACCAAGA 


4 500 


ATAATCGGAA 


AGGACCAATC 


AAACATCTGT 


TTtAAATATC 


CTTCACGCAA 


TTTCCAATCA 


4 obi) 


ATCGGATTAA 


TTGATGTCGC 


GTATTCTTTT 


ACCAACACTT 


GATGTTCCGA 


TAATTCTGGC 


A £ **i A 

4 dzU 


AGTGTTACTT 


CTGCTTCTTC 


AAGTACCTCT 


TTACTTCCGT 


ACTGATTAAT 


CACAACAGCT 


/con 
4 bo U 


TTCATAAACC 


AACTTCCTCT 


CATTTGTTTC 


ATTTTTCACA 


CTTTTAGTAT 


ACGnCCTATT 


yi T A A 


GCCTTCTTTT 


TATATAAGAA 


TATGCTTTAG 


TTAAATTGAG 


AGAAAGTAGC 


to to) to^ TO)fp to 7A /"* rp 

TGTTl I IACj 1 


lOUU 


CTATTATTCA 


TAaCGTAAAG 


ATTCAATTGG 


ATCTAATTTT 


GgCTGtCTTL 




d ft fin 


AGTTCCTGCT 


AAAAAAGCaA 


TAAACATAAT 


AACTAAGATA 


ATCGTTAGTG 


*r\ 7A ^ 7\ m r> TV TV 

AAGATGGCAA 


h y z u 


TGAAAATTGA 


ATTAATTTAA 


ATCCTGTTAA 


AGCCTTTAAA 


AAAGAATCCG 


TTGCCAAACG 


4980 


ATTAACTAAA 


TTTCCAACCC 


CAACAGCTCC 


TAAAATGCCT 


AATATTGAGC 


CAAAAAAGCC 


5040 


AATTAAAGCC 


GCCTCaACAC 


TAAAAATTGT 


AAAgACtTTC 


CCATTGCTAA 


GACCCATGGC 


5100 


TTTCATTAAA 


CCAATTTCTC 


GCGTTCTTTC 


TTGGACAGAC 


ATATACAGCG 


TGTTAATAAT 


5160 


GCCAAAGcTT 


GCAGCTAACA 


AGGCAATTGC 


ACCAAACATG 


GTTAAGACAC 


CAGTGATTGC 


5220 
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ATTGATTATA TTACGAATCA TGCCAATTTC ATCTTCGACA GTAGTCGCTA 
TTTGTCTAAA TCTTTTTTGA TGTCCTTAAT CTGCTCTGGC GTGCTGTCTT 
AGCAATAATC ATCGCGTATT GATTTTTTAA ATGTTCTGGA AGATCTGCTT 
GACAACTTTA TCAATCAATG CTTTATTCAT CAGCGAGAGC CCATTTTGAA 
ATTTCTAACG CCAACAATTT TTGCTTCAAT AACTTGCTCT TGTCCTTTCA 
GATACCTAAT TGAACTGTTT CCCCCACGGC AGCTTTACTA GATGTATAGC 
TACATATTCT GGTGACAAAT TAATTTCGAA GTCCTGGCTC GTTTGCGAAA 
AGCCGCTAAA TCAATGGTCA TCTCATCTAG CGCAGAAGTT GCTGAAAATA 
CTTATCTGCT CCTTTTATAT AATCGATAGC AACGGATTTC ATTGGTTCCA 
ATCCGAGATT TTTTTAATCT TCTCAATATC TTTTTCAGCA AGCATACTTT 
CGTACTGGTT TTTTTCTCTG GATTATATTT ACTTGGTTCT GTTCCGTTAC 
CTCCATTTTT GGCTGAATAA ATAACTGATT TGCACCACCG ACACTGCCGA 
AATATAATCA TTTACACCAA TATTGACTCC TGTAGTTAAA GCTATGGTAA 
AAAGATTGCA ATAATGGTCA ATACGGTTCG ACCTTTATTA CGCATTAAAT 
TGATTTTAAA ATATCTCTAA ATTTCATTTA CTCATCTCCT CCTACAATCA 
AACATGAACT TGCCGATCAC AACGTGCTGC TAAGTCTGGA TCATGTGTCA 
GGTAATcCTT tATTTTTATT TAAAyCAAaT AACAATTCTT CAATTTTCTT 
GAATCTAAAT TCC 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 
TGTGGCAACT TAAAAAAGAA CAAAATGTTC CCATTTATCG CAGTATTATG 
TTCAAAAGAT TCAAACAGGC GAATTATTAC CAGGCGAAAA AATCCCTTCG 
TGGCAGAGTA CTTGTCTGnC AATCGTTCAA CCGTTGTTCA TGCATTAGAC 
ACTTAGGCTG GATTGTGCGA AAAAGAGGGA GTGGAnGTnT GTCAACGAAG 
GGTAATGATG ACGCCACGAA CGGATTGGCA TCGTTATTTA GAACAAAATG 
AGTTCATCCC TTGGTGACAG AGATTGAGTT ACGTGCTAAA CAAAATTTGC 
AGATATGTAT ACAGGCGAGT TGCCGTTAGA CTTAATTCCC AGTTTCGATT 
AAATTGGAAA CAATTTTTAA AAGAGGAAGC ACAAGATGAA TTAGGTTATT 
AAAAGAAATC CAAGCCATTA CAGCAACGGA ATATCAGCTT CATTTACCAA 
ATTGCTTACT TCAGGTGCGC AACAAGCACT TTTTTTAGTT TTACAAGTTT 



AATAGCCAGC 
TTTTAACTTC 
GATTGATTGA 
TGACACTGGC 
AGGAACTTGA 
CGAGCGCTTT 
CTTTACGGCC 
CATATTTATG 
CTGATGTAAC 
GTTGTTGAAT 
CAACATTCAT 
CTTGTTTGTC 
ATGCTCCAAT 
TGGTACTAGC 
AGcCGTCTCG 
CAATAATtAa 
CCCAGTGGCA 



GAGATGATTG 
GAACGTAAAC 
GAACTGGTGG 
GAAAATGGGG 
TGTTCAAACA 
CGACTGACTT 
TTCCAGCATT 
TGCCGTTAAG 
GTGAAAGTTT 
TATTGAGACA 



5280 
534 0 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 

i 

6240 
6253 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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AG G AG AT AG T 


GTTGCCGTAG 


AAGATCCATC 


TTTCTTCTAT 


GCCTTACCAA 


TTTTCCAAGC 


660 


TGCTGGTGTG 


CGCTTATTTG 


GTGTGCCCAT 


GACAGAGGAA 


GGGATTGATC 


TTGAAGCGCT 


720 


AGAACAAACA 


ATTCGACAAC 


ATCGCATCAA 


AATGGTTATG 


GTTAATCCTT 


CCTTTCAAAA 


780 


TCCGACTGGG 


ACAGTGATGC 


CTTTGAGAAA 


AAGAGAACAG 


CTTGTGAAAG 


TCTGTCAAAC 


840 


GTATCAAGTT 


CCTATTTTAG 


AAGATGATGT 


TTTTGGGCAG 


TTAAGTTTTA 


TCCCGAAAAC 


900 


AGAAATTCCT 


CCTTTAAAAA 


AATTAGATCC 


AGATAATGTC 


TTGTATATCG 


GTTCGTTGTC 


960 


AAAAATTTTA 


GGTTCGACCA 


CGAAAATTGG 


TTGGTTAAGT 


GCGCCCGCCT 


CCGTGACGAA 


1020 


ACAAATTGCA 


GAAGCCAGAA 


AAATGATGGA 


TGTTTCGTTG 


AGTATTTTTC 


CACAGATGCT 


1080 


CGCTAAAATG 


GCGATTGAAG 


ATCCTTCTTT 


CTCTGAAAAA 


ATTACGTTAT 


TAAATAAACA 


1140 


AGTGGAACAA 


CGAGCAACAG 


CTGTATATCA 


AG TAT TT AAA 


TCGTTGTCAG 


AATGGGAAGT 


1200 


TTCGCCAGTA 


AAAGGCGGCT 


TTTATTTATG 


GGCACATTGG 


CGTCAAGGGG 


CACTTAAGCC 


1260 


AGAAGATTGG 


CAAGTGTTTT 


TACGAGAAGG 


TGTTTTGGTA 


GCACCAAGTG 


TAGCTTTTAG 


1320 


TGAAAAACGT 


GGAAGTATTC 


GCTTGAACTG 


TTCCCGAATT 


TCACCAGAAG 


AAATGCCTCT 


1380 


TTTTTGTGAG 


CGAATGGTAC 


GGATTACACG 


ACAGCTTTCT 


GAAAATAGAC 


AAACAAAAAT 


1440 


AGAACAGTAA 


GTAAGATTTT 


TCTTATAAAA 


ATCATTTGCT 


GAGAGAAAAC 


GCTACAACTT 


1500 


CCCTTTTCTA 


AACGAATATC 


CAATGCTATA 


CTTAAAAAGT 


AAAAACAAAA 


AAACAAATGA 


1560 


GGTGTTTTCC 


ATGTCTTGGG 


AACAAGTTTA 


CCAACAATGG 


TTAGATGAAG 


AAAATATTCC 


1620 


AGAAAATTTA 


AAAAATGAAT 


TAAAGGACTT 


AAATACAGAC 


CCTGAAAAAT 


GTGAAGATGC 


1680 


ATTTTACGCA 


CCATTAGAAT 


TTGGGACTGC 


CGGGATGCGT 


GGGATTTTAG 


GCGCCGGTAT 


1740 


TAATCGGATG 


AATATCTTTA 


CGGTTCGTCA 


AGCAACAGAA 


GGACTTGCAC 


GTTTTATGGA 


1800 


TACGCAAGAT 


CCTGAAACAA 


AACGTCGCGG 


TGTAGCTATT 


GCCTATGATT 


CACGTCATAT 


1860 


GTCTCCAGAA 


TTTGCAATGG 


AAGCAGCTAA 


AACATTAGCT 


AAACATGATA 


TTCCTTCTTT 


1920 


TGTGTTTGAA 


AGCTTACGAC 


CAACACCAGA 


ATTATCTTTT 


GCGGTCCGCT 


ACTTTAAAGC 


1980 


TTTTGCGGGA 


ATCATGATTA 


CGGCTTCTCA 


CAATCCAGCT 


GCCTATAATG 


GCTACAAAGT 


2040 


CTATGGTGAA 


GACGGTGGTC 


AAATGCCGCC 


AGCTGATGCA 


GATGCATTAA 


CTAAATATGT 


2100 


TCGTAGTATC 


GAAAATCCTT 


TGAAAATTGA 


TGTCTTATCA 


GATGAAGAAG 


TTGCCCACAG 


2160 


CGGTTTAATT 


AACATTGTTG 


GTGAAGAAGT 


CGATAACGCG 


TATTTGAAAG 


AAATTAAAAC 


2220 


AG T TAG GAT T 


AATCAAGAAT 


TAATTAACGA 


AATGGGTAAA 


GAATTAAAAT 


TAGTTTACAC 


2280 


TCCCTTACAT 


GGAACTGGGA 


AAATGTTAGG 


AGAAAAGGCA 


TTAAAACAAG 


CCGGTTTTGA 


2340 


AAAATTTGTT 


TTAGTACCTG 


AGCAAGCCGT 


TGCTGATCCT 


GACTTTACTA 


CAGTTAAGTC 


2400 


ACCAAACCCT 


GAAGAACATT 


CAGCTTTTGA 


ATACGCGATT 


CGTTTAGGGG 


AAAAAGAAGG 


2460 


CGCTGATTTA 


TTGATTGCAA 


CAGATCCAGA 


TGCTGATCGT 


TTAGGTGCTG 


CTGTGCGGAT 


2520 


GCCAAACGGA 


GACTACCAAG 


TATTAACAGG 


AAATCAATTA 


, GGTTCAATTA 


. TGATTCATTA 


2580 
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TATTTTAGAA 


GCCCATCAAC 


AAGCAGGCAC 


GTTACCTCAA 


AATGCTGCGG 


TTCTAAAATC 


2640 


AATCGTTTCT 


AGTGAACTTG 


CTACAGCGAT 


TGCTGAGAAA 


TGTAATACGA 


AGATGTTTAA 


2700 


TGTTTTAACA 


GGTTTCAAAT 


TTATTGCTGA 


AAAAATTCAA 


CAATACGAAG 


AAGACCACAG 


2760 


TCAAACGTTT 


ATGTTTGGTT 


TTGAAGAAAG 


TTATGGTTAC 


TTAGTGAAAC 


CTTTTGTTCG 


2820 


TGATAAAGAT 


GCAATTCAAG 


CGTTAGTTTT 


ATTAGCAGAA 


GTAGCGGCTT 


TCTATAAAAA 


2880 


ACAAGGAAAA 


ACATTATACG 


ATGGTTTACA 


AGATATCTTT 


GAAGAATTTG 


GTTACTTTGA 


2940 


AGAAAAAACT 


ATTTCCGTAA 


CAATGAGCGG 


AATTGAAGGC 


AGCGGTAAAA 


TCAAAGCTTT 


3000 


AATGGCAAAA 


TGTCGTGAAC 


AAGCGCCAAC 


TGAGTTTGCC 


GGGATTCAAG 


TTGCTCAAAC 


3060 


GGAAGATTTT 


AAAGAATTAA 


CACGCACTTT 


TGCAGATGGT 


CAAACAGrAC 


AATTACAAAC 


3120 


GCCTCCT-TCT 


GATGTGTTGA 


AATATCATTT 


AGAAGATGGC 


AGTTGGATTG 


CTATCCGACC 


3180 


ATCAGGAACA 


GAACCAAAAA 


TTAAATTCTA 


TCTAGCGACA 


AAAGCAACAA 


GTAGTTCAGA 


3240 


AGCTTCTGAA 


AAAATTGCTG 


CGTTTGAAGC 


nGTCGTTAAT 


GAATTAACAA 


AATAAAATAA 


3300 


ACGAAACAAG 


CT 










3312 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



TATCTTCGTG 


AAATTCAAGA 


AGACTTAGCA 


ACCCTTCCTG 


AAAATACCTT 


AG AG AT TC AG 


60 


ATCCATCACG 


GAAAAATTTC 


ATTACATACG 


ACTTCCAATG 


ATTTTCTTTC 


CCACTATTTT 


120 


ktAtTATTTA 


AtTAtTaCTG 


JcGtCCaTyCT 


ACTGACTTCa 


TyCTTTTTCm AGcAATTATC 


180 


AAAAAAGrAA 


ACmATTCTGT 


CTGGAAAGTG 


AGTCmAGrAA 


CTAATTTTAG 


CTCATCTTAC 


240 


ATTTACAAcG 


GTTAAaTAAC 


ATTAATAAAC 


TATTTGGTTT 


GTATGGAATT 


TCCGTCCATT 


300 


TTTCCCCCTC 


CGCGAGTAAA 


GTGATTACAG 


GTTCTGAATT 


TCAAGTCCTA 


TATGCTTTTC 


360 


TTGATGTCTA 


CTGGACCATC 


TTTTTAAATA 


CCTTAGCCTT 


ACCACGTTAT 


ACCTCTACTT 


420 


ATTTTGAAGA 


TATACTGACG 


CAGCATATTA 


AACCAGAAAT 


TCTTGAACGC 


TTGGAACACG 


480 


CTTCTCTACT 


AAAATTAAAT 


TTACTTTTAC 


ACATCGCAAA 


ATATCGCTTT 


CCATATACTT 


540 


GTGAAGATGA 


TCTCCAAAAA 


GAATTTGAAG 


AACATCCTTA 


TTTAAGGCTT 


TTCTCAGACT 


600 


CAAAACAAAC 


TATTTTTTGT 


TCCCAATCAA 


ATTTTTCAGA 


AGATCAACAG 


CGGGTTGTAG 


660 


ATATCATTGC 


■ ACCCATTGCC 


ATTGATAATT 


TTGATTCAGC 


AAAACGAGCC 


AAAGATTTAC 


720 


TGGCTGCTCT 


AAATGaAATT 


TCTTGCCCAG 


AATATTTCTA 


CGTTGATTCT 


TTATTAACCA 


780 


ATTTTTGCGA 


aGTTTTTCAA 


ATTCCAAAAA 


TGCGTGAATT 


TGATCGTCAT 


TACTTCTTAC 


840 
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TTATTTTATT 


TAAAAATATT 


ATCTATAACC 


AATTATTTAT 


TTCACAGCAA 


CCAAATACCG 


900 


CAATTCTAGA 


TTTTTTAGTC 


AATAAAAATA 


GTACTTATCA 


AAAAGATTTA 


CAGAAAAAAA 


OCA 


TCTATGCTTT 


TTATCAAAAC 


TTTTCAGCGC 


ATTCCCCTTA 


TCAGTTAAGA 


TTTAAGGATC 


1020 


AGGAGAACCC 


AATTTGGACA 


TGTGAATTGT 


TGTTACGAAT 


TTATAAGCAC 


TATAGTCCCC 


1080 


CCAAAAAAAT 


TAAAATTGGT 


GTTGCATATT 


CTCGCGATTT 


TTACATTGCT 


AGCTTTATTA 


1140 


TGATGAAAAT 


CAAGCAAOCA 


TTTAGTGAGG 


ATATTATTTT 


TGAACGAACC 


GATTTTTCTG 


1200 


GTTGTGATAT 


TGTGATTACT 


GACTATCCCt . 


TATTTGATTT 


ACCAAAAGAA 


ATAGACCGAA 


1260 


TTTACATTTT 


AGAAGAAGAA 


TTAACCCGAG 


AAGATTGGGA 


ACTAATTTTC 


AGTAAAATAA 


1320 


GTATCGCCAT 


TTTTGATTTG 


CAATATTAAA 


ACAAACATCA 


AGCGCCAACC 


TGTCTCACCA 


1380 


GCCATAATGA 


AAACAAGCGA 


CCTACAAACC 


ACTTCTGCCA 


GTTTTAGTCT 


GACAGAAAAT 


14 40 


AGTTTGTAGG 


TCGCTTGTAT 


ATTACTGTTC 


TTTTCTATGT 


TACCTTCAAA 


GACAATCTAG 


1500 


TAACGTACTT 


CAGTCCTTAA 


AGTGCTACTG 


TTCCACCGCT 


GATCAGTTTT 


TAAGCCTGAG 


1560 


ACGArGATCC 


AAAGTGTTTT 


TTGTCTCAGG 


CTCTTACATT 


TTTTCCAACA 


AAGGTATTGC 


1620 


TaGGCGCTAC 


TTGAAGCTGA 


AAACTAGTTT 


CTCTATCGAG 


ATTCCTCATC 


AG T AG ACT C A 


1680 


AATTTAAAAC 


TTTTTTCTAT 


AAAAGTGaCT 


TCTTTTGCGT 


ATGTTTTTAC 


AAAAGAAGAA 


1740 


TAGTCTTTGG 


GATCTTTGAC 


CTTTTCCAAT 


TGGTCAACAG 


AGACTTCTAA 


TTCAAATCCT 


1800 


GTCCCATCTT 


TAAAGAGATA 


ATATTTTGAG 


ATGTCTGTAT 


ATCCTTGATT 


CCGTTTTAAA 


18 60 


ATATAGCCGG 


TTTGTACATC 


AGCAATTTTA 


TCAATTGGGT 


ATGCTTTGAT 


AAACTCTTTA 


1920 


CTTGTATCCA 


AATAATCAGA 


AAGACTGTTG 


ACAAGATTGC 


CAGAAAGGTA 


CATACCTGCC 


1980 


GATGTATAAT 


AGTCTCTTTT 


TTCATAAAAA 


CTGATATTGC 


CTAGATATTT 


TGTATCTGGA 


2040 


TACTCCTCTG 


TAGAAGGTAA 


AATTAAAAAG 


TTCTTTCCGC 


CACTGGTTTC 


AATATCGGCA 


2100 


TCGCCGATAA 


TTTCTATTTG 


GTAGCCTTGA 


TTAAATAAAA 


GTTTATTTTC 


TTCTATTGTG 


2160 


ACTTTTAGTG 


GTTTTTCTAA 


ATCAACAGGT 


CCAGTTGTGG 


TAAGTCCAGC 


TTCTTTTAAA 


2220 


GGATAAACAT 


CTTCTGATGA 


AGCGGAGTAA 


TCCCAATAAT 


CATCTTTCtC 


CGCAAGATCA 


2280 


ATATATACTA 


TAAAGCCAAT 


TGCTACAGCA 


ATGCCTATCC 


AAATAATTTT 


TTTGTTCTTT 


2340 


TTTTGTTTTT 


TCTTTTTTTT 


CGTCTGTGGT 


TTTACCACTT 


TTTTATTGGG 


GTTAGGAAGT 


<£ 4UU 


TGCTTTGCTG 


TATCTGTTCT 


TGTGACAGGA 


AAAAACGACC 


ATTTTGGCTT 


TTGAAAAAGC 


0 A C n 


GCATCATATG 


kATCATAAAA 


ATTCTTTTGT 


TTAGCCATTT 


TTTCTCTCCA 


CTAATTTTTT 




TGATAAGGCA 


ATTTCCTCAG 


ATGCTTCGGA 


GACTAATCTT 


TCATAATCAT 


CTGTGATACT 


2580 


TTCCGACAAC 


ACTCGTATGG 


TACTTAAGAT 


ACCATTGGTT 


GCAAGTGTTA 


TTTCwTCTGT 


2640 


TGACAATTTA 


• TCTTCTTTAA 


TGCTTAAAAA 


CCGTTcTGAT 


GCTCGTAAAA 


TCTCTGGTAC 


2700 


CTTATGATAT 


AAAAAATCAC 


TAAAAtGAGT 


AATTGcTTCA 


GGATACTTCA 


CTAAGTGTTT 


2760 


AAAGATGCCT 


TtAGCGGAAG 


CAACACCCAG 


TTTTACTTGT 


TCAATTTTTT 


TTAAATTTCG 


2820 
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TGACTTATTc 


GTTAGTTCGT 


TTAAATAAAT 


AATTTGTTTT 


TTJAATTCGT 


CCATAATTCT 


2880 


TTGGTACATC 


TCTAGGTCAC 


TGTCCGTTAG ATGGTGCgTC 


GTTTGATATT 


TTTTTAGATT 


294 0 


TTTTTTATAa 


CGGAAATACG 


TACGACCATT 


TCTATAAGTA 


ACCCAACCAA 


TGTACCCAGT 


3000 


CCCCACTAAC 


AAAACCATTA 


TPPPATTCAA 


ATCATGTACT 


CCTTTTTAAA 


TTAGTATATA 


3060 


TTACTGAATA 


ACACACTGTC 


CAGPTTAAAA 


TTGACAAATA 


AGTATGAAAT 


GTTACTCCCC 


3120 


CAATTACTTG 


TGCACTTTTA 


TTGTATCATA 


TAGGAAAATT 


GTTCTGTTCC 


GCATTTGGAC 


3180 


AAATTGTCTA 


AATGTCTAAG 


GGAATCTTTT 


TTTGTTTAAT 


ATGTAGCAAA 


AAACACTTGC 


3240 


AAGAATTGAT 


TACTTCTCAC 


AAGPGTTTTT 


TATTGTCTAA 

X *» x x w x x n*» 


AACAACTATT 


TACATTTCAT 


3300 


TTTATTAAAT 


AGTGAATAAA 


TAAATTAPTG 


CAGCTAATGA 


AGCaCCTAAG 


ATAGGTCCCA 


3360 


CAATTGGcAC 


CCAAGAATAA 


p: ppp a a t ccc 


AATCACCTTT 

rui X vrVVu xxx 


TGTTTTCACA 


GGAATAATTT 


3420 


GGTGCGCTAA 


ACGAGGTCCT 


Zi2i2iTPAPPTG 


PAGGGTTAAT 

onuuvj x x nn x 


TGCATAACCT 


GTAGGTCCGC 


3480 


CTAATGATAG 


ACCTAGTCCT 


Z\ APAPPAAAA 


TCP PA AC A AG 


CATTGGGTTA 


ATGCCAGCCG 


3540 


CTAAGTCGTT 


TTGAGAAAAG 


PPTZaPTAAAP 


PAAAAAPPAA 


GACAAATGTG 


CCGATAATTT 


3600 


CTGTTAAAAC 


ATTGGCTGGA 


TZiPTTAPGGA 


PAGPTGGACC 


AGTGGCGAAG 


GTTCCTAAAA 


3660 


TAGCACCTTT 


GTCTTCTGTA 


ATPTTPPAAT 


GGGGTAAATA 


AGCTAACCAG 


ACAACTAAAC 


3720 


CACCAATAAA 


ACCGCCTAGG 




PGATATAAGG 


TAAAACCATG 


CCCCATTCAA 


3780 


AGTTTCCAGT 


AATTGCCATC 


GPAAPTGTTA 


CAGCAGG ATT 


TAAATGCGCT 


GGACTCATAT 


3840 


ACCCAGACAT 


GTAAACAGCT 


AATGTAAPAG 


CAGCGCCCCA 


ACCTAAAGCA 


ATAACGACCC 


3900 


AACCAGAAGC 


AAAGGCTTTG 


PTTTTPTTPA 

Va>XXXX\arXX V_>n 


AGTTAACTGC 


GGCACAGACG 


CCATCCCCTA 


3960 


GTAAAACTAA 


AATCATCGTT 


CCGAAAAATT 


CACCGAATAA 


TTGTGTCATC 


ATCGAAGTTC 


4020 


CCATTTTTTC 


TTCACCCTAC 


TTTPAAATTT 

X X X \»> tJL^ilx X X X 


TTTAAATCTG 


ATTCTTCAAT 


TACTTTTTCT 


4 080 


AATGTTTCAA 


TGTGTCGTGC 


TPTTTPTTCC 


GCTGTCCACT 


GATAATGCTG 


TGCCATTTCT 


414 0 


TCAATGACTC 


CCGCTTTCAC 


TTGGTCCAAA 


CGATCACGCA 


TAAATAATAA 


GTGGTTGGTC 


4200 


CGTCGCAATA 


AAAAGTCGAC 


GGGTGTTAAA 


GCCATTTCTT 


CTTCCATCGC 


ATAGTTTAAA 


4260 


GAGACAGTTT 


CTGTTAAGGT 


TAAACCAGGA 


ATTACTTTGG 


CATCATCAAT 


CATTTCAAAA 


4320 


ACAGTTGGTA 


CGTTGGAGCC 


ATATAAATGA 


GCTAAGTATA 


GTGCATCCTT 


TTCACTTAAC 


4380 


CCTTTAGACA 


CGCCATGTTT 


TGCCAACGCT 


TCTAATTCTT 


CGTCTACCGT 


TGCAGGGTTC 


44 40 


AATTTTCCGC 


CTGAAACTGG 


ATAGTTTTTA 


GAATCAATTA 


AGGTGAATTC 


TTTCTGGTAT 


4 500 


TCTTCTGCTA 


AGATGGTTTG 


AATCATTTTC 


ATCGCACCTT 


CTGCCATTTT 


ACGATAATCT 


4560 


GTTAATTTCC 


CACCAGCCAA 


AGTGAGCAAG 


CCATCCGCTG 


ATCTTTCCAA 


TGAGCTGCCT 


4620 


CGAGAGACAG 


• CTGAAGGATT 


GACTTTATTT 


TCCACCAAAC 


TGTCTTCCAG 


ATGGTTTAGG 


4680 


ACTTCTTCAA 


TTTCTCGTTT 


ATCTGTTTGA 


TTTTTTTGGT 


ATCTGTCAAC 


GACATCAATG 


4740 


ACTTCATCAA 


TACTTTTATC 


TGAAAGTTTG 


CCATTGTTTC 


CCCCATTATA 


ATCTGAGCCA 


4800 



WO 98/50555 PCT/US98/08985 

910 



CCGTTAGCAG 


TV TV TV rn m T\ TV T\ /"» /"* 

AAATTAAAGG 


TCGTAAGCCA 


GCbbAGbTbb 


b l.L bAAl Alb 


A X b AA 1 I b 1 b 




AATTGTACCT 


CTGGATAACG 


ATTATTbACT 


ACTTbbAAl A 


bb I AA 1 bbAb 


A 1 bRb 1 i ibl 


4 Q90 


TCTACTGTTG 


*?v m r** rn m /■* T> TV TV 

GATGTTGAAA 


GTCTCCATGG 


TAATCTbTAT 


bbbibb 1 bbb 


TA 7A fl TAT Tl'TPT'T 
AAAA1 Al bl 1 




TTTTCTTCTC 


TTGGAACAAC 


GAAGACCATG 


CGGCCATCTT 


GT ITAbb i b 1 


A 1 b AAAA I Ab 




GTTGGTTGAG 


GAACATTTAA 


ACGGCTTTTG 


m ta v** tv ta rri tv 

TCGACAACTA 


AATGAACCCC 


TTTbbllbbA 


CI rtA 

oiuu 


CGCATTTGTG 


GTGTAAATGA 


ATCGTTTTTG 


TCTAAGCCTC 


TTAATTTATC 


TGACbAAbbA 


0 1 ou 


CCAGTAGTAT 


TAATCACAAC 


ATCTGCGTGA 


t\ m ^ m /"» tv tv tv s**m 

ATCTCAAACT 


GTTCGTCAGT 


TAGTAbbibb 




TTTGCTTGAA 


TGCCAGTAAT 


TTTCCCTTCA 


TCATTGTATA 


AAAAGCCCAC 


CGCTTGAAbb 


coon 


CGACTCACAG 


CATGCGCCCC 


GTCAGCTACT 


GCTTGTTTAA 


TATTTTCAAT 


TV TV /"•iTi A TV T\ r*r* T\ 

AAC T AAAb b A 


DO** U 


GCATCATTAT 


TTCGGTAATC 


TAAATAAACA 


CCGCCACCTT 


GTAAACCTTC 


ACTTTTTAGT 


D 4 UU 


TGTGGCTCAC 


GTGCCAAAAC 


CTCGTCTTTT 


GTTAACAGAT 


AGTTGGCGTA 


CTTGGTACCT 


O Q OU 


GTTACATCAG 


CTAACTGATC 


ATATAAATCC 


ATTGCTATTT 


TGACAGAAAA 


TAGATTAAAT 


c; cl o n 


GTTGCATTTG 


GCTCATCATA 


AATCGGCAAT 


AACATGGGAT 


CCGCTTTAGG 


GATGTGTGGC 


ccpn 


GCAATGGATT 


GAACGACGGC 


TCGTTCTTTC 


ACTGTATCTG 


CTACCACCTc 


CACGTCAAAC 


Db4 U 


GTTTTCAAAT AGCGGATACC ACCATGAACT AATTTGGTTG AtCGTGAAgA GGTGCCCTCT 


D l UK) 


GCGAAATCCT 


GCATTTCAAT 


TAACCCTGTC 


TTCATTTTAG 


CGGCACTGGC 


TTGCAGGGCA 


D / OU 


ACCCCTGCGC 


CAGTAATGCC 


ACCACCAATA 


ATTAAGACAT 


CTAAGTGTTG 


TGTTTTTAAA 


coon 


GCGTCGATTG 


ATTGTCGGCG 


TGTTTCTATT 


GAGAAAGTCA 


TAGAAAATTC 


CCCTTTTCCA 


coon 


TTTACTTATT 


TTTACGTTTG 


AATTGTTGGG 


TAGCTGCTAC 


TGCTTGTTGC 


CAACCTTCAT 


oy 4 u 


ATAAATCTTC 


TCGTTCTTCT 


TCCGCCATAA 


TTGGTTCAAA 


TTGTTGTCCT 


TCTTCTTGGA 


bUUU 


ATGCTTTGAT 


TTCTTCTAAG 


TCTTTCCAGA 


AGCCAACCGC 


TAAGCCCGCC 


AAGAAAGcTG 


DU OU 


CGCCTAAGGC 


CGTTGTTTCT 


AAGTTATGGG 


CTCTTTGAAC 


AGCAGTGTTT 


AAAATATCTG 




CTTGGAATTG 


CATCAAGAAA 


TCATTGTTTG 


CTGCTCCACC 


GTCTACTTTT 


AAGACGGGAA 


DlOU 


TATCTATGCC 


TGTATCTTCT 


TTCATCGTGT 


CAATAATGTC 


ACGAACTTGA 


TACGCAACTG 


m VJ 


CTTGAAGTGT 


CGCTTTGACA 


AAATCTTCTC 


TCGTGGTACC 


GCGTGTTAAT 


CCAAAGACAG 




CCCCACGCGC 


TTGTGAATCC 


CAATATGGTG 


CGCCCAAACC 


AGTGAATGCC 


GGAACCACAT 


DJDU 


AAACTTCATT 


GTGCCCTGTT 


GACGCTTTGG 


CTACTGCTTC 


AGATTCAGCA 


GCCGTTTGCA 




ACATTTTTAA 


GCCATCTCGT 


AACCATTGAA 


TCGCTGAACC 


AGCCACGAAG 


ATACTTCCTT 


64 fin 

0 O U 


CCAAGGCATA 


ATATACTTTG 


CCGTTAATGC 


CATAGCCAAT 


AGTGGTAAGC 


AGATTATTTT 


6540 


TAGAAAGTTG 


AGGTTCTTCG 


CCAGTGTTCA 


TGACAATAAA 


TGAGCCTGTT 


CCATAGGTAT 


6600 


TTTTAACCAT 


• CCCAGGTTCA 


AAGGCCATTT 


GACCAAATAA 


GGCGGCTTGT 


TGGTCACCAG 


6660 


CCATCCCAGC 


GATAGGAACT 


TCACTTCCGT 


AAAAATGGTA 


ATTTTTTGTT 


AATCCATACA 


6720 


CTTCAGAGTT 


CGAAACGACT 


TTCGGTAACA 


TCACTCGTGG 


GATATTTAAA 


AG AT CC AAAA 


6780 
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TTTCTTGATC 


CCAATCTAAA 


TCATGGATAT 


XAAAb AAb A 1 


PPTPPPZIPTT 
bb,l V^l^VjR^ I 1 


bbAl 1 lb AAA 


fiR40 

OOlv 


AATCAGTGAC 


AT GAG TAT CT. .CCCGTTAATT 


TCCAAAbbAb 


PPH APTPTP7A 


ZiTTPTPPPZiZi 
/\1 lulVvVjVwfin 




ACATCAATTC 


TCCATTTTCT 


GCACGTTCTT 


GbbbALb lib 


I n\j\3 I un 1 U 1 


AAbA 1 bb AAb 




GAACCTTCGT 


TGCTGAGAAA 


TAAGCATCAA 


TGATTAAbbb 


Au 111111 LA 


1 bAAl bA 111 


■jnon 
/ u 


CGCTGTAGCC 


ATCTTCTTTT 


AGTTGATCAG 


^ TV TV m 7\ /"* rp f** T"* 

CAATAGGTG1 


Lb 111 oijLb 1 


bAl lbbbAAA 


/ Uo U 


CAATTGCGTT 


ATAAATGGGT 


AACCCAGTTG 


/*i m fn m rn v™* /"* TV 

CTTTG1 LCCA 


AALAALbb 1 1 


bill bbbb 1 1 




GGTTTGTAAT 


CCCAATTCCG 


GCAATATCTG 


TAGGTTTCAC 


ibb 1 (jAL 1CA 


ATTAAAf-'AAO 

Al lAAAbAAb 




CTGCAATAAC 


GGATTGAACA 


GAATTCCAAA 


TTTCATTGGC 


TV rrl T> TV <T» fr\ rn 

ATTATGTTCC 


AbbbAbbb 1 b 




CATTAGGAAA 


ATATTGAGTA 

n x n, x x \jnvj x n 


AATTCCTTTT 

nn x x v^v»* x x x x 


GTGAACTACC 


AATTTTGTTT 


bb l 1111 1 Al 




CAAAAATAAT 


TGCACGAGAA 


CTAGTCGTTC 


CTTGATCAAT 


CGCCATAATG 


rp tv m rp m rp rp ^ rnrn 

TATTTT lb 1 1 


7 7ft n 

/ JOU 


CTGCCATGAT 


AATTCCTCCT 

fin 1 X \*t\*r X >— ' ^ J. 


AAACGTTTTT 

nnriv vj x x x x x 


GAAATCGTTT 


TCTTTATGTC 


CCTATCTTAb 




AAATAAAAGA 


ATAAPTAAAP 

ri X r\n\^ 1 rtrt-rW-* 


ATGTCAAAAA 


CTTTTTTTAT 


fflTimTi Ti Ti Ti TV TV TV 

TTTAAAAAAA 


fnipfTl Tv APPJ\T 

TTGAAAbbA 1 




TTCTACTAAA 


APPAAPAAACI 


ACAACGCTTT 

nvnn\j\jVrf x x x 


CAATAATACT 


TTTATTATCT 


CTAATTTTTC 


7 e; £0 


AATAAAAAGA 


AAPATTTTTG 
no^m x x x i\j 


TTTAAAAAAG 


CCTAATTTTC 


GACTTTTAAT 


m rn tv 7V TV tv tv O /"^ 

TGTAAAAAbb 


/ DZU 


CTGGTGCGAG 


GATCAGACAA 


TTTCCGTCAC 


ACCAACCTTT 


TGTTTATTTC 


TCCCGCAAbb 


/ OC5 U 


TGCGTATTTT 


PTTTTTTAAT 


TGTTGCACAT 


CATAGGTCGA 


ACTTAATTCT 


/~» T\ TA TV 7\ TiPTATI 

G AC AAAAbA I 


7 7 A n 

1 


AGACTTCCGG 


TPrTTPPTTGA 


TAGCTCGTTT 


GCTGATAATT 


GGTCACAATT 


A A nTPATTlTT 

AAA 1 b A 1 A 1 b 


7Qfifi 

/ouu 


TTTGTTCAGG 


CTGATACCTC 


TCCACCAGCG 


TGCCATTCAT 


ATTTTTAAGA 


CI AAbbAl 1A 


7 ft fin 


ATACTTGTGT 


TAACGCCGCT 


TGATAAAGAG 


CATCCATTGC 


rn T\ TV tv m Tv jvmm 

TAAATGAAT1 


bbAAi 111 1A 




TTTCACCAAT 


PATTTTAAAT 

V^ii X X X X -TTJT-Ti X 


TCAATTAATG 


CCAACACGCT 


TAAAl A J. lib 


Ab b Ab Ab AAA 




GTTCATGCAA 


ACTATCTCCT 


GGTTGATACG 


TTTCATCAAA 


AC AAbL I AAA 


bb lilllblA 




ATAACGTTGT 


TGAAAAAGAA 


GCTAATTCCC 


GACTAGATAA 


TTGCTGTTCT 


Til IbbbAbA 


ft 1 nn 


TATGGGCCCG 


ATCAAAGACT 


TCCAATTCTC 


m m rn 7V TV TV TV TV T\ 

CTTTAAAAAA 


ATACAGTTI 1 


bbAl bAAl 1 1 


ft i fin 


GTGAAAAATA 


GTAAGCCATG 


GTTTTCTCAA 


GAGTAGGTTC 


AAA1 T 1 A I bb 


bbbbbAl AAi 


p o o n 
o ^ ^ u 


ACAAGACCAC 


ATTTTCCAAG 

*i X X X x wnnv 


ATAAACGTAT 


/-> ti •»"»/■■» rrt Y\ TV TV /*** TV 

CAGCTAAAGA 


AACAGbbbAA 


bbAbb 1 bbbb 


ft 9sn 


GAATTAAGGA 


ATATTCATCA 


AAAGCTGATT 


CTGGTAAGAC 


CbAbAl 1 b 1 b 


A'P'PA^AAAAA 

All Ab AAAAA 


ft *^4n 

O J M v 


CAAAATGAAT 


CATGCTTTCT 


CCTTCGTCTA 


ta mrn /"*rn TV A m /** TV 

ATTCTAATGA 


ATAbbb Ab I A 


i" A A A AA HCC 7i 

b AAAAAb b b A 


8400 


AAAACAATTG 


TCGGACTTGC 


TTAAATAACG 


GATCTTCTTT 


TV Ti Ti TltPPTV Tt 

AAAAl b AA 1 b 


Ailliblllb 


84 60 


TCAATTGTTG 


ATATTTTTTT 


GAAGTGAcCG 


CTAGGCGCTT 


TTTCGTAATA 


CTCAACCAAA 


8520 


GGCTAACTTT 


TAACGCATTA 


TGTTCATTAA 


AAGTAAAACC 


TAAGCCAGTC 


TCAATCCCTT 


8580 


CAATTATCCG 


• TTTATTTAAC 


GGCGACATCG 


TTTTTTTCTG 


GTGTATTTCA 


TAGGGAGTAA 


8640 


TATACCAATA 


CAACTGAAAG 


TAAAAGTAAC 


GAATCTGCAA 


TTCTTCTCCC 


TGTAGTTGTC 


8700 


CATTTTTTAT 


TTGCAAATCA 


AAGACGGCTA 


ACAATTGATT 


GAGTTCCTTA 


ATCTTTCGAA 


8760 
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ATAAAGAAGA 


CTCACTAAtC 


aTAAATTTAG 


TCGtTAATTG 


t> ti r** tv tv Tk tv tv 

AACAATAGAA 


AACTCTTTAT 




TACGATATAA 


ATAATCtAAA 


AGTTGAAATT 


TAAGAGACTC 


TTTTACAAAT 


GCTTCAATTA 


oooU 


AATGATTCAA 


CGAAAAAAAA 


TCCGATAAGG 


TGATTGATAG 


CTGTTGGCCA 


m TA H 4 ! TV TV ^ TV TV 

TCATAAGTAA 


Q qa n 


GCTGACAATC 


CTGTCCAAAA 


GGTTTCAAAA 


AATAATGTAA 


ATCCTCTAAG 


TCTTTTTCAA 


yuuu 


AGGAGGcTTT 


TGTGACCGCT 


AAAAATGCTG 


tTAAATCAGA 


GACTGAGCAG 


GTGCCGCCTG 


yuou 


CTAAGATGGT 


TTGTTGTACC 


GTTAATAATT 


GTCGCATCGC 


TTTCTTTTCt 


AATAATGTCT 


9120 


CAACTTTCAT 


CGTTCTCTCT 


CCAATAAGCC 


GAAGCACCCA 


TCAAATTTGT 


AATATCATAA 




ATATTGGTTA 


AGGAAGGTTG 


ATAAAAATAA 


TCTTTTTGTA 


ATAGGTCTGT 


CAGTGTCTGA 


y ^ 4 u 


CCTGTTTGGA 


TACTTAATGC 


TAAAGTATTT 


ATTTTTTCTA 


AACAATTGTT 


IT1 m fTl TV TV ^ TV TV 

TTTTGAACAA 


q *a a r\ 
y JUU 


AGCTGTGCAC 


CAAGGACCCG 


TTGCGTCACT 


T TAT CAT AG A 


TTAATTTGCC 


TAAAATTTCT 


QTt A 

y j du 


GTCCCATGTT 


GTAGAGGCGG 


CGCTGGCTGT 


CTAACAATTA 


TGGAAGCAAG 


TGTTTGCGGA 


y 4 


AAAAATAAAC 


CTTCCGTTTC 


GGTTAGACCT 


GTACTAGCAA 


GGTAATAGTC 


CCCCACTTTT 


9 4 ou 


GTTCCCATTG 


TTCGCAAAGA 


GCCAATGAAT 


CGGTGTGTTT 


TTTCTTCTAA 


ATTGTTGGCG 


954 0 


ACTACTAAGC 


CAGTTCGTAC 


CGCATTGTTC 


ACTAACGGAG 


CATAAAAAGT 


TTCCGCGACT 


9600 


GGTTCATTCA 


TAACTGAAAT 


GCAATCACCG 


ATGGCAAAAA 


CATTCGGCAC 


TGAAGTTTGT 


9660 


AAATAAGCAT 


CCACCGCGAT 


TGTCTGATCC 


AGATTCCGTT 


GAATTTTTTT 


ATCCAAATAG 


mon 
9 I ZD 


GCTAATTGAG 


GATGTAAGTT 


TAAAGCAAAA 


ATCCCACTGT 


CACAAGAAAT 


TTCCTGCTCG 


y / o u 


CTTGTTTCAA 


GAACAATACC 


ATTTGCTGTT 


TCTTCAATTC 


CTAGGACCGT 


fnm/^rr*m ^ TV TV TV TV 

TTCTTCAAAA 


y o h u 


TGGAAAATTA 


CTGCTTGTTT 


TTCCAAAGAT 


TTTTGAACTT 


CTGCGACCAT 


TTCTTTATCA 


Q QO n 


AAATATTTAG 


GCAATAGATT 


TTCCAAGCTT 


TCAAAGACAT 


GGACCGTTTT 


CTTCATTTTC 


QQCn 

yyou 


ACAAGAAAAT 


CAATAGCTTC 


CATTCCAATG 


GGTCCTGCAC 


CAATGACGGC 


TACTGTLIbb 


i noon 


CTATTTTCCA 


GTAATGGAAC 


AGCCGCTAAA 


GCACCTGATA 


AAAATTTATA 


m mm/^ TA m TV /"* FTl 

TTTGAGTAG1 


i nnon 


TTTTCTGTTT 


GACTACCACG 


AATTTGCGTG 


GAAAACTGGC 


TCGCGCCTGT 


CGCTAAAA1C 


1 ni /in 


AATTTATCAT 


AAGAATACCA 


CTGTTGCTCT 


TCCTTTCGTG 


TCCAAGCAAT 


CAATTGGTTC 




TCAACATCCA 


TTGCCACAAC 


TTCTCTATTT 


AAAAGCAACT 


GAATCTTTTG 


ACGACGTAAI 




TCCTCTTCCG 


TTATGTAACG 


TGCTTCGTGT 


AATTCATTGA 


TGGTATGATT 


t\ rv tv tv m tv r^* r** T\ 

AAAATACGCA 


i mon 


CTTAAGCCAC 


CAGATAAATA 


CCCCACTGTT 


GCTTGTTTAT 


CAATTAAAGA 


AATTTCTGCT 




TGGGGATATT 


TTTTCCTTGA 


AGCAATTGCT 


GCGGAAATAC 


CTGCGAATGA 


m ^ /"*• tv tv tv 7\ 

TGCACCAATA 


1 r> A A Ci 

1 U 4 4 U 


ATCACAATTT 


TCATTTGTgT 


CCTCCTCTGT 


CAGGACCTAc 


TATACTTCTC 


TCAGTTTATC 


10500 


ATGATTAAAA AGAaTGCTAA AgCAACAACt 


ATAGtTTTCC 


CTTAAAATAA 


TTATtTTTGC 


10560 


TGAGCGCATT 


■ GCTTTTCAAG 


CACTTTCATT 


TTCATTTATA 


ATAAATGTTA 


CTACTGTTTG 


10620 


AGGAGGAGTT 


TTCATGTCTG 


CAACAAAAGA 


GCAATTGGaA 


CTATTTTTAT 


TTTATTTATC 


10680 


TGAAACCCAT 


ACAAAAAGTT 


TATCTTTACA 


TGACTTAGTT 


ACAAGTCGAC 


CACGCCCAGA 


10740 
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AGAAGLGCGT 


ATTTTATTAA 


ATATTAACGA 


GGTTTTCACT 


TACTATCATA 


GPGPTPGTGT 


10800 


TCTCTACACA 


AGTGTTCCTG 


CTCTTGAAAA 


CAACAAGTCT 


PAP.PPTTTTT 


TTPAAGPCTT 

x J. LnnUV<\* X X 


10860 


CGAAAArTTT 


TATTTTGAAT 


TAAAACAACA 


TTTTTTCAAT 


P.AAGAAGATG 


AAAPPAATCA 


10920 


ATTGAATGAA 


CGTCTAGAGG 


AAATGAAAAT 


TGCCTTTGAA 


PAATTGAPGG 


ATGAPTAPAA 


10980 


TGTTCTATAA 


TAGAAGAGAA CAATTCGTTG AGAAGGGGTC 


PTTATP.AAP.P 


AAPA AP.A APT 


11040 


GATTGAACGT 


TTAAAAGAAG 


AACTAAATTT 


GCCATTTTTT 


AATPPTATP.P 


TAPAAPAAAA 


11100 


GAACTATTCT 


GAAGCAGATT 


ATCAACAAAT 


CAAGAAAGAG 


1 1 e\i\ 1 1 I 


APTTTGATGA 


1 1 1 fiO 

X X XDU 


TTATGTGCGA 


AATGTTGAAA 


ATTAACAAGC 


TAAAACAACC 


111 ort 1 1 1 in 


ATAAAATPAA 


11220 


AGGTTGTTTT 


ATTATTAAAT 

X* X X X X ****** X 


AAAAAAAATA 


ACAAAAAAAC 


LALLGGLAG 1 


TTTP.PPP.ATP 


11280 


GAAAAGGAGT 


ATTTAPTPAA 


TAAGAGTAAA 


ATGAAAAATA 


Ti Z\ 74 BrrTTPT 
/\rtAn.bo 1 1 0 1 




1134 0 


AATTAATAAT 


APAATAPACT 


TTCTCTTTTG 


TCCATAAAAA 


AGAL A 1 oAAA 


AAAATATTAA 


1 1 400 


ATTCGTTAGT 


TATTTTTATT 

X *T> X X X X X ** X X 


AGTTTTTTAT 


TTTTTCAAAA 


GAAiG 1 1 oC I 


C ATPPP.PP.TA 


114 60 


ATGGCAAAGG 


TTAATTCATC 

X X *ui X X V/il X V/ 


CGTTGCTTGA 


TCCTGCTTTT 




ATT A A AT A A A 


11520 


TCTGCATCAA 


TArrTGTTTC 

X **\—* W X \J XXX V*^ 


TGGATCTGTT 


GCTAAAATCA 


1 i lLnl lu^n 


TPPTPPTAPA 


1 1580 


TATTCTGTAT 


AAGPATGTTG 


AAATTTTTTG 


TGAATGCCAA 


G I. AL 1 1 G 1 GG 


Lo 1 Lbu 1 1 1 i 


1 1 fi 4 0 


AATGTTCCGA 


rTTTTTTGAG 

\^ X X X X X X VJ 


CATTAATTCA 


TATTTTTCAG 


I ALU 1 1L1 i J, 


A A ATTTTCPt - 
t\r\t\ 1 1 1 luH. 


11700 


TGAATTTCTG 


PAATTPGTtC 

^nn x x V-/ v_j x l. v> 


GGCTGTTAAA 


TCATTCACTT 


tnl 1 L 1 Inl^ 


PATPPPTTPA 


1 1760 


CGAACTTCTT 


PGTAATATGG 


ATyCATGACT 


TCGCCAATGy 




TTTAAPPATA 


11820 


TCATtAATTG 


TTkGAACGTA 


AAATCTTAAA 


TTTGGTCTCA 


1 AALLA 1 LLt 


1L1 1 1 X V^V^v^ 1 


11880 


TTTCTCTtTC 


ACTCCATTTT 


ACACTTTTTC 


TATTCCATCT 


LALAAo luoL 


APTTTTTPAP 


11940 


TCTTCATTCT 


TTTCATTCTT 


AACAGAGAAA 


ACATAGTGAG 


LCM 1 1111 


PPPAPPPTAT 


12000 


GTTTTAATGT 


ATCGATCCCT 


AACAAGCATC 


TGGTTTCGAA 


InoLoHtnl 1 


PATA^APPPT 


12060 


AAGTTCGTAT 


CACGAATCGT 


TGAAATAACT 


TCTGGAAAAT 


1 1 AA i ALA 1 U 


P A ATf^Pf^T AP 


12120 


TGTTTTGTCG 


CTGCGGCAGC 


TTCAATCGCA 


TACLL1 I IG 1 


GLLA1 1 L 1 1 1 


PAPAPATAAA 


12180 


TGATAACCAA 


TTTCTGGATA 


GCTCTTACCT 


TTTACTAATT 


PATTPPTP.AP, 


APPAPAPTCG 


12240 


CCGATGACTT 


CGCCTGTTCG 


CTTCTTCTCT 


AACGCCCATA 


AP.PPP.TAP.PP 


GTATTCTTGA 


12300 


TAC30AL X G 1 A 


AGTTCCACTT 


TAGCCAGTTT 


GCGACCTCTT 


PATPAGAAAA 


AGCGTGTTCA 


12360 


I AGGLAi ALA 


TCACATTTTC 


ATCTTGCAAA 


AATTTTTTCA 


ATTPPAPTTP 


ATCTTCCGAC 


12420 


TGCCATTCTC 


TGACAATAAG 


TCGTTGTGTT 


TCGAAAATAA 


TTTTCCGTTT 


CAAGCCCTTT 


12480 


CACTCCTTAC 


ATAAAGAAAA 


ATTGACTGGT 


TTAAGTACCA 


GCCAATTTTC 


TTTTCTTATT 


12540 


TATTAAGCAC 


• CTTTACCTGC 


TTTACGTTGG 


AAGAACTTCA 


CAATTTCCAC 


AATTGGGATG 


12600 


ATAGCAAAAG 


AAGTACCTGC 


TACAATTGCC 


CATTGATATG 


CATCTAAATG 


TGTGACGCTG 


12660 


AATAAATCAT 


TAAATCCTGG 


AATAACAATC 


GTTGCAGCTA 


ATAGCAAGAA 


CGAAACAAGA 


12720 
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ATCCCATAGT 


TAAATGATTT 


GTTGCGbAAl 


A APPP A APTT 
AAbH-AAL. 1 1 


T P. A A A A T T p. A 


TTP ATA A ATP 


± C. t OK) 


GACTTCACAT 


TAAAGGCATG 


/"•TV 7\ /"* T\ TV rp Op TV 

GAACAAT I bA 


HI lAAlbbbA 


A AP.T AP.PP.A A 


PPPP ATP.P.TT 
bbbL-A I OO 1 1 




AAAGCATCGC 


CGTGTTGTAA 


AT LA 1 Ab AAb 


bCi Ivj^IjL/Lu 


ATA AHTTTP.T 


PTTAP.PTP.Pn 
Ol 1/toI^IO^O 


1 ? QOfl 

1 iC. 7U VJ 


GTATGTGCTG 


GGAACATAAT 


TGACAI 1 1 1 b 


T A P A P A A TP A 


A AP.TPAATP.P 


PPPTTP.PP.TA 
v^VvL 1 1 O^O In 


12960 


ATCCCTTGAT 


AAACAACACT 


/■"•/-» fTI rp T\ TV TV TV f+f* 

GCTTAAAAbb 


fpr PP A P A P. A 


AP.A AP.TTPP A 


TTTTTT APP A 
111111 r\\^\^ti 


13020 


CGAGGTTCAT 


GGCTCATCAC 


ATCALbi 101 




PA PPT A APPP 


A ATPP.PTnnPI 
r\r\ 1 Lu^ 1 OOO 


13080 


AAAGTATCCG 


TTACTAAGTT 


AATCCACAAT 


A nfTPft A.PAP 
AAb 1 bAAb Ab 


PT A AP A A PPT 
Kj 1 AAx-rtAoo 1 


ATPPPA ATTP* 


1314 0 


AGCATTGTTG 


CAATAAACAA 


GGTTAATACT 


TbCbbl AAb I 


T A PPTP 7A 7V 7\ P 

1 Abb 1 bAAAb 


rp a AATATTP.A 
iAAAlAl lun 


1 


ATCGTTTTTT 


GGATATTTGA 


GAAGACTTTA 


CGTCCTTCTT 


bl Ab 1 bbbAL 


A ATA A TTPTT 

AA1 AAi toil 


1 


GAAAAGTTAT 


CATCGGCAAG 


GACCATATCT 


GAAGCACCTT 


TTGATACTTC 


TGI IbbbblA 


1 T^9n 

1 jjZU 


ATTCCCATAC 


CAATCCCGAT 


GTCAGCAGCT 


TTTAAGGCTG 


GGGCATCATT 


rp t\ /~ , /^/~«/^Tvrn/~»Tv 

TACGCCATCA 


loJoU 


CCAGTCATCG 


CAACCACTTT 


TCCTTCTTGT 


TGCCACGCTT 


rn tv TV f'"* A TVITP/* 

TAACAATCCG 


AAbCTTbib I 


Uf* fi U 


TCTGGTGAAA 


CACGAGCATA 


GACAGAGTAG 


TGTCCCACGA 


CTTGTGCGAA 


rn rp rp rn m t\ ^r^T 1 

TT1 iTLAlb 1 


1 jjuU 


GACAATTCAT 


TTAATTCAGC 


GCCTGTAATG 


ACTGCATCGT 


CGTLbbbl 1 L 


111 bA 1 1 O 




CCTAAACGTG 


CCGCAATTGC 


TTCTGCTGTA 


TCACGGTGGT 


GACCCGTAAT 


/"•ATA ATTPPA 


J. JOtU 


CGAATACCCG 


CTTCTTTCGC 


GACTTTTACA 


GCATCTGCAG 


CTTCTTTTCT 


X lb 1 bbb 1 LA 


1 


ATCATACCGA 


CTAAACCAGC 


AAATGTTAAA 


TCTTTTTCCA 


CAAGTTbAoA 


AblbAi 1 ibb 




GCTGGAATCG 


TCTCAACATA 


TTTATAAGCC 


n m m j*"> m tv tv tv tv 

ATTCCTAAAA 


bAbb 1 MbOt 


1 1 b 1 1 lbbb 1 


1 TROD 
1 JOUU 


AATGAGGTGT 


TGGTTTTCAA 


AATTTCTTGA 


CGTTTTGTTT 


CA1L 1 AAI bb 


/ib 1 1 1^1 


1 JO uu 


CCATTTGATA 


AGATTTCCGT 


ACAACGTTTC 


tv Ti m Tv T\rnm/™»7vm 

AATAATTCAr 


C I Go 1 bbAbb 


TTTPHPSPAA 
111 brtb f\\3t\t\ 




ACTAGGAAGC 


CACCTGTTTT 


TAATTCATGA 


TV TV /■"■mTV/TpO'TV. 

ACAGTAGTCA 


rp rp t\ tv rnrprn Tv iT"* 

TTAATTTAbb 


AfPAPAPTPB 

A 1 bAbrtb 1 b A 


i j"Ou 


AAAGGAATCT 


CTGCCACACG 


AGGTTCAGCT 


GCTACTTTTT 


CGGTTACGTT 


tv tv 7\r™ , ^OA r P/*^A 

AAAbbbA I b A 




TCTAAACCAA 


ATTGTACTAA 


GGCAGTTTCT 


GTTGGGTCAC 


CAATTAATGA 


ACCATCTTGA 


14100 


GCGATTTTTG 


TATCATTCGT 


AAAGTTCATG 


ATTTTTAAAG 


CCATATTATC 


GGCAGGGATT 


14160 


TCTGTGGAAG 


CAGATAAAAC 


TTGACCATCT 


GTATACAACG 


CTTCAACGGT 


CATTTGATTT 


14220 


AAGGTTAACG 


TACCTGTTTT 


ATCTGAACAG 


ATAATGTCTG 


TACTTCCTAA 


TGTTTCAACG 


14280 


GCTGGT 












14286 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
CAATGCTCAT TTCCACTCnT CCTAAATCTG TTATTGCCAG TAGTTTTGCT CTAACTTTTC 60 
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TAAATaATGk 


GaATAGTAtC 


CATGCTGTCG 


aACCTCAAAT 


TCTTTTAAGT AATCtAATGG 


120 


AATACTCyTA 


CGtTCTAAGT 


TTTTTTGCAT 


ATTTAAATAA 


TCATCAACAT CCaCTAAAAA 


180 


TACTTGATGA 


CTTTTGTTGA 


ATTGAATTAA 


TAAAAATGCT 


GTTCCACCGC TTAATTTAAA 


240 


AnCCTTTAAA 


AAATTTTTTT 


GGTGATCTTT 


TAACATTGGT 


TTATTTCCAA CATAAAATGG 


300 


AAATGCAGTT 


TTATTTTCAG 


TAGATTTGCA 


ATCGAAGGCG 


ATTGGCCGTC CCTTCAAATG 


360 


TCCGATAAAA 


TCACATCCTG 


TCTTGTTAGT 


TGGAATCACT 


ACGGGTTTTC CACCAACCCT 


420 


TATAGTTTTA 


GTTCCATTAG 


GTATTTTTGC 


TACTGTGCCT 


TTTCTGTTAC GACAATACCA 


480 


CTCGTTGGTT 


tGTTCAATCA 


TTTTTTCAAA 


CTGACTCCAC 


GCTTTCATCA CAGCACCTCT 


540 


CTCTTAATTT 


CCCTCTGGAT 


TTAACCAGTT 


ATGCATTGTA 


ACTTTTGGTA TTGAATCGAC 


600 


AGAAATCTCC 


TCTTGATCAT 


TTCGCATATT 


GTATTCACTA 


ATTTTTTTTA TAGCTTGATT 


-660 


TTTTGTCTTT 


ATTCCCTTAC 


GTTCCCAGCT 


TAACAATATG 


CGGTCCATAT ACTTTAGACT 


720 


GAATGCCTGA 


TTAAGAACTG 


CCTCTTTAAG 


GGCTAATTGA 


ATAAGGTCAT CTGGATAACC 


78.0 


ATCTTCTGTT 


TGCCACTGTT 


TAATCATTTC 


CATTTCGaTA 


GGCGAAAGTG GTCtCCCaAA 


840 


ATTTTGTTCg 


ACGGcAGCGT 


ATATATTTAA 


tTTaTTTTTA 


TCTGTTTCTG GTGAAAAAGA 


900 


ATTATTTGTG 


TTTATGTTTT 


GTTTATGTTT 


AATTAATGTG 


CCACTGTTCG TTGACTGCGT 


960 


TGTACTCTTT 


AGTGTCACTT 


TATCGTTACT 


CTGTTGTGTA 


CTCTCTTGCG CACTATTTAA 


1020 


CGTATAAAGT 


ACGCATACCT 


TGTAAGAAGT 


AGCTTTTCGA 


CCATTACTTT TAAAATCAAT 


1080 


CAAACCTAGT 


TGCTTTAACG 


CGnTTCTATT 


TTTATTAATT 


CCTGAGCGTG ACAAACCTGA 


1140 


CAAAGATTCT 


AATGTTGCAT 


TAGCTGCTGT 


AAACCATGTA 


GCCCATCCTG CTTTGTTGTT 


1200 


TATGGACATT 


AATGCACGCC 


ATAAAGCAAT 


CTGACCTGAT 


GAAAGCTTTT GTTTATAAAG 


1260 


CAAATAATCG 


TCAAACGCAA 


GAATCTGTTG 


TAAATAATTC 


AATGTCACAC CTCCACTCTC 


1320 


TATGCAATAA 


TTATtTTtTT 


ATTGGTTATC 


TtTTCTATtT 


CTtCTTTAAA TTTTTCTGGa 


1380 


TCACTATTCT 


TGTCGCTTAA 


ATGGATGAGA 


TATATTTCTT 


CTGTTTTTGT TAAATCAGTT 


1440 


GACTGAAAAA 


ACTTTTTGCA 


GGCATCTATG 


CTCATATGCG 


TTCTTAAAAT TCGATCTTGA 


1500 


ACACTTTTTG 


GCAGTTTGCT 


CTGTCTTACT 


AACTTGATAT 


CATGATTACA CTCAACGAGC 


1560 


CAATGAGTGA 


CGTCTTTAAA 


CGTTTTAGGT 


AAATAGTTAG 


TATCTGTAGC AAATACTATT 


1620 


TTCTTTCCAC 


TAGGTGAAAG 


AATAAGAAAG 


CCCAGTGGTT 


CTCTCGCCCG TGCCTTTTTG 


1680 


TCATCGTGAA 


TAGTGGCAAA 


AGGTTTAACT 


AACCAATCAC 


CAATTTTCTG TTGCTGGTTA 


1740 


GCTTTTAAAA 


TATGTGACCG 


TCTATTTATA 


CCTAATCCTT 


CTAAAGTTCC TCGTGAAGCC 


1800 


CACACATCAA 


ATCTTCCAGC 


GAGTAAAATA 


, TCATTGATAT 


ACTTTGAGTG GTCACCATGT 


1860 


TCATGTGTTA 


CAAGTAAACC 


TTGAATATTT 


GAAAAATTAA 


k TACCTTGTTT CATAATATCT 


1920 


TTAGGCTTTA 


ATCCTGCTTC 


TAACATTAAG 


i GAAGAGTTTC 


: CATCTGCAAG TAAATAATTA 


1980 


TTACCTGCAG 


ATGAAGAACC 


TTGTATATTA ATTTCAATCA TTAAAAGCCA CGTCCATCTG 


2040 
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TTTCGCTAGC 


AATATTTGTG 


GTTAATTCAT 


CAAAAAACGC 


TGTCTGGGTT 


GGTTCGTTTT 


2100 


CTATTTCTTG 


ATTCGTTGTT 


GCTTTTTGAA 


TCGGCTCTTC 


TGGTTCGATG 


ATAGTCTCAT 


2160 


CTGCTGCAGT 


TTCTTTATCG 


AGTTTTACTG 


CTTCTTGTTT 


GACTTCTGTT 


GGTTCTTCAT 


2220 


CAAAATCGAA 


CACTTCTTGT 


GCTGTTTGCT 


CAGTGACATC 


TTTTCTAACA 


GATTTAACCA 


2280 


CTTCATCATC 


TTGCGACTGA 


TACTGCATTG 


CGACATAGGC 


GTTTTCAAAA 


TTTTTAGGAA 


2340 


TTTTCTTAAC 


AATGkTGTTT 


CTCATTTTAC 


GAACAATCAT 


GGATTCTCTA 


CTTTGTGGAG 


2400 


ATTTCCATGC 


GGGACTAATA 


TAAGTCTGGT 


ACTCTTCGCT 


ATCCAAGATG 


TCATCTAAAG 


2460 


ACATTGTTTT 


CAAATTGTTC 


ATGATTTCTT 


GCTTTTTATT 


TTCGATTTCT 


GTTCTCTGAC 


2520 


TCTGAGTTGC 


TTTGAATTTA 


TTTTCAGCTA 


GTCCAAACGT 


CTCATTCATC 


AAATTTTGGT 


2580 


TGATGTGGGC 


AATTAAGTTT 


TTAACAACGT 


CTGCACGCTC 


TGAAATATGG 


AACTCAACAG 


2640 


ATCCGTCTGT 


CATTTCAACT 


GGATAAACAA 


CTCTAACCAC 


TTTTCCTTTT 


CCAGTGGGTC 


2700 


CCCACTCTGG 


ATCAGTCACA 


GACAGCCCTT 


TGTATCCAGG 


GTACGAAAAA 


TGGTCCTCTT 


2760 


CCCGAACTTC 


CCAATGTCGA 


TGAACATGCG 


CCACATTTCG 


TCCGAACTTT 


GAAAGAATGG 


2820 


CGTCATTACC 


ATCTCCTTCA 


ATTCCCATTT 


CAATCTGTTT 


AATCCAAACA 


TCTTTCGGCT 


2880 


TGCCCGTGTT 


TGGATCATTC 


ACTTTTCTTT 


TAACATTTCG 


GGTTTGAAAA 


TATaCTTCGC 


2940 


GAGGAATCGC 


CGATGCATTC 


ACTTGTAATG 


CTGCGATGGT 


CATTAGCGTT 


TCTGTTATAT 


3000 


TCGTTGAATC 


T AC AT CAT T A ATTGACAAAC 


TGGCATTTGT 


aAGCATTGTA 


TTAAtTGCTT 


3060 


GAAwTGAGCT 


GATTACGCAT 


TGTTTTTGAT 


ATTCATTCAT 


ATTAATTCCG 


TTGCTTAcAA 


3120 


GCTGCGAttC 


AACTTGTGGT 


AAAAACGTAT 


CGTTGATTTT 


AGTTAATCTA 


TTCTCGAAAA 


3180 


TTTTATTTTT 


TGCAATTTCA 


TTGGTCATCA 


ATATCTATTC 


CTTTCTCTGT 


CAAAGCAGCC 


3240 


ATCCATACTT 


TTTTAATTTC 


GTCGGGACAA 


TGACTCATAG 


CATCTTTCCA 


AGTTGGCCAT 


3300 


CTTCCGTTTT 


TGTCATAGAA 


TTTATATTGA 


TATGTTAAAC 


TTTGCTGATT 


ATGTGGTTGC 


3360 


TCTTCATCAT 


GCtTAACTGC 


ACAAATTTCG 


CAGGTTCCTT 


CCGGAACGTT 


TCCGATCATC 


3420 


ATAAACCCAG 


TTTCGTCTTT 


TAGAAATCCC 


ATCTATTTTG 


CACCTGCCTT 


TTCAATTGaA 


3480 


TGCTGTTCAA 


TTCTAATTTT 


TTTAyCTTTT 


TCGCTTACAT 


ACATCGCAAT 


GACTTGTGTA 


3540 


TCTACTTGAA 


CAGAGTCTCT 


ATTGTGATTA 


GTCAAACCTT 


CAGCGTTATC 


GATAAAAATT 


3600 


GGAACGATAT 


AGCCTTCCTG 


TTTCATTAGA 


GTGTTGGAGA 


CATCTAATCC 


AGCTTGCATA 


3660 


CGGCTACCGT 


TATTTAGAGA 


ACTGAACGGT 


ACTCCGTCAA 


TCATCGGTTC 


ACATACTGCT 


3720 


TCATTGAGTC 


CTCCATCTTC 


AAAGAAATCG 


AACAATTTCC 


ATTTAACAAC 


TGAGAAATGA 


3780 


CTATTTATAA 


TTTCTTGCAG 


CATGTTCTGT 


TTTGTTATAA 


AGAATTCTTC 


AAATAGTACT 


3840 


AATTTTTGTA 


ATACTTCACC 


TTTCTTATGA 


GATAATTGGC 


GCTCTTGTTC 


ATTGAAATCT 


3900 


TCAATAATCG 


ATAGCTGTCT 


TTCATACTCT 


TTTAGCAAGG 


CTAATTTTTC 


ATCAATCATT 


3960 


GCGATTTCTT 


TATCTATTTC 


TGTAATCTCA 


CTTGTTTTAG 


CGGAAGTTTG 


TTCTAGAATA 


4020 
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GCTTCGTTAC 


TTTGAGTAAT 


GTACTCTTGC 


AATTTTTTTA 


TTTCTTTAGT 


GATAGTTGAA 


4080 


TATTTTTCAG 


TTGCTTCAAA 


TGGTATTTTG 


TCTAATTTTA 


ATGAGGAAAT 


CTGCTGCTCT 


4140 


ACATCAGCTA 


AGTTTTCCTT 


CACATCTTCC 


AAGTGTTTTT 


TAGCAATGTT 


ATATGCTTCT 


4200 


GTTTTAATCA 


ATAGTTGTTC 


TTTTAACTCG 


CCGATCCCTT 


TTTTTAGGgC 


TTCGCGATCT 


4260 


TTGTTATTTT 


GAATGCCTTT 


TTCTCGaATT 


TCAGAGAGCT 


TGATTTGCTT 


ATCCGCTTCA 


4320 


AACTGGGCTT 


TTATTTCTTT 


ATTTGTTAAC 


TCAATTTCTT 


CTGCACGTTT 


TTGCTCTTCT 


4380 


TCCTCATGAT 


GTCGTTTCAT 


TTCATCTTGa 


TCCTTGACAT 


CATATGGACG 


ATTACAGTGC 


4440 


TGACAAACTA 


AAAGATTTTC 


ATTGAAAGAT 


AATTTGGTAT 


AAACTAGACC 


GCCTGTAAAT 


4500 


TCTTCAGCTT 


CTACTTCATC 


ATATTTGTCG 


TACAATTCTT 


CATGCTTTTT 


ATTTAATGCA 


4560 


ATCAGTTCAT 


TATCTTTTAT 


AGAAaCCAAA 


CGTTCTGTTA 


CATTAAGACT 


TGATTCTTCA 


4620 


TCTGCATACG 


TCTTCTGGGC 


TTTATTGAGA 


TCAGCAAAGA 


GTTTTGACTT 


ACCTTGCTCA 


4680 


ATGCCATTAA 


TGCGTGCATT 


CTGTGCATTA 


TCATGCTTCA 


ATTTAGCAGC 


AGTTAATTCT 


4740 


TCTTGTTTTG 


TATTAAGACT 


TGCAATTAAT 


TCTGAAATAT 


TACCACCATT 


TCTAATAGTA 


4800 


ACGAGCTGGT 


TTTCGATGTC 


ATTTTTCTTT 


AATGTCAACT 


CGTTGCGAGT 


GGTTAATAGT 


4 8 60 


TGCTCTTTGT 


TGATATTTTC 


AATATCTGGc 


AATGCTGCTT 


GAATACCTTC 


AATTTTTACA 


4920 


GGAATATTTT 


TCAGCGTTTC 


GTTGATcGCT 


TTTGTTCTTG 


AAGCACACGC 


TCACGAGCTG 


• 4980 


TTTTAATGTC 


ATCATTACCA 


ATAATTtCTT 


TTAATTGGTG 


AATTGATGGC 


GATTCGTTGA 


5040 


TAATCTCTkC 


ATCTGTCTTG 


CTGCCAAAGT 


AtTCAAAAAG 


CTTTTGACGT 


CGTtCATCTG 


5100 


CCACTAACTG 


CTCACAAAAA 


TAAGTCACAC 


TAGTCAGATT 


CTTAAATGTG 


TCTTGATCTA 


5160 


ATACTTTTTC 


TACTTCATCG 


TCAAATGCTT 


TCTTAGTTGT 


GGTCTCTAAT 


CCGTCTACGA 


5220 


GGTATTTTGT 


AAACATCTCA 


TAAGACTTGT 


GTTCCGAATT 


TCGTTTGATA 


ACCTCTTTGT 


5280 


CTCCTCGGAC 


•CTTTTCAAAC 


TCCTTTGCCT 


GACCATTGAT 


AGTCAAAACG 


ACAGTTACTG 


5340 


ATGTCTGTTT 


ACCGCGAATT 


GGCTCGCTAT 


TTTCATCAAG 


TGGACGCCAC 


TGAATCTTAG 


5400 


TACGTTCTTT 


TGAATCTTTG 


TTGAACAAGC 


AC C AG AG AAA 


aGCATCaTAG 


ATAGTTGTTT 


5460 


kGCCTGCATC 


GTTGTCTCCA 


AAAATATCAA 


TACTTTTACC 


ATTTGGTTCA 


ATCATCAGGT 


5520 


CCGATATACC 


TTTAAAATTA 


TGAATACGGA 


TTGACTCCAA 


GCTAATGTTT 


TTCATACAAT 


5580 


TAATCTCCTT 


CAATTTCGAT 


ATTGTCTTCT 


AAGATTGCTC 


GTATAACTGT 


TTCTTTGCGT 


5640 


TTGTTACAAA 


TTTCAGTAGC 


G AC AT CAT AC 


CAAGCACTAC 


TATGATTTTG 


ATTCAGCTCT 


5700 


GTTGTTAAAT 


CATATCTATA 


ACTTTTACTA 


ATTATTGGAT 


CAACTAACCA 


ATCCATACGT 


5760 


GCAATTAAGG 


CTGTAGCGTA 


TAATTCTCGT 


TCTGCGCAAT 


TTTCAGTCGC 


GTTTTTAAAC 


5820 


CATTCTTCAA 


ACCGTTTTGA 


CATGGTTAAT 


TTATTTGACA 


CATTATCCAT 


TTCATTTCCT 


5880 


CCAGATTTTG 


TGTTATAATT 


CTCATGTATA 


ATTTTTGTAT 


GGGACTTAAT 


CGTTTGTGGA 


5940 


CGAGTGAGTC 


TCTTTTTTTG 


TGTAAGCTAA 


ATATTTTGCA 


, TCGTCGTATT 


TCATGAACCA 


6000 
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AACAACTGCA ATTGGTCCAA TAATCAAAAG TAGATAGcTC GCTGGAACAC 
TaCTAACCCT AATATAAAGA CAAGAATAAA TGATCCTAAA ATTCTAGCTt 
TTr.TGGTTTT CTTTCcTTCA TTTATTTTCC ACTCCTATTT ACCCATAGCA 
CATTAAAAAA AGTGCTAATA TAATCGCTGT TAAAAATTGT GATTTAATCA 
AAACACAAAG AATAAAATTC CTACGGAAAG TGTCATCATT GTTTTAATTG 
ATTCTTCACT AGAGACCCTC C7A.TATAAAG TTTCTTTCAT ACCATTCTTC 
ACTTCTATTC TTACCAATCC ACCATTTCTT TTAGCAGGTA ATGGATCAAT 
ATCCATCTTG TAACAG7AGC AGCTGAAACA TTTTTTCTTT TTGgCTACTT 
CAAAGGTGGn AATTTcTTTG aATTAACTTT TCGaTCkTTT TTgaTTTAAA 
TCTAGTGcCA TTkGwATyCT CCTTCCTCAT GtATCCTAGT IcGTTCCCAAT 
TTGCTCACTT AAATAACGAA TATCGATGAA AGCAAGATCA CACAAGCTAC 
GATTCAACAA TTACTCAnCA AATAnCCATT GCGnAGCACT ATTTAGAAAT 
AAA 

(2) INFORMATION FOR SEQ ID. NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
TAAGCGAATT GGACATCATA TGTCAATAAC AAATGTTGAT AAAAAAGGTC 
TATTTTGAAA AATAAAATCA GCTTTTAGAT AATTTTGCCA AAAATTTCGC 
TTAAAAATAT TTTTTTATTT TATGTGCTCA AATATTTTTC AAAAAGAAAA 
AGGTAATAGT AGAAAGGGAG GAGATTTAAT GACAGAACCA AACTTAAAAG 
GTTATTTTAT AGTGATGGAT TTCCGCAACA AGATCAAGAA ACTCCCGCAT 
AATGGTCCCT AAACCAGATT GTGGGGAAGA CAGCTATGTC GGAAATCACA 
TCGTCGGGTA TTAATTACAG GCGGTGATTC TGGGATTGGA CGAGCTGCGG 
TGCTCGTGAA GGCGCAGATA TCGCCTTACA TTTTTTCCCA GGTGAAGAAA 
AGAAGTGGCG CACTATATAC GAGAAGCTGG CCGAAAAGTC GTTCTTTTGC 
AAGAGATAAA CAGGCACCTG AAGAATTGGT TGCACAAGCT CATGAAGCAT 
GGATACCCTC GTTTTAAATG CAGCCCAACA AATTTCCTGT GCCGCTATTG 
AATGGAACAA GTGATAGACA CTTTCCATGT CAATATTATT GCGATGTTTG 
AGCAGCAGTG CCTCACTTGC CTGCCGGAAG TAGTATTGTG ACAACGACTT 
CTTTAATCCC AGTGAGCATT TGTTGGATTA TGCAGCAACG AAGGCATCAA 



TATTTTTAAT 
CGTATAAGCT 
CTCTGATAAA 
AACATAAAAT 
CAAGTTCCTT 
TAAATCATCG 
TTCTCGTTTC 
CTATTGCTTT 
ATCAATAACT 
ATGGAAAACG 
TAAGTAGTGT 
ACTGTCAGAA 



AATTAAACGC 

TTAATGAGTA 

AATTGTATTA 

ACCCAAGAAA 

TACAGGACAA 

AATTAGAAAA 

CGATTGCTTT 

AAGACGCCGA 

CAGCAGATTT 

TAGGCGGCTT 

AAGAATTACC 

GCATTGTCAA 

CCGTCCAAGC 

TTGCCAACTT 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6723 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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TACCGTTGGT TTAGCAAAAC AACTGGCACC AAAAGGCATT CGGGTAAATG GCGTGGCTCC 900 

AGGACCAATC TGGACACCGT TACAGTTAGA TCATGGACAA CCAATTGAAG AGTTGCCAGA 960 

ATTTGGGCAA CATTCTTTAC TGGAACGTGC GGGTCAGCCT GCTGAATTAG CCCCTGTTTA 1020 

TGTTTTCTTG GCGTCCAATG ATGCAAGTTA TGTTACGGCT CAAGTTTATG GCGTAACTGG 1080 

TGGAGAAGCA ATTAATCTTT AGGAGGGATG TTCCGTGAAT AAAGTTGTTA AAATACTTTT 114 0 

GCTTCTAGGT TCACTGATTT TAATGGCATT ATTTGGCGCG GTGGCCATCA CCACTTGGTC 1200 

AGCCGATACA GCTTGGCAAC CACCAGATAA TGTCCAATGG TTCTTGCAAG ACAACGAGTA 1260 

CGCACGTCAA GCAATCTTTT GGGTAGCGCT GGTTTTACTA GGAATTATGC TCTTTCTCTT 1320 

TTTATTCATT CTTTTATTCC CAAGAAGAAA AGGCACGTTT GAATTAAAAG AAGAGCAAGG 1380 

AAAGTTAGTG TTACACCGTA AAGCCATCGA AGGATTTGTA CGCTCTAGCT TGCAAGACAG 14 40 

TGACTTTCTG GAAGCACCAA CCATTCGTGT AAAGGCGACA AAACGGCGCA TTAAGGTCAA 1500 

TGTTAAAGGC ACACTGAAAC GAACGGCAGA TTTAATTGGG CAAACAGAAG CTTGGTCAAA 1560 

AACAATGGAA GAACGGCTTG CTCGTTTGGT GGGAACTGGT CACAAGATTA CGATTGATGT 1620 

TCATTTTGAA GATATTCAAT CAATGAAACG ACCACAACAA CAGCAAACAC AAGCACGTGT 1680 

GGAGTAGGAG GGATGTACAA TGGTTGAATG GATAAAACAA TATGCCGTTC CGTTAATTGG 1740 

TGGCCTAGTC GGCTTGGTTT TTGCAATCTT GTTTCTGTCA ATCGGCTTCT TTAAAACATT 1800 

GTTAGTGGTA CTCCTGACAG GACTAGGTAT TTGGCTAGGT TTGTATTTGG AACAGACAGG I860 

CATTGTCAAA AATTATTTTC ATCATAAATA GGAGGTTTTG GCTATGGAAA ATGCAAATGT 1920 

ATCGATTGTT AAAGGTGAGT TGACGTTTGA GGATAAAGTA ATCCAAAAAA TTATTGGCAT 1980 

TGCTTTGGAA GAAGTCGACG GTTTGTTAAC TGTGGaCGGT GGTTTTTTCT CGAATTTACG 204 0 

AGATAAAATG ATTAACAGTG AAGATGTTAC AACAGGCATT CACACCGAGG TCGGCAAAAA 2100 

ACAAGTCGCA GTGGATATGG ACATTGTAGC CGAATACGGG AAAGATATCG AGACCATTTA 2160 

TGATCAAATG AAAGAGGTTA TTACTCGCGA AGTCCAACAA ATGACTCACT TGGAAGTAAT 2220 

CGAAGTCAAT GTCAACGTAG TAGATATCAA AACACAAGCA GAGTACCAAG AAGAAAGTGA 2280 

AACGrTCCAA GATAAACTTG GCAACGCCGC TGAAGCGACA GGTAGTTTTA TGTCAAAACA 2340 

AACCAACAAA GCCAAACATG CGGTAACTAA AGGCAAGACA AAAGTCAAAG AAAACACGGC 24 00 

ACCTCGTGTT CAATAATTGG TTATTATTTT ATTAAAAATA AAGCAATAAT AGAAGAAACT 24 60 

AGGAGGAATG AACCATGAGT AACGAAAAAT TCAATAATGT ACCAAAACCA GCAGGAAACG 2520 

GACCACACAC ACCAGAAGAT CATGCAATCA AAGGTGAATT AACATTTGAA GATAAAGTCG 2580 

TTCAAAAAAT TATCGGTTTA GCCTTAGAAA ATGTCGATGG TCTATTAACA GTCGATGGCG 2640 

GCTTTTTCTC • AAATATTGCT GAAAAATTAG TAAACACAGA TAACGTCACA GCGGGTATTG 2700 

ATACCGAAGT TGGTAAAAAA CAAGT-CGCTG TTGATATGGA TATCGTGGTA GAATACGGCA 27 60 

AAGATATTCA AGACATTTAC GAAAAAATGA AAGAATTAAT CAGTCGTGAA GTGAAAAAAA 2820 
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TGACCCACTT 


AGATGTGATT 


GAAGTGAATG 


TTAACGTAGT 


TGATATCAAA 


TCAAAAGAAG 


2880 


AATACGAAGA 


AGATAGCGAA 


ACGGTTCAAG 


ATAAAGTCAC 


AGGCGCTGCC 


AAAAGTACCG 


2940 


GTGAATTTGC 


TTCTGAACAA 


ACAGAAAAAG 


CAAAAAAAGC 


TGTGAATAAA 


GGAACAGAAC 


3000 


AAGTCAAAGA 


AAATATGGAA 


CCACGTGTAG 


AATAAACAGT 


TAAAGGGAGG 


AAACAATCAT 


3060 


GGGCTTTATT 


TGGGCATTAA 


TTGTCGGCGG 


GGTCATTGGG 


GCAATCGCTG 


GAGCAATTAC 


3120 


TAAAAAAGGA 


TCATCGATGG 


GCATTATTGC 


CAATATCATT 


GCAGGGTTAG 


TTGGTTCAAC 


3180 


AATTGGTCAA 


GCCATTTTAG 


GCACATGGGG 


ACCAAGCTTA 


GCTGGGaTGG 


CTATTGTGCC 


3240 


ATCGATTATC 


GGGGCAGTGA 


TTTTAGTTGC 


CATCGTTTCA 


TGGATTTTAG 


GACGAAAATA 


3300 


GAGAAAGTTT 


AAAAAAATAG 


GGCATAAGTC 


GTTTCGGCTT 


ATGTCCTATT 


TTTCTTTTTC 


3360 


TATGTATGAG 


GAAGTTTCGT 


ATGAAAAGTA 


AATTTTGAGC 


AATTAGCCAA 


GAGCTTATTT 


3420 


TAATTAAAAT 


CATTAGTCAT 


TTTGGCTTTT 


CAACAAAAAT 


CAACAACATT 


TGTTGAAGTT 


3480 


ATTTGTAGAT 


AACGTTTATC 


ATTCACAAAT 


TTACATTTTT 


TTGTAACGGA 


TATCTCGTAT 


3540 


AGTAATAACT 


GCAAATGAAT 


TTCAGAAAAT 


TTTTTGATAA 


GTCATCAAAG- 


TGAGGGGATA 


3600 


GCGGTCTTTT 


ATTGAAAAAT 


AGCAACACAA 


AATAAACAGA 


ACAAAAGAGA 


GAGGAATTCA 


3660 


AAGAATTATG 


ACAGAACAAA 


CACCAACAAA 


GCTTTTGAAT 


AAAGGCTTTA 


TCAGTATTAC 


3720 


AGTGATTAAT 


TTCATTGTCT 


ATCTTGTTTA 


TTATTTATTA 


ATGGTTATTA 


TTGCGGTGAT 


3780 


TGCGCAAGAT 


AGCTTACACG 


CTACTTTAGG 


TCAAGCGGGT 


TTGGCCTCAG 


GAATTTACAT 


3840 


CATCGGGACA 


TTGCTTGCAC 


GACTTTTCAT 


GGGGAAACTC 


CTTGAACTGA 


TTGGGCGCAA 


3900 


ACAAGTCTTA 


CGTTATGGGG 


CTTTGTTTTA 


TCTTTTAACG 


ACTGTTGCGT 


ATATGTATAT 


3960 


GCCAAGTATG 


GGCATTCTTT 


ATCTGGTCCG 


TTTTTTAAAT 


GGGTTTGGTT 


ACGGAACCGT 


4020 


TTCGACAGCG 


ACCAACGCAA 


TTGTTACCGC 


CTATATCCCT 


AAAAACAAAA 


AAGGGGAAGG 


4080 


GATCAATTAT 


TATGGTTTGA 


GTACTAGTTT 


AGCAGCCGGC 


ATTGGGCCAT 


TTATTGGGAT 


4140 


GTTATTGTTA 


AATGTTTCCA 


ACTTCCATGT 


CATTATTAAC 


TTTTCAATTA 


TTTTAATTTT 


4200 


GTTAACAACC 


ATTGCTTGCT 


TCATTTTCCC 


AGTGAAAAAT 


ATTGAGTTAA 


CACCTGAACA 


4260 


TCGTGAAGCT 


TTATCTAAAT 


GGAATTTTGA 


TAGTTTTGTT 


GAAAAGAAAG 


TTTTATTTAT 


4320 


TACATTTATC 


GCCTTTTTAA 


TGGGTCTTGC 


TTATTCAAGT 


GTCTTATCTT 


TCTTATCTTC 


4380 


TTATGTCAAA 


GTAATCGATT 


TAGTTGACGT 


CAGCACCTTC 


TTCTTTATCG 


TCTATGCGGT 


4 4 40 


CGTGATTACC 


TTAACACGTC 


CATCGACAGG 


TCGGATTTTC 


GATGTCAAAG 


GTGAACGCTA 


4500 


CGTCATGTAT 


CCGAGCTACA 


TTTTCTTAAC 


ACTAGGCTTA 


TTCTTATTAA 


GTATGACTAC 


4560 


GTCTGGTTGG 


ATGTTATTAG 


TTTCAGGTGG 


GTTAATTGGC 


TTAGGTTATG 


GTACTTTCAT 


4620 


GTCAAATGGC 


• CAAGCGGTTT 


GTCTTCAAGA 


AAGCCCAAGT 


CCTCACCGTA 


TCGGGATTGC 


4680 


TTTATCGACT 


TATTTCATCG 


GTTTAGACCT 


TGGATTAGGG 


GTTGGTCCGT 


ACGTTTTAGG 


4740 


TGAATTACGT 


AACTTTATGT 


CATTCCAACA 


AATGTACTTC 


TTAGCAGGCT 


GTATCCCAAT 


4800 
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CGTTTGTACC 


ATTTTATATA 


TGGTCTTCCA 


GAAAGCTAAA 


AATGATGCCA 


AAGATTTGTC 


4 860 


TCTTGAAACG 


ATTGAAGAGA 


TTGAACACG.G- 


CTCAGAACTA 


TAAATGGACA 


AGGAGTGATT 


4 920 


TCAATGCCTA 


ACGATGAACG 


GATTTTAAAC 


GTTTTACAAC 


AAACACTAAC 


GGAAATTCAA 


4 980 


CGCATGAACA 


CGCGTTTAGC 


TAAAATTGAA 


ACAGAGTTAG 


CCGATGTCGA 


TCAACATCAC 


5040 


GCGACTTTGG 


ATCATTACTT 


GCAAGAGCTA 


GAATTGGTGA 


CAGAAAATAA 


CAACGTCATG 


5100 


TAAAGTGAGT 


AGTTAGTAAA 


TGAATGTTCT 


ACGTTTCGGT 


ATCACTAGAG 


TGGAACATTG 


5160 


AAAATTTTCG 


TAAAAAACGT 


CAAAAGGTAG 


CAGTTTTTCT 


GCTACCTTTT 


GGCGTTTTTT 


5220 


GTAATTTTTA 


TCACTTAAAA 


TGTTTCACGT 


GAAAACTTTT 


AAAAGTGGCG 


TGGACTTTGT 


5280 


TTGGTGACAA 


TTTGTGTATT 


TAAATTTTCT 


AAGCTTTGCG 


CTAGTTCACC 


AATGCTAGCA 


5340 


GAAAAATCAA 


CATAATAAGT 


GGGAACCGTT 


GATTTGACAC 


TAACTTCTGA 


AAAGATATCT 


5400 


GTAATAATTA 


CATCGAAAGG 


CTTTTCTGGT 


GAAGCGACAA 


AAATTAATTT 


TTTATCCACG 


5460 


GCTTTACTTA 


GCTTTTCATA 


AAGGGTATCT 


AAAATTAAAT 


TACTTTGATT 


GGTCCTTACT 


5520 


AAGACACGAA 


TCGGTGGTAA 


AGATTTCGTT 


TTCAATGTAC 


AAAAAAGAAT 


AAACAAGCAG 


5580 


TCAACTAACA 


ATTTTTCATT 


GCCAATCAAT 


GTCGGAAGCA 


TTTCAATTGT 


AAGGCGCTTC 


5640 


ACATCTTTGT 


AGAGACTTGG 


ATTTTTTGAA 


GTGCTTCTTT 


TAAAAAGCTT 


TCTTGCACAA 


5700 


ATGGATTGGC 


AATCGTGACA 


AAGGTCTTAA 


AAAAGACACT 


TAAATTTTGG 


ATAGAAGTAT 


5760 


ATAGATAAAG 


ATACTCGTGA 


AATGTCAATT 


TAATTTGATA 


AAAAGACGTT 


AGTTGATAAA 


5820 


TGATCTCTTT 


AATGGGTGAG 


CCTAATAGCT 


GGACCATATG 


CATCCGTGCT 


GCAGGAATTT 


5880 


CCAAGACATC 


TTGTGTGGCG 


TAgcTGaCAC 


TAATCGTCAA 


TAAGAAAAAG 


GCAAAAAGGC 


5940 


CTTCTTTTTG 


TATAAATTTT 


GTTGGAACTT 


GGTGTAAGGC 


TTCTTCTTTA 


AAAAACAATT 


6000 


CAAATTCACT 


CAGATTAACA 


AAGGGATTAA 


CAATCGCTTC 


TTCCTCCACG 


AAAGTTTCTT 


6060 


TGATAAAAAA 


GCCCTCTTTC 


CAACGCGCTA 


AAATCACCGC 


AGAAAAAATC 


AAGTCTTTTG 


6120 


TAAAATCTTG 


TGTTGTATTG 


AAAGAAACAG 


AGTGTTTTAA 


ACGTCGAAAG 


AGACTATGGA 


6180 


CTTCTTTTTT 


AGTAATCGTG 


TGCAACGGCC 


AATCCCCTTC 


AGAGTAATTG 


TTCCAAAAAA 


6240 


GCATCATAAA 


GAAAATCCGG 


ATATCTTTTT 


CTTTGCCCAC 


CAACACATTG 


GTACGTTTTA 


6300 


AATTGATTTC 


AATCTGATAT 


TTTTGGAGCC 


ATTGGTTTAA 


TTGTCCAATT 


TTTCGTGTAA 


6360 


AGGTTGATGA 


AGAAATCTCT 


AGGACTTTTT 


CAAAATGTTT 


GCTATCAATT 


GCATTATGGT 


6420 


TAAAGGCATA 


GGTCAGTACT 


TGATACGCGA 


AACTTTTTTT 


GACAAAGTAG 


GGCATCAAAA 


6480 


ACGACAGTTG 


GAAATAGGGT 


GCAAACCAAA 


ATGTGATTTC 


TTTGGCATTT 


GCTGATAAAG 


6540 


AAAATAAGTT 


CTCCCAGCCG 


TTTTTTTCAA 


TTTCTGTTTG 


GAGCAAAGCT 


ACAGATTGTA 


6600 


GAATGGTATT 


TTGTTGCATT 


TTTAATTGAG 


TCGCTAAAGT 


AGAAATCGCC 


AATGTTTGTT 


6660 


CTTCTGCTTC 


ACGACGTTCG 


AGTAACTCTT 


CTAAAAGAGC 


AAAACTTTTA 


CGATAAGGTA 


6720 


CATCAAGTAA 


CCCTGTTAAC 


ATCCGATTCC 


CTCCAAACTG 


TTTTCCCCAC 


GCAATTTAGC 


6780 
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AAAATAAATT 


TTGCGAACTA 


CTTGTTTTAG 


GTTATAGTAT 


TCGTCTAAAT 


AAGGAACGGG 


6840 


ATGCCAATAA 


AAAGTTTTAC 


TGGCGTGAAA 


AAGCGGATTA 


TTTTTGTCAG TAATGACTAA 


6900 


GTCAGGCACT 


TCTTGAATAT 


GATTAACAAA 


TTTCAAAGGA 


AC AT CGC AAT 


A ATT AT TP. AT 
nn i x n l i on i 


6960 


TTCCTTCGCT 


AACCAATTTT 


CTTGTATCTG 


ATTAATAGAT 


GAATPAAGTA 


AAATTTTAAP 
nnn ii.ii nnv^ 


7020 


AGGAATTTGT 


TTATTTAATA 


AAACACTCCG 


AATAATTAAA 


AAAATP.P.TTT 


PTTTAAATAA 
l l i nnn ± nn 


7080 


CGCATCTAGT 


GGATGTTGTT 


TGTCAAAAGG 


CAAACGCTTT 


AAP.AAATAP.T 


TP.PTTPTTPP. 


7140 


TGAAACATTT 


GGGTATTGTT 


TTTCATACAA 


AGAGACATTA 


0^«nnV-^> i \*r\ X 


APPTPAATTP 


7200 


CTCTTCCACA 


GGGAAATGAA 


TCAGTCGTTC 


ACTGGCATTG 


pata aattaa 

Onlnnnl inn 


PP A APAA AAA 


7260 


GACCACTTCT 


ATTGCGCTTA 


AATCCAACTG 


TAATTGCGTA 


P A *T ,r P r P r P f PP A & 
11111 Unn. 


A A ATPPAAPT 
nnnl V^^.nnO 1 


7320 


AGCGTGTGGA 


CTGTTATCCA 


TTCCCATGAT 


CGCTAAGTTG 


otnnno 1 1 on 


TrrTTTPPAT 
luui 1 l^onl 


7380 


TTGGACCAGA 


GAGTACATCG 


TTAATGTGCT 


GAAGAAATCA 


ta&taaaaat 


p a a Tr tp a p.p 


7440 


TTGCCAAGTG 


TCAGGCTCTT 


TTTGTTCATC 


AATAAAACAA 


a t Tr a pt a 

1 Un 1 1 Lnt 1 n 


a T &P TT^ A A 
n 1 no 1 1 oonn 


7500 


TTCTGAATAA 


GGAAAGGCCA 


AATTAATAGC 


AGCAACTTCT 


PP T\ T TV a TTCT 
bunlflnl loo 


TPATTPTAAA 
1 on 1 1 O 1 nnn 


7560 


ACCGGATTGA 


TAACGGAGTA 


AACCTATGGA 


CATAATTAAT 


I 1 iniil 1 lot 


P T Zl 71 A T P A P. T 
o 1 nnn 1 ono 1 


7620 


ATATGTAAAG 


TAAGGGTAAC 


GGTTTTTGAT 


GATTTCAATG 


nnol iwly^n 


ATTPTPPTTP 
n 1 1 o X ut 1 1 O 


7680 


ATTAGTCTTA 


GATAGAAATG 


GTACTAGAGA 


AGTTCCTGAA 


ATPAAP.AATA 
M. i onnOnn 1 n 


APPA AT A A A A 
nOonn 1 n^A/A/A 


7740 


GAAACTGCGA 


AAATCTTTCT 


CGTTACCACA 


AATTTTTTCT 


L-o 1L1 1 1 1 In 


AP.TTTAAATA 
no 111 nnn 1 n 


7800 


ACAATTAAAT 


CTAGCTAGAT 


GATTATTTAA 


ACGGTCACTG 


APTTPTTTAA 
nt 1 lul 1 1 nn 


A AAT AP A AAA 
nnn 1 nvjnnnn 


7860 


GTTAATAAAG 


TGCTCTTCAC 


AAAAATCTTT 


GGCTGAAGTG 


T A P.P P. TTTT A 


A A APT A A ATP 
nnnO 1 nnn J. ^« 


7920 


CAGGGCTAAC 


ATATAGGTTA 


GTGACCCGCG 


GACAACAGCT 


TPTTTPATTA 


APTPAATPPiPj 


7980 


GATATTTTCA 


TCGGCGACTA 


ATTCTATGGA 


CTGTTGATTA 


TT AAT A ATAP 
1 1 nn 1 nn 1 n^ 


PAATATPTAP 
Onn l n J. o i n^ 


8040 


ATTTTCATGT 


GTATTGACTC 


GAGAAATTAA 


CAAAGTTAAT 


1 on 1 1 v^nn 1 n 


AATTPTTPAT 
nn i i o 1 lon-i 


8100 


ATTGTAATGG 


GAAAATTCAT 


ATTTTCGTCG 


GAGTACTTGC 


atppppaptt 

nl uuuunv X X 


TTTPTTTAPT 
i i i ^ i i i j. 


8160 


GTCATACAAG 


TATTCAATTA 


GGGAAAGTAA 


CTGAATTTCC 


1 O 1 l 1 1 1 n 


ATAAGTTATA 


8220 


CATAATAACC 


ACCTTTCTAA 


TTGGTTATGA 


TTGTTTGATT 


TTTPAPIATAA 
111 vnun l nn 


TACAGATAGG 


8280 


GCATTCTTTT 


CTATATGACA 


TTGACGATTG 


CCAGTGAAAG 


AAPTAPTTPT 

nn^ 1 no l x o J. 


TCTTATGCTA 


8340 


CTGTATCTAT 


TCTTAAAAAT 


CAAGAAGAAC 


TGAATGTGTC 


AAATGGAAGA 


AAACATGGAC 


8400 


AAAAAATGAT 


AGAGCAAAAA 


ATTATTAAGA 


AAAACAACCA 


GTTTATATAA 


AACATTTAAA 


8460 


TAACAAAAAT 


GATAATTTGA 


GTCTATTGAT 


TTTTTAGCTT 


AAATAGCTTA 


TACAAATAGG 


8520 


TTTTTTTGTC 


TGGTTAGTTC 


TTATGAAAAA 


AATTAAAAAA 


AACGGTATAG 


TGGAAGAAGG 


8580 


AATAGTAAAA 


• ATAAGAAAGG 


ATAAGGACGT 


GTTTTCATAA 


AGATGAAAAG 


GGTTTCTGGG 


8640 


ATTTCATCAA 


CTTTTTCTAT 


TTGGAGGAGA 


AGTTGTTGAT 


AATTTTTGCT 


GAAAAATCTC 


8700 


GTTATACTCT 


TCTTATTCTT 


ATAGAAGAAT 


CTATAGGAAA 


AAGCAGAAAA 


TGAATCCGCA 


8760 
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TTCGAAGTAA 


GAGTGCGTTT 


TGAGGAGATA 


AAAAGAAAAG 


GTGTGTTGGC 


TGTCTGTTTT 


8820 


AGCAAGGAAG 


AAAGACAACA 


GAAGAGCACT 


CACTTATTTG 


GTTTAAAATA 


AAAAATTGGT 


8880 


ACGAAGTGAA 


CGTTCTCTTC 


TATGTGTCGT 


TAG T AG AG G A 


AGGATGAAAG 


AAATGAGAAA 


8940 


GAATGGTCCA 


ATGGTAAACC 


GTTGGCTCTA 


CGGGTTGATG 


TGTTTGTTAC 


TTGTTCTAAA 


9000 


TTATGGCACA 


CCACTCATGG 


CTTTGGCGGA 


AGAGGTTAAC 


AGCGATGGCC 


AGTTAACGTT 


9060 


AGGAGAAGTG 


AAGCAAACCA 


GCCAGCAAGA 


AATGACCTTA 


GCGCTTCAAG 


GAAAAGCACA 


9120 


ACCAGTAACA 


CAAGAGGTTG 


TAGTGCATTA 


TAGTGCCAAT 


GTGTCAATCA 


AAGCTGCACA 


9180 


TTGGGCAGCG 


CCCAATAATA 


CGCGCAAGAT 


TCAAGTGGAT 


GACCAGAAGA 


AACAGATTCA 


9240 


AATTGAATTG 


AATCAGCAAG 


CGTTAgcAGA 


TACGTTAGTC 


TTAACGTTGA 


ACCCTACAGC 


9300 


TACAGAAGAT 


GTGACGTTTT 


CTTATGGACA 


ACAGCAACGA 


GCGTTGACGT 


TAAAGACTGG 


9360 


TACTGATCCG 


ACAGAATCAA 


CGGCAATCAC 


GAGTTCGCCA 


GCCGCATCAG 


CGAATGAAGG 


9420 


TTCAACAGAA 


GAAGCATCTA 


CAAACTCCTC 


TGTTCCTCGT 


TCGTCCGAAG 


AAACTGTCGC 


9480 


CAGCACGACA 


AAAGCGATAG 


AAAGTAAAAC 


AACTGAATCG 


ACGACTGTCA 


AACCGCGCGT 


9540 


AGCAGGACCA 


AC AG AT AT C A 


GTGATTATTT 


TACAGGTGAT 


GAAACAACGA 


TTATCGATAA 


9600 


TTTTGAAGAT 


CCGATTTATT 


TAAATCCTGA 


TGGAACACCA 


GCAACACCGC 


CGTATAAAGA 


9660 


AGATGTGACC 


ATTCATTGGA 


ACTTTAACTG 


GTCGATTCCA 


GAAGATGTGC 


GAGAACAAAT 


9720 


GAAAGCAGGC 


GATTACTTCG 


AGTTTCAATT 


ACCTGGCAAT 


TTGAAACCTA 


ATAAACCAGG 


9780 


TTCAGGTGAT 


TTAGTTGATG 


CAGAAGGCAA 


TGTCTATGGA 


ACCTACACAA 


TTAGTGAAGA 


9840 


TGGTACGGTT 


CGTTTTACCT 


TTAATGAGCG 


AATCACGTCT 


GAAAGTGACA 


TTCACGGGGA 


9900 


CTTTTCTTTA 


GATACTCATT 


TGAATGATTC 


AGATGGGCGG 


GGCCCAGGAG 


ATTGGGTGAT 


9960 


TGATATTCCT 


ACACAAGAAG 


ATTTGCCGCC 


TGTAGTGATT 


CCAATTGTCC 


CAGATACCGA 


10020 


ACAACAAATT 


GATAAACAAG 


GCCATTTTGA 


TCGAACGCCC 


AATCCTAGTG 


CGATTACTTG 


10080 


GACGGTAGAT 


ATCAATCAAG 


CGATGAAAGA 


TCAAACAAAT 


CCAACTGTGA 


CGGAAACATG 


10140 


GCCAACAGGG 


AATACCTTTA 


AGTCCGTGAA 


AGTCTATGAG 


TTAGTGATGA 


ATCTTGATGG 


10200 


AACAATTAAA 


GAAGTGGGTC 


GCGAACTTAG 


TCCAGATGAA 


TATACCGTTG 


ATAAAAATGG 


10260 


CAATGTGACG 


ATTAAAGGTG 


ACACCAACAA 


AGCGTATCGT 


CTTGAGTACC 


AAACGACGAT 


10320 


TGACGAGGCG 


GTTATTCCAG 


ATGGCGGCGG 


CGATGTGCCT 


TTTAAAAATC 


ACGCGACGTT 


10380 


AACAAGTGAT 


AATAATCCAA 


ATGGGTTAGA 


TGCTGAAGCA 


ACTGTTACCG 


CCACATATGG 


10440 


CAAAATGTTA 


GACAAGCGCA 


ATATAGATTA 


CGACGAAGCC 


AATCAAGAAT 


TCACTTGGGA 


10500 


AATTAACTAC 


AACTATGGTG 


AACAAACCAT 


TCCAAAAGAC 


CAAGCAGTCA 


TTACAGACAC 


10560 


AATGGGGGAT 


• AATTTAACGT 


TTGAACCAGA 


TTCTTTACAT 


TTATATTCAG 


TGACATTTGA 


10620 


TGACAAAGGA 


AATGAAGTCG 


TTGGAGCAGA 


ACTTGTGGAA 


GGAAAAGATT 


ACAAAGTGGT 


10680 


AATCAACGGA 


GACGGTTCCT 


TTGCAATTGA 


CTTTTTACAT 


GATGTGACTG 


GCGCAGTCAA 


10740 
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GATTGATTAT 


AAAACCAAAG 


TTGATGGAAT 


TGTCGAAGGC 


GATGTTGCCG 


TGAATAATCG 


10800 


TGTGGATGTT 


GGCACTGGTC 


AGCATTCAGA 


AGATGATGGC 


ACAGCCAGTC 


AACAAAATAT 


10860 


TATTAAAAAC 


ACTGGTGCAG 


TTGATTATCA 


AAATTCAACG 


ATTGGTTGGA 


CGTTAGCTGT 


10920 


GAATCAAAAT 


AATTATTTGA 


TGGAAAATGC 


CGTGATTACG 


GATACGTACG 


AACCAGTTCC 


10980 


TGGCTTAACT 


ATGGTACCCA 


ATTCGTTGGT 


TGTCAAAGAT 


ACAACCACTG 


GTGCTCAGTT 


11040 


GACGTTAGGC 


AAGGATTTCA 


TGGTAGAAAT 


AACTCGTAAT 


GCAGATGGTG 


AAACAGGCTT 


11100 


TAAGGTAAGT 


TTTATAGGGG 


CGTATGCCAA 


AACAAGTGAT 


GCCTTCCACA 


TAACTTATAC 


11160 


TACCTTTTTC 


GATGTTACCG 


AGTTAGACGC 


TAACAATCCT 


GCGTTGGACC 


ATTATCGAAA 


11220 


TACCGCTGCC 


ATTGATTGGA 


CGGATGAAGC 


AGGAAACAAT 


CATCATTCAG 


AAGATAGTAA 


11280 


ACCGTTTAAA 


CCTTTACCTG 


CTTTTGATTT 


AAATGCGCAA 


AAAAGCGGTG 


TTTACAATGC 


11340 


CGTCACCAAA 


GAAATCACTT 


GGACGATTGC 


GGTTAATTTA 


AGTAATAATC 


GTTTAGTCGA 


11400 


CGCCTTTTTG 


ACGGATCCAA 


TTTTAACCAA 


TCAAACCTAT 


TTGGCTGGGA 


GCTTGAAAGT 


11460 


CTATGAAGGC 


AATACAAAGC 


CAGATGGTTC 


GGTTGAAAAA 


GTGAAACCAA 


CGCAACCGTT 


11520 


GACGGATATC 


ACAATGGAAG 


AACCAAGCGA 


GAAAAACCAA 


AATACTTGGC 


GTGTTGATTT 


11580 


TCCTAATGAT 


AGTCGTACGT 


ATGTGATTGA 


ATTTAAGACG 


TCTGTTGATG 


AAAAAGTTAT 


11640 


CGAAGGTTCG 


GCTAGTTATG 


ACAATACCGC 


ATCTTATACA 


AACCAAGGTT 


CTTCACGTGA 


11700 


TGTGACAGGA 


AAAGTTTCTA 


TTCAACATGG 


TGGCGAATCA 


GTGAAAAAAG 


GTGGCGAATA 


11760 


CCACAAAGAT 


GATCCAGATC 


ATGTGTACTG 


GCATGTAATG 


ATCAATGGCG 


CCCAATCGGT 


11820 


TTTAGACGAT 


GTGGTTATTA 


CTGATACACC 


CTCACCAAAC 


CAAGTGCTAG 


ATCCCGAGTC 


11880 


ATTGGTGATT 


TACGGTACCA 


ACGTAACAGA 


AGACGGAACT 


ATTACGCCAG 


ATAAATCTGT 


11940 


TATTTTAGAA 


GAAGGAAAAG 


ATTACACACT 


GGaAGTTACC 


ACCGATAATG 


aAACAGGACA 


12000 


ACAAAAAATT 


GTCGTTAAAA 


TGGCCCATAT 


TGAAGCACCT 


TATTATATGG 


AATATCGTAG 


12060 


TTTAGTGACT 


TCTTCAGCGG 


CGGGGAGTAC 


AGACACGGTA 


TCCAACCAAG 


TGTCAATTAC 


12120 


TGGAAATGGT 


TCAGAAGTCG 


TTCATGGGGA 


TGACAATGGC 


GATGTGGTCG 


TTGACATTGA 


12180 


TCACAGTGGC 


GGGCATGCCA 


CAGGGACTAA 


AGGCAAAATT 


CAGCTGAAGA 


AAACAGCCAT 


12240 


GGATGAGACG 


ACTATTTTAG 


CAGGCGCCCA 


TTTCCAAATT 


TGGGACCAAG 


CTAAAACACA 


12300 


AGTCCTACGT 


GAAGGTACAG 


TAGATGCCAC 


CGGGGTTATC 


ACATTTGGTG 


GGTTGCCACA 




AGGGCAATAC 


ATTTTGGTGG 


AGACAAAAGC 


ACCAGAAGGC 


TATACAGTTT 


CGGACGAATT 




AGCTAAAGGC 


CGAGTCATTA 


CTATTGATGA 


AGAAACTTCA 


GCCGAAGGAG 


CACAACCAAC 


12480 


CATTATTAAA 


AACGATGTCA 


ATAAAGTATT 


TTTAGAAAAA 


ATGGATGAGA 


AGGGTAAAAA 


12540 


GTTAGTCAAT 


• GCTCGCTTTA 


AATTAGAGCA 


TGCCGTAACC 


ACGCCGTTTA 


CTCATTGGGA 


12600 


AGAAGTTCCC 


CTTGCGCCGG 


ATCGAACCAA 


CGCGAATGGC 


CAGTTAGAGG 


TGGATAGTTT 


12660 


AAAACCAGGG 


CTTTATCAGT 


TCACAGAAAT 


CGAAGCACCG 


ACAGGCTATC 


TTTTAGACAC 


12720 
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GACCCCCAAA 


CGATTCATCG 


rn TV /""• TV ^ TV TV TV TV 

TGACACAAAA 


TACGAGCGGA CA£ATTCGTG 


A T TP A T CI T 


12780 


C.AAAAXGCTT 


AATTACCAAG 


rn rn rp /** rp Tv 

GTTCTGCTbA 


ACTAATTAAA 


AAAGACCAAG 


C A CCC A A TCC 


12840 


ATTAGCAGGT 


GCTGAATTTT 


CAGTCCTTGA 


CACCACAGGA 


CAAGCAGTTC 




12900 


AGTTTCGGAT 


GCAAACGGAA 


AAGTCACAGT 


GACGGATTTA 


GCCCCAGGAA 




It 


TGTGGAAACC 


AAAGCGCCAG 


CAGGGTACCT 


TTTAAACACT 


GAACCAAGTG 


LI 1 1 Unhurt I 


1 7090 


TGCAGCAAGC 


GATCGGGGCA 


AACCAGCAAC 


AGTTATAGCA 


ACGGCTAACT 


TTb 1 1 AAb 1 A 


IjuDU 


TCAAGGCACG 


GCTAAATTAA 


TCAAAAAAGA 


TGTGAATGGA 


CACTTATTAA 


GTbb X bbbAb 


X J X *i U 


ATTTAAAGTG 


CTTGATGCGA 


AG GG AG AAAC 


GATTCAAACA 


GGCTTGACGA 


OTv7\7\ r P7A7A r P/"*7A 

LAAA 1 AA i LA 


1 J^UU 


AGGGGAAATT 


GTTGCAGAGC 


ACTTAGCCCC 


AGGAAAATAT 


CGCTTTGTAG 


TV tv 7\ rr* A 7\ T^rr 1 
AAACLAAAbb 


J. JZ DU 


GCCAACAGGC 


TATTTATTAA 


ATACCACGCC 


AGTCCCATTT 


GAAATTGCTG 


AG AAAAA 1 bb 


1 ^90 


TGGTAAACCA 


GCGGTCGTGG 


TTGCTAGTGA 


CAACTTTGTG 


AGTTACAAAG 


GGGCTTTCCA 


i Toon 


AATCGTGAAA 


ACGAATAGCG 


CAGACCAACC 


ATTAGCAGGT' 


GCTGTTTTTG 


AAl 1 Al AlbA 


1 Tyl/H 
1 J«J H KJ 


TCACAATAAA 


CAATCATTAG 


GGATTACAGC 


AACGAGTGGC 


AAAGATGGcA 


TV T\ T\ rp rp 7\ rft ^ryi rn 

AAAT 1 Albl 1 


X .50UVJ 


TAGAGACTTG 


GCGCCAGGTA 


CCTATTATTA 


CAAAGAAATC 


AAAGCACCAA 


AAI lALbAbA 


1 jjDu 


TGGCGCAGAT 


TATATTATTT 


ATCCTGAATT 


AGTAAAAGTA 


GAAATTCGTG 


O rn TV rp iti T» TV T\ 

GTbAl 1 1 bAA 


i icon 


AGGTGATCCG 


GAGATTTTCC 


AATTAGGGGC 


CTTCGCCAAT 


TTCAAAGGAC 


GCbbbblbl 1 


1JDOU 


TAAGAAAATT 


GATGCCAATG 


CGAACCCACT 


TCCAGGAACG 


ATTTTTAAAT 


rn/Tp Tv <"P r" 1 /""* A TV r p 

Tbi AlbbAAl 


1 "37 4 0 


CGAAAACGGG 


GAAAAAATCT 


TTGAAAGAGA 


AGTAACTGCT 


GAAAAAGATG 


/rprp /"i TV rp rp/"« Of"* 
b 1 I LAI 1 bbL 


i *3R nn 

I JOUU 


TATGGAGGAT 


TTAGGTGCTG 


GTAGCTATGA 


ATTAGATGAA 


CTGGATGCAA 


bbbA i bbb 1 A 


i *^ft fin 


TATCGTCAAT 


AAACAACCCA 


TTTATTTTGT 


AGTGAAGAAG 


AATTCAAATG 


jvrnjv TV 7A J'" 7A A /T" 




ACTAGATGAG 


TTAGAGTTTG 


TAAATTATCA 


AGCAGAAGTA 


ATGGGACGTA 


AAb 1 L AALoA 


1 TQRO 


GCAAGGTCAA 


ACCTTAGCGG 


GTGCAGTTTT 


TGCAATTTAC 


AATGCCGATG 


AbbAb AA 1 LA 


14 040 
iHyiu 


GCCCCAAGGT 


TCACCGATAA 


CATTCTTGAA 


TCGTGCAGGA 


GAAAAAGTTT 


L 1 b AAA 1 AAL 


1 4 1 00 


AACGGATAAG 


ACTGGCGAAA 


TTTACGCTAA 


AGGGCTAAAT 


GAAGGGCATT 


ALo 111 Inul 


14160 


GGAAACGAAA 


GCACCAACAG 


GCTATCTGTT 


AGACACAACG 


CTACATCCAT 


TTC ATP.TAAP 
1 1 on I \j 1 rxrH- 


14220 


CGCCCAATTA 


GGAAAAGAGC 


AGCCAATTGC 


TTTAGGCGAT 


CTTATCAATT 


A 1 LnnburtnL 


14280 


TGCTCAATTA 


ACCAAAGAAA 


ACGAAACAGG 


TGAAGCATTG 


GCAGGTGCGG 


1 o 1 1 1 Anoy l 


14^40 


CATTGATGAA 


ACAGGGCAAA 


CCGTAGATGG 


TV /"» TV TV TV /""•/"* TV T\m 

ACAAACCAAT 


CTGATGTClb 


apaappa app. 


1 4 4 00 


CAAAGTCATT 


GCGAAAAaCT 


TAGCACCGGG 


AACGTATCGT 


TTTGTGGAGA 


CaCAAGCGCC 


14460 


AACTAGCTAT 


CTTCTTAATG 


AAACGCCAAG 


CGCAAGCTTT 


ACGATTGCCA 


AAGACAACCA 


14520 


AGGCAAACCA 


• GCCACTGTGG 


TACTTAAAGC 


ACCTTTTATT 


AATTACCAAG 


GTGCTGCCAA 


14580 


GCTGGTGAAA 


ATTGATCAGC 


AAAAGAATGC 


CTTAGCAGGT 


GCTGAATTTA 


, AAGTGACAGA 


14640 


TGCAGAGACA 


GGGCAAACTG 


TCGCTCGTTC 


ATTACGTTCT 


GACAACCAAG 


GGTTAGTTCA 


14700 



WO 98/50555 

AGTGAACCAC 
CCAACTGTCT 
ACTCGTGAAT 
AAATCAGCCA 
CAACACACAA 
TTGGCTCTTC 
TGGAACAAAA 
AGAACCAACT 
AATTTCTTCT 
CTAATCAACT 
AGTTTACCCT 
TTGTTTTTCG 
AGAAATTTAG 
AAACTTCAGG 
CGAGAGGGGA 
GGAAAAATGC 
TTCATCATGC 
CAGGTGGAAC 
CCTTTGAGAC 
ATGTGATGAT 
AACAGGAGAA 
TGAAAAAGAA 
CCCAATTATT 
ATGctACGCC 
AGTTTGTCTA 
TGTATTTCTC 
GAACAGATGG 
TCAACGTGTT 
TTTTTCGTGG 
GAACAGTGAA 
TTATTTTAAG 
AATTACAATT 
CGTATTATGT 
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TTACAACCAG GAAAATATAC GTTTGTGGAA AGAAAAGCAC CGGATGGTTA 14760 

AAGCAAGCTG TCG.CAT.TCAC .TATXGCGGCA ACAGCGAAAG ACAAACCTGA 14 820 

GCGGGCACGT TTGTTAACGA GAAACAACCT GTATCCAAAA AAACAAAACC 14 880 

ACAACGAAAC AAGCAGCTAG AGAGACAGGT TGGCTTGGTT TACCGAAAAC 14 940 

GTCAATTACT TCTTTGTCTT TATCGGCCTC ATGTTGGTCG GTTTGGCAAG 15000 

TATAAAAAGA GCAAGAAATA AGCAAAAGCT ATAAAAAAAC AGAAAGrGCC 15060 

ATCACTTTGG ATTTTTGTTC CAGGCyCAAA ACCTTATAAA CGGCGGTAAC .15120 

CCTTCGGAAA TAAGCCGAAA TTCTCCAAAA ATTAAAGAGC AATTTTCGGA 15180 

TATTTCTCGG AGCTAAACGG TTCTGTACCG ACCTCTTTTT TCATCTTAAG 15240 

GAACAAAAGC AGCTCCTTCT TATTGACATT CTCAGGTAAG CCTTCATCTC 15300 

TTGAACAGAA CTTACTGGGT TCTTGGAGAT AACCGCTTTT GCCTCAGCCT 15360 

ATGCTATCTT TAGGTATTTT GGCTAAAATA TTGTACACTT TACATGTGCA 15420 

T TAG AG AT AT GAG GAT AG AA TGATGAAAAA GATGATGGTT TTATACAACG 15480 

AAGTAGTGAA AGTAAAGAGA TTGCAGAACG CTTTAAAAAA GCTGCAGAAG 15540 

AGCGGTTATT TTGCAACCTT CCAATCCAGA CATTGATCCT GAAGAAATGC 15600 

CAAAGAAAAT CAAGTGGGTG TCTTGGTCGT CATTGGTGGT GACGGGACCA 15660 

CGTTCAAAAT TTTAAAGACA CCATTCGTGA CTATCAAATA GGGATTATTC 15720 

AGTCAATAAT TTTGCACGTG TGTTAAGTAT TCCTTTGAAA GAAGAAGACG 15780 
CATTTTGGCA GGACAGACCA CACCAGTTGA CTTTGGGATG GTAAATCAAG " 15840 

TTCGACCTTA ACGATTGGAC TATTAGCAGA TACAGCAGCA AATGTGACCC 15900 

ACAAAAATAT GGTCCTTTAG CCTTTACAAA GCAATTTTTC CGCTTATTAA 15960 

AAAATACAAA CTAAAAATTG ATGGCGATGA AAGACGTTGG CATGGCAAAG 16020 

AACGATGACC ATGACGAATT CTGCTGGTGG CTTTACCAAT TTCGATGCCA 16080 

AGATGACGGC GAAGTCCACA TTATCATTTT ACCTAAGTTA GTTTTTTACA 1614 0 

TTATTTACCT AAAATTATCA GAGGACAATT AAATGAAATT CCTGGTGTCG 16200 

TGGGAGTCAA TTTAAAATTA GTGGCGAAAA AGACAAGAAA GTCCAAACAC 16260 

CGACCCTACT GAT GAT TT AC CGATTAAAGT AACGGTGGAA GCTGGGGGAC 16320 

TGTTCCTGAA ACGTCCACAT CTACGAAAGA ATAAAAGAAA CAACTGTTAT 16380 

AATAGATTTC TCATTTTGTG TACAATAGAA CAGAAGAGAG GCAGTTGAGT 16440 

ATGGGATGAA TATGTTGATA TATTCGATAC AGATGTCGTT TCTAAAATTA 16500 

AAACTTGCCA ATTAATGAAG GCTGGATTGA CATCAATGAG TTAGCAGAAG 16560 

AAACAAAAAA TCAGTCAAAA AATACGTAGA ACTTCTAAAA GATGACATTA 16620 

TGTGGATAAT GATGCTTCTT TGGCTTTTGA AAAGGGGCAT GGCTATCGCT 16680 
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TGAACTTGGC GTCTAGCCAA GCGTTCCAGA 
TAACTTATCA AGTAATGGAA GATTTATTCT 
CAGCAAAGTA TCATACTAGT GATGCCACGA 
TTTTAGCACC ATTGGGTTTA GTAATTAATA 
CGTATTTGCG CCATTTTGCT TACACTATTT 
TTGACTATAT ACAAGAA^CG GAAGTCGCAC 
ACGTAGAAGT TGCGGAAACG GTGACACGCC 
TCCGTAGTGC GCAAGGCGCC GAATATCGAC 
GTAATAAACA TTACCAAGCC TTTATAGAAG 
AAAACGATTT AACTTTTGTG TACTTTTTCT 
AAGCCACTGC ACAAGCAATT TTAGCGCATT 
TGACTCATTT AGTCCAACAT TTCTTATTAG 
AAATTCGTCC GCAAATTTTT TATTATTTGT 
ATGTCCAAAA TGTGAAAATG TACAATGTCT 
TTCTTTTAGC AGAGTTAATT CAAGGTTTAA 
TATTTACAAA TAAAGAATTT TTGGCTTTGC 
ATTTAATTCA TCGCGAACAA AAAATTGTTT 
AAGAAGAACG GGTGAAACTT TCGATTGAAA 
TTTTACGATA CAGTGAAGCA AGCAAAGAAG 
CTTCTTTGGC AGATGTGGAA ACGGGCCATC 
CGTTTGAGCA ATATAATCGT TTGAACCAAA 
CGTCACTTTA ATAAAAAATA GGCTATTTTG 
TTTTCTAAAA AGTAAAAAAA GACAAGAAGA 
CTTATTGACG TTTTCTCGTT GTTTTTTAAT 
GTCAAAAAGG TACTTTTTTT GCTTTTTTTA 
GCAAAGGAAC TACTACTTTT TTTAACAGTC 
ATTTTTCTAT ATAATAGAAA GAGCAAGGGA 
GGAACAAATT GAAGAAATCT TAAGTTGTGA 
ATTGGTTGAA GCTTTAGAAT TGGAAGTCAC 
CTTACATATG AATCCAAAAT TCAAAAAAAT 
GCTGTATCAA ■ CTTTTACAAA ATAAGCCGTT 
ATGAAGAAAA CAATGCTCAT TCTTCTATTT 
GTGACCGAAG CGGTTGAACC AGTTAACGCA 
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AACTTTATAC AGCAATTTTT CGGGATAGTT • 16740 

GGGGTCGATT TAATGGAATI .GTGCCAGCGA- 168 00 

TTCGGCGCTT GCTCAAAGGG TTTACAGAAA 168 60 
AAAGTACTTG GAAAATTGAA GGCAAT'GAAG ' 16920 

TTATGGACTT AATCCGTGAT ACTTGGCCTT 16980 

AAACCGTCCA AAAAATGATG GCTTTTTTCA 17040 

GAGTTAAATA TATGGTCGCT ATCTCAAAAG 17100 

CTTCAGCCGA GCAAGAAACT TATTTAGAAG 17160 

AGATGACAGC TACAACCCTT TTCCTTAACA 17220 

TGTTAGCCAA TGACCAATTT TATCAAGATG 17280 

TTGATGAGAC TGAAGCACCT GTTTACATGT 1734 0 

AAGAAATACA CATGACGCCG CAATTGGAGA 174 00 

TTGCTACACA TTTATTTGCC GAACTTTACT 174 60 

TTTGGAATAA AATCAAAGAA AAATATCCGG 17520 

ATCAACTTTA CGAACAAACA GGCAATGAAT 17580 

GTTATTCTAG TGTTTTGGCG CTTTCCAATG 17 64 0 

TAGGGTTAAA ATCAGGCTTG CCAGTACTGG 17700 

ATAATTTTCG TCGGTTTTAT GCATTGGAAG 177 60 

AATGGCAAAA TATTGATTTA CTATTAGTGA 17820 

CAGAGATCTT TTTATTTTCA AGTGAGTTAA 178 80 

AAATAGTAAA AATTAACGAG AAAAGAACCA 17940 

ATGCACATAT CTGATAAATT GTAAAAAAAT 18000 

AAACGTCAAT AAGTAGGTGT TGATTACCTA 180 60 

AAATACTGCG TCAATTCTTC TGAAAATTAT 18120 

TCGTAAATAA TAGATGATTA GCTATTGATA 18180 

AGTTAAAAAA TATTGAGGTT TCTCACAAGA 18240 

AGGAAGGCGG GATAAGATGA AAAAACAGAT 18300 

AGAAAATAGT GCAGGCATCC GATTAAAGGA 18360 

AAATCAAAAC TTGTTAAAGG TAACAAGTAT 184 20 

TTATGCATAC GAAGACAGTC GTGTTATTAC 184 80 

GGAAGTAACG GAATGAAAAG AGGGAAACAC 18540 

GGTTTATTGA CAGTGGTTGG TCTGCCAATG 18600 

ACAGATACAA CGATTTTTGG TGCGCAAGCG 18 660 
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ATGGTGCCTA 


ATAGCACAGA 


AGATCAAACA 


GATAAGTTAC 


AGACGTTGCT 


GAGTCAGACA 


18720 


GCAAAGGAAG 


GACGGGCATT 


ATTTTTACCA 


CAGGGAAGTT 


ATGCTCTGTC 


AAAAGATATT 


18780 


GCGATTTCTT 


CAAATTATCA 


ATTAATTGGC 


GATACAACAG 


GAGCCACTAT 


TTTACATAAT 


18840 


GCTACAGGTG 


CACCGATTCA 


GCTAACAGAC 


ACAACGTACG 


GGACGAAAAC 


CAATGTACGA 


18900 


CTTCAAAATA 


TTGCTTTTGA 


CGGCATCAAT 


GTGACCTTAA 


AATTGACGAA 


TCAATTAACG 


18960 


CTAGCAAACA 


ATATCTTTTA 


TAATCCATTA 


AAAGGATTCG 


TGGTCAATTT 


AAATGCAGAC 


19020 


ATTGGCGTTA 


AGATTTCAGG 


CAATATTTTC 


ATGCGAGACA 


CGGCACATAT 


GCAATCGGGA 


19080 


GGTGATTTTA 


ACCGAGCGAT 


TTACATTGGT 


GGTTATTCAA 


CGCCAAGTCG 


CTTCCAGTAT 


19140 


ATGTCAGATG 


TAGACATCGT 


GGACAATCTA 


TTTGGTTTAA 


AAGTGACAGA 


ATTAGATGCG 


19200 


ATTAAAAGTA 


CCAGTCGTTC 


AGACTTAGCA 


GCGACTATTA 


CACGTTTGCA 


AACAGCCATT 


19260 


GAAGCCGGTG 


CTATTTCAGT 


ACCTAATGAA 


CAAAATTATC 


TATCCACCGG 


GGTTAATTCT 


19320 


TTTAATATGC 


TGAAGGATGT 


AACTGTTCAA 


CATAATTTTT 


TCTATTCACC 


CTATGATAAT 


19380 


GAAAACTTGA 


ATGGTCTAGT 


GGGAGATCAT 


GCAATTTATT 


TCCGTGGCGC 


CCAAAACATC 


19440 


ACGGTAGTAG 


GGAATCATTT 


ACGCGGACTA 


CAAAATGGTC 


CAGCTGGCGG 


ATTTAAGTTT 


19500 


AAATCAGGCC 


GTAATATTAC 


GATTATGAAT 


AATTATTTGC 


GAAATACAGG 


TCTCATTATG 


19560 


TATGGAACAC 


CAGAGATCGG 


TTTGGCAGAA 


ACACAAGCAG 


AGGGGGCTAT 


TTCAGAACTC 


19620 


TCTAACTGGT 


TAGTGGCTAA 


TAACATTTTT 


GATTGGAAGT 


ATTGGGACAA 


TCAATATGCA 


19680 


ATTGGGATGG 


AATACAATCG 


ACATACAGGA 


AATAACAATG 


TGTTCAATGG 


CGTATTTATC 


19740 


AACAATCAGT 


TTGTTAACTA 


CCATAATATT 


CCGCAAAACC 


GACGTCGGGA 


ATTGTTAATC 


19800 


GCTAGTGGCG 


GTGGGTTCCG 


ACCAGAAACG 


AGTTTTGTGA 


AAGACAATAC 


TCGAGATGAT 


19860 


GGCCTTAAAA 


ACGGCCAATT 


ATTGGTAGAA 


AATTGGACAG 


AGGCGGACTA 


TCGCTTAATG 


19920 


CCAGCAACAT 


GGGAATCATT 


AATTTCACCA 


ACACTTTATG 


AGCAATATAA 


AAATACGCCA 


19980 


ATTCCGGTTA 


GAAATACTTT 


AGCAACCCCT 


GTGGCTACAA 


CGATTGTGCA 


AGGACAATCA 


20040 


ATCGATCCGC 


AGCAACTGGT 


TGCTAATACA 


AATGACGCGG 


ATGAAGCAGT 


CCCAGCCGCT 


20100 


AAAATCGTCA 


ATCCAGAGGT 


CTTAAATGAA 


ATCGGCCAAC 


AAAAAGTGAC 


TGTTCAATTA 


20160 


ACTTATGAAA 


CAGGGTCATT 


GGTAACAGTG 


AACGTACCGG 


TGACGGTAGA 


GGCCCCTGCA 


20220 


AAAAAGCTTG 


ATTTGTCCCA 


GCTACAAACC 


GTTTATGCTT 


CGATTGGTGA 


AGCAAACCAA 


20280 


TATACAGTTT 


ACTCTTGGCA 


GCTATTCACT 


GCCATTGGGC 


CTAAAACAAT 


TGTTCCTTCC 


20340 


TATTATCAGC 


AGGCGACACA 


ATTGTTAGCA 


GAAGGTCAGG 


AGAGTCAGGA 


CAAAACCCAA 


20400 


GAGCAAGTGG 


ATCAGTTAAC 


AAGTAATTTG 


CAATCTGCAA 


TGAAAGTATT 


AGTCAAAAAA 


20460 


GCCGATATCA 


• CGCTAGAGCG 


TACAGAAGCA 


GAAAATGAAC 


TGGCAAGCGT 


TCATAAATTA 


20520 


GACGAATCTG 


TATATACCAA 


AGACTCTTGG 


CAAGCCATGC 


AAGAAGCCTT 


AATAGATACA 


20580 


ACAACTGGCG 


AAGGAAGCTC 


TAAGCAGTTA 


CAACAACTGC 


TTGCTTGGTC 


AGATGAAGAA 


20640 
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TTATTAGAGC 

CAATTAACAC 

ACTAGTAACA 

GAAAGCAGCA 

AGTACAACCA 

ACAAGCAATA 

GAAAGTAGTA 

aGCACGACAA 

ACCAGTAATA 

GAAAGCAACA 

AGTACGACAA 

ACAAGTAGTA 

GAGAGTAGCA 

ATTTACGCGG 

GCCAAGGCTA 

ACAAATCAAG 

AAAAAATTCC 

TTTTTATCGT 

GGATAAGTAA 

TTAAAAGAAC 

AAGGGTAAAG 

AAATTTTAGG 

AACTCATTTT 

GTGTTTCCGC 

TTTTCCCAAT 

TGATGGCGGT 

ACTGGGTCAC 

GCCATTATTT 

AACATAAAGT 

TCGTAGGTCT 

GCATCTGGTT ■ 

CATTCATCAT 

ATTTTATTTT 



PCTAJS98/08985 



CAACACTTGG 
AAACCATAAA 
CAAGCGAAAG 
CCTCAAGTAC 
GTAAAACCAG 
CAAATGAAAG 
CCTCAAGCAC 
GTGAGAGCAG 
CAAGCGAAAG 
CGCCAAGCAC 
GTGAAACAAG 
CCAGCGAAAG 
CCCCAAGCAC 
TAGAGTCTAA 
GTCAAACAAA 
AATCGTCAGG 
CCAAAACAGG 
TAGCCATAGC 
ACATCCTGAA 
TACAGTTTTT 
TAAAGGAGGA 
TATTAGTATT 
AGGATTTATC 
CTCTATTCCA 
TGGTCTAATT 
GAGTACGGCT 
AATTACATTG 
AGGTTTAACC 
TGCAGCAACT 
GGCCTTATGG 
TCCAGTAATG 
TCCAGCCGCA 
TGTTTATTTA 



TGGCTTTAAA 
AACGGCGTTA 
CAGCACCTCA 
AACAAGTGAA 
TAATACAAGC 
CAACACACCA 
GACAaGTGAA 
TAGTACCAGC 
CAGCACGCCA 
AACCAGCGAA 
CAATACAAAT 
TAGTGCTTCA 
TGTAAACGAA 
TCAAGATCCT 
GGAATCAGTA 
AACAGTCAAA 
GGAACAATCT 
AACCTATTGT 
ACAAGAAATA 
AGTGAAAAAA 
CAAAAAACAA 
GAAAATGGTC 
GGCGGTGCCA 
GCAGATTTAG 
GTTATTTTAA 
TGGTTTGCTA 
GCAAATATGG 
AGTAGCGGCG 
TGGCTTCAAG 
CTTTGTTATG 
ATTTTTGTTG 
ATTTTTGAAG 
GGTAATATCA 



929 
ACGCCTGCAG 
CTTCTGTTAG 
AGCACAACCA 
ACAAGCAATA 
GAAAGCAGCA 
AGTACAACTA 
AGCAGTAGTA 
GAAAGCAGCA 
AGCACGACAA 
ACCAGTAGTA 
GAAAGCAACA 
AGTACAACCA 
AGTAGCCAAA 
AATGACGCTC 
GCAGAAAACC 
AAAGCGGATA 
TCAGCAGTTG 
TTTAAAGTGA 
GGACTTGTTT 
CACACAAAAG 
AGGAAGTGAA 
AGAAAAAAAT 
TGATTTCCCT 
CCAGTGTTCA 
TGGCTGGTGG 
AAAAGGTCTC 
TTGGCGCCCT 
CCTATTTAAA 
CAATTATTTC 
GTGCGAAAGA 
CAATTGGCTT 
GCGGCGTTCA 
TTGGTGGTGC 



ATGCACAAAa 
TTGAAAAATC 
GTGAAACCAG 
CAAATGAAAG 
CCTCAAGCAC 
GCGAAACCAG 
CCAGCGAAAG 
CyTCAAGCAC 
GTGAGAGCAG 
CCAGCGAAAG 
CACCAAGTAC 
GCGCAACCAA 
GCAAAGGCCA 
AAAGTAATTC 
AAGCAACTAA 
ATACGACCAA 
GGAGCTTCCT 
AAAGATAACT 
CAGGATGTTT 
TACTTGAAAT 
ATGTTTGTAT 
TCAAAAGCCT 
CGGCTATTTG 
AGCTTTAGTG 
TGAGTTAATC 
TTTCAGAGAA 
ATTTGTTGCC 
TCAGGTTCTT 
AGGTATTGGC 
TGCAGCGGGG 
TCAGCATAGT 
GTGGGGAGAC 
CATTTTTGTT 



GCGTATTAAT 
AACAGAGACA 
TAATACAAGC 
CAATACACCA 
GACAAGTGAA 
TAATACAAGC 
CAGCACGCCA 
GACAAGTGAA 
TAGTACGAGT 
TAGTACCTCA 
AACCAGTAAA 
CAATACAAGT 
GAATTCTGTC 
AAAGCCCTCT 
ACAGATACAG 
AATTGCCAAG 
AGGCCTGAGC 
CTTTAAAAAC 
TCATATAAAA 
TCTGTGATAT 
ACACCAGATG 
TTGGTAGCAA 
GCTTATGTGC 
GGAGCTGCGG 
ACAGGAAATA 
TTACTCGTGA 
TATTTTTTTG 
TCTCTTGCAG 
TGTAACTGGT 
AAACTATTAG 
GTGGCGAATG 
TTTTTCCGTA 
TCTGGCTTTT 



20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 

21600 

i 

21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
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ACTATTTAAG CTATCGCAAA CATTAAATCC ATTTAGCAGA TAAAAACCTA CACAAAGGCT 22680 

AAATGCCTJT GTGCAGGTTT TTTCTTTTTA AACAAAAAGG AAAACGGTTG AAAAATTATT 227 4 0 

GCTAGATTTT TAAAACAATG TTACATTAAT GTTACAAAAA GATTTCTGGA GGTAACGAAA 22800 

AATGAAAAAA ATTGCTTTGT TCAGTATGTT AACGTTCAGT GTATTGTCTT TAAGTCTAGC 228 60 

AGGATGTGGA AACAAAAAAA CAGCAaGCAC AAATGATTCT AAGCCAAAGC AAGAAACAAA 22920 

GAAAGCCACG CAGAAATCCT CTAGCCAACA AGAAATGAAA AGTAGTCATT CGTCTGTCAC 22980 

GGGTCAAAAT TCTAATGTGA CAGGGGAAAA TCCGTCAGAA AATGCCACGC AGCCTTCTGC 2304 0 

AGGAACTGAT GAAACGAATG AAGTCCCTCA AAACCAAGCA CCTGATACAA ACATTACAAT 23100 

TACCAATGTT GTTTTCAATC CTGAAAGAAA TGAAATTAAT GGTACTACAT TACCTAATGC 23160 

AACCATTACA GCAACGGTAG TCGGTGATGC TTCTGCACAA GCAGGTGTTT TTTATGCGGA 23220 

TGCCAATGGC AATTTTACAG TAATTAGTCC CAGAGCGGGA GCGACTACTC AATTAATCGC 23280 

AACCGTTGAT CAACGGAATA GTGCACCTGT CCAAATTGAT ATTCCAAGTT CAGGACAAGA 2334 0 

AGCAGCGCTT TCTTTTAGCA ATATTACGAT TGATCCGAAA CAAGGGACAA TTTCTGGTAA 2 34 00 

AACAGCACCG AATGCAACTA TTTTAGTGTC ACGTGCAGAT GATGCGCGGG TGATTTTAGC 2 34 60 

AAGTTTTACT GCGGATGCCC AAGGGAATTT CACAGCCAGT AATTTAGTTC CCGGCACAAA 23520 

AAATCGCTTA GATGTTACGT TAAATGGAGA AATAGGGACA CCTTACTTGT TTGATTTACC 23580 

AAATTAAAAT AGAAGAGAAG TGATCAAAGC CACCAACGCT AGTTAGCGTT GGTGGCTTTG 23640 

ATGGTTTAAG CTTCTTCTTT TTTATGATTG CGGATATGTA AATAGATGGT ATTTTTTGAA 23700 

ATACCTAATC GATGGGCAAT CTTTATGACA GAATCTTTGA GATTAAAAAT TCCTTGATCA 237 60 

TATAAACGAA TAACAATTTC TTTATTTTGA TTCAAATGTG ATACATTTGG ATCGCTTTCT 23820 

ACTGCTTGTT TTACTTTTGT CAGTGAAGTA GTGATTAATT CGTCAATATT GTCAGAAAAA 23880 

TGTTCAGAAT TTTTAAGCTC TTGGGACATA TCATTTTGTT GTCCTGAAGG TAATAAGTCT 23940 

TGCAAGTAAT CAATTAATGG CATTTCTAAA TGCATATTGA TACAGATCAA TCCAATAATA 24000 

CGATCATTTT CACCGATAAC TGGAATGGTG GAAGATTTTA AAATAACCCC TTTTTGATTT 24060 

TTATTGTAGT AGGGTTTCAC AGCGTAATCA TGTTCCGACT CTACTTGACT TAACATTTTT 24120 

AACGCCATAT CAGTAATGGG GGCACCTTCt TGCGGTTACT GAAATGGCCA TTGATGACTT 24180 

TGATAGCCGA ACTTTCCAAA TTTTCTAAAC TATGAAGAAT GACTTCATGG GTACTACCAA 24240 

AGAAGTTCCC TAATCCATCA AGCATTAAGG CATGTGAATG TAAGATTTGT TTATCTACTT 24 300 

CACTTAAACG TAAATATGAC TCCaAGTGAT ATCCTCCCCT ACACTGAAAT TAATTTTAGT 24 360 

GTAATACTTT TTTAATGGAA AAGCAATAAG ATTCTTTACG TtAAAAAAAc nCaTtGACTA 24 4 20 

AAcTTTkGTT ' TAGTCGTATA CyCkGAATGT GGAATGAAAT TTTTGTTACT CATTGTACTT 24 4 80 

TAGTTCATCG GAATGCTCGC TCATTCTTTC ACTTGTTGaG CaATAAAAAt TTTaTTCAGG 24 54 0 

AGTACCATCC GTTATAAAAA AATATAGTTG GAGGCAAAGA AATGAAAACG TTATCAGGAA 24 600 
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TCTTGATGTT AGTCTTAGTT 
GGAAAAAATC TTTAAGTGCT 
TTTTGCGGTA TGCAGTAGGC 
CTATGGGTAG TCTAGGTGGT 
GTCCAGTTTT TTCAATTATT 
TTATTACCGC GTATCTTTTA 
GGATTGACTT ATTAGTCGTT 
TTATTCAACC CGTTGTATTA 
TAGATGGCAA TCCATATGTG 
TGACACCGTT AAGTTCAATG 
CTGTCGGTGC GTTGACTTGT 
TAAAACTAGG CACAGCTTCA 
ATATCATCAG TTCCAATCCA 
GTGGCATTGT AGTGACCTTC 
GGGCTGGTTT ACTTGTGACA 
CGATTATCAC AGTGGTCAGT 
ACAGAATTGA AGCGGTGGAT 
AAAAAGAAAA AGAAGCAGTA 
ATCAATAAAC CTAAAAAAGT 
GGTCCTATTA TGATTGGTCC 
GCGGCAAGAA AGTTATTCGG 
TTGCAGACAC GCATAAAGGA 
TAGCAGCAGA TGATGCCAAC 
AGATTCAATT TTTAGAAATG 
TGACACTCAG TGATGCAACG 
CTATCGAGGT GAAATACATT 
CTATTTTGTT AGTCATTAAT 
AAAAAAATAA TATTCAAGTG 
TCATCTTTGA CTTAGATTCG 
AT G AC AC AT C GAAAATAATC 
AGAATTTATC ■ GATAAAGCTG 
AGAAATGGAA CGTAGCCAAA 
GGATGTCATG GATGCTGCTG 
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TTATGTGGCA TGTCCCTATT CTCTTTTAAA GCACCGCGAG 24 660 

TTGTC.TGGT.G_ CACCTXGJGC CACCTTTCTA CCAGAGGCGT 24 720 

GGCGTTTTTC ATTTAGATTA TGTGTCGCAA ATTGGTGAAA 24 780 

TTAGCAGCTG GAGCTCTTGT GCCGCTTGCT TTCGGGATTA 24840 

TTAGGTCTGT CATTGATTAA AGTCAGCTTG TTACCAGCCT 24 900 

TCTTTTGTTG TTGAGCAGAT GCAAAAACGT ATTCCAGAAG 24 960 

ATTTTAGTCA TTCCAGTCGC TGCTTCGTTA GTTGCTGGTT 25020 

GGTGTCCTAG AAGTTATTGG GGGCACGATT TTAAGTGCAG 25080 

ATGGGCGCAA TTTTAGGTGC TATTATCCCG ATTGTTGGTA 25140 

GTTTTAACTG CCTTGATTGG TTTGACTGGT GTACCAATGG 25200 

TACGGCAGTT CCATTGTCAA TGCGGCGCTA TTTAAAAAGT 25260 

ACCCCGTTAG CTGTGGCAAT TGAGCCATTA ACACAAGTCG 25320 

ATTCCTATTT ACGCAACGAA TTTATTTTCA GGAATGGTTA 25380 

TTTGGCTTAA AAGTACCTGT GACAGGAATG GCAACACCGT 254 4 0 

TTAGGTAACA ATGCCATTCA AACGACTTTA CTAGCAGTAG 25500 

CTGATGTTTG GATTTTTAGG TGCGCTGGTC TTTAAACGTT 25560 

AATACACCAG AATTTGAAGG aACGACGGAA GTAACCAGTG 25620 

AAAACCAAAG GAGGAGTTAC GTATGAACAC AGCTATGGCC 25680 

GAACAAAGTA ATGAACTACA AAAGTTGTTT TGACATTATC 2574 0 

TTCTAGTTCG CATACAGCTG GGGCCTTAGC GATTGGCTTA 25800 

CGGAACGCAG AAAAAATTGT GATTAAATAC TATGAATCTT 25860 

CATGGGACTG ATTTTGCAAT TATTGGCGGA ATTCTTGGAC 25920 

GTAACACAAT CCATTGAACT TGCGGAAGAA CAAGGAATTG 25980 

AGTGAAGAAA GTCCAGTGAA GCATGCTAAT ACAGCATGTG 26040 

CACGAAATTC ATTTAACAGG AATTTCTGTA GGTGGCGGCA 26100 

GAATTAGACG GTTTCAATGT CCAACTACAT GGTCCTTTAC 26160 

CAAGAAGAGC AGGCGATGCA AGCTTTTAAA GATACGTTAC 26220 

AATGCGGTGT CTCGTTATGT GGAAGGGAAT CAAATTTTAT 26280 

GCACCAATTT CCTCGGTGAA AGAACAACTG TTTTCACTAG 26340 

TTATTATGAT AGGAGTTTGA CATATGTATC TTTCGATTGA 26400 

AAAAAACAGG AAAAAAAATC TCTGATTTAA TGATTGAGCA 264 60 

AAAGCTATGA AGAAATTTGG ACACAGATGG GTCAAAATTT 26520 

TCAAACGCTC GCAAGAAGGA GCAGGCGTTT TTTCTCCTAC 26580 
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TGGGTTAACG 


GGCGGTGATG 


>"* ^* rp rp rp rp 

CGGCTCbl I 1 


AAAAAAATAf^ 

AAAAAAAl AU 




r*f a a ta arrTT 
bL/WiAOu 1 J. 




ATCAGGTGAT 


TTAATGATGT 


CTGCAGTTCA 


AGCCGLACI A 


(jo 1 AL 1 AA I lj 


AAL) 1 LAM 1 y^v 


9 £7 no 


AGCAATGGGC 


GTTGTTTGCG 


CAACGCCAAC 


rn ^ tr\ f~* 7\ 

CGCTGG I bLA 


Avj 1 obLALL 1 


1 ACLAVabbL* J. 


Z 0 / DU 


TTTAACAGCG 


ATTAAAAATA 


CCTTGAATTT 


AACGCGIGAl 


7\ 7\ C T\ A ATT/"" 

L AALAAA 1 1 U 


^ rp rp rn /** rp rp ^ rp rp 


0 Co on 


TACCAGTTCT 


TTATTTGGAA 


TGGTCACAGC 


CAACAATGCG 


ATGATTGCTG 


m /*■*• rn m rn r** 

GTGCTGTTGG 


0 0 0 C\ 


TGGATGTCAA 


GCCGAAGTAG 


GCAGTGCTTC 


AGCGATGGCC 


GCCGCTGCAG 


CTGTTGAAGC 


26940 


GGCTGGCGGA 


ACACCACGCC 


AATCTTCTGA 


AGCTTTTGCT 


ATGGCTTTAG 


GGAATTTATT 


1 t n rt 

2 / UOU 


AGGTTTAGTT 


TGTGACCCAG 


TGGCTGGGTT 


AGTCGAAATT 


CCTTGTGTCA 


AACGAAATAC 


27060 


AGTAGGTGCA 


GGCAATGCGT 


TAATCGCTGC 


AGATATGGCT 


TTAGCTGGAA 


TTGTAAATAA 


27120 


AATACCAGCA 


GATGAAGTGG 


TGGAAGCCAT 


GAATAAAGTC 


GGCCGTCAAT 


TACCACGAGA 


27180 


ATTGAGAGAA 


ACGGGTTTAG 


GTGGCTTAGC 


TGGCACAGCG 


ACTGGCCAAC 


GAATGAAGAA 


27240 


TGAAATTTTT 


GaAAAAGTAA 


AATTCACCAT 


TGnTTAAAAT 


TTAATCATGA 


AGAGGaAtAa 


27300 


yGCTGGTTGA 


AAAATTCGCA 


ATGTGAGTGT 


CAGAAAGTAA 


ACCGGTnGGC 


nAGAGTTAnC 


27360 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



CAGAAACTGA 


ACAAAGTGAA 


TTCCCTTTTG 


CGATTCACnA 


AAGTCACATT 


TTCAcGcAGG 


60 


tTTaAatCTt 


CCTGAAaaTT 


TtAAaGGTaT 


TTTTttGCTt 


GAaGAaaGTT 


ATTATaCTTT 


120 


AAATGGAAAA 


GAACGATTCA 


AATCAGATAT 


CTTTCTCTTT 


GAAGTAAATT 


CGTCAAGTAA 


180 


CGTTAAATTA 


AGTTCCATAA 


CTGTTCCTAA 


AAACTATGCT 


AATAAGAAAT 


ATGAAGAAAT 


240 


TGAATCTATT 


CCTTTTTCTG 


AAATGACCAT 


TTCAGAAAAA 


TTTGTTCCGA 


TTGTTTACAA 


300 


AAAAGAGAAT 


GAAACTTTTA 


TTGGTCAATC 


TGAAAGTAAA 


TTTACAGAGA 


AGAATCATTT 


360 


CATATTAAAA 


CAAACGATAT 


CTGAAAAAAA 


ATTGATAATC 


AAAGAAGAGA 


TAATGAGAGG 


420 


AAATAAACGG 


ATATTCGGAT 


TCGATAAACC 


TATCGAATAC 


GTAAGAGTTA 


AAGAAAATTA 


480 


AAAAAGCTTT 


TAAAAGAATA 


TAATTTAACA 


GTGAAGTAAT 


TTCTTTGTAA 


AATAGTAATT 


540 


ACTTCACTGT 


TATCATATTA 


ATAAAGATTT 


TCATCATTTC 


TGAAAATATC 


TTCTCTTTCC 


600 


AAGAAGAATG 


CTAATTCAGA 


TGTGTGTGGT 


TAAATTGATA 


GTCTGCAACA 


TTAAACATTT 


660 


GACTGATTCA 


CTTTCCAAAA 


TTCTATTATA 


CCAGTTAAAA 


TCGCTTAAAT 


AGAAATACAC 


720 


TTCTAGCAAC 


ATGTTCCATA 


GTAAAAGTAG 


CACTTGTTAA 


AAATAACTAA 


TCTGCAACTT 


780 


ACTATCCTTG 


AAAATAAAAT 


TCTCAAACAC 


TTTACAAACA 


ATTATTTTTT 


AGGAAGATGG 


840 


TTTGTTATTC 


TTATATTTCG 


AGAAAAATAA 


AAATAGCAAA 


AGATACATTA 


CGATACAACT 


900 
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ATACTATCAA TACAAAATTA GA7ACC7C.VT CTGCCCTTTG TTCAGTGATA AAAGCAATTG 
AGTTAGGAAT AATAAATGTt" CACCTTCAAT AGACTTATTT ATTCAGTCAG CTAAACGCTG 
T TAG AT AC AG CTATAGAAAT CTAATTA7AG ACATAGTTAG ATAAACAACT ATCCTTTATA 
AAAATTGGTT CCATGTCTTT AG7ACA------A GAGTACTATC ATTATTTAAT TTAAGTAAAT 

AACGGATTTC AGAACTCATC GT7GAT7CSA CGAGCACTCT TATTCAATTG GATAAGCGTT 
ATCTCCTCTT TTTTAAAGTA AT7TGTAT7C AGCAACCTAG AATATTGATA GTGCCGATTA 
TTTAGACAGC TATCCGGATA ACTTGGAA7A ACTGAAACTA GAGTAAAAAA TCTAATAAGG 
TGCTTTTAAA AT AT CAT CAT TTAAGA7A7G AAATCGCATA CCCCGAAAAA TATCTGCTCT 
CTTTAGTTCT ACAATTTCTG ATATCCAAAT AGTTCAATTT GAGTTAACGC TGGAAATGGT 
GATTCATCTT CTGCTTTGAT CAGCTG777C AATGTAGCAG TCGTTACTGT TCGTGGAGAG 
AAGGTGAAGT ATTGCGTTGC TATCGCATCA CTCGTAGTGA ATATTTCTTC TGAACCATCG 
GAAAAGGCTA TGGTTACTTG TTTCCAATAA CTATCATGGG GAAAATCAGC ACGTAATACC 
AATCCAAGCT TATCGAGTTC AACTTCTCGG CCAAAATCAA TTGTTATTTC AGCATCTTTT 
TGCTGATTGA TCCCCCAAGA TTGATAGGGG TACGATCCAT GATCTCTGTT GGCCAGAATT 
CCATCAATTG CATTTCTAGC AAAAAAAGTC GCATCGTTTC GTGTTTCAAC ATTTGCATAA 
GCGTGGGGAT AAACATTGGC CACATCTT7T TGATCATGAG GATTGAGCGC CCAGTTACGA 
TAACTAGTCT GTTC7TCAGC AAAAGCCT7T CTGACAAAAA AGTAATGCTT CTCGTCAAGA 
AAAGCCGCTG GTGGATAGGC TTCTCGCAAT TTTTCCGTCG TAGGAATTTC ATAGACCCAC 
TCGCCTTTTG ATAAATACAC TAAAGAGGAT CCTAAAGCAT TGTCTAGTTG AACATAAACT 
AATTGTTTTT CTGCGGTTAC AACTTTAAAT TTATCTCCTG CTTGAAAAGC ATACTTTTTC 
GTTGCCAAAA CTGCTTCATC CACTCCAC7G CTGCACATAA CTACTTCATT TTCTTGATTA 
ATAATTGATA ATTCTATTTG CATACTTTCT TCGCTCCTTT TACTTTTTCT TTAAAAACTT 
ACCCGCAATA AATAAAAGGA GAGGCGGCCT TAACCGTTTC CTCCCCTCTT TATTCTCTTA 
AGGTTGTTTA CCAATATAAG CTAAAATTCC GCCATCAACA TACAGAACAT GACCATTTAC 
AAAGTCAGAG GCCTGTGAGG CAAGAAAGAC TGATGGACCT GCAAGATCGA CCGGATCACC 
CCAGCGTGCT GCTGGCGTAC GACCGACGAT AAATTGGTCA AATGGATGAC GTTCCCCGTT 
TTCTTGCAGC TCTCTCAGCG GCGCTGTT7G CGGTGTCGCA ATATAACCAG GACCGATACC 
GTTGCACTGG ATATTGAATT GACCATAT7C AGAAGCAATA TTTTTTGTTA GCATCTTCAA 
ACCACCCTTT GCCGCGGCAT AAGCACTAAC AGTCTCACGA CCTAATTCAC TCATCATTGA 
GCAAATATTA ATAATTTTAC CCCCGCCTTT CTCGATCATC TCAGGAATAA CCGCTTTTGA 
CATAATAAAC GGCGCATTCA AATCAACA7C GATGACTTGT CGGAAATCTG TCGCTGACAT 
ATCTGTCATT GGAATTCGTT TTATGATTCC AGCATTATTA ACTAAAATAT CGATTGAACC 
AACCTCTTCT TTGA7TTGCT TGACCATTGC TTGAACTGCT TTTTCATCCG TTACGTCACA 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1B00 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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GACATAGCCT 
TTCTTGAGTC 
TTCAAAGCCA 
CATATCCATA 
ATTGGAACCA 
GAATAGTTGC 
TCATTCCCCA 
GTATCTTCCT 
GGCATGGTAT 
CTTTCACAAA 
TCCCCAGTTT 
TGCGCCGGAG 
ATCACTTGTC 
CGACCGTCGA 
TCCAAGAAAT 
GGTGTTACAC 
GGTACAAATA 
ATATCTTTTG 
TACTTAATAT 
TTTCTGAAAA 
CAAATTCAAC 
CAAAAGCATC 
CATAATACGT 
CGATCCTTTT 
AATAATAAAC 
CCAATTCTCC 
CTGCTGCCTG 
ATTTCTTTGC 
CTGTAATGCC 
GAAACTCTGT 
GTTCACCGAT 
AATCACTCAT 
CAGGCACTTT 



TTTGCTTCAA 
AAATTATTGA 
ATTCCATAAA 
TTAAAATCCA 
TATCCATGTC 
TTGTTCCTAC 
TTACCAGATG 
CCATATCAAA 
TCCAAGAACT 
TATTGGGATG 
CCATTGGCTT 
CAGAACTAAT 
GAGTTTCTTT 
TTTCAATAAA 
AATCGACACC 
CGCCAAAGAT 
CTTTTTCAAC 
GACTGTGCGT 
TTAACGAACG 
TTGATTACAA 
GGTTCTTTGC 
ACCTCCGCCA 
ATTCTCCAGC 
CGTTGAATAG 
GAATCCTGAA 
AGCTGAACAA 
ACTCCATAGT 
GGCTTTCACT 
TGAAATATGA 
TACTTCCGCA 
CCCCGTTTCC 
CACACCGTTT 
ACTTGCAAAA 



TGCCTTCTGC 
AAACGATCGT 
CTGCGCCCGT 
TCTATTCTCC 
GGTATAAGTG 
ACCTGAATGG 
CTTTGTCTCA 
GTAAACATAC 
GCCCGGCTCT 
AATATATTGA 
AATCTCATCA 
GTAAAATTTC 
CCCGACATAA 
ACCCGGACCG 
TAATTCTTTG 
CATGCGATCA 
TAAAAATTCT 
ATATCTCGTA 
ATTCGACCAG 
TCCCCGTGAA 
CCGTCAAAAT 
ACTCGATCAA 
CATAAAGTCC 
AAGCATTTAA 
TTCATCCGGT 
TAATCGACAT 
TTCCCACGAT 
AATTCAACCG 
AATAGCTCAA 
AAGCTTGACC 
ATGTAATAAG 
CGACGCAAAT 
GAAACCTGAT 



TTTGTAATTT 
TGCTCCTGCT 
AATCAATGCG 
CACTTTCATT 
ATATTCTCTC 
ATCGACCAAC 
GCTGGGTCAC 
GCTTCCATTC 
AAAATCGTAT 
TAAATTTTAC 
ATACTGATTT 
GCAGGATCAG 
TAGCCATCTT 
CCAATATTAA 
TTCAAAATAA 
TTGTGGGTAT 
TTCCTCAATT 
TCCATATTCT 
AACCTGCTGT 
CAGTATGCTT 
CTGAGATCAA 
CGATTGGATT 
CTACCAACTG 
TATTGGGAAA 
CTTCTGGAAT 
AAGGCAAGAT 
AATTAATATC 
TCCATTGACA 
TCCCAGAAAA 
CCGCTCGATC 
TTCCTAAACG 
GTTTCACTGC 
GACCAAAATT 



GCCGATCCTT 
GCCGCTAATG 
ACTTTCCCAT 
CGTTCTATTT 
CACACATCGC 
TCGGTGAAAT 
CCATCATATG 
GACGTTCATG 
AGCCCATTTG 
GTTCATTTAA 
TTACATTTGG 
CTGCATCTTC 
GCTTCTTCAT 
TGACTCCTAA 
TCTCTAACGA 
ACGTCAAACT 
CTTCTGTTGA 
GCATGTTTAT 
CAAAAATTGT 
CAATGCTGAT 
TCCATGCAAG 
TATTTTATGA 
ATTGTTGCTA 
CTGCTGCTGC 
CTCCAATAAA 
TTCTTTTAAT 
AAAGCTGACT 
CCATTGCACT 
AAGCTTTTCA 
ATAAGTTACA 
CGcTCCGCCA 
CGCTTGACCT 
GGCTAATGAA 



CATCGACTGA 
ATTTGGCGAT 
CTAAGCGAAA 
TAGTTCATCC 
CCAGATAAAT 
CACGGCTTGT 
GAAAATTTTT 
CGTATGACAC 
CAGCTGACAG 
TGTCAATGCT 
GTACTTGTGA 
TGAAGAGAAA 
CGCTTCTTTT 
TTCACGCCGT 
CTCAGTCGTC 
GACTTCATTA 
ATAATGACTA 
TTCCCTCTTT 
TCGATCTCTA 
GCAGCCGTAG 
ATCCCCGCCG 
ACTTCAGACT 
GCCGAAGAAA 
ATTTGTTGAT 
TAACGCGCAT 
GCTTTCCCAG 
AACACACCAA 
GACAAAGCTG 
TAAGACCACG 
CTAGCAGCAC 
AGTTGTACAA 
AGTGAGTTAT 
ATAGCTACAT 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
,3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
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TTGCCTCGCC 


GCCGCCATAG 


TGACAGATCA 


ATTGCTGTGT 


TTCGTTAAGA CGAACACCTA 


4920 


TATCCGTTGA 


AAGTCGCAAC 


ATAATTTCGC 


CTAAAGTTAG 


TATGTTTGCC ATAATTAGTT 


4980 


TTCCTTTATT 


TCTCTAAATT 


TGGCCATATA 


TTGTTTTGCC 


AAAGCCGTTA CTTTACCAAA 


5040 


CTCACCAGTT 


TTAGCGTGAG 


CTAATAAGTT 


TCCGCCAACA 


CCCACTGCCG TCACACCCGC 


5100 


CTCAAACCAT 


GACGCCATAT 


TATCCAGACT 


AACACCGCCT 


GTGGGCATAA TATTCAAGTA 


5160 


CGGCAAAGGT 


GCTTGGAATG 


CAGAGATAAT 


ACTCGGACCA 


TAGGTACTTC CCGGAAATAA 


5220 


CTTCACGATA 


TCTACTCCGC 


TCTTCAAAGC 


TGTCTGCATC 


TCAGTAATTG TCATACATCC 


5280 


T GGC AAGTAA 


GGAACTTGAT 


ACAAGTTACA 


AATTTCAGCT 


GTCTCCCGAT TAAAGCTAGG 


5340 


GCTTACAATA 


AATTCGGCTC 


CGGCCATTAT 


AGCTAGGCGA 


GCCGTCACAG CATCTAACAC 


5400 


CGTGCCCGCG 


CCAATCACGA 


TATCTTTCTC 


GTCATGATAT 


GTTTCAACTA ATGACCGAAT 


5460 


GACTTCATCT 


GCTTGAGGGA 


CTGTAAAAGT 


CAATTCTAAT 


CCAATCATTC CGCCCTCTAC 


5520 


TACTGCATGA 


CTAATTTTCA 


AAGCTTCCTC 


TTTAGAATCT 


GCTCGCAATA CTGCGATAAC 


5580 


TCCCGCATTT 


TCTAAGCGTT 


TCAATACTTC 


AACTTTTTTC 


ATTCGTCTAC GCCTTTCTTA 


5640 


TAAAGCTTTA 


TGGAATCAAA 


TATACGAGCA 


CCAGATTGAC 


TATACCGCGC TACTTCTTTT 


5700 


TAAATTCCTT 


CTTCTGGCAG 


TTCTTCTGTT 


tCATTGTTTT 


TTTCAAAAaG CmTAATATtG 


5760 


CTCTTTCCAG 


CCTGTACGAC 


TGTTTCAGCA 


ATTTCAAATG 


AATCCGTGAC GCTATTTCGC 


5820 


ATCATCAAGC 


CTTCTAATAG 


TATTCCTAAG 


TTTGTGCCAC 


TCACAACAAA AACTTGATCT 


5880 


TTTTCCTCAG 


AGAATTCAAC 


ACAGGTTTTA 


AATGGGGTAC 


CACCCAACAA ATCTGTGAAA 


5940 


CAAATAGCTT 


GATCAAACAT 


TTGAATTTCT 


GCCGCTACTT 


TTTGGCGATA CTCATCTACC 


6000 


GTCTCCTCGG 


CTAAAAAGGG 


AATAATGATC 


ACATCTTCTA 


GTTCTCCAGC AATCATCGCT 


6060 


AAAGCACTCT 


GCATCCCAAT 


CGAAAATTCA 


CTATGCCCAG 


TAATTAATAA TTTCATCTCA 


6120 


TTGCTCCTTT 


ATTTAATCCG 


AATCTTTGCT 


TCCTTTTGAT 


CAATCAAAAC AAGCCGACCA 


6180 


TAAAAAAATT 


GGCCATATTC 


ACTCACTAAT 


AGTTTATTCC 


CTTTGACAAT ATCTTCCCGC 


6240 


ATAACAAAGA 


GATCAAATTT 


TTCCGATCCG 


GCGATTATCC 


GAAATCCCTT AGCCTGACAG 


6300 


AATTGCTCAT 


TTTCTCCTGT 


TTGACTCACT 


TTTATTGGCG 


TAATTTGGAT ATCCTCTAAC 


6360 


GGAGAAATGA 


CTGTTGCTTG 


AATTTCTTTT 


CCCGTTTTGT 


AACAAAACTT GTTTGATAAG 


6420 


CGTGGGTGCT 


CATTCAGCTG 


GTTATAAATC 


TCTGAACCTT 


TTGCCACTGA TTGCTGTGTT 


6480 


TGGCCTCCAG 


CAAAAAGGAG 


TGTGTACTTA 


TGCTTGTTCG 


TAGTTAAAGC AAACCGATGC 


6540 


GCTTCTTTTT 


GACAATTTAT 


CGACGGTGCC 


AAATTATAGG 


TACTCGTAAT TTCAGTCTCT 


6600 


TTCTGTCCTG 


CAAAGCTATC 


AATAATAACT 


ACGGAGTTGA 


TCGACTTTAA ATAGATGAAG 


6660 


cTGCGCTCAA 


AAATCATTGG 


ATTCTGATCC 


GCCTTATCCA 


. GCCAGCCGCA TTCTGCAAAA 


6720 


AAACCGACAG 


AAAGTTCTTT 


TATTTGCTGA 


, AATAAGGGTG 


TCGGTAATTT GTCATAACCC 


6780 


CACGTATCGG 


AAACTAAAGT 


ATGGGGATTT 


TCTGCGATAA 


i ACATCGTATT GTGCGAAgCG 


6840 
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CACTCTTTTA 


GCTGAAGTCG 


CTCTGATTTG 


TTGACATAGC 


TGTAACGACC 


ACTATCGGAA 


.6900 


AATAAGTCAT 


CCCCTTGTAG 


TTGTAGTGTA 


AATCCACCTG 


TAGAAGCATG 


ACCATGTGCG 


6960 


CTCCCATGCA 


GACCGTTAAA 


AAGTGTAAAA 


TAGATATCCT 


CTGCTTTGTA 


CGCCATCAGC 


7020 


CCACTTGATT 


CGCCACGAAA 


AAGTTCTTTT 


GGCTTCATTG 


TTTCCCAGAT 


TCTTTCTTCA 


7080 


TAAAGATCCC 


CCGTCCAAAG 


CCTTGCCATA 


TTCGCAGTCA 


TGGAGGGTTC 


AAATATGAAA 


7140 


CCTAATTTGC 


GAT AG AT AT C 


ATATACGTAA 


TGAAAGTTGA 


CATGATCACT 


ATCATTGATC 


7200 


GG AT TTAGG A 


TATCTTGGTT 


ATCCGCCAAA 


TAATGGGTAG 


AGAAAATAGG 


TGTTTTTAAT 


7260 


TTCATGCGAA 


GATCTAATGG 


TAACTGGACT 


TCAAGATATT 


CAGAAATCTG 


CAATAGATAC 


7320 


ACGAATGTCA 


TCAAAACTTC 


GTGCTGGTAC 


AGCGGGCTCT 


GCTCCCAATG 


AATTCCATCT 


7380 


GAATAGAATT 


GTAGATCAAG 


CTGTTCAGCT 


AAACGAGACC 


AT AT T AG AT C 


CCTCTGTTTA 


7440 


CTGGTCACCA 


GTTCTGGAAG 


AAATAAATCA 


ATAGCTGCCA 


TTCCACCAAT 


TGCCAACACA 


7500 


CCCCAATTAC 


TGAGCCTGTA 


TTTATCGATA 


TAGGACCGCT 


CCAAATAGTC 


CAGATGGATC 


7560 


AGCAAGGCGT 


TGTTCAACAC 


ATCATCAATT 


CCTAATAGTC 


TGAAATCAGC 


GATTGGAATA 


7620 


TACGTCAAGC 


TTTTCATCCA 


GTTTGTTACT 


CGAATCCCAA 


CATCTAACGG 


ACGCCAAACA 


7680 


TCCCTATTCG 


TCGAATTTGG 


CTCACCCTCA 


TCGTTAATAA 


AATCAATAAG 


CAAGCTGTGC 


7740 


CATTTCTGTA 


AGTAACGTTC 


TTTTTTAGTA 


AGTGCATATG 


CTTGTGCCAG 


ATCTACAAGA 


7800 


AAGCTTTGAC 


GACTCAACAT 


AAAGAGCCAT 


TCTGGATCAT 


CATCAGGATA 


CCGATTCCAC 


7860 


GCATACTCTT 


TTAAAGAATA 


AGGAATGGAG 


CAAGCTTCCA 


TATCCATTGC 


ATCATTGTAA 


7920 


ATAATCTGAT 


CTTTCAGCAA 


ATAAGAAGTA 


CGTTCAAAAA 


GTGGCTTCAA 


AAATTGTGAC 


7980 


TGTTGCTCCA 


AGTACTCTGA 


AAGCCAATCT 


TGTTGAAAAA 


ATTTTAGTTG 


GTAGTCTTCG 


8040 


ATTCTTTGAA 


GATTCATCGT 


GTTCCCGCTT 


TCAAATTTTT 


ATTCGCCTCA 


GGCTTAAAAT 


8100 


T AC AG TAT AC 


TTCCTCATTC 


ACTCCGAAAA 


GTGACAAATA 


CGGAGACAAG 


AATAATTTTC 


8160 


CAAAAATTAA 


AAATCTTGTT 


TCTGATTACC 


CTCGTTTACT 


ATGCTAAAAT 


TCCTAGGACA 


8220 


GACGCTGCTA 


TCCCAATAGC 


AAATAGTAAG 


ATCAATAGTT 


TATACGTGTT 


CCAATTTCTC 


8280 


TTCTTGATCA 


AATAAAACAC 


TAAGATAGTT 


AAAATTACTG 


GAAGCATTTT 


AGGCATAATC 


8340 


TTATCTAAAA 


TATCCTGAAT 


AGCGATTACT 


TGTTTTTCAC 


CGGATTCTAC 


GTGATGCGAA 


8400 


TACTTTATGG 


CGATGTTGGC 


CTTAACAAAT 


GATGTTGCTA 


ATGCCGCTAT 


AACAGTTACT 


8460 


CCG AC AAT AT 


TAGCAGCCGA 


AGAAATTTTC 


CCGATCTTAT 


CACTCAATGT 


TTCGATGACA 


8520 


CTCGTTCCAA 


GTTTAAAGCC 


TAAATAGCCC 


ATTAATAACT 


TGATAATCAA 


CATCGAGATT 


8580 


AGCATTGAAA 


ACCAAAAGCC 


CATCGGTGCA 


AAACCCAAAC 


CATCCATTGC 


TAACCCGGCA • 


8640 


AAGATTGTTG 


AGAATAAAGG 


AGCAATCCCA 


AATTGAGCGA 


TTGAGTCTCC 


AATTCCCGAT 


8700 


AATGGTCCCA 


TTAGCGCCAT 


TTTGATGCTG 


CGCGTATCCG 


TGACACTTTG 


TCCATTGTCA 


8760 


TACATTGCCA 


ACTGCATACT 


AGTAATGAAT 


GGCAGCATTT 


GAGGGTTGGT 


ATTATAAAAC 


8820 
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TCTAAGTTTG 


CACTCGCCGT 


TTCTTTCAGC 


TTCTCTGGTT 


CATCTTTATA 


GATTTTCTTC 


8880 


AGCGCTGGCA 


TGATCGTGTA 


TAAATAACTT 


ACCCCTTGAT 


AAGATGTATA 


ATTAAAAGCG 


8940 


TTCTGTAACA 


AATACGACCG 


TAGTGCTGTC 


TTCATGTAGT 


CTTTTTGGCT 


TAATTTTGGT 


9000 


GTCCTAGATT 


CCATTACTAA 


AATCCTCCTC 


TTCCATCAAG 


GTGGTTTGGC 


TTTCGTCTCT 


9060 


TATGCTCTTT 


TCTCCAACAA 


TCGTTTTATC 


TTTTCCATAA 


TATTGAATCA 


ACGCAAAGAC 


9120 


AGTTCCTACC 


ATTGCGACAG 


CCATCGTAGG 


CAGCTCTAAA 


TAAGCTGCAC 


AAATATAGCC 


9180 


TAGTAACACA 


TAAGGTGTCA 


GACTCTTTTC 


GAGCATCACA 


GACATGATCA 


TTGCAAAACC 


9240 


TATCGCTGGA 


ATAAGCCCGC 


CAGCAACAGA 


AAAGCCATTG 


ATTAGCCAAT 


TTGGTAACGC 


9300 


TTCAACAAAT 


GCCTGCAATC 


CAGGACGACT 


CAAGGCCGCT 


AATAGTCCAA 


TCATAAATCC 


9360 


TGCAACGGCA 


AAAGCTATAT 


ATGTTGAATT 


AGCAATCAGA 


TTAAAACGTG 


TATATTTCCC 


9420 


TTGCTCAATT 


ACATTCTTCG 


ACCATTTAAC 


ATTCCCAGAA 


AATATAGTAT 


AGATCGCCGT 


9480 


TGTTACCAGT 


TGGAACAATA 


CTGCAAAGGG 


AAATGATAGT 


GCTAACGCGG 


ATTCAGGTGT 


9540 


TACCCCTTGA 


TGTTTAAGCG 


TAATGGCCAT 


AATCGTACCA 


ACGATTCCGG 


GTCCAACTGG 


9600 


ATTGGGTGGT 


ACGGTTCCGC 


CTGCTCCAAC 


ACCAAATCCC 


ATATAAGCCA 


ACTCGGCTAA 


9660 


AGCACCAAAT 


GTTAAAGCCG 


TTGGAATGTC 


ACCCAAAACA 


ACACCTACAA 


AAAAGGACAT 


9720 


AACCAGTGCA 


CGGTTTGTGT 


AAATTCCTAA 


TAGCTGCCCC 


GaATAACAAA 


AAGCTGTGAT 


9780 


CAATCCGATC 


AATATACCTT 


GAACTAATGA 


AATCTCCATC 


TGCTTCTCCT 


TCTCCTTCTC 


9840 


CTTCTAATTT 


AATTAGATAT 


TTTTTCCATC 


AATACATCTA 


AGTACTGTGA 


GCCATCATTT 


9900 


GAAAGTGGTG 


AAGTCTTGGT 


GTTGAAAGTT 


AC AC C AT AG T 


TATCTCGCAT 


CAAGCAGAGA 


9960 


GCTTGCTTAT 


CTTCTTTACC 


TAAATAGATA 


AATTGAGAGA 


TTTTCTCCTT 


GCCTTCGTCA 


10020 


GCATGAATAT 


TCCCAATATT 


TAACTGCTCC 


ATTGGAAAGC 


CCAATTTGCA 


TAGTTCTAAG 


10080 


GCATCATGCA 


AGTTTCCAAC 


GACTACAAAA 


ATTGTTTGAC 


TAGGGGCTGC 


TTTCCAAATA 


10140 


ACTTTTGCCG 


TACGTTCGAT 


CGTCCAAAAA 


CGAATATTTG 


TTTCTTTCGG 


GACTACCGTT 


10200 


TTCATCAGCG 


TCTGCTGTAA 


GCTATCCTCT 


GCCGCTTTAT 


CATTGGCACA 


AATAACTAAG 


10260 


TTCACTCCTA 


AACTCTTGAT 


CCATAGTTGA 


CCTTGTCCAT 


GTATCAGACG 


TTCATCTACA 


10320 


CGCACCATTT 


TTACATTGGG 


TTTTTGCATT 


TTCTTTCCTC 


CTGTTTTTAC 


CAGTACATTT 


10380 


TCCAATTTTT 


TTTGAATCGC 


ACTAAAGCTT 


CTAGATAGAA 


ATAATCACCC 


CAAATATTAC 


10440 


CTTCATTTAC 


GCCTTTACCA 


CTATGCCAAG 


AATAAACACC 


CTCATTCAGT 


AAAGCTGTAA 


10500 


TTCCGTCTGT 


TTTGGATTCA 


GTATAGTTTT 


CAGATAAGCT 


TCTTAGCATG 


GAATGctGCG 


10560 


CACACTTGTA 


AACCAACTTA 


TGACTGTTCG 


TCTCAGGTAA 


AAACGCGTCC 


ATTAGACTCA 


10620 


TACCACAAAC 


AGCAATCGcT 


GTCGCAGAAG 


TATCTCGGGA 


CTGTTCGGAA 


> CCATCCATAA 


10680 


AAATCAGATC 


CCAATAACTA 


ACAAAATCCT 


TCGGcAAGCG 


ATTCAAGaAA 


, TAATTCGTAA 


10740 


TCGCCtCAAA 


AATTTCTGTA 


TTCTCTTCCT 


CTGGACAGTA 


> GGAGCGGAAC 


; AATGCCAAAC 


■10800 
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CATaAATCAA CCACGATTGT CCCCTCGCCC AGCTTGAACT ATCGGAATAT CCTTGGCGGG 10860 

TTTTTcCATA CAAAGGATCG CCCGTTTCAG CATCGAaGAA AAACGTATGG TAAGCAGATG 10920 

AATCTTCTCT AATCGCATTA TCGATTGTCG TATGATAATG TTTAACGGCT ACTTCACGAT 10980 

AACGTTTATC TTGAGTCACT GTTGACGCCC AAAATAATAA TGGAATATTA AGTAAGCAGT 11040 

CTACGATCAA ACGATAGTTA TCTTGATTGC CCAATTCTCC CCAAGCTTGG ATAAATTCTC 11100 

CCTTAGATTG GTAGCGCTGC AATAATTTTT CTGCTGCACT TATCGCGGCA TTCTTAGCAG 11160 

CTTCATTTCC AGTCAACTTA TACGCACTTA CACAGGATAA GCTGTATAAG AATCCCAAAT 11220 

CATGATGCTC GACTTCAATT TCATTTTCAA TTCGGTAAAT AAAACTCTTG ACATTTTGTT 11280 

CTGCAACTTC ACGGTAATGC TTTTCCCCAG AAACTTCATA GCACAGCCAT AATATTCCTG 11340 

TCCAAAAACC GTTTGTCCAT TCTGTATTAT CCATAGCTAt ATACGTAwTG TCCGTTGkTG 11400 

ckGGTGTTGG GAATTaTCTC CCAGTTT 11427 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

CCCTGnGACT ATTTTTCTTA TCTCTTTTAT AAATAGAAGA GAAAGTATTC TGATTTTGTG 60 

TTTTAAGCTT TTTATGCTAG AATAACGCCA GAACATAAGG AAAGGGACGA GwCAwTTGGA 120 

ACCCATTTAT TTAGATCATG CGGCAACAAC ACCGCTTCAT CCAACGGTCA TTCAAGCGAT 180 

GACTGAAAAT ATGCAAACGA CTTTTGGCAA TCCATCAAGT ATTCATCAGT TTGGACGAAA 240 

GGCACACGGT CTTCTGGAAG AAGTGCGTCA AACGATTGCC GAGAGCTTAC AAGCCAAACC 300 

CCACGAAATT ATTTTTAATA GTGGTGGCAC AGAAGGGGAT AACACAGCCA TCTTGGCAGT 360 

TGCTTTTTCT CGTCAAAAGG AAGGAAAACA TATTATTACT ACGGCAATTG AACATCCCGC 4 20 

TGTGTTACGA ACGATGGAAT ACTTAGAAAC GTTAGGCTTT GAAGTGACTT ACTTGCCTGT 4 80 

GAATGaAAAT GGTCAGATTT CAATGGATCA ATTCAAAAAG TCTTTACGCG AAGAAACGAT 540 

CTTGGTTTCA ATGATGTATG GCAATAATGA AATTGGAAAT CGATTACCGA TTGCTGAAGT 600 

TGGTGCAATT CTTAAAAATC ATTCGGCGAT TTTTCATACA GATGCCGTCC AAGCTTATGG 660 

AAGTGAAGTC ATTTTACCTC ATGAATTAGG GATCGACTTA TTAAGTATTT CCGCTCATAA 720 

AATCAATGGT CCAAAAGGCG TAGGTTTTTT ATTTAAAAGC GATGCAATCC AATTACCACC 780 

TCTTTTACAT ' GGTGGGGAAC AAGAAGAAAA ACGACGTGCG GGCACGGAAA ACTTAGCTGG 84 0 

AATTATTGGC ATGGGTACTG CCGTTTCATT ACTAACTTCT GCAGAAAAGC AAGCAAGAAA 900 

AACAGCCTAT CAAAGTTTTC AAACGATTAT TTTAAAAGCG TTAGAAGAAG CCAATATTGA 960 
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TTTTtCmATT AATGGkGaAC CAACCAATCG TTTAGCGCAT GTCTTAAayC 
nGGAATcCCA GTGATTTGCT CTTAATGCAT TTGGACTTAA GAGGTATTGC 
GGCTCGGCTT GTACAGCGGG AACTGTGGAT CCTTCGCACG TGTTGACAGC 
GAAAATTCCT CAGCTATTAA AGAATCAATC CGGATTAGTT TTGGCTACGG 
GAAGAAATTG CGACTTTTTC TGAAGTACTA GTTGCAGTGA TTCAACAATT 
TCTGATTGCG GCAGCTACAG GTTTTCGCTA TAATGAAGAT AACAATGAAA 
GGAACCCATG GCATTTACAA CAACGGCTGC TGTTGAAGGC TCAACAGTTT 
ACATCCTAGT GCAAAACGCT AT AC ATT AAA AGATAACGGC TTTACTGAAA 
GAACTTTCAA TTGATTCGTT CCTTAGATCC AACGCCTCAG AGAAATGAAG 
GAAAATTACG ATCACTGCTG ATCTAAAAGA GTTGAAAATG TCCATTACAA 
TTTAAAACCA ATGAATATTT TcAAGAATGA ACAACATGAA ATGAGTAAAG 

TTTCTTAATG GATGGCTTAA TTAGCCGTGG TGTATTAGAG AAAGTAGAA 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
GGTGAGTGTT CTAGCAGTGC TGTTTGTGGC AGCTGCTTGT TCCTCAAAAC 
AATTAATACA GAACCATACT CTGACCGTCA AACAATGCTT GGTACCTACG 
CATCTACGAT GATGGCAAAG AAGACGTCTT ACCGAAAGCC TTTGCCCGTG 
AGGCGATAAA ATCACCGTCA ACCAACCAGG TTCAGAAATT GATGAArTCA 
AGGCGTGAAA CCAGTCAAAG TCTCAGATGA CTTGTATCCT TTGTTGAAAA 
ATACAGCAAA GACTCCCGCG GCGGCTTTGA CATGGCAATT GGCCCAATCA 
GCACATTGGC TTCGATGATG CCCGCAAACC AAGCCAAGCC GAAATTGATC 
ACTTGTCGAT TACACGAAAG TCAAATTCAA CGACAAAGAG CAAACCGTTT 
AAAAGGCATG CAACTAGATT TAGGCGCAAT TGCCAAAGGC TTCATCACCG 
GAAAGTTCTG AAAGACAACG GCGTTACCAC CGCTATTGTC GATTTAGGTG 
CGTCTTAGGA CACAGCCCGC GTGGCAAAGA CATGGATTGG ACAGTCGGGA 
GAATAAAGCC CGCAACACAG TCTTAGGTCA AGTCAAAGAA AGCAACAAAA 
ATCAGGAATT TATGAACGCT ACTTAAAAGT CGATGGCAAA ACCTACCACC 
TCGTGAAACT GGCTATCCGT TTGATAACGA CATCGCGGGT GTCACCATTA 
ATCCATCGAT GGTGATGGTC TCTCAACAGC CGTCTTCTCA ATGGGCGTCA 
AGAATATGTC GAAAGTCTGA AAGGAACCGA TGCCATTTTC GTCACAAAAG 



TTCATTTAAA 
GATTTCAACA 
GATGTATGGA 
GAACACACCA 
GAAAAAATAA 
ATAATGAGGT 
TTTACAAAGT 
CAAAATCAGG 
GCTTCAAATT 
CGGCAAATGG 
AAAAATACTT 



CAGAAGCGAA 
TTCAAATTCG 
TCAAAGAACT 
ACCAAGAAGC 
AAGCCTATGA 
CCTCTATGTG 
AAGCCTTGAA 
ATCTTGAAGA 
ATGAAGTCGT 
GGAATGTTTA 
TTCAAGATCC 
CCTTAGTCAC 
ATCTCTTTGA 
TCACCGACAA 
AAAAAGGACT 
ATGACAAAGT 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1669 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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CTACATCAGC 


AAAGATATCG 


AAGGCAACTT 


TGAAATCGGC 


AAAGACTCTG 


GCTATACGAT 


.1020 


GGGGAATCGG 


GCCGATTTGA 


AATAAGGAGC 


GACTAAAAAT 


GTCTTTAAAT 


GTGTTTTTAA 


1080 


AAGTAGTGGA 


AATTCAGACG 


AAGCTTGCCA 


GTCTCTTTCC 


CTTCATTATC 


GGGGTCTTGT 


1140 


TTTCAATGAC 


CTATTTCGGG 


GAAGTTCAGT 


GGGGCAACAC 


GCTGGTTTTC 


TTCATTGGCA 


1200 


TGGTGGTCTT 


TGATATGGCC 


ACTACCGCCA 


TTAATAATTA 


CATGGACTTC 


AAAAAAGCCA 


1260 


AATCAGACGT 


CTATAAATAT 


GAAGAAAATA 


TTATCGGACA 


ATCAGGCGTT 


TCACCACAAC 


1320 


TGGTCCGCAA 


CATGATTTTC 


GCCATGATTG 


CCTTCACTGC 


CGTTGTGGGT 


GCGTACTTGA 


1380 


CGGTTCAAAC 


CGGCTGGCTC 


TTCCTTGTTT 


TAGGCGGGAT 


TTGCTGCTTC 


ATCGGGATTT 


1440 


TCTACACGTA 


TGGACCGATT 


CCGTTATCTC 


GCATGCCCTT 


GGGGGAAATT 


TTCAGTGGCT 


1500 


TCACGATGGG 


CTTAGGCATC 


TTCGTCTTAA 


CCGTTTACTT 


GAACGTCGTC 


ACCAATCCGC 


1560 


CTTTTTACTT 


AACGGTCGAT 


TTTGCTAGCG 


GCGCCTTTCG 


CTTAGACGGC 


AATCTCTGGG 


1620 


CGGTGCTTGC 


GATTGTCGTG 


GCGTCCTTGC 


CCTTGGTCTG 


TTCGATTGCT 


AACATCATGC 


1680 


TCGCCAACAA 


TTTACGGGAC 


TTAGACACGG 


ATATTGAGAA 


CCACCGTTAT 


ACATTAGTCT 


1740 


ATTATATTGG 


GCGCCCATCG 


GGGATTGTGC 


TGTTTCAAGT 


GTTAGCGCTA 


sctGGGTATC 


1800 


TGGTGATTTT 


ATTCGGCTTC 


ATTAGTGGTA 


TTTATCAATG 


GCCTGTGTTA 


ATTACATTTG 


1860 


CCACTTTACC 


TGTCATTTGG 


AAAAATATTC 


AAACGTTTAA 


ACAAGAATTG 


CCGCAGCCGA 


1920 


AAAGTTTTCG 


CCATTCCATT 


AAGAACTTAA 


TGGTTTTCAA 


TGGCACCTAT 


GCTTTAGGGC 


1980 


T ATT C ATT AG 


CGCCTTACTT 


GGTTAGTCTT 


GCGAAAAAAT 


AGCGGGCTGG 


GCGAAAAAAA 


2040 


TACTGTTTTT 


TTTCAGAGGC 


CTGCGTCAAA 


ACAATCAAAC 


AAAGCTCAAA 


ATGTTAGTAT 


2100 


TCGTTGCTAA 


CATTTTGAGC 


TTTTCTTGTT 


AAAAAGAGCG 


GAATCTTAAA 


ATTTTGTCAG 


2160 


GGGATTTTGG 


TAAAAATTTG 


CCGAAATCGC 


CCTTTTTTTG 


AGCCGATTAT 


TAAATAACAG 


2220 


CTGTTGTAAC 


TCTTGTCAGA 


ATAGGAACCT 


TCAAAATTTT 


CGCATGGGGT 


GCGAAAAAAT 


2280 


CGTCAATTTC 


TGCTGGGGGT 


TTCTTTGTTA 


TTATAGAGGA 


GTAACAAAAG 


GTTGTTTTGT 


2340 


CTTTCTTGTG 


TAAAAGGGCA 


AGAAAGGCTA 


GCGAGTTAAA 


AGGAGGTTTT 


TCAATGAAAA 


2400 


AAAAGCGTTA 


TTTAATGATT 


GTGTGTCTAC 


TATCTTCTCC 


TAGTTTTTTT 


ATAAATGTTG 


2460 


AAGCGTCTGA 


TGGTGGTTCT 


AGTTCGGTGG 


GGATTGAATT 


TTACCAAAAT 


CCGAGAACAC 


2520 


CCGCTCCTAA 


AGATCCCCCA 


CCGAAAACAG 


ATGCGCCAGC 


TGCTGATCCC 


AAGGAACCAG 


2580 


CTGGTCCTCC 


GCAAGGAGAT 


CAACGAAgTG 


GTGGTTCGAC 


ACAGACCACC 


ACAACTGGCT 


2640 


CAACGCTCCC 


TCGTACAGGG 


AGCAAGAGTC 


AGGCAAATTT 


GAGCATTCtC 


nGnTTCGCCT 


, 2700 


TAATCGGTTT 


GGCGGGAATC 


GTACATAGAA 


AGAAGGGACG 


ACATGAAGCA 


AACTAAGTGG 


2760 


CAACGATTAG 


CAACCATTGG 


CTTGTGTAGT 


TCTTTAGTAA 


TTAACGCCTT 


TTCTGGTGTG 


2820 


ACGGCAGTTG 


CGGAAACCGT 


GACGATTGAA 


AGTAGTCCGA 


CCGCCGAAAG 


TAGTGCCAAG 


2880 


GAAGAGACGC 


AAGCAAGTAG 


CGTGAAGGAA 


GAAACAACGA 


AAGCCAGTAC 


GGAAAATAGT 


2940 
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CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG . .3000 

CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 3060 

AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 3120 

CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 3180 

ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 324 0 

AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 3300 

GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 3360 

GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 34 20 

CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 34 80 

ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 3540 

GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 3600 

ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 3660 

GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 3720 

AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 3780 

ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 384 0 

ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT '3900 

GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 3960 

TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 4020 

TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 4080 

ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 4140 

GTCAATACGT TTTATGGCGC GAG TG AC ATT ACGTTTACAT TGCCCAAACG GTACAAATCA 4200 

ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 4260 

AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 4320 

CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 4 380 

ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA AT TAT TAT AC AATGAGTGGC 44 4 0 

CCTGTTTACT ATT ATT TAG A AAACCGCAAA GTCACCGAGA aCTTCGTAGA CACCAACGGC 4500 

GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 4 560 

GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 4 620 

TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA AC AC AT TG AC AACTACCAAA 4 680 

GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 47 40 

GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 4800 

GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 4860 

TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 4 920 
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AGTATTAATA 


ATGGTAGTAT 


GCCATTGTCC 


CAAGAATTAT 


TGAAAAAATA 


TAATAATGGA 


. 4980 


CAACCAATCA 


GTGCTACCAA 


C AG ATT AC AG 


TTTAATGTTG 


ATAAATTAGC 


CATCGACCAA 


5040 


CAACTAAAAT 


ATGTTGACAG 


CATTCAATTA 


GACACAGCTC 


AAAGTAGCAA 


TCTGAAATCC 


5100 


TATAGATATG 


TGTACACGAA 


CAATAGCTCA 


CTGGTTTTCG 


ACCCAAATGT 


AGCACCAGCA 


5160 


GAGGTTGACC 


TTAGTTCAGA 


ATCTCTTAAC 


TTGCTTAATT 


TTGATTCAGA 


TGGCACCTAT 


5220 


TTTTCTAATG 


CAAATAATAG 


ACTTTTTTAC 


ACGCATTTAG 


GATATAGTGG 


CACACCAGGA 


5280 


GTTAACTATC 


TTCTCGTAAT 


GTTTCTTTTT 


AACGCCAAAc 


CTGCGGATAA 


GTCAAAACTT 


5340 


GTCTACAAAG 


TCACTCGCAA 


ACAAGTCACC 


GAAAACTTCG 


TGGATGTCAA 


CGGTGCCAAA 


5400 


ATCACTGCAC 


CAACAGGCTT 


CACCCAAGGT 


AACCAAGTAC 


CAATGAACAG 


TAACACCTTC 


5460 


AAGTACACAG 


CGGCAAAAGC 


TTTACCAGCG 


AC G TAT ACT A 


CAGGTGGCAA 


AGTCTATACG 


5520 


TTCCAAGGGT 


GGTATAAAGG 


GAAAACCAAG 


CCAAGTACGT 


TGAACAAAAC 


AACAACTCCA 


5580 


ACGTTCAATG 


CGACCTTTGA 


TGGCAATGAC 


GATATGACCG 


CCATGTATAA 


GGAAGAAATA 


5640 


CCAACAGCTA 


GTGTCACATT 


AACTCGACCA 


AAAGAAGTGA 


TTGATACGAA 


TACCAATGTA 


5700 


ATCTGGACAA 


CAACGATCAC 


GAATACTAGC 


AAAGCACCCT 


TACAAAATCT 


CACCTTGAAA 


57 60 


AAAGGGCCCA 


ATTGGTCAGC 


TGGTCTGACG 


ATCCCGACCT 


TTATGGAAGT 


GACACCAGAA 


5820 


GGAGAAACGA 


CAAAaTCAAT 


CCCAGTAAAT 


AGTACACTTT 


GGACAGAGGG 


GGTTCCTTTA 


5880 


CCAAATGCCG 


TTCcTATCGG 


CAAAAAAGTT 


TCAGTTGCTT 


TCACAACTCG 


CGCAACAGGG 


5940 


AAACCAAACA 


CTGTTTTGAA 


AGCAGAAGTT 


GTAGTATTTG 


GTGGTATTAA 


AGATAGTACA " 


6000 


GTGGATAACT 


TCGTGAGAAT 


TCGTCCAAAT 


GATCAAGAAG 


TAGTCACACC 


AACGACCGAA 


6060 


GGCTTCATCA 


GTGTGCCAAC 


CTTCGACTTC 


GGCCAAGTGG 


GCGTTGCAGG 


AACTAAGCAA 


6120 


XAACACAGCT 


TGAAACAAGC 


CGCGGATTAC 


TACGGTAACG 


GCACACGGAA 


TCCGTATCTG 


6180 


CGGATTAAGA 


AAACGCAACC 


CAATTGGAGC 


TTAACAGCGC 


AACTGTCACA 


ACCAAAATCA 


6240 


GCGACAGACA 


GCTTGCCTAC 


AGCGACCCGC 


TTATTATTAG 


GGGCGGCGCC 


TGTCTCTAGC 


6300 


TTTACCAATT 


ACAATCAACC 


AACCGAGTTG 


AAAAATACGG 


TCGGTACCAC 


. GAGTGCCATT 


6360 


AGCTTAACAG 


CCAACAACAC 


AGCAACGAGT 


ATTATTGCCA 


ACAAGCAATT 


CACAGGTAGT 


6420 


AATGTTTATC 


AGTTGGACTT 


CACCTTCAAT 


AATGTCAAAC 


TTGAAGTGCC 


AGCCAATCAA 


6480 


GGTGTTAAAG 


GGCAACAATA 


CAAGGCCGCA 


GTTACATGGA 


ACCTAGTTAC 


AGGTCCTTAA 


6540 


AATAGTTAAG 


TTATCATGAG 


AAAAGCTCAA 


AATGTTAGTG 


GTTATCACTA 


ACATTTTGAG 


6600 


CyTTTTGGTT 


TAAATATTAA 


GGAAAAAACA 


GTTGGAGTTC 


ATCAAAAGAG 


CGTTTAAATC 


6660 


CTTGATTTTC 


GCTAAAATAA 


CACTTATTTT 


CAAAGGGAAC 


AGAGGCGCTC 


AGCCTTGAAA 


6720 


GGATAGGAAT 


CACTAAAAAA 


CGGCACTCAA 


TATGTCAAAA 


TTTGAAATTT 


CAAGCTGTGT 


6780 


GTTCTTTGGT 


AAAATAwATA 


wAAAAATGCT 


AGTTATCAGT 


ATCGATAATA 


ACAGGATACT 


6840 


GATTAAGAAA 


GGACTTTATA 


GAGACTATAG 


ATTGAATTTT 


TACATAGAAA 


GAAGGAGCAA 


6900 
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GATGAAGCGA GTAAATTGGA AAAGATGGCT AGTTGTTGGG TTAAGTTGTT CTTTGTTCAT 6960 

GGATTCAGTG GTTGGTGTGA CTGTGTTAGC GGAAACGATT ACTGGGGCGA CGGAGCAAGG 7020 

AGTAGCAACA TCTCAGTCGA GTGACGAAgC GAGCCAGACG aCGCAAACAA CCGaAGAGTC 7080 

ACAGGCAACG GTCGCTAGTG AAGCGAAAAC AGTACCGCCA CAGGAAACGG CAAGAATTGC 7140 

TTCTCGAGCG ATTGGTTATT CTTCTGTGGA AGGGCGCGAG ATTCCCTTTT TCTTTGTGGA 7200 

GGAAGACGGG ACGTTGTTTG ATCCCGACCG AATTACGATG GCGGTCAATC TTTCCACGTT 7260 

TTCGTTTTAT GAAGAGAAAT TACAACGAAC CCCCCTTGAG CCCACCACTG TGAATGGCGG 7320 

AAAGTTACTG TCTATTCCAA CGTCACCAGC TTTTAAATAT GATACAAATA ACCAGAATCC 7380 

AAGTAATATT TATGGCGTTT CTGAAGTGTC GTTTACTATT CCTAAGGAGT ATCAAAGCCT 74 40 

GGACATTCGA CCAAGTACGT TTTATACAGG AGACACTACG CAATATCCAG TGCCAACGGT 7500 

TTTTGCGAAC GTTGGGGGCA AAGTGACGAA CTATGTGGGC GCCAATGCGG AGACGGAATT 7560 

AGAGTTAACC AATGAAAAAA TGCCCAATAA GCTGACGTTT GGTCCTAAAA AGACGTTTAA 7 620 

ATATACGGTA GCTACGGCAC CAGGAGGCGT TACGTATGCG CTGACCTATT TTTATGGAGA 7680 

TGTCGGCGGT CCAACTAGTT CGC ACCAAAG ACGAGGAACA GCGGGTCCTG TGTATTATTA 7740 

TTTAACAAAG CGGCGTGTCA CGGAAAAATT TGAGAATCCC GCAGGCGGGG CGATTCCTGC 7800 

GCCAGAAGGT TATACGCAGG ATAAGAAAAC CATTGTAACA GGGGAGGATT TTACTTTTAC '78 60 

CCAAGAAGGC ACCTTGCCTG AACGTTACAC AGGCAGTGAT GGGAAGACGT ATTTATTTAA 7920 

AGGTTGGTAC AAAGGGAATG CGAAACCTAG CACGTTGGAA ACCACCAAAA CGCCTAGTTA 7980 

TGCGGTGACC TATGATGACA ATGACGATTT GCATGTGGTC TATGAAGAAG CAGTGATGAA 8040 

AACCTATACG TTGCCAGCGA GAGAAGCTTT GTTCGGCTAT GTTGATGAGC AAGGAAACTT 8100 

GATTAATCCC GCCAAGTTTA AGCTAAGTGC GACCATGGGT GAAAGTGACG GAGCCACAGG 8160 

GGAAATGACG ACTTTTCCCA CAATTGATGG AATCGATATG CCAGCAAGTC AATTAAAGAA 8220 

ATTAGCCATC CCGCAAAAAG TCTACACACG CCCAGACGAT GGGACAATCG TAACTTATGG 8280 

CCCGCAAGAA GTGAGTGTTG AAATTCCTAA GTATTACCAG ACGATTTCGA TTTCACCAAC 8340 

TACTGCGTAT ACAGGGGATA AAACCAAGTA TCCAGTACCA AATGAAGTGC GCCGTGGCAT 8400 

CGAAAACCCC GACAACATTG TTAGTAGTTT aGTGGGAAmC mCTGCGTATA ACTTGACCCA 84 60 

AAAAAGTGCC ACACGCTATA CTGCCCGCCG TTCTTACTGG AmGTGGGGCC CCACGAAGAC 8520 

ACTTTACTCA ATGAGTATCT ATTCAGGAAC TGCTGGGGGC AACTATAATT TATCGACCCC 8580 

TGATGGCACC ATTTATTATT ACTTAGAAAA TCGGCGGGTC ACTGAACATT TTGTAGACGA 8 640 

AAGTGGCGCA AAAATCACGC CACCAACTGG CTTTACACAA GGAAATCAGC TAGTGGTGGA 8700 

CAGTGAAAAC TATGTCTACA CTGTCGCAAA AGCTTTGCCG AAGATCTACC AAGCTGGTGA 87 60 

AAAAACCTAT ATCTTCCAAG GCTGGTTTAA AGGCAAAACC AAGCCAGCAA CATTAAAGAC 8820 

GACAACGACC CCAAGTTTTA CACCAACTTT TAATGATGAG GACGACATGA CCGCTGTGTA 8880 
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CCAAGAAGCG ATTCCCACCG CGGaAcTAAC GTTAACAGGT GCCGTTGACA TAATCGAAAA 8940 

TGGCGCCACA ATGGATTACT GGGAGGCGCT ACTGAAGAAC ACAGGCGAAg cGCCGTTAAC 9000 

CACCATTAAA ATCAAGCCAA CGGCAACTTG GGCGGCTGGC ATCGGCGCAC CCAACACGAT 9060 

ATTTGTACAA GGAACGGGTC AAAACACCAA AGCTTTTCCT GTCACCAAAG AACAATGGAC 9120 

GACCGGTGCA GGAGTGTCCA TCACGTTGGA TCAGCCTTTA CCAGCTGGCG GTCAATTAAA 9180 

AATGAACTTA TTAGGAACCG CCGTTACAGG AAATCCTGGT CAAGTTTTAA CCGCTGATGT 924 0 

TGAAGTAACG GGCAACTTTG GCAGTTTAAC TGCCAAAGAT ACGGTCCGTA TTAAAGACTT 9300 

AGATCAAGAA ATTACGAGTC CTGACGGCGA CGGCTTTATT AGTACCCCGA CATTTGATTT 9360 

TGGTAAACTA GCAATTTCAG GAAGTAAGCA ACAATATGGT TTGAAGAAGG CCGCAGATTA 9420 

CTACGGCAAT GGCACTCGCA ACCCTTATTT ACGCCTGAAT ACTAGCCAAG CCAATTGGAG 9480 

TTTAACGGCC CAGcTATCGC AACCAAAATC AG CC AC AG AC AGCTTGCCAA CAACGACCCG 9540 

CTTGTTGCTA GGAACGGCCG CTGCTGCCAG CTTTACCGAT TACAACCAAC CAACAGAAAC 9600 

CAGGACACCA CTTGGCAAGA CCAGCACCGT GACTTTAACC GCCGACAATA CCGCAACAGC 9660 

GGTGGTCGCA AACCAACAGT TCACAGGCAG TGACGTCTAT CAGTTGGACT TCACGTTTGC 9720 

TAACATCAAA CTAGAAGTGC CAGCCAACCA AGGTATGGCT GGCCAACAAT ACCAAGCCGC 9780 

CGTCACGTGG AATTTAGTGA CTGGCCCCTA AAATAAAAAT GCCCGAGCCT TAGAAAAATA 984 0 

AGAGGCTCGG GCATTTTTAT TGATGGTCTT TTGATGGTCG TTGTGCTATA ATAGCATTGA 9900 

AAAGAGAGAT ATGCTGTAAC ATACCTCTCT AGTGTAGAGC CGTTTAAGAC GGTGACCGAT 9960 

TTTGTTACAA AAAATAACCG TACTCAGTCA AAGTAAACGG TTATTTTTTC TTGTCTTCTT 10020 

TAACAACCTT TAGGATTCCG AAGATTAAAG TCGCGATAAA AGAACCGAAA CTAATCATCA 10080 

GAGCTAAAGC TTCTGCGATA GACAAAAGGC CTCTCCTTTC ATAGATTTTA GTACTTACAT 10140 

TCATAAGCAC CACCACCTTT CAGATTGGAT AGCCACCGCC ATAACTTCTC TACACTTTTA 10200 

TTATACCATT TAGAACTAAA AAATAATTAG TTTTCAATAT ATTTTTTTAG GAAATTGTGC 10260 

TATAATAGCA ATGAAAAGAG AGATATGCTT CAACATACCT CTCTGATGTA GAGCCGTTTA 10320 

AGACGGTGAC CTTTTTAGTT ACAAAAAATA ACCGTACTCA GTCAAAGTTG ACGGTTATTT 10380 

TTTATTGTCA TTTTTAAGCT TTAAAAGATT TTCCTGTTCC CTAATTGTTA AGAATAGAGA 104 40 

AGAAAAGCTC AAAATGTTAG TTCTGATTAC TAACATTTTG AGTTTTTTTC AATAAATATC 10500 

AAGAAAAATC AGCGGAAATT AGAGAGAAAA TGGTGAATTC CACTGATTTC CAGGCTGTTT 10560 

TTTACCAAAA AACAGTGGCT CCTCCTTGGT ATAAAAGGAA TCATTGGAAA TATCAAGTTG 10620 

TCGTATCAAA ATTTAAAATT TCAGTGGGGC CTTTTTTTGA TAAAATGAAA ATCATACAAA 10680 

AAATCATACA TAGGACAATA AAAATAACAA TAAAAAGTTA CATAAAAAAG GAGAATGGCA 10740 

TGAATCAAAA AAGAAACCGT TTATTTGCAA TCGCTTTGTG TGGTTCTTTA CTAATCAGTT 10800 

CATTAAATGG CATCCCAGCA ATTGCTGAAA CCATGGCGGC CCCAGATGTT CAAGGGGAGG 10860 
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ACCGCCAAAC 


AGTAGTCCCG 


TCAGAGGGGG 


TTCAAGGACG 


GGCGCCCGTT 


TCTTTTCCAG 


. 10920 


AGGCAGATAC 


GGAAGTnAGc 


GTACCGCCAG 


CTGATGAAGA 


GCTACCAGAA 


CAGACTGAAC 


10980 


AAGTAGTCGG 


GCAAGTTGAA 


GCGTCAGTGA 


CGAGTCCAAT 


AATAGTTAAT 


ACACCAGAAT 


11040 


CACTGCGTGC 


GCAgCGAGCA 


GTGCAGACAT 


TCGATATTCC 


AGAGAAGACG 


TACAAAATGT 


11100 


CGTTTTTAGA 


TGAAAGTGGT 


CGTTTTATTG 


ATCCAGCGAA 


AGTTTCGATT 


ACTGGGGATA 


11160 


TTATGAAGTT 


TACTATGCAA 


GGCGCTTTGA 


CGAAAGTCGG 


GGCCGTGGCC 


ACTAATAATG 


11220 


TGGGAACCAC 


GAAGCAATTG 


ACGGTGCCGA 


AAGTAGCTTT 


GTCTGGGGAT 


ACAGCCATAA 


11280 


AGAGTTATGA 


TTTTTCCGGC 


ATCCAAAATG 


AAAAGATTAC 


TTTGCCGCGG 


TTCTATAATG 


11340 


TTCCGAAAAT 


CACGCCCAGT 


GCGAATTACA 


CTGGACAAAT 


CTATCCAATG 


GCAGTGACGA 


11400 


AACAGGTGAA 


TGTTACAGGA 


GAAGTATCGA 


ATGAGGAAAC 


CACAGGCGCG 


AGGCTGAATA 


11460 


TGTTTGTATC 


AGGTATCCCT 


AACCAGTTTG 


TCTTTTCTCG 


TGCTAGTTGG 


GACAATGATC 


11520 


CAGCGGGAAA 


TATGTTTGCG 


CGTGGCTTTA 


ACATGGCTAC 


TGCAGGCGGC 


ACACCAGCTT 


11580 


ATCTGACTAC 


GGATGATCAC 


ATTTACTATT 


ATGTGCCTAA 


CAAACGGGTT 


AACATTTATT 


11640 


ATAATGACAT 


GACCCCGAAT 


GGCTTACCAG 


CGTATCCTCC 


AGGGTATAAT 


GCCAGTCTTT 


11700 


CAAAAACAGT 


AGTTGATTCA 


GAAAATTTTC 


ACTATAAAGC 


GCCAAAAGCT 


TTTCCAGAAT 


11760 


TCTTTAGTTC 


TGGGAATCGT 


CACTTCCAGT 


TTAAAGGCTG 


GTATAAAGGG 


CCTGCAAGAC 


11820 


CAGCGGATCC 


AAAAGCCGTT 


AAGTTAGAAA 


CCTCATTGAC 


GCCAGAGTTC 


GATGTCACTT 


11880 


ACGATGGTGC 


CGATAATTTA 


TATGTTTTTT 


ATGATGAAGT 


GAAAGAAATG 


ACCACTACTA 


11940 


TTCCAGAAGT 


TACCTATAAG 


TTTGGCTTTG 


TTGATGAAAA 


AGGGGCGCTT 


GTTGCGCCAA 


12000 


CGAATGTAGA 


CATTCAGGCG 


AACTTAACAA 


CTGTCGAGGA 


TAAGGTGGTC 


CAGAAGGTGG 


12060 


GGGCTATTAC 


AGGCGCTAAT 


GTAGGCAATT 


TAAAACAGTT 


AACCGTTCCT 


AGCCAAACAC 


12120 


TAACCTATGT 


ACCTAAAATT 


AAAGTTGCTT 


CTTCTGGGGT 


AACCGATTTT 


CGGTTGACTA 


12180 


TTCCCAAGCG 


CTATAAAATT 


CCAACAGTCA 


CACCTGGTTC 


ATTCTACACT 


GGCTCAACGA 


12240 


CCGCCTATCC 


TTTGGCGACA 


ACGCTGTTAA 


GGCATATTAG 


TGGTCAGGCA 


GATGAACGGA 


12300 


TAACAACGGA 


TGGCACTCGC 


TATAGTTTAT 


ATAATTTACC 


GACGCCTCAT 


AGCTATCGTA 


12360 


TGTACAGAAG 


TAGTTGGCAA 


CCTGACGTTA 


CCAACACGGG 


ATTTAGTGCA 


ACTTTAAACA 


12420 


TGGCGACGAC 


CGAGATTTAT 


CCAGCTTATT 


TCACGACCGA 


TAACACGATG 


TACTATTTCT 


12480 


TAGAAAACCG 


CCGCGTCACT 


GAACACTATG 


TTAATGAAGC 


AGGTGCTGAG 


GTACCCjAIvjC 


1 o c. a n 
1 z 0 H u 


CTACAGGCTT 


CACGCAAGGG 


AATCAAACGG 


TGATTGACAG 


T GAT ACT TAT 


CATTTTAAAT 


12600 


TGGCAAAGGA 


ATTGCCGTTT 


AGTTATTTTT 


TAAACAATAA 


AGCATACCGA 


TTTAAAGGCT 


12660 


GGTATAAAGG 


CAAAACCAAA 


CCTAAGGTAT 


TGGAAACGAG 


TCGAACGCCT 


GAGTATGATA 


12720 


CAACTTTCGA 


TGATAACGAT 


GATTTAACCG 


TTGTTTACGA 


AGAAATTAAA 


TACGGCGGAA 


12780 


ATGCGGTGAC 


ATTTGGTTTT 


GTTGGGGAAG 


ATGGGCAATT 


GCTTCAGCCG 


ACAGGGTTTC 


12840 
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AGGTGACTAC 


GGATATTGTT 


GAAACAATCG 


ATGGGCTGAG 


TACGGTTTTA 


AGTAATGTTT 


. 12900 


CAGCTGTCGA 


CAGTACAGGC 


AACGTTAAAA 


CTTTGACGAT 


CCCGCAAAAG 


GAATATGTAT 


12960 


TGACTCCCCA 


GATGAATTTC 


TATGGAACAA 


AAAATTCATT 


GATTACGATT 


CCAAGACAAT 


13020 


ACCAAGAAAT 


TTCTTTTACC 


AAACCTGCCA 


ACTACCAAGG 


ACTAGATTAT 


CCAGTTGCAG 


13080 


GGCGTGTTGA 


GAATATG T TT 


AGTGGTAATC 


TGTATGTTGA 


GGGGGCACAA 


CCACCTTACT 


13140 


TTTCTGCTTC 


GAAAGTCAAA 


GCCAATCAAT 


ACAAAATAAT 


TGAGTTTCAT 


TGGAAAGAAG 


13200 


AGCCTACACA 


AAGTTTCTCT 


AGTTTGTACA 


AAGCTACGTT 


AATTCGAAGT 


AGCGTGGCAC 


13260 


CAACAGGCGT 


AGAAATGTAT 


CCTAATAAGC 


CAATCTATTA 


CTATGCGACC 


AATCGCCGAG 


13320 


TGACTGAAAA 


CTTTGTAGAT 


AAATCTGGCG 


CAAAAATCAC 


GCCACCTCAA GGATTTACGC 


13380 


AAGGGAATCA 


AGTCACAGTC 


ACCAGTGATC 


CTTTTACTTA 


TACTGCTTCA 


AAAGCTTTAC 


13440 


CAAGCGTTTA 


TGCGGCTGGG 


GCAAAAACCT 


ATAAGTTTGT 


TGGTTGGTAT 


AAAGGAACCA 


13500 


CCAAGCCAAC 


GACCTTAAAA 


ACAACGGCTA 


CTCCTAATTA 


TTTGGTGGAT 


TTTGATGATA 


13560 


AT G AC GAT AT 


GACGGCGGTC 


TATGAAGAAG 


AAACGCCGAC 


TGCTGCTTTA 


ACTTTGACCA 


13620 


GCACCAACCG 


TGTCGTGAAT 


AATGACGATA 


GTGTAGATTG 


GGTGGCAACA 


CTGAAAAATA 


13680 


CTAGCCTGGC 


ACCACTTAAA 


ACGCTGACAG 


TGAAACCAGC 


GACTACTTGG 


CCTGCAGGAA 


13740 


TTGGTACGCC 


GACGTCACTT 


TCTGTCCAAT 


TGGACGGACA 


AGCACCGAAA 


ATTTATCCAG 


13800 


TAACGGCGAC 


TACTTGGTCT 


GAAGGAATCA 


GCTTaACAGG 


TCTTGAAATA 


CCGGCAGGTC 


13860 


AAACAGCCAA 


CGTGACGCTT 


GTTGGCACCA 


AAATTTCAGG 


AACATCAGAC 


CAACGCTTGA 


13920 


CGGCAACCCT 


TGATGTTACA 


GGAAATTTTG 


CGACAGTGAG 


CGCCGCGGAT 


GCGGTACGTC 


13980 


TG AC AG AT AC 


AACCCAAGGA 


ACAATTACAC 


CAACAGAAGG 


CTTTATCAGC 


GTGCCAACCT 


14040 


TTGATTTTGG 


AAAAATGAAT 


ATTGCTAGTA 


AAACGCAACA 


ATCAGGCTTG 


AAAAAAGCGG 


14100 


CCGATTACTA 


CGAAAATGGT 


ACACGCAATC 


CTTACTTGCG 


AATTAAGAAA 


AATCAACCGA 


14160 


ATTGGCAGAT 


GACAGCGCAA 


CTATCTCAAC 


CGAAAGCAAC 


CACAGATAGT 


CTGCCAACAG 


14220 


CGACTCGTTT 


ATTGTTAGGT 


CCAGCGAATG 


TTTCAAGTTT 


TACAAATTAT 


AACGAAGCAA 


14280 


CCGAACAAAT 


CAAAGCAGTT 


GGTAAGACTA 


GTTCTTTGAG 


CTTAACGGCG 


AATAATGTGG 


14340 


CAACCAGTGT 


TGTCGCAAAC 


CAACAGTTCA 


CAGGTAGTGA 


CGTCTACCAA 


CTGGATTTTA 


14400 


CGTTTGAGAA 


CATTAAATTA 


GAAGTACCAG 


CCAACCAAGG 


AACCAAAGGA 


CAACAGTACA 


14460 




J. 1 VJOrtrt 1 




GTCCCTAAAA 


TAAAAATGCC 


CGAGCCTTGA 


14520 


AATAGGAGGC 


TCAGGCATTT 


TTTATTGATG 


GTCTTTTTAG 


GGTTGTTGTG 


CTATAATGAT 


14580 


AAAGAAAAGA 


GAGATATGCT 


TCCAACATAT 


CTCTCTGATG 


TAGAGCCGTT 


TAAGACGGTG 


14640 


ACCAATTTTA 


TTATTTAAAA 


ATAACCGTGC 


TTGGTCAAAG 


TAGACGGTTA 


TTTTTTCTTG 


14700 


TCATTTTTAA 


GCAATTTCAC 


AATCAGCGCA 


ATCAAAGCAA 


TGGTAAACAT 


ACCAAACCCA 


14760 


AGAATTGTCT 


GAATTGTTTC 


ATATGCTGAC 


AAAAGGCCTC 


TCCTTTCTGT 


GAATTTCGGA 


14820 
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AATACGTGCA TACGCATCAC CTCTCTTTCA GATTGGATAG CCACCGCCAT AACTTCTCTA - 14 880 

CACATACATT ATACCATGTG GGGCTAAAAA ATAATTAGTT TTCAATATAT TTTTTTAGGT 14 940 

AAGTGTGCTA TAATAGCAAT GAAAAGAGAG GTATGCGCCA ACATACCTCT CTAGTGTAGA 15000 

GCGGTTTAAG ACGGTGACCT TTTGGATTAT TTAAAAATAA CCGTACTTGG TCAAAGTAGA 15060 

CGGTTATTTT TTCTTGTCTT CTTTAACAGC TTTTAGGATT CCGAAGATTA ACGTTGCGAT 15120 

TCTTGTTGCA AAGCGCATGT TCCTGTGCGC TTGCCGCCGC CATAACTTCT CTACGCTTAG 15180 

AGTATACTAT GTAAAACAAA AAATAATTAG TTTCCAATAT ATTTTTTAGG CAATTGTACT 152 40 

ATAATGGTAA TGAAAAGAGA GCTATGCGCT AACATACCTC TCTGATGTAG AGCCGTTACT 15300 

AACGGTGACA AAAATTAATA TTTTTGAAAT GACCGTCTAA CTAGGCAAAG TTAAGCAGCG 15360 

CCCCTTTCAT TAAGGGGCGC TGCTTTTTAG TCGTTATTGC GATTGTTCAG TGTCTTTTGT 15420 

AGAGAGGTTC ATTTGTTCTT TGATGGCATT TTCGACACGT TGGCGTTCAC TCTCTTCCAC 154 80 

AAGTTGATAA GAAATACCTT CAATCCCTGT CACGCCATCG CCAGTGTAAT CGTAGCCTTT 15540 

TAATTGATCG GTTTCAATCG TTTTTAGTGC GGGGGTGTAT TTTTTTAGCA TTGCCATCAT 15600 

TTGGTCAAAG. GATAAGTCGG TTTCGCCATG TTCGCCGATT ACGTCCAGAA TTTCTTGATA 15660 

TTTTAGCAAG CTGTTAAAAG AAAGTAGCTT GTTGGTTAAT TCGATTAAAA CTTCTCGTTG 15720 

GCGGCGTTGA CGACCATAGT CGCCTTCGGG GTCTTCGTAA CGCATGCGCG CGTATTGTAA 15780 

AGCTTCCCAG CCACCTAAAT GCTGTTTGCC TTTCGGATAA TGAATCCCTT CAGCGTCAAA 15840 

AGCAAAGGCA TTGTCAACAG TGACGCCGCC CACCGCATCG CCAAGGTCCT TCATAGCATC 15 900 

CATATCAATT GAAAAGTAAT GATTGATTGG GATGTGCAAG ACATTTTCAA CCGTTTTAAT 15960 

TTGCATATCG GCACCACCGA AAACGTGGGC CGCATTGATT TTATTGTATT CGCCATTGGT 16020 

GCCAGCCATT TCCACATAGT AGTCACGCGG GATGCTAGTG ATGGTAGTTT TTTGTAATTG 16080 

AT TAT TC ACT GTAACCACCA TTAAGGCGTT GGAGCGGTAA TCGGTTTTGC GTTTTGAGTC 1614 0 

ATTCGCAATC CCCATAATTA AGAAACTAAC CGGTTCCCCT TTGGCTAAAT CGGGACTCTT 16200 

CGTTGAAGAT TTTCTTGGTT TATACATGGT GTTCGCCGTT TTTTGCAAAT TGCGATACAT 162 60 

GCTGACGCCG TAAGCCGAAA CAGAAAGTGT ACCTAAAAAT AAAAGGGTTA AACTGAAGAC 16320 

CACGAATAAA CTTTTTCTTT TACGAGAATG GCGGCGCCGG CGTTGTGTCG ATACTTCTGC 16380 

TTCCTCTGGT TCAGGTGCCG GCAAATAACG TTGGACAATT TGAAATGAAA AGTTGATCCA 164 40 

GCTATAACTT AGGAGCATCC CAGCAATGAC ATCACTAGGA TAATGAACGC GTAAGATTAA 16500 

GCGACCTAAC ACAATCAAAA CTAAATAAGT GAGTAAGAAA ATTTTGATGC CTTTGGTTTT 16560 

TTCGACATGA AGATAGGTCA AAATGATCAA AATCGTACAG ACTAGCGCCA TTGCTAATAA 16620 

TGAATGTCCG CTGGGAAAGC TAGCAGAGCT CCGGTCAGCT AAAGCGCCTA GGTTTGGTCG 16680 

AGCCCGACCC ACCACGTGTT TAAAAACGCT CCCTAAAAAG CCGACAGCTA ATAGGTTACT 16740 

GACAGCCCAA AAGGCTAGAA TTTTTTCTTT TTTGCGTAAA AGGTACACCA TCAATGATAA 16800 
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GAAGAATAGC AGCATTGGTC CAATCGTGGC AGTTTTTGCT AACAGATTGA CCGCTGTTAA ■ 16860 

AAGTGGGGCA TTTAATTGCC ATTTGATGTG ATAGATAGCT TCGTCCATTT GTTGAAACCA 16920 

ATTTGGACTA GCCGCCACTA ACAAAGTGAG GAAAGTAAAA CCAAGCGTTG TTAGGGCATA 16980 

AAGTGGAAAT TTTTTATTCG ATGTTTGCAT GAGAATCCCT TTCTTTTTCC CTTGAAAGAG ■ 17040 

TAGACTAAGC CAGCATCGAT CGTAAATAAG CGCTTGCGAC TAAACAGGAC ACAGCAAGCC 17100 

AATCGGTTCT CATCGTCTGA GCTTTAGCAC TATCCAACGT AAAGATTTCT CTTAGCTATT 17160 

TTTAGAAGAG CCTTAATTCG ACCATTCCTG TTTTACACAT TGGTGTCTTT CGACATTTCT 17220 

GCGCAATAGC CTATGTTTGG ATAGGAGCCT CACCTAACAA GGTTATTTTA TCTAACGCTA 17280 

GAAGTATAGA ATTTATGCAG CGGTTCTCAC AATGAAGTTT AAGAAAATTT ATTGTTTTTT 17340 

AAATAAATTT AAAGGTGTGG GAGAAAAGCG TTATTTCGGT AAATTTATTT TAAAAACGGA 17400 

AAAACGTTTT ACAGTATCGG TCTTTAACGA TATAATAAAA GAATAGATGA AAAAACGAGG 174 60 

AGGTGAACAG CATGCTAGTT AATTTTACAG TCAAAAATCT TCTATCTTTT AAAGAGGAAC 17520 

AAACACTTTC CCTACAAACA GGTGCGTATT TACGAAAGTA TGTCAATAAT CGTTTgcGAA 17580 

GTGAAACCTA AAAATTACCC AGAAAAACTA AGGTTACTGA AGAGTGCCAT TGTTTTTGGG 1764 0 

GGAAATGGTT CAGGAAAATC AAATTTAGTG AGTGGGTTAA ATGCATTAAA AACATTAGTT 17700 

TTAAGAGAAC AAGCTTCGAT TAATGATGCT TTATTATATG AACCTTTTGC GTTAGACCCT 17760 

CTCACTAAAC AAAGCCCATC TGTTATGACC ATTGAATTTA TTAATGACAG TGTCTTGTAT 17820 

CGCTATTCGT TAGCTAACAC AGCGCGCGAA GTAGTTAAGG AATCTTTAGA GATTTATGAC 17880 

AAAACGCAGG ATGATTTTGT TTATTATTTT AAAAGAGACG GTGCGGAATC TTCCGTTTTT 17940 

CCAGAAAAAT ATCAAGAATA CCAAACACAA ATTAAGTCTA ATTCGTTAGT TATCCATACG 18000 

TTGGAATCAA AAAATGATCA GCACGCTACG AATGTCGTTC GTTGGTTTGC CAATCAGTTA 18060 

GTTATTTTTG ATGGGTCTTT ACGGAATATT GAACGGCTGA GCAATGAAAA AGACAAAGAG 18120 

AAATTCTTGA ACTTTTTGAA ACTTGCGGAT ATGAACATGG TTGACCTGGT TCCTGTGAAA 18180 

GACAAAGAGG AAGATTACTC GAAACAAGAA AAATTAGCCG CTGTTTTAGA ACGAATTATC 18240 

CAAGAAAAAG ACCCGACCAT TGAAATTGGT CGAGTCAAGA AAAGTCTGTA TTCGCTGTAC 18300 

TCTGTTTATA ACCAATATGA TGAAATGAAT CAAGTGGTCG GACAAGAACG TATTCATTAT 18360 

GATTTGGAAT CTTCTGGGAC GAAAAAATTA ATTGGCTTAG CGCTAAATAT TTTATTTAAT 18420 

CCAGAAGAAA AAGTTTTTGT TTTTGATGAA TTTGATGATG CATTTCATCA AGAGTTGTCA 18480 

GGAACATTGT TAGAAGCGTT CAATGCCATG GaGACGAATA CACAATTTAT CTTGACCTCC 1854 0 

CATGAACTTC ATTTGATGGa TAATCAATTG AGAAAGGATC AAATTTACTT CACAGATAAG 18 600 

AATTATCGTG GCGCCAgcGA ATTATACGCT TTGTATGACT TTGAAGCAGA CCCAAAAAAA 18660 

GGTCGCGGAG ATATCACATA TTATCGCCGT TATTTAAATG GTCTGTTCGG TGGTGTTCCT 18720 

CATATTGATC GAAGTGAAAT CGTTCAGGCA GTGAAGGTAA AGGAGTAGAA GTTGGTATGC 18780 
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CAAAAATAAA AAGGGTAAAA AAAGGaAAGA AwTTAAAACC AAtGTCATGA TTTTTTGCGA 
gGAGaAACAG AAGAaGCTTA CCTTAGATTG ATTwAACGM AaTATTCTGC AGTCmATATT 
wAAAGTAAaC TCmAAGTaAA cmTGtAGaCG TCmAGGrGgG CGTTGGTCGA ATACGCCGTT 
TCCTGTATTC GGAGTATGGG GAAAAGCGGA CGnGGATAAT TATGATTTGT TTTATGTTAT 
GTATGACAAG G 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
GAAAATTTAT CTTTGCCATT AATACGCAAA ATGATTTTTT GGATGATTCT 
TTAATGGGGA TTGGGTTGTC ATTTATTGTC GGCTGGCCGT TATTAATggG 
TACTTAATTG GTATTTTTTA TTCATCAGGA CCGAAACCTT TATCTAGTCT 
GAAGTTTTCT CAGGATTTAC AATGGGGTTC ATGATTACCC TTATTTGTGT 
ACGTTTGAAG TTTTCAATTG GTCAGCAGCT AATTTATGGG GGATTTTCTT 
CCTAACACTT GTTACATTGC TAACTTAATG TTAGCAAATA ATATCTGTGA 
GATGAAAATA ACAAGAGATA TACATTAGTT CATTACCTTG GCAAAGCAAG 
TTATTCGTCG GATTAAATAC AATAGCAATG TTAGCCATTT TGTTAGCTGT 
TTAACCCCGC CAACAA'rGTT ACTCATGCTT TTAACTTTAC CTTTTGtACG 
CAAGCCTTAC TGAAGGAACA AGTGAAGAGT AAAACATTTG TTTGTGCTGT 
GCTGTGGGAG CAACTGCACA AGTGCTATTT TTTGCCATAG GTTTGTGGTG 
TTTTATAGAG TGGAAGGGAA TCAAAATGAA TAAGAAACAT GTTGTTATTT 
TTACGCAGGC TTAAAAACAT TACGCGAATT ACAAAAAGGC GCCAAAGACG 
TTTAGTGGaT CGTAACAACT ATCACTACGA AGCAACGGAT TTGCATGAAG 
AACACAACCA GCTGAAAAAA TCACATACAA TATTATGGAT GTTGTCGATG 
CACCTTTATC CAAGGAACAG TTAAAACAAT TGATGCTGCA ACACAAACAG 
AGATGGCCAA ACAATCAATT AT G ATT ACT T AGTTGTTTCT CTTGGGTTTG 
ATTTGGTATC CCTGGTGTGC AAGAACATGC TCTACAAATG GTAGATGTGA 
AAACGTGTAT GAACATATCC AAGAACAAAT GCGTCAATAT AAAGCAACTC 
ATTCTTAAAA- ATTGTTGTCT GCGGAgcAGG CTTTACAGGG ATTGAATTAT 
TGTTGAAAAC AAACCAAAAT TTGCTGAAAT CGCAGGTGTC TCAGCAGATC 
TTATTGTGTC. GAAGCAGTTA CTCGCCTATT GCCGATGTTT AACGAAAAAC 



CGTTTCTGCT 
TCTTTATTGT 
ACCATTAGGG 
CTACATTAAT 
AGTCGCCTTA 
CTTAGAGGAA 
TTCTTTGAAA 
CGGTTTAGGA 
GAAACAAACC 
GAAGATCTTA 
GCTGTAAATT 
TAGGTGCTGG 
TCGAAATTAC 
TTGCTGCTGG 
AAAAAATGAC 
TTGCTTTAGA 
AATCAGAATC 
AAACGGCTTT 
AAAACGAAGA 
TAGGTTCTCT 
AAATCCAAAT 
TTGCGAACTA 



18840 
18900 
18960 
19020 
19031 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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TGGTGTGCM TTATTAAAAG ATTCAGCTAT TCATTTATTA TTAGGTAAAC CGATTAAAGA 1380 

AACAAACCAG GCGCAGTTGT TTATCAAGAT AACGAAGCAG GCGACTTGGC AGAACTTTCT 14 40 

GCTAAAACAA TTATTTGGAC AACGGGTGTT AGTGGGAGCC ACGTTGTAGG CGATTCTGGC 1500 

TTTGAAGCTC GTCGTGGTCG TGTCATGGTT CAACCAGATT TAACAGATGC GAACCATAGC 1560 

AAC 1563 
(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



AAGCGTTGAC 


ACTCGTGATT 


TTATTCAAAA 


CAACTACACA 


GAGTACAAAG 


GCGACGACAG 


60 


TTTCTTAGAA 


CCAATCGCTC 


CGAGCACTGA 


CAAATTATGG 


ACAAAACTAC 


AAGAATTATT 


120 


TGAAATCCAA 


CATGAAAAAA 


ATGGCGTTTA 


TGACATGGAT 


AGCGATATCC 


CAGCAACTAT 


180 


TACGTCACAT 


GAACCTGGTT 


ACTTAATCAA 


AGAAGAAGAA 


AAAATTGTCG 


GTTTACAAAC 


240 


AGATGTACCA 


TTAAAACAAG 


CATTTATGCC 


ATTCGGTGGT 


ATTAAAATGG 


CGAACAATGC 


300 


CTTAGTTTCA 


AATGGTTATG 


AAACTGACGA 


AGAAATGACA 


AAAATCTTCA 


CAGAATACAG 


1 360 


AAAAACACAT 


AACCAAGGTG 


TTTTTGATGC 


CTACACAGCT 


GAAATGCGTT 


TAGCACGTAA 


420 


AAACAAAATC 


ATTACTGGTC 


TTCCAGATGC 


TTATGGCCGT 


GGCCGTATTA 


TCGGTGACTA 


480 


TCGTCGTATC 


GCATTATATG 


GTATCGACTA 


CTTAATGGAA 


CAAAAGAAAA 


AAGATCATGA 


540 


TAACACTGGT 


AACAAAGAAA 


TGACAGACGA 


TGTTATTCGC 


TTACGTGAAG 


AAATTTCTGA 


600 


ACAATACCGT 


GCGTTAAATG 


ACTTAAAACA 


AATGGCTGCT 


TCTTATGGTT 


TCGATATTTC 


660 


TCGTCCTGCT 


GCAAATGCGC 


AAGAAGCAAT 


CCAATGGTTA 


TACTTTGGTT 


ACCTAGGTGC 


720 


CATTAAATCT 


CAAAATGGTG 


CGGCAATGTC 


AATTGGTCGT 


ATCTCAGCAT 


TCTTAGATAT 


780 


CTATATCCAA 


CGTGACTTAG 


AAGCAGGTTT 


AATCACTGAA 


TTTGAAGCAC 


AAGAAATGAT 


840 


TGACCATTTA 


ATCATGAAAT 


TACGTATGGT 


TAAATTTGCA 


CGTACACCAG 


AATACAACCA 


900 


ATTATTCTCT 


GGTTATCCAA 


TTTGGGCAAC 


ATTATCTATT 


GCTGGTATGG 


GTATTGACGG 


960 


TCGTTCATTA 


GTTACCAAAA 


ATGACTTCCG 


TATCTTACAC 


ACCTTAACAA 


ACATGGGACC 


1020 


ATCTCCAGAA 


CCTAACTTAA 


CTGTATTGTA 


TTCTTCTCAC 


TTACCAGAAG 


GTTTCAGAAC 


1080 


ATATGCAGCT 


AAAATCGCGA 


AAGAAAGCTC 


ATCTATCCAA 


TTTGAAAACG 


ATGACTTATT 


1140 


ACGTGAAAAC 


TGGGGCTCTG 


ACGACTGTGC 


AATCGCATGT 


TGTGTTTCTG 


CAACAGTTAT 


1200 


GGGTAAAGAT 


ATGCAATTCT 


TCGGCGCTCG 


TGCGAACTTA 


GCGAAAGCTG 


TTCTTTATGC 


1260 


AATTAACGGC 


GGGGTCGACG 


AAAAAACTAA 


AATGCAAGTA 


GCACCAAAAT 


ATCGTCCAAT 


1320 


GACTGGTGAC 


AAATTAGATT 


ACCATGAATT 


CATGGaACGC 


TATAAAGACA 


TCTTAGACTG 


1380 
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GTTAGCTGAA 


TTGTACGTTA 


AC AC AT TAAA 


CATTATTCAT 


TACATGCATG 


ATAAATATGC 


1440 


ATATGAAGCA 


CCACAATTAG 


CTTTAATGGA 


TACTGATTTA 


CAACGGACAT 


TCGCAACTGG 


1500 


TATTGCCGGa 


ATCTCTCATG 


CGACTGATAG 


TATCATGGCA 


ATTAAACACG 


GTGAAGTAGA 


1560 


AGTAATCCGT 


GACGAAGATG 


GTATGGCAAT 


TGATTATGTA 


CCAACAAAAG 


AATTCCCAAC 


1620 


ATACGGAAAC 


GATAACGAAG 


AAGCCGATGC 


AATGGCTAAC 


TGGATTTTAG 


ATTACTTTAT 


1680 


GACACAAATT 


AAACGTCAAC 


ACACATACCG 


TAACTCTAAA 


CCAACGACTT 


CATTATTAAC 


1740 


AATCACTTCA 


AACGTTGTTT 


ATGGTAAAGC 


AACAGGGAAC 


ACGCCTGACG 


GACGTCGTGC 


1800 


TGGCAAACCT 


CTTGCACCAG 


GTGCGAACCC 


AAGTTATCAA 


GACGGTAAAT 


TCTTAGGTGA 


1860 


GAAAAATGGC 


TTATTAGCTT 


CATTAAACTC 


AACAGCTCGT 


TTAGAATATA 


CAATCGCTTT 


1920 


AGATGGTATT 


TCTAATACAC 


AAACCATCAA 


CCCTAATGGT 


TTAGGTAAAG 


ACGACGACAC 


1980 


AAGAATTAAC 


AACTTACGTA 


ACGTATTAGA 


TGGCTACTTC 


GATAAAGGTG 


GTTACCACTT 


2040 


AAACGTAAAC 


GTATTTACAA 


ACGAATTATT 


ATTAGATGCA 


CAAGCACATC 


CAGAAAAATA 


2100 


TCCAAACTTA 


ACTATCCGTG 


TATCTGGATA 


TGCCGTGAAA 


TTCCGTGACT 


TAACTCCTGA 


2160 


ACAACAAGCA 


GACGTTATTT 


CAAGAACTTC 


ACACGACAGA 


CTATAAAAAA 


CATTTACATT 


2220 


ATTTAACTGA 


AAGGGTGACA 


TTTAACGTGT 


CATCCTTTTC 


GTTTGAAAGA 


AGAGGGATCA 


2280 


TTATGAGAAC 


ACCAGTTACA 


GGTAGAATTC 


ATTCAACAGA 


AAATTTTGGT 


ACCGTTGATG 


2340 


GCCCAGGTGT 


CCGTTTTATC 


GTATTTACAC 


AAGGGTGTCG 


CATGAGATGT 


CAATTCTGTC 


2400 


ATAATCCAGA 


TACTTGGAAA 


ATTGGTTCCG 


GTGGTCGTGT 


GGTGACGACT 


GACGAAGTAT 


2460 


TAGAGGAAGC 


ATTACGTTTT 


CGCTCTTATT 


GGGGCGAAAA 


GGGTGGTATC 


ACTGTCAGTG 


2520 


GTGGTGAGCC 


GCTATTGCAA 


ATGGACTTTT 


TGATCGATTT 


ATTTAAAAAA 


GCCAAAGCAC 


2580 


AAGGCATTCA 


TACAACCTTA 


GATACGTGTG 


GCAAGCCTTT 


TACTCGGGAA 


GAACCCTTCA 


2640 


TTAGTCAATT 


TGATGAATTG 


ATGAAGTATA 


CAGATTTACT 


CTTATTTGAC 


ATCAAACATA 


2700 


TTGATAACGA 


GCAACATAAA 


CTGTTAACAA 


CGCAATCAAA 


TGATAATATT 


TTAGAGATGG 


2760 


CTACCTATTT 


ATCTGAAATT 


GACAAACCTG 


TTTGGATTCG 


CCACGTATTG 


GTGCCTCAAC 


2820 


GTAGTGATTA 


CGACGAATAT 


TTAATCCGTC 


TGGATGCCTT 


CATAAAAACA 


TTAAACAATG 


2880 


TAGATAAAGT 


TGAAGTGCTT 


CCTTACCATA 


CGATGGGCAA 


GTACAAATGG 


GAAGAACTAG 


2940 


GTATTCCTTA 


TCCATTAGAA 


GGAATTGAGC 


CTCCTAAAAA 


TGATCGTGTT 


GAAAATGCGA 


3000 


AAAAACTACT 


ACATGTCGAG 


GACTATCAAG 


GTTATCTAGC 


ACGTTAACAA 


ACGAAAAAAT 


3060 


AGAAAAGAAG 


ACAATTTTTT 


TGTTGTTTCT 


TTTCTATTTT 


TTGTTATCAT 


TAAGAGTGAA 


3120 


TCAAAACCAA 


GGAGTGTGTA 


TTATGTCAAA 


AATTTTAGTT 


TTTGGACA 




3168 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double' 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



ACCATCTTAA 


TTTGACAATG 


AATGACACTG 


GTTGAGAAGT 


CGATTCCAAT 


ATGGAAGGCT 


60 


CTGTCAGCTA 


GTTTTTCACT 


CTACCAACAA 


AATAATCAAC 


CAAAATAACA 


ACGGAAAGCC 


120 


AAACAAGAAA 


CATACTAATC 


ATGCTTCTtC 


AAAGAAGCAC 


AAATAAAAAA 


GTGATGACTC 


180 


GAGAAGAAAA 


TCTCTATGGA 


GTCATCGCTT 


TTTCTTATAT 


TTTAGTTAAA 


AAATTATTTT 


240 


ACTAAAGGTT 


GAAGTAACTT 


GTTCCAGAAA 


CTAATAAGTG 


GGCCAGCACC 


AAATGCACAA 


300 


ATAATTGTAC 


CAATCCCAAT 


TGGACCACCA 


GCAATAAAGC 


CAATTAATAG 


GACCAACAAA 


360 


TCTTGTGCTT 


TTCGAACAGA 


AGCGTAAGGT 


TTGTTGGTTT 


GTTGACAGAT 


GGTTGGCGCA 


420 


ATCGCATCAT 


AGGGGGACTC 


GCCTTGATCT 


GTAGCCATGT ATAAGGCGCA gcCAAATGTA 


480 


AAAATCAGGA 


CTCCGACGAA 


TAAAGAAAGT 


AGTTTGCCAA 


TTGTTGAGCC 


TCCTAAACTA 


540 


TTGGCATAAG 


GTTGAAGCCA 


AGTAGAAATC 


TGTTGAATCA 


AAATTCCTAC 


TAAAAGCATA 


600 


TTTAAAATAG 


TTCCTAGACC 


AATGGCTTTT 


TTGTTTTTTA 


AATAGACATA 


GAATAAAATA 


660 


AACGCATTGA 


CAATAATTTG 


AAAAATGCCT 


AATTGCATAC 


CAAACATCGA 


AGAAAGTCCG 


720 


ATATTCATAG 


CAGTGAAAGG 


ATCAACTCCT 


AAACCTGCCA 


. CTCTTAAAGT 


GGATGCGCCA 


, 780 


ATCCCGATAA 


TAACTAAACC 


AATTAAAATT 


AAAAGAACAT 


TTTTGACTAG 


TTTCATGGTA 


840 


AGCCTCCTGA 


ATAGATTTTG 


TGTGTGATAG 


TACAAAGTCT 


GCAACAAAAA 


. TTCATAGAGA 


900 


ATTTTGTTGC 


AGACTACTTT 


TTAACCAGCT 


CCGTGTTGGA 


ACGAAGGATA 


TAGAGACATC 


960 


CCGCCATCTA 


CAAAGAGTGT 


TGTTCCGGTT 


ACGTATGAGG 


ATTCATCAGA 


AGCTAACCAA 


1020 


GCGGCCGCAG 


CAGCTACTTG 


ATTTGGTTTA 


CCAATAATAC 


CCATAGGAAC 


CATGCTTGTT 


1080 


GTT.TGTTTTA 


ACTGTTCGGG 


ATCGGAAAAT 


TTTTCAGCAT 


TAATAGGTGT 


GTTAATCGCT 


1140 


CCAGGCGCAA 


TACAGTTGGC 


ACGAATACCA 


TATTCCGCAT 


ATTCCAAAGC 


GATTGTTTTC 


1200 


GTGAACATTT 


CGGTTGCGCC 


TTTACTTGCC 


GCATAGTGGG 


CGAACGTTGG 


CCAAGGAATC 


1260 


TTTGGTGTAC 


GGAGACATAT 


TA 








1282 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11871 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171:. 

nnAAATAAAG AAATCATAAA ATAACTTTAA AAGTCCTCTT TTAAAGGAAA ATGTCTGTAA 60 

AATAGAGAGA AATGGTTTTC TTTCTCGTCA AGTAATGATA AAATAAAACG GATAGATTGT 120 

GCTTCTTGTA CAAAAAGGAT ACAAAAATAA AAAGAGGTAA TTGAAAGGAA CACAGATACC 180 
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AATGGCAAAT TTTTTGAAAA AAATGATCGA AAACGATAAA AAAGAATTAA GACGATTAGA 24 0 

AAAAATCnCC GATAAAATTG ATGCACATgC AAGTGCAATG GAACAACTAA GCGATGAACA 300 

ATTACGTGAA AAAACAGATG AATTTAAAGC CCGTTATCAA AAAGGGGAAA CCTTAGACGA 360 

ACTATTACCA GAAGCGTTTG CGGTTGTCCG TGAAGCTGCT AAACGAGTAC TAGGTTTATT 4 20 

CCCGTACCGT GTCCAATTAA TGGGTGGGAT TGTGCTTCAT GACGGGAACA TTCCTGAAAT 4 80 

GAGAACAGGT GAAGGGAAAA CCTTGACTGC CACAATGCCT GTATACTTGA ATGCACTATC 54 0 
TGGCGAAGGA GTACACGTTG TTACAGTTAA TGAATACTTA GCAACCCGTG ACTCTAACGA 600 
AATGGGAGAA TTATATAATT TCTTAGGCTT AAGCGTTGGC TTAAACATTA ACTCTAAATC 660 
TTCTGATGAA AAACGTGAAG CGTATAACTG TGATATCACT TATAGTACAA ATAACGAATT 720 
AGGATTTGAC TATTTGCGTG ATAACATGGT TGTTTATCGC AGCCAAATGG TTCAACGTCC 780 
ATTAAACTAT GcAATTGTCG ATGAAGTGGA TTCAATCTTA ATTGATGAAG CAAGAACCCC 84 0 

ATTAATTATT TCTGGACAAG CTGAAAAATC GACAGCACTT TATACGCGTG CCGATAATTT 900 
TGTCAAACGC TTAAAAGAAG ATGAAGATTA CAAAATTGAT ATTCAGTCTA AAACAATTGG 960 

TTTAACAGAA GCAGGTATTG AAAAAGCTGA ACAAACTTTT GGATTGGATA ACCTTTACGA 1020 

TATTGAAAAT ACAGCATTAA CGCACCATTT AGATCAAGCG TTACGGGCAA ACTATATCAT 1080 

GTTACTAGAC ATTGATTACG TGGTTCAAGA CAATAAAGTC TTAATTGTTG ACCAATTTAC 1140 

TGGTCGTATC ATGGATGGTC GTCGTTATTC TGATGGCTTG CATCAAGCGA TTGAAGCAAA 1200 

AGAAGGCGTC GAAATTGAAG ATGAAACAAA GACAATGGCG ACGATTACGT TCCAAAACTA 1260 

CTTCCGGATG TACAAAAAAC TAGCCGGGAT GACTGGTACT GCTAAAACAG AAGAAGAAGA 1320 

ATTCCGTGAA ATTTACAATA TTCAAGTCAT TCAAATCCCA ACCAATCGTC CAATTATCCG 1380 

TGACGATCGT CCAGATTTAC TTTATCCAAC CTTAGAAAGT AAATTTAATG CAGTGGTGGA 14 4 0 

AGATATTAAA GAACGTTACC ACAAAGGGCA ACCAGTGTTG GTGGGGaCTG TTGCGGTAGA 1500 

AaCGTCAGAA TTACTTTCTG ATAAATTAAA CGCAGCTAAA ATTCCTCATG AAGTATTAAA 1560 

TGCGAAAAAT CACTTTAAAG AAGCCGAAAT CATTATGAAT GCCGGTCAAA AAGGCGCAGT 1620 

TACAATTGCA ACAAATATGG CTGGTCGTGG AACGGATATT AAACTTGGTT TAGGTGTACT 1680 

TGAACTAGGT GGTCTAGCAG TTATCGGGAC AGAACGTCAT GAGTCACGTC GTATTGATAA 174 0 

TCAGTTGCGT GGACGTGCAG GACGTCAAGG GGATCCAGGG GTTTCACAAT TTTATTTATC 1800 

TTTAGAAGAT GACTTAATGA AACGCTTTGG TTCAGAACGA ATTAAAACAT TCTTAGAACG 18 60 

AATGAATGTG CAAGAAGAAG ATGCAGTTAT TCAAAGTAAA ATGTTTACTC GCCAAGTAGA 1920 

GTCTGCTCAA AAACGTGTGG AAGGAAATAA CTATGACACA CGTAAAAATG TCTTACAATA 1980 

CGATGACGTG ATGCGCGAGC AACGTGAAGT TATTTATGCG CAACGACAAG AAGTAATTAT 204 0 

GGAAGAAAAT GACTTGTCTG ATGTATTGAT GGGCATGGTA AAACGGACGA TTGGCCGAGT 2100 

GGTGGATAGC CATACACAAT TAGAAAAAGA AGAATGGAAT TTAGATGGGA TTGTCGATTT 2160 
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TGCAGCCTCA ACACTTGTCC 
AGCTGAAGAA ATCAAAGACT 
TCAACAATTG AATGGCCAAG 
TGTCGATACT AAATGGACAG 
GTTACGTGCG TACGGTCAAA 
GTATAACAAT ATGGTGGGTT 
AATTCGCCAA AATGTCCAAC 
AGAACAAGAT GCAGCTGCAC 
TAAAAAAGTG GGCCGTAATG 
TCATGGAAGA AACGCTTAAT 
GCTCTTTTTA CAATAAAAAG 
TATTAGACGA AATGACACAA 
TTAGAAGAAG ACATTGCCGA 
ACAGAAACGG CTCAAAAGCT 
TTTCATCAGC TCTCTGAGGA 
GAATATGATG CAGAGATTCA 
TTAACAACGT ATGAGCTATC 
ATTGAATTAC ACCCTGGTGC 
CGAATGTACA CACGTTGGGC 
GCTGGAGATG AAGCAGGGAT 
GGTTACCTAA AATCTGAAAA 
GCTAAACGCC GTCATACTTC 
ATAGAAATTG CGATCAATCC 
GGTGGTCAGC ATATTAATAA 
ACCGTTGTCG CAAGCCAAGC 
ATGTTAAAAG CCAAATTATA 
TTACGTGGTG AACAAATGGA 
CCGTATTCAA TGGTCAAAGA 
AT G GAT G GAG ACCTTGATGG 
CTTCATTAAA TTGAAATAAA 
ATAATGAGTA GGATTGAAGA 
GTAATGAAGA AATATTCGAA 
CAAGGCGAGT TCGTCTATGT 
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ATGAAGATAC GATTTCTAAA AAAGATTTAG AAAATAAATC 2220 

ACTTAGTGGC ACGTGCGCAA GAAGTATTTG AAGAAAAATC 2280 

AACAATTGCT AGAATTTGAA AAAGTAGTCA TCTTGCGTGT 234 0 

ATCACATTGA TGCAATGGAT CAATTACGCC AATCTGTCGG 24 00 

ACAACCCATT AGTAGAATAT CAAACAGAAG GTTATTCTAT 24 60 

CAATCGAATA TGAAGTAACA CGTTTATTCa TGAAATCGGA 2520 

GTGAGCAAGT AGCTCAAGGA CAAGCGGAAC ATCCAGAAAC 2580 

AAAGCAACAC AAGTGCGAAg CGTcAACCTG TGCGTGTAGA 2640 

ATTTATGCCC ATGCGGAAGT GGCAAAAAAT TCAAAAATTG 2700 

TAAAGAAAAA ACGCAGGGTG GGGCAATTTG TCCCGCCTTT 27 60 

AAAGAAGGAA TCATTTTGGA AAACCCAGAA ATTCGTACAT 2820 

AAAATTACTA GCTTCAGGAG GTCTCTTTGA CTTAGACCAG 2880 

GGCGGAAAAT CGGATGGGAG AACCTGGTTT TTGGGACAAC 294 0 

AATTAATGAA ACCAATAGTT TGAAAGAAAA ATATCAACAA 3000 

AATTGACGAA TTAACGGTGA TGTTTGAATT ACAACAAGAA 3060 

AGAAGAATTG GAAGAAAGAA TTCATCTCTT GCAAGAACGT 3120 

CTTACTACTA AATGAGCCAT ATGATCATAA TAACGCCCTA 3180 

CGGTGGTACT GAATCACAAG ATTGGGGAAG TATGCTTTTA 324 0 

AGAAAGTCAC GGTTTCCAAG TTGAAACGTT GGATTACCAA 3300 

TAAAAGTGTC ACCTTATTAA TCAAAGGTTA CAATGCTTAT 3360 

AGGTGTCCAT CGTCTTGTGC GGATTTCGCC CTTTGATTCC 3420 

GTTTTGTTCA GTTGATGTAA TGCCTGAATT AGACGACACA 34 80 

AGATGACTTA AAAATTGATA CCTATCGAGC AAGTGGTGCC 354 0 

AACAGAATCG GCTGTTCGAA TCACACATAT TCCAACAGGT 3600 

ACAACGTTCT CAATTAAAAA ACCGTGAGCA AGCAATGAGT 3660 

TCAATTGGAA ATGGAAAAGA AAGCCCAAGA AGCAGCAAGT 3720 

AATTGGCTGG GGCTCACAAA TCCGTTCTTA CGTGTTCCAT 3780 

CCATCGGACA AATTACGAAA CCGGCAATGT ACAAGCTGTG 38 4 0 

TTTCATTGAT GCCTACTTAA AACAGCGCTT AAGTTAAGCT 3900 

AATGTAACTT AATGCAAGAA GTTTGCAATA TTGACATGTT 3960 

AAAAAGAGAT GGAGAGAACG CCATGATTGA AATGAAAGAT 4020 

TGGTACAACA GCGATTCGGA ATATTTCAGT AGAGATTGAC 4 080 

GGTTGGTCCC TCAGGTGCAG GGAAATCAAC CTTTATTAGA 414 0 
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TTAATGTATC GTGAAGAAAA AGCAACAAAA GGTAATTTGA CAGTTGCAGG TCATGATTTA 4 200 

ATGAAAATAA AAAATAAAGA AGTTCCTTAT TTACGTCGTG AAATCGGCAT CGTTTTCCAA 4 260 

GACTATAAAC TATTACCAAA GAAAACTGTG TATGAGAACG TGGCCTATGC GATGCAAGTT 4 320 

ATCGGACGGC GTCCTCGCGA AATCAAAAAA CGCGTGATGG AAGTTTTAGA TTTAGTCGGT 4 380 

CTAAAACATA AGGTACGTGT TTTTCCAAGC GAACTTTCTG GTGGGGAACA GCAACGGGTA 4 440 

TCCATTGCAC GAGCAATCGT AAATACACCG AAAGTTTTAA TTGCCGATGA ACCAACTGGG 4 500 

AACTTAGATC CAGAGAATTC ATGGGAAATC ATGAAATTGT TGGACCGGAT TAATGCACAA 4 560 

GGAACAACAG TCGTGATGGC AACGCATAAT AGTACAATTG TAAATACGAT TCGTCACCGT 4 620 

GTTATTGCCA TTGAAAATGG TCGCATTATC CGTGACCAAG TGGAAGGAGA ATACGGCTAC 4 680 

GATGATTAGA ACATTCTTTT -CTCATTTATT TGAAAGTATC AAAAGTTTAA AACGGAACGG 474 0 

TTGGATGACT GTTGCCTCAG CAAGTGCGGT TACAATTACA TTAGTTTTAG TTGGTATTTT 4 800 

TATGGGCGTT ATTTTTAATG CAACGAAACT TGCTGATGAT ATCGAAAAAA ACGTGACAGT 4 8 60 

GTCTGTCTTT GTCGATATTG GTACAAAACA AAATGAAATG AAAACATTAG AAAAACAACT 4 920 

TAAAGGATTA GACAATGTTG AAGATATTTC TTATTCAAAC AAAGATCAAC AATTGAAAAA 4 980 

AATCCAAGAA CAAATGGGGG ATGCCTGGAA TCTTTTCGAA GGCGACAGTA ACCCATTATA 5040 

CGACGTGTAT TATGTCCGTG CAAAAACACC AGAAGACACG AAAGATATTG CGAAACAAGC 5100 

AGCTAAATTC CCAAGTGTTT TTAAAGCCGA TTACGGTGGT GTAAACTCAG ACAAAATCTT 5160 

TAAAATCGCT CAAACCGTTC GAACTTGGGG CTTAGCAGCT GCTGCATTAC TTTTATTCGT 5220 

TGCAGTGTTC TTAATTTCCA ACACTATTCG GATTACGATT TTATCTCGTC AAAAAGAAAT 5280 

CCAAATTATG CGTCTTGTAG GAGCCAAAAA CAGTTTCATT CGTTGGCCGT TCTTCTTAGA 5340 

AGGTGcTTGG aTtGGGtTGA TTGGcGCAAT CGTGCCAGTC ATTATTAkGA CGTTAGGCTA 5400 

CCATCAAGTT TATAACATGT TCAACCCGCA GTTATTGCGT TCGAATTATT CATTAATTCG 54 60 

TCCAGAAGAT TTTATCTGGA AAGTCAATCT ATTAATGATT GCAACAGGAA TGATTATTGG 5520 

ATCACTAGGT TCAGTTATTT CAATGCGTCG TTTCTTGAAA ATTTAAACGA AAAAAAGCTC 5580 

TCCATTTTGG AGAGCTTTTT TTGTTTTAGT TTTAAAATAC TTGTTATTTA ATTTAAGAGT 5640 

TTTCTCAGTT GTACCAACAA ATTGATAACT TATAATTTTC AAGTCCTACA TATAATGGTA 5700 

AAATAGAATG GATTGAAATT AATTGGAGGA ATAATGAATC GATGAAAAAA AGATTGCTAT 57 60 

TATTTATTGG TTTGGCAAGT ATACTTACTT TGACAGGATG TGCAAAATGG ATTGATCGTG 5820 

GTGAATCCAT CACAGCGGTA GGCTCATCAG CTTTACAACC ATTAGTAGAG ACAGCGAGTG 5880 

AGGAATATCA AAGCCAAAAT CCGGGAAGAT TTATTAATGT CCAAGGTGGC GGAAGCGGAA 5940 

CAGGTCTGAG TCAAGTCCAA TCTGGCGCGG TAGACATTGG TAATTCTGAT TTATTTGCAG 6000 

AAGAGAAAAA GGGCATCAAA GCGGAAGACT TAATTGATCA TAAAGTTGCT GTCGTTGGGA 6060 

TTACACCAAT CGTTAACAAA AATGTCGGTG TCAAAGATAT CTCAATGGAA AATTTAAAGA 6120 
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AAATCTTTTT AGGTGAAGTA 
TTATTTTGAA TAGAGCGGCC 
GAGATAAAAC AGCCATTCGT 
TTTCTGATAC ACCAGGAGCG 
CTACGTTAAG TATTGATGGT 
TTATTTGGTC TTATGAACAC 
TTTTAGACTT TATGTTGTCa 
TTCCTGTTTC GAAAATGGAA 
CTTAACACAA AATTTACAAA 
TAAAAAAGGT ACACTATTTT 
GGAAGATGTA CAGAAAAAAT 
GAAATTTATT AGTTTCCTAT 
TTTTGTTGCA AGTAAAGGAC 
CTTATTTGGA ACAAACTGGA 
GGCTTTACCG ATGATTGCGG 
GCCATTTGCG ATTGGAGCGG 
ATTTTTACAA CCAGTTATTG 
TGGTTTATCT GtCATTGTTC 
TTTAGCTGGT ACGTTTGTGC 
AGATGCATTA AAAGCCGTGC 
GCGTTGGCAA ACGATTTATA 
GGTTGTCTTT GGTATGGCTC 
AAATGCAGCT GTGATGCCAA 
GACAATGGGG ATTGGGAATA 
AGCCTTGATT CTATTGTTGA 
GAGAGGAGCC TTAAAATAAT 
TATGCCGTAT CTGGCGTCAT 
AGAGGAATTC CACATATTTC 
GGTGGTGGGA TTGGTATTCA 
ATTAGTATTC CAATTTCATT 
TGGTTAACTG ATTTAATTCG 
GTCGGGCTGT TCGGCTTCTT 
TCAGGTGCGT TAgcATTAAC 
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ACAAACTGGA AAGAACTTGG CGGGAAAGAC CAAAAAATTG 6180 

GGTAGTGGTA CGCGTGCGAC TTTTGAAAAG TGGGTCTTGG 6240 

GCGCAAGAAC AAGATTCCAG CGGCATGGTT CGTTCCATTG 6300 

AT T AG T TATA CCGCATTTTC ATATGTTACT GATGAAGTAG 6360 

GTTCAGCCAA CAGATGAAAA TGTAATGAAC AATAAATGGA 64 20 

ATGTACACTC GTAAAAATCC AAGTGATTTA ACCAAAGAGT 64 80 

GATGATATCC AAGAACGTGT GATTGGTCAA TTAGGGTATA 654 0 

ATTGAACGGG ATTGGCAAGG AAATGTCATT AAATAATTCT 6600 

CTTAACATAA AATTTACATG TTATTAAAAA CAAATTGATG 6660 

TATCTGTAAA AGTTGGATTG TTTGAGGAGG AAATATTCTT 6720 

TATTAACAAA ATCGAAAAGA GCCAAGATGG AGCAACGAGG 6780 

GTATTGCCTT AATTGTAGCA GTGGTTGTTT CAATTTTCTA 684 0 

TTGCTACATT CTTTAAAGAT AAGATTAATG TGTTTGATTT 6900 

ACCCAAGTGA TATCGGTGCA GACGGTAAAC CCATGGTTGG 6960 

GTTCCTTTAT TGTAACGTTT TTGTCAGCGA TTGTTGCGAC 7020 

CTGTTTTTAT GACAGAAATT TCACCTAAAA AAGGAGCAAA 7080 

AATTACTTGT AGGGATTCCC TCTGTTGTTT ATGGATTTAT 7140 

CTTTTGTCCG TTCAATCTTT GGCGGAACTG GCTTCGGTAT 7200 

TATTTGTCAT GATTTTACCA ACGGTCACTA CGATGACCGT 7260 

CTAGACATTA TCGTGAAGCG TCGTTGGCGC TTGGTGCTAC 7320 

AAGTTGTCTT GCGGGCAGCT GTGCCTGGTA TTTTAACTGC 7 380 

GTGCGTTTGG TGAAGCCTTA GCAATCCAAA TGGTAATCGG 74 4 0 

CTAGTTTGAC AACACCAGCT TCGACACTAA CAAGTATTTT 7500 

CAATTATGGG AACAGTTGAA AACAACGTTC TTTGGTCTTT 7560 

TGTCTTTACT ATTCAACATT GTTATCCGAA TTATCGGTAA 7620 

GAATGCAAAA AAAGCAGATA AAATCGCAAC AGGTATCTTA 7680 

TGTTTTAATT TTGGCGGCCT TACTGATTTA CATTTTAGCT 774 0 

GTGGGAATTT TTAACACAAC CATCCAGAGC CTATCAAGTT 7800 

GCTGTTTAAC TCACTATATT TACTATTGAT TACAATGATT 78 60 

AGGTGCTGGA ATTTACTTAT CGGAATATGC GAAAAAGAAT 7920 

TACCTCGATT GAAATTTTAA GTTCATTGCC TTCCGTTGTT 7980 

GATTTTCGTT GTACAAATCG GATATGGCTT TTCAATCTTA 804 0 

ATTTTTTAAC TTACCATTAT TGACTAGAAA TGTTGAGGAA 8100 



WO 98/50555 



PCT/US98/08985 



TCTTTGAAAG CCATTCATTA 
TGGGAAACAG TTAAAAAAGT 
ATTTTGAGTT CTGGTCGGAT 
AGCGCACCTG CCTTAGATTT 
ATTTTCCGTC AAGCGGAAAC 
ATGCCAGATG GCGCTGCGGT 
CTTTTCAATT TCGGCGCTCG 
TAATGTTAGG GGAATTAGCA 
AGATAGCCAA AAAGACCCGA 
CAATGAGGCT ATCAAAGGTG 
TGGACCTTCT GGTTGCGGTA 
CATTGCCAAT TCACGTGTTA 
AGAAGTTGAC GTCTACGAAA 
ATT TAG T AAA TCTATCTATG 
TAAGAAAAAA TTAGATGAAA 
AGTGAAAGAT AACTTAAACA 
GTGTATCGCT CGGGCAATTG 
CGCGTTAGAT CCGATTTCAA 
TTACACGATC ATCATTGTGA 
AGCTTTCTTC TATATGGGTA 
ACCAAAAATT CAAGCAACGG 
TGGGCAAAGA AATTATTTCA 
TGAAAGGCAT TGATTTAACT 
GTTGCGGGAA GTCAACGTAT 
TAACTATTAC TGGAAGTGTC 
ACGTTGAATT ACGTAAAGAA 
CAGTTTATGA AAATGTGATT 
TTAGATGAAG CAGTTGAAAC 
AAACTTCACA AAAGTGCACT 
CGTGTTTTAG CTGTGGAACC 
CCAGTTTCTA GTGGTAAAAT 
ATTATGGTGA CACATAATAT 
TTACAAGGTG ATTTAATTGA 
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TACACAGCGT GAAGCGGGGT TAGCCTTAGG TCTTTCACGT 8160 

TATTTTGCCA GAAGCATTAC CAGGAATTTT GACAGGTGTT 8220 

TTTCGGTGAA GCCGCTGCGT TAATTTATAC AGCAGGACAA 8280 

CAGTAATTGG AATCCATTGA GCGTTTCAAG TCGAATTAGC 8340 

GTTGGCTGTG CATATTTGGA AAATTAATAC CGAAGgAACG 84 00 

CTCAGCAGGT GCCTCCGCTG TGTTAATTTT AGTAGTTTTA 84 60 

TTTTATCGGC AACCGTCTTT ATAAGAGAAT GACATCAGCT 8520 

TGAAAGAATA CAATTTAAAT GATACCCATT TACTTCAACT 8580 

TTGCGTTACA CACGGAAGAC TTACATGTCT TTTACGGGGA 864 0 

TTGATTTGCA ATTTGAAAAG AACAAAATAA CCGCCTTAAT 8700 

AATCCACGTA TTTACGTTCG CTAAATCGTA TGAACGATGG 8760 

CTGGAAAAAT CATGTATAAA GATGTCGATG TCAATACGAA 8 820 

TGAGAAAACG CATTGGCATG GTTTTTCAGC GGCCCAATCC 8880 

AAAATATCAC ATTCGCATTG AAACAGCATG GTGAAAAGGA 8 94 0 

TCGTTGAAAC AAGTTTGAAG CAAGCCGCAC TTTGGGATCA 9000 

AAAGCGCCTT AGCTCTGTCT GGTGGGCAAC AACAACGTTT 9060 

CGATGAAACC AGATATTTTA TTGTTAGATG AACCGGCAAG 9120 

CAGGAACGGT TGAAGAAACG TTAGTGAATT TAAAAGATGA 9180 

CACATAACAT GCAACAAGCG GCGCGGATTA GTGACTATAC 9240 

AAGTCATTGA ATACGACCAT ACAAGAAAAA TCTTTACACG 9300 

AAGAcTACGT TTCAGGACAT TTTGGTTAGG AGGAGCAAGA 9360 

TCAAAAGATT TACATTTATA TTACGGGAAA AAAGAAGCAT 94 20 

TTTAATCAAG GAGAATTAAC TGCCATGATT GGTCCTTCTG 94 80 

TTACGTTGCT TGAATCGTAT GAATGATTTA ATCCCAGACG 9540 

GTCTATAAAG GAAAAGATAT TTATGGACCG AAAACAGATA 9600 

ATTGGCATGG TTTTCCAACA ACCTAATCCA TTTCCGTTTT 9660 

TAaTGgCTTG CGCCTAAAAG GAGTCAAAGA CAAGCAAGTC 9720 

CAGTTTGAAA GCTGCTGCCG TTTGGGAGGA TGTCAAAGAT 9780 

TTCACTTTCT GGTGGACAAC AACAGCGGGT TTGTATCGCT 984 0 

GGATATTATT TTATTAGATG AACCAACTAG CGCCTTGGAC 9900 

TGAAAACATG CTGTTAACAT TAAAAGAGAA ATATACCATG 9960 

GTCACAAGCA TCACGGATCT CTGATAAAAC GGCCTTCTTT 10020 

ATTTAATGAC ACGAAAAAAG TATTTTTAAA TCCAAAAGAA 10080 
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AAACAAACAG AAGATTACAT TTCTGGTAAA TTTGGTTAAA AGGGGGAAAA AGAATGTTAC . 1014 0 

GGACACAATT TGAAGAAGAA CTATTGAATC TGCACAATCA ATTTTATGAA ATGGGTATGA 10200 

TGGTTAGTAG TGCCGTCCAT AAATCCgTGA AGGCATATAT TAATCATGAT AAAAAATTGG 10260 

CTCAAGAAGT GATTGATCGC GATATTGAAA TTAATGATAT GGAAGTCAAA CTTGAGAAAA 10320 

AGAGTTTTGA AATGATTGCG TTACAACAAC CAGTGACAAC AGACTTACGT ATGATTATTA 10380 

CTGTTATGAA AGCTAGTTCG GATTTAGAAA GAATGGCGGA CCATGCCGTT TCAATTGCTA 104 4 0 

AATCAACGAT TCGCTTAAAA GGAGAAACAC GTATTCCTGA AaTCGAAAAA GAAATTTCAG 10500 

ATATGTCTGA TTATGTGAAA AAAATGGTGG ATAATGTTTT AATCGCCTAT GTTAAAACGG 10560 

ATCAAAAAGA TGCACGAATG ATTGCACAAA TGGATGATCG TACTAATGAA TATTTCGATA 10620 

GTATTTACAA aCATGCCGTA GAAGCnATGA AaGCTAaTCC aGAAcGGTCA TTAGCGGrAC 10680 

TGrTTATcTA CATGTGGCAC AATACTTGGA AAGAATTGGC GACTATGTCA CAAATATTTG 1074 0 

TGAGTGGATT GTTTACTTGG CAACTGGTAA AATTACCGAA TTGAATAGTA ATCGTGACGA 10800 

ACAATTTTAA GAAAAACAAA TAAAAAAAAC TAGGAGACGG CCTTTTTTAG ACCAATTCCT 10860 

AGTTTTTTTA GCGTAAAATG AACTGATTTT ATAAAACTAG TCCTTACTAA TGAGAAGAAA 10920 

TCGTTCTTAA GTCCTATGAC AAATGATCAA AAACAAGGCA TAATAATGCA TGTAAGGAAA 10980 

ACACTATTGA GTGAAAAATT TGCTAAGGAG GCAATATCCA TGAAAGAAAG AGAACGCGTA 11040 

TTAGAATTAG TGAAAAAAGG TATTCTAACG TCAGAAGAAG CGTTAATTTT ATTAGAAAAT 11100 

ATGGCAACTG AAAAAGATGA AAAACAAATC GAAAAAGCTG CTGAAAAAGT TGATACACAA 11160 

AATATTGGAA CAACAAATAA AGAAGATCAA GTCGCAGATT TAATGAATGC ATTAGAAAAA 11220 

GGCGAATCAG AAGGACCTAC TGTTGATTCG TTTGAAGAAA ATACACAGGA CAGTGCAGAA 11280 

AAAGATCGTG AAAACTTAGA AAGAATTCTT GATGAGTTAG CAACAAAAGC CAATCGTGCT 11340 

TCGGCTGAAT TGGACGAAGT CAACGCAGAA ATTGCCGGCA TCAAAGAAGA AATTAAAGAA 114 00 

GTCGCAGAAG AAATTGGAAC ATTAGATACA AAAGAAGAAT TAGATGCATT AACAGAAGAT 114 60 

GAACAAGTTC AACGAAAAGA CTTGCACGTT TTACTTGCAC AATTAGAAGA AAAATTAGCG 11520 

ACTCAAAGTA CTGAAAAAAC AGCACTGGAA GAAGAACTAA AAAACATTCG CAAAGaACAA 11580 

TGGAAgGTCA ATGGAtGtAC AAAAGAAAAA GTTtCTTCTC AATTCTCTGA AGAGTGGAAA 11640 

GATCAAGCCA CAGACACCTT TAACCAAGTC GGCGgCAAAG TTGCCGAAGT TGGTGGTCAA 11700 

GTGGGAGAAT TCTTGAAAAA AACATTTAAT TCTTTCAGTG ATACCATGAA TGATAATGTG 11760 

GAATGGAAAG ACATTAAAAT GAAAGTTCCT GGTGTGGCAA CAACTAAGTT TGAACATGAG 11820 

TTTAACTATC CAAATCCACA AGCAAGTTTA ATTGATGTCA AGGTAGCAAA T 11871 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8084 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



AATGGGTAAC 


AGATTTAAGA 


TGGAAGATGG 


TCTTTGAGCA 


GAGTAAGTGA 


GCGAACAGTG 


60 


AACTTACTAC 


GGGGCGCCCG 


TTACAGCCAA 


CGGTATGCGA 


AGAATCAAGT 


ACCGTATGAG 


120 


GTTATTATTG 


AGAAATAATA 


GCGAATTAAG 


GTGGTATCAC 


GAAATGACAA 


ACTTTCGTCC 


180 


TTTTTGCTGT 


AATAGCAAAA 


GGATGGAAGT 


TTTTTTGTTT 


TTATTAAGCA 


AATTCGTTTC 


240 


ATAAAAAGTG 


CGAAACCTTC 


AAAATAAGAA 


AGTAGAACAG 


AAAGAAGAGG 


AGAACAAAAT 


300 


GGCATTAATT 


GAATTACGTC 


ATGTAAAAAA 


AGAGTTTTCC 


GGTAAAGCGG 


GTAAAGTCAC 


360 


GGCTTTAAAA 


GATATTGATT 


TAACAGTTGA 


ATCAGGTGAT 


ATTTATGGCA 


TTATTGGcTA 


420 


TTCAGGTGCA 


GGAAAAAGTA 


CGTTGGTTCG 


CCTATTGAAT 


GGACTTGAAA 


CACCTACAGA 


480 


AGGTGAAGTA 


GAAATTCAAG 


GACaAGATAT 


TGCATTGTTG 


CCAAATAAAG 


AACTACGAAA 


540 


CTTCCGTAAA 


AAAATTGGGA 


TGATCTTTCA 


ACATTTTAAT 


TTATTATGGT 


CACGGACGGT 


600 


CCTAGAAAAT 


ATCATGCTCC 


CATTGGAAAT 


TGCTGGCGTT 


CCAAAACAAA 


ATCGGAAAAG 


660 


CCGCGCAGAA 


GAGTTAATCA 


AGTTGGTTGG 


TTTAGAAGGA 


CGGGAAACCG 


CCTACCCAAG 


720 


TCAACTCTCT 


GGTGGGCAAA 


AGCAACGAGT 


GGGGATTGCC 


CGTGCCTTAG 


CTAACAATCC 


780 


TGACATTTTG 


CTTTGTGATG 


AAGCAACGAG 


TGCTTTGGAT 


CCGCAGACAA 


CTGATGAAGT 


840 


GCTAGAATTA 


CTACTAAAAA 


TCAACCAAGA 


ATTAAATTTA 


ACGGTTGTAT 


TAATTACCCA 


900 


CGAAATGCAC 


GTGATTCGTA 


AAATTTGTAA 


TCGCGTAgcT 


GTGATGGAAT 


ATGGTGAAAT 


960 


TGTTGAAGAA 


GGTAAAGTGA 


TCGATATTTT 


CaAAAAGCCT 


CAAACAGAAA 


TTGCGAAACG 


1020 


CTTTATCCAA 


CAAGAAGCGG 


ATAAAAACAT 


TGAAGAAACG 


GAACTGGTTG 


TTGAAGAAAT 


1080 


GTTGGAACAA 


TATCCGAATG 


GAAAAATTGT 


TCGCCTGCTT 


TTCCACGGTG 


AACAAGCGAA 


1140 


ATT'GCCAATT 


ATCTCGCATA 


TTGTCCAAGA 


ATATCAAGTA 


GAAGTTAGTA 


TTATTCAAGG 


1200 


GAACATTCAG 


CAAACAAAGC 


AGGGTGCAGT 


GGGTTCTCTT 


TATATTCAAT 


TATTAGGCGA 


1260 


AGAGCAAAAT 


ATTCTAGCAG 


CCATTGAAGG 


ATTACGTAAA 


CTTCGTGTAG 


AAACAGAGGT 


1320 


GATTGGAAAT 


GAATAATCGT 


GGCTTAGCAA 


GTTTCTTTGA 


TTTTTCCCAA 


ATTGATATAA 


1380 


GTAAACTACA 


AGAAAAAACG 


ATTGAAACAC 


TTTATATGAC 


AGGGATTTCA 


ATCATAGTGG 


1440 


TCGCCATTTT 


AGGGCTGATT 


TTAGGACTAC 


TACTTTTTGA 


AACGTCAGGC 


AGAAAAGGGA 


1500 


TTTTGAGCCG 


ATTACTGTAT 


TGGCTAGTGG 


CCATTTTTGT 


AAATATTTTC 


CGTTCGATTC 


1560 


CGTTTATTAT 


CTTAATCGTT 


CTGTTAATTC 


CTTTTACAAA 


AATTATTTTA 


GGGAGTATGT 


1620 


CTGGTGTCAA 


AGGCTCCTTA 


CCAGCATTGA 


TTATTTCCGC 


AGCACCTTTT 


TATGCTCGCA 


1680 


TGGTCGAAAT 


CGCTTTTCGT 


GAAGTCGATA 


AAGGCGTGAT 


TGAAGCAGCC 


AAAGCAATGG 


1740 


GGGCCAATCG 


TTTCCAAATA 


ATTTGGAAAG 


TTCTATTACC 


AGAAAGTCTG 


CCAGCGCTTG 


1800 
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TTTCTGGCTT AACCGTTACA ACCATTTCTT TAGTGGGCTA CACTGCAATG GCAGCTGCTA 1860 

TTGGAGCGGG TGGTTTAGGA AGTTTGGCGT ATCAAGATGG CTTCCAACGT GGTCAAAATA 1920 

CAGTTACCTT AGTAGCAACG ATTTGTATCT TAATTATTGT CTTTGCGATT CAATGGCTTG 1980 

GGGATACAGT CGCTAAAAAA ATAGATAAAC GATAAATGGA GGGAATAAGA TGAAAAAACG 2040 

TACATTATGG TCAGTAATTA CTGTAGCAGT AGCTGTCTTA GTTTTAGGGG CTTGCGGCAA 2100 

TAAAAAGAGT GATGACTCGG TCTTGAAAGT TGGAGCTTCA CCAGTTCCAC AT GC AG AG AT 2160 

TTTAGAACAT GTAAAACCTT TATTAGAAAA AGAAGGCGTA AAATTAGAAG TGACGACTTA 2220 

T AC AG ATT AC GTGCTACCTA ACAAGGCGTT GGAAAGTGGC GATATCGATG CCAACTATTT 2280 

CCAAcATGTG CCGTTCTTTA ATGAAGCGGT TAAAGAAAAT GATTATGACT TTGTGAATGC 234 0 

AGGTGCGATT CATTTAGAAC CAGTTGGGCT TTACTCGAAA AAATACAAAT CGTTACAAGA 24 00 

AATTCCTGAT GGTTCAACGA TTTACGTTAG CTCTTCCGTT TCAGATTGGC CACGCGTATT 24 60 

AACTATCTTA GAAGATGCTG GTTTAATCAC GCTGAAAGAA GGGGTAGACC GGACAACTGC 2520 

TACTTTCGAT GATATTGATA AAAATACTAA AAAGTTGAAA TTCAATCATG AAAGTGATCC 2580 

AGCAATCATG ACCACTCTTT ATGACAATGA AGAAGGGGCT GCGGTTTTAA TTAACTCAAA 264 0 

CTTTGCCGTG GATCAAGGAT TAAATCCGAA AAAAGATGCG ATTGCCTTAG AAAAAGAAAG 2700 

TTCACCTTAT GCCAATATTA TTGCGGTTCG TAAAGAAGAC GAAAACAACG AAAATGTAAA 27 60 

AAAATTAGTC AAAGTGTTAC GTAGCAAAGA AGTCCAAGAT TGGATTACGA AAAAATGGAA 2820 

CGGCGCTATT GTTCCAGTCA ATGAATAAAG AGAAAAAGAC AATAGAATAA AAACTTGAGG 2880 

TCGGGACAGG CGTGTTAAGC TTCAAGAAAT AAGAAGAAAG TTCTGAAAGA TGCTTTTTGT 2 94 0 

CTCTCAAGGA CGTTCAGCTT ATTTCCGAAG AAGCTGCTCC TGTCCCGTTG TTTATCTAAA 3000 

ATTTTAGGGC TAGGCAGATA CATCATGTTG ATTTACTTGC CAAGTCTAAG TTTTTTTCTT 3060 

TTAAAAAAGG ATTATAAATG ATAAAGTTAC AAGGAATTTC TTAATTATCA GGAAAATTAA 3120 

GGGACTATGC ATAGGCATTT GCATTAGTTA TTGTTATAAT AATTAAGATA GATTTTTTGG 3180 

AATGGAGGGA AAAAGATGTT ACGTTTTTTT CCGCAAAAAA ATAAATATGT AGAAGAAGCA 324 0 

AGCCAGTACG CCTTTGATAA AGAAGGCCAA ATTCCGCAGC ACATTGCCAT TATTATGGAC 3300 

GGAAATGGCC GGTGGGCACA GAATCGAAGA CTGCCGAGAA TTGCTGGACA TAAAGAAGGC 3360 

ATGGATACGG TCAAAAAAAT AACGAAGCAT GCTAGTCATT TAGGCGTTAA AGTTTTAACA 34 20 

CTTTATGCAT TTTCAACGGA AAATTGGAAG AGACCAACAG ATGAAGTCAA TTTCTTAATG 3480 

CAATTGCCTG TTGATTTTTT TGATACATTC GTGCCTGAAT TAATTAAGGA AAATGTTAAA 354 0 

GTCAATGTAA TGGGCTACCA AGAGTTTCTA CCAAGCCATA CGCAAGATGC TGTTAAGCGC 3600 

GCCATTGAGC • AAACCAAGGA CAATACGGGC ATGGTTTTGA ACTTTGCTTT GAATTATGGG 3660 

GCGCGTGCGG AGcTACTAAC TGCCATGAAA CAGATTGCTG CAGAGGTTTC AGAAAAAGCA 3720 

TATACGGCAG ATGAGATTAC AGAAGAAACG ATCGCAGATC ACTTAATGAC TGGCTTTTTA 3780 
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CCCACGGAAt TCGTGATCCA GAATTGTTGA TTCGAACAAg TGGGGAASAG AGAATTAGTA 3840 

ATITC.T.TGCI.ATG5CAAATT GCTtATAGCG AGTTGTTTTT TACAAAAScA TawGGCCtGA 3900 

TTTTTCAGGA GACACTTTAG AAACCGCGAT TGCtTCtTTC CAAawTCC-TA ATCGTCGTTT 3960 

TGGTGGTCtA AAAGaAACTm CAGAmACAGA AGGGAGCGAT CCGCAG7GAG ACAACGTGTC 4020 

ATCACAGCTA TCGTCGCATT AATCATCTTT ATTCCAATTA 7CTGGTATC-G TGGTTTTGCA 4080 

ATTGAGATTG CCGCTGTTGC TTTGGCAGTA GTCGGCGTGT ACGAGTTATT CCGTATGAAA 4140 

GGATTAACGT TATTAAGTTT TGAAGGCGTT TTATCAGCAT TAGGGGCCGT TTTTTTAGTT 4200 

ATCCCGACAA TTAAATGGTT AAACTTTTTG CCGGAAATGC AAACAAA.YTT CATTTTATTT 4260 

TATGTCACTG TAATGTTGTT GTTAGGTGCT TCGGTAAT7T CTAAAAATAC ATATACGATT 4320 

GAAGAAGCTG GCTTTCCAGT TATTGTCAGT TTATATGTAG GGATGGGGTT CCAGAACTTT 4380 

ATTAGTGCCC GTTCCGCAGG TTTTGAAGTC TTGTTATTTG CTTTATTCAT TGTCTGGGCA 4440 

ACGGATATTG GTGCATATCT ATTTGGACGC CGATTTGGTC GTCATAAATT GATGCCAGAT 4500 

GTTTCACCCA ACAAGACAAT AGAAGGTGCA CTTGGTGGTA TTTTATCAGC CGTCGTTGTA 4560 

GCAGCGTTGT TTTTAGTTTT CACAGCTAAT AAAGGGCTAT TTCCTTATCC AATGCCGGTG 4 620 

ATGCTCGTAT TGACGGTTCT GTTTTCTATA GTAGGGCAAT TCGGTGATTt GGTTGAATCT 4680 

TCTATCAAAC GTCATTATGG CGTGAAAGAC TCTGGTAACA TCTTGCCAGG ACATGGTGGT 4740 

ATTtTAGATC GTTtTGATAG TTTATTATTT GTTTTCCCAA TTATGCAC7T ATGTGGATTA 4800 

TTTTAAATTG ATTAACAAAA TTTAAAAAAG TATCGTCAAA TAAGCnAAAA TCCAGAAGAG 4860 

AGAcTCTTCT GGAtTTTTGC yGTTTrTTGG GTTAGTTTAG GTACCT7GTG AGTTATTTAr 4920 

TTTATAAGAG TAATTGTATA AAGATGGCTA AAGTTTCTAA AAAATTAAAT AGGTGATCGG 4980 

ATGTTAAATA TCTGAACATT CCaGTGTTAT TCGATTGTTA AAAATTAAAT TATmACAmTC 5040 

GAATAACACT GGaATccyTa TTTCmATaAA GCCaTTGACG TTTrGCATAG ATAAATTTAT 5100 

ACTTAAAAGA AGAAAAGCAA AATAATGTAT TTTACATGCT TGTTCaATCG TGGTGAGAGT 5160 

TATTCATAAA TTTAAGCAAG TAAAGAGAAG ATAATTTAGA GATTTGTGAA CGCGTTCTTC 5220 

ATACAAAATT ATAGATTAGG ATTATATTTT TGATAATCTA ATGTAAAGGA TTAATTCAAT 5280 

GAAAAAATTT TTAGAGAATA GTATACATCA AGCCATAAAA GCTTGTTTTT TCTTTTTAGG 5340 

TAAACTGCCT AAAAAGAAGT TATTTATTTT TGAAAGTTTT CATGGAAAAC AGTATAGTGA - 5400 

TAATCCACGT GCAATTTTTG AGTACATAAG AGATAACTGT CCTGAATATC AATGTATTTG 5460 

GGCAGTCAAA AAAGGATATG AGATGCCTTT TGTGGAAGAG AATGTTCCTT TCGTCAAACG 5520 

ATTAAGTTGG AGATGGCTAT GGTTGATGCC AAGAGCTCAG TATTGGGTTT TTAACACACG 5580 

GATGCCAGCC ■ TGGATGTATA AGAATCAAAC AACAATCTAT ATACAGACGT GGCATGGGAC 5640 

CCCTCTAAAG AAACTGGGCT TAGATATTGA AACAGTCAAA ATGCCAGAGC ATGAAACAGA 5700 

GAGCTATAAA CTAGAAGTCA TTCAAGAAGC AAAGCGTT3G GATTA7CTCA TCTCACCTAA 57 60 
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TGCATACAGT ACAAGAGTGT TTCGACAAGC CTTTCATTAT GAAGGGGAAA TATTAGAAAT 5820 

AGGTTATCCA AGAAATGATT TGTTAGTTTG. TGAAAATMG. J3ACAAAATAG CTTTTTCTGT 5880 

TCGCAATCAG CTAGGTCTTG CAGCTAACAA AAAAATTATT CTCTATGCGC CAACCTGGCG 594 0 

CGATGATGAG TCTATCCGAA AAGGAGCCTA TCGTTTTACC AATCATTTGC CAGTTGAACA 6000 

ACTTTTAGAA ATTGATTCTT CTATTGTAAT ACTTACACGA TTACATTATT TAATTGCAGA 6060 

AAGTTTTGAT ATAAGTGCAT TTGGTTCACG TGTAATCGAT GTTTCTTCTT ATCCAGATAT 6120 

CAATCAACTT TATTTAATCG CTGACTTATT AATTACAGAT TATTCATCGG TTATGTTTGA 6180 

TTTTGcATTA ACAAAAAAAC CGATGCTATT TTTTATGTAT GATAAGGaAA AGTATCAAAA 624 0 

TAGTACAAGG GGATTCTATT TTGACCCAAC AGCAATTTTG CCTGGGAAAA TTGTGACAGA 6300 

TACGGATAGC CTAGTAACGT CTGTAAAAAA CGTATTGTTT TCTAAAACGG AACAGGAACA 6360 

ACGTTTATAT GTGCAGTTTT TTCGTGAATT TTGCCTTGAA AATCAGGATG CCACTAGACA 6420 

GCTGATTTCA GAAATTTTAA AGAGATAAAA AGCCGGAGGG TGTAGGTTGG AAAAACAAAA 6480 

AGAAGTCCAA AAAGAATATG AAGAGTATCA ATATATGTTA GATACTGAAA CGTGGAAAAA 654 0 

GGAGTTAAGT CAATTTAAGC TGTCCTCTTC AGaAAAAATA TCAAAAAATC yTAAAGTCTC 6600 

aATCATTATT GCTAATwACc AATAATGCca CCATATCTTA AAAAGATGAT GGATTyTCTT 6660 

GTTCAACAAA CCATTGGGAT TGAGCAATTG CaAGTGATGT TTATTGATGA TTGTTCCACA 6720 

GATCATTCGT TaAAAGTAAT AGAGCCATAT CTTGAAAGAT ATCCGAATAT TGAAATTTAT 6780 

CAGCTGCTTG AAAACACTGG AGGAGCTCAC GGCCCAAGAA ACGTAGGGAT TGTAAATGCA 684 0 

CGTGGTGAGT ATAGTGTCTT CTTAGATGCA GATGATTGGT AT GAT TT AAA TGCATTGAAA • 6900 

TAT T TAT C AG ACCTACTAGA TGACTCTCAC GATGATTTTG CTGTTTCTGG kCTAATACAG 6960 

AGTACGGACG GCCACTTAAT GTTAAAAAGT AAGCCATATT ATGTGGATGG GTCATTTAAA 7020 

AATCGGTCTA TCCAAGAATT GCCAGCTGAA TTTTATGGTT GGTTAGGACC GCAAGCTATC 7080 

ATGTTAAGAA CTTCACTGAT ACACAATAAT AATTTACACT TTGTnAATCA ACGGGTGGCT 7140 

GGATGATGTG CTGTTCTTTA TGAAGCCAGC GATTTTCAAA ACCATTACAC AAGGGGAAAG 7200 

GkTaACGACT TATTTAAATC GAGATGCAGA TaATGAAAGT TTAAGCmAAT CAtTAATCGG 72 60 

ACATTaTGCT TyCTgGTTGC GTGCGTTAAG TTATaTCAAT CaACAGTTyC CAGATGATTk 7 320 

GTCtAAAGrA CgGATGTTAG CACGCCGTTT AGAATGGCTA ATTTATGACT TTTGTATTCG 7 380 

GAGGAATACT GGGTATCCAT TCAGTAAAAG GCGATTACAA GATTTTAAAG TACAGATGGA 74 4 0 

TCAGTATTTA GGTGAATTAA ATTTTGATCC TAGCCCGTAT TTTAGAAGTG ATGTGCGCCA 7500 

AATCATTTGG ACATTTTTAC AAACTAACGA TATTGATGGT TTGTATTGGT TTATTCAATG 7 560 

TCAGTCCATT- CGTTGGGTGT TAGTCAATCG CTTTGGTTAT GTTTCAAAAA CAGAGAGAAG 7620 

CTATTATTAT CCCAGATTAT TGAAACGATT CCCAGCAGTT CGCATGAATG CTTATGCAGA 7 680 

AGCAACGCAA CGTGAAGGCA ATACGCTTTC TTTAAATGTG TATACACACC AAAAAATTAC 77 4 0 
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AGGATTTGAA ATAAAAGAGC TGAAAAATCC TTTTGAAACA AGACAGAGGC TGGCGTTTAA 7800 

AAAAATTAGT GAAACTAGTT ATCAGGTAGA GTTGCCAGAA AAATTTTCaA ATAAAGAAAG 7.8.6.0 

TAGATTTACG ATAGTTTTTG ATAATTATTT AGAAATTGGG GTTAAAGATT TTAGTAAGTT 7 920 

TTttGCTTAA ATTTtGTCCG TTtAAATCAG GATATTTTGA GTATACmACy AwAAAtGATt 7 980 

AGTGrGGGrA ATTkGTTTak GGAAAATGTG AAAGTGTCAG TCATTGTACC CGTCTATAAC 8040 
GTAGAAACCT ATTTAGAAGA GGCATTAATG AGTTTGAAAn ACCA ■ 8084 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



TTnCnCCCAA 


AATGGwTTGk 


CCGrAGtGCy 


TAmCTGGaGG 


AGrATGCvTC 


CTTtCvTATC 

x x y x *i x w 


60 


CGGAATTTTA 


TTtGGTCCAc 


CATGGACCTA 


TCGTGCTTCa 


CGTCGGGTGA 


CGGATAATTT 


120 


aGCgACTGGG 


GAAGAGGAAA 


AAAATAGGAA 


ACAGCACTGT 


CATAAGCATA 


GAGGAAGAAA 


180 


AGGCCATGCT 


GCCTATGCAG 


AACATCTTTA 


TCAAATAATT 


GATCATTCGA 


AATTAACCAT 


240 


TGAAACGCCA 


AAAGAATTCA 


CATGGTTCAT 


GCAGACGGCG 


ATGCATGCCG 


TTTTCTCAAC 


1 300 


AATTGTTGAT 


GGGTATCGTC 


AACAAAGAGA 


AAATACTGCC 


TATGATTCAA 


CTGAAGCAGT 


360 


CAAACAATTA 


AAAATGAAAT 


TAAGTTGGTT 


GAAAAATGGC 


GCTTATAAGT 


AGCAAAAAGG 


420 


AGGAGAAGTT 


AGTTGATTAA 


AC TAG TC AAA 


CGGATGTCAT 


TATGGTCGGT 


CTTTGCAGCG 


480 


GTTGTATTTA 


TGATCGTACA 


AATTATCGCA 


GATTTATATC 


TGCCCACATT 


AACATCTAAT 


540 


ATCATTAATG 


ACGGTGTCGC 


TAAAGGAGAT 


ATTAACTATA 


TTTGGAGTAC 


AGGAGTTGTC 


600 


ATGTTAGGCT 


TTTCTTTGAT 


TAGCATTGTC 


GCTTCTATTG 


GAAATACCTT 


TTTTGCCACC 


660 


AAAGAATCAC 


AAAGTTTAGG 


AAAAAAATTA 


CGGACTGATA 


TTTATCGAAA 


AGTTGAAAAT 


720 


TTTTCTAATA 


ATGAATTTGA 


TAAATTTGGG 


ACAGCTTCGC 


TAATTACACG 


AACAACCAAT 


780 


GATGTCAATC 


AGATTCAAAT 


GGTTATGCAA 


ATGTTTTTAC 


GTTTAATGAT 


TAACGCACCT 


840 


TTAACGTTAA 


TTGGTGCAAG 


TTTTATGGCG 


TATAACAAGG 


ACCCACAATT 


AACGAAAATT 


900 


TTTCTATATG 


TTTTACCAAT 


TATGGCTGTT 


TTAGTGGGAG 


GTATCATGTT 


TTTAGCGGTT 


960 


CCGTTATTTA 


AAAGCATGCA 


AAAGAAAACA 


GACCGTTTAA 


ATTTAGTTTT 


TCGTGAAGGA 


1020 


CTGACAGGAG 


TTCGTGTGAT 


TCGGGCCTTT 


GGCAAAGCAA 


ACTATGAAGA 


ACAACGTTTT 


1080 


GATGAAGCCA 


ATAAGGATTA 


TACACAAACG 


GCGATTAAAG 


TGAATACGAT 


TGTCGCTTTG 


1140 


ATGATTCCTT 


TAATGACACT 


CATTATGAGT 


GGTACCAATA 


TTGCAATCAC 


CTGGTTTGGC 


1200 


GGTCATTATA 


TTGCGGAAAT 


GCAACTAGAG 


GTAGGTAACT 


TAATTGCCTT 


TATGACTTAT 


1260 


GCCATGCAAA 


TTCTGATGAG 


TTTCATGATG 


TTATCCATGA 


TTTTTGTCAT 


GGTACCGCGG 


1320 



i 
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GCGCAAGCTT CTGCTGATCG 
GTACCCAATC CAGAACTACT 
GTTAATTATC GTTACCAGCA 
AAATCTGGGG AGACGGTCGC 
AATCTTTTAC CACGCTTTTA 
ATTAAAGATA CGTCACAGCA 
GTTTTATTTA CAGGAACGAT 
GAAGAAATTT GGCAAGCCTT 
GAAGGTTTGG ATAGTCATGT 
CGGTTGGCTA TTGCTCGTGC 
TTTTCTGCAT TGGATTTCAA 
ACGGATGCAA TTGTCGTTTT 
ATTTTAGTAT TAGACGAAGG 
AATAACCAAA CGTATCAAGA 
. GTAGTCGACC AAAAGGCGGA 
GGCCAAAACC AAAAAATTTC 
GAATGCTGTC AATTATTGCT 
AAACACCAAA AGTATTAGGA 
CTGCTGAAAT GAAACAAGGC 
GTCAAATTTT ATTAATTGTT 
AGCAAGTAAT TATGACACGT 
AAGCGAAAAT GAACAAAGTT 
TGTCACGGGC AATTAACGAC 
AATTAATTAC CAGTATCGTG 
GGCAGTTAAC ACTAATTGCC 
TGGCGCCTCG CTCGCAAAAA 
ACCAAGTCGA AGAAACtACG 
TGATCAAGAA GTATTTGAAA 
ATTTATTTCA GCGATCATTA 
TGTTGCAGTC CTTGGTGGCG 
AGCATTTCTT CAATATACCA 
GAATACAATC CAAGCTACGG 
AGAAATGGTG GATGAACCTT 



964 

GATTAATGAG GTGCTAAATA CTGATTCAGA AATCAAAGAT 1380 

CTCTCTAAAA GGCGACAAAG CAACATTGGC GTTTGAACAT 144 0 

TGCTGAAAAC TTAGCACTGG AAGATATAGA TTTTTCAGCA 1500 

GATTATTGGT GGGACTGGTT CTGGGAAAAC AAGGCTGGTT 1560 

TGATGTTGAA TCTGGCAAGA TTTTGCTAAA TGGTAAAAAC 1620 

TAATTTACGA GAAATGATTG GCTTTGTACC GCAAAAGGCT 1680 

TCGTGAAAAT ATGCAATACG GCGCCCCAAA TGCGACCGAT 17 4 0 

AGAAATTGCG CAGGCGAAGG CGTTTGTTTC AGAATTGGCA 1800 

CGAACAAGGC GGCGGTAATT TCTCTGGCGG ACAACGCCAG 18 60 

TTTAGTAAAA CCAGCAGATG TCTATGTTTT TGATGATTCC 1920 

AACAGATGCT AATTTAAGAA AAGCCTTAAA AGAGCAGATG 1980 

AGTAGCCCAA CGTGTAAGTA CGGTTATGGA AGCATCGACA 204 0 

GAAATTAGTT GGTAAAGGAA CGCATGAAGA GTTATTAGCG 2100 

AATTGTACAT TCTCAATTGA GAGAGGAGGA CCTTGCATGA 2160 

CCAATGGGCG GTGGACCAGG ACGCAACATT GGCGCAAAAG 2220 

TGGAAAACAG TGAAACGTTT ATTTCGATAC ATGTCTAAAC 2280 

GTCTTAGTAT TAGCAATTGC AGCCGTCGTT TTCCAAATTC 234 0 

CAGGCAACAA CTGAAATTTT TAAAGGCGTC ATGAAAGGCG 24 00 

TTAAAAATAA CATCTTTTCC AATTGATTTT GACAAAATTG 24 60 

ATTGCCATGT ACCTTATTTC TGCTGTCTTT AATTTCTTGC 2520 

GTTTCACAGC GAACAGTTTA TGAATTACGT CAAGAATTAG 2580 

CCAATTTCCT ATTATGATAC TCATAGTAAT GGGGACATTA 2640 

ATGGATAATA TTGCGAGTAC TTTACAGCAA AACTTAACCC 2700 

ACATTTGTGG GTGTACTTTG GATGATGCTG ACCATTAGCT 27 60 

TTAGCAACAG TGCCATTAAG TTTAATTGTT GTGATGGTTG 2820 

CATTTTGCGG CTCAACAAAA AAGCTTAGGA TTATTGAATA 2880 

GTGGTCACGT AGTAGTAAAA AGTTTCAACC ACGAAGAAAG 294 0 

AAGAAAATGA AAAATTATAT CATGCTGGTC GCAAAGCACA 3000 

TGCCTTTAAT GAACTTTATC AAAAATCTAG GCTACGTGTT 3060 

TAAAAGTAGC GAATGGTATG ATGGATTTAG GGGATGTCCA 3120 

ATCAATTTTC ACAACCGATT ACTCAAATCG CTAATTTAAT 3180 

TGGCTTCGGC AGAACGTGTC TTTGAAGTAT TAGATGAAGA 324 0 

CTGGCATACC AGTGGAAACA GATAGTCCTT ATCGTGTTTC 3300 
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TTTTGAACAT 


GTTGCTTTTG 


GCTATTCACC 


AGAGAAATTA 


TTAATGAAAG 


ATTTCAATTT 


3360 


AAATGTTAAA 


CCTGGGGAAA 


TGGTCGCAAT 


CGTGGGCCCA 


ACAGGTGCTG 


GGAAAACAAC 


3420 


CCTAATTAAC 


TTATTGGAAC 


GCTTCTATGA 


TATTAGTAGC 


GGCAGCATCA 


AATATGATGG 


3480 


CGTAGATACG 


CGCGATTTAT 


CTCGCGAAGA 


GTTGCGAGCA 


CACTTTTCAA 


TGGTTCTTCA 


3540 


AGATACTTGG 


TTGTTCACTG 


GAAGTATCTA 


TGACAATATT 


CATTATGGTA 


ATGAGCAAGC 


3600 


GAGCGAAGAA 


GAAGTGATCC 


GTGCGGCGAA 


AGCAGCCCAT 


GTGGATGATT 


TTGTCAGAAA 


3660 


ATTACCAGAA 


GGCTATCAAA 


CGATTCTAAA 


TGAAGAAGCC 


AGCAATATTT 


CTCAAGGTCA 


3720 


ACGACAATTA 


ATTACAATTG 


CTCGAGCATT 


CTTAGCAAAT 


CCAGACGTTT 


TaATTTTGGA 


3780 


TGAAGCCACC 


TCAAGTGTGG 


ACACTCGAAC 


AGAAATACTG 


ATTCAAGCAG 


CAATGAATCG 


3840 


TTTATTGGAA 


AATCGGACCA 


GCTTTGTAGT 


CGCTCATCGC 


TTGTCGACTA 


TCCGTGATGC 


3900 


CGATACGATT 


ATCGTTATGG 


CAGAAGGTTC 


AATTGTAGAA 


ACAGGAACCC 


ACGATGAATT 


3960 


AATGGCGAAA 


AACGGTTTTT 


ATGCTGACTT 


ATATAACAGT 


CAGTTTTCAG 


AAGAAGTAGC 


4020 


CTAAAAGGCG 


AAgcAAAAAA 


ATAGTAATTC 


ATGATGrGGC 


TTCAGGAACC 


ATAGTCGTAA 


4080 


AGACTGGTTC 


CTGAAGCCTT 


ATTTTTTTAA 


CAAAAAGTTA 


GCGAAATTGA 


AAAATTTCTA 


4140 


TTTTCTTTTG 


AGAAAAACAG 


GATATTACTT 


GATGAAATAG 


CAAGAGATGT 


ATAAAATGGG 


4200 


AGAGAAGAAA 


ATGGAGGTAA 


CAGAAAAAAT 


GACTGAATTT 


CAATTAGAAA 


AAGCTGCGAC 


4260 


ATTAAAAACA 


tTGCCATTCG 


TGCGAAAAAA 


GTTGGCGAAA 


TGGTCGACtT 


TTATAAAAAA 


4320 


GTAaTGGGTT 


TTGTTTTAAA 


GTCAGAAGAA 


AATAACCTGT 


CTATCTGGGG 


TACACGTGAA 


4380 


GCAGGAACGC 


AATTATTAAT 


TTTAGAAGAA 


ACTAGAAAAG 


CCGAAGATTT 


TCATAATGAA 


4440 


GAAAAGCAAA 


TGGCTTATTT 


CTCAATTAAA 


GTTCCTACGG 


AAAAAGAATT 


TTTACAAATT 


4500 


GCTCAACGAG 


TGCTTGAGCA 


AGACTACCCA 


AT T GAT G AAA 


GTTTCCAAAT- 


' TGGCACACGA 


4560 


CAAAGCATGT 


TCATTACTGA 


TCTTGAAGGT 


AATCAATTTG 


AGATTTTTCA 


CGATGAAGCA 


4620 


ACAGCAAATT 


CGACTTCTGA 


AAAGCAACCA 


ATTGTGCTTA 


AAGATCTTAT 


CAGTGAGGAC 


4680 


TTAGAGCCAC 


ATCAAGGGTT 


AGCCGCTGGG 


TCATATTTAG 


CGCATGTGCA 


ATTAAAAACA 


4740 


AACAATCAAA 


AAGAAATAAA 


AGCATACTAT 


GAGGAAGTTC 


TGGGTTTAAA 


ACGCAATGAA 


4800 


AAAGATCAAT 


TTGTTTTAGA 


AGATGGCAAA 


GCAACTATTG 


GATTCCAAAA 


ACCTGAAACA 


4860 


AGTGAAGTGG 


ATCAATTACC 


TGATCCACAC 


TTAGGTTTAG 


ATTTCTTTAC 


CATTAAGCTC 


4920 


TCTGATCAAG 


AAPATATTTT 




PAAPAATTAA 


mCZC A A A A A A 


TP A AfZAPTTP 




TTTATTGACC 


AGAAAAAAGC 


GATTGTCACT 


GTGTTTGATC 


CAATTGGCTT 


GGAATGGTGG 


5040 


TTTGTCTTAA 


AATAAACGCT 


TAGACAGACA 


AAATAAGCTT 


GTTTTAGCAA 


GTCTGGAATA 


5100 


AAGTCGGAAT 


AAAAGTTTAA 


CGCTACTTTT 


GTTCGGAACT 


TATTCTAGAC 


TTTTCCTTTA 


5160 


ATATTAAAAA 


TAGAAGGTGA 


AAAAATGTAC 


GAAAAAACAA 


AGTGGAAAAT 


TCAAAAAATG 


5220 


ATGAAAAAAG 


GTGTTTTTTC 


GGGGGCTAGT 


TTTTGTTTCA 


TTGAAGGTGA 


AAAAGAGGAG 


5230 
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AGCAATTGTT 


GGGGACTGGC 


ACAAGTAAAA 


CCCACCAAAG 


AACAATTAAC 


CTCGGCCATG 


5340 


CTGTTTGATG 


TGGCTTCTTT 


AACCAAAGTA 


GTCGGCACCA 


CCACCGTTAT 


TTTGCAATTA 


5400 


GTGGAAGAAG 


GAAAAATTAT 


TCTAGATCAA 


T CAT T AC AAA 


CTTATTATCC 


CTCTTTTCAA 


5460 


GATTCTAATA 


TTACAATCCG 


GCATTTGTTG 


ACACATACGG 


CAGATTTACA 


AGGCTATATT 


5520 


CCTAATCGTG 


ATCAGCTGAA 


TGCCCAAGAA 


TTAAAAGACG 


CCTACAATCA 


TTCATTTCAC 


5580 


GCAGGCAAAG 


CAATAGGTAA 


AAAAGTCGTC 


T AT AC AG AT G 


CAGGGACTAT 


TTTATTAGGC 


5640 


TTTATGTTAG 


AAGAAATGTT 


CCAACAATCA 


ATGATAGAAA 


TTTTATCTGA 


GCGAGTATTG 


5700 


CTTCCTTTAG 


GGATGAACGA 


AAGTACGTTT 


CTACCAAAGa 


ATCCTTTAAA 


CTGTGTGCCA 


5760 


ACAGAATTAC 


ATGAACAAAG 


AGGGTTGATT 


CGAGGTGCAA 


CCCATGATCC 


TAAGGCATTC 


5820 


GTTTTACGGG 


AACATGCAGG 


GAATGCTGGC 


TTGTTTAGCA 


ATGTCTATGA 


TTTAACAAAA 


5880 


TTTGTTCGAA 


TGTATTTAAA 


TCGAGGCAGT 


TATCACAATC 


ACCAATTCTT 


GAAAAAAGAG 


5940 


ACAATTGATT 


TGTTACTAGT 


TAACCAAGTG 


CCAGCTGCAG 


ATAAACCACG 


TTCTCTAGGC 


6000 


TGGGATTTTA 


AATATGATGT 


AGcTACGCAA 


CGGCCATTAT 


TATTCCATAC 


AGGCTATACA 


6060 


GGCACTTTTT 


TGCTAATTGA 


TGTGCAACAA 


CAATCAGCGC 


TTATCTTTTT 


ATCTAATCGA 


6120 


GTCCATCCAG 


AGGACCACCG 


GAACACTTAT 


ATCGAAGAAA 


GAGATCAGTT 


GTTGGCAACC 


6180 


TATTTAAAAG 


AGAAATCATC 


AGTTTCAGAT 


GAAATGACTA 


GTTTTTAGTG 


ATATAATGAG 


6240 


AGCGAATTCA 


ATTTGGGAGG 


TCACAGTCAA 


TGCAAGAACC 


ATTATTTTTA 


CAACCTGTTT 


6300 


TTCAAGAAAA 


AATTTGGGGC 


GGCAATCGTC 


TACACACGCT 


ATTTGGTTTC 


GATTTACCGA 


6360 


GCGATAAAAT 


TGGTGAAGAT 


TGGGCAATCA 


GTGCACATCC 


ACATGGCGTT 


AGTACTGTTT 


6420 


TAAATGGCGA 


ATTTAAAGGA 


AAAAAATTAG 


ATGAATTGTG 


GGCAGACCAT 


CAAGAGTTAT 


6480 


TCGGCCATGC 


AGGTGGCGCT 


GTCTTTCCAT 


TATTAACGAA 


AATTTTAGAT 


GCTGAAGATG 


6540 


ATTTATCCGT 


GCAAGTTCAT 


CCTGATGATG 


CATATGGCCA 


AAAACATGAA 


GGCGAGCTAG ' 


6600 


GCAAAACTGA 


ATGTTGGTAC 


ATTATTGATG 


CAGAACCTGG 


CGCTGAAATC 


ATTTATGGAC 


6660 


ATCACGCGAA 


AACAAAAGAA 


GAACTTGCAG 


AAATGATTGA 


AGAAGGTCGT 


TGGGACAATC 


6720 


TTTTAAGAAA 


AGTACCAGTG 


AAAAAAGGCG 


ATTTTTATTA 


TGTACCAAGC 


GGCACTATTC 


6780 


ATGCCATCGG 


CTCGGGTATT 


TTAATTTTAG 


AAACGCAACA 


AAGCAGTGAT 


ACTACGTATC 


6840 


GGGTTTATGA 


TTATGATCGA 


ACAGATGATC 


AAGGAAAGAC 


CCGGGAATTG 


CATATTCAAC 


6900 


AATCCATTGA 


TGTGACAACC 


GTTCCGGCGA 


AGGATCCTGA 


CCTCTCTATT 


CAACAAGAAA 


6960 


ATCAAGGACA 


ATCAAGCATT 


GTTACTTATG 


TAAAAACAGA 


TTTCTTTAAT 


GTCTATGAGT 


7020 


GGCGTGTCAG 


TGGGATTCTA 


AAAGTCAAAA 


AACAAGCACC 


TTATACATTA 


GCAACAGTTA 


7080 


TTGAAGGTGT 


GGGTCGATTG 


ATTACAGAAG ATgCTgCAAA AGCAGATGTG GCTACTTTTG 


7140 


ATTTGAAAAA 


aGGCGATAgC 


TTTATCTTTC 


CGACAGATAT 


TCCGAGCTGG 


CGTTTTGAAG 


7200 


GTGATCTAAC 


GATTATAGCT 


TCAGAACCTG 


GCAGTCGCTA 


AAAAAAGAGG 


AGATCATCAA 


7260 
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CAGATGATTT CCTCTTTTTT AGCATTAAAA CAGTTTATTT TGTTATAAAG GAACGTTTAG 7320 

GTAAGGCAGC GATAACAGCT GTGACGATGA TGACTGATAG CAGACGATCC CCGTAATCTG 7380 

TTATCAATTG AACCAACAGG ACGCTTGCTA ACTGATTGAA GCCTAAACCA TATAAAAATT 74 4 0 

GGACAATTAT GCTGGAGCCA GCAGAAGTAA TTCCACCAAA TAAACAAACT GTTATTAAGG 7500 

AAGAAATCAC TGTTCCTGGT ATCGTTAACC ACAACGTTTT AAATGGTAAT TTGGTTACTT 7560 

TTAACTGCTG ATGAAGGAGC AAACCTGCTA AAATTCCTGT GCAGAGTTGG ACAGGTGAAT 7620 

AATACAGAGA AAAGATATCC GTTGTCAATC CAGTTAGTAG ACTTGTTAAA ATACCTGTCA 7 680 

GTGCGCCAAA GAGTGGGCCT AATAACGCAC CTGTAAAAAt AGTTCCCGCA CTATCAAGAT 77 4 0 

AAATTGGTAA ACGAAGGAGT AACGCAATGG TACTGCCAAG GTAATTTAAA GCAACACTAA 7800 

AAGCGATAAT TGTAATGATT CGTGGTGTTA TTTTTTTCAT GGTTATCCCA TCCTTAAATT 7860 

TTTGAAATCT CTTTCGATCT CATCTATTTG TACATTTAAA AGAATGGATA GAAACTTCAC 7 920 

GAAAAACTGT TGACTATCAA CCGTAGTCAA GACATGACAA TTAGCTGGTT GCTGCTGGAA 7 980 

ATGATATTGA T CG AT CAT TG ATTGCCCCAT AGCAATGCCG TCAGGAACAA TTTGTCCGTA 804 0 

ACTGGAAAAG CCTTGGCAGA GCGTAGGATC AATAAAATAA GCAACCGCTA AGGGATCGTT 8100 

GATAACACAC CCTAAAATAC GTTCATATTT CCAATGAAAA TCAAAATAAA ATTGAGTAAT 8160 

CTCAGCGAGG TAATTCCCCA TTTTGGGGTT GAGTCGTTGA CAATAATTGA GAATTGTTGG 8220 

TGTAAAAACA ATTTCTCGTG TGACATCTAA GCCGACCATC TCGATTTTTG CACCTAATTT 828 0 

CTCAAAAACC CAAGCTGCTG CATCAGGATC ACACCAATAA TTAAATTCTG CAACAGGAGA 834 0 

GCAATTGCCG TGGCTTTTAA AGGTACCACC CATTGAGACA AAACGTTGAA GATTTCGCCC 8 4 00 

CAGAAGAGGT TCTTTTTTTA AACCCAAAGC AATATTTGTT AAAGGACCAA GGGCAATGAT 8 4 60 

TGAGCAATCG GTAGCTTGTT GAAAGGTTTG TGCTAAATAG TCAGCAGCGG AAAGCGCTTG 8520 

TGCTTGGATT TTGCTAGTTC TGGGGAAATA TGTTTCACCT AAACCATCCA TGCCATGTGT 8580 

ATCTTGTGCA CTAACAAATG GCCGTTTTAA AGGGCTAGCA GCCCCTTGGT AGACAGGAAT 8 64 0 

ATCTAAGCGA TTAACTTTTT CCAAACATTT AAAAACATTT TCCA^^CCAA TCTCCACAGG 8700 

GACATTTCCA CAAACAACGG TAATCGCTTG GACATCTAGC TCTGGTGAAG CAAGTGCATA 87 60 

CAAGAGAGCA AGCGTGTCAT CAATTCCAGG ATCACAATCA ATAATTACTT TTTGCATTTT 8820 

TTCCAATCCT CCTTTTTCAT TAACTGCAAA TGATAACTCT TGTTAAAAAA GCTATGACTT 8880 

GAG T TAG TAT T GAG AG AT AC TTAGTTTTTT ATTCTGGGAA GTTTCCGAAC CAGTCCGAGA 8940 

AATTCTCGAT TTTAAATGCA CCTCTTTATC TTACTATATT TTGTAAAGAT GTAAATAGTT 9000 

GTCGGATAAA TGAAAAAGAA TCTAGCCCTA TATGTAGGAA CTAGATTCTT CTTTTATTTA 9060 

TAAAGCGTTT CAGGAATAGC GATCGGTTCA CCATATTTTT TCTCTAAAGC GGTCCGAATT 9120 

AACCGAATCC CTAGATTTTT GTTGCGTGCT AAGATTGGAC CGTGGAAGTA GGAGCCAAAC 9180 

ACATTTTTAT AGACAACGCC TTCGCCATTA TCTTGACCGT TATTGCCTTT GCCTTGTTTG 924 0 
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ACTGTTCCTA AAGGTCGTTC 
CCATAATACG TTTCATCAAA 
TTGTCTTGGC TCAAAGTGTA 
GCGCCGACAT AGTAATGTCC 
TCATTTTCAA TATAAGAAGT 
AGTTGCTCGA AGTCTTGACC 
GGTTCGTAGA TACTAATAAT 
TTAAGCATCA ATAAATTTCC 
CAAACGACTA ATTCTTTAGA 
TGATTTTCTT AATTGCAAAA 
CAACGTTTTA ATTTCTGAAA 
AGGAATACCC GCAACTTTTA 
TACTTTTGGA ATGTCCATAT 
ATCAATTCCA TCTGCATAAT 
TAAGCCCATC ATATCAATGA 
ACATTCTTTA TCGCCAACTT 
TAAGCCTGCT CGGATTTTAT 
TAGCGCATTA TAAACATTAT 
CACAAAATCA GCAGATGTAT 
AGGGCGTTTG AAGCCACAAT 
TTTATAATGA AGAATATGAT 
CAATTGCTCT TTCGGTGGTA 
TTTTGAATTA AAAATTGGCG 
GGCGCCTTCC ATAATTAAGC 
GCGGAAAATA TTCGTAAAAA 
ACTTGCTTCA TCAATTTCTA 
TAAAAAAGTA GAGACAATTC 
ATCAAATTCT TGGCGTAAAA 
TCCAGTGACT ACGACAATTT 
TTTTAGGGCT AATTTCCCTG 
TTGCGAaGTT TTTCCTGCAG 
AATTTCTATA TCTTTAATTT 
TAAAATAAAA AATTAATTCC 



968 

CCCTTCGCCT AAAAAAGTCA TGCCGTTATG ATTCTCAAAA 9300 

TTCTTCATTA TAAATTTCAA TATCACCAAT GAAACGATTG 9360 

ATGATCtAAT GCACCGATTC CTTTAATTTT TTCGCCTTGT 9420 

TAGAAGTTGA TAACCACCAC AGATAGCTAA CAAGACGCCG 9480 

CAAAGCTTCT TTTTTATTCT GAATGTCTTC TGAGATGATC 954 0 

GCCACCAACA AAGACTAAAT CATATTTTTC GGGATCAAAC 9600 

TTCTGATCGG AAGGTAACGC CCATTTTTTC GGTTAAATAT 9660 

GTTATCTCCG TAAGTATTTA ATAAGTTGCC ATATAAATGA 9720 

CACCATTCAT TCCCCCTTGA ATATACCCTT GGGCTGTTAA 9780 

CAGCAGTATA GGTTGCCAAA ATATAGACAT GATCTGTCGG 9840 

TTACTTGCTC TAAATCAGCA ATTTCTAGTA ATTTTTCTTC 9900 

AACGCAAGGC CATATCTTCA TGGCGATCAC CGCCAGCAAT 9960 

CAGCAAAGGC TTCATGATTA CCATCCCAAA TCCAACTGAC 10020 

TAGCATTTAA TAATGAAACC AGCGAGAAAG AGTAAGGGGC 10080. 

CCTGATTTAA ACCAACGGGA TTTTTTACTA AAACTAACGT 10140 

TAATGGTTTC TTGACGCCCA AAAACTTTTT CATCATAAGC 10200 

CTGGTGCCAC CTGATAATAT TCTGCTACCG CTGTGGCCGC 10260 

ACATCCCACC AACAGCAATG CCGTATTCTT CGCCATCAAT 10320 

TATCCATCCG GACCATTTCC GTTAACTGCA CATCCAATTC 10380 

TTGGGCAATA GTAATCACCT AAATTAGCAT ACGTAATCAT 10440 

TACATTTTGG ACATAAAACC CCATCTGTAT TGTAATGAGC 10500 

AATGATTAAA TCCATAGTAT TTGCGAGGAT TTACAGTTTC 10560 

AATCACCATT GCATAAAATA GTTGCTTCAG GAGCAGCAGC 10620 

GATACGTCGT ATAAATTTCA CCATAGCGAT CCATTTGGTC 10680 

CAAATAATTT TGGTTGAATA TATTTAGTGA CACGGCTTAA 10740 

ACACCGCAAA TTTCTTTTGT CCTTTTTTCG CCTTGGCATT 10800 

CTTGCTCCAT GTTAGCACCA GTTGGATTTG TTAATACCTC 108 60 

TGTTAACAGT TAAAGCAGTT GTCAATGTTT TGCCGTTCGT 10920 

CATAGTCTTT TGCCAATGTA TCTAAAATTT TAG GAT CG AT 10980 

GGTAGCTACT GCCGCCTTTA AAAAATGTTT GTAAAACCCA 11040 

CAATTGCTAA ATGACTTCGA ATCCCCaTTT GTTGTCCCCC 11100 

TTCGTTATTT TAAATTAAAC TCAATCTATC ATACCATAAC 11160 

tTTTACTTTT TCTTACTATT ATTTAAAAAG TCTTTTAGAA 11220 
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AAAGTAGAAT AGTGAAGGAA AAAAGAATAG GGGCTGGGAC AAAAATCACT TTGGATTTTT - 11280 

GTCccAkGyw TAAAAAAcTG aTaAACGGCG GGAACAGAAG CAACACTCCG CGTTGCTTCG 11340 

ATCTCAACAA CTTCTAAGGT GTTTTCTCGA CATGCGTCTC GTCACAGTAC AGCCTTCAAA 114 00 

AGAGCTACAG CTCTAAGAGT TGCGATGTTT CACTGTGrAA CATGAACAGA TGAAATCATC 114 60 

GTACCTACTT GGTGAAGGTC CTTCGGAAAT AAGCCGAAAT TGTTGAGAAT CACGAAGAAC 11520 

GTAGTGrTTC GATGATGAAC AATACCTACT TGGTCTCCAA AAATTAAAGA ACACCGGGTC 11580 

CTTCAATTCT AGCATGTTCC ATGCTTAGAA ATTGATGCGA ACGAAACGGA GTGTAGTGTA 11640 

TCATGCAACA TCAACTCGTA CCTCGTTGTG TTTTATGACA ATTTTCGGAA ATTCCTTCTT 11700 

ATTGCTGAGA AACACTATTT GCAAAGCAAA TAGATGTTGA ACAGTACCTA CGTGGTTCTT 117 60 

GGAGTTAAAC ACTTCTGTCC CGACCTCCAT CGATTTCCAT TTTTATTTCG CATCAATATA 11820 

ACTGTATAGC GTATATTaGA AATATTGAAG AaGTTAGAAA CTTTGTCCCC AGATTTAGTG 11880 

ACTAAGAAGG CGCCTTTATT GTTTAAAAAT TGAATACACT TGATTTTATC GTCCyTCGTC 11940 

ATTAATGGGA CCGGCTTACC AACTAATTTG ACTGATTCAT GAATTAAATC GTCTAACAAC 12000 

TCATTAACAT CCTGAGGAAT GTAATCAGGG TCTTTATCTt CTTCAGGTTC TGCAGTGGAA 12060 

ATCAAAGATT TTAAGTTTGA CTCTGCAGCA ATCAATGTAG TGATGTCGTA GTTAATAGCA 12120 

AAAATACCAT CGAGTTGACC ATTCTCATCT TTAAAATACA TTGTACTAGA TTTTAAAATA 12180 

CGACCATCaT GGGTACGGGT TAGATAGTTA ATATGATCAT TCAACTCACT AGGGTCTTTT 12240 

TTTAAGGCTT CTAAGACTAC TTGAGAAGGT CCATCGCCCA GATGACGAGA AGAGACATGA 12300 

CaTTTTCAAT GGAAACAATA GAGTTGTTAA TATTGTCTTC ATTGATTTGG TGGATAACGA 12360 

tTCGCAGTTT TCCCCAAATT GACCAGCTAA ACCTTTAGCA ATTTGTGTTA AAAAAGTAAG 12420 

TTTTTCGTCG GATAACAT 124 38 
(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

GATACATCTC ATCCCCTAAC AATGAACAGC CAATTGCTTC AAAATGTACC CGAATTTGAT 60 

GTGTTCGTCC TGTATGGAGC TGAATGCGCA CAGCTGCTGC TwGGGCTCCT CTTTTTGCGA 120 

GCCAATATTC TGTACTAGCA AATTGACCTG TTTCGATAAC TTGGCGTTTT AATAAAGAGG 180 

ACAAGTCGCG * CCCAATTGGC TTTTCAATCA ATTGATGCTC GCTTAACTGC TCAATCGCTC 24 0 

CACCAACAAG GGCATAATAA ATTTTTGTCA CCTTTTTTTC TCGTAATTCT TGGTCAATTA 300 

AAGCATGAGC AAAGCCGTGT TTCGCAAATA ACATCAGCCC AGAAGTATCT CGATCTAAAC 360 



WO 98/50555 

GAGTCACGAC ATGAATAACT 
GATTCGCCAT CGTTCCTGAA 
TATTAACTAC TAAAAAATGA 
ATAATGTCTC ATGTTCCGCT 
AAGGATAGAT GGCATTTTGA 
CCTTTGCTAA TAACCCTTTT 
TTGTTTCTTT GTCATATGTC 
GCATCTTTCA CTCGATGCCA 
TCTGCAATTC GATAGCAGAC 
GTGACAAAAT AATCATCACT 
ACAATCGGTG ACCCCAATGT 
TGAAACGCAT TAATGCTAGG 
GAGCCGGTTG GGGTCGAAAT 
TTAATAAACA CATCGCCCAC 
GCTAAAAAAT GTTTATCTGG 
TAACTCGTAC TTTTTTCACG 
TCACGCCAAT CTGTATAGAA 
TTTAACAGAT GATTAAATCG 
ATCACAAGTT CAGGTTGACG 
AGTTGTTTTG TGACTTGTTT 
AGCAGACTCC TTTACTGATT 
TCTAGCAATC CACGGCCATG 
GATGACATTT CTTCATCTAA 
ATTTCTTCAG GAAAGACACC 
AAATTCATGG AGAGTGTCCG 
AATTGCACAG GATATTCCAC 
TCAATGTAGT CCCGTTCTTC 
ACCTGGTAAA TGTCTTCCAC 
TCTTCTTCAA TCCGATTTAA 
ACTGGTTTCA CACGTCCTGT 
TGTTTACGAA • TACCTCGTAG 
AAAAACAGCT CCCACTCTCG 
TGATTATTTT ACCACAGTCC 
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TGATTCACGT AATTTTGACG CTTGTAATAG CCCTTAACAC 4 20 

GGATGATACT GAGCAGGTAT CGAGGCAATG CCAGCAGGCT 4 80 

TCATCTTCAA ACACAATTTC AATCGGTTCA TCATCTGTyA 54 0 

TCATCTGGAA TCGTCATTTT AACGACATCG CCAGTTTGTA 600 

ACGGTCCCAT TCACTTGAAT TTTACCACCC TGAAATTTCA 660 

GAAACACCTT GTTCCTTTAA AAAATATTTC ACTTGTTGTG 720 

CATTGAAATT CCATCGTTTA ATTAATCCTC ACCAATAAAG 780 

AAAATGCATG TGCCGATAAG AGGCAAAATG AATCCGTTCG 84 0 

AGAAGCGATA TTTTCTTGAT AAATATCTAA TTGGTCCACC 900 

TTCCTGTAAT TTAATCTCCA GCCACTCCGT ATGTGCGATT 960 

TCGAAAGACA CGATTGTTTA AGGAGGCGAT TTCGGTTAAC 1020 

ATGCAAAACA GCCCCACCAA TACTTTTATT ATACGCAGTA 1080 

CGACAACCCA TCACCACGAA ATCGTTCAAA CAGTTCATCT 114 0 

CATCGTCCGG TTTCCTCGTT TAATTGTTGA TTCATTTAAT 1200 

TTTCCCATCT CTAAACCGAA TCCGCACATC TAATAATGGA 1260 

ATGAATACAC AAACTCTCAA CTAGTTCTTT TAATTCATAA 1320 

i 

ACCCAAGTGT CCGGTATGTA CACCTAAGAA ACTTACTTCG 1380 

ATGAAAAGCG GAAAGAAGCG TACCATCTCC GCCGACTGAA 14 40 

ATTGTCTATT TGAATTTGGT TTTGCTCTAA TAGTGTTGTC 1500 

CGATTTTTCT TCCGAGTTAT GAACAATCGC CACTTTCATA 1560 

TTCTGGTGTT TCGTTTTTAA TTTGCTTATA GTATTCGTTT 1620 

AGAGAAGTAA TGTTGTGCTT CTTGAATTTC TTCACGAATA 1680 

TAAATAGGCT GCCTCTGCCG CTCGCTGAAG GCGTTCTTTC 174 0 

TTGATATTTA TAATTCAAAG AATGTTCAAT CGTTGCCCAA 1800 

AATTTGGATT TCGGCCAAAA TAATCTTTTC CCCGGTGACT 18 60 

T AC C AC AT GG TACGAACGAT AGCCACTTTC TTTTTTATTT 1920 

CACAATTTTT AAGTCTTTAC GATTTCTTAA AAGTGCGACT 1980 

AAATTGGCAC ATAATCCGCA AACCAGCAAT GTCTTGCATT 204 0 

AGGAATATGC CGAATAGCCG TCTTTGTCAA AATACTATCT 2100 

AACGAATTCA ATTGGTATGT GTTTATCTTG CTCCCGAAAT 2160 

TTTTACTTTT AATTCGCTGA CCGCCTGTTC ATATGGCGCT 2220 

TTCCATGGTA GATCCCTCAA TTTCCTTGAA TTTCATTCAC 2280 

TCCCTAAAAT AGGGTGTTAT TCGCTATTTG ATTTCCAAAC 234 0 
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GCCTTTCATT 


AGAAAAATTT 


TTCACAGAGA 


AGACCACTTT 


TCTTGACGGA 


AACCTATATA 


2400 


ATATGTGAAA 


.ATTGGAGTGA 


TTTAAGAAAG 


AGAGAGAACC 


CCATGAGTGA 


GCACTTAGAA 


2460 


ATTGAATTTA 


AAACATTAGT 


TAGTCCGCAA 


GATTTCAAAC 


GATTAATCGA 


CCATTTTGCT 


2520 


ATTCAAAAAA 


CGGACTTTTT 


TACGCAAACA 


AATCATTATT 


TTGATACGGA 


CGATTTTCAA 


2580 


TTAAAAGCAC 


AACGAATGGG 


CTTAAGAATT 


CGCGTATTAG 


CAGATCGTGG 


TGAGTTAACC 


2640 


TTAAAAGTTC 


CTGCGCCAGA 


AGGATTATTA 


GAAATCAaTG 


ACCCTTTATC 


GCTTGAGACG 


2700 


GCGAACCACT 


TTATCAAGAG 


GAATCACCTC 


CCTACTGAGG 


GGGCTGTGGC 


GAAGAAATTA 


2760 


CAAGAATTAG 


GTATTGAAAT 


CGCCTCGATC 


CATTTAATTG 


GTTCATTAAA 


AACAGCACGT 


2820 


GCAGAAAAAC 


AAATTCCTCA 


AGGACTACTT 


GCGTTAGACG 


AAAGTTGGTA 


TAACCAACAG 


2880 


CACGATTTTG 


AATTAGAGTT 


AGAAGTTACT 


GAAGCTGAAA 


GCGGcAAGCA 


AGCGTTTCAA 


2940 


ACATTAATGG 


CTGATTTGAA 


CATTCCCATT 


ACGCTTGCAC 


CTAACAAGAT 


TCAACGCATG 


3000 


ATGCGGACCA 


CTGCCCCTTA 


GTATTCGAAG 


CACTGACGTT 


CTGGCACCTG 


TATTTGAAAA 


3060 


AGCATGGCAT 


TGGAGGAAGT 


ATGAATGGTT 


GAAATTTATT 


TGTTCGTCAA 


TCCTTTAGGC 


3120 


GGCGTCTGTT 


TAGAAATTGA 


AAAAGAAATT 


ATACAATTGT 


CTGTTAACGA 


CAAGAAGAAA 


3180 


ATACAATTGC 


GCTTTATTCC 


TTTATTAAAT 


ATGAAAACGA 


TCAATGAATT 


TTTGAGTCGT 


3240 


CAACATATTC 


CGATTAATGA 


CATTAAACGA 


CGGAAT CG AA 


TTTTTGAAGA 


TTTATACTCA 


,3300 


GCCGCTCTTG 


ACTATAAAGC 


AGCACAGTTA 


CAAGGCCGAA 


AAAAAGGCCG 


ACAATTGTTG 


3360 


ATTGGCTTAC 


AAAAAGCAGT 


AGCTGAAGAC 


GGTCTAGCTT 


ACTCTCCTGA 


GTTATCAGAA 


3420 


GAACTACTTC 


TAGCGGCTGG 


CGGGG AT ATT 


GATATGTATC 


GTAAAGACCG 


ACAATCCGAT 


3480 


TTnGTCAAAG 


AAnCTT 










3496 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

TTTCACAAAA GAGCAAAACC GCATGTTTTT GCGGTTTAGT GATAATTTTC ACAAACAAGA 60 

AAACGGTTTC TGTTCCACAC TTCTTCTGTA CTATTTTTCT CCCAACTGTT ATATTCTATT 120 

TTGAAAATAT TATATAACAG TTCTATTAAT TTGAGCCCCC TTATCCCCGA ACTGGCTAGT 180 

AATCTGATTT CCGAACTTTG CGTTTTCATG AAAATTATCT TGAAAAAACC AAAAAATAAA 24 0 

AACCGTCAGC CGAGTAACCT ACTCAGCATG CGGTTTTTTA TTTTTTAGAG AACAAAGTTT 300 

TTTTCGTTAA TTTTTATTTT TTTTGGCTCG TTTTTTTGCC AAATACTTTA CCTTAGGTTT 360 

TATCACGAAC ATTCATTGCT TAAAATGACC AAATTTTCGT AAAACGAGTA ATACAGCTTA 420 

GCAAATTCTT TTCTCTTTCA GTAGATTTTA CTAAATCCAT TCCTTGGTAA TTTTAATATA 4 80 
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TAAACGTTTA 


GCTACTTCAA 


CAGTTAACAT 


GTACGCAATG 


ATGATTCCAA 


AGAACCATGG 


540 


CCAGTAGTTA 


GCCGGAAGTT 


TCACAAAGTC 


AAAAACCTCT 


CTTATAGGCG 


TTGCTACAAT 


600 


CAAGAAACCG 


AGTAAGATTG 


CGfTTAAACT 


ACTTAGCATC 




a r r* a or: a p t 


OOU 


TTGAATAAAT 


GGTATTTTCC 


GCGTCCGGAC 


CATATGAACA 


ACTAACGTTT 


GACTAACAAG 


720 


TCCCACCATA 


AACCAACCTG 


TTTGGAATAA 


ATGTTGAGTT 


GCCAAACTAT 


TGGCACTAAA 


780 


CACAAACCAC 


ATGACAAGAT 


ACGTTAGAAT 


ATCGAAAATA 


CTACTTACCG 


GGCCAATACA 


840 


TACCGTAAAT 


TTCGCTAGTC 


CGTTCGTTTC 


CCACCGAACA 


GGGCTTAATA 


ACTCTTCTTC 


900 


AT 












902 



(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



TCTGGTGAAG 


TCCGTGTGAA 


TGAATCCTTG 


TTAACCGGGG 


AATCTGATGA 


GATTACCAAA 


60 


G TAG C AG AT A 


GCCAACTTTT 


GTCAGGTAGC 


TTTATTGTGT 


CAGGTTCTTG 


TTATGCAAGA 


120 


ATTGATAAAA 


TTGGCGCTGA 


GGCCTATGTT 


CATCAATTAA 


CGCTTGAAGC 


GAAATCAATT 


180 


AAAAAAGGGG 


AACAATCAGA 


GATGGTCGGT 


TCCATTAACC 


GTTTGGTGAA 


ATGGGTAGGG 


240 


ATTATTATTA 


TTCCAATCGG 


CTGTGTTCTA 


TTTTTCCAAA 


GTTATTTCAT 


CAACCATCAA 


300 


GGGCTTCATG 


ACAGCATTGT 


CTCCATGGAA 


GCTGCCTTGA 


TTGGGATGAT 


TCCGGAAGGg 


360 


TTGTACCTTT 


TAACAACCGT 


TGCATTAGCT 


GCCAGCACAA 


TGCGCTTAGC 


GAACCnGGGC 


420 


GTTTTATTGC 


ACAACATGAA 


AAGTATCGAA 


TCCTTAGCAC 


GTGTCGATGT 


GTTATGTGTA 


480 


GATAAAACAG 


GAACAATTAC 


TGAAAACACG 


ATGGAAGTCC 


AAGAGTTGGT 


TCCTGTGGTG 


540 


AGTATCGAAA 


ATGATGGTAC 


AGACTTAACC 


AATGTCGAAA 


AATTAATTGG 


CGATTTTTGC 


600 


CGAACAATGT 


CAGCTGACAA 


CGATACGATG 


AAAGCGATGA 


AAGAATTTTT 


TGTTACTAAT 


660 


AATCAGAGGG 


AAGCCGTTAG 


TTATACCTCG 


TTTTCGTCTG 


TCGAAAAATT 


TAGTAGCGTG 


720 


ACTTTCCCGG 


AAGCGACGTA 


TATTCTTGGG 


GCGCCAGAGA 


TGATTTTGAG 


AGACAATTAT 


780 


GAAATGTATC 


AATCTGAAGT 


GGAACATTAT 


ACAAGTCAAG 


GCTATCGCTT 


ATTGGTGTTC 


840 


GGAAAGTATT 


TAGGAGAATT 


TCAGGAAACG 


TTGGCAGCAG 


AGGTTCAACC 


TCTAGGCTAC 


900 


ATTCTTCTTT 


GGAATCCGAT 


TCGTAAAGAA 


GCCAAAGCAA 


CTTTTGAATA 


TTTCGCTGAA 


960 


CAAAATGTGG 


■ CTATTAAGGT 


CATTTCTGGC 


GATAATCCTT 


TAACAGTATC 


CAATGTAGCA 


1020 


CAAGCGGCAG 


GCATTATTGG 


TGCTGAAAAT 


TATGTGGACG 


CGCGAACATT 


AACTACAATG 


1080 


GAAGCCCAGA 


CAGAgCTTTA 


GAAAAATACA 


CCGTTTTTGG 


TCGTGTCACG 


CCAGAACAAA 


1140 
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AGAAACAATT 


TGTATTATTG 


TTGAAAAAAT 


TAGATCATAC 


AGTGGCGATG 


ACAGGGGATG 


1200 


GAGTCAATGA 


TATTTTAGCC 


ATGAAAGAAG 


CAGATTGTAG 


TATTGCCATG 


GCTTCAGGAA 


.12 60 


ATCAAGCTAC 


AGCACAAGCC 


TCGCAAGTTG 


TGTTGTTGGA 


TTCTGATTTT 


TCAACGATGC 


1320 


CGGAGgTTGT 


CTTTGAAGGC 


CGCCaAGTAG 


TTAATAACAT 


TGAACGCTCT 


TCAAGTTTAT 


1380 


TCCTTGTAAA 


AAATATTTTC 


TCCTTGCTGA 


TGTCTGTCTT 


TGCTATGATT 


TTTGCCGTGA 


1440 


CATATCCTTT 


ACAACCGACA 


CAAGTCACAT 


TGATTAGTTT 


ATTTACAATT 


GGGATTCCTT 


1500 


CGACATTTCT 


AGCTTTGGAA 


CCAAATCATC 


GTCGCATTGA 


AGGGAAATTT 


CTTTTCAATG 


1560 


TGTTATCAAA 


AGCAATTCCT 


GGTGGGTTAA 


CGGATATGTT 


AGTAGTCGGT 


GCCTTACTAA 


1620 


TTTGTGGTGA 


TATTCTTGCA 


CTTCAAAAAA 


CGGACATTTC 


AACGACTGCA 


ACATTGTTGC 


1680 


TAGTTTCAGT 


CGGCTTTATG 


GTTCTTTATA 


AAATTAGTTC 


ACCAATGAAT 


CGTTACCGCA 


1740 


AGCGCGTAAT 


GATTGGCTGT 


TTAATCGGAA 


TGGTTATTAC 


AAGTATTTCC 


ATGAAAAACT 


1800 


TGTTCTCGTT 


AACTTCGGTA 


TCGCCAACAG 


CCTTGTTACT 


TTTAGCCATT 


CTTTTCTTTG 


1860 


CAGCAGACTC 


AACTTTCCAA 


CATTTATCAA 


CCATTTCTGA 


AAAAGTTCAA 


TTATGGTTTT 


1920 


ATAAGAAAAG 


ACATTAGGAG 


CAAAACACCG 


AGCCAAAATT 


ATTTTGGTTC 


GGTGTTTTTT 


1980 


GTTTTTTATC 


GTTGGGAAAA 


ATAAAAACAC 


TTGCACCTAG 


TTAATCCTTG 


ATAGTATAAA 


2040 


CCTAACCAAT 


TATTTTAGAT 


AGGAAGGATT 


GaATAAAAAT 


GaAAAAAAGA 


AGTTTTGTCT 


2100 


ATTGGGGaAT 


GTTTTTCATc 


TTTTGTCGAC 


GCTGATAACG 


CCTTTTCTGA 


AACTTGAAAC 


2160 


AGGATATGCG 


CaAACGGraC 


CAACTAGTAC 


CAGTGrAACC 


AATCAAATTT 


nCTGtACGCC 


2220 


AAATGTAGTG 


CCGCGTAAAC 


AAGTGGGCAA 


CATTGTGACA 


GCGATTCAAC 


TAACGGACAA 


2280 


AGAGGGAAAC 


CCACTTGGGA 


CCATTAATCA 


ATATACAGAC 


ATCTACCTCC 


GAATTGAGTT 


2340 


CAACTTACCG 


GATAATACCG 


TTAATAGTGG 


GGATACGAGT 


GTGATTACTT 


TACCCGAAGA 


2400 


GCTACGTCTC 


GAGAAAAATA 


TGACCTTTAA 


TGTAGTAGAT 


GATACAGGTA 


CTGTGGTTGC 


2460 


AATCGCACAG 


ACCGATGTAG 


CAAATAAAAC 


GGTCACGTTA 


ACGTATACAG 


ATTACGTAGA 


2520 


AAACCATGCC 


AATATTAGTG 


GGTCTTTATA 


TTTTACAAGT 


TTGATTGATT 


TCGAAAATGT 


2580 


CGAAAATGAA 


TCCAAGATTC 


CTATTTACGT 


CACGGTTGAA 


GGGGAAAAAA 


TATTTGCTGG 


2640 


CGATCTTGAT 


TACCAAGGTG 


AAGGCGATGA 


TGTTAACGAA 


AAATTTAGCA 


AGTATTCGTG 


2700 


GTTTATCGAA 


GACGATCCAA 


CCGAAATTTA 


CAATGTACTA 


CGGATTAATC 


CAACAGGGCA 


2760 


AACGTATACT 


GATTTAGAAG 


TGGAAGATGT 


GCTAAAAACA 


GAAAGTTTAA 


GTTATATGAA 


2820 


AGACACTATG 


AAAATTGAAC 


GAGGACAATG 


GACACTGGAT 


GGAAaTGCTA 


TTTGGCAaTT 


2880 


CACGCCAGAG 


GAaGATATAA 


CTGATCAGTT 


GGCTGTCCAG 


TATGGTCCTG 


ATGACCGTAA 


2940 


TTTTTCGGTG • 


CATTTTGGCA 


ATATTGGGAC 


TAACGAnTAT 


CGTATCACCT 


ATAAAACCAA 


3000 


AATCGATCAT 


CTTCCCGAAA AAGgTGAAAC 


CTTCACAAAT 


TATGCAAAGT 


TAACAGAAAA 


3060 


TCAAACGGTG 


GTGGAAGAAG 


TCGAAGTGAG 


TCGTGTCTCA 


CAAACAGGTG 


GTGGAGAAGC 


3120 
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TAATGGTGAA CAATATGTTG TTGAAATTCA TAAAGAAGAT GAAGCAGGCC AGCGTTTAGC 3180 

AGGTGCAGAA TTTGAACTCA TTCGAAACTC AACCAATCAA ACAGTGGCTA AAATCACAAC 324 0 

AGATCAAAAC GGAACAGCGA TTGTTAAAGG CTTGTTAAAA GATAACTATA CTTTAGTAGA 3300 

AACCAAAGCA CTACCGGTTA CCAACTTTCG CAAAACAAGA TTCCAATAAC ACCTGAAGAC 3360 

TTTGGTAAAA ACTTcAGTTG CACTGAAAAC TGTTGTGAAT CATAAGATTT CATATCAaCC 34 20 

AGTCGCTGCt TCATTTTTAG CTGGAAAAGT TCTTTTAGGA AAACCACTGA AGGATGCTGA 3480 

ATTTCAATTT GAATTGTTAG ATGAGAAAGG CACAGTACTA GAAACGGTTA GTAACGATAC 354 0 

TTTAGGGAAA ATTCAATTTT CACCATTA 3568 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

CATGTTTATC TAAAAGTTTT GCCCAGAACA CTTCTACATT TTTTAACATC CTTTGCTCTC 60 

CTTTTTATTT TTACTTCACG CAACCTTCTT TCACTAATTA TACATAACTG ATTGCTTTTC 120 

TCCTATACTA TTTTTACAAA ATGAAAACTA AAATAATGTC AGTAGTATTT GGACAGAAAA ' 180 

AAAAAGAGAG CGTGGGACAA AAATCACTTT GGATTTTTGC TCCACGCTCA AAAACTTATA 240 

AACGGCGGGA ACAGAAGCAA ATCCTTCGGA AATAAGCCGA AATTCTCCAA AAATTACAGA 300 

ACAATTTTCG GAAATTCCTT CTTATTTCTC GGAGTTAAAC GCTTCTGTCC CGACCTCATC 360 

TTTTTATTTT GCTTGTCGTT CTTTGATTTT CGTGAGTAAA GATTGAACAT GCTGGCGTGC 4 20 

CGAAGCTTCA TCTTCTTGAA GCATCTTTTC CGATAAAATA AAGGAGCCGC CCACTGCTAA 4 80 

TACTTCTTTG GCTTCCACAT ACTCTATAAA GTTTGTTTCA TCAATCCCAC CAGTTGGTAA 54 0 

AAATTGAACA TCATAAAACG GACCGCTTAA ACTTTTAATG GCTTTTAACC CGCCGTAAAT 600 

ATCGGCTGGG AAAAATTTAA CCACTCGTAA GCCATACGCC AGCGCTCGCT GAATCTCCCC 660 

AGGCGTAgcT GTTCCTGGAA AAATAGGCAC CTCATTTGCC AGACAAAAAT CAATGACTTC 720 

TGGAACAATC GCTGGGGAAA CAATGAACTG TGCACCGTTC GCTACTGCCT CTTGGGCTTC 780 

TTTCAAGGTT CTAACAGTTC CTGCGCCTAC TATTAAATTC CCTGAAGCAG CAAGTTGTTT 840 

GATGGCAGGC AAGGCCAGCT GACTCCGAAA AGTGACTTCA ATAAACGGTA ATTCGTTTTC 900 

TACTAAGACT GATTCTACTT GTGGCAAATA GCGTAAATCT GTTGCAGTAT ATAGTGGTAA 960 

AAGTTTACAT GTCGCTAATT GTTGATACAG TTGATTTTTT TCAACTTTTG TTTCCATTCT 1020 

GTCACTTCCT ATTTTGTTTG GCTTTGGTTG CTTCGTACAG TCCTTGAATA TAGGTCAACC 1080 

CCATTGCTCG ATCATATAAA CCATAACCAG GCATCGCTTT TTCATCCCAA ATTGCCCGCC 114 0 

CATGGTCAGG ACGAATAACG CCTTCATACC CTACGTCTAC CAGTGCTTGC ATTAATTCCG 1200 
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CCATATCTAA 


CGAACCAGCG 


ACACTAGGAT 


GCGCCGTTTC 


TTCAAAACGA 


TGCTCACCTA 


. 1260 


AATACTTCAC 


ATTGCGGAAA 


TGAACAAAAT 


TAATTCTGTG 


ACCTATTTCA 


CGAATCATAG 


1320 


TAGGCAAATC 


ATTCGTCGGG 


TCCGCACCGA 


GGGAACCTGT 


ACAAAATGTA 


ATCCCATTTG 


1380 


CTGGTGAGTC 


CACCAAGGAA 


AGGATCCGTT 


TTAAATCCGC 


TAAATTTTTA 


GTGATTCTTG 


1440 


GCAAACCAAA 


AATTTCCCAA 


GGTGGATCAT 


CGGGGTGAAT 


CCCCATTTTG 


ATATTTTCTT 


1500 


CTTCACAGAC 


AGGAATCACA 


CGTTCTAAAA 


AGTAGCGCAA 


ATTTTCGACC 


AAATCTTCTT 


1560 


CCGTTACGCC 


GGCATACATG 


GCTTTCAATT 


CTTGAAATTG 


TTGCAAACGC 


TCCTCTTCCC 


1620 


AACCTGGTAA 


CCGGAATCCT 


TTTGACTGAC 


TGTGAATTAA 


TTGATACATG 


TCTTCTGGCT 


1680 


GCATGTTCTC 


TACCACAGCT 


TGATCAAATA 


ACAACGATAA 


GCTACCATCT 


TCATTCTCAT 


1740 


ACGCCAAGTC 


CGTTTTAGCC 


CAACCAAAAA 


TAGGTTTGAA 


ACTGTAGCAC 


ACTAAGGAAA 


1800 


TTCCACATTT 


TCCCAAGTTA 


CGTAACGTTT 


GGCGATAATT 


ATCAATATAG 


TGGTCTCGTT 


1860 


GATCCGTTCC 


CGCTTTAATP, 


f^PATPATftAA 






A AA AfZTf^PTA 


± ?Z U 


ATCCTTCTTG 


TTCCACCGAT 


TGTTTAAGTG 


CCTGAATTTC 


GGCTACTGTC 


CAAACATCAC 


1980 


CTGGTAGTTT 


ATTTAACAAG 


GTACCGACAA 


CACCAGTAAT 


TCCTGGAATC 


TGTCGAATAT 


2040 


GTTTTAACGG 


AATGGCATCG 


CCTGCCGCTC 


CGTACCATCT 


AAATCCCCAT 


TTCATTGGCT 


2100 


TTCACCCTTT 


ACTAACATTC 


TTAAAAATCA 


TCGTCCTCAT 


CCTCTTCGTC 


CTCAAAGGAA 


2160 


GTCATAGAAA 


TATCCCATGC 


nTGCACACTT 


TCTTTGCCTT 


GCGCAA 




2206 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CCATCCATTA ACCAGATTTT TGCCACGAnC TTGnCTGTTA AACGCTCAAG TCGCmtTTTC ' 60 

CCgaATGTGG GTAAAAtGTG TTTTTTCaTA CGTCGTTCAG TAGCTATGAA TGTGGATTCC 120 

CTAACTGTTT TTTTGTATTC GTCCAACCAC ATATAATAAA CTTCTTCAAA AGTGGTTAAA 180 

CGAGTATGCT CGTTAGCTAG ATTTCCATTA TCAAAATCTA ATTTTTTTTG ATTAAGCTTG 240 

AGCTGTGCTT CTTTTTTTGT ATTACAGtTT CTGATAGTGA CATTAATTTG TTTTCCAGTT 300 

AAATAATCTA CGCCTAAATA GG C AG TT ACT TTCCAGTATT TTTTCCCTTT TTTTGTATAT 360 

TGTTTAAAAG TTGCCATTGC TTATCCTTTC CACTTGGGCA AGCGAATAGA AGGAATGACA 4 20 

AATTTCTAGC • ACCTCCTTAT TAGATTTTAA AGCCCCTAGC ATGAATCGAA CACGCTAGAA 480 

CTCACCAGAG AGGGGGATAA TAAAGCATAT TTTTTTGTTA AGCAGTGCGC GTTACTTTTT 54 0 

ATTCTCTAAT TTTCTAATAG CTTTTTCTAA ATCATTTATT TTTTTTATTA GCATATCTAT 600 
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TTTTTCATCT TTTTCATCAT TGTTATTAGA TTCATTACTT TTATTAAAAT ATTCAGTAAT 

AGTAGAAGTT . AAXATGCCAA. .CAAACCCAAT TCCCACAATC ATCAAAATGA TTGCAGCAAT 

TCTTCCTAAT GGAGTAG7TG GaGAAATATC ACCaTACCCG aCAGTTGTTG TTGTTACCAA 

AGCCCACCAA AACGCATCAA TGTATGGAAC GTTTTCTGCA TATGAGTAAA TCATTGCTGA 

AATAACAATG AGAACCGAGC TTAAATAAAT CACGTTTAAA AATCCGTTAG TATTTAAAAA 

TGATTTAGTG TTTCTTGCTA ATTTGCCAAC CACACCTATC GCTCTTGTTA GCTTTGCTAG 

TCTAGCTATT TTAGCTATTC GAAACAACCT AGCGATTCTA AAGAAAGAAA AAATAGCATC 

AAAAGGAATT ATCGCGATCA GATCAAAAAT ATTTTCTTTA AAAAATTTGA TTTTATTTTT 

TGAAATAAAA AATCTAACAA TGTAATCAAT TGTAAATGtG aTTAAGaTAA TACTATCAAT 

AACGTTAAAT GGTGGATTAC TAATATTAAT AACATTTGAn AAATCAAGAA TAACTAACGC 

GATTGAGATT AATGCTAAnA CAACAATAGA GTAATTATAG AATTTTTGAT TTATTTTCAA 

TGAAGTTCTC TCACTCTCTA AATGrGGyTT AraGTCCGTGT TCCCAGCACG GACTTTTTTA 

TTTCAAATAA ACTTCTTGTC CCATTTTTAA GTtGTAATGA GCTATAACAT TTGAGTAATT 

GTATTGTCCT TCATATTTTT CGATTAAGCT TCTAAACATA TATTGTTCTG CTTCAGCTTC 

CATCTTAGAA CGAAAAACAG GAATTTTATA CAATGCCATT ATATCCACAT GTTCTTTTAC 

ATGCTTTAAT TCGTGATATA TTGCTTCTTC TTGTTCTGAT GGTGTTAAAT TTTGATTTAC 

AAATATGATA CCGTAGGTAG GGTCGAAACA TGCGCGTTTG TTCAAAGTAG TAAAAACTAA 

CTCCACATTA TATTCTTCTA CCAACTCTTT GATACTTTTC ATATAAGCAC AACCTTTGAC 

TTATTTCCCG AATCTACCCT TTAAATATGC ACGGATAACT TCTCTGTCAT GATCATCAAG 

CGGTTCACCG TCAAAACTCA TGACGTTATC CAGTACATCA TCTAAATCGT CAGATGGTTT 

AGCACCAGCT TGGTTTGGAT TTTCAGTACG ACCTAATAAA TAATCAACAG AAACATTAAA 

ATAATkGGCT ACAGCTTCAA GTTTATC 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 19024 base pairs 
(B) TYPE: nucleic" acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
TGTAAAAAGG nACnACAGCC ATTGGTACCT TAAACTACCT GGTTAGTTCA 
CCATAGTTTA AAACATCCAG GATACATGCG tTTCGTJcGCG CATTCCaGgC 
ATATCGTAGT TTGGATCAGT AGGTTGCGCA TTGACGCCGC GCTTAATTGT 
TTTGGGAAAA TAGCTGTTCG TTTTTCAATC CCTAAACCAT TAATTCGGAT 
GCCCGTTGAA TTTCTCGCTC AAACCAGTTG GTACCTAAAC CAAATCCCAG 
GGGGTTTGTC CATTGGAAGT AAATAAGGTA TTAATTTCAT ACTCAAGACT 



ACGATTTTAT 
CAGCTGTTTA 
GAAAATCAGT 
TTGTAAAATC 
AGAAGTAAAA 
TTGCATGGCA 



660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1887 



60 
120 
180 
240 
300 
360 
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TCAAAAATAT CTTTTTTCGT 
ATCCATTCTT GCGCATCTGC 
AATTCATCAA TCCGATCGGC 
ATTTGAGAAA TTTGTGCAGT 
CCAATTTTAA AGCCATTGTT 
GGAGTATATG GATGATAATC 
GCCACGTGTG AAGGCAACAT 
TCACGTTGCG TATTAAAAAC 
TCTTTATTAA TTAATTTGCC 
TCACGTTGGG TTCTGTAATG 
AGAGTATGTT CCACAATATT 
GCAATTTCTC GATTGACATC 
ACTTGG'TCAT GTATTTTTTG 
AAAGTCACTA AACGACCATC 
GCTTTTTCAT CAGTAATCTT 
CCTAATACAA AGGTCGGCTA 
TAGACAAACA CTATATATTG 
AAAGGATTGA GCAAATTTTT 
AAGAGTTTTT TGCCAAATTG 
TTGTTTGCGG ATTTATTCAC 
GATAAAACGA TGTTCATTCA 
GAGCGAGAAA AATGAAAAAA 
TAAATAATAA AACAGAAGTA 
TGGGGATTAT GGGTCCCAGT 
TTATTTTGCC AACCGCAGGA 
ATAATCAATT AAGTGATTTT 
TAATTGATAC GTTAAATGTG 
CATTAGAAGA AATGGATAAA 
TTTTATCTGC TTATCCACCA 
GTGCTTTGAT TACGAACCCT 
CAAAATCAGC AACAGAATTG 
CAATTTTGAT GGTGACACAT 
TTAAAGACGG TGCGATTTTC 
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TTTCTTCCGA GCGAATGCTT TTTGGCGTTC TTCTCCTTCA 4 20 

TAAATGTTTT TCATAATTAC GTTCCGCAAA AGGCGCTAAT 480 

AGAGCAACCA CCATATTGAC TGGAAGCTAC ATTGGCAATA 54 0 

GGCCGTTTGA ATTGATTTTG GTGATTCGAC ATCAGCGTTC 600 

TAGCATGCCT TTAAAATCAA TTAAACAGCA GTTTGTCATG 660 

TAAATCGTGG TAATGAATGT CGCCTTTTTG ATGGGCATTA 720 

CTTTAAACCA ATTGATTTTC CAACAATCCC AGCGGTTAGA 780 

GTCACTGTCT TTGTTGGCAT TTTCGTTCAC GACAGTCTGA 8 40 

GATAGATACA TTAATATCAG TTGCTTTACT ACGCTCAAAA 900 

AATGTATTCT TCGGCTAAGG CATATTCGTT GTTGGATAGC 960 

TTGAATTTCA TAAATTTTTA CGTTATTGGT AAAACGTTCG 1020 

AGCAACAATC GATTGGACTT TTTGATGTGT CAAAGGAGTG 1080 

CTCAGCTTTG ATCAAAGCGT CGTATATTTT TTGATCATCG 1140 

TCGTTTGACG ACTTTAATTG TTGAGAGATT AACATCCTCG 1200 

CATCATTTTA TCCAGCTCCT TATATCTCGT TAGTAACTAT 1260 

AATTCAATCG ATATATTGCG TATTTTTAGT AAAATATCAC 1320 

TGCTTGCGCT GTTTTGAAAA AATATGAAAA CAAATGAAGA 1380 

TTGCGTGAAA AAAATTCTTT GTCCAAATTA AAAAACAGCA 1440 

CTAAAAGCAA CACTAAAAAA TAACAATTAT TAAATAATAA 1500 

GAAATGCCAA TTATTCTTTG ACTACCGAAC GATATTGTAA 1560 

GATTTTTTCT TGGTAAAATA AGAGTAATAA CAAAACGTTG 1620 

ATTGTAGAAG TAAACCATGT CAGCAAGACA TATGGTCGTT 1680 

TTAGATAATA TTTCGTTTAC CGTTGATGAA GGCGAATTTG 17 4 0 

GGTGCTGGGa AATCGACACT GTTGAATGTC TTATCCTCGA 1800 

ATCGTGCGCA TCGCCGGACA AGATATTTTA AAAATGCGAG 1860 

CGAAGAAACG AAATGGGGTT TATTTTTCAA TCGTTTAACC 1920 

AAGGACAATA TTCTGTTGCC TTTAGCTGTA GAAAAAGTTT 1980 

CGTTTGTTAC ATGTGACTTC GATTTTGGGG ATTCAGGAGC 204 0 

GAGATATCTG TGGGACAAAA ACAACGGGTA GCAGCTGCTC 2100 

AAAATTATTT TTGCAGATGA ACCAACTGGC TCGCTGGATT 2160 

TTGAAATATT TAGCTGAAAT TAATCTACAT GATGATGCCA 2220 

GACCCGTACA CGGCCAGTTT TTGCAATCGC ATCTTATTCA 2280 

TCAGAAGTTG TTCGTCAAGG ATCACGTAAG GAATTTTTTA 234 0 
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ATCGTGTGAT 


1 AGATATGCAA 


. GCAACGATTG 


GTGGAGGCGG 


GCGCGCAAAT 


GATTTTTAAA 


2400 


CTATCATTAA 


. AAAATTTAAA 


. AGCTCATGTA 


. CCTAATTATC 


TGGTTTATTA 


CATTAGTATG 


24 60 


ACTTTTGCAG 


CGGTCGTTTA 


TTATTGTTTC 


CGTGCGATTG 


CTTACAACCA 


GCCTCTTGTC 


2520 


GCAGGTGCAG 


GCCGTGATAT 


CGAAATTAAG 


CGATCTTTAG 


GACTCGGTAG 


TACCTTAGTC 


2580 


ACCATTATTA 


TTTTGGGCTT 


TATGTTGGCA 


GCAAATCGTT 


TTTTTATAGG 


AAAACGTAGC 


2640 


CAAGAAATTG 


GCTTGTATCG 


ATTAATTGGC 


TTGAGAAAGA 


GCCAAGTTTC 


ACTAATTTTT 


2700 


TTAGTAGAAA 


ATTTAGTTTT 


AGGCTTTGTT 


TCACTAATTA 


ATGGAATTAT 


TTTAGGAGTG 


2760 


ATTTTCACCA 


AGTkGTTTtC 


AATGATTTTA 


GCCAAAGTTA 


TGTTTTTAGA 


AGTCCCCAGT 


2820 


TTCTTCTATA 


TTTCTTGGCG 


TTCCGTTATT 


GAAACCGGTG 


TAGCTTTTGC ATTTATGATT 


2880 


kTGATTGTCT 


GCTTAAGAAG 


TATTTGGATG 


ATTTATCGTT 


ATCCATTGTC 


GCAGCTTATG 


2940 


CATCAAGAAA 


AAATTGGTCA 


AATTAATTAT 


AGTCGTTTAA 


CGAAACGGCG 


GCAGCTTCTA 


3000 


GGTATTTTAG 


GACCAATTTG- 


TTTATTAGTA 


GGTTATTTTA 


TCGCGTTTGA 


TTTTCGTAAT 


3060 


TTAGCGACAG 


GTATTGTGTA 


TAAACAATTA 


AATGATTCAT 


TGGATATTTC 


GATTCCAATG 


3120 


ATTATTACGT 


TGACGCCTTT 


TATCATTTTG 


GGTCTATGTG 


TGGTGGGAAC 


TTATTTATTT 


3180 


TTCCGTTATA 


CCATGTATTT 'TATCATTCGC 


TTATTTAATC 


ATAGAAAAAA 


ATGGTATTAC 


3240 


CAAGATTTAC 


GCATGATTAC 


TTTAGGCAAT 


GCCAAACGCT 


ACTTATTTAA 


AAGTAGCAAA 


3300 


ACACTCACTG 


GTTTGACCTT 


ATTAATTGCG 


ACAGCCTTGA 


GTGGGATCGG 


CGTGATGGTT 


3360 


TTTGTGTATA 


CGATTAGTAT 


GCACACGGTC 


AACAGTGATG 


GCCCCGTGGA 


CTTTTTAGTT 


3420 


TCACAGGAAA 


GCTATCCTAA 


ATTGAAGAAA 


GTTCTCGATG 


AAGCGAAGGA 


CACCAAAATA 


3480 


AAAAATGAAG 


TGACACTTTC 


TTATAAAATT 


ACTGGAATTG 


AACGCAGTTT 


GCGAATTGGT 


3540 


CAAGCGCAAG 


AGGAACAAGA 


AACGATTGAA 


GCAGTTAATG 


TTTTATCCTT 


CTCCAACTAC 


3600 


CGCAACTATC 


AAAAAATTAA 


TCCTTATTTA 


AACGATTTAC 


ATTTAAAAAA 


TGACCAAAGT 


3660 


GTTATCTATA 


TGGATAGCTT 


TACCAATATT 


TTAAGTGGTA 


TGTCTCGTTA 


TGAATCGAAA 


3720 


CTGCAATTTG 


TTGAAGGGGA 


GAAGGCACAA 


CTACAACAAG 


TGCTGCCTAA 


CTATCTAGGC 


3780 


AATTTTTTAT 


TGCAATATTC 


CTTACCAACG 


ATTGTCGTTT 


CCGATGCCCT 


TTATCAGAAA 


3840 


GTGACAAAAA 


ATTCCATTCA 


GTATCAGATT 


AACGCCGTAA 


ATGTTGACAC 


GAAAAGCCGT 


3900 


GAAAAACTTT 


ATGAAGCCGT 


AAATAAACAG 


ATTGCCACAG 


AATGGCAAGC 


GCCAATTTTG 


3960 


TATAAGTATG 


AAAAAAATAA 


CCAGCAGTTG 


AGTGGTTTTG 


CCAAACAAAT 


GCCTAACGAA 


4 020 


GAGGTAAAGA 


ATACGCAAGA 


TGATGTCACA 


GAAACTTGGC 


GGCTAAATAT 


GAATGATCGc 


4080 


TACGCCTCTT 


TACGCTATGA 


ACGAAAAATT 


AACGGCTTGA 


CTTTATATGT 


CTCCATGTTT 


4140 


GTGGCAATTT 


TAGCGTTGTT 


TATTACGGGC 


AGTATTTTGA 


TGTTACGACA 


ACTGTTTTCA 


4200 


GCTGTAAGTG . 


AGCGTCAAGA 


TTACGTGACT 


TTAAAACGGA 


TGGGTGTTTC 


TAGTAAAGCG 


4260 


ATAACTAAGC . 


AAATTTATCG 


ACAGAATAGT 


TTGATTTTCT 


TTCCGCCGAT 


GGTCATTGGG 


4320 
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ATTTTGCATA CCACCTTTGC 


: TATTTATTTG TTAAGCCAAT 


TTATTAAGAG 


TCCAGGATAT 


4380 


TTGTTAGTAT ATCTTTCTTG 


TGGAATCTTA ATAATTGTTT 


ATCTAATTTT 


TTATATTTTA 


4440 


ACGTCTGCCA TTTATGCGCG 


GATGATTCGA CACATCCATT 


TTTAAAAAAC 


TGTAAAAGCC 


4500 


GTTGTGAAGC GGTTTTTACA 


GTTTCTTTAT GCGAAAAAAT 


TATGAGAGTT 


CCGTTTATGA 


4560 


AAATCATGAA AATCTAAGAA 


GTGACGAAAT TGTTAGTTTT 


TTTATTAAAT 


TGTAAGAAAT 


4620 


TTCAATGGAC TTTGCTGAAA 


TATAGCATAA AATAATGGAT 


AGCGATTAAC 


CTTAGGAGGT 


4680 


GAATGATAAT GAAAACAACT 


TTTCAGAAAA TTCTCAATAA 


CGAAAGTCTT 


AAATTCATCG 


4740 


GTAAAACGTT ATTTTATTCT 


TTCATTGTCA TTGCATTAAT 


TTATCTTTAT 


CATTACAAGA 


4800 


ATATCAATGG AGGAACCTTC 


ATTTATAACG AGTTTTAAGG 


AGGCGGCTAT 


ACATGGAAAA 


4860 


AGTAATTAAT ATGATTCAAA 


CGATTGATGA ATGGGCACGT 


AAAGAACCCC 


AACGTCCTGT 


4920 


TTATTTAACC GAAGAAAAAG 


TCTCTACGTA TGGAGAGTTA 


AAAGAAAAGT 


CAGACAATCT 


4980 


AGCCGCTTAT TTAGCTGAGC 


TGAAAACGGA CAAGTCAGCG 


ATTGTCGTAT 


ACGGAGAATT 


5040 


AGATTTTGAA ATGATTGTGA 


GTTTTTTAGG CGCATCGAAA 


GCTGGTTTTA 


GCTATATCCC 


5100 


CATTGATGCA CATACACCAA 


AAGAACGCAT TGAACTAATT 


CTAAACGTAG 


CCAAACCAAC 


5160 


CGCAGTGATT GCTGTGCATG 


AATGGCCTGA ATTAGCCACC 


GAAGTGCCAG 


TAATTACGGC 


5220 


GGAAGAATTA ACAGAAATGA 


TGATGCATGC ACCCCGGCAT 


GCGCCTGCGT 


TGATGCCTGT 


5280 


CACAGGAGCA AGTAATTACT 


ATATCATTTT CACTTCAGGG 


ACAACTGGCG 


TACCAAAGGG 


5340 


TGTGCAAATC AGCCATGATA 


ATTTAGTTAG CTTTACGAAT 


TGGTTGTTGC 


AAGATTTTGG 


5400 


TTTAGAAGAA GGGGCCCGCT 


TTTTAGCACA GGCACCTTAT 


TCGTTTGATT 


TATCGGTGAT 


5460 


GTCTATTTAT CCCGCATTGG 


CGTTAGGTGG CTCATTAACT 


CCGTTACCCA 


ATGAAATCAT 


5520 


TAACGATTTT AAACAATTAT 


TTACGCGTTT GCCACAATTA 


ACAATCGACG 


TCTGGGTTTC 


5580 


AACACCGTCC TTTATTGAAT 


TGTGTCTAAT GGAACCaAGC 


TTCGATGGTG 


AACATCTACC 


5640 


AGCACTGCGC ACGTTTTTAT 


TCTGCGGAGA AGAATTACCA 


AAACCAACGG 


CTGAGAAATT 


5700 


AGCCGCTCGT TTCCCAACTG 


CGCATATTTA TAATACGTAT 


GGGCCGACAG 


AAGCGACTGT 


5760 


AGCGATTAGT GCCATTGAAA 


TCACTCAAGA AGTTTTAAAA 


AGTGTCCAAC 


GTTTACCAAT 


5820 


TGGGTATGTG AAAGAGGATA 


CGCAAATTTA TATTATGGAA 


GGCATGTCAA 


AATTACCTGC 


5880 


AGGTGAAATT GGTGAAATCG 


TCATTGCTGG ACCAAGTGTT 


TCCAAAGGCT 


ATCTAAATAA 


5940 


TCCTGCGAAA ACAGCTGAAr 


CTTTTTTCCa ATTAGATGGC 


GTACCAGCTT 


ATCGCACAGG 


6000 


CGATGCTGGA AAGTTAGTCG 


ATAACCTATT GCAGTATGAG 


GGACGCTTAG 


ATTTTCAAAT 


6060 


TAAATTGCAT GGCTACCGCA 


TTGAGTTAGA AGAAGTGGAT 


CATCACTTAA 


CGAATGTTTC 


6120 


CTACGTGAAA CAAGCAGTCG 


TTGTGCCGAA ATATCAAGGA 


AATAAAGTCC 


AACAATTAAT 


6180 


TGCCTATGTC GTTCCGCAAG 


CACATGAATT TTCAAGTGAT 


TTTCAATTAA 


CTAAGGCAAT 


6240 


TAAACAAGAA TTAGCCACGT 


TGACAATGGA CTATATGATT 


CCGCAAAAAT 


TTGTATATGT 


6300 
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TGAACAATTA 


> CCTTTGACAA 


. GTAATGGTAA 


. AATTGATCGG 


AAAGGGTtGA 


TGAATGAGGT 


6360 


GAAtGCCACa 


tGATGAATTn 


TCCTCATATG ATyCCCTATA AtGCACCGTA TtACTTTGTT 


6420 


CtGCTAAtTG 


CGGCCcTTGT 


TGCCgATGAT 


TTTAACTTTA 


GCCaTTAAAG 


GGACACGTTG 


6480 


GCCTTGGTAC 


CAAACACTCG 


TGACACTGGT 


CTTTCTATAT 


ATCAGTTTTG 


GTGGCGAATT 


6540 


TTGGCaACAA 


GGAGTCGCTC 


TGATTGTATA 


TGTTATTTAT 


CAAACGTTAC 


TAACGTGGGG 


6600 


CTATGCGGCT 


TATCGAAAAA 


ATAAAAATGC 


AGGTTGGGTC 


TTTTATCTGG 


CTGTCTTTTT 


6660 


AG C TAT ATT A 


CCGCTTGTTT 


GGGTGAAAGT 


TTCTCCCTTC 


ATGACAGGCA 


AAACAACCTT 


6720 


ACTTGGCTTT 


TTAGGGATTT 


CTTATTTAAC 


CTTTAAAGCG 


GTCCAAGTGG 


TCATGGATTT 


6780 


ACGGGATGGC 


GTAATGAAAG 


AATATCATCC 


GTTTCGTTAC 


ATTCAGTTTC 


TCTTGTTTTT 


6840 


CCCGACTATT 


TCTTCAGGAC 


CCATCGATCG 


TTATCGGCGC 


TTTGAAAAAG 


ATTTGAAAAA- 


6900 


CCCGCCGAGT 


GCGGAGAAAT 


ATTTAGACTT 


TTTAGAAAAA 


GGAATTTTTT 


- ATTTATTTCT 


6960 


AGGATTTTTA 


TATAAGTTTA 


TTATCAGTCA 


TTATCTTGGT 


GGTGTCTTTT 


TACCACATGT 


7020 


GGAAAAAATG 


GCCTTGGCAC 


AAGGTGGTTT 


ATCCTGGTGG 


ACAGTCGGCT 


ATATGTATAA 


7080 


TTACAGTTTG 


TATCTATTCT 


TTGATTTCGC 


AGGCTATAGT 


TTACTGGCCG 


TTGGCACAAG 


7140 


TTATTTAATG 


GGCTATGATA 


CACCAATGAA 


CTTTAACAAA 


CCGTTCCTAA 


GTTGGAATAT 


7200 


TAAAGAATTC 


TGGAATCGTT 


GGCACATGAC 


CTTATCTTTT 


TGGTTCCGTG 


ATTACATTTA 


7260 


CATGCGCTTA 


ATGTTTTTCT 


TAATGAAAAA 


GAAAGTCTTT 


AAGAGTCGGA 


TTGTGACTTC 


7320 


TAATATTGGT 


TACTTTGCGT 


TGTTCTTAAT 


TATGGGCATT 


TGGCATGGAT 


TAACGTGGTT 


7380 


CTACATCGCT 


TATGGGCTCT 


ACCATGCAAC 


ACTAATTTGT 


GTAACAGATG 


CTTGGCTACG 


7440 


TTTTAAGAAA 


AAACATAAAG 


ACAAGATACC 


ATCAAACAAA 


TTTACACATG 


CATTTGCAGT 


7500 


CTTTTTAACT 


TTCCAAGCGG 


TTTGTGTCAG 


CTTCCTGATT 


TTCTCAGGAT 


TTTTAGACAA " 


7560 


ACTATGGTTT 


AAATAAATAA 


AAATAAAAAT 


ATAAAGGGGC 


AATTTATAAT 


GAACATTCAA 


7620 


GAAACAGTAT 


TAAACATTTT 


AGAAGATATT 


ACAGGAACAG 


ACGAAtTGTG 


AACAATCAAG 


7680 


ACATTCAATT 


ATTTGAAGAA 


GGATTATTGG 


ACTCTTTAGC 


GACGGTTCAA 


TTATTAGTAG 


7740 


AAATTGAAGG 


ACAATTAGGA 


ATCCAAGTTC 


CTGTATCAGA 


CTTTGACCGT 


GAAGTATGGG 


7800 


GAACACCAAA ACAAATCATT 


CAACAAGTGG 


AAGCTTTACA 


ATAATGTCTA 


GAAATAAACG 


7860 


ACTAATCAGA 


ATCTTAGGTC 


CAGTGCTTTG 


TTGAGCGGTC 


TTGGTTGCAG 


TGTTCTTTTT • 


7920 


TGCACCGTTC 


AGAATCAATT 


TAACTTCCGA 


AAAAACGTTG 


AAGGAAGCAT 


CAACGTCAAT 


7 QRft 
/ J o u 


GGCACCGAAC 


GTTTTAAAAG 


GCAATGTGAT 


TAAAAACAAA 


GCAGTGGCTT 


CTGGTAAATA 


8040 


TGTTCCTTTT 


TTTGGCTCTT 


CTGAATTAAG 


TCGATTTAGT 


GCTTTCCACC 


CTAGTGTGCT 


8100 


CTCTGAAAAG 


TATCAACGGA 


ACTACCGACC 


GTTTTTACTA 


GGAGAAGCGG 


GCACACAGTC 


8160 


ATTAACCCAA 


GCAATGGTCA 


TTCATTCGAT 


GGGcGATGCG 


ATTGcCAATA 


AGAAAGCGGT 


8220 


CTTTATTTTA 


TCTCCGCAAT 


GGTTTGTCAA 


AAAGGGTGTA 


CCTAATGATA 


GTTTTGGTGC 


8280 
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ACACTATTCA CAATTACAAA CCTATCAATG GTTAGCTAAT TTAACCGAAT 


TAACCAGCGG 


8340 


TGACCAATAT CTAGCACAAC GTTTGACTAA ATTTCCAGTT GTTCAAAAAG 


ACAAAGTCTT 


8400 


AATGGAAACC TTGGCAAATC TGCAGGCAGG GCAACTTCCC CAACGCTCTC 


AACGAGATTA 


8460 


TTTTATAATG AATCTCCGTT TCTTGAATCG TGAAGATGAA TTGTTTAGCC 


AAATTGGCAT 


8520 


GGTTAGCAGA GAACCAATTG TTGAAAAAGA TATGAAACAA TTACCTGCTA 


CTTATAATTT 


8580 


TAATGAATTA GATCAGCTAG CTGGAAAAAT TGCGGCAAAA GCCATTAATA 


ATAATAAATT 


8640 


TGAAATTAGT AACGGTTTTT ATCGCCAACG GATTAAACCA GTTTTACCGA 


AATTAGCTCA 


8700 


TTCACAAAAG AAATGGGATT ACCGTTTTTC TCCAGAATAT GGTGATTTCC 


AAGCTGCGTT 


8760 


AGAACAGTTA GCAGAAAAAA ATGTCGATGT TTTATTTGTA ATCCCACCTG 


TTAACAAACG 


8820 


TTGGTCGGAC TATACGGGCT TGTCACAAGA TATGTTGCAA CAAGTAGCAC 


GCAAGCTGAA 


8880 


ATATCAACTT CAAGAGCAAG GGTTTACTAA CATTGCTGAC TTCTCAACAT 


GTAGTAATGA 


8940 


ACGCTACTTT ATGGCGGACA CGATTCATTT AGGTTGGCGT GGTTGGCTCG 


CTGTTGATCG 


9000 


TCAAGTAGAT GAATTTATGA AACAACCTGC GTCTAAAAAG TTAGCGTATC 


AAATTGATGA 


9060 


TCGTTTTTAT CAGACAGATT GGCAACAACA AAATCCATTG GTCTTACCTC 


AATTTTAAGA 


9120 


AGGCAAGATG GATCAAGTAC CAATTATCTG AAGGGGACTA GAAACGCTTC 


GTAACAATGA 


9180 


AAATAGCCTG TACCTGCCGT TGAATAACGA GCGTGCAGGC TGTTTTTTCG 


TTTATTTTTA 


924 0 


GTGGAGAACG TTGCTCAATT TATTTTTTTG TAAGTATTTT TTTAGTTCGT 


GGTATAATGG 


9300 


ATAAAATTGG TTTTTGAGGG GAGTTATTTT TATGGATGCA AAAGAAGAAA 


AAATGTTAAT 


9360 


TGATTTAACA AGTGCTAAAG GGATTGCTGG GAATGAAGAC GAAGTACGTT 


CGTTATTTAA 


9420 


AAAGTATGCA GCACCGTATG CGGATAAATT TTTATATGAT GGCCTTGGCA 


GTATTATTGC 


9480 


AAAAAATGTT GGTGATAAAG AAGGACCAAA AGTCTACATT TCTGGGCATA 


TGGATGAAGT 


9540 


CGGTTTCATG GTTACCCAAA TCACAGAAAA AGGCTTTTTA AAATTTCAAA 


CCGTTGGTGG 


9600 


TTGGTGGGGG CAAGTGATGT TGGCCCAACA AGTTCAGATT AAAACCACGA 


CTGGTAAGGT 


9660 


TTACCATGGT GTCATTGGGT CTAAACCACC GCATGTGTTA ACAGCGGAAG 


TTCGTAACAA 


9720 


ACCCTATGAG ATTGCAGATA TGTTTATTGA TATTGGTGCA AGTAACGATC 


AACAAGTTGC 


9780 


AGAATGGGGG ATTCATCCAG GGGATATGGT GACACCATAT ATTGACTATC 


GTCGCATGAA 


9840 


TGATACAAAA TTTTTATTGG CAAAAGCGTG GGATAACCGA ATTGGTACGG 


CCGTTTCATT 


9900 


AAAAGTTTTA GAAAATTTAG CGACAGAAGG GCACCCCAAT ATTTTATTTG 


CAGGCAGTGA 


9960 


TGTGCAAGAA GAAGTCGGGc tACgCGGTGC GCAAACAAGT ACGCATCTTG 


TGAATCCAGA 


10020 


CATAGCGTTT GCTTTAGACA CAGGAACAGC TGGTGACACA CCAGGAATGA 


CACCTAAAGA 


10080 


AGCCGACTCT GTTTTAGGTG AAGGACCACA AATTTTGATT TATGATGCAT 


CGATGATTCC 


10140 


GCATAAAAAA TTACGTGATT TTGTTATTCA AGTGGCGGAA GAAAATAACA 


TTCCCTTCCA 


10200 


ATATACAGTG ATTACAGGGG GAGGTACGGA TGCTGGTCGC ATGCATTTAA - 


CGCGCAACGG 


10260 
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GATTCCATCA 


TTAGCCATCA 


CGGTGCCTGT 


TAGATATCTG 


CATTCACATA 


CCTCAGTTAT 


10320 


TCATGAAGAT 


GATTATTTCA 


ACACAGTCAA 


ATTAGTCACA 


GAAGTGGTGA 


"aacgtctaga 


10380 


TAAAGAAACA 


GTCGCTAAAT 


TGACTAGTTA 


TTAGTTTCAG 


AAAGGATGGA 


ATGAATGAAG 


10440 


AGTTTAGTAA 


CAAATTCGCA 


ACATCGTGCC 


AATGCTCGCA GAgCTCTTGA TGGGCAATGG 


10500 


GGGATCATGG 


CTTGGTTAAC 


TTTCTTAGGA 


GTGGTTTTAC 


AGAGCTTCTT 


AACATCGATT 


10560 


GTGCAAAATT 


.TATTTTCTGG 


TGAAAATCAA 


GTTTTTCAAA 


GTAGTTTTGC 


CGCAGTATTA 


10620 


CTGCAAATTT 


TTGTTTTATT 


CGCGTTAAGT 


TATGCGCTAT 


ATTACGCAGC 


CTTGAAGGTG 


10680 


TTACGTGGTG 


AAAAGGTGCG 


GGTCAATATT 


TTAATGAGTG 


TTTTTCAAGG 


AAAATATTAT 


10740 


GGTCCGCTAT 


TTGTGGTCAA 


CCTACTTCAA 


AAAGTACTTG 


AACGTGTAAT 


TGGTTTGTTA 


10800 


TTTTTGTTAC 


CAATTTTATT 


TGGGGCAGGG 


ACAACCTTGT 


ATTTCAAATT 


GATGTTTAAT 


10860 


ACAGTTACAC 


CAGAGGAAAT 


CCAAGCCTTT 


TTCTTAAATG 


ATTTTTCCTT 


TTTCCTAGCA 


10920 


TTTGGCGTTG 


CCACTATCTT 


GATCTTTTTA 


ATTGGTTTAT 


TTGTGAGTGG 


CGTTTTTCAA 


10980 


TTTGCTGTGT 


GGCTACGTTT 


TGATAATCCA 


GAACTGCCAA 


TTATGATGGC 


ATTGAAACAG 


11040 


GCGTTATATT 


TAATGAAAAA 


TCGTTTTTGG 


CAATATTTAT 


GTTTACAATT 


TTGTTTTATT 


11100 


GGTTGGTATA 


TTGTGGGCTT 


TTTAGCTATT 


GGTATCGGTC 


TATTATGGGT 


GATTCCTTAT 


11160 


AACTATGTAG 


CGTTACTAGT 


TTTTATCAAA 


CTGCCTTAGA 


AGAAAAAGGT 


TTAGCCGAAG 


11220 


AATAAAAAAG 


AAGCTTGAAT 


CCTTCGTTAT 


CTCGACAAAG 


GGTTCAGGTT 


TTTTCTTTGG 


11280 


TCTTTTTGTG 


AAAATTTAAG 


CGTTTTAACC 


TCGTTTTtCC 


GTTATTTTAT 


GGTAACATGG 


11340 


AAGTATCGTA 


TTGAAGGAGG 


TTGTCCACAC 


ATGAAATTGA 


CTGTCTTAGG 


CTGTTTAGGT 


11400 


GCTTATCCAT 


ACAAAGGAGA AGGAACGAGT 


TCCTATTTGT 


TAGAATCTGA 


AGGCTTTCAT 


11460 


TTGCTTATTG 


ATGCAGGCAG 


TACGACATTA 


GTGCGTTTAG 


AAAACTATCT 


AGATCCATTG 


11520 


AAATTAGATG 


CTGTCATTCT 


GTCGCATTAC 


CATCATGACC 


ATATCGCAGA 


TTTAGGTGTT 


11580 


TTACAATATT 


ATCGTCAGCT 


TTATCCAACG 


CTGGAACCGA 


CGCCTATTTT 


GCCAATATAT 


11640 


GGCCATACAC 


AAGATCAAAT 


GCATTTCAAA 


GAACTAACGC 


TTCCTAACGT 


TTCCGAAGGG 


11700 


AAAGAGTATT 


TTGAAGCAGA 


AGAGTTGGCA 


GTAGGACCAT 


TTTTAATTAC 


TTTCATGGAA 


11760 


AC GAT T CATC 


CTGTCACTTG 


TTACGCGATG 


CGGATTGTTG 


AAAAAGCGAC 


TGGCAAAGTC 


11820 


TTTGTCTTTA 


CAGGAGACTC 


TGGTTACTTA 


GCAAGTTTTG 


AAGACTTTGC 


AAAAGACGCC 


11880 


GATTTATTCT 


TAGCGGATAC 


GTACTTATTT 


GAAGGTAATG 


AACGACATCA 


TGCACATTTT 


l i q a n 


ACGTCAAGAG 


AAGCGGGTGA 


AATTGCCAAA 


GCAGCACAAG 


TCAAACAATT 


GATGTTGACC 


12000 


CATTTACCAC 


AGCATGGGTC 


ATTGGAACAA 


CTACGGGAAG 


AAGCGCAAAA 


CTATGCTGGA 


12060 


ACGGAAATTC 


CAGTGACATT 


AGCCCAACCA 


GATTTAACAG 


TTGAGATTTA 


AAAAAGTAAC 


12120 


TATACCAAGG 


GAGAGAACCG 


ATTGTGATTT 


TTGTACCGAA 


TGAAAATAAT 


GACCCACGAG 


12180 


TGAACTTAGC 


AATTGAAACA 


TATTTGTTAA 


CAGAAATGCC 


TTTAGATGAG 


CCCATTCTTT 


12240 
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TATTTTACAT 


TAACGAGCCG 


TCAATCATTA TTGGCCGTAA CCAAAATACA ATTGAAGAAA 


12300 


TTAACAAAGA 


ATATGTAGAT 


GAGCACGGCA 


TTCATGTTGT 


TCGTCGCTTA 


AGTGGCGGCG 


12360 


GCGCCGTTTA 


TCATGATCAT 


GGTAATTTGA 


ATTTTAGTTT 


TAT TAT G CCA 


GATGACGGCA 


12420 


ACTCTTTCCG 


TGATTTTGCG 


AAAGTTACCC 


AACCAATTAT 


TCAAGCATTG 


CACGACTTAG 


12480 


GCGTTGAAGG 


TGCCGAATTA 


AAAGGGCGCA 


ATGACCTAGT 


GaTTAATGAT 


ATGAAGTTTT 


12540 


CAGGAAATGC 


GATGTACGCT 


ACCAACGGCC 


GCATGTTTGC 


ACATGGAACC 


TTAATGTTTG 


12600 


ATAGTGATAT 


TGATGAAGTT 


GTAAATACAT 


TAAAAGTACG 


GAAAGATAAA 


ATCGAATCTA 


12660 


AAGGTATCAA 


GTCTGTTCGT 


TCaCGGGTAA 


CAAACATTAA 


ACCATTTTTA 


TCTGAAGATA 


12720 


AACAAGAGAT 


GACAACGGAA 


GAATTCCgCC 


aAGAAATTCT 


TTTGAAAATT 


TTCGGCGTAG 


12780 


ATTCGATCGA 


CCAAGTTAAA 


ACGTATGAAT 


TAACTGATCA 


AGACTGGGCG 


GCTATTAACA 


12840 


AAATTTCAGA 


GCAATACTAT 


CGCAATTGGG 


ATTGGAATTA 


TGGTAAATCA 


CCTGCATTTA 


12900 


ACTTAGAACG 


TCGCCATCGT 


TTTCCAATTG 


GTTCAATTGA 


AATGAAAATG 


AACGTGGCGG 


12960 


ATGGTGCCAT 


TCAAGAAATT 


AAAATCTTTG 


GGGACTTCTT 


TGGGCTTGGT 


GAAATTAAAG 


13020 


ATGTTGAAGA 


TATTTTAACC 


GGCGTGAAAT 


ATGATAAAGC 


TTCTTTAGAA 


GAAGCCATTG 


13080 


ATCAAATTGA 


CGTTAAAAAA 


TACTTTGGCA 


ATATCGAAAA 


AGAAGATTTG 


CTTGGGTTAA 


13140 


TTTATTAAAA 


AAAAAAGACA 


GCCGTTTCAA 


ACGGTTGTCT 


TTTTTCTAGC 


TAAAAAATAA 


13200 


TTTTTAGCTT 


TTAGAAAGGT 


AGTGCTATGT 


GCAAAATCGT 


CGAAACGATT 


TTTATCaGGG 


13260 


AACTATCAAA 


TAGCTCGCCt 


GCATATGTAC 


TTGAACCAAG 


TCAAGTCCGG 


GAATTTGGTT 


13320 


ATGTGCTTCA 


CTATAAATCA 


CTATCTGTAA 


AGAGTATACA 


AGATTTCCTT 


TCTTTTGTGA 


13380 


TATCAAATTT 


TTACAGACAA 


AGTrGTAAAA 


TTTGACGTTT 


CTTATTTTrG 


CAAGGTTATA 


13440 


GGCTTGTTTT 


GGGGCTTGAA 


nTGGCAAAAA 


TTAATAGTGA 


TTTAnGGGAA 


ATATTnACAT 


13500 


ATGTAGAAGT 


ATTTTTATAA 


GAATAACGAG 


AAAAAATCAA 


TAGGTATGTT 


CTTGAGGGCT 


13560 


ACTTTTTGAC 


GTTTTCTAAT 


TAAAmCTTAw 


TAmaaAATAG 


CGGAAACaAA 


GnAAAAATGT 


13620 


GCTTACAAGA 


AGTAATCCGC 


ACTTAGCTAG 


GAAGTTAAGA 


ATGTGCATGT 


TATTTTCAAA 


13680 


CAAATAATAA 


GAATCCTATT 


TCAATCCTTT 


ACAGAGGAAC 


AAGGACTTCG 


TATAATAAAG 


13740 


AAACGAAAAG 


GAGCGAAAGT 


GATGAGAAAC 


GAAATGATGC 


ATCGACCAGT 


CGTTAAACCA 


13800 


GAATTGGTTG 


AGTATATGCG 


TACAAAACAA 


AAGAAACTTC 


CCGGAGAATT 


AGGTGCAGTG 


13860 


GAGGCGGAAG 


CCAATGAAGC 


AGGGGTGCCG 


ATTATTCCGC 


ACGAAACAGT 


AGTTTTTCTA 




CAATTTTTGC 


TAGGACAACT 


AAAGCCCAAA 


AATATTTTAG 


AAATCGGGGC 


AGCAATTGGT 


13980 


TTTTCTTCAA 


GCTTGATGGC 


CCAATTTGTT 


GGTGAAGATG 


GTCACGTGAC 


CACGATTGAT 


14040 


CGTTTTGACG 


TAATGATAAA 


AAAAGCGAAG 


AAAACCTATC 


AACGCCTAGA 


GTTAGAAGAA 


14100 


AAGGTCACTC 


TTTTAGAAGG 


ACAAGCAGCT 


GAGATTCTAC 


CAACGTTGGA 


AGGACCTTAT 


14160 


GACTTTATTT 


TTATGGATAG 


TGCCAAATCA 


AAATACATTG 


AATTTTTACC 


TGAATGTTTA 


14220 
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CGGTTGCTAC 


CAGTTGGCGG 


CGTTTTAATG 


GTGGATGATG 


TATTTCAAGC 


TGGGACAATT 


. 14280 


TTAGACCCTG 


CTGAGGAAGT 


ACCGAAAAAA 


AATCGAGCGA 


TTCATCGTAA 


ATTAAwCCAA 


14340 


TTTTTAGATG 


TAGTCATGGC 


TCACCCTGAT 


TTAACTTCTA 


CTTTAGTTCC 


TCTTGGTGAT 


14400 


GGAGTTATTT 


TAATTACCAA 


AGAGAAAGAA 


ACGATTATTT 


TAGATAGTTA 


AAGAATGTCT 


14460 


TCACAAAAAG 


AAAGCGAATC 


ATTAAAAAGC 


AAGTTCTTTT 


AATGATTCGC 


TTTCTTTTTT 


14520 


ATGTTATTTA 


AAAAGTTATC 


TCGCTATTTC 


TAATTGAGAA 


TGATAATCAG 


AAAATTCTCA 


14580 


ATAGGACAGT 


ATTATGCGTT 


AATAAAACCT 


CGTTTGTTTA 


TAATTATTAT 


ATTGACGGAA 


14640 


TAATTATAAA 


CTGTTATCCT 


TAAAGAGTAA 


AAAAACAGAG 


GAGGAACGAC 


AATGAATTTA 


14700 


ATTAATCaAA AACTTTTTGA CTTTGAATGT 


GATGCCTATC 


ACGATGGTGA 


ATTTACACGA 


14760 


GTATCAACAG 


AAGATATTTT 


AGGAAAATGG 


AGTATTTTCT 


TCTTTTATCC 


AGCTGATTTT 


14820 


TCTTTCGTGT 


GTCCTACGGA 


ATTAGGGGAT 


ATGCAAGAGC 


ATTATGCGCA 


TTTACAGGAA 


14880 


TTAAATTGTG 


AGGTTTACTC 


TGTTTCAGAA 


GATAGTCATT 


ATGTGCATAA 


AGCATGGGCT 


14940 


GACGCTACAG 


AAACGATTGG 


AAAAATTAAA 


TATCCAATGT 


TAGCCGATCC 


CAACGGTCAA 


15000 


CTTGCTCGTT 


TCTTTGGCGT 


TTTAGATGAA 


GCTTCTGGAA 


TGGCTTACCG 


TGCTTCATTT 


15060 


ATTGTGAGTC 


CAGAAGGGGA 


CATCAAATCT 


TATGAAATCA 


ACGACATGGG 


TATTGGTCGT 


15120 


AATGCTGAAG 


AACTTGTTCG 


TAAATTAGAA 


GCTTCTCAAT 


TTGTAGCAGA 


ACATGGTGAT 


15180 


AAAGTTTGTC 


CAGCCAATTG 


GCAACCAGGA 


GAAGAAACAA 


TTGCGCCAAG 


CTTAGATTTA 


15240 


GTTGGTAAAA 


TTTAAATTCG 


CTTTGTAAAT 


AGAGGTCAAA 


GAGTTGGATA 


AAACAGAGAA 


15300 


AATGTTTTGT 


TCAGCTCTTT 


TTCTGAAACA 


CAGAAGAAAG 


AGGAGAAGCA 


AGTGACAGAA 


15360 


GAAATCTACG 


ATTTGATTAT 


TATTGGTGGC 


GGCTCAGCTG 


CTTTATCTGC 


AGGTATCTAT 


15420 


GCAGGTCGCG 


CAATGATGGA 


TACCTTAATT 


ATTGAAAAAG 


ACAAAATTGG 


CGGACAGGTA 


15480 


ACGACGACTT 


CTGAAATTGT 


GAACTATCCA 


GCGATTCGTC 


ACACAACAGG 


TCCAGAACTA 


15540 


ATGGAAGAAA 


TGCGCATTCA 


AGCACAAGAC 


TTCGGTGTAG 


CTTTTACAAA 


CGACGAGATT 


15600 


ATTGATGTTG 


ATTTTAGTCA 


AACGATAAAA 


ACTGTTCAAT 


CGGCGAGTCA 


AACCTATCAA 


15660 


GCATATGCAG 


TATTGATTGC 


GACAGGGGCC 


TCTGCTCGTA 


AAATTGGTTT 


CCCTGGTGAA 


15720 


AGTGAATTTA 


CAGGACGCGG 


CGTGGCCTnA 


TTgTTCTACT 


TGTGATGGGG 


AGTTTTTCCA 


15780 


AGGCCTAGAT 


ATTTTTGTGA 


TTGGTGGCGG 


TTATGCAGCA 


GCGGAAGArG 


CCGTTTATTT 


15840 


GACACGTTAC 


GGTAAATCAG 


TAACGATGAT 


TATCCGAGAA 


CCCGATTTTA 


CATGTGCAAA 


15900 


ATTAACTGCG 


GAGGCGGCTA 


AAAATCACCC 


GAAAATTAAG 


ATTGTTTATA 


ACATAGAAGT 


15960 


GAAAGAAATT 


ACGGGAGATG 


ATTTTGTCCG 


AAAAGCTGTG 


TTTGTCAATA 


ACCAGACGGG 


16020 


AGAAGAAACG 


GTTTATGAAG 


CGCCAAAAGA 


CAGTACATTT 


GGTCTGTTTG 


TTTTTGCTGG 


16080 


AAATAAACCA 


AGTACAGAAA 


TTTTTGAAGG 


AAAAATTGCC 


TTAGACCGTG 


GGTATGTGCC 


16140 


AACAACTGAG 


AATATGGAAA 


CCAATATTCC 


TGGAGTTTAT 


GCAGCAGGAG 


ATTTACGAAT 


16200 
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AAAAGAATTA 


CGTCAAATTG 


TGACGGCGGT 


TGCAGACGGT 


GCAATTGCCG 


CCACTCATGC 


. 16260 


GCAACGTTAC 


GTGACAGAAC 


AAAAAACACA 


GGCGGGCCAA 


TCGATTGTAA 


CGAAGCGAAT 


16320 


GACTGAACGA 


TTAGCTAATC 


AATCTGCCCC 


TGAAACAAGC 


AGTCAGCAAC 


CAAAAGAAAA 


16380 


ACAGCCAGCG 


AAAGTTACTG 


GTAAGCATCA 


GTGGTTCCCA 


GAAAGTATGC 


GCCAACAATT 


16440 


AAGCGGTATT 


TTCGCTAAAC 


TAACCAAAAA 


AGTGACATTG 


CTTCAGTTTT 


TGGATGCAAG 


16500 


TGATGAAAAA 


TCATTGGAAC 


TTCAGTCATT 


CTTAACAGAA 


TTTGCAAGTC 


TGGATCAGAA 


16560 


GATTACTTTA 


GAAACGATTT 


TGAAAGATAC 


TGAGCCAGCC 


AAAGAATTGC 


TGTATGGGAT 


16620 


TGAAAAAATG 


CCTAGCGTTG 


TGCTCTTAGA 


TGCGGCAGGA 


AACTATACAG 


GAATTAAATT 


16680 


TAGTGGTATT 


CCTAGTGGCC 


ATGAAGTAAA 


CTCGTTGGTT 


TTGGCGGTTT 


ATAATGTCGG 


16740 


CAGCGAGGGG 


CAGCCGTTAG 


AAGCAAGTCT 


GCAAAAGAAC 


ATACTTGCAT 


TGCCAAAAAG 


16800 


AAAAATAGAA 


ATCTTTGTTT 


CACTAACATG 


TCATTTCTGT 


CCCGATGTCG 


TGGCGGCTTG 


16860 


CCAACGTATT 


GCATCAATTA 


ATCCTCATGT 


GGaAGCAGAA 


ATGGTGGACA 


TTAGTCTATT 


16920 


TCCAGAGTTG 


AAAAAAGAAA 


AGAAAATTAT 


GAGCGTTCCa 


GCAATGCTGA 


TTGATGGCGA 


16980 


ACAAATGATT 


TTTGGTTCAA 


AAACAATGAC 


CGAAATTATT 


GAAGCCTTGG 


CTTAAAAGAG 


17040 


GCGGGAAAAG 


TAATTAGCCT 


AAAGAACCTA 


AGTCATTCAA 


AAAAGTGAGA 


AGTACGTCGA 


17100 


GATGACGTAT 


TTCTCACTTT 


TTTCGCATAG 


TTAGTCAAGT 


CTCTCTTTTG 


ATTAGGCGAT 


17160 


TGAGTTTCCT 


AAAAAGAATT 


AAAAAGACAC 


AACAGCAAAT 


GAACAATAGG 


CTTCCTGTTA 


17220 


GTGGTAAAAA 


GATACCTTGC 


ATGTAGCTAT 


CGATATGCAT 


TAAACCAGGT 


ACCTCGCTGA 


17280 


AAAAATCATG 


GATAAACTGT 


TGAATGAGCT 


TATTGGCTAG 


AAAAGCAGCT 


AAAGGCAAAA 


17340 


TAAAAATCAA 


ATAATAGATG 


GGTTGTAAGG 


CATAGTTAAT 


TAAaAATTTG 


TAATTTTGTC 


17400 


GTACCTAAAA 


GAGCTAAGAT 


CAGTAAGTCG 


TTTTTTTTGC 


TCTAAAAAAG 


TTAGGAAGAA 


17460 


AGTTCGGCAA 


ATATAACTAA 


TCAGGAAGAA 


AAAGAGTGTT 


CCAGCGctGA 


TTCCAGCCAA 


17520 


ACCTCTGAAA 


ATTTTCAAGG 


GAGGAATCAC 


TGCATAATTA 


TCAAGAAGAA 


TCATTTGTGA 


17580 


AAGCCATCGA 


GAAAACAACA 


AAGTCAATAA 


TAGAGCAGAA 


AAAGTCAGAA 


AACTAATAAT 


17640 


AAAGATAAGA 


ATCAACGTAC 


TAAATTGGTT 


TTTTGAGCGC 


CACGCATTTT 


TGAGAGAGTC 


17700 


TGTGAATAAG 


TAAAGAATCG 


AATACATCAT 


CGCGTCCTTT 


CTGAAAAATT 


TTTGTTTTAG 


17760 


GGGAAAAGTT 


ATGTAAAAGA 


AAAGACGTTA 


TCTCCTGCCA 


AC TAG CC ACT 


GAAAGAGATA 


17820 


ACGCCTTCGC 


TGGAAACAAG 


AACTGGATTA 


TTTCACCAAT 


ACTAATTTTT 


TTAAAGGTTC 


17880 


AACCCCAAAT 


CGTTGTTCTT 


GCTGTAAAAG 


AATTTCGGCA 


CAAGCTCGGC 


TATCTTCTAA 


17940 


AGCATCATGa 


TGATTATTTA 


AAGTGATATT 


GAGGTTTTCA 


CAAACgGTAT 


TTAATTTATG 


18000 


ATTAGGAAAT 


TCAGGAAACA 


GTTTGCGACT 


CGTTTTGACT 


GTACATAAAG 


AAAGGTAATT 


18060 


AGGTTGTTCT 


AAGCCATAGT 


AATCTAAACA 


GCCAGCCAAA 


ACACTACAAT 


CAAAACTTGC 


18120 


ATTGTGAGCG 


ACCACTAACC 


GATTTGGTTG 


AAAACAATGC 


TGAATTTGTT 


GCCAGACTTC 


18180 
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AGGGAATTTG 


GGCGCATTGA 


CAACATCTGA 


TTCATGAATG 


CCGTGAATTT 


GCGTATTTCG 


18240 


CCAGAAAAAA 


GGTGTTTCTG 


GTTTGATTAA 


CGTGTAAAAT 


TCATCCACAA 


TCCGACTATT 


18300 


TTCTACTTTA 


ACAATTGCAA 


TGGAACAGGC 


ACTATGTTTA 


TCATAACTAG 


CTGTTTCAAA 


18360 


GTCAAAAGCG 


TAAAAGTTCA 


TTAAAAAAAA 


CTCCTTTnCA 


ACGCGTTGAG 


TAmCACTTAA 


18420 


AAATCAAGTA 


CAACTTCCAC 


TGGACAATGA 


TCACTACCAA 


GAATATCTGT 


ATGGATTTTA 


18480 


GCACTGACTA 


AACGTTCATT 


CAAGGATTCT 


GACACAACGA 


AATAGTCAAT 


CCGCCAACcT 


18540 


GsGTTATTTT 


TTCGTGCATT 


AAAGCGATAA 


CTCCACCATG 


AGTAGATGCC 


CTCAGCTTCT 


18600 


GGATAAAAAT 


AACGGAATGT 


ATCAGTGAAA 


CCACTGTCTA 


ATAACTCAGT 


AAATTTTTGC 


18660 


CGTTCTTCAG 


GCGTGAACCC 


CGCATTTTTT 


TGATTGGTTT 


TCCAATTTTT 


TAAATCAATA 


18720 


TT I 1 (jAI tirtVj 


TTAf^TTTAA 


GTCACCGCAA 


AGAATGACGG 


GTTTTTCTTG 


ATTTAATTCG 


18780 


TTTAAATAGG 


CACGAAACGC 


ATCTTCCCAT 


GTCATTCGAT 


AAGCTAAACG 


CTTTAGTTCA 


18840 


GCTTGTGAGT 


TTGGTGTGTA 


GCAAGTGACC 


ATGAAAAATT 


CTGGATATTC 


TAAAGTGATC 


18900 


ACGCGGCCTT 


CTTGATCGTG 


AGCCTCGATA 


CCTAAGCCAT 


AGCGAACACT 


TAACGCTTCT 


18960 


TCTTTAGGCG 


AAAAATTGCT 


GTGCCAGAAT 


AGCCTTTTTT 


nTCGGCGAAC 


TCCAnTACTG 


19020 
19024 



AnGA 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 



GTGACAAtGA 


TTTATTTTGA 


TAATAGTGCA CTACACCGAT 


TTACCCCCAG 


GCTTTAGATA 


60 


CATATGTAAA 


AACCAGTCAG 


CGGATTATTG GCAATCCGTC 


TAGCCTACAT 


GACTTAGGCA 


120 


GCCAAGCGAA 


TCGGCTCTTA 


CAACAAGCAC GGAAGCAGAT 


TGCGGAgcTA 


ATCAATGTTT 


180 


CAGCAAATGA 


AATCTATTTT 


ACAAGTGGCG' GAACTGAAGG 


AGATAACTGG 


GTTTTAAAAG 


240 


GAACGGCAAT 


CGAAAAACGA 


GAATTTGGTA ATCATATTAT 


TATTTCTGCA 


GTGGAACATC 


300 


CAGCAGTTAC 


AGAAACAGCG 


GAACAATTAG TCGAACTAGG 


TTTTGAACTT 


TCTTATGCGC 


360 


CAGTGGATAA 


AGAAGGTCGT 


GTAAAAGTCG AGGAACTACA 


GAAATTAATT 


AGAAAAGAAA 


420 


CGATTTTGGT 


TTCTGTTATG 


GCAGTGAACA ATGAAGTGGG 


AACGATTCAA 


CCAATTAAAG 


480 


AAATCAGCGA 


GGTTTTAGCG 


GAGTTTCCTA AAATTCATTT 


TCATGTAGAC 


GCGGTTCAAG 


540 


CAATTGGGAA 


• AGTTAATTAC 


TCAGAGTGGT TAACGGATCG 


GGTGGATTTT 


GCAACGTTTT 


600 


CTGCACATAA 


ATTTCATGGA 


CCACGAGGCA TTGGTTTTAT 


GTATTGGAAA 


CAAGGAAAAC 


660 


GTTTAGCGCC 


GCTTTTAACA 


GGAGGCGGGC AAGAAAACAA 


CCAACGCAGT 


GgGACAGAAA 


720 
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ACGTACCAGC AATTGTGGCA ATGGCCAAAG CGTTGCGTTT GCATTTAGAG AACGAACAAC 780 

AACGGCCGCA ACATGTGGCG ACATTACGGA gTTATTTATT AAAGGCGCIA .GAAGAATTTC 84 0 

AAAATGTGAC TGTTTTTTCA CAAGATAATG AACATTTTGC GCCACACATT TTATGTTTTG 900 

CTTTGAAAGG GATCCGCGGC GAAGTGCTAG TCCATGCATT GGAAGAAAAA CAGATTTATA 960 

TTTCGACCAC GAGTGCTTGT TCTAGCCGGA AAAAAATGGC TAGCAGCACC TTATATGCCA 1020 

TGCATGTGCC AGGGGAATTA GCGACTTCCG CTGTGCGCAT TAGTTTAGAT GAAAGCAATA 1080 

CAATGGCAGA AATTGAACAA TTTATGATTG TTTTCAACCA ACTTTATCAA AAATTTTCTC 1140 

GTGTAAATTA ATAaAAAATA AGTTGGAGCC TAGAAATAGG CTCCAACTTA TTTTTGCTCT 1200 

CTTTATCGAT TGAAGCGGTA GCCaCTTCCC CAGAGCGTTT CAATTGGAGA GTCAGAAAGA 1260 

TTGGCTTTTT TTAATTTTTC CCGAATCCGT TTAATGTGAA CAACGACAGT CGAAACCTCA 1320 

GTATCTAAGG CATCAAACCC CCAGACACTT TCAAAAAGTT CTTCTTTATT CCACACACGG 1380 

TTAGGATGTT CCATCAAGAA AACCAGCAAT TTGTATTCTT TGCTTGTAAA AATAACTTCA 1440 

TTTTCTAAAA CAAACACTTT GTGGGCGGCA GTGTCGACAG CGATTTCATT GATTTTTAGT 1500 

AATTGTGGTG GTTGTTCTGC TTTGCTTAAC AATTGATAAC GGTTCATATG GGCTTTTGCT 1560 

CGAGCGACTA ATTCATTGGG ACTAAACGGC TTGATTATAT AATCATCCgC ACCTAAgCCA 1620 

AGCCCACGaA TTTTATCAAT ATCTTCTTTT TTAGCCGAAA CAATCATGAT CGGCGTTTGC 1680 

TTTTTTTTTC GAATTGCACG ACAGATTTCA AAACCATCCA TAGTTGGCAG CATCACATCT 1740 

AAAATAATTA AATCAAAGGG CTCATTTAAA GCTGCTTCCA ATCCTTTTTT TCCATCATGT 1800 

TCAATTGTGA CAGTCATATC GTTAATTTCT AAATAATCTT TTTGCAAATC AGCAATGCTT 1860 

GGATCATCTT CAATAATTAA TACATGTGGT TTCATTTTTC ATCCTCCAAC AACGGCAGCG 1920 

TAATTATCAC GTTGGTGCCG TCTCCTTTTT TACTTGTGAC AGTGATTGTT CCCTGATGAT 1980 

AGTCAATAAT TTGTTTGACA ATACTTAATC CCAAGCCGCT TCCTTTAACT GTTGGGGTTC 2040 
GAGATTTGTC TACACGATAG AAGCGTTCGA ATAAATAAGG TAATTCTTTT TTATCGATAC 2100 
CAATGCCGTT ATCTGTAATG GTTAAAACTA AGTCTGTTTG ATTATGCGTT AAAGAAATCG 2160 
TAAAGGCTAG TTGTTCTTTA GTGGGATCCG CAAATTTAAT GCTATTTTGA ATTAGATTGG 2220 
TAATCACGCG ATTCATCTGC GTGGGATCCA TTTGAACGTA TAAAGCTTCG GTGGGAAGTA 2280 
CAGAAGTGAT TACGAGTTCT TGTTCCAAAC GGTATTCTTC AAGAATATGT GCAATAAAAC 234 0 

GAGTGAAATT GGTCTTTTCC ATTGTAAAGA CAGCACGGTC TAGGTCTAAT TTAGAATACA 2400 
AAAATAATTC TTCAATTAAA TCATTGAGGC CTAAACTTTT TTCATGAATG ACCGTTAAAT 24 60 

AGCGCTGTTT TTTTTCTTCA GTATTGGCAA CGCCATCCAT CAAACCTTCA ACATAACCAA 2520 
TAATAGAAGT • AATTGGTGTT TTTAAGTCAT GAGAAATATT TGCAATTAAT TCTTTCCGAT 2580 
TAGCTTCATA TTTTTCGCGT TCCGCATTGG CTTCTTCTAA ATCTTGCCAC ATTTGTTCAA 264 0 

AACTAAGTTG GAGCTGTTTG ACTTCTGTTG AGACGGTTTG GTGGCTAGTT GGCGTAAATG 2700 
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GATTTTCTTT 
GCTCGATAGT 
GGATAATTAA 
CGCGTTTCAA 
CATAATGATA 
CGTAGTTTGG 
ACCGTGCATT 
CCTTTAATTC 
CTAAGGCCAA 
GGTAAGCTTG 
CAGCTAAAAC 
GTTTAATGGT 
TACAAGTGAA 
GTTCATTTTT 
AAAAAGGGGA 
CTTTAGTACG 
AAGTGCAGCT 
AAAAGGACAA 
CGACAAAAAT 
TGATGGTGAA 
TGAAGGCACA 
AAACTATCAA 
AATGGGTAAA 
TG AC GAG AAA 
TGTTACCAAT 
TTTAGATGAA 
ATTTGATAAT 
AGATTTTGGT 
TGACAATAAA 
TACACGAATG 
ATATGAAGGA 
GCGGTCAAAA 
ATAGTTGTTT 



TTGGGCATCC 
GGTTTTGGTT 
CAAAATCATC 
GACAATAAAG 
TAGACGCCCT 
AATATGTACA 
CTTTCGAATA 
TTTACTCAAA 
ATAACTTTCT 
GGGCAAACTG 
AGAGGCTAAA 
CAAAAGCGTC 
CAAAAAAAAG 
CGGCAAGTAG 
TTACCAACCA 
TTGTTGATTG 
AGCTCAAGCA 
GAATTAACAG 
mATaATAATT 
ACAGGTTTTT 
TTCTTTGTGA 
GCGGTGGTCG 
GATAAAGACG 
TTAACCTTTA 
GAACCTGGGG 
GACGGTAACG 
AAAACAAGTA 
TACTTCCAAG 
TATGGAGCTG 
GGTAAAGACA 
GACTTTACGC 
GGAAGTAAAG 
TTAATCGTTT 



TTACCCAATG 
AAGCGCTTAT 
CAGATAATCC 
CTCCCGTTAG 
GCATTATCCA 
GACAAAGATT 
ATCACACTTA 
GGTGTATCTA 
TCTTCGTTAG 
GGGACAGTTC 
GTAATAATGA 
CCTCCTAGTG 
CTAGTATTGT 
GAGTTTATAA 
TGAAAAAGAA 
CAGGCTGTGC 
CTGCAGTCTC 
AAATTTTATC 
TaACAGCAGA 
ATGAATTTTT 
CAGACGATGG 
ATTTAACGGA 
GGAAAGATGT 
CGAACGGCCG 
ATGACATTTT 
ATGTTACTGA 
AATATGAATT 
TAATTGATAA 
. CATTAGAATT 
ACAATGGCAA 
CAGACTTCAC 
GATTCTTTTG 
ATAAATTGTT 



988 
TTTTAGTCGC 
TAATGAGCCA 
AGCGTGTGAA 
AACCATCTAA 
GTGTGCCAGT 
TTTCTACTAA 
ATCCTTTTGC 
GTAATTTCGG 
CGGTCAATGG 
CTAATGTGAT 
TGGCACTAAT 
GTTTTAATCT 
TATTAACTTT 
GGCTTTGCTA 
AAAAGTTTTT 
TGGCGGAGCc 
TAGTTCAGCA 
CAGTACTGAT 
AAATGCTAAT 
CGACAAAGAA 
CGAAAAGCGT 
AGTGACGAAA 
AGAAGTCTTT 
TAAAGATTTA 
AGGGGCCACA 
AGCAAATAAA 
CTTTGATTTA 
TAACAAAATC 
AACAGAATTA 
AGAAATTAAA 
GTTCTAATTA 
ACCGCTTCAT 
CATTTTTCTG 
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TTTTTCTAAT GCTTCTAATG 2760 

AGCTGCCGCA ATTGCTACAA 2820 

GAATTCAAAA AGGCTGCTTT 2880 

ATAATGAAAA TCGGATTTGA 294 0 

TGGCATAATA TTATTCATTT 3000 

ATTATCCGAA TAATAAGGAA 3060 

TTCAATTGTT TGAATCGTTt 3120 

TGATTTTTTT AGCAGTTGAT 3180 

TCGTTGCTTA TTCATCATTC 3240 

GTAAGAAGCT AAGGATAAAA 3300 

ATAAGAAATA AAAAAGCGGC 3360 

ATTTCTTACT TATAGTGTAT 3420 

TTTTCATTTG TTTATAATTT 34 80 

TTCTAAATAT ATCAAAAAGA 354 0 

AGTGCGCTTA CCTTATTAAC 3600 

AACTCTGcAA CAGATAAATC 3660 

GAAGCAGCTA AAGAGCAATC 3720 

TGGCAAGGCA CAAAAGTTTA 3780 

TTTATTGGTT TAGCAAAATA 38 40 

ACAGGTGAAA CCCGTGGCGA 3900 

ATCTTAATTT CGGATACACA 3960 

GATAAATTTA CCTATAAGCG 4020 

GTAGAACATA TCCCTTATTC 4080 

GAAACAGAAA CTGGCAAGAT 4140 

TTATGGAATG GCACGAAAGT 4 200 

ATGTTTATTA GTTTAGCGAA 4 260 

GAAACGGGTA AAACACGTGG 4320 

CGTGCTCACG TTTCAATTGG 4 380 

AATGATAAAC GTTTTACGTA 4 4 40 

GTCTTTGTAG AACATGAACC 4 500 

GTCAATCACG AGTGGATGAA 4 560 

TTTTAACAAA AAAATAGTTT 4 620 

ACAGTGGTAG TTTAGAATCG 4 680 
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TTTGTTAAGA TAGGTTTATC AAAGAAAAGG A£CGATGCTT TATGAAAAAG AAAGTATTAA 47 4 0 
GTTCGATTAC TTTAGTAACA TTAAGTACGT TACT TAT AGC AGGTTATGCA AGTCCAGCAT " 4 800 

TTGCAGATCA TGCAGCCAAT CCAAATAGTG CTACAGCAAA TTTAGGCAAA CATCAAAACA 4 8 60 

ATGGCCAAAC AAGAGGCGAC AAGGCGACTA AGATTTTATC TGGCACGGAC TGGCAAGGAA 4 920 

CCCGTGTTTA TGATGCTGCT GGTAATGATT TAACGGCAGA AAATGCTAAT TTTATTGGTT 4 980 

TAGCAAAATA TGATGGTGAA ACCGGTTTTT ACGAGTTTTT CGACAAAAAT ACTGGGGAAA 5040 

CCCGTGGTGA CGAAGGAACA TTTTTTGTGA CAGGTGATGG CACAAAACGA ATTTTAATTT 5100 

CGCGGACACA AAATTATCaA GCCGTAGTGG ATTTAACCGA AGTGaGTAAA GACmAATTTA 5160 

CTTACAAGCG TTTAGGGaAA GATAAACTGG GGaATGATGT TGAAGTTTAC GTGGAACACA 5220 

TcCCtTATCA TGGGAAAAAA TTAGCTTTTA CAAATGGACG TGAAGCATTA ACCAATCAAA 5280 

CTGGCAAAAT TGTGACAAAT AAATCAGGGG ATAAAATTTT AGGAACAACC TTGTGGAATG 5340 

GCACAAAAGT CGTAGATAAA AACGGTAATG ATGTGACAGC GGCCAATCAA AATTTCATTA 5400 

GTTTAGCGAA ATTTGATCCA AACACAAGTA AATATGAATT TTTCAATTTA CAAACAGGTG 54 60 

AAACCCGCGG CGACTTTGGG TACTTCCAAG TGGTGGACAA TAACAAGATT CGGGCCCATG 5520 

TATCTATTGG TACGAATCGT TACGGCGCGG CGCTAGAATT AACGGAACTA AACAATGATC 5580 

GATTTACGTA TACTCGAATG GGTAAAGATA ATGCTGGTAA TGATATTCAA GTGTTCGTGG 5640 

AACATGAACC TTACCAAGGC ACATATCATC CAGCCTTTAC TTTCTAAAAG TCTCTTGTCT 5700 

TCTCTCTTAT TTAACAAAAG AACGTCCCCC AACGTTCTTT TGTTTTCTCT TCCTCTATAA 57 60 

ATAAATTGTT AT TAG T TAT A ATTAAATATT ATTATTAAGA TAATTATGAT GATTATTATT 5820 

TAATAATTTG TTCATATTGT GGACAGGTGG AGTTAATAAC TACTTGCTAT CCTTAAATTA 5880 

TCAAAAGGAA AGGGAGTAGT CGTTTATGAA GAAAAAAATC GTTGGTACAA TTACGTTGTT 5940 

GGCTTTAAGT GCGTTATTAG TTGGTGGAGC AGGAGGGGCT TTGACGGCAG AAGCATACGT 6000 

TCCTCAAAGC GTAGACAATC CCAATAATTT AGGGGATTTA CCTGAGTATT TACGTTCAGT 6060 

TGGTATTAGA CAAGATGAAG GATTATCAGA AAAAGATTGG GCTGGAACAC GCGTTTATGA 6120 

TCGAAATGGG AATGACTTAA CAGATGAAAA TCAAAACCTA TTACATGCAA TCAAATTTGA 6180 

TGCAACCACT AGTTTCTATG AATTTTTTGA TAAAGAGACT GGAGAATCAA CAGGAGATGA 6240 

AGGAACCTTC TTTATGACCG CTGGTATTAC AGATGTTTCC CGTCTTGTAA TTATTTCTGA 6300 

AACCAAAAAT TATCAAGGTG TATACCCACT TAGAACTTTA TACCAAGATA CTTTTACGTA 6360 

TAGACAGATG GGGAAAGATA AAAACGGAAA TGATATTGAA GTTTTCGTAG AAAACAAAGC 6420 

AACCTCAGGA CCAGTTTATG GTCGTCCGCA GCCATACCCC AATAATCGTC CCAGAACACT 64 80 

AGAATTCACG • AATGGACGCC GTGCCATGAC AGAACAAACA GGCCAGATTG ATGTAAATCG 6540 

ACAAGGGGAT GAAATTATTG GTAAAACTTC CTTTGATGGG ACACCGCAAC TTCTTTGGAA 6600 

TGGCACAAAA GTAGTGGATA AAGATGGCAA TGACGTAACT TCGGCCAACC AAAACTTTAT 6660 
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CAGCTTAGCG AAATTTGACC AAGATAGCAG CAAATATGAA TTTTTCAATT TACAAACTGG 6720 
T.GAAACTCGT GGCGACTATG GCTACTTTAA AGTAGGAAAT CAAAATAAAT TCCGTGCCCA ' 6780 

TGTTTCCATT GGAACCAATC GCTATGGCGC TGTCTTAGAG TTAACAGAAT TGAATGATAA 6840 

TCGTTTTACG TACACACGAA TGGGTAAAGA TAACGAAGGA AACGATATCC AAGTCTATGT 6900 

GGAACATGAA CCATACCAAG GAACTTTTAA TCCTGAATTT ACCTTTTAAG AACAAAACAT 6960 

ACGAATGGAA ACCAAAAAAA GTTTCGACTT CTTTTGGTTT CTCTCTTTTT AAAATAGAAA 7020 

AACAGAGAAA GCGTTTTAAT CGACAAATAA AAAAAGATTC GTTAGATTAG AACCACTGGA 7080 

GAAAAATCTC ATATTTCTCT CGAGGAAAGG AAGTTGAGCA CAATGAACAA AAAAATTTTA 714 0 

ATGGGGCTAT TAAGTGTCGT GACCATTCCA TTACTTGCTG CGTGTCAAGG AGGAGAAACA 7 200 

CCTTCCGCAG CGTCAAAAAA TAGTCAAACG GTGACTACTC AAAGTAGTGC AAAAACTGAA 7260 

AGCACCAGTA CAACCCGTTC GGTAGCTCAA ACAACATCAA AAGAGGAAGT GAAAGAACCG 7320 

ATGAAGACCT ATGAAGTGGG TGCGCTTTTA GAAGCAGCCA ATCAACGAGA TACGAAGAAG 7380 

GTCAAGGAAA TTTTACAAGA TACTACTTAT CAAGTGGATG AAGTCGACAC AGAAGGCAAC 74 40 

ACACCGCTCA ATATCGCTGT TCACAATAAT GACATTGAGA TTGCAAAAGC GTTGATTGAT 7 500 

CGGGGTGCCG ATATTAATCT GCAAAACAGC ATTAGTGATA GTCCCTATCT TTATGCGGGA 7 560 

GCGCAAGGAC GTACGGAGAT TTTAGCGTAT ATGTTAAAAC ATGCGACCCC AGATTTAAAT 7 620 

AAGCATAACC GTTACGGTGG CAATGCGTTA ATTCCGGCAG CTGAAAAAGG AC AT AT TG AC 7 680 

AATGTGAAGC TCTTGTTAGA AGATGGACGA GAAGACATAG ATTTCCAAAA TGACTTTGGC 77 4 0 

TATACAGCAT TGATTGAGGC AGTGGGGTTA CGTGAAGGGA ACCAACTTTA CCAAGATATT 7800 

GTAAAATTGT TAATGGAAAA TGGTGCGGAT CAATCCATTA AAGACAATTC TGGTCGAACA 78 60 

GCAATGGACT ATGCCAATCA AAAAGGTTAT ACGGAAATTA GTAAAATTTT AGCACAGTAC 7 920 

AACTAAGTAT TGTGTCCTTT CGGGAAGGAA TAAATACAGG TAGGAGTGAA GGAAAGCATG 7 980 

AATTATTGGT TAAAAAATGT TCGAATTGAA AC AG AG TAT C AAAAAGAAGG TGACTGGTTA 804 0 

TCTGGGACAG TAACCAAAGA AGTGGCAATT GAAATAACAG ATGGAGTATT TACGAAAATC .8100 

ATCCCCAATG AAGAATTGGT AGAAGAACCT GGGTTGACAA TTATTGATGG TCACCATCAA 8160 

TTGATTTTAC CAGGTTTGAT TGAAAAACAT TGTCACTTTG ATAAAAGTAA ACTAGGTGTT 8220 

CCGTGGTATC CGGTGACGCC AGCGAAGTCA ATTGTGGAAC GTGTCGAAGC AGAAATCCCT 8280 

TATTTAGATA GTTTAGAACT TTCTTTAACA GAACGCGCCA ACCATTTAAT TGACTTAGAA 8340 

TTACCACATG GCGCGACAgc CTTCAGAACA CATGTCGACG TGGAACCGAT G AC AG ACT T A 8400 

CGTTATTTTG ATGAAGTCCA AGCGTTGGCC CAAACGAAAC CATTTGCGGT AGAAATTGTC 84 60 

GTCTTTCCAC • AGCACGGCTT ACTGCGTTCA GATTCCGTTG AATTAGTTGA TCAAGCATTG 8520 

GCAAAAGGCG CCGATTTTAT TGGTGGCGTG GATCCTTATT CACTAGATGG TGATTACAAA 8580 

AAATCATTGG CTGAAACATT CCGCTTAGCA GATAAACATG GTGTCGGTGT GGATATTCAT 8 64 0 
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TTACATGACC GTCACGAAGC TGGGACAACA ACGATTAAAG AAATTATCCG TTTAACGAAA 8700 

GAATATGGCC TACAAGACAA . AGXATTXAXC AGCCATGCCT TCGGGTTAAA TGATTTCATT 87 60 

GGTGAAGAAC GTGCCGAAGT TTATGATGCT TTAGCAGCAG AAAAAATTCA TATTAATTCC 8820 

AGCGTGCCAA TCACACCTAA TACGATTCCG CCAATTATGG AATTGTTACG TCATGGAGTC 8880 

AATGTCCATC TTGGTTGTGA CAATATCTAC GATTGCTGGT CTCCCTATGG TGATGGTTCA 8940 

TTACAAGAAA AATTGGCTCG TTTAGGTGAA CTTTTTAACG TGAAAGATCA AGATGCTTTA 9000 

ACCCAATTAT TAGGCTTAGT GACAGATGGT GTCACAACTT TAGACGAACA AGGGCAAGTA 9060 

AATTGGCCGC AAGTTGATGC TGAAGCAACT TACTTACTCA CTGAAGCAGA ATGTGCGGCG 9120 

GCTTTTGTCG CTCGTCAGAC ACCGGTGGAA ATCAGTGTGG TAAAAGGAAC CACACTTTAT . 9180 

CAAAAATAAC AAAAAGAAGC GAGATTTCTC GCTTCTTTTT TTGTTATTAA ATTTATTTGG 9240 

ATTAGACGCT TAATTTTACA ATTGAAAACG ACATCCATCT TATATTTCAT TGATGAAATT 9300 

AGTGAGAAGA AG TAT GAT TT AATTTAGCTC TGATTACTTT TGATACATCG TTTTTCAACT 9360 

GATCAAAATC TGATCTATAA GTAGCAATAA AAACCAATAA ATCAACGCCT TCTTGCGTAT 94 20 

CTAATTGGAA ATCATAACCG TTTAATTGTA AAAATAAGTC GGTTGCCATT ACAGCTGTCC 9480 

GTTTATTTGC ATTGTAAAAA CAATGCTTTG TTGCAATGTT AATCATTAAA ATAGCTGCTT 9540 

TTTCTTCAAT TGTTGGGTAC AGGTCAACGT CAAAGACGAA TTGTGCTGGT TGGTTGCTAG 9600 

CCATTTGTAA TGCATTACGA TCTTTGATGC CAATTTGTTC TGCGGGAGAA TACGTTTGGA 9660 

TCAAGAAAGT ATTCATTTGA AATAGATCAT CCGCTGATAA ATAATAAATA CTTTTCATTT 9720 

AGTAACCAAT TCTTTGTACA GCTCATTATG TTTTTGTTTT GAGTTCTCCA TCATTTGTTG 9780 

TAATTTTTTT TGATGTTCTG ACATATTTTT TTTAACAATA ATATCTTTTA ATTTTTCyTC 9840 

GTCTACATAA ACGGTGTCTG TCGCAGTCGC ACCAATGCTT TCTAGAAACT CTTTTGACAA 9900 

TGTCATAACA ACaGAATTCC CAACTTTTCT AAGTTTTCTT TCTTTGATTT CCACTAGTAA 9960 

CCCaCCTTTC GTATATATTG T AT AT AC AG T ATATCATGGA TCATAAAAAA GTTTCTATCA 10020 

CAAGTCTAAT ACTTTAATAA AATTTCTTGA TTGCTTATCG TTCCTTCAAC CAGCTATCCA 10080 

ATTGCTCAAA TAAAGTGATG ACCGAAGGAT CTTTGGTTTG TTTTAAAAAA TCATGTTCTA 10140 

AATAGTAAAC CGCTTTAAAG GTACTCTCAG GAATGGTCCG TCCGATTTTT TTGCTATAGC 10200 

GGAACGGCAC TTCTTCGTCT GAGCTGCTAG CTGTACTGAA ACAAGGTGGA AAAGTTTTTA 10260 

ATGTTTCGTC TGACAGTGCG TAAGCAGACC AGTCCCCGTT TTCTGGTAAT CCGTAAAAGT 10320 

GCGGCAATAG CGCTTGCTGA ATACTGTAAT GATACAGCAG ATAGCGAGAA AGAAAAGGAT 10380 

CATCCCAAAC AGGCTTCGTT TGGTCGATAG CAGCTATTTC TTTAGCGGAA ATGGCTTGCT 104 4 0 

TTAATAATTT • CCGTGGCTCT TTAATAAATT CTAAATCTGT ATAGCCGTAA AAATTGACTA 10500 

AAAATTGTGG CGTCAAATTT AGTGTTTGTA ATTGTTTGGT TAACTGTAAC ATTAAATAAC 10560 

CGCCAGCGGA ACGCCCGCAC AAACCAAAGG ACTGGTTTTG AATAATCTCT TCGTTTAACA 10620 



WO 98/50555 PCT/US98/08985 



992 



ATTGGAAGGT 


TTCCGTTAGC 


GTCCTTAAGA 


TGTGATCGAT 


TTTTGTATTT 


GGAGCCAACA 


i n con 


AATAGTCCAG 


TGCTAAAACG 


GTATAGCCAT 


TGCTTGTAAA 


CAGCT.CCTTC. AGTTC£T£TG 


± U / ft u 


GTAAATCACT 


TTTGGTTCCA 


TAAATCATCC 


CGCCACCGTG 


TAAATAGACG 


ACATAGTTCG 


1 O O 


TTGGTTCAGT 


CGTAGTGGGG 


TAGATAGTAA 


CTGTTGCGCC 


ATTAf^rGAGT 


ftTTTGATTGT 


lOo oU 


TTTTCAAGTG 


CACGTCTCCT 


CATCTTTATA ATGTATGTGG 


CAATTTCATA 


ATTTAACAAT 

n xxx nnv< nn a 


10920 


TGCAAGGGAT 


TGGCTAAGTC 


AATATCAAGA AGTTGTTCCA 


V_«illOlll /Art 


APHATAACGA 


10980 


ATCGTTTTTG 


AATGAAGGAA 


CATCGCTTCA 


GAbbl L I b 1 1 


TATAHTTflPP, 


ATTATTTTGG 


11040 


AAAAAACTGT 


ACAAGGTTTC 


AAACAAATCG 


TAGTTATTTT 




ATAAACITGCG 


1 1100 


CGAGGAATTA 


ATTCGTCTAA 


ATTTTCCAAT 


TGGTCTtCGC 


UUnlnnndl 1 


TTTAAATACC 

111 rv\n 1 *Av- ^ 


11160 


CCAATATCGG 


CTAAGGTGAC 


AATTGGTCCG 


TTATAAAAAG 


ill \*r\ 1 1 rtrtrt 


P,PHP ACAATG 


11220 


TCCAGACATT 


CAATCAGTAG 


CTCTTTAATC 


CCTTCTCGTG 


ATTTTAAAPT 
Mill! rt/Artv^ i 


ftPTCIACGGCA 


11280 


ACGGTAAGCG 


TGTCGTTTTC 


TTCTAAAAGT 


fft^i TV /■* rn TV TV rn ta 

TCAGCTAATA 


A i\P(^APTT A Zi 


TTP,PAP,PTTT 
1 1 O LnOv«- 111 


11340 


GTAATTTTTC 


CATCACTTTC 


ATTGAAGTTA 


TV TV TA TV 7\ /~1 ^ 

AAAAGGACCG 


PPPZlZiTAZiTT 


TAP,PTP,ATPA 


114 00 


AAGAAGATGG 


ATTTCACACG 


CAGGGTACGA 


AGTAAGGCAA 


TAATTPf^TTT 


111 1 \3t\\3 l 


114 60 


TGTGTATTGG 


TGTCTTTGGT 


AGAAAAAGCA 


ATCbCTTbbl 


ZlZiP & ATPTAI^ 
rtrVb AH 1 1 


PTPTTHPATT 
1 1 lo\_rti i 


11520 


TGCACTTCAC 


GAAGCAAACT 


GTTTAGCTCG 


TCAGGATTT f 


PPP.P.TP.T ATT 


TTP,TAAAATA 
1 i ul nr\nr\ l rt 


11580 


GCATCCGCTA 


AGTTGTTCAA 


ACGAGTATAC 


CGTTCTTTTT 


TTAHPZiAZiTT 


TTPfinTATTG 

1 ILOUlnl IvJ 


11640 


AATTTTTCTT 


GTAAGACATC 


AATCGCATTT 


TCGATAAI bA 


rTAAATPf^T 


TTPTTTAACC 


11700 


TTTTCATCTT 


GTAAATAAAC 


» TA TV /"* in 71 TV Tl 

CAGAAGTAAA 


CAAi 1 Ablbb 


A AT A APT ATT 


H AAAATAGAA 


117 60 


ACTTCTAGTG 


CCAACTGTTT 


TTTGGGGTTb 


bbAl bAbttb 1 


A A AP.T(^TTAA 
rt/Arto loll r\r\ 


AAGAGAGTAA 


llo^U 


TCGTTTTTAG 


TGAATTCACT 


GTTTTTTAGT 


'ptttptpttp 


AP,Af^PAPAAT 


rnrATCATGG 


1 1 O Q <\ 


GGTAAGTCGC 


CATCGCTAAT 


TTCbAbAl b I 


T t n T P P 21 n P a 


TTTTTAAAGC 

11111 rtrtrtVJV* 


GAAGGGTAAT 

\Jiiii\J \J >~J +■ *** * * 


l iy 4 u 


TTAATGATGT 


CATAAAATTC 


TTGCATAATC 


GTCTCAACCG 


TTHCftTAfiTG 

1 1 UVJVJ 1 rtO 1 VJ 


GCGCTCTAAT 


t i n A 


TTGGTGAAAC 


GTTGGCGCAC 


ATCGTAATAC 


GTACGTAAGA 


TATGCGACTG 


ATAATTCAAC 


lzU dU 


AAAGGCTCGT 


AAATCGTAAA 


TAGAATTTTT 


TCATAGCTAA 


TnTPTTPiTf^T 

± Vj 1 *w 1 X U 1 O X 


GATTTTAATT 


lzlzU 


AAAGGAATTT 


CATAATTGAA 


ACATAATTCG 


ATAAACCACT 


PTHHAATCAT 


TTTAATGAAT 




CGATCTGTTT 


TCAAAATTAA 


GCCGCTGATG 


CCAATTCGGC 


GCATCTTTAC 


GAAAAAATTT 


1 9 0 A O 


TCCAACTCTT 


TTTCGGGTAG 


ATCATGCAAT 


GCGTAGAAAG 


AAGTCAAAAT 


TAACTGATTC 




TTTCTACTCC 


AGTTCTCAAT 


GTCAATCGCT 


TCTAAGACCA 


TTGCCGAGTC 


AACTTCATTG 


12360 


TCTAAACCAA 


TCTCTTTCGT 


CAAAATTTTA 


CCTTTTTTTA 


AGTCACCTAT 


CGATAATAAC 


12420 


TCTCTTAATT 


' TCATCATTTT 


CTCCTTTTTT 


CCTCTTATGG 


ATAAAAATTA 


TACCATTTTT 


12480 


TTAGAATTAT 


GTCCGATGAA 


AACACTTTCT 


CGTTTCGATA 


CAATAGCATT 


GTAAGGAATA 


12540 


GATAGGAGGA 


TGAACAATGT 


T AC AC AC AG T 


AATTTTGAAA 


' AACAATTACC 


AAGACTCCAT 


12600 



WO 98/50555 PCT/US98/08985 

993 



CAACTTAATG 


TTATTGACCA 


ATAAAATCAA 


TGCATTAGAT 


GGTGTAACCA 


TGAGTCAAAT 


12660 


CATGATGGGA 


ACAGATGCCA 


ACAAAGACAT 


CCTTAACAAC 


ACGAACTTGT 


TAACGGATGA 


12720 


GGCCAATAGT 


GCATCTGCAA 


ACGACATGAT 


GATTGTGGTT 


GACAGTGAAA 


AAGAAAACAT 


12780 


CATGGAAGAA 


GTTATGCCAG 


CGATTGATGA 


ATTCTTAGAT 


GATTTATCAG 


CAAAGGGAAC 


12840 


TAGTGAAGAA 


GCACAAGCAG 


CAACTTCTTG 


GTCAGAAGCG 


TTTGATCTTT 


TACCAGAGGC 


12900 


GAATGTAGCG 


CTATTCAGTA 


TTCCTGGTGA 


ATATGGTGCG 


CCAGAAATGG 


AACGTGCCTT 


12960 


AAAAAATGAT 


TTACACGTGT 


TCTCTTTCAC 


GGATAATGTG 


TCAATTGAAG 


ATGAAGTACG 


13020 


TTTGAAAAAA 


TTAGCGCATG 


AAAAAGGCTT 


ACTAATGATG 


GGGCCTGACT 


GTGGAACTGG 


13080 


TATTATCTCA 


AGTATTCCAA 


TCGCCTTTAC 


AAATGTTGTT 


TCTCCAGGAA 


ACATTGGCGT 


13140 


TGTTGGTGCT 


TCAGGAACAG 


GGATTCAAGA 


AGTTACAACG 


ATTATTGACC 


GCTTAGGTGG 


13200 


CGGAGTTGTT 


CACGCCATTG 


GTACTGGTGG 


TCGTGACCTA 


AGCGACAAAG 


TAGGCGCTAT 


13260 


CACAGTGAAA 


GATGCTATTG 


TGGCCTTAGA 


AAATCACGAA 


CCAACCGATG 


TGATTACAGT 


13320 


TATTTCTAAA 


CCACCTGCAA 


AAGAAGTTCG 


TGATGAAGTG 


GTTGAATTAT 


TACAAAGCAT 


13380 


TTCAAAACCA 


GTCGTTGCGA 


TTTTCTTAGG 


AGAAAAACCA 


ACAAGTCACG 


AAGGCAAAGT 


13440 


TTATCTAGCT 


CATACATTAG 


AAGAAACAGC 


TAAAATTGCT 


GTTGATTTAG 


CAAATGATGT 


13500 


GGCCGTGAAG 


AAAAATTACT 


TTGAAGCATT 


AGCAAAACCA 


GCTGTACCAA 


CATTACCAGA 


13560 


AGATAaAGTT 


GTAAAAGGCT 


TATATTCAGG 


TGGTACATTA 


GCTTCTGAAG 


CAGGTATGTT 


13620 


AATTTCAGAA 


GCTTTAGATC 


TTGGCGGTTT 


AGTGAAAGCA 


GAAGGTTATG 


TCTTGAAATC 


13680 


TCACGGTTAT 


GAAGTCATTG 


ACTTAGGCGA 


TGATATGTAT 


ACACAAGGTC 


GTCCGCACCC 


. 13740 


AATGATTGAC 


CCAGATGTTC 


GTATCGAAAA 


AATTCGTGAA 


TACGCACAGG 


ATGAAAAAAC 


13800 


GGGGATTATC 


TTATTCGATG 


TGGTGTTAGG 


CTATGGTGCT 


CATGAAGACA 


TGGTTGGTGC 


13860 


ATTGTTACCA 


GCAATTGAAG 


AAGCGCGTGC 


CACAGCCAAA 


GAAGCAGGAC 


GTGATCTTTA 


13920 


CTTTGTTGCA 


ACTGTCTGTG 


GGACGACGAA 


AGATCCGCAA 


AATTACCAAT 


CGTCTGTTGA 


13980 


TCGTTTGAAA 


GAAGGCGGCG 


TTTTAGTAG'C 


AGAAAGCAAT 


ACTAAAGCTG 


TTCAATTAGC 


14040 


TTTATTACTA 


AAAGGAATCG 


AAaTTTCAGA 


AG AC GAT AAA 


GAAGTGGTTG 


CTTACAACGG 


14100 


ACCmACTGTC 


GATGTrCCrA 


AACCAGGCGA 


AAAaGTAATG 


GAACTATTaA 


CAACCAAACC 


14160 


AAGAATCATT 


AACGTTGGGT 


TGCAAAGCTT 


TACAGAATCA 


ATCGTTGATT 


ACGGTGGCGA 


14220 


AACTGTTCAA 


TTTAACTGGC 


GTCCACGTGC 


TAACGGCAAT 


AAAAAAATGA 


TTAAAATTTT 


14280 


GGATGCTTTG 


GAAGATTACA 


GTGAACAAAT 


CGAAGCTGAA 


AATCACAAAG 


TAACAGACAA 


14340 


AATTAAAAAT 


GCACAACCAT 


TCTTAGTAGA 


TGTGGTACCT 


GCGAAATCAG 


TGATTCCAGA 


14400 


ATTAAATGAT 


• GACGCACAAA 


AAACATTACT 


TCATGCTGGA 


CCACCAATCC 


AATGGTCTGA 


14460 


AATGACTGGA 


CCAATGAAAG 


GTGCTTGTAT 
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CCATGTAAAA GCAGTAGGAC CAATGGGCGG TATTACTTCT GGTAATATGC CAGTCTTTGT 14640 
TGTGGAAAAT CGCTTAGAAG GTAACGAAGC ATACTGTATC TTGAATGAAG GTATCGGTAA ' 14700 

AGTATTGCGT TTTGGTGCTT ATTCACAAGA AGTGGTTGAT CGTTTAGATT GGATTAAAGA 147 60 

TGTTTTAGGA CCAACTATTG CGAAAGCCTT GAAACTTTCT GAAGAAGGCT TGAACTTAAA 14820 

CGTATTGATT GCGCGCTCTA TTACAATGGG TGACGAGTTC CACCAAAGAA ATATTGCCGC 14880 

ATCTTTGAAC TTCTTAAAAG AATTGTCTCC ATTAATTATC AAAACAGATA TTCCAGAAGA 14940 

TCAAAAATAC GArtAATCAA ATTCTTAGCT GACACAGATC AATTCTTCTT AAATGTCATG 15000 

ATGGCAACTG GAAAAGCAAT TGTTGATGGT GCCCGTAAAG ATACCAACGG AACAATTGTC 15060 

ACAACGATGA CTCGTAACGG CGTGAACTTT GGGGTCCGGA TTGCAGAAAC AGGCGATCAA 15120 

TGGCATACCG CACCAGTAAA TACACCAAAA GGTCTTTACT TCACTGGTTT CAGCGAAGAA 15180 

GACGGAAATC CTGATGTCGG GGATAGTGCA ATTACTGAAA CAGTAGGTGT TGGAGCAATG 15240 

GCTATGGTAG CTGCACCTGG CGTAACACGG TTTGTTGGTG CAGGTGGCTT TGAAGATGCG 15300 

TTAgCAACCT CAAATGAAAT GGCAAAAATC TGTGAAGGAA ACAACCCAAC ATTCTCAATT 15360 

CCAACTTGGG ATTTCCAAGG AACTTGTTTA GGAATTGATA TCCGTAAAGT GGTGGAATTA 154 20 

GGAATTACAC CAATCATCAA TACAGGGATT GCACACAAAA ATGCCGGTGT TGGTCAAATT 15480 

GGTGCAGGGA CAGTTAGAGC ACCTCTAGGC TGTTTTGAAA AAGCCCTAGA AGCGTATGCT 15540 

GAGAAACTAG GCATTACGGT TGAATAACCA ACTAGTCATT AGTGACTACA TTTTTCCTAT 15600 

CCACCAATAT GGCAAGATGG GAACCATTCA CAGTGTTTTT GAACGTTCAT TTAATCTGAA 15660 

AGTCCAGGAT CAATTAATCA ATGTTGCAAA CTTCCATAAT TATTTATCAA GCTTTGGTAT 15720 

GTTTTTGCCA GATCAATTGT TTCAAGAAAT CTTTCCGTAT GTACAACAAG GTAATAAAGT 15780 

CAAAATCACA GAAAATCAGT TGACCTTTTA CAGTACAGTC GGCGTGAAAA CGATCCAACT 15840 

CACACCTGCT GAGGTCGTTT CTTTAAACAT TACTCATTTC AAGTTAGAAA AAGACCAATT 15900 

ACGCTTGTTG CGAGACCGCT TATTAGCGAG AAACTTAGAA CGTAGAATTG GCCTGCCTTT 15960 

AGACGAACGA GCGAAgcATA TTTTCAAAAA AATGAGCCAA AAACAAAAAG TTTGGAGACT 16020 

ATCGGAATGG CAAGAAGTAA CGAATTATCT GATTGGTCGG GGCAAAGGGT TAACCCCGAG 16080 

TGGTGATGAC ATTTTAGTGG CGTATCAAAC AATTCTGTTC ATACTTGCTG ATGAAAGAGC 16140 

TGCGGCGTTA GCCgcTAcTC TTTCGGCAGC AAATTTATCC ACAACAGATG TTAGTAAAGG 16200 

CTATATTGCA AGCTCGGCTA AAGGATATGT AAATTCATTG TTGTATCAGT TACTCTTGGA 16260 

TTTGGAGAAC CATCGGGATA ATAGAGTTGA CGAAAATATT GACAGAATTA TACAAATCGG 16320 

ACATTCCTCT GGTAAAGATA TGTCATTTGG AATGTTACTA GCTTTACAAT CTGTAGAGTT 16380 

AGATGAGAAG ' TGAGGAGAAG ACATGGAGAA TAGTAAAAAA AATCAAGCAG TGCAAGTGAG 164 40 

CAATAGTTAC TGGATTAAAG TCGTAGCCAT TTTCTTCGTA GGCTGGATTT TAATGTACGC 16500 

AACACGAACT ATTTTTAATC CAGTGATGGG GGTCATTGGA GAACAATTTG GTCTAAGTAA 16560 
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AAAAAGGAGC 


GAaTCAATCA 
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ACGTAATTGC 


CCTAGGTGGG 
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TAGATACGGA 


TCCAACAGAT 


GAAGGCCAAA 
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TTGCAGAATT 


TGTTGCTAAA 


GGGGAACAAG 


TCATTGTCTG 
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TCACGGAAAC 
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GGACCACAAG 


TAGGGAATTT 


GCTTTTACAA 


CAAAAAGCAG 


GCGAAAGTGA 
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GCGTTGAAAT 


TGGATACTTG 
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CAAAAGAAGA 
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GAAGATGCTG 


GCCGTGGCTA 


CCGAAAAGTT 
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CAATGCCTAA 


AGAAATTGTT 


18300 


GAAAAAGAGG 


CGGTTCGTGC 


TT TAG TAG AG 


GCGGACGTTT 


TAACTATCTG 


TTCTGGTGGC 


18360 


GGGGGCATTC 


• CGGTCGTAGC 


AGAAGACGGA 


CAATACGTAG 


GTGTGGAAGC 


GGTTAACGAC 


18420 


AAAGATTTCT 


CTGCACGTGT 


TTTAGCTGAA 


AACGTTGATG 


CAGACCGTTT 


GATTATTTTG 


18480 


ACAGGCGTTG 


ATAACATCTA 


CATTAATTAC 


AATCAACCAG 


ATCAAAAAGC 


ATTAGAACAA 


18540 
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ATCAGTGTAG CCGAAGCAGA AGAATACATT AAAGAAGGTC ACTTTGCTGC TGGTAGTATG 18 600 
CTACCAAAAA TCGAAGCTGC CTTAGACTIC.. GTAAAAGGTG ATGACAAACG GAAAGCAATC ' 18 660 

ATTACTTCTA TCGAAAACTT AGAAAACATC GATAAAGAAG CAGGAACTGT GATCAGTCAA 18720 

AAGGGGTAGT TTACCATGAA AAAACCGTCC ATGATCACCC AAATTGCAAT TGCTGTTGTC 18780 

GTAGGAATCC TCGTAGGACT CTTGATTCCT GCTAGCGGCA ATTACTTGAA AATAGTTGGA 18840 

GATGTCTTTT TGCGACTGAT GCAAATGGCT ATTCCTATCT TGATCTTAGG GCAAATTGTC 18900 

CAAGCAGTTG GCAGTATTAA TCCAAAAGAA TTAACGAGCC TTGGGGGTAG AACAATCGCT 18960 

GTTTTCGGGA TTTCCTCCTT AGCTGCCGCT TTATGGGGCG TCCTAATGGC AGTCATTTTT 19020 

AATCCAGGAT ACGGTGTTAA AATGACTGGC TTCCAAGATG CTTCTATTAA AGCGCAAGAA 19080 

ATCTCAATCA CTGATACGAT TTTAAATTTC GTTCCGAAAA ATATTTTTGA TTCACTGACT 1914 0 

CAAGGATCGA TTATTCAAAT TATTGTCTTT GCATTATTTT TTGGTTTAGC ACTGAATAAA 19200 

TACT T AC AAA GCCATCCAGA GACACAACTC TTCCAAATAA TTGTGGACTT TAATGAAGTT 19260 

ATCATTACAG TCATTCGATA CGTCATGTAT CTAGCACCTC TGGGGATTTT TGCTTTAATT 19320 

GCTTCAACAA TTAGTCACTT AGGCCTTCAA ATTATTTTAC CGTTAGTGAA GTATCTATTG 19380 

GTCTATGGAT TAGGAACTAT TTTATTTTTG GGCATCTGGA TTTTGGTTAT CACATTGTAC 19440 

TGTAAAGTAA GTCCACTACG TTTAATTACA AACATGAAAA ATATGTCTGT GATGGCCTTG 19500 

GCGACAACTT CTTCAGCAAT TACTTTACCA GTAGCTTTGG AAGAAACAGA AGCAAAATTA 19560 

GGTTTAAGTA AGCGAATTAC TAACTTAGTG TTGCCGkTAG GTATGTCATT AAACAGTAAC 19620 

GGTTCAGCGA TGCATATGGC TTTTACTGTA ATGACTATCG CtCAGATGTA TCAGCTAGAG 19680 

TTTGATATTA CGAAAATGAT TTACTTAGCA ATTACAGCAA CGTTTGTCTC ACTAGCAAAC 19740 

GCAGTTGTTC CCGGTGCGGG CTTAGTCTCA CTGGCAGTGA TTGTyCCGCA AtGGGTTTAC 19800 

CCATTGAGAG TATtGCAaTT TTgCaGGCGT CGAATGGTTT GTTGGCATGT TACGAACAAT 19860 

ttTAAATGTC AATTCAGATG TTTACTCTGC TATTCTAGTT GCAAAATCAG TGGATGAAAT 19920 

TGACTATACT GTCTTTAACA GTAGCAATAA GTAGTGAAAA AAGGGGAGGA AATGCAAGGA 19980 

CGAAGAAGAA AATCACTTGT TTTCTTTCCC TTTTTTAAGA TTAATGATAA AGGAGCAAGC 20040 

GCAATGGAAG AAATGGTTGT CGTTACACCG GAAAAATTAG AAAAGCTGAT GAAACAAAAA 20100 

TTAGAAGCTG CTGGTTTACA TTCTGAACAT GCCGAAGAAG TTGCTCGACA TTTAACTTTT 20160 

GCAGATGCTA GCGGTATTCA TTCACACGGG GCGGTTCGTA TGGATTATTA TGCGGAACGG 20220 

ATTGCAAAAG GCGGGATCAC GATTGACCCC GAATTATCAT TTGAAAAAAC TGGACCGTCC 20280 

ACTGCCATTT TTCATGGCGA TAATGGCGTG GGTCAATTTG TTTGTAACGA AGCATTAGCT 20340 

GCTGCAGTTG- ACCTAGCCAA AGAATCAGGG GTAGCATATG TTGGCGTTTC TCAAACGAGT 20400 

CATAGCGGGA CGTTATCGTA TTATGTTAAA AAAGCAGCGG AACAAGGCAT GGTTGCTCTT 204 60 

TCGATGTGTC AATCTGATCC AATGGTGGTA CCGTTTGGTG GTACAAGCAA TTATTTTGGG 20520 
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ACAAATCCAA TCGC7TTTGC AC-CACCTAGA GCAGGTCATG AACCAGTCGT GTTTGATATG 
GCTACGACAG TACAC-SCG7G GGGGAAAATT TTAGATGCAC GTTCAAGAGA. TGTTGAAAIT 
CCCGACACAT GGGCASTAGA CGAAAAAGGC CAGCCAACGA CTGATCCTTA TGAAGTACGA 
GGTTTATTGC CGA7TTCAGG ACCAAAAGGA TATGGACTAA TGATGATGGT CGATGTCTTA 
GCTGGTTCGT TGTTASGCTT ACCTTTTGGC AAACATGTTA GCTCCATGTA TGCGGATTTA 
ACGGAGAAAC GGAATCTGGG "CAGATGTTT TTAGTCATTG ATCCAAGCCG CTTTACAGAT 
TCTGAACGAT TTAAAGAAAA TATTAATCAA ATGGTGGAAG AACTTCATTG TGTGCCTACA 
GCAGAAGGGT TTCAACAAGT TrACTATCCA GGAGAAATTG GTCAGCTAAA CTATCAAAAA 
GCCATGTCAG AAGGCATTGA AATTCCACAA AGTATTTATG ATTATTTAGT TAGTGAGATT 
GTACATTACG ATAAATATGG CGGGCAAGGT GCCTTTGCCA AATAATGAGG AGCGCAGAGA 
GTATTCGCTA ctGCGTTGCT CTCGCCTTCT TTCTATTGGA GGAAAGTAAG ATGAAAAAAT 
CGTCAAAAGT AGCACTTATT TTTGCATTCT TAGTTAGTAT TTTTGTGCCA GTTGCGGCCT 
TTGCTGATGA GATTGGGGAT GTAGTTGCCC CGACAGGAAT CATGGCATTA GTTGTGATTA 
TTCCATTGAT TGTTGTTTTG G7TCTATTGT TTATGAAAGT CGATATGATT ATCGCTGGTT 
TAATTGGTGG CGTCTTAGCG A7GCTGATTG GTGGTATTGG ACTAGAAGAA GCGAATAAAC 
AGATGTTGGA AGCTATCCCG ATGATGTTAG GAATTACCGT TCCCATCATT AATTCAGCTG 
TTGCGATGGC GGTCTTTAAA GCAGGTAGTT ATTCAGCTGC ATTAACTTTA GCCAAACGAG 
GCACAAAAGG AAAAGTCGAG TATGTTTCAG CTTTTATTGT TATCTTATTA GCCGCAGCCA 
CATATATGTC AGGAATTGGT GGCGGAAGTG CGATGGTTAT TGCGCCATTG GCCTTTGCTG 
CAGTTGGCGT TGTTCCTGAA TTAATTGCAG CAATGTCTTT AGCTGCTGCG GTaTCATTTA 
CAACATCACC CGCTTCATTA GAATCAAGTA TTGTTTCAAA ATTAGGCGAT ATCAGTGTCG 
GTTCATATGT TTCAACCATG CGACCTTACT GGTTATTCTT CGTAGCATTA GCGATTGTGT 
TAGCCTTTTG GGGAACGAAA CATAGAAATG TTGGCTTTAA AGAGTCGGCA GATGATGAAT 
TTGATAAAAA GAGCAATGGT GAACTATTTA AAATTACCTT GCCAGCCATT TTCTTATTGT 
' TTGCTGTAAT TTTTGGACCA ATTGTGAATG ATTTAATTGG TTTTGCCTTC TTTACGCCAC 
TTGTATACAT GATTTTAACA TTAGTTTTAA TTTTCTTATG TACAGATTTC TCAATGAACA 
AATCTGTCGA AGCGATGGTA GATGGGTCTA CATATATTTT GACTCGTTTA TTCCAAGTAG 
GGATTTTCTT AGCCTTTATT AATGTTATTG CACAAACAGG AACGTTTGCT GTAATCGCAG 
GTGTTGCTGA AAATGCACCA GCCTTTATTG TCGTGCCAGT TGCCATTTTA ACAGGGATTT 
TAATTGGGAT TCCAGCTGGG GCCtACGTAG GTTCTGTTTT AACTTTAGTG CTACCAGTTG 
CTGTATCGTT GAACTTCCCG CCGTTAGCAT TAGGTTTTGT AGCAATGGGT GTTGGTTTAG 
GTAGTCAAAT GAGTTTCGTT AATATTACTA TGCAAGCTTT ATCTTCAGGC TTCCAAATTC 
CAATTTTGGA TGTTGTTAAA GGAAACGTGA AATGGTTAAG TATTGCCTCT GTACTATTAT 
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TAGTTATCGG 


TTTAATCTTT 


GGATAAAAAA 


CAATT TAG GG 


AGTGGAAAAC 


ATGGATTTAT 
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TAATTAAACA 


AGTTCGTCTT 
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AGATGTAGCT 


ATTAAAGAAG 


ZZ DZU 


GTAAAATCAT 


TGAAATCGCC 


CCAGTTATTA 
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ZZ OOU 
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ACATTTGGAC 
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O O "7 A A 
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TCATGGGACC 


acggcgttaa' 
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CCAGCGCAAG 


22920 


GTTTTACTGG 


CTTCAAAACA 


ATTATGCGTT 


TGAAAGAAGA 


ATACAAAGAG 


TTAATTGACA 


22980 


TGCAAGTAGT 


TGCGTTTCCG 


CAAGAAGGAA 


TTTTTAAAGC 


CCCAGGAACA 


GAACAAATGA 


23040 


TGTATGAAGT 


AATGGATATG 


GGCGCAGACG 


TTGTCGGCGG 


TATTCCTTAT 


AATGATGCAC 


23100 


CAGCGAAAGA 


ACATATTGAT 


TTAGTGTTTG 


AAATTGCTAA 


AAAATATGAT 


AAAGATGTAG 


23160 


ATTTACACCA 


AGATTTTGCC 


GATGAAGCTG 


AT GAT AT GAG 


TATTGAATAT 


TTATGTGAAA 


23220 


AAACCATCGC 


TGAAGGCTAC 


CAAGGCCGTG 


TTTCAGTGGG 


CCACTTAACG 


GCATTGCATG 


23280 


CCTTACCTGA 


AGCAGAATTG 


GCGCCTATTA 


TTGCAAAAAT 


GGCGGAAGCA 


AATATTAGCG 


23340 


TGATGGCTTT 


ACCAGCAACG 


GAT TT AC ATT 


TAGGGGCTCG 


TGGTGATCAG 


TACAATGTTC 


23400 


GACGTGCGGT 


TACGCCAATC 


CGTAAATTAC 


GCGATGGTGG 


CGTGAATGTC 


TGTATTGCTA 


23460 


CGAATAATAT 


TCGCAATGCT 


TTTACACCAT 


ATGGCACAGG 


GGATATTGTT 


CAAACAGCGA 


23520 


TGTTAGCCAT 


TCCTGTTGGG 


CACCTAGGTG 


GAG C GG AT G A 


CTTAAAAACA 


GTGCTACCAA 




TGGTAACAAC 


AAGTCCTGCT 


CGTGCGATGG 


GACTAGAAGG 


ATACGGTGTC 
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TTTAGTCTTA 
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AAGTGGTTGA 


ACATGCGATT 


ATTGATATTC 


Z J /UU 


CAGAACGGTT 


ATATGTTATT 


AAAAACGGCC 


GTGTTACAGT 
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AAAAAAGTCG 


Z3 1 oU 


AAATCATTCG 


TTAAGAAAAA 


AAACAAAAAG 


CTAGGAAGGT 


GAAAATATGA 


TTAATTTATC 


Z JdzU 


CTGTTTTAGC 


TATGATTCTC 


TATTGAAAAG 


TTtATTTTTA 


GTTTATCTTT 


GGTCGATTTT 




CCTTGATATA 


TATATAATCT 
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o a n o r\ 
Z 4 UUU 
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ACAAGTGAAT 


GATCCAAAGT 
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TACCACAAGA 


Z h UbU 


CAAGACTGTT 


GCTGAAAATG 


GTAAGTGATC 


ATTCTGCTAA 


GTTTAGTGTC 


TTAATGTTTA 


Z hLZU 


GTTAACTAAG 


AATTGTTGGA 


TTGTTCTTTA 


GAAAGAAGGG 


ACAATATGAA 


GCGAAGTAAA 


Z 4 loU 


TGGAAAGAAT 


TGATAGTAAC 


GGGCATCTGC 


CATATATTAG 


TATTCCCCAT 


ACTAATACAG 


24240 


ACAACTGTTT 


TTGCAGAAAC 


ATTACCAAGT 


ACAAAACAAG 


TAAGAGAAGG 


AACCAATCAT 


24300 


TCATTAACAG 


■ CAGAAAAAGC 


CGAAAGTGAA 


CAACCACAGA 


CAAAGGATAA 


ACTACATGAT 


24360 


GAAGAAACAC 


TGGCATTGTC 


AAAAAGTGAG 


TTAATCGATA 


ATGAGGCTAA 


, TGTTACAAGT 


24420 


CAAACGATTA 


GAGAAAGAAT 


TGAGACGCCT 


AACCTAACTT 


ATCGTTATGG 


ATTTATTAAT 


24480 
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GAAGAGGGGC AGCCAGTAAA CGCCAATGAG ATCCTTCTAC AGTATCATAG TTGGCAAGGC 24 540 

AATTCCCCAG ATGGCATAAA TGTGTGGGAA GGTGAAAGTC AACCAGTGAC AGCATCTACA 24 600 

GTGGCTAATT TAAAAGAAGT GGTAATTCCA AGTGAGAAAG TAGCCGTCTA TTCCGACATG 24 660 

TCAACGGTGC TTGCAGCGAG TAATCAAACA TTTTTTTTAC CAAGATATTA TACTTCTTTA 24720 

AGCTTATACA ATAAGAAAGG GGAAATTGAT CCCAATTATC CGCTGCCAAG TATTTCCGAC 24780 

GCATCAGGAA ACCAATATCC AACAACAATT TCGCAATTTG AATTGGAAAA AATGTCTGCA 24 840 

CAACAATATA GTCAGAAAAC AGGAGTAACG TTTAACATTA GCGAGAGTCA AAAACTAATC 24900 

GTTCCTTTGT ACAACCAAGT GAAGGTTGAT TCATCGAATC AATCTGGGCT ATTGAATTAC 24 960 

TTTAAATTTT CAGGGCCGGT TTATTATCAT GTTACCAATC GCAAAGTGAC AGAACATTTT 25020 

GTGGATACTC AAGGGAAACC AATCCCTCCA CCACCGGGGT TTAGACAAGG AAAGCAAACA 25080 

CTTATTGAGC GTGACCCTTA CACCTTTAAA CAGAAAGATC TTTTGCCAAG TAGCTATGAA 25140 

ATTGACTCAA AAACGTATCA ATTTCAAGGA TGGTATAAAG GGAAAACGAA ACCTGAAAAT 25200 

TTAGAAAAAA GCGTAACGCC CAGTTATGAT ATTACCTATG ACGACAATGA TGATTTAACT 25260 

GTTGTCTATA AGGAGATACC TCAAAAAAAT TATACATTTG AGGATGTCAA TGGTGTTGAA 25320 

ATTGCACCAC CATCTGATTT TATTCAGGAT CACCAACAAC CAATAACTAC GGATGGCTTT 25380 

CGCTATTTAG CTGGAAAAAA ACTGCCACAA CAATACAGCG TTAACGGTAA AACTTATTTA 25440 

TATCAAGGTT GGTATCAAGA TAAAACmAAA CAAGAGAGCT TAGAAAAAAC gAAGCGACCC 25500 

ATAAACTCCC CTGTTTTTAA TGAAATGAAC GCTATTACAG CAGTGTATAA GGAAATAACT 25560 

GCAAAAGCTG AAATGCAAAT AGAAGGACTA GTCAAAGTCA TGCCAAGTGG TTATATACAA 25620 

ATTTGGCAGA TTATGCTTAC AAATGTGGGA GAAGTACCGT TAAAAAAAAT AAACTTAAAG 25680 

CCAGCAAGTG GTTGGTCACC AGGTCTAGCT CGGCCAATCC AAGTCACGAT TCGTGTTGGA 257 4 0 

TCTGAACCAA ACAAAATTGT TCCTATTACT GATGAAAATT GGCGAGTTGG CATTACTTTA 25800 

AATACGGAAG TGCCTATTGG TCAGACAGCA ACTATTATGA TGACAACAAT TGCTACAGGT 25860 

GAACCAGATC AAGTGTTACA AGCGGCTGTT GAAATGAATG GAAATTTTTC TGCTGTTCAC 25920 

GCAGCTGATA CTGTCAGAAT CCAACCTAAA AATCAAGAAA TTGTGGCACC AGATGAGGAA 25980 

GGTTTTATCA GCACACCAAC TTTTGATTTT GGCAAAGTCG CCATTTCTAG CAACACGCAG 26040 

CAACATGGTT TAAAGCAGGC AGCAGATTAT TATGAAAATG GTCAGGAAAA TCCATATTTA 26100 

CGTTTGAAAA AATCACAACC CAATTGGGCA CTAACTGCAG AACTATCCCC CTTTGAAGGA 26160 

AGAGTGGATC AACTATCATC AATGACAAAG TTATTGTTAG GAACAACCAA TGTTTCAGGT 26220 

TTTATTCAGT ACAATCAACC AACGGAAACT AAAGTTGCTC TTGGCAAAAC AACCGCTATT 26280 

CAATTAGTTG* CCAACGGTGT AGCTAGCCAT ATTGTTGCCA ATGGTCAGTT TGACGAAAGT 26340 

GATGTTTATC AATTTGATTT TTCTTTTGAT CAAATCAAAT TAGAAATTCC AGCAAATCAA 26400 

GGTAGAAAAG ATCAAACTTA TCAAGCAATG GTGACTTGGA AT T TAG T GAG AGGCCCATAA 264 60 
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GTCCATAAAA AATAAGTTGA ATAGTTTGAT CAAAAACATT GCATAGAAAT AATAAAAGGC 2 6520 

GGCTCAACGA AGCGGCCTT.T TTCTTGATTC GTTACCAAAA TATAACGAAT TTTGTAACAA 26580 

AAACGTTTAA TAAAAAGAAG GAAAAATGAT TAATTTTACG AAAAAAAGCA TATATCGACA 26640 

AAAAAGAAGC AAAATTTGAA TAAAAAAAGA AAAAATATGA AAAAAAATCG AAGAATTGTT 26700 

TAACAATTGA AAACACGTGA AAACTAGTCG TAACAAGGTG TAAAAGCCTT ACAAAAGGTA 267 60 

AGTTTACCTA TTATGGAGAA AGAAGGAGTG ATCTGAAATG TTTTTGTAAC ATTTCTATTG 26820 

CTTAATTGAA ATATGGGTGT GGTAGAATGT TTTTGTCGTA AGAAAAGATT AAGATATTTT 26880 

AAATAAACTT TAGAAaTTAA TTCGGAGGAA TGACAAGTGA AGAAACGTCT ATTTGCATCA 26940 

G TAT T AC TAT GTTCATTAAC GCTATCAGCA ATTGCTACCC CAAGCATCGC TTTGGCGGAC 27000 

AATGTTGATA AAAAAATTGA AGAAAAAAAT CAAGAAATTT CATC ATT AAA AGCAAAACAA 27060 

GGGGATTTAG CTTCACAAGT ATCTTCTTTA GAAGCAGAAG TATCTTCAGT ATTTGATGAA 27120 

AGCATGGCTT TACGTGAACA AAAGCAAACA CTAAAAGCAA AATCAGAACA ATTACAACAA 27180 

GAAATTACAA ACTTGAATCA ACGTATTGAA AAACGTAACG AAGCAATCAA AAATCAAGCA 27240 

CGTGATGTTC AAGTTAATGG ACAAAGCACA ACAATGCTAG ATGCAGTTTT AGATGCGGAC 27300 

TCAGTTGCAG ATGCAATCAG CCGTGTTCAA GCTGTTTCAA CAATCGTAAG TGCCAACAAC 27360 

GACTTAATGC AACAACAAAA AGAAGACAAA CAAGCCGTTG TTGATAAAAA AGCTGAAAAC 27420 

GAGAAAAAAG TGAAACAACT TGAAGCAACA GAAGCTGAAT TAGAAACAAA ACGTCAAGAT 27480 

TTACTTTCTA AACAATCTGA ATTAAACGTA ATGAAAGCTT CATTAGCATT AGAACAATCA 27540 

TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA AAACAAAAAG CAGCTGCTGA AGCAGAGCAA 27600 

GCACGCTTAG CTGCTGAACA AAAAGCTGCA GCTGAAAAAG CCAAACAAGC TGCTGCAAAA 27 660 

CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA CCAGTTGCCT CTTCATCAAC AACAGAAGCA 27720 

CAAGCACCAG CAAGCTCAAG CTCAGCAACT GAATCAAGCA CGCAACAAAC AACTGAAACA 27780 

ACTACACCAA GTACAGATAA TAGTGCAACA GAAAATACTG GCTCTTCTTC ATCAGAACAA 27840 

CCAGTACAAC CTACAACACC AAGCGATAAT GGAAATAATG GTGGCCAAAC TGGTGGTGGA 27 900 

ACAGTTACAC CAACACCAGA ACCAACACCA GCGCCTTCTG CTGATCCAAC AATCAATGCA 27 960 

TTGAACGTTC TACGTCAATC ATTAGGTTTA CGTCCAGTAG TATGGGATGC AGGTTTGGCA 28020 

GCTTCTGCAA CTGCTCGTGC AGCACAAGTT GAAGCAGGTG GCATTCCAAA TGATCACTGG 28080 

TCTCGTGGAG ATGAAGTTAT CGCAATTATG TGGGCGCCAG GTAACTCAGT AATCATGGCG 2814 0 

TGGTACAATG AAACAAACAT GGTAACAGCT TCAGGAAGCG GTCACCGTGA TTGGGAAATT 28200 

AACCCAGGTA TTACGCGTGT CGGTTTTGGT TACTCAGGTA GCACAATCGT AGGACACTCA 28260 

GCCTAATCAG . CAATGATAAA AATAGACGTA AGCAGCTTCT GAAAAGAAGC TGCTTTTTTC 28320 

ATTACTATTT CATCCGTCTC CGTTGACAAA TTTGCTAAAC GTGGTATTCT AATTATAATC 28380 
TAATAATTTC CGTTGAATAG GAAAGTAAGT ACAAGACAGC TGGTAGAGAG ACTTTGGTTG 28440 
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GTGAAAAAAG TTAGATGTCT GTATCGAAAA ATGGCCTAGG AGGAATTTTT 


GTCGAAACGT 


28500 


AGGCAAAAAC 


GTAATGTCTT 


TCGTTAACAA 


GACCGCTGAT 


TATGATTGTC 
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AAAGAGTCAT 
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AAAGGTGGTA 


CCGCGATAAT 
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TTACTAGTTA 
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CGGCTAGTAA 


AAGGGcGTTT 


TTTTATAAAT 


AAAAATAGGA 


AAGAGTGAGG 


CAAATGAGTA 


28680 


GTCAGATTCC 


TTTTACAGTT 


GATCATGTGG 


GAAGTTTTTT 


ACGTCCGGAA 


APAATTAAGA 


28740 


AAGPACGGAA 


AAAGGTCCAA 


GCAGGTGCTT 


TATCTCAAGC 


AGAGCTACGT 


PTTATTGAAG 


28800 


ATGAGGAAAT 


CAAAGCATTA 


GTAGCAAAAC 


AAAAAGCTGG 


CGGCTTACGG 


GGAATCACAG 


28860 


ATGGTGAATT 


TCGTCGCGGT 


TTTTGGCATA 


TTGATTTCTT 


AGAAGGATTA 


AATGGTGTGA 


28920 


PAGGCTATAT 


CCCAGAAACG 


GGGTATAACC 


AAACCTTTCA 


TGGAAAAGCA 


GPTPPGGCCT 


28980 


ATAATATPPG 


CGTAGTGGAT 


AAAATATCTT 


TTCCTACGCA 


ACATCCATTT 


TTAGAGGAPT 


29040 


TTTODTTTTT 
1 l l tx X x i x i 


AAAAGAGGTG 


GTGGAAGTTG 


GTGATGGCAC 


GGTAGCCAAA 


PPZiaPG ATPP 


29100 


PTAGTPPGAC 


CATGATTTTA 

\^ 4i X \J4k X X X X X* 


CGTCAAGAAA 


TTTTATCTAA 


CGATGGCACC 


TPGPGPATTP 


29160 


A A APAATTTA 
rtrtn^rtrtx x x n 


TCPAGAGATT 


GGTGCTTTTT 


ATAAGGATTT 


AGCACAAACA 


TZiTlVa AGAPG 


29220 


PAATTGPGGP 


CTTTTACGAG 


AGAGGTTGCC 


GTTATTTGCA 


ATTTGATGAT 




29280 


PGTTTPTAGP 

v_?o x x x \w« x n\jV( 


GGATCAGACA 


AAACAAGAAG 


AATTACGAGC 


AAAAGGAATA 


PZif^PPAGAGG 


29340 


AAATTGPAPA 


AGTGTGCACC 


ACGATTATTA 


ATGAAGCCTT 


AGCTGGCAAG 


PP ZH^Z\ AG ATT 


29400 


TAAPGGTAAP 


PACACACATT 


TGCCGTGGGA 


ACCATGCTTC 


GTCTTGGTTA 


TTTTPAGGTG 
1111 ^nuo 1 o 


29460 


GTTATGAAPC 


GATAGCAAAA 


GAATTATTTG 


CTACGAATTA 


TGATGGTTTC 


TTPTTGGAAT 


29520 


ATGACTCAGA 


TCGTGCAGGG 


GACTTTTCAC 


CATTAAGACA 


TTGGCAAAAT 


Za&TGGPAGPA 


29580 


AAGTCGTTTT 


AGGTTTAGTT 


ACTTCTAAGT 


TTCCTGAATT 


AGAAGAGACT 


GAACAAGTGA 


29640 


AAGCACGAAT 


TGAAGAAGCC 


CAACAGTTTG 


TTCCTTTAGA 


AAACTTAAGT 


ATTAGTCCGC 

.ri X X AO X w^VJv 


29700 


AATGCGGTTT 

nn x >J \j x x x 


TGCTTCAACG 


GAAGAAGGAA 


ATCGTTTGAC 


GGAGAAAGAA 


PAATGGPGCA 


29760 


AAGTTCAATT 


ACTTCAAGAA 


ATTGCTAGAG 


AAGTATGGTA 


AAAAAACT GG 


GAPATAAGTC 


29820 


TAAACGGCTT 


ATGTCCCAGT 


TTTTTCATTG 


TATTATTTTA 


AATATTGATC 


TTTAGTAAAA 

XXI tVvP X 


29880 


TTCGTGTGTT 


CATACTCGCC 


GTTGTCACCA 


TAAAAGCGGT 


TATAGCGTTG 


TTTGAATTTG 

XXX vjrm x x x \j 


29940 


CGGAATAAAT 


AAATGACTGT 


CACTTTAACT 


AGGCAATACA 


TTGGAATCCC 


AAAAATAACG 


30000 


CCCATTAAAC 


CAAACAGTTC 


ACCCATAACT 


AACAAGACAA 


GTAAAATAGT 


AATPGGATGG 


30060 


ACTTTTAATT 


TATCGCCCAT 


CACACGAoCjA 






PAATTGGATG 


30120 


GCAAACCAGA 


CGATAACGAA 


TTTTACTACC 


ATAAATGTGG 


AATCCTGAAA 


GGCAATAATC 


30180 


AATCCTGGAA 


TAAAGGCAAT 


AAATGGCCCG 


ATATAAGGAA 


TAATACATAA 


AATACCCGCA 


30240 


GCTAAAGCTA 


. AAATGCTACC 


AAATTGTAAA 


CCAATTaATA 


GAAAAGCTGG 


cCAATACATA 


30300 


ATCCCTAAAA 


CCAACGAAGC 


AATGACTTGA 


CCTTTTAGAT 


AATCGCCAAT 


TTGAATGTTG 


30360 


GCAATTTGGC 


TATATTCGTC 


AAAATCAGGA 


CGAAAAGCTG 


GGGGAATGAT 


GCCTTTTACA 


30420 
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AAGCGATaAA ACTTTTCTTT GTCTTTTAAA AGGAAAAAGG CGATGATAGG TCCTGTAAAC 30480 

AAGGTTAGGA AAGTAGAAGA GACTGCAGAA AAGACGCTTC CCAGACCTTG TGCGCCTGTT " 30540 

TGTAGGTAAT CCCCTGCAAA GTCTTTAAAG GAGTTTGCAA GTTTGCCCCA GAGCTGATTA 30600 

ATCGACTCAT TAGCAGATTC TAAATCTTTT GCAAAAGGTG TTTTAGCCAT AAATTCTTGT 30660 

ACTGTTTGGA GCGTGCTTTT CCAAAAACTT GGGAATTGTT TGATTAATTC TTCCGTTTGG 30720 

TCTTGGATAA TTGGGaAAAG TTGCAGTCCA GCTAACACTA GTAATGCAAT AATTAAGACA 30780 

TAGAGAATGc GCAATCGCCC aAACACGGGG AATTTTTTTC TCAAGGCGTT TCACCATTGG 30840 

ATTAAGCAAA TAAAwTAGAA TAATACCAAA AAGTACAGGG CCTAAGACGG CACTAAAAAT 30900 

AACAAAAATT GGTTTAAAGA TAAATGAAAC AGTGTTGAAT AAAAAGATAG TTACAGCAAT 30960 

TAGACAAAGG ATAGTTAATG TGTAAAGTAA ATTTTTCCCA CCTAAAAAGC GGATCCACTT 31020 

GGTACTTTCT TTATCCATAA GTTGCTCCTT TCGTTAAGCT TTATTTCTAA AATGACCAAT 31080 

AATTTCAATG ACACCCATGA AAATCAGGAT AAATCCAAAG ACTTGCATCA AAACTAAGAG 3114 0 

ACTTTTGAAG GGATTAAAAA CTAAAACGGC TCCAGCGATT AAAACCAGAA TACTGTAGAT 31200 

AAGCGCGCCT GTTTGACGAG TATTTAAAGC AAAAGTsAAT TGAACGAGGC CGTTTATAAC 31260 

AATGCTAATC CCTAATAAAA ACGGCAGAAT CGAAACGATC ACTGGTGCAA AAAATAAAAA 31320 

GGCTACTGCT AAGACGACAA AGATTAAACC TGTTACGAAA CCAAGTCCCC AGCTACCCGT 31380 

i 

TTGTTTTTTT ATTTTGTAGT CATCATAAAT ATTAATTAAC CCTAAAAGCA TCAAATAACC 314 40 

AGCAATTAAA TAACCGACGA ACTGGAAGAA AGCACCGGGA TTAAAGACAA TGGCACTTCC 31500 

GATAACGATA aAGACGG 31517 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AGGTAATAGA CAGTGCAAAA ACAATGATTA GAAAGGTAAA TTCCATCCTA TCATTCCCTT 60 

CTATATTAAA TTTTTATTGA CTTACCGATC AAAACATTTA AGTAATTTTT TCTTTTTAAG 120 

AATAATTTTC TTTTCAACTT CTTCTAACTC AGAACTGCGT TCGCAAAGTT CCAATTGAGC 180 

GCGTCgTGCT TCTAAACGAA GAAGTTCTCC TTTAATTAAT TCAATTTTTT TGTAGCGTAA 24 0 

GAAATTTTTm TTTTGCATAC GTTCTGCAAA GTGTTGTCGT TTTTCTTCAG GGATTGCTTG 300 

ATATTTCnTT TTCAAGTCAG CGACACACTT TTCAAATTCT TCACTCACCG AAACAACTTC 360 

CTTCCmAAAT TTAATCkGcT AwTTATwACA aATTTTACCA AAAGTACAAG CAAAATTGCT 420 

TTCTTTTTTc AAAAGAAATT CAGACAAAAC TtAACgCGGG AAAAGCTGAT TTATAGGAGC 4 80 

TGAACATGGT TAATCTTCAT TAACAATAAT AcAATTTCTG CCcTTCATTC ATCCGTGTTG 54 0 
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AAAGAAAAAG AACCATTTCT TCTAGTGGCG CAGAAAAGCG TAAAACCACC CCTCTTCTGC 600 

CTTTTTTTAA ATACCCTAAA CGAATACCGT TCGGCTTTTG AACAAGGACA ATCTTTTCGA 660 

TATCTTTAAA GGCCGTGCCC CGATTGTCAA AAGGATGCGT AATAATACAG TCCTCCGCAA 720 

AACCTCTTGC ATCGTAAAGA AAACTGAAGA TAATCAAAAT ATATAACACA CTTCGAATCA 780 

TTTCAATTTT ATCTCCTGCC GAAAATGCAT TGGCCAGAAT CATTAAACTA AAAAAAGCAA 840 

CTAAATATTT TGTTGTAGGa TGTTTTGCAG CCGGaATTAC AATACTTTTA CGTACGAAGT 900 

AAAAGTAACC ATACATAACG ACGACAAATA AAAATAGCAA TAACGTAAAT CCATCGATAA 960 

TCACAAACGA CTCCTTTATT TCTTtTTCTT TTGAATATCT GTTAATATAT CATTTATTTC 1020 

TTTAATTtCT TTTTTkGATT CGTTAATCTT TTkGATAACC GCCTCACGGT TTTTTCCAGG 1080 

CGGTAAGTCA ATTGaTTCTT GGTAGTAGTT TGCTAAAGCT TGAACTTTTT TCATTTTTTC 1140 

GCTTTGCAAC AAGGCTATAT TTCTTTTCAT TTAACCGCCT CCTCTTCCTT AG TAT AC ATT 1200 

ACTTTGTAAG GAATTGCTTA ATTTGCAAGA TATTTTCTAA TAAAAAGGAG TAATTTTACA 12 60 

TTTTCTTATA AACAAAAAGT GTTCGGCTGA GTAACACCAG CGAACGATAA AATAATAGAA 1320 

TAAAAGATGA GACAGAAGTA GTTTACTTTT GTCTCATCTT TTATTATTGA GCTGAAATCG 1380 

CTAAACCAAT TGCTTTTTCC CCTAGGTGGG TCCCGATAAC TGGACCAAAA TGACCAATTT 14 40 

CAATTTCTGC ATTTGGGTAT AGCTTTTGTA ATTTTGCTTG TTCTTTTTCA GCAACGATGC 1500 

GGTTATTGGC ATGAATCACA TACAGTTTAA CTGGTGCTTC AATCCCTGCG TTTCGTTCAC 1560 

CAATAATCTT TTCTGCACGA GCAAAAGCTT TCTTTGTTGA ACGGATTTTT TCAAATAATA 1620 

CAATCTTTCC ATCTTCAAAA GTCAAGATAG GTTTAATCTT CAATAGTCCA GCGATCAAGg 1680 

nTGCGCCATT CGTTAAGCGA CCGCCACGAA CTAAGTTGTT CAGATCTTCT ACAATTAGAT 1740 

ATGCATACGT ATTGTCACGA ATTAAATCTA ATTTGGCAAA AATTTCTTCT AAGCTGGCTT 1800 

TTTCTTCTGT TAAATCTAAA GCAGCTTCTA CCATGTGTCC CATTGGCATA CTTGTAATTT 18 60 

TTGAGTCATA TGGATACAAA GCAACGCCTG GGATTTCATC GGTAAGTCCG TGCAATGTAT 1920 

GAaCAAaaCC mGAGATtCCT GaAGAAAGAT GAaTGCTGAT GATGGTGTCG TACCCTkGTT 1980 

CAGCGATTGA TTTGTAAAGC TCTAACACTT CTCCTAAAGC AGGTTGTGAA GTCGTCGGAA 204 0 

ATTCTTTACT ATTATTTAGC AATGCATAAT ATTCATCTGC TTCAATGTCA ATGCCTTCGT 2100 

TGTATATTTT TCCATCTAAA ATGACTGGGA TGGGAATTAC AAAAAGATTC GGATGATCTT 2160 

TAATGCGCTC GGGTAAATAA GCTGTACTAT CTGTCACAAT AGCAATTTTC ATTTTGTAAC 2220 

CTCGTTTCAC TAATTTTCCA CGTTGTTTCT AATATAGCAT AAAAAAGGAT ATTTTAAAAk 2280 

GCTTTCCTTA TTCGTTCAGG AATTTTTCGA TGGCGTCAGc ATTTTTTTGT TCATCTACTT 2340 

CTAAAATACT GCCTGCATAA GAGGATTCAC CAAAATTCCA AGAATCTGGT ACCGGAACCG 2400 

ATAATGACTC AACCCCTGCT TTTCCTTTAA TCAACAATGA TGGTCCATTC GTTAACATGA 24 60 

AACTAACAGG CACATCTGTC GACATATAGC CGACTAATTT CCCAAGTGAT TCTGGGGTTC 2520 
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TTAATAAAGT 
GACGAACCCG 
CATGTCCATC 
TTTCTGCATC 
GGAAATCTAC 
GAACCAATTC 
GAATGTCCAC 
TTAGTTGGAG 
CAAGAATTAA 
TTTCTTGAGG 
TACTATAGGC 
TTGATTTTTT 
AACCGTTGCC 
TATAGCTGTC 
CTCTATGTTC 
ATTTTkCATG 
GTGTTTTATC 
AATTTAATAA 
TCTTTAAAGT 
AATGAGTTCC 
GTCCATTAAA 
AAGATTAATC 
ACTTGGCATC 
ATTTTCAGAA 
TTCAACAATG 
TAATTCTGGA 
AATCGCTTCT 
TGATAAAATT 
TTGTTGAATC 
ATCTATCGAT 
CACTTCTTGC 
AGCTTCAGGA 
ACCAACTTTC 



CATTGGGTTT 
ACCAAAGTCG 
CATGACCTGT 
GATTTTGACA 
CTTAGCATAA 
AGCACCGCCA 
GAATGTATCA 
AACCATTATT 
AATATTCTTA 
CAACGAGTCA 
AAATAACAGA 
ACGTTTCTTT 
ACCACCTGAA 
TTCGTAAACT 
ACCCATTGAT 
AATATGTTTA 
ACCTTTCTTA 
AAAATGAAAC 
GTGGACAAAA 
AGTTCTGTTA 
AAGAAGTACT 
CCTCGCCAAC 
GTAATCGCCA 
CCAAGAACCC 
GTCAAATCAT 
TGTTCTGCAA 
GGAAAGTTTT 
TTCTGACTTT 
GGTAAAACTA 
GCGTTAACAG 
TGTTCAATTG 
AAAGCAGCCA 
ATCGTTTCTC 



TTCATTTGGC 
CCTTCTTCAT 
TGCCCTTTTT 
CCTTTTGGAA 
TATTTCGTAT 
TAAGCGTATG 
CGCATAAATG 
GTGTCGGCTC 
GCCCCGTTGC 
TCATGTTCTG 
ATCAATAGGA 
TTTGGTTTCT 
TCTGTTTGAT 
GGCkCTTGAA 
TTTCGGGGAT 
TAACGATCTA 
GGATACCTCA 
TCACTTTATC 
CATAGATCGG 
AGGCTGCTTC 
CTCCTAGCTT 
TAAACACGCT 
GTGTTATTTT 
AAAATCGGGT 
ACATTTCCGC 
TGTATTTGGC 
TCTCTAGAAA 
GTTGCCAGAT 
ACTCTGCTTG 
TTCCTTCAAT 
CTTTCAAGCA 
ACGTTGCACT 
TACTCCTTTA 



TCMTACAGC 
CCATCCTAAA 
CAATATCAAC 
ACATAGAGTC 
CTAAATTAAA 
CGGCATTAAT 
AAATTAATTT 
GTCCAGCATC 
TACTTTTGAC 
CTGCAGATTT 
TCACTAGCCA 
TGTCTTTTTT 
GTCGCCTCTC 
GGCTTTCAGC 
TAAAGGTCTT 
CACGACTCAT 
TTTCACAGGA 
TAATAAGTAA 
GTAATCCCCT 
AATTAATTTT 
TGTTTTCAAC 
TAATACTTTG 
TTTCTCAGAT 
TTGATTTACC 
AGAAAGTTTT 
GGCGTAAnCT 
CATCTGCGAT 
CGCTTGATTC 
GACAGGTAAC 
CGTGTTTTCT 
AGCTGGAATC 
GTACGTAAAG 
ATTGTTGCAG 



tgacataact 
gcgagcgtaY 
ac cat ct aaa 

AACAATTTTT 
GTTTTGTTTT 
TTTATTCGGC 
CGGTTTTTTT 
TTCCCCACGT 
ACCATTAAAT . 
TCCTTTTAAA 
TTTGAAAAAG 
TCCGCGGGGT 
ACTTCTGCGA 
TAGTTCTTCT 
TTTATGCTCT 
CGTGTCACTC 
AATTCCTATT 
GCTTCATTTT 
AAAATTTTTA 
TCTGGTTGAT 
GGCCGCGATT 
TGTAAAGAGC 
AGCGGGAAAG 
GATAAATCTT 
GGTGCAATAG 
GTTGAAGGTG 
TGAGCTAATG 
TCTTTTGCCA 
TGCGCTTGAT 
ATTGGGATAA 
GATGCGTACG 
GAACCAATCG 
AATTTCATTG 



TGTTGTTGGC 
TGAAGTAAGA 
TTCAGTGATT 
TCAAATGATT 
AACGTTTCAC 
CCGACACCAG 
GATGGTCCAT 
GTATCGCTGC 
GTTTCTACTT 
AACATGACTG 
CGTTTTGTTT 
GGTTGGTCAG 
CTGTCCTCAG 
GGTTCTTCTT 
GCTGGTCTCG 
CTTTACCTCT 
ACAAATATGT 
CATTTTTAAC 
TTTCTGCACC 
CTTTCACTAA 
CTATTTTGCT 
CAGGAACATT 
AAATCGCTAA 
GTATATTTTT 
CTGCAAAAGG 
TTGCTTCTAA 
CTTGCGGATG 
CCATTAATTG 
GATACAAATA 
TGCTCCATGC 
GCATCAACGT 
GACCTAAATA 
ATAATTTCTT 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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CTGGCGAGCG ATTGGTCGTA TCAATTGTCA TCTTCGCACA TTCTTCATAA AAATTTTTAC 4560 

GAGACTCAAA CAAAGTAATG ATTTCTTTTG AAGAACGTTC TATAGCTAAG GGCCGTTGGT 4 620 

TTTCAGTATC TTCTGTGATT CTTTTTAACA GCTCTTCTGG TGTCGCATGT AAATAAATCA 4 680 

CTTGCTGAAA AGATTTTAAT AAGCTACGAT TTTCTGGTCC GACAACAATC CCGCCCCCAG 4740 

TTGAAAGGAC GGCTGTATTT TTTGACAGCT CCTTTAAAAG TTGGGTTTCC TGTTCTCGGA 4 800 

AAGCTGCTTC ACCATATTTT TCGAAATAGT CAGGAATTGA GCGTCCTATT TTTTCAATTA 4860 

ACGCTGTATC TAAATCAAGA TGAGGCATCT TCAGTTTATT GGCCAAACTT TGGCCGATAG 4 920 

TTGTTTTACC CGCACCCATG AAAcCAATTA AAACAATGCT TTCCATGGTT CGTTATCCTC 4980 

CTTCGTTAAC AACTTAAACG TTCTAAGTCG TCAAAAAATG CTGGATAAGA AACTGAAACT 504 0 

GCTTCAGCCT TATCTAATTC AACAGTGCCT TCTTTTACAA GTAATGCAGC AATTTGTAAC 5100 

ATCATCCCGA TACGATGATC CCCATAACTG GTAACTCTTC CACCATGTAA AGAAGTTGGT 5160 

CCATGTATAA TTAAGCCATC ATCAGTAGGC GTGATGTCGG CGCCTAAAAT TGTTAATTCT 5220 

TTCGCTACTG CATCAATCCG ATTGGTTTCT TTGACTTTCA ATTCTTCTGC ATCTCGAATG 5280 

ATTGTCGTGC CAGTAGCCTG AGTAGCTAAC AAAGCAATAA TCGGTAACTC ATCAATTAAA 534 0 

CGTGGGATAA TAGCGCCACC AATCTCTGTA GCTGTTAATT GACTCGTTTT TACAAGTAAA 5400 
TCGCCAGAAT GATTGGCCTC ATCTTCATTT AAAATAGTGA CGGAACCGCC CATGTTTTTA 5460 
ATCACATCTA AAATACCTGT CCGCGTTTGA TTTAAGCCAA CATTTTTCAG AAGTATCTCG 5520 
CTATCTGGGA CTACTAAACC CGCAACTAAA AAGAAAGCTG cAGAAGAGAT ATCACCAGGa 5580 
ACTACCACAT TTTGACCTGT TAATTGTTGC GGTCCAGTTA ACATAATTTT TTTACCGTCT 5640 
ACTTCAAGTG TCCCACCAAA TTGTCGAATC ATCTCTTCTG TATGATCACG TGTCTTTTCT 5700 
TTCTCAACCA CTACAGAAGT GCCCTCGGCT TGCAAAGCGG CGAATAAAAT AGCCGATTTA 5760 
ACTTGAGCAC TTGCAACAGG CATTGTGTAG TCAATCGGTT GCAAATTTTG AGTCCCGCGA 5820 
ATAGAAATTG GCGGAAACTC CGTTTGCTGA ACCCCTTGAC ATTCCGCTCC CATTTGGTTT 5880 
AAAGGAAGCA TTACACGATT CATTGGTCGT TTGGCAATAG ACGCATCACC AGCTAGGCGC 594 0 

GTTTCAAAGG GACAGCCAGC TAAAATGCCC AGCATCAGAC GAATTGTTGT CCCTGAATTT 6000 
CCAACATCAA TTGTATTCTT CGCCTTTTTT AAGCCTGCAA ATCCTCGCCC CTCAACGGTG 6060 
ATTGTCGTCC CGTCATCTTC AATGTTCACA CCTAAAGAAC GAAACGCCGC TAAGGTACTT 6120 
AAACAATCTT CGCCTCTTAG AAAATTTGTA ATCGTCGTTT TTCCAGAAGA AATCGCTCCA 6180 
AACATAATAC TTCTATGGGA AATCGATTTG TCGCTAGGAA CCATCAGAGT CCCTTGTAAA 624 0 

TGCTTCACAT TTGTACGTAG TTGCACCCTC ATCACCTCTT ATTTTTCGTA ACACGTATAG 6300 
TTCGTTTTTT TAGATAAAAT TTGTTTTGCA GCTTGGCAGT CTGGTTGATT TTTAAAAGAC 6360 
AATTGAAGAA TCCCATAGAT TTCTTCTCTC GTTTCTAAAA TTTTAATATT TGTAAGAGAA 64 20 

AGGTCCGCTT CGCCTAAAAT TTGCGTAATT TCAGCAATGA TTCCTGGTTG ATCTGGTACA 64 80 
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TCAACAAACA 
CGTGTTTCTT 
CATTGGCAAA 
CGATTGCTTA 
TCTCTGAATC 
CTTTCGTTCA 
TCATCATGTT 
TGTAACTCCT 
TTTTCAAACA 
TGACCACCAA 
GTACTACCCA 
TGTACTAAAG 
CTTGTTAAAG 
TCCGCTTGAT 
GCCAGTGAAC 
CTCCATATAC 
ACTGCTTCTT 
ACTTCCGTTG 
GAGCGCTCAA 
TTATAAAGGG 
ATTCCTCCTT 
CAAACAATCT 
AACTCAGCAC 
TTACGATCCC 
ACTACCCCAC 
TCTTCTACGG 
CGAATCGTTA 
GCGACTTGGA 
ATCGTTGTTT 
TTCATGCCAC 
TTTTGTTTTT 
TTTTCGACAA 
ATACGTACTT 



GATCATAGAA 
TGGCCTTATC 
CAGTAGTCAT 
ATAAAATATC 
CTCCAGCCGC 
CTAACGCTGC 
CCTGTGCAGG 
GAATCTGTTT 
AACGTTCATC 
TAAAATTTTT 
CATCTGTGAT 
TTGCTTTAAC 
ATTCAGCTAT 
TATCGAAACC 
TCCCGATTAG 
TTGCGACTTT 
TCATTTGTTC 
CAACAACGGC 
CACTGGCCTT 
TTGGAATAGG 
CAAAACCGCC 
CGTCCATCAC 
CTTTAAAAGC 
ATTGGACATA 
CAATTGTATC 
AAGGATCTAA 
AATTTTCAGG 
TATCCAGTTC 
CTCGTGCCGA 
CGACTAAATC 
CAGGAACTGG 
TCAACGTTAC 
GGTCTTTTTC 



AGCTGGGATT 
AAAAAAATTA 
CTCTTTTTGC 
CGTCCACATC 
TAGCTGCTGC 
GGCAACAATA 
CATCGTAATA 
GTTTTTTTCG 
AGCGGCTGTT 
CATGTTAAGC 
CAGAGCAGTT 
TGGGGAACAA 
CTCATCAATT 
GATAATCTCA 
CCCTAATCCT 
CTTCCATTAA 
AAATGAGTCA 
TTCACAAACA 
ATAAGGCTCT 
TTTCATGACA 
TAAATTGTTG 
TTGACTACCA 
GTTGATGCTT 
GCTACCTAAG 
GCCATTTTTC 
TACGCGAACG 
GATCGTAGCT 
AGCTAAGAGT 
AGACCGTTCT 
AGCATGTCCT 
CTCTACTGAC 
TGGTGAGCCA 
AATTTTCATC 



GCGCCTTCTT 
CGAATAGCTG 
CAATTTTCTA 
GTTGCATCAG 
GCTCTAGGGT 
TGTGGCAAGT 
AACTTCGCAT 
CCATGGTCAT 
ACGCCGGATT 
TTTGTTGCTA 
TCTAGTGATA 
AGAAAAATAA 
AGACCCGTTT 
CTGTTTGGAT 
ACGATTAAAA 
AAGTTTTGAG 
CTATCAAATT 
ACGCTAGCGG 
TTTGTATCAA 
CCACGAACGA 
GAAGTTCTGG 
GGGCGTTGTG 
ACAACTGCTT 
CCAGCTGGAA 
TTGGTTTGGT 
GCAGATTGTT 
TCAATCCCAC 
TTTTTAGCAA 
AAAACATTCC 
GGACGTGGTT 
ATCACGGAGG 
AGTGTTTTAC 
CGTCCCCCAC 



TAT.GAATAGG 
GGGCATTTTT 
GTAAGTCCAA 
AGGAAGCAAT 
AAGTGGTATT 
GACTTAGAGC 
GCGTTCCTTT 
CGGTAAAAAT 
TATGTGAACC 
ACTGATTAAT 
ATTGGTTTAA 
TCTCTGCTCG 
TCTTAGCAAA 
GCCCTTTTTT 
TACGTTTCTT 
TATATAGACG 
TTTCGAGCAT 
CCGGTACCGC 
TATTCACGCT 
TGATTGGCAT 
TATAACCAGT 
CCATTTCAAA 
GCGCAATTTT 
CGCCACCCAC 
CAATTAGTTC 
CAGATCGTTC 
CTAATACCGC 
CCGCACCAAT 
GTAAATCATC 
TGCTGACGCG 
TCCAATTTTT 
CATGCCGAAT 
GACCGTAACC 



CAATTGAGCA 
TTCTGTTAAC 
TAATACTAAG 
TCGAGTAATA 
CAGTTGCTGA 
ACCAGTAATT 
TAGTAACGTT 
ATAGTAGGCA 
AGCCATTGGA 
TTCCACCTTG 
TTCTTCTAGT 
TCTTGCCCCA 
TTCAGTTGCC 
GATACACAAA 
CATTT.TTTCC 
ATAACGTTTC 
AGCCTTTGCT 
CGTGCTATCA 
TTGTAATGGT 
TCCGTTGGTC 
ACTTTCGTCC 
TCCAATGCCA 
GGCATCTAGT 
AAGTACTTCT 
TTTCATTTTT 
TTGAATTTCA 
GACATGCCCA 
CGCCACACGC 
ATGTTGATAT 
GCGGATTTTC 
CCAGTCTTTG 
ACCCGAAGTA 
GCCTTGACGT 



6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 
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CTTGCTAATT 


CTCGGTTAAT 


CTCTTCGCTA 


CTTAAAGGCA 


AGCCGGCTGG 


TAAGCCTTCA 


8520 


ATAATAGCAG 


TTAATTCAGG 


TCCATGTGAT 


TCGCCTGCTG 


TAATAAAGCG 


CATCTTAAAT 


8580 


TCCTCCTTCT 


CTGTTTAAAA 


ATGCTCGAAT 


TTCTTCCGTT 


GGTAAAGAGA 


CAATTTTCGC 


8640 


TTGACCAATT 


TTTTCTAAGA 


CAATGATTTT 


AATCTGTCCC 


CCACGTGTTT 


TTTTATCATG 


8700 


TGTAATTGCT 


TGATAGAGAC 


GTTCTTCTGA 


CCAACGATCT 


GTGGTTACTG 


GTAAATGGAA 


8760 


CTTTTCTAGC 


ATGGTGACCA 


ACTCTTGGGT 


AGTCCCAAGT 


GGGGAAAGCC 


CTTGTTGTTC 


8820 


TGCGACTTGA 


GTTATTTGAA 


TCATTCCTAA 


AGAAACGCCT 


TCACCGTGAG 


CAATCACCCC 


8880 


GTAACCAGCT 


GTATTTTCTA 


ATGCGTGGCC 


GATCGTATGC 


CCAAAATTCA 


GAATCAAACG 


8940 


TAAGCCCAAA 


TCTAATTCAT 


CTTCTTCGAC 


GACATCACGT 


TTAATCTTGC 


AACAGGCCGT 


9000 


GATAATTTCT 


TCTGCATGTG 


CCACTAAATC 


TTGTTCATTT 


TCCAAAGAGG 


ATAAACGGTG 


9060 


CCACAATTCA 


ACATCAGCGA 


TAGCTGCTGA 


TTTTACAATT 


TCTGCAATTC 


CTTCACGCAC 


9120 


ACGTCTAGGT 


TCTAATGTTT 


TAAGTGTATT 


AGGATCAATT 


AAAACCCCAT 


CTGGTTGGGC 


9180 


AAAAGTACCG 


ACAAGATTTT 


TGGCTTTTTT 


AGTATTAACC 


GCTGTTTTAC 


CTCCAATGCT 


9240 


ACTATCTACT 


TGTGCCAGTA 


AGGTTGTTGG 


TACTTGCAAA 


AAGTGAATAC 


CGCGCATATA 


9300 


GGTTGAAGCG 


ACAAATCCAG 


CTAGATCTCC 


CACAACGCCT 


CCACCTAAAG 


CAATTAGACC 


9360 


ATCACTTCGA 


GTCAATTGCT- 


GTTCCGCTAA 


AAAATCATAA 


AGCAGTTGAG 


CTATTTCGAG 


9420 


GCTCTTACTT 


TGTTCACCTG 


CCGCTACAGC 


AATCAATGAT 


GTTTCAAAAC 


CAGCAGCTTG 


9480 


CAATTCTTTC 


TCAACAGcTG 


CGCCATATAA 


TTTATTCACC 


GTTTCATCGG 


TAATAATCGC 


9540 


TACCCGTTGT 


GGCTGCCACA 


GGCTACGTAC 


CCAGGTGCCA 


ATTTTATCTA 


AGGCACCTGT 


9600 


TTCGATGGTT 


AAATCATATG 


AATGTGTAGG 


TAACGTTACG 


GTTAATTTCA 


TGTCCTCTAC 


9660 


TCCTCTGTCT 


TTACTAATCT 


TCTAAAGCAT 


TAATTTCTTT 


CATTGCTTTT 


TCGATGATAT 


9720 


GAACTTCTTT 


CATCATACGT 


AGGTAAGTTT 


TTTCATTCAA 


GGATTGTGGC 


CCATCTGACC 


9780 


ACGCATTCGC 


TGGATCTGGA 


TGAATTTCTA 


CAATCAAGCC 


GTCCGCACCT 


GAAGCAACAC 


9840 


CTGCTCGGGC 


CATTGGCGGT 


ACTAAATCCC 


AGATACCAAC 


ACCATGACTC 


GGATCAACAA 


9900 


TAATTGGAAA 


ATGACTTAAT 


TTTTTAATTA 


AAGGCACCGC 


ACTTAAATCA 


AGTGTATTGC 


9960 


GCGTAGcggT 


TTCGTACGTA 


CGAATCCCAC 


GTTCAATGAA 


GATCACATTG 


AAATTTCCTT 


10020 


GCGCACAATG 


TATTCAGCTG 


CGTTTAGCCA 


TTCATCAATC 


GTACCAGCAA 


TCCCGCGTTT 


10080 


TAAGCCAATA 


GGTTTTCCAG 


TTTTACCAAC 


CGCTTGTAAT 


AACTTGAAAT 


TTTGCATGTT 


10140 


GCGCGCACCG 


ATTTGTAAAA 


TGTCACTGTA 


TTTTGCGACA 


AGTTCTAAAT 


TCGCTTCATC 


10200 


CATCACTTCT 


GTAATGACTT 


GCATATCTAA 


TTCATCAGCC 


GCTTGGCGAA 


TGTATTTTAG 


10260 


TCCTTCTTCT 


TCTAGTCCTT 


GGAAAGCGTA 


TGGCGACGTT 


CTAGGTTTGA 


ATGCACCACC 


10320 


TCGTAAAATT 


GTTGCACCTC 


CAGCTTTAGC 


AATTCGCGCA 


CATTCGCGAA 


TCTGATCTAA 


10380 


GCTTTCGATT 


GAACAAGGGC 


CCGCCATCAT 


GGTCATACTG 


CCGTCACCGA 


, TTTTAACACC 


10440 
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GTTCACATCA 


ACCACTGTAT 


TCTCTGGATG 


AAATAAACGA 


CTCGTTAGTT 


TGTAAGACAA 


10500 


CGAAATGCGG 


ACACTTTTCT 


CCACGCCTTC 


TAATGCTGTT 


AACTCTGCTT 


CAACTAATTT 


10560 


TTCTGTATCA 


CCAATCAAGC 


CTATCACTGT 


TTTTCCATTA 


TCTTGACTTG 


TTTGAGTAGT 


10620 


TAAGCCTGCA 


CCTGTTACTA 


AATCAATGAC 


TTGTTTCATT 


TGCTTTTCGG 


TTGCATTTTC 


10680 


TTTCATAATT 


ACGATCATTA 


TATTCTCTCC 


TCATGCCTTT 


AAACGGCGCT 


CTTTTTATCT 


10740 


ATTTTCAATT 


TTACGTTTTA 


CGACTGATAC 


AGGCATTTTT 


TGTCCTGTCC 


ATTGTTCAAA 


10800 


AGCAGCAGCG 


CCTTGATAAA 


GTAGCATGTC 


CaAGCCGTTG 


CTTGTTTCTG 


CACCACGTAA 


10860 


ACGGgCTTCT 


TTTAATAACT 


GTGkTTCyCT 


GGGATTATAA 


ATAGCATCAT 


ACmCAAATAA 


10920 


CTTCGGTTGA 


ATCATTGCAT 


AATTTTCTAT 


AGGACtACTA 


TGCGCATGTG 


-GATGCATACC 


10980 


CACACTGGTT 


GCATTAACTA 


ACAAGACGcT 


TTCAGCAACA 


TCTTTTGCTA 


GTTTTTCAGT 


11040 


ATCCGCTAAA 


TCGTTTAAAT 


GAATTACACA 


GTTGGTTCGT 


TCAGTGAAAT 


TTGCCAGTTT 


11100 


TTTTTGACTG 


TCGTTAAAGC 


CCGCGCTTTT 


CCTGTTGTAA 


ACGGCGATTT 


CTTTCACGCC 


11160 


ATCTAAAGCA 


GCTTGGGCAA 


TGATTGATAA 


GGCTGCACCA 


CCTGTTCCTA 


AGATGGTCAT 


11220 


TTTATTCTGA 


AAAACGTCAA 


CCTTTTTCTC 


TTTCAAACTC 


CACATAAAAC 


CAGTACCATC 


11280 


CGTGCTGTCT 


CCGTAAAGTT 


TTCCATCTTT 


GTTGACAATC 


GTATTAATTG 


CGCCAACCAG 


11340 


TTGAGCGGTA 


GGACTTAGTT 


CATCCATATA 


ACTTACAGCC 


GCCACTTTAT 


TGGGCATTGA 


11400 


TAAGTTGGCC 


CCTAACATGT 


GAAACGTTCG 


AATCGCTTCA 


ATTGCTTGTG 


GCAATGTCTC 


11460 


TTGTCCAACG 


TCAAACGCCA 


GATACCGAGC 


ATCAACTCCT 


AAATTTTGAA 


ATGCTGTATT 


11520 


ATGAATCAAC 


GGTGAAATAC 


TGTGTTGGCT 


GGGTTTCGCG 


AATAGCCCAG 


CTAAACGAGT 


11580 


GGCTCCAGTT 


ATTTCTTTCA 


TCCTGTGTGC 


TCCTTTACGA 


AATTTTTTTA 


GTAAGTTTCT 


11640 


GAAATCCTAA 


TTTTTGGCCA 


AACAAAAAGG 


ATATTCACAG 


TAAATTTTAA 


GAAAAATCTA 


11700 


CAGTGAATAC 


CcTACAAAAA 


AGCAATATAA 


AAGCCACTAT 


AGATTTTCTT 


AAATCTACAT 


11760 


GGCTGAAAGT 


CATCTTAAAT 


TTCCCTAGCC 


ATTATAGATA 


CAAACGTTTG 


TTGTCTGTAC 


11820 


CCATAATGTT 


ATGAATACAA 


ACCTATCTAA 


AATAGCTAAA 


GTAAAAGTTA 


TTCAGTTGT 


11879 


(2) INFORMATION FOR SEQ ID NO: 182: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
CATGGCAAAT • TTTAGTATAT AGTCCTAAGG ACATTTTACA AAAAGAATTn AAAACGAAAC 
nCATGaCGGG gTaTGACCCm ATTGAAGTAG aCGrATTTTT AGACAaTGTT ATTAAAGACT 
ACGAAGCTTA TAATAAAGAA TTATTAAGCT TGCAAGAAGA AAACAGCCGC TTAATGGCAA 
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AATTAGACCA ATTATCaAAA GCCCAACCAA CGCCTCGTGT TGCTCaAGAA GcACCAAAAA 240 

GTGCAGCAGT GACCAACTTT GATATTTTAA AACGTTTATC AAAC7TAG.AG_ CGTGAAGTTT 300 

TTGGTAAAAA GTTAGATGAA ACACCTTCAA CACCTGTAAC GCCTTCTGCA CCGTCAATGC 360 

CTGCAGAACC AGCGAATCAT GATGTTGATA ATGCACAAAC ACGTC.--ATTT TAACAAGACA 420 

ATTTAAGTCA AGTTTTCACG AGGaACAAAA AGCATTTATT AGAATAGCGG ACACTTGTTG 480 

TAGTGTGTAA ATTTTTGCTG TTCGTCTAAA CAGATTTGTw ATTTTCGGGT GATCGCGGTT 540 

TGCTTTTGCA AGCTGAGGAA AGTCCATGCT CGCACAAGCT GAGATC-CTTG TAGTGTTCGT 600 

GCTTAGCGAA ATCATAAGCT AAGGTACTCT TTTAGAGTAA CGGCASGAAA AATGACTAAG 660 

GTTTCGACTA TGTCAAAGTA TCCTTGAAAG TGCCACAGTG ACGAA3CGAT GTGGGAAACT 720 

CCATCGGTGG AACGCGGTAA ACCCCTCGAG CGAGCAACCC AAACA.VTGGT AGGGGCGCTC 780 

TTCTAAAGGA AATGAACGAG TAGAAGAGGC AGAGTTTACT CTGCASATAG ATGATGACCG 840 

TCACCAATTT TTCCTGAAGA ATTGGTGATA CAGAACATGG CTTATAGAAA ATTACAAGTA 900 
ATTCAGGAAG ACCTTCCAAC GTAGTTGTGA AGGCTTTTTT TACGAnAAGA ATAAAAAAAT 960 

GTtAAAATAA AGAAACTGAG wTAAAaGTAG TAAgCTGTCT GTTCT.--AAGA GGCCGGAACA 1020 

GGACCGTTtA GTcAGAGAAA GCGGACAATT wTAAAAAmCA aGCgC-.-.ccAG TAGTCsGCAG 1080 

GACTGTTAGC TCCAGACmaT wAAAAAGrgT CmAGAAGGAG TTTTargaCG AAAGAAAAAtti 1140 

CaTyCAACyT GTTGCaCGGC TGCTAGTGGC TTAGAAGCAT AGTTGGn U87 
(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1B3: 
AAATAACTAA AACAGTTAGA AATAATGAAT TTCCTCGTTG CTGTTGGCGT TTTGCtAAAC 60 
GaGAACCACC ACCATTTTGA TCAGGTTCAT AAATTGATTG TTCCC.-.TGGC TCACGAGCTT 
GGTTTTTCTT TTTGTCCTTT TtACTCACTG CTAATTTCCT CCTAAACTTC TCTCGCTTGT 
TATTATATCA TAGCTTTTGC CAACTGCCGA CTAAAAAAAT GAATTAATCA CGTTCTTTAA 
ATAATTTTAG AAATTTAGAA CGCCAACCTT CTTCATTTTG ATTGC7GGTC TTGTTCGATT 
TTACTATTGA ATCTTGGTAA ATGTCAAAGA ATAACGCACA ATTATCACAA CAAGTTTCCG 
GTGATTGTTT GACAATCGGT TCTTGAAAAT AAGTTTGAAT CAACGTTCTT CGACAGGTTG 420 
TGGTTTCAAT ATACGTCAGC ATCCCTCTTA ATTGTGCT7G GCGATCTTTT TCTTTCTCCT 480 
CTAAACGATG CTCTAGCTCT TCAAAAGAAT AATCCTTGGC TAAA7AGCCT TGGACCCATT 540 
TTTGTTGTAA CTCGGTGGCA TTTTCTAGCA AACCTGCTTG TTCTTCTCCT TCAATCAGCG 600 
ATTTTAAGGT CGCTCGATTC GCTCTTGTCT CTTCTAAG«A AAACC3ATGA ATAAATTCAT 660 



120 
180 
240 
300 
360 



WO 98/50555 PCT/US98/08985 

1010 



CCCCTTTTTT 


ATACAATAAT 


AGCGCCCAAC 


TTTTTTGACC 


ATCACGTCCA 


GCTCGCCCGA 


720 


TTTCTTGCAG 


GTAATTTTCT 


AAGCTATCAG 


GCAAATCAAA 


ATGAATCACT 


CCACGAACAT 


780 


CAGGTTTGTC 


GATTCCCATC 


CCAAAAGCAT 


TCGTTGCACA 


AAGAATATCG 


ATTTGATTAT 


840 


CAATAAATTG 


TTCTTGCAAG 


CGACTACGTT 


CACTAGCCTC 


CAAGCCCCCA 


TGATAATAGG 


900 


CTACCTTAAA 


ACTGGTTTTT 


GCCTGAATAA 


GTTGACTGAT 


TTGTTCTGTT 


TTGTTGCGCG 


960 


TGGCGCAATA 


GATAATGATT 


TTACCAGCCG 


TTTTAGATAA 


GTAATCTAGT 


AACACTTCTT 


1020 


CTTTTTCTGA 


CGTTTCCTTC 


ACGAACAAGC 


CAATATTTTT 


TCGATTCACT 


GAAGAAAGAA 


1080 


CTTCTTGATA 


GCTGCCATGA 


GAAAATAATT 


GTTTTATAAT 


TGCGTGCTGT 


ACAACGGGTG 


1140 


TCGCTGTGGC 


AGTTAACGCC 


AAGGTCAAAG 


GAAAGTCTAG 


CTGTTTTTGG 


ACCAGATCTA 


1200 


ATTTACTATA 


TTCAGGGCGA 


AAATCGACGC 


CCCACTGATA 


AACACAATGG 


GCTTCATCCA 


1260 


CCACAAATAA 


TGCAATTTTT 


TGGCGTTGTA 


ACACACTAAG 


TACTTCTTGC 


TGCAAAAGCA 


1320 


TTTCTGGGCT 


TAAAAACAGA 


TAATCGTATT 


GGAACATTTT 


CGCTAAAATA 


TAACGCTTTT 


1380 


CTGAATACTG 


TAACGTACTG 


TTTAAGGCAA 


CCGCACCTTT 


AATTCCTTGT 


CTTTGTAATT 


1440 


GACGCATCTG 


ATCTTCCATT 


AACGAAAGTA 


AGGGAGAAAC 


AATTACCGTT 


AGTCCCTCTA 


1500 


ACTTCTGGCC 


CACAAATTGA 


TAACACAATG 


ATTTTCCAGT 


CCCTGTTGGC/ AAAACAGCCA 


1560 


ATGTATTCAT 


ACCGTCAAGT 


AAGTTTGTAA 


TCACTTCTTT 


TTGACCAGGC 


TTAAACGCTG 


1620 


CGTAACCAAA 


TTGAGTGAAT 


AATTCTTGTT 


CTAATGTCAA 


CAACACTCAC 


TCCTTTTCTT 


1680 


GCCAAATTTG 


TTGGATTTGA 


TATAACCGAA 


AATCTAAATA 


AGAAAGCGTC 


GCTTCTTGCG 


1740 


CTCGCCAATC 


CCGATACACC 


CACAGCCGCG 


GATCTTGTTC 


TGTTTGAGCC 


AAGTTTTGGA 


1800 


GCATTGCCTC 


CTTTTCAAGG 


TAAACAGACG 


CCTGCAGCTG 


CCCTTGAATA 


GCCAACTCTA 


1860 


ATAGATGGTC GTGAATGGTA 


CTCGGTTTTA 


TTTTTCGCTG 


CTCAGCAATT 


TCTTGAAGTG 


1920 


TCTTACCTTC 


CGTTAACAAT 


CGTTGGGTTT 


CTTGCATACT 


TTGGTTTAGA 


TTTTTTTCTA 


1980 


GATACGGCCA 


TACAAGCGTT 


TGTAAAGCAG 


GATAATTTCC 


TCCGTACATT 


ATCAACTGCA 


2040 


GTAAATGATG 


AACCGCTTGC 


GTGTGCCACA 


AACGCTGTTC 


AAGAGCTGTC 


TTATGGCTTG 


2100 


TTAATTGCTG 


AGGAACTTTG 


CCAGTTTGCT 


GATAACCAGA 


AAGTTGCGCA 


ACCAAATAGT 


2160 


CGCTTTCTGC 


TTCAGGTAGT 


AACTCAAACC 


CTCGCAACAG 


TTCTTCTTTG 


ATTGATTGAA 


2220 


CGCGCTGCTC 


TTTCTTGTCT 


TGTTGTAACC 


ATCGTTTCAA 


ATATAGTTGC 


GGAATCGGTG 


2280 


TTGATAAAAG 


TGGAATATAC 


TGTTTTTCCT 


CAAATGATAA 


ATATGAAGTC 


ACTTGCACGG 


2340 


CAAATTGAAT 


GAGTTGCCAA 


CTTTCTCGAT 


CACTTCTGCC 


AAAACGATAA 


AGATCAATTG 


2400 


CAGGATAATC 


CAGCAACGAA 


AAATGGTGCT 


CCTTTAAAAA 


TAGTTGCCCT 


TCTTTTGTTA 


2460 


GTTGAATCTC 


ATTTGTCTCT 


TCGTGATACT 


GAATTTGTTT 


CGCCTTCGAA 


AGTTTTTCAA 


2520 


GAAATTGAAA 


ATAGGCTTGT 


TCGGAAATCG 


TTGGCCACCA 


GCCAATGAAC 


CGTAAACAAT 


2580 


CATATAAAAA 


ACCATAAATC 


AAAACAGAGC 


TAGTCCGTTT 


GCCTTTTAAA 


> AGATGATACA 


2640 
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AAGTTGATGC 


ACGTACCTTG 


TATCCATGAG . 


ATAATAACGC 


TAAAATAAAA ' 


GTTGCTTCCA 


2700 


TAGTAAACCT 


CTTTTAGATA 


GGATGTGTCA 


CTGTTTATGC 


AATCACGTTT 


AGTTCCTGAA 


2760 


AAATGTATTG 


CTTGTGGACT 


TTGTCAAGTT 


TATGCGCCTA 


ATATATTTGA 


TTATGATGAT 


2820 


CATGGGATTG 


TCCTTTTTAA 


AGACGAGCCG 


CATGCTCGAC 


AACAATTTAT 


TCGAGAAAAC 


2880 


GAACGCGAGA 


ATGTGTTAAA 


AGCGTACCGC 


AAATGTCCCA 


CGCGTGCAAT 


AGAAATAAGC 


2940 


AAAAATGAGT 


GCTAAATAAT 


TTAGCACTCA 


TTTTTGCTTA 


GAGTATCTTT 


TCGAGCCAAG 


3000 


CAAGTGTTTG 


TTCATCTATC 


GAAACAAACT 


GTAATTGTCC 


TTCTTTTAAC 


GGTTCTTTGA 


3060 


CTAAATGAAT 


TTGATCATTT 


AGGCCATTGA 


TCCaTTCATA 


ACGAAGACCA 


TTTTCTTTTG 


3120 


CTTTAGAATT 


TTTGGCTTCT 


AAATAATAAT 


TGACATGGTA 


ATTTTCTTTA 


CCATCCAATT 


3180 


GTAGCGTATT 


AATGGTCACT 


TCAAAATCTG 


GTGAACCATC 


GGTCGATTTC 


TTTTCACGTA 


3240 


ATCGTTCAAG 


TGCTTCTATA 


AAAGAAGGAA 


ATTGAGGATT 


ATTTTCCCCA 


CCAACGAGAT 


3300 


ATTTTTTAGC 


AAATTCTGTA 


CTATCCGTAT 


GACTTTTCAC 


TTGTTTGGCA 


AATGCTTGAT 


3360 


AAAAGGTTTC 


TAGCGCTTTG 


CGGGCATCAT 


AGTCGCTAGC 


TAACGGAAAA 


TTCAATTGTA 


3420 


GCGTTGGATT 


ATCGGAGAGC 


GCTTCGACAA 


AATCATTTTC 


ACCAATCGTT 


TCTTTTGATG 


3480 


TTTGAATTTC 


TTCTCCATTA 


ATCGTCTTTT 


TAAGTTGAAT 


CGTCATCCCT 


TTGGACCAGA 


3540 


ACAAAGGGCC 


AATCGTTTTG 


CTTCCATTCG 


TTAGCGTACC 


AcTTTCCGAT 


CATTAATATA 


3600 


AATATCTGCG 


TCTGGCAGAT 


TGCTTTCCAC 


AACAAAATTC 


ATTCGTTTAA 


AATCTAAAGG 


3660 


AATACTAGAA 


ATATAATTTT 


CCTCTTCTAA 


TAAACGGTAC 


TCTTCTGTGA 


TAACAGGTTC 


3720 


GCCGGTGCTA 


TCTTTCACTG 


TGGCTGTAAA 


AGTATATTGT 


CCTGGCGCTA 


AGGGGCCCAA 


3780 


TTCCTTTTGA 


TAAGTGGTTG 


AGTCGGAAGT 


GCCCACTTCC 


CGCCCGTTCA 


TTGCCAGCGT 


3840 


CACACCGCGC 


TGATTCGTCG 


TTAACTGAAC 


AGAAACAGGC 


GTTAAAATGA 


ATTGATAACG 


3900 


GTGAAAGAAA 


AATGCTGTTT 


GACTTGTTtC 


ACGAATAGTT 


AAACCATGTA 


ACTTTTTATC 


3960 


CTTTAATAGC 


GCTTTTTCTA 


ATTTTTTTAG 


CTCAGTTGGA 


TTTTTTTGAT 


AATAAGTGAT 


4020 


TAAAGGCTTA 


ATGCTTTGCG 


GATTCACTTG 


AAATTCAGAC 


TCGTTCGTCC 


TCATTTTTTT 


4080 


ACTTAATTCT 


TGGGCATCAC 


CATTCTCAAT 


CGCTTCAATA 


. AAATAAGAAA 


XTTGTTGTTC 


4140 


CTTAGAAAAA 


, TAATGCGTCC 


: CGAGAAAATA 


k GCTACCCG 






4178 


(2) INFORMATION FOR SEQ ID NO: 184: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
GATTTTAATG GGCGGTATCC TTTTAAGTGC TTCCGCGGTT TTACTTACTG CCTGTGGTGG 
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CGCAGAAAAA ACGAAAGCTG AAAATGAAAG CGGTTCAGCA AATAAAGCAG CTGGTGACGT 120 

.AAAATTATGG GTTGATACTG AATATATGGG CGTCTTTAAA AAAGTTGTCG CCGATTTTGA 180 

AAAAGAAAAC CCCGATATTA AAGTTAACTT AACTGCTGGA AATTCGGCAG ATGCTAAAAA 240 

AGACATTGCA AAAGATCCTA AAGCTGCCGC AGaTGTTTTC ATGATGCCAC ATGATCAAAT 300 

TGGTCAAATG GCAGAAGCTG GTCTGATTTA CCCTAACACA AAATATGAAA AAGAAGTCAA 360 

AGAAAACAAT ATCGACTCGG CAGTCGCAGG AGTCACTTGG AAAGACAAAG TCTATGCCTA 420 

CCCTTATGCC GTAGAGTCAC AAGTTCTTTA TTATAACmAA GATACGTATT CTCCAGAGGA 480 

AATTAAAACT TGGAAGTCTT TAACTGGAAA AGGcAAAATC GGTACCAATT TTGGCGAAGA 54 0 

TGGCGCAAAT TATATCTTCG GTCCTCTATT TATGAGTAAC GGCGATTACT TATATGGTGA 600 

AAATGGCGAA GATCCGAAAG GTACCAACTT TAACAACCAA CAAGGTATAG AAGTATTACA 660 

ATGGAtTGCC GATCAAAAAA ATAATCCTGG TGTGATCCAA TCAAATACAG AAGCATTGTC 720 

TAATCTAGGT TCTGGCAAAA CGGATGCCTT CCTTTCTGGT CCTTGGTCTA AAAACGATGT 780 

AGAAAAAGCT CTTGGCGACA AAATGGGGGC CGCTGCCTAC CCAACCATTG ATTTTGGCAA 840 

TGGAGAAAAA CAAATGAAAG CTTTCTTGGG CGTTCGTAGT TTTGCTGTCA ACCAACAAAC 900 

ACAAGCACCA TTGGCCGCAA TGACATTAGC AAACTATTTA ACAAGTGAAA AAGCACAAAT 960 

GACTTACTTC mAAGAAATCg GTTTTGTCCC tCCAACCAAA AATTACCAAn CAAC 1014 
(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CGTTCCATGG CCGTTGTTAA TTTATCCATA TCGGACGCAT AACTAATTCG GACGTATCCT 60 

TCTGCTTCTG GGCCAAAAGA AATACCTGGA ATCAACGCTA CTGCTTCTTT TTGCGCTAAA 120 

TCAATACAAA ATTTCATAGA ATCTTGTTCA TAACCAGTAG GAATCTTAGC AAAAATATAA 180 

AAGGCCCCAT TTGGACGCGC AACCTCAAAC CCCAATTGAG TCATCTGTTC ATAAACAAAA 240 

TCGCGCCGTT TCCGATACTC TTCTTTCATA ATTTCAGCAT CATGCATTCC TTCGATTAGG 300 

GCACGAACAG CCGCTTTTTG AGACACAGTT GCAGCCGCAG TAACTAAATA CTGATGAACT 360 

TTAATGATCT CCGCTGTTAA AATCGCTGGT GCAAAAATAA AGCCAATCCG CCAGCCTGTC 4 20 

ATGGcATGTG aTTTCGACAA ACCGkTAATT AAAATCGTTT GGTCACGTAA CATTTCAGCA 4 80 

ATCGAAACAT GACTCTCATC ATAAGTTAAC TCACTATAAA TTTCATCACT AATCACAAAA 54 0 

ATGTCGTATT TTTTTAGAAC ATCAGCAATT GCTTGCAACT CTTGGCGGTT ATAGGTGACA 600 

CCAGTTGGGT TGCTAGGATA ATTTAAAATA ATTGCTTTGA CTTGTTCCCC ATGCTCGGCC 660 

ATTGCCGCTT CAATCATTTC AGGGGTCAAG ACAAAATCGT TGTCTCGTGT ATCCAAATAT 720 
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ACCGGTTCCG 
AATAGCACTT 
GCTCCAACTG 
ATAAAATAAG 
AAATTCGCAT 
GGCTCTCCTA 
CGAATTAAAG 
TAGCTTCACT 
TCTCATTGTT 
AGGAGAACTT 
TCCGAATTTC 
CTAATACATT 
TTTGCGCTGT 
CGGCTGTCCA 
TACCTTCATG 
GATACGTAGC 
ATTCTCCTTG 
CACAACGAAT 
CTTCATTGAA 
GCTGCGGGTT 
CATAAATACT 
AACGATGATC 
kGATCATTGT 
GGCGTTCTCT 
CTGGTAACAT 
TCGTGATTTC 
AGATATATTC 
CTCCTGTTTT 
ATTCGTTTTC 
TTTCTCCTTG 
ATTCGACCGT 
AACCAAAACG 
TCACTTGATA 



CCCCCGCCAA 
TGTCACCTTG 
TTACTAAAAT 
CGGCTGCTTC 
CAATTGCTTC 
GCGTTAATTT 
AAACAGCAAT 
CCTTCAGTTG 
TCGCTAGATT 
TGTCCTTTTC 
AAACGGACGT 
TGTTAAACAA 
TAGATTAAAG 
ACCAATTGGT 
ACGCGCAGTA 
AAAGCGTTCT 
GCATTGTGCT 
TAATGTGACC 
GATAGTCACG 
TGTTGGATTT 
ATCCGCTTCG 
TTTGATTTGA 
AACCATTGTC 
AAACTCACGA 
GACGGTTTGT 
AGCCGTTACA 
ATTCCCAATG 
TTTGTCAGCA 
CATTTCATTT 
AATCAATGCA 
TAAGTATTTG 
AACATCTTCT 
AtCTGGGGTG 



TGTAATAATT 
TTCTAAAATT 
TTCTGTCGCA 
ACGGaCATCT 
TTTGGCTGCT 
TAAAATACCT 
TTTATTTACT 
ACGTTATAGT 
TTTATTATAA 
AACAAAGTTC 
AAtGGTGCAT 
GTCGCTGGTT 
CCACGAACGA 
CCTTCTGCCG 
GTAGCTGCAG 
TCTGGTTTTC 
TCTGGTGTCG 
GCAATGGTTC 
CCTTTGTTTT 
TCCCATGTGT 
TGTGTTTCTG 
TTATTAATGG 
GTTATTTCGA 
ACAGCCTTTT 
TTAATTTGAT 
TTTTCGGTTG 
TCGCCAGTAT 
ATGGTTAAGC 
GTTGCTGGTT 
AAGCTATCCC 
GCCATGTACG 
TTGACAATAC 
GCTTTTtGTT 



GGTTCATACC 
GCTAATAACG 
GGCTGGtAAG 
AATAAACCAG 
TTTTTGACAT 
TCAATTCCTG 
TGACGATTAA 
AGCACAAATT 
ATATCTCAAA 
TCCTATTTAT 
CAACTTCTAT 
GTGTCTCTTT 
CTAAATGATT 
TTGTCAAGTA 
TAAAAGGAAT 
CATGCAATGA 
CAAAATAGCC 
CATCTGCTAA 
GGTCATGAAC 
CGCTTACTTG 
TGACCATGCC 
TTGTTTCAAA 
ACGGGCGAAg 
GTTCTTCTTC 
AACTTGCTTT 
TAATTCCTTG 
CTTCAAATGT 
TACCATTGTT 
GAGCCAAAAG 
AAGAGAATGG 
GTTGACGGAA 
GGCTTGCAAC 
TTAATTCATG 



CTGGGTAAAT 
AAGCTGAAAT 
TCAAATGGtA 
ACATCCCTGT 
GCTCAGGTGT 
AGACTTGTTC 
AACGATtGGT 
GACCGTAAAA 
AAAACGACAG 
TTATTTTTCG 
CGCTGGTGTC 
GTGAAGTTCA 
TGTTTGCTCT 
GACATGGTTC 
TTGTGCACTA 
CAAACCATAT 
CCAATCACCT 
GACTTCATAC 
ATTCACAAAC 
ATTTGGTCGT 
AGTTGGGAAA 
GAACAATTGG 
CGTCGTTGAA 
TAATCGTTCA 
TACAGGGGAA 
GTCTTCGGTC 
TAACAATTTT 
TTCAATTTTC 
ACTTCCTTCA 
CGCCATTTCT 
AGCAtCTTTA 
GGCTTTTCCA 
ATAAAGATCT 



AGGTGCTGGC 
CGCTTCTGTk 
CTTTtCTTTC 
AtAATGAGAA 
GTTAAAATCC 
GTCAAATTGA 
TAAATCCATT 
TGAATGATTT 
AAAAGAAAAT 
AAACGCCATG 
AACGGTTCTT 
CAAGGAATAT 
TGACGTTTAA 
GGTGCTAGTT 
TAGGCTTGTT 
TTAAAGGTAT 
AGTTCGCCAA 
TCATTTAAGC 
GCTTGTTGGT 
GTCACGACTT 
AGCACGCGTA 
TTACTTTCTT 
CGTTGTGCAA 
TCTGCCGCAA 
TTTTCTTTAT 
GGTTTGAAAA 
GAAAATGTTT 
ACTTGAATAA 
AAAGCTTTCG 
TTGACAGATA 
GGTAAATCGT 
GTTgGATCAA 
TCAGGTTTcC 



780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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TGTATAAAAT 


GGCAATCGTt 


CAATCTCCAC 


TTCCACAGTG 


ACAACCTCCG 


TTTTCTGATA 


2760 


GCCACTAGTA 


TTAAATAAAA 


CAAACGGGTG 


CGCTTTTTCT 


GGGAAAACAC 


TTGTATCAAT 


2820 


TGCTTCTGTT 


AATTGTCGTG 


TTGCTTCATC 


TGCTAAGAAA 


TGCCCGACAT 


CATTGGCATT 


2880 


TTCAAAGCGA 


GTCATCATGC 


CTCGGTGAAC 


CTCGTCGACA 


GAACAGCCAC 


AAATACTATC 


2940 


GTGCGGATGG 


TTTTGAAGCA 


ATGTTTTCCA 


AGCATAATCA 


AATTGGTCAT 


GTGGGTAGTC 


3000 


GCCAGTTACT 


TCGTAAGCCA 


TTGCTGCTAA 


AGGTTCTGCG 


ATATTTTCTA 


ATTGGCGTTG 


3060 


GACTTTTGTG 


TTCCACTGTT 


TTAAATAGAC 


GCGAGCAGAA 


GAAGTATTGG 


CTAATGTGTA 


3120 


CCAGCCATCC 


GTTTCTTGAC 


TCGTTAATTC 


ACCTGTTACT 


GTTCCTAAAT 


CTTCTGGTAA 


3180 


CTCCTCTTGA 


ACAGCTTTCA 


AATAGTCATC 


GAAATTGGAA 


TGAATAAATT 


CATATTCTGG . 


3240 


GAATAGTTCA 


TTGGCTAAAG 


CGATGGCCTT 


AGTAATATCT 


CTTTGTACAG 


GTTGGTGGTC 


3300 


GACCCCATTC 


ATCATCAATA 


AATGGTTTGT 


GGAAGCGTAA 


CGTTCCACAT 


CAGCCAATTT 


3360 


TTGTTTCCAA 


AAAGCGATTG 


CCGCTTCTTT 


TTCTGAAGGA 


ATTTCGTTCC 


CAT T AC TATA 


3420 


CCAGTTAGCA 


AAAAGCAAGC 


CGAAAATTTT 


CGTTTGATCA 


GGACCTTCCC 


ACCACATTTC 


3480 


aGAATATTGT 


GAGCTATAGT 


CGCTTTCTAA 


GACTTGGTTA 


TCAAAGCCAA 


TCGGTTTCAC 


3540 


CCCGCGTCCA 


AAGGCAGCCG 


CAGGaGATTT 


GCTAATTGCA 


TCATTTGAGG 


CGTTTGACCC 


3600 


ATATTCCCAA 


ATGn 










3614 



(2) INFORMATION FOR SEQ ID NO: 18 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 6: 



CTTACATTCC 


ACTaAAACgT 


CAAACCgTAT 


CCAAAGAAGC 


GATTTATATT 


GCCATTGGTA 


60 


TACGAGAAGA 


CGGCACTAAA 


GAAGTACTGA 


GTTATGCGAT 


TGCTCCAACT 


GAATCAACAT 


120 


ACGTTTGGAA 


TGAGCTGCTA 


CAGGATATTA 


ACTCCAGAGG 


AGTTCAAGAA 


GTCTTGCTTT" 


180 


TTATTACGGA 


CGGCTTAAAA 


GGCATGAAAG 


ATACTATCCA 


TCAAATTTAT 


CCTAAAGCAA 


240 


AATATCAGCA 


TTGTTGTATC 


CATGTATCTC 


GTAATATCGC 


TCATAAAGTA 


CGTGTCAAAG 


300 


ACCGAAAAGA 


AATCTGTGAT 


GACTTTAAGG 


CTGTTTATCA 


AGCTAACTCA 


AAAGAAGAAG 


360 


CGAATACCTT 


CTTATCCGGC 


ATGATTGAGA 


AATGGAAGAA 


AAACTATCCT 


AAAGTGACGC 


420 


AGTCAcTCAT 


AGAAAACCAA 


GACTTATTAA 


CTTTTTATGA 


TTTTCCACCT 


AGCATTCGTA 


480 


GAACCATTTA* 


CTCAACCAAT 


CTAATCGAGT 


CTTTCAATAA 


GCAAATTAAA 


AGATACAGCC 


540 


GTAGAAAAGA 


GCAGTTTCAA 


AATGAAGAAT 


CACTAGAACG 


CTTTCTAGTC 


AGCATTTTTG 


600 


ATACATACAA 


TCAAAAATTT 


CTAAACAGAA 


GCCATAAAGG 


TTTTCAACAG 


GTAACCGATA 


660 



WO 98/50555 PCT/US98/08985 

1015 



CATTAGTTTC 


AATGTTTACT 


GAGT AAL 1 AA 


TTATt- TTGPA 
1 1 M 1 U 1 1 o^rt 


GGAGGAPAAT 


TTATTTACAC 


720 


AAAATTATTG 


ACGCTCCCAA 


t\ 7\ rp rn rp T Op rn 

GAATbi 1 1 n 


aaataatat,?* 

MMM 1 MM lttid 


AATAATAGTG 


TTTTTPAAAG 


780 


TTTTGTTAAT 


TTt tCAI i bb 


(jM 1 bbb AA 1 


TP A AG A A ATP 
1 VjMMVjMMM 1 ^> 


APGTGGTATA 


PTTTTAAGAA 


840 


GArACAATGA 


/^TV /*>/^T\ TV TV m TV 

CACGAAAGTA 


GCCA1 IbbAA 


a A APPAGATT 




AAAAAAAPAT 


900 


ACATCCTCTT 


ATGAAGTTGC 


CTAb I A 1 bAb 


bbboril 1 1 in 


ptggtgpaat 


PAAP,ATAPPG 


960 


GCTTTGCTAG 


CGGTTGTCAT 


m tv T\ t\ o rp tv rn /~i 71 

T AAAb I A I b A 


PAAPAAPAAA 


PAG A ATT APT 


AGGAAGAGAT 


1020 


GCGGCTTATG 


TCGCTCAATT 


TGGTLl Agbb 


1 bbb 1 1 U/\ 


PPAATTATGA 


AATAGAGATT 


1080 


CACCGCTTAC 


CAAAAGTTGG 


TGAAAAAb I A 


bbMMl iMbrtM 


PPP A AGPT AT 


GAGTTATAAC 


1140 


AAATATTTTT 


GTTATCGTAA 


CTTTTGGGTT 


C ATGA 1 bAAb 


aarrrzi a Ar* a 

MMbbbMMMoM 


ATPTGTPTTT 


1200 


GTGAAATCAA 


CCTTTGTCTT 


AATGGATCAA 


Tv TV TV TV. TV m /** /"* r** 7V 

AAAAATCbbA 


AAA I bAb i Ab 


TPT^TT APPA 


1260 


GAAATTATTG 


CACCTTTTGA 


TTCTGAAAAA 


fv m TV /"* fTI TV TV T\ TV 

ATCACTAAAA 


TTTATCbbbA 


•Pr'AAAAAATT 




GAAAAGGTCA 


CAGAGGGGAA 


CTTTTTACCT 


TATCGTGTGC 


GTTTTTTTGA 


1 rt i 1 brt 1 ubt 


± J O \J 


AATCAACACG 


TTAATAATGC 


GATTTACTTT 


AATTGGTTAT 


TbbAlblbl i 


Mbbb Iftl urtL 


14 40 


TTTTTAACAA 


CGCATCAACC 


GAAAAAAATT 


CTTGlbAAAI 


1 1 b A 1 MMMbM 


APT Af^ A AT AT 
MVj 1 Mo MM 1 M X 


1500 


GGCCAAGAAG 


TCGAAAGTCA 


TTACGAAATT 


GTCGAALAAb 


AAAAlbAbl I 


PA A A APAPGP 
oMMMMx^MVw VJV_ 


1560 


CATGAGATTC 


GGATTGATGG 


TCAAACCTAT 


TGTGAAbbbA 


A 1 A i bbA lib 


P APPA APT A A 


1620 

i 


GTTGAGATGA 


CGGTTTAGAA 


AAATAAAAAG 


TTAbbbAbAb 


bAbbArtbbMb 


ATGATTPGPT 


1680 


TAGGATTAAC 


ATCTTTTAGT 


GAACACGACT 


tv mm m T\ 7\/Tn/'">/~> 

ATTTAAL 1 bb 


AAAAAAAbbb 


TPP APGPTTT 
1 LLML»o\^ ill 


1740 


ATGAATATGC 


CAGTCATTTA 


CCGCTAGTTG 


Tv TV Tv rn f T\ ^ TV 

AAATGbAbAb 


bbbblAl l/\b 


PPPATTPPTP 


1800 


CCAAAGAACG 


AGTAGCTGAA 


TGGGTCAAAG 


CAGTlbbbbA 


MALI 1 ibbU 


TTTGTPATGA 


1860 


AAGTTTATAG 


TGGAATCAGT 


TGTCAAGb lb 


Ab i outnurtt 


PTATT ATf^PA 


AGTGAAGAAG 


1920 


AAATGATCAC 


CGCCTTTTTA 


bAAAb 1 Al bb 


CCCC ATT A AT 


TGAAAGTAAA 


AAATTATTTG 


1980 


CCTTTCTGGT 


ACAATTTTCa 


GGAACATTTG 


GtTGTACrAA 


AGAAAATGTC 


GCTTACTgCA 


2040 


AAAAATTCGT 


CACTGGTTAA 


AGACTACCAT 


CGCCATTGAA 


TACAAATAAT 


CCAGGTATCA 


2100 


ACCAAACTTG 


TAAACAATTT 


GCAATTATAA 


AAAATCATTC 


ATGTATGTGA 


TAACGAAATC 


2160 


TCATCAnCCT 


TACTATTACA 


TCTACCnGTC 


CCGTCnnGA 






2199 


(2) INFORMATION FOR SEQ ID NO: 187: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
TnCTCAGGGT TTTTCTTCCC TTCTTGTGAA TGCGGGATTG TACCAATTGT GCATCAATTT 
GTAAAAAAAG ATGTACCAAC ACATACCGCT TTTGCTTTTA TGTTAACGGC ACCAATCATT 
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AATCCAATTG TGCTATTCTC AACATACGTA GCTTTTAGTG GCGCAACAAA ATTTGTGATT 180 

TGGCGAGTTT TAGGTAGTAT GATTGTCGCG TGGGTTATCG GCATTTGGTT AGCTTATTTT 24 0 

CGAAAAGAAT CAATTTTAAC AGAAAAAGAG TTGGCGCAGA TTAAAGAAGA GCAGGTTGCT 300 

CATACGCATG AACACGTTGA TGAAGTTCGT TCATTTTGGA AAAACGGCTG GTGAGCGTTA 360 

ACCCATAGTA TTGATGAATT TTTTGATACA GGTCGCTATT tAATTTTTGG TtCTTTGTTG 4 20 

GCAGCAATCA TGCAAACGTA TATCCCaACT GGTGCACTtA TGCAATTAGG CCATTCAAAG 480 

ATTTtAGCGA TTCTAGTCAT GCTAGTCATC GCTGCCACTC TATCACTTTG TTCAGAAGCG 540 

GATGCATTTA TTGGTTCTTC TCTTTTGAGT TTATTTGGCA CAGGACCAAT TGTCGCCTTC 600 

TTAGTTTTTG GTCCAATGGT AG AT ATT AAA AATTTATTGA TGATGAAACG • TTATnTTAAG 660 

ATGTCATCCA TTGTACAATT TGTTTTGATA GTGGCGCTkG TCGTAmCwAT TTATGCGGCA 720 

GTAGTTTAGA AAGGAGGCCT ACAATCGATG ATACGATTTA TTATTTTAAT TGGATATATG 780 

GGGTTGATGA TGTATCTTCA AATTTCAGGT GAACTGAACC AATATATCAA TATTCATTAT 840 

AATTACTTAG CTGTTTTATC AATGGTTTTA GCTTTTATTA TGGCGATTGT CCAATTGATT 900 

TTGTGGAACC AAGCAGATCC ATCAATACAA AAAAAACATG AAGACATGCA CCAACATGGG 960 

GGACATGAAC ATCATGGCCA TCAGCATAAT TTGGAAAAAC CCAGCCAAAG AGGGCTTGCC 1020 

TAT T TACT AC TTTCTTTACC GTTGATTGTT GGCTTATTAT TTCCGACAGT TAGTTTAGAC 1080 

ACAACAATTG TTGAAGCAAA AGGATTTAAC TTTCCAGTAA GTAAAGAATC AGTAGGGGAT 114 0 

CCTGATATGC AAACGCAATA TTTGAAACCT GATACAAGTA TGTATTTCAA TAAGACGGAT 1200 

TATGACAAGC AGATGGCAAA AGCCATGAAA CAATATGATG GCCAATCTGT GATTTCAATT 1260 

ACGGATGAAA ACTATTTGGA AATTATGGgA ACTAATCTAT AATTATCCAr GTCAATTTGC 1320 

aGGGAAAAGG aTTTCTTATA AAGGCTTTGT CTATAATTCC "AAAACGAGAA GAAAGTGTTG 1380 

ACCAGTTkGT CtTCCGTTTT GGGaTTATyC ATkGTGtCGC TGATTCtGGt GtTTTTGGCy 14 4 0 

TGttGGtTCA TTTyCCTGaA CATACTCAAT TTCAAAACAm TGATTGGGTA ACGATTACCG 1500 

GCACGtTGAG TTGTCCTATT ATCCACCGTT TAAACGACAA ATACCAACTG TGCAAGTGGA 1560 

GAAAGTCAAA GCAGATcAAG CACCAAAAAA TCAATATGTC TATCGTTCCT TTTAAAAAAA 1620 

GTTGGTTGCT TCCTTATCAA ATGGAAGCAA CCAACTTTTT TGCTGTCTAT TTTTTAATTT 1680 

TGTTGCGTTT ACGACGAATA AAACCCAAAG ATTCTCTTTT TTCTCGAAAG CTGCGGATAT 17 40 

TCTGATTGAA AACCTCAGAA GCGACTAAGC CTAACGCAAT TGCCCCAGCA ATCATTACAG 1800 

CTTGcACGGT AAAATAGGCA CCTTTTTCAT AATTGCCAAT TACGATATTC CGAACAGCTT 1860 

GATAAGCTAA GCCACCAGGT ACCAATGGAA CAATACCAGG rATATTAAAA ATCGTGACAG 1920 

GcATTTTCAA TATTTTTGAG AATAGGTAGC TTACTGCCGC AACACCGACA GCACCGATGA 1980 

GGGAACCTAA TGCTGCATTT CCACTAAAAA CTTCAACGAT TACCCAATAA AGCATCCAGC 204 0 

TCACACTACC AGTTAGACCA CAAGCCACCA GTGATCGTCG AGGAACATTA ATAATAATCG 2100 
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CAAACGCCGC AGAAGCTAAA AAACTAAAGA TAAATTGTAC CAAAACGATT CCTAGCAAGT 2160 

AAGTCACTCC TTTTAATAGA AAAGTTGAAA GACAAAAGCA ATGCCAAAAC CAATCATTCC 2220 

TGAAGTCATT AATGCTTCAG TCCCACGAGA AACACCCGAT AAGTAGTGTC CAGCTAGCAA 2280 

ATCGCGAAGA GCATTGGTAA TTTGCACACC AGGCACTAAA GGCATGACAC AACCAATGAT 234 0 

GATTAAATCT TGTGATTGTC CGAGTCCCAA TTTGACTGAA AGAATAGCGG CTGCACCAAT 24 00 

GAACAACGAA GCTAAAAATT CAGATAAAAA TTTAATCTTG AGAAACTTTA AACTrAAATA 24 60 

ATAAAGACTA TAGCCGCAAC CACCAATCAG AGAAGTCAAG AATAAATCGG ACCAGACACC 2520 

ACCAAAAAGA ATCATAATTG TAGCGCTGAC AACGGCCGCA CTAACAATGC GTGCGCTAAT 2580 

2594 

TGGAAAATGT CGGT 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

AATCCTTTCG CAAGGAGGAC TTACTCAGCA AAATCCGTTC CAATCGAATG TACCAAGGCT ( 60 

TTAACGTTTC AGCAGCCAAT AAAAATGAAG GATGGGGATC AGACCGATGA CAAGAAAAGC 120 

CAAAACGGCA GAAGAGATTA AACATATTGC AGAAGAGGAA AACGTACGTT TTTTACGACT 180 

AATGTTTACC GACATTATGG GAACAATTAA AAATGTTGAA GTTCCTGTGA GTCAGCTAGA 24 0 

TAAAGTATTA GATAATAAAA TGATGTTTGA TGGGTCATCC ATTGAAGGAT TCGTTCGTAT 300 

CGAAGAAAGT GACATGTATT TATACCCTGA CCTATCTACA TGGATGATTT TTCCGTGGGa 360 

AAGCGACCAT GGAAAAGTtG CGCGCTTAAT TTGTGATATT TACAATCCTG ATGGTACCCC 420 

TTTTgCCGGA GATCCCCGCG GTAATTTAAA ACGCGCCTTA GCAGATATGA AAGAATTAGG 480 

CTTTACTTCT TTTAACTTAG GACCAGAACC AGAATTTTTC CTATTTAAAT TAGATGAGAA 54 0 

TGGTGAAATT ACCACTGACT TGAATGATAA AGGTGGCTAT TTTGATTTTG CACCAACCGA 600 

TTTAGGTGAA AATTGTCGCC GTGATATTGT TTTAGAGTTA GAAAGTTTAG GCTTTGAAGT 660 

GGAAGCTTCA CATCATGAAG TAGCACCTGG TCAACATGAA ATTGACTTTA AATATGCTGA 720 

TGTAGTTGAT GCTTGTGACA ATATTCAAAC GTTCAAATTA GTTGTGAAAA CTATTGCGCG 780 

TAAACATGGC TTGCATGCAA CATTTATGCC AAAACCTTTG TTTGGAATTA ATGGTTCTGG 84 0 

TATGCACTGC AATATGTCAT TATTTAATGA AGAAGGTAAT GTTTTTTATG ACGAGTCAGG 900 

AGAGATGGGc- TTAAGCCAAA CTGCGTATCA TTTCTTAGGT GGTTTATTAA AACATGCCCG 960 

TGCCTATACT GCGGTATGTA ACCCAACAGT GAACTCTTAT AAACGCTTGG TACCAGGTTA 1020 

TGAAGCACCT GTTTATGTAG CTTGGAGCGG CAGAAATCGT TCACCACTCa TTCGAGTTCC 1080 
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TGAATCTCGT 


GGJcTTATCGA 


CTCGTTTAGA . 


ATTACGTTCT ■ 


GTCGATCCTT CAGCGAATCC 


1140 


GTATTTAACA 


ATGGCTGTCT. . 


TATTACAAGC . 


AGGTTTGGAT 


GGTATTCGTA ATGAACTTAC 


1200 


ACCACCGCCA 


GCAGTTGATC 


GTAACATTTA 


TGTGATGAAT 


GAGGAAGAAC GTCAACATGC 


1260 


ACAAATTGaA 


GATTTGcCAT 


CAACCTTACA 


CAATGcCATC 


AAAGAATTAC GTAAAGATCA 


1320 


AGTAATGATT 


GATGCATTAG 


GTCGTCATAT 


TTTTGCCAAT 


TTTGTAGAAG CGAAACGAAT 


1380 


GGAATGGGCA 


GCCTTCCGTC 


AAACCGTTTC 


TGAATGGGAA 


AGAGAACAAT ATTTGGAGTT 


1440 


GTACTAAAAG 


CCGTTACTTT 


TAATAATCTT 


AATGTTTAGA 


AAGAGGAAGA TAAGTCGAAA 


1500 


CGCTTATCTT 


CCTCTTTCGT 


TCTTTAGTTT 


GTGAATGAAT 


AGATAATTTC TATTTCACAA 


1560 


AATAAAGTAA 


TTTTAATTCA 


GTGAAATAGT 


TTGTTTATAG 


GGGAAAAAGG GGGTTTTTCT 


1620 


CAATTTCTCT 


TTTATTTTTA 


AGCAAAAAAA 


TTGACAAAAT 


CTTTCAAAAT TTGTGAAATT 


1680 


CCGCTATAAT 


AAACAAGaAT 


ATGCGTGCAA 


CTATTCTATT 


TTTTAGATAT AAAGGATGAG 


1740 


AAAAACATGC 


GGAAAATGAA 


AACAATGGAT 


GGGAATGCAG 


CAGCaGCTTA TATTTCCtAT 


1800 


GCTTTTAcAG 


AATTAGCAGC 


CaTTTATCCA 


ATAAcACCAA 


GTTCAACGAT GGCTGAACTT 


1860 


GTTGATCAGT 


GGTCGGCCGA 


AGtAAGAAAA 


ATATCTTTGG 


TCAACCTGTG AAAGTAGTCG 


1920 


AAATGCAATC 


GGAAGCTGGT 


GCTGCCGGCG 


TGGTCCATGG 


CTCTTTAAAA AcAGGCGCGC 


1980 


TTACCACGAC 


CTATACGGCT 


TCACAAGGGT 


TACTTTTAAT 


GATTCCTAAT ATGTACAAAA 


2040 


TAGCTGGAGA ACTATTACCA 


TCTGTTTTTC 


ATGTGGCGAG 


CCGAGCTTTA ACTACGAATG 


2100 


CTTTAAATAT 


TTTTGGTGAT 


CAAGGAGACG 


TAATGGCCGC 


ACGCCAAACA GGCTTTGCTA 


2160 


TGTTGTCAGA 


AAGCAGTGTG 


CAAGAAGTGA 


TGGATTTGGC 


ACCAGTGGCT CACTTGGCTT 


2220 


CGATTGAAGC 


GAGTGTTCCA 


TTTATGAACT 


TTTTTGATGG 


CTTTAGAACA AGTCATGAAA 


2280 


TTCAAAAAGT 


GGCAGTTCTT 


GACTATGAAG 


AATTAGCACC 


CTTAGTAAAT CAAGAGAAAT 


2340 


TAGCCGAATT 


CCATCGTCGG 


AGCATGAATC 


CGAATCACCC 


TTCCGTTAGT GGAATGAACC 


2400 


AAAATCCAGA 


TATTCACTTT 


CAACAACGAG 


AAACAATAAA 


TCCTTATTAT GAAAAATTAC 


24 60 


CTGGTATTGT 


TCAAAAATAT 


AT G AC AG AAA 


TTAACCGGTT 


ACGGGGAACC AATTATGATT 


2520 


TAGTGACGTA 


TTATGGCGCA 


GAAGATGCAG 


AAGAAGTGAT 


CGTTACAATG GGTTCAGTTG 


2580 


CGCAAACAAT 


TGAACAGACT 


GTTGATTATT 


TACAACAACA 


AGGAAGAAAA GTTGGCTTTC 


2640 


TTAATGTCCA 


TCTCTATCGA 


CCATTTCCAA 


TTGAAACATT 


TCTCGAAAAA GTACCTCAAT 


2700 


CAGTTAAAGC 


AATTGCTGTC 


TTGGATCGGA 


CGAAAGAGCC 


CGGTGCTGGT GGTGAACCGT 


2760 


TATTATTGGA 


TGTCCAAAGT 


GCGATGTATG 


AAGCGGACAT 


TCGCCCAACA ATTATTGGTG 


2820 


GGCGCTATGG 


GTTAGGTTCT 


AAAGATGTAC 


TGCCAAATCA 


AATTGTTGCG GTTTTTGATG 


2880 


AATTAATGAA 


• AGAACGTTCT 


GCCATGAAGA 


AGCGCTTTAC 


AATTGGGATT GATGATGATT 


2940 


TAACTTATAC 


TTCATTAGAA 


. GTTGGCAAAC 


CACTTGATTT 


GACGAATCCG AAAACCTATC • 


3000 


AAGCTAAATT 


TTGGGGCTTT 


GGTTCTGATG 


; GGACAGTAGG 


; GGCGAACAAG TCAGCGATTA 


3060 
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AGATTATCGG 


TGATCATACA 


GACAAGTATG 


CTCAAGGCTT 


TTTCTATTAT 


GATTCAAAAA 


3120 


AATCAGGCGG 


GTTGACCGTT 


TCTCATTTAC 


GTTTTGGCGA 


GACACCCATT . 


.CGT.TCAACGT 


3180 


ACCTCATTGA 


ACATTCTGAT 


TTTGTGGCCT 


GTCATACAGC 


CGCTTATTTG 


CATACCTATG 


3240 


ATTTAGTAAA 


AGGGTTAAAA 


AAGGGCGGTA 


CTTTCCTCTT 


AAATACCATT 


TGGAATGATG 


3300 


AACAACTAGC 


ACGATTTTTA 


CCAAATCAAT 


TAAAACGGTA 


TCTAGCTGAA 


AATGAGATTC 


3360 


AGTTTTATAC 


AATCAATGCA 


GTGAAATTAG 


CTAGTGAAGT 


CGGACTAGGT 


GGGCGAATTA 


3420 


ATACCGCGAT 


GGAAACAGCT 


TTCTTCAAAT 


TAGCGGAAAT 


TATGCCTTTT 


GAGCAAGTGT 


3480 


TGCCAATTTT 


AAAAGAAGAA 


GCACTGAAAA 


GTTATGGTCA 


TAAGTCCATG 


AAAGTTGTCG 


3540 


AAAAAAATAT 


TCAAGCAATC 


GATAAGACCG 


TTGAATTGTT 


GCATCAGGTA 


CCTGTTCCAG 


3600 


CAGAGTGGAG 


AACGCTTGAA 


GTTCAACCGC 


GAAAACGTTC 


AGAAAATGTT 


AGTGATTTTG 


3660 


TTCATGAAAT 


TGTTGAACCG 


ATTAATCGTC 


AAGAAGGGAA 


TGCGTTATCT 


GTTGCAACAC 


3720 


TAGCGAAAAA 


TGGGATGACA 


GATGGGCGGA 


TGCCTTTGGG 


AACTGCGGCA 


GTTGAAAAAC 


3780 


GTGGAGTTGC 


ATTAGAAGTA 


CCTGAGTGGA 


TTAGCGATCG 


ATGTACTATG 


TGTAATGAGT 


3840 


GTGCCTTTGT 


ATGTCCTCAT 


GCGGCGATAC 


GACCATTCTT 


AGCCGATGAA 


GAAGAAATGA 


3900 


CTGAAGCGCC 


AGAAGGATTC 


ATTGTGCGTG 


ATTTACGTGG 


TGCCGACGGC 


TTGAAGTATC 


3960 


GTATCCAAGT 


CTCAGTCAAA 


GATTGTACTG 


GCTGTGGTCT 


TTGTGTTGAA 


GCCTGTCCAG 


4020 


CTAAAGGGAA 


AGCTTTAGTA 


ATGAAACCTT 


ATGAAGAAGA 


AAAAGAGCAG 


GCTATGAACT 


4080 


GGGCGTTTGC 


TATGACGTTA 


CGTCAAAAGG 


AAAACCCAGC 


AAAACCGAAT 


ACTGTTTTAG 


4140 


TTCGCAATTT 


AATAAACCTT 


ATTAGAGTTT 


TCTGGT 






4176 



(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

CTATATGATA TATCATTTAA AACAAAGGAG CnACTTTCAT GAATATTCAA GCGATTGATA 60 

CGCGACACGG AACTGCCAAC CAACATAGTT TTTCAAATGG AAATTGTCTA CCCTATACTG 120 

GCGTGCCATT TGGGATGAAC TTTTACGCCC CGCAAACAAC TGACCAAAAA GGCAGCTGGT 180 

GGTTTCATCC AGAAGATCGA ACATTTCAAG GCTATCGAGT AACGCATCAA CCAAGTCCTT 240 

GGATGGGTGA TTTTAGTCAT TTATTAATGA CCCCTGTCAG TGGTTCATTG TCGGAACTTT 300 

CTTTATTCCA TGCCCAAAGT TCTTATCGTC CAGAAGAAAG CCTTTTTTCT CCTGTCGAAA 360 

TTAACTTAAC TCAATTACGC TATCAAATAA CTTCTCAATT AATCCCAAGC ATGTATGGTG 4 20 

GTATCTTAAC CATTGATTAT CAACAAAAAG ATAATCACTT ACTACTAACT CTGCCTGGAC 4 80 

GTTATCAAGT AAAACAATTA GATGACCATC AAGTGGCTGT AAAAGTCATC AACTATTCTG 54 0 
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GTTGTGAAGA 


TCCAGATTTT 


AGTTTTTATT 


TTGTGCTTCA 


TTTTGAACAG 


CCTTTAACCA 


600 


AATGGTTTGC 


CCCTTCTTCA 


GGTGAAGATG 


GCAAAATCCT 


TCTTTCTTTT 


GGAAATATTG 


660 


CTCAACAAGT 


TGTTCATTTT 


AGTAGTTCAT 


TTATTTCTGA 


AAAACAAGCG 


CAGTTAAATT 


720 


TAGCGCGAGA 


AATCTCTTTG 


CGTTCTACTG 


AGATGCTGCA 


ACAAGGAATC 


GCAGATTGGC 


780 


ATAATTATTT 


CGATCGCCTT 


AAAGTAACGC 


ATGAAAACCC 


TGAACATACT 


AAAACGTTTT 


840 


AT CAT AC ACT 


GTATCGAACA 


TTCCTATTTC 


CGCAAACCTT 


TTATGAATTG 


GACGAAAATC 


900 


AGCAGCCGAT 


TCATTATGAT 


ACGTTCTCGC 


AAACGGTTCG 


TCCTGGTGTA 


CTATATACCA 


960 


ATAATGGCTT 


TTGGGATACT 


TATAAAACGG 


TCTACCCATT 


GTTCTCACTA 


ATTGCTCAAG 


1020 


AAAAATACGA 


AGAGATGCTA 


GAAGGCTTTT 


TAAATAGTTA 


CAACGAAACA 


GGTTTCTTGC 


1080 


CAAAATGGTT 


ATCTCCAGAT 


GAGCGTGGTT 


TAATGCCCGG 


CACCTTGATT 


GACGCTGTCA 


1140 


TTGCTGATGC 


CGCCGTGAAA 


AAGATTCGTC 


CTGACTTGAT 


GCCTCAGTTC 


CTAGAAGCGA 


1200 


TGAAAAAAGG 


CGCTACACAA 


CAAAGCGAAC 


GAGAAAATTA 


TGGACGTCAA 


GGAACTCTGG 


1260 


ATTATCTAAA 


ATATGGCTAT 


GTTCCTTCAA 


CCTATCATGA 


ATCTGTCAAT 


CATACCTTAG 


1320 


ATTATGCGTA 


CAGCGATTTC 


TGTATTtCGC 


AAGTTGCAAA 


AACATTGAaT 


GaTTCAGAGA 


1380 


CCGCAACGTT 


TTATCGTCAA 


CAAGCCTTGA 


ATTATCAACA 


ACTTTTTAAT 


CCTGAAACTG 


1440 


GCTTTATGCA 


AGCAAaGGAT 


ACTGAAGGCA 


ATTTCCGTCC 


TGACTTTTTA 


GATATTCGTT 


1500 


GGGGCAAAGA 


TTACGCCGAA 


GGAAGTGCTT 


GGCAATCAAG 


TTTTGCGGTT 


TATCAAGATT 


1560 


TTGCAGGTTT 


GATTAAGCTA 


TATGGTAGCG 


AGTTAGCTTT 


CGAAAAAAAA 


TTAATTCAGC 


1620 


TATGCAATCA 


AGCGCCTAAC 


TTTAACGTTG 


AAGGCTATGG 


TTTTGAAATT 


CATGAAATGA 


1680 


GTGAAATGGC 


GGCAATTGAT 


TTTGGTCAAT 


TAGCCATCTC 


AAATCAACCA 


AGTTTCCATT 


1740 


ATCCGTTTCT 


ATTCAGCTAC 


ATCGGCAAAC 


CAGAGATGGC 


TCAACCACTT 


TTAAAACAAC 


1800 


TAATGCAAAC 


ATTTGATGCT 


TCACCAACTG 


GCTATCCAGG 


TGATGAAGAC 


AATGGCAGCA 


1860 


TGTCTGCTTG 


GTATATTTTT 


AATAGCTTAG 


GCTTTTATCC 


CGTCACACCA 


GGAACTGGTG 


1920 


AATATGTAAT 


CGGGATGCCT 


TTAGTCCAGA 


CGGCGGAAGT 


AAAACTATCA 


AATGGTAAAC 


1980 


AGTTAACCAT 


TCAGACATCA 


CCAAACAAAG 


TCCAACAACA 


ATTTATTCAT 


GAAATTCAGT 


2040 


TAAACCAAGA 


AAAACATACT 


GCGCCTTATT 


TTACTCATCA 


AGAACTATTA 


AATGGCGGGA 


2100 


CTTTAGATTA 


TCAATTAGGG 


ATCGTCCCGA 


ATCCACAAAA 


CACAGCAGAA 


CGACCGTTTT 


2160 




nr;AA a a at a a 


T A A A AA G AAA 


GATGTTCATA 


AAAATTGAAC 


ATCTTTCTTT 


2220 


TTATTCTTCG 


ACACCTGCGT 


AATTTGATTG 


GTATTGCTCT 


CGAAATTCTT 


TTGGCGTAAT 


2280 


TCCATTTAAT 


TTTTTAAACA 


TTTTTGAAAA 


ATAGTTGGTA 


TTGTTAAATC 


CTACTTTTTC 


2340 


TGAAATTTCG 


TTGATTGTAT 


CATCCGTGTA 


CAATAATAAT 


TTCTGGGCAT 


GCTTGGTACG 


2400 


CACTTGaTTC 


AAATATTGAG 


aAAAACTCCG 


CTTGGTTTCT 


TTTTTAAAAA 


TTTGGCCTAA 


2460 


ATAACTAGGG 


TTCAAATGAA 


GCTCATCCGC 


CACCGTTTTT 


AAGTtTAATT 


CTTCTAAATA . 


2520 
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GCCTTGTTCA ATTCTTTCTA CAGCTTGGCT AACACTTTCT GAATATTTTT TTATTyTAGG 2580 

TTTATTTTTC ACTTCTGCCA AGACTTGCTG CATTAACGCA TACAGTGTAT TAAGATTGTT 264 0 

GCTGGCTCGA ATTTCTTCCA CGATTTTTTC ATAGTTTTCT TCTAACACAG TTGGATACTG 2700 

ACGGGAAATA TCTGAAAAAA GTAAGAAGAC TACATAGCGT ACATATTCTG GCGTAGCGTG 27 60 

TTGTTGACTT AACTGATTGA AAATAGCATC TAATTCCCGT TGAATGGTCT GCATATCCCC 2820 

AATCATCAAG GCTTGGTTAA ACGCTAGGAA AGTAAAATCT TCTGCCGAAC GATTCaTTTG 2880 

TTCTTTTTGA TTATTTGGCA GAAATTCTTG ATAAAACGTT TCAATTGCTT GCAACTGTTT 294 0 

TACTTTCTCA TAACTTTCAT AAACCGTTTC CCATTCATTG ACAGATTCCC CTACAAAAAT 3000 

CTGACCTTGA CCAGCTAGAA TCGCTTCAAT TTCACGAATA AATAAGAACG CTTCTTGTCT 3060 

GGCCCCTTTA TAGATAAGTA ACAATTGGTG GTACGTACTC ATCCAACTTT CAATGAATAA 3120 

CAGTTGGCCT TTTTTCATAA ATAGTTCCTT GATTGTTTCT AAAAATGACA TCTCAGCTTC 3180 

AACCAATAAA ACGGTAAAAT CACCTGGTTT TATAGTCGCA AATTGCTGCA TTAACTCTTC 324 0 

GTATTCCCCT TCGCTGATTT CATCATTAAG CCAACGCATC AAGCCATTTT C TAG AT AT AG 3300 

ACGCACGCGG GCATGGGATT TTTTTTGTTG GTCCAATTCT ATTTTAATTC TCTGGATATT 3360 

GGCTAATAGT TCTTGTTTAT TGACTGGTTT CAAAAGATAA TCTTGGACCT GCAAAGCCAT 34 20 

GCCTTTTTTG ACGTAATCAA ACTCTTGATA ACcTGATAAA ATTAAACTAA TAAATTGAwG 3480 

TCCtTTTkTC TTtGCTtGCT CaATTAATTC gATCCCGCTC ATTcTGGa 3528 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

TGGATCCTAC nAAAGGCCTT TACnTACCCG nTTTCCCAAA TTCCCTTAAC CCATATCnTG 60 

ATAGGCTCTA GAGGAAGTAT ATCCCCCAGC CGAGAAAaGt AGtGTTAGTA "GAAAGATTCT 120 

CGCTAATATA TTTTAAAATT TGATATACAG AACTTACCTC TTGTTCTGGT GTCTTTCCAG 180 

TAGTTAAAAT AAGTCTGCCA TAGCTTTCAG ATACTTGCTT AATTTCATCT TCTAGATTTG 240 

GATATTCTAC AAAAACATCT TGAATATACT CATAAATAAC AGGAGTGCTA TCTATATATT 300 

CAACAAACAT TTTAATGATA GAATTAACTT CACTGTAATG AGCATTAAGA ACTCTAGAAG 360 

CTAATTGTCT AAATTTCCTA CTTGCCTTTT TTAAATCTTT TTTATTGATA TTAATCTCCA 4 20 

TCTTGCATCT- CCCCACAGAA CGGACAATAT CCGCCTTCCA TTTTTAAGCT TGGACTAATT 4 80 

TTTGCTTCTT TATGACAGCA GTCATAAATT TGAAGTTCCA AGTTATCTAG TTTTGAAATA 54 0 

ATCGGATCAA CAGGATCTTT CTTAATTTTT GAACCTGATT TGAATTTTAC GCTTCCATTC 600 
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TTTGATGATT TACTCAAATT CTTACCAAAA TCATTAAGTA AATCAGATAC ATGATTATTT 660 

ACAAT-T.TTCA TTGCTAATTC TATTACTTCA TCAGTAAACA AACTATCTGG AGTTAGACCA 720 

CACTTAGGAC ACCAAACATG CAATTGAGAT TCATCTTCCA AGTCCTTAGG CGTTAGCTTA 780 

AAATACTCAC CACACAAAGT GCATTGTAAG AAAATAAATC CATCTGAATC TGATGGGATT 840 

GAAATATTAA AAGTAGTGTC AGCTGTCATA GATACATCTC CTTTTGGGAA TACTTAATTT 900 

ACCTAATTAC TATAGTAATT TTATCACAAA AAGAAGGCGA AAATCTTTTT TATGTTATAT 960 

TTCTTTAATA TAAATTAGAT ATTTATATTA TTTTGCACAA TGGTTTTGGT AGATTCCAAA 1020 

AAATCATACC CCTTCGGAAC TAAAGGCGTG TAAAACTCAC TAACAACACC CCCACCTGTA 1080 

TCTACATAGC CTCGCCCCTT GaTTTGTTTC ATAAAAAAAA TTTTATCAAC CTCACCAAAC 1140 

ATCATAGAAT ACCCCAGTTC ACTCATACGA CCCAAAGCCA CACGGAAGTT GAATTGATCC 1200 

CGTATTCCGT CTCCTAAATA TTTCGCATCC GGTCTTTGAC AAGCCAGAAT CAGAAAGAAA 1260 

CCGGACTGCC GACCTAACAT GACGATTTGT TTGAGCTTAT TCAAAATTAC CGCGCTTTCC 1320 

TTAGTCGTTA ACATCTCCAT ATAAGCCACA TATTCATCAA AAACTAAAAA ATTTGGTGAA 1380 

AGACCAAGAT ACGCATAATT CTCTCCTGTT TTGTAGTTTG GCATTTCTTT CATCAACCTA 1440 

CTACGAaCCA TCATACGTTC ATAGAAATCT TCTACACAAG CGGAAATTTG TTCTTTTTGG 1500 

GaATATACGT GAGTCATTAT TGTGCCTAAA TCTGCTAGAT CCGCATTTTT GGGaTCAAGG 1560 

ATAAATAGTT CTGCATCAGA TTTCAGTAAA GCCTCGATGA TGGTGAGTAA AAAGTAAGTC 1620 

TTCCCACCAC CTGTACCACC TGCGATTAAC ATATGAGGCA AGAAATCATA CGCCCATACT 1680 

TGATTTTTCA TCAATCGCAA AGCTCCATTT TCTACCACTG CTTCTTCTAT CCCAATTCGA 1740 

TTGGCAATCA TATCGTAAAG CAAGGTATAA TCCACATAGG AGTCTTTGAG TTCCTTTTCT 1800 

ACTAACTCAC AATACAGTCC GCTTTCTAAC TTCTTTTCTA GCTTTAATAG TTGATCTTGa I860 

TATTTTCCCA GTGAAATTgA ACGCGAATAG ArAGTAAGCC ATCCTTCATc GATAATAAAT 1920 

TTgGGGrAGT AACTGATACT TTCTTTGGTC CGACTACCAT cAAATCTTTG AAAAAcCTTC 1980 

ACTTTGAGTT TGTTTCACTT CATACCAATG GTTTTCTAGA ATCATCCGAG CTAGCTTTTG 2040 

TCGATGCCAT "AAACTGCGCC AATGATCCTG AAAAAAGTGA ATATAAAGCA ATATCATACC 2100 

TCCTGAAAGA GCAAAACTTC CTAGTAAACA GATTAAATCT TTCGTGGACC AATCAATTTC 2160 

ACTAGTGGAG AATGATAGCT TAGAAATCTC AAATTGTCTA AGCTCTTGAA AATAAAAGAC 2220 

GCTCATACAA CCAATGAACA AAGCAAGTAG CCAGCCAAGA ACAAAGTGAT AAATTAAATG 2280 

CTGATTTCTT GCGCGTATGC GATGTCCTTT GTACATCTGA TGGTTTCCAC CTCCAATCAA 2340 

AAATTGCCCA ATAGGATTTA TCGGGCAATT TCAAAAAGTT ATGCTAGTCA TTTACTACTA 2400 

CGGGAAGCTG TTTCGTTTCA ACCAATTGTT TTAAAAAGCT TTCAATGACA TCCGGCATCT 24 60 
CCACACCGAT TTCTTCTAAA ATAGACTCAG CTTTTCGACA TAAGTTTTCA TTTATTTCCA 2520 
CAATCATAAA GAAGGATTCC TCTTCCCCTA ACCAATCCAG AACATTGAAC TCTTCTTCAC 2580 
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TATCGGTATT TCCCCCAAGT AGCCCCTGAT GCATAATCAG AAACATTTTC TCATCATCGG 2640 

TCACCCCTTT AGTGAGTTGC .TTTT.CGG.TTA .GCTCTCCTTT TCCATAGCGA TCTAGACAAT 2700 

CAAAAATATG TTCAACATCA ATCTCCATAC TCTTCACCTA TTTCTTTCCT TCATTTTGAT 27 60 

GGGTATTTCC TTGATTCGAT TGCCGTTTCA AGACTAAATC ATCCGCCTTA ATAAACCAAT 2820 

TGACGGTAGT TCCTTCTCGA AAAGCATAAT TTGCGACCGT GTCCACGATT GGATTGACCA 2880 

ACTCTACTTC TGTATCGTAG GCAAACTCTC GTAATGGCAC ATTTGCTGGA ATACTGACTT 2 940 

GAATCATTCG ACCTTGTGCT GAGGATTTCA AATTATAGGT TCGTTCTTTG ACCTCATTAC 3000 

TAACCGTTCC ATCCTCATTT TGTTTCCGAA CTTCTCGGCG TAATGCAGAA AAACGCAACT 3060 

TGCCAAACGT TAGTCCTGAA TCTACTACAA TACCTTCTGT CAATCTCATA TTTTTATCCT 3120 

CCTATGCTTT CACTAAATCA TCGACATACA ACACATAATC GGTAAAACCG CGGTTTTCAA 3180 

TCGCATACCC TTTGGCAACT AAATGGGGAT TCACAGcTTT CAGCGGTTGA TCTTGGTCAA 3240 

AaTCTTTTTC GCCGGCTTCC GCTGCTACCA CCACTTCAAT ATCATCTGCT CGTTGTTTAC 3300 

TGGAATATAG ATGATAGCTA CGACCAATCA CCACAGTATT tCGGCCATAT CCTTCTGTTA 3360 

AAATTtCTCC TTCACCAGCA TAACCTAACT TTCCAAAGGT TTCGGCCATA TCCGGCACGA 3420 

CAAATTTCAA TTCCATATTC TTTTCCTCTT TTCATTGTTA GTTTTGGATA CAAAAATAGC 34 80 

GCGACCACTT TTTTAGGAGG CGGTCGCACA AGTCGGTTCC TACTATTCAC TTGTTTCATC 3540 

TCATCACCTT AATTGCGGCG ATTCCAGAAA TAATACCCAG CAGTCGAAAA AATCAAGATA 3600 

AAGCCAACTA ACAGTAGAAC GTTGGAATTT TTCTCCCCAG TTTGCGGGAA TGTCTTCACT 3660 

GTGGGACTAG ATTCTTGAGA ATTACTTGGA ACAGCTGGTG TTTCCGGTTG TTTTGGCGTA 3720 

CTCGGTATGG TTGGCACTTC TTTTGGTGTT AAGGTTTGAG ATTTTTCTTT CAAATCTTCA 3780 

TTGTGTTTTC CATTCACGTT TCCATCTTTT TCGTAATTGA TTTCCGTAAA AGTAAACTTA 384 0 

GTTCCTTCTG GATACTTTCC GGTATCTACT TTTTCCGCAA GTACGGTTTT GGTAAATTCT 3900 

TTATCATTCA CTTCATGCTC AATTTTGCCA GATTTCCAAA TTTCTTTGTT CGTACCATCT 3960 

GGTAGTAAAG CATACAGAAT TGTTTCGAAA GCTTCTTTTG AGCCATCCAG TACATCATGG 4020 

GTAACCGACA CAT CATC AAA CATATCCATC ACGTCACCAT GAGTAAAAGT TTGCGAACCA 4080 

TCTTCTAGGT GGGCTTTTGT TTGGATGGAA ACATGGCGTT CAATCGTACA ATTGACGGTT 414 0 

TGTGCTTGAT TGTTCAAGCT AGCATCCTTC GCTACTGGCT CATTGCCTGC TTCAAAGGCT 4200 

ACCTTGTTTT CATACACATA ATTGAACAAG ACGATGCTTT TATCCAAAAC TTGTTCAGCG 4 260 

GTTAATTTAT GCAGAATTTT CCAAGTTCCA GTTTTCTCTT TGTTCGCCAA CGTTGCGGTT 4 320 

GTTTCGGCAA TCACCACCGG TTTGGCTTTT TCGTCTTTTT CTTGGGCAGC TTTTGTGGCT 4 380 

TCCACATCAA- TGGCTTGAGC TACGACATAC CAGTCTTCTT TGATTTCATG CAAGTTATAT 4 440 

CTCAATTTAT CAACTAGCTC GGTAAAATCA AGGGTATTCA ATTTTTTATT TCCGTCTTTC 4 500 

CAAGTCGCAA GAGAAGTCAA ACTATCTTCT TTCTCGGGCA AATCATAAAG CATCAAACGG 4 560 
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TTCAGACTAA 


CAGAAAACTC 


GTTGTTAGTT 


AGTTTCTCGT 


AAGGAACGGT 


CACCATCTTA 


4620 


ATTGGTTGTT 


TTTGTCCTTC 


TTCGGTAATG GTAAAGACAT ACTCACTCTT _CGCATAGTCG_ _ 


' 4680 


TCTTTGTTTT 


CCTTAAATGT 


AGAACGGATT 


TCTAGTGGTG 


TAATCTTTTG 


AAATCCTTCT 


4740 


GGAGCCTCTA 


TTTCTTCAAG 


TAAATAATCC 


CCATAAGGAA 


GATTTTCAAA 


CTTACCATAG 


4 800 


CCATCAAAAC 


CTAATTGCTC 


GTTACAAGCT 


GTGGTCGCTT 


TATCTTCAGC 


ACCTGTGATT 




TCmTTGGTCC 


CTTCCaATGG 


CGACACTTTA 


AAAGATAAGT 


CGTTAAATCC 


aGTTTCGGCA 


4 Q?0 


GTGCCATCAG 


CCGATCCAGC 


A AAT TT AAAG 


AAATCAAAGC 


C AAAG CG AAT 


AACTTGTTCT 




TTTGCCGTAA 


CATCTCGAGT 


AATTACGGCA 


TTTTTTTCGT 


TATTATCAAC 


TTTTTTGATG 


5040 


GATACAGGAT 


ACTtCGTTTC 


ATrCAAAGTA 


TATCCTTcAG 


GTGCTTTGGT 


TTCTTGCCAG 


5100 


AAATACTCGT 


TAATTGCTAG 


nTHTTTAACG 

U 1 u I 1 1 rtrt^VJ 


GCAACTTGGT 


TCTTTTCATC 


TAAAGCCAAA 


5 160 


GTCACTGTTT 


CATCAGAAGC 


TTT CCTTCCC 

111 \^t\J 1 i. Vv>V<^ 


TTCACTAATT 


CTGTTTTAAA 


AGCTTCACTC 


5220 


CATTTAACAG 


CTTGACCATC 


111 1 oLriu 1 r\ 


AAGAGAGTAT 


ATTCAGCTCC 


TTTAAACTCA 


5280 


GCTTTCCCTT 


GACTCTCATT 




TTGTCTTCTT 


TTGTCAAAGT 


GGTTTCCCCA 


^4 0 

J J *i \J 


GTAATTTCTT 


GGTTTTGCCC 


111 InOVJl i. n 


CTGGTAACAA 


GAGCCACGGT 


TTGATTGGCA 


JIUU 


TATTTTAACT 


CGACTTTTGT 


1 UvJ 111 V3rortV3 


GTATTCACGA AACCATTACT 


AGATTTAGTT 


^4 60 


TCTGTCACGT 


AATAGGTTCC 


PA^TTrrAAA 
v^rto i i ^v//inrv 


GCATTAGCAA 


GCTCTTTTGG 


TGTTTCCGCA 


ss?o 

1 


CGACCTTTTT 


CATCCGTTGT 


1 1 1 lk^.1 10V3 


ACAATTTCAC 


CAGCTGGGCT 


GTCTTTACGA 


5580 


ATGGCAAATG 


TATTTCCAGC 


TAGAGAATAA 


TTGTCATTCC 


AAAGATCaGT 


ACCTGTTTCT 


5640 


ACCCCAGTCT 


TCTCTAAAAG 


AATTTGACCT 


TTTTGTCGCA 


TATTTTTCGA 


AGTCATGGAA 


5700 


ATGGTCTCGC 


CCGCTTTAAT 


GGTGGCAGCC 


ATGGGTGTGG 


TATCAATCAT 


ATAAGGATCT 


5760 


GGCACCGATT 


TTTCAGTAAT 


AGTTACCTTT 

flu J- X .XXX 


GTACCATGGG 


GAATTCCATC 


CAAAATAGAA 


5820 


ATCCCATCTT 


TATCTGTTGT 


CACATCTTTT 


GAAGGTAAAG 


CTTTCCCAAA 


ATCTAAATGG 


5880 


AAAACCGTTT 


CTGGTACAAT 


ATCACCTGAT 


TCTTTATCGA 


TTTTTTTGAT 


CTTTAAAGAA 


5940 


CCCTTAGTTT 


CCACATTAAT 


TTTAATAGCG 


TAGGTATTGG 


GCTTATCAAG 


CGCACCAGCC 


6000 


ATCACAGTTT 


GAAGTCCTGC 


TTTTTTATAA 


GCGACTGGAG 


TTCCAGTACC 


AGCTGATTTT 


6€60 


TtCAATGTTA 


ATGTTCCTGA 


TTTGGAATTA 

X x i wvrwix x *» 


GAGTTTGGAG 


TAAGCACTAA 


TTGATTCCCA 


6120 


ATTACACGGT 


AATCTATATT 


CGCCGTATTT 


TGGACGACTT 


TATCAAACTC 


AG A I AAA ill 


6180 


AATTCATTTT 


TATCTATTAA 


AGTTGTCGAT 


TGACCTAAAA 


TTGTTTTTAC 


AGTGGTATTA 


6240 


TGAAAACTTG 


GTTTTTTTTG 


ATACTCCTCA 


ATTGCCTTAT 


TAATTTTTCC 


TTCAATAGAT 


6300 


TTTATATCAA 


CTGAAGCACC 


ACCTAATCTT 


TTTATGGAAT 


GGAGTTTATA 


ACCGTTCACT 


6360 


TCTTCCCAAA 


• TCATCTTTTG 


TGCAACCATA 


TTTGTATCAA 


TATCTGTTCC 


AGCCTTTTCC 


6420 


CAAAGAACCG 


ATACTAGTTT 


CGCTTTATCA 


GACATTGATG 


GCAAAGGGTT 


TTTCTGATAA 


6480 


CCGTGCGTCA 


CTTCTGTTGG 


AATACTAACA 


, CCTGGTTCAA 


TACAAAATAC 


TGTTTGTTTT 


6540 
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GCACCATTGT ATTCTGCATA TAATGACGTA GTACGTTCTG AGAGCGTGCT GCCATCGCTA 6600 

AAAGTTCCGT CAACTTCATA CAGTTTACTT ACGTCATAAT GAATTGTTAC CTCTTGAGGG 6660 

TGAGTAATTT CATCGGCAAA AGCAATCGTA GGACCAATCG TTCCGCCAAT TGTTTGACTT 6720 

AATAGAGCCA AAGTCGCAAA TAACGACCAA TTTTTAAATG TTGTTTTTTT CATATAATTA 6780 

ATCCCCTTTC GTTTTTGAGT AATAAAAAAG AG AC ATT C AC CTCTGaATGT CTCAAATGTT 684 0 

TAAAACTATT TTTTTCAAAG ATTATCAGAT AC C AT AT AT C AAAAACTCCA AAGAGACTTT 6900 

TCCTATATGG GCATACTGAA TAATGCGAGC CGTATTCGGA ATATCTTTTC CGtCTTCGTA 6960 

TTTATCCaTC TTACATTTAT TCATAAAGCC CGCCTTTCTT TTTGAACATA AAAAAAGACC 7020 

CTCAATTTTT GACGGTCTCA TCGTAAAAAT TTACGTTATT CATTCACCCA AGCTTCCACT 7080 

AAATCCaCAG TATCACTTTG TGGACGTAAA GTTTTATCAC TTATAAAATC AACATTCACA 714 0 

ATCATTGGTA AGGACAAAGC TGTCCCAGTG ATAATACAGA TACCTTGTGA TAAACCTGGA 7200 

AGCATTGATT TAGAAACTCT ATCTAGAGTT GATAATGAAT TATCAATTAT TTGAAGATCT 7260 

CGCTCATTCA CTAGCTTGTG TAGAAAGAAA TTATGAACTT GTGAGAGAAG TGTGGGTGAA 7320 

ATATCCGCTG GTCTTTGACT TGATAAAGTA AGGAAAAAAC CAAATTTCCG ACCTTCTTTA 7380 

ATGATTTCCT CAAAAGTAGA AAGTCTATAA TCCTTCCATG ACGTATTGTC GTTTCTTGAT 74 4 0 

TGATCAGACA AAATATTATG TGCTTCATCG ATAATAAGAT GGAAACTAAC CTTATCTTTA 7500 

CTCGTCTTCT GCTCATCGAA AAACATTTTT GAAGTTAACA TCGCTACAAT TTGTTTAGCT 7560 

TCTTGATTCA AATCACGTAA AGAAATTATC TCAAGAAATT TGACATTGTC GATCTGTCCC 7 620 

ACTATCTCTA TATACTTCTG GAAATCTTCA ATACGGCTAT CGATTCGCTT TAGAAGCGGG 7 680 

TCTATATGAT CCGATATGAC ATTTCGATAT AGTAAGTCTT TGACTAATTG TAATTTACAT 77 40 

CTAAGTTCAA ACTCTTTGAA TATATTTAAT TTTTCCGTTA GAATCACATT TTCAACTTGT 7800 

TCAAGATTAA CTGCCTGCAT ATGTTTAGAT TCCCAATTTC CATCAAAAAA AGTACCGTTT 78 60 

ACAATAAATT TTGCAGGGTC ATCCtTACTG TATATCTGTG CCTGTTTGAT TGGCTCCAAA 7920 

GCTTCACTAG AATCATAAAA TTCACCCAGG ATACTTACAA ATCTATTTTT TAAATCCATA 7980 

TTGGGAAAAG TCGTAAATAT CTCTCTAAGT ATAGAACTTA TCCATCTTTG CAGTGACTCT 8040 

TCACCAAAAC CAAACTTATT TATCCCTTTA AGAACACGTT CCAAAAAAGG TTTTTGAGTT 8100 

kGTkGAGTCG CTGrAAAAAG rATwGaCmAG aTCTCAGAAC TAAAAAAGAT ATCCCTTTTA 8160 

ATCTGTAACC TCTTATTTGA AACGGTTCTA GTGGATAATT CATATATATT TTTTTCGCTC 8220 

TTAGGTACAC CAAATGAGTT ATCATGCACA TACTCACCGT TAAAGTCTAT GACAACAAAT 8280 

CTACTTTTCT CTCTTAGCAA TGGGAACGAT TGTTTAAATA GCTCAAAATA CAATTTATGA 8340 

AGTGTATTTG- ACTTGCCACT ACCAGTATTT CCAAAAATAC CAATATGTGA CGCAAAAAAA 8 4 00 

GAACTTATTG GTAAGTTAAC CTCAATTTGC TCATTTAAGC TTTTCCCGAT TTTAATAGTT 84 60 

TGATCTCCAT TAAAATGATC GAAGCTATTT ACATAGATAG TTTGTGTCTC TTCAGTTGTC 8520 
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GGTATTGAAC 


AAATATTTCC 


AATCATTGGA 


ATATATGTTG 


TACCAGTAGT 


GAAACAGCCT 


8580 


TCTGAAATAT 


ATCCAATAAT 


TTGGATATCA 


AGAATTCTTT 


TTATGGTATT 


CTTACTGAAT 


. 8640 


CTGTTATCCA 


ACTGATAGTT 


CTGCTTATTA 


TTTTGAGTAT 


CCCAAATTGA 


TTCAGAATTG 


o / UU 


ATTCTGCCTA 


TTATTTTAAT 


AAAGTTCTGA 


CGAATAACTA 


CAAAACTATT 


TACACTTAAG 


o / oU 


TTTTTTATCA 


CTTCACCATT 


ATTAATATAT 


GTCGCTTGGT 


TCATATCATC 


AAAAGTGATA 


bozU 


GCTTTACACC 


TGATTCCGTT 


CACTTCAACA 


ACTAAAGCAA 


CCTCTATATT 


AGAATTATTC 


8880 


GTCATCATCT 


ACTACCTCCG 


TATATTGACT 


ATCAATATAC 


TTTTCAGAAA 


GATTTCCTTC 


894 0 


GATAATCCTG 


TTAAAAGCTT 


CAAAATCTAT 


AACCGCATCT 


TTATTATCGT 


AAACTGATAC 


9000 


TTCCTCAGAA 


TAAGTGACAA 


TAGTACTATC . 


ATTTGATCTA 


ATCkGATTTG 


TTTCTAACGG 


9060 


TAGGGGCGTC 


TCTTCTTCGT 


TTTCAGCTTG 


TTCCATAATA 


AAATCTGAAG 


GTGTTATAAA 


9120 


CACAATATTG 


GATGGAACTG 


CATTGGCTCC 


AAAACTAAAT 


TGATTTATTA 


TTTCACCTTT 


9180 


AGTACCATCT 


TTATAACAGA 


AAATGAATAC 


CAATAAATTT 


GGATTATTTA 


AAGATCTTTG 


9240 


GACAATATCT 


GTTATATGCT 


CATCTAAAAA 


TGAAAAACCA 


AATGCTATTA 


AAACCGATTG 


9300 


TTCCTTCTCA 


AGCTCGTAAG 


ACAAGAATCT 


AAGCATATTA 


AAATACTGCT 


CTTCAATTAA 


9360 


GGTAGACTGA 


AATTTCCTTT 


TTGTGGGTAA 


AACAATCTGT 


ATTTCACTAA 


TTTTCTCACT 


9420 


TATGCCTTGT 


AATTGATTAT 


AATACTTTTT 


ACCAAAATAA 


TTGATTACTT 


CAGTAGGACT 


9480 

i 


ATCAATAAAT 


CCATCAACAG 


GTTTGCCTAG 


TATGACATCT 


TTCAATATCT 


CAATGGTAAA 


9540 


TGACCGTTGA 


AAATTTGTAT 


CATGAACTAT 


ATCGTTTTGG 


AGTTCTGACC 


AAACTTCTTC 


9600 


TGCTTTTTCA 


GCAGTTTTAA 


TAGTGTCCAA 


AAGGCTGTTA 


TCAACAAGCA 


CTCTATCTCT 


ft £. C f\ 


CCCGATAAAG 


TTTTGCTCAT 


TCAGCCaATT 


CACAGATCCA 


TGACATTTCA 


CCAAATTAAT 


9720 


TGTTGGTAAT 


TCGTTGGCAT 


AATTATCAAA 


GGCCCCGGCA 


TACAACATTG 


TCTTATTAAA 


9780 


ATTTTCAGTG 


CTGAGATATC 


TTTTCATATA 


ACCATTCGTC 


CCGTCATTAA 


AGAAAATTTT 


9840 


GGGATTTGAT 


TCTATATTTT 


TTTCgACAGC 


TATTTCaAGA 


AACAAATCAT 


AATTCGTAGT 


9900 


AAAGATATTA 


GTTCTTCTTG 


GGGAAACACG 


ACTATTTCTC 


AGTTTTAGGA 


ACTCCACGCA 


9960 


ATTAAAAATA 


AATCTAGAAT 


ATTGATTCAT 


CACAGAAGAC 


CCAGAATCCA 


AGTGGTTTTC 


1UUZU 


CGATTCACCT 


TTTTTTATAT 


ATTCAACCAA 


TATTGAATTA 


TCTTTGATTG 


CTATATAATA 


1 nnoft 
lUUo U 


AAAAGCAAAC 


ATAACTAGTC 


TTTTACCTTC 


TTCAGTTATA 


TCTTTGCTAG 


TCATATATTC 


1U 14 U 


CTCAATTTTT 


CCTAAGCTTG 


AAAAAAATGG 


TACCGAAGCA 


CCTGCTCCAA 


ATaAGAATGT 


1UZUU 


CGTTCtTTTC 


TCTgAAAGAA 


CAGCTCTAAG 


AGCTTTTACC 


TTTGAAATTG 


TCATAATAGT 


10260 


TCCCTCCAAA 


ATTGGAATTC 


ACTGTTATTT 


TACCACAATT 


TAATAACAAA 


CGATGATTCT 


10320 


CCTCAAAATT 


GTTTTTAAGA 


TATTTTTTTT 


TAAACTAATT 


AACGCCATAT 


ATCAAAAAAT 


10380 


TCAAAGAGAC 


TTTTCCTATA 


TAGGCATACT 


GAATAATACG 


ATCCGTATTT 


GGGATATCTT 


10440 


TTCCGTTCTC 


GTATTTGTCC 


ATTGACCATT 


TTGTAATAGG 


CGGTGAAAAA 


k AACTTTCCAA 


10500 
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AGTCTTCTCT 


TTCTAAGCGA 


TAATCCCTTC 


GAATCTTCCG 


AAGTCTTAGC 


CCCATTCGGA 


10560 


AGGAATCTAA 


CTTTTCAAAC 


TTGCTAATCC 


TACTTCCCTT 


CTTCAACATC 


TTACTAAAAG 


10620 


TCTCTCCATA 


CAAAATATAA 


GGGACGGCCA 


CTTTCCCAAG 


ATAGGCAACA 


TTCATTAATC 


10680 


TTTCGATATC 


AGGAATATTC 


GTACCCTTTT 


CCCATCTATT 


CACAACATTC 


TTACCCGCTG 


10740 


GTCGAGTAAA 


GTTCTTACCA 


AATTTTTCCA 


GTGTCTCACC 


TCGATTTTTC 


CGAATCTCTT 


10800 


GGATTCGTTT 


TCCCATCGCT 


ATTTTATCGA 


TGATGGGTAA 


TTTAAACGAA 


CCATCCTCTT 


10860 


GTTTTATATC 


CATCAAAATT 


TCTTTACCCT 


CGTTAGACAA 


TAACTGAGTG 


ATTTTCACAA 


10920 


TTGCATTTTT 


TGATAATGGT 


CGACTTTGAT 


TTAAGAATTT 


ATTTAGAAGA 


GATATTTCGA 


10980 


TTCCTAACTT 


CTTAGATAAA 


TCCTCACGAG 


AAAAAATATT 


TTTAGTTTGA 


TATTTTTCTA 


11040 


AATCAATCAA 


GACTTGTTGT 


TTTCTTTGAA 


TCATATCTTT 


AATAAATGAT 


TCAGCGCCGA 


11100 


GAACGTCACT 


CAAAATTCTA 


TCGtTCTTTT 


TAGATATGCT 


GCGTAATCCA 


TTTTCATATT 


11160 


TTACGATTGT 


AGCTTGTGAT 


ATATTTGATA 


ACCTTGCCAG 


TTCTTCTTGA 


GTTAATTGcT 


11220 


TAGATAACCT 


AGCTTCTTTT 


AAATGATTTC 


CCATTTTGaT 


TAACTCATCT 


TTaTTTaTTT 


11280 


GCATGAAAAA 


AGCACCTCCy 


TATGaGAGrT 


GcTTTcATCA 


TACGCTGACA 


AGTAAAAATC 


11340 


CTCAAGGAAA 


TTATTCCTTT 


TTTGTCCTTT 


TTTGTTAAGT 


TGTGACAACC 


GTATGACAGT 


11400 


TGAAACTTTT 












11410 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

AATTAAATTT AGATTGTTAT CGATCATTCT AAACTCATTG TTTTCTAATG TCAATAAAAT 60 

AATGAGATTT AGATTAAAAT AAAATGAATT TCGAATTTTA TGAGCTTTTT TAATAAATCA 120 

GTAAATTCCA TGACATGTGA GATACTTCTC ATTTACAAAC ATTCTTAAAA AAGGTATGAT 180 

AGTAAAGAGG CAACTTGCAA AGGAGTAGAA GACGAATGAG TGAACAACCT CAACAACGTT 24 0 

CAACCGAAGA AGTTGAAGCA ATTAAAGAAG ATATTTTAAC TGCATTAGAA ACAGTGATTG 300 

ACCCTGAATT AGGCATCGAT ATTGTTAATT TAGGACTAAT TTATGAAGTT GAATTTGCAG 360 

AAGAAACTGG TGATACTGTC ATCAAAATGA CCTTAACCAC AATGGGTTGT CCGTTAGCAG 4 20 

ATGTTTTAAC aGAACAAATC CATGGTGTTT TAAAAGATAT TCCTGAAGTG AATAATATCG 4 80 

AaGTAAAATT AGTCTGGTAC CCTGCTTGGA CAACGGATAA AATGAGCCGT TACGCACGAA 54 0 

TTGCTTTGGG TATTCGCTAG TTATTTCGAG ACTAATGATA AAAAACATAT GAAAGAACCG 600 

ATTGGTCCGT TCTTTCATAT GTTTTTTTGT TTGTTTAATT TACTTATTTG AAAAAAACTA 660 

CCTGATGCCA TCAATACTTG GCTTTTTCCA AATTTTTGCT ACTGGATGGA AGTCTAATAA 720 
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AAACTGTAGC TGCGTTCCAC CGAAGCCTGA CGCTCCAGCA AACATGATAT GAAAAGTGAG 780 

CCCTCTATCC ATAATCAGAT ACCCTGTATC CAGCACGCCA TTTCTTTCAT AAAACTGATA 840 

ACGTTTTcTC TTTGGGCTTC ATcGCTGCAT TTTTGcCAAT TTCTTCACTC TCCAAAAGAA 900 

CCCCCTTAGG aTAACGTGTC TTcAATAACG cTAACACGGg TGAACCTAAa TCGnTTGtTC 960 

GGTGGTGCTG ATCAATTGCA AAGAAAAaGA CATAGGnTGC tTCAGGkGCA AtTGGACTAT 1020 

TTAACAAAAT CCCCACGTTC TCATCCGCCT CyTGaATAAC mAACmAATGA AaTTTtCCTG 1080 

sCTtGAGAAG ACATCAATAA TTGATCAAAC GGCAGTCTCT CTTCTTCTGG AAAAGCTTCT 114 0 

ATGTAGGTTG CTTCCCAATA GGGCTTGATT TTCTTGATTT AATAATGTGA ATTGCATAAA 1200 

AGCTCCTTTC TAAAATTGAT GCTCGTACTA ATAACTTTTG GT ACT TAG AT " AAGACCTAAA 12 60 

AGTATTAAAC ATGCGACTGG GnTATCCTAT ACGCAATTAT nTTTGGATAC -TAAATC 1316 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2001 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



TTATGGnGGA 


TnAAATAGGT 


TCTTACTGGT 


TATGGCTAAT 


ATATAATAAT 


TTTACTTTAC 


60 


TAATTTTCAA 


ATTTCAACAn 


AAAACGACCA 


CCTTATCGGG 


AAAAGGTAGT 


CTGCATGAAA 


120 


AATAAGAATT 


TTAATATGTA 


TCAATATTAC 


AATATCTGTT 


TTkTGAGTTC 


ATTAATTTTT 


180 


ATGTAAAAAG 


ATGCCGCCTC 


ATTGGGGAAA 


GGCGGCAAGA 


TGTACTAAAA 


ATGAAAAATA 


240 


AAAATTAATT 


GGTAAAGACA 


TTTACCGCCT 


TCTTTATTAA 


AATACAAGTT 


TTCATTTTAC 


300 


AAAATAATTG 


TATTTTATTT 


AAATTCAAAT 


ATACAATTAT 


ATTAAAATAC 


TCCTATCTTG 


360 


GACTTGGGAT 


AAGATAGGAG 


CnCTCTTCTC 


GAAAGAAGAT 


CATATATGAG 


TGTATTAAAA 


420 


AGGTAACTAT 


ATTTTAAATC 


ATCTTTTTTT 


ATATTTCAAC 


TATTTTTATA 


AAAAATCCCT 


480 


ACTyCTCATT 


TCATTGCGAG 


AGGTAGGGAT 


TTACTAGTAC 


rfiTaATGTAGC 


TAAAGTTTGA 


540 


TAAGTCATAC 


TAATTTAAAC 


AGATAGTTCT 


AGGGTAACCA 


GATGaAACAT 


TTACCCATAG 


600 


ATTAATTTCC 


CAACACTTTC 


TGTGTACAAC 


CAAATAAATT 


CATCTGGGAA 


TTGAACATGC 


660 


GAGATTGTTT 


TCCCATTTAA 


AGAGTATTGT 


TCGGTAATGA 


TCCCTACAGA 


AGATGGTTGA 


720 


AAAGCAAAAT 


AGTCTCCGAC 


TTTTAATTCT 


TGATCTGGCA 


TATTCTCTCC 


TGTTGCCGGA 


780 


TCGATCTTAT 


CAATATAAGC 


AACATTAATT 


CCGTTTTCTG 


TCCAATCAAA 


GTCCGGCTGC 


840 


CCCAATACAT- 


CATTTCTTAC 


TTGCCAAATA 


CCGTTTACAA 


ATTGTAAATC 


GTCTACTCTA 


900 


TATCCGTAAC 


GGCGCTTAGG 


TTTAGGAGGA 


ACTGGAGGAG 


CGGGTGTTGG 


ATTACTTTTG 


960 


GGATTAGCGT 


ATTTGTCCCA 


CATGCTTCTA 


TCACCAAAGA 


ATACGTTTAA 


ATCTAATCGA 


1020 
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CCATTCCAGC 


CAGGCAATTG 


ACCATTTGAC 


GTATATTGAT 


ACATTGAAAT 


AAAATTCCAA 


1080 


TAAGGGACTG 


GTGGCGGAGT 


AGGTTGGGAA 


TATCCTTGTG 


GATTATTTGC 


TCC.G.TAATC& 


1140 


GCTAACCACA 


AACCATAATC 


AGCTTTTGCA 


ATAGAGCTAA 


AATTATATGA 


TTGTAAAACA 


1200 


TTGGTATAAG 


TATAAAACAT 


AGGTTTTACC 


CCTGTTTTAC 


CTTGAACATA 


ATCTAACCAA 


1260 


CGTTTAGCCC 


AAGCAACGTC 


TCCTTTATTA 


GTACTTTCCC 


AGTCTAAAAC 


CAAGACGGCC 


1320 


TTACCGATAT 


AACCTTTAAT 


ATTTTTTAAA 


AAGAACTCAG 


CTTCTTGTTC 


CGCTGTCCCT 


1380 


TCAAGTCCTA 


CCTCATTCGC 


AAAATGATAT 


ACAGCTACTT 


TTTTCCCACT 


GCTAATGGCT 


1440 


TGTTGGAATG 


CTCTATCACA 


GTCAGGATTT 


ATATATCCTG 


TTCCTCCTGT 


AGCTTTAACT 


1500 


ATAACAAAAT 


CTGCAGGGAC 


ACCTTCTTTA 


CCTACGTTAA 


TATTTGATTG 


CCAACTAGAA 


1560 


ATATCAATTC 


CGTTCAATGT 


CATTTTTATC 


ATCTCCTATA 


AAATTTTTAA 


AAGATTGATG 


1620 


TAAGCCAGTG 


CTTGCTAATC 


CACTTAAAGC 


CCCATATATT 


GTATTTtCAG 


GTGTGATTTT 


1680 


ACCTTGTATC 


CACCATGAGA 


TTAATATACC 


TAAAATGAAT 


ACAATTAAAG 


GAATAAACTG 


1740 


ATTATATTTA 


TCTTTTAATA 


TATTAGTAwt 


TTyAATAACA AACCCGACAG 


CTAAACACCC 


1800 


TAGCACCACT 


AATGGCATCA 


ATTCAATTTT 


AGAAAAGTCC 


ATTATTTGCC 


CCTCCGTTCT 


1860 


AAATCTCGTA 


TTCTTTCGTT 


ATTTTCTTTT 


AAATTGTTCC 


ATAAAGATTT 


TATTTGTTCA 


1920 


TTTAATTnTA 


TTACTTCATC 


CTCTACTAAG 


CCTTGTTTTT 


CnTTAGTTTG 


ATCAATTAAT 


1980 


AAATTAAGCT 


TATnTACTGT 


G 








2001 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

AATGATTGTA CGCTTCTTTA AAGGTTACTG TTGTGTCTTC TTTTCGATAA AGGATATATT 60 

CGTCATCTAG CATATCCATT TGCAGCTCCT GTTTAAATCC ATTTCTGGCA ACAATAGCAG 120 

AAACTAGATG AGCAACAGTC AAGCCATCTA ACGGAATATC CACTCCATTC ATTGCTAATG 180 

TGTAGACGGT ATCTTCCACT AAACCACAGG CGTTGCAACC ACCACTGGTT AAATCTACTG 240 

TACTTGTTAT AATTGCCTCT TGCATCGTTT CTTCCCCCAT ATTGTTTTTT AAAGTTATTC 300 

TTTCATTTGT TATTTACGAA CGTAATTCAC ATCATAATCG TCAACTTCAA TATAGATATA 360 

GATTTTTTTA TCAGGCATTT TACTTCGAAT GTCTCGTTCA ATCGCATTCA CAACATCAGG 420 

TGCATCTTCT TCCATTGGTT CTACCACATC AATTTTTGCC GCTACTAAAA TTTCAGTTGG 4 80 

ACCTAAATGA ACGGTCTTTA CATCAATTAG TTTTTCAACA TCCGCCCGAT CAAATGCGGT 54 0 

TTTAATTTTT GCTAAATCTG TAGCGGTCAC ACTTTCTCCT ATAATCAAAC TATAAAATTC 600 
TTTCGCTAAG AAAATGGCTG CACACATTAA TAGTAAACCA ATCAGTAAAC CACTGATAGC . 660 
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ATCAAaTGCA 


GCAATCCCAG 


TTACCATTGT 


TAAAAGGGTT 


CCCACCAAAG 


CTAAACACAA 


720 


ACCAATAACC 


GCACAAAAGT 


CTTCCGTAAA 


AATAATTAAA 


ATTTCACTGT 


GACGACTTTC 


780 


TCGCAAAAAT 


TTTAAGAGCG 


ATAACTTCTC 


TGTATTCAAT 


TCTTTGATTT 


CTTTCATCGC 


840 


TACACGCAAC 


GAGCTACCTT 


CCACAAGCAA 


ACCAAAAATT 


AAAATACCAA 


TGACAATAGC 


900 


CACATTTTCT 


ACTTCATGAG 


AGGGATGCAG 


TAACTTCTGA 


ATAGCCTCCA 


TTACGCCTAA 


960 


CGCGCCCCCA 


CCAAAAAAGA 


GCATCGTTGC 


AACAATTGTA 


CTAAAAAAGT 


ATTTTGCACG 


1020 


GCCTTCTCCA 


AATTGATGTA 


ACGCACTTTG 


TCCACGAGAA 


GCGCGCTTAT 


TCCCAAAAAG 


1080 


TAATAACACC 


T GAT TACT AC 


AGTCCACGAC 


ACTGTGGATA 


CTTTCATTCA 


TCATAGCGGT 


1140 


ACTACCACTT 


AAAGCAAAAC 


CAACAAATTT 


ACTAATCGCT 


ACTAAAATAT 


TCGCAATCAG 


1200 


TGCGGCAATA 


ACTGCAAACA 


TGCCACTCGA 


TTTTTTTGTC 


TCCATAATTA 


CCTCCTATAT 


1260 


GTAACGTAAA 


ACTAAGATAA 


ACCACCCGTT 


TATTCATCCA 


TCATTTTACC 


ACTTCTGAGG 


1320 


TATTTGTAAA 


AAAGAGTTCT 


CTTAAAATTC 


AAAAAGAAAT 


GTACTGACTG 


TTTTAGTTGG 


1380 


CCCTCATTTA 


TCGGTGTCCA 


TCCCACTCTT 


GATAAAATTC 


CTCCAGATAA 


AGTTCCATAA 


1440 


ATTGATGACG 


CTTTTCCGCC 


AATAAGCGAC 


CATAATTTGT 


ATTCATTTGT 


TCCTTTAACT 


1500 


TTAAAAGTTT 


CTCATAAAAA 


TGCATAATAG 


CAGTATCTTC 


CCCATTCCGG 


TATTCTTCTT 


1560 


GGGTTAACTG 


TTCACGAGGT 


TGTTTGTGAG 


GATCGTGAAT 


TAAACGGCCT 


TTATTTCCCG 


1620 


AATAAGCCAT 


CACCCGTGCA 


ATTCCAATCG 


CACCGATGGC 


ATCTAATCGA 


TCCGCATCCT 


1680 


GGACCACTTT 


TCCTTCTAAT 


GTAGTGACTG 


TTTTCCCAGT 


ATTTCCTCCT 


TTAAAGGACA 


1740 


TATTAGCAAT 


AATGGAAAGG 


ATGTGATTCG 


TTTGCTCTGT 


TGATACGTTA 


TTTGCTTCCA 


1800 


ACCATTGTTG 


CACCTTTAGC 


AAgCCAGCCT 


CTTCACTTTC 


GTTAAATTTT 


TCATCGCCAA 


1860 


TATCATGTAA CAATGCTGTT AATATACATA AAAATAAATT 


TGCTTGTTCT 


TTCTCTGCAA 


1920 


TTGTTTTTGC 


TAATTGTGTC 


ACCCGATAAA 


TATGCCACCA 


GTCATGGCCA 


CTTGCTTCGT 


1980 


TGGCTAATTC 


TTGTTGAACA 


TAACGCTCTG 


CTGCTTGAAT 


AATTGTTTGT 


TCTTCAAAAC 


2040 


TAAGAGATTC 


ATTTTCAAAC 


ATCCTAACAC 


TCCTTTATTC 


TGGATTTAAA 


ATTTCCACTT 


2100 


GGATAATCCC 


GCTAGCTTTC 


AGAAACATAT 


TGGTGACAAT 


CGGACCTACA 


AAAGAAAATC 


2160 


CTCTTTTTTT 


TAAGTCTTTT 


GCTACGTTAT 


CTGATAAAAG 


CTGTGTGCGT 


GGGACTTGGT 


2220 


ATGCTTCTTC 


ATAAATAGAT 


AAAACGGGGA 


CGTTTTTTAC 


AAATTGCCAT 


AGATAATTGG 


2280 


CAAAACTGCC 


AAATTCTTCT 


TGTACTTTTA 


AAATTGCTTG 


CGCATTTTTA 


ATGGTTGCTT 


2340 


CAATTTTGCG 


CCGATTGCGA 


ATCATATCTG 


GATCAGCTAC 


TATTCGATCG 


ACTTCGTCAG 


2400 


GCAACAGCGC 


GGCAACTTTT 


TCCGGCTTCA 


TTTGGCAAAA 


ATTTTTCATG 


AATGCTACTT 


2460 


TTTTCCCTGC 


TGCTGCTTTC 


CAACTTAAAC 


CAACTTGAAA 


AACACCCACC 


GTCAACAATA 


2520 


AAAACAGCAA 


ATCATCCTCT 


CTTGTAGGTT 


TTCCCCATTG 


TTCTGTATAC 


CATTGATAGT 


2580 


TGTTCATCGC 


CGCCTCCTTT 


TAAGACTGTT 


CAAATGTTAA 


TAATGCTTGT 


TTCATTGCTA 


2640 
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AACCACCACG 


GTAACCGGTT 


AATTGTCCCT 


TGCTGCCGAT 


GACTCGATGG 


CAAGGAATGA 


2700 


CCATTAATAA 


AGGATTTTTC 


CCAATCGCTG 


TGCCAACTGC 


TCGAACCGCA 


CGTGGACGCC 


2760 


CAATTTTTTC 


TGAAATTTCC 


TGATAGGTCA 


TTGTCACACC 


ATAAGGGATT 


TCTTGCAACG 


2820 


CTTGCCAAAC 


TTCTTTTTGA 


AAGGGCGTTC 


CTTCGATGTC 


TATGGGGCAA 


GCAAACGTTG 


2880 


TCGAGTGTTG 


GTTAAGATAA 


CTTATTAGTG 


CTTCTTTATA 


AGGCGCAGTT 


TTCTTTTCAG 


2940 


AAGCTTCTAA 


TTGTGCTTGA 


GGGTACCAGT 


TTTTCAGCTC 


TTCTATCGAT 


TGATTGGCTG 


3000 


AACCTACAAA 


AGCTAGTCCA 


GCATCTGTCA 


CACCAAGATA 


ATAGTGATCG 


TTCTGCCATT 


3060 


CTTGTTGCGC 


ATAATAAATT 


TTTTTTGCCA 


TATTTCTACA 


CTTCTTTCTC 


GTATTTTCTT 


3120 


ATCTTCATTA 


TAATTCTTTT 


GGCTTTCAAA 


ATCTTGCGCT 


CCCATCACTT 


TTTACTTGTT 


3180 


TTTATTAGGG 


ATTCTTTTAC 


AATGAAGAAA 


ATGGGAGGTG 


TGCTTACAAT 


GAAAATAACG 


3240 


TATCGACTTA 


CTCAGAAACG 


TTGGGAAGCC 


ATTCAAAATA 


ATAATACGCA 


ATTTGATGGT 


3300 


GATTTTTTTT 


ATGGTGTGAC 


CACAACTAAA 


ATTTTTTGTC 


GTCCTTCGTG 


CCCTTCCCGT 


3360 


GTGCCCAATA 


AGAACCACGT 


TCTGATTTTC 


AAAACGGCTC 


AGGAAGCTTT 


AACACTTGGT 


3420 


TTTCGTCCAT 


GTAAAAGATG 


TCAACCAACC 


GGGCAAGTCG 


TTCCTAACGA 


GGAATGGGTT 


3480 


GCTCAAATTA 


AGCAATTTAT 


TGATCGGCAT 


TCAGCGGAAC 


CATTAAGTTT 


AG ACT AT TT A 


3540 


GCGCAAgcCT 


GCCACGGTAG 


TCCCTTTCAT 


TTGCAGCGAA 


CCTTTAAGGA 


ACAGACAAAA 


'3600 


CAAACCCCGT 


CAGCTTATTT 


AACAAGTGTC 


CGAATTAGGC 


AAGCCCAAGA 


ATTGCTTCTC 


3660 


CATTCACCCC 


TTACGATTCA 


ACAAATCGCG 


AAAAAAGTTG 


GCTTCCAAAC 


CGCAGCATAT 


3720 


TTTGCCACCA 


CTTTTAAAAA 


AGAAACAGGC 


GTAACCCCCT 


CTGTTTATCG 


CAGTAAAAAT 


3780 


GAGCCGAAAA 


AATAACATTT 


CGGCTCATCG 


GAT TAT T TAT 


TGGGGCGTGA 


TTGTAGCCAT 


3840 


GCTACAATCA 


ATTGACTGAT 


TTCTGTTTGC 


TGTTGCTCAT 


TAGAAATTGT 


CGCTTTGCTA 


3900 


TCTCCTTTTT 


GGACACCGTA 


ATTACCAAAC 


CCTGCATGAT 


TCCCACCTTC 


GATAGATACA 


3960 


TACTGCGTAT 


CTTTTGGCAA 


AAACTGTTTG 


GCTTTTTGGT 


AGCTAGTCCA 


ATTTAAAACA 


4020 


CCATCGTTTG 


TGGCTGTTAT 


TGACAGAACC 


GGCAGATTTT 


TCTTGTTTAG 


TGCACCTTTT 


4080 


TCGTCAGGAT 


AGCTAGCTAA 


ATAAATCATG 


CCTTTGATTC 


GTTGCTCATT 


TTCCGCTACA 


4140 


AAACGGCTAG 


CCATCACACC 


ACCCAGAGAA 


TGTCCAGCCA 


AAACGAAGTT 


TTGTTGTGGA 


4200 


TGATCCTTTA 


TAATTTGTTC 


TGCAGCATTT 


GGTGCAAGAA 


CCGCTAAATT 


. CAAAGGAAAA 


4260 


TGAACAACAT 


AGACATCATA 


ACCTTTCGTA 


GCAACTTCTT 


GCGCCCATTC 


ACTATAACTA 


4320 


GCGGGCGCAA 


CCAATGCACC 


TGGATAAAAA 


ATCAGTGCTG 


TATCATTTTT 


TTGCCCGTTT 


4380 


GTATAATAAT 


CATAGTTTTT 


TTCTTGCGTG 


GCACTTTGGC 


TTATTTTTAA 


GGCTGAATCG 


4440 


GTTGCTTGAT 


ACGTACTATT 


TTTAATAAAA 


ATAGTCAGAC 


CGATGACTAA 


TACCGCTAAA 


4500 


AAAAGCCCGA 


TTCCTAGGAC 


AATTTTTTTC 


CAACGTGATT 


TkGTtCTATT 


TTTCATCTCA 


4560 


TTTCCTCCCA 


ATGAAAAAAT 


TCTTCTTTCT 


GATTCTACCT 


TTAATAAAAA 


AAGrACACAA 


4620 
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CAATaGGaGT yGGTGAAGCa ACCCGTCTGT TGCTCCACCA ACTCCTAACT AGACCGTTTC . 4 680 
TCG 4683 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



CGCATTGAAA 


TTTGTTGCTC 


TCTTGATCGT 


GATnGGTnGC 


CTTAATAGTG 


ACGGCTTTTA 


60 


CTGTCGGGCT 


TGGTTTGACA 


TCAATGCTAG 


AAACACTAGG 


AACAACCTTT 


GTGAATAATC 


120 


GAGGCATGGC 


TATCTTTATT 


ATTATyATGT 


TAGCAACAGG 


AACGTTAGAA 


AGAAATGGCT 


180 


TGAAAGAGTC 


AGCAGCTACG 


TTGATTAAAC 


GATTCAAAAA 


GGTATCTGCT 


GGTGTGATTA 


240 


TTGACATTTA 


TGGTGTCTTT 


CGGATTATCT 


TCGCCGCTTT 


TAACGTAAGT 


TTCGGTGGGG 


300 


TAGCAGGTTT 


TGTGCGTCCG 


ATTATTTTGC 


CAATGGCGTT 


AGGAACAATT 


GAATCAAAAA 


360 


ATTTACCAAT 


GGTGCCAGAG 


TATGAAGAAG 


AACTAAAAGG 


GATGGCTTCA 


GCCATGGAAA 


420 


ATATCTGTTG 


GTTCTTTGGT 


CAAGTTTTGT 


TTGTTGGTGG 


GGCTGGGGCA 


TTGCTTGTTC 


480 


AATCAACATT 


GAAAGATTTA 


GGCTATGAAG 


TGACATTAGG 


TAAACTGGCT 


TTAGTGGAAG 


540 


TACCAGTCGC 


CTTAGTTGCC 


TTAATTTCGG 


CAAGTATTTA 


CTTTACTTTA 


AAAGAAAAAA 


600 


GATTACGTAA 


AAAATATTAT 


GGACAGGAGG 


GACTGAAATG 


AGTTTTTTCA 


CAGATCCAAG 


660 


TATTGATTTG 


AGTGCAAAGT 


TATTGGAAGT 


CCTTTACATT 


TTTATGGGGA 


TTATCTTGTT 


720 


ATATACGAGT 


TTTCGTAGTT 


TTCGTGATAA 


AACCAATCAA 


CATGCGTATG 


GGACTAGTTT 


780 


ATTCTGGGGA 


ATTTTAGGAA 


TCTTAATCGG 


TTTTGGTCGG 


TTTATTCCAC 


CAGTTGCCTG 


840 


TGGCGTACTT 


ATTTTTGTGA 


TGACAATTCC 


GGCAATTCTT 


CAAAAAGTGA 


CAAAAGGAGA 


900 


AAGTCGCTTA 


CCTTCGAAAG 


AATACATGGA 


AAAAATGTCT 


CAAAAATTAG 


GAATGAAGAT 


960 


TTTTATTCCC 


GCACTAAGTA 


TCGGGGTTTT 


TGCAATTCTT 


tTTGCCACAT 


TGACTACTTT 


1020 


AGGAGCTTTA 


GTCGGTGTCG 


GTGTTGGTGT 


CTTTGTAGCG 


ATTTTCATCA 


TGATGTTCTT 


1080 


TTCTAAAAAT 


AATACGCCGA 


AAGTATTTTT 


TGATGATGCG 


GCGGAAATGT 


TAGAAACAGT 


1140 


CGGTCCTCTT 


AGTACGTTGC 


CAATGCTGTT 


AGCTTGTTTA 


GGTGCTGTTT 


TTACTAGCGC 


1200 


TGGTGTTGGT 


GAGATTATTT 


CAAAAGGGGT 


TAGTGCCGTA 


GTACCAGCAG 


GAAATGTCAA 


1260 


TGTAGGGATT 


AT TAT T TAT G 


CTGTGGGAAT 


GGtTATTTTT 


ACGGCTATCA 


TGGGCAATGC 


1320 


CTTTGCAGCA 


ATAACCGTGA 


TGACAGTCGG 


AATTGGTGTG 


CCGTTTGTCT 


TTTCTTTAGG 


1380 


TGCAGATCCC 


GCATTAGTTG 


GGATGGTTGC 


CTTAACATGT 


GGTTTTTGTG 


GTACACTGTT 


1440 


AACACCGATG 


GCTGCGAACT 


TTAATATCGT 


ACCTGTAGCC 


ATGTTGGAAA 


TGAAAGATAA 


1500 
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GTACGGCGTC 


ATTAAAAATC 


AATTGCCGAT 


TGCACTTTTC 


ATGTTAATTT 


TCCAAATTGT 


1560 


TTATATGATT 


CTATTCAAAT 


AAGAAAGGAT 


GATCGAAATG 


AAAGTTGTCG 


TTACTGGCTT 


JL62Q. 


TGATCCATTT 


GGAGGAGAAG 


CAATCAATCC 


TGCCTTTGAA 


GCAGTCAAAA 


AATTACCTGC 


1680 


TGAAATTGCA 


GGAGCTGAAA 


TTATTAAAGT 


TGAGGTGCCA 


ACTGTTTTTG 


GCACTAGTGG 


1740 


TGAAAAAGTG 


GCTGAAGCAA 


TTGAAACCCA 


TCAACCAGAT 


ATGGTGATTT 


GTGTTGGACA 


1800 


AGCAGGCGGC 


CGCGAAACAG 


TCACTGTTGA 


AAAAGTGGCG 


ATTAATTTAG 


CGGAAcGCGT 


1860 


ATCCCGGATA 


ATGCAGGACA 


ACAACCAAGC 


GATGTGCCGT 


TGGTTGAAGA 


TGGAGCAACA . 


1920 


GCCTATTTTA 


CAAATTTACC 


GATTAAAGCG 


ATGGTAAAAA 


ATTGTCATGA 


CCATCAATTA 


1980 


CCTGCmGCGA 


TTTCTTATAC 


AGCAGGAACC. TTTGTATGTA 


ATGATATTAT 


GTATCACTTA 


2040 


CTACACTTAA 


TTAACACTAA 


ATATCCAACT 


ATTCGTGGCG 


GATTTATTCA 


TGTTCCGTTC 


2100 


CTACCAGAAC 


AAACAATTGA 


TAAACCAACG 


TTTGCTTCTA 


TGTCATTAGA 


AGCAATCACA 


2160 


GATTCTTTGT 


TCTATATGAT 


TGAAGCAGCA 


GTAAAAACGC 


AAGAAGATAT 


TCAACTACAA 


2220 


GGTGGCACAA 


CCCATTaATT 


AACAGGaGCG 


TGGAACAAAA 


ATCACTTTGG 


ATTTTTGTTG 


2280 


CACGCTCAAA 


AACTGATAAA 


CGGCGGGAAC 


AGAAGCAACT 


CcTTCGGAAA 


TAAGCCGAAA 


2340 


TTCTCCAAAA 


ATTAAAGAAC 


AATTTTCGGA 


AATTCCTTCT 


TATTTCTCGG 


AGTTAAACGC 


2400 


TGCTGTCCCG 


ACCTCTTTCT 


ATTTTTACAA 


AAAGAAAAAG 


TAATTTTTGT 


TGGCGTGTGA 


2460 

i 


AAAAAATTTT 


TTATACTATT 


TTCGTTGACA 


AGTCGGTTTC 


TTCTTGGTAT 


ATTGAGATTC 


2520 


ATAAATTATT 


AGCCCaGTAG 


GAAATAGATG 


TTTTGTTCAG 


AAAGTCGATG 


GTTGCTGCGA 


2580 


ATCGATCAAG 


TCTATTTGTG 


AATTACACTA 


ATAAGTAGTA 


ACAATTACAT 


AATACATAAG 


2640 


CGAGAAGCAA 


GTAAGCTTCT 


AATTAAGGTG 


GTACCGCGGA 


GAGATTCGTC 


CTTATTCTTT 


2700 


AAGGATGAAT 


CTCTCTTTTT 


ATGTAGCTTT 


AGGAGGCGTT 


TTCAATGAGT 


TGTGGAGGGG 


2760 


TTATTCGTTA 


TCATAAGATC 


CTTTAATGTC 


AGAGTATTCA 


GAAAAAAGAT 


TAGAGGAGAG 


2820 


TAACGAAATG 


AAAGAAAAAC 


AGAAAAAGAA 


GACCGTGAGT 


GTTCGAGAAG 


CAAGTATCGT 


2880 


ATTATTAATT 


ATTGTGGCAG 


CTATTGCAAC 


AGGAGTAATT 


GGCTTAAAGA 


TATCGCCAAA 


2940 


TATTACTATT 


TTGTTTGTGA 


TTGCATTGAT 


TTTAGGCTAC 


GCAATGTTTC 


GTAAAGTACC 


3000 


ATTTGATCAA 


ATGCATAAAG 


GGATTGTTGA 


TGGCTTGAAG 


CCAGGAATTA 


TTCCAATTTT 


3060 


TATTTTCATT 


TTAGTGGGGG 


CATTGATTGC 


CGTTTGGATT 


CAAGCTGGAA 


TTATTCCAAC 


3120 


GATTATGGTT 


TTTGGTTTTA 


AAATGATTAG 


TGTTAAATGG 


TTTGTGCCGT 


CAGTTTTTAT 


3180 


TGTCTGTGCG 


AtTGTCGGCA 


GTGCAGTAGG 


TAGTGCTTTT 


ACAGTGATGT 


CAACAATTGG 


3240 


AATTGCCTTT 


TTCGGTATTG 


GCAGTACCTT 


AGGGTTAAAT 


CCTGCTTTAG 


TTGTGGGTGC 


3300 


GATTGTTTCT 


■ GGTTCAGTTT 


TTGGTGATAA 


AATGTCACCT 


TTATCAGAAT 


CAACAAACTT 


3360 


AGCCGCAGCG 


ATTGTTGAAG 


CGGATTTGTT 


TAAACATATA 


AAAAACTTGA 


TGTGGTCGAC 


3420 


TGTACCAGCA 


TTTATCGTCT 


CTTTAATCTT 


GTTTATGATT 


TTAGGACAAA 


CGTACGCGAA 


3480 
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TACAAGTCTG 


ACAGAAGTAG 


TTCAAGTTAT 


TGAAGTATTG 


GAAGACCACT 


TTACCATTTC 


3540 


AATTTGGTCA 


TTGTTACCAT 


TAGCCTTAAT 


GTTAATCTGT 


GCCTGGCGCA 


AAATTCCAGC 


3600 


GATTATCACG 


ATTTTGTTAA 


ACATTATTGT 


GGCAGTAATT 


ATGATTATTA 


TTCAAAATCC 


3660 


GAAAGTATCG 


TTGCAGGCAT 


TAGGAAACAC 


GTTGGAAAAT 


GGGTTTGTTT 


CTCAAACGGG 


3720 


GAACGCACAA 


ATTGATCAAT 


TATTAAGTCG 


TGGCGGGATT 


ATGAGTATGA 


TGCCGACTGT 


3780 


GGCACTTATT 


ATTCTAACGT 


TGTCTCTAGG 


TGGCTTATTA 


ATGGAATTAG 


GATTGATTTC 


3840 


TGCGGTCATG 


GAAGTTGTTT 


CGCAGAAAAT 


GAGCAGTACA 


CCGAAATTGA 


TTTTATCAAC 


3900 


GTTATTAACA 


GGAATTGGCG 


TAAATATTTT 


TATTGGTGAA 


CAATTCCTAT 


CGGTTATTTT 


3960 


ACCAGGGAAT 


GCCTTTAAAG 


AAGTTTACAA 


AAAAGAAGGC 


TTGGATCCGA 


CTGTTTTAGG 


4020 


CAGAACCTTG 


GAGGATGGTG 


GGACGGTTAT 


TAACTACCTC 


ATTCCTTGGG 


GGATTGCAGG 


4080 


TAGCTTCGTA 


GCGGGGACCT 


TTGGCGTACC 


AACGTTGACT 


TACTTACCAT 


TTGTTTTCTT 


4140 


TAGCTTATTA 


TCGCCTGTCT 


TCTCAATGGT 


CAGTGCATTT 


ACTGGTTTAG 


GAATTAAGCG 


4200 


GCTATCAGCA 


GAACCTGTTA 


CCGAAAAATA 


GATTGATTAA 


AATAGAACAG 


TTGATGGGGA 


4260 


ATACTGTAGT 


TTCTCCCTGT 


CAATTGTTCT 


ATTTTTCTTG 


TAAAAAAGGT 


TGCATTTTTA 


4320 


GAAAGAAACA 


GTTATAGTTG 


ACTAGTCAAT 


TGTTGACGAG 


TCATCTTTTT 


AAGGAGGAAA 


4380 


CTATGTCGTT 


ATCATTACGG 


GAAGTCAGCC 


AATTATTATG 


TCAGTTAAAG 


GTATTAGATC 


4440 


AAAAAATTAC 


CAAAGTTTTT 


GAAGAGCAAG 


TTGGCTTGAG 


TTTGACTCGT 


TATGAGTTGC 


4500 


TGATGATTTT 


AAAAGAACGT 


CAGCCGTGTC 


TTCAAACGGA 


GATTCAAGAA 


CACTTAAAAA 


4560 


TCGATAGTGG 


TGCGGTCACT 


CGCCATTTGA 


AAATTTTAGA 


AGAGAAACAG 


TATGTGACAC 


4620 


GACAGCGAAA 


TCCAGAAAAT 


AATCGGGAAG 


TGCTAGTGCA 


TTTAACAGAA 


AAAGCACAAC 


4 680 


AAGAACTACA 


GCAATGTACA 


GCCAAGCAAC 


AAACGGTAAC 


AGAAATTATT 


CCCAGTACAT 


4740 


TTACAAAGGA 


CGACTGTCGC 


CAGTTAAAAG 


AACTTTTGAC 


AAAACTAGAT 


CAAAGCATTA 


4800 


CAACGAAAGA 


GGTATGATAG 


AATGACAACA 


TATACAACGA 


ATGATTTTTC 


AGAGATTGTT 


4860 


TTTGGCCGCA 


AATCGGTTCG 


GGTGTACGAT 


GAAACCCATA 


AAATTTCTCA 


TGAAGAAATG 


4 920 


TTAACAATGA 


TTCAAGAAGC 


GACCACCGCC 


CCTTCTTCTG 


TTAACATGCA 


GCCATGGCGT 


4980 


TTTGTCGTAG 


TGGAAAGTGA 


AGCAGGCAAA 


GAAAAATTAA 


AACCCTTAAT 


TCGCTTTAAT 


5040 


ACGCGTCAAA 


ATGAAACATC 


TTCTGCAATG 


CTCTTAATTT 


TTGGTGATTT 


AAACTGCCAT 


5100 


GAGCGAGGAG 


AGGAGATTTA 


CAATCAAGCC 


TATGCTAGTG 


GTAAAATGCC 


TAAGGAAGTG 


5160 


CGGGACCAAC 


AATTAGCAGC 


CATTATTCCC 


CATTATGAAA 


GTTTAAGTAG 


AGAACAAATG 


5220 


AATGACATCG 


TAAAAATTGA 


TGCGAGTTTA 


GCAGCTATGC 


AATTTATGTT 


AGTGGCCCGA 


5280 


GCTCACGGAT 


• ATGAAACAAA 


TCCAATCGGT 


GGTTTTGAAG 


CGGAAAAATT 


AGCCGAAACA 


5340 


TTTGGTTTAG 


ATCAAGAGCG 


CTATGTCCCA 


GTAATGATTT 


TATCTGTGGG 


TAAGGGCATG 


5400 


GAAACTGGTT 


ATGAATCTGT 


TCGTTTAGCA 


CCAGAAAAAA 


TCACGACATT 


TGAATAGGCA 


5460 
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ATAAAAAATG 


ACGGCGTACG 


CTtGCATTAA 


CCAACTTTCT 


TAAAGAAAGT 


TGGTTAATGC 


5520 


mACAGCGATG 


CCGTCATTGT 


CGTTGGATAA 


AGTGACTTCC 


GTTGCGATGC 


TTTTAAGCTC 


5580 


GTCAACGGCG 


TTTTCCATAG 


CAATCCCCGT 


GCCGGCCCAT 


TCCAACATAG 


TATAGTCGTT 


5640 


GTGACCATCA 


CCAAAAGCAA 


TTACTTCTTC 


AGCAGCTATA 


CCTAACTGTA 


GGGTTAGTTT 


5700 


TTCTAAAGAA 


CGCGCTTTAT 


CAATATTTTT 


AGCGGTAAAT 


TCAAAATAGA 


AAGGAGCGGA 


5760 


AAAAGCAGCG 


GTGACGGTTT 


CTTTGAAAGG 


CGCGTAGATG 


GCCTCATGAT 


ATTTTTGTAA 


5820 


ATAGGCGGGT 


TCACCAGCAA 


TTAAAATTTT 


GTTCAAGGGA 


AAGTTTAGTC 


TGGCGGCTAA 


5880 


ATCACGCCAC 


TCACATAATT 


GAAAGTTGCC 


ACCCCGTGAT 


TCATATTCAA 


TAATATTAAA 


5940 


ATCACCAGTC 


TCTAAATGTA 


AGGTGTTATG 


AAAAACATCA 


TTGACATACA 


TAAATGTTTC 


6000 


ATCATTAATC 


ATTGGAATTA 


CGTCAAATTG 


TTTCAAATGC 


TCTAAAATTT 


GTTGACTCAA 


6060 


TGACTTTGAA 


ATTGTCTGAT 


TAAATAGTTG 


TTGCTGACTG 


CCGTGATGGG 


TGACACAAGC 


6120 


GCCGTTATAG 


GAAAGTAAGT 


GTCCATTATA 


ATGAGCCATC 


TCCAATTCAT 


CTGCTAAGGG 


6180 


CCGCATACCG 


TTAGTCGGGC 


GGCCAGAAGC 


CAAAATCAGT 


GATAGGCCTT 


GTTTTTGTGC 


6240 


AGCTACTAAT 


GCTTGTCGCG 


TTTTTGGTGA 


AATCTTTTTT 


TCTGAAGTTA 


ACAAGGTGCC 


6300 


ATCAATATCC 


ATTACAATTG 


CTTTGATTGC 


CATAATTGGC 


CTCCAATCCG 


AAGAGTTTCT 


6360 


TTCTTTTAGT 


ATAAAAGTTT 


TTTCTATTAA 


TGTAAACGGA 


TTCTCTTGTG 


AGGTGTATAA 


6420 


AAGAATTATT 


TTACAAAAAT 


CTATACTATT 


TTTTTGTGAA 


ATTTCATATA 


ATAGAAGTAT 


6480 


TCTAATTATC 


TACATAGAGA 


GCGAGGGACA 


AGGAATATGA 


AGGAAAAAGA 


AATGCATTCG 


6540 


CTCTTTTTTA 


AACATAAGTT 


TGTGAAAGTA 


ACTCCCTATT 


TACGTCGTTT 


TGGTCATCGT 


6600 


TTGAGTGGGA 


TGATTATGCC 


AAATTTGAGT 


ATTTTTATTG 


CGTGGAGCTT 


ATTGTCTTTG 


6660 


GTGGCTGGCT 


ATACGACTGG 


GAATCTACGG 


CTAGCTCTTT 


CTGAAGTCGA 


AACGATAATG 


6720 


ATTCGAGTTG 


TTTTACCGAT 


TCTAATTGGT 


TTTACAGGCG 


GAAAAATGTT 


CGAGGAACAA 


6780 


CGTGGCGGCG 


TTGTTGCTGC 


TATTGCGACA 


GTGGGCGTGA 


TTGTTTCCAC 


AGATGTTCCA 


6840 


CAGTTGTTTG 


GTGCTATGTT 


TATTGGCCCT 


TTAGCAGGAT 


ATACTTTCGC 


CAAAATTGAA 


6900 


CAAATTCTCT 


TACCGAAAGT 


TAAAGAAGGC 


TACGAGATGC 


TGACTAAAAA 


CTTTTTAGCA 


6960 


GGAATTGTGG 


GAGGACTGCT 


GTGCTGTTTT 


GGTATTCTGG 


TTGTAGCTCC 


GGCTGTTGAA 


7020 


AGCGCTAGTT 


TTTGGCTGTA 


TCAATTTTCT 


TCTTGGTTAA 


TTGAAGCCAA 


TCTTTTACCA 


7080 


TTGGTTCACG 


TTTTCTTAGA 


GCCCTTAAAA 


GTGTTATTTT 


TTAATAATGC 


GATTAACCAT 


7140 


GGCTTATTAA 


CGCCTCTAGG 


TTTAGAAGGT 


GCTAGTCAAA 


CAGGTCAGTC 


CATTTTATTT 


7200 


CTATTGGAAA 


CAAACCCTGG 


ACCAGGCGTG 


GGCGTTTTGG 


TTGCTTTTCT 


GCTGTTTGGG 


7260 


CCTGTAGGAC 


• AACGAAAAAC 


AGCAGGAGGT 


GCCACCATGA 


TTCAACTGAT 


TGGGGGCATT 


7320 


CATGAAATTT 


ATTTTCCGTT 


TGTTTTGATG 


GACCCGCGCT 


TATTTTTAGC 


AGTAATTGCT 


7380 


GGAGGAATGA 


GTGGTACGCT 


TGTTTTTCAA 


ATATTTAATG 


TGGGTCTAAG 


TGCTCCAGct 


74 40 
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TCGCCAGGTT 


CATTGGTTGC 


GATTTTAGCC 


AATGCCCCGA 


CTGA1 bbbAb 


CP TPP. PP. P. TT 


i son 


TTTAGCGGAA, TTTTTG1TAG 


CTTTCTGTGC 


TCTTTTGCAA 


TAbbAAbbl 1 


PTTATTAAAA 


/ JOU 


r^TPAACGAG 

V^u x vriA^vjrivj 


GAATTGAACC 


AGTTTCAATG 


7VT , 7\TvT\/~ > T\T</'"'A 

ATAAAGATGA 


a /"Pflrra zip a 
AbbAbbAAbA 


PP AAP.TP.PAA 


7 620 


APAP.TPAPAP 


CTAACTATCA 


GCAAATTTTA 


TTTGTTTGTG 


A I bb Abb AA 1 


PPPPTP A APT 


7680 


ppp a t p p. p, p. c 

ub\-rtl UUOU'J 


PTAGTTTGCT 

\w. L 1 X J. w V^ X 


AAGCCGACAA 


l 1 AAAAbb X b 


TP A A PTTPP A 
1 bAAb 1 1 bbrt 


P ATPPPTP.TP. 
on 1 *o v^v^, lulu 


7740 


Ab 1 1 Abbrtb 1 


PPPTTPATPA 
VwV^u 1 1 V^rt l 


GATGAAGTGG 


CAGbbTAAbA 


("•ATTAPTPPT 

bAl I Ab I bb 1 


PATTP A APP A 


7800 


/~> tv ATTrH Q ZiP 

bAA 1 i b Artrtb 


aptt ap^papa 


AAAGTACGTC 


/—i /"I TV ^ TV Tv TV TV /"*/~* 

C C AG AAAAG G 


ATA T^/""*TP A P 

AlAIbbl bAb 


TPTTPAAAAT 
lull \^I\f\t\r\l 


7860 


TTI 1 iAbAAA 


TT A A ATPPTA 
1 1 rtrtrt itL in 


TTACCCGCAA 


GTTTTAGCCA 


Tv Tv /~""T*/~* TV rp /*"» /"* 

AACTGAC i bb 


1 1 b 1 1 b 1 b Art. 


/ J C. \J 


bAbbAAl b 1 1 


P APTTPPTTP 


AGAGTCTACT 


GAAACGAACT 


/"•/"AOA A A A ^ A 

CGACAAAAbA 


A ATAPAPAAP 

AA 1 Ab AbAAb 




rri m /** rp T 1 T 1 T 1 T T 

bllbllllll 


TAT ATPPPPA 


GAATGTTCGA 


GGATCGCAAA 


T\ TV ftl f* TV 7\ fP 

CAATGGGAAT 


f-> f> Tv TV rnmivmrnp 

bbAA I I A I I b 




r> r~" c t\ ~r\ (~* i\ a p 


PPPPPA A APA 


AGGAGTCGCG 


ATTGAAGTAT 


CTAAAGAGCb 


A/^«T , /* , ^*AAAPB 

Ab 1 bbAAAb A 


01UU 


b I b 1 I 1 i 1 1 A 


PPAZiPP.AP.AP 


AACCTACGTA 


GTGACTCGTG 


AACTGGCGCA 


A rPPTI\ TPAT 

Abbb 1 Al bAl 


r i fin 


TTAbAl b 1 AA 


p p p a a p a a a a 


TTTATACGTA 


GTTACTAGTT 


mmmmPH t\ tv 

TTTTGAATAA 


b AAAb Ab 1 A 1 


o £ £. yj 


bAAbAA i obt 


t p p. zi a p. p, a p, P, 


AGCTGATAGA 


TGTTTTTAAC 


/-^ Tv TV TV TV /^T\'A 

G AC AAG AG AA 


Ab IbJ. lblAi 




TAAbbbAAI 1 


APTPAATAPT 


CCTACCCCCG 


TTTCAGTAAA 


CCGCATGATG 


AAI I 1 b 1 1 AA 


0 J *i yJ 


Abb 1 I/iuLLu 


ppppapppta 


TATCGTGAAT 


TAGAAAATTT 


tv tv Tv Tv z"' rp 

AGAAACCTCG 


b I Abb A 1 bb A 


r d nn 

i 


1 bbbbb^urt^ 


ATTPPAAAA A 


GTAGCTCGTG 


GGCGCTTTTC 


fV m m TV TV /"* TV 

GATTCAAGCA 


/""AT^A A/^PAP 

G A 1 bAAbb Ab 


Rd fin 


bbA 1 bAbuurt 


A A TTPAPP.PT 


GCTATTTTAG 


GAGAAGAGAC 


tv /~*t TV Tv T\ 

ACAGGAACTi 


1 b 1 Ab 111 uu 




rr nrrc a a P a 

bbbbbbArt\-rt 


TP. PP. A TTTT A 


TTGACATTAC 


TGCAAACAAA 


AGAGCCAGlb 


1 b 1 Al 1 1 


OJOv 


Al 1 I ILlbbrt 


AAPPTATTPT 


ATTAGTAATA 


CGACTTTTTA 


TGCCGATAT 1 


AAAb AA 1 1 Au 


Rfi^n 


a a apapppat 


TPPTPPHATT 


CCGTTGACGA 


TTGCTCGGAA 


TCAAGGTTA1 


/^•AAPTPBPTP 
bAAb 1 b/ib 1 u 


8700 


ul 1L1 urtrvrvrt 


ATATPGCTGT 


CTTCTAATGG 


CGAATATTCT 


Tv Tv rp tv m/™ 1 A A 

AAGT A I bbAA 


A TT A A TP APT 
All Art 1 Unb 1 


87 60 


APPA ATTTTT 


TCATTTTACC 


GAATTAACGA 


TV TV TA 7\ /*l TA 

CAAACGACCA 


TT rT 1 1 b 1 I 1 


PZiPTTT ATTP 
bAb 1 1 1A1 1^- 


8820 


ATGCAGAACA 


TTTAGCATTT 


GCACAAAAAA 


TTGTTCAGCC 


7V A TV /** tr P A A A 

AGAAbTAbAA 


APTTTA TTTP 

Ab lllAlilb 


8880 


CAGCATTAAG 


TGATCGCAAA 


TTACAACATT 


TGATTTTAAT 


pmrpTv TV /~ ' TV 7\TT 

bT I AAbAA 1 I 


PPP ATPPATP 
u^urt i bun 1 Vrf- 


894 0 


GGGTCACTGC 


AGGTTTTTAT 


TTAbb TbAJL b 


AP A PPT AT BP 
Ab Abb 1 A 1 rtV- 


T P.PP.P A P. A T P, 


AATAAGGCCT 

/ul i nnu u *^ w i 


9000 


TTCTAAATTG 


TTCTAAACGA 


CTTTTTTCAA 


AAGTCGCTGC 


TGAAACGAAA 


CAACTATACG 


9060 


CTGTCAGTGA 


GATTGTTTTT 


TTTGCTAGCT 


TGTTAAGTGA 


TTTTTCTAAT 


TCnTTTGATG 


9120 


AAGATTTTTT 


tGATGAACAT 


TTTGACACTC 


AGCTGGcTTA 


TGAAGTGAAA 


CAACTGATTG 


9180 


AAGCTGTAAG 


TCAAGAAACG 


GAAGTTnATT 


TTTTCGAAGA 


. TACCAATTTG 


TCTAA 


9235 


(2) INFORMATION FOR SEQ ID NO: 195: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



CCAGCTCAAA 


ACGCATTTAA 


TCACAAAAGA 


GGTAAACCGG 


CTGAAAAAAG 


AGAAGAACGC 


60 


CcTGCACTAT 


ATCGGCATTG 


CCGCAGACGA 


AGCGAAACGG 


TGCAAGGATG 


CCCCAAACCA 


120 


CCGCTATCCC 


CTGGTGGAAT 


GGGGCGTCAC 


CGAGGCGCAG 


GCTtTACAGA 


TCTGCTATGA 


180 


CCGGGGCTTC 


GATTTTGGCG 


GGCTGTATAA 


AATCTACCAC 


CGCGCTTCCT 


GCTGGTGCTG 


240 


CCCCTTACAG 


CGGATTGGTG 


AATTACGCAA 


GCTGCGTCAG 


CACCACCCTG 


AACTGTGGGC 


300 


GCGGCTGTGG 


GATATGGATA 


ACCGGGCGCG 


GGCGCAGTTT 


GGCGCAGGCC 


CCCTTGGACA 


360 


GTTTAAGGAG 


CGTTGGAGCA 


TTGAACAGTT 


AGACGCACGG 


TTTTCCCGCG 


AAGAAAAGGA 


420 


GGAACGCGCA 


TGAAACACCT 


GctCCGCCGC 


CTGCTGGTTC 


CGCCTTAGTT 


GCTCCCaGGC 


480 


AGCCAATGAA 


AAAAACGCAA 


AACAGAAAGA 


GGTGAGAACA 


ATCGAATTTC 


AAGCGAAACA 


540 


ACTGCTGATG 


GAAAAGCTGG 


ATAAAAACCT 


GCTGGATCAT 


GCAAAAAGCC 


T G AC AG AC GG 


600 


GAACGGGCCG 


CAGATTGTGG 


ATGTGTATGG 


GCTGCAACAA 


CTGGCCGAGG 


TTCACTGTTA 


660 


CTTGAAAACA ACCCACCAGT 


TTGACCCTGC 


TGAAGTGGAA 


GCACTGCTGC 


AATTTGCTGA 


720 


CCCGCTGGAA 


GCTGCCTGCC 


ATTGTTGGGA 


AATGAACACC 


CATGAATACA 


GTTTCCCGAT 


780 


CTGCGACCTG 


CTGCGGGAAA 


CCAGGGCGCG 


GGAACTCTTT 


GAACTGGCAA 


AGCCGGAACC 


1 840 


CCCGGCAAAA 


TTGTCGGTGC 


GGGAGCAGct 


TCGCGCCGCT 


GTGAAAGAAG 


CCCGCCAGAA 


900 


TCTGGCACCA 


GAGGAACGGA 


CAAAAGGCGG 


CGAAGCCCGG 


TGAAACAACC 


TGCCGCTTGC 


960 


AGGAAAGGAG 


GTGAACCCCT 


ATGAACTATG 


ACGCAAAGCA 


GCCCCCGGCA 


AACCTGGCCG 


1020 


ATTCCATCCA 


GCAGAAAGAA 


ACCGTCCAGG 


CATTATACGA 


CCTGTTGGAT 


AAAAATATGG 


1080 


CCGACTTCGA 


TGCAAGTCTG 


GCTGGCATGG 


ATAAAGCTGA 


AACCGCTGAC 


CTGAAAGAAG 


1140 


TTTCTGATAC 


GCGGGAAATA 


TACGACTTCA 


TTCGATACCA 


TTATGGCGAA 


TTTGGACAGC 


1200 


GTGAAGCAGA 


ACTTTTGCTC 


CGCATGGAAA 


ATCCCCTGAA 


TTTCATAGTG 


GAAAACTGGC 


1260 


CCTGCCATAA 


TCTTGAAACG 


CTGGCTATGA 


ATTATCTGGC 


GGATAAAACG 


ATTAAGGAAC 


1320 


CGAACAAAAA 


GGAAATGCCG 


CAGTCCGGCC 


ATGACACGAA 


AGATGCTGAA 


AAACCCTCTA 


1380 


TTTTGAAACA 


GATTAAAACC 


AACCAGGGAC 


AGATCAAGGC 


AAACAGACAG 


GAAACCCGCC 


1440 


AGCAATCTAT 


TACGGCAGAG 


CGACCCAGGG 


ACGGTGAGGC 


CCGCTGACGC 


TTTACGAGAA 


1500 


AGGAGGTAAC 


GCACCATGCA 


GGACGAAGTA 


CGCGAAAAAT 


CAGTCGCGCT 


CACCATCAAG 


1560 


GTCGGCAGGA 


CTGGCGGCAG 


GCTGACCGCT 


GAACTGCTGA 


AATGGGCCAT 


CCGGAAATAC 


1620 


CTTTCCCAAT 


CCCAAAACCC 


GAAAATCCAC 


CACGGCAAAC 


AGACGGTGAA 


ACAGCTTGTG 


1680 


GGCCAGGGCG 


CGGGTGTGCA 


GAATATTGAG 


ATTACCGGAA 


AAAATATCAA 


ATCCTTTGAG 


1740 


CGCGTGGCCC 


GGAAATACGG 


CGTGGATTTC 


GCTCTGAAAA 


AAGACCCAGC 


ACAGGGAAAA 


1800 


TACCTTGTCT 


TTTTCAAAGC 


ACGGGACGCG 


GATGCACTCA 


ATGCCGCCTT 


TT AC GAG TAT 


1860 
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GCCGGAAAGA GCCTGAACCg 


CtCCGCGCAA 


AAGAAGCCGT 


CGCTGTTAGG 


CCAGCTCTCG 


1920 


CATTTCAAGG 


AGATCGCAAA 


GAATCTTACA 


CGGGACACCG 


CTAAAAACCG 


GGAGAAAGGG 


1980 * 


GGCGTTGAAC 


TTTGAAATTT 


GATGTAAAGA 


AGATTCTGCT 


TCCAAACCTG 


CCTTATCTGT 


2040 


TCATCTTCTG 


GTTTTTCAAC 


CGTGTGGCGG 


AGGGCTACCG 


GCTGGCGGAG 


GGGGCGGATA 


2100 


TGGTGACAAA 


GGCAATGGGC 


GCAGTATCCG 


GCCTGGGGGC 


GCTGATTTCC 


AAAAACCCGC 


2160 


TGCCCAGCTT 


CTACCCGCGC 


GACCTGCTGC 


TCGGCGCTGT 


CGTCGCGGTC 


ATTATCCGGG 


2220 


CCGTAGTTTA 


TTTCAAGGCG 


AAAAACGCCA 


AGAAATACCG 


CCACGGAGTT 


GAGTATGGTT 


2280 


CGGCTCGTTG 


GGGAAATTCC 


GACGATATTA 


AGCCTTTCAT 


CAATCCCCAA 


TTCGACCAGA 


2340 


ACATTCTCCT 


GACGCAGACA 


GAGCGTGTCA 


TGGTGGGGAG 


AAACAAAATC 


CCCAAGTACA 


2400 


- ATATCAACAA 


GAATGTGCTG 


GTAATCGGCG 


GTTCCGGCAG 


CGGCAAGACC 


CGCTTCCACA 


2460 


TCAAGCCGAA 


CCTCATGCAA 


ATGAACGCCA 


GCTATATTAT 


TACT GAT CCG 


AAAGGAACAG 


2520 


TCGTTCTCGA 


ATGTGGGAAG 


ATGCTCCAGC 


GCGGCGGATA 


TGAGATAAAA 


ATACTGAATA 


2580 


CGATTGATTT 


CAAGCAGTCT 


ATGAAGTATA 


ACCCGTTCCG 


CTATATCTAC 


TGTGAGAACG 


2640 


ACATCCTGAA 


GCTGGTCAAC 


TGCATCATGG 


AAAATACCAA 


AGGCGAGGAC 


AGCAAGGGCG 


2700 


GCGAAGATTT 


CTGGGCGAAA 


GCCGAAGCCC 


TCTACTATCA 


GGCGCTGATC 


GCCTATATCT 


2760 


GGTACGAAGC 


GCCGGAGGAA 


GAAAAGAACA 


TGACCACCCT 


GCTGGAAATG 


CTCAATGCCT 


2820 


CCGAAGTGCG 


GGAGGATGAC 


GAAAATTTCA 


AAAATGCCGT 


TGACCTGATG 


TTTGACGCGC 


2880 


TGGAAAAACG 


CGATCCACAA 


CATTTTGCGG 


TGCGCCAATA 


TAAGAAATAC 


AAAATGGCGG 


2940 


CTGGCAAGAC 


GGCAAAATCC 


ATCCTGATTA 


GCTGCGGCGC 


TCGTATGGCT 


CCCTTTGACA 


3000 


TCCGGGAGGT 


GCGGGAACTG 


ATGGAGGGCG 


ACGAACTGGA 


ACTTGAAAAG 


ATTGGCGACC 


3060 


GCAAGACGGC 


GCTGTTCTGT 


ATCGTATCTG 


ACACGGATAT 


GACATTCAAT 


TTCATTTCTG 


3120 


CGATGGTTTA 


TACGCAGATG 


TTCAACGTCC 


TGTGCGACAA 


GGCGCTGGAA 


AATGGAGGCG 


3180 


CACTGAAAAC 


CCATGTTACC 


TGCCTGCTGG 


ACGAGTTTGC 


AAACCAGAAA 


ATCCCCAACT 


3240 


TCCAGCACCT 


GATTTCCGTT 


ATCCGAAGCC 


GGGAAATTTC 


CGCGCATATC 


GTGGTGCAGA 


3300 


CGCAGAGCCA 


GTTAAAGGCT 


GTCTACAAAG 


ACCATGCGGA 


AACTATCATC 


GGCAACTGCT 


3360 


CCTGCGTCCT 


GTTTTTGGGC 


GGCAAGGAGC 


GCAgCACCCT 


GAAAGAGATT 


TCGGAAACGC 


3420 


TGGGAAAGGA 


AACCATCGAC 


CTTTTCAATA 


CCTCCGACAC 


GCGGGGAAGC 


CAGCGCAgcA 


3480 


TGGGGGTAAA 


TTACCAGAAA 


CTCGGAAAAG 


AACTGATGTC 


AACCGATGAA 


TTGGCGGTGA 


3540 


TGGACGGCGG 


CAAGTGCATC 


TTGCAGGTGC 


AGGGCGTAAG 


GCCGTTTTTC 


AGCGACAAAT 


3600 


TTGACATTAC 


CAAACATCCG 


CAATATAAAT 


ACCTGCTGGA 


TTCCAGCAAG 


AAGAACGCTT 


3660 


TTGATATTGC 


AAAATATCTG 


AGCCGGAGAT 


TGACCGTCCG 


ACCGGACGAC 


GTTTTTGAAA 


3720 


GCTGTGAACT 


CGAAGCGCCG 


CCTGCGTCTG 


ATGCACCCGC 


CGATTGACTT 


TTCAACCGGC 


3780 


GGTTTTTTTG 


TTCCTATCCA 


ACCAATCACA 


ATAAATTTAT 


GGAGGTAATT 


TATGGAATTT 


3840 



WO 98/50555 PCT/US98/08985 

1039 



TTTGCAAGCG 


CTATTGAAAC 


CCTGAAAGTC 


CTTGTAATCG 


CGCTGGGCGC 


TGGCCTTGGC 


3900 


GTGTGGGGAG 


TGATCAATCT 


GCTGGAGGGA 


TACGGCAATG 


ATAACCGTGC 


GACACGTTCG 


3960 


TAAGTGAAAA 


GcTTGCGCAC 


AAAGTCGAAG 


GGCAATTGTA 


GCCCGAAACT 


TTGGAGAACT 


4020 


GATATTCCGA 


TGGGTGAAAT 


GAGGGGGTAA 


CGTCCCGAAG 


CGCCACCCTA 


ATACTCCGAA 


4080 


TGGCGGGGAG 


CATGCAACGG 


CTCTCTCCAA 


AAGTTTATAG 


AGCTCGGTGA 


AGTCGGCAGA 


4140 


GAGCAACCGT 


AAGACTGGTT 


TTCAGACCAG 


ACACGAAAGC 


TGTCCAGAGG 


TGGATGGTGT 


4200 


TGCCTATATG 


CCGGGGGTCG 


GATATCCATG 


GTGAGAATGT 


ATGAAAATAC 


TGACGAAATT 


4260 


CTGAATGTAC 


GAGTCTAAAC 


GTTGATGCTG 


TCGAAAGGCA 


GTACACCTTT 


CATGGTGGTA 


4320 


TCTGTGGGCG 


AGTAAAATTG 


GTTGTTATGA 


AAGTCCATGC 


ATCGTTACAG 


GCGGTGCGGT 


4380 


TATGCATGAC 


CAGATACAGG 


CTTAGAGGAA 


GAACCTAAAG 


ATATCCAATG 


ATAGGAATAT 


4440 


CGGAACGTGG 


AAAGCTGGGA 


ACACGGAGGC 


GGTTGCAAGC 


CAGTGATGTG 


GTGGATGGGA 


4500 


ATACCTTCGC 


GAGAGGGCAA 


TAACCATTCT 


TTATTCCAGT 


GAGAGTAGTG 


GCACAGTACC 


4560 


TGTGAAGCCG 


TGAATAAGTA 


AGCGGTGGAG 


GGATAGCCAC 


AAGTCGAATT 


CAGCAAAGAA 


4620 


AATGAAAACA 


CAACAAACAC 


AGCTTCGAGT 


ATGACAAAGA 


AAATCCAAAC 


CGAAGGgAGA 


4680 


GGATGCCTGT 


GTTGACTTCG 


GGGCAGCAAG 


AAAGGAAAAG 


CAAACAGAGA 


AAAATCCGCA 


4740 


ACTCAGAGTA 


TTACGACATG 


GAAGGTACTT 


TTGACAGGCT 


GTATGCAGAA 


AGTAAGAAAG 


4800 


ATAAAACCTT 


CAACCATCTG 


ATGGAAATCA 


TCGAGAGTGA 


AGAAAATATC 


AAGCTTGCAT 


4860 


ACAGAACGAT 


AAAGAAAAAT 


ACAGGAAGTG 


ATACTTCGGG 


AGTGGACAAA 


AGGACGATAG 


4920 


CCGACCTTGC 


AAAGTTAAGC 


GAGGAAGAAT 


ACGTCCGTCT 


CATACGGAAA 


CAATTCAGTA 


4980 


ATTATCATCC 


AGGACCCGTA 


AGGCGAGTGG 


AGATACCAAA 


GCCCAATGGA 


AAGACCAGAC 


5040 


CTCTGGGAAT 


TCCAACTATC 


GTGGATCGGA 


TCGTACAGCA 


ATGTATTTTG 


CAGGTCATGG 


5100 


AGCCGATATG 


TGAAGCAAAA 


TTCAGTGAGA 


ACTCCAACGG 


GTTCCGCCCG 


AACCGTTCCG 


5160 


CTGAGACTGC 


TATTGCACAA 


TGCATGAGGC 


TGATACAGGT 


AC AG CATC TG 


TACCATGTGG 


5220 


TTGACCTCGA 


TATTAAAGGC 


TTCTTTGACA 


ATATCAGCCA 


TACAAAGCTG 


ATTCGTCAGA 


5280 


TATGGGCATT 


GGGAATCCGG 


GACAAAAAGC 


TGTTATGCAT 


TATAAAAGAG 


ATGCTGAAAG 


5340 


CACCTGTTGT 


TCTGCCAAAT 


GGAGAGAAAA 


CATATCCCGC 


ACGGGGTACC 


CCGCAAGGCG 


5400 


GTATTTTGTC 


GCCATTGCTT 


GCAAACATCG 


TATTGAATGA 


ACTGGATTGG 


TGGATTGCCT 


5460 


CACAATGGGA 


GGAGATGCCA 


ACAAAAACAA 


AATTCAAAAC 


AAGGAGCAAT 


GCGCAGGGAA 


5520 


CAGAGATTAA 


GAGCCACGCC 


TACAGGGCTC 


TGCGCCGGAG 


CAGATTAAAA 


GAGATGCATG 


5580 


CAGTTCGGTA 


CGCAGATGAT 


TTTAAAATTT 


TCTGCGCAAC 


ACATGAGGAT 


GCTGTCAGAG 


5640 


CATATAAGGC 


AACAGAGTTA 


TGGCTAAAGG 


ACAGGCTGGG 


GCTGGAAATC 


AGCCCTGACA 


5700 


AATCCAAAGT 


AGTTAATCTC 


AAAAGACAGT 


ATTCTGATTT 


TCTCGGGTTC 


AAGCTGAAAG 


5760 


TCCGTAAAAA 


GGGTAAGAAA 


TATGTGGTTC 


GGTCTCATAT 


GAGCGATAAA 


, GCATACAAAA 


5820 
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AGGCGCATGA 
CTCAGTTCAT 
GTATTTCTAC 
TCAGAAACAG 
AAGAAAAGTA 
TGGGATATGT 
TCAAAGGTCG 
TGATGAGAAA 
ATGCGGCACA 
ACTGCCACCA 
TTGTCAGCAA 
ACCTGAAATC 
TGGCAGAAAT 
GTTTAAAAGC 
ACGGTGTGGA 
CGCGAAATCG 
GCAGCTTATC 
TCCTCGACAT 
TGGACGGGAT 
CGACCCCGGC 
TCGTAGTGCC 
TGCTGATGGA 
TGAAGGTGTT 
TTGAACTGGC 
TAGATTTTGC 
AACTTTTCGG 
TCTGCGTCAT 
GCGCAATCCC 
TGCGTGGTCT 
ACGCTGTGCT 
GTGCGGGCTA 
CAATTTTTAA 
CCGACCCTCC 



AAAAGTCTCG 
GCAGCTTCAG 
AGAAGTATCT 
ACTGAAAGGC 
TGGAGCAAGC 
CCAGTCAAAG 
GGAACAGATT 
TCCTGTAAAG 
GTATGGTAAA 
TAAAGTACCG 
AGCGGTTCAT 
CTTAAATCTT 
GCCACCTATT 
CCTGGACGGC 
GCGGGGGAAA 
CAGGGAATGA 
CCTCTGCTGG 
AATCACAGAC 
GTTTGACCAG 
GGCATGGAAC 
GGTTGCGGGG 
AAAGAACAAC 
TATCGCAGTC 
GCAGAACGTA 
GGCGGCGCTG 
CCTGCTGGTG 
GCTGGTTCTG 
CTTTGCGACG 
TGTGGCTCTT 
GGTGGGGCAG 
TACCGTTCTG 
CGCCCATTAA 
GTGGAGATAC 



GAAGAAGTAA 
AAATATAACT 
CATGACTTCA 
GACTTATCCA 
AGACAGATGA 
AATGCCCAGC 
CATAAGAATC 
GGCAGAACCA 
TGTGCAGTGA 
GTAAGCAATG 
ATCCTTATCC 
GACAATAAAC 
ATTCTTTGAA 
TGAATTCGAT 
ATCCGGAGAT 
AACAACTTAT 
CAAACCTGTT 
TGGATCAAGG 
ATTAACAATG 
GCCGGGGTCT 
ATAATCTTAA 
TTCCACGACT 
TATCTGGTCA 
GTACAGCAGA 
GGCGATGTGG 
GAAACCATGC 
GTGGGGCGCA 
ATGGCAAACC 
GGGCTGCAAG 
ATTGGGAGCG 
CTGTGCTTCT 
GGAGGTGGAA 
TCACAAAAGG 



AGAAACTGGC TTATTCGTCA GATGATAATG 5880 

CTGTAGTTGC CGGACTTCAT GAATACTATT 5940 

GCAGACTTGC CTTCAGTATC AACAAACAGC 6000 

AAAAGGGACA GCTCAGAAAT GGGTTCATTA 6060 

GGTTCCTTCA TGGACGTCCg GTGGTTCCAC 6120 

ATAAAAGGAA ATCCATCAAT AAATATACGG 6180 

TTGCAATCGA CACAGCTACA ATGTTATGGC 6240 

TTGAATATGC GGATAACCGT ATTTCTTTGT 6300 

CAGGAATTCC GATGAATTCC CACGATATTC 6360 

GCGGTTCGGA TGAATACGCA AATGTTATTC 6420 

ATGCTTCTTC AGAACCTACG ATTGAGAAGT 64 80 

AAATAGAAAA GCTTAACAAA CTTCGTTCTA 6540 

TTTAATGCTG TAACGAAGTG TGTAAGTTGT 6600 

GGAGCGCCGT GTGCGGTGAA AGTCGCATGC 6660 

AACTTCAAAG GATTACCTAT CGCTATCTGG 6720 

GGCCGGCGGC GGAGTTGCCC TGATTGGGAT 6780 

CGGTTAAGGC GGGATAGCCT ATGCTGAACA 684 0 

AAATCCTGCG GGACTGCATC ATGGGCAACC 6900 

AAGTCGGGGA GGTTGCGGCG AATGTGGGGA 6960 

TTTCCATGAT CCGCAATCTG AGCGATACAG 7020 

CCTTTGTCCT GTGCTACGAA CTGATTTCCA 7080 

TTGAAACTTT TGCACTGTAT AAATGGATAT 714 0 

CACACACCTT TGACATCACA ATGGCGATTT 7200 

GCGCCGGTGT GATTACCGGC TCCACCACCA 7260 

CCGCCCAACT GGAAACAATG GAAATCGGGG 7320 

TGCTGAAAAT CACCATGCCG GTGCTGTCGT 7380 

TGATTGAAAT TTATATCTAC TGCTCGGTGG 74 4 0 

GGGATTGGGG GCAGATGGGA AATAACTATC 7500 

GCTTTTTCAT TATGATCTGT GTTTCGATTT 7560 

CCGACAACAT CCACATAGCC ATCTGGCAGT 7 620 

CCCTGTTCAA AACCGGCGCG GTCAGTAAAT 7 680 

CCCATTTGAA ACAATATGAA ATCATTCTGG 774 0 

GGGGTGCAGG GAGCGGCAGA ACGTCACTAC 7800 
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GCCACCATGA 


GCATGGAGGA 


TTTATGCTCA 


CTGCCTATCA 


GCCGTTTTGc 


tGCGCCGGAC 


7860 


TGCGCGCTTT 


TTCTTTGGAG 


TACATTCCCA 


CAGCTTCCAG 


AAGCACTGCA 


TCTGATTCGT 


7920 


GCATGGGGAT 


TCACCTATAA 


AACAGTGGCT 


TTTGTCTGGT 


TAAAGCAAAA 


CCGCAAATCC 


7980 


CTTAGCTGGT 


TCTATGGGCT 


TGGATTCTGG 


ACGCGCGGCA 


ATGCGGAAGT 


GTGCCTGCTG 


8040 


GCTACCAAAG 


GGCATCCCAA ACGCCGCCGG 


GCCGACATCC 


ATCAATTTAT 


TATCTCACCG 


8100 


ATTGAAGCCC 


ATAGTAAAAA 


GCCCGATGAA 


GCACGGGAAA 


AAATCGTTGC 


TCTGATGGGC 


8160 


GACCTGCCCC 


GCGTCGAACT 


GTTCGCCCGC 


CAGAGAGTCC 


CCGGCTGGGA 


CGCATGGGGC 


8220 


AACGAGGTAG 


AACCCAGCCC 


CGGACTGGCG 


GAGGGGCTGG 


CCCCTGAAAA 


TCCAACGTAA 


8280 


AGGAGGGAGC 


GAATTTATAT 


GGACACTGAA 


AAACTGCGAC 


GGCAGCTTAT 


TGACCGCATC 


8340 


ACGGACAATT 


ACAGCGAATA 


CCGCGCCAGC 


CTGCTGTTAA 


AAGACCGGCA 


GGAATTGATT 


8400 


GACGGCGCAG 


GCAGGATTGC 


CGATACCGCC 


ACCGTATTCC 


AGCACCTGGC 


TGACCGAAGC 


8460 


TACAAACCGG 


AAGAACTGGA 


ATACCTTTTG 


AAATTCCAAA 


ACCCCCTGGA 


GGTGGTCACG 


8520 


GAGCATTGGG 


CAGATTATGA 


ACTGATGCCG 


GATGTGCTGG 


ATGCGCTGAT 


TGCAGACACG 


8580 


GTTGACCGGC 


AGGACGATCT 


GACCCTGTTC 


CCGCTGGCAA 


CGGACGCACC 


GGCACAGAAG 


8640 


CCGGAAAGCC 


TGCGGAAGTT 


TCTGAATGTA 


GACCTGGAAA 


CCGTCCTGCC 


CCAGATCATG 


8700 


AAGCAGAAAA 


CCGCCTTTTA 


CCAGACAGAC 


TTAAACTATG 


CGCTGGATGC 


CATGAGGGAG 


8760 


GGCGCAACGT 


CCGGCGACCC 


GGCAAAACAG 


AACTTTGTCC 


TGATTTTCCG 


GCAGAGCGGC 


8820 


GTGGAGTGCT 


TAAATGAGCG 


GGATATGTTC 


ATCGGCGGCA 


CACGCAGCTT 


TAACACACTG 


8880 


CAATTCTACC 


ACACACAGAC 


AAGGGAGCCG 


GTTCTGACCT 


ATTCGGTGGA 


ACTGACGGGG 


8940 


GACGGCAAAG 


ACGGCcTGCG 


GGGAAATCTG 


TACCAGCAGG 


ACAACCACCG 


CCTTGCGGAA 


9000 


tACGCAGGcG 


TACCGCyTCC 


CCCTATACAG 


ATGTAACGGT 


TACATTTACA 


GGAGGGAAAG 


9060 


AAGTACGGCT 


GCCGGAAAAG 


GAATTTTACT 


ATGAAAAAGC 


CGCAAGCCTG 


AAATATCTGC 


9120 


ACGGCAGCAT 


TAAGGAGGTT 


CGCAATGAAG 


CGGAGGACGA 


AACCGTGGTG 


CAGGGGGCGG 


9180 


TCAAACGGGA 


ACACGACCGC 


CGGCGGGAAA 


TGCCCAGGGG 


CTATTTTGCC 


GTCCACCTGC 


9240 


AAGCGCTGGA 


TGAGCAGCGG 


ATTCAGGCCG 


AAGCAGATCG 


GCTATCGTCC 


GCGCTCACGG 


9300 


ACTTAAAACA 


GCCCAACTCG 


CCGGACAAGG 


AGAAGTTTAT 


GGTGGAAATA 


TCCGTGTATT 


9360 


TCCTGCCGCT 


GGCTTCCCAG 


CACGATATGA 


ACCGGCTATT 


TGAGAAAGTG 


CGCGACCAGG 


9420 


TAGGGCAGAC 


GGTGTATATC 


ACTCCGCCGG 


ACGATACCCA 


CAAGCGGTAT 


TTTGTCATGG 


9480 


GGCGGGArGA 


ACGCGCACAG 


GAGCAGCCAG 


AACGCAAGCC 


CTCTATCAAG 


GCCCAGCTTT 


9540 


CTGCAAAGCC 


CGTCCCCGGC 


GATAAGCCCG 


CCACAAAAAC 


CAAAGACCGG 


GAGGTGAGAT 


9600 


AAATGCCCTT 


TGTACCCGTA 


CCCAAAGACC 


TTGCCAAAGT 


AAAAACAAAA 


GTGGCGTTCA 


9660 


ACCTGaCAAA 


GCGCCAGATC 


GTCTGCTTTG 


GCGGGGGCGG 


GCTGGTGGGA 


, GTCCCCGCCT 


9720 


ACATCTTTAC 


ACGCAGCAGC 


ATCGGCAACG 


AGCCGGCGGC 


TCTCCTGATG 


; ATCGGCCTGA 


9780 
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TGCTGCCCTT 


TTTCTTATTT GGCATCTATG AAAAAGACGG TCAGCCGCTG GAAGCAGTCC 


9840 


TGCGGCATAT 


GATACGCGCA AAATTCCTTT GTGCAGGGAC GCGCCCGTAC CAGACCGACA 


9900 


ATCTGTACGC 


CGCCCTTTCG CGGCAGACTG AGGAGGTATC GACGATTGAA AACAAAACAG 


9960 


CAGGCAAAGC 


AGGCCAAAAA CCTGCAAGCG GCAAAAAAAG AACTGAAAAA AGAAAAGTCT 


10020 


GACTTAAAGG 


CCACCCTGCG CCTGACCCCG GACGCAAAAC TCAACAGCCG GGACAAGCGG 


10080 


CGGCTTGCGG 


CATCGGTCAG CAAGGCAAAG CGGGACGGCA GGCTTCCCAA GACCGCCCAG 


10140 


CAGACCATCC 


GCTACCAGGA AATGTGCCGG GACGGTATCT GTATCGTGGG CGACCGCTAT 


10200 


TTCACCAAGC 


AAATCCAGTT TTATGATGTG AACTACCAGC TTGCACAGAA CGAGGACAAG 


10260 


AACCTGATTT 


TTGAGAACTA CTGCGATTTC CTCAACTATT TTGACAGCTC CATCAAGGTG 


10320 


CAGCTTTCTT 


TCCTCAACCA GCGGGCCGAC ATGGAGGAAT ACCAGAAGTC CATCCATATC 


10380 


CCGGAGCAGA 


CGGACGCATT TAACGGTATC CGGCGGGAGT ATTCGGGAAT GCTCAAAGGC 


10440 


CAGCTTGCCA 


GGGGGAACAA CGGCCTGACA AAGACCAAGT ACATCACCTT TGGCATTGAA 


10500 


GCAGACAGCC 


TGAAAGAAGC AAAGCCCCGC CTGGAACGGA TTGAAGCGGA TGTGCTGGCG 


10560 


AATTTCAAGG 


TGCTGGGCGT CCGCGCCCAT TCGCTGGACG GCTATGAACG TCTGGCAATC 


10620 


CTGCACCGGA 


TGTTTCACCC GGCGGACAAT CAGAAATTTA AGTTTGCCTG GGATGCCATC 


10680 


TGGAAAACCG 


GGCTTTCCAG CAAGGATTTC ATCGCGCCGG ACAGCTTCAC CTTCAAGTCC 


10740 


GGCCAGTATT 


TCCAGATTGG CAGGACTTAC GGCGCGGTGT CCTTTCTGCA AATCCTCGCC 


10800 


CCGGAACTGA 


CCGACCGGAT GCTGGCGGAT TTCCTCGATC TGGAAAACAG CATGGTGGTG 


10860 


ACACTGCATA 


TCCAGTCCAT CGACCAGAGC GCGGCAATCA AGAACATCAA GCGCAAGATT 


10920 


ACCGACCTCG 


ACCGCATGAA AATTGAGGAA CAGAAAAAAG CAGTCCGGGC AGGGTACGAC 


10980 


ATGGATATAA 


TCCCCTCCGA CCTTGCCACA TTCGGCAGCG AAgcAAAGAC GCTTCTGGAA 


11040 


GATTTGCAGA 


GCCGCAACGA GCGGATGTTC CTTGTGACTT TGCTTGTGAT GAACCATGCC 


11100 


AGGACGCGGC 


AGAAGCTGGA AAACAACATC TTCCAGGCGG CGGGTATCGC GCAGAAATAC 


11160 


AACTGTGCGC 


TGCGCCGTCT GGACTACCAG CAGGAGCAGG GCCTGATGTC CTCCCTGCCC 


11220 


CTGGGCTTAA 


ACCAGATCGC CATCCAGCGG GGGCTTACCA CCTCCAGCAC TGCCATTTTC 


11280 


GTGCCGTTTA 


CCACCCAGGA GCTTTTTATG AACGGTGAAG CGCTGTACTA CGGCTTAAAC 


11340 


GCCCTCTCCA 


ACAACCTGAT CATGGCTGAC CGGAAAAAGc TGAAGAACCC CAACGGCCTG 


11400 


ATTTTAGGCA 


CCCCCGGCAG CGGCAAgTCC TTTTCCGCCA AGCGGGAAAT CACCAACGCT 


11460 


TTCCTGATTA 


CCCAGGACGA CATCGCCATT ATCGACCCGG AGCAGGAATA TTCGCCCCTT 


11520 


GTGCAGCGGC 


TGGGCGGGCA GGTGATTGAC ATTTCCCCCG CCTCCGACCA GTACATCAAC 


11580 


CGGATGGACC 


TGACCCTCAA TTACAGCGAG GACGACAACC CCyTGaCyTT AAAAAGCGAC 


11640 


kTTATCCTCT 


CCCTGATGGA GCTGATTGTG GGCGGCAAGG CCGGCCTGGA GCCGGTGGAA 


11700 


AAGACAGTGG 


TTGACCGCTG CGTCCACATG GTCTACCGGG ACTACCTGCA AGACCCGAAG 


11760 
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CCGGAGAAAA 


TGCCGGTGCT 


GGGCGACCTT 


TACCGCCTTT 


TGCGGGAACA GCCGGAgCGG 


11820 


AAgCACAGCG 


GCTTGCTACC 


GCGCTGGAAA 


TCTACGTTAC 


TGGCAGCTTA 


AATGTGTTCA 


11880 


ACCACCAGAC 


CAACGTGGAA 


ATCAAGAGCC 


GGATTGTCTG 


CTATGTGATT 


AAAAACCTGG 


11940 


GCAAGCAGCT 


TAAAAAATTC 


GGGATGCAGG 


TGGTTCAGGA 


TCAGGTGTGG 


GGACGGGTCA 


12000 


GCGAAAACCG 


CGAAgcCCAT 


AAATCCACCC 


GGCTCTACAT 


TGACGAGATG 


CACCTGCTTC 


12060 


TCCGGGAGGA 


ACAGACCGCC 


GCCTATACCG 


TTGAGATTTG 


GAAACGGTTC 


AGAAAATGGG 


12120 


GAGGAATCCC 


GACCGGAATT 


ACCCAAAATG 


TCAAGGATTT 


GCTGGCAAGC 


CGGGAAATTG 


12180 


CGAACATTTT 


TGAGAACAGC 


GACTTTATTT 


ATATGCTCAA 


CCAGGCGGGC 


GGCGACCGGC 


12240 


AGATTTTAGC 


CAAACAGCTT 


AATATTTCCC 


CGCACCAGCT 


TGGATATGTG 


ACCAACTCCA 


12300 


ACGCCGGCGA 


GGGGCTTTTG 


TTCTATGGCA 


ATGTGATTAT 


CCCATTCGTT 


GACCATTTCC 


12360 


CCAAGGACAC 


GGAACTCTAT 


TCCATCATGA 


CAACCCGCCC 


GGAGGATCTG 


GCGGAACAGT 


12420 


AAAGGAGGGT 


AAGATTTGGA 


ACACAATCAG 


GAAATTATGG 


AGCAGTTTTT 


GAAGCTGCTT 


12480 


TCGGAAAACG 


GGcGGGCGGG 


ACAGTCAGAG 


GATCTGTCCC 


GCCTGCTGTT 


CTATATGGAC 


12540 


GGCATGAACC 


GCCAGCTTGA 


CGCGGTGCGC 


CAGGAATTGC 


AGACAGTCAA 


GGTACAGCTT 


12600 


GCACAGGCCC 


AGGACACGCC 


GCAGAAAACC 


GTCTTGCAGG 


GCATGGTGGA 


CGGTCTGCAA 


12660 


AGCAAAGTCC 


GGCAGGCGCA 


GGAAAAGCTG 


GACAGCCTGC 


GGGAAAAAAT 


CATGCAGTGC 


12720 


GCCGCCAACG 


CGGTAGAGAG 


TTTCAAGCGC 


GTGGGCGTGA 


CTGCGCTGGA 


TAAGGCGGTT 


12780 


TCCGCTATGG 


GCATCCACAA 


GACGCTGGAA 


GCGGTACAGC. 


AGAATATCAG 


CGGTTCCCTG 


12840 


GACGATGCCA 


GAAAGAGCAT 


TAAAAAGGTG 


GAAACCCTTG 


GCC ACGAACT 


CCGCAGCGTG 


12900 


GGCGGGCATC 


TGAAAAACGC 


CGGGCGCGCC 


GTGGCCGGCA 


GGGAAACCCA 


GGCCGTGGAC 


12960 


GGCGGGCAGG 


AAGGACGCTT 


CCAGGCTGCT 


TTCCTTGCCC 


CCATGCGGAC 


TGCCTATAAG 


13020 


ATGCTTTCCA 


GCATGAACAA 


TACCACCCTT 


GCCGCTATTG 


GCAACGTGGA 


AC G ACT GG AA 


13080 


ACAGCGGCGG AAGCgTGCGG 


GAAGCCAAAA 


AGCCCCCTGT 


CCGGCAAACG 


GAAACGCAGG 


13140 


AAAAGGAGCA 


GGTAAAGGAG 


CTGGAACATA 


AGGAAGAACC 


CACTGAAACC 


CAGCGCCGTG 


13200 


AGAAGCCCTC 


TATCCGACAG 


ACGCTGACAG 


AGAAAAGTGA 


GCCTATGGAC 


GTAAGCGATA 


13260 


TTTTGAGCGA 


CCCTGCCAAT 


ATCCAGATTT 


TTGATACGGT 


TTTAGAAGAA 


TCCTGCCGTC 


13320 


AATGGTGCGA 


CTGTATTGGC 


GATGCGCCGG 


AACGTGCCAG 


CGGCAAAGGC 


TTTGCGTCCT 


13380 


TTTTCTATGA 


GGTTTTTGAA 


CAAAAAGAGC 


AGGAATACAA 


AGCCGAGTTA 


AAGGCAGACA 


13440 


CACGCTCTGA 


AAAACCATCC 


ATCCGAAAAG 


CGCTGAACGA 


GAAAAAGGCC 


GAAGCCGCCG 


13500 


CACGGCCTTC 


CCCAGAGCCG 


GAACGTAAAG 


CGCCGGAGmC 


GCACGGTGAG 


CCGGGAACGG 


13560 


TTTCAGGCAC 


GGGAGAAAAA 


GGTTCAGAAG 


ATGGGCCGGG 


ACGGTCTGGT 


GGAACAGAAC 


13620 


CGCGCCACCG 


GCGAAGAAAA 


gCGCGTCAcC 


AGCGCACGGC 


AGACACCGCC 


TTTGACCGCG 


13680 


CCCGCCCTAC 


GGAACGGGAG 


GACATCCGCC 


: GCCCGGCAGC 


ACGGGGCGCA 


. GACACCGGCA 


13740 
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AAAAGCGGAA ACAGCCCCGC ACTGCGCCGG AGATGGCGGC AGAACAGACG GAACCGCTGc 13800 

TTACGCCGGA ATATCTGGCA GACAGTACTG CCGAACCGCC GCCCGATGCC CCGGCATCCA 13860 

TGCGCGGCGC GGTGGATATG CCGTTACTGG ATGCGCCCAT AGCCGAAGAA CCCACTCCCG 13920 

CGCCGGTGAG GAAACAGCCC CAAAAGCGGG GGAAAAAGGG CAAGAAGAAG CAAAGCACGA 13980 

AATTCACGGA AGACGCTGCC CGACCGGGTA GTGAGGAAAA AACGGCTATG CGCGGTGTGG 14 040 

AGGGCGCTCC TGCCCCAAAG CGGCGCACTT CTGAGGGCGG CTCCCGCCTG CAATTTGACA 14100 

AGGAAGAAGA ACCGGCAGAC CCCGCAGCCG ACCGCGCAgc AATCAGAAAG CGGCAGACAG 14160 

CAAAGCTGGC CGAACACGCC GCCAAACCGG AGGATaAGCT AAAATCCGAT GGCAATCCAA 14 220 

AGGCCGGGGa AACACGCCTG AAATTTGAGG ACACTcCCAA AGATACGGAC ATTTCCGGCG 14 280 

ATACTGCGCC CGATATGCCC ACCCGCCAAC AAAAGAAATA TAACAAGGCC GCCTGCCGCA 14 34 0 

CGGAACGCGC AGACCGCCGC GTGGAGCAGG CACAAATGAA GCTGCCTGCC AAACGCCGCC 14 400 

TGCACATGGA TCACCAGCCT GACGAAGCCA GCGGGAAGAT AAAGCGGCGG CTCCGTTTTG 14 4 60 

AGGATGAAAT CCTGCCGGAG TACCAAAAGC CCTCCCTGTC CTCCCGTGCC GGAAATGCGG 14 520 

CAAAAACGGC GGCAGTCTTA AAGCTGCATG GCAAACTCCG TGAATATGAG CGTGACAACG 14580 

TGGCTCTGGA ATCTGCCCAC AAAATCGAAC TGGTTGCCGA ACAGGGCGCG GGGCGCGTCC 14 64 0 

TGCGCTGGGA ACGGAACCGC CGCCGCTcCA AACCCTACCG CACTTTGCGG AAAGCCCAGC 14 700 

AGAAAGCGGC GCGGGCGCAC ACTGACCTTG GATGGCAGAC CGCCnTGCGG GATAACCCCG 14 760 

AACTGCAACG CAAAAACGCG CTGGCAAAAT GGGTGCAGAA GCAGAAAATC AAGCGGAAAT 14820 

ATGCCCAGGC CGCCCATGAA GCCAAACAGA GCGCGAAATT TACCCAAAAC GTCCTGACCG 14880 

CCACCGgCAA GATTGCCCGC GCCATCGCCC AGTATGCGGC GGCACATAAG GCGGTCTTTC 14 940 

TGGCGGTTGC aAtGcTGGCG cTGGTGGTGA TGTTCTTCGC CACGGGCCTG ACCTCCTGCA 15000 

cCGCCATGCT GTCAGGCTTc CAGTCCTCGT ACATTTCCGC GTCTTACATG GCTAATGAGC 15060 

AGGAAATCTG CCAGAGCGAC CTTTATTACA CGGAAAAGGA AACGGATTTG CAGATTGATA 15120 

TTGACAAAAC TGAGGAAAAT TACCCCGGCT ATGATGAATA CCGCTACAAC ATCGGTGAAA 15180 

TCAGCCATAA CCCTTATGAA CTGTTAGGCT ACCTCTCCAC GGCGTTCAAT GCTTTTACCT 1524 0 

TTGCAGAAGT CCAGCCCGAA ATTGACCGCA TTTTTTCCCG GCAGTACACC CTCACCCGTG 15300 

AGGTGATCGT GGAAACCCGG TATGACGATG ACGGCGACCC TTATGACTGG TATGTTTTGC 15360 

AGACCACATT GGCCGTGCGC CCGCTGTCCT CTGTGTTACA GAGCAGCCTC ACCCCCGGCG 154 20 

AACAGACCGA CCGCTACGGC GTGTATATGC AGACtACGGC AACCGGCAGG CGTTTGGAAA 15480 

TCCCTTTGGT TTTTCCTGGC TTGGATATGT GAGCAGCGGC TACGGCTGGC GGGTACACCC 15540 

TGTAAACGGC GAAAAGAGCC TGCACCGGGG CATTGACCTC GCTGTGGCGC AGGGGACTCC 15600 

GATTCTGGCG GCGCAGGACG GGCGTGTGGT ATCGGCCGGC GATGCAGGCA GCTATGGGCT 15660 

GTGCGTGGTG ATCGAGGACG ATAAAGGCTA CCAGTCCCGG TATGCCCACT GTTCCAGCTT 15720 
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AAACGTCAGC GCCGGCCAGG AAGTAAAACG CGGGGATGTG ATTGCTGCTG 
CGGCAACAGC ACGGGGCCGC ATCTCCATCT GGAAGTCATG TTAAACGGCG 
CCCTTATTTC TTCGTGGACA ATGGCGACGA TGGAACCGGC GCAATCCCCg 
CGGGCCGGCT aTCCCgGACT ATTCCgGCGA AcCGATGGGC GATGGCAGyT 
TGCTGCGGGA ArCGGAGAAa TACCTGGGCT ATCCCTATGT GTGGGGCGGc 
CCACTTCCTT TGACTGTTCC GGTTATGTTT CGTGGGTCAT TAACCAGTCC 
GCGTCGGGCG GCAGACGGCA CAGGGCCTTT ACAACCTGTG TACGCCGGTA 
ACGCACAGCC CGGCGACCTG ATTTTCTTTA CCGGGACATA CAGCTCGCCG 
GCCACGTTGG GATTTATGTG GGAGGCGGGC GGTTTATTCA CTGCGGCGAC 
ACGCCAATAC GGGAAGCCCC TATTGGTCTG CCCATCTGTA CGGGTATGGC 
GAAACACTTT TTTCAGGCAT TTTTGCCAGA AACGGAGGTA TGATTTGAAA 
TTGATGTGGA TTCCCACAAC TGGCCGAACC TGTGCCTGAT GAAGCTGTCC 
AGGCGCGGGG CGGCCTGGTT GAGTGGTGGA AyCCCAAAAA CCACTACGAC 
AAAGCCGGGT AT TC AC AG AT ACCTACTCGA AAGATACGAT TGCCGTTACC 
CCGTCATTTT AGGCGGCACG GGCTACGACC TTGGGGAAAA CTTACCGGAT 
ACACCCGCCC GGACGCTGCA CTGTATCCGC AGTTTTCTGA TACCGCTTAC 
GCCGTGGCTG TCCGCGAAAC TGCGGATTCT GTATTGTCAG CGGCAAGGAG 
GCCGAAAGGT GGCCGACCTT TCCGAGTTTT GGGACGGGCA GAAAGAAATC 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
ATAAAATCTA GCCGTCAATA TCATAGGAAA TTATTAGAAA ATcCCAATTC 
aTGTGGTTTA TcAATcAAAA ATATGGTTCA TGGaAAAACT TAT TAG T TAG 
GATAATGGAG AATATGGGAA ATGGGCAAAA ATATCAGAAA AAGATTTATT 
GAGTCTTTTA TCACGGTTGA GAAAATAACT TCTCAGCGTA TGTATGAAAA 
GGAAAAGACG TTCCTTCATT GAGTACATTA AAAAAGAGAT TTGGAGATAT 
TTTAGAAAGA AT AC AG AG a A GTCTTCTTTT ACTGATTTTG aACTAATGAT 
AaTGAAATAG- TCCGATTAAA ATTACAGGAT GATTTATCAA TGACGAAATT 
GTTCAATCAC CAAAGTTACC ATCAGTTGAT ACAATAATGA AAAGAACAAA 
GAAGAGTTGA TGACAGAAAT AGGGTTTGAC TATCGAAAAA TCAAAATCAA 



TCGGCAGCAC 
AATACTTGAA 
GCACCCCCGG 
TTGAAaGCCA 
TcCCATCCAT 
GGCGTGGGCA 
TCTGCGGTGA 
GGGCCGGTCA 
CCGATTTCCT 
AGAATACGCT 
ATTGGACTGA 
GCATACCATA 
CTTGTTTATA 
AATGCCGATG 
GCGGTGGAAC 
GGCTTTTTGA 
GGAcGGCGAA 
AAGCTGCT 



TGcACCAAGT 
TCTTGGGTGT 
GAAAATAGTA 
AAGATCAGTA 
AAGGTATCTA 
TGAACTAAGA 
TCGAAAACTA 
TAAAAATTGG 
TAAGCAGAGA 



15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 ■ 

16740 

16798 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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AATAACTTAT 


CTAAAAAGAA 


GAAAACTAAA 


TAGTAAAACT 


CTAGACTAGA 


CGTTCAAACT 


bvJU 


AAAAAGTTGA- 


ACGTCTTTT-TL 


-TATTACCTAA 


AAACGAAAGG 


GGATTTTATT 


TTATGATAAA 


boU 


GCCGATATTT 


AAAAAATGGC 


GATTGTTTGC 


GGCTTTAGCT 


TTATTAGGTC 


AAACAATTGT 


Ton 


AGGGGCGATT 


GATCCTATGA 


TTGTCTTTGC 


TGATGAAATT 


GCTCATCCAC 


AAACAGTGAC 


Ton 

/oU 


GGTTGAATTA 


GATTTAGCGC 


ATCAATATGT 


TGTTGAAGGA 


ACTTTCAGTG 


ACGGTCGTCC 


84 0 


AATGTCAGAA 


GTCACTGTTC 


CACATTATGC 


TGTGTACAAC 


GGCGTGAAAC 


AAGATGTTTT 


900 


CTGTATTGAG 


CCAGGAGTCC 


CAATTGACAA 


TGAGTTCACG 


CCGGGTTATG 


AGAAAAATCC 


960 


ATTGCCTGAT 


ATGCCAGAGA 


AAGCGAAACT 


AGTTTCCGTT 


CTATGGAAAA 


AAGCCGGTAC 


1020 


AGATGTAGAT 


ACTCATATTG 


TGGCACAGAA 


GATGATCTGG 


CAAGAAGTTA 


ATGGGTATAC 


1080 


CCTTCACTCA 


ATCAAACGAT 


TGAATGGTAG 


TGCCGTAAAT 


ATCGCAGCCA 


TTGAGGCAAA 


1140 


AATCAATCAA 


GCCATTGCCG 


ATTATCAGAA 


GAAACCAAGC 


TTCCATAATA 


GTACGGCTAA 


1200 


AATAGTATTA 


GGGCAATCGA 


CTACTATGAC 


GGATACGAAT 


AATCTGAATT 


TGTCAGAGTT 


1260 


TGATGAAGTG 


GTGGAAAATA 


CCGCAAACAT 


TGATTATCGT 


GTGAATGGCA 


ATCAGTTGAT 


1320 


AATCACCCCA 


AATGCCAGTT 


CAAAAGAAAG 


TGGTGTGctG 


ACGCTTAAGA 


AATCTGCTGG 


1380 


CACCGGAACA 


CcTGTCsCCT 


ATaAAAAGG 








1409 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



AAATGGGAAA 


AGTCGCTCGA 


GCAATCGCTT 


CGGGTCAACA 


AAAGAGTCAA 


CTTTTCTTAA 


60 


AGGATATTGA 


AGAGGATACG 


AAAAAATTAG 


TCCGTGAAAT 


TAAACAAAGC 


GAGAAAAAAT 


120- 


ATCAAGATTT 


TTCCATTACG 


CAAAAAAAAT 


GTCCAGATTG 


TGGCTCAAAC 


TTACGGGAAA 


180 


AAAATACGAA 


AGATGGGAAG 


ATTTACGTTT 


GTACAAACCA 


AGAATGTTCT 


TATCGTCGGC 


240 


GAAAAGATCC 


TAAAGTATCC 


AATCACCGTT 


GCCCACAATG 


CCATAAAAAA 


ATGGTCATCA 


300 


TAGAAGGAAA 


AAATGGTCGT 


TCTTTCAAAT 


GTAAATTCTG 


TTCGATTACT 


GAAAAAATTC 


360 


CCGATAAAAA 


AGAACGAAAA 


CAAAAAATGA 


CTAAACATGA 


AGAACGTCGT 


TTGATGAAAA 


420 


AATATTCACA 


ACCAGACGAA 


CCAGAAGAAA 


GTGCTTTAGC 


GCAGGCCTTA 


AAAGCCGCTA 


480 


TGAATCAAGA 


GTAGCGACTT 


GACGAATGTT 


GGCTCTAAAT 


AATCAAGTCA 


TCCTTGTTGG 


540 


TCTTTAGTAA 


ACTTTCAAAT 


AGAAAGAGAG 


GGACACGGAT 


GCTCGCAAAT 


TTAAATCAAG 


600 


CAATGGATTA 


TATTGAAGAA 


CATTTAACGG 


AAGAAGTATC 


CTTCGATGAA 


TTGGCGAAAA 


660 


AGACAGGGAT 


TTCTGTTTAC 


CATTTTAAAC 


GAACTTTTTC 


GTTTATTACG 


GGGATGAGTC 


720 


TGGCTGAGTA 


TATTAAAAAG 


AGACCTTTGG 


CGGAAGCAAA 


TTTAGCATTA 


CTTGCTGGTG 


780 
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AAAAAGTGAC 


AGATGTAGCT 


TTTAAATATG 


GTTATCAAAG 


TATTGAAGGA 


TTCTCAAGAG 


840 


CTTTTCGCGA 


TTGGTCAGGA 


CAAGCCCCTT 


CTGAAGTAAT 


GAAAACACAA 


ATCCAAAAAA 


900 


CGTTTCCTAA ATTCTCTTyC 


TATATTGATA 


TTAAAGGAGG 


ACAATCAATG 


GATGTTAAAT 


960 


TGATTGAAAA 


ACCaGCGTTT 


CAAwTAGTCG 


GAGTTAGTCA 


GAAGGTTCCC 


TTGCAGTATC 


1020 


aAGGGGAAAA 


TCAAGCAATC 


ATGGAATTAG 


CACAGCGCAT 


TACACCACAA 


CAACGAGCGG 


1080 


AAATGCATAC 


ATTTGATGAC 


ATATATCCTC 


ATCAAGTAGT 


AAATGCTTCT 


TTTGACTTTC 


1140 


AAGAGGGACG 


AACAACAGAA 


GGTGGAGAAA 


TGACACACAT 


GATTGGCTTT 


GCCACTTCGC 


1200 


AAGAAAATAC 


CTATGAAGAT 


TTGGAGCAAC 


TAAGCGTACC 


TGCTCATACA 


TGGGCGGTTT 


1260 


TTCCAAATGA 


AGGTCCTTTC 


CCACAAACTT 


TACAAGAAAC 


CTGGGCAAAG 


ATATTCTCTG 


1320 


AATGGTTGCC 


TTCATCTGGT 


TACCAAGTCG 


TTGCAGCACC 


AGAAATTTCG 


TTTACGCAAT 


1380 


ATCAAGGACC 


AGCAGAAGCT 


AAGTATAGTG 


AAATCTGGCT 


TGCTGTTACA 


GCTACTAAAT 


1440 


AAAGAAAACC 


CACCATTGAA 


AAATGGTGGG 


TTTTTCCGCC 


AAGAAGGAAA 


AAGTTkGGTA 


1500 


TAATAAACGT 


GAAGAAAAAA 


ATTAGACCTT 


CTAAACTGAA 


ATCTAGCTAT 


GGATAAGTGA 


1560 


TGCGAATACG 


GAATCATGGA 


GAAAAAATAA 


TTCTCCGAGG 


TTTTAGAGTC 


ATGTTGTTTA 


1620 


GAATGGTACC 


AAAACAACAA 


ACCCTTGTTC 


AGCATAATGA 


CTATAGTTTT 


AGAGTCATGT 


1680 


TGTTTGGAAT 


GGTACCAAAA 


CAAACTTTTT 


TGATTTGGCT 


TTTTCTCCCC 


TGTTTTATAG 


1740 


CCATGTTGTT 


AAAAAACAAA 


CTATCACCTC 


AAGCAATCCG 


TAATTTTCAT 


CACAAGATTA 


1800 


GAATTTTTAT 


CATCAAAGAC 


GAGCTGCTTC 


AATTTTCGAA 


AACTAATCAA 


GTGATAAGTC 


1860 


AAAATGTTGA 


TAGATTGAGA 


TTAGTTCTTC 


CTTTTTTTAG 


GAAGGGCTTT 


TTTTATTGAT 


1920 


AGAAGAACGG 


GAAGAAATCT 


TGGTAGCACC 


TAAAAAAATT 


GTGAGAAATG 


CTTCTGTATT 


1980 


GTTTGGTTTC 


TGCCTCAGAA 


AGTTGTATAC 


TTAGATTGGA 


ATTATTCTAT 


ATTACATGCA 


2040 


TTTTCACACT 


TTTTGGAAAA 


GTTCACTTGA 


ATTTTCTTTT 


AGTTTCGTAG 


ATAAAAGGAG 


2100 


TCATCGCTAA 


TGGAAATGAA 


CAATTCAGGT 


AAGCTTGTTT 


CTTTGTGTGG 


AGGCAAATCA 


2160 


GGGAGGAAAT 


AATATGTTTG 


ATATTGTAAC 


ATTGGCGAGA 


ATCCAATTTG 


CGATGACAAC 


2220 


AGTTTTCCAC 


TTTTTCTTCG 


TACCGTTTTC 


AATAGGATTA 


GCACTTGTTG 


TAGCAATTAT 


2280 


GGAAACAATG 


TACGTTGTGA 


AAAAAGAGGA 


ACGTTATCGT 


AAAATGGCCA 


AGTTTTGGGG 


2340 


AAATATATTC 


CTATTAAgcT 


TCGCTGTTGG 


GGTTGTAACC 


GGGATTATTC 


AAGAATTTCA 


2400 


GTTTGGGATG 


AACTGGTCTG 


ACTATTCTCG 


ATTCGTCGGA 


GATATTTTTG 


GGGCACCGTT 


2460 


GGCGGTAGAA 


GCGCTCTTAG 


CTTTCTTCTT 


AGAGTCAACA 


TTCTTAGGTT 


TGTGGATCTT 


2520 


TACTTGGGAT 


AAAATGAATC 


CAAAaGTtCA 


CGTAACCTTT 


ATCTGGTTAG 


TCGTATTCGG 


2580 


TTCAATGATG 


TCTGCTTTCT 


GGATTTTAGT 


TGCGAATAGT 


TTTATGCAAC 


ATCCAGTAGG 


2640 


TTATGTCATC 


AATAATGGCC 


GTGCTGAAAT 


GGTGGACTTT 


GTAGCTGTCA 


TCAGCAATCC 


2700 


AAAAGTTTGG 


TATGAATTTT 


CACATGTTAT 


CATGGGCGCC 


TTTACATTGG 


GCGGCATGGC 


2760 
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AGTAGCTGGt 


ATGGCTGCTT TCCAAyTGTT GAAAAAACGG GATATTTCaT TCCACAAAGC 


2820 


ATCTATGCGC 


ATTGGCTTAT GGGTTACTTT GTTCGGTTCA ATCGGCGTTT TATTAGCTGG 


2880 


CGACTTGCAA 


ATGAAAGCTC TAATTGAAGG ACAACCAATG AAATTTGCGG CGATGGAAGG 


2940 


GGCCTATGAA 


GACTCTGATG ATCCTGCTGC TTGGACATTG ATTGCTTGGG CAGATGAACG 


3000 


GGAACATAAA 


CAAGTTTTTG GTATCGATAT TCCGTATATG CTAAGTATTT TATCGTATAA 


3060 


CAAATTATCA 


GGTTCGGTTG ATGGAATGGA TACAGTCAAC AAGCGTTTGG TAGAAAAAtA 


3120 


TGGGGATGAT 


AAAAATTATT ATCCACCAGT TAACACTTTA TTCTGGAGCT TCCGTATCAT 


3180 


GGCTGGTTTT 


GGTGCCTTAA TGTTATTGGT TGCTGCCTTA GGACTATTCT TTACAAGGAA 


3240 


GAAAAAACCA 


TCGCTGTACG AGAAGAAATG GATGCTTTGG ATTGTTGCTT TATGTACGTT 


3300 


CGCACCATTC 


CTTGCCAATA CAACAGGTTG GTTAGTCACT GAATTAGGTC GTTATCCGTG 


3360 


GACTGTCTAT 


GGATTATTTA CGATTGAACA AAGTGTTTCA CCAAATGTAT CGGTTGCTTC 


3420 


ATTAATTACG 


TCAAATGTCA TTTATTTCTT ATTATTTGCT GGTTTAGGTA GTGTCATGGT 


3480 


TTACTTGGTT 


ATTTTAGAAT TACGTAAAGG ACCAGACTAT GAAGCGAAAA AATTAGCGAA 


3540 


AGAAAATGAA 


CCAGCGTTGG ATCCATTTGA TAAGGGGGTA TTTGGCGAAT GAGTACACTA 


3600 


CAATTATTAT 


GGTTTGTCCT TATCGGCGTC TTGTTCTCAG GTTTCTTCTT CTTAGAAGGA 


3660 


TTTGACTTTG 


GTGTTGGAAT GGCCGTTCAA ACGTTGGCAC ACAATGACGA CGAAAAAGAT 


1 3720 


CAAGTCGTTG 


CAACAATTGG ACCTGTTTGG GACGGAAACG AAgTTTGGCT ATTAACAGCG 


3780 


GGCGGCGCAA 


TGTTTGCCTC TTTCCCATAT TGGTATGCTT CATTATTTAG TGGCTACTAT 


3840 


TTAATTCTAT 


TTACGATTTT ATTTGGCTTA ATTATTCGCG GAGTTTCCTT TGAATTCCGC 


3900 


CATAACATGC 


CGGAAGGTAA ACGTCGTCGG ATGTGGAACT GGACACTATC AATTGGTAGT 


3960 


TTCCTTGTAC 


CGTTCTTCTT TGGTATTTTG TTTATCAGCT TAGTCCAAGG GATGCCTTTA 


4020 


GATGCAAACG 


GCAATATGCA TGCGCAATTT ACGGATTATT TTAATCTGTT TTCAATTGTT 


4080 


GGTGGAGTAG 


CCTTAACATT ACTTTGCTAC TTACACGGCA TGAACTATAT TGCTTTAAAA 


4140 


ACAGAAGGCC 


CaATTCGTGA ACGTGCTCGA AATTATGCAG AAATCTTATA CGGCGTTTTA 


4200 


TATATCGGTT 


TAGTTGTTTT TGCAGTGTTG ATGTACTTCA AAACGGACTT TTATGAAAAG 


4260 


AACTTTGCGG 


TCACTTTAAT TTTAACGTTA GCAATTGTGG TCTTAACGGT CATTGCTAAT 


4320 


GTTGGGGTGT 


TTAAACGCAA AGAGATGTTA GCTTTTCTAG CGAGTGGTTT AACATTAGTT 


4380 


GTACTAGTTG 


CTCTCTTGTT CAGTGGCTTG TTCCCACGTG TTATGATTGG TAGTGAAGGT 


4440 


TTTGATTTGT 


TGATTAAAGA TGCAACAAGC ACACCATATA CACTAAAAAT TATGACGTGG 


4500 


ATTTCTTTAA 


GCATTTTACC ATTCGTGTTA GCTTATACTG CTTGGTCCTA TTATATTTTC 


4560 


AGAAAACGTA 


TTTCACAGAC AGCCGTACCA GAGGGGTATT AAATCTATGA TTGATAAAGA 


4620 


CATCTTACAG 


ATGCCTAAAA TCAAAAAAAT TCTTGTCTTG CTTGCAGGTT TTTCTTTCCT 


4680 


GCAAGCAGTC 


: TTTATAATTG GGCAAGCTTT TTTTCTTTCA AAAGCCATTG TCGGCCTTTG 


4740 
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GAGTGGCGGG 


CATCTTAACC 


AACAACTACA 


GAGTATTTTA 


CTCTTTTTTA 


TTTTTTATCT 


4800 


TGGTCGACAT 


GTTATTACGT 


ATTTTCGTGA 


GAGAATGCTC 


GATACATTTA 


GTTATGAACG 


4860 


CTCGAAAGAA 


TTACGCGAAC 


AATTATTAAC 


CAAAATTTTC 


CGTTTGGGGC 


CAAATGTTGT 


4920 


TCAAAAAAAT 


GGAACTGGAA 


ATATGGTTAC 


CATGGCTTTA 


GAAGGCATCA 


GTGAAACCGA 


4980 


AAATTATTTA 


CAATTGATTT 


TGACAAAAAT 


GATGAATATG 


ATGATTATTC 


CTTGGATTAT 


5040 


TTTGGCCTTT 


GTTTTTACAC 


AAGATATTCG 


TTCAGGCATA 


ACATTGCTGA 


TTGTTTTTCC 


5100 


GATTATTATT 


ATTTTTATGA 


TTGTGTTAGG 


CTATGCAGCA 


CAAAGTAAAG 


CGGACAAGCA 


5160 


ATATGCGGCG 


TTTCAAATGC 


TTTCCAATCA 


TTTTATTGAT 


TCCTTGCGAG 


GAATTGACAC 


5220 


GCTGAAACTG 


TTTGGGTTGA 


GCAAAAAATA 


TGCTGGGAGT 


ATTTACCATA 


CAAGTGAACG 


5280 


ATTCAGAAAA 


GCGACCATGA 


GTACATTGAA - 


-AATTGCTATT 


TTATCAACCT 


TTGCGTTAGA 


5340 


CTTTTTTACG 


ACATTATCGG 


TAGCTGTTGT 


GGCAGTTTTC 


CTTGGATTAA 


GTTTATTAAA 


5400 


CGGCACGATT 


TTATTATTTC 


CTGCCTTGGT 


TACTTTAATT 


TTAGCACCTG 


AGTTCTTTTT 


5460 


ACCGATTCGT 


GATTTTTCAA 


GTGATTATCA 


CGCAACCTTA 


GATGGAAAAA 


ACTCTTTTCA 


5520 


AGCGATCCAA 


GCGGTATTAG 


CATTGCCAGA 


AGCAGAACAG 


ACCGATGTGC 


TCACGTTAGA 


5580 


TGATTGGCAT 


CAAGAAAGTC 


AGTTGTCAGT 


CACAGCGTTA 


AATTTTTCCT 


ATGAAGATGC 


5640 


AACGCAAGAA 


GCGATTCAAT 


CGTTGCAATT 


CTCTGTTGAT 


GGTCGCAAAA 


AGATTGGAAT 


5700 


TATCGGGGCG 


AGTGGTTCTG 


GTAAGTCAAC 


GTTAATTAAT 


TTATTAAGCG 


GCTTTTTATT 


5760 


ACCAACAGAA 


GAAACGAGTC 


AATTGGCGAT 


AAATGGCCAA 


ACGATTCCAC 


ATTTTTTACA 


5820 


AAAAGACTGG 


CAAAAACAAA 


TCCTTTATAT 


TCCGCAAGCA 


CCATATATTT 


TCCAAGATAC 


5880 


TTTGGCAAAC 


AATATTCGCT 


TTTATACACC 


TGAAGCAACA 


GATGAGGCGA 


TTCAGCAAGC 


5940 


GATTCAGTTA 


GTTGGCTTAG 


ACGAGCTGGT 


TAAAGACCTT 


CCCGAAGGAA 


TTCATACCTT 


6000 


GATTGGGGAA 


AGTGGTCGAA 


TGTTAAGTGG 


GGGACAAGCG 


CAACGAGTAG 


CGTTGGCCCG 


6060 


CGCCTTTGTC 


GATCAAAAAC 


GCCACGTCTT 


GCTGTTTGAT 


GAGCCGACAG 


CGCATTTGGA 


6120 


TATTGAAACA 


GAAGTTGAAA 


TGAAAGAGCG 


CATGTTGCCA 


TTAATGAACA 


ATCATCTTGT 


6180 


CTTTTTTGCG 


ACACATCGTC 


TGCATTGGAT 


GGAAGAAATG 


GATTACATTT 


TAGTGATGGA 


6240 


AAAAGGTCAA 


TTAGTGGAAC 


AAGGAACGCT 


AGCAGAGTTG 


ATTGAAAAAG 


ATGGCTACTA 


6300 


TGTTCAATTA 


ATGAAGCAAA 


TGAGAGGAGG 


ACGCCAATGA 


TGGAAAAACA 


GCCGACAACG 


6360 


AAAGACGTTT 


GGAAAAACGA 


TCAATGGGTT 


CGTCCTTTCT 


TAAAaCGCTA 


TAAAAAAACC 


6420 


TTGTATTTTG 


CCTTACTTTT 


AGGCTTCTTG 


ACGTTCTTTA 


GCGCAGGGGC 


ATTAATGTTT 


6480 


ACGTCTGGTT 


ACTTAATCAG 


TCGTGCGGCT 


TCCTTACCAG 


AGAATATTTT 


ATTAATTTAT 


6540 


ATTCCCATTG 


TTTTAACAAG 


AGCATTCGGG 


ATTGGACGTC 


CTGTTTTTCG 


TTATGTGGAG 


6600 


CGTCTAACGA 


GTCATAACTG 


GGTGTTAAAA 


ATGACCTCTG 


ATCTGCGCTT 


GAAACTTTAT 


6660 


AATGTACTAG 


AAAAAGATGC 


GATTTTCTTT 


AAGACAAAAT 


ATCGCACTGG 


! GGATATCTTA 


6720 
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GGTTTGCTTT 


CGGAGGATAT 


TAATCATATC 


CAAAACTTGT 


ATTTACGAAC 


GATTTTCCCA 


6780 


ACCGTTATTG 


CTTGGATTTT 


GTATATCTTT 


TTAGTGATTG 


CATTAGGTTT 


CTTTTCTTGG 


6840 


TGGTTTGCTT 


TATGCATGTT 


ACTGATGTTA 


GGAGTGGTTG 


TTTTTCTATT 


ACCACTTGTT 


6900 


TCAGTTCTAG 


TGAATGGTGC 


GCGTCAAGAA 


AAACATAAAT 


ATGCCAAAAA 


TGAGTTATAT 


6960 


CAAACACTAA 


CCGACAATAT 


TCTAGGTGTG 


TCTGATTGGG 


TATTTAGTCA 


ACGAGGCTCT 


7020 


GAATTCGTTG 


CTCGTTATGA 


AACAGATGAA 


GCAAACGTTC 


GTGCATTAGA 


TGAAAAAATG 


7080 


AAGCAATTTA 


ACCGTGGCCG 


AGACTTTGTT 


TTACAACTTC 


TGTTTGGTGT 


GATTGCGATT 


7140 


GCTGTCTTAG 


CTTGGACAAG 


TGTGCGGTTT 


CCAGGTAATC ATGGGGGCGC AgCcAATTGG 


7200 


ATTGGTGCCT 


TTGTTTTAAC 


CGTTTTTCCG 


TTAATTGATG 


CCTTTGCTCC 


GCTACCTGCT 


7260 


GCAGCACAGG 


AAACAACGAT 


TTATAAGGAT 


TCCATTCGTC 


GTTTTAATGA 


GCTTCCTGAA 


7320 


GGGGAAGATG 


ACTCAACAGA 


AGCGCCTGTT 


CAGCCAAATG 


GCACCGGTTT 


ATCGATAGAA 


7380 


CATTTGTCTT 


TtGCATATGA 


AAATCAAGCG 


AAAAAGGTAT 


TAAATGACTT 


ATCGTTAACA 


7440 


ATTCCTGAAA 


AACAAAmATT 


AGCGATTTTa 


GGGCGCAGTG 


GTTCCGGAAA 


GAGTACGTTG 


7500 


GCTTCGCTCA 


TTCGTGGTGA 


TTTACGGCCT 


ACTTCGGGAG 


AAATCCTTTT 


AGGAAATATC 


7560 


CCAACTGAGG 


CATTTGGTGA 


GACGATGACT 


GAATATATTG 


GGGTTATGCA 


TCAAGCACCG 


7620 


TACCTATTTC 


GAACAACCAT 


TTTAAATAAT 


ATTCGGATTG 


GTAGAGAAGA 


AGCAAGCGAA 


,7680 


CAGaAGTTTG 


GGCGGTTTTA 


GAAAAAGTTG 


GCTTAAAAGA 


AATGGTTGCC 


CAATTACCAG 


7740 


AAGGTTTACA 


AACAATGGTG 


GATGAAGCAG 


GTTTGCGTTT 


TTCTGGTGGC 


GAACGACATC 


7800 


GCTTGGCGCT 


AGCGCGGATT 


CTTTTGCAAG 


ATACACCGAT 


TGTGTTATTA GACgAmCCAA 


7860 


CTACTGGTTT 


AGATCCAATT 


ACAGAACAGC 


AATTGCTGGA 


AACCTTTTTT 


GAGGCATTAA 


7920 


AAGATAAAAC 


CGTCATCTGG 


ATTACCCACC 


ATTTGCAAGG 


CGTAACTTTG 


AT GG AT CAAG 


7980 


TGATCTTTAT 


TGAAGATGGC 


CAATTAGAGA 


TGAGTGGTAC 


GCCAGAAGAA 


TTACTTGCAA 


8040 


CCAATGCGCA 


TTATCAGAAA 


CTTTACCGAA 


TTGATCGTGG 


CATTAGTTCT 


TTCGAGGAAT 


8100 


AAAAAAACGA 


CTGGTTGTTG 


ATTAAAGCAA 


ACAATCAGTC 


GTTTTTCTAT 


TTCAATTAAT 


8160 


GGCAATTCTT 


CTATAAAACG 


TTTAAAATGA 


ACGATGTAAT 


AAGAGCTCAG 


TTAAGGAAAG 


8220 


TAAGGTTTCT 


TTTGCTGTGC 


CATCAgGCAA 


TTGTTGAATA 


TCAGTGATTG 


CTTTGTTGGy 


8280 


TACTTTTTTA 


GCAAAAGCTT 


GTGCTTCTGT 


AACGCCATGA 


TaGTCATTGA 


CTAGCGCCGC 


8340 


AACTTCAGCT 


GCCTCTTCTA 


AGGTAATGGC 


GCGTCCTTTA 


TCCAAATAAG 


GGCTGAATAC 


8400 


GTCAGGTGCT 


GCTTGATAAG 


CGAAGAGAAG 


AGGTAACGTA 


TAAACACCTT 


GTGACAAGTC 


8460 


TTCTAAAGCG 


GGCTTTTTCA 


AGGTTTCCGT 


GTCGGCAGTA 


TAATCTAAAA 


TaTCGTCATA 


8520 


GaCTTGAAAA 


GCAATGcCAA 


TATGACGGCC 


AATTCGTTTA 


GCTAAACGTT 


GGACTTCTTT 


8580 


TGAACTATGA 


CCGAAATAGG 


CCCCTTCAAG 


ACAGCTTAGT 


GAAAAAAGTT 


CGGCTGTTTT 


8640 


GcCGTTGACA 


CTTCTTAAAT 


AAGCAGGAAT 


CGACATGCGG 


CGGTCAAATC 


GATGACTCAT 


8700 
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TTGATCGAGT TCACCAAGTA ATAACCGTTT CATTCCTTGT GCATTGATTT TCATAAATTC 8760 

AGAGCCATTC ATGGCATCTG CAATCAAAGT AAAGAATTCA GTGAATAGTA AATCACCTGT 8820 

GTAAACAGCC ACGTCTTTAC CATATTGTGA TTGAATAGTC ACAGCGCCTC GGCGCAATGG 8880 

AGAATCGTCA ATAATGTCAT CGTGAATCAA. CGTAGCCATA TGTAGGATTT CCAAAGATGC 894 0 

AGCTATTTTT AATAGTTGTT GTTTCTCTTG TTTTTCTTCA TCACCCAGTT GAGCAAAAAG 9000 

AAAGAAAAAA GCGGGGCGTA ATAATTTGCC GCCAGAACGA GCCAAAGTTA CTAAGGCTTC 9060 

TTCAATGTCT TCGTTGCGTG TATGCAACGT TTGTTCAATT AATTCACATG TTTCCGTTAA 9120 

AGAACGTTCA ACGTCAGGaA ACGACTTCCA AAATGTGTTC ATAAATATAC TACGATCCTT 9180 

TCAGTGGTTC AATACTATTT GATTGTATAC GAATTGGCGA CAGAAATACA GTAATTAAGT 9240 

TTTAAAAGTA ACAAAGTTTG TAAAAAAGTA GCGGAGGGAT AGTTTGAATC GAGAAATATT 9300 

TTTTGAATTA GTAGAATTAA AAGCAAAAAC • AGCTAGTGTT TTGCCATTTT TGTTAgGAAT 9360 

TkGTTTTAGT TGGTACCAAT ATGGGCGGCT TCATGTTGGT TATGTTTTAA TTTTTTTCAT 9420 

TGCGATGTTT ATTTTTAATA TGGCGGTCGA TATTnTAGAT AATTATAATG ATTACCATCA 9480 

TGCCACAGAA GTTCATGATT ATAAGGGAAA AAACGGAATA TTATCGGTCG AGAAAATTTA 9540 

TCTTTGCCAT TAATACGCAA AATGATnTTT TGGGATGATC CCGGTTTCTG CTTTAATGGG 9600 

GATTGGGGTT GCCATTnAA 9619 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

GCTCGCTTCC AATAGAATAT TTAAAAAAAT TTAAAAAGTC TACATAATTC CAAACAAGAT 60 

TACCCTTTCA TTTTACACGT GATAAAATAT TTTTGCAAGA ATTGTCCTCA TTCTGAAGCA 120 

AAAAAATTCT CTTCTCTCAA ACTAATGTAT CCGCTATCAC CAAGAATAAT ATGATCCAAC 180 

AATTGAATCC CCATCATTTC GCCACATTCT TCCATTCGTT TGGTAAATTG GATATCTTGT 240 

GGAGACGGTG TCGGATTGCC AGATGGATGA TTATGAGCTA ATAGTATGCG AGCAGAAGAA 300 

TATTTAACCG CTTCCCGAAA AATTTCGCGT GGATGAGCAA TACTTTGATT CAGTGATCCT 360 

TTGAAAATCG TTTTCTGTTG GATAATGTCA TTCTTGGTAT TCAAGTAAAT ACAAATTAAA 420 

TGTTCTTGTT GCAGCCCCTT CATCTCTTGG AGTAATCGTT GAGCGACTTG TTGACTGGAT 4 80 

GTAACTTTGC . CAAATTTGAT TTGACTACTT TGaTAAATTC GACACCCTAA TTCAATGGCC 540 

GCACGTAATT CAATGGCTTT CACTTGGCCA ATGCCTTTAA TAGCCATTAA CTCGTTCAAC 600 

GTGGCTTGCT GCAAATAAGA AAGTTGCTTA AACTGTTTTA ACAGCGTTGC GGCAACAGTC 660 
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ATCACGTCTG 


CTTCTTTTGA 


GCCAGTACGT 


AACAGAATCG 


CCAACAACTC 


TTGATTCGAT 


720 


AATGCTTTTT 


CTCCTATCGC 


CAAGAGCCGT 


TCTCGTGGTA 


AACAATCACT 


CGGCATCTCG 


780 


CGAATAAACA 


GATCCGATAC 


TTGCATAAAA 


AAACCTCCTT 


TGCTAGTCAT 


TAATAAACTA 


840 


CGCAAAGGAG 


TGTTGTTTTT 


ATTATTTTAA 


ATAATGTGGC 


GCTTGATGTA 


AACTCTCCAA 


900 


GACATGTAAT 


GTTTTCTCTT 


GGATAACTTC 


AGTTGGTTGC 


AATAAATCAG 


GTACCATGAT 


960 


CACTGGAATG 


CCCGCACTAT 


GAGCCGCACT 


TACCCCATGA 


AATGAGTCTT 


CGAAAATCAA 


1020 


GGTTTTCGGG 


GCTTCTGTTC 


CTAAAAGTTG 


ACGAGCTTTT 


TGAAAAATTT 


CTGGGTCCGG 


1080 


TTTTGCTCGT 


TTAACATCCT 


CTGCTGAAAC 


AATACCTACA 


AAACGATCCT 


GAATTCCAGC 


1140 


ATGGCTTAAT 


AACATTTCAA 


TTGCTGGACG 


GACGTTACTT 


GAAGCAACTA 


ATCGAGGAAT 


1200 


TTTTTGATCA 


TCTAAAAAGT 


CGAGGAATTC 


CACCACCCCT 


GGTTTTAAAG 


GGACGTTACC 


1260 


AGATCGAAAC 


TCTTGCAAGG 


TATCATCATA 


AGAACGGCGG 


ATAAATTCTT 


CCACTGTGTC 


1320 


ATGACCATAG 


GAT GC AT AAA 


TACGTCGGTA 


ATTCTCTTGT 


ACTTCTTCAT 


CGGAAATCCC 


1380 


AACATAGTCT 


AAATAGACTT 


CTTTACTATA 


TGGTAACCCC 


ATTGCATCTG 


CTACTTTTTG 


1440 


AGTAGATGTA 


TAGTAAATCA 


ACTCCGTATC 


AAAAAGCAAC 


CCATCCATAT 


CAAAAATTAC 


1500 


GCCAnCAATT 


nCTnCATCAT 


T 








1521 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



TCACCTAACT 


TATTACAAAG 


AAATAATAGT 


CTTTTAAGTA 


ACTCTTTGTC 


AAGACATAAG 


60 


CAGAAGTCAA 


AACAGTTCTA 


TTTACAAATG 


AATGCGACGA 


ACCATTTCTT 


CTGTTATTTG 


120 


TTCTACAATA 


TCAATGGTTG 


TTTTTGGCGA 


ACGTTGATTT 


CTGGGGATAT 


CGATTGAGTA 


180 


AATTTTGGCA 


ATACGACCAG 


GATTTGCATC 


CATTAATACA 


ACCCGATCAG 


AAAGTTGAAC 


240 


TGCTTGTTGA 


ACATCATGCG 


TGATCATCAC 


AATTGATTTA 


GTCGCAATTT 


GATTGTTTAG 


300 


CCAAATTTCT 


AATAAATCGT 


TAGATAATTG 


GCTGGCTGTA 


ATGATATCTA 


AAGCTGAAAA 


360 


AGGTTCGTCT 


AAGAGCAATA 


AGCTAGGTTC 


GATGGCTAAA 


GCACGAGCAA 


AGCCTACACG 


420 


CTGTTTCATT 


CCACCAGATA 


ACTCTCTAGG 


ATAAGCTTTT 


TCAAAACCAG 


TTAAACCAAT 


480 


CATTTCAATG 


AGATTACTGA 


CTCTTTTATT 


TATTTGTTCA 


CGGGAAATTT 


CTTTACGGTT 


540 


TTCTAAGCCA 


AAGCTGATAT 


TTTTTTCAAC 


TGTCAACCAA 


GGAAAAAGGG 


CAAAATGTTG 


600 


AAAAACCATG 


CCGATATTGG 


CATCAGGCTC 


ATTAATGATT 


TTATCTTCAA 


AAGTAATTTC . 


660 


ACCTCTAGTT 


GTACGAAGAA 


GTCCAGCAAT 


ACTTCTGAGC 


AACGTTGATT 


TTCCTGTTCC 


720 


TGATTGTCCG 


ATAATACTTA 


AAAATTCATT 


CGTTTTAACT 


GTTAAATCAA 


TATTTTCAAG 


780 



WO 98/50555 PCT/US98/08985 

1053 

GACTTTTAAA TGTTCTGAAT GTCCTTCAAT TTCAAATTCT TTATATACTG ATTTTATGCT 840 

TAATAATTCT TTATTTTCCA TCGTCTTATT AGATAGGTCA AGGGAAATCA CCTATCGTTC 900 

TCCTCTCTAG TTTATCTTAA AAATTTAGTC GAGTTGAGTC GTCCAGCTAT TAATAGTGAA 960 

ATTTTTGTTT TGCTGCTCGA TATAAAGGTT GCCAAACAAA GGCGATACAT AAACCAACAA 1020 

CGAAACACAT CGCTAAGGTT CCTAAGGCAG ATTTATCTTT TACACCATCG TTTGCCGCAA 1080 

TATATTGTCC TAGGCCAGTA ACATCTATTG ATTTTTTGCC CCATTGAATG CTTTCTGCTG 114 0 

CGATATCAGC GTTCCAAGCA GCACCAGCAG CGTTAATAAT TGCTGTAATG AGGTAAGGTA 1200 

AAATAGAAGG GATCAAAAAT TTTGTCCACC ATTGCCATTT AGATAATTTA AAAAGTTTGG 1260 

TCACGTCACG AATATCATCA GGGATGGCTA AGTAGCCAGC AATTACATTA AAGAAAAAAT 1320 

ACCATTGGCA ACCAACCATG ATGAGTGGTA AAATCCACCA TTCTAATTTG TTGaAGCCCA 1380 

TACTAATAAT CAACACAATG AATGGAGTAT AAACATTTGA CGGAATGGAG CCTAAAATTT 14 40 

GTCCAATCGG TTGAACGATA CTTAAGCGCC GATTATCATT CGCAACCCAG ATTGCTAAGG 1500 

GAGTAAAGAT AATGCCAGCA ACGACCATAG CCAAGGTAAC CCGTAGCGTA GTATAAAGGG 1560 

TTAATCTAGG AATCATGCTG AAATCTTGAT GGGGCAAGAA ATGATAGAGC TTAATAGAAC 1620 

AATAAACCCC AATAATGGTT AAGCTGACAT ACCATAATGT TTTGAAGAAA TGGCGTAATT 1680 

TTGCTAGTAA TCGATAGAGA TAGTCTAGCT TGAAAAAGTA CCAAACTTTT GGTAAATCGT 1740 

AAATCCAAAA ATGACTGAAT TTCACAAAAG CTTTTGAAAA GTATTTCGTT AAAGTTGCTT 18 00 

TTTTTAATAA ATGATAAATG ATGGATTTAG GTGGCGTTGT ACTTTCGACA TCTGTTTCAT I860 

ATTTAAAATA ATAAGTGGCT CGAACTAAAG GTTGAAAAAC TAAAAAGTTT AAAATCACCA 1920 

CATTGATTAA TAAGGCAAAA ACAGCCCATA AACAGGATTT AAAGTCGGCT CGATTTAATG 1980 

CTACTTGGAT ATAAGAACCA ATTCCTGGTA AATAAAGGTT GGTTTCTTTA GGAAAGGCAA 2040 

CAGAAGCTTG CTCGCTGGCC ACCAAGGCAA ACCAAGCCGC AGTTTGAGAA ATAATAACGT 2100 

TCCAAAGTAA GCCAGGAATT GAGTAAACAA ATTCAAGCCG CCAAAATTTT TCCCAAGCAT 2160 

TGTACTTGAA CTGATCGGTC ACATTGACTA GATCTGAAGG AATAACTTCC ATTATTTGAT 2220 

AAAGAGTCAA CATGATGTTC CAAGCTTGAC CTGTGAAGAT GGCAAAAATA GCAACCGCTT 2280 

CTGCGCCCAT CACATTTCCT GGAAATAAAC CAAGTAACCA AGCAGTTGTA AAGGTCAAAA 234 0 

AACCTAGCAA TGGAACAGAT TCAAGGAAAT TAACTAATGG TAAAATGACT CTTCGTGCCG 2400 

TTTTATATTT TACGGCTAAA ATACCAAAAA CAAAGGAAAA CAGGAAGGAC CAAGCCATAC 24 60 

CCACAATCAT TCGAAACATG GTTCGAATTG AAAAGTAAAA AAGATTTGAT GGAGATAAGA 2520 

AATCATCTGT TTTACTTGGT GTTGCGAGGA ATTCTTCTGA GGGAAGGGTG TGGAGAGGGC 2580 

GATTCATTTT TAGCCAAGAG AAGACGATGA TAAAAATGAA AATGCCTAAA ATGATGAGCA 2 64 0 

AGGTAAGATT TCCGATACTT TTAGCAAGTA GGGTATTTTT CTTAGATGAT GCAGTGACAG 2700 

CTTTCAATTT AAAAACCTCC AATTAATCAT ATTAAAGAGG TCGCCGAATA AAAGAAGAAG 27 60 
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ATTATAGGGT AATCACAATA ATTGAGCTTT GGCTTTCTGA GACGTAAGAA AAAAATTCTA 2820 

GTTATCTTAG GAAAGCCTTA ATTCGACAAT CCCTGCTTTA CTCGTTACTG TCTCTCGACA 2880 

TTTCCGAGCG ATAACCTGTG TTCTTCAGAG TAACCTCACC TAATATGATT CTATTTGATT 294 0 

TACATGCTTA TCATACTCGT GACTAAAAAT AAAACAATTT CTAGCTGGTT ATACCTGAAT 3000 

TACTGTTTGT TGTTATTCTA TTCaATGATT CTTAAAAAGA TTGCCCaCAT TCTGTTTTTT 3060 

TAGGGAGAAA TTATCAGATG AGTATGTTAT GTAACGCAAG GTGAATAGTA TAGGATGTTG 3120 

TATTTTTAGT TTGTCTCGTT TACTGTTAAA GATGTAAGTA GTAACTATAA GTGAATGGCT 3180 

AAAAAAATTA AAAGTAATCT AGCACCTAAA TTaAACAACA AAGAAATTTG TTCAGAAGAA 3240 

AGGAAATTAT GAT GAT TAT T TGGTTAAACG GTGCGTTTGG AGCCGGAAAA ACGACCATTG 3300 

CACATGAGTT ACAACAAAAA CTTCCCAATG CAATCATTTA TGATCCAGAA ATTATTGGTA 3360 

GTGCATTGAT GGAGTTAGTT CCCGAAGAAA TGAAAGAAAA TGATTTTCAG GAGTATCAGG 3420 

AATGGCGTTG CTGGAATGCT CATTTATTAA AACGAATGTC CAAAGAGTCT GGACGGCCGA 3480 

TTATTGTTCC AATGACCCTT TATAAAAATG AATATGAAGA AGAGCTGATT GGATATCTGA 354 0 

GACGAGCAGG CATTGACGTT TATCATTTTT TGTTAGCGGT CGAGAAAGAA GAAATTTTAC 3600 

AGAGATTATT GAAAAGAAAT GATGGAACTT TTGAATGGGG GAAAAATAAG TTGCCCGAAG 3660 

TACTTGaAGG TTTTCGTCAG ATTCAGTTTA CCGAGATTTT TAGAAATCAT TCAGCTGACA 3720 

CGACAGAAAT TGTAGCAACC ATTCTTAATA GAATAACCGA ATAAAAATAA AATAAGGGAC 3780 
CTAAGCCGAA ATGACTTAGG TCTCTTATTT TCTACTTCTT AAGATAATCA ACTACCAGCT 3840 
ACTCTGATTT CTTCAAACTT TAAGACCGCA CATTTACGAC AATCTATTTT TACACTGTCA 3900 
AAATTTTCTT TCAAGTCTTG CATTTTGTTA TTTTTAGCAG TATATTTTAG GTGTACCTAA 3960 
AATATATTAG AGGTTAACCT TCTAACAGGA AGGGCCTTTT AAAATTATTG CAAAACGAAA 4020 
GAAGGAATTG AATTGAAAAA TTCAGAAGAA CATGAACCAA AGCAAAGACA TCATTTGATT 4080 
GAATATGCAA ATGGTCCtTC GCTTGAAGAA ATTAATGGCA CCATCGACGT GCCTAAAAAT 414 0 

ATGAGTTTTT GGAAAACGTT ATTTGCTTAC TCAGGTCCAG GAGCATTGGT AGCAGTGGGG 4 200 

TATATGGATC CAGGAAACTG GTCTACTTCA ATTACTGGGG GACAAAATTT TCAATATTTA 4 2 60 

TTGATGTCGA TTATTTTAAT TTCCAGTTTG ATTGCAATGT TGCTCCAATA TATGGCTGCT 4 320 

AAATTAGGCA TTGTTTCACA GATGGATTTA GCACAAGCGA TTCGTGCTAG AACTAGTAAG 4 380 

ACGTTAGGTA TTGTATTATG GATTTTAACA GAGTTAGCAA TTATGGCTAC AGATATCGCT 4 440 

GAAGTTATCG GGGGCGCAAT TGCCTTATAT TTATTATTTC ATATTCCTTT AGGTCTGGCT 4 500 

GTCTTCATtA CGGTATTTGA TGTTTTACTT TTGTTGTTAT TGACAAAAAT TGGTTTTAGA 4 560 

AAAATTGAAG CTATTGTTGT TGCTTTAATT GTTGTTATTT TTGTGATTTT TGCTTATCAA 4620 
GTGGCATTGT CAAATCCAGT ATGGGGAGAT GTAATTAAAG GGCTGGTTCC TAGCGCAGAA 4 680 

GCTTTTTCTA CATCACATGC GGTGAATGGG CAAACGCCAC TGACAGGTGC ATTGGGGATT 47 4 0 
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ATTGGTGCAA CAGTGATGCC TCATAATTTG TATTTACATT CTTCTGTTGT GCAAAGTCGT 4 800 

AAAATTGATC GTAAAGATAA AACCGATATT CAACGCGCTT TACGCTTTTC AACtTGGGAT 4 860 

TCTAATATTC AATTAACGAT GGCATTTTTT GTTAACTCTT TATTATTGAT TATGGGTGTG 4 920 

GCCGTTTTTA AATCCGGCAG TGTCAAAGAT CCTTCATTCT TTGGTTTGTT TGATGCTTTA 4 980 

TCAAATCCAG CGGTTATGAG TAATTCTATT TTAGCGCATA TTGCAGGTTC TGGAATTTTA 5040 

TCAATTTTAT TTGCGGTAGC CTTATTGGCA TCAGGACAAA ATTCAACAAT TACAGGAACG 5100 

TTAACTGGTC AAATCATTAT GGAAGGGTTT ATTCATATGC GCGTACCAAT TTGGTTACGT 5160 

CGGATGGTCA CACGTTTGTT ATCTGTTATT CCTGTCTTAA TCTGTGTTTT AATGACCAGC 5220 

GGAAAAAGTA CAGTGGAGGA GCATATTGCG ATTAATAATT TAATGAACAA TTCACAAGTT 5280 

TTTCTAGCAT TTGCTTTGCC ATTTTCGATG TTGCCTTTAC TGATGTTTAC AGATAGCCGT 5340 

GTTGAAATGG GTGAACATTT TAAAAACTCG TGGTTAATTA AATTGTTAGG CTGGGTTTCT 5400 

GTCATTGGCT TAATTTACCT CAATATGAAA GGTTTACCTG ATCAAATTGA AGGATTCTTC 54 60 

GGTGATAATC CGACAGCGAG TCAAATTACG TTAGCCGATA ATATTGCCTA TGTCATCATA 5520 

GCACTCGTCA TCCTCTTGTT AGTTTGGACG GTTGTGgAAT TATATAAAGG CGATAAACGA 5580 

TATGCACAGC AGCTTGcAGC TATGGaGCAA CAAGTAGAGG AGGTTAAATA ATGAAATTTT 5640 

CAAAAATTaT GGnTtGGTGT TGaAGAGTCG CCAGATGcGT . TaAAAGCATT TCCATTATGG ,5700 

CTATCCCAAA AGCAAAAGAA GAACAAGCTG GATTGGGG 5738 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

AAGCCATCAG CTGTTTGnTT ACATGGTGAA TTTTGAGACA AGTTAATTGT AAAAGTCCCT 60 

AAATTATCGT TGTTCTTCGC AG C AC TAG AT TGTCCTTGGG CATCTGTTTG TACTTCAATA 120 

TACGAATTAT CTTTTGAAAA AGATACCATA TCGCCATCTT GAAAGCCGAA TTGGACGACA 180 

TTGTCAATAA TAGTAGATAC TTCTTTGGCA TCATAAGTTG TCATGCTTTG CATTAGTTAG 240 

TtCCCCCtTT AGACTTCAAT TGTTCCAGTA ACATCAACAG TGTGAATTGC TCCAGAACGT 300 

TTGTATTTAA ATGATAATCC TTTATAGTTT CGTGCTGCAA TATCATCTGG ATTTAAATCT 360 

TGACGTCCCA AAGCAGTCAC ACTATAATTT CCAACACCAG TTTCATCTAC AATATCTACA 4 20 

ATTCCGTTAT- TAAATGCAGT TTCCAAAACA TTTGCAACAG TTGTATCCAA TAAAGCAATC 4 80 

CCATTAGAAT CAAAAGTTAA TTTATCTGTC GTTGATAACA AGCGCTGAAC ATTCGTTTCA 54 0 

ATGTTTGATT TAACCCAGTG ATCCCCATGC AGGGCATCGA TAAATTCGCC ACTCATTGTT 600 
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TTTCCTTCTG ATGTTTGGGG GATTCCCGCT TTTGATACAT AAGCAATAGC ATTAGCTTTC 660 

TCAATCGCTT GTAGTTGAGA GGTAGTTAAT GTGTTAGGTG TAATCCCTAC TAAATTATGT 720 

CGGAATTTCC AAGTAACACT TCCGACTGTT AAGCTTGCAG TATTACCAAT TAATGCAGCA 780 

TCTAAAAATT CTTCTAACGG ATGAACCAAG CCGATAGTTA GATTGTTCCC TGTAAAAACT 84 0 

GTTAATTCGT CAACTGTAGC TGTCTGAATT ACTAAAAACT TAAATTCGTT TTCTTCAATT 900 

AAATTTGATA GTGCCAAAGC ATCTGCCTCA ACAAAGTTGG CTAATAATGC AAAATGCCAA 960 

TC ATT AT AAA AGTAACTTGT AGCTGCGGCG ATAATTCCAC TTCCAGCAAT TAAGCTATCT 1020 

GGCTGTTCTG GACTAGTAGG GGTATCTGCT ACAAAAGTTA CTACTGgCAA TAGTTTGTGG 1080 

TTTATTTTCC TGTTTCCAAA TTATTTCTGC TTTTTTATAA ACAGTAGTTG TTGTAGCAAA 1140 

ATCTTTTGCT AATGTTTCTA AACTTGTATA TTCTTTATAG TTTTGAGTAG ATCCTTGAAC 1200 

AAAAATTGCT GGATTTCCTA ATCCAACAAT GGGTTGTGGA TGCATTATGT CAATTTTTAC 1260 

ATTAACATCT GTAATTTTTT CAATCATTTG TTTTCCTCCT GTATGTCTAT ATTTTCTATA 1320 

ATCACAGCAT TGTCAACGTA GCTGTCTTGT ACTCTAAAAC GAGCATCAAA GCCAGCCAGA 1380 

CGTTCATAAT CAATACTAAT GAAATTATCT CGTTTTTGAG ATTGTGTTGT TGAAACTAAG 14 40 

GCTATTTTAG AGTTTTGAAG GTCTACTTTT ACAGAAAAAC CGTTTAAATG CTTTCTTAAC 1500 

TGCTCTGACA AATTAAGTGC CTGAATACTA GAATCTGTAT GACATTTAAT AGACACTACT 1560 

AATTCAAATA CTTCATTATC AGTGACATCT ATAGTCACTG GAATATATGG CGAGATGACC 1620 

TCATAAGAAA AAAACGGTTT ATCTGGCTGT GGACCAGCTG TGCTACTTTC AATAAGTTGA 1680 

AGTCCAGTTG ATTTCTTAAC AATTGCTATT AGCTCATCAG CTAAAAGCTC ATAACTAAAA 1740 

GCCCCATTAA TTTGCGGCAT TATTGGTCAC CGCCTTTAAT CTGTAAAGGT TCACATCTGC 1800 

ATAGTGTTCC GCATATGGCT CTTCGCTTTC TACTTTGTAC TCTTTTCCCA TGTCAATAAT I860 

TTTAGAACCT AAAGGAATCG ATCCAACGTA AGCAAGTTGC CTGTCGCTAG ATGTGAGTGT 1920 

TCCACCACTT TGATAAATCG TCCGATTGTC GTATGGGATA ATCGCCCCAG AAACGGTTTT 1980 

GGGTTCTTCG TTAGTAGGAA ACCATTCGCC GTGAATGTAT TTACCAGCAC CTTCAACTGA 204 0 

AGGTAACACT AACTGAAAGT CAACCGCAAA AGTTTCTATA AGACTAGAAA AGTTCATTCT 2100 

TTGCATTACC CTTTCACCAC CTTATAAGTC ACTGACTGTC TTAATTTTCC TGTGTCAATT 2160 

AATGGATTAC TTGACTTCTT ATTATTGACT GTTATTGGAG AATTTGGCGG ATCAGATAAA 2220 

TTTCTGATAG TTCGTTGAAT ATCCCTTTGC ATACGAGCAC CAACTTTATT TGTCAATTCT 2280 
TCTGCTGTCA TTTTTCCAGC AATAAGTTTT TTCACAAGGT TTAAGGCGTA ATCCGACCAA 234 0 

GCCTTTGCTT GCTCGTCAAA TGTAGATCTA ATAAAGGAAC GTTCTGGAAT AGTTATTTGC 2400 
TCCGCTAACA- TGTATGCGAA TTCTAACTGA TCTTTACCTT TATTTCTTAC TAAAAACTTG 24 60 

TGATTTTCTT TAGTTTGCAT AAAAAACAAA TCAAATTGAC GTGGGCTTTT ACCTCTATAC 2520 
TTTTTCATAA GTGGTATAAC AAGAAAACGA CCTTTGGGAC GAATAGTCAC ACCAAATTCA 2580 
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TGAACTTGGG CCAACATCGC AATAAAAGAA TjCATCTTCCC CAAATATTCC TATCTGCAAA 2640 

GAATATTTAT TAAGCTGATT TAGTTCATCA ATCAGCTTAA TAATCCTGTT ATTCTCAGTA 2700 

ACTTTCATCA AATCACAACC AaaCTCAATG AACCTTTTTT TGCGTATTCG TTGTAAAGAC 27 60 

GTAAATACTC TTGACCATAA ACGGTTCTTT TAAGGTCGGT AAAAGTTGAG TGAAAACCCG 2820 

AATACTCTTT TTTTAGCGAG CCTACTTGCT CGGATTTAGT ATTTTGGTTG TTCAAAACGG 2880 

CTAGATGGCA AGCGAGATAG CGACACGCTT TCTCCTTAAC CTCTTCTTTA AATGGCAATG 294 0 

CATCTACTTC TAGCCAAGCA TCATCAATAA ACAACTTAAT AGAATCATTG TTAACCCCTA 3000 

CTAGTTCTGC AGCTGTTAAC CTAACATTTT CAACTGTGCT TTTTGGCATA GTACCACTTC 3060 

CTATTCTGGA TTAACTACGC TGTCTTCATC AGGGTTTTTA ATATCATCAA TTTGATTTTC 3120 

GATTGCTTTG ACTACAGTTG TGCGATTTTT GTTTGCTTGC TCTTCTTCCA ACCATTTTTC 3180 

CAACAATTCT AAATCGAACG TATCAGCAAT GGATTCTACT GCTTTATTGG CACTCAATTC 324 0 

AGTAAAACCA ACTGCTTTTT TTGATTTTCC TTTTGTTATA TGGTCCAAAA TCTCAATTTC 3300 

TCCTAATCTT TCCAACGATT TATTTAATGG TAATTCCATA CCTTTAATAA ATTCTTCAGC 3360 

ATCTGAATTA TCTAAATCAT TTACACCTGG AATTAATCGA ACATTTCCAA TATGTCTAAT 34 20 

ATATGAACCT TTATTGTGTw CTAACATAAT CTTTACCTCC TTATTTTTAA ATACCATCCA 34 80 

CACGAACAAT TGCATATGGC GTTCTAATTA ACGCACCACC ACAACGTTCT ACAAATGGTA 354 0 

i 

CTTTCCAATT TGGAAAAGAC CATTCAACCT CTAAACGCAC GATGTCTTCT GGAAGTAAAA 3600 

TTTCACAAGT TGATGGTTTT GTATCCATGA TAATAAATGA ATCAGAATTA TCAGTACCTA 3660 

CACCTTTTAA ATCATAAACT TGCTCAATAG ATGAAAACCA GCCATTTTCC TGAACAaCTT 3720 

TCATAATTGA TCGGGCATCA TATTCACCAT AACGACGATT CAATTCTTCA TATTGTTCTG 37 80 

GAGCAACCAT TAATTTTAGA CTAGATCCTT TAAATCCTGG AATAATAGTA ATTTTAGCTC 384 0 

GAGATGTACG TAATTGTTCA ACAATCTCTT CACTAGTCAT TTCTTTCCAC TTCTTAGGTG 3900 

AATTAATAAC TTGAATACCT TCAGCATTCG CAACACCTTT ATGGTTTACT TTAGGATCTC 3960 

CCACAAAAAT AAAGCTATTT TCTTTTTCAG CAATAGTACG TCGTGCCACT TCTGCTTTTG 4020 

TTGCATCAAT TGAAGTTCCC ATCATTTGAG CTTGACGAAT TTCTTGTCGA CTATAACGAA 4080 

TACCAGCAGC AATTGTAAAA ATTGGTGaTT GATAACGtTC aTaTCAAtGk CAACTAAAGG 414 0 

AAGATCATCT GCACCATTTG CGATAATTTT TGCAGCGCCA CTTCTAGTCA TAACATTATA 4 200 

TGCGTATGTT TCCGCTCCTG GATTGATAtC TGTTTTAACA TTAAACATGG TTCGCGCCAC 4 260 

AAGTTCCTCC TGTGGTGCTT GATAAATGAC CTTATCAATT TCTTGTAGGT CACGTGCTTC 4 320 

TAAAGTTGCT GTTACATCAT TTCCCATTTA GTTGTCCTCC TAGTTTTTTA AGGTAAGTTA 4 380 

ATTTGTAAAA- CTGCTAAATT ACCAGCTGAT GCTGTTGTTT TAAATGTACC AATCACGGTA 4 4 40 

TCTGATGGTG TCGTAGCTGG ATCAGAACTA ATAGTTGCTT TTCCAAAATT TCCAGTTGAT 4 500 

AAAGCTTTTG CATTTTCACC TGCTAAAACA TCTTCGTCAA CTTTCACCCA AATAGCCCCT 4 560 
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TTACGTAAGA TCGGTACCAT TTCGTGTTCT TTATACTTAC CAACTTTTTC CACATCTCCG 4 620 
. TAAGGGAT.XT- CTTCTACATA ATTTTTAGCA TATGAAATAC CATAAAATTG CCCATCTTTA ' 4 680 

TATGTGGTAG CAACGTCTTC TGTTACTTGA ACGCCGACAC CGAAACyTAA GCCACCGACA 4 740 

CCAACGACAA GACTATCTGC TTGTACACCT TGATAATTAG CTAATTTCCC AATACCAAGT 4 800 

TCAGGCTTCA TATACTTTTC TGGATAAGGA ATAGTCATAG TTATTTATCC TCCTTTTTGT 4 860 

TCATGTTCAA ACGGTTATTT TTTTGTTTTT CAATATCTTC CGCCGCTTCT TTATCTTTAT 4 920 

CTTTTGCATC GTTGAACGCT GCTGGATTTG TAAATCCTTT TTTCTCCACA TTTGCAACTG 4 980 

CTGAATCAAA GTATGCTGCA ATATAATCGT CAGATTTTCC ATCACCTTTA AAGTCAGGTG 5040 

ATGTTTTGGC AATTACTGAT TCTTTAATTT CACGATCAGA TTTGCCAGTG AAATCAACTG 5100 

AATCACCAAG AAACTTTTGT GCTTTATTAA TTAGATCAAC ACGATCTTGA AGACGCTTAT 5160 

CCAAAGCATC TGCAGTTACT TCTTTTTTCT TTGCTtCGTC AATTyCCTTA TTCAACTTAT 5220 

CAATTTGAGC TTTTAAAGCA TCTCGCTCAC CTTCTAATTT CTCAACGTTT GCTTTACGTT 5280 

GTTCGGCTGC ATCTAATTTT GCTTCTAAAG ATTCAAATTT TGCTTTTACA ATTGAATCCA 5340 

CTTCAAATTC CTTTGAATCA ATAATTAATT TAGACATTCC ATTTCCTCCT GTTTGTTTAT 5400 

CTTTAGTATC AATCATGAAA GCG ACT GAAT CTCCCCGAAT AGCAACTTCG GGACCAGCTC 54 60 

TCCCCTCATC TACTATAGCA ATATGATTAA TTTGCATATT CCTTTGCACA GAATCATATT 5520 

GCATGCCATT ATACACGCCA GTTTCTTTTG AAACATCTGC TTGAAAACCG ATGCTCAGTT 5580 

CACGTTTTCC ATCGTTAATT TTCTTTATkG TCTCTGCATC TGTAATtGTG AACGAAACCA 5640 

ATAGCTTATT ATCTAAAACG TGAGCATCAT TATGAGTCAT GCCTTTTGAA TATTTATTGT 5700 

AATTAGCTGC TGTTACTGGT TCGGTCGGAT GAT CATC AG T CATTGGCTTA GCATTTGCTG 5760 

AAAGTACAGT TGCCTTAGAA AATAATTCAT CAGGCAATTT TGCCTCCATT GATAATCCAC 5820 

CATCAGTTCG GCGGTAAGGG AAAACACCTG GTCGTGTAAT TGGACAAGCT .GTGATTGTCA 5880 

AATAACCTTC ATCAGTTTCT TTAAAATCCT TAATAAAGGC TTTGTCATAT CTAATTACCA 5940 

TGTCTTCACC CCATTATTCG ACTAATAAGC GAATGcCTTC GGTTACTTGA TTACTCGATT 6000 

CTAACTCATT AAGATCCAGT AATTTTCGTA GGCTCATATG ATAGCGTGTC GCAATTTCAC 6060 

TAGCAGTTTC TCCCTCAGAA ACAGTATGAA TCGTTTTCTG CGTAGTTTTT GACAACTCTT 6120 

• TTTTGTTTTC TTCTTTACTA GAAGCTTTTT yCTCTTTAGC CATTTTGATC ACCTCCTTCa 6180 

GkTACATCaA ACCTGATATT tCTCCACTtT TTATAAGCAT CGAAATATAG TTCATTTTTA 6240 

TCACCGTTGA AGGTAAyTCG TAATACATAC CATCAAATAG ATTGGTACTT AATAATGCTT 6300 

TATTATTTTG CAGGGTTTTC GCTAACCAGA CCACGTAAAT ATCAGACTCG CTTATTTTTT 6360 

TTTGATCTGT- GTAATCCAAT TGACTGTTAG CAAAATCAAC TACATGCTTT TTACATAAAT 64 20 

CAATAAATTT TTGACTATCC ATTAATGTCC CTCCTGTTTT TTGAGTATAA AAAATAGCCC 64 80 

CAACCTATAA GGTCGGTGCT ACTCTTCTTT TTCTTTTAAA GATTTTTTCA ATTGCTGCAT 654 0 
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AAAATCTTTA 


TCTTTCTTTT 


TCGATGTAlb 


1 Al 1 ILL1 1 1 


1 i lol tn 111 


rATPTCJGnTA 
L»n iti vjou j. n 


6600 


ACCTAGACTT 


TTTGATTTGG 


GAACGTCTTC 


LLA1 L 1 Abb 1 


lHjlllllLi 


PTiTTTTTATT 
tnl ill i. t\ *. i 


6 6 60 


CATGGTACTC 


CTCCAGTACT 


ATAAATGGTT 


TTbbAlbbaA 


TTATAZiTTATPTP 
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TAAACTTGGc 


ATCACGATTA 


AACAGAACTT 
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CTTTTTbbbA 
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CAACATCATT 


T AAATT. b 1 Ab 


T T P TT Zi PP A A 


6900 


ACCTCACTAG 


GTCATCACTG 


CTATCAAAAA 


TV ^ T\ m /** TV TV f** 

ACATGGAACG 


rn rp m 7V r-> rp m TV 

TGTTACTTCA 


bbA 1 b A 1 rlu 1 


6960 


TGGTCATTTT 
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TTGTCTAAAT 
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TGTCTATCAA 
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CTCTTCTTGA 
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C AAG AT C AAT 


bCTTTCTl LA 
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AATTATATTC 
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7200 


TATCAGCATA 


AAAGTAAATC 


TTCCCGTTAC 


GTTCTCGATG 


AGAATCCCI A 
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7260 


CTCCACTATC 


ACTCCATCTA 


AAAGCTTGAA 


ATCCCGCTCG 


TTTTTGGCGT 


rp/TvPTTATTPZi 
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7320 


TCTGACCTAA 


AATAGTACCT 


GTCTGATCAC 


GAGCAATAAA 


CGCTGCTTTG 


rp/^rp/^TvrrPTiPTi 

ibl b A 1 bntn 


7 IflO 


TACCAGCTTG 


ATGAACAAGC 


TCTTCTCTTA 


TCTCGTTTGA 
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AGATTTGCCT 
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"7440 

f H H \J 


CGCGATAAAT 


AACCTGTTCA 


AACTTTTTTG 


AG T AAT CATC 


ACGAATATTA 


GTGAbAl AAb 


7 son 

1 


TGATGTTTTC 


CGCTATTTTA 


GCTTGAACAT 


AGCTGTCTAA 
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CCATTTTTCA 
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7560 


GGTTTATTCC 


TCTAGCGCTA 


AGTTGAGAGT 


TC AC AT TAG A 


m m ^ rp m tv tv tv rn 

CTTGTTAAA1 


PPPTTPAPAP 
vjbb 1 1 ununL 


7620 


TATTAATATG 


CTTACGAACT 


ATTTTTTGTG 


CGGTTCTGTT 


TTGAAbAAl A 


PPT Till A A APT 
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7680 


ATGTTTGCGC 


ATTTCTGATT 


AGTTTCGATG 


CGGCATCbAA 
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TTA ATTAAPtT 
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7740 


TGGAaTCATT 


AACAATATTT 


TTATCATCTA 
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TP A AATTPAT 


7800 


ATAATGAAAC 


TTTTTCTATT 


TCTTTTACGG 


CCTTCTGAAT 


b TT I 1 1 AbbA 


TliPPTTTPTT 
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7860 


CTAAACGTAA 


CGGATATCTC 


GTTTTTGGAT 


TATTTTTCAT 


1 ATb 1 b 1 b 1 b 


PTPTTTP.TA A 
Uioi 1 lol nn 


7920 


TTTTCATAAA 


CTACAGCAGC 


CATCTTATCT 


AACTCATCCC 


TGbAbAAbb 1 


7ATPPPPPTTA 


7980 


AATTTAGAGG 


TTTCTGTTAT 


GCCAAAACGA 


/«* TV TV TV ^> ^ TV 

CCAAAGCGAG 


CTTCb 1 bAAb 


7ATP2\TP APPA 


8040 


TCAGATACAC 


CTGCCTCAAT 


GTAAATCTTA 


m rn o rnrn rp tv 

TCTGTTTCAb 


CAb 1 b Ab ill 


TPT A ATPTPT 


8100 


GCATCTGTCT 
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ACATCATATT 
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• CGCCTGCTAA 
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8400 
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TGCTTCTGTT 
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CTGAATTGAT 


AATCCAyCTT 


AGTTTCAATT 


TCTAGTTTGT 


CTTGTGGTGT 


1 AAGACTTCTA 
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ACATCTGCTG 
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ATTATATATT 
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TTATCGGATC 


10260 


ATCGATAATT 


AAAAGCCTTG 


CTCCACGTCC 
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AAGTACTGTA 
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AAGCGAGCCT 


■ AGATGACCTT 


CAACTCCCCA 
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CAGAACTTAT 


10380 


TTGTAAGCCG 


AATAATTGGT 


CCGAAAACAA 


GCGGAACTTT 


TCGCGGTTCT 
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10440 


CTTTTGGAAT 


AATTCTTTTG 


AATAAGAAAC 


AACCATCACC 


AGACTATCTG 


i GATGCCTCAT 
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TAAATAATAA 
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TCTCTGTAAT 
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TTTCTGATGT 
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ATTTCGTCTA 
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TCGCCTCAAT 


TCTTCTTCCG 
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TAAAGbbAbb 


rpPr—PTPTTA A 7A 

Ibbb J. LlAAft 




10860 


AAATAGATTT 


CTTTAGACTC 


TGTAATTTTT 


CCCATTbbT 1 


pppt\ tp a ar ta 
bbbAI bAAbA 


T AAPH AP.ATT 


10920 


GATGCTTTGA 


CAATTCATTT 


AGTAACACAA 


CTCTTTCAAG 


T\ rprp^T\/-'rprpyr-«7> 

ATT bAb I IbA 


TA 7A 7A T BTTPTT 


1 0Q80 
j. \j ? 0 \j 


TTTTTATGGA 


GTCAATCTGC 


TCTAAAAGAC 


TTCTTTTGTG 


r^mrprp/"i/"» r "»7\'PT 

GnTTCGbAl 1 


iblllblll^ 


11040 

X X W H W 


GACCATCAAT 


AGCATCTGCT 


GCCTTTTGTA 


CTTTTTCTAA 


GTCGTTATTA 


bbb 1 X AAbbM. 


J. X X U vj 


TTTCGAGCTC 


TAAATCGCTA 


AGTTCAGACT 


TTATTTTATC 


AATTGCTATC 


TLAbb I 1 bbl 


1 1 1 fiO 


TGTGATCTTG 


TATTTTCCCT 


TCAGTTTGCA 


TTGCTATATC 


rp t\ tv t\ TV rp rp 

TAACAGCATT 


/-^ tv A7A7\Ar^ r P T iP 
b AAnnnL 1 r\b 


1 1 770 


GCAATCCCTC 


GTCCTTCATA 


CGATCGCGTA 


GTCTTTTCAT 


TTCCAATTGA 


T* m rp m ffi m rp rp rn 

AT Illllbll 


1 1 9 RO 

X X Z. O VJ 


TAATGTAAGC 


ATTTGTTAAT 


AGCTTGTGTC 


CTTGTTGTCT 


AGCTGTCTTT 






CCACTTTAAT 


TGCTGATTGT 


GTTGCATTAA 


AAGTTTGCAG 


TV m TV TV f*^ TA m /"* TV 

ATAACAGTCA 


Ab AAAbAHb i 


114 00 


CATATCTCTC 


TTTTGTTTTA 


TTTGATGGCT 


CTTTTTCACT 


TCTTGCCATT 


bAbbbb Abb I 


114 fiO 


CCCCTAACTT 


CTAAGTATTT 


TATCAGCTTC 


AATAAGAGTT 


TT a AbAJ. bAL 


rn TV TV PTP TA Af^T 

I AAb J. unno 1 


11520 


TAGTTTGATT 


TCGCCATTTT 


TAAGATTTTT 


TAACCATTGG 


GbbAibbibb 


PTPTT A TP AT 


11580 


TTTTCGGTAT 


TGTTCATAAT 


CTTCGGCATC 


TTCAAACTCT 


rn m /*> rp rn 7\ 7X rp rp 

TTCTbTAAl 1 


PP7A 7A 7ATPTA A 
bbAAA MM 


11640 


AATATCAATT 


TTTTCATGTT 


CTTCTTTTGT 


TCCCATTGTT 


TV TV 7V T\ Z** 1 T\ ^** r"' /"* r™" 

AAAAb Ab b b b 


PP7ATTTPAT A 
bbAl 1 ibnln 


11700 


AAATGCTAAA 


AGACACAGAG 


GGTGTCGAAA 


ATCCACGCGT 


GGGAATlblb 


TPTPTPTTPf^ 
iblbibl 1 y^yj 


11760 


GGGTGTTTTT 


TGCGTCCTGA 


GGAATTAGTC 


tv m ^ mm TV /"* 

GAGTGTTAGC 


TGbAbTbbbL 


i 1 J. i 1 1AM1 I 


11820 


TATTCTACAA 


GTTCTGCAAT 


GATTCCTGTT 


^ tv m m m m m t\ t\ 

GATTTTTCAA 


tv /*"■ rp rp/**rp 71 rp 

AGCGTTCTAI 


P7ATTZ1PATPA 
bA I 1 AbA 1 ^M 


11880 


CAGAAGAGTG 


GATCTAATTC 


AAGCGTATAA 


^ tv /-^ -tl rp Tv r"* /~* TV rp 

CAGATACGAT 


rnrn 71 Tv PTPTTP 

T 1 AAL 1 b 1 1 b 


a PA APTPATT 
AbAAb 1 v»M 1 1 


11940 


AATGTACTTC 


CTGAACCACC 


AAATAGATCT 


TV TV TV TV /"» TV TV m TV rp 

AAAACAATAT 


CTTGTCTTTT 


BP AnPT ATTT 
AbAbbiAl JL 1 


12000 


CTAACTGGTA 


TCGCAATAAG 


TGATAACGGc 


TTTTGTGTAG 


b A 1 b A I Ab 1 A 


PPTTP.PT AC A 


12060 


TCATCTCTTG 


GTACTTTCCA 


AATAGTAGCT 


GGTAAATbTT 


C I AA 1 AbA 1 b 


A TPTT^PPAA 


12120 


ATTGTAGTTT 


GCTTTCTGTC 


TCCGTACCAC 


GAAGGTGbCr 


r*** *P ^P rp *p rp r* 1 rp 

bll Ibilbll 


PTPP.PPATAA 


12180 


AAAACTGGTT 


CATGTTGCCA 


TCTGTACTGG 


CTCCAACCAA 


ACGTAGCATT 


ATTTTTTACC 


12240 


CATATACATT 


GTGAACGGAC 


AACTATACCA 


GCAGCATTCA 


TACTATTTTC 


AAATTCACGT 


12300 


TGATAAGATG 


- ATCCGtGAAA 


CACATAAATC 


GCTGAGTCAT 


CTCTCAtTGC 


GTTAGAATAG 


12360 


TTTTGAAATA 


CTGACATCAA 


GAAnTGGTCG 


AACTCTTCAT 


CACTCcATGT 


CATCGTTCAT 


12420 


AATTTTTTtC 


CCGACCAGAT 


TCGTTTAATt 


CTTTATTGTC 


AGATTTTACT 


1 GCTACATTAT 


12480 



WO 98/50555 PCT/US98/08985 

1062 



AAGGTGGATC 


AGTCACAACT 


AGATTTGCTT 


TTTyTCCTkG 


TAATAACTTT 


TCaACATCCG 


1254 0 


AAAGCTTTGT 


CGCGTCGCCA 


CACAATAGAT 


AATGATTACC 


AAGTTTCCAT 


AATTGGCCCA 


12600 


ATTTAGTTTT 


AGCTTCATGA 


TGATTTTCTA 


TGAACTCATT 


AACTTGAAAA 


TCATCTTCAA 


12660 


TAATCGGTTT 


CTCAATATCT 


TCTTCATAAT 


TGaAAGAaGC 


GAGCArGCTA 


TCTACTTCCT 


12720 


CAGTATCAAA 


mCCAGTCAGA 


TTAACAGCTT 


CATCATcTAA 


TTCATTTAAT 


AAAATAGAGA 


12780 


GTTTTTCTTC 


ATCCCcACCG 


ACCAGAAATC 


TTATTaAGAG 


CCACGTTGAG 


CGCTTTTTCT 


12840 


TTATCAAGAG 


GTAAATCTAC 


CACGGATACC 


TCTATTTCAT 


TGAACAAGCC 


CAGTTCTTTG 


12900 


GCAACAGCGA 


CACGCTGATG 


TCCACCAACT 


AGATTGCCTG 


TTTGAATATT 


GTAAATAGGC 


12960 


GGATCTACAA 


ATCCAAATTC 


TAGGATAGAT 


TGTTTTAACT 


TCTCATACTC 


TTGCATTCCT 


13020 


GGTTTTAAAT 


CAACTCTTGG 


ATTGTAGTCA 


GCTGCTTTCA 


GATCAGATAA 


TTTCATTTTT 


13080 


TCTATTTGCA 


TGTTTTTTCT 


CCTTTAAACA 


AATAAAAAAG 


CCTAATTAGG 


CTTTTTTAGA 


13140 


TAATAGTCAA 


TAAGTATTCT 


TGTTCAAGAC 


AGTTTTTTAT 


TTCAATCGCA 


GCTATTTCTG 


13200 


CTTCTTTATA 


TCTTTTTAAT 


AATTCTTGAT 


ACTCATCTAT 


AGCCAGCTCC 


GTATATGCTT 


13260 


TAGGGTTAAT 


AGATAAAATA 


TAAGTATTTT 


CTTGATTCGG 


CATATTAACA 


AAATTAATCG 


13320 


ATACCGAATC 


ATTATTTTCT 


AAAATTACGT 


TAATTTCTTG 


TTCGTTAATC 


ACTACTGTAT 


13380 


AATTCTCATT 


TCCAAATTGT 


TTCATCTTAT 


TCCCTCCAAA 


AATGTAATAA 


CCCCTCATCT 


13440 


AGGTGTGTCT 


ATATAACCAT 


ACTATTAGCT 


ACTTGTACAG 


TC AT CAT AG A 


ACATTTATCA 


13500 


ACTTAATTAT 


GGTATATGTG 


ATTTGAAATT 


ATTCTGAATG 


TGATAATCCT 


TAAAAAAGCC 


13560 


ATATCCAATG 


TAAATTAAAT 


CTAGCTTTCT 


AGACATTTTT 


TTACTAACAG 


GTCGTATACA 


13620 


TTCAACAATT 


GsAAAGTCAC 


GATTTAATTT 


TACTTCTCGA 


GTTATTTTTG 


GkGGGTAATT 


13680 


TTTACTTAAA 


CTTATGTwCC 


AGTAATTCAT 


AATGTaTCCC 


CTATTTAAAT 


TTGTGTGGAA 


13740 


CGCCTAGCCG 


AATCCACACk 


GAATAACAAT 


ACTAGGTTGC 


TCGCCTTTTC 


CTGTTTCCGC 


13800 


AGGCAGGCAC 


TTTGAAAGGA 


AGGAGTGAAA 


ATCGTGTGTC 


ACAGTAGTAG 


AAACATTTAT 


13860 


TGACA 












13865 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

GGTATCTAAC GCCCAACCTG TAATGGTTAA AATTATTGGT GGTTTGGTCG CGGATnGATA 60 

CTATCCAAGG ATCCACnTTA ATGTCTTCaT TCAGnGCaTT CtCCtTAaGA TGTTTCGGAT 120 

TTCGGTAGGG ATCcAACTAA AagCTTCCGC GGCCCaTTTT TCCGcAAGTc TTTATAAGAT 180 

CCTTTTTTAT CATGGTTTTG CATTTTTTCG TCTGCTTTCA AAATTGCATA GGACACATAT 24 0 



WO 98/50555 PCT/US98/08985 

1063 



ACATTTCGTT 


TACCAGCGCT 


AGCAAATTGT 


AACGATAAAT 


CGACACCCAT 


CCATTCTTTT 


300 


CCTAAGCTTT 


CGTTAATTCC 


TTCTAAATTT 


AGGTAAAGTT 


GTTTGTTGAA 


TAGTAAGCAG 


360 


TCTTCAGTTG 


CTGCTTGAAT 


ATTTCTTGGT 


AAAGCAATTC 


TGTAATAGTA 


ACCTAAGCTT 


420 


TTACCAGGGG 


TGCCTTTTTC 


AGGATAAAGT 


AATCGACCTT 


TGTCTGTATC 


AAGTGATACT 


480 


CCCACTGTTT 


CCACGCGATA 


GCGTGmtCaA 


CTACGCGGCC 


AG TC ACT AAA 


CCAGAGGTCT 


540 


CTTGTTGACC 


AATGTTCATT 


AACTCTTTTA 


ACCAATCATT 


TTTCGTTGGT 


ACCAATCCAG 


600 


CATTTAATAA 


AGCCACATAT 


TCTCCCTTAG 


CAGCTTGAAT 


CATGCCATGA 


CGAGTTTCTG 


660 


TTGAAACATA 


ACGAAGACGG 


TCATTTTGGA 


TATTTATTTG 


ATTCTCGAAG 


CGTGCAGGCA 


720 


ATAGAATTTC 


ATAATTTGTA 


TAGGCAGTTT 


TTTCTAATAA 


TTGACGTAAG 


TAGCTACTCA 


780 


TATTCTO-TGT 


ATCATTCGTA 


ATAATCAATG 


ATACCATCGG 


CACATGATCA 


TAAAGATAGT 


840 


TAATTTTATA 


GGAACCATAA 


AATTCGGCAA 


TCTCTACTTG 


TGCTTTTAAG 


cCACGGCGTT 


900 


CCGTCGCCGC 


TTGAACCGCT 


TTTTGACCAG 


CCACATAGGC 


GTAACCTTTG 


CTTTCAGGGT 


960 


TTAGAGCAGT 


CGATGATTCA 


ATGGCACGCC 


AATGATACAT 


CATACCAGGA ATGTGCTyGA 


1020 


TTTTCGTTGC 


TTGTTCCGTT 


GCTCGCAATA 


CGAAATCATA 


GTCTTGAGCG 


CCGTTATATG 


1080 


CTGAGTTCAA 


TCCGCCTACT 


TTTTCTAATA 


AATCACGTTT 


CACAACAACA 


AAGTGTGTAA 


1140 


TGTAATTATG 


ATTTAAAATC 


AGTTCTGGAT 


TCCAGTCCGA 


TTTGTAAAAC 


GCGTTGAAGC 


1200 


GACGACCATT 


TTCTGTAATT 


TTATCTTCAT 


CAGTATAAAG 


AAAATCAATC 


GTTGGGTCTG 


1260 


TATTTAAAGC 


TTTGACCACT 


TCATATAATG 


CTTGTGGCGC 


TAATTCATCA 


TCGTTATCCA 


1320 


TGAAACCAAT 


AAAATCGCCA 


GTAGCAATCG 


ACAAAGCTGA 


GTTAGTTGCT 


TCTGAAATAT 


1380 


GTCCGTTTTC 


TTCGCGATAA 


ATAACTTTGA 


TTCGTTGATC 


TAGTTCTTTA 


TATTTTTCTA 


1440 


GCATTGGCTT 


AATATGTTCG 


CTTGGCGATG 


CATCATCTGC 


TAAACATAAC 


TCCCAATTTT 


1500 


CATAGTACTG 


ATTTTGTAAA 


GAAGAGACGC 


ACGCTGCAAG 


CCATTTTTCT 


TCGACATTAT 


1560 


AAACAGGCAC 


AGCGACAGAA 


ATTTTAGGCT 


GAAATTGGAA 


CGTTTTAATT 


TCCGCTAAAA 


1620 


CTGCCTCCAC 


GTCAACAGGG 


GCTTCTGCTT 


TTTCTTGTCG 


TTTAAAGTAC 


CACTTCACTG 


1680 


AGCCCATCAA 


ACCACGTGTC 


TGAATCGAAC 


GCCATGCTCG 


ATCAAATTTC 


GTGCCTGGTT 


1740 


GAACCATGTT 


TGGACGCTCC 


ACCACGTATT 


GAACCTTCTG 


GCCTGATTTG 


GTGTCAATGT 


1800 


TTAACGTGTA 


GGCTGGTAAA 


TCTTCTGGAT 


CTTTAATCGT 


AATTACAAAA 


CCATAATCTT 


1860 


GATCGCCTTC 


TAATTCAAAC 


ATATCAATAA 


CATCACGACG 


GTATTCTGTT 


TCCACCGTAT 


1920 


AAGTGACTGA 


TTCACTAGCT 


TCTGCAATGC 


TGAATCTTAG 


CGGATAATTT 


GTTTGTTTGT 


1980 


CATACGCCCA 


CCCTCTGATA 


ACAGAAACGT 


TATTTTCTAC 


ATCGCGAGCA 


ATATCATCGA 


2040 


TGGTTAAATT 


AATCATTTTA 


TCAAATTTAG 


TTAACGTGAT 


TCGTTTATAT 


TTTCCAGTAG 


2100 


GCATAGGGGC 


AGTCTCCGTC 


TGTTTATTCA 


TTTCTTCTAA 


AAATTGATTT 


CTTTCATCTA 


2160 


ACAGTGCATG 


AATAAAGGTA 


TTTTCACGTT 


CTGAATAGTT 


GCCTTCAAAG 


AAATAATCAA 


2220 
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TGTCTACTGT 


CAGGTCTTTG 


CCTGCCATTT 


GTGCATCGGT 


TAC77CGA77 


ACAGGTAC7T 


2230 


GCGTATCTAA 


GAAATAATAG 


ATATTCTCTT 


GTACCAGAAA 


AGCA7TCG7G 


GCTGCSAC7S 


2240 


GGACTGTCTC 


GCCGTCTATC 


GTCATGGATA 


ATTTAACAAT 


TGCCCCCG77 


AAAC77GGGA 


2400 


ATAATTTTAT 


CAGTTGGACG 


TCTTCTGGTA 


TCTTCAGTTG 


GAAC7GTT77 


GT77CAGG77 


2460 


TTTGTAAATC 


AACTGGTTGA 


TCCAACTCAG 


TCTCTTTTTC 


TTCGCCGCAA 


TTAA7TAAAA 


2520 


TATGAACGTT 


TTTTTCATTG 


TATGAATTTA 


CTGGTGTCAC 


TCGCTCTGGC 


TG77C7AC7G 


2580 


GTTCAGCAGT 


CAAGGCAAAA 


AAGTACTGAT 


ACACTTCGCC 


AAAGGG AC GC 


GC777AAAAG 


2640 


cTGCTTGCAC 


TTCTACAGGT 


AACGCTTCAT 


ACGTCGTAGG 


AATT7CgT7G 


TAGCCAAC77 


2700 


GATTAAATGT 


AAAATCTTCT 


TTGGCAATGT 


ATAAACCAAC 


TTCTGCAAAA 


AC7T7TTCAA 


2760 


ATCCTTGTTG 


TAAATAGAAT 


GATTTATGCG 


TAGCATCCAA 


CAGCCCAG77 


TCATTCCACG 


2320 


TTAAACGATT 


ATTAAATAAA 


TCAATTAAGA 


CCGAATTATG 


CGCCAAAT7A 


GGGAAAGTGA 


2880 


TTAAGATTTG 


ACCGCCTGGT 


TTTAAAAACG 


GTTTGACTTT 


TGCTAAGGCG 


CT7TGAGGAT 


2940 


TCATTAAATG 


TTCCAACACA 


TCTGCAAAAA 


CGATATAATC 


AAACGTCT7A 


CCAGCAAAG7 


3000 


ATTCGACCCA 


ATCATTTTCA 


TCAATATTGC 


CATAAAAGGC 


ATCTGTTGAA 


AA7TGACTGA 


3060 


CATGATCATA 


AAGTTCTTTG 


TCTAATTCAA 


CAATTGAGAC 


C7GGcAC777 


TT7TCTTCCA 


3120 


TCAAATAGCT 


AGTCATTCGC 


CCATTACCTG 


GaCCAAATTC 


CaAGP-CATCA 


CT" 1 7 ^ ^ 


3180 


TGATTTGTGC 


TACGATTTTC 


CCAACACTGG 


TACTCTCGTC 


TGTT7CCA77 


TCAAAATCAT 


3240 


ACTTCATCTT 


TTGTCGTATC 


CTTCCATTTA 


TGATCTAAGA 


TGACAATTCC 


TTCTCCGGCA 


3300 


TAACGACCGG 


TAATAAACAT 


CGTCACGTAT 


TTTGTTTTGT 


AGAC7AATGG 


GACTGTTGCA 


3360 


TGTTCCTCAA 


AGAAAGCAAT 


ATCAAAATAA 


TATTCACCTG 


T7AA7AA7GA 


CA7T7TTTCA 


3420 


TATTCTAAAT 


AAAAAACATT 


CCGTCCTTTT 


TTCCAAGGAA 


T7TGCACGCC 


A7CTAACAAC 


3480 


GTATTTAAGC 


CACAGACATA 


ATGATTGTCC 


ACTGTTCGAA 


TAGCGACCCC 


ACAAACAGGC 


3540 


TTCTTAATTT 


GGTCATTTTT 


GACAGTATAT 


TCAATTTTTA 


CAATCAC777 


TTCA7CTTGC 


3600 


GTTACCATTT 


CTAATGGTTC 


ATAATCCGCA 


TTTAAAAGTT 


CCGCATGATC 


CACT7CAA7A 


3660 


ATATCTGGTG 


TTGGGGTTTC 


ATCTTTCATG 


GGTCtTCGCG 


GTCAACCGTT 


TTAA7TGA7T 


3720 


TCTTTTTCAA 


AAAGTTTTCG 


TAGTTGGTCG 


TCACGTCCAT 


CGTATCTCCA 


TATTCAACGA 


3780 


CTTTCCCGTG 


TTGCAACCAA 


AAAGTTCTGT 


CACAAAAACG 


ACGAATGGAG 


TT7ACATCGT 


3340 


GAGAAACAAA 


TAAAATTGTT 


TTACCAGAAT 


TTTTAATTTC 


CGTAAAT7T7 


TCCA7ACA7T 


3900 


TTAATTGAAA 


CTCTAAATCC 


CCGACAGCCA 


ACGCTTCATC 


GACAATTAAA 


ATATCGGGAT 


3960 


CCACGTTAAT 


GGcTACCGCA 


AAAGCCAAGC 


GGACAAACAT 


ACC AC TAG AG 


TACG7TTTAA 


4020 


CTGGTTGATA 


CAAATGATCG 


CCGATGTCAG 


CAAAATCAAT 


GACA7CTTGG 


ACACGCTC7T 


4080 


CCATCTCTTC 


TCGGGAAAAA 


CCCAAGACCA 


TTCCATTTAA 


GAAAATA7T7 


TCA7ATCC7G 


4140 


AATATTCAGG 


GTTGAAACCT 


GAACCTAATT 


CAAGCmACGC 


TGAAATT77C 


CCA77AA77A 


4200 
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CCATGGACCC 


GGAGGTTGGT 


GTTAGTACTC 


CTGTAATAAT 


TTTTAACATG 


GTTGATTTCC 


4260 


CAGaGCCATT 


TTcGCCGACA 


AAmCCAaTCA 


TCTCGCCtTT 


TTTAATTTGc 


ATCGTGrCmT 


4320 


CGTCTAACGC 


ATAAAACAAA 


TCGTGGTAGG 


ATTTTTTGGT 


TGGACTAAGC 


GCTTCTTTAA 


4380 


ACCGATCGGA 


CGGCTTTTTA 


TACATGTTGT 


ATGTTTTGGT 


AATATGTTGG 


ATATCGATTG 


4440 


CATATTCTGA 


CATGTCGTTC 


TCTCCTTATA 


GTACATCTGA 


AAAATGTGGT 


TTTAAGCGTC 


4500 


TAAACACGGT 


TGAGCCAATT 


AATAGTAAAA 


CAATAGTAAC 


TCCCCAGAAA 


TATAAGCTAT 


4560 


ATTGCCAATG 


TTCCCAGAAC 


CATGCACCGC 


CTAAATAAGA 


TTcGCGGTAT 


CCTTGAACAA 


4620 


TATAATATAA 


TGGATTGATT 


TTTAAGATTG 


TCGCAATCAT 


TCCAGTTGGT 


TGCCATAAAA 


4680 


TTGGCAATGT 


CCACATAACT 


GTTTGCATAA 


TGATATTGAT 


AAATTGCATA 


ATATCAGGTA 


4740 


AAAATGGTTG 


TGTTGAAGCA 


GTAATCCACG 


TAATTGCTGT 


TAAAAAGACT 


AATAAACAAG 


4800 


CTAAGTAGTA 


AATTAATTGC 


AAACTGTACA 


GACTTGGGTA 


ATAACCGTTA 


GCTGTAGTAA 


4860 


TTACAAAACC 


AATTAGAATG 


AAAAACAAGT 


GGGTGTACAA 


GTTCGATAAA 


ATTTTAGATG 


4920 


TTGGTAAAAT 


TTGTACATTA 


AACACGACTT 


TTTTAACTAA 


ATAACTATAT 


TCACGAAACA 


4980 


CATTCGTGGC 


TGATAATAAC 


GAATCTGAAA 


AGAAGAACCA 


AGGTACCATC 


CCTGTTACTA 


5040 


AATAGACAAT 


GAATGGCATA 


TTGTCGCCAG 


CTCTAGAACG 


AAGGCCTACA 


GAAAAGACAA 


5100 


ACCAATACGT 


CAAAACAGTT 


ACTAATGGCG 


TTAAAAACGC 


CCAGACAATT 


CCTA 


5154 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

ACTGCCGCTA AACAAGAATT TAAAGTAGTA GTTCAACAAG AAATGCCTTC TGCAGATTTA 60 

TCTTTAGGAA CAGATACAAT TAGTTTTACG GCACTGAATA ATGCGTATGA AGGAATTTAT 120 

CGTTTAGATG ATAAAAGCAA GCCGCAACCA GCAGGGGCTA AAGAAAAAGT GCAAGTCAGT 180 

GATGATGGTC TGACGTACAC AGTGAAACTA AGAGAAGAAG CAAAATGGTC CAATGGTGAT 240 

CCTGTCACAG CTGCCGATTA TGTTTTTAGT TGGCAACGTA CAGCAGATCC TCAGACTGGC 300 

GCTGAGTATG CTTATTTTCT AGAAATGATT GAAAACGGTG CAGACATTGT AGCTGGTAAG 360 

AAACCAGTCT CTGAATTAGG CATTAAAGCA AACGGAGATT ACGAGCTAGA AATTAAACTG 420 

GCAAAACCAA CACCATATTT TGATTATTTA CTGGCCTTTC CACTATTTTT CCCGCAACAT 4 80 

CAAGCAACCG. TAGAGAAATA TGGCAAAGAT TACGCTGCTT CCAGTGAAAA AGCAGTCTAT 54 0 

AATGGTCCGT TTGTCTTAGC TAACTTTGAA GGGGCTGGCT CAGATACCAA TTGGACTTTG 600 

GAAAAAAATG AAAACTATTG GGATAAAGAC AATGTCAAAT TAGATAAAAT CAATTTTGAT 660 
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GTTGTCAAGG AGGCGCCAAC CGCCTTGAAC TTATTCCAAG ATGGCCAGGC AGACGATGTT 720 

ATTTTAAGCG GCGAATTGGC CCAGCAAATG GCGAAGGATC CAGAATTGGT TATCGAAAAA 780 

GAAGCACGAA CCAGCTATTT GGAATTTAAT CAGCGAGATA AAAATTCACC GTATAATAAT 84 0 

GTGAATTTAC GCAAAGCAAT TTCAGCAGCG ATTGATCGCA ACGCATTAGT AGATAAAATT 900 

TTAGGCGACG GCTCAGTAGT CGCTACCGGC TTAATTCCAG AAGGGATGTC GTATTCGCCG 960 

ACAGATGACA CAGATTTTGC TGATGAAAAT AAAAAAATTG TTGAATATAG CCCAGAAAAA 1020 

GCAAAAGAAT ATTGGGCAAA AGCGAAGAAA GAATTGGGCA TAACAACACT CAAAATGGAT 1080 

ATTGTTGCCG ATGATGTCGA TTCTACGAAA AAGTTAGCGG AATATATTCA AGGTACGTTA 1140 

AAGGATACTT TAGAGGGCAT TGATGTAACA GTGAGCCCAG TGCCGTTTTC AGTTCGAATT 1200 

GATCGAGGTA GTCGCGGCGA TTTTGAGACA ATCTTAGGTG GTTGGGCTGC GGATTACGCA 1260 

GATCCCAGTA GTTTCTTAGA TCTTTTTGTT ACTGGTAATA ACTATAATCG CGGGCGTTTT 1320 

TCAAGTAAAG CCtACGATGA GTTGATTGAG GCTTCAGCTA CAAGAGATGC TTCAGATCCT 1380 

GAAAAACGTT GGGAAGACAT GGTTAAGGCT GAGAAGTTAT TGATTGGTGA AGAGACCGCT 14 40 

TTAGCACCAC TTTATCAAAA GGCGACTGCC CATCTACGCA GTAAAGAAGT GAAAGGCGTC 1500 

GTTGCCCATG GTGCTGGCGC ACAATATGAT TATAAGTGGA CATATGTGAC AGAAGAGTAG 1560 

TTAAAAAGAG ACACAGAATA GTCAAATCGA CTGTTCTGTG TCTCTTTTAA TCAAAATATT 1620 

TCTGCTGTTC TTCTTTTAAT TTTTCTGGGG TTACATCTTC ACCAGTAGGA TCATAAATAG 1680 

TGACGCCCAT TAATGTTTTA TTGAATTGTT CACGAATGGT CCGCAAATAT TCTTGTCGCA 174 0 

AATCGACACG AAGAGCTTGT TCTTCAGATG TTAAACCTTG TTCACGCTCT TTTTGTGCAA 1800 

GCGTATTAAT TCTTTTTAAA ACCTCTAACA TTTCTCATTT CCTTTCTATA ATAATTCTTT 18 60 

TGCTAGGCGT TCAAACGATT GAAAGCGCGT TTGTTTGTCA TAACCTGGGG TGATTAGCAT 1920 

TGCTTCTGTC AACCCTAGTt CtTCtTGTTG GTGGAAAAAC TCCGCAACTT CTTGGCTACT 1980 

GCCAATGACA AAGCGTTGTT TATTTTGTTC TAAAACCAGT TTGTCCATTT CACTGTAAGG 2040 

GAAAGCTAGG GCTTCTTCTG GCGATAAAAA TTTATTTGGA AGCCCATGTT CGGTGGCAAA 2100 

GATATAATGC TCGAAACTAG CCGCATGGTA TTCTGCTTCT TCTTTCGTGT CGGCGGCAAA 2160 

AGTGAAATAC GCAGCGCTCA CTTGCGGCTC TTGTTGTTCA GTTACTGGTC GGAAATGCTT 2220 

GTAATATGCG GACAATACTT CTTTGGGCAA ATGTCCATGA ATAAACTGAG CAAAGGAAAA 2280 

ATTAATGCCA TATTCACCAG CTGAACGTGC GCTTTCTTTG CTAGCACCTA GTAACCAAGG 2340 

TTGTGGTAAT AGAACTTCCC GCGGAACAGC TGAAATTTTA GAATATAAAT GATCTTCAGG 2400 

AAATTCCTCT CTTAAAAAGA GTAAAATTTC AGGTAATTTA TCAAATTGGT CTAGCTGTGG 24 60 
TGGCTGACCA- TTATTTAACG CCAACGTAGT TAAAGGGCTG CCGCCAGGCG AGCGGCCCAT 2520 
GCCTAAATCG ATTCGATCAG GATATAAAGC TGCTAACGTT TTAAACGTTT CTGCGACTTT 2580 
AAATGGCGCA TAATGCATGG TCATGATGCC ACCCGCGCCA ACACGAATTT TTTCGGTGGC 264 0 
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ACTAGCAATA 


TGTGCAACTA 


AAATTTCTGG 


CGTTGCACTG 


GCAAAGGCGT 


CTGAGCCGTG 


o ^ n n 


ATGTTCGGCA 


TACCAAATTC 


TTGTGTAACC 


AAGTTTTTCC 


GCTAATTGAG 


CGAGCTCTTT 


I I oU 


GGCTTCTTGT 


AATGTGTCGC 


TTGCTGATTG 


ATTTTTAGGA 


ACCGTTAATT 


GATCTAAGAT 




ACTTAATTTC 


AT G TAT G TAG 


TCTCCTTCTT 


GGTTATTTTT 


TTGAAGCTAC 


CGCTATTATC 


o o o n 


ACGCAAGCAT 


GTCAAAAAAA 


ACAGTACGCA 


CTTTTTAATT 


CCCTAGTGAG 


CGAAAAGTAA 


d. y 4 u 


CAAATAGATG 


TAGCCAAGAT 


CGATGATAAA 


AAATTCTCCG 


CTATGGCTTT 


TTGATGCTTT 


JUUU 


GTGTTAAAAT 


GGAACATATG 


TACGCAATCA 


GCAATTGGAA 


AGGTGACAAC 


ATGTACGAAT 




ATATTATTGG 


AAAAGTAACG 


TTTGTTAGCC 


CATACTATAT 


TGTCGTTGAA 


ACAAATGGCA 


T1 Oft 


TTGGCTATCA 


GATTTCAGTC 


GATAACCCGT 


ATCGCTATTC 


AGGAAAAATG 


GATACGGATA 


3180 


TTAAACTATA 


TCTTCACCAA 


GTCGTGCGGG 


AAGATGCACA 


ACTTTTATTT 


GGTTTTGGCA 


324 0 


GTTTAGAAGA 


AAAACAATTG 


TTCTTAAAAT 


TAATCAGTGT 


CTCTGGAATT 


GGACCTAAAA 


3300 


GTGGCTTAGC 


TATCATGGCT 


TCTGTGGCTG 


ATCATGGTGG 


TTTAATCAAT 


GCAATTGAAG 


3360 


GGGAAGATGT 


T AC AT ATT T A 


ACGAAtTCCC 


AGGTGTTGGC 


AAGAAAACAG 


CCCAACAAAT 


34 20 


GATTTTAGAT 


TTAAAAGGAA 


AATTAGGCGA 


ATTAGAGTCT 


TCCGAAGCAG 


CGGTTGCGGC 


3480 


CATGACCGCA 


ACCGAAGCGG 


TGACAACAAG 


CAATCAAGCA 


TTGGCAGAAG 


CGTTAGAAGC 


3540 


ATTAAGTGCC 


CTAGGCTACA 


GTGACCGAGA 


GATTAAGCGT 


ATCACTAAGC 


AATTGGAAGC 


3600 


GTTAGGTGAG 


ACAACGACAG 


ATGTTTATTT 


GAGCAATGCG 


TTAAAATTCA 


TGATGAAACG 


3660 


CTAAGGAGGA 


ATAACGATGA 


CAGAAGAAGA 


ACGGTTGCTT 


TCAGCTGCAA 


GTCAAGAAGC 


3720 


AGAAGCTTCA 


ATTGAAAAAT 


CGTTACGTCC 


ACAATTTTTA 


GCACAATATA 


TTGGTCAAGA 


3780 


TAAAGTAAAA 


CAAGAACTAA 


CGATTTATAT 


TGAAGCGGCG 


AAAAATCGGA 


ATGAAGCGTT 


O O A A 

3840 


GGATCATACC 


CTTCTTTATG 


GTCCGCCAGG 


TTTAGGGAAA 


ACGACAATGG 


CGATGGTCAT 


3900 


CGCCAATGAA 


ATGAATGTGA 


ACATTCGTAC 


CACAAGTGGA 


CCAGCCATCG 


AACGGGCGGG 


3960 


AG ATT TAG TC 


GCTATTTTGA 


ACGAATTAGA 


GCCTGGTGAT 


GTGTTATTTA 


TCGATGAAAT 


a n i f\ 

4020 


TCATCGTTTG 


CCGCGAGTTG 


TGGAAGAAAT 


GCTGTATTCC 


GCAATGGAAG 


ATTTTTACAT 


a n q a 
4 UoU 


T GAT AT CAT G 


GTCGGTCAAG 


GAACCACGGC 


ACATCCCGTT 


CATTTTCCGT 


TACCGCCTTT 




TACTTTAGTG 


GGCGCTACCA 


CACGGGCAGG 


GATGCTTTCA 


GCGCCATTAC 


GAGATCGTTT 


A O C\C\ 


TGGGATTATT 


TCTCACATGG 


AG TAT TAT C A 


AGAGCAGGAT 


TTAAAAGAAA 


TCGTTCTCCG 


a o cri 
4 <s oU 


TTCCGCGGaT 


ATTTTTCAAA 


CGGAGATTTT 


TGAGGAAGGT 


GCCTTTGAAA 


TTrCGCGCCG 




CTCAAGArGA 


ACCCCCCGAA 


TTGCTAACCG 


TTTGTTAAAA 


CGTGTCCGTG 


ATTTTGCCCA 


4380 


AGTCCAATCA 


GATGGCAAAA 


TCGACCGAGC 


GATTGCAGAC 


AAAGCGTTAA 


CCTTACTGCA 


4440 


AGTGGATCAT 


■ CAAGGCTTGG 


ATTATGTCGA 


TCAAAAATTG 


TTGAAAACAA 


TGATTGATTT 


4500 


ATACGGCGGC 


GGTCCGGTTG 


GTTTAAGTAC 


CTTGTCTGTA 


AATATCGGTG 


AAGAAACCGA 


4560 


AACGGTGGAA 


GATATGTATG 


AACCCTATCT 


TATTCAAAAA 


GGTTTCATTA 


AACGAACACC 


4620 
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ACGAGGACGA 


ATTGCAACGC 


CATTTGCGTA 


TGCACATTTT 


GGGTATGATT 


ATTTAGAGGG 


4680 


TCGTAAAAAT 


TAAAGGGTTA 


AATAAGAGAG 


AAGGGCCTTC 


AACAGGATAG 


AGTATGTTGA 


4740 


AGGTCTTATT 


TTTCATTTTT 


TCAAAGTTTA 


TGTAACGGAA 


TAAATATTCT 


TTTTTTGATA 


4800 


GACCACTATA 


AGTATATTAA 


CGAATAATTT 


TTAAAAGTTT 


CATCTCTTGA 


TAATATAAAT 


4860 


CTTTGTAAGA 


TAAATCAGAC 


AGTTTGCGAG 


TAACTGTTTT 


TGATATCGGT 


TTGTTTTCAA 


4920 


AACCATTTAA 


AGCGGACAAA 


ATATATAACT 


TAGTTGTGCA 


GGTATTTTTA 


AAATGATTTC 


4980 


GGATGTAGCC 


AAGAAGAGCG 


ATTGCTGAGG 


GTGTTTCAAT 


ATTTTGATTA 


TCTGACGGTA 


5040 


GAGAACACAT 


AGAGCAAACC 


TGATAATTCA 


ATGTTTTTAA 


ATAGTTGCTG 


TATGAAAATT 


5100 


TACCTGGTGG 


ATCACCTTCA 


AAATCATTTA 


TAAAAAAGAA 


GAACGCTTTA 


TTTAATAGAT 


5160 


TGCCAGTGGG 


CTTGCTCAGT 


CCATTTAAAG 


TTATACTGCC 


TGTAGATAAA 


GGTTGTTCGC 


5220 


TAGCTAAGTA 


GATGTATTCT 


GTCAATTTTA 


TCACCTCTTT 


CTTATCGATA 


TGAATTATCG 


5280 


TAT CAT GAT A 


TAATCAAATC 


CGCTATATTT 


TAAGGTGAAA 


TGCTAAATGG 


AAACAAAAGA 


5340 


AAGCAAGCCT 


TAACCGCTTG 


CTTTCTTTTG 


TTTCACGTGA 


AACGTTTGTT 


CTTATTTAGA 


5400 


AGAGCGGCAT 


ATTTCAGCTG 


TTGTAGTTAT 


ACTGTGTTTG 


TAATCGCTTT 


TAAATTGGAG 


5460 


GGATCAATGT 


TGAATAAAGA 


AATCAAAGGG 


GCATTAATTG 


TTTCTTGCCA 


AGCGTTGGCA 


5520 


GACGAACCGT 


TGCATAGTTC 


TTTTATCATG 


TCCAAGATGG 


CTTTGGCCGC 


TAAACAAGGA 


5580 


GGGGCTAAAG 


GTATTCGAGC 


AAATTCTGTG 


GTAGACATTC 


GAGCAATCCG 


GAAAGAGGTT 


5640 


GATTTACCAA 


TCATTGGGAT 


TATTAAACGA 


GATTATCCTG 


ACGCAGAAGT 


ATATATTACT 


5700 


GCAACTATGC 


GTGAGGTAGA 


TGAATTAATG 


GCTGTTCAGC 


CAGAAATTAT 


CGCTCTCGAT 


5760 


GCAACCCATT 


CGAAACGACC 


AAATGGCGAA 


ACGTTAGCCA 


CATTTTTCAA 


AAGAATAAAA 


5820 


GAAAAGTATC 


CTAACCAAGA 


ATGGATGGCG 


GATTGTTCCA 


CGCTGGAAGA 


GATGATTGAA 


5880 


GCAGCGGAGT 


TAGGTTTTGA 


TTATATTGGC 


ACAACGTTAG 


TCGGATATAC 


GCCGCAAAGT 


5940 


GAAAATCAGC 


GGATTGAACA 


AAATGATTTT 


GAACTGTTAA 


GAGAAGCAAA 


GAAGCATTTT 


6000 


ACTACACCAA 


TCATTGCAGA 


AGGAATGATT 


GATACGCCAG 


CAAAAGTCCA 


ACGTGTCCTG 


6060 


GAGCTAGGTA 


CTTATAGCGT ■ 


GGTGGTGGGA 


TCTGTTATTA 


CACGACCGCA 


AAAAATTACT 


6120 


GAAAAATTTG 


TGGAAGCTAC 


GAAAATAAAA 


CAGAATCAAG 


CATAAACTAA 


ATACACTATC 


6180 


GATACACTCG 


TTATTTTTTG 


ATCAATACAC 


TTCGATACGG 


TCGCTTTTTT 


TCTAGAGAAA 


6240 


GTTGAATCTT 


TCAATAATAA 


AAAGGGATAC 


ACTCCATTTG 


GCATAGTCCT 


TGCTGATAAT 


6300 


AAATCAGTGT 


ATAAAGCGCT 


ATCATTTTAT 


AGGAGGGGTT 


TTATGAAGGG 


TTTATCAAAA 


6360 


AAGAAACGGG 


TGTCTACTTG 


GTTAGCGTTA 


GGAATCACCG 


TAGTCAGCTG 


TTTTGCGTTA 


6420 


AGCAGGGAAG 


. TGCAAGCAAG 


TGTTGAAAGA 


ACAAAAGTTG 


ATGAATTTGC 


AAATGTTTTA. . 


6480 


GATGTGAGTG 


CATCACCAAC 


CGAACGGACG 


AATGGCGTAT 


ACGATACCAA 


TTATTTTAAT 


6540 


AATTTTTCTG 


ATTTAGGTGC 


ATGGCATGGC 


TACTATTTAC 


CTGAAAAAAG 


CAATAAAGAG 


6600 
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CTACTGGGTG 


GTTTTGCGGG 


GCCATTGATT 


ATTGCGGAAG 


AATATCCAGT 


AAACTTGGCG 


6660 


GCAAGTTTAA 


ACAAATTAAC 


GGTCAAAAAT 


AA/W^AAACGG _ 


GAGAAACCTA 


TGATTTAAGC 


6720 


CAAAGCAACC 


GCATGGACCT 


GTCTTATTAT 


CCTGGGCGCC 


TAGAGCAAAC 


CTATGAATTA 


6780 


GACGATTTAA 


CGATTCATTT 


AGCTTTAATT 


TTTGTCAGCA 


ATCGAACGGC 


GCTTATCCAA 


6840 


ACGACACTTG 


AAAACACTGG 


TGAAGAGCCC 


TTGTCACTTG 


GAGCAAGCTG 


GACAGGTGCG 


6900 


GTCTTTGACA 


AAATTCAAGA 


GGGAACGGAA 


ACCTTAGATA 


TTGGCACTCG 


TTTAACTGCT 


C d. £ f\ 


AAAGACAATG 


ACATTCAAGT 


GAATTTTGGT 


GAAGTCAGAG 


AAACGTGGAA 


TTATTTTGCT 


7020 


ACGAAAGACA 


CAAAATATAC 


GATTCATCAT 


GCGGATAAAG 


TTTCAACAAA 


AATTGATAAT 


7080 


CGGAATTATA 


CAGCAACCGC 


TGAACCAATT 


GAATTGAAGC 


CTAAACAAAC 


GTACAACACC 


7140 


TATACGACAG 


AAAGCTATAC 


TTTTACAAAA 


GAAGAAGAGG 


CAAAGGAACA 


ACAACAAGCA 


7200 


CCCGAATATA 


CCAAAAATGC 


GGCGCGCTAT 


TTCAAAGAGA 


ACAAGCAAAG 


ATGGCAAGGA 


7260 


TATCTAGATA 


AAACGTTTGA 


TCAAAAGAAA 


ACAGCAGAAT 


TTCCTGAATA 


TCAAAATGCG 


7320 


CTAGTCAAAT 


CGATTGAAAC 


GATTAATACC 


AATTGGCGAA 


GTGCGGCAGG 


TGCCTTTAAG 


7380 


CATGACGGGA 


TTGTTCCGTC 


CATGTCTTAT 


AAATGGTTTA 


TTGGTATGTG 


GGCTTGGGAT 


7440 


TCGTGGAAAG 


CGGATGTAGC 


AACGGCTGAT 


TTTAATCCTG 


AGTTAGCTAA 


AAATAATATG 


7500 


CGGGCCTTGT 


TTGATTATCA 


AATTCAAAAA 


GATGATACCG 


TACGTCCACA 


AGATGCAGGA 


7560 


GCGATCATTG 


ATGCTGTCTT 


TTACAATCAA 


GACAGTGCGC 


GTGGTGGTGA 


AGGTGGCAAC 


7620 


TGGAATGAAC 


GAAATTCTAA 


ACCACCATTG 


GCTGCATGGG 


CAGTTTGGCA 


TATTTATCAA 


7680 


GAAACCAAAG 


ATAAGGAATT 


TTTAAAAGAA 


ATGTATCCCA 


AACTTGTGGC 


TTATCATAAT 


7740 


TGGTGGTATA 


CCAACAGAGA 


CCACAATAAA 


AATGGGATaG 


CAGAATATGG 


AAGCATGGTC 


/ oUU 


AGTGATGCTC 


ACTGGCAAAA 


AGACGACAAG 


GATCAAATCA 


TTAAAGATAA 


AAATGGCCAC 


/ o bU 


CTAAAGTGGA 


TGATGATGCT 


GTTATTGAAG 


CAGCCGCGTG. 


GGAAAGTGGC 


ATGGATAACG 




CTACACGGTT 


TGACAAAGAA 


GGTGTGGGCA 


AAGGCGACGT 


TGGAGTTAAA 


GTTTTTGAAA 


7980 


ACAAAAATAA 


AGGAAAAGTA 


GTGGGGTATT 


CGATTAATCA 


AGAATCAGTG 


GATTTGAATG 


804 0 


CGTATCTCTA 


CGCTGAAAAG 


GGCTATTTAG 


CTTCGATAGC 


AGAAGAACTA 


GGCAAAAAAG 


8100 


AGGACTATAA 


GAACTATCAA 


AAAGAAGCGA 


AGAAACTAAA 


AAAATATATT 


CAAGAAAATA 


8160 


TGTTTGATGA 


AAAAACAGGC 


TTCTTCTATG 


ACTTACAAAT 


AAATGAAGAC 


GGCTCGAAAA 


8220 


CAAAATTGTT 


AGTCAATCGT 


GGCAAAGGAA 


CAGAAGGCTG 


GTTGCCGCTC 


TGGGCAAAAG 


8280 


TAGCGACCAA 


AGAACAGGCG 


GCGGCAGTTA 


AGAAAAACAT 


GATGAATCAA 


GAAATGTTCA 


8340 


ATACCTTTAT 


GCCATTCCCA 


ACGGCATCCA 


AAGACAATGA 


AAAATTTGCT 


GCAACGAAAT 


8400 


ATTGGCGGGG 


■ ACCAGTTTGG 


CTTGATCAAG 


CCTTGTTTGG 


TGTAGAAGCT 


CTTCAAAACT 


8460 


ATGACTATAC 


AAAAGAAGCG 


AAAGAAATGA 


CACAAAAACT 


CTTCTTACAT 


GCCGAAGGAT 


8520 


TGATGGGAGA 


AGGCCCTATT 


CATGAAAATT 


AT GAT CC ACT 


GACAGGGAAG 


GGCTTAAGTA 


8580 
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CGAAAAACTT 


TAGTTGGTCA 


GCAGCAGbb 1 


Tt /~* fjy tv rp rn fji tv rp rp 

AC* I A 1 1 1 A 1 1 


PTBTaaaaaP 
b 1 A 1 AAi-VAAV-* 




ft fid 0 


GTAATAACCC 


CACAACGCAA 


ACTGCTTTTG 


AbAI AAAA1A 


r~'7\.7A7Aa7AP7A fT 

b AAAAAb Au b 




R*7nn 

O /V'lJ 


ATATCTCTCA 


AGAGCTGTTT 


TTTTCTTCAC 


rprp/-»/->rpA/" , A n rp 

TTGCTACAAT 


b 1 b 1 l 1 bbb 1 


P T P a m T T a a T 
b 1 bAl 1 lrtnl 


R "7 fifi 


TAAATGTAAT 


TTATTTGTTT 


AAATAAATAT 


ATTTCATATA 


ATTAAATAAA 


a t a THTn a a a 


R R90 


ATAAACGATA 


TGTACTTTTA 


fn m tv tv tv 7V tv rn 

TTAAAAATGT 


TATTTAATAA 


AAATGAAATA 




ft flflf) 

o o o w 


GAAATTGATT 


GTACAATGGT 


m ^\ Tt m O TV TV /"* Ti ^* 

TAATGAAGAG 


TGGGTTTAAC 


GGAATAGTGT 


/\/i 1 rtnucnn 1 




AAGCTGATAT 


CAAAGCCAAA 


*i\ m t* m 0 tv tv t\ 

CATATGCAAA 


ACCTCCTTCT 


CTGTTAGCTA 


1 .tt 1 1 b Ab 1 i*- 




TAT TAT AG AA 


CATTTCTTTT 


GGTTTTTTCA 


AGGGAAGTTA AGTATATAAG 


1X11 ini. t3/\ 1 


-7 U D U 


TATCAAAATA 


TTACAATATT 


ATAATTACGG 


AAATAATAAA 


AATGAATATT 


I i gAAAl Al i 


27 J. *1 U 


TATTAATGTT 


TTCGCTGATT 


TTGTTTAATT 


TGTACAGGTA 


TTGTTGGATC 


rp rp m 71 71 f*> rp rp 7\ A 

ill AAo I 1 HA 




TAAAAAAATA 


ATATTTAATA 


TTATTTTTAT 


TATmAAAGGG 


GG AT AT TAT G 


bAlblbb 1 Ab 


y £. h \J 


GTGCTCTAAA 


ATATTTTAGA 


AAAAAGAAAA 


AAATTGCTCA 


GAAAGATGTT 


m rn /~> /"* TV 7V TV TV A 

TTGCCAAAAA 




AGGGGAAACA 


AACCTATCGA 


AGAATCGAAA 


TTGGTGAAGC 


CAAGCTAAGT 


TACGAGGAT 1 




TAATGACGGC 


ATTACAGTCA 


TTGGGGATTA 


fTTTTAATGA 


ATTTTTTTTT 


ATGGTATCTG 


Q A o n 


ATACGAGAAT 


CATGGCTTCT 


TCAGAAGTCA 


AACATCAAAT 


TGAATGTTGT 


CAAATGGGGT 


QylQrt 

y*iou 


TGAACAATAC 


AAGTGAAAAA 


AAGAATCTGA 


TTCATTATTT 


TTATCAATTA 


GAbbbbAAl b 


y o fi u 


CTCATAAAAA 


TGCGTTGGAA 


ATGTCTATTT 


ACACAGATAT 


TAAATTAACG 


rpmrn t\ r>m 7\ A rpP 

TT I Abl AA1 b 


y duu 


ATTGGGAGGA 


AATTCCAGAA 


TTTGATGAAA 


CCGATCGAAT 


GGTAATTCTC 


bbbb I 1 Aib 1 




CTAGCAAAAG 


CTACTATACA 


TACT AC GAT T 


ATCAAATGGT 


GACTAATCCT 


bbbbbb 1 Ibi 


Q*7 9 f) 


TTTCTGAAAA 


TGAAGTCCTT 


CAGATTCTTG 


AGCAGATGTT 


TCCTGTCAAA 


bAl bb 1 bAAl 


j / OU 


TACGTGATAC 


CCAAACACTA 


AATGTTGCCT 


ATGGTTTTTA 


CCTAAATATC 


Al 1 Ab lbbbb 




AACTTTATAA 


GAAAAATTAT 


GCGAAAGCTC 


GGGAATATTT 


AGCGCTAGTG 


Ab 1 b I 1 AbbA 




CGATTCCTGC 


TGAAATCTAC 


TATATTCATT 


TTAACTTACG 


TTACTTAAAA 


AAbb 1 1 Av^Lj 1 




ATTATCTGTA 


CACTGGCAAA 


ATGAGATATT 


ATAAAGAAGT 


AATCGCCGTT 


Al IbAlAibA 




TTGAAAGCTT 


TGGGGATGCA 


CGTTTGGCGG 


AAGGCATGAA 


AAAAGAAATG 


b 1 1 bAbb 1 


1UUOU 


CGGCTGGTCG 


GACATTTAAT 


m m TV TV TV TV T\ TV 

TTAGAAAAAG 


GTCAGTTTCC 


ATTGAATATT 


b 1 b Ab 1 bAAA 


i m 4 n 


A TA CP 7\ A T\ T\ TV 

AATAAAAAGC 


CCCCCcCCCC 


CGCAAGGTCA 


AACTTGCGGG 


GGGTGCTGAT 


PTTaaapaaT 

bl lAAAbrtrVl 


i n?no 


7\ >\ mom rn Ti fr»^*m 

AATGTTATCT 


CTAATCATAT 


CTAAATCTTT 


AATCAAAATT 


TTGCGGTTAC 


b 1 Ab 1 1 b A/i 1 


i fin 


TGCCCCTAAC 


TCTTTTAGTT 


GCTGCATCAT 


CCGGTTAACG 


CTACTAGCAG 


AAGTTATCCC 


10320 


ACAAAAATGC 


GCAATTTCTT 


CATTGGTCAC 


CACGAAATCA 


ATTAAAATAC 


CTTCCTCTGT 


10380 


TTCTACGCCA 


AATGAATCGT 


ACATTTCATA 


GATTTGTGTA 


CAAACGGCCC 


CAAATTTGCC 


10440 


ATTCATCAAC 


ATTTGCTGCA 


TCTTTTTCAT 


GGAATGCATT 


AAACGGTCCC 


GATAGTATTC 


10500 


TTTGACATAC 


ATTTGCAAGT 


CTTTACTATT 


ATTGATATCC 


CGCCAAAATT 


GAACACGGTC 


10560 
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GATTTGGTAA AGTTCTGCTT GTGGAGATTC GATGCGAATA TTAAAAGGTG CATCAATAAA 10620 

TTCTGAATAT TCATCACGTA GCAATGAAAC AATTTCCAAA TTGTTGATAT AACGTAAATT 10680 

AAATTCACGA CCGTCTTTGG AAATAACGCT AGTTTTGATG ATGCCACTTT TTAAAATATA 10740 

GGCATAACGA TCTTGCAGTC CTTCATAAGT TAAATAATTT TTCTTCTTTT TTACAACGAC 10800 

TCGAAAGGAA TGTTCATCTA AGTAGTCTTG TAAAACATTT CTCATTGGCG GTAACTCCTC 108 60 

AATTATCTCT AAATATTTTT GTTTTTGCCT GTTGGGCTTC TTTCTTATTT AAGCGAATTG 10920 

GaCATCATAT GTCAATAACa AATGTTGAtA AAAAAgGTCa ATTAAACgCT ATTTTGAAAA 10980 

ATAAAATCAG CnTTTA 10996 
(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base- pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



CGGCCCAnTT 


TTTTAAATGC 


nTnCATTCTT 


TTAAGGCTTT 


CCAATCCCAT 


CATTATCATT 


60 


TGnATCTGTG 


ACCATCGTGG 


CCACTTTTTG 


AATGGAgGAC 


TCGCATTGCC 


CaTTGCGACG 


120 


CCAATCCCTG 


CATGGCTTAA 


CATTTCTCGG 


TCATTTTCGC 


CGTCCCCAAA 


GGCAATGATA 


1 180 


TTTTTTGCAT 


CAATACCCAC 


ACGTTTAGCT 


AAATACGTTA 


ACGTAGCTGA 


TTTTGAGCCA 


240 


TCTTTAGGCA 


CAACATCTAC 


GCTATGTTGA 


TGCCAGCGAA 


TAAAGCGAAA 


TTCTGGGTAT 


300 


TCTTCTTCAA 


ACGTACACTG 


ATCTTCTGCG 


TCATAAAAAG 


CCAGCGCTTG 


GTAAATATCA 


360 


TTTTGACGAT 


AAAAGTGTTC 


ATCATAAGCA 


GGTGAATGAA 


AATGAAAAGA 


GTTCATCGCA 


420 


ATTTCCATTT 


TTTCAGCACG 


GTGCTGGkTS 


CTCTTCTTAA 


TATCATCTAA 


GCCCACACAA 


480 


GCAAATCCTA 


AGCCGCGTTG 


ATCACTGTCT 


TGTGCCAAAC 


GTTCCAACGC 


TTCAGCGTGC 


540 


AATAAATTTT 


GATAATATTG 


CTCATGATCC 


ACAAAAGCCG 


CGGCGCCATT 


ACATAATACG 


600 


TAATTGGTAA 


AATCTAAATC 


CCAAATCACC 


TCTTGCGCCA 


TATAACGACT 


TCTGCCAGTT 


660 


GCCAGCGTAA 


CTAAGTGTCC 


TTGCTCCCGC 


AATTTTGCTA 


GCGCTTCTCT 


CGTGCTATCC 


720 


AATGCTTTCT 


TCTCCGAATT 


TAACAACGTC 


CCATCAATAT 


CAAATGCAAA 


AAGTTTTCTT 


780 


TCCATCATTT 


CTCCtTATTC 


GCTCGCTTTC 


GCTAAATATT 


TACTGCCTTC 


TCGTTGTGCT 


840 


AAAGGACTCA 


ACACTAATTC 


TTTCATCAAT 


TCCTCGACCC 


ATTGCGGATT 


CGTTTTACTA 


900 


TATTGGGCGT 


AATGGACCAG 


CCAATCGCTT 


TTTGAATGAA 


GACTCCTCTG 


nAGTCnA 


957 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: 204: 

AGTCATTnGT AATCTTTTAA ACTATCGATT AGTCGGGATT GTGTTAGGTA GTTTACTTGG 60 

TTTTGCCGGC GCTTATGTTT TCACAACTGC GCTACGTTTC TATAATCGGG CCTACTCAAT 120 

TGATGAAACC TTTGAAAATC AACAAGAAGA AAGAGGCGTT TTATGAAAAT AACTTATCCC 180 

TTGCCACCAA ATCTACCAGA ACAACTTCCT CTATTAACGA ACTGTCAATT AGAAGATGAG 24 0 

GCAATCTTAG AAAATCATTT GTATCAGCaG ATTGATTTAC CGAAtCAAGA AGTTCGTAAC 300 

TTAGTCTTTC GGGATGCTGT TTTCGATCAT CTTTCATTAG CAAATGGCCA GTTTGCTTCT 360 

TTTGATTGTA GTAATGTGCG TTTTGAAGCT TGCGATTTTT CTAACGTCGA ATGGCTGTCT 4 20 

GGCAGTTTTC ATCGAGTGAC TTTCCTTCGT TGCAATTTGA CAGGAACGAA TTTTGCAGAT 480 

AGCTATTTAA AAGATTGTCT ATTTGAGGAT TGCAAAGCCG ATTATGCTTC TTTTCGTTTT 54 0 

GCAAATTTTA ATCTTGTTCA CTTTAATCAG ACTCGTTTAG TCGAAAGTGA ATTTTTCGAG 600 

GTCACCTGGA AGAAACTCCT TCTAGAAGCT TGCGATTTAA CCGAAAGTAA TTGGCTAAAT 660 

ACTTCTTTAA AAGGATTGGA TTTTTCCCAA AACACCTTTG AACGCTTGAC TTTCTCACCA 720 

AACTATTTAT CAGGTCTTAA AGTCACCCCT GAACAAGCCA TTTATTTGGC AAGTGCCCTT 780 

GGTTTGGTGA TTACCTAAGA AAATTTTAAA AAAATAGCCG AATAACTGGT TCTGTCGGTT 84 0 

CTTTCTTATT AAATTGTTAT TAATTTTTTG GAAAAACCAA CGGAACCATT TTACTTGTGC 900 

CTTATTTTTA GGkAAACTAT CyTAGGTTAA GCAATCTCTT TTTTTGGAAC AAGAGAAACT 960 

TTTGGAATCA TTTCCTAfAA ATCATGAAGA GATTATGACA AACGTTCTTC TAATCCAAGA 1020 

ACGTCTTTTG TCCTATACTC GATTCTACTC AAAAGGAATG TCAGTGAATG AATATAAAAA 1080 

AACAAGGCCT CGGTCAAGCG ACGGGAAAAA TCATTTTAAT GGGAGAACAC GCCGTTGTTT 1140 

ACGGCGAACC AGCAATCGCC TTTCCTTTTC AAGCGACAGA AATCACAGCC GTCTTTACCC 1200 

TAGCAAAAAC TATGCAGATT GATTGTGCAT ATTTTACAGG ATTGCTTGAA GACGTGCCCC 1260 

AAGAGCTAGC AAATATCAAG GAAGTTGTTC AGCAAACCTT ACACTTTTTA AAGGAAGATA 1320 

CGTTTAAAGG CACTTTGACC TTAACAAGTA CGATTCCCGC TGAACGAGGA ATGGGCTCAA 1380 

GCGCAsAACC GCTGTGGCCA TCGTTCGAAG CCTTTTTGAT TATTTTGATT ACGCTTATAC 14 40 

ATATCAAGAA TTGTTTGAGC TTGTTTCCTT AAGTGAGAAA ATTGCTCATG GCAATCCTAG 1500 

TGGTATCGAT GCCGCAGCAA CAAGCGGCGC TGATCCCTTA TTTTTTACTA GAGGATTTCC 1560 

GCCCACACAT TTCTCGATGA ATTTATCTAA TGCCTACTTA GTAGTAGCTG ATACGGGAAT 1620 

TAAAGGTCAA ACACGTGAAG CCGTGAAAGA CATTGCGCAg cTAGCTCAAA ATAATCCCAC 1680 

CGCAATCGCT • GAAACAATGA AACAATTAGG TTCTTTTACT AAAGAAGCAA AGCAAGCGAT 17 4 0 

TTTACAAGAT GATAAACAAA AATTAGGTCA GCTAATGACG TTAGCGCAAG AGCAACTCCA 1800 

GCAATTATCC GTCAGCAACG ATATGCTGGA TCGACTAGTG GCTCTCTCTT TAGAACATGG 1860 
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CGCTCTAGGA 


GCAAAATTAA 


CCGGCGGCGG 


TGGCGGTGGC 


TGTATGATTG 


CCTTAACAGA 


1920 


TAATA&AAAG. 


ACCGCAC^AA 


CCATTGCACA 


GACTTTAGAA 


GAAAATGGAG 


CTGTTGCTAC 


1980 


ATGGATTCAA 


TCATTAGAGG 


TGAAAAAGTA 


ATGCTTTCAG 


GAAAAGCACG 


AGCGCATACA 


2040 


AATATTGCTC 


TGATTAAATA 


TTGGGGAAAA 


GCCAATGAAG 


AATACATTTT 


ACCAATGAAT 


2100 


AGTAGTTTAT 


CATTAACATT 


AGATGCCTTT 


TACACAGAAA 


CAACTGTGAT 


ATTTGATGCC 


2160 


CATTATTCAG 


AAGATGTATT 


TATTTTAGAT 


GGTATCTTGC 


AAAACGAAAA 


ACAAACAAAA 


2220 


AAAGTCAAAG 


AATTTTTGAA 


CCTTGTTCGT 


CAACAAGCCG 


ATTGTACTTG 


GTTtGCAAAA . 


2280 


GTGGAAAGTC 


AAAATTTTGT 


GCCTACTGCA 


GCTGGTTTGG 


CTTCTTCAGC 


GAGTGGTCTA 


2340 


GCTGCTTTAG 


CAGGGGCCTG 


TAACGTAGCC 


TTAGGATTAA 


ATCTTTCAGC 


AAAAGACTTA 


2400 


TCACGTTTAG 


CGCGACGTGG 


TTCAGGTTCT 


GCTTGTCGCA 


GCATTTTTGG 


TGGTTTTGCT 


2460 


CAATGGAACA 


AAGGCCACTC 


TGATGAAACG 


TCGTTTGCTG 


AAAATATTCC 


AGCCAATAAT 


2520 


TGGGAAAACG 


AATTGGCCaT 


GCTCTTTATC 


TTAATTAATG 


ATGGCGAAAA 


AGATGTTTCC 


2580 


AGCCGTGATG 


GAATGAAACG 


AACAGTAGAA 


ACTTCTAGCT 


TTTATCAAGG 


TTGGTTGGAC 


2640 


AATGTGGAAA 


AAGATTTATC 


CCAAGTTCAT 


GAAGCAATTA 


AAACAAAAGA 


CTTCCCTCGT 


2700 


TTAGGAGAAA 


TCATTGAAGC 


CAATGGGTTA 


AGGATGCATG 


GAACAACCTT 


AGGTGCTGTC 


2760 


CCTCCATTTA 


CTTACTGGTC 


CCCAGGCAGT 


TTACAAGCGA 


TGGCTTTAGT 


TCGCCAAGCA 


2820 


CGGGCCAAAG 


GAATTCCTTG 


TTACTTTACA 


ATGGATGCAG 


GTCCGAATGT 


CAAGGTCTTA 


2880 


GTCGAAAAGA 


AAAACTTAGA 


AGCATTAAAA 


ACATTTTTAA 


GTGAACATTT 


TTCAAAAGAG 


2940 


CAGTTAGTCC 


CAGCTTTTGC 


TGGTCCCGGA 


ATTGAATTGT 


TTGAAACGAA 


AGGAATGGAT 


3000 


AAATAAGCAT 


GATTGAAGTT 


ACTACGCCAG 


GAAAGTTATT 


TATTGCAGGA 


GAATATGCCG 


3060 


TTGTTGAACC 


TGGCCACCCT 


GCCATTATCG 


TTGCTGTGGA 


TCAATTCGTA 


ACTGTAACTG 


3120 


TCGAAGAAAC 


AACAGATGAA 


GGCAGTATTC 


AATCTGCACA 


ATACAGCTCT 


TTACCTATTC 


3180 


GTTGGACACG 


CCGAAATGGT 


GAGCTTGTAT 


TAGATATTCG 


CGAAAATCCT 


TTTCATTATG 


3240 


TTTTAGCGGC 


GATTCATCTA 


ACTGAAAAAT 


ATGCGCAAGA 


GCAAAACAAA 


GAATTGTCAT 


3300 


TTTATCATTT 


AAAAGTGACG 


AGTGAATTAG 


ATAGTTCAAA 


TGGACGAAAA 


TATGGTCTTG 


3360 


GTTCAAGCGG 


TGCGGTAACC 


GTTGGAACTG 


TCAAAGCCTT 


GAATATTTTT 


TATGACTTAG 


3420 


GTTTGGAAAA 


TGAGGAAATT 


TTCAAATTAT 


CAGCATTAGC 


TCACTTAGCC 


GTTCAAGGAA 


3480 


ATGGTTCTTG 


CGGAGATATC 


GCCGcCAGCT 


GTTACGGGGG 


CTGGATTGCC 


TTTTCAACCT 


3540 


TCGATCATGA 


TTGGGtCAAT 


CAAAAAGTAG 


CCACTGAAAC 


ATTAACTGAT 


TTGTTAGCAA 


3600 


TGGACTGGCC 


TGAATTAATG 


ATTTTTCCGT 


TAAAAGTACC 


GAAACAaCTA 


CGTTTACTAA 


3660 


TTGGTTGGAC 


AGGTAGTCCT 


GCGTCCACTT 


CAGACTTAGT 


TGATCGAGTC 


CATCAATCAA 


3720 


AAGAAGAAAA 


ACAAGCGGCT 


TATGAGCAGT 


TCTTAATGAA 


AAGTCGGCTT 


TGTGTCGAAA 


3780 


CAATGATTAA 


TGGCTTTAAC 


ACAGGAAAAA 


TTTCTGTTAT 


TCAAAAACAA 


ATTACTAAAA 


3840 
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ATCGCCAATT GCTCGCCGAA TTATCTTCAC TGaCTGGTGT GGtAATCGAA ACAGAAGCCT 3900 

TGaAAAATCT TTGTGaTTTG GcTGaATCTT ATACAGGAGC . TGCGAAATCT TCTGGCGCTG 3960 

GCGGGGGCGA TTGTGGGATT GTAATTTTCC GCCAAAAATC TGGGATTTTA CCATTAATGA 4020 

CTGCTTGGGA AAAAGACGGA ATTACCCCAC TGCCACTTCA CGTCTATACC TATGGTCAAA 4080 

AGGAGTGTAA GGAGAAGCAT GAATCGAAAA GATGAACATC TATCATTAGC TAAAGCGTTC 414 0 

CACAAAGAAA AAAGTAATGA CTTTGATCGT GTGCGTTTTG TTCACCAATC GTTTGCTGAA 4200 

TCCGCTGTTA ACGAAGTGGA TATTTCCACT TCGTTTCTTT CTTTTCAGCT TCCCCAACCT 4 260 

TTTTATGTCA ATGCAATGAC AGGTGGTAGT CAGCGTGCAA AAGAAATTAA TCAGCAATTA 4 320 

GGCATTATTg CCAAAGAAAC TGGCCTTTTA GTTGCGACAG GATCTGTCTC GGCAGCGTTA 4 380 

AAAGATGCTA GTTTAGCGGA TACGTATCAA ATTATGCGAA AAGAAAACCC AGATGGACTC 4 4 40 

ATTTTTGCCA ATATTGGTGC AGGCTTGGGT GTGGAAGAAG CAAAGCGAGC GCTTGATTTA 4 500 

TTTCAAGCGA ATGCCTTACA AATCCATGTA AATGTGCCCC AAGAATTGGT CATGCCTGAA 4 560 

GGAGATCGTG ATTTCACTAA TTGGCTAACC AAGATTGAAG CTATCGTACA GGCCGTAGAA 4 620 

GTGCCTGTCA TTGTCAAAGA GGTTGGCTTT GGCATGAGCC ArGAAACCTT AGAAAAACTT 4 680 

ACCTCTATCG GCGTTCAAGC AGCGGATGTG AGCGGCCAAG GCGGAACGAG TTTTACACAA 474 0 

ATTGaAAATG CCCGGCGGAA GAAACGAGAA CTTTCTTTCT TaGATGaTTG GGGGcAATCa '4800 

ACGGTCATCT CTCTTCTGGA ATCACAAAAT TGGCAAAAGA AACTAACTAT TCTCGGCTCT 4860 

GGCGGTGTGC GTAACTCTCT TGATATTGTC AAAGGACTCG CTTTAGGTGC CaAAAGCATG 4 920 

GGAGTTGCTG GGACTATCTT AGCTTCCCTT ATGAGTAAAA ATGGTTTAGA AAATACCTTA 4 980 

GCCCTTGTAC AGCAATGGCA AGAAGAAGTG AAAATGCTTT ATACTCTTTT AGGAAAAAAG 5040 

ACGACAGAAG AATTGACGAG TACCGCACTT GTCCTCGATC CAGTTTTAGT TAATTGGTGT 5100 

CATAACCGTG GTATCGACAG CACTGTTTTC GCAAAACGTT AAAGAGACCA GAACATTAAT 5160 

GTTCTGGTCy CyTTATTTGA GTTCTTTTAT CCTACATAAG CATCTAAGAT AATTTGTTTT 5220 

AGTTCACTAA TTAATGGTTC TTTCGGATTT GCAGTTGTAC ATTGGTCTTC ATATGCAAGT 5280 

TCTGCCATAC GATCAACCGT TGTGTCTAAT GTTTCTTGTG AGACACCTTG" GGCTTTCAAG 534 0 

TTCATGTCAA TTCCGACTGA TTTACCTAAA TCTGCTACTG CTGTTGCTAA AGCTTCTACT 5400 

AGTTCTGCGG TAGTATTTCC TTTTAAGCCT AAGAATTTTG CAATGTCTGC ATAATCAGTA 54 60 

TCGGCGCGGA AGTAATCATA TTTAGGGAAC ATTGCGTGTT TTGATGGATC TTTCGCATTG 5520 

TAACGAATAA TATGTGGTAA TAAAATCGCA TTTGTACGAC CATGTGGAAT GCCATATTCC 5580 

CCACCAATTT TATGAGCAAT AGAGTGACAA ATACCTAAGA ATGCATTGGC AAATGCCATC 564 0 

CCTGCCATGG- TTGATGCATT ATGCATTTTT TCGCGAGATT CCATATCTGG ACGTTTCACT 5700 

GAATTTTCTA AATGATCGAA AACTAACTTG ATTGCTTGTA AACTTAAACC ACGTGTGTAA 57 60 

TCTGAAGCCA TCACGGAAAC ATAAGATTCA ATAGCATGTG TTAAAACATC CATACCTGTA 5820 
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tptppaptp a 

Itl otw X b A 


b I bAAbb 1 bb 




APGAAPTGTG 


GATPAAPAAT 


PGPTAPATCT 


5880 


bb lol 1 Hrtty 


bAi Ablblbb 


rp7V j\ppp a T AT 
1 AALbbAl Al 


TTP^PGTGTG 


TTTPAPTATP 


AGTAATAACG 


5940 


pp a a QT/tCT^ 

utnHAl bb 1 b 


1 MHL 1 ILl bA 


aPPAPTTPPT 
AbbAb 1 1 I 


GAAGTAGTTG 


GGATAPAAAP 


AAATTGTGTT 

rwi x i u i u i x 


6000 


rp rp rp rp /~i pp /"» rp rp 
1 i i 1L1 bb J. 1 


iubLAAl ill 


ATATPTPPPT 


TTGPGGATAT 


CTAAGAATTT 


TTGTTTTGPA 

X X VJ X X X X v3^*i 


6060 


bLAAAbAAL i 




blul 1L.A1AO 


AAPATPPAPA 


TTPPTTTPGP 


PGCATCCATT 


6120 


GCAGAAbbbb 


LALLbAAbbb 


AA 1 b Ab lu 1 A 


TPTPP.TTP,P,A 


AATPAAPPAT 


PATTTTGGTA 
x x x Xwoxn 


6180 


CCTGCATAAA 


GAG 1 A 1 X I G 1 


1 bAlbuu 111 


P^TTPfl A PAT 
X 1 V^AA^ A J. 


PAflAAAATAP 


TTPPtATTTTP 
X lv^unl x x x 


6240 


AbATLAi 111 


rp 7\ T 1 T 1 T 1 T 1 T 1 f 

1 Abb nnio 


Tar.rTPTTTa 

1 nbL iti 1 in 


PGTAPGATAT 


PTGPATAGPP 


AAATTGAACC 


6300 


ATACCTGGGT 


C ACAAAG 1 AA 


GA 1 GAL AUG 1 


1 b AAb A 1111 


btnl 1 U 1 lb 


TaP AT ATrrT A 


6360 


AAGAATTTTT 


m Tl 7\ T\ T\ 7\ T\ 7\ 

TTCAAAGAAA 


ATTTT1 GGAG 


biAAl 1 1AAA 


c p a t 1- p p a. t a 


TTATTTPTPP 


6420 


GTTTCGCTAC 


CGTTTTAATA 


i 1 AA 1 1 AAG 1 


rp 7\ TVO^PP^TP a 

1 AA1 bbC Ibd 


1 Abo 1 1 1 1 1 A 


PA A APAP.AP,T 


6480 


TTTTCCCgTA 


m ^ TV TV f** ^ a 7V 

TGgAACCaCA 


AGGAAG 1 G 1 1 


7\ 7\ppnTPPB A 

AAGGA 1 GGAA 


1 bAl 1 1 bb 1 1 


aTA A ATPTT A 
Hlnnnlb 1 In 


654 0 


CCGATACCGC 


CTTCTGCTGA 


TGGAGTATTC 


ACTAAAATAC 


GACAAGCTTT 


CATACGTAAA 


6600 


CCAAAAGCAA 


CTTGTAACTC 


TTCATCTTCT 


GTATGAATTA 


CTGCTGTATG 


TCCTAAACCA 


6660 


CCTAAGTTTA 


ACATCGCTTC 


ACATAGTTCA 


AAAGCATGCT 


CTGCATTGTT 


TGATTTCATC 


6720 


ATTGCTAAAA 


CTGGCGATAA 


TTTTTCTCTT 


GATAATGGAT 


ATTCTGGACC 


TGCACCTTCT 


6780 

i 


AATTCAGCGA 


CTAAAATTTT 


TGTGCCTTTT 


GGTACACTAA 


TTCCTGCTAA 


TTCAG 


6835 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

AGTACCAAGC ACTTGACCGC TTTTTTCCTT AGGTGTGCCT TGTAGCAACT nACGCAGTCG 60 

CATTACTGAT nTAACCAGAA AAAACACCTT GTAATAAGCG TAAGCCAATC AATTGATAAA 120 

CACTTGTGAc TGCGCCCATT AAACTAATGA CGATAGCCAT CCCTAATGAA GCGCGTAATA 180 

ACATTAACTT GCGTCCTTTT CGGTCGGCTA AACGGCCCCA CCAAGGTGAA ACAATGGTTG 24 0 

TGACTAAAAA AGTGGAAGAG AACGTGATTC CGCTCCAAAA ATTCAATTGT TGGTGGGTAA 300 

AAGTACCTAA TGTGTTGATA TAAAGAGGCA TAAATGGCAT CACTAAGCTA AAGCCAATAC 360 

CAGCCATAAA TGTGCCAAAC CATAAAACGA GCAGATTTTT TTCCCAAGGT TGTCTTGGAC 4 20 

GGAAAATTTT ACTTTTAATT AATTCATACA TTCTATTTCC TCACTTTCTA GTAAAGAAAT 4 80 

TCAAATTTGG CTTTGGCGAG TAGTTGATCA TTCAATGCTT GATGCAATTC ATTCGCAAAA 540 

AAGCCTTCTT TTAAATTTAG TGGATTTTTT GTAGTGTACA CATATAACGT ATAGGTGTGC 600 

ACACCAGAGC GAGGGCTTAA CCCCAGATAA CGTTCGGTTA ACTCAGGTAA CGTTTCAGTG 660 
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GCAAGCGCAG 


GAATCATCTG 


TTTCCCCAAA 


GGAGAAGCAA 


GACTATTTTT 


TCCTTGAATG 


720 


GTAGGCTTGA 


GTGCTGTTCG 


ACTAAAGTCT 


TCAGGAATCT 


CATTGGTTAC 


AGGAACATTG 


780 


GCAGCTAACC 


AATGAATAAA 


AGGGAAACCG 


ATTAGAGGAA 


TCGTGTCATA 


ATCAATTAAT 


840 


TCCCAGGATA 


AATAACGGGC 


TTCTTTAGGA 


ATGGCCGCTA 


AAGAAATAGG 


AAAAGATAAA 


900 


ATAGGACGTT 


CGTGTAATAA 


AAATTCTGCA 


CTGGTTCGTT 


TGCCATAACG 


ATCGGCCAAA 


960 


TGAGAACGGT 


TGGTAATATG 


CATGCCAAAC 


AGCCTTCTTT 


CAAAAATGAT 


GTTTACAAAT 


1020 


TAT TAT TATA 


AACTATAATT 


TATAGTTTAA 


AAAGAGTCAA 


TAGATATGAT 


AAACTAATAC 


1080 


TTGTCCAAAA 


TTTAAGAATT 


GGAGGCGTCA 


GAAAGATGGC 


AAAAAAACGG 


AATTTAACGA 


1140 


AAGCCAAAAT 


CCTTGAAAGT 


GCCCGCAATC 


TAGCGGAAGA 


ATTGGGCGTT 


CAACAATTAA 


1200 


CCTTTCAAAA 


TTTAGCCGTT 


GATTTAGGGA 


TCAAGTATCC 


TTCACTCTAT 


AATCACTTTA 


1260 


AAAATATTGC 


TGAAGTAAAA 


AATGCTCTAG 


TGGACTTGTT 


AATTCAAGAA 


TTAAATGATG 


1320 


CATTGCGAAG 


AGCCTTAGTG 


GGTAAGAGTG 


GCGCAGAAGC 


AATTCGGaTA 


TATGCkGAAA 


1380 


CGTACCAACA 


ATTTGCyTTT 


GAAAATTCTG 


CAGTCTATGA 


ATTACTTATC 


AGTGTGCCGA 


1440 


AAACGCAAAA 


CCAACAATTA 


ATTGAAGGCA 


TCCATGAAAC 


CAATCAAATT 


ATCTTACAAC 


1500 


TTTTAGCTTT 


TTATCCATTT 


AATAATGAAG 


AACGACTGCA 


TAAAAGTCGA 


GAATTACGTA 


1560 


GCTTAATCCA 


TGGTTATATT 


ACCTTACGTT 


TTTtAGGTTA 


TTTTCAACGC 


GAAGAAGCCA 


1620 


CGCCGGAAGa 


AAGTTATCGT 


CGCATGATTG 


aAGaTTTTAT 


TGTTTCTTTA 


gAAGTGGcAG 


1680 


GaTAAAATAA 


ATTTCAAAAA 


TTTCTCAGTC 


CGAAACCAAA 


TCCTTGcAAC 


TTTTTcaGAA 


1740 


AGTGaCTATG 


ATACCTACAG 


. TAGCaAAAAA 


AgGAGCGGTT 


CAATGTATCm 


AGtAATTACA 


1800 


ATGtATGGcG 


aCmATGAACC 


TTGGTGGTTT 


TTTGATGATT 


GGCAAGAGGA 


CATTGTTCAA 


1860 


GAAAAAACCT 


TCGATAACTT 


GACCGATGCA- 


GAG AC AT ATT 


ATGTGGAAGT 


TTGGCAAAAA 


1920 


CTTTCTCAGA 


AT TAT AC AT A 


CGTCAACACA 


AAGCCTAATT 


ATTTAAQGGC 


TTTCTGGAAT 


1980 


GATGGGGAAG 


AACGCTGGTG 


TGAAGAATGC 


GATGAAGATT 


TACAACAATA 


CTTCGGTTTA 


2040 


GCATTACTAG 


AAAATCATCA 


TGCGGTTACT 


TTTGAAAATA 


AAGCTGAGTT 


TACCACAATA 


2100 


TCCAATCATT 


TTGGCAAAAC 


AAAATTCTGC 


AAACGGAAAG 


TTATTTGAAA 


CGCATGAGTT 


2160 


TAAAGAAAAA 


CCTCAGGGAA 


TTTTCTTTAA 


ACTCTTTTTC 


TATTCTCGGA 


ATTTAGAGAA 


2220 


AATCGAACCA 


ATATTTGGTA 


TAATACGAAA 


AGATAGCACT 


CAAAAAGGAG 


CGAAACTAGT 


2280 


GGTTGCAAAA 


CAATTAATTA 


TTGCTGAAAA 


ACCTAGCGTA 


gCAAAAGATT 


TAAGCCGCGT 


2340 


TTTAGGAGCA 


AATCAAAAAA 


ATAAGAATTA 


TTATGAAGGA 


CCCAACGTGA 


TTGTCACATG 


2400 


GGCTTTAGGC 


CATCTCTTAG 


GACTAAAAAT 


GCCAGAAGAT 


TTAAACAAAG 


AATGGCAAAC 


2460 


GTGGCAAATG 


GAAACTTTAC 


CGATGATTCC 


TAAGAATTTA 


GGGATTAAGC 


CGTTACCGAA 


2520 


AACGGGACAT 


CAATTAAAAG 


CGATTAAACA 


ATTAGCCAAT 


CGTAAAGATG 


TTTCTGAAGC 


2580 


CGTGATTGCC 


ACTGATGCTG 


GTCGTGAAGG 


TGAATTGGTG 


GCCCGCTGGA 


TTTTAGAATA 


2640 



j 
i 
I 
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TGTCCGTTTT 


AACAAACCAG 


TCAAACGCTT 


ATGGATTTCA 


TCTCAAACAG 


ACAAAGCCAT 


2700 


TAAAGACGGG 


TTTAAGAAAA 


TTCGCCCAGC 


GAAAGACTAC 


GATAATTTGT 


ATTACTCTGC 


2760 


CTTGGCGCGC 


GCAAAAGCTG 


ACTGGTTAGT 


GGGGCTAAAC 


GTCACACGGG 


CCTTGACAGT 


2820 


GAAATATCAA 


GATAACTTAT 


CCGCAGGTCG 


TGTTCAAACA 


CCAACTTTAG 


CAATGGTACG 


2880 


CCAACAAGAA 


AAAACGATTG 


AACAATTTAA 


GCCTCAAACT 


TATTTTACTA 


TCTCGTTATC 


2940 


TGTTGAATCA 


GAAAAAGCAA 


AGATGACCCA 


AAAAAATCCA 


TACGCTTTAA 


AAGAACGGCA 


3000 


GGAAGCAGAA 


CAATTAGTCA 


AaGAACTCTC 


AAAACAAAAG 


GGCTTAGTTA 


CCGATATCCA 


3060 


AGAAAAAGTT 


AAAaCAGAAA 


ACGCGCCGTT 


ACCGTATGAT 


TTAACCGAAA 


TCCAGCGTGA 


3120 


AgCCAACCAG 


CGATTCCAAT 


TTTCTGCAAA 


GAaACGTTGT 


CTCTTGTACA 


AAGTTTATAC 


3180 


GAAACCCATA 


AAATTGTTTC 


CTATCCTCGA 


ACTGACAGCA 


AATATCTAAC 


GACGGATATG 


3240 


AAAGGAACGA 


TGAAAGAGCG 


CCTTCAAGCT 


ATTGCTGATT 


TTTCACCAGA 


AGTGAAAGGC 


3300 


TACCTTAAAA 


ATGGGGCTGT 


TGTTAAACAA 


CAAAAAGTTT 


TCCAAAATGC 


AAAAGTCACG 


3360 


GATCACCATG 


GGTTGATTCC 


AACTGAACAA 


CGTCCTCGTT 


ATGAAAAGCT 


AAGTAATGAT 


3420 


GAACAAAAAA 


TCTATCAAAT 


GATTGTTCAA 


CGATTCTTAG 


GTTTGTTTGC 


TGAGCCAAAT 


3480 


CAAACCAAAC 


AAACAAAAGT 


TACAGTCGCT 


TTTGGAAAAG 


AAACATTTGT 


CTTTCATCAA 


3540 


AATAAAGTAG 


TTGTTGCTGG 


CTGGAAAACA 


ACAGCCGAAC 


AGCCGCTATC 


CACTGTCCAA 


3600 


TGGCAAAAAG 


GCATGACCGT 


TGCGCCAAAC 


TTTACCATTA 


ACAAAGAATT 


AACGTCGCCG 


3660 


C C AAAAC CAT 


TAACCGAAGG 


AACGTTGTTA 


GGAAAAATGG 

Vj \J ********** A 


AGAAACACAG 


TTTAGGCACA 


3720 


CCGGCAACGC 


GAGCTGAAAT 


CATTGAAAAA 


CTGATTAAAT 


CAGAATTGAT 


GGaACGAACC 


3780 


AATAGCGGTT 


TAAGTGTCTC 


TGCCAAAGGa 


AAGCmACTAT 


TAGACTTGGT 


GaATCCATCA 


3840 


CTTGTGaCAC 


CTGGAACTtA 


ACGGGAAAAA 


TGGGGaAAAG 


TcGcTcGaaG 


CaATcGCTTT 


3900 


CGGGGTTCCA 


ACAAAAGAGT 


TCAACTTTTT 


CnTTTAAAGG 


GATATTTGGA 


AGAGGGnTAC 


3960 


GGAAAAATTT 


AGTTCCG 










3977 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

nnTATTTTTC AATGTATCCG CGTAATTCTT CTTCAGTGAC ACTTTCGTCA AAGATATACT 60 

CACCAATGAT- GATCGTTGGT ACAAATTGAA TGTGCGCCGC ATTGGCTTCA GCAATCACTG 120 

CGGAAACAAG CGTTGCATCT TTTTGTTCTT TTAAGCCCAG ATTTTTTTCA GCATACGTCG 180 

CTACTTCTTC TAAGGTTAAG TTTCCCCATT CATCTTGCGT ACAAACATTT TATGCAATGC 24 0 
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TGACAGCGCT TGTTCAGGCG 
TAAACTTTCT . TTTTCTTTAT 
AGATTGTGCT AACAGTTCCT 
ATTGATAAAT TCTATCATTT 
TGTGTTAACT TTTGTTGCAT 
CCTTTATTTC TCTTTTAGTA 
ATTTTCTGAA TTCGTGCCTC 
CAACAGTATG tCTTTTCCCA 
CTAAGCCCAT ATtGGCATCG 
TGCCGATTGC TTGGgATAAC 
aTAACTAATT TTCCTTTTGA 
ATGCCCCAAA CCGTGCCGAC 
TGTTCAGCTG ATTTTAAACT 
GCAATAATGA TGCCGTCAAA 
TTGGTCACGC CAGTTCCTGG 
GCTTTTTTGA AAGTAGCTGC 
GCTAAGTCTA AATTTTCATC 
TCTTTCAAAC GATGAACAAC 
TCATAAGCGG CTACTACAAT 
GCGCCATATG CTTTTGACAT 
ACCACATTTT CCGATAATAT 
TCTTTTGACA TCCGTTTGAT 
ACCCCTCCAA TAGTTTTTAA 
TATTACATGC AACATTCGTG 
CAGtGCTACA CACTTTTTGc 
GATAGCACTT CAAACGGAAA 
TTTCTCGTAA TTTTTCTGCA 
TAGAAATAGC TCCTGAAGGA 
CTAACCGTAA TTCTGCGATT 
TTTCAATAAA TTGAATATTT 
TCGCTGTTTC . GCAATTCACT 
CATCTTCAAA TTTCAACATA 
TTTCAGCACC TTTTAAATAA 
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CTGAATAGTC AATGTAGTGA TGCATCACAT TGCCGCGTTG 300 

CAAACAACTT AATGATACGT TCAACTTTAC CACTTTTGAC 360 

CAGACTCTTC AAACCATTTT CTGCAATAAG GACAGCGAAC 420 

TGACAGGGGC ATTACTCTCA CCAATGTGAA GCCCTGTTTC 480 

CAATTACTGA AATATCCATA AAAAACCTCA TCCTTTCAAA 54 0 

TACCGTTTTT CAATAAAAAA ATCATAAGAC ACGAATTCAA 600 

TGTTCACTTT TTTAACGATT AATCGAACGA GCCATTCTAG 660 

CGTATCGCAT CAGAAAGCTC TTTAATAwTT TCAATACCAA 720 

CCAaTATGTT GAATATCTAC GCCTAAAATT TTATTTCTAA 780 

TTCAGGTTGA GCACTTTCTT GTGAAGTTCC AATGGCACTC 84 0 

ATGTGCAAAA TCCACAACTT CTTtGACTTG TTGATCATCA 900 

TGCTGGGACT AAGACAATAT CAGCGCCCGC TTCAATAAAT 960 

GACCACTGGT TCATCTACGC CTGaACTGTG CATTTTTCCG 1020 

ATGTTTTTTG GCTAAGGCCA CGTTCTTAGC AATCAAGTCG 1080 

ATTACCTGTC AATAAGATAA AATCCAAACC CAGTTCATTG 1140 

TGATGCTTTT CTACCTGGTT CAATCGAAAC TCTCGTTGAA 1200 

i 

AACAGGTTCG ACATTCATGC CaATCGGTAA ACCGACTAGT 1260 

CTCATCATTA GGGACACATG TTTCCGCAGT ATATTGCCCT 1320 

GGGTTGATTA ACATCGAAAA CATTCAATAA AATCATGTCT 1380 

TTCCGCACAA GTGATATCAC CTGCTACGGA TGGATGAACT 14 40 

TACGCGCCCT TCGCTCGCTT TAATACTCTG AAACAGCTCT 1500 

TTCTGATGCA TTTGCACTAA TTAATCGTkG AACCATTTTG 1560 

TGTTTTAGTT AAAAACAGCA ATAAGTAATC GTTTGCATTT 1620 

cCAACGTCGT TTCTTTAAAc CACAGGTCAA CACTACATTT 1680 

ACACTTTAAT TTTTTTTACA CGCTTAAAGA AAAAGTGAAG 1740 

TGTTATTCTT CACTTCTGGA GCAGAAAAGT TACGCACTTT 1800 

AATTCATTAA TTTTTCGAAT ACGATGATTA ATACCTGACT 1860 

ATCATTTCCC CTAGTTCTTT GAGACTAACT TCCGGATACT 1920 

TCCTGAAGTT TCTCCGGTAA TGACGTCAGC CCTACTGTAC 1980 

TCAATTTGTT TTGAAGCCGC ATCAATGGTT TTATTTAAAT 204 0 

AAACGATTTA CTGAATTCCG CATATCACGG ACAATCCGGA 2100 

GAGTTCGTGG CACCAATTAA TGTTAAAAAG TCGGCAATTT 2160 

CAAATATACC CATTTCGGCG TCCCAATGTT CGAGCGTTTA 2220 
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AGTCATAGTA ATTCAACATC TTACAAATAT CATTATTGTG TTCTTCATAA ATTGAAAAAA 2280 

TTTCTAAATG ATAGCGACTT GTTTCCGGAT TATTAACTGA . GCCAG^GCC ATAAAGGCCC 2340 

CGCGTAAATA GGAACGCATT TTTTGCGCGT TACCCATGAT TTCATTTGAC ACATTGCCAT 24 00 

TAAAAACGAC CCCATCCATA ATATCCAAAT CAGCTAGAAT TTTTTGTGTA TCTTGTTTTA 24 60 

GACGCACAAT GTAAACATTA TTTTTCTTTA GTTTCATTTT TTTACGAACT AATAATTCTG 2520 

CTTGTGCATG GTAATGATCT TTCAGCAATG AATACATTCG ACGAGCAATT GCAGCATTTT 2580 

CCGTTTGTAC ATTCAAGACA AACTGTTGAT TAACTAAGCT AAGCGAGCCG TTCATCCGAA 2640 

TTAATGCTGC TAATTCTGcT TTCGCGTGTT CTCGATGGAC TTCTAAACCA GTTAACTCTT 2700 

TTTTGACATC CGAGGCAAAA GACATCTAAG CTCCTCCTCT CTTGATTCTT TTGATTAATT 27 60 

AATATTTCGT TCCAAATACG ATACGGAAAA GTTCTTCAAC AACTTTTTCC CCATCATGAA 2820 

AAACTCCGCC ATCTCTTAAT TCTAAAAAGT CTGTCGAAAT GACTCGGCAA CCTTCTTCAC 2880 

GTAATCCTTG AAAATCATGC TTTACTTGAA CTAAATATTC ATCATAAATT TCAGGATCCA 294 0 
TATAATTTTC TGGTACTTTT TCTGTGTTAA CCAGCACCGT ATCAACAAAC TGCGCTTGTA ■ 3000 

GATGCTCGTT TAAGACACGA ACATGATCTG CATCCGTAAA ATGTTCCGTT TCCCCTTTTT 3060 

GGGTCATGAT GTTACAAATA TAGACTACTT CGGCTGCCGT TTGTTTGATG GCTTCACCAA 3120 

TTTCTGTAAT AACTAAATTA GGCAAAATAC TTGTAAAAAG ACTCCCTGGC CCTAACACGA 3180 

i 

CCATGTCTGC TTCTTCAATT GCTTTGACGA CTTTACGCGC AgcTCTTGGC TGCTCTTCTC 324 0 

CGTGTGTATT TGTTACAAAC ACATGGTCAA TCGTCTTGCG ATCCAGTGCA ATTTTTGATT 3300 

CTCCCACCGC CACTGAGCCA TCTTTAAATA CAGCATGTAA CGTTAATGGC CGTTCTGATG 3360 

ATGGATAAAT CCGTCCATCA ACATGCATCA TTTTTGAAAG CAATTGAATC GCTTCATACG 3420 

TACTTCCTCT CATTTCTGAG ACAGCAGCAA TGATTAAATT ACCGATTGCA TGATTGGCAA 34 80 

AATGACTATC TGATTTATCA AAACGATATT GAAAAATATC TTCATATAAT TGTGGCATAT 354 0 

CTGACAATGC CACTAAAACA TTTCTTAAAT CTCCCGGAGG TGTCATGTTA TTAATAGAGG 3600 

AACGAAGCTC GCCACTACTA CCGCCATCAT CAGCGACCGT TACGACGGCA GTAATATCAA 3660 

CACTTTGATT TCTTAAACTT TTCAAAATTA CTGGCAAGCC TGTTCCACCA CCAACAACCA 3720 

CGATCTTCGG TTTACGAATT CGATAGGTTT TCATTAAGAG CGATTAACCG TTTCTTTTCT 3780 

TTTTAAGCGG TCGCGGTGGG TAATATTTAC ATGGTAATCT TCCTCTTTTA ATTCAGCGCC 3840 

AACACGTTCT GTCAATGCAA CAGAGCGATG TTGTCCGCCA GTACAGCCAA TTGCAATGGT 3900 

GACGCTACTT TTACCTTCTT TTTTATATCC AGGTAAGACT GTTCGCAGTA AATCGATAAA 3960 

TTTTGTATAA AATTCATCTG TTTCTGGGAA GCCCATCACA TAATCATAAA CAGGTTGATC 4 020 

CATTCCAGTT • AACGGTCGTA GTTCATCAAT ATAATGGGGG TTAGGTAAAA AGCGAACATC 4080 

CATAACGATA TCTGCATCAA TCGGCAAGCC ATATTTAAAG CCGAAGGACA CCATTTCCAC 414 0 

TCGAAATTCG TGTGTCTCTC TCGTTGCTAG TTCTTTATTC AAACGTTCCC GCAATTGACG 4 200 
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AGGGGACAAG TCCGTAGTGT CAATAACCAA CTGAGCATCC GCCTTGATTT CTTCCAGCAT 4 260 

TGCACGTTCT TTACGAATAC CTTCTGTCAC TAAACCATCC ATCGCCATTG GATGCGCACG 4 320 

CCGTGTCTCT TTATAACGAG AAACCAACTC TTCATCGGTG GCATCTAAAA ATAAAATCGT 4 380 

TGTATCAATA AAATTAGTAT TTTCAAGTTC GACTAACATA TCCTGAATTT CTCTAAAGAA 44 40 

GGTCCGTGAA CGTAAATCAA TCACTAACGC AATCTTGGTA ACTTTTCCAG ATTCTTTAAT 4 500 

TAATTCCCAA AATTTTGGGA TTAAGCTTGG TGGTAAATTG TCGATACAGA AATAGCCCAT 4 560 

GTCTTCAAAG CTTTGAACAG CGACTGTTTT TCCAGCGCCA CTCATTCCTG TAATAATAAC 4 620 

CAGTTGTAAA TTTTCTGGCA TATTCAACAC TCCCTTCACT TGCATCACTA TATTATAACA 4 680 

CCCCGGGCGT ATCAAAGAAA GCTTTTCAAT CATTCTTATA AAGATTTCTA TTCTTTCATA 4 74 0 

GAAAAACAGC ATTGACATTT TTGTCAATGC TGTTTTAATT GCTGAATTAT ACTATATAAC 4 800 

TCTGTGTTAT TTTAATACAA ACTCAATTTC TGTTTTATCA GTAAAATTAG CTTCAAACTC 4 860 

AAATTGTTTT ACTTGTTCAT TCACGGCTAA CTTTTTCGTA ATGTGTTCTT TAATTTGTTC 4 920 

ATTTGTAGCT GTTTTGATTG ACAATCCATC TAAAAGAGTC ATGATTTTGT CTTGTGCTGC 4 980 

CTTACCTTCT AATTCTAAAT CAGCAGATTC ATCTTCATAC GTTGCTTCAA TTGTTCAGTT 5040 

CGATTTTATT TCATAGGTGA ACTCAATTTC ATTTTCGTCG TATTCAATTT CTACGTCAAT 5100 

CTCTTGAAGA CTGGCAACAG TCGGTGT.CGG TGTGGTTAGT TCATTGTCTG CTGTTTCATG 5160 

GTCCCCACCA ACTTCAGGAG TTACAGTTTC CCACTTAAAG GTACCAATTT TAGCTGTTGC 5220 

TAGATAGGGA TTATTAATGG TTATGACTTT CAATTGAAGG TCTGAATGGT TGGCGATATC 5280 

TGTTAAAATC CCCAGAGCCT GATATGTATT TTGTGTAAAA CGTGTCGCAA TATTTTGCGC 5340 

ATCAATTGTG GAATAGCCTA ATGTTTCCCA GGTGATTTGT TGCTTTAATT CCTGATGAAC 5400 

GGCTGTACTG TCGCCGGCTC CAATTGAATG CCCGTTGcTt CTtTTAAGnC TTCCTGGnAA 54 60 

AACTCATTAC TTTTTCATGG GCTGATTGTG AAATTAAnGT CGCTGCTCCA TGGGGTACnA 5520 

ATGACAAAAG 5530 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

TTGTTAGGGG ATGAGATGTA T CAT GG AAAG ATGGATCAAG GAATTGAGCG GCAAGCGTTG 60 

CATTGCATGG AACTAATTTT TACTCATCCT TTTACCAAAG AAACGGTTCA TCTGATTTCC 120 

CCACTnGCGG AGGATATGAA AAGTGTAGAT GATACGTTAT AATAATAAAG AGGCAGGAGG 180 

GGAAAGTGTA AGATGAATGA AGGCCAAGAA ATGGAAGAGC AATTTGCCCT TCTTTTAGAA 24 0 

ACGCTAAAAA ATCAACAAAT GAACGAATTT CGTGAGCTAT TTTTAGCGTT ACACATTTAT 300 
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GAACAAGGAC AATTTTACCA ATCG7TAGAT GAAAAAGAC3 GACAACATTT ATACAATTAC 360 

CTATCTCCGA AAGAACTAGC GGA7ATGTTT GATGT7ATCG AAGAAGATAA TGAGAATATG 420 

AAAGATTATT 7AGCTGAAA7 GCGCCCaAGT TATGCAGCCG ATATGTTAGC TGAGATGTAT 480 

ACCGATAATG CGGTGGATT7 ACTAAATATG TTAGATAAGA GTCAAAAGGQ GAAATATCTT 540 

AGTTTGTTAA GTAGTGAAGA A3CAGGCGAA ATTAAAGAG7 TACTTCATTA CGAAGATGAA 600 

ACTGCTGGGG CGATTAfGAC CACCGAATTT GTTTCGATTG TGGCGAATCA AACAGTGCGT 660 

TCTGCGATGT ACGTC77AAA GAA7CAAGCG GACATGGCGG AAACAATCTA TTACGTTTAC 720 

GTAGTCGACC AAGAAAATCA 77TA3TCGGC GTTATTTCG7 TGCGGGATTT AATTGTTAAT 780 

GATGATGATA CCTTGA7TGC TSATATTTTG AATGAACGGG TTA7TTCAGT TCATGTTGGC 840 

GATGACCAAG AAGATG7CGC GCAAACAATT CGTGACTAT3 ACTTCTTAGC TGTGCCAGTT 900 

ACAGATTATG ATGATCATTT G7TGGGGATT GTAACAGTTG AT GAT ATT AT TGATGTTATC 960 

GATGATGAAG CCGCTA3CGA CTAC7CTGGT TTAGCCGGGG TCGACGTTGA AGAAGTTAGT 1020 

GAGAACCCTT 7AAAAG7CGC 7TCTAAACGA TTACCTTGGT TAATTACGCT GCTGTTTTTA 1080 

GGGATGTCTA CGGCC7CGTT AATCAGTAAT TATGAGTCG7 TAG7TAGCGA AgCGAGTATT 1140 

TTAGCCGTTT 7TATTTCTT7 AATTACCGGA ACAGCCGGCA ATGCCGGGAC ACAATCATTG 1200 

GCGGTCGCGG 7ACGCCGTT7 G3CTATGAAA GATGAAAAGG ATAGTAATTT TGGTCGCTTA 1260 

ATCTTAAGTG AAGTGC7CAC GGGACTAGTG ACGGGGGCTG . TGACAGGTTT AACCATTATG 1320 

ATTGTGGTTG GCGTG7GGCA ACATAATCTT CCGCtTGGA? TTG7GATTGG GATGGCAATG 1380 

CTCTGTGCAA 7TACTG7GGC GAAT7TAGCA GGGAGTTTAA TCCCAATGTT AATGGATAAA 14 40 

TTAGGGTTTG ATCCCGCAG7 7SCCAGTGGC CCGTTTATTA CTACCTTAAG TGATTTAACC 1500 

AGTGTATTAA 7TTAT7TCAA 7ATTSCCAGC ATGTTTATGC GTTATTTTGT TTAAAAAATT 1560 

GCACTTATTT ATCATGAGAC 7GAG3TAAAA GGATTCGT7G GTT7GCTTGA AATCAAAAAA 1620 

GTGAGATAGC CGTTAATACG ACGGCTrCCT CACTTTTT7A GATACAAGAA ATTAAACAAT 1680 

TTTTGTGCCA 7GGACTTCTG TGACxTTTAA AgTGTCCATC ACAGTTTGAA CGTTTTCTGT 17 40 

GGAAATACCG CCACCTGGkA AAATAAGAAT CCGkTGaTCT GCGTAGACaA TTAAwTCTTT 1800 

TAAAkGATCA AAGTwGtCTy CGATAGGGGk ACCAGCyGGc CCGCCATGAG TTAAAATACG 1860 

sGkCACACCA CGCTCAGCTA ACCaGTCGAT CGCCT7AAAT TGaTTTTCTT TCGaCAAGGC 1920 

aTCAAAaGCC aTATGGAAAG 7GA77TGCaA GCCTTCaGCC gTTtCAATGA ATAAGTcCAA 1980 

AgCTTCTTCa 7CTAACCaAC CGTCCTCAGT TAAAaCAGCC cAATAACcAA TGCCCGTCGG 2040 

TTCCCTAATT TCTTTGCCTC GATTAAATCG GTATGCATAA TTTTTAATTC AATATCATTA 2100 

TAGACAAAAT TGCCGCCACG AGGACGAATG ATTGTCAtCA CTGGTACAGA ATGTtCGCCa 2160 

GCGtATGCTA AGaCTTCTTC AATA.-.CACCA GTACTTGGCG TCG7TCCACC CACGGCTAAG 2220 

TTATCGCATA ATTCGATGCG ATTC3CACCT TTTTGAATGG CTT3GGGAAT CTTAGTGAAA 2280 
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TTTTCGGCAC AGAATTCTTT AATCATTTCT TATCGCTCCT TGCTTTTTCT TCACAGGCAT 234 0 

TGTACCATAA TTCTGCTTAC TACCATAGCC CTGAAAAAAA GAGTGCAACT TTTTATAAGA 2400 

GAAAATGTTC ATTTAATGAA CGTAAGGAGT TTTAAATGTT AAAGAAAATT GAACGAGCAC 2460 

AACAATTATT GAAAAAAGAA GCAGCCGTTT TTCGCTGTCC AACTTGTCAT GAACCAATGC 2520 

ATTTAGAAGG GGTTGGTCTG ATTTGTCAAC AGAGGCATCA ATTTGATTTA T 2571 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

AAAGTTGTTG ACAAATAACA AACGAGATGn TAAGATATAG AAGTTGTCAC nACAGACAAC 60 

AACAACCAAG CAAAACAACT TAAAAAAACT TTTTyAAAAA AgTTCTTGAC ATTCAAACGr 120 

AAGTTTGTTm TGATATAAAA GTTGCTGCGA GGTAACGTAG ACCTTTGAAA ACTGAACAAA 180 

GAATAGACGA ACCAAATGTG TAGGGCGTCT TGATTCAATT CAAGACAACA AACATTTTTA 240 

ACAAGCAAGC AATATGCTAG CAAACAAATT GAGCTTAACA ATCGTAAGAT TGTTCTAACT 300 

TTTATTATGA GAGTTTGATC CTGGCTCAGG ACGAACGCTG GCGGCGTGCC TAATACATGC 360 

AAGTCGAACG CTTCTTTCCT CCCGAGTGCT TGCACTCAAT TGGAAAGAGG AGTGGCGGGG 4 20 

CGATGCCAAA AAGCAAACGC TTGTTAAAGT AACACTGAAA GGGCAATTAA AAGAAGTCAC 4 80 

GTTTCCAGAA ACAATTAAGG TCTTTAATAA TtGTAAAACT GGGACATTGT TTGGCGATTG 540 

GGCTGACGTA AAACCTTTTT TAGAAGCAAA CAAAGAAAAA ATTGAGGATT ATGTTGTAGA ^ 600 

AAACGATGCA CGTAACTCAG CAATTCCTTT TCTTGATCTG AAAGACATTA ATGCTCGGAT 660 

TGAACCAGGT GCTTTGATTC GCGAAAAAGT TGAAATTGGC GATCAAGCGG TGATTATGAT 720 

GGGGGCTATT CTAAATATTG GCGCCGTCGT TGGTGCAGGG ACTATGATTG ATATGGGCGC 780 

TGTTTTGGGC GGTCGTGCAA CTGTAGGGAA GCATTGTCAT ATTGGCGCAG GAACGGTTTT 840 

AGCTGGTGTC ATTGAACCAC CAAGTGCCGC ACCTGTGGTT ATTGAAAACG AGGTTGTCAT 900 

TGGAGCGAAT GCGGTCGTAC TAGAAGGGGT TCGTGTTGGT GAAGGTGCCG TTGTGGCTGC 960 

TGGGGCGGTA GTTGTGGAAG ACGTTCCAGC GCATACCGTT GTTGCAGGCG TGCCAGCAAA 1020 

AGTAATTAAA CAAATTGATG ATAAAACCAA GAGTAAAACA GAGATTTTAG AAGAATTACG 1080 

TAAACTCTAA AAATAAATTA GTCAGTCGCG ATGAAGAAAC TTCTTCATGG CGGCTTTCGA 1140 

TTACCCAAAA- AATGAAGGAG TGGCTTTTGT GGAGCAAGAA GAATTAATTG CGATTAGACG 1200 

GCAACTACAT CAAATACCtG AAATCGGTTT AGaAGAAAAG GAAACACAAG CCTTTTTACT 12 60 

TAATGAAATT GATAAAATGA AGCAACCTTA TCTGCAAGTT CGAACATGGC AAACGGGGAT 1320 
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TTTAGTCTTC 


ATAGAAGGCA AAAATCCTCA GAAAACCATC 


PPPTPPPPTP 


PTGATATTGA 


1380 


TGGTTTACCT 


ATTCAAGAGG 


AAGTGGTTTC 


TGCTTTTCAA 


TPAAAAPPAP 


PAGGGTTTAT 


1440 


GCATGCCTGT 


GGGCACGATT 


TTCACATGAC 


AATCGGTTTA 


PPPPTTTTPA 


AAPAATTAAP 

nnOnrt 1 l nnu 


1500 


TCAACAGCAA 


CCAGACAATA 


ACTTTTTATT 


TTTATTTCAA 


PPTPPTPAAP 


AAAATPAAPP 


1560 


AGGGGGGATG 


TTGATGTATG 


AAGACCATGC 


GTTTGGTGAG 


TPPTTAPPTP 


ATPAATTTTA 

n l unn jl x a x n 


1620 


TGCGCTTCAT 


GTTAATCCTG 


ATTTGCCAGT 


CGGAACTATT 


APTAPPAPAP 


TAPPGAPATT 


1680 


GTTTGCTGCC 


ACTTGTGAAG 


TGAACATTAC 


CTTAAAAGGA 


AAAPPTPPAP 


ATPPGGPTTT 


1740 


TCCGCATCAA 


GCGAATGATA 


TGGTCTTAGC 


GGCCACAAAT 


TTAATTPAPP 


AAPPGPAAAC 


1800 


GATCGTTAGT 


CGAAATGTTG 


ATCCTGTGGT 


TGGGGCAGTC 


PTTAPPTTTP 
oil r\\* LI 1 lu 


PA APTTTTPA 


1860 


TGCAGGGACT 


GCCTGcAATG 


TGATTGCCGA 


AGAAGCAACT 


TT ATPTPPT A 


PA ATTPPPAP 


1920 


ATTGACAGCA 


GAAACGAATG 


AACAAACGCA 


GCGAAGAATT 




PTPA APPTAT 


1980 


TGCGCAATCT 


TTTCAATGTG 


AAGTAACTGT 


TCATCTTGAT 




BTTT APPTPT 


2040 


AGTGAATGAG 


CCAGCATGTA 


CCACGAATTT 


TATTGAATAC 


7i T P a P P Zl A A P 


A APPAAP APT 


2100 


TCAATTTCAA 


CAAGCACCGG 


TTGCAATGAC 


AGGTGAAGAT 






2160 


AGTCCCAGGC 


ACGATGTTTT 


GGTTAGGCGT 


GGCAAGTCCC 


I rt^HLiCL. i it 


ATTPTPPT A A 


2220 


GTTTGAACCG 


AACGAAGAGG 


CGTTATTGTT 


TGGAGTGGAG 


PPAPTT2iPTP 


PPTTTTT AAA 
uv, l iii 


2280 


ATCATTGGAC 


AATTAGAAAA 


ACAGCCACCA 


GATAACTAAT 


n n RT1\ TPTPP 


TPPPTPTTTT 

1 OvJ v*- 1 O 1 1 1 1 


i 

2340 


TTATTTTGCA 


ATAATGGGCG 


TATTGGGAAT 


CGTAAAACCG 


PPPTPTPTTA 


ATPPTTTGAT 


2400 


ATATTGGGCT 


AGAAATTCTT 


CTTTGACCGC 


AAATTGTT.TG 


PPPTTTAPTA 

Vj 111 C\KJ 1 f\ 


PATAPATGGT 


2460 


CGTTCGCACC 


GCAAAGTTGC 


CATTGCCTAA 


ATCGACCATa 


PPAAAAATAT 


PTPPPTPAPT 


2520 


TTGAATACTT 


TCGATATACT 


TTTCTTTTAA 


TGTTTCATTG 


APTTPTPTA A 


TPAPTTPACA 


2580 


AATTTTTTCA 


TAACCTTCTT 


CAGGATTGAT 


GCGGACATCA 


ATTAAPAPTT 
n 1 1 Mrttrit 1 1 


PP AT ATTGGC 


2640 


ACGAGACAGG 


TTACTGATTG 


TGGTGATGTT 


TCGGTTTGGA 


ATAAA ATP A A 


PAPTTCCATC 


2700 


7\ t\ /-*/" , /T , T"T 1, T , 7\ 

AALCbL I 1 1 A 


ATTTGAGTGG 


TTCGTAATCC 


GACAGAAGTA 


ACAGTTPPTT 


CAATATTTAA 


2760 


CGCCAATAAG 


CGAATATAAT 


CGCCTACATC 


CATTTGTTGT 


TCCAAAATAA 


TGAAAAATCC 


2820 


GGTAATTAAG 


TCATTCATGA 


ATCCTTGAGC 


CCCTAAACCA 


ATGGCAACAC 


CGGCAATTCC 


2880 


AGCTCCGGCT 


AATAATGAGC 


CAACAGGCAC 


ACCGACAATG 


GTCAAAAGAG 


AATAGATGAA 


2940 


GAnAAAGAAC 


AGCGTATATG 


GAAATGCATT 


AATGGnTTAn 


AnTGGGTA 




2988 


(2) INFORMATION FOR SEQ ID NO: 209: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
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CnTATAGACA 


TATTCTCnCC 


AACTTATTTT 


TTGGTGGGAG 


TTCCCGGTTA 


TTCnTCCATA 


60 


TTGTTTTCnG 


GTAATTCGTT 


GGTAACTAAC 


AAATGTTTCC 


ATATTGGTCA 


TCTGTCAATC 


120 


CAAGTAAGCC 


GTATTCAGCT 


TTAATATCTG 


CATATTTAGG 


ATAATCTAAA 


ATAACTTTGC 


180 


CGATTAGTGG 


TATATAAGAC 


AATGCCCAAG 


AAATAACCGC 


AAAGTTCAAC 


ATTACATTCC 


240 


CAACAAAAGA 


AGTTGTTCTA 


TTATTTGTTA 


GGCCATTTAA 


ATCATCAGGA 


TAATGAACTA 


300 


ATTTGCCATT 


GCGTAAACCC 


CAGTTCAAAT 


AATCTTTTGG 


CTCTTCATTT 


ACTTCAATTT 


360 


CAATATAGTT 


TTTCTTAGCA 


ATAAACGAAT 


AGTCCGTAAG 


CATTTGATTT 


TCGTCCATTG 


420 


TAATCCATAT 


TTTTATCTTA 


TCCATGTTGC 


ACTCCCTCCA 


AATTCTACAT 


CATCATTTTG 


480 


AAGTCCATAA 


ACTCGAATCA 


TAATTCGACC 


AGATGTTTCT 


ATTTGTATTG 


CTGCATATTT 


540 


ATTTGTTTGA 


ATCGTATTGC 


CTATATTTAA 


GGTTAACGGT 


ATTTTAATAA 


TCATATTTTG 


600 


AGGTCTATAG 


CTAGCTTGAA 


TATAGGTATC 


CACACCATGC 


CAACCATTGG 


CTTTtCCTTT 


660 


TTTAAAAATT 


CCTTCAACAT 


TACATGTAAT 


AATTGTATTT 


GATCTTTGAA 


TGTGAATAGT 


720 


AATTCCATTG 


TAAGAAATAT 


GTTGACCAAC 


AGCATTAAAT 


GCCGTTTTGT 


CAATCTTTTT 


780 


ATTTAAATCA 


TTTGTATACT 


GTGTTTTAGT 


AACTAAATTG 


CTTATATCAG 


GCGTATCACC 


840 


TTTAGGTCCT 


TGCTCTCCTT 


TTGCGCCTGG 


AACACCATCG 


TTTCCTTTTT 


CACCTTTTGG 


900 


ACCTTGTGGA 


CCAGTATCCC 


CCTTGTCACC 


TTTAGGTCCT 


TTTAAACCAA 


TCGGACCTTG 


960 


CGAGCCTGTT 


TCTCCTTTTT 


CTCCTTGTGG 


TCCAATGTCT 


CCTTGTAAAC 


CTTCTGGGCC 


1020 


TCTAGGGCCT 


ATATCCCCCT 


TATCTCCTTT 


TGGACCTTGT 


GGACCAATAT 


CGCCTTTGTC 


1080 


TCCTTTTCGA 


CCTTTAAAAT 


CTGTAAAACT 


TTTTTTATCA 


AAGGATAAAA 


CATGAGCACT 


1140 


ATTGTTTGGC 


AATATATACG 


CAACACATTC 


TTCAGCAGCT 


TCTATAGCTG 


GTAGTTCATC 


1200 


TACACGTTCA 


AATGAACAAA 


TACTTACATC 


TGATTCACAT 


GGACTAAATC 


CTATATTGCC 


1260 


ACAATCAGAA 


CAAAAGCTTC 


TAACCCTCAT 


CATCAATAGC 


CCCTTTCATC 


ATATCGCTAA 


1320 


AGCAAGAAGC 


ACATTCGGTA 


ATGGGTACTC 


CTGTAAATTG 


AGTGATTTTT 


TTGATGAAAT 


1380 


AAGCATTTTT 


TTCAATACAT 


ATTTGATAAA 


GTAAACGATC 


GTTATCACTG 


GcTTGCCAAG 


1440 


CTTCAAAAGC 


TGTATACATT 


GCCATTGCAG 


CGTGTTtAAC 


TATACACCAT 


TGTCCCTTAT 


1500 


CACCTTTACC 












1510 


(2) INFORMATION FOR SEQ ID NO: 210: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
ATTGAAGTTG CTGCAAATGC yTCGkTGATT TCATACAGAT CAATTTCTTC CGTAGTAAGT 
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TGATTGlGlG 


CTAACAGi 111 


1 1 LrtHl LLL 1 


TTaATCGGCG AAATTCCCAT ATAGGCTGGA 


120 


TCAATALLGA 


L Tl LLALAL 1 


rTPTrT 1 IV & T 21 
b 1 V- 1 l*«nn i H 


ATAGCTAAAT AAGGAAGACC GTGTGCTTCG 


180 


GCA1 A I 1 L 1 1 


b 1 bAAGLAA 1 


A A TP a a A P.P A 


GAAGCCCCAT CATTAATGGT TGATGCATTC 


240 


CCTGCTGTTA 


CAGTAlLGIL 


1 1 L 1 1 LHx\nJ\ 


ACTGTTTTAA GCGTTCCTAG CTTCTCAACG 


300 


CTCGAATTAG 


GGCGAA1LLL 


1 i, LAI LI 1 1L 


TCCACAAGCG TTCCTGATAC TTCTAATGGG 


360 


GCTATTTCGT 


CAGCGAATAT 


LLL 1 1L1 oL 1 


Tr.TP,PTTP,AG rTGCTTTTAA TTGTGAATGT 


420 


ACAGAAAATT 


GATCTTGCTC 


rp ip /™^n^ rp tv /** fp rp 

TTCTblAbl 1 


bpatpatapt TTTPHGPPAP ATTTTCAGCA 


480 


GTTAAGCCCA 


TTGCCTGACC 


ACTAAAbbbA 


TPPPTTAATP PATPATAPAT PATAfTAGAA 


540 


AAAGGCGCAT 


CGTAGCTTTC 


^ rp rn m ^ 7A 7\ 

TGTTTCGTAA 


(jimftj TIB PPT T PTAATTTAP.P. TPiPTTGGGAC 


600 


ATATTCTCAA 


TCCCGCCAGC 


TV TV m fYl TV TV TV TV ^P 

AATTAAAACT 


TPP^PTTPTP PTAATTP,AAT PA ATTftTTTC 
1LLQL11LIL L 1 rortl 1 Vjrlrt 1 vvnttiiOliiv 


660 


GCCAAAATAA 


CGGCCTTCAT 


TCCTGATLLb 


PTiPTiPPTP AT TAAPPCTPAT TP^PP^Pif^AATT 
Lr\LrtLLiLMl l/inuL<\JlUnl lo^uuvjnni i 


720 


TCATGAGACA 


AACCGCTGTT 


rp TV Hi rT* *T* TV rp *P 

TATTbLl Al 1 


r pr r rrTTnmCL P.ATTTTnnPP ATTTPCAGCT 


780 


TGTAAAACAT 


TTCCAAAGAT 


TACTTGA1 LA 


RTTTPTTPQC A A AT AP,TP,P, A ATP^TPTTTTT 


840 


AAAAGTTGTG 


TTGTAACATG 


TGTTCC1 AAb 


TPTAPPPPAP TTAPTT^IAPT TAAGCTGCCT 


900 


TTATATTTTC 


CAATTGGTGT 


TCGTAATGlA 


mpn ATZi ZiT A A PTAPTP.TTTT PAAPTAAATG 
ILnnlnnlHn Ll/iLloilll OnnL-lnnniu 


960 


CACCTCCATC 


TTGaAATATA 


rp /*• tv ro 7\ Tv A Tv rn r"* 

TCATAAAA1L 


ppiiPnBTPAP AAPTP.PPAAT TAPTGAATGT 
LLALArtlLriL ri/iL 1 oLLrtrtl ln^iunniUJ. 


1020 


TAACAGACTT 


ATAAGATATT 


TTTTAATAAT 


PTRTPfl APTT AATAPAAAAA TTPPPPATTT 


1080 


TGGTTCATTT 


CGTGCTATTT 


TATATAATGA 


RTrTflTTT A T PPTAAACf^AP, TTAAAGAAAT 
AIL lnl 1 lnl Lulnnnuunu 1 irtrtnunnni 


1140 


GACAATTGGG 


ATTGATAAAA 


TTAGTTTTTT 


TPTPPPPPPT TATTATATTP, ATATGACGGC 
1 b 1 uLLLLL 1 Inl lnlrtl i« niniunvvj^ 


1200 


ACTGGCTGaA 


GCCAGAAATG 


T AGALLL 1 Lb 


B.AAATTTPAT ATTP,HTATTfl GGCAAGACCA 


1260 


AATGGCGGTG 


AACCCAATCA 


GCCAAGATAl 


tptPAPATTT P.PAP.PPAATP, PPGCAGAAGC 


1320 


GATCTTGACC 


AAAGAAGATA 


AAGAGGbL A 1 


TPATATPP.Tr; ATTP,TPf^P,P*A PTGAGTCCAG 


1380 


TATCGATGAG 


TCAAAAGCGG 


LLbLAbl lul 


PTTAPATPHT TTAATGGGGA TTCAACCTTT 


1440 


CGCTCGCTCT 


TTCGAAATCA 


7\ /->/*> 7\ Tv rp rp /~> 

AbbAAbLl lb 


TTAPP,P,AP,PA APAHPAGGCT TACAGTTAGC 


1500 


TAAGAATCAC 


GTAGClTTAL 


a TrrrirnT a a 
AlLLAuAlnn 


AAAAftTPTTG GTCGTAGCGG CAGATATTGC 


1560 


TV TV TV TV m TV m 

AAAATATGGC 


TTAAATTC1G 


p p p c* t p b rr p 
bLbb IbribLL 


TAPAPAAHP,A GPTGGGGCGG TTGCAATGTT 


1620 


AGTTGCTAGT 


GAAClGlGlA 


1111 o«L ill 


AAAAnAGHAT AATGTGATGC TGACGCAAGA 


1680 


m tv rp fry t\ m/"* t\ z"' 

TATCTATGAC 


TTTTGGLblL 


LArtLMooLLri 


PPPP,TATPPT ATGGTCGATG GTCCTTTGTC 


1740 


AAACGAAACC 


TACATCCAAT 


CTTTTGCCCA 


AGTCTGGGAT GAACATAAAA AACGAACCGG 


1 oUU 


TCTTGATTTT 


GCAGATTATG 


ATGCTTTAGC 


GTTCCATATT CCTTACACAA AAATGGGCAA 


1860 


AAAAGCCTTA 


■ TTAGCAAAAA 


TCTCCGACCA 


, AACTGAAGCA GAACAGGAmC GAATTTTAGC 


1920 


CCGTTATGAA 


. GAAAGTATCG 


TCTATAgTCG 


; TCGCGTAGGa AACTTGTATA CgGGTTCaCT 


1980 


TTATCTGGGA 


> CTCaTTTCCC 


tTTTAGAAAA 


fc TGCAACGACT TTAACCGCAG GCAATCAAAT 


2040 
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TGGTTTATTC 


AGTTATGGTT 


CTGGTGctGT 


CGGTGAA1 1 1 


1 1 ^ A^. 1 (jo i o 


M>\ J. 1 M.o i AvjL, 


?1 DO 


TGGTTATCAA 


AATCATTTAC 


T\T\T\TVTV/" , TAT\TV<*"» 

AAAAAGAAAC 


TCATT 1AGGA 


rTPrTrfiiTfl 

k*. I UV^ 1 tjoA 1 A 


A 1 L-oVjAL-AoA 


*1 JL U U 


ACTTTCTATC 


rpnirT* 7\ m tv rp y*« 

GCTGAATATG 


AAGCCATGTT 


TrPAP?\ I\ APT 
1 GGAGAAAL. 1 


TTnpziPZiPZiP. 


zi r* zi t tc tp a 

At- A J. lunlvyA 


£. C£.\> 


AACGTTAGAA 


/"* tv rp t\ nmmi\ tv 

GATGAATTAA 


AATATAGTAT 


Tl G IGG 1 A I 1 


AA I AA 1 ALUb 


1 1 v^u 1 1 L* 1 1 A 




TCGAAACTAA 


GTATTTAAAT 


CCCCCGA1 iG 


a a tt a ft p ta a ii 
AA I G AAL. AAA 


nr ZiPPT Zi P Zi P 


ZiTTPZlZiPPZia 


9 n 


AAACTTGAAT 


GTCTAGCTCT 


TTTTATGAAA 


GCCAAAG I 1 G 


1 1 IGAGGGGG 




Z *i UU 


CGATACCATC 


TGTCAAAGCT 


ta tv tv m rn rri /■* 

AAACGTTGTG 


T\TVrpTVTV/" , TVT\7\ 

AATAAGAAAC 


GTAGCGATAA 


TAAAGA i. G AA 


O A CPl 


CACCGCCGCC 


ATTGTTCCAA 


*-> fj+ tv *v tv ^ m /■* tv 

GTAACAGTGA 


TTTTAACATT 


TGGAGTAATC 


GAi I I 1G1AG 


*5 9 Pi 


CTGGTTGGGT 


CGTCGGGCGA 


TAACTAGTCG 


TATCATAACT 


m TV TA TV TV rn TV m 

TAAAATAGTG 


CCCGAAGCTG 


O ^ Q Pt 


CACTTATTTC 


GGCTTTTCCT 


GTGTAAATTT 


CGCCATTCGT 


ATTCGTCTGC 


GTGACCGGIG 


ZOfl U 


CATTCGCTTC 


AATGATTAAA 


CCCGCTTGCT 


TTTGATCTCC 


ryi rri tv rn TV TV 

TCGTGGATAA 


TAAAAAAGAi 


9 T Pi ft 


ACGTGGTTGC 


CCATTTAGCA 


AATTCATCTG 


GAATTGGTTT 


tv t\ m ta m 

AAGTGATCCC 


/— « mrp i^-t rp/^/^» /*• 

GTTGGTGCCG 


9 T CPt 

<i / bu 


GACTCGACTT 


CGTTTGCTTT 


TCAACCGCGA 


CTTCTTCTTG 


TCTTTTTAAG 


CCTTGAG I AI 


9 Q 9 Pi 


TTTGATTATT 


TAGTTTCTCA 


ACCGTTTCTT 


m tv tv /*i tv m /-* r** tv 

TAACGATCGA 


tx rr* tv nmr , jv/T , ni 

ATAATGAGGT 


TTCCCTAAA1 


9 Q Q Pi 

Zo oU 


CGCTAACAGG 


GATCGTAGCT 


AATTTTTCTT 


CTGATTCTTT 


ta m ta rr^ m /"* 

ATATTGCCCC 


TCTGCCGTTA 


9 Q/l Pi 


ACTTTTTAGC 


TTCTGTAATA 


ATGGCATCAT 


ACTTGGTTTG 


GTCTACGGAA 


GAGGATGTGG 


*5PlPlPi 


TTTGTCCCGT 


AGCTTCGACT 


TGTTTTGTGA 


CACTGGTTTT 


TTCTGTGGTC 


t\ rp rp tv rp/"» 

ACGTCTGATG 


in cpi 
ju bu 


AATTTCCACA 


AGCACTTACA 


AACATTGAGC 


TAACGATTAA 


m tv tv tv tv m y*/ 1 ^ 

TAAACATCCC 


7V07V TV rp TV 7\ T 1 

ACAATCACAT 


T 1 9 Pi 


GCTTTTTCAT 


TTTTAAATCC 


CCCATTTGTC 


TATCTTTGTT 


ATTTTTTGTT 


TViTnrpTv TV TArprpTV 

ACTTAAA1 IA 


1 Q Pi 


TCCCATAGAT 


GAACAAAAGA 


AACCATTATT 


TATAAAAAAG 


CTTAGCAAGC 


GCT TGAAGAG 


"*9 ii n 


AGGCACTTGC 


TAAACTTTTT 


TCTCACTGTT 


tv rn m tv rn m tv m 

ATTGATTGAT 


TACTTTGACT 


AGT 1 G I I G AA 


JjUU 


ATAATAATGC 


ACTAGATGCT 


GCTCCTGGAT 


/** tv tv m tv m f~+ TV /"^ 

CAATATGACC 


TV TV *ti rp Of^ Z* 1 ''P 

AATTGCTCGT 


TGGGG I AAA 1 


J jDU 


AAGAGGCACG 


TCCTTTTTTC 


GCCACCCAAT 


CTTTTGTTGC 


rp rp /**! t\ /"^ Tv t\ a rp 

TTCAGCAAAT 


rn r** rTi rri tv 7\ T 1 

TGT IGAA1 l 1 


"54 9fi 
J fi Z U 


TTTCTTCATT 


AAATTGTTCC 


rn m /~i tv tv m rr» tv 

GTTCCAATTA 


m m tv /"* tv tv f> 

CTTCAGCAAC 


T\ f~* rp /"•/"»/■*• TV 

AGGTGCCCAG 


TV TV rp /-> T\ 7\0/™»A 

AG A I GAAGGA 


-3 fi 0 u 


TCGTTTTATC 


TCCTGGTTCA 


GCTTTGCCAC 


GCATCTGAAT 


^O^T\/~>/~»rpT\/^'7v 

CCCAGG 1 AGA 


rTTTPPTPB A 

GG1 1GG1GAA 


J J *i VJ 


CAATTTCTTT 


TAATTGCTCT 


GGCGAACTGA 


m m m m v** 7\ tv 

TGGTTTCCAA 


ATCTTTTGTC 


UL* 111 Au 1 L-A 


o ou yj 


fTA TA m TV n% TV TV TV 

TATTCATAAA 


CGCAGTACCA 


rp TV fT> TV TV TV TV /"> 

TATAAAGGAC 


CAGAAGCTCC 


GCCAAL l TI 1 


P 7\ Ti A TPS li ZIP 

G AAA I G AAAo 


J DDU 


CCATGGATAG 


tv /-^ m m tv ta tv m 

GACTTTAAAT 


GTCTCCGTGA 


TTGTTGTTGG 


TACTTTTTTG 


rripT, T\ A PPP Zi Zi 

i GAAAGGGAA 




TTTTGTATTC 


AGCCATACCA 


CGAGCCATGT 


TGTTACCGTG 


ATCCCCATCA 


CCAATCGGCG 


3780 


TGTCTAATTC 


ACT TAG AT AG 


TCTTTTTTTT 


CTTCAATAAC 


TTTTGAATAA 


TTGTCTAACC 


3840 


ATGTTTGAAT 


GTCTTTTACT 


GTTAATTCCA 


TCTTTTATCA 


CCCTTACCAA 


GAAATTGTTG 


3900 


TTACATTGCT 


TTCTAGCGCT 


GTTTGCCAGT 


CTTTTGTCGC 


CAAGTCAATC 


ATCGTTAAAG 


3960 


ATAATCCTTG 


CATGTCAATA 


GACGTCATAT 


AGTTACCCAC TTTATGGAAG 


GTCACTTGAA 


4020 
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CACCTTTATC 


TTCTAACAAT 


GTTAGCACGT i 


CGTTCATAAA ' 


I AbAAA 1 1 b 1 1 b*-A 1 i AA.1 o 


4080 


GCGTGCCACC 


CATGCCATTG 


ACAAGTACCC 


CTACTGTTTG 


IbblibllbL, ILnlrtuLln^ 


4140 


TTAATGTTTT 


TTCAACTAAT 


TCTTTCGCTA 


ACTCTTTCGA 


TGGllbbAi 1 111 ItAtuAt 


4200 


GATAGCCTGG 


TTCTCCGTGA 


ATCCCTACGC 


CAAATTCAAT 


m m /-^ t\ rp rp rn rp T 1 7\ 7\ A /** rp A ft 

TTCATCTTCT bblAAAblAA 


6 9 fin 


AACCTGGTTT 


GCCAACTTCA 


GGCACAGTGG 


CTGCTTTTAA 


AGCCACTCCA ATTGTTTTCG 


a ior\ 

H JC. U 


TGGCTGATAC 


AATCTTTTCT 


CCTAAAGAAA 


CGAGCTCTTC 


TAAAGAAGCA CCTTGGCGbb 




CATGATGTCC 


TAAAATTTTA 


TGGACAAGAA 


CAGTCCCGGC 


AACGCCGCGT TTTCCAGCbb 


4 fl 1 U 


TATAGGTGCT 


ATCTTCGACA 


GCAATATCAT 


CATCGACAAC 


GACCATTTCT ACATbAAlbi 




CTTCCATATC 


AGCCATATCT 


TTGGCCATTT 


CAAAGTTCAA 


AATATCGCCT GTA1AA1 111 


4 s fin 


TTACGATTAA 


CAGAACACCG 


GCCCCTTGGT 


CTGCCGCTTG 


GATCCCTGTT AAAATlIbAl 




CTGGTGTTGG 


TGAAGTAAAG 


ACATCACCTA 


AAACTGCGGC 


ATTTAACATT CCATCACCAA 


a fifl n 

4D0U 


CAAACCCAGC 


ATGTGCAGGC 


TCATGACCAC 


TTCCGCCACC 


AGATACTAAC CCAACTTGTT 


/ *7 4 n 


TGTTTGAGTC 


GTTGCGAGCA 


ATCACGCGAG 


AATCTTCGAC 


TTGATGAACA AGGTTTCCAT 




AGCTTTTCAC 


AATTCCTTGG 


AGCATTTCTT 


CGACAATTTT 


ACCAGGTTCA TTAATAATTT 


yl ft cn 


TTTTCATTTA 


GTTCCACCCT 


TTCTTACCAG 


CATTTTGAGC 


TTCTGCTAAA ATTTGCGTCA 




AATCATCTGA 


GACACCTGCA 


GTAATTGctG 


ACGCAAAGGC 


ACCTTCAACC AATGGTGCAT 


h you 


CGACTAAATG 


ATAATGTTTC 


TGTTGGTCTT 


CTTCTAGCAT 


ATCAAAAGCT AATTCTGAAC 




TTAAAACGGC 


ACTGCCTAAA 


TCGGCAAAAA 


TTAAAAATTC 


TCGGTCAGAG TCCGCTTCGT 


m no 


TAATCGTGTC 


AATGATTTTC 


ATTGGATCTG 


AACCTAATGA 


ACCATCTGAT GTACCACCAA 


fin 


GTGAATGAAT 


TGTAATTTCT 


tCACTGGCGT 


TCATCTGTTC 


AATCATCTCT TTAAlAUt-Al 




CTGTAATCAT 


TTTGCTATGT 


GAAACGAGTA 


AAATATCTGC 


TTTCATGTGG ALALllbLii 


5280 


AATCTAATTT 


ATGTCTTGTT 


TTATATTCTT 


GTCCGACTTT 


ATCTGCAACA ATGATlbLAi 


n 

JJ4U 


TCGCTACTTC 


TTCAACGACA 


ATTGGGAATG 


GCATTGAATG 


AATTGATTCT ItlbijAftibt 


5400 


AAGCAATTTT 


GGCTACTTCC 


AACGCTTCTT 


CATAAGTGAT 


TTCATCGACa bbAA 1 b 1 b 1 b 


54 60 


AAAGCGTTAA 


TGGTAAACCT 


ACAGAAGCGC 


TAAAGTCTAA 


AACAGCTTCT AAllbblbii 


5520 


TTGCTGCATC 


TTCTAATACA 


AGTTGTGCAA 


TCGTTGCAAA 


AGCTACTTT1 IbALLAlblA 


5580 


AATAACTGTG 


CGCACCAGGT 


AAGACTGTCA 


TACCGTTATG 


AATCGCATGC GCTGCAbb 1 A 


sfid n 


AACCGCCACT 


TTCAAAACCT 


AAGCCTGAAA 


GTAAGATATT 


TGTTTCGATA ATCTTATTCA 


^7nn 


ATGATTCAGT 


AACCACATTT 


GCATCACAAG 


CTAATTTCGC 


TTTTAACCCA TCTTCCTGGA 


5760 


GGTTTTCCCA 


GCAAAGTTTT 


GCCATGGCCA 


AAGCTGCATA 


TGTGCCGCTA GCAGCAGGCG 


5820 


TTAAGCCTTC 


• TCTTGAACCC 


ATTGGCAAAC 


TAGCATTAAC 


CCGAGAATAT GAATTGTGTG 


5880 


TTGCGCGTGC 


TTCAAAATAT 


GTGGATAAGG 


CATCCCCCAT 


1 TCCAGCAACT AAAAATCTAG 


5940 


TTGGCGCTTT 


GGCAACCACT 


1 TTTGTATCAA 


. CCATAACGAC 


: GCTTGGACTT TGTTTAAAGT 


6000 
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AAGCATAATC 


ATCAAATGAG 


CCATCTTCAT 


GATATAGTAC 


AGGTGAATGC 


P AZiP.TTP.GTP, 
bAtt.b 1 1 bb 1 b 


OvOU 


CATCTTGAGC 


CGCAATCGTT. GGTACAATAA TTAAGTTTTC ACCTTCTGCT 


APGAPTTTTG 
r\\* \jr\<~> 1 1 1 1 vj 


D JL ^ VJ 


AAGCATCGAT 


TfzrTTTarrn 

lULl 1 1/-\A_*V_»0 


PP APPTAAAf 


CAATGATACA 


GTCTGCTTGT 


TPTTTTTP.TP 
Iblllllblb 




CAACGTTTTG 


PAAppppTTT 




GCGTTACTTC 


TCCACTAAAG 


bbAbbb ILIA 




CGAAGGTAAT 


rmp AAA fTTf 
b 1 t/WiL J. Jl 1 


TTPP.PTP.TTT 
1 1 bbb lull 1 


PTTCCAATkG 


TGrACGwACA 


PP A.TPAAPPT 
bbA 1 b AAbb 1 


6300 


CAGCAGGATT 


TbbaAl 1 Ab 1 


AAbbb 11111 


mrppp- A A APT 


TGAAACGAAG 


rr T\ A /^ , r* e V RAPT 1 

T AAbb 1 AAb 1 




TTAGTAATTC 


ATCTTCbbbl 


«t>/— AAPATATT 

IbAAbAlAl 1 


TPGTTGGTA A 


AATCAATGCT 


TTTblbAlbl 




TGATAGCCTC 


b 1 Ai Al 111b 


T"P- TiPBTPPZl 
1 1 nAbtt 1 bbrt 


TAGTATAAAA 


GGCTTTCATC 


TAAAAAbbA 1 




TGAACTTTTG 


GTAtbb Illy 


PTTPPA. A A A A 

til bL/Winn 


&TGATAGAAT 


T ATTTTGTGA 

jl t*. x l x x %J x 


AATAATAAAA 


DD fl U 


TAT TAT GAT T 


rr* rn m 7\ T\ TV rT" T* 

TTTAAALT 1 1 


a p a. ttp a. a rrri 
AbAl IbArtbb 


QTTGPPTTAT 


GACTGTAGAA 


mm7V7\ 7\ 7\7\ J\T7\ 

TTAAAAAA1 A 


DDUU 


TTTTAGATTT 


AGAAAAb 1 bb 


/"AAAAATTPP 
bAAAAAl 1 bb 


Z\ A P. A ATPZXTT 


ggppttaagg 


t\ rn rri rn rn y\ *■> 

ACTbGTTTAb 


DDDU 


CCATTATTTT 


AGTCGATTAT 


n a Arrrrr a p 
AAAbbbbbAb 


PTP.TGHPGAA 


APAT AGTPAA 


GTTCAACCC r 


D / ^ U 


TTTGCCAACT 


GGTACbTCAb 


irrrrTr a pp 
lbbbb 1 bAbb 


TPTPP.AAQPT 


TTGTGAAAAA 


TGTGATGCTC 


p n 
D / t)U 


GTGGTGGCTT 


GGAAGbTbl 1 


bbbAb 1 bb 1 b 


APPPTTTT liT 
rt\J^^. 11 1 1 ril 


PTAPGGTTGT 

\* J. nUrfU x x \j x 


CACTTTAACA 


DO 4 U 


TCGTTGATAT 


bbbbA 1 bbbb 


A T 1 T 21 TT P.T TP. 
AI 1A1 lbl lb 


ATGAPPAGTA 


TGTAGGAGCC 


ATTATGGCTG 


oyuu 


GTGAAGTTTT 


gpt pg a ag at 


GAfTAAGAAG 


AGTTAGAACA 


GGTTTTAACC 


ATGAATGATG 


by ou 

! 


CCTTCATTCA 


AGAATTTTTA 


GTAACACACC 


AAGAGCTGTA 


TCAACAATAC 


CCTGTTT 1 Ab 


mod 


AACTAGCAGA 


CTTAGAAAAG 


AGCGCGACAT 


TAATCGCTGA 


TCTAAGCCAA 


T&TBTTSTTT 
1A1A1 Irtl 1 1 


7080 


CTGAAGCCAT 


CAAAAAAGAT 


TATTTAATCA 


ATACGTACAA 


ACAATCCTTA 


CAGATTAGTT 


7140 


CGCGAAAAGA 


AGAGCTGGAG 


CCATTGAATA 


TGATCCAAAA 


AGATATTCmA 


CAAmCTTTAT 


7200 


TAACTGCTCA 


ATTAACTCCT 


TCAAGkGCCT 


tCCAAmCTAA 


AAwTAAGCAa 


TaCCAcCGGG 


7260 


TTTAGATGnC 


CCTTGGGGAA 


ACCAATn 








7287 


(2) INFORMATION FOR SEQ ID NO: 211: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GTATGCCAAT CGTTTTAATT ACTTGTTGGT CACCGCTAAC GATGCCCACA TCAATCAACT 60 

TGCCAACTAA TTTAGGGACG CCAAGGGTTC CTGCGATTTG CAAACAGATG GTTAAAGTTG 120 

TCGCAAGAAC AATCCATTTA TTTTCTTTAA TAAACTCTTT CATTAATTTC ATGAAATTTC 180 

CTCCTTAAAC AATACCGTTT TAGCTGTTAC AAAGAGTCAT TGTACACGTc AAGTAACTTG 24 0 

AATGTCAACA GTAGAAATGC TAAACTTTTT GATGGAGGGA ACAGTATGGA AAAAAATTAC 300 

ACAtAAAAGa TGTTTCTGAG TTGTTAAATA TCCCTAAGTC CACGATTCGT TATTGGGACG 360 
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AGCAAGGGTT 
AAGATTTGTT 
ATATGTTGAA 
AAGAACAACT 
GAAAATTACA 
TTCCTTTTGA 
TGGAACACAT 
ATGGGATTTG 
AAGAATCAAG 
TGAATAATAT 
TTGTCGGAGA 
ATGGTTGGAT 
CAAATGTAGC 
TTTTAGTTTT 
CTCTTAAAGC 
TCATATAAAA 
TTTAGCTAAT 
AAAAAAGTTG 
TGGATGTTGA 
ATGAAAAGAA 
TATGATTACC 
AGTAACAAGT 
AGAAGATACA 
GATGTACCAA 
TTTGTTTTAG 
GTTGAAACAA 
TCGACAAGAA 
GTCAGCGTAC 
GATATACTAG 
GTAGAAAATC 
AACGAAACAA 
CAGGCTACCG 
CCACCAACAC 



GATTAGTACA 
CAAAATATAT 
TTTATATAAA 
GGGGCAAGAA 
TTTACATGCG 
TAAAATTGTT 
TTCAACTTTA 
TTTGGCAAAT 
TAGCCAATAC 
CGAAGAAGTA 
GCATATCTTG 
TGAAATTTTa 
TAAGCTAGTT 
TAAAACAATT 
TAGAACTTAC 
CGTGTTATTT 
ATAATGTGTT 
TTGTAGCTAA 
GAAACAATTA 
CGAAGGAGAA 
ATGCTGTTTA 
GATGCTGAGA 
TCGTCTCAAG 
TCAAAGATCT 
ATACTTTAAG 
ATGAGAAAGA 
AGATTACCAT 
TAATAGAAGA 
CAGATAAAGA 
AGGCAAACGA 
AGCCTGAAGC 
AAAGTCCTGA 
CTGGTCGGGG 



ACACGCAATG 
GATATTGATT 
AAGACACCTA 
ATAAAAAGGC 
TTAATTGATT 
GTTGTTGATA 
GGTATTTACG 
CATGTGAATA 
AAAGGCTTTA 
AAAAAGCGTT 
ACACAACTAG 
GGGGAAAaGT 
TCTTTTTTTG 
TAGAACGAAT 
CTATGTTTGT 
AATTCACGAT 
ACTTTTTGAT 
ATAAAGAAAG 
GTAATTGTAA 
TGTAAATGAA 
TTTTAAATTT 
AGACTGTTGA 
AAGATATTAA 
CATTTTAAAA 
TGAAGAGCGA 
TCAAACAACC 
AAATGGGCAA 
TAATGAAAAA 
GTCGGTTTCG 
AACAGCCTCT 
AGGAAAGGCG 
GCAACCTCCA 
GTTTAATACA 



AAGAAAACGG 
TTTATCGAAA 
CCGATCTCTA 
TTCAACATAA 
TAGAAACATT 
TTGAGGATCC 
GGGATTTAAG 
AAAAAAGTTT 
TTTTAAAAGT 
TAGCAAAACA 
AAGAGGAAAA 
AAAACAATTA 
GTAAATGTAA 
GTTATTAAAG 
AATTTTTTAG 
TTTTTGTTGA 
TATAAGGAGA 
AATTGAAGGT 
AAGTTGTTTG 
TAAAGCAGTT 
GTTACCAATG 
AAAAGACGGG 
AACGGTCACT 
CAAAAAAATA 
GGACCTACGT 
GATATTAAAC 
ATAACTACAA 
GGAACACTAG 
AAGGAAAAAC 
TCTACAAATG 
ATAGAAAGTA 
TTAAAGGCGC 
CCGATTTATC 



ATATCGAACC 
AATGGGGATT 
TGCCACCATG 
ATACAAAGAA 
GGTTTTTTCA 
AGAAGAGATG 
TGAATCCAAT 
AGATAAAGTG 
CAATACACAA 
AGGATTTGAA 
CCAATATATT 
AaAAAAGGAA 
TGAAAATTAC 
GGAAAACAAT 
GGAAAGTCAA 
AATATCGGGT 
AACGGAAAAT 
TGAAAAGTAT 
AAGATGATTT 
AAAAATTTTG 
ATGAACGCAT 
CTCAAAGTAA 
TACGAAtAAC 
CCAATGATAG 
CTTTAGAAGA 
TGCTTAACCT 
AAGCGTCGAA 
TAATTGATTT 
AGGAAACTTC 
AAATGACCGC 
TTCAAGAGAC 
AACCTACTGG 
AAAGCGTTCA 



TTTGATTTAG 
CCTATCAAAG 
GAAGCAACAG 
ATGAAGAAAA 
GATGAACCAA 
AAAGTTTACA 
GAACTGACGT 
ATTTGGCTCA 
•GATACCCGTT 
ACAGGCATCG 
GAATATTATT 
CTAGCAATAG 
ATGAAAAAAG 
AAAAAAATTA 
TATATTTGCT 
TTTTTCTTTT 
CGTACGTTAA 
TTTATGGAGG 
GTTTAAGTAA 
TGAGTTACCT 
TTGCTCAAGA 
TAGGGAAAAT 
AAATACGCGA 
TCCAATCAAA 
ACAAGCAAAG 
TCAACCAAAT 
CAAGTTATTG 
ACCTAGTAAA 
AGAAACAAAA 
AACTACTAGT 
AGCACTAACG 
TCCGTTAGTG 
TAAAGGAGAA 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



WO 98/50555 PCT/US98/08985 

1090 



CTTTTTTCAA 


CGGGAAATAC 


CAACTTAAAA 


ATTGCGAATG 


AAAATACGGC 


GGCTGCACAA 


2400 


ACATTTTTAA 


ACACACGAGG 


AGCAAGTAGT 


GGTTATGCAA 


TTAACAATTT 


TCCTTTAGAA 


2460 


TTTGCAGATG 


TTGACAATGA 


TCCAAATACG 


TATAACTCTA 


GTCGGGCTTA 


TATTGATTTA 


2520 


AATGGTGCAA 


AAGAGATTGC 


TTGGGCGGGC 


TTATTTTGGA 


GTGCATCTAG 


ATATAAAGGC 


2580 


CCTGCTTACG 


GAACAAATCT 


TTCTGATGAA 


GAAATTAGTG 


CACCAGTTCA 


ATTTACTACA 


2640 


CCAAATGGAA 


CCGTACAGCG 


TGTTTCGCCC 


CAAAGGTACC 


ATCGTATCGA 


TCAAGATGCA 


2700 


ACAAACCCAG 


GACAACGTTT 


CGGGTACAAT 


AACACTGGAT 


tttctaatta" 


TGCAGATGTA 


2760 


ACTTCAATTT 


TACAAGGGGA 


TAAAAGTGCG 


ACAGGGAGTT 


ATACGTTGGC 


AGATATTCCT 


2820 


ATGACAAGTA 


GTTTAAATGG 


TCAATATCAA 


TATTATAACT 


TTAGTGGTTG 


GAGTTTGTTT 


2880 


GTTGTTACAA 


AGGATCAGGC 


AAGTAAGTCA 


AGAGCTTTT-A 


GTATTTACTA 


TGGAGCACGT 


2940 


GGTAATGCTG 


CTGGAACCAA 


TAATGAATTT 


ACTATGAGCA 


ACTTTTTAAC 


AGCAAAACAA 


3000 


GGAAATCTTG 


ATCCAATTGT 


GACCTGGTTT 


ACTGTTCAAG 


GAGATAAATA 


CTGGACTGGA 


3060 


GACAACGCAC 


AAATTAAAAA 


TAGCGCAGGA 


ACTTGGGTAA 


ATATTTCGAA 


CACGCTCAAT 


3120 


CCAGTTAACA 


ATGCTATGAA 


CGCAACTGTG 


ACAGATAACG 


ATGAACATAT 


GGTAGACAAG 


3180 


TATCCAGGGA 


AATTTGCGCC 


GGATCATCCT 


AATTTTTTAG 


AT AT TG AT AT 


CGATCGTATG 


3240 


GCAATTCCCG 


AAGGTGTTTT 


AAATGCTGGA 


CAAAACCAAA 


TTAATTTTAG 


AACAACAAGT 


'3300 


AGTGGTGACG 


ATTATTCTAC 


GAATGCGATT 


GGTTTTGCTG 


TAAACGCAGA 


AACGCCTGAA 


3360 


TTTGAAATTA 


AGAAAGAAAT 


TGTGGAACCA 


AAAGAAACTT 


ACAAAGTTGG 


TGAAACAATC 


3420 


ACGTATCGTG 


TCTCACTAAA 


AAATACAAAA 


GCTGATTCTG 


AAGCGATAAA 


TTCAGTCTCG 


3480 


AAGGATGCAC 


TAGATGGTCG 


CTTAAATTAT 


TTGCCAGGCT 


CTTTAAAAAT 


TATTAGTGGA 


3540 


CCAAATTCTG 


GTGAAAAAAC 


GGATGCTTCA 


GGCGATGATC 


- AAGCTGAATA 


TGATGAAACC 


3600 


AATAAACAAA 


TTATAGTCCG 


TGTCGGCAAT 


GGTGCAACTG 


CTACACAAGG 


AGGTAGCTAC 


3660 


AAAGCTGATA 


CAGCTGAAAC 


AATTTACGAA 


TTTAAAGCGC 


GGATAAATGA 


ACGAGCAAAA 


3720 


GCAAATGAAT 


TAGTGCCTAA 


TAGCGCACCG 


TTGAGG 






3756 


(2) INFORMATION FOR SEQ ID NO: 212: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1046 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

ATAGAAAAGC- ATTATGtGTC CCCaGCGGTT TcCaTGAGGC CaCAAAAGGT TCCATACCGA 60 

CTTTAATaTG TTGAGGATCA GATAACCATT TAGCAAAAAA CTCATGCCAA ATTCGTTCTC 120 

TTCCTGTAAA TAAATCAACA GTATCCGATT CCGCACATAA GTAAAATAAC AATGGTCCCA 180 
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ATGCAGCTAC 


TCCGACAAAA 


CCAAATAGCC 


AAACTTTCGA 


ACGTTGCAAG 


AAACb 1111b 




GTAAAAGATT 


ATCCACCAGA 


ATAAAACCAA 


CTAAAATGGC 


tv tv TV T\ tv /"* ^ ^ 

AAAAGAGGTT 


TT 1 bAi IvjAC 


inn 


TGACCCATGT 


ACCTAAAACA 


CCACCAATGT 


ATACAGGTAA 


AAGCAGCAGG 


TTAAAAAUL 1 




TATTTCTATA 


TGCTTTGATC 


AAACTACTGG 


CCATCATTGT 


TGAaAACATC 


AACGTTtjV^AC 


fl ^ u 


CAATCGTATT 


GGTATTAATC 


CAAATACTTT 


CTAGTTTATT 


TGATTCTTTA 


7\ H TV T\ H"l T* T* T 1 

AAAATT l^bl 


d fin 

4 O U 


CTGCTGGAAT 


CAACTGAGGC 


AACTCCACGG 


ACATCCGATA 


CAAAGTCGCA 


CTCAAAGCAA 


D *J U 


CTAAAGAGAC 


AATTAATAAA 


AGCGTTTGAT 


CCAAACGATC 


AAAGACACAA 


ACTTTAAAAC 


oUU 


AAAGAACCAA 


CAAAATAATT 


AAAGGAATCA 


TTGTATCTGT 


TTGCATCGTG 


CTATGACGGA 


£ cn 


TTGATTCCGT 


AAAGTGAAaC 


AAGAGCCAGG 


TGGCTAGCGC 


CAACGCTACG 


GTCAGCCAGT 


1 "> 


CAATAAATTT 


TAATTTCGTA 


CAATTTAATA 


GCAGTACAAG 


TATTACTGCT 


tv rn t\ t\ m TV T\ rn 7\ 

ATTAATAATA 


/ o u 


AAATTGAACT 


ATTGGCATAC 


ATATATT.TTG 


TGACTGGCGC 


ATCAATTCCT 


ACGACCCACA 


840 


TCGTATACGA 


GCCGATCATT 


GTAACTAATA 


AAATAAGTCG 


ACCGATTCTA 


TTCAATACTT 


900 


TTCCCATATT 


CmAACCTCAT 


TCCATGAATG 


AATCACTAGT 


CGTTTATTCG 


ATAAATATAT 


960 


GCCCCGTTCT 


GATCAGGTCC 


ATATAATTTT 


TCAAAATGAA 


TCGTTGGTGT 


GTCTAACGGT 


1020 


TCTAATGGTT 


TATAAACGAC 


CGCATA 








1046 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 



TAAGTCGGAT 


TATTGTCGCC 


TATAATGACC 


AAAGAGAGCC 


GGTGACGGTT 


AAAGATTTAA 


60 


ACGTGGCAGG 


TGCGCTGACT 


TTATTATTAA 


AAGACGCCTT 


AAAACCTAAT 


TTAGTTCAAA 


120 


CGATTGAAGG 


AACTCCAGCT 


TTTGTTCACG 


GTGGTCCTTT 


TGCCAATATC 


GCTCATGGAT 


180 


GTAACAGTAT 


TTTAGCGACA 


AAAACGGCTT 


TGCATCTAGG 


GGATTATGTC 


GTAACAGAAG 


240 


CAGGTTTTGG 


TGGCGACTTA 


GGTGGCGAAA 


AATTTTTAGA 


TATTAAAGTA 


CCTTCTTTAG 


300 


GAAAAGCACC 


CGATGCCGTT 


GTGATTGTGG 


CAACGATTCG 


CGCGTTGAAA 


ATGCATGGCG 


360 


GTTTGGCGAA 


AGATCAGTTA 


GCAGAAGAAA 


ATTTACCAGC 


GTTACAAAAA 


GGCTTTGCCA 


420 


ACTTAGAAAA 


ACATATTCAA 


AACATGCAAC 


GTTATGATGT 


ACCAGTAGTT 


GTTGCAATTA 


480 


ATGAGTTTAC 


ACAGGATACA 


GAGAAGGAAA 


TTCAACTGTT 


GAAAGAAGCC 


TGCCAAGCGT 


540 


TAGGCGTGCC 


CGTCGAGTTA 


ACAAGTGTTT 


GGGCGCAAGG 


TGGCGCAGGT 


GGCACGAATC 


600 


TAGCGAAAAC 


AGTAGTAGCA 


GAGATTGAAG 


CCGATACAAA 


ACAGTTTCAA 


CCTTTATATA 


660 


ATCCTCGCCA 


AACTATTGAA 


GAAAAAATTC 


AAGCCATTGT 


ACAAACGATT 


TACGGCGGGG 


720 


AACAGGCGAT 


TTTTTCTCCG 


AAAGCGCAAA 


AACAAATCGC 


AGACTTTACA 


AAAAATGGTT 


780 
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GGGATCAATT ACCAATCTGC ATGGCGAAGA CACAGTATTC ACTTTCTGAC GATCCTcAAA 840 

AATTAGGACG GCCAGAAGGA TTTACGATTA CAATTCGTGA ACTAGTTCCA AAAATCGGTG 900 

CAGGGTTTAT CGTGGCGTTA ACGGGTGACA TTTtAACGAT GCCTGGCTTG CCGAAGgTTC 960 

CTGCTGCGTT AAAtATGGgA tGTGGACGAA ACGGGTCAAG CGAGCGGTCT CTTTTAAAGC 1020 

CGCTCTTTCC TGAGACCTAT TTTTAGTATA TACTTGAAAG TAGCTCAATA AAAAATAATG 1080 

ACGGAAATTT ATTGCGTAGT TTATTATAAG GAGCAACGAC TATGGGGGAA C GAG GAG AT A 114 0 

CATTTTTAGC ACACTTTAAT CGCATTGAAA AGTGGATGAG AG AG AG AC TG AACAATCCTG 1200 

TCAACATGGG ATTCAGTGAA ATGACAAGAC GTCTTTCCCG AAAAGAAAAA GGACAGATTG 1260 

CTCAATTTGA AGAGGATTTG CTTCAAATGG CTCAGTTAAG AAATGCCATC GTTCATGAGC 1320 

GAATTGCTGA TGATTTTGTA ATTGCAGAGC CAAACGAATG GGCAGTTCAA CGAATTGAAA 1380 

CGATTGAACA AGCTTTAACA AAACCAGAAA AAGTAGTGCC TAAATTTGCG AAACGTGTTA 14 4 0 

CAGCTTTTGA AATCACCACT TCTTTAGAAA CGTTATTAAC GATTATTGCA AAAAAACAAT 1500 

ATTCTCAATT TCCGATTTAT GAAAAAGGTG TATTTAAAGG ATTAATCACG GTTCGAGGGA 1560 

TTGGTGTATG GTTTGCCATT GAGAGCACCA AAGGCGAAGT TCACATTGCC AACCGAACGG 1620 

TGCGAGAATT GTTGGCATCA AATTATAAAC GCAGCAACTA TCAATTTGTT TCAATTGAAG 1680 

CAACCGTTTT TGAAGTTGAA CAAATGTTTC GGGAGCAGCC ACGCTTAGAA GCTGTGCTTA J7 4 0 

TTAGTAAAAG TGGCCATCCT AATGGTGAGT TAGTAGGCAT TGTTCGACCA AGAGATTTAG 1800 

CAAGTATTCA TAGAGAAAAG GAATGACGCG ATTGTTAGTA GTATATTTTT TAAyCaGTGC I860 

CTTATTAGTA GGGCTTGACC AATGGAGCAA ATATTTGACG GTCCAAAACA TTTCTTTGGG 1920 

tGAAACAAAA GAATTCATTC CTGGATTTTT GTCGCTGACC CATTTAAGAA ATACCGGAGC 1980 

TGCTTGGAGC TTATTAGAAG GAAAGATGAT ATTCTTTTAT GTCATCACAG TAATTGTTAG 204 0 

TGTCGTGATT ATTTATTTAT TAATTAAGAA TTATAAAAAA AGTATCTGGT ATTCTGTAGG 2100 

TTTATCTTTT GTCTTAGCAG GTGCCATTGG GAATTTTATT GATCGTGTAC GTTTAGGCTA 2160 

CGTAGTGGAC ATGCTCCAAA CGGATTTTAT GAATTTTCCT ATTTTTAATG TTGCAGATAG 2220 

TACGTTAGTG GTCGGCGTTA TTTGTATATT TATTTATTTG ATTTTAGATG AAAAAGCAGC 2280 

AAAGGAAGGC AAAAATGGAA CAAATTAACG TAACGATTCA AACAGAAAAA GGCCGGATTG 234 0 

ATAAAGTATT AAGTGAACAT TTACCCAATC ATTCCCGCTC GCAAATTCAA CAATGGTTAA 24 00 

AAGAAAAACA CGTAACAGTT AATGGTGAAA CGGTCCGCGC AAATTACAAA GTTACAGCAG 24 60 

GCGATGAAAT TGTTGTAACA ATTCCTGAAC CAGAAGTTCT AGATATGGTA GCAGAGGATA 2520 

TTCCTTTGAC CATTGTTTAT GAAGACAATG ATGTGATTGT TGTCAACAAA CCTCAAGGCA 2580 

TGGTCGTTCA TCCTTCGGCA GGGCATCCCA AGGGTACATT AGTGAACGCG TTACTTCATC 264 0 

ATGCGGATCA CTTATCGGCG ATTAATGATG TGGTGCGACC GGGAATTGTT CATCGAATTG 2700 

ATAAAGATAC TTCGGGGCTA TTGATGGTGG CTAAAAATGA TCAAGCCCAT GAATCATTAG 27 60 
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CGAAACAGCT AAAAGAAAAA ACATCGTTAC GTAAATACGT TGCCTTGGTC CACGGTGTTA 2820 

TTCCTCACGA TAAGGGAGAG ATAAATGCGC CTATTGGTCG TTCCAAGGTT GACCGCAAAA 2880 

TGCAAGCGGT CATTGAAGGC GGTAAAGAAG CAGTCACGCA TTTTACTGTT TTAGAGCGTT 294 0 

TTGATGCATT TACCTTAGTT GAATTACAAC TAGAAACAGG CCGGACACAT CAAATCCGTG 3000 

TTCACATGAA ATATATCGGT TATCCATTAG CTGGAGATCC GTTATATGGA CCGAAAAAAA 3060 

CATTGCCTGG CAATGGACAA TTTTTACATG CGAAGTTATT AGGCTTTACA CATCCTACTA 3120 

CAGGTGAACA ATTAGTCTTT GAAGCACCTT TACCAGAAGT TTTTGAAAAA ACTTTACACC 3180 

AATTAAGAAA TAATCATTGA TTTTTTTCTG AAGAAGAGGT AAACTCTTTT TTAAGAAGAA 324 0 

AAATAACTAA ATAACTGTAA CCTTTAAGTT TAGTCCCGTG AGGCTAAGAA GGAATCATGC 3300 

AAAAGAACAC TAGGTGCGCT TAGTTAGTAA GTTGTGTGGG CTACATGGTA CCCTTGTACC 3360 

TTTTTAAACT GTAACCAACA TCCTCCTATC CTCATACGGG TAGGAGGATT TTTTGTTGGA 3420 

AAAAATAACA GGACAATTTC AATAAAAACC CACTTATTTT TCAGCTGCTC AATTGAAGGA 3480 

AGAATCGCTG ACCCCAAGAC AGCGAGGACG TAGATAAAAA AGAAATGAGG AGGAAATTAT 3540 

TATGCCAAAG AAAGAAGTTG TTGACGCAgT CACAATGAAA CGAGCGCTTA CTCGAATCTC 3600 

GTATGAAATT ATTGAAAGAA ACAAAGGAAT TCAAGACATT GTTTTAGTAG GAATTAAAAC 3660 

GCGTGGGATA TACATTGCAC AACGTCTAGC CGAACGCTTA AAACAATTAG AAGATATTGA 3720 

CGTGCCAGTT GGTGAATTAG ATATTACTTT GTATCGTGAT GATGTGAAAG ATATGGAGGA 3780 

ACCTGaATTA CATTCTTCTG ACGTACCAGT CTCAATTGAA GGTAAAGAAG TTATTTTAGT 3840 

GGATGATGTC TTGTACACGG GCCGGACCAT TCGTGCTGCG ATGGACGCTG TTATGGATTT 3900 

AGGTCGTCCG AGAAAAATCT CTTTAGCAGT CTTAGTAGAT CGTGGACATC GGGAACTACC 3960 

GATTCGTGCC GATTATGTAG GAAAAAATAT TCCAACATCA AAAACTGAGG AAATTATTGT 4020 
GGAAATGGAA GAGCGTGACG GCGCAGATCG TATTATGATC AGTAAAGGAA ACGAATAAAA ' 4 080 

AGGAGTGAAC AACGTTGTCA GAAAAAGAAT TTCGAAATGA GGATGCAGTC T TAG AT ATT A 4140 

AAGACAGACC GCAAGCTTTT CACTGGGTTG GCCTAAGTTT ACAACATTTA TTTACGATGT 4200 

TTGGTGCAAC GGTTTTAGTA CCAATCTTAG TGGGAATTGA CCCAGGGATT GCTTTAGTCA 4 260 

GTTCAGGATT AGGAACAATG GTCTATCTGA T TACT AC AAA AGGAAAAATC CCCGCTTATC 4 320 

TTGGCaGTAG CTTTGCTTTT aTTGCAGCGA TGCAGATGTT AATGAAGAGT GATGGTTATC 4 380 

CAGCCATTGC CCAAGGTGCT ATGACAACCG GCTTGGTATA CCTAATCGTT TCATTAATCA 4 44 0 

TTAAAAAAAT TGGTTCCGAC TGGTTGGATA AAATTTTACC GCCTATCGTT GTAGGACCTG 4 500 

TCGTGATGGT TATCGGTTTA GGCTTGGCGG CTAATGCCGC AAATAACGCA ATGTATAACA 4 560 
ACAACGTTTA TGATTTTAAA TATATTGCCG TAGCCTTAAT TACTTTAGGC TTAACCATTT 4 620 

TTTACAACAT GTTTTTCAAA GGCTTCCTAG GATTAATTCC AATTCTTTTG GGAATTGTCA 4 680 

GTGGGTACCT AGTCGCTTTA GCTTTCGGAA TTATTGATTT AACCCCTATT AAAGAAGCTG 4 74 0 
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CGTGGTTTGC TTTACCGAAT TTTGAAGTAC CTTTTGTTCA ATATGAACCG AAATTGTATT 4800 

TAAATGCCAT TACAACAATG GCACCAATTG CTTTTGTAAC AATGACAGAA CATATTGGTC 4860 

ATTTAATGGT CTTGaATAAA TTAACCAAGC GCAACTTTTT CCAAGACCCA GGCTTATCTA 4 920 

AAACTTTAAT GGGAGATGGC TTAGCGCAAA TTGTAGCAGG TTTTGTTGGT GGACCACCAG 4 980 

TTACAAGTTA TGGTGAAAAT ATCGGCGTGC TAGCAATTAC AAGAGTACAC AGTGTCTTCG 504 0 

TGATTGGTGG AGCCGCGGTG TTCGCTGTAG CTTTAGGCTT TGTTGGAAAA TTAAGTGCCT 5100 

TGaTTCTAAG TATCCCAGGA CCAGTTaTTT nCAGGAATTA GTTTCGTCTT GkTTCGGTGT 5160 

GaTTGCCGCA ATgGTTTGaA AATCtTAATC GATAATAAGA nTAATTTGGA TAAGAAAAAG 5220 

AATTnTTGAT TGC 5233 
(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TnATAGTATA TTTTTTTAGT CAAAAAATCA ATAGTTTTTG CTAAAAAAAA GTAAAAAAAG 60 

i 

CTTTTTCTTG CtTTTTTTTA CTAAAAAGAA ATATACTATA TGCTATAATA GTACTTGAGA 120 

AGGAGGCATT TAGAATGGCA ACTATCACAG ATATCGCAAA AGTCGCGAAT GTTTCAATTT 180 

CAACTGTATC GCGAGTTCTG AACTATGATC CAAACTTATC CGTTACAGAG GAAACCAAGC 24 0 

GAAAAATTTT TGAAGCAGCA GAAACCCTGA ATTATACGAA ATATAAAACT AAAAATAAAA 300 

ACAAACAGCA GGAGCAACTG ACACCTAACC TTTCCGTACA AACAGCTCCG CAAGAGCCTT 360 

CGATTGCCGT CGTTCAGTGG CGTTCAGATG ACGAAGAACT AACGGATATT TACTATATGT 420 

CGATTCGTTT AGGCGCTGAA AAGCGGGCCG AAGAGTTAGG TTACAATATT CTAAAAGTTT 480 

CTCAATTGGA ACAACATAAC CTTCAAGGGA TTGATGGCAT TTTGGCAATT GGCAAATTCA 54 0 

CTCAAAAGAC GTTACAAGAG TTGCAGCAAC TGCATCCCAA TCTTTGTGTA ATTGGCAGTA 600 

ATTTCCCATT GGAAGAATTT GATTCTGTAA ATACTGATTT TTATCAAGCA ACCGAGATTG 660 

CCTTAACCCA TCTTTTGGAA TTGGGCCACG AAAAAATTGC CTTTATTGGT GCCGAAGAAA 720 

GCGAAAATAT GTATGGTTTC CGTCGCTATA AAACACCGAC AACCAACGCT TATCTTGACA 780 

TTATGCAGCA TTATCACTTG TTTAACGAAG ATTATTTTAT CTTAAAAGAA AATGGCATGT 840 

TGGATGTCAA AACGGGGGAA CAATTGACAG AAGAAGCGTT AGAAAAATGG GGAAATGATT 900 

TACCTACTGC- GATTCTTGCT TATAATGACG CCTTTGCAAT CGGTGTGATA CATACGTTAG 960 

CCGCACACGG TATAAAgGTT CCtGAAGAAA TTAGTGTCAT GGGGATCAAT GATATCTCTA 1020 

TTTCTCAATA TGTCTCACCG CCGCTATCCT CTGTCCATGC CTTTACTGAA GAAATGGGCG 1080 
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AGACAGGCAT 


TAATTTATTG 


CACAAACGAA 


m try /"^ tv tv 7\ m /■* 

TXCAAATGCC 


TAGCATTCGI 


Lb 1 AGAG 1 GG 


114 U 


TGTTAAACAC 


AGAATTGGTT 


GTCCGCCAAT 


/*■ tv tv tv rr» tv 

CGACAACTAG 


TV /"•/"•/""'/""'TV 

CCCACGCCAG 


A A AT A A A A A A 

AAAi AAAAAA 


IzUU 


TAAGGACCGC 


TCCTATAAAG 


GAACCGTCCT 


TATTTTTTAT 


TTGTTTTGAC 


/"> tv tv ATA tv rnm <-> 

G AAA J. AATTC 


bU 


AATTTGTGTT 


AATGCTGGAA 


ATGGTGATGG 


TTCATCTGCT 


*n/">?\ tv rp m i\ t\ r** hp 

TGAATTAAG1 


/* , rn/" , TTA/"^T7VT 
lltjll AG I A I 


1 J^iU 


CACCCACTCC 


GCTGCCCGTT 


TCGCAAAGGT 


AAAGCTCTGG 


CGTTGCTCAG 


Tr"^T T A TV /~> TV ^ rn 

TCTTAACAGI 


loo U 


TTTAAAAACT 


TCATGTGTAC 


CGTCGGAAAA 


TTTTAAAGTC 


T\ rp m rp rp r** 

ACTTGTTGGG 


AA1AAJ. 


i / /in 


ATGGGGAAAG 


TCCGCTCGTA 


ATGTTAAGAT 


/-i tv fry m rri TV fT\f*/ a + 

CACTTTATCC 


7\ tv T\ TV 7% rp rp 

AACAAGACTT 


GAGGGGGAAA 


1 ^00 
i JUU 


TTCAATTTTT 


AATGCGGCAT 


CTAACTGCTG 


ATTAATCCCC 


CATGATTGAT 


AAGGA1 AGGG 


1 CI ^fl 

1 JOU 


CCCATGATAA 


AGATTAGCGA 


AGGTACCATC 


AATCGCATTA 


TV T\ /~"» TV TV TV TV TV 

CAAGCAAAAA 


AAG IGGGA1G 


ID6U 


ATTTCGCGTT 


TCCACGTTGG 


CAGATGCATG 


AGGATACGCA 


X* 1 ! TV* fTl TV TV TV TV fTl 

CCAGTAAAAT 


/■^ *t» in rp TV Tv tv rp /■* 

GTTGTAAAIG 


1 boU 


ATGTGGATTT 


AACGCCCAAT 


TACGATAACT 


CTTAATTTCT 


TCTTTAGTGG 


^ TV TV FT* TV TV /** 

CAATACGCAC 


*\ H A f\ 


ACTTATATAA 


GGTTTCTTGG 


TTTCAAAACG 


ATACGCTGCT 


CGCGCTTCGA 


TAGCGTTTTC 


loUU 


ATTTTTTATG 


ATTGGAAACG 


TCCATTTTTG 


TTCTGGAACA 


TAAATCAAAC 


TGCTGTCAAG 


lo bU 


CGTAgcATcT AACTTCACCA 


TTAAATAAGT 


ATTGGGCGTA 


TCTAAACAAA 


CTTGAATCAC 


lyzu 


ATCGCCGTCT 


TGTAATTGAC 


AATCCCGAGC 


AGCTAAACAA 


ACAAAATCTT 


TTCCTGTAAT 


i you 


AGTTAACTCT 


TTCAAAGrAT 


CTGAATCATC 


TATCTTAGCT 


TTTCTAACTT 


CTTGtTCTTG 


^U*i U 

1 


aTTTAAAATC 


GATAATGrAA 


TACACGTCAT 


ATGCTTCCCT 


CCTAGTTTTA 


CCTnTATTAT 




ATATTATTTT 


AGTAAATTAA 


TATATTTTTT 


ACTAAAATAA 


CGACTAATTA 


AAGCATAAt G 


^ 1 bu 


TTTAGATCAA 


AAAAtAAAAA 


GGCTTTTCTT 


AAAAGTCTTC 


TTTTGTTCAA 


tv /~»rn/""» tv rn rp Tv rp 

ACTGATTATG 




TTTAAGATTT 


AAAAAGTGGG 


ACATAAGTTG 


AGCCGACCCC 


TV TV TV TV fTl m TV 

CAAAAGTTAG 


TV m rprp/Tn TV T 

ATTTCTAGGI 




CTAACTTTTT 


GGAGTCGGCT 


CAAGTCGAAA 


TGACTAATGT 


CCCACTTTCA 


Tv rp TV TV A TV TV A TV /"* 

ATAAAAAAAG 




AATAAATTTA 


TGATTTTTTC 


TGATCATCTT 


CATCCATTTT 


CGATTCCGAC 


TV TV r*"* T /"*/"*/"* T 

AACTGGGG I G 


9 4 nn 


TAAAGCCCGG 


TATATCAACA 


TCGAAAATTA 


AAAAATCGTC 


rp T\ Tv Tv T\ 7\ /~* 7\ f~T^ 

T AAAAAC AG G 


I I G 1GAGAGG 


94 fin 


AATTTGACAG 


TAGTAAATCG 


CAAAAAAAAA 


TTACACTTCA 


tv t\ m rn rn rn tv tv 

AATTTGGTAA 


tv rp rp tv rpr %f pr' r* 
All LAI G1GG 




TAAAGTTTGG 


TTTGTATTAG 


GATACAGATG 


ACCATACTTA 


m /■* tv TV TV rp /"^ 

TCCAAAGTCG 


rp /*— < rp rp TV A Af" A 

1 G X 1 AAGAGA 


i JOu 


GGAATGCCCC 


AAACGCTTTG 


AAACAGCATA 


/-i Tfv m tv rn m rri m 

GATATTGTTG 


TTATGCTGAA 




264 0 


TACATGAGAA 


TGCCGTAATC 


CATGTAAGTG 


tv tv m m /"» /■* m rn rn 

AATTGGTTTG 


TV TV ^ TV /"•/"top 7\ 

ACACCAGG I A 


rnrprn<^Tv/^/^rpTv tv 


97D0 


TTTTTTTACT 


CGACCTAAAA. AGGTCTTTGG 


GGACGGCGGT 


t\ tv m ^* tv try r+ TV tv 

AATCCATCAA 


AGG1 AAAAA.1 


97 fin 


AAAACCTAAG 


TCATCCCCGA 


TCGTTTTTTG 


GGTCTCTTTC 


CATTCCCTCA 


AGTCTACTAT 


2820 


CGTTTGCTTA 


TCTAGAGTAA 


TCGTTCGGAT 


ACTTGAAGCA 


TTTTTTGTAT 


CTGAGAAATC 


2880 


ATAATCTGTT 


■ CTTGTTCGAA 


TATAGAGTGA 


TTTATTCACG 


CAGCAAGTAC 


CTTTATGAAA 


2940 


ATCAATATCT 


GGCCATTGTA 


GCGCAAAAAG 


TTCTTCACTT 


CTTAGTCCAG 


TTACGAACAA 


3000 


AAAGCGCATC 


ATTCGTTTAT 


AAAATGCTTC 


TTGGAAATCA 


CAGCGATATG 


AACAATTATA 


3060 
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TACTTTCTTG 


AAT I b A 1 bAA 


P Tx p tpp z\ ZV Tx tx 




czt^ttczttcztxc 

vart x v^o l l unvj 


TTPTf^RTTTT 

1 1 U 1 Vjrt 1 X X X 


3120 


CCCAATAGTA 


TCTACTGGAT 


ILiLAlbAAJL 


pbpttppzizizi 


n p p zi t pp. c tx p 


^mmm a A A A AfT 


1 1 ft 0 


AATCCGTAAC 


TTATTAAAAA 


TCTGCTTAAT 


a *t» a ATTRTTr 


P&TZiPPPPZiP 
bA 1 Abbbo/iU 


P'T , A 1 r p'p' , p'P^f*p 
ulni Jl 1 iC^ 


1?4 0 

J C H \J 


TTCTTTATTC 


GTGACAAAGC 


ATTCTTGAAC 


AAb AAA I 1 b I 


n?^ n 7\ t\ z* 1 ^ 
HjAAITILILI 1 


pnaTZvTPZiZiT 


i^oo 

JJUU 


CGGTCTAATT 


TTyTCCATTT 


GCATTTTCCC 


/-» 7\ 7\ 7\ Pp 7\ A/"*- A 

GAAATAAG t A 


Abbbbbbl I i 


1 AAAb iLl irt 


JJUU 


TCAGTTTTTA 


CATAAGTTTT 


CTCAATCGTC 


CCAAGCTTAT 


ACCAAGGGAA 


AAAA J. Ab 1 lb 




TTATAAAATT 


CTCGAAAArT 


AaTTCCTTGT 


AAATAAATCT 


CTTGaGAGTC 


Ab Ab 1 AA lib 


^4 ftO 


TTTTTTAATT 


CTTCCATCGC 


ATCTTTTGCT 


TCTTTTTGCG 


TTTTAAATCC 


TCTTLI Abi A 




GGTTGTATTC 


TCTTTCCGAA 


ACGATCATAG 


CCCAGAGTTA 


AAGAAAAATA 


CCAAb Xbbbb 




CTCTTTTCAT 


CCTTATACAC 


ATTTGCAATT 


CTTCTCATCA 


GTATTATTCC 


TbbLAAbAAA 


7fifi0 


TGTGTAAGAC 


TTCTTCAATG 


ATACTGACGG 


GTATTCGTTT 


TATTTTTTTA 


TT1 1 i b 1 AAA 




AAGAGTACCC 


TTTAGTAATT 


AAAAGTGTCT 


TACATTCCTT 


/■* a A/~«ArnfTtfT«niA 

GAAGATTTTA 


rb Abbb 1 AA i 


17R0 


ACTGTGAAAA 


ACCCAACTTT 


ATTATATCTT 


GCCGAGTGGC 


TGTCTTCATT 


b i AbLAbb I i 


^fi40 
joiu 


CTTTTTCAGC 


TGTCAGCTCT 


TATAAAAAAT 


GTATTCATCT 


ACCAACTGCT 


CT To I b AbAA 


^qoo 


AGCACATAGG 


ATTTTCACCT 


CTCCCCAGTA 


ATGGCGAGCC 


AATTTAGTTA 


CGGAAbTAlb 




ATTATTACAT 


TTGTAGATCC 


TGGCTCAAAT 


TTGACAATTC 


TTATTTTAGT 


CATT1 Albbb 


flUiU 


TCTAGTAGCT 


CATCATGGCG 


CAACAAAAAA 


ACGCTCCCTA 


CAAATGGAAA 


GTATlbb i Ab 


4 OftO 


AT CATC TAT T 


CAGTTTTCAA 


TGGTCGTTTT 


CTTTATACTA 


ATAAAGACAA 


TTTGAAACAT 


4 140 


AAAAAAGCAA 


CTTCGTGTGT 


AAGAAAGTCA 


CTTCTTTCAG 


TAACTGATAT 


CcAAGCTTAC 


4200 


AGCATCTTGC 


CTTTCTTGTT 


TTGAAATATT 


TCCATtGCTC 


AwGGrCAACA 


GATTTGACCA 


4260 


ATGTTACTTG 


ATTCTTCGCA 


TTCTCTCTTG 


CGTAGCTATC 


ACAATAAAGA 


TAGTCTTtGA 


4320 


ATCcGCyACA 


AAGAaCTTCA GTGCTtAGGA AaCTTACGAT TtGGCtAATT AyCGaTGrAG 


4380 


TtACATCTAA 


GTGGTTCATG 


GGGAAGTAGT 


AAAATCGCAG 


AAAACTTCTT 


TTTATTCTGA 


4440 


CGTTTCTCTA 


CTTAAAAAAA 


CGGTCTTCTA 


GGC 






4473 


(2) INFORMATION FOR SEQ ID NO: 215: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CAATTGTATG TGCCATCTGC ATACGTGCTG CGGCAATTGC CACTCTCAAC GTAATAGATA 60 

ATATTTCGGC ATAATCAAAT TGATTTTGAT CGAGTGGATT AATTAAATTT CGCATTTTTT 120 

TCAATGCACT ACGATAACAA CCTTGAAACA ATGTCCCTGA CCCAAATTTA ATTTTAGGGA 180 

TTTTTTCAGC CCCTTTTGTC TTTGTCACAT AGTTCATAAT TTGGTAAATA TCGTATTCTG 240 
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TAAGTTCTTT 


TTGAGTGATA 


AATTCCATCA 


GTAAGCGTAC 


ATTGCTTTCT 


TCACCACTGA 


300 


TTGTAAACCC 


TTGCCGCGCT 


TGGCGAATTA 


AGGTTAAATC 


ATATGCTTGC 


AGCAACCGTT 


360 


CGACCTGTTC 


CAAATCACTG 


ACAATTGTGT 


TTCGACTCAC 


TAATAATTCA 


TCTGCTAGCT 


420 


GCTGTGCCGT 


TAAGTACCCT 


TTGACTGACA 


ATAACTGAAA 


AATCAATACA 


TTGACTCGTC 


480 


TTTTTTGATC 


AGGATAGGTT 


TCAAATCGAT 


CGACATCGAT 


AATTTCATTT 


TTTAACAGTA 


540 


AGCGTTTAGA 


GTCAGGTAAA 


TCTAGCCAGA 


TTCCCTTATT 


TCGTTTTGAA 


TAAAGTGTCT 


600 


CATCGTGCTG 


TTCAAACCAA 


GCACGAATAT 


CTTCTAAATC 


ATACTTAATT 


GTTCGAACGC 


660 


TGACAGTAAA 


GCTCTCAGCT 


AATTCTTGCG 


TGGTAATACT 


CCCTTCCAAA 


TCCAAAAGCA 


720 


TCAATAAAAT 


ATTTATCGCT 


CTTTTGGACA 


ACAACACGGT 


CATGCCTTGT 


CACCTCTTTT 


780 


CTTTTCCCTT 


TACACTTAAA 


ACTATAGCGT 


TATTTTATCT 


GTTCCACAAC 


ACAAAAAAGT 


84 0 


TGCACTTATA 


AAAGTGCAAC 


ACTTCTCTTT 


TCATACAAAA 


AAAATCCCTT 


AACAGTAGTA 


900 


ACTGTTAAGG 


GATACCACAC 


TATTTATTTG 


CATAAATAGA 


CAGAAACCTT 


CTCCACAAAA 


960 


GTATTCTGTC 


CGACACGTTA 


CAAATCCGGT 


AACTGATAGC 


CGTACTATCT 


GGTGTCGTTG 


1020 


TGTTTGACAA 


TCACTAGCCG 


TCAAAACGGA 


ATAGCCGAAG 


CTATCTGATT 


GAGTCAACTC 


1080 


GTCGCATAAA 


ACACACTTTA 


ACACAGAATA 


GTTTGGATAG 


CAAGTGATGG 


TAATTATTCG 


1140 


TTATCGAACC 


ATTCTTGATA 


AAAACTTTCA 


ACAATTTTAA 


CGGTTGCTAT 


CGTTAATTCT 


1200 


TTCGTCATTA 


CTGGGCTAGT 


TAACTTTTTG 


TCTTGTAAAC 


ATTGATTCAC 


ATGATTGATT 


1260 


TCATACGTAA 


ATTCACTAGT 


GAATTGTTCA 


GACCATTTGA 


CAGTGTTGCC 


TTGAGCATCT 


1320 


GTATAATAAG 


CACAATCTGT 


TTTCCAGAAA 


TTAGGAATAA 


CGATTTGCCC 


TTTCGTCCCA 


1380 


CAAATGGTCA 


TTTCaCTAGG 


TATCTTTAAA 


CCAACATTAA 


TAAAAATATT 


GCCCAGCGTC 


1440 


CCTTCAGCAA 


ATTTCAAAGC 


TAAATTACAT 


TGACTATCCG 


TCGCCCCTTG 


CTGATACGTT 


1500 


GCGGTTCCCG 


TAACCTCTTG 


AATCTCTTTC 


CCTAATACGT 


ATTGTAAGTA 


TTGTAATGGA 


1560 


TAACTTCCTG 


AGCCATGAAG 


CGCTCCACCA 


CCCGCTTCTC 


TCGAGTAAAA 


CCAGGGAATA 


1620 


TGATCCACAT 


TGGGATAGGC 


AGTAACTGAC 


TGAACCCATA 


AAATCTCTCC 


CAATCCCCCT 


1680 


TCTTGAATCG 


TTGCCTTTAC 


TTTTTGGGTA 


ATTGGTAAAA 


AGACAGACTT 


CTGAGCTTCC 


1740 


ATTAGAAAAA 


CCCCTTGCTC 


CTGTGCGATT 


GCAAAGAGTT 


CTTCTGCTTC 


AGCAGCATTT 


1800 


AATGTAAATG 


GTTTTTCCAG 


TAGGACGGGT 


TTTCCTTGTG 


ATAACGCAAG 


CTTAGCTGCT 


1860 


GAGTAATGTC 


CTTGATTGTA 


GGTGGGAATA 


TAAATAATAT 


CAATTGTTTC 


ATCCTTACAT 


1920 


AACTCTTCAT 


AACTGCCATA 


AGCCACCGGA 


ATCGCTAATT 


CTTTAGCCAT 


TTTTTGAGCA 


1980 


TTCTCTAAAC 


GCCTTGATGC 


AATCCCGCGC 


ACTTcAGcTT 


GCGCGCTTTC 


TCGAAGCCCT 


2040 


GCTACAAATC 


TTGGGACAAT 


TTGGGCAGTA 


CTCATAATTC 


CATAACGAAT 


TTTATCCATG 


2100 


TGATCGCCTC 


CTCTTTCACT 


ATACGAGAAA 


AACGAATATT 


TGGAAACCAT 


TTCAACAAAA 


2160 


AAAGAAACCG 


TTTCCCCAAA 


ACGATTTCTT 


TATATGTAAA 


TTAACGGATA 


CCAACGCTAA 


2220 
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CTTTTAAAGC TTGATTGACC TGTTCCATCT CTTCACGAGA AAGTTGACAA ACCTTGTGTA 2280 

GCATTCGTTC TTTATCCAAT GTTCGAATTT GTTCTAACAA AATAAGTGAT GGCGTCCTGC 234 0 

ACTCGTTATG GGGAATTTCG ATAAGGACTT GCGTTGGTTG CAGCCGCTTG CTTACGTTAC 2400 

GGGTAATCGG TGCAACAATC AACGTTGGAC TAAATAAGTT ACCTTTGTTA TtTTGAATAA 24 60 

TCaGGACAGG TCGAATCcCA CCTTGTTCAG AACCAACAAC TGGTGAGAGA TTCGCATAAA 2520 

AAACCTCACC TCTTTTTATC ATTCAGATTC CCTTCCCTCC TTTTCCTTTA AAAAATTATA 2580 

GCATATTGTT TCTTGCAATA TTTTCAACTT TTTAGCTATC CCtTAAAATA TTTGATAAAA 264 0 

CCACCCTTTA AAAGTCAAAA AAAGCTTAAA CAGTCGACTC T GAT AT AG AG ACGGCTATTT 2700 

AAGCTGACTA TTACATTTTT GTTAAAATAC TATCCACTTT TGTATTAATT AAATCAATAG 2760 

CGACGTGGTT TTCGCCACCT TCTGGCACAA TGATATCTGC ATAACGTTTC GTAGGTTCAA 2820 

TAAATTGGTG ATACATGGGC TTGACGACTG TCAAATATTG TTCAATCACA GAATCCAACG 2880 

TACGGCCACG TTCTTCCATA TCACGCTTGA TTCGACGAAT AATACGGATG TCATCATCTG 294 0 

TATCCACATA CACTTTAATA TCCATTAAAT CCCTTAAGCG GCGATCTTCT AAAATTAAAA 3000 

TCCCTTCAAG AATAATTACT TCTTTCGGTT CTTGTATCAC CGTTTCTGTA CTTCTTGTAT 3060 

GCGCAACGTA GTCATAAACT GGCTTTTCAA TGGCTTGATA ATTTAACAGT TGTTCAACAT 3120 

GTTGAATCAA TAAATCTGTA TCAAACGCAA AAGGATGATC ATAATTGGTA TTTAAACGTT '3180 

CTTCAAAACT CAAATGGCTC TGATCTTTAT AATAAGAATC CTGTTCTAAC ATCATAATTG 3240 

AATGATCGGG AAAATTATTA AAAATCGCAC GGCTAACACT GGTTTTTCCG CTTCCTGATC 3300 

CGCCAGTAAC ACCAATGATA ATTGGCTGAC TATCTTTCAT GTATAAACCT CTTTCTTTTC 3360 

AAAAACTTCA TTGTCTTTTA CTATTATAAT GAAAACTCTG AAGAATGCCA GACCTTTTCC 3420 

AAGACTTTTC ATGTGATTTT TATCTTCTTA TGAAAAATCT GCCCACACTT CTTTCAATTA 34 80 

CAGTATACTT ATCAATAAAA GGAGAATCCT ACCTATGATT AAAAAAGTCA AAAAAGAAGC 3540 

ACTTACAACG GCGCATTACG CTCkTCcTkT ATGAGGgCAG ATyCCTCAAA AAAAATGGGG 3600 

GGCAGATTAC CATACCCGCG GGnATTTGGC TTTGGACTAT CCGACTGGAA GAACTTTCCA 3660 

nG 3662 
(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 
TGAnTATGAA CGACGTTATG ATGAGCACCA AGGACTAATT TTGGGTTCAG TGACAACAGC 
CATTCCTTTA TCAAAAGAAC AACATCAAGC AATGGAAGAA AAAGCTGCCC AATTATTAGG 



60 
120 
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GTATGAACAA 
TGAAGCAAAC 
GAAGTTATTA 
AAGAAATCAG 
AAGAAGTTGG 
ATGCCATGAG 
TAGAAACAAA 
ATAAAGTACA 
GTGTTGTTAA 
CACGTCCTGT 
TGCAAACTGG 
TAGTTATTGG 
AAAAAGGACA 
GTAACCAAGT 
CAGGGGCTTC 
GTGAAGAATT 
AAGCCGTGGC 
ATCCAGGGGA 
ATGAATTAGG 
TTTCTGCTTA 
GCGACTTGTT 
TCGGAGGATC 
TAGCTAGTTA 
CACAAGCTAA 
CGCCACTGCC 
ATAGCGTGAG 
GCAAACATGC 
CATTAGATGC 
ATTCAGCCAA 
GTGCTTCATT 
CCAAAGCGAT 
GCTTTCAAGA 
TAAGTGAATT 



GCGCATTTAG 
CATCAAGTAA 
AAATAATCTT 
TGCCTTGAT7 
AACAGTCAC7 
TGGAGAACT7 
TGATGTAGGG 
ACGCACAGGC 
TCCATTAGGT 
GGAAGCAACA 
CTTAAAAGCC 
AGACCGTAAA 
AGATGTAATT 
AGAAACACTT 
ACAACCAGCA 
CATGTATAAC 
TTATCGTGAA 
TGTCTTCTAT 
TGGCGGTTCA 
TATTCCAACA 
CTATGCAGGC 
AGCGCAAATC 
CCGTGAGTTA 
ATTAAACCGT 
TGTAGAAAAA 
CGTAGATAAA 
AGACTTATTG 
AGCGATTACA 
AGAAGCTT7A 
AAACGAAA7C 
GCAAATGG7T 
ATATTCTTCA 
ACGTGAAACA 



TCAATTTAA7 
TCGA7GGCAG 
TAGGACTACA 
AAAGAACAAA 
TACC-7CGGTG 
GTTGAGTTT7 
ATT AT TAT CC 
AAAATCATGG 
CAACCATTA3 
GCTCCCGGTG 
ATTGATGCCC 
ACAGGGAAAA 
TGTATTTAT2 
CGTAAATTT3 
CCA77ACTT7 
GGCAAACATG 
CTT7CTCTAC 
TTACACTCAC 
ATGACAGCC7 
AACGTTATT7 
ACTCGTCCAG 
AAAGCAATGA 
GAAC-CCTTTA 
GGTCGTCGTA 
CAAGTCTTAA 
AT T 7 T AC ACT 
GAA--CCATTC 
GAG7TTTCTS 
GAA-AAATCG 
AAACAACGCA 
TCTC-CAGCCA 
AAA-.TTCG7A 
GAA7AATCI7 
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TGACCCGTCA 

CATCCGCAAA 
AAGAGGTGAA 
TCGAAAACTA 
ACGGAATTGC 
CAAACGGTTC 
TTGGCGATTT 
AAGTACCCGT 
ATGGT7TAGG 
TTATGCAACG 
TCGTACCAAT 
CATCGATTGC 
TAGCGATTGG 
GTGCCTTAGA 
ACATTGCACC 
TCTTAATTAT 
TATTACGCCG 
GTTTA7TAGA 
TACCATTCGT 
CAATCACTGA 
CCGTTGATGC 
AAAAAGTTGC 
CACAA7TTGG 
CGGTAGAAAT 
TTTTATATGC 
TTGAACAAGA 
GTACAACGAA 
AAATG7TTGC 
ATAACGCATA 
TTGCATCAAC 
AATTGACCAA 
GTGTCGTTAC 
CTCT7AGTGA 



A7T37CGGCG GAGTGGTTAT 
CA--.77AGAAC ,VTATGCAGCA „. 
T7GAATGGCT ATCAAAGCAG 
TCAACAACAG CTTGCTGTTG 
CCG3GCCCAC GGTTTAGAAA 
A7AC3GAA7G GCACAAAACT 
TGAAACCATT CGTGAAGGAG 
TGGGGAAGCT ? TGATTGGCC 
CGAAATTAAA ACAGACAAAA 
TCAATCTGTT GCTGAACCAA 
TGGTCGTGGA CAACGGGAAT 
AATTGATACA ATTATTAACC 
TCAAAAAGAA TCTACTGTTC 
TTATACAATC GTTGTGACTG 
ATATGCAGGA ACTGCAATGG 
TT7TGATGAC TTATCAAAAC 
TCCACCAGGT CGTGAAGCTT 
ACGTGCAGCT AAATTAAGTG 
TGAAACACAA GCAGGCGATA 
TGGACAAATT TTCTTAGAAA 
TGGGTTATCT GTTTCTCGGG 
CG3AACGCTT CGTTTAGACT 
CTCTGATTTA GATGCGGCAA 
TTTAAAACAA AAACTACATG 
GTTAACACAC GGCTTTTTAG 
TTTATTCGAT TATTTTGATG 
AGACTTACCA GATACTGATG 
AGCGGCAAAC AATAGCGGAG 
AGAGAGGTGA GCGACATTGG 
AAAAAAGACA AGCCAAATTA 
G7CAGAAGGC GCCTCTAAAA 
GCA7TTAGTA GCTGCTCAGT 
AGGGAACTAT CATGTGATGT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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TAGCACAACG ACCTGTCAAA AAAACAGGCT ATATCGTTAT CACTTCTGAC AAAGGGTTAG 2160 

TTGGTGGCTA CAATAGCTCA ATCTTAAAAC AAACCATGAG TATGATTCAA GAAGACCATG 2220 

ATTCTAATAA AGAATATGCC TTAATCGCTA TTGGTGGTAC AGGGGCGGAT TTCTTTAAAG 2280 

CACGCGGAAT CGATGTTTCC TATGAGCTAA GAGGCTTAAC GGATCAACCA ACGTTTGAAG 2340 

AAGTCCGTAA GATTGTGACA ACGGCAACGA CTATGTACCA AAACGAAGTG tTnGATGAAT 2400 

TGTATGTTTG TTATAATCAC CACGTTAACT CACTAACGAG TCAGTTTCGT GTGGAAAAAA 2 4 60 

TGTTGCCAAT TACTGATTTA GATCCATCAG AGGCGACCTC TTACGAACAA GAATATCTTT 2520 

TAGAGCCTTC ACCTGAAGCA ATCTTGGATC AATTATTACC ACAATATGCA GAAAGCTTAA 2580 

TTTATGGGGC GATCATTGAT GCGAAAACAG CAGAACATGC TGCTGGGATG ACCGCCATGA 264 0 

AAACAGCCAC TGACAATGCG CAAAATATTA TTAGCGATTT AACAATTTCT TATAACCGTG 2700 

CTCGTCAAGG GGCGATTACC CAAGAAATTA CAGAAATTGT TGCTGGTGCA GCTGCACTAG 27 60 

AATAATCTAG TCTTACGTTT GAACGTTTAT AAAAATGTTC AAAGCGTGTG ACTTTATGAA 2820 

GACTGGAGGA AAGAAATTAT GAGTTCAGGA AAGATTGTTC AAGTAATCGG TCCCGTTGTT 2880 

GACGTGGAAT TTTCATTAGA TCAATCCTTA CCCGATATTA ACAACGCTTT AGTCGTTTAT 2940 

AAAAATGGCG AACAAAACAA AAAGTAGTAC TTGAAGTCGC TTTAGAACTA GGTGATGGAG 3000 

TGATTCGTTC TATCGCTATG GAATCGACAG ATGGTTTACA ACGTGGAATG GAAGTTATCG 3060 

AT AC AG G AAA ATCAATTTCA GTTCCTGTTG GTAAAGATAC ATTAGGTCGT GTGTTTAACG 3120 

TTTTAGGAGA CACAATTGAC TTAGAAGCGC CATTCCCTGC AGATGCTGAA CGTAGtGrGa 3180 

TTCATAAAAA AGCGCCAGCA TTTGATGAAT TAAGTACCAG TAATGAAATT TTAGAAACAG 324 0 

GGATTAAAGT TATTGACTTA TTAGCACCTT ATCTAAAAGG TGGTAAAGTC GGACTTTTCG 3300 

GTGGTGCCGG TGTTGGTAAA ACCGTCTTAA TTCAAGAATT AATTCATAAT ATTGCCCAAG 3360 

AACATGGAGG GATTTCCGTC TTTACTGGTG TTGGTGAACG GACACGTGAA GGGAACGACC 3420 

TGTACTATGA AATGAAAGAT TCAGGCGTTA TTGAAAAAAC AGCCATGGTT TTTGGTCAAA 34 80 

TGAACGAACC ACCAGGTGCA CGGATGCGTG TGGCCTTAAC TGGGTTAACG ATTGCTGAAT 3540 

ATTTCCGTGA TGTGGAAGGA CAAGACGTGC TAT TAT T TAT TGATAACATT TTCCGTTTCA 3600 

CCCAAGCCGG TTCAGAAGTT TCTGCCCTTT TAGGTCGGAT GCCGTCAGCC GTTGGTTACC 3660 

AACCAACCTT AGCGACTGAA ATGGGACAAT TACAAGAACG GATTACTTCA ACGAAAAAAG 37 20 

GATCAATTAC CTCTATTCAA GCAATCTATG TTCCAGCCGA TGACTATACC GATCCAGCGC 3780 

CAGCAACAGC GTTTGCCCAT TTGGATGCAA CAACTAACTT GGAACGTAAA TTAACCGAAC 384 0 

AAGGGATTTA TCCAGCGGTA GATCCGTTAG CTTCATCTTC TAGTGCCTTG GCTCCTGAAA 3900 

TTGTTGGAGA- AGAACACTAC GAAkkGGCTA CCGAAGTGCA ACATATTTTA CAACGTTACC 3960 

GTGAATTACA AG AT AT CATC GCTATTTTAG GGATGGACGA ATTATCAGAT GACGAAAAAG 4 020 

TACTTGTTGG TCGCGCACGC CGTGTTCAAT TCTTCTTATC TCAAAACTTT AACGTTGCTG 4 080 
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AACAATTTAC AGGTCAACCT GGTTCTTATG TACCAGTTGC GGAAACAGTT CGTGGCTTTA 4140 

AAGAAATTCT TGAAGGTAAA CATGACAATC TGCCAGAAGA AGCCTTCCGT AGTGTCGGTA 4200 

AAATTGAAGA TGCCATCGAA AAAGCGAAAC AATTAAACTA CTAGAAAGGC GGCAAACACA 4260 

ATGGATAGTC TTACTGTGAA TGTAGTAACG CCTAATGGTC TAGTTTATGA CCACCATGCA 4 320 

AAAATCGTTG TGGCGAAAAC AACAGATGGT GAAATTGGTA TTTTACCAAA ACATGCGCCA 4380 

ATCATCGTTC CGCTAGCGAT TGATGAAGTC CGCATCAAAC GGACCGATTC AGATACACAT 4 4 40 

GTTGACTGGG TAGCTGTAAA TGGTGGTATC ATGGAAGTCC GTGATAATGT CGTTTCTATT 4 500 

ATTGCCGATA GTGCCGAACG TGAACGAGAT ATCGATGTAC CTCGTGCTGA ACGAGCAAAA 4 560 

CAACGAGCAG AACGTTTAAT TGAAGAAGCC AAAGCAAAAG ATGACCGCGA TCAACTACGT 4 620 

CGTGCGACTG TTGCTTTGCA TCGTGCAATC AACCGGATCA ACGTATCGAA ACATGGGTAA 4 680 

AAAAGGCTGA AGGATTCAGT CACTTTTGTA GAGTTTAAGG CGCTTGCTAT TGAGTCAGTT 4740 

GCCTTAAACT CTATTTCTTC GTATTTTTTA ATAAATAGTG TCAAATTTTG GCATTTCAAG 4 800 

TGGATTAATT ATCGGTTTTT ATTATTTTGT GATAAAATGT TCCTACGTTA GAAAAAAGGA 4 860 

GGCTCCTTAT GCAATACTAT GGAGTAGATG CTATTGTTCG AATCGTTAGT CATCTTATGT 4 920 

TCATTTACAT AAGTTTTTGG GCGTTGCAAT CATTGCGTAT TGAGCAATTT TTCAAAACCC 4 980 

AATTTACACC GCAAATCAGA ATGTTGATGG TATTTTTCGC CATCGCTATC G GAT AC AC AG 5040 

TTAGCTCGTT TGCTTTAGAA TTGATTGCGC TTTGTCGAAA TCTTTTCATT GTGTATTTTC 5100 

CTTAAAAATT GAAAAATAAA AAAAGGTACA ATTTTTAGTT GAATAATGGT ATAATAGTAA 5160 

AGATTTTGAA TAGGAATACC TATTCTCATT TTTTTGGGGG GAACAAAATG GAACAAATTA 5220 

TTGTTCATGG TGGTAATACA AAATTAGAAG GAACTGTAAA AATTGAAGGC GCCAAAAACG 5280 

CTGTTTTACC AATTTTAGCA GCTACTCTTT TAGCTGAAGA AGGCGTTACT ACTTTAAAAA 534 0 

ATGTACCAAT TTTATCTGAT GTATTTACAA TGAATCAAGT AATTAAGCAT TTGAATGTAG 5400 

CCATTGATTT TGATGAAGAC GCGAATGAAG TAACAATAGA CGCAACACAA CCTTTAGGGA 54 60 

TCGAAGCAAA C TAT GAG T AC GTTAGTAAAA TGCGTGCTTC TATCGTTGTA ATGGGACCTT 5520 

TATTGGCTCG TAATGGTCAT GCCAAAGTGG CAATGCCAGG TGGCTGTGCC A-TTGGGAAAC 5580 

GACCAtTGAT TTACACTTAA AAGGCTTCCA AGCATTAGGT GCAAAAATCA TCCAAAAAAA 564 0 

TGGTTATATT GAAGCAATTG CGGATGAACT AATCGGTAAC ACTATTTACT TAGACTTCCC 5700 

AAGTGTCGGC GCTACGCAAA ACATTATGAT GGCTGCTGTT CGTGCGAAAG GAACAACTAT 5760 

CATTGAAAAC GTTGCTCGTG AACCTGAAAT CGTTGACTTA GCAAACATTT TAAACAAAAT 5820 

GGGTGCCAAT GTTATCGGTG CTGGGACAGA AACGATGCGT ATCGAAGGCG TCGATAAATT 5880 

ACATGCCGTA. GAGCATTCAA TCGTTCAAGA TCGAATTGAA GCAGGTACGT TCATGGTAGC 5940 

TGCAGCAATG ACTGAGGGCA ATGTTTTAAT TGAAGAAGCA ATTTCTGAAC ATAATCGTCC 6000 

GCTAATTTCT AAATTAACTG AAATGGGTGC AATCATTgAA GAAGAAGAAA ACGGCATCCG 6060 
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TPTAATTGGT 

X VJ X r\n X X OU X 


PPAAAACATC 


TAAAACCAAC 


AGATGTTAAG 


ACAATGCCGC 


ATCCTGGTTT 


6120 


pppaapagap 


ATGPAAGPAP 


AAATGACAGC 


GATCCAAATG 


TTTGCTGAAG 


GAACAAGCAT 


6180 


pptpapapaa 


APAPTPTTTG 


AAAATCGTTA 


CCAACACTTA 


GAGGAAATGC 


GTCGAATGAA 


.6240 


bbb x bAb 1 In 


aaaaTTPAPP 


PPAACATTGC 


CGTTATTAAT 


GGTGGCAACG 


AGTTACAAGG 


6300 


Abb Ab bAb 1 b 


PliZlPPPAPTP. 


ATTTACGGGC 


AGCTGCCGCA 


TTGATTTTAG 


TAGGGTTACG 


6360 


1 bbAAAI but 


aTT AP APPTP 


TCTCTAATTT 


GAAATATTTA 


GATCGTGGCT 


ATTATGAGTT 


6420 


T A 7\ A A A A 

CCA1 AAAAAA 


PTTPaZXAAAT 


TAGGCGCGAA 


CGTTGAACGT 


GTCAATGATG 


AAAAAATTGA 


6480 


AbAAAAAbAA 


PPa APPAPAP 


TAATPTAAAG 


GAGTCAAACG 


TATGAGTTCA 


TTTCGTTATA 


6540 


i I 1 I Aw 1 nAt 


ZVTTB.TTP.ZiZX A 


ATTPTAGTGG 

rv x x \-* x *ivj iuvj 


TCmTTTCGTT 


AGTCATTATC 


TTATTTGTTG 


6600 


TbbbaiUbAAl 


PATTPPTT AT 


PPTTTGATTG 

O vj X ± 1 \Ji \ X X Vj 


GAAATGGGAA 


TCCAATGGaT 


GTCTTTGATG 


6660 


AATvAAATTTP 

AAAAAAI lib 


bAblbAIAJ. 1 


a TP a APTTPT 


TTAAATAAAA 

X X rUui x ruuui 


ACTTATTTTA 


AGGCGGTGAA 


6720 


CAAAAb x bAA 


TTP&PPATTP 
1 1 bAbbrt 1 i kj 


TTPAPCGPCT 


TTTCTTTTTC 

XXXV^X-LXJ-J-W 


CTGAATATCT 


ATTTTATTCT 


6780 


ATbCTAI AA1 


AA 1 Ab AAb i A 


a PAT A ATP A A 


GGAGTGGGAT 


GGAAGATGCA 


AGCAACCGTA 


6840 


AC Ab AAA I Ab 


PPA a AP ATPP 
bbAAAb A x 


AATPPATGAT 


TC AG AAAAAA 


TGATTATTCT 


TTTTGGAGAA 


6900 


?V/*'»'T>/*' , f" , n' , ftPPP 

ACTGCTAbbb 


ATAPPTTPA. A 
AlAbbi IbAA 


ppaaPATPPA 


PTT ATTPAAT 

Oil *\ X X Vrfrtrt X 


CATTCCCAGA 


AAAAGACCAA 


6960 


GTTACbl Ibb 


r>7\rA A PP A P A 
bAbAAbbAbA 


TPaTPTP A A A 


ATTGGCGATA 


CAAACTATAC 


AATCACAAAA 


7020 


GTGGGTTCTT 


TTTTPA AT A P 

1 1 bbbAAl. Ab 


TaapTTPPa a 


APTATPGCTC 


ACAGCACTTT 

n\/nuvynv x x * 


AATCTTTGCT 


7080 


b A I bb AbbAA 


PPP a TP. a PPa 
bbbA 1 bAbbrt 


TPATPTTATP 


PGGAATGGTG 

vw \J vjnn x vj vj x vj 


TTTATTTAAC 


GCCCCATCAG 


7140 


i lAbbAAAbA 


TZVZiPTZiTTPP 
lAAbiAl 1 vjo 


AAPAAPAATT 


GAT TAT T TAG 


TAAATGGAGC 


GTGAATCAAT 


7200 


ppA.Tnn.naaa 
bb A 1 AAAAAA 


Pa A APPAPTP 


ATAAACGCCT 


GTCTTGGTTT 

VJ X W* X X N-* W X X i 


TGGCGTTGGT 


TTTTAAATAA 


7260 


TP a a P T 21 P.T a 
1 b AAb 1 Ab 1 r\ 


AP APPPTT AP 


TP ATTGTPTT 

X \JJ xv X X VJ X Vjr X X 


ATTGGTGCTG 


TTAATTATTC 


TAACTTTTAC 


7320 


b AAAb 1 1 1 CI 


TaTTTaTTTA 


AGCCCGTCTG 

nu v_^ w vj x v* x \j 


GCAGTTTTTC 


GGCGTTGTCG 


GCTTACCAGT 


7380 


a a T T Zi TP PPP 
AAx lAlbb^o 


P.P.P aTTTT AT 
boon l x l ini 


AT TAT TT ATT 

X X XXX X X 


AAATCCAATT 


GTCGATTATT 


TAGAGAAAAA 


7440 


APAAATCTCA 

nvruin x v_^ x v*^i 


CGTGTCTGGA 


GTATTATCGG 


CCTATTTATT 


TTAATTGTGG 


CACTACTCAT 


7500 


TTGGGGTGGT 

X X vj \D \J VJ X wVJ X 


ATTGTAATCG 


TTCCTAAAAT 


TCGTGAGCAA 


TCTGTCAGCT 


TTGTGAATCA 


7560 


TTTTCCTCAA 


TAT ATT GAT A 


CAATTGATCA 


AAAATCGCAG 


GAAATATi AA 


PTP 7V TPPfi PT 

b 1 b A 1 bb Ab x 




TTTCGCGCAA 


TTTAGAGAAC 


AACTTGAAGC 


GGCAGGCGAT 


AAAGTTGTCA 


GCTCGTTaGG 


7680 


TACCATCATT 


AAAAACGTCT 


CAACGTyCAC 


AGTACmAGGA 


ATTGGTAATT 


tCnTTGGGGC 


7740 


GGTGGCTACG 












7750 


(2) INFORMATION FOR SEQ ID NO: 217: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



GCCCCCAGAA 


TGTTCCAAAA 


GAAATTTTGT 


ATGCATTTTT 


AATTGTGCTG 


GCGATTGATA 


60 


GGGCTGGTGA 


CTTACTGAGT 


TGATATAATC 


AGCCAAACTT 


TCCGCTTTCG 


TTAACAGTTC 


120 


TCGATTTTGT 


TCATTCCAAT 


TATTGCCAAA 


GTCTTTAAAC 


GGATAAAGTA 


AACCTAGCTG 


180 


AACTTCTGGA 


TATTCCGTTT 


TTAATTCCCC 


AACTACTTCA 


GCTGTCCAAA 


GTTCAACACC 


240 


TAAATTGCCA 


GAAATCAAAA 


TCCATTCTAC 


TCCCGTTTCA 


ATATAACTGG 


CTAACTCTTT 


300 


TTTTAAGACA 


TTTTTAATAA 


CTGTGATTTt 


CGGGTCTTTt 


CCTtGAAATA 


TCCCCAATTC 


360 


AAAGCTGCGA 


TAACCGGTGA 


CATAGAGGGT 


TTTTATCATT 


TCCATAAGTC 


ATCCCTTTTC 


420 


TTTCAGGTCT 


TTTTTTtGGT 


ATAATACAGG 


CTAGGAGTGT 


GATAAACGTG 


ACGTTCCATT 


480 


ATCCTAACGG 


AATACCTTAT 


AATCATCATG 


AAGCACAATC 


ACCAAAAAAA 


CAAGTAAAAA 


540 


AGAAACGCCC 


CGTTGAATTT 


GGTAATCGTG 


GGATGGATTT 


CGAAGAAGCG 


ATCAATCAAA 


600 


GCAATCAATA 


TTATCTATTG 


AATAAGCAAG 


CAGTCGTTCA 


TAAGAAACCA 


ACTCCTGTTC 


660 


AAATTGTTAA 


AGTAGATTAT 


CCCAAACGAA GTGCGGgCTG 


TTATTAAGGA 


GGCCTACTTT 


720 


AGACAAGCTT 


CAACGACTGA 


TTATAACGGT 


GTATATCAAG 


GCCGTTACAT 


AGATTTTGAA 


780 


GCGAAAGAAA 


CGAAAAATAC 


CACGTCCTTT 


CCCTTTAAAA 


ATTTTCATCA 


ACATCAAATT 


840 


GATCATATGG 


AACAATGTAT 


AGCACAACAA 


GGGATTTGTT 


TTGTGCTATT 


ATGGTTTTCA 


900 


AGCTTAAAAC 


GCTGCTTCTT 


TTTTAGTGGT 


GAAAATCTAA 


TAACGCATTG 


GAAAAACCAA 


960 


GCAGTTACTG 


GAAAAAAATC 


TATGCCGCTT 


GCACTCATCG 


AGAAAGAAGG 


AATTGAATTA 


1020 


TCTTTTGGAA 


TCGCACCGCG 


AATCCCTTAT 


CTCGACGCAg 


TGGAACAATA 


TATTCAGACA 


1080 


CACTAGATTC 


ATAAAGTTTT 


GAAAGTATAA 


CAAAGGAGTT 


GCTATTTATG 


CCAACCGCAA 


1140 


ATTCAGGCTC 


GCGAGCTGCT 


AGACATGCAT 


CCAACACCTC 


ATCGAAGTCA 


AGTTACAAGA 


1200 


AAAAGACACA 


ACCGCCGAAA 


AAGAAAAAAC 


gGCTTGTTTT 


AAAAATCTTT 


CTTGGcTTAC 


1260 


TTATCGCTGG 


GaTGGTCGCT 


TTTTTAGCAG 


GCGTTGGTCT 


TTTTTGGtTC 


TATGCACGAC 


1320 


AAGCCCCAAA 


ATTAGAAGAT 


GACAAACTGA 


ATGCGACAGT 


TTCATCGAAG 


CTTTACGATA 


1380 


TTAATAATGA 


AATTTTTGAG 


GATTTAGGCG 


CAGAAAAGCG 


TGAATTAATT 


CAACCAAATG 


1440 


ATGTGCCTCA 


ATTATTAAAA 


GATGCCATTG 


TTTCTGTCGA 


AGACCGTCGT 


TTCTATAAAC 


1500 


ACATTGGTGT 


GGATCCGATT 


CGGATTATCG 


GTTCTGCCCT 


TTCAAATGTC 


AAAAATGGTG 


1560 


GATTACAAGG 


CGGAAGTACC 


TTAACCCAAC 


AATTAATTAA 


ATTGTCCTAC 


TTCTCTACAA 


1620 


AAGAATCTGA 


CCAAACCTTG 


AAACGGAAAG 


CACAAGAAGC 


ATGGATGGCT 


GTTCGTTTAG 


1680 


AACGTGAAAA 


ATCAAAAGAA 


GAAATTTTAA 


CTTACTACAT 


TAACAAAGTC 


TACATGGCGA 


1740 


ATGGTTTTTA 


TGGAATGGAA 


ACAGCGGCTG 


AAAATTATTA 


TGGTAAACAT 


TTATCTGAAC 


1800 


TTGATTTACC 


ACAAACGGCC 


CTACTGGCAG 


GAATGCCACA 


AGCGCCAAAC 


TCATATGATC 


1860 


CCTATACAAA 


GCCAGATACT 


GCTAAAGAAC 


GCCGGGATGT 


TGTACTTTAC 


ACAATGTACG 


1920 
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ACAATAAAAA 


AATCTCGAAA 


GCCGAGTACG 


AAAAAGCCAA 


aGCTACCCCA 


ATCGATGAAG 


1980 


GCTTAGTGCC 


ATTAAAAGCC 


AGTGATGACA 


ATCGAAAAGT 


CGTAGATAAT 


TACGTCAAAG 


2040 


AAGTAATCAA 


TGAAGTGAAA 


GCAAAAACTG 


GTAAAAATGT 


TT AC AC AG AC 


GGACTAGATA 


2100 


TCTACACAAA 


CTTAGATATG 


AATGCCCAAA 


AACAACTTTA 


CGATATTGTC 


AACAGTGATC 


2160 


AATACGTTGC 


GTTTCCAGAT 


GATAAAATGC 


AAGTGGCTTC 


AACCGTTATT 


GATGTTGCCT 


2220 


CTGGACAAGT 


CCGAGCACAA 


ATTGGTGGTC 


GTCACATTCC 


TGATGATGTG 


CAATTAGGAA 


2280 


ACAATTTAGC 


TGTTAACACC 


CAACGTGATG 


TTGGTTCAAC 


CGTGAAACCA 


ATCATGGATT 


2340 


ATGGTCCCGC 


AATTGAGAAT 


TTAAATTATT 


CGACAGGTCG 


CCTCATGGTG 


GATAAACCAA 


2400 


CGAAATATCC 


TGGTACCGAC 


ATTGATGTCT 


TCAACTCTGA 


TTTGACATAC" 


CAAGGCGTCA 


2460 


TTACTATGCG 


CCGTGCGATT 


ATGGGTTCTC 


GTAACACAAC 


CGCGGTCCAG 


ACGTTTGATG 


2520 


AAGTTGGTAA 


AGAAAACATC 


ATGCCTTTCA 


TTAAAGGACT 


AGGAATTGAC 


TATAAAAACT 


2580 


TAGAAGCGTC 


AAATGCCATT 


TCAAGTAACA 


CAAGCGATGT 


AGATGGCGAT 


AAATACGGGA 


2640 


TTTCCTCATT 


GAAGTTAGCA 


GCAGCTTATG 


CGGCTTTCGC 


CAATAACGGG 


ATTTATAACA 


2700 


AACCGTATTA 


CGTAAACAAA 


GTCGTTTTCA 


ATGATGGAAC 


TAGCGTTGAT 


TATCAACCTG 


2760 


ATGGTAAACG 


TGCGATGAAA 


GACTCTACCG 


CCTATATGAT 


GACGGATATG 


TTAAAAGATG 


2820 


TTTTAAACGG 


CGGAACAGGA 


TTTAACGGCG 


CCATTCCAGG 


CTTAATCCAA 


GCGGCCAAAA 


*2880 


CAGGGACTTC 


TAACTATACT 


GATGAAGATT TAGCGCGAAT GGgCACAACA GAAAAAGGAA 


2940 


TTGCCCCGGA 


TAGTACCTTT 


GTTGGTTATA 


CAACACACTA 


TGCCGTTTCT 


GTTTGGACTG 


3000 


GAT AC At GAT 


CGAAtACACC 


AATCTATCAA 


GAATACTATG 


GAATTGCCTC 


GGATGTTTAT 


3060 


CGTGAAATAA 


TGAGTTACTT 


GTCTCAAAAT 


GTTTCAAACG 


ATGACTGGGT 


GCAGCCAGAC 


3120 


AGTGTTGTGC 


GGGTAGGTAA 


TGAATTATAC 


GTGAAAGATG 


CCTATGAGGT 


ACCAAATGTA 


3180 


CAAGTTTTAC 


CAAGTACTAC 


CTCTTCAGCA 


CCACAGCCGG 


AATCTAGCAG 


TACGGTGGAA 


3240 


TCCTCTTCCA 


CAAAAGAAGC 


GGAAAGTTCA 


TCAAGTTCTA 


GTTCAGAAAG 


TGCGCCTTCT 


3300 


TCCTCTGAAG 


CGCCACCATC 


AACCGAACAA 


CCGGCAAGCT 


CTTCCTCAGC 


AGAGCAACCG 


3360 


GCAACATCAG 


AACAACCACC 


TGAACCTTCA 


AGTTCTAGTT 


CACAAGAACC 


GCCGCAACCA 


3420 


CCTGAAAGTA 


GTTCTAAACC 


AGATGAAAAT 


AAAGCAGCAT 


AAAAAGTACG 


GGGCTCATGA 


3480 


TGAGCCCCGT 


ACTTTTTTAG 


ATTAAAAATG 


GACGAAGTGA 


ACTGAATCAC 


TtCGTCCATT 


3540 


TTTGCTTATT 


TACTTTCTTT 


ATTTGCCAAA 


GCTGCACCAA 


CAAATCCTTT 


GATTAAGCGT 


3600 


TGTGGGCGAT 


TTGGACGAGA 


AATTAATTCT 


GGGTGGAATT 


GACAAGCCAC 


AAAAAATTGT 


3660 


TTTTCAGGAA 


TTTCAACAAT 


CTCAACTAAA 


CGATTATCTG 


GTGAAACTCC 


AGAGAAAACT 


3720 


AAGCCATTTT 


CTTCAAATAA 


TTGACGATAT 


TTGTTGTTAA 


ATTCATAACG 


ATGACGGTGA 


3780 


' CGTTCTTGGA 


CAACATCTTC 


ATTTCCGTAT 


GCTGCCGCAG 


TTTTCGTTCC 


TTTTTTCAAT 


3840 


TTACATGGAT 


ATAAACCTAA 


, ACGTAGTGTG 


CCACCTAAAT 


TCTCAATGTT 


TTCTTGATCT 


3900 
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GCCATTAAAT 


CAATAATGTT 


ATTTGTCACG 


TCAGGATTAG 


TTTCTGCAGA . 


ACCAGCATCT 


3960 


TCTAAGCCCA 


CAACGTTACG 


ACCAAATTCT 


ACACAGGCCA 


TTTGCATACC 


TAAGCAAATG 


4020 


CCAAGGAATG 


GTACGTCATT 


TTCGCGAGCA 


AAACGAATCG 


CTTCAATTTT 


TCCTTCAATT 


4080 


CCACGATCAC 


CAAAACCACC 


AGGAACCAAA 


ATACCATCTG 


CTGAACCAAT 


TCTTTCAGCG 


4140 


ACATTTTCAG 


CAGTTAATTC 


TTGTGAATCA 


ACCCAATCAA 


TTTCGATATC 


TGAATCAAAA 


4200 


TCAAACCCAG 


CATGTtTTAA 


TGCTTCTACA 


ACAGAAATAT 


AGGCATCTGG 


TAATTCAACA 


4260 


TATTTACCAA 


CCAGCGCAAT 


TTTTGTTTTC 


TTTTTCAAGt 


TArGAaCTTT 


TTCTTCTAAT 


4320 


GCACGCCACT 


CAGTCATGTC 


TGCAGCCGGT 


GCGTCCAATT 


TTAAATGATC 


ACAAACAATT 


4380 


TGATCCATGT 


TTTGTGCTTG 


AAGTGCTAAT 


GGAATGGAGT 


ACAATGTTTC 


AACATCACGA 


4440 


GATTCAATTA 


CCGCTTCTGG 


ATTAACATCA 


CAGAATTGTG 


CTAATTTGTT 


TTTAGTGTTT 


4500 


TGAGAAACAG 


GTAATTCTGT 


ACGGACCACT 


AAAATATTTG 


GTTGAATCCC 


TAAACTTCTT 


4560 


AATTCTTTGA 


CACTATGTTG 


TGTTGGTTTA 


GTTTTCATTT 


CGCCAGCCGC 


TTTTAAGTAA 


4620 


GGAATCAAGG 


TTGTATGGAT 


GTACATTACG 


TTGTCACTAC 


CCATATCAGC 


TTTCATTTGA 


4680 


CGCAATGCTT 


CTAGGAAAGG 


TAATGATTCG 


ATGTCGCCGA 


CAGTACCACC 


TACTTCTGTG 


4740 


ATAATCACAT 


CGGCATCCGT 


CATTTTCGCT 


GCACGCATAA 


TTTTTTCTTT 


GATTTCATTG 


4800 


GTAATGTGTG 


GAATAACTTG 


CACTGTTGCA 


CCTAAGTATT 


CCCCTTTACG 


TTCTTTACGT 


4860 


AAGACTTCTG 


AATAAATTTT 


CCCAGTGGTT 


ACGTTTGAGT 


ATTTATTTAA 


GTTAATATCA 


4920 


ATAAAACGTT 


CATAGTGACC 


TAAGTCCAAA 


TCTGTTTCGG 


CACCATCATC 


TGTGACAAAA 


4980 


ACCTCTCCAT 


GTTGGTAAGG 


ACTCATTGTC 


CCCGGATCCA 


CATTAATATA 


TGGATCGAAT 


5040 


TTTTGAATCG 


TTACTTTTAA 


TCCACGATTT 


TTTAATAATC 


GTCCTAAAGA 


TGCTGCGACA 


5100 


ATCCCTTTTC 


CGATAGAAGA 


AACCACGCCA 


CCTGTAACAA 


AGATGTATTT 


TGTCATAATA 


5160 


GTTAAAAACC 


CTTTCTTTGA 


TAATATTTTG 


TAGAGAGAAG 


TAAAACAACT 


TTGAGACTCT 


5220 


TACGTTTCTC 


CAAATAAAAA 


AGAAACAAAA 


ATAGAAAAGC 


TCCCTATCCA 


TTACGAATAG 


5280 


GGAGCTTGTA 


AGTTCGTTCA 


AACTTCTGAC 


CTTCTTAAAG 


GTGCCCAAGT 


AGTATAATAC 


5340 


AAGCAAACGG 


ACATCAGGTC 


AAGTACTTCT 


ATTCAAGAAA 


ATAATTTTTT 


CTATTTTCTC 


5400 


ATTTTTCTTG 


GACATTCAAC 


TAAAAAGCTT 


GACTCCTTCT 


TAGGAATCAA 


GCTTTTTGGT 


5460 


TTCTTACTCT 


TCAGAATATT 


CGT CATC AAA 


ATCGTCTTCA 


TCTTCAACAT 


CATCGTCATC 


5520 


AATGATCGTT 


AAATCTTCTT 


CGATTTGCGG 


AATGTCTTCT 


T CAT CAT CGT 


CTGAATCTGC 


5580 


ACCAATCTCT 


TGTAAGTCAG 


AGTTGTATGC 


TTTGATTTCT 


TCATCTTCAT 


CTTCTTCATC 


5640 


ATCGTACAAG 


ATGTCTTCTT 


CATCATCATT 


TGTTAATTCA 


GCATCTTCTG 


GATCGTCATC 


5700 


GTTATAGTCA 


ATTACATCTT 


CATCGTTTGG 


ATTAGTGATG 


AAGGCATTTA 


CTTTTTTACG 


5760 


TTTTCTACGA 


CGTGGTGCAT 


CTTCTTCGTC 


TTCTTCCAAG 


; CCATGTGTAA 


CTTCTTCATC 


5820 


GATTGAATCG 


ATTGGATACC 


ATGAACGTAA 


, GCCCCAACGA 


t TTGTCTCCCA 


t ATGAGATAAA 


5880 
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GCTACCGTCG 


ATGTTTAAAT 


CTGTATAAAA 


TTGTGCTAAT 


TGGTCACGAA 1 


TTTCACTATC 


5940 


AGACTTGCCA 


AGATAATTTT 


GGATTTGGTT 


AACTAAATCA 


GAAAAATCCA 


TTACATCTGC 


6000 


ATGTTGTTCT 


AAAATAGCGT 


GTGCTACTTC 


AATCATGGAT 


AATTCTTTTT 


TGTTTAACCC 


6060 


TTCAAATACA 


TTAATTTCCA 


AGAGGTACAC 


GTCCTTTCAC 


AGTCTACTCC 


TT AT CAT AC A 


6120 


AAAACATTGA 


AAAAAAATCA 


ATTCTATTCT 


GCAATATTTA 


TGCAGTAAAT 


TCTAATTCTA 


6180 


AATGATATTC 


TCCGATTCGC 


TCTGTTTGAG 


AGTATAAATC 


ATAATCAACT 


AAAATCTTAC 


6240 


CAGACATTGG 


TTGGTCTTTT 


AAACTAACAC 


GCAAATTATG 


TGTAAACGTA 


CTAATCTGTA 


6300 


GCAAGCCATA 


AGGGGTTCGA 


TAACTTGTAT 


CTAATTTTTC 


CTGATAGCCA 


AATCTTAAAC 


6360 


GCATCCGTAA 


TTCACCGGCA 


CGAATCAACT 


GAACATGCCC 


GTCTGGTTCA 


ATTTTAATGG 


6420 


TTACGGGGGT 


TGGCTCTGTA 


TCCTCTAATA 


ATTCTTCTTT 


GTAACGAATG 


TATAATGTAT 


6480 


CACCCATTT.T 


TACTAATTGA 


CCTTCTAAGT 


CAAAGAAAAA 


ATCTTTTTGT 


TCGTTGCCTT 


6540 


GTTGGACAAT 


CGTTTTTAGT 


TGAATCGATA 


CAGGAACGCC 


TGTTGATAAA 


TCCATAATGG 


6600 


TCACTCCTTT 


TCCAAAATAT 


AACTTTCTTA 


TTGTTGACTT 


TTTCAATGTA 


TTTGTCAACT 


6660 


AAAGTCAATT 


TTTCCTTGTT 


TATGTACCTT 


TTCTTATTGT 


ATAAGCTCTT 


TTCACGGGTT 


6720 


GCAAGCATTT 


GTtTCATCTA 


TTTAATAAGA 


AAACCGTTTA 


CTATTTTAGA 


GAATTCCTTC 


6780 


AGAATtCGkT 


TATATAACAt 


GCTTTTTTGC 


GTTTTTtGTA 


TATAATATAA 


GAGAGAGTAA 


'6840 


GAAAATAAAC 


GTATTAAATG 


ACGTAATTAT 


AAATAGACAG 


GACGTTTTTT 


ATGAGCACAA 


6900 


CTTTTCCCCT 


TCACACCGAA 


TCATTTATGC 


TCCTTGATTC 


AGGTTGTTTA 


ACAAACAGTG 


6960 


ACTATTTTTT 


ACCTTTTGCT 


TTGACCGATG 


TTTTAACTAC 


TTTTTCAGGA 


ATGCAGCACC 


7020 


AACCAATCAT 


TCATTTCTGG 


CAATTGGATC 


AAGCAATGAT 


TCTTGGCATG 


AAAGATACCC 


7080 


GTGTACCACA 


TCTAAaAGAA 


GGCATCGCTT 


CTCTGCAAgA AAACGATTAT 


TCTGTAGTCG 


7140 


TTCGCAATGC 


CGGCGGTTTA 


GGGGTGATTG 


CCGATTCGGG 


TATTTTAAAT 


GTTTCACTTA 


7200 


TTTTACCAAA 


TAATTCTGAA 


CACAAATTAA 


GTATCGATGC 


GGCTTATGAA 


TTAATGTGGG 


7260 


CTTGGCTCCG 


TCAGAGTTTT 


CCGGAGAAAG 


AGATTGACGC 


CTTTGAAATT 


ACTACATCCT 


7320 


ATTGTCCCGG 


TACTTATGAT 


TTAAGTATTG 


GCGGTCAAAA 


GTTTGCAGGG 


ATTGCCCAAC 


7380 


GACGCGTCAA 


AGACGGTATT 


TCTGTCATGA 


TTTACATCAG 


CGTTAACGGC 


AATCAACTTG 


7440 


CTCGTGGAGA 


AGTGGTTCGA 


GACTTTTACC 


TTGCTGGGTT 


ACAAGAACAA 


TTTGGTGAAA 


7500 


ATGGCTATCC 


ACCCGTTGAT 


CCAGCAGTGA 


TGGCAAATTT 


AGAAACCCTG 


ATTGAGACTC 


7560 


CATTAACAAT 


AGACGCCGTA 


AAAACCCGTT 


TAATCGAAGC 


ATTACCTCAA 


CAATTTGAAA 


7620 


AATCTATTGA 


TCCAAACTTA 


ACCGAACCAA 


TTATAACCTC 


AGAATGGTTC 


CAAACAAATT 


7680 


TAACTGTCCA 


GCTGGAAAAA 


ATGGCGCAAC 


GAAACGCCTT 


AATTAAAGGA 


. GAGATCTTAT 


7740 


GACGATTCCG 


TATAAAGAGC 


AACGATTACC 


AATCGAAAAA 


GTTTTTCGTG 


; ATCCCGTCCA 


7800 


TAATTATATT 


CATGTCCAAC 


ATCAAGTGAT 


1 TTTAGACCTT 


1 ATTAATTCTG 


; CTGAAGTGCA 


7860 
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ACGATTACGT CGCATTAAGC AACTAGGTAC TTCTTCTTTT ACGTTTCACG GTGCCGAACA 7 920 

TAGTCGTTTT TCCCATTCTT TGGGTGTTTA CGAAATTACA CGTCGGATTT GTGAGATTTT 7980 

CCAACGTAAC TACTCTGTGG AACGATTAGG TGAAAACGGC TGGAATGATG ACGAGCGCCT 8040 

GATTACACTT TGCGCAGCTT TATTACATGA TGTCGGTCAT GGTCCTTATT CACATACATT 8100 

CGAGCATATT TTTGATACAA ACCATGAAGC AATCACGGTC CAAATCATCA CCTCACCAGA 8160 

AACGGAAGTC TATCAAATTT TAAATCGTGT TTCGGCAGAT TTTCCTGAAA AAGTCGCCAG 8220 

CGTTATTACT AAACAATACC CAAATCCTCA AGTGGTGCAA ATGATTTCTA GTCAAATTGA 8280 

TGCCGATCGC ATGGACTATC TGTTACGTGA TGCTTACTTT ACAGGGACGG AATACGGAAC 8 340 

CTTCGATTTA ACTAGAATCT TACGTGTGAT TCGACCATAC AAAGGAGGCA TTGCC 8395 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

AAACTATATC ACTGGAaTCC mAAAGGCGGT ACCAGCACTT GGTCATTGGA TAAGAAAATG 60 

i 

CAAGGAAAAA CGAATTTACA TTTATATGAA TTAACAGATC AAGGGCGTAT TGACAAAGGC 120 

GCAATTGCCA CTACAAATAA CCAAGTGACC ATCCAAGCCG AGGCTAATAC ACCGTATGTC 180 

ATTGCTGAAC CTGACAGTAT TGAACCGATG ACATTTGGAA CAGGAACACC ATTTAAAGAT 240 

CCTGGATTTA ATGAAGCCAA TACCTTAAAA mATAACTGGA AAGTTTTCCG AGGTGATGGA 300 

GAGGTTAAAA AAGATGCCAA TGGTGATTAT GTCTTTAGTT CAGAAAAAGA AAGAACCGAA 360 

ATCAAACAAG ATATCAATCT TCCTAAACCA GGAAAATATA GTTTGTATCT AAACACAGAA 420 

AC AC AT GAT C GTAAAGCCAC AGTAACTGTT AAAATTGGTG GTAAGaAATA TACGCGGACA 480 

GTGAATAATT CGGTTGCCCA AAACTACATT CAGGCAGATA TTAACCATAC AAGCAGGAAA 540 

AATCCGCAGT ATATGCAAAA TATGCGAATT GATTTTGAAA TCCCAGATAA TGCCAAAAAA 600 

GGCTCGGTGA CATTAGCGGT TGATAAAGGC AATTCCGTTA CAAAATTTGA TGATTTACGA 660 

ATTGTTGAGC GTCAAACGGA TATCATGAAC CCAGACAAAC AAACAGTTAT TAAGCAAGAT 720 

TTTGAAGACA CACAAGCAGT TGGGTTATAT CCGTTTGTTA AAGGCTCAGC TGGTGGTGTA 780 

GAAGATCCAC GGATTCATTT ATCAGAAAGA AATGAACCTT ACACACAATA TGGTTGGAAT 84 0 

GGAAACCTTG TTTCAGATGT ATTAGAAGGC AACTGGTCCT TGAAAGCCCA TAAACAAGGA 900 

GCAGGATTGA- TGCTTCAAAC AATTCCGCAA AATATTAAAT TTGAACCGAA CAAGAAATAT 960 

ACGGTCCAAT TTGaTTATCA AACTGATGGT GAAAATGTCT TTACTGCTGG GACCATTAAT 1020 

GGGGAGTTGA AAAATAACAA TGACTTTAAG CCAGTCGGTG AGTTAACTTC GACAGCAGCA 1080 
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GATGGTCAAA 


CCAAGCATTA 


TGAAGCAGAA 


ATAATTGGGG 


ATGCTTCAGG 


AAACACTACG 


1140 


TTTGGTATTT 


TTACAACAGG 


TGCCGATAAA 


GATTTCATTA 


TGGATAACTT 


TACGGTCACA 


1200 


GTGGAATCAA 


AAAAATAAAT 


TTAGTCTATC 


TAAGCAACTt 


GGTCTACAAG 


TTAAACGTAC 


1260 


TTGTAGCCCA 


AGTTGCTTCT 


TTTATGAAAA 


GAAGCAAGGT 


CTAGTGCCTC 


ATTTTTATAC 


1320 


TGCaGAATAA 


ACCGACACAT 


TTTTTAAAGT 


GTCAGGCTTC 


TTTGTGTATT 


TCATCAAGAA 


1380 


CGAATGGACA 


AACTTGCCTT 


TTGCCACAAA 


TCGTTTaCmA 


TTACmATGAT 


AGGCATAATT 


1440 


TATTTAGAGG 


TGAAGTGATT 


GAAGAAATAC 


AATGTTGATT 


GGAAyTACTG 


GGTTGTCCGC 


1500 


TTTTTATtTG 


TAATGGCGCT 


GATTGTTGGC 


TATTTATTAA 


TCACGAATTA 


TCAGCACTTT 


1560 


GTCCATAGTG 


TTTCGGGTTT 


AT TAG G CATC 


TTATCCCCAT 


TCATTACTGG 


GTTTGTGATT 


1620 


GCTTATTTAT 


TGAGTGGAaG 


TCAGAAAAAA 


ATCGAGGGAT 


TACTAGAAAG 


AGTTCCTTTA 


1680 


CCAGTCGTTA 


AAAAAGCGAA 


ACACGGCTTG 


AGCGTACTCC 


TTTTGTACCT 


GATTATTTTG 


1740 


TTTATTTTTG 


TTTTGACATT 


AAACTATATC 


GTTCCGCTAC 


TCATTAGTAA 


TTTAGTAGAT 


1800 


TTAGCAAACT 


CTTTACCCAC 


TTTTTATGAT 


CACATGGTTC 


AGTTTGTAAT 


GAGTTTAGAA 


1860 


GATAAAGGGA 


TTTTAAAAAC 


AGCCGCGATT 


GAAAAATATT 


TAAATAGTGT 


CCTTAAAGAT 


1920 


TTGTCGCCAG 


AACGTTTTTT 


AAATcAATGG 


ACACAAGCGT 


TGTTTTCATT 


AGGAACGTTG 


1980 


ACTAAAAATG 


TTTCATCATT 


CTTTTTAAAT 


GCATTTTTGA 


CGTTGATTAT 


CTCAATATAT 


2040 


GCGCTGGTAT 


TTAAGCAATC 


TATTTTGACA 


TTTGTTGAAA 


AGGCTGCCCA 


CAAATTGCTG 


2100 


TCTGAAAAAG 


TGTACAAGCA 


AACACAAACT 


TGGTTAAATA 


CAACGAATAA 


AATTTTCTAT 


2160 


AAATTTATCA 


GCTGCCAATT 


TCTTGATGCT 


TGTATAATTG 


GCGTTTCATC 


AACGATTTTG 


2220 


TTAAGTATTT 


TGAATGTGAA 


ATTTGCTGTG 


ACTTTGGGGA 


TTTTATTAGG 


TATTTGTAAT 


2280 


ATGATTCCTT 


ATTTTGGTTC 


GATTTTTGCG 


TCAATTGTTG 


CTGGTGTGAT 


TACGCTTTTC 


2340 


ACAGGTGGTG 


TTACTCAAGC 


AATTACGGTG 


TTACTAGTTT 


TATTAATTTT 


ACAACAAATT 


2400 


GACGGCAATA 


TTATTGGTCC 


TCGAATTATG 


GGAGATGCCC 


TAAATGTCAA 


TCCGATTTTA 


2460 


ATTATTGTTT 


CAATCACGAT 


TGGGGGAGCG 


TACTTCGGCG 


TCTTAGGGAT 


GTTTTTAGCA 


2520 


GTTCCTGTTG 


CTGCAATCAT 


tAAAATTATT 


GTCTCGGAGT 


GGCTAAATGA 


ATCAAAAGAA 


2580 


AATGATAAGA 


TTGTAGATTC 


TATTGAATCC 


TAAAGTAAAA 


CTTTTCGTCG 


CGCGTTGACG 


2640 


AAAAGTTTTT 


TAGTGGAAAA 


GAAAGTTTAA 


TTATTTGGAA 


TTTTTGaTAT 


GCTATACTTA 


2700 


AAGTAAATAA 


AAAATCAAAA 


TTGAGGTGAA 


TAGATGCAGT 


TTTTAAAAAA 


ATATGGCGGC 


2760 


TATTTTTTAG 


TTTTAGGGGT 


CTTACTTGAT 


TTTTTTACAC 


CTTATTATGT 


CGGATTCAAA 


2820 


GATCAAGGTT 


ACAATCAATT 


GACAGAAGTA 


ATTAGCTTAT 


TAGGTGATGT 


GAATAGTCCA 


2880 


GTTAGGGAAA 


* ACTTTAATCG 


GTTAACTATC 


ATTGCGGGAA 


TGTTGATGTT 


AGCAAGTTTA 


2940 


CCTAGGATAT 


ATGCTATTTT 


TTCCAGAAAA 


ACGAAAAAAG 


GTGCTTGGTT 


AGTAGTGGCA 


3000 


ATGATTGGCG 


CTTATGGTTT 


ATTTGATTGT 


ATTTTCAGTG 


GGCTCTTCAG 


; CG TAG AC ACT 


3060 
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T P A A P PP PTP 
1 LnnotuL 1 O 


PP AP&P.TTP.P 
obntnv Hut 


TP.PPHPP.PTT 


CATAACGGAG GAT C AGC AG T 


CGGTTATACA 


3120 


ubt 1 x 1 1 lu 1 


TP.TTATPAP.P, 

lUl 1 n 1 t rtVJU 


PHTPTTAAPP 

tVj 1 t 1 1 nntt 


ATTATCTACA GTAAATACGG 


TTCGCAAAAA 


3180 


a zit a a aanpp 

nn J. nnnnntt 


1 1 1 1 i OUVv 1 1 


T"T T Pi TT 1 T A T T 


TTATGTATGC 


TTGCAGCCGG 


CTTATATGGC 


3240 


T T Zi P P A 21 P. P. Zi 




G C AA C AGG T A 


AAACCATTTA 


ATTATTTAGG 


CTTATGGCAA 


3300 


AG Ab I b Ab J. i 


P^sTTTTPTAA 
Lai 1 1 1 O 1 nri 




ATGCTGGCAC 


TGTGtTTACA 


AACTAAAACC 


3360 


Trv t\ m /"* T\ /* ' 7\ 7\ AT* 

AAT G AC AAA 1 


TTPZiTTP-A AP 
1 1 oft 1 1 onhL 


ATTTP.TPATT 


GGTTTTTTAC 


TAGTTAAAAT 


TGTCTGGTAA 


3420 


ct> TV TV rp- rn 7A 7\ rn 1^ 

TAACTTAAi 1 


ZiTTPPPTAPf* 


P,TP,ArTTCAC 


CAGTAACAGT 


GGCACCATCT 


TTTTCTAAGT 


3480 


C AGC AGAA 1 b 


tttptzit+* PT 
1 1 1 b lnl T-t 1 


TP.TTT A TP A A 
lUl 1 1 rt 1 tnn 


AAGTAATATT 


GCCATTAACT 


TTGGTACCAT 


3540 


m rrt fi tv TV /""* rn ti TV TV 

TTAAAGTAAA 


/"•/t^atttppt 
bbbAl 1 1 bb 1 


T TP A PPTP A A 
1 1 LrVLL 1 Lnn 


TATCACCTTT 


GACAGTCCCG 


TGAACGATGT 


3600 


m/**«T\ tv T\ mfpmfnr« 

TGAAATT1 It 


Ar'AApAA.nr'P 

Ab AAbAAAL b 


ATTAP.PTT AP, 
/Ail nut 1 1 r\\j 


GAACCGTGAT 


TTCATATTCA 


GCAGTTACTT 


3660 


TTTTATTATC 


ATCTTbbbAA 


'PA A ZiPTPPTA 
1 HAH Li 1 IjL 1 n 


ATTTACGATA 


GACATCGTTG 


GAATCTTTCC 


3720 


/-^ rn m tv rrt 7v rp /"* 

CCTTATCATG 


bAAAb 1 Abb A 


PPA APTP.TTA 
tat HHu 1 Ij 1 J. n 


CCTCTTTATC 


AAAAGTGACA 


TTGTCAGTAG 


3780 


CTGCGACAAT 


bbAAl 1 ibb 1 


TT ATP APTT A 
1 1 n 1 LnL 1 In 


ACGCCATTtC 


GAGCTCTTCA 


GGCTTGGCAC 


3840 


TAATTGAAGC 


ACCAGAGAb i 


A PTTPTPT AP 
At 1 1 b lKj i nU 


TTGAATTAGC 


TGTTTCTTTG 


CTACTAGAAC 


3900 


m tv ^ m tv rn z™ 1 rn z" 1 m 

TACTATCTGT 


TTTTTTATCA 


PA TP A APA AP. 


CACTAAGCAA 


CACAAGAGCA 


GCAGTTGCCG 


3960 


m /"» Ti Tv tv tv Tv rp 

TCAAAACAGT 


GGTbAbl Ab i 


TTZiPQPATTT 
1 Inwitnl 1 1 


TCATGAAAAT 


CCCTCCAAAA 


AATTTATTTA 


4020 


TTTCTTCCTT 


rp t\ rri tv rp tv ti pm 

1 AlbAl AAbl 


P ATTPA A AP.P, 
on 1 1 Lrvririou 


GAAATGAGTA 


ATCATACACT 


CAAAGATTGA 


i 

4080 


GAAGTTl bbb 


1 AA1 bbuHHn 


TT ZiPPPTT AT 


TCTCTCAAGA 


AGGTTCAGGA 


AAT AT TC AAA 


4140 


z" 1 rp 7\ 7\ rr* /*^* tv rn rp rp 

GTAA I bA III 


Li 1 1 1 1 LAHH 


TPTTPPTTTT 
lull tv 1 1 1 X 


CTGTAGTAAG 


CTAGTAATAG 


ACAGAATTTA 


4200 


AGGAGGAbA I 


ill bAlbbb 1 


TPPAATATP.A 


ATGATTATCC 


TACATCAATG 


AAAAATTTAG 


4260 


CACC1 1 IbAI 


aippp A 7\ A A A A 


P.PP AT AfZ AT A 


TTGGCAATGC 


GTTGCTTGCG 


GATGGCTACC 


4320 


z-' rn A rp/— ' Tv riipp 

CTGATGAIbb 


f~*C*7\ T»T PP Zi 

bbbbAl IbbA 


ATTP.PP.ATT A 


GTCAAGCGGA 


GAAATGGTAT 


CAAGAGGCAA 


4380 


GTbCAbLAbA 


P A A A A R PPP Zi 

b AAAAALj uLH 


TTTP.A APAAf^ 
ill onntnno 


AAGCAAATCC 


TACGAAACAA 


GATTCGCATA 


4-4 40 


AAL Abb A i AA 


APATPPPPPZi 

AbAI bbbbbA 


AAATTATTAA 
nnn 1 1 n 1 1 nn 


CGGCGGCTGT 


CAACGTTwmA 


TmTAAAGACG 


4500 


ppp zi airrTT 


PPTP ATTTP A 


P,APaP,PnPTP, 


AAAAAGCAaG 


TAATACATTT 


CCCCACAAAC 


4560 


mAPAPPPPAT 


T P Zi Zi P (2 Zi C P P 

1 UnnLUnOtL 


Pa AHAAATTG 


CTCGGAATAA 


GCAAACAAAG 


CTGAAAATCT 


4620 


ATiiAP.PAAP.zi 


TP.P.T APP,TT A 
1 VjLj 1 nto 1 in 


TAAGAMCCA 


AAGAGTATAC 


TGAATAATTT 


TTCGAAACCT 


4680 


TP ZiT A TTPPP. 


TTP.P A ATTP A 
1 IvjOhnl 1 tn 


AGAAAPGGTT 


ATCAAGGTTT 


TTTTGCGAGG 


GAGAAATGGA 


4740 


. CAAGTAAGAA 


AAAATAACAA 


AAGTTAAGAA 


TGAGAATAAG 


ACTTGCTTTT 


CTAGTTTTTG 




GTGGGTACAA 


TTACACTAAT 


TAAGATAAAA 


AATCAAAATA 


ATTGAGGTGC 


TACGACAATG 


4860 


GAAAAAGCCA 


. ATCTTGATTG 


GAATAATTTA 


GGATTTTCTT 


ACATTAAAAC 


GCCCTTTCGT 


4920 


TATATTAGTT 


ATTGGCGAGA 


TGGCAAGTGG 


GAAGAAGGCA 


CGCTAACAGA 


, TAACAATCAG 


4980 


TTAACGATTA 


GTGAAGGTTC 


GCCTGCTCTA 


, CATTATGGCC 


AACAATGTTT 


TGAAGGATTA 


5040 
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AAAGCGTATC AATGTGCAGA TGGTTCTGTC AATTTGTTTC GTCCAGATGA AAATGCCAAA 5100 

CGGCTACAAA AAAGTTG-T-GC-- GCGTC-TACTG ATGCCACAAG TGCCAGTAGA AACCTTTGTT 5160 

TCTGCATGCC AAGAAGTGGT GAAAGCCAAC CTTGCTTATT TACCGCCATA TGGAACGGGC 5220 

GGTACGTTGT ATCTTCGCCC TTATATGATT GGTGTCGGCG ATAATATTGG CGTAnACCTG 5280 

CCAAAGAATA CATTTTTTCC AATGATTGCG TGCCGGnTGG TTCGATTTTA AAAATGGGTT 534 0 

AGCACCAACC AACTTTATCG 5360 
(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 



TGGCCATGCT 


TATCTATTAA 


TTCCCGCTGT 


GTTTATCGGC 


ATTTGCCTTT 


ACTTTATTTT 


60 


TAACTCACGT 


ACGATGCAAC 


GAAAGAAGGC 


AAAAGTGTCA 


CGGCTAAATT 


GTCGGCGGGC 


120 


TTAGGGCTGA 


ACTTATTAAT 


TGCTTTACCA 


GCCTTTCTAT 


AT T TAT TAT C 


aGTTGGCACA 


180 


GCGCAAATAC 


CTTATGTGCT 


TTTTCTGTTC 


TTACTTTTTC 


TATTGTTAAT 


GGATCTTTTG 


240 


CTGACCTTTC 


TATTTGCTGC 


TTACGTCTTG 


TATTCATGGA 


TGTACCAAAT 


GATTCCTTTA 


300 


AAAAAAGCGG 


TTGATTATAT 


CATTGTTTTA 


GGTTCGGGAA 


TTCGTAGCGA 


GGAAGTACCT 


360 


CCACTTTTGA 


AGAGTCGGTT 


AGATAAAGGG 


ATTGAATATT 


ATGAAAAAAA 


TCCCACTGCT 


420 


AAATTTGTAG 


TCAGCGGTGG 


TCAAGGTCCT 


GACGAACCTG 


TGGCCGAAGC 


TTTCGCAATG 


480 


AAAAAATATT 


tGCTTTCACA 


AAATATTCCT 


GCAGAAGCAA 


TTTTGATGGA 


GGACCAATCC 


540 


ACAACCACTT 


ATGAAAATAT 


GgTtGTTTTC 


TAAAGCAATA 


ATTCAGGCAG 


ATTGGCAAAA 


600 


GATGCCgtCt 


GAtTCTAAAc 


AACCCTCTGT 


tATTTTTtCA 


ACAAATAACT 


ATCATGTCTT 


660 


AC GAG GAG CT 


ATGTATGCCC 


aTCGTGTCGG 


CTTAAAAGCT 


GAAGGTGTCG 


GGGCGCCAAC 


720 


TGCCTTATAT 


TTTTTACCAA 


CTGCTCTAAT 


CCGAGAATAC 


ATTGCCTTAC 


TGGTTCATGA 


780 


TrAGCGAATT 


GTGCTTTTTG 


TTTTTCTACT 


TGTCACTCTT 


CTTTTAGGAA 


TCAGTATCTT 


840 


ACCCATCTAA 


AAAAACArcG 


ACtCACATTT 


TTGTGAGTCG 


CTGTTTTTAT 


TCTTTTAATA 


900 


TAAAAAGGTT 


TAATGAACGT 


AATTGGCTAT 


CAATGCAATC 


AGCCATAAAT 


GGAGCGGATA 


960 


GAATCCGTAA 


AATAAATATT 


TGAAAAATGG 


TGCTTTGCTC 


CCTCTCTCAC 


CATTGTAAAG 


1020 


CGATATAAAG 


GGTAACACAG 


TGATAAACAT 


AAAATCAGAA 


TTATAAGCTA 


ACATTTCAAT 


1080 


TGTCGTTGGC 


CAATCTCCCA 


ACCATTGAAA 


ACTTGTAACT 


AAGAAAAAGA 


GCGCTAATGC 


1140 


ACCATATAAA 


CAGTTCCGTA 


AGACAAGTCT 


CTTTCTGGcT 


AAATACGTAA 


tCAACATAAA 


1200 


AGGTAACATC 


ACAATGCCAC 


CTTCAGCAAA 


CATTGCTCCT 


ATTCCTAAAA 


TTGCTAGTAT 


1260 


ACTAATCAAT 


AAAACAATTT 


TTAAACTGAT 


TTTTGGcATC 


TCTAGCATCG 


CTTTTGTCAC 


1320 
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AATTAmCATA C7GACACCAA GGGCTAAGGT GAAGAAAATA TTGTTGTGAA CAGCGACAGy 1380 

GGaTTATTCA C7AAATGA77 TAGAAGCGTA TTTCCAACAA ACATGATTGC""tGCCCAAATA 1440 
TACAAACGAC CATTGTAACG aTAAACATTC CGTGTATAGT TGAAaCCTTC TACAGCCATG . 1500 

TAACCmAAAA ACACACCGA7 ACtnACGAGTA ATTACATGAA AAATCAGTGC CCATTCTGGG 1560 

CGGAACAAAA TAAC7AA7AT GGGCCAGGAC CATTAACCCC ATCATCAATA ACTTTAAGCG 1620 

ATTTGCG7CC AGGAATCGCC CCTT 1644 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

nCCnCGTcAC TArGAAtCAG CTACgTCACG AAGGGAAAGT aCCTGCCATT GTTTACGGTT 60 

ACCAAATTGA AAGTACACCA ATCTATTTTG AAGAAAAAGA CTTATCAAAA ATTTTACGCG 120 

AACATGG7GC TAACACCGTT ATTAAAATGA CAGTTGATGG CAAAAACATC AACACATTAA 180 

TGTCGAAAGC ACAG7TAC-AT ACCTTTACTG GTCAAATGCT TCATGTTGAA TTCTTATCTG 240 
TAAATATGAA AGAAACAACA GAAGTCGAAG CAGAAGTTCA ATTAATTGGT GAATCTGCAG 
GTGTGAAAGC TGGGGGTACC TTAGCTCAAA ACTTGTACAC TGTTTTAGTG GCAGCCACAC 

CAGACAAATT GCCAGAATCA ATTGAAGTAG ATATTACAAA CTTAGAAATT GGTGATGCTC 420 

TTACAATCGC TGATTTACCT GAACACAAAG ACTATGAAAT TTTAACTGAT CCAGAAGAAC 480 

AACTAGTTGC TATTGTTGAA GCACAAACTG CTCCTGAAGA AGAAGAAGGA ACAGCTGCTG 54 0 

AAACGACTGA ACCAGAATTA GCCGAATAAA GAGTACACTT TTGTTCAGCT GATTCTCTTA 600 

AGTTTTAGAA ATGGGATGTA CGTTGAAACA ACGTACGTCC CATTTTCTTT TTACTGCATG 660 

TCGTGATTTC TTCACATAAA TGATGACACC TGCAAAGCTA GGAAGCTACA GAGGAATATT 720 

AAATTTTTGC TAATCTACGC AATATTTTCA AACGAAACAA TTTCTTTTTT TTAAATTTGC 780 

AGTATACTAA AAGTAAGTAA ACAAAAAGGA GGTCATCCAT ATGTTTGAAC GTTTTGATAG 840 

TGACCGTAGC CGCTATGCTT CATTAGGTGT TGTTTCTAGC TTACCTAGTG GCTTAATTGA 900 

TAGTATTTGG TTGATTATTG ACTTAAATCT AAAAGGGGTT ATCCCTTTAA ACGACTTACT 960 

TCACTTTGAT TTGTTGAACA ATAATGGCAA AGTGACTGTC CATTTTTCTC AAGAAAATAG 1020 

TTCAGTAGAA ATGGCCATCG ACTTACCATT TAGTTATTCT ACGGCATATC CGTCGCGAAT 1080 

TTTTGCTTTT • GATGATGGCC ATCGTGAAAC GATTTTATTA CCTGCAGAAA TGTTAGAATC 1140 

TTAAATAATT TCAACAAAGT AGCCTCAAAT TCCGATTAAA CTGGAATTTG AGGCTACTTT 1200 

AATTTCTTTT AGTTATTCTA CAATAATTTG TTTACTACTT CCTGCAGCAT TTTCAAAATT 1260 



300 
360 
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AATAGTAAAG 


GTTTCTGTTG 


CATCATCATA 


AGTGATTCCb 


IbALl 1 rbAC 


1 AATTAb 1 b I 


1 jZ u 


GCTGTAATCT 


TTTGGTATTT 


TTAAAGTCAG 


TGTCTTAbl 1 


bll lb Iblbb 


b Al L AbAr AA 


i ion 


AGTATACGTT 


TTTTGGTGTT 


GTTGTGTCTT 


TCTCATTAAA 


GAAATAGCTT 


TATTTGAGGT 


1 4 4 u 


AATCCCTGCT 


ATTGTACCGC 


CAGTTTCTGA 


CCAAATGTTA 


ACTCCAAGAT 


AGCCTTCTTC 


1 bUU 


TTCTTGCTTA 


ACTGCTTGAA 


TTTCTTCTGT 


AT TT GAG AG A 


ACTTCTACTG 


TTTTTTTGGC 


1560 


AGCATATGTT 


TTTAAATCAG 


TTTCATCGAT 


TCCTGGTAAA 


ACTACATAAG 


CATAATGCTC 


1620 


GTTCGTTGGG 


TGTTGACCAT 


GTTGAATCGT 


AAATGTTCGA 


TAATCACCTA 


CATATTCTTT 


i c o r\ 


ATCACTTGGA 


AATGCTTCAT 


TAATTGCTTT 


ATAGGTTCCT 


TTTCTAGTTT 


CACTTTTAAL 


1 ^ A f\ 
1 / H U 


ATTTACCGTT 


GTTTCTTCTG 


GAAAATAATA 


ACCCATATTA 


GCATTGGAAT 


G ATCCGAC I 1 


loUU 


TAGTAATAAC 


CATTGTTTTT 


TACTCGTTTC 


ATTTTTCTCA 


TGAATCTCCC 


CAATATTTGA 




TAGCACTTGA 


TACTGATACG 


CATCATTTAA 


CAAACGATTA 


TCAACAACTG 


TCTCAATCGA 




AGCTTCTGTA 


TCCCCTTTAA 


TTCCTGCGCC 


TAATGCGATA 


ATTTGCCCAT 


TTAATACAAA 


1 Ck O f\ 

lyoU 


CCAAGATTTT 


TTAGCTTGTA 


GGTTCATAGG 


TAAAAGTTTC 


CCATTATTTT 


TGGTCCCGTC 


2040 


TTTATTAAGA 


GCCATCCCTA 


CTACTGCCTG 


ATTGTCTGAC 


GTTACTCCAC 


CAACCCATTG 


2100 


CTCTTTCGAT 


GTGATTGTTG 


TAAATGCTGA 


AACTTCATCT 


GCCAAAGAAA 


TGGTATCTAC 


2160 


CGTTGTTCCT 


GGTAGTCGAT 


AGGGATCTAC 


AGTTGGCCAA 


TAGCTGGAAT 


TAAATTGAAC 


ZZZU 


CTCGTCATCA 


TTATATACGT 


ACATCATTCC 


GTCACCAGTG 


TGCCAGCCAC 


GTTTATTTTC 


o "> o n 


TTTATTCCCT 


GCTTCAAAAG 


ATGCAGTGCG 


CTTTGAATAC 


AAGCCTAGCC 


CAAACATATA 


Z U 


ACTAGGTGTG 


CGTTGGACAA 


ATCGATCCAT 


CGAAGCATAA 


AGTTTCGTGC 


CTGTAAAAbb 




AAGTTGCCCT 


CCCGTAATTT 


CTGGATTAGT 


TAAAAGTTGC 


ATAGTCATCT 


m t\ t\ tv m /"* t\ rp rp 

GTAAA1CAI i 




AAAATCACGA 


GCATTTGTCA 


AATAATAATC 


TGGGTTCTCC 


TTCATCCAAT 


tv m m rrl T\ 7\ /^"» 

ATTTAACbbb 




TTCTTGAAAT 


TTCTTTTGAT 


AATTATTAGG 


GGCAAATTTA 


GCCACAATTA 


ATAAAI IAIA 


neon 


CATGGTAGTT 


GAGCCGTAGC 


CTGTTTTTCC 


AACTGCTGGG 


GCTCTTGAGA 


TGGAACbbbb 


Z OH u 


ATTCACGGTT 


GGTAGCATTT 


CACCTTTATA 


AATCAAAGGA 


AGAAAGGCAC 


/—• t\ ("p/-< T\ TV /"* T\ rn rp 

GATCAACATI 




TTCAACAAAT 


TCTGTCACAA 


GAGTTGCATC 


CATTTGAAAA 


GATGAATCTG 


CTG1 AAI bbb 


£ / DU 


CAAAATTTGT 


CCAACCCCTT 


TCACCAAAAC 


ATTGCCATAC 


GAACCTGTAT 


AAbbAA 1 b 1 1 




ATTGTGCTGG 


ATAAAAGAAC 


CATCTTGGTA 


AAAACCATCA 


CCTTTTGTGA 


C I AAI 1 I AAA 




TACATCGACG 


ATACTACTGG 


ATGCTTGGTT 


AATTTTCCCG 


CTATCCTTTT 


V"»IT) T\ T\ TV TV rp rp 

GTAAAAI ibb 




TAACCCCAAA 


ACAGTTAAAG 


CTAAGTCCGT 


TCGGTTCGCT 


CCTGATGTCA 


CAAAATTTGG 


3000 


GATAAAAGCT 


AAATCCACAA 


ACGTTCCTTG 


TGGTTTTGTA 


TACAACTGtT 


TAAACGGATC 


3060 


TGGGACATAA* CTGCTCAGCG 


CATTTGTATA 


GATTTGTTGC 


TTTTCTGGAG 


AAACTTTATC 


3120 


CTCTAAAATC 


ATTAGAATAT 


TAAGAAACTT 


CTGTGGCACA 


CCAATTTGCC 


AGTCCCACCA 


3180 


ATTTCCATGG 


TACTTTTTCC 


CATCATATCC 


TTTTTGCGTT 


ACCATAAAAT 


CTAAGCCATC 


3240 



WO 98/50555 PCT/US98/08985 

1113 

AATAATGGTT GCTGCCAATT TGTCATCTTG GTAAAGCGTA CTCCCCTTCG TTCCATAGGn 
TAACGCTAAT TTTTGCAATT TGGTAAATTG AGTCGTTAAA TCAGCCGATG GCGTATTGCC 
AGGCTCTAGT GGCCATAAAT AGCTGCGGTT CGGCTCTTTA TTCATCGTTT GATCGAGTTC 
TGTTGCTTCA TTGGAGATTT TTTGGACGTA GTCTACAAGC GCTTGGTTAG TTGAATCATA 
TTCTTTTGAA ACAAGCTGAT TTCGCCATTT tTCTTTTAAT TCTCGGTATT CTGCAGATGA 
ATCGATAACA GGAATGCTAC CAACGGGGCC ACTTTTCATA GAAATGATAT TCAGCTTTGA 
TAGATTACCA AATAGTCTAA TTCCTACAGC ACCCATCCAG TCATCATTAA TGGACGTACC 
ATCAGGATAA TGAACCTCTT CATTTTCATA AAATAATTGA TCATTCAGGA ATGTTTGAAT 
TTTTTTCCCA TCATAGCGAA CTAGCAGTTT ATATTGTTGA CCACTTAGTA ATGTTGGTCC 
CGGAATATTA GTTAGCCATT TACCGCCTGG TTGTCCTAAC TGCCAGCGAC CGTCACGATT 
ATAAGCAAAA GATTGCCATT GACTCGTCTT TTGTTTATCT CCGCGAAAAA CCAATCCGAA 
GTTACTTTGC CCTTCATATA AAAAGGTTAG TTCAAGATCT CCTGAAGATT GTTCGCGTGC 
ATCAAGATTG ATAGAGACAG ATTCGAGGTT TGTCCCTTGC TTCGTGTTTG CGATCCATAA 
ACCAGCAGAA TCATTCTTTT TTTCTGCTGT TCCTACCAAA TCTTGCCAAT GCCCAACTTG 
ATTGGGGAAA TCTGAAATGT AAGAGGGTTG CTCAATCGTT TGGCTCGCTT CGTTTTCTTC 
TGCAAAAGTA AGGGTGGTTC CCATAAATCC ATTCGTCAAA GCAGTAAACA GTCCGACCAG 
CATTACCCAT ATTTTCCTGT TTTTCATTCT TTCTCCTCCT AAAAAATTAT CGTAAAACGC 
TTACATCCGA ATTAGAGAAA AAGAGTGAAG AAAAATACTT TCTACACTCT CCGACTCTCT 
GAAAATCACT TAATATACTT TTTACGCCAA AATACCAAAG GCACTCAAAA CAATCGATAA 
AATTAAAATA ATCCAAATCA CTCGTGTGGA ATTTAGTTTC TTTTGACCTA ACATCCAATA 
AGTCAAAAGA ACAAGTAAGA TTGGAACTAA GGAAGGCAAA ATCGCATCTA ACGTATCTTG 
AATAACTAAT TCGACCCCGT TCTTTTTATA AACAAAAGGA ACTGTCGCCT TGATAACTGA 
AGGAATTAAG GCACCAATCA CAGTcACACC AAGAACAGTT GCTGCATTCG TAAAGGCAGA 
TAACTTGTCT TTTAAGGTCG TCACTAGTTT GACCCCTTGC TTATAACCAA GTGGTAAGAG 
CGCGGCCCGA CCAAACAATA GTGCAATATT TGCAATAATC CAAATAACAC AGCCAACCAC 
CGAACCATTT TGAGCCAGTG TCCCTGCAAC AGAGCCGAAA ATCGTCCCCC AAATAACGTG 
GAATAAAGAG TCCCCAACGC CTGcCAAAGA ACCCATCAGA GCTGTTTTTA GCGCAATAAT 
TGTTTCTTTC GCTTTATAGC CATCTTGCTC TTCAATCGCA ACATCAATCC CCATAATTAA 
ATTCCCAAAA ATAGCATTGG TGTTAAAAAA TTGATTGTGT GTGCGCATCA TATCTTGAAA 
TTGTGCTTCA TCATGACCAT AGAATTTTTT TAAAACAGGT AAGATGCCGT ATAAATATCC 
TGAACTCATC" ATACGTTCAT AGTTCCAGTT TAATTGGCTT CCTAAAATAT AACGCCAGCT 
AATTGCATTT AATTCTTTTT TTGTCAGTTT ATGCTTACTC ATCGCCGTCA TATCCTCCTT 
CAAAATCACC ATTTCCTGTT GTTGCTTGTG TCGTGCTTTT TGCCCCATCA CGTTTAAATA 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
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4440 
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CAAGCATCGC TGCTACCATC CCAACTATCG AAACACCTAA CATTGGAaCT TTTAAATAAG 5280 

CCGCTAAGAA AAAGCCTAAT AGTAAATAGG GGATATATTG CTTCGTTGGC AAGTAACGTA 534 0 

ATAAAATCCC TACACCGACA ACAGGTAATA CTCCTCCAGC TGTTTTTAAG CCCGTCATTA 54 00 

GCCATTCTGG CATGTTATCG GTAATCGTTG TTACTACAGC ATCCCCCGCT AATAACATTA 54 60 

ATAAAATCGG AATGGCACGA GACAGTCCCC ATAGAACAGA ACCAGATAAA ACTAAACGTG 5520 

GAATCCGTTT TACTTGCATC TTATCAATCG CTTTATCGAC ACTATGAAGT AAAAAGACAT 5580 

TACAAAAACG GGCGACTACA TCTAGTTGCA ACATCAACAG AGmAACAGGA ACAGCTAAAC 5640 

CAATGCCATA CTCCGCTCCT TTTCCAGAGA TTACAGCGAA AGCGGTTCCT AAAACAGCCC 5700 

CTGTGAAATA ATCAGGTACC GATGCTCCAC CAAAAGCAGC AATTCCTAAA CGCATTAATT 57 60 

GAAGAGTGCC GCCAACCATT AAACCAGTTT TTAAATCTCC CATAATACAT TCCAGAAATC 5820 

ATTCCAGCAA TAGCTGGTTG CGTCAACGTA TTTGAAATAA GTGGTCATTA ATAGCnATAA 5880 

ACGCTTAATA GTAATTAAGA TAATTTGATA CGGCTGCTAA ATGCTT 5926 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

GGATATaGAA aCCCAACGGT TnCAAACAGC AAAAAAAATG GCaAAAATAT TACAAGTTCC 60 

GTGGACAATT TTTTTTGATG AAGAAGTACT CGAAACGTAT GATTTTTGAT aGGrGGATAA 120 

AAtGACaACC AAAGAAAAAA TTGAATTTAT CAAACAAGTA ACACyTCATT CAGATTCAGA 18 0 

AGTAGAGAAA ATTATTAAAG GAATGAGCGA TACCAGTATT AACCGCTGGT ATGAAATAGA -24 0 

GAAATATCGC ATTGATCAAG AATTAGAAGA AGCGGTGTTA ACTATCTATT GTTAAAAGAA 300 

TACACCTATA TCAGTCGGAT AGAAATAACA AATTATACGA AATGAGGTAA AAATGACGAA 360 

ATCTATAAAA CTAAACAAAC AATTAAGAAA AGCAGCTTTT AAAAAAGGAT TAAGCCAAAA 420 

ACAAGTGGCC CAATTAGTAT ATTTTGCACA TACTACAACA AATGGTCACT TTAATGGCTA 480 

CCCAGTACCG CCAGAAAGTG CTATAGCCTA TAACGAATTA TTTAACGATA GTGAGTTGGC 540 

ATTTGCATTA GGCCAAGAGT TATTGGGCCT AATAGGGCTA GCAACTGGAG TGAAAGTAAA 600 

AAAAGAGCCA TTAGCGCTTT CGGTACTAAA GGAAAAAGAG GAACGAGAAA GAGAAAAAAT 660 

CGAAGTGGAA AACGAAATCG ATTATTTAAT GGCGATTCCA GTTGAAGAAT TGACCGAAAA 720 

ACAAAAACAG GCAATATTAC AGTATTGTAG CGAATATTCA GATGAATTGT TGTTTGAAGT 780 

TTCTCTTATT TGTAAGCAAT TAGAACTAAT AGGGATGTCC TTTATGGATT TAATGATATT 84 0 

AAGAACACCT TATTGGAAAA ATAAAGAATG GATTGaATAA AGGAGGGAAG TAATTTGGAG 900 

TCAAACATTA GTATGCAAGA TGTGTCTAAG AGACTTTCTA GGAATCTATA TATGATGATG 960 
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GGAACTCCTA GAGAGGATAC CAAAAAAATT TGGAAGGTAT CTGAGCTTTC AAAAGCAAGC 1020 

GGAGTTACAC CATGTGTTaT TAGTCGGATT AAAAATGACA CTGAAGGAAA AGAAAAACCG 1080 

ACTATTGAAA CAGTTGTTAA GCTTGCTAAA gCATTAAATG TTGATCCTGC AGAATTATTG 1140 

AAATAAGAGG TGAAAATAAA TGGAAAAAGA AAATATCCAA GAAAATTATC ACGAATCATT 1200 

AAAAGAGCGG AAAATCAAAG ACACTTTACA AGAAATTAAA GAGAGATTAA ATGGCGTAAA 1260 

ACTAACCTAA AATGAAAGGA AGTTArAAAT GAATAAGAGG AAACCACGTa GTTTATATGA 1320 

AGCAAGAATT TTAGGGACTT TGTTAGTATT CTTCATACTT GGTCTAGTAA TAAAGAACAG 1380 

TATTCCGTTT AACTACTTAC TTTTAATACT TGGACCTATT ATTGCAATTT GGTTTATGAA 14 40 

ATATGACGAT GCTAAaTATC rAGAAAaTTT AAATAAAAAT GAAAAGGAAG TGTATTAATT 1500 

GAATAGAAAA ATCAGAAGTC TAATAAAAGA ATTAACAGAG GAATGTGACA AAGAAAAAGT 1560 

TTCTCTTATT TGTACAGCTA ATAACCAAGG TGAGACAGTT AGCGCAATTT GTGGCGGATT 1620 

AGTAGATCTA TCATTTTGTT TAGGAGTCCA AGAAAAAAAG CTAAGTGAAA AATTGCCGAT 1680 

ACATCCAGAA ATTTTGCGCA AATCAGCAGT TGAAGCATTG GAAGAAGTTA AATCCGATAA 1740 

TCATAAACAT ACTTTTGTAA TCGAAAATGC AGAAGATTTA CAAGATATAT TAAACAGAAT 1800 

TGCTTCAGGT GAGTTTGATG AATAAAAAAA TGCCCAATCG TTTGCAGACG ATTTAGGCAC 1860 

TAACAAAAAT ATACTAGGAA AATTATAACA TAAATTTGGA GGGAGTTAAA TGCAAAAAAA 1920 

CAACAATAAG CTAACCATGT CAGCCCGATT CGGTGAATGG TACAAAGAGG CGACTGAAAA 1980 

TTGTGCAGAA CGGGAATTAT ACGCAACGTC GTTAATTGCA CGTATGGATT GGGTAGTTGA 2040 

TCCAATAAGT AGTAAAAGTT ATAGATATGA TTTAACAACA GATCTGAATA AAAATCATAG 2100 

CAGCGCTTGG TATGATGTTG CCACTGAAAT TTGTAACAAA CGCAAAGAAA CAGTTATACG 2160 
AGCAATCCTA GAAGACAATA TCAAAATTGA AAATTTCTAA TAAATGGAAG AGGAGGATTT 2220 
CAATGTATTA TCCAACTACA AAATTTGATC AGTATGGTGC TCGCCAATTA CCTAAAGAAG 2280 
ATGTATCAGA AGAGATAGAC GAAACAGCAG ATTATGACAA GGATAGTATG GGaGAAACTA 2340 
TCGCCCAAGA AGATGGTGTA TTTGAAATAT TGTATACAGT AACTACTGTC ATGAGTAATA 2400 
GATGTCTCAA AATTAAAACA GAGAAAGCGA TTGTTACTCA ACAAGATATT GTAGATTTTA 24 60 

TTTTAGAAAT AGGTGAAGAA AAAATAACAT CTATAAAATT TGTAGGATTA GGAGAAAAGT 2520 
ATTTAGAAAT TGGAGGAAAT TATAATGGGG AATGAATTAA TAGTAAGCGT TCAAAATAGA 2580 
ATTCAGGAAA TGCAAC AT GG TGAAGGATTG AGATTACCCA CAGGGTATTC GGTAGGAAAT 264 0 

GCTTTAAACT CAGCATATCT AATTCTTAGC GATAATTCTA AAGGAAAATC ATTATTGGAA 2700 
AAATGTCACC CTACATCAGT GTCAAAAGCT TTACTAAATA TGGCAATACA AGGACTTAGC 27 60 

CCAGCAAAGA nTCAATGCTA CTTTGTTCCC TACGGGGATC AATGTACATT GATGAGATCA 2820 
TACTTCGGAT CAGTAAGCAT ATTGGAACGA CTTTCTAATG TAAAAAAAGT ACATGCTGAA 2880 
GTAATCTTCG AAGGAGATGA ATTTGAGATA GGATCGGAGG ATGGTAGAAC CGTCGTTACA 294 0 
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AACTTCAAAC 


CTAGTTTTCT 


GAACAGAGAT 


AACCCAATTA 


TTGGGGCATT 


TGCTTGGGTA 


3000 


GAGCAAACAG 


ACGGAATCAA 


AGTTTACACA 


ATCATGACAA 


AAAAAGAAAT 


ATATAAATCA 


3060 


TGGAGTAAAG 


CTAAAACAAA 


AAATGTGCAA 


AATGATTATC 


CGCAAGAGAT 


GGCCAAACGC 


3120 


ACAGTTCTAT 


CAAGAGCAGC 


AAAAATGTTT 


ATCAATTCAA 


GTAGTGACAA 


TGATTTACTC 


3180 


GTAAAGGCTA 


TTAATGAAAC 


AACAGAAGAT 


GAATACGATA 


AT AATC AG CC 


ACGCAAAGAT 


3240 


ATTACACCTA 


ATCCACCAAA 


TATTGAAAAG 


CTTGAGAAAT 


CAATTTTTAA 


TCAAGATGAA 


3300 


AATAAAAAAA 


TTGCTCAAGA 


TATGATTGAT 


TCCATTGATT 


TAAATCAAGC 


TGACAAaGAT 


3360 


CTACAAGAAG 


AGCTAAATAT 


TGAATTTCCT 


GATCCTAGCA 


AAAACTATTT 


AGCTACTGGG 


3420 


GAGGTAAATG 


GAGATGTTGA 


AAACGAAGAC 


GGCCCTTACC 


CTTTCTGATA 


AAAATTATTA 


3480 


TTCAACTGAA 


GCTGACTGGC 


ATTATATGTC 


GGTCAGCCAA 


TATAAAAATT 


TTAATGAATG 


3540 


CGAAgCAGCC ACAATCGCTA 


GATTAAAAAA 


AATTTGGTTA 


CCCTGTTCAG 


ATCCAAAAGC 


3600 


ATTGCTTGTT 


GGTAATTATG 


TTCATTCTTA 


CTTTGAATCA 


TCAAAAATTC 


ATGATCTATT 


3660 


CAAAAAGGAA 


AACAAAGACA 


AAATGTTTTC 


TAGTAGAAAA 


CCATATGGGC 


TTCTAAAAGA 


3720 


TTTCCAAATA 


GCTGAACAAA 


TGATTCAACG 


ACTAwAAGAA 


GAACCTGCTT 


TTATGAATTT 


3780 


ATATAAAGGG 


GAAAAAGAAG 


TAATTGTTAC 


AGGGCAAATT 


GAAAATGTAA 


ATTGGAAAGG 


3840 


GAAAATCGAT 


TGTCTAAATT 


TAGAGGATGG 


TTATTTTGTT 


GATATAAAAA 


CAACAAAAGA 


i3900 


TATTCATGAA 


CGTAAATGGA 


ATGATAGCTA 


CGCGGAGCkT 


TGtACATTCA 


TTGAAAACTA 


3960 


TGGATATGTA 


CTTCAAATGG 


CAGTTTATAA 


AGAATTGCTT 


AGTCAAAAGT 


ATAATAGGGA 


4020 


ATTTGTACCA 


ATTATAGCCG 


CTGTTTCAAA 


GCAGACTCCT 


TCTGAAGTAA 


AGCTAATTAC 


4080 


CCTAGATGAA 


GACAAGATGC 


ATTTTGAAAT 


GATCCAATTA 


AAAGAAAATA 


TTGAACATAT 


4140 


TATTAAAGTA 


AAAAATGGCG 


AAGAAAAACC 


AACACTGTGT 


GGTAGGTGTG 


AATATTGCAG 


4200 


AGGGCAACAA 


CGAATTACTC 


ACTTTACTAA 


TATGAATGAT 


TTATAGGAGG 


GAGGCAGATA 


4260 


TGGCAAGACC 


TTTAAAACAA 


GGAATTGACT 


ACTTTCCGCT 


TTCTGTAGAT 


TTTTTAAGAG 


4320 


ATATAAAGGT 


TCGTAAAATC 


AAACGTGCGT 


GTGGACCTTA 


CACTGTCGAA 


ATACTACTTT 


4380 


GCCTGCTGGG 


TAATATTTAT 


CGGGAAACTG 


GTTATTACAT 


CGGGTGGGAT 


GAAGATACGA 


4440 


TGTTCTTAGT 


TGCTGACGAA 


gTTGGGGCGA AAGAGGGTCT 


AGTTGAAGAG 


ACGGTAAATA 


4500 


AGGCCGTTCA 


AGTTGGATTT 


TTTAATCAAG 


AGAAGTTCAA 


T GAG TAT AG A 


ATTTTGACTT 


4560 


CTAACGGAAT 


ACAAAAAAGA 


TATCTCGAAG 


CCACTAAAAA 


AAGGAAAGAA 


GTAGTTATTT 


4620 


CCGACATTTA 


CTTAGTTAAT 


GACACCATAA 


AGGATGAAGA 


AACCCTAGTT 


AATGGTGTCG 


4680 


ATAATGAACA 


AAGTAAAGTA 


AATAAAAGTA 


AAGTAGATAA 


AAGTAAAGTA 


AATAAAAACA 


4740 


ATAAAGAAAA 


ACCTGCCGTT 


CAGTTGTCGT 


CTGAAAAGGA 


TTTTTTGGAG 


AATCCTTTAG 


4800 


GAGACAAAAA 


AACAGCGGAG 


CTAATCGCTT 


ATTATTCAAA 


AAATGTTAGT 


CTTGCTACTC 


4860 


CTGTAAATAT 


GACAAATCTT 


GCaTATGATT 


TGAAAGACTT 


TAATGGCGAT 


CTGGATTTAC 


4920 



WO 98/50555 



PCT/US98/08985 



TAAAAGAAGC 
TATTAAAACA 
ATAAAAAATC 
GATTGTTTTA 
GCTACACATT 
TGGAAAGAAC 
TATCGTTCAG 
ATAAAAAAGG 
CATAAAAAAA 
AGTATGGAAT 
GGAGTTGGGG 
AGTAATTATT 
TTTGCGATGA 
AAAGCAGCAG 
AG AG AT G AAA 
ACAGGAATAT 
AAAGAAATTA 
GGATTTATGC 
ATATGAAAAC 
GTGGAAAGTA 
AAAACTATCT 
TGACGTTTTA 
TAAAATTAGA 
TCACGGATGC 
GTCAAmAACT 
ATT CAT G AAA 
GAAAAAACAT 
TCCTTTCACC 
CGACTATTTT 
GCTAAAAAAA 
AAAGGGTAAT 
AAATGGCAGA 
GAAATGGATG 



GATAAACATT 
GTGGCGAGCT 
AAAGCAAAAA 
AGGAGGAATA 
TTATTACAGT 
CAACTCCAGA 
TGAAGaAAAA 
CAAAAGACTA 
TTAGTAATTA 
TTTGCGAAAA 
TTGGGAAATC 
CTAAGAAGGT 
ATGACGAAGA 
ACGTCGTAGT 
AAGGCAATCT 
TGGAAAGTCG 
AATATTGTTA 
ATAGGTTTAC 
TACTATACCA 
TGTACAAGCC 
TAGAAGAGCA 
TGTACAAGCG 
GCGCAAATAC 
TGCAGAAGGT 
ATACAGCTTA 
GCGCAAATGG 
AAACGTAGAA 
TATTTCGAAG 
GGAACAACTG 
TTAAAGAAGT 
CAGTCATGGA 
GAGTGCCTGC 
AAGCCTACAG 



TGTTCAAGCA 
AACGGAATTA 
AATAAACGAG 
GCTAATGATG 
TGATGTCCCT 
TACTCCACCA 
AGAaGCAaCC 
TTTTTTAAAC 
TTATCCAAGC 
AACACTGCTG 
ACATCTAGCA 
ACTATTTGTC 
GGCACGAAAA 
TCTAGATGAT 
GATTCTTAGT 
CATAGATAAG 
CGGCGAAAGA 
AAATACAAAA 
ATTACACCAA 
TATGAAGATA 
AAACCTAAAT 
CCTAAGTCAG 
TGCGATAAAA 
ATTCTTTACm 
AGACCTCGAA 
AATCTTGGCA 
TGGCTAACGA 
ATACTTACCA 
GCACATGGGT 
ATTTGGAACA 
TGAAAAATAA 
ACCAAACACA 
AGAGGTGAAT 



1117 

ATGTAAAAAC 
ATACTCATGC 
ATAGTGAACC 
AATACTAAAG 
TGTCCTGAAT 
AGATGTCCTC 
ACTGCTAAGA 
GGATCTGTAT 
AACCAAGAAT 
GGAGAAATAC 
ATTGGATGTC 
AGCTATCGAG 
GCAATTACTG 
ATAGGGGCTG 
AGAGCATCAA 
CCAACAATTA 
ATAGTCTCAA 
GATTATCGAT 
AACCACAAAG 
GCACCATGAA 
TGATAGAAAA 
CCCTAAGTAA 
AGCCAGATTT 
AAAATGATGG 
CAGAACTCGA 
AGATATACAA 
TGCAATGAAA 
ATTTGCAATT 
TGTGCGTAAG 
TCCAGTTCCA 
TAGATAGGAA 
CGTGGTGCAA 
GGGGaATACG 



ATATGCTTAT 
AGATTTTTTG 
ACCGCTAAAT 
TTGCATTAAA 
GCGGAGGAGC 
CCGTATGTAT 
aCCTTTATGr 
TAACGGATAG 
CAACCAAAGC 
ACCATCTGAT 
TCAACGAAAT 
AGCTTTTAGA 
GTGTAGCAAT 
AACTAGGCGA 
ACTTTGATAT 
TCACCTCTAA 
GAATGGCTAC 
TGAAAACAGC 
CCGTCCGAGA 
GACGTATAAG 
AGGGCCAaTC 
TAAACAGAAG 
AGACAACTAT 
GCAAATTGCT 
AATAATGAAA 
GAGTATGTAA 
AACGTCAACG 
AAAACATCTA 
AATCAAGATC 
ACAAAAGAAA 
TGGGAAAAAT 
AAATGCATGG 
CTGTTCTTGT 



TTTGCTGGAA 
AAGAAAGTAA 
GATTATTCAG 
GCAATTAATG 
AATGACAGCA 
GGAATGTGGC 
ATCTAGTCTG 
AAAATCACTG 
TAAAAATATT 
TTTAACAGGG 
ACTTGTTAAA 
ACAGTTAAAA 
GTCCGAAATC 
CTTTAATAAA 
TGATACGCTG 
CCTAAAATCA 
CCATTCAAAT 
GTAGGAGGTC 
TTTGCTAGAC 
CAACmAGTAA 
ATGGCACACG 
CGGTTAGACG 
TTCAAAGCAG 
GTGATAGTTT 
TTGGAGGTTG 
AGAACAACAA 
AGTTAGGATT 
TCGGGATGGT 
GTGGTAGAGG 
TAAAAAAAAC 
GAATAGGAAT 
ATGGGTTAGA 
CCGAACTTTG 



4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

56.40 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 
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AATACTAAAT 
ACTTGGaGTG 
TGAATTTTTT 
TTTAGAGGAT 
GTAAAAAAGG 
ATGGCACATA 
ATTGTTTGGG 
AAAAATGGGC 
GTGGAGTTTT 
TGATGCATGC 
TATGGATAAC 
TAACaTTGTT 
ATTTGATCCT 
GAAGTTGAAT 
GGTCTTAAAA 
TGAAATATTA 
CTGGTTAGTG 
TTTATAAATT 
CTAATTATGA 
AAATTTGTTG 
GAAGAAGCTG 
GATGATGACA 
TATTTGTCTG 
GGAAGAAAAT 
TTGAAGTAGA 
CGACTTCTAC 
TAGAAGAGTT 
GGTTGGAGGA 
GAGATATTTA 
TACAGGTGAG 
CCAATTTAAA 
AGCAAAAAAA 
AGAACCTTAT 



TAGGCACAGT 
AAAAACcAGC 
CCAAAACAAT 
TGCGAATTGG 
TTCAAAGATT 
CAGCAAGCGA 
AAAAAGATAA 
AAAGTGTTGC 
GCGGCCGCAT 
TGTGGTAGCA 
aGAAAGCATT 
GCAGATTTTA 
CCGCATTTAT 
GAGCAAACTT 
CCTAACGGAA 
AGCATAATTG 
TTTATGAAAG 
AGTCGATGCA 
AATTAATCTA 
GAAATTATTG 
AACAGATGAG 
GAGAGGTAAA 
TCAGAAGAAA 
GAT T TAT ATT 
GCGTGATGGG 
AGATATTAAT 
TTTAAACGGT 
ATAACAAATG 
TGTTGGTAGA 
GCTATTCAGT 
GGTATTGATG 
ATTCTAGCTA 
TTTGCATTAA 



TAAGCATGCA 
GTATTAAAAA 
CaGAATTAAT 
AATTTGGAAT 
AAAAAGAAAA 
GGGAAAAGCG 
GTTGAAACAT 
GAAAAAGTTA 
AAAAAGCTTT 
GAATGTTTTG 
ACgAAAAATT 
GAAAGATGCC 
TGAGAGCTGG 
GGAAAGAAGA 
CATTAGTTTT 
ATTACGAACC 
AGAGTGAAGA 
AATCCACCTG 
TGTTTTATCT 
CAGTTTTGGA 
AAGACTGTTA 
AAATGGGTTT 
ACATTAGAGA 
AAAAATTTTG 
ATCACGAATT 
GATTTTACTG 
TGTCAAGAAA 
GCtTATGaAA 
GTCAAgACGG 
CGGTGGCAGA 
GTAAAGAGCA 
TTTTAGAGGA 
TCGCTGCTAA 



GCTATCAGAA 
TGrAATTTTT 
AGATGAGGCT 
ATACGAATAA 
AACGCCGACT 
ATGGTATGTA 
CAAAAAGGTT 
GCTGAAATAA 
TTGCTCCTTT 
GTTTGATAAG 
AGAGAGTGGA 
GTTTGATGAT 
TAGTAACAGC 
TATCCAAAAA 
CAAATGGAAT 
ACTTTACGGC 
AAATGAATCG 
CATTTGGAAA 
GGAAAAGAAG 
CCTAAAGAGG 
GAGAATAGTG- 
CAAGTTTAGC 
GTTACTTATT 
TACATGATTT 
ATGTAGGAAC 
AAGaTTGGTC 
TAGTACATTC 
AATTACaTTT 
CTTGATGGAC 
TTGGTTTTAT 
TAGCTTGTTT 
GGAAAAACAA 
AGATGAAAAA 



ATGCTACAGA 
gGaGAGGaAA 
AATATGTATC 
TTTGGAGGTA 
TAAAGAAAAA 
TAAAATGCAA 
TTTATTGTCC 
AGAAAAAATA 
GAAATGATGA 
CAAAACGACC 
CATATTGTTG 
AATTCATTCT 
TGGTTGGCCA 
GGTTTTTCTG 
GAGGATCAAA 
AATAAAAGAG 
AATGGTTGAA 
AGTAAAAAAA 
AAAATAAAAA 
ATGCAAAAGC 
TAGGCAATGC 
GATTATTATA 
GCTTATTTAG 
TGATTCTTCA 
TAGAGATACT 
AGATTCTGAA 
ATTTTACCAT 
AGTaACAGCG 
ACAAGATATC 
GTAAATAAAA 
TATACATCTG 
ATGAAATTTT 
CAATGTTTAA 



AACTGACATT 
AACAAGCGAT 
ATTTATGGAT 
ATACTTTTGG 
GCTATTGCAA 
GGGACAGATG 
GTTATGCAAC 
AGCGAAAGGA 
AAAAAATACT 
AAGTTTTGTT 
CTGTTAATCC 
ATCATGTTGT 
AAAAATATGG 
AGTGTATGAG 
TCAAGCTATC 
CGAAAACGCA 
GTTGGATAGG 
TCTTTGGAGG 
CTATGTAAAA 
ATTTACTAGT 
GTTTGTTATT 
ATGCTGAAAA 
AAAAAGGAGA 
ACAATTACCT 
GGTTATGGAA 
TACGATCAAT 
TAATTTTTTA 
ATAGTAAGTG 
GTCGAATAAT 
AAAAGACAGT 
ATAAAGAAAA 
ATAAAATGAA 
AACTTTACAA 



6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 
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GGATATTGTT TGCGAAGTAG AAGACGAAAA AGAATTTTTC GATGATATGA AAACAATTGA 8940 

TAAATACGAA GCGTTCAAAA TGCTTGCTAA AAGTCGTATT GAAGATGGTG GAGAATTGGG 9000 

GGCAGAAGAA GCTTTCAATC AATTAGAAAA TCTTGAAGAA AACGGCGAAG TATTGTTAAT 9060 

CGACGGTAgC TTGCTTTAGG AGGrGTAATk GTgCtATCTA TTCTAAGAGG TTATAAAGAA 9120 
G 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



9121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



AGCCCAAAAT 


TAGGGAAGGT 


CTTCTTTGAC ACCTACtTTA ATTTTCCCyG 


cTTTTTTTAA 


60 


TTGTTCTACT 


GTGACCGTTT 


TGTTTGATGA 


AGCACTTTTG 


GAAGTTGAGT 


CATTGTTGCC 


120 


GCTACCACAG 


CCaACAATGA 


TGCCTGCTAA 


TACAAGTAAG 


GAGAGTCCAG 


CTAGCCATTT 


180 


ATTTTGTTTT 


TTCATTGTTC 


ATTCCTCCAA 


TTTTTTGTTT 


TTTTATTCAT 


TACCCGTTAT . 


240 


GAATAATTTT 


ACTTAGAAAT 


TCTTTTGCAC 


GAGGGTTTTC 


AGTGCTTGTA 


AAGAAATGTT 


300 


CAGGCGTTCC 


TTGTTCAATA 


ATCTGTCCTT 


GGTCCATGAA 


GATTACACGG 


TCAGCCACTT 


360 


GACGAGCAAA 


GCCCATCTCA 


TGTGTGACAC 


AAACCATGGT 


AATATTTTGT 


TTGGAAAGAT 


420 


CGACCATTAC 


ATCTAAAACT 


TCTTGAATCA 


TTTCTGGATC 


AAGTGCCGAA 


GTCGGTTCAT 


480 


CAAAAAGGAT 


AATCTCAGGA 


TGCATATTTA 


AGGCACGAGC 


AATAGCGACC 


CGTTGTTGCT 


540 


GCCCGCCAGA 


AAGCTGCGAA 


GGATAAGCAG 


AAGCTTTATC 


AGCCAAACCG 


ACTCGTTCTA 


600- 


AATACTCCAT 


GCCAGTACGA 


ACCGCTTCGT 


CTTTGGAAAC 


GCCCATCACT 


TTAATCGGTG 


660 


CTAACGTTAA 


ATTATCAATA 


ATCGTTTTGT 


GAGCATATAA 


GTTAAAATTT 


TGAAAGACCA 


720 


TGGCAACTTT 


CTGGCGAACC 


TTTTGAATCG 


GTGCTTTCGG 


TTTGGTGATG 


TCAATGTTAT 


780 


CAATTACGAC 


GGAACCATCA 


TTGACTTTTT 


CCAATTGGTT 


CATACACCGA 


ATTAATGTAC 


840 


TTTTGCCAGA 


GCCAGAGGGT 


CCGATAATGA 


CAATTTTTTC 


TCCTTGCTCT 


ACATTCAAAG 


900 


TAATATCTTT 


TAAAACGTGA 


TGCTCCCCAA 


AATACTTATT 


TACATGTTCT 


AAAGAAATCA 


960 


TAGGCAGTCT 


CCTTTGCAAT 


CATTTGGTAG 


AAAAGATAAC 


AGAGGATTCG 


GTGAGAAACG 


1020 


TGTTATTTAA 


ATTAAATTAA 


CCAAATGTAT 


TTCTAATTGT 


ATTCTCAATG 


TCAGGAATTA 


1080 


TAACATACAA 


AAAAATGAAT 


TACTAGTTTT 


TTAATCGAAA 


AAGATTTCCT 


TCAAAAAAAG 


1140 


AAATAAATGA 


GAGCAAAACA 


GGAAAAATTG 


ACAATAGTCC 


TTAAAAAGTA 


GAAAATAAAG 


1200 


GCAGTGAATA 


AAAGGCAAAG 


GTAGGATGAG 


AATGAGCAAT 


CAAGAAGCCA 


TTGGATTAAT 


1260 


TGATTCTGGC 


GTTGGTGGAT 


TAACTGTTTT 


AAAGGAAGCG 


CTAAAACAAT 


TACCAAATGA 


1320 
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ACGATTAATT TATTTAGGAG ATACAGCCCG TTGCCCATAT GGTCCACGAC CAGCCGAACA 1380 

AGTCGTTCAG TTTACTTGGG AAATGGCCGA TTTTTTATTG AAAAAACGAA TAAAAATGCT 1440 

AGTAATCGCA TGTAATACCG CGACGGCTGT CGCATTAGAA GAAATTAAAG CTGCCTTGCC 1500 

AATTCCAGTT GTCGGTGTTA TTTTACCTGG CGCACGAGCA GCCGTTAAAG TCACaAAAAA 1560 

TAACAAGATT GGTGTCATAG GTACGTTAGG GACAATCAAA AGTGCTTCCT ATGAAATCGC 1620 

CATTAAAAGT AAGGCACCAA CAATTGAGGT GACTAGTTTA GATTGCCCTA AATTTGTCCC 1680 

CATTGTTGAA AGTAATCAAT ATCGTTCTTC CGTAGCAAAA AAAATTGTGG CAGAAACACT 17 4 0 

TCAAGCACTA CAATTAAAAG GACTTGATAC GTTGATTTTA GGTTGTACCC ATTACCCGTT 1800 

GTTACGTCCG GTGATTCAAA ATGTGATGGG ' GAGTCATGTG ACATTAATTG ■ ACTCAGGAGC 1860 

CGAAACAGTT GGCGAAGTCA GCATGCTTCT CGATTATTTT GACATTGCCC ACACGCCTGA 1920 

AGCGCCTACA CAGCCCCATG AATTTTATAC AACTGGTTCT GCAAAAATGT TtGAAGAGAT 1980 

TGCAAGCAGT TGGCTkGGwA TAGAGAACTT AAAAGCACAA CAGATTCACT TAGGAGGAAA 204 0 

CGAAAATGAT TAGACATGAT AGTCGTGCAC CAAAAGAATT GCGAAAGATC ACTATTGAAA 2100 

CCAATGTTTT TAAACATCCA GAAGGTTCGG TGGGTTATTT CGTTTGGCGA TACAAAGTAA 2160 

TTTGCTCAGC CCACTGTCGA AGAAAAGnTT CCCCCATTTT TACGAGGCGG GGGAACAGGT 2220 

TGGGTAGCAG CTGAATATAG CATGCTCCCT CGCGCGACGA ACACAAGAAA TATTCGGGAA 2280 

AGTGCGAAAG GAAAATTAAC AGGGCGAACA ATGGAGATTC AACGTTTGAT TGGGCGGTCA 2340 

TTGCGAGCTG TTGTTGACTT AGAAAAGTTA GGGGAACGGA GCATTGTTGT CGATTGTGAT 2400 

GTGATTCAAG CAGATGGGGG AACCCGTACA GCTAGTATTA CAGGCGCTTT TGTTGCTCTA 24 60 

CGCTTAGCAG TCAATCATTT ATTGATGACT GGTGCACTAG CGGAAGACCC AATTAAAgAG 2520 

CATTTAGCTG CTGTCAGCGT GGGTATTTTA CCCGATGGTT ATTGTGTGCT GGACTTAGAT 2580 

TACGCAGAGG ACTCGAATGC TGCAGTGGAT ATGAACATCG TGATGACCGA GTCTGGTGAA 264 0 

TTTGTGGAAA TTCAAGGAAC GGGTGAAGAA GCTACCTTCT CAGGAGATGA GTTAAATGCC 2700 

ATGCTCTTTT ATGGAAAAAC AGGGATTGAG GAATTAATTG CGTATCAGAA AGAAGCGCTT 27 60 

TATGCTTTAG CCTCTGAAGA AGTCCCTTCG CAAGATTCAG AAGAGAAAGT GATTGTCATT 28.20 

GCGACTAGAA ATCCTGGTAA AGCGAAAGAG TTTAGTTCAA TTTTTGGTGA AAAAGGATAT 2880 
ACAGTTAAGA CGTTATTAGA TTATCCGAAT TTGCCAGATG TAGAAGAAAC TGGTCGAACA 294 0 

TTTGaAGAAA ATGCTCGTCT AAAAGCTGAA ACAATTGCAG AAATTTTACA GAAACCAGTA 3000 
CTAGCAGATG ATTCAGGATT AATTGTTGAT GCCTTAGGAG GAATGCCAGG GATTTATTCT 3060 
GCTCGTTTTG CTGGTGAGCC AACTAACGAT GCTTCAAATA ATGCGAAACT ACTACATGAG 3120 
TTAACGGGCG TGCCAAAAGA AAAAAGACAA GCGCGTTTCC ATTGTACATT GGTTTTTGCT 3180 
GAACCAAAGA AAGAAAGTCT GGTGGTCGAA GCTGAATGGC CAGGAGAAGT CGGTACGATT 324 0 

CCTCGTGGTG AAGGAGGCTT TGGGTACGAT TCACTTTTCT ATGTTCCAGA GCTTGGTAAG 3300 
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ACGGCAGCTG AATTGTCTGG AGAAGAGAAA AATAAAGTTA GTCATCGTGG ACAAGCAGTG 3360 

GCAAAATTAA AAGAACAATG GGAAGAATGG CTAAAAAAAT AGGAGTTGAA ACGTATGAAA 3420 

ATCTTGGTAG TAAGCGATAA TCATGGCGAC CGTGACTGTT TAGTTGATTT AGTGAATCAT 3480 

TATGAAGGAC AAGTCGATGC ACTCTTTCAC TGTGGTGATT CTGAATTAGA ACGGACAGAT 3540 

GAACTTTGGC AAAAATTCAT TGTAGTCCAA GGAAATTGTG ATTTTTATGA TGAATTTCCC 3600 

AAAACTGTGA CTAAAAAAGT GGGGGATCAA GTTATTTATA TGACACATGG TCATTTGGCC 3660 

AATGTTAGAA TGGGGCTCAC GACACTTGCT TTACAAGCGG AAGAAGCAGG AGCGACAATT 3720 

GCCTTATTTG GTCACACTCA TGTACTCGGC GCAGAGCGAC ACAATAACAT TCTCTTTGTT 3780 

AATCCGGGAA GTATTCGTTT ACCTCGTGGA CCAGTTCAAG AGAAAACATA TGCCATAATT 38 40 

GAAAGTACAC CTGAACAATA TCTCGTTCAG TATTATAACA AAGAGCACCA ACCGTTGGAA 3900 

CGATTAAAGG CAACGTTTAC GAAGTAAGTG AATGTTAAGA AATAAGTGCT ACTTTTTTAT 3960 

TTATCAGATT TTGAGCGTAG GACAAAAATC TCTTTTTGAT TTTTGTCCTA CGCTCTTTTT 4020 

TTAGTCGACT TGGATAACCT TTAAGCCGGT CTTACGAAGT TCTGTAATTA AAGTTGATGA 4080 

GCAATGTTTG TCTGTAATCA AACAATCAAA AGCGGTAATA TCAGCAATTT TAAATTTAGA 4140 

CGTCAAGCCG ATTTTTCGAT GGTCCGCGAC CAGAATTTTT ACTTTTGTTG TTTGTTTTAT 4200 

CATAAGCTCA TTAATTTTTG CTTCATTGAA AATTTTTGTA GTGACGCCAT GTTCTAAACT 4260 

GACCCCGCTA CAGCCAATTA CGCAAACATC GGCATGCGTT GCGGTGATTG TGTTAATGGC 4320 

AATATCTCCT ACCAGAACTT CTTTGGGAAA ACGTAATTcG CcTCCGGTCA AAaTATAATT 4 380 

GTATtGAGaA ACATGAGGCT TGGTGGCAAT TTtTAAATTA TTCGTTACGA TGGTCAGATG 44 40 

GTCgGCCTTT AAATAATGGA GAATATTCaA AGCAGTGGAA CTTGAATTAA TGAAAACGGT 4 500 

ATCATAATCA CTTAGATAGC TACAAGCAAC ACGAGCAATG GCTTGTTTTA ATAACTCCGG 4 560 

TCCTGATTGA TCAAAGTCTT TCTGATTTTG GTCGTTGTAA ATACAATACC CATATTTTCG 4 620 

AATGATGTCA TTCTTAGTTT CTAAGACATT TAAATCACGA CGAATTGTGC TTAAGGAAAC 4 680 

ATTTAAAGCA TCTGCTAATT TATAGGTAGT CATTTGTTTG TTTTTTTGCA GCAGCGCGAG 4740 

AATGTTTTGT TGCCTTTTTT CAATGGTTTC TAATGAACTT TTCAAGTAAT ACAACCTCCT 4 800 

TTAGTTATTT TTTTGGACAC GGTCCCCCAT GAAATAGTAT AACTCATAGG AAAACAAAAC 4 860 

ACGTAAAAGA ATCATTAAAA GTGCAAACAA TCATATAAAA GGGTTTTCAT TGTGATTTTT 4 920 

TGCGGCTATG ATTTTCTTAT AAAGCATAAC ACAATCATTT GAGAAGAGGA GTGGTAGCAT 4 980 

GAGCAAATCG ATTCATGAAA TTACAAAAGA AAGTTGGTTG AAGGCGACAT TTCCAGAATG 504 0 

GGGCACTTAT TTAAACGAAG AAATTAATCA GACAACTGTG CTTCAAGGAA CGGTTGCACT 5100 

CTGGTGGCTT GGTTGTACAG GCATTTGGCT GAAAACGCAT GAGAACACGA ATATTTTATG 5160 

TGATTTATGG TGTGGAACAG GAAAACAAAC GCACGGGAAC GGCTTAATGA AAAATGGCCA 5220 

TCAAATGATG CGCATGAGTG GTTGCCAAAA AATGCAACCT AATTTAAGAA CACAACCTTT 5280 
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TGTGATTGAT 


CCTTTTGAAA 


TAAAAGAGGT 


AGATGCTTTA 


GTTGTCACAC 


ATATTCATTC 


5340 


AGACCATTTG 


GATATTCACA 


CAGCTGCTGC 


TGTTCATCAA 


AATTGTCCTA 


AGGCTCTATT 


5400 


TATTGGTCCA 


AAAGAAGTAG 


TGAAAACATG 


GCAGCGTTGG 


GGTGTTCCGG 


CAGAAAAAAC 


5460 


ACGGGTTATA 


GAGCCTGGTC 


AAGAAATAAA 


AGTGAATGAT 


GTGAATGTAG 


TGGCCTTGGA 


5520 


AGCTTTTGAT 


CGAACCGCGC 


TAATTACTTC 


TGATGATCCT 


GAAAATACAT 


TTAAAGGTAA 


5580 


AATGCCTCAG 


GACATGAATG 


AACTAGCTGT 


CAATTATCTA 


TTTGAAACCT 


CAGGTGGTAC 


5640 


GGTATATCAC 


GCCGGAGACT 


CTCATTATTC 


TAACTTGTTT 


GCGAAACATG 


GTAATCAATA 


5700 


TGCAATTGAT 


GTTTGCTTAG 


GAGCATATGG 


CGAAAATCCT 


CGCGGTATCA 


CAGATAAAAT 


5760 


AACTTCTGTT 


GATATGTTGA 


gaatggccga' 


ATCTTTAAAT 


GCGAAAGTAG 


TCATTCCTGT 


5820 


TCATTATGAT 


ATTTGGTCTA 


ATTTTATGGC 


GGATCcAAAA 


GAAATCATGG 


AAATTTGGAA 


5880 


ATTTAAAAAA 


GATCGTTTAC 


ATTATCAATT 


TAAACCATAT 


ATTTGGCAAG 


TGGGTGGCAA 


5940 


ATTTGTTTAT 


CCAGATGACA 


AAGACAACTT 


AGAATTTAAC 


TTTGATCGTG 


GCTTTGATGA 


6000 


TGTTTTTGTT 


AAAGAAAATG 


ATACACCATT 


TCCTTCATTC 


TTGTAAGAAA 


GGCTGGATGT 


6060 


ACAATGTTAA 


CTTATTTTTG 


GGAGCAAGAG 


CTTATTCATT 


ACCCTGATAA 


AGAACCGGTT 


6120 


AGTTGGCAAG 


AGGCGATTCA 


GGAAAGTTGT 


TTGATTTTAC 


TCCAAAAACA 


CATCATCGAT 


6180 


CAGAGTTACG 


TCGATGAGAT 


TATCCAGTGT 


GTCGAAACGT 


TTGGACCTTA 


TATCATCATT 


6240 


GCTCCTGAAG 


TAGCCATGCC 


ACATTCATCA 


GAGGAAAGCG 


CAGGCGTTTT 


TGGAACGGCC 


6300 


ATTTCTTTTA 


CAAAATTTAA 


GCAAGCAGTC 


ACTTTTGCTG 


GCGATCAAGA 


AGCAAAAACA 


6360 


GCAACATTAT 


TTTTTACATT 


GGCAGCCCAA 


AATCCCGCCG 


AACATCTGGA 


AAATATCCAG 


6420 


CAATTGATGG 


ATTTATTGAT 


GACGGATGGC 


GTGATTGCTG 


ACTTGTTAGA 


AACAAACACG 


6480 


CCAATGGATT 


TTAAAGAAGT 


TATGGAGAAA 


TATCAATTAT 


AGGGGGAAAA 


ATCAATGGGG 


6540 


GAGTTCTTGT 


TAAATAGTTG 


GACTTATTTT 


GCCACGAACG 


TTTTAACACA 


GCCAGCATAT 


6600 


TTGATTGGTT 


TCATCGTGTT 


ATTAGGTTAT 


CTCTTGTTAC 


GGAGGCCCTT 


TTATGAATGT 


6660 


TTAGCAGGTT 


TTCTGAAAGC 


GACAGTTGGT 


TATTTTATTT 


TGAGTC3TTGG 


TTCTGGTGGT 


6720 


TTGGTTAATA 


ATTTTCGCCC 


TATTCTAGTA 


GGATTGAAAG 


AGCGATTTAA 


TTTACAGGCG 


6780 


ATGGTTACCG 


ATCCATATTT 


CGGGCAAAAT 


GCAGTCGATG 


CAGGATTGAT 


GAAAACCTTT 


6840 


GGACGAACCT 


TTGGTGATGT 


GATGATTTTG 


TTATTGATTG 


CATTTATTAT 


GAATATTTTG 


6900 


TTGGTTCGTT 


TTCAAAAATA 


TACAAAATTG 


AGAGCCGTTT 


TTACAACAGG 


GAATGTTCAG 


69*60 


ATTCAACAAG 


CAGCTACCGC 


TTTCTGGATT 


TTACTCTTTT 


GTTTTCCTAA 


CTTAGGTAGG 


7020 


ATCGAAGTAC 


TAATTTTTAT 


GGGCTTAATT 


CTTGGGTGTT 


ATTGGGCAGT 


TGCTTCAAAT 


7080 


TTAACGGTAG 


GGATTACGCA GGAATTAACA GAAGGTGCTG GTTTTGCGGT AgCCCATCAA 


7140 


TAAATGTTTG 


GCATATTTAT 


TTTTGCAAAA 


TTAGCAGAAT 


GGTTGAAAAA 


ACGTGATCAA 


7200 


AAGAAACATC 


GTTCAGAGAA 


GGTCGATAAA 


AAATTGGAAG 


ATATCGAATT 


ACCAGGATTT 


7260 
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CTATCCATTT 


TTAATGAAAA 


TATGGTTGCT 


ACGTCCATTT 


TGATGCTCTT 


TTTCTTTGGC 


7320 


ATTATCTTGC 


TTGTTTTGGG 


ACAAGATTAT 


TTAATTCAAG 


CTAAATTTAT 


GCAAGAAGGG 


7380 


CAGAGTTTCT 


TCTTTTATAT 


CATGACCACT 


TCATTAAATT 


TTGCTGTCTA 


TTTAGCTATC 


7440 


TTACAATTAG 


GCGTTAGAAC 


CTTTGTCGAC 


GAGTTGACGC 


AGtCATTCCA 


AGGAATTTCA 


7500 


AATACCATTT 


TACCAGGCGC 


TGTTCCTGGG 


ATTGATGTAG 


CAGCGACTTT 


CGGGTTTGGC 


7560 


TCGCCCAACG 


CAGTAACAAT 


TGGGTTTCTC 


TTTGGCGCAT 


TAGGCCAGTT 


TTTAATGATT 


7620 


GGCTTACTGA 


TTCTATTAAA 


GTCTCCGACA 


ATCGTTATTG 


CGGGTTTTAT 


CCCATTGTTT 


7680 


TTTGATAACG 


CTGTCATAGC 


TGTTTATGCa 


AATAATCGTG 


GCGGAGTAAA 


AGCTGCTTGT 


7740 


TTATTCCCCT 


TCCTTTCAGG 


TTTAATCCAA 


GTAGGTGGCT 


CAGCTTTATT 


TGCTACATGG 


7800 


ATTGGATTAT 


CTCAATATGG 


CGGTTATTTG 


GGTATGTTCG 


ATTGGGCGAC 


AGCTTGGCCG 


7860 


TTATTTACAA 


TCATTATGAA 


AGTTCTAGGC 


TATGTTGGCG 


TAgCAGTGGT 


GGTGATTGGG 


7920 


TTACTAATAA 


TTCCGCAACT 


TCAGTATCGA 


AAAAATCCAG 


AAGGTTACTT 


TATGATTGTC 


7980 


AATGATTATG 


AACAATATGC 


TAAAAAATTT 


CAAGAGAATT 


AAAGGAGGAA 


TTTGAAATGA 


8040 


GAATACTTGT 


TTCCTGTGCC 


AATGGTTCTG 


GCACGAGTCT 


AATGATGATG 


CGGAGTGTTG 


8100 


AAAAAGCAAT 


GAAAGAATTA 


GGTGTACCGA 


TTACAAAAAT 


TCATCATTGC 


GCCATTTCAG 


8160 


AAGGAAAAAG 


CTCTGCTAGC 


CAATACGACG 


TTGTTTTTAC 


TCCAGTCAAT 


TTTTTACAAA 


8220 


TGTTTCAGCA 


AGCAGAAAAA 


CGCGGAGTCA 


CAGTGATTGG 


CATTAAAAAT 


GTAATGTCTG 


8280 


CAAAAGAAAT 


TCAAGAAAAA 


TATATGGCTT 


CAGATCGTTA 


TCAAAAATCT 


GAATAAAAGG 


8340 


GTGGGAAAGA 


TGAAAAGACC 


AAACTTACAA 


ATCGCTTTAG 


ATCATAATAG 


CTTAGAAGAC 


8400 


GCTTTAGCAG 


ATTGTATGAA 


AGTCGGAGAA 


ATTGTGGATA 


TTATCGAAGT 


TGGCACTATT 


8460 


TTGTGTTTAC 


AAGAAGGCCA 


AAAAGCAATT 


CGTTGTTTAA 


AACGCATGTT 


TCCTAATAAA 


8520 


ACGATTGTTG 


CTGATACAAA 


ATGTGCTGAT 


GCGGGCGGTA 


CAGTTGCTCG 


TAATGTCGCA 


8580 


CAAGCAGGCG 


CTGATTTCAT 


GACGGTCATT 


TGTTGCGCCA 


CCTTACCAAC 


GATGGCTGCA 


8640 


GCTCAAAAAG 


AAGTCCGAGA 


ATTACAAGTA 


GAACTATATG 


GAAATTGGAC 


AATGCAACAA 


8700 


GCTCGACAAT 


GGCGAGAATT 


AGGGATTAAT 


CAGGTGATTT 


ATCATCAAAG 


TAGAGATGCT 


8760 


. CTTTTAGCTG 


GCGGCAGTTG 


GGGAGAGAAA 


GATTTGAATA 


AAGTTCAAGA 


ACTAATTGAC 


8820 


TTAGGCTTTG 


AAGTGTCCGT 


CACAGGTGGT 


TTAACTGTCG 


AAACGCTTGA 


ATTGTTTCAA 


8880 


ACAATGGCAG 


TTGCAACATT 


TATTGCGGGC 


CGTGGGATTA 


CTGAATCAAA 


AAATCCTGAA 


8940 


CAAGCAGCCA 


AAGATTTTCA 


AAAAAAAATC 


GATCAGATTT 


GGAAGTGATA 


AAATGGCACG 


9000 


AATTGGACTG 


TATGAAAAAG 


CACTCCCGCA 


AAATTTAACT 


TGGGCAGAAC 


GATTAACATG 


9060 


GGCCAAAAAG 


TTAGGGTTTG 


ATTTTTTAGA 


AATGTCAATT 


GATGAATCCG 


ATGAACGTTT 


9120 


AGCACGATTG 


GCTTGGACAC 


CAAGCCAGTT 


ACAAGAATTT 


TCTCAATTAA 


> TGGTGAAAGA 


9180 


AGATTTTTTT 


ATCCATTCTT 


TGTGCTTAAG 


TGGTCATCGG 


CGGTTTCCTT 


1 TAGGCTCTTT 


9240 
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AAATAAAGAA 


ACACGTGAAA 


AAGGGCGGAA 


AATTTTATCT 


CAAGCGATTC 


GTTTGGCTCA 


9300 


TCAATTAAAC 


ATACGGGTGA 


TTCAAATAGC 


TGGCTATGAC 


GTCTTTTACG 


AAGAAAAAAC 


9360 


TGCCGAAACA 


AGGGAATTTT 


TTCTTCAAGG 


TCTGAAAAAG 


GGCGTTGAGG 


AAGCTGCACA 


9420 


ATACGGTGTC 


ATTCTGGCTG 


TTGAGATTAT 


GGATGACCCT 


TTTATGAATA 


GCATACAAAA 


9480 


ATTTCTAGAA 


ATTAAGGAAC 


AAATTCCGTC 


CCCTTTTTTG 


CATGTTTATC 


CTGATTTAGG 


9540 


TAATTTGTCT 


GCTTGGCCGG 


AAAATAATCC 


GGCGGTTGAG 


CTTGAAAAGG 


GAATTGCTGA 


9600 


AATCGCTGCT 


ATTCACTTAA 


AAGATACCTT 


CGCTGTAACA 


GACACATTTG 


AAGGGAAGTT 


9660 


TCGGGAAGTT 


ACTTTTGGAG 


AAGGCTGTGT 


TGATTTTACA 


GGCTTATTGA 


AAACACTCAA 


9720 


GCGGTTAAAC 


TATAGTGGTC 


CTTTTTTAAT 


TGAAATGTGG 


AATGAAACGG 


ATTTGAATTT 


9780 


TCAAGAAAAA 


ATTCAAGCCG 


CGCAgcAGTA 


TCTCTATCCC 


AAATTAGCAG 


AGGTGGGTTA 


9840 


TTATGAACAA 


TAAAGCGATC 


ATCCAAGCAA 


TGAAACAACG 


TGTTTATGAA 


GCGAATCTGG 


9900 


CTTTGCCAAA 


ACTGGGCTTA 


GTCAAATTAA 


CGTGGGGCAA 


TGTTAGTGAA 


ATCAATCGGT 


9960 


CATTAGGGAT 


TATCGTAATT 


AAACCAAGCG 


GCGTCAAGTA 


TCAAGAAATG 


ACCAAGGAAC 


10020 


AGATGGTAGT 


GACTGATTTA 


AAGGGACAAC 


TATTGGAAAC 


CAATGCATTA 


AAGCCGTCTT 


10080 


CTGATTTACC 


CACGCATCTT 


TATCTTTATC 


AAAAAATGCC 


AGAAATTGGA 


GCGATTGCGC 


10140 


ACACGCACTC 


CCTGAATAGT 


GTTACGTGGG 


CTCAAGCAGG 


GCGAGCCTTA 


CCTCCTTACG 


10200 


GTACCACCCA 


TGCGGATGCT 


TTTTATGGCG 


CTGTACCTTG 


CACAAGAGCG 


CTCTCTGAAA 


10260 


GTGAAATTAA 


AGAAAACTAT 


GAAGAAGAGA 


CTGGCAAAGT 


CATTGTTGAA 


ACATTTCATG 


10320 


AACAAGAATT 


AGATCCTTTA 


GCAGTTCCAG 


GAGTGTTAGT 


TTATGGACAT 


GGCCCATTTA 


10380 


CTTGGGGAAT 


GACTCCGGAA 


AAAGCAGTTG 


AAAACAGTTT 


GATTTTAGAT 


GAAATCTGTT 


10440 


CAATGGCACG 


TTTGACGGAA 


ATAATTAATC 


CAGATGTTGA 


ACCAATTGAT 


CATTTTTTAT 


10500 - 


TGGATAAACA 


CTATCTTAGA 


AAACATGGGA 


TTTCCGCTTA 


TTATGGTCAG 


TGATGGCTGT 


10560 


GATGAAACGT 


GTGAAAAAGA 


AGGATAATTG 


ACTGATTAAT 


TGGTCTTTTA 


TAAAAACTTA 


10620 


GTTGTCTGTT 


ATGGGACAAC 


TAAGTTTTTA 


TTTTTTTAAG 


AAAGCGTTCT 


CTTTTTTGGG 


10680 


GGAGAATAGA 


ACAAAACGTT 


AATTTTCGTT 


TGTTAAGTTA 


TGAAGAATTT 


GCGAATTCCA 


10740 


AGACTTTTTC 


CGCAGAATTA 


GGTACAATAA 


GTTACGTGTT 


AAAGAATTAG 


AAATGGAGGC 


10800 


AACCCAATCA 


TGATAGGGAC 


TGCTGTAAAA 


GAATTATTAT 


TAGAAAAACA 


AGAAACGTTT 


10860 


TTAGTACCCG 


CCGAAAATGT 


CGCCAACGTA 


ATGTGTTTAA 


ATCCATTAAG 


TCACGCGGCA 


10920 


TTAGTCTTAT 


CGCAAGTTCG 


CTATTCCAAG 


ATTCCTGTGT 


TAGATAAGGG 


AGATCGCTTT 


10980 


GTTGGCTTAA 


TCGGCTTGAC 


AGACGTTGTA 


GATAAAATGT 


TTGATTTGAC 


TTCAGTGGAT 


11040 


TTTGAAAAAT 


TGAATGAATT 


TACTGTGGCC 


GATGTAATGG 


AAGTCAATGT 


TCCTGTAATT 


11100 


GGTGAATCGT 


GGGATTTAGA 


AGAAGTGTTA 


CATTTATTAG 


TAGATGCTTC 


TTTTTTACCA 


11160 


GTGGTTGATG 


ATAATCAACG 


CTTTAAAGGT 


ATCATTACAC 


GCAAAGAAAT 


TTTAAAAGCG 


11220 
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GTCAATCATA TGGCGCATGA ATTAGAAAAA CATCATCTTG TTCTGCCTAA ATCAGAAGAA 11280 

GAAATGAAAG TAATTTAAAT AAAACGCAAG AAACTCTTTA CTTTTTTCTC ATAAATGCGT 11340 

TATTCTGTAT GATAACACGA AGTAAAGGAG TGCTGTGATT GGACGCATAT GAGATTATTC 11400 

AGTATATTGG CGATGCCAAA AAGCAAACGC TTGTTAAAGT AACACTGAAA GGGCAATTAA 114 60 

AAGAAGTCAC GTTTCCAGAA ACAATTAAGG TCTTTAATAA TTGTAAAACT GGGACATTGT 11520 

TTGGCGATTG GGCTGACGTA AAACCTTTTT TAGAaGCAAm CAAaGAaAAA TTGAGaTTAT 11580 

GTTGTAGAAA ACGATGC 11597 
(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TTATTTTTTA ATTCGTAATA GTTTTTTTAA ATCGTTTCAC AGAATTACTT TAATTGTTAT 60 

ACAGTTTTTT TAGTTGTnAA AAnGGGGAAA TnGGCCTTTA GGAGGTAATC ATGATTGAGT 120 

TAGTGAATGT GTCGAAAGTC TATCAAGAGA CACACGCGTT AAAAAATATT CACTTTTCTG 180 

i 

TCCAACCAGG AGAAATCGTT GGGATTGTTG GGAAAAGTGG CTCTGGCAAG TCCACGTTGT 24 0 

TACGTTTGTT GAATTTAATG GAACAACCGA GTGAAGGGGA GATTCGTATT GATGGCAGAA 300 

ATGTTCAAAC CTTCTCTAAA AAAGAAGTTC GCCAGCAGCA ACAACAAATG GGGATGGTGT 360 

TTCAGCATTA CAATTTGTTA GAAAATTTGA AAATTTATGA TAATGTTGCG TTGCCTTTGA 420 

AACTTTTAAA AGAGAAACAG CCTGAAAAAA TTGAGCGTTT ATTAACGTTT GTTGATATGG 4 80 

CCCATAAGGC AGAAGCGTAT CCTGCTCAGC TTTCTGGTGG GGnAAAACAG CGTGTCTCTA 540 

TTGCTCGAGC GTTGAGCCGT AATCCAAAAT GGTTATTATG TGACGAGGCA ACGAGTTCAT 600 

TAGATGarGr AAATwCAGrA AGTGTCgTtC GACTATTACA TAAGACGCAT CAGGAATTCC 660 

GACCGACTAT TTTCTTTGTT AGTCACGAAT TAGAAACCGT GAAGCGCTTA TGTAATCGTA 720 

TCTTAGTGAT GGAAAAAGGT CAGTTAATTG GCGAATTTCT GAATAATCCT CAGCAATATG 780 

AGGAAGAACC GCTTTCCTAT TTAGAAAAAG TGGAAAGGAG TTTGCGTCCA TGACTTTTTC 840 

AGAAACCTTT GCCGAATATT GGCCATCTAT GCTGATTGGT TTAAAAGAAA CAGGGATTAT 900 

GATGTCGATT TCGATGGCTA TCTGTTTGCT TGTCGGCTTA CCTTTAGGCT TAGGACTATT 960 

TTTGGCTAAT CCGAAaGTCA AAGGgCATCA ACCTTGGCTG TACTGGGTTT TAAATTTCAT 1020 

TGTAACTGTT - ATTCGCTCGT TTCCTTATTT ATTATTTGTG ATTGCGTTGA TTCCAGTTAC 1080 

TCGTTCTGTT TTAGGAAAGG CGTTTGGCCC AATTCCTTCA TCTGTGCCAT TGAGTATTGT 1140 

TGCGATCGCG ATTTTTGCTC GCTTGGTTGA ACAAGTGCTG TTGGATGTTC CAGAAGAAAT 1200 
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TTATTTTTTA 


ppir]\ 7\ TTf/^T 

bb 1 AA1 I to 1 


T A P P T A P P T P 


pa aaptppaa 

vJrtruivJ l v^ v»_/AxV 


TACATTTGGC 


ACTTTTTATT 


1260 


GGTGGAAbbA 


bb I AbbbbA 1 


TAPTPPTTPP 


TTAPAPAACC 


ACGACAGTGA 


GTATGGTTTC 


1320 


ATATTCCACG 


GTGATGGGTG 


1 1 bl J. bb 1 bo 


VvOULvJUUnl i 


nPiPHATTTTG 


CAGTTCGTGT 


1380 


CGGGTATCAA 


CGGTATbAAi 


7\ CCC T & T T A T 


PTATf^TTPfT 


ATTRTPrATTA 


TGATTGCACT 


1440 


GGTCTTCATT 


ftirn tv TV TV TV rn /■* 

TTACAAATGC 


1 1 bbbAAb Jl 1 


bAI 1 


A A APTAP^APA 


A AAG AAAG T A 

****** X *• 


1500 


GGAGGACAAA 


TV r"* TV TV fjy /"* TA TV TV 

AGAATGAAGA 


Ab 1 i 1 1A1 1 1 


nvjl^alnLd 1 Ik/ 


PPTPTT AT TP 


PAAPAGTTAT 


1560 


TTTAGCTGCC 


TGTGGGGGAA 


A 1 AAAbAAbb 




PA APAPA AP^P 


AG AT T ACCG T 


1620 


TGCCGTGCAA 


TTGGAATCTT 


f> tv tv n ATATUT 

bAAAAbAl Al 


b 1 1 bbAbAl I 


PPPA AP A A AP 


AAGCTGAGAA 


1680 


AAAAGGGTAC 


t\ T\ r\ tv rn rn Tv 7\ 7\ 

AAAATTAAbA 


I 1 AlbbAAb 1 




PTTPPPTAPA 


ACGATGCCGT 


1740 


GCAACATGAC 


GAAGCGGATG 


pffljv tv rp rp rp rp /— /~» 

CTAA1 111 bb 




PPPTTPATPP 


AAATGTTTAA 

/^*iX* X w XXX J*i • 


1800 


CAAAGAGAAA 


AAAGCTGATT 


m TV /"'rp/-"/" , /-'rp/-<rp 

TAGTbbbI bl 


bbAAbbbAl i 


TQTT&TTTTP 


PTPPTPPTTT 


1860 


CTATTCAAAA 


TV TV fTl TV /""* Z*" 1 TV TV 

GAATACCAAG 


A I bbbAAAbA 


1 1 i rtbb 1 unn 


A ATPPP AAAP 


TGGPGATTCC 


1920 


TAGCGATCCA 


TV TV TV FT% TV TV /"^ 

ACCAATGAAG 


GTCblbb 111 


AbbAAi 1 1 in 


A ATPPA A APP 


PPPTGATTAA 

\JVq>W X \J+± X X **** 


1980 


ATTAAAAGAA 


GGTGTCGGCT 


m rn tv tv r** /** /■* tv 

TTAACGGbAb 


bb 1 bbbAbAl 


TTPPTPPA A A 


ATPPTAJVAAA 


2040 


CATC ACT TTT 


GAAAGCATTG 


tv m m m tv rn f* ta tv 

ATTTACTGAA 


1 1 i Abb 1 AAA 


PPPTATP ATP 
bbb lni bn 1 \j 


AAAAAPAPAT 
/Vtrinnvjriv»,ri x 


2100 


CGCTATGGTG 


TTCTGCTACC 


CAGCCTAbl 1 


AbAAbb 1 bb 1 


PfTTT A AP A A 


PPAAAGATGC 


2160 


GATCTTGTTA 


GAAGATAAAG 


AAGCAAGTAA 


AbAl lAbbbA 


TTPPB APTTP 
1 lbbAAbl lb 


TPAPAPPPAA 


2220 


AGGCGAAAAA 


GATAGCGAAA 


Tv T\ T\ f'/"' 7\ TV /— '/TP 

AAATbAAbb 1 


1 1 iAAAAbrtH. 


PPPA TP AP A A 


PAAAAGAAGT 

S^Xfcifc*^** \J*****J *> 


2280 


TGCTGAATAC 


AT C AAG AAAA 


ti rp rn TV T\ 7\ /""■ /"* 

ATTC1 AAAbb 


bbbbAAiHl I 


PPTPPPTTTT 

1 vjLb 1X11 


AATGAATAGA 


2340 


AAAAATGAAG 


CCTCTTTTTT 


bAbAAbb lib 


BTTTTTTTPT 


TTTPTTTPAT 


T TAT AG TATA 


2400 


/■^ fry pft TV TV fT* TV TV 

GTTAATAGGA 


AGGAbb 1 bbb 


bb 1 b 1 bb 1 bb 


A ATT ATTT AT 


TTTAATPATG 

111 / \ i. VJii i VJ 


GAACGGGTGG 


2460 


GCTTAATTAT 


TTTAC1 bbbb 


lilbllll nb 


T^AATPTHPr 


TTATTTTAAG 

l i n x x x x iuvv 


CGTGTTTTAT 


2520 


TAAGTCGTGA 


7\ tv Ti T\ TV rp /~« rp rp 

AAAAA1 b I b 1 


1 b AAAAb lib 


TA^TTP ATTTT 


PATTTTTPGT 

Ort 1 XIX X O w X 


TTATTTGCTA 


2580 


TTATTTCTAA 


TTTTACAGGT 


Tv rp f'-' /-* A TV T\ fp/T" 

Al bbAAAl bb 


/—rp TV ap7\ A TP A 
b i AAb AA i bri 


PATTPTTPPP 


AATAAT T T AC 

n/i x *^** xxx 


2640 


TAACCTATTT 


Tv Tv /"* tv tv TV rn 

AAGCAGCAAT 


/~ > r"» rp rp /-« TV rn rp >r-> 

Gbi IbbAl lb 


rTTi AT APPPP 
bbAAi Abouo 


TAPPTTAPTA 
1 e\K* oil rxo l rt 


ATTGGCGTTT 

1^ X i VJUWW 1 X X 


- 2700 


CTGGATTAGT 


GGGTGGTCCG 


TV rprp/^rp/~</'" % /— /"•TV. 

ATI b 1 bbbbA 


b 1 bbbb i i bb 


TTT A ATTPPT 
111 nn 1 1 bv< 1 


PPPTTTCATC 

VJ VJ V~r V-^/k X Vw 


2760 


GGGTTATTCA 


AGGCGGGGGG 


f , T\T7\/" , TTTf^T 

b AT Ab 1 1 1 b 1 




TPPTTPTTT A 
1 oL, 1 iVol 1 In 


ATTGTTGGCT 

/i X X V9 X !■ VJVJVi X 


2820 


TAATTGCCGG 


TTTTTTAGGG 


AbbbbbAlbb 


b AAAb LrtnnL 


TGTTTTTCCG 


TCTGCCGGCT 


2880 


TTTCTGCCAT 


TGTCGGAGCG 


TGCATGGAAA 


TGATTCAGAT 


GATTTTTATT 


TTCTTTTTCA 


2940 


GCGGTGATCT 


ATCGGATGGG 


GCCACGTTGG 


TTCGTTTTAT 


TGCGCTGCCG 


ATGATTTTGT 


3000 


TAAATAGCGT 


• AGGAACGTTC 


ATTTTTATGT 


CGATTCTAAC 


GACGACATTG 


AAACAAGAAG 


3060 


AGCAGGCCAA 


AGCGGTTCAG 


ACGCACGATG 


TTTTGGAATT 


AGCAGCGGAA ACACTGCCAT 


3120 


ATTTTCGAGA 


AGGCCTCAAT 


AAAAATTCCA 


GCAAAAAAGT 


TGCGGAGATT 


ATTAAGCATT 


3180 
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ATACAAAAGT 


CAGTGCGAT1 


AGCATGACAA ACAGTCATCA AATTTTAGCT 


CATGTTGGCG 


3240 


CAGGAAGCGA 


TCATCATATT 


CCAGAGCTAG 


AAGTCATTAC 


GGAGCTTTCT 


CGGGAAGTGT 


3300 


TACGGACGGG 


CCGAATGACC 


ATCGCGCATG 


CCAAAGAAGA 


AGTTGGTTGT 


TCGGACCCTA 


3360 


ATTGTCCGTT 


GCAAGCGGCG 


ATTGTGATTC 


CATTATTTTC 


CCATCAGCAA 


ATTGTTGGCA 


3420 


CCTTAAAAAT 


GTACTTTACA 


GATCCAGCGC 


AGTTAACGCA 


TGTCGAGGAG 


CAGTTAGCAG 


3480 


AAGGTCTTGG 


GACAATCTTT 


TCTTCACAAA 


TTGAATTAGG 


TGAAGCGGAA 


GTACAATCAA 


3540 


AATTACTTAA 


AGAAGCTGAA 


ATTAAATCCT 


TGCAAGCGCA 


AGTGAACCCG 


CATTTCTTCT 

X> Ai XXX \^ X X V i 


3600 


TTAATGCCAT 


CAATACTATT 


TCCGCATTAA 


TGCGGAAAGA 


CAGTGAAAAA 


GCGCGGAAAC 


3660 


TATTGCTCCA 


t\ m m TV TV /— » 7\ Tv TV 

ATTAAGCAAA 


TACTTTCGAG 


GCAACTTGCA 


GGGAGCCGTG 


TAAACTACGA 


3720 


TCCCAGTTTC 


fk tV TV V^* TV TV ^ m TV 

ACAAGAACTA 


GAACAAGTAA 


AAGCCTATTT 


GTCATTAGAA 


PAAGCACGGT 


3780 


TTCCGAATCG 


TTATCAAGTA 


ACATTTGACG 


TGGAAGAAGC 


GGTTGCTTCA 




3840 


CGCCGTATGC 


GATTCAGGTG 


TTAGTGGAAA 


ATACCATCAA 


GCATGCATTT 




3900 


AGGAGAACAA 


TCAAGTCCGG 


GTAGTGGTCA 


AAGCAAAACA 


ACACAAATTA 




3960 


TGTATGACAA 


TGGGCAAGGG 


ATTCCTGAAG 


AACGCCGCTT 


GCTATTAGGA 


AAGAPGAPAG 


4020 


TGACTTCGGA 


AAAAGGCACC 


GGCACGGCAC 


TGGAAAACTT 


GTCCCGCAGA 


rV 1 uOV^urtrtV^'^ 


4080 


TTTATGGATC 


AGAAGGTTGT 


TTTTTAATTG 


AAAATTTAGA 


GACAGGCGGG 


Af^rTATPTTP 


4140 

i 


ATTTAGAAAT 


CCCGATGGAA 


CAGGAGGAAT 


TAGATGCACG 


TATTAATCGT 


1 u/V I ort. x orio 


4200 


CCCTTAGCAC 


GAGAAGAATT 


GAGCTATCTT 


GTTTCCCAAC 


ATCCGCAAGT 




4260 


GCAGAAGCGG 


ATTCTGTGGC 


GGAGGCAATG 


GAAGAGATGA 


TGGATCAAAA 


r;rr a a tttg 


4320 


TTATTTTTAG 


ATATTCATTT 


GACCGATGAA 


AGCGGGTTTG 


ATTTAGCGGA 


AAAATTAACC 


4380 


CATCTTAAAA 


AAGCGCCCTA 


CTTGGTCTTT 


GCAACCGCCT 


ATGATCAATA 


TGCGTTAAAG 


4440 


GCCTTTCAAG 


m ^ tv tv rn /■* ^> tv tv tv 

TGAATGCAAA 


AG AT TAT ATT 


TTGAAACCTT 


TTGAAGAAGA 


A A AAA T AAC A 


4500 


CAAGTCATTG 


TV TV TV TV TV TV rp/"» 

AAAAAGCATC 


GAAAGAAATG 


GGTCAAGCTG 


TCCCAGAAAA 


PArTGCCGAA 


4560 


AAAGGACCAA 


AATCTGAAGC 


AATACCTATT 


CAAGGGGAAG 


ACCGCATTTA 


TPTTGTGGCG 


4620 


CCAGAAGATA 


TTTACCTCGT 


TTCTGTCGAA 


GAACGGCAAC 


TGAGCATTTT 


TGTAGATCAA 


4680 


/■> TV /~° rn rn TV /"'TV 

CAGGTCTACA 


AG A i GAG AG G 


GACCCTCAAC 


AGTATCGAAC 


AAAAATTACC 


TGPCACGCTC 


4740 


TTTATTAAAA 


/-^ tv rn ^ TV TV 

CGCATCGAAG 


TTTCATCTTG 


AATCGAACAA 


AAATTCAAGA 


AATTTAGfTG 


4800 


TGGTTTAATA 


Tv TV /*» y*» mm /■"•/""* TV 

ACACGTTGCA 


TV /"*m A 7\ *T"T»mrp T\ 

AG 1 AA 1 1 1 i A 






TfSTTAGCCGT 


4860 


TCATACGTCA 


AAGAATTTAA 


AGAAAAACTT 


GGcTTAAGTT 


AAAAAATCAC 


CACTTGAAAT 


4920 


GTCGAATGCA 


TTTCAAGTGG 


TGATTTTTGC 


ATTTGGTGAA 


GAAATTCTGG 


CATTTCATGA 


4980 


AAAAAAGCCC 


■ TTAAAAAAAC 


GTTTTCTTTA 


TATACTTAAC 


CCATAGAAAA 


CAGACAAAAG 


5040 


GGAGCGAAGA 


AGAATGGAAA 


AGAAAGTATA 


TTCATTTTTA 


CAACAAGCAG 


CAGTTTTTGC 


5100 


CATTATTATG 


TTAATCGCCA 


ACGGCTTGGC 


GTTCATTTCC 


CCAATACCAA 


. TGCCTGCTTC 


5160 
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AGTAATTGGC 


ATGGTGTTAT 


1 G 1 ICAi 1 bb 


TT T A tpt a p IV 


AAAPTTP.TTA 


AATTP.P.Af5PA 


5220 


AGTGGAAGGG 


CTAGGTAA1 A 


GG 1 i A L tAAt 


ppttattap a 

oul Iril 1 


TTTTTATTTf^ 
nil inl 1 iu 


TTPPTTPAfiG 


5280 


GATTTCCTTA 


GTTAATTCTT 


TAGGGAI 1 


vj^rlrt^rtri 1 1 1 


fif;PP.TTPAAA 


TTATTTTTPiT 
i inl 1 1 i lui 


5340 


TATTTTAGTA 


GCAACATTAG 


CGI I Ad AGC 


aaPAAPTPPT 


1 VJ O t\\^ \J\J\J^.\3 


PPTTTTT ATT 
*w<k- 1111 Inl 1 


5400 


GCATATGAAA 


GATGCTCGAA 


7\7\/'" , 7\T\T\n7\PB 


ar*ppnrar;pa 


APf^AAP,AA 7\CZ 


PP.AAAP.P.AP.A 


54 60 


GGTTCGCGCA 


m TV TV /T^ TV 

TGAATCCGTA 


TG 1 I GGG A I 1 


i\ 1 inl 1 1 L-M 1 


1 nuv> X ij inl 1 


TPPPATAP.PT 


5520 


ACTTGGTTGT 


TTAAAAAAAG 


m t\ tv tv rp n -1 hp 
TAAAGG 1 I 1 C 


1 1 G 1 Lnl I In 


PPPP ATT ATT 


PP.TTP.PPATP 
v_/Vjj J. lOvvVjnlu 


5580 


ATTTTAGGCG 


TTGTCGTTTT 


TV T\ TV TV O ffl TV TV TV 

AAAAG T AAC A 


GGAAI 1 AG 1 1 


appappa at a 


PA ATP A APPT 


R 4 0 


GGCAAATATA 


TCAGCTTCTT 


TCTAGAGCCT 


GGG AC AG 1 AG 


PPTTTPPP AT 
GG 1 1 Ib^brtJl 


TPPTTT AT AT 
lv^Ll 1 lnlnl 


5700 


AAAAAGAGAG 


ACGTGTTGAA 


AAAATATTGG 


rri rri 7\ 7\ TV T\T' r P'P 

TTAGAAAX 1 1 


1 AAGGGGG 1 1 


a apa aTTPPT 


5760 


TCTTTTGGTT 


CGCTAGTGGC 


TGTTTATTTT 


TV i^* TV TV TV TV /"* 

G C AG G AAAAG 


TGA1 iGGGAI 


GG A X AA I G A 1 




TTAGTGGCCT 


CAATCTTACC 


ACAAGCAGCT 


TA TV 71 Z"^ 1 TV f** f"*^* 7\ 

ACAACAGCGA 


1 1 GGGG 1 AGG 


T , TA r PTTPTPa a 

X AX X X G 1 bftn 


5880 


ACTGTTGGCG 


GCATTGCTTC 


CATTACGGCG 


TTTACTGTAA 


TCI I GAA I GG 


t a r'T 1 a app 
X G X AG X AAGU 




TATGCCTTGG 


GTCGAATTGC 


ATTGAAATGG 


TTTAAGATTA 


T\r">7\ ATrrj\ TV rn 

ACAATGGAAi 


prrp n ta a ppt 
GGGG AAAGG 1 




TTAGCTTTAG 


GTGCGGCCGG 


ACATGCCCTA 


GGTGTGGCAG 


T 1 GGG A 1 AGA 


ta ta TiT'pppa a 
AAl GGGGGArt 


uuou 


ACAGAGGCTG 


CTATGGCTAG 


TATTTCTGTC 


GTAGTGGTTG 


GGG I AG 1 1AG 


rpprpTTTAPTA 


6120 


GTTCCTTTCT 


TTGCAACGGT 


CATTGGTCTA 


m TV TA TV TV 7V TV Z" 1 

TAAAAAAGIG 


GGG i 1 GG 1 GA 


PTTTTTATPA 
GXilllrtl ort 


6180 


GTAGCTAATT 


TCTATCAGCT 


AACAGTCTCT 


GCTT Al I C 1 G 


1 GAA 1 GG I AG 


TT ATP APT A A 


6240 


TCCGTAGGCT 


CGGAGTTAAA 


TV m /"* TV m m TV TV 

ATGAGTGTAA 


tv tv 7\ TvrT'^TV Tv 

AAAAGCCAAC 


PTTBTPPaT A 
Gl lAlGGAl/\ 


PTAAPPTTPP 
vj 1 rVrAovj 1 1 ovj 


6300 


CTTTTGCGTT 


GTTCACAGGA 


TTTTTTGAAT 


Ti m Tv /" T\ rp y\ r" 1 rp 

AT AG AT AGb I 


GGG AX XGAGG 


aPTPTTTTTA 
rtG 1 1 In 


6360 


TAGAATGGTT 


GCTCCTAAAG 


TTTGTTTAAA 


GTGGCG TAAC 


GGGGAX lcbl 


r^TPPTTPTTP 
blt^l lol 


6420 


ATCAAAACTA 


GCAACGGCGC 


GTTCAGGAAT 


CAT1 AAI 1GA 


rp tv CT^C'V 7A 7A TAT 

X AGGG X AAA! 


T ATAPPPATP 
Inl noutn 1 


64 80 


GACAGCCGTA 


TGCAACACGC 


AAATATCTGT 


TV /"■» TV TV T\ TV /— ■ O TV 

ACAAACACCA 


rp rp TV n TV /"^ TV TV A 

G X 1 AAAG AAA 


PPTPTPT AaT 
GG 1 G 1 o 1 nri 1 


6540 


TCCTCGTTCT 


CGTAATCGAA 


TGTCCAAATC 


CGlGCCACi A 


AAAG G AG AA 1 


APTPPPPTTT 
nu 1 uLCo ill 


6600 


GTCCATCCAA 


TAAATGGTTG 


CCTCATTTTT 


GI GAGC I IGA 


T Tv A TA TS. 7ATP BP 

X AAAAA X L,nu 


PT APPPAPPP 


6660 


AAACAGATTA 


CGCCCGTCAG 


rp rp /■*>/"* rp 7\ 7\ "A O 

TTCC1 AAGAC 


Ai XGIGGGGC 


pppa a a ar^TT 

bGGAHrt/vJ 1 1 


TATTTTPAPPi 

1 r\ 1111 vfiVj vj 


6720 


TV m TV TV ^ TV 

ATGATAGCGA 


TCTAGGGGGT 


CATGCCCAIC 


AA 1 bolMM 


app APA A AaT 


PPPPATTPTC 


6780 


tv tv tv tv fti tv /"* fri ^ 

AAAATAGTGC 


TTTGTTTGTT 


GAACAAGGGC 


GG 111 CAA1 G 


PPTTPPPP AP. 
GG 1 X vjL-L^rio 


PPPP APP APT 


6840 


GGTTAGTTTT 


CCGTCTGTGG 


CGACAAAATC 


ATTCGTATAA 


TCAATGGAAA 


TCAGAGCTTT 


6900 


CATTTGGCAA 


CCTCCTAAAG 


GTTAAAGTCA 


TTTTGTTCAT 


TTTACAGCTT 


TTAAGAAAGC 


6960 


ATTTCAAATG 


■ ATAAAAGTAC 


ATCAAATTCT 


AAAAAAGCTG 


TTTCAAAATA 


AACGCTTTGG 


7020 


CTAAAAAACA 


TGCTATAATT 


TTGCTACAAA 


TTAAaGCAGG 


TGAATAGAGT 


GAaaAGAaAA 


7080 


ATCGTACGTT 


TTATCCTTTG 


GTTGTTAGGA 


, ATCATTGTGG 


TTATTTGCTT 


AGGCGTGTTT 


7140 
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CTCGCCTtTC AATTATCCCC TAAGCCAGGC GCTTGGGTCA TCAACCAGTT ATTCGCAGGC 7200 

GAAGTTGAAA TAAAAGATTC AGCTTCCTAT GATAAAGCAT TGCCCAATGT CCAATTGCAA 7260 

GAAAaTCmAA CATACCCTyC TyCArGAAAA aaGAATACCT TtGATTTGTA TTATCCCCAA 7320 

ACGTCAAAAC AAGCGGTACC AGTCGTTTTA TGGGTCCATG GTGGTGGCTA TGTGGGAGGA 7380 

GATAAGTCTG GGATGAAAGA GTTTGCGACA CGTTTAGTGG CTGATTCCTC AGTTGCCtTT 74 40 

ATTTCAATGA ATTATGAATT AGCACCAGAC GCACCTTATC CAAGCCAGCT CCAGCAGGTC 7500 

AATGAACTTG TTCAATTTTT ATTAGAAAAG AaACAAGCGT ATCCCATGTT AGATTTATCT 7560 

AAGCTTTTTA TTGGCGGGGA CAGTGCGGGT GCACAGATTG CTCTACAATA TGCAACAGTT 7 620 

CAAACCAATG CCAACTATGC CAAAGAACTA GGGATGACTG CAGCATTGCC GGCTAGCCAT 7 680 

TTAAAAGGCA CGCTCTCGTA TTGTGGTCCG GTGGATTTGA AACAAATGGC CAACCAACAA 774 0 

AGTGATAATC GTTTTATGAA ATTTTTCGTT AAAACAGTGG CTTGGTCCTT GCTCGGCACT 7800 

AAAGACTGGC AGACGAGTGC AGAACTGCAA GAAGTGAGTT TAGTGGATAA AGTCACCAAA 7860 

GAATTTCCCC CGACGTATCT AACAGATGGC AATAGCTTTT CATTCCAAGA GCAAGGACTA 7920 

GCCTTGGTTC AGCGGTTAAA AACTTTAAAT GTTCCAGTTT CAGCTCTCTT TTTCAAAGAT . 7980 

AAGAAAGCAr CGATCACCCA CGAATATCAA TTCGACTATC AAACAAAAGA AGCTAAACAA 804 0 

TGTTACCAAG AAACAGTCCA GTTTGTGAAT ACGTATAAAT AAACAGAGGC CTTAGAAATA 8100 

i 

TTTCTAAGGC CTCTGTTTAT TTATTTTTGT CATTTTTTCA GCTGAGAAAT ATGTATAAAT 8160 

TAAACAAAAA TGTTTATTTA ATGATGTTTA CATGAGAAAA AATCCGTGCT ATACTACGAA 8220 

CTCGTTTGTT AATTTTAATG TTAAAAACAA CGTTTTTGTT ATTTTTGTGA ACAAATTAAC 8280 

ATTTTTATCT TACAAAAACA GCAAAAAATT CGTTTAGTTA ACAAAAAGAA AATTAAAAAA 8340 

TAAATTTTTT TTTTGAAATT AAGAAAATAT CTTTAAAAAA ATCAGAAGAa AAcTTCGTAT 8400 

ACTTGAGCCA TAGGTTGGTT GTGGAAAGTT TACTTCGAGG GAGGAGTGAT GAAGTTAATG 84 60 

AAAAAGAAAA TCGCTGTTTT ATTGGCAACT TGTTTACTTG TACAACCATT TTTATCTGTT 8520 

TCAGCGTGGG CGATTGAAGG GGAGCAACAA ACAGAACGTA CGCAGACACA GCCAAAAACA 8580 

GCAGAAACCA GTACAAGTGA AgCAACAAAA GAAAGCAGCG CACATTCTAC AGAACAGTCA 8 640 

TCAATGACTG AATCTAGTGC TATCACAGAA AAAGTAACCA CCTCTTCCAC AGAAACAAAA 8700 

CCAAGCGAAG AGCAGAAGCA AATCACGTTG ACCTTTGAAA CAACGGATCA AGCTTTGTTT 87 60 

CTAAATGATG CCAAAAGTTA TCAAGTGGTA AAAGAAAAAA ATCAGCCACT TCGGACAGAA 8820 
GAATTACCAA AATGGGCGAA TAGTGAGGAA AATGCGACTT TCGTTGGCTG GTCTTATCAA 8880 
GGCAAAACAT ATACAAATGA AGAACTTCTT CAATTAGAGT TTTCAGAAGA TACACAACTT 8 940 

ACGGCTGTCT • TTAAACAGAA ACTGACACGC ATGGCCAGAG CTGTTTCAAT TGATCAGTCC 9000 
ATTGCGGACA CAATTGCTCC TGCAGATAAA ACAAAAGTGA TTGTTGTGCC AAGTCCTGCA 9060 
GGAGATGGAG CAGCGTACAA AGAATATGCA TCGACAGAGG CTGGCTTGAA AGAAGCCTTG 9120 
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TTTGATTTGT ATCAACAGGG GAACAATGGC GATTTCACTC TTTATATAGG AAACAATATT 9180 

ACGACGAGTG CGGCGACCAC ..C.GCTAMGTG .MTCCTGACA CCGTTACGGC AAGTAATATG 9240 

ACCTTTTACG CCTTGCAAGG TAAGGTCAAT CATTTAGTGA TTACTGGAAA CAGCGCGGAT 9300 

CCGATTAGTA TGGATCAAAC GGCGCCAACG GGTTCTAAGA CGTTAGGATT TAATCAAAAT 9360 

ATCTATTTTG GCTCAAACAT CACTTTGAGA AATCTAAACT ATACAGGTAC GAATATGTAT 9420 

TTAAATGGTT ACAGTTTGAA CCTGAATGGC GGCTCTAGTG GAAACGGTCT GACCGTATAC 94 80 

GGTGGAACGG ATACAGGAGA TGTTTCAGGA AACCCAACTT TAACAGTAAA TAGTACAGGT 954 0 

ACAGGGACAT GGAATTTCTA TGGCGGAAAC CAAAATGGTG GTAATTTAGC TGGAAATCCT 9600 

ACGATTGTTA TTAACAATAC ACGGAGCGGA CTAAATACAT TAAGTGGGGG CGCCAATATT 9660 

GGGACAGTGA CTGGGAATAC TTCATTGGTT GTCAATGATT CTGGTGGCAG AATTGCTTCA 9720 

ATTTATGGGG GCGGTTATGG GACCAACGCA ACCAACACAG CGAATGTTAC TGGTAACGTC 9780 

TCAACAAAGG TTGCGATTAC GAATGCTGCT ACAGGTTTCC AATTAAGTAC GTATTATGGC 984 0 

GGTGTTCAAT ACGGAAATAT TGGTGGAAAA GTAACAAACG ATATTTCTGG ATATGGCCGT 9900 

TGGTATACAG CGGGTCAACG TTTTATTGGT GGCTCTTCCC GCGGCGATAT AGGAACGAAT 9960 

CGTGCGACTG ATGGAATTAC TACAAATTTA AATACACAGT TGTATAGTGC GGGTCGTGCA 10020 

GATTTTGAAG GAGGCAACCA ATATTCTGGA ACGATCATCG GAGATATTAC GAATGTGGTT 10080 

ACAGCAGGAA CGAATTCCGC TGGTGGTATT AATGATTTTA ACGGCGGTGC TGGTAACAAT 10140 

GTTTCGAAAT TTAACAAGAG TCAAATTGGG GCCTCAAATG AAGCAACGTA TGATGCTTAT 10200 

ACACCACAAC AAAGAGCAGA ATTAGCCAAA TCAGCAGCAG CTTTTAAAGT TTTTGGAAAT 102 60 

ATTTCTTCTA AATTAGTCAG CGGTTCGTTT AATAATGGTG CAATTTACTC AACTGCAGCT 10320 

GGCCGCGGCG GTTATATCGA AGGAAATACA ACAATTGAAG TGGGAACAGC GAATGGTGAT 10380 

GGGTCCTTAG GTGGCGATGG TATGGCTTAT TCAGGCGCAA AACCGACAAG CTTAGATTAT 10440 

TCAACGACCG ATAAAAGTCG TGGGTATAGT TCAGGTTGGG ATATTGTTGG TGGCGGCGGC 10500 

TATCCTGCTA GTAATGATAC TTGGGATATC TATATAAAAG GAGATACTAA GACTGTTTTA 10560 

AATAATACCA TTGCTCGCTG GACATATGGT GGCTCTTTTT CAGGTGTCGT TGAAGGCAAT '10620 

ACTTCTAACA CATTGAATGG CGGGATTGCT G AC AC AT TAG AAGGAACAGG CTATCAAGCA 10680 

GCTCGAGTTT ATGGAAACGG TCAAACAATC GTTAATAATG GTCAAGTCGA TTGGTTTCTT 10740 

TCAGGCGGTG GCTGGAATGA TGCCAAAAAT GTTGGGAATG TCGGTGTAAC GGTTTATGAA 10800 

GGAGTAATTA ATGCTTCGAT GGGTGCTTCT TATGGAGCAA GCGGTGGCCA TACAATTACT 108 60 

GGTAATTCAG ACAATCGCAT CTACGGAGGA AATTTCTCAG GAACTCCTCG GACAGGCGCG 10920 

AACGGTTTTT- CAGGTGGGAT TACCAATGCG GGCTCACTTC TTGGAAATGC CAAACTCTTA 10980 

ATTGATTTAA GAAATTACAA CGGAGAGTTT AAGTTGCCTG GTAATACCTA CATTACTGGT 1104 0 

GGTCGGCCCT ATGGTCAAAA TACGACACTA GGAACGGATG AATCAAACAC AATTACTTTA 11100 
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r** m m t\ ^* tv rn 

GGTTAGCTAT 


rp Tv tv m Tv A *"P A A 

T AA X AA TbAA 


1 AC I CC 1 rl i. C 


TT^TP.P.PTaT 


TaTTPTTPAP, 


16620 


cAAAATGAGA 


CAAAGGTTCA 


7\ccT\C7\ r PT"VT\ 
ACCAbAX 1 1 A 


AAA 1 1 AC J. vjvj 


r,pp.TTaaapp 




16680 


TV TV i"' f 1 C C C C 7\ T\ 

AACGCbcGAA 


TV ^^rp^iTV rn/^Ti TV 

AcGTcATCAA 


Ibll XCX 1 XA 


CAAAACCCAC 


aap,pp.p.ppTa 


TTTAAACCAA 


16740 


TTACATTTAA 


TCAACACTGT 


TTCAAAAGGA 


GGCGAAACGC 


TTTACCAATC 


CGATACTGAG 


16800 


GATATGCAAG 


TGGCGCCAAA 


CTCTAACTTT 


AGTTACCCAA 


TTTCTTTAAA 


AGGGGAACGA 


16860 


TTAACGCCAG 


■ GAAAATATGT 


CTTGAAATCA 


ACGGCCTATG 


GTGTAAAAGA 


TGAAAAGGGC 


16920 


ACCTATCAAG 


TCAAAGGCGC 


CAATGGTGAA 


GAACGGTACC 


TGTACAAATG 


GGAATTTACA 


16980 


AAAGAATTTA 


CTATTTCTGG 


GGACGTCGCT 


AAAGAATTAA 


ATGAAAAAGA 


CGTAACCATT 


17040 
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All bb 1 Go 1 1 


GTATCTACTG 


ATTGCATTAA 


TPATTCTAGC 


GCTGCTCTTA 


17100 


. 1 1 b A_l _J. _i 1LI 


lb 1 ibl Albb 


TAAAAAGAAA' AAAGAGGAAG 


AACAACAATP 


TGAGCAATAA 


17160 


AAAbbbb 1 Al 


b 1 Al i GALAA 


AAAGGGTTTT 


CCTTGTAAAA 


GTAAAGGTAP 

vj i nnnou X 


TTTTACAAGG 


17220 


AA 1 Ibb 1 I bb 


paTannpTap 
b A l AAAb 1 Ab 


AGGTAGAGAG 


AGTGAGAAGC 


CATGAGAAAA 


AAGTATCTTT 


17280 


TTTTAi I lAb 


1 lb 1 1 Ibb 1 A 


ATTTTTACGG 


CTGTCGAAAT 


AGAAPCAATT 


CAAACATTGG 


17340 


CGAbAbbl A I 


1 lAlAAlbbb 


TTAGAACCAA 


CGGAAGAAAT 


TPAArPAAAA 


CC AGAATCAA 


17400 


TCAbbbbibA 


ppaaaapTPa 
bb AAAAb 1 bM 


GAACCGGAAC 


AACCTGCGCC 


AACCCCTCCG 


GAAGATCAAA 


17460 


AAACTACGGT 


1 bb I I b lbAA 


GAAAAAAAGC 


CTTCACCCAA 




AAAPrTGTGG 


17520 


agaaaccggc 


AATGTbAAAA 


GAAAAAAAGA 


AACAAACAAA 


7i P 7A a 7A P 7\ 71 C CL 
AbAAAbAAiw.o 


PPTPATPPPT 
vjv_ l k^rtl ^^v— 1 


17580 


TACAAGTGAA 


TAAAGATTTT 


GTCTTTCCTA 


ACGGACTTGG 


1 bbAbbbbbb 


P.ATP.ZiAP,AAT 


17640 


CTGAAAGCTA 


TTTAGCAGCG 


GTTGGGTACT 


TTCTTTCAGG 


AAAlbbAl lu 


TATP.P.P.AAAA 


17700 


aaatggatgt 


GAGAGAAGTC 


TATGATATTG 


ATGATGTTTT 


1 AAA 1 AAAb 1 


pp.aTa A A AP.A 


17760 


aaaatcgcct 


TATTTCAGTT 


ACTGGAAAAT 


GCACCGATGT 


T&TAPZiPTAPTP 
1 AAb Ab AU 1 


A AAAP.AA APP 


17820 


GTAATGCAGG 


AGTTAGAA1 I 


ATCTGAATTT 


ATTGTGAACA 


MArtbbb 1 v^ovv 


TP A ATT A A AT 
1 onnl innni 


17880 


GCCGATTTTA 


rprpoA TV rp rp rp /— > 

TTGAATTTGG 


TTTATCAGAG 


GATTTTCAAA 


TT PTP a PPP. 7A 
1 1 b 1 bAbboA 


TPP.TATTTTP 


17940 


TTAACTCTTA 


ATGAATbAGG 


AACGGAAACT 


TCCGCTACCT 


TPaTTPaTTP. 
1 b A 1 1 b A lib 


PT AT ATT AAA 


18000 


GAGTCGTTAC 


GTTTTGCTAT 


GTTTCAAGAT 


TTCTTTTTTC 


aPPPTTTTP.a 
Abbb 1111 


TTPTGTGAAT 
X ivviui onni 


18060 


GAGTTTGCGT 


rp 7\ r-> 7\ "A /~> T\ ^ rp rp 

TAGAACACTT 


TGTTAGTCAC 


ACATTGGCTT 


aTaap.PAP.TT 
A 1 AAbbAo 1 1 


PA APP, A ATT A 


18120 


AAG AAAAT G L 


rp /"■/-• 7\ TV TV TV pip TV 

1GGAAAA! I A 


TCAAATTACA 


ATTAATAAAG 


j* tv TT TP APT T 


AGTGGGAAAT 


18180 


G AAAb b AAT T 


TAGbbbAAb 1 


AACAACACTG 


GTTTTTTTAC 


AATTATAPPA 


APGTGATTTT 


18240 


TCATTGTACG 


prpTi TvTV TV TV TV r*^ T 1 

GTAAAAAAGT 


TTTAGAGCAG 


ATTCGGAACT 


TTPaPPTATT 
1 IbAbv^lAl 1 


TTTTTPTPtAG 
1 1 1 1 Ivyi 


18300 


71 tv rpmipp T\ rp/"» 

AAGGTTbATC 


rprp t\ 7\ rp f tv f^ff 

TT AAjL G Abbb 


TGCGGGCTAT 


GCACAAGCTA 


aATPTTTTPA 
AA lul 1 i 1 Urt 


TTTTTTGGCG 

X 1 X X X X O Vj\_/VJ3 


18360 


GTTTCTTTAG 


1 AbGAAbbbA 


GCAACAGTTT 


TATGTAGAGA 


A A A a APTAP.A 
AAAAAu 1 n\3r\. 


TTPGPAAGTG 


18420 


1 1 AAAAbAAl 


L AbbbbblbA 


CAATCAGCAG 


AATGTTTTCA 


ATTP.P.TTAP.A 
Mil 1 1 nun 


GAGTCTTCAA 


18480 


A Tbbb AbAAb 


bAbAAbbAAb 


AAACGAAGGC 


AAT TAT TT AC 


TPTT ATTTTT 
Ivvl lnl 1111 


AATTGCTGAA 

rlfi J. i. VJV.I A VJ*"i 


18540 


bnu 1 bbC 1 


AAblAbbi lb 


AGCAGCACTT 


TGCCAAAGAT 


TTGPAGAAAT 


TCAACAATTG 

X blxiiw»Xik A -** ^fc-* 


18600 


AA 1 bb 1 bAb i 


1 La 1 lAAbblA 


rprnrpT\T\7\ t\ 7\rpT\ 

TTTAAAAAI A 


bAA 111 bAbb 




AGTCCTGCAA 


18660 


uu 1111 bAAb 


aaaa aPTfifiP 
AAAAAb 1 AAV., 


AATCGTACAT 


TTTAAATTGT 


TPPATTTTCC 


AATAGAAGCA 


18720 


AACTATAGTT 


ATCTCAAAAT 


GGATATAACC 


TACTTTGCAG 


AAACGTATAC 


AGAGTATTTT 


18780 


TCTGCTAGTC 


GACAATTTAT 


TCAAGAACAA 


AAAACGAATC 


CCGCTGTTTG 


GGATGCCAGA 


18840 


CGGTTTTTAT 


• TTTATCGGTA 


TCTGTTGTTA 


ATGGTGGATG 


CGTTACCATT 


GAAAAAAGTT 


18900 


TTGGCTCCGA 


TTACATTGTG 


TGTCGACTTT 


TCCTTTGGTG 


AAGCCTATAA 


TCGTATGATA 


18960 


CAAaAaAATA 


TTGAAAAAAT 


TACGGAATGG 


AATATCATTT 


TCAAAGCACA 


TGTAGATGAA 


19020 
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AGTGTACAAT TAATTTTATC GGATATTCAT TTTGTCGATT GGCCTGAGAT TACTCAAATT 19080 

ATtTGcTAGC GCCACCACGT CCTGTTGATT. GGXCGCATTT TATTAAAACG CTGAGTGCAC 19140 

TAAGACCGCA CTCGGGAGAA GAGAAAAAGA AATCATCGTA GACTAGACGG TGATTTCTTT 19200 

TTCtTTTAGA ATATAAAAAT TAAAAATAAT AAAAAAGTTG AAACTTTTTA TGCTTGTGAT 19260 

TCGTCTATAT AAGTGTACAG GAAAGGAGGT TCCAGAAATC GATGGTTTTC GATGGTTGGA 19320 

AAAGAAAAAA ACAAATTCGC GCGTTAGAAA CGCTTGTGGA TACCAAATAT GAAAAAATGT 19380 

ATCGCATTGC CTTTAACTAT ACACATTCGA AAGAAGATGC CTTGGATGTG ATTCAAGAAA 19440 

GCTTTGAAAA GGCACTGAAG GCCATCCAAA AAGGACCCGA TATTAAGGAT TTTGACGCGT 19500 

GGTTCTTTCG AATTCTGCTT TCTGTAGCGA CAGATTATTG GCGTAAAAAG AAACGAGAAG 19560 

CTGTTCATAT TGAAAAAGAT GAACATGTTT TAGCCCTTTT AAATAATCCC CAACGCAGTT 19620 

TTTTAGAAAT TGAAGAAATT ATTAATCAAT TAGATAGTCC CAATCGTGAA ATTATTCTAT 19680 

TAAAATTTTT TGAAGGATTT ACGCTAAAAG AAATTGCCTT GATCCTGGAT ATTAACGAAA 19740 

ATACGATTAA AACGAAGATG TACCGCAGTT TGAATGAACT CCGAAGTCTA TTGAATTAGT 19800 

AAAAGAAAGG AAGCGAAAAA ATAATGGAAG ATTTTGTAAA AAGTGTTGTT AAAAACTTTT 19860 

CAAAAGAATA CCGCCAACAA CCGTTGCCAA CGGATCTACA AGCGGAAGTC AAAGCGCGTT 19920 

TTCATAAAGA GAAAAAGCGG TACACCTGGC TTCGTTTTAG AAACCGCTCA CTGCAAAGTA 19980 

GTCTAGTAGT GGCTTGTGGC TTTGTCTTAT CAGTCAATTT GTTTCCAGGC TTTAGTGAAG 2004 0 

CAGCCCGAAA CATTCCTGTG TTAGATAAAA TTGTTCAATT AGTCACCATC AAAACGTTGA 20100 

CAGCGAAAAA ACAGGAAAGT GAAGTAAACA TCGATGTCCC TAAAATTCAA ACATCCCAAG 20160 

AATCTTCAGT AGCCGACACG CTCAACAAAA AATACTTGAA AGAAGCCCAA ACAGAATTTC 20220 

AACAAGTCAA AGGGCAATTG TTTGATGGTT CTCGTGTTTC AGTCACTGGA GATTATGAAA 2028 0 

AAGTCGTGGA CGACCGTCGC TTCCTAGTGG TCAAACGAAC CTTCACAGAA ATTAAAGGCA 2034 0 

GCTCTGCGAC GACAACGAAA TATGACACCA TCGACAAACG AGCCAATGTT GTCGTTTCAC 20400 

TCCCCTTAAT TTTTAAAAAT GATTTTTACA TTGATGTCAT TAGTAGTGAA ATTAAACAAC 204 60 

AAATAGCAGA CCAAATGAAG AACGATAGCA ATAAAATATA CTGGTCCGAA AAAGAGGATG 20520 

CCCCCAGTGA TGTTCAACCC TTCAAAAAAA TTAAAAAGGA CCAACCTTTC TATATCAATG 20580 

AAAAACACCA ATTAGTGATT GTGTTTCCCC AAGGAGAGAT TGCGCCTTAC TATATGGGCA 2064 0 

CTCCTGaATT TGTGATTCCT GATCAAGTCA TTGAAAATGA ATTGGCAGCG CCTAATTATT 20700 

TAAAATAAAC tCAGTAAAAA ACACGCGACA TTTCTTAAAA CTGTCGCGTG TTTTTTGCTG 207 60 

ACCTAACCAA GTCAGTTGCG GTTTTACTGA GAGAAGCGTA CACTAGAATT AAGCATAACA 20820 

AGAAAGGGCG- AATGGCAATG GaAAAATCAG AAAAAATTAG TATTTTACAA GATGTGGTTA 20880 

AAATTAAATC TGTAAATGGa AACGAAGAAG AAGTGGcAAT CTATTTACAA AATCTTTTGA 2094 0 

AGAAATATGA GATCCCTTCT GAACTTGTTT CTTATGCACC AAATCGTAGT AGTTTGGTTG 21000 
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CTTATTTAGG 


TGAAAACCGA 


GAAAAAGTCT 


TAGGGTTTAG 


IbGCLAbAIb 


bACGTbbT i i 


z iUoU 


CTGAAGGGGA 


CGAAAGTCAA 


TGGACTTTCC 


CACCAT1 I bb 


bbbbbAlAl 1 ., 


bAAbbAAAlA 


z x lzU 


AATTGTATGG 


CCGTGGCGCA 


ACGGACATGA 


TV TV TV /"* ^ f> fT* fTt 

AAAGCGGLli 


tv /Tp /*•>/— A ATP 

AG IbbbAAl b 


bill I AbLAA 


oi i on 
Z X 10 u 


TGATTGAATT 


AAAAGAAAAG 


AAAGTCCCAT 


TAAATGGAGC 


GGTTAAATTT 


rp rp TV rrt TV z*> Tv z* 1 

TTAGGTACAG 


Z lZ*i U 


TTGGTGAAGA 


AGTCGGTGAG 


CTAGGTGCTG 


GCCAATTAAC 


TGAAAAAGGC 


TATGCGGAT, b 


Z X JUVJ 


ATTTAAGTGC 


CTTAGTCATT 


GGAGAACCAA 


CCAATTATAA 


CTTGATGTAT 


GCGCACATbb 




GTTCAATTAA 


CTACTCCGTT 


GTGTCACACG 


GAAAAGAAGC 


CCACAGTTCG 


TV m TV TV TA 

ATGCCAGAAG 




AAGGCATTAA 


CGCGATTAAT 


AATTTGAACG 


AATTTATTAC 


AGAAGCAAAC 


TfV 7\ TV TV T\ fTl 

C AAC AAAT G b 


o i a fl ri 


CGGAAGTCAC 


AGCAAACTAT 


GAAAATCCTG 


AATTAGGCCG 


AACGATTCAC 


AATGTAACCG 


Z 1 D4 U 


TCATTAAAGG 


TGGCACACAA 


GTGAACAGTA 


TTCCAGGTCA 


AGCAGCCTTG 


CAAGGGAATA 




TCCGTTCAAT 


TCCAGAATTC 


AGCAACGACC 


AAGTCATTGC 


ATTATTACAA 


AAAATTGTCG 


i. X boU 


ATGAATTGAA 


CAAAAAAGAA 


AAACATCAGT 


TAGAATTAAC 


GATTGATTAC 


AATAAAATTC 


z I /zU 


CTGTAAAAGC 


TGAGAAAGAT 


TCTGCGTTAA 


TTCAAGCCAT 


TCAAGCCCAA 


TTTGATCAAC 


zl /oil 


CATTACCATT 


AGTGACAAGC 


GCTGGGACCA 


CAGACTTAGC 


TGAATTTACT 


AAATCAGACA 


2 lo4U 


AT AC ATT TG A 


TTGCGTTGTC 


TTTGGACCAG 


GTGTGACCAC 


TACTGCACAC 


CAAGTCGACG 


OI OArt 

ziyuu 


AATATGTCGA 


AATTGATAAT 


TATTTAGATA 


TGATTGATAA 


GTATCAAGCG 


ATAGCAAAAA 


ziybu 


GTTATTTAAA 


TTAAAAAAAT 


GTCGCGCCTG 


AATCATAAAA 


ATCAGGCGTG 


ACATTTTTTA 


ZzUzU 


TAAGGTTTAT 


TCCACCACAG 


TCCATTCATT 


TAAGGTTTTG 


CCGTTCTCAT 


CAACTAGCTC 


oonon 
ZzUoU 


TAACCAGCTG 


TTTGTTCCAG 


CAAAATAGAG 


AaACAAGCCG 


ACTTCATTAA 


CGCTTTTTAA 


Z^Z 1 3 u 


GACGATATCT 


TCCGTTGTGA 


CAAAGTCTTT 


AAATTGGCCC 


TTGCGTTCAT 


/*» tv /■>/■"■ Tt tv r™ 1 rp rp rp 

CACGAAGTTT 


zzzuu 


TTCACCAAAT 


CTAGCGGAGA 


AACCGACAGA 


CCCGTCTTTA 


TTTAAAGGGG 


CTTCGGGCA1 


zzzou 


CATTGTCGCG 


CCAGCTTTAA 


TAAGCATTTC 


ATTGCGCTTT 


TGCCAAAAAA 


CAGTTGCTTT 


ZZ JZU 


GCTTTGTCGA 


GTATCAATAT 


AAAAAGGAAT 


TTGGCTAACT 


TCTTTTGTCC 


ATTTGTGGCG 


ZZ JOU 


TGCTTTGGCT 


GGTCTGGCTT 


TCTTCGGCGC 


CGGTTTTTTA 


AACAGAGAGT 


AGCCAAAACG 


zz *i 3 u 


TTCCAAGATA 


AAGGTGACTG 


TTTCTTTATA 


GTTAGTCACT 


GTCGCTACCG 


CACTTTTCbb 


zzouu 


AATTTTGACT 


GTCGTTGGGG 


TTGCTTCCAA 


TGTTAAACCA 


CTTTCTTTCG 


/-*v tv rn 7\ rp /— , rn Tv 

CATCAIb 1 Ab 


00 S fiO 


GAAAGCTTGC 


GCTAAATAGG 


CAAGTTCTGT 


GTCCCAAGTC 


GGTTTCAATA 


AAATCGAbib 


Z Z OZ \J 


ATCAAAAGAA 


ACTTCTTTAT 


ATGTAGCTTT 


TTCTTCAGCA 


AGTCCTAAAG 


C AACT Ab lib 


Z Z DO U 


GTTATCTATT 


AGCAAGAAAA 


GAATCGCTGC 


CTCTTTTTTC 


GGTAATTTAG 


TTGTTTGTTC 


22740 


AATGATTAAG 


TGGGCCTGAT 


CGGCTAAGTC 


AATGAGTGTC 


TTTGTTTTTT 


TTAGATCAAC 


22800 


GGTTAAATGT 


• AAGTCCATGA 


AGTCTGCTCC 


TTTATCGTTA 


GTTTCTTCAT 


TGTATCATGA 


22860 


AACAGCGTCT 


TCGTCTTTTA 


ACGAGTCAGC 


GACCTTACAA 


. TATTGTTTGA 


k AACAAAAAGA 


22920 


AGCTTTCGTG 


GTATACTTAC 


TC AC AT AC AT 


AATTTTTAAT 


GAGGAAAGTA GAGGAACAAG 


22980 
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AATGAAAACA 


CTTGGCAATA 


TTTTATGGTT 


CATCTTTGGC 


GGCTTTTTTG 


p.tp.p.pttatp 

\j L vow 1 InlL 


0 -and n 


TTGGCTATTC 


GCAGGCGTGA 


TTTGGTGTAT 


TACAATTATC 


GGTATTCCGA 


TTP.P.TTTAPA 


9n nn 


ATGTTTTAAA 


TTAGCGGGGT 


TGAbC 1 1 1 lb 


GCCGTTTAAA AAGCGAGTCG 


tptatap.tp.a 


9*^1 fin 


TAGCGGCGTA 


TCTTTGGTTG 


TAAACATTAT 


TTGGCTGATT 


ATCAGTGGAC 


t a rr A TT AflP 


£~ O £. £.\J 


AATTGGGCAT 


TGTATTAGTG 


GCTTGTTTCT 


TTGTCTGACA 


ATCATCGGTA 


1 1 ^^*J 1 1 


23280 


GCAACAGTCC 


TTTAAATTGG 


C G AAAT TAG C 


GTTAATGCCG 


TTTGGGGCCC 


olul 1 0 1 1 iv^ 


23340 


AACAAATGAT 


TTTTATTTTG 


AATAGTCAGA 


GGTTTCTCAT 


TGACAAATAA 


aappaaaPAP. 


23400 


CGTATTCTGT 


AAATGTAACA 


AATAACGTTA 


CATTTAGTTT 


GGAGTGATTA 




23460 


TGTCAACGAA 


ATTAAGTGTT 


GCGATTCATA 


TTTTAAGTTT 


GATTGAAACA 


pptpptpaap. 


C, «J -J £. V 


AACAAGTGAC 


TTCTGAATAC 


ATTGCTTCGA 


GCGTGAATAC 


GAATCCAGTC 




9 1S80 


GACTAATGAG 


CCAGTTAAAA 


AAAGCCGGCT 


TGATTCATTC 


GATGCGTGGA 


bbbrxrt 1 Mrvrxrt. 


9 *3fid n 

i J04U 


ATACCTTATT 


AAAGAAACCA 


GAAGAGATCT 


CACTGTATGA 


AATTTATACA 


bCbb 1 1 LjMA J. 




TGGAACGAGA 


GATTTTCAAT 


ATTCATCAAA 


ATCCTAATCC 


AAACTGTTCC 




9 17 fin 


ATATCCAATC 


GGTACTAGAA 


ACTGAATTTA 


CTAAAGTGCA 


ACAAAAAATG 


bAAb AAbAA 1 


9 IP 9fl 


TGAAAAGTAT 


CACCTTAGCA 


GATGTGATTC 


ATGAAATAGA 


AGTTAAGAGA 


b A 1 AAb 1 AA 1 




CTCTCTACTT 


TTAAAGTATT 


AAATGTAACT 


ATTTTAGTTC 


TATTTAAAAA 


1 vjArtriOUH i O 


9 iQd n 


ATTTACATGA 


AAGTAGCAGT 


ATTAGGAGCA 


ACAGGAAAAG 


AAGGACAATT 


71 /-'rTirpmfT>7\ ppTi 

AC 1111 Abb A 


9 4 nnn 


GAAGCTAAAC 


GTCGCGGGAT 


GGACGTGACA 


GCCATCGTTC 


GTAATGCCTC 


aaaaaTaapa 


24060 


GATGGCACAC 


CGACCATTGA 


7\ 7\ ty 7\ t\ rp rn rp 

AAAAG AT G T T 


TATCAATTAA 


CAACAGAAGA 


t at tp a a fSPP. 


24120 


TTTGATGTAT 


TAATTAGTGC 


CTTAGGATTC 


CCTGATGTGC 


AAGATTTCCC 


aaaaTPTAPP 


24180 


CAGCATTTGA 


TTGAGATTTT 


AAC C AAT C AA 


AAAACGCGCT 


rp f** rp rp rp /~* rp /"*■ f~** rp 

Tbl I lb ibb 1 




24240 


GGAACCTTGT 


ATGTGGACCC 


AG AAC AC AC A 


ACTCAATTAA 


AAGATACACC 


ATPATTPPPA 


24300 


GCAGAAATTC 


AGCCATTAGC 


TTCGGCGATG 


GGCGAAgCTT 


TAAACTTATT 


GAAAbAAbC 1 


941 fin 


CAAAACATCC 


ACTGGACCTA 


TATTAGTCCT 


GCAGCCATGT 


TTGATGCTGA 


TGGTCCTGCG 


24420 


ACAGGCCATT 


ATGAAGTCGC 


TGGCGAAGAA 


TTAACAACCA 


ATAGTCAAGG 


TGACAGCTAC 


24480 


ATTAGTTACG 


CAGACTACGC 


CGTTGCCATG 


TTAGATGAAG 


TGGAAAGTAA 


TGCACACCCA 


24540 


AATCAACGAA 


TTAGTGTGTA 


TCAATAAaAG 


AAmCAAGAnA 


AATGGGGCTC 


CGTTAAACAA 


24600 


C 












24601 



(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
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CTTCCATTTT 


GGCACACAAA 


TTTTTmCAAa 


GrATTgAGCT 


TCAGGAAaTA 


ATTTTTTTAT 


60 


CAGGAAATTa 


TTAATTATTA 


CGAAAAAAAT 


GTTCATTTTC 


TCACAATwCa 


TTGgTATAAT 


120 


AACGTTATAT 


CGAAAGGAGA 


GAAATGAATG 


AGTGTAGTAT 


TAGTACTTTT 


GGGTCTATTA 


180 


GGGTTACTTG 


TCAGCGGGGG 


ACTGACTATC 


AAAGCATTAA 


TTAAGAAACA 


ACGGAAAAAA 


240 


CAATATGGCT 


ATGCCGCGTT 


AGCAAGTGCC 


GCTCTATTTA 


TTGGTGGGGT 


TGCCACGACA 


300 


GATTCTTCCC 


CGACACTTTC 


ACTTGATGCA 


GAGAGTGTTG 


AAACCGATGC 


CAATGGCTCA 


360 


GCCACCATCA 


AAGGCGAATA 


CGACGGCAAC 


AGCAAACTAA 


CGGCCAATGG 


CAAGGAAGTA 


420 


AAAACGAAAG 


ACAAAACGTT 


TGCTTACAAA 


GTAAAACTGT 


CCGATGAAAA 


AACGCAAAAA 


480 


ATCACTTTTG 


TTGCTGAGAA 


GAAAGATACC 


AAAGTTGAAA 


AATCAGTCGA 


AGTCACACCG 


540 


TCAAAAGAAT 


TTATTGCGAG 


CATCACAGAT 


TCTGAGCAAG 


AATT-GGCCAA 


AACAGAAAAA 


600 


GCCTTAGCTT 


ACGCTGAAAA 


ACAACCTTCT 


CAGAAAAACT 


ATGATGAAGC 


CGCAACCTTA 


660 


GTATCTTCAT 


TAAGTCAAGA 


ATACGAGGAA 


TACAATGATC 


GCCTTGAAAA 


AATCAAAGAA 


720 


GCAGTACCTG 


TTGATGAAGC 


AGTTACAACT 


GCTGAAAAAT 


CTAAAAGTAA 


AAGCGATTAT 


780 


CAAGCAGCTG 


AAAAATTAGT 


AGCTGCTGCC 


CCAGTTGGCA 


AAGAAGGATT 


TCAACAACGA 


840 


TTAACGACTG 


TTCAAACCGC 


TATCGCTGAA 


AAAGAAAAAA 


ATGAACAATT 


AGTCGCCAGT 


900 


GCAAcTGCCG 


CTGTTGAAAA 


AGCTGAACAA 


GAACCAACCA 


ATGAAGCTTA 


CTATAATGAA 


i 960 


GCGGTCAAAC 


AAATTGACGC 


TTTAAACTCC 


CCAAATCAAG 


CTTTAACAAA 


ACGGGTAGCC 


1020 


GTTGTTAAAA 


CTCAATTAGA 


TGCCCACAAA 


GAAAAACAAC 


GAAAAGAAGC 


AGAAGCTCAG 


1080 


AAACTAGCCG 


CTGAAAAAGC 


GCAAAAAGAA 


CAAGCCGAAG 


CGGCAGCAAA AGCGCAAgCT 


1140 


GAAGCAGAGG 


CGCAACAAGC 


CGCACAGGCG 


CCGGCGGAAG 


TCGAGACGGC 


TGCTGCGGAA 


1200 


GCACCAAGCG 


GTAACGCCTT 


AATTAAAGGG 


TCACGTAATG 


GAATTTATCA 


TGTACCTGGA 


1260 


AG TAG a TAT T 


ATAATAGAAC 


TACGAATcCA 


GTAGCTTGGT 


TCTCTACTGT 


TGAAGAAGCA 


1320 


GAAGCAGCAG 


GATACCGAGC 


ACCCAAACAG 


TwAATACtTA 


GCAATAATTA ATCAAmCAgn 


1380 


AAATGGGcTC 


CGTTGTTTAg 


GAGGC 








1405 


(2) INFORMATION FOR SEQ ID NO: 225: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
TATTGGATTA- CAACAGTACT TTTTGTTGCA ATTGGTGGAT TTTTACGTTT TTTTACAATA 
TTTGATGCTT CAAACGAATT ATATCTAAAC GTCCAGAATT TTTTAGAAGG CTCAGAAAAC 
TCAATTAGTT TTGACCACAA TTTATTGATT ATCTTGTATA TTATAGTTCT AACAGTCATC 
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GCGCTGACTA 


TTGCAAGGGT 


AAATAATAAA 


CGTTGGCTAA 


GACTAGGAGT 


ATAAACGAGA 


z 4 u 


CCCACTTCGC 


AGATAGGATT 


CATCTACTTA 


TTCGAGACAA 


TAGATTAAAT 


Z^ 1 T\ 7\ 7\ TV 7\ T\ TV 

CAAATAAAA1 


inn 


ACAACTAGTT 


GAAACTAAAA 


AAGCATCAAT 


TTTGATACGC 


CCCAAACTGA 


TATTTTtCTA 


obU 


AACTTTGGGG 


CGTGTAATCA 


AAATACGAGC 


TTCTTAGTGT 


GGGAGTGGAT 


AT T T G AAG GG 


4 ZU 


ATTTTTCTTC 


AACTTTTCTT 


CGATAATTTC 


GTCCAAATCA 


AACCCTAAAT 


TATGAGGGA1 


/i q n 


CATGTAGGAG 


TAAATGAGAA 


CATCAGCCAA 


TTCTTCTTTG 


ATTCGTTCTT 


CT. 1G I I 1GA1 


JflU 


GCCTTCTTCT 


GCTGTTTTCC 


ATTGAAAAAG 


TTCTAACAAT 


TCTGCTGCTT 


CTAAACAAAT 


oUU 


TGAGAGCGCC 


AAATCTTTTT 


CATTATGGTG 


GGGGCGCCAA 


TTTCTTTCAT 


CACGAAATTG 


ooU 


ATTAATTTTT 


GCCATTGTAT 


CCATTTGTTC 


CCCTGCTTTC 


TTATTCATTT 


GATGGCAACT 


*1 O A 

720 


GAAACCCAyC 


GAACGTTTCA 


CCATCATAGT 


ATTTTAATAA 


CTGTTCTGTA 


GACAATAATG 


780 


AGGAGGTTAT 


TTCTTCCTCT 


TCAACAACTG 


CTTGATTCTC 


TTCTCTATAC 


AACAAATAAG 


o4U 


CCTGTAAAAG 


CTGTTGGTAA 


AGTTTTTTTC 


GTGAAGAGGC 


TTCCGCTGAA 


AGAAACGtTA 


900 


ATTCTTTTAA 


GTGTCCGCGA 


TAGACAGGCA 


AATCATCAAC 


GAATTCGATT 


GGTAATTCTA 


C\ s~ t~\ 


CTACAGTTAA 


CTGAAATTCT 


TCAAAGACTT 


sTTCGTACAT 


CTGCCATTCC 


CTCTTTCTGT 


lUzU 


CGATATTGTT 


TTGGGGTCAT 


GCCCATTTCT 


TCTTTAAAAC 


GTTTATAAAA 


AAAGGCTTTG 


lUovJ 


TCTTGATAGC 


CGATTCGTTC 


aACAATCGTT 


TGAATCGATA 


AATCtGTTTG 


m fnf* tv TV ^* m tv 

CTTCAAGTAG 


1 1 A f\ 

114U 


TTTTTCGCCA 


ATTGGATTCG 


GTAGGCTAAG 


ACAATTTCCG 


TAAACGTTTT 


GTTTGTTTTT 


IzUU 


TCTTTTAAAA 


AATTCGATAA 


GTAATTCGGA 


TGAAAATGAA 


ATTTAGCGGC 


GACTTCTGCT 


l<£fc)U 


AAAGTGACAT 


GCGCAATCTG 


TTGCTCAATA 


TACGCTAAAA 


TCACCTCGAC 


ACGTTGATGA 


1 JzU 


TTCGACAAAT 


AATTATCTTG 


CATCTTTAAA 


GCTTCTCTAG 


GCAGTTGACC 


TAGTTCCACC 


T O O f\ 

1380 


ACTAGCAAAA 


CAAAGGCCGC 


TTTGGTCACT 


TTATTATTAA 


AATAGGCCAT 


TTTCACAATT 


1 A A C\ 

14 4 U 


TGTTTTAGTA 


ATAATAGCGA 


ATAGACGTGA 


ACATCCGAAT 


TTGGCTGGCT 


GGCAAAAACG 


i c f\ r\ 


AAATATCGAG 


GAATGTTTTC 


AAATTCACTG 


CTAGCAATTT 


CTGTAAAAAA 


ACGATACATG 


i ten 


ACTGGTTCTT 


CCGCCAGCTG 


ATTTAAGAGC 


GTATCAAGAA 


AATAGTGCTT 


TTTAAAATAA 


icon 

1 OZU 


AATATAATCA 


TAGACATTCC 


TTGCCCCTGA 


TAGCTAACCG 


AGGTGGCCTC 


j\ m rn T\ TV 7\ rn /—» 

ATTGAGAA i G 


1 DO V 


ATGACTTGGC 


CAGCCGAAAA 


CGAACACCTT 


TGTTGATCCA 


AATAAGCATA 


GCCG TG I IGG 




TCTAAACAAT 


AGAGCATCCG 


AATAAATCCT 


TTCGGAACGT 


CAATGGTTTC 


CGAAGA I GGG 


i Ann 


AGCATAAAGC 


CCGCAATTTC 


TCGATGTGGA 


TTTTCCTTAA 


ATTGCGCTAC 


TTCATTCTCA 


i r fin 


AGAGCATTGG 


TAGTGGTTAA 


ATAACAATCT 


AAGGTCTCTA 


AAAACTGATA 


AAATGCTGCA 


1920 


TATTGTTCTT 


CTTTGCTTGC 


CAATTAATCA 


CCTCATAATC 


CAACACTTTT 


TACTTTACTC 


1980 


CCTCTATTTT 


■ TTTCTTTGGC 


GTTTCCTTCT 


ATACTAGATA 


CATGGTAACA 


CGAAAAAAGA 


2040 


GATTCAACAG 


TTTTTAGGAG 


GAATTTTTTA 


TGAAAATTAG 


TTTTATTGGT 


GCTGGAAATA 


2100 


TGGCATCCGC 


TATTGCCAAA 


GGTGCGCTAA 


AAAAGCAATT 


TATTGCCGCA 


GAAAACCTTT 


2160 
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ATTTTTATGA 


TATTCAAGTC 


GAAAAAACCG 


CCGCTTTTGC 


ACAAGAAATT 


GGTGCACACG 


2220 


CGGTTGCTTC 


GCCTCAAGAA 


GCTATTACAG .TAGCXGACTT 


AGTTATTTTA 


GCAGTGAAAC 


2280 


CGCAATTCGT 


TCAAGCGGCT 


T T ACTTG AAG 


CAAAAGAAGC 


AATTTTAGAC 


AAGCAACCGT 


2340 


TAATTGTTTC 


CATTGCGGCA 


hgtapgacta 


TTGPAGAATT 


GTATCAACTG 


TTTGAGACTA 


2400 


CCCAACCGCT 


TCGTCTCGTT 


pp.tp.tp.atpp 


PPAATATPAA 


TGCGTTGATT 


GGCGCGGGTG 


2460 


CAGCAGCTGT 


CTGTGGTAAC 


P.P T T TT^f A T 

\JKs 1111 Ovrt 1 


PPPPTP.AAGA 


CATCCAGACT 


GTTTTATCGT 


2520 


TGTTTCGAGC 


AGTTGGCCAA 


ppptp,p.p,aat 


TAGAGGAAPA 


GTATTTTTCG 


ATTTTTACTG 


2580 


CTATTGCAGG 


TAGCACTOCT 


P. P P T A TP. PT T 


TPTT ATTTAT 

1 L 1 1 A Jl 1 1 ii 1 


TGATTCGATT 


GCCCGAGCAG 


2640 


CCGTTAAAAA 


TGGTATGAAC 


a a apap.TTP.p 


PPTTAP.AAAT 


TGCCACACAA 


GCTGTACTAG 


2700 


GCAGCGCTCA 


AACTTTGGCA 


a a T H P. P a P A P. 


aPAATPPTTP 
ncnn 1 U<« 1 1 vj> 


GACTTTAATC 


GACCAAGTCA 


2760 


GTTCTCCTGG 


TGGAACCACG 


PTTPPTP.PP.a 


TTPTPP.A ATT 


AGAAAACAAT 


GCGTTTATTT 


2820 


CAACAGTCAT 


CAAAGGGATT 




tppttppppa 

1 H 1 1 lOUbn 


TCAAGAATTA 


GCCAAACAAG 


2880 


ACTGATAAGA 


ATAATTACAT 


PBPPP aaTa a 


aTaaraaazifi 

n x rlrVVjrt/\f\rv\ 


GCGCTGAAAA 


TTCAGCGCTT 


2940 


TTTCTTATTT 


TAGTTGAATG 


apfpn apppaa 


aaTpaTpaPT 

nn 1 uri 1 LAI 1 


GACTATCGGT 


GTTTTCCTAC 


3000 


CATCAAAAAT 


TACTTCATAT 


TTTPPTaPPT 
111 lb inLlj i 


PPP a PTPTTT 
ullnl iol 1 1 


TGGAACAAAT 


GCGTAGTCAA 


3060 


TCCGCAAGGC 


TGCTTCATTT 


TPTTPPPaPP 


LnlUlnl 111 


TTTCTCTACT 


GTCGCTTCAC 


3120 

j 


CGCTTGTCTC 


TTCCGCTACA 




PTTP.aaTPP.P, 
111 unn 1 luu 


CAAGTAACTA 


TTTTCCACTA 


3180 


ATTGATAGCC 


AGTTGTACCA 


bl 1 Ijob 1 lbl 


TTaaaTPPPP 


TAAAATGACT 


AACGGTTGTT 


3240 


GCCCCGTCGC 


TAAGTATTTC 


TPT A ATTPTA 
111 r\e\ x X \J 1 r\ 


PPPATTPATA 


GGCAAAACCT 


GTACAGGGTG 


3300 


TTTCCCACCA 


AGAAAAGTGA 


PPAPAPAPAA 


PAGTAAGPGT 


ACCTTTTGAA 


CTCTCTGTCT 


3360 


CCGCTACTAA 


GATTTTTCTT 


P.TTPGATAAT 


TPGTTPrPiTTP 


TTGGCTTTCT 


GAAATAAGGA 


3420 


GAGCTTCACT 


AGTCAACGGA 


PATTTPGAAA 
ill voruw 


GTAGTGCGTT 


CCCTTCTTCA 


TAAATAGCAT 


3480 


AACCAATGTG 


GCTCATTTCC 


CAACTCCAAT 


AATAGTGTTG 


GCCTTTTTCA 


GC TAG AT ATT 


3540 


GAACGATTAG 


ATATGCAAAA 


TTATPTTPAT 


GGATCGGCGT 


TTGGGTCGCT 


ATGGGACAAA 


3600 


AAGTAGTTAA 


CTGTTCTAAC 


GGPAPTTTCT 


TGGAAGCGAC 


"TTTCTGATTC 


ACTTCTTGTA 


3660 


AGGCAATTAT 


TTCACTTTCA 


PAAP.ATAAAA 


TPAPPTTGGP 


AATCTCCTCT 


AATTTTTTTA 


3720 


ATGGTTCTTC 


TTCTAACCAG 


CTATGAGTAT 


TGATTGTTAA 


AAGATTCATC 


TTTTCCCCTT 


3780 


CTCTTACATG 


TTATTAGGTC 


TGTGAGTGGG 


ACAAAAATCA 


CTTTGGATTT 


TTGCTTCACG 


3840 


CGCAAAACCT 


GATAAACGGC 


GGGAACAGAA 


CCAACTcCTT 


CGGAAATAAG 


CCGAAATTAT 


3900 


CCAAAAATTA 


AAAAGCAATT 


TCCGGAAATT 


CCTTCTTATT 


TCTCGGAGTT 


AAACGGTTCT 


3960 


GTCCCGACCT 


• CATACCTAAT 


CATTTAAATT 


TCAATCTGTC 


CGATCACAGT 


TCCCGCAGTG 


4020 


GTGGTTCCTA 


GTTGATTAAT 


AACTACTTGT 


TCCACTTTGT 


CGCTGTTTGT 


AAAGGCGACA 


4080 


ATAATCGTTG 


TTTCTTTGCC 


AGCTTGTTCA 


ATCACATCCA 


AATTCATTTT 


AGCTAGTTTC 


4140 
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GTCCCAGCTT 


TCAATGATTG 


TCCTTCTTTA 


T\ f-* TV ^ TV TV /"* 7\ fTl n> 

ACAAAAGATT 


G AAAAG LAG G 


TGTTGCCATT 


42UU 


TCAACTGTAT 


CAATTCCCAT 


ATGGACAAGT 


ACTTCAATAG 


Gl lGi 1GIGI 


rprp/Tp7\ /*"» A"* TV 

I TGTAGCCi^A 




ATCGCATGTT 


TGCTTGGGAA 


AACACTTAGA 


ACTTTCCCCG 


AAATTrGTGC 


CACCACCTCT 


4 320 


CCTTCTAGAG 


GCTTAACTGC 


AAATCCATCC 


CCCATCATTT 


m m m /■** m /"■* 7V n t\ "a 

TTTGTGAAAA 


G AC AG GAT C A 


/i t o n 


TTGACTTGCG 


CCATTGGAAT 


AACTTCCCCT 


GTCGCTACAG 


CCACGAAATC 


TTTTTTCTTA 


A A A C\ 
H 4 4 U 


CCTAAGAAAT 


TCGTTGTTGG 


CACTCCTGCT 


GTACTTTCTG 


rrt ta m J\ tv ^> rn /"* 

CTATAACTGG 


TTCTGGAATG 


4 0 uu 


TCAACGCCGG 


AAGCTAGTAA 


ATCTTCAATA 


TCGGATTTTA 


AAACATCGGC 


TTTTGGTCCA 


4 o bU 


TAAACGGCTT 


GAACGCCATT 


ATCTTTCACA 


ATCAAGCCCA 


TTGCGCCTGC 


CCGTTTCCAC 


/icon 
4 o2U 


GCCTCTTCCG 


AACCGACTTT 


TTCTCGATCC 


TTTACACTGA 


CACGTAAGCG 


TGTCATACAC 


A CO C\ 

4 bo U 


GCATCAACTT 


CTTTGATATT 


TTGTTTACCA 


CCAAGTAAGT 


AGACAATTTG 


CGCGATTTGC 


A ^ A f\ 
4 f 4 U 


TGATCGACGA 


CGCCTGAACC 


AGCCGCGCCA 


GATGCGATTT 


CTTCGCTGTT 


ATCATTGTCA 


4 oUU 


TAGTTTCCAT 


TTCTTCCAGG 


TGTTGCGTAA 


TTAAATTTTT 


TAATTAAGAA 


ATTCGCTAAG 


4 o bU 


AAATAAGTGA 


CAACACCAAA 


AATGATGACC 


ATTAAAACAA 


AATTGATTAA 


ATCGCCACCT 


4 yzu 


AAACCAGCTT 


TTATTGCTAG 


CGGTGTTCTT 


GTTAACAATT 


CAATATTTCC 


AAACGAATGC 


4 990 


ACACGCAATG 


GTAAAATATC 


AGCCATTGCA 


AAAGCAGCAC 


CTTGGATCAC 


CGCATAAATT 


5040 


ACATAAAGCG 


GCACTGCAGC 


AAACATAAAC 


ATGAATTCTA 


GCGGCTCTGT 


CACACCAGTT 


5100 


AAGAAAACAG 


tAAAGCAGCA 


GAAAAATACA 


TTGATTTATA 


TTTAGCTTTT 


TTATCTGCAT 


5160 


CGACATTACG 


ATACATCGCC 


AAAGCCATCC 


CCATTAAAAT 


CCCTGAAGAa 


CCAATCATTT 


coon 

52*10 


GGCCTACTTT 


GAATCGAGCT 


GGTGTCCAAT 


TTTCTAAGAC 


AAACTGATAC 


TTGCTCATAT 


toon 
□ 2oU 


CACCAGCGCC 


TTTAAGATTC 


ACTAAGTCTG 


TTGCCCAAGC 


CAACCATAAA 


GGATCTTGAC 




CAAAGACTTG 


GGTTCCAGCT 


TGTGCGCCAG 


ATAAAATTTC 


ATACGTTCCA 


CCTAATTGTG 


D4UU 


TATAGTTGAT 


TGGAATCGTC 


AACATATGAT 


GTAACCCAAA 


AGGTAATAAC 


AAACGTTCCA 


3 4 bU 


ACGTTCCATA 


TAAGAATGGC 


GCTAAAATAG 


GTGCACTATC 


TTGAGATTGC 


GCAATCCATA 


r o r\ 


ACCCAAAATT 


ATTAATTCCC 


GCTTGAATAT 


TTGGCCAAAT 


AAGTGCTAAT 


^ t» tv rn m /■■* tv tv 

GCAATTGAGA 


c c q n 


CAATTGTTGA 


CCACAAAATA 


ACGACAAATG 


GGACAAACCG 


TTTACCATTA 


tv tv t\ T\ rn 7A T* A 

AAAAa TG A I A 




GCGCATCTGG 


TAATTTCCGA 


TAATTGTAAT 


ACTTGTTGTA 


AGCCATCGCA 


CCGACAAAAC 


J / uu 


CAGCAATAAT 


CCCTACAAAA 


ACACCCATAT 


TTAATGCCGG 


AGCTTCTAAG 


ACACTTGTAA 


O / DU 


AGAATCCCTT 


CACCATAATC 


TTGGTGCCAA 


ATAAGGTATG 


CGTGAAAGCT 


TGTTCATCAG 


o o 


CCAACATTTC 


ATTTGTTACA 


CCAAAAATGG 


CTCCAGTAAT 


ACGGTTGATT 


AAAACAAAAG 


5880 


AAATCCCCGC 


TGCAAATGCG 


CCACCTGCAC GATCTTTGGC CCAACTGCCA CCTATgTAAC 


5940 


GCAAATAGTA 


- AATGTAAGTT 


GCCAATAATT 


GCCCAACCGA 


TACTCTCCAG 


AACTCCGCCT 


6000 


GTTGTTACTA 


ACAAGCCTAA 


ATTAGGATCG 


ATTAAAGGAA 


GCGATTTCCC 


AATACTAATC 


6060 


ATTAAACCCG 


CTGCGGGCAT 


TACTGCCACA 


AC 






6092 
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(2) INFORMATION FOR SEQ ID NO: 226:, 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6: 

ATAAACTATC AGCATATTTA TTTAAAGTAC GAAGGCTTAG AAGGAGCCTA GGCGAAAGCT 60 

GAAAATAATG CACCCTCTTA TGAGATATTT CAACAAACGT GCTTTTTGCA CGTTTGTTTT 120 

TTTTTATCAA AATGAAAACG TAGAGGAGGG ATGGAATGGG TGAAGCTAAA GTAACTAAAA 180 

CTTTCCGATT AACAGAATTA GTGGCAAGTG ATTTAAGTTG TAAAGATTTA TTTGATGAAC 240 

TTATTGCCGT TTTAGAAATT TTAGATGTAG AAGTTGTTAA TAAAAAAATA GTTTATTTCT 300 

ATAAAAGAAA CGTGAAAAGT AAACAAGAAA AATTAGAAGT TTGTTTTACT AG AG AG AG T A 360 

TTCTGCACTT AATTGGGATA AGTTATTATG ATATTAATGA AAGAGAAACC TCTAAGTCAA 4 20 

GAATGAAAAG TAAGTATGCT ATTGAATTTT ATAGAGACTT TAAAAGrAAT AAATTGAACT 4 80 

TTTCAAAATG TTGGGTAGAA AGTACAAGAA AAGTAAAAGA TAAGCtcAGG TATTGAAATA 540 

CATTAAAAGC ATAAAGACAG ATGTGGTCAG AATAGGCGCT GACGGACAAT TAAGAACAAT 600 

TCCTATGACG AACACAATCA GTACGCCTAA GATTGGTTTA GGGATTGGCT TATATCATGA 660 

TCATCCAGAA TTCTCAATTC CTCGTTCTTG TTTAAATCTT GCTCAGGATA AAGAAGCGAA 720 

ACAACATACC TCGTTTCGTA ACGCCTGTCG GTGTACAAAA ATTTGGATAT ATGAAAGAAC 780 

AGAACAGGGT ACTTGGAAAT TAGAAGATAG GCAAGAATTT TTTAAAAAAA TACAGGCAGA 840 

AAAAAGTAAG AAAAAAAGAA GAAAAAAGTA AGATTACGTC ATTAACTTAG TTTGTAATTT 900 

GCTATATATC GTCAGCTAAT GATGTCGTAG AATGAAAAAA GGGGTTCTCT TATACTAATT 960 

TTTGTAAAGA ATTAAATTTA TTTTCTATTA CAAGTCCAGT ATAGAACTTA TTACTTACmT 1020 

AACAGTACTT TTAGAAAAAt CAGAAcCAAA TTAAAGAGAG TAGCCCCTTT GnATTTAAAA 1080 

TTTAATAAC 1089 
(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

AGGGTCTTTT' ATAaTCAAGG AACAATGAGT GAGAACGATT CTCAATAAaT GTAAAATAAC 60 

GACAGAAAAA TTAAAGAGGA ATGGTTATGA ATTATTaGAG GCAGAGGACT TAACTACAGG 120 

ATATGGTCGT CAAGTGATTA GTGAAGCGCT GAACTGTTCA TTTGAAGCGG GAACTATTAC 180 



WO 98/50555 PCT/US98/08985 

1143 



AAGTATTATT 


GGACCAAATG 


GTTGTGGCAA 


m m s> tv rp rp /"* 

ATCGACCT Jib 


1 1AAAAACAC 


1 1 bCACb 1 A 1 


o a n 


TTTGCCAGCT 


GAGAAAGGCA 


AGGTGTTCAT 


TAAGAACCAA 


bAAC 1 1 C AAC 


ATTTTTCTTC 


t n n 


AAAAGAAGTA 


GCGCAAGAAC 


TAGCGATCTT 


TV m f> A r" 1 A TV A P* 

ATCACAAACb 


CCACAAAA 1 A 


CAAlCbAl 1 1 




ATCTGTTTTT 


GACCTCGTCT 


CTTTCGGGCG 


ATATCCTTAT 


CAAlCbbbC I 


bGAAAAb lb 1 


/ on 


CACCACAGAA 


GATCAAGAGT 


ACATACAATG 


GGCATTAbAA 


rp rp 7\ TV /~> T\ r^r^frp 

1 1 AACAbbC 1 


rp a A/^rp/-'A/~>rprp 

1 AAC 1 CAC 1 L 


^1 AO 
H Ou 


AGCTAGAGAG 


TCTGTGGCAA 


TTTTATCAGG 


TGGACAACGb 


OA A P" P" P 1 P" T T 1 1 1 

CAACbbb 111 


CbAl 1 1 Al 


^ 4 n 


GGCGCTAGTG 


CAGGATACAG 


ATATTTTAAT 


TTTAGATbAA 


err" a p* p* 7\ P'P' T 1 
CCbACCACC 1 


Ai 1 lubftlLvv 


QUI) 


AGCGCATCAG 


CTAGAAATTT 


TAAATGTGTT 


AAAAC G AAT T 


AAT CAAbAAL 


Arp/^A^AAAA/*"< 


DuU 


CATCGTGATG 


ACGATTCATG 


ATATTAATTT 


AGCTTCTCGT 


TTTTCTGATC 


bCbibAl ibb 




GATGAAAGAG 


GGAAAGATTA 


TCAATGAGGG 


AACACCGCAA 


GAAGTGATTA 


CTGTTAblbb 


*? q n 


CTTAGCAGAT 


GTTTTCGCTA 


TTCAAGCAGA 


AATTAGTTAT 


CATAGTCAGA 


CACAGCGbCC 


Q A CS 


ACTAGTGCTT 


TCTTATGAGT 


TGATGGAAAT 


GGAGCTGGAT 


GG AT GAAAGG 


T\ rn TV TV TV 

C AT AAG C G G A 


QPlO 

yuu 


TTTTATAGAC 


ACCAATGGCA 


AGTGTTGCTT 


GTCTGTCTTT 


TTTTGCTACT 


t\ TV TV fflP 

GGGGACAATG 


you 


CTTCTGTCAT 


TGATGTTTGG 


TGCGACGTCG 


GTTTCGATTC 


AAACGATTAT 


TGATTCATTT 




ACGCACTTCG 


ATGCGACCAA 


TCAAGCGCAT 


CAAGTCATCC 


AAACGGTTCG 


GTTGCCGCGb 


lUlJU 


ATTTTAGGGG 


CCGCTTTTGT 


GGGGGCAAGT 


TTAGCGGTGA 


GTGGGGCGTT 


ACGCAabbbA 


1 1 

1 


TTACACAAAA 


TCCATTGGCA 


GATTCTGGCT 


TGCTGGGTAT 


CAATGCTGGG 


GCTGGbl Ibb 




GCTTGGCGCT 


TGTTTTTGCT 


TTCCTCCCTC 


AGGCAAGTTA 


TTGGTGGCTT 


TTAGTTGTTT 




CTTTTCTTGG 


TGCCGGCGTG 


AGTGTGGCAC 


TGATTTAwTA 


TCTTTCTAAT 


A rn rp A jlipipr 1 

CAT TCAAl 1C 




GAGGAGCAAG 


CCCCATGCGT 


CTGACCTTAG 


TGGGAGCTGG 


TATTAGTGCC 


f% rp rri mpyifTl rp rp TA rn 

CTT 1111 1 Al 




CATTTAGTCA 


ATTCTTAGCC 


ATTCAGTTCA 


ATTTGAGTCA 


TV /*** T\ m m TV TV r^ 

AGAGTTAACG 


rn rp rn rn^*^*rp rn rp 

TTl 1 bb I 1 1 C 


1 4 4 fi 


TTGGCGGTGT 


CAGCGTTATT 


TCTTGGGCAC 


7\ tv rn rn TV TA TV TV TA rn 

AATTAAAAAT 


AGTCGTCCC 1 


Al CI 1 1 Lluo 


IJUU 


GCGCATTTGC 


ATTAGCAATT 


CTGATTAGTC 


CGTCGGTGAC 


m ta m m m rn tv 

TATTTTACGG 


T 1 IbbCbAi b 


J. jDU 


ATGCAACAAT 


TGGTTTGGGG 


CGAAATCCGC 


AGCGTATTCG 


GTTGTTTGCC 


Ab 1 A 1 AACuA 


J. U £ w 


TTTTGCTTTT 


ATCAGGATTG 


TCTGTGGCGC 


TAGTTGGCTC 


GGTCAGTTTT 


PTTPPTTTPP 

b 1 1 bb 111 bu 


lOOU 


TGGTGCCTCA 


TGTGATGCGA 


ACAGTTTCTG 


^ tv tv t\ tv 7A rn m 7V 

GAGAAAAT T A 


CCGGCGCT1G 


A rprn/^/^i/^fTpm/Tp 

All CCb 1 1 Ci 




CGGCGTTAGG 


TGGTGCGCTC 


CTCGTTTTGG 


m /"•/-»/"* A ^ Tv mmm 

TGGCAGATTT 


AATCGCTCGA 


AT bb i bAA 1 C 


i Ann 


CGCCTTTTGA 


AACACCATTT 


GGGATTATCA 


CGGCCTTAA1 


TbbCATCCCA 


rp rp rp rp rp rp rp m 

11C111C111 


1 ft fifl 


ATTTATTTCG 


TAAAGGAGGG 


AAACTAGGAT 


GACACAGAAA 


AAAGTGTCTC 


GTTGGGTGGC 


1920 


TATCTTAATA 


GTTGGGATTA 


TCCTAACTGC 


TTGTATCAGT 


TTAACGCAAG 


GGGAATTATC 


1980 


TGTTTCCGTC 


• ATGGATTTAA 


AAGCCCTTAT 


TTTGGGACAG 


GGAAGTCAGG 


CCAAACAACT 


2040 


GTTAGTTTGG 


GATTTTCGGT 


TACCACGGAT 


TGTGCTTGCG 


GTCCTAGCGG 


GCGCAGCTTT 


2100 


AGGCTTAAGC 


GGCGCTATTC 


TTCAAGGGAT 


TACTCGGAAT 


CCGTTAGCAG 


ATACTGGGAT 


2160 
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TTTAGGTATT 


AATGCGGGTG 


PGGGTTTGGC 

V>>VJvJVJ 111 WVJ" 


AGTGATTTTG 


TTTATTAGTT 


TTGTTGAGAC 


2220 


GACAAGTATCL ACAACGTTTG 


PACTTCCACT 


CGTGGCCTTA 


GTTGGCGGCT 


TGCTAGCGGC 


2280 


TGTTGGGATT 


TTGGTAATCG 


PGTATCATCA 


TCGTTTAGGA 


TTGACGCCAG 


TTCGAATGGT 


2340 


CTTAGCGGGT 


GTCATTTTGA 


GTAPGGGTAT 


TTCAGCACTC 


ACCTTATTAA 


TTACATCAAA 


2400 


AATTGACACT 


GAAAAGTACC 


P.TTTTGTTAP 

U i 11 1 O 1 1 /TkL 


AATGTGGCAA 


GCAGGCAGTA 


TCTGGGGCAG 


2460 


TA ATTGGTTC 
1 /xri 1 1 OO i j. \- 


TTTGTGGGCG 


PTTTPTTGPP 


TTGGCTAGTC 


ATCGGGTTCT 


TTTTTGCGAT 


2520 


P.AAGPGTPAT 
ortrtOLO j. v>n j. 


CGTCTGTTGG 


ZiTATTTTAPA 
nlni 1 i 1 nV<n 


ATTGGGGGAC 


GAAACAGCTA 


TTTCTGTAGG 


2580 


irtll ol luiu 


AAAAAAGAAC 


(jttAltul 1 1 L 


GTTGPTGTTA 
oil ol i o i x n 


GCGGTGATGT 


TGGCGGCATC 


2640 


appgatttpp 

rtOV^Urt ill LL 


GTTACAGGGG 


oriM 1 1 rlrt 111 


TTTAGGTTTA 

111 rlU O 111 ii 


CTTGCGCCAC 


ATATTGCTCG 


2700 


PAAATGTGGT 
Onnn 1 o i oo x 


TTTACGCAAC 


AAAHALH.00 1 


ATTGPPATTG 
rtl 1 u^^n 1 10 


GCGGCGTTGT 


TTGGGAGTTT 


2760 


APTTTTATTG 
r\L ill ini iu 


GTTAGTGATA 

W X X A* VJ X W*» A X 4 


PPQTTPPTPG 


GPTATTGPPT 


GGCAATGGAG 


AAATGCCTGC 


2820 


AGGGATTATT 

nOUOrtl irti i 


ATTGCGATCA 


tppptppppp 

1 Loo 1 uLuLt 


1 1 rt 1 11 l L l l 


TATTTGTTAA 


TCAAAACAAG 


2880 


PGATTCGTTA 


TAAAAAAGAA 


T P 31 21 21 Zl P P T CZ 
1 LAririrloL 1 o 


GPTPAATPGG 

OO 1 L.rtrt 1 LOO 


TCGTTGTGAA 

1 VafO 1 A O X WiTfl 


AGCAACTGTC 


2940 


ATTTGCTCGG 


CTTTTTTATT 


PPPPPZ1TTTT 
ooLooi-Vl 111 


TGAAAAAATA 


GTATGATAAA 

O 1 * \ X Vjn x nnn 


TGCGTAAGAG 


3000 


GATTGAATGG 


AGGATGAAAG 


naTPPa A aap 


i 1 lrirvi 1111 


ATGTACCAAC 


AGACATCCGT 


3060 


TTTGGGAAAG 

111 uOvjrinnu 


ATCGTTTAGA 


PP21 ZITTAPPT 


GPTGTGTTAG 

o L 1 O l o i i nu 


ATCAATTTGG 


CAAAAATGTC 


3120 

i 


TTAGTTGTTT 

1 1 liV 1 1 O i J- A 


ATGGTGGCGG 


T 21 P T 21 TP 21 


PGTAATGGPT 
Lo 1 r\r\ 1 oOL l 


TGTACCAACA 


AAT TACT GAT 


3180 


TTGGCCCAAA 


AAAACGGGCA 


T21PZ1PTPZ1TT 
1 fiLnL 1 on 1 1 


GAGTTGAGPG 

orto 1 l unu\y\j 


GTGTCGAACC 


AAACCCGCGC 


3240 


ATTGAAACCG 


TTAGAAAAGG 


AATTGAGTTG 
rxrt. 1 1 orto 1 1 o 


TGTAAAGAAA 


ACCAAGTCGA 


TGTTATTTTA 


3300 


GCAGTTGGCG 


GTGGTTCAAC 


CATTGATTGT 


TCCAAAGCAA 


TAGCTGCGGG 


CTTTTATTCG 


3360 


GAAGAAGATA 


TCTGGACAGT 


TATTGCAGCC 


AGAAAAGGGT 


ATACTGGTCC 


TGCTT 1 AllA 


J4iU 


ATCGTGACCA 


TCTTGACTTT 


AGCAGCGACA 


GGTAGTGAAA 


TGAACGCAGG 


TGCTGTCATT 


3480 


TCTAACTTAG 


AArCCAATCA 


ArAATTAGGC 


TTTGGTGGTC 


CGAATATGAT 


TCCAAAAGTT 


3540 


TCGTyCTGGG 


ATCCAACGAA 


TACTTTT 








3567 


(2) INFORMATION TOR SEQ ID NO: 228: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
TTTAGTGGTT TGTTGATGTT TTTAAGTAAA ATTGGCACAT TGTTTGGCTG GCTTTCACTT 
CTGGCAGCTG TTATTGGCTT TGTTATTTTT TATAAACGAG CAACGACCGC TGAACAGCCA 
CTAGTTCGTT TGACAATTTT AAAAAATCCC GCCTTCGTTT TATTTTTATG TGGCTTTTTA 
GTTTGTCAAT TCTTATTGTT AGGTATTTCC TTTGTCTTAC CAAATTTTGT TCAAATCGTC 
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CTTGGAAAAA 


ATGCGTTTGT 


AGCTGGGCTA 


GTCATGCTGC 


CAGGTGCTAC 


AGTTGGGGCC 


300 


ATTTTGGCAC 


CACTTTCTGG 


ACGAGTGTTG 


GaTCAATACG 


GCGCTAAAAA 


ACCTATATTA 


360 


TTTGGCTTAA 


GTTTGGCGAC 


AATCGGTTGG 


CTAGCTTTAA 


CAATTTTACT 


AGAGATGCCT 


420 


GTTTTATTAG 


GATTCGTCGC 


TGGACATGTT 


ACCTATATGA 


TTGGTCTAGG 


TTTTGCATAC 


480 


AGTAATATGA 


TGACGACTGG 


TAtGAGCTTG 


TTAGAGGAGA 


AAGATTTTGG 


CGACGGCAAT 


540 


ACGTTATTTA 


ATACACTTCA 


ACAATTTTCT 


GGCGCTATTG 


CGACGGCCAT 


TGTAGCGACG 


600 


ATTATTAATA 


TCGCTCAAGA 


CCACGCAGAT 


AATTTTGCTC 


AGGGGACAAC 


GATGGGCTCC 


660 


CTCATTTCTC 


TGATTTTCTT 


GTTAACCTTA 


TTAGTAATTG 


TTTTAATTGC 


TTGCTGGAAT 


720 


TATTTCCGCA 


AAAAAGCTTA 


AAAAAAAGCA 


GCTGGTCAGT 


TCCATAGTGA 


CCAGCTGCTT 


780 


TTTTTAGTTT 


TCTAAAGATA 


GTTGTTTTTT 


TGTTCTTCTT 


GCTTCCGAAT 


ATAAAGTACG 


840 


AACAAGACAG 


CAATTAGCAC 


GATAAAACCA 


CCAAAAATCG 


GTGTATATGT 


TACT GAT AAA 


900 


TGATCTGTGA 


CCACCCCACC 


TGTCATTGAG 


CCGACGGTAA 


TCCCAATATT 


AAAGGCTGCA 


960 


ATATTAAAAG 


CTGAGGCCAA 


AGTGATATCT 


TCTGGAACAT 


ACTTTTCCGC 


TAATTCTACA 


1020 


ACATATAGTT 


GTAAACCAGG 


TACATTCATA 


AAGGCAAATA 


GGCCCAACAT 


CAAACTTGCT 


1080 


AATAAGCCTA 


GAAGACTGTT 


ATCCATTAAA 


ACAGTAATGA 


ATAAAAAGAC 


TAAGGACAGA 


1140 


ATTAATAAGC 


TAAACATTTT 


TACTAACGAA 


TCTAAAGGTT 


TTTTATTTGC 


CCAGTGACCA 


1200 


CCTACTGTAT 


TGCCAaTTGC 


GACCATTACG 


CCaTAAaCAA 


CTAAAATAAT 


CACTACCGCG 


1260 


TTGGCTGaAA 


aGCCTaAtTT 


TCCmwTAACG 


GTGACAgTAC 


GTATATGCAG 


CAAACGTTCC 


1320 


ACCATAACCA 


AAAGCAGTAA 


TTAGAAAAGA 


GAAAACGAGG 


GGTTTGTTCG 


TAAAAATCCG 


1380 


CGTGATTCCT 


TTTATGTCAA 


CCTTCCCTGG 


TATCGGTAAA 


TTTTTGGGAA 


CTAAAAAATA 


1440 


ACTAGCAATC 


AGTCCCACCA 


AGCCAATACT 


GGCTATAAAA 


AGAAAGGACA 


TATGCCAAGC 


1500 


TGTTTGTTGC 


CCAATAAACG 


TCCCTAAAGG 


TACGCCAGTC 


ACTGTTGCAa 


CCGTTAAGCC 


1560 


CGTAAaCATA 


ATAGCAATGG 


CGCTGGCACG 


ACGTGATGGC 


GGCACAACAT 


CTGCGGCGAT 


1620 


GACAGTCGAA 


ACAGACATGA 


AAATGCCATG 


CGCCAATGCA 


GAAAGTATCC 


GCCCTACCAA 


1680 


TAAAATAATA 


AACGTTGGCG 


CAAaCGCAGC 


AATTAAATTC 


CCACTAATGA 


AAAGAAGCAT 


1740 


AATCCCCAGC 


ATTAATGAGC 


GACGATTCCA 


TTTTCCTGTC 


AAAATTGTTA 


ATAACGGCGC 


1800 


CCCAATCGTC 


ACACCTAAAG 


CGTACAAAGA 


AACAGTCAAA 


CCTGCTTGGG 


AAAGACTAAT 


1860 


ATGAAAACTT 


TTAACAATCA 


TTGGTAATAA 


ACCCACACTA 


ATAAATTCTG 


TTGAACCGAT 


1920 


TGCGAAAGCA 


TTGATAGCTA 


ATGCAAAGAG 


CGTTAAATTT 


GGTGAAATTT 


TCTTTTCCAT 


1980 


TCTCTTCCTC 


CTAAAAAACA 


TTTGAATTTC 


AACAAAAATT 


ACTATATACT 


AAGATTGAAA 


2040 


TTTTGATAAG 


TACCTACTTT 


TTTGTATCAT 


AGGTACCTTT 


TAGTAACTTT 


TGGTTATTAT 


2100 


CAACCTATTA 


AGGAGGAATG 


TTTTTTGGAA 


ACTCGTACAT 


ATGCGATTGG 


TGTCGAAGCA 


2160 


ACGATGGATG 


TTATTGGCGG 


TAAATGGAAA 


CCAATTATTC 


TGTGTAATTT 


ACGGCACGGC 


2220 
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GCCATGCGCC 


CCAGCGATTT 


AAAACGCGGT 


ATTACTGGCA 


TTAGTCAAAA 


AATGCTGACA 


2280 


CAACAATTAC 


GGGAACTCGA 


AGAAGACAAC 


ATTATCGAGC 


GGAAAGTTTA 


CAACCAAGTT 


2340 


CCACCAAAAG 


TCGAATATTA 


TTTAAGCGAT 


TATGGTCAAA 


GCCTCTCTGC 


TGTCTTAGAT 


2400 


AACTTATGTA 


ATTGGGGAGA 


ATCCCACGTG 


GCTCATTTGC 


AAGAACAAGG 


CCAAGAAATT 


2460 


TATTTAAAAT 


GTGAAGATTA 


GTGAACAAAA 


CGTTTATAAA 


CTATATTAAG 


AGGGAGCTTT 


2520 


AAGCTATTTA 


AAGCCTGGGG 


AAAAATCATT 


TTAGATTTTT 


CCCCAGGCTC 


TTCTTATGCT 


2580 


TTAATAACTT 


GAATATCTTG 


CGTTAATTGG 


GCCCAATGAT 


TAGTAGGTCC 


GTGACCATGT 


2640 


CCAACAGAAA 


TTGCTTGGCT 


AATGGcCCCT 


TGTATAAACT 


GTTTGCCAAT 


CACAATCGCC 


2700 


GCTTCTAACG 


GCGTACCTTT 


TGCTAATTCG 


GCTGCGATAC 


ACGCTGAAAA 


AGTATCTCCT 


2760 


GTTCCATGAG 


TATTTTTCGT 


TACAATTCGT 


GGTGCACTTA 


GCCAAAAGCT 


CGAACCGTCT 


2820 


TCCATTAACA 


CATAATCAGC 


CGCTTGTTGA 


TTTGTGCTAT 


GTCCGCCTTT 


CATAATAATA 


2880 


TTTTTTGTGC 


CCAATTGCTG 


TAATTTTTTT 


GCTGCGGTCT 


GCATTTCTTG 


TTCTGTCTTG 


2940 


ATGGTCGTCT 


CTAGTAAAAC 


TTCCGCTTCT 


GGTAAATTAG 


GTGTCACGAC 


CGTTGCTAAT 


3000 


GGCAACAGTT 


GCTCTTTAAT 


CGTTTGAACT 


GCCTCGGCTT 


CTAACAAATG 


ATGTCCACCT 


3060 


TTAGCCACCA 


TTACAGGATC 


GACAATCAAC 


GGACCAAAAT 


CAACTTTCTG 


CAAAT.TGCGA 


3120 


ACAACAGCTT 


CGACACGCTC 


ACTATCTGCT 


AACATCCCCG 


TTTTCGCTGC 


CCCAATTTTA 


3180 


AAATCAGCCG 


CCAGTGATTG 


GAATTGTGCA 


TCAATAAATG 


AACTAGGAAT 


CGGTAAGCTA 


3240 


TCTTGCACGC 


CATACGTATT 


TTGTGCCGTC 


AATGCAATGA 


CAATATTTAA 


GCCAAATGCC 


3300 


TGCCGTGCTT 


GAAATGTTTT 


TAAATCTGCT 


TGCATGCCAG 


CTCCCCCACC 


AGAATCAGAA 


3360 


CCAGCAATCG 


TGACTACTTG 


TGGGGTCAAA 


TTATATTCGG 


TCATTTTCTC 


GCCTCCAACA 


3420 


TTCGCTCCGT 


TACACGCATC 


AACCCTTGAA 


CTTTCTCGGT 


TATCTGTTCA 


GCCAGCATAA 


3480 


TTTCACTGAC 


CAGTGAAACA 


CCAGCCACTC 


CCGTTCCCAT 


AAGTTGTTCT 


ATATTTTCTT 


3540 


CTTTGATTCC 


ACCAATCGCT 


ACAACAGGAA 


TCGTGACTGC 


TGCGGCAATT 


TCAGAAAGTG 


3600 


TCTGCAAAGA 


AGTTAACGGA 


CTGTCCTTAG 


TCGTTGTGGG 


GAAAATTGCG 


CCCACGCCTA 


3660 


AATAATCGGC 


GCCCTCGTTT 


TCCGCTTCAA 


CTCCTCGTGC 


CACCGTTTTG 


GCTGAAACAC 


3720 


CTACGATTTT 


TGTGGACCCA 


ACCAGTTTTC 


GCACAAGAGC 


AACCGGTAAT 


TCATCATCCC 


3780 


CTATATGCAC 


ACCTGCTGCA 


TCCACTGCCA 


AGCAAATATC 


GACACGATCA 


TTAATAATCA 


3840 


ACGGAATCTG 


ATACGCATCT 


GTTACCACTT 


TAACTTTCAC 


TGCTAATTCG 


TAAAAGCGCC 


3900 


GTGTTGACAC 


TTCCTTTTCG 


CGCAGTTGGA 


CCAATGTCAC 


TCCACTGCGA 


CAAGCAGTTT 


3960 


CAATTCGTTT 


TAAAAATTCA 


GTATCACTAA 


AATCATACCG 


GCCTGTCACC 


AAATAAACTT 


4020 


TCAATTTCTC 


TCTCATAGCA 


CTCGTCCTTT 


CACCGCCTCA 


AACCAATCCT 


TCTCCTTCAT 


4080 


TAAAAGAGAT 


AACTGATTTA 


ACGTATTTTG 


GCGAAAATCG 


GcCAGCCCTT 


GGCTCTTAGT 


4140 


CTTGGCTTTT 


TCTCCGCAAA 


GATTAAAATA 


GCTCACTGCG 


GCTACTGCCG 


CAGTcATTGG 


4200 



WO 98/50555 PCT/US98/08985 

1147 

CGCGTTGCCT TCACCTAATA AAGCGGCAAC TAATGCGCCG ACTAAATCGC CCGTGCCAGT 4260 

GAAACAATCC AACTCAGGAA CGCCATTTTG TAAGACAATC ACTTGCTCTT GGCTAACCAA 4320 

AACATCTTGA ATCCCTGTTG CTAAAAATAC CGTCTGGGGA AACTTTTGCG TTTGTTGACG 4 380 

TAAAGCCTGA ATCAGTTCCT CAATTGCTTC TTCACTTTGA TCCAAAGGGC TGCCATCCAC 4 4 40 

TCCTCTGCCA TGACTCACCA ACTGGCAAAA AGTCCGCATT TCTGAAAGAT TTCCTTTCAC 4 500 

AACGGTTGGC TGATTGTGCA CCAGCTTTTC CCCGACTTCA TTTCGAATAT CACTGGCGCC 4 560 

ATAACCGACT AAATCGACTA CGGTCAGTTT ATTCACTTGC CGAGCATAAT CACTAGCTGC 4 620 

TAACAAGCTT TGCTCTCGTT CTTGTGATAA ATGGCCTAAA TTCAAAACCA AAGCACTGGT 4 680 

TTGCTGAAAC ATCTGCGGAA ATTCCCGGGG ATCATCTGCC ATAATGGGTT TAGCATCAAT 4 74 0 

ATAAAGTAAT GCATTCGCCA TAGACTCACA CGTAATTTCA TTCGTAATGC ACTGAATTAA 4800 

TGGTGCCGTT GTCAGTGGAA AAATCGTTTC AAACTTAACG CTTGTTTTCA TAAAAGTACC 4860 

CTCCTAAAAA TAATTCCTGT GTTTGTTTAA ATACCCGAGT TTCTTTCAGT TTTACTAAGA 4 920 

CAAGCCAAGC AATCACTGAA CCAATCAGTG TGGCACCAAT AAATCGTGGT GTATAAATGA 4 980 

ACCAATAAAA ATTCTGCTTA CTGCCAGTAA ACCAAACCAT TACAGGATAG GACAATAACG 5040 

AGCCAATGAT GCCTGTCCCA AAAATTTCCC CTAGCATCGA CCAGCCTAAT TTTTGTCCCC 5100 

ATTGATAAAA TAAACCAGCA AAAAACGCGC CAAAAACGGC ACCTGTTAAC GCTAATGGCG 5160 

GAATACCTAA GAATGTCATA CGGATTAAGC CGCAGACAAG TGCCATGAGC GTTCCGTAAA 5220 

GAGGACCTAA TAAAACTCCG GCAATAATGT TCATGACACT GGACATAGGG GCCATTCCCT 5280 

CAATTCGCAT TAACGGCGAT AATACAACGT CCATCGCAAT CATTAAAGCC AGTAACGTAA 534 0 

CTCTGTGGAT TCGATTTTTT TCCACGTAAA AAAACTCCTT TCTGCGAGAA GCAAAAGGAG 54 00 

GCGCGACTGA AAAAGACAGA AAAACCCAAC CACAAATAAA CGTAGTTGGG TGCAATAATC 54 60 

GGTCAAAATT TCAAGACATT TCCCTGCGCC AGTCCTAACT GTTTCAGGTT CCATGGGTAT 5520 

AATCTCAGCT AATAGCACCC CAAATGTTTC TCTTCAATTG GCTTTAGTAT AGCATAGTTT 5580 

TTTGGAAGCG ACTACTAAAA TTGCCAAAGA GTTTTCCCAC AAAAACAGAA AAAACATCAT 5640 

GCCCAAAAAG TTCGATTTTT TGGGCATGAC GTTTTTCATA CTTTCGTTTG TCTTATTTTT 57 00 

TAGAATCCTG GCATTCCTTT GGTGTATTTG CCTAAAGTTG CTTCGGTTTC TTTTTCAATT 57 60 

TTTACTAACG CGTCGTTGAC AGCCATAATG ACTAAATCTT GTAGCATATC AACGTCTTCA 5820 

GGATCAACCA CGTCTTCTTT AATGATCAAA TCTTTCATTG TGCGATCACC AGTAAATGTC 5880 
GCTGTCACAA GTTGATTGGT GGCTTCACCG ATAAATTCTT TTTCATTTAG TGCTTCTTGC 594 0 

GCTTTGGCCA TTTCTTTTTG CATTTTTTGT ACTTGTTTCA TCATGCCTTG CATGTTGCCC 6000 
ATTCCTCGCA TCATAATTCC TCGACTCCTT CATATTAATC ATCTAGTACT TCGACAAGCT 6060 
CTTCACCAAA CATAGCGATT GCTTCATCTA CTACTGGATT TTCTTGTGAA TGTTCaTTTG 6120 
TTAcAGGCGG TTCGTCCGCT AGACGAGCCa TTTCATTTTC aGGTTCAGAA TGATTCAAAC 6180 
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TGCCCTGATT 


TTGGTTAATA 


AAACTTTGTC 


TTAGTTTTGG 


CCAGCTTTCT 


CTAGTAATGC 


624 0 


AGACCATCTC 


TGGTGTATAG 


TCCATCAAAC 


GGCTTAAATT 


ATTATTGAAA 


GCTAACTGCA 


63CC 


TCTCTTCATC 


GTCTGTTGCA 


CGAGCACAAA 


CAATCTCATA 


GTCAAATGCC 


ACCACGATGC 


6360 


CCTTAGGACT 


CGCTGCGACA 


GGTTCACTAG 


CCTTGAGCAT 


CGCTCGTTGA 


GTGACAGATA 


6423 


ATGTTTGCAG 


TAAGTCTTCC 


CAGACATTTT 


TCACATTCAT 


TAAATGTGTT 


CTAGTCGCTT 


6480 


CGTTTAGGAC 


TTGATATACG 


CGTTCTGTGG 


GTACACGGAA 


CGAACTTTTT 


GGCGCTTGTG 


6540 


GTCGTGCTTG 


TTGACGTGGC 


GCATCTGCTT 


CTTTGGCAGC 


TACACCGTGT 


TTCTTTAATT 


66CC 


CAGCCAGTTC 


TTTCTTCAGT 


TGCCCGATTT 


GATTTTGTAA 


ATCTGCTAAT 


TCTGGATTTC 


666C 


CCTCTGCACT 


ACCATCTTGA 


TTAGCTGTTT 


CTGGCGTGTT 


GTGTTTATTG 


GGTTGCACCG 


6720 


TTTTAGCTAA 


TTTGACAGTG 


GCTACTTCCA 


GATAGATATT 


CGCATTGTTG 


GTAAAGCGGA 


6783 


TCTCATTTTG 


TGTGTCACTT 


AAAATTTGGA 


TTAGTTGGTA 


GATTTTTTCA 


GCAGGCGTTT 


6840 


GTGTAGCTAG 


TTCCTTAAAT 


GCTTCTGTCA 


AAGTTCCTGC 


TTTTTCTGCT 


AATAACTTCG 


69C0 


GTGCTTGCTG 


ATACATCAGC 


AAATCTCGAC 


AATATAATAA 


CAAATCTTCT 


AAGAAACGGC 


6960 


GCGCTTCTTT 


GCCTTCGCCT 


AAAATACTTT 


CCAAGCCTTC 


TAATGCTCGT 


TCAACATCAC 


7020 


CGGCGACACA 


GCACTGAATA 


TAATGATCCA 


TCATTTCATA 


GGTTAGGCTG 


CCTGTCACTT 


706C 


GCATTGCATC 


TTCCAGTGTT 


ACTTTTTCAT 


CACTAAAGGA 


AATCGTTTGA 


TCCAAGATAC 


(7140 


TCAAGGCATC 


TCGCATCCCA 


CCTTCCGCAG 


CACGCCCAAT 


CACATAAAGG 


GCTTGTTCCT 


7200 


CATAATCGAG 


AGCCATTTCC 


TGCATGATAT 


GGGCCATATG 


ATCGACGATA 


TCTTGCGTAC 


7260 


TAATTCGCTT 


AAAATCAAAG 


CGTTGCGTCC 


GTGAGATAAT 


CGTTAACGGA 


ATCTTGTGTG 


732G 


GTTCAGTCGT 


TGCTAAGATA 


AAAATAACAT 


TTTGTGGCGG 


TTCTTCCAAG 


GTTTTTAAAA 


7380 


GTGCATTGAA 


CGCGCCTGTT 


GATAGCATAT 


GAACTTCATC 


AATAATATAA "ACTTTGTACT 


74 4 0 


CGGCTTGTGT 


CGGTGCATAC 


TTTGCTTTAT 


CACGAATATC 


ACGGATTTCT 


TCCACGCCAT 


75C0 


TATTACTCGC 


CGCATCAATT 


TCAATGACAT 


CATTTAAACG 


ACCTTCCGTA 


ATGGCTACAC 


7560 


AGGTTTCACA 


AACATTACAA 


GGTTCACCAT 


CTTGACTATG 


TTTACAGTTA 


ATCGCTTTAG 


7620 


CGAAAATTTT 


TGCGGCACTG 


GTTTTACCTG 


TACCACGAGG 


GCCAGTAAAT 


AAATAGGCAT 


7680 


GAGATGTTTT 


CTTTTGCACT 


ATCGCATTTT 


TCAGTGTCTG 


AGTAATTGCT 


TTTTGTCCTA 


7740 


tACGTCATCG 


AAACGTTGTG 


AACGCCAGAC 


CCGATAAAGT 


GCTTGATAAG 


CCATTTTTTA 


78CC 


CCCTCCCTTA 


TTTTTCATTC 


TTATCTATTA 


. TACGTGATTT 


TnAGTCATTC 


: GGCATCTTTT 


7860 


TTCTTTTAAT 


GAGTCACTGG 


CGCAACCATT 


GCT 






7893 


(2) INFORMATION FOR SEQ ID NO: 229: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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jlziA. SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

GCGCAAr.TTG GCTACCAATC TAAATnACTT TCAGAAGCGT ATTATACGAG 
GTAATTA7CT TAACGACCTT AGTTGCACCG TTCTTATTGA AATACTTTGT 
GAA7C7GCGT TGCAATAAAA AAGATGAGCA GAAGTTGTTG ACAAAAAAAT 
7ATSA7GAGT TCAAGATAAA CGACGGCATC AGGACCCGTG GCCCCATGCG 
7GGCGC77TT AAGCCAAATA AAACCGCTCC CCCATGTTTT GAGTAGTCCA 
CATGCCATGC AACGCATTCT TTAATAATAA GGCGCCCATT TTTCCTTTGA 
AAGAATCGCT GTTTTCAGCA AGCTCATCAT GTTCATAGCT GTGCCTTCGr 
AACGGCATTT CCTGTGAAGC CATCTGTCAC CACCACGTCA GCTACGCCAT 
7CGTGCT7CA ACGTTTCCGA CAAAATTAAT CGTTTCATCT GCTGCTAACA 
7TTTTTCGTT AATTCACTGC CTTTTGTTTC CTCAGTTCCG TTATTTAATA 
AGGATTCTGA ACGTTCCGAA CTTTTTCAGC ATAGAATGAA CCTAAAACGG 
CAAATG7TCT GGTTTATTGT CTGCATTGGC TCCTAAATCT AACATGTCAA 
7GGTTC7CCC ATGACAGGTA AAGTTGACAT CAAACCGGGA CGTTCCACAT 
GCCGACAATG AACAGACCCG CTGCTAATAG CGCTCCGGTG TTGCCTGCAG 
ATCGGCTTCC CCATTTTTAA CTGCTTGCGC TGCTAAAACC ATGGAAGCAG 
ACGGATTGCT TTAACTGGCT CATCGTCACT AGCAATTTTT TCATCGGTAT 
AATGTTT7TT TCATCGGTGA TGTATTTTTT TATTTCGGCT TCTTTTCCAT 
7TCAATGTCT GGGAAGTCTT GTTTGGCTAG CATAACCCCT TCAACAATTG 
ATTATCTCCA CCCATTGCAT CTACAGCAAT TTTCATGAGT GATTCCCTCC 
TTTCCTAACA AGTATAACAT ATTTTTTTCA AAGAAATACG CATTGGCTTT 
CTTCAAAAAT TACCTCTTCT AATCGAAAAA TTGGGCTGCC TCATCTTGGG 
ATTGGCTAAG CCTTGATAAG CAGGATATTG CCACCACTCT TTCACTTTCC 
TGCTTCTTGG CGCGCAACTT CTAAAATGTT AAAGTCCGTC ACGATATCTC 
TTGAGGCACC CCTGATTGCC GTGCACCGAA CACTTCCCCA GGTCCGCGAA 
ACGCTCACTC AAGACAAAAC CATTGGTGGT CTCTGTCATG ATTTTCATAC 
CATTTCATTT TTCGGATTGG CCACTAAAAT ACAATAAGAT GCACTGGAGC 
ACGGCCCCGT AACTGATGTA ACTGTGCTAA CCCAAACCGG TCCGCATCCA 
CACCGTGGcA TTTGGTACAT TCACGCCAAC TTCAATAACG GTGGTAGAAA 
CAGCTGATTG TCTTTGAACT CCTGCATAAT GTCATCTTTT TCTTGGtTTT 
ATGTAGTAAT CCCACTTGAT ATCTTGGGGA ATAAAAGCTT TGCATATGTT 
TGTTGCGTTT TTCACATCTA AGGCTTCTGA TTCTTCAATC AATGGACAAA 



CATGATTGTG 60 

TAAAAAACAA 120 

AAAATAGCTA 180 

TTTTAATGAC 240 

TTTCGTCTTT 300 

CACCTTCAGA 360 

tGGATTTTAA 420 

TTAATAATTC 480 

ATTCGAAGGC -540 

GACCGACACG 600 

CATACTGAAC 660 

ACCCTTTGTC 720 

TTTTGATGCG 780 

AAAAAATCGC 840 

TTTTCTTACG 900 

GAATAATCGT 960 

ATAATTGAAA 1020 

CTTGTGGGGC 1080 

TACTTTTTTG 1140 

TACTTTCTGG 1200 

GTTTCACTCG 1260 

ATAGGGCACT 1320 

CTACCGCAAA 1380 

GTTCCAAATC 14 40 

GCTCAACGCC 1500 

CACGCCCAAC 1560 

TAATTAACAT 1620 

CCAATAATTG 1680 

TCATTTTACC 1740 

CAAAAATTTC 1800 
TAATATAGGC 1860 
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1 lbAl b Ibb 1 


ppappptappt 

bbAbbb Abb 1 




CCATTCTAAA 


ACAGTATCTA . 


ACTGCGGCGG 


1920 


PPP71 A TPP A A 

bbbAA 1 LLM 


PP T P TTT P P A 
bb 1 b 1 i. 1 bbA 


TTGGAATPPG 


TCCAGCTGGC 


ATTTCATCAA 


TGATTGATAC 


1980 


Al LtAl I I bb 


PP Zi T Zi PPPTP 
bbA 1 Abbb 1 b 


TAATAGPCAG 


CGTTCGAGGA 


ATCGGAGTGG 


CCGTCATAAA 


2040 


1 AbAAb A 1L1 


P P T T T T A A P. P 
bb ill i AAbb 


PTTTTTPTPT 


TAATATTTTT 

X X *i X X X A X 


CTCTGATTCA 


CACCAAAACG 


2100 


ATGTTGCTbA 


IbbblbAl I A 


PPA AAPPT A A 


TTGATGGAAA 


CTGACATCTT 

\* x wn\^n x w x x 


GTTGTATCAA 


2160 


AGCATGGGTC 


bb i AbbAb AA 


TP TP A ATPTP 


GPrATTGGCC 


AGTTCTTCGA 

rlv X X * x X vwri 


GAATTAGACG 


2220 


ACGTTCTTTC 


GTTTTGGTbb 


Z\ B.PPTPTTA A 
AAbb 1 b 1 1 AA 


TAAfiGrrnTT 

innOUV/vvj i i 


PGAAPTTCTA 


ACGGATCAAA 


2280 


CAACTGCTGT 


AAGCTTTCCA 


1 blbl IbC I b 


PPPT A A A ATT 


TPAPTPGGAA 


CCATCAACGC 


2340 


TCCTTGAAAG 


CCAGCGGTCA 


T/^P'TPPPPT A 

Tbb 1 bbbb 1 A 


TAPPPP A A TP 


GPAGPPAPTA 


CCGTTTTCCC 


2400 


ACTCCCCACG 


TbAbbTTGAA 


r~*r* A A TPPTTP 
bb AA 1 bb lib 


PAT AT PPT TP 


GGAPTTPGPA 


AGTCTCGACA 


2460 


AATTTCATTG 


GTCACTTTTT 


TTT PPPPPPP 

111 bbbbbbb 


TP.TPAATTPA 


AAAGGPAGTC 


CTTGC jTGAA 

X X -J X W*V* 


2520 


CGTTTTTAAG 


CGATCCAbbl 


bA 1 A 1 1 b AA 1 


APPTAAAPPA 


TTTTTTTPPG 

111111 1 Vyv^u 


CTTTTTCTTG 

\m* X X X X X w X X \J 


2580 


TTTCTTCAAT 


bbTTbbAI I I 


rn a APTTPPZi Zl 
1 AAb 1 1 bbAft 


AAP,PAAAAAP 


TPTTPAAAGA 


CTACTCGTCG 


2640 


TTTAGCTTGA 


TGbbI I ib 1 1 


PTPPZiTTAPT 
b 1 bbA 1 1 AC 1 


PPP,AAAATGP 


ATCGPCCACA 


TTGCTTCTTT 


2700 


TCTAGGCATC 


AAGCGGlAbl 


TTTPTD. A T 1 A A 
I 1 1 b 1 AA1 AA 


ATPATTPPPT 


AAAATPTCTT 


CAACCAGTGA 


2760 


ACCATACTCT 


TCAAAAGCAG 


tp* p 1 /"" p 1 a tt a a 
IbbbbAl 1 AA 


PTPA APP A AP 
b 1 bAAbb ririb 


PTPPTTTPPP 


GG ACTTTTTT 


2820 


AT T C AC AT G A 


TAAATGbbbb 


m\ a a tPTTP 
bbAAA 1 b I 1 b 


ATTPTPTPPT 


TTAGAAGCTA 

l l nunnUv/ x n. 


AGATTTTCAT 


2880 


GCCATTTAAT 


GATTTTCTCT 


1 bbbA J. bbbA 


TTTPPPATAA 


APAPPAATTT 


CTTCGGAAAG 


2940 


AACTACTTTG 


TCTTTTAAAA 


AA PPTTPPTT 
AAbb 1 1 bb I 1 


PAAPAAPPAA 


APATTAATGA 


CCGCATGTTC 


3000 


TTGCATCATT 


/""'/'i TV TV t\ z"' rn m A 

CbAAAbb i I A 


ZiPPPPPTPTT 
Abbbbb 1 b 1 1 


PTTAT AAPPH 


TAPPGAPTAA 


CGACTGCTTC 


3060 


CbAAAbbAb I 


Abbbb 1 1 i a A 


PPPT A APTTT 


TTPCTGATCC 


TGAATCTCAC 


TTAAATCCTT 


3120 


TTCTTGGATA 


1 bAI bbl Abb 


PPA A PP. P ATA 


ATAPGTPAAT 


AAGTCTTCAA 


TTGTAGCAAT 


3180 


bbbbAb i 1^1 


tptba a tttt 

1 b 1 AAA1 Ill 


PAP,PAPdTTT 


TGGGCCAACA 


CCGGGCAATA 


CGCTGACTAA 


3240 


AiCTbTGAI b 


TPP A T A A A 7A T 

1 bbA I AAAAl 


PPPPTPPTPA 
bbbb 1 bb I bri 


TTAATATACG 


CAAAAAAATG 


GAT CAT TC AT 


3300 


rp T\ t» a rn A /"* z"' r^T 

T AAAT AG G b T 


Gb X I bAI 1 Ab 


PPPT A APT A A 


TGAAPPAGPP 


TATTPGTTTT 
im x x xxx 


TATTTTTTCA 


3360 


1 bAbbAAAAb 


b 1 Al 1 b 1 bb 1 


PA P. A ATAAP.T 


ATGGATACAC 


AGGTTGTCCA 


CCTTCATGAA 


3420 


bllblbillb 


1 Ab 1 1 bA 1 bb 


TTTTPAP,AAA 


P T AAAAT AG C 


GATTAGCTTT 

un x x \p* x x x 


TCAGCTTCTT 


3480 


bAAbbbAAbb 


7A TPTTPPPPP 

Ai b 1 ibbbbb 


APA AT AATTP 
ntnn 1 Art 1 lu 


TPAPHATTTP 


AGTGTCTTCA 

rtVJ 1 O X V» X X V^rt 


TCGATCATGC 


3540 


GGTTCAACGT 


GTCTAATGAT 


GCATTAAAAC 


GATCTGCTTC 


AGAAACGACG 


ATTTTACCAT 


3600 


CAATCATTCC 


TAAGAAGTCA 


TCTTTTTTAA 


TTTCAACGTC 


ATCAATGGTT 


GTGTCACGAA 


3660 


CAGCTGTGGT 


• TACTTGACCA 


CTCACAACGC 


TGTCTAACAT 


TTCCGTCATA 


, CTTGCTTTGT 


3720 


TTTCTTCTAA 


TGATTGTTGA 


TCATTGAAAG 


CTAACATTGC 


TGTCATTCCT 


TGAGAAATGG 


3780 


TTTTAGATGG 


AACAACCGCT 


ACTGGTACAT 


CGGCAACTTC 


TGCTGCTTGA 


k TCAGCTGCCA 


3840 
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TAAAGATGTT 


TTTGTTGTTT 


GGTAAAATAA TQACTTGCTC 


TGCATTCACT 


TCTTTGACTG 


jyuu 


CTTTTAAAAT 


ATCTTCCGTA 


PTTP,(5P,TTPA 


TTGTTTGGCC 


-» J**! /-> t\ y*^ m tv tv m Tft 

ACCACTAATA 


ATGTAGCTTG 




CACCTAAACT 


TCGGAATAAT 




PTTPAPPGGC 


TGCAATAGCA 


TV m /""i TV TV TV TV 

ATCACAGCAA 


4 UZU 


ACGGTACGCG 


AACTTTTTCA 


ptpazippptt 


PTTPPTCJAGC 


ATCGCGTTCA 


ACTAACGTTT 


4080 


CATGTTGTAA 


ACGCATATTG 


TPTZXPTTTTZJi 


TTTTT APT A A 


CGAACCAAAT 


TTTTGGCCGT 


4140 


AGTTCATCAC 


TTCACCTGGA 


T P.PT P Zi C T P.T 


nPAPPTHAAP 


TTTGATAATT 


TCGTCATCGT 


4200 


TAACGACTAA 


TAGAGAATCG 


1 1 1 \*f\ 1 


TTAAGTAGTT 


TCTGAATGTT 


TCATAATCAA 


4260 


AAGCGCTGTC 


CACGGTTGGG 




ta pr; a APP AT 


AATTTCTGTA 


f*% TV T\ TV ^ TV TV 

CAATAGCCAA 




ATTTAATGTC 


TTCTGTCGCA 




TazxpappApn 


ATGATGTTCA 


GCATTAACCA 


4 380 


TTTCGTCCAT 


TTCGGCAGGA 


f*> fp m /- p m rn r~> TV m 


ZiPPTTTPAl^T 
nLu 111 ^nO 1 


TGGTAAAAAT 


TCACCACTAA 


4 4 40 


GCGCTTCTAA 


AAATCCTTCA 


J. Hon 1 AAA 1 A 


P.TZXP.TPPTTP, 

O 1 rlo 1 \*\— 1 1U 


ACCCCCACTA 


TCTACAACGC 


4 500 


CGACTTCTTT 


TAGGACAGGT 


HA i AAA 1 L. Avj 


ulol 1 1 1 LoL 


TAAGGCACGT 


TTAGCACCTG 


4560 


CTACAACAGC 


TGTCATAACT 


I CCa A 1 AU Ab 1 


L.A1 L 1\J 1 1 


TTTCGCTTTa 


CGTTCaCCAG 


4 620 


AacGCGCCGC 


TTCACGAGAA 


AUAU 1 1 AAvjA 


TTPTTPPt-TP 
1 lu 1 1 LLL 1 \* 


aACGGGTTTC 


aTCaCTGCTT 


4 680 


TATACGCTGT 


TTCTACACCA 


TrTPT A 7\ -\ C f~* 

l(j I (j I AA3vjo 


PTAr'PPPPPZi Zi 
CAbLbbLUMA 


GTCTTTGGCA 


TTTAATGTTG 


4740 


TAACCTCAGG 


AATTTGTTTT 


G Ab AAb b L 1 1- 


bbAA 1 AA 1 lb 


AGATAAAATA 


ACCCCAGAGT 


4800 


TTCCACGAGC 


ACCCATTAAT 


AALLL 111 bo 


HTZiZVPPPflPP 


TGCTAATTCT 


CCTACTTTTT 


4860 


CAGAGCGAGA 


ATCGGCCACA 


bbL, 111 uvnL 


PZIPTP^TPAT 


TGATAAGTTC 


ATATTTGTTC 


A ft O A 

4 920 


CTGTATCACC 


ATCGGGTACA 


p m zi zi 7i zi ppt 


TTAATflAPTT 
1 1 r\r\ 1 unu 1 1 


GACATATTCT 


GCATTTGTGT 


4 9o(J 


GCAGACGAGT 


CGCACCTGCC 


Tn^arraTTT 

lUOnULnl J. J. 


PPTGGAAPTG 


ACCTGCGCTG 


ATTTCTGTTA 


D04U 


CATTCACCTA 


AAAAATCCTC 


PTTPAf^ATTT 


APCGPTAACG 


GCAACACACG 


ACTTCCTTGT 


CI rtfj 

DlUU 


GCAATTGCAT 


CGTTGACAGT 


zvpzvptapptp 


TAATPPGGTA 

1 xi/i 1 O VJ Art 


ATACACGAAC 


fT*l ^+f*fTl m ^ TV tv TV 

TCCTTGAACA 


blbU 


AAAACGTTAA 


CTGAATTCGC 


Af^TPZiPZiPPT 


AATAATGTTT 

nn 1 nn 1 U 1 1 1 


CAAGATTATA 


TTTTACTTTT 


c o O A 


TCTTGGACGT 


TACGAGAAAC 




TTTPTAPPAT 
1 1 Ivjln^^nl 


AACTAACAAT 


CGTGTAGACA 




TCCAGTGCGA 


TTCCGTTGTC 




APAAPftAPAP 


CACGTGAGTA 


ATTCTCACGA 


O 34 U 


CGTAAAATAT 


CGTTTAAGTT 


nlLl 1 I AAi 1 


TP ATTTTTP.P 
lUril 111 lu^ 


TTGCCATTCC 


GACAATTCCG 


C A AA 

o4 UU 


TAAATATCTG 


TCGCAGCTCC 


PPPZ\ ZIPTZiPT 


P.TAPPP.ATPA 


CATCATTGGT 


AATCTCAATG 


54 60 


GTACCTGCGG 


GTGTTTGAAT 


TTTTACAGCC 


ATTGATGAAA 


GCCTCCTTAT 


AAGAGCACAT 


5520 


TCGCCCTATT 


TnCTTCCATA 


ACATTTTATC 


ATATCCAGCC 


TTTAATTAAA 


AGGAAGTtCG 


5580 


TCAGATATkT 


TCATTCTTCC 


ACTTTTAAGT 


GAACATTCTA 


GCAAGATTTC 


AATTTACTCT 


5640 


TTTTGATTGC 


• TAAAGTCAAA 


TCATAGAAAA 


AAATCTTCTT 


GCAAATCAGT 


GCCTAATATG 


5700 


ATAAATTAAG 


TTAGTATGAG 


CTGAATATTA 


TCGCTCCAAG 


ATCAAGGCAA 


AGGAGGAAAC 


5760 


TTAAAATGGC 


AAAAGTATGT 


TACTTCACTG 


GTCGTAAGAC 


TTCAAGTGGT 


AATAACCGTT 


5820 
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CACATGCGAT 


GAACTCAACA 


AAACGCACTG 


TPQAAPPTAA 
1 b/irUT.bb 1 ru\ 


PTTAPAAAAA 


PTTPGCGTAT 


5880 


TAATCGATGG TAAACCTAAA AAAGTTTGGG 


1 b L bAAb i bb 


TPPTTTPA A A 
1 XXX onnrt 


TPTPPT A AAA 


5940 


TPPAAPPTPT 


TTAATAAAPT 


TAAATAAGCA 


AAbb 1 bbAbb 


TTPPTTPTTP 
1 1 bu 1 1 b 1 lb 


PPP TTTTTTT 

kjO^»* 1 X X X X X X 


6000 


attpp ata a a 


A AAAPAPPPT 


GGACCTTCAT 


CGCATTbAlb 


7A7\PPTPP7A7\P 

AAbb 1 bbAAb 


PTPTTTTTAP 




TTPTPTTTAP 

1 1 b 1 b 1 11 Ab 


TTTP A ATA ZVP 


PGPAATCATC 


bbl bbAl bbn. 


APPAAAAAPP 


APPPPPTTPT 
nvjvUU^ X X \* X 


6120 


TbbAxAAAl 1 


PATTAPTTPP 
bAl 1 Ab x lu*- 


ATAPGATGTT 

X V* X \J X X 


bbb 1 AAbbAA 


PATPTTPATT 
bA ILi 1 bA 1 1 


PPTTAATAAA 
Owl X rtrt X 


6180 


TAI 1 I AC lbb 


PTPPT APPPT 


PAAGTTTTCT 


AbAbbbb 1 bA 


AAb AbbAb 1 A 


PPPP AP AT AT 


624 0 


TLX 1 lAiLIb 




TAPPATATAT 


GAACCAGbbb 


PPT A AT APTP 

bb 1 AA 1 Ab 1 b 


PATPPT ATTT 
bAJLbL-lril 1 1 


6300 


TG AbbA 1 bbb 


AT APAPPPAT 


TTPAPP.AAGT 

X x unuunnu x - 


ACGCCTTbAA 


ATPPTPPTTP 
Al bb 1 bb lib 


A A APP.PT A AP 


6360 


*~» Tl 7\ 7\ f"» "ft TT P P 

bAAAbAi lbb 


PTAAPAPPTP 
b 1 /lAb Abb 1 o 


PTPAATTPTP 


CCCCCTGTCb 


PJ\PPP7\T7\ AT 

b Abbb A 1 AA I 


TPTPATPTPT 


6420 


bb 1 1 b 1 bbAA 


ATPPTTPT A A 


APPTTPTtPA 


AGCGbTAAri 


PTPT A TPTP-I- 

bl b I Al b lot 


ATPATPTTTT 


6480 


TbAbbAbb J. b 


b 1 1 b AA 1 bAb 


TPTTTPPPTA 


GTTTCTTGGA 


/-1 Ti T\ 7\ t\ rp ('** T A 

CAAAAT. bb 1 A 


TTPTTPTPPT 
1 Ibl 1 b 1 tb 1 


654 0 


GAI AAIbAAl 


bAAAAl bbbb 


PAPPPPT A AT 


TGTAACGGTA 


AA1 bbbbb lb 


T A ATA A ATPP 
i AA1 AAA1 ou 


6600 


AbAbAbbbbb 


P A TP A ATA PP 
bA 1 b AA 1 Abb 


T APPAA ATAA 


TCATACGTAG 


CTGGTTbAAl 


bb 1 IbbbbA-rt 


6660 


TbAb i 1 bbAl 


TPPPPPP APP 
1 bbbbbbAbt 


T APPAPTAAP 


ACACGACTCA 


m ro m o rp ^ rp t\ 

TTTGTCTAGb 


A ptpptptp a 
Ab 1 bb Ibl bA 


\J f £. U 


/—rp nmpp jv rp 

G I bbAl bbAl 


TPPTTPTTPA 
1 bb 1 1 b 1 1 brt 


PPATTTTPAG 

O Un X X X X Wrto 


CATTATAAAT 


GTAGGAGCCT 


PP A APP RBPfl 

bbAAbbAAbM 


0 » 0 u 

1 


r"»TTr"^/""PPAPP 

CT. I/bbbbAbb 


PPPTTPTTTP 
bbb 1 1 b 1 1 lu 


PAPPPTP,PP,f^ 


CAGTTTCAGG 


AACAATbbbb 


PP ATPP APTT 
bbAlbbAbl 1 


ooiu 


bAAl 1 1 


PPTPTATPPT 
bb 1 b 1 Al bo 1 


TTTPTTTPTT 

XIIOXXXV^XI 


TCCATTCTTT 


rp t\ ATTr^APPA 

TAAI I bAbbA 


A TTTT TTPT A 


6900 


b 1 bAb 1111b 


AATAAATPAT 


TPTPPGCCAA 


AGCCTGGATT 


GACAGTbAI I 


APP A AP APTT 
AbbAAbAbl 1 


6960 


bb 1 b 1 bb 1 aft 


ATPT A AT APA 


TTTTPPATPA 

XXXI OVJrtl ^/H 


TCGAAAGTGG 


TGTbbbAbbA 


TTPATPPTTA 
1 1 b A 1 uu 1 in 


7020 


bbbbAbb ill 


pap appppp a 


TTTTTA A TP A 

X X X X 1 / loi. 1 n-'/A 


TTTGTAAGGC 


7\ /^•/~" A T P A A T A 

AbbAl bAAl A 


TPPPPTPTTP 


7080 


PT TfT TP. El T P 


P APPPT A AT A 


ATATPCGCAC 


CGGbTTbbbb 


A A AAPPATTP 

AAAAbbAl lb 


AT ATA A TTTT 
nln lrirt 1 XXX 


7140 


p a p p t t p t a p 


AATPATPA A A 


TGAAPATCCA 


ATGGTAGTTT 


TGTbAb 1 bbA 


PP A ATPPPTP 
bbAAi bbb 1 u 


7200 


AAAPPATATT 


PP, PTPPT AAG 


GTAATGTTGG 


GCAb AAAT Tb 


APPPTPPATP 

Abbb 1 bbAl b 


APATPAAPAT 


7260 


rn AT AT A ATP 


TPPTPPTAAT 


TCTTCTACTA 


AAbbbAiAlb 


T/™*TTTPTA A A 
1 b 1 1 1 b 1 AAA 


TTPPPA A A AT 


7320 


pgp,papttaa 


AATPPATPGT 


GCTAGTTTCA 


TT I bbb xbbb 


TPPPTTTATA 
1 bbb 1 1 inln 


TTTPPT ATGA 


7380 


TTTTTTCTTA 


TAAACAGGTC 


GTCGATTCTC 


bAl 1 ibbAl 1 


A A. A A A PTPT A 
AAAAAb luln 


AATAATTTTP 

rtn X /tjV X X X X 


7440 


GTAACGACTC 


GTCGCAATGG 


TTCCCGCTTC 


CAb I I b Abb i 


TTP APT TPS P 

1 1 b Ab 1 i bAb 


A APPAPPTTP 


7500 


TTTGTGGTGC 


ATACATTCAC 


GAAACTTACA 


GTGAGAGGCC 


GCGGCAACAA 


ATTCAGGAAA 


7560 


TTGCTTTGGT 


AATTCCACTG 


CTTCCATCTC 


CAAGAAATCG 


ATGGCACTAA 


ATCCTGGTGT 


7620 


ATCAGCGACC 


. AAACCGTCGT 


ATAACGGAAT 


CAATTCCACG 


TGACGGGTCG 


TATGTTTGCC 


7680 


CCGTCCTAAA 


GATTGGGAAA 


TTTCCGCAGT 


TGCTAGTTGT 


AATTCTGGTG 


AAATTTGATT 


7740 


CAACAATGTT 


GATTTTCCAG 


CGCCTGATTG 


TCCCATAAAA 


, ACAGTCAACC 


GTTCTGGGAA 


7800 
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AAAGCGTTCT AATTCTTTGG TGGCTTCTAC GTCTTCCGAA GCGATTACCG CATAACCTAA 7860 

CGCTTCATAA ATCTGTTTAA TTTCAGTGAC TTGTTGACGT TGCGGCTCAT .CTAATAGATC 7920 

AACTTTCGTT AAATAAATAA CTGGTTCTAT ATCCTTATAC TCTAAGCTGA CTAAGAAACG 7 980 

ATCCAACAAA TTAAACGAAA AATTTGGCGA AACCATACTC ATGACAACCA CACCTAGATC 804 0 

CACATTTGCA ACAGGCGGGC GCACTAATTC ATTTCGTCTT GGTAAAATTT CCAACACATA 8100 

ACCATCTGTT AAATTATCGC TTTCAAAAAG TACTTCATCT CCCACGAGTG GCGTAATTTT 8160 

TCGATTACGA AAATTCCCTC GCGCTCTTGT TTGATATGTT TCTCCATCTG CGTATACATA 8220 

ATAAAAACCG CTTAACGCTT TTCTGATTTG ACCTTTCAGA TAAACCACTC CTAACTACTT 8280 

TCGAATATAC AGGATAGTGG CAGAGAAATC AAGGGAGCCT CTCGATTCTA TTAATTAAAA 8340 

AGAGCCTAGG ACATAAGTCG AAATGACTTA CCTTCCTAGA CCTAAATCTT AAGATAAATG 8400 

GCGAGACAGG AGCAGCTTCT TCTTATTGCT GAGAAACACT ATTTGAAAAG CAAATAGATG 84 60 

TTAAAcAGTA CCTACTTGGT TCTTGAAGCT AAACACTCCT GTCCCAACCT CTTGTCTTAA 8520 

TTACTCGTAC TACTTTCACT AGTGGTGCTT TTTGAATCAT TTGCTTTTGA ATAATAAACT 8 580 

GTAACAGAGT CGCCTTTTTT GACTTTGCTG CCAGCGGATG GGCTAGTTCG TTCAACGACT 8 64 0 

TGGCCATCGC CAGAGCCAGA TAATCCTTGT TCATTGATTT TAAGGCCAGC GTTATTGATG 8700 

CTGTCTTCTG CAGCTTTTGG CGAATAACCT GAAATATCAG GAACAGTTAC TGAGTCACTG 87 60 
CCTTTGCTGA CATGTAAAGT AATCGTGTCA TTTTTCGGAT CAACAGCGGT ACCAGAAGCT 8820 
GGTTCTTGAC TAATCACGGT ATCTTTTTCA ACTTTGTCGC TTTCTTCGTC CACTCGTTTA 8880 
ATTTGAGATT CTGGGATACC TAAAGCAATT AAGCGACTTA CCGCATTATC GTAAGAAATT 8940 
CCGCTATAAT CAGAAAGTGT CACTTTGTCG CTACCTTTAC TTACATATAA CGTCACTTGG 9000 
CCATTTTTAG GTGTCACGGT GCCACCTGGT GCGGGGTCTT GCGTAATGAC TAAACCTGGT 9060 
TCTACCGTAT CGCTTGCTTG GTCGACACGC GTAATTTGAG AGTCAGAAAT GCCTAATTGT 9120 
GCCAGTGCAT TTACTGCATT TTCGTAAGAA TAACCGGCGT AACTAGGCAA AGTAACCGTT 9180 
TCTGGTCCTT GACTGACCGT TAAAGTGACG TTGTCTTTTT TCGGATCAAC TTTTTTACCT 924 0 

GCAGCTGGTT TTTGTTTAAT AATGCTATCC GTAGACACAG AATCACTGTA TTCTTTTTTC 9300 
GTTGTAATTT GGTCTTCTGA AAACCCTAGT TTTTTCAAGG CTTCGACAGC AGATTCATAC 9360 
GATTCCTTTG TATAATCGGC CATCTCAATT TTTTCTGTTC CAGAGCTGAT GTATAAAGTA 9420 
ACAGATCGGC CTTTTTTCAC AGATGATTTT GCTTCGGGGT CTGTTTTGAC CACCTTGCCT 94 80 

TCTTCAATCT TATCGTCGGG AATTTTTTTG GTTTCACTAT CGACTTTCAG CCCGGCACTT 9540 
TTTAGCGCTT GTGAAGCGTC CGCTTTCGTT TCATTTGTAA CATCAGGAAC TTCAACGTCT 9600 
TTACCGCCCG ACATTGCAAA GGCTAAGCCA CCTATTGCTA AAGAAATTAA TGCCGCAATT 9660 
GCTAAGCCAA TCCACAATTT TTTACGGTTT TTCTTCGGCG GTTGTTCAGC TAAAATGTCA 9720 
TCTGCGATAT CTTCTGGGAC TTCGACAGGC GTTGTCTCTT CCGGTTCAGC GATGTCTTCG 9780 
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GGAATCGGAG TTAATACTTT CGTTTCTCCT MTAAAGCCG TTGGTTCCCA CGCAGGTTCG 9840 

TTTAAACGAC TGGCTGACAA GGACGTGTAT AAGTCTTCTG CCATCTCATT CGCTGTTTTG 9900 

TAGCGATCGC TTGGGTCTTT TGCGGTTGCA TGACGAACCA CATTTTCCAA TGATTGAGGA 9960 

ATCCCTGGAT CAAACATTTT GACAGAGGGA ATTTCTTCTT GAAAATGTTT TAAGGCAATC 10020 

GTTACGGCTG ATTCACCATC AAAAGGTACA TTCCCTGTTA GCATTTCATA GAGAATAATT 10080 

CCCACAGCGT AAATATCTGA TTGGTTGGTC GCCATGCTTC CGCGCGCTTG TTCTGGCGAT 10140 

AAGTAATGCA CCGAACCTAA CATTGTGTTC GTTTGCGTAA TTGACGTTTC TGACAAGGCA 10200 

ATCGCAATCC CAAAGTCAGT AATTTTGACT GTGCCGTGTT CGTCAATCAG AATGTTTTGC 10260 

GGTTTTAAAT CTCGGTGAAT AATTCTATGT TCATGTGCCA TTGCGACAGC AGATAAAATT 10320 

TGTTGCGTAA TGTCTACAAC TGTGGAATAA GGAATTGGGA AATGCGTTTG GATGTAACGT 10380 

TTTAAGTCCA TTCCTTTCAC ATATTCCATG ACTAAATATT GTAGTCCATC TTCTTCGCCT 104 40 

ACATCGTACA CACTGACGAT ATTCGGGTGA ACCAGCTCAG TTGCGGCTAG TGCTTCACGC 10500 

TGAAAACGTC GGATGGCGGC TTGATCGTTT TGGAAGTCAA AGCGCAAGAC TTTTACTGCA 10560 

ACGTCTCGGT CTAAAATTAA ATCGTGTGCT AAATAGACGT TGGCCATGCC GCCGCTTCCG 10620 

ATGCTGCCAA TAATGTGATA TCGACCATTC AGCTTCTTGC CGATTTCTAT CATTGGTTTT 10680 

CCTCCTTCTG TTCGTCAAAG TGTATCACTA ACACGGTAAT GTTATCTAAA CCACCCGCTT 107 4 0 

CATTTGCTTG AGCAACGAGT TGGCTTAATT TGGATTCTAA AGGATCTGAC GTCTCTAAAA 10800 

TTTCTAAAAT TTTTGTCTCA GGAACCATAT TGGTTAAGCC GTCCGAACAT AACAGTAAAT " 10860 

AG T CAT TAG G CAGCCATTCA TGATTGGTTA CATCGACTTC AACAGTGCCA GGCATTCCCA 10920 

AAGAACGTGT TAGGACATTT TTTCGTGGAT GATTTGCAGC CATCTCACGA GTAATCTCCC 10980 

CTGATTTTAC TAGTTCATTC ACTAAAGAAT GATCTTCTGT TAATTGTAAC ATGTGTTGGT 11040 

CACGAACTAA ATAAGCACGA CTGTCCCCCA CATTTGCTAA GACAAATGAA TCCGGTAAAA 11100 

GAATGGCACC AACCACTGTT GTCCCCATTC CTAAATACTC TGGTTTCGAT TGTCCGCGTT 11160 

CATAAATTTT TTCATTTTCC TTTTGAATTG CTTGAATTAA CCACTGTGCA GATTTTTCAG 11220 

CTGAGTCGAC AGACGCATTG GACCAACTCT CACCTAAATT ATTGACTGCC . ATTTGGCTAG 11280 

CGACATCACC TGCTTGGTGA CCACCCATGC CATCTGCTAA AACAGCAAAG GTTATGTGTT 11340 

GTTGGTTCTC AAAAACGTTG GCATAATCTT GGTTCGTATT TCTTTTGCGA CCGACATCTG 11400 

ATTGAACGTT GATTTCCAAA TTATTTCACC TCATTTGAAC TAACTTTCCG CATACAGCAA 114 60 

ATAAAGAAGC CATCTGTCAT GTATTGATGG GGATACAGAA CCAGTTCTTG CTCTTTTACT 11520 

ACTGCTTGGA CATTTTCATT TGCAACAATT TCAATTTTTT CAAATTCAGG GTGTTTGGCT 11580 

AAAAAGGCTG- CCACCACTTC CTGATTTTCT TCTGGCGTAA TAGTACAAGT GCTATAGACC 11640 

ATTATACCCC ATTGTTTTAG TGTTGGGGCA ACACTTTCTA AGATTTCTAA CTGAATTTTA 11700 

GGTAAATTTT GAAAATCATT GGCCGTTTTA TGGTATTTAA TGtCTGGTTT ACGGCGCATT 117 60 
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AGCCCTAAGC. CTGAACATGG 


TGCATCAACT 


AAAATCCGAT 


CaAAGGTTTc 


TGCGGGGAAT 


11820 






ATCAAGCTTT 


TCTGCTTGGA 


CGACATCCGC 


CACATGGAGG 


11880 


Lb 1 I GCGCA1 


ill CAI 


TAGCTTAATT 


TTGTGCGCAT 


GAATATCTAA 


AGAAGTCACA 


11940 


CGACCACCTG 


CTTCAGCG IC 


TAAAAAAGTT 


GCGATATGAG 


TCGTTTTACC 


ACCAGGAGCC 


12000 


GCGCAGGCAT 


7\ TV TV TV TV 

CCAAAACACG 


ATGATGCGGT 


TCAATTTGCA 


TCGATGGAGC 


CACGAGCATA 


12060 


GAGCTTTCAT 


CTTGAATGGT 


TAAACAACCG 


TTAATAAACA 


GTTCACTGCC 


AGCTAAGTAG 


12120 


CCTTTTTCAG 


CGACAACACC 


ATAAGGGGAA 


AGTTGACTTT 


CCTTCGCGGC 


AATCCCTGTT 


12180 


TCTTGTAATT 


CTTCCAGCGC 


TTCTTCTCGT 


GAAATAAACC 


GTGTGTTTAC 


GCGGCCACTC 


12240 


GCATGGCTTT 


TTTCAAATAA 


TGACAAACCC 


AACTGCCGTG 


TTTCTTCTTC 


GCCTAATTGG 


12300 


GCAAGTAATT 


TTTCTGTTAA 


CCAACGTGGC 


ATACTAATTT 


CCGTTGCTAA 


ACGATCAACT 


12360 


GGATCGCTAA 


TTGCGGCCAA 


ACTAGGTGCT 


CCATTTGGTT 


GGAACGCCCG 


CAGCACGCCA 


12420 


TTCACGAATT 


TACCAATCCC 


AGGATTGcCA 


CGACGTTTGC 


CAATTTCAAC 


CGCTTCATTA 


12480 


ATGACTGCGT 


GATCAGGCAC 


TTTATCTAAA 


TAAAGTAATT 


GATATAACGA 


TAAAATCAAT 


12540 


AAATTTTTCA 


CCCAATTATC 


GACTTTTTGC 


GGTTTCTTAA 


CAAATGGTGT 


TAAGTAGTAT 


12600 


TCCAACAATA 


ATTTGCGACT 


AATTGTGCCG 


TAGACAAGTT 


CTGTAAACAA 


GCGGCCATCT 


12660 


TTCTCGCTTA 


GTTCTGATTT 


TGTCATCATT 


TCATTAAGTA 


AAAGATTGGA 


GTACGCCCCA 


12720 


CCTTTATCCA 


CACGTTCGAT 


GGTTTCTAAA 


GCAACGTATC 


GAACAGAACG 


TTTTAATTTC 


12780 


TTGGGGATTT 


TTTCAGCCAT 


TTTTGTTAGT 


CCACCTTATC 


TTTTTCTGCG 


ACATTTTGAC 


12840 


CAACGCCATT 


TAAAAATTCT 


TGAATCGTTA 


ATTGACCTTT 


ACCTGCTGGT 


TGCAGGCGAT 


12900 


CAATTTGAAG 


GACTGTCCCT 


TCACCACAAG 


CAATCCACAG 


CGCTTTTTTA 


CTACGTTGGA 


12960 


TAATCGTTCC 


TGGGGCGCTG 


GTTGTTGTTT 


CAGTCAGTGG 


TGTCACTGCC 


CAAATTTTCC 


13020 


AGTTGGTCCC 


TTGATAGGTT 


GTGAACGCTG 


TTGGCCAAGG 


ACGCATACCA 


CGCACTTGGT 


13080 


TATCAAT I GC 


CTCAGCAGTT 


TTTTGCCAGT 


CAATTCGTTC 


TTGTTCTCGA 


GTAATATTTG 


13140 


GTGAAAAAGT 


AGCCTCTTCT 


TCCACTTGTG 


GAATAGGCGT 


GATTTCGCCA 


GCAATTAGCT 


13200 


TAGG TAACGT 


CTCTAATAAA 


AGTTCTTTTC 


CTAGAATGCT 


TAATTTTTCA 


AACATGGTGC 


13260 


C T AC AT CAT C 


TTGTTTCGTA ATCGGAATTG 


CGCGTTGCGA 


TAAAATCGCG 


CCAGCATCCA 


13320 


TTTTCTTCAC 


CATTTCCATA 


ATCGTGACGC 


CTGTTTCTTT 


TTCTCCTTCA 


ATAATTGAGT 


13380 


AATGTACAGG 


TGCGCCGCCA 


CGATATTTTG 


GTAAAAGTGA 


GGCATGGACA 


TTAATTGCAC 


13440 


CTAGTTTGGG 


CGCTTTTAAG 


ATTTTCTCTG 


GTAAAAATTG 


TCCAAAGGCT 


GCTGTCACAA 


13500 


TTAAATCTGG 


TGCTAAGTCA 


ATCACTTTTT 


CCATTTCAGG 


AGAACCTGAA 


ATTTTCTCTG 


13560 


GTTGTAAGAC 


' TAATAAGTTA 


TGTTTTAAAG 


CTGCTTCTTT 


GACAGGCGTT 


GGGGTAATCA 


13620 


CTTTTTTTCG 


GCCAACTGGT 


CGATCTGGTT 


GCGTGACAAC 


CGCTTGAACG 


TCATAGCCTG 


13680 


CCTCGATTAA 


ACTTTCTAAA 


ATCGGAACAG 


AAAAGGCTGG 


TGTGCCCATA 


, AATACAATTT 


13740 
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TAGTCATCAA AATGTTCCTC CATATACGCT TCTAAATCTT CTGGTGCAAT TTTCTCAATC 13800 

ATTTTATCGA TAAATAACTC GCCATTCAAA TGATCAATTT CGTGTTGAAA AGCTCGAGCT 13860 

AAATAGCCGT AAGCCGTAAC TTCCATTTCT TCACCTTCAC GGTCAAAATA ACGGACAGTC 13920 

ACTTCATCTG CTCGTTCAAC GGTTCCATAT GTTTCTGGAA TACTTAAACA GCCTTCCACA 13980 

TCAATACTGG TTCCCT7TTT TTCAATAATC ACTGGATTAA TTAGTTCAAA AC G AC C AG AT 14 04 0 

TCTTCATCGA TTTCAATCAC TGCCATTTGC AAATTTTTAC CAATTTGCGG GGCTGCTAAG 14100 

CCAATCCCAT CATGGGCCAG CATTGTTTCA TACATATCCT CTAGTAACTG CACAATTTCA 14160 

TCGGTAATAA TCGTCACAGG TTGAGCCTTC ATTTTCAAGT GTTCGTTCGG GTGAATCACA 14220 

ACTGGATAGC GCATAAGTGC TCCTTTCAAA GCTGTATTTT TTAAATAAAA TTCATGGGTT 14280 

CTGCATCAAT GGAAAGCTTT AAGCCACGAG CCGTTGCTCG TTGTGTATCG GTTAAAATTT 14 34 0 

CTTTCAATAA TGGTTGAAGC ATTGGTTCTT GTTTATACTT GATAATCACT TGATAAAAGT 14 400 

ATCGATTATT GACACGCATA ATTGCGTTGG GTGTTGGTCC CAACAAAATA GCTTGCGGCG 14 460 

ATAAGCCTTG TTTTAATTTC GTCGCGATTT GAAACATTTG TTTCGCTGCT TCATTTTCTT 14520 

CAGGATGGCT AGCCGTAATT TGGACCGTAA AATAGTACGG CGGATAATCC CCACGATGAC 14580 

GAATATACAT TTCTTTTGTA TAAAAATCTT CATAATCTTG GGCTTTTGCC AATTGTATCG 14 64 0 

CATAATGCTC GGGGTTAAAA GATTGAATGA TGACTTCCCC CGGTTTTTCA GCGCGACCAG 14 700 

CTCGACCGCT GACCTGTGTC AATAATTGGA AGGTCCGTTC ACTGGAACGA AAATCGGGTA 14760 

AATTCAAGGC GGTATCAGCA TTTAAGACAC CGACTAACGT CACATTTGGA AAGTCCAGTC 14820 

CTTTGGCAAT CATTTGTGTG CCTAATAAGA TATCCGCTTG TCCTTCACCA AATGTTCGCA 14 880 

AAATCTTTTC ATGTGCACCT TTGCGACGGG TCGTATCAAC ATCCATTCGG AGAATACGAC 14 940 

TGTCTGGTAA CAACGTTTGT AATTCTTCTT CTACTTTTTG TGTGCCTGTT CCGTAGTAGC 15000 

GAATTTTATC TTGGCCACAG TTAGGACAGC GATAAGGAAT CCGTTCTTCA TGTCCACAAT 15060 

AATGACATTT CATTGTTTTA CTATCCATAT GCAAAGTTAA GGAAATATCA CAATTCGGAC 15120 

ACGGCAACAC ATAGCCACAA TCGCGACACA TCACAAATGA CGAATAGCCA CGGCGATTAA 15180 

GTAGGAGAAC ACTTTGTTCG TTTTTCTCTA AGCGTTCTTG GAGTTTTTCT TGCAACGACA 15240 

TGGAAAAAGA AGAGACATTG CCATTTTCTA CTTCTTGCCG CATATCCACT ACATCAATCG 15300 

TAGGCAAGGT TGCCGCTTGA TTGGCACGCT GTGTTAATCG TAACCGCTGA TAAACATTTT 15360 

TTTGTGCGCG AGCGCGTGAT TCCAAGGATG GCGTGGCACT GCCTAAAACA ACAGGGCAAT • 154 20 

GATGGTACTC CGAACGCCAA ATCGCTAAAT CTCTCGCGTG GTAGCGCGGG GTTTCTTCTT 154 80 

GTTTATAACT TGCCTCGTGT TCTTCATCAA CAATGATCAC GCCGATGTTT TCGATTGGCG 15540 

CAAAAATGGC TGAACGCGCG CCAACCACCA CTTCCGCTTC TCCTCGTTCA ATCTTCCGCC 15600 

ACTCATCGTA TTTTTCACCT TGCGATAAGC CACTGTGCAT TACAGCAACA TGTTCGCCAA 15660 

AGCGACTTTT AAAGCGTtGC ACCATTTGCG GAGTTAAAGA AATTTCAGGG ACAAGCATAA 15720 
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TGGCTGTTTT GCCTTGGTTA AGAACTTCTG CAATGACCTG TAGATAAACT TCTGTTTTCC 15780 

CGCTTCCTGT AATACCTTCC AATAAGTAGG TTTGGCTTTG CTGTTCTTGG ACCGATTGTA 15840 

AGATTGTTTC CACAGCCACC TGTTGTTCTG CATTCAAAGA TAACGCAGTC GTTTTTTCAA 15900 

ACGTCTGATT AGCAAACGGA TCACGATACG CTTCTTTTTC AATGAACGTT AACCAGCCGT 15960 

TTTTGGCCGC TTCATTTAAA AGCGCGGTAC TGAAACCTAA TTCTTTCATT TCCTTCACAG 16020 

CGGTGACTTT TTCAGTGCCC AAGCGTTGTA AATAGTAAAG AAGTTGCTCT TTCTTCTTAG 16080 

CTCCCTTGCG TAACCCCAAA CGAATTTCCT CTAGTTGTTC AAATTCCTTA GCGGCTTGAA 16140 

TGAAGCGGAC CATTTTGACT TTATTTCTCG TCGTAAcTTC GTAGCGAATA TCGACTTTTT 16200 

GCTGTTTACG CAACGCCATT AGTTGAGGCA AAAGCCCACG TTCTTGCGCT TGGTCCCACG 16260 

AAATTTCCTC TAAACCATAA AATAACTGAT CTTGTAATTC TTCAGAAAGC TCATCCGTTA 16320 

AATAAATATA TTTTTGGTAA TCGGCACGCA TTACGCTAGG CAACATGGTT TGCAAACAAG 16380 

TGATTTTAAA AGCAAACGTT TTTTCCTTCA TATAATCAGC TAAAGCCAAC ATTTCTGTAT 16440 

TTAAAACAGG TTTTAAATCC AACACCGCCA CTAGTTCTTT CAACTGAACA TTGGTTTCGT 16500 

CAAGAACCGT TGCAGCCACT GGTTCAATGG CTAGAACAAA CCCTTGAACA TGCCGATTAC 16560 

CGTTACCAAA AGGCACTTCT ACACGCATGC CGACAGCCAA TTGCTCATTT aAATTTTCAG 1662.0 

GAACTAAAtA AGTGAATGGT TGATCGGTTT GcATCGTTGG TACATCCACA ATAACTTTGG 16680 

CTACTTTTTG CATGGTGCAT CCTCCTTTTT ATGTAAATTC TCTTTGCCTA TTGTACTAAA 16740 

GATTGACTGA TTTGTCTTGT GTGTCTCGTT GGCACCTTTC AGAATAAACG AAGAGAAAGC 16800 

CTCTGAAATA GACAGACGGC TTTCTCCATT CGTTGACAAT CATTCAATTA ATTTTTTTCG 16860 

TCACGTAAAC GGTTTTCTAA TTCTTTTTGT TCTTGTTCAC GTTGGCGTTT GCGTTCTTCA 16920 

CGTTCAATGC GTAGACGTTC GCGTTTTTCT TCAGGATTTG GATCACTGAT CACTGTTCCT 16980 

GCTTCAATTT CTTCTAACGC ACGACCTACA CTTTTTACAG AATCAAAGGA TTCAACGGTT 17040 

GGTTGAACGC CTTCATCTAA TTCATGCGCA CGTTTGCTTG CT 17082 
(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

CTTTTAACAG AGAACCTGAC TATTAGTGAA AAAAGAAAGC AACAGCTAGC CTTTGAAGTG 60 

TCACCAGCTG GAGTAGATAT CACAGAGGAT CAGATTGAGT GGTCGAGTAG TGATCCTACC 120 

ATTGTGACGG TTGACCAAAC GGGTAATTTA ACAGCAGTTA AGAGTGGGTG AAGCGAAGGT 180 

AACrGTCAAA ATTAAaGGAA CAGAAaTTAG TGATACAATT CCTGTGACTG TTGTAGCAGA 24 0 

AAATAAACAA TATGCGGAAA TGCGAGCAAA ATGGAAAATG CGATTGTTAG GTACAACGCA 300 
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gTATGATAAC GATGCAGATG TGCAACAGTa TCGTGCCCAG ATTGCTACGG AGAGTTTAGC 360 

ATTATGGCAA ACGCTTAATC AAGCAGCAGA TCGTGAGTAT CTGTGGGAAC GAAAACCATC 420 

GGATACAGTG TCTGCTGATT ACACGACTCA ATTTACCAAT ATTAAAAAAT TAGCGTTAGG 480 

CTACTATGAA CCATCAAGTG AGCTTTTTGA AAAACCAGAA GTTTATGATG CGATTGTTAA 54 0 

AGGCATTGAA TTTATGATTG ATACAAAAAA ATACAATGGA ACGTATTACA CAGGTAACTG 600 

GTGGGATTGG CAAATTGGTT CCGCGCAgCg CTAACAGATA CATTGATTTT ATTACATGAT 660 

GACCTATTGA ATACAGATGC AGAAAAATTA AATAAATTTA CTGCTCCGCT GATGCTGTAT 720 

GCaAAAGATC CAAACATACA ATGGCCAATT TATCGTGCAA CAGGAGCTAA CTTAACAGAT 780 

ATTTCAATCA CCGTTTTAGG TACTGGACTT TTGTTAGAAG ATAATCAACG CCTAGTACAA 840 

GTACAAGAAG CTGTTCCGTC CGTTTTAAAA AGTGTTTCCT CTGGTGATGG CTTATATCCT 900 

GATGGTTCCT TGATTCAACA TGGTTATTTT CCGTACAACG GCAGTTACGG GAATGAGTTG 960 

CTAAAAGGGT TTGGACGAAT TCAGACTATT TTACAAGGTT CCGACTGGGA GATGAATGAC 1020 

CCTAACATTA GTAATTTATT TAATGTTGTG GATAAAGGTT ACTTACAATT GATGGTAAAT 1080 

GGAAAAATGC CATCGATGGT TTCTGGTAGA AGTATTTCCA GAGCGCCAGA AACGAATCCT 1140 

TTTACTACAG AGTTTGAATC GGGTAAAGAA ACAATAGCTA ATTTAACCTT AATTGCAAAA 1200 

TTTGCACCAG AAAATTTAAG AAATGACATT TATACATCTA TCCAAACGTG GCTTCAACAA 1260 

AGTGGGTCAT ACTATCATTT CTTTAAAAAA CCAAGAGATT TTGAAGCGTT AATTGACTTG 1320 

AAAAATGTAG TGAATAGTGC GTCACCTGCC CAAGCGACAC CAATGCAATC TTTAAATGTA 1380 

TATGGTTCGA TGGATCGAGT CCTACAGAAA AATAACGAAT ATGCGGTGGG GATCAGTATG 14 4 0 

TATTCACAAC GTGTCGGAAA CTATGAATTT GGGAATACGG AAAATAAAAA AGGCTGGCAT 1500 

ACAGCAGACG GCATGCTTTA TTTATACAAT CAAGACTTTG CTCAGTTTGA TGAAGGATAC 1560 

TGGGCAACGA TCGATCCATA TCGATTACCA GGAACGACAG TTGACACAAG AGAATTGGCA 1620 

AATGGTGCTT ATACAGGGAA ACGCAGTCCC CAGTCATGGG TAGGTGGCTC AAATAATGGA 1680 

CAGGTTGCCT CTATAGGAAT GTTTTTAGAT AAAAGTAATG AAGGAATGAA CTTAGTTGCT 17 4 0 

AAAAAATCTT GGTTCTTATT AGATGGTCAA ATCATTAATT TGGGAAGTGG CATTACTGGT 1800 

ACGACAGATG CTTCGATTGA AACAATCCTC GATAATCGGA TGATTCATCC ACAGGAAGTG I8 60 

AAGCTTAACC AAGGTTCAGA CAAAGATAAT TCTTGGATTA GTTTAAGCGC AgcGarTCCA 1920 

TTGAATAACA TTGGCTATGT TTTTCCTAAT TCmATGAATA CGCTTGATGT TCAAATAGAA 1980 

GAACGCTCTG GTCGCTACGG AGATATTAAC GAATACTTTG TTAATGATAA AACCTATACA 204 0 

AATACATTTG CTAAAATTAG TAAAAATTAT GGCAAGACTG TTGAAAATGG TACTTACGAA 2100 

TATTTAACAG TGGTTGGGAA AACGAATGAA GAAATCGCAG CTCTTTCTAA AAACAAAGGC 2160 

TATACTGTTC TAGAAAATAC AGCAAACTTA CAAGCCATTG AAGCAGGTAA TTATGTCATG 2220 

ATGAATACAT GGAATAATGA CCAAGAAATT GCAGGACTGT ATGCGTATGA TCCAATGTCG 2280 
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GTTATTTCAG 


AAAAAATTGA 


TAACGGTGTT 


TATCGCTTAA 


CTCTTGCGAA 


TCCTTTACAA 


2340 


AATAATGCAT 


CCGTTTCTAT 


TGAATTTGAT 


AAGGGCATTC 


TTGAAGTAGT 


CGCAGCGGAC 


2400 


CCAGAAATTT 


CTGTTGACCA 


AAATATTATC 


ACTTTAAATA 


GTGCGGGGTT 


AAATGGCAGC 


2460 


TCGCGTTCAA 


TCATTGTTAA 


AACAACTCCT 


GAAGTAACGA 


AAGAAGCGTT 


AGAAAAATTA 


2520 


ATTCAGGAAC 


AAAAAGAACA 


CCAAGAAAAA 


GACTACACCG 


CAAGCAGCTG 


GAAAGTCTAC 


2580 


AGCGAAgCAT 


TGAAACA*VGC 


ACAAACTGTG 


GCAGATCAAA 


CAACAGCAAC 


GCAAGCAGAA 


2640 


GTAGACCAAG 


CAGAAACAGA 


GTTACGTTCG 


GCAGTGAAGC 


AATTGGTAAA 


AGTGCCAACT 


2700 


AAAGAAGTAG 


ATAAAACCAA 


CTTGTTGAAA 


ATCATCAAAG 


AAAACGAGAA 


ACACCAAGAA 


2760 


AAAGACTACA 


CCGCAAGCAG 


TTGGAAAGTC 


TACAGTGAAG 


CATTGAAGCA 


AGCGCAAACT 


2820 


gtggcagAtc 


AAACAACAGC 


AACGCAAGCA 


GAAGTAGACC 


AAGCAGAAGC 


AAAACTACGT 


2880 


TCGGCAGTGA 


AGCGATTAAC 


ATTGAAAAAT 


AGTGGGGAAA 


ATAAAAAGGA 


GCAAAAAAAT 


2940 


GGGGGGAATA 


ATGGACACTT 


AAATACTAGT 


ACAGGAGTTG 


ATCAAACTGG 


TACGAAACAA 


3000 


GTTAAGCCAT 


CAAGCCAAGG 


TGGTTTCAGA 


AAAGCTAGCC 


AATTTTTACC 


GAGCACAGGA 


3060 


GAAAAGAAAT 


CGATCGCGCT 


TGTGATTATT 


GGTCTTCTAG 


TTATCGCCAG 


TGGGTGTCTT 


3120 


TTAGTTTTTC 


GTAAAAGTAA 


AT CG AAGAAG 


TAAAAAATTA 


CGGATTCACT 


AAAAACATAA 


3180 


TAAAAAAGCA 


GACTTCCATA 


CAAGAAGTCT 


GCTTTTTCAT 


TAATTACTCC 


GCTTCTGCAT 


3240 


TTTGCAAGTC 


AGCGACATCA 


GTCAGATAAG 


TGGCTTCATC 


CAACAAATCT 


TCCGAATGTG 


3300 


CGACATAAAT 


CGCAGTGGCG 


CTATCACCAA 


CCACGTTTAG 


GCTAGTAACG 


AGCATTTCGA 


3360 


TTGGACGATT 


AATGGC7AAA 


ATCAATGAAT 


ACGTCGCAAT 


CGCAATCTCA 


TTGTTATAGT 


3420 


TTAAACCAGT 


TAAAATAGTG 


AAAAGGATAA 


TCCCGCCAGC 


GCCAGGTGCC 


GcCGGTGTAC 


3480 


CAATGGATGA 


CATTAAAGCT 


AAAATGGCAA 


TCAAGGCAAT 


ATTAGCTGGT 


GTCACTTCAT 


3540 


AAcCAGTACG 


CCCGCCACAA 


AAATAGCCGC 


AATTACTTGC 


ATAATCGCTG 


TGCCGTTCAT 


3600 


ATTGATAGTC 


ATTCCTAACG 


GCAAGACAAA 


GGCGGCAACA 


TCGCTGTCGA 


CACCTAACTC 


3660 


TTCAACCGTT 


GTTTTTGTAT 


TTAACGGTAA 


AGTGGCTGCC 


GAAGAAGAAG 


TAGAAAATCC 


3720 


AAAGACCACA 


ACTTTCATAA 


TCTTTTTAAT 


AAAAGGAACA 


GGATTTAATT 


TTGTCGCAAA 


3780 


TAAGATAAAT 


AATGGGTAAC 


CGATCATCAA 


AAAGGCTAAG 


AGCGTCGCTG 


TCGTGGTTAC 


3840 


GACGTAGACT 


AACGCTGGTT 


TTAAATAATC 


AmTCCCGTAA 


GAAGCAAACG 


TCCGAACTAA 


3900 


CAACATAAAG 


ACGGCAATAb 




zv^t a 21 T T ZIP n 


zv a n. tz t t a a zx a 


ATAACGTGAC 


3960 


CATCGCATTG 


ACTTCTTCAA 


TTAATTTTTT 


GAAAACACGA 


ATTTTATCCC 


CTAATGCATT 


4020 


AATACATAAA 


CCAGTAATCA 


CTGCCAACAC 


AACAACGGCT 


AAAATTGCAC 


CATTGTCGCT 


4080 


AAATGCAGAA 


CCGATATTTT 


TCGTGACAGC 


ATTGACAATC 


ACCATTAATG 


GGTTGCCGCT 


4140 


CGTACTTACT 


TCGACAGGAG 


CTAGCGTTTC 


AGATAACGAC 


ACATTAAAAG 


CGCCAGTTAA 


4200 


ATATACGCTA 


TAGCCCACAA 


TTCCAGCCGT 


TAATAGAGCA 


ATCAGTGACA 


, TCGATAAAAA 


4260 
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GCCAAAAATT GTTTTATAAG AAATTCGTCC TAATTTTTGC GTATCGGTAA TATGACAAAT 4 320 

GGCTAAAGTA ATTGAAGTAA AGACCATCGG CACAATCACT AATTGTAGGC CGTTGATAAA 4 380 

TAGTTGACCA ATAATATAGA AAAGCCCAAT CGCAGATTGT CCTTTTTCAG GATCTGAAAT 444 0 

ATCTTGGAAA AGAATCTGAT TGATTGTCTG CCACACAGCG CCTTGATTGC TGCGAATAAG 4 500 

GTGCTCTCTT AAAAATAAAA AGCCGAGCCC AACGATTAAT CCTCCCACCA ATGAAATCAC 4 560 

CATTTTTTTG GTAAATGAGA TACCTTTCAC TTCA 4594 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

TTTCGGTCAC TAT C AC CATC TCTTGAAAGA AGTTCAATGG CTACATCTCC GTTTAGCTCT 60 

GCTTTTGCTT TATCTAAATA CTCTTTGGCT TTTTTGGCGT CTGTTTTGAC AAGAGCGCCT 120 

GCTTCTTGAC GAAAATCTTC ATTCGTTTCT GGGTTATATA CAAATTTTTC TGGAATGTAC 180 

CCATAAATTT CTTTTGAGCC ATCTGCTAAA ATATTATCGA CTAAACTTTT TTTATCCAAA ( 24 0 

GCATAAGCTA AAGCTTTGCG GACATTCTCA TTTGCAAAAA TGGATGGTTT TTCGTTTCTT 300 

TTTTGGTTTA AACGTAAGGA ATAGACTTTT GACCGTTCGA TTGCTTCATA GTCTGGGTTG 360 

TCTCGATTTT GTTTAGCAAA ATCTCCTGTT AAGACAGCCA CATCTAATTC ACCTGATTCG 420 

TACAAATTAT AAACGGTATT GGTTTCTTTA AGAACTTCAA AATGAATCGT TTCTGATTTT 4 80 

ACTTTTTCTT TATCGTAATA GTAGGGATTA CGCACAAAAT CCCACGACAT CGCTGACTGA 54 0 

TCCCAATTTT TCATTAAAAA CGGACCATTA GATACAATCA TTTCACTAGA AGTGCCATAG 600 

GCTTGTCCTT TTTCTTTCAC GAATGCTTCG TTTTGTGGGA AAAATGGAGA AAATGCCAAT 660 

AACGACGTAA AATATGGTAC AGGCTTTTCT AAAGTAACCT CCAAAGTATA ATCATCAAGC 720 

GCTTTGACAC CCAATTCCTC TGGTGCTTTT TCCTCTTTTG TAATAGCTGT TCCGTTCAGA 780 

ATCGTTCCTT CTAACAAGAA AAAGTAATTG GCTTGGTTTT TGGGATTCGC TAATTTACGC 840 

CAAGCAAAAA CGAAATCATT GGCTGTTACA GCATCACCAT TGGaCCACTT GcCATCTTCT 900 

CTTAGTTTAA TGGTATATCG TTTGCCATCC TCAGAAATTT CTGGCATCTC TTTAGCAGCA 960 

GCAGGTATTA GTTGATCTTT TTCATCCAAT TGATATAACC CTTCAAAGAC ATTGGACTGT 1020 

GCGTTAATAC TGGCTTCATC TTGTGTAAAA ATAGAATCCA TTGAACCGAT TTCCGAGTTC 1080 

TCCATTAAAT- TTAAAACTTT TTTCGAGACA CTGGTTTTCT CAGTTGCTTT CTCGTCTATT 1140 

TTTGTwTCTG CTGTTGTGCC ACAACCTACC AAGACCGTAC TAACAAATAA CATAATCCCC 1200 

ATCATTTtTA ACTTTTTCAT ACTATTCCCC CTATGTGTCA ATCAAACGGC AACTCTTCCA 1260 
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TGGAAAGCCG 


TTCTTGCGAA 


ACTATCAAGC , 


ATGCGTAGGG 


CAAGCCCTCA 


TTTTCGTACT 


1320 


CAAATTCTCC 


CCGCTGGTTG 


TCAATTGGGT 


CTGCACGCAA 


TTGCATTTTT 


GCCGGCTGAA 


•138a 


AGTCTGCCTG 


GATCTCACTA 


TTTGCGACCA 


TTCTAAAAAC 


ATCTAAATTG 


CCGAAATAAT 


1440 


GATTCCACCG 


CTGTCTATCC 


CCGTCTCGGT 


AAGCACCTCC 


GCCAAAAGAC 


AAATCCGCAA 


1500 


AGAGCCAGCC 


GTACGGTTTT 


ATATAAAATT 


GTGCCCAATC 


ATGACAGCCT 


GTATAATGTG 


1560 


TAGAAACATA 


TAATCCCGAT 


TGCCATTTGG 


CTGGTATCCC 


TGCAATCCGA 


CAAAGCGTGA 


1620 


TAAACAAAAT 


CGCTTGCACG 


CCGCAGTCAC 


CTTTTAAATT 


AGTTGCAGCA 


TACTCTGAAA 


1680 


TATTGGAAAT 


TGTAAAATAT 


TCACGCATGA 


AAGAATAATG 


AATCTTTGTC 


GTCACAAAGT 


1740 


CATAAATCAA 


GCGTGCCTTT 


TTTAAGGGaT 


TACTTTCATC 


GCCCACAATC 


TCATCTGCTA 


1800 


AATTTCTTAA 


ATAAGGCGTG 


AATCGAATAT 


GCGGTAACTG 


TTCCTCTAAA 


TCAAAGCTTG 


1860 


GTTGCATTTT 


GTCTACGTTA 


TTTTCTTCTA 


ACTGTTGGTA 


AACCATCCGA 


TTAACATAAC 


1920 


AATATTCAAC 


TGTAAACGTT 


TGATTTTCTT 


TTAAGATTGT 


TTCAAAACAC 


AGTGTTCGTT 


1980 


GAGGCGCATC 


TTCTGCCGCA 


ATATAGGTTG 


CTTCAGGTGA 


CGTAGCTAGG 


ATTTCAATTT 


2040 


CTGATTGTTG 


TTGACACGCT 


TTGGGCAGCG 


GCAAATGGAC 


ACGAACACGC 


TCACCTGGAC 


2100 


GCTCAAATTC 


TTTTTTCACC 


TGAATGGTGG 


CTTTAATTGT 


TGTTCTCACT 


TTTCTACCAC 


2160 


CGTTTTTTTG 


CATTTGTTCG 


ATGTTCTCTT 


TTAGTTCTTT 


TTTGCGGAGA 


ATTTCTAATT 


2220 


GATTGCTTTC 


TGCTTCCCGC 


AAACGTGCAA 


CATATGCCGC 


ATTTGTTTTA 


ATTAAATTGG 


2280 


CCAAAAATCG 


AAGTTGGAAA 


TACGGTTCAC 


CATCAATATA 


AACCCAGTCC 


ACTAATCCCG 


2340 


TTTCTTTTAA 


ACGAATTAAT 


TCCTCTGTTT 


GATAATCCTG 


AAAAGTATCT 


CTCATCATTT 


2400 


GATCAGCTTT 


TGcAAAGcTA 


TGAGGATACT 


CATtCATACG CATCATTTtA ATGgACAtCT 


2460 


TTTTCaATTA 


TCAAACGTTC 


TTTTaATGCC 


TGCGGAAGTT 


CTTTTGTCAA 


CCAAAAmTCA 


2520 


ATGGCAATAT 


CCGCACTTGC 


AAAATCGCCA 


AATTCTTTCA 


CCCTTTGAAT 


TTCCTCAGGC 


2580 


AAAGGAACAC 


TAAGATACTT 


TAAATCATAC 


TTCATTTACT 


TTACTCCTTT 


TTTAATGAGA 


2640 


AATGCCTAAT 


CCAGCTGCTT 


CTCCTAATTC 


CAATAATGGT 


TTTGCTTCTA 


AGGATACGCC 


2700 


CCCAGTTACG 


GGCGCAGTTT 


CTAAACCGAA 


TGTCGCATCT 


AAATCTGCTC 


GTGTGATATT 


2760 


CTTTTGAGCA 


GCAGCTAAAT 


GGGCAGCAnC 


CGTGATGCCA 


ATGGTTGTTT 


CTTCCGCCAT 


2820 


ACAACCAATC 


ATGCATTCGA 


TGCCTGCTGT 


TTCACAAATC 


TGATTAATTT 


TCAGCnCTTC 


2880 


ATGAATGCCn 


CCACACTTCA 


TTAATTTTAT 


ATTGATAACA 


TCGACTGTGC 


CTTTTTTAcT 


2940 


AATTCCAACG 


CATCTTGAGC 


ATCAAAACAG 


CTTTCATCAn 


, CCATAATTGT 


1 C 


2991 



(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



TTGGAPPPCC 


CAAATTCTTC 


CTTCGTCGAA . 


ATTAACCACT 


CCCCCTTGGn ' 


TGATTGGATC 


60 


AAGGTPTGGn 


TTCCTTTACG 


CCCCAGGCCA 


TTGGGGTGTT 


TCCACCAAAn ■ 


CAAACCTTGG 


120 


ATPPPGPTGG 


TTAAATTTCA 

X X ii^ixi XXX ^— 


CATATCACCc 


ATTGATTGGA AATCaAAAGC 


TATCTTcTTT 


180 


MAA x t\r\\2r\ 1 x 


ACTTATAAcG 


AGAATTTATT 


GACAAATTAA 


GACAAGAAAA 


CAAATATACA 


240 


p A A TP A A PT A 


ttpaaACGTG 


GCATAAAATT 


GTAATGAGGA 


TGATTAACGC 


TGCGGTACAC 


300 


B TiTP A A ATP A 


TPPPTAGPAA 


CACGCTAACA 


GGCTTTAAAT 


TTGATTTAAG 


TAATAATGTT 


360 


Lb i 1 bb 1 l<vl 


ptaa aaagga 


ATTACAGCGA 


TTTGTGGCGG 


TTTTAGAAAA 


CGAAGATATT 


420 




TT ZiTATTTTT 
1 Inlnl 1111 


AACTCTGCTA 


AAATCTGGAA 


TGAGGAAAGG 


CGAACTGATG 


480 


bbbb I otu 1 1 


P. P. AP.TP.AT AT 

Uurtu 1 Ort Xr\ X 


TGATTTAACC 


GAAAAATATT 


TCGATATCAA 


TTCTACACGT 


540 


bb i bA i i Abb 


PTPAAAATAA 


APPTAAAACA 


AAAACCAGCA 


TACGTAAAGT 


TTATTTTGAC 


600 


AAbxbbl lAb 


TPAPTTT A AT 
1 b Ab 111 AA i 


A A A A A A AT AP 


AAAAATCATG 


AGAAAGAACG 


GCTTTTCAGA 


660 


GAAbbbA i AA 


TTTT A APPPA 
111 iAAbbbA 


TAAGGAPTAT 


TTTATTTTAA 


GTTCTCGAAA 


TTTACCTATC 


720 


AAALAAI bAA 


P A ATT APP.T A 
o AA 1 1 AL* Uin 


TATGTTTCGC 

X X U X X 


CTGTTATGTG 


AAAAAGCAGA 


AGTTCAAAAC 


780 


A 1 AAbbb J. Ab 


APPPPPT A AG 
AbbbbV_ 1 rxttAJ 


APATAPCCAT 


GCAACGTTTT 


TAATTGAAGC 


AGGAGCAAAC 


840 


Al lAAblAbb 


TTTP A APTPP, 
1 JL lbAAblbb 


GTTAGGACAC 


AAGAATATTA 


ATATAACTTT 


GGATGTTTAT 


900 


APPP ATPTP.P 
AbbbA lul 


TA A A AG A AG A 


AGAAAAAGAA 


ACAGCTGATA 


TGATGGATAA 


ACTTATTGAG 


960 


A APT THAT 


TTGTGGTAAA 

1 1 U 1 uu x nnn 


CCTGTGGtAA 


AAACATTAAA 


AACCATTGAT 


ATATAAGCAC 


1020 


TTT APP APTP 


PPP ATPTGAA 


ATTGTAAAAT 


AAGTTGtTGT 


TTCTTTTCAG 


TCCAAGTAAA 


1080 


A AT AT ATTTP 
nn X r\ x r\ X X XV— 


APTAAAPGGC 


CATTGTTCCC 


TGACGAATGA 


AGAAAACTGA 


GTTTACTTGG 


1140 


P.PP APTP APT 


GTPATTTTTT 


CAGGTACAGC 


TAGTAAAGTC 


CTCCGAGCTG 


ATTGGTTCAA 


1200 


TTT ATP AGP, A 


AATGAAAAGC 


TTTGTTGTTC 


TTCAAAAAAC 


CAAACTTCTT 


GTGTTTCCTC 


1260 


TAAAAPAATT 


GCTTGCTGTT 


GCGCAAAACG 


AAGTTGTTTT 


TCTACCGTAG 


CAAGAAACTG 


1320 


TTPA ATTTTT 
1 iv^rUiii 1 i x i 


AAPGTTTGTT 

rtrt^«v3 1 X 1 <w 1 J. 


GCCAGCGGTG 


AATAGCGAGT 


GTCGGGAGCA 


TAGTCATAAA 


1380 


(ZC A A AP A ATP 


GTTAAAPTAA 
oil r\fv\\* i nn 


CAAGTGTTTC 


CATTAATGTG 


AATCCTGCTG 


ACTTGTTCAT 


1440 


TGPTTTTPTG 


CTGTATATTT 

X VJ X X ** XXX 


ATCTAACTGC 


TCTTTAGTAA 


TGTAGCCTTC 


GTTGACTAAT 


1500 


TCATTTAAGG 


AAGGCGTCTT 


ATTTTTTTCT 


AGTGTGTAGA 


GCTCGATTTG 


TP.ATTPT APA 
1 un ll^l 


1560 


ATTTTTACGA 


TTGCTTCATT 


GCCTTTTTTA 


TCAACTGTTT 


CTTTATGTTT 


CGCTAAGTTA 


1620 


GGGACAAAAA 


GTAAAATCAA 


TACGGAAATA 


ATCAATAAGA 


CAATCAACAT 


TTCTAATAAT 


1680 


GTAAACCCTG 


CGTATTTTTG 


tTTCTTTTTC 


ATTATAAAAC 


TCCTTCCATA 


TTGcCATAAw 


1740 


TAGGCAGCAA 


CaTTGCCCCA 


TAAACACCGA 


, TAATTAAACA 


AGCAATTATG 


ATAAAAATAA 


1800 


GTGGTTGAAC 


GAGAGTAATG 


ACTCGTTCAA 


TXTTTTTGAA 


. AAAaCCTTGG 


; AAACAGACAT 


1860 


TACTATAAAC 


CATCAaCTCC 


: TTGCCTAATT 


1 TTCCTTTTAC 


: TTCCCCTTGT 


1 TGAATCATTA 


1920 
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AACCCAATGC 
CTAGCAGTCC 
GGCGCATCAA 
CTAGTGCAAA 
GATATACGAA 
AACTAAATGA 
GAATCCCAAT 
ATTGCCGAAT 
TTATTTTATA 
GTAGTGTCCC 
TTGAAAATCC 
CTAAAAAAGT 
CTTGTATATT 
GTTTTTTCGT 
GCCCACGCCT 
AAATAATCAT 
TGATAAATGA 
AAACGCCGTT 
GTTAAGGCGG 
AAAATATCAG 
TTTTCATTTG 
GCTATCACTT 
CCAGAACCAA 
TCATTCAGCT 
CGAATTACCA 
TtCGTCTCGC 
TCCcATGCCG 
TTGtGTGTAA 
GGGTAAAAAA 
CAATTTTTTT 
TTTATGATTT 
. TTTGCGCTGT 
TAAAAAAATC 



TTTCGTAAAA 
TTCACTAATT 
ATCGATGACC 
AAAAGCACTG 
GGCCCGCTGA 
GAGGACAAAA 
ATTTTTAGAA 
ACCAACAATG 
GAACTGTTGC 
CACCAAATCC 
TAATTCTTCG 
TATACTGTGC 
AAAACCATTT 
TAATTTATTC 
TTCTTAGATT 
AAAGAATTCC 
TACCTTGCAA 
GCACCCCCAA 
CTCGAATAAC 
GATGATGGCG 
TTTGCAACTG 
GTTGTTCGCC 
CTGGCCCACT 
GGTGGGGAAA 
AGCTCTCTCG 
C a AT TAG AT A 
CcTATAAATT 
AGTGTCCGTT 
TAAACGTCTT 
ACATT kTCAG 
CAGCAAAAAA 
CCATCTATCT 
TTGCGTTTAG 



AAAGGATAAG 
GCTCCAGCTA 
TCTTTCATTT 
ACGTATTCCC 
ATAGCAGACT 
ACTAAGGTAA 
TAACTATTGG 
ACGAGAATCA 
CGATATTTTT 
CCGTGTGCAG 
ACTAAAGCTT 
TGAGGTAAGG 
GCAAGCAAAT 
GATGGCTTGA 
CTTTTGCCAA 
CGCTACTTGT 
GCATTCAGCC 
TAAGTTACGG 
TGCTTGAGCC 
TAAGCAAACT 
TAAAAACTGC 
ACTTCTTTGT 
GAATAAATAA 
GAAAAAATGT 
ATTCTGAAAA 
AGTTGTCGCG 
TAAAGTGGAA 
GCATCCCTGT 
GCGCCCCATG 
TCATTCATTT 
AAGAACCTTC 
TCCCCAAAAT 
TTTTCAAAAA 



1163 

CAGCTAATTG 
ATTCTTTCAT 
CTAATCCTTG 
GATAAAGGCC 
GATGCTTCAA 
CCAGAATGTA 
CATCATTTTC 
AAAAAACAAT 
CCGTTTGAAA 
TAGCAAACGA 
GTAAGGCAAT 
CGTTTGATTT 
CCGCTAATAG 
AAATACGTTT 
CTACTCTTTA 
TCCTTATTAA 
AATTCATGCT 
GCGTGAATTG 
GTCTCGCCAT 
TTTATCAAGG 
TTTTCCTCAA 
TTAGCCACAC 
AGTCCTCGTT 
TCAGCTTGTt 
TCTCCCACAG 
CCTAATTGTG 
AATTAACTTT 
CCGAAAAGAG 
AAGGACGCCC 
CCTCCTCTCA 
ACCATTaAGA 
AGTGGAACTC 
TTTCTCTGAA 



TCTATGTAAG 
TAAGGGGTAA 
TGCAAGCAAC 
ACGGATAATT 
ATAACGGTTT 
TGGACTGTAA 
CATTAATTGG 
CAGGAGCAGC 
AAAATGTTCG 
CAATTGTGCC 
GCCTGAGGAA 
TTTCATGAAG 
TTTCAGAAAA 
TTTCTGTAAT 
TAACGGCTTC 
TCGTGGTTGT 
CTGGGCCACC 
TCGCAAAAAT 
CACGAATTTC 
CTTCGTACGT 
TTTCAACAGG 
GATACATGGT 
TCTGACAAGA 
CACCaAATCG 
ATGATAAACG 
CTTTTCGTCT 
TCGCCTATTT 
ATGGCTATTT 
CATTCAATTA 
CTAATAAACT 
TGAAGGTTCT 
CTAGACACTT 
TCACAAAATT 



GTTTCACCAA 
TTGGTTGTAT 
TTCCCCCATT 
GGCAGTTTCA 
AAAATGAAGA 
ACAATCAGAA 
GGCAATAAAA 
GGATAACTAA 
GAAATCGCTT 
GAG AC CAT TG 
AAGCCTTCAA 
GAAAAACTTT 
AGCTGCTGTT 
AAACCCATAC 
CTTAAAGAAG 
CAAAATCTGT 
TAATTCTATC 
AGTATGACCC 
ACCAATAATC 
TAACTGAATT 
ATCCTCAATC 
TGTTGTTTTC 
TGCTTCAATT 
GTGCAAAAAA 
CAAGCGTTGC 
TTCTCCGACA 
CTGCGGAAAC 
CGGTATCATT 
ATTGATGAGC 
ACGCATTTAT 
TTTTTTGAAG 
AGTATTATAC 
TTTGAGAAAA 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

38 4 0 

3900 
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ACTATGATAA TCCGATTGGA TCGTTTACGT TTTTGGCTTC GTTTTTTGTT TATTAACTAA 3960 

ATAAACTGGA ATTCCTAGAA GCGTAATCCC TATACCAATC AATGCTAGAA TAGGCTGTGT 4 020 

AATACTCGTT GTAACCAAGA TGAAGATACC ACCTAGACTT GCGATAATTG GAATAATTGG 4 080 

ATAGCCAGGA ACTTTATACG GTCGTTTTAA TTCAGGTTCT CGTTTTCTTA AAATAAAAAC . 414 0 

AGCTAAGAAA ATTAATAGAT TAAACAGCCA CATAACAAAA ACTAACATAT CCGTCAATAA 4 200 

ATCGAAGGTT CCGAAAAACA TCATGATAAT GGCTACAGCT AACTGGAAAA TTCCAGCAAC 4260 

GTAAGGGcAC AACAAAGCGT TTAGgACAAC TTTTGAAACT GAACGACTAA AAGGCATCAG 4 320 

ATTATCTAAG GCTAAGGGCA TAAGGG 4 34 6 
(2) INFORMATION FOR SEQ ID NO: 233: 

(iO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 



CTTCATTCGA 


TTAATAAATT 


CATCCCAATT 


TACTTCATTG 


CCtTTAAACT 


CAGGTCCATC 


60 


AACACAGGCA 


AATTTCATGG 


TGCCATCAAC 


AGTGACCCGA 


CAACCGCCAC 


ACATGCCTGT 


120 


TCCGTCAATC 


ATAATAGGAT 


TcAAGGAAAC 


ATAAATTGGT 


AATTGATGTT 


TTTCAGCTGT 


180 


CAAGGTACAA 


AATTTCATCA 


TGATGCTGGG 


CCCAATCGCC 


CAACTGCACG 


CTACAGATTC 


240 


ACGTGCAATA 


ACTTCTTCCA 


TCGCTTCTGT 


TACTAAGCCC 


TTTTTACCTA 


ATGAACCATC 


300 


ATCCGTTGTA 


ATGATTAGCT 


CATCTGaATA 


mCGACGGCAT 


TCTTCTTGTA 


ACAGAACTAA 


360 


CTCAGCTGTT 


CGTGCTCCCA 


AAATCGTAAT 


CACTCGGTTG 


CCTGCTTCTT 


TTAAAGCCtT 


420 


GATAATTGGA 


TAAATAGCCG 


CGATACCAAC 


CCCGCCACCG 


ACAACCATCA 


CCGTTCCATA 


480 


TTGCTTCACT 


TCTGTGGGCA 


TACCTAAAGG 


TCCAGTGATA 


TCCAACAAAG 


AGTCTGAAGT 


540 


TTTAAGTTGG 


GAAAGTGCAG 


CTGTTGTCTT 


ACC GAT TACT 


TGGAAAATAA 


TTCGAATACT 


600 


GCCTGCTTGT 


GGATTTGTCT 


CCGCAATCGT 


TAAGGGGATC 


CGTTCTCCCT 


TTTCGTCAAT 


660 


TCTTAAAATA 


ACAAACTGTC 


CAGCCTGAGC 


ATGTGTCGCT 


ATTCTTGGCG 


CTTCCAGCCA 


720 


AAAATCAAAT 


GTATTACTAG 


ATAATTGAGT 


CGATTgAAGA ATATTAAACA 


TTGAAATACC 


780 


TACCTTCATC 


GTTCCTTTTC 


ATTCTGACTC 


CTACTTTCTT 


AAACCCTTAG 


GCAACGCTAT 


840 


CAGATAAAGT 


TGTGGCGACA 


CTTTTAACAA 


ATGTTTTTAG 


CATTTCGATG 


TCTTCTTCTT 


900 


CTGCATTGTT 


TTCAATTTCA 


ACTGTTTCAG 


CTACTTTTTT 


AGCACCAGCT 


TTTTCAAAAG 


960 


CTCCTACAAA 


ATCATGCGCT 


GAT TT AC AG A 


AAAATTCACC 


ATATTCACGA 


TCACCGGAGC 


1020 


CAACAACACC 


AAATGGCTTA 


CCTGCTAAAT 


CCTTCTCAGT 


TAATTCATCA 


AAGAAATCTT 


1080 


CAAATTCAAA 


AGGCAGTTCC 


CCATCACCAT 


AAGTATACGT 


TGCGATAATA 


CAAGCATCTG 


1140 
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CATCATCAAA 


GAAGTCTACA 


TCAACTTCTG . 


AACATTCTTC , 


ACGTTCTACT 1 


TCAACTCCAG 


1200 


CTTCTTGAwA 


CTGTTCTTCC 


AAAATCTCGG 


AAATTTCTTC 


TGTGTTTCCA 1 


GTCATACTTG 


1260 


CATAAACAAT 


TTTTACTAGT 


GCCATCTTCA 


TTCATTTCCT 


TTCTTTTTCC 


TAAATATTTC 


1320 


AATGCATTCA 


AATGCAATTG 


TAACCCCTTA 


CATTAATCCT 


TAAAAAAATA 


ACTTTTACAA 


1380 


ATGATTTTAT 


TCACAAcTAA 


TCTAACTAAG 


TATGTACTCT 


TCCCCTTTCG 


AGATAGAGTA 


1440 


TTTTTTACTA 


CGTCTTTACT 


ATACCACATT 


CCAAACCTTT 


TTAACAATAT 


TTTTAAGGTA 


1500 


AAAAAAATAT 


TATTTAGGTT 


CTTTTTTATA 


AAACGTTTTT 


CCTTCTTCCA 


ATCATAAAAA 


1560 


TTGCTTCTAA 


AACCGCTCCC 


CCTAATGCAC 


GAGCTTTGTC 


GGAGATCGTC 


AAGCAATCGA 


1620 


CTTTCTCAAC 


TGGTCGAGGA 


TCTACATCTG 


CACACTTCAA 


CCCTTGATAA 


CATGTTACCT 


1680 


TTTCTGAAAT 


CAATCCTCTC 


AAAGTCCCTG 


TCAGAGGCGA 


ATAAACTGGA 


ACCTGATCGA 


1740 


TTAAAAATAA 


TGCTTCCCCT 


TTCAACACGA 


GATCCCCTAT 


GTTTTTAAGG 


TGGGTAACTT 


1800 


GTCCTGAAGC 


AGGGGCATGG 


ACAACTCGTT 


CCGCACTTTT 


TCCGCCAATT 


TCACCTGGAA 


1860 


TACCTGTATT 


AGGTAAAGCT 


GTTCCTTCAA 


AGTATAAACG 


TCCCAACCTA 


TGGCCACGCA 


1920 


TTGTCTCAAT 


CACAACATCG 


ACATCTTGCG 


GTGCTGAAAA 


GCCTGGTCCT 


AAAGCAATCG 


1980 


TAATCGGTGC 


CATTCCTCGA 


TGCGTACCTA 


AATTTTTCTT 


AGCCAAAATC 


GCATCGATAA 


2040 


CAATCAGTGG 


TTTCAATTGT 


TGAATAGCAC 


TTGCTGTTTG 


ATCGACAAAA 


ACAGGCAGCT 


,2100 


TATTTTGCTG 


CCAACATTTC 


GCGCAATCCT 


TTAATGAAGT 


AATTCGAACA 


GCAACCAAAT 


2160 


CTTCCACTCT 


TTGTTCTTTC 


TGAAAAATAG 


CATCACAAAC 


TGAAACAGTC 


GGCCGAATGG 


2220 


CCAGAGGACA 


TTCTGTTTCT 


AATACTAGTA 


TTTTAAAGCC 


AACGTGCCAT 


AATTTTTGGA 


2280 


TAACACCTGT 


CGCTAGATCG 


CCCCCTCCGC 


GAACAATCAC 


TAATTTTTCT 


TGGATGTTTT 


2340 


GACTGATTTC 


TCCCATAAAA 


TAAGTCCTTT 


CTCTAATTCC 


ATATTGCAAG 


CAATAATCTT 


2400 


GTCTAATTTT 


TCAAAACAGC 


TTTTGGGTAA 


TTGAAGACAG 


ACATTCATCG 


CTTGAACGAA 


2460 


CTCTCTTCGC 


GTATGGGTTT 


GATTTAAACA 


TAAAATTCGT 


TGGCCTTGGC 


TCTCTTTAAA 


2520 


CAATCCTTTT 


GGATGATCAA 


TCAGTTGAGC 


TAACAATGTT 


TCTGTGATTG 


GTTCTTGTAG 


2580 


TTGCGCATCG 


GTTAATTCCA 


AAAAAAGCGG 


CAACCGATGA 


ATGGTCGTTT 


CATCTATCTT 


2640 


TTGACCAATC 


ACAGAAATCG 


GAAGCACACC 


AATTGTGGTG 


GTTGTTTCTT 


TTAATATCAC 


2700 


GGGTTCATAA 


GAGGACCAGC 


CTTTCAAAGG 


TCGTTGTTTG 


GACCCATCCG 


CTTCTAAAAA 


2760 


AACTTTCGTG 


AAGTGCTGAA 


ACAATGTTTG 


AAACGAGGTC 


ATTGCAGGCA 


TCGTtAGTTT 


2820 


TGTTTGGTTG 


TTGATAGCGG 


TTCCTGCTAA 


AGTGATTCCG 


CAGATATTCG 


GCGTTATTTT 


2880 


TTCAAAATGG 


TTTGTATAAA 


AATAATCATA 


CAATTGACGT 


TGGGGAAACC 


CAATTTTTGT 


2940 


AGTTGTACTA 


' ACCAGAACAG 


CTTCCTTTCG 


ATATGTTTTT 


GCTAGCGCCC 


ACATTAAGCT 


3000 


TGTTTTCCCA 


CCGCTACCAA 


CAATCGACAC 


AATCTCCTTT 


1 CCTTGTAAAT 


1 CAAAACACTT 


3060 


TTTTAAGGAC 


TCCACAGTTG 


GTACCTCCTT 


TCCCCAAGGT 


1 CATTTTTCAC 


: AGAAAAAATA 


3120 
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GAgGCTACGC GTACTTTTGC 


TTCTTCTAAA 


AAACATACAC 


TGTGAGTTAT 


TTCCGCTTCA 


3180 


TTAATGAGTA 


AACCGATCCC 


ACAGCCATCC 


GTCAATGATG 


GGGGCAAAGT 


AAAGACGCGA 


3240 


ACAGCCACCT 


TTCTTGCCAA 


CAGTACTTGT 


TCCGCCTGAA 


CAGCATAATG 


TGTATTCGGA 


3300 


AAGGTTAGCA 


AATACTCCAT 


CTTTTCACCT 


TCTTTTTTTG 


TTAAAGCGTA 


ATTACTTTAG 


3360 


CCGCTTGATT 


CATTGTTTCA 


ACAATTTCGT 


ACATATTAGT 


AATAGAGCCG 


ATAGCTAAGG 


3420 


TGTCTGTTAA 


ATGATAAAAA 


TCTAAACAGG 


CCCCACACGT 


TTGAATCGTG 


GTTCCCTTCT 


3480 


CCGCTAATTG 


TTGCAAATCA 


GCTAAGGTAT 


TCGCTGCTTG 


ATTCGTTAAA 


AACGCCCCTC 


3540 


GATTAAAGAA 


TAATAGCTGT 


GTAGGAACAG 


TTTCTGCTTC 


TGATAATGAT 


TGAAGATAAC 


3600 


TTTTCATTAA 


TAGTCGTCCT 


AGCTGCTCGT 


CTCCAGTACC 


TAATTGATCG 


GAGCCAATCG 


3660 


TAATTACTAA 


ATCTGTTGCT 


GACTGACTTG 


TTTGTTGAGT 


CGCTTCTTCA 


GGTACCGTAA 


3720 


TTTTGACAGA 


GAAAACCGTG 


GCTTCTAtnT 


GctTCGCAGT 


CAAAGTGGCC 


TGTTTTTTAG 


3780 


TAGCTAATTT 


TTTTAAATTT 


TCAATGGCTA 


CTTCGTTATC 


GACTAATACC 


TCGATAGTGC 


3840 


CTCCAGCTAA 


ACCTAACTCC 


GCCAAAGCTT 


TTTTTGTTTC 


AATCACAGGT 


ATTGGGCAAG 


3900 


GTTTCCCCAA 


TGCATCTACT 


AATTTCATAT 


TTGGCCCTTC 


TCTCTATAAA 


AGTACTACTG 


3960 


CTTGTTGTTC 


TTTTGGCAGA 


ACCTCGCCAA 


TAATTTTGGC 


AACAGGGTCC 


TCTTTTTGAA 


4020 


TAGCTGCTAA 


ACAGGCTGCC 


GCTTCGTCAG 


CAGCGACACT 


AATTAACAGT 


CCGCCAGAAG 


4080 

j 


TTTGTGGATC 


AAACAAAAGT 


TCTTGCATTT 


CCATAGAAAT 


TTTTGAATAA 


TCGATTGTCG 


4140 


TTCCGATTGT 


TTGGCGGTTT 


CTTTGCCCAG 


CTGCAGTGAC 


TAAACATTCT 


GCGGCGTAGG 


4200 


CCAACGCTCC 


TGTAATGGTT 


GGTAATGCTT 


CTGTATCAAT 


CAGTAATGTT 


ACCTGATTAT 


4260 


CAGCCATTTC 


TTTTGCATGA 


ATTAATAGAC 


CAAACCCAGT 


CACGTCAGTA 


CAAGCATGTA 


4320 


CTGCGTAGGA 


TCGAGCATTC 


GCAGCGGCAT 


ATTTATTTAA 


ACGTTCCATC 


GAATCTTGTG 


4380 


CGATTTTTTG 


TTCCGTCTGc 


TTCGCCAATT 


GCCCCCGAAC 


AGCCGCTTGA 


ATAATTCCTA 


4440 


CACCTAATGC 


TTTAGTTAAA 


ATCAAGACAT 


CCCCCACCTG 


TGGCGTATTA 


TTATGCCACA 


4500 


AATTTTCTGG 


ATGAACTTCT 


CCGGTTACCG 


CTAGACCATA 


CTTGGGTTCA 


TGGTCATAAA 


4560 


TCGAATGGCC 


TCCCGCCAGT 


AAAGCACCCG 


CTTCCTTTAA 


TTTTAAGCTG 


CCACCCAAGA 


4620 


GCATTTGTTC 


CAACCATTCT 


TTTGACATCT 


TTTGTGGAAA 


ACAAACAAGA 


TTTAGCGCAT 


4680 


AAAGGACTGT 


TCCGCCCATG 


GCATAGACAT 


CACTTAAAGC 


ATTAGCGACC 


GCAATTTTTC 


4740 


CAAATGATAA 


CGGATCATTG 


ACCATTGGTG 


AAAAAAAATC 


TACCGTTGAA 


ATCAAAGCTT 


4800 


TTTCAGACGT 


TATTTGGTAA 


ACAGCGGCGT 


CGTCGGCATG 


TTCAAATCCG 


ACGAGTAGCC 


4860 


GCTCATTTGC 


CGGTGTTTCG 


GGTAAGTGTT 


GCAACAATTG 


GTTCAGTACA 


CGGGATTCAA 


4920 


TTTTTGCCCC 


• ACAGCCACCT 


GAT GT AC ATT 


GTGATAAAAA 


ATCCATGCTG 


TCCGCTCCTT 


4980 


CTCAAATTAG 


TAGTTCTTGT 


AAAGCGTGAA 


TTGTCCAAGT 


AATTTCTTCA 


GGCGTTGTCT 


5040 


GCCAACCAAA 


GCTAAAACGT 


AAGGTGCCAG 


TTGCTAAAGT 


CCCAGCTGTT 


TGATGGGCTA 


5100 
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ATGGTGCACA 
CCACTATTGT 
CGTCCTTGGT 
TTAAGGTTCG 
AACTATTTAA 
AAAAAGAGGG 
TGACCGCTTC 
AGCTTTTATG 
AAAATCCCGC 
GAAAACATTC 
AGGCTTTTGT 
TTTGACAAGC 
TAGCTACTGC 
CCGCTAATGC 
TGGGTGCCTG 
TAAGCAAAGG 
AAGCCTCAAT 
AGTCCTTCAT 
TCTGGCGTGT 
TCTGGAAAGC 
TGTCGAAATT 
ACTACAATTT 
AATGGTTGAT 
CGAATACTGC 
GGTAAAGGAA 
AATTGATTCA 
AATTTATTTT 
GACATGAGTT 
ACTTGGTCGG 
GATCGTTGCT 
AGCATGAACC 
TGCATATTTT 
CATGCCCATT 



PCT/US98/08985 



ATGCAAACCT 
TTCTTCCTGA 
TCCTAAAATT 
TTCATGCTTT 
GCTTAGAATA 
TTGATCAAAG 
AGCCCCTCGC 
TCCAGTAAAG 
CGTTTGAGCC 
TTCAATGGGT 
ATTCGTTCGC 
AACATATGTC 
ATGATGTTCT 
AAGATTTAAG 
AAAAAAGTCT 
GGCATTTCTT 
GGTTGCTGGA 
ACGTTTTTCC 
CGATATCTTT 
GTTCATAAAT 
GATTACCGAA 
TATTTCTCTG 
CGCTAGGTAA 
TACTTTGTCC 
ACGCTGGCTG 
CTAAGGTTAA 
GGCCCATCCG 
GTCTCCTTTT 
AGCTGACCAT 
AACTCACCTA 
• AAAATTGTTT 
GCTTTAACAT 
GCAATCCCAC 



GCGCGCGTCA 
TTCCACAGTG. 
GTGACAGGTA 
TGAATCGCTG 
CCTAAACTGT 
GAATTAGAAT 
TCACTAAAAG 
GCTAACACAT 
GCATCCAAAA 
AAAATGGTGC 
CAAGCTCGCT 
ACTGAAATGC 
AACATAGTCG 
GAAGTTGTTG 
GCTAACAATT 
CCGTAATTCA 
AATTTTTTAT 
TCTCTAACTA 
CAAACGGCCA 
CATGCGTCCG 
TAATACTGGA 
ACATTTATCT 
GTATAAATAG 
TTCTTGCCAT 
ATTCTCAGGC 
CGTGTGCTCT 
TTGTGAATTG 
ATCTTTTTTT 
CCAAGGCAAA 
CGCCTTTAAT 
CGATTGGTGG 
AGCCTTCTTC 
CTTCAATTTG 



1167 
TAATGCCATA 
TGAXAGAAAC 
ACCCTGATAA 
CTAAACCAAT 
TTAACGTACC 
GGCTACCAGT 
CTAACCCACC 
CAATGGCCAT 
TAGTTAGTAA 
CTAAGACATT 
GAATATCTTC 
CTCGTTCTTT 
TAATCACATG 
CGTTATTCGT 
GACGTGTCTC 
TGAACTGATT 
AGGAAGT CGC 
TTGGTTTCTT 
GGAGTCGCTA 
CCTTTTTCAC 
CTACTAGGAC 
AACACGAGTT 
CCAGCTCCCC 
TGATGATTGA 
GAAACAACTA" 
AAAAATGTTT 
CCCGATGCTA 
CTTTTGATAA 
ATATGCGCCT 
TCCATAAGCT 
TACTTGAGTG 
TAAGGCAAAT 
ACCTTCAGCA 



TTGTTCTGCT 
AACAGGGACA 
ACCATTTAAA 
TTTATTCAAC 
AGCTTCAAAT 
GCCGCCCGCC 
AATACCTGCT 
CTGGGTCATT 
TCCTTTCTGT 
GGATGCGTGT 
TACTTCTAAC 
CTCAAGTAAG 
GTCGCCAGGT 
AAACGTAATC 
TAAAAGAGGA 
TTCTGCTAAA 
TGCATAATTT 
GAATTAATTT 
TTCTCAGCAT 
CAGTCAATGT 
AGCCATCTTT 
GTAGCATTTT 
GAGCTTGTTC 
GAATCAACTG 
AAATCCGTTC 
TTCCTTGATA 
GAATAATGAC 
GGGGTATTTT 
GCTAAAGCTG 
AAACCTTCGT 
GCGTTCACTA 
TTTTCAGTTA 
GCTCGTGGAT 



AGTTGCTGTG 
GTTTGTGCCA 
AAATTCTGCA 
TCTTTAATAG 
TTATCTGGTA 
ATCAGTGGTT 
AATCCATATA 
TTTATTGGTA 
TGAGCCCATT 
GTCATCACCA 
AGGCCAGTTT 
TGAAGAGGTC 
TGTAAAATCC 
TGAGCCGCAC 
AGCCCCTCTC 
TAGGCAGTCA 
AAATAAACCA 
TTGGTACTCT 
CTGAACTGCT 
TAATAGTTCT 
TTGTAATGGC 
AGGTGTTAAC 
GGTTCCTAAA 
AATGTTCTTA 
TAATACGTCT 
TTTTAAGAAC 
ACTAACGAGT 
CTAAAGGCAG 
GTGCGGTAGG 
GAAGATTGTC 
ATCCTAGCGT 
AAGCATACCC 
TCACCACTTG 



5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 
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CCCCATATCA 
TTCAGCAGCA 
ATCCGTTTTC 
TCCCATATCA 
GAGTGACTGC 
TTCCGCAACA 
GGTTGCCATC 
AACAGAAATA 
GAAACTAGAA 
TTCTTTTAAC 
GACGGCATTT 
TAAATTACTT 
TGGCGGATTA 
ACCTGCCGCA 
AGTATCCGCA 
CGTAAAGTCC 
TAGCAGTACT 
ACTCATGTCT 
TTCCTTGTCT 
CGCACTAAAG 
AAAGGCATGT 
AACACTTCCT 
GaATAGCTGG 
CGTAATCAAC 
CGAGTGCTAC 
AATCGGAAAT 
ACACAGCCAC 
CGCGAGGATA 
CTACATAAAT 
CTAATTTACC 
CTAACTGAAT 
TAGCTTTCTG 
AAATAACCAT 



TACGCAGCCA 
TAACCATAAC 
GCTGAATATT 
AGCTCATACC 
CTCGAAGCCG 
ATCATTTCTG 
CCTTGACCGA 
ATACAACGCC 
ATTCCAGCAA 
GCTGCATTTT 
TTAAAGCGTA 
TCAATTGCAA 
TTAGTATATA 
TGTGTACAAG 
TAAATCACTG 
ATTTCCATGC 
TTTACTGGTT 
TCTTTTCCAC 
AGCCCTAGCA 
AGTAAAATGC 
TCATTAAygG 
CTGCATgCCA 
TGCACCTTCT 
TTCCACTAAA 
TGCATCCCCA 
AAATTCTAAA 
ACAGTCAGGA 
GGCACTCCGC 
CCCTGTTCCT 
ATTTGAATGk 
• GGCTTCTATT 
AATGTCTTCT 
CCCAGGGATG 



CAAATTTAGC 
CAGCATGACT 
CCCCATAAAA 
GTAATTGCAT 
TTGTCGTTCC 
GTGGTAAATT 
TACAAGCAGC 
CTGTATCAGG 
CTTCCGCTTG 
TCGAGACCAA 
TTTCCCATGG 
AAGCCGTTTG 
CAGCAAACCC 
CCCGTTGCAA 
CTTTCATTGC 
CATGGCGTTT 
TCTTCAATAA 
CAAAACCACC 
TCCGAGCCAC 
CCTCACCTTC 
TACGGAATAA 
CGTACCAAGT 
GCTAAAGCAG 
TTTTTTATCT 
ACATACCGTG 
TGACCAATTT 
TGCGCCAATG 
AAAGCAGATG 
AAGGTTTTCT 
TCTTCTGGAA 
ATTTTCACAT 
TTTGTGGGTT 
CAATACCCGC 
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TACTTTTCCT 
AACTGGACTT 
TTCTTCGCCT 
TGCCGCTCTT 
TGAATTCGGT 
TAATGTTTCA 
ACTTGTGCGG 
TAGCCCCACA 
TTCATAGACG 
TTGTCCATTC 
TGATAGTCCG 
GCAAACACCA 
TTCCACATCA 
TACCGGTCCT 
AGTTAAATTG 
GGGATGAACC 
CCAAGCGGCT 
ACCAACAAGT 
TTCCCGTTGT 
AGGTTGTGCA 
TGTyCTGTCA 
GTTCaTGaGA 
CTTCACAAGA 
CTGGTAAGGT 
TTATTTCTCC 
TATTATTTCC 
CTTGGGTACT 
CATGaAGCAT 
CCACTGCATC 
TGCTCGCTTC 
ATCCTGTACA 
CGGGTTTC'TT 
ACTGAACTGC 



TTGTCATCTA 
TTTTTATCAT.. 
TCCAAATCGC 
CTAGTTGCTT 
GTTCTTCTTG 
CAGGCAATTT 
ACATGTATTT 
CCCACGCCAC 
TCTTTAACCG 
GGTAAAGTAT 
ACTTTTTCTG 
AATCCGCGAA 
ATCGTTtGAT 
CCTAAAGAAG 
CCTTCTTCAT 
ATCAAACTTT 
AAGGATGCAT 
TTTGATTGAA 
TCATCATAGA 
ATCGCACATT 
CTACAAACGC 
CAAGATATTG 
AGTAAGCGGT 
TTCTTTTCTT 
TTCTGGAATC 
TGGAACGTCT 
ATCGATTGAT 
TCCTTCTATG 
GACCCGTTGG 
TTCACAAAAC 
ACGACAAATG 
GTTCAATAAT 
ACCCGTTTTT 



GAATAGCTAC 
TAAT.TAATGG. 
TTAAAGCACG 
CTCCTGTGAA 
TATCAGGGGc 
GCGTCGTAAC 
TTCCTTCTTC 
TATTTTTAAA 
CTAACAAGGC 
CTCCTGGGGC 
CTAACAAGTT 
ATGCACCTGC 
ACTTGTAGGG • 
CATAGGCCCC 
CACAGCCGGT 
CTTCACGGCT 
GATGCTGAAC 
CATGAATTTT 
TACTTTGACC 
CGGGCTCCAT 
AGAwTTTTCT 
CCCTTTTCAT 
ATCATTTCTT 
TTTGAAACAA 
ATTACATCCC 
TTAGCTGTGA 
AGAACTTTTG 
TCAATGTCAT 
AAGTCTTCCC 
ATTTTAGCAG 
TTTCCTCGTA 
CCTTGTGCCG 
GCAAACGCAT 



7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

85B0 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 
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AGGCATAGAC 


ATCTTTTTGC 


CGTTGGCTTA 


AACCTTCGAT 


TGTCGTGACT 


TCTTTACCAG 


9120 


CTAATTTTTC 


TAGAGTAAAC 


AAACATGCTT 


TGGAAGCCTT 


TCCATTTACT 


AACACCGTAC 


9180 


ATGCACCGCA 


AGAGCCTTGG 


TTACAGCCAT 


CTTTTGTTCC 


AGTTAGGCCC 


AGATCCTCCC 


9240 


GCAAAAAGTC 


CATTAATTTT 


TTGTTTGTTT 


CACAGGTTTC 


ACTTTGACCA 


TTTACAATAA 


9300 


TTGAATACAT 


TTCTTGCACC 


TCGATTTCTG 


ATATrGTTTA 


GTGTTCAATC 


ACCkrGATTA 


9360 


TACCACAAAT 


CTCCGTTCTA 


AAAATTATTT 


TTTAGCAAAA 


AAAGTTTTTT 


ATTTAATTTA 


9420 


AAGGTTCGCA 


T TAT T GAT AG 


CGCTGTTGAA 


TTAATTCTCC 


CGCGATGCTA 


ATCGCTAACT 


9480 


CTTCTGGTGT 


TTCGGCTGCA 


ATGTCTAAGC 


CAATTGGCAT 


GAAAATTCTT 


TCAATTTCAG 


9540 


TCATTGTATA 


ACCGACCTCT 


TTTAACCGTC 


GAACTTGTGA 


AGCAATTTTG 


TGTTTGCTCC 


9600 


CCATCACACC 


AATATATTTG 


GCTGGTGTTT 


CCAATAACTG 


TTTGGCAACG 


ATGAAATCAA 


9660 


AATTGTGTCC 


TCTGGTAATT 


ACTATACCAT 


AATCGTCCGC 


CGTTATTTCA 


ATAAACGCAG 


9720 


CTAAATTTTC 


TAAAGGAATC 


ACCCGTAGTT 


CTTCAGCTTG 


TGGAAAAGCT 


TGTGCCGTCA 


9780 


AAAATTCTGG 


ACGATCGTCC 


ACTACAACGC 


AATAAAACGA 


TAAATAGTTT 


AAAATAGGTA 


9840 


CCAATGCGCG 


TGCAACATGA 


CCACTTCCAA 


AAATATACAC 


TTTCCCTACT 


TGGAGGATCG 


9900 


CTTCTGTTAA 


AAAAATTATG 


TCCTCTTTTT 


GCTCACAATG 


GATACCTTTA 


GGTAGCAAAC 


9960 


TTAATTCCCC 


CTCCCAAGAA 


GTGGAACCAA 


TCAATTCGCC 


TGTTTCACTA 


TAAAAGGATA 


10020 


CTTTTCCTAA 


ATTGTTTTGA 


TCTAGTTCAG 


TCACTAACCA 


ACCTTGTTGG 


TTATTTTGAA 


10080 


AATGCGTTTC 


TATGGATTCA 


CACAACTCCT 


TTAAAAATGG 


CGTTTCCCAT 


GAAAGATAAT 


10140 


GGAAAAAAAT 


CGTGACATTC 


CCGCCACAGA 


CCATACCTAA 


ATCAGCAATA 


TTATTCGGGG 


10200 


CTAAAATAAA 


TTGACGTTGT 


GCATTTTCTT 


TCTCAAGAAC 


TAGCTTTTTT 


GCCACTTGTT 


10260 


CAGCCTGAAA 


TTCAACCGCC 


CCACCACCAA 


CAGTTCCGGC 


TATTCTACCT 


TCGGCGGAAA 


10320 


CCAGCATTCT 


AGCCCCTTGA 


CTTcGTGGTG 


TGGAtCCtGa 


ACTTGTACCa 


ATCGTTAATA 


10380 


ACaTCGTGTC 


TTGCTTTTCC 


AAtATTGCCT 


GtTTCAAACG 


CTGAAATACT 


TCTCTCATTT 


10440 


CAGTTCCTCC 


GATACTACAT 


CCTGTTCTTT 


AAATATTTCT 


TCAAAGCATG 


CCTCTACCTG 


10500 


TT GAT AC ACT 


TTTTTCTCAA 


ATAAAAGATA 


TTGATCAATC 


AACTCTTTTC 


CTTGA 


10555 


(2) INFORMATION FOR SEQ ID NO: 234: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
CCCACTCAAC GTACATTTAA AACATACCCT ATTACTATAC AGGATTCCCT TTACTTTTTC 
AAGATGTTTT TATGATTTCA AGACAATTTT TTTATCCGTT ATTTTCTTCA TGAAATGCGC 
ATCATGCTCA ATCACTAGCA TTGCAGGCTT CACAGATAAG ATTAGCGCTT CTAATTGTTG 
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ATGATTAAAC ACATCCAAGT AATTAAGGGG TTCATCCCAA ATATAAAGCT CTGCTGATTG 24 0 

AGACAATGAT TTGGCTACTT CGACTTTTTT CCGTTGCCCC ATACTCATTT GTTCAATTCG 300 

ATTAGTGAAA ACGGnCACGC TCCATCCCAA GTTTTCGTAA GTTATTTAAA AATTGGGTGT 360 

AATCTAACTG ATTTTTCTCT GCAAATTCGG aTAAAGTTCC TTGATTGTCT TCATAATCTT 4 20 

GGCGCACATA AGAAATGGTT AATTGGTGaG CCAAAGTGGc TTCGCCTTCT GaATCCCCAG 4 80 

AAAAATTaTC CAATAAAtAC TGaATCaAGC TCGrTTTTCC TGaGCCATTT TTTCCTGTTA 54 0 

TTCCGaCAAT TTCTCCCGCG tTTATTGaAA AAGaAATTGG cGCmAACAGC CAATTTTTCT 600 

CGTAGCCTAG ACGAAGCTCT TCCACCGTCA ATAATGTTTT GTGATGCGTT GGCTGATAAT 660 

CCATTGACAA AGAATCAATA TACTCAAGAT CTTTTAATAG TnTTTCTTTT ' TCTGCTAAnn 720 

GTGTTTCGGC GCGTTGTTGG AATATGTTTC GA 752 
(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 



TTGATATCCA 


GTTTGTAATA 


ACCATGGCTA 


AAACAATCGA 


AACAATCCCT 


GGATCAAAGA 


60 


CTGTTAGCAT 


CAAGATCATA 


ACAACTAAGT 


TAGGAATACC 


AGAAAGAACT 


TCTAAAATAC 


120 


GTTGCATTAC 


TGTATCGACA 


CGACCGCCGA 


CTAACCCGGA 


AATCAGTCCA 


TACGTTACCC 


180 


CAATCGTAAT 


ATCTAGAATT 


GCCGCGATAA 


ACGCAATTAA 


TAATGAAATA 


CGTGTTCCCA 


240 


TAAATAAACG 


ACTTAGGACA 


TCACGACCTA 


AACCATCTGT 


TCCTAGATAA 


AAGTTTACGT 


300 


TATCTGGAAC 


GTTTGCTTGT 


GCATATTTAT 


CAACTAATTG 


ACCAGCAACT 


GTTGTTTTAC 


360 


CATTTAAACC 


ATTCACGCTT 


TCCAAGCCAG 


GAATTCGTGG 


TGGTAAGTTA 


ATGTAAGAAA 


420 


CATTTTGTTT 


CGTTGGGTTA 


TGTGGTGAAA 


CCCAAATGGT 


AATAATAGAG 


ATAAAGATAA 


480 


TAACTGCTAA 


GAAACTCATT 


GAGATAACTG 


CAGCTTTATT 


CTTTTTCAAA 


CGACGCCATG 


540 


AGTCTTGTAA 


GAAACTTAAA 


GACGGTGTGG 


CAATACGCTC 


TCTTTCTTTT 


AGAGTTGAAG 


600 


TGTTTAGAGG 


CTGAAACTCA 


TCTGCCGGAA 


TATCTTTAAT 


CGCTTGTGGT 


AGATTATTTT 


660 


TATTGACTGT 


TTCCATTAGC 


CTTTTGCACC 


TCCTGATACG 


CGAATTCTTG 


GATCAATTAA 


720 


ACCATAAAGT 


AAGTCAACAA 


TCAGAATTAC 


AACAACTAAC 


ATTGTTGAAT 


ACAAAATAGT 


780 


TACTGCCATA 


ATTGTTGGAT 


AGTCGTTTGT 


CATAATTGAT 


TTTACAAACT 


GTTCCCCAAT 


840 


CCCAGGGATG' 


GCAAAAATAT 


TTTCTACTAC 


TAAAGAACCT 


GTgcATTAAA 


CCAACGGCTA 


900 


AAGGTCCTAA 


TAAGGTAATC 


AATGGAATTA 


AACTGTTTCT 


TAATCCGTGG 


CGGAATGCAA 


960 


CTTGCCAGCG 


ACTCAAGCCT 


TTTGCTTTGG 


CTAATTCGAC 


ATAATCACTA 


TGCAAGACCT 


1020 
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CTACCATCTC GGTTCGAA7A AACCTASCCG AATCGGCCAT GGGACTCATT GCCAAGGCAA 1080 
. TAGT-T-GGGAA GATTGTGTAG GCAMKCM 7CCACATAGC AATTGGTAAG ATTCTTAATT 1140 
TCATTGCAAA AATGTATTGT AATAATACCG CGAAAACAAA GTTTGGAATA GAACGACCTA 1200 
AAATGGCTAA CAATGTTGCC AACGTATCTA CCCAAGTATT TTGACGCATC GCTGCGATAA 1260 
TCCCCAATAA AATACCGACT AACGTACCAA AGATAATCGC TTGTCCTCCC AGTTGAAGGG 
ATGGCCCAAT TCGACCAGCT AATAACTTAG CAACAGGTTG GTTTTTAAAT TGGAATGAAA 
TACCAAAGTC TCCTGTTACC AGGTTCTTTA AGTAAATTCC GTATTGAACA ATCACTGGTT 
TATCTAAGCC AGATTGTTTA TTCAACATAG CGATTGTTTC TGGACTAAGT TTTTCTTGGT 
TGGTATAAGG CGTACCTGGT AATAATTGCA TTAAGAAAAA CGTAATCGTT GCGATTAACC 
ACAAAGTGAT AATCATGAAA AAGATCCGTT TAAGAAAATA TTTTCCAAAA CTATTCAATA 
ATGACACCTC CGAGATTTTT TTGACTTCCT CAGTTATTTT CGTTAAAGTA ATGACTAGAT 
AGACCACAAT AATAGATAGG GTGAACACAT GAraACTTTT AAATGGCCAA tTaTTACAGC 
CGTTATTTCC AGTATCGGCA TTTTTCTTTA CTTATTAATA AGTAAAGAAC CTATCACTAC 1800 
CTCCTCATTA TCAGATACCT yCTTTATCGT TTCGCTTTtC TTTTTAATTA TCGGCATTGC 
TTTATGGATT ATATCCTCTG GCTTTTTCGA TAACTTTCAG CGCTCAATGA AAAACGCGTT 
TCGTTTTAAA AAGAAAAATG AACCGAAAGA GTTTATCCCG CTTTCAGTCA TTGGCGATGC 
ACATCGCTCT TTCTGGCTAA AAACTGGGGG CATCTTACTG ATTCTGTCTT TAGGCTTTCT 
ACTAnTTTAC TTGGTCTAAG ACAAGATAGC AAGAGGACCG TTGCCTTTTA TTCCATGGCA 
AAGGTCCTTT TTTGTTTGCT ATCGATCTTT TGTCAGGTTA TTCTGAAATG TATGCCCATT 
TGTAGTCATA CGTTGCGCCA GCAGGATGAA CTGCGACATC TTTGACTTTT TCAGCACGTA 
AATGCGCTTC TGATTTTTGG AATAATGGTA CCACACCCAT GTCTCCCATA ATCGTTTTCT 
CTGCATTTAA C AT AT CATC C CAACGTTTTT CAGGATCTGT CGCATCTGCA CTACTTGCTG 
CTTTCACGAA CTTGTCAAAC TCTGCATTGT TATAACGTCC ACGGTTATAT GAGTTATCTG 
AAGCAAATAA GTCTAAGAAA CTACTTGGAT CAGCATAGTC AGCACTCCAA CCACCGATTA 
CTGCATCAAA GTCGCCTTTG TTTGAACGAT CTAAACGAAC AGAGAATGGA ACCGGACTAA 
CCGTTACCTT CACACCATCT AAAGCATCTT GAATTGACCC TTGAACAAAT TCAACTGTTT 
TCTTAGAAGA ATCTGCATCT GAAGAAAGAA TATCCATTGT TAAGGTTGAA ATCCCCAATT 
CTTTTTTCGC TTTTTCCCAG TATTCTTTAG CTTTTTTCGt ATCATATTCA ATTTGaCTAC 
CTGCTTCTTT AGCAAAGTCT TTGCCGCCAC TTGGGtCTTT TGCCATATCT GCTGGTACTA 
AACCGTTTGG cTCGATAGA 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 10955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2779 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 



GTGGCAGCCC 


AGCAAGTCAA 


AGTGCGCCCG 


CACCTGATGA 


AGTGTTGGCG , 


ACAAGTATTG 


60 


GGACGTATGA 


AAATCGAACA 


TTGTTGGTTC 


CAGATGGGCG 


TTTAGCCTTA 


ACTGCCTTGC 


120 


ATAAAGGAAA 


AGATTATCAA 


GGCAAACCAA 


TGTTTTATGT 


CCTGTTTGAA 


TTAACCAATA 


180 


CAACCGAAAA 


AACACAAAAT 


ATTCAGCTAA 


TGATTCAAAG 


TTTTATGGAA 


GTTAGTCAAA 


240 


CTGTCCACGG 


AAAAGCCCAA 


AATTTACAAT 


ATGCAGTCTT 


AACAGATTCA 


CCTTTCCAAG 


300 


ACAAATTAGA 


TCGTTTAGCA 


GATGAAATCA 


ATCCTGGTGA 


AACAATCCAA 


GGCGCTTACC 


360 


CGTATGAATT 


TATTAATGAG 


AACAAGCCGG 


TCCATTTTAA 


ATTTAGAGAT 


CGCCTTCTCA 


420 


GTCTGGATGA ACCCATTGCA TCAGAAGAAA TTACCATTAC 


AGAAAAAAGT 


TTGGTCCGCT 


480 


AGCTTAGCTA 


GCATTGGACC 


AAACTTTTTA 


TTTATTATTC 


AGACGGAAAA 


GATTCTCCTT 


540 


CTATAAAACG 


TGTGTAAATG 


TGTTCGCTCT 


CAGCAGCTTC 


TCCCCGCATT 


AAATGAATAA 


600 


TTTTCTCTGC 


AGCCACCATG 


CCCCATCGAT 


GTT TAG/VAT A 


GGCAATTGTT 


GCTAATCGCG 


660 


GTTGCACAAA 


CGCACCTAGT 


TCAGAATTAT 


CAAAACCAAT 


AATCCGAATG 


TCTTTCCCCA 


720. 


TTTGGTAATT 


TGTTTCTGCA 


ACATATTTAT 


AGACACCAAT 


CGCCATTTCA 


TCATTAAAAG 


780 


CAAAAACATC 


GACTGGTTCT 


GTCTGTGGTT 


GACTCAAAAT 


TTTTTTAGCA 


GCCGCATAGC 


840 


CAGAAGGTTC 


GGTAAAATCG 


CCTTGGATAA 


TTTCATAAGG 


TATCCCAAAA 


CGAGTTAATT 


900 








TATCATAACC 


TTTTTCTGGT 


CCAGAAAGCA 


960 


GAAGAACTTT 


TTTTGAACCC 


ACATTGACAA 


ATTGCTCAAT 


TGCCTGCGTG 


GCACCGCCAc 


1020 


GGTTATCCAA 


GAGAACTTGC 


CGAATATTCC 


TATGCTCAGT 


GGTTCGATCA 


AGGACAACGA 


1080 


TCGAATGGCC 


TCGTTCAGCA 


AACTTTTCAA 


TTTCTTTTGT 


TGGAAATGTC 


CAATCTAAAA 


1140 


TAATTGCCCC 


ATCCACCATT 


TTTTCAGGAA 


TGAAAAGATG 


TGACTTTTTA 


CCGCTGCAGA 


1200 


CAATCATCTC 


ATAATCAAAC 


AGTGCTAAGC 


CTTTCTTAAT 


TCCCTCCAAC 


AATTCACCAT 


1260 


AAAAACTACC 


GCCATAATCA 


GCCAAATAGA 


CACCAATAAT 


ATTGGTTTGA 


CGACGTTTTA 


1320 


ATGTGCGAGC 


GGCCATGTTA 


GGAACATAGT 


TTAGCTCTTC 


AGCAATTGCT 


TGGATGCGCG 


1380 


TTCGTGTTTC 


TTCAGTTACC 


TTTGAACTAC 


CATTCAATGC 


m ft ft ft ft ft /"•/*■' 

G T AAG AAAC ks 




1440 


ATACGCCTGC 


TTTTTTTGCA 


ATATCTTTAA 


TTCCTaCCAT 


GGACTACTCT 


CCTTTCTAAT 


1500 


GGTAAATTTC 


AATAACACTT 


GCTTCATTTA 


ATAATGAAAG 


CACCTCGTCT 


TTTTGAACTT 


1560 


CCATTCCCCC 


TTCTCGATAA 


GGGTTTTCTT 


CAAACTTAAA 


CGGAACCTCT 


TGACCATCGA 


1620 


TAACTATTTT 


CGCTGACTGA 


TTTGGATAAA 


ATTTATATGT 


GACAGGTTTA 


TCAAACAATT 


1680 


GGTAAGTTAG 


TGTTAAACCA 


GACAATTTTT 


CTGGTAACAT 


GGGATCAAAA 


, ACTACACTTT 


1740 


GGGCCTTTTG 


CCGAATCCCT 


AAAACAGCTG 


TAATTAATTG 


ATTTAAATAA 


, ATTCCCGGCC 


1800 


CGCTTGAATA 


, AATTCGCCAA 


CCACCTTTTA 


, CCCCAATACT 


1 TCCATCTTTT 


1 AGTTTACCAA 


1860 
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AATTTTCTTG 


TGATTCATAC 


CTTGTTTTAA 


AATCGCCATC 


GGAACTACTA , 


AAATAGGCAT 


1920 


TGGCTTGACG 


TAACTCTGCA 


TTGTGAACCA 


CTTCTTTTAA 


TTGAACTGGA 


TTGATTTvaCA 


1980 


TTAATGCTTG 


CCAGGTTTTG 


TCTCGCTGAC 


CAATTTTCGC 


CATCGCTTCT 


GTATAGCGAA 


2040 


TGTGCGCATG 


AACATACTGT 


AAGCCAATTT 


CGCGACCAAA 


ATTAGCCGCC 


TGTTCTGCTC 


2100 


GTTTAAAATT 


AGTGCTCACA 


CCGCCTCGAT 


AGTTCGCTGG 


TTTGTTCATT 


AAGCGCACGC 


2160 


CAT C AG G AAA 


CGTTAATTCT 


TTTTCAATAA 


TTTCTAAATG 


ATGACTTACT 


TGTTCCGGTG 


2220 


TTAATAATTC 


AGCTAACaTA 


CTACGAGTCA 


TTGGCAATAA 


TCGATATTGA 


ATCCCTGTTT 


2280 


TTTGGTCATT 


TGGGTGAATC 


ATTTGTTCAA 


AAGTTTCTGG 


CGTTTCCATA 


TAAATAAACC 


2340 


CTGGAATCGT 


TTCATCTGCC 


AACATATATT 


TTTGAAAATC 


GTTCTTTATT 


CCTGCGACTA 


2400 


AAATTGCTAA 


GTGCTGACTA 


TAAGTAGCAT 


CGATTTCCGT 


AATTAAAGCC 


GCAAATTTTT 


2460 


TCAACACTTG 


ATACGTTAAG 


GCAACGGTCC 


AACTACTAGC 


CATTTGCTCT 


TTAAGACGAT 


2520 


TATCATAAGG 


TTGTAACGTG 


TCATCCCAAT 


CGCCGTCGCC 


ATAACAAGAT 


AGAAAAGTTT 


2580 


CTGGTAAAAA 


ATGTTGTTCA 


ATATAATTTA 


TTTCTTTTTT 


CAAATGTTCA 


AATAAAGATG 


2640 


CTGTTTCTTT 


TGTTTTGTGA 


TTATCTTTGC 


GACTTGTATA 


AGGGATTTCC 


GTAGCTAAGA 


2700 


TTGAATAATC 


TGCGGTTTGT 


TCCAAATAAT 


CAGTCACTAC 


TTTCAATGGC 


CAGACGATAA 


2760 


TATCGCCATG 


GCTTTCACTG 


GCTTTTTGTT 


CTTCGTATCG 


GTCAAACATA 


AACCATTGCG 


2820 


GCCAGTTGCC 


ATCGTCCTCA 


AATTGATTTT 


CAAATAGATG 


CTGAATAATG 


GAGGCCACGA 


2880 


TCTTAGGCTG 


TTGCGTCGCA 


AAGAAAAATT 


CTGTCGGTCC 


TTGCGACACA 


TCTCGTGTGC 


2940 


CCCAAGCTGC 


TCCACCATAC 


TGTTCTAAGC 


CATGTGGCGA 


CAAATAATGA 


ACGAGCATAT 


3000 


TATGAGTATA 


CCAACGACTT 


AATTGATTCA 


TTTGCTGCAC 


ACTTGAGTCT 


GTGTGTGTTA 


3060 


ACTCAAAATG 


ATGTAACAAG 


CCATTAATGA 


ATGCCAAATA 


TTGACTTTCT 


GCTTTTTCAA 


3120 


AATCTAATGG 


TGTATTGACT 


AATGATTGGT 


CCTCTATACT 


GCCCGTAATA 


AGTAGTGAAA 


3180 


ATTTTTGGGT 


CTTTTCGATT 


GCCAAGACTA 


ATAATTCTGC 


TTCAGCTCCA 


CCAAATAATG 


3240 


TGCTATCGGT 


TACTTGAAAT 


GATTTATCAA 


CTGTCAATGC 


ATAGGTTAGT 


TCTGGGTAAT 


3300 


AATTTTCTGT 


GGCAGAATGA 


TTTCCCGTCA 


CTCGAAGCGT 


TTTGCCAGTT 


TGTTCTAATT 


3360 


GATAGGTTGG 


TACTTCTTCC 


GGTCCTAATA 


ACAAATGGTT 


AGAAAGAGCC 


CACATATATG 


3420 


CATCCTTTTG 


ACTGGTCACT 


TCCAATTGAA 


TCACTCTTGA 


ATCAGGCGAA 


GTGAAGGTGC 


3480 




T H T TV TP C T T T 


nnPAAPTTAT 

OUvfuiv L, 1 / » i- 


AATACCACGT 


AGCAGAGTTC 


AACCCCATTT 


3540 


CAAAAGCAGA 


AGGCATTGTT 


AGTAAGCGCC 


ATAAGCCATT 


CTCTAAAATG 


TAGATTCGTT 


3600 


GACCAGAGCG 


TTTCATAACG 


TTTAAAGCGT 


TTCGGCTATT 


ACTCATTAAT 


TTGTTCATGG 


3660 


TCGTATTTCC 


TAAAACTACT 


TGTGAATTGA 


AAATGCCATA 


CATATACACT 


GTCGTACTAA 


3720 


GCAAAGGTTC 


GTCAACCGTT 


AAGCCTTTAC 


CACTTAATAA 


. AATGTGGCCA 


. TGCGCTCTTT 


3780 


CCATGGCAAT 


TTCTTTTTCT 


TGAAGAACAA 


, CATGATGGTA 


A TTTTCCGTG 


AAAAAGGAAT 


3840 
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ACGTCTGACC 
CTTCTGTTAA 
TTCCTTCTAA 
AAAAAGGTTC 
CCTTACTTTC 
AT TG AT AG AC 
TAAATGAACG 
TCCCTTGAAT 
CAAGCTCTTC 
TTGACTGAAC 
AGCCAGTTCC 
CAACTTGGTA 
AAAAATCACT 
TTATCCGCAA 
TCATCATGCC 
TGTCTGCTTG 
CGTGACTGTT 
TGCATCACTG 
AAACTGAACT 
TTTAATTGGG 
TCCAGCGATT 
AAATGTTAAT 
TCCAAAGGGA 
TGGTGCTGAC 
AACCGTTTCT 
TTCTGCTAGC 
AACATCTATC 
ATTTTTCTCA 
GCGACTGGCA 
TGCTTGACTA 
AAAGGGTTCT 
AATGGCTTCT 
CCCTGGGCCG 



TGCTACTTTT TCAATTTGTG TCTGAAGGGG ATAAAGTGCG GCCAATTCTT 3900 

TGGATTTCCT GTGATAGGTT CTCCTAAAAG TTTCGGAGGA TAGTCCTGTG 3960 

TGAATCGAAT GTCACAGTTT CATAAATAGC TTGTAAGGCT TCTTTCGATA 4 020 

AGCTAATGCT TGCGGATGTG TTTCTGTTAC CGCACCATAA AAAACAAATT 4 080 

ATTCGTTAAT TGGACGGCTT CCGTTTGTAA AGCGGTGTAA GCAAATTCAT 4140 

TTCATTTGCT AAATATCTTT TCCCCAATGC TTCTGGTGTG TTTATTTGTT 4 200 

TCCGAAAAAT TGATAGCCGT CCGTTGAGTA GCCTACTAAT TTTTGAAAAC 4260 

AAGGGCTGGA AACTTCTCAT TTTGCGGTTG ATTTTGTCTT GATAAAACAA 4 320 

GCCCTCTTGC CAAATGTGAT GATCAAGATA CTGCGACACA TACGCTTCAT 4 380 

GGCTCCTTTT -GTCGCTAAGC CAATATCTTG ACCATAAACA ATATCTACTA 4 4 40 

AGTGACATTA ACCCGCCAGA AC CAT TG ATT GCTTGGCCCC AAACGAAAGT 4 500 

ACGAACCGCT TCAAATTCAC CCCGCCATGT CAATTGATTA TCTGAAAGAT 4 560 

TTGACTGTTG GAACCAATCA TTGGTCGATA ATTAATTGTG TCCCCTGTAA 4 620 

ATAGAGTTGG TTCATACTGC CGTCTAAAGG ATTACCTTCT AATTGATTAA 4 680 

ACAGCTATTC AATTCTTTTA AATCACCTGT CTCTAGAAAT GTAGCAACTG 474 0 

CTTAAGCTTC TTTATCTGCA TAACTTATCC TTCCCATCTT ATTCATTTAA '4800 

GCGGTTTGTA CATCGCGACT ATTTCCACCA AT CAT T AC G A TAAATTCACC 48 60 

GTTACCTGAT TGTCTGAATG AACATAGCGC ACCATATCCT CTGAAAGAAG 4 920 

TCTTTTTCTT CCTTTGGTTC CAGCCAAACT TTTTGGAAAT CTTTTAACTC 4 980 

CGAACCACTT GTCCCACCAA ATCACGTACA TAAAATTGCA CGGTTTCCTC 504 0 

GGACTGTTAT TTCTTACTAA AACAGAGACC GTGACTGGTT GTTCTTTGGT 5100 

GCACTTAGTT GAATTGGTTG ATAAGAGAAA TGACTATAAC TTAAGCCAAA 5160 

TATTTAGCGT AATTAGAGAC ATCTAAATAT TTAGAAACAT ATTTTTCTTC 5220 

TCATAAGGAC GCCCAGTATT GTCGACATTG TAAAAAATTG GCACTTGACC 5280 

GGAAACGACA TACTTAAACG ACCACTTGGA TTGTACTGTC CCCACAAAAT 534 0 

GCATTGCCCC CTTCTGTTCC AGGTTGCCAA GCTTGCACAA TTCCTTTTGC 54 00 

CCTTGCAAAT CTAAGGGCCG GCCACTGTAG ACCGTGACAA CAACATTGGG 54 60 

ATAATTTGTG CAACTACCGC TAATTGTTTT TCGGGTAGAC GAATATCACT 5520 
GCTTCCCCAC TCATCCAATC TGTTTCGCCT AGTGCCACTA CTACTTTATC 5580 
GCTAACGTTA AAGCCTCTTC AATGGCCGCC GCGGTCGGTT CAAAGTAATC 564 0 

TGGGCAACTA GAAGATTTGT GGTTAATTTT TGGGCCCCAG CCACGAGGGA 5700 
TCTTTCTTTC CTTGCCATGA CCAAGCCCCT AAAATATCTT GTGATTGGGC 57 60 

ACAAGCGCGA TTTTTTCTTG CGGTGTAAAA GGCAAAATAT TTTCATtTTT 5820 
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TAAAAGAACC 


ATTGATTTTT 


GAGCAATCTT 


TTTAGCTATT 


TCTCGATGTT 


CTTTGCTTAA 


5880 


AATCACTTCT 


GCTTCCGCTT 


GTTCATCCGC 


ACCACGATAG 


GGATTTTCAA 


ATAAGCCTAG 


5940 


CTCATTTTTT 


AGTTTTAAAA 


TTCGCATAAC 


TGCTTCATCA 


ATAAGTGTTT 


CAGCAATCCG 


6000 


GCCTTCTTCA 


AGTAATTCTT 


TTAAATAATC 


TGGATAACAC 


GTGGTCATCA 


TTTCAATATC 


6060 


TACACCAGCT 


TTGATAGCCA 


ATTCTGCCGC 


TTGTTTTTCA 


TCTTTGGCAA 


CGCCATGAGG 


6120 


AATTAATTCT 


TTAATCGCTC 


CCCAGTCTGA 


AATCACAACC 


CCTTCAAATC 


CAAATTCATT 


6180 


TPGTAATACG 


TCTCTAAACA 


GCCATTGGTT 


GCCTGTAGCA 


GGAATCCCAT 


CCaCTGTATT 


6240 


GAAGGACGTC 


ATGACAAGTT 


TTGCACCCGC 


ATCTAATCCA 


GCTTTATAGC 


CAGGTAAATA 


6300 


PATTTCTCTT 


AGCTGCCGTT 


CAGACATATC 


TACCGTATTA 


TAATCCCGAC 


CAGCTAAGGC 


6360 


APCACCATAT 


GCCGCAAAGT 


GCTTCACGCA 


CGCGGCTACT 


CGCCAATTGT 


CTTCTTTTAA 


6420 


ATTTTCTPCT 


TGGTAACCTC 


TAACCATTGC 


AGCAGCTAAG 


CGGCTGTTGA 


GTAATGGATC 


6480 


TTCACCTGTT 


GACTCCATGA 


CACGTCCCCA 


ACGTGGATCG 


CGCACTAAAT 


CAACCATTGG 


654.0 


TGAAAAGGTT 


ACATGTAATC 


CTGAAACAGC 


AGCTTCTTTT 


GCTGAAATCT 


CCGCCATTTT 


6600 


TTCAACCGCT 


GCTTCATCCC 


AAGTGCTTCC 


TAAAGCCAAA 


GGAATCGGAA 


AGATGGTTCT 


6660 


GAAACCATGA 


ATAATATCCG 


CCATTAAAAG 


TGTTGGAATA 


TTTAATCGAT 


GATTTTGCAT 


6720 


ATATTCTTTT 

-T* X A* X X X X A J- 


TGAACACGTT 


TTGCTTCGTT 


TGCACCGGAA 


ACACCCAGTG 


TTGTCCCCGC 


6780 


CGTATCAATC 


GTTTCTTGAC 


TTAGTCCTAG 


CTCTGACATA 


GGGCCGGTCT 


TTTCTTCTGC 


6840 


TTTATCTGAA 

xxx i* x w x vjnr* 


TAAAACGCTG 


CTGCTAATTG 


TAATAATTGA 


TTAATTTTCT 


CATCAATTGT 


6900 


CATTTCTTCG 

XXX V X X 


ACTAATTGAA 


TCAGTTGATT 


TTCTTTCATA 


CTCTCACCTT 


TCTTCAATAC - 


6960 


ATACTTCTGC 


TTCTAAAGGA 


AAATGGTCGG 


AGGGAAAACG 


TCCATCGATA 


CTATCTGTTA 


7020 


AACTTGCGGT 


TTGCTGTACT 


TGCCAACCTT 


TTACATAAAT 


ATAATCAATT 


TCTTCTAATT 


7080 


CCGCCCAAGG 


TTTTGTGTAA 


GTAAAATTTT 


GAAATGTTCC 


TCTTGGGCCA 


TAATGAGTAG 


7140 


CTAAATTTTT 


GCTATCTTGA 


AATTTTTTTT 


GAACAAGTTG 


ATGAACCTCT 


TCCCCCGACT 


7200 


CTGCATTAAA 


ATCGCCCATT 


AATAAAGTTG 


GATACTGTGC 


AATTTTGTCA 


TGTAACTCCT 


7260 


CTAAGATAAC 


GGTCATTCCT 


GCTAAACGAG 


CATGAGCTGA 


AATATGATcC 


AAATGAACAT 


7320 


TGATGACTAA 


AAAAGGCGTA 


TTTTGAGTCG 


TTTCCTTAAA 


TAGTCCCCAC 


AAAGCAATTC 


7380 


TTGGACAGCC 


AGCTTCAGGA 


TGGATACTTG 


GCTGCTGAGG 


TGTTTCAGAT 


AACCAAAAAT 


7440 


ACCCTGTATC 


AATAGCCTGA 


t\ nn ^ *"n 
ALTl 1 X X b X X 






rrTTPTPPTT 


7500 


GACCATCCCC 


TTCTCTTTCC 


GCAGAAAGAC 


AGGTGAATGT 


TGTATACGCC 


TTTAAATCCC 


7560 


TGACTTGATT 


CGGTCGTACT 


TCTTGAATAC 


AACAAAGGCT 


CCAATCATGA 


AAATTGATTA 


7620 


ACTGACAAAC 


AGCCTCTTTA 


CGAAAAGACC 


ATTGCCAATC 


TTGGTCATAC 


TCTGTATCTA 


7680 


CCCGGACATT 


ATACGTAGCT 


ATTTTCATTT 


TTAACCTCCT 


GCAATAGTTG 


TTGAATTTGA 


7740 


CGCATCTCAC 


GACGGATCTG 


GGTTGTTTCA 


, CCGCCACCAT 


GGCCTTTTTT 


CCAATACACT 


7800 
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CTGACTGCTT 


TTCTTGCCTT 


TATCTGATGA 


TAAGAAGGGA 


AAAAATCTTT 


CATCGGACAT 


7860 


ATCGGATCGT 


GACTCCCAAC 


AGAAGCAAAG 


ACTGGATTTT 


TAATTTGCGA 


AACAAAGTGA 


7920 


CGACTATCAA 


AATAAGGAAG 


GACCGCGGCA 


ATTTTTTCAT 


GCGCTTGAGG 


ATGCTCTTGT 


7980 


AAGTACTGAT 


GAATAATTTG 


ATACGAACCC 


GTTCCTTCTG 


CTACTCTATT 


TTGTAAACAA 


8040 


GAATGACTTG 


GAACATCTGC 


AAAAGTAGCT 


AAAATCGGGA 


AGTCGACTAA 


ACTGTTCACC 


8100 


ATTAACACTA 

nil /wun^ x 


CGCCACCCCC 


TTGGGAACCA 


CCACGAAGAA 


TGATTTGGTC 


CTGATTAATT 


8160 


TCAGTAAAGG ■ 


TCCGCACCAC 


CTCTACGCTA 


CGTAAATGGT 


CGGCAAACAA 


GCGGCGTTGG 


8220 


TACCAATCTT 


CTTTGTCTAG 


AAATCCTCGT 


CCCATTGGTG 


CTTCATGCCA 


AATTGTTTGA 


8280 


TATGGCACGC 


GATCTTTGGT 


TAATCCCCCT 


TGACCTCGAT 


TATCAATGAC 


AAAACAAGCA 


8340 


CAACCAATTT 


GTGTCCAATG 


AGCAAATTGC 


CAAGGTTCTT 


G a AT AT GAT T 


CATATAGCCT 


8400 


AAACAGTCAA 


TCACTATTGG 


CTGTTTCCCC 


GCCGTTTTAG 


GaAGTAACAA 


ATAGCCAAAG 


8460 


ATGCGCTCAC 


CTAGGAAACT 


TAAAAAGTCG 


ACCTGTTCCA 


CTTGAACATT 


TTCCAATGGA 


8520 


TAGTCAATGA 


CCCGCCGATG 


GATATCTAGC 


GGTATTTTTT 


GAAGTTCTTC 


CTTGGTCTTT 


8580 


TGCCAAAACT 


CAAAATACTC 


ATTCATGTTC 


CACACCTAAC 


ACTTTCTGAA 


AAAACGCAAC 


8640 


TTGAAGAAAA 


GCCGTTGTCC 


CAAATCCTAA 


CGGAATATAT 


AAAAACAAAT 


TAAGCTTATA 


8700 


GAAAAAAACA 


AACGTTAAAA 


GAGGTAAAAT 


AAACAAAATA 


ATCGCATTTG 


CAC TAT TACT 


'8760 


TAAGACCACA 


TAAACCGTCT 


CACCTAGTGT 


CTTTTTAACG 


GACCAATTAA 


AAACGGAAAA 


8820 


GAGAAAAAGA AAACTCGTCA AAAATAATGC AACAAAACTT AATAGAAAAA TAGCTAAGAA 


8880 


ACCTAAACTA 


GTGGCTATTT 


TAAATAAACT 


TGTTACGACA 


ATACCGCCCA 


GTAAAAAAGC 


8940 


AGCGAACAAG 


ACACCTGTTA 


AAATAAAACC 


TCGCATGATA 


TTATTTTTTA 


ACGCTGTCCA 


9000 


AAACGTTCGC 


AAAACAGGAA 


TATCTTTTTC 


ACGATACAAA 


CGATAGATCG 


TTCGATACAA 


9060 


CGCAATTAAG 


CCGACACCGC 


TAAAGATGCC 


TGTTATACAG 


GTAATTAAAA 


AGAGAAAATT 


9120 


AATAATGATC 


AAGTTGGCAA 


TAAAAGAAAA 


AATCCGATAA 


TACCAACCTT 


CATATGCAAA 


9180 


TAGCTTATTC 


ATATTAAAAG 


AGAAAAGCGG 


ATTTCTCGTT 


GCTTTATCCG 


CTTTTCTCTC 


9240 


CTCCTCTCTT 


ACTTAAGTTT 


GTCTCTGTTT 


TCCGCCATTT 


TCTCAGATTT 


TATTTTTTCA 


9300 


ATTGCATCCC 


ATTTATTTGA 


TTTTAAGAAT 


GCTTTGTAGT 


CTTCTAAAGA 


TTTGTCAAAG 


9360 


GCTTTATCAT 


. CTTTCGCACG 


AATCATTGAA 


ATAACGGTTG 


TATTTAGTTT 


GGTTTCAATC 


9420 


GCTTCATTCG 


CACGAGCTTC 


CGGCGTTCCT 


GCATCTGGAT 


TAATATTTTC 


AATTACGAAA 




TGTGGGACTA 


ATTTTCCTTT 


GCCCCATTCT 


TGCATTTGCG 


TTAAAGCACT 


TTCCATTGGT 


9540 


ACTTTTAGTT 


TATTGACACG 


GTCGTTGTTA 


AAGTATAGGA 


AACGACTAAT 


GCCATATTTT 


9600 


TTGTTGTAGG 


CATCATTGTC 


TGTTTGTTCT 


AATTTTTTCA 


CTTCTGGTAA 


ATAATCAATT 


9660 


TTTCCTTGAT 


CATTGTACGC 


ATAAGTAACT 


CCTTCAACGC 


CATATTTTGT 


TAAAATTTGT 


9720 


CCCGGTTCAT 


CAATTAAATA 


TGTGAACAGT 


TGAGTGACTT 


TTGCTGGATC 


TTTCGCATCT 


9780 



WO 98/50555 PCT/US98/08985 

1177 



TTCGTAATGT 


AATTACTTAA 


CCAACCTGAA 


ATGCCGGTTT 


GATTTAATGT 


TGGTTTTCGG 


9840 


CCAGAAGTGC 


TACTTGGTCC 


ATCAATGGCT 


ATATAACGTG 


TGCCAGATTT 


TTTCATAAAT 


9900 


TCTGTGAAGT 


TCCCACCTTG 


ACCACTGGTT 


CCAGCAACGA 


GCATGGTTGC 


ATAATTTCCT 


9960 


TGTTTCACTT 


TTTCATCAAA 


CGTTGCCCCA 


TCATCTGTGA 


AGCTATCATC 


ACTAATATTG 


10020 


CCTGCTCGGT 


AAACATCATT 


AAATGTTTTT 


AACCATTCTA 


AATATTCTTT 


ATCTAAATTT 


10080 


CGATCATAGT 


ATTTACCATT 


TTTATCCTCT 


AAAGGAACAC 


CTAAGAAGTC 


TTGTAATTTG 


10140 


TCTAAAAATG 


GTCCTGCACC 


ATCGCCCACT 


GTGGTGAAGC 


CCATTGGGGT 


CATCTCAGGA 


10200 


TATTTTTCTT 


TAATCTGTTG 


CATGACTTTT 


TCAAAATTTT 


CTGGTGTTGA 


AACGTCTGGC 


10260 


TTGCCTAATG 


CATTATAGAC 


ATCTTCACGA 


ATAACAAAAT 


TATCATTTAC 


TGGGATATTT 


10320 


CCACTTTCAT 


AATCCGCTTT 


TG TAT TAG AG 


TAATTAGGGT 


AACCATATGT 


TTTTCCATCC 


10380 


TCTAAGGCAT 


AC CAT TT AAA 


CGTATCTTGG 


TTAACAACTT 


TCATTAAATA 


GGGGTCATAT 


10440 


TTTTTAGCTA 


AATCGTTTAA 


AGAATAGGCC 


CAAGAATCTG 


CTTTCAATGC 


TGCTTGTCCA 


10500 


GTTTTTTCAG 


TTAATGTCAC 


GATATCAGGC 


ATATCTCCAC 


TTGAAATCAT 


GGCATTTAAT 


10560 


TTTGAATCAT 


CGCCGGAAAT 


AAATTCAATA 


TCCACATTTA 


AGTCTTTTTT 


AATCTGCGCA 


10620 


GTGGTCACGT 


CTTTTCCAAA 


AGGCAGCGCC 


GTCCAATCTtj 


ACal TAA1 H I A 




lUDOu 


TTGGTCGGTT 


CTTTCTTCTT 


ATCTAACTTC 


CAAGCAGGCG 


TCTTTTCATC 


TAATTCATAC 


1*0740 


CGATCCTTAA 


TTGAGACATC 


TGAGCTAGCT 


TTCTTTTCCC 


CACCGCTACA 


GGCCGCTAAA 


10800 


CTAACGACCG 


CCGTTGCCAC 


CAATAAACCG 


CCTAAAACTT 


TTTTCATATT 


GTTCCCTACT 


10860 


TTCTTTATTC 


TTTTACTGCA 


CCGGCCATCA 


TACCGCCGAT 


AAAGTGTTTT 


TGTAGTAATG 


10920 


GATAAATGAT 


TAAAATTGGT 


ATCGTTGTAA 


TGACA 






10955 



(2) INFORMATION FOR SEQ ID NO: 237: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 : 

GCTAGtCAAG GATTAAATGA ACGTGCTGAG TATTTAGACA TCAAACAATT ATCGTCTATG 60 

TATTCAATGT CTGTTCCTGA AGTTGAACAA AATTTTGTAA AAGACAAACG TATGCAAATG 120 

ATCGAAAAAA GAAAGCCTGG TACTAGCAAG GGAAAAAGAT aCTGGCCtGc TAAAGAAGCt 180 

ATAAAAAtTT GtAATGACAT CmTGAATcAT TGGGATTaAA GGAAAGGrCg ATACAGAAAC 24 0 

AwTGGACAAC- TTaGTAATTA TGAAAAACCA ACAAGCAGTA ACAACTAGTT TGCAAGTCGC 300 

AGAAACGTTT AATAAAAATC ATCGTGATGT TTTAGCAGCG ATTGATGATT TAmAAGAGGG 360 

GGTTGCGGAA AATTACGCAG GGCTATTTTA CGAAGATAGC TATATTCATC CACAAAACAA 4 20 
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ACArAAATAT 


AGACAAaTCA 


TTATGAACCg 


TGACGGATTC 


mCGCTACtAG 


CaATGGGATT 


480 


CACTGGTCAA 


AAAGCTCTTA 


AGTTTAAACT 


GAAATACATT 


GAAGCTTTTA 


ACCAAATGnA 


540 


GGAACTGCTT 


AAAACTCAAT 


CAAACTTACC 


GATTAATAAC 


ACAGAATTGT 


TATTAGAAGC 


600 


TGCGTTAAAA 


CATGAACGTG 


GATTGACTCT 


TGTAAATCAA 


CGTTTAGATA 


AGCTAGAAAC 


660 


AGAAACTACA 


ATTAATAGAA 


GCCAACAACG 


AAAGATACAA 


GGGCTAGTTT 


CATCAACTGT 


720 


TATCAAAGTA 


TTAGGTGGCA 


AAAAAACATT 


GGCTTATCAG 


GATTCAAGTA 


TTAAGCAATC 


780 


AGCCTTTAGT 


AATTGTTATA 


AACAATTGAA 


AGCATTATTC 


GATGTAGCAT 


CTTATGTAGA 


840 


CATTCCAAAA 


GTTCGATATG 


AAGAGGCTGT 


AGCTCTAATT 


CCTAGATGGA 


AGCCTAACTT 


900 


AGAATTACAA 


GCAAGAATTG 


ATATGGcTAA 


TGaTAATGGa 


GATATGnTTT 


AAAGAAATAG 


960 


GATAGTAATG 


ATAAArGGAT 


AAGCCTTAmC 


TTATCCCsGG 


TTwAAAGTAG 


TGtCAmCAAT 


1020 


AATAGCtTTT 


TGATCCATAT 


CACTTTTGAT 


GGATTCAATA 


ATTTCTAAAA 


CAGAAGATTT 


1080 


ATAGTAATAA 


GTCTTGCTTG 


ATGCGATTAC 


TTCATTTTCT 


TCGGTTTTAA 


TTACAAAATA 


1140 


GTATTTTTTA 


TTAGAAGCTT 


TCTTTATTAC 


AAAAAACATT 


TTGACACCCC 


GCTTTCAACT 


1200 


GAATTATATC 


AAAAAAATTA 


TaGGAGGGTA 


AATATGAnAG 


CAaTwCGTGA 


aGCaCGATTG 


1260 


ATAGcGCtTA 


TTATTGATGA 


TTGCGCTAGG 


TGTGTTATTG 


AAAAGCCACT 


TTTCAGTTCC 


1320 


AGTGTTGGCA 


ACAATAAGTG 


TACCTCTTTT 


TATCCGTTGG 


TTTTTCAACT 


GGGATGAAGC 


1380 


GGAGTATCAG 


CATTTTCAAA 


AAAGACAAAA 


TAAAAAGCCC 


CAATCGTCTG 


CCAACGATTA 


1440 


GGACACATAC 


AAAATGAACT 


AAGGAGAGTA 


TACCAAAATG 


AACGATAAAA 


TTCAAAAATT 


1500 


GATTAAAAAA 


CTAGCAAAAG 


AATGCCAGAA 


AGAAGATGTA 


GCTTTATCTT 


TGGCAGCTAT 


1560 


CGATTTAGAA 


GG AG AG AT GG 


CAATATCTCA 


AGTTGGAAAA 


GGCACGATAG 


TAGCCATTGC 


1620 


TGCACATAGC 


CAATATACAC 


TAACAAAAGA 


AAAACTGGAA 


CAATCAGATT 


GTGATTGTCC 


1680 


AAAACATCGT 


TTATTAAAAG 


AGATGTACGG 


TACTGAAACA 


GAAACTACGA 


CTAAAAATAC 


1740 


ACATACTTTT 


GTCACAGATG 


ATCCAAACGA 


TTTGATGGAT 


ATATTATCGA 


AGATTTTTCG 


1800 


AGGTGAGTTT 


AAATGACTAG 


AAAAGAAAAG 


TTAAACCAAG 


CAAAAAGATT 


AGCTGATTTA 


1860 


TGGTACAAAC 


AACAAAAAAG 


TCAACTATAC 


ATTGCGCAAC 


AAAAAGAGCA 


CAGAGGGATT 


192a 


GCATGATGAA 


AAAAGCGACT 


ACGCCGCCAA 


GCAAATAGTC 


GCATACAAAA 


TTATACTAGA 


1980 


AAAATTGTAA 


CACAGAAAAG 


AGGTTTTGTG 


AATGAATCGT 


AGAGAAGCAA 


ATGCACTAGA 


2040 


CAGATATTTA 


ACAGAGCCGA 


CTGAAAAACC 


ACATAAGGAA 


ACCTATGAAG 


ATGATCCaGT 


2100 


GG AT AC TACT 


GATTATTTTG 


GAAATGaAAT 


TGCTGATGAA 


GACGGTGTGT 


TTGAGATAAC 


2160 


TTTTGCAATG 


AAATGTCTTT 


ATACAGGACA 


ACCAGTACTC 


ACTTGTAAAA 


AAATTGCTAC 


2220 


ACAAGATACA 


■ ATCGTTGATT 


TGATAGAAGA 


ATTAGGCGAA 


GAAAACGTGT 


ATTTAATTGA 


2280 


ATATGTGAGT 


TCAGGAAAAA 


GATATAAGGA 


GGGCTTATTG 


AATGACTGAA 


GCAACCAAAA 


2340 


CAGATTTTTC 


TAAGTTGAAT 


GTTTATCAAA 


AATTAGCGTA 


TGTGAGACAA 


AAAGCACCAT 


2400 



WO 98/50555 



PCT/US98/08985 



ATA7CCAAAA 
CCG C ATT AAA 
ATC.-A3TTCG 
TAGCCAAAAA 
TTCATAATCC 
GAGAA.-JVGGG 
TTAA7ATTGC 
CTACAATTAC 
TGGCASGGCA 
CTAAAAAAGC 
CGAAGTGGAT 
ATGATTGGAA 
ATCAACTCGA 
CTA7TAGATA 
GATATAGCTC 
ACGAA3CTAA 
AATGGATTAG 
ACCAA7ACTT 
GTATTTGTGG 
GTCAAAAGAT 
AAGAACATCA 
TAAACGTTGA 
CTTGAAAAAT 
GTCTATATCG 
GTCGTTAACG 
AACTAGTGAA 
ACCACTTAAA 
AAGGCTATTG 
CAATTATTTC 
ATAAGCCAAA 
ATACAGAATT 
AGGGGGATGA 
TAATTAATAA 



AAGTAAACGT GGCCAGCAGT 
TACAGTCATA AATCAAGTTG 
GGAATCACAA GATGAGGTAT 
ACGTACAACG TATTTTACAG 
TTCTGAGATT GTTGCTTGTT 
GGTAGGAAAA GCACTAACGT 
AACAGATGAT GACGATCCTG 
TGAACGGCAA ATTGATATGT 
AGAATTTGAA GCTGTGAAAT 
ATTTGAAGAA TATAGTGCTT 
AAATTTTTAT GAATCTAGGC 
AAATCATAAA ACACAAAGGA 
ATTTCCTCAA ACTCCTAGCT 
ATCGACAAAT GTCTCAAAAA 
GTTGGAGTTA TGACGAACCA 
aAGTGGCGTG TATTTTGAAC 
TTTTTTAATC GAATTCATTT 
GCTAGAAAAT AACAAATGGT 
GAAACATGCC GATGTTTGTC 
TAATCATGAA ACATTTACTT 
GATAGGTACT AAGAACTTTT 
AGAACGTAAG AAGTTGAATA 
ACAGGCAATT AACATCGAGT 
GTTCAAGAAA TGGTAAAGGA 
GAGAAATTAA GCATAAATAT 
TGAACATAGA GGATATTACG 
TGGGAATGCA AAATTATTGT 
TTGGGCAACG AATCAATATT 
ATGGCTGAAA CAGTTAGAAG 
CAGCAAAATG GTAGATCGGA 
•TTACACCCCT AGTGAAGAAA 
AGAAAATTTC ACTACCCCTA 
TACmAAGAaT AATACAAAGA 



1179 

ACAGCTATGT GGGGTCAAGC GATGTACTAT 
GATTAATTTT GAAGCCAGAA ATTGTTGCTC 
GGAAAGCGGA TAAAGTAAAG AAAGAGCCTG 
AGCTAGAGTT AATGATGACA TGGATTAATA 
CATGGTATAG CCAAGGAGTA GATATTGAAG 
ATGCGGAAAA ATATTTTTTA TTGAAATTTT 
ATAAATACCA AAAAGAACAG CTAAAAAATA 
TGAACGCATC AATTAGTAGA GTAGCCGAGC 
CGTTAGCTAT AAATGATTCT GATTTGAATC 
ATGATTATGG TGTTATTTCT AAATTGCTTG 
AGAAGGTTCA AGAAGATAAG AAGTGATTGA 
AATAAATTGG CGATTGAGTT T GAG GAT GAG 
AACAACGATG ACAATTTAGT AAAAGTTGAA 
CmGaATGCaC TTTTyCACGT TTTAATaGCk 
AATGGATAGA GGAAGTTTTA AAGTATTAcT 
ATAGTAAGGC TACACGACAT GAAGCAACAG 
TGAAAAATGA TGTACCACTA GAAAAGAGAT 
TT TAT TACT G CCTTAAATAC CGTAAGTGTT 
ATATCGAGGT AGTCGGTATG GGGCGAAATC 
TTTACGCTGG TTGTCGTCAA CATCATCAAG 
TAAACAAGTA TCAAATTAAA CCAGTAAAAC 
TCGGAGGTTA ACAGTTTGGT AGAGGAATTA 
CAAAAACTAT TTTTTGAACT ATTAGCATTT 
ATAGCTATTG AAGCACAAAC AATAAAAAAA 
GTTTATACGG TCGTTGTTGA TGAGGAGGAT 
CCATTATCCC AGCGATTGTT CGCTATGATA 
ATGGAGAGCT AACGGCATTA GCAAATGAAA 
TTGCAAATCT GTATAACGTT AGCAAGCGAA 
AAAGAAACTA TATAAAAATA CAAATTTTTT 
GACATATCTA TATCTTACCG TATCCAACTG 
ATTTCATCAC CTATGGAAAA AATCATCAGG 
GTGAAGAAAA CTTCACAGAG AATAATACAT 
ATATATATAG TGTTGAACAA AGTTCAACCA 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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TGTCTGAATT 


ATTTGAAAaG 


GTTtGGAAAa 


CATaTCCGAr GAAAaCCAwT AAGAAAAAaG 


4440 


CCAAaGaGCa 


ATTCTTAAAG 


AAAATTAAGT 


CAGATGAAGA CTTCGAACGG TTTAAAACAG 


4500 


GATACAAAGC 


TTATCTTAAG 


TATATCAAAT 


TAAACGACTG GTATCATCCA CAAGAATTGT 


4560 


TCCGCTGGAT 


ACGTGATGAA 


CGTTTTAACG 


ATGAATACGA CTTGTCTGAA ACGGCAACGC 


4620 


AAGTAAGATA 


TTCAAATAAT 


CCAGTTAGAC 


AAGAGAAGTT GCCAGATTGG GTAAATGAAC 


4680 


CAAAAAAAGA 


AGAGGAGAAA 


CTATCACCAG 


AAAAGCAAGC TGAACTTGAT AGGCAAATAA 


4740 


AAGAATACTT 


GGAGGGGAAA 


TGATGCGAAT 


TATCCTGCCA ATTGAACGAA AACTGCAAAG 


4800 


TCGCCCGAGG 


TTTGCAAGAC 


GTGGGAATTA 


TGTCCAAACC TATGAAGATA GCGCTATGGG 


4860 


TGCCTATAAA 


CAAAAGGCAA 


AACCAGAATT 


GATTGAAAAA GGGGCTATTT TTGCGCATAT 


4920 


TACGTTTTAC 


ATCGCTACCC 


CTAAAATATC 


TATTAAGTTC TAAAAAGAAA CGCTTAGAAG 


4 980 


TGAAATTAGA 


GCGGAAATAT 


TGCGATAAGA 


AACCTGACTT GGATAATTAT TTCAAAGCAG 


5040 


TCACTGATGC 


TGCCGAAGtA TTTTATATAA AACcGATGgT CAAaTtGCTG TAaTGGTTtG 


5100 


yCAAAAGTTG 


TACAGTATGC 


GACCACGAAC 


AGAACTAGAA ATTACAAGTT TGGAGGAACA 


5160 


AGTGTAGTGG 


GTAAAACGGG 


TAAACAGATC 


AAAGAGAATT ATTCGGTTAG AAAGCGAGTA 


5220 


AAGAAGATGC 


TTAAATTTAA 


aGaATTTAAT 


ATTCAATTAT yCGACGTTCA CATTTATATA 


5280 


AAAmGGCTAA 


GTAGTTTTAA 


AACCAACAAA 


AAATAGTATA AAAAAACTAA CTAGTTTCCG 


5340 


CTAGCTAGTT 


ATAGATACAT 


TTTCATCTCA 


GATAGAGACA ATTAATAATA C ATT AC TAT G 


5400 


AAATTTAAGG 


CAACAAAAAA 


AGCCAGCTGA 


CCACTAGCTG ACTTATGTGG TAGAAACAAC 


5460 


TGTTTTCCGC 


CAGTTGTTCT 


TTAGGTGTGA 


GTTAGCACTT TCCCCAAATA AAGTGCTATT 


5520 


AAAAAGCCGA 


TTAAAAATCG 


ACAATAGATC 


TCTTGCAGAG ACGTGTACTA CTATTAACTG 


5580 


TTTACCAGAA 


GTTAATGTAG 


AAAAGGAGGA 


AATTTATTTC ATAAATAAAA TCCCCAAGAA 


5640 


AGTTAAAGTG 


ATTATACCAT 


TGAAAAATAG 


ATTTGAAAAT ACTTTCTATG CCAAAAATAT 


5700 


TTATTAGATG 


GAAAATATCA 


AATGAATAAA 


GATGATGTGA AAGATGACGA ATGGAAGATT 


5760 


GTAAAATATT 


GTAGAATTAT 


TATTGTTTTC 


TTTATAGTCA ACCAATAATT TGTTGACTAC 


5820 


AAACATGTAT 


GAGTGATGAA 


TATTTTGATA 


AATAAATAGC CAGTCGGTTT CCGCCGACTG 


5880 


GCTGAGAAGT 


GAaTAGCTAT TGGAATAGTA TyCTaGTATA ATTTATaTCa TATGGAGTCG 


5940 


CTGATGAGCA 


AAGAATAAGC 


ATTTCTTATA 


TATGTTTGGA TAAATaAAAA AAGCTACTTA 


6000 


GTTTCCGCTA 


AGCAACTCTT 


AAATGATGAT 


ATGTTTATTA TAnACTATTA TACCATAAAA 


6060 


GGAGCGATTT 


CACTTGATTC 


TATTGTTAAA 


AGAAGTAGAT TTTCGACAAA CAAAAGCGAA 


6120 


TGCTAGAAAT 


GTGTTGAAGA 


GTTTTAGACG 


TTTAGAGCGA ATAGCTGGTC GCTCTTTGAT 


6180 


AGATTTAAAA 


• TCACCAATTA 


TTACAGATAT 


GCCTAAAAGC CAAAGTCACG GGAATAAAGC 


6240 


AGAAGATGCG 


CTAGTACAAT 


TAGTAGATGC 


AGAAGCAGAA AGAGACGCAA TTGTATCTGC 


6300 


GCTTATGGCA 


TTGAGCCTAA 


CTAGCAGACA AATTTTGCAT TATAGTTTCT GTGTGCAGGA 


6360 



I 

I 

I 
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TCATTACTCT AATaCAAGtA GCTAGGGAAG TTGGATATTC CGAAAGAAGT ATTCAACGAA 6420 
TGAAATCAGA .GGCJTTAATT GAATTTGCCG AAGCGTATCG AAATGGAAAA ATAATTGCAT 64 80 

ATAAATAAAA TTTTTGGCGG TTTTTGGCGG GAAGTTGGCG GTTTTAACCA TAT TAG GAT G 6540 
TTAATATGGn AGTGGCGAAG TAAGGAGACG AGGTAAGCCT GCATTACCTA CCTAGCTCCG 6600 

TTCC 6604 
(2) INFORMATION FOR SEQ ID NO: 238 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 



AGTAACTTGT 


GACAAAGTCG 


TGCATGGATG 


CACTAAAAAG 


ACACCCTATT 


GTGGTAGGGT 


60 


GTCTTTTTTT 


GCTGTTTAGA 


CAGCAGCGGT 


TGGAAGCAAC 


TTGTTTATTC 


CTCGTTCTGT 


120 


TTTTCCAACC 


AAATGGTTAC 


AAGTTTCAGG 


ACAATCCCGA 


CAAAAATCGG 


CACAAGAATT 


180 


TTTGTGACAA 


TCTCGTACAT 


GTGATGCACC 


TCCTTTCGAG 


GCAAGTCGCC 


AATTTGAGTA 


240 


TAGCATAAAA 


AGTTGTTCAT 


TTATTATTGA 


AAT TAT TATA 


AATGGAATAA 


GTACGTGTTT 


300 


TCTGGTTTTC 


GTAACTAGAG 


AACACGTGCT 


TTTTTCTTTC 


AACTTTTGGC 


ACAATGGTTT 


360 


GTGCCAAAGT 


ACGACTTTTT 


CCAGATTTGG 


AAAGCGGATA 


CAATTTAGTC 


AACAAGAAAA 


420 


C T T H C T T (2 h CZ 


nunnuuuuftn 


TAAAAATGGA 


ACAAGTACCA 


GAAAAAAATA 


AAGTATCAAT 


480 


TAAAGCAAAA 


GTCCAA/iGAA 


TTGGCAGTAC 


CTTATCTAGC 


ATGGTGATGC 


CCAATATCGG 


540 


GGCGTTGATT 


GCGTGGGGCG 


TTTTGACAGC 


GCTATTTATC 


CCAGATGGAT 


ATCTGCCAAA 


600 


TAAAGCCTTT 


GCCACGATGG 


TTGGCCCAAT 


GTTGACCTAT 


TTAATTCCAT 


TATTAATTGG 


660 


CTACACTGGT 


GGGAAAGTAA 


TAGCAGGTGA 


TCGCGGTGCC 


GTAGTTGGCG 


CAATTGCGAC 


720 


AATGGGCGTG 


ATTGTGGGAA 


CCGATATTCC 


GATGATGCTA 


GGTGCCATGA 


TTATGGGTCC 


780 


ACTTGGTGGC 


TTTACAATTA 


AGAAATTTGA 


TCAGTATTTC 


CAACATCGGA 


TTAAAGCTGG 


840 


TTTTGAAATG 


TTAGTAAACA 


ATTTTTCAGC 


TGGATTGATT 


GGTTTTGCGT 


TACTTTACTT 


900 


GGTTTTTCAG 


CGATTGGCCC 


AATTGTCGAT 


GCGCTGACTG 


CTGCAATGGC 


ACGTGGTGTG 


960 


GAAATCATTT 


TAAATGCTCA 


TTTGATTCCA 


TTAACGAGTA 


TTTTTATTGA 


ACCGGCAAAA 


1020 


GTGCTCTTTT 


TAAACAATGC 


GATTAATCAT 


GGCATTTTAA 


CACCCTTAGG 


GACAGAACAA 


1080 


GTCTTATCGG 


CTGGTAAATC 


AATTCTTTTC 


CTATTAGAAG 


CAAACCCTGG 


TCCTGGGTTA 


1140 


GGTGTCTTAT 


TAGCATTTAT 


GTTTTTTGGA 


AAAGGTGCGG 


CAAAATCTTC 


TGCGCCGGGA 


1200 


GCAATAATTA 


TTCATTTCCT 


TGGTGGTATC 


CACGAAATTT 


ACTTCCCATA 


TGTAATGATG 


1260 


AAACCGTTAT 


TATTCTTATC 


GGTTATTGCA 


GGTGGTGCGA 


CAGGTAGTTT 


GGTTTTCCAA 


1320 


ACGTTAGATG 


CTGGTTTACG 


TGCGCCCGCT 


TCACCAGGCT 


CCaTTATAGC 


TATTTTAGCG 


1380 
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ATGACACCAA 


TGGGTtCTTA 


CTTACCAGTT 


AwTTTArGtG 


TTTTGGCAGC 


AACTGCTGTT 


1440 


TCCTTTGCAG 


TATCCGCAGT 


CATTTTAAAA 


GCAGATGCGA 


AAGAAGAAAG 


TGATGAATTT 


1500 


GCCCAAAAAG 


TGAAAGAAAC 


CCAArCTGCT 


AAAGCAGCAT 


CAAAAGGATT 


AACTGTCACA 


1560 


AATGGTAGTG 


CTTCATTGTC 


AGGAATTCAA 


CAAATTATTT 


TTGCTTGTGA 


TGGTGGGATG 


1620 


GGCTCTAGTG 


CAATGGGTGC 


GTCTATCTTA 


CGGAAAAAGG 


TCCAAGAGGC 


TGGTTTAGTA 


1680 


CAAACTGTGA 


CAAATCGGGC 


CATTAATAAT 


TTGACCGATG 


AGGGAAATAC 


ATTGATTGTC 


1740 


ACACAAGCGG 


AATTGCAAGA 


ACGGGCCAAA 


CAGAAAGCAC 


CTAATGCAAC 


CTTTGTAGCA 


1800 


GTTGAAAATT 


TCTTGAACTC 


ACCACGTTAT 


GATGAAATTG 


TGGGCTGAAC 


TATCAGGTAC 


1860 


CAAAGAAAAA 


AACGACGCAC 


AACCAACAGT 


CTCCAAAGCA 


ACACCTGCTT' 


CTGAAATTGA 


1920 


GGGAGATTTA 


GCAAATGTCA 


ATGAGATTCT 


TTTGGTTCAC 


GATGATCGTG 


TCGGGTCAGC 


1980 


AACGATGGGA 


ATGAAAGTCT 


TAGAAGAAAT 


TTTAGATAAA 


GAGAAAATTT 


CAATGCCGAT 


2040 


TCGAAAAATT 


AATATTAATG 


AATTAACTCA 


ACAAACACAG 


GCTTTAATTG 


TCACAAAAGC 


2100 


TGaACTAACG 


GAACAAGCAC 


GTAAAAAAGC 


ACCGAAAGCG 


ACACACTTAT 


CAGTAAAAAG 


2160 


TATGGTTAAT 


CCCCAAAAAT 


ATGAAACAGT 


GGTTTCGCTC 


TTAAAAGAAA 


GTGCCTAGAG 


2220 


AGGAAGAAAA 


CAATGGAAAA 


TCTTACGAAT 


ATTTCAATTG 


AATTAAATCA 


ACAGTTTAAT 


2280 


ACAAAAGAAG 


AAGCTATTCG 


CTTTTGCGGC 


GAAAAACTAG 


TCGAGGCAGG 


CTGTGTTGAA 


£340 


CCCGCTTATA 


TCGAAGCAAT 


GATTGAAAGA 


GACCAATTGC 


TATCTGTTTA 


TATGGGGAAT 


2400 


TTTATTGCCA 


TTCCTCATGG 


AACAGAAGAA 


GCCAAAAAAT 


TAGTGAAAAA 


ATCAGGAATC 


2460 


TGTGTAGTGC 


AAGTTCCAGA 


GGGTGTTAAT 


TTTGGCACCG 


AAGAAGATGA 


AAAAATTGCT 


2520 


ACCGTATTAT 


TTGGGATTGC 


CGGAGTCGGT 


GAAGAACATT 


TGCAATTAGT 


CCAACAAATT 


2580 


GCACTTTATT 


GTAGTGATAT 


GGATAACGTG 


GTGCAACTTG 


CCGATGCATT 


AAGTAAAGAA 


2640 


GAAATAACAG 


AAAATTTAGC 


CATTGCTTAA 


AGGAGAGAAT 


AAGAATGAAC 


GCAGTACATT 


2700 


TTGGAGCAGG 


AAATATTGGA 


CGCGGCTTTA 


TCGGCGAAAT 


TTTAGCTAAA 


AACGGGTTTC 


2760 


ATATTACGTT 


TGTGGATGTT 


AATGAAACCA 


TCATTCAAGC 


GTTAAAAGAA 


CGTAAAAGTT 


2820 


ATACAATTGA 


ATTGGCCGAT 


GCCTCACATC 


AACAAATTAA 


CGTTGAAAAT 


GTGACCGGGT 


2880 


TAAATAACAT 


GACAGAACCA 


GAAAAAGTAG 


TAGAAGCAAT 


TGCGGAAGCC 


GATTTAGTCA 


2940 


CGACGGCAAT 


TGGTCCTAAT 


ATTTTACCAA 


GAATTGCTGA 


ATTAATTGCT 


CAAGGAATTG 


3000 


TV *T\r*f*v\f*f*rn/T* 

A I bCACCj I bC 


LbAAbL/W\ 1 




LbUl uunln 1 


1 rt X V_rOW J. 1 \J 1 


GAAAATATGA 


3060 


TTGGTGGTTC 


AACCTTTTTA 


GCAGAAGAAG 


TGGCCAAATA 


TTTGAAAAAC 


CCAGCTTATG 


3120 


CTGAACAATG 


GATTGGTTTT 


CCTGATGCGG 


CGGTTGATCG 


GATTGTTCCA 


TTACAAAAAC 


3180 


ATGAAGACCC 


ACTTTTTGTT 


CAAGTTGAGC 


CTTTTTGTGA 


ATGGGTCATT 


GATGATACCA 


3240 


ACCGAAAAGC 


CAAAGAGATT 


CAGTTAGAAG 


GTGTTCATTA 


CGTTGCTGAT 


TTAGAGCCGT 


3300 


ATATTGAACG 


AAAATTATTT 


AGTGTAAACA 


CTGGCCATGC 


TACAGTTGCC 


TATACAGGGG 


3360 



WO 98/50555 PCT/US98/08985 

1183 

CGTTGTTAGG CTATCAAACC ATTGACGAAG CGATGCAGGA CGCCTTAGTG GTAGCACAAC 3420 

TCAAATCAGT TTTGCAGGAA ACCGGTAAAC TTTTAGTGGC TAAATGGAAT TTTGATGAAC 34 80 

AAGAACATGC AGCCTATATT GAAAAAATTA TTCAACGTTT CCAAAATAAA TATATTTCAG 354 0 

ATGCTATTAC ACGTGTAGCA CGGACACCAA TCAGAAAATT AGGTGCGCAA GAACGGTTTA 3600 

TTCGACCAAT CCGTGAATTA CAGGAACGCA ATCTAGTGTC GCCGCATTTA TTAGCAATGA 3660 

TTGGTATTGT CTTTAATTAT CATGATCCAG AAGATGAACA AAGCCGTCAA TTACAGGAAA 3720 

TGCTTGACCA AGAAAGTGTT GATACAGTGA TCGCTGAAGT AACGGGCATT GAAGATCCAG 3780 

AAACGGTTAA AAATATTAAA CAAAACGTAG AACGCTATGC GCGACCACAA GTAGCATAAT 38 4 0 

AACAAAATCC TTCTACCAAG ATACTTCACA TTTCTTAATT AAAGAAAAAA ACAACCGCTC 3900 

ACTGATTAAA GTGAGGGGTT GTTTTTAACA TATTGACTAA GTGGTTTTTG ATAAAGCTCT 3960 

CCTTTGAAGG CACCTGTATT GAAGTATTCA GCTAATTGGT AACCAATGAA TGGGCCAGTA 4020 

GTGAGTCCTG ATGAACCAAG TCCGCTAGCA ACCACAAGGT GGGGCATTTC TGGCAAAGGG 4 080 

CCAAAAAACG GAGCGAAATC AGATGTATAA GCACGTGTAC CGACGCGGTA ATGCATGGGT 4140 

TGCTTAAAAA GCTGCTCTGC TTCTTTTAGA AATGGTGCAG TCCCAGAAGT TAGTTGTTGA 4 200 

AAAGCGCTAA CAGTCTCTTC TAAATCCCAG GCTTGTTCAT TTTCGTGGGT GGCACCTAAT 4260 

AAAATTTTTC CTTGATTAAA AGGAATTAAA tCnGctTCGC CATCTAACAT AGCAACAGGC 4 320 

CATTGCTGAC TGTTTGTAAA AGGGGTCTCG AAAACTAATA ATTGTCCTTT TTGGGGACGA 4 380 

ATGTCCACAG ATAAATTTAA AGTAGTGAGT AATTCCTTTA AATGAGGACC AGGTGTTAAA 4 4 40 

GCGAGAAAAT CAACGCTCTT TTTTTCTGAA TTTGTGACAA TTTCCCACCC TTGATTCAGT 4500 

TGTCGAAAGT GCGCCCGTTG TGAACAAAAA GTGACCCCTT TGGCTTCGGC GCGTTTCTGA 4560 

AGATGGTTTA GATAAGCTGG TCCATCTAGG CGACCGCCGC CAGAAATAAA AAGTGAGGGA 4 620 

GTTTCCTTAA GTAGCGGTAA GAACTTCGCT GTTTGAACCG CTGACAACAT CGTGATTTCT 4 680 
CCAATTTCAG GTGCAGTTTG CTTACGCTCT TCTGCCAAAT GGGCTAAGTC TGCCAAAGCG 4740 
GCTGCCGGGC GCAAAATGAC TGTCCCTGAT TGACGATAAA TATCCTCCGT CAGCTGAAAA 4 800 

TCTTTGACTA GCTTTGGAAA TAGGGCAGCG CCCTCTCGCG CTAATTGGTA CCATTTCTTA 4 8 60 

TTCCGACGTT TGGATAGCCA AGGGGAAATA ATCCCAGCGC TCGCTTTAGT CGCTTGGCCG 4 920 

AGCCCTTCAT CATAAACGGT AATGTCAAAT TTTTGTGGAT CTAAATAATT AGCTAACGTC 4 980 

ATTCCGATAA TGCCGCCGCC AATAATGGCA ATTTTTTTCA TGCTTCTTCT CTCCTTCGAA 504 0 

CAGTTGTATT GTAACACAAA AAAACCGCCT C TAG AC AG AG ACGATTGGGA GATAACTTGT 5100 
TAACGGGTCC TTCTGTGTGT GTAAATAGAA TTGCCGCCTT CTAAAACGTG AACAATTTCT 5160 
TGTGCCATTT CTGCCACAGG CACGGGATCA TAGCCAATAA ATAAGTTCTT TAAAAAGGGG 5220 
ATTTTCTTAA AAAAATAAAT ACATTTACTA AAAAGTATGG TGCCGGTACG TTCTTGACCT 5280 
ACAAGTAAAC TAGGATAAAA AATTACTGCT TCGGCGACTT CTTGGTGGAT TAGTTCCTCC 534 0 
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GCTGCCAATT 
AATAAAAACT 
GTCAGAATTA 
GTATCAATGA 
TCGGAGCATA 
TGACGTGAAA 
CCTAAAAAAC 
TTTCTTCTAG 
GTTATAAATA 
TTATTTATTG 
GCCTAGAGAA 
ACCGAGCAAT 
GAAGAAGAAA 
AACTGTGAAT 
TGACTGGGGA 
CATCCAACCG 
GATTAAGGTG 
TGAGGCGGTC 
GCGGGCGAAT 
AGAAGACCGC 
AGTAGATGAA 
TATTTCCACA 
TTCACCGATT 
GGACTTGACT 
GGTTGTGGAT 
CGCGAGTGGC 
GGCCAATTTA 
TTTAGGGGCG 
TTTAGTGAAT 
CGACAAAATG 
AAAACCAGTA 
AGCAACCAAC 
CAATACAAAT 



TGGCATCCAT 
TAGCTGGCTT 
GCCGTTGATA 
CCCAGTCAGC 
CCCACTGCAC 
CACTGATGAC 
CGCTACCGCC 
TGTACCTTAA 
ACATAAAAAT 
TCGTGAAATA 
ATGCCCTTGT 
TTAGTTTGTT 
ACGATAATTA 
GCGTTAAATA 
ATTGACTATT 
GAACAAATGG 
AAAAACGGCG 
ACAAAAGAAA 
ATCTATAATA 
ACGATGTCTG 
GAGATTACCT 
AAGGCTAATT 
TTACGGTTAA 
AAAATTAGTG 
GGCTCAATTT 
AATCCAGAGG 
GACAAAGTCA 
ATTGAGTTAC 
GATTTTGGAA 
GCGATTGAAT 
AAAGTCGGTG 
GAATCAATGA 
CAACCCCTCA 



ATATTTTCTC 
TTTTTGTTGT 
GGTTTTCTTT 
ACGCTGGACT 
TTGGTGGCTC 
TTGATGTTGG 
AAAAATAATG 
ATAAAAAAAG 
AACCTTTTTA 
AATAAAAAAA 
TTCTTGAATT 
GGGGAAAATA 
TATTGGGGGC 
AAAATGCACA 
TTGACGTTCC 
AAGCCTTTGC 
ATGTCGTAGA 
TTGAGGCACA 
AGTGGAATCG 
GTGATGATTT 
TCACCAATGA 
TCAAAGGTCG 
CTTCAGAAGA 
TTGGGCAAAA 
CTTACATCGA 
GCGGCACAAC 
AAAATGGCTA 
CGAAAAAAGC 
CCATCATTCG 
CTGGCTTAGA 
ATATTGTTGA 
CAGATGCGTC 
TTGAATTGCG 



AAAGGGAAAG 
GCTAAAAAAG 
TTTCGTGGAT 
TGTTCTTGCC 
CAAGATGCGG 
CGTTGGACTA 
ACTTTCATGG' 
AAGCAAAAAA 
GATTGTTGAA 
TGAACAAAAA 
CAAGCATGGT 
GACAGAATGA 
AGTTGCGGTA 
ACAAGTAGCT 
CGACTTGCAA 
GCGTGATCAA 
TGCAGGCACA 
ACAAAATAGC 
GGCCATTGAT 
AAATGAACAA 
AACCTTAGCG 
TGTATCAATT 
TCTTTATTTA 
AGCTAAACTA 
TGATAATCCT 
GATGTCTAGT 
CCATATGCAA 
GATTCAAAAA 
TCGTGATGTC 
ATCAGCCGAC 
ATCAGATGCA 
GAAATAGGAG 
GGACATTAAC 



GAACAGTATT 
AGCTGATTTT 
TTTCAAATAA 
AATTCGTGTC 
AGAGCGTGGC 
AAATTTTCGC 
CAAGCCCTCA 
TCTCGATTGC 
ACACAAAGAC 
TGAAATAAAT 
TAAATAGTTA 
GGAGATGAAA 
ATTGCGGTTG 
GTCAAGCAAG 
CAAATTTATA 
AAAATAACAA 
GAATTATTTA 
TTAGCCAAAT 
AAATTTAATA 
TATCAAACAG 
GATTTAGGAG 
CCAGAAGTAA 
GCTGGAAAAG 
ACTTCTGTTT 
CCTGAAGGCA 
TATAGCGTCA 
GCAACCATTG 
GAGGGTGAAC 
CAAGTCGGGC 
CGAGTGGTTA 
GCGATTGCTT 
GGCTTGATGG 
AAGTTCTATC 



GGCTGAAATA 
TTTCACTGGC 
AATACCTACT 
ATGAGTAACA 
AGGCTTTCCA 
GATTTGTTGA 
CTTTCAATTG 
GCATAGTTTA 
CCAACAATGG 
CAGACTAAAG 
TAAGTGAACG 
CGAAGAAAAT 
GGGGCATCGT 
CGCCTAAAGA 
TTAAGGGTGT 
AGGATGCAGA 
CTTATGAAGA 
TAGAAACGAA 
AAACTAAAGA 
AAGTCGATGC 
CGAAgcAAtA 
AAGATGCCAA 
TGAATGAAAA 
CCAACAATGT 
ACAGCGATGC 
AAATTGCGTT 
ATTTAGGCGA 
AGGCCTACGT 
AAGAAAATGG 
TTTCTTCAAA 
CTGATGAATC 
TGGAAGCAAA 
CAGTAGGTAA 
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GGAAAAACTA 


CATGTTTTAA 


AAGAATTGAA 


CTTGACCATC 


CATCAAGGAG 


AGTTCATTTT 


7 380 


AATCATGGGG 


AAGTCCGGTA 


GCGGTAAGAC 


AACGCTGATG 


AACATTATTG 


GTTTCTTGGA 


7440 


TCGCTTAACA 


GACGGAAGTT 


ATCATTTCTC 


AGGAACAGAT 


GTTTCAAAAT 


TGTCTGAAAA 


7500 


TAAAAAATCA 


GCCTTTCGCA 


ATGAATATCT 


AGGATTTATT 


TTTCAACAAT 


TTTTCTTGAT 


7560 


TAATTCTTTA 


AATGTTAGTC 


AAAATGTTCA 


ATTGCCCTGT 


GTTTATGAAG 


GGAAAAAAAG 


7620 


CCGCGCTGAG 


AAAAAAGCGA 


TTGCTGAAAA 


ATACCTAAAA 


ATTGTCGGTT 


TGGAAACAAA 


7680 


AGCCAAATCA 


AAAGTTACTG 


AACTTTCTGG 


AGGACAGCAA 


CAAAGAGTAG 


CCATTGCTCG 


7740 


TTCCTTGGTG 


AATGATCCAT 


TATTGATTAT 


GGCTGATGAA 


CCGACTGGGG 


CATTGGATAG 


7800 


TGAAACGGGT 


ACAGAAATTA 


TGGAATTATT 


GAAAGAATTG 


AATGAACAGG 


GCAAAACAAT 


7860 


TGTCATGGTG 


ACACATGATG 


AAGATATGAA 


AAAATATGCC 


TCACGAGTGA 


TTCATATGAA 


7920 


AGATGGTCGC 


TTTTTGGAAG 


AAGAGGTGAT 


TAGATGATTG 


CTAATTTATT 


TGTTAGTACT 


7980 


TTTTTAAGCT 


TAAAGGCACA 


TAAGTTGCGG 


GTATTTTTAA 


CGATGGTCGG 


GAT TAT T ATT 


8040 


GGGATTACAT 


CAGTCGTAAC 


GATTTCTGCG 


CTAGGTGAAG 


GGATGAAACG 


TCAAGTTGTG 


8100 


AAGGCATCCA 


GTGCAGTGAA 


CGCGGATGTT 


TTGAAGATTC 


ATTATACAAT 


GTCGGATGGT 


8160 


AGCAGTGACA 


ATTTTATGTC 


CTATGAAGAG 


CC AG AT TATA 


CGTTTAGCCG 


AGTGGACTTG 


8220 


AAAAAATTAC 


AAGACATTCA 


AGGAATTGAA 


AGTATTTATC 


CTCAATATGG 


TGAATCAATG 


8280 


ATGGGGGGCG 


GAGACAATTT 


GTTTGTACCA 


ATGGATTATT 


TTGGGGCGCA 


AGCGAATTTA 


8340 


TCCATTACGT 


CGACTAAAGG 


GCAGAATGAT 


ATTTTATATG 


GTCGTGATTT 


TCAACCGAGC 


8400 


GATGCAAATA 


CAGATGCGAT 


TGTCTTAAAT 


CATGATATGT 


TTGAAGCGCA 


AATTCGGCTT 


8460 


GATGATCCTA 


GCCAACTCAT 


TGGAAAAGCT 


GTTTCAATTG 


GTGGTTATAT 


GTACAAGGTG 


8520 


ATTGGTATTT 


TAGCACCAAA 


AGATTTAGAT 


AGTTTGGGTA 


TGAACGATGA 


TTGGGCAACT 


8580 


GCTATGAGTA 


GTTTTGTTTC 


TCGAGAATCA 


TACAATAAAT 


TAGCGAAAAC 


AAAAGCAATT 


8640 


AGTGGCATCA 


ATATTAAGGT 


TCGTGAGGGA 


GCGGACCGTG 


AAGCAATTTT 


AGGACAGGCA 


8700 


ATCACCATTT 


TAAGTGAAAA 


TCATCCTGAA 


GTGAAAGGAA 


CGTTTAAAGA 


AAACGACCAA 


8760 


GACCAACAAC 


TCCAACAACA 


AATGGAAGAA 


ATGGTCACGG 


GTATGACAAT 


GTTTCTGATG 


8820 


GCAATTACTG 


CAATCTCCCT 


ATTAGTCGGG 


GGAATTGGCG 


TCATGAACAT 


CATGTATGTG 


8880 


TCTGTCACGG 


AACGGAAACG 


AGAAATTGGG 


ATTCGTCGGG 


CCATTGGTGC 


TAAACCACGA 


8940 


f~> fp /•> t\ m «"p T 1 7\ T 1 


1 L*L.M/\ 11111 


t\r\ i \3\3t\r\\3\^t\ 


ww 111 r\ 1 


PATTGATTGG 


TGGTTTGATT 


9000 


GGTGTCGGCT 


GCGGCTACTT 


ATTGGCAACT 


GTGGTCGGTG 


GGTATATTTC 


GATTACGCCA 


9060 


ATTATTACGC 


CGTCCATCTT 


TGCTATTTCA 


ACATTGGTTT 


CGGTCTTCAC 


AGGTATCTTT 


9120 


TTCGGAATTA 


TTCCAGCAAT 


TGGTGCTTCG 


AGGATGGATC 


CGATTAAAGC 


GATTTATAAT 


9180 


TAAAAAGAAG 


AGCTTGGTAG 


TCACATCTTT 


TTTGATGATC 


CTATCAAGCT 


CGCTTTTTTA 


9240 


TTTCATGTCA 


CTAAGAATTT 


TCTGAATCAA 


. TTCTTGTTGA 


, AACTCTTTGG 


TATATTGTTC 


9300 
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GGTGAGTTTC 


GCTTGTTCTT 


CTTGGGACAT 


TGATGGATTC 


GCCATCATAG 


CAGCCATCAT 


9360 


TTTTTTGCCA ACTGTTTCAG 


TCCCTTCTTT 


TTTGATATCA 


GCGGCATGTT 


GTTCCATATA 


9420 


TTCGCCTAAT 


TTTTTGGCTT 


GATCAGCACT 


TAAAACTAGC 


CAGTCACTAG 


CAACGTTGAA 


9480 


GGTGTTTTCT 


TGTTTTGCAA 


ATTGGTTATC 


GTTGTCAGCA 


AGGCCAAATA 


GTGTTTTGGC 


9540 


CCAAGACTCT 


TGATAGGACC 


ATTTCGTCGT 


TTTTTGGACT 


AATGTGGCTT 


TTTGAGCATC 


9600 


TTTTTGAACC 


GTGTTTGTGA 


CGTTTTCCGT 


TCCTGTTTTT 


TGTAGTTTCT 


TTTTATCTGG 


9660 


TGTGTTATAT 


AAATAAACTT 


TTTCGGAACC 


ATCGCCCAAC 


GGCTGGTACA 


ATAACATGTT 


9720 


GGGGCCATCT 


TTGCTTGAAC 


TACTGCTTTT 


TAGTTCAAaC 


GTTTTGtCCT 


GAGTAACTTC 


9780 


GGTCATACCA 


AAATGATCTT 


TATCGTTTAA 


AACGATAAGA 


CTTAACGAAG CTAAGAACCC 


9840 


TAAACCAAAA 


ATAACCATAC 


AGAATTTATG 


CCAGACTTTT 


CGGCCAAATA 


CATTCGTATA 


9900 


GGCAAAGCcT 


AACACGCAAA 


GGATTAAAaG 


GAAAaTAATC 


ATTATTTCAT 


TGCACCTACT 


9960 


TCCTTAACAG 


GCATCGCTTC 


TTTTTTATTT 


AAGAAGAAAG 


TAATCGCCAA 


GCCGAGAACG 


10020 


CCAAAGATTG 


TTGCCACGAA 


GAAGGCTGCG 


TGATAACCGT 


TTAGTGTTGC 


GTTTGTTGCT 


10080 


TGATCGCGAT 


AAGTGAGTGG 


GGCTGTTTTT 


AATAAGTCCG 


ATGCAGGTAA 


GCCGTCTTTG 


10140 


GTAACATTTG 


TTAAAACACT 


AATCAAAACA 


GCTGTTCCAA 


TTGAGCTGGC 


TACTTGTCGG 


10200 


AAGGTGTTGT 


TTACAGCAGT 


CCCGTGACTT 


AATAAATTCA 


TTGGTAAAGC 


ATTCATGCCA 


10260 


GAAGTTGTCA 


CAGGCATCAT 


GACCATCGAA 


ATACCAAACA 


TCCGAATCGC 


ATATAGGACA 


10320 


AT GAT AT AAA 


TAATAGGTGT 


TGCTTTTGTT 


AAGAAAGCAA ACGGTAACGT 


TGCAGCTGTT 


10380 


AAAATAAACA 


TCCCAGTAAT 


CGCAAGACGA 


CGGGCACCAA 


TTTTATCAAA 


AATGCGACCA 


10440 


GTTACTGGCA 


TCATAAGTCC 


CATGACTAAC 


GCTCCTGGTA 


AAAGCATTAA 


CCCTGAATGG 


10500 


AAGGCAGATT 


CGCCACGAAT 


ATTTTGGATA 


TACAAAGGTA 


AAACCATTTC 


GGCGCCAATC 


10560 


ATTGCCATGT 


TGGTTACGCC 


AGATAAAATG 


GCCGCAACAG 


TAAAGACTTT 


TGATTTAAAA 


10620 


ACGCGTAGTT 


CTAAGAAAGG 


TTGTTCTAAA 


TGCAATTGTC 


GATATGTAAA 


AAGACATAAA 


10680 


ACAATCGCAC 


CAACAATTAA 


GAATCCATAG 


ACTTGCGGGC 


TACCCCAACC 


TTTATCACCA 


10740 


ACGCTTGAGA 


AACCGTAAAG 


TAAGCTACCA 


AAGCCGAGAG 


TAGAAAGAAT 


GACCGATAAG 


10800 


ACATCGATAC 


TTGGATTAGA 


TAATTGAATG 


ACATTTTTCA 


TTAAAAATGA 


AGCTAAAATT 


10860 


AGGACTAAAA 


CGACAATTGG 


AATGACCATC 


CCAAAAAGGT 


CACGCCATGT 


ATAAGAATCA 


10920 


ATAATCCAAC 


CAGATAATGT 


1 GbACC i AAA 


GCGGGTGCTA 


ATCCGATAAC 


AATTCCCGTT 


10980 


GTGCCCATGG 


CAGCGCCACG 


TTTTTCAGGC 


GGGAAAATTG 


AAAGCATAAT 


TGTTTGTAAA 


11040 


AGAGGCATAG 


AT AC AC CG AC 


CCcTGCGGCT 


TGAACAAGAC 


GTCCAGTCAA 


TAACATGCCG 


11100 


AAGTCTTGCG 


CTACAAAACA 


GATGATGGTT 


CCGATTAAGA 


ACGTTGACAT 


CGCTGTTATG 


11160 


TATAATTTTT 


TTGAACTGAA 


TTTGTTAATC 


AACCAAGCAC 


TGATGGGAAT 


CATAATGCCG 


11220 


TTCACAAGTA 


GGAATCCAGT 


GGTTAGCCAC 


TGTACACTTG 


AAGCTGAGAT 


ATCAAATTCT 


11280 
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TTCATTAATG 


TAGGTAAGGC 


CGTCGTTAAA 


AGCGTTTGGT 


TTAAAATCGT - 


ACAAAATGTT 


11340 


CCGATTAGTA 


AAACACCGAC 


TAGTAACGAC 


CGGTTATAAG 


GTTTTCCGTG . 


AATATCCACA 


11400 


GGTTGTTGTG 


AAGTCACATA 


TAtCGCTCCT 


TTTTCTGTAA 


GTTTCAAATT 


AAAT TGC ATT 


11460 


AaCTACTTTA 


CmACTAATTA 


GAATCTtTGT 


CAACTTACTT 


GACAAAGGTG 


TAAAAAAAAC 


11520 


TACAATTAAG 


GAAAGGATAA 


GGGAAAGGAA 


GTTAAACTAA 


TGTGGTCAAA 


AATGATTCAG 


11580 


TGTTTGTTAA 


ATGAAGAAAA 


TTTGTTGATA 


GGAATTAGTG 


AACTAAGTGA 


AATTGCTGAA 


11640 


fITTTCGCCAC 


GTCAACTTCG 


TTATTGGGAA 


GAAAAAGGGT 


ATATTTCTTC 


TATTGCGAAA 


11700 


GACGCAAACG 


GTCCACGGAA 


ATATCGGTTA 


CATACAGTCG 


TCAAAGTGCA 


TTGGATCAAG 


11760 


CGTTTTTTAG 


ATGAAGGATA 


TACGTTACAA 


AGTGCGGTGG 


AAAAAGCAGA 


AGAGCAGCAT 


11820 


CGTAATGTGG 


GGATGACGAA 


AAAAATCTTT 


AGTCAAATGT 


TTCATGGTAT 


CTCAGAAGTT 


11880 


TGTCCCAATT 


ACGTAGCGAT 


TGATTTGGGC 


GATTTCGATG 


AAGAACAAAA 


ATTATATGTT 


11940 


GTATTTAATG 


AAACCACAGA 


AGTCAGTGAG 


TATCTTTTAA 


TTCCTAAGGA 


TGCAGATGTT 


12000 


TCTGATGAGT 

X Km* x vj^i x x 


TAAAGAAAAG 


AAAATAAGAA 


TGAAATAGGA 


CTGGGAGAAA 


GACCTGAAGT 


12060 


GAAAGTAAAA 


CGTTACTGAA 


TGAATCTTCG 


AGTTCTTTTT 


TTCTTGCAAT 


TTCTTTATTT 


12120 


TTfTATATM 


TTTCTTATAT 


GGAAAAGGAG 


GGTTTCGATG 


AAAGAAACAA 


AACCTTACGG 


12180 


AACAGTCTTG 


TTAAAAGCGG 


CTAAAATCAT 


GGACTGTCTA 


AGTTTACAAC 


CCGATCAAAC 


12240 


CTTACAGGAA 


ATTGCAAAGA 


ATACCGAAAT 


GACGGCATCA 


ACGGCAATCA 


AAATTTTAGA 


12300 


AACCTTAACA 


TTAATTGGCT 


ATGTTCAAAA 


AGACACGAAC 


AAGACGTACC 


GCTTAGGAAC 


12360 


AAAATTGATT 


CGCTATGCGA 


ATCAAAGTGT 


G GAG C AAAT C 


GATTTAGCTG 


AAATTGCGGT 


12420 


ACCTTATTTA 


GAAAACCTTC 


AAAATACGGT 


CGATGAAACC 


ATTCATTTAG 


GGATTTTAAA 


12480 


TAATAATGAA 


ATTCTGTATA 


TCAATAAATT 


AGAACCGAAA 


AATCAAACGA 


TTCGCATGTC 


12540 


CTCGAAAGTG 


GGAATTACTC 


GACCTTTGTA 


TAATTCGGCC 


ATGGGTAAAG 


CCGTTTTAGC 


12600 


AGAATTTTCG 


GAAGAACAAG 


TGCAACAATA 


TTTAGACACG 


CAGACATTAA 


TTCCCTATAC 


12660 


CGAAAATACG 


ATTACCAATC 


CCTTGCGTTT 


GAAAAAAGAA 


TTAAAACAAG 


TGCA^CAAAC 


12720 


GGGGGTTGCT 


TACGATGATG 


AAGAAATTGA 


ACAAGATATT 


TTCTGTAGCG 


GAGTATCTTT 


12780 


AATGAAAGAT 


GGCGAAATTG 


CGGGTGCTTT 


TAGTGTAAGC 


ATGCCGAAAT 


ATCGTTTGAC 


12840 


AGAGGAAAAT 


AAAACGACGA 


TTAATCAAGC 


CCTTTTGGCA 


ACAAAAGCAG 


CGATTGAAGC 


12900 


G AAAT T AT AG 


TAGAAAAAbA 




Zl A Z\ (Z T C CZ J\ T T 


f^HATTTTTG 

\J O sjxi 1 1 1 1 X v 


TTTGTCTTTC 

XXX VJ X V-* XXX w 


12960 


TTTTTTTATT 


TTTGTAAAAA 


CGTTTGCATT 


TTCTTTTTTT 


ATCTGTATAA 


TGAAGTTGTG 


13020 


AATTTCTTTA 


TAATAAATAT 


ATATTTCTAA 


, ATAAGAAATG 


AGGGAGTCTG 


TTGAAGCGAG 


13080 


TATCAATTTT 


ACAAAAATTA 


GAAAATGCTG 


GCGTGATTGC 


TGTGGTTCGC 


GGAAAGACAA 


13140 


AAGAGGAAGC 


GCTAAAAGCt 


AGTCAAGCCA 


. TTGTAGCAGG 


CGGTATGCGG 


GGGATTGaGT 


13200 


TAACCTTTAC 


TGTTCCTCAA 


. GCAACAGAAG 


; TGATTCAGGA 


k ATTAGTTGCG 


1 TTATACCAAG 


13260 
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AGAATCCTGA CGTAGTCATT GGTGCGGGGA CAGTTTTGGA TGCTACGACT GCTCGTTTAG 
CTATTGTAGC AGGGGCTGAA TATATTGTGA GTCCAAGCTT CGATGAAGAA ACCGCTAAAA 
TTTGCAATTT ATATCAAGTT CCTTACTTGC CAGGCTGTAT GACCATCACA GAAATGAAAG 
AAGCACTGAA AAGCGGAGCA GATATTATTA AACTATTTCC GGGAAGTGTT TATGGACCGA 
GTGTTATTTC AGCATTTAAA GCGCCATTGC CTCAGTTAAA TATTATGCCa ACGGGTGGTG 
TCAGTTTGGA AAATATGGCC GAGTGGTTCG CTGCAGGTGT GACGGCCGTC GGTGTTGGTG 
GCAATCTGTT AGCGCCAGCA GCAACTGGGG ATTTTGAAAA AGTTCGTGAG GTTGCCCAAG 
CCTATATGGA AAAATTCCAA GCAATCAAAG GGGTGTAGGC- ACATGGGGAA AGTCGTAACA 
TTGGGAGAAA TCATGTTGCG TTTGTCTACG CCTGTCGGAA TACGCGTAgC ACAAACAGAA 
~ AATTTCGCTG TACATTACGG CGGTGGTGAA GCGAATGTAG CGATTTCGTT AGCGAACTAT 
GGCCACCAAG TGACTTTTGC TAGTAAAGTG CCCGATAATT CATTAGGAGA AGCCGCCAAA 
AAGCATTTAA GTCGTTATGG GGTTTCAACG GAATTTGTAC GAACTGGCGG AACACGTCTA 
GGAACGTATT ACTTGGAAAC TGGTACAgGA GAACGAGCAG CCAGTGTTGT CTATGATCGC 
GCCTATTCAA GCTTTGCTCA AATCGAAGAA ATGGAATGGG ACTTACATGA GCTGTTTTCA 
G GAG TAG AG A TCTTTCATAT TTCAGGGATT ACAGCGGCGT TATCAAAAAA TTGGCAAAGT 
ATGACGGTTG CGTTAGTTAA AGCAGCGAAg CAAGCAGGGT GCCAAGTAAG TTTTGACATT 
AATTATCGAG GGAAACTTTG GTCCCAGAAA GAAGCGAGCG TGGCATTGAA GGCCATTCTT 
CCATTTGTTG ATATTTGTTC AGCTGGGGTA TTAGATGCTC GTTATTTACT GGAGATTCCA 
GAACCAGAGG AAGAACTTGA AGACGCATTA ACGTGGTACT ACCGAAAAAT GCAAGAACGT 
TATCCCAACA TTCAAGTATT TTATTCAACC AAACGTCAAG TTCATTCAGC GACAGATAAT 
GAATTAATCG GAACGCTATG GTACAAAGGT GCGTATTATA CGTCCAAATG TCATCGATTA 
CAACCGATTG TTGACCGTGT CGGTGCCGGT GATGCGTTTG CTGGTGGGGT TTTACATGGC 
ATATTAGCTG CCTATGATCC ACAAACCTGC ATTGATTTTG CGACAGCGGC TAGTGCGTTA 
AAACATACTG TCCATGGTGA TTGTAATCAA TTTAGTCAAA AAGAAGTCCT TGCTTTTTTG 
GCAACGGGCT CAGGAAAAAT AAATAGATAG GAGATTTTAG TTATGCAAAA TATGGAAACA 
CGTTATACAC ATAGTCCGGC GGATATCCGT CACTATTCAA CAGAACAATT GAGAGATGAA 
TTTTTAGTAG AGAAAGTTTT TATTCCAGGT GCAATCAGCT TAACCTACAC ACATAATGAT 
CGGATGATTT TTGGTGGTGT GACACCGACC ACAGAAGAAT TAGAAATTAT TTTGGACAAA 
GAATTAGGAG TTGATTATTT CTTAGAACGT CGTGAACTAG GAGTTATTAA TATTGGGGGA 
CCAGGCTTTA TTGAAATTGA TGGCGCCAAA GAAACAATGA AAAAACAAGA TGGCTATTAT 
ATTGGGAAAG AAACAAAACA TGTTCGTTTT TCTTCAGAAA ACCCTGATAA TCCAGCGAAA 
TTTTACATTA GTTGTGTGCC AGCACATCAT AAATATCCCA ACGTAAAAAT TAGTATTGAT 
GAAATTACCC CAATGGAAAC GGGTGATCCA TTAACATTGA ACCAACGTAA AATTTACCAA 
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T AC ATT CATC 


CGAACGTTTG 


TGAAAGTTGC 


CAGTTACAAA 


TGGGCTACAC CATTTTAGAA 


15300 


CCAGGTAGCG 


CTTGGAATAC 


AATGCCTTGT 


C AT AC AC AT G , 


AACGTCGTAT GGAAGCGTAT 


15360 


GTTTATTTTG 


ATATGGAAGA 


AGATACGCGG 


ATTTTCCACA 


TGATGGGTAA ACCAGACGAA 


15420 


ACGAAACATT 


TAGTAATGAG 


TAATGAACAA 


GCAGCTATTT 


CACCAAGCTG GTGAATTCAT 


15480 


TCTGGGGTTG 


GTACAAGTAA 


TTATTCCTTT 


ATTTGGGCCA 


TGTGCGGAGA AAATATTACG 


15540 


TACACAGACA 


TGGATATGGT 


TGCAATGGAT 


CAATTAAAAT 


AAACCAAAGA AAGAGGAGAA 


15600 


AACATGGAAT 


TTAAAATGAA 


TGATTTTCAA 


TTAGATGGAA 


AAGTAGCGTT AATTACAGGT 


15660 


GCTGTTTATG 


GGATTGGCTT 


TGAGATTGCG 


AAATCCCTAG 


CTGAAGCAGG AGCAACAATT 


15720 


GTTTTTAATA 


ACTTATCTCC 


TGAATCAGTT 


GAACAAGGCT 


TGGAAAATTA TCGGGCCGCA 


15780 


GGGATTGAAG 


CACGTGGTTA 


TGTGTGTGAT 


GTCACTGATG 


AAGAGCAAGT TCAAGCGATG 


15840 


GTTGCTCAGA 


TTAAAGAAGA 


AGTCGGCTCT 


ATTGATATTT 


TAGTCAATAA TGCGGGAATT 


15900 


ATTAAACGAA 


TTCCAATGGT 


GGATATGTCA 


GCAGAAGAGT 


TCCGTCAAGT GATTGATGTT 


15960 


GACTTGAATG 


CGCCATTTAT 


TATGGCAAAA 


GCGGTTATTC 


CAGATATGAT TGAAAAGGGT 


16020 


CATGGCAAAA 


TTATCAATAT 


CTGTTCAATG 


ATGAGTGAAC 


TAGGTCGTGA AACAGTTAGT 


16080 


GCGTACGCGG 


CAGCAAAAGG 


CGGACTAAAA 


ATGTTAACGA 


AAAATATTGC TTCTGAATAT 


16140 


GGACAATATA 


ATATTCAATG 


TAATGGGATT 


GGACCTGGCT 


ACATTGAAAC GCCACAGACG 


16200 


GCGCCATTAA 


GAGAACCAGG 


GCATCCATTT 


AACGAATTCA 


TCTTGGGACG AACACCTGCT 


16260 


AATCGTTGGG 


GCACACCAGC 


AGACTTAGCT 


GGCCCAGCAG 


TTTTCTTAGC ATCTAACGCT 


16320 


TCTGACTTTG 


TTAACGGACA 


TATCTTATAT 


GTCGATGGTG 


GAATTTTAGC TTATATTGGA 


16380 


AAACAACCAT 


AAATAAGTAA 


CAAAAAGCGG 


ACATACCTAA 


AAAGGATGTC CGTTTTTTGT 


16440 


TAT T TACT C A 


TTGATTGAAT 


TGGTAAAAAC 


CGTATCTAAT 


TTTTAAAAAA AGTCAATTTT 


16500 


TGACAGTAAT 


TCGTCAAGAT 


TAACTATATA 


ATTTGCTAAC 


ATTCCTTGCT AAACTAAAAT 


16560 


TACAAAAAAA 


TGAAAGCGTT 


TCATTTGCGC 


TGGTGGGATG 


GAAAAGGAGT TTTGATTGCC 


16620 


GATGATTATT 


AAACAGGAAC 


AATTCATACC 


AAAAGATACA 


GTAGAAACAA CGATTGACTT 


16680 


ATTAATTAGA 


AATTTAACAA 


CGATTAAAGA 


TAACACAGGT 


GAATTTCTGT TAGATTTTGA 


16740 


TGGTTTAAAA 


GTTGATGACA 


AAAGTTGGAC 


GATTTGGAAC 


TGGCCGCAAG GCGTTGGGTT 


16800 


GTATGGTATA 


TATAAGAATT 


ATCGAAATAC 


TAAAAGTGAA 


AAAGCTTTAC AAGTAGTTAA 


16860 


tuHL 1 o*j 111 




TGCAAGAAGG 


CGCGCCGCCT 


AAGAATGTGA ATACGATGGC 


16920 


TCCGCTTTTA 


ACAATGGCTT 


ATCTATATGA 


AGACACTAAA 


GACTCTAAAT ATATACCTTA 


16980 


CTTAGAGCAA 


TGGGCGGAAT 


GGGTGATGGA 


AGAAATGCCC 


CGAACGAATG AAGGTGGCTT 


17040 


GCAACATGCG 


AC AT AT GG AC 


CAGAAAATAA 


, AAATCAATTA 


. TGGGATGATA CGTTAATGAT 


17100 


GACTGTCTTA 


. CCTTTGGCAA 


AAATTGGTAA 


GTT GTTGAAT 


1 CGACTCGATT ATTTGGAAGA 


17160 


AGCAAAGCAT 


CAATTTTTAA 


. TCCATATTAA 


A TATTTACAA GATAAAAAAA GTGGTCTATG 


17220 
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GTATCACGGT 


TGGACTTTTG , 


AGGGAAATCA 


TAACTATGCG 


GAAGCACTGT GGGCAAGAGG 


17280 


AAACTGCTGG 


ATTACGATTG 


CTATTCCAGA 


AATTATTGAA 


ATCTTAGAGT TGCCAAAAGG 


17340 


AGATAGTTTA 


CGGGAATTTT 


TATTGAGTAC 


CCTCAATGCG 


CAAGTGGCAG CTTTAGCGAA 


17400 


ATACCAAGAT 


GAATCTGGTT 


TATGGCATAC 


ATTAATTAAT 


GATTCAAATT CGTATTTAGA 


17460 


ATCTTCTGCT 


ACAGCGGGAT 


TCGCTTATGG 


GATTTTAAAA 


GCGGTTCATA AAAAATATAT 


17520 


TTCTTCTGAA 


TATGAAGAAG 


TGGCAAACAA 


AGCAATTGCT 


GGCTTACTAA ATGAGATTGA 


17580 


TGAAACGGGA 


GAAGTACAAC 


ATGTGTCAGT 


TGGTACAGGA 


ATGGGTGATA ATTTAGATTT 


17640 


TTATCGCACA 


ATTGGAATGA 


CAGCGATGCC 


TTATGGTCAA 


TCATTAACAA TCCTATGTTT 


17700 


GACTGAATTG 


CTTGTTTCTT 


ATTGCTAGAA 


AAGTAAGGAG 


AAAAGTTCAT GAAAAAAAGA 


17760 


AAGGTTTTAT 


TTACAGCAGT 


TATGGTATTG 


GCAGGATTAC 


AGTTGCTAAG TGGTTGCGGC 


17820 


AAAACAGAAG 


CTTCGGCAAA 


TGATACGGTA 


GTCTTGCGCT 


ATGCGTATGC TAGTAATAGC 


17880 


CAACCAGTTA 


TCGATTCTAT 


GAAGAAATTC 


GGTGAATTAG 


TAGAGGAAAA AACAGATGGT 


17940 


AAAGTTCAAA 


TTGAATATTT 


TCCAGATGGT 


CAATTAGGAG 


GAGAAACAGA ACTAATTGAA 


18000 


TTAACACAAA 


CAGGTGCAAT 


TGATTTTGCA 


AAGGTCAGTG 


GATCAGCATT AGAAAGTTTT 


18060 


TCTAAAGATT 


ATTCTGTATT 


TGCCATTCCG 


TATATTTTTG 


ATAATGAAAA ACATTTTTTT 


18120 


AAAGTAATGG 


ATAATCAAGC 


GCTAATGCAA 


CCAGTGTATG . 


ATTCTACAAA AAAATTAGGA 


18180 


TTTGTTGGTT 


TAACTTATTA 


TGACTCTGGT 


CAACGAAGTT 


TTTATATGAG CAAAGGGCCT 


18240 


GTTACATCTC 


CAGATGATTT 


GAAAGGTAAA 


AAAATTCGGG 


T CAT GC AAAG TGAAACCGCC 


18300 


ATCAAAATGG 


TAGAACTTTT 


AGGGGGTTCG 


CCAGTACCTA 


TGGGTAGTTC GGAAGTATAT 


18360 


ACTTCTCTAC 


AATCTAATCT 


AATCAACGGT 


GCAGAGAATA 


ATGAGTTCGT TTTATATACA 


18420 


GCTGGTCATG 


GTGGTGTGGC 


TAAGTATTAT 


TCTTATGATG 


AGCATACTCG AGTGCCAGAT 


18480 


ATTGTGATTA 


TGAACGAGGG 


AACAAAAGAA 


CGTTTGACAG 


CGAAACAAGA ACAAGCGATT 


18540 


GAAGAAGCAG 


CAAAAGAATC 


GACCGCTTTT 


GAAAAAACGG 


TCTTTAAAGA AGCGGTTGAA 


18600 


GAAGAAAAGA 


AAAAAG C AC A 


AGCAGAATAT 


GGCGTTGTGT 


TCAATCAAGT AGACAGTGAA 


18660 


CCATTCCAAA 


AACTTGTTCA 


ACCGTTGCAT 


GAATCATTCA 


AAAATAGCTC AGAACATGGC 


18720 


GAACTGTATC 


AGGCTATTCG 


CCAGTTGGCG 


GACTAATTAA 


GGAGGTAATT ATGAAAGGTT 


18780 


TTAGAAAATT 


TTTAAATCGA 


CTTTTAGAAA 


TACTTGGATC 


AATCATTCTT GCTGCAATGG 


18840 


TTTTGGTCGT 


TTTGTACCAA 


GTGTTTGCCC 


GGACAGTGTT 


AAAAAATCCT AGTACCTTAA 


18900 


CAGAAGAATT 


TGTTCGCTTT 


GCATTGGTCT 


GGATGGCGAT 


GTTAGCTTCT GCTTATGTGG 


18960 


TTGGAAAAAA 


GGGACATTTA 


GCAGTAACTT 


TGTTAAGTGA 


. AAAATTACAA GGGAAACAAA 


19020 


AACGCTTTTT 


AGAATTTATT 


GTTCAATTGT 


TGTTCCTACT 


TTTTGCAGGA GTGATTATGA 


19080 


TATTTGGCGG 


TTGGAATGGT 


GTAGTAATTA 


, CGTTAGGACA 


, AATTTCACCA TCTTTGGCAA 


19140 


TTCCTATGGG 


, TTATGTCTAT 


TTAGCCGTTC 


: CAGTAGCAGG 


1 TGTTTTGATG TTTATTTATA 


19200 
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GTTTGATGAA 


TTTGCTAGAG 


TTATTCACGC 


AATCAAAAGA 


TTCTTAATGC 


GTTTATAGAA 


19260 


GAAAAAGAAT 


AGGAGTAAAA 


ATATGGCTTT 


AGAAGCAGGC 


ATTATCTTAT 


TTGTAGTTTT 


19320 


TGCAGTCTTA 


TTGGTAGTTG 


GTATGCCAAT 


TGGTATGAGC 


ATTGCGGCAT 


CTTCAATTGC 


19380 


AACATTGTTA 


TTAGTAATTC 


CTTTTGAAGC 


AGCTGCATTT 


ACCTCAGCTC 


AAAAAATGGT 


19440 


CTCTAGTTTA 


AATAGTTTCT 


CTCTCGTAGC 


AATTCCGTTT 


TTTATTTTTT 


CTGGTGTGAT 


19500 


TATGAATAAT 


GGGGGAATTG 


CCCAAAAGTT 


GGTAAATTTT 


TCTATGTTAT 


TTGTTGGTCG 


19560 


TGTTCCCGGC 


GCGCTAGCAC 


ATACTAATGT 


GTTAGGGAAT 


GCATTATTTG 


GTTCGATTTC 


19620 


TAGCTCTGCA 


ATTGCAGCAT 


CTACTGCTAT 


TGGTGGCGTA 


CTAATTCCGC 


AACAAGTAAA 


19680 


TGAGGGCTAT 


GATCGGAAAT 


TTGCAACGGC 


TGTAAATATT 


GCCTCTGCTC 


CTACAGGAAT 


19740 


GGTTATTCCA 


CCAAGTACAG 


CGTTTATTAT 


GTATTCATTG 


GTGGCTGGTG 


GGGCCTCTAT 


19800 


TTCTTCTTTA 


TTTTTAGGAG 


GTTATTTGGT 


AGGTTCTTTG 


TGGGCGTTAG 


GAATTATGGT 


19860 


GGTAGCTTAT 


GTTATTGCAA 


AACGTCATAA 


TTATCCTACT 


GTGGCAAAAG 


CAAAAAAAGG 


19920 


AGAAGTCAGC 


AAGGTTCTGC 


GAGAAGCGGT 


TCCCAGTGTA 


TTGTTAATTG 


TC ATT ATT AT 


19980 


TGGTGGTATT 


TTGACGGGGT 


TATTTACAGC 


GATTGAAGCA 


TCAGCAATTG 


CTGtTGCATA 


20040 


CTCATTATTG 


ATTTCAATGT 


TTTACTACAA 


AACAGTGAAA 


ATAAACGATT 


TACCTAAAAT 


20100 


GTTAAAAGAG 


GCTGTCTTGA 


TGAGTGGAAC 


CATTATGTTC 


TTGTTGGCAA 


CTTCTTCAAT 


20160 


GATGTCTTTT 


GCGATGGCTT 


TTACAGGGAT 


ACCAGAAGCA 


ATCAGTACAG 


CTATTTTAGG 


20220 


AATCACAGAC 


AATAAATACT 


TAATTCTACT 


ATTAGTAAAC 


GTTGTCCTAT 


TGTTTGTGGG 


20280 


AATGTTTATG 


GATGTTGGTC 


CAGCTATTTT 


GATTTTTACG 


CCAATCTTTT 


TACCAATTGT 


20340 


TATGAATGTT 


GGGGTGGATC 


CAGTTCATTT 


TGGTTTATTT 


GCGATTATGA 


ACTTGTGTGT 


20400 


TGGTTCAATT 


ACACCACCTG 


TTGGAACAGG 


ATTATATGTT 


GGTGCTAGTG 


TTGGGGGTGT 


20460 


TAAAGCTGAG 


CAAATGCTGA 


AACCACTAGT 


TCCATTTTAT 


CTGGTAATCT 


TAGCAATTTT 


20520 


' ACTTTTAATC 


AC AT AT GT AC 


CTGAACTTGT 


CATGTGGTTA 


CCAAATCTAA 


TTTCTTAATT 


20580 


CAATACAAAT 


TGTCTAACAA 


CCCCAATTGA 


GCGAAATATA 


AGTTGTTCAA 


TCAACTAGAT 


20640 


TTGGTAATTT 


ACTATAAAAT 


ATTGAGTTTA 


GGAGGCATAG 


AAATAAAAAA 


TCTCTCTATG 


20700 


ACATTATGTG 


TCATAGAGAG 


ATTTTTTAGA 


AGCCACTTTT 


ACGGTATTGA 


CTAGGAGTTA 


20760 


ATCCAGTTCG 


CTTTTTAAAT 


AATCGACTAA 


AGTAGGCATA 


GTTTTCTAAA 


CCAACATCTA 


20820 




ill un^ i \j ± x 


TGATTAGAAT 


AGCGTAGTAA 


ATCTTTTGAA 


TAAGAAATTT 


20880 


TTTTACTAAT 


TTGATATTTA 


ATTGGCGGTA 


ATCCAATGTA 


TTTGGAGAAC 


TCTTTGTTCA 


20940 


ATTGATATTT 


ATTTAAATGA 


AAGAGTTGAG 


CCAAGTCTTC 


TAAAGAAATT 


GTTTCTTTGA 


21000 


AATGTTGATC 


AATGTAAGCT 


TTCATTTCCA 


AAACCAATGA 


TGGAATCGTT 


TCGTAGGTAA 


21060 


AGTCTTGACG 


ATACTTTTGC 


AACAATAATT 


CGTTCAGTAA 


TTGATGAATT 


AAGACGGAAG 


21120 


ATTGGTAATC 


AGTTTGCGCA 


TGTTGTAAAT 


CTTGTAATTG 


TAATAATTGC 


TGAATGATTA 


21180 
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AATGAATAGG 
TAAATTCTTG 
AAGGATCTTC 
CACCTCTCTG 
TGAATTTAAT 
TAAACTGTCC 
AAATATGTCG 
GAcCTCCTAA 
ATTTTTTGTT 
AAGGATTGAC 
CGGTTGATAT 
AGATAACAAT 
AGCGTTGGGA 
AATTAGGTAT 
TGGATCAATC 
CAGCTGTAAC 
TTCAAGTTCA 
CTGCAGCCAG 
CAGTCATCGA 
GGGAAAGTGA 
AGCCGGGCAC 
ATGCAACGTT 
GCATTGGCGA 
CGACTGTTAC 
CCCAGAACAC 
CACGGCACCA 
CAGCATTTCA 
TTACTGAATT 
AACTCGCTCG 
TAGCTCAATT 
CCAATAATCG 
GACCTGGGGA 
TTAAAGATAT 



ATTATCTTCT 
GAAAAAAGCA 
GCTGGTTGCT 
TAAGCTGTAT 
TAAGTATGAT 
AATTTCCTGT 
CTTAGAATCG 
TTACCAAGAT 
TATGATGAAT 
AACTATCAAC 
TGGCGCTTCT 
AGAAGAAATT 
TATGGCTAGT 
CAAGGAATGC 
TGGACAACTT 
AAATTTTTCA 
GCCTTTTAAT 
CCAATTGTTG 
AGCAACCAAC 
ATTACTGGAT 
GATTTTAGGC 
GATTACGATA 
TGACTGGGCG 
AACAATTTCT 
AATTCGTTTC 
AAACTACCAA 
ACAATTTATT 
ACAAGCCTAT 
TTGTATTTAT 
AACAGGAATC 
TTTATTGAAC 
AGCAATTGCT 
TCCAGCAGCT 



GGAAGCTGAT TAGTGGAGAA AACTGGTGAA CCATTTTTCA 2124 0 

CCAGAATTTC CTCCTTCAAA ATGAATCCAG TCCATTTCCC 21300 

TGATAGTATT GATAATCCCG ACAGTCAATA AAGAATAAAT 21360 

GTATGATAAC GATAGCGTAA TTCCCCAGTA CCATTTAGCG 21420 

GGTAAATGTG CCCGCTCCGT ATAATAAGGG CGCGAGGCTT 21480 

ATGTAATAAA AAAGTGTGCG ACTTTTGCTG CTAGGTGTCA 21540 

TCTGACCAAG AGTAGTCGGC TTTTTCCCaA TAATCCaTAA 21600 

AGTACAATTA TATCTCAAAA TTGTGAGAGG TGAAAGCGCT 21660 

ACATAAAGTT AGTCTAACGT ACAAAAAATA AAAATAGTGA . 21720 

AAAGTGAmAA AAGAACAAGT TGAAAAACGA ACCTATTTAG 21780 

AGCGGTCGCA TAATGAAAAG TCAACGCTTA GCGAATGGTC 218 4 0 

CATCGTTTTA AAAATGGTTT TCATCAAAAA GATGGGTATC 21900 

TTAGTTCATG AGTTACTGCT TGGTCTGCAA AAAGTTAAAC 21960 

TTTATTGGAA TTGATACTTG GGGTGTTGAC TATTGTCTGC 22020 

TTGGACGAAC CGATTGCTTA TCGAGATGGA CGGACAGAAG 22080 

AAAGGCTATT CATTAGAAAA ATTGTACCAA CAAACGGGGA 22140 

ACCATTTTTC AGTTATTTGT AGAGGAAAAG GAACGTTTGG 22200 
TTAATCCCTG ATTACTTAGG TTATGTCTTT ACTGGAAAAG 22260 
GCTTCCACCA CGCAACTGTT GAATGCTGGA ACGAAACAAT 22320 
TTCCTAGGAA TTGATGAAAC ACTATTTCCA ACATTAGTAG 22380 
GACCTGCAAA CAGCGGCTTT TCCAGACTAT GATTTACCAA 224 40 
GCTAGTCATG ATACAGCTTC AGCAATTTTA GGTACGCCAG 22500 
TACATTAGCA GTGGTACTTG GTCTTTATTA GGCATTGAAA 22560 
GCTGAAGCTT TCCAAGAAAA TTACACGAAT GAATGGGGCG 22620 
TTAAAAAATA TTATGGGTAT GTGGCTAATT CAAGAAGTGG 22680 
TATAGCTATG CTGAATTGGC CGCCTTAGCT GAAAAAGAAC 2274 0 
GATGTAAATG ATCCTCGTTT TTTAAATCCG GGAAATATGA 22800 
TGCCGTGAAA CACAACAAAC AGTTCCTGAA AGCCCTGGTG 22860- 
GATAATTTAG CATTGTGTTA CAGCGTAGAA CTAGAAAAAT 22920 
GAACGGAAAA TTACAACTTT ACATGTTGTC GGTGGTGGTT 22980 
CAGCTAACTG CTGATGTTGC TAATGTTACA GTGAATGCGG 23040 
TTAGGTAATC TTTTGATGCA GATGATTGCA ACAGGTGAAC 23100 
CGTACTTGTA TCCAAACATC TTTTCCCACG GAAATCTATC 23160 
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AAGCAAATCC 


AATTGATTCA 


ACAATAAAAA 


ATCGCTATCA 


AGCATTTATG 


AAAAGGAGCT 


23220 


CATTATGACA 


ACCATTACAC 


AAAAATATGA 


AGAAGCGAAA 


GAAAAATATG 


CCAGTATTGA 


23280 


CGTAGATACC 


GAAGCTGTAT 


TGGAAAAAAT 


GGCTGATGTC 


AAAATTTCAA 


TGCATGTGTG 


23340 


GCAAGGAGAT 


GATGTGAGAG 


GCTTCTTAAG 


TGAGGATGAA 


CTTTCTGGTG 


GTATTTCAGT 


23400 


TACAGGGAAT 


TATCCAGGCG 


TTGCTCGAAG 


TCCCCAGCAA 


CTACGTCAAG 


ATTTAGAAAA . 


23460 


AGCGTTTTCT 


TTAATTCCAG 


GAAAACACAA 


ATTGAATTTG 


CATGCTATTT 


ATTTAGACAC 


23520 


AGAGGAAAGG 


GTGGATTTAA 


ACGAACTGGA 


ACCAAAGCAT 


TTTGAACCAT 


GGGTGACTTG 


23580 


GGCAAAAGAA 


AATGGCTTAG 


GATTGGACTT 


TAATCCAACC 


TTTTTCTCGC 


ATCGAATGTA 


23640. 


CCGCGATGGC . 


TTTACTTTAG 


CTCATCCGAA 


TCCGCAAGTT 


CGAGACTTTT 


GGATTGAGCA 


23700 . 


TGGGAAACGT 


TCACGAAGAA 


TTGCTGAATA 


TTTTGGAAGA 


GAATTAGGAC 


AAGTAGCTGT 


23760 


CAATAACTTT 


TGGGTGCCAG 


ATGGCTTTAA 


AGATAACCCA 


GTTGATCGTT 


TAACGCCGAG 


23820 


AAAGCGGCTA ATGgcTTCGC 


TAGATGAAAT 


TTTTTCAGAG 


GAAATTGATC 


CAGCTTATAC " 


23880 


AGTCGATGCG 


ATGGAAAGCA 


AGCTCTTTGG 


GATTGGTTCA 


GAAGCCTATA 


CAGTTGGGTC 


23940 


TCATGAATTT 


TATATGGGCT 


ACGGTTTAAC 


GCGGAATAAA 


TTAATTTGTT 


TAGATGCTGG 


24000 


TCATTTTCAT 


CCAACGGAAG 


TGATTTCTAA 


TAAGTTATCT 


TCATTATCCT 


TATTTGGCGA 


24060 


AGGAATGCTA 


TTACATGTCA 


GTCGTCCAGT 


TCGCTGGGAT 


AGTGACCACG 


TTGTCATTAT 


24120 


GGATGATGAA 


TTGCAAGAAA 


TTGCCAAGGA 


GTTAGTTCGC 


AATGATTTAT 


TAGGAAAAAC 


24180 


ACACGTGGGC 


TTAGATTTCT 


TTGATGCCAC 


AATCAATCGT 


GTTGCCGCTT 


GGGTGATTGG 


24240 


CACCCGCAAT 


ACACAAAAAG 


CGTTAATGAA 


AGCGATGCTT 


GAGCCAACAA 


ACGTCTTAAA 


24300 


AGAAGCTGAA 


TTGATAGGAG 


ATTTTACCAC 


ACGTTTAGCC 


TTAACGGAAG 


AGTTAAAAGA 


24360 


CTTTCCATTT 


GCCGATATCT 


GGAACTACTA 


CTGTCAAGAA 


AATCACGTAC 


CTATCGGCTT 


24420 


AGATTGGTTA 


ACGGATGTTC 


AAGAATACGA 


AAAAGTGATT 


TTACCAACTC 


GTCAATTGCC 


24480 


AACAGGTAAA 


GATAGTTGTC 


GTTTTTCATA 


AAATTAACAA 


ACGAGAGGAA 


CTTCTAAAAT 


24540 


GAAAAAAATG 


AACGTGTTAC 


AAGCACCATT 


TGTGGAAGAA 


ATGGTCAAAA 


CGACCAAAAA 


24600 


TTTATATCGT 


TTAGGTTGGG 


ATGAACGAAA 


TGGCGGAAAT 


ATTTCGTATT 


TATTAAAAGA 


24660 


GGAGGAGATT 


TTACCTTTTC 


TAAATCCAAC 


ACAAGTGCTC 


CGAAAGATTC 


CAATGAAATT 


24720 


TGATGCAACG 


AAATTAGCAG 


GAAAATATTT 


TATCGTGACA 


GGCTCAGGAA 


AATATTTTAA 


24780 


AAATGTCTGT 


GATGCATCAA 


GCGAAAACTT 


AGGGATCTTA 


CGAGTTAGTb 






GGAATTAGAA 


CTATTGTGGG 


GCTTAGAAGA 


TGAAGCTGTA 


CCAACGAGTG 


AATTACCTAG 


24900 


CCATTTTATG 


AGCCATATTG 


CTCGTTTAGC 


TGTTGATCCA 


GAAAATCGCA 


TTGTCATGCA 


24960 


TAATCATGCG 


AGTCATTTAT 


TAGCAATGAG 


TTTTACTCAC 


GAATTAGATG 


AAAAAGTCTT 


25020 


TACTAGAACA 


TTATGGCAAA 


TGTGTACCGA 


ATGTTTAGTA 


GTATTTCCAG 


ACGGTGTTGG 


25080 


TATTATTCCG 


TGGTTGGTTC 


CTGGAACGAA 


TGAAATTGGT 


GTAGCAACGG 


CTGAAAAAAT 


25140 
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GAAAGAATCT 
GGATGAAGTA 
TTGTGCGCAA 
AGCGTTTGGT 
GCTATGATAA 
CGACGTCATG 
AAGTATCAAA 
GATGAAGCTC 
ATGGAAGATA 
GTTTTTGAAC 
AAAACAACTT 
CCTTTGTGGT 
TTATTCAAAA 
AAATCCGGAA 
AAGATATGGT 
GTACGTTAAA 
GGACCGTTTT 
ATTTCGAACG 
GGACAAACTT 
TGGATGTAGC 
AATATGGTGC 
AATTAGTTCT 
TTGAAGCAAT 
TGGAAAAAAA 
GATTTACTTT 
AAGAATATCG 
TTTTGAAATC 
AGAATAATCA 
AGTAAAATGT 
GAATTGGCTA 
TGTTATCCAG 
AATATTTTCG 
CATCCAGATT 



CGTTTGGTTT 
TTTGGTTTAA 
GGAGGCGTTC 
GTGACACCGA 
AAAAAGCCTT 
AGAAGTTATG 
TTTTTTTAAA 
GCTGGGAGCA 
TTATGGAAAC 
TATAATTTAA 
ACTTGAGTGG 
AACGGCTGTT 
AATAGTTAAC 
ATCACAAGTA 
TAGTCCAGAA 
AGGGGTAGCA 
ACTCATTCAT 
GATGTATATT 
GTCTGCTTGT 
AAAATTAGCA 
GGATGTTCGT 
GAAAGAAGAG 
GATTACCTAC 
TTTTTTTGTG 
CCATCAATTT 
CAGTGCTAGT 
CTTTAAAAAA 
GATCATTCAT 
TGACACTTGC 
AAAAAATTGC 
AATATGCAAG 
AAGATGAAAA 
TAGAAGAGCG 



TATGGCCGCA 
TTGAAACTGC 
GCCAAACAAT 
AAGTTGGCTA 
TTGTATGCAA 
GCCAGAAATG 
TCCTGAAACC 
AATAGCACTG 
AAATCCTGAT 
AATAGAACAG 
AGTTGATTTC 
CCAGAAAGTA 
TTCTAAAAAA 
AATAGTATGA 
AAAACCATCT 
ACGTTCTTTT 
TATCATGATT 
TTTTTAACAC 
TTTCAACATT 
AACTACTTGA 
TATGAACAAC 
AATGAGAGCC 
ATTTCTCAAA 
ACTAAGTATT 
GTGCTGAAAA 
GATGTTTACC 
GAGTTTAATA 
TTTGATCATT 
AGTACGTGCC 
TGAAACGAAC 
TGCTGCTGTT 
GATTACGATT 
AGAAAAACGA 



1194 

ACATGGTATT 
CGAAAAAGCC 
TTCAGATGCA 
TTTGGAAGAG 
GTATATCCTG 
CGTCAAATGC 
AGTACACTTT 
ACACCAATCA 
TGCTCGCCGA 
TTTGTTATAG 
AAGGGATTGT 
ATAAATTGTA 
AGAGGTGAAT 
TTTCAGATCA 
ATCATTACCA 
TAGATGGACA 
TACATCGAAT 
CCGATTTTTT 
TTGGTCAGCG 
CCCCGTTAGA 
AACTGTTAGA 
AACCAAAACA 
ATTTGGATCA 
ATGTAACGCG 
AGAAACTACT 
TGAAATGTGG 
TGACACCAAA 
ACGAAGAATC 
CATGATGTTG 
ATAACTTCAA 
CTATCTCCAG 
AGCGTCCTAA 
CTGCATCAAT 



TATGGTACTG 
GCTGAAGTGT 
GATTTATGGC 
AAAGTATCAA 
ATCAACATGC 
TGAAAGAACA 
TTGGTTATTT 
ATCAAAAATG 
TGACGGCTGA 
TGAATCGTAT 
TCTGGAACTT 
CCAACATTAG 
CTTGTTGAAG 
ATATGAGATC 
CGATTTTTAT 
TCAATTTGAT 
TATTAAGCAG 
ACAACAACGA 
TCGTAGTAAA 
TCATTCTCCT 
CTTTTTAATT 
AATGATTGAA 
ACCATTAACC 
GGAATTTAAG 
TTACGCTAAA 
ATTTAAATCA 
AGAATTTTTG 
TATCAAGAAA 
GCACCAAAAT 
TTCAATTTGC 
GACTAGGAAG 
GTTGTTATAT 
TTGAAACATA 



GCCGTGATAT 
ACACTTATGT 
GCTTAGCAGA 
AAGGGAGAAA 
CGAATATCAA 
TGGGGCAATT 
AGAAATAGAA 
GTGGAACTAT 
ATTAAAAAAA 
GATTGTACAC 
ACAAAGAAAG 
CTGTAACTGT 
AGTGTAAAAG 
TTTCATGTAA 
GAAGTCCATT 
GTTGAAGCAG 
AGTACTGATG 
TCTAGTAAAC 
GTTATCAAAG 
TTGCCaGAAG 
TAGTTAAATC 
AATGAACGAA 
TTAGAACAGA 
AAACATACAG 
CAGTTATTAA 
TATCGACACT 
GTGCAACATA 
GTGAGGCTTG 
ATCTCCTGTG 
CTTAGGTATT 
CCAATTGAAG 
TAATCTGATT 
CATTCGTTAC 



25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 
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GATTCAATGT 


TTGGGGCAAA 


AATCGTTGCA 


ACTGAAACAG 


GCAGTATATA 


TGAAACGATT 


27180 


TATTACACTG 


AAGATAATTA 


TACAGAAGAA 


GCCTACGAAG 


AAGTTTTATT 


TTCTGTTAGA 


27240 


AGATTATGTC 


GAGAAGCTGA 


AAAGTATGGG 


ATAATTGTTG 


GGATTGAACC 


AGGAATCAAT 


27300 


CATCCAATTC 


ATACAATTGA 


AAAAATGCAA 


CGATTGATTG 


ACGAGGTTGC 


TTGAACTAAT 


27360 


TTAGGTATTA 


TTTTGGATCC 


TACCAACTTA 


ATTCGAGTAG 


ATATAGATAA 


AACGTATTTG 


27420 


GAAATAGTAG 


AAGAAGCGTT 


TGAGTGTTTT 


GGTGAAAAAA 


TCGTTGCTTT 


TCATTTGAAA 


27480 


GATTTCATTA 


TAAGAAATCA 


GCAGATATTT 


CCGGTAGCCA 


TTGGCGAAGG 


GCAGGTACCT 


27540 


CTTACAGAAA 


CCATTCAATT 


TTTGAATAAA 


CATAAGCCCG 


GACTGTTCAC 


TATTTTTGAG 


27600 


GAAACACCAT 


TTGAAAAAAT 


aGCAAGTGCC 


TaTCAAAAAG 


TTTCGcAATA 


TCATTCCATT 


27660 


TAGATaAAAA 


TGTGCAACAa 


AAAACATCTC 


GATTTTTTGA 


GATGTTTTTG 


TTGCTTAATA 


27720 


AGACAAAATA 


TAAAATACAC 


TTGCCTTAAA 


AACTCACATA 


AAAGAAAGAA 


GCAATCGCCA 


27780 


CAGCAAATGA 


AATAATAGCA 


AACACTAAAC 


TACTCCATAG 


TCCTAACAGA 


ATAAATGGTG 


27840 


AGGTTCTTTT 


ACCGCCCTCT 


TTCTTTAAGT 


AGGAAAATGT 


CCGTCCAGTA 


ACAATGAATT 


27900 


GCCAGATACT 


AATCAATAAA 


AAAATACCGC 


CAATTAAGTA 


ACGCATGTCC 


TCGTCCTCCT 


27960 


TCTATTTAGT 


ATACCACTTG 


ATGaAAACGT 


TATCaAAACG 


CTGTATAAAA 


AAGTTGCCTT 


28020 


TCAAAAAAGA 


GAAAGGCAAC 


TTTTTGACGG 


TTTAACGGAC 


GTCACCCATT 


AAGTTTTGGA 


28080 


TTGGTTTTTT 


CACAATGAAT 


AAGAGAATCC 


CAGCAACGAC 


AGCTACAAGT 


CCGACAATCC 


28140 


CAAAATATGC 


TGACTCTGTT 


CCAGGTGTAT 


AGAAACGAGC 


AATTTGAGCG 


TTAATGGCTT 


28200 


GTGAAGCGGC 


ATCTGCTAAT 


AACCAAATAG 


CAACGGTTTG 


AGCTTCGAAA 


GCTTTCGGTG 


28260 


CTAATTTTGT 


AGTAATCGAT 


AAGCCAACGG 


GAGATAAACA 


CATTTCTCCG 


ACAATCATAA 


28320 


TGAAGAAACT 


AAAGAATAAC 


CAAAGTGGAC 


TCACTCGTCC 


TTCGGTACCA 


TAAAGTAGAC 


28380 


CTGGCAACAT 


TAATAAGACG 


AAGGAAAGAC 


CAGCAAAGAC 


TAGCCCTAGT 


GAAAACTTCA 


28440 


CAACTGTTGA 


TGGTTGACGT 


TTACCTAGTT 


TTGTCCATAA 


TGTAACAAAA 


ATAGGGGTCA 


28500 


AAATAACTAC 


GAATACAGGA 


TTTAATGATT 


GGAACCAGCT 


AGCGGCAATT 


GGGAAGCCAA 


28560 


ACAAACTTGT 


TTGTGTCCGT 


TCATTTGcAA 


ATAGAGCmAG 


AATAGACGAA 


CCTTGTTCTT 


28620 


CCmATGgACC 


AGAAAACmAT 


TGCTGCTAAG AATwATGGcA gTAAGCCAGA ACTTTTGGnT 


28680 


TTTCtTCAGC 


AGTGACATCT 


TTAGACGTTA 


ACATTTTGCT 


GAAtAATAAA 


CTGGTAaCAG 


28740 


GATTCCTAar 




TATTAATAAA 


GAAATCAaTT 


GTTAAATGTC 


CTGTGACATA 


28800 


GGCGCCACCA 


AAAATAAGTA 


AAGCAACGAC 


AATTGCCAAC 


ATAAATGCTT 


TAGCAAATTT 


28860 


CTTTTGCTCT 


TCTTTAGACA 


TTGGATTAGT 


TGGGGCTTGA 


CCAATCCCAG 


CAAGAGATTT 


28920 


AGGTCCTTGG 


AAATAATATT 


GTAGTAGCCC 


GAAGAACATC 


CCTACTGCGG 


CAACGGAAAA 


28980 


GCCTAAATGA 


TAATTATATT 


CTTGTCCTAA 


AGTTCCGACA 


ACAAGGGGAG 


CGATTAAGGC 


29040 


TCCGATATTA 


ATCCCCATGT 


AAAAAATGGA 


GAAGCCTGCA 


TCTCTACGTA 


AGTCAGTTTT 


29100 
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TGAATAAAGA TGGCCAACCA TCCCTGAAAC GTTTGGTTTt AACATCCCAG TCCCAAATAC 29160 

AATCAACATA ATAGAAACGA ATAAGGCGGG GACACCAAAT GGGGTAGCGA GAACAATATG 29220 

TCCAGCAATG ATTAGCAAAC CACCAATAAA AACAGTTTTC CTTGAACCTA AGACACGGTC 29280 

AGAGACCCAA CCGCCAATGA TACTGGACAT GAAAACCATT GATCCATAAA TGGACATGAT 29340 

GGCTAAGGCA GTTGCTTTGG GGAGACCTAA ACCGCCATTG GCTACGGTAT CGTAAATGTA 294 00 

ATACATTAGA ATAGCACGCA TGgCCGTAGy AACTGAAACG TTCCCACATT TCTGTGAAGA 2 94 60 

ACAGTGTTGC TAGTCCTTTC GGTTGTCCCA GAAAGGATTT GTCTAATTTC TCAAGTTGTT 29520 

CTTTGTCACT CATAGATAAA ACTCCAATTC TTATAATATT CAGAATATTT GATAATAAAA 29580 

TGACTTAAAT GTAGATTCAT ATATTTTATT ACT CAT AG AT TGGAAAATAG AAAGAGAAAA 29640 

AACTGATTAT GTTTTTAAAT AACAAGCGTT CGAGCGATTG TTTGTTTTTT ATTCGTGAAT 29700 

TATGATAAAA ATGAATATTT ATTCTGCTT 2 9729 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7237 base pairs 

(B) TYPE: nucleic acid 

(C) - STRANDEDNESS: double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 



GGTTATGAGA 


GCAGGGATTG 


ATGTTGGGTC 


AACGACAGTA 


AAATTAGTAT 


TTTTAAATAA 


60 


GCAAAATCAG 


TCTATTTTTA 


CAAAATATGA 


ACGCCATTTT 


TCAGATGTCA 


AAGCAGCAAC 


120 


GGAACGCATT 


TTAAAAGAAG 


GGCTAGCAAG 


GATCGGTGCA 


GACCAACCTG 


TGACAATGAG 


180 


CATAACTGGC 


TCTGGTGGTA 


TGGGGCTGGC 


TGAGGTATTA 


GGGATTCCCT 


TTGTTCAAGA 


240 


GGTTATCGCG 


TGTACACGAA 


CTGTTGAAAC 


AATAATACCA 


GAAACGGATG 


TTGCTATTGA 


300 


ACTTGGCGGA 


GAAGATGCAA 


AAATCACTTT 


TTTTGATGGT 


GCTTTAGAGC 


AACGAATGAA 


360 


TGGTAGTTGT 


GCAGGTGGGA 


CAGGCGCCTT 


CATCGATCAA 


ATGGCTGTTT 


TACTTAAAAC 


420 


GGATGCCAAT 


GGAGTGAATG 


AATTGGCGAA 


GAATTATCAA 


ACTATTTATC 


CGATTGCTTC 


480 


CCGCTGTGGG 


GTTTTTGCCA 


AAACGGATGT 


TCAACCTTTA 


ATCAATGAAG 


GAGCAGCGAA 


540 


AGAAGATATC 


GCGGCGAGTA 


TTTTTCAAGC 


GGTAGTTAAT 


CAAACAATCG 


CCGGGTTAGC 


600 


TGCTGGTCGA 


AAAATTAAAG 


GTAAAGTAGC 


ATTTTTAGGG 


GGACCTTTAT 


TTTTCATGTC 


660 


AGAATTGAGA 


AAACGGTTTG 


TTGAAACCTT 


AGCAATTCAG 


ACAGAAGATG 


TTATTTTTCC 


720 


AGAACAACCG 


CAACTGTTTG 


TAGCAATGGG 


TGCAGCACTT 


TATTCAGAAG 


ATGCAGCCGA 


780 


ATGTACATTG" 


GAAGAATTAA 


TTCAGCGCTT 


AGTGAACAGC 


CAGGCTACTG 


ATTTACAGCC 


840 


GAGCGACACG 


TTACCACCGT 


TATTTAACAG 


CGCAGAAGAT 


TTGGCAGATT 


TTCGAGAAAG 


900 


ACATGCGCAA gCCCAAGCAA 


CGGAGAAACC 


TTTATCTCAA 


CATACTGGCG 


TGACCTTTTT 


960 
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AGGGATTGAT GCAGGCTCCA CAACGACAAA AGTCACATTG ATTGATGAAG ATGGTCATTT 1020 

GTTGTTTTCT TTTTATGGTA ATAACCAAGG GCAACCT.TTA .GAA&CCAC£A TG AC AG TACT 1080 

GAAATCACTT TATCAACAAC TGCCTGAAGA AGTTTTTATC GGAAAAGCGG CTGTTACAGG 1140 

aTATGGAGAA CAGCTAATAA AAAATGCGTT AAAAGTCGAT ATTGGTGAAG TTGAAACGAT 1200 

GGCCCATTAC AAAGCAGCCA ACCATTTTCA GCCAGGTGTG GATTTTATTT TGGATATCGG 1260 

AGGACAAGAT ATGAAAGCGA TGACCATTAA AAATGGCGTG CTTTCCTCGA TACAATTAAA 1320 

TGAAGCTTGT TCTTCTGGGT GCGGTTCCTT TATCGAAACC TTTGCGCAAT CTTTAAAATA 1380 

TAATGTGAAG GATTTtGCTT TGGCGGCTTT AGAATCGCAA GCGCCTGTTG ATTTAGGTTC 14 40 

TCGGTGTACT GTTTTTATGA ACTCAAAAGT GAAACAGGTT CAAAAAGAAG GGGCCTCCGT 1500 

CGGCGATATT TCTGCTGGAC TATCTTATTC AGTCATCAAA AATGCCATTT ATAAAGTCAT 1560 

TAAGGTTCGC CGACCTGAAg AGTTGGGAGA AAAGATTGTC TGCCAGGGAG GAACtTTTAC 1620 

AATGAnGCTG TCTTGAGAGC TTTTGAnAAA ATTAGCGGTC GTGAAGTGGT TCGTCCTTCT 1680 

ATTGCTGGTT TGATGGGGGC CTATGGAGCA GCGCTGATTG CTTTGGACGC TTATGAAATT 174 0 

GGGGAAGAAA CCACGCTTCT TTCTTTAGAA GAGTTAGCGG CATTTACCTC TGAAAAGGAA 1800 

TTTACTCATT GCGGCTTGTG TGAAAACAAT TGTCAATTAA CGGTCACAGT GTTTTCAGAT 18 60 

GGGCGCCAAT TTATTACGGG AAATCGCTGT GAACGTGGTG CACGAATTAA AATTAAACGA 1920 

GAAGAACGCA AAgTAAATTT GGTTGATTAT AAGTACCGCA AGTTATTTAA GTACCGACCT 1980 

TTGCGGGAAA ATAAAGCGTT TCGTGGGCGC TTGGGTATCC CTAGAGTCCT AAACATGTAT 204 0 

GAAAATTATC CTTTGTGGCA TACATTTTTT ACTGACTTAG GCTTTCGGGT CGAATTATCG 2100 

CCACGTTCTA ATAAGCAAAT CTATGAACAA GGCCTTGAAA CGATTCCCAG TGATACGGTT 2160 

TGTTACCCAG CGAAAATGGC GCATGGACAT ATTCAAGCAT TAATTGATGC GCAAGTGCCA 2220 

ATTATCTTTT ATCCTGGTGT TGTTTTTGAA CAGCAAGAAA CAGTGGAAGC AGATAATCAT 2280 

TTTAATTGTC CAATTGTGCA AAGCTACCCA GATGTTATTC GAAATAATGT GGATGCCATC 2340 

AGAGAGGGCC AAGTGGATTA TCGAAATCCC TATCTTAATT TAGCGAATGA AGCTGCAGTT 24 00 

GCCAAAGTTT TAGCTGAAAA TTTTGCTGAT TTAGGGATTT CCTTAGAAGA AATTCAAACA 24 60 

GCGCTGCATC ATGGTTATCA AGAATTAGCA GCTTTTAAAA AAGAAATTCA AGAAAAAGGC 2520 

GAAGAAACGT TAGCCATGTT AACTGAAAAA GGCCAGAGAG GGATTGTCTT ATCGGGGCGA 2580 

CCGTATCACT TGGATCCAGA GATTAATCAC GGCATTGCCG AAGTCATTAC ACAGGAAGGA 2640 

TTCCATGTCT TGACAGAAGA CAGTATCTCG CATCTTGGAG ACGTGCAGAA TTTACGTGTG 2700 

GTGAATCAAT GGGTCTATCA TTCTCGCTTA TATGCGGCGG CTAAAGTAGT GGCTAAGACG 2760 

AAAAATTTAG- AACTTGTCCA ACTGAATTCC TTTGGTTGCG GCTTAGATGC TGTGACAACC 2820 

GATCAAGTAG AAGAAATCAT GGATCGGTCT GGCAAAATTT ATACAGTCCT CAAAATCGAT 2880 

GAAGGTGCGA ATTTGGGCGC AATTCGTATT CGGTTGCGCT CCTTGAAAGC GGCTGTCAAT 2940 
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GAGCGGGAGA 


AGCAACAGTT 


TGTTCCGCAT 


ATGCAAATGG 


AAGAGCCAGA 


GAAAATTGTG 


3000 


TTTACAAAAC 


AAATGAAAAA 


AACACATACC 


TTACTGTTAC 


CAATGCTTAG 


CCcGATtCAT 


3060 


CAATCTGGAT 


TAGTCGATGT 


AGCATTACAA 


GCTTCTGGCT 


ATCATGTTGT 


CTGTTTGCCT 


3120 


GCGCAGGATA 


AAGAAGCCGT 


GAATGTCGGG 


TTAAAATATG 


TCAATAATGA 


TGCGTGTTAC 


3180 


CCAGCAATTA 


TTTCTATCGG 


ACAACTAGTC 


GAAGCTTTGG 


AAAGTGGTCA 


ATATGaTTTG 


3240 


GATAATGTCA 


GTGTCTTAAT 


GACGCAAACA 


GGGGGCGGTT 


GTCGCGCAAC 


GAATTACATT 


3300 


CCCTTATTAC 


GAAAAGCCTT 


AAATGATGCT 


GGTTTTTCTC 


AAGTTCCTGT 


AGTTTCAGTT . 


3360 


TCGATGGGCA 


ATAAAGGGGT 


CGAATCCAaT 


CCAGGATTTA 


AATTTACGTT 


GCCAATGATC 


3420 


AAACGCTTAG 


TGGTTGCCTT 


TTtATATGGT 


GaCTTATTCG 


AACGAGTAGT 


TTATCGAACA 


3480 


AGACCTTATG 


aAACAGAAAA 


AGGAATGGTG 


GATCAACTGC 


ACCAAGATTG 


GCTAAAAAGA 


3540 


GTGGAAGCCA 


ATGTCCGGAA 


TGGTTCGCTG 


ACACAATTTA 


ATCACTTCAT 


GAAAAAAATT 


3600 


ATTCGGACGT 


TTGATGAAAT 


TCCTTTACAA 


GAGATAAAAA 


AACCAAAAGT 


CGGTGTGGTC 


3660 


GGGGAAATTT 


TAGTGAAATA 


TTCGCCGACA 


GCGAATAATG 


ATATTGTGCG 


TTTGCTAGAA 


3720 


GAAGAAGGTG 


CTGAAGCAGT 


TGTGCCAGAT 


ATTGTTGGCT 


TTATGAACTA 


TTCTTTGTAC 


3780 


AATCAAATTT 


GGAAATACGA 


AAATATGGGC 


ATGTCAAAAC 


AAAGCAAACG 


TTTAGCAGAA 


3840 


TTTGCGATAA 


AAATCATTGA 


ATTGGTAGAA 


AAACCAATGG 


ATAAGGCATT 


GAGAAAATCG 


3900 

i 


GTCCGTTTTG 


ATGGTATTCA 


TTCCATTTAT 


GATATGGCAG 


CAGACGCAAG 


TAAAATTTTA 


3960 


TCAATTGGCA 


ATCATACGGG 


TGAAGGTTGG 


TTTTTAACAG 


CTGAAATGAT 


TGAGCTCTTA 


4020 


AAACACGAGG 


TGAATAACAT 


CGTTTGTATG 


CAGCCgTTTG 


GATGTTTACC 


AAACCATATT 


4080 


GTTGGAAAAG 


GGGTCGTGAA 


AGAGCTACGA 


CGTCAATATC 


CGCAAGCAAA 


TATTGCGGCG 


4140 


GTGGATTATG 


ATCCAGGGGT 


ATCTTTGGTT 


AATCAGTTGA 


ACCGAATCCG 


TTTGATGATG 


4200 


GCCACAGCGA 


ACAAGTTGTT 


GAAAGAAGAA 


AATGTAAAAA 


GATAATAACT 


AATAGAAATT 


4260 


CGTTTAAAAA 


CATTTGACTT 


GGTCACTTTT 


TACTGACTAA 


GCCAAATGTT 


TTTGTTATAA 


4320 


AAAGAAAGCA 


TGCAACTAGA 


CCAACTTTGA 


TAAAAATTCC 


AATTCCTCTT 


TTATTTATCA 


4380 


TTTTTTGGTT 


GTTTACACGG 


TCTATACCAG 


TTATAATAGA 


AGAGGAGATG 


AACGAAGAGG 


4440 


AGGAAGCGTA 


AATGGTTCAA 


AATATACCAA 


TTTATATTCA 


AATTCACGAT 


AAAATCAAAG 


4500 


AAGATATTGA 


AAAAGGTGTC 


TGGAGTATCG 


GGGATCGTTT 


GCCCTCTGAG 


CGAGAATTAG 


4560 


CCTTGAAATT 


TGATGTTAGT 


CGAATGACCT 


TACGCCAAGC 


CATCCAGACT 


TTAGCAGACG 


4 620 


AAGGGATTTT 


AGAACGAAAA 


ATTGGTTCAG 


GAACGTATGT 


AGCACGTAAG 


AAAGTTCAAG 


4680 


AAACAATGAC 


AGGTACCACT 


AGTTTTACAG 


AAATTACCTT 


GTCTCAAAAT 


CGCGTTCCTT 


4740 


CTAGTCGGAC 


• GGTGTCTTAT 


TTTGTAGCGA 


AACCTAGTTC 


TAGTGAGATG 


GAAAAATTAC 


4800 


AACTAGGTCC 


AGAAGATTCA 


ATTTTACGAA 


TGGAAAGAAT 


TCGTTTTGCG 


GATGACATTC 


4860 


CGATTTGTTT 


TGAGGTGGCG 


AGCATTCCTT 


ATTCATTGGT 


CTCGCAATAT 


GGCAAATCTG 


4920 
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AAATCACGAA 
CTAACCAAAC 
AACGAGGCGA 
TTGAATATGT 
GAAATGATAA 
AArGTkGTAT 
ATTCAAATTA 
TCTGAATACC 
ATAATAGCTG 
TTTTTTACAT 
ATCAAATAGT 
TTACCAATCA 
TTTATTTTCT 
ACGTACGTAA 
ACCAAATAAC 
AATGAAGTAT 
GAGATTTAGA 
ATTTTGGACA 
AATGAAGAGG 
AAATGATGAA 
AGCTAATGTG 
GGAAAAGATA 
TCTAACGGTA 
tGAAGCAAGT 
AACTGTAGTT 
TAGATAAAAA 
TTGTTTGCGC 
TAAAAGTTGG 
CTGTAATACC 
ACCCTAAATT 
TAGCTTATGG 
TGGTAACAAG 
ACGGAAACTC 



CTCTTTTTAT 
AATCTCTGCT 
TGCGATTCtG 
TCGTACGCAG 
GTATAGAAAA 
CTGAAAAaTA 
AAATTAGTGA 
AAAAAAGTGA 
CTAGTTTATT 
TTCTTTTTTA 
TGCAAAATCA 
ATAATACTTA 
TTTTCAATAA 
AGCCGATACA 
GGTTAGACGA 
AAATATCCCA 
GTTGGGGCCC 
TTATCTAATT 
AAATATCCGG 
GGTCTAATCG 
AATTACTTTA 
AATAACTTGA 
CTAGCGGGTC 
AATTtAGGTC 
GCTAAAAGAG 
AATAATAGAT 
TATTGATGAA 
TTTGCCGTGC 
CCAGAAAAcA 
TGTTGAACAA 
• TCCTGGTATC 
ATATGCTGAG 
GTTCACGCAT 



AAAACACTTG 
GTTCaAGCAT 
CGTGTGCGCC 
TATGCAGGAA 
ATGCCAAGAA 
CAATTTTTGG 
AAGCATCTCA 
GATGGTTTGT 
TGCTTTTATT 
AACTGTAACC 
GCAGTTGTTT 
TGATTGTCAA 
AAAGTTTAGC 
GTTGATGCAA 
GATAGAAATA 
GAATAAAACC 
AATTAGGAAT 
TACTCGATGG 
AATTAACAGT 
AGGAGACGTT 
GTAGGTACTG 
AAATACTATT 
TTGGCCCCAA 
GTCAATTACT 
CCATTAATGA 
GTGAATGATG 
CCCCCTTTCC 
GTGTTTGGTG 
GGATGCTTTG 
TTTGTAGGTT 
TTTCAATTAA 
CCAAAAAGTT 
AAAACTTGGC 



1199 
AAGCCAAATC 
CGGAACAGAT 
AAGTATCCTA 
GTCGATTTGA 
ATAGGCTGAG 
AGGTTATTTT 
ATTCAmATAA 
CCCGCTATAA 
CTTTTTTTGG 
TAATATAAGA 
TGCAAATTGT 
AG TT ATT AT A 
GGAAGGAGTG 
CAGCATGGTG 
TCTCGTCATT 
AATCTATCCA 
CACTATTGAA 
AGTTAGATCG 
AAAAGATATC 
TCCTGGTAGA 
TAAAGCTGAT 
ATTAGGTCTT 
GATGATtAGA 
ATATAGAGAA 
AATGAnACTC 
TTGTTGAATT 
T TAT AC AT AG 
CCTCACAGGT 
aTtGCATGAA 
TTAGAAATAT 
CAGCATCAAT 
TGGGAACGCA 
CAAGGTTTGA 



TGGCCACAAA 
CGCAGAATAT 
TTTTGAAAAC 
GTTTTATTTA 
GTArGATATT 
TTATGrACTT 
ATATTGAATA 
ATATTGTTAA 
CTGTAGTTCA 
TATACCCGTT 
TACATTGAAA 
TAGATATACT 
ATGCGGAATG 
GATTATTATC 
TTTTAATAAG 
TTATATCAAT 
TTTGATGATG 
TTTGATGAAG 
GAAGAAGGGA 
ATGATTGAAG 
GATGATAGAT 
GGTGGAGGAG 
ATGGTGGACT 
GCAGATATTG 
AAATATTAAC 
GACAGAAGGA 
AATTGtGAaT 
TAGCCGAGGA 
TTTGAACTtC 
TCAATTTGCT 
TGTAGATGAG 
ATATGAAATT 
GAGTGAATGT 



ATTGGCCATT 
T TAG AG ATT A 
GG ACT ACCTT 
GAGAAGTAGA 
TtAaCCTCCA 
ATCAGGTAGA 
GAGTAGCAAG 
AAATAAACTT 
TTTTTGTCCA 
CAACGCTTTA 
TTGTTAGTTA 
GAAAGAGAAA 
AAAAAAATTA 
TAATTTGATA 
GAGTGAAAAA 
TAAATAAAGA 
ATGAAGGTCA 
TTGTCACAGA 
TAGATTTTTT 
ATAGATACTT 
TTGAAATACA 
GATCAAATAT 
ATGACCGAGT 
GTGAGAAGAA 
GTTGAAACTG 
ATAGATATCA 
GAAGCTATTG 
AGAGTATACA 
AGTAAAAATG 
CCCCCATCCA 
TTAATAAGAG 
AACTATGAGG 
CCAACTTGTG 



4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 

5880 

t 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 
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GTAAAGGAGA TGTCTCCCAA TGGGAAATAT TTCAATATTA TCAAGAGAAG AAATAGGCCG 6960 

AGACTATACC GAAGCTCGTA CGTTAAAACG TAAGCTAAAC AATGGCGATT GGGTTTATGT 7020 

ATCAGAGGTA CTTAATGAAT TAGTTgAAAA TAAAATTATG ATAAAAAAAG CCGATCAGAG 7080 

AGATGATTAT AAGCCGCTAT TTGTCGAGCC TAAGAATATC TTATTTAAAA GCTCCTCTTT 714 0 

CCTGATTGAT GTTAACGGTA TTGTTGCGTT TACGATTAGC ATATTTTTAT TCGTAATCAA 7200 

GGGGTCTCCT TTTGGAAGTG CAGTAAATGT TTTGTTT 7237 
(2) INFORMATION FOR SEQ ID NO: 2 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 



GAGGATTTGA 


TTTGATCnTC 


TGAGTTCACG 


TgkCGCTAtC 


agcTCTGATA 


cTTAcaCTTT 


60 


AcGTCGGAcC 


CAgAAcAGGC 


T TAT TAG AAA 


TGATTCaAAA gATGTtGgCT GTTCCCgArT 


120 


GTGTTTGTGT 


TTtATTTGTA 


TAATTAAGTT 


GaCTATTTCC 


tTGGAAAATw 


CAGGAGTCAG 


180 


ATGCTAAATA 


ATTTTTGCAT 


CTGACTCCTG 


TTTTCTTATT 


TTAAACGATG 


GATTGGTTCT 


240 


CCAATTCTTT 


AACGTATAAA 


TAAAAAGAGC 


CAAAGTCTAA 


GCGATTCATT 


TGTAAAGATT 


300 


CATGAATTTT 


TGTGAACCCT 


TTACTTTCGA 


AAAATCGAAT 


CATTCTATGG 


TTTGTATGAT 


360 


TAGTTTGAGC 


ATACATTTTA 


CGTCTTCCTT 


CCCAGACTGC 


CCGTTTCTCC 


AATTCATGGA 


420 


ACAATAAACT 


GCCATAACCC 


TTGGCAATAT 


AATTAGGCGA 


AGTTGCAAAG 


CGCTTTAACC 


480 


AAACAAAGTC 


TTGTTCCTGC 


TCCATACAGA 


ACGTTGCCAT 


TGAAAAAATC 


ATTTCTTCGT 


540 


GCACTAGCAA 


ATATAATCGT 


TTTTTTGTAA 


TGTCTTCTTG 


GATAT-CTCCT 


TGAGAAGGAT 


600 


AGTCCTTTGT 


CCACTGTGTA 


ATGCCCACAA 


CATCCATATA 


TGCCTTTACT 


TCTGTTAAAA 


660 


ATTGAAGAAT 


CATCGGTAAC 


TCTTTCTCAT 


CAGCCACTCT 


AATTTTCATT 


GTTTATTCTC 


720 


TTaATAAGGT 


AACAATAGAA 


TCGACAAGAC 


AATCACACTA 


ATAATGCCAA 


TGATGGTCGG 


780 


TACACTCGTC 


CGTTTTAAAA 


TTCTGATGGG 


ACTAACTCCT 


ACTGTTGATG 


CGACAATCAT 


840 


GACAACTGCT 


GCTACAGGAG 


AAATCGTTCT 


AGTTAAATTC 


GCAATCATCT 


GCATTGGCAA 


900 


GGAAATTAAG 


ATGCCATCAA 


TCCCAGCTTT 


TTCAGCAATC 


CCAGGGATTA 


GTTCGATAAC 


960 


CGCATAAAAC 


ATTGCCAGAC 


CACCACCGCT 


TAAAATTCCA 


AAAAGGGTCG 


TTGCACCGCT 


1020 


GAAAATTAAG 


GTCGTGACAA 


TTCCTGCTGA 


ACTTGATGCT 


TCGACAGATG 


CCATAATACT 


1080 


ATCAATAATT 


CCTAACGTTT 


GAAtCGCAGA 


CGTGAATAAT 


GAACCGCCTA 


CGACAAGCAT 


1140 


CACAACTTGA 


CTAAACCCTT 


GCCCCATTCC 


TTTAAACATC 


TCGACAGCTG 


TATCTTGAAC 


1200 


CTTTTTAAAA 


GATTTCAAGC 


GAACGyCTCA 


ACCAATACAG 


CGATGAAGAA 


AGAAATAAAT 


1260 


GTAAGAACAA 


AAATATCCAT 


CGTTATTCCT 


TTAATAAACA 


TTCCTGCGAT 


TCCCACTCCA 


1320 



1800 
1860 
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ACAATTAATA ATAGCGGTAA TATTGGCAAT ATTGCATAAA ATTTAGGAAC ATCAAATTCT 1380. 

TTTTGTTTGC TTAAGCCCTC TTCATTTAAT GAGACATAGG CAGCCTCACC TTCTTTTTTA 14 4 0 

TCGCAGTATT TTTGCCAGAA GTACTGTGCA ACCGCAATGA TTAATAACGA TGGCAGCGAA 1500 

ATTTTGGCAT TCCATACAAC ATAATTTAAC ACATCATAAC CAAGGGTATT TGCGGCGATT 1560 

ACGTTATCCG CGCCTAAAGG TGTTGGCATG ATAGTTGCCG TCATTGCAAT CACACCTGCA 1620 

GCAGTTAATG ATGAAATCCC CATGCTGGCT AGCATCGGAT ATAATATAGA CATTAAAATA 1680 

ATCGCTAAAC TTGATGCACT AGGAACTACT AGTGACATTA AATTTCCAAT TAAAAAGACC 1740 
ACAGGGACAA ATAACGATTT ACGTTTAATC TTCATTAATG GTTTCACCAA AAAGTTGACA 
GCCATCTGAT TGGCCCCGAT TGCATTCATA TAGCCTGAAT AACCAAAAAG AATCATAATT 

"acaattcctg CTGAACTCAC TTGTGCAATA ATTGTATCTT TAAACTTCAA GAAAATATCA 1920 

AATAATCCAA AACCGGTTCC TTCTCCTGCA GGATAAATGG GATGCCCTAA CATCCATGCA 1980 

1989 

CATGCAAGT 

(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double . . ■ 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

GCTTCATTAC TAAGAAATAA GGTATGTAAA CAAAAACAAC TAAGATTCGG TATTTTTCCA 60 

GAAAGAGTTA TTATTTAAAT GTTTTTAAAT AAAAAATATG GAGATAAATG GCAGATCGTT 120 
TGTTGCTTTT TTTAATyTCG AATTATCAAT TAAGTACAAC ACTTTATCCT GTAAAAACTT 
CATACGATGT TTTyCATTTT AAGCACTTTT GTGGTCTAGT CTATGCATAA AGGATTGAAT 

ATAATTATCT GGAATTAAGA TTATATCTTT CTCTTTTGGT AGGTACTCTC TTAATAATCT 300 

ATTTGTATTT TCATTTAGTC AAAAACTGGG TAAAGGAGTC TACAAAGATA AACCTAAAAA 360 

ATAATTTTGC ACATATGCTT GATGaCCTTC CTGGAGTAGC TGATAATGCT AAAAAAATAT 420 

TGTTTGATAT GAAATGkACT ACTGkAATAG ACGATTGTAA TGTtCATTTA CGAGATGTAC 480 

TTAATTTATT AGAAGATGAA ATGCACAGCT TTAGaTTATC TTCTATTGaA GCtAATGATT 54 0 

TAGaTAAACT AtATGATACT TTTTATTACT ATGGnAAAAA AGamCAATTA ATAGAATTTC 600 

TGCTTGACCA GATGACTAAT GCGATAAATT TTAAACAAGC TGATAAGGAT TTTCTTATTT 660 

ATTCATGGCT TGTAGAAAAA CTTACTGAAG AACAAATATG CGAAATGCTC GCGAGAATTA 720 

ATGAAAATAC CCAATACCAT TGGAATAACA ACCTTAATTC ATTTGTTGAT GAGTTCGTTG 780 

CTTATTATAA AGAAACAAAT GGAATTGATC TAAGAATAAA TTATTTGGGA .GCAATTTATA 84 0 

CTAATTTAAA AATTCTCAGC GATGAATATA CTCTCTCTAT CTCAGATTAC GAGCTTATTT 900 



180 
240 
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TCAAATTTTT TGAGGATATA ATAGAGAGTC AXAGCGAAAC AATATTTGAT TTTCAGCGTA 960 

AATATAAAAA. GGATATTGAA ACAAATTTTG GTAATAGTTT GGAAAACTTT CCTAAACTGA 1020 

ATAAGTTaTT GTTCGAATAG ATTTTGAAAA GCTTACAATA AATAAAAGTA TTAATTTTTG 1080 

TTACATCAAT TAATCTCTAA TTCATGAATT TCTTTTTTGA TTCTATATAT AGTTTGTCTC 1140 

GTAATATTGT AATCTTTCGC AATCTTTGAG ATAGCAACCC CATTCTTAAG ATCCTGAACG 1200 

ATGCTTTTAT AAACAAGGCG ACGTTGGGGA TCTTTTGTGT CTGCACTGTA TAACTTAGGA 1260 

CGACCTTTAT AGACGCCGTT TGCTTTCGCA ATCTTTATCC CTTGTGCTTG GCGTCTTTTA 1320 

GATTCCGTAC GTTCTTGCTC GGCAATCATG GCCAAAATCT GAATGATTAG GTCTTTGATA 1380 

AATTTATCTA GGAGTGGATT TCCGATTGCT TCAGCCATGA TTGGTAAACT GGTAATAATT 14 40 

AGCCGCACAT TTTTCTTCTT TAGATAATTT ACTGAATCGA TAATCTCGTC ATAATTACGT 1500 

CCAAGTCGAT CGATGGCCTC TACAATAAAA ATATCTTGGG CTCTAACAAA ATCTAGAGCT 1560 

TCCTGAAAAA TCGGTCGATG AGTAATAGTC GCTCCGGATT GTTTCTCGAT AAAAATTTTT 1620 

TCAGCACCAA ACTTGTTCAT TtCCTCAATT tGACGTtGTT CATTTTGATC GATTGATGAA 1680 

ACTCGAATAT ATGCAATTTT CCAAATtTTT ACCTACTTTC TATAATTTTT AAAAtACACC 1740 

CTAATCATAC ATTCAAAAAC CATATTTTTC AAkGTtCTGT TGGTGTCTAT TTAGGGGTCC 1800 

TTAATTAGAC AATC 1814 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION:" SEQ ID NO: 242: 

TATGGTTCAA GGTCCAGTGT TTaATGAAAG ATATGTTTGG TCAGACTAAT CGAATTGTTG 60 

TTCATGTTAT TTTTGAAGAA TGGTTGGAAC TGTTCAAAAC CGTTGATGAT ACAGAAGCTT 120 

CTAAAATGGA GAACCGCTTT GTTGCATAGG AAGGAAGTGA GAATATGCAG TATCTAATTG 180 

ACAGTTATAG TGCGTATACG GTTGATGGTG GTTTGTTGGA TAAAGGATGG CAAGTAGTGA 24 0 

ATTGGTTCTT TG TAG AT AT A CCATTCTTTA TCTTACGAAT GTTTGTTTCT TTTTTTCTTT 300 

TTTGTGAGAA CGTCTTAAAC C AATC TAG TT TCTTTGAAGG AAAACAAGAA ACAGTATTGA 360 

ATATGTCAAA AAATGTTCTA AGTGGCATTG GTGGCAAGGG ATTTAGAGGT GGTTCTTTAC -420 

TAGCATTAGC GATTCTTATT AGTGCATATT ATCTGCTTTA TCATTTCTTT TCCAGTCGTA 480 

GAAGCTTTTC AAAAGTATTC CTTCATTATT TAGCAGTTGT TATTTTGTTT GGTTTTTGGT 540 

TTGGCTCTGT ATCAACGTCA ACAGGCACAA CAAGTGGCTC AATGTTCTTG ATTCAATCAA 600 

CTAACGCAAT TGCTAAAGGA GTACAAAGTA CGTTTACTAC AAGTGCCAAT TTAGGAACAG 660 

AAGGTTCTAA AGATGGAGAA GTATACCAAA GTCCTATTTT TGATGCCACA GTAAAACAAA 720 
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CATTTAATTT 


TGTAAATAGT 


GGTTCACTTG 


ATGGGAAAAT 


GGAAAATGGT 


AAAAAGTTAG 


780 


ATGAAGGTAA 


ACTACTAGAA 


AAGCCTAAAT 


TGTCTAAAAA 


AGAAAAAGCA 


AAGTTTGAGG 


840 


ATGAAAGAAG 


TACCTATATA 


AAAAATAATG 


AGAAAGATAA 


TCCATACTTC 


TCACAGGACG 


900 


GAGCAAAAAC 


AATGGAAAAA 


GCTTTTGGCG 


TTGGTGTGGG 


TTTAGTCAAT 


TTAGCAGTCT 


960 


TAGCAGTGCC 


TGTCACGTAT 


ATTAATATTA 


TGCTAAATGT 


AATTCAAATC 


ATTGTTGATT 


1020 


TACTCATTCT 


TGTTTTTCCG 


ATTATTGCGT 


TGGTGTCTTT 


CTTCCCTCGG 


TGTCAAATGA 


1080 


TGATGTTTAA 


ATTTTTTAAA 


TCATTAATTG 


GCATTTTATT 


TTTACCTGTA 


GTTTTCGGTA 


1140 


TATTTTTGTC 


TGTGTTATTT 


TGGATCAATA 


AGTTAATTGA 


CCAAGCGTTT 


TTAGGATTGA 


1200 


TGGATAAAGT 


AAGTAGTAGT 


TTATTAATGG 


TGATGTCTGG 


TGGTGTTTTT 


CTTTTAGGCA 


1260 


CATTAATTGT 


TACTGCATTG 


GTGAAAATTA 


TTTTGTATCG 


GAAAATTTGG 


AAAAGCAGAT 


1320 


ATAAAATATT 


ATCATTCTTT 


TCTGATGGCC 


AAGTTCAACA 


ACCTTCTTTT 


GAATCAAAAA 


1380 


TGAATGAAAA 


AACAAAAGAA 


ACGGTTGAGC 


GTACAGGTGA 


AATTGGTGTT 


GGAGCAGTAA 


1440 


AAGCAGGAAT 


AGGTGCTkCC 


ACTGGAAATT 


TAGCGTTGGC 


TGCTGATGGT 


GCTAGTCATT 


1500 


TAATGCCAAA 


GCATGACAAG 


GCATTGAACT 


TAGCTAAAGA 


GCATTTTATT 


GACGATAATG 


1560 


GGCAATTTGC 


TGGTGTTAAA 


ACAGGATTGA 


ATTCATTGTT 


GAATCGTTCT 


ACTCAAGAAG 


1620 


AACAACAACC 


ATTAAAAGAT 


GATTTATTGC 


CAGAAGAAGA 


AATTGTGGAA 


GTAGAAGAGC 


1680 


CAACCAATGT 


TGATAATGAG 


GAACTAGAAG 


AAATAGAAGA 


TAAAGCTTTA 


ACTGATGAAG 


1740 


TGGATAATTT 


TGAAGATTTA 


ACAGATGATG 


AAATGGATAA 


TTCTTTAGAA 


GAAATTGATG 


1800 


TTCCTGTATT 


AGATGAAACT 


GTATCTGATT 


CTGATACTTC 


TTTAGCTGAT 


GAAAGTAATA 


1860 


TTGAGTTAGA 


GCCAATTATT 


GAAAATGAAC 


TTGATAGTTT 


AGATTCAAGT 


GTAGAATCAA 


1920 


GTGAAAATGA 


TAGAGAAGAG 


CCAATTCATG 


AAGTTGATGA 


GCAAGAAATA 


GCGGACAATT 


1980 


TGAATGTTAC 


GGTCGATAAT 


TTTGATGAGT 


TGGCATTTGC 


TCGTGAAGAA 


CGTGCATTTT 


2040 


TTGATGGTGG 


AGAGGATAGC 


GAGTTAATCA 


CAAACAATAA 


TGATTCATTA 


AATAACGTGT 


2100 


ATTCATTTTA 


CCAAACAGAA 


GAAAACTTTA 


ATAATTTAGA 


ACAAACAGAG 


GAACAATTTT 


2160 1 


TTGGTAGTGT 


GAATACGGAA 


GAATATAATT 


TTGATGTTGT 


AGAGGGCTGA 


TGAAATGTTT 


2220 


ATCAGTCTTC 


TTTTTATTGA 


AAGGAGAGAT 


CATGAAGAAA 


TTAGGCAAGG 


TTTTAATTGT 


2280 


TAGTTGTTTT 


ATTTTTATTC 


TTCCTTTTTT 


ATTATTTTTA 


GGTGTATTTT 


CTTCTAGTGA 


2340 


AAGCGGAGAT 


TCTTCCCAGT 


TTCAGCCCGL 


I ACALC i\L Ata 


(a AAAAALj I Ab 




2400 


TTCTAACTAC 


GTGACGTCAC 


ATGGCGGAAC 


GTTGCAGTTT 


GCTTCCGCTT 


GGATTGGCAA 


2460 


TATGGAACAT 


GAAAGTGGAT 


TAAATCCTGC 


TAGAATTCAA 


AG T GAT T TAT 


CGTTTAATTC 


2520 


AGCGATAGCT 


TTTAATCCTT 


CGTTAGGCGG 


TTATGGAATT 


GGGTTAGGAC 


AATGGGATTC 


2580 


AGGACGAAGA 


GTTAATTTAT 


TAAATTTTGC 


AAAAAGTCAA 


AAAAAGGAAT 


GGAAATCAGT 


2640 


AGCTTTACAA 


ATGGATTTTG 


CGTGGAATAA 


GGATGGTTCT 


GATAGTGACT 


TACTTAAAAG 


2700 
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AATGTCTAAA 


TCAAAAGATG 


TGAATACACT 


TGCGGTAGAT 


AtTTTGAAGC 


TGTGGGAACG 


2760 


AGCTGGAACA 


AAAGATGATC 


CCGCAGaACA 


AGTAAAAAGA 


AAGGCTAGTG 


CTAATAATTG 


2820 


GTATAAACGA 


CTTTCTACAG 


GTTCCATGGG 


CGGAGGTTCA 


GCCAATGTTG 


GTGGAGGAAA 


2880 


AATTGATGCC 


TTGGAAAAAG 


TGATGGGGCA 


AACTATTAAT 


GGTGGTCAAT 


GTTATGGCTT 


2940 


ATCTGCTTTT 


TTTGTTGAAA 


AACAAGGAGG 


TCTACAAATG 


ATGGGTACGG 


GGCATATGTT 


3000 


TGCGAGTGAA ATTGGTAATG ATTATCCTTG GAGTTCAATT 


GGTTGGACAG 


TCATAAAGAA 


3060 


TCCAAATTAT 


TcAGATATTA 


AAGCAGGAGA 


TGTCATTAAT 


TTTGGTCAAG 


GTGGTGTGGC 


3120 


TACTAGTATT 


TATGGGCATA 


CTGGTGTAGT 


GGCAAGTGTT 


GAAGGTAAAA 


ACAAGTTTAC 


3180 


TACTTATGAG 


CAAAACGCTG 


AACAAGGTCA 


AATTGTTGCT 


AAGTATTTTC 


GGACTTGGGG 


3240 


ATTAGATTTT 


CCACATGTGA 


CCAGCATAGT 


AAGGAAATAG 


GAGGAATTAT 


GAATGACTTT 


3300 


GAGTAAGAAA 


AGTGGTTTAA 


CTAATCGTCA 


GATAAAAGAA 


ATCTGTTCGA 


AATTTAACTG 


3360 


TAGAAAAAAT 


GAATTTGCAG 


TTCAAAAGTA 


CCAAGATAGT 


TTTTTATTAG 


CTGTGAATAA 


3420 


TAAAGAGTAT 


CGTGTTAAAT 


TTTCTGAGGG 


GATTTTTTCA 


AAGATTATGT 


ATATTAAAGA 


3480 


AGTGCAACGC 


ATTTCAAAGA 


AAGTAGGTAG 


AAAATGAAAA 


AGTTAAGTTT 


AGTAGTTTTA 


3540 


GTATTAGGTG 


TTGGATTAAT 


ATTTTCTGCT 


TGTGGAGGAA 


ACACCAAAGG 


TCATGAAACG 


3600 


GCGGCTTCGT 


CAACAACTGA 


ATTGGTGACT 


ATGAAACCAG 


TTGATAATAT 


AAATACTACG 


3660 


AAAAGAAGTG 


AAGTAACTGA 


AAAaAGTAGT 


ACTAAACCTA 


TGCAGATTAA 


AGATAAACTA 


3720 


GCAGAAGAAA 


AAGTATCTGA 


TACTTACCAG 


TTGGTGGAGG 


ACTTTGGaAA 


TACTTATGCm 


3780 


AATTTTTCTA 


GTATCAATGA 


TCGTAACGrA 


AAATTAAAGr 


AATTAATGAC 


TGCTGAATGT 


3840 


GkAAAGAAAA 


ATGGTATTGA 


TGTGAAAACT 


GGCGCAAAAT 


TGGAAAGTAA 


AGGTGAAGTA 


3900 


AGCGCAATAT 


ATCAAAATGA 


CAAAAAGGAA 


TACGCAGTCT 


TACTTGATTG 


TGAACAAAAC 


3960 


GGAACAAAAA 


CAAGAGTACT 


TTTACTGGCA 


AAAGTGCAAG 


GAAATAAGGT 


AGCGGAAATG 


4020 


ACCTATAATT 


CAGTGAAACA 


AGAGTATTAA 


CCTTGATATA 


TATGGATTTG 


TACTTTAGTT 


4080 


AATGGTATAA 


TGTATGTATA 


AGTGAGGAGA 


TGAGAAAATG 


AAAAAGGTAG 


TTTCAATTTT 


4140 


GTTGATGGTT 


GTTGCAGTCT 


TCACATTAAC 


TGCATGTAAT 


GGTTCTAAAT 


TAGATAAAAC 


4200 


AGGTGAAGAA 


TTTAAAAATT 


CTATAATGAA 


AGATTCTTCA 


TATGGTGATG 


AATATTCAGA 


4260 


AGATGGTTTT 


AGTTTTTTAA 


TATATAAAGA 


TAAAGACACT 


AATCGTTATT 


TGGCTGATGT 


4320 


TTGGGTTCCT 


GTTAAAGATG 


AAACTAGCGC 


ATTGGAGTAT 


TTTTATTATT 


ATGATGAAGA 


4380 


TAAGCGATTA 


GAT AG TACT A 


AAAGTAAAGT 


AACCTTTGAT 


GATATGAAAG 


CTAGTGGAAA 


4440 


CTATGAAGTA 


GTGTATAAAT 


CaGGGAAATT 


TAAATAACAT 


AGCCTGTCAC 


GTAATTTGTT 


4500 


ATTTTTAAGT 


GGCTAAAATT 


TGGCTAAAAC 


ACTTTGAAAT 


GTATGGAAAT 


GCTGGATTAT 


4560 


TTGTAAAAAT 


AATGTTGATA 


TTAAAGCATT 


TGTGTGGAAA 


ATACATTTTA 


. TAATTTACTC 


4620 


GTAACGCGTA 


GGTCACAGGT 


TCGATCCCTG 


CAGCTGGCaT 


CTAAAAAAAC 


AGACAGGcAC 


4680 
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TACCATCTGG TAGTGCCTGT CTGTTTTTTT AgmTAAAATT TTGGaTTAGT GCAAATGaAG 4740 
CAATGCCTAC CAGTACAATA AGCAA7AAGT TTTTAGTAAG "tATTGCGGCT AAAATGGTAG 4800 
GAAATGAAGC GAATAAATaA GGAAGATTGA GGCTTGGTAA ATGCCCAATT CTTTGTTCAA 
ATAAGCTACT AAACCATAAG GCGGACATAA TCACAATTGG AACAAAACTT AGRTATTCTA 
ATAATTTCTG TGGCaGACTG ATTTTTTTAA AAtAAAAATG GCAGTATTCT TGAAAGCCAG 
GTTGCAACGG TAcAGCCTAA AATTGTTAAA AAGAGATACT CaGAAGAAGG CATGTTTTAT 5040 
CATCACTCCT ATTCCACACC CGATTAaCGT GmCAaCTArG ACAATTaAAT TACTGGGAAT 
AAAAATTAGA CCAATAGACA TCAAaCCAAA AGTAATCAGG ACCATGATGA GTTGTAAAGC 
GATTTTCATG GAACGaTCAC TaATTATCTG CAGATAGAGT AAACCAATGA ACATAGCCAC 
GACCGCAAAC TCAAGACCTA ATGCTTGAGG GTTTGCAATA AAGCGCCCTA GATAGGCACC 5280 
GATGATCGTC GAAAAAACCC AAGTCGCATA GGAAATCAAA TTGGCAGCAT TAAACCACTC 
AAAACTTAAA CGTCCTTTGG TGTGGTTTAA TTTATTCATT CCTAGTGCAA. AACTTTCATC 
CGTCAGTAAG GTTCCCAACC AGAGATTTTT ACCGAGAGAT TCGGCTTTAA AGTAAGGCGC 
AATGGTCATT CCCATAAGAA TCATACGCGC ATT TACT AAG AAAGTCGCTA ACACAATAGA 5520 
TAAGATAGGG CTGCCGCCAG TCAACATACT AACTGTTACA AACTGAGCAG ATCCTGCAAA 
AATAAAAAAC GACATCGCAC TAACAATAAG AGGGGAAAAA CCTGCTGCTT GTCCTACGAT 
ACCAAAAGCA ATGCCAATAC CAATA7AGCC AAAAACCGTT GGCAAGGCTT CACGAATGCC 
AGTGCGTGCA TCTAATTGGT GGTTCATTCT CTTTTTTTCC TTCTTTCTGT ATAAAATTTA 
AACACGTGTG TATGAAAATC TAGACCTTCG ATAATGAGAC ACAACGAGCG AATGAGCGTT 
TTCATTATTA ACCACTCCTT TTGAACAACT AAGATATATA TTGTATAATA CAATAAAGAA 
TACTTTTGTT CAATATATTA AACTTTTGGA TGATAAAAAG AAACAGAAGG AGCAACTATT 
TTGGAGATTA ATACAATTAT TGGCAATAAT TTAAAaAAAA TTAGACAAGA AAAAAAACTT 
ACGTTAGATG AACTAGCTGG GATAACCGGA GTGAGTAAAG GGATGCTTTC GCAGATTGAA 
AAAGGAACCA CCAATCCTAC CATAAATACA ATTTGGAAAA TAAGTAATGG ACTCAATGTT 
CCATATACGT ATTTATTAGA AGATGACTAT GTTAAAGAAA ATCCCGTGGT TAAAAAAACT 
GAGACGGTAA CACAACATTC TGAAGATCAC TCCTATCGAG TCTTTAATTA CTACTCGGAC 
AACCAGGAAA GAAACTTTGA ACTCTTTCAG TTTGAATTAG ATCCAGGGAA AAAGTATCAA 
GCTGTTGGGC ATTCAAAACG TTCGACAGAA TATGTCATGG TTATTCAAGG AGCATTAGCT 
CTTGAAGTGG GCGGTCAGAG ATACCTTCTT CCAAAAGACG CCGCTATGTA TTTTGATGCT 
AATCAAGTAC ACACGTATAT AAATGACGGA ACAGAGAAAG TCCAAGCAGT CGTCATTAAC 
TATTATACGT AACAGTGGAA AATAAACATT AGAAAATTGC TGCATCTGTT ATTTTATGGA 
CAAAAAAAGT AAACAATGCT ACCCTAAAAA TAATAGAAAA TGAAAGATAA AAGAAGAGGT 
CAAAATGAAT AAAAAAATTG AAAGTCTTGG TAATACAGCA TTAAGTTTAC TAAGTAATAC 



4860 
4920 
4980 



5100 
5160 
5220 



5340 
5400 
5460 



5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
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GATTATTGTT 


GTGCCTTATT 


TACTTTACAT 


TCAATTGAGT 


GAT GG AC AC A 


ACCTTCTAAC 


. 6720 


AATTTTACCA 


TTTGTCTTAT 


TTTATACATT 


GCGTATGACA 


GGCATTTTTC 


TGGTTCGAGG 


6780 


AATTAATTTG 


TCTTTAACTA 


GTCTAGGTCT 


ACTAAAGATT 


TCTCTATTGT 


TAGGCTTGGC 


6840 


AGGCTCCTTA 


ATTGGGATTG 


CCGGCGCTTA 


TTCTTTCCCG 


CTTTATAGTA 


TTTCAGGAAT 


6900 


GTTGTTAGGA 


CTTAGTGGAG 


CGTTGTTGCC 


ACCAAGTAAT 


CAATCAATTC 


GCTTTTTCTT 


6960 


AAAAGAAGAA GGTCAAAAAA TCAgTCmTAG CAATCyTTTA ACgACAGcTT AwTGaCCgCT 


7020 


GtTTTAwTTG 


GCGCTTTATT 


TTTTAAAGCG 


ACTGAGATTG 


GACTGGCGTT 


ATTGGTGTAC 


7080 


GCGCTCTATT 


TTCTGGTTGC 


TTTAGTAGCA 


ATCAAGAGCT 


ATCCTGTTTC 


ATTAAACGAG 


7140 


CTAGAAGAAG 


AATCAGCAGA 


AGTTTCACGA 


AAAGAATTAA 


TTCTGTTTGT 


TATCTTTTTT 


7200 


GTGTTATTAC 


TTTTATTAAG 


AAGTGGTCGC 


TTATTAACGA 


ATGCGGTGGA 


ATT TG ATT AT 


7260 


GCGATTGTTG 


GCGCCTGCTG 


TTTCTTTGTG 


GTGGCTGTCC 


TTTTCGTTAG 


TCATTATGGC 


7320 


AAACGCGGTG 


CGTATAAAGT 


ATCATTAGAA 


ATGAAcTTGC 


GACCTTGATT 


AACGGGATTG 


7380 


TGGGGAACTA 


TTTATTnTAT 


TGGTTCCATT 


TATGTTGCAG 


GGtTaCGGCA 


GAGmtCATAT 


7440 


GGGCAtCTAT 


CTCwATTtAC 


CTtATGTTTt 


AGGGAtGAwT 


TTtGCAAtGA 


TTGCAGCTAG 


7500 


TAAAcGAAcA 


ATGGTCAAAC 


AaCATACCAa 


TGATTGGTCT 


GGCGGGAAGT 


TTGGGAATAT 


7560 


TACTtCTTAC 


TTCGTGGACT 


ATTTTAGGCC 


TGTTTCTCTT 


GaGTTTTTTC 


AAAAGCACAT 


7620 


TGAATAGCTT 


TTTAGCTAGG 


CGATATTATC 


AAGAGACAGA 


TTTGCCAAAA 


GATAGCCGTA 


7680 


TCCTTATCAA 


ATACACAACA 


CAAACAAAAG 


GGAGTTTATT 


CCATCAATTT 


ATTTTGATGG 


7740 


CGCTTATGTT 


AGTCACTGTA 


GTAGGCAAAG 


GTGGCACGAC 


ACAATTTTTA 


TTAGAATTGA 


7800 


CTGGCGGCAA 


AGGCATTTCA 


ATGCAAACAA 


GTCAATTTTT 


AACTTTCATT 


AAAAATAGTA 


7860 


ACTCGCTCTT 


AGTGTTATTG 


TTTATTGCAG 


TTTATTTCGT 


GGTCGTTTTT 


CGTCAAAAGA 


7920 


AACGATAATT 


TACAAATATT 


AATAAAAAAA 


TACTTGGAAA 


AACAGTGATT 


GTGGCTGATT 


7980 


TTCCAAGTAT 


TTTTTTATGA 


AAAGATGTTT 


TTTTTTTTCG 


AAGATTCAAA 


AAAAATTAAT 


8040 


CAATTGTTTC 


ATTTTTAACA 


AGCTATTATT 


AGGTTATATG 


AATTTTATTG 


AAAGGGGATT 


8100 


AGTATGAAGA 


AAAAAACTTT 


TTCTTTTGTG 


ATGTTGAGTA 


TACTTCTCGC 


ACAAAATTTC 


8160 


GGGTTTGCCG 


TAAATGCCTA 


TGCTGTAACA 


ACGACAGAAG 


CACAAACAGA 


GACCACTGAT 


8220 


ACAGCAAAAA 


AAGAGGCAGA 


GTTATCGAAC 


TCAACACCAT 


CTTTACCTTT 


AGCAACAACG 


8280 


ACTACTTCAG 


AAATGAATCA 


ACCAACTGCA 


ACAACTGAAT 


CGCAAACCAC 


AGAGGCGAGC 


8340 


ACAACAGCTT 


CCAGTGATGC 


TGCTACACCA 


TCTGAACAAC 


AAACAACGGA 


GGACAAGGAC 


8400 


ACCTCACTTA 


ATGAAAAAGC 


CCTGCCAGAT 


GTTCAAGCGC 


CAATTACAGA 


TGAACTACTT 


8460 


GACAGTATGA 


GTCTTGCGCC 


GATTGGTGGA ACAGAATACA 


GCCAAACAGA 


GGTTCACCGC 


8520 


GAATTAAATA 


CAACACCGGT 


AACCGCTACG 


TTCCAATTTG 


CTGTTGGAAA 


CACAGGTTAT 


8580 


GCACCTGGAT 


CAGTTTATAC 


AGTTCAATTA 


CCAGAACATT 


TAGGTTATTC 


AACTGTCAGC 


8640 
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GGAGAAGTGA 


CAGGCATTGG 


CGCAACTTGG 


GCAGTCGATG 


CGGCGACCAA 


AACATTAAGT 


8700 


ATTACGTTTA 


ATCAACGAGT 


TTCAGATACT 


TCCTTTAAAG 


TAGAACTAAA 


AAGTTATCTA 


8760 


ACAACAGAGG 


CGGAACCATT 


AATCAAAATT 


GAAACTCCAG 


GAAAAAATAA 


AAAAACCTAC 


8820 


TCGTTTGATT 


TATATGAACA 


AGTGGAACCA 


ATTCAATATA 


ACGAACGAAC 


CAGAACGACG 


8880 


GGGTTAGATG 


GCGAAATTTT 


TTATAATTTA. GACCGGACGT 


TAACTGGCAA 


TCAAACATTA 


8940 


GAATTATTAA 


CAACAGAGAC 


GCCAGGCGCT 


GTCTTTGGAA 


AACAAGATAA 


CTTGGAACCT 


9000 


CAAGTTTTCA 


GTTACGATGT 


CGACATTAAT 


GGTCAAATTT 


TACCAGAAAC 


GCAAACCTTG 


9060 


TTAACACCTG 


GCAAAGATTA 


TACATTAAGC 


GATAATTCAC 


TCGGGCGGAT 


TGCTGTAAC'T 


9120 


GTTCCAAACA 


TGAATCAACA 


AAAAGCCTAT 


TCCTTATCGA 


TTAATCGGAC 


AATTTATTTA 


9180 


GAGAGTGCTT 


CGGACTATAA 


CTACTTATAT 


TCGCAGCAGT 


ATCCAACAAC 


AAAAATTGGG 


9240 


TCAATTTCTT 


TGAAAAGTAC 


GACAGGAACT 


AAACAAACAA 


CCGATTTTAC 


TGCTAAGACG 


9300 


AGTCAAACAA 


GTAAAGTAAT 


TGCTGATCGT 


GAAATGCGTA 


GTATGTCCTA 


TATCAGTTTT 


9360 


CAAAGCAAAG 


GGAAATATTA 


TGTAACAATT 


TATGGCACGT 


TAACAGAAAC 


AAAAGTGGGT 


. 9420 


CAACAAATCG 


TATTAGAGAG 


TACAAACGGT 


CAAGAAATTA 


AGAATCCTAA 


ATTTACGGCG 


9480 


TATGGTCCTT 


TATATGAAAA 


TGTAAAATTG 


GAAGACTATT 


TTGATATTAA 


AACTGAAGGT 


9540 


GGCAAGCTCA 


CTTTAACGGC 


CACAAAAGAT 


AGCTATTTAA 


GAATAAATAT 


TTCTGATTTA 


9600 


ACAATGGATT 


TTGACAAGAA 


GGACATTAAT 


CTATCATTAA 


GTACACCTGT 


AATTGGTCCT 


9660 


AATAAAGCCA 


TTCAATTAGT 


ATCCGATCAA 


TATATTGAAC 


CAATTAGTGT 


TGTTAATCCT 


9720 


TTGAATGCTG 


AAACTGCTTG 


GGGTAATTAT 


GATCAAAATG 


GTGCCTATTC 


ATCAAGAACA 


9780 


ACTGTCTCAG 


TTATGGGAAG 


CAAAGAGAAA 


CCGATTCAAA 


ATTTAGAAAT 


TAAAGTAAAG 


9840 


CATCCTAATT 


ATCTTTCATT 


ACGAGCTACA 


AAAGAAATTT 


ATTTTTATTA 


CAAGTTAGGA 


9900 


ACGGATTATA 


CAGTAACGCC 


AACGTCAGAT 


GGTTCAGTTA 


TTAAGTTCAC 


TACGCCAATA 


9960 


ACCAACGAAA 


TCCAAATTCC 


AATTGGTTTT 


AATTATGTGC 


CAGATAGTTT 


GCCAAAAGAT 


10020 


AAAAGTATCC 


CAGTCGATAC 


GATACCGATA 


ACAATGAGTG 


CTGAAGGTTT 


AACTCCAGTT 


10080 


GATACGACAG 


TAACTACTAA 


TAGTAAGCGT 


GGTTCTGAAC 


GAACACTTCA 


AAGTAGTAAA 


10140 


AATCAATTCC 


TTGTCAATGC 


ACGAAATGAT 


TCTTTTGACT 


CACTAAGCGT 


CCGTACAAAA 


10200 


ATTcCAGcTG 


GCGCCGATGT 


TCTTTTTGAC 


ATT TAT GAT G 


TTTCAAACGA 


TCAGGTAGAT 


10260 






GGACCGCGGT 


CAATACTTTG 


ATAAACCAAT 


GACGCCAAAC 


10320 


AGCCCTGGAT 


ATCCaACGAT 


TACTTTTGAC 


GAAAATACCA 


ATAGTTACAC 


GTTTGATTTT 


10380 


GGAAAAACCA 


ACAAACGTTA 


CATTATTGAG 


TATAAAAACG 


CCAATGGCTG 


GATCGACGTG 


10440 


CCAACTCTTT 


ATATAACAGG 


GACAGCGAAA 


GAACCACAAT 


CGAATAATAA 


TGAAGGCTCT 


10500 


GCTTCGGTTT 


CTGTTCAAAA 


TGAAGCGTTA 


GACATTTTGA 


GTGCAACACA 


AGCGGCGAAT 


10560 


CCAACATTAA 


AAAATGTAAC 


AAAAACGACA 


GTAACAACAA 


AAAATATTGA 


TAATAAAACA 


10620 
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CATCGTGTGA 


AAAATCCAAC 


GATTGAATTA 


ACACCAAAAG 


GCACAACCAA 


TGCTCAAATC 


. 10680 


GATTTGAATT 


CTATTACCGT 


GAAAGGCGTG 


CCAGAAGATG 


CTTATTCATT 


AGAGAAGACT 


10740 


ACAAACGGTG 


CGAAAGTCAT 


TTTTAAAGAC 


TATACATTGA 


CAGAAAACAT 


TACGATTGAA 


10800 


TACAATACGG 


TCTCTGCAAA 


CGCTGGCCAA 


ATCTATACAG 


AAACAACAAT 


CGACTCTGAA 


10860 


ACATTGAACC 


AGATGTCTGC 


TAGCAAGAAA 


AAAGTCACCA 


CTGCGCCAAT 


CACATTGAAA 


10920 


TTCTCAGAAG 


GTGATGCGGA 


AGGTATTGTT 


TATTTAGCAA 


CTGCCACATT 


CTACACGCAT 


10980 


AACGTAGAGG 


ATGAAAACCA 


AGCAATTGCG 


aAGGTTTCTT 


TTGAACTAAT 


TGATAATGTC 


11040 


ACGCATACAG 


CAACCGAATT 


TACAACAGAT 


GAAAAAGGTC 


AATACTCCTT 


TGATGCCATC 


11100 


ATGACAGGTG 


ATTATACTTT 


GCGAGTAACG AATGTACCGC AGgAATATTC 


CGTGGATGAA 


11160 


GAGTATTTGA 


CAGGAAAAGC 


CATTAAGCTG 


GTCAAAGGAG 


ACAACCAACT 


AAAAATTCCA 


11220 


TTAACGAAAA 


CAATTGATcA 


CAGTCGTTTA 


CAAGTCAAAG 


ATTCAACGAT 


TTATGTCGGC 


11280 


GATTCATGGA 


AACCAGAAGA 


GAACTTTGTT 


TCAGCAACAG 


ATAAAACAGG 


TCAAGACGTT 


11340 


CCcTTCGAAA 


AAATCACTGT 


TTCAGGTCAA 


GTTGATAaCa 


sCAAAGCAGG 


CGTTTATCCA 


11400 


ATTATTTACA 


GTGACGAAGG 


TAAaGAAGAA 


ACAGCcTAtG 


TGACCGTCAA 


ACCCGACCAA 


11460 


TCTAAGTTAG 


AGGTCAAAGA 


TACAACGATT 


TATGTTGGTG 


ATTCGTGGAA 


ACCAGAAGAT 


11520 


AATTTCGTTT 


CAGCGACAGA 


CAAAACAGGT 


CAAGACGTyC 


CGTTTGAAAA 


AATTGATGTT 


11580 


CAGGGAACAG 


TGAATGTTGA 


TAAAATAGGC 


GATTATGAAA 


TTGTCTATAA 


AAATGGCAmA 


11640 


AAAGAAGCGA 


AAGCAATCGT 


TCATGTCCGT 


GATGACAGTC 


AGTTAGAGGT 


TAAAGATACA 


11700 


ACGATTTATG 


TTGGTGATTC 


GTGGAAACCA 


GAAGATAATT 


TCGTTTCAGC 


AACAGACAAA 


11760 


ACAGGCCAAG 


ACGTTCCGTT 


TGAAAAAATC 


ACTGTTTCAG 


GTCAAGTTGA 


TACTAGCAAA 


11820 


GCAGGCGTTT 


ATCCAATCGT 


TTACAGTTAC 


GAAGGTAAAG 


AAGAAACAGC 


TAATGTGACT 


11880 


GTCAAACCCG 


ACCAATCTAA 


GTTAGAGGTT 


AAAGATACAA 


CGATTTATGT 


GGGCGATAAA 


11940 


TGGGAACCAG 


AAGATAATTT 


CGTTTCAGCA 


ACAGACAAAA 


CAGGTCAAGA 


TGTCCCGTTT 


12000 


GAAAAAATTG 


ACGTTCAGGG 


AACAGTGAAT 


GTTGATAAAA 


TAGGCGATTA" 


TGAAATTGTC 


12060 


TATAAAAATG 


GCACAAAAGA 


AGCGAAAGCA 


ATCGTTCATG 


TCCGTGATGA 


CAGTCAGTTA 


12120 


r* 7ir i /* , Tr i t\ t\ t\ 
bACab 1 L AAAb 


A i ACAACAA 1 


TTATGTGGGT 


GATAAATGGG 


AAGCAGAAGA 




12180 


TCCGCGACAG 


ACAAAACAGG 


TCAAGACGTT 


CCGTTTGAAA 


AAATTGATGT 


TCAGGGAACA 


12240 


GTGAATGTTG 


ATAAAATAGG 


CGATTATGAA 


ATTGTCTATA 


AAAATGGCAC 


AAAAGAAGCG 


12300 


AAAGCAATCG 


TTCATGTCCG 


TGATGATAGT 


CGTTTACAAG 


TCAAGGATAC 


AACGATTTAT 


12360 


GTCGGCGATT 


CrTGGArACC 


AGAAGrGAAC 


TTTGTTTCAG 


CrACAGATAA 


AACAGGTCAA 


12420 


GATGTCCCAT 


TCGAAAAAAT 


CACTG 








12445 



(2) INFORMATION FOR SEQ ID NO: 24 3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2546 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

GAACACAAGA AAATGTTTTT TGAATAAACT CTTCTTGTTT TGAGGTTGGA TAAAGTCTGT 60 

ATTTATAAGC TTTCAGAGCC TTCATCTCAT AACCACCCTA CCAAGATAGT TAGAAAAGAA 120 

GCTGGTTATA AACCAGCTTT TCTTTTCTTC TCTTCAATTT TTTCACGGTA CATTGCAACT 180 

TCGTCTTCAG CACAATAAAC ATAGTGTCCA TTGAGAATTT CACGGAGCTC ACGATCTTTC 240 

CCATCTGCCG CTGCAGGCTG GTAAACGATC CGTTTACGTG TGCGTTCATA GTCTGGATCT 300 

GGTAATGGAA TCGCAGAAAG CAAACTTTCT GTATAAGGAT GAACGCCGAA ATTATAGACA 360 

TCATCACTTG TGCCCATTTC CAATAATTTC CCACTGTGCA TTACGCCAAT TCGGTCACTG 420 

ATATGTTTTA CCATCGATAA GTCATGTGCG ATAAACAAGT AAGTTAAGCC TTGTTCTTTT 480 

TGTAAATCTT GTAATAAGTT TACTACTTGG GCTTGAATCG ACACATCCAA AGCCGAAATT 54 0 

GGCTCATCAC AGATAATGAA ACGTGGATTA ACCGCTAAAG CACGAGCAAT CCCGATCCGT 600 

TGACGTTGCC CACCAGAAAA CTCATGAGGA TAACGCGTAC TGTGACTTGG ATTTAATCCC 660 

ACCGTTTTTA AAAGTTCATT GACTTTTTCT TCCCGCTCTT TTGGTGTTTT AGCTAATCCA 720 

TTGACATCGA TTCCTTCAGC AATAATATCA TTGACTTTCA TTCTTGGATT CAATGAAGCA 780 

TATGGATCTT GGAAGATCAT TTGAACATCG CGACGGAATG CTTGCATGTC CGCTTTTGAT 84 0 

TTAATTTTGC TAATGTCTTT TCCATCGAAA AGAATTTCAC CAGCAGTTGG ATCATATAAA 900 

CGAATGACAC TGCGACCAGT TGTTGATTTT CCACTTCCTG ATTCGCCAAC TAATCCAAAG 960 

GTTTCGCCTT CATAAATATG AAAACTAATG TCATCAATGG CTTTTACTTC GTCTTTACGA 1020 

CCAACATTGA AATACTGTTT CAAACCAGTG ACTTCTAATA GGACTTTTTT CTCTTCTGCC 1080 

ATTAATTACT ACCTCCTGGA TTTTTTTGAC GCTCTGCATA AATTGCCCAA CGACGAACAA 1140 

TCTCTGCTGG AGGCGTAACT TTTGGTGCTT GTGGTGCTAA TAACCACGTT GCTGCTTTGT 1200 

GGGTTGGTGA TACTTCAAAA TAAGGTGGTT CTTTTTCTGT ATCAATTTTT AAAGCAAACT 1260 

CATTCCGTGG ATAGAAGGCA TCTCCAGTTG GTGGATCTAA TAAATCTGGA GGTGAACCTG 1320 

GAATCGCATA AAGTTTGTCT TCTGTTCCTT CCATCGTTGG CATTGACCCT AAAAGTCCCC 1380 

AAGTATATGG ATGTTGTGGA TTATAGAAAA TTTCTTCTGC TGTTCCCACT TCAACCAGAC 14 4 0 

GTCCACCGTA CATGACAGCG ACACGGTCAG CCACGTTTGC TACTACCCCT AAATCATGGG 1500 

TAATGAAAAT AATCGAAGTG CTAATTTTTT CTTGTAAATC TTTTAATAAC TCTAAAATTT 1560 

GTGCTTGGAT GGTTACATCC AACGCTGTAG TTGGTTCATC TGCAATTAGA ACTTCGGGAT 1620 

AACAAATTAG AGCAATTGCA ATGACAATCC GTTGACGTTG CCCACCAGAA AATTGGTGTG 1680 

GATAGTTTTT CAAACGTTTT TCAGCATTAG GAATGCCGAC TAATTTTAGT AATTCTAATG 174 0 
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ctTGCGCTAA GCCTTCTTTT TTTGAGACTT TATTATGTTT GATTAATGAT TCCGCCACTT 1800 

GTTTGCCAAT TGGCATCGTT GGGTCTAATG AAGTCATOGG -ATCTTGGAAA ATCATTGCGA 1860 

TTTCTTTACC ACGAATGGCT TGCATTTGTT TTTCTGTTTT ATGAACAATA TCTTGTCCTT 1920 

TAAACAAAAT TTCCCCGTTA TCAATATTGG CATTACTGCT CAACAAACGC ATAATACTGC 1980 

GGGTTGTAAC AGATTTACCA CTCCCTGATT CACCTACGAT GGCTAACGTT TCACCTTTAT 2040 

AAAGATCAAA ACTCACACCA CGAATGGCGT TGACTTTACC TGCAAATGTA TCAAAGGAGA 2100 

TTCTTAAGTC TTTAACTTCT AATACTTTTT CCATAGTCAT TCACCACTCT TACTCTTTCA 2160 

TCTTAGGATC AAATGCGTCA CGTAAACCAT CAGCTAACAA GTTGAAACAA ATCATGATAA 2220 

CTGAAAGTGT TGCCGCTGGA ATCCACATTA AATATGGAAG GAAACGGAAA GTTTTGTATC 2280 

CTTCGTTTAA CAAGGTTCCT AAAGATGCGG TTGGTGGACG TAAGCCTAAC CCAATAAAAC 234 0 

TTAGGAACGC TTCAAAGAAA ATTGCCGATG GGATACTGAA CATCATTTGG ACAATAATGA 2400 

CACTTGAGAT ATTCGGTAAA ATATGTTTAA TAGCmATTTT CCAACGAGAT TCGCCCAnTG 24 60 

TTTCTGCTGC CATGACAAAC TCTTGGTCTT TTAGTTTCAt GGTCkGGGcT CGCACAATCc 2520 

TGGCCATkGA tATCCAGTTT GTAATA 254 6 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 8102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

AACTTACCAA GAATCGCTTT AGAAGCGCAA GGAAGTCAAG AAAAGTTCTG GCAAATTTTA 60 

AATGAACGTT TAGCGATCGT TAAAGATGCT TTGGTTTATC GGGTTGAACG TGTTAAAGAA 120 

GCTAAACCAG CGAATGCACC GATTTTATAC ATGTATGGTG CATTTGGAAA ACGTTTAGCT 180 

AC AC AG GAT G CCGTAGATGA ACTGTTTAAA AACAAACGTG CCACGGTTTC ACTTGGCTAT 24 0 

ATTGGCTTGT ATGAAGTGGC TTCTGCCTTT TACGGCGGTG CTTGGGAGGA .CAATCAAGAG 300 

GCCAAAAACT TTACACTGGA TATTTTAAAA GAACTGAAAA AAAATGCCGA CAACTGGGGC 360 

AACGAATACG GCTATCATTT TAGTGTCTAC TCTACACCGA GCGAAAGCTT GACTGACCGT 4 20 

TTTTGCCGTT TAGATACTGA AAAATTTGGC ATCGTCGAAA ATATTACAGA TAAAGAATAT 480 

TACACAAATA GTTTCCATTA TGATGTACGC AAAAATCCAA CGCCTTTTGA AAAACTTGAT 540 

TTTGAAAAGG ATTATCCAAA ATACTGTTCT GGTGGTTTTA TCCACTACTG TGAATACCCC 600 

ATGTTACAGC AAAATCCCAA AGCACTTGAA GCTGTCTGGG ATTACGCTTA TGACaAAGTT 660 

GGCTATTTAG GAACCAATAC GCCAATTGAT CATTGTTATG CCTGTGGTTT TGAAGGGGAC 720 

TTCCATCCAA CTGAACGTGG CTTTGAATGT CCAGAATGTG GCAATCATGA TCCTAAGACT 780 

TGTGATGTCG TTAAACGGAC TTGTGGTTAT TTAGGTAACC CGCAAGCTCG CCCAATGGTA 84 0 
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CATGGTCGTC ACAAAGAAAT TTCCTCTCGT GTGAAACATT TAAAAAATTA AACTCGATTA 900 

TTCTATAGAA AGAAGTTTTG CTGATGCGTA ATCCGAAACC ACAAGAATGG AAAACCGAAG 960 

ACTACAGCCA AAAAAAGATT GCCGATTACA AAGCGTTCAA CTTTGTTGAT GGCGAAGGTG 1020 

TACGCAATAG TTTATATGTG AGTGGCTGTC TCTTTGCTTG TGAAGGTTGT TTTAACAAAG 1080 

CTGTCCAAAA TTTCAATTAC GGTACGCCGT TTACAGAATC TTTAATGAAT CAAATTATTG 1140 

AAGATTTGTC CCATGACTAT GTCCAAGGTT TGACTCTTTT GGGTGGCGAA CCTTTTCTGA 1200 

ATACAGATGT TTGTCTTAGT GTGGTCAAAC GTGTGCGGGA AACATTTGGC TCAGCCAAAG 1260 

ATATCTGGTC TTGGTCTGGG TATACCTTTG AAGAACTACT TTTAGAAACG CCTGATAAGC 1320 

TTGAACTACT CCATTCAATT GACATTTTAG TTGATGGGCG CTTTGAACTG GCAAAACGTA 1380 

ATTTAAATTT ACAATTTCGT GGCAGTAGTA ATCAAGGGAT TATTGATGTA CCCAAATCAC 14 40 

TTGCTGCTGG CAAAGCTGTT ATTTGGGAAA AATGTCATGA CGCCGAAACG TCTTATGAAC 1500 

AAATCAAAAA ATCAATTTAA AAAAGTTAGC CCTTTAGCCA TCTCTTGAGG ATGTGTCTAA 1560 

AGGGCTAACT TTTTTCCCAT AAAGGTAAAA CAATGTTTTC ATACGCTAGT GGATCTAAAT 1620 

TCGTTAAAGT AATGCCTAAT AAACGGATTC CTTGTTCTAA ACCTAAAATA TCTTCCCAAA 1680 

TTAAATTCGC TTGATAGAAA AGTTCTTCTT TTTTAGAAAT ATATTCAGGT AATGTCACAC 174 0 

GTTTCGTAAT CGTCGTATAG TCAGTATAAC GAACTTTCAG GACTACTGTT TTGCCATGCT 1800 

TTTGCACTCG TTGCAATGCA CGCTCAACCC CTGCTGCTAA TTGTCTCAAT TGAGCCGTAA 1860 

CCTGCTCTTC CGTCGTTAAA GGGGTGCCGT AAGTATGTTC CTTGCCTACC GACTTACGAT 1920 

CTCTAGTCAC TTGAACTGGC GCATCATGGA TACCGCGGAC TTTACGATAC AATGAATAAC 1980 

CCATTTTACC GAATAACCGA ATCAACGTCA TCTCATCTTG CTTGTATAAA TCTGCTCCAG 204 0 

TAAAAATCCC TAGTTCATGC ATTTTAGGCA CTGTTTTTTT CCCGATTCCG TGGAAATCTT 2100 

CAATCGGTAA TGCTTTCAAA AAATCTTCGG CCTCTTCTGG TAAGACAACC GTCAGGCCCT 2160 

TTGGTTTCTG AAAATCTGAT GCTAACTTGG CTAAAAACTT ATTGTAACTA ACACCTGCCG 2220 

AGCAAGTTAA CTGCAATTCA TGCCAAATAT CATATTGGAT CATTTTCGCA ATTTTAATAG 2280 

CCGACTTACT GTTGACCTTA TTTTCAGTTA CATCTAAATA CGCTTCGTCA ATAGATACCG 234 0 

GTTCAATCAC ATCTGTATAG CGGCGAAAAA TTTCACGGAT TTCTTGAGAA ATTTCACTAT 24 00 

ATTTCTGATA ATTACCTGGC TTAAAAATGG CACTTGGACA TAG TT CAT AC GCCTTTTGAG 24 60 

CACTCATAGC CGAGTGAATA CCGTACTGTC GAGCAATATA ATTCGCTGTG GTCACCACTC 2520 

CTTTGCCCCC AGTATCACTT GGATGGCGCG CAATAACTAA CGGATGTCCT ACTAACTCTG 2580 

GATGATCTCG TTCTTCAACT GAAGCAAAAA ACGCGTCCAT ATCTATATGT ATAATTTTTC 2640 

TCGACGTATC TTTTTTTAAG GGAAAACGTA ATTCGCTCAT CATTTTGATT CGCCTCCTCC 2700 

CTCCATTATA TACGAACGTC CGTTCTTTTT CAAATTGAAA CTTCTTTATT TCAACCAGAA 27 60 

AAAGCTGAAA ATCCGTGAAG TTATAAACAC AGATTTCCAG CTTTTTAATT AAACGACCAA 2820 
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fry rp rp rp rp rnrppT 

J. lbl i 1 1 Ibl 


TPBTTfi T B PT 
1 b A 1 1 A 1 Ab 1 


a P P PTT T A T 21 
Abbbl I 1A1A 


p.p.p zi zi n a t a a 

bbb AAbA 1 AA 


p.TTP.pa aTra 


ptptv aPpaaT 




Ab 1 1 b 1 1 bb 1 


PBTBPPBaPB 


T7A a TV BP.TPAP 
1 AAAAb 1 b Ab 


PB TA ATPTPA 

b A 1 AA 1 b 1 b A 


TaTTTTATPP 
1 Ai 1 1 1 Al bb 


paTppapapc 

bA 1 bbAb Abb 




nmpTi a CCCCC* 
A 1 LAALbLLt 


PPA n B B BTTB 
bbAAAAAl 1A 


TIP PPT P. Zip TAT 
Abbb 1 bAbA 1 


pTV TpPp A PPT 

bAl bbbAbb 1 


aaapTaapaa 

AAAb 1 AAb AA 


TTPPpapaPT 

1 lbbbAbAbl 


JUUU 


TTTGGCAGAA 


TLAAlbblbb 


rTTT'CTi. TPPP 

bbbbbAl bbb 


PPTTTTPAPA 

bb 1111 bAbA 


P T T T PT TP P A 

b 1 1 1 b 1 1 bbA 


TP A'TTPPT A A 

ibAl ibb 1 AA 


jUoU 


AGAGGCGAGT 


TTPAT7VPTTP 

1 1 GA 1 Ab lib 


p 7\ ppT A A TPP 

b Abb 1 AA 1 bb 


fp TV /"^ rp rp rp ^> rp /■* 

1 Ab 1 1 1 b 1 bb 


aTTappppTT 
A 1 1 Abbbb 1 1 


PPPPTTPPTT 

bbbb 1 1 bb 1 1 




TTGAAAAAbA 


bl Al b 1 AAA1 


TPPPTvTAppA 

1 bbbAl AbbA 


T7A a a pp a att 

lAAAbbAAl 1 


PPPaPPATPP. 
bbbAbbAl bb 


a aTTP.PTP.PP 
AA i 1 bb 1 bbb 


on 
OioU 


w t» mp a rp PPP A 

AATCATCCCA 


PTP7\TJ\PPn TV 

b I b A i Abb AA 


TPATTTPAPA 

1 bA 111 bAbA 


APPAATAAAT 
AbbAA 1 AAA 1 


TTAATAPAAP 
1 1AA1 AbAAb 


papaaaaaaT 

bAbAAAAAAl 


^9 AO 


P A A A A T A PP A 

CAAAATALbA 


RTPPfR HTTP 

Ai bb 1 AAI lb 


PTPTSPTBPB 

b 1 b 1 Ab 1 Ab A 


appa aTTPpa 

AbbAA 1 1 bbA 


aTPaaap.aaa 

A l bAAAb AAA 


Tpaaaa appp 




rn rp rp rr* O rp A T\ T 1 

CTTTTGTAA1 


ibbibbb Ibb 


PTTT T T TTPP 

bllllili bb 


aTTa a bpppt 

Al lAAAbbbl 


pppTTPPaaa 
bbbl 1 bbAAA 


paaTPappaa 

bAAl bAbbrvrt 


J J uu 


7V TV PP A TP7V TP 

AACCATGATb 


A PPPTT A TV TV A 

Abbbl iAAAA 


TAAPPTTA TT — 

1 AAbb 1 1 A 1 1 


PA PTTP A AAA 

bAb 1 1 bAAAA 


Ab A 1 A 1 1 1 1 A 


btb a aTappp 

A 1 AAA 1 Abbb 


O 4 4f. U 


GACTGCGAC1 


A A TTP A A TP A 

AAI 1 bAAlbA 


PTPPTPP A A T 

b 1 bb 1 bbAAl 


PAPA PT A ATA 

b Ab Ab 1 AA 1 A 


7\ 7\ 7\ rp 7\ f** rp rp 

Al AAI Aibi i 


TPPTa a a app 

1 bb 1 AAAAbb 


Id AO 


TAATTTTTCT 


TTAPTAPTA A 

TTAGTAC f AA 


rp a ATTA A TPP 

\L AAI 1 AATbL 


P A r , f* t T\f > /** A PT 

bAb b Abb AC! 


APPATPPPTP 

AbbAi bbb 1 b 


AAAAAAATAA 

AAAAAAA1 AA 


tc: >i ri 

J J T U 


GGATAAATTA 


TTAACTGCTA 


A ATPPATPTP 

AATCCATCTG 


rp/-*i/-'^ppppftp 

TCGGCGCCAC 


CTTTCCTGCT 


APA ATAPPAA 

AbAAT AbbAA 


jDUU 


TCACTTGCTG 


TGCTTGCTGA 


ATTTCTTCGG 


TA TPA TPPPT 

TATCATGCGT 


PAPAPPA APP 

G AC AC G AAbb 


APPPTPAPTP 

Abbbl bAb I b 


J DDU 


CTTGCTCTTT 


GTTTAATTGG 


TTTAACAATT 


PATTPAPA AT 

GATTbACAAT 


CTGTTTGCTT 


TPTTPATPTA 

1 b 1 1 b A 1 b 1 A 


j / iU 


ATCCCGTTGT 


7\ APTTPPTPT 

AACTTCGTCT 


AATAATAAAA 

AAI AAI AAAA 


PATPTPPTAP 

bAl b lbbTAb 


AAAAATAATP 

AAAAA1 AAi b 


TTTPPTAPPa 

111 bb 1 AbbA 


T7ft n 

J 'OU 


TV A PP A 7\PPPr 

AAbbAAbbbb 


TTPTPPPTPA 

TTbibbblLA 


PP A PP A P A P A 

b b Abb Ab Ab A 


PTTPPPPTAT 

bl IbbbblAi 


rp rn rp rp rp rp rp t\ 

1 1 1 b 1 1 1 IbA 


apaTa appa p. 
AbA 1 AAbbM.b 


JOIw 


pppppaattt 
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BPTBBTPPPB 
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PPBPTTTTTP 
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pTpaTTaaap 


PPTTPTTPTP 


3900 


TPaPPPTABA 
1 uRLub 1 AAA 


TPPB B TV TA AP 
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A TV A TT TV TP AT 

AAA1 lAlbAl 


APAPPPTTTP 
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Oil OVJV^ A v3V* -I 


3960 
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TP.PPPPTATP. 
1 bbbbb 1 Al b 


TTTPPATTPP 


PaTTP.TAPTfl 




4020 


PTTPaTABaa 
Li Jl b A I AAAA 


a a t t t pp p pt 

AAI 1 1 bbbb 1 


T P T P. T A (Z P A P 


tta ataatpa 


apaTaaaaTT 


TTTaiXAAATP. 


4080 


T A PTTTTPPP 
1 AL 1111 bbb 


bb 1 Abb 1 b AA 


pp a pp a tap a. a 

bbAbbAAbAA 


TTPTTPTaTT 
1 Ibl IblAl 1 


PPTP.PPTPTT 
bb 1 loll 


TpaaaTP.APA 


4140 


7\ ATC7\ ATA TP 

AA 1 LAA lAlt 


Ab AAAbAA I b 


PT A pp TV pp AT 
b 1 AbbAbbAi 


TTapTTPaTa 

1 1 Ab 1 1 b A 1 A 


apapap^ppT 

AbAbAbbbb 1 


TPTPPTPPP A 


4200 


tat* a a nTTnrT 
A i AAA 1 1 Ab 1 


b A Ibl 1 Iblb 


TPPT TTTTT A 
1 bb 1 1 1 1 1 1 A 


■ aaTP.ca aTca 

AA 1 bbAA 1 b A 


TATTPTTT QT 
1 Al 1 b i 1 1 AI 


ATAPTTPTTP 


4260 


1111111 ALA 


rp rprp/^ 1T\ ^+ T\ 7\ rp rp 

1 1 1 b 1 b AA 1 1 


T A p p p tv p T T p 
lAbbbAbl lb 


TTTPaTTPPP 
111 bAl 1 bbb 


TPTTTT p a ap 
ibl 1 i 1 b AAb 


TPPPTPPPTP 


4320 


rp rp f** rp 7\ rp rp 

1 1 1 1 


AAA A PTTPPE 

AAAAb 1 1 bbA 


P ATPTTPPTT 
bA 1 b 1 1 bb 1 1 


p a can. a a a t a 

bAbbbAAA 1 A 


PP T PPP a TT P. 
bb 1 bbbAl i b 


PTPPTATTP.T 


4380 


L1L1 1 1 blbA 


ppbptta nnpp 
bbAb 1 AAAbb 


PPP T B 7A PT B 7A 
bbb I AAb 1 AA 


Taaapaapap 

1 AAAbAAb Ab 


TPPPPTTTT A 
1 bbbbi 1 1 i A 


PPPTATTTTP 


4 4 4 0 


TGCTAAATAT 


TCTGTTAGTT 


CTCCTAAGGT 


ACCACGCAAA 


TACTCTTCAT 


GAAGTTTCGT 


4500 


TAATTCCCGA 


CAAATCACAG 


CGGGGCGTTC 


TTGACCATAG 


ACTTCAGCAA 


ACGTTGCTAC 


4560 


CGTTTTAGCA 


ATCCGATAGG 


GGGACTCATA 


AAAAATCTGT 


GTCGGCCGTT 


CTTCTTTTAA 


4620 


TGCTGATAAC 


ACGTCTTTCT 


GTTCTTTTTT 


CTTTCTTGGT 


AAAAAACCAT 


AAAACGTAAA 


4680 


TGGCTGTGGT 


AAAAGACCCG 


AGGCAATCAA 


AGCGGTCATT 


CCTGCCGTAG 


gTCCTGGCAA 


4740 


GGCAATCACT 


GCCAGTTCTT 


CTTCAAGACA 


TGCTGTAACC 


AATTCATGGC 


CAGGATCACT 


4800 



WO 98/50555 

AATTGAAGGC ATACCTGCAT 
AATCAGTTGA GGAATTCGCT 
AATCTCAAAA TGATTTAGCA 
TGCTTCTTTC AATATGTTTA 
TGGGACAAGA TACAATTTTC 
AGCGCATCCC TCATTTTCCT 
GATTGACTGT CAAACCAAAC 
ACGTTCCCCA TAAATAACAT 
ACCATATAAA ATATTGCAGA 
TGATTTTGAT AGCTCTTGTT 
TTGGAGATGC TCTCGTAAGC 
CTCATGGAGT GCTGCTTTAA 
CAACGAATTT AATCCATCAT 
CAACTTTCGC AGTAGCTTCT 
CATGCAGACG AACTTTGACA 
CATCTGGTGT TATTACTTCT 
CGTCGTTTTC GTATTTCAAA 
GATTTAAAGA TAACCCTTGA 
GAAATGTTGA GCAACATAGT 
CATCTCGGAC ACCAATTTGa 
TGACTAATTc ACGAAAATCA 
GATCGAAGGT ATATTCTACT 
CATTGGCAAT GCTTTTGGCA 
CATTGGCTGT TGCTTTATTT 
CTGTTTTTTT AGGAATAGCA 
CTTTCTCATT ATAAATATAC 
CACGGAAGCG AACTCCTACT 
CGTCGCTTCA ATCGGTTAAG 
CTGAATGCCA CTTGGCACTT 
TTGTTCACAA ACATTTTGCC 
TAGCGCCAAC CGTTCCGCCT 
TTCTTCCAAT AATTGACGGT 
TTTTTCCTTA AATACTTTGA 
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CGCTAACTTG GGCAATCGTT TCACCATTCA GCAATCGTGT 4 860 

CTTTGTAATT ATGTTCGTGT AAACTAATTT GTGGTGTTGT 4 920 

ATTTTTGTGT GTTTCTTGTG TCCTCACTAG CAATCACCGT 4 980 

AACAACGAAT ACTCATATCT TCTAAATTGC CGATCGGTGT 504 0 

CTTTTGTTGT TTGCTTATCA AAACTTTTCT GTTTTTGCAT 5100 

TTTTCGTTCT ATAAATAGAT AAAGGTTTGA TTTGACAATC 5160 

CTTTCCTTTT TATTTGCAGT GTCCACTAGA CTTAACGCGT 5220 

CAAGACAAAA GGCACATTCT TCATCATTTT CACGTCTTGA 5280 

CATGGAAGCC CTCTTCATAA AGTTTTTCCA AATTCATACG 5340 

TTTCAGTTTC AGTTGTATTT CCTGCTAACT TATTCAGTTC 54 00 

GTTGGTTTTC GATTTCCAAC GTGGTATTTT TTTCTACCAA 54 60 

TTTCTCTTAA TTGTGTCACT GAACTGTCTA AATCAGTTTC 5520 

AGAGCGAACG CTTATCCATT TTGATCACCT TTTTCAGCAG 5580 

TTGATTTCTT CGTAATCATA ATCCGCTGCG ATTTCACGAC 564 0 

ATCCGACTTA ATAGATTCAA CCCGACGACT TTGCCTTTGC 5700 

TTTCCATAAT CTGGTAATTC TTTTTTAGCG GCTTCATATT 5760 

CAACACATTA AGCGGCCACA TAATCCCGAA ATTTTGACAG 5820 

TCTTTAGCCA TTTTGATTGA AACTGGCATA AAATCTCCTA 5880' 

TGTCTGCCAC AAGGGCCAAT GCCACCTAAT ATTTTCGCTT 594 0 

CGTAACTCAA TTCGCGTCCG GAAAATAGCC GCTAAGTCTT 6000 

ATTCGCCCAT CTGCCGTAAA GTAAAAAATC ATTTTGCTAC" 6060 

CGCACTAATT TCATTTTTAA GTCATGAGCT CGAATTTTTT 6120 

GCTTCTGCAT CAGCCAAATT TTTTTGTTCT TTTTCTAAAT 6180 

AAAATGGGTT TTAGGTCCTC TGGTAAATCG TCTGAATCGA 624 0 

ACAGTAGCTA ATTGTTTTGA CTGTTGAGAT TCAACGAGTA 6300 

TCAGATTTTC CAGGAGCAAA ATAATAGATA TGACCGGCTT 6360 

ACTTCTACCA TTTTATTCCT CCTAATCTAG TTCAAGTGAA 6420 

GAACTTGCTG AAACAACTAA GTTCCTACTA TATTATGAAA 64 80 

TTTTCCTTTA TGATTTAGGG TGAATCATTT GGATAACTAA 6540 

AACTAACATT GGCAGTCCAT TTTTGGCGTG CTTTCAAAAT • 6600 

GCTCTTCCGT TACTTTTTTG GCTTGCTGTG TCGCAACACT 6660 

AATAAACCAT GAGCAAGTCA AAGCTAAGCG CTTGTTGTTC 6720 

CCATTTTCTT CTGAACGTAG ATAAATGCCT GTAAATCATT 6780 



WO 98/50555 



PCT/US98/08985 



1214 



AC.TTTTTAGA TAA7TAAACC ATTGCAAAAT GATTTCCCTA GCTTCATTAA ACCATTCATC 6840 

TTGAGAGA7T TCaACTGCTT YcTCAAAACT ATTTGTCAGT TCAGCTAAAA GGGTTGCAGT 6900 

CTT7TCACCA ATCCCCTGTT TGATTAAGCG ATCAATTAAT GTTTTTTTGA CTAATGGTTG 6960 

AAAATGTAAG GTTTGGCATC GTGATTGAAT CGTTGGTAAA ATTCGAGAAA GCGAAGTGGT 7020 

TTC7AAAA7A GCTAAAATTT GTCCTTCTGG TTCTTCTAAA AATTTTAAGA GACTATTaGc 7080 

tGCGCCGGTA CTCATTTTAT CTGCTTCTTG AATTAAGAAA ACTTTTTTAG CAGTCTCGAC 714 0 

CCCACTTTTA GAAAACTCCG CTTTTAATTC ACGGATTTGG TTCACTTTGA TGGTTTGCCC 7200 

ATC7GGCGCA ATTCTTAAAA CATCTGGATG TTCATTTTCA TTAATCCGCA CACAATTATG 7260 

GCA7TCGTTA CAAGGCTGTT GATTTACTAA ATTCGTACAA AAGACATGTT TCGCCATCCA 7320 

TAAGCCAAAT TCTTGTTTTC CAGTTCCTGT ATCTCCTTCA AAAAGATAAG CATGGGCAAG 7380 

ACGACCATGC TGAAAACTTT TTTGGAGTTG CTTGTACAGC AAAGGTTGCA TTTGCTGTAG 744 0 

CTG7TGTGCT TCATTCATCT TAATATTGAT GGAATCCTTC AACTGGTAAG ACGAAGCAAG 7500 

TAGCGCCGCC TACTTCAACT TCCACAGGAT AAGGAATTTG GCCATCCATT GTGATATCTA 7560 

AAG7CACAGG TGTTGAAACA TATTGTTTTC TTGATTGACA TGTTTCTTTA ATTAAAGCTA 7620 

ATGTTTCGTC GACACGTTCA TCATCAATCC CAATAATAAA TGTGCTGTTT CCCGCTTTTA 7680 

AGAACCCACC TGTTGAGGAT AATTTTGTAG CACGAATATT GGCATCAATA AATTCGTTGG 774 0 

CTAATCGGTT ACTATCTTtG TCtTGTACAA TGGCTAAAAT AATCTtCATG GTCTACACCT 7800 

TCC7ATAATT AAAAGTTTTC TGGATAACGT TCAATAATCG CCTGATACGT TGCTTCTACG 7860 

ACAAGTTCTA AACTCATCCG TGCATCAATC TTTTTGATAC GTTGTGGATT TTCTTCTGcT 7920 

AATTTCAAAT ATTCATGACG AACmCGTTGA TGAAATTCTA ATCCTTCTGA ATCTAAGCGA 7980 

TCAATTTsCT GTGTTCGATT TTCTTGAATC CGCCGTAAGC CAGTATCTGA GTCAACGTCT 804 0 

AAATAAATTG TAAAATCTGG TGAGACACCT TCTGTCGCAA ACGCATTAAT TGAAGCAATG 8100 
Gn 

{2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
AATATTTATT TTATAGGAGG AGAAAGTTAT GGGGCATCAA TTGAGCACTC ATTTTTTTAA 
CAAGAAAACA ' GGGATTATTT CAACTTTTTT AAGTTTGATA CTTTTAGTCG GTGGGTTACT 
GTTTGCACTA CCTGCATTTG AAGTACAAGC AGCATCAACT GTTACACCTA AAACAGTCAT 
GTATGTAGAA GTAAATAACC ACGATTTTAA CAATGTGGGa AAATATACTT TAGCTGGAAC 



8102 



60 
120 
180 
240 
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CAATCAACCT 


GCTTTTGATA 


TGGGGATTAT 


TTTTGCAGCC 


AATATTAATT 


ATGATACGGT 


300 


CAATAAAAAA 


CCTTACtTAT 


ATCtAAATGA 


ACGGGTCCAA. 


-CAAAc T.T TAA—ACG AAG C T G A 


360 


AACACAAATT 


CGTCCaGTTC 


AAGCACGCGG 


CACTAAaGTC 


CTATTGTCTA 


TTTTAGGCAA 


420 


CCATGAAGGC 


GCTGGCTTTG 


CTAACTTCCC 


AACTTACGAA 


AGTGCGGATG 


CCTTTGCAGC 


480 


TCAATTAGAA 


CAAGTCGTTA 


ATACGTATCA 


TTTAGATGGT 


ATTGATTTTG 


ATGATGAGTA 


540 


TGCTGAATAC 


GGAAAAAATG 


GGACGCCTCA 


ACCAAATAAT 


TCTTCGTTCA 


TTTGGTTACT 


600 


TCAAGCCTTA 


AGAAATCGCT 


TAGGTAATGA 


TAAATTAATT 


ACTTTTTATA 


ATATTGGACC 


660 


AGCGGCTGCG 


AACTCTTCAG 


CTAATCCTCA 


AATGArTTCT 


TTAATTGATT 


ATGCTTGGAA 


720 


CCCTTATTAT 


AGTACCTGGA 


ATCCGCCACA 


AATTGCGGGG 


ATGCCTGCTT 


CTCGTTTAAG 


780 


TGCTTCTGCC 


GTTGAAATTG 


GTGTCAATCA 


AAATCTAgCT 


GCGCAATATG 


CTAAACGTwC 


840 


AAAAGCGGAC 


AATATGGTAT 


TTATCTAATG 


TATAACCTCC 


CAGGAAAAGA 


TTCATCTGCG 


900 


TATATCTCGG 


CGGCGACACA 


AGAACTTTAT 


GGTCGAAAAA 


CGAACTATTC 


ACCAACTGTT 


960 


CCAACACCTT 


AATATTATCT 


CGTGTCTACA 


AGCGCCAACA 


TGTAAAGAGT 


TAAAAGCCAG 


1020 


CTTACTACAA 


GCTGGCTTTT 


TGGTATCTTG 


AGCGAATAGT 


TT TAG AC AAA 


AAAAGACCCG 


1080 


CCCTTTGGGT 


AAGGACGGGA 


AAGGAGTTAA 


AAATGAAAAA 


GTGTTTTGTG 


TTGTTAAGGT 


1140 


TGTTTGTTGA 


TATGCTTATA 


TCTTACAAAC 


TAAATGTGAA 


GATTTTGTGA 


TCATTTCATT 


1200 


CTTATTTTGC 


TACTTTTATA 


GGAATTAAAT 


AGATTGCCAC 


GAGACTATCT 


TTAGTTGCCT 


1260 


CGTGCATCCG 


TTATTCTGAC 


TGCTCCTTGT 


GGTGATACTC 


TGAAAAAACC 


AACAGTCCCA 


1320 


GTCCCTCCAT 


TTGCAGCCAT 


CTGTTTCGAT 


TGGGCTCTAA 


AAAAGAAGTC 


ATTTCCATTC 


1380 


ATACCCAAAA 


ATCCAAGGTT 


CGTATCATTA 


TTTTCTGCAT 


CAGTCAGTGT 


CTTTCTAATA 


1440 


TATGTCTCTG 


CTTTTTCTTG 


TGTATCGATG 


ATTTTTGCCT 


CTTCATTTTG 


GTCTTGATTT 


1500 


TCCTCTTCAG 


AAGAACGATA 


GTAATACTCT 


TCCGCaGGAT 


AATTTCCACT 


ATCTTGAGTA 


1560 


ATTAACAATC 


GCGGTTTCGT 


CTTATCCGAT 


TGGTCGCCGT 


TAGGATTTTC 


AACACCAATT 


1620 


TTATATAATA 


ATAGCGCAGC 


ACCAGTAGGC 


CCCCACCTAA 


TACTTAAGCT 




1670 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6: 

TACCGAgGTG TATCAgCGAT TCtaGTAAtC CACAaGTTCC ACTACAGAAT CAGAAtCTTA 60 

AAAGGTCTGG CGGAAGAAAA AACGCTAAAA GAAAAAATCC CGTCCTTAAA T CAAAC AACG 120 

ATTAAGAAAG TCACTGATAA TGAGTATAAA CGCATTCTTG TGaTGGAAGA AAAGGCTACA 180 

GGAACGACCT ATAAAAGTAT TTTGATAAAA AAACAAAATC GCTTAAAAAT TGTCAACGAA 240 
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AbAALbbA 1 b 


AAI 1 bTTAi A 


bAAI bbAAbb 


b 1 A I AAbAAb 


bAAAbbA 1 TC 


AAT AAAb I LA 


*3 A A 


AAAb I AI Abb 


CATTGTTTTC 


AAAbAAAAAl 


1 1 CI 1 1 bbAA 


AbAAl bbTTT 


TTTTATTbAb 


Ten 
J t)U 


AAAAGAATTT 


CTAGGATATA 


TAGTTAGTTA 


CATAAGTAGT 


. TAACCGTTTA 


AGCGAGGTGT 


/t O A 


AAAAATGGAA 


TTTAATTTTT 


CGGGGGAAAA 


GCCTTTATTT 


CAGCAAGTAG 


CCGATCAGAT 


480 


TGCCGAAGaA 


TTTTTAATGG 


CGCTTACTTA 


GAAGGCGAAC 


AAATTCCCTC 


AACCACAGAA 


540 


ATATCAAAAA 


GTTATCAAAT 


CAATCCAGCC 


ACCGTTTTAA 


AAGGAATGAA 


TTTATTAGTA 


600 


GAACGACAGT 


TGATTGAAAA 


GAAACGGGGG 


ATTGGCATGT 


TTGTTTTACC 


AGGCGCTCAA 


660 


GAGAGAGTAA 


GAAGTGCACG 


GAAAGAAGAA 


TTTTTAAATA 


AAGAAGTCTT 


AGAAGTTGTC 


720 


GCAGAAGCTA 


AAAAATTAGG 


GATTACAGCT 


GAACAATTAA 


AACAACTAAT 


TGAAAGAGGG 


780 


TATGACGCAT 


GAGTTTGCGG 


GTGGAATCAG 


TAAGTAAAAA 


ATATCATCAA 


AAACAAGCAC 


840 


TAGATAATAT 


TTCaATTACT 


TTTGAAAAAG 


AAACGATTTA 


TGGGCTCTTA 


GGCAGAAACG 


900 


GGGCTGGTAA 


AAGTACGTTA 


TTAAATATCA 


TAAATAATCG 


AAGTTTTGCC 


ACTTCAGGTT 


960 


CCGTTAAATT 


GGCGGGAGAA 


ACAGTGACCG 


ATAATGAGGC 


TGCTTTGACT 


CATATTTACT 


1020 


TAATGAGTGA 


AGATAATTTG 


TTTCCTCCTC 


AGTTAAAAAT 


TAAAGATATT 


TTTAAAACGA 


1080 


CTGAAGGTTT 


TTATGGATCC 


TTTGACTGGT 


CTTTAGCCGA 


ACAAATGTTA 


AGCGATTTTG 


114 0 


ATTTAGATGG 


TAAAAAGACA 


TTCAAAAAAT 


TATCAACTGG 


TTATCGGAGT 


ATTGCTAAAC 


1200 


TAATAGTGGC 


GCTATCCGTC 


CCCTGTGAGT 


ATATTTTTTT 


AGATGAGCCA 


GTCTTAGGTT 


1260 


TAGACGCCAA 


TCATCGGGAA 


TTATTTTATA 


CTTATTTAAT 


T GAG AC AT AC 


CAAGAACGTC 


1320 


CCCGTACGTT 


TGTGgATTTC 


TACCCATTTG 


gATtGGAAGA AATtGCGAAT 


TtATTAGAAG 


1380 


AT AT TAT Cay 


CATTGAkCmA 


gGGAAAATTG 


ATCCGCGCAG 


AATCTATTGA 


AACCAATTTG 


1440 


AAAAACCGGT 


CGC 










1453 



(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

TAGGAGTGAC AACAAATGAC AATTGATTGG ACAAAAGAAG TCGAAGCACG TAAAGACGAC 60 

TTATTAGAAG ATTTACAAAA CTTATTACGG ATCAACAGTG AACGTGATGA TGCACAAGCA 120 

ACACCTGAAG CGCCCTTTGG ACCTGGGCCA GTTGCTGGCT TAAAACACAT GTTAGCGTAT 180 

GGTGAACGTG ' ATGGCTTTAC GGTGAAAAAT GTCGACAATT ACGCTGGACA CATTGAATAT 240 

GGTGAAGGCG ACGAAACGTT AGGAATTTTT GGTCATATGG ACGTTGTTCC TGCTGGTGAT 300 

GGTTGGGAAA CAGATCCTTA TGAACCAGTC ATTAAAGATG GCAAAATTTA TGCACGTGGT 360 
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GCAAGCGATG 


ATAAAGGCCC 


AAGTATGGCT 


GCTTATTACG 


CAATGAAAAT 


TATCAAAGAA 


420 


TTAGGCTTAC 


CCGTGTCTAA 


AAAAATTCGT 


TTCGTAGTAG 


GAAGTGACGA 


AGAAAGCGGC 


480 


TGGGGCGATA 


TGGATTACTA 


TTTCCAACAT 


GAAGAAGCGC 


CAGACTTTGG 


TTTCTCACCA 


540 


GATGCTGAGT 


TCCCAATTAT 


CAATGGGGAA 


AAAGGAAATG 


TGACTATTCG 


TTTAACTTTC 


600 


CGAGGCGGTA 


ACGGCGCTGA 


TTATAAATTA 


GAAAGCTTCA 


AATCTGGCTT 


ACGTGAAAAC 


660 


ATGGTTCCGG 


GAACAGCAGA 


TGCTGTTGTA 


ACGGCTGCTT 


CAGCAGACGA 


AGCGGCTTCA 


720 


CTAGCTGCTA 


GCTTTGAAAC 


ATTCATCAAA 


CAAGAAGCAA 


AAATTTCAGG 


TAACGCAGAA 


780 


CTTTCAGATA 


AAACAGTGAC 


TTTCCATGTA 


GTTGGTAAAG 


GTGCTCACGG 


TGCTAGCCCA 


840 


CAATCTGGAA 


TTAACGCAGC 


GACTTTCTTG 


GCAACTTTCT 


TAAATGATTA 


CAGCTTTGCA 


900 


GAAGGCGCAT 


ATAGCTTTAT 


TAATACCATT 


GCTGAATTTA 


TCCATGAAGA 


TTTTTATGGT 


960 


GAAAAATTAG 


GCGTGGCTTT 


TGAAGACGAA 


AAAATGGGTA 


AATTGACAAT 


GAACGCTGGG 


1020 


ATTGTTAACT 


TTGATCCAGA 


AAACCCTGAA 


AATAGCTTAG 


TAACATTAAA 


CTTCCGCTAT 


1080 


CCAAAAGGAA 


CTTCAGCAGA 


AGAATTACAA 


GCAAAAGTTC 


AAACAACAGT 


CGGTGAAACG 


1140 


GTCACTGCAA 


CACAAGGAGA 


TCGCAACCAA 


GAACCGCATT 


ATGTACCAGT 


TGATGATCCA 


1200 


TtGGTGGCAA 


CGTTACTACA 


AGTTTACGAA 


G AC CAT AC AG 


GCGAAAAAGG 


CCAAGAACAA 


1260 


ATCATTGGTG 


GCGGTACTTA 


TGGCCGCTTA 


TTGAAACGTG 


GTGTTGCCTA 


TGGCGCAATG 


1320 


TTCCCTGGCT 


ACACAGACAC 


AATGCACCAA 


GCCAACGAAT 


TTATGGAATT 


AGATGACTTA 


1380 


TTCCGAGCAG 


CTGCTATTTA 


TGCAGATGCG 


ATTTATCGTT 


TAGCGAAGTA 


AAGTAAAAAA 


1440 


GCyGTTTTCG 


ATTTTCGAAA 


ACAGCTTTTT 


TTTGCAATTT 


TTGCCTTTTA 


AAAAcCCaAA 


1500 


ATCAGAACCG 


TCAATTTACT 


GAAAATTTCT 


TGCAAATCAA 


CGGATCCCAC 


GATACGATAA 


1560 


ATAGGAGAAA 


ACGCAATGAG 


TGAAACACTT 


ATTGTTAACT 


ATTTAATCAA 


AGGACGTGTG 


1620 


AAGATGAACG 


TAGAAGAATT 


AGTTCnAAAA 


GCmAAAGATT 


TAATTGGAGA 


AGGCmACGTG 


1680 


GAACATGCGC 


AACAATTCAT 


TGAAGAACAT 


AAAGATGAaT 


TaGGCGrACA 


CTATCmTAAA 


1740 


ATTACmGCGA 


TGTTAGCTGA 


TmCAaCGACC 


GATGGTTtGT 


TTGaTAAAAT 


TAAAGgATtG 


1800 


TTcAATAGTT 


GAAAAAAATC 










1820 



(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 8: 

GACATTTAAT TGAACAATTA GAACAACACT ATCACGGAAT GATAATAGTA CTTTATACAA 60 

AAAATTGTCA ATCATACGCA GCATCCTTAA ATCAAAAAGG CATAAAGACG ATACTTTTTG 120 

ATAAGAAAAC TTTGTTGATT ACAACGGCGA TTTCATACTT GAAGTCAGCG CCGTTAATTT 180 
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TTGTAGATAA 


TTATTTTPCA 


GAATTGGCAA 


TTTTAAAAAA 


AAAGCAAGTA 


GTTATCCAAT 


240 


T Ik T P P P A T C P 


TAATPPPPPP 


TTTAAAAPr A 


GTTTGGCTGG 


GGTGCTTATC 


ACACGAAACA 




t a n a anT app 


nnuOrt l Ortr^ri 


TPAPPTTTPA 


AAAaGTTtAC 


GmTAAAATGA 


ATTATTTTAT 


J D \J 


Lul lAAJ. 1 A 


PTAAA A ATPP 


AAPPAATTTT 


TAGAAACAAT 


TATCGTCTCG 


AGAATGCTCA 


H £ \) 


7\ rp rp rp rp f~> rp J\ rp 




pppptpttpp 


TTATCTTGAA 


AAATTAGAGG 


TAGGAGAAAC 


a q n 
ft o u 


AAA 1 LAunft 1 


AA 1 AAAv? ill 


TATATPPAPP 


AACTTACCGA GAgGGAATGA ATGAAATGAT 


r a n 


bL 1 1 o 1 oA 1 A 




TA AAPPPTTT 
InnnUuL 111 


TTCTGCTATG 


CCAAATTTTC 


ATTTTTATAT 


ouu 


GAAATTACAT 


CCATCAATCC 


AGTTAGATGC 


CATTGAACTA 


CCGAGAAATG 


TTTCTATTTG 


660 


GGAAAAGCAT 


ATTTTTGAAA 


GCTTTAGTGA 


AATAGGgCTA CTTGATTACT 


GATTATTCGA 


720 


GTGTGGTCTT 


TGAATACATG 


CATGTTAAAG 


AGCAACCGAA 


CATTTaTTTT 


TgTCCAGATT 


780 


aGAAAAtACG 


CATTAAATCC 


TGGAATkGAG 


CCTGATTTTT 


TTGAATATTT 


ACCAGGACCG 


840 


TTAGTTGaAA 


ATGaAAAACA 


GTgGATgTGG 


GCATGtCAAA 


TTTTcCAtAA 


ATAGTTAtAA 


900 


rGAkAATaTT 


ACtrGAtTaA 


CGAGGAATnG 


G 






931 



(2) I N FORMAT I ON FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1948 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 



CCTATGTAAA 


TAATGAAAAT 


CCCATATTTT 


CCAGAAGGGA 


TAAATGTGAG 


TTTTGTTGAA 


60 


ATTCTTGGCA 


AAAATCAACT 


ATTTGTTCGA 


ACGTATGAAC 


GTGGTGTAGG 


TTTTACAAGT 


120 


GCTTGTGGGA 


CAGCTATGTG 


CGCAAGTAGT 


TTACTTTATA 


CGTTATTGAA 


AGATGGTGTT 


180 


TTTTATGAAG 


AAATAACCGT 


CAAAAATACT 


GGGGGTATGG 


TGAAAACAGT 


CGTCCACGAA 


240 


ACGTCAGATG 


GCAGTTATTG 


GATGGAATTA 


ATTGGGAATG 


CCACGATTAC 


GCATTTGATT 


300 


GAAGGATCAC 


TGACTGATTT 


ACTAAATGGT 


GCATTTGAGA 


AAATAACTAT 


CACTGAAACG 


360 


AATGAACAGA 


AACATTATCA 


AGAATTTCTG 


CAAACACTTT 


CTCAAAAATA 


ACAATTGTTC 


420 


CTTGATAAAG 


CGCTCTTTTA 


TTTTACAATT 


GTGTTAAGTT 


CTTGTTCCAA 


AAAATAAAAA 


480 


ATATTGCTTG 


AATAAGCAAA 


CTATGAGAAA 


ATAAAGGGCA 


TGAATTAAGA 


ATTATTCTTA 


540 


GATAAACGAG 


GAAGTGTCTT 


ATGAAACTAT 


GGAAAAAAGT 


AGTACTGGGC 


TTCTTGAGCG 


600 


TCTTGCTAGT 


GGCAGTAGCT 


GGGGTATGTG 


CTTACGGGAT 


TAAGATGTAT 


TCAGATGCCA 


660 


ATTCAACAAT- 


TAATGGGATC 


TATCAATCAG 


TTAACCGTAA 


GTCCAATAAA 


GGAGCCACGG 


720 


CCAATATTGA 


TGCGCAAGAA 


CCATTCTCTG 


TTCTTTTAAT 


GGGGATTGAC 


ACAGGCGATT 


780 


TAGGTCGAAC 


AGAACAAGGT 


CGTTCAGATA 


CTACAATGGT 


TGTAACCATC 


AACCCGAAAG 


840 
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AAAACAAGTC 


CACAATGATC 


AGTTTGGATA 


GAGATATTCT 


AACAGATATT 


GTTGGAAATG 


900 


ACACACAAGA 


CAAATT.GAAC 


GATjGCCTATG 


CATTTGGTGG 


CGCCGAAATG 


GCGATTAATA 


960 


CAGTCCAAGA 


ATTACTGGAT 


ATTCCCATCC 


ATCACTATGT 


ATCCATCAAT 


ATGAAAGGGT 


1020 


TAAAAGACCT 


GATTGATGCG 


GTAGGCGGTA 


TTGAAGTAGA 


TAATACGATT 


GGTGAATTTA 


1080 


CGTTGGACGG 


TATTACTGTT 


CCAGCAGGTA 


AAATCmAGTT 


AGATGGCACG 


ACAGGTTTGG 


1140 


CTTATGCGCG 


GATGCtCaCG 


AAGATCCTGA 


AGGAGATGTG 


GGTCGTCAAC 


GCCGCCAACG 


1200 


AGAAGTTGTT 


GAGAAAATTG 


TGCGCAAAGT 


CATGAGTTTT 


GACGGTGTTT 


CAAAATATCG 


1260 


AAAAATTCTT 


GATGCTGTTG 


AAGCCAATGT 


AAAAACAGAC 


TTAACTTGGG 


AT G AC AT GAT 


1320 


GGACATCCAA 


TCAAAATATT 


TATCTGCCTT 


TAAAACAATT 


GACTCTGAAC 


AATTACAAGG 


1380 


ATACAGTGCA 


ACCATTGATG 


ATATTTATTA 


TCAAGTCCTA 


GATCCAAATA 


GCTTATACAA 


1440 


AACACAAACA 


ACACTCAGAA 


AACAACTTGG 


TTTAAAAGAA 


CATGCTTCTG 


AACGTGAAAA 


1500 


AGATTTAGCC 


TTTTATAATC 


AGTTTAGCTA 


TGCTGTTACA 


GATACTGCAC 


TGATTGGTGA 


1560 


ATCTGGTGAT 


TCAAGTGCCA 


ATAGCTCTAC 


TAGTTCGAGT 


AGCACAGCGG 


ACACCACAGC 


1620 


CGGACAAACA 


GAATACAACG 


GcAATACGCA 


ACAAGCAGCA 


GAAGGAACCT 


ACGATCCGAA 


1680 


TACGTACGTT 


GACCCAAATG 


CGTACACAGA 


TCCAAACGCT 


TATGCTGATC 


CAAATACGTA 


1740 


TGTGGATCCC 


AACGCACCAC 


AATATTAATC 


TGTTAAACAA 


GCTCGAAAAC 


TTCTATTTCT 


1800 


GATGGGAAAT 


AGAAGTTTTT 


TTCTTTTATT 


CTAGAAATTT 


TTTTCAGAAG 


GTGTTATAAT 


1860 


TGGACTATAC 


CAnAATAAAC 


CGTAAGGAGC 


GTGTCAAAAT 


GCAAATTATC 


CGTGTAGCAA 


1920 


ATGCAGAAGA 


AGGCGGCAAA 


AAAGCGTT 








1948 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

TCAAGCCAGC AAGTAAGGAA rGArGTGTrG ATTTCCyGtC CyGaCTTtCC TACTATCGCA 60 

TAAAACTTCC CAAGTTCAAA TTTTTGATTT ACTCCTGACA AAACTTTTTC TTTAGAATTT 120 

GCATAACTAT ATGCTACATT CTTTATTTCT AATATATCCA TTATTTTCTC CTAACTTATT 180 

TTTGATAAGA TTTCTTTAGG CTTCTTGATT AACATTAATG AAGAAGCAAA TACAACTGAT 24 0 

AAAACAATAA TACTTATTAA TATTAAATAA CTTTGTCCAA GCGTTGTTAT ATTTGACATA 300 

AAACTACTAC TATTTATTnA ACTTCCaCCG GAAATCATTG rATCCTCTGA GTTAATGAAT 360 

CCATCTACAA TTACTTTTAG TAATACATTT CCTAAAAATA AGGAAGATAT TATACTTGGT 4 20 

ATCGAyATGA ATATTAACTC GAATATAAAT TGCATTATAA TTTGTATCTT AGATGTTCCA 4 80 

ATAGATAAAA ATATACCTAT TTCATAAATT CTTTCCCTTA ACCATAGAAT CAAGATTAAT 540 
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PAAAP.AAPAA 




TaapaTAATA 


p.a at a ap.tp a 


1 1 r\L 1 i 1 ml 




600 




PTPaPTPT a 7A 


APAPTPTTPA 
AbAL 1L1 1 \*r\ 


aapp.pattap. 

HAL vj L^ri 1 1 Hu 


TA TA TPTTTf TP 

HH1 CI 1 1 1 l\y 


AAPAA A AT AT 


660 


TTTT A PTP A T 
ill LjHL* 1 tn 1 


PA UTTTTa at 
LHA1 1 1 lAnu 


V- 1 \^ 1 1 1 UHA 1 


TTPTTTaAPP 




A^ATTrTfiPA 

ivy IvJUn 


720 


/"■tappapaat 


A P A TT A TV n TV rp 

HbHl 1 AAAA1 


TTTATTTPPA 

1 1 1 Al 1 1 b^A 


ATTTPATTAT 
Hill LH1 1 HI 


'PTTPTPTATATT 
1 1 1 b 1 vjHHI 1 


PTTT A ATAPP 


780 


TCTTGGbl lb 


TTf^A ATA A T/"* 

I 1 GAAi AAlb 


ta^aaapa r^r^ 
1 AbAAAbAbb 


A 1 A 1 1 1 1 b Ab 


T 7\A A A TP A PA 

1 AAAA 1 bAb A 


apaTa aTPPT 

HbA 1 AA 1 tt 1 


840 


GTATATGTTT 


Z" 1 rp rp rp f 1 1 rp rp rp 

bb ibl 1 1 I I 1 


TV TV Tv A A A 

Ab b Ab AAAAb 


Al bbb 1 A1AA 


rpmrprpTV TV rn rp rp 
111 1 AAA1 1 1 


JTPnPTTTTT 
A 1 bAb 1 1 1 i 1 


QOfl 


ATTTTTCCAC 


TTTTTTCAAT 


JVTP1l7\Pn'3\ AT 

ATCT AG 1AA1 


rp/— » tv tv <^-> tv rp tv 71 

ibAAGAl bAA 


b 1 1 bA 1 bAbb 


TA ATTTTQPTA 

1AA1 1 1 iAbA 




TTGTTTTGTT 


TAGCAAATTC 


TTCATGAAbA 


TV rp T\ ATTtP A AT 1 

ATAATTGAAT 


TCTTATCATT 


1 lb 1 1 b 1 Al A 


1UZU 


TTTTCACCTT 


CTTTAATTGT 


AAATACTCCA 


O rp tv /—« rp TV 71 7\ rp 71 

b r Abl AAATA 


AAATATTTCT 


111 Ab 1 Al 1H 


1UOU 


TTTGTAGCTT 


CGAGTGAAAC 


t\ tv T\ rp fp rp rp t\ 

AACATTCTTA 


A A rp rp /-»/"• rp/""* 7v f~* 

AATTCGTCAG 


A rp TV TV A T fT^T* f~* 

ATAAATCTTb 


rp rp 7A nn rn t\ r T 1 ^ 

1 b 1 Al 1 1 Al 1 


1 1 A fi 


CTTTGTTCTC 


CGCTAACTAC 


TTTAGCATCT 


TTTAGTTTTG 


rp TV Tv rp /~i /*■• Ti rp iTS 

CTAATbbAib 


ATATTPA RTT 
Al Al 1 bAAl 1 


1 z uu 


ATTTTTTCTT 


CAATTTCTTT 


TAATTTTTCA 


ATATCTTTAA 


ATTGA11 AAb 


ATTAAAATAT 
A i 1 AAAA1 Al 


1 9 fin 


TTACCATCTT 


TTTTTGTTAT 


T 1 TV rp A rp rp ^ t\ TV 

T GAT AT T G AA 


/-» TV A T»rp APTi A 

G AAT T AG AAb 


T f ibAi Al AA 


APPPTTTTPT 
Abbb 1 1 1 1 b 1 


J. 


ATTTCATTGC 


TTGATTTCAT 


TATTGTTAAA 


b AAG AAT AC A 


AG b AAG AAAb 


AAPTATAPTT 

AAb 1 A 1 Ab 1 1 


1 *^R0 


AATATGATAA 


AAATTATTAA 


AGTTCTATTT 


TTCTTTCTTG 


TA A TATATPP 
1 AA1 Al AlGb 


T AT APPnTTT 
1 AlAbbAl 1 1 


X *J H \J 


TTTATCACTT 


m f**^m ta ta m ta 

TCCTAACTAC 


TCCTCTCTAT 


ITibl lAblb 


tv rp r« APTA 7\T 

b A 1 AAb 1 AA 1 


TPP ATPPaPT 
1 bbA 1 LtAt 1 


i ^nn 

X jUw 


GGACAAGCAT 


TTTTACATCG 


TCCACATbTG 


A I Ab Ab 1 bbA 


P A TP A TTPP TA 

bAl bAl IbbH 


AT TP TP a APT 
A 1 1 1 tnnvy 1 


1560 


GAATCTTTAT 


m T\ rp rp rp *-"/■■«/-* 7\ 

TCATTTGGCA 


AACTTTTTTA 


bAl 1 1 IbbAb 


7\7\TTA. ATfiPTA 

AA1 IHHIHbH 


PTT A TPPTT A 
k,l InlLUX In 


1620 


TCCAATCTAT 


TlTPTTiTi TV rp TV T 

ATCT AAATA i 


1 GAAA 1 AG bG 


i lAAAA/iJ. 1 b 


7APT2i2i7AT7AnP 
rtb 1 HHH 1 nut 




1680 


CAT AT AT ATT 


Ta C AAAAAGG 


TCTATATATA 


Al 1 AlAbAAA 


P T TA TA P TA Zl TA P T 
b 1 AALAAnvj 1 


A APT ATT AP, A 


1740 


ji rp /-> TV rp T» ^ H 1 

ATGATTCCTT 


TCbAATAAi A 


T\ A A A nPPT 

b AAb AAAb b 1 


A 1 bb 1 At lib 


1 bH 1 1111 


ATTTAATAAA 
HI 1 1 rvri 1 nrVn 


1800 


ACAAGTGGTA 


ATCCTCCTTb 


bAGbAlAbbA 


A 1 bbbAbAAA 


TTATAPTTT&PZl 
1 AAb ill HbA 


A A A AT APP. P. A 
AHHA 1 A\— v^Ort 


1860 


GAACCb rGAb 


/-•rp TV TV A nTATP 

b I AA AA 1 A 1 b 


AAPPAAASAT 

HHbbHAAAAl 


AAAuu Inftl H 


TTaaTP.aaaa. 

1 1 AH 1 \3 AHHA 


ATAPAAHTAA 


1920 


TATTATATAT 


rp nn r** 7v t\ fp hp rp^^ 
1 1 bAA I UlL 


1 bAAAl b 111 


A 1 LAAnt 1 1 H 


AA1 bill bnn 


PTTTTTT A AA 


■ 1980 


AAAbbGAA i. b 


1 1 b 1 b 1 AbbA 


7i aTPTTPa 
AAlbl lkjAAl 


P A A rppp T\ A A rp 

LAA 1 Ltnnn 1 


r;cziP2iPa^pP 


ATrPArAAAC 

r\ x VvvnwnruiVki 


2040 


AAA1 L T 1 bbb 


A A TV rp rp ft 

Abb AAT Gb 1 b 


prn ATAAAAAA 

bl Al AAAAAA 


pTvnn a TPPP TA 
bAAHHi LLUA 


Ab 1 A 1 A 1 AA 1 


A A A ATA APT A 


2100 


7\ 'P HP OP T\ TV rp rp 

Al bl 1 Al Al 1 


1 1 bAA ill bb 


TV A tt a ptppp 
AH 1 1 At- 1 oLL 


1 0 1 nu 1 on 1 ^ 


PT2iTTP.P.aPA 


AP.AAPPTTTT 
nOnnLiV/ x x x x 


2160 


GCTCCCGGGc 


ATGAATAACA 


ATTTATACCC 


GGAACACAAA 


TTTTCTTTAA 


ATTCCCCCCG 


2220 


TAGATTTTAC 


CAGTCTTAAA 


CCCTTCaAAG 


T AAG AAT TAG 


TAATTAAAAA 


CCAAACGGrT 


2280 


TGAAATAAAT 


GCTTAAAACG 


ATCTCTTATT 


TTATATTTAT 


CTTTTTTTGt 


ACTACCATTT 


2340 


ATAATATTtT 


TAATTCTATC 


CaATTCCAAT 


ACACTCCATA 


CAAATTCTTA 


TCGCCTTATC 


2400 


CmAGaCmACy 


TTAGTTTCtC 


CcATTACTAT 


CAACAAAGAA 


AGTCTCTGGC 


ATAGCCTGTA 


2460 


TACCATTCAA 


TCTACCATTA 


AAGTTTGTCT 


TATCTGGCAT 


TAAGAAAGGA 


TATGAAGCCT 


2520 



WO 98/50555 



PCMJS98/08985 



1221 

TTGTCTTTTC ATGTATTAAC TTTGACTTTT CAATTGCTTC CTTGTTTTCA CCGTTATCAT 2580 

CAACAGGATC TGTAACAACT CCAACGATAT TTACACCCTT GCTTTTCATT TCATTCTGAA 264 0 

CCTCAACCAA ATCAGGAATT TCCTTTACAC AAGCTGTACA CCAAGTTGCA AATACATTTA 2700 

CCATAGTAAG ATCATACTTC TCAAAATCTT TACTTGTAAA ATCCTTGCCA TTTATGTCCT 2760 

TTGTTGATAG TTTACGTAAA TCTTTAACCG ATTCTTTGTT AAATGCTTCT GTATTTTCTA 2820 

AATCAGTTTT TTCTGATAGC ACAAAACCAT TTTTAGGACG TTCTTTTTTA TCTATAATTT 2880 

GAATCTCAGT TCTTTTTAAT TCATCCAGAA GATTACTTTC AGCTCCGCTA TTTGTGCTTA 2940 

AATAGCAATC GTATTTTCCA TCACTTGAAA CTCCTATTTT. AG T AT GAG T A TCACATTTTA 3000 

TAATTTTAGA TTTTTTTCTT CTGATGTATT TTTTTCAAAT ATACCAATAG TCCCaATTCT 3060 

TTCAAGCTCA TTTTGCCAAT TTTTATATCC ATCGCCCATC TTTTCTATAA CgCATyCTTT 3120 

TGgTtCTtCT GTCATTTTTt CAAATGTCAA AATAGCATAC TTTAGTTCTt TAwCyATAGG 3180 

ACTTnGAnCA TCTAACAnCG CTAnCTTTTa TCAGCTATAT AATTTCTAAA TTTGTCTGAG 3240 

AGTTTAAATT TAAGCCCTAA ATATTCATAA ACATATTCTT TCTTtGCCTC CaATGTATAA 3300 

TCTGaTGGCT TAAAAGTTTC ATTTTTGcCA TTTGCATTAA TTGACATGCC TTCCTCTTCA 3360 

GAACTTATGG ATTGTCCTTC TTTTATTTTT ACTTTTTTGA CACCCAACTG CGCTTAAGCT 34 20 

CAAACATAAT GCAAGTAATA GGCATGCTCC TCTTGGTACT ATTTTTTCAT AAATTATTCC 34 80 

TCCTTTATTT TTTACACAAT AGCATTTTAT AGCTACTTTC AATAACATTC TACACACCTT 3540 

TTATGAAACG TTGATGAAAT ATAATTTTTT ATTTTTCCAA ACTATGGTAT AACAATATAA 3600 

GTATTATGCT TAAAGTTGTT CTTACACGAT TAGTATATGA TCGATTAATA TATGAGCAAG 3660 

CACAGTAAGT GTACGGACAC TTACGAAAAA AATGATGAGG CAGCCCTTTG GGATCTGCTT 3720 

CTTTTTTTAT CAATTTTTAA CTTGTTAGGG TGTACCCTAA GACCCCGAAA TACATTCAAA 3780 

GGAGGATTAC CTATAGCAAA TAGAATACGA AATGAAAGGC TTGAAATTAA ATTAACAGAA 384 0 

GAAGAAAAGG CTCTTTTTGA GGAGAAAAGA AAACTTGCGA AATGTAAAAA TATGAGCCAT 3900 

TTCATCCGCA AGTGTGTTTT AGAAAAAGAA ATATATTAAG TGGATTTAGA GCCGTTTAGA 3960 

GATTTACAGG GTTTCCTTTC CAATGCTACC AGCAACATAA ATCAGATTGC AAAGCGTGTG 4 020 

AATTCCACCG GCATAATCTA TAAGGACGAT ATAAATGATA TGAAAAAGCA GATTGAGTAT 4080 

TTCTCAAAAG AGTTGTGGCA AATACATTCT CTGCTTCTTA ACAGAACTTC CGGAGTGTTA 4140 

AATGAGAGTG TAAAATATTT TGTGTAAATA GAAAAAAGAA AGTCCCTTCT GTAGAATAGA 4200 

GTTACCACAA CACATTCACA GAAGAGAGGA CTTCCCTATG AACGATTTTA CTACAGAAAT 42 60 

TCTAAAGACT CTAGCGAACA AAGGCGATTT GAATGAATTA TTCCGTGTCC ATTTGGAGAA 4 320 

AGCAGTCAAT ACGCTTCTCA AAACGGAGTT AACGGCTTTC CTCGATTACG AAAAGTATGA 4 380 

TCGCATTGGT TTTAACACGG GTAATTCTCG TAACGGCTCC TATGACCGTA CGGTCAAGAC 4 4 40 

CGAGTATGGG GAACTTCATC TCCAGATTCC GCGCGACCGC AATGGCGAGT TCAAGCAACA 4 500 
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GACTGTTCCT GCTTATAGAC GGACGAATGA CACGTTAGAG GAGACCGTCA TTCACCTCTT 4 560 

CCGAAAAGGT ATTACCATGT CGGAAATCGC AGACTTGATT GAGAAAATGT ATGGGcATCA 4 620 

CTACACGCCC CAAACCATGT CCAATATAAC AAAATCATTT aCAGAAGAGG TAACGGCGTT 4 680 

TaAaGGGcGG GAcTTTCATG GACCGTTATG GCGCTATTTA TTTGGGCCnC ACGTTTTTCn 47 4 0 

CGTAAAAGGG AAAAnCGCCA CGAAG 4765 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 



GTGTGTCTAA 


CGGTTCTAAT 


GGTTTATAAA 


CGACCGCATA 


TTTAATCCCA 


TATTCTTTTA 


60 


ATTTTTCTGG 


AGATAAACGA 


GCAmTAATCG 


CATCTTTTTG 


CATCAACTCT 


AAAACAGGGG 


120 


TCTGATTTCC 


TACTTCAACA 


TTGATGTGCG 


CATAGCGATT 


ATAAATAAAT 


GTGTCTTTTC 


180 


CTTTTGGATC 


TAATGGTTTC 


CAAGCCTCTA 


AGTCAGGTGT 


AAAAGCAACA 


CCGTTAAATG 


240 


TATGCACACC 


TAACATTGGC 


AAATAGCCAT 


ACATCAAGCG 


TTCGCCCACC 


CACAATTGGT 


, 300 


TGGGGTCTTT 


TTTGTTAATT 


TCTTCCACTT 


CTACCGCTAA 


TTTTTkACCA 


TAGACAGCAG 


360 


AAACGCCACG 


AGAAATTGGA 


TTTACAAAGG 


CTCCAGAAAC 


CATAATGATT 


CCTAATAAGA 


420 


CAAGGACAAA 


TAACGTGCGC 


CAACGTTTAA 


ATAAAGCAAC 


AATTAAAACG 


GTAGCGACAA 


480 


CTAAGACACC 


AATGATGTCC 


ATTCGTGTAA 


CGTATAACCG 


TAAATTGCCC 


GTATATAAAA 


540 


CAAAGAAATA 


CAGAACAACA 


TTTAATCCTA 


GTGCCCCTAA 


CCAGGCCATC 


ATTGGAATTT 


600 


TTGCTTGGCG 


ATTCCATAAw 


TAATCGATGA 


mCCwTAAaCT 


TAACAACACA 


GCGGCAAaCC 


660 


CGAAGaTAAC 


ATCGCCCGTT 


GTTCTGGCAC 


ATAAGACCAC 


AATGTCAACT 


TAGCAAAAAT 


720 


TGTTGGAAAT 


TGCACGCTCA 


TCCaAACTAA 


GTTAAATAAA 


CAATACACGA 


ATAAGACAAA 


780 


ACCGTAAATG 


tTTTCTTTGA 


TTTTTTTATA 


AAAAATAAAT 


GGTGAGAGCG 


GCAAAATAAC 


840 


AAAAAATAAT 


TGATAAAAAC 


TACTGATTTC 


CGAATTATTG 


GAATATGGAA 


CATCTTGAAA 


900 


CGGCATTTTC 


CAATTTGTTA 


AAAATAACAA 


AATATCGCGT 


TTAGGCATAT 


CGCCACCTAA 


960 


ACTTACCCGT 


TTTCCTGGAT 


AAGCGGTATC 


CATTACTGCT 


TTTAAGGAAT 


CCAATGAGCC 


1020 


ATATAGCGAT 


CCGCCAACAA 


TAATCGCCGT 


CATCAAGAGT 


GCCCCrCCTA 


TCAGGACCCC 


1080 


ATCCCACTTA 


TCCAACTTAA 


TTTTTTTACG 


AAATTCTAAA 


AAGAATAAAA 


TTAAAAATAA 


1140 


TAAAATCAAA 


" TAACCAAGTG 


GTACTTGTAA 


CGCAGGATAA 


ATAACTAAAA 


CAAATCCTGA 


1200 


GGCCATAATA 


ACAGCCGTTA 


TCGCACAAAT 


GACACGACGT 


CGAACACTAC 


TATGATAATA 


1260 


GAAATAATTA 


TAAATTCCCA 


CTAAAAAACc 


TAAGCCAAAA 


AATACTAAAT 


CTCCAACTGG 


1320 
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CGAGACAAAC 


CACCATTGAA 


GGGCCGAAGA 


AAACGGAATC 


CAAACACTTG 


CTAACAAAGC 


• 1380 


TAAATATTTA 


TTCCGCCGTG 


TGACGATCAA 


GCCAATTTCA 


AAAGAAAATA 


AAATTAAGCC 


1440 


AATTAATTTA 


AGATTCCAAT 


ACCATGATAA 


GCCATATTCT 


TTGCCTAATA 


GTACATAGCC 


1500 


CCACGTAAAA 


GGTTTCGCTA 


ACGTGGCCAA 


ATTCCACGCT 


GGTGAATTAT 


AGCCTACCAC 


1560 


CATGTCTTGC 


CCATTTAACG 


TAATCACATC 


ATTATGTGTT 


TTCAAACCTG 


TTTGTGTTTG 


1620 


TGAGAGGCTA 


AATGGGGTTT 


GAACGAGCCA 


TTCATCCGAG 


CGAACACCTC 


TTGGTTCACC 


1680 


AGCAATTAAT 


GAACTTTTTT 


GACCATCCGC 


ATATTCAGAA 


AC AT AC GC AT 


CCCACATGTT 


1740 


TAATGAACTA 


CCATGGACTT 


TAAAGGCGAC 


CAGTAAAATA 


AAAATGACTG 


CCGCTAGCAG 


1800 


ATAGCGCCAs 


GTrAACAAAT 


GATCAACCAA 


GCGAGCACTC 


TGCTCAGCTT 


GGTTGCGTAA 


1860 


TGGTTGTTTT 


TCTTTTCTTG 


TAGTTAGTTG 


TTGTTCCATC 


TTTTCGCTCC 


ATTTCGTCAT . 


1920 


CCGCAATTGC 


ATCTAGTCAA 


TTCGTAATGA 


AAAGTTGGGA 


ATATCCCTTG 


TCAGATTAGG 


1980 


ATTATAAAAT 


GGATCATTTT 


CAATTAAAGG 


TCCCCATTTT 


TCTTCCATCA 


TCACTTTTTC 


2040 


TTGTTCAAAA 


CGTTTTTTCT 


TTTTGCCTTT 


ATCATCATAT 


CCACGAGTTT 


GAGATTCAAA 


2100 


ATGATAAAGC 


TCTGCTTCAT 


GAAGCCAGAC 


ATTGTCACGT 


CCTAAAGCTT 


GGACTTTCAA 


2160 


GCATAAATCA 


ACATCATTGA 


AAGCTACGGT 


AAACGCTTCT 


TCAAAACCAC 


CAACTGCATC 


2220 


AAAATCTGCT 


TTTTTCATAA 


GTAAGCATGC 


GGCAGTTACT 


GCTGAATAAT 


TAACATTAAT 


2280 


AGCTAACCGA 


CCAAAATAGC 


CTAAGTCACC 


ATGGGGATAA 


CCATAATGAC 


CATGTCCAGC 


2340 


GACACCACCT 


AATCCTAAAA 


TAACGCCTGC 


GTGTTGGACC 


GTATTATTTG 


GaTACAATAA 


2400 


TTTCGCACCT 


ACACAGCCAA 


TCCGTTCTTG 


CTGAGCmAAT 


GAAACCATCA AGGTTAACCA 


2460 


ATTCTCTGTA 


ATCACTTCGG 


TATCATTATT 


TAAAAATAAT 


AAATACTCTC 


CGTGCGCTTT 


2520 


TTTCGCTGCA 


CGATTGTTAA 


TAGTTGAGAA 


ATTGAATGGA 


ATATCGATTG 


ATTCAACGAA 


2580 


AAAGCGACCA 


GGTAATTGTT 


GTTCAAATTC 


TGCATATAAT 


TCGTGCATTT 


TTGGATCTGT 


2640 


ACTTCCATTG 


TCAGCCATAA 


TAATCTCGTA 


ATTTTGGTAG 


GTTGTTTTTT 


CAATAATCGA 


2700 


TGAAACACAA 


CGTTGTACGT 


CTTTATAGCC 


ATTTTTAGTT 


GGAATAATAA 


TTGAGACTAA 


2760 


CTTTTCAGAT 


TCAATGTCAT 


AATAAACATC 


GTATAAGCCA 


TTCGCCGCCC 


CATGCGTTGC 


2820 


AT GAG C ATT A 


ATTCCTCGAC 


GAACGAGCGC 


ATCTTGCACG 


GCTCTTAGCC 


CCGCTTCAAA 


2880 


AGCATAACCT 


TTGGAGCCTT 


GATCCACTGC 


TGTTGAGGTT 


GGCAGCATCC 


GCCAATAGTA 


2940 


AAGAACTTTT 


GGAATATGTT 


TGATGCGTTC 


TTTTGTCGTT 


TTTTCAGTAA 


ATCTTAAGAC 


3000 


TAAATCGTAG 


TCTTGAGAAC 


CTTCGTAACC 


TTTTCGAAAG 


CCGCCAATTT 


CTTCTAGAAT 


3060 


ACTCCGACGA 


TACACGCCTA 


AATGCGAAAT 


GTAATTCGTT 


CCTAAAAGTA 


AATCAGGTGA 


3120 


CCAATCTGGT 


• TTAAAGGCTG 


GATCAGAACG 


ATTTCCATCC 


ATGTCAATCT 


TATCTTCATC 


3180 


ACTATAAATT 


AAATCCAGTT 


CTGGGTTTTC 


ATTCAGCACT 


TTAACCACTT 


CATAAAAAGC 


3240 


ATTGATCGCT 


AATTCATCAT 


CGTTGTCTAG 


TAAGGCGACA 


AATTCTCCGG 


TAGCAATCGC . 


3300 
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TAAAGCAGAG 


TTGGTTGCTT 


CGGAAATATG 


ACCATTTTGT 


TCACGAAAGA 


CGACCCGAAT 


3360 


TCGCTCATCC 


AATTGCTGGT- 


ACTCTGT-TAA 


AATCTTTTTG 


ACATTAGGAT 


CTGTTGAGGC 


3420 


GTCATCGGCC 


ATACATAATT 


CCCAGTTAGT 


ATAGACTTGA 


TTCAAAATAG 


AATCAATGCA 


3480 


CAACCGAAGC 


CATTTTTCTT 


CAACATTATA 


CACTGGCATC 


GCAATGGAAA 


TTTTAGGCTG 


3540 


ATAATGAAAT 


GTTGCGATTT 


CTTGCGTCAT 


CGCTTCTATG 


TCTAGTACTT 


CATTTCTAGC 


3600 


TAACCAGTTA 


GGATACGAAG 


CTTGATTTCT 


TAGTTTCTCA 


ATTTTCGCTC 


GTTGAATGGT 


3660 


ATGAGAAATA 


CCATTTTTAG 


CCATATACTT 


AAATCCTTTT 


TTGGCTTTTA 


TCCAAAGACG 


3720 


AGTTACTTTA 


TCTTCTGTCC 


CTGGAATCAC 


TGGATATTTT 


TTATTCAAGG 


GAAAATCCAC 


3780 


CGTTATAACA 


TGTGCTGATG 


ATTGAAAATG 


AAAAGGGAGT 


ACTTTTTTTT 


GTTTAATTCC 


3840 


TTCAAGTGTA 


ACAACAAAGC 


CAGCTTCAAT 


CGCAGGCTCT 


GTTTGATAAA 


TTTGATTAAC 


3900 


ATCTTCACGT 


AAAACGCGCT 


GAATGTTATA 


AGCGGACACT 


TGGTTTtCAT 


TATTAATTGT 


3960 


GAAAGTTGGG 


CTTTCTTTAG 


TGATtGTAtC 


TAACGCCCAA 


CCTGTAAtGG 


TTAAaTtATT 


4020 


tGTTGTTTTG 


TCGCGATAGA 


TACTATCAAA 


GATC 






4054 



(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 



AGCTCATAGT 


TTTTCATAAA 


AGAGTTCATC 


CGCTTAGCAG 


TAGCTTTAGA 


TACACCATTA 


60 


CCAGATTTAG 


TAG TAT TAT C 


AAACGGCTTT 


TCGAATAAAT 


CCTTTGCTGA 


AGCTTTGATT 


120 


TCTTTTATTC 


CTGATTTGCT 


TTTTTTAACT 


GCTTTATCAA 


TACCTTGAAC 


TAGAGGACCC 


180' 


AATAAAAAAT 


TATCCTTTGA 


TGATTCCCAT 


AATTCACTAA 


TTTTTTTAGA 


GATATCGGGC 


240 


CATTTTTCTT 


GCAGTTTCTT 


TCCAAATTTA 


CTTCCAAAAA 


TTGATCCTGC 


AGCTGTTCCT 


300 


AAAACGCCAC 


CAATTAGAGT 


ACCTAACGGA 


CCAAATGTTG 


TCCCTGCTGC 


TGCACCCAAT 


360 


TTTGCGCCAG 


CTAATGTTGC 


TGCTTTAGCG 


CCTAATGCTC 


CACCAACGAT 


AGAACCTAAC 


420 


GAATCACCAA 


TTTTTTCAAA 


CTTATTATTT 


TTATTAATTT 


GAGTTAAAGT 


TAAACCTGCC 


480 


GAAATATAAG 


ACAATAAAGG 


AACCGATTTG 


GCAAACCTTC 


TCACTCCGCT 


AAAAAGTTTA 


540 


GTACCTTTTC 


CTATTTTGCC 


AACAGATCCA 


ACCGGTAAGG 


ATTCCTGTAT 


ATTTGGACTA 


600 


GCAGTAGTAT 


GCGTTGGTGC 


GCTAGCTTTT 


TGAGCTGCAG 


AATCAACATC 


TTCTAAAGCC 


660 


TTTCTTGTTT 


TAAAAGCTTC 


TCTCTGAGCA 


GTGTTCCCAA 


GCTTTTCTAG 


TTTACCAATA 


720 


ATACCTAAAA 


TTCCCTTTGA 


GACTCCCAAG 


AGCGGTTTTC 


CTACTAGCTT 


AAACCCTAGC 


780 


AAACCGCCAA 


TACCATACGT 


GGCCCATTTG 


CCAACTTGTT 


TCAT AGCAAC 


AGGGTTTTCA 


840 


TCAGTAAATT 


CTTTGAGTAT 


ACGAATGCTT 


GGCTTAATAA 


TATATTTAAA 


CCCTTTATCC 


900 
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ATTACGTTAA ACATATTTTT AGCACCTTCT GAAATGCCAG GTCCTATAGC TTTGCCTACC 960 

TTTTTAGCTG TAGGAATTAT TTCATCAACA AAATAATCTT TTGCTTGTCC AAAGTGTTTC 1020 

ATTACCTGTT TTGCTGTTGG CTCAACTTTT GCGCCAACAT TTtCTAACAT ATCTGGTAtT 1080 

TCTTCACCTT TTTTTACAAA ATCTGTCGCT TTGATAATTA GTGGCGTGAA TATTGGTAAC 1140 

AAACCTTGCC CTAACTTCGC CGATGTTTCT TTTATTGATT CTGTAAAAAC ACGGGTGGCA 1200 

TTTGCAGCTT GATCACTTGT TCTTGCAAAA TCGCCTTGGG AGTTCTTAGT TTTAGACATT 1260 

ACGTAGTTAT AACGTAAGCG TACTAATTCT TCCTGACTTA GAGAATCTAA TTTTGCTTGT 1320 

TGAACCTCTT CACCTTTAGA TTCTGCTTCT GTTAATTTTA ACTGCGCATC TCTAGCTTCA 1380 

ATTGAATTTT CACCATGTTT TTTAATGGCT TTATTTAAAC GATCTTGAGC TTTTTCTCTT 14 4 0 

GCCATGGtAT TCTTTGCAGC CTTAGAATTA TCTACTGACG ACTGCAACGC TCCACTGGCC 1500 

ATCGCAAACT GTTCCAAGTT GGTTTGAGTC ATAACAATAC CCAAACCTTT TAACGCCTCA 1560 

GTTTCACCAG TGAATACACC ATTTAATGCA GTATTTACAC GGTCAATTCC TATATTTTTA 1620 

AATGAAGCAA GATCTCCTGC CAAGTCAACT AATGAAGTAG ACATTTTTGC AGCTTCTTCT 1680 

GTGCCAATAC CCATTGAAGT TGACATATCT CCATATGTAG CTGCTAAGTC TAACGCTGTA 1740 

CCTTGTGCCA AACCAATATT AGTCAGTGTG GACTTCGACC AATCTTCTAC AACTTTATTG 1800 

TwATCACCAA AAGCTACTTC TACTTTGTTT AAAGCTTCAT TAGTATCAGA TGCATAGTCA 18 60 

AAGGCCTTTT gCCTGcTCCT GCGATTGCTG CACCAGCTAT CAtTGCACCA TTTCTTATCT 1920 

TTGTAAAAGC ATTCCAAGCA ACATCTGAAG CCTTTGTTGC AACAGTACCT ATATGGCCAA 1980 

TACTTGTTTG TACACCCTTC GCAGCTTTTC CAACTACTTT TAGAGATGTT GCACTAGATT 204 0 

GAGCGAAAAC ACTCGCAGAA TTTTTACTTT TAwCaAACTG aTTTCCaAcA TCTTTTAcTT 2100 

GtTTTTCaGT TTGTTTAgCa GAGTTkGTTG TCtTTTGCaT AGAATCTTTT GcTCGaTTTC 2160 

CgAACTGGAC AACTTTATCT GTATTTTGkT TTAAAcCTyC ACTATTTCTT TTTAAAgAGG 2220 

TTGTGGCGCC AT C TAT G GAT TTTGCTGAAT TTTGATAAGT ACCTTCCATT CGTGCAGCTT 2280 

TAGCAAGAAT TTTATCAGTT TCTTCATTGG CACGCTCTAA CGATCTATTA TTTATTTTCC 2340 

AATCGAGTTC GATTACTGAA CTACGTAACG CATCTGCCAT TATTAGATGA CACCTCCTCT 2400 

TGATTTAGTT AGATTTATTT TTTGATATGC tAcTTCGTTC CAAACTGCTA GCTCCTCTGC 24 60 

CGTGGCAATA TCGACTTCAT CTTTAGTGGC AATGCCTGCT ATTACAGGCA GCCACCTAAA 2520 

CATATCTTTT TGAACTTCTC TTTCCGTAAT TCTCGGATTA GGGCTAGTTA AGCATTCGAC 2580 

CAAGAAAGTT GTCGGCTAAT GCCATAACCT CACGATATCC TTCGTGTTCA TCCCAATAGT 264 0 

CCCAGTTCAA TTTTGGTGTA ACAATAACTG AGTCCATAAG TTGAGAGTGG TAAGCAACGT 2700 

CAGAAAAAGT ATTTGACGGT CCTTTTGAGT TATCTAAAAT CGCTTGAGCT GCACGTGTTC 2760 

CAGGGAATTG AAATGTATAT TCCACATCTT CAACAGTATG TTTTTCTTGT TTTCCGAACT 2820 

TATTAAATGG TTTTTTTTCT GCTTCTGGAA TGTTATTTTT CACTTCTAAT TCTTTTACTT 2880 
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CATCyTTTTT CGTCATGATA AATTCCTCCT AAAGTTAAAT AAAAAGCACt TAACGAGATG 2940 

TTaAGCGCyT AGtTATATTC GTGTTTGTAA TCTAATGCTT TAATCGTATA AGAACGAGTC 3000 

GGAACGCCTT TACCAAACGA TCCATCAGGT GTTTTTTCAA TATAGGCTTT TGATGCCCAT 3060 



GCTTTTTCAG TTGA 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3074 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

CTTGTGATTG TAGATGAACC GCAGATTCCT ACGAATCCAG TGCCCTTTTA TCCTTATGTA 60 

ACGAATCCTA ATCTCGTGCT CTTCCGGTTT CACGGACGTA ATGCGGCTGG CTGGTTAGCC 120 

AATGATGCAG AGTGGCGAAA AAAAAGAACC CTTTATCATT ATAATACGCA GGAAATCGCT 180 

GATTTAAGCG AAGCTGTTTT AAAAATGTCG CAAGAAGCTA AAGAAGTGGG CGTAATTTTT 240 

AATAATAATT CTGGCGGCGA TGCCGCAGAA AACGCTCTAC AGATGCAAAA AGTCTTGAAT 300 

TTGAGTTACG ATGATTTGAA TCCAAAACAA TTAGATTTAT TTTGAGAAAG AAGGTAGCTG ( 360 

AATGTCTGAA AAGCAATCCC CTTATGAAAT GGCTTGTGCA TTCCACGAGG TTTTTAATCC 420 

ACAACAACCA GAGCAACCAA CTGCTTTTTC GCCAGAGCTA GCCAGTCGAC GAGCAGAGTT 4 80 

TAAAGTCGAA GAACTTGTAG AATTTTTATA TAGTGCTGCT GATAATGACC AAGTGCTTTT 54 0 

TCAACAATTG GTCAACGGAT TAAAAAAATC TGTCGATAAA GCAGAGCAGA AAGTCTTGGC 600 

GAAAGGCAAA CCAGTATCTG ATCCTTTAGT GGAGGAAGTG GATGCACTAG TTGATTTATT 660 

GTATTTTACT TATGGTTCGT TTTGCTTAAT GGGCGTCAAT CCAACAAAAA TTATTGAAAT 720 

CGTTCATCAA GCCAATATGG GGAAATTATT TCCAGATGGG CAGCCACATT ATGATCCAGA 780 

AACAAATAAA GTATTGAAAC CTGAAAATTG GGAACAAGAT TTTGCGCCAG AAGCTAAAAT 84 0 

TAAAGCGGAA ATTCAACGCC AATTGAAAAT TACTGaAACC AATGAAAAAG ACGAATGAGA 900 

AAACTCATTC GTCTTTTTCA TTAATCATGG AAGAACGTCC GATATAAGGC TTTGATGGCT 960 

TTTTCCTCTT GTTCTTTTTT GATACCAAAC ATAATACTTA CTTCCGAAGA TCCTTGGTTA 1020 

ATCATTTCTA AGTTGATTTT TTCTTTGcTA ACGCCATTGT ACTTTCAGCA GTAACCCCAA 1080 

TTCGTTGGCG CATGCCTTCC CCGACAATCA TAATCATAGA AAGATGATGC GTGATGCGTA 1140 

ACTCATCAGG CGCTAAAACT TCTTGGAGTT GAGCCATCAG TCGATTTTCA ATTTCAGTTG 1200 

TTAACTGGTC- TTCTCTTAAA ACAATCGTGA TGTCATCAAT TCCTGAGGGC ATATGCTCAT 1260 

AGCTCAAGGC TAATTTTTCT AGTACTTcTA AGAGCCGTCG CCCmAAACCt AATTCTCTAT 1320 

TCATTAAATA CTTACTAATG TAAATACTTG CGAmCCCTTG ATCACTAGCA ATTCCTACAA 1380 
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CAGGAGCATG CTTCACTTTG CGAGAGGTGG TAATTAGGGT GCCAGGATGA TGTGGGTTGT 14 40 

TGGTATTTTT GATAACGACT GGAATATTTG CGCGATAGGC AGGCATTAAG GCTTCGTCGT 1500 

GGAGGACGGC AAAGCCTGCA TAAGCTAACT CCCGCATTTC TCTGTAGGTC AATTCTTGAA 1560 

TCGTTTGAGG CTGTTCAATA ATTCCAGGAT GGGCGACATA AATTCCGTCC ACATCGGTAA 1620 

AGTTTTCATA CAAATCTGCT TGAACACTGC GGCAACAATG GACCCTGTGA TATCGGAACC 1680 

GCCACGAGAA AAAGTACAAA TTTCGCCGGC TTCTGTATAA CCAAAGAAAC CTGGAATAAC 174 0 

CAAGATTTGT TGAGTCTCAC GATAGACACT AATTTTCTCT AAAGCTTTTG GTAAAATGCG 1800 

GGCATTGCCT GGCTCAGGTG TCACTAGG 1828 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



GGCCGCCACG 


GTTTTGGTTG 


TTCCCGCCAT 


TGTTATTTTG 


GCTGGTCCGA 


TTTTGATTCG 


60 


TTGTAGAACC 


GTTATTGTTT 


I I A I I bL 




fti m f rp y\ rp rp rp 


bl 1 IluHl A 


1 90 


TACGGTTTTG 


TGTGTTGCTT 


TGGCCGTGAC 


GATTTTGGTT 


GTtGttGTTt 


CTTkGTtGyC 


180 


CATtCGTkGA 


TkGATtGTTt 


GGACGATTTk 


GGTwATTtCT 


TkGTtCACCA 


GGTTTATtGT 


240 


yCATkGGTTy 


CTTATTGGCT 


GGTTTATTTk 


GTGTkGTTkG 


TkGCTTtTGG 


TTGTTTGTTG 


300 


CTTTTGATTG 


AGTATTTACG 


GTACTTCCTC 


CtCCGAATGC 


TTGACGAAGC 


TTTGTTTCTT 


360 


GCTCGGAACT 


AATTGCGCCC 


aTATGGTTTT 


TAACATCCAT 


ACCTAATTGA 


TGCGCTTTAT 


420 


CGACAACGTC 


TTTGCTTGCT 


TTATTCATTT 


CTTTTGCTAG ' 


"TTCATAAATT 


CTTTTTTTTC 


480 


CCATGCAATC 


ACCTTCCTAA 


CCTTTTAGTA 


ATTCCTTGAT 


TTTGGTGGCA 


AATCCTGCAT 


540 


CAGTTACTCC 


GATAACCATT 


CGTGGTCGCC 


CAATCGCTTG 


GCTGAGCTCG 


AATTGACTGA 


600 


ACAGTTCGCT 


AACAGGAACA 


TTATAATAAG 


AACTTTTGTC 


CTTGATTTTC 


TTTCTAGTAT 


660 


TCTCACTGGC 


ATCTTGCGCA 


ACAAACACAA 


ATTTCGCTTT 


GTTTGCACGA 


ATATCTTTAA 


720 


GGGTTAATTC 


CTCGCCTGTG 


ACTAATTTAC 


CGGCGCGCAT 


TGCTAAACCT 


AAAAGGTTCA 


780 


AAATTTTGGT 


TTTATTTTCC 


ATCGCCAAAC 


AACTCTTTGC 


GGGCTTTTTG 


ATGTGTTACA 


840 


TAATCCAATA 


GTTCTTGATA 


AAATTCATCA 


CTTAATTTTG 


TTTCTAAGAC 


ACGATCAAGA 


900 


ATTTTTTTAT 


CCCAAGCTTT 


TTGGACTTCC 


GCAGGATCCA 


AATCAACGTA 


AGCGCCACGT 


960 


CCAGGTAATT 


TACCAGTTGG 


ATCTAAGGCC 


ACCTCGCCTT 


CTTTGGAGCG 


AGTAATTCTG 


1020 


ACTAATTCTT 


TTTTCGGTTT 


CATTTCGCCT 


GAAACAACCG 


ATTTACGCAT 


TGGAATTTTT 


1080 


CTCTTTTTCA 


TTTTTTCACC 


CTCCAAAAAC 


GTTATTCTTG 


TTCAGGTAAT 


TCTTCTGTAA 


1140 


TTTCTGCTTC 


AACAACTGCT 


TCTTCAGCAG 


CCAAGTTTTC 


GTAATCATCT 


GCCGTCATAT 


1200 
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CTGAAtGGAT AATTGCTTCA TCCATTTCTT CTGCCACTTC GGCAACTTCT TGATTGGCTT 12 60 

GTTGTTCATA AAATTCAGCC ATATCAGATT CTGATTTGAT ATCAATTTTA TGGCCCGTTA 1320 

ATTTAGCAGC TAAGCGCGCA TTTTGACCAC GtTTACCAAT TGCTAATGAC AATTGATAAT 1380 

CAGGTACAAC AACTGTACAT GCTTTGCTAw TTtCTGGATC AAAAaTAACA TCTAATACTt 14 40 

GAGCTGGAtT CaAGGCATTA CTAATATAAA TAGCTGGATC TTCGTTCCAT TCCACAATAT 1500 

CCATATTTTC GCCTTTTAAT TCATTCACAA TCGCTTGGAC ACGTTGGCCT TTTGGACCAA 1560 

CACAGGTACC CACTGGGTCA ATATTAGGAT CTGTTGAACG AACGGCTACT TTTGAACGAT 1620 

CGC 1623 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 



ATGCCTGCCG 


GTCGACTCTA 


GAGGACCCCC 


GCATGAACCA 


GACGTGACGG 


CTTTGATTCG 


60 


TCTAAGTGAT 


GTCTATGAAA 


TTCCTTTAGC 


GACCAATATT 


GGCAGTGCTG 


AAATTCTTTT 


120 


GAGAGGTGTC 


GAGGCGGGGT 


TTGCGGACTT 


CCGTGAGGTT 


ATCCaTGAAG 


GGGATCGTCG 


180 


ACCGTTAGCT 


TTTTAGAGAA 


CCTTAATGGT 


TATTTTGACT 


ATTTTGTGGT 


ATATTTTATG 


240 


AGGACTTTTG 


TCCAAATAAA 


ATAACAGAGT 


AGGAGACAAA 


GCGTTAAGTG 


CTAAAAGGAT 


300 


GGGATGTTGC 


CTTTTGGACG 


AAGAACATAA 


ACAGTGTGTT 


GCTAGAAAGC 


AAAGGTTCTT 


360 


ACTTTGCTTA 


TCATCAGAAC 


AACACGTTCA 


GCTTTTTAAT 


GTTCGCGGTT 


TTGTTTTCGC 


420 


ATTCGCTGCA 


TGGTTCGTCA 


TGTAGCAGCT 


CTTAGAGGAG 


ATGCTAAAAA 


TGACAAAGAA 


480 


AAAATTTGGC 


ACAAAATCCA 


TTGCGTTAAT 


GGGCGTACTC 


ATTGCGGTGG 


TCGTTGTCTT 


540 


TTCACGCTTT 


TTTGCGTATG 


AAACAACGTT 


TTTAAAAATC 


AGTTTCACGT 


TTATCCCAGA 


600 


ATCATTGATT 


GGGATGATTT 


TCGGACCTTT 


TTGGGCTGGA 


ATCGGCACGG 


CGGTGGCTGA 


660 


TGTGGTAGGA 


ATGTTGCTTT 


TTCCAAAGGC 


AGGCTATTTT 


CCTGGGTTTA 


CTTTAAATGC 


720 


CTTTTTGGCT 


GGTGCTATTT 


ATGGTTATTT 


CTATTATAAG 


AAAGAAATGA 


CATGGCAACG 


780 


AGTGATTTTG 


GCGACATTGT 


TGGTCACCGT 


TTTAATTAAT 


ATCATTTTAA 


CTCCTTTGTG 


840 


GTTAAGTTTA 


ATGTATGGCG 


TCAATCTAGC 


AAATTTTGCT 


TGGTGGGTAC 


CACGCTTAAT 


900 


TAAAACAGTT 


ATTTTCTTTC 


CTATTCAAGT 


AATTGCCACT 


TATTATTTAG 


GCAATAAAAT 


960 


TCCCTTTAAA 


■ CGTTTATTTG 


GAAAGCCATT 


ATCTGAATTA 


GATCAATAAA 


TTTAAAATTA 


1020 


AAAGAGTCAC 


TTTTCATCAT 


CCCAGAAAAG 


TGACTCTTTT 


TTATTGTGCT 


TTTTGATTAA 


1080 


ACTGTGCTTC 


GTAAAGGTCA 


CGATAATAAC 


CCTGTTCTTT 


TTGCAATAAT 


TCGTCATGGG 


1140 
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TGCCAATTTC 


AACCACTTGA 


CCATTATCCA 


TAACTAGAAT 


TTTTTCAGCA 


CTTTTGATTG 


1200 


TTGCTAAGCG 


ATGGGCAATA -AGAAAACTGG- 


TCCGGCCAGT 


CATCATTTGT 


AAAAAAGCAT 


1260 


CCTGAATTTT 


TTGCTCGGTC 


AATGTATCAA 


CTGAACTGGT 


CGCTTCATCT 


AAAATAAGCA 


1320 


TCGGCGGATT 


ACTAATCATC 


GTGCGTGCAA 


TAGTCATTAA 


TTGTCGTTGT 


CCTTCAGAAA 


1380 


TTTTAACGCC 


TTGGGAACCA 


ATCTCGGTAT 


CTAGTCCTTG 


AGGTAAACGC 


ATGACGAAAT 


1440 


CAAAAATATG 


GGCTTTTTTC 


ATGGCTTCTA 


TAATTTCTTC 


GTCAGAGGCT 


TCAGGATTTC 


1500 


CGTACGTTAA 


ATTTTCCCGA 


ATGGAGCTAT 


CGAATAGCCA 


AGTATCTTGG 


AGAACCATAC 


1560 


CAAAACTTTT 


TCGCAAGCTG 


TCCCGAGTAA 


TTTGAGTAAT 


GTCGTGAGTA 


TCAACCGTAA 


1620 


TTTGGCCACG 


GTCGACCTCG 


TAAAAGCGCA 


TTAACAAATT 


CACTAATGTT 


GATTTACCCG 


1680 


CACCAGTTCG 


CCCAACAATG 


GCAATTGTTT 


CTCCTGGTTC 


AGCAATTAAA 


TTAAAGTTTT 


1740 


CGATTAAAGG 


TCGCGTCGGC 


GTGTAAGAAA 


AATCAACGTC 


TTTAAATTCA 


ACACGGCCTC 


1800 


GCGCTTCTTT 


CAAAACAATT 


GCGTTCGAGG 


CATCAGGACG 


TTCAACTGGT 


TGATCCATCA 


1860 


TATCAAAGGT 


ACGTTCCAAT 


CCAGCTAAGG 


CGGCTTGGAT 


TTGGTTGGTG 


ATTCCTGAGA 


1920 


GTTCAATAAA 


AGGTTTAGAA 


AATTGACTAG 


AGTAAATGGT 


GAAACTGGAA 


ATAACACCAA 


1980 


CTGTGACTAA 


AGGATTGCCA 


CTTAAAACTA 


ACAAGCCaCC 


GACTAATCCA 


ATCGATAAAT 


2040 


AAGATAAATG 


GTCGATAAAT 


CGAGAAAGTG 


GGTTGGTTAA 


GGAGGAAGAA 


AACTGAGCCt 


2100 


TCTGTCCTTT 


TTCATACAAC 


GTTTGATTCA 


ATGCTTCAAA 


TGTTTCTTGC 


GTAGCTTGTT 


2160 


CTTGTTGAAA 


GGCTTTAACA 


ATTTTTTGGT 


TGCCAACCAT 


TTCTGAAACG 


AAACCGGAAA 


2220 


TTTCACCAAC 


AATCGTTTGT 


TGTGCAGCAA 


AGTTTTTTTG 


TGAAGCTTTT 


GCAACAAGCC 


2280 


AGTTTACAAT 


AAAAATAATC 


GGCGTACTGA 


TAAGAACCAC 


TAATGTGAGC 


AGGGGACTTA 


2340 


GCCGAATCAT 


AAAGAGCAAA 


GCAACAACGA 


CGGTGGCCAT 


GCCAGAAAAT 


AATTGGTTGA 


2400 


AGACAGCCGA 


GCAAGCCATT 


GAAATATTGT 


CAATATCGTT 


GGTAAAGCGA 


CTGACGATAT 


2460 


TTCCGTGTGA 


CGTTTGGTCA 


TAATAACTTA 


ACGGTAATTG 


ATTTAAATGG 


GCAAAGGCAT 


2520 


CTTTTCTCAG 


CTGTGTGGTC 


GATAAATAAG 


ACACGCGATT 


ACCGAGACGT 


TGAATCAGCC 


2580 


ATTGACTTAG 


AACGGTTACG 


AGTAAAATCC 


CTGCTAATAA 


ACCTAAAATT 


TTGATGAGTT 


2640 


GCGCAnCATT 


tGACTTGTCC 


TTTACCCACC 


ATTGTATCAA 


CGGATTTACC 


TATATAATAA 


2700 


GTCATGAGGA 


CAGTCGAAAA 


ACCACTGAGA 


ATTCCTAGGA 


TTAAGGCGCC 


AATAATTTCT 


2760 


TTTGGATAGC 


GTAGTAAATA 


CGGCTTAAAA 


CGTGAGAAGG 


CACCAAAATG 


AATTTTTTTA 


2820 


GTTTGCATGG 


GTATCCTCCT 


CCTGTGACGC 


AACGATTTCT 


TGATACTCTT 


TAGAAGAAGT 


2880 


AAGTAATTCT 


TCGTGGGTGC 


CAAGGCCAGC 


GACTTTTCCA 


CTATCTAGGA 


CTAAAATTTG 


2940 


ATTCGCCGTT 


• GCAATCGAAC 


TCACGCGTTG 


TGAAATTAAA 


ATAACGGTTG 


TCTCTGCTCG 


3000 


TTCTTTTTGT 


AAAGCACGAC 


GCAGATTCAA 


ATCTGTTTGA 


TAGTCGAGGG 


CACTCAAGGA 


3060 


ATCATCTAAA 


ATAAGTAAAT 


GCGGTTTCCT 


AATCAAGGCA 


CGGGCAATGG 


TCAACCGTTG 


3120 
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TCTTTGTCCA CCAGAAAAGT TTTI7CCGCC TTCGTTAACA GGCGTGTCTA ACCCTTGGTC 3180 

TAATTGTTCA ACAAATTCTT TACATTGGGC GATAGCTAAT GCTTCCCAAC .AGTCTTCATC 3240 

AGTTGCATTT GGTTTGCCCC A7TGTAAGT7 TTCTCGAATA- GTCCCTGTAA ATAAGACGGC 3300 

AGTTTGTGGA ACACTTGCAA CTTGTTGGCG TAAAGTAAAG AGCGGCCAAT CGCGCACATT 3360 

TACACCATTA ATAAAAAGGT TTCCIGCACT CACATCATAA AATCTGGGAA TGAGTTGGGT 3420 

TAAGGTACTT TTCCCGCTGC CGGTAGGTCC TGTGATGCCG AGAATTGAGC CTTTAGGAAT 3480 

CGTAAAATTG ATATTTTCCA AAGCCAAACC GGCCTCTGGT TCATAGCGAA AATCTACATG 3540 

GTCAAAGGTA ATATCACCAA ATTGTGGAGA TGTTTTTGCT GATTCTGGTG TATCTGTTAA 3600 

3601 

C 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

GGATGATTTT CAAGAGGAGA ATCCTCAATT AGTCTTGATG GATATTAATT TACCTGTTTA 60 

CGATGGGTAC TATTGGTGTC AAAAAATTCG TGAAGTTTCT AAAGTACCAA TTATTTTTAT 120 

TTCGAGTCGT AGTACCAATA TGGATATGAT TATGGCGATG AACATGGGCG CGGATGATTT 180 

TGTCACAAAA CCGTTCCAGA TAGATGTTTT AATTGCTAAA ATCAATGCTT TGTTGCGCCG 240 

CTCTTATAAC TATTCCGATA CGGATAGTGA AGTGCTTTCA CATAATGGCA T TACT TT AAA 300 

TGTGGACAAT GGACGAATGG AAATCaGAGG TGAAATGATC GATTTAAGTA AAAATGAATA 360 

CCGATTACTG TATCTTTTAA TGAAAAAGCA CGGAAAAATT TTAACACGTG AAAAACTATT 420 

GCGCGCTTTA TGGGACGATG AACGTTTTGT GGATGACAAT ACGCTGACGG TAAATATCAA 480 

TCGTTTACGA AAAAAAATTG AGCAAGCAGG GATTGCTGGC TATATTGAAA CCAAAGTGGG 540 

CGTCGGTTAT ATGGTGCCCT AGATAGGAGT AGACAATGAC CATTCTAAAG TATTTAAAAG _ 600 

ATCGCTGGTT ATTACTAATT GGTTGGTTAT TTTTTCTGTT CTTAACTTGT TTTATTTTAT 660 

GGTTGGCTCC TAATGTCCGC TTAGATTGGA CAGTGGTTGG CTATATCTTT TTACTGCAAA 720 

GTGTTTTTTT ATCGCTATTT TTAACAATTG ATTATTATTT AAAACGTAAG TGGTGGCTTT 780 

CGTTGGCAAC GGAAAAAGAG CCGCCTTCTT TACAAGAATA TTTGAATACA GCAGAAAAAG 840 

AAGAAGAACT ACTTGTCCAA ACGTATATTA ACGGTTTATT GCAAGAGCAC CAACAAACCA 900 

TGCAGCAAGC AATTAACAAC CAACAGGACC AAAAAGATTA CATTGACTCT TGGGTTCATG 960 

AAATCAAAGT CCCCCTAGCG GCTATCACGT TACTGGTACA ATCTGTCGAA GATGATATTC 1020 

CAGAGAAAAA ATATTATCTA TTGGAAAATG AGTTAGGTAA AATTGATGAA TATGTCGAGC 1080 

AAGTCCTATA TTATGCTAGA CTGGACAGC7 TTTCTCGAGA TTATTTACTC CAAGAATATT 1140 
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CTTTAAAAGA AATTGTGCAA TCTGTTGTTC GTACGCAAGC AAATTATTTT ATCCAAAAAA 
GATTGCAATT CTCAATTGAA GGGGAAGATG AAGCGGTTTT AACCGACCGA AAATGGGTCA 
TTTTTATTTT CCGCCAATTA TTAAGTAATG CAGTCAAGTA TACGCCAGAA GGCGGCACGA 
TTACGGTCCT AATTTCCAAA AATAAACAAG GTATTTATTT ATCTTTAAAA GATTCTGGGA 
TTGGCATTCC GACACAAGAT CAACGACGCA TTTTCGATAA AGGATTTACT GGTGAAAACG 
GGCGAAAAAG CGAACAGCAT TCGACTGGTA TTGGTCTGTA TT7GGCTCAT AGCTTAGCGA 
AAAAGTTAGG TCATGATTTA ACAGTCGAAT CGACAGAAGG GCAAGGTACG ACGATGACAT 
TGTTTTTCCC TTCCCTCAGT TATTATAATG AAGTGAAATA AA.--AGTAGGT CTATCAGCAA 
ACTAAAGAGG TGATAGGCCT TTTTTTATTT ATTAAAACTT TCTTGCTAAC TTCTCTTACT 
CTTCTTAGTT GTTTAGAAAA CGTTTTTATT GTTAGAATGA AACGAAATAA ATTTGTTAGG 
AGAATATTAT GAGTTTACTT ATCGCTTTAG TGCCCATGAT TGCGTGGGGC AGTATTGGTT 
TAGTCAGCGG TAAAATTGGT GGAAGCGCCA ATCAACAAAC GT7AGGTATG ACAATCGGCG 
CATTACTATT TTCAATTGTT GTCTTTTTTG TCATCCAGCy TACGTTGACG ACAGCCACAT 
TAATTGTTGG ATTTATATCT GGTTTATTCT GGAGTTTGGG ACAGAATCAA CAATTTCATA 
GCATGAAATA TATGGGCGTT TCAGTTGGCT TACCAATTTC AACTGGGATG CAGTTAGTTG 
TTAATACCGT TGCAGGGGCT GTTTTTTTCC ATGAATGGAC AAAAACAAAA GACTTTGTTG 
TGGGCTTTAT CGCTTTGGCC TTTTTAGTTT TTGGGGTGTA TT7AACAGCT CGACAAGATG 
ATGATAGTCA ACCGAAAACC TCAAACAGTA TGTTAGATTT TAATAAAGGG ATTCGGGCGT 
TGATTTTCtC AACAGTTGGG TATGGCGTCT ATACCATCAT TA7TAATGCC ACTGGTTTAG 
ATCCGTGGGG AATTATTTTG CCACAAAGTA TTGGTATGTT AGTGGGGGCT AGTTTCTTTG 
CCTTCAAAAA AGTCAAAGTT GATCGTTTTG TTTGGATGAA TA-GACGACG GGTCTGCTTT 
GGGGCTTAGG AAATATTTGT ATGTTATTAA CAATGCGTGA AA7TGGCTTA GCTATTAGCT 
TCTCTTTATC ACAAATGGGG aTTATTATTT CGACTCTTGG GGGCATCTTC CTTTTAGGCG 
AAAGAAAATC AAAAAAAGAA ATGTTTTATG TTaTTTnTGG TTGCATTTTT GTTATTCTAG 
GTGGTATTCT TTTAGGGTAT ATGAAAGCGT AAATAGTTTA TT7ACGCTTA TTTTTTTTAC 
GGGGGTAAaC ACATgTCTAT TTTCAGAAAA AAAGAAATCA CTGTGCCTGT CAATGACAGT 
GGTGGGATGA AAAAAAATCT TAAGACGATG GACTTAATTT TTCTAGGAAT TGGGGCTGTA 
GTAGGAACAG GGATTTTTGT TGTGACAGGT GTCGCAGCGG AACGCTATGC AGGTCCAGGG 
CTCGTCTTAT CATTTCTGGT AGCAGCAGCC GCGATTATTT TATCTGGTTT ATGCTATGCA 
GAATTTGCAT CACGTATTCC GGTCATTGGT GGGCCGTATG CCTACATGTA TGTGGTCTTT 
GGAGAAATTG TGGCTTGGAT GACTGGTTGG ATGATTATTT GTGAGTTTTT CTTGGCGGTT 
TCATCCGTTG CTTCAGGTTG GTCAGGCTAT GTCCATGGCT TTTTGGATAG CTTAGGTTTT 
TCTTTGCCGC AAGCGTTAAG TGGCGCTTAT AATCCAACAA ATGGTACATA 7ATTGATTTG 
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ATTGCCATGT TGGTCGTTGT AGCCGTAACT TTTTGGGTAT CTTTAGAAGC GAAAACAGCG 3180 

TTACGTTTAA ATAACTTAAT GGTTTTCGTG AAGTTTGGGA TTATTTTATT ATTTGTTCTC 324 0 

GTGGGAATCT TTTATGTGAA ACCAACGAAC TGGCAACCAT TTATTCCTTA TGGTTTTTCA 3300 

GGCGTTTTCA GTGGTGCTGC TTTAGTTTTC TTTGCCTTTC TAGGGTTTGA TGCGGTCAGC 3360 

ATGGCCGCGG AGGAAGTGAA GAATCCGAAA AAAGATATTC CAAAAGGGAT TATTGGTTCC 3420 

AT TAT TAT TT CAACATTGTT ATATATTGTT GTTACGTTAG TTTTAACTGG GATTGTTCCA 34 80 

TTTACGGATT TAGGGGTCAA GGATCCGGTG GCTTTTGCGA TGCGCTTCAT TAATCATGGC 3540 

GCAATTGCGA CAATCATTTC AGTTGGTGCT ATTTTAACCT TATTAACAGT GACGATTGCG 3600 

ATGATGTACA GTTTAGCCCG TGTGATTTAT GCGATTAGTA AAGATGGATT ATTGCCGCAG 3660 

TTTATGAGCA AAATCGATGA GAAGCGGCAC ACACCTAAGA ATGCAACCTA TGTAGCTGGA 3720 

TTTTTAGCAA TGGTTTTTGC TGGGATTGTT CCAATGGAGA TGTTGGCTGA ACTTACAAAT 3780 

ATCGTGACAC TCTTTTATCT GATGTTTCTC GCTTTAGGAA TTATCAAATT ACGGACAATG 384 0' 

AAAGAGAACC GCAAGCAGtG rATTtAAAGT ACCACTgTGC CGGTTCTACC ATTAATTTCA 3900 

ATCGTGGTTT GCGCCGCTCT GATGTTCCAA TTGTCTTTAG CTACTTGGCA AGTATTTGGC 3960 

GTAgcgTaGT GATTGGTTTG GCTATTTATT TCTTTTATGG ACGTCACCAT AGTATCGTTC 4020 

GTTTCGAGAC AAAAGAATAA AAATGAACGA AAAAAGAGAG GAATACTTGT CAAATAGACA 4 080 

GGTGTTTCTC TCTTTTCTTT ' CTGGCATTTT CTTCATTTTC AAGAGAAAAC CGACAAGAAA 4140 

TATAGTTTAA TAATGTCTAA TTATGGTATA CTGTTTAGGA GTAAATCCTG AGGGTTTAGG 4200 

AAAATGACTA AAGGAGGTTT TTTCTGTATG TCACAGAAAG AAACATTTTA TATTACCACC 4 260 

CCAATTTATT ACCCAAGTGG GAAGTTACAT ATTGGGAATT CATATACAAC AATCGCCTGC 4 320 

GATGCAATGG CCCGTTACAA GCGTTTAATG GGTTTTGATG" TATTTTATTT AACAGGTGTG 4 380 

GATGAACACG GCCAAAAAAT TGAAAAAAAA GCTGCTGAGT TGAATGTGAC ACCAAAAGAG 44 4 0 

TACGTTGATA AAATGGCAGC AGATGTTCAA AAATTATGGA AAACATTAGA TATCAGCTAT 4 500 

GATAAATTTA TTCGGACAAC GGATGATTAT CACATGGCAG CCGTGCAACA AATTTTTGAT 4 560 

CGTTTAGTAG AACAAGGAGA TATCTACCTT GGAGAATATG AAGGTTGGTA TTCAGTTTCT 4 620 

GATGAAGAAT TTTTCACTGA AACACAATTA GCTGAAGTAT ATCGTGACGA TGAAGGCAAT 4 680 

GTCATTGGTG GGaAAGCACC AAGTGGTCAC GAAGTCGAAT TAGTTAAAGA AGAATCCTAT 474 0 

TTCTTCCGCA TGAGTAAATA TGCGGATCGC TTGGnACAAT ATTACGAAGA ACATCCAGAA 4800 

TTTATTCAAC CAGAATCTCG TAAAAATGAG ATGCTAAATA ACTTCATCAA ACCAGGATTG 4860 

GAAGATTTAG CCGTTTCAAG AACGACTTTC TCTTGGGGAA TTCCATTAAA AAATGATCCA 4 920 

AAACATGTGG TTTACGTATG GATTGATGCG TTATCAAACT ATATTACAGC AC TAG GT TAT 4 980 

GGcTCAGAAG ATGATTCATT ATTCCAAAAA TATTGGCCTG CGAACGTGCA AATGGTTGGG 504 0 

AAAGAAaTCG TGCGTTTCCA TACGATTTAT TGGCCAATTA TGTTAATGGC CTTGGATTTA 5100 
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CCATTACCGA AAAAAGTCTT TGGACATGGT TGGTTATTGA TGAAAGACGG AAAAATGTCT 5160 

AAATCTAAAG GAAATGTTGT TTACCCTGaA ATGTTAGTAG AACGCTACGG tCTAGATGCG 5220 

TTACGTTATT ATTTATTACG AGCAATTCCA TTTGGTAGTG ATGGCGTCTT TACACCAGAA 5280 

GATTTCGTTT CTCGCTTAAA TTATGATTTA GCTAATGACT TAGGAAACTT GTTGAATCGA 534 0 

ACAATTGCGA TGATTAATAA ATATTGTGAT GGCAAAGTAC CAGCCTATGC TTCAAAAGTT 54 00 

ACCCCATTTG ATAGTGAACT ATCGACAACT GCGGCGAATG TTATTGGAAA ATATCATGAA 54 60 

GCAATGGAAA AAATGGAATT TAACACGGCC ATTGCAGAAA TCTGGACGCT AGTTTCTCGT 5520 

GCCAATAAAT ATATTGATGA AACAGCGCCT TGGGTATTGG CTAAAGAAGA AGAGAAACGC 5580 

AACGAATTAG AAAGTGTCAT GATCCATTTA GCGGAAAGCT TACGAATTGT GGCAATCTTA 5640 

TTGCAACCAG TCATGACAGA AACACCAGGG AAAATTTTCG AACAATTAGG CTTAGATCCA 5700 

GAAACAATGA ACATGGAAAA TATTCATTTT GGTGAGTTCC CAACAGACGT AACAGTAACA 5760 

AGTAAAGGAA CACCTATTTT CCCACGTTTA GAAATTGAAA CAGAAGTAAC GTATATTCAA 5820 

AAGAAAATGT CCCAAAGCGA AAGTGCTACA GAAGAGGATA TTAAATGGAA TCCAGAAGAA 5880 

ACAACGTTGG TTTCAACGAA AGAAAAACAA ATTAAGTACG ATGATTTCGA TAAAGTGGAA 5940 

TTAAAAGTTG CCGAAGTTAT TGACTGTAAA AAAGTCAAAG GAGCAGATAA ATTATTGCAA 6000 

TTCCGTTTAG ATGCTGGAGA TGAAAACCAT CGTCAAATTC TTTCAGGTAT TGCTGAATTT 6060 

TATCCAGATC CAGCTGCGCT AATTGGCAAA AAAGTAGTCA TTGTGGCTAA CTTAAAACCA 6120 

AGAAAAATGC GTGGTCAAAT TAGCCAAGGA ATGATTCTTT CAGCTGAATC ACCAGAAGGG 6180 

AAACTTCAAA TTGTTGAAGC ACCAAAAGAA ATGCCAAATG GTGCAGGAAT CGCATAAAAG 624 0 

CATAAGTGGT AAAGATAAAA GGTGAATTGT AACCTTTTAT CTTTACTACT TTTTTTGTAT 6300 

ACTAGAAGAA AAGTGATTTG ATGTGGGGGA AGTTTGATGG CTTATCATAA ATTTAAATTT 6360 

aTGAGCmACG AAAAAATGAy CCTAAATGGC GTCCGTATTA TTTAGaAGCt ATGAGAGTAA 64 20 

GGGAACAAGC ACGGCTAACA GTACAACGTA GAAAAAGAAG GGCAAGAATT AGAACCTTTT 64 80 

tAGTGGTTGC GATACTGGTT ATTTTAGTTT TTCTGTTATT GGCTCATTGG ACATATTCAC 6540 

AAACAAATGA ACTCTCTTTC TCCAGTTTTA TCGACGTCCT AAAATCGAAA ATCAACGAAT 6600 

TTTTAAAtTA AAAGAaTCGA AGAAAAGCCT AGAAAAAATC ATTTGCGATT TTTCCTAGGc 6660 

TTTTCTATTT TAGAtTTCAC TTTTTTAGGT ATTCgTkTTT TACCAAAACG GCCTTTCGTT 6720 

AGGCGACTTG GTAAAATGAT CAGTAAGAGT GAAAAAGGGA CACTGATAAT TTGCCACGTG 6780 

AGTAAACGGC CTAAGCTATT ATTTTTATAG AGCCACTCTA ATATAAAATG CAGCCCTAAA 6840 

ATACTTAACA AAGCCAAGCC TGTCAGGAAA TAAGGCGTTG ATACCCAGTT GGTCGTTTTC 6900 

AAGAGACTAA AACCTAAAGG AACTAGATAA AAGTAAGCAC TTGTTAAAGT AATACGCCAA 6960 

CTTTTCATAA CGGTGCCTAA ATTAATTATA AAAAAAAGTA AAGATAGGAC TAAAGCGAAA 7020 

GGTGGAAACA AAGCCACTAA CGCCGAATAG ATGGTTCCCC AGACGAGCAT TAAGGGACCT 7080 
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TTAACAATAG CGGCAATAAG TGTAAACAAG CCAATTAGGA GTAACTGCTG TGTCAGTGAG 714 0 

CCAAAGGTTA GCAAGCCAAA TAAAATAAGC AGTAGCCAGT GCCaAATGGT TTGGTTTTGT 7200 

TGTTTTCGCT CAATAGTTAC ATTTTTTTGG ATAGAGGTTT TAAAAAAATT TTCTAAATCA 7260 

GGTGATTGTT TCACCTTCTC ACGCTCCTCT CAACTTAAGA GACAACAAAC GAGTAGGTCT 7320 

GCTAATTAAA AAGCAGTGCC TACTCGTCTA TTATAAGTGG TTTTTCTAAA AATAAAATCC 7380 

GCCTTAAGTC TCAGTTACTG GGCTGCCAAG TAAGGTGTAA ACCATTCGGA AAGTGTGCCG 74 40 

GTATAAGGTA AAACGACCCG TTTCATTCCA CGAGAAATAG CAGTATACmG GATTTTTTTA 7 500 

TCTCGTTTAT TGTCGGAATA ACGATTAACA TTTGGGTTAT GAAGAATTAC ATTATCAAAT 7560 

TCTAGACCTT TTGCCATATC AATTGATAAA ACTTGCAGTG ACGGATGCTC TTCCAATGAA 7 620 

AGAGAAGCAC AGAGTTCTTT TGTTAATTGT TCTTTTTCTG CAAAAGTTTT AGTAATAATG 7 680 

GCGGTGCTTT CTTTCGAATC AAAGGTTGTA ATGGAAGTGA CGATTGCTTT AAGATAGTCA 774 0 

GACGCAGAGT CACAAGGAAT AAATGTCGGC TTTTCTCCAT CATGCCGGAT GGGCACAATG 7800 

GTTAATTGCT CTGGTTGAGT GCCTAATGTT TGGAAGAGTC GTGTAATTTC TTTACTTGAA 7860 

CGATAGCTGT TTAATAATTG ATAGGTTCCA ACAGAACGGC TGGTTTCGTT AACCAATAAA 7920 

TCATGGGcTT CAGAAAATTC AATGGCTGTA TTAAAAATGG CTTGATTTTC ATCACCAGCT 7980 

AAAGTAAAAC TAGCTTTAGG GAAAAGTGTC AGCAGTAATA ATAATTGGGC TTCTGTATAA 8040 

TCTTGGACTT CATCGACTAG GATAAAAGCC ATTTGCTGGT TGGCGAGGTT TTCAATAAAT 8100 

GCATGTTTCA CCAAAATAAG GGCCACAATT TCATCGGCTG TGAAAACAGC TGTCTCTTTT 8160 

TGATACGATT TTTCAGTGAA TAAAGAATAA AACGTTTCAA AAAAGGCCCA TTGATCTAGC 8220 

CAAGCATCAT TTTTAATAGC GGCAGTGATA GAGCGATATT TTTTCTTCAA ACGTTTTAAA 8280 

GCAAAATCAG CCAGATGTTT TTCATTATCT TCGGTAATCA CTGTGCCAAA ATAACGAACT 8 340 

TGTTCAGCCT CTGTTAAATC TTGGaATTGA TCCAACATTT TTTtAGAATG GGCTTGTTTT 8400 

AATAAGTACC GTGTCCAAAG ACTGGTTAAT TTTTCtTTTG TGGCTGAAAT tCGTkGGCGA 84 60 

nTACtCArGT CaGCTGGTGk yTCyTGaTaA AGTTTaAAAA TgTTTCTGAA GTAAATAAGG 8520 

TCTGCTTyTT AAAAGTtAAA GGmCGGAwAA AATGTGGCTG CACAACCGCT TTTTCAGGAT 8580 

TGCTTAAAAG AAACGTTGCA AATTCTTTTG AACGAATAAC GTGCTCTTGC TCCGTTGCCT 8640 

CAGAGCGAGT AATGCGTTCG AAATAGTCCG TTTCTTCTTC CaTTGGCCAT GATTCTTTGG 8700 

TAGTGAAGCG CAAAAATTGT AAAAGAGTCA AATTCAAGGG ATTTCGTTCG CCTAAATTCG 87 60 

GTAAAACTTG AGAAATATAA TCAATAAATG TTGAATTTGG TGAAAGTAAT AAAATATTAT 8820 

CCGCTGTAAT TTCTGCTCGA TGTTGATAAA GTAAATAAGC AATCCGTTGC ATAATCGCAG 8880 

ACGTTTTCCC ACTACCAGCA ATCCCGTTGA CTAGTAGGAG TGGCTGCGTT TCGTCACGAA 8940 

TTATTTGATT TTGTTCTTGT TGAATGGTTG TCGTAATGTC CTTCATATAA GGAGAAGCAT 9000 

CTTCTTCTAA AGAATGCAAT AACACATCAT CTTGAATCGC AATGGTTGTG TCAAAATAGT 9060 
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TGATTAATTG ATCTTTTTCA ATAATTAACT GTCTTTTCAA ATCAAGTTGT ACAGGAATGG 9120 

TTTGTTGATG TGCCGTGTAG GCTGTTTTCC CTAAAACATT GTCATAAAAT AACGAAGCAA 9180 

TCGGTGAGCG CCAATCGTGA ATGCGCGTTT CTTCTTCTAA AGAAGTAAAA TCGTTTACGC 9240 

CAATATAAAA GTTTTCTGGC TCTTCAGGGC TGTCTTCAAA AGTCACGCCA ATTTTGCCAA 9300 

AATAAGGAAC TTGTAATAGg CGTTTCACTT TGTCTAAATT TTTAGCCGTT GCTTCATTTT 9360 

TTAAGTTCCA CTGATCAATT TCACGGTTCT TCATTTCCAA CATGGCAAAC GTTTCTAGAT 9420 

TATCGGCATA ACTATCAAAA TTCAGCTTAG AATCGCGACC AAATTCCTCT AGCATATGTT 9480 

TGCCATCTTT CATTGTGTTC GCCAATTGCT CTTCGTAAAC TTTTTTAGTy CGtTGCAGCT 954 0 

CATgATAAAC CGTTGTTAAA TGCTGTTGTT CATAAATTTT TTCATCGGT 9589 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 



GGTCCTTATC 


ACCTTTACCT 


TCCAAATAAA 


TCAAAATAGC 


GTACGcAwAw 


CTTgGTCGAC 


60 


GTTGTTCAGT 


AACTTCAAAC 


AGTTCkTCTk 


TCAAATTTTT 


TAAtACGTCG 


ATCTGCTCaT 


120 


TAACCCcGAT 


TGAAATCATT 


AGTATCTATT 


AAACCgTTAT 


CCATTTCAGA 


TTGCCTTTTT 


180 


TCAACCAATG 


TTTTTAAATG 


TAGTTCTATA 


GAAGCTGTTT 


GAACCATGCT 


TCTAATTAAA 


240 


TCTTCTGCTA 


ACCCTTCACT 


AGCTTGTTTA 


TTTTCTGCCA 


TTGTAGCTAC 


CCCCTAATTT 


300 


CTATGTGAAA 


ACTTGAAAAA 


TTCTGCAACA 


GTTGCTTTAA 


ACACTCCATT 


GCCATATATT 


360 


TTTATTTCTT 


TTAACCCACT 


CGACAAAATA 


ACGTTTTGTT 


GCCATGGCTG 


TAAATGTGGC 


420 


AATATACCTA 


AAAGATTATA 


ATGCAATAAT 


CCATCGTCAC 


TTTTAGGAAA 


GACACCATTA 


480 


CCATCTATCC 


AATTTGATGG 


ATACTCTAGT 


TTTATTGCAT 


CTGTGATATC 


CACTCCACTG 


540 


ATTTCCAATC 


GGAATTCGGT 


GGCTGTTGAA 


GGAAAAGTTT 


GAATGCCAGC 


ACTCAGACTG " 


600 


TGACTATGAG 


GATTTGGATT 


TAATTCATGA 


GTATGTGGAT 


TTGGGCTTAA 


TGAATGGCTA 


660 


TGGTTATTTG 


GTGGAATATT 


CGTGCTACCT 


ATTTCTAATC 


TGGTATTGGT 


TAATTTTAAT 


720 


GATGCATTTG 


TAGTTTCTCC 


ACCTTTAACA 


GGCATCTCAA 


ATCCAGAAAT 


ACGAATTTGA 


780 


AGCTCCATAA 


CTTCGTGATA 


TACAAGGTCC 


TTACTTACAG 


GCAAATAAAC 


AATGGCTGGT 


840 


GTATTTGAAT 


CCCCTTGAGC 


AGTATATTCT 


TGACCATAGA 


TTTCAGTGAC 


CTGATGTCTA 


900 


CGTTGCATCG 


• TATCTTGTCG 


ATTTTGTTTA 


CGATTGTTTT 


GAGTTCTTTC 


TGCTACTTTA 


960 


ATCAATGCAT 


CATCATAATT 


CATCATCGTT 


CATCTGCCTT 


CCGCTCAATT 


GATAAATTTT 


1020 


TACTCAAAGT 


CAGTCTATTT 


GTTTCTAAtC 


CTGTTTCATC 


AAAATTGTAA 


TCAATTTTCA 


1080 
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AAATATAGTA CCAATCACTC 
TTATTAATTT ATTGTTATCA 
GTAATyCAGT TGTTGTGATA 
CCGCTGTTTC ATATGCAGTT 
CATTTACTGC AAATGGTGCT 
TTTCATCTCT TACACTTTGT 
TCTTTGCAAG ATTGATATAA 
GAACTGGAAA TCCTTTTATC 
TAGACATACC ACTATCTGAT 
ACTGATAGTC AATTTTAGGT 
TCTGAACAGG TTTAGTTTCT 
CAACTCTCCA CCAAATATCA 
TTTGTCTACT ATATACATAG 
GCCATTTATT TGAATATCGA 
TACATGCTAA ATTGGTGGAT 
GTATAGTTAT TGTTTCAAAT 
CTTTATTGTT AATATATATT 
TTGGCAAAAC GAGTGAAGTT 
TTAAAATCTT ATTACCTCTT 
TTCTATTATC TTCAGCGTAA 
GGTtGTCAAA ATACAACTCC 
TCGCTTCaTT TGCTTTCaGC 
CATTAGGATA TTTTTCTTTT 
AAATTAATTT CGCTGTTTCA 
AAAAATGCCA ACCTTTTACA 
CAACATTTCG TAATTTTTCC 
TATAATTCAG TAAGAAGTCG 
CAACTCCTGC ACGTTCAAAG 
CCTTAATAAA ATCTTTTGCA 
TACCATAGTG AATTAATTTA 
CTTCTCTAAG' TAAACGTTGA 
ACTGAACTAC TCCATATCCA 
ATTCAACATC AGCATTACCA 
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ATCTTcATGA TtTCTTTTTG ATAATCAGAA CATCCTTCTG 1140 

TATAGCAAAC GAATTTGATC TCCGACATTA ATGTCAGACG 1200 

TCaATGTAGT ATTTTCTTCT TGCTTGTTTT AATCTTTTAA 1260 

CTAGTTGCTT TATTTCTATC TTCATCACTA ATTTTTTTAT 1320 

AAATCATTAA ACGAATAACT CGCTTCAATA GAT AT ATT AC 1380 

TCATCAATTA CTGTGTATTC AACaTTATTA TTAGAAGCAA 14 4 0 

TCATAACCAC GTTCGTTATT GATACCTTTT CGAAGAACAC 1500 

TGTGTATGTG GTTCTAAATA AACTTCACGC AAACTCATAC 1560 

TTTTCTCCAT ATACAGTAGC CATATTTATT ACTTGATCAA 1620 

TCGCTAATTA AACGATATGG CGCATTTCTA AC GG AT GAT A 1680 

CCAAAAGTTC CAAATTCTAA TTTTCTACCA AAATTAAAAC 174 0 

TCTGTTAATT CACATGTTTT TGTTAATGCC TCAAGTTTGT 1800 

TCTATTCTTT TTTGAGAAGA GTTTTGTAAA TAATCTAAAT 18 60 

AAATCAAGAG TACTAAAAAT ATCATTAATT GTTCTATTCT 1920 

ACTTGTCGGA ATTCCCATTC GCTGATTATA TGATCAAGTT 1980 

TCCTTATCGA CATCATAATC ACTTGTAATC CCATGAAAAA 2040 

TTCACTTCTT CTCGACCTTT AAAATATTCA CGATAAGTGA 2100 

GTAGGCACAT ACATCAACTC ATTACTCCAA GATAATGTAT 2160 

GCAACAAGTT TTCCTTTTTC AAAGATTTCG AAGAAAAATT 2220 

TTATCTACTG GGGCATTTTC AACTGGTTCA AACATTAAAA 2280 

TCTTTTTCTT GCCCGTCAGT TCGAATACCT TTGATAAATA 2340 

GTAAACGATA ATTCAAATCC TGAcTTTTCG ACATCTTCAr 24 00 

ATATCATTAC GAATTTTTAT AGGAATGTTA TTGAATTGGT 24 60 

GCATTATAGA TTTCTATATT TTCCATTGGT AAGTCACTCG 2520 

AAAACTTTTC CATTTTTAAT ACCTAATTGA TCAATTTCAC 2580 

TTAGTATTTT CTGGAGTTGG AT CAT CAT AA TCCCCTTCTT 2640 

AACCATTTTT TTGCAGCTGT TATACGTGCT TGTGTTTTCT 2700 

TTTTTCAGAA ATGCTATTGT TAATTGATCG ACATTTGTAC 2760 

TCAGTCGGAC TAACAACTCC AATCCATTGA CCATGAAACA 2820 

GCTTGTGTTG ACGCCATTTT ATAATCACCA TCAATGCCCG 2880 

ACATAATTCG CTCCGCTTTC CCATGCATTC GGACTTGTCC 2940 

CCACCGCCGC CAATTTCAGA AATATCGGGG CGCATTCCAC 3000 

C AT AT AG C AG CAATAGCTTG TATTGTGTAG CCTTCTTTTG 3060 
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AAAGCACATT 


TCTAATATCT 


ACAGCAACTT 


GTTGCTCTTT 


ACTTAAAACA 


CGTTTTGGTG 


3120 


GAGGGGTGGG 


ATTATTTCCG 


CCATCATTTT 


TTGGATCTGT 


ATAAATAACA 


CTTGGCATTC 


3180- 


CACCATTATT 


TTTTAAAATA 


AACTGATAAT 


TATCTATGGA 


AATTCCGTTA 


GCTCCGTAAT 


3240 


TACAATGAAT 


CACTGACGAA 


CTGTCAATAA 


ACATGCCAGT 


GTGTCCTCCT 


GCACCATCGC 


3300 


TAGCTCCACG 


AACACCCCAA 


ATAAATATAT 


CTCCACGTTT 


GGGATTCGGC 


GTCTGTTTCC 


3360 


AACCAATACT 


TTCTAATGAT 


CCAAACAAGG 


TTTCTGTATT 


GCCCATTGTA 


CCACTTGGCA 


3420 


AAAAACCGCC 


TGCGATCAAT 


GATAAGTAAA 


CTGCGCTTGA 


GCAGTCGTAA 


CTATTCGGGC 


3480 


CAAGACGAGC 


ACTCATAGAG 


TAAACAACTT 


TTCCCTTTCT 


CGCTTCAAAC 


CACTGAATTG 


3540 


ATGCATCATG 


ACTAGGCATT 


AATTACCACC 


TTCCTTTGGC 


CTTTCCTAAT 


GATTATTAAA 


3600 


TATAACGAGG 


TGGTTTAGGT 


TTCCCATTAT 


TTTTAAATTC 


AATTTGTAAA 


TTGGCTCTAT 


3660 


CATCAGCAAC 


AGGGCCATCA 


TTGAAGAATT 


CTAAAAATTG 


TAACCAACCA 


CTAGAACTAG 


3720 


AAATCCCCAC 


ATTTCCATTA 


TTATTTAATT 


TTAGTGTTTT 


ATTAATATTG 


TCATTAAAGG 


3780 


ATTTAGTAGT 


ATCATGTGAT 


TTTGTATATA 


TCTCTTGTTC 


TCCAGCACGA 


GTTAAATAAA 


3840 


GGTGTAAAGT 


AAATGGTACT 


GACCTAATGT 


TATTTGTTTT 


ATATGATAAA 


TTTTGGATTA 


3900 


CTACATCACG 


AACTTGCCAA 


TCAAAGCTAG 


CATTTGATGA 


TGGTCTCATT 


CCAAAATTAA 


3960 


CAGTCCCTCT 


TATTTCTCCT 


TCTCCTAAAT 


ATTGGGTCTG 


TTCAGCGTCT 


CCCCAATCTT 


4020 


T C CATC T AAA 


AATGAAATCA 


CCATTACCTC 


GATTAATTGA 


AACACTCATC 


CCTTTTGTTG 


4080 


ATTCTTGAAT 


CCCATATCTA 


GTGTTAATTT 


CCCAATTCTG 


TGAAGCTATA 


TTATCGATAT 


4140 


CATCTTCCAT 


TTTTGCAATT 


TTTTTCCAAA 


TTTTTTCAAT 


CTGTTCTTCA 


ATCCAATGAA 


4200 


TCTTTTTCCA 


AATTCCACAA 


ATTGCACAAA 


TTGTCATCTT 


CTGTAGATTA 


TAAAGTGTAG 


4260 


ATACTAATTT 


TTTTAAATAT 


CCTTTCCAAT 


CACAATCATC 


ATGAATATCG 


AGAGCACTGT 


4320 


CGGCTCTACC 


TAGTAAACAA 


TCATTTAAAT 


CATGCATATC 


TTGACAATCG 


TTTTTCTTGC 


4380 


CATTAAATCC 


TTTATCATCA 


CCTAAGCTAG 


CACATTCTTT 


GTCGCCAATT 


TCAAAACAAC 


4440 


AGTCAACATC 


TATGCCCCTA 


TTCTTTAAGT 


CCTTACAAGA 


TTTGCAATTT 


TTAGACATGT 


4500 


AAATCATCCT 


TTCTAGTACG 


TAATTTTATC 


CTCTTTGATA 


AATACACAAA 


CTGGCTCACA 


4560 


GCAATCGTTT 


GTTTCTACTA 


GTAAGCTGTT 


GTACCCATGC 


TTTATGGACC 


ATCCAAAAGT 


4620 


GTTACAATTC 


ATCAATTCAA 


GATTTTCAAT 


CGGTAACTTA 


TCAAAATCGC 


AGCAGTCATC 


4680 


TTCACGATAG 


AAAACATCTC 


CATTTGATTC 


AATAAGTAAC 


TCTCCATCGT 


ATATACCTTT 


4740 


TAAACGCATT 


GCGTTACCAT 


TTAAATGAAT 


AACTGGATCT 


TCTACTCTGC 


CAATAATAGT 


4800 


CATTGTGAAT 


ACATCAGTCT 


CCAAAACGGT 


ATTACTATAA 


AACTTTCCAG 


CAACATAGTT 


4860 


ATTGCAAAGC 


■ TCTTTCTTAC 


AAATTTTCGT 


TCCGAGCATC 


TTATTATATC 


CCCATAGATG 


4920 


ACGACCTTTA 


TTACAGTTAT 


AAATTATTTG 


ATAACCACCA 


CTACAATATT 


TCCAGTAATC 


4980 


TTTTTCTATA 


TCTTTTTGTT 


TAACACACAG 


ACTGTCTTCC 


TTAGTAAGAA 


AATCACATTC 


5040 
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ACAAGAACAT 


TTTTGACAAT 


CTTTATGTTT 


AATATTATTT 


TTTTGACAAC 


TAATACAGCA 


5100 


TCCGTAGATA 


TCGTCCTGAC 


AAGTGTCTAT 


TTCTTCAAAT 


TCTAGACAAT 


CAGCGAAAGA 


5160 


ACACGCATCG 


TAAGGTTTCA 


AAAATGTTTT 


CTTTTCATCT 


GCATAGTGCC 


AAATTCCTTC 


5220 


ATATAATATA 


AAGTTCAAAT 


CGACAGAAAA 


AGTGTTTTTT 


TCAATGCTGT 


AAGGTTCTGA 


5280 


AAAACTTTCT 


ACAAAAGCAT 


ATGTCCAAAT 


TATTCGACGA 


TTTTCAATTG 


CCCATAGCTT 


5340 


ACCAATCTTA 


GAAAGCTCAC 


TATATAAGAA 


TTCTTTATAA 


AACTGTTTTT 


GGTCCATAGA 


5400 


AAACATAGAA 


TAATTTATCT 


TTAGTGTTAA 


TGCTAACTTT 


TGTTCAGTTG 


AATACTGTTG 


5460 


TATTTGTTTT 


GCATTAGTAT 


AACTACCATG 


ACCAAAACTA 


AATGCTTGAG 


TGGTAGTTTT' 


5520 


GAATCCTCCA 


CTCAAATCTG 


CATCTTGAAT 


CATATCGTAT 


GCACCAATTA 


CAACATCTCT 


5580 


GAATTGTAAG 


TACCTTTTTT 


GAGGATAAAT 


TGTTCTGCAA 


TCATACATCA 


ATTTACACTC 


5640 


CTCTCAAAAA 


TCTATTTGCA 


CGCAAATTGG 


CAAAATTAGG 


ATTTCCAGTC 


ATACTAATAT 


5700 


TTTGGTTAAT 


TGTATTATTG 


TTATTTCTTG 


TATTTTTTGT 


CGTATTATAA 


TATCTATTCA 


5760 


TTACATTTGA 


GACAGGTTGA 


CTATTTCCAA 


ATCGATTAGT 


GATACTAGCT 


AGTGCTCCAC 


5820 


TTAGATCTAA 


GTTATTGATC 


CGTTCCATAA 


AGTCAATTCC 


GAAATGATCT 


ACTGCTTTTT 


5880 


TACGTTGAAC 


ATATTCTCCA 


GCAGTTAACA 


TTGCTGGAAC 


ACGATCTGTA 


CCTTTGGATT 


5940 


TAAAAATATG 


TTTCATAATG 


CTTCCACCGT 


TTGCTCGATA 


CTGAGGTATA 


ATGCCTCCAT 


6000 


TTGCACGTGT 


AACTCTTCGA 


GTCTTCACAT 


TTTCTGTTAC 


ATCAACAGTA 


AGACTAGGCG 


6060 


CTGTTAATGA' 


ATTTAGATTG 


GTTTGAATAG 


AAGAAATTTT 


GTTATTAATA 


GATGTAACAT 


6120 


AACTATCAAT 


ATTACTATCT 


AAACCTTGTA 


GTGCATTTGT 


AAATCCTGAT 


TTCAGACTAT 


6180 


CCCCAAATCC 


TTTACCAACA 


TTATTAAACT 


CGGTATCTTT 


ATTTCTTAAA 


TTAGTTATTA 


6240 


AATCATCAAT 


AACTTTTTTG 


ATTTTAGAAG 


GAACATCAGC 


ATTTTtGAAT 


CCATTAATAA 


6300 


CTTGCTGTCC 


ATATGATTTC 


CCAATTGGTT 


CAAATTTAGA 


TTCTAGACCT 


TGTAACGTAA 


6360 


TAAGTAAGTT 


TTTAAAAGAT 


TCGACTAGCT 


CGGGTATACT 


ATCCAGTCCT 


TTTGCACTTG 


6420 


GAAATGTGTT 


CATTAGTTCA 


AT G ATT GAT T 


TCATTCGGTT 


AACGTTGTTT 


AATACACTCG 


6480 


TCCAATCAAA 


TGGTAATGCG 


TTTAATACAC 


TTAATTTATT 


TGATATCGTT 


CCTAACTTAC 


6540 


TAACAGATGA 


AACAATCATA 


CCTAACGTTT 


CGCTGCTTAT 


AACGCCTTCA 


GATATCCCTT 


6600 


GCCACTTACT 


AATGGCCATC 


TCTGTTGCGA 


CATATTTAAT 


ACTTTTTATA 


TTATTACTCA 


6660 


ATTCATCAAC 


ACCCAATGTT 


AATTGAGGAA 


TCTGATTAAT 


TCTATTTGCT 


AATGTTATCA 


6720 


GTCTATAAcT 


GTAGTGTCGA 


TTTTATCAAT 


GACATCTTCA 


CCAATCATCG 


TGTTTGAATC 


6780 


AGGAAAAGTA 


TCTATCGTTA 


ACCTATTAAT 


AACACCTTTA 


ACAGCGGTAA 


TTGTTGmATG 


6840 


GACTTTATCA 


■ TTTTCTAGTG 


TGATACTAGC 


TAATCCaTTA 


ACTCTATTTG 


CTcAATTGTA' • 


6900 


TAAAGTTTAT 


CAAAACTAAA 


TCCAAAGCTT 


TTATAAAGTC 


AGCTTTGACT 


ATTCCTTCAT 


6960 


TGTAATCTCG 


CCAAGAGCTA 


GCCTGAAACA 


TTCTTAATGC 


CCCTTTGGTA 


. CTTGCAATAG 


7020 
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TGGAATTTAT TTTGtCAGTT TCGATATTTA TTCCCgAAAG ACCTGTTAAA CGATTTGCTA 7080 

CTTTAGTGAC CCATAATACA SCTACAXCTA ATCCGCTAAG CATTTCCCTA TCAACAAATC 714 0 

CATC ATT AAA ATCACTCCAT GaATCGrTTC TTAACATTTC GAATAGAnCC nTTTGGATAT 7200 

TTGGAATAAC AGATGGTGCC TTACTGnTTC CTAATGGTAA AnCGAAAGTT CATTTAATTT 7260 

ACTGCCCACT TTATTCAACC AGTAAATTGA TGTGTCTAAC TGGTCCAwTA ATCTTTTACC 7320' 

AACAATTCCT TCATTAAAAT CGGTCCAACT ATCTGCTTGC AATAATCTTA AAGCACCTTT 7 380 

GATATTTGCT ATTATTCCTT GTAACTTGCC ACCATCAAAT TCAAAATTAG AAAGTGTTCC 744 0 

AAATTTTTTA GCTATACTAC TTAACCAATA AACTGCAGTG TCTAGTTTAT CTAATGTTCC 7500 

ACGTTCAACT ATTCCCCCAT CGAAGTCGCC CCAAGATTTA ATTTGCAACA TACGTAATGA 7560 

GGATTTTAGA TTTGCTATCA TGGTTTGAGT TTTTCCTAAG TCAAAAGAAG TTTCTGATAG 7 620 

TTTAGACAGC TTTTTGTTAA TGCTATTCAA ACCATCCACA CTTTGGCCCG TCATATCTAT 7680 

AAAATCTTTT GGCACAACTC CACCAATGAA ATCACTCCAA CTTTCATTAG TTAGCATACG 774 0 

TAATGCACCT TTAATATTTT GTAGTACATT TTGCACTGAA CCCGTTTCTA ACTTACTTTC 7800 

TGaTAATGCT GcTAACTTTT TGcTTAATTT ATTTAACCAG AATATAGTAG TATCTGCTTG 7860 

ATCAATAATT TTAGATGAAA CgACTCCaGA TTTCGCAATA TCCCaATCaG CAGGrtCTAA 7920 

CTTtTCyAAw AATATCTGnn ATAACTGTAA CTTTACTTGG TAAATATCAG GATATTGGAA 7980 

AATTTAATnT TC 7 992 
(2) INFORMATION FOR SEQ ID NO: 258: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
AAGAAAACCT TGAATTTGTT CAAGGGCAAT TGCAACAAAA TTTACCAGGT TTAACAATCA 60 

AATTAAAATC ATTGCCTTnG CAAAATCGTT TGGATTTACA AACAGCAGGT AATTACGATT 120 

TAGCCTTTGG AACATGGACG CCTGATTATG CTGACCCCAT TAATTTTTTA GAATTTTATG 180 

ACTCGAAAAG TGGCTTAAAT ACTTCTGGTT ATAACGATTC GGCGTATGAT GCGGGTCTCC 24 0 

AAAAAGTTAG AAAAGATTAC GCCAATGAAC CTGAAAAACG CTGGAATGAA TTACTAAGTT 300 

TAGAAAAAAC ATTAATTGAA AAAGATGCTG GTGTTTTACC CTTATTCCAA GGAGCCATTG 360 

GTTACTTGAA ATCAGATCGC TTACAAGGCT TGCAAGTATT TTCTTTTGGT CGCACAGTAT 4 20 

CCTATCGTTT ' AGCTTATGTA GAAGAATAAA TTGACTTCTG AAAAGTCCTA TCTTTTGAAA 4 80 

AAGTCTGCTT TCTGTTTTAT CCTTAAAGAA GGAAAATTAA CTAAAAAAAT CAACTATGAA 54 0 

AGGTAGGTTT TTCAAATGTT TAGCTTTCTA TGGGCATTAA TTGTCGGAGG AATTATTGGC 600 
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GCAATTGCTG GCGCAATTTT AGGAAAAGAC GTGCCAGGTG GCATTATTGG AAATATTATC 660 

GTGGGCTTTT TAGGAAGTTG GGTAGCATCA CTCTTGTTGC CTTCTTXAGG. ACQ&QTCATC 720 

GGCGGGTTTC CAATTATCTC AGCGTTGCTA GGAGCGATTA TCTGTATCGC TATTTACTCT 780 

TTTATTGTAA ACCGTAAGgC GTAAAATTGT ATCAAAATGA TATCTTGAAA AGCATCATAA 840 

AACTTGTGAA GGTTGATCAA ACCATTACTT TTGATGTCTT TTCTCTCATA TTTTAAGTCT 900 

ATGACAAAAA TCTCTTTGGA TTTTTGTCAT AGACTCTTTT TTTAAAAATA TTTATTATGA 960 

GAATTTCCgC GTTAGACTGC TTATTTTTCA TTTATTATTC ATTTTTGATA AAGGAAGTTT 1020 

TGTTGGATAA AGTGAATAGC TcAAGGCTCA ACAAAAATAA rGArAAAATT AATATGTTAA 1080 
'GTTCAAGGTG AGTGAAATTC TGCTCTCATA ATACGCCGTT TTATTCATTA AACGATAGTT . 114 0 

GAAAAAAATA AGTGCCCTTG TTATTCTTTG AATTAAACTA ATGGGGGCGA AAAGCATGGA 1200 

ATCGTTAAAA AGTAATCTTT TTTTGACCAG AGAAGCaTAT TTGGAAGCTT TTGAAACAAT 1260 

CcATGACACC TTTAAAAACG GCTATTTTAA CATCAGAAAA ACCTGGCGTG AATTTTGGGT 1320 

CCAGTGGTGC TGTCTATGAT CAGCAACATG CTGAAATGGA AGCGCTCATT CGACCATTAT 1380 

GGGGAATTGG ACCGTATTGG GTTACTTGTA AAGATGATGT TTTAAGAGAT GCTTATGTGG 14 40 

AAAAATTAAT CGCAGGAACT GATCCGTCAT CCCCAGATTA TTGGGGCGTT ATTGTTGATT 1500 

ATGATCAATA TATTGTGGAA GCAGCAGCAT TATCTTTAAC TTTATTGCTT CATAAGACAT 1560 

ATTTTTGGTC CTGTTTTTCT GAGAAACAGC AAGAAAATAT CATGGTTTGG TTAAATAAAG 1620 

CACTTGGTTG TAAAATTCCT AAAAATAATT GGACGTTTTT CAAAGTGCTC ATTCGTCTGG 1680 

CACTCGAGGG ATGTGGACAA GTGATTAACC AAGCGGAACT AGAGGAAGAA TTGGCGTTGA 1740 

TTGAATGCAT GTATTTAGGT GATGGTTGGT ATATGGATGG TAAAACTACG CAAAGAGATT 1800 

ATTATATTTC GTTTGCTTTT CATTATTATA GTTTGATTTA TGTCAAATTT ATGCGAGAAA 18 60 

GAGATCCTAT TCGTGCGAAA CGTTTTACAG AACGAGCCGT TATGTTCGCG CAGGACTATC 1920 

TGTACTATTT T GAT GAG GAG GGCGAAgcGT TACCTTACGG ACGTAGTCAA ACGTATCGAT 1980 

TTGCTCAAGG AGCATTTTTC GCAGCTTTGA TATTTGCAGA AGTAGAGGCT TTACCTTGGG 2040 

GACAAATAAA AACGCTGTTA GCAAAACATT TACACTGTTG GATGCAACAG GCTATTTTCA 2100 

CTTATGATGG TCGGTTGTCC ATTGGCTATC ATTACGAAAA TTTAGTAATG GCAGAAGGTT 2160 

ATAACGCACC AGGCTCTCCG TATTGGGCTT TTAAAACATT TTTGTTGCTC GCAGTGGATC 2220 

AAGACCATCC GTTTTGGCGT GCTGAACCGC TGGCGATAAA ACGAGAAGAA AAGCGTTTGA 2280 

TAGAAAAGGG AAACATGTTA TTAATCCATC CTAAAATGAA AACGCATGTC CTCGGCTATC 234 0 

CTGCAGGACT ATGGCTTGAA AATCAAGCAC ATGCTGCGGC TAAATATAGT AAATTTGTCT 2400 

ATTCTAGCCG* CTTTGGTTTT AGCGTTCCCA AAGCTGGTGC GCGTTATGAG GAGGGGGCTT 24 60 

TTGATAACGT ATTGGCTGTT TCTCGTGATG GTGAATATTT TCGTGTTAAA GGGAAAGTTC ' 2520 

ATGATTTTTA TTTAACAGAG GAGCAAGTTT ATTATAAATG GCTACCTTTT CAAGAAGTAG 2580 
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AAATTGAAAC 


AACCATTTAT 


CCTTTTGGAG 


AATGGCATGT 


TAGAGTACAT 


GAAATAGTGA 


2640 


CATCGGTTCC 


ATTAAAAGTG 


CGCGAAGGAG 


GATTCAGTCT 


ACCGTTACTT 


GGAAAATTTT 


2700 


CAGAAAAAGA 


ACTTGGTAAA 


GAGTGGGCTA 


GAGTCGATTC 


AGGAGAATTG 


TGTTCGCAAA 


2760 


TAGTTGCAAT 


TGAAGGGTAT 


GACCACGCAG 


GTATTCTTCA 


ACCTGAACCC 


AATACATCGT 


2820 


TGTTTTTCCC 


AAGAACAAGT 


TTACCTTATT 


TGCAAAAAGA 


GCTACCAATA 


GGAACGCATC 


2880 


GCCTGATTTG 


TTTGGTAGGT 


GGTGTGACAA 


GGGAGGAATA 


AAAAATGATA 


AAAATCGAAT 


2940 


TGAAAAATAA 


AGACAATCAG 


ATTATTACTG 


GCCGAATCGA 


TCACGAGAAG 


CCAGAAGCGC 


3000 


CATTCGTGGC 


GGAGGGAGAC 


ACGTTAGCGG 


TATTAGCAAT 


TAAACAGTAC 


CTTTATCAAG 


3060 


TTGGCGATAA 


AATTGTAGTG 


ACTTATACAG 


GGGCTACCCC 


TTATTTGATG 


GTTCAATTAG 


3120 


ATGAAACTTT 


GGCTCCTTCA 


TTGATTTATC 


TAAAAGAAAA 


AGTTTGGGAA 


TATCAAATTC 


3180 


CATTAGAAGA 


ATCTGCTCGA 


AAAGCTTTTG 


TTGATACGGC 


CTTTCATTCA 


GCAAGGCACC 


3240 


AAATAATGGT 


TCGACAAGCG 


CACACGTTTG 


AAAAAGACAA 


TTATCAAAAT 


CTTTCTATGA 


3300 


AT AC AC AT G A 


CCAAAAAGAA 


TATTCAGGAG 


CCTATCCTCA 


TGCGTATGCA 


AACGTTGAAA 


3360 


CGAGAGACGA 


TGCTGTTTTC 


TTTGCGAAAA 


ATGCGATTGA 


TTCTAAGTTT 


GGTAACCTTT 


3420 


CTCATGGCTC 


GTATCCATTT 


GCTTCTTGGG 


GAATTAACCA 


AAAAAAGGAG 


GCACAGTTAA 


3480 


CGATTGACTT 


TGGCAGACCT 


GTAGAAACGG 


ACTGGATCCG 


CTTGCTATTT 


CGGGCGGACT 


3540 


AC CC AC AT G A 


TAGCTATTGG 


GAGGAAGTCA 


CTTTAGCTTT 


TTCAGATGGT 


GAGGAACGAG 


3600 


TAGTTGCCAC 


AACGAATACC 


ACCTGTTTTC 


AAGAAATAAA 


GTTCCCAATG 


AAAACAACAC 


3660 


AAACTTTGAC 


ATTAAAATGC 


TTACGAAAAG 


CGCTGGACTC 


TTCTCCATTT 


CCAGCGTTGA 


3720 


CCCAAATTGA 


AGTATTCGGT 


CGGAATAAAT 


AACAAGAGGT 


CGGGACAGAA 


CCGTTTAGCT 


3780 


CCGAGAAATA 


AGAAGAAATT 


TCCGAAAATT 


GATCTTTGAT 


TTTTGGAGAC 


CAAGTAGGTA 


3840 


TTGTTCATCA 


TCGAAtCACT 


ACGTTCTTCG 


TGATTCTCAA 


CAATTTCGGC 


TTATTTCCGA 


3900 


AGGACCTTCA 


ACTCTTAGAG 


CTTTAGCTCT 


TAGAAGTTGT 


TGAGATCGGA 


GCAACGCGTA 


3960 


GTGTTGGTTC 


TGTTCCCGCC 


GTTTATCAGT 


TTTTGAGCGT 


GGAGCAAAAA 


TCCAAAGTGA 


4020 


TTTTTGTCCC 


ACGCTCTTGT 


TTTTAAGAGA 


AATAaTCAAG 


AAAGGTCGCT 


TTTTATTTTC 


4080 


TTTTaCGTTA 


TAATACAATA 


AGTTAATATG 


TTAACAAGTT 


AAGGAGTGAT 


GGAGATGTGT 


4140 


AGTCAAGaAA 


ATAaACAAAG 


ACTGATCTCT 


CTAGGAGAAA 


ATGTCACAGA 


TGAATGGATT 


4200 


CAAACGCAAT 


TAAATGAATG 


CGTTTCAAAA 


ATTGCTAAAA 


ATATGCAGCG 


TTTCGGAACA 


4260 


CAGTTTCCTT 


CAGCTTGTGC 


TACGAATGGT 


CGATATCGTC 


TCAAAGCTAA 


CGATGATTGG 


4320 


ACCAATGGCT 


TTTGGACAGG 


CATGCTTTGG 


CTGGCATATG 


AGTGGTCTCA 


CCAAGAAAAA 


4380 


TTTCGTTTAC 


- TGGCAATGGA 


AAATATTGAC 


AGCTTTCAAC 


AACGATTAGA 


GGAGCATTTT 


4440 


GTTTTGGACC 


ACCATGACAT 


TGGTTTTTTA 


TATAGTCTAT 


CGGCGGGTGC 


TGGGTATAAA 


4500 


ATTACCGGTG 


ATGAGCGCTG 


TAAACAAGAA 


GTGCTAGCTG 


CCGCTGAAGT 


ATTGTCAGCA 


4560 
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CG AT TCCAAG 


AAAATGGCTC 


CTTCATTCAA 


GCGTGGGGCC 


ATTATGGTGA 


TCCCAAAGAA 


4620 


TATCGATTAA 


TTATCGACTC 


ATTAATGAAT 


TTACCGCTTT 


TATTTGAAGC 


AACTAAATTA 


4680 


TCTGGAAATA 


GGGTTTATGA 


GGAAATTGCC 


TCACGTCACT 


ATCATACGTT 


AATGAAAACG 


4740 


GTGATTAAAT 


CCGATGCGAC 


AACTTATCAT 


ACCTATTATT 


TTGATCCAGA 


AACAGGCAAA 


4800 


CCAAAACATG 


GAgcTACGCA 


TCAAGGACAT 


AGTGATACTT 


CCATTTGGGC 


ACGAGGCCAA 


4860 


AGTTGGGCAA 


TTTTAGGGAT 


TCCCCTAAAC 


GAAACATTTC 


TCCATTCCCA 


ACCATTTCCA 


4920 


GAAATGTATC 


CACAAATAGT 


GGAGGTTTTT 


CTAGCTCATT 


TGCCAGCAGA 


TTTAATTCCC 


4980 


TATTGGGATT 


TTGATTTTAA 


TGACCAAATG 


CCATCTGACA 


AAGATAGCTC 


GGCGCTGGCA 


5040 


ATTGCGGCAT 


GTGGTTTATT 


GGAAGCAGAC 


AAACTACAGG 


CTTTTCCTCA 


AGCGAAAGAG 


5100 


CTTGCCAAAG 


GAATGATTTA 


TCAATTAGGA 


GAATATTATC 


GTACGCAAAA 


TGATTCCGAA 


5160 


AATGAAGGAC 


TGCTACTCCA 


TGGTGTTTAC 


GCACATGCAG 


AAGGAAAAGG 


GATTGATGAA 


5220 


CCCAATTTAT 


GGGGCGATTA 


TTTTTACATG 


GAAGCCTTAA 


TGCGTTTAGC 


AAAACCATCA 


5280 


TGGCAAAGAT 


ACTGGTAAAG 


GGGAAAAATG 


ATGCAAACGT 


TTGAGATTAA 


AGAAGACTTT 


5340 


TTATTAAATG 


GACAACCAAT 


AAAAATTATT 


AGCGGGGCGA 


TTCATTACTT 


TCGAATGACA 


5400 


CCTAGCCAAT 


GGGAAGATAG 


CCTCTATAAT 


CTGAAAGCTT 


TAGGAGCTAA 


TACAGTAGAA 


5460 


ACCTACATTC 


CTTGGAATAT 


TCATGAACCA 


GAAGAAGGCG 


TATACGATTT 


TGAAGGAATG 


5520 


AAAAACATTG 


AAGCATTTGT 


CCGCTTGGCA 


GAAAAACTGA 


ATTTACTAGT 


TATTCTGCGT 


5580 


CCTTCGGCTT 


ATATTTGTGC 


AGAATGGGAG 


TTTGGTGGCT 


TACCAGCATG 


GCTTTTGAAA 


5640 


GAAAAAGGTG 


TGCGATTACG 


TTCGACAGAT 


CCTATTTTTA 


TGACTAAAGT 


TCGTAACTAT 


5700 


TTTCAAGTAT 


TATTGCCTAA 


ACTAGCACCG 


CTGCAGATAA 


CACAAGGTGG 


ACCAGTGATC 


5760 


ATGATGCAGG 


TTGAAAATGA 


ATATGGGTCC 


TATGGCATGG 


AGAAAGCGTA 


CTTGCGACAA 


5820 


ACCAAGCAGA 


TAATGGAAGA 


ACTGGGGATT 


GAGGTTCCTC 


TTTTTACATC 


TGATGGCGCT 


5880 


TGGGAAGAGG 


TTTTGGATGC 


GGGTACGTTA 


ATTGAAGAGG 


ATGTTTTTGT 


GACAGGAAAT 


5940 


TTTGGTAGTC 


ATTCGAAAGA 


AAATGCTGCC 


GTTTTGAAAA 


AATTTATGAC 


TCGTCATGGT 


6000 


AAAAAATGGC 


CGTTGATGTG 


CATGGAATAC 


TGGGATGGGT 


GGTTCAATCG 


TTGGGGCGAA 


6060 


CCAGTGATTC 


AGCGGGAAGG 


TACCGACTTG 


GCGAAAGAAG 


TAAAAGATAT 


GCTGGCAGTG 


6120 


GGTTCTTTAA 


ATCTCTATAT 


GTTTCATGGA 


GGAACAAATT 


TCGGCTTTTA 


TAACGGGTGT 


6180 


TCGGCGCGAG 


GTGCAAAAGA 


TTTACCTCAA 


GTCACTAGCT 


ATGATTATGA 


TGCTTTATTA 


6240 


ACAGAAGCAG 


GAGAACCAAC 


AGAAAAATAT 


TATGCCGTTC 


AAAAAGCAAT 


TAAAGAAGTA 


6300 


TGTCCAGAAG 


TTTGGCAAGC 


ACAACCACGC 


ACGAAAAAAC 


TGGGGAATTT 


AGGAAGCTTT 


6360 


CCAGTAACTG 


• CTAGCGTATC 


CTTATTCGCA 


GTAAAAGATC 


AAATGATGAC 


ACCGAAGACC 


6420 


ACCACGTATC 


CATTAAGTAT 


GGAAGAAGCT 


GGCTCAGGCT 


ATGGCTACTT 


GCTTTATTCG 


6480 


TTTGATTTAA 


AAAATTATCA 


TCATGAAAAT 


AAACTmmAAG 


TGGTCGAGGC 


CAGTGATCGT 


6540 
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TTACACATTT ATGTAGATGG AGATTTAGCT GCTACCCAAT ATCAAGAGAC AGTGGGAGAA 6600 

.GAGCTACTCA TTTCAGGGCA AACTGAAAAA GACACACTTG CGTTAGATAT CCTAGTGGAA 6660 

AACTTAGGTC GCGTTAATTA TGGCTTTAAA TTAAATAATC CCACACAAAG TAAAGGAATT 6720 

CGAGGTGGAG TGATGCAAGA CATTCATTTT CACCAAGGCT ATCAACACTA TCCCTTAACT 6780 

TTTTCACAAG AGCAATTAGC CAAGATTGAT TACACAGCTG GAAAAAATCC ATTGCAGCCT 6840 

TCGTTTTATC AAGTAACATT TGAATTAGAA CAGCTAGCAG ACACGTATAT TGACTGTCGA 6900 

GGCTACGGCA AAGGATTCGT TGTGGTCAAT GGCCATCATC TAGGTCGTTA • TTGGGaAATA 6960 

GGACCCATTC ATTCGCTGTA TTGTCCCAAA GAATTTTTAC AGCAAGGACA AAATGAAGTA 7020 

GTAATTTTTG AAACAGAGGG AATTGAAATT GAGTACTTGA AATTTACGAA TCAAGTAATC 7080 

AT AG ATT AAA AAAATAGAAG TAGAAGCTGA AATTTATTCG GCTTCTACTT CTATTTTGGT 714 0 

TTTGAAATAT TTCCAATGTT TATAACTAAT AATATATTCA GCAACACTAC CATCTGCGAG 7200 

ATAGGAATGT TTAACTTGTT TAACAGCGGG TTCACGAAAA CTTAATTGAA GTAAATTCAA 7260 

TAACTCAGCA TCTTCGGGAA AGACAATTTC ATTTGTTTCT ACAGAAGCCA ACGAGAAGAG 7320 

GTCTACTGAA AAATCTTTAC GGACTCGTTC ATACACACTT GCATAAGCTG TGAGGTCCTT 7380 

TGAGAGCGGC TCTTTAACAA GCTTTTTGGG AAGATGTGTG ATATGTACGA GAAAGGGTGT 74 40 

ATCTTCAGAA TAGCGTACAC GCTTGATTCT ATAATAAGAA GCGTTAGCTG GTAAGCCTAA 7500 

TTCCTTTAAA ATTTCGGCTT GATTATCTTC TTCAATTGAA AGGACTTTAA CTTTTTCAGA 7560 

GTCTAAAGAA TGCAACTCGA TATCAGAAAA TTTTACATTT TGAGAAACTT TTGATTTCGA 7 620 

AACAAAGGTC CCTTTTCCTT GGATACGATA CAAGTAACCT GCACTGGTCA GCTCATTCAG 7 680 

TGCTTTGACA GCTGTGATAG AACTAACAGC ATATTTTCGT TTAATATCGG CTTCAGAATA 7740 

AAATTTATCA CCAGGAATAA AGGTATGATT TTTTATTTCA CGTAAAAGGT CTTGTTTAAT 7800 

TTGTTCATAT TTAGGTAACA TGCGATTGCC TCCATTCTAG TAATCGAATG AATTAAGCAT 78 60 

AACATGTTAA CTAGCCTTTA GCAAaCACTT TGAAAAAGAG GAAAGACAAT TTTTATTTAA 7920 

gtTCAAAAAA aAATaATATA TTAAGAaTaG GTTTACGGAT AACTTAGTAT GTGTTATAGT 7980 

AAGTTCGTAG AGAACATAAC ATGTTAAGTA GGGTCACCTA CATCTAGGGA GGGAAATTCG 804 0 

TGGAAAACAT TGTACTAATT AGTCATGGTA GCATGGCTGA AGGCGTCAAA GTTAGCTTAG 8100 

AAATGATCGT TGGTCGGCAA GAACACGTGC ATACAGTCTC ATTACGACCA GATAGCGACA 8160 
ATCTTCAATT TGAGAAAGAG CTAAATGAAA AAATGAAAGC CCTTAACGGT ACGACACTGA 8220 
TTATTGCTGA TTTACTAGGA GGAACGCCAT GTAACGTGGC AACTAAGAAT TATTTGAACG 8280 
TTGACGGAGT GGAAATTATT GCAGGGATGA CGCTCTCTGT TGTGATTGAA GCAGTGGTTA 8 34 0 

ATCAACAGGC ■ TTCGATTAAA GAATTAGTTT GTTTAGCACA AGAAAATATT GTTGATGTAA 8400 
AGGCTGGAAT GAATCAAGCG GAACaAGAGA TTTCAGAAGC CAGCAAAGAA AAAGAGTTAA 84 60 

GTAACTATAG TCAATATGCT GGAAAAGAAA ATATTGTAAA CACACGGATT GATGAACGGT 8520 
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TAATTCATGG ACAAG TAG CG GGAATTTGGT CGACAAGTCT TAGTACACAG CGAATAATTG 8580 

TAGCCAATGA TGAGGCGGCT ACAGAT.CCTT TACAAAAATC ATCTTTGAGA ATGGCAGCGC 8640 

CTTCTTCGAT GCGTTTATCT GTTCTTGGAG TAGAGGCTGC CGCTAAAAAT ATTCaATCaG 8700 

GAAAATATGG TAAACaGCGT TTATTTTTAT TATTTAAAAA TCCGAAAGAT GTGCTCCGCT 87 60 

TTATCGAAGC GCAAGGACCT ATCAAAACTG TAAATGTTGG GAATATGAGT TACAAAGAAG 8820 

GAGCACGCGA AGTAACAAAA AGTATTCAAG TTTTACCAGA AGAAGAACAA ATTTTTGAAA 8880 

CAATCGCTTC AAAAGGCGTC ACTGTGAcAG CCCAATTAGT TCCAAACGAT CCAGTCGTTG 894 0 

ATTTnATGAA AAAATTACGC GGCTAAAACT GTCAGACTAA GAAAGAATAT n 8 991 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



ATAnAAGTAn 


ATAmAATGGC 


GATTATTGCT 


GTCTGAAAAG 


ATAAAAGAAT 


TGGTCGAACA 


60 


TCCATTTTCC 


CACCTAOl 1 1 


b I i AAL I I AA 




i iT^ 1 A A A A T AT 

y 1 Lnnnn 1A1 




120 


GTTGACTCTG 


TAAAAAATTT 


AAAAAAGCAT 


CTGCTGATTT 


TTGTTTCTTA 


GAGGCGGCAA 


'180 


CTTTACCAAC 


TGGATAGATA 


ATTGGCTTTT 


TCAAAACAGC 


TTCAGGCATG 


GCCGCAACAA 


240 


TCGCTACTTT 


TGAATTGGTT 


GCTGCATCTG 


TCGCATAAAC 


TAAGCCAGCT 


TCTGCACTTG 


300 


CATTAGCTAC 


CCATTCAAGG 


ACTTCTGTTA 


CATTCGTGCC 


AAAGCTTGCG 


TGTTTTTCTA 


360 


CATAAGACCA 


AGCGCCTAAA 


GCTTTTAAGC 


CTTCTTCGGC 


ATATTGACCA 


GCTGGAACAC 


420 


TTGCAGGATC 


ACCAATTGCT 


AtCATTTGGG 


CTTTTTTTAA 


ATCAGAAAAA 


TCATGCCACT 


480 


TTGCTTGATC 


TTGGTTAGGC 


ACAATAAGAA 


CGAGCTGGTT 


TTCCAATAAA 


GGAACGACAC 


540 


TTTTTTTATT 


AATTAGTTTT 


TCTGCAACCA 


ATGCATTCAT 


TTGTTTTGTC 


GAAGCTGAGA 


600 


AAAATACATC 


GGCTTTTAGG 


CCTTTTTCAA 


TTTGCATCTG 


TAATTTTCCA 


GAACTATCAT 


660 


AGGTTCCAGT 


TACCTGAATA 


TCTGGATGCT 


CTTTTTCAAA 


GGCTGGAATA 


ATTTTCTTCT 


720 


CCATGACTGA 


TTCTAAGCTA 


GCTGCTGCCG 


CAAGGGTTAT 


TTCTTCTTTT 


TTTGTTGAAA 


780 


CTGTTTCTTT 


TTGCGGCTGA 


TTTGTACATG 


CAGCAATTAG 


GGCAACTAAA 


CTGAATAAAC 


840 


TAAGCAGGAA 


CACAGCCTTT 


TTTTGTTTAG 


ACATTTGAAC 


GCCTCTTTTC 


TTTTATAAAT 


900 


ACCCGCCATA 


ACTCAAGTCA 


ATAATTTCAG 


ATTTTTTAAT 


CACTTCTTTG 


GCCATTAATC 


960 


TTGGTAACAA 


CAAATCAAAA 


ACCGTTTTTT 


CTGAAAATAA 


AACACCACCT 


GGTAAGCCAA 


1020 


TGATCGTCTG 


CTCTCCTTTA 


TAAGCCAGCA 


AAAACATTGA 


GCCTGGTAAG 


ACTGGTGTCC 


1080 


CATGGGTTAC 


GATTTCAGCA 


CCTGTCTGCT 


TAATTGCTAA 


CGGTGTCAAA 


TCATCAGGAT 


1140 


CTACACTCAT 


ACCACCTGTA 


CAAACAATCA 


GATCAACCCC 


TTGTGCTAAC 


ATCTTTTTGA 


1200 
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TCGCAACAGT 


AATTTTTTGC 


GGCTGATCAT 


CGACAATTTC 


TTGCTTGACA 


ATCGTAACTA 


1260 


AAGGATACGC 


GGAGAACTTG 


GCTTTTAACA 


CAGGGTAAAA 


AGCATCCTCA 


ACCAATCCTG 


1320 


TGTAAACTTC 


ACTGCCAGTA 


GTTACAATCC 


CGACCCGTAT 


TTTTTTGAAA 


GGTTTCACAG 


1380 


AGATTAGTGG 


CTCAGACGTG 


CTGAGCTGGC 


GTGCCTTTTC 


TAATTGGCTT 


TTAGAAATTG 


1440 


TTAGTGGAAT 


CACTCGAAAG 


GCTGCAACTT 


TTTCATCTTT 


CTGAACTTCT 


ATATTGGTAA 


1500 


PTTTAGTTGA 

^ x x x n>j x x un 


CAAGGCGAGA 


CCTTCAATTT 


CATTTAAAGC 


AATTAATTTT 


TCTCGATTGA 


1560 


PTTTTAAAAfi 


ACCTCTCGTA 


CTAGCTTTAG 


CAACAATTTT 


TCCTTCATGG 


ACTTCACTCG 


1620 


riPTCAPATfiA 


GTCATCTGTT 


AACAGGTTAT 


ACAAAAATGT 


CGCkGCCTCT 


TCTTCATGTA 


1680 




ATCCAAGGAA 


ACAAACAGAT 


GTTTTTTCCC 


TAATGATAAA 


AGAATTGGAA 


1740 


TUTPTTPTTP 


AACAACTATA 


TGTCCTTTTT 


TAAAACACAC 


TTCCkgGTCT 


TCCCATAGTC 


1800 




AtATCATGAC 


TCAATGGAAG 


GCCAACTGCC 


tCTTCTGTTC 


GAATTGTTTT 


1860 


PZiTPPTTTAT 


TCCTCATCAT 


GTATGATTTT 


TTTATTTGTG 


ATTAAATCCA 


AAGCGTGGGG 


1920 


rnnziapannp 

rtMrirVV- o u \^ 


AAPAAAAAAT 


GCAAATGTTC 


TGTTACACTT 


TTCGGGCTAC 


CAGGCAAATT 


1980 






TTTGAACAGA 


AACGCCTCGA 


CTTAACATAG 


CAAATGGTGT 


2040 


1 1 J. I X \sK*r\r\.\j 


PTBAHTTGAP 


GCAAGCCCTC 


ACTGATCCCA 


GGAATGATTC 


GCTCTGCTAC 


2100 


li p/^t a a rzPT a 

/ikrfO x rtriVjto X g\ 


ftPTTPTGGGG 


TCCGATCCCG 


TTCACTCAAC 


CCTGTGCCGC 


CTGTCGTTAA 


2160 


AATPAAATPA 


CAGGTCTTAG 


ACCACTCTTG 


CAATAGCTCT 


ACTAATTGAT 


TTTCTTCATC 


2220 


TrjPTAAAATG 

X OVJ X fWVrt X o 


GTTTGTTTGG 

Vj XXX xxx 


TTACCTGGTA 


GAGCGCAGGT 


AACTGTTCAC 


GAATCAAGGG 


2280 


PPPPHATTGG 


TCGACAGCAA 


GTCCTTGAAA 


GCAACGATCA 


CTTAATGTTA 


CAATGCCAAC 


2340 


TT T AAAC AAC 


GTCTAACTGG 


TCTCCTTTTT 


TAATATGACC 


TTCTGTCACA 


ACAACTGCAA 


2400 


AftATTCCTTC 

nun x j. x x 


TCTAGGCATG 


ATGCAATCGC 


CCACTCGCGC 


GTAAATCTGA 


CAGTGTTTAT 


2460 


ggpactcttt 


CCCAATTTGA 


GTAACCACCA 


ATTCGGAATC 


GCCTATTTTC 


ATTTTCGTTC 


2520 


PAAPAGGTAA 


TTGACGTAAA 

X X Vj^I^^W X * xi * 


TCAATCCCTG 


AGACGATTAT 


GTTTTCCCCA 


AAAGCACCAT 


2580 


TGPCAACCCG 


AGCTCCTTTT 

nu v x x^Vp* x x x x 


TCATTGAAAG 


CCGTTATTTC 


TTCAAAAGAT 


AACAAACTGA 


2640 


CTTGTCGATG 


CCAATTACCG 


CCGTGCGCAT 


CATTTTCTAA 


TCCGAAACCT 


TTAATTAAGT 


2700 


TGACTTCAGG 


AATTTCCTTT 


TTCTGTGTGC 


CTCTTCGCGG 


GCTAATATTT 


ATTGCAATAA 


2760 


TTTCTCCTGT 


TTTCATACTT 


ACCCTCCTAT 


TTGTGACATA 


AATCGTTGTT 


CTCCTTGTTG 


2820 


TTCCAAGAAA 


AGATGTTTTT 


CAGGCTTACA 


s~< r~~ f' t\ 7\ tv r* 1 r* 7\ 

GGCAATGCCA 


CTCTlbAJL AA 




2880 


TTCTGCTGTT 


TGTTCCATAG 


CTAAAGCTTC 


CTTTAACGAG 


TAGCCGTCTG 


CAGAATGCAG 


2940 


GCACGTTTTT 


AAATGGCCGT 


CTGCAGTAAT 


TcGGATTCGA 


TTGCACGTCT 


CGCAAAAACA 


3000 


ATGCCCTAAT 


GCACTAATAA 


AACCTATTTT 


CCCTTTAAAA 


TCTtTTAAGG 


CGTAATAACT 


3060 


AGCAGGGCCA 


TTGCCCAATG 


GTTTTTGGTA 


TGGTTGTAAC 


GTTCCATAAT 


ATTTGGTTAA 


3120 


AATGGCTTCT 


ACTTCTTCTT 


GTTGCTTACC 


TGGATGCTCC 


TTCCCTAATC 


CAATTGGCAT 


3180 
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CATTTCAATG 


AAACGAAGAT 


GCACAGTCTC 


TTTCTTmGCG 


ATTTCAGCCA 


GTyCTACAAT 


324 0 


yGTTGCTTCA 


CTTAG.CTCTT 


TCCTAGCTAC 


AGTATTAATT 


TTGATATTAG 


GTAATCCCAC 


3300 


ATCAATCGCT 


TGTTCTAAGC 


CCGCCAGAAC 


ATTCCTTAAC 


TGACCAACTC 


TTGTTATTTC 


3360 


ACGAAATTCT 


TCTGGATCCA 


AAGTGTCTAA 


GCTAATATTA 


ATGCCGTCCA 


AGCCCGCCTC 


3420 


TTTTAATCCT 


TGAGCCTCAC 


GAGCTAACTT 


CATGCCATTA 


GTCGTTAAGG 


TTACTTTTTC 


3480 


GATGCCTGAG 


ATTTGCTTTA 


TCCTTTTAAT 


TAAAGAGAGT 


AAATTCGGTC 


TAACAAGCGG 


3540 


TTCACCACCT 


GTTAATTTAA 


CTTTTTTGAT 


GTCTTCTTTC 


GCTAATATCC 


TCAATAAAAA 


3600 


AATAATCTCA 


TCATCTGTTA 


GAAGCTGCTC 


TTTTTTTAAA 


AAGCAAAGAC 


CTGTTGCTGG 


3660 


CATGCAATAC 


GTGCAAgTAA 


GTCACAACGA 


TCTGTTAAAG 


ACAAACGAAC 


ATAATCAATT 


3720 


TCACGGTTAA 


"ATGCATCTTT 


CATGCTGTTC 


ACTCATCACT 


TTCCTTTGAT 


TTAATTGAAG 


3780 


AACGAATGAA 


ATGACCACTA 


CGTCCACCAT 


TTTTCTCTAA 


TAAACCAATG 


TTTGTCATTG 


3840 


TCATTCCTCG 


ATCAATTGCT 


TTACACATGT 


CATAAATTGT 


TAAGAGTGCT 


GCCTGCACAC 


3900 


CTGTTAGCGC 


TTCCATTTCA 


ACGCCAGTTG 


AACCCACTGT 


TTTTGTAAAG 


CAACGTACTT 


3960' 


CTAAACAAGT 


TTGATTGCGC 


CAATGAAACT 


GAATTTCACA 


CTTTGTCAAA 


GCAACCAAAT 


4020 


GACACAAAGG 


AATCCATTCA 


AACGTTCGTT 


TTGCGGCGGT 


AATCCCAGCT 


ACTCTTGCTA 


4080 


CCTGTAATAC 


TTCACCTTTT 


TTAATCATTT 


GGGTCTGGAT 


CGCTTCTGCT 


GTCGCTTCCA 


4140 


ACATGTGAAT 


TTCTCCATAA 


GCGAGGGCTG 


TTCTTTCAGT 


GTCATTTTTT 


TCCGAAATAT 


4200 


CAACCATGTG 


CACTTCTCCt 


TGTGTATTCA 


AGTGCGTAAA 


TTCCT 




4245 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4013 base pairs 
(B-) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

TGAAATCAGC ATGGTACGTT TAAACCAAGA TTGTATCCGC TTTAAAAAAC AAGAATGATT 60 

GACAAAAATT AACTTTACAC TATAATATTT TTAATAGATA ATGTTGAAAA TTGAAAAAGG 120 

CAATGGTAAA AGGGGCTGTT AAAATGTCAA AAAAAACAAT TATGTTAGTT TGCTCCGCAG 180 

aATGAGTACT AG.TTTACTTG TTACAAAAAT GCAAAAAGCA GCATTTAATC AAGGCTTAGA 24 0 

TGCCCATATA TTTGCGGCTT CTGCATCGGA AGCAGAAACA AATTTAGAAA ATAATCCAGT 300 

CGATGTGTTA TTGTTAGGTC CACAAGTTCG TTATATGAAG AGTGATTTTG AAAAGCGTCT 360 

ATCACCAAAA GGTATCCCAA CAGATGTCAT TAGCATGTCT GATTACGGCT TAATGAACGG 420 

GGAAAAAGTG CTGGAACAAG CCTTGGGTTT GTTAGCACAA GCAGATAAAA AAGAGTAAAA 4 80 

ACAAGTTGTT TTTTCGAGAA TCTTCGAAAA AGCAACTTGT CTTTTTTTAT GACTTTAGTT 540 
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AATTGGAATT GAGCTGAAGC AGTCATTCTG GTTTATCAGT TCCATAAATT ATAGAGCCGA 600 

AATTAGTCCT GTGAGCTAGA GAAAGACAAC .TGXCTTTAAT TTTTTACTTT AATCTAGATA 660 

CTAAAAAAAC ATACTTAACA ATCTATTTTC TGAATAATGA ACGCCTTTTT CCGTAGTCTG 720 

AAAGTAGTAA TTCAAAGAAT GATTAACAAT ATTTTATATA CGAGTCATTA TAGTATGGCA 78 0 

TAAGCATTTC GAATAACAGA CTTATTCAGC AAATAAATGA GAGAGAATTG TGTTTTTATT 84 0 

GTTTAAAAAT AATGAGGTTA TTTCATAAGG CATACTTTCT TCATACGTGA TTTCTAAAGG 900 
ACTATCTTTA TCAAAAGAAA TAATTTGTGA GTCAGGCATT GCAAGTAATA TAGGTGAATG ■ 960 

AGAGACAATA ATAAACTGAG ACCCCTCATC AATTAGTTTT TTTATAGATA TCATTAAGGC 1020 

TAGTTGGCGT TGAGGTGAGA GAGCCGCTTC AGGTTCATCT AACAGGTACA GACCATTTCC 1080 

TCTAAACCGA TGGTTCACCA GATTCAAAAA ACTTTCGCCG TGTGATTGAT GATGTAGGGA 114 0 

TTTATCGCCG TATTCAGAAA AATCTAAGTC TATATTTTCT GCATATGTAG CAACATTGTA 1200 

GAAACTTTCA GCTCTTAAAA AGAAACCATC CGTAGCACGC TTTACACTTT TTCCAACAGT 1260 

TAAATAATTA TGTAAATCCA AGTGAGTATC TTTTGTAGAG AAATTAAAAT TAATACTACC 1320 

GCCTTCAGGA TTAAATCCCC AGTTGACTGC GATTGCTTCT AATAAAGTGG gATTTTCCaG 1380 

TTcCATTTtc ACCTGaAAAA AAAGTAATAG GttTTTAAAA TTTAGkGaAT nAAATTTATT 14 40 

CmGcCCTTCA ATTTTTGAAA AAACAGAATC TTCTTCAAAA GCTTTGAATT CTAGTCTTTT 1500 

i 

TAAAAAAATA TTTTCCATGA TAAAGCTTGC TCCTTTCTAG TCTTTTCAAA TTAAAGCTTG 1560 

GATTACTAGA TATACTTTAA CATTAATTTT TCTTTAAAGG TAGATAAATT TGTTATAAAA 1620 

AATCTGCATA TAGGTAAGTC GAAAGTGGAA GATTCGTGTA AACAATAAAG CGTATCAAGA 1680 

GTTTGTTTAG TAGGTAACGA CTAGATAAGA ATAGAAAGAT AACGGTGGGA GATTTCCTTT 17 40 

TTCGCAAAAA AGCTAGAGAA ATGCGTATTT TATAATTGAG GTGGTTAACA TGGAAGATTT 1800 

GCATGGACGT AAAATAATTA TTGAAGAAAC AGAACGTTCA ATACAGACAA AACTGGTTTA 18 60 

TTTACCAACA ATGTTAGAGC GTTCCCCCCT AACAATTCAA TGTCAGCGGT GCGGGGAAGT 1920 

CGTATCCAAA AAAGAAAACA GACTGGCGAC AAATGTATAC TACTGTCATG CGTGCATTCA 1980 

ATTAGGCCGT GTGACTTCTT GTCAAAAATT TTGTCATTTA CCAGAAAGGC CCAATAGCCC 204 0 

AAGAACTGTC TTTTTTGAGT GGTCTGGACA ATTAACAAAG GGACAGCAAG CGATCTCGGT 2100 

TGAACTTTGT GAGACAGCAA AAGCAAGAGA GAATCGCTTA GTGTGGGCAG TTACAGGTGC 2160 

AGGCAAAACA GAAATGtTAT TTGCTGTTTT ACACCAAACC TTACAAGAAG GGGGGCGGAT 2220 

TGCGCTGsTT CGCCGCGTGT CGATGTTTGC TTAGAACTGT TTCCCAGAAT CCAAGCGGTT 2280 

TTTCCTCATG AAGCAATTGC TTTACTCCAC GGAAACAATC AAGAATCGTA TCGTTATACG 2340 

AAACTAGTGA ' TTTGCACCAC CCATCAATTA TTGAAATTCC ATCAAGCGTT TGATTTACTT 24 00 

ATCGTTGATG AAGTCGACTC TTTTCCTTTT GTTAATAACG AACATCTTTA TTATGGAGTG 24 60 

CGTAACGCAC GAAAAAAAAG AAGTAGTTTG GTGTACTTAA CAGCAACGCC TACGAAAGAA 2520 
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TTGCGGAAGC 


TTTTACAGCA 


AAAGAAGTTA 


GCAGCCAGTA 


TTTTGCCAGC 


AAGATATCAT 


2580 


.CGTAGGCCAT 


TACCTGTGCC 


ACGCCGATTT 


TGGGTTCAGT 


CGTGGCAACG 


TCTTTCTAGC 


2640 


AAAAATATTA 


AAATCATTGC 


ACATCATTTG 


AAACGTTTAT 


TGGTGCGTAA 


TTCAGTTTTA 


2700 


CTATTTTGCC 


CAAGCATTTC 


ATTAATTCAC 


CAATTGCATG 


CATTAATAGC 


TGAGCGATTT 


2760 


TCCGACAAAG 


AAGTTGTCGC 


CGTTTATGGT 


AGTGATGAGc 


TTCGCCTGGA 


AAAAGTTCAA 


2820 


GCAATGCGTC 


AgcAGAAAAC 


GGATATTTTG 


ATCACTACAA 


CCATTTTAGA 


ACGTGGTGTG 


2880 


ACTTTTGATG 


CAGTCTCAGT 


TATTGTTTAT 


GGTGCCAATC 


ATCGAGTCTT 


CACTTCTTCA 


2940 


ACACTTGTCC 


AAATTGCGGG 


CCGAGTCGAT 


CGAAGACAAG 


AATTCAATTA 


CGGCGAAGTT 


3000 


TTATTTTTAC 


ATGATGGGGA 


AACAAGAGAC 


ATGAAAGAGG 


CAATTCGGCA 


AATAAAGCAA 


3060 


ATGAATCGCT 


TGGCTAGTAA 


ACGaGGGATG 


TTAGATGGAT 


tGTAAAAATT 


GCGCAAgCCC 


3120 


mTTCGTTTGG 


AGATAACATT 


AAAAATGCTC 


TTACGTTTTA 


ACAAATGGAT 


GAGCCCTTCa 


3180 


CTTTGCCcAG 


CACTGCCAAG 


CAAAGTTCCa 


GAAACTGCCA 


ATGACAGGCA 


CTTGTTTTGG 


3240 


CTGTAGTCGC 


GTCAGCCAAG 


AGCGCTATTG 


TTTTGATTGT 


CAAAGGTGGC 


AGCTGCTTTA 


3300 


TCCAGAGTAT 


TCTTTTCATA 


ATGCAGCGTT 


GTTTCATTAT 


GATGAAGGAA 


TGCAAGAGTG 


3360 


GATGGAACGC 


TATAAATTTC 


AAGGAGACTA 


TCGCTTAAGA 


ATGTGTTTTA 


ACGAGGAAAT 


3420 


CAACTTTTAC 


TTTCAGCAAC 


AGTCTGCGGA 


CTATATTATT 


CCGGTGCCCT 


TATCCGAAAA 


3480 


ACGAATGCAA 


GAACGAGGCT 


TCAACCAAGT 


AATTGGCTTA 


CTAGAAGCGG 


CTGATGTTCC 


3540 


GTACAGCCCT 


TTTTTAATTA 


GAAAAGAGGA 


AAATGTTCCG 


CAATCTAAGA 


AAACTAGAAA 


3600 


GGAAAGGATG 


CAGTTGCAAC 


AACCATTTGC 


CATCCAAAAA 


GAAAACCAAA 


AAAAATTAAA 


3660 


AAAATGTTCT 


GTTATTTTGG 


TGGATGACAT 


CTATACCACT 


GGACGAACTC 


TTTTCCACGC 


3720 


AGCTGCTGTT 


ATCAATGACT 


GTTATCCTAA 


AAGTTTGAAT 


ACGTTTACTC 


TCGCGCGATA 


3780 


AAAAGAAAAG 


AATTCATAAA 


TTTGATGACT 


ATTTTGTTGG 


CAAAACCCAG 


ATTATTCGAT 


3840 


ATAATGAAAT 


TAAGAAGAAG 


GGAATTAAAG 


AGTGGTCCTT 


TAATCTTTTC 


TTTACTGGTA 


3900 


GATGAAAGGG 


GTAATTGTTA 


TGTTTAGATA 


TAATGTACGT 


GGAGAAAATa 


TCGAGGTAAC 


3960 


CGAAGCGATT 


CGAGACTATG 


TTgAGAAAAA 


AGTCGGTAAT 


TAGAAGATAT 


TTC 


4013 



(2) INFORMATION FOR SE<2 ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

TCAGTGAAGA TTGTTTTATT TTCGTTATAG ACGATGGTAA AGATGCTTTT tTATCCACCC 60 

ATGATACAGG TTACTTTACG TCAGAAATGT TTGAATATCT TGAAAAGAGC CACTTACTTC 120 

. TTAGTATTGT TTCTTTAGAT TGTACTAGTC AAACAAACGA AACTGGCAAT AGTCATATGA 180 
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ATTGGGAAGA 
AAACGATTTA 
CGGCTTTATC 
AATCGGATTA 
TACTTTTATC 
CTGGAAATTC 
TTTTCTTCTT 
CCATGATTTC 
TCATTATAGA 
GGATTTTCTA 
TGGTTTTTCT 
TTCAAAACAA 
TTGCCGACAG 
AATTTGCCaA 
GCATTCGGAT 
" ATTGTATTGG 
GGGGAAATGA 
CTTCTGTCCT 
ATATACTCTT 
AACAAAAAAC 
TTATCCATTT 
GGTTGGTTTA 
ACCACTGAAG 
GCATCACTGA 
TGTGCTTTTT 
CTATTATAAT 
TCGTATTGAG 
CCTGTTTTTC 
ATAATATAAC 
TCTAAACCGG 
GAGGAAGCAT 
TCCCAAAAAG 
ATCCCTCGAT 



AAATTTAAAA 
CGTTGCCAAC 
TCAAAAGCAT 
TTTGCCTTCT 
GTAACTTACA 
CGAGTATTTT 
CATTCACTCG 
CTGTTTGCTT 
TTGGTGAACC 
AGTCTGTAAA 
CACCTGGATA 
ATGGCGTTAA 
TCACTACCGC 
ATGGTGTCGC 
TTAATACTTT 
TTCCAACAAC 
TTCTTCCGAT 
CAGGGTCCAG 
CTGCATTTGC 
CAGCACTAAT 
TTATTTAATT 
CCTGAATGTC 
CTGCTTCATC 
ATTGTGCTTT 
CAATGCCTAA 
GAACACCGCC 
CCATGGTTTC 
GGTCAACACT 
CCATCAAATC 
CAGTATGTTC 
ACGTTTTTCC 
CATTGTCATA 
CATCAATACT 



TTAATTGCAG 
CATTTTTCTC 
GAAATCATTA 
TTTCAGCACT 
CTATCTTAAA 
TATGGTTCTT 
ATTAACAATA 
AATGCCGCCA 
aGATTGaCCG 
GTTAGATGTA 
GCCTGATATT 
AATTTCTCCC 
AATATCCGCA 
ACTGCCATCT 
GGCATTCTTA 
AAAGCCTGTT 
TGACGCAAAA 
TAACGATCTT 
AGACAAGCTA 
TTTTCGTATG 
GGGGAATTTT 
CTCTTTAGCT 
GCCATATTGA 
AGGTGTTAAG 
GTTCTGGATA 
TTGATCATAA 
TGGTTGTCCG 
CTGAGTATCC 
AGAATAAGAT 
CGTTACACCA 
TGTTGGTGTA 
AGCATCTGGA 
GTGTCGTTGG 
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AATTAAAAGA 
ACAATGGCGG 
CGAGTTATGA 
ATTATATGCT 
AAACAAAAAT 
TACGCTGCTG 
AAGTTATAAT 
TTGGAATGAA 
CCGGTTGTAT 
AACAAATCAT 
GTGACATGGG 
AACTCTGGGC 
TTCGGGGAAA 
CGACCTGGAT 
AAACTTTCAG 
CCTAAAGAAA 
GGCGCTTCGG 
TTTTGTCGTG 
AAACCGGCGA 
GAGAACTTTT 
TTTCATTCAT 
CCGATTCCAG 
ACTTTTGCAG 
TAATTTACTA 
ATGGTGTAAC 
TAAGGCGTTC 
TGTTTACTTG 
GCACCAATTT 
TCATTCAACG 
TGTGTCATTT 
CTTGTTTCAC 
TGTTGTTCAT 
AAATGATCTT 



AAGAAAACTC 
CTTGTCTTAT 
TGGTTTAGAA 
TTTAACCGAT 
ACTCGAGATT 
GcACAGCGGa 
TCACTTCATT 
CACCAACTAC 
CGATATCATA 
TTTCATGAGA 
TATCTGAAGA 
CATCTGGACG 
AAGCTACATC 
AAAACCAAGC 
CCACATGATT 
TATAGCCTGG 
TTGTATCTGC 
AATGACTTTC 
TCAAAGTTAC 
TCATAAGATT 
TGACCAGAAC 
CAGAGTTAAA 
CAGCAAGCAT 
ACGAGCTGTA 
CAATCCGATT 
CTTTATACCG 
GCGTTTGTAA 
CTGGATTAGA 
CACCTGATTG 
CATGACCAAC 
CATAACGCAT 
CAACAACTGA 
CATAATATGT 



GTCACAGATA 
GCGGAGATGG 
ATACTAACTT 
AATTACAAGG 
TCCCGTGATT 
TAAACGTTTA 
TAGTCTTTGA 
TTCAACCTGA 
AAATAGTAAT 
CCATTGTGTG 
TTCAAACTTT 
ATCGTTTTGT 
AATCACTTTG 
ATCATCTTTG 
ATTGGTGACA 
TTTACTGGCA 
CACTTCTTGT 
GGCAGGAACG 
TAAAATCAGA 
ATGCCACTCC 
AGATTCACTT 
GGCTGCTGAT 
CGCATCACGA 
GAAAATAGTC 
AATAATTCCA 
TGCTCGATCG 
ATTTCGAATA 
TGCACCCGAA 
TCCTAAATAT 
TACATCTAAA 
TGCTTTTCCA 
CAAGATGGGC 
TTTcACAAAT 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080" 
' 1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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TTTCCATGCG TATAGGCGTC AACAGCATTC TGATTTAATG CTACTAGTGA TGGTGCTTGA 2220 

ATAATAGAAT AATTATTTTT GCCATCAACT ACTGCATGGT AAACCCCTGT GTTATCCGTT 2280 

CCGTGTAAAG CAATTCCCGT ATTGCTTTTT TCAACTGGCA CATTAAACGC TACTTGAAAA 234 0 

GAGTTTTTTA ACGTTACTTC ACTACCGACA TGTTCCGTTA AATCTTGTTT TTCCATAATT 2400 

GTTCCATCTG TAGCGTTCAC TTTATAAACC AATGATACGG GGTCACCAGT TGGCGAAgAA 24 60 

tTTCTCTnAC TTTATAAAAT AATTGCCCCT CATTATTTAC ACGTTTGTCA ATAGCTAAAA 2520 

AGACAAGATC TyTAGTTTCC GGATTATCAG AAGCAATAAT TTTTTTAACT CCTTCTTCAC 2580 

cGATTGCATC arCCGATTGA ATTTTCACTA CTTGATCAAC TGGATTTCCt GTAACACTTG 2640 

TTACTGCACC ATTTCTATCT AAATGAATAA TCACATTATC TGAATCAACA GTAACGCCTT 2700 

CCGTTTGCTT TTCTAACTTA AGATGCTTCA TCCCCCACGC ATCTTCTTTT CGATCTACAA 27 60 

CCGTATATTG TTCAGCAGCT GCCTCAGAAA TTGCCTCCTT TTCCaACTTC GATAAAACTG 2820 

TTGAAACTTC ACTTTCTGAA TAAACTTGTT CTTCTGCGGC TACAAAAAGT GAAGAAAATA 2880 

GACATGAACT TCCAACAAAG ATGCCTGTAC CTAAAATGTA TAAAATTTTA TTTCCCTTCA 2940 

TCAAACAATT AACTCCTTCC CCAGTTTCCT TTTATTTCTT TTCCCAACAA AAGTATTGCT 3000 

TTATCCTCCC TAATTTTTcC CGTTAAACGA CCGCCCCTTT CCCTGCGTAA ATATTTGTTG 3060 

CTATTTTCAT TATAGACAAC TACCTAACCG GCACTATCTA AATTCCTTAA TCAGCCGACA 3120 

TTTTTCCCTG TATTGCCCCT CATAACTCAA TACCATAGCA AAAAAGTTGT TAACAAATTC 3180 

ATTCGTTAAC AACTTTTTTA CTGTTATGAT TTTTTTCAAT AATTAAATGT TGAGTGACCG 3240 

CTCCTTCATT GAAGGACGTT TGCAAATCAA ATTGATCTGA AAAATCAACA ATTTCTTTTA 3300 

CATATAACAA TCCCCAACCG TGCTCTTCTG GATTACTTTT AGAACTAAAT TTGGTTTTCT 3360 
TCATAACCTT AAAATCAAAT TCGGCTTGGA CACTATTTGT TATCACAAAC TCTTGTGTTT 34 20 

CATTTTTATT GAAAATTGAG ATAGCGAACA TTTTCAATTC TGaTTTTGCA CTGTTTTCAA 34 80 

TCGCGTTATC TAATAGAATA GCCAACAAAC GAATCACAAC CACTAAGTCG ATTGATGTAT 3540 
CAAGAATAAA TACTTCTGGC ACATCAACAA TCACTTTTAG TTTTGCTTGC TTTGCGGTAT 3600 
TAATTTTCAT ACTTATGAGT GCCTTAAGTT CCATATTTTT CAAACGATTT AACTTCATAA 3660 
GTTCAAATTC TTCATTATCA ATAATTTTTT TGGTTGGTGC AATCGTTTCT TCATAAATTC 3720 
TTTTGATTTC CAGAATATCT TsGkATGAAA TAGCAATTTG TAAACTATAC AATAAATTTT 3780 
TATAATCATG ACGAAACATC GCTAGCTCTT CGTTAATGGA TTCTATTTTT TCAGTATAAA 3840 
TCGTTAAATT ATTCAATAAT TGAGTGTTGA TACGTTGTTC TTCCAAATAA GCTTCAATGA 3900 
CTAAGCCAAT CAGAATTAAA AATAAACCTA CACTTATTAA TAACTCCAAT TTAGAACTAT 3960 
TTACCGTGAC AAATCCTCGC ATAAAGTCAG GTGAAATCTC TGGCGAAATC AAAAACACAA 4020 
AAAGTAACAA CAGAATCAAC AATAAAAGTA AACAACCAAA AAAAATTCGG CTATTTTCTA 4080 
ACACACGATT TAATCTTAAA AAAACACTAT TGGGAATTTT TAAAGCAATC CAGATATTTA 4140 
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TAAGCAACAG AAGAACATTT GAGGcAaTCC ATAATAATTT ATTTTGATAA ATCTGTTGAA 4200 

AAAAAACACT TAAAATCGTT TGCTCCGTGT AATTGATAAn CG AAaCCATA AGCAaTGTAA 4260 

AAnCAGACAA AAATATCGCT TTAAAAATTG AGAACCTCTT TTTGTGAATG AGTAAAAATC 4 320 

CTGTAAAAAT AACGACTGTC AGCGCATAAA TCAACCAAGA GTAGCCCAAC ATTAATGCCG 4 380 

CAAACGAACA AAGATTAATG ACAATAAGCA ATGAAAGTAA CGGTATTTTA CATTCAATCT 4 44 0 

TGATCAATGT TACTCGTAAA AAGACAAAAA CGATAAAGCT AGATACAAGC AAAACGTTAG 4500 

TAGCTAATAA CGACAAAATC ATTTTTCAAT ATTCTTTTTA GGTTTTTCAG TTTGTCCCAT 4 560 

CCATTGTCCA AATATATTGG GCGAATTTTG ATTTAATCTA GGCTTAGTTC CCACACCATC . 4 620 

ACTAACTTTT GCAACTCTTT TTTCAATAAC ATTTTTAATT ATTTTTTTAC CAAATTTCAT 4 680 

AAAAACACTT CCTTCAATTA AATTTTTTGT TACAGGGAGT AT CATC AG AC CTTGGATGAC 474 0 

GAGACCGTAG AGTATTACTG AAGCAATCTT TTCTTTGACA AACAGACTAC ATAAAATAAG 4 800 

TAAACACACC CTTATAAGGC TTTGCTTTTT GAGCACGTTT AAATCAGATG GCTGAACAGG 4 8 60 

TTCAATCGCT GTTCCTTGTG GCGCATACCA ATAAAATAAT ACGACTAAGA GGCACATCAA 4 920 

TAGAATTTTA TAAATAAACG GTAAGCTCAC A 4951 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5277 base pairs 

(B) TYP?: .nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

TTAATTGATA ATCAAAGTAT AACATTTTTT AATGAATTTA CATAATAATA CAAGTGGAGG 60 

TCTGTTCGTG AAGGTAGCAG GAGACAAAAT AAAACAAGCG AGAAAACTTA AAAAAATCAC 120 

ACAATCAGAG CTGGCTAACG GCATCTGTAC GCAAGCAACA ATTAGTAATA TTGAAAACCG 180 

TAATCTATGT GACAGTTTAG ATATTTTCTC ATCTATTTGC CTCCGATTAG ATTTAGAAGT 240 

AGAAGAATGC ATGATGATTT CTGAAGAAAA GAAAATTGAA AATCTACTAA ACAAAGTCGA 300 

GGATTTATCT TTACGGTTAT TTCACTTAGA GGCCTACAAT TTATTACAAG AAGTGCCTGA 360 

AGGTCTTATC TTATCTAATA ATGAACTAAC GACTAAATTG TTATATTATA AGGGrATTAC 4 20 

TTGTTTATTA GGAAAAAAAG ATAAAGCAGA GGCCTTGTTT TACCTATACC GGGGCGCGGA 4 80 

AATTGACCGT AAAGTCAATA TTTACAACAT TTTAAGCCTG AATGCGCTAG GTACGCTATA 54 0 

CGAACTGGAA AAAGATATGA GAAAAGCGCA AGTGTATTAC GAAAAATCAT TACAAGAATT 600 

GGAACAATTT'AAATTAGAAT GTTCCTTGGA GCGTTGTAGA ATTTATTATA ATTCTGCTAA 660 

ATTCTACTCG GAAATGAAAG ACTACCAAAA AAGTGTCATT TTAAGCGAAA AAGGGATTCA 720 

GATTTGTCGT GACAAACACT CCATTTATTT GCTAGATTAT CTTTTATATG AAAAAGCCTT 780 
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TAACAAACAA ATGCTCGGGG AAGACACAGC CGATGACTAT CGCCAAGCCT ATTATTTTAC 84 0 

ACAATTTTTT GGCAATACGG AAGTCCTGCA ATATATTGAG AAAGATATGA AAGCTTTTAA 900 

TATTTCCTAT TAATTTAATC AAAAAGCCGA TAAAAGCTGA AAACTCAGTT TTTACCGGCT 960 

TTTTGAAAAA TATAGGCAAG TTGCTTTTAA AAATCAGCAG TCACGGTTAC GATAAGCAAG 1020 

ACGAAgTATT TAGGAGGATT TAAAAATGAA AAGAGTAATA TGGTTTAGAC GTGATTTACG 1080 

ATTACAGGAT AATAAAGCAT TAGCACACGC GTTACAAAAT TCTGCAGCTG ATGAATTGAT 1140 

TTTATTATTC CAAATGAATC CTCAACAATT TATTCAAGAA AGTGCTAATC ATAACGCTTT 1200 

TTTTGCAAGC TTAGCCTCGT TCAAAGAACG AATCGATCAA GAGGCACATT TACAAATCAT 1260 

GGTCGGCGAA CCATTAGATT TATTTTCACG TTTGAAACGC AAATTACCCG ATTGGCAGGC 1320 

CATTTATTTT AATGAAGATA CTTGTGGCTT TGGGGCAAAG CGGGACCAGC AAGCTATGCG 1380 

CTTTTTTGAA GAAAATAATA TTCAGTCTTT CTCTTTTCAA GATGCCTATT TGCATGGCTC 14 40 

TGAAGAAATT AAGAAGAACG ATGGCAGCAA GTACCAAGTG TTTACGCCCT ATTACAATAA 1500 

ATGGAAAGAG GCGCCTAAAG AAACACCGAT TCCTGTTTCC TATACAGCTG mAAAAATTTT 1560 

TAGTGCGTGT CTTTTTCCAG AAGAGGAAGC AGCTTATCGT GAACAGATTG CGAGGATtCC 1620 

TTTAACACAC TATAGTGTCG GCGAAGAAAC AGCCAGAAGG CGCTTAAATA CTTTTATTGA 1680 

TCAAAAACTT CAATCCTATG AAAATAAGCG TGATTTTCCT TATCAGGATC AAACGAGTCA 174 0 

TCTGTCTACT TTTTTAAGAA CGGGAGAACT TTCGATTCGC ACCATTTGGC AAGAGCTTGC 1800 

ATCTGTGCCT TCTAGCTTAA GTAAAGAAAC CTTCAAAAAA GAATTAGCTT GGCGCGACTT i8 60 

TTACAATATG ATCTATAGTG CGTTTCCACA ACAAAAAGAG GAAGCTATTC AAGAAAAATT 1920 

TCGTTATATT CAATGGACAA ATGACCCAGA AATGTTTGTC AAGTGGCAAA AAGGGGAGAC 1980 

GGGGTACCCT ATAATTGATG CCGCAATGCG ACAACTGAAT CAAACTGGTT GGATGCACAA 2040 

TCGCTTAAGA ATGATTACTG CCTCTTTTTT AGTTAAAAAT TTACACATCG ATTGGCGTTG 2100 

GGGTGAAAAA TACTTTCAAA AAATGTTGAT TGACTATGAT GCTGCCAATA ATATCGGTGG 2160. 

CTGGCAATGG GCTGCTTCAA CAGGAACGGA CGCTGTCCCT TATTTTCGGA TTTTTAATCC 2220 

AATTATCCAG TCAAAAAAAT TTGATAATGA CGGCCAGTTC ATCAAAAAAT ATGTTCCAGA 2280 

ACTTAAGCAA GTGCCACAAA AGTATATTCA TCAACCAAAT CTAATGAACG AaGCCTTACA 234 0 

AACGCAATAT CAT GT AC ATT TAGGAGAAAA TTATCCAAAA CCCATTGTCG ATTATGCATC 2400 

AAGTAAAAAA CAAACATTGT TTCTATAksA AGCGAGCAAA GAAATtCATC AAGAAATGAA 24 60 

CAATCCAAGG TTTCAATAAA CAGTAAACCC AACTAGCTTA GCAAACAACT TGTAAGGTTA 2520 

GTTGGGCAAA TTAATTAGTC GAAAGAGAAG TGCnATTTAT CGGCTAAAGT TTTTTATTCT 2580 

ATTCTAGTTA • ACATAATATA CATTATACAA AGTAGAGTAA AAAGCATTGA AAAGCAAACA 264 0 

AAACCAGTCT TTAGTTTATC TAGACTGGTT TyGTCACGTA CGTTATATAA ATTATGCTTG .2700 

TTGCTTGATG GCTTGTGCGA CACGTGCTCC ATATTCTGGA TTtACTCTAG TGAACAAATC 27 60 
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AATTTGACGC 
AATTAAGTTT 
GTGATCTTGA 
CGCTGTAGGA 
ATTGCCATTG 
TTGGTTGATT 
AGCGAACAAA 
CCCTGGTGAA 
AGTCATCGTG 
AAAAAGTGTT 
TCCAAGAAGG 
GGtTTtCCCa 
CTTGAAGTGA 
GCTACCAAAA 
TGTTACTTGA 
TTTCAAATGC 
ATCACGAATG 
ATAGAATTTA 
AGCTACAGTT 
GGCTTTAGTA 
TTTGGCATGA 
TAAATGAACG 
ATCGCCAACT 
TTCATCTAAA 
AAAAACACTC 
ATCAAAAACA 
TCAAAGAAAA 
AGGTCTTTCG 
ATCGCAACAG 
TTTTCAACTT 
TTGGCTTTTT " 
TTGTCTTGCT 
GAAACTTTGG 



GCGATGaTTt 
TCTTTTTCTT 
TTATAGCTAT 
TCTTCTTTTG 
TTAAAACGCA 
GGCAATTGGT 
CGTCCTTGCA 
AAGGTTACTT 
CCAACkTCAA 
TCTTTCATTG 
AAATTCtTGA 
GCAATTTCyT 
TATTTAACAA 
CCATGCATAT 
TGAAGACTTT 
GTCCGCGGAT 
AAAAAAATGG 
AGCGCAAAAC 
GAAAAACGAG 
TATTGTGCCA 
ACAACCCGTT 
TCTTGGATTA 
GGACTTCCTT 
ATTATACTCA 
GAAAACATTT 
AATGTATTAA 
CTAAAATGCT 
TTACTTTTTT 
TCGTGTCTTC 
CTTGTTCTGA 
GTGTAATGCT 
GCGGTTTCAC 
CAGAAACATC 



CTTGaTTTTt 
CGCTTGGCAG 
AATTACCAAC 
GTGTTTCAGT 
TATTGCCGTC 
GACTATTAGC 
ATAATTtATC 
GTkCAACCTC 
TCAGCGGATA 
TTAACGCATC 
TTTtCAATtG 
CTGCTAATTG 
AAAATACTTC 
GACGGAACGA 
CCGGTGAATG 
TTCTTTTTTG 
GCGTATTATT 
CGCGAGGATC 
CGAATAAAGG 
TTGATTGGCT 
CTGGTACGCG 
GGACAGGACC 
GTGACGTAGT 
TTGTATAAAA 
GTTATTTAAT 
CTTTAAACTG 
GTTTTTTTAT 
TGAGCGGCCG 
AGTCGATAGA 
TAAAATATTy 
AATTTTACTA 
ATTAATCAAC 
TTTGACTTTT 
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CACTTgACCT 

CAAATTATAC 

ATTTCCTTCA 

ATAACTATTT 

TTTTTGGTAG 

GCCCACGCGA 

CGGAGAAGCT 

TGCAAAATAA 

CTCTTTTTGA 

TGCATACGGG 

CATTATGCAA 

ACTCTCTAGA" 

TCCAGCAGCA 

CAACGGAATG 

AGACCAAAAA 

GCTATGAATA 

GCCAACTAAG 

ACGCAATGTA 

TGTTTCTTTG 

CACCTTGAAA 

CTCACGATTA 

AAATTCTCCT 

TAAATGTTGA 

AAACCACTCT 

AGAATGATTA 

TTTACACCAC 

TCAAAAAAGA 

TTTTTATTGG 

CCTGTCATAG 

TCTTTCTTAG 

GATAATGATT 

ATATTTGTTT 

TCGAAAGGAT 



AAAgATGCCG 
AAAGCGTTTG 
ACTTCAAAGG 
GGTTCATAAT 
TTATTTACAG 
TGACGATGTG 
TCAATACCAG 
kTCkCTGGAT 
GAAACCGTTT 
aTAATTTGCc 
ATCTyCAATA 
TTTTTGATTC 
TTGACCCATT 
CCACGATCAC 
TCCCAAACAG 
AAATCTGGAA 
TCGTAATTTC 
TCTGATGAAC 
CCAACTTCAG 
ATCCCATGAG 
AAATGAGCTA 
GCAGTTAGCG 
TTTTTCACCT 
GCATAACAAA 
AAAATAGCAG 
CAAAAATAAC 
CCTAGCATCC 
TCATTCAGCG 
AGGTGCTATC 
GCATkGAAAT 
CAAGTCCCAT 
TATTTTCTTT 
CTTGTTCTTT 



CTATATTGTT 
CTTGTGTAAA 
AACTAATTTT 
TAATTTCGCT 
GCGCTTTTGC 
CGTCCCCATA 
GAACGAAATT 
TTCTATTTAA 
TTGTTACATC 
ACAGATAATG 
TGGrAATCTG 
CTTGATTCGT 
TGAATGTATG 
TCATTAAAAT 
CTTCTGGACT 
ACTTAATTGC 
CTTCGTCTGT 
CTAACTCTCC 
ATAAAAAATC 
CACCAGCACC 
ATTTTTCTAA 
AATTTTGATT 
AAAACACTCC 
AATAATGCTT 
TAGAATTTAA 
AACCAATAAT 
CAATCATGCT 
TTATTGATTG 
GGAAAATATT 
CGTCGCTTTT 
ATCAACGTTC 
TGTATTAACA 
ACCTTTTTCA 



2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4 380 

4440 

4500 

4560 

4620 

4680 

4740 
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GATGTTTCTG TCGTAATTTT GATTAACTTT TGTAATTCTT TTTTTGTAAA CGTATGGGTA 4800 

ATGGTGTCCC CTT.TT.TTCIC JGAACGTATCC TTGTCTAATC CTTCTAAAAA CTCTTTGTAT 4 860 

TCCTTAGCAG ACTGTTTTCG TTCTTTTTCC GATTCTTGAT ACTCTTTGAT TGTATCTTGC 4 920 

CATTCTTTCG TACTTTCCTC GTTCATGGCT AAAAGATCAA TGTATTTTCC TTTTAGTTTT 4 980 

TCTGTATCTA ACTGAGAACT ATCGATAGCA CCGTCAGTCA TTGAGTCTGC GACAGCGACA 504 0 

ATATATTCCA TCATATCTGT TGCTAAATAG AGTTTTGGCT CTTTGCCCTC GTTATCTAAC 5100 

GATCCAACCA ATGAAATCGG TACATCCATT CCCATCGCTT TTAATTTCAT CTCTAAGTTG 5160 

AATGCTTTTT CTTTTTTGGC ATCTACTTGA ATCTTCCCAG AAAGCGATGC GTCTTTGATT 5220 

TGCGTGATGA GCATCCCAAT CATAGGATTC GTTTGTGmAC CATCTTCTTG TGAAAAT 5277 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
TCACAATAAA TAAAGAAAAG TAAGAGGATA AATAACAAAA ATCTCATCAA ATTGAAAAGA 60 

AGAAGTGGTT GTTTTCACAA GTGTGAAGAG GGTTAATTAA ATACAAACAA GATTTCAATT 120 

ACATTACAGA AATTTTAAAA CTCAAAATTT AATACGTTTA TTTCTTGTTT CAATAGGAAA 180 

TTAACGGTAA AATAGAAAAG TGAGACAGTG AGGAGGAAAA ATGAATGAGC GAACAGTTAA 24 0 

CAAGAAGTAG TAAACATATT CAAAAAGCTC TTAAAAAGCA AAAAGCTTAT AAACGTGCTA 300 

CCGCCGTTGC GGGGACTTCA ATGATTTTAG CACCTGTGGT TGGTGCTGCG GTACCTGCCC 360 

AAGCAGAATC AAGCCAGCAA GCATTTATTA ATGAAATCGG AAATTCAGCA GCAGCCGTTG 4 20 

CCAATAGCAA TGATATGTAT GCTTCCGTCA TGATTGCTCA AGCGTTGCTA GAAAGTAGTT 4 80 

ATGGTAGTTC TGGGTTAGCA TCTGCACCGA ACTATAACTT GTTTGGTGTA AAAGGAAGTT 540 

ATAATGGACA ATCTGTTTAT ATGCCAACAA AAGAATATTT AGACGGACAA TGGGTGACCG 600 

TCACAGcwGC ATTTAGAAGT TATAATTCTT ATGCGGAATC GTTCCAAGAT CATGCCAATG 660 

TTATTCGTTC CACTGCTTTT GGGGATACAT ACCATTATTC AGGCGTTTGG AAGAGTAATA 720 

CTAGCAGTTa TCGCGATGCA ACAGCCGCTT TAGCTGGAAG TTATGCAACT GATCCAGGCT 780 

ATGCCGAAAA ATTGAATTGG TTAATTGAAG CATATAATTT AACTCAATAT GATTGGGGAG 8 40 

CACCAGTCGC TCAAACCACT AGTTATTCAG GACTCACCGG GAACCGTTGA T 8 91 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: .264:.. 



AATTCTCTAA 


TTCGGATAGT 


TTGGCTGGCT 


TAAAGTCCAC . 


ATAATCCACC 


TCTGATTATT 


60 


TTTTCTTCGT 


ATTTATCGTT 


CCATTGGCCA 


TTGCTTTCTC 


TTGTAAACGA 


CGCTTTTTCT 


120 


TTTTAATTTT 


TGATTTTTTC 


TTACCCATAA 


ATTTTCCTCT 


TTTCATTAAT 


TAATATCTTA 


180 


TTATTTCTTT 


TTTTCAGTCG 


CAAATCCCAT 


GATGaTTTGa 


TTCTTTAGAT 


ATCGTAATTG 


240 


CTGTCTGGCA 


TATGCTTCCT 


CTCTACAGCA 


AGATAATCTA 


TAGTTAATAT 


CCTTTAAAAC 


300 


CGCTAAATCA 


AAATTGTATT 


TGCCCAGTAA 


TTCATTAATT 


TCTTCTTCAG 


CTGTTTTCAT 


360 


TTAATTCTCC 


TTTAGCGTGA 


TCGGTCTGCC 


AT ACT TT AAA 


ATTTCCCAAG 


CGCCATCTTT 


420 


CATATTGGTT 


TTATTCATAT 


GATTTCTTTC 


ATCACGAGCA 


ATTGTATAAT 


CGAAAAATAA 


480 


ATCAGCTTGC 


TCTGATCCAT 


GCAGGTACTC 


AACATACACG 


CCGTCTACTT 


GACGGCCCAG 


540 


TATATATAAT 


TCTGGATAAC 


TTAGCATTGT 


TGTTCTCCTT 


CAAGAGATAT 


AGTGGCCCCA 


600 


GTGaTTTTTA 


AGCCAATTAG 


TTTACTTAAA 


ACCATAATTG 


CTTCATCCAC 


TAGCAATCCA 


660 


TCTCGTTGAA 


TAGAACTGGC 


ACATGTAAAG 


CGGTCCATGC 


TTACAATCTC 


ATCCACCGTA 


720 


ACAGGATATG 


GAATAGTaAC 


GCCTGCTTTT 


TTGGCAATTT 


GTTCAATAGC 


TCGGACATGC 


780 


GATTTATTCG 


ATGCTAAAAT 


ATACTGACCT 


GTTCTAGCTG 


ATTCTTTAAC 


TAGCTCTGTT 


840 


GTTTTTCCAG 


TCCCTCGATC 


TCTTGCTATA 


ATTTTCATCT 


ATTTGGCCTC 


CTCACTTAAT 


900 


CCCCAAAAAG 


GCTTTACCTT 


GTAAAACTTC 


TCAACTTCTC 


GATTTAAAGA 


TCGCACCATA 


960 


CTTTCTAAGA 


CAGTTGCTCG 


TGTTCCTAAC 


GTGACATTTT 


GCTTTTTCGC 


TTTTTTAACA 


1020 


CTTGTAATAC 


CTAAGTTGTG 


CCTTAGTTCA 


GAAAAAATTA 


GTATTCCCTG 


ATTGCTATAT 


1080 


TTCCATTTTA 


AATCTGGATT 


GGCTTGAATC 


TTTTCCCATA 


ATTCGTTAGT 


GACCACTAAA 


1140 


TAGTTATAAT 


CACCTAAAAA 


CGTCTGTTTC 


GCAGAACTTT 


TCAAGTCTGC 


CATTGTAACT 


1200 


TTAATTTCAT 


AGCATCTAAT 


GGTGTTGTCA 


GTAGAATAAG 


TCATAAAGTC 


GACTCTTTCT 


1260 


TTGCCAAACC 


ACCCAATAGT 


AACTTCAAAA 


CAGCCGAACA 


CGCCCATTTT 


GTTGGTATAG 


1320 


TGCCACAAGC 


ATTTCTCAGC 


TTGTCTGGTT 


AAATCAGTTT 


TCATTCACCA 


CCATCCGCTT 


1380 


TCACAGCAAA 


CGGCCAATAG 


CGCTCATCAA 


TTGCTTTGAT 


TTCTTGTTCT 


GTGAATATTT 


1440 


CTTGAACCAA 


CCCAATCAAA 


TTATCTGACC 


AGTCAACAAT 


ATTATATTTA 


CCATTTTCAT 


1500 


GATCAAAAAC 


CAGTCGCTTA 


TGCACATCAT 


CGTTTTTTAT 


GAAATCAACA 


TAATACAATT 


1560 


GCTTTTTCTC 


TACTTCGTAG 


CCGTTATCAA 


TCGCATTCAA 


TAGTTTTACT 


TTATTTTCTG 


1620 


GATGTAAAAG 


CCAACCCTCA 


ACATCGTCAG 


TTATCAATGT 


ATATTCGTTA 


TAATATAAGC 


1680 


ATTCTAACTG 


• TTGACCAACG 


AATACCATAA 


ATGCAGCACC 


AGAACTTAGT 


TTTTTATCAC 


1740 


TAATTTTCCC 


TTTTAACCCA 


TCAAGCCAAT 


CAGCCACAAA 


TTTCTCTTCA 


. TGCGAAAATA 


1800 


TAACTTTTCC 


CTGTACTTTG 


AGCTCTTTTA 


, TGACGTCAAT 


TGCTGCATTT 


1 ACTCCGTCGT 


1860 
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CATAACsTCT 
TTAAATCTTC 
TAGTCCAGTT 
TGAAATTTCA 
ATTTTCATTA 
AAAGGAAAGA 
TTCTTCCATA 
ATCTAAATAA 
AAGACTTGTA 
CGATAGTCTG 
TTTGTGACTG 
TATGGTCTTT 
TCAACACGAT 
CCCTCCAACA 
AAAATATTAG 
TAAGCAATTT 
ATGTCCCCCT 
ACATAAGGAA 
GTGCCAGGAC 
TCAAACAAAG 
TTTGGAATCA 
GATAATACAT 
TTGCGCATAG 
TATATTGTCA 
TAAGCCTGTT 
ATTGATAAGA 
TTCATCAGGA 
CTTCTTCACT 
TTTGCATCAC 
AAATTGTCTC 
TTACCACAAT 
CACTACcTTC 
GAATTCCCAA 



TGCCCATTCA 
TTTTTCTTGT 
TTTTGATACT 
GTAATATCTG 
ACATAACCCC 
TATTCGCTGT 
ACCTCTAGCG 
TCACTCATTC 
ATTGCCCAGT 
GTTTTATAGA 
TATATTTTTC 
TAAAACCTTT 
CCATCTCAAG 
GTTCTGGGTT 
ATTGACAAAA 
CGTGCATACT 
CAAAAATTTC 
ATTCATCCCA 
TTCCCTTAAT 
GTGTAACATT 
TCTTCTTCAC 
AAATTCCTGT 
TCTACATCGT 
TCATGTTCGA 
GATTGCATGA 
GGCGCTATAA 
TAATACATTA 
CGCTTTCTAA 
AAAACATATT 
TAATTGTTAA 
TGGcACAGGT 
TGATTATTTA 
ATGGCCTTcC 



TCTTCTGCCT 
TTATTCACCG 
TCAAAAATTC 
TATTTTTAAG 
ACGAACCTAG 
TGCAGTCTCT 
CTTCAGCTCT 
CGCTTCCTCC 
TTTTGACATA 
ATTAACTCTC 
TTTCAACCAA 
TACGATGAAT 
TTGACGTTCA 
TTCATGGATG 
TACAAGAtCT 
TCTTCGGTGG 
AACGCCGTTC 
ATTAAAAGTT 
CTGAAAATCC 
TTCTAGCATT 
TCGCTTTCTC 
AATATGATAG 
AAACAATTTC 
CTAAATCCCC 
GAACATATTT 
ATCCTGTATC 
TGTTTTCTTC 
TATTTCCACT 
GTCTGTTCCA 
ATATGaTTGA 
AGAGCTATCT 
ATTAATACAT 
AACTCTTCAA 



1256 
CGCAGCCTCC 
CTGATCCTCC 
AGTAATTATG 
AATTACAGTT 
GTCCATATAT 
GACCACATAT 
TGGTCTAATC 
TGTTCAATGG 
TATCTGTAAC 
AGTCTCCAAA 
TCCAGAACAA 
AAAACATCTT 
ATTTCTTTTA 
TTACCGATGA 
AAAAATTCCC 
GCGTATTTTT 
TTGTCTTTCA 
GTATCATAGC 
GTAATTATGG 
TCTTTCTCGT 
CAACAGTTCT 
GAAGTTGCTA 
GCAAGCTTCA 
CTCAAAAATT 
ATCAATCATT 
ATCATCAATA 
TACTGAATAA 
TCTTCCGCTG 
TCTTCAAACT 
TTAGTAAATt 
TTCTCAAAAT 
AAAATCCATT 
AGGTCCTTTT 



GATATCATAG 
ACTCTAAAAA 
ATCGGTTGTT 
GATGGCAATG 
TTTTTAGGCT 
TTAAACCCTC 
TCCATTATCT 
CCCATCGGCT 
TTCTATAGAC 
ATAATTCTAT 
TCTGCTGATT 
CtGCTGACAT 
TTAGTTCATG 
CTAAATAGCC 
AGAAAAAGCG 
TTGGTGCATC 
AGCCTGTTGA 
TTCCTATCTC 
CTATCATGCT 
AGGTATCCCA 
GGATTTTCGT 
GCATCGTTCT 
AACGTTTCCC 
TCAACGCCGT 
CCCCACATGC 
GTCCATTCTA 
GCTCTAAACT 
AAAGAAAACT 
GGAAAAAATT 
CTCCATCATT 
TCGGATAAGT 
TTTCTTTGCA 
TGTAATTTTG 



AGCGCTTGAA 
GTTCGTCTTC 
TATTACGCCA 
CACCTTGTGC 
TCAAGGAAAA 
TTTTAAAAGC 
TTTTTATTCG 
AAACGCTTGT 
AGGTCTATCA 
AGGTTCAATA 
TTCGTTTAGA 
ATTTCCTTCT 
CATTTAATTT 
ATCGCGATAC 
CCCATCTGCA 
ATAAACTAAA 
TTGCATGATG 
TATTTCAGAC 
ATTCGAGTCA 
CGCTCTAAAT 
GAATATTTCC 
TAAAACAAAA 
ACGTGCCATT 
TCTTGTCTTT 
CATTTTCTAG 
CATTTTTATC 
TCGGAATCAT 
AGCAAGACAA 
AACTATCATA 
AAACAATGCG 
CACAATTTTT 
ACGTCTCCTC 
GAAAGTTGCG 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

i 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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TATCATAGCC CAAATTTCTA AGTGTCACGT ATTCATCTGG TGTTAACTGG TCTAAATCTA 3900 

TcgCaCAAGT GGCGGAAAGC GTTCAACCGA CGACTCAAAT ATCACACTAG TTAATTTCAT 3960 

TTTTtCACCC TCAACAATTT AGAATTAATT TTTTACGGCT TTCGTTAAAT CAAAACCAAG 4 020 

AGCAGTTGGA 4030 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4147 base pairs 

(B) TYPE: nucleic acid 

(C) STRAiJDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TAATGCAATG CTTGCAGCAG ATTACTGTAT TATTGTTCTT CAAACACAAG AGTTATCTTT 60 

AGACGGTGCT CAAACATATA TCGCTTATAT GCAATACTTA GCTGACACAT ACGATAATGA 120 

TTTACAGGTC TTAGGTATAA TTCCTTGTAT GTTGCGTCCT GGGGGTAGGG TAGACAATAA 180 

AGTATTAGAA CAAGCAAAAG AATTATATGG AGGAAATGTA TTAAACACTG TTGTAAAATA 24 0 

TCAAGAACGT TTAAAAGTCT ACGATGTTGA GGGAATTAAA ATTAGTCATA ATTACACAGG 300 

TAAAGCAGAC GGTTGGGACG TTAAAGCGCA TAAGGTATTT ATTGACGTAT TAAATGAATT 360 

AGATCAACAT CAACTAGTAC TAGAAGAATT GGAGGGATAA AATATGAGTA ATAGTTtCGA 4 20 

ACCTATTATC AAAaCTAAGA AGAAAAAATC TTCTTTAGAA GAAAAGCATG TTAGTGTTAA 4 80 

AGGAAATGAT AATTTTTCAA GAG AT ACT AC AAATAATTTT GCTACAAAAC CTACATCAGT 54 0 

ATCTAAAAAA ACAAGCACAG ATAATAGTGA TGTTAAGCCT TCTACAGTAG ATCGACGATA 600 

TAAAGCTACT ATGACTCAAA AAATATCTCC TTCAGTCAAT CTTAAAATAA ACACTTTAAA 660 

ACCCTTTCTT GGTGATTTAG AAAACATGCC CAAAGCCACA GTAAATGATA TTTTGAACAT 720 

CTTGCTAGAT AATTATGTAA ACACAAAATT TTCTACTCGC CAACAAGAAG CATTCAATTC 780 

TATGTATAAC ACACAAATTG AACTTTTAAA GAAGTAGTAA GTTTAACTTA GGTATACCAG 840 

TATACTGGTA TACCTAAAAA AGAGAGTATT TAATATACTC TCTTTTTTTA TATTATAATT 900 

TATGATTCCG CATTATAGAA TTCTTCTTAA TCGTGTATAT TCGTTTACAT AACGAGCATT 960 

TATACTTGTC TAAATCACTC GACACTCTAC TTGCAAGACA GCCACACGAG CACTGCACAT 1020 

AGGCAATACT AGTTGTAGTA GTATCTTCTT TGATTAGTTT TCCTTGGTAA TCATATGTTT 1080 

TCATAAAAAT CTCCATCACT GTAATAGTCG TTAATGAAAG CATCAATTGT TTTACTTTCT 114 0 

TCTTCGGTTA ATTCTGAAAA TTCTACATCT ATAAAAGGCT CTTCTACAGT TCCTACTGCA 1200 

GACCATTTCA AAAAGTTATG AATTTTAATG TGCCTTATGT CGTAGAAATC GATTTGAAAA 1260 

TCACTAATTA AAACAACATC AAACTCAGCC ATTCCACGGA AAACACCAAA CACATGCGGT 1320 

TTTACACGAT CATATTCATC CAATGAGTTC AACTGGATTT CAAGCACTTT ATTTTGTTTA 1380 

ATTGAGCGTT CTAATAAGTA TTCTATTTGT ACTTGAGATT GCTGTGGCAG TCTTTCAATC 14 4 0 
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TCTCGTGCAT TAAATTCATC CGTTTCTTTG ATTGCTGCTT GCAACTCTCC TAAAGGGAAA 1500 

GCAGTTGGCC ACTTCAAATC AAAAGGGCGG TCTACATAAT CGTTATAAGG TGTAAATTCC 1560 

TTTTTAGTTT TCTTCATTTG TTAACCTCCA AGACCGACGT TGCCACCTGC ATGACCACCA 1620 

ACTAAGCCCG AACGATTCAC AGcTGTCGCC CCtTCAAGAA GTGAGGACGC ATGGATCAAT 1680 

GCTTTAAAAC CAAATTTCTG ACGAATTTTA TCAATTAGAA AGTCTAACTG ATTGTTTAtT 1740 

ATTTGTTCTT CAGGAGGGGA GAAGAGATCT ATTTGAAGTG TAGTATCCCA AACCAGCTTT 1800 

CCAAAGGAGA CTCCTAAATT TCGAATATCC ATGCCAGGAA CATAGTTTTC TCGAAAAAGT 1860 

TTCAAAACAT GCTCGGTTAA CACTTTTGTG TTGTTGCTAC GAGCGATTTT AAGCTGTTTT 1920 

CGCCAATGAG ACCTGCCCAG TCCATCAACT TGTCCTTTAG AATAACCAAC AAAAATTGAA 1980 

AC AC AT TC AG TTTGACAATT TGAATTTCTC AACCTGGTTG CTACCTGGTC TGAAAGTTCC 2040 

TTCAGCACTA ATTCAATTTG ATCCTTTCGA GCATAGTCTC TAGGTAATAC TTGACTATTA 2100 

CCGAATGATT TTTCTTTGGC CACCTGGCTT TTTTCTCCTA AAAAACTACG ATCAATGCCC 2160 

CAAGCATGTG CGAACAGTTG TAATCCCATG ACACCAAAGC GGTGTTTTAA TAGATAAGGA 2220 

TCTGCTTGTG CTAAATCATA AATCGACTCA ATTCCTGACA TCTTTAAACG AGCAGCCATA 2280 

CGCTTGCCAA TCCCGCAAAA TTCAGTGATA GGGGAGATAG ACCAAACTTT TTCTTGTACA 2340 

t 

TCTTCATATC TCCATTCTGC AACAAAACCT GGTGTATTTT f TGCTTCGTT ATCTAGGGCC 2400 

AGCTTTGCTA ACAATGGGTT ATCTCCAATG CCAATCGTGA CGTATAGCCC CATATGGTTA 24 60 

TAAATGACAC GTTGAATAAT TTTGGCCAAT CGATAAGCAG TATCACAATT AAAATACTTC 2520 

AGAGAAGCAG TTATATCAAT GAAGCTTTCA TCTACACTAA AAACAGAATG GTCTTCATCT 2580 

GCCACAAACC GTTTGAGCAG ATTATTAATT TCACTATTTC GCTTCATATA TAAATTCATT 264 0 

CTTGGTGGAA CAATAAGCAA TTCTGGGTGA TCAGGCACTT CATCTTTACG AGTCACATTA 2700 ' 

GAAATTCCCA GGACTTTTTT TGCCATTGGC GAAgcAGcAA GyACwArTCC GCCTGCATTA 27 60 

TCCGCACCAC T CAT y ACT AT AAGCATTGTT TTrAGGGGAT CTAAGCCACG TTCTACACAT 2820 

tCTa.CGCTCG CGTAAAaGGA TTTtACaTCG aTrCAAggAC ATCTCTCCGg GTTCTTgGAA 2880 

TAATCAAtTT CATGCAAAAA CCCGTCCTTT CGAGTTACAT TATACGAACT AACGTTCTTT 2940 

TTTGCAACAA CAAAATTTCA GTTTTCCGAA TTATTTTAAA TTTAAAGTAG CTCAGAAATC 3000 

TACAGTCATT ATTTTTTTAG TTTGTTTTTA ACGCGCAAGT AGATATCAGC CAGCCAGTAA 3060 

ATAATTGTAA TTAATACTAT TACTTTTAGA AAGTTCACCA TGAATACTAG ATTGAATGCG 3120 

TATTTTAACC ACAACAGCCA TAATATAATC GCGATAACTA CTTTTCCAAA GGTATTcaAT 3180 

TTCCtAACCA ATAACCAATG TAAAAAATCT TCTCCTTTAT TTTTAGTAGC TCTATATTCT 324 0 

TCTAGAATAT TTTTTTTCAT GTTTAAACCt CCTTTTTTCT CrTCTTmTAT TmACATATAT 3300 

ATAATTACAT GTTAATATAm ATAwATGAAA AAGAAAGGAG GTAGTTTTCT TATGAAAAAA 3360 

TTAGCACTAG GTTCATTGCT TTTAGTTGCT CTAGGTGTTA TCGGGACATC TTTTGCTTTA 34 20 
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ACTACTGAAG TAAAACAAGA GGCTGGATAT GGACATCCTC TTATTGATCA CAAAATTTAG 
TCTTAAAAAG GACATAGCCT TGAATATTTG TAAAGGTTAT GTCCTTTTTC ATTTTTATCT 
CTTATTACTA TTTTCGGGCC GTCGTATAGA TTCCCCCAAC CTAAGAACTT TTCATTTGGT 
TGAGCTGACT AAACGAAgGG AAACCAAACT AAAACGTTGA AGGCCAGTTG AGdAAAGCGA 
AACTAATTAT tCGTCTTTTC GGGTTtGACC AtGTtCCTGG TAtaAAAAAT TATTCGTAtT 
TAAATAGAAA TATTTTTATT ATTTCCGCCT GCAGATCACC ATTTAATCAT GTAAAAGCGA 
ACGTTAAAAC ACCTTCATTG CATAACTACC GATCAATATT TGTGAAAATG GCTTAGAATT 
GCTCTCTCAA ATCTAAATTC TAGTTTCTTA TTTTACTTTT GGTACTAATT CATTTATTTT 
TGAAATATTT AAGGCAATCA TTAACTCACT GATTGTTTGT ACACCTACTT TTGTATCATC 
AAATAGTCCT TCATTGTACA TTCCATCTAG TAATACTGGG TCTATTTGTA CATTTCTTTT 
CAGTTCGTCA ATTGTCATTC CTTGTTTATC CATGATTTCC TTtAACTTGA TkTCCATTTG 
ATTATTCTTC ATAACTTTTT CCTTCTTTCT TTTTTATCCC TT ATT AC TAT TTTCAAnGTG 
GCACATA 

(2) INFORMATION FOR SEQ ID NO: 2 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
GTTCTTCTTC GAATTCATCT CCAATTACCT CGACTTCCTC ACCTTnAATA TCTGGTnGGT 
CAAAACCAAA CTCTGTCATC TCAAAATCTA AmTCTAAATC TGCTAATTCA TCTAACTCCG 
CATTTAATAA TTCTTCATCC CATGTAGCAA TTTCTCCGAC CTTATTATCA GCTAAACGAA 
AAGCTTTGAT TTGCTCTGGA GTCAAGTCAT CAGCAATAAT TGTAGGTACT TCTTTTAGTC 
CCAAATACTT AGCAGCTTTT AATCTTGTAT GCCCATTTAC AATGACATTA GAAGCATCTA 
CCACAATAGG AACTTTAAAT CCAAACTTTT CGATACTTTT AGCAACTGCA GTGATAGCAC 
CTTCATTATG TCTGGGATTT TTTTCATAAG GTATCAAATC TTCTGTTTTT TGTACTTTTA 
TATCCATATA TGTAGAACCT CCTTGTATTT AAGTATCTAG TGGGATTCGA ACCCACGAAT 
GCTAGTTTTG CAAACTAGAG CGTTGCCACT TCGCCATAGA TACATGAAGC AAACTTATAG 
AGCAGTATAA GCTTGCTTGA CTAAACGAAA CATTTATTGA CATACATTTT TTATTTTGTG 
AATCACTAAA ATTCACAATG GACCAGATAG GATTTGAACC TATATTCAGC ATCTGCCTAC 
TTCCTTTATA ' TGCACTTAGA TATCATTACG CATAACTGAT CCATAAAATA TGCAACAAAA 
AAACCGACTG CATAATTTGC AATCGGTTTT GTGTTTTACA TATTTTTCTA TACTAGTATA 
ATAACATCAT TTTTATGACT TTATCCGCCA TAAAACCGCA AAAAAACCAC AATTTCAGTC 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4147 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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AGTCGCACTT TTATAGCATT GAAGCAGACC AGATTCATAT 
TGCGACaCaC TTTAACGTTT CCACGTTGCT AACAGAATAA 
AACATTAGTT TTCTTTTCAA CACTGCAGTA AGTATAATAC 
TGACAATCGG GCCAATGCAT TATCTATTTn TAATTTTTCT 
AATAAAATGC TGTTGAGATC GATGTGTATT TGTGTGCTGA 
CGGCGAAGTT ATATTAAATT GAGGTGTTCC TGCAATTCTT 
TAAAAGATTT CGTGCATTCT GTTTTGTTTT TTTGTCATCA 
TTTTTATCCC TCCGTGATAC AATATCTATG CAGCGTTTAT 
CATCCGAATT ATATTTGAAG TCGGTTTTTG TTTAATATTT 
GCCGTCTAAC GAGTGGTTGT TTTTTTTATT GTCTATCATC 
CTTCCAATTG TATTTTTTGT TTATATAGCG AACTTACGCA 
CAAGCCATAA GAATTCATTT AGTGTTATCA TATCTATTCT 
ATTCCTCCAA TGCCCAGCCC ATTAATTTAA CTATTTCAAA 
CAGTTGAGAC ACCtAGGTwA TTAAACTTCA AAACATCCCT 
TTTCAAAATT CCTAACATCT TCarasTTCA TTGTTCCGCC 
AACTTTTTGA TTTTTCCATT TAtTGATCCT CCTCACTTAA 
TCTTGTTGGG CATAATCTGT AATGTAAGTT ATTTCTGCAA 
TGTCCGTCTT GATATTCGTT TAAGCGTAAT ATATCGCCGT 
TTACGTATTT CAAAACGCTT ATCACCTGAA ACAACCGCTT 
TTTAGTTCGT GAATGGTTGG TTCTTTCTCA ATTCGTATCC 
ATAAATCTCC ATTTGTCTTA AACCACTTAG AAAATTCAGC 
AAGAAACTGc CGAACCGATa ATGTCTACTT TATcAGAACC 
TAAATTCATC GGCACTTCTA GGCAATGCAG CTTTTTTCGA 
GCTAATTGCT CTTTCGTTGG CATAGTCAGC ACCTTTCAAA 
AACTTCTAAG CATTCTAACT CTTCAATCAA TTCTTGTTTT 
AAAACTTTAA ATGTACTTTT ACGCTCATCG AATAACGCTT 
CATGGAACAA TCATTTGTGC TGTGTATTCA ACAACCAGAT 
CTATTAATTG AAATCACTTC ATACGCCATT TTTTTATCAC 
ATTTTAGCTA ATTTTGGCTA ATCTTAGCGT GTCATCTTCT 
GTGAATGTAA AATTCATTTA TATTATGATA ATCGTTATTC 
TTTTACTCTA' ATCTTACAGG TTACTAAAAT ACAATTTTTT 
AATAT 

(2) INFORMATION FOR SEQ ID MO: 267: 
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GCTTCAGCAA 
CCTAGTTCAG 
AATAGTTGTC 
TTTTCCGCAT 
ATAGGCACTT 
TGCAGCCGTC 
ATATCAGGAA 
TTCGGGAACG 
TTTCAAGATG 
TTTTAATTTT 
CAACGCTAAC 
CCAGTTAATT 
TACTGCATTC 
GTCTAgATTA 
AGTCGGTAGC 
TCCCTAACAC 
CATGGACATC 
TTTGATAGTT 
CAAAGTATTC 
GTTAGCTAAT 
AGTTGGCATT 
CATTGaTAAA 
TTCTtTAATT 
TAATCAAGGC 
TCCATTAGAG 
TATTTATTTC 
CAACAAACTC 
TAGAAAATAT 
TTACTTTGAT 
GCCTACTTTT 
CAATCAGTAT 



ATTCAAGAAG 
-AAGCAATCCA 
TATTTATCAA 
CAATTTTTCT 
CTTTAATTAT 
TGTATTTTTT 
ATAGCTGCAT 
CTAAATCACA 
CCAACCACTC 
TCCAGTTCTT 
AAAATAATTA 
AAATATTTTt 
CGCCTTACAC 
TTTGTAACTG 
CATATAATCa 
TACATATCCA 
ACCTGTATAT 
TCGGTCATTC 
TGGTAAGATT 
GCATCAATCA 
GCACTTGAGA 
CCTTCTTTTA 
GTTTTGCTAA 
TATCTGTAGA 
TCCACCTTCC 
CTGTTCAGAC 
ATAGTTAATA 
GATATTTCCT 
GTAACGTTCA 
TGGACCAATA 
GCGGGATAAC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
,1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2.64 0 
2700 
2705 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 



Kj(j i AAu 1 Hurt 


TPrrATGGTT GATTCACCTT CCAAAAACnA 


GCACnCATAG 


AAGAGCTTCT 


60 


T AAA ILi 1 It 


rPAPTGTGCA ACATAATTAT AAATAAATTT 


CTTTGGTGTC 


ACCTCGCCAA 


120 


AGGCAAAA 1 U 


rATATPATAG TATGCGGGAA TTATTATCTC 


TTTTTTCGAA 


TTTTCAGGAT 


180 


ATAAAA1A1 1 


TTTPTrrAAA CGCGCCTTGG CAGCTTTAAA 


TTCAACTGAT 


TGTTCTCCTA 


240 


AAACAA1 1 


PTTAft: AftfiAA ATACCGCCAT TATAAAACAT 

^A 1 nOrtuUiui funwwwwwni *. • ********* »-w* * * 


AGCTACATCT 


TCCTTATAGC 


300 


GAI 1 lbL-CAi 


GtCAACTwGa GCCAAGCGGG CACTAAGTTC 


TtTCCCCGCA 


TTACGTAATT 


360 


CCC.3 1 AAAAA 


ATACAACaTA ACAATAATAA AAACTATAAT 


TAGCCAAGGC 


TTTAACGACC 


420 


np | rrT"T , 7\ TTPP 

TILL- 1 Al Jt LL, 


TAATAAATAT AACCAAATTT TCTGCAAAAA 


ACTAACCTTA 


GGTAGCGAGG 


480 


i 1111 oo-rt 


C AT AG TAT C A GTAGTAAGTA GTTCCTTAAT 


CCGAGTGGTG 


ACAATATGTA 


540 


1 1 1 bLAl 1 


AAAATCCTTT AATTTATAAA AATCTTTCAC 


GGTACTATCT 


GTTATTTCAT 


600 


Lbo 1 1 A/iRt 1 


ATCTGGGATT ATCTCTTCAA GCCCATAGCC 


TACTTCAAGT 


CTTGCTTGCC 


660 




AGCTACTAAG TAAATCACTC CATTATTATA 


AAGTTTATTT 


CCGATACCTA 


720 


ATTTATTACC 


TATTTGATTG GTAAAGCTTT CAATCGTTTC 


GTTTTCAGGA 


ACT kGCTCAA 


780 


CTGTGATAAC 


CATTAGTTGT GGTTTCGTnT CCgTCTTACT 


AArTTCTTGa 


TTAATTTTTk 

J. ± X**V A J. X X A 


840 


GTACATTATT 


AAkGACTTCy TCGTTCAAAA TTTGTGCATT AyCTGAAACA 


AAAAGATGAT 


900 


TTyGaTTAAT 


TTTTACTACG ACATCGTCTG TTGGAGCTGA 


ATAAGTCCTT 


TCAAATCGTT 


960 


TTAAGAAACT 


" CATTCCCAAA TAGATCCCTA TTAAAAGAAC 


AGCTAGATAA 


ACTAAATTCT 


1020 


TCATTACAAT 


ACTTTTTACT TTTATTTTTC GATACTCCCC 


CACGCTCATT 


TGTAATTCTA 


1080 


TCCCTACTTT 


ATTTTTCCCA TACACTGATT TTTTCTCCCT 


TCCACTCCTA 


CCGAATGTTC 


1140 


TAGCTCTGaT 


TTTCCACAGA GCTAAATAAA TTTTTTTGTA 


TTAAAAATTT 


ATTtTCTTAA 


1200 


TAGTTAGTAT 


ACCGAATATA GACGTTCAAA TCTCTATCTT 


TTTAACTAAT 


TTCTGTTAAC 


1260 


ACGCATAAAA 


AGAACGTAAC GCCTAGTGGA TAAGCGGTAC 


Gn 




1302 


(2) INFORMATION FOR SEQ ID NO: 2 68: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
AAACTTGCAA AAGAGAACGC AAGCAAGATA ATAGAAGACG GTTTTATATA GATAGCTACC 
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GnAAAGGGTA CCAGAAACCA AATTAAAAGC CTAGAAACGT TGGAAAATCA ACGTCTAGGC 120 

TTTGTTTTTA GACTATTCCC ACTCTGTAGT AGAAAGAATA GTTTGACTTA CTTTTGTTTG 180 

TATATCAAGG TTTAAGTTTG GTTGAATTGC TAAAATATGG CTTAAAATGA TATATACTGT 240 

GATAAAACTG TGATAAACTT TTTTGATGAA AAAATAGATT TTAGTAATTA AAAATTTGTC 300 

TGAATTTATT TATATGCTTA ATTTGAATAA TGTTCTTTTT TAGTTTTATT GATAATTTAA 360 

AATTTACTAT GTGTTTTATt GTAGTAATAA TTtAAATATA CAGAAATTAT GGkTTTGTAT 4 20 

AGkAGTTTTT ATTATAAAAG TATTGAAAAT ACAGGTTTTT AATGTCTTTG TTTGCATAGT 4 80 

AAATAAGTTA AAAAATAACT TTTGTGTTAA CACTTATTGC kTTTTGAATT TATTATTTTA 54 0 

AATTAATCTG GAATGATATt GTTATGTATA AAAAAACTAT GTTGTTTAAA CTTATCGACG 600 

TGGTATAATT AATATAGGTA TTTTGAAAAT AAAAAGAAGG GGTATTTTAT GTTTACTAAC 660 

AATAAAAATA AAAGCTTATA TTTTGAAAGA ATGCAAATGA TTGATATGAT AAGAAATTTG 7 20 

GACAATATTA GTTACAAAGA GTTTCCTAAG TTTATnGGGG AGCTGTACAA TCTTATGGGG 780 

TATAAGGTAG AaCTTAATGA GAGAtGAATA ATATTGATTT ATACGCAAGT AAGTTTTCAG 840 

AAAAGTTAGC AATACAGGCA AAAGCTTACA GCTTAAmCTA TAATAAGTCG AAaGCAATTA 900 

ACmAAAGAAG aGTAGAGTCA TTTGCTTTGC AAGCAAGAGA AAAAAGAAAA AAACCTATAT 960 

ATATAACTAC AGGAGTTTAT ACAAATCCAG CATATATAGA AGCTAAAAAA ATTGGTGTTA 1020 

CTTTATTTGA CAGAAAAAAT ATTTTTGATT TAATTTCAAA GGCTGATCCT AGCTTACTGG 1080 

TCGAGGTTAG TTATTTGGAA TCAGTTGAAA AACACGATTT AAAAAAATGT AAGCATTGTA 114 0 

AAATTGGGCA TGCTACGAAA GTTTACTCAG AAGATACGAG TAAGTTTTAT TTTATTTGTA 1200 

TGGATTGTAA GAAACATACA TAGGGAGAAG TAAAAGTCTA GTAACGTTTA GTTTCTAGAC 1260 

TTTTTTATTT GAGTAAAATC TCTTTAAGTT GTGTTTGTTG TTGAAAATAG TAACTATTAT 1320 

AAATAAGATT GGCTTGTTTA TAAGATAAGT TGTGGGAATA GATTAATTCG TCAAGAATTC 1380 

TTATCTTGAG AACTATATCG TTTACGTTGA AAAAACTAGC CATTTTTTCA AGATCATCAT 14 4 0 

AAGTACTAAA TGCATACTTT AAGGCTTTTT TATTAATAAG TAACATTCCG GCACCCTTAT 1500 

TTGCTCTGAA TTCTTTAGGA TAATCTTCAG GAAGATAGCC CTTTTTTATA AGTAGTGTAT 1560 

CAAATGATTT TTCATTGTTA GGCATATCTG TCGTATAGTC AAAATAATGA TGAATTATTT 1620 

CGTGCATTTT CGTAAAATTT TTAATTTCTA TTTTCAAGTG TTGGGCAGTC GCTACCATAG 1680 

TTACATCACC AATTTTTGTT ACTGATCCCC AGATTAATTT TTTAGaTTGT TTTTTGAATG 17 40 

GATAATCAAA AAAATCAATG TTTTCGGTTT CTTTTACGTA GGCTTCAATA TCATAACAAT 1800 
TGATGAGTTC TTTTCTTTTA GACATATTGA TTGAAATTTC ATTTATAAGT TTGTCAGTAA 18 60 

ATGCCTGTGT TTCGAAATAT AATTTTTTCT TATTTAGTAT CATAAAAAAT TACTCACTCT 1920 
TTTTTTTTCT TTCTTTTGCT CGTTTTATAA GAAATTCTTG AAAATCAATC AATTCTTTTT 1980 
CTAATTCTTC TATGTCATCA GAACTTAAAT CATTAGTGTT TAATCTGAAA TGTGAAATAA 204 0 
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TATTTTGTCC AGCTTCTTTT TGATTATTTG AAGGTGTTGT AGGGTTATCA GTGCGTCCTG 2100 

TTAAATAATC AATTGAAACA TTAAAAAAAT CAGCAATAGC AGTAAGTTCT TCTTCTCTTA 2160 

ATGGACGTAT TCCAAGTTCT ATTTTATTAT ATACACTAAC ATTAATGTCT AGTGCGTTTG 2220 
AAATTTCTTT TTGAGTGATT TCTCTATTTn CTCTtAATTG TgTaTCTTTT CGTAAGTnTT 
CATACTATAT CTGTCCTTAT CTAATGTTTA T 
(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2280 
2311 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
CAACACCAAT TTTATATAAT AATAGCGCAG CACCAGTAGG CCCCCACCTA ATACTTAAGC 
TTGTAATTGG TCGCGACTCA TCAGTAGGTG AATCCTTCGT AATCATTCCT CCGGTATTTG 
TACTACCATC AGGATGAATG ATCAAACTCT CTCCTTTACC ATTTTTCCAT ATTCCAACTA 
AAGTAGTAAA ATCGCCAGCA TTAATCGCTG TAGTATCTAA ATCAGCTACA GTTAGCTGAC 
TATTGTTTCC ATCAGCTGTC GGCTCTTCAT TTTTCACTGA TTCGTTAGAC TCGACACTGC 
TGCTCGCTTG TTCGTTTGTC TTATCAATTG TTTCACTCGT TTTTtCATTC GTCTCTTCAA 
CTTTAGAGGA TGCCTCCGTT GAAGCGGTTA CTTTCTTTnC TTTTGGTTCC CGACAaGCAG 
TTAaTGATAA AACCGCTAGC ACGACAAAAA aCGCTAACAT TCCCTTTTTC ACGCCTATTC 
ACCCCGTTCA ATGATCTATC TTACTAGACT ATTGTACTAT TAATTCAACT GTTTAGACAA 
ATATTTATGG AACAGACAAA AGTTGAACCT GTCATTGAAT GCTTTGTGTC ACATGGACCT 
ATCAATCAAC TTTCTTTTTT CTATGTATTA TATGAAAAAA AGTGAGCCAT CAGTCAAAAT 
GACCGATGGC TCACTTTCTA AATCTTCAAA TAAACGAACA AATGGATTAT TCAGGTAAAT 
CTACATTATG ATGGACTTTC TGAACATCTT CTAAATCTTC TAAAACGTCT AACATTTTTT 
CAAAGTTCCC TAAATCGTCG CCTTCTAATG TTACTTCGTT TTGTGGAACC ATTTCTAACT 840 
CAGTAACCGT GAATTCTTCA ATGCCTTTTT CTTTTAACGC TTCTTGTGCA TGGTGGAAAT 900 
CTTCTGGTTC TGTGTAAACG ATAATTTGAC CGTCTTCTTC AACAACATCG CGAACATCAA 960 
TGTCTTTTTC CATTAAGTAT TCTAAAATTT CATCCGCATC GTCACCAGCA AAGCCAATGA 1020 
TACCTGTATT ATCAAACATG TAGGAAACAG CACCGCTGAC ACCCATGTTT CCGCCGTTTT 1080 
TACCAAATGC GGCCCGCACA TCTGCTGCTG TTCGGTTCAC GTTGTTTGTT AAAGCATCAA 1140 
CAATGACCAT TGAACCATTT GGCCCAAATC CTTCATAGCG TAATTCAGAA TATTGTTCAT 1200 
CTGCTGAACC TTTTGCTTTT TCAATAgsGC GATCAATAAT ATGTTTTGGT ACATTGTATG 1260 
TTTTGGCGCG TTCGATTACA AACCGTAATT TTTGGTTTGC ATGAGGATCA GGGTCTCCTG 1320 
ACTTCGCTGC TACGTAAATT TCAATCCCGA ATTTTGCATA AACACGGCTA TTGTTTGCGT 1380 
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CTTTGGCAGC TTTTTTCTCT TTAATATTTG CCCATTTACG GCCCATATTC TCACTTCGCT 1440 
TTCTCTGATT TGTTGTTAGT ATCTTATCCC TTGTTTATTA TACTGGGGAA aCTGATTTTT 1500 
GTTAAGTATT TtTTAGGTAT CTGGCGAAAA TCGTAAGGGA AACCGACCAA AATTTTGTTC 
TTTTAAAGAA GAATCTCCCA TTTGGTATAG TTGATgGCGC GAATATCTAA CGAAAGCATT 
GCGCGGTTTT CyTTAGATAC TTTGGaAATT ACCGGCAATT GAATCTGTGC yTTGGkTTCy 
TTTAGATAAG CTTGTCCCTT TGGCGTAAAT CCTAGCACAT GGAGATAACT GkTTTGcCmA 
ACAGTTTCAA CTTCTTCTTT GGTCATATTT AACAATATAT AGGTTGCTAA TCGTTGCAAT 
CTTGTCCAAG TATAACGCTT CGTTTTCATA CGTTCCATTA ATTCGTGAAA AGAACCTGCT 
GCTTGGATTT GATCTTTTAG TCGATATTCA AGTCCTTCCG TCATTTGGTA AATGGCTTGT 1920 
AACTCTTCTA ACGAGGATTG AACAATTTTA TACTTTAGGA AGGGCCAATA ATTTTCCCAT 
GAAATCATTG TCTGTGTTTG TAAATGTTGT GCGGTTATAG AAGGCACCGT TGGCAGCACT 
TGCGTTATTT CTTGTTGAAA AACACTTTGC CTAATCGCCG TGGCGCTAGC AACTTTCCCA 2100 
GAAATCGTTG CATCATGAAA GCCCGCTTGT TTTCTAGCAA TTGGGTAGAG CGTCATCGGG 2160 
GTTGGATACG TGGCATTTTC CTTTGCATAA CTCATCCCCA AAATATGATT AGGCGAAGAA 
AAATCAAACC TGCTTTCAGG ATATACTTGA CGAAAAACAG CAGTCATTTT TTGTGGATAA 
CTCAATTGTT TGTCTGTCAA TGCGTGGAAC GTTTGATCAA TTAAGGATTG ATTTTCTTGG 
ACAAACTGGC CGAAAGCCGC ATAATCAATT GCGCTTGTAC TATCTGTTCC AAAACATAAG 
CTCTCACATT GCAACGCCTG CAATAACTTT ATACCGCCTT TGGCAAAGTA ATCCGCAGAC 
TGTACCGACC AAGCTGTCGG CAATTCAATG ACTAAATCCA CACCATTTTG CAAAGCTTCT 
TCTGCTCTGG CCCACTTATC TAGTAAGGCT GGTTCTCCTC TTTGTAAAAA ATTTCCACTC 
ATTATAGCAA TCACTACTTC AGCTCCGCTT TGTTGGCGAG CTTGTTGGGC ATGATAGCGA 
TGTCCATTAT GAAAGGGATT ATATTCCACG ATAATGCCAC AGGCTTTCAC TCTAGCCACC 
TCCGTTATAA AAAAATGGAC TTATCAGCAA AGAGACAAAC GTTTTTAAGC GCTGTCCCAA 
GCTCATAAGC CAAATCTTAT TAATCTCTTT GACAAACAAA GAACCAGCGT TCACTTGTTT 2820 
CAGTTGGTGC TTCATCTGTA AAATCTGCAT AAACTTTGAC GTTCATGAAA CCAGCACTTT 2880 
CTAACATTCG TAAATAGTTA TCTAACGAAT AGGTTCGTTC TTGATGCAAT TCATCATACC 2940 
GAACAAAGTG CTCATCTGTT TCCTGTTCTT TCACAAAAAA TGTTAAGAAA TGTTCGATGC 3000 
TGTGTTCTTT TTCACCAGCG TAACTATCCC ATAGAAAAGC AAATTCTTCT GTTTGATAAT 3060 
GATAGCTATA TTCTGGAAAA ACTGAATCAA TTTTGTAAGT AGAATGAACA TCAAAAATGA 3120 
ACGTTCCTTC CTCTTCTAAC GCTTGATAAA CTTCATCGAA CACTTGTTGT ACTTCTTGGC 3180 
GaTTAGCGAT ATAGCACAGT GAATCAGAAA AACAAGTGAT GGCTTGGTAT TGTCCCACTT 324 0 

CAGATAAATC TAGCATATTT CCCTGAACAA ATTGAACTGG CACTTCTTCT TCAAAGGCCC 3300 
GTTCACTAGC AATCATCAAC ATTTCTTCTG ATAAGTCTAA GGCCGTCACA TCGTAACCAC 3360 
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TTTTCGCAAA ATCAACTGCT AAAGCGCCTG TACCACAAGC CATTTCAAGA ATTTGTTGGG 3420 

TTCCTTGTGG GAGATGACGA TTGGAAAAAT CCAACCATTT TTGGTATAGG CTTTCGTCCA 3480 

TCACTTCGTC ATAGACGAAA GCAAACGTTT CATAGGCCAT TTTAGTCGAC CCAAGCGCTT 3540 

AGATCAACCA TTGGGGCATC TGACCATAAT TTTTCTAAAT TGTAAAAGGC ACGTTCTGAA 3600 

GATTGGAATA CATGGACAAT AACATCGCCT AAATCGATTA AAATCCATTT TCCGCCTTCT 3660 

TTACCTTCCA CACGTTTGAC TTCAACTTGA TTTTTGGCTT CTTGTTCGTT AATTTCATCA 3720 

ACAATCGCAT TGATTTGACG TTCACTATTG GCTTGGCAAA TTAAAAAATA ATCTGCTAAT 3780 

AATGAAATCT CTCTTACGTC TAATGCGACA ATTTCTTCTG CTCGTTTTGA ATCTGCTGCT 3840 

TTGACAGCAA TTTCTAAAAT TTTCTGGCTA TCTATGATAG TTCCCTCCTA GTAATTCTTT 3900 

CTTCTTTTAT TTTTGATTAA CGACCCATTG ATTGTAGGTC TCTAACGTTT TTGGATAAAT 3960 

TTTATGTTCT TGTTCAATCA AATGTAACAA GGTATGTTTC GTTTCATAAG CAACCGCTGC 4 020 

ATCTAAGTCA ACAAGGGCAA TTTCACGGGC TTCTTTCACA CCTGGGAAAA TCACGTCCTG 4080 

GTTCAATGGT AATCTGCGAC ATAAATAATT TTATCTAGCA AGCTCATCTT AGCAGCCCCA 4140 

GTTGtATGCA CACGaATTGc CTGCAAGATT TCTTCGTCAT CAATGGCTAA TTCCCGTTGA 4 200 
ACAATATCCG CCCCAACTAA GCCATGCCAA ATTTCATTGC CATAGTTTAA TAAAGCCAAA 
TCAAAACCAT CTCGGCGAAT AATCAACTCG AATTCATCGT TTGGACGTTC TTTGGCGTAA 

TCATGCGTCA AAGCAGCAAT ACTGGCTTTC GCTTCATCGG CACCATATTT CGCTGCCAAG 4380 

GCCACAGCTG TTTCCTCGAC TCCTAACACA TGGCGAAAAC GGCGTTCACT CATTTGCATT 4 440 

TGAACTTTTT GCATTAATAA TTCACGTTGT GCAGCATCAT AATTTGTTAT CCAGCTCATC 4500 

TTGATACAGT CCCTTCTCTT GAATATAGTT TATCACATTC TCTGGAAGAA GATAGCGTGT 4 560 
TGAGCAACCA CTTTTGACTT TTTGTCTAAT CAAAGTGGAA CTAATCGCCA TCTGCGGAAC 4 620 

ATCGACCCAA ATAATTGGAT AGGTAGATTC CGTTGGATAA TTAGGACGCC GAATGCCGAC 4 680 

AAATTGAACT AAATGGAGTA AGTCATCAAT CCGATGCCAT TTGGGCAAGT ATTCCACCAT 4740 
ATCTCCACCA ATTATAAAAT AATAATCAGT ATCTGGATTG GCTTCTTTTA AAGCTTTCAT 4 800 

CGTATCATAC GTGTAACTTT TGCCTTTGCG AATTAACTCA ATTGGCTCAA TATCTAAGCA 4 860 

GGGATTATCA GCCACTGCTA GTTCTAACAT CGCTAACCGA TGTTCACTGC TAATTGTTTT 4 920 

CTTTTCGTCT ACATGTGGTG GTAAGTACGT AGGCATTAAG TAAACTTTGT CCAAACCTAG 4 980 

TTGATTTTGA ACTTGATCGG CCATGACAAG ATGTGCTAAA TGAACGGGAT TAAAATTACC 5040 
ACCTAACAAG CCGACTTGCT TTCTTGTTTG AAAACGTAAC GGTTCCTCTT GCAAGAGAAC 5100 
GTCCGCTCCT TGACGCGCTT GTCTTTTTTC TCCCATTGTT CATCCCTCCA ATATTAAATC 5160 
TCTTTGACTG CTGCTGAAAT TTTTTGATAT TTTTCTTTAC TTGATGGCTT GAATAAGACC 5220 
AACACGCGAC CAATAATTTG TACCACATCG CAATGAATGG CTGCTGTTAG TTCTTCTGCA 5280 
ACCTCTTCTG CTACTTCATC TGTATTTTGT AATAACGTAA TTTTAATCAG TTCACGTTTT 5340 
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TCCAACGCTT CATCAATTTG TGTAATCATT GCGTCATTCA AACCACCTTT GCCTACTTGA 5400 
AAAATAGGTT GTAAATGGTG TGCCTGACTA CGTAAATAAC GTTTTTGTTT TCCTCTAAGT 5460 
TTCAAATTGT TTGCCTTCTC TCTAAAATTT AAATTAATGC TTTTCTACGT AAAACATCGA. 5520 
CACCTTTAGG CGCCCAACCA GCGACTACAC CAGGTTCTGT GACAGTAATC CAACCTAGAC 5580 
CAGCAAACAC AATATCCGTT TTTTCTTTCA CGGAAAATTC AAAGCGCACT AATTCAGGAA 5640 
ACTCAGCGAC TTCATCGGCA CGTGGAGGCT GGAATAAGCC ACCAACATGT TTTTCATAAA 5700 
AAGCATCGGC GGTTGCTGTT TTTGTTCGGT GTAATTTTAA GTCATTGGAA ACGTAGGCAA 5760 
TGAAGGAACT TTTAGCACCT TGGACATAAT CAAAGCGAGC CAAGCCACCT AAAAACAAGG 5820 
TTTGTTCAGC ATTTAATTGG TAAACTTTCG GTTTGATTTC TTTTTGCGGT GCAATAATTT 
TCAAGTCTTT" TTTaCCTAGG TAATGTGCCA TTTGATGACG GTGAATAATT CCGGGCGTAT 
CGATTAAGAA GTGGCCATCG TCTAATGGAA TTTCAATTTT ATCTAGCGTT GTTCCAGGAA 
ATTGTGATGT CGTAATCACA TCTTGAACAC CCACTGTTTG TTGAATGATT TGATTGATTA 6060 
ATGTTGATTT TCCAACATTG GTCACACCTA CGACATAAAC GTCTCGGCCT TCACGATATT 
TTTCGATCGT ATCTAATAAC TCTTGCATTT CTTGTGGTTT CTTGCCACTT GTTAGTAAAA 
CATCAACAGG GCGTAATCCT GCTTCATGGG CCCGTTCTCT CATCCATTGA ACCATTTTAG 
GTTTTTTTAA TGATTTAGGC AAAATATCCA CTTTATTGCC AACTAATAAC ACAGGATTAT 
CTCCGATAAA ACGATGTAGT CCTGGAATAA GAGAGCCATT AAAGTCAAAA ATATyCmcTA 
CGTTCACAAT CAGTGCATCT TCTGTTCCTA AACCATTCAA TAAACGTAAA AAGTCATCAT 
CTGTTAATTG GACATCTTGA ATGTCATTAT AATGTCTTAA GCGAAAACAA CGTTGGCAAT 
AAACTTCCCC TGTTTCCATT CCTTTTTCGA AAGCTGTTTT TGGTGTATAG CCTAATTCAT 
GAGGATTTTC CGTTTGGATA ATCGCACCAC AGCCAATACA ATGAATTGCC TCTGTCATTC 
TAATCCGCCT CTCCATGTCA TTTCTGGATG TTTTTTTGAT AAGTATTTCA TAATTTTCCG 
TTCAAAGAAC CGATTGATTC TAGTATTCCA GCCATCTGTA TCGACAAT.CG GTTGAACTAA 
AACATTGCGT ATACCCGCTG CGTTAgCTCC CCGAATATCT GTCATAATCT GATCGCCGAC 
CATTAACATT TCAGATGGCT TCAAGCCTAA TTTTTTTTCT GCTAATTTGA AGCCACGTGC 
GGTTGGTTTC AAGGCACGTG CTACATAATC TAAGTCAAAC TTTTCAACAA CGCGTTTAAT 
CCGGCTGTCT TTGTTGTTTG AAACAACGAG GACAGTAATT CCTGCATTTT TCATTTCTAA 
TAACCAAGTT TTTAATTCTT CTGTTCCATC GGGATTATTC CAAGCAATTA ACGTATTATC 
TAAGTCAGTT AGTACTGCCT TAATCCCTAG TTTTTTTAAT TGTGCGGGGG TAATTTTATA 
AATCGCATCA, ATCATCCATG TCGGTTTATA TTTTGAAAAC ATCGTGACTC CTCTCACTGA 
CACTATGTAG TTTCTTCGAT TTTAAGAGAC TGAAACATAA CTAAACAATT ATGCTTCAGT 
CTCTACCTAT TGTATTATAC TGCATTTTAC TGAAAATTTC TATAAAGGAC GGTCGTTCCT 
TGTAAAAGAA CAAGTAAAAT AAAAGAAAAA ATAAAAAAAA GCCACCCATC AACCCCTGAA 



5880 
5940 
6000 



6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 



WO 98/50555 PCT/US98/08985 

1267 

AAGTTCGTTG ATGAGTAGTA ACTTCTAATA TTTGCGAAAA CGTTGGTATC TTTTTTCTAG -7380 

TAACGTTTCC GTATCTAATT TAGCTAATTC AGTTAATTTT GAAATGAACG CTTTTTGTAA 74 4 0 

CATGCGATTA ATTTTTGCTT GTTCCAGAGG CTGACCGTTC ATAACTTCTG GAATTACTTT 7 500 

ATCTACAATA GCTAATTCCT TCAGCTCTGT CGCTGTGATT TTCATCAATT CTGCTGCTTC 7 560 

TTTCGCACGA CTGCCATCTT TCCACAAAAT AGAGGCGAAA CCTTCTGGCG AAAGAACCGC 7 620 

ATAAATCGTG TGTTCCAACA TCCAAACTTC ATCTGCGACT GCTAAAGCTA AAGCACCGCC 7 680 

ACTACCGCCC TCACCAATGA TGACTGAAAT GATTGGCACT TTCAAATCAG ACATCTCTAA 7740 

CAGGTTTTGC GCAATGGCTT CCCCTTCTCC CCGCTCTTCG GCACCAATCC CACAATAGGC 7800 

ACCAGCAGTG TTCACAAAGG TAATGACTGG GCGATTAAAT TTTTCCGCTT GTTTCATTAA 7 8 60 

ACGTAACGCT TTGCGATACC CTTCAGGATT AGGTGAACCA AAATTACGTT CAATATTTTC 7 920 

TGGTAGATTT CGACCTTtTT GGaTACCTAC CACTGTAACT GGGTTTCCCT TGACAAAGTA 7 980 
GCAACACCA 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double , 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AAACACAAGG GAAATTTGAT TATTCCTCAA AATATAGCGG TGTGCTTTGT GAAAAACATT 60 

GGCATTTAGA TGAACAGCGC TACCATGCAG ATCCTAGAGC AATTCATTTT ATTCGCTTAT 120 

TTTCGCAAGT CTCCTATGAA AAAGTCCAAA ACATTCAAGT AAAAGAAGAA ACTAAGAAAA 180 

GTATTCGAGA AACGATTGAT ATGTTATACG ATGAATATGT TGGTTTGCAC TTAAAAAGTA 24 0 

AAAAATTTAT TGATCAAATG AAAACGTGGG AAAATACGTT AAAAATTCCG CCACGAAAGA 300 

AGGAAGAAAA ATAAACATwA ACAAGCAGAC GTTTTtCTAA AAAAAATTTG ACAACTTCCC 360 

TCAGACACCG TATTATGATA GAGAAGTTAA ATACGTACGA AGAAAAAGAG AAGTAAAAAG 420 

AACCCTCTGT TAAGCGAATC TGGGAGAGTG GGAGCCAGAA ACACGGAACT TTTGAAAGGC 4 80 

ACTTTGGAGT ACGACAAACG AAGCTGCCGA TGAACACATC GGAAGTAGGG TGGAACCGCG 54 0 

ATAATTATTC GTCCCTATGT CTATTAATAG GCATGGGGAC TTTTTTCTGT GAATCACTAC 600 

GATTGGCGCA AATGCGACAG AGTAGATGAT GAACAGTACA AGGTGACCAA AGGGAACGTT 660 

GTCTCCTCTA TGTTAAAACA AACTGCTTAG CCAAAAACAT GGAGTAGATG ATGAACAGCA 720 

CAAGGTGACC AAAGGGAACG TTGTAACCGA ATAAAAAAAG CAAACAGTTA ACACCAAAAG 780 

AAAAAATAGT AACGTTTATT CAAAAAAGCT CAATTGGCGC AAAGGCGACA GAGTAGATGA 840 

TGAACAGCAT AAAAATTATC CCATTCTACA AAATTGTTTC TCTGTAGCCC TTTTTAAATT 900 
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TACATTcCAG AAgCAAAGCC AATCCCTATG TCTATAAAAr ATATTCCAAG GaGGaAAACA 
ATGaAAAACa AGTTAACCGT ACaAGAAATG aTTTTAACCt TGCaAAAATT TTGGTCaTCT 
AATGGCTGTA TGTTAATGCA AGCCTATGAT ACAGAAAAGG GAGCCGGTAC CATGAGTCCT 
TACACGTTTT TACGTGCCAT TGGACCTGAA CCATGGAATG CTGCCTACGT AGAGCCTTCT 
CGTCGTCCAG CTGATGGACG CTATGGTGAA AATCCGAACC GTCTATACCA AC AT CATC AA 
TTCCAAGTTG TAATGAAGCC TTCTCCTGAA AATATCCAAG AACTTTATCT AGAAAGTTTG 
AAATTATTAG GAATTGATCC ATTAGAACAT GACATTCGTT TTGTAGAAGA TAACTGGGAA 
AATCCCTCAA TGGGTTGTGC TGGTTTAGGT TGGGAAGTTT GGTTAGATGG CATGGAAATT 
ACCCAATTTA CGTATTTCCA ACAAGTCGGG GGCTTACAAT GTCATCCTGT CACTTCTGAA 
ATTACTTACG GGTTAGAACG CTTGGCATCA TATATCCAAG AAGTCGAAAG TGTCTATGAT 
TTAGAATGGA CACAAGGCGT TAAATATGGT GAAATTTTTG TTCAACCTGA ATATGAACAT 
TCAAAATATT CCTTTGAAAT TAGTAACCAA GAAATGCTTT TAGAAAACTT TGACAAATTT 
GAAAAAGAAG CTAAACGCTG TATTGAAGAA AGTTTAGTTC ATCCAGCCTA T GAT TAT ATT 
TTAAAATGTA GCCATACATT CAATTTATTA GATGCTCGCG GCGCTGTCTC TGTAACGGAA 
CGTGCAGGAT ATTTAGCCCG CATTCGGAAT ATGGCTCGTT CCGTAGCGAA AATTTTTGTG 
GCTGAACGTG AAAAATTAGG GTTCCCATTA TTAAATAAAG ACCAACACGT ATCCAAGGAG 
GCAGAATAAC ATGGCAAAAG ATTTATTATT AGAAATTGGC TTAGAAGAAA TGCCCGCACA 
TGTTGTCACA CCAAGCCGTA TCCAACTTGA AGAAAAAGTA ATCAAATTTT TAGATGAACA 
T CAT T TAG AC TATGAAACAG TGCAATCCTT TGCCACACCT CGTCGTTTAG CCGTGAAAGT 
AACTGCGATT CCTGAAAAAC AAGCGGATGT TGAAGAAGAA GTAAAAGGTC CAGCGAAAAA 
AATTGCTTTA GATGCTGAAG GAAATTGGTC AAAAGCTGCA CAAGGTTTTG TACGCGGCCA 
AGGAGTTACA ACTGAGGACA TTGTTTTCAA AGAATTAAAT GGTGTCGAAT ATGTCTATGT 
GACGAAGTTT ACCAAAGGCC AAAGTGCCAA AGAGGTATTA ACAAAATTAA ATGACGTGAT 
TACAAGTCTA ACTTTCCCGG TAACAATGCA CTGGGCAAAC TATGATTTCG AATATATTCG 
TCCAATTCAC TGGATTGTGG CACTACTTGA TGATGAAGTA ATTCCTTTCA AAGTGTTAGA 
CGTGACAACA GGGCAAACTT CTCGTGGCCA CCGTTTCTTG GGAGACGATG TGACGTTCCA 
ACATGCTAAT GAATATGAAG CAAAATTAAA AGAACAATTT GTAGTGGTTC AACCGAATGA 
ACGGAAACAA ATGATTGTTG ATCAAGCGAA CGCTCTCGCT GCAGAAAAAA ATTGGCAGTT 
AGCTTTAGAT GAAGAATTAT TGGAAGAAGT TACGAATTTA GTGGAATATC CAACTGCATT 
TGTAGGTAGT TTTGATGAAA AATATTTATC TGTTCCAGAT GAAGTATTAG TGACTTCTAT 
GAAAGAACAC CAACGTTATT TTGACGTTCG TAACGACCAA GGCTTATTAA TGCCGCATTT 
TATCGCTGTT CGCAATGGGG ATAATGTTCA CCTTGAAAAC GTGATTAAAG GAAATGAAAA 
AGTGTTAATT GCTCGTTTAG AAGATGCAGA ATTCTTTTAC AATGAAGATA AAAAATTAAC 
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GATTGAAGCA TGTGTTGAAA AACTAAAAAA TGTGACTTTC CATGAAAAGA TTGGCTCAAT 
TTATGAAAAA ATGCAAGGGG-TTGGTTTAAT TGCACAAATC ATTGGTCGAA AAGTTGGCTT 
ATCTGAGGAT GAATTGGAAG ATTTAAAACG TGCCTCTGAA ATTTATAAAT TTGACTTAGT 
AACAAATATG GTTGGTGAAT TCCCAGAATT ACAAGGAATT ATGGGAGAAA AATATGCCTT 
ATTACAAGGC GAAAAACCAG CCGTAGCAAC GGCGATTCGA GAACATTATC TACCAACTTC 
CAGTGAAGGT GAATTGCCAG AAACAGCAAT CGGGGCTGTT TTAGCCTTAG CCGATAAGCT 
TGACAGCGTC TTTTCATTCT TCTCAGTTGG TATGATTCCA ACAGGTTCTA ACGATCCTTA 
TGCAcTTCGT CGTCAAACCT ATGGCGTTAT CCGAATTATT GAAGATAAAG GCTGGACCTT 
CCCATTAGTT CAATTACAAA CAGAAGTTGA TGAAGCAGTG AATCAAGATG TTGAGAAATA 
TGGCGTTTTA CTAAACGAAG GACAAGCAGA AGTCGTTGAA TTTGTGAAAG CTCGCTTACG 
TCAATTATTA ATGACGAAAA ATGTTCGCCA TGACATTATC GATGCAGTGG TGTCTGCTGA 
GCAAGCCGAT TTATCGAAAT TATTTGCTTC AGCGAATATT TTAAAAAGCC GTTTTGAAGA 
TCAAGATTTT AAACCATCAA TGGAAGCCTT AACGCGTGTT ATTAATTTAG CGAAAAAAGG 
TCAAGAACTT TTAGGTGATA CTGAAGAAGG GATTGATCCA AGTCTGTTTG AAAATAAGGC 
AGAAAAAGAA CTGTATCAAG CAGTTAACGA TTTATCTGAA GCATTTGCGA CACGCACGAT 
TGCAGAAAAC TATGAAGCAC TAGTAAATCT ACGTCCGTTA ATTGATGCGT ACTTCAATGA 
AACgATGGTT ATGGTTGAGG ATGaAAAAgT CaAACAAAAC CgATTaAAAC AATTAATGCA 
AATTGCTAAA ATGGCGTTAT CaATTGCTAG CTTAGATTTA TTAATTGTGA AATAAAAAGC 
GCTGTTTTTC GACCGCTAAG GAGGAACCGA CAGTATGCTA GAAACAACAT TTGAACAAAT 
TTTAACACAA TTATCAAAGC CAGCTGTTCG TGCGCTAACG AATGAAAAAA TCGACAGTGT 
GGATGAACTC TACGCACGAG GCCGAAAAGC GTTACTTTCG TTGCATGGCT TTGGTCCAAA 
GTCTATTCGA ACTATTGAAG AGATGACTGG GAAAGAATTG AAGTAAAAAA AGGATCGCCG 
TTTAGAGGCG ATCCTTTTTT TATTTACGTT TTTGTTTTCC AGCATTACGA CCTTTATTGC 
CGTTTGCTTT TGGTTGGTTC GTTTTTTGCG TTACTTTGTT TGTTTCTTTT GGTGTCACAT 
CCTTGATAGG AGTAACCACT TGTTTTGGTG GGTTATTTTT CAATTCTTCT TGTACTTGTT 
TTTTAATTCT TGGACGTAAG AGGATATTTG TGATGGCAGA TTGAATACAT GTGAAAATAC 
CACCAACTAC CCAGTAAAGC GTACGCCTGC AGGTGAGCTA AAGGACATGA AAACAATCAT 
TAATGGTGAA ACAATCAGCA TTGATTTCAT TGTTTTCTTT TGTTCTTCAG GAATACCAAT 
CATTGAAATA TAGCCTTGAG CTAAATAGGC TAAACCAGCT AATGCCACTA AAATCATATT 
TGGTTGCCCT AAATTCATTC CTAAGAAGCT TGCCGTGGCA ATACCTTTTG TATAACGAGC 
AGCAAAGAAC AGAGAAGAGA AAATAGGCAT TTGAATTAAT AAAGGCAAAC AACCAATTCC 
GCCAACCATA CTCACATTGT TTTCTTTATA AATCTTTTGT AGTTCTGCTT GTGCGGCCAT 
TTGTTCTTCA CGCGTACTTG CTTGTTTCAT TTTTGCTTGT GCAACATCCA CTTGGGGTTT 
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GATTGCTTGC ATTTTTTCCG TTTGGATAAA GGTTTTTTTC GATTGATGTA AGCCTAACGG 4 920 

TAAAATAATG ATCCGTACAA TAATGGTGAT AAAAATAATC GCCCAACCAT AGT-TCCAATT 4 980 

AAAGTTATCG ACAAGATAAG TAATTGCATT ACTCATCGGT AAAACAAGGA AG T TAT AG AC 5040 

AAAACCTTCA CCAGTTGGTT GTCCATCTGC TCCTGTTTTC ACACAGCCTG ATAAAAACAT 5100 

TACTAAAAAT AGTAAGCCAG AGCCGTAAAG CCATAC 5136 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

GGCGAAATTG TTGAAAAAAA ACTAGTTACA GCACCTCTGT CAACTGAACA AATTGATCAA 60 

GCAATGGCCG AAATGACAGG TGAAaTCACA CAAATCCCAC CGATGTTTTC TGCGGTGAAA 120 

GTCAATGGTA AACGTTTATA TGAATATGCG CGGAACGGGG AAGAAGTTGA ACGGCCCCAA 180 

CGAAAAGCTA TGATTTATTC ATTTGAACGC ACTAGTGAAC CAATTTTTAA CGAATCCGCT 24 0 

CATACACAAA GTTGGCGGTT TAAAGTGGTC TGTGGCAAAG GAACCTATGT GCGGACACTA 300 

TCAGTGGATA CTGGCAAAAA ATTAGGCTAT CCTGCCCATA TGTCGGATTT AACGAGAACG ' 360 

GCAAGTGGCG GTTTGCAAGc tGCGCAATGC TTAACGTTAG AAGAAGTAGC CAAACAAATG 420 

GCTGCTGAAA CAATCGATCA ATGCCTGTTG CCCATTGAGA CAGCAGTAGA ACAATTCCCA 480 

CGAATTGATT TATCAGATGA ATTGTATCmA AAAGTAAAAA ATGGCATGCG TTTGCACmAA 54 0 

AAAGAATTAG GrATAAAGGC AaTGCCAGAA TCCTTAGTCG CCTTATTTTA CCAAAATCAA 600 

GTGGTTAGTT TATACATGCC ACACCCAACG CATGATAAAT TATTGAAACC AAGTAAAGTT 660 

CTACGGAACA ATTAAATAAA AAAGAAGGAA GTTTTTGCAC ATGCAAGTTA TTCAACTACA 720 

TCATCCCTAT GAACCCAATC AAATTCCTAA TGAAGAAGTC GTGATGGTCC TAGGCTTTTT 7 80 

TGACGGTGTT CACAAAGGCC ACCAAAAAGT AATTGAAACA GGTAAAAAGA TTGCGGAGGA 84 0 

AAAAGGCTTG AAGTTAGCTG TGATGACCTT CAACCAAcAC CCAtCAATCG TTTTTCAAAA 900 

AGTtTTACCA GAAAATATGA AATATTTAAC TAGCTTgGAA CAAAAAGAAC GCCTAATGGC 960 

CAACTTGGGC GTTGATATTT TATATATTGT CGAATTTACT TCAGCGTTTG CTCAATTAAA 1020 

GCCGCAAGAT TTTGTCGATC AGTATATCGT TAATTTAAAT AGTGCAGTGG CAGTCTCTGG 1080 

GTTTGATTAT ACGTATGGTC CTAAAGAAAT TGCTGGAGTC AAACAGTTAC CAACTTATGC 1140 

GCAAGGCCGT TTTGAAGTGG TGACAGTACC AAAAGAAGAA ATGACTGGTG CCAAAATTAG 1200 

CTCAACAAGA ATTCGCGAAA AAATGGAAGC CGGCGAAATG GAGGAGGTCA CGGAGTTGTT 1260 

AGGATATATT TATGAAACAG AAGGAACAGT TGTCCATGGC GATGCTCGTG GCCGTTTATT 1320 

GGGTTTCCCG ACCGCTAATG TCAAAGTGAA AAGCACCGTG CGTTTACCAA GAATTGGCGT 1380 
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CTATGCTGTG 
TAATGACACC 
TCAAGACATC 
GAAATTCAAT 
TGACTATTTC 
TATCAAGCTG 
TGGCGTCCCA 
GAGTTGTCCT 
AAAGCCAAAC 
AGTGTCCATC 
CAGAAGGCCA 
ACGAAGTTCG 
TCAGAAGCAT 
TGGCATTTTT 
GAAGCAGAAC 
ACTGGTGGTT 
CAGTATTATT 
AGCAGACAGA 
AAGAAACATA 
TTGAAGGAAC 
AAAAAAAGCC 
AAGCAGAACT 
TGAAAAAAAA 
GGGGATAAAA 
CGACAAGTAT 
AATATGTTCA 
ACACCTATCA 
GAAGTCGCTG 
CAATAGTCAA 
GTTGCTGGTT 
CCAATTTGAA 
CTATGAAGAT 
TAAGTTTGGA 



CAAATCGAAG 
TTTGGAGAAG 
TATGGTGAGC 
GGGGCTGAAG 
CAACAATCTG 
TAGAACGTTT 
CAGAGCATTT 
TAGTTTCAGA 
TACGATTACC 
CAGAAGCACA 
AAGAATTAGG 
GTTTTGTTCC 
TATTGGCTTT 
CAACATGCTT 
TCGGAATAAA 
GTCGCGTCTT 
ATGTAGTGGT 
GTTGCTAAAA 
TGGTTATGAA 
CGTTTATACG 
kTCAGAAAAA 
CGCTGATTTT 
TGGAGGGATA 
AAGAGTACAA 
TTCAAGAAAT 
ATGCGTTGAC 
AAAATATTAC 
GCAAATGGAC 
AACAAAAAAA 
ATTCCCAAGA 
TTTGGGAAAG 
TATGCACTTT 
AAAATTTATC 



TTGGTGGCAA 
GACGCGATTT 
AAGTGGTGGT 
CCTTGATCGA 
AGGACAAATA 
AACGTATCAA 
TATCGTGCAT 
AaCTGATCAA 
TGATCATACA 
GAACACGGGC 
CATCGGCGGA 
AGCGACTACT 
GGAATTACTG 
TGCTTATCCA 
TGAATAAAGA 
TACTACTTTT 
ACGTAGTACT 
GGCATTTTAG 
ATTACAAAAA 
ATTTTACTGC 
GGGCCAATGC 
TGGCAGCGAT 
GAGCATGTCA 
AGAATATAAA 
TGAAAAATAT 
TGATTTACTT 
GGGAGAAAAA 
TGATAAACAG 
GGAGAATAAA 
AAAATGCGAA 
AATATACGGA 
TGAGACGGTA 
AGAAATGTGA 



ATGGTATGTC 
AACCGTGGAA 
TCGTTGGAAT 
CCAATTAAAA 
ATGAAGATTC 
GCGTTTAAGG 
TTATTAAGAG 
GGAAAATTAA 
GAAAAGGCTG 
GTAGGCTCAG 
TTGATTATCC 
TTTCAGATTA 
CCAGGTTATT 
GAAACACACC 
ATAATAAGAc 
TATTACAAAG 
AAAGAGTAGT 
AAGGTTTAGT 
TTTTAAATGA 
GTTTAGAAAA 
GGAAGTTCTA 
GGACGTTGCT 
AATTTTAAAG 
CGACGGATTG 
GCTTGGCATT 
GATTTGTTTG 
GTAGGTGATT 
AAAGAAAAAC 
CATGGTTGAA 
AAAAGTCCAA 
AAAAGAAGTC 
TTTAGTAGAT 
GCAACACACA 



GGCATGGGCT 
GTCTATATTT 
CATTATTTAC 
CAAGATGAAC 
GTCAAGAAAA 
AATTAAACTT 
AAAGTGCCGA 
CGGGTCATAT 
TTTTGACCGT 
CGTTAATTAA 
TGGGCCATCC 
CGTTACCGGA 
TTGGCACAAG 
CTGCAGAATT 
GCGACAACTA 
TAGTTGACAG 
TTAGGGAGGA 
TTTAGCCATC 
CCAAGGGTTT 
AAATCAATGG 
TCGCTTAACC 
TTCAAAACAA 
CGTTGATCAA 
CAGCATTACC 
TTTCAGATCA 
AAGAAGGAGC 
TTGCTGAAAC 
TGAATCATCA 
AATTTTTTTA 
GTGTTTCAAC 
AATCAGCGAC 
TATCAGTATT 
AAAATTATAT 



CAATTGGTCA 
TGGACTTCCA 
GTGACCAAGT 
GAGATACGGC 
ACCTGCAGAG 
GACCGAAAAC 
CTTTATTCCG 
TATGTCTTCA 
TGGGCCGCTA 
GCATTCTGTT 
AACTTACTAT 
AAAAGAAACA 
TGGTGGGGAG 
AGAAGCCTTT 
CTAGTTAAGT 
AACCGGTACT 
AAAGCAGTGA 
ATCCAAAGGA 
ACTGAAATTG 
GTTATAGCTG 
TCATCAGGAG 
GTAAACAAGA 
AAAAGTTGTT 
AG TAG AG TAT 
TAGTGGAGCC 
TGCTAACGGA 
AATTGTCAAT 
ATTTTCTAAA 
AAGAAGGAAA 
GTGTGGCCAA 
TTAGAGAAAT 
TAGAAAGAGA 
AAATGGCTTT 
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TTTTAAAAGT AGCGAAACTA AGTGAAATGA ACCAAGTCTA GCTACTTTTT GTTATGTAAG 
CTTAGTAATA GTTCTATAAA AAACTCTCTA TTTGTTTTCT AACTCAGACT GGTAAGATTG 
ATTTAATTAG AAAAAGATAG AGGTGAGAGA ATGAATCTAC GTACTTTAGG TCTAACAAAA 
ATTCTTTTGA CCTTTATTAT ATTAAGTATT ATTGGCGCGT TAATGATTTA TAGCGCCAGT 
AGCTATGACT TGTTGATGCA AGGCGTGAAG CCCACTGCTG TGTTTATTAA ACAAGGCATT 
ATTATGTGTC TCAGCTGGGT GCTAATGTTC GTGATTTACA AAGTTCGACT AGAAGTGTTA 
TTTAATAAGA AAATTGCGAT TGGACTGCTT TTAGTTTCGG TACTATTACT GTTAATGGTT 
CGACTACCTT TTTTTGGTGT AGCAGTGAAT GGCGCACAAC GTTGGATTTC TCTCTTTGGA 
ATCCAGTTTC AGCCATCAGA ACTTTGTAAT TTTGCGATTA TTTATTACTT GAGCTGTTAT 
TTAGGTGAAA AAGAAAATGG CTTGACAACG AAGCAGTTAA GGAAACAATG GCTGTTTGTT 
TTGGTCGTTG CATTTTTAGT TTTGATTCAA CCCAAAGTTG GCGGGGCCAT TCTTATTTTA 
GTTATTGGCA GTGTCTTGAT TTTCTCAGCA AGTATTCATG CAAAATTTAG TGTAATTGCT 
GCGGGGATTG TCGTTGCTAG CGCGGCGCTT CTTTCTAAAA TAATTATTTT TCTAGGCGAT 
CACCGCTACT TACCACATTT TTTTGCGCAT GTGTATGACC GCTTAGTTGT TTTGAAAAAT 
CCGTTTTTAT CCTTTCATGA TCGAGGATTT CAGCCTTCAA TGGCCTATTT AGCAATGTAT 
AACGGTGGGT TTTGGGGCAC TGGACTAGCC AATGGTATGG TAAAAAAAGG TGGCTTGCCA 
GAAGGTCAAA CCGATTTTAT CTTTGCGGTG ATTGTGGAAG AACTTGGTTT GATTGGCGGC 
TTACTGTTGC TATTTCTTTT GTTATTTTTA GCGGCCTCGA TTTTGCGGAG TAGTTGTGTA 
ATAAAGAATC ATTGTTACGG CTTGTTTTTG TTAGGGGTCG GGACCTTGAT TTTAGCTCAA 
ACGGCGATTA ACATTGGTGG CGTTTTGGGT TTGATTCCAA TGACGGGAAT TCCTTTACCG 
TTTGTTAGTT ACGGTGGGAC AAGTTATTTA ATCTTTTCAG TGGCATTAGG AATCGTCATT 
AAAATTATTG CAAACGAAAG GCGGCAGTTA AATGGTCAAT ACAAAAAAAT TCAACTTACA 
TCTTAATTAT GATTTATTAT TGCCCATTTT TTTACTCACA CTTTTATCTT CTGGTGTGCA 
ATATTGGATT GCCGTTAATG AAGGAAAAGA TGGAACTGTG CCaGCATTAA AACAACTTTT 
CTTTATTTTT GTTGGCTATG CAGGCATGTT TTTAGCTAGT CGGCTGTCAC AGAAATTTAT 
TTGGAAAGTA GCTCCTTTTT TCTATGGGTT TTCATTAATC TTAATGAGTG CTTTGTATTT 
TTCCTATGAC AAAGGAATGT ATTTATTGAC GGGCACAAAG CGCTGGTTAG ATTTAGGGTT 
TATTAAATTT CAGCCATCAG AAATTGCGAA AATTGCGTTT ATTTTAATGT TGGCAAAAAT 
TATTGTCCAA CATGAGCAAC AAGACTGGTC TGATAAATGG CGTTCAGACA AGCAACTTCT 
AAAAAAAATC GTTGCGGTTA GTGTCCCTGT TTTTTTCTTG ATGGCGGTAC AAAAAGATTT 
TGGGACATCC CTTGTTTTCG TAACGATTAT TCTATCTCTT TTAGTTATTT CAGGAATTGA 
TCGCAAAATT TTAATCATTA TTTTCAGTGC TTTAGCAACG TTAGGCGTGG TCTTGATTCT 
GTTAGTCTTT ACAGAGTGGG GGCACAAAGT ACTCTTCTTT TTACATTTTA AACAATATCA 
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ATTGGATCGA 
ACAAGTACAA 
GATTGAAGTG 
CTGGGGCTTT 
TTTAGTAGCT 
TTTTTCGCTG 
AAAAGGTATT 
TTCTTTAGGA 
ACTGGAACTC 
CATTCAGCCA 
GCCATGTTTC 
TGAATATTTG 
ATAAGTGAAA 
CTAGCCAATT 
ACATTGTTAG 
TTCTTAAAAT 
ATGGTAAGTC 
TATTTTCTGT 
TTAGTAAGTC 
TGGAAGAATG 
GCCAACAwTA 
GATTAAAAAG 
CTTGAGAAAT 
AAAAATAATG 
ATAACGTAAT 
AACTAAATAG 
GTTTAGTTCa 
GTCTAAGACT 
TTTACGTTCC 
AGGTAAAGAA 
ATTGACACAA 
ATCCAAAAAT 
GTAATAATAC 



ATTTTAGCAT 
GGCTTATTGG 
TATGTTCCTG 
GTGGGCAGTG 
GGCTTGCGGA 
GTCTTTOAAA 
CCGCTTCCGT 
TTTGTCaAAG 
GCTTGCCGAC 
AAAGTTGTTG 
GTTTAGGTAG 
TGTATTCGGG 
GAGTATCCAC 
GGTTAATTTG 
TTCTTTTAAA 
GAATTGATAA 
AGCAAAGGAA 
AATAATAGTG 
ATTCAATTTA 
ATTAATTTCT 
TTTTyCTAAA 
TTCCGCGCCT 
TGCTAAATTT 
TAAACGAAAT 
TGTTCGAATA 
ATGGcTCGTT 
TCAGTGaGTG 
AAATGAAGAG 
CAACGTTTGA 
CGAATTTCGT 
AGGAGATAGC 
ATCCCAATCC 
TTAAAATCAA 



GGATTCATCC 
CAATTGGTTC 
TTCGTGAGTC 
CCACCGTTGT 
gCAATTCGCG 
CGGTGGAGAA 
TTTTAAGTCA 
GAAGAGAAGC 
AAGCTCTAGT 
AAGTTGTCGA 
TTCTTGGTAA 
GACAATAGCG 
TTGAATGACT 
TTTTGCTCGG 
ATTTCTTCCG 
TTTGCTGTAA 
ATACCAAAAT 
TGGACTTGTA 
TATTCTAAAT 
TTTAGCTCTT 
aTTAAwCCaG 
AACGTTGCTT 
TCGGCTGCTT 
AGTTCCATGG 
AATAACGGTG 
CTGGaCTATC 
AATAGGGAAG 
AAGGAAGTTC 
ACACTTTTTT 
CAATCGTCAC 
TTTCTGTATA 
CAATGTCAAT 
TACTTACATA 



1273 

CTATGATTAT 
GGGTGGTTTA 
TGATATGGTT 
TTTTCTCTAT 
TTTTTGTATG 
TATCGGTGCG 
AGGAGGCACT 
CGCTACCTGA 
CTTGGTTTTT 
CTCATGTAAC 
GTTAGTGTAT 
AAACTATCGG 
TTCTCTGGAG 
AAAATTCGGT 
TCGTGGCTTT 
ATAAATCAAT 
CTGCATCCAT 
AGTTGGATGA 
CACAGAGATA 
CTTCAATTGA 
CTGGCGTTAA 
CAAGAATTTT 
TAGTAAATTG 
GAATTCCTCC 
TGCTTCACAA 
TTCCAGAGGa 
GATGCTACAG 
GATGACGATT 
TT CAT AT ACT 
GCTATCGGCA 
AATTAATTCT 
TAAATCTGTT 
CGGATAGGAT 



GTCGATAAAA 
TTTGGCAAGG 
TTTACTTTTA 
TTTTACTTGT 
TACATCTGTG 
GTGATTGGAC 
TCGTTAGTGA 
AAACACAAAT 
ACTATTGAAA' 
GTTCTTTTAA 
GGATCGAATT 
AATGAGACAC 
AAAAATTAGA 
ATTTAAACGG 
TGCCAAAGCT 
TTTACCAATA 
TGTTTCCAAG 
ACGATCCGTT 
AGTGGCCACA 
GACCAACTCA 
GGTGGTAGTT 
TATTTGCTTA 
TTTCACTTTG 
TTTTTCTATT 
TGTCCGTGAT 
ATACCAACTA 
GCTTTTTCTC 
TGCCGATTTT 
TGATCGGAAT 
TGGGCTAAAG 
TTCTCTAATG 
TCCAAGGCTG 
TGATTAAATA 



TCTCCTATCA 
GTGTGCATGG 
TCGGGGAAGC 
TTTATCAAGT 
TCGCCCTCAT 
TGTTGCCGTT 
TGGCTATCAC 
AAAAACCAAG 
TTGTTTTTGG 
ATAATAAACT 
GTCTAAAAGT 
CAAGTGAGTA 
AAACGTAtAT 
ACGAAGGGGT 
GGATGATCGT 
TGTTCAGGTA 
GATTGAAGAA 
AATGTTGTCA 
TAAATATGGT 
TTGGTGATGC 
TGGCCTTGGC 
CTTAATGCAG 
CATAACTCTA 
CCTAGAATAA 
GATTATTCTG 
AGTTACGTCT 
GTTGGACCAT 
CCCAATTGAT 
ATGCAGCAAA 
GATGATTTTT 
GAATCGAGGT 
TTTCAAGATC 
ACAACAAGAC 
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TTCTCCAAAT GCTTTATCTA TATCAATTAA GTTTGTTCCA ATCCGAATGG TCGTTGTCGA 7380 

TCCTTGGGTC ACTAATTCAT TAATCTCTTC TTGCGCTTCC TTAATGAGTG CAAAAATCAT 74 4 0 

TTCTGTTCGC TTGGCGACGA CTTCGCCTGC TGGAGAAAGC CGACAATGTT TCTTGTCTCT 7 500 

TAAAATTAAT GAAGCACCTA ATTCACGTTC CAACTCCTTG AGCTGGGTAC TTAAATAAGG 7560 

TTGACTCGTA TGGAGCTTTT GAGAAGCTAA AGTAAAACTC GCATACTTTG AAACTGCCAT 7 620 

AAAATATTCT AGTTGTTGTA ATTTCATTTT CCTCACCGCC GTTTCTTTAA CTATACAAAA 7 680 

AATAGCAGAA GAAAAGCTTC TCGACAAGGC TATCTGATGG TCTTTAGATT AATTTAACCA 7740 

GTTATGATTC TATGAATAAA TACGAATCAT ACCAGAAAAC CATAGTAAAT AAGCGGATTA 7800 

TTTTTCTTTT TGTTACTGAA TGGACAAAAA GTTATCTACT AAAAACTTTT TCTATATCGT 7860 

TATTTTTTCA TGGAAGAATT ATACTTGCTG TTGTAATTAA ATAAGATTTG TTAGGTGAGG 7 920 

CTCCTATACA AACATAGGCT ACTGCCCAAA AATGTCGAGA GACGCCAATG GGTAGAACAG 7980 

GAACTGTCGA ATGAAGGCTT TTCTTAAGGT AGCTAAAAGC AATCGCTTTT TACGTTGTAT 804 0 

AGTGCTAAAG CTCAACGATA AGAACGATTA TTCTTTGACA CGTtGAGTTT TCAGCGTGTC 8100 

TTTTTTTGTG GAAAGGGTGA GGAAAATGGA AGATGAACCG ATCGTCAATT CAAGTCAACC 8160 

ACAGATAAAT GAGGTGAAAG AAAATGGAAA AGAAAATGCG TCACTTGGGC CATGACTATA 8220 

GTTACAAAAA GTTTGCCCAT TCAGAAGTAG AAAAAGTTTT ACAACGTTTT GGCTCAACTT 8280 

GGGATGGCAT TAAACACCAA GATATTGAAG AATTACAAGA ACGTTATGGC CGCAATAAAA 834 0 

TTATGAATGA AAAAAAAGnT TCGCGGcTGC GACTTTTTAT AAAATCGTAT ATCACGCCGT 8400 

TTACTTTGGT GTTACTAGCA CTAGCCACGA TTTCATTTTT TACAGAATAT GTCTATGCTG 84 60 

CGCCAGAGGA AAAAGATGTT ACTGGCGTGT TAATTATGTT AGCGATGGTC aTCTTAAGCG 8520 

GCACGATGaG TTTTGTCCaA TCAGTCaAAT CaAGCAATGC aGTTGAAAAA TTACAAAATA 8580 

TGATTAAAGT TACAGCGACA GTCATTCGTG ACAAGAAGCA AATGGAAATT CCTATTGAGG 864 0 

AAGTTGTGTG TGGCGACTTA GTCCAACTAT CCGCTGGCGA TATGATTCCa GCCGATTTAC 8700 

GTTTAATCCA ATCTAAGGAC TTATTTGTTT CCCAATCATC CTTGACTGGA GAGAGTTTTC 87 60 

CCATTGAAAA ACACGCGACT CATCAAAAAG ATAGTGATCA AATTGATACA GAATACGACA 8820 

ATCTTTTGTT TTTAGGAACA AATGTAATTA GTGGCACTGG TTTGGGCATT GTCATTAAAG 8880 

TTGGTAATCA AACTTTATTC GGAAGGATGG CTTCTGATAA CGGAGAAGAG CAGCAGCAGA 894 0 

CGAGTTTTGA AACAGGAATT AATAAAACAA CGTGGGTGCT GATTCGCTTT ATGTTGGTCA 9000 
TTACGCCAAC TGTTTTTCTA ATTAATGGGT TAACAAAAGG TGATTGGGCG GAAGCGTTAA 9060 
TGTTTGCGAT TGCCACAGCT GTCGGCTTGA CACCAGAAAT GTTACCTATG ATTGTAACAA 9120 
CCAATTTAGT CAAAGGTTCT CGAGAAATGG CTAAAGAAGG CACGATTATG AAAAATGTCA 9180 
ATGCCATTCA AAATTTTGGC GGGATGGATG TACTTTGTAC GGATAAAACA GGTACCTTAA 9240 
CACAAGATAA AGTAATTTTA GAATACCACT ATAATATTGG GTGCCAAGAA GATCAAAAAG 9300 
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TCTTGGATTT 
ACGCCGTTAT 
TAGATGAAAT 
AAACGCGAGA 
CACAGGTACT 
CCAAAGACGT 
AAGTCGAAGC 
GCTATCTTGC 
ATCAACATAA 
TTTGTAAAGA 
TAAACCAAGA 
CTGACCaAAA 
TAGGTGATGG 
ATACAGCTGT 
TGGTTTTGGA 
TCAAATTAAC 
TTTTACCATT 
CTTCGTGTAT 
AATGGGAACC 
TTGATATTGT 
CATTCTTTGA 
GGTTCATTGA 
TGCCTTTCCT 
TCTCGGgAAC 
CACCAaGCTA 
TCATTAAAGT 
TATATAGAAA 
GCTCCTCTTT 
CAGTTAGTAT 
AAAACGAGCT 
CGGCCTATAT 
TCTTTTTAGA 
TTACGCAAGc 



AGCATTTCTA AACAGTTATT TTCAAACAGG TTTGCGTAAT TTAATGGATA 
TCAAGCGGCG mCACArGAAA GTGATATACA ATCAGATGAT TTTTATAAAG 
TCCTTTCGAT TTCAATCGAC GTCGAATGAG TGTGATTATC AAAGAGTTTA 
AACACGCTTA ATTACCAAAG GGGCGGTTGA AGAAATGCTC TTGGTCTGCA 
TTTAAATGGC GAAATTGTGC CCTTAACAGA GACGCTGCGT CAAAAAATTA 
TGAAGCTTTA AATCGAGATG GTTTGCGCGT TCTAGCAATT GCCGATAAAA 
GGCTGAATGG GAG TAT AC AA CTAAAGATGA ATCAGAGTTG ATTTTACAAG 
TTTCTTAGAT CCACCAAAAG AAACAGCGGC GGCAGCTATT CACGCCTTGC 
TGTAGCTGTG AAAGTATTAA CGGGGGACAA TCAGTATGTG ACGCATTCTG 
AGTTGGTCTG GCTGGCGAAA AAATTATTAC AGGAAATGAA CTTCAACAAT 
AGAATTAAGA AAAACCGTTC AAGCGTATAA TATTTTCGCC AAAATTACGC 
AGTTCGTATT GTAAATGCGT TAACTGAAAA TGGTCAAACT GTTGGCTTTT 
AATAAATGAT GCTGGAGCAA TGCGTGCTGC TGATGTAGGA ATCTCTGTTG 
CGATGTGGCG AAAGAATCGG CGGATGTAAT TTTATTGAGA AAAGATTTAA 
AAAAGGGATT TTATCTGGAC GGCGCGTGTT TACGAACACG ATGAAGTACG 
AGCAAGTTCT AATTTTGGCA ATGTTTTTTC CGTGATCCCA GCAAGTATCT 
TTTACCGATT GCTCCGATTC AATCGCTATT ATTAAATCTA ATTTACGATA 
GTCAGTTCCT TGGGACAAAG TAGATGAAGA ATACGTAGAA AAACCAAAAA 
AAAATCAATT GGTAATTTTA TGCGTTGGTT TGGCCCAACG AGTTCTATTT 
TACCTATTTA TTTATGTATT TCATTGTGTG TCCGGCTATC CTAGGTGGCT 
ATTAAATGGA GCAGATCAAT TGCTCTTTAT CGGTATTTTC CATGCAGGGT 
GTCCTTATGG TCCCAAATGC TCGTCTTGCA TTTCCTACGA ACAGAAAAAA 
TCAAAGTAGT GCTTCTGGAA tTATGACGTT AGcACtaCAG CAGGGaTTGT 
TGtTTTACCC TTTACCGCTT TCGGTGcAGA ACTAGGATTT GTCGGTTTAT 
TTTCTnGTAT TTAATCCCGA CAATTGTTGC TTATTTAGCA CTGGTTGCgT 
•TTTATACGTC aAACGTTACG GTCaATTACT GTAACTGATT TGTACTATTA 
AATTTTTTTA CTTTTCTTTC aAGGTTGAGG AGGTGTCGGT TTTTATGACA 
GTTGTTTTAA GTGACTGTGC GAATTTTTTT ATGGATTTAT TTGTCATTTA 
GGCTAGTAGA GACTTTCCAA AAAGAACGAG CTTTTTGGTA ATAAGGAGAT 
AATGACCAAA ACAAACCAAC TTTCAACTTC AACGCCGATG TTCGATCGGT 
TCATATTCCA TTTTGTGAGC ATATTTGTTA TTATTGCGAT TTTAACAAGG 
AGGACAACCA GTTGATGAAT ATATTCAGAG TTTATTAAAA GAAATTCAAC 
ATTGTATCCA GAACAAGAAA TGAAAACcAT CTATATTGGC GGAGGCACGC 
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CAACTTCATT 
CGTTTGATGA 
AATTACAAGT 
ATGATCGCTT 
TGAAATTTTT 
CTGGACAAAC 
CGCACTATTC 
GTCAAGGACG 
TTGAAGCAAT 
GAAAAGAAAG 
CAGGTGCCAG 
ATTTGAAACC 
TAAATCAAAT 
AAAAATTCCA 
GCTTAATCAA 
GGCTATTTGT 
ATAAAAAAAT 
TATTGATGGT 
AAGTTGCGTA 
CACTTAGATG 
AAAATATCTT 
AGAAATTGAT 
CATTAGAGGA 
TGGCTGGTTA 
AACTTAGAgA 
ACAATCAGCT 
AGTGAAAGAA 
TGCTATTATA 
TGTTTCTAGC 
ACCTTTGTTG. 
TTTCCAAACA 
AGAAAAAGTC 
TGAACAATTG 



ATCAGCGAAA 
CCGAAATGAA 
CATGAAAAAC 
GCTCAAAAAA 
AGAGAAAGAA 
ATTGGAAAGT 
GCTTTAT t'CA 
GTTACAGCTG 
GGAGAAAAAA 
CCAGCATAAT 
CGGCTATCTT 
CTTGCGAGAA 
AGAAGAAGAG 
ACAAAAATTT 
AGAAGAATTA 
GGGGAATAAT 
CATCGAAGTG 
TTTTGAAGGT 
TTTCCCTTGA 
TTAAGAGTGC 
GAGACTTATC 
GGaGGATGGc 
ATACGGCTTA 
TCGTTACTAC 
ATCCGTCAAT 
GAAATTCTAT 
CGGAAATTAA 
GTGACAGATA 
CAAGATCTTG 
ACCGTTTATC 
CCAGAAGGAA 
TTTGTCGGTG 
AAATCTGTCT 



CAGTTAGATG TGCTACTAAA 
TTTACGGTGG AAGCAAACCC 
TATGGCGTCA ATCGTCTTTC 
ATCGGTCGCA AGCACACCGC 
AATTTTACCA ATGTCAGTAT 
TTTCGCGATA CTTTGACAAG 
TTAATTCTTG AAAATAAAAC 
CCTGAAGAAG AAATCGAAGC 
GGACGTCACC AATATGAAGT 
CTGGCTTATT GGAATAATGA 
GGGCAAACTC GTTACAAAAA 
AATCAATTGC CAATTGTGGA 
CTATTTTTAG GCTTACGTAA 
CAGGAACCAA TTGAAGCAAT 
TTAATTGAAG AAGCAGATAT 
GTTTTTGAAG CATTTTTGTT 
TGACAAAGCA TTTCGATGAT 
GGTTTGCAGC GATTAGCACT 
CAAAAAGGTA AGGACTTGTT 
TAATGAAGGG TGGTAATAmC 
ATCCAAAATT ACACGAACAC 
ATTGCCTCTA GTTCTGCCAC 
TTAGCTAAGA CCCATTCTTC 
GTGGATCATc TTTTACmACC 
CTTTTGGAAA AGAATtTCAT 
CAGAATTGAC AAGTTATACT 
CCGGCTTTCG AATGGTTCCT 
AAGGCAATGT TGAAAACCAA 
AAAAAATGAC ACAAATCATC 
ATCGTTTGCG GACGGAGATT 
TGATGAACTT ATTCGATGAG 
GTCGCATGAA TTTGTTAGAT 
ATTCCTTTAT GCAAAATTCT 
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AGGGGTTCGC GAACAGCTAA 11340 

TGGTGATTTA ACCCAAGAAA 114 00 

GATGGGCGTT CAGACATTTG 114 60 

CGCAGATGTT TATGAAACAA 11520 

TGACTTAATC T ATGCTTT AC 11580 

AGCCTTAGCA TTGGACTTGC 11640 

AATGTTCATG AACTGGGTTC 11700 

ACAAATGTTT GACGAAACGA 117 60 

CAGTAATTTT GCTTTAACAG 11820 

CCACTACTAT GGATTTGGCG 11880 

TCACGGTCCA ATCCAACATT 11940 

GACTGAAGAA TTAACTCGTC 12000 

AAAAGTAGGC ATTTCTAAAC 12060 

TTATGGAGAG GTCATTCAAC 12120 

CTTACGTTTA ACGAAAAAAG 12180 

AAGTGAAAAA GAGTAAAAAA 12240 

TTTTTTATTA CAAAAGTTGG 12300 

CTTGCGTTAA GAGTGCTAAA 12360 

ATATTAGTAT ATGTAGTTAG 12 420 

GATGATTACA GAAAGACAAC 12480 

GGGCTTaCCT GTCGGTTCTa 1254 0 

AATTCGTAAT GATATGAAAG 12600 

TTCAGGGCGA ATCCCCTCAA 12660 

mACACAAGTA GAAGAGAACG 12720 

GAGATTAATG ATATTATTCG 12780 

GCTTTTTCAT TAGGACCTGA 1284 0 

TTGAATGATC GGCAAGTCCT 12900 

GTATTTGCCA TTCCTGCCGC 12960 

AATGATAAGT TGGTTGGACA 13020 

CCGATGATTT TACATCGCTA 13080 

ATGTTAGGAC ATGCCTTTGA 1314 0 

TTTGGCATCA AACAAGATAT 13200 

GATGAGCTGA CGCATTTATT 13260 
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AAATGGTTCT GCAACAACAG AAAATCCAAT TGTTTTTCGT ATCGGCTCAG AAATTGGCAA 13320 

CAATCTCTTA GAAGATATGA GCATGATCAC TGCTACGTAT GAAGTATCAG GACATGGCAA 13380 

AGGAACAATT GCCCTGCTAG GACCAACCAG TATGCCGTAT TCAAAAATAT TCGGTTTGGT 13440 

AGACACGTTT CGACATGAGC TAGCTTCACA ACTAGGTGAC TATTACCGTT TTTTAGGTAA 13500 

TTAAGCAAAC CTTGTTAGGA TAAGAAGTAG ATGCATAAAG AGAAAGGAAG TTGCTGAAGT 13560 

GAGTAAAAAA GAAGAGAAGC AAGAAGAATT ACAAGAAGAA ATGGAAGCTG TTGATGCTGC 13620 

TGGCGTTTCT GAAGTAGAAG TTGAAGCAAC AGAGATTGAA AATCTTAAAG CAGAGCTTtC 13680 

CGAAATGGAA GACAAGTTCT TGCGGGCAAG AGCCGAAaTT GCTAATATGA GCmATCGAAa 1374 0 
TAAAAcGrAC GAGAATTATt AGTcGTTcCG CTCACAAGAT TwGGTAAAgA TTTACCACTA 
TTGTAATT 

(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

AATAAAGCAT CGTTACGCTC TTTAATTTTA GCAGCAGAAT TCGCTGTGAT TATTGCTGTA 60 

CTTTCACAAT TTACAATTCC ATTGGGGATT GTCCCCTTAA CTGGACAAAC ATTTGCCATC 120 

GGCTTAACCG CTACCGTTTT GGGCAAGCGA ACGGGGACCT ACGCTATTTT AATTTACTTC 180 

TTATTAGGAT TAATTGGTTT ACCCGTTTAC GCTGGGATGT CTAGTGGGTT CGGTGTACTC 24 0 

TTTGGACCAA CTGGCGGCTA TTTAATCGGA TTTATCTTTA ACGGCTTATT AACTGGTTAT 300 

ATTTTAGAAA AAACAACCTT TAACTATACT TGGGCCATTA TCGCAAATAT TGCCGGCGCC 360 

TTAGTTACCT TAGTTTTCGG TATGATTTGG TTGAAATATT CCGCAAATCT GCCGTGGcCA 420 

AACGCTTTCG CTGGCGGGTT TGCTCCGTTT ATTATTCCTG GTATCATCAA GGCTGTCGCA 4 80 

GCTGCTTATG TTGGTATTTT AATTCGCCAA CGCTTTATGA AACGTTTTTT AGCACACCTT 54 0 

TCTTAAGGGG AAAACTTTTT TCTTTTCCTT GTCTGTAGTA CACTAAAAGG ACAAAGGAGT 600 

GAAGTCCATG TCCTCATTAT TAAAACGATT GGTTCAGTTG GTTTTGTTAG TCGTCGCTGT 660 

CTTGCTGATT CGACACTATG TTTTCTCCCC TGCTGCGGTG AACGGCTCTT CAATGGAACC 720 

AACACTTCAT AACAACGACC GTTTATGGGT GACCTCGATT AAAAAACCAC AGCGCTTTGA 780 

TATTATCGCT TTCCCTAGTC CTCGCAACGG CCAACGAGTA GCCAAACGTT TAATTGGTTT 84 0 

ACCTGGCGAA- ACAGTCGAGT ATCGCGATGA TACCCTTTAT ATTAATGGTG TATCACTCAG 900 

TGAAGATTAC TTAGCAAGTG CTAAACGAAA TGTCTCTAAA AATGAAAATT ATACCCAAGA 960 

TTTTACGCTA GAGACCTTAG AAGCCACCCA ATCCCTGACC GTTCCAGAAG GCATGTATTT 1020 
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TGTCTTGGGG GATAATCGCC CGCGCTCAGA CGACAGTCGT TATTTTGGCT TTGTTAAACA 1080 
AGCGAGTGTG GAAGGTGTTT TGACTTTTCG TTATTATCCA TTAGATAAAA TTGGCTTTCC 1140 
ATAAAATAGT AAACCGCCCT CAAAAAGTCT AACTTTTTGG GGGCGGTTTA CTATTTTTTT 
AAGGTCCAGT GACTAAGTTC CACGTCACCG TTCCTTGATA TTTCTCATTT ATTTTCCCTT 
GATTGGCAGG GACTTCTAAT TTCACATTGG CAAAATCAAA ATCTAATTGA TAGACATCAT 
TACCTGTAAA TTGGTTATTA AGAATTACTG GTGTAGATGT ACCGTCACTC GTCAAAGTTA 1380 
AGGCTTTCGT AACCCCAACT TTATTTGTTA ACTCTGTGAC TTGATTATAG TTTTCGATTG 14 40 

CTATAACATC CGCACTTCCT AATAATAAAC GCGTACTAGT TGGCAAACTA tCAGcTGACG 1500 
CTGTTAACTG TGACAATTGT GCTGTTAACT GCCAATTAGG TTGGCTTGTC TTTATTCGTA 1560 
GATATGGATT ACGTGTGCCG CTGCCATAGT AATCAGCTGC TTTTTTCAGT CCGTACTGCT 
TAGTGCTACT GTTAATCGTA ATTTTCCCAA AATCAAACAT TGGTGTACTA ATGAAACCTT 
CTTCTACTGG TGTTGGCGTA ATCGTTTGAT CTTCTCCTTG AATACGTACA AAGTTACTAG 
CTGTCACTGC AGAAAAATTT CCACTAGCTG TCATTTCTGC GGTTAAAAGT TGTCTTGGTT 1800 
CTCCCGTAAT TTTCGCTGTG TTCACTGTCA CTaAAGTTCG CCGTTCACAG GAATCTCTAT 1860 
ATCTAAGTTA ACTCCTGCTT GCCATTGTTC TGCCGTCACT AGAATAATTT TATTTGGTGT 1920 
ATTACCAATG CGAACAGCTA ATTGTGTTGG TGGTGAAATA CCTGCCGCCC ATTTGGCAGT 1980 
TGGGGCTAGT TTAACGTTGG TCAATGGAAC TTCACTGGTG TTTTTTAGTC TAACTGTCCA 2040 
AACCATAGAA GCTTCATTTG GGATCACTTC TAGTAACCGA CTAACCGTCA TTTCTGCTGT 
TGGTTGGACT TCCTCATAGA CCGCTGTCAT GTCATCTTTA TCGTCGTAAG TCACGGCGTA 
ACTTGGTGTT TTAGTCGTTG TCAAGGTGCC TGGTTTGGTT TTACCTTTGT ACCAGCCTCT 2220 
GAACTTATAG AATTTGTCTC CTGTTGTATA AGTGTCTGGT AAGGCTTTCG TACTAGCGTA 2280 
AGTGAAATTA TCACTATCAA TCACTGTTTG TTTTCCTTGC GTGAAGCCTG TTGGCGGTGT 234 0 

AATTTTAGCG CCGCTTGGAT CCACAAAGTT TTCAGTGACT CGGCGATTTT CTAAATAATA 2400 
GTAGATTGTT CCCTCTGGAG AAGCTAGGTT ATAGTTTTTT CCGACTGTCC CTGAATAAAT 
TCCCATAGCA TACAACGTGT TATTCGGGTC CCATGTCCAA AAAGCTCGTC TGGTTCTAAA 
AAAAGTTCCA GCGTCAGTTA ATTTGAAGGC TTCTGCTCCG TCAGCTTGAG AAACAACCGT 
ATATGGCGCT TCTATATTTT TCCGGATCTC TTTGCTCATA GGATACTTTG TTTTGTCTCC 
CGTATAAGTA GCTAGCGGTG TAATCGACAT TGTCTTATAT TGTTTAGGAA TTGTATGATT 
AACAGACCTT GCACCGTATG TCTTAAGCTT ATCTATCGGG AAATCATATG ACTTGCCAGG 
AATCGATAAT TTTTTTAATT TACTCATCAC TTCATCAGTT CCTTGGATTT TCCCTAATAC 
TGTTGATTCA GTTTCGTCAC TTTCACCTAA TTCGGCTTCT ACAGAAAAGC CTGCTGTATT 2880 
CAATAGGTTA TCTGCCTCAT CAACATAACC AAAGTACACA TCTTTGGCTG GTAAATTATA 2940 
CGTTTTCATC ACGACTTCTT CATAGACAAC CGTCAAATCA TCATCATCAT TATAAGTGAC 3000 
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TGGATAACTG GGAGCTTTAG TCGTCGTCAA GQTGTCTGGT TTGTTTTTAC CTTTGTACCA 3060 

GCCTTTGAAC TGATAGGTTT. .TC.GTG.GCGACL TTTATAGGTA TCTGGCAAAT CTCCGCTTTG 3120 

CTTAAAGGTG TACGAATCAC TAGTAATTAC TGTTTTTTTC CCTTGGGCGA AACcTGACGG 3180 

TGCGGTGATT TTAGTACCAT TGGTATCTAC GAAGTTTTCC GTAACaCGAC GATTTTCTAG 3240 

GTAAAGATAC ATACGTGCAT TTTCATCTAG TGTATAGaCA CTCGGTTCTA CTCCTTTACC 3300 

AGTAATAGTC GCACTTAATA GTGTTGCAAG TCTAACATTT ATAGGCATTG ATGACCAGTA 3360 
TATACCGCTC ATCGTAAAC 
(2) INFORMATION FOR SEQ ID NO: 273: 



3379 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

TAGATTGTTT AAAAAAACTA TCGAATAATA GAGATTAAGC TTGCATTTTA GCGGTGTTTC 60 

CTGnAAAGTA AGCTTGTGTG GAAACACACC ATCTATCACT TGGTTTAGCG AGTCACCAAC 120 

GCTTCACTCA GTGATCAGAC GAATAATAGA GAAAGAGGTG GAACAAATGG CAATTTCACA 180 

AGAAAGAAAA AACGAAATCA TTAAAGAATA CGCACGTCAT GAAGGAGATA CTGGTTCTCC , 24 0 

AGAAGTACAA ATTGCTGTCT TAACTGAAGA CATTAACCAA TTGAACGAAC ATGCGCGTAC 300 

ACACAAAAAA GATCACCATT CTTACCGTGG TTTAATGAAA AAAATCGGTC ACCGTCGTAA 360 

CTTGTTAGCT TACTTACGTA AAACTGACAT CCAACGTTAC CGCGAATTGA TCCAACGTTT 4 20 

AGGATTACGT CGTTAATCAT TACGTATAAA GCGAGGTTCA ATCAGAATCT CGCTTTTTAT 4 80 

CTTATTGTCA CTATTGAAGG TCCTCTTTGG AAAGTTTTTA GGAACTACTA ACCAAATGAA 540 

AGAGCAGCAA CGACTTGTTA CTTTTTCATT TGTTTAGTAG CCGTCTGACT TTTCCAAAGA 600 

GCAAGAGTAA AGGAGAAGAA T TAT G AC AG A AAAACAAGTC TTTAAAACAA CTTGGGGCGG 660 

CCGCCCACTA GAAGTAGAAA TCGGCCAATT AGCCAAACAA GCCAATGGAG CTGTCTTAGT 720 

ACGTTACGGC GATACGGTCG TTTTAAGTGC TGCTGTTGCC TCTAAAGAAG CAAAAGATGT 780 

AGACTTTTTC CCATTAACTG TAAACTATGA AGAAAAAATG TATGCGGTAG GAAAAATTCC 84 0 

TGGCGGCTTC ATTAAACGTG AAGGTCGTCC AAGTGAACGT GCGACATTAA CCGCTCGTTT 900 

AATCGACCGT CCTATTCGTC CAATGTTCTC AGAAGGCTTC CGTAACGAAG TCCAAATCAC 960 

GAACATCGTT ATGAGTGTGG AGCAAGACTG TACACCAGAA ATGGCTGCAA TGTTTGGGtC 1020 

TTCCTTAGCG TTACAATTTC AGATATTCCG TTTGATGGAC CAATTGCTGG TGTTGACGTT 1080 

GGTCGTATTA ATGGCGAATA TGTTTTAAAT CCAACGGTAG AACAAGCGGA ACAAACAGAT 114 0 

ATTGAATTAA CTGTTGCTGG AACAAAAGAA GCCATCAATA TGGTTGAAAG TGGTGCCAAA 1200 

GAAGTATCAG AAGAAGATAT GTTAGGTGCC TTACTTTTCG GTTTTGATGC CATTAAAGAA 1260 
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CTAGTGGCGT tCCAAGAAGA AATCGTTGCC GCTGTAGTAA ACCGGGGATC TTAA 1314 
(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

GCCGTTTTGC ATACAGAAGG ACCACTGTTA GTCATGGCTG GTGCTGGTAG TGGAAAAACG 60 

CGGGTTTTAA CACACCGAAT TGCCTATCTA ATTGAGGAAA AAGAAGTAAA TCCATGGAAT 120 

ATTTTGGCGA TTACGTTTAC CAATAAAGCT GCTAAAGAAA TGAAAGAGCG TGTAAATAAA 180 

TTACTGGAAA CGGGTGGCGA AGATGTATGG GTCTCCACTT TTCACTCCAT GTGTGTAAGA 24 0 

ATTTTACGTC GTGATGTGGA TCAGATTGGT TACAACCGTA ATTTTACAAT TATTGACCCA 300 

TCGGAACAAA AAACATTAAT GAAACGCATC TTAAACGACT TAAATATTGA TTCAAAAAAA 360 

TATGATCCTC GCTCAATTTT AGGTACGATT AGCAATGCCA AAAATGAATT ACAAACGCCT 420 

GAAAAAGTTG CTGAAATGCA AGGTTCCTTA TATGAAGAAA TTGTCGCAAA ATGCTATGAA 4 80 

GCCTATCAAA AAGAATTGCG CAAAAACCAA TGTATGGATT TTGATGATTT GATTATGAAT 54 0 

ACGATTCGTT TATTTAATGA ACAACCAGAT ACGTTGGCTT TCTACCAAAA TAAATTTCAC 600 

TATATTCATG TGGATGAATA CCAAGATACC AATCATGCAC AATATACCTT AGTCAATTTA 660 

TTGGCTGCAC GTTTCAAAAA TCTGTGTGTC GTAGGTGATG CCGACCAAAG TATCTATGGT 720 

TGGCGCGGGG CGGATATGCA AAATATCTTG GATTTTGAAA AAGATTATCC AGATGCCAGT 780 

GTAATTTTGC TAGAACAAAA TTATCGATCC ACCAAAAACA TTTTGGCGGC TGCGAATGAT 840 

GTCATCAAAA ATAACCGTAA CCGCCGCGAG AAAGAACTGT GGACAGAAAA TATAGACGGT 900 

GAAAAAATTG TCTATTATCG CGGGGACACA GAACGGGATG AAACACAATT TATTGTGAGC 960 

CAAATCCAAA AAGAAATGCG GGAAAATGAC CGAATTTATG GTGATTTTGC GGTATTATAT 1020 

CGGACCAATG CTCAATCCCG TGTAGTGGaA GAAATGCTCt TGAAATCTAA TATTCCGTAT " 1080 

ACCATGGTGG GCGGACmTAA ATTCTACGAC CGTAAAgAAA TTAAAGATAT TTTGGCTwAT 114 0 

TaAGTTTGAT TGCCAATCCA GATGATcGAT TAGTTTTGAA CGAATTGTCA ATGAACCAAA 1200 

ACGAGGGATC nGTAAAAGTC AATTGGAAAA TnAC 

(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
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AAAGGACCGC CTAAACCAAC AnCGACCTGT CGGAGCGnAC GATCATTTTA GAyCCAtGCC 
CGTTsCCAAA TCACCCGTCG aGAATCAATC TTTAATTCAT TTCCTTGCTG TAAATGATTA 
TCTAGTAACG CAGCTAAGGC ATTATTGGCT GTCGTAATCG CATGCATATC ACCAGTAAAA 
TGCAAATTGA TTTCTTCCAT TGGTAATACT TGTGCGTAAC CGCCACCTGT TGCGCCACCT 
TTAATCCCCA TCACAGGACC TAATGATGGT TCTCTTAGGG CAATAACTGT TTTTTTATTG 
ATTTGATTTA ATGCATCTCC TAAACCAATG GTCACGGTTG ACTTTCCTkC TCCTGCTGGT 
GTTGGATTAA TCGATGTAAC GAGGATTAAT TTACCTTCTG GTTGCGCTTC TAAGGATTGA 
AGCGTTGGAA AATCTATTTT CGCTTTGTAT TTTCCATAAA GTTCGAGCGA ATCTTCTGAA 
AGACCGATTG TTTCGGCAAT CTCTTGAATA GGTTTTAATG TTGCTGCTTG TGCAATGTTA 
ATATCTGTCT TCATAACGGA CCCCTTTTCA ATTCTTTATC GTCTTTGTGA CACATTGTTT 
CTTTTAAATT ATAGCTTGTA TTGAATAGAA TGGCTAGAAT AACCTTTTTT AAACAACCCA 
ACCATTGACG TTTGGTAGCC TTTTCTCGTA TAATGAAAGT GTCTTTACAA ATCTAAAAGG 
AGCGTGAATT GGATGCAAAA AGGAATTGTC AAATGGTTCG ATAACAGAAA AGGTTATGGT 
TTTATCGTTT ACAACGAAGA AGATGAGATT TTTGTCCACT TCACTGCTAT AGAAGGTGAT 
GGATTCAAAT CACTTGATGA GAATCAATCT GTAGAGTTCG AGATAATGGA AGGCAATCGT 
GGGCTACAAG CAGCACATGT CAAAAAAATA GATAACTAAA AAAGCTGAAC CAACAGCGCT 
TAGCTCCAAA AAGTAAGTAA ACGTAGCGTT TGTTTCAGCT GATTATTTTA AGATTTTGAG 
AGTGGGTCAC CAGTCGTTTC GACTTATGAC TCACTCTTGT TTTTTTCTTC TTTTTTATGA 
AGGCCCAAAG AATGATCTTC TGCTACCTTC AATTCAGGCT TTTTTACCTT AAGTGCCTCA 
AGAAATAACG GAATCGTTTT CTTTTTCATT AAAAAGACTT TTGAGCCTTC CTTAAATTCT 
GAGGAAAAAA TTTCGATGCT TTTTTCATTG TAGGTTACTT TTTCTATCTG AGAAAGCGGA 
AGTGCTTTTT TGCGAAACCA CAAAACATCT CTGACAATTA AAGATGTTTC TGTCATATTA 
AAAGACCGCG CAATTCCTAG ATAAGCAAAC ACAAAAAATG GAACCATCAC CAGATTACTA 
ATCAAGTAAG GACCATTATT TTCCAATGCT AAAATTAAAC TAATAAATAA TATACAAAAT 
GTGCAAGACC AGTAAATAAT TGTTGgATGC TAATTCTGGg CTGgCCAATA GgAATTTCTG 
TTTTTTCATA AAGgTGCCCT CACTACTTCT gGATCAACGC CGCTAAATTT AAATATGTTA 
TAATAGTAAC ATACTTTTTT TAAGAACmAA AAGAAATTAG GAGTAATTTT AwGtTAAGaA 1620 
TTTATGtAGA TGCTGCAACA AAAGGCAATC CAGGTGAATC TGGCGGCGGT ATCGTTTACC 1680 
TCACCGACCA ATCGCGGCAA CAATTGCATG TGCCTTTAnG CATTGTTAGT AATCACGAAG 
CCGAATTTAA AgTGCTGATT GAAGCTTTAA AAAAAGCGAT tGCCAATGaA GACmATCAAC 
aAACCGTTCT TCTCCACTCA GATAGTAAAA TTGTTGTCCm AACmATTGAA AAAAACTATG 
CTAAAAATGA AAAGTACCAG CCTTATTTAG CAGAATATCA ACAACTAGAn AAGAATTTTC 
CTTTGCTCTT AATCGAATGG CTACCTGAAA GTCAAAACAA AGCGGCCGAT ATGCTTGCAC 
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GGCAAGCATT 
CTTTATTAAT 
TTTAAAAACA 
GTTATTyCCG 
TGCATGCCAC 
AAATAGGTGG 
GTGGTTGCTT 
GCTAATTTTT 
TTATTACCTA 
GCGACACTTT 
ACTTTATGTT 
ACTTTTTTCA 
CTTGTACGGA 
ATCCGAACAG 
CGATTGGAAC 
CATCTGTCGC 
TTTGGCGTAa 
AGAGAACAAT 
GATTCGCAAA 
ACAATGCAAA 
AAGTTAACAA 
CATCGGAAAA 
CTAGTGTTTG 
GCACCACAAT 
CAGTTCCACC 
TTTTGCAATT 
CTTTGTTTGG 
AT AC AT AT CC 
ATCTGCCCTT 
GCAATCGCAA 
tTCTGTTTCT 
CACAATCGTT 
TTCAATGACT 



ACAAAAATTT 
TTGATAATTA 
GATAAAcGGT 
CAAATkGCCC 
GTTTGCCTGC 
GAAATTGTTT 
CTAAATCTTT 
TCAAATCTAA 
CTGTAACCAA 
CTGCACTTAA 
GTTCCACCAT 
CCTTCTGAAT 
TAGTTGGCCT 
TTGACCTGTT 
GGATGCATAA 
ATCTTGTGGC 
CCGCGCGACA 
TTTAATCCCC 
TAATTCAATA 
ATAATTTTCC 
TAACGGTTTT 
ATCAGCGACA 
GCTCGCCGTT 
TTTGGGATTC 
TAACTTCCTA 
TAAAGAGCTA 
GTTATTTAAC 
GCATAAAnGG 
TGTCTAAAAC 
TTGTCGTTCT 
GTATCAATAT 
CGTGCAAACG 
TTCGCATGAT 



TATCCCAATA 
AACACGTGGA 
TCTGTCTCGA 
ATtGGTTAGC 
AGAAACAATA 
TTTCATCCCA 
TAAGTGAAGC 
TTCTTGATTA 
GGTTTTAAAA 
ATGATCTTCA 
GCGCATCGCA 
CACTTGTTTT 
GGTGCAGACG 
ACGCGACTAA 
TGGGTGTACA 
ATCACAGCTA 
ATCTCCTCTT 
GCCTTTTTAG 
TCCAATAAAT 
TCGGAAAAAG 
TGCCCCAACC 
TTTTCATAAT 
GCTTCGGCCA 
CCTTCACCTA 
GTTTGTTTTT 
CAAAGACACC 
TCTTTCTATT 
CACCTTCTGC 
TAAAATCAAG 
yCCcTTGACG 
TAGCGGTTGC 
AAATTAATTG 
AGGTATTGGG 



AAAAGTAGCA CTGTTTACTT AATGCTTTTC 
GCAAAAATTC CaAGTGATTT TTGCTCCaCg 
CTTCTTCTTA TAGCCaCTTA TTCTTTTGTC 
GAAaGGATTG CTTCAGGCGC TAATTCAATT 
ATTGCAGAAT ATTGTTGAGC TTCTTCAGCC 
ATTGGTGAAC AGCCCCCACG AATATAGCCT 
ATTTCTACCT TTTTGTTGCC ACTAGCTTTC 
CCAGGAATGA CAGCCACGAC TGGACCTGTT 
ATTCGTCCTT TTTCAATACC TAAACTTTCA 
CTCCAAGCGA ATTCATATTC TTTATATGGT 
TTCGTTTTTT GTTGTGTTTT TTTCTTTGCC 
GTTCGAATGT CTTTAAATAA TTTCTTAATA 
CTTGCTGGGC GGAGCCAAAA GTTAGTGCCG 
TCATCCCTAA CTGTCCCATT GACATAGTAA 
TTTCATTGGT TGCTGAAAGT AACGTAAGTA 
TTTTACAAAT ATCTGCCTGA CGCATTTGcA 
GCGACGGTGT TTTCTGAAAG TCGTGATTAC 
CTTCGTGAAT TAACGTATCT GCCGCCAGCG 
CCAACGCACC CTTTTTTACT AATTCATGAT 
CCATTTCGCC GCCTTCTTTT TGTGTACGAA 
GTTCCATTAC TTGTTGTGAA AGATTACATA 
AGTCTAAACG CCATTCCACC AAATCACAAT 
AAATATCCTC GGCTGTTGGA GCAACAATGG 
TTCGCACATT TTTGACTATA ACTGGCTTCA 
CTCTCTTCAT ATCATACCAT ■ TTCTATTCTT 
TAGAATTAGG AATTTCCTAA CACTAAATGC 
GTTACGTTAA ACTTCCGTAC TTTGTAACTG 
CAGTAACGTT CGTGCGTTCC TCGTTCTACA 
TTTGCGTCAC GAATCGTCGA AAGGCGATGC 
CATTTTGGCT AAtCCTTCTT GAATCaATCC 
TTCATCTAAA ACTAAAATTT TGGGATCAGT 
ACGTTGCCCg CTAGAATAAC TCGCTCCTCG 
TAACGTGTGA ATAAATTTAT CTGCCTGAAC 
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AAA7TCCGCA GCCTGTTTAA TTTGCTCATC TGTAATATTG GGATTCAGTA AACGGATATT 4020 
TCCAGCAATA TCTCCATAAA ACATGAAAGC GTCCTGTAAT ACTAAGCCCA TCTTTTCCCG 4080 
TA.-.TTCTGTC ATGGGAAAAT CACGAATGTC ACGATCATCA ATTAAAATCT GCCCTTCATA 4140 
AAATTCATAG AAACGCATGA GTACATTAAT AATTGAACTT TTACCACTTC CTGTATGACC 4200 
GAC7AAAGCG ACGGTTTCCC CGGGATTTGC GACAAAGCTA ATATTTTTCA ATACTTCGTT 4260 
TTTGCCGTCA TAAGAAAAAG TAACATTGCG AAATTCAATT TTCCCGCGAA TAATTTCCCC 
ATTCGCACCT ACTGATTGTT GCGGCGTTAA TTCTTCCGTA TCCATAATTT TTAAAATACG 
GCTACCTGCG ACAATCCCAT CTGTAAAGAT ACTTAGGAAG TCCATCATTT GCGTCATCGG 
AT T AAAAAAA GCTTGGACAT ACGTTACAAA AGCATAAATC ATCCCCGCTT CAACAGGTGA 
ATGTAAAGCA TCGATGCCAA ACATAGTTAA GGCCAATGCA ATCGCCAACG TATACAACAA 
ATTAATGaTG GGACTTAATA ATAACGaATT CATACGAATC ATTGcTaCTC GTGTTTTTAA 
ATAATCATCA TTGGTTTCCT CAAATTCTTT TTCCAAACGC TTTTCTTGGC GAAATTGTTG 
AATGATTTGC ATACCAGAGA TGTATTCATT TAGTTTGGTG TTTAATTGAC TTAATTTTTC 
CCgCATACTG CGATATAATT TTGAACTAAA TTTTTGGTAA TACCAAATCA CAACTAATAA 
AATCGGCAAG AAAATTAGAC AATAAAATGA GATTTCTGGA CTGAGTTGAA ACATTGCAAA 
AAATGACGAG ATTACGGCAA AAATTCCTGT AATCACCATT AGGAATACAT ACCAGAATTC 
AAACAATGTT TCTGTGTCAT TTGTAACACG AGACACCGTC GAACCCGCTG GTGTTTGATC 
AAAATAACGC ATACCTAAAG TGTGTAATTT TTCAAATAAT TTCACTCGTA TATATTGATA 
CGTTTTTAAA GAAGCCaTCG AATACAAATA CCATTGGAAA AACCAAACAA TACTTTTGAC 
AATGACACCA AAGAAATAAA TAGCAGCAAA AGTTAGAATC ACTTGCATTG TAGCAGTTTT 
AGGCGTTAAA TAATTATCCA TAAAAACTTG aATGACTCTA GGCAAAAGAA TATTGATAAT 
TGATAAGCCA AATGCAAAGA GAATAGCTGT GTAAAACATG CGTCGAAACG GTTTGGCAAA 
ACGAAACAAA CGCTGAACAA TCGTTAATTG TTCTTTCATC GGTAATTCTC TGGACCATTC 
GGATTGATAC TTTTCTTCCA TTAACTCTCA CTCCCTTCAA TTTTCGCTTC TAATTGCTGT 
TTTTCCCACA TCCGCTGATA CCAACCCTTC TGAGCCAGTA ACTCTGGATG CGTCCCCCGT 
TCAATAATTT TCCCTTCATC CAACACTAAG ATTTCTTTGG CATGCATGAC ACTACTTAAA 5520 
CGATGCGCTG TAATAATAGT GGTTTTTTCT TGGCGCGTTT CTTTTAGATT TGATAAAATC 5580 
GCCTCTTCAG TCTTCGCATC GACTGCAGAT AAGGCATCGT CTAAAATTAA CAATTCTGGT 
TCAACAATCA AGGCCCGTGC AATAGAAATC CGtTGtTTTk GTCCTCCAGA AAGCGAAACG 
CCACGTTCCC CAACCATTGt GTCATAGCCT TCAGGGAATG CTTTAATTTC TTGGTTAATA 
AATGCTAATG CTGCCGCTTG TTCAACTGCT GCTTGTTCCA AGTGTGGTTT AGCAAAACGA 
ATGTTATCTC GTACCGTCAT GGAAAAGAGA AAATGGTCCT GTGGCACGTA CCCCATTGCA 
CCCATCAATG CATCTAATGT GTAATTTTTA ATGTTATGTT TTCCAAAGGA AATCGTTCCT 
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TGGTTTTACC 
ATTGGATATT 
TTGAAATTGT 
TTTCATGTAA 
ATAAGCGCCC 
CCAACTGACC 
AAACATAAGA 
GTcGTGAATT 



AGCGCCTGTT 
TTCCAACGCC 
CCCTTGCGCC 
TAATTCGTTC 
AATCGCAAAC 
TAAACTAATC 
AaCACCmATA 
TTGAC 



TGGTATTGGT CATATTCACG CATCAACAAT TTCAAAATAG 
TTGCCAACAA TCCCCAAGGT TTCTCCTTCT TGCAACGAAA 
ACCGTCTCTT CTTTTGGATA AGAAAAAGAA TCAATCTTCA 
ATTGTTTTGA TGGCATCTTT TCGTTCGATA ATATGCGTTT 
ACACGATCGT AACTAGCATT CCCACGTTCT AACACATTAA 
ATTGGCCAAA CTAGCATCCC AATATAGGCA ATGAAAGAGA 
GTTCCTTCCA TGATGAAACG ACCACCAaTA ATAATTGkCA 
ATCAGCGkAA TAAaCGGaTC AAATrGgCAT CTAAAAAGTc 
(2) INFORMATION FOR SEQ ID NO: 27 6: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
CCTAGGACTG GCCTAAGATG GGGCCCCTCT TTCTAAACCG AAAGGGTCTT ATnGGAAAGG 
GTAACTCTAT GGTAAATGGA GGTnCnTCTA AATTTCAACC GATTTGGCAA AAAATCCAAT 
AGCTCAATGT TTGACCAGAA TAAATCAAGT TAGGATTTGC TAATCCGTTT AATGCAGCTA 
AGGTTTGATA AGTCGTACCA AGCTTAGCTG CAATACTTGA TAAATTATCA CCGTATTGAA 
CTGTGTAAAT GTTGCTTACT ACTGATCCAT TTACTTTCAA AATTTGGCCA GGGTAAATAA 
GATTTGGATT TGTCAATCCA TTTAATGCAG CCAATCTTTG ATAGTTTGTT CCGTATTGAT 
AAGCAATGCT TGATAATGTT TCGCCGTATT GTACCACATG AGTCGCTTCT GGTTGTTTAT 
CAGGAACAGT TGCCGCATCT GGCAATAATT CAATATTGCC TTTGCTGATC CATGACAAGA 
TACCTTCaAG CAATACTCTG CTcTCAGTTA CTTCTTGTAC TTTATAGCTG TTTCCTTTTA 
CCCATTGCGG AATAGtTCGC CAGTTGCCCA AGCATCAACA CTAAATTTCA CTTTAACAGT 
GTCGCCCACT TTAACATCGG AACTTGGCGT ATTTTCTACT TCTTTACCTT CCTCAATAGC 
TGGTGTGTTT GTTGCTGGyT GGTTATTCTT TGTATAACCA TTATCAGTAA TACCTGTTAA 
ATCAACGTTA CCATCTAGCC CTCCTGCAAC GTAGGTTGAC GTGAACTGAA AAATACCGAT 
ATTTTCAAAT GaAGGAAAAT AATTGTAATT TGGATAAGGT GTCACTTCAT AATCTGGATA 
TTCTGCCATC CATAATTGAT ACTTCTTTGC AATTCGTGAT AAATCATAAG ATGAAGTAAG 
ATACCCTTTG TATCCGTAAA GCATCGGTGT ATATCCAGCC TCTTTGATAT AGTCTAACGC 
CCATAGCGTT' ACATCCGTCG ATTGAACGCC GTCTTCCGCA TCTAAGGCGA CAATTGATCC 
CTTTGGTGTT TGAACTTTAG GTAAAAAGTA ATCTAATACT TGCTTTGCAT TTTCGTAGGT 
AAGGACGTTT TGCCACCATA CATACGTATG CGCTCGTTTA CCTTGAGCAA TCGTACTAGC 
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TACTTGACTA 
TGAAATTGAG 
CAAATCAACG 
GACTACCAGT 
TAATCCAGGT 
ATTAACAACA 
AACTGCTACT 
GTTCTTTATT 
TTTTATTTTG 
CTTCCTTTAG 
ATGCGAAAAT 
cATCCTGCAC 
ACAAATTTTT 
TTGTTTTGTA 
ATTCGTTTTC 
CAGTCATTCT 
CCGCATTTTG 
CATTATCAGG 
GCTCTTTTTC 
AATAACCTTG 
ATCGCACTTA 
AAGTCGTTTC 
TAGTATCCAT 
GGCACTAAGA 
CATTGTAAAA 
CCTGCAGcTC 
TATGCTTTGT 
TGTGTTTTAA 
ACACTTCCCC 
CTTTAGACAT 
CTGCTGGACC 
CTCCCTTTTC 
GAAATACCTG 



GAATAAGTAA 
AATTTATCAT 
CCTTGGtCTC 
AACGTTGCCA 
GTTGTGGGAT 
TCTAACAGTT 
ACTTGAATTA 
CTTGATTTCC 
CTCTTCCAAT 
TTCTTTCATG 
AATtCCCCCT 
CTACTTctAC 
GTATATTCTT 
TAACAATTCC 
ATCTTATTCC 
TTGTGTTTGT 
CTTTTGTAAA 
ATCGTATATT 
TACTTCAAGT 
AATATCATCA 
ACGCAAACAT 
CAGTAATTTT 
GTAAAAGGaA 
AAAAGTGATA 
TCCAACCATT 
CGCCATACCA 
CCGCATCTAT 
GATAGACTGG 
TACCTTTTCA 
CAAGCCATTT 
TTGCGGACCA 
ACCTTTCAAT 
TGTTCCTCCT 



CTTGATCATA 
GTGCATAACC 
CTTTTGCAGC 
ATAGTTTCTT 
CTGTCACAAT 
GCTTATTAAT 
ATAATAGAAC 
AATTAATCAT 
CGGCTAATGC 
CTATCTTCTA 
AAGAATGTAA 
TTACAAGAAA 
CTTTTGTTAA 
AATCATACAA 
GCTCCTTTCG 
TGAATAACTT 
AGTTCCAATT 
AAGCTTGTTC 
TCAATTTTAT 
ATCCAAATTT 
TTTACTGCCT 
AGTATCCTTA 
AATAACTCCC 
ACACGCACCG 
CTCACACTGT 
TGCACCTGTC 
TTTTTTTATA 
TTTTCCTTTT 
AACGTAAtTG 
TTTATTGAGG 
ACGTCTCCTT 
ACTTCTGGTT 
TGTTTTACTT 



1285 
AATCCCATAG 
AAATGTTGCT 
AAATGTGCTA 
TTTCATTTAT 
ACCTAGAATA 
CATTTCAATT 
TGCTGGAATA 
TTTTATTTCC 
GTATTTCATG 
ATTGAGAAAA 
TCACtGCTAA 
aACCGCTTAG 
GCAATTCATA 
CTGTTTAATA 
TAAGTTCAGC 
GATTCTGCTG 
CCGTAGGTGG 
CGTCCCAGCG 
TCGGTTGTTC 
TCATGTTCTC 
GAACCAGCCG 
ATATATAAAT 
CCAGAACCTG 
TTCAAAGCTC 
GAAAGAGACT 
CACAACGGTT 
TAAGATTCAA 
TCTTTTAATT 
CTGGCAATCC 
TTGCAACGTC 
TATCTCCTTT 
TCCCTTCCAC 
TAACAATATC 



TTGTTTTGCC 
TGATACCCAT 
ATTGGTGATA 
TTATCTCCTT 
GTTAGCACAA 
TGAAATTTAT 
ATCGAAAGCC 
TCCAATCCCT 
GTTATTTAAA 
GACATGATAA 
CCATtGTtCT 
CTTTCGCTAA 
TTCACGTAAT 
TTATCGAAAC 
GACTTGTTTC 
AGAAACTTGC 
CGTTGGCTCT 
ATAATTAAAA 
CATTGTGGAA 
TTTCCTCCTA 
TATCACTAGA 
ATTTCCGTAC 
GATTCTGCGC 
CTTCTTTGGT 
TAGAAGGTAC 
CTTTTTTGGT 
GCCCATCAAT 
GAACAATATC 
ATCTAGCTTt 
TGTCGTTGTT 
TGGACCTTGT 
AGCATTCCAA 
TGTCATTCAA 



CACCAATTTG 
TCCAAATAGA 

CAAAAAATAA 
TCCTATCTGA 
CAAATAATGC 
ATCCAAAAGG 
AAAATTGTTT 
CTAAAGAGGG 
CGTTCAACAG 
AATTTCATTA 
ACTGTTAAGT 
ACGGTTCTCC 
CTACTAAATC 
CTGGAAAAGC 
ATCAATTCGC 
TTCATTAATT 
GGTTCTGGCA 
AAATCTGTTG 
TATCCGTCTA 
GTATTCATAA 
AGAGGCATTT 
TAAATTAGTA 
AtGCTGTTTA 
ATATTCTTGC 
TTGCCCATTT 
AATTTTTTGA 
TGCGGCAGTA 
TGTTGTCATT 
GTTTTATCTt 
GCATTTTGCC 
GGACCTGGGT 
TGTGTTTGAG 
CTTCCCCTAC 
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TCTCTCAAAC 


GTAATATCAG 


GTATTCTGTC 


AATGGCTTCT TGAACI I 1 1 1 1 


1 UnnLn 1 A 


n sn 

JlOU 


TTGTTGATTC 


ACTCCGCCGC 


CATCGCCACC 


ACCAGTTGCT GAAATAACAC 


OA1 LI iLl UA 




AATAGAAATA 


TTCGCTCCAG 


CAGTATAACC 


TTTCAACTCT TCCAGTTTCG 


7\ T 1 T 1 T 1 A f** T 1 T* 

Ml 1 1 1 1 it 


lino 


AGTGGTGAAA 


TTTTGATCTG 


TTTGCTTTAC 


CGCAGACAAC GTTCCGTCTT 


LIulAAl lit 


1 IfiO 


TAACAGTTGG 


CCAACCTTTA 


TTCCGCCCAG 


TTCATCTGCG GTAGCGATTG 


(jAAu AA ibln 


14 90 


CACGCCTCCC 


TCGCCATTTG 


ACAACCGTTG 


AAACATTTCA GCAGTGATAA 


1 ACL-Cj 1 U 1 vj 1 


14 ft D 


TTCTTCTGTC 


GCATAAGGAA 


GTTCTGTCAG 


TGCATTTTCT AAGCCTAGAT 


C I bL 1 1 1 AU 1 


1*^40 

J Jl V 


GATAATTACT 


GCCCCAGTAT 


ATCCATTAAC 


AGATAATACT TTTGATTGAC 


CCGCAA1 AA1 


i enn 

jDUU 


TTTTTCTAAT 


CCTCGAACAG 


CGGATGCATG 


TGTAATAGGA TAAAACTGAC 


GTTCCACGLC 


jODU 


ATTTTCATCG 


GTTTCCATCA 


TTCGTTTTGC 


TTTAACCACT TATTTCACCC 


ACTTTTTCAA 


1*7 9 n 
j / zu 


ACACATAAGC 


GTTCTGTTTT 


GTATCATCAA 


CTGTTGCGAT AACCAATGCC 


CCATCGATUb 


1*7 AO 


CAGGATAATC 


AACTGTTCCA 


ACAATTTCTG 


TTTCATGATT CAGTGAAAAA 


GCATCATCTT 


JO 1 u 


GTAAAATAAT 


CAAGTCACTT 


ATTTCGCCAT 


ATTCTAACGT ATATAAGCGT 


TTCTCTAATT 


1QOO 


TCTGATACAA 


ATATTCCATA 


TCTGCCAATA 


AACGTTCAGA AATTGAATTA 


TGGCGCACTL 


1G£0 


CTTGAATGTC 


TACACGTGCA 


TCCATTAGCT 


CGGCTAACAT CGTACCGCCA 


GGATCAATCG 


/I A9 O 


TTTTTAAAAT 


ATCTTTGATT 


GATTCGAACC 


ATGaAGTGAA ATCTGTTTTT 


TGCGCATCTC 


yl no 0 

i 


GCCACGCTTC 


GAACTCTTCT 


TTTCTAGCAT 


TCATCCAATC AGTAAAATCA 


CCCTTATTTT 


a i ^ n 

fl .L 1 VJ 


CGTTGATAAA 


AGCGTTCATG 


TCCGCGATTA 


AATCTTCAAT GGACTGCCAA 


TAAGAACCLA 


4 9 on 


TTTCACCTTC 


TGTTTTCGAA 


GCAGCATTCA 


CTACAAAATA AGAAAAGTTT 


TGCG 1 1 (jUAL 


a 9 fin 


CAATCAGGTT 


ATCACCTTTA 


TGAATACTGA 


AATATGCTTC TTGTCTGTGT 


AACoAC- 1 vjCA 


4 19 0 


TAGAATATTC 


ATCAAAGGTA 


TACTGAATAA 


TCCCTTTTTT GGCATTCACA 


Ai 1 i 11 1 u 


4 380 


AACGTTGAAT 


CGGATATTTA 


TTATCAATAA 


CCGATTCaAA AAAAAACTTC 


bLAALC Ivj 1 1 


4 4 4 0 


AAATCAAGTG 


GCAAAGCATT 


TTCAACTAGT 


ATGGCTTCTA AGACTTCTGT 


GTT ICbAl 1 1 


4 ^00 


CCTTGTCGTA 


CATTCTGAAT 


CCCAATGTAA 


TTGTAAGGTT CAGTTGTACT 


TAGCb 1 IbL 1 


4 560 


TGCCATTTAA 


CCATTGAAAA ATCCTCCTyT 


CGTTATTTTG GTGGAATAAC 


AA 1 Lun 1 1 uM 


4 620 


ATAGAATTAG 


CAAAATATAA 


TCGGTCATAT 


TTTGCGACAA TTTGyCCTTG 


CTL(jbL.r 11C 


4 680 


TGTTCTATsG 


TTTGGATACG 


TCCGTTATTT 


AAGCCGTAAA TCACGCCCGT 


1 bnttn Ini 


4 74 0 


GTTGGGTCTA 


CTGTCCAACC 


TGTTCCCCAT 


TGGCCACCTC GTCTAATATT 


bALun 1 1 1 


4 800 


CCTACyACTA 


AATCTTGATA 


CGTTGGATTT 


TGGATTACTC GCCAACCTAC 


CGCATTCCAA 


4860 


TCATATGCTT 


CACCAATATC 


TGCAGCAGAT 


GATGTATCAC CAATTACATG 


; TGAAAAGCCA 


4920 


TAAATTGTTC 


• CTGCACCTAA 


ACCACAGCCG 


CCCATAAAAC CAGAATATTC 


: GGCTGGAACG 


4980 


GCATAACATT 


GCCCATTACC 


AAGCCATTTG 


1 CCCATTAAGG TCTCCAAATG 


; TTCTATGCCA 


5040 


GCTTTTCCTG 


; TTGCAGTAGA 


> AGCTTTCAAA TCTTTGAATT TGTCATACCA TrCTTGTGCA 


5100 
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TAGGTTTGTC 
GCAGCTGTTT 
CCTAACCATT 
CTATAGTCTT 
CAAGTTGGTG 
GGACCGCCTA 
CCGAGGATAC 
TCCCAAGCAA 
CCAGAACCAC 
TCAACTGTCA 
AAAAACTGTG 
ATTTTTTGGG 
CCATTTTTAT 
CTATTACCAT 
TTCCAAGCTT 
GTTTCAAAAA 
TTAATTCCCC 
TTGTGATTAA 
TGGGTGTTAG 
AAATCAGTAA 
GCTTCTTGAA 
ATTTTTATTG 
TCAATATCAT 
tTAGCAAATT 
ACTTTAGGCT 
AAGCTAGCTA 
TCAGGATACT 
CCGTCTGCTC 
TCAAATTTAA 
CGTCCTCTAC 
TATTGAAGTA 
GTAGTGCTAG 
TTCTGTTCGA 



TTTCTGGATG 
TAGGCGAGCT 
GCCCATTGAA 
GTTTGATACC 
CGCCTACCAA 
TTTGTTCGGT 
CTGCAGCAGC 
ATTTATCTGC 
CACCACCATT 
GAATTGCTTT 
TAGGACCAGT 
GATTATCTGC 
CTGCTAACCC 
TCCAAAACTC 
GTAAAATACC 
TATTAGATTT 
ATGCATTTTT 
TTGCAATCAC 
GTCGATTTTT 
TATAATCTAC 
AGTCTTGACT 
TTTtTTCTGC 
AAAGCTTGTG 
CTTTTGCTTC 
TATCACTTCC 
TATCTTTTAC 
TACTGATATA 
CACTTTGAAG 
TGGTTAATCC 
GCCTTAGCAA 
GAGAACCTTG 
AAGCTGTATT 
TTAATTTCAT 



TGCTGCAGCT 
GACAACTTTA 
CATACACCAA 
TGCAGCTGCG 
TGGATATGCT 
ATCTGGGTTC 
TGCTTTGCTG 
ATCGCTTGTA 
TTGACCAGGG 
ACTTCCATTT 
AATTAAATGT 
GACTAATAGT 
TGGAAAAGGA 
CATCCCTTTT 
GTTTACCATA 
TATTTTGCCA 
AAATGGaCCT 
ATCTTTTGCA 
AGGATACTGA 
AAAATCACTC 
TTGCTGTTTA 
TAAAGCATCA 
GTACACAGTG 
TTCTGAATCT 
TGGATATAAT 
TCCTTGATCT 
TTCGCTTTTA 
CTCTGCAAAT 
TTTTAAATCT 
AGATAATTTA 
TTCCCCTGCA 
TATATCAGAA 
TGCTTCTGCA 



1287 
GGACGTTCAA 
AATTCATCAA 
TTAATTAATT 
ATTAAGCGTT 
GAACCGTCCC 
ATACTTGGAC 
TATCCGTTAG 
ACTTCTGATG 
ATAACTTCTT 
GGACCAAAAA 
CCTGTGCCTG 
AATGAATTCC 
TTTCCCTTTG 
TTAGTTAATT 
CGTAAAATAT 
GCTCCTATAA 
TTCCAACCAG 
GTATCCGTGG 
CGTATACTTC 
ATATACTCTT 
TAAAAAGCAA 
TAGACAATTC 
AATGTATCGA 
GTAAGTTTCT 
GTAGTGAAGT 
TCAGTAAACT 
ACTGCATTTC 
GGCAAAACTT 
TTACCATAAC 
AAAGGCTCTC 
ATACAATTCA 
TATAGTTTGA 
CCATTACGAT 



AGTTTAATTC 
CTGTTAATGG 
GAGCTTGGGC 
GTACATATTC 
ATTGAACCCA 
CAACTTCTCC 
CTAATAGGTA 
GATAACCACC 
TGCCGCCCAC 
AGTTAAAATT 
ATTGGTTAGA 
CATCAGAAAC 
TTCCCATCGT 
CCATGATTTT 
CTCCACAGCT 
AGTCCGCATT 
TTCTTGAGAA 
ATTCCGTATC 
CGCCTTCAAC 
TTTTTGTCAA 
CTTGGATATC 
CTGTAACTTT 
ATAAATTATA 
CaATTTCTAA 
ATTTACTTGC 
GAATGTATTG 
CATAAATCCG 
TAGTAACAAT 
GAACAGTTCC 
GTTTTATTTC 
ATACATTTCT 
TATCACAAGG 
TATCAACTGT 



AAACGCATAA 
ACTTACTTGT 
TAATGACGTC 
TCGGCCATTC 
TCCGTAAGCT 
TTGTACATTT 
ACTCCATAAG 
TGTACCAGCT 
AATCAATCGA 
ATTTCCAACA 
CAAACCAATA 
TATTGGATTT 
GCCAACGTGA 
TTTCTTATTA 
ATTAAATGAT 
CAATACTCCA 
TCCAATTCCT 
TAAGTAATAA 
ACCGTTAATC 
TGTTTTTATA 
TCCCGCAGTG 
CGTTTGAATG 
GTTACGCATC 
TTCAATAGAA 
CACTTTATTT 
AGCGTAAACA 
TTGAGAAGTT 
TGATTGCCAA 
AACGTTATCT 
TCCGCCCCAG 
AGCTTCAAAT 
TTCGTCCATG 
TACTAGCCTC 
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ACTTGTCTGT TTCCTAGCTT ATAAGTACGA GATTGGGCAT AAATAACAAT GCTATTAGTA 7140 

-AAAGTATCTT TAAACGTTTG TTTGATCTCA AAAATGTGGT ATTCTTCTAA GTCATTTGGC 7200 

TTTGCTTTAA TTTGATAGCC ATTTTCGAAA TAATCACTAA ATCTGCTAAT CGCTGGATAG 7260 

TCCATTTCTA GTTCATATTT TCCGTTTGCT TCTTCAGTGA TTTCGCAACG TGTCGCATCA 7320 

ACAAGACGTC CTAATCCATT TGTTGaAAAA TCTTTTTCTC CAGGTTTAAA AATAACTGGA 7380 

ATCAAACCTT TCGCCTCCAA TTCGGCTGAA CCTTAAACTC TGTTACTTTA CCAGTCCAGC 7 4 40 

GAAAATTATT CTCTCCACAT TTTAAAATCG GATAATCTTT AAAAAGTGTT TTATGATCCA 7 500 

AGATTTCAAA TGCTCCACCT GATTTTCTAT AAGCTTCTTG TTTTTCTGAA TCTATAATGA 7560 

TGTCACCGTT AATTGCTTTT AATGAATATG ATTGATTATT GATAAAAAAA GAAATATCCC 7 620 

CAGACCCCAA AATCTGAATA ATGGGTTCTG AAGGATATTT TTCTGTATTG ATTAACTGAT 7 680 

TAGGATTACT TATCCAATAT tGGCCAATAC GATTTTTCTT AAAAGGTCGG ATACTTACAG 77 40 

TAAATTCAAA AGGAATTAAA aCCCCGCTTT TTCTTGTTCC TGTAAATTTT GGTGGACTCG 7800 

TTaCAATCGC CTGATaAATa TAATGCTCAT CAAAATAGAC AATAAAATCA GAATAGTTTC 78 60 

CCATATCGAG CCAAAAGGAA ATTTCATCTT CTAAAAAAGA AaCTyCTtGk AAAGtATTtG 7920 

CTTTCGCATA GCATGTAATG GTACGTTCTA CATTTTTATA ATATGCAAAA TCAACGGCTA 7 980 

TTGAATCATT ACCCATTCGC TCCCTAAGCT CTACCACACG TCCTGCAGAA AGTCGTTCAG 804 0 

GTCTTTCTCT CATAAATACA TTGAATTCAG AACTATGTTT TCCATTAAGA AAAAACTGTC 8100 

CTCTTTTAAA TTCCACCAAA AGCACCCCCA GTkGCATCAC TATCTCTATT TTTAACAATT 8160 

TGgAATATAC TTAACGAGGT CTTTAGCCAT ATCCaTTAAT tGTTTTTCAT TTAATTTTCC 8220 

CATAGCTTGT ATATTGATAT TGAAAGTATC ACCGCCAATA TTAGTCGTAG CATTACCTTT 8280 

ATTTTTAGCT AAGCTTTCTG TTTGAGCTCC TTGTTGGTTG ATATATCTGC CAGTAGTAGA 8340 

AAAATTTGGT AACTCTGTTG GTAAATCGGT CATTTTTTTC ACTGATTTGT CAAGCGTTCC 84 00 

TTTTTCTTGG TCAATACCAG CTACAACACC TAATACAATA TTTTTACCAA TCATATCCCG 84 60 

CATCCATCTT GAAGGTGAAT GAATGCCTAA AGCACCTTTG ATTTTTTCTT TAATATTACC 8520 

AGCAACTTCT" TTAATTTTTT TATTCACAGC ACCAATCATT GAACCAATAC CGTTAACTAA 8580 

TCCTTGGATA ATATTTTTAC CAATTTCAAA TAAATCGATA TGGCGCATAT CATTAAAGGT 8640 
TTGCTTCACA TTTTCAACTG TATCACTAAC GCTTCTTTTA AGATTATTCC ACGCATTTTT 8700 
AGCACCTTGT ACCAAATTGT TGAAAATATT AACTGTTCCC TGTTTTAAGT TTTCCCAACC 87 60 

GTTAATGATG CCGTCTTTTA TACCTGTCAC AAGATCAACA ATCCACTGTT TAAAATTATT 8820 
CCAGGTATCT TTTGACCATT GAACAGTTGC GTTAAATGTA TCAACTGTGC CTTGTTTTAA 8880 
GTTATTCCAA" CCATCAATTA CACCATTTTT AATGTTTTCT ACTGTTTCAA AGAACCAAGT 8940 
TTTCAAACTT TCCCATATTC TAATTGCTTC AAATTTAATA TTTATCCACG TTTCGATGAT 9000 
AGAATATTTA ATTTCAATCC AAACGTTGAT CGCTCCATAT TTAATGTCAA TCCAGAGTAA 9060 
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GGTGAAAAAT AACTTCACAT CAATCCAAAT CTTTTTAATT GTCAACATCA ATCCATTAAA 
AATAGAAGTG ACTGAATAGG .-AAATAGGTGT -AACAGTGTTA TAAAAGATAT TTTTAATCCC 
GAACCAAATA GTCTGAGCAG CTTCAGCAAT ATTATCCCAA ACGGCAATCA TGTTTTCTTT 
TGCCTCTTCC CATCCACCTG TGATCATTGA TGTAATGAAA AGAATTGGAG CTAATAGAAC 
ATTTTTTAGA ATGGTGACGA CATTTTCAGC GATCATTTTG ACATTTTCAA TGTTCGCTTT 
CATAGCGTTA ACAACCATTT TAAACGCATT TTTGATTCCT GTTACATATG GACCAATATA 
TTTCCAGACA AAATCAaATG CTGTTGTGAA AACATCTGAT ATTGATTTTC CAACACCCTT 
AAACCAaTCT TTCACATTAT CAAAGCCATT TTTAAAACTT TCTCCAACAC TTTTAGCACT 
GTCAGCAGCA CTTTGTTTAA TATTTTCCCA TGTATTTTTT GAGCCTcTTT TGkTGaATyC 
CAAAaGTCCA CTGAAAAATT CCTTGGTACC GTTCCACTTA TTTTTAACCC AGTCGGCTGC 
ATTTCCAGGT ACTTCTTTCA TCCATGTGCC AGCATTCGAA AAAGCCTCTT TTGTGCCATC 
CCACATGTTA CTGAAAAATT TCATTGTGGA ATCCCaAGCT TTCACAACTG TTTCTGCAGC 
ACTAGAAATA AATTCTTGTA TATTTTTCCA AATATTTTTA ACAGCATCCC TAAACCCTcG 
TTAGTCTTCC AAAGATAAAT AAATCCTGTA ACTAAACCTA CAACTGCAGC TAAAATAGCG 
ACAAACGGAT TCGCCAACAT AGTTGAATTA AGTATCGtTG CGCAATTGAT AATCCTTCTG 
TTGCTTTTTG CCAAGCCGTG AATGCTGCAC TTACTTTTTT AGCAAGCATC AACGTTCCAA 
TACTGCCAGC TAAACCTGCA AGCAATGGAG CATAAGGTTT TAACGTATCA TACAATGTTT 
TGGCTGTTTT AATCATTGGC GGAATCATCT CGGCAAATTT AGATAAAGCT GCTTCCATTT 
TTGCCCCTTT GTCAGCAATG ATTTCACTAA TAC'TTCCAAA ACCTGCACTT TTTAAGCCTT 
CGTCAATTTT AGTTACAACG TTAGCCACAC CACGAACGAT TGCAGTCTTC ATGTTAGCTA 
AACCTGTTTT AATACCAGCG GTAGAATCTT TAGCAATCTG TTCTAATGAT TTAAGACCGC 
CACCGCCTTC TTTATTTAAT TTGATTAAAG CATCTTGAAA TTCTTCAACT GAAATTGAGC 
CATCAGAAAG CCCTGCTTTC ATCTGTCCAG CTGTTAATCC CATCTGTTTT GCTAAAGCGT 
TTAATGCTGG TCCTAAACCA CTATTAATCA TTGAATTCCA AGTTTCAGCA TCTACTTTAC 
CATTTGAGAA CGATTGGGAC AGCTGGATAA TAGCATTTTC TACCATTTCA GCAGAACCAC 
CAAAACCGAG GATACCGTTA TTTAAAGCTG CGAAAATCTG TTCTGATTTA CCTAAATCAT 
TTGTTGATGA AGCGATTAAT TGAACACCTT TAATAGCGCT ATCTAACGGC GTAGGCAGCC 
CTTGGATACT TTTCTTTAAG CTATCCATTG TTTTAGATGT TTCACCAGCT GAAAAGCCCA 
TATTTTCAAA CACACGATTT GCGTTATTTA ACGTATCTAC ACGATTAATG GCCCCGTCGA 
TGTTGCTGGT AATCAATCCA ATACCTTTAG AGATAATTTT AGTTGcTCCG CTAGCTAAAA 
AGTTACCAAC" AAAAGACGTC CATATGCTCC CAAGAGATCG GCCGCCTTTT TGTCCTGTTC 
TATCAACTTC AACATCAAAG CCTTGTAACT TTTTTACTGC TGAATTTAAT CCTTGTGAAA 
AGCCAGATTC ATCCAGTATC ATTTTTAAGA CTAAGTCTTC ATTGtTCAAA AAGTACCCCC 
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TCCCTCTTAG AACATAGTAT TTTCATCAAG ATATTTGATA TTTTCAAATT CGtCTACAGC 
ATCTTTAAAT GCATAAATTT TCmAAAGCTC GtTTAAATCT GTGTTTTCGA. TCTCATTTAA 
AGTCCACCCA TTCTCAAGAA GCGAACTTTT TAGTTCTGCT TCTCGATATT GTGGCGTGTA 
CTTAAAATGA GGATGATATA AAAGTTCCGT TACTTTTTTT TCTGTTCAGA ATAAATTGCA 
TCATAACCAG AAGTAACAGA ACCTAACAAT TGAGCTGTAA TTTTCAATAA TTCACGAGCA 
TCCATACCGT CAATATATTC TTGTCCAGTA AACTGTTTTT CAAAAATAAC gTCAGCAATA 
AAGTCAtAGC ATTCTCTTAA AATAGGACGA ATTGCTTCCA TATCATTTGT TTTTGkTGcT 
TCyTCTAACC TGATTTGTAA ATCCGTACCT GTATCCatGA CTGAACCTGG TAAAAAtTCT 
GCCGACGTGA ATTGTTTTGT AGTATATTTG CTTCCATCTT TAATCATTAA TTTAATTTTT 
TGTTGAAATT TACTTGCCAT TTTAATTCCc TCCrTATAAA ATaGGACGAC AAGGTCGkCs 
TAAACTGTTA TTTTTAATCT GCTGTTGTTA CATTTAATGT ACATTTTGCG GTAAAGTTAC 
CATCTTCTGT TGTGACTACG ATTTCCGTTG TCCCTTCTCC TACAGCAGTA ACCTTCCCTT 
GAATTGGTGT TACAGTTCCA ATAGCCTcAC TTTCTGaACG GAACTGATAT TTTTTATTTG 
AGGCGTTATC TGGTGTAATT GTCGGTGTTA AGGTTGCTGT TTGGCCAACT TTTAAATTTA 
ACTCTGTTTG GTCTAAAGTT ACACCAGTAA CAGCAGTAGT ATTTTCTTTA CGTGGaTCCA 
TTACCTCAGT AAACCAGTTT TTAATCATCT CTAAGTCAAC ACCTTCATCG TCTTCATCCA 
CGGAATACAT ATAACCCAAC CCTGGAACAT CAACGAAAGA CCCCGTCCAT TCTGGATGGG 
TATAAGATAC TGAACTTCCT TCTAATGTAG ATGTTTCATC AGATGTTAAA GCAAATTTTC 
CTTTATAGAA AATCGTATAG CGATATTTAC CGTTCGATTT TCGGCGACGA TAAGCAAATG 
CGCCATCTGA TGCAATATCA TCCGCAGACC GCAATACGCC ACCCTTTAAT TTTTTTCCCC 
CTGTAATTTC AGCTAAAACT TCATTTtGGT AGCCGTTTGT TTCTAAAGTA ACTTCTGCAC 
CACCAAATGC AACATACTGA TCTTGAACTA CACTATCGCC ATAGTCAGGC GTTGTTTCTG 
TTGTAACATC TGGTTTGATA CTTACAGCAG TACCGATTGT AATTGGCGCT CCGTAAACTG 
GAAAAGCGCC CGTTTCGTCT GTTAGTGGGA ACCACGTTGG CTTCTCTACA GAAATGACAC 
TTACATTTTT CTTTTTTGCC ATCTATTTTT CACTCCATTC AATTAATTGA GGGAACGCAA 
CATTAAAATT GATATGTTGA ATTCCGTCTG TTTTAAACGT. TTGATAATCT TCTGGAAACA 
ATTCGTTTCC GTCCAAATTC AACACATTAA AAAAAGCCCC ACAGCTTTCT GTTAGGCTTT 
TTACTAATTG TTTATCTTTC TTACTATCAA CCAGTGCAAT ATCAACATTG TATGCTTTAT 
TTTGAACGTT TTGACCTACA TTATCAGTCA TGCTTTCTTC TACGCTCAAA ACGAAATAAA 
ACGGTTCTGA CGATTGCATC ACATTATCAA GATAGATAGG TGCATCCGAG AACTGTTTTA 
TTGTGTCAGT- AAGCATTTTT AAAATTTTAT CGTACATATC TATCCTTTCT TAACAATGAT 
AATCGCCATT TGTTTAAATC GTTTCGGAAT ATATGTTGCA TTAGCTAATT TATTAGATTT 
TTGCAACATA AATCTGCCTT TAACAAATCC GCCATTTTTT GTTCGGTGGC CATCGTTTAC 
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ATATCTAAAA 
tCTAGCTTTA 
TACAATTAAC 
AGCAATGGGT 
AAGAJCGCTT 
AATAAAAAGG 
TTCTATCTCC 
AGATATTAAA 
GAGAGAAAGC 
CGCTTyCCAC 
CTGCTAAAAC 
TTTAATGGCA 
ACTAATTGAT 
AGCTTTATAG 
ACAAGTATTG 
TTCAGAAGCG 
AACCATAAAG 
TAGCTTCTTT 
TTTTTGTTTT 
CTTCTTTTTT 
AAATTCAAGA 
CCAATCCGTC 
CGTAATTTTA 
ATTAACGCAA 
AGTTTATCGC 
TCTTGCATAT 
GTTTTAAATA 
TTATTCGTTG 
TCATCAGCCT 
CGAAGAGCTA 
GTGCCAGATA 
TCAGATTCCG 
ATGCCATATT 



TATTTTTCAT 
ATGaTTCGrT 
TGATAAATTT 
GTCATTTTCT 
T TACT CATC A 
TTTTTTTGAA 
TTTTTTAACT 
CTTAAACGCT 
ACAGGGTAAC 
ATTTTCAATA 
TTCTGCTTCA 
CTGCTATAGT 
GTATCACCAC 
ATAACCTCAA 
TAAGAATTGA 
TTAGGTAGCA 
CATCATTACC 
ATCGTCTTCT 
AGGATGCATA 
TGCTGTTGCC 
TTGCTCCAGA 
TTGAATCTGT 
CATAATTAGT 
TTCCTGAACC 
CAAATTGCAC 
CAGGTGCTAC 
AGGCATCAaT 
AGCTAATCAA 
TTACACCAAC 
CTTGGcTATT 
CAGAAGTTGa 
CATCTGCGTA 
CTCGCCATTT 



TATTAATCAA 
AACCTCGTCT 
TAGCTAACGA 
TTAAATTTGC 
ATTGAACTTC 
AATAACACAT 
TTTATATTAG 
TCTTTGTTTT 
TTGCCATCAT 
TCACTTATtC 
TTCAATAGAA 
TCTTCATGAG 
GTTTTACACT 
TTATAAGTGG 
TTTCAAAGAT 
TCAATTGAAT 
TAAACTTTAA 
GAAACTATAA 
AAGCCCACAA 
aTTTTTAATT 
ATTAGAAGCT 
TGTTTTTGCT 
ATCAATAGCT 
GTAATTTGAG 
AATTTTTGTT 
TAAACATAAT 
ATCTTTTCTT 
ATTTAAAATT 
AATCAATTTA 
CAATTCATTT 
TTTTCTGAAA 
TTCTCCGCCT 
AATCTCAGTT 



1291 
T.GCTCCCACG 
TAAATCaCCC 
ATCATTCACC 
AATAACTTTA 
CTGATAGCAC 
GTTTTTGACC 
GTTCTAAAAA 
CTATTACAGG 
GAATTGGAAA 
GCTCAATAmC 
AAACACCCCG 
TGATTGTATG 
AGCTACAGAC 
TTCAAACTCG 
AACAAGGTTT 
TTTCTCAACG 
AAGTATCTGC 
ATTCATTATT 
AGTTTTtCTT 
yCCTCCT TkG 
TTGTATTCAA 
AATTCTTCCG 
GCGATTGTTC 
ACAATATTTC 
GATTTTTCAT 
TTTTCGCCCA 
GTTACTGCCC 
CCGTTCATCT 
CGATTTAAGT 
CCTACACCAT 
ATTTCAGTAT 
TCTAATTGAG 
GACTTGGCAG 



ATACGACCaC 
GATTTAATtG 
ATTGCAGCAC 
TCAGCATTTG 
TACTTCTATA 
TGTACCTTGA 
AAGTTTTTGT 
TAAGTTTCCC 
ATAAACTTGT 
ACAGGTATCA 
CTATCACAGC 
TCTTTTGATT 
TTATCTATTT 
TCCCAAGCAA 
AAAACAGACA 
ATTTCTTCTT 
AGCACGTTCT 
yTCGTTTGCT 
GTCAAGTACA 
TTATTATATT 
TTGAATACTC 
CACGCCATTC 
CATTAGGTAA 
CAAGTTGCAA 
CCAACTGATC 
TGTATCCTTT 
CCGCAGCTGC 
GACGACCTTT 
CAATTTTCAT 
CTACATTAAT 
AGTTGtTAAA 
CAGATGAATC 
CTTCAAcTTk 



TAGACAGCTT 
GtGTCATAGG 
CTTCTTTTTC 
ATTTCATTCk 
TGGCTTGGAT 
GTAATAGTTA 
TCTTCATAAG 
ATACTTCCTT 
yCAGTAATTC 
AAATAAGTAG 
CTAAAATACG 
CAACTACATA 
CGTTTTTTAA 
TATCAATTTT 
AAATCCTATC 
TTTTTTCATC 
AATATTTCAA 
GTGaTAAATT 
CGATAAGTTA 
TTAGGCTTTC 
ACCAACTAAT 
ACGTAATGGA 
ATTAGGTTCA 
TCCAAAAGTA 
AATCATTAAT 
TTGGAACATT 
TGTTTCGACT 
AGAACCAGAT 
TTCTGTAGCA 
AGCATCTAAT 
CCATGkACGA 
ATCATTATTC 
GCCTGCGCCT 
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AACAAATAGC TTAAAAATGG TGTGTTTGGG AACTTGTAAt GCGTTaACTT CCkGTGAAAt 15060 

ATCtAAATAC TCAaGATTAT TaAgGAACTT TTTTTCaTTT GTAGTTTCCT CCTAATCGAT 15120 

AAAAGCTTGT AATTTTTGTC CTAATGCTAC CTCTGGATTT TCAAACGATT TTGTTTGATT 15180 

ACCAGTTCCC ATGTTGGTTT GTTGCGATTT ATTACCAAAA GCTTTAGTCA TTTCTACATT 15240 

TTTAATAGCT TCTGCATGCT TATCATTTAT TGCTTCTAAA AGCTCTGTAA AGCCTTCTAC 15300 

AGCCTTCTTA GTAAATTCAG TATCTGAACT AACAAGATTA TTTAACATAA ATTGAGAAAT 15360 

AGAATCTTTC AAATCTCCGT CCCAATCTAA GCCAGCAATT TTTTCTGCAA CAAAAGCCTT 154 20 

ATTATCACTA GTCACACGTA ATGCCTTTTC AGCTTCAAAT TCAGCCTGTA ATTCTTCTAA 154 80 

TTTAATTTGT TCAGGAGTTT TATTTTTCTT AGATTCTTCA TACTCCTTGA TTGTTGTTTC 15540 

CTTGATTTTT TCAAGATTAT TTTGTTTCCA AGCTTCTAAT tGTTTATCTG CAGCTGATTG 15600 

TGACTGTGAT TGTACAAATT TTTGTGCTTG CTCATTTGAT TCTACAAATG CCTTAAAATC 15660 

ATCGAAAGTG AAGTTTGTAC CACCGTCTTC TTCAGCAAAC ATTTGTAAAT TCATTGGTAA 15720 

TAGGTTTGGT TTCATTTTGT TTCTCCTTTC GCCCCACGAT TCGACTAATC GCCCCGCATT 15780 

GCTTTAGATT TATTTATTGC GCCCCACCAT TCAATTAAGC CCAGCATTGC GCTAGTTTAA 15840 

CGTCATTTCG GACAAAATAA AAAGCCTAAC TTTCGTCAGA CTTTAATTGC TTTTCTTCTC 15900 

TTAGTAAATG CTCTTCATAA TCTGCATCTA AGTAATCATA GGGgATCCAT CTAATCACTT 15960 

CCAATTCTTA TGGACCAGTT CAGCACCTAA CATTTGATAA TCAGTGACAG CATCTTTTAC 16020 

GTTTTGCAGA GTCCTAGACA CAATCGAAAT AGTTAATTTA CTTTTTTTAC CTGGTAACGA 16080 

ATACAAAATA TCAACGTGAC AATAATTACC ACCCCAAACT GATTTAAGCT CATCTTTGAC 16140 

GATATTACCG TTGCCATCTC TCAAAGTGTG TTTGGTTAAG TATCGTTCGT TTTCTTGTTC 16200 

AAAAGCTTTT TTATAGGCTT TTTCTGTACC ATTGGTAACT TCTAGATTTA ATACTGCTTC 16260 

GAATAATCCT TTCATGATCT CACCTCCAAT TTAGGGTATA AAAATAGCAC TCAAAGTTAT 16320 

CCTTTAAGTG CTTAGTAAAG TCTGCTATGG TAATCTGACG GGACGTGGAC CGATTTATTT 16380 

TCTTTAATAT GCTCATCAAT GAATTTCGCT AATTTTTTTG CACCTAGAAT AGAAAAATCG 16440 

AAGTCATCAC TCTTTGTTAC ATCGATATAT TCATATATTG GAAACTCATC ATGGAAATGT 16500 

TGTTCATACT TTTTAACTGC ATCATCAATA ATAGACCATG CGCCATCTTC GCAACTAAAC 16560 

ACTTATTTCA CCGCCTTTAA AATATCATCT ACTAATTTTT CCCATTGGCT TACTGCTGTC 16620 

GGAAAAATCT CTTTCATTAA TTCTCTTGAC TTATCGTTAG CAACCACTTC TGTCATATGT 16680 

GCAAAAAATT CCGTTTCTGT GCTACCTGTT GTTTGCCAAT AGCGCTTGCC GTGACCAAAA 16740 

CCTAACGGAT AGTCTCCTAA AGAACCT-ATA GATTCAATCA TATCTGAAAT AGCGGATACT 16800 

GAGTTAGGAT- TTTCCCTAAC TATTTTTGAT AATCGkTCTC yAGTTTTTtG TAAGTTCTCT 16860 

GTATnnnTTT tATAAATTTC CGCCCTTAAG TCATCaTAGG CCTTTTTCTT ATCCGCATTT 16920 

CTAGGTCTAT TACCTAAATC ATTATAGGAT GGCAAATCTC CGTTTACATA TTTCCAAAAA 16980 
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TCGCGTTCTA GCGCTTCTTT AATGTTATAC TCAGAAAGTG ACGATGCGTG TGTTATATGC 17040 

TC TTPT ATAA AAGTGGTTCG TCTTCCTAGC TTTCTTTTTA TCAGTTTTCC TGTCGGCATC 17100 

GTATTTTTTC CTGTTAATAC TTCAAGCCCA AGATGATCCA AAGCATGTCC TATCTCATGA 17160 

TAAACTGTTT GATAAGGATT AACACCCCTA TCCCCATCAA ATGATTTTTG ACTAAGTTGA 17220 

ATTGATTTGC CTTGTGCAAA GGCTCTTACT TCTTTGATGT CTTTAAACTC CAACTTATGC 17280 

CCTATCACAT TTATTAAGTT TAGCACCCTT TGGTCAGTTA TCCCATCTAA ATGATCTAAA 1734 0 

AAATTAGAAT AATTTTCTTT TCCTAAAGCT TCTTGCATGT TTGTTTTGTC TAATACGGAT 17 4 00 

TTTAAATCGA GTTTAGTTGC ACTAACTTTT GGATGATTTA ATTTAATACC CCACTGCTTA 174 60 

CTTGCGAATT CAT CC AATAA ATCACCAAAC ATCTCTTCAT ACAATTTATC TATGTCATCG 17520 

CTAATTtCAG GAATAATTGG TATTTCAGTA CmACGGCAAC GTCCATGATA CGGTGGATGC 17 580 

CAATCATCTT TAATCTCTTT TCCATGACGT CCACCACAAA TAGAACAAAC ACGCTCATCT 17 640 

TCTGCCGACC AGCTTTGTGT TTGCTTAACA CCTATATCCT TTAGCGATTT TCTTACACCT 17700 

TCTACCGCAA AATGTGAATA TTCCGTTCTA ACAAGATTTT CAATCGAACG ATTAAACTTT 177 60 

CCTTGTTCCA ACTTAAACAT ACCGCTAATA ACACCATCGT TTTTCATCGT TCTAAGAGCT 17820 

TCCACAACTC CTTCACCACT TGCCAATGAA TTAATAATGG AATTGCTCAA ACGTTGCTCT 17 880 

AGGGTTGATA TATTGCCCCA TAAACGAGAT GAAAATGTTT TTCCGCTCCA CGGaTAGTTC 17 94 0 

i 

ATGATGTTTT CTAGTTCATT CTTAGTTAAA CCAGGTGCTG AACCGCCTAA TAATTGTATC 18000 

AACGCATTAG AATTAGAATT GTAGATTCGT TTTGTGATAT TCTCTAAGTC GTTATTAAAT 18060 

TTACCGTTAA CATCACTAGC TATTGCTTCA CCTGCAATGG TAGAAAAAAT GTCTGCTCGT 18120 

AATTGTAACA AGCGATTAAC TTTCGCATAG TCAAAGGATG GAAAATATTC ATCAATGAAT 18180 

TGTTTATAAA CTTCATCTGA TTCCATCAAC TTTTCATAGT TCTTCTCAAT ATACTTACGG 1824 0 

TACTTCTCTT GGTCTCGTTT GCTAAAATCT TCTAGCATCT CACTTTGAGT AATACCGTGT 18300 

AAATCAGCTT GTGCCAATAA CTGTCTTTGA ATTTTAACTA AAGCACGTTC GAAAACAGAT 18360 

TCTAGTTCAC TAAGAGTTTT CTTTTCCAGT TTCAAGCGTG CTTTGTCTTC TAATTCACGA 18420 

CGTTTTTCCC" AGTAGCGTTC ACTAACTGTT GCTTTCTTCT TCGTCATTAT CCGCACCACC 184 80 

TAGTTTGTAG TCACCACTAG GATAtCTTGA CCTTGTTCTA AATTCATCAA GTCCATTTCA 18540 

TAATCTGGGT CTTTAGCAAA TGGAATCTGA TTAATAATTG TTCGTTTAGA TAGAAACGGC 18 600 

GAAAGTTTTG GcAATGCATC AGCAAGATAG CCAATATCTG TTGGcAAACT GCGACTGAAT 18 660 

GTAAACACAA TTTTAGACAC ATCTAAATCA AGCTTATCGT TAAATTTAAT GAAAGCTGCC 18720 

ATCGTTTCAG CGACTTCTTT TAAACCTTCC CTGAAATACT GCTCTTTTGT ATTTGTTTTT 18780 

GCTTCTAAGG CAATAATTTG CCATTTACGA GCTTCACCAG AAGCGTTTGa TTTAAATACC 188 4 0 

TCATCATTAA AATCAATTGA TTTTGTGATT GTATAGAACA TTTTTTTTAG CTGGTTTAGA 18900 

TGATACTCGT TGAAATCTTT ATTAATATCT TTCGTTACAT ACCCAACCTT AGCTTGTGGA 18960 
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TCTGGTAAGT TAAGGATACC TAATTGTTCC ATCATTCTTT 
ATTGAGCCGC TAATGGCCAT GTAAGCAAGT JTAAACTGTT 
ATAAGCTTCT ATCAAATGCA TCAGAAAGTT CTTCCGCCAC 
TCGTATTGTT TTTAAATTCT GATAGGTAGA ACGTTTCTAG 
TTAATTTAAA TGTTCCAGAT ACACTGACTA AATTAAATTC 
GTATTCTTTC TTTAGTAATG aCTTkCATTT CTTCAAAAAA 
AtTTTTCACG AATAAAGATA tCTGCATTTT CGTATTTTTC 
TGCTCGCCCA TAATTGCCAA CCTTCCTTGG TTTTAACAGG 
CACCACAAGC TCCTTGAAAC CGAGCTGTGT CAGAATCAAG 
TCACTAACTC TGTCAGCCGT TCGAACTCTT TTGGTGTTTT 
TCAAAAATAA ATCTTTAGCA CGTTGCATAA TTGATCGTCT 
CCCACTTAAT TGGAATACCT GTGAAATGGT CCGCTGCTTG 
AACCAGCATG AAGTTTATTA TTCACTTTTA TAATCTTTGT 
CGATCTCATT TTGTTCGCTT GTATAAGCTT TGTATTTGCG 
GCTTCATTTC AGTAATAAAA TCATTAGGAT CGAAAACTTC 
ATTTTGTTCG TAATCTTTTA TACCGCTTCA AACTTAAATT 
CTCCTAATAT TGGATAAATC TGACTTGGTT ATCATTTGCT 
TACAGCATCT GGTGCATCAT CGTGTCTGTT TTTACCTTCC 
TTCATAAAAT TCTGGCCATC TTATTTTCCA ATCTGATGGG 
AAGTGCCGAA TTAGAAAGAA TACGTGATTG TTTATTTGCC 
ATAAGCACCA CGATATCCTC GTTCTTTAAC AATTCTTTCT 
CCCACCAGAG TTAGACTCAA TGCGGACATG GTTTACTTGG 
GTGTGCTGTT TCCGTTTTTT CCATTGGTTC TTTTGTAAAT 
TTTATGATCT TCTGTCTCAC CAAAAACAAT TGAACAAAGA 
TGTATCGGTA TAATTCCATA TTTTAATGTA ATTAGAGCGT 
TTGGTACAAA CGACCTTTTA AATCAATCGG TTCTTGTTGA 
TGCACCCATC GTTTTCTTTT TACGGAAATA TTCTTCTTTA 
CATGGTGTCT GTTTCTcGTT ATATGCTTTC ATGCTAATAT 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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GTGCTTTTTC 
CAAyTcGTTT 
TTCAAAATCG 
CGGGTTTTCT 
AATGTATCTG 
TTTTTTTTGA 
AGCTTTCCAT 
TTCTAACAAA 
CATGGCAAAC 
GGTATTTTTT 
CTGCTCTGTA 
ATCGACAATA 
GTTTGGTTTA 
CTCTCTATCA 
TTCATTAATT 
GTTTTGAAAC 
AATGTTTCTA 
CTTTGGTAGG 
AAATACACAT 
GATTGATGAA 
GAATTACGAG 
TTATTTTTAA 
AGAACATCCA 
TTATCAGCAC 
GTTTCATATG 
TAGTTAGCAG 
GAAAGGACAG 
GTTTTACTTT 



TTCATCTAAT 
TGTTGGTCTG 
CAATAACGAT 
ACTTCCTcAA 
CTATATATGC 
TGTGTGtCGT 
GGTTCGATAT 
CGAAAAGCAA 
CGCATATCAT 
AATGTATTAT 
ATATCGTAAT 
GTATTGTATA 
GGTCTTCTAT 
TCAAAAAATG 
TGTGTAGAAT 
ATTTTCACAC 
CAATGCCTGT 
TCGTCATAGC . 
TATTTTCTAC 
ACGGCTCGTA . 
AAAAACCACG 
TTTGTTCAGC 
ATACTTCTGC 
CAGTGTCTGC 
TTGAAAATTT 
AAGCAATATC 
ACTCACAAAG 
ATA 



19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20633 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 
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ACATTGnTGA AAGAATTGTT GACGGAAGAG GATGATCAAT TTAACTTTTT CTCATTTGAT 60 

ATGGAGGAAA CACCTTTAAG TGCAGCGATT GCTGAAGCGG AAACAATTCC GTTTTTTGGT 120 

GATTATCGCC TTGTCTTTGT GGAAAATCCT TACTTTTTAA CTGCCGAAAG AAAAACAAAC 180 

GGTGTGGAAC ATGATGTCAA TCTTTTATTG AATTATTTGG AAAGTCCATC ACGAACAACT 240 

GTTCTTGTTT TTATGGCCTC ATATGAAAAA TTAGATGAAC GCAAAAAAAT TACCAAAGCA 300 

TTGAAAAAAA ATGCCACAGT AGTTGACGTC AATCCAATGG ATGAAAAGGC TATTCGTCAG 360 

TACATTTCTC aAACGATTGA GAGCGAaGGG TACACTATTC GCTCCGATGC TTTTGATTTA 4 20 

TTACTTCAAT TGACAGACTT GAACTTATCA AAAATTATGG GaGAATTACA GAAGCTGTAT 4 80 

TTATATGCCC TTGAAACGAA GACAATTACA CGAGGTGCGG TTCAAGAATT AGTCCCTAAG 540 

TCTCTGGAAC ATAATGTTTT TGACTTAACC AACGAAGTAC TTTCTGGAAA CAGTGAAAAA 600 

GCTGTTCAAT XGTATGAAGA TTTACTTTTA CAAGGAGAaG AAACCATTAA GTTAAATGCA 660 

ATCTTATTGA ATCAGATTCG CCTTTTTCTG CAAACAAAAA TTTTGGCTAA GATGGGCTAT 720 

CAACAAGCGA ATATTGCTGA TACATTAAAG GTTCATCCTT ATCGAGTCAA GTTAGCGTTA 780 

CAACAAGTCC GACGATTTGA ATTGTCTCGA TTAGAAAGAC TTTATGATGA ATTAGTTGAA 840 

AATGATTACC GCATGAAAAC GGGCCAAATG GATAAAGAAT TACTATTCCA ATTAT.TTATT 900 

TTAAAATTGA CTGCACAAGC AAAATAAGAA AATGCCAGAG GCAATcGTCA TACAACGATT i 960 

GTCTCTGGCA CTTTCTTATT ATTTAGCTCC TAATAGTTCT TCTTTTCCGT GAACAAGCTG 1020 

TGTCAACATG GCACAGAAAG GTGTTGCTAC GTTGTATTTT TGTCCTTTGC GCCAAACGGC 1080 

ACCATTAATG TAATCAATTT CTGTGAGACG ATGATTTTTA ATTAAATCTT GATACATAGA 114 0 

CGGATAGTGC AAGCCAATGC CATTTGGATC ATAAGTTTGA ACAATATGTG TATAGACTTC 1200 

TGCTTGATCT AAATAAATGG CTTCTTTTTC TGCGACTGCT GCAAATTCAC TGATTAATGT 12 60 

TTTAACAAGG GATTCACTAA CAGGTAGTGC GCCAAATTCG GCAATATTAC AATCAAGGAT 1320 

TGTACACAAA CCGTTTAGCG TGCCATTGAC ACAGGCTTTC CGCCAAATAG AGTAGCGAAC 1380 

ATTGCTACTA TAGCTTGGaT TTAAACCAGC TTTTTGGAAG ACATCAACGA CTTCTAAAGC 14 40 

AAATTTTTTA CCAGAAGGAT CAATATTTTC TAATTCAATT TCGcCATCAC CTAAAAGTTT 1500 

AACACGTCCC GGTCCTTCTA AACCAGCTGT CCACAkGGTA ATGCCAACCA AGATGTTTtC 1560 

TTTTGGtACA wATTTtCCtA GTACATCTTC ATGACCTAAG CCATTTAATA AACATAGTAC 1620 

ATACGTTTTT TCAGTAATCA TTGGTTGAAT CGCTTTAAAC ATT 1663 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



WO 98/50555 PCTAJS98/08985 

1296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



A AAAAAPAnP 


GAATCCCTTC 1 


CCAAAATCTT 


CTTTCATTTT 


CAAGAAGCCT 


TTAAATAACT 


60 


r**T A nfZTP ATP 
o 1 rlrVo 1 Ksr\ 1 \s 


TTPTPTPPTw 


AACTTAATGT 


TTTTATCTGA 


GCCTAAAGCA . 


AAATGCAGTG 


120 


Til bbbb 


ATPPaPP ATP 


ATAGTATTCG 


TGTATGACGA 


TTCAGAAATA 


CCAATATTCT 


180 


CACbbAAlbb 


TTTTTPPT A A 
11111 l rvrv 


GCCCAAGTTA 


ATTTTTCAAA 


TTTATAGGCT 


TGTGGGGTTG 


240 


ACACGCGACG 


I Pit\ 1 b 1 1 1 


PPATTAATAT 


ATTGTCGTGT 


TGTTTCCTCA 


GTTTCTTTTA 


300 


CAAAAATTTG 


1 1 b/i 1 1 /\i/\l 


PPPAAPGAAG 


TCACTGCATT 


TCCGTATTCT 


TGACACTTAA 


360 


CAATTACATC 


Ab A 1 HAnnL 1 


A ATTPATPAA 


CTAATGGACG 


AATCCCGTCA 


TGGATAACAA 


420 


TAGTATCTIb 


ibbbl 1 lurtn 


TPTTPTTTTA 
lOll^XXXi n 


AGAAATTAAC 


CCCATTGTTA 


ATTGATTCTT 


480 


GACCAGAA1 1 


1 bb 1 bb 1 boa 


ATA AT PP ATT 


GTAATTTGTC 


AATATGGtAC 


TCTTTTGCAT 


540 


AGGCATCTAb 


iblllbll bL- 


P A APPTTTTT 
^./txH«*V^ 11X11 


TACACACAAC 


TAAGATTCGA 


TCAATGAGTG 


600 


GATGCTTTTG 


AAAAbAI Ibb 


Z\ATPTPTAPA 

t\J-\ i O 1 O 1 rtOrt 


TAATTATCGG 


TTTTTCCTCT 


ACCATAATAA 


660 


ATTGTTTAGb 


AA 1 b 1 bb 1 bb 


PPP AT TPPTT 


TACCGACCCC 


ACCGGCAATA 


ATTAAAGCTG 


720 


TAATCA1 lib 


Pi. 1 /\Ln/Wi 1 b 


PTTTPTPT A A 


CTAGTCAACC 


ACTAGTTTTA 


TTTCAATTTT 


780 


GTTTwACTTA 


1 A 1 A I 1 b A 1 b 


A APAPAPPP A 


AAACTGCATT 


CAATCTTTTA 


CATTTTCTCA 


840 


TAACTATCAG 


rp 7\ T\ ^rp rp rp z"' A m 
TAAb 111 b Altl 


T A ATTTATPP 


TAGACCTTAA 


TCGAACTGTA 


AATGAGAGTG 


900 


TAAAATATTT 


rp rp /Tp 7\ t\ A T f 

Tb 1 b 1 AAAl b 


Zl Zl A A A ATPP A 


TAPAAAAAAG 


GAAGTCGCTT 


CTGTAGAATA 


960 


AAGTTAACGA 


bAAbbAAl lb 


nr ZIP A A A APA 


PGACTTCCCT 


ATGAATGATT 


T TACT AC AG A 


1020 


AATTGTGCAA 


Ab 1 b 1 Hb 1 b/^ 


PT A A APPPPA 


TTTAAATGAA 


TTATTCCGTT 


CGCACTTAGA 


1080 


7\ 7\ 7\ A f~*r* ATA 

AAAAbbbA I A 


a&p&r*APTPP 

Ar\br\brib 1 


TAPPPACTGA 

1 rtV^OOrt^ 1 Ort 


ATTAACGGCT 


TTTTTAGATT 


ACGAAAAATA 


1140 


TGATbbbAbl 


obi 1 1 lrlrVl l 


PAPPTAATTP 


GAGAAACGGT 


TCTTACTTTC 


GATCAATCAA 


1200 


A APPPA AT AT 


GGTGAATTAA 


CATTGGAAAT 


ACCTAGAGAT 


CGTAATGGTG 


AGTTTAAACA 


1260 


ACAAACTTTA 


CCAGCCTACA 


AAAGAACAAA 


CGATACATTb 


bAAAbb Ab i r\ 


TTATPPATTT 
1 inl 111 


1320 


ATTCGAAAAA 


GGTGTTACGA 


TGTCTGAAAT 


TGCTGATTTG 


ATCGA7\AAAA 


TGTACGGTCA 


1380 


TCACTATACT 


CCACAAACCA 


TGTCCAACAT 


GACTAAAGTT 


CTGACTGAAG 


AAGTAAATGC 


1440 


CTTTAAATCC 


AGAGCCTTAA 


ATGATAAGTA 


. TGTCGCTATT 


TTTATGGACG 


i CTACTTACAT 


1500 


TCCACTAAAA 


, cGTCAAACcG 


TATCCAnAGA 


k AGCGATTTAn 


i TnTTGCCATT 


1 GGTATACGAG 


1560 


TAGACGGCAC 


TAAAGGAGTA 


. CTGAGTT 








1587 



(2) INFORMATION FOR SEQ ID NO: 27 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
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CATTAAAAAG ACGTTCATCA AAAACGAGCC CAAGACTTAC TTGACCTAAC GCTTGTGCGC 60 

CTCCTAAAAG AAAAAATAAC ACCGTCTGCA CAAAGACATA CAAGGAATCC ATCCCTAAAA 120 

TGTTTGGCGT TAGGAAATGA TTTTGCGTGA CTGTTTGAAA ACTAATAGTT GAAAAAGCCG 180 

CAGCGATTCC AACAAAAATA AAAGCCAATA ACTTTTTGCC TCGTAACGCC AAAGCAAAGG 240 

ACCAATTGCC ATACGTGTTA TAAGTTAAwT AmCAAAGGCT CACAATTACT ACGGCAGCTA 300 

AAAGCAAAAG TAATGAAAGC CATTTTTTTA AAGGCCGTTT CATTGTTTGC GCCCCCTTAG 360 

TAATAATAAA ATGAACAAAA TGCTCCCTAA TGTTCCGACA ACCACGCTGA CAGGAATTTC 4 20 

ATAGGGGGCA ATCAATACTC GTGCTAAAAT ATCACAGATT AACAAGAACA AACTCCCGCA 4 80 

AATCCCAGTT AACCACAAGG TATTTTTCAT ATGATCGCCA TAACGTAAAG ACACGAGATT 540 

GGGAATCACA ACCCCTAAAA ACGGAATATT CCCCACCATC AATAAAACGA TGGCACTTGC 600 

TAAAGCAACC AGCGTCAAAC CTAAAAACTG AATTCGCTGG TAATGGACGC CAAGACTATT 660 

TGCAAATGCT TCACCCATAC CAACGACAGT GAAATGATAA GCAAATAAAT AAATAACAAA 720 

CCACAAAGGA ATCATTAAGA ATAACCATTC ATAATTCCCT TTCATAACTG TGGAAAAATT 780 

TcCTTGTAAC CAAGCAGACA TATTTTGAAC CAATTGGTAT TGATAGGCAA AAAATGTGGC 84 0 
AATTGAGCCT AGAATGTTAC CAAACATCAC GCCCACCAAA GGGACCATCA CTTGATTCTT 900 

ACTTGGTAAT AATCGAATGA GCGCTAAGAA TAATAAGGTC CCGCTTAGCG CAAAAATAAA > 960 

AGCAATTGCC GTTCGATAGA GAAACGGAGC TGTTGGGAAA AATAACATTG CGACTAAAAT 1020 

ACCCAACCTT GCACTATCGG TTGTTCCAGC GACACTAGGA GCGACAAATT TATTTTGTGT 1080 

GAGTTGCTGC ATAATCAATC CAGAAATACT CATCGTACTA CCAGCAATGA TTAAACTCAA 114 0 

CGTGCGGGGA ACACGGGTGG TAAATAAAAC AAATTGTTGC TGCGCATCTA ATTGAAACAA 1200 

CTGAGAAAGT GCAATGTCCT TCACCCCTAT CACTAATGAC AATAAACTGA AAAGTAATAG 12 60 

TAGAACAAAC AAACCAATCT TTTTCATGAG AAAACTCCGT TTTTATTTAA TTGAGAATGA 1320 

TTCTCATTCT TAATTACAGT GTATGAGAAA ACCATCACAA ATGCAACCAC TTTTTTTCCT 1380 

TTATTAAAAA CTTCTTCTCA AGCCGACTGA CCAACGACTT ATCTAAGATA TGCTATAATG 14 40 

GAGGGAAACG TTAGGAAAGG AGTGTTACTG AATGGCGAAT GATGACTTAC AACAACGTCT 1500 

GGATAAAGGA ATGTACGGTA CGCCCCTCGT TAATCCTGAA gAACAACACA AATACATGGG 15 60 

CACATTTCGT GAACGCTGCC GTCTCTCCAT GACCGTCGCT GAAATGAAAG ATGCACAGAA 1620 

CCAAAAACAT TTATTAGAAG AATTAGCGAA CATCCtGAAG TTACAGTACT TTTAAATGGC 1680 

GAAATCTCTT CTGACCTGCA AAGTACCTAT ATTAAATTAT TAAATCAGCA TGGCGCAAAC 17 4 0 

TTTAAGATTG TTAACAATTT TGTGGAGAAC AACCCTGACT CTCTCGGCTT ACTGTTAGCT 1800 
GAAAAACATG CAGTCGATGA ACCTGTGATT GACGTTACAG AAAAATATCC TCAAGCAACA I8 60 

GAAACGCCAA AAGAAGAGCC AACAGCTAAA AAAAGCTTTT GGCAAAAACT ATTTCACTCA 1920 
TAAAAAGCTC AAGGAAAGTT ATGAGCGCTT CCATTTTAAA TAGAATTATG CTATTATGAA 1980 
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AATGAAAAAG 
AACTGGTGGC 
ACGGTTATTT 
ACGCTGCAAA 
CGTTAAAGTG 
ATACAACTTT 
TTATAGGAAT 
TGCTATTATG 
CAGTTATAAG 
GTCTAAGCTT 
CAGTAACGCA 
AAAAAATGCT 
CTCAGAGCCA 
ACCAAGAAAA 
TATTCTTTTT 
GTCTACCCTG 
CGATAATCCC 
AATCAATGCC 
AAC TAT CATC 
TTTCTCGCAA 
CTTTCAACAT 
TGGTCCCCCG 
CATCTCCGGT 
CAAAGAACCG 
TAAAAGCAAT 
TAATTCCTAA 
GTAAATGTGC 
AACCTGCTGC 
CGTAGTTATC 
GCAGTTTCAA 
CATCAAATAA 
GGCGTTCAAA 
AGTTTTCTTG 



AGAGGTATGC 
TTACTGAGTT 
TTTATTGTTT 
ACT AAT CATC 
CATCGGATAC 
CATTGGTAAT 
TTTCTATAAA 
AAAATGAAAA 
AACTGGTGGC 
ATGAACGGTT 
ACAAGGGTTG 
CCTTTCGTTA 
CCGTCATACA 
CGTTTTTATA 
TTATTGAACC 
TCCAGCGTAT 
GTACGTTAAG 
ACAAATTTTG 
AATGACATCG 
GTACACGATT 
TAGTTTTTCT 
CAATGGATCA 
AACATTGGCT 
ATATTCTGTT 
CCGTAACGCT 
ACCGTAATTC 
TTCTTGTGCC 
AGACAAAATT 
TTTGCTAGTC 
AAATTGTTCT 
GAAAATTTGT 
GGCCATACTT 
AATAATCGTC 



GGGTACATAC 
AAGTTTTGAA 
TTCGTTGCTT 
AATTCCAATG 
ACATAGGCAT 
AGCTTTAGTA 
ATGATATAAG 
GAGAGATATG 
TTACTGAGTT 
ATTTTTTATC 
CAAAACTAAT 
AAGTGCATCG 
ACTTTCATTG 
AAGAATACCG 
ACTTCTGGGT 
GGATACACAT 
CTATCACGTG 
GCTCCTAAAG 
GTCATATCAA 
TGCTCTTTTT 
AAATCATTTA 
CTATTTTTAG 
GGCACGCGCA 
CCAGGTAAAA 
TCCGTAAAGT 
GTTGATTTTG 
TCAATAAAAG 
TTCTTTGTCA 
ATGTTGGCAT 
TGTTCATCTA 
GTGGATAATT 
TGATATTGGT 
CACATTTTGC 



1298 

CTCTCCTTTT 
TCTTAACCGT 
CAAGGATACC 
CCGCTTCGAT 
CACCTGCCTC 
TAACCTATTG 
TTATGAGCGC 
CTACTACATA 
AAGTTTTTGT 
GTTTTTCGTT 
CATC AAT TCC 
GATACACATA 
GTAATAGTTT 
CCAAAAAATC 
ATAAAAGTTT 
GCATGTGCAT 
TGCCTTTTAC 
CAGCTACGGC 
TCGAATCCCC 
CTGGCACAGA 
ACTGGATTAA 
CGGCCACCAA 
ACATGATGGC 
ATTCTTCAAT 
CTTCCAACGA 
GTTTCACGAC 
ATGAAAATTC 
CGGTTTTGTT 
TTTTGACGTA 
AAATTTCGAT 
CCATCTCACG 
TTCCAAAAAT 
CAGAAACGGT 



ATACCAACAC 
TCGGCCTACG 
GAAAATCAGT 
TGACAAAAAA 
TCTAACTCAG 
GGTGTACCAA 
TTCCATTTTA 
CCTCTCCTTT 
TGTTCTTTAA 
GCTTCAAGGA 
AATGTCGCTT 
GGCATCACCT 
TAGTATAACC 
TAT TAT G AAT 
TAAAACGTCC 
GGCTGGATTA 
GTCTTTGTCA 
CTCAATGGCG 
GCCCGTGCTA 
AT AG AT AG TT 
CTCTAGTGGT 
TTCTTCTACA 
TTTTACATCA 
TAAAACCGCT 
AGCTCCTTCT 
AAACGCTTTA 
TTCACCGCCA 
TTCCATAATC 
TTCAATGTGC 
TTCCAAACCT 
GAAACCAGCT 
ATTTCCCAGT 
GTCTTGTGGA 



CAGTTATAAG 
AAGCTTGTGA 
AGTGCAACAA 
ATGCTCCTTT 
AGCCACCGTC 
GAAAACGTTT 
AAT AG AAT T A 
ATACCAACAC 
CCGTTCAGCC 
TACCGAAAAT 
CGATTGACAA 
GCCTCTCTAA 
TATTGGGTGC 
TTTTTGGCGG 
ATTGTCAAGC 
AAGTTCGCTT 
GGAATGATTA 
ATTTGTTTAT 
AC AT TAG AAT 
AAACCTTGTT 
GCACGGTGAT 
GTGTGTTTGC 
TTATCTAACA 
GTGTCCTCTT 
TTAAAAATCG 
TTGGCGTAGC 
GGCACATGGA 
AATGGTACCA 
TCGCCATGTT 
TTTTGAATCG 
AATTGATAAG 
TCACTGTTGG 
TTCCGCAGTT 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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4020 
4080 



4200 
4260 



4620 
4680 
4740 



4860 
4920 



GTTGATAATA CTTACCGACA ACTTCTTTGC CTTTGCGAAT GCCTAGAGCA TCTAACATAT 
CAATCATTTC TGAAAAAATC CGATCGCCTT CTGCAATTAA CTTAATCGTT TCATGAGGAT 

GACCAAGAGC CACTTGTTCA TTAAAAATAT CCCCAGTTTT CACCCATTCA TCCGCTTCTT 414 0 
CTTTTTCATC TGTCCACAAT AAATACAACA TGAAATAATG TAAGAAATCC ACAGTATCTT 
CCACAATGCC TAAACGTGAA AAAGGATTTA AGTCTAAATT ACGCAACTCG ATATAGCGAA 

TACCTGTTTT AGGCAGATCA GACATTTGTT TCCCGCCACG TAAGCGCACA GGCGCATAAA 4 320 

ATTCTTTTTC TTCAGAAAGT AAACCATTTT CCACCATGCG ATGAATATCT TCTAAATAGC 4 380 

GTTCCAATGA GGCATACGAT ACTTTCACAT TGTCATGATT TCTGTAGCCA TACGTACTAT 4 4 40 

TCCGAATACT GCGAACAGGT TCTTGCGGTT GGTCGTCGTA GACCCTAAAG TAGCGGTCTT 4 500 

CACTAACTGG TGAAGCCCCA AAAAGATACG TAATTAACCA GCGATAACGT AAAAAGTTAC 4 560 
GGGCAACTTT CATGTACACT TTCGTTTTAA ATTGTTTACA ATCTGTCACT TCGGATTGTT 
CAT CAT AT AA TTGCTGAATC AGGGCTTGGT CATATTCAAA ATTAAAATGA ATTCCGCTGA 
CCATTTGTTT TCGTTTGCCA TACTCTTTTG CCAAATAACG ACGATATAAC ACCGCATCAT 

ATTGATCTAA TTTAGCAATT TTAATCTCTT CATCTTTTGT TGGTAATTGT GGCGGCATAC 4 800 
TTAATGGCCA CAGCATTTCA TCTTCTGGAA TCGAACGACG AGCCACATCG TGAATGGCAT 
CTAAAAAACG AAGCATTTCT GTGCCGCTAT TTGCTACAGG CGTGATTAGT TCTAATTGTG 

TTTCACTAAA ATCTGTTTGA ATATATGGAT GATAAGAACG GTTACCAAAA ACCGTGGGAT 4 980 

GCTCAGTTGT CGCTAAAAGC CCTTCTCGTG TACTACGTTG GTTTTCTTTT TCTAAaCCAA 5040 

AACGAGCCAT CAATACGTAA GGACGAACAT tTTTCTTTTG CATTAATTCT CTATAATTCA 5100 

TCTTGTCGCC TTTCTTTCTA CAAACTATCT GTTTCTCTTA CTATTATATA GGAAAACGAG 5160 

TAAATATGCT GTTTTTTTGA ACGTTCGCTT GGAAATTTTC GAGACACTTT TAGAGAGCGC 5220 

CTTTTTCCGT TATACTGAAA ACAACTAGTT TATAGAAAGT GGGGATTGCT CATGAAAAAT 5280 

CGTGCATTAC TTTTAATTGA CTTTCAAAAA GGCATCGAAA GTCCGACGCA GCAATTGTAC 5340 

CGTCTACCAG CAGTGTTGGA TAAAGTCAAT CAGCGTATTG CTGTCTACCG CCAACACCAT 5400 

GCGCCCATTA TTTTTGTTCA ACATGAaGAA ACCGAATTAC CTTTTGGCTC aGACTCTTGG 54 60 

CAACTTTTTG AAAAGCTGGA TACACAACCC ACAGATTTTT TCATCCGAAA AACACATGCG 5520 

AATGCCTTTT ACCAAACAAA TTTAAATGAT TTaTTGACGG AACAAGCCGt TCAAACGTTA 5580 

GAAATCGCTG GTGTCCAAAC CGAGTTTTGT GTGGACACTA CAATTCGGAT GGCTCATGGT 5640 

TTAGGCTATA CGTGTTTGAT GACACCGAAA ACCACTTCCA CCTTGGATAA TGGACATTTA 5700 

ACCGCGGCAC AAATTATCCA ACATCATGAA GCTATTTGGG CTGGCCGCTT TTTGACTTTT 5760 

TTGAGCCTGT GACAAAAATC ACTTTGGATT TTTGTTCCGG GCTCAAAACA TGATAAACGG 5820 

CGGGAACAGA ACCAACTCTA CGCGTTGCTC CGATCTCAAC AACTTCTAAG AGCTAAAGCT 5880 

CTAAGAGTTG AAGGTCcTTC GGAAATAAGC CGAAATTGTT GAGAATCACG AAGAACGTAG 5940 
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TGATTCGATG 
TCATGCAACA 
ATTGCTGAGA 
GGAGCTAAAC 
TTGTCGTATT 
GTGTGGGACA- 
TCACTAAACT 
TAAAAGACAC 
TCTTATTGCC 
TGTTTTCGGG 
TTGTTTCTTA 
TCCATTTTTT 
AGCATTATTT 
CTCAACTGGT 
AGGTATTGCT 
TGCAGATATT 
GATTCAAGTT 
AATTCAAGAA 
CGGTTATCGA 
CCGCTTAAAA 
GAGTGAAGTC 
AGCAACTAAT 
GAGATTTTGC 
GTCTTGCACT 
GTTGCCTCCG 
ACTAAAGGAA 
AAATTTTCCA 
CGAACTAGCT 
TGCTGTAGAA 
ATTCATCCCT 
AGCTTGTTTG 
GGTAATTCTC 
AATTCCGCTT 



ATGAACAATA 
TCAACTCGTA 
AACACTATTT 
GGTTCTGTCC 
TCATGTTAGT 
TAAGTCGATT 
AATAAATAAT 
CGCCTATGTA 
AAATAAACTT 
ATTAGTCCCG 
CTTCTTGGGA 
GTAGGAATTA 
GGCGGTATTA 
GGGACAGCAA 
ATTTTATTCG 
GTGTTATTCT 
GGTCTGGTTC 
CGCTTATTAC 
AACGCGAAAG 
GAAGCCATTC 
TACGGAAAAG 
GCCAATAATT 
GGCTCTTTTT 
TCTAAGCCAT 
TTAGCTTTGG 
ATTAATGCCC 
TAAAAGTCAT 
TCCTCTGTTG 
CGTTGGACAT 
TTCTTTTAAT 
ATTTATACTA 
GATAAGCCAA 
GAGTAAATAA 



CCTACTTGGT CTCCAAAAAT TAAAGAACAC CGGGTAGGTA 
CCTCGTTGTG TTTTATGACA ATTTTCGGAA ATTTCTTCTT 
GCAAAGCAAA TAGATGTTAA TCAGTACCTA CTTGGTTCTC 
CGACCTCTTT TCCAAAAAAT TGTTTGTTCA CTATGTTTTC 
ATAAGGAAAA AGTGTTCAGT TGATTAGTTG AAGATTTAGA 
AGGCTTGTCT CTCATGTTCT GTCTTTTTTT ATGTCATCAT 
TGGAGGTATT TTATGAACAA ATTTTTTACA GCCCAACGTT 
ACAGTCGGTG CATTCATTTT GGCCATTTCA ATCAATGCGG 
GTCGCAGGCG GAgCAATGGC ATTAGTATTG TTATTAATTA 
CCATTGTTCT TTACGCCATC AATATTCCTC TTTTAGTGCT 
AAGAGGTCGG TGTCAAAACC ATTTATGGCA GTCTTATTTA 
CTTCAGGGAT GCCTGTTTTA ACACATAATA TTTTTCTAGC 
TTACAGGTGC TGGCCTAGGC TTGGTTTTCC GTGGCAATGC 
TCATTTCACA AATTGTCAAT AAATACTTCA AAGTTTCCTT 
TCGATGGTTT AGTCATTTTA TCAGCAATGT ACGCTTTTAA 
CTTTAATTTG TTTATTTACA ATTGGTCGTG TCGTTGACAT 
GCTCGAAGAA TGTGATGATT ATTTCACCAA AATATGTCGC 
GTGAACTAGA TAAAGGTGTG ACCTTGGTGC CAATCGAAGG 
GCATGCTCTT AATGACCGTT ATTCGTGAAA AAGATTTTCC 
TTGAGATTGA TGAAGAAGCC TTTTTGATTT CAATGAGTGC 
GATTTAGCTT GAAAAAAGTA GCTGATTCTT ATGGCGTTGA 
TACAATAGTT TTTTGTAGGA AAAGrGCCGC AAAATCTGTA 
ACATCTTCAT CCAGACAGGT TCTAAGCTAT TGCCATCTAG 
ACATTTGCTC TTCTGGTATT CCCATATCAA CTCGGTAGCA 
CGGTTTCGCC AAACTGTTTT ACGGCTTCGG CACTCTCTAA 
CACTGGTTGT TTGCGCATCA ATAATTTGCT TTtCTTGGAT 
GGTTCAACAA CATAATCCAA AATTGATCAT CCCACATCAT 
AAAATTCCTC GTTCTTTTTG AATCCAAGTT TTTCATAAAA 
CCGCTACTGG GAAGTTAACA AAAACCATTG TGCCCATGTG 
CATCTACGAC AGCTAACTGT GCGTTGCGAC AAAATCAACA 
GCATATTCAT AAAAGAAATG TTACTAAAAC GTTTTCTTAA 
GTGCTGAATG ATT TC ACT AT GTTGTGGATA ATCGTTTAAC 
CTCGAAGTCT TCAAACTCAA AGCCTGGTGT AACCACACAA 



6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 
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CTGACTAACG 
AACGAAAGCT 
TCAGGAAAAA 
GCTTTTAAAC 
TGCAACGCCC 
AAATAACCGC 
ATCTTCTCCC 
TCCCAATCCT 
TCCTGTCCCA 
CCTTTAAATA 
TCACCTAGAC 
CCACCTTTGG 
TTGAACTGTT 
ACTTCGTGAG 
TTCATTAAGT 
TGTTCTGCTT 
TTCCCACGTA 
CCCGCTCGGA 
TGGGCTAATT 
ACGGTTAAGC 
TACAAATCAA 
CGCAGTTTCA 
TCTGCACCAA 
TTAGCAAACA 
TCATATTTTT 
TCTGTTGGTT 
ACTTCTTGTG 
TCAACTAATT 
GCTGTCGTTT 
GCTGATTCAT 
TCCTTTTTTT 
CTCCTTCCAA 
TCGTGTCTGT 



CAAATCCTGT 
GTTGCCCTTG 
TACAGTGAAC 
GATGAAAATG 
GTTCGCCTCG 
CCTCAGGATG 
TCACTCCTTC 
AAAATAAACG 
CTTTCTTCCC 
AGCCCATTGC 
CAACTAATTT 
CTGTTCCGTC 
TGGCTTGTGT 
GCGCATCTGG 
TGACTTTATT 
TGGCACGTTC 
CAACTTCAAC 
ACGTATCGGC 
TACCAATGCT 
CATTTGGCTG 
CCATTTTTTC 
CTTCTTGTCG 
TTAACGTTTC 
ATTCGTTCAA 
CTTGGACCTG 
CTGCCAAAGG 
TTTCCTCTGT 
TTTCCATTTC 
CTGGCGTTAT 
CACTTGGTGT 
CAGCGGAAAA 
CGGCGTTAAA 
GACGACATCT 



CGCTACTGTA 
AACTAAGTCT 
GGTTAACGGC 
TGACGGACTT 
TTTGGTCGGA 
CGCTTCTAAT 
ATTTTTTCAG 
GCGGGACAGG 
AAACCTGTTT 
GAAATCGTTG 
CACTGGCAAG 
TAGTTTGGTT 
CATCGCATTT 
AATTTCTCGT 
TTGTAAACGA 
CACTGCATCG 
GCCAGCCCGT 
AGCTGCCATT 
GGTTGTTTTA 
TAAGTTAATC 
AATGATGGTA 
CAAGGCTTCT 
TTCTAATTCT 
CCGTTGACCA 
CTCTTCTTTT 
AGTCACGACG 
CAGGATTTCT 
AGCGTTATCT 
TTGCGCCTCT 
TTCGTCTACT 
AGCTTTTTTA 
ATATATTTTT 
GCGGCTTCTT 



GAGCCAAAGA 
TTCCCTAACT 
TGACCATCGT 
TCTGTATTTA 
ATCAATTGTT 
CCTAACTTCT 
AAACGATTTA 
AGTAGCTTCT 
ATTTTCTACA 
GCATTGAATG 
TGTAATTCAT 
AAAACTAAGC 
TGCCCAGTAG 
TGAATGACTC 
CCTGCTGTAT 
AAAACGACCG 
TCGCCCCAAA 
AAAACTTTCT 
CCGACTCCAT 
GCATTATTTT 
TTTTGAACTT 
GTAATTTTTA 
TCAAAGAAAT 
AATGTTTTAC 
GACTCTTCAA 
CTACTTTCTT 
TCTGGCTCTT 
GTTAAATCGG 
TCTGTTGCGG 
GTTTCTTCAA 
ATTTTATCAA 
CGACCGCTTT 
TAACTAATGG 



TTGTGCCGGC 
GAACAGCTTG 
GAAAAAACCA 
ATAGAAATAA 
CACTTTTATT 
CAATCCACAT 
GGAAGAACTT 
TCATTCTTGA 
ACTAAACATC 
GTTCTAAATC 
TCCGAATAGC 
CAGTGACATC 
TTGCGTCCAC 
GTTTAATTTT 
CCACTAGCAA 
CTGCTGGATC 
CAACTAATTG 
TTCCTTCCAA 
TAACACCGAC 
CATTAATTCC 
GAGCTGGTTT 
AGGATGTATC 
CTTCATCGAC 
GCGTTTTTTC 
TTTCTTCTGA 
CCTCAGGCGC 
CTTCTAACGC 
CTGTTTCTGT 
TTTCTTTTGA 
CAATCTCTTG 
AAAATCCCAT 
GGCAACGCCA 
AATCGCATTT 



AGGAACTGAG 
ATACGTTCCA 
TAATTCATCC 
AATCGCTGTA 
GGTTTGTTTA 
TTGTTCTGTC 
AGAACATCCG 
AGCTAAATAC 
TTTTAATAAG 
GTTAATGCCT 
AATGACAATA 
GGTTGTTTCT 
GACTAAGAGA 
TTCTAATTCC 
AACATCGGCC 
GCCGCCAGCA 
ATCGATGGCG 
CTTGTATTGA . 
AAATAAAATC 
TGCTTCTTCA 
TTTCACATTT 
AAAGCCTACA 
GCTTCGGAAA 
CAAGCCTTTT 
AACCTGCGTG 
TTCCACAACA 
TTCTTCTACA 
GACTTCTTCC 
TTCAGCAGGC 
TTTTTCTTCT 
TCTTATTGGC 
TCTTGGTCAT 
GCCATGGCGA 



7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 
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CACCTAAGCC 
TTTCT7CTGC 
TCCCT1TTGG 
AAAAAGAGGC 
CAACTTTATT 
ATAATTTGTT 
CGACCTCATC 
CATGAATCGC, 
GAACCAACCC 
CTAAGTAAGG 
CTTGTTTCGC 
ATAATGTTCC 
CTGGCTAATT 
CGA.TTTTTGA 
TTGTCCCTTT 
AATGGCCAAA 
AGAACGGAAC 
GTTCGCCTCC 
TCTCAATCCC 
CACCAAACAT 
TAAACCGTTC 
CATTTAATAA 
ATTGCTCGAT 
TCACTGTTTG 
CAAAGGTTAA 
CACGATTTTT 
TTTCTTCCGC 
TACGTGCCGT 
GTGCCGCTAA 
GCTGTTGACG 
GCTTTTGACG 
TCGTCAAGCG 
CTTGTTTCAT 



AGCGTATTCG 
ACGGATTCCT 
CATAAATTCT 
TGGAATTTCC 
ATAAATTCGA 
TAGAGAACTA 
GGATAAGACA 
TTCCAAGGGC 
GCCGTTAAAA 
ACCAATGGCT 
AGCCATCAGT 
ATCTAAATCA 
GCTTTTTTCA 
AACCCCAGAT 
ACGATGCGTC 
ACGAGCAACG 
TGGTCGAACA 
AGACAACAAG 
TGTATTCAAG 
ATTAGGAAAT 
TTTCACTTCT 
ATCATCTCGT 
GGCACTTAAG 
CTGCGCTTGC 
GCTGTATTCT 
TAGGACTTCG 
TAATTTTGTA 
TTGTAAAGAG 
GCCTTCTTCG 
GATGGCCGTT 
AGCAAAATGG 
TTCTTGCGCT 
TTCAGCATCT 



ATCATCGGTA 
AAATCTTTGG 
AATAAGTTAT 
TTGATGCGTT 
TTCGGGGTTA 
TAAATTGAAG 
TCAAACGGCA 
ATCAATGTTT 
GTAATGCTAT 
GCCAAAGGTC 
GCTTGTTTAT 
ATAGCAACTA 
ATTGCGCCAC 
TCTTGCATAG 
ACCACAATAA 
TTGGCTTCAT 
CGAATAATTG 
CTTAAATGTT 
AGATCCTCTG 
ACCACTTTAA 
TGGTCCATTT 
TGACTGACTA 
TTAACAGGTC 
GCTAaATCAT 
TCCTGTAAGT 
ATTTGTGTTT 
TCTAACTCGT 
GTTTGCAACG 
GTTAACTGGT 
TCTCGAATCA 
GCACATTGTT 
TGGGCTTTAA 
AAACGTTCTT 



AATCGTTTTC 
CTAACAAGGA 
TTCGCGTTTT 
CATTTAAATA 
ACTCTTCTAA 
GATAATTAGG 
TGTTGAGTGT 
TCTCAATAAT 
AATCGCCTTC 
GACCTGTGCA 
TCCGTAAAGA 
ATTTTATCAA 
CTTCTTTGAC 
TCACGCCATA 
ACTGTGTACC 
CTAAAGCTGC 
AAAAGAGTAA 
GTAATTTTTT 
GGTTGGTTAG 
ACTGGCCACG 
CATCCATGGT 
AAAATTGATG 
CTAACCGCTC 
CGATTGGGAA 
AACTTAACGA 
TATCTGCTAA 
CTACTTCTTC 
CTGTTTGTTT 
GATCGCTAGA 
ACMTTCGTC 
CCGCCGCTAC 
AGGTTTCCAT 
TCGTTGCCGT. 



TTCGTCCCCT 
AATACCATAG 
AATAATTTCA 
CGCTTCATCA 
TTTGTAGGAC 
TGCAGAAGGC 
GGTTGCTAAT 
GGCCCCTGTA 
ATCTTGTAAA 
GATAACAACT 
AATCTCTTTT 
ACGTTCGCTC 
CTCTTCCAAG 
TAAAACATCT 
ATCTTCAAAT 
TTCTACTTCA 
CGCAATCGCC 
CCCTGGTGGT 
CACCAGTTCT 
AATTGCTTCA 
TTCAAATAAT 
GCGCTCATCG 
AATTTCTTGT 
ATACGCTTCA 
GCTATCTAAT 
ATACTGTTGT 
TTGCAACGCT 
CTCCGCTAAT 
ATGGCTACTT 
CAATTGTTCT 
TGCTTGTTCT 
TTGGCTAGCT 
TAAGTTTGCT 



AGGGTCATGA 
GCTTTCGTAA 
AAGCGCTCAT 
ATCGCAACAA 
TCAAAGGTTA 
AACTGCATCA 
TGGTACAGCT 
TCATTTTTTT 
CCTAATTCTT 
TTCACGCCTG 
TGGCTATTTA 
CTCCTTGCTT 
CGAACTGAAA 
GCGGCTTCCA 
TCGCTTAAAT 
TCTAAAATAC 
GTTAATGCCC 
TGCGCCTCAA 
GCCCGTCCGC 
AAGACTTCTT 
TGTTCTTTCG 
ACTTGTTCAA 
TTTAGGCGTT 
TAGGCAGCTT 
TGCATTTCTG 
TGGCGTTTGT 
TGTCGTTGAC 
TGAGCGACTT 
AATTGTTGTA 
TGTTTGTCTT 
"GCTTGAACTG 
TCTTGTTCCA 
TGTTGCTCTG 



9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
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TCAATGTGGC 
ATTCAAACAA 
TATTGTCAAT 
CACGTTCCGT 
TCAGTTGTGT 
AAAATAGGCT 
TCATCACATC 
GGTTGGCACT 
TTTGTACTTG 
CAGCCAAACG 
AAAAGGTTGC 
TCTCATTTTC 
TATATTCTTT 
TTTTGTGACG 
CTTGCAAACT 
CGTTTTGACG 
ATTCTTCCaA 
CTTTCATTGC 
GTTGATTTTT 
TATTGGCTTG 
TAGAAGACTT 
TCTGACGTTG 
AACGAACAAT 
AGGTGTGTTT 
CTGTCACTTG 
TGCTGCGTAA 
ACTTTTCTTC 
GAATTTCGGT 
ACTTTTTAGC 
TGATATCTTG, 
TCTTTCGTTG 
CAGGTTTACT 
TGCCTAACCC 



TTTTTTCGTT 
GCGTTTTTCT 
TTCTTGTTGT 
GGCTGTTTTC 
TTCTAATTGA 
TCCTTGGTTG 
GCCTTCCAAT 
CGTTAAATCT 
TTCTGGATAA 
ATTCTGAACC 
TCGGCCACTG 
CACAACAATA 
AGGCACTTCA 
CAATACAGCT 
TTTTTGGCGT 
CTGTGCTAAt 
CTGTTCTTGT 
GAGAGCTTCA 
AGCTGTTTCT 
TTCTTGCATG 
ACTATATTTT 
TGCCGTTTTT 
TTTTTCGGCC 
GGAACGTTCA 
TAATAATTGT 
ACGTACCAAT 
AATGGCTGTT 
CACAGTTAAA 
CGCATCTGCT 
GACACGGCTT 
TTTGTATTTT 
ATTAAAAATC 
TGAATCCATA 



TGATATTCGG 
TTTGTTAAAC 
TTTAAGCGAT 
ACTTCTTGCG 
GTCATTTGTT 
CCACGTTTAT 
GAGACAACGC 
GCCGCTAAAA 
CGAACTAGTT 
ATCGCCGATA 
TGTTGTTGTT 
TGTTGCGCCG 
ATTAACTCAG 
TTCACGCCTT 
GCCTtCGCTT 
CGTTCACGCT 
AAGCCTTGTT 
ACCATCTGTT 
TGTTGATATT 
ACTTCCACAT 
TCCACATCTT 
TCTGCGATTG 
GTTTCTGCCA 
ATTAAAACAT 
TGTTGTTCCG 
TTGCCTTCTA 
AATTCTTGCG 
TTGACATCAA 
TGGGCGGCTA 
AAGTTGTCCT 
AAAACGCCCG 
GCTTCAACTT 
AATAAATCTT 



TTAAGAATTG ATGCAATTCC CGTGATTCAT _ 
GAGTAATCGT TTCTGTTTGA TTGGCTAATT 
TTTGTTCACC AGCAGAGCGC AACATTTCTG 
AGAGCGCTTG GACTTCTTGT TCTACACTTC 
CAGTAATTGT TTGAAGTTCT TGCGCTTGTG 
TGGCTCCCCC AGTCATCGAA CCGCCAGGAT 
GATATTGGTA ATTGACTAAT TTTGCTAACT 
TCGTGACGCC TAAGAGATTT TGAATAACGG 
CACTGGCAAT CCCCACGAAG CCCGGTGCAC 
CAGAGCGTGG CTTAATGGTA GTCAATGGTA 
TTAAGAAGGT GATGCCTGCT CGACCATCTT 
CACCACCTAG GGCCGTTTCA ATCGCCAACG 
CCACTGCGCC AACAATCCCA GTTAATTGGT 
GATAGAAGCC AGCGTAATTT TCTTGAATTT 
GTTGCACTTG ATTCATCgCT TGGTACATAT 
TAGCTTCAAG CGTTGCTTTC AACGCAGTGT 
TTGCAACTTT TTGCTCTTTT TCTAGCGTCT 
CTTCTAGCGC TTCGTGTTTT GCTAACGATT 
GTCGTTCTAA ATATTTTAAG TCATTGGCTG 
ACTGACTGCG CAATTCTTCC ATTAATTCTT 
TCGTTGCCAA AGCCAATGCT TCTTTTAACG 
CTGTTTCCAG CGTCTGCAAT TCTTCTCGAT 
ATGTCTCTTC GTATTCGCTT GCCGTTTGGG 
TTTTTTGTCC TTCTGCTTGT TTCAACGCTT 
TTTCAATTTG TTCATCTAAA CGATTGCGCT 
AGTCATGGAC TTGCTTACTT GCGCCAGCTA 
TTTTCGTTTC CCAAATAGCT TTTGCTTCTT 
TTTCGGTAAG TTCTTCTTTC AAAGCTAAAT 
GAGGAACTAA TTGGTCTTCC AACTCGTAAA 
CGGTTTCAAA AAGCTTTTGT TCAGCCTTTT 
CTGCTTCTTC AAAAATCCCT CGACGGTCTT 
TCCCTTGTGA AATAATTGAA AAAGATTCTT 
GAATGTCTTT TAAACGGCAC GCTTGTTTAT 



11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13360 
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TAATAAAAAA GTCACTTTCG CCCGTTCGTT TTAAACGACG 
AATCCAATGC TAAATAATGG TCGCTGTTGT CTAACGTCAC 
GCGGCTTACG ACCTTCCGAA CCGGCAAAGA TAATGTCGTT 
TTTTCGCAGA TTGTTCACCT AATACCCAGC GAACGGCTTC 
TTCCATTCGG ACCAACCACT GCTGTCACAT CATCTTCAAA 
CAAATGATTT AAATCCTGTA ATTTCAATTC GTTTTAAATA 
GCCTTACTGA GGAATACTTT TCAGTGCCCG CTCAGCGGCA 
CGATTTTCCT TGGCCTAACC CAATGAGTTC ACCATTCATG 
GCGGTCATGA GCAGGGCCTT CTTCTTTAAT TAAGCGATAT 
GCGTTGTAAA ACTTCTTGTA ATTGTGTTTT GTGATCCATC 
ATCAATTTTC GGAAAAATAA CGTCTTCAAT AAATTTcTTG 
AAGTAGAGGG CACCTAAAAA GGCTTCAAAT AAGTCACATA 
CCGCCCGATG CTTCTTCTCC TTTACCTAAT AAAATGTAGT 
GCAAATTTGG CTAAACTATC TTCCCGAACG ATGgTGCGCG 

CTGGAAGTTC TGGGAATTTT AAATACAAAT ATTGTGAAAC 

CTCCTAAAAA TTCAAGACGT TCATTATCGG ATAATTTTAA 

AGGATGAATG AGTAAAAGCT TGCTCTAATA GATTGACATC 

GTTCTTTTAA CTCTGTTGTT AACTGATTGT CCATTCTCCA 

CACCTTGTTT CAT TAT AC TT TTTTTTTCAC TTTTTTTAAA 

CTGTTATTCA CTGTGTTCTC ATTATTTTTT AGAAAAAATA 

ATCAAATCAG CCACTGCTAC CCCATAAAAA" AAGAGAAAAC 

TTTTTTGAAT AAACGTACCA GCAAGAGACA ATAAAAAAAG 

CTAAATATTA ACATCTCATT TAAAGATTGG GATAATTCTT 

TTTGTTTTAG TCGCATCATG ATTCTTTACT GTGCCTATAT 

TTATTTATAG CAACAATAAA AGGTACAGAA AGCGACTCAT 

AAT 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
CACTATTTTA TTTATATTAA TGCAACGTtT CTATGTAGAA GGCGTGAcTT CTGGCTCTGT 60 



GGTGACACTA 
GGTAACTTCC 
CATTTTGCCA 
CGTGATATTA 
TTCAATAATG 
CACGAATATT 
TCCTGTTCAG 
TAAACTTCAG 
TCAATTGAAA 
TCATGTGAAA 
cTGCGCCAAC 
ATAATGATGC 
TGTCGAAGTG 
CATCTTCGTT 
AATTAATTCT 
ATAGCGATGC 
ATGGAAAACA 
CATTCCTTTA 
TGGTTCTATT 
AATAAATCAC 
GTAGCAACGT 
CATTCAGTTA 
CTTTTTGTCG 
GAACCTGCAC 
CCCCAGTATA 



ATCTCACTAT 
GCAATATTTA 
CCGCGAAGGT 
CTTTTCCCAC 
GTTTTATCTG 
GCTCCTTCCA 
CTAACTTCTT 
TGAAAAAGGT 
CATCGCCTTT 
AAGCACCGGC 
TTTTTGGTCT 
ACGTGTTCGT 
ACATTCTTTC 
AATTTTCCTT 
AAAACTGCAT 
TCATTCACAT 
ATGCCGTAAC 
CATTTATTTA 
CAGAAAAAGC 
CTCTCCAGCT 
TTTCTCGGAA 
TTCACTTTCA 
TCTTTTTAAT 
CTCTTTCTTA 
TTTTTGATAA 



13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15363 
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CAAAGGCTAG AAAAActCGT TTGGTTTATA GCCAAACGAT GTTTTTTATT TTTTGGCGTT 120 

GTTTTCCTTA CACGTTTTTC TTCGTAGCGC TTGATAATTG GACAAAATTT TCATGACCGC 180 

CAAAAAGTGT TGATTGACCT CTTGATAATC CTCTAAGGAA AGAACACTAA GCGTTTCAGA 24 0 

TAGTTGCTGT TTTACTTGAT TTTCTTTTTG CTTAAAGAGC TTCTCGCCTT CTTTTGTCGT 300 

ATAAACATTA ACCACGCGGC GATTTTCTTG ATTATGGCGA CGTTGAACAA TCGCTCGTTC 360 

TTCTAACCCC GCAATAGAGC GACTCACTTG aACTTTCGAC GTTCCTACTT GGTTGGCTAA 420 

CTCCGTCATA CTAATTCCTT CATTTGTTGC CACAATTAAT AAAATTAACA TGTCTAACTT 4 80 

GGTTAATTTT AGATCCTCCA AGTCTAGGTC TTTAAAAAGC AATTGGCTCA TATATGAAAA 540 

AATTTCAGTA AAGACACTAA TATCTAAATT TGATGAATGC TTCTCTTTAT CCATATGTTC 600 

TCCCTCGTTT CCCACCTAGT GTATACTAAA TTCACTTGAT TATCAAATGG TAATGATTGA 660 

TTTTTGTTAA CTATTATGTT ACGTTATACA CTAACTTAAG GTTGTATTGT TGATAAATTG 720 

GAAGAAGGAT GCGAAAAATA TGCTAGAACT CAAAGAGATr AAAAAATATT ATACTGTCGG 780 

CGGAACGACC ACTAAAGCTT TAGATGGCGT CTCTGTCGCT TTTCGAAAAC AAGAATTTGT 840 

AGCCATTCTG GGTCCCAGCG GCTCTGGTAA AACGACGATG TTGAATGTGA TTGGCGGATT 900 

GGACAATTAC GATTCTGGTG ACTTAATCAT TAACGGCAAA TCAACAAAAA ATTTTAAAGA 960 

AGCCGACTGG GATGCTTATC GGAATAACTC AATTGGTTTT ATTTTTCAAA GTTACAACTT ,1020 

AATTAGTCAT CAAGGGATTT TGGATAATGT CGAGCTTGGT ATGACACTGA GTGGTATTTC 1080 

AAAAGCAGAA AGACGCCAAA AAGCCGAAGA TGCACTCATC CGTGTCGGTT TGAAGGACCA 1140 

TATCCATAAA AAACCTTCCC AATTATCTGG CGGTCAAATG CAACGGGTTG CCaTTGCTCG 1200 

CGCTTTAcTn AATGaTCCCG ATATTTTACT TTGTGaTGAA CCaACTGGTG CATTGGaTTC 1260 

AGAAACCAGT CTGCaAATTA TGGCACTTAT CCAAGAACTT TCGAAAGAAA AATTAGTTAT 1320 

TATGGTGACC CACAATCCTG AATTAGCTCA CGAATACGCT GATCGAATTA TTGAATTTTC 1380 

AGATGGCCAA ATTCGTcATG ACTCGCGTCC TCATATTGAA CATGAAAAAA AAGAACCTTT 14 40 

CGAATTGAAA CGCACCAAAA TGAAATTCAC GACTGCGCTG CGTTTATCTT TTAATAATAT 1500 

CCGCACTAAA AAAGGGCGAA CTTTTTTAAC CGCCTTTGCT TCAAGTATCG GAATTATCAG 1560 

TATTGCGATT GTCCTTTCTC TTTCAACAGG ATTCCAGAAA CAAATTGATA AAACTCAGTC 1620 

GGAAACGTTA GCGCAATTTC CGATTACCAT TTCTAAAATA GCGACCGATC AAGATCCAGA 1680 

AAGTTTCAAT AATCAGGACA AAAAAGGCAC CTTCCCGAAA GAAAAAGAAG TGACTGCTAA 1740 

AATTAGCGAT GCGGACCGTG CGCAACACAC GAATTTAATT GATCAAAAGT TTATTGACTA 1800 

TGTCAATCAC ATTGATCCTG AATTAAGTAA TAATATCGGC TATACTCGTC TAGTCAATAT I8 60 

GrACTTATTA AGAGAAATCA ACGGGAAAGC ACAAGCTGTC AAATTTTCAA aCkmyrCACC 1920 

CGACGGcCAA TCAAACGCGA TGGCTTCTAT GATGGCTGCC CAAACTGGTG TCGGCGTTTC 1980 

TGCTTTTCCT AAACAACTAG AAAATGGAAA AAACAATTTC TTAAAAGACA ATTACTCGTT 204 0 
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GCTTGAAGGA AATTATCCAG AAAAAGAAAC CGATGTCGTT TTAATCGTCG ATAGTAACAA 2100 

TACCACCAAT" ATTAATGCTT -TAAAAAATCT TGGCTTTGAC GTGAAAGATA ATCAAAAGAT 2160 

TGCCTTTTCA GACATTGTTG GCACAAAAAT GAAATTAGCC AATAATAATG CATTTTACAC 2220 

CAAATTACCG ACAGGCAACT TCATTCCTAA CCAAGATTTG CAAGCTGTTT ACGATAACCC 2280 

AGAAAATACT GAATTAACAA TTTCAGGGAT TTTGCGAATT AAAGATTCTT CAACAATGAA 234 0 

TCTATTAGCA CCTGGAATTG CTTATAGCGA TGCACTTTCA ACTAATATGA TTGCCAAAAA 2400 

CAAAACCTCT GACATTGTTA AAGCGCAAGA AGCCAGCAAC ACTAACGTTA TGACTAACGA 24 60 

AACATTAGAT GCTTCCGCGA AAGAAAACCT GTTGTCTTAT TTAGGAGCGA ACGAAATCCC 2520 

TTCAAGTATT ATGATTTATC CAAACGATTT CAAATCAAAA GATAAAATTC TTGATTATTT 2580 

AGATGCTTAT AACAAAGGAA AAGATAAAAA AGACCAAATC ATCTATTCTG ATTTAGCTGG 264 0 

AACAATGACC CGCTTAACAG GTGGTTTAAT GGATGCCATC ACCTATGTGC TGATTGcATT 2700 

TGCTGGGATT TCCCtTGTTA CAAGTATGAT TATGATTGGG ATTATCACCT ATACTTCTGT 27 60 

CATTGAGCGG ACCAAAGAAA TCGGTATCCT GAAAGCGTTA GGAGCTCGTA AAAAAGATAT 2820 

TACCCGGGTG TTCGATGCCG AAACTGCTAT TTTAGGCGTT GCCTCTGGTA TTCTTGGTGT 2880 

TGTGATTGCC TTTCTCGCAA CATTTCCGAT CAATGCAGTG CTCTATAACC TGACTGATTT 2940 

AGAAAATGTT GCCACCTTAA ATCCAATTCA CGGGATTATT TTGATTGTAA TTAGCACAAT 3000 

CTTAACCATG ATTGGTGGCC ATATCCCTGC CCGAATGGCA GCGAAAAAAG ATGCGGCAGT 3060 

TGCTTTACGT GCAGAATAAA TAACAAAAAA TAATCCTTTA ATTTCTCTTA AAGGATTATT 3120 

TTTTGTTTGT AATTTTTATA AAACCACCGT ATACTAAATG ATTGTGAATG CTTAATAAAA 3180 

AGAAACGAGG TAATTTCATG TTAAAAAAAC CATT 3214 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 
CGCGGCTTGA TTTAGAGGAA GTCCAACAAG AAATTAACgA TGCGAACGAA tTGCAAATGA 60 
AGCGAAACAA TCGTGGCAAG CAAAAAAAGA ACAACACAAA GAAGCAACAT CGAAAGAAGA 120 
ATAAGGAGCA AAGACTGTTT TCTGAATAAA GGAAAACAGT CTTTTAAAAT ACTAAGAAAT 180 
CACGGTTTTT TATGGTAGTT TGCTAATTTT TTGTTATTTT ATCGTAAAAT ACGGTAAACT 24 0 

AAGAAGGAAA CTTTGAGAAA ATAGATAAAT TATTATTTGC TAAAGGAGCA AACTTCTTCT 300 
AAAATAAAAA GGaAGTTTAA AAGAAAGTAT GGTAGCAGAT GGAAATACAA TTTTTAGGAA 360 
CAGGCGCTGG TGTACCagCA AAACATCGCA ATGTCACAGG aATTGCGTTG aAATTATTAG 4 20 

aTGAACGAAA TGCAGTTTGG CTCTTCGATT GTGGGGAAGG GACTCAATTG CAAATTTTAA 4 80 
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AAAGTAGTAT TCGCCCAAGA AAAATTGAAA AGATTTTTAT TACACATTTA CATGGGGATC 54 0 

ATATTTTTGG TTTACCTGGT TTGTTAAGTA GTCGTTCTTT TCAGGGCGGG ACAGAACCTT 600 

TAGAAATCTA TGGACCAGTT GGGATTGCTG ACTTTGTCAA GACTTCTTTA CGGGTCTCGC 660 

AATCGCGGTT GTCCTATCCA CTGAAATTTA TTGAATaACG AAAGAAAATG ACgTTATTTT 720 

TAAAGATAwA CAATTCACTG TTCGCTGTAA TATCTTGGAC CATGGCATCA CGAGCTTTGG 780 

CTATCGAATT GAGGAAGCAG CGCATGAAGG AgAwTTmCAA GTAGAGAAAC TTCAAGCTTT 84 0 

AGGGATTCCT TCTGGGCCGT TGTATGGCAA ACTCAAACGT GGTGAAACGA TTGTTTTTGA 900 

TGGACAGGAA ATTAATGGAC AAGCGTTTGT GGGAGAGCGT AAACCTGGAC GAATCGTGAC 960 

GATTTTAGGA GACACTCGCA AAACCAAAAA TAGTGTAACT TTAGCACGAC GAGCGGATGT .1020 

TTTAGTCCAT GAAAGTACGT TTAATAAACA CGAAGCAAAA ATGGCAAAAG CCTACTTCCA 1080 

CTCAACCAGT CAACAAGCAG CTGAAGTTGC AAAAGAAGCG CAAGTAAAAC AACTTATTTT 114 0 

AACGCATATT AGTGCCCGTT ATTTAACCAA AGAGGCCTAT CAGCTTCAAG AAGAAGCCCA 1200 

AGAAATTTTC CCCAATACAA AAATCGTAAA AGATATGGAT ATCATTGAGA TTCCATTTGC 1260 

CAATGAAGGA GGAGCTTAGA TGGATTTAAC AAATAAAGTC GTTGTCGTAA CTGGAGGTTC 1320 

AGCTGGTTTA GGTGAACAAA TTTGTTATGA AGCAGCTAAA CAAGGGGCGG TCGTCGTTGT 1380 

CTGCGCCCGC AGAATTAATT TGATTGGTAA AGTTCGCGAG CAGTGTGCTG TTTTAAGCGG 14 4 0 

ACGAGAAGCA TTTTCTTATC AATTAGATAT TGCCGATCCA GAAAGTGTTG AACGTGTGGT 1500 

TGAAGCTATT TCAGCAGAAG TTGGTCCTAT AGATGTTTTA GTCAACAATG CTGGTTTTGG 1560 

CTTGTTTGAA AATTTTGTTG AAATTGACTT GGCTGTTGCA CGCCAAATGT TTGATGTCAA 1620 

CGTTCTAGGA ATGATGACGT TTACTCAAAA AGTAGCTATT AAAATGATTG AAGCTGGGCA 1680 

AGGACATATT ATCAACGTTG CTTCAATGGC TGGGAAGATG GCTACAGCTA AATCAACCGT 17 4 0 

TTATTCTGCA ACGAAATTTG CTGTGTTAGG TTTTTCAAAT GCTTTACGTT TAGAATTGAA 1800 

ACCGTTGGGT GTAGCAGTAA CGACTGTCAA TCCAGGACCA ATCCAAACAG AATTCTTTGA I8 60 

TAAAGCGGAC CCGACAGGCA CATATCTAGC TGCGGTGGAT AAAATTGTTT TGGACCCTAC 1920 

GAAATTAGCG AAAGAAGTGG TTGGGAGTAT GGGGACTTCT CGACGTGAGA TTAATCGTCC 1980 

ATTTGTCATG GAAGCGGCAG CCCGCTTTTA TACATTGTTT CCACATTTAG GAGATTTCAT 204 0 

AGCGGGAAAT ATTTTAAATA AAAAATAAAG GAGTCGATCA AGTCATGAAA AATCAATGGC 2100 

GTGTTATTTT GGGCCTTGTG TTGGTCTTAA TTGTTGTTAT TTTTGCTGTT TTAAATAATC 2160 

AAGCTGTTCC AGTAAACTTT GGTTTTACTA AAATTAGTGG ACCGTTAATC TTAATTATTT 2220 

TAGGTTCTGC. AATTATTGGC GCACTAGTTG GATTACTAAC CTCCACAACC ACCATTTGGA 2280 

ACCAACGTAA GGAATTAAAA GCTGTACAAA AAGAGTTGGA TATTTATAAA AATGATATGG 234 0 

ATAAATTAGT CAAAGAAGAA ACGGAAAAAG TTCAACGTTC TTTTGATAAT CAATTGGCTG 24 00 

ATTTGCAAGC CAAACAAGCG GCAGCACCAC AAGTCTCAGA ACCAGTGGTC AACGAACAAA 2 4 60 
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CGTbAAAbAC 


apaapTapap 

AbAAb 1 fVbAb, 




t p. t r a p. p. t a p 


TPPTZlTTflZlT 
lV^olrt.1 lunl 


PP, T T Zl TP. TP. Z\ 


2520 


AACCTCGTGT 


b AA 1 bAAbHu 


paapaaaaat 


a app aapapa 


PTTTZ\TPaPT 
^ 1 1 Inl ^rt'w 1 


A Z1PTTTPTTT 
Arto 1 1 1 b 1 1 I 


t joy 


TAAAATAAAA 


Lib! b A 1 A 1 C 


T>f /~"P A TTT A T 


PTAAPAPaTa 


TP ZiP HPTTTT 
1 ^rt^rtu 1 I 1 1 


TTTPTPnr AT 

1 1 1 C 1 C Ab A 1 




TTAAAAAAAT 


f~+ rp rp rp /^» rp rp rp 7\ 

GCTT. TC i 1 1 A 


fPTTTTPPA j\ 


1111 bL 1 


D. T Z\H T T Zl P T T 
A 1 AA L 1 1 1 


PPTTAPPAPI 1 

bb 1 1 AbbAb 1 




GACGTCGTGA 


AA7VAArp/~ , AAA. 

AAAAATCAAA 


PTBTPa aTPP 
C 1 A 1 bA/\ 1 uo 


paPTTapapa 

b Alj 1 1 ftbrtbH 


PZiZiZiP.ZlPZlP.Zl 


&TT APPTPT A 
nl 1 rtb-b 1 b 1 A 




n * rn m rn tv rp tv /"» 

GAATTTATAG 


AACAAT 1 AAA 


AAAAbAAbAA 


Al I AA1 bbAl 


TnaTT PP P P Zl 
1AA1 IbbbbA 


Z1PTTTTZ1TPP 
nb 111 l/\lbb 


0 Q 9 n 
Z 0 *C u 


CACCGCAATA 


m /*• J"* TV Z* 1 TV TV 

TCCGGACAGA 


AbAAGCAl AA 


Cb i AAb i 1 1 C 


TAPATPPP5P 

1 ACA 1 CCbAC 


T 21 TPP a Zl P Zl T 
1 A 1 bbAAbA 1 


9 p ft n 


ATT TAT GAT C 


CGTTCTTAAT 


bCATbAJ. Al b 


bAbAAAbCbb 


i AbCCCbCAl 


^/"•TiPPAAPPP 

bCAbbAAbbb, 




GTCGAAGCGG 


GTGAACAAAT 


CCTTGTCTAT 


GbbbAl I Al b 


ATbCAbAl bb 


r* tv rnf> APT APT 

CA1CAC 1 Ab 1 


1000 


ACCACGGTTA 


TGAAAGAAGC 


tv t\ m /"i a tv m rp a 

AATCGAATTA 


GTCGGTGGCA 


TGGTTCAbTA 


CT iTCTACCb 




AATCGTTTTG 


TTCATGGCTA 


tGGGCCTAAT 


AAAbaCGTAT. 


TTGCTbAACA 


AATTPAAPRB 

AAI ibAACAA 




GGAGtCCAAT 


TAATTGTAAC 


m m /** tv rp tv tv rp 

CGTTGATAAT 


GGTGTAGCsG 


GACATGAAbC 


A A TiT 1 A APTAP 
AAI I AAC 1 AC 


11 ft fl 


GCTATGGCAC 


AAGGAGTAGA 


m m tv tv rn/^/' 1 rp rp 

TGTAATCGTT 


ACTGACCATC 


ATOA A TTPPP 

ATGAA1 TbCC 


TPAAPAATTB 

lb AAC AAI 1A 




CCAGAAGCCT 


ATGCGATTGT 


CCACCCTAGA 


CATCCACAAb 


b 1 bAC 1 A 1 CC 


111 lbbAbAl 


1100 


TTAGCTGGTG 


TTGGCGTGGC 


TTTCAAAGTC 


bC 1 AC lbCb 1 


T 1 APTTPPTPA 

i AC1 lbblbA 


ZITTPPPa &TT 
All bbb AAI 1 


1 1fiO 

J J D U 

) 


GAGTTATTAG 


ATTTGGTAGC 


AAT 1 bbbACb 


Al lbC IbACl 


TPPTTTPATT 

1 bb 111 bA 1 1 


a flPPPZiTPA A 


3420 


AATAGAACGT 


m m rp y"i tv Tv TV 7v rp 

TTGTCAAAAT 


bbbbbl ICAA 


AlbAl 1CAAA 


PTPPTPZ1TPP 
C 1 bb 1 bA 1 bu 


aaTTPPTTTA 


34 80 


GACGTTTTAT 


T AC AAG AAG C 


bbb lb 1 bAAA 


KAbbAAbb 1 b 


TTTPZIPZIZIP.ZI 
111 LfAofirtUri 


a T P H. Zl T T P. C C 


3540 


TTTACCATTG 


/— i TV f~+ rp rp rp rp 

GACCTCGTTT 


AAACbCAJ. I A 


bblCbAi 1 Ab 


bbbAAbbbbL. 


a pp a PPTP.TT 

HLLrtUO lull 


3600 


GAATTGATGA 


CTACAT I J.GA 


1 b AAbAAbAA 




TPPP T H Zl H T Zl 


TATTPtATPAA 


3660 


CAAAACAATG 


TV TV r~"» TV TV TV TV TV 

AACGAAAAGA 


CATTb TAACb 


ACbAl 1 bCCA 


a ap a a r* r*r*c ,r v 
A/VbAAbbbb 1 


PP&TTTAATP 


3720 


GACCCGAATG 


CGCCGGTCCA 


TA1 I 1 1 AbCb 


A A APA APPPT 

AAACAAbbC 1 


CfT* ATPAD PP 

bbb A 1 bAAbb 


PPTPTTAPPP 
b.o 1 b 1 1 nouu 


3780 


ATTGTCGCGG 


« /-i tv rp rp TV rp 

GCCGCATTAT 


GCAAbAAACb 


err- a a)\ppa a 
bbbAAACCAA 


CCA1 1A1 111 


APPAATAP AT 
MoLnn 1 Aon 1 


38 4 0 


GAATCTGGTA 


CGACTGCTAA 


AbbC 1 C 1 bbA 


bbbAbbAl 1A 


PPPPZiTTPZIZi 
bbbbnl loHM 


TPTPT ATP.AP, 


3900 


GCGTTGAACG 


AAgTGCGlbA 


bCAAl ibAC 1 


CAl ill bb 1 b 


PPPZlTPZlTZiT 
bbb A 1 K^r\ Inl 


PPPAPPPPPA 


3960 


A i bAC I I T AC 


C 1 b 1 1 b AAAA 


1 Al 1 ill 


PTTPa Z1PZX HP 


Zl TT TP. P.P Zl P Z\ 


PTTTATTHAA 


4020 


AAAAAT^OAAA 

AAAAATCAAA 


TTGAi Al bbb 


TaaTPPaPAA 
1 H-A 1 ubntnn 


pnpTTPPTPa 

bnb 1 1 ul 1 L.M 


TTZiPTPZlP.aP, 


PTTAHPHf^TT 


4080 


TCGCAGGCCA 


CGACTACTTT 


TATTGATCAA 


TTGCGTATTT 


TGGCGCCTTT 


TGGGACCGAT 


4140 


AATACTGTTC 


CAACTTTTGT 


ATTTAAGGAA 


ATCACGCCTA 


CGCAAATTCG 


GCAAATTGGT 


4200 


GCAGATAATG 


CACATCTGAA 


ATTTCAAATG 


AAT C AAG AAG 


GGGCACAATT 


GGATGCAATT 


4260 


GCTTTTCAAA 


TGGGGCCTCA 


AGCGGATGAA 


CTAGCACAAG 


GAACCGCCGA 


TGTGGCAGGC 


4320 


CAACTTTCCA 


TCAATGAATG 


GAATGGTCGT 


AAGAAACCAC 


AATTAATGGT 


GACTGATTTT 


4380 


GCTGTATTGG 


GAAGACAATT 


ATTTGATTTT 


CGTGGCAAAA. ATAATCAAAC 


GAAACCAATT 


4440 
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CCTTCAGAAG 


CAACAGCTTA 


TCTTTTGTTT 


GATGAAAAAA , 


ATCAAAAATT 


CATTTCAGAT 


4500 


CCAACCGCTA 


ATATTATTGT 


TTGGTCAAAC 


CAAGAAGAAT 


TGGTGGAAGC 


TGTTTCTCAA 


4560 


AATCAAATCG 


AACAACTGGT 


TTTTGTCGAC 


TGTCCTGTGG 


AAGCAATCAC 


GGTTAAAGAA 


4620 


ATTGTTGAAG 


CGACTGAGAT 


ACAACGTATA 


TACATGATGT 


TTATTTCACC 


AGAAGAAGCT 


4680 


TATTTAAACG 


GTATGGCTTC 


ACGTGAACAA 


TTTGCAACAT 


TGTATAAATT 


TATTTTGCAG 


4740 


CAAAAAGAAG 


TAAATCTACG 


TTCCCAGCTA 


TCGAAGGTGG 


CAAATTACTT 


AAATATCCAA 


4800 


GAAAAATTAT 


TAATTTTCAT 


GATACAGGTG 


TTTTTTGACT 


TAGGATTTGT 


TACAATAGAA 


4860 


AGCGGTGTTT 


TAAACAGTAT 


TGAGAAACCC 


GACAATCGAC 


CATTGACTGA 


AAGTCAAGTG 


4920 


TACCAACAAC 


GATTGAAAAA 


AATAAAAACA 


GAAGAATTTC 


TACTTTACAG 


TGATTGTCAA 


4980 


ACAATACAAC 


AATGGCTGTG 


GAATGAGGAG " 


-GACAAATAAA 


AATGGATTTA 


AGAGATTACA 


5040 


TTGCAAGCAT 


TCCAGATTAT 


CCTGAGAAAG 


GGATCGTGTT 


CCGTGACATy 


TCTCCATTAA 


5100 


TGGCTAATGG 


GGATGCCTAT 


CGTGAAGCAA 


CGAAACAAAT 


CGTGGATTAC 


GCGAAAGAAA 


5160 


AAAGAATTGA 


TATGGTTGTA 


GGACCGGAAG 


CTCGTGGCTT 


TATTGTCGGC 


TGTCCAGTGG 


5220 


CTTATGAACT 


AGGGGTTGGC 


TTTGCTCCGG 


TTCGTAAAAA 


AGGAAAACTT 


CCTCGTGAAA 


5280 


CCATCGAAGT 


GACTTATGAC 


TTAGAATATG 


GTTCAGATAC 


ATTAACGTTG 


CACAAAGATG 


5340 


CCATCACACC 


AGGCCAACGT 


GTTTTGATTT 


GTGATGACTT 


GTTAGCAACT 


GGTGGTACAA 


5400 


TTAAAGCAAC 


CATCGAATTA 


GTCGAACAAT 


TAGGTGGTAT 


TGTGGTTGGT 


TGTGCGTTCC 


5460 


TAATTGAATT 


AATGGATTTA 


CATGGCCGCG 


ATAAAATTGA 


TGGCTACGAT 


ATTGTTACCT 


5520 


TAATGGAATA 


CTAATCTAAT 


TTAATTAGAA 


AGAGCGAAGG 


ATTTAATCCT 


tCGCTCTTTC 


5580 


TTTGACATAT 


TTATAAAAAC 


GGAGCGTAGC 


TTGGGGGGAA 


CACACCGTTT 


TATTATTTAT 


5640 


TAAGGGAACT 


TACGATAAAC 


GTATCTTGCT 


TGCTGTTGTT 


TACGTGTACG 


TGTATAACGC 


5700 


TTCTCATCCC 


ATGACATAAA 


CCAAAGAATA 


AATAACGGCG 


TCACGATCAT 


AAAAGCAAGA 


5760 


CGTAATCCTA 


ATGTTCCnAA 


, GAAAATACCT 


ATAACCAATA 


. CAAAAAATAA AAATGCCAAC 


5820 


ACGACGAATT 












5830 



(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
TATTTGACTG CTTAAACGTA TAATGAAACG CCGAGTCATT TGTTGGTATC CCTAAACCAT 
TTTGTAAAAA TTCAATCGCT TCCTGTTTAG TATAAGATTC TGCTTCTAAT TGCGTTCCTA 
AATGCGGATT TTTTTGATCA ACAAATTGTC TTGCTTTGTC ATTATTAGCA GACTTTTCTA 
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TAACGACTTC CTGTTGCGAA TTCCTACTTC CTGAACAGCC AGTTAATACA AATAAAATCA 
CCAACAACCC GAAACACTTA CTCAAAGTTT 7CTCCTCCTC TCCCTTTTCA TTAGTGATAC 300 
CTGATGATTT CTTTTATATT TTAGCATGCT T3ATATAACG AATGGTGCTC AAAAAAGACA 
GAATCAACAA GTAAAAtTGT CgACTCACC? ■ TCTCAACTA AtATTAAAAA CCATCTGTCT 
CAATATTTtC GAATTtCCTG kAAATTACTG ATAAATAAAA aCAGCTCCTT TTGCGtATAC 
yTATATTATC AGCACCTAAA gGAGGAAmT? TcTTGAAGcA AAAAAAtCTA CTCACTTATC 
AATCACTTGT AGCCTTGTTG CTaGTATTCA GTCTCTTTAG TTTTGACCCA TCGGTTTCTT 
TTGCCACCCG TTCTGGAAAA ACGCCTGTT? CAGTTGAACT AGAAATTGGA GGGCTGCCTG 
GTGATGAATC GGTTGACGAT GCCATTGATC C7GATT7AGA AAATCCTAAC ACTAGTTTTG • 
ACTTGCTCTT TATTCCGAAA GAATTTACTT TTGAAACCCA AGCGATTTCT GGTGATTTAA 
CGGCAGTTCC TATACGACCT TCTGAAGGCA ATACTCATAC AATGAGACAC TTCGGGATGG 
GGGATGTTCG CGGAGCcTTA ACTGGTTGGC ATGTCACTGC TGAAATCCCA CACATGCGAA 
ACGAAGCTCA TTCTTTACTA GGCATCATTA CGTTTCAATT AACAGGCAGC TATGCTCGCT 
ATGACAAAAC GTTAAGGCGT TTCTTTATGA CAAATACAAT GTATGGACTG GATTTTTCAG 
AAGATCCTGC TGCACCCGAC TTCCCGACCA ACCCAATTAT TATCGGTAAC GGTGCGACTT 
TAATGAGTAA CGCTGGTGAT AGAAAAGGAC AAGGAA7GTG GAGCGGCCGC ATGACCGATA 
TTTCTCTTGC AATCCAAACC CCTGTTTCAC AATTATTTCC TGGCGCATAT ACAGGTTCGA 
TTATCTGGAA TTTAATCTCT GGACCTGTAT AAAATAAAAA AACGCTTAGA AAACTAGCGG 
AATCACTACT CTTCTAAGCG TTTTTTTATA TCATAAGGAC TATTTGCTCC TTCTATTTGT 
CTAAGCATGC TTTACCACTT TTCGTTTAAC GAATCACTTG GCCCCAACGG TAATCAAACT 
GATACTCTTG ATTACCATTA TTCAGGTTAA T7TTGTATAA ATAACGCGGG ACTTTAATGA 
ATAACGCATC CTCTACTTGA AAAGCTTCAA TAGGTACaTT TTCTTTAGTT AAAAGCAACG 
GTGTTTGCCA CTCGTATGTT TTTTTCTGCC AATTGTATAA GCCAACTATC GCTkGATTAC 
TTkGTCTTTT CTTGTATGAC AGAACATAAT AATTTGCTTC ATCTTTCTTG CCTAATAAAA 
" CTGGTGCTAT TAGTTGCTCT TCTAGTTTTC CTTGCACTGT TGTACGCTCT TCCGATGACA 
GCTCTTGACC AATTTGACTC TCATTTGAAT CTACCGTCCA TTTAGGCGCT TCTTGTGTTT 
GAAAGACTTC TTTGAAAGTT GGGTCTTGTT TTAACTGCCA ATTTTTCAAA TCTAATTGAT 
AATATCGATT GTtTAATCCA TAAATATAGA GATAGCGATT ATCCATAAAA CGATAATCAA 
CGAAATCAGA GCTTAAATAG TCTTTTAATT CAGGGAATTt CyTCACTAAA TCTGCTtCAG 
AAAACTGkTT TTTCCtGTTT GnACnAGnAA GAAATGGCC 
(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



240 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1959 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

GATAACCTGA CGAGCCATAA GATTGATTAT AAAAACCCTC ATAAAGTTAC TTCTGCACAA 60 

GTTGGTTCAT ATAGTAAAAC AGAAACAGAT AACCTATTCA TCAATAAATC AGATGCTGAA 120 

AACGGCTTAT TGGTAAGAAA AAGTATTGAA ATAACAGACT TAAATAATGC TAAAGAACCT 180 

GGCATATACT CGATTCCAGC TACAGGAGTT GAAAACAAGC CGCTACCAAA CTCTGGAAGT 240 

TTGTTCGTTA GTAAAGACCC AGGAGGAGTC AGACAACTAT TTCAAACGGA AAGGACTATC 300 

GTTATTCGTC AATTTGGTGG TATTCCTTCA AAGTGGACGG ATTGGAAAGA AGTAGCATTT 360 

AAACCGAATG TTGTGAATTT AACTGAACCT CAACGTATAG GsGGAACTAA aGAATTTGCG 4 20 

GATATTCCAT TAGTAAATGG TACAGAAtAG CTTTGAAAGA AGATATATTT TTTTATCAAA 4 80 

AGACTGGGCT TGATGAAGTC CaAGCAGAcT ACAAAGCTTC TTTTAGGGAA GAAACAAATA 540 

TGTTTCTTAC ACGAGAAGGA AACAGAGTTG ATGCTTATTT ACGAGTTAAT GTGGTGGATG 600 

TCACCAAACT AAAAACTGCT TTTGTTCCGA TTTTTCAAAT TCCTGATGGA TTTAAAATTG 660 

ATCTTAGCAT GAGGGAGAGT TTCTGGAATG TTCCATTGAC AGTAACTCAA TACACCTACC 720 
CACAAGGTAA CTATGGTGCA TTATATGAGA TGAACGTGAA AGGAATCCGA TTCGGAAGTG 
ACCGTCTTGG TAATCATTAT CTATACGGAA GTTGGCATAC TAATGATCCA AAACCAGATG 

CTAAATTTAA GTATATGCTT TATGTGAACT TATATAATTT GTCTGAGGGA AGAGATTTTG 900 

TTTCAGGTTC ATACAAAGGA GAAATTTATT ATGTAGCTGT TGAAATTGAT GGACAGTTAG 960 

GrGAAAAATT AAAAGTATCT GATGGTTTTT ATAAGTATAC GGTAGGAACT AAAATAAATA 1020 

AGGcTTCTCA AAATGCATGG GTGATTGGTT ATGATCAGTC AGGAACAGAA GTAACTAGAA 1080 

GCAAAATCAA AATATTATAG AGAGTAAGGA GAAAACAAAA AATGAAAAAA ATTTGGAAAT 1140 

ATGGACGTAC TGGAGGGCAA GAGCTACAGG TATCTGATGA TTTCCCAATG CAAGTTCCTT 1200 

TTACAGATGT AGTTCCTTTA CCAACCGTCA ATTTAGAAGA CCAATTTTTT ATTCCATCTG 1260 

AGAATAGATG GAAAGAAATT TCTAACCAAT TAGATAAGGA AAATTTGGAT AATTTAAGTA 1320 

TATTATATAA AAACCTTGAA AAGGATAATG AATTATTAAA AGCTAAAGCA GATAATCTTG 1380 

CTCTTCTAAA TTCTAAGCTA ATGCTCAATG ACCTTAATAT CCAAAAAGAA AATACCCTCT 1440 

TGAAAGCTAA AGCAAATGAC CTAGCTGAGA TTGGTGCGAA ATCAATGTTG TCCATTGTAC 1500 

AAATTACTGG GGAAATAGGG AAAATTAATG AGCAACTTAA AGGAGGTGCT AAATAATGTT 1560 

TACTTTTGAT GATGTTAAAT TGATGTATGA TTGGGGTCTT TATACTGATG ATGAAGTAAA 1620 

GCTATTTGTA CCTACATGCA TTACAGAAGA GGAGTTTAAC GAGATTGTAG GGAAAGAAGG 1680 

TTAGTCAGTT GGAGTTAGAG CAAAAAGTAA AAGAACATGA AAAACGTCTT GGTGATCACG 17 4 0 

ATAGAGAAAT AGGTCGTTTA GATAGACGAA CGATGACTTT ACAAGAGCAA CTTAATGCAA 1800 

ATTTAGTTAG ATTAGATGAG TCAAATAAAT TCTTACGTGA ACAAAATATG AAGCAAATGG 1860 



780 
840 
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AGCAAAATAG TGAAATTCTA AATGCTATTT TGAATAGAAA CAGTGAAGCg GATGAAAGAA 1920 

AAGACGAACT AAAAAAACTT AACACTGAAA ATATATGGAA AGTAATACTA GCTATATyTG 1980 

TTTCTAGCGG AGCAATAACT ATTTTATTTA ACTGGTTAAG CACATTTTTA GGAGGCGCTA 204 0 

AATGAAAATT AATTGGAAAC ATAAATTCAC AAGCAGAAAG TTTTGGGCTG CAGTGACrGG 2100 

rGTAtCaTtG CTTTGkTaGC aGTTTTCAAT GTAGATGATT tAmCaTCTGa AAAAGTGGTC 2160 

ACTTTAGTAG CAGCTATTGG CTTATTAGCT GCATATATTG TTGGCGAAGG ATTTGTTGAT 2220 

TCAAATAGAG ATAATTAAGA AGTCATTCTA AAATGGCTTC TTTTTTTATA TAAAAAATTA 2280 

AGAAAGAGGT TTTAAAATGA AAAAGTTTAG TAAGTTTTTA TTATCATTAG TTGTAGTTAC 234 0 

AGGATTATTG TTACCAACTG CCGCAGATGC TTATCAAGTG GAACAAGATC CTATCGATTT 2400 

TGGCGGATAT TTTCCAGGTT ATGCGACTAA CGAATTAATT GTCTTGCACG AGTCAGGAAA 24 60 

TGGGAACAAC GTTGGCCTAA ACAGTCTAGA CAATGAAACG GCATATATGA AACGCAACTG 2520 

GACGAGCGCT TATGTTTCAT ATTTTGTCGG TTCTGGTGGT CGCGTGAAGC AGTTAGCGCC 2580 

AGTTGGCCAG ATTCAATGGG GAGCGGGAGC GACAGCCAAT GCAAAAGCAT AT GC AC AG AT 264 0 

CGAACTTGCT CGAACGAATA ATAAAGAAAC ATTCAAGAAA GACTATGCTG CCTATGTCaA 2700 

TTTGATTCGT GATTTAGCAA CACAAATTGG TGCaACATTT GACTTGGATG ATGGAACAGG 27 60 

ATATGGAATT GTGACGCATG ATTGGATCAC AAAAAATTGG TGGGGGGaTC aTACaGATCC '2820 

ATACGGATAT TTAGCACAGT GGGGaATTaG CaAAGCmCaA CTAGCACAAG ATTTACAAAC 2880 

TGGACTTCCA GAAGATGGTA GCGAAGTTAT TGTAAATCCT GGCAAGCCTA ATAAACCAAA 2 94 0 

ATATAAAGTC GGTCAGCACG TTCGCTTCAC AACAATCTAC AAAAATCCAG ATGCGCCAAT 3000 

TTCTCAGcaT ATCAATGCAA ACACATTGTG GACTCAAGTT GGaACCATTA CACAAAAAAT 3060 

AAATGGcCGT AAAAATCTAT ATCGCATCGA AAACAGCGGC AAACTTTTAG GTTATGCAAA 3120 

CGATGGTGAT ATTGCGGAGT TTGGGAAAAC AGCAAACCAA CACCAGCTAA AACATTCACT 3180 

ATTGGTGTAA ATGAAGGAAT TGTGTTGCGT ACTGGATCAC CTAGTTTGTA TGCGCCATTT 3240 

ACGGCGTGTG GCCAAAAGGT GCACAATTTA GATATGATTC GGTTCGTGTG GCCGATGGCT 3300 

ATGTTTGGTT AGGGGGTTCT GATTCAAACG GAACTCGGAT TTGTATCCCA GTTGGCCCAA 33 60 

ATGATGGGCA ACCCAGACAA TACTGGGGTA CTGGATATTA AGAACAGCTT GAnTTTTCTG 34 20 

TTAACCCnTA nGTTTAGGAA TAAACTTACA CTTATTTAAA TTTCTCCTGA GTCGCCTTnC 34 80 
CCCAA 

(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3485 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

AGAAGGAAAA AAGAACTGAA AAAGCTTAAA AAAAGCTGGG AAAAAAGTGG AGGAAGAGGT 60 

AAACCACCAA CTTCTCTATC TACGTTACGA TGTGGTCGAA TACTCCAAGA ATATATTGAG 120 

TTTTGCTTAT TTGATATGGA AGAAAATACT CGTTTAGCTA TGTATCAGCC AAAAGAAGGC 180 

ATTTATACAC AAAATGAAAC ACAAATAAAA CGAGTGATTG GGTGGCTTGA ACCGAAACAC 240 

AATGCTAGAG CGGTAAATGA TGTTATTTTT CACATTTGGA AAGAAGCTAA AATAAAGCCT 300 

AAAACGGTCT CAAGATATCT TATACCAGTG AAAAACGGGG TATTTAACCT TAAAACGAAA 360 

CAACTAGAAC CGTTTACGCC CAATTATGTA TTCACATCAA AAATTGCAAC CGCATATGTA 4 20 

GAAAATCCGT CGTTACCTAA ATTTAATGAT TGGGATGTAG AGAGCTGGTT GAACGAAATT 4 80 

GCTTGCGGAG ATGCACAAAT AACTACTCTG TTATGGCAGG TAATAAGTGA TGCGATTAAC 540 

GGGAACTATT CAAGGAAAAA ATCAATCTGG . CTTATGGGTG ATGGATCAAA CGGAAAAGGA 600 

ACGTACCAAC AG T TAT TATA TAACCTAATT GGTCCACAGA ATATCGCAAC GTTAAAAATA 660 

AATCAGTTTA GTGAGCGTTT TAAACTTGCT TTATTAGTAG AAAAAGTGGC AGTTATTGGG 720 

GATGATGTAG GAGCrGGAAt TTATATTGAT GATAgTTCGG AATTTAATAG TGtAGTGACT 780 

AAkGrGACTA TACTAATTGA AgTTAAAAAT AGAATGCCCT ATAGCGCTCG AATGTATGTG 840 

ACTGTTATTC AGTCTACCAA TGAGATGCCT AAAATTCGTA ATAAATCTAA TGGAACGTAT 900 

914 

CGCCGTTTGC TTAT 

(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

GGCATAGCCT TGAAGAAGCC TATCAAGCGA TTTTGTCGGC GCCAAGTTTT GATGCTTGGA 60 

TTAAAACCAC ACGCCCTTTG AAATTAGACA ACAATCAATT GTGGCTAGAA GTCCCTTCTG 120 

CAGTTCATCG TGATTACTGG GAAAAAAATC TTTCGGCAAA AATTGTCGAA ACTGGATTTA 18 0 

AATTGACCGG TGCAGAAGTG ATGCCCCATT TTGTTGTTGC CGATGAAAAA GACGCAGCAC 240 

TGGCACAAGA ACTCGAAGAA CCTGCCGAAG AAGAAGTTGT TTTTAGTGAA CAAAGTAAAA 300 

AAGCGATGCT TAATCCTAAG TACACTTTCG ACACCTTCGT TATTGGTAAA GGAAACCAAA 360 

TGGCTCATGC TGCAGCGCTT GTTGTTGCCG AGGATCCAGG TTCGATTTAC AATCCCCTTT 4 20 

TCTTCTATGG TGGTGTAGGG TTAGGGAAAA CCCATTTGAT GCATGCTATT GGTCATCAAA 4 80 

TGTTAGTGAA TCAACCAGAT GCCAAAGTCA AATATGTTAG TAGCGAAACG TTTACGAATG 54 0 

AATTTATCAA CTCAATTCAA ACaAAGACAT CCGAACAATT TCGGAAAGAA TATCGCAATG 600 

TTGACTTATT ATTAGTCGAT GATATTCAAT TTTTGGCGGA AAAAGAAGCA ACATTGGAAG 660 
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AATTTTTCCA TACCTTCAAC GATCTTTACA ATGAAAATAA ACAAATTGTT TtAACAAGTG 720 

ATCGCCCGCC GAATGATATT CCmAAATTAC CTGnAACGAT TAGTTTCTCG CTTTGCTTGG 780 

GGTTTGTCTG TCGATATCAC CCCGCCTGAT TTAGAAACAC GGATTGCAAT TTTGCGCAAA 84 0 

AAAGCAGATG cCGAGCGTTT AGAAATTCCG GaTGATACAC TAAGTTATAT CgCTGGtCaA 900 

ATTGaTTCCA ACATCCGTGA ATTAGAGGGT GCTTTGGTCC GTGGTTCAAG CCCTTTGCCA 960 

CCTATTATGG AGAAGGATAT TACCACTAGT TAACCGGCGG AGCCTTGGAA ATCCCTAAAA 1020 

nCAGTTGGAG TAAAAAAnCA ATnGCCnATT TAC 1° 53 
(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

CGATTAGTGC CTAAAGACAA GCTCACGCAA TTAACGAAAC AATTCCTTTA CTTTTCGGAG 60 

TCCTTGGGAT TTTATTTATG GTCGGCCAAT AAGAAGCAAG AACGAATAGA ATTACTTTGT 120 

CATATTGAAG AGAGTCAATG TCAACAATTT TATAGAAGAA AACAATCATG GGATTTTTAC ,180 

GAwAAACrAT TACTAGArAT CTTCCGrCTA CCGACCGCTA ATTTAAATTT ACTGCCAGAT 240 

AGACGGCACA CTGGCCAATT GATGCATGAT TATTATCAAA AATTGACGCA ACAACTTGGT 300 

TACAGAAATG CTCAGTTGcT TAGAACACAA AGCTTTTTAT ACACGAAAGG TTGTCACCTT 360 

CTCCAGTTaC CTCCTTGGTT TTACTATCCT GGATTGAAGC TAATACCTTC TTCTGAAGAA 420 

GATATTCATT TAAAAATGCT TGTTTGGGAA GCACTAAAAA CCGAAGAAGA GCGTTTAGTG 4 80 

ACCAAACAGG AACTATGGCG TATCTTGGAA GCTATTTTTT TGGAAGAAGG GAACTTTGTT 54 0 

TGGCAGCCAC TGCCGAATAT TGGCTTAAAA AAGCTTTTTA AAATTGTGGG AAATAACTTA 600 

TTGAGATGGT TACAAGAATG TTATGTGTTA ATTAAAGTAA ACAATAAGTA CCTAATAACT 660 

AGTATTTTCC TAAGAGAAGA TCCTGAAAAG CTCATTCATT GGCTAAAAAA AGTGAAAAAA 720 

CAACATTTTT TTAGTCACAC. GTGTTAAATT ATGATAGTAT AATGAGTAAA ACGTTTCTTA 7 80 

AAGGAGAGTA AACTATGAGC GAAATTAAAC AATTACCAAC ACGTGATGAA GkTCCAACTC 840 

CATTGACTTG GGATTTAACC AAAATTTTTA AAGATGATGC TGCTTTTGAT GTTGCGTATA 900 

ATCAGTTGAT AGAAGAGCTA AATCAAGCAG AATCATTTAA AGGTACCTTA GGGGATGGTG 960 

CAGAAGCTTT TCTAGCTGCG TTAGAATACG TTTTAGATGT CTATCGAAAA GTGGAAACAT 1020 

TATATGTCTA TTCTCATTTA AAAAATGATC AAGACACTAC AAATACAGCT TACCAAGCTT 1080 

TATATGCCAG AGCTAGTTCT TTATATGCAC AAGTTAGTGA AGCTGTTTCT TGGTTTGACC 1140 

CAGAGGTTTT AACATTAAGT GATGAACAAA TCTGGGGATA TTTTGAAGAG CAGCCGAAAT 1200 



GTA 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1500 
1560 
1620 
1680 



1800 
1860 
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TAGCTGTTTA TCGTCACTAT ATTCAAAATA TTTTAGATGA ACGTCCCCAT GTTTTATCGA 1260 
TGGAACAAGA AGCTTTACTG GCGGGGGCTA GTGAaATTTT TGGTGCCTCA AGTAATACAT 1320 
TTTCAATTTT AAATAATGCA GACTTAGAAT TTCCAACTGT GCAAAATGCT GAAGGCGAAA 1380 
CGATTCAACT TTCTCATGGC GTTTATGGTC AGTTAATGGA AAGTGTCGAT CCATCTGTTC ' 14 4 0 
GTGAAGCAGC ATTTAAAGGT TTGTACAAAG TATACAAACA ATTTAGAAAT ACATTAGCCT 
CAACTTTAGG TGCACATGTT AAAACACATA ATTATAAAGC AAAAATTAGA AATTATGATT 
CTGCTCGAGC AGCCTCTTTA GCAAGTAATC ATATCCCTGA AAGTGTTCAC GAAACATTAG 
TAGCTGTAGT AAATAAACAT TTACCTTTGC TACATCGTTA TGTAAAATTA AGAAAAAAAT 
TATTAAACGT AGAAGAATTA CACATGTACG ACTTGTATGC GCCTTTGCTA GGTGAAGCAC 1740 
CAATTCGTTA CAGCTATGAA GAAGCAAAAG AAAAAGCAAT TGAGGCTTTA AAACCACTAG 
GTGAAGACTA TTTATCTATT GTTAAAGAAG CTTTTTCAAG TCGCTGGATT GATGTGATTG 
AAAATCAAGG AAAACGAAGC GGCGCATATT CTTCAGGAGC TTATGATACA GCCCCATACA 1920 
TTTTAATGAA TTGGCATGAT AGTTTGGATC AACTATTTAC ATTAGTCCAT GAGATGGGCC 1980 
ATAGTGTTCA TAGTTACTAT ACAAGAAATA ATCAGCCGTA TGTTTATGGC GACTATTCAA 
TTTTCTTAGC TGAGATTGCT TCAACCACAA ATGAAAATAT TTTAACAGAA TATTTATTAC 
AAACAGAAAC AGATCCTAAA GTACGTGCGT ATGTCTTAAA TCACTATTTA GACGGCTTTA 
AGGGAACCAT TTTCCGTCAA ACGCAATTTG CGGAATTTGA ACATTTTATT CACACGGAAG 2220 
ATGCTAAAGG CACGCCATTA ACAAGTGAAT ATTTGAGTGA GTATTATGGC GAGCTCAATG 2280 
CTAAATATTA TGGACCAGAA G TAG T TAG AG ACGAAGAAAT CAGTTACGAA TGGGCGAGAA 
TTCCACATTT TTATTACAAT TACTATGTTT ATCAATATGC AACTGGCTTC TCAGCCGCTT 
CGGCATTGTC TAAACATATT TTAGCTGGAG AAGAGGGAGC TTTAGAGAAT TATCTTAACT 
ACTTGAAAGC AGGAAGTAGT GACTTCCCAA TTGAAGTGAT GAAAAAAGCG GGCGTGGATA 2520 
TGACACAAGC CGCGTATATC GAAGATGCAA TGAAAGTTTT TGAAGAACGT TTAACGGAAT 2580 
TAGAAGCTTT GGTTGAAAAA TTATAGACAA AAAAGTGGGA TATAAGTCCA GATGACTTAT 264 0 

ATCCCACTTT TTAACTCTTA AGATAATTGG TTAACAGCTT TCGTCGCAGC ATCTTTTTTA 2700 
TAATAAATGT AAATGTCCAT TTGGAAAATG ATAATTGGAG CGACGTTTAA ATTCAAAGGT 27 60 

TGCGTCACCT kGGTTCAGCA ACGGTTCGTT ATGTGGATGA AC AC G AC AAA GTTGTTTAAT 2820 
TTCATCCATG TTTTCAAAAT TTTCGACTAA AATGCCGAAA TCGTTCTCGC AAGTATAGTT 



2040 
2100 
2160 



2340 
2400 
2460 



2880 
2883 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
CGTTAACCnA GGGTCTAGCT GTCTGnTAAA GCTAGAACCG AAGGACTTAG AGnCGGATTA 60 

CCCCACAAAG TCTTCCCACA AwTTTkGATA CTTTACgAGT kGAAACgCCT GATACATACc 120 

ATTyCCAAaC ATTGAAGCCc ATGAGGGCTT TTTCGTTTCc GTTGATAACG TTCcAAACAC 180 

TGTGGGTGAA AAATGGCCaT CACGTGTTCT GGGTACTTTT AATTCTAGCG TGCCTACACG 240 

TGTCGTAAAr TGCGCTCATA ATAGCCATTT CGTTGACTTT GTCGGTTTTC TGTTCGTTCA 300 

TATTCTTTTG CTTGAATATA TTCTGTTCGT TGATTTTCCA TTAGTTGATT AAATACCGTT 360 

GTTAAAATAT TTTTAGAAAC GTCATCCTTT ACAGAATATT CAATAATGCT TTGAATCTCT 420 

TCGCTTTTCA GTGTAAAATG TACTTGGGTC ATGTAAAAGT CCTCCTGGGT ATGTTTTtGT 4 80 

CGTTAAAAAC ATtGTACCGT AAAAGGACTG TTATATGGCC TTTTTACTTT TACACAATTA 540 

TACGGACTTT ATCtATTTcA GCCTGTATCA TAGCTAAACA AATCGAGTTG TGTGTCCGTT 600 

TTTGGGCGTT CTGCTAGCTT GTTTAAAGTC TCTTGAATGA ATGTATGTTC TAAGTCAAAA 660 
AACTCTGACA GCGCCTTTAT ATAGCTTTCT TTTTCTTCTT TTTTTACTTT AATGATCGAT 720 
AGCAACAATG ATTTAACACT AGCAAGTTGA ATGCCACCAT TTCTTCCTGG TTTAATCTTA 780 
AAGAAAATTT CCTGATTCGC CTTCAGTACC TTCAGCAATT TATCTAATGT CCGCTCAGGA 840 

ATGCCTAGCG CTTCTCTAAT CTCTTTTTTA GTCGTCACCA AATAAGGCTT GTATACATCA 1 900 
CTTTTTTCGC TAATATAAGC CATTAAATCT TCTTTCCATT CTGACAAATG AACACGTTGA 960 

CGTTCACTTC TTTTTTTCTT GAATTTAAAC CACCCTTGAC GGACAAATAA ATCTTTACTG 1020 

GTTAAATCAC TTGATACCCA AGCTTTGCAA AGAATGGTAA TGTATTCCCT ATTAGCCCCT 1080 

TGATAGTTTT CTGAATAGGC ACTTCTAACA AGTTTAATCA CTTCTTTTTC TTCTAAGGGT 1140 

TGATCTAATC GATTATTAAA CTCAAACATA TTATATTCGC ACGTTTCGAT TGAATAGCCT 1200 

GAACTAAAGT AGGCTAAAGA GAGGGTAAAC ATGACGTTAT TACGCCCTAT TAAACCCTTT 1260 

TCTCCTGAAA ATTTCGTTTC GTGCAATAAG AGATTAAACC AGGGTTCATC TACTTGTTTT 1320 

TTGCCTTCTG TACCGCTTAA AACCGTTAGA CTTGAACGAG TAAAGCCCTT ATTATCTGTT 1380 

TGTTTGAAAG ACCAATCTTG CCATTCTTTG AAAGAATAAC GGTAATTGGG ATCAAAAAAT 14 40 
TCTACATTGT CCGTTCTTGG TATaCGAGCA wTCCCAAAAT GATTgCACGT TAGATCAACT 
GGcAAAGACT TyCCAAAATA TTCTCGGATA TTTTGCgAGA TTATTTgGCT GcTTTGaCAG 
ATTtAAATTC TGATTTTGAA GTCACATAAC TGGCGTTTCT AAAACAAAAT ATGcTTGaTA 1620 
ACCtTTATCA GATTTGATAA TTAACGTAGG CATAAAACCt AAATCAATAG CTGTTGTTAA 1680 
AATAtCGCTT GCTGAAATAG TTTCTTTTGC CGTGTGAATA TCAAAATCAA TAAAGAAGGT 174 0 

ATTGATTTGT CTTAAATTGT TTTCAGAATG TCCTTTCGTG TATGAACGGT TTTCGTCTGC 1800 
ATACGTACCA TAACGATAAA CGTTTGGTGT CCAATGCGTA AATGTATCTT GATTTTCGTG I860 
AATCGCTTCT TCGGAAGTCA GAACAACGCC ACGTCCGCCA ATCATGCTTT TTTTTGAGCG 1920 



1500 
1560 



WO 98/S0555 PCT/US98/08985 

1317 

ATACGCAAAA ATAGCCCCTT TGCTTTTACC TGGCTTGGTA GTGATTGAGC GAATTTTACT 1980 

ATTTTTAAAT TTGTACTTTA ACAAGCCGTC ATGAAGCACA GTTTCTACAA CAAAAGGGAT 2040 

ATTCATTCAG CTGTTCTCCT TTCTtACGAA AATTAATTAG TTAGAAGCTA CGATCAAAGT 2100 

TGAATCACAA CAAAAAAGGC AATCAACTAA GTTTTTCTTA ATTGATTGCC TGGTATCTTC 2160 

TTAAAGACTT GAAATTCOCT CAAAAACCCG ATATAATGGG TTTACAGATA TTTAAGTATC 2220 

TGATTAATAA AGTAATTAAA TACTTTACCA AATTTCGGGT CTCGACTTCT TTAATTGATT 2280 

GGTGGTAATC AATTAAGGCT CGCAACTTAT TTTCTtGGCG GAAAATAAGT TGGTCGTGGC 2340 

TCTTTTTTTG TATTCTTTAT TCAGTTCGTT GTTTCGTTAT ATCTAGTATA CCGCTTTTAA 24 00 

AAAAAATAAG CAACAATTTC GTTAAATATA TTAACGACAA TCATTGCTTC TCTTCTTCTA 24 60 

TTTGCAA'CTC TACTTGTTCT AATTCATTAG GAATTAAGTC AATAACTAAG GTTGAGTTTG 2520 

TCAATTCTGC AATTCGTTCC AATGTTTTTA ATGTCGGATC TGATTTTCCT GATTCAATCA 2580 

ATGAATAATT TGGTTCTGTT GCAAAGTTTT AAATCTACTA TCAAATAAGG TAGAATAATA 2640 

GAAAAAGATA GCAGGAGGAA TGACGATGAA TCATTTTAAA GGAAAGCAAT TTCAGCAGGA 2700 

TGTGATTATT GTAGCCGTGG GCTACTATCT TCGTTATAAC CTTAGCTATC GTGAAGTTCA 27 60 

AGAAATCTTA TATGATCGTG GCATTAACGT TTCTCATACG ACGATTTATC GTTGGGTGCA 2820 

AGAATATGGC AAACTACTCT ATCAAATTTG GAAAAAGAAA AATAAAAAAT CCTTTTATTC 2880 

ATGGAAAATG GATGAAACGT ACATCAAAAT TAAAGGAAAA TGGCATTATT TGTATCGAGC 2940 

CATCGATGCA GATGGTTTAA CCTTGGATAT TTGGTTACGT AAAAAACGGG ACACACAAGC 3000 

AGCCTATGCT TTTCTTAAGC GGTTAGTGAA GCAGTTTGAT GAACCGAAGT TGTtAGTCAC 3060 

AGATAAAGCC CCCTCTATTA CAAGTGCCTT TAAGAAACTA AAAGAATACG GCTTTTATCA 3120 

AGGGACAGAA CATCGTACCA TTAAATACCT GAATAATTTG ATTGAACAAG ACCATCGTCC 3180 

AGTAAAGAGA CGCAATAAAT TCTATCGAAG TTTACGCACT GCCTCTACCA CGATTAAAGG 3240 

CATGGAAGCC ATTCGAGGAT TATATAAGAr AACCCGAAAA GAAGGCACTC TCTTCGGGTT 3300 

TTCGGTCTGT ACTGAAATCA AGGTATTATT GGGAATCCCA GCTTAAATCA TAGATACCGT 3360 

AAGGGATTTT ATTCTTTATT TAAAACTTTG CAACAGAACC ATTGTGTAAA AGTAAAAAGG 3420 

CCATATAACA GTCCTTTtAC GGTACAATGT TTTTAACGAC AAAAACATAC CCAGGAGGAC 34 80 

TTTTACATGA CCCAAGTACA TTTTACACTG AAAAGCGAAG AGATTCAAAG CATTATTGAA 3540 

TATTCTGTAA AGGATGACGT TTCTAAAAAT ATTTTAACAA CGGTATTTAA TCAACTAATG 3600 

GAAAATCAAC GAACAGAATA TATTCAAGCA AAAGAATATG AACGAACAGA AAACCGACAA 3660 

AGTCAACGAA ATGGCTATTA TGAGCGCAGC TTTACGACAC GTGTAGGCAC GCTAGAATTA 3720 

AAAGTACCCA GAACACGTGA TGGCCATTTT TCACCCACAG TGTTT 3765 
(2) INFORMATION FOR SEQ ID NO: 288: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 704 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

nnATAAATGT TGAATAAAGT TTGATAnTGT TTTATTATGT TTTCGAGAAG GGGATTTGAG 60 

TTTAAAGAAA GGGACAGAAT TTTAATAGTA GATACACGAG GTAAGGAGGA AAACAAAATG 120 

CGTGTCATAC TTGGTTCAGA TTTAGATGGA ATAAAATTAA AAGCAGAAAT GAAACAATAC 180 

TTATTGCAAG AAAAAGTTGA GGTAATCGAT AAAAGTGAAA GTGCTTCGGA AGACTTTATA 24 0 

GAAGCAACGC TTGCTGTTGC TCATGAAGTT TTAAAGGATA CTGAAAGTTT AGGAATCGTT 300 

TTTGATGGTT ATGGGGCAGG TAGTTTTATG ACTGCTGCTA AAATTAAAGG AATGATTGTC 360 

GCTGAGTTGT CAGATGAGCG TTCTGCATAT ATGGCGCGAG AACACAACAA TGCTCGGATG 420 

ATCACGGTGG GCGCAAAAAT TGTTGGAACG GAACTAGCCA AAAACATAAT AAAAGAATTC 4 80 

TTAACAGGTC ATTATGCAGG CGGGCGTCAT CAAATACGTG TAGATATGTT AAATAAAaTG 54 0 

GCTTAaTAAA TAATAGAAAG GAAGCGTTTT TATGAAAATT GCCATTGGTT GTGATCATAT 600 

TGTGACGGAT ACCAAAATAG CAGTTTCTGa TTTTTTGAAA GCAAAGGGAT ATGAAATTTT 660 

AGATGTAGGC AC G TAT GAT T TTACTCGAAC TCACTATCCT ATTTTCGGAA AAAAAGTGGG , 720 

AGAAGCCGTA ATGAGCAATC AGGCTGATTT GGGTATATGT ATTTGCGGAA CAGGCGTTGG 780 

aATTACCAAT GCGGtTAACA AAGTGCCGGG TATCCGTtCT GCTTTAGTTC GCGATATGAC 8 40 

TACAGCGTTG TATGCGAAAG AGGAATTGrA CGCAAATGTT ATTGGTTTCG GTGGGAAAAT 900 

AACAGGAGAA TTCTTGATCT GTGATATCGC TGCTGCCTTT ATTGAAGCAC AC TAT CAT CC 960 

AACAGAAGAC AACGAACAAC TAATTGAAAA AATTAATCAA GTTGAAAAAC AACATCCAGA 1020 

ACAACAAGAT GCGCACTTTT TCGATGAATT TTTAGATAAA TGGCAACGAG GAGTCTATCA 108 

CGATTAGTAT TTAATCAATG AAGGTTACTT AAGAAAGTGG TCTTAAATAA TAATCATTGG 114 

TATTTGAGTT AAATTCCCCT TCATTAATTT TTATTGCTGT TTTAATGTGG CACTGTAAAG 1200 

AGCATTATTA AGAAAAAGAA GTAACAAAGG AACAAATTGA AGGCTATACA GGTAAsCATA 1260 

AAAAGTACAC GAGTTCTTTT CTCGTGTACT TTTTATGGTT TCGCTGCTTG TTTTATTTTA 132 

GTAATTATTT TTCTATAAAA ATAGTCCCAG AATTATTAAA ATAATCCCGA TAACTAAACT 138 

GCCGATGCCA GAAGCCGTAC CATATTTTTT ACCTAAATTT ATATCAGTAG TATTTTTAGG 14 4 

AATCAAGTTT GAGAAAAAAT AAAATTTTTT TACCAAGATT AGATATGCAA CAAGTAAAAG 150 

TAATAATCCT AAGAGAaGAT AACTAAGaTT TTCAGCTAAT AAATCtrGAT GAGCTAATAA 156 

CaATATCCCC AAAAAGAGGC CTACCAATCC ATATTTCATC GTTTTTTCTT TTGCTGACAT 162 

ATAAACATCC CTACTTTACT TTTTAATTTA TTAAGAGCAT AAAACCATAG ATTCGATATA 168 

AGTTCATTTT GTTGTATCCA AAAGAGCATG ACCTGGAATG ACCTATATAA TTAGAAAATT 174 
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GGAGTGATAT 


ATAGTTATTT 


TACCATAGCT 


TAAATGAGTT < 


GATTAACTCC ' 


GAATAATGTA 


1800 


CAACTATGTA 


CACCTATTGT 


GGGTGGAATG 


TGTGTTGGGA 


CTTTGCTAAC 


CTAATAGTTG 


1860 


AAAAAATGAA 


TCAATAAATT 


TAGAAAAGGG 


ACTATTCAAA 


AGAAACTTTT 


TTGGGAATTG 


1920 


CTATATTATT 


TATTTTTTTA 


AGCGTAATCA 


ACTAAATGTT 


TTTATGGATA 


AAGAAAGTCA 


1980 


TTGTTCCTTG 


TCATCAAGCT 


ACAACAGCCA 


AATTCATTAG 


AATTTTTTTA 


GTGAATGACT 


2040 


TTAATTTTAG 


AAAGGTACTG 


TTGTTAAACA 


TTCATGCTAT 


CAAAGGAGTG 


TGAGGCGAAC 


2100 


GAAGCAAGTA 


TTTTCaTTTT 


ATAAAAGTTG 


TATTATGGAC 


TCTTAAAAAG 


AGGTGATTTA 


2160 


AAAAATTTTG 


AAGTGGAAAT 


GTTAAGAAAG 


GAGAGAGGAA 


ATGACTTATT 


TAGTTCTGGT 


2220 


CAGTCATGGC 


GATTTTGCTA 


GAGGGTTAAA 


AAATTCATTA 


GGGATGTTTG 


CTGGCGATGA 


2280 


GATAGAGAAA 


GTCAGAGTGT 


TTGGCTTACA 


ACCTGGTGAA 


TCTACCGATA 


ATTTTGGTAA 


2340 


GCGGTTTCTT 


TGTGAAGTTG 


AAAAATTACC 


GCAgGAAACG AGGCTAATCG 


TTTTGGCTGA 


2400 


TATTGTAGGT 


GGGAGTCCAC 


TTGCTACTAT 


TTGTAAAGTA 


TTAGATAAGC 


AAAAAATGCT 


2460 


AGAAAATTCT 


CTGATTGTTG 


GAGGAGTGAA 


TTTTCCAATG 


GCTTTAAATG 


CGCTTCTATT 


2520 


GAAAGAAACT 


AGTGAATTAT 


CTGATTTGAA 


AAAAATGATA 


GTAGCAGATG 


CTTGTCATGG 


2580 


CAT TAG AG AG 


TTAACTATAA 


CTGAAGGCGA 


TGTAACTGAA 


GAGGATATCT 


AAAGGAGAAG 


2640 


ATGAACATGT 


CAATTTCTTT 


TGTTCGAATC 


GATGATCGAA 


TGATTCATGG 


CTTAATTACT 


2700 


TTACGTTGGA 


CTAAAGAAAT 


GCCATGTGAT 


GGCATTATAG 


CTGTGAATGA 


CAAAGCAGCA 


2760 


AGTAACCCCA 


TTTTAAAGGA 


AGCATATAAA 


GCAGCTGCAC 


AAGATAAAAA 


AACCTTTATT 


2820 


TGGACAATGG 


CGCATTTTTT 


TGATGTAAAG 


GATAAAGTAT 


TAGCGTCTAA 


GTCAAAATAC 


2880 


TTTTTGATTA 


CAAAGAGCCC 


TTTGGATATG 


AAAAAAATTT 


TGGTAGATTG 


GCATTTTGTT 


2940 


CCAAGTGAAG 


TAAAAATGAT 


TAATGTTGGA 


CCGGGGAATG 


ATCGCGAAGG 


GACAATTAAA 


3000 


TTAGGGGATA 


ACCAATCCTT 


TACAGCAGAA 


GAAGCAGCGG 


CTTTTGAAGA 


AATTGAAAAG 


3060 


GCTGGTTACA 


AAGTGGATTT 


TGCCTTGTTA 


CCTGATCAAC 


GAATTGGTTC 


TTGGGCGCAT 


3120 


TTTAAAGCTA 


AATTTGGTTA 


CTGATTCTAG 


AAGTTTTGAA 


AAGAGGAGAG 


TAGTATTATG 


3180 


ACAATCAACT 


GGATACAAGC 


GGCTATTTTA 


GGCCTATTTG 


CTTGTCTGTG 


TTCAAATTCT 


3240 


TGTATGGCTG 


GACAAGCGGT 


GGGAAATTAT 


ACGATTGGGC 


GTCCTTTAGT 


CGGCGGATTA 


3300 


GTTTGTGGCA 


• ttATTTTAGG 


AGATTTAAAA 


TTAGGAATCG 


CTTGTGGCGT 


ACAATGCAAT • 


3360 


TAGTTTACAT 


TGCTTTGGTC 


ACACCTGGAG 


GCACAGTGTC 


TGCTGATGTG 


CGGGCAATTT 


3420 


CATATATCGG 


GATTCCTTTG 


GCGATGGTGG 


CGATTTCGAC 


CAAAGGGTTA 


GATCCACTTG 


3480 


GCGGCTCTGC 


AGCCGACCTG 


GCTAAATCGG 


TCGGCACATT 


AGTTGGAACG 


ATCGGAACGG 


3540 


TATTATTTTA 


TTCTGTTGCC 


GCGTTAAATT 


TAGTTTGGCA 


, ATCGTTTGGT 


TGGAAAGATA 


3600 


TAAAAAAAGG 


CAAGCTTGAT 


CATTTATATG 


CCATTAATTT 


TGGTTGGCCG 


; TGGATTTCTC 


3660 


ATCTACTTTT 


TTCCTTTTTG 


CCAACCGTGA 


. TTTTAACCTA 


i TTTTGGGGCT 


' ACGGCTGTTA 


3720 
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CAGCAATGAG AGATGCATTA CCTATGGATG GTCTTGCAAT GAAAACGCTT TTTACTGTAG 37 80 

GTGGCATGCT TCCGTGTGTG GGAATTGCAA TTTTGTTACG TCAAATTGTT AATAAGAACA 384 0 

TTGATTTTGT ACCTTTTTTT GTAGGGTTCA CATTAGCTGC TTCATTAAAA CTCAATTTAG 3900 

TCTCAATAAc GATTATTGCg CTTCTTTTTG CTGTAATTTT TTAwAAAATT GCTTTGATTA 3960 

AAAACAATAC TAAACTGGTT GATTCAACAG GAAgTCTAAC GAATGATGAC GAAGAGGAGG 4 020 

AC AT AT GAGA TGGAAGAGTT GAAAGCAAAT AAATTAGACC AAAAAACGTT AAAGCATTCT 4 080 

TTTCATTTAT GGTTTTGGGG AGCATTAACT TGTTTTTCAC AACAGCATAT GCAAACCTTC 414 0 

GGCTATCTTT CTTCAATGCT TCCTATtATC AAAAAACTTT ATCCAAAACA TGAAGACCAA 4 200 

GTGAAAGCCA TTCAAGCATA TACAGCTTTT TTTAACACAA ATCCAATGCT TGGAACGGTC 4 260 

ATTGTTGGTG TGACTGCTAG TATGGAAGAG GCTAGAGCAA GTAAGAATGA AATTGATGGA 4 320 

GAAACCATTA AT GAT AT GAG GGCAGGATTA ATGGGACCGA TTGCAGGGAT TGGCGATTCA 4 380 

TTAGTTGATG GTACCTTGAT TCCTATCTTG TTAGGAATTT CCTTAGGAAT GTCCACGGGC 4 4 40 

GGTTCGCCAA TTGGTGCCAT ATTTTATATT ATTGTTTGGA CGTTAATTTC CTATTTTGGC 4 500 

CAACGCTTTT TGTATTTTAG AGGCTATCAT TTTGGTGATA AAGCAGTAAG TTTCTTAGTT 4 560 

GGGAAAGAAG GAGCCGCTGT TAGAGTTGCT ATCGGAGTTA TTGGTAGTAT GGTTGTCGGT 4 620 

GGGGTCCTTG CATCTTGGGT AAATGTAACT ACTTCTCTTA AATTAATAGG TGCTGACGGA 4 680 

AAAGTTTTCC TTAAACTGCA AGATAAAATT GACAGTATTT ATCCAGGATT ATTAACGATA 47 4 0 

CTTGTTACCT TGTTTTGTTG GTGGTTAATG TCCAAAAAAC ATGTTTCCGC TATTTGGACG 4 800 

ATGCTTATTT TAGTTGTTAT TTCTCTCGTT GGCGTTTTAT TAGGTGTTTT CAATCCAGGA 48 60 

TTAACTTACT GAACAATCAC CAAAAAGGAC TCTAGCTGTT AGAGCAAACT GACTTTTGCA 4 920 

TAAAACCGAT AACGAAAAAG CTAATAAATT AAATTCTTAC TCTCTAAGAC ACAGAAAAAA 4 980 

TCTTGTGATT AAAATAAAAA TATATAAAAT AGAAGAAATT TAGAGACTAG CTTCTATTTT 5040 

ATATATTTTT TTGTTTTAGA AATTTAGTTG GTTTATTAGA ATTTTAGGAA AAATTTTACG 5100 

AAAAGTGTCG CAAAAATGGC TCCTAAAAAC GGTGCAGTTC CGGGAACAAT CAACCCATAT 5160 

TGCCAATCAT TATTGGTTTT ATTTTTGATT GGTAAAAGTT GATAGGCGAT ACGAGGACCT 5220 

AAATCACGTG CTTGATTCAT TGCGAATCCT GTAGTACCAC CTAGCCCCAT GCCGATTGCC 5280 
CAAACAAGCA AGCCAACAGC AATGGGTAGT TGTGTTTCGT AGGAATGTGC AATCGCTAAA ' 534 0 

ATAGATGTTA AAAAAATGAA TGTTGCAAAA GTTTCAACAA AATAGTTTCT TGGTAAATTT 5400' 

CGTTGGTTCG GATTTGTAGA AAAAATATTA CGAATGGCGA TCGGATCGAC ACTATCGGCG 54 60 

GATAGCTTGA AATGATCAGC ATACATAATA TAAACAATGA CGGCGCCACA AATGCCGCCT 5520 

AACATTTCAG* CGATGACATA GGGAATGAAG TAAGACCATG GAATCATTCC TAAAATAGCT 5580 

TGAGCGAGGG CCATTGCTGG ATTAATACAG ACTCCGCCAA AGACGAACAG AACAACAGAG 564 0 

ATTCCAAAAG CCCAAGTAGT GATTGCAAAC ATGTGACCAG AACCAGCGTA TTTTGTTCGT 5700 
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TTTAGGACAT 


CGTCGCAGTG 


AACACCTACC 


CCAAATACGA 


TCATTAATGC 


AGTTCCCATA 


D f ou 


AATTCTGATA AGATATGATG- AAACATAAGA - 


TAAAGCTCCT 


TTAATCAATA 


GTTTTGTTAT 


toon 


TTAATGTTTG 


GACGAGGCGA 


TAGACTTCCT 


CGATTGTAAA 


ATTATATTGT 


TTAGCAATTC 


CQQfl 

SooU 


GTTCACAATC 


TTGGTATTCA 


ATAGACTCTT 


TCAATAGGCC 


ATGATAATGA 


TTTTTTTTAA 


c n A A 

5940 


TATGGACCTT 


ACCAAAAGGT 


GTTTCCAAGA 


TTTTGAAAGA 


ACGGTCCATC 


ACACTACGTT 


6000 


CCATTGTTTG 


AAAACGCATA 


CCAATTGTAG 


AAGTATGTTT 


AAAGAGTAGT 


TCTGTAAAAT 


6060 


ATTCTTTTTC 


CTCTGTTGTT 


GTTAAAACCG 


TTAATAGAAT 


GGCACTTCGG 


TTTTTTTTCA 


6120 


TATGAATGGG 


CGTGAAAAAA 


ACATCTAGAG 


CACCATGCTC 


TAATAATAGA 


TCCATTACGT 


6180 


AACCTAACTG 


TTCaGGTGTT 


TGaTTGTCTA 


tATTTGTTTC 


aATTTTTAAg 


ACTTGGTCaT 


6240 


GGTGTTCTGT 


CCTCTTTTTT 


TTAcTATGTG 


TTTCAGTAAA 


GATTGATCCT 


CTGAGGGCAT 


6300 


TGAACTTGCC 


TGtTTCCCGT 


TTGCCaAATC 


CGTAACCTAT 


TTTTTCAATT 


AATCGTTGTT 


6360 


CTGGTGCGAC 


AAATAGAGGA 


CTAAGTTCTT 


TAAAAATAGC 


TAAGCCAGTC 


GGTGTGACAA 


6420 


GTTCAGTTTT 


TATCTCAAAG 


TCTTGTTGAA 


TGATTAAATT 


CGTTTCTTTA 


CGTAATTCCA 


6480 


TCACAGCGGG 


AACTGGTATT 


GGCATGACTC 


CATGGGCAAT 


TGTGATAGTG 


CCACTTCCAT 


6540 


CTGTAATTGG 


CGTTGAATAG 


ACTTGTTCAA 


TCTCTAACTG 


TTCCCATAGG 


ATAAAAAAGC 


6600 


CAACGATGTC 


AACAATTGAA 


TCAATTGCTC 


CAACTTCATG 


AAAATGAATA 


GTGTCAATTG 


6660 


TTTGTTGATG 


AACAGCAGCT 


TCAGCTTTTG 


CAATGTCATA 


GAAAACATTT 


TTACTATGTT 


6720 


TTTTTACAAA 


ATTGGACAAG 


TCGCTTTTTT 


CAATTAAGGA 


ACAAACAAAA 


CGATCATTCA 


C "I O A 

678U 


CGAAAAGAAA 


AATAATCTGA 


GTTTTGCACC 


AAACACAAGG 


TTTGATCTGC 


CTATTTTACT 


CO A A 

DO H U 


GAAAGATTCG 


TTTAAACCTG 


GAGACTATAC 


CTATACCATT 


AAGTTGAAAA 


ACCATGAGGG 


6900 


GAGCTGGACG 


TTCTCAAAAG 


ATTTCACGAT 


AAACAAAAAG 


CAAGCTGATC 


AGTATAATAA 


6960 


GCGTTCTGTT 


GATCATAAAG 


TCAAAAACCT 


TCACTGGCTT AAGTACCTCA TgGnCTTTTT 


7020 


AATTCTAGTT 


ATTTTAGGTA 


CCG 








7043 


(2) INFORMATION FOR SEQ ID NO: 2 


89: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

TAATTATTAT CTCATTTGTA GAT T AG T AAA AGGATACCGA AAnACATTTA GnAAAGCAAC 60 

CACTTATTTT CAGAAAGTTA GGAAATCATG AAAATATTTG kTCyTCaTTT tCCtTAAAAT 120 

TTTGtTAAAC TGtAAAGaTA TGAGTCsTAA CGAAgGAGTT AAATAGTATG ACAGAATCCA 180 

AAACTGCCAT TATTTTTGAT ATGGACGGTG TCCTAGTAGA TAGCGAAgCT TATTATTATG 24 0 

AGCGACGTAA AGCATTTTTa GCGGAATTTG ATTTGACCAT TGAGGGCCTA ACCTTGCCGG 300 
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AATTAGTCGG TGCGGATATG CGTTCATTAT GGCAAAAGAT TGAACAAGTC AACAAAAAAG 360 

AGCTAGATAT TGCTTTTTTA AATGAACAAT ACATAGCCTA TAAAAAAGCG CATCCGATAG 4 20 

ATTATTTAGC TGTGCTTGAT GAAAATGCGA AACGTGTTTT ACAATTTTTA AAAAGACAAG 480 

GCTATAAAAT CGGATTAGCT TCTTCTTCCA CAAAAGACGC AATTGAAGAG GTTTTGACAG 54 0 

TGGGGCAATT AAGTAGTTAT TTTGATGCGG TCGTGAGTGG GGAAGATTTC GAAGAAAGTA 600 

AACCGGCGCC G GAT ATT TAT CTGCACACGT TACAAGAATT AGCTGTTGCG CCACAGGAAT 660 

GTATAGCCAT TGAAGATTCA GAAAAAGGCA TTGCCTCTGC CAAAGAAGCT GGGcTAGAAG 720 

TTtGGGcTAT GCGCGATGAA CACTTTGGGa TGGATCAAAG TCAAGCGGAT GCCTTCTTAA 780 

cACmACTAAg TGATATkTGT AAAAAAaTTA GTGAAAA 817 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

ATGAAGAAAT ATATCTTCAC AGAAAGAAAC GGTATCTACA TCATTGACTT ACAAAAAACA 60 

GTGAAATTAG TAGATGCAGC TTACGATTAC ATGAAAAACG TTGCAGAAGA AGGCGGCGTT 120 

GCTTTATTCG TAGGAACTAA AAAACAAGCA CAAGAAGCAA TCAAAGATGA AGCAATCCGC 180 

GCTGGTCAAT ACTATGTAAA CCATCGTTGG TTAGGTGGTA CGTTAACTAA CTGGGATACA 24 0 

ATCCAAAAAC GTATTGCTCG CTTGAAAAAA ATCAATGCAA TGGAAGAAGA CGGCACTTTC 300 

GAAGTTTTAC CTAAGAAAGA AGTTGCTGGC TTAAACAAAG AACGTGAACG TTTAGAAAAA 360 

TTCTTAGGTG GTATCGCTGA TATGCCAAGA ATTCCAGATG TAATGTACAT CGTTGACCCG 4 20 

CGTAAAGAAC GCATTGCTGT TCAAGAAGCA CACAAATTAA ACATTCCTAT CGTTGCAATG 4 80 

GTTGATACAA ACTGTGATCC TGATGAGATC GATGTTGTAA TCCCTTCAAA TGATGATGCA 54 0 

ATCCGCGCTG TGAAATTAAT CACTGCTAAA AtGGCTGACG CTTTCaTCGA AGGTAACCAA 600 

GGGGAAGACC AAGCAACTGA AGAGTTATTC GTTGAAGAAA CACCAGAAGC AACTTCAATC 660 

GAAGAAATCG TTGATGTTGT TGAAGGAAAC AACGAATCAG CTGAATAATA ATGATTACGG 7 20 

AGCTGTCTCA AAGGCTAGGC GACAAAAAGC CTGATTCTCC TTTGAGATAG CTTTTTTAAA 780 

AAGAATTGAT AATATCCGAG GAGGACACAA ATAATGGCAG ACGTTACAGC TAAAATGGTA 84 0 

AAAGAATTAC GCGATATGAC TGGCGTTGGA ATGATGGACG CTAAAAAAGC ATTAGTAAAA 900 

GTAGAAGGCG- ACATGGAAAA AGCCGTAGAC TTTTTACGTG AAAATGGTAT GGCTAAAGCA 960 

GCGAAGAAAA ATGACCGCAT TGCGGCTGAA GGTTTAGCAA ACGTAGCAAC TGTGGGTAAC 1020 

GTTGCAGCAA TCGTTGAAGT AAACTCAGAA ACTGACTTCG TTTCTAAAAA CGAAATGTTC 1080 
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CAAGATTTAG 
GAAGCAATGG 
ACTGTTATCG 
GCTGCATTTG 
GGCACAACTG 
CGCTATGTCA 
ACTGAACAAG 
CGTCTACAAA 
GATATGACTG 
TTTGAAGTTG 
CAAATGAAAA 
AACAGACAGC 
CTTTCAGCCA 
GTGGATCGCA 
TGAATGATTT 
TCGTATTAAA 
CAGTCATTAA 
CCATCGTGGT 
AACGTGCACA 
AAGATACATT 
AAATTGCAGA 
TTATTTTTGC 
GAGCTGCAGA 
ATTCTGCGGA 
ATGTTATTTC 
ACGATATTCC 
TTGGTGAAAA 
CAACAGCCAA 
TTCGTGCTGG 
CACCAACACC 
TTACACCGTT" 
TCGGCATTAG 
AAGAACGTCG 



TTAAAGACAT 
CTATTAAAAC 
GTGAAAAAAT 
GCGCTTACTT 
ACGAAGAAGT 
ACGAATCTCA 
CATTAAACGA 
AATTCAAAGC 
TCGAAAAATT 
GCGAAGGTAT 
ACTAAGAGCA 
GATCCCCAAA 
TTTGGGAACG 
CTGCTTGGTC 
TTAGAGAATC 
GTTAAGTGGA 
AGAAATCGTC 
TGGCGGCGGA 
AGCTGATTAT 
AGAAAATCTT 
ACCATACATT 
CGGAGGAACA 
AGTAGATGCG 
TCCACGTGTG 
TAAAGGGCTG 
TTTAGTTGTC 
TATCGGAACA 
AGAAAAAATG 
TCGTGCGAAC 
AGTTAACCAA 
TGACAAAAAT 
CCCAACCAAT 
GAAAGAATTA 



TGCGACTAAA 
TGAAAAAGGC 
CAGCTTCCGT 
ACACATGGGT 
AGCGAAAGAT 
AATTCCTCAA 
AGGTAAACCA 
AGAAATCGCT 
TGTTGCTTCT 
CGAAAAACGT 
TCAAACGCTC 
GGAAGCTGCA 
CATAACTTTG 
TAAAAAATCA 
TTGGAGGGAA 
GAAGCGCTAg 
CAAGAAATTA 
AACATTTGGC 
ATGGGCATGT 
GGTGTGCCTA 
CGTCGTCGTG 
GGAAATCCTT 
GACGTCATCT 
GATGAAACAG 
CAAGTAATGG 
TTTAACCTAA 
ACTGTAAGGG 
ACAAAAGCGG 
GCAAGTCTAT 
TTAGCAAGTA 
TCTATCGCTG 
GATGGAAATG 
GCCAAAGACG 



1323 
GTTGCTGAAA 
ACAATTGAAT 
CGTTTTGAAG 
GGCCGTATTG 
GTTGCTATGC 
GAAGAATTAG 
GCAAATATCG 
TTGGTTGACC 
AAAGGCGGCG 
GAAGATAACT 
GGTTGAAAAG 
TAAATTGATT 
ATATGCGCTC 
TTTTGGTGAA 
TTATGATGGT 
CCGGAGAAGA 
AAGAGGTCCA 
GTGGACAAAT 
TAGCTACTGT 
CACGTGTTCA 
CAGAACGTCA 
ACTTTTCGAC 
TAATGGCGAA 
CGACAAAATT 
ATTCAACAGC 
ATGAAGCAGG 
GGAAATAAAG 
AAGAAAGCTT 
TAGACCGTAT 
TCAAf ATCCC 
ACATTGAAAA 
TCATCCGTTT 
TGAAAAAAGA 



ACAAACCAGC 
CTGATTTAAT 
TTGTCGAAAA 
CTGTATTAAC 
ACATCGCGGC 
AACACGAAAA 
TTGAAAAAAT 
AACCATTTGT 
AAGTGAAATC 
TTGCGGATGA 
CCGTGAATCA 
ACTGCTTGAT 
CCTTTTTTTA 
AAGTCATCTA 
TAAACCTAAG 
CGGTTTTGGC 
TGAATTAGGT 
TGGTGCTCAA 
GATGAATGCC 
AACATCAATT 
TTTAGAAAAA 
AGATACAACG 
GAACAATGTT 
TGAAGAACTA 
AAGCTCATTA 
AAATATCCGT 
AATGTCTAAA 
ACGCCGTGAA 
TCAAGTAGAA 
AGAAGCACGT 
AGCTATTCAA 
AGTGATTCCA 
AGCGGAGAAC 



TACGATGGAA 
CGAAGCAACA 
AGCTGACAAT 
AGTTATCGAT 
AATCAACCCT 
AGCTGTGTTA 
GGTTGTTGGT 
TAAAGATCCA 
ATTCGTACGT 
AGTAATGAGC 
TAAGTATCTA 
TCAAGGCAGT 
GGCAAAACCA 
AATGAGGAAT 
TATCAACGTG 
ATTAAACCAC 
ATCGAAATGG 
ATGGGCATGG 
TTAGCATTGC 
GAAATGCGTC 
GGCCGCGTTG 
GCTGCGTTAC 
GATGGTGTTT 
ACACATTTAG 
AGTATGGATA 
CGTGCCATCC 
GAAGTTTTAG 
TTAGGTCAAA 
TATTATGGTG 
GTATTAATGA 
ATGAGTGATA 
CAATTAACGG 
TCAAAAGTTG 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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CTGTCCGTAA CGTTCGTCGT GATGCAATGG ATGAGTTGAA AAAAGCTCAA AAAAATGGCG 3120 

ACATCACTGA AGATGAATTA CGTTCATTTG AAAAAGATGT TCAAAAATTA ACAGACGATA 3180 

GCATCAAAAA TATCGATGCG ATTACTGCTG AAAAAGAACA AGAACTTTTA GAAGTATAAA 324 0 

ACCAATCACT AATAAAAATT ATTGAATCCA GAGAAGCAAA TGCTCTGGAT TTTTTTGTAT 3300 

AGATGAGAAA GGCGTATCTC AACGCTCTCT TTGCAAAAGT CATTCTAAAG TAATTATTTA 3360 

AGAAAACGCT TTCTTTAATA AATTTTTTTA TTCTACCAGA ATTTTTTTCT TTGTGTAGGG 3420 

AGGATCACTT AAAAATATAG AGAAACTTTG AAATATCAGC CTTTTAAAAT TTTTCTTCAC 3480 

AAATTGAATC CTAAAATTTG ATGATAAGAG TTGTTATTTT TAGCAATTGT AATTTGCGAT 354 0 

TCATTCTCAA TTAGGCTAAA ATGAGTGAGA ACGGAAAAAC GATTAAAACA AGTTGGAGGG 3600 

AATTGTACAT GTCTGTGTTA GAAATCAAAA ATTTACACGT ATCAATTGAA GATAAAAAGA 3660 

TTCTAAAAGG AGTCGATTTA ACAATTAATA CTGGAGAAAT CCATGCAATT ATGGGACCTA 3720 

ATGGAACTGG TAAATCTACT TTATCAGCAG CAATTATGGG AAATCCTAAC TACGAAGTCA 37 80 

CAGAAGGAGA AATTCTATTT GACGGCGTCA ACGTTTTAGA TTTAGAGGTA GACGAGCGTG 384 0 

CTCGCTTAGG TTTATTCTTA GCAATGCAAT ACCCAAGTGA AATTCCAGGA ATTACCAATG 3900 

CTGAATTCAT GCGTGCAGCA ATCAATGCAA AACGTGACGA AGACAACAAA ATGTCAGTTA 3960 

TGCAGTTCTT AAAAAAATTA GATAGCAAAA TGGAATTATT AAATATGCCT GAAGAAATGG 4020 

CTGAGCGTTA CTTAAACGAA GGTTTTTCTG GCGGCGAGAA AAAACGAAAT GAAATTTTAC 4080 

AATTATTGAT GTTAGAGCCA ACGTTTGCCA TTTTAGATGA AATTGATTCA GGCTTAGATA 4140 

TCGATGCGTT AAAAGTAGTT TCTAAAGGGG TTAATGAAAT GCGCGGCGAG AACTTTGGTG 4 200 

CGTTAATTAT TACCCATTAT CAACGCTTAT TGAACTATAT CACGCCAGAT GTTGTTCATA 4260 

TCATGATGGA AGGCCGCGTT GTGAAAACCG GTGGGGCTGA TTTGGCAAAA CGTTTAGAAA 4 320 

CAGAAGGCTA TGCTGGTATC AGCCAAGAAT TAGGTATTGA TTACAAAGAA GAAGCGTAAG 4 380 

GAGGACAAGA TAAATGAAAA AAGTAAATCT AATGGATTAT CTTGACGAAg TCACCGcTTT 4 4 40 

TTCACTAGAA CATGAAGAAC CAGCTTGGAT GACTGAATTG CGAACAACGG CTTTAAGAAA 4 500 

CGCTGACGAA AGTGAGTTAC CACATATTGA TCGTGTGAAA TTTCATCGTT GGCCTTTATT 4 560 

AAATGTTCAC ATGGAAAGTT ATGTGCCTTC TGAAGGAAAT GTCGCAAGTT TTGATCAAAT 4 620 

GAAAGACAAT CCTTTAATTG TTCAACAAGG ATCTTTTCAT GCTTTTGAGC AATTGCCAGC 4 680 

CTCATTAGCG GAACAAGGGG TAATTTTTAC AGACATTTTT ACTGCTTTAC AAGAGCACCC 47 40 

AGAATTAGTT AAAGAATACT ATATGACTAA AGCTGTATTA CCAGAAGAAG ATAAGCTGAC 4800 

AGCAGCTCAT GCAGCTTTCA TGAATAGTGG CGTTTTCTTA TACGTTCCTA AAAACGTAGT .4 860 

GATTGAAGAG - CCAATTGAAT CTTTGTTTAT TCAGGATTCA GATAGCAACC AACCTTTCTT 4920 

TAAACATGTT TTAATCGTTG CAGATAATCA TAGTGAATTC AG TT ATT TAG AACGTTTTCA 4 980 

ATCAACTGGT CATCACGCAG AGAAAGCTTC TGGAAATATC GTTGTGGAAG TGATTGCGAA 5040 
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AGATGGCGCA AAAGTTAAAT ACTCAGCCGT GGATCAATTA GGGCAAACTG TAACGACTTA 5100 

TATGAA.TCGT CGTGGCTATA TTATGCGGGA TGCCTCTGTT GATTGGGCTC TAGGCGTCAT 5160 

GAATGATGGC GATGTTGTGG CTGATTTTGA TTCTGATTTA GTTGGCGAAG GCGCTCATTC 5220 

AGAAGTTAAA GTTGTTGCAA TTAGTGCAGG TAAACAAACA CAAGGCATCG ACACACGAGT 5280 

GACAAACAAA GCGCCACATT CAATTGGtCA TATTTTACAA CATGGAGTTA TTCGTGAACG 534 0 

TGGTACTTTA ACGTTTAACG GAATTGGCCA CATTTTAAAA GGAGCAAAAG GAGCCGACGC 5400 

ACAGCAAGAA AGTCGTGTTT TAATGCTGTC TGATAAAGCT CGTGGTGATG CCAATCCAAT 54 60 

TCTTTTAATT GATGAAAATG AAGTGACTGC CGGACACGCA GCCAGCGTTG GTCGAGTAGA 5520 

TCCAGAAGAA ATGTATTATT TAATGAGCCG CGGCTTGCGT AAAGAAGAAG CGGAACGTTT 5580 

GGTGATTCGT GGCTTCCTAG GTTCTGTCAT TACAGCAATT CCAGTTAAAG AAGTACAAAA 5640 

TGAATTTGTA GAAGTAATTG AAGGGAAGTT AAACGCATGA TGGATGCAGC AACCATTCGT 57 00 

CAATCGTTTC CTATTTTATT TCAAGAAGTT AATGATGAAC CGCTAGTTTA TTTAGACAAT 5760 

GCTGCGACAA CCCAAAAACC AACGGCAGTG TTAGATGTTT TACGACATTA TTATGAAACC 5820 

GATAATGCCA ATGTTCATCG TGGGGTGCAC ACGTTAGCCG AACGAGCAAC GAAGGACTAT 5880 

GAAGCCTCCC GCGAAAAGGT TCGCCAATTT ATCCATGCCA AAGAAACAGC AGAAGTTTTA 5940 

TTTACACGAG GAACAACAAC CAGTTTGAAT TGGATTGCCA AAAGTTATGG TGACTTGGCA 6000 

GTTACAGCCG GGGATGAAAT CGTCATATCT TACATGGAAC ATCATTCAAA CATTATTCCC 6060 

TGGCAACAAT TAGCTCAACG TACAGGGGCC ATTTTGAAGT ATATAGACGT CACTGAAGAC 6120 

GGCTTTTTAG ATATGGCAAG TGCACGTCAA CAAATTACAG AAAAAACGAA AATTGTTTCG 6180 

ATTGCTCATG TTTCAAATGT GTTAGGGGTC ATTAATCCAA TCGAAGAATT AACACAATTA 624 0 

GCTCATCAAA ATGGCGCTGT GATGGTGGTA GACGGCGCGC AAGCAGTTCC TCATATGCCT 6300 

GTGGACGTTC AAGCCATCGA TGCTGATTTT TATGCATTTA GTGGTCACAA AATGTGTGGA 6360 

CCAACTGGAA TTGGTGTCCT TTATGGCAAA CGTCATTTAT TGGAACAAAT GGAACCTGTG 6420 

GAATTTGGTG GCGAAATGAT TGACTTTGTT CATCTTCAAG AAAGTACTTG GAAAGAGCTT 64 80 

CCTTGGAAAT TTGAAGCTGG CACACCTAAT ATTGCCGGGG CAATTGCCTT AGGTGCCGCC 654 0 

ATTGATTATT TAACAGAAAT TGGTTTAGAG GCCATTCATC AACATGAAGC AGCCCTTGTT 6600 

CACTATGTTT TGCCGAAGTT GCAAGCTATC GAGGGCTTGA CGATTTATGG TCCACAGGAT 6660 

CCAAAAGACC ACACAGGTGT GATTGCTTTT AATATTGAAG GTTTACATCC ACATGATGTG 6720 

GCTACGGCGT TAGATATGGA AGGGGTCGCT GTCAGAGCGG GTCACCATTG. CGCGCAACCG 6780 

TTATTAAACT ATTTGAGCGT GCCAGCAACC GCACGGGCAA GTTTTTATTT ATACAATACC 684 0 

AAAGAAGACG- CAGATCGCTT AGTTGAAGCG ATTAAAGCGA CAAAGGAGTT TTTCCAGCAT 6900 

GGCACTTTCT AAATTAGATA ACTTATATCG CCAAGTCATT TTGGACCATT CTAGCCACCC 6960 

ACATCACCAC GGAACGCTAG ATGCGTCTAG CCAAACAATT GAACTGAATA ATCCTACTTG 7020 
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TGGAGATGTG ATTGAACTGG ATGTGGCCAT TpAAGACGGT GTGATTAAAG ATATCGCCTT 7080 

TCAAGGAAGT GGCTGTTCCA TCftGTACAGC . CAG.TCCTAGT ATGATGACTG ATGCGGTCTT 7140 

AGGAAAAACA ATCGCCGAAG CAACGGCCTT AGCAGAGGAT TTCTCTCAAC TTGTTCAAGG 7200 

AAATGAAGTG GCCGAAGATG AAAAATTAGG CGATGCAGCA ATGTTAAGCG GCGTAGCTAA 7 260 

ATTTCCAGCT CGGATTAAAT GTGCAACCCT GGCTTGGAAA GCCTTaGAGC AGGCTGTTGC 7320 

TAATAACGGC CAAGGCGAAg cCGGTcATTT ACATTGTGAA AAATAAAGAA AGGGTGACGA 7380 

CGAATTGAGT ACAGTACCTG AATTAGAAGA ATATAAATTT GGCTTTCACG ACGATGTGGA 74 40 

ACCAGTCTTT AGTACCGGAG ACGGCCTAAC CGAAGACGTC GTTCGCGAAA TTTCTCGCGT 7500 

GAAAGGCGAA CCAGAATGGA TGCTTGACTT CCGTTTAAAA TCGTTAGAAC AATTCAATAA 7560 

AATGCCCATG CAAGAATGGG GTCCAGACCT GTCAGATATT GACTTTAGTA AAATTAAATA 7 620 

TTACCAAAAA CCGAGTGACA AACCCGCACG CGATTGGGAT GATGTACCTG ATAAAATCAA 7 680 

AGAAACCTTT GAAAAAATTG GGATTCCCGA AGCTGAGCGT GCTTATCTAG CTGGGGCTTC 77 4 0 

TGCTCAATAT GAATCAGAAG TTGTTTACCA CAACATGAAA GAAGAATTTG AAAAATTAGG 7800 

CATCATTTTT ACAGATACTG ACTCTGCTTT AAAAGAATAT CCTGACTTAT TCAAAGAGTA 7860 

TTTTTCAAAA CTTGTGCCAC CAACAGATAA TAAACTGGCC GCTTTAAACT CAGCTGTTTG 7 920 

GTCAGGTGGA ACCTTTATTT ATGTACCAAA AGGTGTTCGA GTTGATGTAC CATTACAAAC 7 980 

ATATTTCCGA ATCAACGCTG AAAATACCGG CCAATTTGAA CGGACCTTGA TTATTGTCGA 8040 

TGAAGGGGCC AGCGTTCACT ATGTTGAAGG CTGTACGGCA CCCACGTATT CAAGTAACAG 8100 

CTTGCATGCG GCCATCGTAG AAATTTTCAC TAGAAAAGAT GCCTATTGTC GTTATACCAC 8160 

TATTCAAAAC TGGTCAGATA ACGTCTACAA CTTAGTAACC AAACGTGCTA AAGCTTACGA 8220 

AGGTGCCACT GTCGAATGGA TTGATGGGAA CCTAGGAGCT AAAACAACCA TGAAATACCC 8280 

AAGCGTTTAC TTAGATGGCA AAGGTGCCCG TGGCACGATG TTATCGATTG CCTTTGCTGG 8 34 0 

CGCTAACCAA ATCCAAGATA CTGGGGCCAA AATGATTCAC AATGCGCCGA ATACCTCTAG 8400 

CTCAATTGTT TCAAAATCAA TTGCTAAAGA CGGCGGCGAA GTTAACTACC GTGGCCAAGT 84 60 

GACTTTTGGC AAAGACAGCG CAGGTTCGAT TTCCCACATC GAATGTGACA CGATTATCAT 8520 

GGACGAAAAA TCAAAATCAG ATACGATTCC GTTTAACGAA ATTCATAACA GCCAAGTGTC 8580 

CTTAGAACAC GAAGCCAAAG TTTCAAAAAT TTCTGAAGAA CAATTATATT ATCTAATGAG 8 640' 

TCGCGGCTTG TCTGAAGCCG AAGCGACAGA AATGATTGTT ATGGGCTTTG TTGAACCATT 8700 

TACGAAAGAA TTACCGATGG AATATGCGGT TGAGTTGAAT CGGTTGATTA GT TAT GAG AT 87 60 

GGAAGGCAGT GTAGGTTAAA CCTGTTATAG GAGCTAGAAC CCTTGATATA TAAGGATTCT 8820 

AGCTTTTTTA- TTTTGCCTTG ATAGTAGAAA TGACCAGCTT TTTACTGATG AGCCATCATA 8880 

TCAGCAAACT CTTGACCAGT TTTCTTAGGG GCGCTGTTTG CAATATCCGT ATAGATAGAT 894 0 

TACGTTACTT TTCTATTTAT GTTGATGAGT CTATTCCAAA TGATAAGCAA GCTATATCAG 9000 
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AAGCAATTGA ACGTCTGAAA GAATTAATCT AJCTATTCAG 
TACAGCAAGC TTAAAAGAGC CACTGGAAAC GGAGTACACT 
mCCAACTCAA GCTAATCATA TCyTCGGTAG AaTGATACAa 
GAGAACCCCA CCCCGTTGTT ATTTGACCGT ACTAAGGATa 
GCTCACATGG GGTATTCTTT TTTGACTTTT GCaAAAGGTT 
ACGAAaTGAA ACACTGCTTT ATAAACATTG ATATAATAGG 
AATGAAACGA AAAATAGTTT TTAGTTGAAA ATACTTAAAA 
GTTATTGCAG ATGAAACAGA CCTACAAATA TTTCATGTGC 
TTTATGGTAT AATTGTAGAT GTCTAAATTC ATGTGGGGAG 
GGTAAAGTGA TTTCAATTAT CAATATGAAG GGTGGAGTTG 
GGAATATCAT CTTTTTTGTC AGAAAAAAAA GATGAAAAAG 
CCTCAATTCA ATGCAACTCA AGCGTTCATA GATCCAGAAG 
ACCATTTTTA AGTTATTTAA GCCCCAGACA GAATTACATC 
AGAGAAGAGC TGGTTACTAA AATAAACAAA AACTTAGATA 
CTAGTATTAG TTAATAAAAG TAGCGATTCA GGTTTAATTA 
ACAAAAmATA ACTTAAGGAA TTATTATGAT TATATTATAA 
ACGTTATACA CAGATAGTGC ACTAGTTAGT TCTGATTATT 
GACAGATACT CAAATATAGG AATTAGCTCA CTGAATAGAG 
CAAGAAGACT TAGAATTAAA GTGTTTAGGA CTAATTTATA 
TCAGATAAAC AAAAAGAAGT AAAAGCCGAA TTGGAATCTA 
TATGTGTTTA AATCTAGTAC TAGCTTAGTG AATGATATGC 
ATTGCTACAA GGTATGCTAA ATCAAATAAA GATATAAGCG 
GCATTACTTA GCAAAAATTC TTCAAAAGAA GGGGTGACTG 
ATACAGAAGT AGTTTGA 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
CAGTAAATAT TTTAGTTAAG AGATGAAACT TGCATGTGGA TTTGAAACAT GTAAGTTTTT 
TATTCTGTTT TAATAGTTGA TTATTTTTTG TTTTATACGA TAATGAATCT CGTATGATTG 
AAATTTTTTT AAAAAAGGTA TGTTTTATAT AGTGACAGAG AAGAGGATTA TAATGGGAAA 
TCAAGAGAAC AAATTATACC ATTTTGTCGG AATTAAAGGT TCTGGCATGA GCTCATTAGC 



AAGAAGGTGG 
CTTACTCTAA 
GTAATTGAAT 
CGCTTTAAAT 
AATAAATTTT 
GTTTCTGTGT 
AGTAGTGCAT 
AGCAAGTTAA 
GAATTGTAAT 
GTAAAACAGC 
TACTTTTGAT 
ATTATCTGAA 
AGTCTTTTGT 
TATTAATGGG 
AACGTTTAAA 
TTGATTGTCC 
ATTTAATTCC 
CTATTGGGGA 
CGATGGTTCA 
ACAAAGCGAT 
AAGTGGGAAA 
ATGTGGTAGA 
TAGATGGATA 



GAGAAGGGAT 
TTGATGGACA„ 
GAATAGGATA 
GAGCTACATG 
GATACaTGAA 
TTCGTATGTA 
AGTTGTGGGA 
AGAATACTTT 
GAGTAAAAAT 
GTTGAGCGTA 
TGATAGTGAC 
AAGTGAAAAA 
AATGCCAAAA 
TGATTTGAAT 
ACGATTTATT 
ACCAACTTTA 
TAACCGTATA 
TTTGATAGAT 
AAATCATCTA 
GGAAAATATA 
ACAGGGACCT 
TGAACTGCAA 
ATAGATTTTT 



9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10397 



60 
120 
180 
240 
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GTTGGTCTTA 
TACACAACGG 
TGATAAAGAC 
CCGCGCCATC 
CGAACCGTAC 
GCTAGCACAT 
CCACGGCGAA 
CTTGGCTTAT 
CTACAAGAGC 
AGGAATTTTT 
TTATTATTAT 
CCCTCTTCCA 
CCTcCAGCTA 
TAAAACAACC 
GCTAAGCCAA 
ATATTCGTCT 
GTCATCCATT 
TGCGTAACGG 
AAAAGTAAAA 
AACCATTTCT 
AGCACTCAAA 
TTTTTCATAT 
GTTTAATAAA 
ATAGTTAGCC 
AAGTGGTTTG 
TTCTTGACGC 
ATCTGTTGGA 
GTCTTTGTCG 
AAATGGATAT 
GATAGCAAAA 
ACTTTCATAA 
ATTAAAAGTT 
TCCATTAATC 



CACCAAAAAG 
GATCTTGAAA 
ATGATTGTTA 
GAATTAGGCG 
ACAAGCATTG 
GTATTAAGTG 
CCAnATGCTG 
TCACCAGATT 
ATTGAGGACG 
GCTTATGGTG 
GGCGTCAGCG 
AAaTGAAATC 
AACTAAATAA 
GTTGAATCCC 
AAGCGCTCAG 
AATTGTGTTT 
AATTTTGGAA 
TTCATAACTG 
CCAAGAATTG 
TCAGTGACAG 
TGAGTTGTTG 
TGTTTGCCTT 
CCTAACAATT 
ATTTGCGCTT 
CCAGCAGCTT 
ATAAACCAAG 
ATCAAGCCAT 
GCATATTGGT 
TGGTTACCAA 
GCAGCAATAT 
CTACCATCGA 
AAAACTTGTT 
GTATTTCCTT 



GCTACAACGT 
AATCAGGTGT 
TTGCTGGAAA 
CAGAAGTAAT 
CTGTAACAGG 
GTATCAATCC 
ATTTCTTTGC 
ATGCGATTAT 
TTTTTTCAGC 
ATGATAAGTA 
AAGAGgAkGA 
AG T CAT CAT T 
aAcAAACAGT 
TGTACTTTCC 
CTAATGCCTC 
GCATTTCTTT 
GTGTTTCTTC 
TAGCAACGTT 
GTgAATCTTT 
ATTCTTCTGG 
GTGTGTAATC 
CTTCACCATA 
CAACAGATTT 
GAGCTGTTGT 
GTGTTAAGAT 
TATTGGTATT 
CTTTATACCA 
TAATAATTTT 
TAGGGAAGTC 
TTGGkTCyTT 
CTTTACTAAT 
GACCGTAAGA 
TAATATAGTT 
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TCAAGGATCA 
CCCTATTTTA 
TGCTTTTCCA 
TCGTTACCAC 
GTCACATGGG 
AACTAGTTAT 
ATTTGAAGCG 
GACGAATATC 
GTTCCAAACa 
TCTTCGCCAG 
TATCCArGCC 
TGGAGTGCTG 
AAATTTAGTT 
ATCTTCCTCT 
AGCCTCCTGT 
TTGAACTTTA 
TGGATCAACA 
AGTGATTTCA 
TGCTTCTTCG 
CCAAATGATT 
TTTCAACAAT 
AACAAGTCCG 
TTCTTTGTTT 
TTTTAATGGT 
TGTATCACCA 
TAAGTCATAT 
TTGATGCAAG 
TGGTGAGCCA 
ATAATTACCT 
TTTAwGGaAT 
ATCTAAATTG 
GTTACCTAAA 
ATCTGGCAAT 



GACGTAGAAG 
CCATTTAATG 
GATACCCATG 
GATTTCATTG 
AAAACAAGTA 
TTAATAGGGG 
TGTGAGTATC 
GATTTTGATC 
ATGGCTCATC 
TTAGAATCAG 
CGATkGCTkG 
GACCAACACC 
TGaTTACTGT 
TCCTTTCTAT 
GATTATTTTT 
TCCCAGCCAG 
GTTCCTGTAT 
GTTTTCACTT 
ATGCTCTTAT 
AAGTTGTTTC 
TTCACACGAT 
TTTAACAATT 
TTAGACGTGT 
TCTGTTAGTG 
TAATCCATAG 
GGTGTTGTAC 
ACTTTTAAGT 
GTATCAGTTG 
GTTGCAAAGA 
TCTTTTAGAA 
TATTTATCGA 
ATTGGGAACG 
TGATCATAGG 



AGTATTTCTT 
CAGATAATAT 
AGGAAATCGC 
CTGGTTTTAT 
CGACTGGTTT 
ATGGCACTGG 
GCCGTCATTT 
ATCCAGATTA 
nAGTCnAAAA 
AAGTGCCaGT 
ATAATTCAkG 
AAAAATAATC 
CCAGGAAAGG 
TATGTAAAAG 
GAGATTGGAT 
CTGTTTTTAG 
TTAAGCTTGC 
TATCATTTAC 
CACGTTCTTT 
CTGTGTTCCA 
CATCGCCAAC 
CTGGATTGCT 
TTGCAACAAC 
GACGAGAAAC 
GTCCTTGTGT 
TTGTAGCAGC 
TATTAATCAT 
TTTTTACTGC 
ATGTTTGGCC 
CTTCCGTTGC 
CATATTCTTT 
CATACAGTTT 
CTTCTTTGGC 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
'1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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ATATTTAGGT 
ATTTTGTGCT 
GTCCCAATCG 
AATTTTATTC 
CATTAATAAC 
GCACGCGCTT 
TTTTTCATTC 
CACCCACGAA 
CAACAATCGC 
CACCCATATC 
ACTGTAATGG 
AATAACCCAA 
CAATGCTGAC 
TGTCAGGCAC 
CTAAAGGCAA 
AATAGTTCGC 
ATTTTTTAAA 
CATTGACAAT 
GTTGACTCAT 
AAAACCGGTA 
rGAAAAACGG 
TCGCTGTCGG 
TTCCTCCTAA 
TAACAAACAT 
ACCAGTAGCC 
CGCACCTGAG 
GATGTTTAAA 
TATTTGTTGC 
CCCCATCACA 
GAAGACAAAA 
ATTACCAGTA 
ATAGTTAATA 
AAcCAAGCGA 



GCTAAATCAG 
AATGAAATAT 
CCCCAACCAA 
GCATTATCGA 
GTTGGTGTAC 
AGTCCTACAG 
CTTATTTCCT 
ATATTTTTGG 
CATTCGAGCC 
CGTACTTCTT 
TACCAAGGTA 
AGCCGCAAAC 
AAAAATGCGC 
GGTTTTCTTG 
AATTAaCGCC 
AACCATCCCT 
TGGAAAGTTG 
CGTCCCTGTC 
TGCCCCTTGG 
GCCGTTTGCT 
AACCACACAG 
GTTGAAGGAT 
AATAGTGCTG 
GCCCCAACGA 
GTTAAGCCAT 
TTTCTTGGTA 
ATTAATAAAA 
CATTTGTTGG 
GAAGCATAGT 
ATTACTGGCC 
ATCCATTGAT 
ACCATCCAAG 
CGATTTCTTA 



TTAAATCAGC 
CATAGCTTTC 
CAAATTCCAT 
TTAATTGGTC 
TATCATCACC 
CTAAAGCTGT 
CCTGTACAAT 
AAGAATGGGT 
GATTCACTTG 
GATAAGAACT 
TCTTTTTGAA 
AAACTAATTG 
ATTTCACTTG 
AAAAACGTTC 
CAAATTGTAT 
GGTGAAAATA 
GAACGAGAAA 
ACCGTAATGA 
AAAATGTAAC 
GCTAATGATC 
GAAATAGCAA 
CGAACGTTAA 
AATCTGGTTC 
CAGATTGATA 
TGTAAATATA 
CGATGTAGAA 
CAGACATCAT 
CACCATCCAC 
AGATAATACT 
ACCAACGTGG 
TCAAAATCCC 
ATAAGAAATA 
ATTCACTCAT 
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ATAGGCGCCT 
ACCAGAAGCA 
TTTTAATTTT 
ATAATTATCT 
TTTTGACGCT 
GCGCCAGCTA 
TTTATTCTTT 
AAGTTAACGC 
GTAGAACGCC 
CTAAGTTGTT 
TGTAAAGTAA 
TGGCAATCCC 
CGCCATCAAT 
GCATCACTAA 
CTTTCAATTG 
ACATTGTAAT 
TGACATAGGC 
AGACCGTTAC 
GATAGGCCTC 
CTTCACTTGT 
AAATAGCAAT 
CGCTAGTGAC 
AATTCGACGA 
TAACGCTGCA 
GGTATCTAAC 
CAAACCAAAG 
TGGTAAAATT 
CATCGCTGCT 
GTTGTAGCCT 
ATCTGCATAC 
TTTATCTGGA 
TGGTAGCAAT 
AATAATAGCA 



TTTTGTGCAT 
ACGATTGTTG 
GCCCCAATTT 
GGTTTGTCCC 
GCATCTTTTG 
CGGCTAATCC 
CACGCCGCCG 
AATTGGTAAC 
TAAGCCTTCT 
TTGAATTTTC 
CGCATTGAAC 
TGGAACGGCT 
ACGTGCCGAT 
AATATTAAAT 
CAATAAGTTT 
TAACGCAAAC 
ATATAATGAC 
CCCTAAGGCA 
CAATGAAAAT 
CAATGAAATC 
GACAATATTG 
TTTTTTCTTT 
ACAACTAAAT 
GCTGCCGTCA 
ACACTTGTCA 
TCTGCTCTGA 
TGTGGAATCG 
TCATAGTATG 
AGACTCTTCC 
CAGTTAATCA 
CTTAAAAAGG 
GACATCGTGT 
AAAGCAATCG 



TCGTTGCATA 
ACATTTTTTG 
TTTTCTCGAT 
CAACACGATA 
AAGATTTACC 
TTTTTGCCAT 
ATTGTTAAAC 
GTAGATAAGA 
TGTGCTGCTG 
ATCAATAAAT 
CAATCGTTCC 
AGTGGTAACA 
TCAATAATAC 
GGTCCTAAAG 
GTCATTACTA 
AATGTAAAGA 
GTCATTGTTG 
TGAATAATGC 
TCTTTTGGCC 
ATAATAACAA 
AAAATCAAAT 
TTCATAATCC 
TCGCAATTAA 
TGCCTACATC 
CGTTGTAAAG 
AGATACCGCC 
TAACATTTTT 
TCGGATCAAT 
AAACATTTAG 
TTGTGCCGCC 
CATAAACAAA 
GATAAACTTT 
CGAAGAATAA 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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GTTAAAGGCT AAAAAGACGA CATTGTACAA AATGGTGTTT CTAGTAATTA ACCAAGCATC 4260 

TTTTGACGCA AATAAAAATT TAAAGTTTTC AAAACCAACC CAAGGACTTT CTCTTAGACT 4 320 

AGCTAAGAAA CCATCTGGTG AAATATGAAA ATCTTTAAAT GCGACCACAT TGGCTAAGAC 4 380 

TGGTATATAG AAGAAAAATA TAAACCAGAT AAATCCTGGT AACGCCATAA GGATTAAGGC 4 440 

CTTATAACGC CATACATTTT GAAAAAACCC GTTTTTCTTC TTCATTCGTT TCGCTCCTTT 4 500 

CATAAATATA ATAGCGTTTT CATAATATCT TTTTAAGGAA CAAATTATAG GATTTGAGAA 4 560 

ACAAATTATA GATTTTGTGT CATTTTAAGG TAAAAATTTA CCGAAATGAC TGTTCTTTGT 4 620 

GaAAACGTTT ATCTGATATT TTTTTTACAA CTGTTTAAGA ACCTTAAAAA CAATTGGCTA 4 680 

AACCTCACTC ATAAAGCGAT TTTGCTAGCG TGTATTTGTT TTATAAAATC TGAGAACTTG 4 74 0 

ATAGTTtCAC AAAGGATATC CTCTAATTAT TTTTGAGGAG GACTATTTTT GAATTTTATC 4 800 

CGTAAAAAAC ACTAGAGCAA TTCGTTTAAT TATAGTTTAT TTTTTTGCAG TTTTAATGCA 4 8 60 

AGCTGGTTCT T TAT T ATT AC TAGCTATTTC ACTACTATTT TTCTATTTGC CTTTATGCAC 4 920 

ACACAATATG TTGGATTTCT TGTTATAAAT CAAAAA 4956 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20561 base pairs , 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

TGAAGTAGAA GAAGAAAATT ATCCAGAAAA TAGCTCAGCA CTTGTTGCTC GCTTAGAATT 60 

TCGAATTGTC TTTGACGAAT TTGTTTTATC AGGGGCGATT AGCCAAATTA ATCACATTAT 120 

TGATCGTAAA ATTGAAAAAC AAGAAGATAT TTCGCAAGAA GAAGTCGATG AACTTGTGCG 180 

TCCTTTATTT AGCATCGTCG AACGTTTATC TTATGAAGTA ACTGAAATTG CTCTGGATCG 24 0 

TCCAGGTGTT CAATTGAATT TTCAACAATC AGAAGAAGCT TAAACTTTAA AAACAGCGTT 300 

TATAACGGAT TCCGTTATAA ACGCTGTTTT TATTCTTGTG AAGATTTAGC AACCAAAGCT 360 

TTTTTAAACT GTTCATAATT TTTTGTAGCG ATATCCACAT AAGCAATCGG CTGACCAACT 4 20 

TTGGTGATTT TTTCGGGAAG ATAGGGAGCG AGCATTtCAT TGTACAAAAT GAACGCAGCT 4 80 

TCTGGTACTT CCTCTGAAAG AACATCATAA CTGATAAAGT CmATTTCATA GGTTGGTTGA 54 0 

AAAAAGGCTT TGATGTTTTG CTGCTCTTTA TTTGAAAAGA GCCaAGGaCC TCCATTTGcA 600 

GGGCTCCCCa CAAAcAACGT CGCTCCaTCa CGGTTGACAG AAAGGATCCa ACAAATTTTT 660 

TGCATACGCT ' TACACCTCCT AGAACTAAGT ATACCACGTC AGCTTAAAAG AAAGAGGACA 720 

AAAAACAAAT TATTTTTACA GTGACTAGTA AAACAACGCT TGGTGATAGT TTTAAACTAT 7 80 

CACAACGGTT ATTTTCTATC AAAAAGTAAG CAAATTATCC CACTATTTGC TATAAAATAA 840 
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GGTACAATAG AACTATCACT GGAAACAGTT TPCTGTTTTC AGGGTATTTA CTGGAGAGTG 900 

AGTACCGGGA ACCTTTACTT CCTT.TGATGJ" .ACTTG.TCATT AAAAACATAG GGAGGAACTA 960 

AAATGACAAC ATTCATCACT GTATTAGTGA TCCTTGCATT TATTGCTGTG TTATTCGGTT 1020 

TCTACCAAAT GCAAAAGAAA CATTTAAAAT TCTCAACACG TGTCTTTTCA GCATTGGGCG 1080 

TGGGGATTGT TTTAGGTGCA ATTATTCAAT TTGCTTTTGG TACAGACAGT AAAATTACTA 114 0 

CGCAATCAAT GGAATGGATC GGGATTGTAG GTAATGGTTA TGTAGCGTTT CTACAAATGC 1200 

TCGTAATTCC GTTAGTATTT GTATCTATTG TAGGTGCATT TACAAAAATG AAAGAATCCA 12 60 

ATAAATTAGG AAAAATTAGC TTTAATGTGT TAGCCACCTT ACTAGGAACA ACAGCAGTAG 1320 

CTGCCTTGGT GGGGATTGGA ACAACCTTAG CGTTTGGTCT TCAAGGGGCA AAATTCACGC 1380 

AAGGTGCTGC AGAAACGTCA AGAATTGCCG AGTTAGCGAC GCGTCAAGAT GCGATTCAAG 14 4 0 

ATTTGACGAT TCCCCAACAA ATCGTTTCAT TCATTCCTAA AAATGTCTTT GCTGACTTTG 1500 

CGGGAACACG ACCAATGAGT ACCATTGGGG TGGTAATCTT TGCTGCGTTT GTTGGTGTCG 1560 

CTTATTTAGG CGTTCGTCGT AAAGCGCCAA AAGAAGCTGA ATTTTTTGCA AATTTAATTG 1620 

ATAGTTTATA TAAAATTACC ATGCGAATTG TTACCTTAGT TTTACGCTTG ACCCCTTATG 1680 

GCGTGTTAGC GTTAATGATT AATGTTGTAG CAACAAGTGA TTTTGTTGCG ATTATTAACT 17 4 0 

TAGGTAAATT TGTGTTGGCA TCATATGTGG CCTTAATCAT TGTTTTTGCC ATTCACATGC 1800 

TGATTTTAAT TACTTTAAAA GTCAATCCAG TTACGTATCT AAAAAAAGTT TTTCCAGTCT 18 60 

TGAGCTTTGC TTTTACTTCT CGTTCAAGTG CAGGAGCTTT ACCATTAAAT ATTGAAACAC 1920 

AAACCAAAGC ATTAGGTGTC GATGATGCCA CAGCAAACTT TGCTGGTAGT TTCGGTTTAT 1980 

CTATTGGACA AAATGGTTGT GCCGGCGTTT ATCCAGCTAT GTTAGCAACC ATCGTTGCAC 204 0 

CAACTGTTGG AATTGATGTC TTTAGTTTAC AATTTATTTT AATGTTAGTA GCAGTTGTCA 2100 

CAATCAGTTC ATTTGGCGTA gcgGtGTCGG CGGCGGAgCA ACCTTTGCTT CTCTAATCGT 2160 

TTTAGGCGCA ATGAACTTAC CAGTTGCAAT TGTTGGTTTA GTAATTTCAG TTGAACCACT 2220 

TATTGACATG GCGCGGACAG CTGTTAACGT TAACGATAGT ATGGTTGCAG GCGTGTTAAC 2280 

AAGTGCTCGT ATTCATGAAT TAGATCGTGA TGTCTTAAAT GATCGCGACG TTGTCTTAGA 2 34 0 

TGCAAATATC TAAATCAATA AAACGAAAAG TGTTTGAGCA ATGTCTTGCT CAAACACTTT 2 4 00 

TTTTGAATTT TAAGCGAAAG TGCAGTAATT ATTTAAAAGA ACGTATATAA TAAGTTGTTA 24 60 

ACTAAATTCA AAAGGACGTG AAGGAATGTT TTCAAACCAA GTAAAGATTA TGTTATATGT 2520 

GAATAATGTT GAGGAATCTA GTCAATTTTG GCAAACGTTT GGCTTTGTTG AAAAGGAAAG 2580 

AGAGGAAGTT GACGGCACAT TGGTGGTAGA AATTGCACCA AGTGAATCAG CAGAAGCGAT 2640 

TATTGTTTTA" TATGATTTGG CCTTTATTCA AAAACATTCA CCAGAAGTGG CGGGGAATAC 2700 

CCCTTCTTTA ATGTTTGCTA GCGATGATAT CATTGGTCTC TATAAAAAAA TGCAAGAGGC 27 60 

TGGCGTTACT GTTGGTGAAA TGGTTCAATT GCCAACAGGC TTAGTTTTTA ATTTTGCAGA . 2820 
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TAACGATGAT AATTACTTTG CTGTGATGGG GCAAGAAAGT AAATAAAAAA CAGCTTGGCA 2880 

CCAGAATCAA TCTAGATTCC GGTGCCAAGC TGTTTTTTAT AAAAGTTGAA AGTTTTATAA 294 0 

ATTTTACTAA AATCTGTAAT TTTCATGAAA ATATAGCGTA TGATAATGCT GTTAATAGAA 3000 

TAGGCGGAGG TAAGCGGTAT GCGTAAACGA CATGCAAAGA AAAGACATGG AGGAGTGAAT 3060 

TGGCTTTTTA TAGTATGTTT GTTGGTGGTG ATTGGTGGTA GTGGTTATTT AATAAAAACG -3120 

TTCTTTTTCA CTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 3180 

CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 324 0 

CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG CTTTGATCAT TAAAGATGAT 3300 

CAGGTTTTAG TAAATAAAGG ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG 3360 

CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 34 20 

AAAGCAATTG AAGAAGGTAA ACTTACATTA - GATACAAAAC TCGCTACGTT TTATCCGCAA 3480 

ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 354 0 

TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 3600 

AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 3660 

GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 3720 

CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 3780 

AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT 3840 

GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 3900 

TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 3960 

TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 4 020 

TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 4 080 

AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 414 0 

GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TTTAAAGCAG AAGTAACAAC 4 200 

GATCACTCCC AAAGCCTTTT TAGGTTTCTT CTGAGAGACT TTATATTAAA ATAAAGGGAA 4 260 

GAACGATCAA AGGGAGTGAC GAAGATGAAA CAATCAAAAA ATAAACAAGT TGTTATTGGT 4 320" 

GAACCAACGA AAAAAAACAT TTATCTTCAA GATATAGAAC AGGAAACGCA AGCTTTTATT 4 380 

CTCCAAAACA ATCCACAATT GACAGCTGAA TCGATGATTA CATTGGCGGA GTTACTTAAT 4 4 40 

TATCGTTTGG ATTACATGAA ACAATTGGTC GCAAATGATT CTGAAAAAAT CCAACAACTG 4 500 

AATGAATTGG TGGTTCAACA TATCAAAGAA GATAAGCTGA TTTCCAATAA TATGAATGAA 4 560 

AAGATGGAAA AGAAATTAAC CTTTGGCCAA CGAGCAGCGG ATACCATTGC TAAATTTGGC 4 620 

GGTAGCTGGG- TGTTTATTGG CTTCTTTTGC TTCGTGTTGA TTGCTTGGAT CGTGATTAAC 4680 

TCTACCACTT TATTGGGACA ACCATTTGAC AAATATCCTT ATATTTTGCT AAATCTCGCC 4 740 

TTATCTTGTT TAGCAGCTAT CCAAGCACCG ATTATTATGA TGAGTCAAAA CAGACAAGAA 4 800 
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GCCCGTGACA 
ATTAATTTAT 
AAAATCCAAA 
TCTGAAAAGG 
GTAGGTACTG 
TTTCCGAAAA 
GGTGTTCTTT 
TCGTGATTCT 
CCTTGTAAGT 
CACAGTGAAA 
ACGCATGTCG 
CTGTTCCCGC 
CTTTCCCTTT 
AAGATGAAAT 
TTTGCGAAAG 
GCGACAGAAG 
GAAGCAATGC 
GAAATGGCCT 
GAAGAAGACG 
GAAGCCTTAT 
TTAGGTCATT 
TGGATTCGCG 
GGGATTCGCT 
CAATTGCATG 
GACGGCAAAA 
AAAGTTATTG 
AACCCAGAAT 
GGCTGCTCAT 
ACAGAAGCAG 
TAAAAAAATA 
TGTCTTTAAA 
CGTCTGAAAA 
GCCAGTCAAT 



GAGAACAAGC 
TACATGAGAAw. 
ATATTCAAAT 
AGTAGTATAT 
ATTAACATCT 
TTGTCATAAA 
AATTTTTGGA 
CAACAATTTC 
TCCGAAGGAC 
CATCGCAACT 
AGAAAACACC 
CGTTTATCAG 
TTTATTTACC 
ACCAAGTATT 
AGCATCTAGC 
GGATTAACGG 
AAGCAGATGA 
TCCGCAAAAT 
TTGATCCATT 
TAGACGAAGA 
TCCGTGGTGC 
AGAACAAAGA 
GTGAAAAATT 
GTGGTATCGC 
TGTATGTCTT 
GGAAAGACTG 
GTAATCGTCA 
TAGAATGTAG 
AAGTTGCTGA 
AACAAAAAAA 
AGTGTTGATT 
AAATAAAAAA 
GTTGAAAATG 



GAA7AATGA7 
AAT5GATTA7 
TGAACTTT7A 
AGAAGAGG73 
ATTTGCTT73 
ACACAACGA3 
GACCAAGTA3 
GGCTTATTGT 
CTTCACCAA3 
CTTAGAGCTG 
TTAGAAGTT3 
GCTTTTTTTA 
GTATAATGG3 
ACT7TATTA3 
TTTTTGCAAA 
AACGTTATC7 
GCGCTTTAAG 
GTTTGTTCG3 
AGAAACGACG 
CACTGTTGTA 
CGTGCGCCCA 
AAAAT T TAT 3 
TTCTGGCTGG 
CAACTATGGA 
TGATGACCGA" 
GTTTGATGGG 
AATCTTAACT 
CCAGCATCCT 
ACGTTTAGCT 
CTTCGAAAAT 
TTAAAGCT77 
ATCTAATAAA 
GGGAAGGAGG 
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7 at aaaat t a a7t7aaaagc 
attatcaata at caac tag a 
3gtgaattgc aagaacaact 
gggacagaac cgt77agctc 
caaatagtg: tcc7Cagcaa 
3aacgagttg a7g7tgcatg 
gta7tgttca tca7cgaatc 
7ttctcgcca aaggctcgtc 
7aggtacga7 gat77catct 
tagctcttt7 gaaggctgta 
ttgagatcgg agcaacgcgt 

TTGTGTGTAA A7ACACAAGC 
ATAGAATGAC TTTATACAAA 
AAA7ATACAA CAA7TGAAGA 
TCA7TAAAC7 7AAAAGGCCG 
3GTACTGTCG AAGAAACAGA 
3ATACATTC7 T T AAAAT T G A 
CCACGTTCTG AATTAGTGGC 
GGGAAATAT7 T3GAACCTGC 
ATCGATGCTC G7AACGATTA 
GATATCCGTA GCTTCCGTGA 
GATAAAAAAA TTGTTACCTA 
TTA7TAAAAG AAGGATTTGA 
AAAAATCCAG AAACACGTGG 
"ATCAGTGTCG AAATTAATCA 
ACACCTTGCG AACGCTACAT 
TCAGAAGAAA ATGAACATAA 
GCCAACCGT7 ATGTAAAAAA 
TTG7TAGAAG CGG7TGAAGT 
TAT7TTTCGA AGTTTTTTTG 
CTTAGCTAAA ACCATTTTAT 
TAT7TCTTT7 ATATAATGTT 
AAT7CAGATG AAAAT C AT AA 



AGAAGTAGAA 
AAATTTGGTG 
CGCCGCGACT 
CGAGAACCAA 
TAAGAAGAAA 
ATACCTACCC 
ACTACGTTCT 
ACAGTTTTAT 
GTTCATGCTT 
CTGTGACGAG 
AGTGTTGGTT 
AAAATAATCG 
AGGGGAAAAC 
TCCAGAAGCT 
TATTTTAGTA 
AAAGTATATG 
TCCAGCAGAA 
GTTGAACTTA 
AGAATTTAAA 
TGAATATGAT 
ATTACCACAA 
TTGTACTGGC 
AGATGTTGCT 
CGAACTTTGG 
TGTTGATAAA 
TAACTGTGCA 
ACATTTAGGT 
ACATAATTTA 
ATAAAAATAA 
TTTATTTTTT 
TTTTTAAAAG 
TATTGGCTGG 
AAAGGTTTAG 



4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 
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TTTGGTATGT 


TTAGGGCTAT 


TGATCATTGG Gt,tGCmAACA AAAAgCGkTA 


TGGCTGAAGA 


6840 


AAATAATTAT 


GAATCAAATG 


prpp aappp^AP, PTTPTATGGT 


ACCTACGTTT 


ATGAGAATGA 


6900 


AAAAGAGTCA 


AATGACGTAG 


rrTTiTarrra APAATPAGAA 


GAACAGGGAA 


GAAACAATTT 


6960 


AGCTGCTTCT 


GGACAAGCAG 


rpmmfpnrT'T A A AAPAftftPGAG 


TCTGAAAATC 


CGCTGTATTC 


7020 


CTTGATAGGA 


GTTAGTTTGT 


TrrrraTfli^T PATTTATTTA 


ATTAATAAAA 


TGAAACGAGA 


7080 


GAAGGAGTTT 


ATTTAATGAA 


7\ 7\ a t\ npTPTT 1 TTAPTTAPAT 
AAAAAb lull l i l i rt v^ri i 


CGGCAACCAT 


ACTTTTAGGC 


7140 


GTAGGGTTTC 


AAGCAATAAA 


prnaTrsrra PPAP.AAAPTT 


TTCCGAAAGA 


ATACAATACA 


7200 


GAAGGCACAA 


TTACTTTTGA 


7\/^o7\r , r ,r PP7i r P ZiATflAAATPA 
AGC Abb 1 bn 1 nn 1 uAm i v-rt 


CACCGCCTGT 


TGATCCAGAA 


7260 


AATCCCGATC 


CTACTGATCC 


ti rmArn rrTR STTf^ATPPTP 
AGTAGAUCUA. Al ibrti^i^ 


CAGGGCCAGG 


AACGGGTGGT 


7320 


GCCTTATCAA 


TTGATTATGG 


GTCAAAATTT AAAl 1 1 win 


PACAAAAGAT 


TTCTACGGCA 


7380 


GATCAAACGT 


ATTATGcTGC 


GCCAGATbAA A 1 LjA/iHUn i ^ 


GTTCAAAAAA 


ACCAACCTAT 


7440 


GTACAAGTAA 


CTGATAAAAG 


AGGaACGTTA GCAGGATGGA 


AAL 1 AAbbL 1 


TTCCCAACCT 


7500 


GAACAATTTA 


AaACAGCTAC 


TGGAGAAGAA CTAGTCGGCG 


CTCAATTGAC 


r*TTTAPAAAf^ 


7 560 


GCAGAAGCTG 


CCTCAATGGT 


GGATGAAAAA TATAAACCTA 


CTGAAGTCAG 


TAGTACTATT 


7620 


TCCTTAACAC 


CTGGTGTAAA 


TAATAATCTA GCGATGAACG 


CTAAAAAAGA 


AACAGGTGTT 


7680 


GGGACTTGGG 


TTTATCGCTT 


TGGTTCAAAT GAAGCAACAA 


ATAAAGAAGC 


TGTCCAACTA 


7740 

i 


TTCGTTCCCG 


GAAAATCTGT 


AAAGCTTGCA CAACAATATT 


CAACGAAGTT 


AGTTTGGGCA 


7800 



TTAGAAGATA 


CTCCCGCTAA 


TAATTAAATA 


AAGGAGAAAA 


GCAATGAAAA 


AACAATTCAC 


7860 


AGGAATCGTT 


CTATGCAGTA 


CATTTGCAGT 


TGCTGGTTGG 


TCCTTACAAG 


CGAAGGCAGA 


7920 


GGAGCCAAAA 


GAGTATCGTT 


CGAATGGTTT 


AGTAGAATTT 


ATTCCGAACG 


TAGATCCGAC 


7980 


AGAGCCGGTT 


GATCCAGAAA 


ATCCCGATCC 


AGAAAAGCCA 


GTCAAACCAA 


TTGATCCGAC 


8040 


AGACCCAGAA 


GGTCCCAATC 


CTGGTACCCA 


AGGACCGCTG 


TCTATTGATT 


ATGCTTCAAG 


8100 


TTTTGACTTT 


GGGAAAAATC 


GTATTTCGAA 


TAAAGACCAA 


GTATATTTTG 


cAAGAGCACA 


8160 


GCAATATCAA 


GAAAATCAAA 


AAGAAACACC 


AAATTTTGTG 


CAAATTTCTG 


ATAATAGAGG 


8220 


AACCAATAGT 


GGTTGGTCTT 


TAACGGTTAC 


GCAAAAGGAG 


CAATTTAAAG 


CAACAAAGGC 


8280 


GACTTTAAAT 


AGTCAGTTGA 


CAGGTGCGCA 


AATTTCTTTA 


GCTAACCCGA 


CCGTTAATAG 


8340 


TAATGCGCAA 


AATGTTGTGA 


AACCAGAGGC 


AACAAATAAG 


ATTGCCCTTG 


TTCCAGGAAC 


8400 


CGCATCGCTA 


GTGGCAGCTG 


CGArCAAGGA 


ACCGGTGCAG 


GAACCTGGGC 


TACTTATTGG 


8460 


GGCAAAGTAG 


AAGTTGTTGC 


AGAACGCGAT 


GAAACCAATA 


CGGTACACAA 


TGTTAATGTT 


8520 


ACAAAAGATG 


TTGCGCTATC 


AGTACCAGGG 


TCAACCCCTA 


AAGATGCCGT 


GAAATATCAA 


8580 


ACAAAATTAT 


• TGTGGACGTT 


GACGGATGTA 


CCAGGAATTT 


AAAAAGTAGA 


GGTGAAGAGA 


8640 


GATGCGAAAC 


AAACAAAGCC 


TAGTCTTTTT 


GGCAAGTAGT 


TTATTCTTAC 


TAATCCTTGC 


8700 


GCCTATTACT 


AGCGAaGCGA 


. TTAGTAATCC 


TTCGACAAAT 


ATCAACGAAG 


TCAATCGGGA 


8760 
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AATAGTGGAG 


GAATCGGCCA 


ATGATCAGGC 


AACTAGCGCT 


CCTTCTCACT 


CGGAAACATT 


8820 


AGAGCAAGAA- 


.CCTTCCCTAG 


AAACGACAAC 


AACAACCGAT 


AATCATCCAG 


CAGAAACAGA 


8880 


CCAATCAACG 


GCTACTTTAG 


AACCTAGCGA 


ACAACAAAAG 


GATGACGGTA 


CCAGTGCAAC 


8940 


GGAATCGGTG 


ATAGTTGAAC 


CACAGCCGCG 


CATGTTGgCT 


GCGCCATACG ■ 


CTACTTCATT 


9000 


GCCGGATGAT 


CCTAACATCA 


TTCCAATCGA 


CAAGGTGTTT 


CAAGAGCCAA 


TTGGAAATGC 


9060 


AACAAGTATT 


TTAGAAGGTG 


GGAAGTTATT 


ACAACTAAAC 


CCAGCTGTTA 


AATCACAAAA 


9120 


AGGCGCTATT 


TGGTCAAAAA 


AACCAATCAG 


TTTATTGTCG 


GATTTTACCT 


TTAAAAGTTA 


9180 


TTTGTATTTA 


GGAAACGAGT 


ATGCAAATGC 


TGGTGATGGT 


ATGACCTTTA 


CCTTAACGAA 


9240 


TGATCCTAGA 


ATGTCCACAA 


CGCCTCAAGA 


AGTCATTGGT 


TCACCTGGGA 


TGGGAATCGG 


9300 


TGCATACTCC 


ACGAAAGCTG 


GTCAACCTTA 


TGTTAGAAAT 


GCCTTATCGA 


TTGAGTTTGA 


9360 


TACCTATAAA 


AATACTGGTA 


GTTCCAACCG 


AATGGACCGA 


GAAATTTCAC 


AAGATAAGGG 


9420 


AAATGGACAT 


TTAGCTTTTG 


TCACTCCGAA 


AGCTAATAAC 


AACAACTATA 


CAGGGGAGCA 


9480 


TTCAGGCGTC 


ACTGTTGCGC 


CAACCTATCT 


ATCTAATGGA 


ACCTGGCGCA 


TGTTGACTGT 


9540 


TCATTGGAAT 


GCCGCAACCA 


AAGCATTAAC 


ATATGATTTA 


GAAGGTGTGG 


GCACGAATAC 


9600 


GTATGTAGTT 


TCAGATTTAA 


ATGCACAATT 


TGGCGCAACA 


ACGGTTTATT 


GGGGCTTTAC 


9660 


ATCATCCACA 


GGTGGCAAAT 


ACCAAGAGAA 


TGCTTTAGCA 


AT G AC AC AAA TTCCTACCAA 


9720 


CGTTACGTCA 


CAAGCAGCCC 


TTAGCGTGAA 


TGGTCAGGAA 


TTTAGCTCAG 


CTGTTGAAGC 


9780 


AGTGAAAAAT 


GATCAGGTGA 


GACTACGGAA 


TACATTGAAT 


ATTGACAATG 


ACTTTATCGA 


9840 


AGATCGTCAA 


CCGCAAGTAA 


GCATTGACTT 


ACCTGATGAA 


TTAGCATATG 


AAGAAAATTC 


9900 


ATTAACGATT 


GATGGTAAAA 


AGGTAGCAGC 


CAAAGATCTG 


ACACAAACAG 


GGAATCACTT 


9960 


AACAATTGCT 


TTAAATGACT 


ACCTAGTTTT 


AAAGAAGGAC 


AT GAT GAT TG 


AATTAAAAAC 


10020 


AACCCTTCAA 


GATAATACGC 


CCGAAAAAGT 


TTTAACTATG 


AACTTTGACT 


ATTATGAAGA 


10080 


AGGAACCTTA 


CTGCAAAAGT 


CAAATAATGT 


AACGATTACT 


ATTCCTAAAC 


CAACAGAAAA 


10140 


AACAGTAACC 


GTTTTTTATA 


AAGACGCAGA 


AGGTAAAGAT 


ATTGCTCCAC 


CTAAAAGTGT 


10200 


AACAGGGAAA 


ATTGGTGCAC 


CCTATCAAGA 


AAAACCCCTA 


GATATTTCTG 


GCTATGTTTT 


10260 


TACAAAAGAC 


TCGGGCAATG 


CGGAAGGAAC 


AATGACAGAG 


GAAACAAAAG 


ATATTTATTT 


10320 


TTACTATCGT 


TTAGGGGAAC 


TATATTTTAA 


AGAGGCACCA 


AAACAGATTA 


CTTTTGGTAC 


10380 


GGAAAAGATA 


CGTAATCAAC 


CACTAATTAA 


ACTTGGGCAT 


CCCACCGAAG 


gACTCAAAGT 


10440 


CGTGGATGAA 


CGTAACGCAA 


ATAATTGGCG 


CCTGCAACTA 


AAGCAAACGC 


AACCGTTAAC 


10500 


AAATGATTCA 


ATTGTAATGC 


CAGATGTTTT 


TTCTTTCGTT 


TCAACTGCTG 


GAAGCTCTCA 


10560 


GATTACAAAT 


• GAGGCAATTA 


CGCTTTTAGA 


AAGTAATCAG 


AAAGGTGAAA 


CAGATTTAAC 


10620 


AGCATTACTT 


GATGAATCAA 


AACAACAAGG 


AATTCAAATC 


AATGTTCCTG 


TAGCTTATCA 


10680 


AAGAGTTGGA 


ACATTTAAAG 


CTCGTTTATC 


TTGGGCGTTA 


GAAGATGTGC 


CAGGAAATTa 


10740 
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AAGAAAAGAA 
TTTTAGGGAG 
CTAATGGTAT 
ATCCAACTAA 
CAAACGGTCC 
CGTCGAAGGA 
ATGAAGTCAA 
GCTGGTCATT 
CTGGGGCAGC 
CAACAGGCCC 
CAGCGAAAGG 
CCGCTAAAAG 
CCACAAAATT 
AGAATAGCGG 
TGAACAGAAA 
TACCTAGCAA 
AACAAGTAGA 
AGTTGAAAGT 
ATAGTGCCAT 
ATCAAAGTTT 
AACCGAAAAG 
GAGTGTTAGC 
ACGATAAGAG 
TGCAGCAAAA 
AGATTAATGC 
ACCAAGTTGC 
AGGAAAAAGC 
AAGGGCAACC 
ATGCTGCAGG 
GGCAATTTGA 
ATGTCACCTT 
TGATAGTGAT 
AAAGAGAAAT 



TGGAGAAAAG 
CTTAGGCATG 
CGTAGAATTT 
TCCAGTTGAT 
GTTGTCAATT 
TCAGACTTAT 
AGAAGGGCCT 
AAAAGTGAAA 
TATCACGTTT 
AGCCACAATC 
AAACGGTGCA 
TATTGAGTTA 
AACATGGACG 
GGAACATAGT 
AAAACAACGA 
TGGTTTTGCC 
TAAAGAAAAA 
GAACTTGCGC 
TACCAATTCA 
AGCGTTTGAT 
TGAACAAACG 
TAGTGGTATC 
TCAGGGGTTA 
TAAAAAAAAG 
TAGAAATGTG 
ATTTTCGGCA 
AAACATGCAA 
GCTTAAGCCA 
CTCATTTAAA 
AAAAGACTTT 
AAAAGAAAAT 
TTTAATCATT 
ATAAAGAGTA 



aAAATGaAAA 
ACTTCTCAAG. 
ACACCAAGTG 
CCTATTGATC 
GATTATGCGT 
TTTGCTGCTA 
AACTATGTCC 
CAAGAAGGAC 
AAAAACGGAA 
ACGCTAAATT 
GGAACTTATT 
ACTGTTCCAG 
TTGACAGACG 
TACCTGCTAT 
TTTCTTTTTT 
AGTGAATTTA 
ACTTACTTTG 
AATGATACAG 
AATGTAATCG 
ATAAAAGACT 
GTTTCTATTA 
ACTTTTAAAG 
TCCATTAAAA 
GTTGAGCCGA 
ATTTTAGTCA 
GAGATTACCA 
ATTGCACCAA 
GGAGATTACC 
AAATCAATAA 
ACAATTAATG 
CATTCGAATT 
GCCTGGTTGA 
GTTAATTTTT 



1336 
AAGTAACAAC 
C.CTT.TGCAGA- 
AAGAGCCAAC 
CAACAGATCC 
CAAGTCTAGA 
CCCAAAAATA 
AAGTAACAGA 
AATTTAAATC 
ATGTAGTGAC 
CAGATGGTAG 
TATTTGATTG 
GTTCTACAAC 
CTCCAGGAAA 
TCACTACATA 
TTCCGATACT 
ATTTTGCGGT 
ATTTATTACT 
ATAAAGAAGT 
TGGAGTATGG 
ACGTTCAATA 
ACGTCAAAAT 
AAGAAACGTC 
ATGAATATTC 
ATTTACTTTT 
ATCTACAAAA 
AAAAGGGACA 
ATACCAATTT 
ATTTGTCTAT 
ACAATGAATC 
GTGAAGTTGC 
TATATTTGAT 
TTTGGCGAAA 
GATTTGATCA 



AGTATGTTTA 
AGATGGCGGT 
TAATCCGGTT 
TGAAGGTCCT 
CTTCGGTGTT 
CAAGACGGTA 
TAACCGTGGA 
AACTTCTGGT 
TGCTTCAGAT 
TCAATCAGAT 
GGGAACAGAT 
CAAATACGCT 
TTAAACGAGT 
TAGAAAAGGA 
ACTAATTCTT 
TACGCCTATT 
AGCACCAAAT 
AAAGGTAGGT 
TGAAAATAAG 
CCCTAGCTCA 
GCCTAATACA 
GGATGAAGGC 
GTATGTCGTT 
GAAAAAAGTC 
TGATCAAAAA 
TGAAGAAGTT 
TTCTGTTCCC 
GACCGTTGTT 
GATATCTTTT 
CAAAGAATTA 
GATTGGCTTA 
AAAGAAACAG 
TCGGCAACGT 



TCCACATTTA 
AGTTATACAT 
GACCCAACGG 
AATCCAGGAA 
CAAAAAATCA 
GGCGCAACCG 
ACCGAAGCTG 
AAAGAATTGA 
TCAGGCAAAC 
GTAATGAGTG 
GCAACTACAG 
GAAAAATACG 
TTTAAAAAGA 
GTAAATAATT 
TTAGCGTTAG 
CCATCAGAAA 
GAAGAAACTG 
ATTTCCATCA 
GGGAAAAAAG 
GTTCGTTTAA 
CCCTTTGATG 
AAACGCCAAG 
GCATTATTGA 
TCACCAGGAC 
ACATATATTA 
TTATATAAAG 
ATTGCGCTAA 
GGGAATAAAG 
AGAAATCAAT 
AATGAAAAAG 
CTACTATTAT 
GAAAAAAACG 
TTTTGTTTGT 



10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
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11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
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12300 
12360 
12420 
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12540 
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12660 
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TTTTGCTTAT CAAACATGCA GTTTATTCAA CAAACCTAAC TCAAAAAAAG AAATAATAAG 127 80 

AAGGAGTATT GTTAAATATC GTCTGATACC ACTTTACTAG TTATATTTTT TGTTTTGTTA 128*0 

TGTAGCTAAA GTATTTGGCA ATATATGTGA GAAAAATGGA TAAATACAAA CATTTACTAT 12900 

TATGGAGTAC ATTGTTACTC TCGTTTTTTC AAGTAACCAA TGTGCAAGCA GCAGAAGAAC 12960 

GGAATCCAAT TTATAATCCG ACCAATCCAA CAGAAGAAAT TTATCCGATT GAGAATACTG 13020 

GCACACCAGA AAAAATGGAA GAAACAAATG AAAGTACTAT GACTAGCGAA ACGCAAGCGG 13080 

AAGAAAGTAG CGAAGACGTT AGTAAAAAAG AAGTGACAAC CAATAATCCA AAAACAAATG 1314 0 

AAGAAAGGCC TTCTTCCAAG AAAAGTCAGC ATCAAGTATT TTTCACGGAT GAATTATTTG 13200 

ATGCTAGCCA TATCACCATT GTACCAAATA GCACAGGCGG TGCTGGAACG GAGGGGCAAA 13260 

ACACTTTTTA CTTGAATTTC ATACGGGAAT TAGTAGGAAC GGCTATTTAT GGGAGAAAGA 13320 

ACTGATAAAT GATGAATTAT GAAATTTTTT TAAATAAAGA AGATCAACGA AGTTATTCCT 13380 

TGCTAAGGCA TTTAGAAGAA AGTCCCACGC TATCAGGTAC TTTTATTGCA CTTCGCGAAG 13440 

AATTATCAAT GTCGAATTTT TTAGTCAAAA AAACATTAGA AAAATTAAAA GCAGATATTG 13500 

ACAATCTAAA GTTAAGCGAC AGCTTAACTT TAGTCGTTTC TGATGTAGAT GTTACACTTG 13560 

AAATTGATGG AAACTATTCA AGTAAACTTT TGCTCACGAA ATACTTAACA GAATCGCTCT 13620 

CTTTTAAATT AGTGTTGTCT TACTTTAGAG GAAATTATAG TTTGACAAAG TTTGCTGAAT 13,680 

GTAATCATGT CAGCACCTCT GTGGCGTATA GTACCTACAG CGGCTAAAAA AAGCCCTTAA 13740 

AGGAGTATCC AGATCTATTT TAGTAAaCGG GAAATTGTCG GCAATCAAAA AGCAATTTCA 13800 

TTTTTTTTGT ATAAATTGTT CACGCTTTCC AATCAACCAA TTTCCGAATT ATATTCGGTA 13860 

AAAGTCTATA ATGAAGCGAA ACGTGTATTA CAGTCTGTTG AGTTAAACTA TACTTTCACC 13920 

ACCTATGAGC GACGCAATTT TTTCCATTAT TTAGCTATTA TGATTAACAA CGAAGGACGT 13980 

GCGGTGGAAG GGATCGATAC GCGTGCACTG AATACATTTT CAGAGGAACT AATAAAAAAA 14 040 

TCGCAAGTGG CATGGGCATT GGCTTCAAAa TCATTAACGT ATACGATTGt TTTTTTCTTG 14100 

TATTTGCATG GCAAACTAGA GAAAAAATAT GTCATTCATG AAGATCCTAC GATTGAAAGC 14160 

CTAACTCGTG TTTTCATTGG AAGTTTTGAA AAAGCATTTA ATTGTTTAGA GGAGTCAACA 14 220 

AGGAATACTC TAGAAGAAGG GTTAGCAATT ATTCATTTCA ATGTTATTTA CTTTCCGATT 14 280 

AATATGTTTG ACGATTTTGA AATGGATTTA CAGTTTTTTA AGCAAACATA TCCGGAATTT 14 340 

TATTTTTATT TAATTGAATA TATTCGTTGG TTAACAGCAA AACATAAAAA AATTGCGAAA 14 4 00 

GCCAATCACT GTTTGTTTTT TAACTATCTA TTGTTGCTAA TTAATCATGT TCCGGTTCAT 14 4 60 

TTAATTGCAG AGCCAGCCAA GGTCCTAATT GATTTTTCTT ATGGAAAAGA GTATAACCAA 14520 

TTCATCAAAA- aAAATTTATC CGTCTATGTG AATTTAAATG TAGAAATTAT TGACCCGTTA 14 580 

TCAGATACCT TACCAGATGT GGTGATTACC AATTTAAATA ACTTGTATCA AGAAGAACAG 14 64 0 

AGTAAAGTTA TGGTTTGGCT GGATCCACCT CGTTCGATTG ATTGGGTCAA TTTAACGCAA 14 700 
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TCATTATTGA 


CTATTCAAGA 


AGAAAAATAT 


CaACAACAAA 


AAGAATCGAC 


AAAGACAAGC 


14760 


GGTGATCCTA 


TCGAATAGnA 


TCATCGCTTG 


TCTTTGTGTT 


TAAGTCTTTA 


CAAAACCACC 


14820 


CAAGTGTTTT 


ATAATATGAA 


TATATATTCA 


TATGAATAGA 


AAAACATTAA 


TGAGGAGGCG 


14880 


CATAACAATG 


TCAAAACAAA 


CAGGTGCACA 


CACACATGAA 


GAAAAAGGTA 


AAAACGTTCC 


14940 


AGTTATTCTC 


TTTTTTACAG 


GATTGGCTCT 


TTTCTTTATC 


GGTTTATTTT 


TAGGGAACAT 


15000 


GCTCTTAGTG 


AAAAATATTC 


TTTTCTCATT 


AGCTGCTATT 


TTAGCTGGTT 


AT CAT ATT AT 


15060 


TGGGGAAGGC 


TTTGGAGATA 


CGTATCGGGA 


TACGAAAAAC 


AATCGAAAGT 


TTTCGCCTAA 


15120 


TATTCATTTG 


TTAATGACCT 


TAGCTGCAGT 


TGGTTCAGCA 


TTAATGGGAA 


GTTTTGAAGA 


15180 


ATCAGCATTG 


TTGATTTTGA 


TTTTTGCGGC 


GGCGCATTTC 


CTTGAAGATT 


ATGCCCAAGG 


15240 


CAAAAGTCAA 


CGTGAAATTA 


CAAAACTATT 


AAATTTAAAT 


CCAACAGAAG 


CTCGCCTGAT 


15300 


AACAGACGAC 


GGCTCAATTC 


AAACTGTGTC 


AGTTGAACAA 


TTGAAGATTG 


GTGACCGCGT 


15360 


GCAAGTCCTC 


AATGGTGCGC 


AAATTCCGAC 


AGATGGCGTC 


GTGATTGAAG 


GAAGTACCGC 


15420 


TGTGGATGAA 


TCATCTATTA 


ATGGGGAAAG 


TATTCCCAAA 


GAAAAAAATA 


GTGGGGACCC 


15480 


AGTTTTTGGT 


AGTACCATGA 


ACGGTTCAGG 


CACAATTGTT 


GTTGAAGTAA 


CGAAAGATAG 


15540 


TTCAGAAACA 


GTTTTTGCAA 


AAATCGTACA 


GCTCGTGAAC 


CAATCACAAG 


AAAATCAATC 


15600 


AGAAATTGCA 


AGCAAAATTA 


AACGATTTGA 


ACCGAAATAT 


GTCACACTAG 


TTTTAGCAGT 


15660 


TTTTCCACTA 


ATTGTTTTAG 


GTGGCGCATT 


GCTTTTCCAA 


TTGACTTGGG 


CTGAAAGTTT 


15720 


TTACAGAGGC 


TTGGTCTTCT 


TAATTGCTGC 


CTCCCCTTGT 


GCATTAGCAG 


CGAGTGCCGT 


15780 


TCCGGCAACT 


TTATCAGGAA 


TTTCCAATTT 


AGCAAAACAA 


GGCGTTTTGT 


TTAAAGGTGG 


15840 


CTCATTCTTA 


TCAAATTTAG 


CTGAAGTGAA 


AGCACTCGCT 


TTTGATAAAA 


CAGGCACGCT 


15900 


AACAAAAGGC 


AAACCTGAGG 


TC AC AG ATT A 


TCTTTTTGTT 


GATGGCTTAG 


AAGATCGCCA 


15960 


AGAAGAATTA 


GTAGCTGTTT 


TAACAAATAT 


GGAGAAAAAA 


TCCAATCATC 


CATTGGCGAC 


16020 


GGCGATTGTT 


AATCGCTTTG 


AAGCAGAAAC 


AACGGCTTTA 


AATTTAGAAG 


TAGAAAATAT 


16080 


TGTCGGGGTG 


GGTTTAGTAA 


CGACCATTGC 


AGAAACAACT 


TTTAGAATTG 


GTAAACCATC 


16140 


TAGTTTCGAA 


CAAGTGCCAA 


CAATCATTGA 


GAAACAAACA 


ACGAAATTAG 


CAAGTGAAGG 


16200 


CAAAACGGTA 


GTGTACTTTG 


CCGAAAATGA 


ACAAGTGATT 


GGTTTGGTGG 


CATTGATGGA 


16260 


TGTTCCTAAT 


GAAGAAGCAA 


TGAACGCCAT 


TCACTATTTC 


AAGTCACAAA 


ATATTGAAAC 


16320 


AACAATGATT 


ACAGGAGATG 


CAAAACTTAC 


GGGTGAAGCA 


GTTGGTCGCT 


TAGTAGGCGT 


16380 


GGACCAAGTT 


TTTGCTAATG 


TTTTACCTGA 


GGAAAAATCA 


GCGATTGTTG 


ATCAATTAAA 


16440 


ACGCGAAgTA 


GGAATGACAG 


GGATGGTTGG 


TGATGGCATT 


AATGATGCAC 


CGGCTTTAGT 


16500 


GAATGCTGAT 


■ ATCGGAGTGG 


CGATGGGGGA 


TGGTACAGAC 


ATTGCTATCG 


ATGTTGCAGA 


16560 


CGTGGTAGTG 


ATGAAAAATG 


ATCTTTCTAA 


ATTAGGCTAT 


GCTCACCGTG 


TCTCAAAACG 


16620 


CTTGAATAAA 


ATTGTGCAGC 


AAAATATTAT 


TTTTTCAATG 


TTAGTGGTAG 


CAACGTTAAT 


16680 
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TATCTTGAAC 


TTTTTAGGTA 


TTGCGAATAT 


CGCATTTAGC 


G TACT TAT TC 


ATGAAGGAAG 


16740 


TACCTTAGTG 


GTCATTTTTA 


ATGGCTTGCG 


TTTATTAGTT 


AATACAAAAT 


AAGTCAAATA 


16800 


GTCACTTGCA 


AGAAAGGAGA 


GAAAGCGATG 


AATTCTTTAA 


AAATGATTGA 


ACTTTCTCTC 


16860 


ACCGACGTTG 


TCATTCGGAT 


GGCTCTAGCC 


GTGGTTTTTG 


GAGCTTTGAT 


TGGGTTAGAA 


16920 


AGAAGTATTA 


AAAGAAAAGG 


CTTCGGTATC 


AGTAGTAATG 


CCATTTTATG 


TTTGGCCTCC 


16980 


TGTACGATTT 


CCATTTTACA 


GATTCAATCG 


GTGGATATTT 


TAGTCGATGT 


GGTGAAACAA 


17040 


AATAGCGCCT 


TGGCGTCCAT 


TATTTCTATG 


GATATTACAC 


GATATGGCGC 


CCAAGTGATT 


17100 


AGTGGTGTAG 


GTTTTCTCGG 


TGCGGGGATT 


ATCGTTTTTA 


GAGAAAGGAA 


AGTCTCTGGG 


.17160 


TTAACGACAG 


CAGTAATGAT 


GTGGAATGTT 


ACTATTATVTG 


GTTTAGTCAT 


TGGGATGGGC 


17220 


TTTTTAACCA 


TTGCGTTTAT 


CAATCTCGTG 


GCAACTTTGC 


TAGTATTAGC 


TCTGGCCCAT 


17280 


TTTAAACGAA 


AGACTCCTTT 


AAAACGATTG 


CAGCTTATCG 


TAAAAATGAA 


GGAACCTATG 


17340 


ATTAAAGAAA 


GCCGACTGCA 


TGAACAACTT 


CAAGCTTCTT 


TAGAAAAAGG 


GGAGCGCTTA 


17400 


GTAGAATTGA 


CTTTGGAATC 


CGAACAAGAG 


GCAGTGACAG 


CAAAGATTTC 


TTATGTAGGC 


17460 


AAAGAAGATT 


TCTGGCATTC 


CAACTTGCAC 


CATCTCTTAA 


AAGAAAAAGG 


GCACGTTCAA 


17520 


TCTGTCTCTG 


TCACGGAAAC 


AATTTAACCA 


TAAAATAAAG 


CACTTGGGcg 


CTAGCGCGCG 


17580 


CCAAGTGCTT 


TTGTTTAAAT 


ATTTAATACT 


TTATCTAAGA 


AATCTTTTGT 


TCGAGGGTTT 


17640 


TGAGGATTTT 


CAAAGACTTG 


CTCTGGCGAA 


CCATCTTCTA 


GGAAGTTACC 


GCCATCAATG 


17700 


AACATTACTC 


GATTAGCCAC 


TTCTTTTGCA 


AAGCCCATTT 


CGTGAGTAAC 


GATGACCATT 


17760 


GTCATTCCTT 


GTTTTGCTAA 


ATCTTTCATT 


ACACCAAGAA 


CATCGCCAAC 


CATTTCTGGA 


17820 


TCAAGGGCTG 


ATGTTGGCTC 


ATCGAAAAGC 


ATAATATCAG 


GATTCATCGc 


tAACGCACGT 


17880 


GCAATTGCGA 


CACGTTGTTT 


TTGACCACCA 


GATAAAGAAT 


CTGGATAGGC 


ATCTTTTTTA 


17940 


TCTGCCAACC 


CAACACGTTC 


TAAAAGTTCT 


AACGCTTTTT 


TCTCCGCTTC 


GTCTTTkGAT 


18000 


AAGCGACCTA 


AATCGGTTGG 


TGCTAAAACA ATATTTyCTA AAATgATAAA 


TGTGGAAACA 


18060 


AATTGAAATG 


TTGGAAAACC 


ATGCCGATAT 


GTTGACGGAC 


TTGGTTGATG 


TCCGTATTTT 


18120 


TATCTGTCAA 


GTTCGCACCA 


TCAATAATAA 


TGTCTCCGCT 


AGAAGCTTCT 


TCTAATTGAT 


18180 


TCAAGCAACG 


TAAAAAAGTA 


CTTTTACCAG 


AACCAGAAGG 


ACCGATGACG 


CAGaCTACGT 


18240 


CGCCTTCGTT 


AATAGAAACA 


TTAATGTCGT 


TTAAGACCGT 


GTTGTCACCA 


TATTTTTTTA 


18300 


CTAAATGTTC 


AACTAAGATT 


TTTTCAGTCA 


TATTAGTTCA 


CCTTCTTTTC 


AAGGTTTTTC 


18360 


GCTAATTTAG 


TTAAGCTAGT 


•AATTAAAATT 


AAGTAGATTA 


ACGCAATAAC 


GAAtAAACCA 


18420 


TTGTACTTTG 


TAATGTCCGA 


GCAACAATAA 


TTTTTCCTGT 


TTGTAGTAGT 


TCGATTAAGC 


18480 


CAATTGCTGA 


AAGGATAGTT 


GTATCTTTCA 


ATGAAATAAC 


AAATTGGTTA 


ATGAATGAAG 


18540 


GAATCATAAT 


GCGAATCGCT 


TGTGGTAAAA 


TAATTTTTTG 


CATTGTCCGA 


TTATAGGAAA 


18600 


GACCTAAACT 


TCTTGAAGCT 


TCCATTTGGC 


CAACTGGAAC 


AGCTTTGATT 


CCGCCGcGCA 


18660 
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CAATTTCAGA 


AATATAGGCA 


CTGGCATTTA 


AAGTTAATGT 


GATAATCCCA 


GCAATGAAGA 


18720 


CC.GGGATAT.T 


AAAGCCAAGC 


ACACCAGGTA 


ATCCGAAATA 


GATGAAGAAC 


GCTAAGACCA 


18780 


TTAAAGGAAT 


TCCACGAATT 


AAATCAACAT 


AAATGGTTGA 


AAGTACACGT 


AATGCTTTGA 


18840 


TTGGTGAAAC 


GCTGAATAGC 


CCAAAAATAA 


CGCCGACAAT 


TAAAGCAAGT 


ACGAAGGAAA 


18900 


TCAACGTTAA 


TAGAATCGTC 


ATCCATAAAC 


CGTGTAACAA 


TTGTTTCCAG 


TTGTTTTGCA 


18960 


TGAAGCCAAC 


AAACGTAGAT 


TCGTCAGCTG 


TTTCTTCTTT 


GGCTTCGTTG 


CCATCTTTCA 


19020 


CGTAAGTGCC 


GATAATTTTA 


TCGTATTCAC 


CTGTACGTTT 


CATTTCTTTT 


AAGCCTTCGT 


19080 


TAAACATTTC 


AAGTAGTTCA 


GGATTTTGCC 


CTTTTTTGAC 


AGCAAAACCG 


TAAGAACCAC 


19140 


CTTTTTCACG 


AGGAATTGGT 


GTCGCTAGTG 


GTTGATTTTG 


CGCAACCCCG 


TAGCCAATGA 


19200 


CAGGATAATC 


ATCCATCATG 


GCATCAACTT 


CGCCAATTTC 


TAATGCACTG 


TATAATGCAT 


19260 


CAGTTGTGTC 


TAAATACTTA 


ATTGAGTAAT 


CATATTTCTT 


TTTATTTTTT 


TCTAAGAAAT 


19320 


CAGCACTTTC 


AGTACCAATT 


TTTACGCCGA 


CTTTTTTGCC 


TTTTAAATCA 


TCATACGATT 


19380 


TAATTTTGTC 


ATTGCCTTTT 


TTGACCGCAA 


TTTGAATACC 


GCTATCAAAG 


TAAGGAACAG 


19440 


AGAAGTCGAA 


GGCTTTTTTA 


CGATCATCCG 


TAATCGTCAT 


GCCTGCAACC 


ATCCCATCTG 


19500 


CTTGACCAGA 


TTCAACAGCT 


TGGACAGCAG 


AACTAAAACC 


AATAAATTTA 


AATTCAACAG 


19560 


TAAAACCTTG 


CAATTCAGCT 


GCACGTTTGA 


CTAAATCTAC 


ATCAATTCCG 


ACATAATCGC 


19620 


CTTGGGCATT 


TTGAAACTCA 


AACGGAGCAA 


ATGTAGAATC 


ACTAGCAATC 


ACATATTTCT 


19680 


CTTTTTTAGG 


AGTGATTTTT 


TTCATCTGTT 


CGCCAGCGTC 


TTGTGTGTTT 


GTTTCATCGC 


19740 


CTGTCGCTAA 


GTAGTTGTTT 


AAAATTTTAT 


CATAAGTACC 


ATTGTCTTTA 


AGATTTTTCA 


19800 


AACCAGCGTT 


GAACTTTTTG 


ATTAATTCAG 


GGTTTTGACC 


TTTTTTCACT 


GCGAAACCAT 


19860 


AAGAACTGCC 


TGTtTCTTTA 


TCGCCAACTA 


ATTGTAATTT 


TTGGCCATTT 


TTCACTGCGT 


19920 


AnCCAATACA 


GGATAATCAT 


CAACGATTGC 


ATCCGcTTCG 


CCGTTTTCTA 


AGGCTTTATA 


19980 


AAGACCCGTT 


GCATCGTCAA 


AATTTTTGAT 


GGTGTAGTCA 


TATTTCTCTT 


TATTTTTTTC 


20040 


AAGGAAATTG 


GCACTTTCAG 


TTCCAACTTT 


AGCCGCAACC 


GTTTTGCCTT 


TTAAATCATC 


20100 


ATATGATTTA 


ATTTTGTCGT 


TTCCTTTTTT 


AACCGCTAAT 


TGTAAACCGC 


TATCGAAGTA 


20160 


TGGATCTGAA 


AAATCAAAGG 


ATTTTTTTCG 


CTCATCCGTA 


ATACTCATTC 


CAGCAATCAT 


20220 


GCCATCAATT 


TgTTTAGATT 


GAAtCgyTTG 


GACTGCACTA 


TCGAAGCCTA 


ATGGTTTTAA 


20280 


ATCCACTTCA 


AAATCTTGAT 


CTTTGGCAAT 


CGCATCTAAT 


AAATCAACGT 


CAATACCAAT 


20340 


GTATTTTCCT 


TTTGAGTCTT 


GGAATTCAAA 


AGGGGCGAAT 


GTTAAATCCG 


TTCCAATTGT 


20400 


ATATTTCTTT 


CCTTCTGgCA 


TGTGgCTGgA 


GTGGCCACTA 


ATGGAATGTT 


AGCAGAGTTA 


20460 


CCATCATCAC 


• AAnGAAAGAA 


AGTTAAAAGT 


GTTTCCTTTT 


CCATAAAATA 


CCTCCTAGTT 


20520 


TTGGCATATG 


GAACCATTTA 


TATnGTTAGG 


nnATAGGGAG 


C 




20561 


(2) INFORMATION FOR SEQ ID NO: 293: 
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(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 1836 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

AACTCTGTAT CACCAATTTT TGTATAAAGC ATGGATTCAG CCCCTAATAA TTCAGAAACA 60 

ACGACTTCAG ATTTAATCAC CGCATCTTTC ATTGCTGTTA AAGCCACTTG TTCGCTATGA 120 

ATATCTTCTG GACGAATACC GAAAATTAAT TCTTTGCCTT CATAACCATG TTCAACAAGC 180 

TCTTTGTTTC TTCCTTCAGG AATTGCTAGT TTTAAGCCAT GTTGGTTAGA AATAACCCCA 240 

TTGTTTAAGG TAACTTTGAA GAAATTCATG GCAGGTGAGC CGATGAACCC AGCTACGAAC 300 

ATATTTTTAG GTGTATTGTA AACTTCTTTT GGTGAACCAA TTTGTTGAAT AAAGCCATCT 360 

TTCATAATTA CGATTCGGTC AGCCATGGTC ATCGCTTCGG TTkGGkCATG TGTCACATAA 420 

ATTGTCGTAG TTTCTAAACG TTGGTGTAGT TTAGCGATTT CAGCACGCAT GGCTACACGT 4 80 

AATTTTGCAT CCAAGTTTGA TAAAGGTTCA TCCATTAAAA AGACTTTGGC ATCACGGACA 540 

ATTGCCCGGC CAAGTGCAcA CGTTGGCGTT GTCCACCAGA AAGCGCCGCA GtTTACGTTG 600 

TAAGTACTCT GTTAAGCCCA GAATATCCGC AGCATTTTCC ACCCGTTTTT TAATTTCAGC 660 

TTTATCATAT TTTCTTAGTT TTAAACCAAA GGCCATGTTA TCAAAAACAG TCATATGCGG 720 

GTAAAGCGCA TAGTTTTGGA AAACCATGGC AATATCACGA TCTTTCGGTG CAACGTCATT 780 

CATGACTTTG TCCCCAATTG ATAATTCACC TTCAGTAATA TCTTCTAAGC CAGCAATCAT 84 0 

GCGCAATGTT GTTGATTTAC CGCAACCAGA AGGTCCGACA AAGACGATAA ATTCACGGTC 900 

TGTAATTTCT AAGTTAAAAT CTGTAACAGA ATAATGTTCT GCATTGTCAT ATTTTTTATA 960 

CACATTTCTA AGCGCCATTT CCACCATTTG ATACGTCACT CTTTTCTTTT TGATTTGTTA 1020 

CTCATAGTTT AAATGAAAAC GCTATCTTTC GGCAAGGGAG AGTTGCACAA GAAAAAGAAG 1080 

ACTTTTTCGT CAAGGTGACA AAGGGCAAAC ACTCTCAGCG GATTTATCGA TAAACTTTTG 114 0 

TTATAATATA ACAAAGACAG GAGTTGGATA ACATGAAAAT CGCTTTAATC GCTCaTGATC 1200 

GTAAAAAAAC ATTAATGATC AAACTAGCAA CCGCTTATAA ACATATTTTA GAAAAACATG 1260 

AATTGTATGC AACGGGAACG ACAGGCATGA AAGTGATGGA AGCAACTGGC TTACCCGTGC 1320 

ATTGCTTTAA ATCGGGGCCA TTAGGTGGGG ATraAACAGAT TGGCGCAATG ATATCAGAAG 1380 

ACAACATTGA TTTAGTTATT TTTTTAAGAG ATCCTCTTTC CGCGCAaCCG CATGAACCAG 14 40 

ACGTGACGGC TTTGATTCGT CTAAGTGATG TCTATGAAAT TCCTTTAGCG ACCAATATTG 1500 

GCAGTGCTGA AATTCTTTTG AGAGGTGTCG AGGCGGGGTT TGCGGACTTC CGTGAGGTTA 1560 

TCCATGAAGG GGATCGTCGA CCGTTAGCTT TTTAGAGAAC CTTAATGGTT ATTTTGACTA 1620 

TTTTGTGGTA TATTTTATGA GGACTTTTGT CCAAATAAAA TAACAGAGTA GGAGACAAAG 1680 

CGTTAAGTGC TAAAAGGATG GGATGTTGCC TTTTGGACGA AGAACATAAA CAGTGTGTTG 1740 
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CTAGAAAGCA AAGGTTCTTA CTTTGCTTAT CATCAGAACA ACACGnTCAG CTTTTTAATG 1800 
TTCGCGGTTT TGTTTnTCGC ATTCGCTGCA TGGTTC 18 36 

(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 



TATTTTATTG 


ACAAAGAAGG 


GGACGAGTAC 


AAAGAAGTGT 


TTGATCGATT 


GCATGTTTTT 


60 


TCAGCTGAAG 


AATTTGAAAG 


TGAATTGAAA 


CGATTACTAG 


AAAAAGATAA 


ATTCATTAAT 


120 


TGGGAGTGAG 


TTTCATGAGA 


AGAATAGAGA 


AAGAATTCAA 


CAAGAAGCTA 


GCCAGTTACG 


180 


AAAGAGAGTT 


AAAAAAATTA 


GGATGTCTGG 


ATGATGAAAC 


AGGGCTTATT 


CCCATAAGCA 


240 


AAAGACGATG 


GCATGTCATT 


TGGTGGCAAC 


CTGTTACACT 


TGCAAAAACA 


ATCGTGAGAT 


300 


CTTTTCGATT 


AACGCTAGAT 


AATGAAAACT 


TATGTATTTT 


AGGAGATGTA 


GAAATCACTA 


360 


TTTATCATGA 


TGGAACATAT 


GGTATTTCTA 


AAGAAGCTGT 


CCCAATTTTT 


ATTAATGATT 


420 


TATTGTCTTT 


AAAAAAGTTA 


ATTACTATTT 


TCTATGGAAC 


ACCTTTTAAC 


TTAAATTTTG 


, 480 


AAAAAATTCG 


CTGCGTTAAT 


TTTAACAGGT 


ATTGTGTCTC 


TATTCCTGAA 


ATTTATGTCG 


540 


AGAAATTTGA 


AGTATTAATA 


AACTACTTGA 


TAATTTTAAG 


TAGCTGCTTA 


CATGAAGTGA 


600 


AAAAGCACGT 


AGAATATGAT 


TAACAAGCTT 


TTCTTCTTTC 


AGAAGAACAC 


CAATAGATTG 


660 


GTTGAGTTCG 


CATACGCTCA 


CCTAAGAGAG 


GAAAAGAACG 


AAGTCAAGCT 


TCGTTCTTTT 


720 


TTCTACGTCG 


GAGAACCGAC 


GTAGTGATTC 


AGACGCAAAA 


ACCATTCGTC 


TGAATCAGGC 


780 


CCGTTTTTAG 


GAAGGGCTAG 


TGTGTTTGAT 


TTATCAGTTC 


TCACAGCCGT 


GTAATTTCTA 


840 


AGATTCCCTA 


ATCGTCTATC 


AGTCACTCGG 


CGAGTATAAC 


CTAAAACATA 


ACCTGTCGGA 


900 


GGTCAAGGGC 


AAGTGGCATC 


GGCAAGCCGA 


CCCTTGTCCT 


CCGACAGAAC 


ATGTTTTCTA 


960 


GTCGGTTATG 


CCGAACGACC 


GATATACGAA 


AAGGGAATCT 


AACCAGAATT 


TCCCCTCTGA 


1020 


CCTTTGAAAA 


CTGAAAAGCG 


AAATCAAACA 


AACAGGCCTT 


CGTCTAAAAA 


CGAGCCTAAT 


1080 


CAAACCTTAG 


GGGTTGCACC 


CCTAACACCC 


GGCGCCTCAC 


TCTCTCAACC 


GAAAAGGTTG 


1140 


GTCGGAAAAG 


CGTTGACAAA 


AACATAGTTA 


TTTATTGGTT 


GTTTGAAAAT 


AAAAATTGAA 


1200 


AGAAGGAAAG 


CAAAATGCTA 


ACAACAAAAG 


AAAAAAACAG 


ATTAAAAAAA 


ATGGTAGAAG 


1260 


GAAACAAAAC 


GTTTCATTAT 


TCTTATGTCG 


ATAGACTTAG 


ACAAGATGTA 


AGATATTACG 


1320 


TTAATCAATG- 


TGAATCAGCT 


GTTAAAGCTA 


GAGAAAGCAT 


GGAAATCTTA 


GAATTTATCT 


1380 


ACAGTCTATT 


TTCTGATAAA 


GAGATACCAG 


CATGGTACAC 


AAAAGCAGAC 


CTTGAAAATG 


1440 


ATAAAAAATC 


AATTGAAAAA 


TTGGAACGCT 


GGGCAGCTTA 


GCTGTTCAGC 


AGCAATCAGA 


1500 
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AAGGGCGATT 


AGCATGAAAT 


TTATCAAAGA 


TGTAACGACA 


TTAGAAGAAT 


TGAAACGCGT 


1560 


ATATAAAAAG 


CTAGCTTTAA 


AGTACCACCC 


AGACATGGGC 


GGTACTGATA 


AAGAAATGGC 


1620 


GCAAATAAAT 


AACGAGTATG 


AAGCGTTGTT 


CGATAAGCTT 


AAAAATACAC 


ACAAAAATAA 


1680 


AGAAGGTGAG 


TACTACCAAA 


AAGAAACGAC 


AGAAACACCG 


CAAGAATGGC 


AAGAAATCAT 


1740 


ACACAAGTTG 


TTAACACTAA 


AAATGGAAAA 


TGTTTCAATT 


GAAGTGATAG 


GCGCATTCTT 


1800 


ATGGGTATCA 


GGGAATACAA 


AGCCTTATAA 


AGAAGAGTTG 


AAAGCTTTAA 


ATATGAAATG 


1860 


GAGCAACAAC 


AAAAAGTCCT 


GGTATTTAGC 


CCCCGATGGA 


TATAAAAAAC 


GCAGTAAGAA 


1920 


AAAATACAAT 


ATGAGCGATA 


TTCGCAATAT 


GTATGGTTCG 


CAGGTAGTTA 


AAGAAGAACA 


1980 


AGCCAAACAA 


AAGACATTAA 


CAGCAACAGC 


CTAAGGAGGA 


AGTACAATGG 


AACAACAACA 


2040 


CCCGACTATT 


CACACATTAA 


AAATTGAAAC 


CGAGTTTTTC 


AAGGCAGTAA 


AAGAACGGCG 


2100 


AAAAACATTT 


GAGATAAGAA 


AGAACGATCG 


TAATTTTCAA 


GTAGGGGATA 


TTTTAATTCT 


2160 


CGAAGAATAT 


ATGAATGGCA 


TGTATTTAGA 


TGATGAATGC 


GAAGCGGAaG 


TTATTTATAT 


2220 


AACGGATTAT 


GCACAACGTG 


AAGGTTATGT 


GGTGCTAGGA 


ATTGAATTAC 


ATTAGTAAAA 


2280 


AATTAGATTA 


TACGTCCAAA 


GAGGTTTTAC 


CTCTTTGGAC 


yaCCAAATAA 


AAAGAAAGAA 


2340 


GGAATCAAGA 


ATGATTAAAA 


CAAGTGCAGT 


AGGAAGATTA 


ACAAAAAGTT 


CAGAACTAAG 


2400 


AGTAACGTCA 


ACAGGGAAGG 


CCGTTGCCAA 


TTTTGCATTA 


GCTTGCAAAA 


GAGAAAGACT 


2460 


TGATAAAGAA 


GGGAATGCCC 


AAACGACTTT 


TATTCAATGT 


GTAATCTGGG 


GGAAACCTGC 


2520 


AGAGGTGTTA 


GCAAAATATA 


CGCAAAAAGG 


TTCATTAATT 


GCAGTAAATG 


GTGAATTGCA 


2580 


ATCACGTTCT 


TATGATGATC 


AACAAGGACA 


AACGCATTAT 


GTTACAGAAT 


TAGTCGTAGC 


2640 


AGGCTTTGAA 


TTTTTAGAAT 


CAAAAGAAAC 


TGTACAAAAT 


AGAGAACAAA 


AACAACAAAT 


2700 


GCCAGAAATG 


GCGAACGCCG 


AACCGTTTTA 


TGGTATGGAT 


GATATGGAAC 


CGCCTGCTTA 


2760 


TCATTAAAAT 


TTATCTGAAG 


GAGGGAGAAA 


ATGAAAAAGT 


TAATCGGTAA 


AAAGTGGCTG 


2820 


CTgCTTACAG 


CAGTAGCCAC 


TTTTTTATTA 


TCAGGATGCG 


CAAGTCTTGA 


ACAAAAAGCA 


2880 


CAGGATAGTG 


TAAAAGAAGT 


TACTGAAAAT 


GTTACTCAAA 


CTATTTCAAA 


CGATCAACGT 


2940 


ATACCAGCTG 


ATTTTGTTAG 


GCACGTGGAT 


GGCGATACCA 


C AG TAT T AAA 


AATTGACGGA 


3000 


AAAGAACAAA 


AAGTTCGGTT 


TTTATTAATT 


GACACACCCG 


AGACTGTGAA 


ACCGAAAACA 


3060 


AAAGTTCAGC 


CGTTCGGATT 


GGAAGCTAGC 


AAACGCACAA 


AAGAGCTTTT 


GTCTACTGCT 


3120 


TCAGAAATTA 


CGTTTGAATA 


TGATAAGGGC 


GATAAAACAG 


ATCGTTACGG 


ACGAGCGTTG 


3180 


GGCTACATAT 


TCGTAGATGG 


AACATTACTA 


CAAAAAACGC 


TTGTAAGTGA 


AGGATTAGCT 


3240 


CGTGTTGCCT 


ATGTAAAAGA 


GCCTACAACT 


AAGTATTTGG 


CAGAACTAGA 


GCAAGCCCAA 


3300 


GAACAGGCTA 


• AAAATGAGTC 


ACTCGGAATC 


TGGAGCATAC 


CAGGTTATGT 


GACACAACGG 


3360 


GGGTTTAGTA 


AATAAATAAG 


AAATCGCTAT 


GAATTATTGA 


AAGCTTTTGG 


CGAGAAATTA 


3420 


TAACACAATA 


TGTTGTCGAT 


AACAACATAT 


TGTGTTATAA 


TTATATACAT 


AAAGGAGGGG 


3480 
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TAAAGGTGGC TACTACAAAA AAGAAACCTA TACATGTAAA TGTTGATGAA AATTTAAAAG 3540 

AAGAAGCAGA ACAATTATTT GATGATTTAG GGTTAAATAT GACAAGTGCA ATTACGATTT 3600 

TCTTAAAACA GTCTATTAAT GAGCAAGCAA TTCCTTTTAT GATTAATAAG GGAAACAAAG 3660 

AGACTCTACA AGCATTAAAA GACATTAAAG AAGGAAATGT TCATGGTGGA TTTTCTTCCG 3720 

TGGAGGATTT AATGGAGGAT TTAAATGCTT GAAATATTTT ACACGAACCA ATTCAAAAAA 3780 

GACTTTAAGA AAGCAAAGAA ACAAGGAAAG AATTTAGAGA AGTTAAAAGA AGTATTAGTA 3840 

CTTTTACAAG AACAGCAAAC 'ATTGCCGCCA AAGTATAAAG ACCACGCACT AACAGGAAAT 3900 

TATATAGGTA CAAGAGAGTG CCATATTGAA CCAGATTGGT TACTGATTTA TAAAATTGAC 3960 

GGTGATAAAT TGATCCTAAC TTTAGCTCGA' ATAGGCTCAC ATAGTGAATT ATTTCGATGA 4020 

ATCGGAAAGA TATTACAAAT CAATAAAAAA CACTACTAaA CGACCAGACG AATAGTAGTG 4 080 

TTTTTTTGAT TATTCGATTG TACTAGTAAT TTTAACACAA TaAAAAATAG TCATGCAAGA 4140 

CCAGGGCGCA ACGATCGGTT CTGTTGCAAA GTTTTAAATA AAGaATAAAA TCCCTTACGG 4 200 

TATCTATGAT TTAAGCTGGG ATTCCCAATA ATACCTTGAT TTCAGTACAG ACCGAAAACC 4260 

CGAAGAGAGT ACCTTCTTTT CGGGTTTTCT TATATAATCC TCGAATGGCT TCCATGCCTT 4320 

TAATCGTGGT AGAGGCAGTG CGTAAACTTC GATAGAATTT ATTGCGTCTC TTTACTGGAC 4 380 

GATGGTCTTG TTCAATCAAA TTATTCAGGT ATTTAATGGT ACGATGTTCT GTCCCTTGAT 4 4 40 

AAAAGCCGTA TTCTTTTAGT TTCTTAAAGG GCACTTGTAA TAGAGGGGGC TTTATCTGTG 4 500 

GACTACAACC TTCGGGTCAT CAAACTGCTT CACTAACCGC TTAAGAAAAG CATAGGCTGC 4 560 

CTGTGTGTCC CGTTTTTTAC GTAACCAAAT ATCCAAGGTn AAACCAT 4 607 
(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TTCTCGCGGA TTTATTCGGA TTGAACACAA CAAACCCAAA AGGTCAAATG ATGGTTACGT 60 

ATGATTTTAC TGCCTGTTTT TCAAAAGATT CAATTATTTT TACAAACGAG CAAAGAAAAA 120 

CAAGTTGTCG CAAACCACGA AGATGAAATT AAACAGCTTT ACCAAGGCTT TGAAAAAGAA 180 

TTCGGACGTC CACTTTCACC AATTGAATTA GAAATGATTG GTCAGTGGTT GAACACGGAC 24 0 

CATTATTCCC CTGAaTTAaT CCGGTTAGCG TTGCGAGAaG CTGTTTTAAA TCAGGCGTAT 300 

AGTTTGAAAT ATATTGAtCG TATTTTACTT GCGTGGGAGC GTAAAAATAT TACCACCAAA 360 

GAACAAGTTG CTGCTGATCA GAAAAAaCGA AAAGATTCTA TGATTCAAAA TGAGATTGAA 4 20 

CAACAAGGTC AGACACAAGA ATCGTTGCCA AAGGTTACAT TACACAATTG GTTAAATCCA 4 80 

GAAGACAGTG AATAAAAAGG AGGTGGGCCA TGCTATCAAA AGAGAAAACA ATGGAAGCTA 54 0 
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TTGAAATTAT GTATGAGATG TTTCCAAATG CAGAATGTGA ATTAAAACAT AAGAATCCTT 600 

TTGAATTATT AATTGCTGTG ATTTTAAGTG CGCAAGCAAC CGATGTATCG GTAAATAAAG 660 

CGACTCCTGG TTTATTCGCA GCTTTTCCAA CACCAGAAGC TTTAGCAGct GCGCCGGTGG 7 20 

AGGAGATTAT TGCTAAAATA AAAACCATTG GCTTATATCG CAATAAAGCA AAAAATATTA 780 

AAGCGTGCGC ACAACAACTT TTAGAACGTT TCAATGGAGA AGTTCCGCAA ACCAGAGATG 8 40 

AACTGGTCAG TTTACCTGGC GTGGGGAGAA AAACAGCCAA TGTAGTAATG GGAGATGCTT 900 

TCGGCGAACC GGCTTTTGCA G TAG AT AC AC ATGTGGAACG AGTGTCCAAG CGCCTTAGAA 960 

TTTGTAAGCT AAATGCGAAT GTTACTGAAG TTGAGCAAAC CTTAATGCGT AAAGTTCCGA 1020 

AAGAACTCTG GGTGAAAACG CACCATACCA TGATTTTCTT TGGGCGTTAT CACTGCCTAG 1080 

CAAGAGCGCC TAAATGTGAA GCCTGTCCGC TACTCTATAT GTGTCAAGAA GGCAAAGAAC 114 0 

GGATGAAAGG GAAATAACAC ATTTTTTCCT AAAAAAGcGA GCCACAAGTC CAACCGACTT 1200 

GTGGCTCGCT TTCTTTTTTT GATTTTTAAG AAACGCAACA AATGATTGCG GTTcTTAAAA 12 60 

AT T GAT T TAG T AT AG AC AG T GTGGtATATA CyAgAAAATt TATTTTTTTG AAGAAAGATA 1320 

GGCGATGAAA TGACAGAAAa TGCACCCAAA TTTAAGCmAA TTtCyGCAGA AaTTGAAAAG 1380 

AAGATTCGAG ACGGTCTGTA TGTAAGTGCA CAAAAATTAC CTTCAGAATA TGATTTAGCT 14 4 0 

AAAGAATATA ACTGCAGTCG CTTGACCATC CGTAAAGCGA TTGATGATTT GATCCGCAAA 1500 

AATATTTTGG TAAAACGACA TGGTAAAGGT AGTTATGTGA TGTCGCAAGC GAAAATTCAA 1560 

AGTGGTCGCG CTGGCTTACA AGGTTTTACT GAGGCAGCCA AAGCTTACGG GAAAAAAAGC 1620 

CAGACAGAAG TCATTTCCTT TGAAGAAGTA GTACATCCCG CTGAGAAAAT TCGGGAGGCG 1680 

CTCCAAGTAG GCAAAAATGA GGCAATTTAT GAACTGATTC GCCGCCGAAT GTTAGACGGC 174 0 

GAACCAATGA CAGTTGAAAA AATTTATTTG CCACAGGCAT ACGTACAAGG CCATACGAAG 1800 

CAAGACTTCG AGGGCTCTCT TTTCTGCTTA ATCGAGAAGA ACGTCGAGAT TGCTTATTCG 18 60 

CATCAAGAAA TTGAAGCAAT CTTAGTTGAA GCGGAAATTT CAGAATTATT GAATGTTCCT 1920 

GTGGGCCAAC CACTTTTACA AGTCCACTCT ATCACCTATG CGCTTGATGC AACTCCTATT 1980 

TTATATGATG TCTCTTTATA TCGAGCAGAT CGGTACACGT TTAAAAACAC ACTGACCCGC 204 0 

TATAGCCCGT CTGAAAACAA CCAAGTGGAG CTAGGAGGTT CTTGGAACGA ATGAAGATCA 2100 

AAGAAGAAAT AGCCGCTCAA AAAGATTTAT TTTATGAAGA CTTAAACAAA ATTATCGCGA 2160 

TTCGAAGTGT GAAAGGGTCG CCTAAAAAAG AGGCACCTTT TGGCGAAGGA CCGAAAAGAG 2220 

CCTTGGAAGA AACGCTGAAA CTTGCAGAGC GTTATGGTTT TCAAACTGGG ATTGTCAATG 2280 

ACGCAkTggC TATGCGCAAT GGGGAACAGC GGAAGAATAT CTGGGAATTA TTGGTCATTT 234 0 

AGATGTAGTA CCAGAAGGTT CTGGTTGGTC AGTGCCGCCC TTTcAATTAA CGAAAAAAAA 24 00 

TCaACGTTTG TATGGTAGAG GAATTCTAGA TAATAAAGGT CCTATCTTGG CTTGCCTGTA 24 60 

TGGAATGAAA TTACTGAAAG AACTTGGTTA CCAACCAAAG AAAaCCATTC GCTTAATGTT 2 520 
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TGGCACGGAT GAAGAAAGTG GGAGTGGAGA TATCCCCTTA TATTTAGAGA AGGAAAaCGC 2580 

ACCCGTTTTT GGATTTACTC CAGATTGTAA ATATCCAGTA GTTTATGGGG AGCGAGGGAT 264 0 

TGTTAATTAT GAGATCACAA CGACCATCCC AGATGATTCA AGTGAACAAA TTGGTCAGAT 2700 

TATAGGTGAT CAAGCAAAAG ACCACGTACC TGATCAATTA AGTGTGGTGA TTGCGGGAAA 27 60 

AACAACAGCA ATCACGGGAA AACGTGCTCC TTCCAATGCG CCAGAACTAG GCAAGAACGC 2820 

GATTACTTTA TTGGCACAGA AAATTAGCGA GGAACAGTTA GTCAAAGGAA ATTTATTACA 2880 

GTATTTCGAC TGGTTAACCG CTAGTTTTCA CGAAAAGCAC TATGGCGAAG GAGTAGCTCT 294 0 

GGACTTTAAG GATCAGGATA GTGGGCAATT GATTTTAACG CCyTATGyGT TGGAAAAAAG 3000 

AGGACAGCAA TTGGTGTTAT CATTGGCsGT GCGTTATCCT GTTTCTATTA CAGAAAACGA 3060 

AGTAACGACG CAGCTAACGA AG GC ACT ATT TCCAGAAAGT GAAGTGACCG TCATCCGCCG 3120 

CCTCCCTAGT ACGCTGTTTC CAAAAGATGA GCGCAATGTT CAAAAATTAA CCAAGGTTTA 3180 

TGAACAAATT ACTGGCTTAG ATGGGACGCC AGTCACAACT ACAGGTGCTA CGTATGCTCG 3240 

CTTTATGCCG AATATCGTTG CTTTTGGTCC ATCATTTCCT GGTCAAAAAG GCATTGCGCA 3300 

TAACCAAGAT GAATATATGG ATGAAAAAGA TTTACTGCTT AATCTGGAAA TCTATATGCA 3360 

AGCGATGATT GCATTAACAG AAGCATAAAA CCAATAGAAG ATACACGTAT GAGAAGAAGA 3420 

CAATGTGTTT CGTAGAGGTC GCATACGTGT ATCTTCTATT TTTCTGTATA AAATTTCATT 3480 

TTCAGTATAT ACAAAACAGT AT AT ACT AG T TTATAATGGT GGAGAAATGT AAGCGTTAAC 3540 

GAAAGGGCGG ATGGAAAATG ACTTGGGGTG CAATTGCGAC ATGGCGGATG GCACATGATG 3600 

GGT TACT AAA AGCTACAGAA GAATTACAAC AAGGAGGTGC TGCAGGCACG GCCGTGGAAC 3660 

AATTAATTAA AGAAGTAGAA GACTATCCTT TTTATAAGTC AGTGGGCTAC GGCGGTTTAC 3720 

CTAATGAGGA AGGGATTTTA GAAATGGATG CTGCCTATAT GGATGGAGAC ACATTTGCAA 3780 

TTGGTGCTGT GGCGGGAATT ACAGATGTTA AAAATCCGAT TTCAGTGGCT AAAGCATTAA 38 4 0 

GTAAAGAGAA GTTTAATAGT TTTCGTGTTG GCGCAGGTGC AACGAAATAT TCAATGTTGC 3900 

ACGGTTTTGA AAGAAAGAAT ATGTTGACAG AACGCGCTAA TCAATGGTGG CAAAAGCGTT 3960 

TAAAAGAAAT TCAGGAAAAT CAGTTGAACC CCTATGATGG GCATGATACT GTCGGCGCTA 4 020 

TTACGTTAGA CCAAACAGGA TCAATGGCAG CTGGCACTTC CAGTTCGGGA CTCTTcATGA 4 080 

AAAAAGCAGG ACGTGTTGGC GATTCACCAC TGTCAGGTTC TGGTTTTTAT GTTGATAGCG 414 0 

AAATTGGTGG TGCTGCGGCT ACAGGCTTAG GTGAAGATTT AATGAAAGGC TGTCTTTCTT 4 200 

ATGAAATCGT TCGTTTAATG GgGGaaGGGC GCTCGCCACA ACAAGcATGT GATCAAGCCG 4 260 

TCTATGCTTT TCATGAAAAA TTGACCCAAC GTTATGGGAA AGCAGGAGCT TTTTCACTAG 4 320 

TGGCCATGAA CAAACAGGGA GACTGGGGCG TACTACgAAT GTGGAATTTA CTTTTACGGT 4 380 

TGGTACTGAT ATGCAGCAAC CAGAAATCTA TATTGCAAAT CCAGGCAAGA ATCATACTAC 4440 

AGAGATTCAG CCGATTTCAC AAGAATGGTT AGCGGCCTAT GAAAAAAGAA TCAAAGCACT 4 500 
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AATTGAATAA AT GAG GAG AG TGAAGGAACA TGAGTAAAAA ACAAATTTAT CTTTTTTGTG 4 560 

ATGCAGGGAT GTCTACAAGT ATTATGGTGA ATAAAATGAT GGAAGTGGTG GAAAAACATC 4 620 

AAATGCCTTT AATGATTACT GCTTTTCCTA TTGCAAGAGC mCAAGAAGTG GTTGAAGCAG 4 680 

AGAAGCCTGT TGCtATTTTA TTAGGcCACA AGTCCGTTTT TTTATTAGGn AAAAACCA 4738 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 



GGGTGCTGCC 


CCAATGGCGT 


ATGCACTTTG 


GACAAAACAT 


TTAAAGGTGA 


ACCCAACAAC 


60 


TTCTAGAAAC 


TGGGTTGATC 


GTGATCGTTT 


TGTACTTTCA 


GCAGGCCACG 


GCTCTGCAAT 


120 


GTTGTACAGC 


TTATTACATC 


TATCAGGATA 


CAACGTAACA 


ATTGACGATT 


TGAAAAATTT 


180 


CCGTCAATGG 


GATAGTAAAA 


CACCAGGGCA 


TCCAGAAGTG 


CACCACACAG 


ATGGTGTTGA 


240 


AGCAACAACA 


GGACCTCTAG 


GACAAGGGAT 


TGCTATGGCT 


GTTGGTATGG 


CAATGGCTGA 


300 


AGCGCATTTA 


GCTGCAACTT 


ATAATCGTGA 


TAGTTTCCCT 


ATTATGGATC 


ATTATACGTA 


360 


CGCAATTTGT 


GGTGATGGTG 


ATTTAATGGA 


AGGTGTTTCT 


CAAGAAGCAA 


GTTCAATGGC 


420 


TGGTCATATG 


AAATTAGGTA 


AATTAATTGT 


CTTGTACGAT 


TCAAATGACA 


TCTCATTAGA 


480 


TGGACCAACT 


TCTAAAGCCT 


TTACTGAAAA 


TGTTGGGGCT 


CGTTATGAAG 


CATACGGATG 


540 


GCAACATATT 


TTAGTTAAAG 


ATGGCAATGA 


TTTAGATGAA 


ATAGAGGCTG 


CGATTGAAGC 


600 


GGCTAAAGCT 


GAAACGGATA 


AACCAACATT 


GATTGAAGTT 


AAAACAGTCA 


TCGGTTACGG 


660 


AGCACCAAAA 


GAAGGAACAT 


CTTCCGTTCA 


CGGCGCGCCA 


ATTGGTGAAG 


AGGGCATTAC 


720 


AGCGGCTAAA 


GCTGTTTATG 


GTTGGGAATA 


CCCTGACTTT 


ACTGTGCCAG 


AGGAAGTCGC 


780 


TGCTCGTTTT 


AAAGAAACGA 


TGATTGGCGA 


AGGCCAAAAA 


GCTGAAGAGG 


CTTGGAATGA 


840 


GATGTTTAAG 


AACTATGAAC 


ATGCGCATCC 


AGAATTAGCC 


AAACAATTTA 


AAGAAGCTTT 


900 


TGCCAACCAA 


TTACCTGAAG 


GTTGGGAACA 


AGAATTACCT 


AAATATGAAC 


TAGGAACAAG 


960 


TGCCGCAAGT 


CGTGTAACAA 


GTAAAGAAAC 


GATTCAAGCT 


ATTTCAAAAG 


TTGTTCCAAG 


1020 


TTTCTGGGGC 


GGTTCAGCTG 


ATTTATCTGC 


CTCAAATAAT 


ACAATGGTTG 


CTGCAGAAAA 


1080 


AGATTTCGAA 


CCTGGTCAAT 


ACGAAGGCCG 


TAATATTTGG 


TTTGGTGTTC 


GCGAATTTGC 


1140 


AATGGCTGCA 


GCAATGAACG 


GGATTCAATT 


ACATGGTGGT 


AGCCATGTTT 


ATGGCGGGAC 


1200 


ATTCTTCGTC 


TTTACTGACT 


ACTTACGTCC 


AGCAATCCGT 


TTAGCCGCTT 


TACAAAAAGT 


1260 


TCCTGTAACT 


TACGTCTTAA 


CGCATGACTC 


TGTTGCGGTG 


GGTGAAGATG 


GGCCAACACA 


1320 


CGAACCAATT 


GAGCAATTAG 


CAAGTGTTCG 


TTGCATTCCA 


AATGTTCATG 


TGATTCGTCC 


1380 
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AGCAGACGGC 


AATGAAACTG 


TTGCTGCTTG 


GAGAATTGCT 


ATGACTTCTA 


CGGAAACACC 


1440 


AACTATTTTA 


GTTCTAAGTC 


GTCAAAACTT 


ACCTGTTTTA 


GAAGGAACGT 


TAGAACACGC 


1500 


TTCTGATTCT 


GTTCAAAAAG 


GTGCTTATGT 


ATTGTCACCA 


CAAAAAGGTG 


AACAACCAGC 


1560 


AGGGATTTTA 


ATTGCGACTG 


GTTCTGAAGT 


AAATCTAGCG 


GTGGAAGCAC 


AAGCGAAATT 


1620 


AGCGGAAGAA 


GGCATCGATG 


TATCTGTCGT 


GTCAATGCCA 


AGTTTTGATT 


TATTTGAAAA 


1680 


ACAATCTGCT 


GAATATAAAG 


AAAGTGTGTT 


ACCTAAAGCT 


GTGACAAAAC 


GTGTGGCGAT 


1740 


TGAAGCTGCA 


GCAAGCTTTG 


GTTGGGAACG 


CTATGTAGGG 


ACGGAAGGCA 


AAACaATTAC 


1800 


AATTGATCAC 


TTCGGCGCTT 


CTGnCACCTG 


GCGGTCTAGT 


TCTTGAAAAA 


TTCGGCTTTA 


1860 


CTCCTGAAAA 


TGTGGTTAAT 


ACCTATAAaT 


CaCTATAAaT 


AAACGATAAA 


AAACCTGAAT 


1920 


GCTTGTGATA 


GCATTCAGGT 


TTTTTTGCTT 


CATTTCTTTT 


TGGGAGCAAA 


GAGCATAAAG 


1980 


TGAAAAAATA 


ATGCGAACAA 


TTGGAACCCA 


ATACCAGTTA 


AAGTAACACC 


AGACCAACCA 


2040 


TAGTTTTGCC 


ACATTAATGT 


ACCAATCAAT 


GAACCCAGTG 


AACCGCCGAT 


GAAATAGAAA 


2100 


AACATAAAGA 


CGGTATTGTT 


ACGATTGCTG 


GCTTCTTCTC 


CCAAATTTTG 


GACGCGTGTT 


2160 


TGATTGGCTA 


CTTGCCCAAa 


TTGAGTACCG 


ATATCTAAAA 


CAATAATTGC 


TAAGATTAGT 


2220 


AAGACCACAT 


GTGTTCCACC 


TAGAAAAAGT 


AAGATAAAAC 


TACCGAACTG 


CATCAACAGG 


2280 


CCAATCAAAA 


CAATCTTCCG 


CTCTGGATAA 


TAATCGGATA 


GACGACCAAT 


GATGGGTGCT 


2340 


ACGAATGCTC 


CCGATAAACC 


AAAAATTGCT 


AAAATACCGA 


CTTCTTTCGG 


TCCCCAATAA 


2400 


TAAGCTGGAC 


TACTAACAAA 


GAAAATTAAT 


GTAGACCAAA 


AGATTGAAAA 


GGTTGCAAAC 


2460 


ATAAAAAAGC 


CATTGACAGT 


TGCTTCTCTT 


AACAAAGGTT 


GCGATTTGAT 


TAGCTTAGGA 


2520 


AGACTTTTCA 


AAGAACCTAA 


ATAAGTTAAT 


CGATTTGAAG 


TTTGAGTATG 


TATCACTGTT 


2580 


TTGGGCAATT 


TTAATTGAAG 


TAGAACGACC 


AATGCTAACA 


CGAATATTAC 


GGCGATTAGG 


2640 


TAGACTGTTC 


GCCAAGAAGC 


TGCACTAGCA 


ATGAAACCCG 


AGATCGTTCG 


AGACAATAAA 


2700 


ATTCCTGTCA 


ATAGTCCGCC 


CAGCATTCGA 


CCTGTTACTT 


TACCACGGTT 


CGCTGGTCCC 


2760 


GCCATGACCG 


CAGCATAAGG 


AATAATAATT 


TGGGCGACAA 


TAGAGAGCAG 


CCCGATAAAA 


2820 


AAAGAAGAGA 


GTGCAAACAG 


GAGAAAACTT 


GGTGCAAAAA 


ATGCAGCACT 


T AG AG AC AG C 


2880 


GAGGAGAGCG 


CGGCTACTCG 


AATGATTAAT 


TTGCGACGAT 


CAACGACATC 


GCCTAATGGA 


2940 


ACAAGAAAAA 


GTAAGCCTAG 


CGCATAGCCT 


AATTGGGTCA 


GCATAGTAAC 


AAAACCAATA 


3000 


GAACCAATAC 


TGACTTGAAA 


GGCCTGAGCA 


ATTTGGGTGC 


CAATTGGTTG 


AATATAATAC 


3060 


ATATTGGCGA 


CCACGACCCC 


GCAAGTTAAC 


GCTAATAAAA 


ACGTCACTTG 


ATTTGTTAAA 


3120 


CCCGATTGTT 


GTTTCTTCAT 


CTAACCACCT 


ACATTTCTAT 


GTAATTCATT 


TTGTAATACA 


3180 


ACAACTACTC 


• GGCAATAGTT 


GTTnTGATTA 


CAAAAACAGA 


CTTCT 




3225 



(2) INFORMATION FOR SEQ ID NO: 297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



AAGTCCTTGT 


TTATTGGAAT 


TTTACGACAG 


TCGCALLAI I 




rp rn rp rp z** 1 rp /"* 


oU 


CAAATTCATA 


GCTTTTCACA 


CTGCGGTGAT 


TTTTTAAAAA 


TTCCbl IA1 1 


r i pri r 'T ,r pr , 'T i TT\ 

CCCjbTTClTA 


i on 
l^U 


ATGCACCTGT 


TCCTTTGCCG 


TGAACAATGG 


TTACTTGTGG 


ATAGCCCGCT. 


AGAATCGCAG 




CATCAATGTA 


TTGGTCAACT 


TCTGCCAATG 


CTTCTTCGTA 


GCGTTTGCCA 


CGTAAGTCTA 


240 


GTTGTGTGCC 


AACATGACTG 


CTTTCTGCAG 


AACGAACCGT 


TGTAACTCTT 


GGTTTCGCTT 


300 


CTTTTTGTGG 


AGCGACAGGC 


GTCATATCCT 


CTTCAGATAC 


GTTCATTTTT 


AAAATGCCCA 


360 


ATTGGACTTG 


CCATTGTCCT 


TTGCCGTTGT 


CTTTTAATAA 


GGTCCCTCGT 


TGGCCGTAAG 


420 


TATTGACAAT 


TACTTCATCG 


CCAGCTTTTA 


ATTTTTTCTG 


TTCTTtCGCC 


TTCTTCAGCA 


480 


CTTTGTTTTT 


AGCAAGCTTG 


GTTTCyTCAt 


GGTGTAAnTG 


AGAAAGTTGT 


GkTTTCGCAT 


540 


CAATTAACTG 


ATGTTCTTTG 


ACACCACCTT 


GTTGGCCACT 


TTCCAATTGC 


ATTTTGCGAA 


600 


TATCAGAAAT 


AATGGTTTCT 


GCGTTTTCTT 


CCGCTTCTGC 


AATAATTTTA 


TTTGCTTCTT 


660 


TCCGTGCTTn 


TTGTAATTCT 


GT 








682 



(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



GCCAGTTGAG 


GAnTTGGTGT 


TGGCGCGGAA 


TACCnTTTnT 


CAAAAAATAC 


CAAGGAGAGA 


60 


ATTAAAAGGA 


GTTATCCCCT 


ATTATATTGT 


CCGATGCTTT 


CGCGGAGGAA 


GTTTTTAAAG 


120 


GCAATCCCGC 


CGCTGTTTAT 


GTTTTGGAAA 


AATGGTTGCC 


AATGCTGTAT 


GAAGAATTTT 


180 


TT TAT ACT AT 


TTTTTTGCCA 


TATTATAAGG 


GACGGGTTAA 


AGAAAGAACT 


TATAATAATA 


240 


GACTTAGTAT 


GATAAAAAAT 


ACTTTTGCCT 


ATTTTTTTAA 


AATGAAATTG 


AAAAGTATAG 


300 


CTCCTGTTCA 


TATACAGAAG 


TGGCAGAATG 


AGCTGTTGGA 


ACAGTATGAA 


AACACGTATG 


360 


TAAGAAATAT 


TTATGGACTG 


TTTCAGATGT 


CTTTAGATCG 


AGCTGTTGTG 


TTAGGAATGA 


420 


TTTCTTCAAA 


TCCAGCAAAG 


ATCGTTGGTA 


ATGTGAAAAA 


AAGTAAGAAA 


GAAATTGATT 


480 


TTTGGACAAA 


AGAAGAATTT 


GAAAAAGTTG 


TTAAGACTTT 


TTATGTAGAA 


GATTATTACC 


540 


AAAATTTTTC 


■ TTTTATTTGT 


ATCTGGTTAT 


TAT T TAT G AC 


AGGCATGAGA 


ATAGGTGAGG 


600 


CTACTGCTTT 


AACTTGGAAA 


GATGTGAATT 


TAGATAGAAG 


ATATTTAACA 


GTAAAAAAAT 


660 


CACTATATTA 


TAAAAATGCG 


CAAACTTATG 


AATTAGTGTC 


TCCAAAGACT 


AGAGCCAGTA 


720 
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TTAGGACGAT T TAT T TAG AT GAAGATACAG TTCACTATTT AAGAGATTGG AAAAAAAGGC 780 

AAGATGATGT TGGAGGAATT GAATTTATTT TATCTTATAA TTCAGTTCCT ACTCAAAAAC 84 0 

ATACAGTAAG ACATATTATT AAACGTCATG CAAAATTGGC AGAGGTACAT GATATTAGGA 900 
TTCATGCTTT AAGACATTCT CATGCTTCTT TATTAATAAG TATGGGTACT AATGCTTTGT 960 

TGATAAAAGA ACGATTGGGT CATGAAGATG TGCAAACAAC TTTAGGTACT TATGGACACT 1020 

TATATCCAAG TAGCTCCACT GaAATAGCTA ATGAATTGAA GGGAATAGTT AATGTTGAGT 1080 

TTACTAATCA AAATATGGCT TCGGAGGTTA CTAATCAATT TACTAAAGGA GTAAAAAAAT 1140 

AAAGTGTGAT AAAAGTGTGA TAACGCAAAA GAAAACCCTC TATAAATCCT ATTATATAGG 1200 

GGTTTATAGA GGGTTGTTTA TTATTCCCAC TCCACAGTTG CAGGTGGTTT TGAAGTAATA 1260 

TCGTACACGA TACGGTTTAC ATGTGCAACT TCGTTCACGA TGCGCACAGA GATTTTTTGT 1320 

AAAACATCCC AAGGGATACG GGCAAAGTCA GCGGTCATGC CGTCGATAGA AGTCACAGCA 1380 

CGGATACCGA CTGTGTAATC GTAGGTACGG CCATCGCCCA TAACGCCTAC TGAGCGGATG 14 4 0 

CCAGGTAAGA CAGTAAAGTA CTGCCAAATG TCACGATCTA AGCCAGCTGC TGCGATTTCT 1500 

TCACGTAGGA TAGCATCTGA ATCACGAACG ATTTGTAACT TATCTTCTGT AATTTCACCA 1560 

AGCACACGAA TACCTAATCC TGGTCCTGGG AATGGTTGGC GCCAAACGAT TGCATCAGGC 1620 

ATGCCAAGTT CAGTTCCTAA GGCACGAACT TCGTCTTTGA ATAATGTATT TAAAGGTTCA 1680 

ATTAATTCAA ATTGCATGTC TTCAGGAAGA CCACCAACGT TGTGATGTGA TTTAATTGTT 174 0 

TGCGCAGTTT CTGTACCAGA TTCGATGACA TCTGTGTAAA GAGTTCCTTG TGCTAGGAAT 1800 

GAAACGCCTT CTTCCCCAGC AAGTTTAGTT GCTTCGTCAT CAAAAACATA CACAAATTCG 1860 

TTACCGATGA TTTTACCTTT TTGTTCAGGG TCAGAAACGC CTGCTAATTT GCTTAAGAAG 1920 

CGTTCTTTCG CATCTACTTT AATAATGTTT AAGCCAAATT TACCGCCTAA ACTTTCCATT 1980 

ACTTGTTCTG CTTCACCTTT ACGTAATAAT CCATGGTCAA CGAAGATACA AGTTAATTGG 204 0 

TCGCCAATGG CTTTTTGTAA AAGCACGCCG ACAACACTTG AGTCAACGCC ACCTGATAAA 2100 

CCAAGTAAGA CTTTTTTATC GCCTACTTGT TCACGGATTT TAGCGACTTC CATATCGATG 2160 

AAGTTTTCCA TGCTCCAGTC ACCAGTACAG CCACACACAT CAAAAGCAAA ATGACGTAAT 2220 

AAGTCGTTAC CATATTCTGA ATGACGAACT TCTGGGTGAA ATTGTACGCC GTAGAATTTG 2280 

CGGTTGGCAT CTTCGATAGA AGCGATTGGA CAATCGTTGC TTGTCCCAAC TTTTTCGAAT 2340 

CCAGGAGCGA TTTCTGTTAC TAAATCACCG TGGCTCATCC AAACGGTTTG TTTCGCAGGT 2400 

GTTCCTTGGA ATAATGTCGC ATCACCAAGG ATTTCTAATT CTGCTTTTCC GTATTCACGG 24 60 

TTTGCGGCAG GTTCTACTTT TCCGCCTAAG TTATGAGTCA TTAATTGCAT ACCGTAGCAG 2 520 

ATGCCAAGAA * TGGGAATTCC TAACTCATAA ATTTCTGGAT CAATGCTAAA AGCATCTTTG 2580 

TCATACACGC TGTTTGGTCC ACCTGAGAAG ATGATTCCTT TAGGCGCGAT TTTGCGAATT 2 64 0 

•TCTTCTGCCG TAGTTCGGTG GCTTAATAAC TCAGAGAATA CGCCAAATTC ACGAATACGA 2 700 



WO 98/50555 PCT/US98/08985 

1351 



CGAGTAATCA 


ATTGGTTAAA 


CTGACTACCA 


AAGTCTAAAA 


CAATAATTTT 


TTCGACAGAT 


2760 


GTCAAATCGG 


CAACGTTAGT 


CACGTCTATT 


ACCCCTTATC 


CATTTAAATT 


TATCTAAAGA 


2820 


AAAGGCGGTT 


AAAAAGTGGT 


CAGCACCAAG 


AATTAAGAAA 


GGATTTTCAA 


AAATTGTTTT 


2880 


ACGATTTTTA 


GAAAATTTTG 


GCTTAATTCC 


GCAGGGgCTA CTTTTAAACA GCCGAAAATC 


2940 


AGGCGGTTAA 


AAAGTGGTTC 


ACTTTTGTtC 


CGTACTCTTC 


TTTGTCTAGG 


TATACTTATT 


3000 


AAAACATTTT 


ACTCCTTGAA 


AGTCATGCAT 


TCTGCTGAAA 


ATTTTCAAGG 


ATATTTCTAC 


3060 


mAAAGATACA 


GGATATTTCT 


TAGCTCGTCA 


ACCCTTAATA 


TTTACGTAGA 


TAGATTTGGT 


3120 


CAATCAAATG 


ATTTTCCGTT 


TTATGCAAAA 


TAATGTCCGC 


ACGTCCGCGC 


GTTGGCAAGA 


3180 


TATATTCCTC 


CAAGTTCGGT 


AAGTTCACGG 


TTTTCCAAAC 


ATTCCGTGCC 


ATGGCGAAAG 


3240 


CATCTTCTCG 


TTTGCCAATT 


GCATATTGGT 


AATAATAATT 


ATTAGGATCT 


AAGAAAGCCG 


3300 


TATCTAATAA 


GGCCCCAAAA 


CGTTCTAGGT 


ACCATTTTTC 


AATTAAGGCT 


GGATCGGCAT 


3360 


CTACAAAAAT 


AGAAAAATCA 


AAAAAATCAC 


TCACATAAAT 


TTGTTGATTG 


GCGGGTAACT 


3420 


GTAATGTGTT 


AATGCCTTCA 


ACAATCAGAA 


TATCTGGTTG 


TTGTATCAAC 


TCATATTCGC 


34 80 


CTTCGATAAC 


ATCGTAGACA 


CTGTGAGAAT 


AGACGGGTGC 


TTTAATTTCG 


TCTTTGCCGC 


3540 


TTTTGAtTCG 


TTTAAGAAAT 


TAATTAGCTT 


CTCCATGTCA 


TAACTTTCAG 


GAAACCCTTT 


3600 


TCGGTCCATA 


ATTCCCTGTT 


CTTcTAACAC 


TTTATTGGGA 


TACAAAAATC 


CATCCGTkGT 


3660 


AATGaGTTGA 


ACGTTTCTTC 


GTTTAAACGT 


TCGGGCCAGA 


ATTCTTTGCA 


AAAGACGAGC 


3720 


GGTGGTGCTT 


TTACCAACGG 


CGACACTGCC 


GGCAATTCCA 


ATGATGAATG 


GTGGTACAGA 


3780 


GACATATTCA 


TGTAGGAATA 


ATCCTTTGCT 


AAGTGTTAGC 


GATTCAAATT 


CTTTCATGTA 


3840 


CAAATGAATA 


AGATGCGTTA 


AGGGTACATA 


AATTTCTTGT 


ACATCTTTTA 


GTGAAATTTG 


3900 


GTCGTTCACA 


CTTTTAATGT 


TATGCAATTC 


TGCTTCGGTT 


AACGGTGCTT 


TGCCGTCATG 


3960 


ATAAAACCCA 


TGCCATTCTT 


CTCTGGAAAT 


TGGGTAGTAA 


TTCATTTTAT 


CGTCCATTGT 


4020 



TCCTCCACC 4 029 

£2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

AACCGGTGTG ATTACATCAT ACAATACTGA TTCAGATTTG TAAACCGCTT TCTAAAAAAA 60 

ATAAATAAAC ATAGTCCATA AAATGGACTA TGTTTACGGC CTTAAAAAAA GGTAGTTATA 120 

TTTGTAAAAA TTTACTCACT AATAATCTCA ATCTGATAAA AATCAGAAAG TATGAAACTT 180 

TGACTATATT CAATCGGAAC CCCATCGTTT CCTCTAATCA AACGTTTAAT ATTGAATAAT 24 0 

GAAAAGCTAT CTTCTTCACC AAAACTTTTT CTTATTATTC TCAACGGCTC TAGTTCATCT 300 



WO 98/50555 



GAATCCTTCG 


TTAATAAAAT 


CGTGCTCACC 


CCTTGAAGAC 


CCaATTCATC 


GGATAAACTA 


TCCCTAAGTT 


TATCTGCAAA 


GAGATAGCGA 


AATGCAATTT 


GGGTTAGCTT 


AAGCAACGCT 


GCGATCTCTT 


TATCAGTAAA 


TTCTTCCGAA 


THTTTTTCGT 

X \J L X x x x V* *w X 


TATTGGACAT 


CGATAACTTC 




CAAAGGTTCC 


CTCGCCCtGA 




CCTTTACTAC 


TGTTGTATTA 

X \J X x- \J x x x ± i 


TCTGAATAAA 


CTTTCTCACC 


CGTACTAAAC 


CGCTTTATAA 


TCTCATATTT 


AGGCAACATA 


CTTCATTGTA 


AAGAACTTTT 


ATATTTATGT 


ATTTCGCACA 


CCAAAATCAT 


AGCTAAAAGC 


TTGATTTCAA 


TAAATTTGAG 


TTCTTTTTTA 


ATTAAATGAA 


TGATGrAGCG 


AATGrCTATC 



(2) INFORMATION FOR SEQ ID NO: 300: 
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360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1136 



TCCATCGTCA TCGGAAGATT AGCCATATTT 
GCTCCTTCAA TCAGGCGTTT TACTGAATCA 
TTTTGAATTT TCCATGGAAA ATCATTCGTT 
TTGCTGCCTG ATTTGTCGCC TAATATCTGT 
ACTAAAGTGA TCGTTTCCTC AACCGACTTT 
TCTGAGAAGA AAACTTTTCT ATGCTTCAAA 
CGCCGAATTA AATACCCTTC ATTTaCCAGA 
CTGACATTAT ACTTTCTTTT TAAATCACCT 
TCTCCAGATT CAATCTTCTC TATGATATCC 
TGATCCTCTC CTAACACGTt AACCGTTAAT 
AAATAGAATG AGTAATAATT ATTTGCTCAA 
TTGGCATGAA AGGGCGGAAG TTTTTGGGAA 
GTATAATTAG CTTATAAAAA TACTGACCGG 
AAAAAACAAn CACCTACAAA AATTTA 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AGATCGGTCC nTAATAATTT ATTTTTGCCC TCCAGCGTCG nTAATGTCTC CTGATGAGGA 60 

TGTTTAAAAC GATTATTCAG TCCAGCTGrA AyCAAgCyTC TCGCGGATTA AGCTGTTTGA 120 

TAAATCTTTT AGTAGACGAC GTTTTACTAC CATGATGACC GACTTTCAAG ATGTCGATTT 180 

TCAACTGGGG ATAGCGAGCC AACACTTGTT CTTCTCCTTC GGCTTCCAAG TCACCAGTAA 24 0 

ATAAAAAATG CCGTCCGCCT AATTGTGTGG AAAACACTAT CGAGTCATTA TTGCCCCCAA 300 

TACTGCTCTT ATTTGGTGAT AAAATTTGAA AAGAAACGGG CCCATTAATT CGCTTGGGCG 360 

CTAAAATTGG CCGACAGTCT GTCCCGCTTT TTTGGAGTTG TTTCAAGACT TGGCGAAACG 4 20 

AAAAATTGGC TTCAGCTGCT CCTTTAGGGA AATAAAGAAC AGAAATTGGA ACCTTCTGCG 4 80 

CAATGGCTAA TAAATCGCCA AAATGGTCTT CATGTGCGTG TGTGATGATC ACTTCATCTA 540 

AACGTTTGAC TCCTTGACTT TTGAGAAAAG GAATCACTGA ATATTCTGCT CCTGATCGAC 600 

TATTTTTCCG • TTGCCGCCAA GCTTCTTTGC CAAAAGACAA CCGCCCCCCT GTATCGATCA 660 

GTGTATTTTT TCTGTGAAAA GGCGCTTGGA TAAACAAACT ATCCCCTTGG CCTACATCGA 7 20 

TCATTGCAAC AATGCCAATT GGCGAAAGGT ATTTACTATT TATCAAAACA ATTAGTAGTC 7 80 
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CGATTGAGAA AAAAATTTTT TTAGACTTGT CTGCTAAATA AATCATTGAA AATCCGATTA 840 

GTAATAACAA ACT AATTTGT" ATCCAAGAAT TTAAATAGCC GATGGTCAGC TTCGGATTTA 900 

AGTGAACTAG CACCCACTCA AATAAGCCAT TCATTGTTAA AAATAGTGGT TCGAGCAAGG 960 

CTGCTAAATA ACGTGCTGGC ATAATAATCA CTAGAAAAAA GCTCAACGTC AAAAGCGGTA 1020 

ACAAGAACCG TTCAAAAATT GGTAACAAGA GAAATGTTAA GAAACTTCCT AACAAGGGCC 1080 

ATTCATAAAA TGATTGTACT AAAATAGGTA AGGAAGCCAC AGCCAATAGG AAAGAGCCTA 114 0 

AACAATTTTT AATGAAAGGC GAAGAATAAC CAACCAATGT TTGCTCTAAA AACAGCAGAA 1200 

CAAAGGAAAG TAAATAACTA AATTGACCGC CGACTGAAAA AAAGAGGGTT GGCTGAACAA 12 60 

TCAAGTGCAC TAAGAGAGTT AAACTCCAAC AATCGAGGGC CGGAACCTGC CAATTAAACC 1320 

GTCGATTGAA CGTATTGAGG TTTGTTTGTA AAAGCGCCCG CATGACACTA ACTGAGAATC 1380 

CTGTTAAACC AGCATATATC AGAGAAAAAC CGCATTGCCA CAAAAAATAA CTTTCTTGCG 14 4 0 

TTCGGCGAGC AATTCTTAGC ATTAGCAAGC GAAATAAACG AAGGAAAAAA GAAACATGCA 1500 

TCCCTGACAA AC T AAAAAG A TGCAAAATGC CTAATGCTGA AAATTTATCG CTTATTTCaG 1560 

CAAATTCTTC aTTACGAAAA CCTAATAGCA ACGCATTCAT ATACATTGCA CTAGTCTCAG 1620 

CAAAATTTTT CTGAATATGT ACCATCGCCC ACTTCCGCCA GCCGCTAATG ATTTCTTTAA 1680 

ACCGCCAAGA AAAAAAGGAA CGCGGTTGTC CTATCTCCTC AATGAATAAA ATAGCTTCGA i74 0 

TTCCTTGATT CTTTAAAAAT AGTTGATAAT TGAAGCCATT TTTGTTGGTT TGACCTTGGG 1800 

GCGAGTCTAA AATACCTACA GCGGTTGCCT GAATGGTTGT GGTTAGTTGT TGCCATTGCT 1860 

GTTTTTCTTT CACACTTTTT AATTGATAAA ATGCCATCAC TTTCTGCCTT TTGCCTTTTA 1920 

CGATCCAATT GGCAGGAAAT TTTACTTGAT CACCATCGAC TTGTATTTCA TCTGGTAATA 1980 

TCGTTAGCTG TCCTGACAAG CGGACTTCTT TAGGCAACAA AGAGACTGGC TTTAACCACT 204 0 

GGAAAATCAA TATGCCCCCT ACTATTAAAA AACTGCACAA AAAACCTCCC AATATAACAG 2100 

GAAAACTTTT TGTACAGACA ATACGGATAA CAGTGATAGA AAAAAGAAGA GCTCCTAATA 2160 

GACTAGGTGC GAAAAAACAC CACAGCCCAC AACTAGTCAG CAGAGCTATG AAGAACCAAT 2220 

TATTCCTCAG CTGAGGAAGG ATTGTCCTCA AGTGTGGCAA ATTCTTCTCC CCATTGTAAT 2280 

TCGGCAAAAT ATTTTGGTAC TAAAGTCACT TTTTCAACCG TGGCTCCTAC TTGTTCAATT 2340 

AAATTTAGGG CGTACGCATC ATTACGGTAA TCCTTTAAAT AATAAATCTT TTTAATCCCT 2400 

GCTTGTAAAA TCATCTTTGT ACATTGCAGG CAAGGGAAAT GTGTCACATA AATTTCTGCC 24 60 

CCTTCTGTAG GGACGCCAAA CTTCGCACAT TGTAAAATTG CATTCATTTC TGCGTGGATT 2520 

GTTCGAACGC AATGATTATC AACAACATAA CATCCTTCAT CAATACAATG CGTTCCGCCA 2580 

CTGACTGAAC CATTGTAACC TCCTGCAATA ATTCGTTTGT CGCGAACAAT GGTTGCTCCG 2 64 0 

ACAGTTAACC GTGTGCAAGT ACTTCTTAAT GATAATAAGA CGCTTTGTCC CATAAAATAT 2700 

TGGTCCCATG GAATTCGTTC CATAATAAAC CTCCTTTATT TTTAATGACT CCTGTGTTCG 27 60 
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ATTTGCCAAA 


TCATTTAATC 


TTATTACAAA 


TAGTATGTTA 


TCCGCATCCC 


ATAGTCAAGA 


2820 


ATTCTCAGGA 


ATTTTCTGTT 


AAAAGTTAGC 


CAGTCACTGT" CAACATGTCT 


'TTTAAACGTT 


2880 


CTACTGTnTT 


CTCTCCAATa 


CCTGGgAACA 


TTTTTCaGTT 


CCTCGACGGT 


TTTAAACGGT 


294 0 


CCTTGTTCGT 


CACGAAmACG 


AATAATCTCT 


TGTGCTTTTy 


TCGCACCGAT 


TCCTGAmATC 


3000 


GTTTGTAGCT 


CTGCTTCTGT 


TGCCGTATTA 


AGGTTGATTT 


TTTCCTCCTG 


ATTTTGCTGA 


3060 


GCAGGTGCTG 


ATTCCTGCAG 


CGTTTCTAAT 


GATTGTGCCA 


CTGGnTCTCC 


TTTTTTGGGC 


3120 


ACGTAAATAA 


TCATTTGATC 


TTTtACTTTT 


tGrGCGTAAT 


tGACCTGGGT 


AGTATCAGCT 


3180 


TCCTcGtCAC 


GCCTCCAnTA 


AAGtAAAGCA 


TCCCAAATCC 


GCTGCTGATT 


TTTTAGTTGA 


3240 


TAGATTCCTG 


GTACTTTCAC 


TGCGCCTTTA 


ATATCAcATA 


GATCATTGTT 


■TCTGCCCTTT 


3300 


CTTTTGTGGC 


ATCTGTGGCG 


ACCTCTGTCG 


TAGTGGTCAA 


CTTGTTTCTw 


CmcACATTGT 


3360 


GGTGTCTACT 


TGTAGCTCCT 


TATTAACCAT 


CAAATAAATG 


CCCACTAATA 


GAGTTATCAA 


3420 


AATGACCATG 


AACACGCCAC 


TACCTAGCAA 


TAGCCACTTT 


TTAGGTAACT 


GTTTCAACCA 


3480 


ATCCATCGTT 


TGCCCTCCCT 


TCAAAATATA 


ATTACGCATT 


TTTTAACAAA 


AAAATGCGCC 


3540 


TGGACCAAAA 


ATCACTAGTA 


ATTTTTGGTC 


CAGGCCTACT 


TTTATTTTAA 


TTTCTCTAGA 


3600 


TAGTTCAAGG 


CATCTTGAAC 


CgTTyTTACT 


GGGACAATTT 


TCATTTTTGT 


CCCAATTTTT 


3660 


TTCGCAGCTT 


CTTGTGCTTC 


TTGATAATTC 


GATTTGATTT 


TTGGCTCAAC 


TTTTTTCATT 


3720 


TCTGAAGTTA 


TTTCATCATC 


AGGTGCAAAG 


AAAATTTCTG 


CGCCGTTCTC 


ACTGGCAGTT 


3780 


ACAACTTTTT 


TGTCAATGCC 


TCCGATTCTA 


CCAACGATTC 


CTTGACTGTT 


CATCGTACCT 


3840 


GTTCCAGCAA 


TTTCATGACC 


TTTACGTAAA 


TCTTTATGAC 


TTAATTGTTC 


ATAGGTCTGC 


3900 


AAAGAAACAT 


TAAA 










3914 



(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCGTGGCAGG AAAAATGTTG GnACTTTATC ATTACGGATT AAAAAAATCn ATGnTCCTGA 60 

AAAAGCTGTC ATGACTTCAG GTGGTAATTT TTTAATCGTT CTGGTAACCC ATTGAAACCA 120 

CTCCTTCTTT TTTATCATAC CACAACATCC ATTTCTCTAA AAGCGCAGCA CTTACTACTG 180 

AATATCTGCT AAAATTTGCC TGATAATCTG CTTTTCAGAC AGTTGTTCAG ATAACCGCTG 240 

TTCTCCCGGA . AACCCTAAAT AATCTTCTGG TAGCCACCAA CGTTTCATTT CCACGACACC 300 

AAATTCTTGA TTCTTGTTTC TCTGGCTATG TCTTTTCAGC GTTTCTTCAA AAGAAATATC 360 

AAAATAATAA ACCTGAACGT TTCCTTCAAA GAGATGGATT ATTTCCTGAA AAAAGGAATG 4 20 

ATAAATGGCT TTTTGAAAAA TCCCTTCAAC AATCACATAC TGGCATTCTT GATACCCAAA 4 80 
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AGCTACCAAC 


TGTTTTAACA 


AAGCAATACT 


TAGATTACCT 


GTTTCATCTT 


TAACACGAAG 


540 


CATTTCTCTG 


CGCACCACAT 


CTTGAGAAAT 


CAATAACGAT 


TGTTCTTTAA 


GACACTGATG 


600 


AAGCGCGTTA 


gcTATCGTTG 


TTTTACCACT 


ACCAGAATTA 


CCACGTAAAA 


TAATTAATTT 


660 


CGGTTGCATC 


TGTTTCCCTC 


CATTTTCTTG 


ATTGTTGAAA 


ACCAGCACAA 


AAGAGCAAAA 


720 


AACTTGGAAA 


AAACTCTCGA 


AGTTTTGCTG 


TTATTTTGAT 


GAATGTTTCG 


TTGAATCTCG 


780 


CTGAATGAAT 


TCCGTTCCGA 


CAATAATTTT 


TCTAGGAACT 


GGGGGGGGAT 


tCAAGCATAA 


840 


CTCTTTCATT 


GTCTCAACGG 


CTAGCTCACC 


CATCCATTCA 


GTATGAACTT 


aACTGTkGTT 


900 


AATGTTGGTG 


AAACATATTT 


GCGACACTAA 


TATCCtTAAA 


ACCAATGATA 


GAAATATCTT 


960 


CTGGCACACG 


AATGGCTGCT 


TCTTGTAACG 


CCCGCATTAC 


ACCAATCGCC 


AATGTATCAC 


1020 


TTGcAGCGAA- 


AAAAGCGGTT 


GGACGAGTGT 


CCGCTTCTTC 


AATAAAGTCT 


TTCATCGCTT 


1080 


GATAGCCTTC 


TTCAACAGAA 


AAAGCTGCTT 


CGACAACCCA 


TTCTTGATGA 


TAAAGGTCCA 


1140 


GATCTTTTAA 


GGTTTCTTTA 


AAAGCGACAA 


AACGTGGATC 


AACCAATGCC 


ACGTGGGACT 


1200 


CTTTCGTATA 


TTCTTGCCCA 


GAAAGCATAC 


CAATTTTCTG 


ATGCCCTTGC 


GCCAAAAATT 


1260 


CTTGCACAAT 


TGTTTTAACA 


CTTCCAGCAA 


AATCAACAAC 


AATGGAGTTT 


TGGCCAAAAG 


1320 


CCATTGCATC 


AAAGTCTACA 


AATAGAAGAG 


GTGCGCCTAA 


GGCGGCTAAG 


GCCGCCACTT 


1380 


GCTTTTCATC 


AAATTTTCCT 


AGGGCAATTG 


TCCCATCAAA 


TCTTTTTTGA 


GACAGGTTTG 


1440 


TCAGAGATTC 


TTTGACTAAA 


AGAAAACCCA 


ATTCCTCTGC 


CCGCTTTTCA 


ACACCTAAAC 


1500 


GAATCGATAA 


ATAGTAAATA 


TCATCCAGTT 


CTTCTGGATC 


GCTGAACCAT 


TGGACAAACC 


1560 


GAATTTTCTT 


TTTGCCTGCT 


TGTTTTTTTC 


TGGCTCGCTT 


TTTATAATTT 


AACGTTTCTG 


1620 


CTGTTTCAAA 


TATTTTCTTC 


TTCGTTTCAT 


CACTAACTGA 


TAATGTTTCA 


TCATAGTTCA 


1680 


AAACTCTCGA 


AACTGTCGCC 


GAAGAAACAC 


CGACTTGCTC 


TGCTATATCT 


TTAATGGTTA 


1740 


CGATGTTCCT 


CACCTCAAAC 


TAATCATTTT 


CT'ATACATTG 


AATAAATCGT 


TCAAAAGCAC 


1800 


GACGACCTTC 


TTCGTCACGT 


TTAAAGACGC 


CGGCATCTGT 


TAAAATTTGA 


ACAAAAATTG 


1860 


TCCCAATTCC 


TTGTTGGATC 


ACTTCTTTGA 


CATTGGCTTC 


AGTGATGGTT 


TGCTGACTGA 


1920 


GTTCATCCGC 


CCAAGCTCGA 


TGACTTTCAG 


CAATTTGATT 


GGGTTGCTTT 


AATAAATACT 


1980 


TTTCGACTTC 


TCTTAATTCT 


TGTGCCAGCC 


TTGGTGGCAA 


AATAGCTAGT 


CCCATTACTT 


2040 


CAATTAAGCC 


AATATTTTCT 


TGCTTAATAT 


GTTGTACTTC 


TGGATGTGGA 


TGAAAAATGC 


2100 


CATCTGGAAA 


TTCCTCCGAA 


ACATTATTAT 


CTCGTAAAAC 


TAAATCCAAT 


TCAAATAATG 


2160 


ACCCTTTGCG 


TCGGGCAATT 


GGCGTAATCG 


TATGATGCGG 


TGTGCCATCT 


TTTGAATAGG 


2220 


CCCGAACTGA 


. GACACGTTCA 


TCAGAATAAT 


TGCGCCATTT 


GGTCAAAATT 


AATTCGGCCG 


2280 


CCTCAACTAG 


TTCCTCTTGA 


CGATCACTTT 


GAAGACGAAT 


CACAGACATC 


GGCCATTTTA 


2340 


AAATACCAGC 


AGTTATCTCA 


GGATATTTTT 


TCAAAGAAAA 


TCTTTTTTCC 


ATTGGTGCTA 


2400 


AATCCATTGG 


AAATGTATGA 


CGTCCTGCTT 


GGTAATGATC 


ATGAGATAAA 


ATAGAACCAC 


2460 
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CAACAATTGG CAAATCTGCA TTAGAGCCCA CAAAATATTC AGGGAATAAC TCCGTAATTT '. 2520 

TAAGGAGTCG CTGGAACGTT TTTTTCTCGA TTTTCATTTC TCGATGTTGT TGGTCCAAGA 2580 

AAATCGCATG CTCATTATAG TAAGCATATG GCGAATATTG AAAGCCCCAA GCTTCGTCCT 264 0 

CTAAATTTAG TCGGATAATC CGATGGTTTG TACGTGCTGG ATAATTTACA CGGCCATAAT 2700 

AGCCTTCATT CTCCATACAT AACATACACG CTGGATATTT TTTTTGACTC ATTTTCTTTT 2760 

CAGCCAAAAT TTTTCGAGCA TCCTTTTCTG GCTTCGATAA GTTAATGGTA ATTTCTAGTT 2820 

TACCATAGGG TGAGTCCATT GGGAAAATAA TATTTTTGGC GATTGCTTTC GTTTTAATGT 2880 

AGTTATTCGT TTTACATAAC TGATAAAAAT AATCCGTTGC TTTTCGTGGA GACTCTTGAT 294 0 

AATAATCCAT AAACTTCTGA TTAATGATGG TTGGGGTCGG CGTAATAAAA TCCATTAACT 3000 

CCGCTTCTAG TTGTTCTGCT TCCGTAATTG AGTCATCGAT TTGGTGATTG GCTTGTGCAA 3060 

TATTTAAAAG ATGTTCCACC ACACGCAACG GTTCTTGCTC ACAAGCAACC GCTTCTGGTT 3120 

GTTCAAAAAA ATCACAGCCA ACTAGTTTGG CAATCCGATT CGTCAAATAA TAGCGGTCTG 3180 

TTTCCGTCCA ATCGCCATTT TCTATGATTA TCTCTACAAA TGCAGCAATC CAT TT ACT AC 3240 

TTTCTTTCAT CATCTGATGT TCTCCTATTC ATTATAGCCT TGAGGATGTT TCACATGCCA 3300 

TTGCCAAGCG GTGGCGATAA TATCCTTGAC CTCAGTTACT TCTGGTTGCC AGCCTAAAAC 3360 

TCGTTTGGCT TTTTCACTAG AAGCAATTAA CGTACTAGGA TCCCCTGCTC GGCGTGGGGC 3420 

GATTGTCGCA GGAATTTCTT GACCTGTCAC TTCGCGAGCA GCATCTAACA TTTCTTTAAC 3480 

AGAATAGCCG TTGTTGCTAC CCAGGTTAAA GACGTCACTT TCGCCGCCAT TTTTCAAGTA 3540 

TTCTAAAGCC AAAATATGTG CTGCAATCAA GTCTTCAATG TAAACATAAT CTCGAATGCA 3600 

CGTGCCATCT GGTGTATCAT AATCATCCCC AAAAATACTT AGCTCTGCTC GTTGGCCCAA 3.660 

TGCCACTTGT AAAATAATTG GCACAATATG CGTTTCTGGC GTGTGATCCT CACCAATTGA 3720 

GGCATCTTTT TTTGCTCCTG CAACATTAAA ATAACGCAAA GCAACATATT TCATTTCATA 3780 

AGCGTTGTCG CACCATTTCA TGATTTTTTC CATCATTAAT TTACTTTCCC CATACGGrTT 384 0 

TTTAGGATTC GTTGGGGTTT CTTCTGTAAT AGGCATTGCT TTTGGTTCGC CATAAGTGGC 3900 

CGCTGTGGAA GAAAAAACAA TGTGTTTCAC CCCGAATTCT TGCATGACTT CCAACGCAAT 3960 

TTGAGTGCCG TGAACATTGT TATTGAAATA CATTAACGGT TTCTCCACGG ATTCTCCTAC 4 020 

TAAAGAATTG GCCGCAAAGT GCAACACCCC TTCAATTGAT TCCTTTTCAA AGACGCTACG 4 080 

TAAAAAAGCT TTATCACGAA TATCGCCTTC ATAGAAAGTC GCTTGTTCAT GAACTGCTGA 4140 

ACGATATCCC GTTAATAAAT TATCAACAAC AATAACGGCA TATCCTTTTG AAATTAATTG 4 200 

ATCGACGGCA TGTGAGCCAA TGTAGCCTGC GCCACCTAAT ACTAAAATAG ACATAAATAC 4 260 

GCTCCTTTTA CTCCCTAATT . TCTTTTTCTA TTGTACCATT AAATAAGGTC TACTTCTGAA 4320 

TTATTTCATT TTGTAAACAG GATTTAATTT GGCCATTTCT CCTTATTCAT TTTTCCTATA 4 380 

AAAATACAGA AAAGACTTGG TAAATTTTAA CAAATCTCTA AATTAGTCGT AGAGTTTATG 44 4 0 
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AAAAATCATT GTAACTATTG GATTTTTTTG CTTTTTTCCG TCATAATAGA AACAAATACA 4 500 

ATTTGAAAGA AGGAATGAAC GATGAAAAAA TTTGTATCAG GTATGTTAGT CGGTACTGCT 4 560 

ATCACAGTTG CTGCTTTAGC CGGTGTGGCT ACAACTATTA AAAAaACAGT CAT TG AT CCA 4 620 

ATTGAAGAAA AAGAAGATAT GATTGAAGAA AACCGTAAAA AAGCGATGCG TAAGCGTATT 4 680 

GCACGTTAAA AAATAGCTGT GACGTTGAGA TAAAAAATCT AACGCCAAGA AATCAACAGA 4 74 0 

AGCGCTTCTG GCTCTGTTGA TTCTCTTAAG mATAACAAGT GGTCATCCAT CGCTTTGATG 4 800 

GATGACCACT TGTTATGCTT TCTTTTTTAC CAAAGGAACT GACTATTTTT CTTAACTATT 4 8 60 

AAAAATAGTG GCAGCAATCC GATCCATATC GCGCATATTC CAAGGATCCA CACCAGGACG 4 920 

AGCTGCAGCG GCTGTTAACC AATGAATACC TTCTGTTGGA ATACCACAAC TAAAAAGGCC 4 980 

TGTCGCCACC TGTCCATCTT TTGTCATGAC TTGATTCGTG GCAGGCACAA TCAAGAGTGC 504 0 

ACCTGTTTTG ACTATTTGAC CATCGGCTAA TGCCAGTTGA TGGAGCGTAg cTAGCTTATC 5100 

GGCTAACAAC TGCCTCGTTA AAGGATTCAA ACTTTTTTCA TTATTGACTT TGGGAATGCG 5160 

CGCTTCAATT AAAAAGTGAC TATTATACCG ATTTTCCGGA AACAAATTAG CATAAGTAAT 5220 

AAATTGACCC GCTTTCATTT CGACACGCAT TTCTGGCCCC AGAATCGTGA CTAATCCCAG 5280 

TTCAACTAAC GCCTGTAGCT CTTCTGTTCT TTCAACAGAA GGTCCCACTG ACAAAAAGGT 534 0 

ATTCAACGGG GTAAACCAAT GCCACAACCA CTTTTGACTC TCGTCATTCG TTAATAATCC 5400 

TTCATCCAAA ACAAAGCGAA TCTGATCTCT TAAATCTTTT AAACTGTCTA AAGCAGTGGT 54 60 

CAAAGGACCG CTAAGATTAC CTTTCTCTGC TTCTTGAATA TCCCATTTTA AATACTCTAG 5520 

TAAAAACGAA CGAAAACTTT CAT GAG TAG T TTTCTTTGTT TCAGGATGTT CTAACGCTTG 5580 

CCAATCAAAC CAATCTTGTT TTTCAAAGTG GGCTAGGCTT TCCGGCGCTC CCTTGGTTTG 564 0 

AATAAACAGC TGTTGAAAGG CCTCTTTAGA AAGATGCGGA TATTTTTCTT CGATTAAGCA 5700 

AGTGTAATAC ACATATTCCA CTTCTTTTTT TAAATACCCA AAAAACTCTG CGCCTGTAAC 57 60 

CTGTCCTTTT TTACGACATT CAGCTAAATA CGTTTCGGTT AAAAAAACAG GCTGATAGGT 5820 

TTCCCCGTAG CCTTTTTGAT TATTCCCTCG AGGATGAGAA GGAAAACCGC GACCAGAACC 5880 

AGCAATAATA ATAGGCTCTC TTCCAGAAGG TTCATAACGT AAGCGGCCAT TTTGACGAGA 5940 

AAAAACACCC CCGCGGCCAA TCGTCAAGGC AGATAAATAG TCAAAAAATG CCAAACCTAA 6000 

GCCTTTTAAC ACAACGGGTT GTTTTGCAGG TATATTGGCA AGTGCCAATA AAGCATCCGC 6060 

CGCATTTTTA GGAGAAGAAT AAAACAAACG ATGCTCCTTG GCGTAGTCCG CAAGCTCTTG 6120 

TTCAACAGTC GTTAATTCAT TTTCTTGGTG ACCCAAAGCT AAAATTACTT TGTCTGCCGT 6180 

AAACTCAATG .GATTGAGTAT ACACATAATA GTTATCCTCT CCTGCCCGCA CAGCAGTTAC 6240 

TGGCTCTTTA AAAAACGTCA GCGATGTTTG TTCGTTACAC CGCGTGGCTA AATACGTATA 6300 

AAACCATTTT TGGTACATGC CATAAAAAGC CCTCGAGCAG TGATCATTCG GTCCAAGGCG 6360 

ATGACATTCT TTCAACAACT CAAGATAATT ATAGGGTTGT TGCTGTTGAA TGTACTGCGC 64 20 
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AgcTTCTGTT TGGGTCCATT C AT AG AG AG T GGGGCCCTTG ACAATTGGTC CTTCGCCACT 6480 

AAAAGATTCA TCTGTAAATA AAGTGACTTG ACTAATCACG GTGTTCATTA ATAAATAATT 654 0 

AGGTTGCTCT GCTCGCCAAA TTTTCCCACC TGGACCGTAT GGATCAAACA ACGTGAGCTG 6600 

AACATTTATT TCCGTTTTAC GAGCCCATTC AATGACTCGT TCTGCAGCTG ATAAGCCTCT 6660 

GGGTCCAGCA CCTACAATTG CAATTTTCAT CCAATCACCT ACTCTATTTT TCTAATAATT 6720 

CTTTTAAGAC AACCGCTTGA TTATGCACTG TATCTTTCGC CCCATAGACC AGCGTGACTA 6780 

CTGGCTGATT TTTTACAATC GTCTGTAACT GCTTAAACGC TGTGGCAGTT GGCTCATGTT 684 0 

TCAATTCTTC TTGGTATTTT TGTTTAAACT CGGGATATTT ATTTGCTTCA TGATTAAACC 6900. 

ATTTCCGTAG CTCTGTACTT GGGGCCACTT CTTTGAGCCA TTCATCTAAC TTTTCAGATT 6960 

CCTTGGATTT TCCTCGTGGC CAAACACGGT CAACTAAAAC ACGGTAGCCA TCTTGCTCTG 7020 

TCGGTGCTTC ATATGCTCGC TTGATTTGAA TCATCTGCCT TCACCTCTTC TCTTTATTGT 7080 

AGCTCAAAAA GTGAGGACTG ACAAAAAGCA ACATCTCTAT TAAGCTTTCC TATATAAAAA 7140 

GAGTTGGTCT CAATGAAGAC CAACTCTTTT TGTTATTAAC TATTTCTCTT TTTCACTTTC 7200 

CCAGTCCAGC CATCATAGCC GCCTTTTAAA ATATAAATAT CAGTATAGCC ATTTTTGCGT 7260 

AATAGATTAG CTGCTCGGAT ACTTAAAGCT TTCTTCTGAT CATAAAGATA AACAGGTTGA 7320 

TCCTTACGAA GAGAACCAAT TGTTGTTTTT AACATGCTGT ATGGCATGCT GCGCGCCCCT 7380 

AAAATATGCC CAGCATCAAA TGTATCTTTC TCACGAACGT CAATCACTTG TGCTTTACGC 74 4 0 

ATTGTTTCTT TAAATTCTTC TTCCGTCAGC ATTTTTGCTG AACGTTTCAC CATAATTTTC 7500 

AAATATAATT CATTGAATAC CATCGCTAGT ACAATAACTA GCAAAATGCC ATTGATTACC 7 560 

CATAAAATAC TCATTCTAAA AAGCCCTCTC TATTTCTCTT TTGAAAATTG TAATGCTAGT 7620 

GAGGCGGCAC CAATGACGCC GGCCTCATTA CCTAGCTCTG CTAATTTGAT TTTTGTACTA 7 680 

TTTCGAACTT GCGGGAAGGT AAATTCTTGG AAATATTTTT CGACACGACT ACGTAAAAAT 77 4 0 

TCTCCTGCTG CAGAAACACC TCCACCGATA ACCACGCTAT CTGGATTTAA CGTGTTACCT 7800 

AAGTTTCCTG TGGCTAAGCC TAAATAGAAA CAAACACGAT CCACAACCAT TAAAGCAAAA 7 860 

TGATCGCCTT TTTCGGCAAA TTCAAAAACG TCTTTACTCG AAACATCTTG TCCATCATCA 7920 

ATCGCTTGTT TCAGTTCAGA GTCACCTGCA AATTCTTCTG ATAAATGACG TGCAACACGA 7 980 

ACGACACCAG TGGCACTAGA AACAGTTTCT AAACAGCCAC GTTTCCCACA GGTACAGTCA 804 0 

AAGCCGTTAG GATCAACCGT CACATGCCCA ACTTCACCAG CACAGCCAGC AACACCGTGG 8100 

AGCAATTTTC CCGCTGCAAC AATGCCTCCG CCGACACCTG TTCCTAAAGT AATAAAAATC 8160 

ACGTCGGGAT TGTTTTCCCC CGCGCCCTTC CAACGCTCAC CTAAAGCGGC AACATTGGCA 8220 

TCATTATCTA AGGCAAACGG AATTCCTAAA GCAGATTCGA TTTGTTCTTT GACAGGCTGT 8280 

ACAGTCGTCC AGTTTAAATT ATAGGCACCT ACAACAGTTC CTTTTTCAAT GTCGACACTT 8 34 0 

CCTGGTGTTC CCATACCAAT TCCCACAAAA TCTTCTTTTT TCATATTATA AAGATCTATA 84 00 
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CGGTGACGAA 


TACTTTCAAT 


AATTGATGGC 


ACAATATGCT 


TGCCATCTTC 


TAAAATATTC 


8460 


GTTTCAATAC 


TCCATTTTTG 


TTGAACAACG 


CCATCGGTTG 


TTAAAATAGC 


AAATTTGATG 


8520 


GTCGTTCCAC 


CTAAATCAAT 


CCCAATAATC 


TTCTTATCCA 


TTTAGTCGAT 


TCCTTTCTGC 


8580 


TTCCCTCTAG 


ATTCTTCTAT 


TCGATGCTCT 


CTTTGCAAAA 


TATGCCGGGC 


CTGTAAATAC 


8640 


GTATCTCGGT 


CTACCAATTG 


ACCTTCATAC 


AATTTTTTTA 


ATTCAATCAT 


CATTAACTCA 


8700 


ATATCATAAA 


TCCGTGCACC 


GACATAAATA 


TACATCCCAA 


ACTGTTTAAA 


AAGTTGTTGT 


8760 


ACATCATATA 


ACGTTTCCAT 


ACGAAAACAC 


TTCCTTATTA 


TACAGGTAAA 


CCGTATTTTT 


8820 


TAAAACCCAA 


TAACAGACAA 


ATTACGACmA 


GAAAAATAAa 


AATTAACGTG 


GCCATAATTC 


8880 


GCTCGTGCTT 


ATTAAATGCT 


TCTTCACGAT 


TTGGAACAGC 


AAAAGCCGTC 


GCGACTAATA 


8940 


GTCCACCAAT 


TAGACCGCCA 


ACATGACCCA 


TCATATCTAC 


AGAGCTACTA 


AACAAGTTAA 


9000 


ATAACAAATT 


AATCACAATA 


AACATTGAAT 


AGCGTTGCAC 


CATATACATA 


ATCGCTGGAT 


9060 


TGTCTCTAAA 


ATGGCGTCCT 


AAAATAACAA 


ATGCGCCGAA 


CAAACCAAAA 


AGCGCTGTAC 


9120 


TAGCACCCGC 


TGAGACACTG 


TTGGGTGTCC 


CAAAGGCAAA 


ACTGGCGATA 


TTTCCAGCAA 


9180 


TCCCACTCAA 


TAAATAAATT 


CCCAAATAGC 


GCCAATGCCC 


ATAAATTGCC 


TCAACCTGTG 


9240 


CGCCGATATA 


ATAAAGAGTA 


ACCATATTCA 


AAATAATATG 


CATAAATCCA 


ATGTGAAGAA 


9300 


ACATTGGCGT 


GATAAACCGC 


CAGTATTCAT 


GATTTTGAGC 


TACCAGTGGA 


CGAACCATTG 


9360 


CTCCCCAATT 


AACTAAGTTT 


AAAATATTTT 


CTGATCCTCC 


AGTTAATTCC 


AATCCAAGGA 


9420 


AAACAATGAT 


TGAAATACCT 


AAAAGGGCAT 


ACGTAATGAA 


CGGCTGATTT 


TTTAGCCGTT 


9480 


TCAATTTCAT 


TTCTGTTTGA 


TAATTCATTC 


CATTCCTCCT 


ATTGTATAAA 


TAATTGTTCG 


9540 


ATTGGCTGAT 


CAAACGCTTC 


TGGTTGCCAT 


TGCTCATGGA 


GTTGTTCACT 


AAAAACTAAA 


9600 


CTACATGAAT 


GGCCCGAAAA 


ATGTTCCAAA 


TAACGATCAT 


AAAAACCACC 


GCCGAAGCCT 


9660 


ATTCGATAGC 


CAGCACGATT 


GAAAACAATC 


CCTGGCACAA 


TcAATAAATC 


AATCGCTGTA 


9720 


GCTGTTATTT 


CTGCCGCTGT 


TAACGGAGGT 


tCTTCCACGC 


CAAATGCACT 


GGTATCATAA 


9780 


ACCGTCTCTG 


GAAAGACTTG 


ATAAAAGTGC 


ATTTTGCCTC 


CTTTAAATGT 


tCGCGGCACC 


9840 


GsCAcTTGcT 


tGCTCtCTTG 


CATGGsCCGk 


TCGAAAATTG 


GCTGTGTTGA 


TAATTCTAGC 


9900 


GGTAACGACC 


GAATCATTCC 


GATCGTTTGT 


GCTTGTTGCC 


ATTGCTGACT 


AGcAAcAACT 


9960 


GTTCATAGAG 


ACTTGATTCT 


TTATGTTGTT 


TAATTTCTGG 


ATGTTGTTTT 


AGsCATTCCA 


10020 


GGGCAGTTAA 


GCCCACTTTT 


CTCAACTTGT 


TTTTCTCCAT 


GTACACCTTC 


AACCCCTTTT 


10080 


CTGTTAAGTT 


TATAGAAAGT 


GTTCTTAAAA 


AACAAGCTGA 


TTCCCCTTTT 


CTCTTTGAAA 


10140 


AATCCCTTTA 


TTTTTCTAAC 


GTTTTTGACG 


CTTCTTTAAA 


AAAGTCAACA 


ACCAAATCCA 


10200 


TTGTTTCTCC 


TTTCACTAAA 


TGCCGCTCAC 


CGATTTCCGT 


TATAAATTGC 


GTATTTCTCG 


10260 


CATACGTTTT 


CCCAGACACT 


AATTGTTCAA 


AATCAGCCAT 


GTCTTCATAG 


GGAATTTTTT 


10320 


CATCTTCCGT 


TCCATGCCAA 


AACAGCATCG 


GTCGTTCTTG 


TAGTCTTTCT 


GGCGTTTGTG 


10380 
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ATAAATCGTA TTGATGCACC CAACTTAGCA ACAGTGGCAA GTCTTTAGGT ACAAAAATAG 104 4 0 

CAAATTCTTT CGCCCGTTGA ATCACTCGCT GAATATAGCG CGCCGGATAG GGTGTTCCCA 10500 

TTAAGCAAGC AGCAGCTTGA ATCTCAGGAT GTTGGGTCAT AAGAGCGCTG GTAGTAATAC 10560 

CGCCCATGGA TACACCACCC ACGCCAATCC AATTTTCTTT AATCAACCCT AATTTATGAA 10 620 

AATGATGGaT AATTACTGGA AATTCAATTA AATTATATtG AATGCTACTC CAAAATGTAA 10680 

TCGAAGGaAT CGTGGaAATA gGTtCTGnTT TTCtTTCTCC ATGGtTCATG GcATCTGGTA 1074 0 

AAATGACCCG AAACCCCTCG TGTGCTAATT TTCTTGCTTG TGTTAAAGAT AATTCTTTAG 10800 

CTGaTTGCCA ACCATGATAG TAAACAATTA AAGGTAATGC TTCGTTTTTT TGCTCTTCTG 10860 

TTgTCACTTC TAATGCTGGA ATTTTTGCTA TGTAGCGATG TCTAATACTG ATTTTCATTC 10920 

ATGATTCCTC CATTATAAAA TATCTTCTCT TATTTTACCG TTTTTTGATA AAACCTGCTC 10980 

TTTATATGAT TTATCAAAAA AATCACAAAA TATGCGAATT CGCATTTTCC TATTTTATTA 1104 0 

CTTTCCGTTA TAATCATTTT ATTAAATACA AAGGGGTGTT TTTATGGTTA AAATTGAGGA 11100 

AAAAGGTAAA GTAACATTTG GACAAGCTTT TAAAGATTAT TTTCGTGGAT ATGTTGATTT 11160 

TAAAGGTCGG ACCACACGTG CTGGTTATTG GTGGATGACA TTGGTATTAA GCATTTTAGC 11220 

ACTTATTTTT TATATTGCCA TTGTCGGCAA AGCTGtGTCA GCAATTTTAG CAGCTGAATA 11280 

TTTTGAAACC TATGATTTTG GGAACTTATT ACCGTTGATG CTTTTgCACT TGTTTTATGG 11340 

TTAGCTTTAT TATTAcCAAC GTGGGCAtGT GTGTcCGTCG TTATCGTGAT GCTGGGATGA 114 00 

CAGGTTGGGG CGTGCTTGTT TTATATCTAC TTTCTATTGC CTGTAGCTAT ACACAAGTTT 114 60 

TCTCTGTTAT GTCAACACTG AAATATGATG T.TCAAACAGA TACTGTGATA ACAGGTGGCA 11520 

GTCCCGTCTT CTTATTTTTC ACTCTTGTGA TTAGTCTGTT TTTCTTCTTA CTAACAGTCT 11580 

TACCGACAGA TAAACTAACA AC AAC TAG CC AAAATAGCGT CTTACGCTTC TTCTTCCGTT 11640 

ACAAAGAAGT GAAGTAAGGA AAAGTTTTAA GCCTTTCTTT TAAAAGAAGG GCTTTTTTCT 11700 

TATTTTAAAA TCTTTATGAT AAAATAAGAA CTAGTCTAGC TAAAAAAATG GAAGGAGAAA 117 60 

ATATGGCTAC TTTTAAACGG ATTCTGAATC TAGAAAATTA TATTTTTACT TTTTTTATTT 11820 

TATTTGTCTT TTACGGCGCA ATTAACTCTC CCTATTCGCT GGTAAAAATT CTTTGGGAAA 11880 

AGCCTAAGCA CTTGCTTGTT TTACTTCCTC TATTAATTAT TCTATTGTTG ATTATTTATA 11940 

AAAACAAAAA ATTTCTTTAC ACTAAGATTC AAGCGCTCTG GCACTTAATA AAAACCCATG 12000 

ATCGTGTGGT TATTATTGGT AGTTTGATCA GTTTATTTTT ACTACAACTT TTACTCCTGA 12060 

CGCAGATTAC TGTGCCGATT GGCTGGGATG TTTTTGACAA CTTTCATAGT ATTACCACTG 12120 

AAAATAAGGA TTATTCAAAA ATCGTACTTT CTTTAAACCC CAATAACGAA TTTTTCTTTT 12180 

TCATGATGTA TTACCTTAAT AAATTTTTGC GTTTCATTGA TGTAACAGGG AGCTGGAGCA 1224 0 

ATACTTGGTT TAGTTGGCAA GTGGTTAATT GCTTGTTTAT TAATAGCAGT TTATTTCTCT 12300 

TTTATCATGC CTCTAAGCGT GTCTTTAATC CATTAACGGC CTTCGTTGCT TACAGTTTAT 12360 
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TTTTTCTTTC 


TTTCGGATTG 


TCTCCATGGT 


TACTCACGCC 


T TAT AC AG AT 


ACGGCCGTCT 


12420 


TATTGTTCAT 


CAACCTAGTC 


TTTTTTGCAT 


ATAGCCTTTT 


TGACCAAGTT 


TCCCCTCCCT 


12480 


TCGTAAAATA 


TTGTTTATTA 


CTGTTTATTG 


GGATTGGCCT 


AGCTTGGTGT 


TTTTTAATGA 


12540 


AACCTTCGTC 


TATTATCTTT 


TTTATTGCAT 


TTAGTTGTAT 


CAAAGTGCTG 


CAACTTCTTT 


12600 


TAGTAAATCG 


GAACAAACAG 


TCTATCGTAA 


AATTAACCGT 


GGTTGCTCTC 


TTTTTACTGA 


12660 


CTGGTTTCGC 


CAGCGCATAT 


TATAGTTTTC 


AATTTTTTGT 


GGAAAAACAA 


ACGATTACGG 


12720 


AAATCGATAA 


AGAACAAoCC 


AAACCTTGGA 


CACTTTTTGT 


AATGATGGGT 


TTGACTGGAA 


12780 


CTGGCGGTTA 


CAATGATGCA 


GATACACAAG 


CAGTAAATCA 


GTTGCCTACA 


CAAGAAGCCA 


12840 


AAAAAGCTTA 


TACTATAAAA 


ATGATTCAAG 


ACCGTTTAAA 


AAATAAAGGA 


TTCTTCGGTT 


12900 


ATCTCCGTTT 


TCTGGCTCAA 


AAAAATCGTC 


ATAATACTGC 


CAATGGTGAT 


TTTGACTGGG 


12960 


GTTGGGATGG 


CGGTGATTTA 


ATCCCAGAAA 


CACCTTCTAA 


AAATCGTTGG 


CAAGAACACT 


13020 


TGCGGTCACT 


GTATTATCCG 


CAAAATCAAA 


AAAGTAACTA 


TCTACGAATT 


TATATGCATT 


13080 


TTTTCTATTT 


aCTcACATTa 


CTaGGTcTAT 


TGTTCAGTAT 


CCCTCTAAAA 


GATTCAAAAA 


13140 


ATAACTACGC 


GATTTTAAAA 


TTAGCTTTCA 


TTGGAGCTAT 


TCTGTATTTA 


TTGCTCTTTG 


13200 


AAGGCGGCCG 


CTCTCGCTAT 


CTAATCCAGT 


TTATGCCGTT 


TTGGTACTTG 


TTATCAGCGA 


13260 


GTGGTTGGCT 


GGGATTGAGA 


GAGATTCGAA 


GATACAAAAA 


AATAGTAAAA 


TAGAAAAGAG 


13320 


GATTGAGTTA 


AAAATGAAAA 


AAAGTTTGCT 


GGGGTTGATT 


GTTTTTTTAT 


CCTTACTGAC 


13380 


CCTTACTTCT 


TGCAGAAATA 


AAGTGACCAC 


AAAAGAGCTG 


ATGGCAAACG 


AGTGGGcCGT 


13440 


AAACTCCAAC 


GTTGATGAAG 


TAGTAATGAT 


TGTATCATTC 


aGCGAaGATA 


CCGCTACTTT 


13500 


CAAAATCAAT 


ACAGnTGAAC 


ACaCATCAAC 


TGCAAAAAAT 


GAGTTGGnAA 


AAGCAGGCGA 


13560 


AGAATTAGGT 


AAACAAATTG 


CAAATAAAAT 


AGAATACAAA 


GTCAAATACC 


ATCTAAAAAA 


13620 


CAATCAAATT 


CGTTGGGAAA 


ATGAAGGAAA 


AGAAGTAGCT 


TACAAGATAA 


AAAAAGAAAA 


13680 


GCAAGATCTA 


CTTTTCACTC 


CTACTAAGAC 


AAACAATTCT 


GATAACCAAA 


CAAAACTAGT 


13740 


TTTGAAACCT 


TACACAAAGA 


AAAGTATTGA 


TTCTTCTACT 


CAAAAAGATA 


AAACGGAAGA 


13800 


AACCAGCTCT 


AATTATCAAA 


ACGTCTCATC 


TGAAACAAAC 


CAATCTACCT 


CTTCATCTAC 


13860 


TACTAAAGAA 


CCGCTACCAC 


AAGTTAGCTT 


AGCTGATTTT 


ATAGGCGGTT 


GGGGTATTCC 


13920 


TCAAAGTGAT 


AACTTATTTT 


TTATAAATGC 


TGACGGAACA 


CTCACTAGCA 


TAACTCAATC 


13980 


GAATGTTCCT 


CTTCAAAATG 


TAAGTTTTTC 


TGTGGATGAG 


AATGGTAATC 


AAATAATGAC 


14040 


GTTTCTTTTG 


AATAATACGC 


CCCGAACAGT 


AACGAAAAAT 


AATGATGGTA 


CTTTAACTGT 


14100 


TAATGGACAA 


. ATATACACTT 


ATCTAGGTAA 


TATTACGTTG 


GAACAATTAA 


TTGAAAGAAA 


14160 


TAATCAAACT 


CAACAAGTTT 


TTGAACAATC 


TGAACAGCAA 


CCACCACAAA 


ACTCTGATTC 


14220 


TCGTGAACAA 


ATACAAAATT 


CTAAATCaGA 


CCAACCTATA 


TACGATACGG 


TACGAAGTGG 


14280 


TGAAGGTGGG 


CGACAGTTAG 


CCGAAAGAAA 


TGGTTTAACC 


TTAGAAGAAT 


TATTAGCATT 


14340 
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AAATCCAGGC ATTGAAACTT CTGTTTTTTA TCCTGGTCAG TCATTACGAA TTAAATAGAA ' 14400 

ATTTAGGTAT ACTAATAGAT AATAGTCTAT AAAAGTAGAA CTGGACCATC TACCATTTAC 14 4 60 

TTGGGATACC AGTTGATGGC AATAGTTTTG ACGAAGTGGA ACTAGACCTT GTACCTACTG 14 520 

GCTCAGCAAA AATTGCGACA TTTGATATAG CAGTTGGGCA TTGATTAACA GCATGCATAT 14580 

AATTTGATAA TATGGAATAT ACACAAAAAT GTCTAAGGAG CCTAACATTC TTTCAAGAAA 14 640 

GAATGTTAGG CtCCTTTTAT TCAGACAATT TCAGCCACTC CTTGCGATTG CAATAGGACA 14700 

TAGTTTTTAC AAAGACAAAA TGATAAAGCA AACCGTAAGC TAGCCCTAAC ATATTTAACA 14 7 60 

AAATTCTTAA GTAAAGACTT GTTTCTAGCC CAAATATCAT AGAAGCTGCC ATGATAGCAC 14 820 

CAACACAATT AAAAATCGTT TTACTTTTAT AAGATGTGCA TAAGCCTAAA CAAAGTCCTC 14 880 

CTAAAATACC AAGTAGGTTC GTTGGTATAT AAAATAACAA GATAGCCGAT ACTAGAGAAC 14 940 

CAATAACGAC CCCTATTATT CGTTCTTTTG CTCGTTCAGA CAATTTAAAC GTCTCATACC 15000 

CAGAAAATAA TGACGAACTA GCAAATGTTG CCCACATAAA GCGGTCAATC TGAAGGTGCG 15060 

TTCCTATAAA AAGTAATAAG CTAATGCCTA AAGCGTAATA ACCAAACCAA ATATTTCTTT 15120 

GATTAAAAAA ACCATTTTCT GTAACCATCT GTATAAAAGT AATCTCTTGA TCCAACTTTT 15180 

TATGTTTCAC ATGATAAACA AAAGCTAAAA GTAAATAGGC AAACACTAAT ACAAAGAAAG 15240 
TTTGTTCTAA TTGCTGAAAC GATTGATAGT GAACCGTGCC AACTAAATAT AAGTAGGAGA ■ 15300 

ACGTATACAA GCCCGGATTA CCCATTTTAG GGTTTTTACC AGTTAAGAAA AACAATGCCA 15360 

GCAAGCAGAT AAAATGAAGA CCCCATTGTA AAAATGATAC AGAAATCAGT GAAATTAGTG 154 20 

GACTAACACC TAGAATTGTT AACACGATTC CTAAACTAAG CAAGGCCTGT TTTTCCCTGT 15480 

AGCCATATGA GACGAAACGG ATACTCAACA GCAAACAAAA TAATACAATT GAAAAAGGCG 1554 0 

CTACAGCTTT TCCAAAGAAA AAAGTAATCG TCGACACCCA TAAAATCGCA AACGAAACTA 15600 

ATAAAATATC CCTTATTAAC AAGGCGCGCC AAAAATATCT CCGCTGTTTT TTAGTGTCCG 15660 

CTTGATGAAT TTTTTGTTTT AAAATAAATG GATCTAACTG CAATAATTGA TAAAAAGTCA 15720 

CCTAAAATCA TCTCCTAAAA ATCGACTCTT CTCAATTCAT TATTCCAACT TTTCCytCtA 15780 

TGaAATCaTA AGaAAAgaAG TkGTGACGAA CATTTTAcTA CaACTTTTTT TCTGAATCAA 15840 

TACATGCTTT ATTTCTTATT CCaATACCtT TCCCgATACG ATTTGCTTAC TTCATAAAAA 15900 

ATAAGCCTAG AAAAAATCAA AACGATTAAT TCTAGGCTCA AAATTAAATA ACTGTTGAAC 15960 

AAAAGctAGC GCTTTTTTAC ATAAACTTTT CCAACAGATC CATTTCTTTT TCTAATTCTT 16020 

TTTCATCATA CTTGTCATAC TTACCAGCTT CGTGTGCAAT TTTTTTAATC TCATGAATGG 16080 

CTTTTTTTTC . AACAATCCAT TTCTTCATAC TATGTTTTTT TGTATCCTCA ■ TCTAGCGAGC 1614 0 

TCAACTCTTC TAAACTTGCA TCCAATTTGT TCAAAACGTC GCAATTTTTG CTAATGCTTG 16200 

CGCTTCTTTT TCTTCATAGT TTGACATAAT AGACACCTCT TCTTAAGTAT TTTTCTACTT 16260 

TAAGTGTACG TCTCTGTCGT TTTATTTGCA ACTAATAACT. CAACTGCTTC TCGGTACGGA 16320 
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GGATAAGGGA 


TTCGAACCCT 


TGCACGATGT 


TACTCGCcTA 


ACGGTTTTCG 


AGACCGTCCC 


16380 


CTTCAGCCAA 


ACTTGGGTAA 


TCCTCCCTCT 


ACATTCCATC 


TACAAATGAA 


CATGTAATGA 


16440 


ATCTATAGAT 


AAAAAGCCTA 


GACCCTTACA 


CAACAAGGAT 


TCTAGGCTCT 


ATCTAACTAA 


16500 


ATTATTTAGT 


TTCACGGTGT 


AATTTTTCTG 


CAACACCATA 


TTTAGACAGA 


CTAGAACTCA 


16560 


CAAGGGTTTG 


CGTAATATCA 


TTTTAATTCA 


AATTAGCAAA 


ATAGACTATA 


ACAACAAGAT 


16620 


AAAATCTACA 


TCAAAAAATT 


AAAAAAGGCA 


AGCGAATAGA 


TATATAAATC 


ATACCGAATA 


16680 


AACAAAAAAA 


GCTCTACTTC 


CTGTGATTGA 


GAAGTAGAGC 


TTTTAATTTA 


TCTATTCAGC 


16740 


ACCGTTAACA 


CTAAGGTGCA 


GATACCTAAT 


AAAGTTAAAC 


TAAATGATAA 


TAGTTCTTTC 


16800 


TTTTGCAAGC 


TCATGTTTTT 


CTTCATCTCG 


TTAATTCCCT 


TTAACACTAT 


TATAGCGACG 


16860 


ACAATAAAAA 


GTATATAGTA 


CGTTCGCATA 


CCTGAAGCCC 


AAACAGTAGG 


AGAAAAACCC 


16920 


ATAATTAATC 


TTGAACAAAA 


ACCCATCATC 


ATTAACACTA 


CTAAAAAAAC 


AGCAGATTTT 


16980 


TTATCATCAA 


AACTTATATA 


AATTCCTATA 


ATTAAACATA 


ATAATAATCC 


CAAAACTATA 


17040 


AAAGTAGCTA 


CCCAAGTTCC 


AGGGTGATAA 


ATTGATAGTT 


TAGTTCCAAA 


TCTTGTAAAG 


17100 


ATATTCCCCA 


GATTGTTGCT 


ATTCCAAATC 


ATTGATCTTT 


TATTTCCATT 


GACATAAGTA 


17160 


AAACTTTGTC 


CCATTGTGTT 


TCCGAAAAAA 


■GTAATCAAAT 


TTAGAAAAAG 


CGGCAATGCG 


17220 

i 


GTTnATATAC 


GTATATAATA 


GTTCTGACAT 


TTCTTATAAG 


ATAGAAAGAA 


TAATATAAAA 


17280 


AAGAGTAATA 


AAAATAAAAT 


ATTTGTGTCT 


AAAAAAAACG 


GTTTTCCAAA 


TGATGATAGG 


17340 


GAGCGTCAAT 


AATTTTGTGT 


AAATAAATTG 


TCCTCCTGCA 


AAATAATTAG 


TTACTCAGTA 


17400 


AACATTGAAA 


CTAATGTATC 


GGTTACCTGT 


TGAAAACCTT 


TATGGCTTCT 


GTTTAGAAAT 


17460 


TTTTGATTGT 


ATGTATCAAA 


AATGCTGACT 


AGAAAGCGTT 


CTAGTGATTC 


TTCATTTTGA 


17520 


AACTGCTCTT 


TTCTACGGCT 


GTATCTTTTA 


ATTTGCTTAT 


TGAAAGACTC 


GATTAGATTG 


17580 


GTTGAGTAAA 


TGGTTCTACG 


AATGCyAGGT 


GGrAAATCaT 


AAAAaGTTAA TAAGTCyTGG 


17640 


TTTTCTATGA 


GTGACTGCGT 


CACTTTAGGA 


TAGTTTnnCT 


TCCATnTCTC 


AATCATGCCG 


17700 


GATAAGAAGG 


TATTCGCTTC 


TTCAnTGAGT 


TAGCTTGGAT 


AAACAGCCCT 


AAAGTCATCA 


17760 



CAGA 17 7 64 

(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
GGGATTAACT TGCCTCnTAA GTTTAGTGCA GATTnCGATA CAAAATTAAG TGCTGAAGAA 60 
ATAGCTACTC TTGAAAAAAC GGCnCTAGAA ATGAACAAGA ATTTTCCAAC GAGCAAAGAA 120 
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GATGAAAAAA ATAAAGACGT 
GAATTAAGTG TATACACAAC 
CAATTAAGTG TATCGGACCA 
ACCGGAGAAT ACATGCATGA 
AAAGAATATC CTGGAATGAA 
AATGGTAAAG TTTCTAAATA 
TTGTTTGATG ATGGCCGCTT 
ACAGCATTAG GCGTTTCACT 
ATTTCTTATG GTAAAGAAAA 
AGCATGAAAT CTAAAGAAGA 
ACCGCACAAC AAGCAGAATC 
AATACAGCCA AAACAACACA 
TTGGCTAATC TTCAAGCAGT 
CAAACTGCrr CGGCAGAAAC 
GAGTTAATTC AGAATCGAGC 
CAAGCGATTG AACAAACGTC 
GAAGAAAACA CATTGAATAG 
CAAAAACAAG TTGCGCTTAA 
CCAACATACG AAAAAGCAGT 
AAAGAAGCCT ATGAAAATTC 
GCTACACTTG CT TAT AC AC A 
CAGTACCAAG GCATATTAAG 
GCGTTGCAAG AGCAAGTAGC 
CAAATGTTAG TAGCAAGTGC 
TTTTCTAAAC AAAAAGAACA 
ACGAAACAAG TAGCAAAAGC 
TGGTTGACTA TTTTTGGATT 
CGGAAAAATA GTTAAAAGTA 
AATACATGAC ATGAAAGGAG 
GTTCTCACGA AGCACTAGAA 
AATATCTATT ' CAAGAGAATC 
TGCTGTGGCT GAATAGAAGT 
TGATTATGCA AGGTGCGAAT 
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GATGTGGGAT ATTCAACATT TAAGTGCAGA TCAGAAAAAA 180 

AGAATTATTA AATGAT-G-TGG -GAAAAAAATT GGGTCTTTCT 2 40 

ATCAATCAAA TTTGCTTGGG ATATTGCGAA ATATTCTGAT 300 

TGTGATAGCC ATTAATAAAG CAGCTAAGGA AAATGGCTTT 360 

TTACTATGAA AACCTAGGTG GGGGGTATTA CGAAACTGAA 4 20 

TACTCTCCAA GAAAGTATCC GCAAAATGCT TGTCAATATG 4 80 

AGGTTATAGC CATTTACATT CTTTATTACA AGACGGAAAA 540 

ATCTGGTGAA AAAAATTCTA TTTCTCCTAA GATTCATATT 600 

ACTGGAGGAT AGTAGTCAAT ACCAAAACGG CGAAGTGGCA 660 

ATTACAACAA GAGATAGCTA GCAACCAAGA AAAATTAGCT 720 

AGACGCACAA CAAGCAAGAA GTGCAAGTCA GCAAGCCTTA 780 

AGCAACAGCA GAAAAAGAAC TATCTGTTCA TAAAGCGACA 8 40 

TGCGACTAAr AGTACGACAA ACTATGAAGA AAAAGTACGA 900 

AAACCTTCAA CAAACAAAAG ATCAATTGGC AACAATCAAT 960 

TGTTGTTTTA GAAAAAGCGA AAACAAAAGT TGCAGACGCA 1020 

TGCTAAAGTG CTGAAAGAAA AACAAGAAGC CCAAAAAGCA 1080 

CTTGAAGGAA GTATTGGATT TAGCAAAAGA AAACTTAAAT 1140 

AACAAGTACA CGTTCATTGG CTCGTTTAGA AAATGCTCAA 1200 

GAACGAGTTA AACAAAGCAG AAGCGGCAGT GGTCCAAGCA 12 60 

TCTGAAATCA TTGGAGGAAC TCAAAGAACA ACAAGCCGTT 1320 

AGCACAAGAA GACCTTTCTA ATGCGAAGTT AGAGCTACAG 1380 

AGACTTAGAG GCACAACAAG CCGAACAGCG CCGACAAGAA 14 4 0 

AAAAGAGCAA CAACGCCTTG AACGAGAAGC AAAGCAACAA 1500 

TACTTCAGCA GATAAAACAC CTGGTCTCCA ACAGTTATCT 1560 

GCCAAAAGCA CAAACACTAA CACATTCAGA ACCTCGTAAG 1620 

CCAAGAATCC TTACCACATA CAGGAGAACA AAAAAGTATC 1680 

ATTCATGGCA GTAGGTGCGA TCAGTTTCAA GAACAAAAGA 17 4 0 

TAATAGGTGA GGAGCGAAAT TTTCGCTCCT TTTTATTTTA 1800 

AAAACATTGG CAAAGAAGAT AAACCTTGTT TCCATATTTT 18 60 

CAAATTTATA TTCTATCCTA CCCGTATTGT TTAATTAAGC 1920 

GAAGATATTT TTGGTTCTTT TTTTATTGGA GAAATGACAT 1980 

CGCTTAAAAT AAAGCAGTAG AATCCCCTTA TTGTTTTATT 2040 

CCTTGCCAGC AAGTAGGGTT TACGGTAAAT GATCCCGACT 2100 
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GCCGGTTAGC 


GAAAACCGTA 


TAGAGATTGA 


GCGGTAGTGT 


GGCAAGCAAC 


AGCTCTTATA 


2160 


CCATTCATAA 


GCCTACCGCA 


AAAAGAATGG 


CAATCTCAAA 


AAAACAGGAA 


ACTGAGAGGT 


... . 2220 


CGCTCCTTTT 


AGGTTGGCGT 


GTAgCATCTG 


GGATGCAACG 


GACATGAAGG 


AAGATAAGAG 


2280 


GTAGGTTCGA 


CTCCTACCAC 


GCCAGTAAGT 


AGCTTTGCTA 


CGAGAGAAAA 


AGAATAATAG 


2340 


GAGGAATAAC 


AATGAAAAAT 


TATGATCCAA 


ATATCCGATG 


GGGCTTGCAT 


ACAATCAAAG 


2400 


TAAGTTTCCA 


ACGAGGTGTA 


TACAAAGGAT 


TTGTTACCTT 


TGTAAAAAGT 


GGAAATTGTA 


2460 


AAGGGTTAGA 


CGTATTAGGC 


ATAGACGAAG 


AAGATTTATA 


TGACATGAAA 


TTTAAAGAAA 


2520 


ATCCAATCAA 


TTTTAGACTG 


CTTGGTGAAG 


ATGATAATGG 


CGATGAATGG 


TTCGCAATGA 


2580 


CTTTGAAGAA 


TGACAAAAAA 


GATGAATTGT 


TAGTTGAAGA 


TGTGTGGGAA 


GAGTTAAGTG 


2640 


AGTACATTGT 


GAGGATTGAA 


ATTATTGATT 


TTGAGGAGGA 


AAAATAGAGG 


AGCGAGAAAA 


2700 


TACTCGCTCC 


TCTTTTATAT 


ATAGAGAGGC 


TACCCAGCAA 


TGATAACTAG 


GTAGCCACAG 


2760 


ACATAAAATT 


ACTATTAATC 


TGGGAATAAA 


T TAT T TAT AC 


CATGAAAAGG 


GTTACTTATC 


2820 


AAGAAAAAAT 


ACCAATGAAT 


AGGAGAATGT 


GGAATGAAAT 


AAAGTAATAC 


ATCTATAAAA 


2880 


AAGAGAGGAG 


GAAAAAGTAT 


GACTTGTGTA 


CGTGCGCCAT 


GTCGGAGAAA 


TCGTTGTTTG 


2940 


TAAAGAAAAG 


AATAATAGGA 


ATTAAAAAAT 


ATAATTTACG 


AGGAGAAGGA 


TTCATGAAGC 


3000 


AACAAACAGA 


AGTAAAGAAA 


CGTTTTAAAA 


TGTATAAGGC 


AAAGAAGCAT 


TGGGTGGTAG 


3060 


CCCCTATTCT 


TTTTATAGGT 


GTGTTAGGAG 


TTGTAGGATT 


AGCTACTGAT 


GATGTACAAG 


3120 


CTGCGGAATT 


AGATACGCAA 


CCAGGAACAA 


CGACGGTGCA 


ACCCGATAAC 


CCCGATCCGC 


3180 


AGgTAGGTAG 


TACAACACCT 


AAGACAGCAG 


TAACTGAAGA 


AGCAACAGTA 


CmAAAAGACA 


3240 


CTACTTCTCA 


ACCGACCAAa 


GTAGAAGAAG 


TAGCGTCTGA 


AAAAAATGGA 


GCTGAACAGA 


3300 


GTTCAGCTAC 


TCCAAATGAT 


ACCACAA 








3327 



'(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

TTCGCTGGTA CGTTTTTtCA ATACTTAATA ATGCACTAAC GTTATATAAA ATAAACCGAG 60 

TTGCATCATC TACACTGATG CCCTCTTGTA AGCAATTGGT TTCAGGTTCA CTTTCAAAAA 120 

TAGCCGCAAT ATTATTGATA TAGGTTTTTT TGAAAATTTG ATACAGCGTA CGATTGTAAT 180 

CAGGGCCCAT .TTCTTGAAAC ATATCGTTCA ACATTAAGAA AAGATTGGTC GGATGTTCAT 24 0 

CTAACAAAAC TTTGATTAAT TGATAAAGTT TTTCTTCTAC TGATGCATTT CCAGAAACAA 300 

TTGGGACTAA TTCTTCCTGA ACACGGGACG TTAATTCTTC AATCACAGCT AAATACAGTT .360 

CTTTTTTATT TTTAAAATGG TGATAGAGAT TCGGTTGGGT. AATGTtCGCT TTTAAAGCAA 420 
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TTTCACGAGT GGAGGtATTT TgAACCCCTT tGTCATGAAT AATTCAGAAG CGACAGATAA 4 80 



AATAATTTGC CGTGTACGTC TAnATCAGCT 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



510 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GACATAAGTC GTTTTGACTT ATGTCCTATT TTCGTTTTTT ATTTCCAAAC ATTTTACGAG 60 

AATATTTTAA AATTTAGCGT ATACTTAATA TAAGCAAGTT AGGAGGGCTT ATTTATGAAA 120 

CCATATGTTT ATGTAATCAG TGATATTCAT GGACAAGCAG ATTTGTTTGA TGCCTTACTC 180 

ACAGACTATG ACCCAGTCGA GCATCAGCTA GTTTTAATTG GTGATTTAAA CGACCGTGGT 24 0 

TCGGATTCTA AAGCCTGTTT TCTCAAAGGA AAAGAACTTG TCGAGCAACA CGGCGCTGTT 300 

TATTTACGAG GcAATCATGA AGAATATTTT TTGCAGTTTC TTCAAAATCC AGAGGACTGG 360 

TTTGCAGGAT ATGTTCGCAA TGGTGGCAAG GAAACAATTG AAAGCCTCTT GCATCCAGGA 4 20 

GCTACTGCAG AATACTCACC AACAGAAATG GCCTTAATGA TCCGTAGTCG CTATCCAGAG 4 80 

TTAATTGATT TTTTAACAAA ACGACCACTT TATTTTGAAT GGCAGCATTA CTTATTTGTG 54 0 

CATGCTGGGG TTGATTTAAC GATGGAAGAT TGGCGGCAAA CAGCACCTAA GGATTTTTTG 600 

TGGATTCGCG AACCCTTTCA TCAAGGGAAA AATAATACAG GCAAGACCAT TGTTTTTGGT 660 

CATACCATTA CTCCGATGTT ACATGGTGAT ATGCAGACCA CGGACCTTTG GCAAAGTGAT 720 

GGAAAAATTG GGATAGATGG TGGTGCTATT TTTGGTGGCT CTGTGCATGG TGTGATTTTT 780 

AATGAAAAAG GCATTGTCCA AGATATCGAA TATCAAAAAC GCACCCcTGC GTGGCAACCA 84 0 

GAATTTTGAT TCCTTAATCG TATGAATCAG TTACCTCACT TCTATATTTA GAAGCAAAAA 900 

TATGCTAAAA TAGATAAGTA TGCATAAGAA AAGAGGTATC TAGTTAATGG CAGAAGCTTA 960 

TAACGAAGAA GTAGTAGCGC TACTTCAAAA AGAATTGACA ACGTATCGTT CAAAACAAAT 1020 

AACAACGGTT TTAACTTTAT TAAATGAAGG AAATACAGTT CCTTTTATTG CCCGTTACCG 1080 

GAAAGAAATG ACTGGAAGTT TAGATGAAGT CCAAATCCGT GAAATAGAAG AACGTTATCA 1140 

TTATTTACAA AACTTGAAAA AGCGGAAAGA AGAAGTTCTC CGTTTAATTG AAGAGCAAGG 1200 

AAAACTAACA AAAGAACTCA AAACAGATAT TCAAAAAGCA GTAAAAATGC AACAAGTGGA 1260 

AGATTTgTAT CGACCATATA AACAAAAACg TCgGACCAAA GCGACTATCG CGAAAGAAAA 1320 

AGGGTTGGAA CCATTAGCAG ATTGGTTGCT CTCTTTACCT GAAAACGCGG ATATCTTAGC 1380 

CAAAGCGGCT ACTTTTATTA ATGAAGAAAA AGAAGTTGCA ACCGCCGAGA TAGCCTTGCA 14 4 0 

AGGTGCTCAT GAAATTTTAG CAGAACGTAT TAGCGATGAA CCAAAATACC. GAATATGGTT 1500 
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ACGTGATTAT ATGGTTAAGC ATGCGCAATA CGTTAGCGTA GTTAAGGATG AAGAAAAAGA 1560 

CGAAAAGCGG ACCTATGAAA TGTATTACGA TTTTGCAGAA CCAGTTAGCA AAATGGTGCC 1620 

GCACCGTGTC TTAGCAACGA ATCGTGGTGA AAAAGAAGAC ATTTTAAAGG TTTCCCTTGT 1680 

CGCTGATGAA ACaAAAATTA AT GAT TAT TT CCAACGTCAA CTAATCGGCA AGCAAGCAAC 174 0 

AAGTTTAGcT GcTCCATACA TTGAGGCTGC TTATTTAGAT AGCTATAAAC GATTCATTGG 1800 

ACCAGCGATT GAACGGGAGA TTCGCAATGA ATTAAcAGAA AAAGCTGATG AACAAGCGAT 1860 

TGCAATTTTC GGTGAAAATT TACGTAATTT AT TAT T AC AG TCACCACTTA AAGGAAAAGT 1920 

TGTTCTAGGA TTTGACCCGG CTTATCGGAC AGGCTGTAAA TTAGCAGTTG TCGATGAAAC 1980 

TGGTAAAGTT TTAGCAATTC CAAGTGATTT ATCCACATAA ACCAGCGACT GCTGCGAAAC 204 0 

GTGAAGC 2047 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



CnCCCGGGGG 


AAATCAnGAn 


GAATTTCCCG 


AAATTGCTCT 


TTAATTTTTG 


GAGAATTTCG 


60 


GCTTATTTCC 


GAAGGAGTTG 


GTTCTGTTCC 


CGCCGTTTAT 


CAGTTTTTGA 


GGGTGGAGCA 


120 


AAAATCCAAA 


GTGATTTTTG 


TCCCACGCTC 


ATTTTTCTTA 


AATGAAAGTT 


TTTAAAAAAT 


180 


TTCCATGATT 


TTTCAAGTAA 


AATGAGGTGG 


GCACTTTTGT 


TCGATTCCAT 


TTTTATGTAT 


240 


AATGAGAAGT 


GATGGAAAGA 


GGGAAGTTAA 


ATGGCACAAA 


AAAGAGCAAC 


AAATAAGAAA 


300 


AAAAGTACGA 


CAAAGAAAAA 


GACCAAAAAA 


CAACAGCAAC 


AACAAGAACG 


TTTAAATTAT 


360 


ATGTTTCTTG 


GCCTGATTTT 


TATTTTATTC 


GGTGTCTTTG 


GACTCTTTCG 


CTTAGGGTTT 


420 


TTAGGCACGT 


TATTAGCGAA 


TTGTTTACGC 


TTAGTTGTAG 


GTAATACCTT 


TCCATTTGCA 


480 


..GCGATTTTAC 


TAATCCTGTA 


TGGCTTATTA 


GTCATGATTT 


ATGGGAAAGA 


TTTTCCGTTA 


540 


AAAAGAGGGC 


GCCCTATTTT 


TGGGGCTGTT 


CTGATTTATA 


TTAGTGTTTT 


GTTATTTTTC 


600 


CATGCCTTTA 


TGTTTCGAAA 


TGTTAGTGGC 


AGTCAACCAG 


ATATTTTAGG 


TAACACTTGG 


660 


GAATTTTTGC 


AAAGCGATCT 


AAAAGCTAAC 


CAAGTGACGC 


AAAACGTAGG 


GGGCGGTATG 


720 


ATTGGCGCAC 


TGCTTTATCA 


AGGAACGTAC 


TTTTTGGTGG 


CACAATTTGG 


TAGCTATTTA 


780 


ATTGCCACCT 


TGCTACTGTT 


AGCGGGTATT 


TTCTTGATGA 


GTATGTGGGA 


TTTTCAGCmA 


840 


ATTGTTGATC 


ATTTCCAAAG 


TATTCAAGAT 


CGCCTAAGTC 


ATGTTTCGGC 


AAAATCACAG 


900 


GCGCGACAAG 


AAGAAAAAGA 


GGCGAAgCGA 


gCAGCGAAGA 


AAGAAGCCAA 


AGCTGCAGAA 


960 


CGCCAAGCAA 


AAATTGAAGC 


CGCTGCCCAA 


CAAAAATTAC 


AAGAGCGGGA 


ACGCATGGAA 


102C 


CAAGCGGCAG 


CCGAGCGCTT 


AACAAAAACA 


CCTGTTGAAA 


CCCACCAACC 


AATGGTGGAA 


1080 
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GAACCGGCCG CACCAACGCC TGTTCAAATT GATTCGTTTC AACAGCAAAA TCAGGCGATG 1140 

CCTGTACCAC CAATTGCAGC AACAAAACCG CAACGAGAGC AGGAAGAAGA GGCGGCAGAT 1200 

GAAGCTGGTG TTCTCGAATT TGAAATTTCT GAAGAAGCGG AAGATCGTGA TTATCAACTA 12 60 

CCACCAACTG ATTTGTTAGA TACAATTCAA GCAACGGATC AAAGTGGTGA ATATGAAAAA 1320 

ATCGAAAAAA ATATTGGCGT CTTAGAACAA ACCTTCmAGA GTTTTGGTGT TGATGCTAAA 1380 

GTAGTCAAAG CTAGTTTAGG ACCATCTGTC ACAAAATTTG AAGTTCAACC CGCTGTTGGG 14 40 

GTAAAAGTTA GCAAAATTGT TAACTTGACG GATGACATTG CTTtAGCGCt TGCGGCCAAA 1500 

GACGTTCGTA TGGAGGCTCC GATTCCTGGG AAATCACTAA TCGGGATTGA AGTTCCTAAT 1560 

AGTGCAATTA GTACAGTATC CTTTAGAGAT ATTGTTGAGG CACAACCAAG TCATCCAGAT 1620 

AAACTATTAG AAGTACCTTT AGgTCGAGAT ATTTCAGGGA TGGTTCAAAC TGCAGATCTA 1680 

TCAAAAATGC CGCATTTATT AATTGCGGGA TCGACAGGGa GTGGGAAATC GGTTGCTATC 174 0 

AATGGCATCA TCACAGGTAT TTTGATGCAA GCGAAACCTC ATGaAGTAAA ATTAATGATG 1800 

ATTGACCCGA AAAtGGTAGA ATTGAATGTG TATAACGGGa TTCCTCATTT ATTAACGCCA 18 60 

GTGGnTTACT AATCCGAGAA AAGCGGCCCA AGCCCtGCAA AAAGTAGTTC AAGAAATGGA 1920 

ATTTCGTTAT GAAAAATTTG CAGCAACAGG CGTCCGAAAT ATCACTGGTT ACAATCAACT 1980 

AATTCAACAG AAAAATGCGG AAGATGGCGA AAATCGTCCA ATTCTACCAT TTATTGTTGT 2040 

CATTGTTGAT GAGTTAGCTG ATTTAATGAT GGTGGCAAGT AACGAAGTAG AAGATGCGAT 2100 

TATCCGTTTA GCCCAAATGG CACGGGCAGC AGGTATTCAT ATGATTTTAG CGACACAACG 2160 

GCCAAGTGTG GATGTCATTA CAGGAATCAT CAAAGCAAAT GTCCCTTCTC GAATGGCTTT 2220 

TGCGGTCTCA AGTGGGACAG ACTCGCGTAC CATTATTGAT ACCAACGGGG CGGAGAAACT 2280 

ATTAGGTCGA GGCGATATGC TCTTCTTACC AATGGGAGAA AATAAACCAA TTCGTATCCA 234 0 

AGGGGCCTTT ATCTCTGATC AAGAAGTCGA ACGAGTGGTT GCGTTTGTAA CGGATCAACA 24 00 

AGAAGCTGAA TATCAAGAAA GCATGATGCC AACAGATGAA CCGACCACTT CAGGCGGTGG 24 60 

CGAAGCTCCT CAAGATGAGT TGTTTGAAGA AGCGAAAAAC TTGGTGGTTG AAATGCAAAC 2520 

AGCTAGTATT TCATTGCTAC AAAGACGTTT TAGAATTGGC TATAATCGTG CGGCTCGCTT 2580 

AGTAGATGAA TTGGAAGCAC ACGGAGTTAT TGGACCATCA GAAGGTAGCA AACCAAGAAA 2 64 0 

AGTCTTTCTA CAAGCGGAAT CAGAAGAGGC AGCAACCGAG ACGCCAGAAC AATAAACAAA 2700 

AAGAACAAAA ATAGATAAAT TTATCTATTT TTGTTCTTTT TTTATAATTT CAGCAAAGCA 27 60 

TCATGTTATA ATCATCATTT TCTGGTAAAA TAGAAAGGAA TAGTGCGGTT GGGAGGGAGC 2820 

GTATCATGAG AAGAAAAAGC GTGCTTTATT TAGAAGTAGC AGATCAGATT AAAGAAGATA 2880 

TTTTGAGTGG CAAGTATCCA GTAGGAACGT TTTTACCAAC AGAAACAGAA TTGGAAGAGC 294 0 

TGTTTAACGT TAGTAAAATT ACGATTCGGC GGGCAATCGA AATGTTAGCA ACAGAAGAAT 3000 

TTGTTGAGAA AAAAAGTGGT CGAGGCACAA CTGTGTTAAG CAATCGTCCC TATAATAAAC 3060 
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TATCAAAAGC AGGGACATTC ACTGAATTTT TGAATGAATC AGGTCAGAAA GTAACTAAAA 3120 

AAGTTTTACA AGTTGAAAAC TTGACGTTGT CAAAAGAGAT GCCGGCTTAT CAATTTTTAG 3180 

GAGAAGAAGT GGTTCATCTT TCACGGTTAT ATCTAATCGA TGAACAACCC TATATTTATT 32 40 

TTAATTACTA TTTGCCAGCT GCCATGAAAG AC G TAT C ATT GACTGAATAC AAAGAAGAAT 3300 

CCTTGTATCG TTTGATGGAT CGACATGCCA TTGAAATTTA TAAATTTGAA GACCGTTTTG 3360 

AAGTTGCAAC TTTGACAGAA GCAGAGCAAA AGATGTTGCA AACAACTGAA ACAACGGGCT 3420 

TAAAAAGAAT TCGGCGTTCG TTAAGCCCAA CAGGTCGTTG CGTAGAATAT TCAGAAGCTA 34 80 

TTTATAATAC GAAGATTCAT CCTTATGTGA TTGAATATGA AGCATAACTT TTAAAAGAAT 354 0 

AACAGTGCAC TTACAGTAAT GTAAGTTCAT TGTTATTCTT TTCTATTGTC GTTTTTTTTC 3600 

GTTTTCGATA CACTAAATGA GTGAGAAGAA TAGGAGTGGA AAATATGTTA GAAGTACAAA 3660 

ATTTAAAAAA AGTTTACGGA AACGAAATTA AATATGAAGC CTTAAAAGGG ATTAATTTAA 3720 

CTGTCCAAGA TGGTGAATTT ATTGGTATTA TGGGACCTTC TGGAAGTGGG AAAAGTACAC 3780 

TTTTAAATTT ATTGGCAACA ATCGATAGTC CAACAGACGG TGAAATACTT TTAAATGGAA 3840 

AAAATCCAAA CAATTTGAAT CAAGAACAAA TTGCTAAATT CCGTCGTACA GAATTAGGAT 3900 

TTGTTTTTCA AAGCtTTAAT TtAATGCCAA CATTAACTGT TGAAGAAAAT ATTATTTTGC 3960 

CATTGACACT TGATGGAGAA AAAGTCTCTG TAATGAAACG ACAATTAGCA GAATTAAGTG 4 020 

AGCGTTTAGG TATCAATCAT CTGTTGAAAA AACGAATTGC AGAAATTTCT GGTGGACAAG 4 080 

CACAACGGGT CGCCGTTGCT CGGGCAATGA TTCATCATCC ACAATTATTA TTGGCTGATG 414 0 

AACCAACAGG AAACTTAGAT ACGAAATCAT CTAAGGATGT GATGGGCTTG TTACAACAAT 4 200 

TAAATGAAGA AGAAGCGGCG ACTATTTTAA TGGTGACCCA TGATCCGCTA GCCGCAAGTT 4 2 60 

ATTGCAAGCG CATCGTCTTT ATTAAAGATG GTGAATTAAT TGATGAAATT GTCCAAAATG 4 320 

GCAATCAAAA AGAGTTCTAC GATTTAATCA TGGTGAAACT GGCTGAGATT GAAGGTGTCG 4 380 

ATAATGAATT TTAATCAGTT TGTTATAAGA AACACGATTC GAAATAAGCA TTTATACTTG 4 4 40 

GCTTATTTCC TAAGTACAAT GTTTTCTGTG ATGATCTTTT TTACGTTCAC TGGTTTTGTC 4 500 

TTTCACCCAG CCTTAGCTAA TGGTTTAAAT CCTAAAGCAC AAATGGGAAT GACGGCTGCG 4 560 

GCAATTATAA TCTATGGTTT CTCTTTCTTA TTCGTTTTAT ACTCGATGGA TGTCTTTATC 4 620 

CAATCTCGAA AAAAAGAGTT TGGAACCTTG ATGATTCAAG GGATGAGTCC AAAACAGTTA 4 680 

AAGAAAATGA TTTTTATTGA AAACTTAGTG ATTGGTTTCT TTGCTACAAT TTTTGGTAGC 4 7 40 

ATTTTAGGTG TTGGATTTTC ACAGTTTATT TTATGGATTA GCAATTTATT GATGCATCTA 4 800 

GGTTTAGGTT TTTATTTGCC AGTGATGCCG TTTATCATTA CTGTCATTTC ATTTGGGGTC 4 860 

TTGTTCTTAG TAATTTCTTT CTTTATTCAA TTCCGCTTGC CAAAAGCGAC ACTTCAAGAG 4 920 

CTATTAAAAG CAGGAGAAAT GGGTAAAGGT GAAATCAAAA GCTCTAAAGT TAAATCTTTC 4 980 

TTAGCCGTGT TACTTTTAAT TGTCGGTTAC GGAATTGCTT TAGTTGCTAA GGGACAGTTA 504 0 
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GTTTTAATGG TGATGTTCCC AGTAATTTTC TTAGTTATTT TAG GG AC AAA ATTCTTATTT 5100 

GmCCAATTGA GTGTTtCTGT CATTGAaCGT TTAAAmCGGA AaCCAAAAaT CTTTTGGAAA 5160 

AAAmCCAATA TGGTCGTTTT ATCAGATTTA GCATTCCGCA TGAAAGATAA TGCACGTTCG 5220 

TTCTTCTTGG TTTCGGTGAT ATCAACGGTT GCTTTTGCTG CCATTGGAAC GTTATACGGC 5280 

GTGAACGAGA TGATTTTTAA AGGAATAAGT TCTGTACCGT ACGAACTGAC ATTGTCTG 5338 
(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 



TGAATTCGAA 


GAAGAACTCC 


CTGCAGAACC 


TATTTCCAAA 


TTAGGTGATA 


TTTATCAATT 


60 


AGGAAGGCAC 


CGTTTAATGT 


GCGGGGACAG 


TACAAATTCT 


TTAGAAGTAG 


AAAAATTAAT 


120 


GGGCAATAAA 


AAAGCCGATC 


TTTTGATTAC 


TGATCCTCCA 


TACAATGTAG 


CGTACGAAGG 


180 


TAAAGGAAAA 


GAAGCACTAA 


CTATTAAAAA 


T GAT AG T AAA 


GAAACGAATG 


AATTTCATTC 


240 


ATTTTTATAT 


GAAGCTTTTA 


GTGCAGCCAT 


AAATAATATG 


AAATTAGGAA 


GTTCATTCTA 


300 


TGTGTGGTAT 


GCCTCATCAG 


AAGTAGTGAA 


TTTTCATACT 


GCTTTAGAAG 


AAGCTGGGTT 


360 


TTTAGTAAAA 


CAAGAATTGA 


TTTGGAATAA 


AAATAGTATG 


GTTTTAAGTC 


GTCAAGATTA 


420 


CCATTGGAAA 


CATGAACCTT 


GCTTATATGG 


TTGGGCTTCT 


GGGGGcAGTC 


ATTCTTGGTA 


480 


TTCAGATAGA 


AAGCAGACTA 


CTATTCTTAA 


TTTTGACAGG 


CCTACAGTTA 


ATAAGGAACA 


540 


CCCTACTATG 


AAGCCAGTTG 


CACTATTCGA 


TTATCAAATT 


AAAAATAGCA 


GTAAGCAGGG 


600 


AGACTGTATT 


TTAGATTTGT 


tTGGAGGtTC 


TGGTACTACG 


TTAATAGCAT 


GTGAGCAGAA 


660 


TGaAAGAGAA 


GCCTATTTAA 


TGGAGTTAGA 


TCCAAGATAT 


GTTGACGTCA 


TTATAGCTAG 


720 


ATGGGAAGCA 


TTCACAGGAG 


AAGTGGCTGT 


GAAGATATCA 


GGTAATGATA 


CGGCGGTGGT 


780 


TGATGGTGGC 


TAGAAATGAT 


AATTTAAAAC 


CATTTTCAGA 


GCGAAGCGTG 


GATGAAGCTr 


840 


GAGAGCTTGG 


AAGAAAAGGC 


GGTAAAGCAA 


GCGGAGAAGT 


TAGACGAAAA 


AAAGCTGACT 


900 


TGAAACGAGC 


TATTTCTATT 


GTTTTATCAT 


CAGAAGTACC 


AAGCTCTAAA 


ATGGCTAAGA 


960 


CGCTAAAAGA 


AATGGGATAT 


GAGAACACTA 


ACGAGATGGC 


CATGGTCTTA 


TCTATGACGC 


1020 


AAAAAGCAAT 


TAAGGGAGAT 


GTCAAAGCAG 


CATCCTGGAT 


TTCCAATATT 


ATTCAACCTG 


1080 


CAAAGGTAGA 


GCATGAAGTT 


GAAATGAGTG 


TGGGCGTTGA 


TGAGAAGCGG 


AAAGTAGCAG 


1140 


AAGAATACAT 


TAGGGGCCTG 


TTTAATAATG 


ACACTGGAAA 


TAGCACAGAA 


GAGAACAATT 


1200 


AAGCTTTTAA 


AAAGTTCAAC 


GCCCAAAGAA 


AAACTCAATA 


AGTTTGTAAA 


AGGTTATGTT 


1260 


CCAACTCACT 


ACAAGAGATT 


AAGTATATCA 


ATGGAAAAAG 


CAATAGAACT 


AGCCATAATT 


1320 
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GGGGCAACCG AAAGTCTAGC ATATTATGGT GATCGATTAT ATTTTACGCA GGCTCTATTA 1380 

ATGGGGGCTG TGGTAAGTGG AGAGTATGAC"AACATTATTG TCGTTACCCC TTCACAGTAT 1440 

GGTAAAAGCT GGTTGAGTTC GAGAATAGCT GTTTGGCTCG CTGATCACAA TCGGCGTTGT 1500 

TACGTGGCGG GTGGTAAAAA AGATACCACT GATATTATCA TGCAACATGT TACAGATACA 1560 

CTACAGACTG TTGATGAATC AATTGCAAGA AAATTATTAG AGCCTGTTGA TAAGCTAGAA 1620 

AGACTTCAAA CGGGTTTATC CAAAAGAAAG ATTTCTTTCA GTGGCGGAGG ATCGATTGAA 1680 

GGGATTTCAT TAGGTGAACA TTTCAAAGGA AACAAATCTG GAAACCAAGC GATTGGTCGT 174 0 

GGTGGAGATT ATATTATTGA TGAATCAGCG TTCGTCTCAA ATGAAACATA TGCTGAGCTT 1800 

GGTCGTAGAA ATTTTGCAAA TGTGGATGGT AAAAACTATC TATCTTTTGA AATATCTAAC 1860 

CCACATAATA AAGGTCGATT TTATGACAAA CTAACTCAAG AAAATATTCC AAAAGGCATG 1920 

TTGGTTGTAT GGGCAGATGT TAGGACCGCT TTTGAAGAAG ATCGAGTTAA AAGTATTGAA 1980 

CAAGTAATAA GTTCTGAATT TTTTCAAAAT AAATCTACAT GCCAACGTTA CTTTTTATGT 204 0 

GAGCTTCCAG ACGAAAATGA AGATGGAATG TTTGGGACAC CTCAAACAGA AGAAGAACAT 2100 

ACCGAAAAAA ATTGGGAGTA TTTCCTTGGT GTAGACAGTG CTTATAAAGG AAAAGATAAA " 2160 

ATCAAAGCCA CGTTATCAGC ATTAGATGCA CAAGGACAAG TACATGTTAT AGACACTATA 2220 

GAAATTGAAA AAGGTGACTG GCAAGACGGT GTGACTAGTA AAAAGATAAT TACTCAATTG 2280 

TTGATGATTA TAGAACATTT TGAAGTTAAA GGTGTATGTG TCGACGTAGG TTACGGTGTT 2340 

TATATTGTTG AAGGTTTAGC ACATATTAAC GGAGATTTCG AATTACACGG CATAAATTTC 2400 

GGTGCTGGTA CAACTAAAGA AAGGGTGGAA AAAAACCACT ACTCGGCAAA ATATGGGGCA 24 60 

AATAAGCGTG CTGAAATGCA CATTGACTTA CAAGAAAACA TTGATAACAG AAATATATTT 2520 

TTCACTGAAA AAGTATATGA AGAAGTTATA GATGAACTAG TTCTTGTAAG TAGCaAGATC 2580 

aAGTCTAACG GAAAAACAGC CATTGTTCCA AAGGAAGAAA TCmAAGCTAA GTTGGGCCAC 2 64 0 

TCACCAGATA CACTTGATTC AGTTCTGCTA TCGCTACATG CGATTATTCT ATATAAATTA 2700 

AACGAACGAT TCTATATCTA TTCTTGATGA AAGGAGGGAA ATGAATGTCT GAAACTGAAT 27 60 

TAGTTGGGAA AGATGAACTA TTAAAAGCGA TGACTATGTG TAATAGTTGT CCAGATTTTA 2820 

ATTTAAACGA TTTAAGAGGT AAGGGTGATA ACAACTATCA GCTTTATGAT TGGTTAATTC 2880 

ATAATTTGCC AACAGCACAA TATGTTCTTG GTAAGTTAGT AGAGCTTATT TTTTCTAATA 2 94 0 

ACTTAACTAC AGGCGATGAA AAACAAGATG AAATATTGAA TAACTTTTTA TATGGTCAAA 3000 

CGAATCCAGA AGGAGTTACC AACTATCACG TACTTGTTCA ATCAATTAAG GAATCAATTG 3060 

TATACGGTCG ATCTGGTTTA CGTTTTTTAT CTAAAGATGA TGGATTGATT AACGTAAAGT 3120 

GTAATCATTT TGGCGTTGCT CAAATATTGA ATAAAGAACA TTACGGATAT AAAGAATTGA 3180 

TTGGTTTCGT TATTGACAAA AAAGGTCGAG CTATTACAGA TGTCGATCTC CGTGAAGGAG 3240 

AAATTGATTC AGAAGAATAC TTTAAAAAAG GAATATTTGT TTTTAAAAAC AATGACAATA 3300 



WO 98/50555 PCT/US98/08985 

1372 

TTTTATTGCC ACCCGAAAAA TTCGTTAATT TAAGGGTTGA TACATCTACT CCCAAAGGAT 3360 

CGAGTGTCTT TGATTCGGAT ATCCAACGAG TCCTGCTTAT AGCTTCGGTA TATAAACGAC 34 20 

TACTATATGA TATCGAATAT GATGGCGCAG GAAGATTAAT TTTCTGGGCA GATAATGCCA 34 80 

ATAGTAATGA AGAAAGCTCA AATAACTTTT TGAACGACAC TGAATCGGCA ACCAAACGAC 354 0 

GGCAGGATAA ATATAAAAAA GAAGTTGAAG AAATAATGAA GCTCGTAAAA GACAGCAATT 3600 

CAACAAGCGT TTTAGCAGTT TCAAATGCTT TTAAAAAGAT GGACCACTTG CCACGTGTAA 3660 

CTAAAGCGAC GGAGTTT ~.TT AGTTATTTAA ATcAGGAAGG CGAGATTATG GCGCAAGTAT 37 20 

TCGGTGTTCC TAATGTTCTG CTTGGATTAG GTAAAATCAG TGGGAATATA TCAATGGAAA 3780 

AAGTAATAGA TAATGCAATG CTTAATTCAA TTATTCCTTT ACGTGAAAAT ATTGCGACTC 3840 

AAATTTCGAG CATATTAACA AATAATTTAA AAGTTCCAAA AGTCTATTTT GACAAATATG 3900 

AACTAAAATC TCAGTCAGAT ATAAATGATA GGCGATTGAA AGTTTTAACT GTTGCTGAGC 3960 

GACTTAAAGC ACTAGGTAAA GAGGATTTAG CTAATAAAAT TATTGAGGAG GAAATTCAGT 4020 

TATGAGTATT TTAGAAGATC TAAGCAAAGC AAAAAAGAAA GCAAAACCAT ATGCAGTAGT 4 080 
CGGTGGACGT GAGGTTTATG ATTATGACTC TTTAGAGAAA AAAGTTGAAA TCGATCAAGC 4140 
CGAAGCGAAA GTTGGTGGTG GCCAAGTAGA TTTAGGGAAA ATGAAACCGA CAAAAGATGG 4200 
ATGGGGGTAT ACCGACATGG GAAATAGTTT CTCGGCAATT CCTCAAGATA TCTTATTTAT 4260 
CaATCGTTAT AAAAAAGAAA aTGATGCTTA TTTAATCGTA ACAGACTATC GTGcCATTAA 4 320 

AGAACAGTCT AGTGGTCAAa TTTATGCATC AAGTGTTCAA GCGCTAGTTA TTAAAAATaA 4 380 

AGGCAAAAAA GGTGaAGAAG AGATGTATCT CGAAAGTATT CGAAATGTAT CTGATACTGA 
ATTTATCAAC GATTTTACTG GAGAATTGTC AAACATTTCA ATGGCAAAAG TTTTTGAAGC 
TATCGATAAA GACAAGGTGA AAGAAGTAAG CAAAGATGAA ATCAGTTTCT AATATTGCTT _ 4 560 
TAGAGTACAC CACAACAAGC AATGAACAAA AGCAACGTGA ACTATTATTG CTATTAGTTT 4 620 

CTTACTTCCT AACTCTTTAT GATATGGAAA AAGAAAACTT TGCTGATGAA CTTGGTATTT 4 680 

CCAGTGAGTT TGTAGAAGAT GCTCAAATTA AGTTTGATTA CATGAAGAAA ATTGAACTGA 4 7 40 

CATTGCAAAA TATGCGTGAG ACTGTACAGA GTGAAAAAGA CAAAGACAGT GAAGCTCTAA 4 800 

CTGCTCTATA TTTTAATCGA ATTTTAAATA CAGATGGAAA AAAAGCTAAA GAATTAGCTC 4 860 

AAATCGAAAC TGCAAAACAT TTAACTCGAC TTGATCGTTC TAAGAAAATT AGAAAGAGAT 4 920 

GGAAAGCATT TTCAGGATGC TGTGATGTGT GTAGAGCAAT GGATGGAGTA ACAGTAGCTC 4 980 

TTGATGAGCC ATTCATGTAT CAAGGTCAAG TTGTTGAATT GAGCAGTGGT GAACGTTATA 5040 
TCAACAATTA CGCAGCGATG GACACGCCAA ATGCTCATCC AAACGATAAA TGCTCTATTG 5100 
AATTTATTAT ' CGAATAATTG AAGGGGGGTG TAAACTATGG CGAAAATTAA GTTACCAATC 5160 
GTAACAATTA ACGCAAAAGG TGGCGCTCAA TATGTTGTTG AAGGCGTCGA AGCAGTAAAT 5220 
GCGTTAACTC AATTAGAAAA TTATCCTGAA CGTGGTATTC GTATCAAAGT TGATGGGAAA 5280 



4440 
4500 
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ATGACAGTAA TTACAGAAGG TTGTTTATGT TQTGCAGCTG TTACTGGAGA rGTTGAAGTA 534 0 

GATGTTCCTG AAATCGTXXG . XGAAGAAGTA GAGTGTGAAC CAATTTCCAA TATTTTACCA 5400 

AATCCAAAAA ATCCAACTGA TCCAGGGGAA AATTCAAAAC CAGAAGAAAA AGCTGGCGAA 54 60 

AAATAAAACT AGGAAGTGTC AGTATGAAAC AATTTAAAAA CAATTTAGAG ATGTTTCGTT 5520 

ACATTGATGC CATTAATAAA GAGCCTAGTA GAGAATTTAA AATTGAGCCA. TTACGAGAAG 5580 

AGATTATTAT TGAACCTATC GAAGAAGACG TAGAATCAAA AGCTAATAAA .AAAGCTAAAA 564 0 

ATAAGAAGGG GGCGAAGGAA TGAACTACTT TAAATTAGTT GACGGTATTC GTTCCCCTCA 5700 

GTCTATCGAT GTAGTACGTT CAGAAAACGG ATATAAAAAA TTTGGTTGGA TACGTGTCCT 57 60 

TCCAGATGAA CGATATCCGT TGGGAGACGA . TGAAGCATTT ATTCAATCAT TAGAAAATGC 5820 

TAGTGTTGAA AAGCTTTACT CCGACAAATT AGTTACTGAG CTCGAAAATA ACGGAATTCA 5880 . 

ATTTGAAGTC TTTAACGGCG GATGTTGTGG CGGAAAAATC AAAAAAGTAA GTTATAAGAT 594 0 

CATCGATATT GTTAGAGATG AGGTGTAACA TGTTTGATTT TATCAAGAAA. CGTGATGCTC 6000 

AAACGCAAGC ACGAAAACGT ATGAAAAAGG ATTTAGACGA AATTTTCGAA TTTAATAATG 6060 

AAGAAAAACA GCAACATACA TTGAACCACA TTGTTCAACT AGCTAATAAT GATTTGAAGG 6120 

AAATTGGCTG GGTTCGATTA CTGGATGAAG GAACAGTATT ATACGGTGAT GGAAGTATTC 6180 

GAAGTTATAT AAAACGTGGA ACTATAAAAG AATTTTATGA CTCTTTAGAA GATGATTATA 624 0 

TTGGCTATAT CAATATTGGA CACATTAATT TTGCTACGTT ACCAATATTT GTTGGTCAGT 6300 

GGACTAAAAA TGATTTGCGC TTAGTAGATA TTGGAGAAAA CCGTCAAGCA TTAGAAGTCA 6360 

ATATGAAAAT TGATGAAAGC TTATCTGTTA TCCAAGATTT GCAAAAAATG CCATATACAA 64 20 

TTGGTATTTC AGCCGAATTT ATGGCAAGTT ATGACGAAGA ATTnTCTTAC GAATATGAAT 64 80 

TTCCAGTTAT TGAACATCTT TTTATCATGG GATTTGGCAT CGTTGGAGAT GTAGGTAATG 654 0 

TGAATAGTAG CGGTATAAAT TTATCCGCAG AGGAGGCAGA CAAAATGGCT TTAGCAGATT 6600 

TATTTGGCAA GAAAAAAGAA AATGATCAAG AAACGATCAA AGAACCTGAA ACTAAACAAA 6660 

CAGAATCAAA AGAAGAGCCC AAAGAACCTG AAGTGGCAAC AAAAAAATCA GAGGAACAGA 6720 

ATGTTGAAGA AAATGACAAA AAAGAATCTG AAGCGGAACT TTCCGAAAAA GAAGACCATA 6780 

CATTCGAACA ACTTTATAAT TTATCTATGG AGCAAAATGa AAAAATGGTT GTTGAGCTTG 68 40 

AACAGTTACG TGCTGaAAAT aAATCTTTaA AAGAAGaAAA ATTGgAAGCA AGAACAAACA 6900 

AATGaAAAAG CTGTTCAACG CTTGGAAAAA TTGATGAATC GTATTGAAGT TTCAGCGCTG 6960 

CCACAAGCAA CTGGATCTAC TAAAAATAAA TGGGGGGAAT AACAGATGGT ATTAGAAGTT 7020 

AATCAAGCCA TCGTTAATGA ATCGTTACAA ACAGAGCAAG TTATGGAACA ACTAAGCTCT 7080 

ACTGTAGATG TCATTGTCGA TAATATCGAA AAATATACaA ATGCTGCTCG TTATGGTAAA 714 0 

GGAAATAATT ATGCTTTAAA TAACTTGCGA ACAGCAGTTC AAAACCAATT TCCTTTAGTT 7200 

GATTGGTTAT TAAGCACAAG CCTATCTCAA ATGATTGAGA ACGCTTGGCA AAATGGCTCA 7 260 



WO 98/50555 PCT/US98/08985 

1374 

TTGcCAACTG TAACTGATGA AGATGGAAAT GTTTATTTAA AAGCACCTTT AAATGTATTT 7320 

ACGACACCAC CAAAAGATAC TAAAGGTGAA TGCTGTTGGT.. TACCTTJTGA JTATTGCAGCA 7380 

TGTGGGGGCA AAGCACCAAT TAATATTCTA TGTTTAAAAG ATTGTGTGGA TATGTTAAAT 7 4 40 

CACTTATTGG ACCGTAAATT AAAAGTTCAA TCGAATGATT TAATTGGTTT CTTTAAGCAA 7 500 

GCAGGGCAGA CTTATGAAGA AGTTCGTGAT TTCATGAATC GTGAAAGTAT GGCTTTCTAT 7 560 

ACTGCCAATA CAATTGTTAA TGGACAACTA GATGTTACAA CACCTATCTT AAAAAAATTC 7 620 

CATGGATTAA TGGAAATTTT AAAACGCCCA GAAGTTTTCA AAATGCAAGG AACAAATATT 7 680 

T TAG C AG CAT TTGATTCTAT TGGATATCGC ATGGACGTTC TAGGTGGATC ATTCATTTTT 77 40 

GCAGCTCATC CATTAACTGT AGCAAGCATT AAAAATGCCA TTCGTCCTAA TCGTTACGGT 7800 

ATTCTTCCAG ACGGCTGGAC CATTAACGGA GAAAGTATTT TCTATAAAGG AGCACAAGTT 7860 

TTACCTGATA AAAcTGTACC AGTTGATGTT GAAAATGGCA CAGGCGTTAT TTGGCAATTG 7 920 

TCTGGTGAAT CCGTAGTGTA TTCTTGGGGA CAACTTtGCG TCCTGCTGAA GACTATATTA 7 980 

TTCGCAATCa ATTTACAACA ACAAACGATG TAAATAAAGG ATGCGCAACA GAATGTGATA 804 0 

TTTATTACAA CTTAGGAAGT GTAGTTACAA ATAATGTAGC TCGATTAGCT GTAATTACAG 8100 

ATGTTCCTTT GGCTGCTGGA tCAACGCCTA CTCATTAGCt AATTTATTAG ATCGaGTATA 8160 • 

TGTAGAnACA TTAGCTCCCT AGAAAGTAAG tGCTt ACCTT ATGGATGAAA TCATTGAGCA 8220 

ACTGAAAGAA TTTtGCGATt GTTTTCCTTG TGATGTAGAG GACAACAAGT TAGAGAAAAC 8280 

AGTTAAAGAA GCAATTCATT TAATAAGTTT ATTAACTTGT TGGaCTCAAC GTCCTTGCGA 834 0 

AACATTTTTA AT GAG T GAGA GACAAGAAGT ATTTGATATG GACAATTACT. TACCTTGTTC 8400 

GTGTGATGAT GGAATTATGG AATTAGATTT ATTCTATGCA CCATTTGCTT TAGCATCATT 8 4 60 

CCGAGTTTTT TCTGTACATC GAGAAGGTGT AAAAGAAATC ATTAGAGAAT TAGATGAAAA 8520 

AGAATTTGGT TATAGCGTTG TAAAAGACAA ATTACTTGTT GATATTCGAA AC TAT GC AAA 8580 

TAGGGAAAAC GGATGTTGTG TTTGTAAAAA AGAACACCAG TTGCTAGTCT TATATGATGC 8 64 0 

TGGATTTGAA GAGTTACCTC AATGTTTATT ACAACTTTTC TGCGATTTAA TTCATGTAAT 8700 

TTACAACAAG AATAACTGTG ATTGTCATGC TTGTGCTACT TGTCAAGATA ATTCAGATAG 87 60 

CGGCTTTATT GCAAATGAAG CGATGACAAC AGATGAGTTA GTAGAAAGCT ACCTCAATAA 8820 

ACTAGTAATA GACAGCTACC GAAAACAATT AGGTCTTATT AGCCTTTGTG GTAAAAGTCT 8880 

AGAACAGATT TGGGGGATTA GAGTATGAAA GTTCGATTTT TAGGCGTTCG TGAACACATT 894 0 

AGCGCTACAG GTTGTTCGTC. CTGTGGAGCG AAAAGGTATG CCAGTGGTTT AAAGACAGAA 9000 

AAGACATTTT TCTTACCAAG CGGAAAAAGA TTAGATGTTG AGTTTAATCA TGAATATGAA 90 60 

ACCACTGATA • CAGACGGAAT GTTTTTGCTT GAAAGTGGGC ATTTCGAGGA AGTGTTTTAA 9120 

ATGGCACTAA AAAAGATAGT TATTCAATCA ATTGAAAAAG AGATAGATAA TTATGCAAAA 9180 

AAAATGGAAA AAATTATAAA AGAAGAAGCT CACGTTAAAA CAGGAGCGTT GAGAGATTCG 92 4 0 
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ATAACAATAG AAAAGGAAAG TGACGGGAGT AQATTAATAG GAGTTGATGT CGCGAAACTG 9300 

AAATCCGATC CTCGCAATGT TGGAGGCTTA GATTACTCTA TTCCTTATTA CAAAGGTCAT 936.0 

AGTGGCTACA CGATAAGGCC AAGAAAGGCA AAGGCTTTGA GTTGGGTTGG TAAAGATGGC 94 20 

AAACGTCATT TTGCTAAAAG TGTTTATATA CCACCCCACG CAGGTGATCC TTTTTTGAAG 9480 

CGAGCTGTGT TACGTAGACC AAAATTATAG GAGGTATATC AAATGGCGAA ACGTGCAATG 9540 

AATGCATTAG CAGCAGATGG TGAAACGACT TATCAGTTAT CAGCTAAAGA CTATACAATC 9600 

GGTGATTCTG AAGTCACAGG TGTGTATGAT AATGAAAAAG CTGTAACTAT TAAATTATTT 9660 

GTAGACGACG TAGCAGTGGA TGAGATCACA CCTGATAATT CAAAAAATAT TTATGCTATT 97 20 

TCTACAAGTA AAACTACTAT TGTTAAAGAT . AGTAAAGTGG AAGTTGCAGA ATATGATGCA 9780 

GATAAGAATG AATTAACAAA AATTCTAGTT ACTGTAATTG ACCCAAATGG AGGAGGAAAT 98 40 

GAGATGGATG AAAAAGAAAA GATTGATAAA TTCATTTCAC GAAAATTAAC AGTATTAAAT 9900 

GAAAAAGACG GTATTGTCTA TGAACAGCTA GCAATCCGAG TTATTCAAGT TAATCAAAAA 9960 

TAATAGGAGG AATTAGACAT GGGAAAATGT AATACAGAGC AAGTTCTTTC AATGATTGGT 10020 

GTTAACAAAT TAACGAAGGC ACAGGAACTG TTTTTTTCGG TTCTACAGGA CAATGACTCT 10080 

TGTGTAAAAA TTAAAACCAG TGACTACTTG GAAATCATTG GAGACAAATC AGCATTCAAT 10140 

AAATATTTAC GACCAGAGGA TGCATTTAAT TGCTTAGCAG AAGGATGTAG AAACACAGGT 10200 

GGATTATTAA TTACAGGCAA CGAATTTCCG TTAGGGGCTA CCTTTAAAAA AG T TACT GAT 10260 

GCGACTGATT TTTATGCTGG TGCAACCACT TTTTATTTAG ATTTGCCAAA AGATGGAACA 10320 

TACACTATCG AATTTAAAAT TGCAGCAATT AATGACAATA GTTTTGTAAA TGCTGATGTA 10380 

TATAGAAAAA AATTCACTGG AACTAAGGGC TATAATCCTA TTTTTATTGA TTTTTCAGTC 10440 

GTTCCTGAAG AAGTTTTAGG CGAAGGTTGG cAAGCAAATG AACGTGGTGT TTATGTGTCA 10500 

ATTACTGTGA CAACTGAAGA AGAAATTCCA TTAAAACAAA TTCATATTTC TTCGATTAGT 10560 

TTTTACAACT CAATCGAAGA ATTACAAAAT GATGAAGTTG TAACGATTGG AT GT ATT AC A 10620 

GAATACGGTG GAGACATGAC TATGGATGTT GCCGATAGTG TATGTTTTGG TGCTAAGTAT 10680 

GATCCATCAA GCGCTAGTAT TACTCGTACA TTCACAGGTG GCAAAACAAG CGGAAACTAC 10740 

TGGTTACTTA ATCCATTTAT GCGTCGAGGG GATTTATCAA AAGGGTGGAC AGTTGTTAAA 10800 

GAGAAAGATA AAGTTCGTGA ACTCACTATT GACGGCCGAC GTTATGGGTA TATCTTGTTA 10860 

AACGGTTTAT CAAAACAAGA ATGCTCTTTC TCTAAAGCTT TAGTTGCAAG TGAATGTAAC 10920 

TTTACTGaTG CAGAATTAAC TAAAGTCAAC CTCCCaGATG TAGCAGTATt AAACGAAAAA 10980 

CAATACCAAA TTATTAAGCA TGGCGAATAT GAT GG AT ATT TAATTGTTCA TGAACGTTTG 1104 0 

ATCGGACAGC ■ CATTGTTATA TGCATACCCT AAAGAGGTTT. CTATTGAGCA ATACGTCGGT 11100 

GAAGATGACG CATATGAAGG ACGCCGTGTA CGTCTATTCT TCCCAACTGT TCAGACAGAT 11160 

GGTGTGAAAG TGAACTATAT TTTCAACAAT GTATTAGTTA CTTCATTCCC TACGACTTTA 11220 
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AGTAATACAG ATGAAACTAC ATTTGAATTT G^AGTATCTA TTCAAAAGGA TAACAATGGT 11280 

CGATTCTTTG AAGTTCAGAA AATTATTGAA TAATTATTTA AGAAAAGGGG ATTTTTGTAA "11340 

TGAAACAAAG CGATTTAACG AAAATGATTA CAAAAAATGA TGTAATTGAT ATGAAACATA 11400 

AAATGGATAA AACTCAGGAT ACAAGTAAGC CATATGCAGT AATTGACAAC GATAATATTG 114 60 

CTGTTGTTGG TGACGCAAAC GAGATTCAAA AAGTTGAAGA AAACTATGTG ATCAAGTTTC 11520 

GTGTTCCAAA AGAATTCtTT GAAGAAtTCC GTATGGAGCA aCTACTGTAG ATAAATATGT 11580 

TATTTTCGAA GTTGAATACA GTAATGCAAG TGTAACAGGT ATGAATGATT TAAAAATTGT 11640 

AGATGCACTT TTGAACATTC AACCATTTTT GAAGGAGTTT TACcAAAAgA CGAAGACGGC 11700 

AAAGTAACTA TTGkGGAAAa GACAGaTCGA GAaGTyCTtA AAATGTtATC AACATCAGAG 117 60 

GACGAAGTTA TTtaTGGGTT TTATAAAGTG GTGGCAGCTT TcTTAGGTGt AGATGAAGAG 11820 

TTAATCGAAT ACATGCTTCC ATTTTCAGTG ATTG 11854 
(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

AGTGTTTTTC ATCGAGAAAG TTAAAAAAGT TTTTTTGTTA TAAATAACAT ACTGTTTATT 60 

TAAAGTGGTG TATACCACTn AATTGAGCAT TTTAAATAAA AAAATAAAAA AGGGTATACA 120 

AATGCAATCG TTTGCGCTAA AATGTGnTCG AGTATTAGAA AATACACGAA AAGCGACAAT 180 

TAAAGGAGAG AnTTAAATGA ACACTGCTGC AATCTACCAT CGTCCAGAAA GTGACTTTGC 24 0 

TTATCTTgTA TACAGaAGAA ACCATGcATA TTCGTTTGCG TACGGCAAAA GGCGACATTG 300 

cCTCTGTCCA ATTAGTGcAA GGAGATCCTT ATCTTTTAGG GAAAGAAAAG TGGTACCAAC 360 

AGTCGTTAAC CATGGAGAAG TTAGTATCTA CGGAGTTATA TGATTACTGG TTTATTGCAT 4 20 

TAAGTGCAAA ATTTAAACGC CTTTCATATG CTTTTACTTT AGTAGGGGCG GATGGTTTGA 4 80 

GGGCGTTTTA TGGCGAACAC GGCATTTATC CTTTAGAAGA AAAGTATTTA GAGATGGCGA 54 0 

ATAATTATTT TCGGATGCCG TATTTTCATG AAATCGATCG TTTTAAAGCA CCTGAATGGG 600 

TAAAAGAAAC GGTTTGGTAT CAAATATTTC CAGAGCGTTT TGCAAATGGT GATCTCGCCA 660 

ATGATCCTGA AGGGACGTTG CCGTGGGGAT CTAAAAGACC AGATCGACAA GATTTTTTTG 720 

GTGGGGATTT GCAAGGGGTG CTAGATCATC TTGATTACAT TGAAGATTTA GGTGTTAACG 7 80 

GCTTATATTT TTGCCCAATT TTTGAAGCGT TTTCTAATCA CAAATATGAT ACGATTGATT 84 0 

ATTTAAAAAT TGACCCAGCT TTTGGTGATG GTGCTACCTT TAAGCGTTTA GTCGAAGAGT 900 

GTCATCGACG GGGCATCAAG ATTATGTTGG ACGCCGTTTT TAACCATATG GGGGACACTT 960 

CGCCTCAATG GCAAGATGTG TTAAAAAATG GCCAGGACTC AATTTATGCC GATTGGTTCC 1020 
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ACGTAAATGA GTTTCCAGCT AGCTATGAAG AAAGTGCTGA TTTTGAAGAA GCAAGCAACA 
TTACCTATGA TACTTTTGCA TTCACTCCGC ATATGCCTAA ATTAAATACA GCAAATCCAG 
AAGTGCAAGC CTATCTTTTA GAAATTGCTC GTTATTGGAT TGAGGAATAT GATATTGACG 
CCTGGCGTTT AGATGTGGCA AACGAAGGGn ATCATGCGTT CTGGAAAAAA TTCCGGCAAA 
CATGTGATGA CGCAAAAGAA GATTTTTATA TTTTAGGAGA AATTTGGCAC TCGTCGCAAA 
GCTGGCTCCA AGGAGACGAA TTTCATGCAG TGATGAATTA TGCATACACA GATGCGATTA 
TGGGCTACTT TGTCAAGCAA GAACTGTCCT TGGAAAAGAT GCTTTCTGAA ATGAACAACC 
AGTTAATGCT CTATCGACAA CAAACGAATC AAATGcAGTT TAATGTTTTA GATTCCCACG 
ATACACCACG CTTATTGCAT GAAACGAAAG AGGATAAAGA GCTGATGCGC CAAGTATTAG 
CGTTTACGTA TATTCAACCA GGTGTCCCTT GTCTTTACTA TGGTGATGAA ATTGGTATGA 
CAGGTGATAT GGATCCTGAT TGTCGTCGTT GTATGGTTTG GGAGGAAAAT CAGCAAGATT 
TGGATTTGAA GGAATTTGTA AAATCATTGA TTGCCCTTCG CAAAAAGTAT GCGGCTGTTT 
TTTCTGGTGG AACAATCGAT TGGACAGGAA CTTCTTTTGA AACGGGGTTG ATTAGACTCG 
CTTGTCGTTT GGAGGAACAG ACGATTTATG GGATTTTTAA TACTGGCCAA GAGACACAAT 
TTGTCGATAG AACTGATCGA GAAGTTTTGC TAAGTTCTTC TATCGAAATT GCTTCAGAAC 
AGATAATGGT TTTACCAAAA GGTTTTCTTT TATATCAAGC ATAATAATTA AGCAAAGGAA 
TGTGAAGAAG AATGGAAAAA CATTGGTGGC AAGAAGTCGT GGTTTATCAG ATTTATCCTC 
GCAGTTTTAA GGATrGCAAT GGAGATGGCA TTGGAGATTT ACCAGGAATT ATTGAAAAAT 
TGGATTATTT GGAAACGTTA GGCATTGGGG CTATTTGGTT ATCACCAGTT TATCAATCAC 
CCAATGATGA CAATGGTTAT GACATTTCTG ATTATGAAGC GATTATGACA GAGTTTGGGA 
CGATGGCGGA TATGGATCGT CTAATTGAGG AAGCGAAAAA GCGAAAGATT GAAATTATTA 
TGGACTTAGT GGTTAACCAT ACGTCAGATG AACATCGTTG GTTTATTGAA GCGAAAAAAA 
GCAAGGAGAA CCCCTACCGA GACTATTATG TATGGGCTGA TCCTGCTTCA GATGGCGGTG 
CGCCTAACCG ATTGAAATCT GCCTTTTCGG GTTCGGCGTG GACGTTTGAT GAAGCGAGTG 
GGCAATACTA TTTGCATTTG TTTAGTAAAA AACAACCAGA TTTAAATTGG GGAAACCAAC 
AAATGCGTCA GTCGGTTTAC GAGATGATGA ATTTTTGGAT TGATAAAGGA ATTGGCGGTT 
TCCGTTTAGA TGTAATTGAC TTAGTGGGTA AAATTCCAGG AGAAGAGATC ACCGCGAATG 
GGCCTCATTT ACATCGTTAT TTACAAGAAA TGAACGCCGC GACATTTGGC GGCAAAGAGC 
TGTTAACTGT AGGTGAAACA TGGGGGGCCA CACCTGAGAT TGCAAAAATG TACTCTAGTC 
CAGAGCGTCA TGAATTATCT ATGATTTTTC AGTTTGAACA TATGAGCTTG GATCAACAAC 
CAGGTAAAGA AAAATGGGAC CTTCAGCCAA TGGAAGTTGC CAAGTTGAAA CAGGTTTTTG 
CAAAATGGCA AACAGAATTA GGAAATGATG GCTGGAATTC TTTATTTTGG AATAACCATG 
ATTTACCGAG AATGATTTCA CGTTGGGGAA ATGATCAGGA ATATTGGTTG GAAAGTAGTA 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
'1980 
2040 
2100 
2160 
2220 
' 2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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AGTTATTCGC 


TATTTTACTG 


CACATGATGA 


AAGGCACACC 


TTATATTTAT 


CAAGGGGAAG 


3060 


AAATTGGTAT 


GACCAATACG 


CCGATTACAG 


ATATTCGTGA 


AGCGCGGGAC 


ATCGAAACGA 


3120 


TTAACATGTA 


CCATGAGTAT 


CTAGAAAAAG 


GATATTCTAA 


AGAAGAAATT 


TTGCTGAAAA 


3180 


TTAATACTAA 


AGGTCGTGAT 


AATGCACGCC 


GGCCTATGCA 


GTGGACCGCT 


GAAAAAAATG 


3240 


CGGGCTTTAC 


GACTGGTACA 


CCTTGGATTG 


ATGTTAATCC 


CAATTATCAA 


ACGATTAACG 


3300 


TCGCTGCTGC 


TTTGGCAGAT 


AAAAATTCAC 


TATTCTATAC 


GTACCAAGAA 


ATGATTCGTT 


3360 


TGAGAAAAGA 


ACATCCTTTA 


ATCGTTTGGG 


GCAACTTTGA 


ATTGCTGGAA 


ACAGTGGAGG 


3420 


AAGTGATCAG 


TTTTTATCGT 


ACCTATGGTG 


AGGAACGCTG 


GCTTGTTGTA 


ACGAACTTTT 


3480 


CTGACAAAGT 


TCAGCCATTT 


TCTGCAGACG 


TTTCTGTAGA 


ACAAGTAATG 


ATTGAAAATA 


3540 


TGCCGAOAGA 


TGTCACTGCT 


TTAGCAGACT 


ATTCGTTAGC 


GCCTTGGCAA 


GCATTTGTTG 


3600 


TGAAAGTGAG 


TCAATAAGAT 


GGACCAATGG 


TGGAAAAATG 


CTGTCGGTTA 


TCAAATTTAT 


3660 


CCTCGGAGTT 


TTAAAGATAG 


CAACGGGGAT 


GGCATTGGTG 


ATTTGCAGGG 


AATTATTGAG 


3720 


AAATTACCGT 


ATTTAAAAGA 


ACTAGGGGTC 


GATTTTTTAT 


GGTTAAATCC 


GATTTACACT 


3780 


TCGCCAAACG 


TTGACAATGG 


TTATGATATT 


GCGGATTATC 


AAGGGATTCa 


GCCaGAGTTT 


3840 


GGTACGATGG 


AAGATTTTCA 


AGAACTtTTG 


GACCAACGCA 


TCaATTAGGT 


TTGAAGATTA 


3900 


TTCTTGATTT 


AGTGGTCAAT 


CATACCAGTG 


ATCAACATCC 


GTGGTTTGTT 


GAAGCGAAAA 


3960. 


AAAGTTTAGA 


TAATCCGTAT 


CGTGAGTATT 


ATTTATGGGC 


CGATGCGACA 


CCAGATCGTA 


4020 


TGCCGAATGA 


ATGGCAAAGC 


TTTTTTGGTG 


GCTCGACTTG 


GACGTATGAC 


GAAGGTACAA 


4080 


AGCAAGCATA 


TTtTCATGTG 


TTTGCAAAAG 


AGCAACCTGA 


TTTGAACTGG 


AAAAATCCTA 


4140 


AAGTTCGTGA 


AGAGATTTAT 


GCGATGATTC 


GCTGGTGGTT 


AGATTTGGGA 


ATTGATGGTT 


4200 


TCCGATTGGA 


TGCGATTAGT 


CATATCCAAA 


AAGAACCTTG 


GGATTTTAAA 


ATTACCACGA 


4260 


ATCCGTGGGC 


TCCATTTATG 


AATGTAAAAG 


GCATTGAAGA 


CTATATGCTG 


GATTTGAAAG 


4320 


TTATTTTTGC 


T GAG TAT G AC 


ATCATGACAG 


TGGGGGAAGC 


TAGTGGCGTT 


TCAAGTAAAA 


4380 


AAGCGGTGGA 


ATGGACCAAT 


GATGCGGGCT 


ATCTGAATAT 


GATTTTTGAA 


TTAGAGCACA 


44 40 


ATGTTCGAGA 


AGGAAAGCCA 


GGAGAAGAAC 


GGTTAAATAT 


TTTAGGCTAT 


AAAAAaGTAA 


4500 


TGGCTCGCTG 


GCAAAAGCAT 


TTAGGGACAG 


AAGGCTGGAA 


TGCATTATAT 


GTAGAAAATC 


4560 


ATGATAATCC 


CAGAATCAAT 


TCAATTTTAG 


GCAATGAAAC 


CTCTCATTCT 


GCTAAGGCGA 


4620 


TTGGAACAAT 


CGCCTTGCTC 


TTGCGAGGAA 


CACCGTTTAT 


TTATCAAGGA 


CAGGAAATCG 


4680 


GCATGGTTAA 


TTATCCGTTT 


CAACAGATTG 


ACGAGTTAGA 


TGCTAAAGAT 


TCTCATAACC 


4740 


ATTATCGTTT 


GCTTATmGAA 


AGTGGCTACG 


ATGCGAAcAA 


GCATTAmAmG 


AAGTGGCGCA 


4800 


TTGGACGCGG 


GACCcATTCG 


cGTAyGCCTa 


TGCAGTGGAC 


GAGTCaAGAG 


GCAAGTTCAT 


4860 


TCACCTCGGG 


GCATCCGTGG 


TTAGCAAT 








4888 


(2) INFORMATION FOR SEQ ID NO: 308: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 



AGAAGTCAAA 


ATTCGAACAT 


TGTAATCAAA 


TGCTGACATA 


TTTTTGAGTA 


TAGCCCGCCC 


60 


AAATTGTAAA 


AACATTGTCT 


ACGAAAGTAT 


AATACATATT 


TTCTTGTAGT 


AAGGTATAGC 


120 


CTTTTTCCCA 


CTCGGCTTGT 


AAAACAATCG 


ATTTTAAAGT 


AGTACCTTCT 


TTTTTAATTA 


180 


ATAAAGATAC 


TAATAAACTA 


TCATGAATTA 


TTTTTGTATT 


GTCACCCATA 


TAAACGAAAT 


240 


CAATCACTCC 


GCTGGCATTG 


CTTGTATCGT 


TATCTGCAAT 


AATAATTGAA 


AACATTTCAA 


300 


TTGCCGTCCC 


ATTCCCCTGG 


GTCTTCTCTT 


GTCCAAAAAT 


TACTTTACCT 


CCAmGCTCTT 


360 


GAATATGAAC 


ATTCGGAATA 


ATAAAGGtTT 


TTTTCATCCC 


AAAATCnTTA 


TAAATATTTk 


420 


GATATATtTC 


TAGATtGTAT 


TGTAAAATAT 


TTtGGATTCG 


TGTGATACGC 


TCTACTTGTT 


480 


CTGGATCATT 


AGAATATTTT 


TCTTTTTTCA 


CTAAGTCTAG 


CCCTAATTGT 


TGGATGTATT 


540 


TCTCTTTTAA 


TTCTTGATAG 


AAACGTTCTT 


TAAATGCTTG 


ATCAACACGA 


TTCATATCAA 


600 


AAATAAACCA 


GTTTAATGTA 


TTAATAATTT 


TTAACCGATA 


TGCTTCAATT 


CGATTTGGAG 


660 


CATGTTGTTT 


TAAGTACTCC 


AACATTTTGT 


CATGTAAATT 


GAACTCATCA 


CGATAATTCA 


720 


TTCCATACGT 


AGACATCAAG 


CCATCTTGTC 


TAACGATTCG 


TGTCAGCAAA 


TATTTGTTTA 


780 


GTGCAACAAT 


TTTTTCGGTA 


CCTAAtAATG 


CTGTATAAGT 


GAAATAAGCA 


TCCTCGCCAA 


840 


CAAATTGTTC 


TAAGTTTACC 


AAGTTATTTT 


CTTTGATGTA 


TTCTAATAAA 


AATAATTTAT 


900 


CATATAAATA 


AGGGGTTACT 


TCATCCAATG 


AAATTTCTTC 


ACCAGACATA 


CGTCCGCCAT 


960 


TAGTTAGTAA 


TGGATTGTGA 


ACAACGTGCA 


CTTCTCCATT 


TGGTAACTTT 


TCTTTCATGc 


1020 


CAAATAACAG 


AATATCTGTT 


GCACTATCAT 


ATGCACGATA 


ATAAGCGTCT 


TGTAATAAAT 


1080 


TCGCTTCAAA 


ATGATCATCT 


GCATCCAGAA 


ATTGCAAATA 


CCGTCCTCGT 


GCCATCGAAA 


1140 


TGCCCACATT 


TCTTGCGTGT 


CCAGGATTAC 


TTTTGCCAAT 


AGAAATCCCC 


GTAATTCTCG 


1200 


GATCTTTTAC 


TGCAAATTCG 


TTAATAATAT 


CCATGGTATT 


ATCTGTAGAG 


CCATTCTCTA 


1260 


CTAAAATGAT 


TTCAATCTCA 


CGTAAAGACT 


GCTCACAAAC 


AGTTTCTAAT 


GTTTGTCTTA 


1320 


AATATTTTGA 


aGCATTGTGA 


ATCGGCATGA 


CAACACTAAC 


TTTATACAGA 


TTTTTCAAGT 


1380 


AGTCCTCATA ATAAgCTGCT 


GGgATCATTg 


CAATTTGATA 


AAATtGTTTG 


ATGTATTCcA 


1440 


AAATCGATAT 


CATnCAGCTG 


GTCTGTTTCT 


AAAGCTTGtC 


TAAAATCTAC 


TGCCcCTTTT 


1500 


gAATCACTTC 


• ATAAACCGAC 


TCATCACGTA 


AATGGGTTAA 


TTTTTCAATC 


GCCCAAAAGT 


1560 


ATTTTTCGAA 


CATTTGTTTA 


TAAAAATAAC 


CTTTGTAGCG 


TTCAAATTTa 


TTTJcGTTCAA 


1620 


TTAAATTTGA 


aCmAATTAGT 


TTAAATTGTT 


GAAACAAGAA 


AGGAmCCTTT 


GTTGCCACAT 


1680 
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TAT T TACT GT 


CGAATTTGGG 


TTGTCTTGTC 


TATAAAAATA 


CAwACAtCTT 


TAATAAAAAT 


1740 


AACATTCTGA 


GATGCTGAAT 


TGATTTTCCA 


AAATAGACCA 


GTGTCCCCAT 


AACTTTTAAT 


1800.. 


TTCCTCATCC 


CaAGTTACGT 


TTTCTTTCTC 


AATCATACTT 


TTTTTATAAA 


TGGCATTCCA 


1860 


TGGATAAACT 


TGTACAATTA 


AATCCGTCGT 


TTTTAAGTCA 


ATCAATTGAT 


TGTATTTATC 


1920 


GGGATATTGC 


CATAATATAT 


CTCGCTCAAC 


CTCTTCTTCA 


GAAGAAAATT 


CAACCCAATT 


1980 


ACAACGGACA 


ACATCTGCAT 


GATGACTTTT 


AGCCGTATTG 


TATAAACGCT 


CATAGGCATG 


2040 


CAATGCTACA 


TAATCGTCAC 


TTTCAAATTC 


AGCAATATAC 


TCACCTTTTG 


CTTCTGAAAC 


2100 


ACCCAAGTTA 


AAGGCTTTTC 


CAATTCCCCC 


ATTTTTAACA 


TTGAAAACGC 


GAAAACGTGG 


2160 


aTCATCTTGT 


GCAATTTCTT 


CAATAATTTT 


TTGACTATTA 


TCAGTAGAGC 


CATCATTAAT 


2220 


AATTAAGAAT 


TCAATTTCTT 


TCAATGTTTG 


GkTTTTCAAA 


CTCATTAATG 


CCTCCnCCAA 


2280 


ATAGGGTTCC 


ACGTTAATAG 


GACnGGnACA 


ATnGACTGGA 


CACTTTCACC 


ATTTTTCCCA 


2340 


TAAAACCAAT 


TTCCCCCCAC 


TTAACCATTA 


AAAGTGGGAA 


ACCCCAAAAT 


AACCCCGGAT 


2400 


TTAAAC 












2406 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



TTTTGATTGG 


TGCCTGTTTC 


GGTTCTTTTT 


TATGCTTAGT 


GGCGGACGCT 


TACCAGTAGG 


60 


GCGTTCTTTG 


TGGTGGCcCC 


CTTCCCACTG 


TCAAGGTTGC 


CACCAGCCTC 


TACAACTCTA 


120 


TGAATTAATT 


CCAGTTGTTT 


CGATTTTGCT 


GCAGCGCTTT 


CGTTGTCGAA 


AGTGTCAACA 


180 


GCCAGTAGCT 


AAGTGTTATC 


TTTTAGCAGA 


ACTTGTCATG 


GGCGGGTTAA 


CCGCAAGCTG 


240 


TTTTTTTGCT 


GGCCTGACGA 


TTGATGCTTT 


GATTCTCTGG 


TGGTGGCTGA 


CGAGCGCTTT 


300 


CACATTATCC 


TTGATTGATT 


ACTGGTATTT 


AGTGGTTGAA 


CCGAAGATTC 


TTTATCCTAG 


360 


TTTTTTTGTG 


CTTTGCTTGC 


TGAAAATTGC 


GGGTCAGCAT 


TCGTTTTACT 


TACTGACTGG 


420 


TCTTTTTTGT 


TTTTGTTTTT 


TCCAAGCCGT 


CTTACATTAT 


TTTCCGGAAG 


CCATGGGACG 


480 


AGGGGATCTT 


TTGTTACTGG 


GTTTGTGGGG 


CTGTTTTTTA 


CAGGTACCTC 


AGTTGCTGAT 


540 


GCTACTATTT 


TTTGCCAGTA 


GTTATGGCCT 


AATCTATGGC 


TACAGCTGTA 


AATTTTTGGG 


600 


GTATCCAGTT 


GAACAGACAC 


TTCCCTTTGT 


CCCCTTTTTA 


AGTTTGGGCT 


TACTCACAAT 


660 


CAGTTGGTTA 


TAGCAAATAA 


AAAAAGCCGT 


AAGGAGCATG 


CTCCTTAsGG 


nTTTTTAGGG 


720 


tTTATTTATT 


GATGTTTTTC 


ATCGCATCTT 


CAGCAGCCAT 


TTGTGCTTCA 


ATATCTGAAA 


780 


TACTATAAAC 


GTTTTCGCTA 


GCTACGCCAA 


CTTCTTTTGG 


TTCCATGTTA 


CGGTAACGAG 


840 


CCATACCAGT 


ACCAGCTGGG 


ATGATTTTAC 


CAATGATAAC 


ATTTTCTTTC 


AATCCAAGTA 


900 



WO 98/50555 



PCI7US98/08985 



1381 



ATGGATCTTT 
ATGCAGCAGA 
GACGACTAGT 
CTGCGATGTC 
TTTACGTAAC 
CATACGGTAA 
AACTTGTAAT 
GATGTCGCCT 
TGTTTTTCCT 
AACTTCACCG 
TTCTTGGATA 
CGTACGCATG 
TGCTTCTCCG 
TACACCATGT 
GTCCACGATT 
ACCAGTTTCT 
TGGTTCGATA 
ACAGTCGTCT 
ACCTGAATCG 
GATAAACATT 
GATTCGTCCA 
GATGTTACCA 
TTCGATCAGT 
ATAACGTTCG 
TTTGTGGGCA 
TACTGTCATC 
TTTAGAGGTT 
TTTCTTCTTG 
AGCTTCTACG 
CAAGTAAGGG 
TAAGATTCGT 
CCCAATCCGT 
GTCGCGGAAG 



TTTGCCACGG 
TAAGAAGCTG 
AGCAGGGACG 
CATTAATGTT 
ATTTGACGAA 
ACACGTTGTA 
AATTGTTTTG 
TCAGCAACTT 
TTGATGGTTA 
GTTACTTCTG 
CGAGGTAAAC 
GTTAACTGTG 
ACTTCAACGT 
TTTGTGTTAC 
TCTCTGGCAA 
GGATGAACAA 
ATTTCGTTAC 
TCACGAATGA 
GCTGTCTTCA 
TCTAAGACAG 
TTCGGAGCGG 
CGGGCTCCAG 
TTTTGTTGGA 
TCATCAGTAA 
GCATCAATGA 
CCAGCATAAG 
TCCGTAATGT 
AATGGTAAAA 
AAGTATTTAT 
AACTCAACAG 
TCTTTTTGCC 
GAGTGTAAAT 
ATCATGCCTT 



ATAGCAGCAT 
TTTGTTTCCA 
CCACCAGCAA 
CCTGGAAGaT 
CCATTACTTC 
CTTCACGTAA 
GATCGATTGA 
TCATACGTGC 
CTTCTTTTTG 
TAATCACTGC 
CTTGAGTGAT 
TACCAGGTTC 
CAGAACCAGT 
ATGTAAAGAC 
GATCTTCAGT 
CTGATTTACG 
CTTCACGAAT 
TTACGTCTTG 
AGGCCGTATC 
AAAGTCCTTC 
CCATCAAACC 
AGTCAGACAT 
TTTCGTCTTT 
TTAAACCACG 
TTGCTTGTTT 
TTGAGTGTTT 
GGAATCTCTT 
CTAATTCTTG 
CTGGTGTTTG 
GCATGATTTC 
AATCAGTAAA 
GCACGTAGCC 
CCCCTTCACG 



CTGTTAACAC 
ATGACGCTTT 
CTAATGTGTC 
TTCAGTGTCA 
GATATGTTTG 
TAGGTAGTTT 
ACCTTCTGTC 
AGTGTAAGGG 
ACGTGTGGCA 
TTGCCCTTTC 
ATCGTCACCG 
CCCGATTGAT 
TGCAAGGTTA 
AGAACGGATG 
GATTAATTGA 
TGTATAACGA 
TGCTTCGATT 
GGCAACGTCA 
GGTCATCCCT 
ACGGAAGTTT 
ACGCATACCA 
CATGAAGATT 
GGCACCGTTC 
ACGGAATTGT 
TTCATGAAGA 
GTAACGTAAG 
GAAGACTTCA 
CTCTTTAATG 
AACAGTTAAG 
GTTAAAGATG 
TGGTTTGTCA 
ATTTTGCCAT 
GCCTTCTTCT 



GCGTGTTGTT 
GGTAATACCA 
GTAGTTACGG 
CCTGGATCCA 
TCGCCGATTT 
TCAACAGATA 
AACGGTGCAC 
ACAGTATAGG 
GGATCTTCAG 
GGATTACGTG 
GCAACCCCGC 
TGAGCGGCGA 
CGTCCGTAAC 
GTTACTTTTT 
TCCGCACCAA 
CCTAATAGAC 
TCAAGACCAC 
ACTAAACGAC 
TTACGAGCCC 
GAGATGATTG 
GCTAACTGTG 
GGGTTACGAG 
CAAACGCCGA 
TTAGTAATTG 
ACCATGATAT 
TCTTTCATGC 
GCAATGATAT 
TGTGCAGGAA 
TTATAGTCTG 
ATCTTACCAA 
CCTAAAAGTG 
GCTAAAACAG 
TCCATTGTTA 



TCTTGGAAGG 
AGTAAGACTG 
TCTTTAAATT 
TTACGCGGAT 
CTACCCCTTG 
AAACATCACG 
CACGGTGGAT 
TACGTGTATC 
AAATATCGAT 
CTTCAAAGAT 
CGGTATGGAA 
TTGTACCAAC 
AGTGCTTACA 
CAATACCAGC 
TGATGATTGC 
GTTCGTCTAA 
GGTCAGTGCC 
GTGTTAAGTA 
CGTGAGTTGA 
GTAATTCCAT 
TAAAGTTGGA 
CTTCCATACT 
TTACACGTTC 
TTTCAACTTG 
CGGCAATCCC 
GGTCAAGCAT 
TTCCAAGATT 
TGTCTGTACC 
TTGGTTCGTT 
CTGTGGTGAT 
TTGTTTGAAC 
CTTCGTTCAT 
GGTAGTAGTT 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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ACCTAAGACC 
ATTTTGGGCA 
GTGAACCGCC 
AAGACGAATC 
CAACGTAGGT 
CCAAACTTCA 
TTCACGTTGT 
TGGTAAACCA 
ATAGTCAACA 
GTGAGAAAGA 
ATTATCGATT 
GCTTGGCGCA 
GTATAAGTCG 
TAAATCTGGT 
AGCACGGAAG 
AGCTGTTTTT 
TAATTGTTTC 
TTTTTCACGA 
TTCGATGACA 
TAAAACAAGA 
GGCTAATTCA 
ACAACGGTCA 
CCAGTCTTTG 
TGTACGGTAG 
TTCTGGTGAA 
TTCCCTCCAT 
TCACTTGAAA 
AGTCGCAGTT 
TTTCAATTTC 
CCAGCTGTTT 
CATCATCGTC 
GCATGTCTAA 
CTGGTTTTGG 



ATATCTTGAG 
GCAAGCATTA 
ATTTGGTCTC 
GCACGACCTT 
GCCCGGTTAA 
TCTTCGCCAC 
ACTAATTCAC 
CATTGGTACA 
CGTTTACCTA 
GATTTCAATG 
AAAGCATCGA 
TTTAAGTCTA 
TTTAAGTCAC 
GGAATAACTG 
GCTTCTAAGA 
AATTCTTCTT 
ACGGCTTCGG 
TATTCGCGTT 
ACGTATGAAG 
CCCATCCGAG 
ATATGTCCCA 
CAGACAATAC 
GTTGGACCAA 
TTAATTGTTT 
GCAAGCCCTA 
TTCCTTATTT 
GAAGGTCTGA 
GAATCAACTC 
TTGAACAACG 
CGCTGATTGT 
ATCGTCCATA 
ACCAAGTGAT 
AATTGGTTCA 



ATGGTGTAAC 
ACATACGAGC 
CATCGAAATC 
CAACAAGAAC 
GTAAAACTGG 
GTTCGATTTT 
GC ATT AC AAA 
TTTTTAAGAA 
GTAAGTTTTG 
GACGGTTACC 
CCGCTTCTTG 
ATAAACGTTT 
TTGTTGCAAA 
GGATAACATC 
TGTCTAAACG 
TTAATTGCGC 
CACCCATAGC 
CCGTTAACAA 
CAAAATAAAT 
ATGGAATTCC 
TACGTTCACG 
CTTTATAACG 
AAATGCGTTC 
CGGGTTTTTT 
TTTGCATACT 
AGAGAAGCTG 
CGACACTTGG 
ATCAGAGTGG 
TCTTGTGCTT 
TGTTCCGCAA 
TCGCGTAACT 
TGTAATTCTT 
CCTTTAACGA 



AACTGGTTTA 
TTCTGCTTGT 
GGCATTGTAG 
AGGTTCGAAA 
GTGTTCTTGG 
ACGTTTCGCA 
TGGTTTGAAT 
CGGACCAACG 
ACGGAAACGA 
TGGTCCAGTA 
TAACATCCGT 
TAGACGGTTG 
ACGACCACCT 
CAT T AC CATC 
ACGGATGGCA 
AACTTCGCCA 
AGCTTTGAAG 
TTGTTTTTTC ' 
GACTTCTTCC 
TTTAAAGTAC 
ACGTACTTTT 
GATACGTTTA 
ACAGAATAAC 
AACTTCCCCA 
TTCGAATTTA 
AGTCAGCAAC 
AACAAGTAAT 
TAGCGACAAA 
CTTCTTTCAC 
ATTTGGTTAG 
CAATCTCAGC 
TCACTAATAC 
TGGCTTCGTA 



CCATCTTTCG 
GCTTCTTCAT 
GCTTCACATA 
GCTTGGATTC 
ATAACTTCTT 
TTTTTAATGT 
AATTCAATCG 
ACGATTACGG 
CCTTGTTTCC 
ACTGGACGGC 
TTTTCGTTTT 
TTACGGTTGA 
TCCAATTGAA 
CAGCTTGGTT 
CGTGTCCGTT 
TCTAAATCGA 
GCTTGGCCAT 
TCTAATGTTG 
AATGCACGAG 
CAGATATGTG 
GAACGAGTGA 
TATTTACCAC 
CCTTCACGTT 
TAAGACCAAC 
TTTACATCGA 
TTGTTAGCAA 
GGCTTGCGCA 
TTAGTCTCTG 
GACTGATTCT 
TGCGTCAACG 
TTCCTCAATG 
TCGGAATGAT 
TGTTTTCACA 



GGTTCAAGAT 
TTAGAGGCAC 
CTAATGGGTG 
CTAATCTATG 
CTAAGATATC 
TTGTCGCAAT 
CCATTTCTTT 
AACGACCAGA 
CTTTCAACAT 
CACGACGACC 
GAACAATGAT 
TAACACGGCG 
CCATCGGACG 
GGTTACCAGA 
TTTGACCAGA 
CGTTATCTAA 
ATTGTTCACG 
TGTCACCTGG 
GGCTCATGTC 
AAACTGGTGC 
CTTCAACGCC 
AAGCACATTC 
CGGGTTTTAA 
TTCTGATCTT 
TCAAAAGGAA 
ATGGCAAGCA 
GTTGCAACAA 
TCTTCGGCTG 
GCTTCTTTTT 
GTGATTAAAT 
TCCAAGACAC 
TCAGGCACAC 
CGTCCGACAA 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
-3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
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CGTCATCAGA TTTGTAAGTT AAGATTTCTT GTAATGTGTA AGCAGCACCG TAAGCTTCCA 
GTGCCCAAAC TTCCATTTc'c CCGA^ACCTt" GTCCACCGAA TTGAGCTTTA CCACCCAACG 
GTTGTTGCGT AACAAGAGAG TAAGGTCCGA TTGAACGAGC ATGCAATTTG TCATCAACCA 
TGTGGGCTAA TTTAATCATA TACATGACAC CAACGGAAAT ACGGTTATCA AATGGTTCAC 
CTGTACGTCC ATCGTAAAGA ACTGTTTTAG CATCGCTAGC CATACCAGCT TCACGAACAG 
TTTCCCAAAC GTCTTCATCG GTTGCCCCAT CGAAAACAGG TGTTGCGACG TGAATGCCTA 
ATTGGCGAGC AGCCATACCT AAGTGTAATT CAAGTACTTG TCCGATATTC ATACGAGAAG 
GTACCCCTAA TGGGTTCAAC ATGATATCAA CAGGTGT7CC GTCAGGTAAG AATGGCATAT 
CTTCTTCCGG CATAATACGG GAAACAACCC CTTTATTTCC GTGACGTCCC GCCATTTTAT 
CTCCTTCGTG AATTTTACGT TTTTGAACGA TATAGACACG AACTAACATG TTGACACCTG 
GTGATAATTC ATCGCCAGCT TCACGAGTAA AGATTTTCAC ATCGTGAACG ATACCGCCGC 
CACCGTGAGG TACACGGAGA GACGTGTCAC GAACTTCGCG GGCTTTTTCC CCGAAGATTG 
CGTGTAATAA ACGTTCTTCT GCAGATAATT CTGTGACCCC TTTAGGTGTG ACTTTACCAA 
CTAGTAAGTC GCCATCTTGA ACTTCAGCAC CAATGCGGAT AATCCCCATT TCGTCTAAGT 
CTTTCAATGC GTCTTCCCCA ACGTTTGGAA TTTCACGGGT AATTTCTTCA GGTCCTAATT 
TTGTATCACG TGCTTCTGAT TCATATTCTT CAATATGCAC AGAAGTGTAA ACATCGTCTT 
TAACTAAACG ACGGCTCATG ATAATGGCAT CCTCGTAGTT GTAACCTTCC CATGTCATGA 
AGGCAACTAA GACGTTTTGC CCTAAAGCCA TTTCGCCTTC TTCCATAGAA GGTCCATCCG 
CTAAAGTATC GCCTTTTTCA ACTTTTTCAC CTAAGTGAAC AATTGGGCGT TGGTTGTAGC 
TTGTTCCTGA ATTTGAACGA CGGAATTTAG TAACCATA.TA TTTGTCTAAT GCGCCATTGT 
CGCGACGAAC GCGAATTTCT TTTGCATCGA CGAATTC7AC GACACCGTCA TGTTTACATA 
GTAAAGCAGC ACCTGAGTCA TGGGCTGATT TATATTCCAT CCCTGTACCT ACCCACGGAG 
AGCGTGGTTG AATTAACGGC ACCGCTTGAC GCTGCATGTT GGCACCCATC AAGGCACGGT 
TGGAGTCATC GTTTTCTAAG AAAGGAATAC ATGCTGTTGC GACTGCGACT ACTTGTTTTG 
GTGAAACGTC CATGTAGTCA ACTTTGTCTA CGGCAACTTC TAAGTTTTCA CTTTGTAGAC 
GCGCCATAAC AACATCATTG GCAAATGTGC CATCTTCATT TAAAAGTGAG TTCGCTTGCG 
CTACGATATA ATGGTCTTCG ATGTCTGCTG TTAAGTAATC TACTTGATCA GTAACACGGC 
CTGTCGCACG ATCAACACGG CGATAAGGCG TTTCGATGAA ACCAAATTTA TTCACTTTTG 
CGTAACTAGA TAAACTATTG ATCAACCCGA TATTTGGTCC CTCAGGCGTT TCAATTGGAC 
ACATACGACC ATAGTGAGAG TAGTGAACGT CACGAACTTC ATAACCGGCA CGGTCACGAG 
TCAAACCACC AGGCCCTAAG GCTGATAGAC GACGTTTATG GGTTAACTCA CCTAATGGGT 
TTGTTTGGTC CATGAACTGT GATAACTGTG AAGAACCAAA GAATTCTTTG ATACTTGCTA 
CCACTGGACG GATGTTGATT AATTGTTGTG GTGTCAA7GT T7CTGTGTCT TGAATAGACA 



4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 
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TTCTTTCACG AACCACACGT TCCATACGGG CTAAGCCAAT ACGGAATTGG TTTTGTAATA 6900 

ATTCACCTAC TGAACGGATA CGACGATTAC CTAAGTGGTC GATGTCATCG ACATTGCCGA 6960 

TATCTTCCAT TAAGTTGAAG AAGTAGCTCA TTGAAGCAAC GATATCTGCT GGACGAACTG 7020 

TTTTTACGCT GTCGTCTGGA TAGCCGTTAC CAATCACATT TACGATACGT TCAGGATCTT 7080 

TTGGTGAAAG AACTTGAATC ACTTGGATTG TCATTGGTTC AGTTACTACA GCATCTTCAC 714 0 

TTGGATAGTA AGTTACGCTG TTTAAGCCGT TGTCAATGTA TTCGCCTAAT GTTTCCATGA 7200 

TTTGGTGTGT TAAAACCGTG CCTTTTTCGA CAATGATTTC ACCAGTTTCT GGATCAACTA 7260 

GCGTTTCCGC TAAGGTTAAG TTTAATAGAC GTGTTTTTAA GTCTAATTTT TTGTTAACTT 7320 

TGTAGCGACC AACGTTTGCC AAATCATAAC GTTTTGGATC AAAGAAACGT GCAGTTAACA 7380 

AGCTACGTGA GCTATCTGCT GTTTTTGGTT CGCCTGGGCG AAGACGTTCA TAAATGTCTT 74 4 0 

TCAAGCCTTC TTCTGTACGA GAATCACTTG CATTTTtGTG TAAATCTTTT TCAATTGTGT 7500 

TGCGTAAGCT TTCGcTGTCG CCGAAAATTT CGAAGATGGT ATCATCTGAA CCGAAACCTA 7560 

AAGCACGAAC TAACACAGTT AAAGGAATTT TACGTGTGCG GTCAATCCGA ACaTAAGAAA 7 620 

TGTCTTTCGC ATCTGTTTCC ATTTCTAACC ATGCACCACG GTTAGGAATG ACTGTTGAGC 7 680 

CaAAACCTTC TTTGCCGTTT TTGTCCACTT TTCCATGGAA ATAAaCACCT GGAGAACGAA 77 4 0 

CTAATTGGGA AaCGATAACA CGTTCTGCCC CGTTGATGAT GAAGgTACCC ATTTCTGTCA '7800 

TTAATGGGAA ATCGCCGAAG AATACTTCTT GGGATTTAAT TTCACCTGTT TCACGGTTGG 78 60 

TTAAACGTAA TGTTACATGT AATGGCGCAG AATAGTTGGC ATCATGTGCG CGGGCTTCTT 7 920 

CTACTGTGTA CTTTGGTTCT TTTAATTCAT AGTCAACAAA TTCTAAGGAT AAGTTTCCTT 7 980 

GGAAATCATC AATTGGTAAA ATGTCTTCAA ACATTTCACG AAGTCCCTCA TCTAAAAACC 804 0 

ATTGATAAGA GTCTGTTTGG ATTTCAATTA AATTTGGTAA TTCCAATACT TCACTGATAC 8100 

GTGCGAAACT TCTACGTTCG CGATGCTTTC CGTATTTTAC TACGTGTCCA GCCAAGCTCT 8160 

TCACCCCTCT AGTTATAAGT CTATCAAAAC ACCGATCCGA AAAATTACAT TTACGTTACA 8220 

TATATTAAAT ATATGTAACA AAAAGCATTT TTTGCAGTTT TTAAGCAAAA AAAAACCAAA 8280 
AATAGAACAT GCTTCCGAAA AATTTTCTGG CGTTTCTACT TTTGTTTTCC TATAAAAAAC 834 0 

CGGAACGGAC AATATTTCGC TCCTGCTTCG TCACTTTCAG ATAGTTTTTG CTCTATAGAT 8400 
TCTACTACCT TTGTCTAGAA AAGTCAAGGG TTTCTTTAAG GTTTTCTTTT TCATAAAAAG 84 60 

GAGTTTTTTT GTAAAGGGTT TTAATGAAAA AATGCGTAAT AAATAATTAC GCATTTTCCC 8520 
CTATCTTCCT CTTCACACAA AAAAACGAAT GCAAAGATGC ATTCGTTTTT GACTAACCGC 8580 
CACCTAATAA AAAGTCTAGG ACATTcATGC CGCCGCTTGT TTGGCTGCGG TAAAGTTCGG 8 64 0 

TGTAGCCGCA TTGGGTACAG CTAATTGTAA TAAATTTTTT ATTTkGAmCA TCAAAAATTT 8700 
TCGCAAAGTT CCCACCTGTT GCTTGAAATT GGTCCGACAC ATAGTGCATG TTGCCACATT 87 60 

TCTCACAAAC ATATTGTTGC TTTTCCATTC CCCTCACTCC TTCTCAATAA AACAAGTATA 8820 
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GCAAACGCCT GCTTCATTTT CCTCCATCCT AAGATGGATT TTTTCACTAG TTGACCAAGC 8880 

CTTTTCCTTT CCGCTACAAT AAGCATAACG AATCGGAGGA AAAAATATGT CGACAAAAAG 894 0 

CCAAGTCCTA ACTTTATTGA TGAAACAAAC GCCCGCATTC CTTTCGGGCT GTCTCTTGAT 9000 

CAGATCTTGA TCCCCTGCGC CATCAGATCC TTGGCGGCAA GAAAGCCATC CAGTTTACTT 9060 

TGCAGGGCTT CCCAACCTTC CCAGAGGGCG CCCCAGCTGG cAATTCCGGT TCGCTTGCTG 9120 

GTCCATAAAA CGGCCCAGTC TAAGnATnGC CATGTAAGCC ATGCnAGTAA CTGCTTCCC 9179 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



CATAAGTCGA 


AACGACTTAT 


GCCCCACTTC 


CGTTTTTTTT 


ATTCCTTCAA 


TGAATTTTGA 


60 


AACGTGCTGA 


TTTTTTCAGC 


CAATCTTTGA 


TAAGTCGTAA 


AAGAcGGATG 


CTGTAGCTCT 


120 


TGTTCTTCGT 


TAACGGATAA 


AGTTTGTTGT 


GTCGTAATGG 


CCTTTGTTAG 


TGCATCTACA 


180 


AAATGAATAA 


AAGAATTGTT 


GGTTTGTTTA 


TTGTATCCTG 


GTGAATAATA 


TTTCGATTGA 


240 

t 


CGATAAAAAG 


GCCGCAGGCT 


GTTTCTGTGA 


TTTTTTCCTG 


TTCATCTGAA 


AAGTATTCAT 


300 


CATAGCTGAC 


ATGTTTTATT 


TTCTGCTCTT 


CTGTTATCAC 


TTGAAGTCGA 


GCGATTAGGC 


360 


CATTGCCTAA 


ATCTTCTGCG 


TAACCGATAT 


ACGTTTCTTT 


CGATGGCTCT 


TTTAGCCAAG 


420 


TGTCCATCTC 


TGCTGCTAAA 


GGCATTAATC 


CTTCACGCGC 


AGAAGTTGAT 


GAACCGCGTA 


480 


CCGTATAAAA 


ATTACCTGTA 


ACACGATTTT 


CGTCACGCAT 


TTGTTTTTCT 


ACATAAGTAA 


540 


TACCATTAAC 


AACGGTGACT 


AACGTATCGT 


CTGTACGCGT 


GTTGCTTGCT 


TGAAATTCAG 


600 


GATAATCCGA 


CAGGCGTTTG 


TTGACACCCG 


AATAGCGTGT 


ATCATAATAA 


GGATCACTAG 


660 


CAAACTCTGT 


AAACTCAACA 


TTCAGAATCC 


TTTGCGTTTc 


AGGTTCCGTA 


ATAATCTCTA 


720 


CAATGCCGTA 


ATTTCCGCCA 


GTAAATACAC 


CTTCATTTTT 


ATAATAGTTA 


CCCGTCATTA 


780 


ATCCCAAAGG 


CGGTTGATTG 


GACCAAAACA 


TTTTTTTCTT 


TTTTTCTTCC 


GGTGT.ATATT 


840 


TTGCTGGTGG 


GGTTGTTCCA 


AATGTCGTTT 


GTGTGGCCCC 


TGTAACCACT 


TCGTACGTGT 


900 


ATTCATGAGA 


ATCAAAAGTC 


AATTGATTAG 


CAATCAGTTC 


GTCATTTTTA 


GTCGGCGCTT 


960 


GTTTTTCCAC 


CGCAGAAGTC 


GTTTGATTGG 


TGACGGTAAC 


TTGCTTCACT 


TGCTTTGTCT 


1020 


CTTGCGGACC 


ACTTGTTGTT 


TGTTTCGTTG 


GCTCTTGATC 


TGGCGCACAA 


CCAACCAAAA 


1080 


TAAATAATCC 


• GATAAAAACC 


AACATTCCTT 


TAATATGTTT 


CATTCACTTC 


TCTCCTTTTT 


1140 


ATAAACGCTT 


AATTTCCTCT 


TATCTCCAGT 


GTAGCAAACA 


TTTATAAACC 


AAAACTGACG 


1200 


GAAAAATGAA 


TAAATAATAA 


ACAAAAACGC 


TAAGTCTCCT 


GCAGCTTAAG 


AGACTTAGCG 


1260 
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TTTTTTTATT 


TAATTGCTTG 


GCGCTTTTGG 


CGCAACCACT 


CTTCTTTAAT 


GTGTTCAGAT 


1320 


AGGCGAATCA 


TTCGTTTACC 


AACTTCACTA 


TATTGGTTAG 


TAGCTAAATT 


GGCTAAAGAA 


1380 


ATTCGAACTG 


ACCATTTGTC 


ACTCCCGAAA 


GCATCCGCTT 


TTAAAAGCAT 


TAATTTTyCT 


1440 


TTTTCTGCTA 


ACGAAGTTAA 


TACTTTTGTG 


ATCGTCCACG 


ATTTCGTTAA 


ATAACTTGAA 


1500 


AATTCTGGGC 


CATAACGTTT 


TTCTGTCCAA 


TCTCTAAAGT 


TAATTTCACA 


ATAGTATGCT 


1560 


GTATTTAAAC 


TTGCTAAAGG 


CTCTTCAATG 


CCTAAGGTGC 


GGAACAAGAG 


CTTTTCACGC 


1620 


GTATGACAAA 


TATCCATCAC 


TTCGTCTTTA 


TATGCTTGAC 


CGTCTTTTAA 


CAAAGCATAC 


1680 


AAACTAAATA 


AAGCCATCAT 


TGCTTGCTGA 


ATAGAAGaAA 


GTCCCGCTGC 


ATGGTTCAAC 


1740 


GCAATATCGC 


GGCTGTctGC 


GACTAAGCGG 


CTGATAAAGT 


TAATTTGGCT 


TGTGTCTGCA 


1800 


TTCAATGTAG 


CATaACGTGC 


CTGTAATTCC 


ATTTTTCGAG 


CCACTGGCAA 


TTCTTTTAAT 


1860 


AATTGATCAA 


AGATGTTCTC 


TTGACTGACC 


GCAATAGTCC 


CCACACGCCA 


ACCGGTAGCG 


1920 


CCAAAGTATT 


TAGAATAAGA 


ATAGATACAG 


GCTGTATTGT 


AAGGTAACTC 


CGTAAATAGT 


1980 


GAACGGAAAG 


CTGGCACAAA 


GGTTCCGTAA 


ACGTCATCTG 


TTAAAATCAT 


TAACTTAGGG 


2040 


TTATCCACTG 


CAGAGATTTG 


TTTGATTTGT 


TCAATGGTAG 


GTTTTCCCAT 


AGCATTAGCA 


2100 


GTTGGATTGC 


TGGGATTTAC 


GACAAAAACC 


GCTTTGATGG 


ACGGGTCTTT 


TAGTTTATCA 


2160 


ATTTCCTTGT 


TTGAGTATTG 


ATACGTCGTT 


TTTCCATCAA 


TGATCATTTG 


CTCTGCTTTG 


2220 


ATTTTAACCA 


CGTCAAAGTC 


ATAACGTGGT 


AGTTCTGGAA 


TTTCTAAATA 


AGGAGCAAAC 


2280 


GTTGGTAATA 


ACAACGCAAT 


ACGATCACCT 


TTTTCTAGTA 


AATAATTATT 


TGCCAACGTA 


2340 


TCAAACAAGT 


AACAAATCCC 


TGCTGTACCA 


CCCTCAACGG 


CAAAAATATC 


AAACGGTTGT 


2400 


GCTTCGCTGG 


AAAATAATTC 


TTGATTTAAA 


TATGCTTTAA 


TCGGTTGTTC 


ACACGCCTTC 


2460 


AAGCAACGAA 


CAGGATTCGG 


ATAATTATCC 


CCAATCACAT 


AGTCTAACAT 


GGCATCTAAC 


2520 


CACATTTCCT 


TATCCATCCC 


AAAATAGTTA 


TGCTCTGCTG 


TCCAAATTTC 


tTGTAAGAAG 


2580 


GTTGCCCCTT 


TGCTTGGATT 


TTCTGCTAAA 


AAGTTTAAGA 


AGCGTTGCGT 


TCTACCAAAA 


2640 


CTTGGCTGAA 


TCATACCAGC 


CGTTTGTTCG 


CTCCCCTCTc 


TTAACGTTTC 


TTTTGTAGCA 


2700 


AATTGACCTA 


ATAAAAAAAA 


GGCTTCCCTT 


GGTGTTGGCG 


CCGTCCAATT 


AGGATTGCCT 


2760 


CGCCCTGCAT 


TTAACAATGT 


TTCTAAATTA 


TGAT.TTTCTT 


CATGTAATTT 


CTTCTCTAAA 


2820 


AATAGACTTA 


ATTCAAACGG 


ACTAAGGTTT 


TCTGTCAATT 


CATCACTTGT 


TATCATTTTA 


2880 


TCCTCCTGAA 


GCACACACAG 


TCATTTAGTA 


TTTAGTATGT 


AGTCTATACT 


ACTCTTCTAC 


2940 


CTCCAGATGT 


CAACCATTTA 


AGCGGAATAT 


TCTGACAATT 


AATAAAAAAG 


AAACAATCCT 


3000 


TTATAAAAAA 


TAAAGGATTG 


TTTCTTTTTT 


GCTGATTAAC 


CAAAGAACCG 


TTGCATCTCA 


3060 


ATTTTTGCTG 


• ATTGAACAGA 


ATCAGAAGCA 


TGGATTAAAT 


TTTTGGTGAC 


AGACAACCCA 


3120 


TAATCCGCTC 


GTAAGGTGCC 


ACTTTCAGCA 


TCTAACGGAT 


TCGTTGCTCC 


AACTAATTTA 


3180 


CGCACTTTTT 


TAACCACATT 


GTCCCCTACA 


ACGACCATCA 


CAACCACTGG 


GCCACTTGTC 


3240 
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ATATAGGCTA ATAATTCAGG GAAAAATAGT TQATCTTTTA 
ACTGTTTCTA CTGTTAGTCG TTCGAATTTA AGGTGCAATA 
AAACGTTGAA TGATTTTGCC AACTAACTTT CGtTCAACGC 
AAACTTGTTT CCTTCATTTT TTCCATCCCT TTCAAAACTT 
GGTAGCTACA GAATAAAATG AAAGCGGATA CTTTGTCAAT 
TGATTAAATA AAAAATTAAA GATTATTCAC AAGATTTGTG 
ATTGCTTGTT TCTTTTAAAA ACAAATAAGG TTGCTCCTCT 
ATAAACTTTA GCTTCACCAT TCACTAAAGT AAATAAATAG 
TGTCGCACTG TCAGAGTATA CTGCCACCAG ACAATAACCC 
CCAATTTAAC CAAATGGGAT TGTTCCCAAT AGCTGGTTGT 
TAAATCATCA GGCACTTGAA TCGTGTAAAA AG CT AC AG AG 
ATAGTTCTGA TTGCGTTCTT GACCCCAGGT TTCCATGTAT 
TTGCTGATTC CAAATCGTCC CTTCTTCTGT AACTAATTCA 
TGTTTGCTCT AATTCCAACG CCGTGGTTTG ACTAACTTCG 
TTCTTCAGTA TTTACATTGT TTGTTTGTAC GGTTTGTACT 
TTTTTGCCAA CCACAGCTAG ATAAAACGCT TCCAACAACC 
TAGTTTCATT CCTTAAACTC CTGTTTTTAT TTTTAtTCnA 
CAATCCCTTT CTTTGAATAG TnAAAACCA 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
AGCTCGGTAC CCCCAATTAT TATTTAAAAG CTATATTGAA TATAGCTTTT 
AACTGAATGA AAAAAAACTG TCCAATAATA TATAGAAATT TCTGAAAAAT 
GTAAATTATT TTAATAAAAT TAATAAGAAA GGAAAAAAAT ATGCAACTAG 
AACGAATCAT ATGAAAAGAA ACATGGTTTT AACTAAAGAA AATCAAGTGA 
TCGTATTCGG TCAGAAACAG TTGGCTTAAC AGATTTTGAG AAAAAAAGGA 
AAAAGTAGCT CGAACGTTGA AACGTTTACA AGAGAATGAG GGTTTTGAAA 
ACCTGTGAAT GCGGATATTC GAGGGAAAAT GGGAGGTATG CGCCAATTAG 
AAATCATAGT GTAGCGGTGG ATAAGTTGAT GAAAACTGCT CAGTATCTTG 
TGGCATGGTC TATGAGTATA TTTGGTTAAT TGGCGTGCCT TTAGTAAAGA 
AATTGATATT AAAGAAACAT TTTCTACGGC GTTAAATAAT CTATCTGAAA 
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AATGCTCATA 
AGTCAATACC 
CATCTGGCTT 
CCTTTTGCCC 
GATAAATTTT 
AATTGTTCTT 
ACTGGCTGTT 
ACATGCTTCT 
CCTTCGCTGC 
ATTTTAGGTT 
TGACCTGGTT 
TGTCCTAACT 
GTTAATTCTT 
GTTGCTGAAA 
TTCTGTGATT 
GCTATCATAA 
TCGAAATAAA 



ATGTTCCTTC 
TTTATCTTCA 
AATAATTACT 
TTTCCCAAGG 
CTTTTCCTTT 
TCAGTTCTGT 
CTTTACTCAC 
GGGCAACTTG 
CTGTTTCTGA 
CATAACTCAA 
GGTACGCTTG 
GTTTTGCTTT 
GTGTTTGCGT 
TTGCTCTTGT 
CCTCTTTCGA 
TAATCCCGTT 
CCTAAAATCT 



AAATAATAAT 
CATTTkTAAT 
AATATCCATT 
TTGCGTATTA 
AAACAAAGAA 
TTGTTCTTTT 
TGGACAAAGA 
AAAATGAAAT 
AAGAGCGGTC 
AAGTGGTTAA 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
. 3900. 
3960 
4020 
4080 
4140 
4200 

i 

4260 
4289 



60 
120 
180 
240 
300 
360 
420 
480 
540 

600 
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AGGACTTGGT 
ACAAGAAGTA 
GTATTACTAC 
GACTAGTCAG 
AGAATTGTCT 
GGTCACGATT 
TGTGAAGTTA 
TGCTCGTGCG 
ACAAAAAAGA 
TGATGATATT 
ACAATTACGT 
. AATCCGTGCA 
ACGTTTTTCA 
TTTCTTTACT 
CACGTTAGAA 
TATGAATCTA 
TTGGTTGGTA 
GTATTCTTCT 
ACAGTTTATG 
ATTTATTAAG 
ACTTGATCCA 
AAATAATATT 
AGCAGAAGTG 
ACGTTTAATT 
GATTAAAGGA 
TGAGAAGAAA 
TGAATTAGAC 
TTCAAAATTT 
GATTTTTCAA 
TTCGTTTAAT 
CGATGGTACA 
TTTACGTTAT 
TCAAGGACAA 



TTAGAAGTAG 
TATCAGAATT 
CAAGCCTATC 
AACTTAGATA 
AATGAGTTTG 
GATGGTAATC 
CAGGCTGGTT 
AGAACACGAA 
AAAATTGTTG 
CCGATTATTG 
GAGCGAAAGA 
ACGTTTGATA 
ACCAGTAATT 
TCGTTACACG 
GAAAAAGAAA 
TTGCTTGCAA 
TCGGGAGATA 
CTATTAGATG 
CGAACGATTA 
ACGTTTAATT 
ATTGTATTAT 
AACGAGGATA 
GTGGCTGAAC 
AGTCATTCAG 
TCGATTTTAG 
GTGACCATTT 
GAGAATCAAC 
GGGTCACGCA 
TCGTCACCAG 
AATTTTATGG 
GGCTTTGGAA 
TTAAAATTAC 
TGTCTCATGA 



GAGTAGCGGA 
TGTCAGAATT 
AATTTTTAAA 
ATTTACTAGC 
GGAAAAGTTA 



ACTTGCTAGA 



TTGCTGAAAC 
CAAAGAATAT 
AGGGGCAACA 
ATTGGAAAGC 
AAAATTTGAT 
ATGTCTATCT 
GGCAACATAC 
CAGGTACAAA 
GTCGGAAACA 
ACAAACAGAA 
CAGGTAACGG 
TGAAAGTGTT 
AT GAT CC AG A 
TTGTGACACT 
TTGATGAAAC 
AGTGGAAAAT 
GGAAAGCAGG 
AGAAAGACGT 
AGTTAGCCTT 
TAGAAATTGA 
GGTTATCTGT 
ATCGGAAAGA 
AAGGACAAAA 
TATTAATTAC 
CGGTGGTTTG 
CAGTCAATGA 
AAGATATGTT 



1388 

GGATTGGGAA 
GTTAGTGGAA 
TAATATTCAA 
GAACAAAATC 
TATTGCTCAT 
ACTGGTTCAA 
AAAAGGCCAA 
CATGGAAGAA 
TTCACTTGAT 
AGTATTAGTG 
GAATCGACTA 
TTTTCAAGCA 
GTCGACGTTA 
GACAGGGTTT 
AATTGTTTCT 
TATTGAAGGA 
GAAATCAGTA 
ATACATTGAT 
ATATCAAAGA 
TGATGTTCGT 
AGAAGCAATC 
GCCACATAAG 
CAAACAAGTT 
TCATGAAATG 
TTCTCATGGT 
AGATTTAGAT 
GACTTTAATG 
AGAAACAGTC 
GATTCTAAAA 
GCAATCGGTC 
TTTCGATGAA 
GATTAATTTA 
TAACCAAGTG 



GAAGCTTATA 
CGATTGACAG 
CATGAGAAAA 
ACTCCTTTTC 
TTACCATTGG 
AAAATGTCTT 
TTAGCGTTAT 
GCACATTTAG 
GATTTAGATC 
ATTAGTGGGA 
GATAGCTTAG 
ACGTTATTAG 
GAAACATTTT 
TATTTGGGAC 
GGCTCTAAAA 
AAGAAAACAA 
TTTTCTAAAT 
CCAAAAAAAG 
AAATACCCAC 
AAAAAAGAAA 
GCAACGGCAA 
ACGGCGATTA 
GGATTTTGGC 
GGTCGTTTTT 
GAAGTAGAAG 
TTACCCACAG 
TTTGCATTAG 
ACTTTCTTTG 
TCGATGAAAC 
AACGATGTGT 
GTGGATAATC 
AAATGGGTAT 
AATCGAATCG 



AAGACCAAGA 
AAGACGAGTT 
AAGAACTTTT 
GAGGTGGGTT 
GGGATTGTGG 
ATCCTGTTTC 
CAGGACGTTC 
CAGGTTCAAA 
AAAAAATTGA 
ATTCAAAAAA 
GAATTCCGTT 
GCAACTTTCA 
CAGAGTTGAA 
GTGTGGACGC 
ATATTGTCTA 
ATAATCCCCA 
GGTTATTTTT 
AAGTTCGCCA 
TTGATGTGGC 
ACCATGGGGT 
AAGCTATGTT 
ATGAAACAGT 
ATGTAATTGA 
TATTATCTAC 
GACTTTCTTT 
ATCGGCACGA 
GCACGTTTTG 
ATGAAGCGTG 
GGATCGGACG 
TAGATAATGG 
GAGAAGGGAT 
CGAATATGAT 
TGATAGACGT 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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TATTTTTGAA - 


GAATGGTTAG . 


AACTCTTTGA , 


AACAGTTGAT GATACAGAAG CGTCCGTTAT 


2640 


GGAAAACGAT 


TATGTGGCGA 


GGTGGTGAGA - 


AAGTATGAAA TATCTCATTG ATAGTTATAC 


2700 


AGCCTATATG 


GTGAATGGTG 


GTTTATTAGA 


TAAAGGATCA GAAGTCGTCA ATTGGTTCTT 


2760 


TATTGAATTG 


CCTTTTTTCT 


TTTTGCGTAT 


GGGGGCAATG ATTTTCTTAA TTATGGAGAA 


2820 


TGTCATGAAT 


CAGTCAGATT 


ATTTTGTTGG 


TAAACAACAA GAAGCATACG ATTATTCTTT 


2880 


GGATATTTTA AAAGGATTTG 


GTGGAACTGG 


GATTGTCAAA GGGAGTTTGC TTGGATTGGC 


2940 


AATTATCCTA 


AGTGCCTATT 


ATCTATTGTA 


TAGCTTTTTT TCTAATCGGC GTAATTTCAT 


3000 


GAAAAGTTTG 


TTGCATTATT 


TTGCGGTGTT 


TGCCTTGTTT ATTTGTTGGT TTGGACAAGT 


3060 


AAAAACAATT 


GATGGTAAAA 


CGCAAAATGG 


GGCAATTTTT TTGATTAGTT CGGTTAGTGA 


3120 


GATGACAAAA 


CAAGTACAAG 


GCAAGTTTAC 


TTCTAATGTA AACTTTGGAG GGGATACAAG 


3180 


CCAAGAAGTA 


GACGATGGAA 


AGAAAAAAGT 


GTATCAAAGC CCAATGTTTG ACGCAACGGT 


3240 


TTTACAGACG 


TTTAACTATG 


TCAATAGTGG 


CTCGATTGAT GGCAAAATGG CAAATGGGAA 


3300 


GAAGCTTGAT 


TATGATAAAC 


TATTAGAAAA 


GCCTAACCTG AATGATAAAG AGAAAAAGGA 


3360 


TTTTAGAGAA 


AAAAGAAATG 


CTTATATTGA 


ACAGCAGGAG AAAGACAATC CTTATTTTTC 


3420 


GCAAGATACC 


TTAAAAACTA 


TGGAAAAAAG 


TTTTGCGGTA TGGACGGGTG CAACAAATTT 


3480 


ATTTATTCTA 


GCAATTCCTG 


TGTTATATAT 


TAATTTAATG TTGTCGTTGA TACAGCTGTT 


3540 


AGTTGTCTTT 


TTAATTCTTA 


TATTTCCAGT 


TGTTTTGTTG GCTTCCTTTT TTCCACGGTG 


3600 


TCAAATGTTA 


CTATTCAAGT 


TCTTTCAAGG 


TTTAATTGGT GCGTTATTCA CGCCAATTAT 


3660 


TTATGGTATT 


TTTTTATCGG 


TGCTGTTTTG 


GaTTAACAAa CTGATTGATG GGGCATTTTT 


3720 


AGGTGTCGCA 


AAAAAAGTCA 


GTGGCAGCTT 


GCTTGAATTG ATTTCAGGTA GTACGGTTTA 


3780 


TATGATCGTT 


TTATTTGTAG 


CaGTGGTAGT 


GAAGATTGTa GTTTTACGAA AAGTTTGGAA 


3840 


AAATAAATAT 


GCCATTCTTG 


CTTACTTTTC 


AAATGGGCAA GTGTCACAGC CTGTGTTTGA 


3900 


ACAAAAAGTG 


GAACAGTTGT 


CTGAACGAAC 


GAAAGAAGTA GCAACTGGTG GAGCGCAAGT 


3960 


TGCCCTAGGT 


GCTTATACAG 


GCAATCTGGG 


CATGGTCGCA AACGGTGCAG GTAATATTCt 


4020 


ACCGAATCAA 


GATAAAGCCA 


TTGAAATGGG 


CCAAGAACAT TTTATTGATG ATAATGGTTC 


4080 


ATTTACTGGG 


GTAAAAGCGG 


GTTTACAGTC 


TATTTTTAGG CAACCAGTGA CAGAAGAAGA 


4140 


ACAAGGGGAG 


CTGAACGAGG 


AACAGTTTTC 


AGAAGTGGAA ACGGAATCGA TAGATCATGA 


4200 


GTTAACAGAT 


ATTGAAGAAA 


TAGGACAGAC 


AGAGTTAGAA TCTGATAGTT TAGATAACTC 


4260 


TATTATTGAA 


TTAGAGCCTA 


TGACGAGTGA 


AGATTTAGTA GTGGATGTCG ATAATATGAG 


4320 


TTCTGATATA 


GATGTGGAGC 


TAGCATCAGT 


AGAAGAAGTT GGGGAAGAGC CACCAATGAA 


4380 


CGCAGAGGAA 


TATTCAGGTT 


ATGATAATGT 


GATTGTGACC AACTTTGATG AATTGGCTAT 


4440 


GGCAAGAGAA 


GAACGTGATT 


ATTTTGACGG 


TGGCAAAGAA CAAGAGCTAG TCGAGACAAA 


4500 


TGGAGTTGAA 


. TTGGTTAATA 


jVTATTTATCC 


: GTTTGTTCAA ACAGAAGAGT ATCTAGCTAA 


4560 



WO 98/50555 PCT/US98/08985 

1390 

TCTTGAACAG ACAGAAGCCC AGTTTTTTGG AACAGATGAG TTTGAGAAAG AATTAGAACA 4 620 

GATAGAGGAG TGGGCATAAC CACTTCTCTT TTTATGTTAG GTGGTGCAAA TGAAAAAAAT 4 680 

CTTTTTAGGA ATCCTTGGCT tGTTTAGCTT TTTGTTTTTA TTGCCTTTTT TGTTATTTTT 4 74 0 

GGGCACCATG TCAACAGAAA TTGGAAACAA TACACAATTT CAAGCAACAA CGCCACAAGA 4 800 

AAAAGTAGCG CTAGAGGTAT CCAATTTTGT TACCAAAAAT GGTGGATCGT TACAATTTGC 4 8 60 

CTCTGCATGG CTAGGAAATA TGGAACATGA GAGCGGATTA AATCCTGCTA GAATACAAAG 4 920 

TGATTTAACG TTTAATTCAG CGTGGGCGTT TAATCCTTCG ACCAATGGTT ATGCGTTAGG 4 980 

TTTAGCAATG ATGGACGGTG AGCGCCGTGT AAACTTGTTA AATTTTGCCA AAGAACAAAA 504 0 

GAAAGATTGG CAGGCTGTCC CAGTACAACT CGAGTATATG TGGAATCATG ACGGCTCAGA 5100 

CAGTGCCTTG CTGAAACGTA TGTCGAAAAG CTCTGATGTG AATCAATTAG CTGTAGATAT 5160 

TTTGGTACAT TGGGAACGTG CAGGCACTAA AAATGATCCC AACGAACAAA TCAAACGAAA 5220 

AACAAGTGCG AATAATTGGT ATAAGAGACT GTCTACAGGT TCTATGGGGG CAGGTTCAGC 5280 

CAATATTGGT GGTGGCAAAA TTGATGTGTT AGAACAAATG TTAGGGCAAA CAGTCAATGG 5340 

AGGTCAGTGT TATGGGGGGA CTTCTTATTA TGTTGAAAAG ATGGGCTTTC AATCTTTAAT 5400 

GAATACAGGG CATATGTTTG CCAGTGAAAT TGGCAATGAT TATGCGTGGG AACAATCTGG 54 60 

CTGGCAGGTA ATCAAAAATC CAAACTATTC TGATGTCAAA GCaGGTGATG TGATTAATTT 5520 

CGCAATGGGC GGTTATGCGA CCAGCGTTTA CGGTCATACT GGAATTGTAG cTAGTGTTGA 5580 

AGGTAATGGA AAGCTTGTTT TATATGAACA GAACGCTGAA AAAGGACAAA TAATTGCTAA 564 0 

ATATTTTCGA CAATGGGGAG CAGAATATCC TAACGTGACA AGTATTGTAA GGAAGAAATA 57 00 

GAGGAGAGAT TTTTGATGAA AAAAGAAGAA ATGGACGTTG TTTCATTGCG GATTATAAAG 57 60 

CTTTATTTTT TAGGCATTAG GGAATTAAAT TTCCCAGATT ATAATAAAAG GTTTCAGCAA 58 20 

AAAGATATGG AATTGTTTAT TCAACTAGCT GATATGATGG AAAATTTACC AAATCTTGAT 5880 

GAACAACTAA TTTATGAATT AGAAGAGTTG AAAGATTATT TATTCTATGT CAAAACCGAA 594 0 

AAGTATTCTT TAACAGTTCA TGATATGTTT TTGGAAATGA AATCCGAGCT TGAAAAAATA 6000 

ATTTAGAAGA AATGTAGTGT CAAAAAAGAG AGGGTGTAGT GTCAAAAATG TTTAGAAAGA 6060 

AAAATGGTTT AACAAATCGG CAAGTATTAG CGATTTGTAC AAAATTTCAG TGTAAAAAGA 6120 

ATGAGTTTGT CGCCCATCGA TTAAGTGATG GGTATTTAGT AGCAGTAAAT AATAAAGAAT - 6180 

ACCGAGTAAA ATTTTCGGAA GGCTTTTTTT CAAAAATTGT ATATGCAAAA GAAGTGCAAC 624 0 

GGAAAGGAGC TCGGAAATCG TGAATAAGCT ACCTTTACTT ATTTTATTGT TAGGCGGAGT 6300 

GTTGCTTGTT AGTGGCTGTC AAAGCCATAA GGaAGAAAAC AAGTCTAGTA AAGTATCGAC 6360 

AGAAGAAACG ACAGTGATTG AAACAGTAGC AAGGGAACAA TCGAAGGAAT CGTTTACGAG 64 20 

TGAAGCAACT AAAAAACAGA CAGAAACAAC GAAATTAGAA GAACCAGATC ATGTAAAACT 64 80 

TCTAGAAGCT TATGGAAATG CGTATGCGAA CTTTACAAGT ATTAATGATC GCAATGAAAA 654 0 



WO 98/50555 PCI7US98/08985 

1391 



GCTAAAGCCC 


CTCATGACTG 


AAAAATGTAT 


CAAAAAAAAT 


GGAATTGATG 


TTAAAACTGG 


6600 


AGTAGCGTTA 


GTTTCCGTAG 


GAAAGGTTAC 


AACGATTTAT 


AAAAATGATC 


AACATGAATA 


6660 


TGCTTTACTT 


TTGGATTGTG 


AACAAAATGG 


AACGCAGACA 


CGAGTGTTAC 


TTTTGGCTAA 


6720 


GGTGAAGAAC 


AATAAAATTT 


CTGAAATGAC 


CTATAATTCA 


GTTAAGCAAG 


AGTATTAGAA 


6780 


ACTCTTGATA 


TTAATAGATT 


CCATGCTAGA 


TTTATGGTAC 


AATTACACAT 


AATAAATAAA 


6840 


GGAGAAAACC 


AATGAAAAAA 


AGTATTGTAA 


CGtGTTATTT 


TTGATGTTGG 


GGATAGTTGT 


6900 


AT T TACT AG T 


TGTGGGTCTA 


AAATGGATAA 


GACAGCAGAA 


GAGTTTAAGA 


ATAAGATTAT 


6960 


AAGTCATGTC 


GGTGATCTTT 


ATGATGAAAA 


ATATCAAGAA 


AAAGAATTTA 


GCTTTTTAAT 


7020 


TTACAAAGAT 


AAGGATACAA 


ATGAATATTT 


AGCTAAAGCT 


TTGGTTCCTT 


ATGAAGGTGA 


7080 


GCCTAATAGT 


GTAGAATCTA 


AGTATTTTTA 


TAATGTAAAT 


AAAGAATTAG 


AAACTATTGA 


7140 


ACCATTTAGT 


GGAGGTAGAA 


CGTTTGATTA 


TGCTAAGTCT 


CATGGTAACT 


ATGAGGTAAT 


7200 


ATATAAATCA 


GGAAAATTTA 


AATAGAATTT 


GAACTTAAGA 


TAAATTAAAT 


TTATGTTTGA 


7260 


GAGCAGTGCT 


TAGAGCATTG 


CTCTTTTTTG 


TTTTAGGAGG 


GAAAACAATG 


TTAATTGTAA 


7320 


CAAAAGAAGA 


CCTAATGAAT 


TTAGGATATG 


GAAAATATCA 


AGCGGAAGAT 


ATAATTAGGA 


7380 


AAGCAAAGGC 


AGTAATGGTG 


TCTAAGGGAT 


ACGCATATTA 


CCTAAATAAA 


CGATTAGGAA 


7440 


GAGTTCCTGT 


TGGAGCTGTC 


GAAAGTGTAT 


TAGGAATAGA 


ACTAGAATCT 


ATAGCAAAAG 


7500 


AGGTTGAAAC 


GGTTGGTTAA GTTTAAAAAA GATAAGAGAG 


GAAAGTATTA 


TTTTACTGCT 


7560 


AGTTTGGGTT 


TTGATGAAGT 


AACTGGTAAG 


AGGGTTCAAA 


AAATGCGTTC 


AGGGTTTTCC 


7620 


ACGATTAAAG 


AAGCAAGGGA 


AGCTTATGCG 


GAAATAATTA 


ATAATTTCGG 


TAAAGAAGCA 


7680 


TATTCTTCAA 


ATAGCACAAT 


GCTGTATGAA 


GAATTTTTTT 


ATACTATTTT 


TTTGCCATAT 


7740 


TATAAGGGAC 


GGGTTAAAGA 


AAGAACTTAT 


AATAATAGAC 


TTAGTATGAT 


AAAAAATACT 


7800 


TTTGCCTATT 


TTTTTAAAAT 


GAAATTGAAA 


AGTATAGCTC 


CTGTTCATAT 


ACAGAAGTGG 


7860 


CCGAATGAGC 


TGTTGGAACn 


GTATGAAAAC 


ACGTATGTAA 


GAAATATTTA 


, TGGACTGTTT 


7920 


CnGAT 












7925 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
TAAAATAwTw ■ TTTATAATAG AaAGTTATCG AAAGGATACC TATGTTTAAA TTATTAAAGA 
CTTTTCAAAT TGTGTATGAG CAAATGAACT TTTCCAAAGC AGCTACCTGC CTATATATAT 
CACAACCAGC CGTTTCTAAT CAAATCAAAC AATTAGAAGA AGAATTAGGC TGTTCATTGT 
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TTTTAAGAAA 


TGGTCGACAA 


GATGTAGTCC 


CTACTAGGCA 


AGCGGAAGTC 


TTGTATAATC 


24 0 


GTT.TGCTAAA . 


TTTAGCTGAT 


GATTGGCAAG 


AAACATTAAC 


AGCTCTTCAT 


CAGGCGAGAT 


300 


TACCTAAAGA 


AACTTGCCGA 


ATTGCTGCGT 


CTAATACTTT 


TGCAGTTTAT 


TATTTGCCGC 


T) C Cs 

360 


AACTGATGCA 


GCATTTACAG 


ACACAGTATG 


CAACGATCAA 


TTTTGTTTTA 


GAAATGAACA 


420 


ATTCAGAAGA 


AGTAGTGGAA 


AAAGTTGAAA 


AACATCAAGT 


TGATTTTGGC 


TTTATCGAAA • 


480 


AACCATTAAT 


TACGAAAGGA 


GCAACTCGTG 


AAGAAATTAT 


TCAGGATCAA 


TTGGTTTTAG 


540 


CTGGGGACCC 


TgCCAATcAG 


AATTGGTTGG 


TACGTGAAAA 


GGATTCTGGG 


GTCTTTCATT 


600 


ATACCCAACA 


GTATTTAGAA 


GAAAGTAATC 


AAAGCCCAAC 


ACTCATGACG 


GTTAAAAACA 


660 


ATGAAATGAT 


TGTTAAAATG 


TTGGAATTGG 


GCATGGGGCA 


ATCGTTGCTT 


TCAAGAAAGG 


720 


CGATTACTGA 


AAAAATTCCT 


TTCCAAACGT 


TAGGTGAAAA 


GTATTGGCGT 


TCCTTTTATT 


780 


TTTTGACACG 


GGGACATTTA 


AAATCCTCCT 


TGCTTCAAGA 


AGTAAAACAA 


GCAATTTATC 


840 


GATTCTATCA 


GACGGAAATG 


AATAAATACT 


AAAACGTTCT 


CTTTGTTTAT 


TTATCAATCT 


900 


TTGACTACAC 


TAAAGAAAAC 


GAAGGAGGGA 


ACCACAATGC 


CTAAATTTTC 


ACCATGTTTA 


960 


TGGTTTAACA 


CACAAGCGGA 


AGAAGCTGCT 


AACTTCTATA 


CAACAATCTT 


TGAAAAAGGT 


1020 


GCGATTCTAA 


GTAAAACGAA 


CTACGTTAAT 


GAGGAGCATC 


AACCTCaAGG 


aACATCATTA 


1080 


ATGACCATTG 


AATTTACCTT 


GGCGAACCAA 


ACAATCATTG 


GTTTAAATGG 


CGGACCAGAA 


1140 


TTTTCCTTTA 


CGCCAGCTAG 


TTCATTTTTT 


GTTGAATGTA 


CAACATTACC 


ACAAACAGAA 


1200 


ACGTTGTGGA 


AAAATTTAAC 


AGCAGATGGA 


CAGATATTGA 


TGCCTTTTGG 


GGAATATCCT 


1260 


TTTAGTCCTT 


TATATGGTTG 


GGTAGTGGAC 


AAATTTGGTG 


TCTCTTGGCA 


AGTGTCTTTT 


1320 


TCAGGAAAAG 


AACAAACAAT 


TGTACCAACC 


TTTATGTTTG 


CTAATGAAAA 


ATATGGCGAA 


1380 


gcAGCCAAAG 


CTCTATCAGA 


ATGGTTGGCG 


ATTTTCGGTC 


CAGGCGAAAT 


AATTGAAAAA 


1440 


GTTGAATATG 


AAGATGGGAA 


CATTGCGCAA 


CACTTTTCAC 


ATTACAGGAG 


CAACCGTTTC 


1500 


GAGTAATGGA 


TGCACGAGAT 


AAACATGATT 


TTGATTTTAC 


CATGGCCTTC 


TCCATTTATA 


1560 


TTGATTGTGA 


AAATCAGGAG 


GAAATAGATC 


GGCTATGGCA 


ACAAGTGACT 


GCTAAAGGGA 


1620 


AAGAATGGCC 


GTGTGGCTGG 


ATGGAAGATC 


AATTTGGAGT 


CAGTTGGCAA 


ACAGGCAATC 


1680 


CTGAACTGAA 


ACGTTATTTA 


TCTGACTCAA 


ATCCAGCTAG 


AGCAAATGAA 


GTGACCAAAA 


1740 


AATTGTACCA 


AATGAAAAAA 


ATAGATTTAA 


ACCAGTTAAA 


AGAAGTGTAT 


GACAAATATA 


1800 


ATCATTAGAA 


AAACTGAGGT 


TGTCTGATCC 


ACTACAcTTC 


GTTACGTTGG 


TATCAATTCC 


1860 


AATACATGGA 


ACATACTAGA 


GTTGAAGGAC 


TTGTCTTTTA 


AGGATCAAGA 


CCTTCAACTC 


1920 


TTAAAAAAGG 


CTCGATTCTT 


GCGAGTTTCA 


AAGAAGCCAC 


TTCTATTCTG 


TTGTTTATCT 


1980 


TAAGGGTTAG 


• GACTCTGCCG 


ATTTTTTTAT 


AAGATTATGC 


TATAATTAAG 


AGTAGCAATT 


2040 


ATTGAAGGAA 


AAATGAAAAG 


TCCAAAATAG 


GAAGTssTAA 


, AATGATTGAG 


TTAATTGTGT 


2100 


TGTTAACAGT 


TTTTACTTAT 


GGAACAAACT 


TTGTTTTATA 


, TCTTATTTTG 


; AAAAATAAAA 


2160 
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AGTCCATGCC AATTGTCGAA AAAGCTTCTA TTGTATTAGG AATTAATATG TCTGTGTTAT 2220 

TAT T AG AT GG TATTTTTGCT TTCGITGGTA. AATTCAT.CAT GTTATCAGAA TGGGAATTAT 2280 

TAGCGCCTAT TTTTGAAATA GTGTAAAAAT CATCATAAAG TCCTATATCC AAAAAGAATG 234 0 

AAGTTTGTTA GGATAAGAAA GATGAAAATT TAGAAATAAA AGAGGTGGGA TTGCATGGCT 2400 

AAAATCATGA TTGTCGAAGA TGAAACAACC ATTCGAGAAT TAATTAGCGA GGAATTACAA 24 60 

AAATGGCAAT TTGAAACGAT TGGAACAACC GATTTTAATG ATGTATTGGA TGATTTnCAA 2520 

GAGGAGAATC CTCAATTAGT CTTGATGGAT ATTAATTTAC CTGTTTACGA TGGGTACTAT . 2580 

nGGTGTCAGA AAAT 2594 
(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 



TGTGCTAAGC 


GTTTnTTAAC 


TTTTTTTAnG 


TAGTTTGGCA 


GGTTCATTAG 


GAGnTGGCGC 


60 


TPTTTTATAT 

1 V i i X J. 4i J. -li A 


AGTGTCATAA 


TAGGTGTTnT 


AGGATTTGCG 


GACTATCAAA 


AGATGTTTTA 


120 


TCGTGTCGAA 


CCTATTTATC 


CAGACGACTT 


AAAAATGATC 


ACAGAAGTTA 


GTTTATTAAA 


'180 


AGAAATGGTT 


GGTTTATGGC 


CGTTTATTTT 


TGTGGTAGCG 


TTAGGTTGTG 


TAGCACTTTT 


240 


TTTCTTAGGA 


AAAGCCTTTT 


ATAAAAGTTT 


CTTTTTATCG 


AAGAAAAAAC 


AAACGATTCG 


300 


TGTGTTAAGT 


CTTGTTCTTT 


CGATTGGTCT 


ATTTTCATAT 


ATTGGTGATT 


TTAACAATCC 


360 


GCAAAATTTG 


TTACGTAAAG 


CCTATAATAA 


AACCGCGTTA 


TGGATTCCTT 


ACAGCCAAAA 


420 


AATGAATTAC 


TACAATACAG 


GATTyATTGG 


TGGTTTyCTA 


TATAATTTAA 


AAGTAGAGGC 


480 


TATGGACCAA 


CCAAAGGGCT 


ATTCAAAAGA 


AGCTATTTCA 


G AAAT TACT C 


AGAAGTATCA 


540 


AAAGTTGGCC 


GAGGCGAAAA 


CACTGCCGAG 


CAATGAGCAA 


CCAAATATTG 


TGTTTGTCAT 


600 


GAGTGAAAGT 


TTTTCAAATC 


CAGATCATCT 


GAAAGGCTTG 


TCCATTATTG 


GTAATCCTTT 


660 


AGCTGATTAT 


CAGCAAATCG 


CTGATAAAAC 


TTACAGTGGC 


CAGATGCTTT 


CACAAAACTA 


720 


TGGGGGCGGA 


ACTGCGAATA 


TTGAATTTGA 


AGCCTTAACA 


AGTTTTcGAT 


GGAATTAATG 


780 


AATGCTCAAA 


TGACTACACC 


TTATACGATG ATGATTCCTA AGATGACgGA ATTGCCTTCC 


840 


ATTGTTTCAT 


TACTTAAGGA 


ACAAAAGTAC 


CAAACAACGG 


CTATTCATCC 


GTACAATACA 


900 


TCTATGTATA 


AACGAAAAGA 


TGTGTATAAA 


ATTTTAGGTT 


TTGATCAATT 


TATCAATGAA 


960 


AAAACAATGA 


ACTATACGGA 


TAAACTAGAA 


AATAATCCTT 


ACATTTCAGA 


TCAAGCAGCG 


1020 


TATCAGGAAG 


TGACGACACT 


GCTTAAAGAA 


AAAGAGCAAC 


CGCAATTCAT 


TCATTTAGTT 


1080 


ACTATGCAAA 


CACATATGCC 


CTATGAAAAC 


AAATACACAA 


CGTCCAATTA 


TTTTTCTCAA 


1140 


GGAAACGGCA 


ATCAAGTCGC 


GCTCAATAAT 


TACTTGCAAG 


ATATTGCCTA 


CAGTAGCGAA 


1200 
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yTTTGcaAAA 


ATTTCCTACA 


AGAAATCGAA 


AAAATGCCCA 


AACGCACATT 


AGTCGTTTTT 


1260 


TGGGGAGATC 


ACTTACCAAG 


TATTTATGAT 


GAAAACATTC 


AAGCGTTGAA 


TGAAGGGGCA 


1320 


GCCCTTCACC 


AGACAGAnTT 


TCTAATGTAT 


GATACGGCGA 


ACCAGTTAAC 


AGrAAAAAAT 


1380 


CAGCATCAAG 


CAATCATTAG 


TCCATTTTAT 


TTnGCTCCCT 


CACTTTTCCA 


mCAAAGTGGC 


1440 


TTACCGCAAT 


CGGGATTTTA 


CGCTATGCTA 


AATGAGGTAC 


AAGAACAATT 


GCCTGCTTTT 


1500 


GAAAAGGGGA 


ATT AC TAT CT 


CGGTGGGGAA 


TGGAAAAAAA 


CAGTAGAAAT 


GAATAAAAAG 


1560 


CAAGAACAAC 


TATATGAGGA 


ATACCGACTG 


ATTCAATATG 


ACATTGTTTC 


AGGGAAGCAA 


1620 


TATAGTTTAG 


AAAATCAATT 


TTTTAGCTAA 


AAAGrGCCTG 


GGACaAAAAT 


CACTTTGGAT 


1680 


TTTTGTCCCA 


GGCTCAAAAC 


CTGATAAACG 


GCGGGAACAG 


AACCAACTCC 


TTCGGAAATA 


1740 


AGCCGAAATT 


CTCCAAAAAT 


TAAAGAGCAA 


TTTTCGGAAA 


TTTCTTCTTA 


TTTCTCGGAG 


1800 


CTAAACGGTT 


CTGTCCCsAC 


CTCTTGAGTT 


TTATTTTAAA 


GTTTTACGAT 


AAACAATGAC 


1860 


ATAATAGCTG 


AAAGCTAACA 


GACAAGCAAA 


ACTTAAAAAG 


TAAAGCGGCA 


AGACCCAAAT 


1920 


GGTACCATTG 


TGAAAGTAAC 


TAATCAAATA 


GGTAAAGAAA 


CTAACAAACA 


AGTAATAGAC 


1980 


AAAACTAAAA 


ATCCCACTGG 


CAGTACCAAT 


TACAGATTCA 


TAGCCTTTTA 


AGGCCATACT 


2040 


TAAGGCATTG 


GGCAAAGTAA 


TATTAATCCC 


TAAGAAAAAG 


AGAAAAATAA 


ACAAAATAAG 


2100 


ACCAATGATT 


GTTTCAAATT 


GTGTCATTAT 


TAATAAACAG 


CAAGCCGCAA 


AGATACTTAG 


2160 


CAGCAAACCA 


GCAAAGGCAA 


TTTTTTCCGC 


TGACCAATAT 


CCTAGCAACC 


AATTGATAAG 


2220 


AACCGCAGCA 


AGAATACTAG 


ACACAGCTAA 


AACTAAACCA 


AGAAAGCCAT 


ATTGGACACT 


2280 


GCTAAAGCCA 


AAGTGATTGA 


TGAAAATAAA 


CGGAGCTTCT 


GCATAGTAGC 


TAAACAATAA 


2340 


ACCATTAATG 


CCCCCAATCA 


ATAAGCCGTA 


GCACCAAACT 


TTCGAGTCGA 


AGGAAAGGCG 


2400 


TTTCGCCACG 


GACCAAAAAG 


GCGCTTTCGT 


TAAGGTATTT 


CCTAAGTGAA 


AGGTTTCTGG 


2460 


TAAACGATAA 


AAACTATAGA 


AAAGTAATAG 


AACAGCCATG 


ACAATTAAGG 


TGCTAAAGAC 


2520 


ATGCTGATAG 


CCAAAATAGG 


TTTGAACAAG 


TCCGCCGACT 


AAGGGTCCGA 


TGGCAGGAGA 


2580 


TAGCGATAGG 


GCAGCGCTTG 


TTTGAGCAAA 


AATTTTCCCT 


CTTTGAAGAT 


CGTTAAAGCT 


2640 


TTCCCGCATC 


ATCGATTGGG 


TAATGACAGA 


ACCAACGCTT 


GCACCAAATG 


CTTGAATTAA 


2700 


ACGGAAGAAT 


AACAGCCAAG 


GAAATGTCGG 


ACTCAAAAAA 


AGGCCAATAT 


TTCCAATTAA 


2760 


ATAGCATAAA 


ATTCCTAATA 


ACATAGCTGG 


GCGACGTCCA 


ATCGTATCGG 


ATAGAATTCC 


2820 






CAAAGGCAAC 


AAAATAAATA 


CTCATGGTTA 


ATTGTGCCAC 


2880 


GCTATCGTGA 


ATGTTAAAAG 


CAGAACTAAG 


CGCTGGCAAA 


ATGGGTGTGA 


AAATCGACTC 


2940 


GCTGATTTGT 


GGAAATCCCA 


CAAGCAAAAT 


AATAATTAGT 


AATGATGGGA 


CGGTGTTTCG 


3000 


GGTGTTTTTC 


ATGATTTTTC 


TCCTTAAAAG 


TTTAATCTAA 


TCAAAAACAC 


CCATTGTCTT 


3060 


TTACAACCGC 


CAAATTTAAA 


ATAAGTAAAA 


, TAATGTCACC 


AACTTTCCTT 


GAAATTTTTT 


3120 


ATTATACAAA 


GTCTAAATGA 


AGCAGCGCTT 


CTTCTTGACT 


AAATTCATAA 


. GTGTAGCGAC 


3180 
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CGGATAAACC GGCAAGCTCA CCTGATGCTT GAAATAGTTT ACCTGGGCAG TCTAAAAGAC 
CCTGTGTAAa CGTCCCTTTT TCGCAAGCAG TAAAGGTGCC TTTTTTTCCT AAATAAGTTC 
CTTCGAAATG GAGGAAACCG ACCACTTGAG CCACAGCTTC AAGCCCATCT GCCATTGAAG 
AGCCTTGATA AGTCATTAAG TATTCCACAG AAGcTGTCCC AGATAAGGCG CCTGACAATT 
GATAAGTAGC GCTCGCTTGA TTTACTGGGA AATTTTCATT AAGATTGTCA ATCGGTTGTT 
CTTGCCAACT AG AT AC AG T A AATGATGCTT GCTCCATTGC TGTCCCTTCT TTCTTATTAA 
TAAGAATAGG GTAGCGGGAA GC.GCACTATT TGTCAACAAT CCACCACTTT AACTCTTCAT 
TTTTTGCACA AAAGAATGTA ACGCTAAGCG GTTGATGTGA TAAGCAGTTC CTTGTTTTTG 
AATAAAGCCG CGCTCTCGAA ATTGTTTGAA GGTATAAATA AGATGGCGAT AACTAACCCC 
TAAGTATTCG GCAATCTCAG TATGTTTTTC TTTATAAACA TCTTCGTGAT TAGCGGCAAC 
CAATAGCAAT TCCGCTAGCC GATAACTTAG TTCATAGGTC TGATTCGTTG TAAAGTGGTC 
CACCCGTTGA AGCATTTTTT TCCCTAGATA ATGGCTGAGT TGCTGCATAA AAAGCACCTT 
GTTTTTTAAA ACAGTTTCTG CGTAAGCTAA AGGTACAGCC AAACAACAGG TTTCACCTAA 
GGCGATAACG TCCTTGGTTA TCGTTTCCGC TTGAACTAAT GTTAAATCGC CAATTAAATC 
ACCTGCCGTT AAAAACTGAA CAATGGATTG GCGGCCATTG CGTTCTGTTT TTAAAATTTT 
CGCTTTTCCT TTGATTAAAT AAAAAAGATA ATTTAATGGC TCTGCTTGGT GATGAATGAA 
TTCTTGATTT TCAAAATCGA TAATAAAACT GTGTTTTAAA TCAGTTTCAT TAAAAAAAGG 

CGTATTTTTA GCTAATTCTT GTTTCGGTGA TCGT 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
TAAGGTTGCA AAGCCAATTT TTTTATTCTT ATCCGTAAAA TATTTTTTGC 
CACTGAATGA ACTTTGTCTT CAAGGGTTCG TAAAGATAAG CTAATCTTTT 
ATCAACATCA ATAACTTGAA CGGTTACTTT TTGTCCAACG GTTAAGAGTC 
TTTGGTAAAG CCAGCTTGTA CTTCCGAAAC ATGAATCAGT CCTTGTGTTT 
AGAAACAAAT GCGCCATATG GTTGAAGTCC GGTAATTGTG CCAGAAAGTA 
TTTGTACTTC ATCAAATTTC CTCTTTTCAT ACGATAAATT TCTTTTTCCG 
TTTAAAATAA TAGCATTTTT TCAACGTTTG TGCATTATTT CTTCCCTAGT 
ATTGTTTCAT CTTTTAAGCA AAATCTACTA TACTATAGGA AGTAAGACTT 
AAAAGGAGTT ATCTTATGTT TAATTTTAAT GAAACCATTG AGCGTCGCCA 



GTCGATGGTT 
GAGAATATTC 
CAT G GAT ATT 
CTTCGTCTAA 
CCATACCAAT 
TTTCTACTCA 
TGTTAAAACT 
TATATAGAAG 
CACGAATTGT 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900. 

3960 

4020 

4080 

4140 

4200 

4234- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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GTTAAATGGG 


ACACCGTTGA 


AGCGAGTTAC 


CACGAAAAAG 


ATTTACTACC 


TTTATGGGTT 


600 


GCTGATATGG 


ATTTCAAAGT 


ACATCAGCCT 


ATTCTTGATG 


CCTTGTCACA 


AGTTATTGAA 


660 


CAGGGCATTC 


T AGG AT AT GC 


AGTTGCACCT 


AATGAACTGT 


ATCAAGCAAT 


TCAAGATTGG 


720 


CAACGGCAAC 


ACCATCAATT 


GATTGTTGAA 


AAAGAAGAAA 


TTTTGTTTAA 


TAGTGGCGTG 


780 


GTACCCAGTT 


TGGCCACAAC 


CGTTCAAGCG 


TATACAGCAC 


CTGCTGATTC 


TGTCATGATT 


840 


TGTGATCCAG 


TCTACCCCCC 


TTTTGCAGAT 


GTGGTAAAAC 


AAAATGAGCG 


TAGACTCGTG 


900 


CGCCATTCTT 


TACTAGAAGT 


TAACGGACAT 


TATGAAGTTG 


ATTTGGTTAA 


AATGGAACAA 


960 


CAGATAATCG 


AAGAAAAGGT 


GAAACTGTTA 


CTTTTTTGTA 


ATCCCCATAA 


TCCTGGTGGA 


1020 


CGTGTTTGGA 


CAAAAGAAGA 


ATTGCTGGCA 


ATTGGCCGTT 


TATGCCAAAA 


ACACCAAGTA 


1080 


ACTGTGGTTA 


GCGATGAAAT 


TCATCAGGAT 


TTGATTTTCA 


AACCGCATAC 


CTTCACTTCC 


1140 


TTCACTGTCG 


CTGATGATGC 


ATTTAAGGAA 


TTCACTGTTA 


CCTTAACCGC 


AGCAACGAAA 


1200 


ACCTTTAATT 


TAGCGGGGAT 


TAAAAATTCA 


ATGCTGTTCA 


TTCCTAATGA 


AAAACTACGG 


1260 


CAATCTTTCG 


TATCTTTACA 


AGACnAAAAT 


CATCAAGGCG 


GCATCaATAC 


TTTTGGCTAC 


1320 


GTTGGAACGG 


CAGCTGCCTA 


TCAAACAGGG 


GAGGAATGGC 


TAACGGCATT 


ACTTGATTAT 


1380 


TTAAAAGAAA 


ATATTGATTT 


TGCCCTTTCT 


TTTTTCCGTG 


AAGAGCTGCC 


TAGCGTTCGT 


1440 


GTAATGGAGC 


CTGAAGGGAC 


CTATCTTTTA 


TGGTTAGACT 


TTAGTTCTTA 


TTCACTAACG 


1500 


GATCGAGAAC 


TTCGCGATAC 


CTTAATTCAT 


AAGGGGAAAG 


TGGTTCTAAA 


TCCAGGAATT 


1560 


AGTTTTGGTC 


CCCAAGGTTC 


TCAACACATG 


CGTTTAAACT 


TAGCTTGTTC 


AAAAGAAACT 


1620 


CTCGAAGAAG 


GCCTTTTACG 


TATCAAAAAA 


GCCTTTAATT 


AAAAACAGCT 


GTCCTAGTTT 


1680 


TTAGGACAGC 


TGTTTTTAAT 


TCTTTCTTTT 


GCGTTTAGCT 


AAATAAACTC 


TGACGAACCA 


1740 


GATACCTGAA 


AATAGAaGTA 


AAATTTCTCC 


AATTATGATT 


GTATAAAATC 


CTGGTTCATG 


1800 


AACGATTCCT 


TGCTGATCTA 


CATACGTAAA 


AAACGTTCCT 


ATAAATGGGA 


GTATTACTGC 


1860 


AAACAAAGCC 


AAGGCCATTG 


GATATCTGTA 


TTTTTCTAAC 


AAATTTGCTT 


CCTCTCCTTC 


1920 


TATTCTTTAA 


CTCAGTATCC 


TATAAAGACT 


TAATTAAAAG 


AAGAGATTTG 


GATGCTTTTA 


1980 


AAAAAAAGTT 


AGTCGTTTTA 


AGAAAwAAAT 


AAAGAGTACC 


AACGAATTTG 


TCCCTCGTTG 


2040 


GTACTCTCTT 


TTTCAATTGG 


TTAAACTTAT 


TCACCAATGA 


TTTCAACTGT 


GTCCATCACG 


2100 


ACGTCAAATG 


TTGGACGGTC 


TTGGGCATCC 


CGTTGGACTC 


CACCAATTTC 


ATCCACTACA 


2160 


TCCAtACCaT 


CAACAACATG 


ACCAAAAACA 


GTATGACGGA 


AGTCTAACCA 


AGGTGTTCCG 


2220 


CCTTGtTTGT 


AAGCTTCAAT 


AATTTCTTCT 


GGAAAACCAG 


CATCTTCTAA 


CTGACTCATC 


2280 


ATTTGAGCTG 


GAACATTTTG 


ATTAGTAACA 


ACAAAGAATT 


GGCTACCATT 


TGTGTTAGGA 


2340 


CCAGCATTTG 


■ CCATTGACAA 


TGCGCCACGC 


AAATTGAAAA 


CATCCCGAGA 


AAATTCATCC 


2400 


TCAAATGCTT 


CTCCATAAAT 


ACTTTCGCCG 


CCCATTCCTG 


TTCCGGTTGG 


GTCACCGCCT 


2460 


TGAATCATAA 


AATCAGGAAT 


CACACGATGG 


AAAATAACGC 


CATCATAGTA 


> GCCCTTTTTC 


2520 
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GCTAATTCAA CAAAATTTTT AACCGTTTTA GGCGCTTGTT CTGGGAATAA TTGAACAGTG 2580 

AIGTCGCCTc .GG.TJTGTTTT AATGACCGCT TTTGGGCCTT TAGCATTTTC TAAATCTAAT 2640 
TG 2642 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



v^MVjL> 1 1 rtrt I J. 


TTCCTTCATT 


TACATAAAAT 


TGAAACATAG 


TGTGACAAGG 


CGGCAACGCC 


60 


A T T P. A a C & C A 


CATCTTCTGG 


ATTCCAAGCG 


GAAACAATTA 


AACGGCGAGA 


ATCAGGATTG 


120 


^TTTTAJVTCA 


TTTCAATCAC 


ATTAGCTAAT 


TGATCGATAA 


AGC TAG CATC 


TTTGGTTTCC 


180 


pA^TPApnpp 


ATTGTGCACC 


ATAAATATTC 


CCTAATTCAC 


CATATTTTTC 


TGCAAACTCT 


240 


P.P ATP A TTT A 


AAATGGCATC 


ACAGAATTTT 


TGATGTTCTT 


CTTTGTATTG 


TTCTGCAAAC 


300 


P,PTP,P,ATPTT 


GTAAGACACG 


ATGCCCAAAA 


TCCGTCATGT 


CTGGTCCTTG 


ATAATCTGCG 


360 


PTTTTT APPiT 


AACGTTCGAA 


CGCCCATTCA 


TCCCAAATAT 


GATTATTTCG 


CTCTAACAAA 


420 


T AGPG AAT AT 


TCGTGTCGCC 


TTTTAAAAAc 


CACAATAACT 


CACTTTTAAT 


CaAGcCAAAC 


480 


Pt^TAPTCGTT 


TGGTTGtTAG 


TAAGGGAAAT 


CCTTTGGCCA 


AATCAAAACG 


CATTTGATAG 


540 


CCAAATAAAC 


TATACGTCCC 


aGTTCCTGTA 


CGGTCTTCTT 


TAAAATGACC 


TTCTTCTAAA 


600 


ATCTTTTTTC 


CTAGCGCTAA 


GTATGCTTCT 


TCCATCTTAT 


TCCCTCCGTC 


TCTTAATCCG 


660 


CGTATTCACT 


CAAb J. Al 1 Ol- 


P A APf^ATTPA 


1 i 1 1 i 1^1 iv> 


TAAPAATTGT 


TCCGTTGCTG 


720 


TTACTGCTTC 


TTGCAATTCT 


TGTAAACGAA 


CAAAATCATC 


ACCCTGatGA 


TTCATTTCTT 


780 


CAGTTAATGT 


TTCTAATTTC 


GTTtCTAATT 


GTGTGATGTC 


TTCTTCAATT 


GTtGCCCACT 


840 


CTTTTTGTTC 


CATATAAGTC 


AGCTTTGTTT 


TttCTTtCTT 


TtCTGTCGAt 


GCTCTTTTAC 


900 


tGGTTtAtGA 


gsGGTTyTTc 


TTTGACtTTT 


CTTGTTCTTT 


TtGGTTCGCC 


AAATATTCAC 


960 


TCATCGTGCC 


ATAAAATGAT 


AGGAkCTGAC 


CATTTCCTTG 


AAAAACTAAC 


AATTTAkCCA 


1020 


TTGTTTTATC 


CAAGAAATAA 


CGGTCATGAG 


AAACCGCAAT 


TACCGCTCCT 


TTAAACGTTT 


1080 


GGATATAATC 


TTCTAAAATG 


GTCAACGTAT 


CAATGTCCAA 


ATCATTGGTT 


GGCTCATCCA 


1140 


ATAATAACAC 


GTTAGGCTGA 


CTAATTAATA 


ATTTTAAGAG 


ATATAACCGG 


CGTTTTTCTC 


1200 


CACCAGACAG 


TTTGCCAATC 


AGTGTGCCAT 


GCATAAAACG 


AGGAAATAAA 


AAGCGCTCTA 


1260 


ACATTTCAGC 


GACCCCAACG 


CTGCTTCCAT 


CACTGCGTTT 


CACTTCTTCT 


GCTGCTTCTT 


1320 


GCAAATAAGC 


AATCATCCGT 


TGATTAGGAT 


CCATTTCTTC 


GTTTTGCTGG 


GTATAATAGC 


1380 


CAATTCGGAC 


GGtTTCACCG 


ATACTATATA 


GACCACTCTC 


TAAAGGAATG 


CGTCCAGCTA 


1440 


AAATATTTAA 


TAATGTGGAT 


TTGCCTGCAC 


CGTTTTTACC 


AGTAATGCCA 


ATTCGATCGT 


1500 
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TTGCTTGAAT 


GAGTAAATTA AAGTCTTTCA 


AAAGCTGTTT 


ATGATCAATG 


GCATAATTTC 


1560 


CTTCCtTGAT 


TTCCAGAACT 


TTTTTCCCTA 


AACGCTGCGT 


GGCAAAGTCT 


GCTTCTAAGG 


1620 


TGCCATTGGT 


TTGTACCTGA 


TGCAAGTtTT 


CTTTTAAATC 


CTGAAACCGA 


TCAATTCTTG 


1680 


CTTGTTGTTT 


CGTtCCTCGT 


GCTTGAACGC 


cTGCTCGCAT 


CCaCGCAAGT 


tCTTGTTTGA 


1740 


AGAGGCGTTT 


TCTTTTTTCT 


TCTTGTTCTA 


CAGCGACACG 


TTCACGTTCG 


GCTTTTTCCA 


1800 


TCACATACGT 


TTCATAGTTT 


CCTTTGTATT 


CATACAGTTT 


TCCAAAAGAA 


AGTTCGAAAA 


1860 


TCCGATTGGT 


TACTCGATCT 


AAAAAGTAGC 


GGTCATGGGT 


CACCATTAAA 


ACCGCGCCAC 


1920 


GATAGTTATT 


TAAAAAGTTT 


TCCAGCCATT 


CAATCGTTTC 


ATAATCCAGA 


TGGkTCGTTG 


1980 


GTTCATCCAA 


TAACAGTAAA 


TCAGGCGCTT 


CAATTAAGAC 


TTGCGCCAAG 


CTTACTCGTT 


2040 


TTAACTGCCC 


ACCAGAAAGC 


TCACCTATTT 


TCTTTTCCAA 


TGTTGGAATG 


CCTAATTTTT 


2100 


GTAAAATAAT 


TTTTGCATTG 


GTATCTGCTG 


TCCAAGCATC 


TTGTTTATTC 


ATGGCTTCTT 


. 2160 


CTGCTTGCGT 


GTAACGTCTT 


TGAACCCCTT 


CGTCTAAACC 


ATTCTCACTT 


AAAGCAGTAA 


2220 


CGCTAATTCA 


TAATTTTTGA 


CCGCTTTTAT 


GATCGGGCTG 


TCTCCCTGAA 


ACACGGCTTC 


2280 


TGCAATCGTA 


ATATTCTCGT 


CAAATGTTTG 


ATTCTGGGAA 


AGGTACCCAA 


TTTGATAATC 


2340 


ACTTGGCTGC 


TGAATCGCAG 


CAAGATCACC 


ATCACCGCTA 


TCTACGCCAG 


CAATAATATT 


2400 


TAATAAACTG 


GTTTTCCCTG 


TGCCGTTAAT 


CCCAATTAAA 


CCAATGCGAT 


CTTTTTCATG 


2'4 60 


AATCAAAAAA 


GAAAGATGAT 


CAAATAGTGT 


TTTTTCCCCA 


TATGTTTTGG 


TTAAATCTGT 


2520 


TACTTTTAAT 


TCTTTCACCG 


ACACTACCAA 


CTCCTTCTAT 


TTTGGTCTTG 


CTTATTGTAT 


2580 


CAAAAATTGG 


ATTATAAAAA 


AAGTGCATGA 


GATAAAAATC 


GTCTTGGTTT 


CATCTCATAC 


2640 


ACTATTACTT 


ATTCTTTATT 


AAAATTTTTA 


ACAAATCGTT 


TCAGCGGAAC 


ATTATTCACT 


2700 


TCGCCGCTAA 


TCGAATAACT 


CTCTAATAAA 


TTGACATTTT 


TTCGTAACTC 


GTTCGCTTCG 


2760 


ATAATTGAAA 


TGCGTTCAGA 


TTGTTCGTAA 


TGATCAATCA 


TCCGTCGTTC 


TTCTTGGAAG 


2820 


CCTAATAATA 


GGTCTTTGAC 


ATAGTCTGTC 


TCCGTATGAA 


CAATATACGA 


ATCAATAAAT 


2880 


TCGCCTTTTT 


CAAACTTATG 


GGCTAAATGC 


AAGCGTCCGT 


CAATGAAGAA 


ATCAATTAAA 


2940 


TCATCATCAT 


AGACACCGCG 


CAAATTATCT 


AAACTCTTCA 


GAATAACTTG 


AGAATTTCTT 


3000 


AGAAAAACTT 


GGCGAATATT 


TTCTAGTTCT 


TCATTTCGTA 


ATTCTTTTTC 


TTCTTCAGAA 


3060 


GCTTCACCAC 


GATTGAAAAA 


TAGCTTAGGA 


TGTAAAAGAA 


TCGAAATGAT 


TTGTCGAGCA 


3120 


AA 1 AA 1 AALC 


AAAAGCCTAC 


AATCGATAAT 


Zi "h. T T (2 T T"T*f^ (Z 
t\r\ 1 lul i loo 


CAATCGACCG 


TTGCATCCGT 


3180 


GTAATCAAGC 


GTTGATACAT 


ACGATAACTT 


CTGACACTAA 


TTTCTCCTTG 


TTTCAGACTT 


3240 


TCTTCTAAGC 


CGTCTCGTTC 


AATGGACACA 


ATTAACGTCC 


GCAACTCTTG 


CACTTCTTGT 


3300 


TTTCGTTCTG 


CaCTTAGCTG 


cTCGGTATAC 


AATTCTTTTA 


AACGATCTTG 


ATAATTTTCA 


3360 


ATGACCGCAT 


TCATTTCACC 


ACGAGGATTC 


TCTTCATTTT 


GCTGTTyTAA GACATCAATG 


3420 


ACATCACGTA 


ATAAATCAAT 


TTCGACTGAC 


TCTTCATTTT 


CCACTTCCGT 


CTCTGAATCA 


3480 
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G77AAA.A73 g7aaaa::ag aatcccgcca a7cagag7:a c:a*aa7aac acaagccgta 

A7AAA7AG7A AtAAZG-CG 77C777AA^ TGCTCGCC3T 7AATCGC7GT TGGCAAAATA " 
A^GA7GG7G3 C7AAAC7:AC 7G773C77TA ACACCGC7AA AA7-TTAAAA7 CaGCATTTCA 

-caTT:-:c-r 7777Aa::77 c77AAG7:ca tg7Tgaa:-:g tstaaacac? gaaaatggct 
lic-aj^zg--^ t:aaaaa:-ac aa:gscg:ta ataattag.-a caaccag7A.g taggaaccag 

7773AA7A77 G77S377AA7 C7AAAT7GGC GAAAACAC7T GAGrAAG77C AATTCCTAAG 

AA7AAGAAA3 C7AA.3GCA77 TAAAGTGAaC TCAATCAAaC TCCAAATACT TTCGCCAACA 

777GACA3C? CAG~7 3AAA AA3CGAAACT TT7TTCAAAC 7AGACGC77G CATAATTCCA 

G3AATCA7A3 CAGCAA7AA7 77CAGAAACG COAT^ C7GAAA7TAA ATAAGCAATA 

- AA? 3G C- AG A G7AA773AA7 G.ACAAG7AG CC7GTAACAT CT7TTGCCAC TGCAGTTTCA 

AG7AAA73AA CAA7r7G:7G T7TTAA7AGA ACSAACAC3A TTCCGACTAC CGCTCCACCA 

ATGCTGGAAA 7AAC7AAGGT CCCTGCCGCT TCCACCGG2G AAAATGACCC CGTTAATAAA 

GCC3CTAA73 CAAA77G^AA GGCTGTAACG CKGA7GCAT CA7TAAT GAG TCCTTCACCA 

GA7 AAAA7 GT T7A77GAACG T7CAGGAATG TCAAT7CGAC C7GATAACGA ACCTACCGCC 

ACCGCA73C3 T7GGCCC7AA ASCCGCTCCA AAAGCGAA3C AAGCAGC7AA TGGTAAAGTA 

GGCAAAA7CA TA7C-7AC-3G7 CCAGCCGACT COACTAA3G 7 A--T T AACAC ACCAAAAAAG 

GCCAAGAAAA GAA73A7:~ GAAA7TT7TC ATCATTGC7T TCAGAT7GAT TAGCTCCCCT 

ICt3TA^A7A AAAG3GG7GC AA7AA7CATT AC7AAAAAZA C77CAGGC7C AAATTCAATT 

• TCC7GC:C7A 7777AG77aA ACCAA7TAGA ACGCCTAC.-A TAATT 

(2} INr^MATIC:; FC?. SSQ ID NO: 316: 

(i" 3EC~::CE 7HAJ^AC7£?.IS7IC3: 

(A 12NG7E: 2~2€ base pairs 
(B 7"/P£: r-uclsic acid 
(C STRA^EDNISS: dcuble 
(D 70PC1CG7: linear 

S£r-EHICE CESC?lP'7I0:i: 3EQ ID NO: * 316: 
7nTAC77CTA C7G7CACr.7G CGGTAnTAnT TTCCTTTG7G GA7TGTA7TT GTTTATGCAC 
C7A7T7GC7T 77CG77A7GG G3ACATGGAA AT7GGCTAGG GAAAATGGGA GTACTGGATT 
AT7CAG3CGG A77AG7A:-73 CA7ACTA3CG CTGGAATA3G AAGTCTTG7T TTAGCAATAA 
CA7CAC7AAT A3GTTTAAA3 AAT7CAT7GA, TATTTAAA7C ACAAGAAATG ATAGCTTTTG 
7AGGAA7GCT G777AT7ACC 77AG3A7GGT TCGGATT.AA TA7GGCACCT TCTGGAAAAA 
7CG3TGAASA AGC3AT77AA G777GGT7AA ATACACT7AT TTCAATT7TA GGCGGTAGCA 
7TAGC7GGAC 7777ACC7AA 7G3A7TT7AA TAAAAAAAGT A77TATC7AT TCTATAATGA 
ATGGAA7AA7 C3GCGGA77A G7AG3TAG7A CT7GCTCG7T 7GGATA7G7A wCTCCTGCAT 



3540 

3~600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4605 



60 
120 
180 
240 
300 
360 
420 

480 
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TTAGTTTATT 


AATTnCGGTA 


AtTGkATGtA 


CCCTTTGtCC 


AATTGTwATt 


CACATAATGC 


540 


ATCTAAGCAT 


AGCAAATTTT 


GATGACGCTG 


CAGATTCTTT 


TGGGATGAAT 


GCAGTGGGTG 


600 


GAATTGCAGG 


ATCGATTTTA 


ACTGGTGTCA 


TGGCAGAAAA 


AGGGGACTTT 


TTTCTTCAAT 


660 


TATTTGGGAC 


TTTTTTAATC 


AGTATTTGGA 


GCCTTAGTTT 


AAGTCTTTTG 


ATTCATTATT 


720 


TATTGAAAAA 


AATAGTAAAT 


GATTGTTATT 


CTCAATGTAT 


TAAATAAAAA 


AATTAAAATT 


780 


ATTPPTAGTG 


TGTATG^ATA 


TCAAAATAAT 


AGTCAATAAT 


AACACTCAGG 


AAGAAATTTG 


840 


AAf^APTTTAA 


ATTTTAGTGG 

i \ x x x x nw x \j \j 


AGGTAAATAA 


TGGAAAATAA 


AGAAGTTATA 


ATTATTGGAG 


900 


p n p n. pp,p r 


TGGPGTAGGT 


ATGGGGGTTG 

fx X wVJvVJW X X \J 


CTTTAAAAGA 


TTTTGGCATA 


AATAATTTTG 


960 


PTZiTTTT Af^ A 
Linl X 1 1 nun 


HAGAAAGCaG 


GTAGGTAATA 

x n \j vj x **** x * + 


GTTTTATTAA 


ATGGCCTGAG 


CAAACAAGAT 


1020 


TTATPAPTPP 


TTCTTTTAfA 


AGTAATGGAT 


TTGGGATGCC 


TGATTTAAAT 


GCAATAGCAA 


1080 


TTf^ATAPATP 


TPPTTCATAT 


ACTTTAGGTG 


AGGAACGTTT 


ATCTGGAAAA 


GATTATGCTA 


1140 


AATAPPTACA 


ATTAGTTTCA 

xi X X £*\J XXX x^/* 


GAAGAATATA 


AGTTACCTAT 


TAAGACAAAT 


TGTAAAGTTC 


1200 


AATCTATTAA 


AAAAGAAAAA 


ACTGGTTATT 


TGCTTGAAAC 


AACAAAAGGA 


TTTATTTATG 


1260 


CAGAATATGT 


AATTTTTGCG 


ATGGGAGAGT 


TCTCTTTTCC 


AAATAAATCT 


AGTATTAAAG 


1320 


GTGCTTATAA 


GAATAGTTTA 


CATTATGGCG 


AGATTAATTC 


TTGGATTGAA 


ATTAAAGGGG 


1380 


ATAAGCAGAC 


GATTATCGGA 


GGAAATGAGA 


GTGCGATTGA 


TGCAGCTATT 


GAATTAGCTA 


1440 


AATTAGGTAA 


ACGAGTTACA 


ATTTATACAG 


ATACTTTGGG 


GTTAAATATA 


AGAGATGCTG 


1500 


ACCCGAGCAA AAGgTTAGCA 


CCACGCACAA 


G AC AG AG ATT 


TTTTGATTTA 


CGTGTAAACC 


1560 


AAAAAAAATT 


AGATAGTATT 


AAAATATACA 


CAACTACCAA 


AGTTAAAAAA 


ATTGAAAAAA 


1620 


AGTCTACTAG 


TTATATATTG 


ATTACAGAAA 


ATGGGAAATC 


ATTACCTGTA 


GAAAATATAC 


1680 


CAATTCTTTG 


TACAGGTTTT 


AAAAATGGCA 


CAAAATCTAT 


TACTGCATCA 


TTGTTTAAGT 


1740 


ATAAGGAAAA 


TGGAGAGGTT 


CTTTTAAATG 


ATTTTGATGA 


ATCGACAATA 


GCTAAAAATA 


1800 


TTTTCTTAAC 


CGGTCCAAAT 


GTACGTAAAG 


GTAATACCAT 


TTTTTGTTAT 


ATATATAAAT 


1860 


TTCGCCAACG 


TTTTGCCGTA 


ATCGCTAATG 


AAATTGCCCG 


ACGAAAACAT 


ATTACTATTG 


1920 


ACGAAAAAAA 


ATTGTCATAC 


TATAAAAATC 


AATCTTTTTA 


TTTAGATGGT 


TGCTCTGGGT 


1980 


GTGAAGTAAG 


ATGTAGTTGT 


TGACAAATAA 


GTAATTGTTA 


CAACTTATTT 


TATAAATTAA 


2040 


AAGATTTTTA 


AATAATTTAA 


GTCAAGTCCA 


GACTCCTGTG 


TAAAATGCTA 


TACAATGTTT 


2100 


TTACCATTTC 


TACTTATCAA 


• AATTGATGTA 


TTTTCTTGAA 


GAATAAATCC 


ATTCATCATG 


91 fin 


TAGGTCCATA 


AGAACGGCTC 


CAATTAAGCG 


ATTGGCTGAT 


GTTTGATTGG 


GGAAGATGCG 


2220 


AATAATCTTT 


TCTCTTCTGC 


GTACTTCTTG 


ATTCAGTCGT 


TCAATTAGAT 


TGGTACTCTT 


2280 


TAGTCGATTG 


TGGGAATTTC 


CTTGTACGGT 


ATATTGAAAG 


"GCGT'CTTCGA" ATCCATCATC 


2340 


CAATGATGCG 


CAAGCTTTTG 


AATATTTTGG 


TTGATCGATA 


TAATCATGAA 


TCAATCGATT 


2400 


TTTAGCCTCA 


CGCGCTAAGT 


TAATATCTGT 


GAACTTAAAA 


ATTCCTTTAA 


. CAGCTTCTCT 


2460 
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GAAAGATTTT GAATTTTTTT TAGGAATGGT GGTAAAGATA TTTCTTAGGA AGTGAACTTG 2520 

GCATCyTTGc CAmCTTACGT TGGTGAAGGA TTTTCTAATG GCAGAGACTA ATCCTTTGT.G 2580 

CGCATCAGAA ATAACGAGTT CCGTAcCTTG TAAACCGCGT CCTTTAgGAw TCmAAAAATG 264 0 

TkGTCCAGGc TCTCGCTTCG cCACTTGAAT CAGaAGCgAT AATTCACGGC GCAnCTTGGT 2700 

ATCCAACGGC AAAGAAGCTT TGAGnG 2726 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATAAAAAACT ACAATTTTTT GGTACAAATA GTAATGTTTT TTTAAGTTTA TTCAATACTG 60 

TTGGGACAGT AATATTGATA GCTATAGGCA GCTTATTAGT ATTAAAGAAT AGTATAAATT 120 

TAGGAAATTT AACAGTATTA ATTTCATTAT CAGGAATAGT AGTATCATCC CTAAATCAAT 180 

TTGGGTCGTT TCAAGCGCAA TATGAATCGA TGCAGGTGGC TAGTCATAGG CTTGAATCTA 24 0 

TATTAATAAA TATGGAAAAT GAGAATGTAT GCGGGGAAAt AATTTTAGAC AAGAAAATAG 300 

AATCAATAAG ATGTAAACGT GTTTCCATAA AAAAAGGAGA TACTTTATTA CTCGACACTG 360 

TTAACTGTGA AATCTATAGA GGGAAAAATC TTTCAATACG AGGTGAAAAT GGTTCAGGTA 4 20 

AATCTACATT AATTAAGTCT TTAGTAAGAT TAGATGATGA CTATAGGGGA CAGATTCTAA 4 80 

TTAATAATAT TGACATAAAA AAAATCAACT TGGACTGTCT ACGATCAAAA TTAGTGTTTG 54 0 

TAGAGCCCAA TCCTAAATTT TTAGAAGGTA CAATTAGAGA TAATTTATTA TTGGGACATA 600 

AAGTTCCGAA TAGTATATTT AATAAGTTGA TAAGAGATTT TGAAATAAAC AAAATATTAG 660 

ACGATCTACC TTTAGGTATA AACTTtCCTG GAGAAgCTGC AATAAAATGC TTATCTTCTG 720 

GACAAAAACA AAAGCTAGCC T 741 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

AGTAAATAAA TGAATACATT GCAATTATTT AAGCAGGCCT TTTTTTCAGT TTCAGAATTA • • 60 

CGGCATGCGA AAAAAAT-GG C- -GTTTTGGAAA GTAATTCTCT ATGCTGTTTT TTTAAGTGCT 120 

ATTTTAGCAC TACCTATTAC CAAACAAGTC TTTTCTGTGC TACAAGAGGT GCAACAAGAT 180 

GGCCAAAAAA TTGCAGAAAA GTTGCCTGAC TTTTCTATTG AAAATGGCAC GTTGCAAACA 24 0 
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AAGGCCAAAG 


AATCTGGCTT 


TATTTATCAA 


ACCGACTCGA TTATTTTTAC 


GTTTGATCCA 


300 


GATGGAAAAC 


GTACAGCCGC 


TGATGTCCAA 


AAGGATTTAA 


TTGGTAATGC 


GTTTGGTTTG 


360 


GCTTTTTTAC 


AAGATGAATT 


TGTAGTCGCC 


TTACCAAATT 


CAGGCGCAAC 


TGAATCAATT 


420 


CTAGGCACCG 


ACCAATTTAT 


CTTCCCCTAT 


AGTAAAGGAA 


CCTTAGATGG 


CGTTAACGCT 


480 


CAGAGTATCA 


AGACTGCGCT 


AAGTGAGGCA 


GCGATCCCCT 


GGTGGACAAA 


ACTCATTGTC 


540 


TTTTTAGTGG 


CTATCTACCC 


TGTTCTAATC 


GGTTTAGTTT 


TAGACTTACT 


GATTGCCGCT 


600 


ATCGGTGCAT 


CTCTCTATTC 


AAAATTACGT 


TTTTATTCTT 


TACGTTTGAT 


TGATTGTTTA 


660 


AAAATTATTA 


CGTATTGTGC 


TACCGTGCCA 


GTTATCTTAA 


GTGCTATTTT 


AAACTTCGTA 


720 


AATCCGATGT 


TTAACGATGG 


CATCCTAGTT 


ATTTTACTAA 


GTCTGTTCTT 


TTTCTTTGTC 


.780 


ACAACCAAAA 


ACGAACCACG 


GAATATCCCT 


GGTTCTTAAA 


AAAAGGTGTT 


GTCTTAAAGA 


840 


CAACACCTTT 


TTACTATTCT 


TCAAATTCTT 


CTTCATCTGT 


TAATTCAGGT 


AAGTCAATCG 


900 


TGGTCACCGC 


AGATTGGCCA 


ATTTGCGTTA 


AATATGCAGC 


TAATTCATCC 


ACATTTTCCA 


960 


CAGGTGAGAC 


GGCTGCTTCA 


ATTGCCATTG 


CTAAATTAAG 


TCCAGCGACT 


ACTCGCAATT 


1020 


GCGGATAAGT 


GCCCATGGCC 


ATCATAGCCA 


CATTACAAGG 


TGTTCCTCCT 


TTTAAATCAG 


1080 


CTAATACAAG 


CGTCGGGACA 


TTGCCAGCTT 


CTTTTAAGAT 


TGCTGCCAGT 


TTTGCTTGCG 


1140 


TGCCGCTCAA 


GCCATCTTCT 


GCGGTCATTG 


ACACAATCGC 


AGCATCTGCT 


AATTCCCCGA 


'1200 


CAATCaTTTG 


aGTAGAAGCT 


AAGGTTtCTT 


CTGCCATCCG 


TCCATGGcTC 


ATTAAAATTA 


1260 


ATTTTGGTTT 


CATCATATCC 


TCCTAAAAAT 


ACTGCTGATA 


AATTTTTGTT 


CGTGCTTTCa 


1320 


GAAAGCCACG 


TTCTTCACGA 


GTGGGaTGAA 


TCCGATTCGT 


TAACAAAATT 


AGGGCTTCTT 


1380 


TGGTCTCTTT 


TTTCATACCA 


ATCGCCGTTC 


CAGTAAAGCC 


AGTATGATAC 


AAATAGTCTG 


1440 


CCCCTGCTTG 


CGTCTTTTTC 


ACTTCCCAAC 


CAAATGTACG 


TCCCCCTTGA 


TTCATAGACC 


1500 


AG AG AG C AG T 


GAACATTTTT 


TCAGAAAATA 


ATGGTTTGCC 


AGACGGGTAG 


CGATTGTTCA 


1560 


TAAAACACTG 


aACAAAAGTG 


AGTAAATCTG 


GTAAAGTTGC 


AAATAAACCT 


GCACTGCCAA 


1620 


TTGGCGCTTT 


TAGTTGATAG 


GCTTTCGAAT 


CGTGGACCAC 


TCCCTGAATA 


ACGCCGCGTT 


1680 


GTTTTGTAAT 


CTCTGTGGGC 


ACTGCCTGGG 


CAAAATCGCT 


GACCGTAAAA 


CTTGTATGCT 


1740 


GTAGGTTCAT 


TGGTTGAAAT 


ACGTATGTTT 


GAAAGCTCTC 


TTCTAATGAG 


CAGCGATCCA 


1800 


ATACTTGAAT 


AATTTCTCCC 


AATAAGTAAT 


ACCCTAAATC 


AGAATAAGTG 


GTTTTCCCTC 


1860 


GTTCAGATAA 


CGAATGCGTT 


. AAAATGATTT 


CTTGCAAGCT 


TTTTTTTGTG 


ACATTTTTCT 


1920 


TATCGACAAC 


ATCAGCTGGC 


AGTCCACTCT 


GATGCAGTAG 


AAGCTCCCCA 


ATCGAACAAG 


1980 


ATAGTCCTTG 


ATAATCAGGT 


AGGATTTCCC 


CCACAGTGGT 


TGAAAATGTA 


AGCTGATTTG 


2040 


TATCGATTAA 


TTGTAACATT 


CGATTTGTTG 


TCCCAATAAC 


TTTCGTTAAG 


GACGCTAAAT 


2100 


CATACAGGCT 


AGCTGATGAA 


ATCTTTTGTC 


TTGCAAATGG 


ACCAGTATTA 


CCTAAAAAGC 


2160 


CCGTATACTG 


TTCTGATAAG 


CTATCTTCTG 


TGATCGTCCC 


CCAAGCGACA 


CCCGGAACGA 


2220 
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CTTGCTGATT 


AATCAGTCTC 


TGAATTGTTT 


CTTGAATTGC 


GATTTGATTC 


ATTGTTCACC 


2280 


ACTTCTATGC 


TAAAACTTCT 


GAAATCCGTC 


CGTCATGTTC 


TGCTAATAGT 


TGTTTGGCTT 


2340 


TTTGAAAATC 


AACGTGGGCT 


TTCGCCATTA 


CAATGGCTGG 


TGCCACTTCT 


AGCTGTGCTG 


2400 


CCTCCAAGTA 


TTCTTTGGCT 


TGCGATTCCT 


CGATTTCTGC 


TGCTTCTTTA 


ATGATATTCG 


24 60 


TGGCACGTTG 


AATTAATTTT 


TCGTTCGTTG 


GTTGTACATT 


GACCATCAAA 


TTTTGATAGA 


2520 


TATTCCCTAC 


TTTAACCATA 


ATTCCTGTTG 


AAAACATATT 


TAACACCATT 


TTCTGTGCTG 


2580 


AACCCGCTTT 


CATTCTAGTT 


GACCCTGTAA 


TGACTTCCGG 


TCCTACAATT 


GGCGCAATCC 


2640 


CAATTTCCGC 


CAATTGATTC 


ATCGGACTTT 


GATTATTACA 


GGTAACTGAA 


ATGGTTAGAG 


2700 


CGCCCACTTT 


TTTTCCGTAT 


TCAATTGCGG 


AAACAGCATA 


TGGTGTCCGA 


CCACTAGCAG 


2760 


CAATCGCAAT 


TACGACATCT 


CGGGCAGTCA 


ATTGATGTTG 


CGTTAAATCT 


TCGATAGCTA 


2820 


ATTCTTTCGA 


GTCTTCAGCG 


CCTTCAATTG 


CTTGATACAT 


TGCTTTTTCA 


CCACCAGCTA 


2880 


AAATACCAAA 


TGCGCGTTCT 


GGCGACACAC 


TATATGTGGG 


TGTTAATTCA 


ATCGCATCCA 


2940 


AAGCACCTAA 


ACGTCCAGAC 


GTTCCTGCAC 


CACAATAGAT 


TAAACGGCCC 


CCTTTTTTAA 


3000 


ATCGTTCTGC 


CGCTGCATCA 


ATTGCTGCAG 


CAATCTGCGG 


AAGCACCTTT 


TCAATTGCTT 


3060 


GTGCTACCTT 


TTGGTCTTCT 


TGATTTATTA 


AAGTTACCAT 


TTCTAATGTT 


GACACTTGGT 


3120 


CAATCTTTTT 


AGTCGCTTCA 


TTTCTGGCTT 


CTGTCGTTAA 


TCCTTCTAAG 


TTCATTTGCT 


3180 


TGCACACCCC 


TACTTCTCTT 


TTAGACGGCT 


GACAAGTTGA 


CTCGTTTCAA 


TCATATTGCG 


3240 


TTCAATTTCT 


GTGTCTAAAT 


CATCTTGAAT 


TGAACCTAAA 


TACAGAACAT 


CGCCAATTGC 


3300 


CATTGACGAA 


GCAATCGATG 


AAATAGCCCC 


AACTCGAATG 


AGGTGCTCAT 


TATTGGGGAC 


3360 


TAAAAGAACT 


TCATCGCTCA 


ATGATCGCAC 


GTTTTCTCCA 


TCGTTACTGG 


TAATAAAAAT 


3420 


AACCGTTGCG 


CCTCTATTTT 


TGGCAATTTC 


ACAGGCGATT 


AAGGCTTCTT 


TTGTCAAACC 


3480 


ACTATAGGAG 


ACCGTGATCA 


AAATGTCTTG 


TKCCGTTGCA 


TAATTTAAAA 


ACTCCAACAT 


3540 


CATGTGACTA 


TCATAATTAA 


AAATTGCTTG 


GCGACCAGCG 


CGGTTAAACT 


TGTGATAAAG 


3600 


ATTATAAGTC 


GTTAAAGAAG 


AGGCACCAAT 


TCCAACCAAA 


TGAATTTTTT 


CCGCTTTTTT 


3660 


AACATGCGCG 


ATTGCTCGTT 


CCAAGGCTTG 


TTTATCCACT 


GTATAAAATA 


AATCCTCAAT 


3720 


CGTAGCATTT 


AATAAAGATT 


CCACTTTCAT 


ACAAATAGTA 


TCCACCGAAT 


CATCCACCGC 


3780. 


TACAATTGGA 


TCAATATCTT 


TTTTACCGTT 


ACTGGCAGCT 


TGTACCGCTG 


CAATGGATAG 


3840 


TTTTAACTCT 


TCCCAGCTGT 


CAAACCCTAA 


TTGTTTAGAA 


AAACGGGTAA 


CTGACGCCGG 


3900 


AGAAGTTTCG 


CTATTGCTAG 


CAATTTCGTT 


TGCTGTTAAT 


GAGAGAAGAC 


GGGTCGGATC 


3960 


CTTCAGTATA 


TAGTCTGCGA 


TTCTCTTATT 


TACTTTGGAA 


AAGTCCTTAT 


ATTTTTGTCT 


4020 


GATTGAACGG 


ATTGGCTCCA 


TCTTCTCATC 


TCCATTTCTA 


TTTTATGAAA 


GTTAAAATTG 


4080 


CTTCGTTTGG 


TTCAGAAGGG 


ATCATCTGTG 


CGGTAATAGT 


CACCCCATTA 


TCTGTTAAAC 


4140 


GACGAACGGT 


TTGAATATCT 


TCATCGGTTA 


ATGCAACAGA 


TTTTTTAATT 


GCTCGACTGC 


4200 
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CTTCTTTTTG 


AGAAATATTG 


CCGACATTGA 


AGCTCTGCAA 


CGGAACACCG 


CCATCAATTA 


4260 


AAGCCGCCAT 


ATCTGCAATG 


TCTTTCGTAA 


TTAAAAAGAC 


TTTTTCTTCA 


TCATAATTG.C 


4320 


CTGCTAAAAT 


TTTTTCAATC 


GCTTTACGTT 


TTGATAAAAT 


GGATAATTTA 


ACTCCTGCaG 


4380 


GCTTAGCCAT 


TTTTAAGGCA 


CCAATTTGTA 


ATTGATCTTT 


GACAGCGGCA 


TCATTTACTA 


4440 


CCATGATTCT 


TTGTGCGCCA 


ACCGTATTCG 


TCCAAACGGT 


TGCAACTTGT 


CCGTGAATTA 


4500 


AACGCTCATC 


TACTCTTGCA 


TGTACAATTG 


TCATAATAAA 


TCTCTCCTAT 


TATATACTAT 


4560 


TTTTTTAAAA 


TAAACCGCCA 


ACCCAAGCCA 


ACATTGGTTG 


TAAATTGCCT 


AAAACCATGC 


4620 


CAATtAAAAT 


TAAAACAAAG 


ATTAGCCGTG 


TTGAATTCAT 


TTTCTTTTTC 


CCTAGTAACC 


4680 


AATAGGATAA 


AAGAACAATC 


CCTAAAGGAA 


TTAATGCGGG 


TAAAATTTTA 


TCCAACATTT 


4740 


CTTGAACAGA 


TAATGTTACA 


TCTCCCATTT 


TATAAGTTAA 


ATCTAATTTG 


TAAGTAATTA 


4800 


CCGAAGGAAT 


CAAACCGCCG 


ACCACTGTTA 


ACCCTAAAAT 


GGCTGCrCCT 


TGCGTAAACA 


4860 


ATTTCATTTT 


ATCCGCAAAT 


TCTGTCGCAA 


TTTTTTTCCC 


TTGATGGTAA 


CCAATCCAAG 


4920 


TGAATTTATA 


GCGAACCCAT 


AGGACCACTG 


CTCCGGCAAT 


TAACGGGATC 


AGTAACCCAA 


4980 


CAACCTGGCC 


CCCTTGCGCC 


ATGTAAGCTG 


CAATCGAAAA 


AACAATCGsA 


CGrTAAATnG 


5040 


CGATAAAAAT 


GGTATCACCA 


ACACCAGCAA 


ACGGACCCAT 


TAGACCTGTT 


TTCAAACCAG 


5100 


TAATGGCCGT 


TTCATCTTCA 


ATGCCGACTT 


CTTCTTCCAA 


AGCCATATCA 


GCTCCGACAA 


5160 


TTAAGTGCGA 


CATGGCTGGC 


GTTGTATTCA 


TAAAGCCAAG 


ATGTGTCTTC 


AAGGCTTTCT 


5220 


TCATAGCTAC 


TGGATCTTCA 


CTGTATAGTC 


GCTTTAATGC 


AGGCATCATC 


ACCCAAGCGT 


5280 


AACCAAGTCC 


TTGCATTTTT 


TCATAGTTCC 


AGCCCCACTG 


ATAGCCAAAA 


AGATTACGGA 


5340 


CGTAAACTTT 


ATTAATATCC 


TTTTTTGTTA 


ATTTGTTAGT 


CATCAATCTC 


AACCTCCTCA 


5400 


TCATCTTCGT 


AAATAACCGT 


GCCACTTGTT 


GTAGTGCGCC 


GCCTTTGTTA 


TGATTCATCA 


5460 


CATAAATCGC 


TGCTAAAGCC 


AAGCCAACCA 


GTGCGACACC 


TAAAACTGAG 


AAGAATTGGG 


5520 


CCCCATAAGC 


AAGTAACACG 


AACCCAnTAA 


TAAAaTAAGG 


CCAGTATGTT 


TTAATTGGCA 


5580 


AGTAACGCAT 


TAAAATAGCA 


ATCCCCaTCG 


CTGGTAAAAT 


AGCGCCTGCT 


GCTTTTAACC 


5640 


CATTCATCAC 


CCAAATAGGA 


aTCCAATCAT 


TGATAGCGTT 


TACAACTTGT 


TCACCAAAAG 


5700 


CTAGGCCAAT 


AAAAACTGGG 


AT G AC AC GAG 


AAATCGTCCA 


AGGAAAAATT 


CCTAAAACAT 


5760 


TGCAGCGCTC 


GACCCCTTTA 


TAATTTCCTT 


CTTCCGCATA 


ACGATCTGCT 


TTGTGTTGAA 


5820 


AAAACGTATT 


AGCCATTCTG 


CCTAAAATAT 


CTAGTTGTGT 


TAATAATAAC 


CCAATTGGAA 


5880 


CCGCAACTCC 


GATACCATAC 


TCTGCCCCTT 


TCCCTGAAAT 


GATTGCATAA 


GCCGTTCCCA 


5940 


TAATGGCCCC 


TGAAAGAAAG 


TCTGGGACTG 


TTGCTCCTCC 


ATAAGTCGCT 


ACACCTAATG 


6000 


TCATCAATTG 


AAGCGTTGCC 


CCAATCATCA 


AACCTGTTTG 


AACATnAaCC 


TAAGACTAGG 


6060" 


CCAGTAAACG 


TTGCAGCAAA 


CAATGGTGTA 


TATGGACCAA 


TCTGAATAGA 


AATTTGGTCA 


6120 


AGTACACCAA 


CAACTGTGTA 


AAGACATAAA 


ATCAAAA 






6157 
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(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2378 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



CAATCTCAAA 


TAGCAAtCAG 


GCTTCAGTGG 


ATATTTTAAA 


GACAACTGCT 


CATAGGGGAT 


60 


TAACCCCAAG 


CAACATAAGT 


ATCGATCCTT 


CACCCAGTTT 


TTCGACGAAT 


GGGGATCAAA 


120 


TAATGTTATC 


TAGTGTGACT 


AGAGAATCAA 


TGATAAACAA 


ATCAATTAGG 


TTTTCTTATG 


180 


CACCTCAAGA 


TTTAATAGAA 


GGTTCTACAA 


CAGGAACTAA 


ATCATTTAGT 


GGCACAATCA 


240 


ATTTTTTTGA 


AGTTAGCAAT 


TTAACTAGTG 


ATTTAAAACC 


TAATTTAAAA 


AAAGTATATT 


300 


CAGGAGATAA 


AGTAGAATGG 


ACTTGGGAAA 


TAATTAATAC 


ATCTTCTAAA 


CAAGCGTCGA 


360 


ATATTTATTC 


GGAATTAAAT 


TTACCTGTTG 


GATTAATTAT 


TGATCAAAAC 


AGTATTATAA 


420 


AGAACGGAAA 


TGTAGCAACG 


ATAAATGATC 


TTAACGGAAC 


GAATAACCTA 


GGAGCATTAG 


480 


ATTCTAATCA 


AACAATAAAA 


TTTACTTTTG 


aaactatagt' 


ATCTGGAAAT 


TCAGGAGAAT 


540 


GGCTGGAATT 


AAACGGAAAA 


GTAAATTGGG 


AAGATCATGG 


GGCTCGTTTT 


GTGACAAGTT 


600 


CAAATCAAGT 


GAAAATAATA 


GATGATGAAC 


AAAAGGATAA 


AGCTGAAGAG 


ACAGATGAGT 


660 


TAGAGATTCT 


TTCAGTACCA 


CAATCTTTTA 


AATATGGCAT 


CCTAAATAAA 


AGTAATGTCG 


720 


AACAAATCAT 


TCATTTGAAC 


ACnAATAATT 


ACCAGACAAA 


TACTTCGGTG 


GTTACGGATG 


780 


GATTTTATAC 


ACGTATGAGA 


GATGAGCGTC 


CTAGTCTTAG 


TGGrTGGrAA 


CTAACTGCAC 


840 


AGTTATCyGr 


TTTTAAAGAk 


GTTmCAAtTy 


CAcGGTGTTA 


aCGAATtCTG 


GGrTAGCyTT 


900 


AAAAATGGaA 


AATATGAAAt 


TGAATCTATA 


AAAAATCGAG 


ATAsTCCACA 


AGAGAGTATT 


960 


GATCAAAATC 


CAACAGGAAC 


ACCTTCAACT 


GTTAGCACsA 


aTGAAACGCT 


GATATCAGGA 


1020 


CAAGCAGCAA 


AAACATTGAT 


CAATGCACAG 


GCAAATGAAG 


GTCACGGTAC 


GTGGCAACTA 


1080 


AGGATTCCtT 


TTGATAAAGT 


GTCCTTGACT 


GTACCAGCGA 


ATACAGGGGA 


AATAAATAAA 


1140 


AATTATACAG 


CGACTTTGAC 


ATGGTCATTA 


GATGATACAC 


CATAGGAGGA 


GAGTTCTGTT 


1200 


TATACGAGTA 


TGGATTGATT 


CTTGAATCGT 


ATTCTTGCCT 


AACTTTCCTC 


TATTTTGTTA 


1260 


TTAAGGTATG 


GTCCATAGGG 


TGTGGACATT 


GGTGGGAAAT 


CAACAGAATT 


ATTAGTAGAC 


1320 


TTTGGTTTGA 


TGTTTATAAC 


TTAGTGTATG 


tAGTTGAAGC 


TGCTTATTAT 


ATCTTGATTA 


1380 


TGTTAAGCAG 


CTTAATATTT 


yTATCTTATT 


AAAAAAGGGG 


GAAGCTCAAT 


GACAAGAGTA 


1440 


CTGATAATTG 


CAAGAAGTCC 


TTTAGCGGAA 


CAAGAGTTGG 


AATGTACACT 


TCAGCGGTCA 


1500 


TTCGATGAGG 


TCTTTTGTTC 


TTCTGAATTG 


ATGAAAGAGG 


TCGAAAAATA 


CTCGTTAGTT 


1560 


ACTCAATATT 


TTTCAGTGGT 


CATTTTTAGT 


GATACAATCT 


CCACCAGTGA 


AATGGCCACA 


1620 
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TACTTACCAT 


ATTTTAAGAA 


ATTGGGCTTG 


TCTATTCTGC 


GAAAAGGACA 


GAAAGAACAA 


1680 


TTGAAAACAA 


CAGAATATAC 


GTATCTAATA 


GATGAAATTG 


ACGATTGGTT 


TGATGAACAA 


1740 


ACATCACCGA 


ACGTGCTGAT 


AGAGAAAATT 


GTTAAACTTG 


ATAATAAATC 


TCAAAAAAAC 


1800 


AGGATACCTA 


ACCTTCGTAT 


AATTGAATCG 


GTACAAAAAT 


CAATAGTTAA 


ACGAGATCAG 


1860 


GTTATCATTT 


ATTTTACGAA 


AAATGAACGT 


AAATTATTGT 


ACTATCTATA 


TAAATCTAAG 


1920 


GGTAAATGTG 


TTTCAAGGAG 


AGAACTGTGT 


CAATTAATGT 


GGGGAGAGGA 


AATAAGCAAT 


1980 


TCTTCCCTGA 


GTCAATTATC 


TACATTAGTA 


GCACATGTGC 


GAGAAAAATT 


aACCCATGTC 


2040 


GGGTTTGAGG 


AGTTATGGAT 


AAAAACGATT 


TGGGGGAGAG 


GCTATATTTT 


AAGTGATGAA 


2100 


TTTATCGACT 


ATTTGTCTCA 


AAATAATACT 


TtcAGTAAAG 


GTcGTGTGTc 


TAAATTaTTG 


2160 


TAGATTTGGA 


AATACATTTC 


GTTAAAAAAG 


ATTTAAAACA 


ATATAATAAG 


TGTTGTGCTA 


2220 


CATAGTTAGT 


AATGCAGCGA 


AAAATTTAAC 


CGAAGCTAGC 


GATTGCTAAC 


TTTGGTTAAA 


2280 


TTTCAAAAAT 


ACAACTGGAT 


TAGAAACTTC 


TGGATAGTCn 


ACTTTCnGGT 


AAGTATTTAT 


2340 


ATACCAAATC 


CTTTTATTTT 


AGnAATGAAT 


AGGGGATC 






2378 



(2) INFORMATION FOR SEQ ID NO: 320: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



GATTAAAGCT 


ATTAATCTCT 


TGCTGCAGTT 


CAATAGTTTC 


TTGAAATTCT 


TCCGAGTTTT 


60 


CTCTCAAAGT 


ATCGTAATAT 


TTAGTATTGG 


CTTTCGTTnG 


GGATTTCTTT 


GATAACAAGA 


120 


TTAAGACAAA 


ACTAAATAAC 


GTTGGCAAAA 


TAACAGCTAA 


TCCCATTTTG 


ACATTACCAA 


180 


CCAAAAGCAT 


CACTGAAATA 


AAAGGGAAAA 


AC AG AG CC AT 


ACCGCCGGAT 


TTAGGTATTG 


240 


CATGACTCAT 


CGCATGCTCA 


ATACCTTCAA 


CGTCAGACAT 


GATAGTTTGA 


GATAAATCTG 


300 


ATAAATTATG 


TTTTGAAAAA 


TAGGATAATG 


GTAAATTTGA 


CAAATCGTCT 


GCAATTTGCA 


360 


CTCTTAAATG 


CGCACTTTCT 


TTATAAGTAC 


TATTATATAA 


GTtTTCATAC 


TCCCTATCTA 


420 


ACAACCAATA 


CATGAAAAGC 


AAAGTAACAA 


ACGATACTGC 


AAAATAAAGC 


CAACGGGGCT 


480 


TtGCATTTTC 


TAAAACTAAT 


TCGTCTATCA 


GCATCATTAT 


AATGAATGCT 


GGTACCATAT 


540 


TAATGCAGTA 


TACAAAAAAA- 


CTAGAAATAG 


AAGCTTTAGT 


TAAAGCTTCT 


GCGCCCTGAT 


600 


CAGTTAAAGC 


AAATTTTTTT 


TTATAAAAAT 


TTTTCATTGT 


CAAGCCTCCA 


ATCATTTGCA 


660 


TTCTCATAAT 


TAGTGACCAA 


TTGTTTATAC 


ACTCGATGTT 


TAGCCATTAA 


TTCATTATGA 


720 


TTTCCACGTT 


CTACTATTTT* 


■ TCCTTCTGAC 


AAGACAAGAA 


TTTCATCTAC 


AGATTTAATG 


780 


CTGGATAATC 


GATGAGCTAT 


CATAATAACT 


GTTTTTTCTT 


TCATCAAATA 


TTGAAAAGCC 


840 


TTTTGTAGTT 


CATACTCATT 


ATCTGGATCA 


ATAGATGCAC 


TAG CTT CATC 


CATAATAACA 


900 
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ATCTTTGCAT 


CTTTTAAAAT 


TGCTCTGGcA ATAGCTATAC 


GTTGTTTTTC 


TCCTCCTGAC 


960 


AGATAAACTC 


CTTTACTACC 


AATTAAGGTA 


TGCTCTCTTT 


CGGGAAATTT 


TTTTAAAATC 


1020 


gATTCGCAAC 


CGGCTAATCT 


TAGCGCGTCA 


AATACTTtAC 


TTGCtGGCGC 


TgAGGATTTG 


1080 


CTATAGAAcG 


TTATCGTAAA 


TACTCTGATT 


GAATAGTTTC 


GTATCTTGAA 


AGACAAAAGC 


1140 


AATTGTGTTA 


ATAATTGTGT 


CTTTTGAATA 


GTGATTTAAA 


GGCTTACCAC 


CAATTTTAAT 


1200 


PTRTCCCTTA 


TCCACATTGT 


AGAAACCTGA 


AATTAGTTTA 


GCTATCGTAC 


TCTTTCCGCT 


1260 


TPPTRATTTT 


CCTACTAATG 


CATAAATTCT 


TTTTTGTTCT 


AAAACTAAGT 


TGAAATTTTC 


1320 


TAAAACACAC 


TCTTCTTTGT 


ATGAAAAGCT 


GACATTTTCA 


AATGCAATGT 


CACAATTTTT 


1380 


AATCTGTGTC 


TCGTTCCCAT 


ATTTTAGTTG 


GTCTTGCTTC 


ATTTCTAAAT 


AAAGGGAGTC 


1440 


TAGTGTTTCG 


ACTGCATATC 


TTGCTTCAAA 


CAAGTGCATA 


CTTACATACA 


TTATTCTCAT 


1500 


GAAAGAAACA 


AAAATAATTC 


CACTTAAAAA 


AAACAACATC 


AACAAGTCAA 


CAGCTAATAT 


1560 


CTTGGGAGTG 


GTAAGAGAAG 


TTATAAAAAA 


GGTTATAGGA 


ACTATCAAAA 


TGCAAACTAT 


1620 


GCCAAAAAAT 


AGCCATTGAA 


AAAGAACAAA 


AGGTGTTTTC 


CCACTCAAAG 


AATACTTATA 


1680 


TGCAAACTTA 


GCATAATCGT 


TAATTGCTTT 


ATTCAAGGCT 


TTAAATGATG 


TTACTTTAGC 


1740 


ACCAAATATC 


TTTATAACTT 


GAATTCCGCG 


GATATATTCT 


ACCGTTTCAG 


AGCTTAACAC 


1800 


ATCTAACGAT 


TCTTGATATT 


TTTGAATAAA 


ATTTGTATTC 


CCTGTCATTT 


TCTTTAACAA 


I860 


GAATACACCA 


ATTACTAACA 


TGGCGATTAG 


AACCAATCCA 


ATTTTAAAGT 


TAATGAAAAA 


1920 


GCCTAAGGCA 


ACTGATAAAA 


CAGGTGTTAC 


TATGGCTTGA 


GCACTATCTG 


GAATCAAGTG 


1980 


AGCAATTGCC 


ATATGTsTTT 


TAGCCGCGTT 


GTCATCAATT 


GTCTTTCGAA 


TATATCCCGA 


2040 


AGAATTTAAA 


TCAAAAAAAC 


GAAAACTTGC 


TTTTGTTAAa 


CCGCTAATTC 


CCTTTTytCT 


2100 


CAAATTAGTC 


TCyAAGCGAA 


ACCCAAGGAT 


ATGAGAAAAT 


AAGCCAGAAA 


AAATATAAAA 


2160 


TAAAGCTCCT 


AAAGTTAACC 


AAGCAACAGT 


TTGAACCGCA 


TAACTCTTAG 


CTAAATATTC 


2220 


ATCTCCTACA 


ATAATTAGAG 


ACTGAAGAAA 


TTTGTAAATA 


CTGTAATATC 


CATAGACTGT 


2280 


TAATACGACA 


GAAAAATCAG 


AAAATAAAGT 


GGCTAACCAT 


CCGTAGTATT 


TCATTTTAGG 


2340 


GACATAACTA 


AATAATTTTT 


TGTATACTTT 


CATTTTTTCC 


TCCCCCGCAT 


TCAATTTCTT 


2400 


CTCTTATTCG 


TTTACAACTG 


TTAATGCTTT 


CCGTAATGGC 


GTAAATTAGA 


GATGTACAAA 


2460 


CAGATAGAAG 


GTAAAmTACG 


TTCTATTTTT 


TGTATTTTTT 


TAAGACTTTT 


CTTCCAATCA 


2520 


ATACACCAAT 


CAT AGATAAA 


CTTGCTACAA 


CTAACACTGA 


AATAATTCCC 


CATATACCCA 


2580 


CCGCGTAGTG 


GATCATTGCA 


TCCATTGTTG 


CAGCAGATTG 


TCCTCTATCT 


ATCCAGTCTT 


2640 


GACGAAAACG 


ATCTGTAAAG 


AACCATATGG 


GAATCCACTG 


TCCGGCAGCA 


CTGCCTAGTT 


2700 


GCATTAGAAC 


CCAAGACACA 


ACTTGAGGAA 


TTGCTCGATC 


ATATCCAAAT 


ACATATAGAA 


2760 


TAATATCAGC 


TAAGATCGCC 


ATACTAATCG 


AACTGATAAA 


CCAAGGTAAG 


TACGCCCCGC 


2820 


CCATTAACGT 


AAAAAATAAC 


AATAAAACAA 


TTGTGTAAAT 


AGTAAAAATT 


CCTTTTCTAG 


2880 
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GGTAGTTATA 


GCTAATATAA 


ACAAATATGA 


TTCCAGATAA 


TAATCCAAAA 


ATCCCTGGGG 


2940 


ATATGCTGTG 


ACCAAACGCC 


CCACCAATAC 


TACTAACCAT 


TATTGAAAAA 


AAGTAAATTA 


3000 


CTAGGGAAAT 


AACTGTAGCT 


AAACCAATAA 


AAATATAATC 


TTTTGTTTTC 


ATATATATTT 


3060 


TCCTCAATTC 


TATTTCGTTA 


TGTCATTAAA 


AATCTGTTCT 


ATCAAAGTTG 


AATCTTCTAC 


3120 


TCTAAGTTTT 


ATATCTTTTT 


TTATTCCACC 


ATTTTTTAAA 


TAAATAATTC 


TATTGCATAT 


3180 


TTtATAAAGA AATTCCGGAT 


CATGAGTGGC 


TATGACTACT 


GCACCCTTTT 


CTTTTAAATC 


3240 


TTTAATAAGG 


TCAGCAATCC 


TTTCCATATT 


GATATAATCC 


AACCCACTTG 


TAGGTTCATC 


3300 


AAATAGAAAT 


AAATTAGTTT 


TTGACAATGa 


AGCTATAGAA 


ACCAATAATC 


TTTGTtTTTC 


3360 


TCCACCGGAT 


AAATCAAACG 


GTAATkGATC 


TTTCATTTTA 


TAAAGTTCCa 


TTTTTTTAAA 


3420 


ATTTTTTCTT 


GCTGATCTTT 


ACTAACTATT 


TCATTCCCCA 


GGGCTAACTC 


GCTCCTAACT 


3480 


GATTCTGTGA 


AAAATTGGTA 


ATCCcATTTC 


TTGCATTACT 


AAAAACGGCG 


ATGACAAATT 


3540 


TTTnnnATTT 


TCGATTATCT 


TTCCTTTGGT 


AGGGCGAATA 


ATCTTCATGA 


GTATTCTTAG 


3600 


TAAGGTTGTT 


TTCCCCaCAC 


CGTTTTTCCC 


TACCAAACCA 


ATAACATCTC 


CaACTTTGaC 


3660 


ATCAAAATTT 


ATGTCTTT.TA 


GAATATTTTC 


ATAAGTAATT 


TCTTCAACTT 


TTAATACATC 


3720 


TGCACTTGCC 


TTTTCGATAT 


AGGGAACTTC 


AATCGAACTA 


TTAAAAATGT 


CAAATTTCCT 


3780 


TAATGCGTTA 


TTGTTTTCAG 


TTTTTAACAA 


CTGAATAGTT 


GTAGTTTTTA 


CAGTTCTATT 


3840 


TTTAACAAAT 


AGAACTTGAT 


CAATTAATTC 


TTTCAAATAA 


TAAAAACGAT 


GTTCAGCTAT 


3900. 


AACAATAGTA 


AATCCTTTAT 


TTTTTAAATC 


ACAAAGTAAG 


TCAGCAAGTT 


TCTTAGTCGC 


3960 


TTGGAAGTCC 


AAATTTGAAG 


ACGGTTCATC 


CAATATAATT 


AATTTAGAGT 


TTAGCGATAA 


4020 


ACTACATCCA 


ATAGAAATTT 


TCTGCCGTTC 


ACCACTTGAT 


AGCGAAAAAA 


TGTTTTTATC 


4080 


TAATATATTT 


GAAAGATCTA 


ATAATTCAGC 


TAATTCATTG 


ATTCTTTTAT 


CCATATGCTC 


4140 


TTTTGTAAAT 


CCAAAGTTTT 


CCATCGAAAA 


TACTAACTCA 


CTCGTTGTAT 


TTGTTGTAAA 


4200 


GAATTGTCCT 


CTAGGATCTT 


GACTTACGTA 


GCCAATATTT 


TTTGATATAC 


TTTCTTGTGT 


4260 


TAAGGATTGT 


AGCGAGTTAT 


TTTCAAATAA 


AATTTCCCCA 


ACCAATTTTC 


CGTCATAAAG 


4320 


TCTTGGAATA 


AGTCCATTCA 


TTAACTTTAA 


TAGTGTGGAT 


TTCCCACTCC 


CAGAAGAACC 


4380 


AGTAATCAAT 


GTAAGACTAC 


CTTTTTTTAT 


TGAAAAATTA 


AGGTCAACAA 


GCGCTTGTTC 


4440 


CTTAACATCC 


TTATATGAAA 


AAGAAACATT 


GTTAAATTCA 






4480 


(2) INFORMATION FOR SEQ ID NO: 321: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 



WO 98/50555 

T AC AG ATT AC TTGAAAATCT TTACGTTCCA 
TTTTTTCTTA GACTTCTTTT TTTGATTTTC 
AAATATTTAA ATAAATCCCT TTCAAAAGTA 
TTTTTCTTCA TATAAACACC TCATTAAACT 
AATGTATTAA AAGAAAGTTT TTTATTAAAA 
GATTTCTTTT TATTCCAATA CTTTTTTGTT 
AATCGGCGAC AATGTTTTTA CAACGAATGG 
TCTACAGCAA TCCCTTGTTT TCACCTACCA 
CGCTTAGTTA TACGAAAATG ACCTACCAAA 
AAGAAATTGA AATCTCCCAA AAGGTCGTCC 
TAGTTGGGCC TATGAACAAA CTAACGTTAC 
AACAGCATGG TTTCTCCCCA TCAAAAAGCA 
AGAAACTATG CCAATGCTAA ATCAATTTAT 
CATTGTTAGA AAACAAATTA ACCCgCCcTT 
TTGGGATACA TTTTACTACT ATGATAATCA 
GGGAaTTTTC TTCTTAGTTA GCGTTAAAAA 
AAACAATAAC TCTCTATGAA ATCTTAGGTG 
TCAAATCTTT AATTGCTTCC AATTTTACAG 
TATTTATACA CAATAATGTC AGTGCTGAAG 
AAAGAGACGG CAAGCGCAAA CTTAGAATTT 
ATTACCAGCA TGCAGTTAGC GATGTATGAT 
ATTAAGATGG CACTTATGGG ACTCCTATCA 
ATTGCCTCCT TATTTTCAAC AATATTTGCA 
CTGATGTTTT TTTGGTTATC GATGTTAATC 
GGGATATTTA GGGTTTAAAC TTGGAACGAG 
TAAAATTTCT TCAGCAGCAA ATATTGGCGG 
TTTTGTTAAG GCAAATATCG CCATTAAGTA 
AGTAATTGCT ATTCAAGATA TTTTAGATAA 
AACTATTTTA GTTTTTTATT TGATCAAAAA 
TTTTTTATTC GTGATTGGGA TTCTAGCCTC 
TGTAGTAAGG GACAGGGTTT TCGATTTCGA 
CTGAGTGATG GGCATAATAT TTTTTCAGTT 
GATGAATCTT CAAAAAATCG AAAACTACCA 
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GTTTTATTCT nTCTAAAAAA CGTACTCACT 60 

CTCTTTTAGG TCTTGTCCTT TTTGTCTTCA 120 

TTTATAGGTT TTGGACAATA AGGAATATCT 180 

ATTTTATATC ATCACATTAT CAAATTAACA 24 0 

CTTTCTTGGC CTTTTGGGAG ATTTCAGTGC 300 

GGTCACTTTC GTATACCCAT TCTAAGCGAT 360 

TAGGACAAAA CAATGGGAGA AAAAACTTTT 4 20 

CATGTTGTGA ACCCAAAGCA CGCCGACGAA 4 80 

AACAAAAAAA GTGAGGTGAA TAGAAACAAA 54 0 

ACCTCGGGGG CAACTTTCTG CCGAAAATAG 600 

TTTTTGGTAA ATTATTAAAT TAACCGAAAA 660 

AACTGTAAAA ACATATACTT CTATGAGTGG 720 

CgrTTATCCm CTTgcACGaA CCAAAGACAG 780 

ATATACCAAT AAATTATACG GTAATCGTAG 84 0 

ACTGGTTTAT GAAaGCAGTT aTTACTAAGA 900 

TAACAAAAAT AAAAGGAACA AGAACAATGA 960 

AAGAAGGGTC TGACAAATAT ATTTCTGTAT 1020 

AGAATGAAGA CATTTTATCA AGGTGTCAGC 1080 

AAAATTTATA AAGATGAACC AGAAAAACTG 114 0 

TATAACACGA ATCCACAAAT GTTGACATTT 1200 

AATGATCAAA GTGTCTCAGA TACACGCAGC 12 60 

GGAATCGGAA ACTCGATTGC TCGATTTGGG 1320 
GGATTAGCGA TGAATGGTCT GGGATTTGCT - 1380 

TCAATGCTAG TCATCAAATT ATTAATGGGA 14 40 

TGTGATCGAA ACCTTAAGTG ATAAAATTGG 1500 

AGTGGCTGTT ATTGCGGCAT TAAATACTTC 1560 

TTCACATAAT GTAGAATCTG GTGAAAAACA 1620 

GATTACGCCC AAAATGCTTC CAGTCATTTT 1680 

AAGAAAATGG AACACGTATC AATTGTTATT 174 0 

TGACTTAGGG ATTTTAGCAT AGTAAGTGAG 1800 

GTAAACCTTG TTTCCTTATT TATAAGTTTT 18 60 

AACGGAAATA AAATTTGAAA GCGAGAACAC 1920 

ACTAAAGTTT TATCAGAAAG ATTTACTTTC 1980 
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AGGATATTTG GAAAAACATT CAAAACTTTT GGAACCACTT TTTGAACGTA CTTATTTTCT 2040 

ATTAAAAGAT CAGATTATTT ACAATGATGC GAkGGATATG GAAGCCTGTT CTATTCCGTA 2100 

TTCTCTAAAA GAAAATACAC GGAATCGATA ACATGGAGAT GACCCAGAAT gGCTCTTtAT 2160 

GTTGAATTGA CAAAGTTTtC TTCTAGATTT ATCACAGGCA TACGCGTTAA TAAAAGAAAA 2220 

ATGCTATTTA CAAAAATGGT GCAGTCTGCT TCTTGATTTT ATTCAAGAGG AAGGCGAACC 2280 

AAATTCTACT AATAGGAATG TTTGGCGACA CCTTGATGTT GGAATTCGAG TGATGAACTG 234 0 

GATGAAGAGT TTGACTTATA TTTCTATTGC TGATTATAAA CAATTGGGAA TTGATAAAGT 24 00 

GTTGCGAAAC GCTTTGCTGG TTAATCTCGA GTATTTGGAG AGGAGCTATA TCGATAAGTA 24 60 

CAGACTTAGT AATTGGGGAG TATTGGTAAC GGGTGGAATG GCTGCTATGG ATCTATTTCT 2520 

TCCAGAACTA GTGAACAGAG TAAACTAATA TGGGACCGGT TAACTGACTT ACAATTTTAT 2580 

TCAGATGGCA TTCATTGGGA ACAGAGCGCA TTGTATCAAT ATGAAGTTTT GATGACGTAT 2640 

GTATATCTAT TACAGATTTC TGAATATCTT GAAATATCAT TGCCATTGGA TCTTCGAACG 2700 

AAATTGAAGA TACCTATCTT ATCTACCTAC TATATAACGG ATAATCAAGA TGTTCTGAAT 27 60 

CCTATCAACG ATAGCGATCA TGTCAATTTT CGATATGTAT ATGATAGCTA TCGTAATATG 2820 

AAGAAAGAAC TTGGAAAACA CTGCAGCCAG AGAAACTTTT TTCGTGGTGA ATCAAGTGGT 2880 

TTAGTGTTTT ATAAAACAGA GGATATTTAT TTTACGCTTT TTAATGGATT GCATGGAAGC ,294 0 

TCACATGGTC ATGCTTCTAC AGGGAGTTTT ACGCTACAAT TACAGGGTGA TGATTTGATA 3000 

TCTGATAGTG GTTGTTACAG TTATGTCAAC AAAGCAGAGT GGCTTCAACC GAAGGAGTGT 3060 

GATTCGCATA ATACAATGTT TATTGCAGAG AATTCTCATA CTTTAGTACT GATACATGGG 3120 

GCTACGGGAA ATTACCAACC CCATTATTTC AGCGAATAAA AGAAATTCCT GTCGGTTTTT 3180 

TTGCGGAATG CGGCTGGATA GATAAAGATG ATCAGAACTT AATGATTTTT AAGCGTAGTT 3240 

TTATCTACTT GAAGGCCATT AATTCAGTAA TTATTATTGA TAGTTTTATA GGTCAAAAAG 3300 

AGACTGAAAT TACGAGTACT TATAATTTAG CACCTACGCT AAATTGTCAA AAAAGCAGAT 3360 

CAATTCAATT TAACTACAAA TTCACAAAAA TATAAACTCC TTATAGTATC AGGTCAAACG 34 20 

GATCAAACTA CTGTTGAAAG ATCAGAGATT TATAATCAGC TGATCGAACT ATATACGCTT 34 80 

GACCAATAAA TTTCATTATA AAACGGGGAA AGAAGTTCAG GATACCGTGA TTTCCCCTTT 354 0 

GGAAGATATC CAGATTATGC CAGTAAAAGT GAACCAAGCG GGAGAAGACG AGCCATTCTG 3600 

TCAGGCCAAA GGATTTTGCG TGATCACTGG AGAGAAAAAA TTTTATATTT TTGTTATATG 3660 

TGGAGATATT GTTAAAGGGA ATAAATTATT AG T GAG TAT G GTCAATTTTT TATGGCCGAC 3720 

TTGTTTTGAT TGATCAAAAT GAAGCGATTA TTCGTATTAA ATAAAGGAGG AGAAGAATGA 3780 

ACTTTTTAAT AACTGGACAT AGTGAATTTG CTATTGGAAT GCAAAGCGCC TTAACGATGA 384 0 

TTGCAGGAGA AATAGAAGAT GTGATGTTTA TCCCTTTTTT AGCCAATGAA ACGACAGAAG 3900 

AATATCGTCA AAAAATTGAA TCTTCAATTG A GAT G TAT G A ACATGTCGTT TGTTTTACCG 3960 
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ATTTATTAGG 


CGGTACCTCA 


TTCAAAAAAT 


GTATTGAATT 


CTCTATGACA 


AAAAAACAAG 


4020 


TTTTTGTTAT 


AAGTGGTACA 


AACATAGCGA 


TGTTATTGGA 


AGGATTGATG 


ATGCGGAATT 


4 08 0 


CTGGTATTGA 


GCCGTTTGAA 


CTTGCCAAAT 


CAGTGGTAAA 


AGTTGGAAAA 


AGTAATATTG 


4140 


TGCTTTTTGA 


AAAAAATAAT 


GAAGCTGAAG 


AACTATCAGA 


AGAGGTAATA 


TAAAATAATA 


4200 


GTTGTATAGC 


GATTATCTAA 


AGTGACTGGT 


CAAACGTTTC 


TAATCGTATA 


TCAGTCATTC 


4260 


AG T GAG TATA 


ATTTAAACCA 


TAACCTACCA 


ATTGCCAAGG 


AAATATAGCG 


TATTTTACTC 


4320 


TTGTCAATTG 


ACAGTTTATA 


GTTTAGAGAT 


GGCTATGCCA 


AACAACTAAT 


ATATGAAAAA 


4380 


GGAAACTAAA 


CATCATTCCT 


CTATTGGAGC 


AATTTCTTCT 


TTGATGGCTT 


ATAAAATGTA 


4440 


CTAATACTTC 


ATTTGAGGCA 


GTAGAAAGTT 


GAGAAATAGT 


TGTTTCTTGT 


GCTGTTTTTG 


4500 


TCGTTATTAA 


GTTTGCCAAA 


AAGCAACTTT 


CAACCAACAT 


TTTTGTTTCT 


TAT CAT AT CA 


4560 


AGAACCTGTT 


TTTTATTTTT 


CTATAAAAAA 


TGTCGAAATT 


ATTAAGTTAT 


CGACATGATT 


4620 


TTGTTATATT 


TAAGATAAAT 


TAGTCAAAAT 


TTTTGCCAAT 


ATTTTTTCCA 


AATTTTTCTT 


4680 


TATTTTTTCT 


TTCCTTACCT 


TTTTCAATTA 


ATATCATTTG 


GTGTTAATAT 


ATCTCTGATA 


4740 


ATACAAAAAA 


ACATTGACCT 


ATCATAAGAT 


AAAGAGTGAT 


AGTTTAATAC 


TAAAAGAGGT 


4800 


ATATTAATTT 


TATTTAAAAA 


TTGACATTAA 


TGACTCTTAA 


TCAGAGGGTC 


TAGGGTTCGA 


4860 


ATCCCTATGC 


CCCCATTGGG 


TGCCAAACCC 


ACGAGAACGG 


TATTCTGTCG 


GCGTCTTAGG 


4920 


ACGTTTCGAA 


GaCGGAGTAC 


GCGTTCTCTT 


TTTTTACATA 


AAATGTCGTG 


TGTCGTGAGA 


4980 


GAT AC AC TAG 


TTAAATCGGT 


TAAAGAAtnT 


ACGCTTAGAA 


GAGCGAGAAA 


GTACGTTATA 


5040 


TAAAAAAAGT 


TAAAAAAATT 


TCAGGCTAAA 


AAAGCGAATT 


TAACAAAAAG 


CGCACTTTTT 


5100 


GGTTCCTATG 


AAACGAAAAA 


GTACGTACTA 


TACAGCATGT 


ACGGCACGGT 


ATATGATGAT 


5160 


AT.CAGAAAAC 


GTTAAAGGAG 


CGAATCAAAT 


GATTACATCA 


CTAGCAGATA 


CAGTAACATT 


. 5220 


AAACAACGGC 


ACTAAAATTC 


CAGGAATGGG 


TCTCGGGGTA 


TTCCAAATAC 


CAGATGAAGA 


5280 


GACAGCAAAA 


GTGGTCGAAG 


AAGGGATTAT 


CAATGGTTAT 


CGTTTAATCG 


ATACAGCGCA 


5340 


AATCTATGGC 


AATGAATCTG 


GCACAGGAGC 


GGGAATTAAA 


GCAGGCTTAG 


CGGCAACTGG 


5400 


TTTGAACCGT 


GAGGACTTAT 


TTGTTACCTC 


GAAAGTTTGG 


AACGCGCATA 


TTTCCTATGA 


5460 


TGAAACAATT 


CAAGCTTTCA 


ATGATAGTTT 


GGAACGTTTA 


GGTTTAGATT 


ATTTAGATTT 


5520 


GTATTTAATT 


CATTGGCCAG 


GAAATAATTC 


TTACAAAGAA 


TCATGGCAAG 


CTTTAGAAAC 


5580 


TTTATATGCT 


GAAGGAAAAG 


TGAAAGCAAT 


TGGTGTCAGC 


AACTTCCAAG 


TTCATCATTT 


5640 


AGAAGATTTA 


CTTTCTTATG 


CAAAAGTTGT 


TCCGGTAATT 


AACCAAGTGG 


AACTACATCC 


5700 


AAAATTGGAT 


CAAAAAGAAG 


TGCGTGATTT 


CTGTGAAAAA 


CAT G AT AT C A 


AAGTCCAAGC 


5760 


TTGGTCACCA 


TTGATGCAAG 


GCCAATTATT 


ATCTAATGAA 


ACCATTTTGG 


CGATTGCTGA 


5820 


GAACCATAAT 


AAATCAGCAG 


CCCAAGTGAT 


TTTACGTTGG 


GATATTCAAC 


AAGATATCTT 


5880 


ATTAGCGGTG 


AAATCTGTCC 


ACAAAGAACG 


GATGATTAGC 


AATGCAGCGG 


TCTTTGATTT 


5940 
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TGAACTAAGT GCAGAAGAAA TGGCACAAAT XAACCAATTG AATGAATCGT TACGTGTGGG 6000 

ACCAGATCCA GACACGTTTG ATTTCTAAAA ATTAACTAGG AGGAGATACA ATGAGTGAAG 6060 

CAGTTAAAGG TTTTTCAGTA GTCAACCACG TGGGGATTAC AGTCAGTAAT TTGGATGAAT 6120 

CAGTGAAGTT TTATGAAGCA TTAACAGGAA CAAAAATCGC AAATCGTGAT GAAATTGGTG 6180 

GAAAACGGAT GGCCCAAACA CAAGGGTTAG AT GAT AC ATT AATCAAGTAT GCCAATGTGC 624 0 

ATCTAGAAAA CATCAATATT GAT T TAT TAG AATATGTTAA ACCAAAATCA GAAAAAGCTT 6300 

CTTATTCCAA CGAACAAATT AGTGCAATGC ATATGTGTTT CGAAGTGGAA GATATTGATG 6360 

CAGCTGTTGC ACGTTTAAGA GCGATTGGCG TAGAACCTGA TGGAGAGCCG ATTACATTCC 64 20 

AAAAAGAAGA TGGCTTGAAG AGCGGTTTTG GTACAGGGGT TGCTTACTTT ACCGATCCAG 64 80 

ATGGTACAAA TTTAGAATTG ATTGCTCCTA AAGGTCCATT TACACGAAAT TAAGTTAAAA 6540 

GACTTTCAAA CmAACATCAA AAAGACCCCC TCATTCTAGT ATCACTAACT AGAATGGGGG 6600 

CyCTTtTATT TTTTgAGACA GAAGTCGCTA AAGAAAAGAT ATTCCAAATC CGTAATTGAT 6660 

GGTATACTAA AACGTAACAA ATAAAGGAGG nnAGACAATT GATGCTTAGC GTCTTAACAG 6720 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ' 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 



GTAAGAGGGG 


AGTATGAAAA 


TATGGGAAAA 


GAATATCATG 


TTGCCGTGGT 


AGGAGCAACA 


60 


GGTGCTGTCG 


GGACAAGAAT 


GATTGAATTG 


T TAG C AG AAA 


CAAGTTTGCC 


AATTGCCAGT 


120 


GTGAGGTTAT 


TGGCGTCTAA 


GCGTTCTGCT 


GGGACAAGAG 


ATTCAGTTTA 


AAGGGCAACC 


180 


TCTTATCGTG 


GAAGAATTAG 


TCCCGGAATC 


ATTTGAAGGA 


ATCGATTTAG 


CATTATTTAG 


240 


TGCTGGTGGG 


GAGTATTTCC 


GAAAAGTTTG 


CACCAGAAGC 


mGTTaAGCGC 


GGACTGTCGT 


300 


TGTTGACAAC 


ACT AG CC ATT 


TTAGAATGGC 


AAAAGATGTT 


CCTTTAGTTG 


TTCCGGAAGT 


3 60 


GAACCCGGAA 


GCCTTGAAAA 


ATCATCAGGG 


AATCATCGCA 


AATCCCAATT 


GTTCTACGAT 


420 


TCAAATGATG 


GTGGCCTTAG 


AACCAATTCG 


GCAAGCGTTT 


GGCTTAGAAC 


GAATCATCGT 


480 


TTCAACGTAT 


CAAGCTGTGA 


GTGGCGCAGG 


TATCCGTGCA 


ATTAATGAGC 


TAAAAGAACA 


540 


AGCCGAAGAT 


TTTTTAGAGG- 


GAAAACCTGT 


CAGCGAATGG 


TCAGCTGAAA 


TTTTACCAGC 


600 


TGGCAGTGAC 


GAAAAACATT 


ATCCAATTGC 


TTTTAATGCG 


TTGCCTCAAA 


TTGATGTTTT 


660 


TTCAGAAGCA 


GGGTATACCT 


ACGAAGAATG 


GAAAATGATG 


AATGAAACAA 


AAAAAATCAT 


720 


GGGAGACAaC 


AAGCTAAAAG 


TCGTtGCCAC 


CTGTGTCCGT 


ATTCCGGTAA 


TTTCAGGACA 


780 


TTCTGAATCT 


GTTTATATAG 


AAGTGAAACA 


AGCAGATGCA 


TCGGTCTCAG 


CCATCCAAGA 


840 


ATTAATGGCG 


CAAgcTCCAG 


GTGTAATTTT 


GCAAGATGAG 


CCAAGTGAAC 


AACTATATCC 


900 
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ACAAGCGCTA 


AATAGCGTCA 


ACCAGAAAGA 


AACATTTGTC 


GGCCGTATTC 


GTCAAGACGT 


960 


TGACGTGAAG 


AACGGTTTCC 


ATTTATGGAT 


TGTGTCAGAC 


AACCTATTAA 


AAGGAGCTGC 


1020 


TTGGAACTCG 


GTCCAAATTG 


CTGAAACGCT 


ACACGAAATG 


AATTTGCTTC 


GTGTACCTGA 


1080 


AAACGACTAG 


AAGGTTTGTA 


TTGTATACTC 


GGGAGGGAAT 


AGGATGGATT 


TAACGAATGC 


1140 


AACAATTATT 


ACGGCAATGG 


TTACGCCGTT 


TCAAGAATCG 


GGAGAAATAG 


ATTTTGATAA 


1200 


ACTACCGCAA 


TTGGTAGACT 


ATTTATTGGC 


CAATCATACA 


GAAGGCGTTA 


TTTTAGCAGG 


1260 


AACAACTGGT 


GAATCACCCA 


CGTTAACTCA 


TGAAGAAGAA 


TTACAATTAT 


TCCAGCGAAT 


1320 


TATTGaACTG 


ATCGATGGAC 


GTATTCCGAT 


TATTTGTGGT 


GTAGGTACGA 


ATGATACACG 


1380 


TGACTCAGTC 


GCTTTTGTGa 


AAGAGTTAGC 


GACAATCGCT 


GGtATCGATG' 


CTGTTTTGGc 


1440 


GGTTGTTCCT 


TATTATAATA 


AACCAAATCA 


AGAAGGTATG 


TATCAACATT 


TTAAAACCaT 


1500 


CGCAGAAGCA 


AGTGAACTGC 


CAATTATTTT 


GTACAATGTT 


CCTGGTAGAA 


CGGCTGCATG 


1560 


TCTAGAAGTG 


GAAACTACTT 


TGCGcTTAGC 


ACAATTrGaA 


AAAATTGTGG 


cAATTaAAGA 


1620 


ATGTGCaGGG 


TTAGATGCAA 


TTACTGAGTT 


GATTGAAAGA 


GCGCCAAAAG 


ACTTTCTAGT 


1680 


GTATACTGGA 


GAAGACGGGT 


TGGCTTTTGC 


CACCAAGGCA 


CTTGGCGGAC 


AAGGGGTTAT 


1740 


TTCGGTAGCC 


AGTCATGTCT 


TCGGTTCGTC 


AATGTATGAG 


ATGTATCAAG 


CGCTGGAGCA 


1800 


GGGGAACTTG 


CCGGAAGCGG 


CTAAaATTCA 


ACGACAGTTA 


TTACCAAAAA 


TGAACGCATT 


1860 


GTTTTCAGTG 


CCTTCGCCGG 


CACCCGTGAA 


AGCGGCCTTA 


AATCATTTAG 


GTATTCCTGT 


1920 


TGGAAATTTA 


CGTTTACCTC 


TAGTTGCTTG 


TACACCAGAA 


GAAGAGCAAA 


GAATTATTCG 


1980 


CACGTTAGAG 


ATTTAAAGAA 


CGTGTAATCA 


TAGAGAGGTG 


AAAAGAGTGA 


GTACAATAAA 


2040 


AATCGTTCCA 


TTAGGCGGCG 


TTCGTGAAAA 


TGGAAAAAAT 


ATGTACGTTG 


CTGAAGTGGA 


2100 


AGATGAAATT 


TTTGTACTGG 


ATTGTGGATT 


ACAATATCCA 


GAGAACGAAC 


TTTTAGGAAT 


2160 


TGACGTCGTC 


ATTCCGGATT 


TTACGTATTT 


AGAAGAAAAT 


AGTGAACGAA 


TTGCAGGGGT 


2220 


CTTTTTAACA 


CATGGCCATG 


CTGATGCAAT 


TGGGGCCTTA 


CCGTATTTGC 


TTTCAAAAAT 


2280 


TCAAGTGCCT 


GTTTTTGGGA 


CAGAATTAAC 


GATTGAACTA 


GCAAAACTAA 


GTGTCAACAG 


2340 


TCATGCGGAA 


ACGAAGAAAT 


TTAAAGATTT 


CCATGTGATT 


GACCCGCATA 


CAGAAATTGA 


2400 


TTTTGGTCAA 


GCAACGGTGA 


GCTTCTTTAA 


AACAACGCAC 


ACCATTCCAG 


ATTCGATTGG 


2460 


CGTTAgCATC 


AAAACAAAAG 


AAGGCAACAT 


TGTTTATACT 


GGTGATTTTA 


AATTTGATCA 


2520 




b/ioH 1 vj 1 I C 




1 ouLubl 1 1 vj 


vaU 1 uBnHl 1 o 


tzn, & jx zx a J\ (1 ft 


2580 


CGTCTTGGCG 


CTACTTAGTG 


ATTCATCAAA 


TGCTGAAAAC 


CCAGCCCAAG 


TTGCATCGGA 


2640 


AGCCCAAATC 


GCTGATGAAG 


TGTTCGACAC 


GATTCGCTAT 


TGGGAAGGAC 


GAATCATCGT 


2700 


GGCATGTGTG 


GCAAGTAATT 


TGCAACGGGT 


CCAACAAGTA 


TTGAATGCAG 


CGGATCGTTC 


2760 


TGGTCGTAAA 


GTCGTTTTGA 


CAGGCCAAGA 


TTTTGAACGG 


ATTATTCGGA 


CGGCGATGAA 


2820 


ATTGGAAAAA 


TTACAATTGC 


CAAGTGAAGA 


TCTATTAGTG 


AAACCAAAAG 


AAATGAAAAA 


2880 
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ATATGCACCC 


GAACAGTTAT 


TAATTTTAGA 


AACAGGCCGG 


ATGGGAGAAC 


CTATTAAGTC 


2940 


ACTTCAAAAA 


ATGGCTAATA 


ATACACATGG 


TGTCGTTCGA 


ATTGAAGAAG 


GCGATTTAGT 


3000 


TTATATTACG 


ACAACGCCGA 


CAACGGCCAT 


GGAAACAACG 


GTTGCCAAAA 


CAGAGGATAT 


3060 


TGTCTATCGG 


GCCGGTGCTA 


CGGTGAAGCA 


AATTTCCGAC 


AATTTACGTG 


TGTCTGGTCA 


3120 


PGCTAATCCG 


AATGACTTGC 


AATTGATGCT 


GAACTTAATG 


AAACCAAAAT 


ATTTCATTCC 


3180 


Tf^TPPAAGGT 


GAATACCGCC 


AGTTAGCTGC 


GCATGCGGAc 


TTAGCACACG 


AAATTGGCAT 


3240 


^PfTTATAAA 


GAT AT C T T T A 


TTACAGGTCG 


TGGGGACATT 


CTAGAGTATA 


CAAAAGGTAG 


3300 


AATCJTrGGTA 


GCTGGCAGCA 


CAACGGCCGA 


AAATATTATG 


ATTGACGGGA 


TTGGTGTCGG 


3360 


CGATATTGGT 


AATATCGTTT 

*Xl* X ** X V* \»J XXX 


TACGGGATCG 


TCGTATTCTT 


TCCGAAGATG 


GTATTTTTGT 


3420 


GGPTGTTGTG 


ACAATTAATC 


GTCGAGAAAA 


ACGAATTGTT 


TCACCAGCTA 


AAATTACTTC 


3480 


T A A CZ TTT T 
lnunuoi ill 


HTTTAPGTTA 


AAACAAGTAA 


AGACCTAATG 


AAAGAAAGTA 


GCAATATTGT 


3540 


TAPAQAAATT 


GTTGAAAAGC 


ATTTAGAAAG 


TGACGATTTC 


GAATGGAGCA 


AACTGAAACA 


3600 


Af;AAATrrr^r 


GAAAACTTAA 


GCCGTTATTT 


ATTTGAACAA 


ACCAAACGTC 


GTCCTGTGAT 


3660 


PTTAPPAnTT 

1 ^AO J. J. 


ATPATGGAAG 


CGACGCAACG 


CAAGCGTCCC 


AAAAATAACG 


CATAGTTCGT 


3720 


PPTGATAAAA 


GCAGGTGATT 


TTAGGAGAAA 


TTCTTCTAAA 


GTCATCTGTT 


CTTTTTTGTA 


3780 




AT AAA TAT GG 


TACTATTAAT 


CAGAGCGCTG 


AATAAAGGAG 


GCGATTTTAG 


3840 


TGGAAGAGAT 


TGTGTTGTTC 


CATACCAATG 


ATCTTCATTC 


TCACTTTGAA 


AATTGGCCCA 


3900 


AAATACGGCG 


TCTGGTTAAA 


GCCAAACGTT 


CGCTGTACCA 


AAAAGAGGGC 


AAAACAGTCG 


3960 


TAACCATTGA 


TTTAGGTGAC 


TTTTCAGATC 


GGTGTCATCC 


TTTGACCGAA 


GCAACAGATG 


4020 


GTCGAGCAAA 


TGTAGCTATA 


ATGAATACAT 


TAGCGTATGA 


CCTGGTTACA 


ATTGGAAATA 


4080 


ATGAAGGAAT 


CGGCAATTCA 


AAGAAACAAT 


TAGAGCATTT 


ATATGATCAG 


GCAACATTTG 


4140 


AAGTGGTCTT 


AGCTAATTTA 


GAAGAACCAA 


AAACCCAAAC 


GTTACCAGAT 


TTTTGTCAAG 


4200 


CATATAAAAT 


TATGACAACT 


AAAGAAGGCA 


CAAAGTTAGG 


GTTTATTGGC 


TTAACCGCAC 


4260 


CATTTCCTTT 


AACGTATAAT 


CCCAATGGCT 


GGACAATCAA 


ACAAGTGGAA 


GCAGTCCTTC 


4320 


CTCAACTGAT 


TACAGAAGTT 


GCGCCTCAAT 


GTGATGTTCT 


CATTTTGCTT 


AGTCACTTGG 


4380 


GGATAGATAC 


TGATTTTATG 


ATTGCGGCGA 


ATTATCCTGA 


AATTCAAGTG 


ATTTTAGGTT 


4 440 


CCCATACACA 


TCATTTATTT 


AAAGATGGTG 


AAAAAATTAA 


TCATGTGCAA 


CTAGCAGCTG 


4500 


CTGGCAAATA 


TGGACAGTAT 


ATTGGTGAAG 


TGCATCTGTT 


TGTGGATGCC 


GACACGAAGC 


h o bU 


AAGTCACAAG 


TTATGCGAAA 


ACCATTGAAA 


CAGCAAGTTT 


AGAAGAGCAA 


GCAACGGATG 


4620 


CCAAGGAAAT 


TGCGGGTTAC 


TTAACGGAAG 


GTCATCGACT 


ATTGCAAGCG 


CAGcAAATTG 


4680 


CACAAATCCC 


TGAAACCTTG 


TCAACGGACT 


TaCGGCAACC 


GCATGCCTTT 


ATAACTGTTG 


4 7 40 


CCTTGAAGGC 


GTTAAAAGAA 


GCGGGGCAGA 


CCGAAGCGGC 


TGTGTTGAAC 


AACGGGTTAT 


4800 


TTTTAGCAGA 


TTTACCAGAA 


GGAATAATCA 


ATGCAGACCA 


ATTACATGAG 


GCCTTGCCTC 


4860 
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ATCCAATGCA TTTAATCAAG GTTACCTTGA AAGGGTCAGA CATGAGCCGT TTGATTCGTG 4 920 

AAATGGAAAA AAGTCGTCAA TTTCTGCGGA AATTTCCGAT TCGCGGGATG GGcTTCCGCG 4 980 

GGAAAATTTT TGGCGAATTA TGTTATGATG GGATTCGTTT TGAGCCAAAT AGCCAGACCG 504 0 

TTTTTTGGCA AGGGAAACCT ATTCAGCCA 5069 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



ATTTTAGGAT 


TATGGTTAAA 


TCAAACAGAA 


TCTAAAAATC 


GCTGGATGCA 


AGTATTTGAT 


60 


GAACTGAAAG 


CACGTGGTGT 


CGAAGATGTC 


TTTTTCATTT 


CGATGGACGG 


TGTTTCTGGC 


120 


CTCGAAGAAG 


GCGCAAAAGC 


GATCTTTCCA 


TCGGTAATTG 


TTCAGCGCTG 


TATTGTTCAT 


180 


CTTGTTCGAA 


ATGCATTGCG 


CTATATTCCA 


AGTAAGGACT 


ATAAGGAAGT 


CTGCCGAGAT 


240 


ATGAAAAAGT 


TCTACGGTGC 


TTCTTCTCTA 


AACGCTGCAC 


ATGCTGCTTT 


TGGCAGTTTT 


300 


CAAAATCGGT 


GGTCTCATTA 


TTCTGGTGCT 


GTAGATGTCT 


GGAAACGGAA 


CTTTGCACAT 


360 


GTTGAACAAC 


TATTTGATTA 


TGGTAGTGCC 


ATTCGAAAAA 


TTATGTACAC 


TACCAATGCG 


420 


GTTGAAAGTG 


TCCACTCCAG 


CTTCAGAAAA 


GTCACAAAAA 


AAGGAGCGTT 


CTCTAATGAG 


480 


AATGCACTCC 


TAAAACTACT 


TTATTTACGC 


ACGAAAGAAT 


TACACGCGAA 


GTGGTCAGGT 


540 


GGTCGAATTC 


AAAACTGGGC 


TATGGTGCTT 


AAGCAGTTAA 


TGATCAACGA 


GACCTTCTCC 


600 


TCTAGAATGA 


AAAGTATGCG 


ATTTACCTTC 


CGTAACAACG 


GTAGGTTTAT 


TCAACTATGT 


660 


GTCCGATCAA 


AATTTATCTG 


ATTTCAAAAT 


AGCCGAGAGA 


ATTATGGTGG 


TTAAAAACTC 


720 


TTAACTATTT 


AACTTCCACA 


GTTTAGTTGA 


CAAACCCCAA 


TTAGTTCTCT 


ATTTTCACAT 


780 


ACTATTTTCA 


ATAGTGTGTT 


TTTTATTTAT 


ATTCTTAGTC 


AAGCTAACTG 


TATGAGTATT 


840 


TAGTAAAGTT 


GTTTCTTTCa 


AAAATTCAGA 


AGTCTTATCm 


CCTTGTATAC 


CTAACCAAAT 


900 


CAGTGTCCAA 


GGTTCCTTTT 


TATCTGTATA 


ATACTTCACT 


TTTTGACCTT 


GAAAAAAAAG 


960 


AAAATATCGT 


GTTCTATAAG 


TTCAAAATGT 


TGAGCTTCTA 


CTTCTAAGAA 


TCCCTTACCA 


1020 


GATAAAACAT 


AATGAAATAT 


CCATTTTTGT 


TGAACTATAT 


AATTACGAAT 


AGAAGAATTG 


1080 


GGATGATAAG 


ATTCATATCC 


GCATTGGGAA 


AGGGAAATTG 


GAGCAGAAAC 


TGACTTATTA 


1140 


TGATACCAAA 


AATCCTCCAT 


TATTATCCTC 


CTTTATAGAA 


CTAACTATGA 


AGAAAATACC 


1200 


ATGTTTTAAG 


TATAGATTAT 


CATGTTCAAA 


TGGTCAAAAA 


AATCTATACT 


ATAATTAGAT 


1260 


AGCGCTTTCT 


AACTATTTTT 


TGAAGGAGTC 


AATCATAATG 


AGTGaGAAAA 


TAATGAATAC 


1320 


AAyCCAGAAT 


AAAGTTTTAC 


CTATAGCTAC 


TAAGATTGGG 


AATCAACGAT 


TTTTAGTAGC 


1380 
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GTTAAGAGAT 


TCATTTATGG 


GTACGATGCC TQTAATAATG ACaGGtTCGG TTGCAATATT 


1440 


ATTAAATGCT 


TTTTTAGTAG 


ATTTTPPTAT 
n x x i x \*\* mi 


GCAATTTGGC 


TATGAGAAGA 


TTACTGATTA 


1500 


TTTTCaATGG 


T TAG TAG ATA 


TA A AT AATTT 
i /"inn i nn i x i 


GaTATCaAAA 


GGAAGCATTT 


CTATTGTTTC 


i c c r\ 

1560 


CTTACTGTTT 


ATTTATTGTT 


TAPPAPTAAA 


TATTGCTAAA 


ATATATAAAA 


CAGATACTTT 


1620 


GTCTTCAGGA 


CTAGTCGCCC 


TPTPTTPATT 


TATTATTTCT 


ATAAGTAATT 


CTATGACTTC 


1680 


AACTTATAAT 


TTATCGAATA 


ATAATAATAT 
n I rirt 1 nnl n 1 


TGACTTAACA 


ACACTATTTA 


CTGATGTTGA 


1740 


GGGAATAACA 


GTTACAGGTA 


ALAu 111 nrin 


TPTPAPTATA 


TPAGGATTGT 


TGCPTGGTAC 

X OVa>\_r 1 \J\J X fl\j 


1800 


ACAAATTAAT 


TCAAATGGTT 


Al X J iA^LuL 


A ATA ATPATA 
rtrt 1 nn 1 \*rl 1 n 


PPTTTTTT AT 


PTTP A ATT AT 


1860 


TTTTTGTAAA 


CTCATGAATA 




HATTAAATTA 
nnl 1/ittnl In 


PPTPATAPAP 

1 On 1 n^-nO 


TTPPTPPAGC 


1920 


TATCGCTAAA 


CCATTTTTAT 




PP.PTTTAP.TT 
OVJU 1 1 Ino 1 1 


TPTPTTT AT A 
lLitl 1 In In 


TAATTPPTAT 

l/in 1 1 OU in J. 


1980 


TTTGACATTT 


TTGTTAAATA 




AAAAATTTTA 
nnnnni 1 I In 


APAPATPTAA 

n^nOn 1 o 1 nn 


TTTAPGAAAT 


2040 


ATTACAACGC 


CCTATGTTAG 


onU 111 LLLft 


ZIZIPTTTTTTT 
nnu 1111111 


P.TTPTTATAT 


T APTTPPTTT 
1 no 1 i i a i 


2100 


CTTAGTTCAA 


TTTTTCTGGT 


r P r r r P r rpf7/"2Tft.T 

1111 Lou 1 n 1 


TPATPPTPPA 

J Ln 1 OO 1 OOn 


AATPTTATPP 


PAPPTATTAT 

v^n\_/ x n x in J. 


2160 


GGAAGGTGTA 


TTTGGAGTAG 


PP.P TTTT APT 
Lou 1111 ny> 1 


TAATTTAPAA 
1 nn ill n onn 


PPTTATPAAA 
i l n i v^nnn 


APAATGAAAC 


2220 


TATTCCTTAT 


GTTTGGACAA 


PTPTATPAT A 
aiolnl Un 1 n 


PPPATPPTTT 


PTPTGGTATG 

UIO 1 OO 1 n 1 O 


CAACTTTAGG 

vnnv xxx noo 


2280 


TTTATTAATT 


GCGATTTTTT 


PPP.TB.TPAAA 


AAATAPTPAT 

nnn 1 no 1 V^rt 1 


TATAAAPAAP 


TTGCAAAACT 


2340 


TGGAATTATG 


CCTGTTTTGT 


TTH 21 T ATTPP 
1 lnnlnl 


TPAAPPAPTA 
1 VjnnO^-no 1 n 


ATPTATPPAT 
n 1 o 1 n 1 \jvjn i 


1- APCTACTGT 
unvVpi x nu x o x 


2400 


ATTAAATCCT 


ATTTtATTTA 


1 lH~nl 1 Kit i 


PTt- ATPTPPT 


PPTPTAATGt 


PTAGTGTGGC 
v_< i no i o i wvju 


2460 


TTATTTAGTA 


ACAGATTTTA 


APTTPPPTTT 


CTCCAGTAAC 


ACAAAATGTA 


ACTTGGGTAA 


2520 


TACCACCGGT 


CTTATATGGA 


rp m r*> m rp m rp f> 7\ 


CAGGATTTGA 


TTGGAGATCA 


ATTATTTTAT 

*l X x n. x x x x n x 


2580 


CAATTATTAA 


TTTATTTTTA 


nL.t^nV-ul 1 nn 


TTTATTTGCC 


ATTTGTTAGA 


ATPPPAAATA 

n x uo^nAn x n 


2640 


AGAAAATTGG 


ATAATTTTTA 


Z1P.TA APTPPT 
nu 1 nnw 1 Lu 1 


AGTTAGTGTG 


TATTAGAAGT 


ATGATAAGCC 

n> X on x nflou^i 


2700 


TCATTTTTGA 


TATATTAAGG 


A APPr-PA AP A 


CaTGCAAaCG 


TTTGAGATAA 


A rP ArPAATT 


2760 


TTTGGTAGAT 


GGAAAACCAA 


PTA A ATT PAT 
L 1 nxin i i On l 


TAGTGGCgCA ATCCATTATT 


TTCGCATGAC 


2820 


ACCGGCACAA 


TGGGAAGATA 


PTTTATAPAA 


TTTGAAAGCA 


CTGGGTGCGA 


AT AC CAT AG A 


2880 


GACTTATATT 


CCTTGGAATC 


TGCATGAGCC 


AGTGGAAGGC 


GTCTATGACT 


TTGAGGGCAT 


2940 


GAAAGACATC 


GTTGCTTTTG 


TGTCATTGGC 


GCAAGAACTT 


GGCTTGATGG 


TGATTTTGCG 


3000 


GCCATCTGTC 


TATATTTGTG 


CGGAATGGGA 


ATTCGGCGGG 


CTACCTGCTT 


GGTTGCTGAA 


3060 


AGAGCACGTG 


CGATTGCGTT 


CAACTGATCC 


ACGATTTATT 


GCAAAAGTCC 


GTACTTATTT 


3120 


CAGTGTGCTT 


TTACCAAAAT 


TAGTTCCTTT 


GCAAGTGACA 


CATGGCGGAC 


CAGTCATCAT 


3180 


GATGCAAGTC 


GAAAATGAAT 


ATGGTTCTTA 


TGGCATGGAA 


AAAGAGTATT 


TGCGGCAAAC 


3240 


AAAACAGGTG 


ATGGAAGAAT 


TTGGCATCGA 


TGTTCCGTTA 


TTTACGTCAG 


ATGGTGCTTG 


3300 


GGAAGAAGTG 


CTGGATGTGG 


GCACACTGAT 


TGAAGAGGAT 


GTTTTCGTGA 


CAGGAAATTT 


3360 
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CGGCAGTCAT 
AAAGTGGCCG 
GATTATCAAG 
TTCATTGAAT 
TGCTCGTGGT 
GGAAGCTGGC 
TCCTGAAGTG 
TGTGCAAAAC 
GATGTATCCA 
AAACTTAAAA 
GCATCTTTTT 
AGTGATGATC 
CTTAGGCCgA 
TGGGGGAATC 
TGTCAGCAGA 
CTTTTTATCA 
GCTATGGcAA 
GACCGATTCA 
TGATTTTTGA 
TCAAGAAACT 
ACAGCTAGTT 
TCTCAATACT 
ACTTTTCATT 
TTCGCGATAA 
ATAATTATAG 
TCTAGGTTAT 
CCTAATTGAT 
AAATCAGCAT 
TTCAATTATC 
TATTAGTGGA 
CTACACAACA 
TTCGAAAACG 
GTAAGTATTC 



TCCAAAGAGA 
ATTATGTGCA 
CGTGATGGCC 
CTCTATATGT 
GTGTTGGATT 
GAGCCGACTG 
TGGCAAGCCG 
AGCGTCTCAC 
ATAACGATGG 
AATTATCATC 
GCAGACGGTA 
AAGGGTACGC 
GTGAAcTATG 
ATGCaAGACA 
TCAACTCaAA 
AGCTGAATTC 
AGGGGTGGTC 
TTCCTTGTAT 
AACGGAAGGG 
TTTGACTAAT 
TAAATTTATT 
TATTCTAACG 
TATGGTTCAA 
ACATCCTAAA 
ATTCATCACA 
TTGAGAATAT 
TATTGGTGAG 
TCAATCAGAC 
TTCTTTATTT 
AAGGGGATAA 
CCAATTTGAT 
AAACGATAAA 
ACAAACTCAT 



ACGCTACAGT 
TGGAATACTG 
AAGATTTAGC 
TCCATGGCGG 
TGCCACAAGT 
AGAAATACTT 
AGCCCCGCAG 
TGTTGGCGGT 
AAGAAGCAGA 
ACGAAAATAA 
GCCTGCAGAC 
CAGAGaAAGa 
GCTTTAAATT 
TCCcATTTTC 
AAAATCGATT 
ACATTGACTG 
ATCGTCAACG 
TGTCCGAAAG 
ATAGAAATTA 
GATTTTTCTG 
GCAGCAAACC 
CATCACAAAT 
GCAGTGAATA 
T GAT AG AG AT 
GATACATTGC 
CAATTGAGCA 
TAATTAAAAA 
AATTAGGATT 
TCTTCTAGTT 
AGGAGCATAT 
GCCTACTGTA 
ATAAAAGAAA 
TTAGATGAGC 



1417 
GATGAAAGCA 
GGACGGTTGG 
TAATGAAGTC 
AACCAATTTT 
GACTAGCTAT 
CCATGTACAA 
GAAAACATTT 
AAAAGACCAG 
ATCTGGTTAT 
GCTCAAAGTT 
AATTCAATAT 
ATGGATTGaA 
GAATGGGCCG 
aTCaAGGCTA 
ATACTGCCGG 
ACCTCGCGGA 
GTATCAATTT 
AGTTTTTGAA 
ATGAGCTCAT 
AAATAGGTTC 
AATCATTGTA 
ATTTATCTCA 
ATTTACACTT 
AATTTAAGCA 
TTTTTTCGAA 
AGTACAAATA 
GGAATACTCA 
GAATGCTGAC 
TTTCGAAAAA 
TGATCGTGCA 
AAAAAGTATT 
ATGAGGAACC 
AAGATGTTTA 



TTTATGGCAA 
TTCAATCGTT 
AAGGATATGC 
GGTTTTTATA 
GACTACGATG 
AGAGCCATTA 
GGTTCTTTAG 
ATGATGACGG 
GGGTATATGC 
GTGGAGGCCA 
CAAGAAAACC 
CtAGATGTTT- 
ACACAAGTGA 
TCgGCAATAT 
TAAAAATCCg 
TACATTTATC 
AGGTCGTTAC 
GAAAGGCACA 
CTTTTGTGGA 
AAATATCCAT 
TGTGTCGCGA 
ACAAGTCTCG 
TCTTTTGAGA 
TTTGATCGTT 
GAAACTATAA 
ACTCCAAATT 
AGCTACATGT 
TTTTTTGTTA 
TAGGTATTCT 
ACAAAACAAC 
ACGCAATGAA 
ATTAAATGAT 
TTTCTTATTT 



AACACGACAA 
GGGGAGAACC 
TAGCCTTAGG 
ATGGTTGTTC 
CGCTATTGAC 
AAGAAGTATG' 
GAACATTCCC 
CmCAAGAAAC 
TTTATTCAGT 
GTGACCGATT 
TTAGGGAAGA 
TAgTAGAGAA 
AAGGCATTCG 
GCaTTGaCAT 
GCCCAGCCTT 
GATTGCCGCA 
TGGCAAAGAG 
AATGAGATCG 
CAACCGATAG 
AACCATATAG 
TTTTTCCAAA 
AATGATTTTT 
TTGACTTATA 
GATAAAAGGT 
AAAATGAGAG 
TAGTAATATT 
ATAGCATAGA 
TATTGATATT 
CTGATCGTTG 
AATGAGATAT 
GCTAAGTCTA 
TTAAATGAAG 
GGAATGGAGA 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
,4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4 800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
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TATTAATTTC 


AGATCAGAAG 


TTATCAGCAG 


CAATTGATCA 


GTTATCGGAT 


ACAAGGAAAA 


5400 


AGATTGTTTT 


ACTTTATTAT 


TTTGCAGGTT 


TTAATGATAC 


TGAGATCGGT 


AAAATTTTCA 


54 60 


ACATGAGTAC 


AAGTGGTATC 


TGGTATCAAC 


GAACGAAAGC 


GGTTGAACAA 


TTGAAAATGG 


5520 


AATATGGCTT 


ATGGTAGACA 


AACAAAGGAA 


CCAACATTAC 


CCATTAGTTC 


CTATATCTAT 


5580 


TATCTTATCT 


GCTTTGAAAG 


AAGACCCAAT 


TTCAATCAAA 


ATGATACTTA 


AACATTACCA 


5640 


CGGTTATATA 


ACTAAGTTAT 


GTTTAAAAAA 


TGGtTTTAAT 


GaGGTGGGGC 


AATTTaTTAC 


5700 


CTATGTAGAT 


GAATaTrTGC 


TAAGACAGCT 


AGAAATTAAA 


TTGATAGAGG 


CTATATTGAA 


5760 


ATTTAAAGTT 


AACTAATTAA 


AAAGTCCCCT 


TTCCACTTTT 


GCTATCTAAA 


AGAATCTTCT 


5820 


AATATTTGAG 


AGATATGTTT 


TTTAGTTAGA 


AAACCTATTG 


AT CAT TG AAA 


ATTAAATAAA 


5880 


TTCTATCAGT 


ACGTTAACAC 


TTCAAGGAGC 


aTATTTTAGT 


TGCGCCaTGA 


TCTCTTATTG 


5940 


TAAAGAAGGA 


GCGATTAAAA 


TTCTGAAATA 


GTTTTTATAG 


TCAATAAAAA 


GCGATGATTT 


6000 


GAAGTTGAAA 


TAATGATACT 


TCGGTAAGGC 


TCGCCGTAAA 


CTTGGGGAGA 


GGTGAAATTC 


6060 


CTATGTACTT 


TTGACAAAAG 


TGGCTGATAG 


ATTAATTATT 


GTTAATAGAA 


AGGAAGAGAC 


6120 


AATCATGTCG 


AAGAATAACA 


ATAAAAGTAA 


TTCATCAGAG 


TTAGAAAAAT 


ATTTGTTTCA 


6180 


TCCAGAAAAG 


ATAGAGGATT 


CTAATAATAA 


AATGAAGAGG 


ATAAATCAAC 


TACCATTAGT 


6240 


TTTAACGAAT 


AAAGATTTAA 


AAGAACAATT 


TCAAATAAG 






, 6279 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GCGTTCATGG AAnCCAGAAG AGACTTGtTT CAGCAACAGA TAAAACAGGT CAAGATGTCC 60 

CATTCGArAA AATCACTGTT TCAGGTCAAG TTGATACTAG CAAAGCAGGC GTTTATCCAA 120 

TCATATACAG TTACGAAGGT AAAGAAGAAA CAGCCCaTGT GGCCGTCAAA CCCGACCAAT 180 

CTAAGTTAGA GGTTAAAGAT ACAACGATTT ATGTTGGTGA TTCGTGGAAA CCAGAAGATA 24 0 

ATTTCGTTTC AGCGACAGAT CGTGATGGTC ATGCTATATC CTTTGATAAG GTTCAAGTAA 300 

AAGGGGAAGT TGATACAAAG AAAACAGGAG AGTACCAGAT TTCCTATACC ACTGAGCCAG 360 

TTAATGAAAC CAAACCAGCT" GTGCAGTCAC GACTTTTCTC AATGTTTTCA AACGAAACAC 420 

CTCGTCAATT AACGACAGTA GCAACTGTTC ATGTGATTGA TCGTAATCCT ACACCGCTAC 4 80 

CAGATAAAAA TGAGAACAAC CAGACTAGTT CTTCAACGAA TCAAACCACG ATAAAAAGTA 54 0 

GTCAGTATGT CACACACATT GTGAAACCTG ACAAGCAAGG ACGTtATCCC AAAACGGGAG 600 

aACAAACTAA TGGTCTATAT CGTGTGCTAG GACTTGTTGT ATTACTTATT GTGATTATTA 660 

GTGGTATAGT CATCAAGAAG AAACGAAAAT AATTAAATGA GCGTGGGGCA AAAGTTTAAA 720 
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GTAACTTTTG 


TTCCACGCTT 


AAAAGATGAA 


TAAGTGCATG 


AATGAGGGTA 


GCTGTTTCCT 


780 


TGGAAAATAG 


GAAGTTTTAC 


AAGTGACTTT 


TATTTTAAGA 


ACACCTATTA 


GCAGGTTTTT 


840 


CTGTTAAGAT 


AGTTATTTTT 


ATGAAATAGT 


TTGGAGGAAT 


AATAAACGAG 


ATGAACCAGC 


900 


GAACTATTTA 


TTATGGTAGT 


ATCGGCACAG 


CCTTATTTCT 


TTCTTCAGGA 


ATGTTGATTG 


960 


CTTCAGTGAA 


TAGCTATGCA 


AACGAAGCGA 


CAACACCAGG 


CAAATACACC 


GTGACTTATG 


1020 


AAGTGGATGG 


CAAAATTGAA 


ACAGCAACCA 


TTACGGTCAA 


AGAAAATCAA 


ACGGAACTTA 


1080 


AATTAAAAGA 


CACGTTTTTA 


GGAGTTAATC 


AAGCATGGTC 


ACCATTAGAC 


AATGTCGTTT 


1140 


TTCTAAAAGA 


TAAAGAGGGT 


CGCCCTTTAA 


ATTTAGAAAA 


TGTAATTATA 


GAAAATAATG 


1200 


TAAATACTCA 


AAAAGTAGGC 


CTTTATTTTG 


TAACCTTCCG 


TTACGAGTCA 


TACCTTTCTA 


1260 


TTGCGAAkGT 


GTATGTCAAC 


CCTACTAATT 


TTGCTGCaGA 


GGGGaATaTG 


cTTGCaGaGc 


1320 


TTCCtTATAG 


TCAAGCAATG 


ATTGGCCCAG 


AAAGATCaAT 


TATTGCCtCG 


AGACaGGACA 


1380 


AGGAGCAACA 


AACAAAACAG 


GAGAAAAAGA 


AAAATAAAAT 


TGAACAGGAA 


CGAGTAAATA 


1440 


AAAAGGTGCT 


GTTAAATGAT 


GAGTTGTTGA 


AAAAAATAGA 


TACGATAGCC 


CCAGTGGAAA 


1500 


TTACTGGAGG 


AGCGGCAAGT 


GGGTCCTCTC 


AATTTGGCAC 


CACTTTAAGC 


TTCCTGTCAG 


1560 


GCTCACTTTT 


GTATGGAACA 


AGGAGAGTTT 


AAATGGAATT 


CTTGTTATTA 


GATGATCGGG 


1620 


AAAATTTAAA 


ATTAGCGATT 


ATTAGAAAGT 


TAGAGCAACA 


ATATTCATTT 


TCTGAACGAA 


1680 


AAGATCGTTT 


ATGTAAGCAA 


TTAGCTATTT 


CTCCTTATTT 


GCTAGAACGG 


AACATAACAG 


1740 


AAATCAATGC 


AGATTTGCAA 


CGCTTTGGTC 


TGATTAATGA 


GATGGAAATC 


AGTGAAAAAA 


1800 


ATAATGAAAT 


TTTGTTATCT 


CAAAGCCTTC 


GTATTTCTTC 


AAGTATTATT 


GAAGAATATT 


1860 


ATTTAAAAGA 


CTCGTTAGAA 


TTTAATCTAT 


TAAAAACGAT 


TTTCTTTCAT 


CAGTTTACCT 


1920 


CAATCAAAAA 


ATACGGTGAA 


AAGCACGGGA 


TGAGTCGAAC 


GGTTGTATAT 


AAAATTGTTG 


1980 


ACCATATCCG 


GAAAGAATTA 


GGTCAATACG 


GTATTAAGTT 


ATCCAAAAAT 


TTACAACTTA 


2040 


GCGGCAATGA 


AATGGCGATT 


AGACAATATT 


TTACTATGCT 


TTATTACCGA 


ATTTACAAAG 


2100 


ATTCTGAAGA 


ATTATATAAT 


CAAACAGATC 


TTCsCsCAGT 


GAACCAATTA 


TTGGCTCAAT 


2160 


TGAAAGGCTC 


TTATGAAAAT 


ATCACTAATT 


TTCATTTATT 


TAAACATTAT 


GTATTGGTGG 


2220 


CGTTAGAACG 


AACGCAAAGA 


AAAGCTAATT 


ATTTTTTATC 


CCAAGAAGAA 


AATCCATTCG 


2280 


CTTTTGATGA 


AGAAAGTTCG 


ATTtATCAAG AAATTCAAAG TTGGATCAAC GAAGgAnTGA 


2340 


AGGCGACTCA TGCAGAGAAA AACGCAGAAA TTCAAGgAaT 


TATCGGCAAT 


TTAgGTCTAT 


2400 


CAtGAACTTA 


TTTCGacATT 


TGCTGAGTcT 


CACAATGAGG 


CAATTACGGC 


TACGAAAACA 


2460 


TTATTCTTTT 


CTTATATGCC 


GTTTACAATT 


TCGGACGAAG 


AATTTTATCA 


AGAAATTGTT 


2520 


CCGATTATTT 


ATCAACATCG 


TTTTATTACG 


CCGTTCATCG 


ATATCACCTT 


ACGAATTATG • 


2-580 


GATTTAGAAT 


TTTTTCAGGA 


ACGCTACCCA 


ATTGTTTTTA 


ATAGCTGTCG 


TCAGTTTTTA 


2640 


TTTGCGTTGG 


ATTGCTCGGC 


GTTTGAGTTT 


AGTAAGCTTT 


CTTTATTTTT 


TGATTTGCTA 


2700 
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TTAGTGTTAT 


CCCGTCTTTA 


TGACCAAAGG 


AATGAAAAAA 


GCACGATCAA 


TCTTTATGTG 


2760 


AATTTTACGC 


AAGGAGAAAA 


ATATACTCAG 


TTTATCAAAG 


AACAAATCAA 


AATATTCGAA 


2820 


TCTTTCAGCA 


TTCACTTTCA 


TTCAGCGATT 


CGACCAGATA 


CTGATTTAGT 


AGTTTCGGAT 


2880 


TATTTACCAA 


AAACGTTATT 


TTCAGTAAAa 


TGTTTAATTT 


GGCTAGCGCC 


CCCGCGAGCC 


2940 


AGTGACTGGC 


AAAATTTTGG 


CAACGAAATT 


GTCCGAATTA 


ATAAAGAGCT 


ACAACAAACG 


3000 


AAACAAAGAA 


AAAGCGAATA 


AAAACAGTCG 


CAACGTGTTT 


TAGTTGCGAC 


TGTTTTTTGT 


3060 


TACTGAATGT 


TTATAGTAAA 


CTTTCTCATT 


ATCACTGATG 


TAAAAAATTT 


TAAATAAAAG 


3120 


CCAAATAGGA 


TTGAAGAAAA 


AGAAAAAAT.T 


GCTAGTGAAG 


GACCaATTCT 


CTAAAGTTTT 


3180 


TCTCAAATAT 


AGTAGAATTC 


CTTTCTTTTC 


TTCTGTGAGA 


AAGATATAAT 


AAAAAACGTA 


3240 


TGAACAAAGG 


AAAAGGGATG 


ACGAATGAAT 


AGAAAAGAGA 


AAACAAATTT 


AGATAGTCGA 


3300 


ATTGATTACG 


GCGTGATATT 


ACCAGTTTTT 


CTTTTATCAC 


TAATTGGTAT 


GCTATCGCTT 


3360 


TACGTAGCGC 


TTTACAATGA 


TCCAAGCAAA 


CCTAAAATTG 


GTAGCTTGTT 


AATGAAACAA 


3420 


GGACTGTGGT 


ATTTAGTCGG 


AGGATTAAGT 


ATTGTCATTA 


TTATGCATTT 


TAGCTCAAAA 


3480 


TTACTCTGGC 


GGTTAACGCC 


CGTTTTTTAT 


GCACTTGGAT 


TAGTATTAAT 


GGGATTAmTA 


3540 


CTGAAATTCT 


ATGACCCCGT 


GCTAGCAGAG 


CAAACAGGTT 


CTAAAAACTG 


GATTCGCTTT 


3600 


GGTGGCACCA 


CATTCCAACC 


ATCGGAATTG 


ATGAAGATTG 


CGTTTATTTT 


GATGTTGGCC 


'3660 


TATATAGTGA 


CCATGCATAA 


TGTTAAATAT 


GTGGACCGAA 


CACTCAAAAG 


CGACTTTTGG 


3720 


CTAATCGCTA 


AAATGCTATT 


AGTGGCGATT 


CCGGTAATTG 


TTTTAGTACT 


TTTACAAAAA 


3780 


GACTTCGGGA 


CGATGTTAGT 


TTTCTTAGCC 


ATTTTTGGGG 


GAGTTTTCTT 


AATGTCTGGC 


3840 


ATTACTTGGA 


AAATTATTGT 


GCCAGTTTTT 


ATTTTAGCTG 


CTCTAGTGGG 


GGCAGGAACT 


3900 


ATTTACTTGA 


TTACAACAGA 


AACAGGTAGA 


GATTTATTGT 


CTAAGTTAGG 


GGTCGAAGCC 


3960 


TATAAATTTG 


ATCGGATTGA 


TTTATGGCTA 


AATCCTTTCC 


ATACAGATCC 


AGACCGCTCA 


4020 


TTCCAACCAG 


CGCTAGCATT 


AACAGCAATT 


GGTTCTGGGG 


GCCTTTTCGG 


AAAAGGCTTT 


4080 


AACGTGAGTG 


ATGTGTATGT 


CCCTGTTCGT 


GAGTCAGATA 


TGATTTTTAC 


AGTTGTTGGT 


4140 


GAAAACTTTG 


GCTTTATTGG 


TGGCTGTTTC 


ATTATTTTGT 


TGTATTTTAT 


TTTAATCTAC 


4200 


CGCATGATTC 


GCGTTTGTTT 


TGATACAAAT 


AATGAATTCT 


ATGCCTACAT 


TGCCACAGGA 


4260 


ATTATCATGA 


TGATTTTGTT 


CCATGTTTTT 


GAAAACATTG 


GAGCGAATAT 


TGGTTTACTA 


4320 




GGA1 1 LbAl i 


ALL.G I 1 I Al 1 


AG 1 GAAGGTG 


rp rp rp rp j\ rp 

Gl Tb J. Ibt-Al 


111 Hub 1 AHU 


4 380 


ATGATTGGGG 


TGGGCTTAAT 


TATGTCCATG 


CGTTACCAAC 


AAGAAACGGT 


TCGTACCAGA 


44 40 


AGTGGTCGTT 


AAAGAAAGAA 


GAAAGAAGGG 


GTAGAAATGT 


ACACACTTTA 


TGAATATCCT 


4500 


AAATGCTCAA 


CTTGTAAAAA 


AGCGAAAnCG 


wGGTTAGACC 


AACAAGGCGT 


TAAATACCAA 


4560 


GCAATTGATA 


TTAAAGCAAC 


ACCGCCTAGT 


AGTGAACAAT 


TAGCAAAATG 


GATGAAAGAG 


4620 


ACCGGCTTGC 


CTGTTCGTCG 


TTTTTTTAAC 


ACAAGCGGGG 


TTTTATATAG 


AGAACAAGGC 


4680 
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TTAAAAGATC 


TTGTTGACAG 


TTTTTCTATT 


GAAGAAGCAA 


GTCAGCGTTT 


AGCTGCGGAT 


4740 


GGCATGTTGA 


TTAAGCGTCC 


AATTCTCTTA 


AAAGACAATA 


CATTTCTAAC 


GAATGGTTTT 


4800 


AAAGAAGCAG 


ATTACGAAGG 


AGTGCTTGGA 


AAATGAGTGA 


AATGAAACAA 


ATCGATCAAT 


4860 


TATGGATCTT 


ACCAACAGAA 


GCAGGTTATA 


AAATTGGTTT 


AACCGCCGAA 


GCCCAAGAGG 


4920 


ATTTAGGGAA 


GATTACTTTT 


GCGACATTTC 


CCAAAGTCGG 


ACAAACATTA 


GCTAAAGGTG 


4980 


ATTCTTTAAT 


TGAATTAGAA 






* rri rp p n r; t A T 




5040 


GTAAGGTAGT 


TGCGATTAAT 


GAAGCGGCTG 


ATCAAGAGCC 


AAGTGCATTA 


GATGAAGAAA 


5100 


A-TGCCTGGAT 


TGCTGTTTTA 


GCAGATGTTG 


AACCAACAGA 


ATTTGAAAAT 


CTTTAGTCTT 


5160 


ACGTTGACAA 


GGCCAATGaT 


TATTGcTATA 


ATTTTCTCAA 


TCTAATCAAA 


AGAAAGCTTT 


5220 


GaAAAGAAGA 


GTACATGTTa 


AGGACGTTTG 


AAGAGAGCCC 


kGTTTGCTGA 


GAaTGGGtAA 


5280 


CAGATTTaAG 


ATGGrAGATG 


GtCyTTGrGc 


aGAGTAAGTG 


AGCGAaGCAG 




5330 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 
TGACAGrGaC AATTkGTkGA AgCTTTCAGG AATATTGGCT ACACTTGCTC CGAATAATTG 60 
TAAAAAAGTA TCGTTAAAAA TAACACCTTT TAACCCTAGG AACAATCCTG TTGCTGCAAT 120 
TACTGGAACA ATTGGAATGA ATATTTCACT TAAAATGCGC ATTGTTTTGC GCATGCCTGT 180 
TTCATTATTT TTTAAAATTG CTGTTTGCTC TGCTTTTGaC ACAACATTGa TTCCAAGTTT 24 0 

TTCTATTTCA TCATACACTT TAGTCACAAT ACCAGTTCCT AAAATCACTT GGTATTGACC 300 
ACCATTATAA AAAACCCCCT TGACTAAATC GATTTTTTCT AGTTGTTTAC CATCTACTTG 360 
TTCATGGTCC TTCAAAATAA AACGTAACCG CGTTTGACAA tGGGTGACCG ATATAATGTT 420 
ATCCATACCG ATTAATTGAA TAATGTCTTT GGCTGCCTGC TGATAATTTT CTGCCATGTk 4 80 

ATACTCCTCC TAGCTACTCA TTGATAGTAG nCAGTAGTCC TTGATAGCTn TTGAATAATG 54 0 

nCCCnGC 547 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 
CCTTGTTCAT TTGACACCGA CGTATTTAAG TCGTTTATTA AAAGAGGAGT TTGGTATTCC 60 
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CTTTAGTCAA TTTTATCGAC 
AGATAAAACA ATTGCCCAAA 
CCGGATCTTT AAACAAGAAA 
AGCATAAAAA AACAACGACT 
TGGCTAATGC AACAAGTGCA 
TAAATATGAA TTGGTAAACC 
GATGGGTGTG GGTGGATTGT 
AAAGCTAATT CAGAAATCAT 
ACATTGTCTT CTACAGTTGT 
GCACGGCCGT TACCAGCAAA 
GCTTCTGCAA CAGTCATACC 
ATTGCTTTGT AATCAACTGC 
TCGTAGCTTG CTTTATGCGC 
TTTTTCACGT TAGTACGTCC 
ACGCCCGCTT GTTCTAAGCC 
ACGTAATCTG CTTCAACACT 
TCGCCATTGT CAACAGCTTC 
TTGAAGTGGT CTGTGACAAC 
CTACCTTCTA AAATTGTTAC 
CCGATGACAC CGCCACCGAT 
CCTGTAGAAT CTAACACGCG 
CCTGTTGCTA CAATAGCATT 
CGCAATGTAT TTTCGTCAAC 
TTCAATAACA TGCCAACGCC 
TGTGTTTTTG CAAAGTCTAA 
TGTGCTTCTT GGTAATGATG 
TTTAAACAAA CGCCTCCGAT 
GCGGCACGAA TTGCGGCAAC 
TCAATGGCGA AATCTCCTAC 
ATCAGCTAAT AAGCGTTTAA 
ACGGTGGTCA AAGCTTAATG 
ATTAACAACT GGTTCTTGTG 
GATTACTGGT GTAAACCAGC 
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AATTAAAAAT CACATGGGCT AAACAGCTTC TTTTGGAGAC 120 

TCAGTGAAGA TTTAGGGTTT GTTGAAGATA GCTATTTTG.T 180 

CGGGTGAAAC ACCCCTTAAA TACCGTAAAA GAAAGAACAG 24 0 

CCTAGAAGTC GTTGTTTTTT TACACCTTTT GCTCTTGCTG 300 

AAAGGCGCAT TATATCTGTC GTGCGACAAA GTGATACTAT 360 

TAAAGCCAAT TCAGCTGTAT CCATAGTAAT TTCGCCCAAT 420 

TAAAGCAATG TCTTCTGCAT TCATGCCAGA TTCAATAGCT 4 80 

GTCACTTGCA CCGACACCGG CAATTTGTGC ACCGATGATG 54 0 

AACTAAACGC ATGAATCCTT CAGTTTTATC TAAAGAGATT 600 

TGGGAATTTG TAGCCTTTTG CTTCGATTCC CGCTTCTTTT 660 

AAGGCTTGCC AATTCTGGAT CAGTAAAGGC AACAGCTGGC 720 

AACTTTCTTA CCAGAAATTG CTTCAGCAGC AATTTTTGCT 78.0 

TAACGCAGCA CCTGGTACGA TGTCGCCGAT TGCGAAGATG 84 0 

TTGGTTGTCA ACTGGGATTA AACCACGTTC GCCAATTTCA 900 

TAAGTCGTCT GTGTTTGGAC GACGTCCAAC AGTGACCATT 960 

TTCTTCTTTT CCGTTAACTT CATATTTAAC AGTGACGCTA 1020 

TTTAGCCATT GCAGAAGTCA CGATTGTTAC GTTTTTCTTC 1080 

TTTAACCATA TCTTTTTCAT AAGTTGGTAA AATTGATGGG 1140 

TTCTGAACCT AAGTTAGCAT AAGCGCCACC TAATTCAGCA 1200 

AATAACGAAT TTTTTAGGAA CTTCTTTTAA GTTTAAACCG 1260 

TCCGCCAAAT TTGAATCCTG GGATTTCAAT TGGACGAGAA 1320 

ATTGAATGAG TAAGTTTGTG CTGAGTCTGG GTGAATAACA 1380 

GAAGAATGcT TCGCCTTCAA TAATTTCTAC TTTGTGTTTT 14 4 0 

GCCTGTTAAT GATTTTACAA CTGTGTTATC TTTCCAGTCT 1500 

TTCGACTCCT TTAGCTGTTA CACCAAAAGT TGAAGAATCT 1560 

TCCAGCAGCA ATTAAAGCTT TTGAAGGAAT ACATCCAACG 1620 

GTATTCACGT TCGATAATCG CAACTTTTTG ACCCATTTCT 1680 

GTATCCTCCA GGACCAGCTC CGATTACGAC TGTATCTAGT 174 0 

TACCATTTGT TAATCATCCT TCCATTAATA GTAATTCTGG 1800 

TGTTGTTCAT TGCTTTTTGA GCAGTTGCGC CGTCAACGAT 1860 

ATAATTTCAT CATGCGTCCC ACAACGATTT CGCCGTCTGC 1920 

CAATTGTACC AACGCCTAAA ATAGCAACTT CAGGGTAGTT 1980 

CGCCACCGAC TGAACCAATG TTACTAATTG TGATTGTACC 204 0 
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ATCACGCATA TCTTGTGCA'G TTAATTTACC TTCGATAGCC AATGCTGCTT TTTCGTTGAT 
TTCATCAGCG ATASCAAACA TGCTCTTCGT ATTA3CATTT TTAACATTTG GTACATATAA 
GCCATGATCT GTATCAGTAG CGATACCAAT GTTAAAGTAA TTTTTGTAAA CAATTTCTTG 
TGCTGCGTCA TCGATTGATG CATTCAAGAT TGGGAATTTT TGAACAGTTG AAGTCAATGC 
TTTTACAACG TA7GGTAAGA ATGTTAATTT TGTACCATTT GCAGCAGCAA CATCTTTAAA 
TTTCTTACGG TGATCCCATA ATTTAGAAAC TTC7ACTTCA TCATGTAATG TTACGTGAGG 
AGCAGTGTGT TTGCTGTTAA CCATTGCTTT AGCAATTGCT TTACGTGTTG GTGTCATTTT 
TTCACGTGTT TCCATTTCGC CTAAATCAGA AG7AAAGGCT TTTGGCGCTG CTTTAGGTGC 
AGCTGCTTCC GC7TTAGGTG CTGCTTCTGT TGCGGCAGCT TCAGTAGCTG GGGCTGCTTG 
AGAACCACCA GAGACAAAGG CATCAATATC CGC7TTAATG ACACGGCCAC CTTTACCAGT 
TGCAGTTACT TG7GTAATAT CAACGTCTTT TTCACGCGCA TACTGACGAA CAGATGGCAT 
TGCTAAAACG CG7TTGTTTG GATCAGCGGC TGCAACTACG CCTGTTGAAG CGGCTGGAGC 
TGATGCT7CC GC7TTAGGAG CGTCAGTAGC TGG7GCTGCG ACTGACGGTG CTGCTGAATT 
ATGACCAGGT GCSTCAATTT CAACTAACAC GTCACCCACA TTGGCAACTG TTCCTTCTGG 
TACAACGA7A 7T7TTTACAG 7ACCTGTTAC TGG7GATGGA ATTTCTTCTA CTGATTTGTC 
ATTTTGkACT TC7AATAATG AATCATCTTC ATwAATTGTG tCGCCCGCTT TAACGAACCA 
TTTAACGATt TnCnTTCGGC AATTCCTTCA CCGATGTCTG GTAATTgAAT TGGAAAAATC 
CGCCAGC7GC TTCGGTTGTT GGTACAGCAG CAC-3TTGTGC TGGTGTTTGT TCTTGTGCAG 
GAGCTGC7GG TGCTGCATCG TTATCTTCGT GACCAGGTGC gTCGATTtCG AyTAACACGT 
CACCAACGTT TGCAACTGTT CCTTCTGGTA CAACGATATT TTTTACAGTA CCTGTTACTG 
GTGATGGAAT TTCTTCCACT GATTTGTCAT TTTGTACTTC TAATAACGTA TCGTCTTCGT 
TGATTGTATC GCCAG3TTTT ACAAACCATT TAACGATTtT CCTTCGGCAA TCCCTTCACC 
GATATCCG3T AA7TTAAACT GATAAGCCAT TT77TAAGTC TTCCCTTCTT CGTTGGATCT 
gtctatt:?c 77.-.GGAGCAA AACGCGGGGA AACGACTATA GTAGCTGAAC ATTTCATGCC 
TATAGCCG7A ATCCACCTGT TTAATTCTTC GT77ATCTAC TTCGAATAAC TATTAAAATT 
CGACGAT7TC TC7AGCTTTT GCTTCGATAT CTT7CGCATT TGGTAACCAG ATATTTTCTG 
CTTGTCCGAA 7GGGAAGATT GTATCTGGAG CAGATACACG TCCGATTGGT GCTTCTAATG 
ATAATACGGC ACG77CAGAA ATTTCAGAAA CAA7CATAGC GCCAACGCCA GCT.TGTTTTT 
GTGCTTC77G AsGGACAACC ACACGGCCAG TT77TTCAAC AGAGTTAATA ATTGTTTCCA 
CATCTAAASG AGCCACTGTA CGTAAGTCAA TGA7TTCTGC TGAAATATTG TCTTTCGCTA 
AGCTATCAGC 7G7777AA7C* GCTTCACGAA"' CCA7 AGCACC GTAAGTGATG ATTGATACGT 
CTGTTCC77C ACGAGTTACA GCCGCT7TAT CTAAAGGCAC TTCATAAGCT TCGTCTGGCA 
CTTCCTCACS GAA7GAACGG TATAA77TCA TG7GCTCTAA GTAAACAACT GGATCGTTGC 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 
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TACGAATAGA TGAAATTAAT AGTCCTTTTG CATCGTAAGG GTTTGATGGA ATAACAACAC 4 080 

GAACACCTGG TGATTGTGCG ATTAATCCTT CTAAGTTATC TGAGTGAAGT TCTGGTGTAT 4140 

GAACACCACC ACCAAATGGG GCACGAACAG TAATTGGCAT ATTACGAGTT CCACCCATAC 4 200 

GGTAACGCGT ACGAGCCATT TGACCAACGA TTTCGTCAAA TACTTCAAAA ACGAAACCAA 4 260 

AGAATTGGAT TTCAGGAACT GGACGGTAAc CTTGCAAGGC AAGACCGAAA GCCAATCCAC 4 320 

CGATACCAGA TTCAGCTAAA GGTGTATCGA AGACGCGGTC TTCACCAAAT TTTTCTTGTA 4 380 

AACCTTCAGT tGCACGGAAA ACCCCACCGT TGTTACCAAC GTCTTCACCG AAGATTAAGA 4 4 40 

CATTTTCGTC TTTCTCTAAT TCAAGAGCTA AGGCATCTGT AATTGCTTGG ATCATAGTTT 4 500 

TTTGTGCCAT TTTTATTTCG ACTCCTTCGC TTCATAAAAT GCAATTTGTT CTTTAATTGT 4 560 

TTGAGGTTGA ACTTCAAACA TATTTTTCAA GAAATCAGAA ACTTTTTGTT TTGGCGCTTT 4 620 

ATCCGCTTCT GCAATCGCTA CTTTGATTTC TTCTTTTGTT TTTTCAATAA TTTCTTCTTC 4 680 

TTTTGCTTCA GACCATAAGC CTTTATCTGT TAGATATTTA CGGAAACGAG TCAATGGATC 4 74 0 

TTTTTGTACC CATTCGTCAT CCATTTCTTT TGAACGGTAA CGTGTTGGAT CGTCTCCAGA 4 800 

TAAAGTATGT GGACCATAAC GATAAGTTAA TGTTTCAATT AAAACTGGAC CGTTTCCTGC 4 8 60 

AGCTGACCAA TCGCGTGCTT CTTTTGCAAT TGCGTAAACT GCTAATGGAT CCATACCATC 4 920 

AACTTGAATA CCAGGAATTC CTGCTGCAAC AGCTTTTTGA GCTAAAGTTT TAGCCGCTGT 4 980 

TTGTTTTTCA CGAGGTGTAG AAATCGCAAA ACCATTGTTT TGGATAATGA AGACACCGTT 504 0 

TGCTTGGTAA GCACCAGCAA AGTTAATTGC TTCATAGAAG TCCCCTTGTG AAGAACCGCC 5100 

GTCACCAGTA TAAGTGAAGA CAACATTTTC TTTTCCACGT TTTTTCAATC CTAAAGCAAC 5160 

ACCAGCTGCT TGGATGTATT GAGCACCAAT GATAATTTGT GGTGGTAATG CATTTAAATC 5220 

TTCCGCGTAG TAGTTCCCTG cTACGTGACC ACGAGACCAT AAGAAAGCTT CTCTTAATGG 5280 

TAAACCATGT TGTACTAATT GAGGTACATC cACGGTAACC TGGTAGTAAG TAGTCTTCTT 534 0 

TTTCCATTGC AAAT 5354 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

CGTAATCTTT TAAAATGACA AGTTTAGAAC AG AG AG AT G A GTGCGGACAC TTATTTTTCT 60 

GTTCTTTTTC TTTCTGmAAA AGTCTTTGAA ACTTATCATT TCGGAAGGTT TTCTTTTGAG 120 

AATATGATAG AATGAAGCTG AGGTGAGCGA akgGnCAAAA AAAATCAATT TACCGAaGCC 180 

GTTATTTTCA GAAAAAGGAA AACGGGCGGT CCTTTTATTA CACGCTTATT CTGGCAGCAG 24 0 
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TAATGATGTA CGGATGTTAA gCAGACGTTT AGAAAnAGAA AATTATACTG 


TCTATTCGCC 


300 


TAATTTTTCA 


GGACATGGCA 


CGCTAGTTCC 


TGAAGATATT 


TTAGATCAAA 


CAACGGAACA 


360 


ATGGTGGCAA 


GATACACAAG 


AAGCAGTCGC 


TTTTTTGAGA 


GAAAGAGGCT 


ACCAAGAAAT 


420 


TGCCATTTTT 


GGTTTATCAA 


TGGGCGGGAT 


TTTCAGTATG 


CATGCATTAA 


CGGAACAATT 


480 


AACAGGATTA 


ATTGGTGGTG 


GATTTTTCTG 


TTCACCGATC 


TACCCTGTCG 


AAAATAAAGT 


540 


CCCaGAAAaT 


TTTGTATTAt 


ACGCAGAAAA 


CGTGATGAAA 


ATCGCAGAAG 


TTCCTGCAGA 


600 


GGAACAACAA 


AGCCGGTTGC 


AATCGATTGA 


GCAACGAGTC 


AAACAACAAT 


TAGGAGCGAT 


660 


TGAAACGATT 


GCTAGTCAGA 


CGGCTGACAA 


ACTAAACAAA 


ATCCACGCAC 


CTCTTTTCTT 


720 


AGCGCAAgcT 


GGcAAAGATG 


AAATGATTGA 


GCCAATGGGC 


GTCTATCAAA 


CGGCCCAAGC 


780 


ACTAACACAA 


GCACGTGTGA 


CCTTACAATG 


GTACCCAAAC 


AGCGGACATG 


TCATTaCGGT 


840 


GAGTGGAGAA 


CACAAACAAT 


TAGAGCAAGA 


TGTGGTGCAA 


TTTTTAGCAA 


CACTACCTTG 


900 


GAATGAGGAG 


AAATAATGAC 


AAAACAAACA 


ATTAAAGAGC 


AAATTTTACA 


TTTCATGGAA 


960 


AGCCAAAAAA 


AGAAAAGCTT 


TTCAATGGAA 


GAAATTGCAC 


AAGGCTTGAA 


TTTAGAAAAA 


1020 


AGTTCAGATT 


TTAAAATTTT 


AGTGCAAACC 


ATTGCACAAA 


TGGAACGAGA 


AAAGTCAGTT 


1080 


AGTTTCAACA 


AAAAAGGCAA 


AGTCCTGTTA 


CCAATGAAAG 


ACTTATTAAT 


AGAAGGAACG 


1140 


TTTCGTGCAA 


ATGAACGAGG 


CTTTGGTTTT 


GTAACCATCG 


ATCCTGAAGA 


ACCAGATGTT 


1200 


TATATTCCGA 


AAGAGGCAAC 


GAACTTTGCA 


ATGGATGGAG 


ATACGGTTTT 


AATCGACGTG 


1260 


ATCCAACATG 


CGGATCCTTT 


TTCAGATCGC 


GGCGCAGAAG 


GTAAAGTCAA 


AGAAATTAAA 


1320 


GAGCGAGCAG 


TGAGCCAAGT 


TGTCGGAGAA 


TTTGTGGCAT 


ATAGTGAAGA 


AGAAATGGCA 


1380 


GAAATGGGAC 


TGTATGGCTA 


CATGATTCCC 


AAAGATAAGA 


AATTGAATCA 


GTATACTGTA 


1440 


TCAATTGCAC 


CTGAAGGGAT 


TAAGCCAGTA 


GATGGTAGCA 


TTGTCATTGC 


TGAAATTACC 


1500 


TATTATCCAG 


ATCAAGAATA 


TCCaACGAGT 


ATGGAAGGAC 


TAGTCAAACA 


AGTGATTGGT 


1560 


CATAAAAATG 


ATCCAGGAAT 


GGATATTTTA 


TCAATCGTGG 


TGGCTCATGG 


AATTCCCACA 


1620 


GCATTTCCTG 


ATGAAGTTTT 


GGCTGAAGCG 


GACCAAGTAC 


CAGAAACTAT 


TGCAGAAAGC 


1680 


GATTTAGTCG 


GTCGTCGGGA 


TTTACGTGAT 


CAGTTGATTG 


TGACGATTGA 


TGGAGAAGAT 


1740 


GCGAAAGATT 


TAGATGATGC 


AGTAACGGTA 


CAAAAGTTAG 


CAAATGGCAA 


TTTCTTTTTA 


1800 


GGGGTGCATA 


TTGCAGATGT 


TTCTTATTAT 


GTAACTGAAG 


GAAGCCAATT 


GGATATGGAA 


1860 


GCGTATGAAC 


GTGGTACAAG 


TGTCTATTTG 


ACAGACCGAG 


TTGTGCCAAT 


GATTCCGCAG 


1920 


CGATTATCAA 


ATGGGATTTG 


TTCGCTAAAC 


CCACATGTTC 


CACGTTTAAC 


CATGAGTTGT 


1980 


GAGATGGAAA 


TTACACCAGA 


AGGAGAAGTT 


ATTTCGCATG 


AAATTTTCCA 


AAGTGTAATC 


2040 


CAAACAACGG 


AGCGAATGAC 


CTATACAGCA 


GTCAATGAAA 


TTTTAGAAGA 


GCAAAAACCT 


2100 


GAAACGTTAG 


AACGCTACAA 


AGAACTAGTT 


CCCATGTTTA 


AAGAGATGGG 


TGAGCTGCAC 


' 2160 


CATATATTAG 


AAGAGATGCG 


TATGCGCCGT 


GGTGCGATTT 


CTTTTGAAGA 


TCGTGAAGCC 


2220 
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AAAGTCCTAG 


TTGATGAAAA 


TGGTCATCCC 


AAAGATATTC 


TTTTACGCAC 


GCGTGGTGTG 


22B0 


GGTGAACGAT 


TAATTGAATC 


CTTTATGTTG 


GCAGCGAATG 


AAACTGTGGC 


GCGTCATTAT 


234 0 


CATGACTTAA 


AACTGCCATT 


TATTTATCGG 


ATTCATGAAC 


AACCAAAAGA 


AGAAAAAATG 


2400 


CAACGTTTCT 


TTGACTTTGC 


TGCAGTACTT 


GGCATTCTTG 


TTAAAGGAAC 


AAAAGAAAAC 


2460 


ATTTCACCGA 


AAGATTTACA 


AAAAGTTTTA 


GAGCAAGTTG 


AAAATAAGCC 


GGAAGAAGTG 


2520 


GTTATCAATA 


CGATGTTGCT 


AAGAAGTATG 


CAACAAGCGA 


AATACTCAGA 


AGACAACTAC 


2580 


GGACACTATG 


GGTTGGCTGC 


TGAATATTAT 


ACGCATTTCA 


CTTCACCAAT 


ACGTCGTTAT 


2640 


CCAGATTTAA 


TTGTGCATCG 


TTTGATTCGC 


AGCTATAGCC 


AAGATCAATC 


TGAAAAAAAT 


2700 


CAAGAAAAAT 


GGAACGAAGC 


ATTACCAGAA 


ATTGCCAATC 


ATAGTTCAAG 


TATGGAACGT 


2760 


CGCGCAGTTG 


ATGCAGAGCG 


TGAAGTGGAC 


GCCATGAAGA 


AAGCCGAATT 


TATGGTAGAT 


2820 


AAAGTGGGAG 


AAACGTATGA 


CGGGATTATC 


AGCTCAGTCA 


CAAAATTTGG 


TATCTTTGTG 


2880 


GAACTGCCTA 


ATACAATAGA 


AGGCTTGATT 


CACGTGAACA 


ACTTAAAACA 


AGATTATTTC 


2940 


CACTTTATTG 


AAAATCATAT 


GGCGTTAGTT 


GGTGAACGGA 


CGGGAATGAC 


TTTGAAAATT 


3000 


GGTCAGAAAG 


TTCAAATCCG 


TGTTGAAAAA 


GCTGATCCAG 


AAACAAGAGA 


AGTTGATTTT 


3060 


GAATTGATTT 


CAGCTGAGGA 


AGTCGCGCCA 


GTCGAAGGAC 


CAAAAGGACG 


TAAAAAAGGC 


3120 


AAAGCCAATT 


CTTCAACTCG 


TTCAAATAAT 


CmACGAAGAA 


ATAAAAAAGA 


TGAATCATTT 


3180 


GATGGTAAGA 


AAAAGAaAAA 


TAAGAAAAAA 


GGCAAAGGCA 


AAAAACAaCC 


TTTTTATAAA 


3240 


GAAGCAATGA 


AACAAAAAAA 


TAAAAAAGGG 


AAAAAGAAGA 


AGTAGAGGAG 


GTCGCTTATG 


3300 


CCAAAAGGGG 


AAGGAAAATT 


AATCGCACAA 


AACCGCAAAG 


CTAGACATGA 


TTATTCAATT 


3360 


ATCGATACGG 


TTGAAGCTGG 


GCTTGTTTTA 


CAAGGAACAG 


AAATCAAATC 


TATTCGAAAT 


3420 


GGTCGTATCA 


ATTTAAAAGA 


TGGTTTTGCA 


CGAATTCGAA 


ACGGAGAAGC 


CTTTTTGTAT 


3480 


AATGTTCATA 


TTAGTCCATA 


TGAACAAGGC 


AATATTTTTA 


ATCATGACCC 


ATTACGTACC 


3540 


AGAAAATTAT 


TATTACACAA 


AAAACAAATT 


AATAAGTTAA 


TCGGCGAAAC 


CAAAAATACA 


3600 


GGTATCACAT 


TAGTACCTTT 


GAAGGTCTAT 


ATTAAAGACG 


GCTATGCCAA 


AGTTTTGATT 


3660 


GGTTTAGCTA 


AAGGGAAAAA 


GCAGTACGAC 


AAGCGTGAAG 


ACTTGAAACG 


TAAGGAAGTT 


3720 


GACCGACAAA 


TCAGCCGAAC 


ATTAAAAAAT 


AACCGACGTT 


AAAAATGTAA 


ATTTATTCTT 


3780 


TTTCGCATAT 


TTGCTTGCCT 


TTCTTCGGAT 


TAGGTGTAAA ATCTGGTAAG 


TGTTGAGTAA 


3840 


ATGAATGTAA 


ATGATTCATT 


TTTTAATCGC 


TTCGGAAAGC 


AGTACCACCC 


TGCATTAGAT 


3900 


TGGTTTATGG 


AAAAGGTTTT 


CCAACCAATT 


TTCAACACAT 


GAAATTTTAA 


AAGTAAACTA 


3960 


TTTGTAAATT 


ATCAACAAGT 


GGTGGTACGA 


TAGCTTTGCC 


AGGGATTATC 


ACTCAGGAGT 


4020 


AGGATGAAAC 


TTCTGAAAGA 


TTGTTGGAAA 


AGAGGTGAAT 


TGATTTGGAA 


GAGAAGAAAT 


4080 


TACTCTTCAA 


TATCGGACCA 


ATTTGGTTTG 


ATGGAACCAT 


TGTTTTGATG 


GTGCTGTTGA 


4140 


CGTGTATCAT 


TGTTTTCGCC 


TTTGTCTATG 


CTTGTACGAG 


1 AAATATGAAG 


; CTACGACCTA 


4200 
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AAGGCAAACA 


AACGGTCATC 


GAATGGTTAG 


TGGATTTTAT 


TCGTGGAATC 


ATT AC AG AC A 


4260 


ATTTACCTCG 


TAAAGAAGTG 


AGCAATTTTC 


ATTTAATGGC 


GTTTACCCTC 


TTCATGTTTG 


4320 


TACTTGTTTC 


AAACATCTTG 


GGTCTCGTTA 


CAAAAATCGT 


TGTGGGCGAC 


GATCTCAGCG 


4380 


TTTGGAAAAG 


TCCAACCGCC 


GATCCAATCG 


TAACCTTAAC 


GTTAGCGATG 


ATGATGATTG 


4440 


TCTTAACACA 


CTTCTTTGGT 


ATGAAACGCT 


TTGGTTTTAA 


AGGCTATCTA 


GTTAATAGTT 


4500 


ATTTAAGACC 


CGTTGGTTTC 


TTACTACCGG 


TAAAATTAAT 


GGAAGAATTC 


ACGAATCTAT 


4560 


TAACATTAGG 


CTTGCGTTTG 


TACGGAAACA 


TTTTTGCCGG 


CGAAGTTCTA 


TTGGGCTTAA 


4620 


TTGCTGGAAC 


CGTAGCAAGC 


GTCGGATTAT 


GGGTTATCCC 


ACTAGCAATT 


CCGCTAGAAA 


4680 


TGATTTGGGT 


AGCATTCTCA 


ATCTTTATTG 


GATGTATCCA 


aGCGTTTATC 


TTTGTGACAC 


4740 


TTTCAATGGT 


TTATATGTCA 


CACAAAATTG 


AAACAGAAGA 


ATAATCAAGA 


AATACGAAGG 


4800 


ACAAAAATTA 


TCTTTTAGGA 


GGATTTTACA 


CAATGGAAGG 


ATTAAATTTT 


ATCGCAGCAG 


4860 


CAATTGCAAT 


TTTCGGTTCA 


GCTATCGGAG 


CCGCTATCGG 


TAACGGACAA 


GTTATTTCAA 


4920 


AAACAATCGA 


GTCAATGACT 


CGTCAACCAG 


AAATGTCTGG 


TCAATTAAGA 


ACAACAATGT 


4980 


TTATCGGGGT 


CGCTTTAATC 


GAAGCTGTGC 


CAATTTTAGG 


TGTCGTTGTT 


TCTTTATTAC 


5040 


TTTTATTCAG 


ATAAGCAATA 


AAATGGAGGC 


AGGACGCTGA 


TTATTATCGG 


GACCGTCACT 


5100 


GCTTTCTTCT 


ATCTTTAGCC 


TTTTCAATAG 


CTGGTGATAA 


AGCAGTTGAA 


GAAGCTAAAT 


5160 


CGATTTTTGC 


AGGAAAGGAA 


GTTGAGACTT 


CATGCTACTT 


ACTACATTGG 


TAGTCGGTGA 


5220 


AaCGGCTCCT 


AGTACCACAC 


TAGGCACGAT 


GATTGTTGTC 


AGTGGTGCAT 


TTTTAATCTT 


5280 


GATGCTCTTA 


TTAAAGAAAT 


ATGCATGGGG 


AGCAATCGTA 


GATATTTTGA 


CACAACGTGA 


5340 


AGAAAAAATT 


GCCAACGATT 


TAGACTCTGC 


AGAACAATCT 


CGCGTAgcAG 


CAGCGAAAAT 


5400 


GGAAAAAGAA 


CGTCAACAAC 


AATTACTTTC 


TTCCAGATCA 


GAAGCAGCTG 


AAATTATTAA 


5460 


AAACGCGAAA 


GAAAGTGGAG 


AACAGACTCG 


CCAAAAAACA 


TTAAAAGAAA 


CAACTGCGGA 


5520 


AGTTAcTCgc 


TTACGCGAAA 


AAGCTCGGAC 


AGATATTTCT 


CAAGAACGTG 


AAGAAGCATT 


5580 


GTCTTCAGTA 


AAAAATGAAG 


TGGCAGATCT 


TTCTCTTCAA 


ATTGCAGCCA 


AAATTTTAAA 


5640 


TAAAGAATTA 


AC AC C AG AT G 


CACATGAAGC 


ATTGATTGAT 


TCTTATATTG 


AAAGTTTAGG 


5700 


TAAAGCAAAT 


GAAACTAGAT 


AAATATACAG 


TTGGTAAACG 


CTACGGCAAG 


GCTTTGTTTG 


5760 


AGTTGGCAGT 


TGAAAAGaAC 


CAAGCTGaAG 


CAmTTTATCA 


AGAATTATTG 


ACGCTTCGcG 


5820 


AAGTCTATCA 


CCAAGTACCT 


GGTATTGGCG 


ATATTTTAAG 


TGATGATCGC 


TTAGAGCCTT 


5880 


ATGAAAAAGA 


CAGCATCATG 


GnAAAACTAG 


TAACAGGTTT 


TAGTGAAATG 


ATGCCAAAnT 


5940 


TTTTGCGAGT 


GGTTTACGAA 


TATCGTnGCA 


TGTACGATCT 


GCTCTTGATG 


ATTGAnGGAT 


6000 


ATGAA 












6005 



(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

TTCGTTGCAA CAGAGTTTTC ATTTTTCTTA GGAATTCCAG TAATGTTTGG AGCAAGTTTT 60 

TTAAAAATCT TTAAATTCTT GGCCAAAGGA AATACGTTTG GGTCAGAAGA AATCATTATT 120 

TTAATAACTG GCTCTATTGT TGCCTTCGTT GTTTCCATCA TTGCGATTAA ATTCTTATTG 180 

AATTACTTGA AGAAAAACGA TTTTACAGTT TTCGGTTGGT ATCGCGTAAT TTTAGGTGCG 24 0 

ATTTTAATTG GCTACTGGTT ATTCTCATAA AAATAAAACC GTTGCTTTCA ACAGTGAAAG 300 

CAACGGTTTT ATTTTTTTAA ATCACCAAAC GTTATTCGCT CAATCCCCTG TTCTTCAAAC 360 

CACTGACGAA CACGAGGATC ACAAATCGTT GCTAATTCTT TGCAGCGTAA AGTGGTATAT 4 20 

GAAGAATTAT CTAACACAAA TTGATCCAAA TACCCAACAT CAAAATGCAT CTCAGCAATT 4 80 

GCCGATTCCG GCATCCGTAA TAAACCAAAG TCATCTTCTA AAAAATTCTC CACTGATACG 54 0 

CCTTCGTGaA TaTAAGATAG CGCCCCAGAC TCTAACAAAT GTGTTACCTG GAAATAACCT 600 

TGTTGcGGkG GaTAwTCtTT TTcCCCCACT nATAATTnCA GGG 64 3 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3384 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



TTTGGCAAAC 


TCTTCATTAT 


ACATGTTAAT 


AAGCGCATTA 


ATATGCGGCT 


GAATAATTTC 


60 


TTCAAATAAC 


GCTTCCTTAG 


AAGTAAAATG 


TTTATAAAAA 


GCACCCGTAG 


TTACGCCTGC 


120 


GGTCAAGCAC 


AGTTGCCTTA 


TCTTAGTTTG 


CTGAAATCCT 


TCTTTTTCAA 


AATGGCTTAG 


180 


TCCACTTTCT 


ATTAATTTTT 


GGTGAGTTGT 


AATATAATCT 


CTGGTTTCTA 


TGGGTACTTC 


240 


CTCCTAATAA 


GATGAATTAA 


CTATATTATC 


TTACACAGGA 


CTTGAAGTTT 


CAATCGTTAA 


300 


CGTCTATTAC 


TGAACTCATT 


CCTTAGAGCT 


AAACTAATGA 


GAAAGAGCAA 


AATAACTGAT 


360 


ACAAAAAGCA 


ATCCTACAAG 


AATTAATAGA 


CTCATCTAAA 


TCATTCACCT 


TTTCTAAAAT 


420 


AATCCGAGAA 


AGATTTAAAA 


AAATTATATC 


CAGCTATTAG 


GATAAGTCCT 


ATCATAATCA 


480 


CTCCAATAGC 


CATTTTCTTA 


TCCTCCCTAA 


AAATAAAGTC 


TTTTACTTAA 


AACCTTTTTT 


540 


GTAATCGTCA 


ATAAATTTTT 


TCCCTTTATC 


AGTTAATTGA 


AATCCTATAT 


CTAAATAGTC 


600 


CTCATATCTA 


AGTTTAGCCA 


GCATCTCACT 


TACAGATGCT 


GGACTAACAC 


TTAGAACTTT 


660 


TGCTAAATCA 


TTATTTTTTA 


ATTTCTTTTT 


TCTTTTTGAT 


AATTCCTGAA 


TTGCTAGTAA 


720 


ATAAATTAGT 


ATAGTAGTTG 


TTTTAATTTG 


TACCATACCT 


ACCCTCCAAA 


TTATTTCGAA 


780 
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AGGCCTTCAG 


CAATTTTATC 


CAGGTTCCAT 


TTCATCATCG 


CATAGTAACT . 


ATCACCATCT 


8.40 


TGTCCTTTTT 

1 \JJ 1 V* X X X X X 


CTGCAATTGA 


ATCAGTAAAA 


ATCGTTGAAT 


AGATAGGAAT 


ATTGGTATCT 


900 


TTTGATAPTG 
111 \Dr\ 1 i u 


TTTTCATCGG 


TCTATCGTCC 


ACACTACTTT 


CTACGAATAA 


GGAGGGAACT 


960 


rp rn rnp m fp r< rp o p 


PTA ATTTTTP 

O 1 Ai/i 1 X 1 1 A 


AACTAAGTGT 


TTTATTTGAT 


CTGGTGTTCC 


TTCTTCTTCA 


1020 


/"* T* O T ""P f" n TTT 

(j 1 b 1 ludl 1 l 


ppp a A atgt A 


AGCAGAAGGC 


ACATTATACG 


CTTTCGAGAA 


ATATTTaAAG 


1080 


/"*» T\ rT*/^/"* T 1 ^* 7\ P* 

CATLC1 


(pipPT A AP A AT 


PATTTTTTTA 

v^n x x x x x x x j i 


TCATTCGGAA 


TTGAAGCAAA 


TTTAGATTTA 


1140 


j-i^rprp/^pprprp/^rp 


pf np APA ATP 


CAACTTTTCA 


ATATACTTAT 


CTAGATTTTC 


TTTATAGAAT 


1200 


11 1 1 Ini lrti 


PAGGATPTTT 


TTCCGCTAAC 


CATTTTTCAA 


TATTTT-TAGC 


GTAAATAATA 


1260 


u^ui i i x^irt 


AATTTAACCA 


AGCATGGGGA 


TCTTCCTTCC 


CTTTCTCACT' CTGACCCTCT 


1320 


t\t\\jj l J-\J\M. 1 rtrt 


PATPTATGCC 


ATCACTTGCT 


GCAAAATAGT 


CTTTGTTTTC 


CTCTTTGTTC 


1380 


PPBTVTTTTH 


PTAATTTTGT 


AAACCAAGCA 


TTTCCTCCAG 


TkTCCAAGTT 


AACACCGTTA 


1440 




AATPTGPTTT 


TGAAGTTTTT 

X x x x x x 


TGAACATCTT 


CAGGCAAaGG 


TTCATATTCG 


1500 


1 ooUvart ltl 1 


r* PP A AT AP^PT 


APGATACTGT 


GTAAATCAAT 


TTTATCTTTT 


GCTATATTTT 


1560 




TPPTA A A ATP 


GAATTAGTAA 


CTACTACCTT 


TAATTTTTCG 


TTCGAAGCCA 


1620 


Llulnl vl 1 X 


PTPTTTAPT A 


TTTGTTGAAC 


ATGCCCCGAA 


TATAAAAAGA 


GCAATCAACA 


1680 


1 1 ^L.Lnuri^- x 


TPPPAPTAPT 


ACTTTCCATT 


TTCTCATCAT 


AACGATCTCT 


CCTTCTTTCT 


1740 


In/vin/il 111 


TPTTTTGGTG 

X O A X X X \J\J X w 


AGATTAGAAA 


ACTAATTACA 


AATAAGGATG 


CCGCTGTTAA 


1800 


HHPZ\QTI\PTT 
/irtV_,rlri 1 -oV— 1 1 


PATPPTGPGG 


PTATGTTAAA 

\^ 1 *i x w x x Jii^n 


ACTATAACCA 


ATAAATAATC 


CaAGCAATGA 


1860 


tpa apttp^pt 

1 uriri^ 1 1 o^> l 


CPGACTGTTG 


CAGAAATAAA 


TATCATAGTT 


TTCAAGCTAT 


TTGCATATAA 


1920 


ATATGPTGTA 


GCTGCTGGCG 


TAATTAACAT 


TGCAACAATT 


AATATAGTTC 


CTACACTTTG 


1980 


PATAGPTGTT 


ACTGCTACTA 


AAGTTAATAG 


AACCATCAAT 


AGATAATGAT 


AAAAATTGAC 


2040 


attpatpppp 


ATTGCTTTTG 

X X UVrf X X X X VJ 


CCATTAATGG 


ATCAAAAGAA 


GTGATTAATA 


ATTCTTTAAA 


2100 


AAATAGAAPT 


ACTATAAPCA 


ACACTAAAAC 


AGAAACGCCA 


ATTGTAATCC 


ACTTATCAAT 


2160 


ATCTTGTACA 


GCTAAAATAT 


TTCCAAACAA 


GATATGAAAT 


AAATCGGTTG 


AGCTATTTGC 


2220 


AACACCTATA 


AGGATAACTC 


CTAGTGCTAA 


AAAAGAACTG 


AAAGTAATAC 


CAATTGCTGT 


2280 


ATCTCCTTTT 


ATT AC ACT AT 


TGCTTTTTaT 


AAAaGTAATA 


ATGnTAGrAG 


CTAATAGACC 


2340 


AAATACAATC 


GCACCAATAA 


AAAAGTTAAT 


GCCTAAAATA 


TAAGAAATTG 


CTACTCCTGG 


2400 


CAAAACAGCA 


TGAGAAATAG 


■ CtCGCCCATA 


AGAGACATTC 


CTCTTAAAAT 


AATAAAACAA 




CCTACTGTTC 


CAGAAACAAT 


CCCTATAACT 


ACAGATGTAA 


TTAGAGCATT 


CTGTAAAAAG 


2520 


TGAAAGTCAT 


ACAGACCATT 


AATAAAATTT 


TGAAGCATAT 


TATTCCCCCT 


CTCCAATAAA 


2580 


TATAGTATCT 


CCATAGGCTT 


TTTTTAAATT 


CTCTTCATTA 


AATACAGATT 


CTACACTTCC 


2640 


GTGAGCAATT 


AGATTTCGAT 


TCAAAAGAAT 


AATATTATCA 


. AAATACTtCT 


TTACCyTGcT 


2700 


TAAwTCATGA 


TGCACAATTA 


, GCACTGTTTT 


TCCCTGATGT 


1 TTCAATTCTT 


1 TTAACGTATC 


2760 
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CATAATTATT CGTTCACTAA CTAAGTCTAT CCCTACAAAT GGTTCATCTA AAAATATAAA 2820 

ATCCGCATTT TGAGCAAGAC ATCTTGCTAA AAGTACACGT TGAAATTGTC CTCCAGATAA 2880 

TTCTCCTATT TGATGATTAG AATATTTTCC CATATCGACC TTTTCCAAAG CCTTTGAAAC 294 0 

TTTTTGCCAT TCTGCATTTT TAATTCTTTG AAAAACTTTC ATTCCGGCAT ATGTTCCCAT 3000 

AGATACACAT TCTTTTACTT TTATTGGAAA AGTAAAATCA ATATCAGCTT TTTGTTCTAC 3060 

ATATGCAATC TTTACCAGTT TTTTATTGAT ATCTTcATTA TTAATGGAGA CCTCTCCCTG 3120 

ATGAGGGATA ATATTTAACA TTGCTTTAAG CAAAGTAGAT TTTCCAGCAC CATTTGGACC 3180 

AAAAATACCT GTTAAAGCAC CTTCTTGTAT TTTTAACGAA ATCTCGTTCA ATGCTAAAAA 3240 

ATCGTTATAA CAAACmnGTA AGCTTTTTCA CTTCTAACAT AACTTTCTCC TTCTAAATGA 3300 

TAATGATTAT CACGATTGTA TGGtACACaA AAATAATAtT TaGtCAaGCC TAAAATTTTa 3360 

AATTgGGATT TTnCgGTgGT TGGA 3384 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 



AnAATATAAA 


TCCTGAGTAG 


TTGCACTGAG 


AAAGCCAAGG 


ATTAAAACTT 


CAAAGGAAAA 


60 


GCCTGGGACA 


TCAATCATTT 


TGGATTTTTG 


TTCCAGGCTT 


AAAAATGATA 


AAGGGTGGGA 


120 


GCAGAAGCAA 


CTACTAACAA 


ATAAGACTTT 


TCTTTACGAC 


AAATAATGCG 


GATCGGTCTT 


180 


AGTCGTGCTA 


ATGTTGTCGC 


GGATTTTTTT 


CAAGGAATCG 


GCATTTTGAT 


CAAAATTGTG 


240 


ATACATCGTA 


TTTGTATATA 


AAATATCCAT 


TACCGTTAAG 


TAACTGATCC 


GTGAAGCCAA 


300 


GGATTCCGCA 


CGAAACAAGC 


CTTCTTCGGT 


GACGACAATA 


ATTGCTTCGT 


CGGACAAGCC 


360 


TGCCAACTCT 


GAACCGCTGT 


TACCAGTTAA 


CGTAATCATT 


TTTGCGTTGG 


TTTTTTTAAC 


420 


TTGTCGTGCT 


AGGTTGATTG 


ATTCTTTGGT 


TTTGCCTGAA 


TGGGAAAAAA 


TAAACACGCA 


480 


ATCTTCACTG 


GTTAATTTTG 


TGACAAAACT 


TAACTGCATG 


TGGTAATCAA 


AAATGTAGTT 


540 


GCACCGATAG 


TTAGTCCGAA 


TGAATTTATG 


GAAACTATCA 


AAAGCCACAA 


TCGAGGACCC 


600 


GCCTTGCCCA 


AAAAAATGAA 


GCGTTTTGGC 


AGAGTACATC 


AAAGCCAACA 


CGTTATCTAA 


660 


CAGCTCCTTT 


GTTAAGAACT 


GTGTGGCGTT 


ACTGAGGGAA 


TAAATATTGG 


CTTGTAAGAC 


720 


TTTTTTAGCA 


ATGGTAATAC 


TAT CAT CATC 


TGGTGAAATA 


TCATCGGCGC 


CATTCACGCT 


780 


ATTGACGTTT 


TGAAAAGTCG 


GCTGATAATT 


AGAATTTCTG 


GCGATATCAA 


TTTTAAATTC 


840 


TTGGAACCCA 


GAATAGCCAA 


TCTTTTTTAC 


AAATTGATAC 


ACGGaAGATT 


GGGaGACGCC 


900 


AATTTCGTTG 


GCAAGACTTT 


CTAACGTCAT 


TTGTGTCAAT 


AATGaTTCAG 


AATGAACAAA 


960 
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GTAATCGGAA 


ATTTTCTTTT 


CTGTGAGGCT 


CAAATTCGGT 


TTTTTGaTCC 


GAATCCGTTG 


1020 


ATCCAAATAG 


TGGkTTTTCa 


TTAACACGTT 


CCTCCAATCT 


TATTTTTTTA 


AAAATAAAAT 


1080 


GAATGCCTTT 


TCATTGTTGG 


TATAGACTGA 


TTATAGTGGA 


AAAAGAGGAC 


AAATACAATT 


1140 


TCTCTTTTTC 


TCCACAAAAA 


AGAAATTCTT 


TAAAAAAAGC 


CAAAAAAAAA 


TAAATAATCT 


1200 


TTTTTCTGAA 


AAATGAAGGG 


GTCTTGAAAA 


GACTTCTCTC 


AACTAAAGAA 


AGTAGCCAGT 


1260 


AAAAAGAACG 


TGATACCAAG 


AGATTTACTT 


GTGTTTTTTG 


TATGAAAGCT 


CCTTAAAAAT 


1320 


AAAAACGTTT 


TACCTTTTGA 


GTAGGCAAGA 


TTTTTGCTAA 


AAAATAAAAA 


ATATTTTTTA 


1380 


ATTAGTGTAA 


TCGTTTACAT 


TATTTTTTTT 


ATCCTTTATG 


aTTGaAGTAT 


CAAAAGAGGC 


1440 


AGGTGaAAGA 


AAAATGGACA 


TTGGCTTTAT 


TGGACTtGGG 


AAAATGGGTT 


tGAACATGGC 


1500 


ATTAAATGTT 


CACGAACAAG 


GCTGGCCCAT 


TATTGGCTTT 


GATGTAACAA 


AGGAAGCAAG 


1560 


AGCGACTGCT 


AGAGAACAAG 


GTCTTTCCGT 


TGTAGATTCG 


TTGTCAGAAT 


TATTGAAAGC 


1620 


GCTTAACAAA 


AGGAAGGTGA 


TTTTTTTAAG 


TACCCCCGCT 


GGTCAGATTA 


CCAATCAATT 


168Q 


AGTGGCAGAA 


TTGGTAGAGC 


AATTAGCACC 


AG AG G AT AT C 


ATTGTGGATA 


GTGGCAATTC 


1740 


AAATTTTCAT 


GATAGTGTTG 


CGAATGCTCA 


GCTTGCTAAG 


GAAAAAGGAA 


TTTATTTTAT 


1800 


TGATTGTGGT 


ACTTCTGGTG 


GCATCAAAGG 


AGCGCGGGAA 


GGAGCGTGTT 


TAATGGTTGG 


1860 


CGGTGCGCcT 


GAAGCTGTCA 


AAGTGTTAAC 


CCCATTTTTT 


GAAGATTTAG 


CTTGTGAACA 


1920 


AGGGTATCTT 


TATGCTGGGA 


AATCAGGAGC 


TGGTCACTAT 


TTGAAAATGG 


TGCATAACGG 


1980 


GATTGAGTAC 


GTCATGATGC 


AAGCCATGGG 


CGAAGGATTC 


AATTTATTAG 


AAGCTGCCGA 


2040 


ATATGATTTT 


GCGTTGGAGA 


AAGTAGCCGA 


TGTGTGGAAT 


CACGGCTCGA 


TTATTGAAGC 


2100 


TCGCTTGATG 


GGCTTAGCGA 


AAGAGGTTTT 


TGCGGAAAAT 


CCTACGTTAG 


CCAATCTAGA 


2160 


AGGGAAAGTG 


GCAGCAAATG 


GCGAAgcnAA ATGGATGATT 


GAAGAAGCAT 


TGCGTTTGGA 


2220 


GATGCCCGTC 


CCAACTACGG 


CGTTGTCCTT 


ATTTACTCGA 


AATGAAAGTA 


TGCTAGCAAA 


2280 


TCATTTTTCA 


AACAAAGTGG 


TCGCTTCTTT 


GCGACAAGGA 


TTTGGCGGGC 


ATGAAGTCGT • 


2340 


AAAAAGTCAG 


TAAAGGAGTT 


TAGTCATGTT 


AAAAAAACAA 


AAAATTGCCA 


TAGTGAATTC 


2400 


CAGTAGTTTT 


GGTCAAATTT 


TTCCTGAACA 


CTTGACGCGC 


TTAGAAAAAA 


TAGGCACCGT 


2460 


GAAGCATTTT 


ACAGTCGACA 


GTGAAATTGG 


TGGCAAAGAG 


TTAGCGGAAC 


GTTTACAAGG 


2520 


ATATACGATT 


ATTATTGCTA 


GCGTGACGCC 


ATTTTTCACG 


AAAGAATTTT 


TTGAACATAA 


2580 


AGATGAATTG 


TTGCTGATTT 


CTCGCCATGG 


TATTGGGTAT 


AACAACATCG 


ATTTAGACGC 


2640 


TGCAAAACAG 


CACGACACAA 


TTGTTTCCAT 


CATCCCAGCT 


TTAGTAGAAC 


GAGATGCCGT 


2700 


GGCGGAAAAC 


AATGTCACAA 


ATTTGCTAGC 


TGTTTTACGA 


CAAACCGTGG 


CTGCAGATGC 


2760 


CAGCGTTAAA 


GCGGATCAAT 


GGGAAAAACG 


AGCGAATTTT 


ATCGGTCGGA 


CGTTATTTAA 


2820 


CAAAACTGTG 


GGTGTCATTG 


GTGTTGGGAA 


CACAGGAAGT 


TGTGTCGTGG 


AGACGTTACG 


2880 


AAACGGATTT 


CGCTGTGACG 


TGTTAGCGTA 


, TGATCCCTAT 


AAATCCGCCA 


. CTTATCTGCA 


2940 
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AAGTTATGGA 


GCCAAAAAAG 


TAGACTTAGA 


TACGTTGCTA 


GCTTCAGCAG 


ACATCATTTG 


3000 


TTTATGTGCC 


AATTTAACTG 


AAGAAAGTTA 


TCACATGATT 


GGCTCGGCGG 


AAATTGCCAA 


3060 


GATGAAAGAC 


GGCGTCTATC 


TTTCCAATAG 


CGCACGAGGC 


GCCTTGATTG 


ATGAAGAAGC 


3120 


CATGATCGCC 


GGTTTACAAT 


CGGGCAAAAT 


TGCTGGGCTT 


GGGACAGATG 


TTTTAGAAGA 


3180 


GGAACCAGGT 


CGGAAAAATC 


ATCCTTATCT 


TGCGTTTGAA 


AATGTAGTAA 


TGACGCCGCA 


3240 


CACCTCCGCT 


TATACCATGG 


AATGCCTGCA 


AGCGATGGGA 


GAAAAATGCG 


TGCAAGATGT 


3300 


GGAAGACGTG 


GTGCAAGGTA 


TTTTACCTCA ACGGGCAGTG 


CAAGAAGTCA 


GTCGTTACGT 


3360 


TAGTTAAGAA 


GGAGGAGGAG 


CATGTTAACT 


GATTTAAAAG 


TGAAAGTTGG 


ACCGCAATTT 


3420 


TATCGTTACC 


ACGAAGGCGC 


CTTAGCTTCG 


GTGCCGTCAC 


TTTTCAAGGA 


ATATCACGCC 


3480 


CAACGAATTT 


TGGTGGTGCA 


TGGCACCGTA 


TCCTTTGAAA 


AAGCTCAGCC 


ATTTTTACCA 


3540 


TTCTTGGCAG 


ATTCAGAGTA 


TCAATTCTTC 


TATCACACTT 


ACACAGGAGA 


ATGTAGTTAT 


3600 


TTTGGCGCCG 


AACAAATTTC 


ACAGCAAATA 


AAGGAACACC 


AGATTGACTT 


TCTTCTGGGA 


3660 


GTCGGCGGCG 


GTAAGTTAGC 


CGATTTAGTG 


GGCTACAGCG 


CCCATTTAAA 


CAATCTGAAT 


3720 


TTTGGCTTGG 


TCCCCACTTT 


GGCTAGTAAC 


TGTGCGCCGT 


GGACACCGTT 


AGCTGTGATG 


3780 


TATCAAGAAA 


ATGGCGCGGC 


CGAAGGCAAA 


ACGGAACATT 


TCTTTCGCCA 


AGCTGCGTTT 


3840 


TTAATCACCG 


ATCCTAAGTT 


GCTTCTGGAT 


GCGCCCCGTG 


ATTATTTTGT 


CGCTGGTTTA 


3900 


GCTGATACGT 


TAGCGAAATG 


GTATGAATCA 


GAGACCATTT 


TACGCCAAGC 


CCATTTGCAA 


3960 


GGGGAACCCT 


TTTTACAATT 


AGCGGGAGCG 


ACCGCCAAGC 


TTTCGCAAGA 


AGCGATTATG 


4020 


CGGGACTCAA 


AAGCAGCCTT 


AGCAGCGATG 


GATGAGGGCA 


AACTCACGCC 


TGAATTTGTC 


4080 


CATCTTTCAG 


AAATTGTTTT 


CGCTGTTTCT 


GGATTAGTCG 


GCGGCTTCGG 


CGACAAATAC 


4140 


GCCCGCAATG 


CAGCAGCACA 


CGCCATGCAT 


GATGCTATGA 


GTAAATTTTT 


ACCCAAAAGT 


4200 


CATGACTATT 


TACACGGCGA 


AAAAGTAGCT 


TACGGCATCT 


TCTATCAATT 


AGCATTGGAA 


4260 


AAAAGGTGGG 


CCATCATAGA 


CGCATTAATC 


CCTTTTTATC 


AAGAGCTAAA 


CTTGCCCATG 


4320 


TCGTTACGAC 


AAATGGGCCT 


TTACCCAAAA 


GAAGAAGCAG 


TACTCGATGC 


AATGGTTCAA 


4380 


TTTATTGATT 


CAAAAGAAAA 


GGTTCATTTA 


ATTCCTATTG 


AGATTTCGGA 


AGAACGTTTA 


4440 


CGTCAAGGGA 


TTAAAGAGCT 


AGAAAAGTAT 


ATCCAAAACC 


AAGGATAAGA 


AGGAGGAACT 


4500 


ATGGAAACGT 


TAAGTGTTTT 


ACAAAGTTTG 


ATAATGGCGC 


TTTGGGTGGC 


AGCAATTATG 


4560 


TCCAGATGGC 


TTGGAGGAGG 


AGCAACGTTA 


ACGTTGCGTT 


TCTCGCCCTT 


AATGACAGGG 


4620 


TTAGTAGCAG 


GTTTAGTGAT 


GGGCGATGTC 


CCGAAAGCGA 


TGATTGTCAC 


CGCAGCATTA 


4680 


CAAATGATTT 


ACATGGGTGT 


TTTTTCACCA 


GGTGGCTCTA 


TGCCAGCAGA 


ACCATCAATT 


4740 


GCGGCAGCGA 


TTGCTGTGCC 


AGTTGCGTTG 


CTAGGGAATT 


TGAAGCCGGA 


AGCGGCTATT 


4800 


GCCGTAGCGG 


TTCCTGTCGG 


TCTTTTAGGA 


AGTTATTTAT 


ATCAATTCCG 


TTTCTTTATT 


4860 


AACACGTTTT 


TAGGGAAGTA 


TACCGATCGT 


GCGGTGGCTG 


AACTAAATTC 


TAAAAAAATT 


4920 
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GCCCGCTCAA 


TTATCTGGTA 


TCCAACGATT 


GCCTCATTTA 


TTTTATTCGT 


TCCATTAGTC 


4980 


TTTTTTGCTC 


TTTATTTAGG 


GGCACCAGTC 


ATTGCCGATA 


TTATTAAAGC 


ATTAGAAGGA 


5040 


ACCGTTGTGA 


TTCACGTGTT 


AGAAGTAGTC 


GGTGGTGGTT 


TAGCCGCAAT 


TGGGATTGCG 


5100 


ACAACGGTCT 


ATGTCATTGG 


TCGAAAAGAT 


TTCTTGGTTT 


TCTTTTTCCT 


TGCATACTTC 


5160 


ATGAGTATCG 


TCTTTAAGTC 


ATTGGAAATT 


ACCATGGTAA 


CGTATGCCAT 


CTTTGGCGTA 


5220 


ATTATTGCGT 


TAATTTTTGT 


GCAAGTCCAA 


AAAGGCAAAC 


CTGTAGCAGA 


AAGTGCTGGT 


5280 


TCCGCCAGTG 


CCACCACTGA 


TTTTGATGAT 


GATGACGATT 


ACGATGACGG 


GTTTTAAAAG 


• 5340 


AAAAAGGAGG 


AACTGAAATG 


ACGGAAACAA 


CAGAAAAAGT 


CGTGCCGAAA 


ACAACGAATT 


5400 


TAGCACCAGA 


AGAAATTACC 


AATAAAGATG 


TTACAAAAGC 


CTATCTCCGT 


TGGCATTTTG 


5460 


CCAATGAAAT 


TCCCCATTCC 


TTTGAACGTT 


ACTTGGCGCC 


TTCGTTATTG 


TACGCAATGA 


5520 


TGCCACTTTT 


AAAGAAATTA 


TATAAAGACG 


ATGAACAATT 


AAAAGCCGCT 


TATATGCGTC 


5580 


AACTGCTGTT 


TTTCAATACG 


■ CAATTAAGTT 


GGGGCGGCGG 


GGTCATTACT 


GGGCTGATGG 


5640 


CTTCCATGGA 


ACAAGAGCGG 


GCCAAAGAAG 


AACATGAAGG 


CCGCGAAATC 


ATGATGCAAG 


5700 


ATGATTTAAT 


GTATAACACC 


AAAGCTGGCT 


TGATGGGCGC 


GTTAGCGGGG 


ATTGGTGATG 


5760 


CGATTGACTC 


AGGTACGGTT 


CAATATATTT 


TTATTGCGAT 


TGCGGTGCCG 


TGGGCGCAAC 


5820 


AAGGTAGCGC 


ATTAGGTGCG 


ATTTTCCCAT 


TTGTGGCGTT 


TGCCCTTTAT 


CAAGTACTGT 


5880 


TAGGTGTGTT 


CTTTGCGAGA 


CAATCCTTTA 


AAATGGGGCG 


GAATGCCACA 


GGCTTAATGC 


5940 


AAAGTGCGGG 


GATTCAAAAA 


GCCATTGAAA 


TGTTATCTGT 


TTTAGGGTTG 


TTCATGATGG 


6000 


GGATTTTAGC 


TGGTAATTAT 


GTCAAAGTTT 


CTTCAACCCT 


TCAATTTAAA 


CTTTCTGGCC 


6060 


GTGAGTTTGT 


GGTTCAAGAT 


ATCCTTGATC 


AGATTGTCCC 


AGGACTCTTA 


CCGCTAGCTG 


6120 


TTGTAATGGG 


CGTGTATTGG 


TTCTATACGA 


AAAAAGGCTT 


GAAAGTTACG 


CAAGCATTAT 


6180 


TGTGGTTAAC 


AGGTATCTTA 


ATTGTCTTAG 


CAACAGTCGG 


TATTTTATAA 


AAAGAAAAGG 


6240 


ACGTGAATCA 


GATGGGCGTA 


GTAAATTTAG 


CAAGAGTAGA 


TGAACGATTA 


ATT CAT G GAG 


6300 


AAGTGATGGT 


GACCTTGTCC 


CAAAAAAGTG 


GCGTTAATTC 


CATTTTCGTT 


GTCGATGAAG 


6360 


TGGTGGCAAA 


GGATAAATTT 


ATGCGTGATT 


TATACAAAAG 


TGCAGGCAGT 


CGGACAGGAC 


6420 


AAAAAACCAT 


TGTCATTACG 


CCGGAAAAAG 


CTAAATTTTA 


TTGGGATGAA 


TACCAATTCA 


6480 


AAGAGTACAA 


CTGTATTTTA 


ATCGCTAAAA 


CCGTCTCTGT 


AATTTATGAC 


TTAGTAAAAC 


6540 


ACGGAGTCCC 


TATGAAGGAG 


TTAAACATCG 


GCGGTATTGC 


ACAGAAGAAT 


CCAGAAAAAG 


6600 


ATTTATTAGT 


AACCAAATCA 


GTTTATTTAA 


ACAAAGAAGA 


TGCCGAAAAA 


TTAAAAGAAC 


6660 


TACATGAGGT 


GTATGGCGTA 


GAAGATATTT 


ATTTCCAAGC 


AACGCCTTCT 


GCACCAAAAA 


6720 


CTAGTTTAAA 


AGAGGTTTTA 


GGCAAATTTA 


ATTTATAAAA 


AAGGCCAGTC 


GCTTGAGTCA 


6780 


TTGCATGACT 


GTTTGAAGTA 


ATGACTCAAG 


CCACTGAACT 


GAAGGAGGAA 


CTTTTATGTT 


6840 


AGGGATTGTG 


ATTGCTACAC 


ATGGTGCACT 


AAGTGACGGT 


GCGAAAGACG 


CAGCTACCGT 


6900 
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GATTATGGGC GCAACCGAAA ATATCGAAAC CGTTAATCTA AATAGTGGAG ACGATGTCCA 6960 

AGCACTAGGC GGACAAATTA AAACAGCGAT TGAGAACGTC CAACAAGGGG ATGGCGTGTT 7020 

GGTCATGGTT GACTTATTAA GTGCGAGTCC TTACAATCAG GCAGTTTTGG TGATTAACGA 7080 

ACTCGAACCA gCTCTACAAA AGAAAATCTT tGTGGTGAGT GGTACCAACC TACCAATGGT 714 0 

CTTAGAAGCG ATAAACCATC AATTATTGGG CACACCGATT GCTGAAnCGG CGCAAGCGAT 7200 

TGTTGCGCAA GGC 7 213 
(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

ATCCAAAAAG AAAGGAAGTG CCTTATGGCT AAAAAATCAA AAATTGCTAA AGCAAAAAAA 60 

CAACAAGCAA TGATTGCAAA ATATGCGCCT ATTCGCCAAG CCTTAAAAGA AGCTGGCGAT 120 

TATGAAGGTC TTAGCAAATT GCCAAAAGAT GCTCATCCAA GTCGCTTAAA ACTCCGCGAT 180 

CAAACAGATG GGCGCCCACG TGGCTACATG CGTAAATTTG GCATGTCTCG GATTCGTTTC 24 0 

CGTGAATTGG CTCACCAAGG CTTGATTCCT GGCGTCAAAA AAGCAAGCTG GTAACTCATT 300 

ATTTCGATAA AGACAACTAG CGAGGACAGT CGCCTTTGGG CTGACCTCGC AGCCCCCTTT 360 

ATCTTTTATA CGCAATTAAA AAAATCAAAT TGGAGGACGG TaCTATGCGC CAAACCATTA 4 20 

CGTTAGCTTG CGCTGAAACT GGCGAGCGCC TTTaCTTAAC ArGTAAAAAC ArACGAAATA 4 80 

CACCAGAAAr AtTACAGCTG AAAAAAtATT CaCCAAAACT aCGGcGcCGs GsACTTTTcA 54 0 

CkGAAGTCnA ATraCCGGGG GGAA 564 
(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CAAGAATCGT GTGGAACAAA AAGCGGAAGA CGGCGACGCA AAAGCTCAAA AATTGCTTAA 60 

GGTATTGCAA GACCCCAATA ACTTTTTATC CACAATCCAA GTAGGGATTA CTTTGGTTAA 120 

TATCCTATCT GGTGCTTCAT TAGCAGAGAC TTTATCTAGT CGTTTGGCCC CTGTATTAGG 180 

TGGCGGCGCg CAGCTAAGAG TCTTGCCAGT ATCATTATCT TAGCTTTGTT AACGTACGTT 24 0 

TCTATTGTAT TTGGTGAATT ATACCCTAAA CGAATTGCTA TGAATAAATC AGAAGAAGTC 300 

GCTCAATTGA CTTCTGGTGC AGTCCGCTTT TTAGGAGTTA TCGCGCGTCC CTTCGTTTGG 360 
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TTGCTATCGG CTTCGACCGA TTTATTATCC AAAATCACAC CAATGACTTT CGATGATGCT 4 20 

GATTCGAAGA TGACACGAGA TGAAATGCGC TACATGCTTG AAACAGAAGG TGTTTTGGAA 4 80 

AACGAAGAAC TAGAAATGTT ACAAGGTGTC TTTTCATTAG ATACAAAAGT GGCACGAGAA 540 

GTAATGGTTC CTCGGACAGA CGCATTCATG GTTGATATCC AAGATGATGT CCAAGAAAAC 600 

ATCAATTTAA TTTTAGGAGA AAATTATTCT CGGATTCCTG TTTaTAGTGA AGACAAAGAT 660 

AAAATTGTTG GTATTTTACA TACCAAAACG TTACTAAAAG CTGCACGTAA TTTAGGCTTT 720 

GAAAACATTG AATTAGGCGC GATTATCCAA GAACCATTAT TTGTCCCAGA AACCATTTTT 7 80 

ATTGATGATT TACTTTATGA GTTAAAACGA ACACAAAACC AAATGGCGAT TTTACTAGAT 84 0 

GAATATGGCG GTGTTGTCGG TTTAGCAACA CTGGAAGATT TACTAGAAGA AATCGTTGGT 900 

GAAATTGACG ATGAAACAGA T 921 
(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 



TGAACGTCCT 


AT TAG T TATA 


GACGAATTTG 


AGGACCAAAA 


AGTTTCAATT 


TTTTCAATTT 


' 60 


AGTTAGTTGT 


TTATTTAATA 


AAAAATAAAA 


GGTTGCTAGA 


GGATGAAAGG 


AGTATAATTT 


120 


ATTTATCTAG 


AAGAGTAATC 


GGAATGAGAA 


AGGGCTGTGG 


AAATGGATCA 


TCAACAGGTG 


180 


TTGAAAAATG 


AAATTAAACA 


ATATTTAGTG 


AGTAAAAACT 


GTGAGAAGAC 


GTATTATCAT 


240 


TGTATGGAAG 


TTGGAGAGTA 


TGCCTATCmA 


CTCGGTGAAA 


AATATTTAAC 


TAGTCCaGAA 


300 


AAAGTAAGTA 


TTGCmGGaTa 


TTTACATGAT 


ATCTCTGCGG 


TTTATCCAAA 


TAATCAGrGA 


360 


ATTAGTGTCG 


CACAAAAATA 


TGGGATTGAA 


TTAAACGAAA 


CAGAAATGGC 


CTTTCCAATG 


420 


ATAATTCATC 


AAAAAATTTC 


TAAGTCTATT 


GCGAAAATGG 


ATTTTGGTAT 


TGAAGACAAT 


480 


GAAATTTTAT 


CAGCTATTGA 


ATGCCATACA 


ACATTGAAGA 


AAAACTATTC 


GGACATAGAT 


540 


TTAGTTTTGT 


TTGTCGCTGA 


TAAAATAAAA 


TGGGATCAAG 


AAGGGAAGCC 


ACCATACTTA 


600 


GATGGTTTAT 


TACAAGCTTT 


AAATTGTTCT 


TTAGAAAATG 


CCGCATATTT 


CTATATCGAT 


660 


TATATTTTGA 


AaCACGATAT 


TaAAGTCGTG 


CATCCTTGGT 


TATGGGATGC 


TTACAATCAA 


720 


CTTAATTTGA 


TAATTAAATA ACAAATAAAT 


ATAGCCGTTT 


TTAGTCGAGT 


TTTGCTGGTG 


780 


AGAAAAGGTG 


AAAAGTAGTA 


CAGTATTCTT 


GTCAGAAAGA 


AGGGATTCCC 


AATGAAAAAA 


840 


ATAATCTTGA 


TAGGTCTTTT 


ATTGAGTATT 


TTATCAGGAT 


GTTCTTCCAA 


CAAAAGTGGT 


900 


GATAGCACTG 


CAAGCAAATC* 


* AAGCAAGCCG* ACAGCAACAT 


CAGAAATGAA 


AGCACAACAG 


960 


TAGCCAAGCG 


AGTCAAATGA 


GCACGGAGTC 


AAAATCTAGT 


ACAACTCAAA 


GTAATTCTTC 


1020 


TGCTTCTACT 


AAAACTGAAA 


ATCAGTTGTG 


GTCTGTGCAA 


AAAAAAGCAC 


AGTTACAACA 


1080 
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ATTTATGGCT GACTGGGGGC AACGGATGAA CCAAACGTAT AAAGAATATA TACCAGGAAA 114.0 

TTTAATCAAT TTTAATGGCG TCCAAGCGCC TGATGGTCTA GTAGGAAATG TTACTATGCA 1200 

ACCAGCGATT GGTCAGCATC CCATTTATTT AGAATGGTCT GAAAaTGGAC AAGTGAAGGA 12 60 

TGGATATGCG TTAGTTGCTG TTTACTCTGA TGCAGAAACG CAACCAGAAA TGCAAAAGCA 1320 

TTTGTATTTA TTCACCATCG TTAACAATCA GCCCTTAGTT TTAGTGACGA TGCAAAATCA 1380 

AGGAGATCCT TATGGCTATC TGTATTTTGG AGCAACAGAT AATGCAGAGT TGAGGGCTGG 14 4 0 

CTTTGAAAAA ATTGTAGGTG CCCCGTCAAT TACTAAAGAA CAAATCCCAA tATTTCAGTC 1500 
AATCCgTGkc tCaAAgAGGa GCGGnTGnAT TTTA 
(2) INFORMATION FOR SEQ ID NO: 334: 



1534 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 
TTGATTTGGC AAATATTTCA GGAAAAGCGG GTAAAGAGCT TGGTAATTTT GGCATGATGA 60 
TTCAAGACTT AACGAAAACA GTGCCCTATT TAACGATTAC AGAATTGGTG AAAGAAACTT f 120 

TACAACGTAG TGGTTATCGG GAAGCTTTGA TGGCGCAAAA TAATTTAGAG TCTCAAGCTC 180 

GCTTAGAAAA CTTGGACGAA TTTTTATCTG TGACACAAGA ATTTGATAAA CGTTTTGAAG 24 0 

CGCAAAACAA CGACGATCCA AATGGCGAAG AAACAAAATT GGCGGATTTC TTAACAGATT 300 

TAGCCTTGGT TTCTGATTTA GATAATTTAG AAGAGAGCTC TTCGCAGGTA ACCTTAATGA 360 

CGTTGCATGC GGCCAAAGGG CTGGAATTTC CAGTGGTCTT TTTAATGGGC TTAGAAGAAG 4 20 

GGGTTTTCCC ATTATCACGT GCCATGCTAG AAGAAAGTGA ACTGGAAGAA GAACGGCGCT 4 80 

TGGcTTATGT TGGtATCACA CGAGCGGAAG AAGCCTTGTT CTTAACGAAT GCTTATTCGA 540 

GAACATTATA TGGTCGCACG CAATATAATC GACCAAGTCG CTTTTTAGAA GAAATTGAGG 600 

CGGAgCGTTT AATGAAACAA GGGGCGGCAg CCAAtTTACA GAAAGCTCCA GCGCGCACCT 660 

TTGATCCTAA AGTATTTAAG CCCAGCGCAA GTAAACCAGC TTATACACAA CCAGCTACAA 720 

AATCAGTTAG CAACAAAGTG GCAAGTGGTG GCGAAAAAGT TTCTTGGCAA GCAGGTGATA 7 80 

AGGTCCAACA TAAAGCATGG GGCGTTGGTA CAGTCGTTCG TGTAGGCGGT TCGGCCAAAG 8 40 

ACTTAGAGTT GGACATTGCT TTCCCAGAAA AAGGCATTAA ACGGTTGTTA GCGGCATTTG 900 

CACCAATTGA AAAAATTTAA CATGCAAAAT AGGTCAGCGA TTTAATGAGA AATCATCATT 960 

AGTTAAATTC TTTTTATTTT CTATCTTAGT GTAAATGTGT TAAAAAGACG ATAGGAGCAT 1020 

TACGCTTATG AG GAG GAG TT CTACGATGGA ACAACAACCA TTAACATTAA CGGCGGCGAC 1080 

AACTCGTGCG CAAGAATTAA GAAAACAACT CAATCAATAT TCCCATGAAT ATTATGTCAA 1140 
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AGATCAACCA 


TCCGTAGAAG 


ATTATGTTTA 


CGATCGTTTG 


TATAAAGAGT 


TAGTCGATAT 


1200 


TGAAACGGAG 


TTTCCTGATT 


TAATCACGCC 


AGATTCGCCG 


ACACAACGTG 


TAGGTGGCAA 


1260 


AGTTTTATCT GGTTTTGAAA AAgcTCCGCA 


TGACATTCCA 


ATGTATAGTT 


TAAATGATGG 


1320 


GTTTAGCAAG 


GAAGACATTT 


TTGCTTTTGA 


TGAACGTGTT 


CGTAAGGCGA 


TTGGTAAACC 


1380 


CGTTGCTTAC 


TGTTGTGAGT 


TAAAAATTGA 


TGGGTTAGCA 


ATTTCATTAC 


GTTATGAAAA 


1440 


CGGGGTGTTT 


GTACGCGGAg 


CAACACGAGG 


CGATGGAACG 


GTTGGTGAAA 


ACATTACAGA 


1500 


GAACTTACGA 


ACGGTTCGCT 


CTGTACCGAT 


GCGTTTAACG 


GAGCCCATTT 


CAGTGGAAGT 


1560 


TCGTGGCGAG 


TGCTACATGC 


CTAAACAATC 


CTTTGTCGCA 


TTGAATGAAG 


AACGTGAAGA 


1620 


AAATGGCCAA 


GATATTTTTG 


CCAATCCTCG 


GAATGCAGCG 


GCGGGAAGCT 


TGCGCCAACT 


1680 


AGATACAAAA 


ATTGTGGCTA 


AACGGAACTT 


AAATACGTTT 


T TAT AC AC AG 


TTGCTGACTT 


1740 


TGGACCAATG 


AAAGCGAAGA 


CGCAGTTTGA 


GGCATTAGAA 


GAGCTTTCTG 


CAATTGGTTT 


1800 


TAGAACCAAT 


CCCGAACGGC 


AATTATGTCA 


GTCAATTGAT 


GAAGTTTGGG 


CCTACATTGA 


1860 


AGAATACCAT GAAAAGCGGA gcACTTTGCC 


ATACGAAATC 


GATGGAATTG 


TGATTAAAGT 


1920 


GAACGAATTT 


GCGTTGCAAG 


ATGAATTAGG 


TTTCACTGTC 


AAAGCACCTC 


GCTGGGCGAT 


1980 


TGCCTACAAA 


TTTCCACCAG 


AAGAAGCAGA 


AACAGTAGTG 


GAAGATATTG 


AATGGACAAT 


2040 


TGGGCGCACG 


GGGGTGGTCA 


CTCCTACTGC 


GGTCATGGCT 


CCTGTACGAG 


TCGCGGGAAC 


2100 


CACGGTTAGT 


CGGGCAAGTT 


TGCATAATGC 


AGATTTTATT 


CAAATGAAAG 


ATATCCGTTT 


2160 


AAATGATCAT 


GTTATTATTT 


ACAAAGCCGG 


CGATATTwTC 


CCaGAAGTAG 


CGCAAGTGCT 


2220 


AGTTGAAAAA 


CGTGCAGCCG 


ATAGTCAGCC 


TTATGAAaTG 


CCGACACaTT 


GTCCaATTTG 


2280 


TCaTAGTGAG 


TTGGTGCATT 


TAGATGAGGA 


AGTGGCGTTA 


CGTTGTATTA 


ATCCAAAATG 


2340 


CCCTGCGCAA 


ATTAAAGAAG 


GGCTGAATCA 


CTTTGTCTCA 


CGAAACGCAA 


TGAACATTGA 


2400 


TGGCTTGGGG 


CCACGAGTGT 


TAGCACAAAT 


GTATGACAAA 


GGATTAGTGA 


AAGACGTAGC ' 


2460 


GGATCTGTAT 


TTCTTAACAG 


AAGAACAATT 


AATGACTTTG 


GATAAAATCA 


AAGAAAAATC 


2520 


AGCAAACAAC 


ATTTATACTG 


CCATTCAAGG 


GAGTAAAGAA 


AACTCTGTCG 


AACGTTTGAT 


2580 


TTTTGGCTTA 


GGGATTCGCC 


ATGTGGGGGC 


GAAAGCGGCG 


AAAATTTTAG 


CGGAGCATTT 


2640 


TGGTGATTTA 


CCAACCTTAA 


GTCGTGCAAC 


CGCGGAAGAA 


ATTGTGGCTT 


TGGATTCAAT 


2700 


TGGCGAAACA 


ATCGCAGACA 


GCGTTGTGAC 


GTATTTTGAA 


AATGAGGAAG 


TTCATGAATT 


2760 


AATGGCTGAA 


CTGGAAAAAG 


CCCAAGTGAA 


CTTGACATAT 


AAAGGTCTCC 


GGACAGAACA 


2820 


ATTAGCAGAA 


GTGGAGTCAC 


CTTTCAAAGA 


CAAAACAGTT 


GTTTTAACAG 


GAAAATTAGC 


2880 


GCAGTACACT 


CGCGAAGAAG 


CCAAAGAAAA 


AATTGAAAAT 


CTTGGTGGAA 


AAGTTACAGG 


2940 


GAGTGTTTCG 


AAAAAAACTG 


ACATTGTGGT 


TGCTGGCGAA 


GATGCAGGCA 


GTAAATTAAC 


3000 


CAAAGCTGAA 


AGCTTAGGCG 


TGACCGTTTG 


GAATGAACAA 


GAAATGGTCG 


ATGCGTTAGA 


3060 


TGCAAGTCAT 


TTTTAAGAAG 


AAAAATTAAC 


GTTGTCAGAT 


TTTATGTAGT 


AATTTGATGG 


3120 
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TATGCTTATT 


TTAGTCATAA 


AAGTTATATG . 


AACGAGGACT 


TTCTGAGTTG . 


AGAAAGGAGG 


3180 


TTGATGCTTA 


TGCGCAATTC 


TTCTTATCTT 


AATGAAAGGA 


GCAATGTTTT 


TGTCCGTTGA 


3240 


AGCAGCGTTG 


GGACTGATGA 


TTGGTTTTGC 


AACACTTGTT 


GTGACCATTA 


TCTTCGTTAT 


3300 


CTTAGCGCTT 


GTCTTAGACA 


ACAAAAATAA 


CCGTTCATAA 


GCTTTGAGGA 


GCTGAACGGT 


3360 


TAAGCGTTTG 


AAATAAAACC 


AATAACTCAG 


TAAGCCGCTA 


GTTCTTATAA 


CTAGTAGTGG 


3420 


CTACTAGAGA 


GGTATGTTCC 


TGCATATCTC 


TCTTTTCATT 


TTCATAATAA 


CATGTTTCCG 


3480 


TCATTTTTGC 


GAGCGACAAA 


TTATCAATGA 


AATTCAAGGT 


TTCATCTAAA 


ATATCCGACC 


3540 


ATTTTATGGT 


AGAATAGGAA 


CAGAGTGAAC 


AGCTGTGCTA 


AGCTGTTTAA 


AGAAATACAA 


3600 


TCCAATTGAT 


TAATCTGAAA 


ATGAAAATTT 


CAGTAAATGA 


TAGAAAGAAG 


GGTATCTATG 


3660 


GCAATTACAG 


AAGAACAAGT 


CAAACATGTC 


GCTAAATTAT 


CAAAATTATC 


TTTCTCCGAA 


3720 


GAAGAATTAG 


CGGACTTTAC 


CAATCAACTA 


GATAAAATTA 


TTGATATGGT 


GGAATTACTA 


3780 


GAAGAAGTAG 


ATACAACCGG 


GGTTCCTTTT 


ACTTCAAATG 


TGAATGAATC 


AATCAATGTC 


3840 


ATGCGTGAAG 


ACGTAGCGAC 


ACCAGGAATG 


GACCGTAAAG 


AATTAATGCG 


GAATGTTCCG 


3900 


GAATCAGAGA 


ATGGTTACAT 


TAAAGTGCCT 


GCAATTATGG 


ATAACGGGGA 


GGCAGGAGCC 


3960 


TAATGGAAAA 


ATTATACGAC 


AAATCGTTAA 


CAGAGTTACA 


TGATTTATTG 


GTCTCAAAAG 


4020 


AAATCACTGC 


AGTTGATTTA 


ACCGAAGAAA 


CATTAAATCG 


TATTCAAGAT 


ACAGAAGAAC 


( 4080 


AACTAGGTTC 


TTTTATCACA 


GTTAGTGAAG 


AAAAAGCAAT 


GGCATTAGCC 


AAAGCGATTG 


4140 


ACTTAAAAGG 


CATCACTGAA 


AGTAATCCAT 


TAGCTGGGAT 


TCCTATTGGG 


ATTAAAGACA 


4200 


ACATTGTCAC 


AAAAGATATT 


TTAACAACAG 


CTGGcTCAAA 


AATGTTACAC 


AATTTTGACC 


4260 


CAATTTATGA 


TGCGACCGTG 


ATGGATaAAG 


TCTATCAAGC 


AGATATGATT 


CCTGTCGGAA 


4320 


AATTGAAcAT 


gGATGAAtTT 


GCCaTGGGCG 


GAAGTACGGA 


AACGTCTTAC 


TTCAAGAAAA 


4380 


CAAAAAATGC 


TTGGGATCAA 


ACCAAAGTTC 


CTGGCGGTTC 


TTCTGGTGGT 


TCAGCTTCtG 


4440 


CTGTCGGAGC 


CGGACAAGTC 


CCAGTTTCTT 


TAGGGAGCGA 


TACAGGAGGT 


AGTATCCGCC 


4500 


AACCAGCCGC 


ATTCAACGGT 


ATTGTGGGcT 


TGAAAcCAAC 


TTATGGACGT 


GTGTCTCGTT 


4560 


TTGGTTTGAT 


TGCTTTTGCT 


TCAAGTTTAG 


ACCAAATTGG 


TCCaTTGACA 


CGAAATGTTA 


4620 


AAGACAATGC 


CTTAGCTTTA 


AATGCAATTA 


GTGGTTATGA 


CGAAAAAGAC 


GGCACTTCTG 


4680 


CAGGCGTTTC 


TGTTCCTGAT 


TTTACGGCAG 


ATTTAACTGG 


CGACATCAAA 


GGCATGAAAA 


4740 


TTGCTTTACC 


AAAAGAATAC 


TTAGGTGAAG 


GGGTACAACC 


TGATGTTCGT 


GAAGCGGTAT 


4800 


TAAAAGCAGC 


AGAAAGGTTC 


AAAGCTTTAG 


GCGCAACAGT 


GGAAGAAGTT 


AGCTTACCAC 


4860 


ATTCTAAATA 


TGGGATTGCC 


GCTTACTATA 


TTATTGCCTC 


TTCAGAAGCA 


AGTTCAAACT 


4920 


TACAACGCTT 


TGATGGCATT 


CGTTATGGTT 


ACCGTTCCGA 


. AAATGTCCAA 


. AATCTAGAAG 


4980 


ACGTCTATGT 


TAATTCGCGT 


TCCGAAGGCT 


TTGGTACTGA 


t AGTCAAACGT 


1 CGGATTATGT 


5040 


TAGGAACCTT 


CTCTTTAAGT 


GCTGGTTATT 


1 ATGATGCGCA CTTTAAAAAG 


1 GCTGGTCAAG 


5100 
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TCCGTACACT 


AATCAAACAA 


GATTTTGAGA 


ATGTCTTTGC 


GG AT TATGAC 


TTAATCATTG 


5160 


GCCCATCAAC 


ACCAACTGTT 


GCTTTTGGTT 


TGGGTGAAAA 


TATCAATGAC 


CCAATCACGA 


5220 


TGTATATGTA 


TGATATTTTA 


ACTGTTCCTG 


TTAACTTAGC 


TGGGTTACCA 


GGCATGTCAA 


5280 


TTCCAGCAGG 


CTTCTCAGAA 


GGATTGCCAG 


TTGGTTTACA 


AATTATTGGA 


AAACATTTTG 


5340 


ACGAACATAC 


AATGTATAAA 


GCAGCCTATG 


CTTTTGAACA 


AGCGACAGAT 


TTCCATACGA 


5400 


AAAAACCAGT 


CATTTTAGGG 


GGGAACGATT 


AATGAACTTT 


GAAACTGTCA 


TTGGATTAGA 


5460 


AGTCCACGTA 


GAATTGAAAA 


CCAACTCTAA 


AATTTTCTCC 


TCTGCGCCAG 


CGCATTTCGG 


5520 


AGCAGAACCA 


AACAGCAACA 


CAAACGTGGT 


TGACTGGAGT 


TACCCAGGTG 


TTTTACCTGT 


5580 


GATGAATAAA 


GGTGCTTTAG 


AATTTGGCAT 


GAAAGCAGCA 


CTTGCCTTGA 


ACTGTGAAAT 


5640 


TTCAAAAGAA 


ACACATTTTG 


ACCGTAAAAA 


CTATTTTTAC 


CCAGATAACC 


CGAAAGCATA 


5700 


CCAAATTTCT 


CAATTTGATC 


AACCAATCGG 


ACATGATGGT 


TGGATTGAAA 


TTGAAGTCGA 


5760 


AGGCAAAAAG 


AAAAAAATTC 


GCATTGAACG 


TGTGCATTTA 


GAAGAAGATG 


CTGGGAAAAA 


5820 


TATTCACGGC 


ACAGATGGCT 


ATTCTTATGT 


AGACTTGAAT 


CGTCAAGGCA 


CGCCTTTAAT 


5880 


TGAAATCGTT 


TCTGAAGCAG 


ACATGCGTTC 


ACCAGAAGAA 


GCGTATGCTT 


ATTTAGAAGC 


5940 


GTTGCGTTCA 


ATTATCCAAT 


TTACAGAAGT 


TTCTGATGTG 


AAAATGGAAG 


AAGGTTCTAT 


6000 


GCGTTGTGAT 


GCCAATATTT 


CTTTACGTCC 


GTATGGTCAA 


GAAGAATTTG 


GTACTAAAGC 


6060 


GGAATTAAAG 


AACTTGAACT 


CTATGAACTT 


TGTAAAAAAA 


GGTTTAGCTT 


ATGAAGAAAA 


6120 


ACGCCAGGCC 


AAAGTCCTAT 


TATCTGGTGG 


CGAAATTCAA 


CAAGAAACAC 


GCCGCTTTGA 


6180 


TGAAGCAACT 


AGTACAACGT 


TGCTTATGCG 


TGTTAAAGAA 


GGCTCAAGTG 


ACTACCGTTA 


6240 


CTTCCCaGAG 


CCAGATGTTC 


CTCGTTTTTC 


AATTGATGAT 


GAATGGATTG 


AAAAAGTTCG 


6300 


CGCTAGTTTG 


CCAGAAaTGC 


CAGcCTCACG 


TCGTGCCCGT 


TATATCAGTG 


AATTAGGGTT 


6360 


ACCTGAATAC 


GACGCAATGG 


. TCTTGACTTT 


AACGAAAGAA 


ATGTCAGATT 


TCTTTGAAGC 


6420 


AACTTTAGCA 


AACGGTGCAG 


ATGCGAAACA 


AGCATCAAAC 


TGGTTAATGG 


GCGAAGTTTC 


6480 


TGCTTACTTA 


AACAGTGAAA 


AAGTTGAATT 


AG CC GAT ACT 


AAACTGACAC 


CAGAAAATCT 


6540 


AGCGGGTATG 


ATTACTTTGA 


TTAACGATGG 


CACAATCAGC 


TCGAAAATTG 


CGAAGAAAGT 


6600 


GTTCAAAGAG 


TTAATTGAAA 


ACGGTGGAGA 


TGCTAAAGAA 


GTTGTCGAAG 


CAAAAGGCCT 


6660 


TGTTCAATTA 


TCTGATCCAG 


CGCAATTATT 


ACCAATGATT 


AACGAAGTTT 


TAGACAACAA 


6720 


CCAACAATCA 


ATTGATGACT 


TTAAAAACGG 


AAAAGACCGT 


GCAGTTGGTT 


TCCTTGTTGG 


6780 


ACaAATTATG 


AAAGCAACTC 


GTGGACAAGC 


AAACCCAGGT 


GTGGTTAACA 


AGTTATTGCA 


6840 


AGAAGAGCTT 


TCAAAACGAT 


AAGAAAGTAG 


CAGGCAAACC 


ATGAAAAAAG 


CTAGAGTGAT 


6900 


TTATAATCCA 


ACGTCAGGAA 


AAGAGTTAAT 


CAAAAAGAAC 


TTAGCCGATA 


TTTTATCTAT 


6960 


TTTAGAAGAA 


TGTGGTTATG 


AAGCCAGTGC 


ATTTGCGACC 


ACACCAGAAG 


AAAATTCAGC 


7020 


ACGCAATGaA 


GCACATCGTG 


CTGCGCGGGC 


AGGATTTGAT 


TT AC TAG TAG 


CTGCAGGTGG 


. 7080 
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AGATGGGACC 


ATTAATGAAG 


TCGTGAATGG 


GATTGCTCCG 


TTGAAGCGGC 


GCCCTAAAAT 


7140 


GGCTATTATT 


CCTGCTGGAA 


CGACGAATGA 


CTATGCACGG 


GCCTTGAAGA 


TTCCTCGTGA 


7200 


TAATATCGTT 


AAGGCAGCAG 


AAGTGATTAA 


AAAAAATCAA 


ACTGTCAAAA 


TGGATATTGG 


7260 


CCAAGCCGGC 


AAAAATTACT 


TTATCAATAT 


TGCGGCGGGT 


GGTCATTTAA 


CGGAACTGAC 


7320 


TTATGAAGTT 


CCGTCAGAGT 


TGAAAAGTAT 


TTTTGGTTAC 


TTAGCGTACT 


TAGCCmAAGG 


7380 


AGCCGAAATG 


TTGCCGCGAG 


TGAAGCCGAT 


TAAAATGCGC 


ATGACGTATG 


ATGAAGGTGT 


7440 


GTACGAAGGC 


AATGCATCAA 


TGTTTTTCCT 


AGGGCTGACT 


AACTCGGTGG 


GTGGTTTTGA 


7500 


GCAAATCGTA 


CCAGATGCTA 


AATTAGACGA 


TGGTAAGTTT 


TCATTAAksA 


TTGTAAAAAC 


7560 


AGCCAATATT 


TTTGAGATTC 


TTCATTTAGT 


TGCGTTAATG 


TTAAATGGTG 


GAAAGCATGT 


7620 


TGAAGATCAT 


CGACTGATCT 


ATACAAAGAC 


CAGCTATTTA 


CATGCAGAAA 


CGTTAGAAAA 


7680 


GAACAATAAA 


ATGATGATTA 


ATTTAGATGG 


TGAATATGGA 


GGCGATGCTC 


CAATGACCTT 


7740 


TAAAAATATG 


CATCAACACA 


TTGAAATTTT 


TGCAAATGGT 


GATGCACTGC 


CGTCCAATGC 


7800 


AATTATGGGT 


TCTGTCTTAA 


CTGGTAGCGA 


TGAGATTGTC 


GTAGAATCAG 


AAGACGAAGA 


7860 


GGAAGAAGCA 


TATAmCGAAG 


CCAGCAAAGA 


ATTTGTCAAA 


GAAGTTGAAC 


GACTAACAGA 


7920 


CGAAGATATT 


GATGGCGATG 


GAAAGATTGC 


GGAAAAAGAA 


AAGCACTAGG 


TAGTCCGTGA 


7980 


ATAGCTAATG 


AAAGATTTAG 


TTGATAAGAT 


CTTTCATTGG 


GTATTTTTTG 


ACTAAGGGAA 


8040 

i 


AGGGTCTCAT 


TCTTTTATTT 


ACCCAAAAAG 


CTCAAACGGA 


ATTGCTAAAT 


CGCCATGrAA 


8100 


ATTCGTTGTT 


GCTTTTCTTA 


GGTATTTTCA 


TTAGAATGAG 


TGAAGTGTAT 


TAAAAAGAAG 


8160 


GAGCGGACTT 


ATGAAAAATT 


ATCCCAGTAA 


AGAAAAATGA 


CGTCATTGAA GTAGAAATCA 


8220 


TTGACTTGAC 


GCATGAAGGT 


TTAGGCGTGG 


CAAAAGTGGA 


CCATTATCCA 


TTATTTATTG 


8280 


AAAATGCCTT 


ACCAGGTGAA 


AAACTTGAAA 


TCAAAGTCTT 


AAAAACAGGC 


AAAAGTTTTG 


8340 


GTTATGGGAA 


AGTATTGACT 


GTCCTAAAAA 


GCAGTGAACA 


GCGTGTGCCC 


GTGAAAGATG 


8400 


AAAACTTTAC 


AAAAGTCGGG 


ATTAGTCCGT 


TGCAACATTt 


AGCGTACGGs 


GcCCAmCTAT 


8460 


CATTTAAAAC 


mCAmCAAGTA 


GAAAATGTTA 


TGCAACGTGT 


CGCTAAATTG 


CAAGAAGTCC 


8520 


CTGTTTTACC 


AACAATTGGG 


ATGAACGATC 


CGTGGCATTA 


CCGGAACAAA 


GCACAAATTC 


8580 


CCGTGCGTAA 


AATTGACAAC 


CAACTACAAA 


CAGGTTTTTT 


CCGGAAAAAT 


AGCCATGATT 


8640 


TAATTCCGAT 


GGAACATTTC 


TATATTCAAG 


ACCCAGAAAT 


TGATGCAGCA 


ATTGTTAAAA 


8700 


TCCGTGACAT 


TATGCGTAAA 


TATAGCGTAA 


AACCTTATAA 


GGAATCTGAT 


AATACAGGGA 


8760 


ACTTACGCCA 


TATTGTCGTT 


CGTCGTGGCT 


ACCATACTGG 


GGAGATGATG 


GTTGTTCTAA 


8820 


TTACGAGAAC 


ACCGAAATTA 


TTCCCAATTA 


GTAAAATTGT 


GCCAGATATT 


TTGGAAGCTA 


8880 


TTCCGGAAGT 


AGTCAGCATT 


GTACAAAATG 


TAAATCCAAA 


ACGAACCAAT 


GTCATTTTTG 


8940 


GCGACGAAAC 


CATCTTATTA 


CATGGGTCTG 


AAAAAATTAC 


AGATACTATT 


TTTGATTTGA 


9000 


AATTTGAAAT 


TTCCGCGCGT 


TCATTTTATC 


aAGTAAATCC 


ACaACaAACA 


GAAGTGATGT 


9060 
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ATCmAAAAGT TAAAGAGTAT GCGGCTTTAA GAGGCAATGA AATTGTGGgT GGaTGCCTAT 9120 
TGCgGGaTTG GaACGAnTGG cTTAACTTTA GCCCAAGATG CCAAGCCAGT TTACGGTATT 9180 
GAAGTAAnTG GAGGAAGCGG TCCAAGATGC CG 9212 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 335: 



CCCTGGGAAA 


1 X bbnbbAl 1 


aaaa^ppaaa 


Tr^aaPaaTTP 
1 n/i/ib/iAl 1 b 


TaPaPPaTTa " 
l/ib/ibb/il 1 1\ 


PTPTPTZiZiZiZV 


60 


bbAbAlAl 1 1 


a ppa a paTPP 


paaTP.aTP.aT 
L-riM 1 brt 1 o/i 1 


TCCGATTAAG 


GCACTATAAG 


pp.p.pziTP.app 


120 


AAb lbb IbAl 


Abb 1 b 1 1 itl 


TPPT Zi a aP a P 
i bb 1 A/iHbHb 


ACCTACTAAT 


GACATCGCGC 


paa app.Tap.p 


180 


I bb I bbAb i 1 


A/iftAAl bbbb 


bbbbbrib i bl 


TCCACCAGTA 


TACATTAATA 


ZiPPTPZiPPPa 
ribb 1 LriL/OOn 




1 AAAb X bAbA 




/il bb/il/ibbb 


AACAAATCCT 


GCAATCGCAA 


TPPTTPPaTT 


300 


I I It, I blbbl 


111 bbbbb Iri 




TAAGCCaATT 


TCAACAAGCC 


aaaTpaTPpa 


360 


n a a a rpT a a p 


PTTPaTTPPa 
b 1 1 bril I bbn 


zvt Zi zi a P.T zi p.p 


TAGAACAGGT 


AAAAAGACAG 


TTPTPATTPP 


420 


ATAAGCCATG 


ATTGCGCTAA 


TACCAATTCC 


TAACGCTAAA 


AATCCGTAAA 


TTTTGGCATA 


480 


AAATTTATTC 


AAACCTGTGC 


CAGTTTGGAT 


ATACTCTTGA 


TTATTCATAA 


GTnAAAGTCC 


540 


CTTCTTTCTC 


TGTTAGTTCA 


TCAATCGTCA 


CTTGGGGTGG 


TTGAACCATC 


ACCACTGCAT 


600 


CTAACACGCG 


ATTTTCTTGC 


TCATTAACAG 


CCTCAACTGA 


AATGGTAATC 


ACATCTTTCA 


660 


TTTTGTCGAC 


TTTGATTACT 


TCTAATTGGA 


AAGTCAATGT 


TTCGTAATGA 


AAAACGGGTT 


720 


CCACAAAGTT 


CaCAGAGAAA 


TTCaCAACAT 


GTGaACCTGG 


TCCTGGTAAA 


TGTTTAGAAA 


780 


TCGCACTAGT 


GATAATTCCC 


ATCAGCATAA 


TTGATGGTAC 


GATTGGTTTT 


TCATATTCTG 


840 


TTTTTTGGGC 


ATAATCATGT 


TGGATATATA 


AAGGATTCgC ATCATTGGTC 


AAGCCCAAAT 


900 


AAAGTAATAA 


GTCCTTGTCT 


TCAATTGACT 


CTGTCAAGGA 


CAACGAATCT 


CCCTCTTCGA 


960 


TATCTTCGAT 


GGTTTTGCCT 


AATTTTCTCG 


GTTTTCCGAT 


ATTCAAATTA 


GTACACCTCC 


1020 


GTTATTGATA 


AGATAATTGT 


ACCAAAATGT 


ACTTAAAAGG 


CAAGCTAAAA 


AGTAGTCTTT 


1080 


AAAGGATGTT 


TAAACAAAAC 


CAACTTATCT 


AcCATGCtAG 


CATTCTAAAA 


AAAACAACGG 


1140 


aACAGGaGCA 


GCTTCTTATT 


TtTTGAAGCC 


AGATACCCCT 


GGnCCCGATT 


TTGGTTATGT 


1200 


CCCCTATAAA 


ACGGTCnAGT 


nGGGTACCTA 








1230 



(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



GTCACAAAAG 


AnGTTTCAGT 


CAATACTGAA 


ACTGGTTTAA 


CACAACAAGA 


GGCCCAACAA 


CGGCTGAAAG 


AAAATGGCCG 


CAATCAATTT 


GAAGAGGCTA 


AGAAGGATTC 


TGTGTTAAAG 


AAATTTATTC 


ACAGTTTATC 


TGATTTTACA 


ACGATTATTT 


TATTAGTTGC 


TGCGGCAATT 


TCGTTTTATA 


CAGCAATCGT 


AACAGAACAT 


GGCGAATATT 


TTGAAGGAAT 


CTTAATTATT 


GCAATTGTGA 


TTATCAATGC 


TGTTTTAGCC 


ATTGTTCAAG 


AAGGAAATGC 


TGAAAAATCT 


CTGGCGGCTT 


TACAAGACAT 


GAACAAGCAA 


AGTAGTGCAG 


TCTTACGAGA 


TGGCAAGGTA 


ATCGAAGTCG 


ATGCTGAAGA 


ACTAGTAGTA 


GGAGATGTTC 


TAGTTCTTGA 


AGCTGGTTCT 


ATGATTACAG 


CCGATGCACG 


ACTTATTCAA 


GCTTCTCAAA 


TGCGCGTAGA 


AGAGTCmGCA 


CTAACaGGTG 


AAAGTGAACC 


TGTTGaAAAA 


GATCCTACAT 


ACGTGGGGcA 


TGACGATGAT 


GGCTTAGGaG 


ATCAAATAAA 


TATGATTTTC 


AAAgGCTGTA 


CCGTTGTGAA 


TGGCCGCGGA 


CGAGCAGTAG 


TAACTGCAAC 


AGGAATGnAT 


ACGGAATGGG 


TAaAtTGCTG 


GGTTACTGAA 


TAACAGCGAT 


CAGCAAAAAc 


aCCATTACAa 


AAACGCTTGa 


ATCmATTrGG 


CmAACGTATT 


aGTTTgTTAG 


CCtTAGGGGC 


TGCAGCAATT 


GTTTTCATTA 


TTGGTGAACT 


GCAAGGAGAA 


CCGCTTCTTG 


AAATGTTTAT 


GACAGCCGTt 


TCCTTAGCGG 


TGGCAGCTGT 


ACCAGAAACA 


CTAACCGTGA 


TTGTAACGTT 


AACCTTAGCA 


TATGGTGTTC 


AAAAAATGGC 


nAAGAAACAT 



GCGATTATTT CGTTCGATTG 

(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
920 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

AATACTATAA AACCTTTTTA TACATTTGTA CACAAAAAAC CTCACTTTCT GAGGnAGATT 60 

TTATTATTTA GCATTGCCAA AAAGTCAAAA GCAAGAGCCC CCAAAACTTC TTTGTTAGGG 120 

TTGcCTTGAA AATAAAGAAA AACAAAAAGC CGAAAACGCT AAAGACGTTT TCAGCTTTTT 180 

GTTTTTTaAT GGrATAAATG GcTCTTTTGT GCTAAAGCCG CAAATTTTTT CGATCTGTGT 24 0 

TCCAGATGTG GATAAAGAAT' TGCACCGCCA ATACTAAAGA CGACTGAAAT CCCCACCATA 300 

ATCCAAATGG CAAACCAAGC ATTTGGCCAG TAAATGCCTC CTGCTGATTC CCTTAGCAAG 360 

TTCACGGCAT GCGTAAACGG CAAGAACGGA TTAATCATTT GGAAGAATTT ACCTGAGACT 4 20 

TGAATTGGGT AGTTCCCACC CCCACCAGAG ATGGACAGTA CCAAGATTAT TATCGCAATT 4 80 

CCTTTCCCTA CATTGCCAAA GAGAGCAACA AGCACATAAA CCATTATCAT AAACGTAATG 54 0 

GCAATCAATA CCGCAAACCA GACGCTATAC GCTGGATTAC GAACATCTAC GCCTAACGCA 600 
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AAATAATTAC 


CTAAGGTTAC 


AATCAGTGCT 


TGACCAATCC 


CCATGACAAT 


GAAGGTAAAC 


66.0 


ATCCGTGCTG 


AAAATTGTTC 


CCGTTTAGAG 


AAGCGTTTTT 


TATCTTTTCC 


TTCAAGATAA 


720 


ACATCCGTTG 


TCGCAACACT 


TGAGAATAAT 


ACAGCACCAA 


CCCATAAACA 


CAAAGCTGTA 


780 


tap.a ATfjfinG 


TACTTGCCGA 


ACCATTATTA 


GCAATCGGAT 


AAACAGCATG 


TTCTTTGACT 


•840 


TPTi TiPTP.^TT 
I CMAC 1 wu 1 1 


GTGTAAAGAA 


GTCACTTTCT 


TTATTGGCAT 


CTAATTTTAG 


TAATTTAAGA 


900 


Al 1 1 L-oCL- 1M 


AATCAACTTC 


TTTTTCACCT 


TTACGGATTG 


CATTCGCTGC 


TTTGTGTAAT 


960 


r^i r~* rp rp rp rn rp T\ 


TATTTGGCCA ATCATTAATA ATTAGCTCGT 


TGGCTTTGTC 


CAAGGCCGCT 


1020 


TCAALA1C1 o 


GCATTTTCTC 


ATTGACTGTT 


TTCAAGGTAT 


TCGTTAAATC 


TTTTCGGATA 


1.080 


L bun 1 Ao 1 


CATTTTGCAT 


AAAGGCTGCC 


GCTTTGCTGA 


CTTTATCTTG 


AATGACTGGT 


1140 


AAA i lo 1111 


TATACAAATC 


AGCACCACGA 


TTAATGCCAT 


TGACAATGGT 


TTCCATATTG 


1200 


PPftfTPZiZiPZi 
CCA 1 1 LnnLn 


TCGTATTCGC 


ATCATGAATT 


TCTTGTTTAA 


TTGCCGGCAT 


TTCGGCTTGA 


1260 




ATAAAGAAAT 


CGCATTAGTG 


ACCGTTTGAC 


TTGTTGAGCT 


CAATAAACCT 


1320 


TPZ1 Zi APTPA A 


TTTGTTGCGC 


TTTGTTTAGT 


TGACCTTGAG 


CGkTTTGGgA 


TGGTAGCAAT 


1380 


1 AMI 1 1 old 


AAAATCGTTT 


TTACTGTGTT 


TTCAATTGCA 


TCAGGGGTAA 


TACTATTGAC 


1440 




CTTGTTGAGC 


TcTGCTTGAA 


TAGCATTTAA 


AtCTGCATCA 


GTTACCGCAT 


1500 


1 LnLn 1 CMM 1 


GTTATCAACA 


TCAGTTTTTA 


ATCTAACTAC 


AATACCATCA 


ATCGTATTCA 


1560 


2\ T P T 7\ HP TT Zi T 
Al C inl 1 1 Ml 


TGTGTCTTGT 


AAATCAGTGT 


TACCTGTGGC 


TTCTTGTATT 


TGCGTAAACA 


1620 


PTTP.Z1 ATP Af^ 


CTGTKjbAi A 


1111 tni (oil 


P.TTTTPPT Af^ 


GCTTTCAGAC 


AGTTGTTGCT 


1680 


T2i ATT^PTTP 


TCTTTCTTCA 


ACTGGTAAAC 


GATGATTTTT 


TAAATCTTCT 


AGTGCTGTAT 


1740 


ttapappa^p 


GCTAATTGTT 


CCAATTGCTT 


TTAAAATCGT 


CCCCACACTT 


TGAGTAATTC 


1800 




CTTTTCCAAT 


TGAGTGGCGC 


CATCTAATGT 


GACATTCCCT 


AAATCATTGG 


1860 




TAGTTTACGA 


ATATCTGGTA 


ACGCCGTTTG 


AACTTGTTGA 


ATGATTTCTA 


1920 


ATPPTTflTTT 


TGCTTCTTGG 


ATTCCTTCGC 


TCATTGTTTG 


TTCCACAGAG 


GCAAAGTCTT 


1980 


CATCAATCAT 


AGCAATTTGT 


CGGCCAGCGT 


TTTGAATTTC 


TGGAATTTTT 


TCTTGTAACG 


2040 


TCAAAATAAC 


ACTAGCTTGT 


TCTTTGATGC 


TTGGCATTTT 


AL-L.A1 1 InuL 


TP A AP A ATPT 


2100 


TCTCTCCTAA 


AGCATCAACT 


TCAGGTAAAT 


ATTTCATCGC 


ATCGTTGGCT 


TTCGCTAATT 


2160 


TTTCTTTTAG 


CTCAGGCATC 


TTACCATGTA 


AGTCTGTGAC 


TTGTTTTGCG 


TACGTATCAA 


2220 


TGGTTCCAAT 


GTTGGCATCT 


• GTATCTAAAA 


TCATGctCTT 


AACTTTTtGA 


ATGCTAACCA 


2280 


nGTTTTTATC 


AATAtCATAG 


cCAATAayCA 


, TkGAAgGTTT 


gaTaACGtAc 


TACTTGcCGG 


2340 


tTTwAAATGG 


AAATCCTTCC 


CGAAAATTTT 


GGTGGAACCG 


> G 




2381 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13154 base pairs 

(B) TYPE: nucleic acid 



WO 98/50555 PCT/US98/08985 

1444 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 



GCCAGATTAT 


CGTTCCTTAA . 


AAGGTGTCTA 


TCAAGGAATC 


CAGCAACCGG 


aATACTTATT 


60 


GAGCCGGACT 


TGCTTAAAAA 


CAGAGCCAGA 


AAAGTTTTAT 


CAATTCGTTA 


AAACGCTCTA 


120 


TCATCCAGAC 


GCACAACCTA 


ATATCATTCA 


CCAGAAAATG 


GCCCAATTAG 


AGCAAATGAA 


180 


ACGGGGAAAA 


ATTGTTTCAC 


AAAATATTGA 


TGGGCTTCAT 


CGAAAAGCAG 


GTAGCCAAGA 


240 


AGTGGTGGAC 


TTTCATGGTA 


ATTTATATGA 


ATGTTATTGC 


CAAACTTGTG 


GTGCAACGGt 


300 


TCCTTGGCAA 


GACTATTTGC 


TTAGTGATCG 


GCACGCTGAT 


TGTCATGGAC 


AAATCCGTCC 


360 


AGCTATTACT 


TTGTATGAAG 


AAGGACTATC 


CGAAGAGGCA 


ATTGAAAAAG 


CCATTCAGGC 


420 


CGTTGCTTCT 


GCCGACTTAA 


TCGTGATTGT 


CGGCACGAGT 


TTTcAAGTGC 


ATCCATTTTG 


480 


CGATTTGATT 


CATTATAAAC 


AACCAACGGC 


AACTATTTTA 


GCCATTAATC 


AAACGCCGCT 


540 


ATTTTTACAA 


CAACCTTATT 


ATTTTTTAGA 


AGCGAAAGCC 


GAAACGATAT 


TTGCAGAATT 


600 


AACGATAAAG 


GAGTAACGGA 


ATGGAAGAAT 


TTAAATCAGA 


AAAAGAAAAA 


ATGATCGCAG 


660 


GTGCACTATA 


TTTTGCCTCA 


GATCCTGAAT 


TAGTGGCTGA 


TCGCAAAAAA 


GCCCGTGAAC 


720 


AAATGGCGTT 


AATTAACCAA 


CAACCAGACA 


CTTATATTCG 


TCGTCAATTA 


ATAGAAGAAA 


780 


CGTTTGGAAA 


AGTGGGTGTC 


GGGACGTACA 


TCGAACCGAT 


GATTCAATTT 


GATTATGGCT 


840 


ACAACATTTC 


TGTGGGAAAA 


AATTTCTATG 


CCAATTTTAA 


TAACGTTTTT 


TTAGATGTTT 


900 


GTCCAATCGA 


AATTGGCGAT 


AACTGTATGT 


TTGGACCCAA 


TGTGCAACTT 


TATACAGCCG 


960 


AGCATCCATT 


ACAAGCGGCC 


AAGCGTAATA 


GCGGAATGGA 


ATCAGGGAAA 


CGAATTATCA 


1020 


TTGGCAACAA 


TGTTTGGATT 


GGCGGTGGAG 


CGATTGTCCT 


TCCTGGCGTA 


ACATTAGGCG 


1080 


ATAATGTAGT 


GGTGGCAGCA 


GGTGCTGTTG 


TGACCAAATC 


GTTTCCAGAA 


AACTGTGTGA 


1140 


TCGCAGGTAA 


TCCAGCACGG 


ATAATTAAAG 


AATTGACTGA 


AGATGACGCC 


CCAACAACTT 


1200 


CTTTAGAACA 


ACAACGAGCA 


AAGATTAACC 


AAATCGATAA 


GGAACTGGTC 


CGTTTATTAG 


1260 


AACAGCGGAT 


GGACGTCGTC 


GCAGAGATTG 


CTGCAGTGAA 


AAAGAAAGCA 


GGTCATGCGG 


1320 


TTTTTGATTC 


TGAAAGAGAA 


CAACAAGTAT 


TAGAAACTAT 


TCTGAATCAT 


GTTGAAAATG 


1380 


CCGAGTATGA 


AGAAACATTA 


TCTGAGACTT 


TCCAGGGAAT 


TATGGACGCC 


TCTAAACGTT 


1440 


TCCAAGAAAA 


ACATTTAGGA 


GAGTGAGGCG 


AAGGTCGTGG 


ATGGAGAAAT 


CAGAAGAAGA 


1500 


GAGATTGTTA 


AAAGATTAAG 


AGAGACCGAA 


AAGCCAATTA 


GTGCCACTCG 


TTTTGCAAAA 


1560 


GCTTTTGATG 


TCAGCAGACA 


AATTATTGTA 


GGAGATGTCG 


CTCTATTACG 


GGCGACAGGG 


1620 


GTCGAAATTG 


TCGCAACAGC 


TCGAGGGTAT 


ATGTTGGAAC 


AACCTTTAGA 


> AGGAATTGAA 


1680 


CGGAAAATTG 


CTTGTCAACA 


TACACCTGAA 


. CAAACGAGAG 


AAGAATTGTC 


: AGTCATTGTT 


1740 


GCCAAAGGTG 


GAGAAGTGGT 


AGATGTGAGT 


1 ATTGCACATC 


: CGCTTTATGG 


; GGAATTAATC 


1800 
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GGCAGTTTGC 


GCATTCAATC 


AGAGAAAGAT 


ATTGATAAAT 


TTATGGACAA 


GTATCAAAAA 


1860 


GAGCATGCTA 


CATTACTATC 


AGTTTTAACA 


GGAGGCGTCC 


ATCTGCATAC 


TATTCGCTGT 


1920 


GCAGACGAAG 


AAACTTTCCA 


AGAAATCAAA 


GAGGCACTTC 


GAAAAAAAAG 


TATTTTGTTT 


1980 


GAAGGATAAC 


GAGAAAAAGA 


GAGGACGTTG 


ATGTTTGTTC 


AAGCCCTTTC 


TTTTTTTTCT 


2040 


TAGAAAATTA 


CCAAAAAAAG 


ATAGAAGAAT 


TTGTGAATTT 


TCTTCTATTT 


TTTTAACTTT 


2100 


TTGCTAGAAA 


TTTTTTATGA AACAGTATAC 


TATTAAAAAA 


GGAGCCATTT 


AAACAATAAC 


2160 


TATGTCAATT 


ATATATGTTG 


GTACATAAAA 


aTAAATGGAG 


AAAAATAAAA 


AAGGAGAATG 


2220 


AGTATGAAAA 


AACAATTTGC 


CCTGTTCTCT 


ACCTTATTAC 


TTTTAGGTTC 


ATCAATGCCA 


2280 


GTAGGTGCAT 


TTGCAGAAGG 


TAATAACTTG 


GTAGCTGAAA 


ATAGTTCTGT 


CGCAGAAACA 


2340 


ACCGCAGAAG 


CAACAACTGC 


TGAAGCCACT 


GAAACGACAG 


CGACTAGTGA 


AACAACAGAA 


2400 


GCGACAGAAG 


AATCAACAAC 


TGAAACAGAA 


AGTTCAACCG 


AATCAAGTGA 


ATCTGCAACC 


2460 


ACGGAGTCCA 


CGGAAACGTC 


AGGAACCGAA 


ACGACAGATT 


CCACAACGGA 


TTCTACATCC 


2520 


ACAAGTACGA 


CAGAATCGAC 


TACTGACTCG 


ACCTCGACGA 


GTACAACAGA 


GTCAACCACA 


2580 


GATTCTACGT 


CAACAAGTAC 


AACTGAATCA 


AGTACCACGC 


CAACGACCAC 


ACCAAGTTCT 


2640 


TCTAAAGAAC 


AGCCAAAACC 


TGGCACGAGT 


ACTTCTGAaT 


CAAAACAGCC 


GGCGAAACCA 


2700 


GTGACACCAA 


CAGCGCCAGC 


TGAAAAGCCA 


GTAGAACAAC 


CAGCAGCGTC 


AAGACCGCAG 


,2760 


CCTGAAATAG 


TCCCACCAGT 


TACAAATGAA 


ACGGTTGGGT 


TAGTGGAAGA 


CGATGAGACG 


2820 


TTTACGGTCA 


GCAAAACGAA 


GAAGACAGAA 


GAGTTTATTC 


AAGAAATCGG 


TGAATCTGCT 


2880 


CGGAAAGTGG 


CCAAGGACAA 


AAACTTATAT 


GCTAGTGTGA 


TGATTGCTCA 


AGCAATTTTA 


2940 


GAAAGTGGCA 


GCGGCAATAG 


TAAATTAAGC 


CAGAAACCAA 


ACTACAATTT 


GTTTGGTATT 


3000 


AAAGGCGATT 


ACAAAGGTCA 


ATCTGTTTCT 


TTTATTACCT 


ATGAAGACAA 


TGGTTTCGGT 


3060 


AACCTATACA 


CAGTAGAAGC 


AAAATTCCGT 


CAATATCCAA 


CGTATAAAGA 


ATCAATGGAA 


3120 


GATTATGCGA 


AATTATTGAA 


AAATGGTTTA 


GATTCCAATA 


AAGATTTCTA 


TCATGGTGTC 


3180 


TGGAAAACGG 


AAGCCAAATC 


ATACAAAGAA 


GCGACACGTT 


TCCTAACAGG 


AAAATACGCA 


3240 


ACGGACAAAG 


ATTACCATAA 


GAAATTAAAT 


GCGTTAATTA 


AAACATACGA 


TTTAACTTAT 


3300 


TATGATAAAG 


AAAAAGCAAC 


CGTTGAACCA 


ATGGAATCAA 


ATTTCCCAGC 


CTATAATGGC 


3360 


AAAAACTATG 


ACACCTTTAA 


TAGTTATGCT 


TGGGGCAACT 


GTACACAATA 


TGTCTACAAC 


3420 


CGTATTACGC 


AATTAGGTAA 


ACGTGTGGAC 


TTAACAATGG 


GGAATGGTCA 


AGATTGGGGC 


3480 


GAAACAGGTC 


GTGCACGCGG 


GTACAAAGTA 


AGTCGTACAC 


CAAAAGCAGg 


TGCAGCGGTT 


3540 


TCCTTCCCAG 


CTGGTGTTTT 


AGGCGCAGAC 


AATACTTATG 


GTCATGTCGC 


ATTTGTGGaA 


3600 


AAAGTCTTTA 


AAGATGGTAG 


CATCTTAATT 


TCTGaAATGA 


ACGTGAAGGG 


ATTAAACGTT 


3660 


GTTTCGACAC 


GTACAATCTC 


AGCCGATGAA 


ACACATTTGA 


TGAACTATAT 


CGTTCCAAAA 


3720 


GATAAATAAA 


AAAATCAGCT 


CTTTGTCAAT 


AAGAACGAAT 


GAGTTGGAAA 


AwTCAAAGCT 


3780 
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AGCACTGAAG 


AAGCAATTTT 


AATTCTTCAG 


TGCTAGCTTT 


TTTTTGTGGC 


AAAAGTTCTC 


3840 


TCCAATAAAG 


TTTAATACAT 


TCAGATATTG 


CCAGGAGGCT 


TCAGGTCGGG 


ATGAGTTAGC 


3900 


AACAGTCGCT 


GAGAGCTATA 


TTTTTTTTAG 


GTAGAATCAT 


TTACTTTTAG 


GTGGAAGATG 


3960 


TCTTATAATT 


GTAAGCGGAT 


ACAAAAATAA 


AAAAAGAGGT 


GGGTAAATGA 


GTAAAAAAGA 


4020 


AATAAATCAA 


GTAGTTGCCA 


GTAGCTATCA 


ATTGTATATT 


AATGGAGAGT 


GGACAACAGG 


4080 


TAGTGGTAAC 


AAAATGATTG 


CTAGTTACAA 


TCCTAGTAAT 


GGCGAAAAAT 


TAGCAGAATT 


4140 


TGTAGATGCC 


ACAAATGCAG 


ATGTGGATCG 


AGCTGTAGAA 


GCAGCCCAAG 


AAGCGTTTCA 


4200 


GACATGGAAA 


GATGTTGACG 


TTGTAACAAG 


AAGCAATCTT 


TTGTTGAAAA 


TTGCTGATTT 


4260 


GATTGAAGAA 


AATCAAGAAC 


ATTTGGCTAT 


GGTGGAGACT 


TTAGATAATG 


GAAAACCGCT 


4320 


TCGGGAAACG 


CAATCGATTG 


ATGTCCCTGC 


CAGTGcAGAC 


CATTTTCGGT 


ATTTTGCTAG 


4380 


TGTGATTCGT 


GGAGAAGAAG 


GATCTGTCAA 


AGAATTTGAT 


AAGGATACGT 


TATCCATTGT 


4440 


TGTGAAAGAA 


CCCATCGGTG 


TTGTGGGTCA 


AATTATTCCG 


TGGAATTTCC 


CCTTATTAAT 


4500 


GGGTGCTTGG 


AAATTAGCAC 


CAGCGTTGGC 


AGCAGGTAAC 


ACGGTTGTGA 


TTCATCCGTC 


4560 


CTCGAGCACA 


TCATTAAGTC 


TGTTGGAATT 


GTTTAAAATT 


TTTGATCAAG 


TCTTGCCGAA 


4620 


AGGAGTAGTG 


AATTTAATCA 


CCGGTCGTGG 


TTCTGATTCA 


GGAAATTATA 


TGTTGGCACA 


4680 


TCCAGGGTTT 


GATAAGCTAG 


CTTTTACAGG 


CTCAACAGAG 


GTGGGGTACA 


CTGTCGCTAA 


4740 


AGCGGCGGCC 


GACCGCCTAA 


TTCCAGCCAC 


TTTAGAACTT 


GGCGGAAAAT 


CAGCCAACAT 


4800 


TATTTTTGAA 


GATGCCAATT 


GGGAACGTGC 


ATTAGAAGGC 


GTGCAGTTAG 


GGATTTTATT 


4860 


CAATCAAGGG 


CAAGTTTGTT 


GTGCTGGGTC 


TCGTGTGTTC 


GTTCAGTCAG 


GTATTTATGA 


4920 


TCAATTTGTA 


GAAGCTTTAA 


AGGAAAAGTT 


TGAACAAGTG 


AACGTTGGTT 


TCCCGTGGGA 


4980 


AAAAGATGTT 


GAAATGGGCG 


CTCAGATCAA TGAGCATCAA TgGAAGAAAT 


TTTAAAATAT 


5040 


GTCGAAATTG 


GTGTGAAGGA 


AGGAGcTACG 


CTGATTACTG 


GTGGGCAACG 


TTTAACAGAA 


5100 


AATGGGCTAG 


ACAAGGGGGC 


GTTTTTAGCA 


CCTACGTTAT 


TAGCGAATGG 


TACGAATGCA 


5160 


ATGTGTGTGG 


CCCAAGAAGA 


AATCTTTGGT 


CCTGTTGCAA 


CAGTGATTAA 


ATTTGAAACG 


5220 


GAAGAAGAAG 


TCATTCGTTT 


AGCTAATGAT 


TCTGAATATG 


GTCTAGGTGG 


TGCCGTCTTT 


5280 


TCTCAAGATA 


TCAATGTGGC 


ATTACGGGTT 


GCTCGTGGTG 


TACGGACAGG 


TCGAATGTGG 


5340 


GTCAACACAT 


ACAATCAATT 


GCCTGCGGGC 


GCGCCATTTG 


GCGGTTATAA 


AAAATCAGGA 


5400 


ATTGGTCGAG 


AAACGCATAA 


ATCAATGCTA 


GATGCTTATA 


CGCAAATGAA 


AAATATTTAC 


5460 


ATTGTGACAA 


AAGAAGAAGC 


AGATGGACTG 


TATTAAGTAG 


TGTAGTGAAT 


TCTAGATAAT 


5520 


AAACTTATTT 


TTGGTTATTG 


TTATTTTTCG 


AGCTAAACAA 


AACAATAATT 


TTTTTAAAAA 


5580 


GTTGAGTGAT 


GAGTAAAGTA 


AAACAGATTA 


AAAGTAAAGC 


CTGGTACAAA 


AATCACTTTG 


5640 


GATTTTTGTA 


CCAGGCTTTA 


AAGCGGAGTA 


ACAATAGTGT 


TCACTCTTGG 


TTGTTTTTAA 


5700 


TTTTTGACTA 


TTGTAGAAGG 


AGTTATTTTG 


CTTCTTCTGC 


TTCAAATTTA 


GCAAAAGCTG 


5760 
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TTTCAATAAC TTCTTTTAAT GCAGAAGCAG AAGCTTCCAT TTTTTCTTGT TCAGCaTCTG 5820 

TTAATGGAAT TTCAATGACT TGTTTAACTC CtTGGCGGTT GATGATCGCT GGTGcACCGA 5880 

TATAAATATC GTTTTGACCA TATTCACCTT CTAAATAAAC AGATAATGGT AACACAGAGT 5940 

TTTCATCGTT TAGGATAGCT TTAGTGATAC GCGCTAGTGC AACCGCGATT CCATAGAAAG 6000 

TAGCTCCTTT TTTCTCGATG ATTGTGTAAG CAGCGTCGCG TACGTTGAAG AATAAATTAA 6060 

CCATTGCTTC TTCATCGACG TCAGGATTAT TTTTCACCCA TTCGTAAATT TGTAAGCCAG 6120 

CGACATTCGC ATGTGACCAA ACTGGGAATT CTGTATCTCC GTGTTCCCCT AAGATGTAGG 6180 

CATGGACATT TCGTGCATCA ACGTCAACTA ATTCGGCAAT TGCTTGACGG AAACGAGCAG 624 0 

AATCTAGTGA AGTTCCTGAA CCGATTACTC GTTCTTTCGG GAAGCCAGAG AATTTCCAAG 6300 

TTGAATAAGT CAAAATATCA ACTGGGTTTG CGGCAACTAA GAAGATACCG TTGAAACCAG 6360 

AATCAACAAT TGTTGTAACG ATTTCTTTAT TAATTTTCAA GTTTTTATGA ACTAAGTCTA 64 20 

AACGAGTTTC GCCTGGTTTT TGAGGCGCAC CAGCTGTTAA GACAACTAAG TCTGCATCAT 64 80 

GGCAATCGTC ATAAGTAGCA GCATAGATTT TTTTAGGAGA AGTAAATGCT AATGCGTGAG 654 0 

ATAAGTCCAA CGCGTCTCCT TCAGTTTTTG GTACATTAAT ATCAATAATC CCAACTTCTT 6600 

GAGCAATATT CTGAGTTACT AAAGCAAAGG CATAGCTAGA ACcTACGGCA CCGTCCCCGA 6660 

CTAAAATTAC TTTTTGGTGA TCTTTATTCC CTGCGGCTGC AGTCATGTGT ACCATTCCTT ( 6720 

CCTCTACATT CTTTTTCGTG AGCAACGAGT TTTTATTTCC TGTTGCTAAA GTGATAGTAA 6780 

CATGGTTTGT AAAAAATGTC ACGAACTTTG ATTGTAAAAA TGATACAGTT TTCAGACCTG 684 0 

TTTCAAAATG GTAGCTCTTA CATGTGCGTA ATTTCACTAT CTTACGCATA TTTAAATTTC 6900. 

TAAAAATTTT TTTACATCTT TCTTAATCTT CAAAGACTGC GAGTTTTTTT ATTTCTATGG 6960 

TATAATGAAT AAAACTGGGT AGCGCATACG CTTCCCAGCC ATTTTGTGTT GGCAAGCTTA 7020 

TAATCCAAGG AGAATGAAAG ATGAAAGTAA TCGTAGGACT GGGAAATCCC GGTAGTAAAT 7080 

ACAAAGAAAC AAAACACAAC ATTGGCTTTA- TTACGCTAGA TGAAATTGCG TATCGGCAAA 714 0 

ATGTTTCGTT TAATAATAGT AATTTTGAAG CAGATATTGC AGAGTTTTTT ATAGGGACTG 7200 

AGAAAGTAT-T GTTAGTGAAA CCATTAACAT TTATGAATGA ATCAGGTCGC TCTGTTGGTC 7260 

CGTTGTTGAC CTACTTTGGG GTAGATGAGG AAGATTTGAT TGTTATTTAT GATGACTTAG 7320 

ACTTAGAGAT CGGTAAAATT CGTTTACGGC AAAAAGGTAG TGCTGGCGGC CACAATGGGA 7380 

TAAAAAGTCT GATTGCCCAT TTAGGAACAA ATGTCTTTCC TAGAATTAAA ATTGGCATTG 74 4 0 

GtCGGCCCTC aAAAAATGAT ACAGTTATCC ACCATGTACT AAGTACTTTT CCCAAAGAAA 7500 

CACACGAAGA GATGTTACTT GCTGTAAAAA AAGCAGCCGA TGCCGCTTTA TATGCATGCG 7560 

AAGGACATAC GTTTGTTGAA ACAATGAATC AATTTAATGG AAAATAGCAG AAAGGAGGGC 7 620 

CCTTGTTTGA ATATTATCGA ACGAATTAGC CAAACAGAAA TTGCTAAATC ATGGCGCAGT 7 680 

CAACTAATGA CAACAGGAAC GCGCCAACTA ATTACTGGGT TATCAGGCTC TGCTAAAACA 774 0 
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TTACTGATGG 


CAGGCGCTTT 


ACAAAAGACG 


CATCAAAAAA 


TTGTTATTGC 


GGTCCCAAAC 


7800 


CTATACTATG 


CCAATCAATT 


AGTAGATGAT 


TTTTTAAATA 


TTTTACCTTC 


AGAACAAGTT 


7860 


CATCTATTTC 


CAGTAGATGA 


AGTGTTATCC 


GCCGAAATGG 


CCTTTTCTTC 


ACCAGAAGCG 


7920 


CGCGCAGATC 


GAGTGAGTGC 


GTTAAATTTC 


CTGATGACAG 


CGAATGCAGG 


AATCGTGGTC 


7980 


GTACCAGTAG 


CTGGACTACG 


AAAATATTTG 


CCAACGAAAG 


AAACGTGGCA 


AAATGCGCAA 


804 0 


CTTTATTGGG 


AAATCGGGAC 


AGAAGTAGAA 


CCTGAGATCC 


TCGCGCAACG 


CTTAGTGTTA 


8100 


ATGGGCTATG 


AACGCCAAGC 


AATGGTCGGA 


AAGCCTGGCG 


AATTTAGTAT 


TCGCGGAAGC 


8160 


ATTGTTGATG 


TCTATCCGCT 


AAATGCCGAA 


TACCCTGTTC 


GGGTGGAACT 


GTTTGATGTT 


8220 


GAAATTGACT 


CAATCCGTTA 


TTTTGAAGCA 


GATACCCAAC 


GTTCATTAGA 


AAATCTTGAA 


8280 


GAAGTAACGA 


TTTCACCAAT 


GACTGATTTA 


GTATTCTCAA 


AGGCAGATAT 


GAATCATGGC 


8340 


ATGACGAGGT 


TACAAGATGC 


TCTAGAAAAA 


CGTTTAGCGA 


CAgCTAAAGA 


TCAAACGGAG 


8400 


ACCGATTTTT 


TAGAAGAATA 


TTTTGGTCAG 


TTGCTGTCTT 


CTTGGGAACA 


AGGGATTCCT 


8460 


ACAGAAAATG 


CTCACTATTA 


TACAGATTTT 


CTGTACCAAC 


AAAAAACTAC 


GCTGTTAGAT 


8520 


TATTTACCAG 


AAAATAGTTT 


GCTTTTTGTG 


GAT GAT T ATT 


CTCGTATGAT 


GGAAACGGAG 


8580 


CGAGAAATCG 


CTAGAGAGGA 


AGCGGAATGG 


CAAACCCTGA 


AAATTGAAGA 


AATGCGCGTG 


8640 


TTTTCTGAAC 


AAACTTTTGG 


CGTAGACTTT 


CATGATCAAT 


TGCGAAAAAC 


TGCACGAAAT 


8700 


ACGACCTTCT 


TTTCACTTTT 


TCAAAAAGGA 


ATGGGGAATC 


TTCGTTTCCA 


AGAAGTCCAT 


8760 


AATTTCCAAT 


ATCGTTCAAT 


GCAACAATTC 


TTCGGTCAAA 


TGCCGTTGCT 


AAAAGCGGAA 


8820 


GTGGACCGTT 


GGCAAAAGCA 


AAACCAAACG 


GTCGTTGTTT 


TTGTTCCTAC 


GAAAGAACGG 


8880 


ATTCAAAAAG 


TCGAAGAGCT 


ATTTCAGGAC 


TTTGACATAC 


CTAGCGTAGT 


AAGCAATTGG 


8940 


GATCAGTTGT 


TAGATGGGCA 


TGTCCAAATT 


GTCCAAGGGG 


CATTACAAAC 


. AGGGTTTGAA 


9000 


TTGCCTAAAA 


ACAAGCTCGT 


GGCGATTACG 


GAAAAAGAAA 


TTTTTCATAC 


AACGACCAAA 


9060 


AAACGGGCAC 


GTCGTCAAAC 


AATCTCTAAT 


GCGGAACGTT 


TGAAAAGTTA 


TAGCGACTTA 


9120 


AAAACGGGCG 


ATTATGTCGT 


TCATGCGAAT 


CACGGGATTG 


GTAAATATAT 


TGGTATGGAA 


9180 


ACCTTAGAAG 


TTGATGGGGT 


CCATCAAGAT 


TACATGACTA 


TCTTGTACCA 


AAATGATGAC 


9240 


AAGTTATTTA 


TTCCAGTAAC 


ACAATTGAAC 


TTGATTCAAA 


AGTTCGTTGC 


TTCCGAGTCT 


9300 


AAAACGCCGA 


AAATTAACAA 


GCTAGGCGGC 


AGTGAATGGA 


CGAAAACTAA 


ACGAAAAGTT 


9360 


GCTTCGAAAA 


TTGAAGATAT 


CGCGGATGAT 


TTAATCTTAC 


TTTATGCGAC 


AAGAGAGTCG 


9420 


GAAAAAGGCT 


ATGCTTTTCC 


GCCAGATGAT 


GCGTATCAAA 


AAGAATTTGA 


AGAGGCTTTT 


9480 


CCGTATTCTG 


AAACAGATGA 


TCAATTACGT 


AGTGCCGCCG 


AAATTAAACA 


CGACATGGAA 


9540 


AAATCTCGAC 


CAATGGATCG 


TTTGTTAGTT 


GGCGATGTAG 


GGTTTGGTAA 


AACAGAAGTG 


9600 


GCTTTACGAG 


CGGCTTTCAA 


AGCTGTTAGC 


AACAATAAAC 


AGGTTGCCTT 


TTTAGTGCCA 


9660 


ACAACTATTT 


TAGCGCAACA 


ACATTATGAA 


ACGATTCAAG 


AGCGGTTTGA 


GGGCTTTCCT 


9720 
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GTAGAAATTG GTTTATTAAG TCGT7TCCGC ACGAAAAAAC AGCAAAATGA AACAA7TGAA 
AAAATTAAAC ATGGCCAAGT AGATATTGT3 ATTGGTACCC ATCGTTTATT GTCGCAAGAT 
ATTAATTTTA GTGATTTAGG CTTG77GATT ATT GAT GAGS AACAACGGTT TGGAGTGAAA 
CATAAAGAAC GACTAAAACA ATTGCC-TTCA CAAGTGGATG TCTTGACGTT AACAGCAACG 
CCAATCCCAA GGACCCTTCA TATG7CGAT3 TTAGGTGTCC GTGATTTATC CGTCATCGAA 
ACCCCACCAG AGAATCGTTA TCCGA7CCAA ACGIATGTCA TGGAAAATAA TCCTGGGGCG 
ATTCGGGAAG CGATTGAGCG AGAAATGGCC CGAGATGGTC AAGTCTTTTA TTTATACAAT 
CGAGTAGATA CGATTGAACG AAAAGTTGAA GAG7TACAAG CACTAGTTCC AGMGCACGA 
ATTGCGTATG CACATGGGCA AAT G ACAGAA GTACAACTGS AGAATACTTT GT7TGATTTT 
ATTGAAGGAC AATATGATAT ATTAG7CACT ACAACAATTA TTGAAACAGG AG7AGATA77 
CCTAATGCTA ATACCTTGTT TGTASAAAAT GCG3ACTACA TGGGCTTATC TACGTTGTA? 
CAATTACGTG GTCGTGTGGG GCGTAgcAAT CGASTGGCCT ATGCTTATTT TA7G7ACGAA 
CAGCAAAAAA TCCTAAACGA AMAS76AGA AAC3TTTAGA AGCAATTAAA GACT7TACGG 
AATTAGGTTC TGGGTTCAAA ATTGCSATGC GTGACCTTTC CATTCGAGGG GCCGGGAACC 
TTCTTGGTGC CCAACAACAC GGcKCATTS ATT3GGT7GG GTTTGATATG TACTCGCASA 
TGTTATCTGA GGCAGTTGCT CGTMACAAG GAAAAAATA7 CCAAGACCAA AAAACGTC7G 
TTGAAATTGA TTTGGGCATT GATGC7TATA TTCCAGGAAC GTACATTACA GAC GAACGAC 
AAAAAATTGA AATTTATAAA CGGA77CGTC AAT7AGAAAA TATGGATATG T AC GAAGAAT 
TAGAAGCGGA CTTGTTAGAC CGC777GGAG AGTATCCTGA CGAAGTGGCG CA7TTATTAA 
CCACTGGCCG TATCAAAATG GsTGGCGACC GTG7CTTGTT AGAAAGTATT CG7AAACGCG 
ATCAAAAAGT CAAGTTTGTG TTAAG7AAAA TTGGCACAAA AACATACTCT GTGGAGCAGT 
TGTTTGAAGC GTTATCGGCA ACCAG3TTAA AAGCGGATCT AGCAGTTGAG AAAGAACAAA 
TGACTATTTC CTTGAAGTTG CCGAAA.GA77 GTAAAGAAGC -TGTTTGGATT CAAGAAAT7G 
CTGCTTTTAC AACCGCATTG CGACAAGWA AATACA^ACA TTCGCAAGGA AC7-GATTC7C 
TTTAAAAGGA GGCCTAGAAG ATGG3ACAAA AAGAAATGCA ACGAATGATG CAAGG7GC7G 
TTGTGTTAAC CATTGCTTCC TTCA77GCAA AAG7GTTGA3 TGCGTTTTAT CGCG7TCC77 
TTCAGAACTT TGTAGGGGAT GAAGGATT77 ACG7GTATCA GCAA3TTTAT CCAATCTA7G 
GGATTGCGAT GACATTAGCA TTG73A3GC7 7AC:GCAAT7 TATT7CCAAA AT7G77GC7G 
AACAACCAGA TATTCGGAGT CAA-AACAAG 7GT7GCGCCA ACTA7ATCCC 7A7G7GTTA7 
GGTTAGCGAT TGCTTGTTGG GCC773TTT7 7CTTTGGTAG TCAA3AAATT GC7A77AG7A 
TGGGGGATGC TGCACTCCAA CCA77AATG3 AAG7TGTTTC C7TTACC7TT TTAC73GT7C 
CAATTCTTTC TTTCTATCGT GGGAA77T7 7 aAG3TCATT7 G7TGATGGTG CC7AG7GG7A 
TTTCACAGGT GATGGAACAG TT7G737G73 7CG33G7TA7 TCTAGTAGCT GCG77GTC77 
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ATCATTACTT 


TGGTGGCTCC 


ATTTACCAAA 


GGGGGACGGT 


GGCCATGAGT 


GGTGCGTTGG 


11760 


CTGGCGGTAT 


CTTAGCGGTA 


CTTGTCTTAT 


GGTACTACAA 


TCGAAAAATT 


CTTAGTGGAA 


11820 


GTACGGAATA 


CTTGCATCAA 


TGGAAGATTA 


TGCCTCAAAC 


CACAGGACTA 


TTTAAACGTT 


11880 


TAATGATTGA 


GGGTGGTTTG 


GTCTCTTTAT 


ATAGTGCTTT 


TCTGATTCTA 


TTTCAATTAA 


11940 


TTGATTCCTT 


TAAAGTAAAA 


AATGCCCTCA 


TGCTTTTCGG 


GCTCTCTGAT 


TTAGCTGCGA 


12000 


AAGTGGAGAA 


AGGGGTCTAT 


GATAGAGGAC 


AACCACTAGT 


TCAATTAGGA 


TTGGTAATTG 


12060 


CTTTAGCTTT 


AAGCTCGACC 


TTTTTACCAG 


GTTTGACAAA 


ATATTTTATG 


AAAAAAGACC 


12120 


GTCAACAATT 


TTTACAAGTT 


GCAAAAATCT 


TTTTGCGCCT 


TACCACAACC 


TTGGCATCGG 


12180 


CGGCTTCGAT 


TGGTCTAATG 


ATGTTATTGC 


CGTACATGAA 


CTTTACGCTT 


TTTAAAGATT 


12240 


ATAAAGGCAA 


CGATGTTTTA 


GGGGTGTATG 


TTTTATCCAT 


CGCTTTTATG 


GCCATTATTC 


12300 


AGGCCTACCA 


AAGTATTGAA 


CAAAGTCGCA 


ATCGTTTTAA 


AGGTCCTTTA 


GTGGCAGCTG 


12360 


GTGTGGGCTT 


ATTAGTGAAG 


TTACTGACAA 


CGGGCTTTTT 


CACCATTCGT 


TGGGGAACAT 


12420 


TAGGTGCTAG 


TTGGTCTACT 


ATTCTTGGGT 


TACTAGCGAC 


GTTATTCGTT 


TTAGTCCGAC 


12480 


AATCGGATGC 


AGCGATTAAT 


TGTTTTGTTC 


GAGAACGAAA 


CTTTTTGAAA 


AAACTGTTGC 


12540 


TTAGTTTAGC 


AATCATGATG 


CTGTCATTAC 


TGGTCTATCA 


AGGAATTATC 


TCCATCTTGT 


12600 


TCCAAGGTGT 


GCATCACCGA 


AGTCAAGCGT 


TTTTCGTTAC 


AGTTTTAGGT 


GTTGCAGTGG 


12660 


GTGGTACAGT 


TTTTATTAGC 


ACCATCATTA 


AGTTAGAATT 


ATTTACGATA 


CGTGAATGGT 


12720 


TAAGTTTACC 


GTATGGCGCA 


AAAATTTTAC 


GGATGAGACA 


AAAAAAATAA 


AAGAGGTGAA 


12780 


TGTGATGCGG 


TTAGATAAAT 


TTTtAAAAGT 


TTCTCGCATT 


ATTAAAAGAA 


GaACAGTTGC 


12840 


CAAAGAAGTC 


GCTGATAAAG 


GCCGGATTCA 


AATCAACGGA 


GTCTTAGCCA 


AATCTTCTAG 


12900 


TACTGTGAAG 


ATTGGTGACC 


TTGTCAAAAT 


TCAATTTGGG 


AACAGAATTT 


TAGAAGTAGA 


12960 


AGTCCTGCAA 


CTGAATGACT 


CTACGAAAAA 


gAAGATGCCA 


CAAAAATGTA 


TGAAATAAAA 


13020 


TCAGAAACAC 


GTGTCTCTGA 


AGAATAAACA 


GGTGAGGATG 


GAGCGCTTGG 


CAAGTTCATT 


13080 


GAAAAGATGA 


ATATTCCAAG 


CGCTTCATCC 


TTTCAGCGTT 


TTAATGAAAC 


AGACGCGCCG 


13140 


CTTTTGAAAA 


AAAT 










13154 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

TCGTCGTTGC TGAATGCGAC AGCAGTTGGG CCTGTAAAAG ACTCGTCTAA GCCGTCTAAA 60 

CCAACTTTTT TAGCTGCACG AGAAAGGATT GAGTTTTTGA TGACTTTCAT TTCTACGCCT 120 

GCATCACGTA ATTGTTTACG TAAGTTAGTG ACTTCTTCAA CTGTTAAACC ACGGTAGTCC 180 
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ACGATTACAA CTGATGCTGC TGATTCAAAT TTCTCAGCGG CTGCTTGAAC TAGGGTTTCT 24 0 

TTTTTAGCGA TTGCTGCTTC AC T CAT T TAT TTCACCTCCT GATTTTGATG GTGGAGTTGG 300 

ACTTGGTTTA CGTTAkGTAA AGTAGAAGAA TCaAATGAAA CTTcGCTnAC TTCGTTCGCC 360 

AAGTATTGTT TTACATCGGC TTTTTTTCAA AAAGTCGTGT AAAACAACAA AACTCCATGC 4 20 

CACCGTrGAC ATAGAGGGaC CGAATTGTAT ATAACTCaAC AATTGTCCTC GGTAGGaAAT 4 80 

TmAAGACTTG CGTCACCTAC TGTCTTCGGG ACAGTTAATT ATTAACCC 528 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



GGGGAAGCCA 


AGCAAGAACA 


GAAGCGACGG 


GTTATGGAAC 


GGTTTATTTT 


GTTAAACATT 


60 


TATTAGcAGA 


TAAAAACGAT 


ACGTTTGAAG 


GCAAAAAAGT 


CGCTGTTTCT 


GGTAGTGGGA 


120 


ATGTCGCAAT 


TTATGCCATG 


GAAAAAGCAA 


CTGAGCTAGG 


CGCCACAGTA 


ATCACATGTT 


180 


CCGATTCGTC 


TGGTTTTGTC 


TATGATCCAG 


AAGGAATTGA 


TGTGGCGTTA 


GTAAAAGAAC 


. 240 


TAAAAGAAAA 


AAATCGTGAG 


CGCATTTCAA 


AATATGTTGA 


AACACGTAAA 


GGCGCAACTT 


300 


ATTATGATAA 


AGAATCTGTT 


TGGAATTTCG 


AAACAGCGTA 


TGACATTGCG 


TTACCATGTG 


360 


CAaCACAAAA 


TGAAATCmAC 


gAAAAGCAAG 


CCGCTATTTT 


AGTGAAAAAC 


GGAGTGAAAG 


420 


TCGTGGCAGA 


AGGCGCAAAT 


ATGCCTTGTA 


CACTAGAAGC 


AGTCGCCGTT 


TTTGCTAAAT 


480 


CAGCTGTAAT 


TTATTGTCCA 


GGCAAAGCCG 


CTAATGCAGG 


TGGTGTTGCG 


GTTTCTGCAT 


540 


TAGAGATGAG 


CCAAAATGCT 


GAACGTTTAG 


CTTGGTCCTT 


TGAAAAAGTT 


GACGGTATGC 


600 


TAGATCAAAT 


TATGCAAAAC 


ATTTATGAGA 


CTTGTCGTGA 


TACAGCTAAC 


GAATATCAGG 


660 


CACGAGATAA 


CTTTGTTTTA 


GGTGCGAATA 


TTGCTGGCTT 


TGAAAAAGTA 


GCGGCAGCGA 


720 


TGCTAAGTCA 


AGGACTCGTT 


TAATAACAAA 


AAATGCTAGG 


ATTTCCTAGC 


ATTTTTTATT 


780 


GATTTGTTTG 


ATTAAGAGAA 


TAAATATCTC 


TAATTAAATA 


GTTAAAATTA 


AACAATTGGT 


840 


CTATCCAAAA 


GAATTTGCTT 


GAAAAAATGT 


CTAAAAACAC 


GTAAAGTATA 


CATGTAGTAA 


900 


GTTTACTTTC 


GAGGaGGAAA 


AGGAATGTCA 


CACmTTCmAT 


TaGATTATTC 


CAAATTaGCA 


960 


CCATTTGTTG 


CTGACCATGA 


ATTGGAATAC 


aTGcAAcACm 


AGTCACTGCA 


GTGGgACAAa 


1020 


GCATTACGTG 


AAGGtACAGG 


AGCTGGtAAT 


GATTTTACTG 


GcTGGaTtGA 


TTTACCgaAA 


1080 


ACTATGACAA 


'AGAAGAATTC 


GCGCGCmTTw 


AAAAAGCAGC 


AGCAAAAaTT 


CAATCTGACT 


1140 


CTGAGGTATT 


AG TAG T CAT T 


GGTATCGGTG 


GTTCATATTT 


AGGTGCCCGT 


GCAGCAATTG 


1200 


AATTTTTGAC 


ACATTCATTT 


AATAATCTAT 


TATCCAAAGA 


AGAACGGAAA 


GCGCCACAAA 


1260 
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TCTTCTTTGC TGGAAATAGC ATCAGCTCAA CTTATTTAGC TGATTTAATC AATGTGATTG 1320 

GTGATCGTGA TTTTTCTGTG AACGTTATTT CAAAATCTGG AACTACGACA GAACCAGCGA 1380 

TTGCTTTCCG CGTCTTTAAA GAGTTATTAA TTAATAAATA TGGCAAAGAA GAAGCCAATA 14 40 

AACGTATTTA TGCGACAACT GACCGTGCTA AAGGAGCAGT AAAAGTTGAA GCTGATGCGG 1500 

AAGGTTGGGA AACATTTGTG ATTCCTGATG ATGTGGGCGG CCGTTTCACT GTTTTAACGC 1560 

CAGTAGGTCT TTTACCAATC GCTGTTAGCG GTGCGGATAT TGATCGTTTA ATGGAAGGTG 1620 

CAAATGATGC CCGTAAAGAA TACGGCGCAA CGAGCGACTT AAAAGAAAAC CAAGCCTATC 1680 

AATATGCTGC GTTACGTAAC ATTTTATATC GCAAAGGCAA AACAACTGAA ATGTTAATTA 1740 

ACTATGAGCC TGGCATGCAC TATTTCTCTG AATGGTGGAA ACAATTATAC GGTGAATCAG 1800 

AAGGAAAAGA TGGCAAAGGT ATTTTCCCAG CAGCCGCAGA_ CTTTTCAACT GACTTACATT 1860 

CCATGGGTCA ATATGTTCAA GAAGGTATGC GTAACTTATT TGAAACAGTG GTTAAAATTG 1920 

AAAATCCACG CCATAGCATT TCAATTCCAG AACAAAATGA AGACTTAGAT GGCCTAGGCT 1980 

ATTTACAAGG AAAAGAAATT GAcTTCGTCA ATACAAAAGC GTTCGAAGGT ACGTTGCTTG 2040 

CTCACACAGA TGGTGGTGTG CCAAACATGA TCGTTAAAGT GCCAACAATG GATGCTTATT 2100 

CATTAGGTTA TGTTATGTAC TTCTTCGAAA TTGCAGTGGG CATCTCAGGT TACTTAAATG 2160 

GGGTTAATCC ATTTGACCAA CCAGGTGTTG AAGCTTACAA ACGCAACATG TTCGCTTTAT '2220 

TAGGTAAACC AGGATTTGAA GAATTAGCAA AAGATTTAAA TGCACGCCTA TAGAATAGTA 2280 

TTCTAAGTTA ATTTTTAGCt TTGTGGTGAT ACAAAGATGT TTGCTTTGAA ATATTTTTGG 2340 

TGATACATTT GGTGATACAT TTATAAAAAA ATCAGGTAAC GTTTTAGTTA CykGATTTTT 2400 

kGTGTgCATT kGATATAATt CGAGTTTATT TACAGCTCGT TCTTTCACTT CAGGAGAGAC 24 60 

ATGTGCATAT ATCTCAGTCG TTTTAACATT CTTATGACCA gmATGTCTTT AATGTCCTCA 2520 

AGTGGAACTC CTGCTTGTCG TAAACGAGCA GCATTAGTAT GGCGACCGTC ATGAATTCTG 2580 

ATTTTTGGTA AGTTTGCTTT TTGAATGATA CGGTGCCAAG CTGTGTCAAC GGATCGTTGA 264 0 

CGAATAGGCA AGCCAACGTT TTTTCCTCTA TGATTGATAA ATAAAAAATC TTGGTTTGaG 2700 

TTAGTCATGG GTAATACATT ACTTGTAAGA TCATAATTAA TACTATAGAG TTCTAAGACG 27 60 

GAGTTTCTCA CTCTGTTTGT CATTTTTAAA CTTCTCTTCC CAGCATCAGT TTTAACATCA 2820 

TCTAAAATTA CTTGACCAGG TTGCTCTTTT CTATACAAAC GCTGCTtGTC AAaTGTGAGA 2880 

aTATyATCTC TAAAATCAAA GTTAAACCAT CGTAAGGCCA TTACTTCGCC TTTCCGCAAG 2940 

CCTAAGTCGA AAATGATTAA AAAAAATACA TACCACATAA AATCGTCAAA ATTTCTAGCG 3000 

GTGTCAAGAA ATAGATTTTC CTGCTCTAAA GTGTAGAATT CTAGTTTCTT TTCTTTATTT 3060 

GTTTTTTCCT TAACCTTATC TAAAGGAAAT TCAACGAAGT AGGTAGGATT GCTTTTTATA 3120 

TACCCTAATT TTTGTGCCTT TTTGAATGCG TTGGAAAGAG CTGCATTTAC AATTTTTACT 3180 

GTATTGTAAC TAAGCCCATT TTTACTTTTT CCTAGTTCTT TATTCGTAAA CAAGCTATTA 32 40 



WO 98/50555 



PCT/US98/08985 









1453 








ATAAACTTTT 


GGTGATCCGC 


AGgTTTATAT 


TTATTTAATT 


GATAATTACC 


TATATAGGGA 


3300 


-TCAATATACA 


TACGGATAGC 


ATCTCGATAG 


ACAATTCGTG 


ATCCTTCCTT 


AACGTTATTT 


3360 


TTGTATTCGT 


TTATCCAAAT 


ATTCATAAAT 


TTTGAAATAG 


TCATTTTTTG 


TGGATTTTCT 


3420 


TTAACGAATT 


CTTCAGATGC 


AATTTGCCCT 


TCAATTATTT 


TTGCAGCTTT 


CAAAGCTTCC 


3480 


TTATGACTTC 


TAAAACCTCT 


CTTGTGTAtT 


TTtTGTTGCC 


TGCCAGTACC 


ATTGCTACGT 


3540 


CCATTTGAAA 


CGAaGTATTC 


CCAACGTTCT 


TGACCATTTT 


TCAATTTATA 


AGATTTTATA 


3600 


GAAGCCATTA 


ATCTTCACTC 


CTGAACATAG 


ATTTTGGAAG 


GGTATAAAGT 


TTAGGTTTAT 


3660 


ACCAGCCTTT 


TTCATAGAAC 


TTATCCATTT 


TTAATTCTTT 


TTCTTTCCAT 


GAAAATAAGG 


3720 


GGAGTGTGAC 


CATATTATCA 


GTTTCAATGT 


CTTTTTTTAG 


TTTTAATTGA 


TCCTTCAGCA 


3780 


CATCTTGATA 


TTCTTCTTCA 


CTATAATATG 


GGTTtGTTAA 


TATTGGATAA 


ATTGTTACTT 


3840 


CTGGAACATC 


TCTTTTCAAT 


CTTTCTCTCA 


ATTCATATGG 


THTfATPATT 


atzxctpa atp 

x\ 1 no x v*rtrt 1 




CTTTCGTAGT 


TTTACTGTTT 


AAATAAAATT 


AGTAACATAA 


AATAGAAGAT 


TTATATTAAA 


3960 


AATCATTTAT 


TTTTTCAGTA 


TTCAGCTCAA 


TGGATAAGAA 


CGTATGTnCT 


TTTTGGTTTT 


4020 


AAAAGAAAAG 


CCCATAGGCT 


AATCTTACTT 


AGCGGATTTA 


AACGTACATC 


CGCAATTATT 


4080 


ACACnCCATG 


TGTTTTTnTT 


ACCCnTTTTT 


CCAGCGAAAC 


CAGCTA 




4126 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 



GACGATTTAG 


GCGTTAAATT 


AAAAATCGAA 


GAACTAGGCT 


TTGATGCTCT 


GTTAGGGGCC 


60 


TTAAAAACTG 


GCAAGATTGA 


CATGGTCATC 


TCTGGCATGT 


CGCCAACGCC 


TGAACGATTA 


120 


AAAGAAGTAG 


CTTTTTCTCA 


ACCATATATG 


AACGTTCAAC 


AAAGAGTCAT 


TGTTCGGAAA 


180 


GAAGACAAAG 


ATAAATTTCA 


TTCCACTAAA 


GATTTCGAAG 


GCGTTCGTGC 


AGGTGCCCAA 


240 


AAGCAAACGA 


CACAAGAAGA 


ATTAATTAAA 


ACCGAATTAA 


CCGGTGCTGT 


CCCTACTTCT 


300 


CTTCAAAAAA 


TTCCAGATTT 


AATTATGAAT 


CTAAAAAATA 


ATAAATTAGA 


TGCAGTCGTT 


360 


ATGGAAGAAC 


CGGTTGGCGA 


ACTATGgTTT 


CTCAAAATGA 


GGATTTAGCT 


TTTGCTGATG 


420 


TTACCTTTGA 


AAAAGGCTCA 


AAAGATACTG 


CCGTTGCTTT 


TCCTAAAGAT 


GCGCCCGTTT 


480 


TGACGGAGAA 


AGTAAATGCC 


TCTATTAAAC 


AAATTAATGA 


TCAAAAATTA 


ATGGACGGTT 


540 


ATAAAAAAGA 


AGCCAATCAA 


TTAATGTTCC 


AAAAAGATCA 


AAGCTTTTTA 


CAAAAGTATG 


600 


GAAAGTTCTA 


CCTTAACGGT 


GCAGGCTATA 


CCATTTTCCT 


AGCTTTCATT 


GGTGTTTTGT 


660 


TTGGTGCCAT 


TTTAGGGGCA 


TTACTTGCCT 


TAATGAAGTT 


AGCCAATTCC 


AAAATTTTAC 


720 


GTGGCATTGC 


CATTGCTTAT 


ATTGAATATG 


TTCGTGGCAC 


TCCTTTGCTT 


GTTCAAATTT 


780 
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TTATTGTCTA TTTCGGGACA GGCGTTCTTG GCTTAGACTT ATCAAAAGTA GCAGCTGGTT 84 0 

GTATTGCCCT TTCTTTAAAC AGTGGTGCAT ATGTAGCAGA AATTATcGTG CTGGTATTCA 900 

GGCAGTTAAT AAAGGACAAT TGGAAGCTGC TCGTTCTTTG GGAATGAATC AAAACCAAGC 960 

AATGCGATAC ATCATTTTCC CACAAGCTAT TAAAAATATT TTACCAGCGT TAGGCAATGA 1020 

ATTTGTTACA GTAATTAAAG AATCTTCCGT TGTTTCAGTC ATTGGTGTTT CAGAACTAAT 1080 

TTTCCAAGCT GGCAACGTTC AAGGAGCAAG TTTTAAACCC TTCTTACCTT ATTTAATTGT 114 0 

ATCGTTAATT TATTTCGTCC TAACCTTTAC TATCTCACGT CTATTAGGTG TTGCTGAAAG 1200 

GAGAATGAGT ACTAGTGATT AACATTAAAA ACTTACAGAA AACATTCGGA AAAAATGAAG 1260 

TACTTAAAGG AATTGACTTA GAAATTGAAG CTGGTGAAGT TGTCGTAATT ATTGGCCCTT 1320 

CTGGTAGTGG TAAAAGTACT TTTTTACGTT GCTTGAACTT ATTAGAAACA CCAACTGGTG 1380 

GCAGCGTTGA ATTTGAAGGA AAAAATTTAT TAGATAAAAA TACCAACATT GATCAACTTC 1440 

GTCAAAAAAT GGGAATGGTA TTCCAAAATT TCAATCTTTT CCCGCATAAA AATGTCTTGG 1500 

AAAACTTAAC GATTAGTCCT GTTAAAGTAA AAAAAGAAGC CGCTTCTGAA GCCACAGAAC 1560 

ATGCTCTTTC TTTACTGGAA CAAGTTGGTT TATCAGATAA AAAAGAAAAT TATCCCTCTC 1620 

AACTTTCTGG TGGGCAACAA CAACGGGTAG CGATTGCTCG TGCTTTAGCC ATGCGCCCTG 1680 

ACGTGATGCT GTTCGATGAA CCAACTTCTG CCTTAGACCC TGAAATGGTG GGCGAAGTGC 174 0 

TATCCGTTAT GAAAAATTTA GCTATTGAAG GGATGACAAT GGTAGTGGTT ACCCACGAAA 1800 

TGGGCTTCGC AAAAGAAGTC GCGGACCGTG TCATTTTTAT GGATGCAGGA ATCATTCAAG 18 60 

AAGAAGGCAC ACCTGAAGAA ATTTTTGATG CGCCACAAAA CCCAAGAACA CAAGATTTTC 1920 

TAAGAAAAGT TTTATAAAAA GGAGCACCTA GTATGGATCG CAAGAATATC GCAACACGTT 1980 

ACCAACAACC TACTGAAAAT TTACTCATGG ATATTGCAAC ATTGGCGAAA AAGACCCCAA 2040 

ATTTAATCGA TTTATCGATT GGTGACCCTG ACTTAATTAC CGATGAGCGG ATTATCGAAC 2100 

AAGCAGCTAA CGATGCCAAA AATGGGCATA CAAAATATAC CGCTTCTGAT GGTAGCGAGG 2160 

CATTCATTGA GGCGGTCATT CAGTTTTATC AG TC AC ATT A TCAATTATCT TTTCAACCGA 2220 

ATCAAGTCCG GGCCACAGTT GGCGCTCTAC ACGGGATGTA CCTTGCCTTA CAAGTGATTC 2280 

TAAATCCTGG AGATGAAGTG ATTATTCATG AACCTTATTT CTCTCCTTAT AAAGACCAAG 234 0 

TCCTTTTAGC AGATGGCGTT CCGGTCTTTT TACCAACCTA TGAAGAAGAT GGTTTCCAAA 24 00 

TTGATGTAGC TCTTTTGAAA GAAAAAATCA CGCCTAAAAC CAAAGCGATT ATTCTTAATT 24 60 

CACCGAATAA CCCGACAGGC GCTGTTTTCT CTGAAGAAAC TTTCAGAGAA ATTGCACAGG 2520 

TAGCGATTGA ACACAACTTA TACATTCTTT CAGATGAAGT TTATGAAGCC TTTTGTTTCC 2580 

AAGAAACGTT TACTCCCATG GCTACGTTTG CTCCAGAAAA TACTATCACT TTTGGCAGCT 264 0 

TTTCAAAAGC CTTTGCAATG ACTGGTTGGC GTATTGGTTA TATGATTGCC CCAGACTATA 27 00 

TTAACGAAGT AGCCAAATTA ATTAATGAAG GCGTTACGTA TTCTGCGCCT ACGTTGTCAC 27 60 
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TCATAATGGA AATATAATTG ATCGTTTAAC TTACGAATCA TCGGATTATC CTTAGGCGTA 4 800 

CCAAAAATAA TCAAGTTATG ATTTTCTAAT GCCGCATTTT TAGGCGCTTT TTTATAATAC 4 8 60 

GTAATCTCAC CG AC AT TACT TTCGGCATAA TTACCAATTA AGTTAAATAG ATTGGTCAAC 4 920 

ACTTTAAAAT AGTTATCGTC CATTTTTTCT GGTAATAAAA TCGCTAAATC AGCAAAAGAA 4 980 

CGGCTTCTGA TAAAAATGTT CGGATAGTTG TTAAATAATA TGTCATTCAG TTCTTCTGTT 504 0 

TGAATAAACG CATTCGAATT ATTTTCAATA AAGGCCCATG GAGTCTGACC ATTTCGTAAA 5100 

ACTTCTGGTG ATTTAACATT CAAATCAAAA GCAACTTTTA AAACAAAACT ATCAGCAATT 5160 

TCTAAGTTCT TTGGAAATTC GAGATTTAAT TCATCACCGT TCGCACGAGC AGCGGTTAAT 5220 

TTCTTGCTGC CAATTGGTTT GTCATTGGCA TAGACCGTCA CTAAAGAAGA ATCAAAATCT 5280 

AAGTTTTCAG CGTATTTAAA ATGTAAATGC ACGCGACTTC CATTGGCATT GTTTCGATCC 5340 

ACTGGTAAAT TGACAAAGTA CGTTTGTTCT TGGTGATAAG CCCCTTCTAG TTTGTCCCCC 54 00 

GTTGAGGTTA ATGGATAATT TCCATCAAAT TCTAAAGTTG AACTAAACGT ATCGGTATTC 54 60 

TCATCAACTG TTGTCTCTTC TTTATCTGTT TGGGTCATTA ATTCATAGTT TGCTAAATAA 5520 

CGTCCCGCAC GAACAAGTAA GTCCTCATCT TTAGAAGTAG CTACTAAAAC ATGCTTATCT 5580 

GGTTGATTGA AAAATTTTAA GACAGCCTGG TCCTCCACGC GTTTGCTATC AATTTGACTT 564 0 

T TAT ACT GAT CTGGTAATTT ATCATAGCTG GCAATAATCA ACTGATAGGG TGCTGACTGT 5700 

TCTTTGTTCA AAGAAGCCAT TGGCAACAAT TCTTCTGAGG TAGTAATCAA ACGAGCAGCA 57 60 

CCAGcTAACG CATGGGTCGC TGCAGCTAAT TCTTTTTCCG ATGCATTTTC TGGTGTCAAA 5820 

ATAACACTGT GTTTATTCGC AATCGTGTCT GCTCCCACAA AATGATTATA AAAAGAATGA 5880 

ATCGTATTCT CTGGTAATTG TAAATCGTAT TGGAAGTTTA CGTTAGATCC TTCATACATC 594 0 

GTTAACCAAT TTGCTGGAGT TTCAATTAAA TTATACATGT CGTTTCCTGC CTGATTAATA 6000 

ATTTGGCCTT CAACAGTCAA TGTATTTGTC TCTTTAATCA GTTCCAAAGG GATTTCAATC 6060 

GTCTTTTGTT GAATACCAGT CGTATTTTCA GGACGCCAAG AATAAAACTT CACACCATTA 6120 

ACTGCCACAG TTAAATCTGA GGTCTGATTA TTTTTTAATT GTGTGATTTG GtAGGTCATA 6180 

TTGAATGATG CTtTTTTTAC GTCCCAATAA TCAATTTTTG TAAAATACAT GGTGCTACGG 624 0 

ACAGAAGTAC CTGTTaAAGA GATTGTTTTA TTTTGAAAAG GTTGTGTAAA GGTATGGtkG 6300 

tCTkGTtCtT tGTTTnCGGC ATATACGGTC GGACTtGCGA CACCAATCAG AATTGCCAGA 6360 

AAGAACAACC CTATTAAAGT GACTGTTTTT TGTTTCATTG CACGTTTCCT CCTTATTTAA 64 20 

ATCGTTCAGT TTTGTACCAT TTTGTTTGTG TTTGTTTATG AAAAACATTT TCTTTAATAT 64 80 

ACATCACCAT GCCATAAGCG GCGACTACTA GCCACATCTG ACTGTATGTA ATATACATCA 654 0 

AGGCAATAAT TAAGATATTC TCTAAAGTCA TTTCGCCTTT TTCGGTTGTA ATACTGACAA 6600 

AAGTACTGAA AATAAATAAC AAAATCGCTA ACAACCACAA GGAATTGCTA AATCCTTGCA 6660 

GGGTCGTTGT TAAGTAACCA GACATTGATA AAACTAACAT GATGTCTGAC AAAACGAGCG 67 20 
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ATGTCATTAA GAGAAAATAA ATGGACAAGA AATATAAAAT A7CAAAACGG ATCCGACTCG 6780 

CTTTAGGATT AAACAATAAT TTAGCATTTT TAACAA7GAC A7AAATATTG CCTTTGACCC 6840 

ATCGCGTCCG TTGTCGGAAC CAAACTGACA AAGTTTGTGG C7CTTGCTCC CACGTCACTG 6900 

CTTTTGGTTG AAACTTAATT TTATAGCCCA TCATATAGAT GCGGAAACTG ATTTCTGTAT 6960 

CTTCGGCTAA CGCTTTGTCA TCCCAGCCGC CAATGGCTTC AATAATACTA CGACGAACAA 7020 

TGAAATTTGT GCCAGGAATC GTGCATAACT TAAATAATTG AAAACGACCG GCTTGTGCCA 7080 

TCCATTGAAA GGCAAGTGTT TCGATATTAA TAAAGCGTGT CAACAGGCTT GCATTGCGAT 7140 

TTCTCGTTCT AAATTTACCA ATAACCGCGC CGTATTCCTC ACTAGCAATT AACTCTCCAA 7200 

CTAAATAACG GAGTGCATTC GGTTCAGGTG TATTGTCCGC ATCATAAATG GCAATCACGT 7260 

CCCCCTTGGC TTGCTTGAAA CCAATGTTTA AGGCAT7AGA C7TACCTTTC CCACCTGTTA 7320 

TATTGTCTGT ATTAATAATC TGCAAATTTC GATTAACCGC CTTTTTTTGG ACGTTAGCTA 7380 

AAAGCTCAGC ACTATTATCA CTTGAGTTGT CATTAATCAC AATAATTTCA TAGCGATCTT 74 40 

GCGGATAATT AAAATTCAAT AACGCTTCAA CTGTTCGCAC AATGACAATT CCTTCATTAT 7500 

GGGCCGGCAC CATAATTGAA ACAAACGGCG CGTCTTCGTC C7CTACAGGT GGTGGTGCGA 7560 

CAGGATCTGT TTtTAAATAA TAAACATAGC CAAAAA7AAC TAAGAGAATA TTTACAATTA 7 620 

GT AG ACT CCA AATAATGATA GTTGCTGTTA ACATTACAAT G7CAGCAATG GTCATTTGTC 7'680 

TTCCTCCTTA TTAATATATT TCGACCGATA ATTCTT7CGT CCAATCATAA ACAAAATAAT 774 0 

CAGGATGATT AAAGTCAGCG TGACAACGCC AATCAAAATA TTGTTTGTAA TACTGAAAAA 7800 

GAATGAAAAT AACCCTGTAA ATCGTTGTTT TTCAAC7GTT GAAGCCCCTG TATCAGCCAA 78 60 

ACCATTGACA ATTTGATTGT TTAAGAAGAA CCGATTATTC CGAAATTCGA TTTTCTGCGT 7 920 

TTGTGTTTGA ACAGTGAAAT GTTGATCCGC TTCTGAAACA CTAAACACTA TTGGTGCTTC 7980 

CTTGACTTCT TGAATTAGAT GATCCACTTC TTTTTTAGAA T7TGGAAATG AAAAACTAAT 804 0 
GGTGGTCGGC ATTGGAAATC GATAGTCATT CGGATTATCT TTATCCGTCC ATTCAATCCC 8100 
CGATAAATAA TCATACGGTA AATCAAACAA AGCATTCTTA TAAGTGATTG ATTCTCCCGT 8160 
TTGGTTTCGA TAAAAAACGC GTTCAATCGG CGGATTTTCT CTTAGAATCA CTGTATCACT 8220 
AATTGCTAGA CCATCCTCTT GGTATTGAAG GTCTTGATTC CAGTATCCTG GAGCAGAAAT 8280 
GCCTACTGGA TAGACATGTC TGTCGACCAA TAAACTAATC TGCTGCTCCA TCAATTGTTT 8340 
CAAGGCACGT TGGTCATTTA AATCAACATT GTTAACGACC GGTGTTTCTA AGAAAATGAC 8400 
CCCCGTCGAA GCAAACGCAC GTAACGCATT CGTAAATAGT TTATAAGGTA AAATCGTATT 84 60 

GTTTTGTGAA ACGCTCGTAC TGGATAAAAT ATAAGGATAC G7GGTTTTAA CTAATTCTTC 8520 
TTGCAAATAG GTTGCCACAG ACATATCTCT TAACGGGTTA AAATCAGAAA TCGTCAAAAA 8580 
CGGCCGTGAC ATTTTTTCAC TTTCCAGCCA TTGACTAATT ACGTCTAAGC TTTCTAAAAA 8 640 

AACAGCGCCT TTGCGTTCAA ACATTGGTAG AAAAGCATGC CCATTCTCAA TTACACCAAA 8700 
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TGGATATTCT 
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TTCATCGTCA 
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ACTGTCATAG 
AACTTCTACT 
CCCAGCAATA 
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TTGTTTATTT 
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CTAAAAACAA 
CAACTAATAA 
CGTTyCTCTT 
ATAATAGATA 
TAAAAAATTT 
TATTAGGCTT 
ATGGATAATT 
TTTTCATTTG 
TTTGCAACCC 
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GAACGATAAA 
ACAATCGTTG 
CAATAATCAG 
AGATAACTAA 
CAGAAAATGC 
TAAAATAAAT 
AACTTTTTTT 
AAAAACAACT 
AAAGTTTATT 
TCGTTGTTTC 
CGTAGCTATC 
CATTATCTCC 



TCTGGTGCTA 
ACAACTGATT 
CGATATTGAC 
CGCATGTTTC 
TCAGAGGCAA 
CCATCATTTA 
CCAAAACTCG 
TTCAAGCTGT 
AAGCCACAAA 
TTCAATTCCC 
AATAACATCA 
ATGACGTACT 
ATTTAGATAA 
GAGTCCTCTG 
GAAAGCAAAG 
AGATTGtTCC 
TACTAAGTAA 
TAAATAATTC 
TTCAGAAGTT 
ATATTGCGGG 
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CAAATATTCT 
AACTATTCTT 
TAAGACACTC 
CGTATAGGAG 
CAAAGATACG 
AAGATAAACG 
AAGCCAAATG 
ATTAAAATAA 
ACCATTGACA 
CAAGCCATTA 



ATTCTTGTGT 
GATCTTGAAA 
GATGACTTAA 
GACCAATGTG 
TAACGCCGGC 
GCAATTCGCC 
TCAACACTCG 
CATAGACAAG 
AAACGAAAAA 
TCTACCTCCC 
GAAACAATTT 
AAGTCACTAT 
ATTCCTGTCA 
TATAAACCAA 
TTTTTAGGAA 
AGCGGTGCAC 
ACATTTGGGA 
AATAACACCC 
GGTGACACAG 
TTGATTCCGC 
GTCATACTGT 
GTTGTAAAAC 
GACAAGTAGA 
TGGCGTGTTG 
CTGAGCCAAG 
ACAGAAAGTA 
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TCTTCGGTTA 
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GCAACGGTAA 
CGCATAATTG 
CGATAAAGAC 
ATCGGATAAA 



TTTTAATCGG 
AGGCAATACT 
TTCTTTCCAC 
TAATTGTTTG 
TTGTTCAGGC 
AGGTACATAA 
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TAAGACATGT 
GAGTAAGAAA 
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TGAAAAATTG 
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CCAACCGTCT 
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TAATCCAGCA 
AAGTACTACT 
CGTCACCCGA 
CGCTTCTGTA 
AACCAACAAG 
CAGAATATAA 
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TTGAGTGGAA 
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AATGCTTGTC 



GACCTTGCGT 
GGTTTTTTTC 
AATATTGTTT 
GAACTCGATC 
TAGAAATCAA 
CCACACTCTG 
TTTCTTTTTT 
CAGCCAGCGC 
CAATAAGATA 
TCTTGCCACA 
AATACTGAGA 
GAAATGAAAA 
AATCTGGGTT 
ATCACGCCTA 
TTATACACCA 
TTCCCCACAT 
CATAATTCAG 
AAAAAATCTT 
GAACGTCCCT 
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ATATCGTTAG 
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ATAATTACGC 
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TCCCTAAAAC GGCCAAAAGA CTTGGAGAAA TATAACGACT CAACCATTCA AAAGCCGAGT 10740 

TCATGTATTT TTTGATCTTT TCCATGCTTT CACCTCATCT AGAAATCATA GGCCAAAAGA 10800 

GCCATTAAGT TATTATAGGA ATAGACTTGT TTCGTAACTT TATCACCAAA TCCTCCATAA 10860 

ATCGGACTGC TTGAATCCAT AATTTGAAAA TTATTAAGTA CTGTAATTGC CGCACGATAC 10920 

AACTCTTTGT CATTTTCGGC AACACCAATT AAAGCCGCTA TCGCATAACT AGCTGCAGAT 10980 

TGATTTTTAT CCACGGGCGA TCCATTTAAA TCATACGAGT TAAAAAGCGT TCCTTGCTGA 11040 

ACTTGTTTTC GCACAAAATC AATACTAGTC TGCTTCGCCA TACCCACTTC ACTTAAATGG 11100 

ACAATCGTTA ATAAAGATTC AATAATATTA AGTGTTCCGT TGTTTTCGTA TTTATTTGTT 11160 

TGATGATTAA ACTTCGTTTG ATAAAGAGGT AAATCATCAG AAATATAGCC AT CATC TAT T 11220 

AATTGGTAAT GATACTGCAA GTAGTCTGCT GAAATCCCAA ATTCTTTGTA TAGATAACCC 11280 

AGCGTAATTA AGTCAATATA AAACAAAGAA GTTTCTGAAG ACTGTTGTTT TGATTTACTA 1134 0 

TCGTAAAAAT CAATTAGAAT ATTATCTTTC ATAGAGGTTT TCATAAAACT CTTGCCTAAT 114 00 

TTTTTAATTT CTTGATCATA CTGATCTGTT TTAAAATGAC CCCCTGCTTC AATTAATGAA 114 60 

CGAATGATCC GTAAATCATC AATCGAGGCA TTGACATTGT ATTTATTGCC TTGCTCGTCA 11520 

ATTCGATAAC TAAATTGGAC GCCGTCGTAA AAGGTGTCTT TCGTCTGCTT GTAAAAATTA' 11580 

TCAAAACGTC CCTGCGTATC AAAAGCTAAA TTACGCAAAT ATAGACCCGA AGACTCACTT 1164 0 

AGCAATTCAT GTCCTGTGGC GACATCTTTG CGTTCAGCAT CTGGTAAATA ATTGGTTAAA 11700 

AAGCCGCCGT CTGTTTTGAG CCGGGTTTCA ATAAAAAACT TCAACAAATC TTTGCGTTCA 117 60 

GCTGTTTCTT TkGTGTGAAA TTCATGTTTT GTTAAAAAAT GTTGTTTGtC TGTCGACATT 11820 

AAGCGTTGAA TTTTTGGATT CACTTTTTCC ACCTCTACTT TATAGGATTT TTCTCTAAAC 118 8 0 

AAAAGAAAAT ATCCTCCCAG AAGCACCACG AATACCCCAA TAAAAATCCA AAGTTTTCTC 11940 

TTCATTGTTA CGCCTCTTTT CCTCACTTCT TTATTGTCAG TGTACACAAA AGTACTTGTT 12000 

TTTGCGAAAC TTACGCTACA TCTTCAAGCG CAAGAAAAAA AGCAATCCAT AAAAATAAAA 12060 
GGATTGCTTT AACCGTTTCG TCGATCATTT CCTGTCATCG TAATCTCTTT TGACAGATAA 12120 
CGCACACGTT CCATAATTCG TCTGGCTTTT AGCGGCTCTT GTTCTCCTCG TTGCGTCACG 12180 
GTTAAATGCT CTTCTAAAGC TTTTAAATCT AGATTGGAAG AGAAAAAAGT AACTAATTGT 12240 
TCTTGCATTC GGTATTGCAA GATTACGCTT AAAACGTCGT CACGAATCCA ACTAGTCATG 12300 
GATTCCGCAC CAATGTCATC AATCATTAAT ATAGGGGATT TTTTTACTGC ATCGAGCTTT 12360 
TCACCAACTT GATCGCGACC AATGGCTTGC TTCATTTCTA CTGTAAATGT TGGAAAATGA 124 20 
ACAATTGTCG TAGTAAATCC ACGTTCTGCT AAAGCATTAG CCATCGCACC TAATAAGAAA 124 80 
GATTTTCCTA CACCGAAGGG gCCTTGTAAA TAGAGCCCTT TATGAAATTC TTTTGGTGTC 1254 0 
GCAGGATATT CTCTTAAAAA TTGCATTGCT TCTGcTAACG CTTTGGCTCG ACCTTGAGAA 12600 
GATGGATCGA AATCTCGTAA ATTTGCTTCT CGGACATCTT TTGGCATGTC CATTGCCCTT 12 660 
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ACTCGATTAC GAATTTCTTC TTCCTTCTGA CGGGCGATTA AATCAGCCGT TGGGACATAC 12720 

GTTACAtCAA TGTAATGAAA ATtCAACATT AATCTtGGCT CATAGCCAGG GGCAATCATt 12780 

GCCGGATCGT TtACTTGAAA TTTCyTTTTT TCyTGAACAA ATTCATACaG CTTAGCATAA 12840 

CTTTTACGAA TATCTTCATC GGTTAAACGC TCTCGGTTTG CTTGTATAAA GGCTTGAACA 12900 

TCTTGATCTT TCAATACTtC GTTAACCAGC TCTTCATATC GTTCATTGAT GTCACGCTTC 12960 

TGGATAATTT TTGACATTTC TTTTCCTACG TCTTCCATCA TTGGTCACCT CCTTGGTTTA 13020 

AAAATTCTTG AATTTGTCGA TCTAACTCGG CTTGTTCTTC TGGGGAAAGT TTTGTTTCAT 13080 

CGTTTGGCTG ATCTGCCCAT TCTGGCAATC GCTCTTGTCG AACAGTTTGG CGTTTCCCTG 1314 0 

AATAATTTTG TTGTCGCTGT TTTtGTTTGG TTTGGCTTTG TTTTGCCAAT TCTCTGACAT 13200 

GTTCAATTGC TTTTwCTGGT GAATGAATGC CaCTnTGTCC CCaTyCATyC GCAATyCGAT 13260 

TCaCATACTC CGCTTTTAAA GTCGGTTGTT GTTGAATATT GTAAACATAG TTAATCAAAA 13320 

TATTAATCAC ACTAGTCGGT AAGCCTGATT TGGCAACCAA ATATTTCACT AATGAACGTT 13380 

CTTGTGGTGT CGTATAGCCG CCTCGACTAT TTTTTAAAGC TTCTAAATAT TGAATAGGCG 13440 

GATTCTTTTC ACTATCCATA ATCATTTGAA TATCCAGTTC AGaAAAACCA TTCATTTTTA 13500 

ATGCGTTATA TCGATACGTT nGTGGCTcTy CCAcCGTCaA TTGTGCTTGC TTGTTTgkCm 13560 

CTACGTTTGT TGaTTGTGGT TTTTTATCAG GATCaAACGC TTGATGAATC GTTCGCACTA 13620 

ATTCCTTAGG CAAAATTCGA CTTGTATAAT AGTCAAATGA TTTCGAAGCA AAGTCTACCA 13680 

TATCTAACTC ATTGATGCCA TATAATTGAT GAAAAACATA GACTTCTTCT TGAAAACTAG 13740 

CCTCATCTTC AGGCAGTTGT AAACCTAAAC GGTGCAAACC TTCCACAAAA TAGTGCCAAT 13800 

CAAAAGAATC CGTTGATACA GAAATTTTCT TCGATTTTTC TTTTTGAGAA AAAGTTTCTT 13860 

GAATCGTTGT CAATCGGTTC GCTT 13884 
(2) INFORMATION FOR SEQ ID NO: 34 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

ATTAATACCT GTATTAATGT CTTTTACAAC TTGAAATTTG ATTTTTTCCA TGGCGACATG 60 

CTTCTTATCC CAGTATTTTG GATTTTTAGC AATCGTCCAT TTATCTGAAA TACCAACACC 120 

GTTTAAATCA GTTAATGTAA ATGCTCCATT CGTTACAATG TTTTCTTTTG TTGTTCCATA 180 

TTCTTTACCG TACTCTTTTA CTGCGTTTTG TTGGACTGGA AAATAAGCAG TCAATGCTAG 2 40 

CAAATCTGTA AAATAAGGTG TAGGTTCTTC TAATTCGATT TCTATTGTTT TGTTTCCCTT 300 

AGATTTTACA CCAAGTGTTT CTTTTTGTTG CTTTCCGATA CTAATCTCTT TTGCATTTTT 360 
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TATAGCAGCA AACAGTTCTA CGTTTGGCGA GGCAGTTTTA GGATCAAGAA CTCTTTGCCA 4 20 

TGCCGTTACA AAATCATCAG CAGTAATATC TGTwCCATCG GCCCATTTTG CACCGTCTCG 4 80 

AATCACAATT GTATATGTTT TACCAyyATT GGATATTTTT GGTTGCCCAG CTGCAATTGC 54 0 

TGGTTGTGGT TGATTTTTTT CATCTAATCG GTACAACCCT TCCATTACGT TGGCCAACAA 600 

TGTAAyACTT GTTTGATCCA CTGCTGTTGC TGGrTCTAAA CTTTGAACTG GTGTACCTGC 660 

CGATAAAGTT GCTATTTGTT CTACCTTAGT TTCATTTGTT TTTTTTGAAT CTTCTGTTGA 720 

TTTATTTACT CCACAACTTG CAAATACnAA CAACACnAAT ACTAAAACAC CAACTnTTTT 780 

AAGC 784 
(2) INFORMATION FOR SEQ ID NO: 34 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 



CCCTTnATTA 


TTCCTATTTT 


TTGGnGnTAA 


AAATCAGAAA 


TTACTTGTAG 


GATAAGCTGT 


60 




nnATTAAPTA 


ATAGGTAAAG 


TAATTCGTTA 


TTCAATGGTT 


TCCCATCTGT 


120 


CAATATATCG 


ATAATTCTTT 


TTACAAACGG 


AGGGGAATTT 


GAACAGAAAC 


GATTGCGGCA 


180 


ACCATACTTA 


TTAGTAAGGG 


TATTAAAATG 


CATAACATTG 


TTTTCTTGTC 


AATAAATAGA 


240 


TTAAATAAGT 


TATATTTTTC 


CTTATGCTAC 


AGGGCGCTCC 


CAAGGATAAC 


GACCGTATTG 


300 


TTGTTTATAC 


CAATTAATTT 


CGTAAGTCGT 


TAGCTTAGCT 


GCTAAAGTAT 


AAGGGGTAGG 


360 


GAATTTCTTA 


ATCATCGCTT 


TAATTTTTGC 


TAACATTAAT 


TTTCCTCCTT 


TCATTGTTAT 


420 


GCCTATAAGA 


TTATAATAGA 


TATAGACAAT 


TACATTTTAG 


CATACTTTAA 


ATAGATTtTT 


480 


AcTATTTTTG 


TTAAAATATT 


GATAATTAAT 


TTTGAAAGTG 


GGTGAACACA 


AATGTTTAAA 


540 


TTTTATATTT 


TTATATTTGT 


AGTATTGATC 


GTAATATTCT 


TGTGGTATTT 


TTATTCATTT 


600 


AAACCTCTTT 


TGAACAATAA 


AAAAGAGGTT 


ACCCTACTAT 


CTTTAATTGC 


AACAATATTA 


660 


GAATCTCTTT 


TTTTCTGGCT 


GTAATCCTTT 


TAAGGCAACA 


AAATTATTTA 


TAACAAATGT 


720 


TGGTTGTGTT 


GCAAAGTTTT 


AAAATAAAGT 


AAGCTACTTC 


AATTTTTACT 


GTTTAGCTTA 


780 


AGCTGATTTG 


AATAGAGTAT 


CAATTTCTTC 


CCACGGGGAA 


AAGTCGAAGA 


GACCAATTTC 


840 


TCTTCGACTT 


TCTTTGTATA 


AAGCAGAAAC 


AACTTCGATT 


CCTTTAATCG 


TTACTGCAGC 


900 


TGTATAGGTG 


GACTGAAAgT 


TGTTGCCATA 


AGGAAGCTTC 


CCCTTCAACT 


GTCGGTGATC 


960 


CTGTTCAAGA 


ATATTATTGA 


GGTATTTCGA 


CTTCCAATGC 


TTCACTTTTT 


GGTATAGAAT 


1020 


TCCTTCTTCT 


TTCAGCTCTT 


TGATCGCTTT 


TAATGAAGGA 


GCATATTTAT 


CTGTTACAAT 


1080 


GGAACGTGGT 


TGACCGTAGA 


CTCTGATTAG 


ACGCTTGAAA 


AAGGCTTTAG 


T AG AG AC AG T 


1140 


ATCACGATGG 


TTTCGTAGCC 


ACATATCTAA 


CGTATTGCCC 


TTCGAATCAA 


TGGCTCGATA 


1200 
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AAGATACCGA 


TCTTGACCTT 


TCACTTTGAC 


ATAGGTCTCA 


TCAATCCGCC 


AGGATTTTGA 


1260 


TAAGGATCTA 


CGTTTATTCC 


ATAGATGATA 


AAGGATTTTT 


CCATACTCTT 


TGTTCCAGCG 


1320 


GTAAATTGTG 


GTATGATCCA 


CGCCAACACT 


AAATTTACGC 


ATGATTTTGG 


CACAATCACG 


1380 


ATAGCTCAAA 


GAAAAGTGAT 


AGTATAACCC 


GATGGCTTCT 


ATAATAATTC 


GATAGTCATA 


1440 


CTGTTTTCCT 


TtaAACTGAT 


TTTTCATGAC 


TGTTCCCTCT 


CGTTCTTCAT 


AGATTTACTT 


1500 


GAAATTATAC 


CGTAAAATAG 


GGcTAAATAA 


AACTTTGCAA 


CAAAACCCAG 


CTCTTTTCTC 


1560 


TTGACTTTTC 


TTTATGTAAA 


AAGCCTTGTA 


TTGCAAGCAA 


TACAAGGCTT 


TTTATGCTAT 


1620 


AATAGAATTG 


ATAGTAAAGT 


TTAATGACGG 


TGGCTATCCA 


ATCGAAAGTG 


AGGTGGTGCG 


1680 


TATGAACGTA 


CTTCCAAAAT 


CtATAGAAAG 


GAGAAGCCTT 


TTGTCCGTTT 


CGGAAAGTTT 


1740 


GCAACTAATG 


CyTGCGTTTG 


GTGGGTTTAC 


ATTGACCCTA 


tCcACAACCA 


TTGTTGCCAy 


1800 


cTGAcTATAA 


AGACAAAAAG 


AAATAACCGT 


CACTtTAGCA 


GGAGACCGGT 


TATTTCTAAA 


1860 


TAACTTATAA 


AAACTCTGCC 


ACCGTCTTTA 


GCGGTGCTAT 


CAATGAGGGC 


GTATTACCAG 


1920 


TACGTCCTCT 


TTACATAATT 


ATTATACCAC 


ATTTTGTGCG 


AAAAGGTAGG 


CGAATGCCTA 


1980 


CCTTTTTtCA 


TGCGCGCCTA 


AACGACGGTG 


GCGGTGGGAG 


TGTGGCGCGC 


ATGAnAAAAC 


2040 


AGATCTATTG 


ATTGTAGCCA 


TATTCAGACA 


ACCCGTTCAT 


GGACGGTTGT 


TGGTAGACAT 


2100 


TTtGTTGTGT 


TGGTGCTTGA 


TnCATTGCTG 


GTTGtTCATT 


AACCGTTGGG 


GTTGATTGTC 


'2160 


CTTGTGGGAC 


TGCGTGTTGT 


TTTTCTTCTC 


GTTTTTGTAC 


GGTTTCTTTG 


GACTCTAAAA 


2220 


ACTCAAAATC 


TTCTACAACT 


AATTCTGTCA 


CATAATGCGT 


AACGCCTTGT 


TGGTCATCAT 


2280 


ACGAACGAGT 


TTGTAGTTCC 


CCATTTACAC 


CAACTAATGC 


GCCTTTTTGC 


GTGTATTTTG 


2340 


CAAAGACTTG 


GGCGGGTTTG 


CCCCATAAAA 


CACATTGAAT 


AAATGTGGTC 


GGCACATTTC 


2400 


CGTTAGTGTC 


TGGTCTTTTT 


CGCTTACAAG 


CCAATGTAAC 


GTTTGCGACA 


CTCTTTCCGT 


2460 


TAGTGGTGGT 


ACGTAGTTCG 


GCTCCTTTTG 


TTAAACGCCC 


GACAGCTGAT 


AATTGAATCA 


2520 


TGATTGTTTC 


CTTCTTTCTG 


ATTTTTTATT 


TATATCATGC 


CTTTTTTAGT 


GGCAATAAAT 


2580 


ATCTTGTTAA 


GTGGAAGTTT 


TTTTGAAATG 


AGAGTGATTT 


GCCTGTCTAG 


GCAAATCACC 


2640 


TCCTTCTATC 


TGCCATGCCC 


TTATGAGGGC 


ATGGCTTTTT 


CAATCTCTAT 


TTGCCATTGT 


2700 


TCAATGGCTT 


CTTCTACTTC 


TTCTGGTACG 


TGTCCCCGAT 


TGTATTTCCC 


TTGCACTTGT 


2760 


ATCACTCTTT 


TTTCACGGTA 


TTCTAAGGTA 


AAATATGGTT 


TATCTGGTTC 


TTCTTTTCTT 


2820 


cc r* a T A Zi A & h. 




TTTTCCTTTT 


u 1 n 1 O 1 1 O 1 1 


CAAGGTAATG 


TTGGCTACCA 


2880 


ACACAATGGT 


GCAATACCTG 


CCCTTCCTCC 


ACAATTTCTT 


GCAACGAATG 


TGGGGCGAGA 


2940 


AAAAGTAAGT 


CGTCAATTTC 


GGTTTCTAAT 


GCTTTCAGTT 


GTTTTTTTCG 


TTCTTGGTAT 


3000 


TGCTGTTCTT 


CAAGCTCTCT 


TTCAAGTAAA 


TTCAACGTAT 


TTGTGGCATT 


GTCATGGGCT 


3060 


AGTTGCAAGT 


TTTTAGGGAA 


AAGCACATCA 


TCAGTAAATG 


GTGTATTCAG 


TTCCTCTAGC 


3120 


ATGTGTAAAT 


AGTCCATATA 


GTAGCTAAAA 


CGCTCGCCTT 


TTCGCATAAA 


CCAATGTTGA 


3180 
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AACTTGGTTA 


AGTTCACTTC 


TTTAGGTAAT 


TGCTTAATTT 


GTGAATAATG 


TAAATAATTT 


3240 


TCAATACCTT 


CTATTCTCGC 


AACTTTTCGC 


TCCTTCAAAG 


TTTTTTCTAG 


CATAAGCATA 


3300 


TGAAACGTTG 


GGTTGGTGGT 


TTTTAGTCGC 


TGTTTATTGG 


CTTTTAGCCA 


GTTCTTCGTA 


3360 


ATAATTCGCA 


TATCTACTTT 


TTTATGCCAT 


GTGCCAAAAA 


CATTTCGTTC 


GTCATTAAAA 


3420 


AGTATTTCGT 


TTGCTAGAGT 


AGTTGCGCCT 


ATCTTTTGCA 


AAAACTCAAT 


TTCATTGCGG 


3480 


TATTTATAAA 


TATGCGCCAG 


TTCTTGGCGA 


CTAATTACAG 


GTAACTTGAG 


GTATTTCAAT 


3540 


TCCGAATTGG 


TCGTTAGATT 


TTCTTGCCAG 


TTATTTGGAT 


AAAAAACAGT 


GTTCGTGTAA 


3600 


GCAL 1 AL* i L-A 


t n. t 2x a en n t a 


r\ 1 1 J. l r\t\\^ 


AAGCCCTCAT 


GCCAATGTCC 


GTTCTCATAA 


3660 


TGGGCTTTAA 


CATGGGTATC 


GTTTGAAAAC 


CGTTCaAGGT 


TTGACGGCTC 


ATAGATAAGG 


3720 


GTTTCTGTTC 


CATCTGTAAT 


CGTCTGCCAA 


TAACTATGGG 


TTTGAATTTC 


AATTCGtTTT 


3780 


GCACGAACGA 


GAATGATTCC 


AAAGGAGTAA 


AATTCGTTGG 


AAACGATAAT 


CGTGAGTATT 


3840 


TCGTTAATCG 


TGTTTTTATG 


GTGGGGGAAT 


TCTTGCGAGA 


ACTCGCAAGA 


ATTGTCTCCT 


3900 


TTnnGTn 












3907 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
TAAAGAGTCA AACAGAA^C TTTTTTAGCT GCTTCTTTAT TGCTATACTA CAGAGGGAAA 
AAACGAAAAG G AG TAT G AAA AATGTTTTTT AAAGAAATTC ATCATGTGGC AATCAATGCC 
TCAAATTATC AAGCAACTAA AAATTTTTAT GTCGAAAAGT TGGGCTTTGA AGTTCTTCGC 
GAAAATCATC GTCCCGAAAA AAACGATATT AAATTAGATT TAAAATTAGG CAGCCAAGAA 
TTGGAAATTT TTATTTCTGA TCAATTTCCC GCTCGCCCCT CTTATCCTGA AGCTTTAGGA 
TTACGTCATT TAGCTTTTAA AGTTGAACAT ATTGAAGAAG TGATTGCyTT TTTAAATGaG 
CaAGGAATTG gAAACAGAAC CGCTACGAGT AGaTGaTTTT aCGGGTAAAA AAtGAcCTTC 
tTTTTGAtCC AGATGGACTA CCyTTAgAaC yTCCwgGAAT AATAAATAAA GGGAGAGnAA 
nC 

(2) INFORMATION FOR SEQ ID NO: 34 5: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
482 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
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ACAGTTCGgT CCCTaTCCGT CGCGGGCGTT 
AGAGGACCGG GATGGACTTA CCGCTGGTGT 
TAGCTATGTA GGGAAGGGAT AAACGCTGAA 
GAGATTTCCC ATTTCTTTAA GAAAGTAAGA 
GAAGTGGAAG GCTAGTGATA GTTGGAGCGG 
AAGAATGGAT AAGTAAAAGC AACTTGGTTA 
TGAATGAAGA TTCAACTCAA TAAGAAACAA 
TACACCTGTT CCCATGCCGA ACACAGAAGT 
GTTTCCCTTT GTGAGAGTAG GACGTCGCCA 
AGCTGGGAGA GCACCTGCCT TACAAGCAGG 
TTTTTTGAGC CGTTAGCTCA GTTGGTAGAG 
TCGAGCCCTG TACGGCTCAT ATTCATATTT 
CACTAGATTT AG GAT C TAG C GCCGCAAGGC 
TTAGCCGGCT TAGCTCAGTT GG TAG AG CAT 
AGTCCTTTAG CCGGCACCAT TTTGCGGAAA 
GGTGGGGGTC GCGGGTTCGA ACCCCGTTTT 
CGGAACTGGC AGACGCACAG GACTTAAAAT 
ATTCCGGTCC TCGGCATTGT TTTATATTTA 
GACTACGAAT CAAAAGGTTA GGGGTTCGAC 
TAAGTTTTAA AAAGGTGCAC GTGAGTTTCT 
TAACTTCATA GCCCGGGAAG TAGCTCAGCT 
, TCGCAGGTTC GAATCCTGTC TTCCCGATTG 
TTTTTTTTTT GTTTAAAATT TAAGTATCGA 
CTTATCGAAA TTGCGACTAA AAATACAGGA 
TAGTTGAGAG GAAAAGGTTG AGACTTTGAA 
AATCACTGTG GCGTATTTAG CTAATTGGCT 
GGTGCCTCCT TCTGCAAAAA TCCGTGCTTC 
AGAGACGAGG ATGGCTACGT CATTTTTGGT 
TTGATAAGGG ATTAAATGGA TAAATAGCTG 
TAAGCGAATG TAGGAGTTAC TGACACCCAC 
ATCAATGCTT TTATTTAACA TGGCCTCGGT 
AGTCGTTAAC TGCGCAATTT TCTTGGCAAT 
CATATTCGCA TCAACATTCC GGATTGCTTG 
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GGAAATTTGA GAGGAGCTGT CCTTAGTACG 60 

ACCAGTTGTT CTGCCAAGGG CATTGCTGGG 120 

AGCATCTAAG TGTGAAGCCC ACCTCAAGAT 180 

CCCCTGAGAG ATGATTAGGT AGATAGGTTG 24 0 

ACCAATACTA ATCGGTCGAG GACTTAACCA 300 

TTTTGATTCA AACTTCAATC CAGTTTTGAG 360 

CACCCAGTGT GGTGGCGATA GCGAGAAGGA 4 20 

TAAGCTTCTT AGCGCCGATT GTAGTGAAGG 4 80 

CGCGAATTGT TGTTTTTGGA GGTTTAGCTC 54 0 

GGGTCAGCGG TTCGATCCCG TTAACCTCCA 600 

CATCTGACTT TTAATCAGAG GGTCACAGGT 660 

CATTTGCGGG TGTGGCGGAA TTGGCAGACG 720 
GTGGGGGTTC AAGTCCCTTC ACCCGCATAT * 780 

CTGATTTGTA ATCAGAGGGT CGAGGGTTCA 84 0 

TAGCTCAGTG GTAGAGCACC ACCTTGCCAA 900 

CCGCTTAGCC AATTCATTAC GCCGGGGTGG 960 

CCTGCGGTGA GTGATCACcG TACCGGTTCG 1020 

GCACCCATAG CTCAACTGGA TAGAGTGTTT 1080 

TCCCTTTGGG TGcATAGTTG AGTGATTGAC 114 0 

TTTTAATTCG ATAGTCTTTT TTTATTACTT 1200 

TGGTAGAGCA CTTGGTTTGG GACCAAGGGG 12 60 

TGAAAAACAG TACTGATCAT CAGTACTGTT 1320 

AGTAGGTGTA TTTTTTTGTG ATGTGTAATT 1380 

ATAAAGGACA TTGAGAATAT ATTCGATGGC 14 4 0 

ATCTGCATGT TCAACGTTAG GAATGGGCAA 1500 

GTTCAAGTCT GAAGTAATGG CAATAATCGG 1560 

ATTTACGATT TCTGCAGTGT TACCACTGTA 162 0 

GGCATTGATG GCTAAATAAA ATTGTTCAGC 1680 

AATGCGTAAT AACTTTGTTT GAAAATCCGT 17 4 0 

TGCTAAAATA TTTTCAGCGG TTAAAAGTAA 1800 

TAATAACTGT TGGGTGGATT GAACAGTTTC I860 

TAGTAAATCA GAATCTTGCG CTAAAAAAGG 1920 

GTAGTGAATT TCTAAGTCGC GGGACAAGCG 1980 
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GATTTTAAAT TCTTTGAAAC CGCTAGTCTC TGTTTTTTTA CACAAACGAA CAATCGTGGA 2040 

AGTTGAAGAG TTGGTGGCTT CTGCTAGTTC ATAAATCGAC ATATGCAGGA CTCGTTCAGT 2100 

ATGAGATAAA ATAAAATCAA TAATTAGCCC TTCGTTATAT GTATAGCGTT CTTTTTGTTC 2160 

TAAACATTCG ATTAGATTCA ATGGGTTCAC CTCTCTGTGG TACTTTGTTT CGCTTTCAGT 2220 

ATACCTTAAG CGTATGAATT ATGAAACATA GTTACAGAAT TAGTTAGTCG TATTTACATA 2280 

AGAAAGCGGT TTCTTTATGA TGAGGATATC ACAAAGGATT AAGGAGAGAA GTGAAATGGA 234 0 

TAAGTTAACA AACTGGGTGG AGCAAGCCGT AGTGCCTAAA GTTTCAAGGA TTACAAGTTT 24 00 

ACGTTATTTT CAAGCCTTAA GAAATGGTTT TTTTGCGATT ATGCCGTTAA CAATTATTGG 24 60 

ATCGATTTTT ATGTTGATTA CAGATTTTCC AGTAGCAGGG TATGGCGATT TTATGGCTGG 2520 

AATTTTCGGT GCAGGTTGGG CGGATATGAT TTCACCCGCT TATCGAGCAA CTTTCAACAT 2580 

GATGGGAATT ATTTTTGCCG GGACGATGTC CTATAAATTG GCGGAAAGTT ACGAAATGGA 2 64 0 

TCGTTTAACT TCATTAATTT TAGGAATTGT TGCTTACGTA GTCGTTTTAC CGAAAACGGT 27 00 

GACCACTGAA TCAGGCGAgT TGTGACGAAA GTTTTATCGT TTGATTGGTT AGGAACGCAA 2760 

GGGGTGATTA CGGCCATTAT TATGTCAATT CTTTCGGTTG AGTTGACTCG TTTCTGTATT 2820 

AAGAAAAAAT TAGTCATCAA AATGCCTGAC AGCGTGCCTT CTATGGTGAG CCAAGCTTTT 2880 

AGTGCATTGA TTCCTGGGAT TTTTGTGGTA GCGGTGGCTT TATTGATTAA TGGGATTGGG '2940 

GTTTCGTTTG CGGATTCCTT CCCACAATTA ATTTATGCGG TGATTCAAGC GCCACTGCAA 3000 

GGTTTAATCG GAACCCCTTT TGCAATTATT ATTGTGGCTG GCTTAAATGG ACTGTTCTGG 3060 

TGGTTCGGGA TTCATCCAAC TGTCATCAAT TCCATGCTAT ATCCTATTTT ATATGCGAAT 3120 

GCAGACAAAA ACCAAAGCTT AGCGGAATTA GGACAATTAA CTGCTCAAAG TGGCAACTTT 3180 

GGCACGGTCC AAATGCTCGA TCAATTTGCT ACAATTGGAG GCGCTGGTTG CACGATTGGC 3240 

TTGGCTATTG CAATGGCCAT CGTCGGACAT TCATCTAGAA TGAAAGCCAT GTCGAAAATT 3300 

TCGTTTGTTC CAGCATTTTT CAACATTAAT GAACCCTTGA TTTTTGGACT CCCAGTTATT 3360 

TTTAACCCTT TATTATTGAT TCCGATTGCG GTCGCGCCGA TTGTATCGGT CTTGATTGCC 3420 

TTTCTGTCAA TGAAAATTGG GTTCATGCCA ATGTTTACAA ACATTCAAGC ACCATGGcTA 3480 

CGCCCTTTCT GTTTTCGGGC TTTTTAGTGG GAAAATGGCA AGGTGCAGTG ACGCAAGTTT 3540 

TAGCTGTTGC CGCTAGTGTC GTTATTTATT ATCCATTTGT AAAAGCCTTA GATCGACAAT 3600 

ATCAAAAAGA AGAAGCCCAT GAAGTACCAG AAACATTGTA GAGAAAGAGG AGTGAAGGAA 3660 

ATGCATGTTG AGAATGATGA AAATATTTGC TGTGAAGTGG AAAAAGGAAT GGAAAAAATG 37 20 

TATCATCCGA TGGTGCCCGA GTTGGTTCAA ACCTCTTCCT TTTATTTTCC AACTTACGAA 37 80 

GAATTTATGG CAGCGAGTAT AGATGAAAAA AATAATTATG TTTACACACG TGGCACCAAC 384 0 

CCCACAACAG AAATTTTAGA GAAAAAAATT GCTCGTTTGG AACGTGGAGA AAAATGTAAA 3900 

GTTTTTGCCT CAGGAATGGG CGCTATTTCG GCAACGCTGT" TTACGTTGCT CCAACAAGGT 3960 
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GATCATGTTT TAATGGTCAA 
GAGAAATTTG GCGTATCCTT 
TTTGTTCAGG AAAACACCAA 
CTGCTAGATT TAGAAAAAAT 
GATAACACGT GGGCGTCACC 
ATTCATTCTT GTTCCAAATA 
GGCAAAGCAG CGCTTGTAGA 
AATAGTCCCT TCAATTCTTG 
GCGCATCAGT CTCAAGCCAT 
AGTCGCTTAT ATCATCCTTT 
TTGACAGGTT ATGGCTCATT 
AAAACATTTG TGAACGCATT 
AGTTTGGCCT TGCCTGTATT 
TCGCCAGCGC ACATTCGCAT 
ATTCAACAAG CCTTGGATCA 
ATCAGCTGAA CAAAAGCAAC 
CAATTTTCGG AAATTTCTTC 
ATTCAATAAC ATATTTTCAT 
CCTCCTGTTT TGCATTTTTT 
TTCGATATAA TAATGGTCAA 
TCaAAATACG TCAGATTTAA 
AATTGAAATC CGTCGGGCGG 
CTATGTAATT AATACACGTT 
TGGAGGCGGC TATATTCGCA 
ACAAATGGAT CAATTGATTG 
AACATTGTAT GAAGAAGAAG 
AAGTAAAAGC ACATTGAATG 
GCGTTCATTT TTAGAGCGCT 
GAAAGTGCGA AAGCCGTACT 
TCCGTCGGTT CTGAACATTT 
AAAACATTAC GACAATTGAA 
AGTGGCTATG GCACCATGCA 
CCAAGAGCCA AACAAATCTT 
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TACAATTTAT GGAGAATCTG TTTCGTTTGT GCAATATTTA 4 020 

GACCAAAGTA GACGTTGCAG AAACAGAAGA ACTTTTTCAA 4 080 

AGTCATTTAC TTTGAAAGTC CGTCCTCACA AAAGTTTGAA 414 0 

AACTGCGTTA gcACAAAAAA TTGGCGCATA CACTGTCATT 4 200 

GTTATTTCAG CATCCTTTAT GCCATAATGT GGACCTGGTG 4 260 

TATTGGTGGT CATTCGGATA TTGTAGCGGG GGCTGTCATT 4 320 

TCAAATTTTT GAACATGGTC ATCAAGCATT AGGCGCAGTC 4 380 

GTTAGCGTTA CGAGGTATTC GTACCATGCC CGTTCGGTTG 44 40 

CCAAACGGTG CTGTTGGCTT TGCAAAAGGA TCCGCGAATT 4 500 

TGTAGGAAAT CAACAACAAC AAGCGTTAGC AGAAAAATAT 4 560 

GTTTGCGATT GATTTAAAAG ATACCGATTT TGAACGTTTG 4 620 

GACGCTGGTC ACAATTGGGG TCAGTTGGGG TGGCTTTGAA 4 680 

CAAAGGCAAT AACTTGGCGG CTGTTCAGAA AAGAGGCTTG 4 74 0 

GTACGTTGGT TTGGAAGAGC CAACATCTAT TATCGAAGAT '4 800 

AGCCTATGGC GAAGAATCGC TTTTGTGAGA CATTAAGATA 4 8 60 

TCcTTCGGAA ATAAGCCGAA ATTCTCCAAA AATTAAAGAA 4 920 

TTATTTCTTG GAGCTAAACA CTTTTGTCTC AGCCTCTGTG 4 980 

TTTTAGGAGG GGAACTTAAG TCAGTTTGAC TGAGGTTGCT 504 0 

TATTAAAGTC AAAGCGTTCC TCTTTATTTA ATAGGCaAAG 5100 

TATTAGTCAA GAGAGAAGAA GGATGGAAGA CTATGAGTAA 5160 

TCGAGGCTTA TTTAAAAAAG ATTTTAGAAG AAAGTAATAA 5220 

AAATGGCTAA TTTATTTAAT TGCGTTCCTT CACAAATTAA 5280 

TTACCATCCA ACGAGGcTAC GCAGTTGAAA GTAAACGTGG 534 0 

TTGTCAAAGT ACAAATTTCA GATAACGATC AAT TACT AAA 54 00 

GCGCAACCTT AACAGAAAAA GATGCGCTAA CATTTATTCA 54 60 

TGATTACGAA AAAGGAAGGC AATTTAATGC TAGCTGCTTT 5520 

GTCTGGGGAA TCACGAAGAT TTTCTAAGAG CTCAAATAAT 5580 

TGAGCTATGA GGAGGAATAA CTATGGATGA AC.TATTTACA 5640 

GGCGATTGCC CAAGAAGAAG CGAAGTATTT TCGTCACCAA 5700 

ATTATTAGCA CTCGTTTTAG AACCAAATGG GATTGCCGGA 57 60 

TACTGACACA GAAGACATCC GTGAAGAAAT TGAGCATTTA 5820 

AAGTCCAATG GGAAATAACA ACTTATACTT ACCTTATTCA 5880 

TGCCTATGCT GGAGATGAAG CGAAgcsCTT AGGGGCTCAA 594 0 
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AAGATTGGTA CCGAGCATTT ATTATTAGGC TTGTTAAGAG ACGAAGAAAT TTTAGCGTCT 6000 

AGAATTTTAG TGAATTTAGG ATTAAGCTTA TCCAAAATGC GTCAATTATT ATTGAAAAAA 6060 

ATGGGCGTTT CCGAACCCAA TGGCGCACAA CGTCGTCGGA ATGGTCAAAA TAAAAATGCG 6120 

CCACAAGGGA CGCCAACATT GGATTCTTTA GCAAGAGATT TAACCAAGTT AGCACGAGAA 6180 

CAAAGCTTGG ATCCAGTAGT TGGTCGCGGT ACCGAAGTCC GTCGTTTGAT TCAAATTTTG 6240 

AGTCGCCGTA CCAAAAACAA TCCTGTCTTA GTCGGTGAAC CAGGTGTTGG GAAAACAGCG 6300 

ATAGCAGAAG GGTTAGCCCA GAAAATTGTC AACCGAGAAG TCCCTGAAGA TATGCAAGGC 6360 

AAACGTTTAA TGATGCTAGA CATGGGTGCC TTGGTTGCTG GAACCAAATA TCGAGGTGAG 64 20 

TTCGAAGATC GCTTGAAAAA AGTTGTCGAT GAAATCTATC AAGATGGACA AGTCATTTTA 6480 

TTTATTGACG AATTGCATAC CTTGATCGGT GCTGGCGGTG CAGAAGGGGC GATTGATGCT 6540 
TCAAATATTT TAAAACCTGC TTTAGCACGT GGCGAGTTGC AAACAATTGG TGCGACAACT . 6600 

CTGGATGAAT ACCAAAAATA TATTGAAAAA GATGCCGCTT TGGAACGTCG TTTTGCCCGT 6660 

ATTCAAGTAG CAGAaCCAAC CCCAGAAGAA GCAGAAGAAA TTCTAAAAGG CTTGCGTTCC 6720 

CGCTATGAAA AACATCATGG CGTAGAAATT ACAGATGAAG CTCTACATGC AGCGGTTCAA 6780 

TTATCGATTC GCTATTTAAA TGACCGCCAA TTACCAGATA AAGCAATTGA TTTAATGGAC 684 0 

GAATCTGCAG CAAAAGTTCG TTTAGATAAG GCAGATCAAC CTTCAGAAAT TAATGAATTG 6900 

CGTACCGAAA TTTCTCAATT AATTACGGAA AAAGAAGAGG CCATTCAAAA CCAATCATTT 6960 

GAAAGTGCTG CTCGGATTCG TCAAAAAGAA AAACAAGTGA TGGAAAAACT AGAAGAATTG 7020 

ATTGCAGTGA AAGAAAAAAG TCTGTCTGGT TACAGTACAC AAGTAACAGA AGAAGACGTG 7080 

GCTGGGGTTG TTTCACAATG GACAGGCGTT CCTTTGCAGC AGCTAGAGAA GAAAGAAAGC 714 0 

GAACGATTGA TGGAATTAGA AACCATTTTA CATCAACGGG TCGTTGGCCA AAATGAAGCA 7200 

GTTGAAGCTG TTTCACGCGC TATCCGTCGT GCGCGTAGTG GCTTGAAAGA CCCAGCAAGA 7 2 60 

CCGATTGGTT CCTTCATGTT CTTAGGACCA ACAGGGGTTG GGAAAACAGA ACTTGCCAAA 7320 

GCCTTAGCCG AAGCGATGTT TGGCAGCGAA GAAGCTTTGA TTCGCGTGGA CATGTCTGAG 7380 

TTTATGGAAA AATACAGTAC TAGCCGTTTA ATCGGTTCTC CTCCAGGTTA TGTCGGCTAC 7 4 40 

GAAGAAGGCG GTCAACTAAC TGAAAAAATT CGTCAACGTC CTTATTCCGT TATTTTGCTA 7500 

GATGAAGTAG AAAAAGCCCA TCCAGATGTC TTTAATATTT TATTACAAGT CTTAGATGAT 7 560 

GGCCATTTAA CGGATGCCAA AGGACGAAAA GTCGATTTTC GTAACACGAT TTTAATCATG 7 620 

ACCTCCAATT TAGGGGCAAC AGCGATTCGT GAAGAAAAAC ATGTTGGCTT CAATGTGAAA 7680 

GACATTTCTA AAAATCATGA ATTAATGCAA AAACGAATCA TGGAAGAATT GAAAAAAGCC 77 4 0 

TTCCGTCCaG AATTCTTAAA CCGAATTGAT GAAACGGTTG TATTCCATTC CTTGAAACAA 7800 

GAGGAAATTC ATGAAATCGT AAAAATTATG AGTCAATCTG TCGTGAAACG GATGGCTGAA 78 60 

CAAGAGGTTA AAGTGAAAAT CACACCAGCT GCAATTGAAG TTATTGGTAA AGTCGGGTTT 7 920 
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GATCCAGAAT ATGGCGCACG TCCAATTCGT CGGGCCTTGC AAAAAGAAGT GGAAGATCGT 7 9.80 

TTAAGTGAAG CCCTTCTTTC TGGTCAAATT CAGTTAGGGG ATAAAGTAAC TTTAGGCGCT 8040 

AGCAAAGGCA AAATTACGTT AAATGTGCGG GCACCAAAAG CACCAAAGAC AGAAGCGAAA 8100 

GAATTACAAA CTGTATAGAT GAGATGCAAA GGAGGTCGGG ACAGAAGTGT TTAACTCCGA 8160 

GAAATAAGAA GGAATT'iCCG AAAATTGTTC TTTAATTTTT GGAGAATTTC GGCTTATTTC 8220 

CGAAGGGgTT GCTTCTGTTC TCGCCGTGTA TCAGTTTTTG AGCGTGGAGC AAAAATCCAA 8280 

AGTGATTTTT GTCCCACGCT CCTTTTTCTA TTTTCCAGTG GCGATTTCTT TTTGAGAATT 8340 

GCCAGATTGA TTTTTCTGTG CTACGTTAAT AAGGAGGGGT GATGGAATGG AACAATCTTT 8400 

TTTTCGAGAA TTACTACATT ATCAACGTCC AAAGGATAAT CCTTTTCGAC TAGTGTTTGC 84 60 

TGGCTTGGTT ATGTTCAGCA TTCTTTTTTT GGGCTATATC AATCATCAAT TATTGATTTC 8520 

AAGCTTTGGC TCGTTAGGTA TTTTTACTTT TTTATATTAT CAGCCGCTAC CATTAAAACA 8580 

GTTAATGACT CGTTTATCTG TGGTCGGTAG TTACCTTTTT TTAGGAAATT TGTTAGGCAT 8640 

GTTATCGACC CATATTGCAT GGCTTATTCC CATTGTTGTT GCACTGGTTG GTTTTGGTGG 8700 

CCGCTTTTTC TTTGAAGTCT ACGATATTTC AAAACCAGGG GCCTTCTTTG GTGTGATGGT 87 60 

GACAGCGATG GGTGCCAGTA CGAGCATTCC GCTAGCAAAA ATTCCGTTCA TGAGTGGCTG 8820 

TATGTTATTA GGCATTGGCT TTTCCATTTT GTTTGCACTA GTGTTGCATT TTACAGAGAA '8880 

GGAAATCAGT GTCCCGCTAG CTAAGCGATC ATTACGGGAA CGAATTTATC AG CAT CC AG A 8 94 0 

AGCGCCATTA GATAGCATAT ATTATAGCTT TGTTTTATTT TTTGCTGTAT ATATCAGCGA 9000 

AAGTTTACAT TTACAAAATC CGTATTGGTT AGTCGTTTCC TGTGCATCCA TTCTACAAGG 9060 

AGACAATTTA CGTGCGATTA AGCAGCGAAA TAT T C AG CG A ATTTTTGGCA CAACGATTGG 9120 

ATTGGTTATT TCCGCTTTTC TGCTTAATTT AGCTTTGACA ACTTTTGAAT CGATTGTCGT 9180 

GATTACAATC TTATTTGTGA C CG TT GAG T A TTTTATTCGT CGTAATTATG GATTGGCGCA 924 0 

ATTTTTCACA ACGCCTATGG CTCTGATGCT TTCATTGCTC GTTCGTCAGC AGTACGTGAT 9300 

AACATTGATT CAATTCCGCT TTTTGGGGAT TGTTTTAGGC AGCTTGTTAG GGcTTAGCAG 9360 

CAGCGTGGCT ATTCACCGTT GTAGTAACGT TTTATGAACG AAAATATCAC TTGAACGAGT 94 20 

GAnGAAGAAC AGATGACCAG TAATTAAAAA AAGAGCATTT GGTAGGTAAG CGGATGGCGC 94 80 

TTTTCTACCA AAATTTTTTC GGAAACATTG TGAAAAATGT ACGTTTAATA CAGATAGGCT 954 0 

CTTTTAAAGT TTGTGAACGT ATGATACTAT TAAACGGTCA TAAAGAAGGA GGCTATATTT 9600 

ATGATTGAAC TGATTGCAAC TGCAGAATCT gTTGAGCAGG CTGTAGAATT ACTCGCAACA 9660 

GGTGTGGATA CATTATATAT TGGAGAAGAA ACATTCGGGT TACGTTTGCC CGTATCTTTT 9720 

ACACGAGATG AACAACGCGA GATTGTGAAA CGTGCACATG AAATGGGCAA ACAAGTGCTA 9780 

GTTGCAGTAA ATGGCATTAT GCACCCTGAA AAAATGAAAT TAGTCCCAGA GTATCTGACA 984 0 

TTTTTAAAAG AAATCAACGT CGATAAAATT ACTTTGGGTG ATCCGGGTAT TGTGTTTATT 9900 



WO 98/50555 



PCT/US98/08985 



ATGCAACGAG ATGGATTAGA 
TCACGTCAAA TCAATTTTTG 
GTACCTTTTG AAGAAATGGT 
GTCTATGGGG CGACCTGTAT 
TACACAAAGA ACGACAAAGG 
AAACCAGAAA CCCACTATTC 
AACGATGTGA ACTTGATGAA 
AAGCTCGATG GCCTTTACAC 
GAAGCGAAAG CAGCACTTGA 
GCTGAAATTG AAAAAT T AC A 
AGTGCCGATG CAATAAAATA 
ACAGAACGAA CATTGAAGCG 
AAAGTCGCAA TTCGTTACGG 
CGTACCGCGC AGGAAATTTC 
CATAATGCGA AAGTTTATGT 
GCAGGAGCGT TTTTCAGAGA 
CCAGCTTTAA TCGAAATTTG 
CAAGCTTCAG CAACCAACTA 
GTGGTTTTAG CGCGCGAAGT 
ATTGAAATTG AAGCCTTTAT 
TTATCCAACC ATATGTCGAT 
TGGAAATACG AGTTATTTGA 
GGGGAAGTTG AAGAAGAGTT 
CCAGAATTAA TTGAAAATGG 
CACTATGTTT CGACAGTCGC 
CCTGAAAACT ATGTCTGCCA 
GAATTAGCAA -CTGGTTTTTA 
CGTCGTAAAA TTCCGCAATA 
CAAGTCGCAA CGATTCGTCA 
GGTCCTGGGT TTACCCATTT 
GCGATCGATC GGGCACCTAA 
GCAGTCGGCG ATATGATTCG 
GTGTCCGCGT GAATAATTTA 
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AATTCCTTAC GTTTATGATG GCGAAACCTT GGTAACGAGC 9960 

GAGTAAACGA GGAGCCATCG GAGCTGTCCT AGCTCGGGAA 10020 

CGCAATGGAA GAAAACTTAG CAGTTCCAGC TGAAATTCTT 10080 

TCATCAATCG AAACGCCCAT TAATTCAAAA TTACTACAAT 1014 0 

GGTTACGAAA GACGAAGGGC TGTTTATTTC TGAACCCAAA 10200 

CATTTATGAA GATAGTCATG GTACGCATAT TTTTGCCAAT 10260 

TGAGTTAACA AACTTGTATG ACCATCACTA TCGTACATGG 10320 

GCCAGGTGAA AATTTTGTCG CGATTGCTCG CTTATTTGCT 10380 

AGCGGGAACT TGGTCAGAAG ATCAAGCAGC TCATGCAACA 104 4 0 

TCCAACAGGT CGTGGCTTAG ACAATGGTTT CTTTGATTTA 10500 

AATTGATTCA TTTAAGGGGA AGAAGAACAT GAAAGTAATG 10560 

TCCCGAGGTG TTGGCTCCAG CCGGGACGCT AGAAAAACTA 10620 

TGCCGATGCC GTTTATATCG GAGGGAATGC CTACGGTCTG 10680 

ACGCCAGAAG AAATGGCGGA AGGCGTAcTT TGCCCGCGAA 10740 

CGCTGCCAAT ATGGTCACTC ATGAAGGAAA CCAGGAAGGG 10800 

ATTGCGAGAC ATCGGAATTT CAGCTGTGAT TGTTTCCGAT 108 60 

TGCGACAGAA GCACCAGGTT TACCCATCCA TTTATCGACA 10920 

TGAAACATTA GAATTTTGGA AAAATGAAGG CTTAGAACGT 10980 

TTCAATGGAG GAAGTTGCAG CGATTCGTGA AAATACGGAC 11040 

TCATGGAGCA ATGTGTATTT CATATTCAGG ACGTTGTACT 11100 

GCGGGATGCC AATCGTGGCG GTTGCTCTCA ATCTTGTCGT 11160 

TATGCCTTTT GGGACAGAAC GTCGGAGCAA AACCAGCGAA 11220 

TTCAATGAGC GCCGTAGATA TGTCTATGAT TGAACATATT 11280 

TGTCGATAGT TTTAAAATTG AAGGCCGCAT GAAATCCATT 1134 0 

TAATGTGTAT AAAAAAGCCG TCGATTCTTA CATGGAAGAC 11400 

ACAAGAGTGG ATTGATGAAT TATGGAAAGT AGCACAACGT 114 60 

TTACAATACA CCAAGTGAAA ATGAACAATT ATTCGGCGAA 11520 

TAAATTTGTT GGTGAAGTGA TTGCGTACAA TGAAAAGACG 11580 

ACGGAACCTT TTCAGTGTCG GCGATGAAAT TGAATTTTAT 11640 

CCACCAAACC GTTAAAGAGA TGTATAATGA AGACGGTGAA 11700 

TCCCATGATG TTATTAACAA TGGAAGTGAC GCAACCTGTT 117 60 

TAAGAAAAAA TAAACCAGCC AAAAGGAGGA GTAATTATGA 11820 

TTAAAATTAC CGTCTTTGCG AGAAGCAACG GTACTTTCTG 11880 
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GTAAAAACCA 
CTTTGTTTTC 
GTTCATTTTG 
ATGAATTAGG 
CAGAGGAAGT 
ATGATCCCAC 
AAAATAAAAT 
AAAACCGCAG 
CGTTAACCAC 
TGCCCTTAAC 
ACTTCATTAC 
TTCGAGAAGA 
TGGCTCTGAA 
CAATCATTAA 
TTTCTTCTGT 
CTATTCAGCA 
AAACAGAAAA 
CGATTAGTAA 
TTTGTCCGAA 
TCGCGCCGTT 
GTACAGTTCA 
TAGCAGTTAT 
TACTAGATGC 
CGGTCAAATA 
CCGAAGCCTA 
AAAAGCAGAC 
AACACTCTCA 
ATAGTGTTTC 
CGAGTTGATG 
TTGTTCATCA 
CTCGCCAAAG 
GTACGATGAT 
CTCTTTTGAA 



ATTGGACACA 
GGAAAAACTT 
TGCCATTCGT 
CGAAGTAGGC 
GATTCAATTA 
ATTACGCTAC 
GGCTTCTTTT 
TGTGGAAATG 
ATATACCAAT 
CAAGTTACTT 
CTCGCAAAAA 
AGAGCAGTTA 
TTTATGGGGG 
CGATGAAAGT 
CGAAACGATG 
AGATTTGAAA 
TTATCTGATT 
AGAATACTTG 
AGTTCGTAAC 
TTTAAAGCGT 
ACAAATGATG 
CCAACCTCTC 
AAGTAGTGAT 
TCGTATCAGT 
TGAAGTCTAT 
AGAGGTCGGG 
AAGGCTCAAA 
TCAGCAATAA 
TTGCATGATA 
TCGAAtCACT 
GCTCGTCACA 
TTCATCTGTT 
GGCTGTACTG 



TCTGTTGCTT 
CACAAAACAA 
CATGATGTGG 
CTAATCTTGT 
GCGGATTCTT 
AACGAAGTCA 
GCAACGGACC 
ACCTTGAAAA 
GAAATCATCA 
TATCAATCAC 
CTTCCTCAGA 
ACCTCTTTTG 
AAAAACTATG 
GATAAGATGT 
TGGCTTTTGC 
GATTTCCTAC 
GTTCTCTTAG 
CAAGCCACAG 
ACGACTGAAG 
GTCTATCCAC 
ACGTGTCTCC 
TTAAAAGAAT 
TTTAATGAAT 
AAAATTAATG 
TTGGCGATGA 
ACAGAAGTGT 
TACATCGAGT 
GAAGGAATTT 
CCTACCCGGT 
ACGTTCTTCG 
GTTTTATCCT 
CGTGCTTCAC 
TGACGAGACG 
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CTCTTTCTTT 
ATGAATACCA 
TGAAGCAATG 
ACTACGTTGG 
TAGGTTTCAT 
TTTATGAAGT 
TCTTGGAAAA 
TCCTATCCGA 
GCACCATCAA 
AAGAGAAAGT 
AAGATGGACA 
AGATGAATCA 
ACGAAGTCAG 
ACCAATTAGC 
CCATTTGGCA 
GTTACTATTA 
GTAATTATTT 
ACTATCAGGA 
TGCGTGTCAG 
AAAGAAAAGT 
AAGCAGGAGA 
CAGAGCAGTT 
GCGGCGAACG 
AAGCGCTCGG 
TCTTATATCG 
TTAACTCCGA 
AATCGCAGAT 
CCGAAAATTG 
GTTCTTTAAT 
TGATTCTCAA 
TGTAAGTTCC 
AGTGAAACAT 
CATGTCGAGA 



TTTAGAAGTC 
TGCTGGCGAA 
CGAAACGATT 
GATTATCGTA 
TTTAATCCAA 
CATGAAATTG 
AGTTTCCCTT 
TTTTTTAAAA 
TTGGCCGCGA 
AGCGCTGGGA 
AACATTGAAG 
AGCGGTGGAA 
TGaGTATGCC 
CAAAAAATTA 
ACAAGGACAA 
CCAAACAACG 
ACACAAACAG 
CTATTTAGGT 
CTACCAAACC 
GTTATCAATC 
AGAAGCAATC 
AATGACATTA 
CCTATTTGTT 
CTATGATATG 
CTTGGTCACG 
GAACCACGTA 
GTCATCTATT 
TCATAAAACA 
TTTTGGAGAC 
CAATTTCGGC 
GAAGGACCTT 
CGCAACTCTT 
AAACATCTTA 



TCTGATATGT 
ATTTTGATTG 
CGGCATCTCC 
CCGAAAGTTG 
ATGCCTAAAA 
TTAGTCAATA 
TTGCCTGAAC 
GTCAATATTG 
AgCACGAAAT 
GAAACGACCT 
CTTTACTTCA 
GTTGTTCAGA 
TTAGTCCAAG 
ATGATTGATA 
GAAATCCACG 
ATTGTCCAGC 
CCTGAAGCCA 
GAAATCATCC 
GCTAATCAAG 
GCGGAAATTC 
CAACAACGCT 
GGAACGTTTT 
CATAAAAATA 
ACTAACGCCG 
AATGAATGAA 
GGTACTGTTC 
TGCTTTGCAA 
CAACGAGGTA 
CAAGTAGGTA 
TTATTGTTTT 
CACCAAGTAG 
AGAGCTGTAG 
GAAGTTGTTG 



11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
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AGATCGAAGC AACGCGTAGT GTTGCCTCTG TTTCCGCCGT TTATCAATTT TTAAGsCTGG 13920 

GACAAAAATC CAAAGTGATT TTTGTCCCAG GTTCTTGTCT ATTTtTAtTT TAAGCCAAAA 13980 

CTTCCTCGAT ATTTTTTTTA ATTTGAAAAA GACCTATGAC TGATTTTTCG TTATAATTGA 14 040 

AAAAATAATT AAGGAGGAAT GAGAATGAAA TTAAAAAACG AGCAACAACA ATCGTGGCTT 14100 

AGCTTAGCAT TTGTTTGGGC GGGCGCAATG ATTAGCGTGC CAGGATTAAT TATTGGAGGA 14160 

ACGTTAGTTG CAGGCATGCC TTTATGGGAA GCTTTGTTAA CAGGATTTAT T'GGTTACGGT 14 220 

ATTATTGTTA TTTTAATGAT TTTACAAGGA ATTCAAAGTT CCGACTTACA AGAACCTTCT 14 280 

GTCAAAGTCG CTTCTCAAGT GTTTGGCATT CAAGGTTCGC AAAAGATCAT TTCTATTATT 14 340 

CTAGCGATTG CTTGCTTAGG TTGGTTTGGT TTGCAAGCCA ATGTCAGCGG CGGCGCCTTC 144 00 

ACAAACTTTT TAAAAATTTA" CGGCATTGAT TTACCTGTGT CGCTTTCTTC CTTAATCTGG 14 4 60 

GGAATTATCA TGCTTATTTC GGCTCTTTAT GGCATAAAAA TATTGAAAAT TCTGAATTAT 14 520 

TTTGCGGTTC CTGTGTTAGT TCTGGTTTGT CTTTATGGCT TGGTGGCTTC GTTAAGAAAC 14 580 

AACGGTTGGG CGGCTGTTAG TCAATATACC CCTCAAACAG CAGGAAGCTT TATGTCAGGT 14 640 

CTTTCAATGA CTGTCGGTTC TTTTGCATTA GGCGCAGTAA TTGCTGGAGA TTACTCTCAA 14700 

TATGTTTCTT CTCGTAAAGA TGTGGTCAAA GCGGCCACCT TAGGGATTTT ACCGACAGGC 147 60 

TTGTTGATGA TTGGCGTTGG TGCGGTCTTG ACCATTGCAT CGAACACCGC AG AT AT C AC G 14 820 

GAAGTTTTTA TGAATCTCGG TTTTCCAGTT TTAGGAATTA TTGCGCTAAT TTTAGCAACT 14 880 

TGGACCACGA ATGCCGTGAA TGCTTTTTCT GGCGGACTGG CCTTGATTAA TGTCTTTGAT 14 940 

ATTCCAAAAG AAAAAGAAAA AGTCGCTGTC GGTGCGGCCG GCGCGATTGG AACCTTATTA 15000 

GCAGTTGTGG GGATTTTAAA TTACTTTACG CCGATTATGT CCGTCCTTTC TGCGATGGTG 15060 

CCACCAGTGG CTGGTGTAAT GATTGCTGCC TATTGGTTGA TTAACAAAGG CGATCGAACC 15120 

AAATGGCAGC CAACGCCTGG CGTGAATAAA TTAGGTGTTT TTTCATGGCT AATTGGTGCT 15180 

GCAGTGGGGG GCATTCCGGT GATTATGAGT TTCTTTCCCA ATGCCCCTCA ATTGCCTAAC 15240 

CAACCGCTAA TTGGCATTGT TTTATCGTTT GTTATTTATT ATTTCGGTGC AAAACGTCAA 15300 

CAGACGAACA CCGAATCATT AGAAATGGAA TGAAGAAGAG GAGTGTCGGA GAATGCGTTA 15360 

TTTAGATGCA GAAGCAATTG AGAATATAGC CACAGGAGCA GCTTTTTTAG GGACAGGAGG 154 20 

TGGCGGTGAT CCCTACATTG GTAAAATGAT GGCTTTGTCC GCCATCGAAG AAAACGGACC 154 80 

TGTCAAACTG GTTTCTCCAG AAGAAATTGC CGCGGAGGAT TTTTTCCTAC CCGCCGCAAT 15540 

GATGGGTGCC CCATCTGTCG CAATTGAAAA ATTTCCCAAA GGCGACGAAT TCGTCCGTGT 15600 

CTTTGAGAAG TTAGGAAAAT ATTTAGACCA AGAAACGATT GCGGGAACCT TTCCAATGGA 15660 

AGCTGGGTGC GTCAATTCAA TGATTCCAAT TGTTGTTGCA GCGAAgcTAG gCATTCCCTT 157 20 

GGTGGATTGC GATGGTATGG GTCGGGCCTT TCCAGAATTG CCTATGGTAA CGTTCCATTT 15780 

GAATGGGATG TCAGCGACCC CTATGGCAAT TACCGATGAA AAAGGAAATA TTGGCATTAT 15840 
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GGAAACGATT GATAATACTT 
CGCCAGTGCT TTAGTGAGCA 
TCACAACATC GTGACATTAT 
TGAACAAGAA AAACGCCAAG 
AAAAATTCTA GATGTGGAAC 
GAGTGGCTTA AACAGTGACG 
AATTGCCGAG AAAGATGGTC 
TTTAGAAACT TTAACGCCTG 
AATGGGCTTG AAAGCGAATG 
TCGGTATTTC GGCTACGAAA 
CAAATAAATG TATAAAATCG 
AGATCATCAA TTAAACTTGA 
AGGGATTGCC GGTGCCTTAA 
CACACATGCA ATGTTGGGCA 
AGCAAAAGTT GGGGTCCTTC 
AGCTTGGCCC AAGGATTTAG 
CTACGAATAT GATGGACAAC 
GGCCTCTTGG CGAGGAGAAG 
AAATGACCAA GAATTGTTTG 
TGTATCTTGC TCCTCAATGA 
TTTGAATGCT GCGTTGCACA 
AGAACAAGAA AAAATTCATC 
GTCGTTAACG TATGCCAAAC 
TATTCGTGGC GCTTCATACT 
CACAACGTCC GACATTGGCG 
TGATGTTGGC GGGGTACGGA 
CGGCGGCAGT CTAGTGCGTG 
CGGCTACCGT ATTACACAAG 
TATCGCGGTT CGCTTAGGCC 
CCAAGCGTTT GCGGAAAAAG 
TCGCATGAAG ACCAGTTCCG 
CATCCCCGAA GAATTGACAG 
GAATGCCATC GGTGCCTCTA 
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GGACAGAACG TCTTGCTCGT GTTCAAACGG TTGAAATGGG 15900 

TTTATCCCGC GACAGGCAAA CAATTACAAG ACTATGGGAT 15960 

CAGAAGAAAT TGGCAAAGTG ATTCGAGGTA CCTATGCAGA 16020 

CATTAGTAGA AGTTACGGAT GGCTTTGAAT TGTTCCAAGG 16080 

GAGAAGTAAA AGGTGGCTTC AATTTGGGAC GTGTCAAATT 1614 0 

CTGGTTCAGA AGCAGTCGTC CATTTTCAAA ATGAAAATTT i6200 

AGGTGATTGC GATGACGCCT GATTTGATTT GTATGGTAGA 162 60 

TGACAACAGA AAGCTTGAAA TACGGCAAAC GTGTCCAAGT 16320 

CCGCTTGGCG AACGAAAAAA GGTATCGAAA CAGTGGGTCC 16380 

TGGATTATCA ACCACTAGAA AACTTAGTAG CAAAGGAGGA 164 40 

GTATCGATGT CGGCGGCACA AATACAGATG CTGTTATTTT 16500 

TTCATTCAGT GAAAGTTCCA ACAACCGATG ATATTCAAAC 16560 

ATAAAGTCTT GGCAGAATCA GCAGTGGACC CAACCAAAGT 16620 

CGACCCAATG TACGAATGCC ATTGTTGAAC GAAAAAAATT 16680 

GTTTGGGCTA TCCAGCAACC GCTTCGGTCT TGCCTTACAC 16740 

TGGCCACTCT TTCAGAAACA TATGCTTTAG CACATGGCGG 16800 

CATTGACTGC ACTGGACGAA GAAGAATTAC GTGGGATTTT 16860 

TAGAGGCAAT CGCAGTAATT GGCGTTTTTT CTTCTTTAAA 16920 

TTCAAGCACT TGCGAAGGAA GTGTTAGGGG CAGATGTTCC 16980 

TTGGGTCTGT CGGTTTAATT GAACGGGAAA ATGCGACAAT 1704 0 

AAGTAATTAA AGTGACCAGC GAAGGCTfTG AACAAGCCTT 17100 

ACGCCCAAGT GTACTTATGT CAAAATGATG GTACGTTAAT 17160 

AGTTTCCTAT TTTAACCATT GCTTGTGGAC CAACCAATAG 17 220 

TAGCTGGCTT GAAAGATGCG GTTGTTTTAG ATGTTGGTGG 17280 

TGTTAGTGGA TGGCTTTCCA AGAGAATCTT CACTGGCCGT 17 34 0 

CCAATTTTAG AATGCCAGAT ATCGTGTCAA TCGGTGTCGG 174 00 

AACAACCAGA TGGTTCTGTG ACAGTTGGCC CCGATAGTGT 174 60 

AGGCCCTTGT TTTTGGCGGC ACACAATTAA CGACCACTGA 17520 

ATGCCCAAGT CGGTGATCCT AGCAAAGTGG CTCATTTAGA 17 580 
TGTACCAAAA AATTGGCGAA TTGGTCAGTG AAGCGATTGA " 17 64 0 

CAGACGTGAC GGTCGTTTTA GTTGGTGGTG GGAGTATTAT 177 00 

GCGTTAAAGC GCTAATTCGC AACAAAAACG GAGCAGTGGC 177 60 

TCGCTCAGAT TAGCGGACAA TACGAACAAA TTTATGTTTA 178 20 
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TTCAAAAATC CAACGAGACG 
TGAACTTGCG GGTGCAGTCC 
CGCCTATCAT CCAGAAAATG 
AGGACCAGTG CAAAGCAAAA 
GCTGTCGATT GAAGTTTTAT 
ATTTTAAAAT TCAGAgtTAg 
GTGTTGACAT ATTTTTTTAG 
ATGAAAAAAG GAGAGAAAAT 
GCACGAATGG TTGGTGTTTT 
CTAACAGAAA GAGAATCTAT 
CGGCATGGAT TGGTTGATGT 
GAAATAGCTG ACAAAGCTAG 
ACCAAAGGCG CAATTTTAAC 
GCATTATTTA GAAATCGTGG 
GAAGATTTAA ATGAGTATTA 
GGATCGAATC ATCAATTACG 
TGGATTGACC AAATTGATAT 
CGTAAGGACC TACATTTGAC 
GAAAATGAAC GAGAGGTTGA 
AGGATTAACA ATGAAAGAAT 
TTACGAATCG GGAAAAACAA 
ACAATTTCCC CGTAGTTTCT 
TTTTAGAAGC CGAAGTGCGG 
ATATGTGCAA GAAGTGTATA 
ACCAGAATTA ATTGAAAAGA 
TGAGGAATTA CGGAAAATTT 
GTTGGCTGAA CAAAATGGTG 
TGCGGTCTCG CGTTGGATGA 
TGCTGTCCGT CGTCGTTTTA 
TGTTGAAAGT ATTCATGAGT 
AAATCTATTT GCTGCGCATT 
TATTTCGTTA GAATACTATA 
GATTCAAAAA ATATACACAT 
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CGGCGTTGAC TGATGCCCAA GAAAAAGCCG TCCAACAAGC 17880 

CTGGCACAAT CGAGTTAGTC GAGGTGGAAG AAACACCACT 17 94 0 

CAACCCGCTT ACGAGTAAAA GTGGTTGGGA ATATGTATTG 18 000 

GTTATAAGAA GAGTTATTTG AAATATGAGG TCGGGCTAGG 18060 

CCTGAAGAGT ACAGAAGAAG CTATATCTGT CTCGCCGTTT 18120 

agTaAGTCyA AATGACTTAC CTTCCTAAgT TCATTTTTTT 18180 

AAAAGGnCAA AATGTTAACA TGGTATGTTG TTGGGGTGTT 1824 0 

AATGAGTATT TTAAAAGAAC ATTTAAAAAG AAATTGTGAT 18 300 

ATCGTTAAGC ATAAGAGAAG CATATCGCTA CTTAATGGAG 18360 

TTTTCAACGA GCAGAAATGA AAAAAGTTTA GGGACACGTT 18 4 20 

TGGTTTAAAA CAAGTGTTAA CTTCATCAGC AATCCCACAT 18 4 80 

TAGTCGATAT GTCAATGGTC ATACGTACTT AATGATTGAA 18 54 0 

GCCAGCGAAA GTTTTATCAG AAGCTTCAGT TCCTAAAAAG 18600 

GAGCTTATTA AATAAGCAGT ATAATTTATT TGACAAACCA 18 660 

TGATGCCAAT CAACCGCCAT TCTTATTACT AAGTTATGGC 18720 

TTTTGTTCGA TTAGGGTTAC CAGATATTGG AGTTGGTCGT 18780 

TACGCAGGCA CCAGTTCTTT TAAAAAATCC AGAGGAAGTT 1884 0 

TTTTACGTCA ACCGCTGACG AACTGATAAG GAGAGGACTA 18 900 

TTTTTAATCC AAAGCAATTA ACTTCGGCTC GCATTGCACG 18 960 

TAGCCGAAAA AGCAGAGCTA TCTAGGCAAA TGATTTCGAA 19020 

TTCCTAAAGC AGATAGCATA TTAAAACTAA TTTCTGTTTT 19080 

TTTCAGCAGA TACATTTGAA TTGTATTCAG GCGCAACTTT 19140 

CGACGAAGAA AGTAAGAGAT ATGCAAAAAG AGCGGTTAAA 19200 

AAGTATTAGC AACATATGTG AACTTTCCCA AAGTTTGTTT 192 60 

GTATTTATGA AATCACAGAA GAAGAAATTG TGAAAAAGGC 19320 

GGGGCTTAGA CTTGATTTCG CCAGTCCCTA ATCTCATTCA 19380 

TGATTATTTC AGAAGCCAAC ATGTCCAATC CAACACTAGA 19440 

TTGGTCGGCC GTTTATTATG CTCACAGATA ATCATGAATC 19500 

ACGTGGCGCA TGAACTAGGA CATATTTTAT TACACAACGG 19560 

ATTCGCAAAC AGAATTAAAG AACATTATTG AGTATCAAGC 19620 

TTCTCCTTCC TTCAGCGGCT TTTTCTGATT CGCTCTTGTC 19680 

TTGATTTGAA AAAGTATTGG AAAGTTTCTC TTCAATCAAT 19740 

TAGCGTTAAT TAATGATGAT CAACGTCTCT ATTTAAATAA 19800 
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AAGAATTGCT 


AAAAATAAAT 


GGAAAACGAA 


AGAGCCTTAC 


GATGAACAAC 


TGACTATCGA 


19860 


ACAGCCAGAA 


GTTTTCCAAA 


AAGTTTACCA 


CATGATTTTA 


GAGCACCAAG 


TCATTCCTGC 


19920 


AAGTCAACTA 


AATCAAGCTT 


TTAAATTACC 


TAAAGAAGAA 


ATAGAAAAGA 


TATTAGGAAC 


19980 


AGCTATTCAA 


ACGGAGCAGG 


ACAAAACAGA 


AATTGTTCTT 


GAATTATTAC 


GTTCTAACAA 


20040 


GTAATCAAAA 


AGAAGGAGGA 


ATAACTATGG 


CTAAATTATT 


CGCCTACCAA 


ATTGGTCAAA 


20100 


ACCCTCGTAT 


TCAAACGGAC 


CTTTTGGTAG 


ATCCTCAGCT 


GTTTGAAGAT 


GAGCATGGCT 


20160 


GTATGGGTGC 


AGTGGGCTTC 


GGGTTAGCAG 


ATTGTGTTCA 


AACCGGAATG 


TTTACAGATA 


20220 


TTGAAGTGAT 


AAAACGGTAT 


CTTCACGAAG 


CAACCTACGT 


TTTTATTAAT 


GGTGACTTTG 


20280 


ACCGACTAAG 


TTATTTAGAA 


ATTGGGATTG 


CACTTTCCTT 


AGGCAAAACA 


CTTTACGTCA 


20340 


TTACAATGAA 


CCCAAACGTA 


ACGAAAGAAG 


ATTTAGGAAT 


TCCCTTCGAC 


AATGCTACGA 


20400 


TTGAATTCCT 


TTCCCCCAGT 


GCTTTTACGG 


AACGAATCCA 


TGAAACAGAA 


GCAGCAGAAA 


20460 


ACTAAACAAA 


AGTAAGTAAT 


ACGACTGTCT 


TTTTTTGCTA 


TTTTTTGCGA 


TAATAAGGGA 


20520 


GAAAGTTCAA 


ATTGAAAGGG 


AGCCAGAATA 


AATGAAAACC 


TATGATTATA 


TTGTTATTGG 


20580 


TGGCGGTAGT 


GGCGGGATTG 


CTTCTGCTAA 


CCGCGCAGGA 


ATGCACGGAG 


CTAACGTCCT 


20640 


TTTAATTGAG 


GGCAACGAAA 


TTGGTGGGAC 


CTGTGTCAAC 


GTTGGTTGTG 


TGCCTAAAAA 


20700 


AGTCATGTGG 


CAAGCCAGCA 


GCATGATGGA 


AATGATGGAA 


AGAGATACAG 


CAGGCTACGG 


20760 


GTTTGATGTT 


GAAATCAAAA 


ATTTTAGCTT 


TAAACAATTA 


GTTGAAAATC 


GTGAAAAATA 


20820 


TATTGACTTC 


TTACATGGTG 


CGTATAATCG 


TGGTTTAGAC 


AGTAACAACA 


TTGAACGTAT 


20880 


CCATGGTTAC 


GCTACGTTTA 


CTGGGGAGCA 


AACAATTGAA 


GTCAATGGGA 


CAGAATATAC 


20940 


GGCGCCGCAC 


ATTTTAATCG 


CGACAGGTGG 


TCGTCCGAAA 


AAACTAGGTA 


TTCCAGGAGA 


21000 


AGAATATGCC 


TTAGATTCAA 


ACGGCTTCTT 


TGCGTTAGAA 


GAAATGCCAA 


AACGTGTGGT 


21060 


CTTTGTCGGT 


GCGGGTTACA 


TCGCAGCCGA 


ATTAGCAGGA 


ACCTTACATG 


GCTTAGGCGC 


21120 


AGAAACACAT 


TGGGCCTTCC 


GTCATGAACG 


ACCATTACGA 


AGCTTTGATG 


ACATGCTTTC 


21180 


TGAAAAAGTC 


GTAGAACGCT 


ATCAAGAAAT 


GGGCATGCAA 


ATTCATCCGA 


ATGCAACACC 


21240 


AGCTAAAATT 


GAAAAAACGG 


CGCAGAATGA 


GTACGTTATT 


ACATTTGAAA 


ATGGTGAATC 


21300 


AATCACGACA 


GATGCCGTGA 


TTTTCGGGAC 


AGGCCGTCAG 


CCGAACACGG 


ATCAATTAGG 


21360 


CTTAGAAAAT 


ACGAAAGTAG 


CCCTTGATGA 


AAAAGGCTAC 


GTCAAAGTAG 


ATAAATTCCA 


21420 


AAATACCACG 


CAAAATGGCA 


TCTACGCAGT 


CGGCGATGTC 


ATTGGTAAAA 


TCGACTTAAC 


21480 


GCCAGTCGCG 


ATTGCCGCTG 


GTCGGCGCCT 


GTCTGAACGC 


TTATTTAACG 


GACAAACCGA 


21540 


TTTATATCTG 


GATTATAACT 


TGGTGCCAAC 


GGTGGTCTTT 


ACCCATCCAC 


CAGTTGCGAC 


21600 


AATTGGCTTG 


ACGGAAAAAG 


CAGCGCTTGA 


AGAATATGGT 


GAAGACCAAG 


TGAAAATCTA 


21660 


TCGCTCTAGC 


TTTACACCAA 


TGTATTTTGC 


TTTAGGAGAA 


TACCGTCAAA 


AATGCGACAT 


21720 


GAAATTAATC 


TGTGTCGGCA 


AAGAAGAAAA 


AATTGTCGGT 


TTGCATGGCA 


TCGGCATCGG 


21780 
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TGTTGACGAA ATGCTACAAG GCTTTGCAGT CGCTATTAAA ATGGGCGCGA CCAAAGCTGA 2184 0 . 

TTTTGATAAT ACTGTTGCGA TTCATGCAAC AGGCTCAGAA GAATTTGTAA CAATGCGATA 21900 

AAAGGCGAAT AAACCCCAAA. AAAGAGCGTT GCTGATGCAA CGCTCTTTTT TGAATACACG 21960 

AGTTATACTA TTTGATTCGT AAAAAGTAAT TTTTTATATC ATACCAATAA CCACTTTTCC 22020 

ATCTTTTAAC TACTGACTTT TCCAAGAAAT ATGCTATGaT TAAAAAAATC AGCAAAACAG 22080 

AGGGTAGGGA GTTAGAGGAT GGATAAAATT AGAATCAACA ACTTGAAATT TTTCACAAAA 2214 0 

AATGGTGTTT TGGCAGAAGA AAAACGTCTA GGACAGCAAG TAGAAATCGA CCTTGAAATG 22200 

CAGTTGTCGT TAGCTGAGGC AGGGCGAACA GATGATGTCA CTCAAACTGT CAATTATGCC 222 60 

GAAGTGAATG AATTGATT.GC CCAACGAGTA AACAATCACT CGTACGATTT 'AATCGAAGGT 22320 

TTAGCGAGTG CTATTTTAGA TGATATTTCC GCAGATTATC AAGAACAATT GAACAAAATT 22380 

ATCATTAAAA TTAGAAAATA TAGTGTGCCG ATGCCCGGGC TTTTTGACAA TATTGAAATT 224 4 0 

GAAATGGAGC GAGAAGTATA GATGATTGGC TATGTAGCAT TAGGTAGTAA CGTCGGCGAT 22500 

CCATACGCAA ATTTAGTGAA AGCCCGCAAT TATTTAAACG AAGACCCGCA GATTCAGGTC 22560 

GTGGCTTCTT CAAAACTTTA TGAAACGGAT CCTTACGGCT ATACAGACCA AGATGATTTT 22620 • 

TTGAACGCCG TATTGAAGAT TGATACTAGC TATACCGCTG AAGAGTTATT GGCTGTTACC ' 22680 

CAAGGGATTG AGCAAGCAAT GAAACGTAAA AAATTAATTC ACTGGGGTCC GCGGATTATT 22740 

GATATTGATA TTTTATTATT AGGGGGGCAG TCAGTCACGT TGCCCCACTT ACAAGTTCCT 22800 

CATCAAGAAT TAACAAAACG GTCATTTGTT TTGATTCCCT TAAGTGATGT TTACGCAGAG 228 60 

GAACAACTAT TGGGCGCGCC AATAGCTACG TGGATTCAGC GCAGTGGCAA TCAAGAAGAA 22920 

GTGCGAGTAA GCGAAAAGGA ATGGTAGAGA TGGAGCAGGA 22960 
(2) INFORMATION FOR SEQ ID NO: 34 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
ATAAAGGAAG AtGCCTTCAC CGATTCCACC AGCTAAAACA GGAGTGACGA TATwgAAAAG 60 
GCTATGTTTC CAACCGrGAC CtAACAgTGT TCCaACAAGG GTCCCAACGC CCATTGCTAA 120 
AATCATTCCT AATGCCATCG GGACAATCAT GCGCATTAAA CCTTGGACTA AAATTTTGCG 180 
GTTCATTCCa AGGATACTTC CACAGACCAA ACATGCAATA TAGAAATAGA GAAAGTTTGC 240 
TTGTTTCATT AAAATATCTG TTGAGTCTAA GACATTTTGG TTGAGTAAAT TGAAAAAGAC 300 
CATAATTGAT GGTACTAATA AAGATAAAAT TGCTGGACCG CCAAAATGTT TCAAAATAGG 3 60 

AATATTTCCA CCAATTGTTC CAAGTAACCA ACCCATTGTT AAAATTACGG CAAAACCACC 4 20 
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TAACATATTG ACTGGTAGTT 
AAGATAAACA GGTAACGTAA 
TGTTTCATTA GCAGCTGTTG 
AATCTATATT CATCATAATG 
TAAAGAAAAG TTTTTTAACC 
GATGGAATGA AGCGTGGCGG 
TTAATTATTA TGATTACTTT 
CGTAAAAACC AAGAGCAAGC 
GTTATTGAAG CCTTAAAGAA 
TTAGGTGAAA TCCATCAGTT 
ACACATCCTG ATCGCCAAAA 
TAAAAGGGGA GGAACATTTG 
TTGTCCCAGT CTATGATCAA 
CTCGCTTTCA CAATTGATTG 
TGTTGGTTTT GGCTTTATAT 
ATTTAGAGCC AAGAGAAATT 
CGAAAGACGC AATCCTTGTG 
CGACTAAAAT GTATCATAAT 
CAGCATTGGT TTTATCTCCT 
GTCAAAATGG CCAGGATTAT 
TTGGGCATGT TATCTTTCTG 
TCAGCACaAC AACaTATGCT 
TGCATGTCAT TTATGGCTTA 
CCGATTTATT GAAACCAGAA 
CTATTCTAGC AGGTTTTTTA 
TAGCTATTGA AATTTATCCA 
TGATTGCTAT CTACCGTTAT 
TCATTGAAAC CATTGATTAT 
CAAAAGAACA ACTGGAACGC 
TAGAGAATGC GGAAGCGACA 
GCATAAATGT • TCATTACGAA 
CCTATGGTTG CATATATACA 
TTTACGGTTG CAACAATTGC 



1476 

GTTCTAATAG AGCCGTAACA AGAATAATGC TAGCTGTTAC 480 

CCGAACCAAT CCTCGTTTTG GTTAATTTGT TGCGCCAATC 540 

CGGGcAATTT CTTTTCCATG CTATTTCCTC CTATAACTTT 600 

TAAACGTTTT CTGTAGAGTA GATTATTTAA TTAATTTAAC 660 

AAATCTTCCT ATGCTATACT TTATTTGAAG AGAAAAGAGG 720 

GAAATTATGG TGTTTCTTAT CAGTGATTTT TATTGCAACA 780 

CTTTTACGGA GTAACGACGG TTCAAACAAT TAAAGAAGTT 84 0 

ACTTTTAGCC GTCGGGGAAC AGTTGGCTAT TGAACCGAAT .900 

TGATCACTAT TCAGATGAAT TGGAAGCGTA TACGGTTCGT 960 

GGATTTTATC GTGATTATGA ATATGCAGGG GATTCGTTTA 1020 

AATAGGTAAG CATTTTGAAG GgGCGATGAA GTGCGAGCAT 1080 

TCTGTTAGTC AAGGGAGTTT AGGAGAATCA TTACGAgGTT 114 0 

GGAAAACAAA TTGGGGTTGT gGCAATGGGC ATaAAATGAC 1200 

AACGAACmAA AAATGATTAC ACAGTGAGTG TCCTTTTAAG 1260 

aGCCATCGTG GtTTCnTATT ATTTAAAAAA caACTTCACG 1320 

GCTCGTCTAT TAGAAGAGCG AaACGCCATG CTTGAGGAAA 1380 

ATTGATACCG ATCAAAATAT TTTATTAGCA AATATTGAGG 14 40 

ATAACAAATA GTGAAGAAAA TTTACTAGGA AAGAAACTTT 1500 

GAAAAATTAG TCGTACATTC AAAAACAGAG CAATTCTATC 1560 

TTCGTCTCAA TCGCCCCGAT TAATGTCCGT AAAAAAACAA 1620 

AAAAATGCGA CAGAAACGTT TATTGTCGCA GAACAGCTTG 1680 

TCGGCATTAC AAAGTCAATC ACACGAATTT ATGAATAAGA 17 4 0 

GTCGATTTAG AAGATTATGA GGCGCTAAAA CATTATCTAG 1800 

AAGGAATTTG CACAGCGCTT AGCTATTTTG GTCAGAAATC 1860 

AGTGGGGAAA GAATCAAGTT TGCAGAAATT AAAACACAAT 1920 

GAAATTCCAC CAAATAAACG AGATGAAGAT ACGCAAAATT 1980 

ATTCATCGTT TCCTGATGGA GCAAACGTTG CCAGAAGAGA 2040 

CAGCCAGGAA GTTTGACGAC GACTTATTCG TTTGCTTATC 2100 

TTTGAGCAAG AGTTTTTCAC TTCTTATTTG GCAAGATTAT 2160 

TTGACTTGGG AAAACCAACA AAACAATTGG CTTGTCCTTC 2220 

GGAGCAGAGG AAAATGAACC TATTGATTAT TGAAGACGAT 2280 

TCAAAAATAT CTTGAAAAAT TAATTCACCA ACCGACTATT 2340 

TGAAGGTCTA CTACTAACGA AAGAAAAACA ACCAGCACTT 2 4 00 



WO 98/50555 PCT/US98/08985 

1477 

GTTTTATTAG ATGTTCATTT AAAAGATGGC AATGGCTTGA CCTATCTGGC AACAATTAGA 24 60 

GATGAAAAAA TTGATACAGA AGTTATCCTG ATTACGGCTG CCAACGAGTT GGAAAATGTT 2520 

AAACGAAGCT TACATTTAGG CGTTTTAGAC TATTTAGTTA AACCCTTTTC TTTTGAGCGA 2580 

TTTCAACAAA GTATCGAAAA CTATCAGAAA AAAACTGCTC AATTCACGCT GGAAACGAAG 264 0 

GAGCTTTCGC AAACAAAAGT GGACCAATTA TTTCATTATT CACAGACAAA CGCACGTAAA 2700 

AATGAACAAG CTCTTCAAAA CATGTCCTTA GAAAAAGGAC TAACACmAGC AmCATTACAA 27 60 

TTATTGTTAA AAAAAATAGA CGAATTTACC GACTATTTTA CGATTCAAGA ATTATCAGAA 2820 

GCGAGTCAGC TTTCACATGT TTCTGTTCGC AAATATGTGT TGTTTCTAGG AAAAAAATAA 2880 
CCTTTTAGAA AGTAAAAATA GCTATTTAAA AGTGGGGCGT CCCTATCAAT CGTTCnGAGA . 2940 

ATTTAGTATT TTTTAATTTT TAGGAA 2966 
(2) INFORMATION FOR SEQ ID NO: 34 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



AAATTATTTA 


AAATTCAGAC 


TCATTAGTAG 


ATCAGATTAA 


TATCTAACTT 


TAATGCTCAA 


60 


AAGTGCTGAT 


GATGAATAAG 


CCATCATGAC 


AGAATTTAGC 


GGCATCATAA 


AAATTATTCA 


120 


GATCAAACGT 


CAACTATGAT 


TATAGCAGAA 


AAACAATGGG 


TTCAAAATTG 


AGATAAGAAA 


180 


CCAGCTAGAC 


TGTCrACGTA 


AAATGATAGG 


CTAAAAAAAG 


TGATGTGTGC 


TTGmTACrAT 


240 


TTTTATACAG 


TATTTCrTGA 


TGTrATAAAy 


AAATTkTCTA 


TTAAaaTaTG 


TtnACTAaaA 


300 


AGACAAACTT 


TTTGGGGGCA 


CTACATATaC 


CTTGTTGGTC 


TTTTnTATAT 


TCCAATTGTT 


360 


ATTAGGTATG 


TAGCAGGACA 


TTTAAGGTAT 


ATTTTGATTT 


GTGTTCTGGG 


TCAGCGCACC 


420 


AAAAACAAAA 


ATGAAACGTG 


TGTCAGAGGA 


AGTATGTGTG 


GTTTTCCAAT 


AAAAACAGCT 


480 


GATGGTCCAA 


TTTTAAAGAA 


TAATATAGCA 


CAGGAGAATA 


TGAATGAAAA 


AAGGAAAGTT 


540 


ACCATTAATG 


ATATAGGGAG 


AAATwTAGTG 


TATTTArGGA 


ATTTGAGTAA 


CGGTAAaTAG 


600 


TTGATGAATA 


TATACTTTTT 


TAGTrGAAAA 


AAAACACTGA 


TAGGAGGTAT 


AATATTGAAA 


660 


AAGAGAATAT 


GTATCACTAC 


TGGAGTTGCA 


ATGTTATTAG 


GATTATCACC 


GATATTAACA 


720 


ACAGGGGCTA 


CAGTATACGC 


AAATGAAGAG 


AGTAACTACG 


TATAAGTAAA 


TTCTACAACA 


780 


AAAGAGCTAG 


AGCAACGGAT 


GGAGGCGCTA 


GCCAATCAAT 


ACAAAGGAGT 


TTCTCTATTT 


840 


GAAACCGAAT 


TCAATAAAGA 


AAATATGACT 


CCTGTTCAAT 


TAGAATTATA 


TAACAAAGTT 


900 


TTAGATCTAC 


AATATGATAT 


TGCTTTACAA 


AATGGAAGTG 


CAAAAGGGAT 


GTcTAGAGAG 


960 


CAATATAAAC 


AATATCTATC 


AAACGCGATA 


CAAGGTTTTA 


TAATACAACC 


TAGAAATACT 


1020 


GGACATGGTT 


TGATTTCTGT 


TGACGTCTTA 


GAAACAGTTT 


TAGATACTGC 


GCTTAATGTT 


1080 
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GTATTAATTA TCGGAAGTTT TGGGTCAATG TCTGCAATGG TTAAAGAATT GGGTCAAGAA 1140 

GGCGCAAAAA ACTAATTACA AGATAAAGCA GTTGGATTAA TTATTAGTGT GATTCTAAGT 1200 

TCTGGAAAGC AAGCTGCAAA ATTATTAGAT TCTATTGACA AAATACCCAA TAATGGCTAC 1260 

ATTGAATTAA CTTAATAGAG TTTACGAAAG GATTGAAAAA GATGTACCTT GTAATAGGGT 1320 

CAATTCTAAT AATCTTTATA GTCTCAGTTT TTATCTCTAC TATAGGGGCT TTGAATGAAA 1380 

AAaTAAAAAa TAAATTCTAC TTATTTGTTA AATTTTCGTT TAATATTTCT ATTGCTTGTA 14 40 

TTTTTTTAGG TGGTATATCT TTTCTTTATG CTTACTTTGT TTTTGATGCG TTTAAAAATT 1500 

TTAAAATTAT CTTGATATCC ATATTTTTAT TTGTAATCTC TTACAAAGGT TATATGGATA 1560 

AAAAATATTT CCCAnGAAGA 1580 
(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



AAGATATGGT 


AAAAATTAAG 


GCTAAATCTT 


CGCATCTTTT 


TGCATATCAG 


CTGTAGAAAC 


60 


AAATTGTAGA 


TTTGTTTTTT 


TACCATCATT 


AAGCTTTATT 


TCTTTTTCyT 


CaATtATCAA 


120 


rGAATTACTA 


TCaCTTTCtA 


AATTAACATT 


TATATTATCT 


TTGATTTGTA 


TTTCATTCTT 


180 


ATTATTTTTt 


TCTTTTATGA 


ATACTTTTAT 


TTCACTACTC 


TTGGAATAAA 


ACTCCAAaGT 


240 


TTGTTCTATA 


TATTTTATTT 


CTTTTCCATT 


CATATTACTA 


GAAATTTCAT 


TTGCCATATT 


300 


ATGAATCTTT 


TCTTTTCTCG 


TCTCCAAATA 


TGTTTTAGGA 


AAGATAAAAA 


AAACTAATGA 


360 


ATGAACTAAT 


ATAATTATTA 


TTCCAAGAAC 


AGAAAATATT 


TGTATGAACA 


TTTTTGGAAA 


420 


TATCTTTAAT 


TTTTTCATTT 


TCTCTCCAAT 


TTATATCCTA 


CATTTCTTAT 


AGTAGTTATA 


480 


CAGTCTAATT 


GTAATTTTTT 


TCTAAGTTCT 


TTTAtATATA 


CATCTATTAC 


TCGATCATAA 


540 


GGAGkTTCyT 


CGCTATCTTT 


CCATACATAA 


TCaATAATTT 


GCATTCtTGT 


TAACACTTGT 


600 


CCAATAATAA 


CCAATAAACA 


TTTAATACTC 


CA 






632 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 9: 
AGTGCCTTAT GCCAGAGTTT CaTCCmyTGG ccAAAaCTkG GAACGGCAAA TTCAAGAGTT 
AAAAAAATTA GGAGCGAAAA AAATATTTGT AGAGAAAAAA TCTGGCGCAA GTATTGaACA 



60 
120 
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ACGACTAATT 


TTTACAGAAG 


CTATCTATTT 


TGTGAGAGAA 


TCCGATATTT 


TTATGGTAGA 


180 


AGCCATTGAC 


CGATTAGGCA 


GAAATTACGA 


TGAAATTATT 


CAGACGGTTA 


ATTTATTGAA 


240 


AAATAAAAAT 


GTTCGACTCA 


TAATTACAAG 


CCTTCCTATT 


ATGGCTGAAG 


TGGTTGGTAA 


300 


TCCATTATTT 


GATCGTTTTA 


TTAAAGATTT 


GATAGTCCAG 


ATTTTGGCAA 


TGATTGCGGA 


360 


ACAAGAAAGA 


ACAGAATCTA 


AGCGACGACT 


AGAACAAGGA 


ATTAAAATTG 


CAAAAATGAA 


420 


CGGTGTTTAT 


CAAGGACGTC 


CTGAATTATA 


TAGTCCAACA 


GCAAAAGATC 


TCCAAAAAAG 


480 


AGCTGTCTAT 


AGAAATATTA 


TAGAAGAGTT 


ACAAAAAGGC 


ACTGCTATTT 


CAAAGATTGC 


540 


AAAAAAATAT 


GGAATTAATC 


GACAAACAGT 


TTATAGAATA 


AAAAAAGACT 


ATGAGTCAAA 


600 


TCAAGCAGAT 


TCTTAATGAA 


TCATAAAAGA 


AATAAGAGGG 


CTTATTTGGT 


GAAAAmCAGA 


660 


AAGGAGTAGT 


TATTCTGCAA 


ATTATTAAAG 


ATACAAAAAA 


CACGTAGAAA 


AAGAAAATAA 


720 


GCCAATGGTT 


ATTTTTGATA 


TGATGTCAGT 


AAAAGTGGAA 


AAATTATTAT 


TAGAGTTGTC 


780 


AGAAGAAGTA 


TTTCGAGTGT 


GTCCTAAAAC 


AAAGAAAGAA 


TTTAGAAAAA 


TGGTTTCAAG 


840 


ATATAGCTTT 


AGAAATGGAG 


AGAATAAGAT 


GGTTGTTCTA 


AAAAAATATG 


ATTACTGGGA 


900 


AATATAAATT 


ACTACGTGGA 


ATAAGGCAAA 


TATGGTTGCT 


TATGTTGATT 


GTGACCAATT 


960 


GGCTAGAAGA 


GAGTTAGGAA 


AATATAATCA 


TCTTATGTAT 


AGTAATTGTA 


AAAAATATAA 


1020 


ATTAATTAAG 


AGCGTTTGCG 


TAGAAATAAG 


CGCAGAAAAA 


AATAAAAAAA 


GGTGAAACCT 


1080 


TGATAAAATA 


GTAATAGGCA 


ACAACTAACT 


CCTCAATATT 


AATATAATAG 


TGTCAATTAA 


1140 


AATGCATAAA 


TTAATCCCTA 


AATTGTTAGT 


TATTACCTTC 


ATAATAAATA 


GAATTGTGAA 


1200 


ATTTATTGTC 


TAAGGTGAAT 


AGTTGTGAGA 


AGAGAGTACG 


ATTATCAGAA 


AATTGAAAAA 


1260 


CAAGTGTTTA 


CCAATGGCTG 


TGAACTAGAA 


TCCAAAATAC 


GTACTACGTG 


AACAATGTGT 


1320 


AG TAT G TAT T 


TTGGTGTTTT 


CTTAATAACT 


AATTTTGAAT 


AATCATTCAA 


TTCTTTTATC 


1380 


AATAATTGAA 


CGTAAGACTT 


TTTCTCTTAA 


ATCCAATGAA 


TTAATCTTGT 


TATTTTGTTC 


1440 


ATATTCATGT 


ACTAACAAAT 


CACAAATATA 


AAGTTGTGAA 


ATTTTTCCAG 


CTAAAGAACC 


1500 


ACCATTTAAA 


AATTCTTCGA 


TTGCTGTTTG 


TAAAACTAAA 


TCTGCAGATT 


TTCCTATCGG 


1560 


IGAAl CaAAI A 


I AAi ICij i I A 






1 1M1 11 i 1 1U 


P H ZiTTTTT ZXZi 
Lrtn 1 X X 1 I nn 


1620 


TGAGTCATAA 


GTATCTTTTG 


TTTTTCCAGA 


TAAAGAAAAG 


ATAATAACTA 


AATCTCTATC 


1680 


TGTTAACAAA 


GAAGCAACTT 


GAGCTTGATA 


ATGAGGaTCT 


AATACAGCTT 


TAGCTTGTAT 


1740 


TCCTACCCTC 


AAAAACATAC 


TTTCCAAATC 


AAGACTAGTA 


TTcCACTAGA 


TCCTACACCA 


1800 


AAAATATATA 


nGcTGCtTGC 


tGatTAATAA 


GTTGGGATGG 


CTTTGGTnAA 


TTGCCTCCAT 


1860 


TAATTAAA 












1868 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3274 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



TTCTAATCTT 


GAAAAAAACA 


GACATTTTTC 


CTGATGTCTG 


AATTATTAAC 


CAAAATACTT 


60 


TGAAAGTCAA 


ACTATTTTTC 


TTATTTTTTG 


AGAAAAAGTG 


CTGGAAACCG 


CATGAAATAA 


120 


TAGTTTGATA 


ATCAAAATAA 


ACTCGGAGGT 


ACAGTATGAA 


AAGAGTGGTC 


ATAACCGGCA 


180 


TGGGTGCCGT 


AACACCATTA 


GGCAATACCG 


TAAAAGAATT 


TTGGCATAAT 


CTAGTGGATG 


240 


GCAAGCTAGG 


AATTGGCAAA 


ATTACTAAAT 


TTGATtCTGA 


AGATACTGGG 


GTGGCTTTGG 


300 


CTGGTGAAGT 


GAAAGAGTTT 


GACCCAAGTG 


CTGTTTTAGA 


ACGCAAAGAG 


CAAAAAAGAA 


360 


TGGACTTATT 


TTCTCAATAT 


GGCTTGGTCG 


CTGCTTTAGA 


AGCGTGGGAA 


ATGAGTGGTT 


420 


TAACAGAAGC 


AACGATTGAC 


CCGACTCGTT 


TAGGTGTGAT 


TGTCGGTAGT 


GGGATTGGCG 


480 


GGATGACTAC 


CTTACAAGAC 


CAAGTACGGG 


TAATGGATAA 


AAAAGGCGCA 


AAACGCGTAA 


540 


CACCATTTTT 


CGTGCCCATG 


GTAATTGCTA 


ATATGGCAGC 


GGGAAATATT 


TCGATTCGTC 


600 


TAGGTGCAAA 


AGGACCTTCT 


CAAACTATCG 


TGACTGCTTG 


TGCTTCTGCG 


ACGAATGCGA 


660 


TTGGTGAAGC 


TTTTCGCACG 


ATTAAATATG 


GTTTAGCGGA 


CATGATGGTA 


ACTGGGGGGA 


720 


CAGAAGCCAC 


AGTATGTGAA 


ATTGGGATTG 


CAGGCTTTGC 


AGCATTGAAT 


GCTTTAAATA 


780 


CAACAGAAGA 


TGCGACTCGG 


GcTTCGATTC 


CTTTTGATAA 


AGAACGAAAA 


GGTTTTGTGA 


840 


TGGGTGAAGG 


CGCAGGAATG 


CTGGTTTTAG 


AAGAACTCGA 


GCATGCACAG 


AAACGTGGCG 


900 


CAACTATCTA 


CGGCGAAATT 


GTTGGATATG 


GCAGCAACTG 


TGATGCTAGC 


CACATGACCG 


960 


CTCCTTTAAA 


AGACGGTAGT 


GGCGCGGCAG 


CTGCGATGGA 


AATGGCGATT 


GCAGAAGCAG 


1020 


GGATTACACC 


GGAACAAATT 


GGGTATATCA 


ATGCTCATGG 


TACGTCAACC 


CCAGCGAATG 


1080 


ACGCTGCGGA 


AACAACGGCC 


ATCAAACGTG 


TCTTTGGTGA 


GCGAGCATCG 


CAAATTCCAA 


1140 


TTTCAAGTAC 


AAAGAGTATG 


ACTGGACATT 


TATTGGGCGC 


GGCTGGCGGT 


ATCGAAGCGA 


1200 


TCGCTTGTGT 


GCAAACATTG 


CAAGAAGGAA 


AAGCTCATCC 


GACAGTAGGC 


TACCAAGTTG 


1260 


CTGATCCAGA 


TTGTGATTTA 


GATTATGTCA 


CAGAAGGTGC 


ACGTGACATT 


ACAGCGGACT 


1320* 


ATACAATTAG 


TAATTCGTTT 


GGCTTTGGTG 


GCCATAATGG 


TGTAATTTGC 


TTGAAGAAAT 


1380 


GGGAGGAAAA 


CTAGTATGAA 


AAAAGTAATG 


ACTGCAACAG 


AAATTATGGA 


AATGATTCCT 


1440 


AATCGCTATC 


CGATTTGTTA 


TATTGATTAT 


GTGGATGAAA 


TTATTCCAAA 


TGAAAAGATT 


1500 


ATTGCAACAA 


AAAATGTGAC 


AATTAACGAA 


GAATTTTTCC 


AAGGACATTT 


CCCTGGAAAT 


1560 


CCAACAATGC 


CAGGCGTTTT 


GATTATTGAA 


GCATTGGCAC 


AAGTAGGTTC 


GATTTTAATC 


1620 


TTAAAAATGG 


• ATCAATTTGA 


AGGTGAAACA 


GCCTATATTG 


GCGGTATCAA 


CAAAGCCAAA 


1680 


TTCCGTCAAA 


AAGTGGTCCC 


TGGTGATGTC 


TTGAAATTAC 


ATTTTGAAAT 


CGTCAAATTA 


1740 


CGTGACTTTG 


TCGGCATCGG 


CAAAGCGACT 


GCTTACGTGG 


AAGATAAAAA 


> GGTCTGCGAA 


1800 
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TGTGAATTGA 


CGTTTATTGT 


GGGACGATAA 


GAATCAATCA 


TV rp rp rn rp 7\ rn y\ 


Lb 1 1 Ab J. J. 1 b 


1 Q C A 

lo bU 


CCTAAGAAAA 


GGAATAAGTC 


ACAATGGCTT 


ATTCCTTTTT 


TV 7\ TV T"* A OP A 

AAAT1 1 Al Al 


AAA 1 AALAL 1 




AACGCAAAAA 


AACATGCTAT 


AATCATAAAA 


AGCTTATGTG 


AAGGlbb 1AA 




1 y 0 u 


AAAACAATTT 


ACAGGAGGGT 


TTTTCTTTTG 


GATATTTCAG 


TAGAGTTTTC 


GbCa Ab A 1 AAA 




TTGGCGAATG 


TGTTAATGAA 


CGCATCAGGC 


ATTCATTGTA 


TGACAATTAA 


GGAAA 1 bbAb 




GAGTTAGCGG 


CTTCTCAAGC 


AGGTGCTTTT 


GTCGCTAAGA 


CGGCAACACC 


TAATCCTbbT 


z 1 oU 


CAGGGCAATG 


AGGAGCCTCG 


GTATTTTGAT 


ACACCATTAG 


GAAGTATTAA 


CTCAATGGGC 


ZZzU 


TTGCCTAATT 


TAGGGATTGA 


TTATTATTTA 


GATTATCAGA 


TTGCACGTCA 


AAAGGAATTT 


O O O Pi 


CCTGAGGAGT 


TACGCTTTTT 


ATCTGTATCT 


.GGTATGAATT 


ATGAGGAAAA 


TATCGCTATT 


Z J4U 


TTAAAAAAAG 


TGCAAGAAAG 


TGAATACACA 


GGAGTGACAG 


AATTTAATTT 


GTCTTGTCCC 


O * Art 


AATTTGCCTG 


GTAAACCGCA 


AATTGCTTAT 


GATTTTGAAT 


TAACAGAAAA 


GCTTTTAACA 


z4 bU 


GAAGTTTTCC 


AATTTTTTAC 


TAAACCATTA 


GGTGTAAAAT 


TACCCCCATT 


TTTTGATATT 




GCACATTTTG 


ACGCAATGGC 


TGAAATTTTA 


AATAAATTCC 


CTTTAGTATA 


CGTGAATAGT 


icon 


ATTAATAGCA 


TCGGTAATGG 


TyTaTATATT 


GACAGTGACA 


AGGAmGAAGT 


GGTcATTAAA 


Z b<l U 


CCAAAAGGAG 


GCTTCGGTGG 


ACTGGGCGGC 


GAATATGTCA 


AACCAACAGC 


GTTAGCCAAT 


z / UU 


GTTCGTGCGT 


TTGCGCAACG 


TTTGAAACCA 


GAAtCAAAtT 


ATTGGAACGG 


GCGGTATTAC 


1 T £ A 

Z / bO 


ATGTGGAAAa 


GATGTTTTTG 


AGCATCTTTT 


ATGTGGTGCG 


ACATTAGTAC 


AAGTCGGCAC 


0 q 0 n 


ACaATTGCAT 


CAAGAAGGTC 


CACAAGTTTT 


TGAGCGTCTA 


GCCaAAGAAT 


TACAAGAAAT 


O Q Q n 

Z 0 0 U 


CATGGCAGCA 


AAAGGTTATG 


AAAGTATTGA 


AGAATTTCGT 


GGGAAATTGA 


AAGAGATGTA 


z y 4 u 


ACAAAAAAAC 


CGTCAAGAAA 


TTTTCTTGAC 


GGTTTTTTGT 


TATTTCACAG 


/-1 rp rp T\ 7\ /"» 7\ rprn 

CCTlAAbAi 1 


jUUU 


TTCCATAAAG 


GCTTCCAAAT 


ATTCAGGTTG 


TGTTAATTGT 


rp /-»/"■ rp rp 71 pTipp 

TGGTTALAbb 




30 60 


ATGGCAAATG 


TAATTGCCTT 


TTTTCGTGTA 


GGTTCCGTCG 


CCGCCTGATT 


TGGTGGTAGC 


3120 


TAGGAACATA 


GACACATTCG 


TGATTGTTTG 


ACAGATGGAG 


CAAACACCTT 


TAACAATGGT 


3180 


GGGTGAAAGG 


GTGCCGTGGA 


CGCCGATTAA 


XTTATTTTCT 


CGATAAGCTA 


. AGATAAATAT 


3240 


TTTCTGAGTC 


CCAGCGTCGT 


TCCAACCGAC 


ATAG 






3274 


(2) INFORMATION FOR SEQ ID NO: 351: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
TnGGAAACCC AGACTTATTA ACTTTTAnGA TTTTCCACTT nGCATTCGTA GAACCTTTAC 
TCAACCAATC TAATCGAGTC TtTCAATAAG CAAATTAAAA GATACAGCCG TAGAAAAGAG 
CAGTTTCAAA ATGAAGAATC ACTAGAACGC TTTCTAGTCA GCATTTTTGA TACATACAAT 
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CAAAAATTTC 


TAAACAGAAG 


CCATAAAGGT 


TTTCAACAGG 


TAACCGATAC 


ATTAGTTTCA 


240 


ATGTTTACTG 


AGTAACTAAT 


TATTTTGCAG 


GAGGACAATT 


TATTTACACA 


AAATTATTGA 


300 


CGCTCCCGTT 


AACTTTTGAA 


TCCCTAATTG 


ACTAGCTGCA 


TAAAATAGTA 


AATATTTTAT 


360 


CGGATTTTGC 


GCTAAATAAA 


TAATAATACT 


AGATTTTCTT 


AAAAAGTCAC 


TTCCAGGAAT 


420 


ACGTATATTT 


TCTAAATTAA 


AACTTAAAAT 


AACCACTAAC 


ACAGCAACTA 


TTGGTAACGT 


480 


AAAATAATTT 


TGTGCATCAT 


AAAAAACACC 


TAATTGTTGC 


GGAAATTTCA 


CTTCCTTTAG 


540 


TCCTACATAA 


GAAAAAATAA 


CAATACCTAA 


AACTAAGAaT 


AAATTATTTA 


AAGGCTTTTC 


600 


AAATACATAG 


TTTGCTAAAA 


AATAGCCTAT 


AATCACATAA 


CAAATCGTAG 


ArATCATCGA 


660 


AAAgATATAG 


AACCATAAAA 


AGGCACTTGT 


TTAATCCATT 


CAATGTCAAA 


GTACCACTTA 


720 


CCATAAGTAG 


AAGTAAATAC 


TCCTATCAGA 


AAGCTAACAA 


TTCCAATTGT 


TAATAATACT 


780 


TTCTCACCAA 


AATAACTTCG 


GATGTAATAA 


ATAATAAATA 


CACTCATTAC 


TAATGCGATT 


840 


AAATACCAAG 


AAACACCTAT 


TCTGGGGACA 


TAGCCTAATA 


ATATGCTCAG 


TATAAATGAT 


900 


ATTTTTCCTC 


TTAAATCCAT 


TTCCCCTTTA 


ACAAAATTcA 


TAATAGATAC 


GATAAAGAAG 


960 


GGAATGTAAA 


aGATCAACCA 


CGAAAGATAT 


ACTTTTAAGA 


TTCTTTTTAT 


ATAATTGAAT 


1020 


AGTATTTTTT 


TATCTTTTTG 


TTTTTCTCTA 


AAAGAAATTT 


TTTTAAACAA 


AAAATATCCA 


1080 


G AT AC TAT G A 


AAAAAAACGG 


AACAGAAAAG 


CGGGCTAATA 


TTTTCAAATC 


ACTACCCAAA 


1140 


ATAGATTTAG 


AGGGAAAGGT 


ATGAATTCCA 


ACTACACTTA 


ATGCAGCAAT 


AAATTTGAAG 


1200 


GCATCCAATA 


CATCATTGTT 


TTTAGTCATT 


TTTCCCCTCT 


CTATTCTCCC 


ATTTTTCTTC 


1260 


AGATTTCATG 


ATAGATAATT 


CTTGCAATAA 


TGTTTTAATG 


CGATTTTCGT 


GTTTTGAAAT 


1320 


CAACACAGAA 


TTTTTAATTT 


CCATGATTAA 


CAGGACAATC 


ACCGCAACTG 


ATAATAAGAA 


1380 


GTTTGATGTT 


GTTTCAAACC 


CAAAGGcCAT 


TGCTAACCAT 


TCTGCCACAT 


GCGGGnAAAT 


1440 


aGCaAAAnTA 


ATTAaGACmA 


TTGAGATTAA 


TAmCCAAATC 


AAAGCATTTT 


TTAGGAGAAA 


1500 


AATATTTTTA 


TTAATCCCtC 


GAATAATGTA 


TAAGAAAAAG 


CCAATTGCAA 


ATAAAAATAA 


1560 


AGCTAGCCAT 


AACACGGTTG 


GTAGCATTAG 


TCTCCCTCCT 


TCATAAAGGC 


AGCAATCAGA 


1620 


ATTGCTGAGC 


CGACTTCCAG 


CATATACTTC 


ACAGAAGCCA 


AGGCACGAAT 


CGAAGAAACA 


1680 


CCACCAAGCC 


GTTCCATCAT 


ATTGACAGGT 


CTTTCAACAA 


TCACAAAGCG 


TTTTTTAATT 


1740 


AAGTGAACAA 


TCGATTCTGG 


CTCGGGATAA 


TTTGTCGGAT 


AACGCTTTGC 


AAAAAAAGCA 


1800 


ATCAC ii TCC 


GATTTCCCGC 


TCGATAACCA 




p a t Zi 71 A T T C T 


TTTATPAftAA 


1860 


GCCATTCGAA 


TGCAAAAAGA 


TAACAAACGA 


ATACCAAAAC 


GGCGCATTTT 


CGTACTTTGA 


1920 


AAAGACGCTT 


CGTTTCCAGG 


AATGAAACGA 


GAGCCAATCG 


AAAAATCACA 


TTTACCCTCT 


1980 


GGTAATGGTT 


CCAGTAAAAT 


AGGTAAACTA 


TTGATATCAT 


GTTGACCATC 


ACCATCAAAC 


2040 


TGAACCGCAA 


CATCATACTC 


ATTTTCTAAG 


GCATATTTAT 


AGCCGGTCTG 


TACCGCCCCG 


2100 


CCaATACCTA 


AATTCaACAC TAAATGAATG 


GCATTAATTT 


GGTTGGCTTC 


TAAAATTTGT 


2160 



WO 98/50555 

TTGGTTCCAT CTGTCGAGCC 
GTAACTTCCT GTTTAAATGT 
TTGTAAGCAG GAATGATTAA 
TAACAAAACC CCTCACTTAC 
TAAGGCTTCT TGCCAAGTAG 
GACAGAATAT TGTGGTCTTT 
TTCCACTTCA GTATCTTTTA 
CATTAGACAA ATGATAAACA 
CGGCTAACGT TCTTGTCCAA 
CTCTTGTTTC CGCTAATTTT 
CCCATGATGT TCGAATAATA 
ATAACTTCGT CCGACCATAC 
CTTCTTTTTT AGTTCCATCA 
CTGCTTTAGc AcTTCTGCAA 
CTTGCCTTCG TCTTCTGCTT 
TGGTTGAATT TCTTTCACTT 
GGAATCCGTT GAAACATACT 
CAATTGGCCA TTTCCACCAG 
CAGGGCAAAG TAGCTTTTGT 
AAAAATTACT GACCATTTTG 
CAGTCTTCAT TTTCTGTATA 
AATTCTGGTT CCCAACCTAA 
TCATGTCCTG CACGATCGTT 
AAAATTAATT CCATCACTGT 
TCGCCAATTT GCCCTTTTGT 
ATCCAGTCAC GAACATTTTT 
TTGGTAATTT GACGAGGAAT 
CAATTTGAAA TTGTTGCTTG 
GATCCTGCTT TTGTTGAAGA 
TTTTCCCcTT CGCCTTCTCC 
AGTTCATCAG TCGATACATG 
GTGTACGTCC CAATCAGATT 
TTATGCGATT CAGCAGCATA 
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ATCATTAATT ACCACATAAT CCAATTCATG TTGAAAATGC 2220 

TTCAATGGAA GCAATCGTCC GCAAAATGTT TTCTTCCTCA 2280 

AAGTACCTTC ATAAAGAGAA ACACCTTTCC ACTTTTTCTC 234 0 

ACAAAACAAT TTATTGTTGA ACATTTTCTA ACATTTGTGC 2 4 00 

GAATTTTAAA GCCTAACGCT TCTGTTTTAC TTAAATCCAT 24 60 

GCGCCTTTTG TGGAAATTGA GTTGAATCCA CTGGCAATAC 2520 

AAATTTCTTT AGCAAACTGA TnACcAGCTA CAGCTGTTTT 2580 

CCAAATGGCG CTTTTTCTGC AATAACAAAT GCCATAAATT 2640 

GTTGGGCGGC CAAACTGATC GTCCACTACC TTTAGTTGGT 2700 

TGCATGGTAA AGACAAAGTT ATGACCGTAT TGACCAAATA 2760 

TAATAGTCGT CCAGGATTTC TTGGACCGCT TGTTCACCTA 2820 

TCATTTAAAG GATTCGGTTG ATCATCAATC GCATAGACGC 2880 

AAAATATAAT CTGTACTTAC GTAAACAAGT GTCGCCCCCA 294 0 

CATGACGTGT GCCATCTACA TTCACTTTTT CATCTAATTC 3000 

TATCAACAGC TGTATAAGCA GCACAATGAT AAATGACGCT 3060 

TTGCCAACGT ACTATCCGCA TCAGTAATAT CTAATTCTGT 3120 

CAATTCCTTG TTCGTCTAAT AGGTGACGTA ATTCTGTGCC 3180 

TAATTAAAAT CATTTTTTTC GCTCCTTCTT GAAAAAAAGG 324 0 

ACTTTGCCAT CGCCTTTCGA CTGTTATCTC ACATGTCAGA 3300 

TGCATATTTT GCTTCAACAG CTTCTTTATC TGCACGCCAC 3360 

CCACTTGATT GTTTCTGCCA AGCCTTCACG GAAATTCGTG 34 20 

TTCTTCGCGT AATCTGGTTG AGTCAATCGC ATAACGTAAG 34 80 

GACATGCTCA TAGGCATCAA CAGGTTGTCC CATTAATTCT 354 0 

TTTATTGTCT TCTTCACCAT CGGCTCCAAT TAAATATGTT 3600 

TAAAATTGCC CAAACAGCCG AAGAATGGtC GTTTGTATGA 3660 

ACCAGCACCA TAAAGTTTTG GTGTAATGCC ACTTAAGACA 3720 

AAATTTTTCA ATATGTTGaT ATGGcCCATA GTTATTTGAA 3780 

TAAGTTGAAA GAACGAACCC ACGCTTTGAC TAACAAGTCA 38 4 0 

ATAAGGACTT GAAGGATTGT AAGGCGTTTC GGCAGTGAAT 3900 

ATGTCCTGGT AAATCTTCTC TTAAAGGTAA ATCGCCATAA 3960 

GTGGTAACGA ACATTGTTTT TACGGCAAGC TTCGATCAAT 4 020 

GGTTTGTACA AATGGGAATG GGTCGTTTAA TGAGTTGTCG 4 080 

GTGAACAACC GCATCTGTTT CAGCGACTAA ACGATTAACT 4140 
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AATTCAGCAT CAGCGATATC TCCTACGACT AATTCCACGC GATCACTTGG TAGACCTTCA 4 200 

AGATTTTTTT CATTTCCTGC ATACGTTAAT TTATCTAATA CAGTTACATG AACTTCTGGA 4 2 60 

TGATTCTTTA CAACATAATG AACAAAATTT GAGCCGATAA AACCAGCTCC ACCTGTTACG 4320 

ATGATTTTTT TCATAATTGA AACTCTCCTA CTTTTTTCTT AAATTTCGCC ATAGATAAAC 4 380 

GGATTTTCCG CTTCAAATTC TTTTAAGGTT GGATGTTTTT GATCCTTTTC TGACGTAATC 4 4 40 

GCTTTTTCTG GCGCAATTGG CCAATCAATA GctAACGCTG GATCGTTAAA AGCAATACCG 4500 

CCGTCTGCTG CCGCATTGTA GTAATTGTCA CACTTGTACA TAAAGTTCAC GTTCGGTGTC 4 560 

AAAGTGACAA AACCATGGGC AAAACCTTTT GGTACTAAAA GCTGACGGTG ATtATGTTCT 4 620 

GATAAAATGT AGCCTTCCCA TTGACCATAG GTTGGACTCC CTTTACGAAT ATCAACAATC 4 680 
ACATCTAATA CCGCACCTGT TACAACGCGA ATTAGTTTTG TTTGCGCmTT CGCCTTTTTG ' 4740 

AAAATGCAAT CGACGCAAGA 4 760 



(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 
AACCGGGAAA GnTCTAAAGA TTCTTTCAAC TTACAGGAAT TTTTTTAGGC TTTTCTTGGC 60 

AGTTTCnATG AAAATGCCAA GTAAACCTAG AAGGGATGGT AACTGATTAA TTGAAAAGGT 120 

AGGGAAAGAG CCTGGATGTT GAATAAAGTC GACCTGAAAA AAGGGTTAAG TACAGAAGAA 180 

GTTGCCAAGC AGAAAGAGTT AGGCCTTCAA AATAATTATG AGGAAAATGT AGCAAAATCT 24 0 

ACAAAAGATA TTATTTTTGA TAACGTCaTG ACGTTATTCA ACTTTTTAAA CTTTGCAATC 300 

GCTGTTTGTT TACTATTTGT CGGAGCGTAT TCCAATCTGG CTTTCTTAGC GATTATTATT 360 

GTGAACATGT CAATCGGGAT TTTCCAAGAA ATTCATGCCC GTAATTTAGT TCAAAAATTA 4 20 

TCTATTGTGG CAAAAGAAAA TGTTCATGTT GTTCGAAATG GCGTACAACA AGAAATTGAT 4 80 

ACAAAAGAAC TAGTTATGGA AGATATTGTT ATTATCTCTG CAGGGGAACA AGTTCCTTCT 540 

GATATGGAAG TTATTGACGG TAAAGTCGAA GCGAATGAAG CGTTGTTAAC AGGTGAATCA 600 

GATTTAATTG AAAAGGAAAT CGGCGATACC TTGCTTTCTG GAAGTTTCAT TGTTAGCGGA 660 

CAAGCCTATG CCCGTGTTAT CCACGTTGGT GCCGAAAACT ATGCTGTGAA AATCACCCAA 720 

GAAGCTAAAG TGCATAAACC AATTCAATCA GAGTTAGTCA ATTCGATTCG TAAAGTCTCT 780 

AAATTTACAA ' GTTGGGTGAT TATTCCTCTA GGGATTATTT aTTTGTTGAA GCTTTTTGGC 84 0 

TAAGAGaTGC TGGCATTAAA ACATCTGTTG TTGCTTCTTC AGCTGCTTTG TTAGGGATGT 900 

TACCAAAAGG ATTAGTACTA TTAATTAGTA TTGCGCTGAC AACAGGTGTC ATTAAGTTAG 960 



WO 98/50555 PCT/US98/08985 

1485 

CCAAAAAACG TATTCTAGTA CAAGATATGT ACTCAATTGA AACATTGGCC CATGTGGACA 1020 
CTTTGnTT 1028 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 



GCGgTGGCTT 


AGGACTCGCT 


AT GC TACT AG 


AGAGACCTCA 


GCAAAAAAAA 


AACAGCCGAT 


60 


TTTATCAAAT 


GAGTCCTGAG 


GAACGCCTGG 


CTTCTCTTCT 


TAATGAAGGC 


CAGATTTCTG 


120 


CTGATACAAA 


AAAAGAATTT 


GAAAATACGG 


CTTTATCTTC 


GCAGATTGCC 


AATCATATGA 


180 


TTGAAAATCA 


AATCAGTGAA 


ACAGAAGTGC 


CGATGGGCGT 


TGGCTTACAT 


TTAACAGTGG 


240 


ACGAAACTGA 


TTATTTGGTA 


CCAATGGCGA 


CAGAAGAGCC 


CTCAGTTATT 


GCGGCTTTGA 


300 


GTAATGGTGC 


AAAAATAGCA 


CAAGGATTTA 


AAACAGTGAA 


TCAACAACGC 


TTAATGCGTG 


360 


GACAAATCGT 


TTTTTACGAT 


GTTGCAGATC 


CCGAGTCATT 


GATTGATAAA 


CTACAAGTAA 


420 


GAGAAGCGGA 


AGTTTTTCAA 


CAAGCAGAGT 


TAAGTTATCC 


ATCTATCGTT 


AAACGGGGCG 


480 


GCGGCTTAAG 


AGATTTGCAA 


TATCGTACTT 


TTGATGAATC 


ATTTGTATCT 


GTCGACTTTT 


' 540 


TAGTAGATGT 


TAAGGATGCA 


ATGGGGGCAA 


ATATCGTTAA 


CGCTATGTTG 


GAAGGTGTGG 


600 


CCGAGTTGTT 


CCGTGAATGG 


TTTGCGGAgc AAAAGATTTT 


ATTCAGTATT 


TTAAGTAATT 


660 


ATGCCACGGA 


GTCGGTTGTT 


ACGATGAAAA 


CGGCTATTCC 


AGTTTCACGT 


TTAAGTAAGG 


720 


GGAGCAaTGG 


CCGGGAAATT 


GCTGAAAAAA 


TTGTTTTAGC 


TTCACGCTAT 


GCTTCATTAG 


780 


ATCCTTATCG 


GGCAGTCACG 


CATAACAAAG 


GAATCATGAA 


TGGCATTGAA 


GCTGTAGTTT 


840 


TAGCTACAGG 


AAATGATACA 


CGCGCTGTTA 


GCGCTTCTTG 


TCATGCTTTT 


GCGGTGAAGG 


900 


AAGGTCGCTA 


CCAAGGCTTG 


ACTAGTTGGA 


CGCTGGATGG 


CGAACAACTA 


ATTGGTGAAA 


960 


TTTCAGTTCC 


GCTTGCTTTA 


GCCACGGTTG 


GCGGTGCCAC 


AAAAGTCTTA 


CCTAAATCTC 


1020 


AAGCAGCTGC 


TGATTTGTTA 


-GCAGTGACGG 


ATGCAAAAGA 


ACTAAGTCGA 


GTAGTAGCGG 


1080 


CTGTTGGTTT 


GGCACAAAAt 


TTAgCgGCGT 


TACGGGCCTT 


AGTCTCTGAA 


GGAATTCAAA 


1140 


AAGGACACAT 


GGCTCTACAA 


GCACGTTCTT 


TAGCGATGAC 


GGTCGGAGCT 


ACTGGTAAAG 


1200 


AAGTTGAGGC 


AGTCGCTCAA 


CAATTAAAAC 


GTCAAAAAAC 


GATGAACCAA 


GACCGAGCCA 


1260 


TGGCTATTTT 


AAaTGATTTA 


AGAAaCAwTA 


AAAAAcAGTT 


CAGCAGAAAT 


TATTCTGCTG 


1320 


AaCTAtTTTT 


TTTCACATTA gGTAGCCGTT TCAdGcCACg AATTGGTTTT ACTTTTAAGA 


1380 


CATCTAAGAA 


GAAAGTGAAA 


ACAGGGATTC 


CTACAATAAG 


TCCCCATACA 


CCAAAAAGaC 


1440 


GCTCACTAAC 


TAATAAAATG 


ACAAACGTAT 


AAAAGATTGG 


TAACTCTGTT 


T TACT AG AC A 


1500 


TGAATTTAGG 


ATTTAGCACG 


TAAGATTCAA 


ATAAATGGAC 


GATTGTAATT 


AAAGCTAAGA 


1560 
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TATAAATCAC ATCGTTCAAG CCACCTTGGG AATAAGCGAT GAAACTTAGC 
AAATAATCAC ACCAGCAACT GGGACTAAGC TCAAAATGAA GATCATAATG 
GTAGCTGAGT AAAGCCGATT ATCGCCAAGG CCAAGGTAGT AATGACTGTA 
CAATGAAAAA CTGAGCTTCC aTTACGaCAC CaAATGTGTT GACGAACTTG 
aATaGATATC cTGmnaAAAC CmATCmAAAT CACtTTTTAA AAwTAACTTr 
CCaTTTGCTT CTTCTCAATC ATAAAGAAGA AACTTAAAAT AAAGGACATG 
AGCCGACAGC GCCAATATCT TGAATGTAGC GTAAAATCAT TGAAGCACCA 
GAGAGAGTAA GTTAGATTTT TCCAAGTATT GATCAATAAA TTGCAATACT 
CTGTATGTGG ACTTTGATAA AAATCAACTA CAGAATTAAT CATTTGAGTC 
CTAGAACGGG AACATATTTA GTAATCGCTA AATAGATTAG AAAAACAACC 
TTAAGAGAAC GATGACAATC GTCGGGATTT TCATATAGCG TTGAACAAAA 
GAACAGCTAA ATAGGTAAAG ATAAATGTTA AAAGAATTGT TGTAATCATA 
AATAAAGCAC AAAAATAwTC AAAGCAAGGA CGCAAAAACG TCGCAAACGT 
TAAATTTTGG TTACAAACTC ATAGGGGAAA CTCCTTCTTT ACTGGATAnT 
AAGAATAACA TAACCATACC GATGGCAGCT AGGCCGCAAA CAATAATATG 
TACATCACTA CAATGGCGAT GAACTCTCGA ATAAACGCGT TAGGTAAAAA 
GTTTTTGCAC CAATGCCATC AGCTTTTAAG CCAGCCATTC GAGCGAAAAG 
AAAAGGTGAT AATTGTTTGT AGTAAAAATG GCTTGAAATT AGCATTTCCA 
TCATTGTTTC TTTGGAAAAA CGCATATTTT CTAGTGTGTT TTTTGAATGA 
AAATATCATC ATCAGGAATA CCTTGTGTCA AAGCATACAT TTTCATCGC 
(2) INFORMATION FOR SEQ"ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
GGACGCGATT GATTTTTACG AAGGAATGTA TAAAAACnCG GCAAcGaATT 
TGACTGGCAT AACTATCaGC GGGCAAATTG GCGCGAGGTA GAAACAAAAG 
GACATTGCAT TTTGCCAACG CGGGCGGTGC TGGGGGCTCA TACACACAAT 
TGGTACGAAT ACTGTTGTTG TATCATTTGA TGGCAATGCT GCTTATCCCG 
TAGCGTTTTA ■ CTCGTGCAAA ATAGTGATTA TAAAGTTTTA AGGACTTTGA 
GGGGAAATAT ATCGTTTACT TAAACATAGG CAAAGGCGGG TTGCTTACCT 
AACCTACCTT TGCCTATGTG CTATTTTTTG GGATAGAGGT GCTCAGAGTA 



GGAATGCAGG 
GCTAAACTAG 
TTCACCACAG 
TTGGCAAAAT 
GAAAAATCAG 
ACAAATGATA 
TTCTGGAGTT 
TGATTTGTAT 
GTTTGGTTAA 
AATGCATAGG 
TGGACCAAAC 
CTTCTGGATA 
TCATTTTGGA 
CAATCCTGTT 
GCGGCGTTTG 
ATAAAAAGCG 
ACCCGCCCGG 
AAATCCTCAA 
GCTTCTACTA 



yCAcTCAAaT 
GCGACACGTT 
TCACGAAAGT 
ATAACCCAAG 
ATCAGTAATA 
TTTTATAAGT 
GGGGATGTTT 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2749 



60 
120 
180 
240 
300 
360 
420 
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TTATGGGAAA TTTTTATGCT AGATCTCTGT TTGACTAGGC TCAAATAGAT CTGAATCAAT 480 

CGTTGAAGTG aTATCTGTTT ATATAAAGAT AGGCTAAATC GTCAATTTTT TACAAATCGT 54 0 

CGTtCATGTT AAAATAAATG GaGAAATGTT TTAGTAwTCA TAGTAAGTAT TTtATCGTAG 600 

AATAAAAGAT ATACTAACAA nnGGAGCGAT GAAAATGCTA AAAGAAGAAA GATTAAAGGC 660 

GATAGTCTCT CTAGTAGATC AAAAAGGGGC AATCAAAGTT ACTGAAATTA TGGAACGATT 720 

AAATGTTTCG GATATTGACT GTTCGTAGAG ATTnTGACAG AATTTGGAAG CCAGCGGGGC 780 

GTTTTnAAAC GAG 793 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
CAGTTGGCAG TGTTTCGCTA CATGACAAAG ATATTCCGTT ACAGACAAAA ATTTATTATG 60 

AAGTGAAATC TTCCGAACGT CCAGCTAACT ATGGCGGAAT TACCAAAGAA TGGGGCATGA 120 

ATGATGTCTT GGACACGACC CATGATCGTT TCACAGGGAA ATGGCACGCT ATTACGAATT 180 
ATGACCTTAA AGTAGGGGAT AAAACGTTAA AAGCAGGAAC AGATATTTCT GCCTACATTC ' 240 

TTTTAGAAAA CAAAGACAAT AAAGACTTAA CGTTTACGAT GAATCAAGCA CTATTGGCCG 300 

CCTTAAATGA AGGAAGCAAT AAAGTAGGCA AACAAGCTTG GTCTGTGTAT CTGGAAGTCG 360 

AACGGATTAA AACAGGTGAC GTAGAAAACA CGCAAACAGA AAACTACAAC AAAGAGCTGA 4 20 

TGCGTTCTAA TACsGTGGTG ACrCATACGC CTGATGATCC AAAACCAACC AAAGCCGTTC 480 

ATAACAArAA AGGGGAAGAy ATTAATCATG GAAAAGTGGC TCGTGGTGAT GTTCTTTCTT 540 

ATGAAATGAC TTGGGACTTA AAAGGGTACG ATAAGGACTT TGCCTTTGAT ACAGTCGATC 600 

TTGCGACAGG CGTTTCTTTC TTCGATGATT ACGATGAAAC GAAGGTGACA CCAATCAAAG 660 

ACTTACTTCG TGTCAAAGAT TCTAAAGGGG AAGACATTAC GAACCAGTTC ACGATCTCTT 720 

GGGATGATGC CAAAGGCACG GTGACGATTT CTGCCAAAGA CCCACAAGCC TTTATTTTGG 780 

CGCATGGTGG GCAAGAATTA CGTGTAACTT TACCAACAAA AGTTAAAGCC AATGTTTCTG 840 

GTGATGTGTA TAATTTAGCG GAACAAAATA CATTTGGTCA ACGAATTAAA ACCAATACCG 900 

TTGTCAACCA TATTCCAAAA GTGAACCCTA AAAAAGACGT GGTTATTAAA GTCGGTGATA 960 

AACAAAGTCA AAATGGTGCC ACAATCAAAT TAGGGGAGAA ATTCTTCTAT GAATTTACAA 1020 

GTAGTGACAT TCCTGCAGAA TACGCTGGTA TTGTGGAAGA ATGGTCGATT AGCGATAAAC 1080 

TAGACGTCAA ACATGACAAT TTAGTGGCCA nGGTCTGTGT TTGCCAATTC TAmkTTTGTy 1140 

TTAGCAGACG GAACCAAAGT GAATAAAGGG GACGACATTT CGAAACTATT CACGATGACC 1200 

TTTGAACAAG GGGTAGTGAA AATCACGGcC AGTCAAGCCT TTTTGGATGC GATGAATCTA 1260 
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AAAGAAAACA 


AAAACGTTGC 


GCACTCATGG 


AAAGCGTTCA 


TTGGTGTAGA 


ACGAATTGCG 


1320 


GCAGGAGACG 


TTTACAACAC 


GATTGAAGAA 


TCTTTCAATA 


ATGAAACGAT 


CAAGACTAAT 


1380 


ACGGTAGTGA 


CGCATACGCC 


AGAAAAACCA 


CAAACACCAC 


CAGAAAAAAC 


AGTGATCGTG 


1440 


CCACCAACAC 


CAAAAACACC 


ACAAGCACCA 


GTAGaGCCAT 


TAGTGGTAGA 


AAAGGCAAGT 


1500 


GTAGAGCCAG 


AATTGCCACA 


AACAgGCGAA 


AAmCAAAtGT 


CTTATTAACG 


GTAGCTGGAg 


1560 


TTTAGCTGCn 


ATGCTTGGCT 


TAGCAGcTTA 


GgCTTtAAaC 


GTAGAAAAGA 


AACAAAATAA 


1620 


TTGAAGACGA 


mArGAaGGAC 


GAaCTGATTG 


TCCTTCTTTT 


TTTAGTTATG 


GAGGGGAAaC 


1680 


ATTGGAAGCA 


GTAGTAGTnG 


AAgAGAGGCA 


AAGGGGATGA 


AAGAAATTGC 


CATCCAAGAA 


1740 


AAAGATTTGA 


CCTTACAGTG 


GAGAGGAAAC 


ACAGGTAAGT 


TAGTTAAAGT 


TCGATTrAAr 


1800 


AATACACGTG 


CAATGGAAAT 


GTGGTACAAC 


AAACAAATTA 


CCGAAGAAAA 


CATTCAAGAG 


1860 


ATCACCACGT 


TGAATATyAT 


TAAAAATGGA 


AAATCTTTGG 


CATTAGAAGT 


ATATCCAGAA 


1920 


AAAAGTATCT 


ATGTGAAACC 


AAATTTAGGC 


AGAATCAATG 


TGCCTGTCTT 


TTTTATCAAA 


1980 


ACACCTATTA 


ACAGAGGAGT 


ATTTGAAGAG 


ATTTTCGGCG 


AAACGTTAAA 


AGCATAAGTA 


2040 


AAGGAGAAAG 


AAAAGCCAAT 


GTATGTATCT 


TTTGTTCTAA 


GTTGTTTGCT 


TGGATTTTCT 


2100 


GCCTACAGTC 


TATTAAATAG 


GCTAAATTCG 


TTGGAATTTG 


TGGATGTTTG 


GTTAGACAAA 


2160 


GAAACACAAA 


AAATCACACT 


AAAACGCTGT 


TTTTATGATA 


CGTCTTTCAA 


GAAACAAACA 


2220 


CTAAAAGAGT 


TAGAACGAGT 


ATATTTCCAA 


TTAAAAGAAA 


TAATCAACGT 


GCAAATAAAC 


2280 


AAGCGGTCTT 


TAAATACGAA 


TGACATACGT 


AATGTACGAG 


AACTAGAGGA 


AAAACAACAA 


2340 


GAAATAAAAC 


GATTCATGTT 


AGACGTTTTA 


GAAGATGCTT 


ATTGGAAAGA 


ATTAGCAAAT 


2400 


ATGCCAGAAG 


ACCAACGACA 


CTTAGACGAT 


TGGGATTTCT 


TTTGAAATTG 


GTTACTGAAA 


2460 


GGAAGCGATG 


GATTACAATT 


TGTTTGATTT 


TTAACCAGCC 


ATGCTAAGTT 


TTTCCGTTTC 


2520 


GGTTCCTTTG 


GTTTTTTTGT 


TAGCGATAAT 


AAAAAAATGA 


TGCCTGACAC 


AAAAATATAG 


2580 


GAGGGAAAAA 


AT GAT T ATT T 


TATCAGGACT 


TTTTATTTTA 


GGGATAGGAA 


TAGTTGGAGG 


2640 


TTATCAGCTA 


GCCACGTTAC 


CTAAATTAAT 


TGAAATGAAA 


CAGCACAAAG 


CAATACAGAA 


2700 


CCATTTCAAT 


GTTAAAGGAA 


ATGAGTACAC 


CTATTATCAA 


GAAGACAGTG 


AAAATTACAT 


2760 


TTTATCCTTG 


GAAGATACAG 


AATATCGAAT 


TAAATTTTCC 


AAGAACACCC 


CGTTAAAAGT 


2820 


GGTATTCACT 


GAAATTTTAG 


AACCTATGTA 


AGGGTGAGTT 


TATGTATATC 


TCTTTTGATT 


2880 


m T\ OTl TV /""■/■"• TV TV 

TAGAAGCAAC 


TV /""• TV IN 7\ 7\ 7\ 

ACAAGCAGAA 


TV TV TV Tvnt7\/**/"* , nt.« 

AAAATAGCTy 


TACCwGTCCA 


TCAALGATl t 


TCGCTyGTTC 




AAGGTCTTTT 


GyTACTTGAA 


CAAGCCAATC 


AAGAAGTTTT 


GGCCACAAAA 


CAAGGAGGCA 


3000 


CGATAGCCTA 


TTTTTTATAT 


ACAGGAAAAG 


AACAAGAACT 


ACCTATTTTA 


GAAGCGGAAC 


3060 


TTTTATTACC 


GGTCAAACAA 


AAGGATATGT 


TAGACGTATT 


rCTTGAACCA 


GTCACAAATG 


3120 


ATCCAACGCT 


TGATCCAATA 


GAAGTAAAAA 


ACTTCTTACA 


GTTAGTTAAT 


CAATCGTTGC 


3180 


CAAAGAAGCA 


ACGAGCTAAA 


AGTAACAAAA 


AArGTACAGA 


AAAGAAGGmA 


ACTAyTGATA 


3240 
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CAAAACCAGT GAAGAAAAAg TTTTTTTgTC CGTTAGTCTT ATCACACTTr TTGGATTGAT 3300 

TCTACTTATG AGTGGTTTTT TATTAGGGAA AACCAATTCT ACACCTTAyC AAGAAAAAGA 3360 

ATTAGAGGTr AAAAAAGAAC TCAATACCTT AACAGAACAA GTCAATGAJcC AAAATAAAGT 34 20 

AGAAACATTT ACTCGATTCT TTCTAACTAA TTACTATTCA GGAAAAACAG ATGCCAAAAC 3480 

CAGACAAGCA GTTTTAAAAA AGTTTGTAAA AAAAGAAAyC TTATCTGATT TTTTATCGGA 3540 

GAAAAGCCGA GCGAGAAGTA TGTTTCCCTG GGAAGTAAAG AAGAAAGACA AACAATGGAC 3600 

GATAGyGTTC ATTGTTGTAT TAATCAATGA AAAAGmCGAG CAATCGACAA AGAAAATTAC 3660 

GTTTACAGCT GAAGAAACAA AAGAACAAyT GGTCGTAATG GAACGACCGA CAGAAGAAGA 3720 

TTTTGAACTT ACAAATTAAA ATTAAAATGG AGGAAATAAT GATGAAAAAA ATTCTTTTTG 3780 

CTAGTTTATT TAGTGCCACA CTACTATTTG GGGGAAGTGA AATTTCTGCT TTTGCACAAG 384 0 

AAATTATCCC TGATGATACT ACGACACCGC CCATTGAAGT ACCAACAGAA CCAAGTACAC 3900 

CAGAAAAGCC AACAGATCCA ACACCGCCAA TTGAGCCACC TGTAGACCCT GTAGAGCCAC 3960 

CTATTACACC AACGGAGCCA ACAGAACCGA CAGAGCCGAC AACACCAACA GAACCTACAA 4020 

CTCCTACAGA GCCAAGTGAA CCAGAACAAC CAACGGAGCC AAGTAAACCA GTAGAACCTG 4 080 

AAAAACCAGT TACACCAAGC AAACCAGCAG AACCCGAAAA AACTGTGACA CCAACTAAAC 414 0 

CAACAGAATC TGAAAAACCA GTACAACCAG CAGAACCAAG CAAGCCAATC GACGTTGTTG ! 4 200 

TAACGCCAAC AGGGGAATTA AATCACGCTG GAAATGGTAC ACAACAGCCA ACAGTCCCTA 4260 

TTGAAACAAG TAATTTGGCA GAAATCACGC ACGTGCCTAG TGTAACAACA CCTATTACAA 4320 

CTACAGACGG AGAAAACATT GTAGCTGTAG AAAAAGGTGT TCCACTTACA CAAACAGCAG 4 380 

AAGGGTTAAA ACCTATTCAA TCrAGTTACA AAGTATTGCC TAGCGGAAAT GTAGAAGTAA 4440 

AAGGTAAGGA CGGTAAAATG AAGGTTTTAC CATACACAGG TGAAGAAATG AATATCTTTT 4500 

TATCTGCCGT AGcgGtATCT TGTCTGTAGT ATCTGGGTTT GTCATCTTTA AAAAACGCAA 4 560 

AGCTAAAGTA TAAGCTAGCG TATAGATAGG AGCGTACCAT GAATAArAAA AGAATTrTCT 4 620 

TTGGGCTTCT CTCTTTTTTC TTGCCTATAT TTTTAGTCTT TGGAGGCTTA CTTTTTTTTC 4 680 

TTTTATTATT AACGAGTACG TCAGATACTT CAAAAAATGA TTGTATTCAG CCAAGTATAA 47 40 

ATAATCCAAC TGATGCGACA GATACACCTA AATCGATCGA GCAGTTTGTA AAAAGCCATA 4800 

AAGATGCTTA CCTTTTATCA TGGAAAGCAG GTGGCTTTTT ACCGTCTGCT AGTATTTCTC 4 8 60 

AAACGATGGT AGAAAATGGG TTTAATTTTA CTAATCCATC GGGGACGTCA TTTTGGCAGG 4 920 

CrCACAATAT GGGCGGTGTT AAAACGTCAA AAAAAGAAGA TTTTCCTGTA ACTTTAGCAA 4 980 

CATTCGGCCA AGATTCTGTT GATATTTCTG GTACAAAGCC AGGGTCAAAC GTCGGTGATG 504 0 

GCACTGGTGG GGCATATACC TGGTTTAAAG ACTACAATGC TGGAATTGTT GGAAAAGCAG 5100 

AATTTATGGC ACACCAGACA CTGTATACAG GTGCTATCAA TAATACTGAC GGATTAAGTA 5160 

CTTTATCAGC TATTTATTCA GGAGGATGGG CTACAGACCC TACTTACCTC ATGAAGTTAC 5220 
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AGGCCACATA TAATAGCTTA GGCAAGCAGT TTCAATGGTT GGACCAAGAA GCAATACAGA 5280 

AATATGGTAA CGCGCCGTTT AAAAAGAGCG AACTTGTCCT AAATATTCCC GGAAAATCCC 534 0 

CAATCACAAA CG AAAAATAT GGTAAAAATT CTGATTGCGT TGTTACTTCA GATACATCCG 5400 

ATCAAGTTAC AGGACAAAAT ACTGCCCCAT CATTAGAAGT ACCAAGTGCG TATAAAGGAA 54 60 

AGTTAACACT TCCGCCTATC GATTCGAATG ATTACGCTGG AAATAACTAT CCTTTTGGAC 5520 

AATGTACGTG GTACGCCTAT AATCGTATGG CGCAAATTGG CAAACCAATT GAATGGTTTT 5580 

CAGGTGATGG CGGAAATGGC GCAGGTTGGG CTAATTCTGC TCGTGCGAAA GGGTACACAG 564 0 

TAGTAAAAGT AAAACCAAGT GTTGGTTGGG CAGCTAGTAT GCAAGGTGGC ATAGGAGGTT 5700 

CTGCACCGCC CTATGGGCAT GTGGCAGTTG TAGAGTATGT CAATTCTGAC GGCAGTATTC 57 60 

TTGTGAGTGA AGCAAATGTT ATTAATCAAG GCTCTGGTAC GCGTTCATGG C GAG TAT TAG 5820 

ACAGAGCAAC GGTTGAACAA ATTGATTTTA TTCAAGGAAA GGGAGCGTAG CAATGAGGGA 5880 

GCGTTTAAAG CAAGCACAAC CACTAGTGAT CGTTCTTTTA CTTGTTTTAC TGACCACTTT 5 94 0 

AGGAGGCGGT TATTTCAATG AGAAAAAAGA AAATGCACAA ATAAAAAATC AAAAAGAGAA 6000 

AGAGCAGCAA GTCCTAAAGA AAACTTTAGA AGAAAAAAAC GGACAACTGA ATGATTTATC 6060 

GAGAAAAATT GAACAATTAT TGGATGAAAA AGAAGATCCT GTTAATGAGC CATTAACAAA 6120 

AGCAGTGAAT GCCTTATTTT CACTTkTGTT TGACTATGAT TCTAGCAATA AAGAACAAAC 6180 

AGTCGCTAAA AGAAAAGAAA AAGCAACTGT TTATGCGACA GAGCAAGGGC TATCCGGGGT 624 0 

ATTCCCAAGG GAT kCAGAT A AAGTTGTTCC TAGTGTTGTG ACCGTTTCTC AATTAGATAA 6300 

GGCACCTGAA ATTTATCGAA CCACACAAAA rGATACAGGA AAAGAAAAAA CAGCATTGGT 6360 

CCGGCTAGAA TATTCAGTTC GAATCGCAGG AAGCCAArCG CAACGTGGAA ATTTTATTTA 6420 

CAAGGTAACT TTCAATGAAG AAAGCAAGAA ATTTACAACC ATTGAGAATT TAGGTGAGGA 6480 

AAATCAGTAA AAAAGTACGA ATTACAACrC TTATTAGTTT AATGCTTGTT TTAGGGGGGA 654 0 

TAAGCCTCTT TTTATTTATG GTAAAACAAA ATGAAAAGGC AATGCTCAAT CAGGAAAAGG 6600 

AGCGAGCGGC CGTAGTTTTA GCAGACAATG AAACACGTGT CTTATTTTTT TACAAAGATA 6660 

CGTGTGCTGA TTGTCAAAAG GTCTATCCGA yGGTTTACGA AGAGGACCAG GGCGAAAACA 6720 

AC ATT GT ATT TATCAATTTA AATCAACCkA AAAACCGTCA CTATATAAAG ACCTATACGT 6780 

TGGAAAAAGT ACCAACGTTT GTTACACCGA AAAGAAGATA CGTAGGAACA GATACGAAAA 684 0 

AAATTCAGCA ATTAATTGCT GAAAATAACG ACGAAAGAAG TGAAGATTAA TGCACGAAAA 6900 

TCATGTGAAT GAAAAGGAGA CAGCCGTAGA AAATACGGAA CGTATTGCTA AAAATTATGC 6960 

GTATGAACGC CCAGCGATTC AAACGGCGCT ATTTATCTTA TGGCGGGTGC ATAATAAGCA 7020 

GTATCAAACm GGCGCTCGGA TATTTTACGA CGAATTAGAA AAGGCCACCA AGACGTCTAA 7080 

GACCGCTTAT AAAGAAGCAT TAGCTTTTTT AGAAGGTGCA GGCATGGTGG TAAACGAGGT 7140 

TGTCGTAGAA AGTAAAGTCC CCCAATCGTT AATTCAACGT TACGGGATTT TAAAAGATGA 7 200 



WO 98/50555 PCT/US98/08985 

1491 



GTAACAAAAA GGCTGTATTC AATCCrCAAG CAGGGAAGCA AGCCyAGAAA GAAAAAGTGA 


7260 


GGAAAGAGAC 


ACGTGTTTCT 


TCATTGGTTC 


AAATAATCCG 


AGCAATCTTC 


AGGAAGAAAG 


7320 


AGAAAGAACC 


AGAAAATTCT 


CGAACCATTC 


GTTTTTCGCC 


ACGTTCGGCG 


AAAAGTATTT 


7380 


ACTCAATGGT 


GTTTTTCCTT 


TTATTTTTCT 


TGATAAGCTT 


ATTTTTATTA 


ATGAGTTTTG 


7440 


GACGATTAAA 


CACGCTTACA 


AGGCTTGCGA 


TGCAAAAGCA 


AGTAAATCGT 


GAGGAAATTA 


7500 


TTGAAAGCGT 


GAATAAAGGC 


GTAgCTGCAA 


GTGATCAATT 


GAAATATGAC 


GGAATGAAAC 


7560 


TTGTGGATCG 


ATTATTTACA 


GTGTCCTCAA 


AATTAGAAGG 


AAAAGAGTAC 


TGGGAAGCGC 


7620 


AAATAACCCC 


TTATTTAGCT 


GCTGGATTGC 


GTGCAGAAGA 


TTTAGGTCTT 


GATAAGACAA 


7680 


ATGATGACCG 


CGTTGCACGA 


AATGTACGCT 


TTATTCGATT 


AGAAACAGTT 


GATTATAAAG 


7740 


AATCACTTTA 


CAGCTTGTAC 


TATGATGTCC 


GTTTTACTGA 


AGGGAAAGAA 


TGGCGCCAAG 


7800 


TGCAAGTGAT 


TTTGCCTGTT 


AGTTATGCAG 


AAAATGAACT 


GAAGTTrCTT 


GATAGACCAA 


7860 


CGTTAATGAA 


TTTAGCGAAG 


ACAAGTGAGA 


GTAACAAAGT 


TGykTATGAC 


GAGCAACGTT 


7920 


TTATACCTAA 


AGGAAAAGAA 


GTAAATGAAG 


AAGAAACAAA 


AAAACTAACG 


GAATTTACCA 


7980 


ATCGTTTCTT 


TGAATTATAT 


GTAAAAAATG 


ATGAAAAGTT 


AGGACTAATT 


GCCAATGTAA 


8040 


AAGGACTAGA 


ACATGCGACG 


CTAGAAAAAG 


TAGATATTAC 


AAGTCTAAGA 


GAAACAGGAC 


8100 


AAGGGGTCTA 


TGATGTTCGA 


GGAACCTACC 


AATTCAGTTA 


TGAAGGGAAA 


AGTCCGCTAA 


8160 


CGTCTAACTT 


CTCATTACAA 


ATTGAAGCAA 


CAAAAGATAG 


TTATTTTATT 


AAAAAAATGA 


8220 


ATGGAGTGTA 


AGAATATGTT 


TGATTTATCT 


GGTTTAGCAA 


ATAATATTTC 


CTCACAGGTG 


8280 


ATTATAyTAT 


TGGTGATTTT 


AGGACTGGTA 


TCCTTAGTTG 


TAGCGGTTGC 


AACACAAGGG 


8340 


GCCGCACGAG 


GAATTGCGAC 


AGTCTGCTTA 


ATCTTAATTT 


TAATCGCGTT 


GGTTTTAGTG 


8400 


TTAAAAGATG 


CTGAAAAAAT 


TGGTACCTGG 


TTGAAAGACT 


TAATTTTCAA 


ACCAAATGCC 


8460 


GGCTTTATTT 


TCCCAATAAA 


AGGTTGGCGA 


TTTTGATGGA 


ATATAATTAC 


ACACGAGAAT 


8520 


TTAAACAACC 


TCACAAGATT 


TACAGTATCA 


AAGGGGTAGC 


ATTGCCTTTT 


GCGCCAAACG 


8580 


GCATTCGATT 


GGAACAAATT 


TTTGTAGGTA 


TTGGTATCCT 


TATTCTCTTA 


TTGATATTCG 


8640 


CGATTATTTC 


ATTTGTCGCT 


AAAATCAATT 


TTTTTACCAC 


AATTATTGCT 


AATTTTTGGT 


8700 


TGATTCTTAT 


AGTAGGAGTG 


G'GCGTCCTTG 


TGTGGACGCT 


CTTTTCTTTA 


AAATGGGATA 


8760 


ATAAGAGCTT 


TCTTGATTAT 


CTAATTGGTC 


GAGGAAACTT 


CTTGTATCAA 


AAGAAAAAAA 


8820 


GATATGAACA 


TGGGCTACTT 


GTCCAATTCC 


ATAAAGAAAA 


AGTCCAATAC 


AAAGTTAGAA 


8880 


AATAAGAGTA 


AAGTGTTTAC 


AAAGAGATTG 


TGAGGGATGA 


AGGTGGATAA 


AAGAATTGAA 


8940 


GCTGTGACAA 


AATTTTTAGA 


GTCTTTAGGA 


ACTGTAGAAG 


ATTACACCGA 


GGATGTAGCT 


9000 


GTTAAGTATA 


GAAACTTGAT 


TCTGAAATCG 


TATGAATTAT 


ACGAGAATAA 


GTATAACGAT 


9060 


ACTGTTGACG 


ACAGTTTATG 


TATAGAAGTT 


TGGTCAAACG 


GGACTTATGT 


kGTTACAAAC 


9120 


GAGGATTTAT 


CTTTTGATTG 


TGAGAGCGAA 


GAAGATTTAC 


AAAAGCTAAA 


AGAATTGTTT 


9180 
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GTTAACACGT CATTTtATAT 
TTATCCGTCA AAGCTAAAGC 
AGAAGTTGTA ATTGCAAATA 
CGTGTTTATT TAGATAGGAT 
AGAAACCATA AAGAAGCCTT 
GGAGGTAAAT ATCTTGATTA 
TTTAAGGTAT ACGCAAAGTG 
CTTTACCAAT GCTAACAATG 
AGCTGCAGAG TCATTAGCAA 
CATTCAAACA AGAAACTATG 
CACAGAAAGC TTCCAACTAT 
TACAGGTAAA GCTACGTTTG 
TTCGCCAGAA AATAGCATAG 
AAGTGAATAA ACAACAAArG 
AAAAAAATAG TGATTGCAAA 
CGCTTAGACT TTCCAATTGA 
GTGTGGGTAG GTTATAAACT 
AAGGAATACA TAGAGGACGG 
ATGAATATTC CAGAATACTT 
GTAAAGGGAA GCTTTGCGGA 
AAAGACGAAG TCCAAATGAA 
ATACAAGTTG CTAATCCTAT 
ATCAATGCTT TAACAGGGGT 
TTAGAAAACA AGATTTACAA 
AGAACCATAG GTCGATTGTT 
CGAACGTTAC TTCCGGAAGA 
ACTATTGAAC AATTGGATAA 
ACGAGTATGT TTGGAAGTGC 
GAAACCCATG TCAAGGTTAC 
CGTCGTAAAA GAATCTATGA 
GAAGACGAAG TGATTTTATT 
AGTAAAGAAC GACGATTGTG 
CAAGCATTAG AAGAGAAAAT 
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AACCATTAAT GAATTAAATA AAGTAGGTCA TAAGGCTACT 9240 

AAAAAATTTA AGGAAACTGG GGCAGTTAAT TAAAGAATAC 9300 

TTTAAAGGAT AAAGTAACCG AAATAATTGG TGATGATGGG 9360 

AAGTGAACGG AT GG ACT AAG AAAAACATGC TATAATGGTT 9420 

ATTTACAAAA GTAGTArCAA AAAATACAAO TACATGAGTA 94 80 

ATAACGTTAC ATTAGTTGGA CGATTAACCA AAGACCCAGA 954 0 

GAACAGCCGT AGGTCAATTT ACGTTGGCCA TTAATCGCAA 9600 

AAAGGGAAGC AGATTTTATC AACTGTGTTA TTTGGCGGAA 9660 

ATTATGCAAC AAAAGGGACT CTGATCGGTT TAACTGGTCG 9720 

AGAATCAACA AGGCCAGCGT ATTTATGTAA CTGAGGTTGT 9780 

TAGAATCAAG AGAAGTAAAC GAGCAACGAA AAGAACAGGC 984 0 

ATAAACAGTC AATGGATAAA CCTGATCCTC TGGATCCATT 9900 

TGGATATTTC TGATAATGAC CTGCCGTTTT AATGATTTAT 9960 

ACGAGTGAAC AGATCACTCG TCTTTTTTTT GTACAAGTGT 10020 

GAAAGGAGTT GATAAGGTGC CATTAAAAAG TAAAGAGCCG 10080 

AGCCATTACG AATAATTTTC TTTTTACAAC CTCTAACGAC 1014 0 

TGCACACCAA GTATTTCCTT TGAATAATTT AGACTTTTTC 10200 

AAAAGGCGTA TTCGAGCAAG ACACGTTCGA CTACCATTTT 10260 

TGATCTCGAC GAACAGATTG AAGCGACAAT CGATAACCTG 10320 

TTTAGGAGAA ACCTATTTCC GACAAGCGGG TGATATTCTA 10380 

TAAATATAGT ACGTATTTGT TCATTCGTTT "TACCGCACCT 10440 

G GAG TAT AT A GAATTGTTCA AGGAAGTTGG ACGTAAGGCA 10500 

TCATGTACCA GTTAGCCAAC TTTATCGCGC ATATCAAGGA 10560 

AGACTTATCA AATTTTAAAA ATGTGGAACG CATGGATCCA 10620 

TTATTATTTC TTTCACCGTG CAAATACCCG TTTACCTGAA 10680 

AATGACGGAG GGGATTATTG AAAATAACAC GGGATATATG 10740 

AACACATTAC CTTGCTTTTT TACCGTTGAT CAGTTTGCCA 10800 

GAT TAT AC AA AACTTGCAGG ATAGTTTGAG TACCACGATT 108 60 

ATTTCGCCAC CCAGATAAAG ATAAACGCGA TGTACACAAA 10920 
ACAGGACAAA GAGCAAACAC AAGTTGACGG TATTTTAGAC " 10980 

TGGGGAAGAG CGATTGCACG AACTAAGTAA AAAGATACGT 1104 0 

TCGTATGACG ACAACGTTTG TACTTTCAGC CGATAGTAAA 11100 

AAAAGAACTC GAATTTGTTT TGGACGGTAC AG ACT AC AAG 11160 
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TTGTATCGTC 


CGCTAGTCGA 


CCAATTAACA 


TTGTTTAATC 


AGTGTTTAAT 


TGGCGCTAAG 


11220 


AGTCAATTTA 


AAAGCTATGA 


GCATGTTGTT 


TCAACAGGTT 


ATGTAGCGGA 


CCTTGGAATG 


11280 


GAT T TAG AAA 


AAGAAATTGG 


TAACCGCTAT 


GGCATGCCTC 


TAGGTCGTGT 


CATAACTGCT 


11340 


AAAAAATTCA 


AGTCCGTACA 


ACAGGCACTT 


TCTTTATCAA 


GCAAAATCGT 


ATGGTTTTTC 


11400 


CCAAACTTGA 


CTAAAAAAGC 


CATAGAAGGC 


GCACAATATA 


CAAATGGGAA 


TACGCTAATC 


11460 


ATTGGCCCrC 


CTGGGCAAGG 


AAAGTCAGTA 


CTTGTTAAGT 


ACATTTTCTT 


GTGGTTAACG 


11520 


TTTTTAGGAC 


AAAAAATACT 


TTATATTGAT 


CCTAAAAATG 


AAACCGTCCT 


TTTTTTCAAA 


11580 


CGAGCATTAG 


ATAAGTTCGG 


ACATATTCCA 


GAATTTAAAG 


CTTTGTATCA 


ACGCATTAAC 


11640 


TTTGTTTCGC 


TATCAAGTGA 


AGAAAGrTAT 


CGAGGAATGT 


TAGATCCATT 


GCTATTTCTG 


11700 


CCAAGAGAAG 


AAGCTATTCA 


AACCGCACGA 


AACGTGTTAG 


AAAATTTTGG 


TGAAGTTACG 


11760 


ACAGACAGTC 


ACACTGCAAG 


TGAyAAGAAG 


ACGCTrATTT 


TAGATTCTGT 


CAATGCGGTG 


11820 


ATGAAAGGTA 


AAGGrAAAAA 


ACACCTAACG 


AAAGTAATCG 


AGGTTATTCG 


CGAAAAAGAC 


11880 


CCTAAATTAG 


CmAATTTAAT 


CTCTGGCCAC 


AATGTTGGGT 


TAGGyAAAAT 


TCTTTTAGGG 


11940 


AAGGATTACA 


GCGAACCTAT 


TCGCTTTGAA 


AATCAAATCA 


ATGTCTTAGG 


TACCCAGGGG 


12000 


CTTAAAATCC 


CTACACAAGC 


GGAAATAGAC 


AGCGGACGAT 


TAAACAATGA 


ACAAATCGCA 


12060 


GGCATGTCwA 


TTATGGAAGT 


TATTATGAAA 


ATGACAACGA 


TCTTCTCTAC 


AGATAAAACT 


12120 


GAAGATGCTG 


CGATTATTTT 


CGATGAAGCG 


AAAGGATTTG 


AAGACACGGC 


CCAAGGACGT 


12180 


TTTCTAATTG 


AAGGAAGTTT 


ACGyCAAGGA 


CGTGCGAACA 


TGACCGATAT- 


ATACCTTGTC 


12240 


ACACAAGCGT 


TTATGGATTA 


TGATAAAGAA 


GATAAAAAAG 


AACTATTGTC 


TTATAAATTT 


12300 


GCGTTCCGGC 


CAAACCAAAA 


AGAAGCACAG 


AAAAAAGTTT 


TAGAATTTTT 


TGGTATGGAT 


12360 


ACGAACCCAG 


CAAATATTCA 


GTTAATTAAT 


GAACTAAAAT 


CAGGGACTTG 


TTTATTCCAA 


12420 


GACCATCGAG 


GGCGTAGCCA 


gCCTATTGCA ATTGATGTTT 


TATTCGATAG 


TTGGTTAATG 


12480 


GCCGTTTCGT 


CTACTAATAA 


AGAGGATGAA 


GCTACTCAAA 


TGGCGTTGGC 


AATGGAACAA 


12540 


GGATCATAAA 


AAAGGAGAAG 


TGAGCATGAA 


AAAAGTACTA 


CCTTTTATTG 


CCTTAGTCGG 


12600 


CTTGTTATTG 


TTGTCAGGTT 


GTGGAACAGA 


TATGAAAAAG 


ATATTGACTG 


CCGATGGTGG 


12660 


TAAATGGAAA 


GTGGAAGAAA 


CACGTGCAAC 


TTACACTTTT 


TTTGATGACG 


GTAAATTTTC 


12720 


AGCTAATGAC 


TCAGAGGATA 


GTGTTAGTGG 


GACATACACT 


TATGATGAAA 


AAAATAAAAA 


12780 


A AT71 APPTTT 

AAi AALL. ill 


bALin 1 1 AC 1 A 


GCAGrAACtC 


TTTCATTaTG 






12840 


TAmCAAGATT 


ACAGGGGAAA 


TTGGCGAAAA 


ACAAAGAACA 


CTTATAAAAC 


AAAAAACAGA 


12900 


ATAAAGAGGT 


GATACTTTGG 


ATAAAAAGTC 


ACTGaTAAAA 


GCAGCCTTAC 


TACTTTTAGT 


12960 


CGGCTGTTTT 


TTTGTGTTTA 


CACAAACTAC 


ATCGGTcAAT 


GCCGCACCTA 


AACCTGATGA 


13020 


AAATAAGGTT 


ATTGTTACGG 


AGCCAACTAC 


TAATGTAGAT 


ACAATATATG 


ATACTTACAA 


13080 


AGACAATGCT 


TTTGAGTTAA 


TGACTAAAGA 


AAAAGAAAAG 


GATTCTGCTT 


TTGGGGTTAA 


13140 
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AGAAGCAATT 


ACTAACGCTT 


CTGCAGCAGT 


CAAAACATTT 


GTATGGGCTG 


GGGTAAAAGG 


13200 


ACTCGGAGAG 


TTTAATGCAG 


TTATGGTAAA 


AACACTATTT 


AGTATGGATA 


TTATAACTGC 


13260 


TATTAAGCAA 


CCGATTATGA 


ACCTAACGTC 


TAGCATAGCA 


ACTAATATGT 


TAGGAATTGC 


13320 


GGGAACGATT 


GGTATTGCCT 


TTGTCTTTGT 


TATTTTAGGT 


GTAAAATTTA 


TTGGACAACA 


13380 


ACGATACAAA 


CGGTTTTTTG 


GTATTTTTTT 


AATGACGATT 


CTTATTTTTA 


CGGGTCTTTC 


13440 


CGTCCTAAAA 


GATGCCAACA 


CGTCCAATTC 


TTTGTTTGAT 


ATGATGTTTA 


GTGTAGATAA 


13500 


AGAAGTTGAG 


ACCGCATTTG 


TAAATATTAA 


TCCGGTCTTA 


GGTGATGTTA 


GTGTACCAAT 


13560 


GACAGAGAAA 


GGGAAAGATA 


AGAACGGAAA 


TGAAGTAGAG 


CAAAAATTAT 


CTGCAGATCa 


13620 


ACGAGCAAAA 


AGTGCAGGAA 


ATTTAATTGC 


TTCACGTGTT 


TTTTACACCA 


ATGTGTATGA 


13680 


ACCTTACTTG 


CTCATGAACT 


AT GG AACAAG 


TGATGTTAAT 


AAAATCCGTA 


AGAAAACAGT 


13740 


rAAGTATAAA 


GAyAAAGAAT 


ATGATCGAAT 


CAACTTATTG 


TTAGACAACG 


ATATGAATAG 


13800 


CGAAGAAAAT 


AATAAGTTAA 


TGGAAGAAGT 


TGTAAATTAC 


GAAAGTAAAG 


ACTTAAAAAA 


13860 


CCGTTCAATT 


AT G TAT TAT A 


ACAACTGGAC 


CAATACTTTT 


TATGGCTTGT 


TTTATCTGGT 


13920 


AGTTAACTTC 


ATTCAAACAG 


TGGTGTATTT 


CCTGTTAAGT 


TTCTTACGAT 


TAATTATTGC 


13980 


kGTCATTCAA 


TTGTTCCTAT 


TGCCGTTGTT 


ACCGTTATTG 


TTGTTCGCAG 


GACTCTTTTT 


14040 


AACTGAAACs 


AACGTCTTTG 


CGAATTACTT 


TAAAACCTTT 


GGCATGACTA 


TTTTTATGAA 


14100 


GGGAATGGTT 


GGTTTTGCGA 


CTATTTTCTT 


TGCAAGTTTC 


TTATCACTAG 


GGTTTCAATT 


14160 


AAGCAATCAG 


ACAGAAAACG 


TGTGGCAGAA 


GATATTAACG 


ATACTTATTT 


ATCTTCTTAC 


14220 


ACCGCTTGGC 


TTGTATGTCT 


TCCGGAAATT 


CTTTGCTAAT 


CTTGTCACGG 


GACGTGTTTC 


14280 


ACTTTCTGAT 


GGTGTCGGGT 


TTATCGGAAA 


TCCTTTTGGC 


ACAGAAGCTA 


ATATGCGTAG 


14340 


AGCGGCAAAA 


GAACAAAArC 


AAGAGAATAA 


GGAGCsCAGG 


AAACAGGCAC 


AAGAAGAACG 


14400 


TAAAGCAGCT 


ATTAAAAAAC 


GTCAAGAAGA 


AGCAAGGAAA 


CATGGTAAAG 


CAGAAACTGG 


14460 


CTTAAAACAA 


CGGCCATTAA 


ATAAAGAACA 


AGAAAAACGT 


AGTGCrctAC 


GCCGTGArTT 


14520 


GAAGCCAAGr 


CCTCAACATA 


AAGCGCCAAA 


TGCTATCGAA 


AAAGCACAAG 


AAGGATTAAG 


14580 


ACAAGCGCAC 


GAAAAAGGAC 


GTATGCAAGA 


ACAGAAATCT 


GAAGAACAAC 


TCAAACGGCA 


14640 


GCAACATGAA 


TCAGCTCGyC 


AAAAAGAATA 


TGAAAAAGAT 


CGATTGAAAA 


AACAGGAATC 


14700 


ACTTAAAAAA 


ATTAATGATC 


AAAGCGACGA 


ATCAACCAAT 


TTAACAGCTT 


TAAGAAATCA 


14760 






GAAGTAAAAA 


ACGCGAAACA 


GGTCATCAAA 


CACTTATGAA 


J. *i O 


ACGACAAGGA 


GAAAAAGTAC 


TTCAACCAAC 


AAATAAACCA 


AAAGATATGA 


AAAArCAAGT 


14880 


GGTTGCTCGT 


AGTCCAAGAC 


GAACAGGACA 


ACCAACGACT 


CGTCAAAATA 


TCCAACCAAA 


14940 


AGTAGCGGCC 


CAAATGGTCA 


AACGGAGTGG 


TGGAACGTTG 


GGAAGTAATC 


CTGCCTATCG 


15000 


ACGTCCAGAA 


GTACGwAAAA 


AAATGGATCA 


AGTGAAGCAA 


GTAACGCAAG 


TTGATGTACC 


15060 


TCAAACGCAT 


TCTACACCAC 


AACGTGTACA AyCCAAGGTA AATGAGCCAA TCACACGTAT 


15120 
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GAATAGACAT AATACGCCAA TCATTAAGAA AGAAAAGAAA ACTAAACAAy CACCAGTTGT 15180 

TCGCAGAGAA CGCAAGAAAC CTGGTACACA GAAGATAACA AACAAACCTA AAGGAGTGAG 15240 

ACAACCTCGT GCAAGGAAAT AATTTGTTTA ACAATGGsTT GCTACAACCA AAGAAACCGC 15300 

CAAT AG GGC A AGAACAACAA AAAATGACTC GCTACCACCA ACTCCGACTA GTCTTTGGAT 15360 

TAGTCGGAGT TCTTTTA TAT CCTTTTGCGA TTTTAGGCAG TATTCCATTA tTAATTGGTC 154 20 

TAGCGGTTGA TAAAAAAGAC AAGGCTGCAA ATGTATTTGA TATGGATTAT GAGAGTTTTT 154 80 

TGAAACGAAA TAGTATTGTy TTTCTTAGTT TTTCAGCTGT GTTATTTGTA ATAAATGTCT 15540 

TTGCATTTAT CCTCTGGATT CCCAGAGGGT ATCTGTCTGC GTATCTTCTT TTCCCCTTAA 15600 

ATCTTCTTCA TACaGCACTG CGTTTTAATT GGGAGACAAT AGTCGCTTTA CTTATTGGGT 15660 

CAAGTGGAAT GGGAGCCATT TTTTTGGCAT TTTCTTCATT TGTGGCAAAA CGAAAAGTGA 1'5720 

TTTCAAAGGA AGACGAACGG AAGAAAATAA CCGAATCAAA AGCCTACAAA GACCGAGAAA 15780 

AAAATAAATT TGAAGAGTCG CAACGCTTCA CAGATGAACA AGAAGAAGCG TATGAAGAAG 15840 

CAGTAGAAAC GGTTGATATT GACAAGTATA AAGAATTATC CAATCAATTA TTGCTTGGTA 15900 

CAAGTGAGTT CGGGTTGCCG TATATCATCA ACTTTTCAGA GTTCAATCAG CATGTTCTTG 15960 

TTCCCGCAAC GACAGGTTCA GGAAAGACAA CATTACTACA ATTAATCGTT CAACACGCCG 16020 

TAAAATTCAA CCTCCCTGTC ATTCTCATTG ATGGGAAAGG CGCACGCGAT ACGCTAGAAA 16080 

G TAT GC GAGA GATTGCACGG TATTATGATA AAGAAGTACA TGCCTTCACA GACGATGGCG 1614 0 

ACATGCGATA TAACCCAGTA GAACACGGAA ATGACGTGTC GATTCGAGAT AAGTTGGTGT 16200 

CTTTAGCAGA AACGGAAAGT GTTTTCTATT CCAGTGCCGC GAAACTATTG TTAGAAAATA 16260 

CCGTCCAACT GATTGATTTA TTTTCTACAC GTGACGACGT GAATCGAAAG TTAGAAGACA 16320 

TTCAAAAATA CTTATTACCT CGGAATGTTT TGCGTTTGTT TGCGGATAAA ATCGAAGGAA 16380 

AAAATCCAAC GCTCTATGAG ATAGAAGTAG AAGTCAAGCA AGCAAAACCA AAAAAGAAAT 16440 

CAAAGAAAAA TGAAACGAAA GTCCCGATTA TTGATGAAGA AATGGACGAA GTAAGTGaAG 16500 

CAACCACACC AACAAACGAC GAGGTAACGA GTGAAGATTT AGAACTAGAA GAAGGTGACA 16560 

TTCCTCCTGT AGAGACAGAA ATCATTGTCT TAAACCCAAA CACTTTACAG CTAGACGATT 16620 

TTTATTATTT GTTAAAACGA AATCTTTTTT ATTTGTCAAA AAAAGaAAAA ATAATGTTTG 16680 

AACGTTTGTT TACACGTTAC GAACACAAAG ACTCACCCTT TTATTTATAT GCCACTTCTG 16740 

AATCATTACA AACCAATATC AACATGTTAT TAGACAGTGA GTTAGGCCAT TTATTTGATA 16800 

CAACAAATGC AAAAAGTGTT CTTGATGTCC AACAAATTGT CCGTGACCGT TCACTTGTTT 16860 

ATGTGTCATT CAAT GG ATT A ATTTACAAAG AATATATCCG TACATTGGCG CAAATGTTAG 16920 

TCGGTGATGT GAATTATTTT GCGTCCGAAA TGTATCGAAA AAATGTCAAA CGTGAAGTCA 16980 

TTGTGATCTT TGACGAACCT GCTTCTTATT TAAATGAACA GTTTATTGAT ATGGTAAACA 1704 0 

AAGGACGAGG AGCGGGTGTT CATGGAATTT TCACACCGCA AACGATGGCA GATATTGCGA 17100 
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AGTTAGGTGA CAAATTAATG GAGCAATTAG TCGGAAATGT AAACACCGTA TTCATTGGAA 17160 

AGACTAACGA AAAGGGCGAn CGGAgTATTG GAGTGAAaCA ATGGGAACGT ATCAAGATAT 17220 

AGATGTAACG TCCGTCACTG AACAAGAAGA TGGCTATTCA GACGTCGGAA AATCAGATTG 17280 

GTCAGGTGAT CGAGGCACAA AACGCAACGT CGATCGCTTC AAAGTGAACC CAAATGTAAT 17340 

AAAAAGTTTG CGTACAGGAG AGTTTATTAT TTACAGAACC GCAGAAAATG TGAATGTCCC 17 400 

GCCACAGAAA GTTTATGTGC GAAATGCACT AGAATGGTTG AGAACTCACA ATGGAATTTG 17 4 60 

AAAAATATGA ACACTTAAAT TATTGAAAAA AACGGCCTAA AGGTATGCCT AATAGTATGC 17 520 

CTAAAGGTAT GCTTAAAAGT ATACCTAAAG GTGTGCCTAA TTAAGAAAGG TGGAACCCGT 17580 

CCTCGGCGCT TGGTATCAAA GGCTTTCTCC GCCTGCAAAA TTCCTGACCA TTTTAGCGCA 17 640 

ACGTGGGGTG AATACCCCTC TTATAsGCTA AAATGGTCAG GAATTTTGCA ACTACGAAAA 17700 

AAATACCAAG CGCCGAGGAC AAACAGAATA GAGAAAAAAA TAATGAGTAA AGTAATAAAA 17760 

AATAAAGAAA AAAGTTTAAA AGATGAAGTT CAAAAGATCA ACCCCTACCC CAGTCTTGGG 17 820 

GTCTCATAAC TATCTCTTTC CGTTCTCTTT CAACCTCTTT TGAAAGAGAA AACTTACAAC 17880 

TCCCCCTTAC GCCTGCGCCC CCTCACCAAA TCTTGATGGG GCTGTCGCCA AGACCAGAAA 17940 

CCCAAAAGAG TGGTTCTGTT TGATAACTAG TGGACTATTG ATGATTGATT TAAAGTAACA 18000 

ACCCCCTAAA TCCCCCAACA AAACTTGGGG GACTTAAGAT CCAAAAAAGA ACTTCTCCAC 18060. 

AGCATTACGA AATTCTTTTT TATTGAtAya AATAgGCGTT GCCTATTTGT TCCTTAAAAA 18120 

ATTCCAGTAA ATAACTACAG AACTCAAATA GGAGGCTTCC ATGCATAAAC TAAAAGAAAT 18180 

TGAAATACTA GAAGAAGAAA TACAATCACG AATACTCGAC CATCGTTATG CGGACAGTTG 18240 

GGAACAAGCG ACTCAAATTT CAGCAGATAT TAAAAAAAAA GAAGCGGAAC TCGCTAGGCT 18300 

AAAAAAGAAG GTTAGCAAAT GAATCCATTA CTTTATAACT ACTTTATTGA TGAAACGTCT 18360 

TCTAAAAAAA ATATGATAAA AGTATTGAGT TTATTACATG AATTAAGAAT CATTTCTCGT 18420 

TCACAACTGT TATTGTTACT TAACATAAAT GCGAATGTTG CTGAAAGAAC GATGAATCAA 18480 

ATATTAAAAT ATCTTAAAGA CAACAACTTG ATTGACAAAA TAAAAAAAGG TAATCAAGCT 1854 0 

TGTTATTTCC TGACAAAGGA AGGTCACCAG AGTATTGGTG GCTATTATAC TTTGCCAAAG 18600 

GTACCAGAAT ATAATTTGCA ACACCATTTA CAAATAAATG ATTATTTAAT CAAAATGCTG 18660 

GAACTGACAA AGGATAGAAA AAATTTGAAA TTTGTTCTGT CCGAACGAaG GCAAGTCTTT 18720 

GAAACCAAaG ACCTTGCCAA CAATCGAAAT CGAAAAAAaT ATTTTGTTGC TGATTTTATT 18780 

TtCCGTTTTA GAAGTAAAGA AAATAAAGAA GTCAATTGGT CATTTGAAAT TGAGTTGACG 18840 

ATGAAAACTA GGCGGCGATA TCGGGAGGGC ATTTTCCCAA AGTACATTGC AGAATTAAAA 18900 

CGTCTGCCGT ATGCTCGACT AATTTATGTG ACACCTAGTC CGTTAATCGA AGAAGAACTA 18960 

GAACGATTTA AAGGGTATTT TATACGAAAA GAAGGTGAAG ATAATGCAGA AGTATTTGAT 19020 

CGCCTGCATA TTTTTTCAGC CGAAGAATTT GAACAGGAAA TTAAACGATT ATTAGCAGAA 19080 
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GATCAATTTA TTAATTGGGA GTGAGTTTTA TGAGAAGAAT AGAGAAAGAA 
AGCTAGCCGG TTACGAAAGA GAGTTAAAAA AATTAGGCTG TCTGGATGAT 
TTATTCCCAT AAGCAAAAGA CGATGGCATG TCATTTGGTG GCGACCTGAT 
AAACAATCGT GAGATCCTAT CGATTAACGC TAGATAATGA AAACTTATGT 
ATGTAGAAAT CACTATTTAT CATGATGGAA CATATGGTAT TTCTAAAGAA 
TTTTTATTAA TGACTTATTG TCTTTAAAAA AATTATTTAC .. CATTTTCTAT 
TTAACTTAAA TTTTGAAAAA ATTCGCTGTG TTAGTTTTAA CAGG.TATTGT 
CTGAAATTTA TGTTGAAAAA TTCGAAGTAT TAATAAACTA CTCGATGATT 
GCCTACATGA AATACAAAAG CACGTAGAAT ATGATTAAAA AGCTTTTCTT 
AACACCAATG AATTGGTGAA GGTCAAGGGC ATTCCGGTCA AAAATTCCTT 
AGAACAGTGT CACGCAGCCT TTTTCGTTTT CAGGCCGTTC CTTTTGCCGT 
CTGTTCACTG TTTGTTACAG TACCATAGCC AGTCGAAAAA ACAAGGGTAT 
AAGCTCGCCC TTGTTTTTTC TTTATGTCTA CGGTCCTTTC TCGGCACAGC 
AACACACTGG CGAAAATGAA CAACCTAATC GAAAACGAAA AAGGGGGGAG 
TTGACGAAAA GAAATTTTTG ACCGTCCAGC TTCACTTCCT CAAACGCCTA 
TTGTTCAAAA ATCAAAACCG AAAGGAGCTG AAAACATGAA TACCCCACAT 
AACTGGAACA TGCCAGAAAA GAACACCAGA AATTAAGTGA AGCCATTATC 
CAGTGACGTT ACTACTTAAT TATGGTTGTT TGAAAAATGC GAATGACCGT 
TAGAATATTT TCTAAATCAT AAAGAATGGA AGGACTGAAA GTATATGAGA 
AAGAAcTAAA AGAACAAGCA AGAACAGAAT TAAACGAATG GGGATTGATT 
GCTTTGAGGG CGACTTTGAA ACATGGATAG GCTGTTATGC ACGACCAAAA 
CAGCCCTTGA CCCAATCAAC GAAGAAGAAG CCAAAGAACA AGcAAAATAT 
GTTTCCCTCA AGATTTTACA GAATGGTACG AGTGGGAAAT TAACAATGGA 
ATTTACTATA AAAnAGaAcC TTTTGCTAAC GACAAAAgAT TTCTAAAaAG 
TTGAATTAAT TGTrACTCAA GTATAACTTA CCTAACGACC TTTGACAAGT 
TCCCATCTTA AATTTTAAGA GGGGAAGCTT ACAAGAAATG AGGAAACGTT 
AAAATATACA AACTGTAAAA GAATTAAGAT ACGCACTGAA ACAATACAAA 
CAACTACGAT TTTTGATGAT ACACAAGGCT ATATCAGTGT CGCCTTTAAA 
TAGAAAAAGA CGGAAAAAAC GTTCAGTTTT ATGTTTTTAT CCTAATTATG 
TTATCCAGTC GAAACGTTAC TGGCTTACCT TGAAAAGTTA 
(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3182 base pairs 

(B) TYPE: nucleic acid 



TTCAACAAAA 
GAAACAGGAC 
ACACCTGCAA 
ATTTTAGGAG 
GGTG.TCCCTA 
GGAACACCTT 
ATTACTATTC 
TTAAATAGCT 
CTTCCAGAAG 
- TTCTGATGTC 
GTGTAACGTG 
ATAAACGAGC 
GAACGACACG 
ACTACACCGT 
ACAAAAATGG 
TTAACATTCA 
ACAAATGACA 
TTATATCAAC 
AACAGAATtG 
ATTGACGGTT 
GACAAACCGA 
GCGGTCAATG 
AAACTTAAAA 
GTACTTACAC 
GAAAATTTGC 
ATGGACTTTC 
CAAACACTCG 
AAAGAAGTGA 
AGGTCGCTCT 



19140 

19200 

19260 

19320 

19380 

194 40 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20860 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 



TTTAACTcTT 


TTCCAGaATA 


AAGAACCTGA 


GTCTCTCCTA 


ATTCTATTCC 


CTCAAATTCT 


60 


TCATAAGGAT 


CGCTATATGC 


CCAACCGAAA 


AAATCCAAAA 


ACGGCGTTCC 


GTTCTTGACG 


ion 
120 


GGAGAGGTAT 


CATGCTTAAA 


ATGTTCAATA 


TTATGCGTTG 


CCCaATCGCT 


ATTACCtAAA 


i a a 
loU 


GCTAAAAGAG 


TTAGACAGTC 


TTGAACAGTA 


TCAAAATCTA 


TCTCCCCTGT 


TTCAAAAGCG 


240 


TATTTAATAT 


TCATATAAGG 


GTTCAAATAA 


TAAGCACGGT 


CCAGTAATTT 


TTCTCCTACG 


1 A A 

300 


TTTTCGACAT 


AAGCCTCTAC 


CATTTTAATT 


GCTACTTCCT 


TGGTTAACTC 


CTTACGAACA 


360 


GCCTTTTCTT 


CTTTGTAGGT 


TTGTTGTTCT 


TCTGATACAT 


CTAAATACTC 


ATCTACCTTC 


420 


AAAGGATTCG 


CTTGTTGACG 


AACAATAAGT 


GTTTCTGATG 


AAAATTCATT 


TAATATCGGT . 


480 


AGCCCTGCTA 


TTTGTGAATA 


TGTTTTACTA 


GCTAAATCAG 


CTTTCATACC 


AATTTTATTT 


540 


ACGATATTTT 


GCCAAAGGCA 


ATCGTTTTCC 


TCTTTTGTAT 


AGCTTCTGTC 


TGTATCTATA 


600 


ACTAACCGAT 


ACCTTAGACC 


AAAATCTGAA 


ACATTATTTT 


TTATCGTAGG 


ATAGAGAATA 


660 


TAACTAATTC 


CTTTTAGGGA 


TTCAGCAATC 


TTTTTTATAA 


AATTTTTTAA 


ATCTCTATCC 


720 


TCTAGCTCGT 


CATAATCGAT 


AGCGATTAAT 


TCTTTTCTTA 


AT AG AT TAT C 


GTTTGAGCGT 


780 


TTACCTATCA 


GTTCACCAGC 


GAGAAAATAA 


CTTGCTTTTC 


GCTTAATCGG 


TTCTATTGAT 


840 


TCTTTGGTAG 


CGTGTTCTCC 


TAGAGTTACT 


TTCTCTGGTA 


GGTACTCCAC 


AAGATATTGA 


900 


AAAGGGGAAA 


CGCCCTCTAC 


TATCTTCATT 


TTTTTACTAA 


ATCCCTTACT 


TACATAGATA 


960 


CTCaATTTTw 


AATCAC AT CC 


TC TAT AC AC A 


TTCATTAAAC 


ATTTCAGCAA 


GTGCATTTAC 


1020 


TTTTTTTGAT 


ACTAAATATT 


GCAAGGTTAC 


CATTTGATCA 


ATGCATCTCG 


ATGTTTCATA 


lUoU 


AGCCaATATT 


TTATCAGAAT 


TTTTCTGGCC 


ATTTTCATAG 


TCTATTnTCn 


CnCTCGGAAA 


114 0 


AGATGCGAAA 


ACCTCCkGTA 


AAAAGAACAG 


TAAAtGTTTA 


ATATCTCTTA 


ATTCGTCGGT 


1200 


TATTGTATCT 


TTATCAAATT 


CTGGATACAT 


TACTCGCACT 


CTCCTTTATT 


AAATTTATAA 




AAATGTTCAT 


CaAAATTCAT 


GAATAACACG 


GTGAAAGCAC 


CGATAAACAG 


TGCTAGTTTT 


1 T O A 


AAGATTGTAG 


GTAGAACAGA 


TGCACCGCCG 


CCTAAAAGAA 


ATGCGACTAA 


TAGTAAATTA 


1380 


AtTTTCGTTT 


TCATTTTGAT 


TTTCCTCTCA 


TTCTTTGGTA 


AAATGAAAGA 


CAAAAGACGG 


1440 


CTATATTTAG 


CCTTTTTGGT 


GCCTTTAACA 


CGTTCACTTT 


GGTCGGTTAG 


TGTGTTAAGG 


1500 


CTTTTTCTTT 


TGCTCTCATT 


CATCATAATT 


GCTCCTTAAT 


TTGTCGCTAC 


TGGCATTTTA 


1560 


AGCGTTGTTT 


GATATAAAAC 


ACCTTGACCA 


TTCAAAATGC 


TCTTAATCGT 


GTCATAATCC 


1620 


ATTTTAAGAG 


■ TAAGCAATGT 


AGCTACTTGT 


TGCTCTCGTT 


TGTTCACTGC 


TTCGATCTCT 


1680 


TCTGACGTTA 


AAAAATCTAA 


TATCGTTGAT 


TTTTTACTTA 


CCTCTCGAGC 


TTCTCTTAAT 


1740 


TGCTTAGCAT 


TAAAACCTAA 


TGCTGTCTTA 


, TAAACAAGGT 


CTGTATAATG 


CTTGTAGTAA 


1800 
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TGCGGGGAAA GCCCTAACTC CTTGATAACA TCCGTCATTG ACTTACGCAC ACTTTTTTCC 1860 

TTTTGTCGTT CCATACGTCT AGCGTAGAGT CCGTTT.TTCA. JGTCATAGAA CTGTCGCACC 1920 

AGCGCTTTCT TAAATCGTCG AACTGGTTCG GTATTATCTA GATAAGTGAT AAGCAAGGTC 1980 

GCTTGTTCCT CGTTTAATTG ATAGAATTTA GCTTTTTGCC CACTCTCCAT TGCTCGGATT 204 0 

TCAAATCCGA CCTTGCCAAA CTCTTCTAGG TCTTTTTGAT ACTTTTTTAT TAGTTTAGAA 2100 

ACACTTTCTC TTTTAATTTG TGAATACTTT GCAATAACCT CATCGGTCGT ATAAGGTTCT 2160 

TCCTCGATAT ACTGTGAATG TAAAAATACT AATTCGTTCA TCTATCACAC CTCCATTTCT 2220 

TTTAAGCTTG GGAACTGACT TTCTAACCAT GGAATCAACA CAGACAAATT ATGTCGTGCT 2280 

TGTTCTTCAG . TAAGATTAGC TATCCTATTT TGCATAATCA TCAAATCGGC ATAAAGCTTG 2340 

GCGTTTCGTT TTTCATGTTC TCTACACTCT TTTAAAATCG TTTCTGCTTC TTCTACGATC 24 00 

TCACTAATGG AAAAATTCTT GTACTTTTTT GGTACCCATA TTAATTTGAG CATTGTTGCC 24 60 

TCCTCAAGTT AAATCTGTAG TAACGGACGA TTGACTATGT GCTATCTCGG aTTCTCTATT 2520 

GGCGGGTGGA TwTTTATTAT TCATTTTCGA GAAGCCAAGA TCTTAgTTGA TGCTTACTCC 258 0 

AGCAGGTTTT ACGATCAGAT AAACGCCTTC CtGTkGGTAG TCCtTTCgCT ATCaATTCaT 2 640 

CtAAAtAAgC TTCgGAACAG TTGTCtAAgA AACGTCTTAC TCCaACGCGT GTATAgATGA 2700 

GGTCGCTGTC TACTTCACTG TCTCGTCTAG CTCGCTCAAT CGCTTCTAGC GTGGTTTCAT 2760 

ACACTTGTTT ACTGATTGCA TTCAATTGCT TTTTATCAAG CGTATGGCAC ACCTCAACAA 2820 

CTACAGGCGG TGTTTGCAGT TGCTCCATTG TCATGCTGTC ATACCTCCTT ATAAATTTTT 2880 

TCATTTACCA GCCAGTCAAC TAATTTTTTG TAGACTGTTT TACGCACAGA CAAGATTTTT 2 94 0 

TCATTTTCTA TCTGTCCTAG CGTTTTAGAT GAAATACCAA TTTCATGAGA TGCCTTTTCT 3000 

AAAGTaATAT TTTTTCTCGC TCGGCGCTCA CGTAGCTTGG cTATCATCTC TAAATCTAGC 3060 

TCAAACATAT TCtCACCTCT tCCGATTTCC CGtTTTTTGG GaTTkGATAC CyTGaTTawA 3120 

ATTCCCaAAA AAwGGGaAGT CAAGTTATTT CTCATTTTTG GGAATTAAAT TTGATTTCTG 3180 

CC 3182 
(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

AnATAnnATA AATGTAATAA TTAACCACAT TATTCCgGTr AATTGAACGG CTAAATAAAT 60 

ATTATAAmmm mCAAATCCTA AAGTGAAGAG AAAAAGTAAA AmATAACAGT TACTTCTTTT 120 

AAAATGTGCT TTCCATGATT CAAATGCTCG TTTAACGGTG ATTGCTTGAT AATTCATGCC 180 

CTCTTCCAAA ATGAAATCAG TCATCATTTG GAGTGCTGGA CCAACACCAA AAATAATCCC 24 0 
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TCCAGCTAAA CTAAATAAAA CAAACAGTAA ATTTAGTTTG ATTACTGTCC AGGAAAGGTA 300 

AAACAACCGT TGAATCCCTG TACTTTCCAT CTTCCTCTTC CTTTCTATTA TGTAAAAGGC 360 

TAAGCCAAAA GCGCTCAGCT AATGCCTCAG CCTCCTGTGA TTATTTTTGA GATTGGATAT 4 20 

ATTCGTCTAA TTGTGTTTGC ATTTCTTTTT GAACTTTATC CCaGCCAGCT GtTTTTAGGT 4 80 

CATcCATTAA TTTTGGGAAG TGTTTCTT 508 
(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



CAGTCACGAA 


TTGATTTAAA 


TCCACATTTG 


TGGAATTAAC 


AAATTCCACG 


AL LA 1 L AtjU A 


fin 


TAAGTCGTTG 


GGTAGTTGGC 


CAGACGACTT 


TCAGAcCAAT 


GATcCTTGAy 


CAaTCGCAGG 


120 


nCCwTAGaCA 


GAAATTGGtT 


TAATCGTAGA 


cCaACTTGAC 


GATCGGCATT 


AAAGGCATGG 


180 


TTGTTTTGAT 


TAATATCGAA 


GTTtCTGCCG 


CCGATmAmAC 


CAAGGATACG 


ACCAGTAGCG 


240 


TTATCCATTA 


AAATATTGCC 


AGTTTCAATC 


ATTGTATTCC 


CATTGACATC 


TGCAGTGCCA 


300 


TCATCAAGCA 


AATAACCATA 


ATTTGCGACA 


GCGGTTTGCa 


TCGTGTTGTA 


AATATTCTGa 


360 


TCAATGGTTG 


ATTGGATTGT 


ATAGCCTTGC 


GACTGGATTT 


CGCGACGTGC 


TTGTTCTTCA 


420 


TATTGATCTA 


AACCAACTTG 


ATCTAAATCA 


TCACGATTAA 


CTTTAGCTTT 


TTTCATATCT 


480 


AAATCCATGA 


CAATCTCAAC 


TGCTTTATCT 


AACACCGTAT 


AGTATAGATA 


GCCTTCTGTA 


540 


TTGACATTGG 


CCTGTTCAGT 


TGGTAAAAAG 


TCTTTTTTCA 


AGTCGTAAGC 


TTTGGCTTCT 


600 


TCATATTCCT 


TTTGCGAAAT 


AGCTTTTTCT 


CGGTACATAC 


TGAAAAGAAC 


AAAATCTTTT 


660 


' CTCTTCATTC 


CTAAAGACAA 


ATCGTCTTTC 


AAGGCACCAG 


TGTTAGTATA 


GGGCGTGTAA 


720 


ACAATGGGAC 


TTTGGGGAAG 


TCCTGCGATA 


AAGGCCGCTT 


GAGGAAGATT 


CAAGTCCTTG 


780 


GCACTTTTAC 


CAAAGAGTCC 


TTTGGCTGCT 


TCTTCCACGC 


CyGCAATATT 


TTCACCTTTG 


840 


TTATTACGAC 


CAAAAGGGGA 


GACATTTAGA 


TAAGTTGTCA 


CGATTTCGTC 


CTTAGAAAAG 


900 


TA 












902 



(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
TTTTTGTTGT TCTTTAATTT TTATTCGTTG GTTGCTTGCC ATCTGTCCAC CAAAGCTTAA 
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TTAAGGCCTG TGTGCCATCA TTTTCAAACC CTTTATCTGC AAGgCTTTCA TATAATTCTG 120 

TCGCTTTTTG CGTCGCTGGC AATGGAAGGT CAAGTTTTTT TGCTTCATCT AACGCAATCT 180 

TTAAATCTTT GATAAAATGT TTAACAAAAA AGCCTGGCGT ATAATCTTCT TTTAAAATAC 24 0 

GCGGACCATA ATTACTTAAT GACCAGTTTG CAGCACTACC ACCACCGACT GTTTCCAGAA 300 

CTTTTTCTAA AGTTAATCCT GTCGCATTTG CATAAACGAG CATTTCAGTC AAACCGGTCA 360 

TTGTGCCTGC AATCATTAAC TGgATTCGCC ATCTTAGTAT GTTGCCCTTT TCCAGCTGaG 4 20 

CCGTGtAGCA TAAACGTTTT ACCAAATGkT TTGGAAAATk GGtAAGACGG nGkCaTAACT 480 

TTCyTGgATC GGCCACCAAc CATAATTGGT TAAAGTACCA TTTTTAGCAC CTAAGTCACC 54 0 

ACCAGaAACT GGCGCATCCA ACGCATGAGC ACCCACTTCT GCTGCTTTTT TGGCAATCTT 600 

TTCAGCCAAG GTCGGCGTAC TTGTGGTTAA AT CT AC TACT ATTTTACCTG TAAGATCTGC 660 

TTGAAAAATC CCTGTTTCAT TAAAATACAC GCCTTCAACA TCTGAAGGAA AGCCAACCAT 720 

TGTAAAAATG ATATCACTCG CTTCCGCAAT TGCTTTTGGC GTATCATACC AAACTGCTCC 780 

TTCAGCAACT AAATCATCTG TTTTACTTTT TGTTCGATTG TAAACATTGA CAGATAAATT 84 0 

GTTTTTCATC ATGTTACGAA TAATTGATTT TCCCATGACA CCCGTTCCAA TAAAGCCGAT 900 

TTTAGACATA GTAGCTTCCT CCGTTCTTAA AATAAAAAAC TTTCCTCCTT CAGTATAAAC 960 

GAAGGAGGAA AGTTTTGCAG TTTATTTTCT TAAAGATGTA AATTTTACTT TAATTTTTTT 1020 

TATTGAGTCC GCATTTTCTT TTTCCATTTG AAGTTGATCA AAGTAAATCA ATGTATCTTG 1080 

ATAAAATTCA ATCATTTTGT TGCCAAAGTT TGTAGAAGAA ATTTCAAATA ATTTTTCATC 114 0 

TAAAATAGTT TGGTCCATTT TAATATTTGC TTGTATATAG TCAATCAATG TTGTGGCAAA 1200 

ATCACTGTCT GTCTTAAAAG TTTTGCCTAA GCTTTCATGA TCAAATAAAT TATTTAAATA 1260 

GGCGTTCCCT TCTGCAACAC ATTGAACGCC AGCGGCCATC GCTTCTGTAT AAGTTAAGCC 1320 

TTGTGTTTCC GACGTTGAGG CGCTGACAAA ATAGTCAGCA GCTTTATAAT AAATGGCGAC 1380 

TTCTTCATTT GGCACTTCAC CCGTAAATTG AACGTATTCA CTGACTTCTA GTTCTTCTGC 14 4 0 

TAATTCTTTT AAATCCTCTA GATAGGGACC ATTTCCAACT ATTACTAAAC GAGTTTGTGG 1500 

TAATTTTTCA ATAACTTGTG GTAACCCTTG AATAATTGCT TGAATATTTT TTTCATAAGA 1560 

AATTCGACTT AACGATAACA ACATAATCTG TTGCTCTTCA ATACCTAGTT GTTGACGCAT 1620 

TCCAGCGATC ATTTCCTCAG TAATATCGGG ACGTAAAAAC TTATCAATTT CGATCCCTGT 1680 

TGGAATAATT CTCATTGGTG CGGTTACTCC ATAATCCCGT AATTTTTCAA TTACACGTTC 174 0 

ACTGGGACAA ACAACACCTG TCGTATGATT TGTAAACACT CTTGAAAAGA ATTTAACATG 1800 

TGATGGACGG ACAACTTTTC CTTTAGCAAT ATAATGTAAG TAATCTTCAT ACATCGTATG 1860 

ATACGTATGG ATGACAGGAA TTTTCATTTT TTTCCCAACC ATTTTCCCTA AAATACCCGC 1920 

ACCAAATTCA GTATGTGTAT GAATTAAATC TAATTCCAAT TCTTTGGCAA TTAAATAGGC 1980 

GTACCACATG CCGCGAACCA CTACTCGGCG ATCTTTAAAA GATACGAATG GCACACTAGG 204 0 
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CATTCGTATG 


ACATCTTCTT 


CAAAATCAGT < 


CGCGTTAGGA ' 


TCGGTTGTCG ' 


TAAAAATATA 


210.0 


CACTTCATGG 


CCGTGCTTTT 


CTAATTCATC 


TTTTAAGGTC 


TTAATAGAAG 1 


TCGCTACCCC 


2160 


ACTCACTTGC 


GGGAAATACG 


TATCAGTAAA 


AAAACCAATT 


TTCACTCTCA 


TTACCTCCAT 


2220 


TTCATCTAAC 


AATAACTGCC 


TTCAATATTT 


GTTAAGATAA 


GACTGTTTCG 


TAGACTTCTT 


2280 


TTAACTCGTA 


ACCAATTTGC 


TTGATACTTC 


TTTGCTCTGC 


TACTTGATAG 


CCTGCTTCTC 


2340 


TAGTGCTAGG 


TATTTTCCCT 


TCTAATAAGC 


CTTCAATATA 


TTTTTTAAAT 


TCTTCAATAG 


2400 


AATGACCCAT 


ATAACAATTT 


TCATTAGCTA 


CTAGCCAGCC 


TTGATAAACT 


GGAATATCAC 


2460 


GCACTAAAAC 


TTGTTGCTGA 


CTAGCTAATG 


CTTCTAAGAC 


AACAATTCCT 


TCCGTTTCTT 


2520 


CACGCGAAGG 


GAAAAAGAAG 


AGATTCGCTG 


CCGCATAGGC 


CCCTTCGATG 


ACATCTCCTT 


2580 


TAATATAACC 


TGGGAAAATA 


ACATTTTCTG 


GATGATCTTC 


TTTAACTAAT 


TGTCGAATAT 


2640 


TTTTTGGGAT 


AGAATACATT 


GGTGTATCAC 


CAAACCAAAT 


AAATTGATAC 


TCTGGTAACT 


2700 


GTCGCGCAAC 


TTCAATAAAA 


TCAGTGATTC 


CTTTACGTTC 


AAAAAATAAT 


CCCACACAAA 


2760 


TAATTACTTT 


TTTCTCTTCA 


TCTATTTTAA 


AATATTCTCG 


GAATTTTTGT 


TCCTTTTCTT 


2820 


CTGAAGGATA 


AAAGCGTGAT 


AAATCAATAC 


CATTTGAGAT 


CGCCGAAATA 


GGCACTTTAA 


2880 


TGCCATATCC 


TTCTAGCAAT 


GTTTTAGAAT 


ACGGTGTTGG 


TGTAATTAAA 


TGATCCGCTT 


2940 


TGGAATACAA 


ACTGATAAGA 


TATTTTTTTA 


CTAATGGTGC 


CAGTTGATTA 


GACCCAATAA 


3000 


AGGAATTACG 


AAAATCTTCT 


TCTGTCGAAT 


GGGCATGATA 


AATAACTTTT 


TTTCCTAGTT 


3060 


TGCGTGCTTT 


TCTGACCATA 


CGATGGCTAT 


TTACGCCATA 


TGTATTAATA 


TGCAATATAT 


3120 


CATAGTCACT 


GCAATCAGCA 


TCTAATGTAT 


ATTCTATTCC 


AACTTCAGAG 


AGCGCTCGTT 


3180 


TCTGGTGATC 


TAATGCTCGA 


CCAATACCTG 


ATTTGGCTAA 


GATTTTCTCA 


CCTTCAAAAT 


3240 


ATAACAATAT 


TTTCAAACCC 


GCTCCTCCTT 


TTCTGCTTTA 


CTAACTATAA 


TAAACACATT 


3300 


ATACCATAGC 


TAAAATTTGA 


TGTATTCATT 


CTTCGGTAGG 


ATCCTTTTAA 


TCTTATAAAT 


3360 


GTTTTTAAAG 


TCCCTTCTTC 


AAGCTTATCA 


AAAAATAGTG 


AATAAAACAA 


AAAAAGCAAA 


3420 


AACCAAGTTG 


GTTTTTGCTT 


TTGCTCAACT 


CCGGCAGTAG 


GACTCGAACC 


TACGACATCA_ 


3480 


TGATTAACAG 


TCATGCGCTA 


CTACCAACTG 


AGCTATGCCG 


GAATAATCGC 


GTGGCGACGT 


3540 


CCTACTCTCA 


CAAAGGGAAA 


CCCTTCACTA 


CAATCGGCGC 


TAAGAAGCTT 


AACTTCTGTG 


3600 


TTCGGCATGG 


GAACAGGTGT 


ATCCTTCTCG 


CTATCGCCAC 


CACACTGGGT 


GTTGTTTCTT 


3660 


ATTGAGTTGA' ATCTTCATTC 


ACTCAAAACT 


GGATTGAAGT 


TTGAATCAAA 


ATAACCAAGT 


3720 


TGCTTTTACT 


TATCCATTCT 


TTGGTTAAGT 


CCTCGACCGA 


TTAGTATTGG 


TCCGCTCCAA 


3780 


C TAT C AC TAG 


CCTTCCACTT 


CCAACCTATC 


TACCTGaTCA 


. TCTCTCAGGG 


GTCTTACTTT 


3840 


CTTAAAaGAA 


. ATrGGAAATC 


TCATCTTGAr 


GTGGGCTTCA 


. CACTTAGATG 


I CTTTCAgCGT 


3900 


TTAwCCCTTC 


: CCTACAAAGC 


TACCCAGCAA TGCCCTTGGC 


! AGAACAACTG 


; GGTACACCAG 


3960 


CGGnAAGTCC 


: ATCCCnGGTC 


: CTCTCGTAC 








3989 
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(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



CCTnCCTAAT 


ATATCTTCnA 


TCACtGTAAC 


TTTACTkGTA 


AtATCAGATA 


TTGaAAATTT 


60 


ATTTTCyTGt 


AAAGTTTTCA 


TTTTATTAGA 


TAGAGAAATC 


ATTTTATTTA 


ATATATTTTC 


120 


CATCTCATCA 


AAAATAGCAG 


TGTCAACTTT 


ACCTTTCATT 


AAGCTTCCTA 


ATTTCTCAAA 


180 


AATATTATCT 


TTTCCACCAA 


TAACTTCAAC 


TGCTTTATTG 


ATGTCATATA 


TATTATTTAC 


240 


AAGTTTTGCA 


CTATTTATAT 


CTTCTGAAGC 


ACTAGCTAGA 


CGTTTCAATC 


CATTTGACAC 


300 


TTCTATCAAA 


CCATCGATAA 


CTTGTACATA 


TGTCGATGCA 


GTAAAAGTAC 


CTATTATTCC 


360 


TTTGAATAAA 


GATGAAATTT 


TTCCAATACT 


GGATTTCGCC 


ATATAGCTTA 


AAcTTCTTCT 


420 


AAACTCTCAA 


GCTTATTTTT 


AACACTTGTA 


ATATCTTCAG 


GAACCTTTTT 


ATTCAATTGG 


480 


TTAATTGATT 


CAGAAACTAG 


CATTAATTCT 


CCTGCTAGTA 


GTGCTACAGC 


TGCCAAACCA 


540 


GCGATTGCTG 


CAGGGAATGT 


AGCAATACTT 


AAACCACCTA 


TTACTCCAAC 


TAAAAGACCC 


600 


ATTGCACCAA 


TTGCTATACC 


G AT AT TAG C A 


ACCTTTTTAG 


CTATTGTTGC 


AATATCTGAA 


660 


GGTACTTTTT 


CATTTAATTG 


TGATAGTGCT 


TCGCTTGCAA 


TCATTAATTC 


TAGTGATAGG 


720 


AGTGCTACAC 


TAGCTAATCC 


TGATATAGCC 


ATCATTGGAT 


TAGCAGATGA 


AAAAGCGCCT 


780 


GCAATAACAA 


CTAATCCTGC 


CATAGCCCCG 


ATTGCAATTG 


ATAAACTCCC 


TAGTTTCTTA 


840 


GCGACAATTC 


CAATATTGTT 


TGGTACTTTC 


TCATTGAGTT 


GGTTCATCGC 


TTCACTTGCA 


900 


ATCATTAAAT 


CTACTGAGaT 


TAATGCAATA 


CTTGCAATCC 


CTTTTAAATT 


ATCAGTAAAA 


960 


TTTAATTTAC 


TAG CT AT AAA 


AGCCAGACCC 


CCAATTGCAC 


TGATAGCCAG 


ACTCATGCTC 


1020 


GCTGTTTTTT 


tAGCTAGCCC 


TGTAAAATCA 


TTyGGTATTT 


TTTCaTCTAG 


TTGTTTCATT 


1080 


GCTTCTGCGC 


CTTCTTCAAG 


CACTTTTATt 


GCTCCGAATA 


ATAACGCAAG 


GTTACCAGCA 


1140 


TTTTTTAAAA 


AACCAGTCCC 


AATAGATTTT 


AAACTCTCTA 


ATGGTTTTAC 


GATTTCTGAT 


1200 


GCTCCTCCAC 


CTTTACCGAA 


TTTAGGCAAT 


TTTATTTTAC 


CGAATACTCC 


AAAAACTTTG 


1260 


GATAATATAG 


AAAGCCCTTT 


GGCACTAATA 


GCTATAAACT 


TTAAAGCTAC 


CCCAAATTCA 


1320 


AATAATCGTG 


GAATAAATGC 


ACCTAGGCCT 


TTCAAAGTAT 


CTCCCTTACC 


TAAAAATTCA 


1380 


AATATAGGTT 


TCAATAGATT 


GAATAATCCA 


ACAGCATCAT 


CTTTTACACC 


CTTTATTCCT 


1440 


TTACCTAGTC 


CCTCAAAAAA 


TGAAGCATAA 


TCAATAGTGC 


TTAAAAATTT 


CACTACTTkG 


1500 


TCAAAAAAAC 


GAAATATATC 


ATCTGAATGT 


TCTTCTGCAA 


AAGCTCCTAT 


TTTTTTTAAA 


1560 


TTATCTTCGA 


AAGCAGCACC 


AATTTTTCCA 


AATACACCTG 


CAATTCCTCC 


CAGGCCTTTA 


1620 
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TCCTTCGCCA 


TCTTATCAAA 


TG TAT CG AT A 


ATCGAGGTCA 


CTCCACGTGT 


TATAGCTGAA 


1680 


TTCATAATAG 


ACATACCAGT 


TCGAACGCCA 


CCAACAGAAT 


TTCTAGCTTG 


TTCTTCAAAT 


1740 


GATTTGAATC 


CTTCTGCCCC 


GTTTTTATTC 


ATCTCCACAA 


TTCTATCCAT 


AAATTGATCC 


1800 


ATAGAGATAT 


TTCCATTTCT 


TAAAGCTTCT 


CCCAATTGAT 


CAGATGTTAA 


ACCAAAACTC 


1860 


TTCCCGATTT 


GATCCACTTG 


GGCAGGCATT 


GCTGTTAGTA 


GCGAGCGCCA 


TTCTATCATG 


1920 


TCTGGTTTAC 


CTTTAGAATA 


AG AT TG AG AT 


ATTTGCTCAA 


GTGCGCTTGA 


TTGTATTTCT 


1980 


CGTGACATGC 


CACCCGCTAA 


AATTGCATTA 


TTTAATGCCA 


AGAATATATC 


CGCAGATTTC 


2040 


TGTACATCTT 


TATTATTTGC 


TGTAAACCTT 


TGTACTGAAG 


CAACAGCATC 


ATCTAAAGTT 


2100 


GTAGGCAAGC 


CATTTAGCCC 


TTTTACAAGT 


TTGCTTTTGG 


CGATATCGGC 


TTGGTCGGCT 


2160 


GAAATATTCA 


TATTTGACAT 


TACTTTTTCA 


AAGTTATTTA. 


AAGTATCTAA 


TCTTGAAATA 


2220 


GAACCATCGA 


TAGAATTTGT 


AATCATACTA 


GTtGCCTTAT 


CAACAGCCTT 


TACAGCTAGT 


2280 


CCTACACCAA 


GCATTTGATT 


TAGTTTGGAT 


GTTAAAGGAT 


TGAATGCTCC 


TAATATTTTA 


2340 


TTGCTTGCAT 


TGTTAGCTAG 


CCCGCTCAAC 


TTATCTAGAT 


TTGATGAAAC 


ATTTATCTTG 


2400 


TAGGTCCGTT 


CACTCATACT 


TCGAAGACTA 


TTTTGTAATT 


TTCGACTCTC 


TTCTCCAGTT 


2460 


TCTCCTAGAC 


CACGACTATC 


AATTTGTAAC 


ATAGCCTTGC 


GATTATTCAA 


TCGACTAATC 


2520 


TCATTGTCTA 


GTTTACGAAG 


TTGTGCTTCT 


GAACCTTGCA 


ACTGAGTTGT 


TACTATTTGT 


2580 


AAATTTGCTT 


TCCTAGATTG 


TAGTTGTTTA 


ATTTGTkGAT 


TTAAGGTkGA 


TATTTTTtCT 


2640 


TTkGCAtCTT 


TAGTATTTAC 


ATCTGCCTTA 


ATGGTTGCTT 


TCTGTGCTTG 


CAATTTTTTT 


2700 


ATATCAACAT 


TAATATTTTT 


TATTTCTTCT 


TTTGCTTTTT 


TTAGTTTCGT 


AGTATTCCAC 


2760 


TCGATAGCAG 


GGCGTTGACC 


TTTTAGTTTA 


GCTATTCGTC 


GATCTAAACT 


ATCTAGTTCT 


2820 


TTTTCTATAT 


CTTTAATATT 


GTCCTTAATA 


CGAATGGAGA 


TATTTTCAAC 


GGCCAATTAT 


2880 


TCCACCTCCT 


CCAGCTCATC 


TTGCGATATA 


AATTGAACAA 


CAAATGGTTG 


TGGTTGCTTA 


2940 


GGTTTTGGTG 


CATTTTCTTG 


TGACGACTTC 


CAATCCATAA 


AGTTTTGATA 


ACTTTGCTCA 


3000 


TTAGCATAAT 


GGCCATATGC 


TACAATCAAT 


TCAGAAACGC 


CCCAACCACT 


CAATACATGG 


3060 


CTGGGACGTT 


GTCCTAAGAT 


TTTTGCTACA 


AAATGAGCCA 


TATACGAATA 


ATGATtAAAC 


3120 


TTAGCAAAAT 


ACGTTyCTgC 


GCTgTTGTAA 


TGAAACACCA 


TtGTTtAAAC 


TGtCACGCAC 


3180 


GATTATCCGA 


AAAAAGCATC 


TGCtTCTTTG 


AACACtTCTG 


GGtGATTTcT 


GTtAATGCAG 


3240 


TAAAnGTTCA 


ATCACTGAAA 


TGGAAGCaGk 


ATCGATTACT 


CTCAnCA 




3287 


(2) INFORMATION FOR SEQ ID NO: 361: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
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CAAAATTAGG 


CCTTATCGGA 


CCATGTTTTT 


CTACATTCAT 


TGGTTGAAAT 


AGCATTAATT 


60 


TTAGTCGTGA 


kCGAATTGTC 


TTATCGTTTT 


ATCGAACATC 


CACTGACACG 


ATTGTCTTAT 


120 


AAGGATGTCT 


GGACTCAATT 


CACTGAGTTC 


TTACGGAAAC 


CGTGGGACCT 


TCGTGAAAAA 


180 


GGGACAATGG 


CCTTTATGAC 


AGTGATTAGC 


GTGATTGCAG 


TATTCGGTTT 


AATTGTGGCT 


240 


CCAGCAAACG 


CTAAAAGTGC 


GCAACAAGAG 


CAACTAGAAA 


AAAATATTGC 


GAAAAATCAG 


300 


CAAAAAATTG 


AAGAACGTAA 


AAAAGAAGTC 


CAAGAAAGTG 


GCACGAAAAA 


TAGTAGCACG 


360 


TCCACAGAAA 


G TAG T AC AAA 


ACCGAGTGAA 


AGCCAAGACG 


TTCTACCCAC 


ACAACTCACA 


420 


CCAGAGCAAG 


TAAAAAAAGC 


GCAGAATTTA 


GAAATCACAA 


CTATTGGTGA 


TTCTGTTATT 


480 


CTAGATGGTG 


CTAGCGGGTT 


ACAAGATATT 


TTCCCTAAAA 


TGATTATCGA 


' TGGGGAAGTG 


540 


GGCCGTCAAT 


TGTATAGTAG 


CATTTCACTA 


ATTGGTGAAC 


TAGACAAAAA 


GAAAATGCTT 


600 


AAAAATACTG 


TTTTAGTGAG 


TTTAGGTACG 


AATGGTCCTT 


TTACAGAGGC 


ACAATTTGAT 


660 


GAATTTATGA 


AAGCGTTaGG 


TAATCGAAAA 


GTTTATTGGA 


TTAATGTTCG 


CGTCCCAACT 


720 


AGAAGATGGC 


AAAATCAAGT 


GAATAGTTTA 


CTTAGTCAAA 


TGGACAAAAA 


ATACGATAAC 


780 




X X \Ji\\* 1 \J\D X i 


TAATTATAGT 


AACGCCCATG 


ATGATTGGTT 


TTATGATGAC 


840 


CGAGTTCATC 


CAAATGTGGC 


AGGTGGCGAG 


CAATACACAC 


ACTTTATCGC 


GGAGAAAATT 


900 


TTACAGTAGC 


AAGAAACTTc 


CAGCTCAGAT 


GAAAGGGGCT 


GGgAAGTTTT 


TTGTTATAGG 


1 960 


GAAAAGCAAA 


TATAGGTAAT 


TTAATGATTT 


TCTTGGAAAA 


CCTAAGCTTT 


TTTTTTTAGA 


1020 


TATCnGGGTA 


TACnGGTTTT 


TTGGTTACCA 


GnTCCCCGnA 


AGGGATTTAG 


CCAGGTAATT 


1080 


GGCCGGTACC 


TTACCTAAAA 


GATG ' 








1104 



(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GATTnCGTGA AGCGGTTGAC ATATGATTGC ATCTTCATTT ATAATAGATT CATGAAAGGT 60 

TATTCATGTA AACGGAATAA TCGCTATTnT ATAGGGAGTG AAGGTGTTTA AATGCAAAAG 120 

AGTTATAAAC TAGAAGGACT TGATTGTGCA AGTTGTGCAG ACAAAATTGA AAAATCTGTC 180 

CAAAAAATTG ACGGAGTAGA AAAAGCGCGT GTGGACTTCA TGGCTGAAAA GATGACACTG 240 

GAAGTTGAAT CAGGACATGA TTTAGAAGTC GAAAATGAAG CGCGTGCAGT GATTGGTAAG 300 

TTAGAATCAG "ATGTCAAAGT TATTTCTTTA AAAGATGTCA AAGAAGAAGA GGGACGTAAT 360 

CCAAATAAGA ATCGCTTAAT TCGTATCATT ATTGCATTTG TTTTATTTTT TAGCACTTAT 420 

TATCATTAAA CCAAGTnATA ATTGGGTTGC TTTGGCATCT TATTTAGTCG TTTATGTCTT 4 80 
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AATTGGTGGC 


GACATTGTCA 


AACGAGCGGT 


AACGAATATT 


TTTCGTGGGG 


AAGTGTTTGA 


540 


TGAAAATTTC 


TTAATGTCTG ..T.GG.CA&CGAT 


CGGGGCATTC 


TTTATTGGGG 


AATATCCCGA 


600 


AGCTGTGGCA GTAATGTgTT CTACCAAGTC 


GGcGAATGGT 


TCCAAArTGC 


TGCGGkTGAT 


660 


P7A 7ATPPAP.A A 


AATCmATTGC 


CAAATTAATG 


GATATTCGTC 


CAGATTCAGC 


AAATCTTTTA 


720 


rp tv n TV TPP Ti P 

b I AAA I bbAU 


AAATAAAAGC 


GGTCGCTCCA 


GATACAATTG 


AAATTGGGCA 


ACAAATATTA 


780 


GTbAAAbbAb 


GTGAAAAAGT 


TCCATTAGAT 


GGTCAAATTA 


TTGATGGATC 


AAGTATGGTA 


840 


GAT AOL 1 L 1 b 


CGTTGACTGG 


TGAATCCGTT 


CCAAGAACGG 


TCAAAGTCGG 


CGATGAAAtC 


900 


TTAGgTbbl 1 


TtATCAATAA AAATGGGgCC 


TtGACCATTA 


ATGTtACGAA 


AAAaTTtGGG 


960 


GAT ybAabAb 


TTaGCAAAAT 


TTTAGATTTA 


GTTGAAAATG 


CTAGCAGTAA 


AAAAGCCCCT 


1020 


GbAGAAAAb A 


TTATTTCTAA 


T\ rn rp rn rp rp 

ATTTbCTCbf 


TACTATACAC 


CAGTTGTCGT 


TGTTTwAGCy 


1080 


AtCCTAwlbb 


CAGTGaTTCC 


aCCatTTATT 


TTCCCTGATA 


CTTCTATTAA 


TGAATGGGTT 


1140 


rn TV rp /-i /»*• T\ /•*'/■"• TA rp 

TATCGAbbAI 


TAACGTTCTT 


GGTTATTTCT 


TGTCCATGTG 


CCTTAGTCAT 


TTCTGTTCCT 


1200 


mm ?\ Tl /"■ rn rn ft* /"* rp 

TTAAb I i 1 b 1 


TCGGTGGAAi 


1 bbbbbAbbA 


GCAAACTAGG 


TGTTTTAATT 


AAAGGAAGTA 


1260 


Ti r" , rpTvrprprp7\/" , T\ 

ACTATT lAbA 


tv Tv rp m rn rn tv r~* 

AATTTTAGCb 


AATAbbbAAA 


cAATTGTTTT 


TGATAAAACT 


GGTACGCTAA 


1320 


CAAAAbbAAA 


TTTTGTGGTT 


b AAAAL A i AA 


CTAGTGTTGT 


CTTACCCGAA 


GAAGAGTTGT 


1380 


1 bbb A 1 I AAb 


AbbAAbAbbA 


bAAbAAl IA1 


CCACTCATCC 


AATTGCTATT 


TCTATTAAAG 


1440 


7\ t\ TvrprnTv tpp 

AAAbb 1 A i bb 


P7\7\7vP7A7ARPT 

b AAAb AAAb I 


rTTrr npp n ta 

b 1 1 bb rtotnn 


CAGCAATTGA 


AGAGGTTGCC 


GGACAcGGTA 


1500 


i bAAAbbbAb 


AA1 IbAAbbl 


Zi 7A Ti ZAPTAPTTT 


TAGTCGGTAA 


TGCGAAGTTA 


ATGAAACAAT 


1560 


rprp f"*rp t\ rp /""■/"' 71 

j. i. bb 1 Ai bbA 


RPPTiPPTA "*TA 7A 
AbbAbbAvjAA 




CGGGAACTTT 


AAtTTTTGTC 


GCTATTGATA 


1620 


Tv rp /~» n T\ rp rp m /~« /-• 

A I bAAl 1 lbb 


7A PP P T 7A T T T Ti 
AbbblAl 1 In 


PTPTATTP.PP.P. 
Ij 1 bn 1 1 0\<uo 


ATCAATTGAA 


ACCAGACGCC 


ATTTCAGCAA 


1680 


lAAAAbAAl 1 


AAAGGCTGAA 


GGGGTCAAAC 


AAACTGTTAT 


GCTGACTGGA 


GACAATCAAC 


1740 


Ti 7i PTTPPTA P. 7A 
AAb 1 IbbAort 


AGCTATTGCA 


AAAGAAGTTG 


GTGTTGACAA 


AGTTTATGCA 


GAACTTTTAC 


1800 


p n p n Trr t ta a 


AGTGGATCGT 


TTAGAAGAGT 


TGTTAAAAGC 


ATCTTCACCT 


AAAAACAAAG 


1860 


i 1 v~ 1 1 1 0 1 


TGGTGATGGT 


ATGAATGATG 


CACCTGTTTT 


AGCCCGGGCT 


GATGTTGGGA 


1920 


ttp.pp atp,p.p. 


TGGTTTAGGA 


AGTGACGCGG 


CAATTGAAGC 


TGCTGATGTG 


GTGATTATGA 


1980 


ATGATGAACC 


AAGTCGTATC 


GCCTCTGCGA 


TTAAATTATC 


AAGAAAAACA 


TTGAGAATTG 


2040 


TTAAACAAAA 


CATTATCTTT 


GCGATTGCTG 


TTAAAATCAT 


TGTCTTGGCA 


CTAGGTGCAT 


2100 


TAGGCTTGGC 


TTCAATGCAA 


GCAGCAGTGT 


TTGCTGATGT 


TGGTGTAACT 


ATTATCGCTG 


2160 


TTTTAAATGC 


TATGCGTTGT 


TTACGTGTAG 


AAAAGATGAA 


AGATAATAAC 


TAAGCCTTAA 


2220 


AAAGTACAGC 


TGAGACGAAC 


GTTTTTAGCT 


GATTATCAGG 


TTTTGAGCGT 


GGGACAAAAA 


2280 


TCACTTTGAC 


■ TTTTTGTTCC 


ACGCTTTTTC 


TCCTACACAA 


TTTCTATTCG 


TTCTTAAATT 


2340 


GCCAGAGTTC 


TTGTAAAGAA 


ATCTTTAAAT 


TTTTCGCTTT 


GTGGCATATA 


ATAAGAAAGA 


2400 


ATTTGTACGA 


AAATTGCGCT 


TCGCTTTTTT 


TTCATGGTAC 


AATAAGCAAG 


AATGGATTGA 


2460 
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TTTACAGGAG GGAACCTCAA CATGGCATCA A&TGAAAAAA CACGATATAA AGCAATCATT 2520 

GCAGACCATA CGTATACAAT TATTGGCCAA GAGTCAAAAC AACATATGGA...TTJAGTCACA 2580 

AAAATTGTCA ACGAACAACT GGCGGAAATT AAACATCTTT CTCCGCAAAC TGATACGGAA 264 0 

CAAGCTTCTG TTTTATTAGC AATTAATGCA ATTTCAGATC AACTAAAAAA ACAAGAATAT 2700 

GCTTTAAAAT TGGAAAAACA AGTAGCAGAT TTAAAACAGA AAACCATTCG TTTGGCTGAA 27 60 

TTAGAGAATC GTGTGCGCCG AATGGAAATT ATCGAAGAAG AAGCACGAGA TGTTTTGAAA 2820 

AAGAATGGAC AAGAAGATGT AGAAATTCAT AATCATGTGG AAGCCCAACA AATTTTGAAT 2880 

GAAAATCGAA AACAGCAGAT TCAAAATAAA GGTGTTTCGG AGTAGTTCCT GAAAGGATGA 294 0 

CAAGATGTTA ACAATACTTA TTTTATTACT TTTAGCCTTT GGATTTTATA CAGGAGCTAA 3000 

ACGAGGCTTG ATTTTACAAG TGCTCTATTC GGTAGGTTAC TTAATTTCAT ATTTTGTTGC 3060 

TCGTACCTAT TACAAAGAGG TCGCTTCTCA TTTAGAATTG TATATTCCAT ATCCATCGGT 3120 

TACGCCAACG TCAAAGCTTG TCTTTTTTAA TCAAGAGATT TCTTTAGACT TGGACAAAGC 3180 

CTTTTATTCA GCGGTTGCGT TTCTATTATT GCTGTTTGCa GGCTGGTTGG TCGTTCGTTT 324 0 

TTTAGCTATC TTTTTACATG GATTAACATT TATTCCAGTT TTGAAACAGG TAAATGGACT 3300 

ATTAGGTGGC GTATTAAGTG TTCTTGTGTT ATACGTGGGT CTTTTTCTAG TGTTAGCTAC 3360 

GGCGTCAATG ATTCCGTCAG ATATTGTCCA AAATCAGTTT CGGTCAAGTG GCTTGGCACG 3420 

AGGCATTGTA AAAAATACCC CAATTCTAAC AAAACAAGCA TATGAATTAT GGGTTGAACC 3480 

AATTACGAAA TAAAGCTGAA TGGGACTGAG ATAGAGATAA GAAGTAAGTG AGCAAAGCGA 3540 

AACTTCTTAT GTTATGTCTC AGTCTTTTCA ACAAATGAAA AAGTTAGAAA AGTAGGTGGA 3600 

AAAATGAATC AGCGTATTCT ATCAACATTA GGGTTTGATA AAGTAAAACA ACAGTTGTTG 3660 

CAATTTATTG TGACAGCTCA GGGGACAAAC GAAGTTTCGG AATTATTACC AATTGCTGAT 37 20 

GAAAATAAAA TTCAATCATG GCTAAATGAA ACACAGGATG GTTTAAAAGT TCAACGATTA 3780 

CGTGGTGGAA TCCCCATTCC TAAATTAGAA AATATCCAAC CACACATGAA ACGGATTGAA 384 0 

ATTGGTGCGG ATTTAAATGG CATAGAATTA GCACAAGTAG GCCGTGTCTT GTCGACTACT 3900 

TCAGAACTGA CTCGTTTTTT TGATGAGTTA AGTGAAAATG AAGTTGATTT TGAACGTTTA 3960 

TATATGTGGC GAGAGCAACT AGAAGTTTTG CCTGAATTAA ATCGCCAACT CAAACAAGCC 4 020 

ATTGATGATG ATGGCTACGT CACAGATGAA GCTTCGCCAG CCTTAAAGGC AATTCGGCAA 4080 

AATATTCGTC GTAGTGAGCA GACCATTCGA GAAGAACTAG ATAGTATTAT TCGTGGGAAA 414 0 

AACGCACGTT ATCTGAGTGA TGCCCTTGTT ACTATGCGAA ATGAACGCTA CGTAATTCCA 4200 

GTCAAACAAG AATATAAAAA TATTTTTGGT GGCGTGGTTC ACGACCAAAG TGCTTCAGGA 4 2 60 

CAAACCTTAT ■ TTATTGAACC TAAACAAATT TTAGAAATGA ATAATCGTTT GCGACAACAA 4320 

CAAATTGCCG AAAGAAATGA GATTACTCGC ATCTTGGCAG AACTGTCGGC TGAGTTGGTT 4 380 

CCTTATCGCC GAGAAATTAC GCATAATGCT TATGTCATTG GTAAACTTGA TTTTATTAAT 4 4 40 
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GCTAAGGCGC 


GACTAGGAAA 


AGAACTAAAA 


GGAGTTGTTc 


C AG AAAT T AG 


fin — - TV O TA TA TV H'* 

TCmAGCAAAT 


4 500 


CmTGTAGTTT 


TTAAACAAGC 


CCGGCACCCA 


CTATTAAaTC 


^* TV ^* TV Tl TV TV f+ ^1 

CAGAAAAaGC 


TGTAGCCaAT 


4 DbU 


GACaTTGTGA 


TTGGAGAAGA 


ATATCAAGcA 


ATTGTGATTA 


CAGGGCCTAA 


TACCGGCGGG 


4 620 


AAAACCATTA 


CTCTGAAAAC 


GTTAGGTCTT 


TTACAATTAA 


TGGGTCAAGC 


TGGATTwCCT 


4 680 


ATTCCAGTTG 


AAGAAGAGAG 


TAAAATGGGG 


ATTTTTACAG 


AAGTATTTGC 


GGATATCGGG 


4740 


GATGAACAAT 


CGATTGAACA 


AAGTTTAAGT 


ACTTTCTCTT 


CACATATGAC 


GaATATCGTG 


4800 


TCTGTCTTAA 


AGAAAGTCGA 


TCATCAAAGT 


TTAGTACTAT 


TTGATGAATT 


AGGTGCTGGG 


4860 


ACAGATCCGC 


AAGAAGGGGC 


CGCTTTAGCG 


ATTGCCATTT 


TAGATTCATT 


GGGCGCTAAA 


4920 


GGAGCCTATG 


TGATGGCAAC 


AACCCATTAT 


CCTGAATTAA 


AAGTGTATGG 


CTATAATCGA 


4980 


GCTGGAACAA 


TCAATGCCAG 


TATGGAATTT 


GATGTAGATA 


CTTTGAGTCC 


AACCTATCGT 


5040 


TTATTAATTG 


GCGTGCCTGG 


CCGAAGTAAT 


GCTTTTGAAA 


TTTCGAAACG 


TCTTGGATTA 


5100 


GACAACAGTA 


TTATTGAGGC 


TGCAAAACAA 


ATAATGGACG 


GTGAAAGTCA 


AGATTTAAAC 


5160 


GAAATGATTG 


AAGACTTGGA 


AAACCGTCGC 


AAAATGGCCG 


AAACTGAATA 


TTTAGAAGCT 


5220 


CGCCACTATG 


TCGATGAATC 


GGCCGCTTTG 


CATAAGGwAC 


TGAAAGAAGC 


CTA 


5273 



(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13932 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 



GGTTTAAGGC 


TTCCACTTTC 


CCAATGTTGC 


TTGTAGGTTT 


GCTTTAAGGC 


TCGATTTCTG 


60 


CTGATGTTTG 


TTGCTCTTTG 


AGTTGTTTAA 


ATGTGAAGAT 


TTCTCGGTTG 


AATAGGTTGA 


. 120 


rGTCTTGATG 


GGTTAGGTTC 


ATTTTATTCC 


CTCTTTTTAG 


TATCTTCTAG 


TTACTCTTAA 


180 


TAATTTTATT 


TTTCAATAGC 


CATTGTAAAA 


ATAACACAAG 


GACTACAATT 


GGCCAAAGCG 


240 


TGATTGGGAA 


AAAGAGCCCT 


ACAAAAATGC 


CTGGTACAAG 


AATCGTCCCT 


AAAATCTGTG 


300 


TTGTTAAACT 


CGTAGCAAAA 


ATTGCGACAA 


CAAAAAGCGA 


GTCAATTGTT 


GGG AAAAATT 


360 


GCGCGACCGC 


AAACCCGATT 


AAAATAGCTG 


AAAAAACGAC 


GGGAAAAATA 


TCGCCACCTA 


420 


TCCAACCTGT 


CTGTAAACAA 


ACTTGTAAAA 


ATACTAGTTT 


CATTATCGCA 


GCGATTACCA 


480 


AGATATACCA 


GGCTTGTTGA 


CTACCAAAAC 


TAGGAACTAA 


ATGCATAAAA 


GATTGTCCTG 


540 


AAAATAGTAA 


TCTTGGCATT 


AAAATAACAA 


AGAGAAAAAT 


ACAGGCTGCA 


CCAAAGGTTA 


600 


CCAGTATGCT 


TTGCTTTTCT 


TCAAAAGGTT 


GGCATAGTTT 


AGCCAGAGCA 


GCTTGCAACC 


660 


ATTGATAGGC 


GTTACCAATC 


ACAAAAGCAG 


TTAATAAAGC 


GGGTAAAATC 


AGCCATAACG 


720 


AAGCCATTTC 


AATTTTTCCA 


GTTCCTAAcT 


TCGTAATAAA 


CGATGGGTGT 


CCCACCAATT 


780 


TCATTAAAAA 


GGTAAACACC 


ACCAACCCGT 


TCACAATAAA 


TAACCCGTTC 


ATCACTTGTT 


840 



WO 98/50555 PCTAJS 98/08985 

1509 



TCAGTTTCTT 


TTTATCTACA 


GGAGCGAGCA 


TCGTATCGTA 


GCGCTGAACA 


AACTGTTGCG 


900 


GATGTAGCAA 


GCGCTGGATT 


TTTGTCCAAA 


ATACTTGCTG 


TTCTTGCTCG 


TCATAATGAA 


960 


AATAAAGGTA 


CCGTAACTTA 


TCAGATTGCC 


AGACTGACAA 


CGAAATAATC 


GCTCCCAGCA 


1020 


ATGCGGCTTC 


AGGACCGACA 


CCAGCCCCGA 


AAATTAAAAT 


TACCAAGGCG 


GCCAATAAAT 


1080 


TTCTAAACAC 


ACCTGAATAA 


TCGACCGATT 


GTTTCGCCTT 


CAATTCTGTC 


AGAGCCTCAT 


1140 


GCGCCGTTTG 


TGGTATTTGC 


CCGAAGCGTT 


TCTTCAGATA 


ACTCAATACC 


AACGCTCCTA 


1200 


GCACACACAA 


GACAAAATAA 


TAGAGAATTG 


GCCATTTCAT 


TTGCGCTGGC 


AACACCTCCC 


1260 


AGAGCAGAGT 


CGTTAGCTCT 


CCTTCAATGA 


TTAAAAATGC 


TCCTGAAATT 


CCCGCGATGa 


1320 


TACTACTTAA 


AAATAACCCA 


CTGACGATaA 


TTGtCCACAT 


TTTTCTGTTA 


CGCATACTTC 


1380 


CTCCAAATGA TAGTTGTTTC 


TTTtATAGTA 


GCAGATAGTT 


AGTGGnTTGT 


GTTTGAAGAT 


1440 


TTtGGGAAGG 


ATTaTTTTTT 


TGAAAAATTG 


AGGTTATGAA 


AAGATTCCAT 


TTTTTATTGA 


1500 


TCAAATTATC 


CTTCTTGTAT 


AATCGTTTCT 


TTTAGCTATA 


TATATTCCAT 


TAAATTAAAA 


1560 


ACCGCCACTT 


ATATTGCTTC 


TTAAGTTCCC 


ACAAACAACG 


CCACTTCCAG 


CGCCAACAAG 


1620 


TCCACTTCCT 


TATCAAAGTC 


CAACAGCACT 


AGCTGTTCAA 


TTTTCTCCAA 


TCGCTGTAAT 


1680 


AAAGTATTAC 


GATGAATAAA 


GAGTTTTTTG 


GCTGTTTGAG 


ACAAGTTTCG 


TTTTGATTGA 


1740 


AAATAGACAT 


TCAGTGTATG 


GACAAAGTCG 


GTATGTTGCT 


GTTGATCACT 


TTTCTGCAAT 


'l800 


GGTTGAAGAT 


ATTTTTTTCT 


GAGGGTTGCT 


TGTCCTTCTT 


CTAAAAAATA 


ACGTTGCAAT 


1860 




A ATf^AAAATT 


CGAGTAGGAA 


GCAATCCTCA 


TTTCAGGTTT 


TTCTGCAACG 


1920 


AATTGGCGTT 


GAAAGGCTAA 


AACGTCAATC 


GCTTGTTGAT 


AAGATTCATG 


AATCTGGGTC 


1980 


AGACCTTGAA 


TCTTAGGTCC 


TATTGTTAGG 


AAAAGTTCCA 


CATCTTCTTG 


TACTTTCATT 


2040 


AACTGATTTA 


TGCTACTTTG 


TGCTAGGCGT 


TGCATCATTT 


TCAGATAATC 


GCCTTCTTTT 


2100 


TCAGGTGGAT 


TTTCAACTAA 


TAAGACAAAG 


GAACTGTCAA 


TTTCAATACA 


AACAATTTTT 


2160 


TGTTTGTCCC 


CTAAAATAAC 


TTGGTTTAAC 


TGGTTACCTA 


AAATAGACAT 


TCTTTCTTGC 


2220. 


CTTTGGCGTT 


CATTTAAGGT 


TAATGAAATT 


AGCAAAATCG 


CATAGGGCTT 


ATCTATTGCC 


2280 


AAGCGATAAT 


ATTGTTGAAA 


TTGTTGGAAA 


ACAGCTTGGT 


CCCTCTTTTC 


ATGCGCAAAC 


2340 


AGCTTAAAAA 


ATAATTCGTT 


TCGTTGTTGG 


TATTCTTCAA 


ATTTCCGTGA 


GTATTCCAAT 


2400 


TGCAATTGTA 


AAAGTAAGTA 


ATGAGCTGTG 


ATTTTAAAAT 


GAGCCACATC 


GACTGGCCGT 


24 60 


AATGGGGCCA 


AGGTGTTCCA 


GACGAGTAAA 


TAATATAAAA 


TAACATTTTT 


CI ill lAAi, I 


Z J £. U 


GGATAAATTT 


CAATTGGCAC 


CACTTGGTTC 


TGGAGTTTCA 


ACTGTAAATG 


AATCGGTTCT 


2580 


TGATAAAAGC 


GTAATAATGG 


ATGACTTTGT 


TCCAGCAATT 


GTTTTTGGAA 


CTCTGAGGTT 


2640 


TCTTGTCTAT 


CTTCTAAAAT 


GCTAGAAATA 


CGAATTGGAT 


TTTTAGGCAA 


GTCAAAAAAA 


2700 


TTAACTCCTT 


CAAAGTGGTC 


ATTCAATAAC 


AAGACAGGAT 


TTTGAATGGT 


TTCTGCTAGA 


2760 


TCCTGCAGCA 


AATGAAAACT 


GCTACTTCCC 


GTTTCTAAGA 


TTTGGTAAAT 


•TTTATTATGC 


2820 
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AATGCGGTTT 


CTTGCGCATT 


CATGTCAAAT 


TGTTCTAACG 


TGACACGACT 


TTTCAACACG 


2880 


TCAGCAATTT 


GCGAAAAGGT 


GTAATTGTAA 


GGAATTGCTA 


AAATAGGAAC 


GTTCATTTCA 


2940 


TCTGCTAATT 


TGATTAGGTC 


TGGTGGAACT 


TCTTCAAAAT 


AACGTTGGGT 


TTTAAATCCC 


3000 


AAGGCAGCAC 


AATGGTGTTT 


GGCTAGTTCT 


TTCAAAAAAT 


TTTTCTGTAA 


TTTCCGATCA 


3060 


TCTTTAAATA 


TGTACCCTGT 


CGTTAACAGC 


AGCTCACCTG 


CtGTTAGCCa 


ATCTAAAATA 


3120 


TCAGGATTTT 


CTAAAATATT 


GACTTTTAAA 


ATCGGTTGGT 


CCTGATTTTT 


ATGTCCTGCC 


3180 


ACTAAGGTAA 


TGCCAGGCAA 


TTCTTCTAAC 


GTTAATAAAT 


CACTCACTTT 


TAACATCATT 


3240 


TATCCTCCCA 


ATGTACAAAT 


TGTCCATTCA 


AACTTGTAAA 


TCAATGTCAT 


TTTTAGCAAT 


3300 


TGTTTAGTGT 


GCTGGATTCT 


TTTATTATGA 


AGTCGTGAAA 


GCGCTATTAA 


AAAGGAGGTC 


3360 


GTTGGAATGA 


CAAAAATTTT 


AGCTTATGAC 


CAAACGATTT 


TATCGAAACT 


CCCTCATCAA 


3420 


ACATTGACTA 


GTTTGCAATG 


TTTTACGCAC 


GCTATTAATT 


ATGGTCATGT 


AGACACCCAA 


3480 


GAAATTCTCT 


TGACCATTTC 


AACGCATCCC 


GAACATATAG 


CACCTAACAC 


GATTATACTC 


3540 


GATGAAAAAA 


AACTTAGCCA 


TTTCGCTGAT 


TTACCTGTCC 


AAATTGAGCC 


CCTGTTTATC 


3600 


CGGTTAGGCC 


GACGACAATT 


AACACTAGCC 


GATGCCCAAA 


TTTGGCAACC 


AACGCCACTT 


3660 


TATTTAACAC 


CCAGTAAAAC 


AAAAATAGCT 


ATTTTTTTAA 


CGTATATTCA 


GCAAACTCGT 


3720 


TTCAATCCTT 


GGCAAGCGCT 


TTTTCCTGCT 


TCGTCGTCAA 


CCTTTCTTTT 


TGATTCTTAT 


'3780 


AGTCGGACCC 


AGATTCAAAA 


AGAAATGATT 


CATTTTCAAC 


AGGCTTGGGA 


GCAAAAATCT 


3840 


TTTAAAATAG 


CTCTTGATTG 


TTTAACGAAG 


ATTTTGGGAT 


TGGGGATTGG 


TTTAACCCCC 


3900 


ACTGGCGATG 


ATTTTATCAC 


AGGATTACTT 


GCTAGTTTTT 


CTGCATTACA 


GCAACTTCCC 


3960 


TCGAATTTCC 


AACAGCTGGC 


TGTGCTAGCC 


AAGGAACGAA 


CGAATGCGGT 


CAGTTATGCG 


4020 


GAAATCCATG 


AAGCAGTGAA 


CGAGCGGTTT 


TCTCAGCTAG 


TTCAGCGTGT 


TTTTTTATCA 


4080 


ATTGAAAGTG 


GAAATACAAA 


CGAAATGGCT 


CAAGCGGTTT 


CAGCTTTAGA 


GGAGGTGGGC 


4140 


TCAACTTCAG 


GAAGCGACAT 


ACTTTGCGGC 


ATTATTTTTG 


GTTTAAAACT 


ATTTTGTGAG 


4200 


GAGAATAATC 


ATGACAATTC 


AAACAGTAAT 


TAAAGAGAAC 


GCCTATTTTG 


ATTCAGTTAC 


4260 


TTTAATGACC 


ATTTCAACAC 


GAGCCAATGA 


ACTGGCTGGA 


GTTAAAACTG 


CAATGATTGG 


4320 


GATGGGAACC 


GACATGAATC 


TTGaAGTGAT 


TCGGAATGTT 


GGTTTATATA 


CACCCGCCTT 


4380 


GGATCaTGTT 


ACAACAGGTG 


ATTTATTAAT 


TGTTTTAGAT 


CTTGATGACC 


AAGCGAACTC 


4440 


AGAAGAAATT 


TTACAGCAAG 


TCGACGAGTT 


ATTTACGAAG 


AAAAAGAAAA 


CCGCTTCTTC 


4500 


TGAAGTAACC 


TATAAGACGT 


TAGATTCAGC 


CCTTCATGAA 


GAGCCTGATG 


CCAACTTAGT 


4560 


TGTAATTTCT 


GTCAATGGAA 


AATTTGCTGC 


• ACGAGAAGCG 


CATAAAGCTT 


TAGATCAGCA 


4620 


AAAACACGTC 


ATGCTCTTTA 


GTGATAACGT 


GACGGTTGAC 


GAGGAATTAG 


CGTTAAAACA 


4680 


AAAAGCACAC 


GAAAAAGAAC 


TGTTTGTGAT 


GGGCCCTGAT 


TGTGGAACAG 


CGATTATTAA 


4740 


TGGCGTCGGG 


TTATGTTTTG 


CTAACGAAGT 


ACGTTCAGGA 


> GATATTGGAA 


TTGTCGGCGC 


4800 
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TTCTGGTACA 


GGCAGCCAAG 


AAGTTAGTGT 


CCAAATCCAT 


AAATATGGTT 


ATGGGATTTC 


4860 


ACAATTAATT 


GGTACTGGTG 


GCCGTGATTT 


GTCTGCTGAA 


ATTGGTGGGC 


TGATGATGCT 


4920 


TGACGGTCTG 


GATACTTTAA 


TGGTTGACGA 


ACAAACTAAG 


GCAATTTTGC 


TTATTTCTAA 


4980 


ACCGCCTGCT 


CCAGAAGTAA 


CAGAGAAAAT 


TTTAAAAAAA 


TTAGCTGTTT 


CAACTAAACC 


5040 


AGTTGTTATT 


TATTTTATCG 


GCAGTGAACA 


AACGGAACGA 


AAAATAGACA 


ACGTCACTTT 


5100 


TGCTACTTCT 


TCTCTGGATG 


CAGTTCAAAA 


GGTGATTCAA 


TTTAGTCAAA 


CACAGGAAGC 


5160 


GGAAACCAGC 


CTCTATCAAA 


ATCCGACAGG 


CGCAACGTTA 


ACAGCCATCC 


AACAGCAACA 


5220 


AGCCCCTTCT 


CAAAAATTTG 


TTCGTGGGTT 


GTTCTGTGGT 


GGCACTCTTT 


GTGATGAACT 


5280 


TTTATATGCT 


TTAACAGAAG 


TCTCTGATGA 


TGTTTATAGT 


AATATTCATA 


AAATTTCTGA 


5340 


ACGGCAATTA 


GCAAATCCTG 


ACCACAGCCA 


AG C AC AT AC A 


TTGATTGATT 


TCGGTGATGA 


5400 


TCGTTTCACA 


GAAGGCCGCC 


CACATCCAAT 


GATCGATCCG 


ACTTCTCGGA 


TTACTCGCAT 


5460 


TCTTCAAGAA 


GCCAAAGATC 


CTGAAGTGGC 


GGTTTTAGCG 


TTAGATTTTG 


AATTAGGTTA 


5520 


TGGTTCTCAC 


GAAAATCCCG 


TTGGCGTGTT 


AGCGGAGGCG 


TTAAAGGAAG 


CCAAAGCAAT 


5580 


AGCGAAAGCA 


GATGGTCGTG 


AGTTAGCCAT 


TATTGGCtAT 


GtTTTAGGGm 


CCGAAGAAGA. 


5640 


TCCaCAaGAT 


ATTCATGAcA 


GCGGAAAnTT 


TTAGAAgATT 


TAGAGGTTCT 


CGTTGTCGAc 


5700 


AGCAGTCATC 


AACTTTGCGA AgCcACCAAA ACTTTTGTAA AAGGAGCGAA 


CGAACATGAG 


5760 


CAGAAATGAT 


CTTTTTAAAC 


AGCCTTTAGA 


TACGATCAAT 


GTAGGCATTG 


ACTTTATTCA 


5820 


TGAAGATATG 


AAAAAACAAG 


GGATTCCTTC 


ACATCAAGTA 


AATTGGGCAC 


CGCCAGCCAA 


5880 


TGGCGACCCA 


GAGCTATTAA 


AATTATTGGA 


TCAGTTAAAA 


AATCCAACAC 


TTTATGAAAA 


5940 


AATCCAACAA 


GCCAACGAAG 


AAGCGGTTAC 


CCGCATTATT 


CAATCAAAAC 


CTATTTTAGT 


6000 


AGGATTTGAT 


AAAGCAATTA 


ATGTCATGCC 


CGATATGACT 


GAAACGACCA 


TTTTACATGC 


6060 


TGGGCCACCC 


ATTACTTATG 


AAAATATGTG 


CGGACCGATG 


AAAGGTGCCG 


TTCAAGGAGC 


6120 


GCTTGTTTTT 


GAAGGATTAG 


CAAAAGATCT 


AGCTGATGCC 


GATCGGGTTG 


CTCGTTCTGG 


6180 


CGCGATTACG 


TTTTCTCCTT 


GTCATGAACA 


TGATGCCGTT 


GGTTCCATGG 


CTGGCGTGAC 


6240 


TTCTCCGAAT 


ATGTACGTTC 


ATATTATCAA 


AAATGAAACG 


TACGGCAACA 


CGGCTTTTAC 


6300 


TAATTTAAGT 


GAACAATTGG 


CGAAAGTTTT 


ACGATTTGGT 


GCAAATGATC 


AATCCGTCGT 


6360 


GGATCGCCTG 


ATTTGGATGC 


GCGATGTTTT 


AGGTCCATTA 


CTTCATGATG 


CCATGACTTT 


6420 


TTGTCCAGAA 


GGCATTGATT 


TACGCTTAAT 


GCTTTCGCAA 


GCCTTGCATA 


TGGGCGATGA 


6480 


GTGCCATAAT 


CGCAATGTAG 


CCGGAAGTAC 


GTTGTTAGTC 


CAAGCATTAA 


CACCTTACAT 


6540 


GGTTCAAACT 


GATTTTTCTC 


GTGAACAATT 


AAAAGAAGTA 


TTTGAATTTC 


TTGGCAGTTC 


6600 


TGACTATTTT 


TCTGGTCCGA 


CTTGGATGGG 


TGCCGCTAAA 


TGTGCGTTAG 


ATGCTGGTCA 


6660 


TAATGTGGAG 


AATAGTACAA 


TTGTCACCAC 


CATGTGTCGC 


AATGGGGTCG 


AATTTGGAAT 


6720 


TCGTGTCAGT 


GGGATTGGTG 


GAAATCACTG 


GTTTACGGGG 


CCTGCCCAAC 


GGGTGATTGG 


. 6780 
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TCCGATGTTT 


GCTGGTTACA 


CACAAGAAGA 


TGCTGGTTTG 


GATATGGGCG 


ATAGTGCTAT 


6840 


TACAGAAACG 


TATGGTGTTG 


GCGGATTTGC 


TATGGCAGCT 


GCAGCAGCAA 


TTGTCCCATT 


6900 


AGTTGGCGGA 


ACAGTGGCAG 


AAGCATTGAA 


TTATTCAAAA 


GAAATGCTGG 


AAATTACTAC 


6960 


PAAAGAAAAT 


CCCAATGTCA 


CGATTCCTGT 


TTTAGATTTT 


ATGGGGATTC 


CTACAGGCAT 


7020 


PP ATP.TPTTA 
Curt 1 Ul<ul -Lrt 


AAAGTCTTAG 


AAACAGGGAT 


GCTGCCTGTC 


ATTAATACTG 


CTATTGCTCA 


7080 


Tni\ziP.AAPPA 


GGGATTGGTA 


TGATTGGCGC 


TGGATTGACT 


AATCCGCCAG 


CGAATGTCTT 


7140 


1 M^unoo^h 


TTAAAAGCTT 


TAGTAGCTAC 


AATAAACTAA 


TTTCAAAAAC 


TTAGTAAAAT 


7200 


(j 1 Urt 1 Hill 


TGGAGCGATC 


AT AT G AAAAA 


TAAATTGATC 


AATTACGAAG 


CCGTTTTAGC 


7260 


TrTTflTTflTA 


AAAATCTTTT 

riiiiif* x w x x x x 


TGGGAATGTT 


CCTGTATTGG 


ATCATGAATG 


ACTTTCGACA 


7320 


lulu nn^ 


x x X X X X \^ X X 


ACAATGACGT 


AGTTACCAAA 


GTGAATCAGc 


AGGGATTTTA 


7380 




TATTTTGTTA 

x n. x x x x >j x x *i 


TGAACTTCAC 


GGAAGGGGAA 


TTTTACGGTG 


GACTTTTTAC 


7440 




TTAPTAATCG 


GCGGTCTGAT 


TGCTTGGCAA 


CTTTATCGGA 


AAAATTCAAA 


7500 


ATpr;pAf;^A 


TTTGCAATTG 

xxx w vnrv x x w 


CTGGTGGTTC 


TGGTGCTTGG 


CCTTGGGTAC 


TTGCTTCTCA 


7560 


PPTTAPTTTPP 


PTPTTTTTAA 


CTATTTATGT 


TTTTGATTTT 


ACACGCTTTT 


TTACAAAAGA 


7620 


M.O 1 \J 1 lUlVJO 


TTAPPAAPGT 


TTATTGTGGT 


TGTCGGCACG 


CCGCCAGCTT 


TAACGCTCGT 


7680 




GGPTGGAAAA 


AATTAGGAAC 


GATTTCTCTG 


TTGAGTGCGC 


TTTTCACCTT 


'7740 


TPPPTTTf^PP 


AATTGGPTGA 


ATGCGCAATT 


GATGCCCCTT 


TTAAATGTAC 


CTGGGACAGT 


7800 


TTPPA APP,TT 


APPAPAATGG 


CTTTTGTTGG 


CTGGATTGTT 


TCAGCCATCT 


GTCATCAACT 


7860 




PiTTPPTTPTG 


TTGCACCAAT 


CGCTCAGAAG 


CGGCAAAAAC 


AACCTAAAAT 


7920 


TGAAAATACG 


CAAACCTTTC 


AATGGTCAAT 


ACGACGCACT 


TTTGCTGACT 


TCTCCGAAGC 


7980 


TTP,GTTTTAT 
i 1 uu i x x mi 


GGAAATGAGT 


TAGTTGGAGG 


TTTGGTAATT 


CTAGGCGTTT 


TAACAGATTG 


8040 


O 1 1 1U1 unn i. 


ATTAATCATA 

xi x x nn x v**** x n 


TTACAAATGG 


TTCTGGCTTG 


GTTCCTGACA 


TTTTAATGAG 


8100 


ACAAATGATT 


GCTTCGGCTG 


TTGGTGTTTT 


TCTTTATCGA 


AAACATTTCG 


CTGAAGAAGG ■ 


8160 


TTGGTATCCA 


ACGTTTATGC 


CACTGGTCAG 


TATTGTTCCT 


GGTGTGATTT 


TAATGACGGG 


8220 


TGGTGGCTTT 


TGGCTTTCTC 


TGGCCATCGC 


CGTTTTAGCA 


GGAATCAGTG 


CGGCTCCTGT 


8280 


CGGTAACTAT 


ATTGCTAAAC 


GATTACCTCC 


GTTTGTTCCT 


GGTGCTGTCG 


GCTTTGTATC 


8340 


AGGAATGGCT 


GTTGTAACCA 


TTTTATTGTC 


AGCCGTTTTG 


CATACATTTG 


ATATTTTTGT 


8400 


ATAGTGAATA 


AAAAGAAGCG 


AA T GACGA TT 


tbLl itl 1 1 1 


111 rtrt 1 1 onL 


PAATPPPPTT 


84 60 


TTTATCCGTA 


TTCTTAAGCA 


AAAGGAGGTT 


TTGCCATGTA 


CTCCATGTTA 


AAACGTGTGA 


8520 


TAACGGAAAA 


AGATTTGCTT 


CGGCAAATTC 


GTTTGTTGGA 


ACAGTTGTTA 


AATGTTCCCC 


8580 


AGCTGACAGC 


CAAGCGTTTA 


GCCGCGCAAA 


TCCAAACGAC 


AGAGCGAACC 


GTTTTTTCGG 


8640 


ATTTACAGTA 


TATTCGCAGT 


CAATTACCCG 


CCGACTGGTC 


CATTGAAACC 


GATAGCAGTG 


8700 


GTATTCGGTT 


ACGCAATCAG 


GGGAATGCGC 


AAACCAATGA 


. GCTTTGGTCA 


TTATTTTTAC 


8760 
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CTCAATCCAT 


TAG TAT TC AG 


CTATTAAAAG 


AGCTACTATT 


TACGAAAGAG 


TTAGTGACAA 


8820 


CTTCTTTTTT 


GAGCACTAGT 


GGTGTTTCTT 


ATGAAACTTT 


GAAGCGACAT 


ATCAAAAAAA 


8880 


TGAATCAGGC 


CTTACGTGAT 


TTTCATTTAA 


CGATTCAACT 


AACCACGATG 


ACCATTCAGC 


8940 


TAATCGGGGC 


AGAAAGCAAT 


ATTCGGATTT 


TTTATCATCG 


TTTGTTGGTG 


CCTTTTACTC 


9000 


ATAATAATTA 


TTTTTTTGAC 


GATTATTCGA 


TTCATGAGGA 


GCATTATTTC 


CAATTTTTGA 


9060 


AACAAGTTTA 


CAGTAGTGAG 


TTGACGGTGG 


AAACGGAAGA 


AATTTTTGGC 


GCTTGTTGGT 


9120 


TTTTTATTAA 


CACGATTCGC 


AACAAAGCCA 


ATTGCCGAGT 


GAGTCAATTT 


TCTTTTGACT 


9180 


CAAAAGATGT 


CTTGTTTCAA 


CTTTATCAAC 


CGTCCTTGGC 


AAAGCTTTAT 


GCTTCTGAAG 


9240 


GTATCTATTT 


GCAAGGAGAA 


GAAAGTTTTT 


TCGCCTTCTT 


TTGTTTCTTA 


GAAAGCTGGA 


9300 


ATTACGATAA 


TGTCTATGGC 


GAAACGTTGG 


CTTCTGCGCT 


CCACACTCAT 


TACAGTCAGC 


9360 


TGCGAAAATC 


ATTACAGCAG 


TTTGTCACTA 


ATTTGTCCAC 


TGAAGAAGCC 


CGGCCTGATT 


9420 


TAATCCAAAC 


AAATTTATTG 


GATAATCTGT 


TATTGTTATT 


CATTAAATAT 


ACCGAGTCCC 


9480 


CAACACTTTC 


TGAGCAATTT 


CAATTGGAAT 


ATCAAGAATT 


AATGACCGAA 


CAAGCCGCGG 


9540 


AGGACCTTGC 


GTTATCAAAG 


AGCAACCAAG 


AACTTTTGGA 


GATTTTATCT 


CGCTACACCA 


9600 


CG AT TGAAG A 


ACCGACTTAT 


TTTCTTAGTT 


TGGCTTCTTT 


GTTAGAAAAA 


CAAGCCATTT 


9660 


ATTCCATTCA 


GGCGCAAACG 


ATGACTGCCT 


ATTTCCTCTT 


TCAAGGGGAA 


CCTGCTTGGA 


9720 


AAGCTTTTTT 


ACAACAAGAA 


TTAGCTGCTT 


ATTTAGGTAC 


TCGTGTGAAA 


TTACAAGCTA 


9780 


TTGAATATGT 


GGAACTAAGC 


CAGTTGACGT 


TAAATGAAGC 


CGACATAATC 


ATTTCCAATT 


9840 


TTCCCTTGGA 


TCATCTTGAT 


CTTCCTGTCT 


TTTATCTCTC 


TTTAATTCCA 


ACAAAAAATG 


9900 


AACTCCGCCG 


ACTAGCAGAG 


CTCACTCTTC 


ATTCTTATTT 


TTAAAGGAAC 


TAGCGAGGAC 


9960 


ATTCGCTAGT 


TTTTTGGCTT 


TCTTTCGTCA 


TGAATCGTTT 


GGTTATATAT 


TTTTCCCCTT 


10020 


CAACAAGGAA 


AAACAGACTC 


ACAACATGGA 


CGAAAATCAT 


TGCTAATTGA 


AGGCCATTTA 


10080 


AAGAAGTTGT 


GCCAATTAAT 


TgctGCGCTA 


CTGGCCAGTA 


AACTACTAGC 


GCTTGTAGTA 


10140 


AAAAGAGGAT 


TCCTAATGAA 


GCAAACAACA 


CTTTATTTTG 


GAAAAAGCCT 


GTGCTTAACG 


10200 


ACGGATCAAC 


CAACTTACGA 


CAGTTGATCA 


TATACGCTGc 


tTGCGCTAAG 


ACGATACTTT 


10260 


GTAAGAGGAA 


CGTTTGCTGT 


AAGGCTTGGC 


CCTCAAATTG 


TAACGCCATG 


AGGTAACCTG 


10320 


GTACCATAAT 


GAGTACCGAC 


ACATAGACGA 


TTCTAAAAAT 


ACTGTACTTT 


GTTAAAATGC 


10380 


Li 1 1 1 OnL 




GGTCGTTTCA 

vJvj X v> O J. J. J. \*r\ 


TGGTATCAGC 


ACTCGCTTTT 


TCAAAACCTA 


10440 


ATGCATAAGA 


CAAAGTAATC 


GTAGTTACCA 


TGTTTACCCA 


TAATATCTGG 


ACAGGTGTCA 


10500 


GCGGTAGAGG 


ATGATTCATT 


AAAAGCGCCC 


AGACAACAAT 


TAAACCTTGT 


GCTAAAGAGG 


10560 


TTGGCAAGAA 


GAAGGTAATC 


GTCTTTTTCA 


AGTTATCAAA 


AATGCGGCGT 


CCTTCTTTCA 


10620 


CGGCTTTAGC 


AATCGTATGG 


AAATTATCAT 


CTGCCAAAAC 


•GATATCTGCG 


GCTTGTTTGG 


1068.0 


TTACTTCACT 


TCCTTTAATT 


CCCATGGCAA 


TTCCGACATC 


AGCTTTCTTT 


AGTGcTGGCG 


10740 
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CATCGTTCAC 


GCCATCGCCG 


GTCATCCCGA 


CAATTTCACC 


ATTGTTTTGT 


AAAGCCGTGA 


10800 


CAATTCGTAA 


TTTATGTTCT 


GGCGTCGTTC 


GTGCAAATAC 


ATCGACTTTT 


TGAACGTGCT 


10860 


GAGCTAATTC 


TTCATCTGAC 


ATAGCATCAA 


TTTCTAATCC 


TTCAAGGACT 


TTTTTCGTAT 


10920 


GCTTTAAACC 


AACTTGCTCG 


CCAATGGCTT 


GCGCTGTATC 


TTTATGGTCT 


CCGGTAATCA 


10980 


TTTTGACGGA 


AATACCT jT 


TCTTGAGATT 


CTTTGACGGC 


TTGAATGGCA 


CTTTCTTTTG 


11040 


GCGGATCGAT 


GATGCCTGCC 


AGTCCAGCAA 


ACGTCAATCC 


TGACAATGTT 


TCATGTGTCA 


11100 


GTTCCTGTTG 


TGACGTAACT 


GTCTTATAAG 


CGAAACCTAA 


TACCCGTTGT 


CCTTTTTGCG 


11160 


CTAATTGCGC AgCTTGTGCT 


TGCCAAGCAC 


CTTTTTGATT 


ATCAGATAAC 


GTAGAAAGTT 


11220 


GTAACAATAC 


TTCTGGCGCA 


CCTTTGACAT 


AAATAATTGA 


ACCTTCTGCT 


TGCGGATGGC 


11280 


GAGTGGCCAT 


GTATTTATAA 


CTTGAACTAA 


ACGGAATTTT 


CTTTTCTACA 


GGGCGTAAAG 


11340 


ACAGTTGATC 


CTGATCAACA 


TATTGTAGCA 


AAGATAACTC 


GGTAGGATTC 


CCCTGCAAGT 


114 00 


CGGCGATTTT 


TTGTTGGTCT 


TGTAATTTCA 


ATTCTTGACA 


GTTGGCCATA 


ATCTCTTTGA 


11460 


CAATGGTCAT 


TTCTTCAGTC 


ACTACATCCA 


TTACCGTCAT 


TTCATTTTTA 


GTTAGCGTTC 


11520 


CTGTTTTATC 


TGAACAGATA 


ACTGTCATTG 


AGCCTAATGT 


TTCAACGGAA 


GGCATGCCTT 


11580 


TAATAATCGC 


ATTTTCTCTA 


GCCATCTCAT 

\J \ m * N^ik X N-^ X ***** X 


GAACACCCAT 


CGATAAAATC 


ATCGTTAACA 


11640 


CAGCAGGTAA 


GCCTTCTGGA 


ATCATTGCTA 


CAATTAATGC 


AATCATCGCA 


GAAAATAAGA 


11700 


GGCTCCATTC 


CATGCCATGA 


CGAAAAGTGG 


TAAAGAAAAT 


CAGAAACACG 


ATCAAGACCA 


11760 


TAATCCCTTG 


GAAAATTPGC 


TTGTTTAATT 


GGTGCATTTT 


CCGCACGAGC 


GGCGTTGTTT 


11820 


TTTGATCTAC 


TGACTGCAAC 


GCGTGATTAA 


TTTTTCCAAC 


TTCAGTGGAA 


TCTCCTGTTT 


11880 


fTAPTArAAC 


GCCCAAGGCT 


GAGCCCGCTT 


GGACAAGAGT 


TCCTGAAAAA 


GCCAAATTCT 


11940 


TTTGATCGCC 


TGTTGGCAAT 


TCCTCATTCA 


AGGAACCGGT 


AATTTTCTCC 


ACGGCTTCTG 


12000 


ATTCGCCTGT 


CAAAATTGAT 


TCTTCAATCA 


TCAGATTATG 


AACATCAAAT 


AAGCGCAAAT 


12060 


CTGCTGGAAC 


TACATCGCCA 


GCTTGTAACG 


TGACTACATC 


TCCCAGAACC 


AATGTTTCAG 


12120 


AAGAAACGGT 


TGTTTTTTGT 


CCATCAATTA 


GAACGACTGC 


CTCTTGGCCC 


ATCATCTGTT 


12180 


TTAATCCGTC 


TAATGATTCT 


TCTGCTTTAC 


GCTCTTGCCA 


ATAACCGACA 


AAGCCATTGA 


12240 


CAAGCACCAC 


TAGAAAAATG 


ATACTTCCTT 


CGACCACTTC 


GCCTGTGGCA 


AATTTTAGAA 


12300 


TTGCGGCAGC 


GAGTAAGACC 


ACCATTAGTA 


AATCTGTAAA 


ATGTTTGGCT 


AGCTTTTGCC 


12360 


ACGTTTTCAG 


TTGTTGTTTT 


TCTTCAATTT 


TATTTGGACC 


ATTCGTTTGT 


7\ a a / r ^/~* r r r n^/ r> T 1 
1 1 UL, J. 


L £. 4 c. \J 


GACGTTCTTC 


GCTAGAAAGA 


CCCGTTTGCT 


TCGTCTTAGT 


CTGCTCGATT 


ACTTTTGTTG 


12480 


AGGAAAGTTG 


ATACCATTTC 


ATTTCGTGTA 


CCTCCTATTT 


ACTTGATACC 


TTTATTATAG 


12540 


AGAGGATTTT 


CTTCACAAAA 


TGGTCGATAA 


TTTCAGTGGC 


CCCTGAAAAA 


AATGGGGCCA 


12600 


AGATTATTAA 


AGAAAGGAAC 


AACCCGCATT 


TTAGCGGGTT 


GTTCCTTTCT 


TTTAAATATT 


12660 


TTCTTGTTCG 


ACCACCAATT 


GACTTTTGCC 


AATTAACGCT 


TTAACATCAA 


, TAATTTCACC 


12720 
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ATTTTTACGT CGGTCTGCTA ACTCAACTGA GGTGGTAAAC AACAAGTCAC 
CAACGCTGTT TCAATGGAAT CTTGGACTAC GCCGACGATA AAGCCAACGC 
CATGGCAATG TCATTGGAAA TGCCAAACAA ACTACATGCA AGCGGAATTA 
ACCACCAGCA ATTCCTGAAG CCCCACAAGC AGAAACAGCA GCGATGATGC 
TGCTAAATAG ATTGGTACGG ACATGCCTAA TGTGTGGACT GCCGCCAATG 
AATCGTAnAT GCTGCGCCGC CCATATTAAT CGTTGCGCCT AGCGGAATAG 
AGATTCTTCA TTTAAATCCA TTGCTTTCGC CAATTCCATA TTAATGGGAA 
AGAACTGCGT GTAAAGAATG CGGGAATCGC ACTTTCCTTT AATACAAAAA 
ATACGGATTT TGCCGAATAT TCCAAAAAAC AATCGCTGGA TAAACGACTA 
GGCCATCGTA CCAATTAACA ATAATAGTAA CTGACCATAC TTGGCTAACC 
TGTTGTTGCC ACCGAATGAA ACACCAAACC TAGAATTCCA ATTGGCGCAA 
CATTTGAACG ACCGTGGTAA TCCCTGTTGA AAGATTGGCA ATCACATCCT 
CGAACTTTGG CGTAACCCAA TACCGATTAA TGAGGACCAA AATAAAACAG 
GCCTTCAATC ATCGCTTGAA TCGGATTTTG GACAACGCTT GTTAACACAT 
ATCACTAAGT TGCGTTGGGG CTGCTTGCGC TTCCTGCGCC GCCTCCAAGA 
GGGAAACAGA TAGCTTGCCG TCACGGCAAC AACAGCTGCC AAAAATGTCG 
CACAACTAAA ATTGATCCAA CATATGTTTT CGCCCCAGCC TTTTGTTGGG 
AATAATTAAG ACAAAAACTA AAATCGGCGC AATTGCTTTT AAGGCGCCCA 
ACCAAGAACG CTGATGAAGG TCCATTCTGG CACCAAGAAA CCCAAGGTGG 
AATCCCTATC ATGATTTTTT GAATTAACGA TAATTTCCGA ATCGCTTTGA 
CACGCTCCTT TA 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
CTGAATTGGT TATTGTTTCG ATTTTAGAAG AGAAAGAGAT TAATGTTACC 
TAAAAnTTAT TGGCAAGAAC AGTTGGCGAA AT TAG AG AAA CAnACGGAAA 
AGAAGCGTTA GCGAATGGCA TCGATAAAGT AAGTGTAATT GTCAATGAAG 
AGAATTAATT "ATTAATAAGC TTATTCCTTT GAATAAACCA GATTTGTTAA 
CAAGTCAACT AGCAAATTAA AAAGCTTTTT TGGTAGCCAA GCAGCTTATA 
TGCACCAATT TCTGTAATGA TTATTCGCTA ACAAAATTGA AACaGTTAAA 



TAGAAGAGTT 
CGACGACTTG 
ACAACAAGGA 
TCAACAATAA 
TCATAATAGT 
AGACCGCATA 
TATTGGCCGC 
AGACTAACGG 
AAGCAACCAC 
CTGCAATGCC 
TGCCGATAAT 
TGGTAGCGAC 
ATAAATAATT 
TGCTTAAAAC 
CAATCTTAAC 
CTAACAGATA 
CTAGCGACGC 
CAAACAATTC 
TTCCAATCAC 
CCATTCACTT 



AATCGTTAGA 
AATATCAACA 
GGAACCCAGG 
TTATTGGCTC 
TGGCTCGATA 
AAAGCCGCCA 



1278.0 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13932 



60 
120 
180 
240 
300 
360 
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CGATTTACTA 


TTCGTGGTGG 


CCTTTTTATA 


GTAATTTTTG 


TTTATATAAA 


TAAACAGCCA 


420 


AACTTAGTTG 


TAGCGAACTA 


TCAGGAAATT- 


GTAAAGTTAA 


GTGAAGAATC 


TCTTCTATTT 


480 


TTTTTAAGCG 


GTAAATAATT 


GTATTACGAT 


GTACATACAT 


ATTTGTGGCA 


GTCAACGAAA 


540 


TATTTTTATT 


CGATTGAAAG 


TAGGTGTTGA 


TGGTTTCTAG 


TAGGGAACTT 


TCATGTTTTT 


600 


GATCTTCTTT 


TTCTAATGGT 


TCAATCAATC 


GTTGATATAT 


TTTTTGTTTG 


GCTTGTTTGG 


660 


TCAGGGTTTC 


GATCATTAAA 


CGATAAAAAA 


AGAGGTCGTC 


ATAATTCAAA 


ATAGTTCGTT 


720 


GTTGTAATTC 


TTGACTTGCT 


TGTGCTAAGT 


GGAATGTTTC 


TTGTGCTAAT 


TTATAAGAAT 


780 


CACACAGATT 


GGTGATTTTT 


TTGACAGTTG 


GCCCAATCGC 


TAATTTAAAT 


GATAATTTTG 


840 


GATTCAGTTT 


CATTAGTTGC 


TCTAGTAAAT 


TCTTGGCAAG 


ATGAGTGGTT 


TGTGCAAGGT 


900 


AGGTGGCTTC 


CTCTTGCGCT 


GGATTTTCTA 


GTAATAAAAT 


AAAGCCAGTT 


TGTGTAGGAA 


960 


AGGCAGTAAT 


TTTTTGTCGA 


TATCCTTCTA 


AAATATGATT 


CAATTGATGG 


CCAATGCTAT 


1020 


TCATTTGAAT 


AGTCGGATTA 


AACAGTTGTT 


CCAATTGTTC 


AATGGCTACA 


AGCAAGAGAC 


1080 


TAAATGTATC 


TTTTTCATTT 


AATTGCAGAT 


AAT TACT AAA 


TTCTTGCCAA 


GCTTGAGGAT 


1140 


TGGTATTTTT 


TTCTTGGATA 


AGGTCTTGAA 


ACAGCGCCCC 


GATCTTTTGC 


CTCGTTGTTA 


1200 


AATTTTCCGA 


AGATtCTAAT 


TGAAGTTGTG 


GTAAAAGGTA 


GTGAGCACAG 


ACCTCCACAG 


1260 


ATAATAAATC 


AATCGTCTGT 


AATTTTTTGG 


TAGTTTGCCA 


AATCACTAAG 


TAAGATAAGC 


1320 


TGAGATTTTT 


CTTTTTGATG 


GGTAAAATTC 


GCAAAGATAA 


CTGATGGTTC 


GCTAAGGAAA 


1380 


ATTCAACAGT 


CAATGGTTCT 


TTATAAAATA 


ATAGGAGCTC 


ATTGTTCTGG 


GCTTTTAAAG 


1440 


CTGTCTCAAA 


CTGTtTTTTG 


AGAGTCGCGT 


TATTTAAAAT 


ACTGGTGATT 


CTTGTTGGCG 


1500 


CATCAGGTAA 


ATCAAAATGA 


TTCATAATCT 


GAAAACGCTC 


GTCTAAAAGC 


ATTACAGGCG 


1560 


CTTGAATCGC 


GGCACTTAAT 


ACTTCCAGTA 


ATTGGGTACT 


TTTTCCGCCA 


GCTTTGATAA 


1620 


ttgtataaaa' 


TTTATTATGC 


ATATGAACAG 


AATGTTGGCT 


AATCGTTTCG 


TCAGGTTGAT 


1680 


TGACTCGGCG 


ATTGACCAAT 


GAAATAATTT 


' GAGAAAAGTT 


GTAATTATAA 


GGG ATT GAGA 


1740 


GAATTGGTAA 


TCCGACGTCA 


TTCGCTAATT 


GGATCATCTC 


TTGTGGAACT 


TCATTGAAAT 


1800 


AACGTTGGAC 


TTTAAAGCCT 


AGAGCAGCGC 


AGTTCATATT 


TGCTAGTTCT 


TGAATAAGAC 


1860 


GTTTTTGCAA 


GCAAGGCTGA 


TCTTTGAAAA 


TATAGCCGGT 


AGTTAATAAA 


ATTTCTCCTG 


1920 


GCGTTAACCA 


GTCAAAAGCA 


TCAGGATTTT 


CTAAAATATT 


TGCTTGTAGA 


ATTGGCGCAT 


1980 


CTAAATTGGT 


CTCGTGCGTC 


AATAAGGTTA 


AACCAGGTAG 


TTCTTCTGCA 


TGGAGTAGAT 


2040 


CATTGACCGT 


CATCATTAGC 


AAGGCCTCCT 


TTTGTATGAA 


TTATCTAAAA 


ATTAAGTGTA 


2100 


TTTATTGTAA 


GATGTTACAA 


AACTATTTTA 


GGGAAAATAA 


GGTATCCTTA 


TACT AAT AG A 


2160 


AAAAAGTTCC 


AAGTCATCAC 


GTGATTGGTA 


GTCACAAGTG 


ACCAAATAAG 


AACAATCAGC 


2220 


GCTTACAACC 


TATAGAAACG 


CATCGTTTAA 


AGATATAGCA 


, ATAGTATACT 


TAATTTTTAA 


2280 


TGCTTTTACT 


ATTGTTTCAG 


CTACTTTAAA 


TTTCTCAAGT 


ATTTCTATAA 


. TCTGTAAGTT 


2340 
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TTTTGAGACT 


CTTATCAGAT , 


ATGCTTCTTT 


TTCTATATTT 1 


TATCAGAAAA TTCTGGTAAC 


2400 


TTTGAAGGTT 


ATAGTTTGTA , 


AAATGGAGGG 


ATCGGTATGA . 


AGAAATTAAA - AATGTTAGGA 


2460 


TGCGTCGGGT 


TGCTTTTAGC 


TTTAACGGCT 


TGTCAGGCGG 


GAACGGGAAA CTCGGCTGAT 


2520 


AGTAACAAAG 


CAGCGGAACA 


AAAAATTGCA 


ATTAGTTCTG 


AAGCGGCTAT TTCGACAATG - 


2580 


GAACCACACA 


CAGCGGGGGA 


TACGACCTCG 


ACTTTAGTCA 


TGAATCAAGT TTATGAAGGA 


2640 


CTCTATGTTT 


TAGGTAAAGA AGATGAATTA 


GAGTTGGGGG 


TCGCTGCCGA AGAACCAGCG 


2700 


ATTTCTGAAG 


ATGAAACCGT 


TTATACATTT 


AAGATTAGAG 


AAGATGCCAA ATGGTCGAAT 


2760 


GATGATCCAG 


TAACAGCAAA 


CGACTTTGTT 


TATGCATGGC 


AACAAGTTGC TTCCCCTAAA 


2820 


TCAGGATCGA 


TTCATCAAGC 


TTTATTTTTT 


GATGTCATTA 


AAAATGCTAA GGAAATTGCT 


2880 


TTAGAAGGCG 


CAGATGTGAA 


TACTCTTGGG 


GTTAAGGCGC 


TAGATGATAA AACGTTAGAA 


2940 


ATAACTTTAG 


AACGGCCCAC 


CCCTTATTTG 


AAATCATTAC 


TTTCGTTTCC TGTTTTGTTT 


3000 


CCACAAAATG 


AAAAATATAT 


CAAAGAACAA 


GGGGATAAAT 


ATGCTACTGA TGCAGAACAT 


3060 


TTGATTTATA 


ATGGTCCTTT 


TAAATTGAAA 


GAATGGGATA 


ATGCCTCTTC TGATGACTGG 


3120 


ACCTACGAAA 


AAAATGATAC 


GTATTGGGAT 


GCTGAAAAAG 


TTAAATTAAC AGAAGCGAAA 


3180 


GTTTCAGTAA 


TTAAGAGCCC 


AACGACAGCG 


GTGAATTTGT 


TTGACTCGAA TGAATTGGAT 


3240 


GTAGTGAATA 


AGCTAAGTGG 


TGAATTTATT 


CCTGGTTATG 


TTGATAATCC AGCCTTTCTT 


3300 


TCAATTCCTC 


AATTCGTCAC 


ATACTTTTTA 


AAAATGAACA 


GCGTTCGTGA TGGAAAAGAA 


3360 


AATCCGGCTT 


TAGCGAACAA 


CAATATTCGT 


AAAGCGTTGG 


CACAAGCTTT TGATAAAGAA 


3420 


AGTTTTGTAA 


AAGAAGTCTT 


GCAAGATCAA 


TCAACGGCTA 


CAGATCAAGT AATTCCGCCG 


3480 


GGACAAACGA 


TTGCGCCAGA 


TGGAACAGAT 


TTCACAAAAC 


TAGCTGCTAA GAAAAATAAC 


3540 


TACTTAACCT 


ACGATACAGC 


GAAAGCAAAA 


GAATTCTGGG 


AAAAAGGGAA AAAAGAAATT 


3600 


GGGCTGGATA 


AAATCAAATT 


AGAATTTTTA 


ACAGATGATA 


CAGACAGCGC CAAAAAAGCT 


3660 


GCTGAGTTTT 


TCCAATTTCA 


ATTGGAAGAA 


AATCTAGATG 


GATTAGAAGT GAATGTTACT 


3720 


CAAGTTCCTT 


TTACTATTCG 


TGTTGATCGT 


GATCAAACGA 


GAGACTATGA TTTAGAATTA 


3780 


TCTGGTTGGG 


GAACCGATTA 


TCGTGATCCA 


TTAACAGTTA 


TGCGCATCTT TACTTCGGAT 


'3840 


AGTACCTTGG 


GCGGCGTAAC 


GTTCAAGAGT 


GATACGTATG 


ATCAATTAAT TCAAGAAACT 


3900 


AGAACAACAC 


ATGCGGCTGA 


TCAAGAGGCT 


CGTTTAAATG 


ACTTTGCTCA AGCACAAGAT 


3960 


ATTTTGGTGA 


ATCAGGAAAC 


GGTTTTAGCA 


CCAATCTACA 


ATCGAAGCAT TTCTGTATTA 


4020 


GCTAATCAAA 


AAATCAAGGA 


TCTGTATTGG 


CATTCATTTG 


GACCCACGTA CAGTTTAAAA 


4080 


TGGGCTTATG 


TTAACTAATT 


GAATGAAGGT 


AGGAAGGGGT 


TTGTCATGCT TTTAGAAAAC 


4140 


TGTAAATTAG 


' AAACAGGATT 


TTCTACTGTG 


AATCAGCGAA 


, TTGTTGCGAC AGAAACAGCT 


4200 


AGCTATGATA 


TTCGAATTGA 


AAACGAGGTC 


TGTACTGCGG 


TCGCACCGCA GTTAAGGCCT 


4260 


TTAGAGAACG 


AAACAGTTAT 


CGATGTGAAT 


1 CAGCAACTCG 


; TTTTACCTGC GCTAAGAGAA 


4320 



WO 98/50555 PCT/US98/08985 

1518 



ATGCATATTC 


ATATTGATAA 


AACGTATTTT 


GGGGGACCTT 


GGAAGGCATG 


TCGTCCACTA 


4380 


ACGAAAGGAA 


TTTTAACACG 


TATTGAAGAA 


GAAACTTGGC 


TGTTACCTGA 


ACAATTACCA 


4440 


ACTAGTTTAG 


AACGTGCTTG 


CCAAGTTATT 


GAGCAGTATA 


TTCAACAAGG 


CCATTATCAT 


4500 


ATTCGCAGTC 


ATTGTAATGT 


AGATCCATCT 


ATTGGAACGA 


AACATATTGA 


AATTACTAAA 


4560 


GAAGCATTTA 


GTCGATATGA 


ACAATACATT 


ACTCATGAAA 


TTGTGGCTTT 


TCCTCAACAC 


4620 


GGATTACTCC 


GCTCACAAGT 


CGAACCTTTG 


ATGCGCGAAg 


CnTTACGCAT 


GGGGGCAACT 


4680 


CATGTCGGCG 


GCGTTGATCC 


AGCCCTTGTG 


GATCGTCATG 


TGGATCATTC 


AATCGCAAAA 


4740 


ATATTTGAAT 


TAGCTACCAC 


ATTTAATAAA 


AAAATTGATG 


TTCATTTACA 


CGCCCGTGAT 


4800 


ACTTTGGGAC 


TTTATGAATT 


TAATAAATTT 


GTTGATTATA 


CAGAACAAGC 


AAAGATGTTT 


4860 


GGCAAAGTCA 


CTTTAAGTCA 


TGCTTTAGCC 


TTAGGGAGCT 


TAGAAGAAGC 


AGCAATTAGA 


4920 


GACATTGCGC 


AAAAATTCAT 


CGAAACAGGT 


ATTGATTTGA 


CTTCGACTGT 


ACCAATTGGA 


4980 


ATGCCCACCA 


TGCCGATTCC 


AACCTTGGTT 


GAACAAGGTG 


TTAAGATCTC 


CGTGGCTCAT 


5040 


GACAGTTTGA 


CGGATCATTG 


GTCGCCTTTT 


GGCAGTGGCA 


ATACAATAGA 


AAAATTAAAT 


5100 


ACAGCCGCTC 


AAAGATTTAA 


AATTAGCGAT 


GAGTATCGCT 


TGAATCGTTT 


ATGGGGCTTA 


5160 


GCGAGTAATT 


TTGTAACGCC 


TCTTGATCCT 


AACGGGCAGC 


AAGTTTGGCC 


TAACGTCGGC 


5220 


GATTCTGCAG 


ATTTCCTACT 


ATTTAATGCT 


GAGAGCACTG 


CTCACGTGAT 


TGCACGCCAA 


5280 


CAGCCAATCC 


AACAACTGAT 


ACTAAAAGGA 


CAACTGGTTG 


AAGCCGTTCA 


AAAGGGGGAA 


5340 


AAAGAATGAC 


AGCATCCaTT 


GTTTATTTGA 


ATAATGTTCG 


TTTAGAAACT 


GGTTTTGCTA 


5400 


AAGACCACCA 


CGAATTAACG 


TATACTAAAA 


CCGCACGCTA 


TACTTTAGCG 


ATTCAAGAAG 


5460 


GCAAAATCCA 


AG C GAT TAT T 


CCGCAAAATC 


AAGTCACTGA 


AAAGCAACAA 


GGCACTGATT 


5520 


TAAACGGACA 


ACTGGCTATT 


CCCGCTTTTC 


AAGAGAGTCA 


TAATCACTTA 


GATAAAACCT 


5580 


ACCTTTCATT 


AGGCTGGCGT 


GCTAGCCAAC 


CTGTTAAAAA 


TTTAAAAGAG 


CGTTTAGCAG 


5640 


ACGAAGCCAG 


TGAACTTAAA 


TTGTTAGCTC 


CGTCAACAGA 


ACAACGAGCG 


ACGGCCATGA 


5700 


TTGAAAAATT 


GATTGGGTAT 


GGAGCCTCTT 


ATATTCGGAC 


ACATGTCAAC 


ATTGATCCTT 


5760 


ATGTAGAACT 


AGAAAACTTC 


TGGGGCGTCA 


AAAGAGCCTT 


AGAAAAATAT 


GCGCATGTGA 


5820 


TTGATTATGA 


TATTGTTGTC 


TTTCCACAGC 


ATGGATTATT 


AAAAAATCCC 


CAAACGGTTT 


5880 


TACTCATGAG 


AGAAGCTCTA 


AAAAATGGTG 


GAACAATGGT 


TGGTGGTTTA 


GATCCTGCAG 


5940 


GCATTGATTA 


TGCAATTGAA 


GAATCTTTAG 


AAACTATTTT 


CGATCTGGCA 


GAAGAATTTC 


6000 


AAGTAGGAAT 


CGATATCCAC 


TTGCATGATA 


CGGGGGAAGT 


GGGCGTTTAT 


ACAATTGATA 


6060 


AATTTTTAGA 


CATTTTAGAA 


GAACGAAAAT 


TTACTCAGCG 


AACCGCTATC 


AGTCACGCCT 


6120 


TTGCGCTGCT 


AGATGTACGA 


CCAGCAGAAA 


AAGAAAAACT 


ATACCAACGC 


TTGGCAACGC 


6180 


ATCAAACGGC 


GATTATGTCT 


ACAATTCCCT 


ATGATCCTCG 


TTATTTATTG 


CCACCAATTG 


6240 


ATGTGTTACG 


CCAAGCTGGT 


GTATCGGTTC 


ATTTAGGTAG 


TGATGGCTTT 


TTTGACTCTT 


6300 
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. GGAGTTCTAA TGTTTCAGGT GATTTATTTG AAAAATTACG TAATTTTTGT GAAATGACAG 6360 

GAAAGATTAC AGAAGAGCAA CTTACTCAAG CTTACGTCCA TGGCTGTGGC AAAGTAGCAC 64 20 

CTTTTTCATT TGAAGAAGAA CGCCTTTGGT TTACAGAAGG CGACGAAGCA AACTTTATTT 64 80 

TTACAGAAGC AGCTTCTACA GCAGAAGTGA TTGCTAGAAA ACCACAGAAA CGCCAGATTA 654 0 

TGTTGAAAGG ACAATGGGTT TAACTTGCTT TAAAATTAGG TAGAAATAGA ATGCTAAAAA 6600 

GCCAAACAGA GGAAACAAGG TTCAGCTCTG AGCAACGCAT AGTGTCAGTC TAGCGGTTAC 6660 

TCGTCACAGT TTAATCCTTA TAAAGTGCTT CTGCTCACAA CGTGTATCAG CGTTTCAAGT 6720 

TTGTTTGACT CACTTTAACA TAGCACTTTT GCGTATCTTC CTTTATAATA AGACAAAATG 67 80 

AAGGAGATGA ACCAAGCATG ACTGAACAAA CGAGAGAAAA CAAAAAAGCA GCCTACGAAT 6840 

TGTGCTTACA ACAATTGAGC GGTTTGTTAG AAATAGAAAC GGATGCTATT GCAAATTTAG 6900 

CAAACTCGTC CGCGTTACTT GCACAAACAT TACCAGACAC CGTTTTTGCT GGGTATTATT 6960 

TATATAAAGA TGAAGAATTA GTTTTGGGGC CTTTTCAAGG GAAAGTCTCT TGTACCAGAA 7020 

TTAAAATGGG CAAAGGTGTC TGTGGTGAGT CCGCGGAAAA ACAAGCTACA TTGATTGTTG 7080 

ATAACGTGAA GACACACGCC AATTATATTT CTTGTGACTC AGCTGCGATG TCTGAAATCG 714 0 

TTGTACCGAT GGTAAAAAAT CATCAATTGG TCGGTGTTTT AG AT AT TG AT AGTGGTGTAA 7200 

CAAATAGCTA TGATGCGGTT GACCAACTGT ATTTGGAAAA ATTTGTTACC TTGTTACTAG 7260 

AAAAATCAGA TTTTTAAGCA GACTATTTTT GCTTTCTGAC AAATCATGAG AAAATAAAGT 7320 

ATAAAATTTG ATCATGAAAG AAGGTGAGTC ATTTGTTAGA GCTGTATATA AATTTTAAAG 7 380 

GTGAAGCAAA AGAAGCAATT GCGTTCTATG AAGATGTTTT TGATACAAAA TGTCAAAATT 7440 

TAATGACTTT TGGCGAAgcA CCTGAAGATC CTGAGCATCC TATAAACGAC GAGATAAAGG 7500 

ATTTAGTGAT GAACGCAAGT ATCATTATTG AAGGAACCTA TGTAATGATT TCAGATGTTC 7 560 

CTGATATGTT TGGTTTTGAA GTTACGGAAG GAAATAACCT TTCGCTCGTG GTTAGCACAG 7 620 

ATGATGATGA AAAAATTGAT CGCTTATTTA AGCGATTATC TGAAGGTGGC ACTGTTACAA 7 680 

TGCCGTTTCT GAAACTTTTT GGTCCAAAAA GTATGGTTCC CTAAAAGATC AGTTTGGCAT 77 4 0 

CCATTGGATG TTTAATTATT ATGAAGAAGA TTAAAAAAAT CCCCAATCCG TTATTTTCAT 7800 

TGGATTGGGG ATTTTTTATT CATGAAAGAA TTGCTTCTTA GCGATTTGAT ACTGCCGTTG 7 860 

CTTTTCTGTC CAAAATTGCT GATCGTTTTC ATCAATTTTG CGAATGACTT TTGCTGGAkT 7 920 

ACCCCCAACA ATCGTGTTGG CAGGAACGTC TTTTGTAACC ACAGCACCAG CAGCAACAAT 7 980 

AGCATTTTTT CCAATTGTAA CGCCAGGTAA AATGGTGGCA CTTCCGCCAA TCCACACGTT 804 0 

ATCTTCAACA ATAATTGGCG TACCGAATTC TAATTCTGCA GTTCGAATCG TTGGATCAAT 8100 

TGGATGGCCG • GCAGTATAAA AACTAGCACG AGGGCCAACC ATGACATTAT CACCGAAGAT 8160 

GATTTTGTTT ACATCTAAAA AAATGCAATC CATATTTGCA TAAAAATTTT CTCCTACCTC 8220 

AATATGGCGA CCGTAATCTA CATGGAGGGG TGGGGTGATA TATAACTCAT TGCCAGTTTT 8280 
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TCCTAATAAT 


TCTTTTTCTA 


GGGCCACAAT 


TTCTTTCACA 


TTATCGATAG 


GAAGTTGATT 


8340 


GATTTTTTGA 


GCTAACACCT 


TGCCTTTtGT 


TGATTTGTTT 


TCGGGTAAAA 


TGGTGGGGGC 


8400 


GTAATATAAT 


TCACCTTTTA 


ACATTTGCAT 


ATAATCAAAG 


TCATTTTGTT 


TCATGTAAGC 


8460 


GCTCCTTTAG 


GCTTTTTCTC 


TATTGTATCA 


AATCAATACA 


ATAAGTCGAG 


GAATATGTAA 


8520 


AATAAAAAAG 


G AC AG AT AG C 


AAAATAGCCT 


TGCTTATCTA 


TCCTTTATCT 


AAGGTTATCC 


8580 


AATTTTTTTA 


ACTCTCCCGC 


CAATTTTATC 


TGCTAATTTT 


TGTGCTTCTT 


TTTCAGAGTG 


8640 


GTAAACTAAA 


CAACCTGATT 


TATCTCGCGT 


ACGTTTACCA 


TCTGCAGTTA 


AATAGCCATG 


8700 


AAAAAATTTG 


ATTGCATACA 


TAGACATCAC 


CGCCTTTCTA 


ATTGAAAATA 


GTGTAAATGG 


8760 


AGCGATTAAA 


AATGTCAAAT 


AATAGAAGTG 


CTCTTAACCA 


TTCTTATCTT 


TTTTAGTTCG * 


8820 


AAGATAAGCT 


TGGTTAAGAG 


CACCATTTCA 


CTTAATAAAT 


TAGTTTGAAA 


CGGGTCGTTG 


8880 


AGTGAAATTC 


CTGTTCAGGT 


GAGAGGGTGA 


TAGAACCGAA 


CAACGGAAAT 


CTTTCAGCAC 


8940 


CAGGACAGAC 


TTGGGTTTCA 


AATGTGAGCG 


CTCCGTGATT 


GACTAATTTT 


TGCCCTTGAA 


9000 


TATCGGGACC 


GTTATCAGCA 


AAATTTGCTG 


AAAAAATAAC 


AACTGAAGGT 


TGATCAGTGA 


9060 


ACATCTCTAA 


GGTGATATCT 


TTAGTAGGAC 


TTTTTAAGAT 


AACATCCGGG 


TTAATTAAAT 


9120 


CATGATTGGT 


TAAGATGAAA 


GGGTGATCGA 


AGCCGTTCAT 


CTTTTGGATT 


TGTTCATCGT 


9180 


TACTTGAAAA 


AACCTCAGCT 


AATTTTTTAG 


GTTTTTGTAA 


ATCAAAAGCT 


GTCTGGTCGA 


9240 


CGGAAGCTAA 


TTCTCCAAGC 


GGTAAAACTC 


TTTCATCAAG 


CGGGACAAAC 


TTTTGGGCAT 


9300 


TTATTTGTAA 


CTCATGCGCA 


TCAATGGGGA 


TTGTAGGATT 


TCCAGTTAAA 


TTAAAGTAGA 


9360 


CATGGATTGG 


TTGGATTAAA 


AAGTGTTTGT 


TCATCTGTAG 


TTGCCTGGAT 


ACTCAATTTG 


9420 


CCATTCATTT 


TCTTCAGTCA 


ATGTATACGT 


GACTGAAACA 


GTGAGATTCC 


AGGAAAACCA 


9480 


TTTTCACCAG 


CAGGACTCTC 


TAGCCAAAA 








9509 



(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

CCATTCCTAA AAAAAATTCA AAATCTTTCA GAGAAGCTGT TAAAGGAATT TTTAAGTTCA 60 

CAGATATTAA CTTAGCGCGT GAGGCTAAAA ATCGATTGAT TCATGATTAT ATCGATCAAC 120 

CAAAATATTC AAAAGcTTGC GCATCATTGG ATGATGGATT CGAAGACGCC TTTCAATATA 180 

CCGTACAAGG AAATTCCCAC AATCGACTAA AGAGTACCAA .TCTAATTGAA CGACTGAATC 24 0 

AAGAAGTACG CAGAAGAGAA AAGATTATTC GCATCTTCCC CAATCAAACA TCAGCCAATC 300 

GCTTAATTGG AGCCGTTCTT ATGGACCTAC ATGATGAATG GATTTATTCT TCAAGAAAAT 360 

ACATCAATTT TGATAAGTAG AAATGGTAAA AACATTGTAT AGCATTTTAC ACAGGAGTCT 4 20 
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GGACTTGACT 


CGGAAGTCCA 


AACCAAAAAC 


AAACTAATTA 


TTAACCAACC 


TGAGCAAACT 


480 


GAAAAACCAA 


TGAAAGATGT 


TGCTAAATTG 


CTGAAAATTT 


TACCGCCAAC 


ATTTAACGAA 


540 


ACAGTGCTCC 


TTCCAGTCCA 


AACACAGTTA 


AAAGAACACT 


ATCCGCGAAC 


GGTTTCCCTC 


600 


GTGAAAAACT 


TAGTCAGTCC 


GCTGCCAGAA 


GAATACCGTT 


TCTTGACAGA 


GGTACTCTTA 


660 


TCTGTATTAC 


TCAGCGGAGA 


AATTTCTGAA 


ACGATTCCTT 


ACCAAGCGCT 


CCTCGTGGCG 


720 


CATGGTGAAT 


CGACTGCCAC 


AAGTATTCAA 


GCCGTTGCCA 


ACAAATTATG 


TGGTACCTAT 


780 


ATTTTCGATG 


CGATTAACAT 


GCCCTTAACC 


TCCTCCGTCC 


GAGATATCGT 


CGCCGAAGTT 


840 


AAAGAGTGGC 


TTTCTCAGCG 


GGACACATCA 


CAAGGCGTCA 


TTATGCTCGT 


CGATATGGGT 


900 


TCTCTGACAC 


AACTTTATAA 


AAGTTTGAAA 


CCGCAAATTC 


TTGGAGAACT 


TTTGGTGATC 


960 


AATAACCTGA 


CCACTGCCTA 


TGCGTTGGAA 


ATTGGTCATC 


AATTAATGAA 


CGAACAACTT 


1020 


TTCTATGGTA 


TTGCTAAAAC 


CGCTGAAAAA 


AAATTTAAAA 


CCGATGTACA 


GTACTTTGAA 


1080 


GGCTTTTCAG 


TAGAAAAAAA 


TATCATTGTC 


TCTAGTATTT 


CAGGTCTGGA 


TATTGCCAAG 


1140 


CAAATCAAAC 


AAATCTGTCA 


AAAATACCTG 


TATACTGACA 


TTAAAGTCAT 


TACTCTAAAA 


1200 


TACAAAGATT 


TGGTAAACAC 


GTTAGACATT 


GCCAACGCCG 


AAGAAAATTA 


TTTAAAAGAA 


1260 


ACCTCGCTCA 


TTTTGACGAC 


GTCTTACTTA 


GACAACCATA 


CGAATGTTGC 


TAGCGTGAAT 


1320 


CTCTTAGACA 


TGTTAGACGA 


AGATGCAGGA 


ACCCAGTTGA 


TGGAACCTTT 


CCAAAACTTG 


1380 


ATGCATCCCA 


ATAACATTGA 


TAGCATGATT 


AACGAATTTG 


TTCACTTCTT 


CTCGAAAGAA 


1440 


GGCTTGTCAG 


AAAAACTAGA 


ATTTCTAAAT 


CCTGACGTCA 


TTATTAAACA 


AGTCGAAAAC 


1500 


GTTACAAAAA 


ACATTGAAAA 


ACGCTTCGAC 


CTCACACTCA 


GCGGCAAAAT 


GAAATTTAAC 


1560 


TTAATGATGC 


ACAACGCGCT 


CATGGTAGAG 


CGAACCATGC 


TCGGCGTTGA 


AGACTATGAA 


1620 


ATCCCTGCTA 


ACCTCGAGGA 


ACTAACCATT 


AACCAAAAAC 


CATTTTTCCA 


AAATGCTAAA 


1680 


AACATTTTCT 


ATACTCTAGA 


ACAGTTTTAT 


CGAATAGAGA 


TTTCCAACTG 


GGAACTCTAT 


1740 


GTGATTTATG 


AAATTTTGTC 


TTCTCGGTGA 


GGGGGATGGG 


AAAAGCCACT 


TTCCAGAGGA 


1800 


AAGTGGCTTT 


TTtAGTTtGT 


GtTCTGCGTT 


AAAAATTCAT 


ACATATTAGG 


TTCTACAGGT 


1860 


GTTTCTAAGG 


TAAAGTCTAG 


TTTAACGACG 


CTTTTTTTCT 


TACCCTCTTT 


GTCTTGTTTT 


1920 


TCGACTTGTT 


GGATAGTTyC 


TTGAAGAGGG 


GCTACTtCGC 


CTAGATAATA 


GAAATCTGTT 


1980 


TCGTTCTCTT 


TTTGATTATA 


TTTCCGTTTC 


ACAAATAAAT 


GAAATTTATA 


TTCCCCAGGA 


2040 


TTCCGCAAAA 


GTTGAACTTC 


TGGCGAGCGT 


AGATTTCTAG 


GCGCTTTTGT 


ATACCAATGG 


2100 


AATGTTCGTT 


CATCAATAAA 


TTCATCCTCA 


TAAGCAATAC 


GTGCAGCTTT 


AAAATCTTTG 


2160 


CCTTTATCCA 


AAGTTACAAA 


AATAGCAAAA 


T GAT T ATT AC 


TGTTTGTATA 


GCCACCAATA 


2220 


CTTTGTTCAT 


TTTGATTAAA 


GTTCATATTT 


AATAATCGTA 


, AAACATCTTT 


ACGACGATAT 


2280 


TTTTTGTATA 


AATTTAACGG 


TTGGGATAAA 


. TCATACTCCA 


. TAGATTTGTT 


AATCCCTGTA 


2340 


TGGATTACAT 


CTAACATCAA 


, ATTGATGAAA 


k TAAGAATTCT 


1 TTTTAAACTC 


: TCTGAATTTT 


2400 
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TCTGTTATTT 


GTAAACAATT 


TTCTTTCATC ' 


TTAAACATTT GTGATAACAA ATAACGTTTT 


2460 


CTAATTGTTG 


ATTCGTAAAA AGATAAATCy . 


AAGACTTTAA CAACAGATGC TAGAGTATCT 


2520 


GTCGTATACG 


TCAGGTTAAA 


CGTATCAAAC . 


AGCGCTGTGA TTTCCTCTAA AGATAACTGT 


2580 


TCTTTATCTG 


TCATTAAAAA 


TTCTTCAATA 


ACATAAAGTT CATGCGGGCG AATTCCTGAT 


2640 


AGCAATTCTA 


CAGAAACAAA 


TTTTAGTGCA 


CCGTTTTCTT CTTCACTTAA TTTAGCTTCT 


2700 


GTTTCCCCAA 


TTCTCTCCAA 


GAAATTATAG 


TATGTTTTTT CTTTACCTAT AATCAACAAT 


2760 


GGATCCACGC 


CATCTAACAC 


TTGAAAATCA 


TTCAGGTAAG GAACCCTATT CAACCGATTT 


2820 


TTCAACTTAA 


AATAACTAGC 


TTTTAGTTCC 


GACATCGCAT CTAATTTAGC TTGATCAATT 


2880 


GATTTAAAGA 


TACGTTCACG 


AGCAACTTTT 


TCAAAATTAA TTGAAGATAA ACCACTGATA 


2940 


TACGTCACAT 


CAATTGTATC 


GCGACGTAAA 


CTATTCTTAG TTTGAGaACT ATCTCCAGAT 


3000 


AAAGCCATAG 


GAATCATATA 


GTTATtTTTA 


TAATTaCCAA TAAAaTCaAT TACCGTAACA 


3060 


AATTCTTTTG 


ATGGGTCTTT 


TCTTAGCCCC 


CGACCTAaTT GCTGAATAAA AATAATACTA 


3120 


GATTGCGTAT 


TTCGCAGCAT 


AATTACTTGA 


TTTACTTTAG GGATATCAAT GCCTTCATTA 


3180 


AAAATATCAA 


CCGTGAATAT 


ATAATGAATC 


TCTCCAGTAT CTAAACGCTT AATTTCCTTT 


3240 


TCTCTTTTAT 


CAAAATTATG 


GTCACCAGAT 


AAATAAGCAC TCGGAATTCC ACGATCGTTA 


3300 


AACATCTCTG 


CTAACTGCCT 


TGCTTCGTCT 


TTACGACTAC AAAACACCAA GCCTCGAGGA 


'3360 


CTATCTCCCG 


AATAACCATA 


ATAGTTAATT 


TTTTTTATTA AAAAATCAAC TCGTTCTTTC 


3420 


TGTAGCAAAC 


GTTTTAAATC 


GGTTGTTTCA 


TCAATGATTT CACCATCTTT TTCATAATCA 


3480 


GTTACGCCAA 


AATAATAAAA 


TGGACAAAGC 


AGTTCTGCCT CTAAAGCGGC TTGCAAACGT 


3540 


ATTTCATACG 


CTACATTATA 


ATCAAATAGC 


TCAAAGATAT TAAAATTATC TGTTCTTTCA 


3600 


GGTGTAGCCG 


TCATTCCCAA 


TAAAAACTTG 


GGATTGAAAA ATTCAATGGC TTTAATATAA 


3660 


GATTTCGCTC 


CAGCTTTATG 


GACTTCATCA 


ATCAAAATAT AGTCAAACGC TTCTTTTCCT 


3720 


AAACGCTCTT 


GATATTCACT 


TTTAGATACT 


GTTTGAATTG TTCCAAATAG ATATTTTGCA 


3780 


TCAAAATTCT 


TAGAATTCCC 


TGATAAAACA 


CCATAATCAT CATCTTTTCC ACCTAAAATC 


3840 


AATTTGAATG 


ATTCTAATGC 


TTTCATTAAA 


ATTAGCTCAC TATGAACAAT AAATAATACA 


3900 


CGTTTAGGAt 


GGAATtGCTG 


cACATCAAAk 


GCTGcTAGAT ACGTTTTCCC TGTACCAGTC 


3960 


GCCGAAACAA 


TCATtCCCTT 


AGTTGCGCCC 


GTTTCTCTAA GCGCTTTTAA ACTTCTAAGC 


4020 




GCATAGAGTT 


CGGCAAAATA 


TATTGcGTAT CTAAACACTC GTTTTCAGAA 


4080 


ACATCTTGCA 


CTAACATCTT 


TCTATTAACA 


ATAGGATGAT AATTTTTTTC AAATGCATCG 


4140 


ACCCATTCTA 


GGGTTAGCGA 


CTGTGCCTCA 


TGCCATTCTT GAATGAGATG CTTTTGAATA 


4200 


GAAGAAACCA 


TCTCGCCATT 


CTCTCTTGAA 


GTTAATCTAA CATTCCACTC ATAGTTAAGT 


4260 


TTCAAAGCAG 


ACATTGTTAG 


GTTCGAACTA 


. CCAATAACAA AGCTTTGATA ATCCTCCATC 


4320 


TCAAACAAAT 


ATCCTTTTGA 


. ATGAAAACCT 


1 TCTTTTTTAG AAACTCGTAC TTCTACATTT 


4380 
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GGAATATTCA ACAATGCTCG AAATACGTGT GGATCATTAA AATTTAAGTA TGTTGATGTt 44 40 

AAGATTTTtC CAGAGATTCC ACGCAAAGCC AGATCAGCCA GTTGTGTTTT TATTGCATTT 4 500 

AAGCCACTAG GTGTtATAAA TGCAACGGCT ATAAAAAAAG TCTTACAGCT TTTCAAATCT 4 560 

TTACTCAATG TATTCAATAA ATAATCATGT TTTTTGGGAT TGTTAACAAT AAATTTAGGT 4 620 

GTAAGTTCGT CACCTAGAGT TTCCTTATTA ATAAATGCCT TTTGTAACGC CAATTCTAAA 4 680 

TTATCCATTT TGTTCAGTTC CCACACCTTT TTCTACTAAT TTwTCCACAG TAGGAATATC 4 74 0 

AGCAGGCGCC CACTCAAGTT GGTTTAATTC ATTTGGCGTT AGCaTTTAAT TTGTAGATGT 4 800 

TCTGTCAACT GAGGTTCACC TGATTCTAAA TAACAGATAA ATGTAGTTAA ATTAACGAAA 4860 

CCAAAATCAT ACTCATATCG ACAAAAAGCA TACTCTTCCT TGACGATCGT TACTTCTATT 4 920 

TTTAATTCTT CTTGCAATTC ACGTTCaAGC GCTTGaACTT CCGTCTCACC ATTCTCGATT 4 980 

TTTCCACCAG GAAATTCCCA AAGATTTGCT AATGCACGTT CAGGTCCTCT TTGACAACAC 5040 

AAAACTTTCC CCTTCTCCAC TATGATTGCT CCTACTACGC GAATTTCTTT TTTCATATAT 5100 

TTTAACCCTC TAAtCTAATT GATTTTGTTA GTTTwATTAT ATTTAGGATC GCATAAATAA 5160 

GCTAGGGTTG CAATATwAGt GGCGkAAATT TCATTTATTA TATTTGTTTT TTGATTTACT 5220 

ATACTTTAAT TATTTAATAA CTCTTCA 5247 
(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

GGACTTTCTC CTCCTGAGTA TTTGACTATA CTTACAGTCA ATGGTCGGAA GATCGGGTCa 60 

GTACCCcTTT GGaAATmAtT TArGTACcAT TATTTAGCAC CTTtCCcTGa CTTAGAAGAA 120 

GTGATTATTC GCTTAGATGA ATTTGATCAG GTTTTGTTCG CCAATCGCAC AGCTACAAAT 180 

GGAAAAAAAA TGGATATGTC ATCAATTAAA AAACATGGCT TTCAACTAAT GCGAATTATC 24 0 

TCCGATCCTT ATACAGGACA ACCACTGGGA GAATTGTACA CTGTGTTTTC AAGTCAAGAA 300 

TTGCTAGGCA ACCAAGCAGA CTTATTGAAA AAATCAGGTA TTAATGCGTT TATCTATGAT 360 

AGTGCTGGGA ATCAAATTTT TTCTGAAAAA GCGCAATTTA CCAAAGAAGA AGAGCGGCAA 420 

CTAGATAAAA GAATGCACAC CGATAGTGAC ATTCAACAGG TTTTTCATAA TCGCTACGAT 480 

ATTACGGAAA TAGAAAGCAG TGGCCGGAGT ACCATTCTCT TATTAACAAG CCGCCGAGTC 54 0 

TTATTTCAAC ' AATTGTTTAT GAATTATGCG GCCATTTTAG GGATTGGGTT ACTTTTGATT 600 

GTTATTTTAT TAGTGGGCTT AAACCGCCTA TTTAAACGCT ATTCTCAACA AGTCCAATTA 660 

ATTTTAGAAG CGACACGAGC CATTGGCGAT GGCAATTTAA AAGAACGAAT TGACACGAAT 720 
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CAAGTTCAGG 


AAGAGTTGAA 


TGATATTGcT 


TCCGCAATTA 


ATTTTATGGT 


AGACAGTTTA 


780 


GATCAATACA 


TTCACGATAT 


TTATACTTTA 


GAAATTAAGC 


AACGTGATGC 


CCACATGAGA 


840 


GCGTTACAGT 


CGCAAATTAA 


TCCTCATTTT 


CTTTATAATA 


CATTGGAATA 


TATTCGCATG 


900 


TATGCCTTGA 


GTCGGCAACA 


AGAAGAACTA 


GCTGATGTTG 


TCTATGCTTT 


TTCAACGTTA 


960 


CTTAGGAATA 


ATATTAACCA 


AGAAAAAACC 


ACGACCTTGG 


CAGAGGAAAT 


CTCATTTTGC 


1020 


GAAAAGTATG 


TGTATTTGTA 


TCAAATGCGT 


TATCCCGATC 


AATTTGCCTA 


TAAGTTCGAA 


1080 


ATAGAAGAAA 


CAATTGCTGA 


TGTTGAAATT 


CCTAAGTTCA 


TCATTCAGCC 


GCTCGTCGAA 


1140 


AATTATTTTG 


TGCATGGTAT 


TGACTATCAA 


CGACAAGACA 


ATGCTATCAA 


GGTCCACGCC 


1200 


TACCGAGAAG 


GCGAAAAAAT 


TATTGTGGCT 


GTTGTGGATA ATgGAAAAGG 


AATCACCGCC 


1260 


AATCGGTTAG 


AGGAAATTCG 


CGAACGTTTA 


AATCAAACAG 


AAATAGACAC 


GGAACAATCC 


1320 


ATTGGGTTAA 


GAAATGTTCA 


TGAACGTTTA 


CAACGCTTTT 


TTGGGGAGAG 


TTATGGTTTA 


1380 


ACCATTGAAG 


GGAAAGAAGG 


AGAAGGGACG 


ACAATTCGTC 


TTTCGTTTGT 


AGCATAAGGG 


1440 


GTGAAAAGAT 


GTATCGAGTG 


ATGTTTGTTG 


ATGATGAATA 


TATGATTTTA 


GAAGGCTTAA 


1500 


AATGGATTAT 


TCCTTGGCAA 


GAGTTAGGCT 


TTGAAATTGT 


TAAAACAGCG 


CGTAGTGCGC 


1560 


AAGAAGCATT 


GGCTTTTTTa 


GgAaaCAGAA 


TCCATTGATG 


TTTTATTaAC 


AGATATTACG 


1620 


ATGCCAGAAA TgAGCGGGAT 


CgAaTTAaTg 


AGCAgCA 






1657 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 



GCTAGACTCG 


GAGTCGCCTC 


CAACATCATA 


AC ACT CT AAA AGAAACATCT 


ATAAAGTGAT 


60 


CGGTTTATTA ATTATTTAAC 


AAATTCGATA 


ACTACCATTG 


GTGCTGCATC 


GCCGCGTCTT 


120 


GGTTCTGTCT 


TCAAGATACG 


AGTGTAACCA 


CCTTGGCGTT 


CCGCGAAACG 


AGGTCCTAAA 


180 


TCGTTAAATA 


GCTTTTGTAA 


AGCTGATTCA 


ACAACGATAG 


ACTCATCTTC 


TTCACGAACA 


240 


CTAGCAACTT 


CATTACGAAC 


GAAAGTAGCT 


GCTTGACGAC 


GTGCATGAAG 


ATCTCCACGT 


300 


TTACCTAAAG 


TAATCATTTT 


TTCTACAGTT 


GAACGTACTT 


CTTTCGCACG 


AGCTTCAGTA 


360 


GTTACGATGC 


GTTCGTTGAT 


GATTAAATCA. 


GTTGTAATAT 


CACGTAGCAT 


AGCTTTACGT 


420 


TGGCTTGATG 


TGCGTCCTAA 


TTTACGGTAA 


CTCACGTTGG 


GTTCCCTCCT 


TCGTTTCTGT 


480 


TCTCACGAGC 


CACTGCTGTT 


AAGCAGGACT 


GGAAACTTCG 


TAAAAGTCTT 


AATCGTCCTT 


540 


ACGTAATCCT 


AACCCTAGAT 


CATGCAATTT 


AAGTTTTACT 


TCTTCAAGAG 


ATTTACGACC 


600 


TAAATTACGT 


ACTTTGATCA 


TTTCTGGTTC 


AGATTTGTTT 


GTAAGTTCTT 


GTACAGTATT 


660 


AATGCCTGCA 


CGTTTTAAAC 


AGTTATATGA 


ACGAACTGAT 


AAATCAAGTT 


CTTCAATCGT 


720 
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CATTTCTAGC 


ATTTTCTCTT 


TTTGAGTTTC 


TTCTTTTTCC 


ACCATGATTT 


CAGCGTTTTT 


780 


CGCTTCGTCA 


GTAAGGTTTA 


CAAAGATATC 


TAAATGCTCA 


GTCATGATTT 


TCGCAGCTAA 


840 


ACTTAGTGCT 


TCTTGAGGAA 


TGATTGAACC 


ATCTGTCCAT 


ATTTCCATTG 


TTAATTTGTC 


900 


GAAATCATCA 


CGACGACCAA 


CACGTGTATT 


TTCTACTTGG 


TAGTTAACAC 


GAAGAACGGG 


960 


TGTGTAGATG 


GAATCAACTG 


GAAGTACACC 


GATTGGcATA 


TCTTCTTTTT 


TGTwTTCGTC 


1020 


TGCTTGAACA 


TAGCCACGAC 


CAGGTTTCAC 


TGTTAAGCGA 


GCATGGAAAG 


TAGCTCCTTC 


1080 


AGCAACACTA 


CAGATAACTA 


AATCTTTATT 


TAGGaTCTCA 


ACATCACTGk 


CCaCGATAAt 


1140 


ATCGCCGGCA 


GTCmCGGkAG 


CGGGtCCTGT 


AAwATCGATC 


TCaAGGGTTT 


CtCyTCyTcm 


1200 


GCATAwAATT 


cmATGCTAgC 


cTTaATAtTC 


aAAATAATTT 


GTGTTACGTC 


TTCTCGTACA 


1260 


CCTGGAATGG 


TTGAGAATTC 


GTGTAAAACA 


CCATCGATTT 


gAATATTCGT 


AATTGCAGCT 


1320 


CCTGGTAAaG 


mAGACAATAA 


AATACGmCGT 


AGAGAGTTAC 


CTAAAGTAGT 


TCCGTAACCG 


1380 


CGTTCTAATG 


GTTCAACGAC 


GAACTTACCA 


TAATCTCTAT 


TyTCATCAAT 


TTTTTCGATT 


1440 


CTTGGTTTTT 


CAAATTCAAT 


CATTCTTATC 


TATACCCCTT 


TCAAAACGAA 


AAGTGTCTTG 


1500 


TTCAATGACG 


TTCCTAATTC 


AAACTCAGAA 


TGAGTGGCGC 


TCATTAAACA 


CGACGGCGTT 


1560 


TTGGAGGGCG 


GCATCCATTA 


TGAGGAACTG 


GAGTCACGTC 


ACGAATTGCA 


GTCACTTCTA 


1620 


AACCTGTTGC 


TTGTAATGAA 


CGAATTGCTG 


CTTCACGTCC 


AGAACCAGGT 


CCTTTAACTG 


1680 


TTACGTCTAC 


AGTTTTAAGT 


CCATGTTCCA 


TTGCCACTTT 


AGTTGCGGCT 


TCTGCAGCCA 


1740 


TTTGAGCGGC 


AAAAGGAGTT 


GATTTTTTGC 


TTCCTtTAAA 


GCCTAATGAT 


CCTGCTGATG 


1800 


ACCATGCTAA 


TGCATTACOA 


TGTGTATCAG 


TAATCATTAC 


GATTGTATTG 


TkGAATGTAG 


1860 


AATGGATATG 


TGCTACACCT 


GATTCTATAT 


TCTTTTTGAC 


ACGGCGTTTA 


CGACTAACTT 


1920 


TTTTTGCTGC 


CATGAAGTTC 


TAACCTCCTT 


CACTTAGGAA 


TTATTTTTTC 


TTGCCTGCTA 


1980 


CTGTTTTAGT 


AGGGCCTTTA 


CGAGTACGTG 


CATTATTTTT 


CGTGTTTTGT 


CCACGAGTTG 


2040 


GTAATCCACG 


ACGGTGACGG 


ATTCCACGGT 


ATGAACCGAT 


TTCCATCAAG 


CGTTTGATGT 


2100 


TTAAGTTCAC 


TTCACGACGA 


AGATCCCCTT 


CAACTTTTAA 


CTTATCAATT 


TCCGCACGAA 


2160 


TCGCATCTGT 


TTGTTCGTTT 


GTTAAATCAC 


GTACACGAAC 


ATCTTCAGAT 


ACGCCAACGT 


2220 


TTGCTAAAAC 


TTTTTTAGCT 


GTAGTGTTAC 


CAATACCATA 


AATGTAAGTA 


AGAGAAACTA 


2280 


CTACACGTTT 


GTCACGAGGG 


ATGTCTACTC 


CTGCAATACG 


AGCCATTATT 


CGTTACACCT 


2340 


CCTAATTATC 


CTTGACGTTG 


TTTATGTTTT 


GGATTTGCTG 


GGCAAATAAC 


CATAACACGT 


2400 


CCTTTACGGC 


GAATTACTTT 


ACAATGTTCA 


CACATTGGTT 


T TACT GAT GG 


TCTTACTTTC 


2460 


ATGATAATAC 


CTCCTGTGGT 


TTTACGGAGT 


ACAATTATTT 


AAAGCGATAA 


GTAATACGGC 


2520 


CACGCGTTAA 


ATCATACGGT 


GATAATTCAA 


CTGTTACTTT 


GTCACCAGGT 


AAGATACGGA 


2580 


TGTAGTGCAT 


ACGGATTTTA 


CCTGAAACAG 


TAGCAAGAAC 


TTGGTGTCCA 


TTTTCTAGTT 


2640 


CGACTTTAAA 


CATTGCATTC 


GGCAAAGTTT 


CGACGACTGT 


ACCTTCGACT 


TCAATCATAT 


2700 
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CTTCTTTTGC 


CACGCACAGT 


ACCTCCTTGT 


ACTTGTTTCG 


CGTTTTCACA 


CGATAAAACG 


276.0 


GCGGAAACTC 


CAGCTCCCAC 


CTAGAAATTC 


TCAACTAACT 


TCGTTATTCA 


TTTTGATGTC 


2820 


ACAAAACTGT 


AGAAGTCGGA 


CT GAT AT ATT 


CTACCATATT 


TAGTTATGTT 


TTGGCAAGAG 


2880 


AAACTGAATA 


CAACaACAGG 


GTTAAACTGC 


GGGCAATTGG 


AAATTAGTTT 


TCGTCAATGA 


2940 


TTTTTTTCAC 


ATCAGAAAAA 


ACAGTATCGA 


TTTCGCGATT 


GCCATCGATT 


GTATGCATCA 


3000 


AGCCTTGTTC 


TTTATAGAAA 


GCTAAGATTG 


GTGCGCTACT 


TTCAATGTTG 


ACTGCCAAAC 


3060 


GATTTTTAAC 


CGTTTCGGGC 


TTATCATCTT 


CTCTTTGGTA 


GAATTCATGT 


CCGCCACAGC 


3120 


GATCACAAGT 


GCCTTCAACT 


TTTGGCGGGT 


TGAATAATTT 


ATGATACGTT 


GCGCCACATG 


3180 


TGCGACAGAT 


GAATCGTCCA 


GCTAAACGCT 


CGATTAATAC 


ATCTTCTTCT 


ACATGGATGT 


3240 


CAATGACAGC 


ATCAATTTTT 


TTGTTTAATT 


CTTTAAGCAT 


TGTATCTAAT GCTTTTGCTT 


3300 


GATCCAGTGT 


ACGTGGGAAA 


CCATCTAATA 


GGAATCCTTT 


GTCTGTATCT 


GGTTCAGCAA 


3360 


GGCGTTCTTT 


AACAATtCCG 


TTTGTTACTT 


CGTCAGGAAC 


CAGTTCACCT 


T TAT C GAT AT 


3420 


AAGAnTTCGC 


TTCTAGGCCC 


AAGGCAGTTt 


CATtTTTCAT 


TGCTGCACGG 


AACATATCTC 


3480 


CAGTAGAAAT 


ATGTGGAtGC 


CATACGTATC 


AATGATTTTT 


TCTGCTTGGG 


TGCCTTTACC 


3540 


TGCTCCAGGT 


AATCCCATTA 


AAATGAGGTT 


CATATATTCT 


CCTCCTAATT 


TTAAAGACCG 


3600 


GCTTATGAAG 


CTAGCmATAT 


GTGAGAATGG 


TGCTGAGGTA 


AAACCTCAAC 


ACTCGATTCT 


3660 


TACTTATTGA 


TAAAGCCGAC 


ATATTGACGT 


TTCATCATTA 


ATCCTTCTAA 


TTGTTTCGTT 


3720 


GTTTCTAATG 


CAACCCCGAT 


AACGATTAGT 


AAACTTGTTC 


CACCTAAACC 


GATTGATTGA 


3780 


GGTAAGTTCC 


AAACCATTTG 


CGCAATGATT 


GGAAGTAAAG 


CAACAAGTCC 


TAGGAAAATT 


3840 


GAGCCGACAA 


CACTTAATCT 


CATTAACACG 


CCAGATACAT 


ATTCTTCTGT 


ACCTTTACCT 


3900 


GGCCGCACGC 


TTGGAATGTA 


GCTTCCTTGT 


TTTTGTAAGT 


TTTCCGCTAA 


TTTCTCAGGG 


3960 


TTTACTTGAA 


CAAATGCATA 


GAAGAACGTA 


AACGCAACGA 


TTAGGACAGT 


AT AG AT GAT T 


4020 


GCCCCTGGAA 


CTGTGTTATA 


ACTGAAGATT 


TTTGTCATAA 


TGTCATACCA 


ATTTTCACCA 


4080 


GCGAATTTTG 


ATGAGAAAGC 


TTGTAAAATG 


GCATTTGGTG 


TTGCAATTAA 


CGAGCTGGCA 


4140 


AAGATAACTG 


GAATAACCCC 


AGCAGCATTT 


ACTTTTAACG 


GTAAATAACT 


ACTTGTTGGT 


4200 


GCACCAGAAA 


CACGTTTTGT 


ATATTGGATT 


GGAATCTTAC 


GTTCTGCTTG 


TTGGAAGAAC 


4260 


GTTACGACTG 


TAACAATCAC 


CAAAATAGCA 


ATAACTAAGA 


TTGCAATGAA 


AATAACAGAT 


4320 


TGCCAAATAC 


GAGAAGACTC 


GATATTGACG 


AAGTAATCTT 


CATAGATTTC 


TTTGACTGCT 


4380 


CCTGGCAAAC 


GGGAAATAAT 


CCCGGCAAAG 


ATAATCATTG 


AAACACCATT 


TCCGATTCCT 


4440 


TTTTCTGTAA 


TTTGTTCACC 


CATCCAAGTC 


ACAAACATCG 


TCCCAGCAGT 


TAAAATAACT 


4500 


GCAATCATCA 


CAAAGGTACC 


TAAGGTTGGA 


TTGTTGACAA 


TCCCAGTTTG 


ACTTAAGCTA 


4560 


TTAAAACCAG 


CAGTGATCCC 


CATTGACTGA 


GCCACACCCA 


AGACAATCGT 


TAGATATCTT 


4620 


GTTGCTTGAT 


TTAATTTCTT 


ACGCCCAACT 


TCCCCTTGTT 


TTGACCATTC 


TACAAATCTA 


4680 
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GGTACAATAT 


CCATTTGCAA 


TAGTTGAATA 


ATAATAGAGG 


CTGTAATGTA 


TGGCGAAACC 


4740 


CCCATCGAGA 


AGATAGAGAA 


GTTTTGCATG 


GCACTACCAC 


TCACCATATT 


CAACATATTC 


4800 


AAAAAGGGTA 


AGCTACTTAA 


ATCAG ACAAT 


CCCTTTGCAT 


TCACCCCGGG 


CACAGTAATG 


4860 


TGCGCACCTA 


GGCGAAATAC 


AAACAAGATT 


AAAACTGTAA 


ATAAGATTTT 


TGATCTAATG 


4920 


TCTTTGACTT 


TAAAGGCGTT 


CTTTAATAGC 


TTGAACAATT 


AGATCACCTC 


GATTGAACCA 


4980 


CCAGCAGCTT 


CGATAGCTTC 


TTGTGCTGAT 


TTCGAGAATT 


TAGCTGCTTT 


CACAGTTAAT 


5040 


TTTTTCGTTA 


ACTCACCATC 


TGCAAGAACT 


TTGATTCCCG 


CTTTTTCGTT 


TTTAACGATT 


5100 


CCTGCTTCTT 


TTAGGACAAC 


AGGAGTTACT 


TCTGTTCCAT 


CTTCAAAACG 


GTTTAAAGTA 


5160 


TCTAAGTTTA 


CAACAGCGTA 


ATCTTTACGA 


TTAATGTTTG 


TAAACCCACG 


TTTTGGTAAA 


5220 


CGACGGAATA 


ATGGAGTTTG 


ACCCCCTTCA 


AATCCTAAAC 


GAACACCACC 


GCCTGAACGG 


5280 


GCTTTTTGCC 


CTTTTTGTCC 


ACGGCCAGCT 


GTTTTACCAT 


TACCAGATGA 


AGTACCACGT 


5340 


CCAACACGGT 


TACGTACTTG 


GCGTGAGCCC 


TCAGCAGGTT 


TTAATTCATG 


AAGTTTCATA 


5400 


GGTTTGGCAC 


CTCCTTAAGC 


AATCTTCTAT 


TACAAATCTA 


TAAATCTTAA 


ACTTCTTCTA 


5460 


CGTCCACTAA 


ATGTGAAATA 


GTGTTAACCA 


TACCTTTGAT 


TGCTTCATTG 


GCAGGTTTTG 


5520 


TAACAGTACT 


ATTCACTTTA 


CCTAAACCTA 


ACGCTTTAAC 


TGTAGCACGT 


TGGTTTTGAG 


5580 


GACGTCCGAT 


AACAcTGCGT 


TTTAAAGTAA 


TTTTTAATTC 


AGCCATTATT 


TTTGTCCTCC 


5640 


TTAACCGATA 


ATTTCTTCTA 


CAGATTTACC 


GCGAAGTGCT 


GCCACTTCTT 


CGGCACGTTT 


5700 


TAATTGTTTT 


AGACCTTCAA 


CAGTTGCGCG 


AACAACGTTA 


ATAGGTGTGT 


TTGAACCTAG 


5760 


TGATTTAGAT 


GTAATATCTG 


CTACACCAGC 


TAATTCCAAT 


ACGGCACGAA 


CTGGTCCACC 


5820 


AGCAGCTACC 


CCAGAACCTT 


CTACTGCAGG 


TTTCATAAGG 


ATACGGCCGC 


CACCGAATAC 


5880 


GCCAATTACT 


TCGTGCGGGA 


TAGTAGAACC 


AACCATAGGC 


ACTTCAACTA 


AGTTTTTCTT 


5940 


CGCGTCTTCG 


ATTGCTTTAC 


GGATAGCTTC 


AGGTACTTCT 


TGTGCTTTAC 


CAGTACCGAA 


6000 


ACCAACGTGT 


CCGTTTTTGT 


CACCGACAAC 


AACTAAAGCT 


GCGAAACGTA 


AACGACGTCC 


6060 


ACCTTTAACA 


ACTTTTGTTA 


CACGGTTGAT 


CGCTACAACG 


CGGTCTTCTA 


ATTCCAAATG 


6120 


TTTTGGATCA 


ATATAAACCA 


TGAATGGTGT 


TCCTCCTTCT 


CCTAAAATTC 


TAGTCCATTT 


6180 


TCACGAGCAG 


CTTCAGCTAA 


AGCTTGCACG 


CGGCCATGGT 


AAAGGTATCC 


ACCACGGTCG 


6240 


AAGACTACTT 


TTTTGATGCC 


TTTTTCAGCG 


GCACGTTCAG 


CAACTAGTTT 


ACCAACAGCT 


6300 


GCAGCTGTTT 


CTGTTTTTGT 


TCCACCTGAA 


ATTTCTTTAT 


CCAAGGCAGA 


GGCACTTGCT 


6360 


AGCGTCACAC 


CCGCTACGTC 


ATCAATGACT 


TGCGCGTAGA 


TGTTTTTGTT 


AGAACGGAAA 


6420 


ATGTTCAAGC 


GTGGGCACTC 


AGCAGTACCA 


GAGATTTTGT 


TACGTACACG 


ACGATGTCTC 


6480 


TT.TTGACGTG 


TTTTGTTTTT 


ATCTGGTTTT 


GTAATCACAA 


TTTTCACCTC 


TTTGATTTAA 


6540 


TGTTTACTAC 


TAATATTTCA 


AGCAGTCAGC 


AGATTACGGA 


AAAGAGAAGC 


TGGCAGACAA 


6600 


GACAAAAGCA 


TTTTATTGGA 


AAAATCCCAT 


AAAATGACAG 


TTGCTTGTTG 


TTGACTTCTG 


6660 
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TTTTCTACAC 


AGCTGTTATT 


ATTTACCAGT 


TTTACCTTCT 


TTACGGCGTA 


CAAATTCACC 


6720 


AACATAGCGA 


ATCCCTTTGC 


CTTTATAAGG 


TTCTGGAGGA 


CGAACGCCAC 


GGATGTTAGC 


6780 


AGCTAATTCA 


CCAACAACTT 


CTTTGTTGAT 


GCCTTTAACG 


ATAACTTGTG 


TGTTCGCAGG 


6840 


TACTTCAAAT 


GTTACGCCTT 


CTGGTGCTGT 


CATTTCGACA 


GGATGAGAAT 


AACCAACGTT 


6900 


TAAAGTTAAT 


TTGTTGCCTT 


GTACTTGGGC 


ACGGTATCCG 


ACCCCGATAA 


GTTCAAGTGC 


6960 


TTTTTGGAAA 


CCTTCAGAAA 


CACCAACAAC 


CATGTTGTTG 


AAGTTTGCAC 


GAGTTGTTCC 


7020 


GTGGATTGTT 


TTCATTTCTT 


TTGAATCGTT 


TGGACGAATG 


AATGTCACTT 


CGTTTCCTTC 


7080 


GATGTTCATT 


TTGATATCAG 


AAGAAAATTC 


GCGTGTTAAT 


TCACCTTTAG 


GTCCTTTAAC 


7140 


TGTAATGTTG 


TTTCCGTCTT 


GCTTGATTTC 


AACACCAGCA 


GGAAGAACAA 


CAACTTTATT 


7200 


ACCAATACGG 


CTCACTTAGA 


GACACCTCCT 


TGTGTATTTT 


TTAGAATTAC 


CATACGTAGG 


7260 


CGATTACTTC 


GCCGCCGATA 


TTTTTCGCTC 


TAGCTTCTTT 


GTCAGTAATT 


ACACCCTCTG 


7320 


ATGTAGAGAT 


GATTGCAATT 


CCTAGACCGT 


TTAATACTTT 


TGGTACTTCG 


TCAGCTTTTA 


7380 


CGTATGCACG 


TAAACCTGGT 


TTAGAAATAC 


GTTTTAAGTT 


TGTGATAACA 


CGTTCTTCGT 


7440 


TTTTACCATA 


TTTTAGGAAA 


ACACGGATTA 


CGCCTTGTTT 


GTCATCTTCG 


ATGTATTCTA 


7500 


CGTCGCGGAT 


AAATCCTTCA 


CGTTTCAGGA 


TTTCTGCGAT 


ATCACGTTTG 


ATTTTTGAAG 


7560 


CAGGTACTTC 


TAAAGTTTCA 


TGTTTAACCA 


TGTTCGCATT 


ACGAATGCGA 


GTTAGAAAAT 


7620 


CTGCAATTGG 


ATCTGTCATG 


ACCATTGAAC 


TATTTACCTC 


CTTTATGCGA 


GTTTACTTGT 


7680 


TTACCAGCTA 


GCTTTCTTCA 


CGCCGGGAAT 


TTGACCTTTA 


TAGGCAAGTT 


CGCGGAAGCA 


7740 


AATACGGCAA 


AGATGAAATT 


TACGATAAAC 


TGAATGTGGA 


CGTCCACAAC 


GTTCACAACG 


7800 


AGTATAAGCT 


TGTGTTGAAT 


GTTTAGCAGG 


ACGTTTGTTT 


TTAGCAATCA 


TTGATTTTTT 


7860 


AGCCACGTAG 


TTCGCCTCCT 


TATATTATTT 


TTGGAATGGC 


ATACCTAATT 


GTGCCAACAA 


7920 


TTCGCGAGAC 


TCTTCGTCAG 


TGTTCGCTGT 


TGTAACAATT 


ACGATGTCCA 


TACCGCGTAC 


7980 


TTTATCTACT 


AAATCATAAT 


CAACTTCTGG 


GAAGATTAAT 


TGTTCTTTAA 


TACCTAAAGT 


8040 


ATAGTTACCA 


CGACCATCGA 


AAGCTTTTTT 


GCTGACACCG 


TGGAAGTCAC 


GTACACGAGG 


8100 


TAAAGAAACT 


GTTACTAATT 


TATCTAAAAA 


TTCGTACATT 


CTTTCGCCGC 


GTAGnnTTAC 


8160 


TTTCGCACCG 


ATTGGCATTC 


CTTCACGTAA 


ACGGAATCCA 


GCGATAGATT 


TTTTAGCTTT 


8220 


TGTGATAAGT 


GGTTTTTGAC 


CAGTGATTAA 


TGCTAATTCT 


TCAAcAGCTT 


TGTCTAAGTT 


8280 


111 nb^n 111 






CATGTTGATA 

/i lull un x 


ACGATTTTAT 


CAACTTTAGG 


8340 


TGTTTGCATA 


ACTGAGCTGT 


AATTAAATTT 


TTCCACCAAT 


GATGGTGTTA 


CTTCTTTAAT 


8400 


ATATTTTTCT 


TTCAGGCGGT 


TCATTCAGTA 


AGCCCTCCTT 


CCGCAATTTT 


TATTTATCTA 


8460 


AAACTTCACC 


TGTTTTTTTA 


GAAACGCGGA 


CTTTTTTGCC 


GTCGACTTCT 


TTAAACGCAA 


8520 


CTTTAGTAGC 


TTCTCCGTTT 


GAAGGATCAA 


TCACCATTAC 


ATTAGAAACA 


TGAATTGGCG 


8580 


CTTCCACTTC 


AAGGATTCCA 


CCTTGTGGGG 


CTGCTTGGTT 


TGGTTTTTGG 


TGTTTTTTAA 


8640 
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CGACGTTGAC 


ACCTTCAACG 


ATAACTTTAT 


CTTGTTTAGG 


AAACGCTGCT 


AAGACAACGC 


8700 


CTTCTTTGTT 


TTTATCTTTA 


CCAGTGATAA 


CTTTTACTTT 


ATCGCCTTTT 


TTAACAAACA 


87.60 


TTACTGTTTC 


GCACCTCCTT 


TAACGTGGGC 


TCGAGATTAT 


AATACTTCTG 


GTGCTAGAGA 


8820 


AACGATCTTC 


ATGAAGTkGT 


TTTCACGTAA 


TTCACGTGCA ACTGGTCCGA AgAwACGTGT 


8880 


TCCGCGTGGG 


CTTTTATCGT 


CACGGATAAT 


TACCGCAGCA 


TTTTCATCAA 


ATTTGATGTA 


8940 


AGAACCGTCA 


GCACGACGAG 


CGCCAGATTT 


TGTACGAACG 


ATAACGGCTT 


TAACTACTTC 


9000 


ACCTTTTTTA 


ACAACCCCAC 


CTGGCGTTGC 


TTGTTTGACA 


GTAGCAACAA 


TAACGTCACC 


9060 


GATGTTAGCA 


GTTTTACGAC 


CAGATCCGCC 


TAGGACTTTG 


ATCGTTAAAA 


TTTCACGTGC 


9120 


GCCAGAATTG 


TCAGCGACTC 


TTAAACGACT 


TTCTTGTTGG 


ATCACGATGT GTATCCCCCT 


9180 


TTCAGATTTT 


GTGCTTCAAT 


CTATTAGGAA 


ATTCTCGTTT 


TATTAGATAA TAACTGCTTC 


9240 


TTCGACTACC 


TCTAGTAAAC 


GGAAACGTTT 


TGTAGCTGAT 


AATGGACGAG 


TTTCCATGAT 


9300 


TTTTACGATG 


TCTCCAACTT 


TCGCTGTGTT 


GTTTTCATCA 


TGCGCTTTAT 


ATTTTTTAGA 


9360 


ATATTTCATA 


CGTTTGCCAT 


AGATAGGGTG 


GTTTTTCTTT 


GTTTCAACGA 


CAACTGTGAT 


9420 


TGTTTTATCC 


ATTTTGTCTG 


ATACCACGCG 


ACCTTGGTAA 


ACTTTACGTT 


GATTTCTTTC 


9480 


TTCAGTCATA 


CGCGTTTGGC 


CTCCTTCCAC 


TATTAGTTAG 


CTTGTTCACG 


CAATACTGTT 


9540 


TTGATGCGTG 


CAATCGATTG 


ACGTACTTCT 


TTAATACGTG 


CAGTGTTTTC 


TAATTGACCT 


9600 


GTTGCTAATT 


GGAATCTTAA 


GTTGAACAAT 


TCTTCTTTTA 


ATTGTTTTTC 


TTTATCTAGC 


9660 


ATTTCGGCAG 


TGGTTAATTC 


TCTGATTTCT 


TTAACCTTCA 


TTCGATTCAC 


CACCCATTTC 


9720 


CTCACGTTTT 


ACGATCTTAG 


TTTTTACCGG 


CAATTTGTGA 


GATGCAAGAC 


GAAGAGCTTC 


9780 


ACGAGCTACT 


TCTTCAGGAA 


CGCCTGCGAT 


TTCAAACATG 


ATTTTACCAC 


GTTTAACTGG 


9840 


TGATACCCAG 


CCTTCTGGTG 


CCCCTTTACC 


TTTACCCATA 


CGTACGCCGA 


TAGCTTTACT 


9900 


TGTGTAAGAC 


TTGTGAGGGA 


AAATTTTAAT 


CCATACTTTC 


CCGCCACGTT 


TCATGTAACG 


9960 


AGTCATTGCA 


ATACGGGCTG 


CTTCGATTTG 


ACGGTTAGTA 


ATCCAGTGAG 


ATTCAGTTGC 


10020 


TTGTAAACCC 


CATTCACCAA 


ATGCTACTTC 


TTTTCCGCCT 


TTAGCTTCAC 


CGCGCATTTT 


10080 


TCCGCGGAAT 


TCACGACGGT 


GTTTTACACG 


TTTAGGTACT 


AACATGATTA 


TTTCCCTCCT 


10140 


TTCTCAGTGT 


TTTTTTTCGT 


TGGAAGAATT 


TCTCCACGAT 


AGATCCACAC 


TTTAACTCCT 


10200 


AGTTTTCCGT 


AAGTAGTGTC 


CGCTTCTTCC 


CATGCGTAAT 


CAATGTCCGC 


ACGTAATGTG 


10260 


TGAAGTGGTA 


CAGTTCCTTC 


AGAGTATCCT 


TCTGAACGAG 


CGATATCCGC 


ACCGTTTAGA 


10320 


CGACCTGATA 


CTTGAGTTTT 


GATTCCTTTA 


GCGCCAGCAC 


GCATTGCGCG 


TTGGATGGCT 


10380 


TGTTTTTGAG 


CACGACGGAA 


AGCAACACGG 


TTTTCTAATT 


GACGTGCAAT 


TCCTTCGCCT 


10440 


AGTAATTTTG 


CATCTAAATC 


TGGTTTTTTG 


ATTTCAACGA 


TGTTGATGTG 


AACTCGTTTG 


10500 


CCAGTTAATT 


TGTTTAATTC 


TTTTCTTAGG 


TTTTCGACTT 


CAGATCCGCC 


TTTACCGATA 


10560 


ACCATACCTG 


GTTTAGCTGT 


GTGAATTGAA 


ATGTTAACAC 


GGTTTGCAGC 


GCGTTCGATT 


10620 
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TCAATTGTAG 


ATACAGCAGC 


ATCAGCAAGT 


TTTGTTGCGA 


TAAATTTACG 


GATTCTTAAA 


10680 


TCTTCGTGTA 


AGAATTCGGC 


ATACTCTTTT 


TCAGCATACC 


ATTTCGCATC 


CCAGTCGCGG 


10740 


ATGATGCCTA 


CACGCATTCC 


AATTGGATGT 


ACTTTTTGAC 


CCACTGATTG 


TCCCTCCTCT 


10800 


TAAATGTTTA AGCTCTGTTT 


TCATAAAGAA 


AACAGCCTAT 


TTCATGAGAC 


TTATTTCTCT 


10860 


GTTACAACTA 


CTGTGATGTG 


ACTAGTACGT 


TTGTTGATTG 


GTGAAGCTGA 


ACCTTTTGCA 


10920 


CGAGGACGGA 


AACGTTTCAT 


TGTTGGTCCT 


TCGTTAACAA 


ATGCTTCAGA 


TACGACTAAG 


10980 


CTTTCAACGT 


CTAAGTCAAA 


GTTGTTTTCT 


GCGTTAgCAA CTGCTGACAT 


TAAAACTTTT 


11040 


TCAATGATTC 


CAGCAGACTT 


GTTCGGTGTG 


AATTTCAAGA 


TTGAAATTGC 


ATCCGCAACG 


11100 


CTTTTCCCTC 


TGATAAGATC 


GAT TACT AAA 


CGTGCTTTAC 


GAGGTGAAGT 


GCGAACTGTT 


11160 


TTTGCAGTTG 


CTTTAGCTGA 


TGTAATTTGT 


TCTGACATTT 


GCATATCCTC 


CCCTCAAAAT 


11220 


TAGCGTTTAG 


TTTTTTTATC 


GTCAGCAACG 


TGGCCACGAT 


AAGTTCTAGT 


TGGTGCAAAT 


11280 


TCACCTAATT 


TATGTCCTAC 


CATGTCTTCT 


TGAATGTAAA 


CAGGGACATG 


TTTCCGTCCA 


11340 


TCATATACAG 


CGATTGTGAA 


ACCTACGAAT 


GATGGAAAAA 


TTGTAGAGCG 


GCGAGACCAA 


11400 


GTTTTAATTA 


CTTTTTTCTT 


TTCGGCACCT 


TGTTGTGCTT 


CGACCTTTTT 


CATCAAATGA 


11460 


TCATCGACGA 


AAGGTCCTTT 


TTTTAAACTA 


CGACCCATGG 


TGAACCTCCT 


CTCAAAATGT 


11520 


ACGACACATG 


GTCAAAAAAA 


TTATTTAGTA 


CGACGACGAA 


CGATAAGTTT 


GTCTGATTTA 


11580 


GCTTTTTTAT 


TACGTGTTTT 


GTATCCAATT 


GCTGGTTGAC 


CCCATGGAGA 


TACTGGCGCT 


11640 


TTACGTCCAA 


TTGGTGTTTT 


ACCTTCACCA 


CCACCGTGTG 


GGTGATCGTT 


AGGGTTCATT 


11700 


ACGCTACCAC 


GAACAGTTGG 


GCGTTTACGC 


ATCCAACGAG 


AGCGGCCAGC 


TTTACCAATG 


11760 


TTGATTAATT 


CGTGTTGTTC 


GTTACCAACT 


GAACCGATTG 


TTGCACGGCA 


AGTTGCTAAG 


11820 


ATCATACGAA 


CTTCGCCAGA 


GTTTAAGCGG 


ATTAATACGT 


ATTTGCCTTC 


TTTACCAAGT 


11880 


ACTTGCGCGC 


TTGTTCCAGC 


TGAACGGATT 


AATTGTCCAC 


CTTTACCAGG 


TTTCATTTCA 


11940 


ATGTTGTGGA 


TAACTGTACC 


AACTGGAATG 


TTTTCCAATG 


GTAATGCGTT 


ACCTACTTTA 


12000 


ATATCTGCTT 


CTGGAGCAGA 


TACTAGGCGC 


ATGCCTACTT 


CTAATCCTTT 


TGGTGCTAAG 


12060 


ATGTATGCCT 


TAACCCCGTC 


TTCGTAATGA 


ACTAACGCGA 


TGTTAGCAGA 


ACGGTTTGGA 


12120 


TCGTACTCGA 


TCGTTTTAAC 


AACCGCTGCG 


ACGTTATCTT 


TATTACGTTT 


GAAGTCAATC 


12180 


ACACGGTATT 


GACGTTTGTG 


ACCGCCACCT 


TGGTGACGAA 


CCGTAATGCG 


ACCGTTGTTG 


12240 


TTACGACCGG 


CATTGTTTTT 


TAATGGCTGT 


AACAATGATT 


TTTCTGGTGT 


TGAAGTTGTG 


12300 


ATTTCAGCGA 


AATCAGAACT 


CGTCATGTTA 


CGACGGCCAT 


TTGTGGTAGG 


TTTGTACTTT 


12360 


TTAATCGCCA 


CGTCTTATTT 


CCCTCCTACG 


TTGATATTGT 


TCAGGACTGC 


TTATTCAGCA 


12420 


GCTTCGAATA 


ATTGAATTTC 


TTTTGAATCT 


TCAGTTAATG TAACGATCGC TTTGCGACGT 


12480 


TTTTTTGTAT 


AACCTGCATA 


TTTACCCATG 


CGTTTGAATT 


TTGGACGCAC 


GTTTAAGATG 


12540 


TTTACGTTTG 


CTACTTTAAC 


ATCGAATGCT 


GATTCAACAG 


, CTTGTTTTAC 


: TAAAGTTTTG 


12600 
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TTTGCGCGAG 


TGTCCACTTC 


GAAAGTATAT 


TTCTTTTCGT 


CCATAGCTAA 


CATGGATTTT 


. 12660 


TCAGTGATCA 


CTGGGCGTTT 


GATTACGTCT 


AGTAATTCCA 


TTATGCAAGC 


ACCTCCTCAA 


12720 


TTTGAGTAAG 


AGCTGTTTGT 


GTTGCCAAGA 


CTTTGTTAGC 


TGACACAACA 


TCTAACACAC 


12780 


TAACGTTATC 


AGAAGTTACT 


ACAGAAACGT 


TTGGTAAGTT 


ACGAGCTGAT 


AAAGCTGCAA 


12840 


AGTCGTTACC 


ATTTTCTAAA 


ACAACTAATA 


CTTTAGTATC 


GATAGACAAG 


TTAGCAAGAA 


12900 


CTTGTTTGAA 


TTCTTTTGTT 


TTTGGTGCAT 


CGAAGCTTAA 


ACCTTCTACT 


GCTACCAAGT 


12960 


TATTTTCAGC 


AACTTTGTCT 


GATAATACAG 


ATTTCATTGC 


TAAGCGACGA 


ACTTTTTTAG 


13020 


GAAATTTGTA 


GCTGTATGAA 


CGCGGTGTTG 


GTCCAAAGAC 


TACGCCACCT 


CCACGCCATT 


13080 


GTGGTGAACG 


GATTGAACCT 


TGACGAGCAC 


GACCAGTTCC 


TTTTTGACGC 


CATGGTTTGC 


13140 


GGCCACCGCC 


ACGAACAGCG 


CTACGGTTTT 


TAACTGCGTG 


TGTTCCTTGT 


CTTAATGAAG 


13200 


CACGTTGCAT 


GATGATTGCA 


TCATAGACAA 


CACTTTCATT 


AGGTTCGATT 


CCGAAGATTT 


13260 


CTTCATTTAA 


AGTGATTTCA 


CCGTTTTGAG 


TTCCATCTTG 


TTTGAATAAT 


GCTACATTCG 


13320 


GCATTCCTTA 


GTTCcTCCTT 


TCCcTACACT 


TATTTAGCTT 


TCACaGCTGA 


TTTGATTGTG 


13380 


ATTAAaGATT 


TTTTCGCTCC 


aGGAATGTTT 


CcTTTGaTTA 


AGATAACATT 


TCTTTCTAgT 


13440 


CCATTTAACA 


ATTCaAGGTT 


TTGaATtGTT 


TACGCGkTCG 


CCACCCATAc 


GgCCGGCTAG 


13500 


TCGTTTATTT 


TTAAATACAC 


GGTTAGGCGC 


TACTGGACCC 


ATTGACCCAG 


GACGACGGTG 


13560 


ATAACGAGAA 


CCATGAGACA 


TAGGTCCGCG 


GCTTTGGCCG 


TGACGTTTGA 


TTGCCCCTTG 


13620 


GAATCCTTTA 


CCTTTCGTAG 


TACCTGTAAC 


ATCAACAACG 


TCTCCTGCTT 


GGAAAACATC 


13680 


TACTTTGATT 


TCTTTACCTA 


CTTCATATTC 


TCCtAGCTCA 


ACATTCTTGA 


ATTCTTTAAT 


13740 


GAAGCGCTTA 


GGAGCCGTGT 


TTGCTTTTGC 


AACATGACCT 


yTCGCAGGTT 


TGTyTGATAA 


13800 


AACTTCACGT 


TTGTCTTGGT 


AACCAACTTG 


GATAGCTTCG 


TATCCGTCAG 


TTTCAACAGT 


13860 


TTTTaCTTGt 


AAAACTACGT 


TTGGCGTAGC 


TTCaACTACT 


GTTACTGGAA 


TTAATTCACC 


13920 


AGACTCAGTA 


AAGATTTGTG 


TCATTCCCAC 


TTTTTTCCCT 


AAGATTCCTT 


TGGTCATGAG 


13980 


TACACCTCCA 


TCTTAATTTT 


ATAGTTTGAT 


TTCAATATTA 


ACACCAGACG 


GTAAGTCAAG 


14040 


CTTCATTAGA 


GCGTCAACTG 


TTTTTGGTGT 


TGGGTTCACA 


ATGTCAATTA 


GACGTTTGTG 


14100 


AGTACGCATT 


TCGAATTGTT 


CGCGAGAATC 


TTTGTATTTA 


TGAGTCGCAC 


GAATAACTGT 


14160 


GTAAAGTGAG 


CGTTCTGTTG 


GTAATGGAAT 


CGGACCTGAT 


ACGTCAGCTC 


CAGTTCTTTT 


14220 


TGCTGTTTCC 


ACAATTTTAT 


CCGCTGATTG 


ATCTAAAATA 


CGGTGTTCAT 


ACGCTTTTAA 


14280 


ACGGATACGA 


ATTTTTTGTT 


TTGCCATCTT 


GTTCCCTCCT 


TCGCCTATTT 


TAAAAGTAGA 


14340 


CATAGCTCCA 


CGAAAATTTA 


TCCGGCATGC 


TCGTTCATGG 


CAAAGCGTCC 


GAGCGTGTCG 


14400 


CAACCTCTCG 


CTTCACAGCC 


GGCAAATCAA 


ATCGTTGATC 


TACCAAnGCT 


TTTTACACTC 


14460 


CTGTAAACAG 


CACCTTTTTG 


ATTATACTAT 


GAAAGGATAG 


TGTTAGCAAG 


GATTTTCTGC 


14520 


GTTTTTTGAA 


AGAATTTGnC 


nnGTTTAGAA 


. AAGCATT 






14557 
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(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



CCCCCCGGAG 


TGTAAAAGTA 


ACTATGATGC 


ATACTTTGGA 


TAAAATTcTT 


CCAAAATAAA 


60 


mCCGTkGGCC 


CAT kG AG AC A 


AAGCGCTCcA 


AGAAACTGGC 


TTTATTGGCT 


AGATTTATCG 


120 


AACCATGAAC 


GACATAAaCT 


ACTTTTTAAc 


TGCTGTTTGT 


GCGGTAGAAC 


ATCCAGATAT 


180 


TGCCAGCGGC 


CTAATAATTC 


TTCTTCAGCT 


TCTAATCCTT 


GACGATCCGT 


TTGAATTTGT 


240 


TTCAAAGATT 


CTTGAATATT 


CTTTAATTCC 


GTCAAATAGC 


TTTGTAATGT 


TTCTTCATCA 


300 


AAGGTTTGTy 


CCAACTCTTC 


TAAGGTATAG 


ACTTTTCTTT 


TTATCTTTAA 


CCCTTTTCCT 


360 


GAAGAGCGTG 


TCAAAAATAA 


AAGAAGCTCT 


TGCAATTGTT 


GTAGCATCGT 


TTCATATTTA 


420 


CGTAGCTGTT 


CAGTTGACTG 


TTCTGCTTGT 


AAAAAAGAAT 


ATTGACTCTT 


TAAGGCTTGT 


480 


TCTTCTTCAG 


GCGAATGAAA 


AATCCGTTTT 


ACTTCGACAA 


TTTGCAAGCC 


TTGAATGGCT 


540 


TGTAGAATTG 


CTGCTTCTTT 


TTCGGCAGCG 


GCAATTATTG 


TCATTTTTTC 


CATTTTACTA 


600 


ACTGCCATAA 


ACGTCTTTCA 


CCCTCTCAAT 


AATTGAATCA 


ATGACTTGTT 


CTTTGTTCGC 


660 


TTCATAACGT 


TCGTGAAGCG 


CTTGTTCAGT 


GCGCTTTGTT 


TCATCTAAAA 


GTTGTTCTTT 


720 


TTCAAGTTGT 


AACATCTTTT 


CTTGCGTTGC 


TAGTGAATCC 


GTCAGCACTT 


TTTCAACAGC 


780 


TTCTTCACTC 


TCTTGACGTA 


GTTCAGCTAA 


ACGTTCTTGC 


TTCTGTTCTT 


CATATAAACG 


840 


ACAATTCATC 


TTAGCTTGTT 


GCCGATGCTC 


TTCCACCGCA 


TCTTCCGCTG 


CCTTGATTTG 


900 


CTCTAGTACT 


TCTTGTACCA 


TTGTCTCCAC 


CTCCTTAGTT 


TTTACACCTT 


TAG CAT AT AA 


960 


GAAAAAGGAC 


TAGCGAAATA 


GAGATACGCT 


CAGCCCTTTT 


GTTTTAGGTA 


TGCAAAAAGG 


1020 


CGCAAACTCA 


AAAATTGTTC 


TTCTCACCCC 


TTGAAGAACT 


GAATCCTTGA 


AGCTTTCGCC 


1080 


CAACCATTCT 


TTATTCAATA 


CTCTACCATC 


TTAGCAAAGT 


TTTTCGCAAA 


ATTCAATACT 


1140 


TACCCAAATG. 


AATTAAAATA 


ACAAATTTTC 


ACTTTTATAA 


TAGGTAATAA 


AAAGGCATTA 


1200 


TTTTTTCGCT 


TTATAATTAT 


TGATCTTGAA 


TTTTTTTCAC 


ATAGATGTGC 


TATACTTACG 


1260 


TAAACAATTA 


AATTTATTTr 


GAGGGATCTC 


TGATGAATAT 


ACGATATATT 


TCAATCTCTG 


1320 


GTAATACACG 


CTCATTTGTT 


CAACGTTTAA 


CAACTTATTC 


TGAAGAGCAA 


CATCAGCACA 


1380 


ACGAAAAAAA 


TCCTACGATT 


ACTTTCAAAG 


AAATTTCGGA 


AAATTCTCCG 


TTAGAAGTAG 


1440 


AAACAGAACC 


• ATTTTTTACC 


TTTGTGCCTA 


CCTATTTAGA 


TGGTGGCGAT 


GGCATCAACA 


1500 


ATGGCAATAC 


CGAAATTTTA 


ACAGAAGTCA 


TGCGAGAATA 


TCTTGCTTTT 


GAAAATAATT 


1560 


ATCGTTATTG 


CTCTGGCGTG 


GTGGGCAGCG 


GCAATAAAAA 


CTTTAACCAC 


CAATACTGCC 


1620 
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TGACAGCTAA ACAGTATGCG GAACAATTTA AGTTCCCCTT CTTAGCTGAT TATGAATTAA 1680 

GAGGTACCCA AGCGGATATT GAACGAATTT ATGCTATTTT AAAGGAAAAC CAATAAGCAA 1740 

GTTTCATTAC TAAAAAGCTC ACCTATGATA GACTACATTT TGAGGTGAAG AAAAATGAGA 1800 

AATACATGGT TAGCAGAACA ACTTCAGTCA ATCAGTGAAG AACCCAATTC TTTTATTATC 1860 

GAGGAAACCA TCaAATATAT TGAACAATTA GAAGATGATA ATGAAAGTCT TCAAGTTGCA 1920 

TTGGAAGGGA CTATTTGGGA GTCCTAAAAA TGGGACGACC ATTGGGAAAA TATTTAACnG 1980 

A 1981 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

TCATAGGGCC TATTTTTTGT CAAGGAATCT TCTTATCTTT TTTGTTTAAT CTATCAAATT 60 

AAAATATTTT ATTTTTTAAT ATAAAATTAT TTCAATAATT ATTGAACATT TCAAATTGAA 120 

TTGATTTTTT CAGACGTAAC TTTAGGcTTG AGCCAGAATT TTTTGCGCTT CATCTATTGT 180 

TCGACAATCA CTGAGCCGCT GAATGTTTTC GAGATCTGAA CAGAAAAAAG CTAATTCTTG 1 24 0 

AAGCATGCTT AGTTGCCGTT GACTTTTTAA AGCAACGACA AATAACAAAG TAGCTATTTT 300 

TCGATCTGCT TCCGTACCAA AGTTGACCCC ATCAGGCACT TGAATAAAGC AAATCCCTTC 360 

TTTTAAAACT TGTTCATCTT GTCCTTCCGC ATGAGGTAAT GCCACGAAGT TACCGATATA 4 20 

CACACTGACT TTTTGTTGAC GTTGATACAT TGCTTTGACA TAGTCAGGCG TGACCAACTG 480 

ATTTTTTAGC AGTAATTCAC CAGAAGCTGC AAGTGCTTCC TCCCAACTAG CATACGATTG 54 0 

ATTCAACATT ATTTGCTCCG CTGTAATTTC AAACATCTCT CAACACCTAG CCTTTCATTT 600 

CTTCAATTAA TAATGACGAA AGCAACTGAT AAACAATGGC TtCATTTCCT GAATGAAAAA 660 

TTTCTGTATT TAAATCATTC ATAATAATCG CACCACTGAT TTTTCCTAAA ATTTTCCCGA 720 

TTGTTTCATC AATCGGTATC GGCGCTAGCA TGAGTAACAT ACGTTGTAAC TGCATCGGCT 780 

TTTTATCCAT ACCTTCAATC ATAAGTGGCT GGGCCAGATT AAAAATACAA AAAACAGGCT 840 

GTGTCACTGC TGTACTTGAG GTATGGAATA AGGCCATTTC CGTCTGAGGA ATCCCAATAG 900 

GCGCTTGCTC GTAACGTTTC ATCAATTGCT GACAGACGAT TGCACTATCG GTGATTAAAT 960 

CTGCGGAAAG TTGCGCGAGA GCTTGTTGAA CGGTTTCTGC TAAATCCGCT TGATTATTTA 1020 

AATCTGCTAT GAAAAATGTT TCTAATAGTT TACTAATTTC TTCCATGAAA GCGACCACCG 1080 

TTTCGTAGCT TTCTTCGCCA CCTATTTTTT TCACAGGCGC TTTCCTAAGC GAGCGTTTTT 1140 

CGTGACTAAT CCGCTTAAAC TCTTCTTTCA ATTGGcGGAT TTCATCGTCT AAAAGTAACG 1200 

GTGAGATCAA TTTATATTTA CCATTGtATC CtGGTAATAA CGArTGgAAA TCACTAAGTC 1260 
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ATAGTTTTCC TCAACATTCA CCTTACCTAA ATCCGCAATC CGAAAAAAAT GAATCTGGTT 1320 

GATAAAAGGA AAGTAGCGCC GTAATCGCAT TTCCAGCATA CTTGTCGAAG CCAAACCGCT 1380 

AGGAGAAAAA CCAGCAATAT CAACTTCCAT AATTTTGGGA CTCCGTTCTA AAGAATTGGC 14 4 0 

AAAATGAAGC ACCATGTAGG CAATTTCCTC TTCCGATAGA TTCTTTTGCG GAAAAATTTG 1500 

CGGCAACGCT TGGCGTAAAG CgCAGCTATT TCTTGATATT GATTCATGAT CCGTTCTAAA 1560 

ATGGGATTCG TTAAATCTnC. TTCTTGTAAA ACCGCCCGAG AAAAAACACC TGATAAATGC 1620 

GTCAACAACA TTnTATACAA ATnGGTATCT TC 1652 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1742 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



GTGATTCGAG 


nCGGTGCAAA 


nTGGnCCGAT 


GGTACCGnTA 


TTACTGCTGA 


TGATTTTGTA 


60 


ACGGCATGGC 


AAAGAGTGGT 


AGATCCCaAA 


ACTGCTTCTC 


CTAGCGTTGA 


GCTTTTCTCT 


120 


GCTATAAAAA 


ATGCTAAAGA 


AATTGCTTCA 


GGAAAACAAT 


CGAAAGAAAC 


GTTGGCAGTA 


180 


AAAAGTAATG 


GCAATAAAAC 


GATAGAAATT 


GAG T TAG AAA 


AACCCACACC 


ATATTTTACA 


240 


GATTTATTAG 


CATTAACAGC 


GTACTATCCT 


GTACAACAAA 


AAGCGATTAA 


AGAGTATGGG 


300 


AAAGACTATG 


GCACTTCACA 


AAAATCAATT 


GTAACAAATG 


GGGCATTTAA 


CTTAACAAGT 


360 


TTAGAGGGAG 


TGGGAACTTC 


TGATAAGTGG 


ACGATTTCTA 


AAAATAAGGA 


GTACTGGGAT 


420 


CAAAAAGAGG 


TTTCAATGGA 


GAAAATTAAC 


TTCCAGGTAG 


TTAAGGAAAT 


TAACACAGGG 


480 


ATTAATCTTT 


ATAATGATGG 


CC AGT TAG AC 


GATACCCCAG 


TTACAGGGGA 


ATATGCTAAA 


540 


CAGTACAAAG 


ATAACAAAGA 


ATTTTCCACT 


ACTTTAATGG 


CTAATACCAT 


GTATTTAGAG 


600 


TTGAATCAAC 


GGGAAAAAAA 


TTCTATTCTA 


CAAAATAAAA 


ACGTTAGAAA 


AGCCATTAGT 


660 


TATGCAATAG 


ACAGAAGTAA 


TTATGCTAAA 


AACATTTTAG 


ATAATGGTTC 


TATTTCTGCT 


720 


GTTGGTGTTG 


TTGCTAAAGA 


CGTTGCTTTT 


GATCCTAGTA 


CAAAAAAAGA 


TTTTGCTAAC 


780 


AAAATGTTGG 


TGCATTTTGA 


TACAGAAAAA 


GCGCAATCCT 


ATTGGAATAA 


AGCGAAAAAA 


840 


GAATTAAATA 


TTAAAGAACA 


AGTAACTTTA 


AACATTTTAA 


CCAATGAAGA 


AGAAACAACC 


900 


AAAAAAGCAG 


CTGAATACAT 


TCAAGGACAA 


TTAGAAGAAA 


ATCTAAAAGG 


TTTaAAAATT 


960 


ACGATAACAC 


CAGTTCCTGc 


AAaTGTACAA 


aTAGAGCGAG 


TTATGAAaCA 


TGATTTTACT 


1020 


ATTAGTcTAA 


• GTGGCTGGCA 


GGCAGATTAT 


CCTGACCCTA 


TGAGTTTTTT 


AGGtAmCTTT 


1080 


GrAAGTwACA 


GkGTGTTGAa 


TTTTGGAGGT 


tATAGCmATA 


CTAAaTATGA 


TGAATACTTA 


1140 


AAAGATACCT 


CAGATAAGCG 


TTGGAAATCT 


CTAAAAAAAG 


CAGAAGAATT 


ACTTCTTGAG 


1200 
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GATGCAGGAG TAGTTCCATT TTTACAAGTA GGTAATTCTA AATTATTAAA GTCAACGATG 1260 

GAGAAATATA...GAAACGCATT CGATAGGTGC AAAATATGAT TACmAAAAGA TGAAAAGACT 1320 

AGAGTAGTTA CTCTAGTCTT TTGGTTATTT TCTTTTCGTT TAGtTTTaTA wGaTTAtTTT 1380 

CyTTAGtAAG tAAAGtAwCy GGtTTACyCa TAGaACCaTT ATTTTTTTTT GAGATCATTA 14 4 0 

GATAAATGtT AwCtTTCaTt AATTTTGcTA GCTCGACcAT TTcTAAGTCy CCcTAATAGA 1500 

TCGACATATT TAATgAGTTA TTTAAAGAAT CCaTGTTGTT TGTTAAAAAG TAACCAACTA 1560 

AATTATAAGT TATTAAATAA TTTAGTTTAT ACTTATATGA CGGGAATTTT TGCAATAAAT 1620 

CATTGTAGAT ATTCAGAACT TCTGTACAAC TTTCAAAATC TCGGTTTTGA ACGAGTTTAG 1680 

TAGTAATGTT TTCAAGTAAA TAGTATGCTG CTTCACTAGT AACTGTTGGA GCGTTTGAAT 174 0 

CT I 742 
(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 



TAGTATTTCC 


TGTAACATTA 


TTTAGAAATA 


AATTTATTTC 


TAAATTAGGC 


AAGCTTTTTT 


60 


AATACAACAT 


GATTAACAGT 


AATAATCACT 


TTCTTTTTGC 


GATATATTAT 


CATTAAAAGG 


120 


AT AT AG AT AA 


TTTAAAAAAT 


ACCCCCCATT 


GTCGTTCAAG 


AATTTTGGAG 


TTTCCTAGCC 


180 


ATTAGTTTTC 


TTATATTCAA 


TTATTTTTCA 


TTGTCCTATA 


TGAAATTTTG 


CAACTATTCT 


240 


AAGGCTATGA 


AAAAAACTAA 


CCATCCTTAA 


AGAATTTATA 


TCGGCATTAG 


TAATTCTAAT 


300 


ATTTATTTCT 


TAGAGCAAAA 


CCACTAATTG 


TTAGTTAAAC 


TCTTTTTTTA 


TTATACAATG 


360 


CTTTTACTGA 


TAAAAGTTCT 


ATCCTCATTT 


TAATAAAAGA 


TTATTTAAGT 


TTCTATCTAA 


420 


TGAATTTTAC 


GGATTTGCTA 


ACGTATTATT 


AATTGAGTAA 


CCTTTTTCAA 


AATTGTCCCT 


480 


AATGCTGTGA 


AACCAATACT 


AATTTCAACA 


CAAATAACTA 


AAACTTTAAT 


TATCTTCtTT 


540 


TTCATTAGCT 


TTTATCTCGr 


AAACAAAAAT 


TATAAAACGT 


TTTTTAACTT 


ACCCTAACTA 


600 


TTCTTATTnT 


CTAAATGTTA 


CAGAATAGTT 


ACGGTArGTk 


AAATATAGTA 


CTTTCTTTCT 


660 


ATTTCATTGC 


CTGTACGCTC 


TTTTTTATCT 


ATATATTATC 


CATCAAATAC 


AC ATT CAT AC 


720 


TATCTATCAT 


AATTTTTATA 


TTCTATTCgT 


TGTACAATAG 


TTATAAGAAA 


TAT AT TC AAA 


780 


CAAAACGCCT 


TTTCTTACTT 


AACTATTTTA 


ATTTTCTAGA 


AAGCATAATC 


TTTTGTACTA 


840 


TCTATTTACC 


TTAAATAATG 


CAATTCTATT 


TATCATGGAG 


ACAAAAATTT 


AAAATATTAC 


900 


TTTATGCTTT 


TTGACTAACA 


CTATTATATT 


AAATTGAtAA 


CGTTTACTCT 


TGTAATTGAT 


960 


AACAAATCTC 


TTTAGTAGTA 


GAATAAACTT 


TACT AT AT AC 


TTGAAAGAAT 


TTATTATATA 


1020 


ACAACACATT 


TTTTTCAATT 


GGTTGATATA 


CTTTTTTATA 


TTTCACAAAC 


ATTTTAATAC 


1080 
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AACTATCAAT 


GTTTGCACAC 


ATACCACAAC 


CAATAACTGC 


TAATATTGCA 


GCTCCCAAGC 


1140 


TAGGACCCTG 


TTCCGTTTCC 


AATGTAGTGA 


TAGGAAGGCC 


AAAAATATCA 


GCCTGTATCT 


1200 


GAAGCCATTG 


TTGATTTTTC 


GTTCCTCCAC 


CTACAGAAAT 


AATTTTTTTT 


ATTTTCAAAT 


1260 


TTTTTCTTTG 


TTCCATGATT 


ACTTTAGATT 


CTTTTAATGA 


AAAAGTTATT 


CCTTCAATAA 


1320 


CTGATCGTGT 


AAAATGTTTT 


AGACTATGTC 


GACTATCGAT 


ACCAATAAAA 


CTCCCCCGAA 


1380 


TTTGACTATC 


TTGATACGGA 


GTTCTTTCTC 


CTCTTATATA 


GGGAGTAAAA 


ATCAAGCCCT 


1440 


CACTTCCTAC 


TATTACTTCA 


TCAATATTCG 


CTAACAAATC 


AGAAAATTCT 


ACATCTTTAG 


1500 




TPTAAACCAT 


TTTAAAGAAT 

x x x fuuiwriri x 


TACCTGCTGC 


CAACGTTACT 


CCCATGGAGT 


1560 


AwTArGAATT 


AGGCAGAGCT 


GAATTAAATA 


AGTGGAGATT 


ACCTTGATAT 


TCGTTCTCAG 


1620 


CATCGTCTTC 


CAT TG AC AAA 


AAGACGCCAC 


TAGTACCAAT 


TGACAGCATT 


GCTACTTCAG 


1680 


AATTAACTAt 


CCCAGCTCCA 


ATAGsCGCAC 


mAGCATTATC 


AaGCTCCACC 


TGGCAAAAAT 


1740 


TCCACTTTGG 


TTCCAAGnCC 


AAACCTATTA 


ATCCACGGAC 


CTTCTCCATA 


TnACCACTTT 


1800 


AGCTGAAGnC 


CCAAATAAGG 


TTGGCAAAAC 


CCCCTAGGAA 


ATTAACTTTC 


AGGAATGTCT 


1860 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



TTATTTGATT 


TTACCGATGG 


TnAnTCTTCC 


CACATTTACT 


TGGCGGTGGG 


AATCAAATtG 


60 


TTTTAGCTTT 


AGGCAATCAA 


CCGACTACGA 


TTTGGGCTTT 


AGTCGGCTTA 


CTGGTTTTAC 


120 


GTTTTGTTTT 


TTCAATGGTT 


TCTTACGGTT 


CGAATCTCCC 


TGGTGGTATT 


TTCTTGCCTA 


180 


TTCTAACTTT 


AGGTGCCATT 


ATCGGAACAT 


TGTATGGCAG 


TCTACTTGTC 


CAATACGTAG 


240 


GGATGGATCC 


AATTTTTGTG 


AAAAACTTTT 


TAATTTTTGC 


CATGGCTGGT 


TATTTTACTG 


300 


CGATTGGTAA 


AGCACCGTTA 


ACAGCGATTA 


TTTTAGTAAC 


AGAAATGGTC 


GGCAATTTTT 


360 


CTCATTTAAT 


GTCTTTAGGG 


GtTGTCGCTT 


TAGTTAGTTA 


T ATT AC CATC 


GATAGTCTTG 


420 


GCGGCAAGCC 


AATTTnATGA 


AAGCTTATTA 


GAACGTTTGG 


TXCCTTCTAA 


AGTAAGTAAT 


480 


ATTCGTGGCC 


ACAAAACGAT 


TaTCGAATTA 


CCTATCACTG 


CAGAAAGTTC 


CTTAGATGAT 


540 


ACGATGGTAC 


GTGaTTTTGT 


CTGGCCTAAA 


CAAATGTTGC 


ycACTTCCAT 


TCGGCGTGGC 


600 


GAATCGGAAA 


• TTTTAACACA 


TGGTGATACT 


GTTATGCATG 


TTGGCGATGT 


GCTCATTATT 


660 


TTGACCGATG 


AAAAACTCGC 


TTACCAAGTC 


aAAAAAGAGA 


TTTATCAAAA 


AACTTTACCG 


720 


AATGATTTAG 


TGAATTAAAG 


AAAGCtGCTG 


AACCTGAGTT 


CAGCAGCTTT 


CTTTTTTTCT 


780 
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TATGACTGTT 
ATTTGTGCTa 
AAAATTTTAC 
TTCAATGCTT 
TTAGAACGCA 
ACGGAAAAAT 
TGTGTACAAG 
CCTTTTTTAG 
CTTAGCGATA 
GCAATCATTG 
TTAGGAACTA 
GCGCTAAAAG 
AAATTTAATG 
GACTACTGCC 
AACAAATGAT 
ATCTACAAAT 
GAATCAACGT 
TAGGTGGTTT 
ATTTCACTCC 
CAGAAATTAA 
CAGTTAGTCG 
TACGCAAASG 
TTATTGATAC 
AGCCAAATAA 
TTTTACCTTA 
CTGGTGATGG 
GTCCACTGAA 
TTTCAAAAAC 
CCGCATTTTC 
CAATTGAAAA 
TTTATTAATA 
TGATGAAATG 
AGAAGGAATT 



TTTTAATTTT 
TTATATCCaT 
AATTGTTAGA 
CAGAATCCAC 
TTCACGGCGG 
CAATCAAAAA 
ATGGCGATAT 
TTGGGAAACA 
TGGAAATTCC 
GTGCCACTAG 
ACGGCGCACA 
CCTTGGCTAT 
AAGTAACGTT 
CTGCTGAATC 
TTACACAGTG 
TGGTGATTTA 
TTCACGTATT 
TACAGGCTCA 
CGTGTCTGCT 
CGGCTTAGGA 
TGTTCAGGCT 
ATTTTACGAA 
GACAGGTGAA 
TCATGAGTTG 
TGGACATCGG 
TGCATTACTT 
AAATTCTGTC 
ACAAGATCCT 
TGATGATTTA 
AATTTCAGAA 
AAAGATGTCA 
GTCCAAAAAT 
TTAAAACGCG 



TTTGAATGTT 
GAAATGAGGG 
TCAAAAATCA 
AATCCGCCGG 
CGCTAAGCGT 
CACGCACGAA 
GATCTATTTA 
AATCCATGTA 
CACTATTATG 
TATGGAACAG 
TTTAGAGTTT 
TCATCAAGCA 
TACCAAAGTG 
GCTTCAAGGA 
ACTTTAAACC 
AATCGGATGA 
TTAAAACGTA 
TTCATTGCTG 
GATACACGAA 
CCTGCTCTAA 
GGTGATATTG 
GAATTAATCC 
GATTTAATGA 
GCAGAATTAT 
TTACTAGAAG 
TTCACAACAG 
GGCGCTGGCG 
TTAGAGGCTT 
GCCTCAGAAG 
TAAATCCACT 
TGATTATGGA 
TATACGATGG 
AAGCACAAAC 



1537 
TTTGACCGTT 
ATGAAAATGC 
GTCGTCAAAT 
GATTTACAAG 
AtTCTTAATT 
AAGCAAAAGA 
GATGCTGGTT 
GTGACGAATT 
CTTGGTGGTA 
CTACAACATT 
GGTTTAACGA 
GAAACAGCCT 
GAAGAATTAG 
TTCCAGCAAA 
CTTCTATCGA 
CGAATGATTT 
TGGAGACAGA 
ATTGGTTGAA 
TCAACATTAA 
CAAACGAAGA 
TTGTTCTTTC 
AAATTGTGAA 
ACGCTTTGTC 
ATCATACAAC 
AAGGAGCACA 
AAGGTGTTTA 
ACTCAATGAT 
TCAAATGGGG 
ATTTTATCCA 
AAGGGAGGAA 
TTTACAAGCT 
TGGCCGCATC 
ATCGACTGGT 



TTAGATTGAT 
TTACGGAAGA 
CGCAAGAACT 
AATTAGAAGA 
TAGGCTTTGA 
TTGCTTCTTT 
CAACCACGTT 
CCGTGCATCA 
CTTTGAAGCT 
TTCGTTTTAA 
CGCCTGATCC 
ATGTTTTACT 
AGTCAGTTAC 
AAACTATTTT 
CTTTATCGTT 
TAAATTACCT 
ATCCACTGCC 
AAAAGAAGAA 
ATTGAAATCA 
AATTCAAGAA 
AGGAAGTACA 
GGAAAAAGGC 
ACAAAAGCCT 
CTTTACAAGT 
ACACGTAATT 
TCGTTCCAAC 
TGCTGGTTTT 
TGTCGCTTGC 
AGAACTTATT 
AATAGAGTGA 
ACTGATAAAA 
TCTGACATTG 
TTAGGCGACG 



TAGCGTTTTC 
ACGACATCAA 
TGCTAGCTTG 
CGCCAATTTG 
ACAAAACATG 
AGCCGTTAGT 
AGAAATGATC 
CGCTGCAAAA 
CTCCACTAAA 
CAAAGCTTTT 
CGAAGAAGCA 
GG AT CAT AC A 
TTTACTAACT 
AATGGAGGCA 
CATGTCGATC 
GGCGGCAAAG 
TTAGGTTTTT 
ATCCAAACGA 
GATACTGAAA 
CTAAAACAAG 
CCTGCTTCTC 
GCAGAATTTG 
TTATTGGTCA 
ATTGAAGATA 
ATCTCAATGG 
GTTTTAAAAC 
ATTGGCAACT 
GGTAGCGCGA 
CACGAAGTAA 
ACATTAATGA 
AAGGCGCGAT 
ACACTTACAA 
GCATCGCTAT 



840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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GCCTCACGCA AAAAATAGTG CCGTAAAAGA ACCGACAGTT CTTTTTGCTA AAAGTAAAAA 2820 

AGGTGTTGAT TATGAAGCAC TAGATGGACA ACCTACTTAT CTATTCTTCA TGATTGCTGC 2880 

TCCAGAAGGC GCCAATGATA CACATCTCCA AGCGCTTGCG GCCCTTTCTC GATTACTTAT 2940 

CGATCCAGAA TTTGTTGGCA AATTGAAAGA AGCGGAAACT CCAGAAGCTG TACAAGCACT 3000 

TTTCCAAGCA GCGGAAGATC AAAAAGAAGC GGAAGAAAAA GCAGAACAAG AACAAACAAC 3060 

GCCAGCACCA GAAAGCAATC GTAAATATAT TATTGCTGTG ACTGCTTGTC CTACAGGAAT 3120 

TGCTCATACT TACATGGCCG AAGATGCGCT GAAGAAAAAA GCAAAAGAAA TGGGCGTAGA 3180 

CATTAAAGTA GAAACAAACG GTTCAGAAGG AATTAAAAAT CGTTTAACTG CTGAAGATAT 3240 

TGCTCGTGCT GATGGCGTGA TTGTCGCAGC CGATAAAAAA GTAGAAATGA ATCGATTTGA 3300 

TGGTAAAAAA TTAGTCAATC GTCCCGTCAG TGATGGAATT CGTAAAACCG AAGAACTAAT 3360 

TAACTTAGCC ATTAGTGGTG AAGCTCCTAC TTTCCATGGC AGTGACTCGG CTGCAAGCGA 3420 

CCAAGAGGAT TCAGCGGAAG GTTCAATCGG TTCTCGTATC TATAAAGATT TAATGAATGG 3480 

TGTTTCTCAT ATGTTACCTT TCGTTATTGG TGGCGGAATT GCCATTGCTC TTTCCTTTAT 3540 

GATTGATCAA TTTATCGGGG TTCCTCAAGA TCAATTAGCC AATTTAGGGA ATTACAATGA 3600 

CGTAGCTAGC TGGTTTAACC AAATTGGTGG CGCAGCATTT GGCTTTATGT TACCTGTCTT 3660 

AGCTGGTTTT ATTGCTTCTA GTATCGGGGA TCGTCCAGGC TTAGTTGCTG GTTTCGCGGC 3720 

TGGTGCATTA GCAAACGCCG GAGGAGCTGG GTTCCTTGGG GCCCTAATCG GTGGTTTCTT 3780 

AGCTGGTTAT GTAGTGGTTC TTTTACGTAA AGTTTTCAAA GGCTTGCCAA AATCATTAGA 3840 

TGGAATTAAA ACCATTTTAT TCTATCCAGT TTTTGGTTTG ATTATTACAG GCTTATTAAT 3900 

GTTAGTCATT AACATCCCAA TGAAAGCAAT TAATGATGCG TTAAATCATT TCCTACTTGG 3960 

CTTAGATGGC ACAAATGCTG CTTTACTTGG TGCCTTATTA GCTGGTATGA TGGCCATCGA 4020 

CTTAGGTGGA CCAGTTAACA AAGCTGCTTA TGTTTTCGGT AcAGcTACGC TAGCAAGTAC 4 080 

TGTTGCAGAA GGTGGCAGCG TGGTCATGGC TTCTGTAATG GCTGGCGGAA TGGTTCCTCC 4140 

ATTAGCAATT TTCGTAGCAA CTCGTTTATT TAAAAATAAA TTTACAAAAG ATCAACAAGA 4200 

TGCCGGCTTA ACCAATATTG TAATGGGACT TTCTTTTGTC ACTGAAGGGG CAATTCCTTT 4260 

TGCAGCAGCT GaCCCATTaC GTGTGATTCC AAGTTTCGTG GTAGGTTCTG CCTTTACTGG 4 320 

TGCGTTGGTT GGTGCGTTTG GTATCAAATT ATTGGCACCA CAT GG AG GAA TCTTCGTCGT 4 380 

CTTCCTATTA AGTAATCCAT TACTATATCT ATTGTTCATC TTAATTGGGG CGATTATTTC 4 4 40 

AGGAATTGTC TACGGACTTT TGAAAAAACC TGTCGAAGTA CCTGCTTAAA AAAAGCAAAA 4 500 

AAGCGATTCA ACCATGGTTG AATCGCTTTT TTATTATTTT CTCTTTATCG CATAATTTTC 4 560 

TAAATACTCA • GGATTTCCTT TGCTTCTGAT ATACGTGTAC ACGCCCATCA CAAAGGCCCC 4 620 

AAGGGTATTG GTAAATAAAT CCCCCATCGT ATCCATTAAA GCAGCACGCC CAATAAATGG 4 680 

TTGGCCGTTG GACATATTAT AGCGTTGTAA GTTCATGTTC CCTAATGAAT CGCAGATAAA 4 7 40 
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TTCCCAGAAT TCCCAGAAGA CGCCACAAAG 
TAACCACGGG GAAACATCGG CGTAcTTCGT 
ACCCACAGCA GTTAACAGCA TCGGACTAAC 
GATTGAGATC ATATGCAACG AGGTTCCTAG 
AAAATAAACG ATAGTCGGTG GCATAATAAT 
AAAGATTAGA CCAATTCCAA GTAAACATTC 
ATACTTAGCC GTAAAAAATT CCCAGAGATT 
TAATCCATAT TTAATTTTCG AATAATTTTT 
TTTATATGAA AAAAACTGTA GATGAAAGAC 
TCGAATGAAG AGCTTATTAA AAATGTTTAT 
ACAGCACGTT TCATACGAAT ATCGTGTTCT 
TGCGCTTCAG GCATGTTGTA TTGTTCTGCT 
TGTGTTACTT CGATGTTTTC AGCTTTGGCA 
CGTGTTTCTG ctTCGCCTTC AAATTGTTTG 
TGGTAGTACA TTTCAGGAGA AATTCCTTGG 
GAACGGTGAA CTTCGTCGTG AACCATTACG 
ACAGCTAAAC GAATTGCTTC TTCGTCTTTT 
GTCAATTCTT TACGGAATTT TTCTTTTAAT 
TCTTTTGCAA ATTCATCATC CAATTCTGGT 
ACTTTGAAGA CTGCTTCTTT ACCAGCTAAA 
ACGTTTACTT CTACTTCTTC GCCAGCTTTT 
ATAAATGAAT TAGAACCTAG TTCTAATGAG 
TCGTCGCCTA AGAAGCCTTC AAAGTCGATA 
TCTTCTTTGA TTACTAATTC AGCTTGAGAT 
TCTTCATCTG TTACTTCACG GTCTTGTTTA 
TTTACTTCAG GTTTTACAGT TACTTCAGCA 
CTTTCAACAT CAATTTTTGG TTGTGCTACT 
TCATATGCTT CTGGTAAAAC AGCGTTTAAC 
ATACGGTTAA AAACAGTACG TGATACTTTT 
TkTTTCACTT TATTGAACGC TtGCGTTAAT 
AAAGTTAATA CACCATCATT AGTGCCTTTT 
CCGAATAACA TTTAATTTaT ACaTCAAACT 
CCCTGGnAAA TGTA 
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CCCTGCAAAT GCAAACCCAA ATAATAAAAA 4 800 

TTTCTTCAAT AGGAAACCAG . QAATACCA.TA 4 860 

AAAGTGTAAA ATTTTATCCC AGAACGAAAT 4 920 

GAATACTGAA ATAAATAAGA AAAACCAGTA 4 980 

TTTTAAAAAT TTATTGACAA GATCCGGTAA 504 0 

GATAACAAAC GTCACACCTT GTTTTGTTGA 5100 

GTACGTCAAG CATAAGATGC CAAACACCAC 5160 

ATCCaTTTTG ACACACCTTT CAAAAGTTAT 5220 

ACCTTCCATC TACAGTTTTA GAAAACTCTA 5280 

TTTTCAACAG CTGTTTCAGT AATTGTTTCA 5340 

AACATGTCGT TGTTTAATAC TTTACGTACT 5400 

AAATCTTTCA CTTCAGCATC GATGTCTTCT 54 60 

ACTGCTTCAA TCACTAAGTT TGTTTTTGTG 5520 

TGTAAATCTT CTTCTGTAGA ACCAGTTAAT 5580 

CGTTGCATGT TGTTTAAGAA TTCATCCATT 564 0 

TGTGGTAATT CAACGATTTC AGCATTTTCA 5700 

GCTTCTTCAG CTGCTGCTTC TTTTGCTTCT 57 60 

TCGTCTAATG ATTCTACTGA ATCGTCAACG 5820 

AATTCTTTTG CTTTTACTTC ATGAACAGTT 5880 

TCTTCTGCTT GGTAATCTTC AGGGAAAGTA 594 0 

TTGCCAACTA ATTGGTCTTC AAAGCCTGGA 6000 

TAATTTTCGC CTTTGCCGCC TTCAAATGCT 6060 

ACTACAGTAT CGCCATTTTC AGCAGCGGCA 6120 

TCTTGTTCGC GTTTCAAACG TGCTTCCACG 6180 

GACACTGTTA AGTTTTTGTA TTCGCCTAAT 6240 

GTGATAACCC AGTCTTCCCC TTTATTCATG 6300 

GGTTCAATTC CTGCtTCTAA TACAGCCGCT 6360 

GCATCTTCGT ATAACGCTTC TTCTCCATAC 64 20 

CCTTTACGGA AACCAGGTAC GTTTAAGTTT 64 80 

CCTTTTTCGA TTACTGCTtG TTCGATTGAA 6540 

TTTTCCCATT TCGCAGACAT TTGTTTCCCT 6600 

TwATwakGCA TACCTTAAAA TAGACACTAT 6660 

6674 



j 

WO 98/50555 PCT/US98/08985 

1540 

(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



TAGCATTnAT 


GATTGTCTTT 


AACTACATTC 


CAATTTATGG 


AATTATCATT 


GCTTTTAAAA 


60 


ATTATACGAT 


TGTTGACACG 


GTTTCTTCTG 


CACCATGGGT 


TGGACTTGAA 


AATTTTCGAA 


120 


TCATTATGGA 


AGATAGTTTT 


TTTTGGGAAG 


CAGTTCGTAA 


TACCTTAGCG 


ATTAGTTTAA 


180 


TGAAACTTTT 


TTTAGGATTC 


GCTATTCCGA 


TTATCTTAGC 


GGTAATGATT 


TTTGAAATGA 


240 


GAGATGGACA 


TTTAAAAAAA 


GTGATTCAGA 


CAATTTCTTA 


TATTCCGCAC 


TTTTTTTCTT 


300 


GGATCGTTTT 


AGGCGGGATG 


TTGATTTCGT 


GGCTTTCAAC 


AAATGGCTTT 


ATCAACCAAG 


360 


TCATGATGTC 


GTTAGGCTTA 


ATGAATCAGG 


GCGTCAATCA 


TTTGTTAGAT 


CCTGATAAGT 


420 


ATTGGTGGAT 


TGCTGTTCTT 


TCTGATTTGT 


GGAAAGAAGT 


AGGTTGGGGC 


ACGATTCTCT 


480 


ATCTTGCTGG 


TATGTCTCGC 


ATTGATCCAA 


CGTTTTATGA 


AGCCGCGAGA 


ATTGaTGGTG 


540 


CTTCAAAATT 


AACACAAATC 


CGAACGATTA 


CTTTGCCCTT 


ATTAGCGCCG 


ATTATTTCGT 


600 


TGAACTTAAT 


TCTAAATGTT 


AGCGGTTTAT 


TGGGATCGAA 


CTTAGATCAA 


ACGTTGGTTT 


660 


TAATGAATGC 


TCAAAACCAA 


AACAAATCAG 


AAGTAATTAA 


TTCATTTGTT 


TATCGTATGG 


720 


GGynAACGCA AGGGGACTTT 


TCTTATGCGA 


CGGCAGTTGG 


TTTAGGGATT 


TCTGTTATTT 


780 


CAATTGTTTT 


ATTAGTCATT 


ACAGATCGCA 


TTACTAGAAA 


AATGAACAAT 


GGTCGGTCGG 


840 


TTATTTTATA 


AAGGAGGCAA 


CAGCTGTGAT 


TAAAAGAAAA 


TCAAAAGAAA 


ACAAATGGTT 


900 


TTACTTTTTC 


AATACGTTAA 


TTCTTGTGTT 


GTTTTCTTTA 


TTAATTATCG 


TGCCAATTTG 


960 


GAATATTTTA 


GTTTCCTCAG 


TTTCGTCAAG 


TTCGGGTTTA 


GCCGGTAAAG 


GCTTGGTTTT 


1020 


ATGGCCGAAA 


GGATTTACGC 


TAGAAAATTA 


TCGGAAAGTT 


TTTTCGGATA 


GTAGCATTCC 


1080 


GCGAGCGTTT 


TTGATTTCTG 


TTTTAAAAAC 


AGTTATTGGT 


GCGTCGACGC 


ATACACTGTT 


1140 


CTGTGCGATT 


GTCGCCTATG 


GACTTAGTAA 


AT CT AG ACT A 


GTCGGGCGTA 


ACATTTATAC 


1200 


AACGATGGGT 


GTCATTACAA 


TGTACTTTGG 


TGGGGGCATG 


ATTCCCACGT 


ATCTTTTGAT 


1260 


TAAGTCATTA 


GGACTCTTAG 


ATACCTTCTG 


GGTATATATC 


ATTCCAGCTT 


TATTTAGCTA 


1320 


TTATGATGTA 


GTTATCTTAA 


TGAACTTTTT 


CCGAGi\AATT 


CCACCATCTT 


TAGAAGAATC 


1380 


CGCCAAAATA 


GACGGCGCCA 


GTGAGTTTCA 


AATTTTTTAT 


AAAATCTTTT 


TACCTTTAAC 


1440 


AAAACCGGCA 


TTAGCCACCA 


TCATTTTATT 


TAATGGGGTT 


GGTCAATGGA 


ATGACTTTAT 


1500 


GACCACAAAA 


TTATATATTA 


CCAAAGAaTA 


TCTGTATCCA 


TTGCAGATGA 


AAATTtATGA 


1560 


AATTATTGTt 


CaATCAmACT 


TGAGTaCCAT 


GACAGAGAgC 


GGCAGTACAG 


ATATGGTGGT 


1620 


GCAGGCAACG 












1630 
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(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
TCATCCATAA TTCCTTTTTC CGCTGGTTCT TTTCCTAATG CAAAACCAGG TAACCCATCC 60 

GCGACCACGT TAACAAACAA CAGTTGAACT GCAGTAAATG GCGCACCCCA ACCAAGTAAC 120 

ATGGCAATTA AAACAATAAA AATCTCAGAA AT AT T AC AG C TTAATAAGAA GTTAATGGCT 180 

TTACGAATAT TTCGGTACAC GGCACGTCCT TGCGCCACTG CATCAACAAT AGTTGCAAAG 24 0 

TTATCGTCCG TCAAAATCAT ATCTGAAGCA CCTTGGGCCA CATCTGTTCC AGTAATCCCC 300 

ATTGCGCAAC CGACATCACT AGCTTTCAAG GCTGGCGCAT CATtcACGCC ATCTCCGGTC 360 

ATTGCTACAA CAGCGCcAGA ACGTTGCCAT GATTGAACAA TTCGAATTTT ATCTTCAGGG 4 20 

GTTACACGTG CATAAACGGA TAAATCTTTA ACACGTTTAT CTAATTCTTC ATCACTCATT 480 

TTTTTCAGTT CTGAACCAGA TAAGGCTTCA CTTTTATCTT TTAAAATTCC TAATTCTTTA 540 
GCAATTGCAC TGGCTGTTAC CACGTGATCA CCAGTAATCA TGACTGTTTT AATTCCAGCT | 600 

TTTTTCGCAC GAGCAATGGC GCCCTTACTT TCTGGACGTG GTGGATCAAT CATGCCAATT 660 

AAGCCTAACA AGCGAAGATT TTTTTCTAAA GCCTCTGATG TAATTTCCTT GGGtGGTTCA 720 

TCATATACGG CATAACCGAC AGCAATAACT CGTAACGCTC GTTTTCCGAA ACGATCATTA 780 

ACAATAGCCG CTTGATCGAC ATCTCCGAAA CCAAAACGAG GTAGTAACAC ATCAAAGGCT 840 

CCTTTAGTAA TTGrGrTGTA CTTTTTCCCC CATTGATGGA CCGTA 885 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

GCGTGATTAT CTATACGTTG GTTCAATGTT ATTCGGTTTG TTTTTCGGTG CTGGAAATTT 60 

AATTTTCCCT GTACATATGG GACAAGAAGC GGGGGCTTCC GTCTTTTTAG CAAACTTGGG 120 

CTTCTTAGCC ACTGGAATTG GCTTACCTTT TCTTGGTGTC ATTGCTATGG GTGTTTCTAA 180 

AAGTTCAGGT GTTTTTGACT TAGCTTCTCG GATTAATCGT CGTTATGCTC TTATTTTTAC 24 0 

CTTACTATTA TATTTAACGA TTGGTCCTTT CTTTGCTTTA CCAAGGTTAG CAACTACTTC 300 

TTTTGAGATT GGTCTTGCAC CATTTGTTCC GAAAGACTTG CAATGGCTGG TTCTTGCTGG 360 

TTTTTCCGCA TTATTTTTCT TAGCTGTTTG GCTCTTCTCT AGAAAGCCAA GTAAAATCTT 4 20 
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AGATTATGTG 


GGTAAATTTT 


TAAATCCGTT 


ATTTTTAGCA 


TTGCTAGGAA 


TTTTGATTGT 


480 


TTTAGCCTTT 


GTAAAACCAT 


TAGGTCATGT 


GGCCAGTGCC 


GCTGTTGGTG 


AAAATTATAT 


540 


TTCAAGTCCT 


TTCTTTAAAG 


GATTTACAGA 


AGGTTATAAT 


ACGCTGGATG 


CTTTAGCTTC 


600 


ATTAGCTTTT 


GGGATTGTTG 


TCGTTTCCAC 


TTTACGAAAT 


ATGGGCGTCG 


AAAAACCAGG 


660 


AGATATCGCT 


AAAGGAATGA 


TTAAATCAGG 


TGCTTTCAGT 


GTTGTATTGA 


TGGGGATTAT 


720 


CTATACCTTA 


CTTGCTTATA 


TGGGAACAAT 


GAGTATCGGG 


GCTTTTCCAA 


TCAGTGAAAA 


780 


CGGCGGGATT 


GCCTTAGCCC 


AAATTGCCAA 


TTACTATTTA 


GGCCTACCAG 


GGAGTATTTT 


840 


ACTGGCTTTT 


ATCGTAGTTC 


TTGCTTGTTT 


AAAAACCGGT 


ATCGGCTTAA 


GCACAGCCTT 


900 


TTCTGAGACT 


TTTGAAGAAA 


TGTTTCCGAA 


AGTTTCCTAT 


CAAAAATTCT 


TGGCAATTGT 


960 


TGTTATTTTA 


CCAGCTATTT 


TTGCAAaTGT 


CGGCTTAACC 


AACATCATTC 


AATTTGCCGT 


1020 


TCCAGTCTTG 


ATGTTCCTTT 


ATCCATTGGC 


GATTACTTTA 


ATGCTTTTGG 


TAATCATCAG 


1080 


TCCTATTTTC 


AAACATCGAC 


GGGAAGTCTA 


TCAAGCAACG 


ACCTATATCA 


CATTGATTGC 


1140 


TGCTGTTTTA 


GATGCATTAA 


ATAGCGCGCC 


TGCCTTTATC 


AAAGATACGG 


CGTTTGTAAC 


1200 


AACGCTTTTA 


GAAAAAGCCG 


GCCAATACTT 


ACCATTGTTC 


ACTATTGGTA 


TGGGCTGGGT 


1260 


CGTTCCAGCA 


ACGGTTGCTT 


TTCTAATTAG 


TTGGGTTTGG 


GTCATGATTT 


CTAAAAAAGA 


1320 


TCCTATTCTT 


GATTAATTCT 


TACCAAAACG 


ACACTAGAAA 


AAATGCGGAA 


GCGCATCTTT 


1380 


TCTAGTGTCG 


TTTTTATGTA 


TATTCGTTGT 


ACCAAACTCT 


TGACTATGGT 


AAATTGAAGT 


1440 


TACCAACAAG 


TACTGGAGGA 


AATTGCCCAT 


GTTAGTTACA 


ACAACCGAGA 


GAATTAGCGG 


1500 


TCAAGAATAT 


GAAATTATTG 


GAGAAGTTTT 


CGGGTTAACC 


ACACGTTCTA 


AAAATATGTT 


1560 


TAAAGATTTA 


GGTGCTGGAC 


TAAAAAGCGT 


AGTTGGCGGT 


GAAATCAAAG 


GTTACACTGA 


1620 


TATGCAACGA 


GAAGCACGCG 


ACCAAGCAAT 


CGAACGTTTA 


AAAGCAGAAG 


CCTCTAAATT 


1680 


AGGTGCAGAT 


GCCGTTGTGA 


TGATGCGTTT 


TGATTCTGGC 


ACAATTGGCA 


CAGACATGCA 


1740 


ATCTGTCGTG 


GCTTACGGAA 


CAGCTGTTAA 


ATACATCTAA 


ATTTTGCAAC 


TAAAAAGACG 


1800 


CAGACCTTTT 


CTTGTAAGGT. 


CTACGTCTTT 


TTAGTTTTCA 


TATTTCACTT 


CTGCTAAATA 


1860 


CAATCCTTCT 


GGATGAGCAG 


TCGGACCTGC 


TAAATTTCGA 


TCTTTGGCTG 


CGATAATTGC 


1920 


TGGAATCGCG 


GTTGGTTCCA 


GCCGCCCATT 


GCCGATTTTC 


AATAGCGTAC 


CAACTAAAAT 


1980 


CCGAATCATT 


TTGTACAAGA 


AACCATCGCC 


CCGAAAAGTG 


AATGTTAATT 


CATCCCCCGT 


2040 


I I LA 1 I LAL A 


1 CjAAL AL I 


1 LA 1 


1 1 LuoftL 1 


TT , 2^^ 1 ^ , T >, T , / w,f ^ , ^ 

1 Inl i 1 ^ lt\ 


LLLrxriLl ILL 


?i no 


TGAGGCACAG 


AAAGAGGTAA 


AATCATGGGT 


GCCTAATAAA 


TCAGGCAAAG 


CTTGTTGGAT 


2160 


TTTTTCGAGA 


TCAATGGGAT 


ACGGAAAATA 


ACTAGCATAA 


AAACGTCGAA 


ATGGACTTCG 


2220 


CGGTTTGCCA 


ATATCCACTT 


TAAATTGATA 


GGTTTTTTCG 


ATCACATGAT 


AACGTGCATG 


2280 


AAAGTCTTCT 


GAAACACGTT 


CCCACGTGCG 


CACAGCAATG 


TCTTCGGGTG 


TTTGTGTATC 


2340 


TAACGCAAAG 


CGCATTTTTT 


CCAACGGACG 


TTCCTGCGGA 


TAATCAAAGT 


GAATCACTTG 


2400 
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GCCCaTTGCA TGAACACCTG 
ATTATTCATG CGTGCTAACG 
CTTTTGAAAG CCATGAAAAT 
AATTTTTCTT CACCTTTTCT 
GCCAAACATT ACTATTGTAT 
TTCACCACCT TGATAACCTC 
ACTACTTACG AAAAGCGGAA 
TCCTTCACCA AAATCAACAC 
CATTAATGTT GGGACAAAAC 
AAATTTCACT ACATGAAGTG 
TGCCGTCGTT AAAGTTAATA 
AAATAGCCCA TTCGTCACAC 
TTCCCCCCCT TGTGTAAACA 
TGGCCGAACA CCGCGGATAA 
AGTAAAGGCT GCTATTAAAA 
AAAATAAAAA CTAGCTAATA 
TGGAATATAA CGCCCAAAAA 
TTTTTCAGAA GCTGATCGGC 
CCTTTTGCCA CTAATTTTTC 
TTCAGCCATT GTGTCTCTTG 
TTTTCTAAAA CAATCACGTG 
AAAACGATGG TCATATTATG 
CGACCCTTAG GATCCAAACC 
GCTAGCACAC CAGCAATTGC 
TGTTGTAAGT ATTTTTCATC 
GCTTCTTCGT CTGAAACACC 
TCTTCAAACA ATTGTGCTTC 
TTTAAATTCT TATTATCAGT 
TTAGTTGGTT TCACCAAAGC 
TGTCCGACAA TCGCTGTATA 
CGTTGCTCAA AAGGAGTATT 
ATAACCAATC CACCATCCCT 
CTTTCAAAGC TACTTTTAAT 



1543 

CATCTGTTCG TCCTGAACCA AAAACCGTCA CGGAGCGTCC 24 60 

TTTTTTCTAT TTCTTCTTGA ACGGTACGCT CATTAGGTTG 2520 

GAG TGC CATC ATAGGCAATG ATTGCTTTAT AGCGGGGCAT 2580 

AAGTTCTTAA TACAAATAAG CCAATGGTTA ACAAGACAAA 264 0 

CTCGTAGGTG CCAATGAAGA ATGCGGTATT TTGTCCGTCC 2700 

TTGCTTCCAT GGCAGTCGCT AAGTCTTCCG CACGATTAAA 27 60 

TCAAAAGTGG AACAATAGCT TTCATTTTTT GAAGTAAGTT 2820 

CACGGGCACG TTGAGCATTC ATAATCTTTT CCGTTTCATC 2880 

GTAAGGCAAT TGATAACATC AATGAAACTT CATGAACTGG 294 0 

GGCGTAGCAA ATATTCAATC GCATCTGATA ACTCTAATGG 3000 

ACGTTGACAT GAAAATAATT AAAACAAAAC GACAGAAAAT 3060 

CAAACTGGGT TATGGAAATA ATTCCCCACT GAAAGTACAC 3120 

AGGTTTGAAG AGCCaCAGTA AACAGAATTA GCCaTAATAA 3180 

AGAAGCGGGC GCTGATTTTT GATAATGAAA TAGCAAAAGC .3240 

CATAACTCTG CCAATTATTT GCTAAAAAGA TAATTCCAAT 3300 

ATTTTGCACG CGGATCTAAA CGATGGATAA AAGAATCGCC 3360 

TCAATTTATT CATCATTTTG CCTCACCTGC TTCCTGCATT 3420 

TAATTGATCG GCCGTTAATG GTAATTGCTC AAAAGAAAAG 34 8 0 

AGCAAATTCG GCTGCAGTTG GCACGCCTAA TTGTTTTTCT 354 0 

AAACACTTCT TGTGGTGCTC CCGCTCGGAC GATTTGTCCT 3600 

ATCTGCGTAA TTAGCCACGT CATCCATCAA ATGTGTCACT 3660 

TTCCTTATGC AGACGACTAA ACATCTCCAT CATTTCTTTA 3720 

AGCAGTCGGT TCGTCTAATA CTAAGACTTC TGGTTCCATC 3780 

GACACGGCGC ATTTGACCAC CTGATAACTC AAACGGAGAA 384 0 

TAAGCCGACT AAATCAAGCA TTTTCTTAGC TAATTTTTTC 3900 

AAAGTTTTTC GGTCCGAAAG CGATGTCTCG CTCCACTGTT 3960 

TGGAAATTGA AAAACGATCC CAACTTTTTT ACGAATTGGT 4020 

TTCTGGCGTA ATTACACGCT CTCCAATTGT CACTTGGCCT 4 080 

ATTTAAATGT TGTAATAGCG TTGATTTGCC ACTGCCAGTA 414 0 

AGAGCCGTCT TGAATCGTTA AATTAATATC GAATAATGCC 4200 

GGGTTGGTAA GTAAAATCTA CTTGTTTAAA ACGGATGTCC 4 260 

TCTTCTGTTA AATAATTCGT TGGCACAACA ACGCCACGCT 4 320 

TTTTCAGGAA ACGGTAAATC CAAGCCCATT TCAACAAGCG 4 380 
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CTTCTCCTGC AGAAAAAATC TTTTCTGGTG TTCCTTCGTT TGTCAATTGT CCTTGGCGCA 4 440 

TCACTAAAAT ACGATTCGCA TTAGCCGCTT CATCAATATC GTGGGTAATA GAAATAACTG 4 500 

TTAAATTACT TTCTTCTTTA ATCTTTTGAA TGGTAGCAAT TACTTCTGCA CGACCTTCTG 4 560 

GATCTAACAT ACTTGTCGCT TCGTCTAAAA TAATAATATC TGGTCGCAAA GCAACCACAC 4 620 

CAGCGATAGC TACTCGTTGT TTTTGGCCGC CAGATAGCCG AGCAGGTTCT CGCTTGGCAA 4 680 

AATCTAACAT ACGAACCCGT TCTAATGCAT . CGTGAACACG TTCGACCATT TCATCACGCG 4 74 0 

GAATACCTTG ATTTTCTAAA CCAAAAGCCA CATCGTCTTC GACTGTTGAA CCGACAAATT 4800 

GGTTATCTGG ATTCTGAAAA ACCATTCCTA CCATACGCCG GATATCCCAG ACGTTTGCTT 4 860 

CATTTAATTC TTTGCCGCCA ACTTTGATAG TCCCAGCAGC TGGCAGTAAC AGCCCATTGA 4 920 

TCGTTTTGGC TAACGTTGAT TTACCAGAAC CATTATGGCC AATAATAGCG ACCCATTCGC 4 980 

CTTGTTGAAT GGAAAAAGAC ACATCTTTTA AAGCGGGAGA AGCATCTTCT GGTTGATAAT 504 0 

TAAATTGGAT ATTATTTAGT TCAATGATTG GTTGCATCGC ATACTCCCTC AATCTATTTA 5100 

TTCTTTCTAA CACTACCAAA AAAGTAGCGT TTTTTCAATC ATCTGTTCTG ATTCTTCACA 5160 

CTATTTTTTA TTATACCAAA AGAACGGTCA TTATGGGAGC TCTTATCTGT CTTGACAGGT 5220 

CTATTTTTCA TAAAAATAAC TTCTTGTTCT CATAAAACAA GAAGTTACTG ATTTTCTTTT 5280 

AAAAATGTGT CTTCAAATAT TCAAACGCAT ATTGGGCATA TAACTCAGCA CCCATTGCCA 5340 

TCGCATCTTC ATCAATATTA AAACGACCAT GGTGATGTGC CCACTCCGTA TCTTTCTCTG 5400 

GATTACCACT TCCCACTAAG GCAAAACAAC CAGAAGCATG TTCAGTATAA TAACTGAAGT 54 60 

CTTCGCCACC AGTTGTCGGC TCTTCTTGTC TTAAGGCCGC TTCACCAAAA TTTTCCTTAA 5520 

TTAGTGTTTG GGCAAACAAG GCATCCTGCT CATCATTAAT CACTGGCAAC GTCCCATATT 5580 

GATAATCTAA CGAAGCCGTC CCACCGTAAA TAGCGGCTGT TTGTTCCGCA TAGCGTTGTA 5 64 0 

GCGCCTGTTC TACTCGATTA CGCGTTGCTA CACTAAAGCA ACGAACCGTT CCTTCCAAAC 5700 

GTGCATTCTC AGCAATCACA TTAAAACGGG TCCCCACATC CATCCGTCCA ATCGTGACAA 5760 

CTACTGGATC TAAAGGATCT GTCTCCCGAG AAACAATCGT TTGCAGATTC ATCACGAAAG 5820 

AAGAAGCTAT GACTGCCGCA TCAATACAAG CGTTTGGCAT GGCACCATGG CCACCACGGC 5880 

CTTTAAAATC AACAGAAAAA ATATCTGCGG ATGCGAAACT TGAGCCCACT CGACACGAAG 594 0 

CGGTTCCTAC AGGCATTTGT GACCAGATAT GTAAGCCAAA TACATCATCA ACACCGGTCA 6000 

TTGCGCCTTG CGCAACCATT GCTTTAGCAC CTTGCGCATT TTCTTCAGAT GGTTGAAAAA 6060 

TTAACCTTAC TGTTCCCTGA AGTTCTTCCT GGATTTCTTT TAATACTTTA GCCACTGTTA 6120 

CTAACATCGC GGTATGGGAA TCATGCCCAC ATGCATGCAT CTTGCCAGCT TCTAAAGATT 6180 

TATAAGCTAA ATCTTCATTT AATTCCTGTA CAGGTAAGGC ATCCATATCC GCCCGCAGCG 624 0 

CAACCACTCG GCCTGGTTTG CCGCCCACAA TTTCAGCAAT TAAACCCGTC GGTTCTGTTT 6300 

TGCGATAAGT AATTCCTAAT TGATCAAGAA CTGCCGCGAC TTTTTCTGTC GTTCGAAATT 6360 
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CTTCAAATTG 


TAATTCTGGA 


TGTTGGTGTA 


AATCCCGTCG 


AAAAGCAATC 


ATTTCTTGTG 


6420 


CATGCTGCTT 


GACTAGTGCT 


TTGATTTTTT 


TCATACTTAC 


GCCCCGTTTC 


TACAAACTTA 


6480 


ATAAAATACT 


TGAAAAAATA 


CCCGCTAAAA 


TAACCGAAAC 


AATGGTGACT 


GTTACGAAAC 


6540 


CTGCCACTAA 


CATCGGTGGC 


ATTAAATGGC 


TCGTTAAAGC 


CGCCTGTTCT 


TGTTCATCTT 


6600 


CTGTCAATGA 


TTGAATCACT 


TCATTAGTAA 


TAATATAATC 


GGCTGGAAAG 


CCATATAAAG 


6660 


CTGTCAACGA 


AACAGCGAAT 


GCCATCTCTT 


TACTCACACC 


TAATACACGA 


CCAACAATTG 


6720 


CTGAAAAGAT 


GTACATCCCG 


ATAACACCAA 


TCACAATAAT 


CCCAATCATT 


GGTAAAAATA 


6780 


AATCCATCAA 


CATTTCAGGT 


GTGGCTTGTT 


TTAAGCCATC 


AAAGACAAAA 


AGCATTAACC 


6840 


CTAAAATCGC 


AAAACCAAAG 


CTATTTGCTT 


TCTGCAATGG 


TTGCTTTTCT 


AAAAAGCCTA 


6900 


AAGACGTTGC 


AATCACGCCA 


AAACATAAAC 


ACAGCACAAA 


AGGACTAATC 


GTCACAAACG 


6960 


GTGCCGTCCA 


CACAGAAACA 


TAATAAGCAA 


GTGTGGCAAC 


TAAGGCTAAA 


CGTAAAATTC 


7020 


GTGAATAATC 


TGTATGATAT 


CGTTTGGGTA 


CTTTTTCAAA 


TAGTTTAGGC 


ACATCTGGTT 


7080 


CTGCTCCTGT 


TTCTTGCCCC 


TCTTGTTCAT 


GAACCGGCTG 


CCATGTCCCT 


GCTCGATATT 


7140 


CTTGAAGCTT 


CCGCTTTCCT 


TCTTTTTTTA 


GAACAATTGA 


AGTTAAGGGA 


TAACCAGCAA 


7200 


ACCCTTGAAT 


GACATAAATT 


AAAATTGCCA 


AAACCGAAAG 


TGAGGCTAAA 


CCAGCTGATT 


7260 


GAGCCGCTTC 


TGACATTACT 


AATGAAGAAA 


CTAAACCTCC 


TACTAACGGC 


GGGATTGCAA 


7320 


CTAAAACTGT 


TTTTAAGTCA 


AAAACAAAAG 


TCCCCACAGC 


TAACAACAAG 


GCAATAATGC 


7380 


CCAGAATACC 


TGACAAAGCA 


ATCACAATCG 


TTTTCCATTG 


ATTAACCAAT 


TCTTTAACAG 


7440 


AGAGCAGAGT 


GCCCATATTA 


GTAATCAACA 


GATACATTAA 


TAAAGTGGCA 


ACCACTGTAG 


7500 


GAATCCCTGC 


CACTTCCACA 


ATATTTTGTG 


GAAAAATAGT 


CCAATAACCA 


AGAATAAATA 


7560 


AAACACCACA 


AACAAAAACG 


GATGGAATCC 


AAGCTTTTGT 


TTTGATAGAA 


ACTACATCAC 


7620 


CAATGTATAA 


AATAACCATC 


AGCAACGAAA 


AAGCAAACAT 


TTGC 




7664 



(2) INFORMATION FOR SEQ ID NO: 37 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

AAAAAAATTT TCAAAAGTCA AAAATGAAGG GGATTTTCAA GATGGAACAA TCATTAATTG 60 

TGAGAGGGAC ACCACAAGAG TATCTTTGCG AAATTGGTTC GTGGGAATTA TTAGAAGGAC 120 

AACTATTATG CAGAAATATT CATCGTGTAT TCATTTTACA CGGGACTGAG TCATGGCAAG 180 

CGGCAAAACC GTATTTTCCG ACGTTTCAAA AAGTAACAGC TGTTTTTGAA AACTATGGAG 240 

GTGTTTGTAC AGATCAACGT GTGCAAGAGT TAGAAGCGCA AGTGCTGGAC AATCATTTAG 300 
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AGGCCATCGT CGCTGTTGGT GGAGGAAAAG TGGCCGACTT GGGCAAAGCC TTAGCACACA 
AAATGGCGTT ACCGGTAATT ATTTTGCCCA CTTTAGCAGC TACTTGTGCA CCGTGCACGC 
CATTGAGTGT AATGTATCGA GAAGATGGAG CAATGGAACG ATATGACGTT TTTCCGCAAG 
CAAACGCTTT GGTGTTGGTA GAACCAAGAG TGTTATTACA TTCGCCTCGG TCCTTAATGG 
TGGCAGGTAT TGGGGATACA TTAGCTAAAT GGTACGAAGC AGATGCGATT ATTTCGCAGT 
TAGATGTCCA AGTGTTACCT ATTCAAGTTT CTCATTTTGC CGCGGAAAAA TGCCGCGATA 
TTTTGTTAAA TGAAAGTATC AATGCTTTAA AAGCGATGGA AGAGCAACAA TTAAATCAAT 
CGTTTATTGA TGTCATCGAA ACAAATTTAA TTATTGGTGG AATGGTGGGT GGCTTTGGCG 
ATGATTATGG TCGGACAGCA GGTGCsCATT CCATTCATGA TGCACTAACT TTGCTGCCAG 
CTAGCCATCG ACAATTGCAT GGAAACAAAG TGGCGTATGG TGTGTTTGTT CAATTAGCCA 
TTGAAGAAAA ATGGCAAGAA ATCGCTGAAT TGATTCCTTT CTATCACCAA CTAGGTTTGC 
CTATTTCTTT AAAAGAAATG GACATGGACT TAACTGAAGC CGAGTATCAA GAAGTAGCAG 
AmCGTGCTTG TATAGAAGGC GAAACGATTC ATTATnTGAA ACAAAAAATT cCCtGAAATT 
GTGAAACAGC GtGCAaGrTT TrGAAAAaTA TaCAGCGACA AAATAaGTGC CAAaCAGCAA 
CTAGTTCATT TGAATTATTT GAATTAGTTG CTGTTTTTTA AGGATCCTTT TTTGAAAAAA 
GAAACCTTGA GAAATCCTTG CAAATTCAAT AAAAAAAAAG TATGATGATG GGAGGACAAA 
TTATAGTAGA AAGTAGGGAA ATGAATGACG GAACCAGCCA TTCGTTATCG TTTAATCAAA 
AAAGATAAAC ACACAGGTGC GCGTTTAGGT GAATTAATTA CTCCTCACGG CACATTTCCA 
ACACCAATGT TTATGCCGGT AGGTACACTT GCGACAGTTA AAACAATGTC ACCAGAAGAA 
TTAAAAGAAA TGGGTGCCGG TGTTATTTTA AGTAATACTT ACCATTTATG GTTACGCCCA 
GGTGAAGATT TAGTCGAAGA AGCAGGCGGT TTACACAAAT TTATGAACTG GGATCAACCT 
ATTTTAACTG ATTCAGGTGG CTTCCAAGTC TTCTCATTAA GTGATATGCG GAAAATCGAA 
GAAGAAGGTG TGCATTTTAG AAATCATTTA AATGGTTCaA AAATGTTCCT TTCTCCAGAA 
AAAGCGATCA ATATTCAAAA TAAATTAGGA TCAGATATTA TGATGAGCTT TGATGAATGT 
CCACCATTTG ATGAAAGCTA TGAGTATGTC AAACGTTCGA TTGAACGGAC TTCTCGTTGG 
GCTGAGCGTG GCTTAAAAGC CCATGCGAAT CCTGATAGAC AGGGATTATT CGGTATTGTT 
CAAGGTGCGG GATTTGAAGA TTTACGTCGT CAAAGTGCTA AAGATTTAGT GAGTATGGAT 
TTTCCTGGCT ATTCGATTGG TGGCTTATCT GTTGGTGAGT CTAAGGAAGA AATGAACCGA 
GTCTTAGACT TTACGACACC GTTAATTCCT GAAAATAAAC CACGTTACTT AATGGGTGTG 
" GGTGCCCCTG ATTCATTAAT TGATGGAGTG ATTCGTGGGA TTGACATGTT TGATTGTGTG 
CTACCAACAC GTATTGCGCG CAATGGTACT TGTATGACTT CACAAGGTCG CTTGGTTGTT 
AAAAATGCCC AATATGCAAG AGaTTTnCGT CCGCTTGATG AAAAATGTGA TTGTwAkGTT 
TGTCGTAATT ATACGCGGGC CTATATTCGT CATTTGATTA AATGTGATGA AACATTCGGA 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 • 
960 
1020 
1080 
1140 
1200 
' 1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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ATCCGTTTAA CATCGTATCA TAATTTATAC TTCTTGTTAG ACTTAATGAA AAATGTCCGT 2340 
CAAG 2344 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



TAAGTTTTGT 


TTCAAATTCT 


TCTACATATT 


CTCGATACAT 


AAAAACACAT 


CTTGAAATAC 


60 


TCGGTCTAAG 


TTCCTTAACC 


CTCTCTTCCA 


ATTTTGTACA 


TTCTCTTTTT 


ACATAAGAAA 


120 


CAGTTAATTC 


GTTACTGTCT 


TCAAGTTCTT 


TTACACTCAA 


TTGAAGTTTA 


TGAATAGAAC 


180 


TCATCACATG 


AATCATTTTC 


TCAATAGTGG 


CATCCATAAG 


TTTTATATTT 


TTTTTGATTA 


240 


ATTCTTTATC 


CATTCTATCA 


ATCCACCTTT 


CTAAAACGGG 


TACATTTCTT 


CTAATTCGAA 


300 


CTCTCGTTCC 


ATATAGGACC 


GAACACTTTT 


ATTATCTCTA 


TTGGTAAACA 


ACAAGTCTTT 


o c n 


TGcATCaACa 


CTCaCCATTT 


CaATAATCTC 


TTCaTGCTTT 


TCTTTTnAAT 


TCTGCTAAAT 


* OA 

4 ZD 


CAATGCGTTC 


CATCAATTCA 


TTTGTTCCTA 


ACAGATTATC 


AATCTCTTTA 


ATCATCTTTA 


A Q Pi 


ATGAAGGGGC 


TAAATAATTT 


GATAGCCATC 


GAAGCGTCTT 


T TCAAT AG AC 


/"i ffi rri Try fry TV 


Dh U 


CTTGCACAGA 


CAATTTTAAG 


CCTTCCAATT 


TATCAACAAC 


ATCATGCCAA 


GATTTACAAT 


DUU 


AAACTTTCAT 


ATCATCTTCA 


ATGTCATAGA 


CAGTCAAAGA 


TTGATTAATA 


ATTTCTTTCA 




CTAACCAATC 


TAGCGACTCA 


CCGCTTGATA 


GAAAATATTC 


TACAAACAAA 


AACGCCTTTT 




CATCAGAAAG 


GCGAATTTCG 


TATCGATTTT 


TAATTTCATG 


TTTTTCTCTT 


GCTTCTTCAA 


780 


CACTCATACT 


TTCTTTTCTT 


GCTAACTCGT 


AATCTTTTTG 


GTAAAAGTTA 


AAATACAGTG 


840 


GCGATTGACG 


ACTACCAAAA 


TAAAGGGATA 


AACCTTTATT 


CACCATTTTT 


TTATTTTCGT 


900 


AAACAAATCC 


ACCGCTAAAA 


TCAAAATTTT 


TAAATGTGGT 


GTCAACAAGC 


CCCTGCTCCA 


960 


TTTTTTCTTT 


TAAAACATAC 


AAATCAAAAT 


TTTCTTGCCC 


CTTCACAATC 


AATTCATCAA 


1020 


GAGCAATATC 


AATTCGAGTG 


ATTTTTGTAT 


CTACTAAAAT 


GCCTTGACCG 


AAAATATCGT 


1080 


CATCATAAAG 


AGAGCGAAAA 


AATTCTGTCC 


aCGATCGGTk 


GTTATCTTCC 


TCAAAGATTT 


1140 


CTTCTAATTG 


ACGACAGCCT 


tCiGCCAGACA 


AGTCCaACAT 


AATTCCCATA 


TTGACATTTT 


1200 


CCGGATTCCG 


AAAAATCCGA 


ATAGAGCTAT 


ATGAAAATGT 


TTCCGTGTAA 


TGATAAAACC 


1260 


CTCTAGATTC 


ATGAGTAAAC 


CAGTTCATAT 


CCATGTGCAA CAACTTTTCA AyAACTGTCC 


1320 


GAACATCTAA 


• TGTTTTAAAT 


CGTATTCTTA 


AAAAGTCAAT 


TTTTGCTTCT 


ACCGAAGCGT 


1380 


GCCCACCATC 


TAAGTTCAAC 


ACATTAATAA 


TCTTTTGTTC 


TAACTCTTTA 


GAAATAACAC 


1440 


GTTTTCCAGA 


TTCTATTTTT 


GAAATTAACG 


AACGCTCrAT 


TCCTAATTTA 


CTTGCCAATT 


1500 
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CTTGCTGTTT 


TAATTTTAGy 


TCTTTTCTAT 


ATCGCTTTAA 


ATCAACACCT 


CTCACTGCTT 


1560 


TAAACCTACT 


TCCAAAAAAt 


GTGACAACTT 


TAGTCGAAAA 


ATGTGACAAA 


ATCATAATCG 


1620 


CTCAATCCCT 


TGTAGGACAT 


AGCTTTTAAG 


CAACTTTTTT 


ATACGGTATA 


TTAAATTCTA 


1680 


CCCCCATGTT 


ATAAATAGGG 


GGTTTTtGAC 


GTAAGGTGGG 


CATTCGCCCA 


CCTTACGTCT 


1740 


GTTCATTTTA 


AAAGGATTTT 


CCGTTGCTAA 


AAGTATACGT 


GGGCTTTCGC 


TCTGCTTCGC 


1800 


CCACGTATAC 


T TAT T AC TAG 


CAACAATCCT 


TCAAATAAAC 


AGCCGTTATT 


CAGCTGTTCC 


1860 


TCGTTCTAAT 


GCACCAATTC 


TCTCTTTTAA 


GCTTGCATTC 


TCCGCAAGAA 


TAGATTGAGA 


1920 


CACCTTTAAA 


AGTTGCCTCA 


TATCTCTTTT 


CAAATCCTCA 


ATTTCTTGTT 


TATTAATATG 


1980 


TGTAGCTTTA 


TTCTCATCAG 


ACATAAATAC 


TTCATTTATT 


GCCTCATTAA 


GCGGAACATT ' 


2040 


ATTTTCACGC 


AAACGTAGCA 


ACTCTTGAAA 


ATTTTCTATA 


TCTTTAATAG 


AAAAAACCTT 


2100 


ATAGCTATAA 


CTATGACCTG 


CACTATTCTT 


TGCATATCGG 


CGTTCAAAAC 


GTGTATCAGT 


2160 


CATTTTTTCA 


ATCCTATTTG 


CCCAATTTTT 


CAGTGTTTTC 


AAAGCAATAG 


CATTGTCTAG 


2220 


CTCTAAAATT 


ACTTGATCAA 


ATGAATACAT 


TTTCCACACC 


TACTTATTAA 


ACATTTCTTT 


2280 


CCAAGCAGAa 


tCCATATGCT 


CCTTAAATCT 


TGTTTCTAAA 


TACTTATCAA 


CTTCTTGTGA 


2340 


TGTTGAAAAT 


TTCTTTTCCT 


CAATTGGTCT 


ATCTTCCTCG 


TAAACAGCTT 


CTGCATCTAA 


2400 


AACAGAAATT 


ACTAATAATT 


TATA 








2424 



(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



GACGCTCTTT 


TACCAATCAA 


AGAGTAAGGC 


ATAGCCTAAA 


GAAAGCAGCT 


CTCGACCAAA 


60 


ATTTTTGATA 


TCTTTTGGAA 


AAGAAGAAGC 


CGCAGTGATG 


CCTGAAACAT 


TTGAGATGCC 


120 


TGCTCGCTTG 


GcTAAAAGTT 


GCGAGCGGTA 


CATGTGAAAG 


TCACTAGTAA 


CAATCACTAG 


180 


CCGTTTAAAA 


GAAAGGAGTT 


TTTGTGAATT 


GACAATATTT 


TCTTTGGTGC 


GTGTGGCACG 


240 


ATTTTCTAAG 


AGAATCTGTT 


TTTCTGGAAC 


ACCTTTATTT 


AGCAAGTATT 


GTTTCATAAC 


300 


CGCGGCTTCC 


GTAGCCGGCT 


CATCTGCTCC 


TTGCCCTCCG 


CTAACAATAA 


CGGGTGTCTC 


360 


GGGGTGTTGT 


TTAATATAGG 


GGATAGCCAC 


ATCTAACCGA 


GCTTGCAAAA 


CACGGCTGGG 


420 


CAGTGCGGTT 


TCTTTAGACG 


TCCCTAAAAC 


ACGGGCCCCT 


AAAATTAAAA 


TAGCATCTGG 


480 


TTTATCGGTG 


GGTTGCGTAT 


TTTTGCCAGT 


AAAAATAAGT 


AGACCTAAAA 


TCAAAAGAAT 


540 


ACAAAAGCCA 


AAACTGACAA 


TGGATAAACT 


TGTTATTTTT 


AACCAATGCA 


TGAAACATCC 


600 


CTCCTTGTAA 


TGTTAGTGTA 


TACGAAGTAC 


CATAGAAAAA 


GCGATAGAAA 


ACAAAAGACT 


660 


TATAAAAAAA 


TCTATGTTTT 


AATAGAAACT 


TTTGAAGAGA 


GGGAGCGAAT 


GCATGCGTAC 


720 
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CAAGAAGAAC 
AAAGATAGTT 
GAAACAGCAA 
ATGACCTTAT 
GGCCAACGTA 
GATTCTTTAG 
TCTGATAAAA 
GAATTTTGGT 
GCTGTCACAC 
GCGGCTTGGC 
TATGTCACTC 
AgCcTkGTTG 



AGAAGAAGAA ATGTTTCAAC TTATTATGGA TGTAGCAAAA 
TGTGGGAATG GTGGGGTCTC GCACGAATGT GAAAGCGCCA 
CGACATTGTT TATATTGTAG AACCATGTGC TGAGTTTTTT 
AAAATTTGGA CAACCATTAA TTATGCAACG TCCCAAGGAA 
ACCGAAGACG CGGGAAACAT TTTTAATGTT ATTTGAAGAT 
GTTGTGTCCT CTCGCAGAAA AAGATAATTG GCATGAAGGC 
GTTAGACAAA GATGAAAATT TGCCGCCTTT ACCTGTAGCA 
AACAGTTCCG GATCAGCAGC AATTTAACGA TTGTTGCAAT 
GTATGTAGTT AAGGGGCTTT GTCGGAATGA ACTATTTTAC 
ATATTGTCAG CAGGAACTGT TGCGCTTGCT TTCTTGGCAA 
GCCAATTTCT GTTGGGAAAC AATTTAAGTA TCTGAAAAAT 
GGATCAATTA GCATCGTTGc TGGaTTTTTC ctAGTaAAGg 
ATwAAgrCcC CAAGCCTTTT TTGGACGT 
(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13085 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
AGTTTAGGnC ATmCGTAAAC ATCTTCTCCT CCTTTATTCG TTCATTTTTT 
GTAnGGGAmA AnGGATACGA AgCTTACTCA CTTTCGTATC AAAAAAgCAw 
AGTGCTACAC GCTTTTTTGG TTTTAAAaTG ACATTTAAAA GGACTGCTAC 
AAGATAACTT CCGAATTACC GAAAATCGTT GTTACCCAAG TTGGAAAACC 
CCAGTAGACA ATGTGACACC AATTCCAAAT GCTAGAGCTG TTCCGACAAC 
CGAGGAGTCA TTTCCTGCGA AGCAATCATC CGAATACCTG TCATAGAAAT 
ACTGAAATCG TTGCGCCACC AATTACCGCG TAAGGAATCG TAGTTAATAA 
TTCGGAACAA AACCAGCAAT TAATAATATA ACTGAAGCGA AAACCAGAAC 
ATCACTTTTG TTACCGTAAC CAGACCAACA TTTTGTCCAA AAGTTGCCAC 
CCAAATAAAC TGCCAATAAA GTTAGTGAAA CCACTCCCCA TAATTCCGCC 
TCATCCGTTG CATCCCGATC CATTGCCCCA ACCGTTGTTG CTGTAAATTG 
TGAACAGCAT CAACAATAAA CATCACCACA AGCGTAAAGA TTGGTACCAA 
AGTCCAAAGT GAAACGnGGT AATTACCTGA ACGATTTTAG CATTTTGAAC 
CTAACCATCC CTAAAGCTAA AGAAATCAAA TAGCCAATAA CCATTCCGTT 



ACATTCGAGC 
TTCAAGATTT 
CGTGGA7AGC 
TTCCCACAGA 
TCGATTTAAC 
CAATTATCTT 
ATTATACAGT 
GGGTTAGTAC 
ATCTCTATGA 
AGGAACCGGA 
CTGACACGAT 
GaACaGCtTA 



TATAGATAAC 

TCTTCACTCA 

CAAAGTGGTT 

AGCTAAGCTT 

CCCAGTATTT 

AGAAGCAAAA 

TGCAGCTACT 

ATATTTATTA 

TGGGATTGAT 

AGATAATTCT 

ACCGATGGCT 

TTGAAAATCC 

TGGCTCAAAA 

TAAAATTGCG 



760 
843 
9C0 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1468 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
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GATAACTTCA AAAATCCTTT 
AATGCAACTA ACCAATTCCG 
TACTTAATCG CAACAGGAAA 
AAAGGCGGAA ACAACACACG 
CTGCCAACTA ATTGGGAACC 
AAAATCGGTA CATACGCAAA 
CCAAAAATAG GGAAAAGTTG 
ATCAAAATAG TCGTATCACT 
GTGACAATCC CTACTACTGC 
ATTGATAGTT TCGCATCATA 
TTTCCTTCTT CTTTTTATTC 
TTTTTCCCTC AGAAACATCT 
AAACCATCTT TTGGCAACTT 
AAGTAATTCC CATAGATTTC 
GTTGTTTTCA CAACATTATC 
ATAAAAGCTT GTTTTGCTTG 
AATCCTAGTT GTCCAGAAAT 
CCTACAGTAG CTGGTGCCTG 
TCTATGATTC TTTATTTTCA 
TTTCTAATAA CGTAATCAAT 
CTTCAACTTT TGGTAACATT 
TTTTTTCTGT TGAAACATTC 
CTTTTTCAAC TGCTGTTAAA 
CACTACAAAA ATCTTTGTCG 
TGACTGGGAT GCCACCACCG 
GAATCGTGAG AAGTTCGACA 
CACGGCCAGC ATCTTCCATC 
CTTTTGTCAT GAAGCGACCA 
CCACGACCAC TTGAGTGACT 
CTTCTTGTAG CGCATTCTCT 
TGGATAACGG AACAACTGGA 
GAATCATCCC CACTTGAGGA 
TCAAATCTGC AATCGCTTTG 
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TGCAAAATAA TTGAAGTAAA AGACAATCGC AAAGGTTAAT 900 

GGCTGAACCA AAATCAGCAC TACCAGCTCC TCCTGCCATA 960 

CAAGGATAGT CCAATGCTTA AAATAACCGT TCCTGTTACT 1020 

GATTTTTTTA ATGaACAAAC CAACAATAAT AACAAGAATA 1080 

AAAAATTGCT GCAATCCCAA AATCAGCTCC GATGGCCATC 1140 

ACTTGCCCCC ATCaTAACTG GTAACCGTGA ACCCACTTTT 1200 

AATTAACGTT GCAACTGCTG AAAAAATCAA TGATGTTTGA 1260 

AGGACTTAAT TGGCAAACTT TGGCAATCAT GATTCCCGGT 1320 

TGCGACAACG TGTTGCAATC CCATCGGAAT TGCTTCAGAA 1380 

TTCAAATAAT GCTTGCTCTT TTTTTTCATT CATAGGATCG 14 4 0 

TAAAAGGATT GTTCCCCTTA ATCAATAAAG AATGCAATCC 1500 

TAAAAATTAT CCTTTGACAG CAACTGCTTC AATTTCAAAT 1560 

GGCTACTTCT ACACAAGAAC GTGCAGGCAA AACTTCTGAA 1620 

GTTAATTTTG CTAAAATCAG ACATATGTTG TAAAAACACC 1680 

ATAGGTCATC TCGACCTCTT TTAAAATCGA ACCTAAATTG 174 0 

TTCTTCCACT GTAGTCTTCA TTTCGCCCGA TTGCGGATCT 1800 

GTACAACGTA TTTCCCGCTA AGACAGAATG TGAATAAGGT 1860 

TGCAGAATTA ATCATTTTGT TTGTCATCGT TTTCCTCCAT 1920 

ATTAAAGTCC CTGTTTTTCC AGATAATCCT TCTTTGGCTT 1980 

GTTTTTCGAC CTGGCTTAGA TTCAGCAAAC TTAATCGCTG 2040 

GAACCTGGAG CAAACTGACC TTCTTGCGCA TATTGTTTCA 2100 

CCTAAGGCTT CTTGATTTTC TTTACCAAAA TTAATACAAA 2160 

ATCACGAGCA GATCAGCATC CACTTGTTCA GCCAGTCGTT 2220 

ATGACTGCAT TGACACCTTT TAGTCGGTTC CCTTCTTGAA 2280 

CCGCCAGCAA TGACTGTTTG GCCAGCATCA ACCAATGTTT 2340 

ATATTTTTTG GTTTAGGAGA AGCGACGACT TGGCGATAAC 2400 

ACTTGTATAT TTTTTTCTTT GACTAATTGT TCTGCTTCTT 24 60 

ATTGGTTTCG TTGGTTTTTT GAACGCCGGA TCATTTTCAT 2520 

ACGGTGGTGA CGGCTTTACT TATATTTCGT TGACGCAACT 2580 

AAATCATAGC CAATGTAACT TTGACTCATG gCTACGCACA 2 64 0 

TATTGTGGAT TATTAAAGGA AAATTCTTCC ATAGCTAATT 2700 

CCATTGCCAT GAGATAAAAT AACTTCATAT CCTGCTTCAA 2760 

GAGGTCTGTT TTACGGCAGT CATCTGCTCC GTCAAATTAT 2820 
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TTCCTAAAGC 


ATTGCCACCT 


AAAGCTACTA 


CGACGCGTTT 


GCCCATTGCT 


TCACCTCATT 


2880 


TTCTGTTTTA 


TAAAAAGGAC 


GATGCTTTTT 


CTTTTTCACA 


ACGAGTTTTC 


GTATTAACTG 


2940 


TCTGTTGTCG 


TTGCATGAAG 


TCAACGCACA 


TGACTAAGCG 


TTCTTGACTC 


GAGACAATCA 


3000 


CACATTCATG 


AATAGACTTG 


GGAAGGTCTT 


ATTTGTTCAT 


GTTGAAGAGG 


CAACTTCTAA 


3060 


TGATTCATTG 


TGTAAAGAAA 


AAGCATCTGT 


CTTCATGCCA 


AATTCATTAT 


TTTACTTCTC 


3120 


TTGGTGTCGC 


TTTATCCGCT 


AATTCTGTTA 


AGGTTTTTTG 


CGGATTTTTT 


ACTTTACTTA 


3180 


AGAAAATCAT 


AGCCGCAATA 


ACATACGGTT 


TGTAGCTAGC 


TTCTTTGTAA 


AGTTCGACCC 


3240 


GATAACGATC 


GAAAACAGAA 


GCTTCCACTT 


CGCCTTCTTC 


ACAACTAACG 


CCAGTAATAT 


3300 


CAGCTGGTAG 


ACAATGCATA 


TACAAAGCTT 


TGCCGTCTTT 


TGTTGTTTTC 


ATCAATTCTT 


3360 


CTGTACATTC 


CCAATCTTTG 


TGCTTTTTAT 


TTTGAGAAAG 


TAATTCTTGT 


TCTAATTGAT 


3420 


CAATTCCTGC 


TTGGTCACCA 


TTGCCATATA 


ATTCAGTTCG 


CTTTTCCATT 


GCCGCAAAAG 


3480 


GTGCCCAGCT 


TTTCGGATAA 


ACAACATCCG 


CATCTTTGAA 


TGCTTCTGCC 


ATACTATTCG 


3540 


TTTTAGTAAA 


GTTTCCACCA 


AATTCTGCCG 


CATTTTTCTT 


CGCAACTTCT 


TCTACTTCTG 


3600 


GCATAATTTC 


ATAACCTTCT 


GGATGTGCTA 


AGACGACATC 


CATTCCTAAA 


CGGGTCATTA 


3660 


GACCAACAAT 


TCCTTGAGGA 


ACAGATAATG 


GTTTTCCATA 


AGATGGTGAA 


TACGCCCAAG 


3720 


TCATGGCAAC 


TTTTTTCCCT 


TTCAGGTTTT 


CAATGCCACC 


AAATTCATGA 


ATTAAATGCA 


3780 


ACGCATCCGC 


CATTGCTTGC 


GTCGGATGGT 


CAATATCACA 


TTGTAAATTC 


ACTAACGTTG 


3840 


GGCGCTGTTC 


CAATACCCCA 


TCTTTATATC 


CTTCTTGGAC 


AGATTCTGAT 


ACTTCGTGCA 


3900 


TGTAAGCATT 


GCCTTTACCA 


ATGTACATGT 


CATCACGAAT 


ACCAATGATA 


TCAGCCATAA 


3960 


AAGAAATCAT 


GTTTGCTGTC 


TCACGAACAG 


TTTCTCCGTG 


AGAAATTTGA 


CTTTTTCCTT 


4020 


CGTCTAAATC 


TTGAACTTCT 


AAGCCCAAAA 


GATTACACGC 


TGAAGCGAAA 


CTAAAACGTG 


4080 


TTCTTGTAGA 


GTTGTCGCGG 


AATAAAGAAA 


TCCCCAAGCC 


ACTATCAAAG 


ATTTTAGTTG 


4140 


AAATATTATT 


TTCTCTTAAA 


TAGCGTAATG 


TATCGGCAAC 


TGTGAAAACA 


GCTTCTAATT 


4200 


CATCTCTTGT 


TTTTTCCCAA 


GTTAGGAAAA 


AGTCATTTTC 


GTACATTTTT 


TCAAATTCTA 


4260 


ATTTATCTAA 


TTTCTCAATA 


TATTCTTTAA 


ACGTTTCCAT 


TCCATCTGCT 


CCTTTTTTCT 


4320 


TGTCTAATAA 


TCCTTTACTT 


AACCACAATA 


GACCGTTGGT 


AATGCCGCAT 


AAACGGCTGC 


4380 


ACAACGAACC 


AAATCATCTT 


TCCAAGTTTT 


TTCATTTGGT 


GCATGTGCTT 


GTGCTTCGGC 


4440 


CCCTGGCCCG 


AAACCAATAC 


AAGGAATACC 


ATTACGTCCC 


ATAATTGAAA 


CGCCATTTGT 


4500 


TGAGAAGGTC 


CACTTATCTA 


ACAATGGGCG 


TTCTTCACGC 


ATTTCCACTG 


TTTCTACAGA 


4560 


GCCTTTCCGT 


GTTTCTCCAT 


AGAGATTTTT 


ATGCGTTTCC 


ATCAAGGCTT 


TTGTTACATC 


4620 


ATGATTTTCT 


■GGAATTACCC 


ACGTTGGGAA 


ATAACATTCA 


ATTTCATAAG 


TTAAATCCGT 


4680 


ATAAGAAGGA 


CGATCATAGT 


TGTACATTGA 


AACCGTCACA 


TCATCTCCAT 


ATTTTTTAAC 


4740 


AGCTGGTAAA 


TTGCGAATTT 


CTTCTAAACA 


ACTTTCCCAA 


GTTTCTCCAG 


CAGTCATTCG 


4800 
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ACGGTCTAGT 


GAAACGGTAC 


AGCCATCCGC 


AACAGCACAC 


CGACTTGGTG 


AAGAATGGAA 


4860 


AATTTGCGAA 


ACAGTCACCG 


TACCACGTCC 


TAAGAAACGG 


GCTTCTTGCC 


ACTCTGGATT 


4920 


GTACTTAGGA 


TCTAACATAC 


GAACTAAACC 


ACGAATCGCT 


GTGCTTTCTG 


TATCTCCATT 


4980 


GTTGTTTAAG 


GCACGAACAT 


CTTGTAAGAT 


ATCCGCCATT 


TTGTAAATGG 


CATTATCTCC 


5040 


ACGTTCTGGT 


GCTGAGCCAT 


GACACGAAAC 


GCCTTTGACA 


TCAACTTTAA 


TTTCCATTCG 


5100 


GCCTCGTTGA 


CCACGATAAA 


TCCCACCATC 


TGTTGGCTCT 


GTTGAAACAA 


CAAATTCTGG 


5160 


ACGGATACCG 


TCTTCTTTGA 


TAATGTACTG 


CCAACATAAG 


CCGTCACAGT 


CTTCTTCTTG 


5220 


GACAGTAACT 


GTTACTAAAG 


CAGTGTATTT 


TTCTGATAAT 


AGACCTAAAT 


CTTTCATGAT 


5280 


TTTAGCGCCA 


TAAATAGCCG 


AAACAATCCC 


GCCCTCTTGG 


TCAGACGTAC 


CGCGACCACC 


5340 


GATCTCAGTT 


TCTGTTTCAT 


AGCCATCGTA 


AGGATCAAAT 


TTCCAGTTAC 


TCATTTCACC 


5400 


AATTCCGACA 


GTATCCATGT 


GCCCATCAAA 


AGCAATTAAT 


TTTTCACCAG 


ACCCCATATA 


5460 


ACCAAGTAAA 


TTTCCTTGAG 


GATCAATGTC 


AATTTTATCG 


AAACCTAATT 


TTTCCATTTC 


5520 


GGCTTTTGCT 


CGCGCAATTT 


TGTTGCCTTC 


TTCTGCACTT 


TCTCCTGGAA 


TTTTGACAAG 


5580 


GTCTCTTAAA 


AATTTGACCA 


TATCTTCTCT 


GTAACCTTCT 


GCTGCTGCAT 


TTACTGCATT 


5640 


AAAATCCATT 


CCTTTTACCT 


CCATAAAATT 


TTTTTATAAT 


TGTCTGGATC 


AGTGTcCCCT 


5700 


TCGGTAGAAA 


ACATTAAAAC 


GTTTGACGTC 


TCATCCAGTT 


GCAATGCCTC 


ATGTAATTCT 


5760 


TGATAGTCGG 


GATCTTGCAT 


CACTGTAGCT 


ACTAAGCCCA 


TTGCTACTGC 


CCCAGATTCG 


5820 


CCTGATACAA 


CTTGCGGATC. CCCTTTTAAT 


GGCGCGCCCA 


ACATACGCAT 


TCCTTTTTCA 


5880 


GAAACCCAAT 


CTGGCGCTGA 


AACAAAGAAG 


GATGTATGAT 


TTTTTAGAAT 


ATCAAAAGAA 


5940 


ATCGTATTAG 


GTTCTCCACA 


TGcTAAGCCA 


GCCATAATTG 


TCTGTAAATC 


ACCATCGACA 


6000 


AAGCGAATTT 


CTCCGTCTTT 


TTTAATTGCT 


GATTGATATA 


AACAGTCAGC 


CGCCTGTGCC 


6060 


TCAACTACGA 


CCATAATCGG 


TGGGTTTTCT 


GGAAAAGCAT 


TAGCAAAATA 


ACCAATCACT 


6120 


GCACCAGCTA 


AACTGCCGAC 


ACCAGCTTGA 


ACAAAAACAT 


GACTAGGTCG 


TTCAGAGCCA 


6180 


AACTTTCTTA 


ATTGTTGAGA 


GGCTTCTAAC 


GCCATCGTTC 


CGTATCCTTG 


CATAATCCAA 


6240 


GTCGGAATTT 


TTTCATATCC 


GTCCCAAGCG 


GTGTCTTGAA 


CCATGACACC 


GTTTTCAGTT 


6300 


TCTTCGGCCA 


TTTTATTAGC 


CATTCGCACA 


CATTCATCGT 


AATTAACTTC 


TTCGATTGTG 


6360 


ACGGTTGCGC 


CTTCTTTTTC 


AATATTTTCT 


TTTCTCGTGT 


GTGTTGAACC 


cTTCGGCATG 


6420 


AGTACAACTG 


ATTTTTGGCC 


TAATTTATTC 


GCAGCCCAAG 


CGACTCCTCG 


TCCATGGTTA 


6480 


CCATCAGTTG 


CTGTAAAAAA 


CGTAGCTTGT 


CCAAAAGCTG 


ATCGTAATTG 


ATCAGAAGTT 


6540 


AATTCATCAT 


ATGTTAGTTC 


CGAAACATCT 


TTCTTTAATT 


TTTCGGCAAT 


ATAGTTAGCC 


6600 


ATGGCAAACG 


* AACCACCTAA 


TACTTTAAAA 


GCATTTAAAC 


CAAAGCGATA 


AGATTCATCT 


6660 


TTAACAAAGA 


AATCATTTAA 


CCCCAGATAC 


GCGGCCATAT 


GATTCAATTC 


TGCTAATGGT 


6720 


GTTTCTGAAT 


ACTGTGGAAA 


ACTTTTGTGA 


AAAGCCAAAG 


CTTTCTCAAT 


TTTTTCTTTA 


6780 
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GACATCAGAG GTAAGTACTT ATCCTCCGTC T.TCGGCATTT CATTTGCTGT CCATTTAATT 684 0 

TTTTCCAAAA ATACTTCTCC TCCTTCTATT TTTAAGTTGC TTTACGAAAA ACAAATTTTC 6900 

CTTGATTTGC CAAGATGACT TGCCCAGCTT CAGCAACTTT TTGACCTCTT AAATAAACCA 6960 

TTCGAGCCTG TCCTTGTGTT TCAAATCCTT CATACGGCGT ATAATCGACA TTTTGTAACt 7020 

GCGTTCTCGC AGCAATCACG CCGGTTGTTC TGGGATCCCA AACAACTAAA TCGGCATCAC 7080 

TGCCTTCTTG CACCACACCT TTTTGTGGAT ACATCGAGAA TTGTTTAGCA ATATTTTCAG 7140 

AAAGCAGAGC AACCATTTTT TCCAAGGTGA TTCGGCCCTT CGCCACACCT TCCGTGTAAA 7200 

TTAATTCTGG TCTCGTTTCA ACACCTGGCA TCCCATTGGG AATTTTACTA AAATCATCTT 7260 

TGCCAACTGT TTTTTGTCCG TAAAAATTAA AACTGCAGTG ATCTGTCGAT ATTGTATTGA 7320 

TTGCCCCCTC TTTAATCCCT TGCCACAAAG CCCGTTGATC CTCTCGTTTG CGTAGTGGCG 7380 

GTGAGCAGAC ATATTTTGCT GCTTCAAAAT TAGGCGCATC ATACAAGTGA TCATCTAGTA 7 4 40 

AAAGATATTG AGGGCAAGTT TCTACATAGA CTGATTGTCC TCTTTGACGA GCCCGTTCCA 7500 

CCGCTTCTAA TGATCGTTTT GTGCTTAAAT GCACAATATT AACCGGCAAA TCAGCCATTT. 7560 

CCGCTATCAT CAAATATCTT GCTACCGCTT CTGCTTCAAC AGCCGCCGGT CTTGATAACG 7 620 

GATGATAGTG CGGGGTTAGC TTTCCTTGAG AAACATATGA TTGGaTTAAT TCATCTACTA 7 680 

AATCACCATT TTCACAATGA ACGCCTAACA TTCCATTAAC TTTTTTTATT TCCTTCATCG 7740 

CTTCAAAAAT TTCCGCATCT GTGGTCCGCA AATTATCATA AGCCATATAC ATTTTGAAGG 7800 

AAGTAATACC TGCAGCAATC ATTTCCTGAA TTTCAGCAGC TATCGTTGGT TTCCATTCAA 78 60 

TCATGGACAT ATGGTAAGTA TAATCACATG AACTCTTGCC TTCAGCCAAT TGATTCCAAG 7920 

TAGCTAAACA GTCTTTCAAT GAGCCGCCTT TATTAGGTGT AGCCATATCA ATAACCGTAG 7980 

TCGTACCTTT GGCCACCGCC GCTTGGCTTC CAGTGGTAAA GTTATCCGCC GTACTTAACG 804 0 

AACCTTTGCC ATTATTTAAT TCCAGATGCG TATGCGCATC AATaAAGCCT GGTAATACAT 8100 

AACACCCCGT AACATCCTCA ATCTGAGAAT TTTCTACTGG TAAATTTGCC CCCATCTCTA 8160 

CAATTTGCTC GCCGTCAATT CGAATATCCA ATTGACGACG ATTATATGCA GATACGACGG 8220 

TTCCACCTTT GAGCAGAATA GACATTTATT TATCCCCCAG TCGTAAATTT CCTTTTATGC 8280 

TTGCATACAA TAAGCATATT CTTCAATCAT AGTTACCATT AATGCAATTA AATCTTCTGG 834 0 

TACATCTAAT CCTTGATCTG TTAATTGAAT TGTTGACACC GTTCCCCATA GACGCAGTAA 8400 

ACAAATTTTA TCCACAGGAT TTTGAACATA GAAACCTGAA TTAGTACTGT CATAAAAATC 8 4 60 

AGCTTCACTG TTAAATAAGG TAAATTTGTC TTTGTAAGGG GCACTTGCAT AAGGGCAAAA 8520 

TGTTTCACAG TTGCCACATT CGTTACACAT CCGATCAACA TGGACAATTT GTGGTTTTCC 8580 

TTCGACATAC ACAACAATAT TTGCTCGGTT TGGACAGACA TCCATACAAG ATTCACAAAT 864 0 

TGTGGAGCAT TCTAAGCAAC GACTAGCTTG ACTACAAGAC ATCTCGTCTG TCACAAGGAT 8700 

ACCCCGTTTG TTTCTAACAA ATGCGACATC ACTATTTAAA TTATCTTTTT CATAGTGATG 8760 
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TATTGGCCGC T.TTGGTTGCA TCAGCGATTG CTTCAACAAT 8820 

CGTCACCAAT GACATAGACA CCTGGTATAT TTGTTTCCAA 8880 

CTTTCCCATA GTTGTCTGTG TGAATGCCTA ATGCTTGGTA 8940 

CACCAACAGC GGCAATCACT GTATCAGCAG GAATATCAAT 9000 

GTCGACGGCG TCCAGAGGCA TCTCGTTCAC CTAGGACCAT 9060 

GATTTTCATG TTTAATTGGC GAAAGTAATT CTAAAAAGTC 9120 

AATAAAGTTC TTCTTCATCT GCTGGCATAT TGCGTTTATT 9180 

CTTTTTTAAC ACCAGGTAAT GTTGTGGCTG CACGAGCACA 9240 

CACCAACGAC GACAATTTGT TCTCCGTAAG GATTAATCGT 9300 

TTAAAAATTC TAACGAGTTC AATGCTCGAC CAGATTCTAA 9360 

ATGCACCAAT GGCATAAATA ACATTCGTAT AGCCTTGGTT 94 20 

GCGCCTCTTG TCCTGTACGG AATTCGGCTC CCATAAATTC 94 80 

GAACAGATTC CATAGAAATT CGAAATTCTG GTACAATCTG 954 0 

TCTCGGATTT TTCAAACACA GTAACAGGCA TGCCTTCACG 9600 

AGATCCCTGC AGGGCCTCCA CCCACAACAG CTGTTTTAGG 9660 

GCTTTTCTAG TGTCGTCAAT AATTCATCAT AAGCATGTTC 9720 

CCCGAATATG AATAGATTCT TCATAAAATT GACGTGTACA 9780 

CACAAATGGT TCCAGTAATA AAAGGCAACG GATTCTTATC 98 40 

TTAAATAATT ACCTTCACTG ACATAACGAA GGTAAGCAGG 9900 

AACCACCGTC ACTTCGACAA GGAGCAATAT AACAATCTGT 9960 

GTTTGGTACT TTCTGGTAAT TTAATTGATT TTTGGTAACG 10020 

CAACAACTTG AGCTAATTTA TCCAAGTTGA CATGGACCAT 10080 

TTAATACATT AGCAACTTGA TTCATTCGTT GATAGCCACC 1014 0 

CCATAGTGAT TGGCCAAATC CCAGCATCGA ATATTTCTTT 10200 

CGCCAGAATA AGAAATTTGT AGTTTCCCAT CAAATGCTTC 102 60 

AAATACTAAG AGGAAATAGC GAACGTCCTG ACATGTACAT 10320 

TTGCTGCAAT CGTTACCGGG AAAGTGTTCG TAATTTTAAC 10380 

TATTAGCTAA TAATTGCAGA CGTTGTAACA TCGGTACTGC 104 4 0 

CTTTAAAATG GTGATCGTCA AAAACCATAT AATCAAACCC 10500 

GCGCATATTC ATAACCTAAC ATGGTTGGAT TACATTTAAT 10560 

CACTTAATAA ATAGGCCGCA ATTCGTTCAA TTTCGTCTGA 10620 

ACAGCGTAAT AGAATGACAG ACTTTAGGAC TAATTGCTTC 10 680 

TTTTAAAATA AGATAGATAT TTTTTGGCAG CTGCTTGGCA 107 40 
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TTCCGCCCAA 


ATCGGCGTTC 


CTTCTGCGTT 


CTGCATTTCG 


TTAATATAAC 


GATCAATTTT 


10800 


TGGTGATTGA 


ATCCCTGCCA 


AATCATAGCC 


AACACTCATA 


TTAAAAATGA 


AACCATTCGG 


10860 


GTCACCTAGC 


TCGAACTCTT 


TACTAAGTAA 


CTTCAAAACG 


AACCATGCTT 


TGACGTATTC 


10920 


ATCATAAGCT 


TGGGGAACGC 


GCAATTCTGT 


GGACCACTCA 


ACGTTGTAAC 


ATTCATCTGC 


10980 


TGCTGCGATA 


CATGGTTTAC 


TCACAGGTAA 


ATCTTCTCCA 


TCCAGTATCT 


GGACTGTCTT 


11040 


AACTTCAAAA 


AAACGAGAAC 


CTGTTAAATA 


GGCCGCAATA 


ATATTTTGTG 


CTAATTGTGT 


11100 


ATGAGGACCT 


GCTGCTGJAC 


CACAAGGGGT 


TTCCATTTTT 


TCTCCAAATA 


ACGAAATCGT 


11160 


TTTTGTAGGA 


TCCGCATGAT 


AAAATTTACG 


AATGCCAAAA 


ATTGTGCCAT 


CTTGTTGATA 


11220 


CTCACTAAAA 


ATCCAATTTA 


ATAAAGCCTC 


AATTGAGATG 


GGATGCATAA 


TATCACTCAT 


11280 


TAACGTCACT 


CCTTTTATTT 


ATAATGAAAG 


CTATTGATTT 


ACTCGTTTCC 


AGAAAGATTG 


11340 


TGCTTGCGTT 


TGCGCATCAT 


GCCAAACTTT 


CTCTTCATCA 


ATTCCTTGAA 


CTTCACGATT 


11400 


ACGCATCCGA 


ATTTCTCCAT 


TAATCACCGT 


GTCAGTGACC 


ATCCCACCAT 


TCATGCCGAA 


11460 


TAAAATGTGC 


ATGTTATAAT 


TCTCTTTTGT 


CATTGGTGTC 


GGGCCATGAT 


AATCAACAAT 


11520 


AATGACGTCT 


GCTGCTGCAT 


TAGGTTCTAA 


AACGCCTAGC 


TTTGTTTTAA 


AATAGCGATT 


11580 


AGCCATTTGC 


GGATTATTAT 


CAAACAGCAT 


TGTGGGAATT 


TCTGCCCAAG 


CCGCATTTGG 


11640 


ATCAGCTAAA 


TGATGTTTGT 


GGATAACATT 


TCCTACCTTA 


TAGGATTCCG 


TCACATCATT 


11700 


CGTATAACCA 


TCTGTGCCAA 


GACCCATCAA 


AtACCATACT 


CATTAAACAT 


GCGCATCGCA 


11760 


GGTGGGCAAC 


CGACCGCATT 


CCCCATATTC 


GACTCTGGAT 


TCGTGACCAC 


CATTGTGTCT 


11820 


GTATCTCTTA 


GTAATTCCAT 


CTCATGAGGT 


CCAATATGGA 


TGCAATGGCC 


TGCCATTGTC 


11880 


TGTTTTCCTA 


AAATCCCTAA 


ATCAAATAAT 


CGGTTCACAA 


TCGGCTTCCC 


ATATTTTTTC 


11940 


AAAGAATCAT 


GAACATCGGC 


TAAATCCTCT 


GCAATATGAA 


TATGATAACC 


AATCCCATCT 


12000 


GGCGTATTGG 


CTGCACATAA 


TTCTAAAGAA 


GCATCTGATA 


AAGTAAATGC 


TGCATGCATC 


12060 


CCCATCATCG 


CTTTTTGCAT 


AT CATC AT TT 


CTTAAGGAAC 


TGGCTTTAAT 


AAAGGCGGCA 


12120 


TTTTCTTGAA 


CTGCTTCCCG 


CATTTTCTGC 


TCACCATCGC 


GATCAGAAAC 


TTCATAACAA 


12180 


AGGCAGGTCC 


GAATCCCTAA 


CTCGTCCGCT 


GCATTAGATA 


ATTGGGTTAA 


ACTCCCTGTA 


12240 


ATTTCTCCGT 


AACTAGCATG 


ATGATCAAAG 


ACAGTTGTAA 


CGCCGTTGCG 


AATACTATCT 


12300 


AAATAAGCAA 


CCTTACCACT 


TTGATAAGTG 


TCTTCTAATG 


TCAAGGCACG 


ATCAATTCGC 


12360 


CACCATTGAT 


CTACCAAAAT 


ATCCATGAAA 


TTTTTAGGAT 


GATAATTCGT 


TAAACTTAAC 


12420 


CCACGTGCAA 


AGGTACTATA 


AATATGATTG 


TGCATGTTAA 


TAAAGCCTGG 


CATAATCACT 


12480 


CCACCTTTAG 


CATCAATAAA 


TGTCGCTTCT 


GGATAAGCTT 


GTTTTAATTC 


TTCTGTTGTT 


12540 


CCGACTTGTT ■ 


TCACAACTTG 


ACCATCAATC 


GCCACACACC 


CATTTTCAAA 


AAAATGATTC 


12600 


TGCGCATCCC 


GAGTGATTAA 


ACGCCCATTT 


CCAATTAATA 


ACATGTATTC 


TTCTCCTATT 


12660 


TTCCAACCTA 


TTCCCCACGT 


TTAGCAAACG 


CTTACAAATC 


AATCGCTCAA 


TCATTCACGA 


12720 
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ATATAGCTTG TCTAGCTTTT CTAATCTATA TTTCCTACGA AAAGCATTTG AAGCGCTTGC 12780 

ATTTTTTTAC TAACGAACTC ATTAACTTGA TTAAAGTATA TCACCTTACC TTAAAAAAAA 12840 

CAATGCTACC TAAAAAATAT TTTTTAGAAT TTTGACAGCC GTTATCTTGT ATTATTATTT 12900 

TTATCATGCT ATCATTATTT TTCAGAACAG GTAGTACGCG ACATTAAAGG ATTTAGCCCT 12960 

GCCTATTCCA TTTAAAAATA TTATGTTAAA TTTGACACAA TCTTATAAAA ACTGTTTGTG 13020 

CCAGGCACAT TAGAAAGGAG CTTATTTATG TTAACCGACG AAAAACTTAC TTTTTTAACA 13080 

CAAAT 13085 
(2) INFORMATION FOR SEQ ID NO: 380: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 



TTTTGGTATG 


AAACAGAAAA 


AGTGGTTAAT 


CGGACTTGTT 


GCACTGGGCT 


TGGTTTTAGC 


60 


aGCATGTGGA 


AGTGGCGGTT 


CGAAAACGAC 


CTCAAACGAA 


CCAGCTACAC 


AGAAAATTAA 


120 


CGTCGCATCT 


GGTGGTGAAC 


TCTCGACATT 


AGACAGCGCT 


CATTATACAG 


ATGTCTATAG 


180 


TTCCGATATG 


ATTGGTCAAG 


TAGTTGAAGG 


CTTGTATCGA 


CAAGATAAAA 


ACGGAGATCC 


240 


TGAGCTAGCT 


ATGGCGAAAG 


CAGAGCCACA 


AGTTAGTGAA 


GACGGGTTAG 


TCTATACATT 


300 


CAAGTTACGA 


GAAGCAAAAT 


GGACAAACGG 


GGATCCAGTT 


AAAGCAGGGG 


ATTTTGTAGT 


360 


TGCGTTTAGA 


AACGTGGTCG 


ATCCAGCATA 


CGGTTCAAGT 


AGCAGTAATC 


AAATGGATAT 


420 


TTTTAAAAAT 


GGGCGTGCGG 


TGCGGGAAGG 


ACAAGCCACG 


ATGGAAGAAT 


TTGGTGTCAA 


480 


AGCAATCGAT 


GACCAGACAC 


TAGAACTAAC 


ATTGGaAAAT 


CCAATTCCTT 


ATTTAGCCCA 


540 


AGTCTTGGTT 


GGGACACCTT 


TTATGCCTAA 


AAATGAAGCC 


TTTGCCAAAG 


AAAAaGGTAC 


600 


TGCCTATGGG 


ACTTCTGCAG 


ATAATTTTGT 


TGGCAATGGG 


CCGTTTGTAA 


TTTCAGGTTG 


660 


GGATGGCAAT 


TCCGAAACTT 


GGaAATTGAA 


GAAGAATGAT 


CaTTATTGGG 


ATAAAgAACA 


720 


CGTAAAATTG 


AATGAAATTg 


ATGtTCaAGT 


AGtGaAAgAA 


ATTGGCACAG 


GAGCCAATCT 


780 


TTTTGATAAT 


GGCGACTTAG 


ATTACACTGT 


TTTAGCAGAT 


ACTTATGCAC 


TTCAGTATAA 


840 


AGAGTCAAAA 


CAAGCGCATT 


TTGTACCTAA 


AGCCATGGTG 


GGTTATTTAA 


GCCCCAATCA 


900 


TCGCCGTGAA 


ATTACCGGCA 


ACGAACATGT 


TCGAAAAGCT 


TTTTTACAAG 


CGATTGACAA 


960 


AGAAACTTTT 


GCAAAAGAAA 


TTTTAGGAGA 


tGGCTCGACA 


GCTTTAAATG 


GkTTTGTACC 


1020 


AGCTAATTTT 


GCaAAAATCC 


AGATACAGGT 


GAAGATTTCC 


GCAAAGAAAA 


TGGTGATTTA 


1080 


TTGCCATATA 


ATATTAAAGA 


AGCCCAAGCT 


AACTGGAACA 


A 




1121 



(2) INFORMATION FOR SEQ ID NO: 381: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 



nTCATTGAAT 


GCnCGGAACA 


ACGGAAGTAT 


GGATGGCGAC 


CCACCAGCGG 


CGATGCGTTA 


60 


TACAGAAGCT 


CGTTTATCAC 


AGTTAAGTGG 


TGAGTTGTTA 


AAAGATATCG 


ACAAAAATAC 


120 


CGTCGATTTC 


GTCTGGAATT 


TTGACGATAC 


AGAAAAAGAA 


CCCACTGTTT 


TACCAGCCAA 


180 


ATATCCTAAT 


TTATTGGTAA 


ACGGCTCAAC 


GGGGATTTCA 


GCAGGCTATG 


CAACAGAAAT 


240 


TCCTACGCAT 


AATTTAGCGG 


AAATCATTGA 


TGGAACTGTG 


TATTTAATTG 


ATCATCCTAA 


300 


TGCAAGTTTA 


GAGAAATTAA 


TGGAATACAT 


TCCAGGACCA 


GATTTCCCAA 


CAGGTGGGAT 


360 


TCTACAAGGA 


AAAGCAGAAA 


TTAAAAAAGC 


CTACGAAACC 


GGCCGTGGTA 


AAGTAATTTT 


420 


ACGTGCGAAG 


ACAAAGATTG 


AGCCTTTAAA 


AGGCGGCAAA 


CAACAAATTG 


TCATTTCTGA 


480 


AATTCCTTAT 


GAAGTCAACA 


AAGCAACGCT 


AGTGAAGAAG 


ATGGATGAAA 


TCCGTTTAAA 


540 


TAAAAAAGTA 


GACGGCATTG 


CCGAAGTTCG 


TGACGAAAGT 


GACAGAACTG 


GCTTGCAAAT 


600 


TGTTGTTGAA 


CTAAAAAAAG 


ATGTCAATGC 


AGAAGGTATT 


TTAAATTATC 


TATTTAAAAA 


660 


CACTGAATTG 


CAAATCAATT 


ACAACTTTAA 


CATGGTAGCC 


ATCGACAACA 


TGACCCCGCA 


720 


ACAAGTGGGC 


CTAAAACGGA 


TTTTAGAAAG 


TTACATTACG 


CATCGCAAAT 


CAGTTATCAT 


780 


TAACCGTTGT 


CAATTTGAGT 


TGGACAAAGC 


TCGCAAACGT 


GAGCACATTG 


TTGCTGGTTT 


840 


AATTAAAGCA 


TTGTCTATCT 


TAGATAAAGT 


GATTGCAACA 


ATCCGTGGCA 


GTAAAGATAA 


900 


AAAAgATGCG 


AAGAAGAACT 


TAGTATCCGA 


CTATGCGTTT 


ACTGAAGAAC 


AAGCTGAAGC 


960 


AATCGTTACT 


TTACAGCTAT 


ACCGTTTAAC 


CAATACGGAT 


AT TACT GAT T 


TACAAGAAGA 


1020 


AGCGAAAACT 


TTAGAACAAC 


AAATTGCTGA 


GTTATTGAAC 


ATTTTAAACA 


ATGAAAAAGA 


1080 


ACTATTCTCA 


GTCATGAAAA 


AAGAActTCG 


CGAAGTTAAA 


AAGCAATATG 


GCAATCCGCG 


1140 


CTTAACTCAA 


ATTGAAGAGG 


AAATCCAAGA 


AATCAAGATT 


GAAACAGCCG 


TGTTAGTTGC 


1200 


GCAGGAAGAC 


GTGGTCGTAA 


ACnTGACGCA 


CGAAGGCTAT 


ATCAAGCGGA 


GTAGTATTCG 


1260 


TTCTTATACA 


G CATC AAA 










1278 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4184 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
ACTTTTTGCT TAAATTTCGT GATGTTTTGC ACTGACTATT CTTCCTCTTT TTGGTAACGA 
TAAATGATTC GTCCATTTAA CCCCGCTTCT CCAACAATTT CCGCGCGGTC AGGTAAAGAA 



60 
120 
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ATCCAAGCAT CACGAAATTG GAAAAATTCA TTTTGCTCCA CGATTAATTC AGAGATTTCT 180 

CCAGCTTTTA ATGAAGCAAT TTTTTCTACA AAAGGATTTT CAGCCATTGA TTTCACCACT 2 40 

CTTTCTCTTA AATATCAACT CTTATAGTTT ACTTGAAAAT GGCGCAATTG TCTGTAAAAA 300 

AATCGAATCT TTGCCTTTTT TTAAACAAAG TCTTTTACTT GATTCTCTTA CAAAAAATGC 360 

TATAATAACC TCAAAGGTCA AAAATGGTCA AGTGGTTTTA TAGACAGAAT TTATTCCTGC 4 20 

TATCTAGTAC TTGGCTGGTT TGAGGAAAAA TGGTCTGTTT ATTAGAAATA ATAGAGCAAA 480 

CAACTATTTT TCTGGCCTCA GTTATTTTGA AAGGATGTGT TTCATTCAAT GATTTGTCAA 54 0 

AATTGTCAAC AAAATGAAGC AACAATTCAT TTATATGCAA ATGTGAATGG TCAAAGAAAA 600 

CAACTGGACT ATTGCCAAAG CTGCTATCAA AAATTAAAAA ATCAAGCAAA TAATTCCCCT 660 

CAAAATATGG GACAAGACCC TTTTGGCTTT GGGAGTTTAG ACGATTTATA TCGTTCTCTT 720 

TCTCGCCAAA TGCAACAAGG TAATCCTTAT GAACAACAAA CACCACCCAC TCAATTTGGT 780 

GATGGTGGTA ATGGAGGTCA ACCCCCTCGC GGAGGCGCTG GCGCTGGTCA AGGTTTACTA 840 

GGCGAGTATG GGATTAACAT CACCGAAGCG GCACGCCAAG GAGATATCGA TCCTGTGGTA 900 

GGACGTGACC AAGAAATTAA GCGTGTCATT GAAATCTTAA ATCGCCGTAC AAAAAACAAC 960 

CCCGTTTTAA TCGGTGAACC AGGTGTTGGT AAAACAGCCG TTGTCGAAGG TTTAGCCCAA 1020 

AAAATTGTCG ATGGCGATGT GCCCCAAAAA TTATTGGACA AAGAAGTCAT TCGTTTAGAT '1080 

GTGGTTTCAC TTGTTCAAGG AACTGGTATT CGCGGACAAT TTGAAGAACG GATGCAAAAA 114 0 

TTGATTGAAG AAATCACTGA AGCTGAAAAT GTCATTTTAT TTATTGATGA AGTCCATGAA 1200 

ATTGTGGGTG CTGGTGCTGC TGGTGATGGC AATATGGATG CTGGCAATAT TyTAAAACCA 1260 

GCCTTAGCAC GTGGTGAATT ACAATTAGTC GGTGCCACAA CTTTAAATGA GTACCGTATT 1320 

ATTGAAAAAG ATGCTGCCtT AGAACGTCGG ATGCAACCAG TCCAAGTAGA TGAGCCAACC 1380 

GTTGCCGAAA CCATAACGAT TTTACATGGT TTGCAAAAAC GTTATGAAGA TTATCATCAT 1440 

GTTAAATATA CAGATGAAGC GATTAACGCA GCAGCAAACT TATCCAATCG TTACATTCAA 1500 

GATCGCTTTT TACCAGATAA AGCGATTGAC TTGTTAGATG AATCTGGTTC AAAAATGAAT 1560 

CTAACTATCC AACTCGTCGA TcCAAAAACA ATTGATAAAA AATTAGCAGA AGCGGAACAA 1620 

CAAAAACAAC aAGCTTCCGC AGAAGAAGAT TTTGAAAAAG CGGCTTATTA TCGTGATCAA 1680 

ATCAATAAAT TACAAGCAAT . GAAAGAAAAA CAAATCAGCG ATGAAGAAAC ACCAGTCATC 17 4 0 

ACTGAAAAAG ATATTGAAGC CATTGTGGAA CAAAAAACTG GCATTCCTGT CGGTGACTTA 1800 

AAAGAAAAAG AACAAACCCA ATTGAAAAAT TTAGCTGTTG ACTTAAAAGC TCATGTTGTC 1860 

GGACAAGATG ATGCCGTCGA TAAAGTAGCT AAAGCGATTC GTCGCAATCG TGTAGGCCTA 1920 

GGCAAACAAA ACCGCCCAAT TGGTTCCTTC TTATTTGTTG GGCCAACAGG CGTTGGTAAA 1980 

ACAGAATTAG CAAAACAATT AGCTTTTGAG CTATTCGGTT CAGAAGATTC GATGGTTCGT 2040 

TTTGATATGA GTGAGTATAT GGAAAAACAC AGTGTTTCTA AACTGATTGG TTCTCCTCCA 2100 
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GGCTATGTTG GTTATGACGA 
AGTTTAATTT TATTGGATGA 
CAAATTCTCG ACGATGGTCG 
ACCATCATTA TCATGACAAG 
GGTGCTGCCC GTGAAGGTGT 
CCCGAATTCC TTAACCGTTT 
CTGATGAACA TTGTCAGCTT 
TTACACATTG AGGTTCCAAC 
GCAATGGGCG CTCGTCCTTT 
GAATACTATC TAGATCATCC 
AAAATTATCG TAACTGGCGC 
GCAGAATAAT CAAAAATCCA 
TCACTTTTTA TTTTCATCAA 
TCATAGAAAA AATTCAATCT 
ACAACTATAA TGAGGTGATA 
AGTAAATAAA CAATTGGAAA 
TACGACCGCT GTCTATTCAG 
TCGTCGCATT CGTCCGACAG 
TAAAGAGGCT TATGACCCAG 
GTCTGTTCCA TTAAAAGTTT 
CTTCGGCAGT TTTTTTAATT 
CCGACCAGGC TTCCTCATTT 
TTAAGTCATC CACTTCTTTT 
AATATTTTGA TTGGCGAGAA 
CTGCAAAATA TTCATCATAA 
CAGTTAAACG TCCAGAAAGC 
TTGAAGGTTC TTTGATTTCC 
CGCTATTGGA TTGACCTTTG 
TTTGATTTTC TAGAAAAGTC 
CGTTGTTCGC TTGAAAATTT 
CAGAATAATA GTTTTTAGGT 
TAACAACGAC ATCAATTAAA 
AATCCCCTTG AACTTCCCCG 



1559 

AGCTGGACAA CTCACAGAAA AAGTCCGTCG TAATCCTTAT . 2160 

AATTGAAAAA GCGCATCCTG ATGTTTTACA CATGTTTTTA 2220 

TTTAACAGAT GCACAAGGTC GTACAGTCAG CTTTAAAGAC 2280 

TAATGCTGGA ACAGGAGCTG TGGAAGCCAA CGTTGGCTTC 234 0 

GACTAAATCT GTTTTAGGTC AATTGAACAA CTTCTTCACC 2400 

CGATGGTATT ATCGAATTCA AAGCATTAAG CAAAGAAAAC 24 60 

GATGTTAGAA GAAGTAAATG GCTTATTAGC TAAACAAAAA 2520 

AGAAGTCAAA GAGAAACTCG TTGATTTAGG TTATGATCCT 2580 

ACGTCGAACA ATTCAAGAAC AAATTGAAGA "CGGCATTGCT 2640 

TGAAAATCAT CAATTAGTCG CAGCGTTAGA TAACGAAGGT 2700 

ACAAGAAGTG ACTAAAACAG AAACTTCAAC ATCTGACCAA 27 60 

CACGCATTAA TAACAATGCG TGCGGATTTT TTTATGCTTT 2820 

AAAAACTATA ATAGAAGAAA GATATTCGTA TACTTGTTCC 2880 

GGAACTCCTG TACATAATTG GAAACGCTCT TTTGCAGCAC 2940 

AAATGAAATT AGTGAATGTG ACCAATAGCT ACAAACAATT 3000 

ACACGGATGC TTACTTTGTA AAAGTCTACT CAGCAGGAAA 30 60 

AAGCAGCTCA CCATGTGGAA GTTCTGATTA TGAATAAAAA 3120 

AAGTAAAAGA AATTTTAGCA AAACTGTTAA AACGATTACC 3180 

ATGAAATTTC TATCATTGAG TTAAACCATG TGACCGAAGT 324 0 

CCCTCATGGA AAACTAAAAA AAACTGCCGA AGTTTCCCCA 3300 

TATTGTGACG TTATTTTTTT TACAAGACTT TGATAATTTT 3360 

GTCGGACTGA CTAATTTATT TTCCTCGATT GCTTTTTTTC 3420 

TTGAAATTTT CCCCCGATTT TTGAGCATAA TCTTTAGAAA 34 80 

TAAGCCTTCA GTTTCGCATC TTTAATCTCT TCTTCTGTAT 354 0 

CGAAGATCGG TTATTTTCCC TGAATCTTTT AACGTAATAA 3600 

CCCTCCTCAA AATCTTCTGT TAGACTTGTA TAATGTTCTT 3660 

TTGGCTAAAG CCCTAGCAGC AGGAAGATAG CCATTATTAG 3720 

GCTGTCTGGA AGGCGCCATC TAGGCGATTT TCAGCAACCA 3780 

ATCGTATTAA CAACGGTCAC TAACGATTGA TCAGTTCGGG 384 0 

GCATAGCCGG ATAAACGTTT tGTCACTCCT GCCCATTCTT 3900' 

CCACGTTCAT CAAATTCCAC TAACTGAATT TGTTGATCCT 3960 

GCCGTATAGC CGCCATCAAA CACGACTTCA TGATGATAAT 4 020 

ATTTCTGGAC GACCTGACCA ATACATCTTT TGAGCTTTTT 4080 
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CTTCTGGACT AATTCCTGAT TGTCCATTAT TAGCTTCTAC TGTAGCGCCA GAAACAGCAT 4140 
CAACTTCAAA GGTATGTCCA TTACGCTCAT AACTCACTTT GGCA 4184 
(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6159 base pairs 

(B) TYP£ : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



AAAAACACGT 


GTGTCTGTTC 


CATaATAGAC 


ACGTGTAGGA 


TCTGCGCCAG 


GAACATATAA 


60 


AACGGCCATC 


AGCGTCACCG 


AAATTAAAGC 


GCCGATTAAA 


ACTAAGCTGA 


AAATTCGACC 


120 


GCGCTTACGA 


ACAAAGCGTT 


CCAGAAGAAT 


AAAAAGTAAT 


GGCCAAATTA 


AATAATTTTG 


180 


CGCTTCAACT 


GCCAATGACC 


AAATATGTAC 


AAATGGCGAT 


TGTGTCGCAA 


ATCCATCAAA 


240 


ATAAGAAAAC 


CCTTTAAAAA 


TTTGCCACCA 


GTTGTTGTAA 


TAAAAAATAC 


TACTAGCGAC 


300 


AACGCTGCGA 


ATATTGTTTA 


ATAAATCCTG 


CTGAAAGAGT 


GTAATATAAC 


TAACTGCACT 


360 


TAAAAGCATC 


ACAATTAATG 


CTGGATACAG 


CCGTTTCATC 


CTTCGCACAT 


AAAATCCTTT 


420 


TAAATCAATG 


GTCTGATTTT 


CTGCCCATTC 


TTGCCGCAAC 


AAGTCGTTAA 


TAAGATAACC 


480 


TGAAACAACA 


AAGAAAATCG 


GAACACCTAA 


GTAGCCACCT 


CGCATGACTT 


GGGGAAAAAG 


540 


GTGATACAAA 


ATCACGCCAA 


TTACAGCGAT 


CGTCCGAATG 


CCATCAAAAC 


CTGTAATATA 


600 


GCGTTTTTTC 


TTGCTATTTG 


CTGtCTGATT 


CATTTGtGCT 


TTCCAACTTT 


TCTATAAAAT 


660 


TAwTCtTTkC 


CyTTAtTTAA 


AGaAGCAAGT 


AACTTTaATA 


TACGtTTTTT 


CCtAGGAAAT 


720 


GGCAAGCAAA 


CTTTGCCTCC 


TTTAGGCTTT 


TTTTACAATT 


CAAAAAACCA 


GCATTTTTTA 


780 


AATCACAAAA 


AAACAACCAA 


AGCTTATGAA 


AATCACTCAT 


AAGCTTTGGT 


TGTTTACTCA 


840 


TATCGTTTTC 


GTTTaGakGA 


rGATGGaATC 


CCTAGGGCTy 


CACGaTwTTT 


TGcTACCsTT 


900 


cGCCGGGaTA 


TTTCCakGCC 


kGAATCTTTw 


AATCGCTCGA 


cTrATTTTGg ATCcAGATAA 


960 


CGGTTTACTT 


TTTACTTCTT 


GATCAATTAA 


TTTTTGTAGT 


TGCTTTTTAA 


CCATCGTTGT 


1020 


TGAGACATCC 


TCGCCATCTT 


CTTCTGATGA 


AAGAATACTT 


GTTGAGAAGA 


ATGAGCGCAA 


1080 


TTCAAAGACG 


CCAAACGTGG 


TTTCTAGATA 


TTTTCCATTG 


ACCGCACGAC 


TAACTGTCGA 


1140 


TTCATGAATA 


GACAAAGCAT 


CCGCAATTTC 


TTTTAAAGTC 


ATTGGTTTTA 


ATGGCCGATC 


1200 


TGCTTCTAAA 


AAGAAGGCTT 


CTTGACGACG 


CACAATTTCT 


TGCCCTACAC 


GTAAAATTGT 


1260 


ATCCCCCCGC 


TGATTCACTG 


CGCGCTCTAA 


CCATTCAAAT 


TCGTTTTTCT 


TATCCTTAAT 


1320 


ATATTCTTGG 


•ACTTCTTTGT 


CGTCAGTCTC 


TTGCATTCGT 


TTGAAATAAT 


TTTCTTGAAA 


1380 


ATTTAATGTC 


GGTCGACCAG 


CTTTGTTGGA 


AATCACCACT 


ATTCGATCTT 


CCTTGATTCG 


1440 


GACAGTTAAA 


TCTGGTCGAA 


TATACAATTC 


CTCGGTCGCA 


TCAAAAATAT 


TCCCTGGCGC 


1500 
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TGGCGATAAC GTTTGTATAT AATCAAAAAT CAATTGAATT TCTGCTAATT CAATCCCGAA 1560 

TTTTTTAGCT AACGGGCCCC ATTTACGACT CACTAAGGCA TCAAATTCTT CCTCCAATAA 1620 

AATATACGCT AAATTAGGCG CGGTATCGTC TCGTTCTGTT TGTAGCATCA AACATTCTTG 1680 

TAAATTGCGT GCCCCCACAC CTGCTGGATC TAGCTGTTGT AACAAAGTTA ATGCATCTAG 17 4 0 

CATTTGAATG GCGCTTGCTT CGGTTTTCAC TGCCGCTTCT TCTAACGAAA TCGTTAAATA 1800 

ACCATTCACG TCTATATATT CTACTAAAAA CAATACCAAT GTTCGCAAGT ATGTATCGCG 1860 

GTAATTCAAA TGAATTTGAT CAATTAAAGA CTCAAATAAT GATAAATGAT AGTCTGGAAT 1920 

TTGATTCATA TAATTATTTT CTTCGTCTTG ATTGGTGTAA GAACGACTAG TTATCGGAAA 1980 

ATCTGTGGTG TACTGCGTGT CTACTTGTAA ATCAATCAAT GGATTCTCTA GTGCTTTCGC 204 0 

TTCAACAAAG GCAGCGAGTT CTTCCGTATT AAATTGTAAA ATTTGAATGG ATTGTTGAAG 2100 

CTGTTGAGTC ATGGCCAACT TCTGGACTTG TTTTTGTTGT TGGGACATTC GTTGCTCAAA 2160 

TTTCATTATT TTCGTCCTCC CTCTTGTAAT TTTTGTCAAT ATCAGCTAAA CTAGTATTCG 2220 

TGCCGCGCCC TTAGTGTAGT GGATATCACG TAAGATTCCG GTTCTTGAGA CGGGGGTTCG 2280 

ATTCCCTCAG GGCGCGTTAT TTAAATATGT TGATTTATCA ACGTTTCTGG ACACTCTTTA 2340 

TTCATCTAAA GAGTGTTTTT TTCTTGTTTA GGAATGATTT TTCTATCCGT ATTTTTTTCA 24 00 

AATGTTGCTT GTGGCAAGCT CAATCGCACT TAACGCGACA ACATTTGTTC TGCTTCAGCT 24 60 

ACAAATCCAA TAAGGTTAAA ATTCCTCCGT TCCTATTGTA ACGCATAACG TTTCCTTTTT 2520 

CTATCTGCTA AACAAAGTAT TACGACAAGC AAACACCTGC TTCTCTCTTA AAGTTCCCTT 2580 

TCTGATTATA TAGTAAATTG CTTTTAATTT GTTAATTATT TTACCTAAAA CTTAATACCC 264 0 

TAATTAAATA AAGAACCTGG GGCAAAAATC ACTTTGGATT TTTGTCCCAG GCTTAAAACC 2700 

TGATAATTGG CGGGAACAGA ACCGTTCAGC TCCGACCTCT AAAATACTTA ATTAGAAATA 27 60 

TACTACCAGC AATCACACCA ATTATGATTA CTAGCCAAAA TAATCCACTT ACAAGTAACA 2820 

GCTCAGTTAA GGTCGTCTTC AGATTCTCCT CCTTTAAATG GCCTCTGCTT TGTTTTGCTA 2880 

TGTAATTAAT GGCGAGATCA GACAACAGCT ATCAAAAAAA CGGACCAACA CGTGGTCCGT 294 0 

TTTCATTCAT TATTAAGCTT TTACGATGTT CGTTGCTTGT AAGCCGCGTG GACCTTCTTC 3000 

AGTATCAAAA GTTACTGCTT GGCCTTCGTC TAAAGTTTTA AATCCATCTG ATTGAATAGC 3060 

TGAAAAATGT GCAAATACAT CATTGCCATC TTCACCTGTA ATAAATCCAA AACCTTTGTC 3120 

TGCGTTAAAC CATTTTACTG TACCGTTGTT CATATATATT TCCTCCTAGT GCATAAAAAT 3180 

ATGCAAGTTA TTGCAAAAAA TGTTGAGTGG CAAAAGGAGA AGTATAGTTT TTAAATATAT 3240 

AACTCAAATT ACTTATCAAA ATTGATTACT TAATTACTAT AACATATTTA TTAGGAATAA 3300 

ACAATGCCCA CGACTCTTCC TTGGACTTTC TAAAATAATC AAAGATGCGC TTGACAAAGC 3360 

CCTAGCAAAA AACACTCCTA TTCTCTCTAT TTCCAGTTAG TTTTGTTTAA CTTTAAATAA 3420 

TCACTACGCG CCAAATAATC AAATAGCGGA GcTAATCCTG GTGAAAAATT TACTTCCTCA 34 80 
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AACCGCTTAA TAAATTGTAC GCCTTTAATT TTATTTGGTT CGTTCAGGGT CACTTTTCCT 3540 

TGGGCTTGTT TGGCAAAATA AAAATGTCCT TGCCAAACGA TTGAATCAAA AACAAGTGGT 3600 

CCAGTAAAAA TTTCTTCAAG TTCAGTAATT TCCACTGCTA ATTCTTCTTT CATTTCACGA 3660 

ATCGCACATT CCAGCGCTGA TTCACCAAGT TCTTGTTTTC CGCCAGGGAA ATTCCATGTG 3720 

TCTGGCCGAT CTTTTAAAAT AAGAATTTCA TCAGATTGAT TTTTAAGAAG TACTCGCGAA 3780 

AATTCTTTTT TCATTTCGTT TAACCCTCTT CTTCATTTTT TATTGAGAAG CCACACTTGG 384 0 

TAGCATAAAA ATATAATGCA AAATACGATC CGCATACGAC GGATTTTTTT CAAAGTCATT 3900 

GACTTCTTCA TTGACTAACT CCAAATCATC GGTAATCACT CCAGCAACGT CCATAAAGAG 3960 

CATTTTACTC AATACCTCTG AGTCATTAAC TGTCCAAGCA TAAACTGCTT TTTTCTGTTG 4020 

GTCAGCTTTT AAAATGAAGG TATCATTCAA CGTAGTAGCT TCCATTGTAT ACAAATCAGC 4080 

TTGTGTTTCT GGAAAAGTGA AATTATAGGG CAAAATAAAA CTAACTTTTA CTTTTGATTT 414 0 

GGCTTTTTTA AACGCTTTGA TCACATTGTA GTCGAGAGAT TGAACCTGAT GGTTATTCTT 4 200 

TATCAGAGGA CGTTCATATT TTTCAATAAA GTTAGAAAGC GCACCTTGAC TATCTTGTTT 4260 

AGACGTTTTA ATTTCTACTA GTAATTTTTG TTTGGCTTGG TTTGCTTTCG CTAAATAATC 4 320 

ATCAAAACTG GCAATTGGTG CTTCTTGGCC ATTTTCTTTA ACGGTCATTT TTGTTAATTC 4 380 

AGCCAACGTA AATTCTTGCG GCGTGCCATC AACGCCGGCG AGTTCTTGAA GGTTTGCATC ,4440 

ATGCATCACA ACAAATTGAT GGTCTTTGGT TTCTTGAACA TCAATTTCAA TATAATCTGG 4 500 

TTTAAATTTA ATGGTTCTTT CCATAGCAGG AATGGTATTT TGTACGCCAT TACTATTGGT 4560 

GACCCCTCGA TGCGAAATAA TCAAAGGTTT TGATTCTAAT AGCCCTGTTA AGTAAACAGC 4 620 

GTTGAAAATT ATATAGCCGC CAAAAATGGT GAATAGTCCT AGGCCCATCC CGATAGTCAC 4 680 

TAATTTATTA CGTTTCGTTT CTTTGTCAAA GGCAACTTTT GTCGTCGTTT CTGCTTGAAC 47 4 0 

ATTTAATTGT GTATATAAGA AATTAATCAG TAATACTGAA AGCCAAGCAT TAGAGACAAA 4800 

CTGTAAAAAT TGGACAACGG TTAAATTTAA AATCCCACCT AGCAAGCTAA TGGTATCGCT 4 860 

CAAGGTATCT AACTTCAACT GTAGAAGATA AAGTAACACA TAAATAACAT ACGTTGAAAC 4 920 

AAATACCGCG ATCGTCACAA ATGCAATATT ACGAATAATG AACCATAATC GTTTGTTTGT 4 980 

CATTGACCAA CTAGCCTTAA CTGCCTCGCC CGCTTTTTTT CTCTCCAGAA TCATTAATGG 504 0 

AAGCGTGTAA ATGAAACGAA TAGCTAAATA CCAGACnnrt AACCCAAAGG CCAGTAACCC 5100 

CACCAAGTAA GGAATTCGTT GGGACAAAAA TTCAACAATA AAATCTGGAA TCACAAATTT 5160 

CGTCAATAGA TTACTTTGGA AAATCAGGTT ACCAAAAGGG ACAATGACAA CAAAATAACC 5220 

GAAAAAGAAT AACCACGAAC GGGCACcTTG CTTTTTAAGA aCCTTAAAAG ATGCCTTGGT 5280 

AACTCCTTTA AAATGAAAGC GATGGTCTTG CCGTATTTGA AAAATCCCTA ATAATAAAAA 534 0 

GGTAAACTGA TAAAAAACTG CACCTAAAAA AAGTAACAAC AAGAGAATTA ATAACACCAG 5400 

AGCAAGCGGC TTTTCTAACA ATAAATTCAG GTTATTAAAC GAAAGATAAG GAATACCTTG 54 60 
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AGTTTTTAGT GCGTTGGAAG TAGCCCAACT AAAAGCCCCA CCCACCAATA ATAATAAGAC 5520 

ATTTAAAGTA AAAAATAACC ATAAATACTG .CAAGATGTCT TTAAAAAATT GCCTTGTTCC 5580 

ATTTTTAAAT GTTTGCCAAG CAGACATACA AACTCCTCCA TTCTAAAAGT ATCTGATTAA 564 0 

AGTTCTTCCA TACGAAGCCT AAATAAAACA ACGGCAATCC AAATTGAAAT AAATGATGGC 5700 

AAAATAACAA TAATGATTTC GACTGGCTCA TCAATAGTTC CTTGTAACGT ATTTTTTACT 57 60 

TCTGAAGAAA CAATaAACTG ATCAAAAATC ACTTGATAAA TAGTAACATA AATCAATACC 5820 

CATTGAATAT AATATTCAAT TATTTTCaTT AAAGCGATGG GTAAAACAAT TTTTTTCGTC ■ 5880 

GTTAAATTCA ATAAAAAATA TAACAAAAGA CTCGTACTGA TGATAACGAC TATCCCTAAA 5940 

AGAAATAACG ACTCTAACCG CTGCCCTATT. ACGTTTGCAA CGCCCATTGT TAATAATAAA 6000 

TACATAGCTA CCGAAATTAA GATGCTCACA CTTAGCATAA CAGAAAACAA AAGTAATGCC 6060 

TTAAGAAAAm sCCACTTATC AACTCTTTTA TTTCCAATCA ATAAACCTAA AAGAATCACC 6120 

GACAAAAGTA AAATCAATTC AATCATTCGT TCCTCCTCC 6159 



(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



GTAATAGCAG 


AAATGAGGAA 


AAGAAATGCA 


GAAAGAATCC 


ATTTATGGAC 


TGACGAGAGA 


60 


ACAGCTTGTG 


GACTGGTTTT 


TAGCCCATGG 


CGAGAAGAAA 


TTTCGCGCAA 


CACAAGTGTG 


120 


GGAATGGTTG 


TATACAAAAC 


GTGTTGCTTC 


ATTTAGTGAA 


ATGTCGAACA 


TTTCTAAATC 


180 


ATTGATGACT 


TTATTAGAAG 


AAAATTTTTC 


ATTAAATCCA 


TTAAAACAAG 


TCATTGTGCA 


240 


AGAAGCACAA 


GATGGTACCG 


TAAAATATTT 


ATTTGAATTA 


CCAGATAAAA 


ATATGATTGA 


300 


AACAGTCTTG 


ATGCGTCAAG 


AATATGGATT 


ATCTGTTTGT 


GTGACCACTC 


AAGTTGGCTG 


360 


CAATATTGGG 


TGTACATTTT 


GTGCTAGTGG 


GCTATTAAAA 


AAACAAAGAG 


ATTTAACTGC 


420 


TGGCGAGATT 


GTTGCCCAAA 


TTATGTGGGT 


GCAACATTAT 


TTCGATGAAC 


GTGGGTTGGA 


480 


TGAACGTGTT 


AGCCATGTCG 


TTGTGATGGG 


AATTGGTGAA 


CCATTTGATA 


ATTATGCCAA 


540 


TGTCATGAAC 


TTTTTACGAA 


CGATTAATGA 


TGATAAAGGC 


TTGGCTATTG 


GTGCCCGTCA 


600 


TATTACAGTT 


TCAACAAGTG 


GTTTAGTGCC 


AAAAATTCGT 


GAATTTGCAG 


ACAGTGGACT 


660 


ACAAGTAAAC 


TTAGCAATCT 


CATTACATGC 


TCCTAACAAC 


GAAtTCGTAC 


ATCAATTATG 


720 


CGGATTAATC 


GTAGTTTTCC 


AATTGAAAAA 


TTAATGGCAG 


CAATTGATGA 


ATACATTGAA 


780 


AAAACCAACC 


GACGTGTTAC 


ATTTGAATAT 


ATTATGTTGA 


GTCAAGTGAA 


CGACCGACCA 


840 


GAACATGCTC 


AACAATTAGC 


AGATTTACTA 


CGGAATAAGA 


AAAAACTTTC 


TTATGTTAAT 


900 


CTGATTCCGT 


ATAACCCAGT 


GAGTGAACAC 


GATCAATATA 


GCCGAAGCTC 


TAAAGAAGCG 


960 
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GTTTTAAAAT TTTATGATGT ACTCAAAAAG AATGGCATCA ATTGTGTTAT TCGTAAAGAA 1020 

CACGGAACAG ATATTGATGC GGCTTGCGGA CAATTACGCA GCAAACAAAT GAAAAAAGAA 1080 

AAAGTCmAAA ACCAATAACG TAGTTAAAAT CGGATCAGAA AGGTTTCTTT TTGATTCGAT 114 0 

TTTTCGTTTA TTTAATAAAA AATCGTCGGT TTTTCAATTC TTGTTTaTAA ACaAAAGATG 1200 

rATGCyCtAA AACtTTTATG aCAAwTTTyC TTTTtGTGGt TTTcaAnCTT GaAATyCTAA 1260 

AAGGAAATAG CTATACTGGA AAGGAGATCC GTCnCCnGAT TTTTTCCTGG AAACCGCCTA 1320 

TTATTTTCCG nTAnCCTTTT CCAAAAAGnT GGGAATTTTG GTACGC 1366 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CATTTCCGAT TATGCCGGCG TTTAGTTTTT TGAAAATCGA TTTTAGTGAT ATTCCGATTC 60 

TACTCGGAAT GTTCTTGTAC GGACCGTTAG CAGGAGTAAT TACTGCTTTT GTTCGTTCGT 120 

TGCTACACCT GTTCTTAACC GGACTAGCAC CGCAAAATAT GGTGGGAGAT TTCGCTAGCT 180 

TTTTAGCAAG TAGTATCTTC ACCTTGCCAA TTTTTTATTT CTTTGGTAAA AAGAAAAATA 24 0 

TCCGTACAAA TCGGATAGTG GGCTTAGTAA GTGGGATCTT AGCCTTGACA ATTTTCATGA 300 

GTATTGCGAA TTATTTTGTC ATTACACCCA TTTACTTACA ATTATATGGT GTGACCACAC 360 

AACAATTTTT AGGAACATCT TTAGCAAGCT ATGTGGCGAT TGGTATTGTG CCATTCAACC 420 

TTATTAAAGG CCTCTTAGTC AGTGGTGTTT TTC TAG TACT ACATGCGAAG TTATTGCCAT 4 80 

GGCTATCAAA AAAACAACAT ACTATTCAGA AAAAAACACC GTTAACAAAA TAAATGATAA 540 

AAAACCTGCT GTTGAACAAT GTTTGACAGT GGGTTTTTAA AATTTACGCC TAAAAGAAAA 600 

GAGGkTGTCA TAATCTGtCA TCATTCTGTT AAAAAATACT AAACCATCTG CATTGTAATT 660 

TTAACTTTCC CTTGGTATGA TAGAAATCAT CAAAGAAAGA GGAGTTTTCC TGAGTGAAAA 7 20 

AGTGTAAAAA TTGTCGTCAC GTGAATCGTG ATACGGAGTC CTTTTGTGAA GAATGCGGCG 780 

CTCCGCTAAT GAATGAATCA ATGCATCAAG AGGAAAACCA AGCACAACCA TCAATGAATA 84 0 

AAGGGGACGA TCTACTCCTC TCAGATCAAA AAAGAAGCTG GATCTGGGCG TTCCTTTTGT 900 

GGTCCATCCG TCCTGGGGCC GGGAGCCnAT TTCCTCGGGA CCGCCATnAT TTTCCAAAGG S60 

ACCACCAAAT TCCCTAATTT AATTGAAGCC GATTGAGGAA TnGGGGTGnC CCAAGGAATT 1020 

AA 1022 

(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 6: 

TGGTTCAGGT AAAGGTCGTT TTATTTATGA AATGGCTAAA GCACATCCTG AAATTAATTA 60 

TATTGGTATC GATATGCAAT TAAGTATTTT ATCGATTGCA TTAGATAAAC TAGTGGCCGA 120 

TCCGTTACCT AATCTACAGT TATTGCGAGT AGACGGCGAA CATTAACAGA ATATTTTGCT 180 

GAAAATGAAG TAGATTTAAT CTATTTAAAT TTTTCAGATC CTTGGCCAAA GAAAAAACAT 240 

GAAAAACGTC GTTTAACGTA TAAAAACTTT TTAGCGACAG ATGAAATTAT TTTAAAACCA 300 

AATGGAGAAA TTCATTTTAA AACGGATAAT CAAGGGCTcT TCGAATACTC ATTwAgtAGC 360 

TTTTCTAAAT ATGGCATGAT TATTGAACGT GTGTGGCTAG ATTTGCATAA TAGCGAGTTT 4 20 

GAAGGCAACA TTATGACAGA ATACGAAGAA AAATTTTCTA GTCGTGGTCA ACGAATCTAT 480 

CGAGTAGAAG CACGTTTTGT TGCAAAATAA AAACAAGCAT TCCTTTATTT TAAGTGAAGG 54 0 

AATGCTTGTT TTTTATTATT TAAGAAAGCC TAAGTTGTCG TTCTTAGAGC ATAGGCGTCC 600 

AATAAAATTC TGGCCAGAAT GGATAATGGT AACCGTGAAC GCACTTCATA ATCGGGGAAA 660 

AAGGAGCCTT CTGATTTGAA GCCTACGAAC GGAATTTCAA TGTATTTCAT TAAACTAGAG 720 

TGTTTGCTTG CAGTTAAAAG GATAGTCCCA ATTTCTGAGT CGTAACAGTT TTTACCAGCG '780 

ACAACTAATT CTTGGGTTTC TCCGTTTAAA GAAACAAAAA GAGCCACATC CTTTTTTGAT 84 0 

AACTTTTTAC TAATATTTTT AATAATTTCT GGGTCCGTAT GCAGTTCACA GTATTTGCCA 900 

GCCAATTGAA ACTTAACCAT CATTTCCTGA GCAATTAATT CTGAAAAACC TCTAGCAAAG 960 

ATAATAATCC GTCGAGAAGC CTTGATTTTT TGGATCGCAT CTTCGATATT ACCAATTTCA 1020 

ATCATGTTAA TCGTGCGAGT TACTTCTTCT TCATTCTTTA AAATGGCTGT CCGAATTCCT 1080 

CGGTCTACTT TTTCTAATGC TGAAAAATTT ATATTTTGGT TTTCTTTATC TTTCAAGTGA 1140 

TGCTTGAAGG ATGTGAAACC TTCATAGCCT TTCTTTTTCA TGGTTCTTAC AATCGTCGCA 1200 

GTTGATACAT TGGCTCGTTC GCTTAATTTT ACAATGGAAT AATCAGGGAT TTCTAAAATA 1260 

TGTGTTTCGA TAAAATGCCA GAGATATTGC TCTGACTCAC TCAATTTTTG TTCCATAATG 1320 

ACCAGCCCTT CTTTCTTTCA CTAAAAAAAC ATTTATAAGC CTATTTTTAA TGAAAAAAAA 1380 

TAAAAAAGCT AGAGAAATTA TGAAATTGTA AATGTTTTCT ATATTTTTGC GAAAGAATGA 14 40 

AAACCTTTAC ATGGCATTTT GAGACATAAT AAAGAGGAGA AAGGAGGAAC TACAGTGACT 1500 

ATTTATACAT GTACATTAAA TTTAGCCATT GATTTATTTA TCGAAACAGA AGAGCTAGTT 1560 

CCTTTTGTCG TGAATCGTAC GAAGGAAGAT GATATTCAAG CAAACGGGAA AGGAGTCAAT 1620 

GTTTCCCTAA TTTTAAAGAT GTTAGGGATT GATAATACCG CTTTGGGAGT CAAAGCTGGT 1680 

TTCACTGGGA ACTATGTCGA GGACTATTTA AAAGAAAAAG AAATAACCAC TGATTTTATC 17 4 0 

GAAGTCGCTG GTACAACGAG AATTAATGTG TTTACAAAAG TTACCCAAGA TCAAAAGGAA 1800 
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ACCAAAATTA TCTGAAGAAC ATGTACAACG TTTTTTGAAA 1860 

AGGGGATTAT TTGTGCGTAT CAGGAAGTTT ACCACAAGGG 1920 

TGAAATTAGC CGAATTTGTT TTGAAAAACA GGTGTTCTTA 1980 

AGAAATTTTA GATTGTTTGC CTTATCAACC ATTTTTATTG 204 0 

TCAAGAGTGG TTTCATACAG AAGTGCAGAC TAAGGACGAT 2100 

ATTACTGAAG CGGGGAGCCA AAAATATTTT ACTTTCTTTA 2160 

TATGAATAAT GAAAAGGTTC TTTCAGGTAA TTCACCAACT 2220 

TTGTTCAGGG GATGCAATGT TAGGTACCTT TTTAGCTGGT 2280 

GGAAAAAAAT TTGAAAAGAA GTATTGCAGC AGGTAGTTCA 2340 

GACAGATTTT TCAGATGTTC AGGAGTTAGA GCAACAAATA 24 00 

AACAGATGGC TACGTATCAA TTAATAGCTG CTACAGGTTG 24 60 

CGTATATGGC ACAGGAGGCA TTGGAACAAG CTGCGAAACG 2520 

TTGAAACACA TGGTCAGATT GGTATAGAAA ACGAGTTAAC 2580 

CGGAGGCAGT CATTATTGCC GCCGACAAAG ACGTTCAAGC 2640 

GAATCATTGA TGTCCCTGTC AGTGTGGGAA TTAAAGAAGC 2700 

CGCTCGCAGG CAAAGGATCG ATTGCGGCCG AAAACCAAGC 27 60 

AAACACAGAT TTCTTCAGGA AATGTTGGAC ATAGTATTTA 2820 

TCTCTCACAT GTTGCCGTTT GTAGTAGCAG GCGGCGTTTT 2880 

TTTGGGGCAT TTATTCATTT GATCCAGAAA GTAGTCAGTA 294 0 

TAAAAAGTAT TGGTGATGCC TCAATGGGCA TGATGGTGCC 3000 

CAGAGGGGAT TGGCAAACGA CCTGGTTTGG TTGTGGGCTT 3060 

ACACTGGTGG CACTGGCTTT TTAGGAGGGA TATTATCTGG 3120 

TTCTTTTATT ACAAAGAGTA TTTAAAGGCT TGCCTAAATC 3180 

TTTTCCTTTA TCCAGTCATT GGCGTTGCAG TTATTGGTAT 324 0 

ATCCGATGAA AGCTGTTAAC GAAGGAATGA TGAGCTTTTT 3300 

GTCCGTTGGT TTTAGGAATT ATTGTGGGTT GTATGTGTGC 3360 

TTAATAAAGC GGCCTATGTT ACTGGGACGG CACTCTTAGC 34 20 

TGGCAGGAGT TTCTGCTGCC TGCATAGCGC CACCTTTAAT 34 80 

TCTTTGGAAA ATACTTTGAC ACAAATGAAC GCAATGCGGG 3540 

GCTCCACACA TATTACAGAA. GGCGCTATTC CGTTTGCGGC 3600 

TACCAATTAT GATGCTCGGA TCTTCAATTG CGGCAGTCCT 3660 

AAGTGCCAGC ACCTCATGGA GGATTTCTTG TTCTACCAGT 3720 

GGGTTTTGGC AATTTTAGCT GGTTCATTGG TAGGTGGCTT 3780 
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ATTGCTAGGT TTTGTACAAA AAAGTAGAAT CGCCAAACAA GAGCAATCGT TATCAAAAAA 384 0 

ATGAAATAAC AGGAATTAGG AGTGGATGGA ATGGCTTTTA TTAAAGAAAA TCATATTTTT 3900 

TTAAATCAAC AGTTACAAAC GCAAGAGGAT GTTTTTCACT TTTTAGCAAA AAAATCAACA 3960 

GAATTGGAAG TTGCTCAGGA CGCACAGGAA GTTTTCGATA AACTGAACGA GCGAGAACAA 4 020 

GAAGGAACAA CGGGGATGAT GAATGGATTT GCGATACCTC ATGCAAAAGC AGCAACGATT 4 080 

CAACAAGCGG CCATTATCAT CGTTACCTTG GACCAAGGTG TTGAGTGGCA GAGTCTTGAC 4140 

AACCAATTAA CGGAGTTTGT AATTGCGCTG TTTATTCCAG ATGCCGAAGC TGGCACGACA 4 200 

CATTTGAAAT TACTCTCTTC TGTTGCGCGC CTCTTATTAA GAGAAGAAGT TACTTCTGGA 4 260 

CTGAAACAAG CAAGTTCGCC AGCTGAAATT GCTACGTTAC TAAATAATCA ' ATT AGGAGAG 4 320 

GGAACAGAAT GAAAAAAATT AGTGAACAAA AACGTAAACA TTTAGAAAAT CTGGTAGATG 4 380 

ATCAAGGGAT TATTGGCGCA CTAGCCATTG ATCAACGTGG GGCCTTAAAA CGCATGATGG 4 4 40 

GAAAATACAA AGAGGTGACT GCGCAAGAAA TCAGTGACTT TAAAGTATTA GTATCTCGTT 4 500 

GCTTAACACC AGAGACTTCA GCAATTTTGT TGGATCCAGA ATATGGCCTA GCTGCTGCTG 4 560 

AAAATCGAGC ACAGACTTCA GGTTTGTTAC TTGCGTATGA AAAAACGGGC TACGATGCTT 4 620 

CGACACCAGG ACGGTTACCC GATAGTCTTG ATGTTTGGTC GGTTAAGCGT TTAAAAGAAG 4 680 

CGGGCGCAGA CGCGTGTAAA TTTTTGCTTT ACTACGATGT TGACGAAAGT GAAGCAATCA ! 4740 

ATGAGCGAAA GAAAGCTTAT ATAGAACGGA TTGGTTCAGA GTGTTTAGCC GAAGAAATTC 4 800 

CATTCTTTTT AGAAATTGTC TCTTATGATG CAAACAATTC GGATAGTGCT TCAAAAGAAT 4 8 60 

ACGCTAAAGT AAAACCGCAC AAAGTAATTG AAGCGATGAA AGAATTTTCA AAGGATCGCT 4 920 

ATAATGTTGA TGTTTTAAAA GTAGAAGTCC CTGTGAACAT GAATTTTGTA GAAGGTTTTG 4 980 

GCACTGAAAG TCTTTATAGC CAAGATGAAG CACAGGCCTT TTTCAACATG CAAAGTGAAG 504 0 

CAACACAACT TCCATTTATT TTCTTGAGTG CAGGGGTTAG TGCTACGATG TTTCAGGAAA 5100 

CATTGAAATT TGCAAAAAAA GCAGGTTCCT CGTTTAATGG AGTCTTATGT GGTCGCGCAA 5160 

CTTGGGCAGA CGGCGTGCTT CCATTTGTTC AACAAGGTGC TGAAGCAGCC GTTGCGTGGC 5220 

TTGAAACAAC TGGTAAAACA AATGTTGAAG AATTGAATCA GGTTTTAAGA GAGAGCGCAG 5280 

TATCAGTCTT TGAAAAAATA CAGTAATTAA GCTGAAAATA TATACTGAAT TTAAACTATT 5340 

TGGTCATCTG AATTAAATTT TTTCAGAAAT CGTCTTCTTT AAGTGAACTC AATGTACGCA 5400 

AACTCAAAAG ATATGATATA ATTATTTCGA TATCGATGTA TGAACACCCA AAAAGTAACA 54 60 

TTGCTTTACT TTATTTGGGG AAAGTTGGGG TATTATGGGA ATTCCTAAAG AAGGCGAGTT 5520 

TGTAACCATT CAGAGTTATA AACATGACGG ACATTTACAT CGAACGTGGC GTGATACGAT 5580 

GGTACTAAAA ACGAGCGAGT ATTCTTTAAT TGGTGTGAAC GATCACACGT TAGTAACGGA 564 0 

ATCCGACGGA CGTCGGTGGG TGACTCGTGA ACCAGCCATT GTTTATTTTC ACAAGAAATA 5700 

TTGGTTCAAT ATAATAGCAA TG AT TAG AG A AAAGGGGGTT TCGTATTACT GCAATCTGcT 57 60 
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TCGCCATATG TTTTAGATGA CGAAGCGTTG AAGTATATTG ATTATGATTT AGATATCAAG 5820 

GTTTTTCCAG ATGGTGAAAA ACGTTTACTA GACGTTGATG AATATGAATT TCATAGCAAA 5880 

CTAATGGATT ATCCAGAAGA TATTGACTTC ATTTTAAAAG AAAATGTCAA GACTTTAGTG 5940 

GATTGGATTA ATAACGAAAA AGGCCCCTTC TCGCCCGAGT ATGTAGATAT CTGGTATCAA 6000 

CGTTATCAAC AGCTATCAAA AAAATAGCCG ATGTAAACAG CCCACAAAAG GrGCTTGGAG 6060 

CATTAGCCAA AATGGCTAAT GCTCCAAGCs CTAAATTTTC CGATTTAATT GGTGTTATAG 6120 

TTCTTTGTTC ATTTCGTAGT GTTCAATGCC AGCATCCATA AAAGGTGCTC CTGTTTTTTG 6180 

ATAACCTAAA CGTTCATAGA AGCCTACAGC GGTGACTTGT GCTCCTAGAG TAATCTTGTG 6240 

GTAGCCTTGT TCTTTGGCAA AATTTTCCGC CGCTTCCATA ATCACTCGTC CATAGTCGGC 6300 

GCCACGGTAA GCTTTTTCAA CGGCCATTCG TTGTAATTTC ATAACGCCAT TTTCTAGTGG 6360 

AAGCAAGCGA CaAGTTGCGA CAGCTACATG GTTCGTGTAC AAAACAAAGT GAATACAATG 64 20 

TGCTTCATAT TCATCGATTT CCATCTCTTC TGGTACCCCT TGTTCCGCTA CGAATACGCG 64 80 

TTGACGAAGC CGGACAGCAT CTAAATAAAG ATCACTCATG GTATCTTTTG TTTGAGCTAC 654 0 

ATGCATnGAA AaaCCCCCAg TTCAGAAcTG tkTCTATTTT AyTATACCtG AAAAAGTTCT 6600 

AGCmATGAAG TATTCTaCAA TGACTGAGGG GGGGAAGGTG TGATAAAtAA AACGCAACAA 6660 

TTGAATAAAn ATAAAGGAGG AGCCGCTGTG ATTAATCGAT TTAAAGAATC GAAATTATTT 6720 

TTCTGGTCAG TTGAATTAOT TGTCGTGGCC AtGCTATtAt TTAwTGCTTC GAAAATTAAT 6780 

TTCTTATTTG CACCGATTGG GACATTCTTT TCAACGTTGT TTGCCCCAGT CTTAGTAGCA 6840 

GGCTTTTkGT ATTATTTACT AAATCCTGTG GTCAATCTAC TAATGAAGAC GAAAATGAAA 6900 

CGAATTTATG CCGTCTTACT GGTATTTTTA TTACTAATCA TCGCATTAGT TCTGATCTTA 6960 

TTAACAATTA TCCCTAAGTT GGCAGATCAG CTAGCTAGTT TAGCATCAAG TATGCCGGAT 7020 

TTCTTTAAAC AAGTGGAGAC CTGGATTTAT GAGATTGCGG AGTTACCTAT TTTTAAACAA 7080 

ATTGATTTAA CAAGCTATAT TGAAAAAATG GATATTTCTT ATGCCAATAT TATCCAACAA 7140 

TTTTTAAGTA GTTTATCAAG TAGCTTAGGT TCAATCGTAT .CAACGGTGGC GTCAACCACT 7200 

ATTGTTTTGG TCACTGCGCC ATTTATTTTA TTTTATATGT TAAAAGATGG AGATAAGCTA 7260 

GTACCAGCGA TTCAACGCTT TTTACCTGAG AAAAGAAAAG ATGATATTGT GGATCTTTTA 7320 

GGACAATTGA ACCAAACCTT ATCCAGTTAT ATTAGTGGAC AAGCGATTGA ATGTTTATTT 7 380 

GTCGGTACAT TTACGATTAT TGGGTATTCA TTGCTTGGCG TTCGTTATGC CtTTTTATTT 74 4 0 

GGTGTCATTG CCGGCTTTAC GAATTTGATT CCCTATCTAG GACCTTATCT AGGATTAGCG 7 500 

CCTGCTGTCT TAGTGACCAT CTTTAATGAA CCCgTCaAAG CAGCCtTATG TTGTTTAgTG 7560 

GTGCTAGTGG TTCaACAATT GGACGGAAAT ATTATTTATC CAAACGTTAT CGGAAAATCT 7 620 

CTAAAAATCC ATCCATTAAC GATTATTTTA ATTTTATTGG TAGCAGGAAA TTTAGCAGGT 7 680 

TTGCTAGGTA TTTTCCTCGG TGTCCCATTT TACGCAATCT GCCGGACCAT TATCTACTAT 774 0 
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GTTATTGATA TGGTCAAAGC GGGTCGTAGT GAAAAAGTTA CCAATGCCGT TCTTTTAGGG 7 8 GO 

AATGAAACAA ATACGAAAGA AAATAACGGT TAATGATTGA ATCCGCTAAT CTTTCGTGCT 78 60 

ATGATTTTAT GGATTAGGAT AGAAAATAAC CTGTCATAAT ACTGATTGTG TCTGTTTCTA 7 920 

AGCCTAACCA TAAAAAGAGG AAGAACGTTA AAAGAGTACG AAAGTACAGC CTCGGAATTA 7 980 

TCAACAATTA AGGAGAGAGA GAACATTATG AATAATGCTG ACCCTGAGAG TCAGTCGCTA 804 0 

ATTGCGCAAC TTCTATTATT AGTCGTGTTG ACGTTTATCA ATGCCTTTTT GGCAGCGGCG 8100 

GAAATTGCAG TCGTATCTGT TAACAAGAAT CGTGTGGAAC AAAAA 8145 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



AAAAGAGGCT 


TTAACAGCCA 


AGAACATCAA 


ACCTAGTGAT 


TTAGCGAAAA 


AAACTGGCAT 


60 


CGGTAAATCT 


TCGATCAGCG 


ATTGGCTAGC 


TGGTCGTTAC 


GAAGCAAAAC 


AAGACAAAGT 


120 


TTATCGCATT 


GCAGATGCAT 


TAGACATTAA 


TGAGGCCTGG 


TTGATGGGAC 


AAGAAGTTCC 


180 


CAT GG AAAAA 


AATGCCTCAA 


CTATCGACCG 


CAT T TAT AAA 


AAATTAGAAC 


CCCAACGACA 


240 


AGCCATCGTT 


TATCAATTTG 


CCGAACAACA 


ATTACACGAA 


CAACAAACGC 


AAGCAGAAAT 


300 


TCTCTCATTC 


CCTCGCCGTG 


ACGAAATGAC 


ACTGGCTGCC 


CACGCTGGGG 


ATCCAGAAAA 


360 


GATATTTTCA 


AAAGAAGAGA 


TCGAGAAAAT 


TCACGATTAC 


TTGGATGAAA 


TTGATGCCAA 


420 


ATATCAACAA 


TCGATTTCTT 


CTG AC AAAAA 


AGAGGATTAA 


CTTCCCATCT 


ACTCTTTTTA 


480 


TTAAGTAATG 


GCAGGTGAAT 


GGTAAAGTGA 


ATGATTATGA 


AAAACTCGTC 


AGTTCCATTC 


540 


AAAAGGATGT 


GACGGTTTTA 


GAAATTGATT 


TATACAATCA 


AACAGGTTGT 


TATGGGTTGT 


600 


ATCGAAATGG 


CAAAATTTAT 


ATTGAAAAAA 


CAT T GAG T AC 


CCGTCAAAAG 


AAGAATATTT 


660 


TAGCCGAGGA 


ATACGGGCAT 


TATCAAACAT 


CCGTTGGCAC 


TATTTTAAAT 


CAAAATTGTA 


720 


CAGAAAATCG 


GAAGCAAGAG 


TTGAAAGCTC 


GCAATGTGGC 


CTTAGAGCAA 


TTAGTTACAT 


780 


TAGACGATTT 


AATTCGTTGC 


TCTGAAGCTG 


GCTTAAGTAA 


CCATTATTCT 


TGTGCTGAAT 


840 


TTTTGGAAAT 


CGATGTTGAA 


ACATTAAAAA 


ATGTGATTAC 


GTACTATCGA 


CAAAAGTATG 


900 


GCGCCACTTA 


TCTTTATAAA 


GGACGGATTT 


TTGAATTTAG 


GGATTATTCT 


GTGATGGTAT 


960 


TAAATACTGG 


GTTAACTTAA 


AAaGCATGGA 


ACAAAATCAA 


TAAGGATTTT 


GTTCCATGCT 


1020 


TAAAAGTATT 


CTGGAAAATT 


CAAGAGCGTT 


TTTTGTACCA 


ATGGTTAATA 


TGTCCAACGC 


1080 


CAAGCATAAG 


ATTGGGGACA 


AGTATAAGGA 


CGAATATAAA 


TGACGGCCAA 


AATCCGATGA 


1140 


ACTGAATAAT 


ATGTTCCCGC 


ATACTATGTA 


GTACAAAGAA 


GAAAAGAAAA 


AAGTACAGGG 


1200 
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AACTCTTGCC TGAACCCATT TCTTTTAAAT AACGATCTCC ATTTTTATAG AAAACAACAA 1260 

TTCTTTTCAT CTTTGCACCT CCATGACTAT TTTATCAAGA AATCATGAAA AGAAAAAGCA 1320 

CACCATTCGT CTGGTGTGCT TTTTCTTTTT AATTATTCAC CTTGATTCAA CTTACGAATC 1380 

AACTCATCAA TGTGATTTCC GTATTGGACA GATTCGTCTC TTTCAAAAAT CAACTCTGGT 14 4 0 

GTTTTATACA AGGTTAACCG TTGTCCTAAT TCTTTACGAA TCAGTCCTTT GGCTTTGTCT 1500 

AAGCCTTGTT GAGCTTTTTG TTGTTCGGAA GCTAAATCAG ATAACAAGCT GTAATAAATC 1560 

GTTGCTTGTT GTAAATCACC GGTCACACGA ACATCTGTAA TAGTGATGCC TTGGACGCGT 1620 

GGATCTCTGA TGCGTTTGTT CAAGATATCA TTCACTTCAC GCATGATTTC TTGACCTACA 1680 

CGACGGTCAC GATAATTTGC CATAGtGTCT TCCTCCAATT ACTAAAATTA TTGTTTGATT 174 0 

TCTTCCATAA TGAAGCCTTC AATCGCATCA TCTACACGTA AATCATTGAA GTTTTCAATC 1800 

ATAGCGCCAC ATTCAAAACC AAGTTTGACT TCTTTCACGT CATCTTTAAA GCGTTTCAAG 1860 

CTTGCTAATT TACCTTCATA AATAACAATA CCATCACGAA TGACACGCAC GCCACTATCG 1920 

CGGCGAATAA AGCCTTCTGT AACATAACAG CCAGCGATTG TGCCAACTTT AGATACTTTG 1980 

TACAATTCAC GAACCGTCAT TTGACCCGTA ATTTTTTCTT CAAATTCTGG ATCTAATAAC 2040 

CCTTTCATCG CTGTTTCGAT TTCTTCTAGG GCTTTATAGA TAATACGGTG TAAACGAATA 2100 

TCAACTTCTT CTTGTTCTGC TTGTTGTTTC GCTTGTGGTG TTGGGCGAAC GTTAAAGCCG 2160 

ATAATAATCG CATTACTTGC TGCGGCTAAA GTTACGTCGC TTTCATTGAT GGCCCCAACT 2220 

GCCGCGTGAA CAATTTTCAC ACGAACGCCT TCTACATCAA TTTTTTGTAA ACTTGCTGAA 2280 

ACGGCTTCGG CAGATCCTTG TACGTCTGCT TTGACAATGA TATTAACTTC TTTTAATTCG 2340 

CCTTCTTTTA GACTTTCAAA TAAGTTATCC AATGTCACAC GGCTACTTGC TGAACGTTGT 2400 

TCAAGCAACG CACGTTTAGC CCGTTCTTCC CCTGCTTGAC GTGCTGTTTT TTCATCTTCA 24 60 

AAGACCACAA AACGATCACC CGCTTGAGGA ACATCATTTA ATCCGGTAAT TTCTACAGGA 2520 

GTTGCGGGTC CTGCTTCTTT ATCACGGCGG CCCATGTCAT TGGTCATCAC ACGAACACGT 2580 

CCGTAAGTAT TACCAACAAC GATTGGATCA CCAACATGTA AAGTTCCTTG TTGAACGAGT 2640 

AATGTCGCAA CCGGACCTTT ACCTTTATCT AAGCGCGCTT CAATCACTGT TCCAATTGCT 2700 

TTTTGCGTTG GATCAGCTTT TAAGTCTTCT ACTTCAGCAA TCAATAAAAT ATTTTCCAAT 2760 

AGCTCATCAA TGTTTTGATT GAATTTCGCT GAAATATTGA CGAAAATCGT ATCTCCGCCC 2820 

CATTCTTCAG GAATTAATTC ATGTTCACTT AATTCTTGTT TCACATGATC AGGATTTGCA 2880 

CCAGGCTTAT CAATTTTGTT AACTGCTACA ATAATCGGCA CTTTTGCCGC TTTGGCATGG 294 0 

TTAATCGCCT CGATTGTTTG TGGCATTACT CCATCGTCCG CTGCAACAAC TAAGATGGTA 3000 

ATATCAGTGA TGCTGGCACC ACGCGCGCGC ATACTTGTAA AGGCCGCATG TCCTGGTGTA 3060 

TCTAAGAACG TAATTGGTTT GCCATCAATA TCTAACTGAT AGGCACCAAT ATGCTGCGTG 3120 

ATACCGCCTG CTTCACCTGA AGTCACACGA GAATGGCGCA ATGTATCTAA TAATGTTGTT 3180 
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TTCCCGTGGT CTACGTGTCC CATGATTGTG ACAACAGGGG GACGTGTAGT TAAGTTTTCT 3240 

TCAACAACTG CTTCTGGTTC GAAGAATTTA TCAATGTCCG CAATATCAAC TTGAACTTTT 3300 

TCTTGCGGTT CCATGCCATA GTCCACAGCT AATAATTCAA TTGTATCTTT ATCTAAGGCT 3360 

TGGTTTTGGT TAACCATGAC ACCCATCATA AATAATTTyT TGATGATTTC CGCTGGTTCG 3420 

CGGTGGATTT TTTTCGCAAT ATCCGCCACG TTCATACCTT CTGTATATTC TAAAACATCT 34 80 

GGTAATTCAC GGAATTTACG TGCTGGTAcT GCTGGgCTTC GCTGATTCTT GTTGGATGTT 3540 

TGGCCTTTTT TCCCTTTTTT GGTTGGAAAC GATTGCGGTT TTGGnTTATT AAAAGTTATT 3600 

CCGGGTTAAT TACGATTTGG AACCGCCACC AAAATGGTGG TTACGGGTTT GGCCGCCA 3658 



(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

TTTAGAATAT GGGCAAATTT CGTGGGCTTT TTTAACAAGT TCTTCTGCTT TAGCTTGATC 60 

TACCCCATCA ATATGAACTT CTAATTCAGC ACCAACGCTA AACCCATCTT CTCCTTGGAA 120 

TAAGGATACA CGTGCAGTTA CTGTACTTTT ACCTTCGACT TTTTCTTGAT CCATCACTAA 180 

TTCTAATGCA CCGTTAAAGC AAGCACTGTA AGCCGCTGCG AATAATTGTT CTGGATTTGT 240 

TGTATTTTCT TTCTTGACAC CTGGCGAAnA ACCGCATAAG AAAATGATTT ATCCGGTGAG 300 

TGAACTTCTC CTGCACGTCC ACCCGTATTA ATGATAGTTG TTTCATAGAT TTTTTTCATG 360 

TAAATTCCTC TCCTTTGTAA AGTTTTACAC TTCTGAGAAC TTGTTACTTA CATAATTATT 420 

GTAGGAAACT TTGCCAGATA CAGCAAACGT TGTGCTTGTG rATAATAAAA AATaACATTA 480 

ACGTgTAwTT TTGACGtGAA AAAGGACCCC ATCTATGAT 519 



(2) INFORMATION FOR SEQ ID NO: 38 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

TCAAAATTAT ATCCAACTAT AACAACAATC ATTTCATACC TTATTTCTTT TTATTTTCTG 60 

AGTAAAACTA TGCAACATTT ACCACTTAAT ATTGCTTACG CAAGTTGGTC AGGTTTAGGA 120 

TTAGTATTAA CAACAATTGT TTCAGTTCTT ATTTTCAAAG AACAAATAAA TTTAATAAGC 180 

ATTATTTCAA TTATTTTAAT AATATTTGGT GTTGTTTTAC TAAACACATT CGGATCATCA 24 0 

CACTAATTAC TTTATTCCAA TTTCCTTTTC GGTAGAAACT TTGACCCCTC ACACTCCCCA 300 
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AAGATGAGCC ACCAGGCAAA AAAACGCCCT GGTGGGCCCA CTTTGTTTAG TGTATATGAG 360 

GGGTCT-T-TTC TTTCCGAAAA GTTTTCTCGG CATAAAAGAT CATCTGATGT TTTTTATCTA 4 20 

AAAATCATTA TTTTTCTAGA AAATCGTATC ATTTTACAAC TTTTATTTTT GACTGTATAA 4 80 

TCTAGTTAAA TCCCGTAAGG GCGCACTTAT ACGTCGATAA ATCGACGTGT GACGAAACTG 54 0 

ACCTGcGTcA GTTTTTTAAA ATTAAATGGA ACGGcTTGAG TTaTyCCGAG CAGTTAAAAA 600 

AATtAATATG JcGtGAATATC GCTACTGCAT TCtTAAAAAA GTGGgGCAGA TTTTAGCCCC 660 

ATAACTTTTT GGATTACATC GACGATACGG CGCGAAA 697 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 



CTTCTGGTkG 


GCyGcTTTCt 


GCgGTTTcTG 


TTGtGCCgTT 


CTCtTCaGTk 


GTCaCAcTtG 


60 


GTTTTTCaGG 


TATGATTGGC 


TCaAtCGTGC 


CyaGTGTTTC 


aCTTTCaTCT 


GGATTTTCTT 


120 


CTtCtGTTGG 


TGAGACATGA 


ACTTCTGGTT 


GGCTGCTTTC 


TGCAGTTTCT 


GTCGTGCCAT 


180 


TCTCTTCAGT 


TGTCACACTT 


GGTTTTTCAG 


GTATGATTGG 


CTCAAtCGTG 


CCtAGTGTTt 


240 


CACCTTCATC 


TGGATTTTCT 


TCTTCTGTTG 


GTGAGACATG 


AACTTCTGGT 


TGGCTGCTTT 


300 


CTGCGGTTTC 


TGTTGTGTCT 


GTCTTTTCAC 


TTGTTTCTGT 


TGCCTGTTCC 


GAAGTAGGTG 


360 


TCTTTGGTTC 


CCATTCATCT 


ATTGTCTCTA 


AAGGAATGTC 


TAGACTTTCT 


TCTGTTGGCA 


420 


TTTCCACATT 


TTGATTGCCG 


CTTGCCTCGC 


CTTCTACAAA 


AACATTTTTA 


ACCTGTGATG 


480 


TATTTTTTTC 


ATTCGTTGCG 


TCTTGATTGT 


TTAGTTGATA 


ATTGATGTCA 


TAACTATTTT 


540 


CAAATGTTGC 


TTGATGTTGG 


CCTGCTTCTG 


TGATTGTCGA 


AGTGTAACGg ACGATAAAGG 


600 


AAGTAAAGCG 


TGCTTTATCC 


CGATAAAAAC 


GTAAGTTAAA 


GTCATTATCT 


GTTACGAAgT 


660 


CAATTTTGCC 


ATAACCTTGT 


TGCTCAAACT 


CGGCAAGTGA 


AATATATTTA 


GTTTCTTTGT 


720 


CATTCACAAT 


ATCAAATGTA 


AAACTCTCTT 


TATTTAATTG 


TTGACCACTT 


CCTTGTCGAT 


780 


CCGCAATTGA 


AATATCTTCT 


GTGACATCGG 


ATTTATTGAG 


GTTCACATTT 


AAAAACCAAC 


840 


GTACTTGATT 


TGACTCTCCA 


GCCAAATCGC 


CTACTTTATA 


AAAAAACGGA 


TAGTCTCGCT 


900 


CAATTTGGCC 


AGTCTCTGTG 


TTAGTCACTC 


CTTCAATCGT 


CAAACGTTGA 


GTCGCCGTTG 


960 


CTGTTCCGAA 


ATCCGTTTCT 


ATCACATTCG 


GTTGAGAACT 


ATTGGTGATA 


AGCGTTTTAA 


1020 


TCCCGAAAGA 


AAAATGCCCA 


TTCACATTAT 


GTAAAGATTC 


AACTTTTTCG 


TTAAATGTTG 


1080 


CTACAACATG 


ATCTTTATAG 


ATAAAAACTT 


CCCCTAAACC 


ATTTAAATTG 


ATTTTTCGTG 


1140 


GTGAACTATC 


GTTCTCGGTC 


ATTCCAACTA 


GTGCGTCTGG 


TAAAGTTAAA 


GTAATAGTAT 


1200 


CTCCAGGTTT 


TATTTTCTGA 


TTTTCTTTGT 


CACTAAAAGA 


TACTTTAATT 


GAGGTCATCT 


1260 



WO 98/50555 PCTAJS98/08985 

1573 



CTCCTTGATA 


TAAAP.T AP.TP. 


TPATPTA aTT 
1 bn 1 b 1 AA 1 1 


PTaPTTTPTP 
b 1 Ab 11 1 b 1 b 


a flPPBTTP A A 
AAbbAl IbAA 


CTTTTGCTCA 


1320 


ATTPP,PrPr,PP 




ppappa a tp a 

bbAL-LrtA 1 bA 


A T A A fTTTTT* 

A 1 AAbbbbbb 


bbbAAl T.GGT 


AGAATCATTA 


1380 


ppaapap.taa 


PAPTAAAAQT 


TTTAPAPTTT 
ill HtHL 111 


TTPTPATBA A 

1 1 b 1 bAl AAA 


A A A TT A A 
AAA1 IATTCA 


CCCTTCTTTT 


1440 


rTPTTZHTTTT 


TTPPAPTTAP 


TPPTPTTTTT 
1 bb 1 b 1 i 1 1 1 


PEPnAPTTPA 

bAbAAb 1 1 bA 


PAATAPPAAA 

bAA 1 Abb AAA 


AAGATATATC 


1500 


TPAPAPAATA 


TTA ATP.PTPA 


1 1 1 AAA! Abb 


PTTTTTTTA A 
b 1 1 1 1 1 1 1 AA 


AATTTCTAAT 


TCACTTTCTA 


1560 


APP A A A APPP 
Ab bAAAA b b b 


I 1 i Al I Al bb 


1 Ab 1 1 ALA I 1 


AbAlAAbAl 1 


TGTTACAAAA 


TATCTAATGC 


1620 


TPPPA A ATTT 


PAPTTPPATB 
Ijnb 1 1 bbAl A 


A ATA ATATAT 
AA 1 AAb A 1 A 1 


TTPAPATPA A 

1 IbAbAl LAA 


T AAT G AAT AA 


AAATACTTAT 


1680 


1 ILAlbblLA 


i i 1 Ibl 1 1 bb 


LAbbLLAJ/T. 1 


ATTCTTGTTA 


m TV TV m TV 7\ TV r*fT\rT> 

TAATAAAGTT 


TAT GAG AT AA 


1740 


ppapp^aaap 
b bA b b a. AAAb 


AAl bAAAAA I 


TCATATTTTG 


ACGGTGGTTT 


AGCAACGTAT 


ATCGGGACTT 


1800 


P A ATTTTAPP 

LnAl i 1 lAbb 


A -i PPT TP ATT 

AatL 1 1 bAl I 


ACTGTCTTTA 


/■* TV rn m tv -n m 

CATTAGGCAT 


TTGCGCACCT 


TGGGGAATTT 


1860 


olAl b A 1 b 1 A 


bAAb 1 bbAAA 


7\ rn tv tv v - * tv rn tv 

Al CAAGCAT A 


CCGTTATTGA 


TGGTAAACGC 


CTTTATTTTG 


1920 


Abbb lALAbL 


T 1 7\ TPP A A (PT A 

1 Al bbAAl X A 


TTPPPnr»/"> t\ m m 

1TCGGTCATT 


GGATTAAATG 


GCTTTTATTA 


AC T AT AAT T A 


1980 


tl 1 1 Abb AM 1 


1 1 A 1 bb 1111 


IbbTiGAATA 


TCCGTTTGCA 


ACAATGGATT 


ACAAAACATA 


2040 


bbbAbAbAb 1 


TTPA T A A A A A 

1 lbAlAAAAA 


p T\ pp a a pp a rn 

bAbbAAGGAT 


GAACTGAGTT 


ACGGTCCATC 


CTTCCTCTTT 


2100 


1 1 A 1 yM 1 A 1 


b 1 AAL AbAb 1 


bl lATbbTAC 


TCTTTGAAAA 


TV TV ^* TV TV TA TV m 

AACAAGGAAT 


TTATGCTATA 


2160 


till un 1 AAb 


1 A 1 bAAbbbA 


TACTTATGTT 


ATTTCGGGGG 


CGTTACGGAT 


TCGACAGGCA 


2220 


TAPTTP.APPT 


TP A aTTPPPT 
1 bAAl 1 bbb 1 


1 I bbrAGGTT 


ACGGCTACGT 


Tl TA TV TA TA ^ m m TV 

TAAAACGTTA 


CAGTTAAATA 


2280 


A rxrt^ 1 1 AM 


a a z\rr a a a a p 
AAAbbAAAAb 


AAl ibl I 1 Lb 


CTTTAGCTGC 


i^N rp 7\ TV TV TV T\y~«/~>TA 

CTAAAAACCA 


^ try TV /"» TA TA ^ 

GCTAGCGAAG 


2340 


Al tv- 1 bbbbb 


PATPPPPPaT 
bAl LuLtLHi 


b 1 bb ibbbbT 


CAGGGTCCTA 


ATCGAAGTGG 


t** TV #n TV ^1 y"i m TA TA 

GATACGCTAA 


2400 


1 1 1 1 1 \^\^ O I 


Ulu 1 AAAA1 1 


TAfArrAPrT 
1 AbAbbAbL i 


TACCAGACTA 


/T*Tk T\ Tl TV TV ^ TV 

GCAATACAGA 


ATGCCTGTCA 


2460 


\* 1 VrfOvjv- nLuL 


TPT A A APPP a 


BPPTTTA A AT 

Abb 1 1 1 AAA1 


bAb 1 bl b 1 Al 


TV AOOrnA/^A/™" 

GAACGTAGAG 


tv m m m tv ta ^ ^ 

ATTTAAGTGG 


2520 


PA ATaTPTTT 


OunLuLbub 1 


1 bbAb 1 bbbb 


bbb 1 b I bbAi 


rnmrnrrtA^'A A A A 

TTTTAGAAAA 


^* m TV TA TA TV TV T\ 

GTAGAGAAAA 


2580 


APAAAAAAPA 
AV— AAAAAAVjA 


ppt a A A A PPT 
bL 1 n/WiLL 1 


TATAA A ATA A 

1 Al AAAAI AA 


AGGTTTTAGC 


TCTTTTTTTC 


TATAAGCAAG 


2640 


AP.A AP.P.PTPP. 


TT a PPTPPTP 


TT APAPTTPP 

1 1 AbAb 1 i bb 


1 AibALATTA 


TCTTTCTAAA 


tv /™* tv omm ^ ta tv> 

ACCACTTCAT 


2700 


rp rn fp rp fp ni fp rp 
,11 1L1 1 1 1 


1 A 1 1 ob 1 On 1 


111 bbb 1 Al 1 


PP-iTAPA APi 

bba l AbAAba 


fn^l^^fTlTV TV TV TA TV 

TGGCTAAAAA 


TATTATTGTT 


2760 


ATPAAAAGAf; 


TTTTTA A P. AT 
11111 nnbn 1 


TTP A A TT TPT 
1 1 LAA1 1 1 b 1 


pptppA a mrim a 

bblbAAlbl A 


TPPA APTATA 

1 bbAAb IAI A 


o A /*• a a a a r*r*/~* 

C AC AAAAGC G 


2820 


ATGATAAAAG 


TAATT APT AT 
i nn i i r\\*. 1 1 


ttppp t a a p a 

I 1 *~V_-Vj A AAbA 


TP AT p'PTT T* A 

1 b A 1 b 1 1 1 1 A 


AATTPATTAP 

AAl IbAl 1 Ab 


bl rAGbbCAC 


2880 


ATTGAACTGC 


CCATAAAGCT 


AGTTGTCCAG 


CTACAATAAT 


AGCATTCGTC 


TTTAACCCAT 


2940 


TGGCTTTAGC 


GAAGCGTAAT 


ACAGTCACTG 


CTAATTCTTT 


cCAGGCTTTC 


TTTTGTGCAG 


3000 


CTTTTACAAT 


TGCGCTGATA 


CTAATCATTG 


CAAAAAACTC 


ATCTTTGATT 


TTGTTAGCAA 


3060 


CGCAACTTCC 


TAATGCATTC 


CAATTAAAAC 


GTGCTTGAAC 


TACTGTTGAA 


GCATCAGAAA 


3120 


TATTTACATT 


TAATTCTTCC 


TCAACTGAAG 


AAATCTCTTG 


CTGAGCAAAA 


TTATTTTTGA 


3180 


CATAGTTTTT 


AAAGTTATCT 


TCCGTCACTC 


CATTCGCATA 


CATTTCTTCC 


AGTCCTTTTG 


3240 
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CTACAGCCTC AACTTCTGAA TCCTCTAGCT GGTCAGATGC ATAAACACTT GTACCAGCGC 3300 

TGATAAAAAT ACCTAATGTC ATTAGTAATG CCATCGCCAA ACAAAAAAAT TTAGCCTTTT 3360 

TTGTCACCTG TAATTCCATA AATAAATCCC tACTTTCTTT TTCCTTGTTA TAATCATAAC 3420 

AAGGAAAAAG ATAATTTATC ATCGAAATCC CTATTTGATG CTTATAT-TTT CCTATATATC 34 80 

CTAGAGCTGA TTTTTCTCAC TAGTGATCAC ATCAATTTGa ACTAATGaTT CAAAAAAAGA 354 0 

CACAGGTTAT TTGAGGTACG TTCGAGGTAA TGCTAGGTTC TATTTTGAAA ACAACGCAGC 3600 

TCTCTTGaAA TATTTTTTGa GCTTGTCTaA ACCTTCTGAA ACGAATATAG AAAATAAAAA 3660 

AAGAGTCCCC TTTCTCAAGG AAACTCTCAA TTGCTAAGTC AAAATAACTT CACGktATCT 3720 

AAATCTTTJtA AATCAATCCT ATGGTtCGCG GrACAAAGGa GCTAAAAaCT TAATAGAATA 3780 

AATCGCACAA ATTCCGACAA TAATATAGAT AATTTTTGCT AATGGTGTGn TTGCACCGCC 384 0 

ACTAATTGTT GCAACTAAAT GCAAAATTA 38 69 
(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

GGAGGAAnCG AATGGTAAAA AACGATTAAT TATTGAAGCT GTGGATTCTG ATAATGATTT 60 

aACAATTGAA CAATTGTTTG AAAATTATAA AGAAGAGaAG GTGCATGTAA CTATTCAAGA 120 

TTTAGGAAAT GCGGTAGGAA ATGAAAAATG GTAAAAGTTC CTCATCAAGG TGATATTTTA 180 

TTGTTGAATA CGGCTCCTAG ATCAGGACAT GAACAAACAG GTAAGAGACC GTATATTGTT 24 0 

TTAAGTCATG ATATTATTGC AGATTACAGC AATGTTGTTA TTGTAGCTCC TATATCAAGT 300 

ACAAAAAGAA ACTATCCTTT ATATGTTTCT ATTAATCCTT CTTACGGAAT GAAAACATCA 360 

GGTAAAGTAT TAT T AG AT C A ATTAACCACC ATCGATTATG AGGCTAGACA ATGCGTGTTT 4 20 

TTAGAAACAG CACACGAAAA ATTAATCGAT GAATTATTGC TAAAAGTTAG AACCGTATTT "4 80 

CAAAAAGTAA ATAAAACAAA TAAATTTTAA CTAATAAAAA GGCGAAAAAA CGAACAAGTC 54 0 

AAAAGTTTCC GACTTGTTCG TTTTTTTCTA TCGATTAAAT GCTCCCACCA GAAAGTAATG 600 

ATACAGGTAA AATATAATCC TTACTTATCT TTTCTCCCTT AATCTTTTTC AACAAAACAT 660 

CAACAATTAG TTTTGCAAGT TCATCAATAG GTTGTTTAAT TGTGGTCAAT TGTGAAAAAA 720 

ATTGTTCAAT AAATGATGTT CCATCGTAAC CAATAATTTT CACATCTTCT GGAATATTTA 780 

AGTTGAGTTT -TTTTACAATT TTCATAGTAA GAATAGCAGT TAAATCATCT GAAATAAAAA 8 40 

TGCCATCTGG TTTATTAGTT GaAATAATCG ATTTAATTTC CATCTCTCGA CGAATTGTAG 900 

ACAAACTATT AGGAACTTTA AAGACTTTTC CT.TCTGGAAT TTGGAAAGAA AATCCTAAAG 960 
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CACGTAAACC TGTAGGAGAA TCTGTATTAT CATTTCCAGT aATCATAATA ATATTTTGaC 1020 

AACCATTTTT CCGTAAAAcG CTTGCTGcCA ATTTACCACC TTCaAAATTA TCTGATGCGA 1080 

TGATAGGAAC ATTTGGTGCT AAATTTCTAT CAAAAGCAAT AATTGGTGCT TGAACACGCT 1140 

CATAGTCAGC AATCCCAAGA TTATGACTAG AAGAAATAAT ACCGTCGACT TGATTTGCTT 1200 

CAAGCATTTC AAGATATTCA CGTTCTTTTC CAGGATCATT TTCGCTATTA CAAATAATTG 1260 

TTTTATATCC TTGATTAAAT aATTCTATTT CAAGGGTATT CAATGAGTTC CGCATAGAAT 1320 

ATGTTACTGA TATTAGGGAA AATGAGA 134 7 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5385 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtfDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 



I AAL-U 1 bA 1 A 


A I b L AA X Lbb 


blAl bblbl 1 


bAbbAbb lib 


Tana naar 
1 AAA 1 b AAAb 


bbbbbb I bAI 


bU 


AAAAbb 1LI i 


AAb 1 bb 1 1 L-/\ 




b Ab 1 1 AAAtb 


RTTrrra at a 
Al ibbbAAlA 


Ibl AAlbbAI 


ion 








bb 1 AAAAA 1 b 




b AA I b Ab Ab A 


i on 
loU 


TTAGTAAAGA 


GTCAAAGTCT 


GCAAAACACA 


AGTCGTACTG 


CAACAGGTGT 


AAATaAGAAT 


240 


CCCCTGGAAA 


ATcAATCAGC 


AGGACATCAT 


CAATAATTAA 


TTGAGTTTGT 


GTACGAATTT 


300 


CTCGTCCTTT 


TTTCTCTCGA 


GCGTACTTAC 


ATAATTTACA 


ATTACAAAAA 


ATTCCTGGAA 


360 


TCCGTTCTGC 


TGCACCCGTG 


CCTAAATAGT 


TGATTTTCAT 


TGGGAGtACC 


TwACCtTTcT 


420 


TTGaCATTCT 


TCATACTTAG 


TAGATAACGC 


TGAACCTTAA 


AGTAAATGGG 


CATGGAAACT 


480 


gAAACaTTTG 


TTGTGTCCAT 


CTTACCTTTT 


TTAGAGAAGA 


ATTGAAATGG 


AATTTTTTAA 


540 


GTAAATATGC 


GTAAAATrGA 


AAAAGCTAGA 


CGCCTTTTTT 


GATAAGCTGA 


ATTTACTGCG 


600 


AATGCTCGTA 


CTATTTCCGA 


AAAAAAGTTT 


TACCAAAGGC 


ACTTTCATAT 


CCATCTGCTA 


660 


TTAGTTATAG 


AAAAATAACA 


AGATGGACAT 


AGCTTATTAG 


AGCAAAAATA 


ACTACGCTAT 


720 


TGATAAAGAG 


TGAGGCGTGT 


TTTGAACAGG. 


TTGCGTGGGC 


AAATGAATTT 


GAATAAAAGG 


780 


GATGACAAAA 


GATGACTATT 


GAAAAATTAG 


AATATTTTTA 


TACCATTGCA 


AAATATAATA 


840 


GTATTTCCAA 


AGCAGCCTCT 


GAACTCCATG 


TTAGTAAATC 


AACCTTAAGT 


GCGTCATTAA 


900 


AAGATTTGGA 


AAGTGAATTG 


GGCCATTTAC 


TTTTTAACCG 


TAACGGTAAT 


TCTTTGACGT 


960 


TAAATTCCTA 


TGGTGATAAA 


ATCGTCCAAT 


CTGTATATAT 


CATTTTAAAC 


GAGGCTAAGA 


1020 


AAATGAAACT 


TAATTTGCAT 


GAAATGATTG 


AGAATCCAGT 


GATGCGTTTA 


GGTTTTGGAA 


1080 


ATACGTCCTT 


GAT G TAT AAA 


GTGACAGAAA 


ATGAGGATCA 


ACTGAATCGT 


TTTTGGGAAT 


1140 


GTTACCATGG 


TTCATCGTTT 


GAGTTATTGA 


ATAAATTGGA 


GAATCATGAG 


TTGGATTTTG 


1200 


TGATTACTTC 


TGCAGATGTG 


AATAGTCCAG 


TTTTAAAGAA 


AGAACAATTA 


ATTGAACTAA 


1260 
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AAATGTATCT TTGTGTTTCA AGAGAAATTA AACAAGAAAT TGAGGAAGAA GGCTTCAGCT 1320 

GTTTGaCCAA CTATCCATTT CTATTTTTGC CGCATCACTT AGATCATTTA GAAGCTACCA 1380 

AGAGTGTGCT AGAAATGTTG CAATTAACAA GTCCGCTTGT TTGTTGCTAC GATACGTTGA 14 4 0 

TGCTTACTCG TCTCATTGAA AAAAGTAAAG GAGTTTATGC GGTGATCTCT CTTCGGAAAG 1500 

AACAACTGCA GGAAATTGAT TCTAAACTCT TCTTTTTACC GATTGAACGA AAACAAAAAT 1560 

TTTATTTATA CCGAAATGTT TCCAGTTCGG TTTTTGTCCA ACCTGGTCAA ATCAAGGCAA 1620 

CTTTACAAAA GTTGCTAGAG ACTTAGCCAT TAAAATTACT TTGGAATAGT TACCAAAAAT 1680 

AGCTTTATTC GCTTAGATTT ACTTCTGCTG CGGATAAGGC TATTTTTGTT GTTCATTGTT 1740 

TAAAAGAGAT GCTCAATTAT GGGGCTGTTC GATATTTCTG AACAGAATGT TCAATTATTT 1800 

AATAACTTAA GAATCGATAA ATCGTATCTA TTACGGTTCA ATTTAAAGGG GTTCTGGTGT 18 60 

ATAGTTTGGC TATCTAAAGA TTTACTTTTT TAAATATAAA ATAACTAAAT GGCAAGGAAT 1920 

ATAAAGAACT GTTGATACGT GAGTACTTGC GATAACTGTG TCGTTAGAAG TTTAAGTGTC 1980 

ATTGTTGCTT TTCCTGTTTT ATGTAGTtGC TAGAAAGGAC TGTTTTTATG ATCGATAGAA 2040 

AAAAAGTTAT TTTGTATACT TGTTGCATGA GTTTGTTTGT AGTGACCATG GATGTTACGG 2100 

TTGTCAATGT GGCGTTACCA TCCATTCAAA GTGATTTTCA CACGAATCTG TCTACATTAC 2160 

AGTGGGTAAC AGATGGCTAC ACTTTAATGG TAGCATCCTT ATTATTGTTG TCTGGGTCTA 2220 

CAGCAGATCG AATTGGCCGT AAACGAGTCC TTCAATTGGG CTTAGCCTGT TTTGGTTTAG 2280 

CGTCTTTCCT ATGTGGGATT TCGCAAACCC CAGGGCAATT GATCGCGTTT CGCATGTTGC 2 34 0 

AAGGGATTGG TGGTTCTATG TTAAATCCGG TAGCAATGTC TATTATTACA CAGGTATTTA 24 00 

CCGAAAAGTT AGAAAGAGCG AAAGCAATTG GCTTGTGGGG CTCTGTTACA GGGATTTCCT 24 60 

TAGGCATGGG CCCGATTATT GGTGGACTGA TTGTTTCTTA TTTTAGCTGG CGGTATGTCT 2520 

TTTTTGTAAA TGTACCGATC ATTGCTGCGG CAATCATCCT TACACAAAAG TTTGTACCTG 2580 

AGTCAAAAGT AGAGAAGACG GTGAAAAATG ATTTTGTTGG TCAAGCATTG ATGATTCTTT 2 64 0 

TTCTATTTAG TTCTATCTAT TCCATTATCG GACTACCTAG AAAAGGGCTT TTCGCGCCAG 2700 

ATATTTTAAG TACTGGGATA ATCGGCTGCT TAGCCATTGT TATTTTCTTT ATTTATGAAT 2760 

ATAACATTGA CAATCCGTTA ATCAACCCGC GTTTCTTTTT ATCTATTCCA TTTACATCGG 2820 

CTTCTTTTTT AGCTATTTTT GGCTTTATCA TATATAACGG CTATTTATTT TTAAACACGC 2880 

TATATTTGCA AGAGATGAGA GGCTTCAGCC CGTTGGAAGC TGGCTTATCA ACCATTCCTT 2940 

TGGCTTTTGT TAGTTTTCTT GTTGCACCGA GAGCCGGCGA AATGGTAGGG AGAATAGGGA 3000 

CGAAACGTCC TATTATGCTT TGTGGTATTT CAATGTTGGC TGTTAGCTTT TTACAATTAT 3060 

TTGTAACTAA AACAACGCCT AT GAT TAT TT TATTTATTAT TTATATCTTT TTAGGCATTG 3120 

GGTTTGGGAT GTTAAATTCA CCGATTACCA TTACAGCGAT TGAAGGAATG CCACTTTCTC 3180 

AGTCAGGAAC TGCAGCAGCC ATTGCGGTGA CATGTAAGCA AATTGGCAAT TCTTTAGGGG 324 0 



WO 98/50555 

TGGCGCTACC AAGTCTTTTA ATTACAAAGC 
TTACAAACGT ATGGCTTTaT TTGGATGTTG 
aTCCAATTCT CCGCTAGCGA AAAAGTCATT 
CAAGATTTTT TTGAAGGGAA AGGAAGACAA 
GGACTCTTTT TATTTTTTGT TTTACTCTTT 
GATTTCTCAT TGGGTGTTAA GGGCAGTCTC 
GCTACTACGC TGTTATTATT TGGTTTTAGT 
CTTAGCTTAA CTCTTTtAAA TGTTTTAAAT 
TATCGTGAAT TCACTGACTT TATTACCATC 
GAAGGAATTT CTGGTGATTT ACTTGCCTTA 
GATACTTTAG TGCTGCTCGT TTTGGTATTT 
GTCTTTGTGA CCAAACCTAT CAACAGAAGA 
TTGCTCTTTT CGGTTAACTT ATTTTTAAGC 
ATTTTTGACC GTAATTATAT TGTAAAATAT 
GGTGTCAAAA CAGCCATGTC AAATGAGGTA 
GAAATCATGA AATTTACCAA GCAAAACAAT 
GCCAAAGGAA AAAATGTATT TGTGATTCAT 
ATGAAAGTCC AAGGCCAAGA GGTTACACCG 
ACAATCAGTT TTGATAATTT CTTTCATGAA 
AACATGTTGG AAACAGGAAC ATTCGGGTTA 
TCAGACAATA CTTTCGAAGC GGCTCCCGCA 
GTTTTCCATG GAAATGTGGG GAGTTTTTGG 
TATCAATACT TTTTTGATCG GTCATATTTT 
ATTTTAGATA AACAATTGTT GCGTGAATCA 
TTCTATGCGA AGTTTTTAAC TGTGACCAAT 
TTCCCGTTTC CTAAATTGAA CACAGGCAAT 
CACTATTTGG ATCAATCGCT AGAACAATTT 
AATCATTCTA TTTTTGTTAT TTATGGTGAC 
GACTTATGtA AAgCTTTCAA CCGCGATCCA 
TTGCAAAGAG TACCTTTGAT GTTTTACATG 
GAATATGGTG GGGAAATTGA CGTTTTACCG 
AAAAATTATA TCCATTTTGG CACGGACCTT 
TTTAGAAATG GCGATTTTGT AACGCCTCAA 
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C TAT T AC TAG TTCGCTTACT CGAACACCTT 3300 

CGCCATTGCG ATTATCtTTT TAAGTTATTT 3360 

GCGCAGAGTT AGGTTTTATT TTTAACAAAT 3420 

ACTTAGATGA AAAAAATAAA TAACCGATTA 34 80 

TGGGCAAAAA CGATTCTGGC GTATTACCTA 354 0 

CAGCGAGAAA TTTTATGGTT CAATCCaGTT 3600 

TTGTATTTTA AAAAGCATAA AACATTTATG 3660 

ACATTAATTT TATATTTGAA TATTATTTTT 3720 

CAATCTGTTT TTGGCTTTTC AAAAGTTTCA 3780 

ATGAGGGTCC ACGATGTCTT TTATTGGCTA 384 0 

TACTCTTTTA AACAGCGACG GAATCATCAA 3900 

TTGGCGATCT CAATTAGTTG TTTAAGTGTC 3960 

GAAACCGATC GTCCTCAATT GCTTCAAAGA 4 020 

CTTGGCTTGG ATGCATTCAC AGCCATCGAT 4 080 

AGAGCAGAAG CTAGTTCAAA CGAATTACCT 414 0 

TTACCGCTTA ATCCACAGTA TGCCGGGATT 4 200 

TTAGAAAGTT TTCAACAATT TTTGATTAAT 4260 

TTTTTAAATC AGCTTTATCA AAACAAAGCG 4 320 

GTAGGGCAAG GAAAAACGAG TGATGCGGAG 4 380 

CCGCAAGGCT CTCTTTTTAC ACAATTAGGA 4 4 40 

GTTTTAGGAC AAAAAGGCTA TACmACCGCT 4 500 

AATCGAGACC ATGTGTATAA AAACTTAGGA 4 560 

TCAAAAGATG ACAAAATGCT AGGTTACGGT 4 620 

GCTAGTGAGC TTGAACATTT ACAGCAGCCA 4 680 

CATACCCCTT ATTATACGGA TGACCAAAAC 4 74 0 

TCGATTGTGG ATGATTATGT TCGAACAGCG 4800 

TTCCATTACT TGCAAACAAC AAGTTTATAC 4 860 

CACTTCGGTA TTTCAGATGC GAACAACAAG 4 920 

AAAACGTGGA CAAACTATGA CAATGCGCAA 4 980 

CCAGGCTATA CGCAAGGGAA AATCATGCAC 504 0 

ACACTGTACC ATCTTTTAGG TGTCGATGAT 5100 
TTATCTCCTC AATATAAACA AGTGGTTCCT ■ 5160 

TTTTCGTATt AGGTGGGGAA ATCTACAATA 5220 
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ATCAAGGGAA AAAATTAGAC CAAGTTCCAG CAGATTTAAA AGCAGAAGTC ACGAAAGATA 5280 
ACGATTGGGT GAAAAAATCG TTAAGTTTnT CTGACAAATT AAATCGGGGA AACCTTTTAC 534 0 

GnTTTTATAA ACCACAAAGG GTTCCAAGAA GTGCAACCAn AAGGG 5385 
(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 



CGTGTGGTTA 


TCTCTGCTCC 


TGCAGGCGCA 


ATGAAAACAA 


TCGTTTACAA 


CGTCAATGAT 


60 


GATACACTTG 


ATGCCAACGA 


TAAAATCATT 


TCTGCTGGTT 


CATGTACGAC 


AAACTGTCTA 


120 


GCACCGATGG 


CTTACTTCTT 


AAATAATGAA 


TTCGGTATTG 


AAGTTGGTAC 


CATGACAACT 


180 


GTCCATGCCT 


ACACATCAAC 


ACAAATGCTA 


TTAGATGGCC 


CTGTTCGTGG 


CGGAAATCTT 


240 


CGTGCGGCAC 


GCTCTGCTGC 


AGACAATACT 


ATTCCCCATT 


CAACTGGTGC 


TGCTAAAGCA 


300 


ATTGGCTTAG 


TTATTCCAGA 


ACTTAATGGA 


AAATTACAAG 


GCCATGCCCA 


ACGTGTTCCC 


360 


GTTGTGGATG 


GATCCTTAAC 


AGAATTAGTT 


AGCATTTtAA 


AAACAAAAGT 


AACAGCTGAT 


,420 


CAAGTAAATG 


AAGCAaTGAA 


AAAACACACC 


ATtGATAACC 


CtTCTTTCGG 


TTATGAwGAT 


480 


CGTGAAATTG 


TTTCTGGAGA 


CATAATCGGC 


ACGACAGAAG 


GTTCGATTTT 


TGACCCaACG 


540 


CAAACagAAg 


TAACGACaGC 


TgGCgATTTc 


CaAtTaGTgA AAAcTgTGGc 


TTgGtAtGAc 


600 


AATgAAtAtG 


GtTTTaCTTg 


CCaAAtGATT 


cgCTTatTGG 


aAAAAtTcGC 


tAATTTatAA 


660 


TTCCaATGaA 


taAAAAAcTg 


CCtTGcaTAA 


AtTaATgCaA 


gGCaGTTTTT 


TTAATGTTTA 


720 


CTTTGACAAC 


TTTGACAAAT 


ACCATGAAAA 


GTTAAGCGAT 


GATCTTTGAC 


TAAAAAATGA 


780 


TACCGACTTT 


CAACAATTTG 


CTCTACTTCA 


CCAAGCAAGT 


CTTCTTCCAC 


TTCCTCAATT 


840 


GTGCCACATT 


CTAAGCATAA 


CAAATGGTGG 


TGAAAGTGTT 


TGGCGCCTTC 


TTTTCTTAAA 


900 


TCATAGCGAG 


CTAAACCATC 


GTTAAAGCTC 


ACTTTATCAA 


CGACTTTTAA 


GTCTGTTAAA 


960 


ATTTCTAGCG 


TTCGATAGAC 


TGTTGCTAGA 


CCAATTTCCG 


GACTTTTTTG 


TTTTACCAAG 


1020 


AAGTAAATTT 


CTTCTGCTGA 


TAAATGATCT 


TTTTCAATTT 


CCAATAATAC 


AAGTACGGGT 


1080 


GCCTCCCGGT 


GTGGGGGTAA 


ATTnAAGnCn 


GAATCCnGGA 


ATGGGTTTTG 


GCCTTTTAAA 


1140 



GCCGCC 114 6 

(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 

TGGCTATGTC AACATTGCgA CAaGgCCGTw AaGCTCTGGt GCGCCAcGGC GGAAGTGCCT 60 

ATCGAAaCCG CCAwmAAATT GTTGAAAGAC ATGGGGGGCA GTTCCATTAA ATATTTCCCG 120 

ATGAAAGGCT TGGCTCACAA AGAAGAATAC CAAGCAGTAG CAGCAGCTTG TGCCAAGTAT 180 

GACTTTTATT TAGAGCCAAC AGGAGGCATT GATTTAGAAA ACTTTGAAGA AATTGTTCAG 24 0 

ATTGCGGTGG ATGCCGGCGT TAAAAAAATC ATTCCTCATG TCTACAGCTC AATTATCGAT 300 

CAAGAAACAG GGGACACTAG AACAGAAGAC GTGAAAACGT TACTAACCAT GATGAAAAAG 360 

ACATTGAATA AGTAAAGGGC CAATCTTTAG TTGACTTTCT CCACCAGATT TGCTACTCTC 4 20 

AAGTGGAGAA TAAATAAAAA CCAACAAATG AATGTGGGGT ATTTTGGTGA GTGTCGTAGG 4 80 

TATCTAATTG ATAAGTTGAA TAACCAGTGA ACCAAGCAAG GTTTATTTGG TGTGCTTAGA 54 0 

AAGTGATACC TGTCAGTCAT CTCAATTACT AAATACCTCC TTTAAAAGAG GTTTCTTTAG 600 

TAACCTTTAT CCAATGATAA AGAGGACAGC GGTAGAACGC TGTCCTCTTT TTTCTGTAGA 660 

AACTGGTTTT TATTTAAAGG CGCCAGGTAT TCTTTTCGTG ATAGAAAGGA AGAAAAAACA 720 

AATGAATCAT GaAACCATTC AAAAATTACA ATTTACAACA ATTCTTAAAG AAGTTCAAAC 780 

ACGTGCGATT GGCGAGTATA GTAAAGAACG GTTGGCGAAA ATGGTCCCTG CAACACGTTT 84 0 

AGAAACAGTG CAGTCCCGCT TAACGGAAAC AACCGAGGCC CGCTTAATTA TTGATAGCGG 900 

ACAACACGTC CCTTTCATGG GCTTGAGCCA AATTACCCGC TTGCTTCAAC AAGTGAAAAA 960 

AGGCTTAATC CTTACACCAA ACGAATTGAT TGAGGTCGCT GACTTTTTAA GAAGTAGTCA 1020 

GCGAATCCAG AAGTTTTTTG AAAAAAATCA ATACCAAACA CCACGCCTAT TTAGTTACAG 1080 

CCAGCACTTA GCTGATTTCA GAGCAATTGA AGAACAAATT TATACGAAAA TCCATAATCA 114 0 

AAAAGTTGCC ACCGATGCTT CTCGACAACT TCGGAAAATC CGGCGTCAAA TCAATGAATG 1200 

CCAACAAGAA ATTGAAACAA AAGTCACCAA ATTTTTACGT AATAAAAATA ATCAACTGTA 1260 

TATTCAAGAA AATATGATGG TGAAAAAAGG GGAACATTAC ACGGTACCAA TCAAAGCTAG 1320 

CTATAAAAAT AAGGTTTCAG GCACAATCAT TGAGCAATCC AATAAAGGAC AAACGGTCTT 1380 

CATTGAACCT GTAGCAATTA GTAAATTAAA TGAACAATTG ACACTCTTAA AAGCGGAAGA 14 4 0 

AACAGCGGAA GAATATCAAA TTTTGGCAGA ACTAACAGGG TTAATCAATG AGCAAGAACG 1500 

ACTAGTCGAT CAAGCCGTCG ATACAATGAC CACCTTAGAT ATTATTTTTG CACGAGGAAA 1560 

GTATAGTCGC GAAATTGGTG GGATTACGCC AAAAGTAAAC AAAGAAGAAC gCTTGCGCAT 1620 

TGTTCAAGGA AAACACCCTC TTTTATTAGA ACACGCAGTA CCATTGACGT TTTCTTTAGG 1680 

AGCAGAGTAT CGAGGACTTG TCATTACAGG TGCCAATGCT GGTGGTAAAA CGGTTGTCTT 17 4 0 

GAAAACAGTT GGCTTATTGA CTGTCATGAC ACAATTTGGG TTGCAAATTC CAGCACAAGC 1800 

AGGCACAGAG ATTCCAGTAT TAGATGAAAT TTTTGTCGAT ATTGGTGATC AGCAAAATCT 18 60 

GGAAAATGCG TTAAGCACTT TTTCTGGGCA CATGAAAAAT ATTGCGGAAA TGTTGCGTAA 1920 
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TGTTAAGCGA CACACGCTTG TTTTATTAGA TGAAATTGGC AGTGGTACCG AGCCGAACGA 1980 

AGGAGCTGGC TTAGCGATTG CTATCATGGA AACGATGTAT CAAAAGGGAG CCTTGGTAGT 2040 

TGCAACGACC CATTATGGTG AAATTAAACG TTTTGCCCAA GAGCATGACG ATTTTGTGCC 2100 

CGCCGCAATG GCCTTCGATC GTGAAACATT AACGCCCAAA TATCAATTGA AAATTGGTGA 2160 

AGTTGGCGAT AGTCAGGCGC TTTGGATTGC ACGTAAAATG AAGATGGAGC CACAGCTCAT 2220 

TGAAAAGGCG GCT CAT TATA TTCAGCGGAA AGAATACCCT ACTAATCGAA GTCTATTTAA 2280 

ACCCCTGAAA GCAAAAGAAA CGACCTTTGC CACGTTTGTA GAAGAAACAC ACCGTTATCA 2 340 

AAAAGGTGAT CGTGTCCTAT TGACAGAAAC ACAACAAATT GGGTTGGTTT TTACAGATGA 2400 

GGGTGAACAA GAAATCCAAG TATTTGTGAA TGATAAAATT GAAAATGTAC CACGAAGACG 24 60 

GTTGAAATTA CAAGCGAAAG CACAAGATCT ATATCCGCAA GACTATGATT TAGAAAGTTT 2520 

ATTTACAGAT TTCCATGAAC GAAAGAAACT CAAAGATATT GAACGCGGCT CAAAAAAAGC 2580 

GCAGAAACAA TTACGCAAAG AGGCCGAAAA AAGAGCAGCA AGACGCGATA AATAAGGAAC 2 64 0 

TTAACAGAAG CAATACAACA ACTTAACACT TTGTTTACTT GTTATTTATC AGAAATCAAC 2700 

TAAGACTTGT TATAGTCAAT GTATGGGTAG ATATGaAGGA GGAAACAAGG AAATGAAGAA 27 60 

AAGAGCTTTG CTAGGGGTTA CCTTATTAAC ATTCACAACA TTAGCGGGTT GTACAAATTT 2820 

ATCTGAACAG AAAAGCGGCG AAAAACAAAC AGAGGTTGCT GAAGCGAAGG CAACTGAATC 2880 

TGAAAAAGCA TCAGTAAAAA ATGTTATTTT TATGATTGGA GATGGCATGG GGAATCCGTA 294 0 

TACAACGGGC TATCGCTATT TCAAAGCCAA TCACTCAGAC AAGCGTGTTC CCCAAACAGC 3000 

TTTTGATACC TATTTGGTCG GACAGCAAGC CACTTATCCA GAAGATGAAG AAGAGAATGT 3060 

CACCGATTCA GCTTCCGCAG CGACAGCGAt GGCTGCCGGA GTGAAAACCT ATAATAATGC 3120 

TATTGCACTC GATAATGACA AGTCCAAAAC AGAAACAGTG CTCGAACGTG CGAAAAAAGT 3180 

GGGGAAATCA ACGGGTCTTG TAGCAACATC TGAAATAACA CATGCAACCC CTGCTGCATA 32 4 0 

TGGCGCACAT AATGTTTCAC GCAAAAATAT GGCAGAAATC GCCGATGACT ATTTTGATGA 3300 

TCAAATCGAC GGACAACACA AAGTCGATGT GTTACTTGGC GGCGGCTCCG AATTATTTGC 3360 

CCGGAAAGAT CGTGATTTAG TCAAAGAATT TTCCCAAGCG GGTTATGGTC ATGTCACAGA 34 20 

CAAAAAGTCG TTAAATGAGA ACCAAGACGA CAAAATTTTA GGCTTGTTTG CACCAGGCGG 34 80 

GCTACCTAAA ATGATTGACC GAACGGAAGA AGTCCCTTCA TTAGCTGATA TGACAGAAGC 3540 

GGCTCTTCAA CGGTTAGATA AAAATGAAAA AGGTTTCTTT TTAATGGTTG AAGGTAGTCA 3600 

AATTGATTGG GCCGGGCATA GCAATGATAT TGTTGGCGCG ATGAGCgAAA TGCAAGACTT 3660 

CGAAGCGGCG TTTGAAAAGG CCATCGATTT TGCCAAAAAA GATGGTGAAc ATTGGTGGTT 3720 

ACAACTGcAG ATCATTCAAC AGGGGGCTTG TCTTTAGGCA AAGGAGATCA ATACAACTGG 3780 

TTGACGGAGC CTTTaCATGC GGCAAAACGC ACGCCTGATT TCATGGCAGA AGAAATTATT 38 4 0 

AAAAATGGTA ATGTGGAAAA AACAGTGACT GAGTATATTG ATTTTCAATT AAGTGAGGCT 3900 
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GAATTGAAAG 


CAGTGAAAAC 


AGCGTCGGAG 


TCAAAAGATG TTGAAAAAAT CGCTCAGGCA 


3960 


TTAAGAAAGA 


TTTTTGATGA ACGTTCGAAT 


ACTGGTTGGA CTACTGGCGG ACACACAGGA 


4020 


GAAGATGTAA 


ATGTCTATGC 


TTATGGCCCA 


CAAGCAGAAG CTTTTTCAGG ACAAATTGAT 


4080 


AATACAGACC 


AAGCGAAGAT 


TATTTTTGGC 


TTAG7AGATG GC AC CGGGC A AAAAGCTGAG 


4140 


ATTAAAGATA AAGGTATTGG CAAATAACAA AATGAAGAAA AAGCAGAACG TATAGAAGTT 


4200 


CTGCTTTTTC 


TTCATTGAAG 


GCATAAAAAT 


TTTATCTCTG TTATAATGAA GGTAACCAGA 


4260 


TAAAAGTGAG 


GTCAAATAAT 


GAAAAAAATG 


CATGTTGAAA GTGAATTTGC ACCATTGCGT 


4320 


TCAGTTGTTT 


TAGCACAATC 


GCAATTTTGT 


TTACCAGATA AATTTGATGA AGCGGATACG 


4380 


ACTTTTTTAA CAGAGGAAAA TGCCCGTTTA ACACAGAATA ATGAAGGGCG TGATTTAGCA 


4440 


GACTTTGCAC 


CAGAACAACA 


AGTGCGTTGG 


GAAAAAGAAA AAGAAGTCAT GCAAGGAGTA 


4500 


TTAGAATCTT 


ATGGCGTAGA 


AGTATTTCGT 


CCACGTTTAT TGACAACCTA TGAAAAAGAA 


4560 


CATGGCAAGG 


AATTGGGCGT 


AGGATATAGT 


AACTTCTTTT CACGAGACCC TTTTTTTACA 


4620 


ATTGGCAATC 


TCTTAATAGA 


AGGAAACTTA 


AGATTTCAAC * ACCGTCGACA AGAAATTTTA 


4680 


CCAATTCGTC 


CGATTATTCA 


GCAGTGGACA 


CAAGAAGCAG AAGGCTATTA TTTTGctGCG 


4740 


CCACAACCTG 


ACATTTCTGA 


AGGGGCTTTA 


AGTGAAGCAG GTCCCTTTAT TGAAGGCGGC 


4800 


GATGTTTTAG 


TTCTAGGAAA 


AACAATTTTT 


GTGGGCTACT CTGGCTTGGC AAGTAACCTA 


4860 


GCAGGAATTC 


AGTGGTTAGC 


CAATATGATT 


GGTCATTTTG GTTATGAAGT GGTGCCGGTT 


4920 


CGGTTACATC 


CTCATATTCT 


CCACTTAGAT 


TGTGCGTTAA GCTTGTTGAG AGAAGGCTTG 


4980 


ATGATTGTTT 


GTGAAGAGGC 


CTTTTTGGAT 


GGACTTCCTG CGCAATTAGC AAACTGGGAA 


5040 


AAAATTCACG 


TTACTTTGCA 


AGAAGCAGCG 


TATTTAGTGA CAAATGGTCT ACCGCTTAAC 


5100 


GAAGAAACGT 


ATATAACGGA 


CCAATCGTTT 


ACAACTTTAA TTCCTCAAAT AGAAGCAAAA 


5160 


GGTATCAAAG 


TTGAGGCAAT 


TGATTACCAT 


GTTTCTAGAA TGTTAGGTGG TTCCTTCCGC 


5220 


TGTACAACGC 


AAGCTTTGAT 


TAGAGAATAG 


TCAGCGAGTG ATTATCAATT TTCGTAAGAA 


5280 


AAAATGTACG 


TTTTGTCTAT 


GAAAACGATA 


TAATTTTAGT TTTCACTATT TTTTTTAAAA 


5340 


ATATTGAGGA 


AATTTTGATT 


TAATAAATGC 


GAGAAAACGT CAAAATGTTA GTGTTTGTTA 


5400 


CTAATGTTTT 


GACGTTTTCT 


CGCTGTTTTT 


TTATAAAATT ACCACTTTAT ATATGAAAAA 


5460 


AAACAATATC 


TTTTTTGAAA 


ATTACGTTTT 


CAAAGACACA AAAAGTCGTT GGAACCTTGC 


5520 


TGTTATAGTA 


AATTCATTTT 


TTTTGCATGA 


GGTGTTTAAA ATTTGAAATC ACAAGCAGTC 


5580 


TTTTTTTCGC 


TATACTTAGT 


TTACGTTCTA 


CT GAG AT AG A AAAGTTTTTC ACTTTTTTAT 


5640 


TCATTAATTA 


ACCAAGATAT 


AAAGGAGAGA 


GTAAAATGAA GAAAAAAACA TTGGTAACTT 


5700 


TATTTGCAGG 


AACGGCAATT 


TTAGGGGCAA 


GCATCGCTCC TGCAGCAGCA CATGCAACAA 


5760 


CGACTGGTAC 


AACACCAGCA 


AACGTAGAGA 


TGACAAGTGG TTCATTACCA GGCGGTACGG 


5820 


GTGATGGAGA 


TGACGGAAGC 


ACACAAGAAC 


CAGAACCAGG CTCGAATACA AATTTTGATT 


5880 
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TGTTATATGT TCCAACAGAA TTCAAATTTG CTTCGACAGA AGTTAGTTCT GAT T TAT C AG 5 94 0 

CGATTTCGTT AGATGCGACA GGTACCCAAA CAAAACGTTA TGCCGTTGGG GATGTTCGTG 6000 

GAACACAAGC GGGTTGGTCA GTTACTGCCG GCGTTGCTGA AATGAAAAAT GGCACTGCTA 6060 

CTTTAGAAGG CTCAATTACC TTTGCACAAA CAGGAGCAGT AGCGAAATAC GATGAAACAG 6120 

CAAAAACATA CTCACGTGAT GTaGCTGCCT TTGCAGCAGA TCCAGGAAGT CCTGAATTTG 6180 

CTGGAACAAC TATTCCAGTA GGTGGAGCAG CTGTATCAAT TGCTACAGCA GCAGTTGGTA 6240 

AAGGTCAAGG TACATGGGAT TCTGAATTAT CAAATGTAAA ATTAAACATT ACAACACCTA 6300 

GTTCACAAAT TACAAATGGT GCGTATACAG GTAACGTAAC TTGGACATTA gTGcTGCGCC 6360 

ATAAAAAGTT ACTAACGAGA ATACTGCATT ACTAGAAAAG GAGctGCGCA GTGGTGCACG 6420 

GCTCCTTTTC GATACATAGG AGGAACAAAT AATGAAGACA ATGCGCTGGC TAATCGTCAC 64 80 

AGTTTTTATT TGTGTGATGG CATTAAGCAT CCCTTCTGCT GTACAAGCAG AAGAAGCGGA 6540 

TGACAACATG AATACAAAAT TTACAGTTAA ACCCAATATT CCTGAGAATC AAGCGAAAAA 6600 

TACGCAATCC TACTACGATC TCATTGTAAA ACCAGGACAG AAACAAGAAG TCGAATTATC 6660 

ATTAATTAAT AAAACAGATG AAGAATTGAC GATTGATTTA AATTTGGCGA ATGCTGTGAC 6720 

GAATGACAAT GGTTTAATTG TGTACAATGA TTTCGAGAAA AAACCAGATT CAAGTTTAAA 6780 

AGTACCCTTG ACAACATTAA TTAAATTACC TGAGGAACAT GTAAAAGTTC CCGCTAAGAA 684 0 

AACCGTAACA GCCAAAATGA CTGTCGAAAT TCCAGCGAAT GGTTTCCAAG GATTAGTTTT 6900 

GGGTGGTGTT TATGctTCGC TTGCAGAAGA TGAAAAGGAA GAAAAAGGAA AATCAACGGG 6960 

TTTAACCAGT CGATATGGAT TTAATGTAGC AATTGCTTTA CGAATGTCAG AAGATACACC 7020 

GATGTATCTA GCAGAGACTC TTCAATTAGT TAAATTAGTT CCGACCATTG . CTTTAGGAAA 7080 

TAAATCGGTC CAAGCGGTTA TTCAGAATCC GACATCCGCT ATTTTTCCTG AAGTACGTTT 7140 

AGAAGGAAAA GTAACGAAAA AAGGCGAAAC GAAAGAATAT GCCAAACGGA TTTTACCAAG 7200 

TGTTCGTTTT GCGCCAAATT CGACGATGAA CTTCCATTTA GATTTCGGGA AAGAAGATGT 7260 

TCGTCCAGGA ACCTATATTT TTGAAGGAAA AGCGGTCTTG AAAGAAGATG AACAACAAGT 7 320 

GTGGCCGTTT AAGCAAGAAT TTACCATCAG TAGTCAAGAA GCCAAAAAAT TAAATAAAGA 7 380 

AGCAGTAGTG AAATTGGTCC TACCGACTTG GTGGAATCAA GCTTTTTATG GCTTGTTAGT 74 40 

TGCAACAATT GTGAGTCTAA TTCTAGCTAT TTGGCGTTTT ACGCAACAGA AAGCAACACT 7500 

GAAACAACAA GCCTATTTAC AACAAGAAAA ACAGGCAGCC TTGAAAAATA GACAAGGAGC 7560 

ACGCGATGAT GACAATGAAA AGTAAAGGGT CACTTCTGGT GACGTTGGGA ATACTTTTAA 7 620 

CCGTTGGCAT TGCGAGTCTA ATTGTTTCTT CTGAGAGTTT TGCAGAAGAA GTAGGGCAAA 7 680 

CGAATATCGG TGTAACGTTC TATGGAGGAA AAGAGCCACT AAAAACGGAA GGTGTCATTA 7 74 0 

AGCCAATAGA GCAACCAGTC ACTGATAAAG ATAAAAAAAC GTCACAACAA CAAGACAAAG 7800 

TGAGCAGAAA AACCACTGCT AAAACGAATC CGACTAATGC ACAGACGTCA TTACCAAGGA 7860 
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CAGGTGAACG AAATAGCACG 
TAACAAGTTT CTATTATTTG 
TAAATAGAGT TAATGTAGAA 
GTGCTGGCTT GGCCGTCCTT 
ATTTTGTACA TCTGGCAAAG 
TGCTTCTGGT GATGTGTGGC 
AAACACAATT AACATAAAGA 
TGTTAACTTT AAAATAAAGA 
CCAACTTAGA TAGAAAATAA 
CTGTGAATTA TAAAATAAGT 
CTTTGGCTCA ACAAATAACC 
TAGGCGGAGT AGTTCAACAG 
GAAAACAACT TTTTCAACAA 
TTTTTTTGCT CACTTATTCA 
TAGACGTTTC GAAAGGCACT 
CTGAAAACAT TGCTTTATCC 
TTCGGATTCG TCGTTTCGGG 
CCTTCGCTTT ATTTTGCTGG 
ACGGCATTCA ATTAGCCGTC 
AAATTAGTTA CTTTCTTGCT 
GGCCTGAGAA CCAGTTAATT 
GTGATTTAGT TGGGCCAGTG 
TAATGACAGC CTTGCAAGGA 
CTGGTGAAAT TATTCGGAAA 
AATTGCTCAT GAATGTGTAT 
TTTACTTGCC GAATGTTTTA 
TTTGTGAACA AGCGTTAgct 
AAATCGGTTT TTTTACCATA 
GGCAAGAAAA GCTAAAAGCG 
TGCAATCGGA ATTGCGTCGG 
GGGAAATAGA GAATGTATCT 
AAACAGCCAA AAAGGTTTAT 
TGATGCACCA AGTTCAATCA 
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TGGCTTTACA GCCTTGGTAT TGCCTGTTTA CTCGTAGTAC 7920 

AATAAAAAAA GGAAAAAGGA AAAATAATGT GCTGTTTTTA 7980 

GGAGGAATAT AGATGGAGAT TACGATGTGG ATTTGGATTT .804 0 

TGTTTTGCTT TATTTTTACT GTTCTTAGTA AATAGCATGA 8100 

CCTAAGCAGT CTTTAGTAAA ACAATCAATG TTTAGTTGGC 8160 

GGCTGGGTAG CGGTGGCT AT CTTACTGAAT ATCGCATTAC 8220 

AAACCCGTTG TAAGTTTTTA AGAAACTTGC AACGGGTTTT 8280 

ACGGTACAAT TTATCAAAAA AGTACAGTAG ACATTTGTAC 8340 

AGCGCTATCA TTATATGATG ATTACAGGAG GCGGACACAG 8400 

TATCTCCTTG ACGAGTATGC AGGGCAAAAA ATTGCGCTCT 84 60 

GATCTTTCGC AACGAGAATT AGTGACGCAA CTTTCGACAA 8520 

AATACCGTCA GCATTCCAAA CCTTTCCGCA AATGAATGGG 8580 

CGTCAAGTGA TCTATAGCGA AACAGAAAGA CAAGCACTTA 864 0 

GAAATCGAAG AACTATCGGT GTATCATTAT CAAAATTTTT 8700 

ATTTTAGCGG ACATCAAAAA AGTACGACAC CAACTAGCAA 87 60 

TACGAACGAA AACGAGGGTT TTACCTAGAA GGAGATGAAT 8820 

AAAAATTGGC TTGCCCGCCT CTTACAACAA AAGTCGGGCA 8880 

CTCAGTCAGC ATCAAATGAG CCAATACGCA AAAATACGTG 8940 

CAGGAAGCAC AGCTTCAACT GGTACCCAGC AGATTGGATG 9000 

TTTTCTCAAA AAAGATTAGG CAAATATGGA ACAGTCGTCT 9060 

CAGTCCTTAC GCGCTTATCC CGTCAGTCAT CGTATTTTAA 9120 

GCGAATGTAG AAACTGAGAG TCTTTTTTTC ACGATTTGTT 9180 

GAATTGCGAG ATACGAAGTT AGAGTTTTTA TTAGATATTT 924 0 

ATGGAATTGT TGGCCGTTGT TCAATTTGAA CAGCCTCGTG 9300 

TATCATCTGG TTCCTGCCTA TTTTCGGATT AGCTATGGCT 9360 

ATCAAGGATA TTCGACAATC GTATCGGTCG CTTTATCATC 9420 

CCTTTAGAAA AATTAACGAA ACGGTCAATC CCGTCAGAGG 94 80 

TTATTTGGTG GAGAAATTTT CGGGCAAAAA AATGAACAGC 954 0 

CTAATTGTTT GTCCCAGTGG TATCAGTTCT TCGTTGATTT 9600 

CTATTTCCAA TGATTGAATT TAAAGAAACG AATGCGGTGA 9660 

GAAAAAAGCT ATGACATCAT TTTCTCAACG GTCCCTATTG 9720 

CTTACCAAGC CCATTATGAG CTCTTTGGAG AAAAATCAAT 9780 

GATTGGCTAT TGCCAGGTAT TTCCATGCCA GATGTTAAAG 984 0 
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ATATTTTAGA TGCGTTAAAA 
TATATCGTGT TTTACATCGT 
CAATGTTATC GGAATTATTA 
GGCGAGCGGG AATCACAGAA 
AGTACATTGA TGCAATGATT 
CAGACGTAGC ATTGCCTCAC 
TACTAAAAAT CCAAAAAAGT 
TTATTTGTTT AGCTGCTTCA 
AAATTCTATC AAAAAAAGAA 
TACTATCAAT TATCCAAGAA 
GGTTTAGGAA CAAGTTTTAT 
GTTGATACAG ATAATTTTCA 
GCTGATTACT TCTTTGTCGA 
AAAATCATTC CTTTAAATGC 
GTTTTAGATG ACAATGGCAT 
TGAATTTATT TGTTTCAATT 
TGGGATTAGT TTTACAGAAA 
TCGTCGGCTT TTTAGTTTTA 
TTGCGGAGAT GTTCCAAACC 
TTGTAGCGAT TGCCTTGCAA 
TGGTTGTTAA TATTCTGTTG 
AAGCGATGAT GTACGTTTCG 
TTAGTTTGCC AATGATTCTT 
CTGCTCTTTG TCAGCCCTTT 
ATACAGGGAA TATTGGTTAT 
AAAAGTCAAC GGAAGATATT 
TCTCAATTAT GATTTTGATG 
GCTATGTCGA GCATGAGTTA 
CAGGAACCTT TACGGGTGGT 
AAATTGTGCC AGCTTTCCAA 
TTGATGTTCC CATTATCTTT 
GCTTTATTGT GGGAACTATT 
TACCAGGCGT TGTCGGGCAC 
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CCTTATATTT CCTTAAAAAA AGGAGTCACA GAAGCGCAGT 9900 

AAAATGAACA AAATTTTAGA AAAAGAGGAG GATCTGCGTC 9960 

AC AAC AG AAA CAGTCCAAGT CCTGCCTGAA GTTGCAGACT 10020 

GCTGCCAAGC CGTTATTAAA AAATGGCAGT ATTACGGAAA 10080 

CAAAAAGTCG AATCGTTTGG GCCATTTATC CATATCTGTC 1014 0 

GCGCGACCAG AAGATGGCGT TAATCAGTTA GGCATGTCAT 10200 

GTCGATTTAT TAGAAGATCC TAAGCATGAA ATTCACCTTT 10260 

GATAACGAAA CACATTTACG GGCCTTATCT AGTCTGACAA 10320 

TCATTACATC GCTTAGTGGC GGCTGAAACC GTCCCAGAAA 10380 

GGAGAGAAAT AAGATGAAGA TTTTTACAGT TTGTCAAAGT 104 4 0 

GGTGCAAATG AATATCCAAC AAATTTTAGA GGAAGTCGGT 10500 

AATTGATCAC ACAGATGTCG GGAGTGCCAG TGGCGATATG 10560 

AAAAACATTG CAAGATGCTT TAGGGTCAAT TCCTGCTGAA 10620 

GATTATTGAT TACGATGAAA CGAAGGAGAA AGTCTTGAAA 10680 

TGCACATCAA TAAGAAGGAG GAAAAACATG GAAACTTTAT 1074 0 

GTTAGCCAGC CAGCGATTCT GGTTTCATTA ATTGCTTTGT 10800 

AAGAAATTTT CTGATGTTGT TCAAGGAACG TTGAAAACAT 108 60 

ATTGGTGGCG CAGGGCTATT AGCGAACTCG TTAACGCCCT 10920 

GCGTTAAATA CACAAGGCGT GGTTCCTAGT AATGAAGCGG 1098 0 

CAATATGGGA CACCAACTGC TCTGATTATG TTGGTGGGCA 11040 

GCGCGTTTTA CACGTTTTAA ATATATCTTT TTGACAGGTC 11100 

TGTTTGACAG CCGTTATTTT AGTGGGGGCC GGTTTTAGCG 11160 

TTAGGTGGTT TATTTGAAGG AACTTTATTA ACAGTGACAC 11220 

ATGAAACAAA TTACTGGAAA TGATAAAGTA GCAATGGGGC 11280 

GCGGCAAGTG GCTTTATTGG AAAAGTCTTT GGGAATAAAG 11340 

AAAATTCCTA AAAGTTTTGG TTTTTTAAGA GACTCAACCA 11400 

TCGATTGTCT ATGTGATTTT GGCTTTGCTG GCTGGGACAG 114 60 

AGCAATGGCG AAAATGCGAT TATCTTTTCA TTAATTCAAG 11520 

TTCGTAGTGG TTTTACAGGG TGTGCGGATG GTTTTAGGAG 11580 

GGAATTGCTA AAAAATTAGT GCCTAATTCT AAACCAGCAC 11640 

CCTTATGCGC CAAATGCTGT CTTGATTGGG TTCTTTGTGA 11700 

TCAATGTTAG CGATGGTTGG TCTGAATACC ACCGTGATTA 11760 

TTTTTCTGTG GTGCCGCGGC AGGTGTTTTT GGAAATTCCA 11820 
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CAGGGGGAAG TCGGGGAGCG 
TACCGTTATT TATTTTACCA 
AT AC AG ATT A TTTAATCCCA 
TTTTAGTAGG AGGAATCATT 
CTATGCGGGA GAAAAAAGTC 
TGGGACAGAA GTCGTTTTGA 
CTGTGCAATT TGTTTCCGCG 
GTGTTGCATG CCAGGCACCA 
AAATTGGTGA TAGGAAGGGC 
AAATTTTTGA TAATCAAAGG 
GTCAATCTCT TGGGCAATTA 
ATAACTTTTG GCAACAGGAA 
AGCTTTGGGT AAGCGTTCAG 
TGTGCCACCA GCAGAATCAC 
TTCTTGATAA GCAGCATAGC 
TGGATATACA GGGATAAAAT 
AAAGTCTAAT TGGCTATAGG 
AGCTTTCTTG GTAGGCGCAA 
CATTTCTTGA AGAGGAAAAA 
TGATCGTTGC CAATGATAAA 
TTGGGAAAGA CGTAAAGAAT 
TGCGTGAAAC AAATATGCGC 
CCCTCCATTC AGCAAAAAGT 
AACGAAATTT GTTGAAGGGG 
TTGCGTAAAG TAAATGATTG 
CGTTCACGGC AAGGAGAGAA 
CAATGATTCA GGGGTCTCCA 
GTTTAGTGGG AAAACGGTCG 
CGACTATTTT GCGCAACGTA 
TGACATTACC AATCCATTTT 
AGAAAATTTT ATACTGATGT 
TTTAACAGAG TTGATTCGTC 
CAATCAAACG ATTAATGAAA 
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ATTTTAGGGG CTGCGTTTAA TGGCTTATTA ATTAGTTGGT 11880 

GTCTTGGGTG ATTTACAGTT AGCATCCTCA ACGTTTGCGG 11940 

GGCCTATTTT TAGGCAAATT AGGTGAGGCA GGTCCGAGTC 12000 

GCTTTTGTAG TGGTTGTTCT TTTGGCTAGC GCTGTTTTGA 12060 

GCCGAAGAAA ACTAAGTGTT TTGTTCTAAA AAAAGGCAAG 12120 

CTTCTGTCCC mTTTGTCTTT TACCATTGTT CTTGGATAAA 12180 

CTTGTCGTCC TTCTGGCAAG TATTGTTTCA TTGCGAAAAC 12240 

ATCGGCAAGT ATCTTGTGGT CGTGGCCCTA ATTGTTTTAA 12300 

AAAAAACTTC ATGTGTGCCG CCCACGAAAA AAAGTGGCGG 12360 

ACAAGCAATT TAAAGAAAGT GTGGCGTGAT TGGCCGTGTC 124 20 

ATTGTAAATA GTTCATGCCA ATCATTTTGT CCTTTTTTTG 12480 

TGTCTAAATG TCCATCTAAC CAAGGAGAAA TTAAAACAGC 1254 0 

GGAAATCGGT GGCTAAATTC TGCAATACGT TTAAAACCAA 12600 

CTAATAAAAT AACATCTTCG GGATCATAAT GCTGTAAAAT 12660 

AATGATTATA AATATCAGCA AAAGACTTAT TTGGTGCTAA 12720 

AATCTGTTGC aAAAGGGAGA TCTTCAATTA GTTTATAACT 12780 

CAAGATTCAA TGTAAAGCCC CCACCATGCA GGTACAGTAC 12840 

TAACGGAGTT TGACATGCGA TACATCGTAG GAAATTGAGT 12900 

GATGACGTAA ATGTTTTGGC AAAGTTACTT GTTCATTTAG 12960 

AAGCCTTGGT GACTTTCGGA TTTCGTTCAA GGCGAGCAAT 13020 

CTTCAAAAAG ACAGGCTTTT AAGGATTTTT GACGAGAACG 13080 

TAGCTAACAA CAAACCAATC GATTTCTTTT TCAATTAAGG 1314 0 

AAATCATTTT TTCTACAGTA TAACATTTTA AAAGTAGAAC 13200 

TGAAAGGACG GTTTAAACGA ATGTACGTAT TTATTTGTGG 13260 

TAGTAAATTA AATGAGATTA AGAGGTCCGG TGAATGGGCG 13320 

GTAGGATGAC AAATAAAAAG GTTACAATTA AAGATGTTGC 13380 

TTACTACCGT TTCACAAATT TTAAACGGGA ATGGGGCACG 13440 

AAAAAGTATT GGCTGCAAAA GAGCGATTAA ACTATCAGCC 13500 

TGGTGATGAA AAAAAGTAAG ACGATTGGCG TAATCGTACC 13560 

TCGCGCAATT GATTCGAGGA ATTGAAAGTG TTTTATACAA 13620 

TATGCAATGC GGACCAAGAT GTTACAAGAG AACATGAATA 13680 

GTAGTGTCGA TGGCTTTGTG. ATTGCAAGTT CTGAAATTTC 13740 

CGCTACGTGC GAAAAAGATT CCGTTTATTG TACTTGACCA 13800 
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AAAAAAAGCA GAAGGCTTTA GCGATGCTGT TTTGACAGAC GAT TAT AG AG GTGGGCAACT 138 60 

AGCAGCAAAG CATTTACAAG AGCAACGACA TGAACAAGTC ATCGTGGTAA TGCCGCCACA 13920 

TGCGCCAGTC AATATTCAGC AACGGTTAAA AGGTTTTTGT TCTGTTTATA CGGAAAAGGT 13980 

CCAGTTAATC GAAACAGAAT TATCAAAAAC CGGCGGCTAC CAAGCTGTCC CCGAAATTTT 14040 

AAAAACAGAG AGCACCGGAA TTTTTGCTAT CAATGATGAA ATTGCTTTTG GTTTATATCG 14100 

AGGATTAGCA GAAGCGGGCA AAAAAATTCC TGAGGATTAT A 14141 
(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



ATnATACCTA 


TTATCCATTA 


GGCGACATAC 


GTCCTTTTCC 


GTTTAAAAnA 


AnnTAAATCC 


60 


ATATTTCCAT 


TTTTACCAAT 


ATCCGAAGTC 


TAAATTAAAg 


TAmGAAGATA 


ATcCGTAAAT 


120 


ATTCCATATT 


TTGAATGAAG 


GGTAATTCTT 


CTTTTAGCaT 


TGAATCaTTT 


AATAATAATT 


180 


CaGTTGCAAA 


TATATTTGCT 


TCATTTTCTA 


AAGTAGCTTC 


AAAAGCTAGG 


CTGTGCATAT 


240 


CTATACAGAA 


TACATTTAAA 


TGTTTGTGTA 


AGAAATAATG 


TCCTAGCTCA 


TGAGCAATTA 


300 


TTTTATCTTG 


TTTTGTTTCA 


TCAGTAATAT 


TGCTATTAAT 


GTGAATGATG 


TAAGTGTGAT 


360 


TAT AG TAT TG 


TAGAAAACCT 


TCTACATTAC 


CTAAGTTAGA 


TTTTAAAACA 


TGAATTCCTA 


420 


AGAATTCTGA 


AATTCTTTTT 


G GAT CAT TAG 


ATGAATAAAA 


ATTAATTATA 


CCTTCAACTt 


480 


CGCTTTTTAT 


TTTTGAATAG 


ATGCTACGTA 


TCGGACTCAC 


TCCTTAGTCT 


TTTTTGTCAT 


540 


ATCGTCAACA 


GTTTTTATTG 


CATTT.TTCAT 


TGCAATTTTT 


AAAAGACGTG 


CAGTTTCTTC 


600 


GTCAATATCG 


CCGTTTTTTC 


CCATAAATAG 


AAGAGTATCT 


TCATTTTTTA 


ATTCTTCAAT 


660 


AGTTTCTTTA 


ATAGTACTAT 


CCAATTTAGT 


GTCATTTTTA 


TCTGGTATAT 


CTAGCAAGTA 


720 


ATTTGCGCTT 


ACATTAAAAT 


ATTCAGCAAT 


tTTTTTATTG 


TTTCTGGaTT 


AGGAGtAGAA 


780 


CGATTATTTT 


CATAGTCACT 


AATTGAACTT 


TGTCTTAAAT 


GAAGAATTTC 


TGCAAGTTCA 


840 


CTTTGCTTGA 


TTCCTTTTTT 


CGTTCGTAAC 


TGTTTTAATT 


TATCGGAGAA 


AGCCATAATA 


900 


TCACCTCTTT 


TCTGCAATAA 


TATCACTAAA 


TGTTATATGG 


GTTAACTTTA 


TTTGTAAAAA 


960 


AAGAATATTA 


GTTGTTGACA 


AAACGTAAAA 


TACGATATAT 


ACTAAAAaCG 


TAATAAACGA 


1020 


TATGGTTTAA 


AAAAAGGAGG 


TAAAAAACGT 


TATGAAAATT 


AAGTTTTTGA 


CaGCAAATGA 


1080 


AGTTAGTGaA 


"TTAATGAATT 


GTTCAAAAGC 


TAAAGCCTAC 


GGAATTATCA 


AAGAATTAAA 


1140 


TAGCGAATTG 


AAAGAACAAG 


GAATGAAAAC 


TTCAGCAGGA 


AGAGTGAATG 


AACAATTCTT 


1200 


TGCTAAAAAA 


TTTGGTTTTG 


AGCATGAGTA 


ATAAAATTGG 


ATATTTTGTT 


TTATTAGAGT 


1260 
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ATCTGATTAT ATGTCGGATA TTCTCATAAG GCAAAAAGCC TTAATAAAAT TAAATAAACA 1320 

.AAAAAGCGTT CACGAGAAGA AAATGAATTT CTGTTTCGCA AACACTTAAA GTAATACCTA 1380 

GACAATATTA TTTTAAGCGA TTTGACsTAA TAATACAAGA ATGAAATTTT TGTTCGAGGT 14 4 0 

TATAAGTCTT GTTGAGGTGG TGTGTGCCAG CCGTTGAATG GAACACAAGG TGGCAGTAGT 1500 

ATCTTTAACT AGAGCCATGT TGATGACAGA TTAGTGCCGA GCTGTCAGAT TATTTCCGAT 1560 

AACCATGTCG TGATGACACG TTGGGAATAG TTGCTGAATT AGATTGAATG AAAAAATCTA 1620 

TGCAAGCTTA GCAAGAGGAG GAGAGCTGTA TACCGTATGT CGTAGGTCTA TATAACGTGC 1680 

CGTGATGTAA TTTTAATTAC TAGCCGTTGA AAAATGATTC TATGAAACGA ATGGATAAGG 17 40 

GGCTATTTCA TAGCTAAGGC GAACTTTTAG TCAGTTATCT GCAGAAATGC ACATTCTACT 1800 

ACGATGAATT TTAGAATCTA AATTGCTATA TAAGTAAGAT TGTTATTTGT AAGAGTGGaA 18 60 

AAGATTTTTT CTAAAAGGAT AACCAATCTT AGTTTAAGTA TGCACAGCTG GATTACTCTG 1920 

GTTGTGCTTT TTTATTTGAA AGAATAGAAT TTGTTATAAA AAATAAGGAG GATAGGAATG 1980 

AAAGAAGAGT TATTACTTTT TGTTGAGAAG TTTGTTGCAC GAATGAAACG TCAGAAAAAA 204 0 

GCATTCTCTA TTACTGATAT AGAAAAAAGC TATAATTTGG AACGTAAGAA GTTAGGTAAG 2100 

AGTGCTGTTA AGTTAACAAA CATGGAACGT TTAACAATTG AAAGTCGATT ATTAAAAAAT 2160 

CAGATTTTAC AACGrACCTA TAAAATGACA GGyTATCATA ArCCTTGTCA AGTTGTGTTT 2220 

TTTAGTTAGy AATAAGAATT GCATAAAGAG AxAAArGCAG TyAGTTGTTG ACTGTTTATT 2280 

TTTrTAyAAA AAtTAAKTGA AAGAAGGAAA TTATTTTGGC AAATAGAATT CArGAAATTG 234 0 

CATTAAswGA wAAAAATyTA GGAAGTAAAA TTAAAGCTrT TTATAAAACA GAACGTGATA 24 00 

TTTATGTrAA TGGkGCrCCw GTwCGyGGTC GTGTrTTyAC yGCTTCAAGC GAAAArCAAC 24 60 

GTGATTTTGA AATTTTTGTC CCTCATTTAC GAAAyTCACA AATGTTGAAT GGTCGTGAAG 2520 

AAGTTAAATT AATTGATCCA GTAGCACGAA TyAATGTGAA aCGAAATAAC ATTGGTAATA 2580 

GTCGCTCaAT TGATATGGTA GTTTATGCaA AACGTTTAGA AGTAGTTGGA GGTAATAAAT 264 0 

AATGGGATTA ACATTTGAAA ATAATACrAT TGAAAATTTT GACGTACAAG syACATTTGG 2700 

TAyGATGAAT TTyTTAGAAG TAGTkGCAAT TAATGArCGT GATGGwGAAG rmAATGTCAC 27 60 

TGAmGAAGTT CGTGAACArC GwGTGACkGT TTATTCTAGC AAATTAAATG ACCAAGTAGA 2820 

AATTGyGATT GAyCCAGATT ATGATGTyAG TGGTATyAAy TAT GAT GAT G ArATTGAATT 2880 

GACTGGAAAT GTGACTGCTC GTGGTTGGCT TAATACTTTy AAAGGsTwyA ATGACAATGT 2940 

rCArACwGAA CAAGCyTTTA AAGTACGTkC rGCwGGTATT AAAAArGTAG GGGCTGTTAA 3000 

ACmAACrCAG rCACCTCArC CAAaGgAAAA ACAAGAAAAT AAAyAATAGA AgTAArTmAA ■ 3060 

TATTGTATTC GGACrAAtCG GTAGCGCCGT TTAACAGCTG TrAAGGGCAT TCTGACGAGT 3120 

GCCCTTTTTA TTATTAATTT GTGTCAACAA GGTTTGTTAA TAGACATTAA TAATAAAAAA 3180 

TGAGGTGGTT AGTTAAAAGA TGAATCGATT AGAGGAATTA ATCAAGAACC CAACAAAATT 324 0 
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TTGATTCATT AAGGGAATTA TTTGTTACTT TTGAAACGAA 3300 

GATATGATTA TJSCTCGCAGA TACTTAACAC AACTATATTT 3360 

ATTTAGTTCA AAGCATTTTG AGTGAGTTCA AAAAGAGTGG 34 20 

GAGTGATTGG ATGAAGCAAG GGAAACGCCC AGTTAGAAAA 34 80 

TCGCAGATTG AATGCGAAGA ATTGGTTAGT GGAAAGACGT 354 0 

TCTTCATAAA GAAAGTGGTA ATTTGAAAGA GATTTTTTAT 3600 

TGGAAAAAAA TAATGAAAGT ACTAAATTAT TGCAAAGAAA 3660 

TTGAAGGTGA AATGGAGTTT TATGTGTTAC GTCCATTATT 3720 

AAGCAGTAGT TATGACATTT CAAGATGTTT ATGaTAATTC 3780 

CAGAGGGATT ATATCAAACG ATTGTTCGGT GGATTGAAAA 3840 

ATGTAAACTG AGAGGGGGTG TCTTGTAAGT TAATGAGTAG 3900 

ATAAACAAGT AAGGTATCGA GCGTTTTTAG AAAGAAGATT 3960 

r 

AAAGTTTTAA TTATTACCGT GACTTGTACT TAAAATTATT 4020 

TTATTCAATT CTTATTACTA GATGATTCTT TTTTTGAAAG 4 080 

TGTTAGATAG TTTTTTGGAT CAATTAGTTC GTGCGTACGA 4140 

TTGAAAAGAA AGTGTATTTT GAAGAATATC CATTAGGAAT 4200 

GATGGTTTGA TGGCTTATAA TGGCTTTAGA ATAGACGTTA 4 260 

AATGACAATG AATGAAATTA AAGAGCGTGT GAAGCCAATA 4 320 

TGTGATATAT GTGTTTGGTT CTTATGCTAG AGATGAAGCG 4 380 

TTTTTTGGTT GGCACTACTG GTTTGCCAGA AAAATATCGG 44 40 

TTTCGATGAA TTGAAGGAAG CCGTTGAACA TGAGATCGAT 4 500 

TGAACAACCA ATTGATTCAG AATACCAGAA AGAGTTTTAC 4 560 

AGTTAAAGTA TTTGAAAGAG AAAAATAGAG ATATTCGTGT 4 620 

ATTGTGAAGA TGTAAAATCG GCACTCAATC GTTGTAGCAG 4 680 

ACGATAGAGT GtTTTTTCAT GCAATCAGTA TGTCTTGCAT 4 740 

ACTCTTTTTC TGAAGGGTTT AAGGAAAGAA ATAGTGGTGC 4 800 

GGTCAATTAG AAATATTTTT GCACATGCGT ATAGTAATAT 4 8 60 

AACTTGCAAC AGTCTTTTTT CCTGTTTACG AAGAATTTTG 4 920 

TACAAAAAGA AGGTAGGTAA ATGTTTAAAA AATTATTTAG 4 980 

GTTATTCTTC AAGAAATTTG TTGGCACTCn ATCGAGTGTT 5040 

TTTGTTTAGG CTATTTTGTG GGCTATAAAT GGCTTTATCC 5100 

CTGAtTGGAA AATTTATATT GTGCCAGCGT TAATCATTGT 5160 

ATGGGGnAAT 5190 
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(2) INFORMATION .FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

CAGATAGAGA AAAATATTAC CCCGCATACA TTGCGGCATA GnTTTGCGAC ACATTTGTTG 60 

GAAAATGGCG CAGATCTTAG AATCGTGCAA GAATTATTAG GAnATGCAGA TATCTCCACG 120 

ACACAAATTT ATACACATAT TACCAAACAA CGAATGGCAG ATGTTTATAA GGAACATTTT 180 

CCTCGGGCTT AATTTGCGAG AATGCCCACA AACGAATAAA TTTATGGTAA ACTAGGAAGT 240 

CGAGAAAAGA AAGTAGGTGT CGTTTTTGCA AGAAATTAAT TTAAAACTAG ATGTCTTTGA 300 

AGGACCACTG GATCTTCTAC TACATTTAAT TCAAAAATTA GAAATCGATA TCTATGACAT 360 

TCCCATTACA GCAGTAACTG AAcAATACAT GTCTTATATT CATGCGATGC AAACCTTAGA 420 

GTTAGAAGTC GCTGGAGAAT ATCTAGTCAT GGCTGCCACC TTAATGGCTA TTAAAAGTCA 4 80 

GATGTTGCTT CCAAAACAAG AGCTTGAAAT CATTGATGAT GAAGATTTTT TTGAAGAGGA 540 

AGACCCACGG GAAGCTTTAG TAGCTCAGTT ATTGGAGTAT CGGAAATTTA AATATGCAGC 600 

GACTGTTTTA CATGAAAAAG AAGAAGAACG TAAACTCTAT TATACAAAAG AACCAATGGA 660 

TATGGATGAT TATAAAGAGG AAGATACAAC ACTACCGCCT AATCAAATCA ATACAATTGA 720 

TTTATTTCTA GCCTTTCATG CCATGTTAGA AAAGAAAAAA AATCGCCAGC CTGTTGAAAC 780 

AACTGTTGCC AGTGACGATG TTTCCATCGA AGAAAAAATC TCCGCTATTT CAGAGCGTAT 84 0 

GCGTCaAGTA CAGAAAGGGA AGGCAGTGTC GTTTGATTCT TTCTTCGATT CTTATTCCAA 900 

ACAAGAAATT GTGACAACGT TTATGGCTCT TTTAGAATTA ATGAAAACGG GCgCGATTTA 960 

TGCTGAACAA GAAAATAATT ATAGTGAAAT TTTGTTATTT AATACAGAAA CCCAGCAGGA 1020 

AGATACAACA GAGGTGGAGG AAACACAGTG ACATTAGTTA GTCAAATTGA AGCAATTTTA 1080 

TTTGTGGTTG GCGAaGAaGG AaTTGGCTtG GAaGAGCTtG CGTACTTATT AGAGAAGTCA 114 0 

ACAGCAAAAA CCTATGAAGA ATTAACGAAA TTGAAAGAAC ATTATGCTTC TGACAATAAA 1200 

TCGGCGTTGA ATATTTTAGA AGTCGGCAAT CATTTTGTCT TAACAACTAA GAAAAAATAT 1260 

GCCTCtTACT AAAAAAATAT GCGCAATCGC CTATGTCGAA CGCCTTATCA CAAGCGGCGC 1320 

TAGAAACATT ATCCATTATT GCTTATAAGC AACCAATTTC GCGTATTGAA ATTGATGAAA 1380 

TTCGTGGCGT ACAGACTTCT GGCTCTATCC AAAAGTTAGT TGCCCGTCAG TTGATAGAAG 14 4 0 

AAAAAGGCCG CGTAGATGGT CCTGGTAGAG CAATTTTGTA TGGCACAACG AAGTATTTTA 1500 

TGGATTACTT TGGGTTGAAA AGTTTAGATG AATTACCAGA TATCCAACAA ATGGAAGATG 1560 

AATTAGAAGA AGAACTACCA ATGGATCTAT kTTTTGATCG TaTCAAGAAm CGAACCCAAT 1620 

GTCTGAAACA ACAGAAGGCG AGGAAGCTTA AACAaTGGaA AGtTACmAAa AggCAtTGCT 1680 
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CATGCTGGGG tGACTTCgCG TCGCAAAGCa GAGG l- 714 
(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

TTCAATCACT GCAGAATGCA TCTCGTGGAC TTCCATTACT TGGCCATCTT TTTGTTCATC 60 

GATGACAAAG GAACCGTTGG ATGTTCGTTC ACTGATTGGA TAGTTTTTCG ACTGAATAAC 120 

TTCTTGTGTG CCTTTTTGCG TAGTAACTAA TAAATCTAAA TCGGTACTTT CAGACATAAA 180 

GACAACACGA TGTGGATTTT TCTTCAATTC CCGCAAGACC ATTAGTCCAC GTTTTGCTCG 240 

TCCTAATTGT GTTAATTCTT GGGCTAACAT GCGTTTAACA CCGCCACGTT GGGTAACGAT 300 

GACAATCGGT GtATCCCCTT CAGAATGAAC CAGTAAACCA TTAACGACAT AATCGTCTTC 360 

TTTGAGGTTC ATTGATTTAA CTCCAGCCGC TTTACTACCC ACGACTGGTA CTTCATATAA 420 

TGGATAACGT AAGCCAAAAC CGCGATTGCT AACGAGGAAA ACATCCAATA AGTCTTGATC 4 80 

ATTTGTTAAA TAAACATTGG TAATTTCATC TTGGtCTGAT TTTAATTTCA TACAAGATGT 540 

CGGGCGGCTC TTATAAGTGC GCCAAGGCTC AAAATCAGCC ATTTTGGTTT GTTTAATCAT 600 

GCCAGCTTTT GtAATGAAAA CAAAGGTTTT TGTCGGACTT AACTCTTTaT AAGGATATAC 660 

AGCAATAATC GATTCATCGA TAGCaAGATT TAAAATGGkT TGaGAAATAT GTTCGCCnAT 720 

TTCyTTCCAG CGTAAATCTG GCAACTCATG GACGGGGCGA TAGATCATAT TCCCTTTATT 780 

TGTTACTAGT AAAAGATGAT CTAATGTATT GACTTCGCCA GCATATAATA AAAAGTCGCC 840 

TTCTTTCATG CCGATTCTTC TGGTTTTGAT GCTGTATAAG AACGA 885 
(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

GCAGCCACAA TATCAGCAGC CGCAACAACC GGTTGCGcAc AACTtGGAGC TCTTGTAAAA 60 

GCGCACTGCG TTCCTTTTCT TTATTCTTTT GATCTTGAGA ATCTTCTAAA AATGCCGAAA 120 

AGAAATGTTG GGAAGAGAGC GTAATCAGTT TAGAAATGCT CTTGATGGTA GCTTTATGTT 180 

GATCCATTCT TCTGCCTCCT TTACGAATAA AATAGAATAA AACTCAAATG ACTAATTACC 24 0 

TGTATTTTAC CTAATTTTGT GATAAAATTC AAGAAAATAT GTTCGCCTTC AATAATTATG 300 

AGAATTTACC AACGTTTCTG TTTTTCGAAA AAGAAAGTAG TATAATGTGT GAGCATATAA 360 
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GAAATACATA 
AAAACCATCA 
CGGGTTGTTA 
TTCATATGTC 
TGAGAAAAAA 
GCAATATGAT 
CACCGCATAC 
GATTGTATCG 
TGTGTTAGCT 
TTTTAGTGCA 
AGTGGCAGCT 
CACTACAACT 
TAATGATCCA 
AGTGATGAAT 
CAACAGGCAg 
AGCGATGTAC 
GGGGACGTCC 
AAGCAGGCTT 
ATGCGTTTGA 
CAAATCCTGG 
TTTTTCAAGA 
ATCAGCAACC 
TATCGCAACA 
CAGTCTATTC 
ATAAATAAGT 
CCTTTTTTTG 
TTTCTATTAA 
TTGTTATATA 
TTGTAATAGC 
TTTGTGTTTC 
ATAATTTGCT 
TGGAAAAATA 
GAACAAATTC 



AGATTATCGG 
GGTTACCAAA 
GCGCAGTTTT 
AAAGCTAGTG 
GTCAAACCTT 
GGACATCTGG 
GGATTTGGCA 
CTTGCAATGG 
TGGGCAAAAA 
TATGCTGAAA 
TTCTTGAAGG 
GGTCACATTA 
AACGATTCAG 
GAAGCGTTAA 
TACCCTTTGT 
TTGGGCAACG 
AGTTCATGAA 
AAGCTGGGCG 
TTATCGGAAA 
TGTGATTCGT 
AGTTCAGGCA 
AATCGTTAAT 
GATTAGTCGA 
CTTTTTACAG 
ATAGAAGAAT 
CAGTGATTAT 
GAATAGTGCA 
GTAGGGGATC 
AATTACATCA 
ACCCGAAACT 
AACGGGATGG 
AATTTCACCA 
TTTTTTTGCC 



ATGTAGTCAA 
AAAAAGTTCT 
CGCCGATGAA 
CTGAATCAAG 
TAAGCTATGG 
ATTTGCCGTT 
ATGTGGATAA 
TTGGTTCATA 
ATGACTTCTT 
TGAAAGGCTA 
AAGGTCATCC 
TGGTGATGAG 
AAGAAAAGGG 
ACTTCTGGGC 
TCGCATCTTT 
AATACAACTG 
GAGCAACAGT 
ATTATTTTGA 
ATTGCGCGAT 
CATCGTTTAA 
GAATTTGGGT 
CACTGGCAGC 
GCGTTAAAAA 
GCGGCGGGAT 
TAAAAAGGAA 
GGACACTCAT 
GGTAGTTGGG 
GTGACTGTTG 
ATCCCCATTT 
GATAAGGCGA 
GAAAAAAGGT 
AATTCAGCTA 
AGTAATTCCG 



TAGATAGAAA 
CACGGTGTTG 
ATCAGGTGGG 
TAAAACAAAA 
ACAACAGGTC 
AGAATTGCAG 
GCCGAATACA 
CATGGATCAC 
TATGGAAGGG 
TAACTGTCAA 
AG T CAT TAT T 
TGGTGTGGAT 
CCATTCAAAA 
ATTTTATTAG 
TTTAGCAAAT 
AAGAGATGCA 
TATTTGAGTT 
ATAAACGAGA 
ATGATGAGGA 
AAATTCAAGC 
CCTTTGATCG 
ACCCAGAAGA 
AACGAGGATT 
TGGTGAATGA 
GGAACTAGCA 
TTAACATCGT 
ACGTTACGTC 
AAAGTTGAGC 
CGATAAAATA 
TAACACCCAC 
GGGATACAGA 
GGACTTTGGA 
CAGCTTGAAC 



AGGATGAATA 
TTGGTATCGT 
TTTTCAAGTT 
GAACCAGCAC 
AATCAAGAAA 
ACAGATGCTA 
ATCGAAATTA 
CAAGAAGTTA 
CAAGGGACGG 
GAAATTGGGG 
TCTGTAAAAC 
GAAAAAGGCG 
CGGACATTTA 
GAATAAGAAA 
AAAAGGAGGC 
AGCCTATCAT 
ATTGACCTTG 
AACGCTTAGA 
AAAAATTTTA 
GACCATCACG 
CTACCTCTGG 
GGTTCCAGCC 
TAAATTTTTA 
TCATTTAGAA 
AAACGAATGG 
TGAAACAGTT 
TGCAATGGAA 
AATCGTGCTT 
ATCAATATTT 
TTTTTTAGCC 
AGCAGATAAA 
CGTTCCTTCG 
AATCCGTCCT 



GTATGTCAAA 
TTGGCTTTGG 
GGTTTGATGA 
CAGTTAAAAT 
TTGAAAAGAA 
AATGGAAAGA 
ATGGCTGTGC 
CCCCTCTGGA 
CGTGGTCTAT 
ATATTGAAAC 
CGGGCTATTT 
ATTTCTGGaT 
CAGCCGAAGA 
AGGTGTGGAA 
AGGACAATGG 
GmTGAyGAAT 
GAAAGTATGC 
GCAGCTTATG 
GCTTTACTAG 
AATGCGCAAG 
AACTTTGTCG 
TCCACGGAAT 
GGTGCCACGA 
ACGTGTCAGT 
CTAGTTCCTT 
TTCGCCAACT 
CTTTTCTCGG 
TTTTCACTAT 
TGCCGAACTT 
GTTTCTTCGG 
T TAT AG AT AC 
CCAATAAAAA 
TCTAATAATT 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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CTTCTGATGA GCGTTGCAAA AAATGAATCA GCATTGTTTC ATCTAAAGAT TTTGGTTGGG 2400 

CTGTTTTTTT TCGTTCGCTT AGATAATTCT TCAATTTAAA TTTAAAATCT GTATAGCCTT 24 60 

CACACTCGAA TTTTTGACAA AAACGCCAGA TTGCCGTTTT ACTGAAATGA ATTTCATCTG 2520 

CTAATTCTCG AATTGTCATA TAAGGAACTT TTTCAATATT GTGGCTAATA TACTTGTAAA 2580 

TCTCTAATTC TGTTGGATTT AATTGTGGAT CGTAGGTTAA AAACAGCATG CTGATCCCTC 264 0 

CTGTCATTTG TGAACGTAAT GTCCTTTTTT AGGGACATGG TCTTATTTGT AGATTATAGC 2700 

ACTCAATCAG AAAACTAGCT ATAAAAAgGC GCCACTTAGC TACAATCAAT TAGAGAAACA 2760 

ACTAAAAATT CATGAAACAG AAGAGAGAGT GGGGAaAAAT GAGCAATAaA AAATtAATTA 2820 

TCAACGCTGA TGATTTTGGT TACACGCCAG CGGTAACGCA AGGAATCATT GAAGCACACA 2880 

AACGTGGCGT GGTTACTTCA ACAACTGCAT TACCAACTTC TCCTTAT. 2927 



(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

AAAAGTTATT TCTGCTTAAT GATGGTCTGG TCTTTTAAAC AGTCAATTTC TAAsAACCCT 60 

TAGTTATCTC CATAATTTTT TGGAAGATGT GAAGATAACT AAGGGTTCTT TTTCTGTTTT 120 

CAATAAAAGA AAGCAGGATT CTATAGAAAA GGCTACTAAA AAAgAGCGGa AATATGCTAT 180 

TATTATTAAG AAGTAAAAAA ATATAGAAAC AAAGGGATGC CGATGAAAAA GTCCTACCGT 24 0 

GTCAAGAAAG AAAAAGAATT TCAACAGGTG TTTAATAAAA AACAATCTTG TGGCAAATCG 300 

TCGTTTCGTG GTGTACGTTT TAGAGAAACC ACaACAAGCC CATTTTCGAG TGGGGATTTC 360 

TGTTGGGAAG AAAATTGGAA ACGCGGTCAC AAGAAATGCT GTGAAgCGGA AAATCCGCGC 420 

GAGTTTATTT CAATTAAAAG ACCGTATCTC TCCAGAAATT GATTTTATCG TGATTGCACG 4 80 

TCCAGGATTG GAAAAGTTAT CGTCTGAAGA AGTGAAAGCT AATTTAACAC ATGTGTTAAA 54 0 

TTTAGCTAAA ATATTAGATG TAAGAGAGGG AATTGAGTAG TGAAGAAGTA TAAGCGCTTA 600 

TTATTAATGG CTGGGTTAGT GACCTTAGTT TTTGTTTTAT CCGCCTGCGG AACCGCGCCC 660 

GTAAGTGAAA GTAGTACAGG AATTTGGGaT CGATACATTG TTTATTATTT TGCCCAAGCG 720 

ATTAAGTTCC TATCATTAGG CGGCAGTGTC GGAATTGGGA TTATCCTATT TACGTTAGTC 780 

ATCCGGATTA TTTTATTGCC GTTGATGCAT TTCCAAACGA AGAGTATGCG TAAAACACAA 840 

GAGTTGCAAC CGCAATTAAA GGCCTTGCAA CAAAAATACT CTTCAAAAGA TCCTGAAACA 900 

CAACGCTTAT TCCGAGAAGA ACAGCAACGT TTGTATGCAG AGAATAATGT GAATCCTTAC 960 

ATTGGCTGTT TACCACTGTT GGTTCAACTA CCAATTATGA TGGCTTTGTA TCAAGCCATT 1020 
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TCACGCGTGC CTGAGTTAAA AGAAGGAACG T.TCTTGTGGT TAAGCCTCGA TAAGCCGGAT 1080 

CCGTATTTAA TTTTACCGAT TT.TAGCGGCT. GTCTTTACGT TTGCTAGTAC GTATCTTTCA 114 0 

AGTATGAGCC AACTAGAAAC AAATGCCTCA CTAAAAATTA TGAATTACGT TATGCCAGCG 1200 

ATGATCTTCT TTATGGGGAT TTCATTAGCC AGCAGTTTAT CTCTTTACTG GGTGGTCTCA 1260 

AATGCCTTCC AAACAGGGCA AACATTGTTA TTAAACAATC CATTCAAAAT CCGTAAAGAG 1320 

CGGGAGGAAG CAGCACGTCA AGCCAAAGCA CGGGAGCGAG CGCTTGAACG AGCAAAAAGT 1380 

CCTAAGAAAA AAGGGAAGAA AAAATAAAGG AGGTCGTTCA GATGCCGATT TATGAAGGAA. 14 40 

ATACAATCGA AGAAGCAACG CAGAAAGGCT TACAAGCACT TGGTTTAACC AAAGAGGATG 1500 

TTACAATTGA TGTGTTAGAC GAAGGAAAAA AAGGATTTTT AGGTCTAGGA AAAAAACTTG 1560 

CCCAAATTTC AATGGAACCA ACGATTAGTG AGCAAGTCAC AGAAGCGGTC GAAGAAACAG 1620 

TGGAAGATAT TGTTGTTGCC GATGAAGCAA CAGCAGTGGA AGAAGCAGTT GAGGAATTAA 1680 

CCGAAGCAAT TCCTAGTCTC TCAGAGGAAT CAACACATTC TTTAGAAAAT CTGGAAGACG 1740 

AAGCTGCTAT TATTGAATTA GCGATGTATT TAACGAATAT TTCAAATGAA CTGAACGCCC 1800 

CTGCCATGGT TCGCATAGCG CGCGAGAATG GCAATATTAT CTTCCATTTA GAAACAGAAA 1860 

AACAAGGTAT CTTAATTGGA AAACACGGAA AAGTGTTAAA TGCGTTACAG TATTTAGCGC 1920 

AAGTCTTTAT TCACCGCATT GCAAGCAATA AGCTATCGAT TGTTGTGAAT GTCGGAAATT 1980 

ATCGTGAAAA ACGTCATGAA ATTTTGGAAC GTTTAGCGAA ACGCACCGCG GAAAAAGCCA 2040 

AACGTACTGG GCGGCCAGTC TTCTTGGAAC CAATGCCGGC TTTTGAAAGA AAACAAATTC 2100 

ATCACACGCT AAGTAAAGAT GAGCAAATCA AAACCCATTC TGAAGGGGAT GAACCTTATC 2160 

GGTATTTAGT GGTTGAACCA GTTAAGAAGT ATTTTTAAAC AAACAAAGAA ACGTTGGCAA 2220 

CTCTCAGCAG GGACTGAATA GTTGCCAACG TTTCTTTGTT AAATGATTTT TTAGCCGTCT 2280 

CCGTTTCTTT TTTTATGTTA TGATAATCAG TAATTACTCC AGTAGTAAAT TCAAATGAAA 234 0 

AAGTTTTGTA AAGAGACGTA GGGGGAGCAA CAGGTATGCG TACGGAGAAA ATACTGGTTG 2400 

T AG AT GAT G A TGCAGCAGTT CGTCGCTTGA TTTGGAAAGC ATTACAATCA ACAGGTATTC 24 60 

TTATTTACCA AAGTGAATCA ATTGAAAAAA CAAAAGATAT CGTGTCCAGA GTTACCTTTG 2520 

ATTTGTTCAT TTTAGATGAT AACTTGGAAC ATGACAGCGA TGGTTACTAC TTAGTCCAAA 2580 

TGATTAGAGA GAAATATCCA CTGGTGCCAA TTATCTTTTT TAGTAGCAAA AAAGAAGAAG 264 0 

AGGATATCAT TGCAGCCTTG GAAATGGGTG TGGATAGTTA TATAACGAAA CCTTTTTCCT 2700 

TGAATGTTTT TAAGGCTCAG GTGATTGCTT CGTTGAATCG AGCTCGAATG ATTCGAAAAC 2760 

AATTGAATGT TTATAAAAAA GAAGAGCTAC AGGTAGGCGA TTTCCGTTTT GATTATGGTC 2820 

GTTATCAACT ATTTAAAAAG GATTTACCTA TTTCATTGTC TTCGAAAGAA GTTCAATTAA 2880 

TTCAATTTTT TCTGGAGAAT CCCGAGCAAG TATTTTCTAA AGAACAAATT TACAGCAGTG 2940 

TTTGGGGTAC TGGGGATGTC GATGCGAATA CGATTATGGT TTTTATTAAT CGTTTACGAA 3000 
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GTAAATTAGA 


AGAAAACCCG 


AAAGAACCTT 


GTTACcTTCG 


GACCGTTTGG 


GGCATTGGTT 


3060 


ATACCTTTAC 


GCCAGATGGT 


TGCGTACAAA 


AAAAATGAAA 


AAAGACCTGC. 


AGXXGCACGA__ 


3120 


GCGACTGCAG 


GTCTTTTTTG 


CTTATTCAAT 


TTGCCGACGT 


TCCATTTCTT 


TCAAAATAAA 


3180 


TTGAAGCGAA 


CGATTTAGAT 


GTTCATGGGT 


TAAAAGTGTG 


ATCTGGTCAA 


TATTCTTTGT 


3240 


TTTCATTCCG 


CGGAAAATTA 


ACCAATGTTC 


TTCTAGGGCA 


ATACTACGAG 


GTTCTGAGGC 


3300 


ACGAATCGCA 


ATATCTCTGA 


AATAGATTAA 


GTAAGCATGG 


AGATCAGTCA 


CAATTTTTTT 


3360 


TAAACGTAGC 


ATCTGACTCT 


TAGTATAAAT 


AAAGGAATTA 


AAATCTGAAA 


AGTTCTGTAG 


3420 


TAAGTCATCT 


ACTTGGTTAT 


TTTTATTGTA 


TTGTTCTCCT 


TCGAGAAGTA 


AGGCTTCCAA 


3480 


TTCATCAAAA 


TCTTCTGGTG 


TCATGAGTTC 


CATGGCTTTA 


ATGGTTGCTA 


AAGTGTCCAA 


3540 


AGATTTACGA 


ATATCATAAA 


TTTCATAAGC 


ATCTTTCATA 


CTGATCCCTT 


TAACCACGAT 


3600 


ACCCACCATA 


GGTATATGTT 


CCACCAATTG 


TTCTTTAACT 


AATTCTTGTA 


AAGCAAAGCG 


3660 


AATGGGCGTC 


CGACTAATGT 


TTAATTGCTC 


GGAAAATTCT 


TTTTCGTTAA 


TCCGTTCACC 


3720 


TGCAGGAATA 


TCCCCTAAAA 


TAATGGTTTT 


TTTAAAGGCT 


TCATAGGTAC 


AAATTTTTAG 


3780 


AGGTTTACTG 


CTAGTTAAAT 


CTAAGTTCCG 


TTTTACTGCT 


TCAACCGTTG 


GATTCATATG 


3840 


AGGCACCTCC 


CTATAGGTTA 


TTTTTATAGA 


TAATAAATAT 


AATATTAAAA 


ATGAAAATAT 


3900 


AACGCAAGAA 


AAGACAAAAT 


GAAAAGAAAA 


CCGAGGAGAA 


AAGGTACTCC 


TACGGTTTCA 


3960 


GCGACGACGT 


TTAATTTTGT 


GGAATAAATA 


AAGGCATGTG 


ACCTAAAGCA 


ACAATTGTAA 


4020 


CAACGAAAAT 


AACGAAAATT 


CCTAAGGCAT 


ATTTTGCCGA 


TTCTTTTTGC 


CATTGGCCCA 


4080 


TATCTAAACC 


AGTTAAACGT 


AATAGCAAGT 


AAATAAATGC 


CACTAATGGA 


CTTAATAAGT 


4140 


GGAACGCTTG 


CCCCATTAAG 


GAAGCTAAAG 


CCATAGACAT 


ATTGTCAAAT 


CCATAAGAAC 


4200 


GTCCAGCTTC 


TGCTAGGACT 


GGTAAGACAC 


CGAAATAGAA 


ACCATCGTTT 


GAAATAAAGA 


4260 


ATGTTCCTGG 


TGCTGAAATG 


AACGCAATCA 


CTAAGCCCCA 


AGCGCCGGCT 


AATTGTTTTG 


4320 


GAATAATATT 


AACAAAGCTA 


TTTGCTAAGG 


CATTGGCCAT 


ACCAGAACCT 


TGGAATAAGC 


4380 


CCATGAAGAC 


GCCAGCAGCA 


AAAACTAAGA 


TAACTACTTG 


C AC AG CATC A 


CCAGCATTTG 


4440 


CACTGATGCG 


GCTGCTTTGA 


TCTTTTAATT 


TTGGATAGTT 


AATCATTAAA 


GCGATACAAG 


45D0 


TACCAACTAG 


GAACAATAAT 


AACGGTGGCA 


TTTCAATCCA 


CGAAGGCAAA 


CCTAATTGAC 


4560 


CAACGAATGA 


ACTTAGGACT 


AACCAACCGA 


TTAAGACAAT 


CGTTAAAATA 


CCATTGAACG 


4620 


CAAAATTCTT 


TGGACGACGA 


ATTTCTAATA 


ATTCTGGGTC 


GTTAACAATA 


GTCATTTCTT 


4680 


CCAATTCTTT 


ATCAGTAAGC 


GTTGTCACGC 


CTAAGCGAGC 


ACGTTCTTTT 


TTCCCCATGC 


4740 


GATAAGCAAC 


AAAGAAGATA 


ACATACAATA 


CAGATAAAAT 


CATACCAGGT 


GCTAAATAAG 


4800 


AAAGAATGTC 


•GGCATCAACT 


TCTAGTACGG 


CCATGGCACG 


AGCTGTTGGA 


CCGCCCCAAG 


4860 


GTAATAAGTT 


CATAATGGTG 


TTTTGTAAGA 


TAACAAGCAC 


ACCTAAATTC 


ATCAATTTCA 


4920 


TATTAAGTTT 


TTTGTAGATG 


GGAATAAACG 


CAGAACAGCA 


AATCAAGGTC 


GTGGTTGTTC 


4980 
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CATCTCCGTT 


TAAGGAAACA 


GCAGCAGCAA 


CTACCGCAGT 


AGCCATCAAG 


ACTTTCATTG 


5040 


GATCGCCTTT 


AGCAAAGTAA 


ATCATTTTTT. 


TAGTGATTGG 


GTCAAACAAA 


CCAGCATCCA 


5100 


ACATAATAGA 


GAAGTAAAGA 


ATGGCAAATA 


ATAACATAAT 


ACCAGTTTTT 


GAAGTAGTTG 


5160 


AAATCCCATC 


TAAGACGAAA 


TTTCCAATGT 


TGACTTCAGG 


GACAACACCG 


TCAGCACCAG 


5220 


GTGGCGGACT 


CGCTACTCCA 


GTTGCAATTG 


CAATCAAAGT 


AAAAATAAGA 


GGAATAATTA 


5280 


CTAAAGATGT 


AAACGGTGAC 


AATTTCTTTT 


TCATGATTAC 


ATACATAAAG 


ATGATAATCA 


5340 


TACCATAAGC 


TAAAATAGTG 


AGTAACATGT 


TTTTCTTCCT 


TTCTTTTTTA 


GAACTTTTTT . 


5400 


GTAAGCGTTA 


ACAACCGGTG 


TTAATAAAAA 


AGAATAGCCA 


TGAGAAATGT 


TTCACAGTAT 


5460 


TCTGGTTTAC 


AACAAAAAAT 


ATTTTTTGTG 


TTTGTCTATA 


GTGTGAGAAT 


ATACAAACTT 


5520 


TCGCAGAGAA 


TAAATAGCCT 


AAAACAAAAA 


GAACATCAAA 


AAAACAATTC 


AACAATCAAA 


5580 


TTCTTATTAA 


TAACTTAATA 


TAATATTAAT 


AGAATCTTAA 


CGGCTGATAA 


ATGCTTATAT 


5640 


ACCAATGTTT 


ACAGAGTTTT 


TTTCACAAAA 


TAAACAATAA 


AAGAACAATC 


TGAGCTAGTG 


5700 


TTTTCACTAG 


GTTTTTTAAC 


GGTTTTTTCT 


ATAATGAACG 


TATACCAAGA 


ATTAGTGAAA 


5760 


GGAGAGACAA 


TATGGGCCTA 


TTTAAGTCCT 


TATTCAAGAA 


AGAGCAACAC 


ACACAGGATA 


5820 


CCGAGATTAA 


TGAAAGCGTT 


ATCCAACGTG 


AAGCGGATAA 


CTTGGAAAAG 


CGATGGATAG 


.5880 


ATATACCAGC 


ATATATTCCT 


GCTGAGCCAA 


AAGAATATCA 


ATTGGTAAGT 


ATTGTGGCCA 


5940 


CTGCCATCGC 


AGCAGGAGAG 


TTTCCAGAAA 


GTCGTTTTGT 


TATCAAAAAA 


ATCGCAAAAC 


6000 


GAAATCCAGA 


AGCAAAAGAA 


GTTGCCATTA 


TTGCAGCCAG 


TATAGCTGCA 


GAGCTTTCAG. 


6060 


AAGAGTCTCA 


GCTAGTTGTC 


AAAAGAATTT 


CAAAACAAAA 


CCTTGAGGAG 


GATCATTATG 


6120 


TTACGCAAAT 


TTAAAATTTC 


AATTGATGGG 


AAAGAATATT 


TAGTCGAGAT 


GGAAGAAATC 


6180 


GGTGGCGTCC 


CACAACCAGC 


ACCTGTTGCG 


CCACAACCAA 


CAGCGCCAGT 


TGCTACGACA 


6240 


GAAACACCAG 


CCCCTGCTGT 


TGAAGAAACC 


CCAGCGTCAg 


CAGCACAGCC 


AGCCGCCCCA 


6300 


GCAGGAGCAG 


ATGCAATGCC 


TGCGCCAATG 


CCAGGAACTG 


TGTTAAAAGT 


ATTAGTCAAT 


6360 


GTTGGTGATA 


CGGTGTCCGA 


GAACCAACCA 


TTGTTAATTT 


TAGAAGCAAT 


GAAGATGGAA 


6420 


AATGAAATCG 


TTGCAGGAAA 


AGCAGGTACG 


GTAACAGGAA 


TTCATGTTAC 


CCAAGGCCAA 


6480 


ATTGTGAATC 


CAGGGGAACC 


ATTAATAACA 


ATTAACTAGA 


ATTAATTTAA 


GAAGAGGAAG 


6540 


TGATTTGTGT 


GGAGACACTG 


ATTCAAGGTG 


TTATAGGAAT 


GGGCCAAGAA 


CCAGGACGCA 


6600 


TTGTGATGAT 


GCTGATTGGT 


GGACTATTAA 


TGTATTTAGG 


AATAAAAAAA 


GAGTATGAAC 


6660 


CAACACTTTT 


AGTGCCGATG 


GGTTTAGGAA 


CGATTTTAGT 


GAACTTTCCT 


AATTCCGGCG 


6720 


TATTGAGCGC 


TGGCGGCGAG 


CCAGGTCCGT 


TTAATGTTCT 


TTTTGATTTT 


GGAATAAAAA 


6780 


CAGAATTATT CCCGTTATTA 


TTATTTATTG 


GGATTGGGGC 


AATGATTGAT 


TTCGGGCCAT 


6840 


TGTTACAAAA 


TCCATTTATG 


TTGATGTTTG 


GTGCCGCTGC 


ACAGTTTGGG 


ATTTTCTTTA 


6900 


CAGTTATTAT 


GGCTGTTTTA 


TTAGGTTTTG 


ATTTAAACGA 


TGCTGCTTCG 


ATTGGGATTA 


6960 
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TTGGTGCCGC TGATGGCCCA ACGTCTATTT TCGTAGCAAA TACCTTACAT TCAAAATACA 7020 
TGGGGGCCAT AATGGTTGCC GCTTACTCCT ACATGGCGTT AGTTCCGATT ATTCAACCAG ■ 7080 

TAGCGATTAA AGCAGTGACG ACGAAAGCCG AACGTCGTAT CCGTATGACT TATCGGGCCG 714 0 

GTGAAGTTTC TCAAACAGCG AAAATTCTTT TCCCGGTTGT TATTACGATT GTCGCAGGGT 7200 

TAATTGCGCC GGTGTCATTA CCATTAGTTG GTTTCTTAAT GTTTGGGAAC TTATTAAGAG 7260 

AATGTGGCGT TTTGGATCGT TTATCCATTA CAGCTCAAAA TGAATTGGTA AATATTATTA 7320 

GTATCTTGCT AGGACTATCT ATTTCCGTTA GCATGAACTA TGCAGAGTTC TTAAAAGTCG 7380 

ATACGTTGAT GGTTATTGGC TTAGGGTTAG TCGCCTTTAT TATGGACTCA GTCGGCGGCG 7440 

TGCTTTTTGC TAAATTCTTG AATTTATTTA GAAAAGAAAA AATCAATCCA ATGATTGGTG 7500 

CTGCGGGTAT TTCCGCTTTC CCAATGTCTA GTCGGGTTAT TCAAAAAATG GCTACGGATG 7560 

AAGACCCACA AAACTTTATT TTAATGCATG CTGCTGGTGC CAACGTTTCA GGACAAATCG 7 620 

CTTCTGTAAT TGCTGGTGGC TTACTATTAG CCTTTTTAAC AT AGG AC GG A AAGGAGAACG 7 680 

AATACATGTC TGTTAACATG GAAGATTTAA AAATTGCTTT TGAACTGTTA GGTTTTGGTT 7740 

GGGGCGGCGT TTTCGTTGTT TTATTTATCA TTTATCTAGC ATCTAAATTA CTAACAAAAT 7800 

TATTTCCAAT TAAAAAATAA AGGAGGTTAT TGCTGTGAGC AGCGTATATA CAATGAAAAG 7860 

AATCTGGTTG CAAAAAGATA AAAAAGCTTA TCAAGCTTGG CGCGAATTAA TGGAAAAAGC 7 920 

CGAATTGCAA ACGACAGAAG CACTTGATTA CACCGTTGGC ATATATGATG ACACAGCATT 7980 

AATCGCCTCT GGCTCGCTTT CAGGCAAAAC GATTAAGTGC GTGGCCGTTT GTAAAAAATA 8040 

TCAATCTGAA AATTTGTTAA CCCAATTGGT GGTTCATCTA CTTGAAAAAC TGAGAGAAGA 8100 

AAACCAATTA CACAGTTTTT TGTATACAAA ACCAAAAAAT GAACAAATTT TTCAGTCGCT 8160 

AGGCTTTCAA AAAGTGGTTG CGAATCAAGA GGTGCTATTT ATGGAGCAAG GCAAACCAGA 8220 

TTTTGCCGAT TATCTTGGTT ACCTGACACA GCATAAAAAA GCAGGGCCAG CCAGTAGCAT 8280 

TGTCATGAAT GCAAATCCTT TTACGAAAGG CCATCAGTAC TTAGTCGAAA AAGCAGCCCA 834 0 

AGAAAGTCCG CACGTTTATG TGTTTGTTTT ATCTGAAGAT AAATCGCTAT TTTCAAAAGA 84 00 

AGCACGTTTC GCAATGGTGC AAAAAGGGGT CGCCCATTTA CCCAATGTGA CAGTCCTATC 84 60 

TACAGAAGAT TACTTAGTTT CGTCTGCGAC TTTTCCTACC TATTTTTTGA AAGAAAAAGC 8520 

GCCATTAGAG GTAGCTGCAA TTCAAGCAAC GCTTGATGCA ACGTTATTTA AAGAAAGAAT 8 580 

TGCGCCAATA CTTGAAATTC AGCAACGTTA TGTGGGAGAA GAGCCTTATT CAGAAGTGAC 8 640 

CGCTGTCTAC AATCAAGCAA TGCAACAAGT TTTCGGACAA ACGATAACGC TAACCATTGT 8700 

CCCGCGATTA GCGAGTGATG GAGAGCTGAT TAGTGCGACG AAAGTTCGCA AAGCGATGGC 87 60 

TGAAGGCGAC AAAGAGACAC TCAAGAAGTT TTTACCAGCA ACAAGTTATC AATACTTAGT 8820 

AGAACACAAG AAACTAATGA GGTGAAGAAA GTGAAGATTA TCAAAAATGC TACTGCAGGG 8880 

ACAGTAGAAT CAAGTGATAT TATGATTACC GTCCAACCCA TTGAATCAAC AGAAAATACC 894 0 
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ATTGAATTAG AAAGTAGTGT CGAAAAACAA TTTGGCAATC AAATTCGTCA GGTGATTACG 
ACAACTTTGG AACATTTAGA TGTCCAAGGC GTTGCGGTCA AGGCGATTGA TAAAGGGGCA 
TTAGATTGTA CAATCAAAGC TCGGATGATT ACAGTTTTAC ATCGAGGCGC TGAAAAAGAA 
ACGTATGATT GGAAGGAGTT GGACTCATGG AACGTTTAAG AAGAACAATG ATGTTTGTCC 
CTGGAGCGAA CGCGGCAATG CTAAGAGATG CACCTTTGTA TGGGGCAGAT TCAATCATGT 
TTGATTTAGA AGATGCAGTT TCATTAAAAG AAAAAGACTC TGCCCGTGTT TTAGTCCATT 
CTGCTTTGAA AACTTTCGAT TATGGCAATA TTGAAATTGT TGTTCGGATC AATGCTTTAG 
ATGCTGGTGG TGCAGAAGAT ATTGAAGCAA TGGTCTTAGC TGGTGTCGAT GtGATTCGTT 
TACCAAAmAC AGAAACAGCC CAAGATATCA ■ TTGATGTCGA GGCAGTCATT ACAGAAGTAG 
AACAGCAAAA TGACATTCCC GTAGGAACAA CGAAAATGAT GGCTGCCATC GAATCGGCCG 
AAGgTGTTCT AAATGCGCCC GCGATTGCCA AAAGTTCAAC TCGCTTGATT GGCATCGCGT 
TAGGCGCCGA AGACTATGTT ACCAATATGA AAACACGGCG CCATCCAGAT GGCCAAGAAC 
TATTCTTTGC TCGTAGCATG ATTTTACATG CCGCTCGAGC AGCTGGGATT GCGGCTATTG 
ATACGGTCTA TTCAGATGTA GACAATACGG AAGGCTTTGA AGCCGAAGTT CGTTTGATTA 
AACAGCTAGG TTTCGATGGA AAATCGGTGA TTAACCCTCG TCAAATCCCA TTAGTTAATA 
CTATTTATGC GCCAACAGAA AAAGAAATCC AGAATGCTAA GGAAGTAATC TGGGGCATTC 
GGGAAGCCGA AGCAAAAGGT TCCGGTGTTA TCTCCGTAAA TGGAAAAATG GTTGATAAAC 
CAATTGTTGA ACGAGCTGAA CGGGTCATTG CGTTAgcATT GGCGGCTAAA TTAATTACCG 
AGGAGGAAAT ATAGACATGG TAGTAAATAA AGTAGGCAAA GACATTCCTC AATCTTATGC 
AGAGCAATAC GGCATTTATG AAGGAGAATT AGCACACATT GGTGCTTATC AAGAAGCGAG 
CCGAGCAATT AAACCAGTGA AACCTCGGGA GACAAAACTT TTAGGAAGTA TTCGTGAAGC 
AATTGAAAAA ACAGGGTTGA AAGATGGCAT GACCATCTCT TTCCATCACC ATTTCCGTGA 
AG GAG AC TAT GTGATGAATT TAGTTTTAGC CGAGATTGCC GCCATGGGGC TGAAAAACAT 
TTCGATTGCA CCTAGTTCAA TTGCAAATGT TCATGAACCT CTGATTGAAC ACATTAAAAA 
TGGAGTAGTC" ACCAACATTA CTTCTTCTGG TTTGAGAGAT AAAGTAGGAG CCGCTATTTC 
AGCAGGTATC ATGGAAAATC CTGTGGTGAT TCGTTCACAT GGTGGCCgGG CACGGGCAGT 
AGCAGCAGGC GATATCCATA TTGATGTAGC TTTCTTAGGT GCACCAAGTT CAGACGCTTA 
TGGAAATGCG AATGGAACCA AAGGAAAAGC TACCTGTGGT TCTCTTGGAT ATGCGATGGT 
CGATGCAAAA TATGCGGATC AAGTGGTCAT CATTACTGAC ACATTAGTAC CATATCCTAA 
TACCCCTATC AGTATTCCAC AAACAGATGT TGATTATGTG GTAGAAATTG ATGCAATTGG 
TGATCCAGAT GGAATTGCCA AAGGCGCGAC ACGTTTTACA AAGAATCCTA AAGAATTATT 
GATTGCTGAA TATGCCGCAA AAATAATTAC ACATTCTCCT TACTACAAAG AAGGCTTCTC 
ATTCCAAACA GGAACAGGTG GCTCTTCACT AGCAGTTACG CGGTTCATGA GAGAACAAAT 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10600 
10740 
10800 
10860 
10920 
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GCTTAAAGAT GGCATTAAAG 
ACTTTTGGAA . GAAGGCTTAG 
AGCTATTTCT TTAGGGGAAA 
ACCGCTTAGC AAAGGGGCGG 
AGTGGACACA GATTTTAATG 
TTCTGGCGGA CATTCCGATA 
TGTTCGTGGA CGGATTCCAA 
GAGTGTCGAT GTCGTAGTAA 
AATTGATTGT TTTAAAACCT 
AGCCTACAAC ATTGTTGGCA 
AATTGAATAC CGAGATGGTA 
TAACACTTTT AGAGATGTTG 
TAGAAACAGC ACCAGAAGCT 
AAAATTCGCC CACCTTAACG 
TGCTGGATCA AGTACCGATT 
ATTTGGCGGT TTCGTTAGCC 
CACATCCTTA TGGTCGTTTA 
GCCTTCATCG AGGAGATTTA 
CGAAAGTCTG TGGACGAAAT 
AAATAATTTT GACACGAAAG 
GTGCTACGGG ATGGACAACA 
CCGATTATTA AAACAATGGA 
ACGTTTGATT CCTGTGTCCG 
CGCAAAGAAG TTAAAAATAC 
TATCGTCATT ACGCTGATGA 
ATTGATATTA TTCGTATTTT 
CAAACAACAA AAGAAGCTGG 
ATTCATACCA TCGATTACTT 
TCCATTTGTA TTAAGGATAT 
AGCAAAATGA AAGATGCCAT 
ATTTCTGAGA TGACCTATTT 
ATCTCTTCTT TCGCTGGCGG 
GAAGATTTAG GCTTTGAAAC 
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CAAGTTTCGC TT.TAGGCGGA ATTACGAATG CAATGGTGGA 10980 

TAGAAAAAAT TATJGACGTC CAAGACTTTG ATCATCCATC 1104 0 

ATGCCAATCA TTATGAAATT GATGCCAATA TGTATGCCTC 11100 

TTATTAATCA ATTAGACACA GCTATTTTAT CTGCTTTAGA 11160 

TAAATGTCAT TACTGGTTCA GATGGTGTCA TCCGTGGTGC 11220 

CAAGTATGGC TTGTAAAATG AGTCTAGTGA TTGCACCACT 11280 

CAATCGTTGA GCAAGTAAAT ACAGTCGTTA CACCAGGTAC 11340 

CAGAAGTCGG CATTGCAATC AATCCAAAAA GAACAGACTT 114 00 

TAGATGTTCC TCAGTTTACG TTAGAAGAGT TGAAAGACAA 114 60 

CCCCGGAACC AATTAAATAT GGAGACAAAG TCGTTGCGTT 11520 

GTCTAATTGA TGTGGTACGC AATGTTTAAT GGGGAAAAAG 11580 

GATGCTCGCG AACAAAGAGC TGCCACACAG AAGGAGTTGT 11640 

AGTCTCTTAT CTGCCACAAT GAATATTCCT GGAGAAGTAA 11700 

GCTGTTTTTT TAGAAGTAAT TGACGAAGTC GAGCAACAAT 11760 

GTTAATTTTT ACCGGAATGA AAAAACAGGT CCTGAATATT 11820 

CCACAAGAAT TAAAACAAAG AATGGTCAAA ATTGAAGAAA 11880 

GTGGACCTAG ATGTTTTATG GGGAAATGAG GAGCTAAAAA 11940 

GGCTTGCCAC CACGGCGCTG TTTTATTTGC CAAGAAGTGG 12000 

CGACGTCACA GTCTTGAAGC AATGCAAGAA AAAATAACAG 12060 

GAGCAGCCAA GTGAGTAAAA AAATTCGTTT TACAGAAACG 12120 

AAGTTTAATT GCAACACGGA TGCCAACCTC AGATATGCTG 12180 

CGAGGCAGGA TTTCATGCAT TAGAAATGTG GGGCGGGGCA 1224 0 

CTATTTAAAC GAGGATCCTT GGGAACGTCT TCGCCAAATT 12300 

CAAACTTCAA ATGTTATTAC GAGGACAAAA TTTACTAGGA 12360 

TGTGGTGCGG GCCTTTGTCG AAAAATCAGT TGAAAATGGG 12420 

CGATGCCTTG AATGATGTCC GTAATTTACA AACAGCAATT 12480 

CGGACACTGT CAAGCAGCGA TTTCTTATAC TACGAGTGAA 1254 0 

TGTAAAATTA GCCAAAGAAT TAAGTCAAAC AGGAGCGGAT 12600 

GGCAGGCGTT TTGACGCCAC AAACTGGTTT TGAATTAGTG 12660 

TGATTTACCT TTAGAGGTGC ATACGCATGC GACTAGTGGT 12720 

AAAAGTTGCT GAAGCAGGTG CGGATATCAT CGATACAGCT 12780 

TACGAGTCAA CCAGCCACAG AATCTGTAQC CATTGCGTTA 1284 0 

AGGTTTAAAT ATGGAAAAAG TAACAGAGAT TGCTGACTAT 12900 
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TTCAATCCGA TTCGTGATCG 
ACGGAACCGA AAACCTTGAT 
CAATTAACGG AGCAAGGTTT 
GTAAGAGCAG ATTTAGGGTA 
CAAGCCTTAA TGAATGTGAT 
GATTATGTGC GCGGAGAATA 
AAAATCATTG ' GGGATGAAGA 
TTGCATAGTT TAGAAAAGGA 
TATGCGATGT TCCCGCAACA 
GATGTACCGT TACAGAAAGT 
ctCCGCAAAT GTATAAAGAA 
CTTCATCAAA TrGAAAAAAG 
CTAGCAATTC ATCAAAAAAA 
AACCATCATG ATTTAGCCAA 
GCCCAAAATC ATGAATTAGC 
ACAGATGGAT CAGCTGTTCT 
GTTGAAGGCA AAGCTTTACT 
CTGGAACAAA AATCAGTGGA 
GCAGGAATCC ATTTAGAAGA 
CAACAACGTT TGAACATTCC 
TTAGCTGGGC TAATCAACGC 
GTGATTAACG GAGTTGGTGC 
ATTACGCATC TCACCTTAGT 
AATCCTTACC AACGTGCTTT 
TTAGCCACCG CTGTCGTCAA 
ACGCCAGCCC TGATTAAATC 
AAACCAGAAA TTGAACCAGC 
TCTTCCAAAT ATCCGAATCA 
TTATTGGCAG CGAAAGGCAG 
TTGGCCGCTA TGATTTCAGA 
GGCGTAGTTG -ACACGGTTTT 
GCAGAAGAAG GGACGTAAAA 
GCACAATTGG CACTACTCTA 



1599 

CTTTAGAAGT GAAGGCATTT TAAACCCTAA AGTTAAAGAT 12960 

TTATCAAGTG CCTGGTGGTA TGTTATCAAA .CITAT.TAAGI 13020 

ACAAGACAAA TATGAAGAAG TTTTAGCTGA AGTACCTAAA 13080 

TCCACCACTA GTTACGCCGC TTTCACAAAT GGTTGGAACA 13140 

TTCTGGGGAA AGATATTCCT TAATACCGAA ' TGAAATC AAA 13200 

TGGCAAATCA CCAGCACCTA TCGCAGAAGA GATTAAGCAA 13260 

GGTAATCACT TGTCGACCAG CGGATTTATT GAAACCAGAG 13320 

AATCCAACAA TATGCAAAAT CTGAAGAGGA TGTCCTATTA 13380 

AGGAAAAGAC TTTTTAGGAC GTCGTGAAGA TCCTTTTTAC 134 4 0 

GACAGTCAAA CTTGACTTGC CAGAATAGAA CGGGCAAGTG 13500 

GCAAGGGTTT GAGCGACGAT AACGGTTGTT CAAACCCTTG 13560 

GCAGCCCGTG AAAGGAAGTG ACAATGTGTT AGAAGAAGTT 13620 

TATTGGTGTG CTAAGTCTTG AAGCAACAGA AGCGGTGTTA 13680 

AATGTATACA CCTGGTGTCG CAGAATTGAG TTTAATGATT 13740 

GCGCGAGTGG ACCATTAGTG GTAAGTTAAT TGCGGTTATT 13800 

GGGTCTCGGC AATATGGGCA CGCAAGCGGG CTTGCCAATT 138 60 

TTATAAAAAT TTGGCGGGGG TCGATGCTAT TCCTTTAGCA 13920 

CGAGATGGTT CAAACAATTG AAAATTTACA AAATAGTTTT 13980 

TATTTCTGCA CCAAAATGTT TTGAAATTGA AGAAAAACTT 14 04 0 

TGTGTATCAT GATGACCAAG AAGGCACAGC CATTGTCGTT 14100 

AGCCAAAATT AAAGGAAAAC CTTTAAATGA ATTGCGTGTG 14160 

TTCTGGTGTA GCGACCGCTA AG T TAT G TAT TCAAGCGGGA 14 220 

AGATCGCCAA GGTGTTTTAA GAGAAGAAGA CCCGACATTG 14280 

GCTTCGACAG GTAATAAAAC CATCAGTAGA AAACAAAGAT 14 34 0 

TCAGGATGTT TTTTTAGGCT TATCAGAAGC GGATGTGctA 14"400 

AATGAACCAA GACCCAATTA TCTTTGCTTT AGCAAATCCT 14 4 60 

CCTCGCCCAA GCCAATGGCG TCCGTTTATT AGCAACTGGT 14 520 

AGTGAACAAC ATTTTAGTTT TTCCAGGTCT TTTTAAAGGG 14 580 

AAAAGTCGAT GTCGGTTTGG AAATGACAGT AGCACGAAGC 14 640 

ACCCACAGTA GAAAAATTCA TACCTAATGT ATTTGATGGC 14700 

TAATGCTGTA CT AG AC TATA TAAAACAAGA AAAGACCACG 14 7 60 

GGCAATGACA AAGCAAGCAA GGAGTAAATT AGCGCAgcAA 14820 

TGAAGTGACC TGTTTAAATA AACCAGGCTT GGTGGATCCA 14 880 
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GTCGATGCAG GCGCCCATCA AGATATGGAT GTTTTCACTT TTTTAGAAAG TAGTGTGGTT 14 940 

TTGACTCCTT ATTTTGAAAC CTTCGTTCAA GCTGGGACAG AGCTACGTCA TTTGCCGATT 15000 

GAGGCAACAT TTCGCACGAT TCGACAAGTT GGTTTAGAAG CAGAACGAGC GATGTTTGAG 15060 

GCAACAGAAG GTATCAATAC ACATAAAGGA GCAATTTTTT CTCTAGGTAT TTTTTTAGCT 15120 

GTCTGTGGTC GTCTAATGAT TTGGGAGACC CCTTGCACCT TACAATTGTT TCAACAAACG 15180 

CTTCAAACAA TGACGCAAGA CTTATTGGCT GACTTTGAAA ACCTCGCTAC GAATGAGCTA 1524 0 

GAAAAACTAA CTTGGGGTGA ACGACTATTT GTTCAGCATG GCTTGACTGG AATCCGTGGT 15300 

GAGGCGCAAC AAGGTTATCC CGCTGTCTTT GAACAGGGAG TGCCTTATTA TCAAAAACAT 15360 

CAAGGCACAC AACAACAAAA AC T CAT T GAT ACGCTCCTTT ATTTATCGCT CTATGTTGAG 154 20 

GATACGAATT TGATTAAACG AAGTCAAAAT GTCCAAATTT TTGAAGTATA TCAACCGTTA 154 80 

ATCCAGGAAT ATTTTGCTTT AGGGGGGACA CAAACGTTAC GGGGAACCCT TTGCTTAGAA 1554 0 

AGATTGAATC ACTTATTCAA AGAAAAAAAT TGGAGTATTG GTGGCAGTGC AGATGCTTTA 15600 

ATTTTGGTGG TGTTTTTAGA TAAACTAATG AAATTAAATT GGCTGACGCC TTAATTAAAA 15660 

AG AT AG TT AC AACGCTGATT TGGTCAGTTA TTTTGACCTT ATATGCCTAT GGAAAGGTCT 15720 

CAAGAGAATT CAAGAAAGCG ATTGACAAAT ACCACGAAAC ACAGTATATT TTATTACGAA 15780 

ATCATTCTTA TC AG AG TATA CAAATACGAT GATAAACAGT TGAAGCGAAT CTTGATTTGC 15840 

TTTTCCTAAA CCTGATTCTT TTAGAATCAG GTTTTTGTTT ATCTAAAAGG CGGCATTCGT 15900 

TCAAAAACGG CTGTCGCCTT TTTTTATGAA ATTTTTTTAA TATAAAAAAT TTCATAAAAA 15960 

AATCTTAAAA ACAGAAAAAA TTGTTATAGA TTAGAGTCAT TTTTTTGATG TAAAGTTTTT 16020 

ATAACTTATG TCTATAAAAT TGATAAAAAC AGTCAAAGTA GTAGAAAAAA TAATAGTGGA 16080 

ATTTCTCAAA AAAATAAAAA AATTTTCCGT TATTTAATTA TCGATTTATC AAACGATAAG 16140 

TTATAATAAA ATAATTTATT TATCTTTAAT GATCGGGGAG GAACGTATAA ACTTCCTCaT 16200 

TTTCCTCTCA TTGTATAATA ACTCACTAAT GAATTAGTGA GTTTTTTAAC ATATCCAGGA 162 60 

TGCTTGTTAT CAAATGAATT TTTATATAAA AATTCACAGC ATTATATTTT GAAAAGCTGA 16320 

AGAAATGTAC TAAACTGCAG GTGTTAAATC CATTTAAATC GCTAAAGGGG GAGATGAACA 16380 

ACTAGGGGAA GCTAATGATC TTGGTATTTA TCGTTTATTT TAAAGAAAAG AGGGACGATC 16440 

AGATGAAAAA GAAAATTGTT GAGGATTTTA ATCGGAAAAG TCAGCATAAA AAATGGACAA 16500 

AACGCAAGAT GCTTAATTTA GCAATATCAA GTGGTTTATT ATTTACGTCA TTAGCAATCC 16560 

CTGTAAGTAT AGCTGTTACC TCTGGCACAA TCAGTGCATC AGCAGCGGTC TTGGATATCG 16620 

AACTATTATC AAATGTTACG TCAAATAATG ACAGTGGCAC TTCAACGAGT AATCGTTGGA 16680 

CAGCCGCAAA CCAAAATCAA CCAGTTAATT TCACGGTTTC TGGTGGCGCT TTAGCAGATG 16740 

CTTCCGCTGT GTTTAGTGGA CAAAAACAAG CGGTGTTAGT GGTTCCTCCT GAGTTAAGAG 16800 

GAAATGTAGC TGCAGCAGGC AGCGCAgcAA TCAATACCAA TGTCACGATT GATCTTTCAA 168 60 
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AAGTTACTTT TTTGACTGCC GTTTTGAATG C^GCCAATGA TTTAACCAAT GTGATTACTC 16920 

AAATTACCAG TGGGGCGTTA GGGAATTTAA CTGGTGTTGA TATTGATTTG ACGGAAGTGA 16980 

ATCGTCAATT GGAATTAGTT AATAACATTG AAAACTTAGG TGCTGCTTCA TTTACAGCTC 1704 0 

CGGAAACGTT AGCAGCTGAC GGCTCATACA TTAGTGCACC GAT TAG T GAT GGTTTAGGGT 17100 

TAGTTTTAGC CCAAAATGTT TCAAACATCT TACAAGATTT GAATGCGGCA GTTCAAGCTT 17160 

TGGAGGCAAA AGGTACCAGT ATCCCAAGTA ATCTTGTCGC CGCAGCTATA AATGCAGCCT 17220 

TGCTTCCTGT CAAAGGCACG GTAAACGTGG CTGTTTCAGG TGCTTTGCCT TTATTAGCGG 17280 

TTGGTGGTTC AGGCGTAAAT GAGTTAGTGG ATGCTTCTTT ACTAGGCACA ACCACGGTTA 17 34 0 

CTTTACCAAC TACCGTTTCA ACACCTCAAA. ATTTATCCAA TAATTTAGAT GCTCGTTTTG 17400 

TAGGAACAGT CGTTCAAACA GATCTTTTAG ACGTTAATTT ATTAGCAACA GCAGACGGTG 174 60 

TATCCAACAT TTATTTTGCT GCAGGCACTA CTAGTGAAGT AACCGCACCA ACAATCACAG 17520 

GAGTAACAGG TAATTCAACA GCAGGTTACG AAGTTAAAGG AACTGCCGAT GCCAATGCCA 17580 

CGGTTGAAAT CCGAAATGCA GGAGGCACCG TAATAGGCAC AGGTACCGCT GATGGGACAG 17 64 0 

GAGCGTTTAC AGTTACCGTT CCCGCAGgTG AAGCAGGCGC CAATGAAACG TTAACCGCCG 17700 

TAGCGAAAAA CGCCAGCGGy ACAGAAAGyA CGCCAACAAC GTTCCAAACr CCAGCGGATG 17760 

AAGCAACCGT AACCGCACCA ACAATCACAG GAGTGACAGG TAATTCAACG GCAGGTTACG 17820 

AAGTTAAAGG AACTGCCGAT GCCAATGCCA CGGTTGAAAT CCGAAATGCA GGAGGCACCG 17880 

TAATAGGCAC AGGTACCGCT GATGGGACAG GAGCGTTTAC AGTTACCGTT CCCGCAgGTG 17940 

AAGCAGGTGC CAATGAAACG TTAACCGCCG TAGCGAAAAA CGCCAGCGGC ACAGAAAGTA 18000 

CGCCAACAAC GTTCCAAACA CCAGCGGATG AAGCAACCGT AACCGCACCA ACAATCACAG 18060 

GAGTGACAGG TAATTCAACA GCAGGTTACG AAGTTAAAGG AACTGCCGAT GCCAATGCCA 18120 

CGGTTGAGAT CCGAAATGCA GGAGGTGCCG TGATAGGTAC AGGTACTGCT GATGGGACAG 18180 

GGGCATTTAC AGTTACCATT CCCGCAGgTG AAGCAGGTGC GAATGAAACG TTAACCGCCG 1824 0 

TAGCGAAAAA CGCCAGCGGT ACAGAAAGTA CGCCAACAAC GTTCCAAACG CCAGCGGATC 18300 

CTAATACGCC CGTGGCGACG CCAATTGTTG AGACTGTAAC AGGTAGTACA ACAAAAGGCT 18360 

ATGAGGTCAA AGGGACTGCT GAAGTTGGCA CCACCATTGA GGTTCGCGAT GCAGCTGGCA 18420 

CGGTCCTTGG TACTGCAACA ACTGGAACTG ACGGAAAATA TACAGTGACT TTAGATTCAG 184 80 

GAACAGCAAC AGCAAATCAA ACGCTGAGCG TTGTAGCGAA AAACGCTAGT GGCACGGAAA 1854 0 

GTCAACCAGC AACGGCGACA ACACCAGCTG ATGTCACTGC ACCAACAGTT GATAACATCA 18 600 

CAGGCAACTC TGGTTCGGGT TATGAAATTA CAGGAACAGC AGACCCTAAC ACAACAATCG 18 660 

AAGTTCGTGA TCCATCTGGG GCAGTCATTG GTACAGGTAC CTCTGATGCG AATGGTGATT 18720 

TTACTGTAAC GCTACCAACG GGAACGACCA ATCCTGGGGA TACGTTAACA GTGATTGGAA 18780 

AGGATAACGC GGGAAATGAA AGTCAACCGA CTGAAGTCCT TGTTCCTGCT GATGCCACGG 1884 0 
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TTACAGCACC 


AACTGTAACA 


GbAb I AAbAb 1 


b 1 AA 1 I b Ab 1 


tp.ptp,p.ttat PAnp,TP,APAG 


18900 


GCACCGCTGA 


TCCGAATGCT 


ACCATLbAAA 


i 1 Lb 1 bAibb 


APATP.fZP.AAP P.TP.ATTP,PAA 


189 60 


CAGGGACTGC 


CGATGGGACT 


GbT 1 Lb I i I b 


b 1 b I bAAbb 1 


TrranrT^r; app^ppaaatp* 


19020 


CGAATGAAAC 


ATTGACAGCG 


TTAgCCAAAG 


ATCCTbbibb 


rTA TAT7APAAPT APAPPP.APA A 




CCTTCCAAAC 


ACCAGCAGAT 


GAAGTAGTGG 


CACCGLbAAb 


TPTrrRPTiJin /**TT7APTPPPA 
IblbbAbAAA b 1 i Ab 1 bbbA 




ATACAACACA 


AGGATATCAA 


GTGACAGGTA 


CCGCTGAAC I 


r VrT*H , Tif~ t CT±CC 7ATTf^TA TAPTTf"' 

IbbbAbbAbb Ai l«hHvjl 1L 




GTGCAACAGA 


CGGAACAGTT 


TTAGGCACCG 


CAACAACTGG 


AbbbAClbbb bAA-LAlAblb 




TGACGTTAGC 


TTCAGGAAAA 


GCAACAGCTA 


AACAAACAGT 


^ ta ta m m tv f~* rp m /■"< rri ta 7\ TV T\ 7\ T 1 f** 

GAATGTAGTI bb 1 AAAAA 1 b 




ATACTGGACT 


TGAGAGTCAA 


CCAACTACAG 


CTATGACACC 


n m ta ^ ^ ^>^t TV ta ta TV TV 

CGCTGATGTT AbbAbAbbAA 




CAATTGGTGA 


CATTACTGGA 


GATTCAACAA 


CTGGTTATGA 


AATCACTGGG ACbbbbbAbb 




CTAATACCAC 


CATTGAAGTA 


CGGAACCCAG 


ATGGAACAAT 


TATTGb I AbA AbbAbAAbbb 




ATGATCAAGG 


AAACTTTACT 


GTGGACCTTC 


CAGCGGGAGC 


bbblAAibbi bblbAlAbAI 


19560 


TAACAGTTGT 


TGGAAAAGAC 


GGTGACGGCA 


ATGAAAGTCA 


AbbAAbbbAA b 1 bAbbb 1 


1 9620 


CTGAAGATGC 


AACCGTAGCA 


GCACCAACTG 


TGACGACTGT 


TACAGGAACA AblbbbAblb 


i3DOU 


GGTATCAAGT 


AACCGGCACG 


GCAGAGCCAA 


ATGTCACCAT 


m t\ TA m m TV TV TA Tl ^ TV 7\ f~* 7\ f"* 

TGAGATTCAC AAibAAbbAb 


1 Q"7 4 fi 

JL y / fi U 


GTTTAGTTAT 


TGCTACGGGA 


ACGACTGATG 


GTGCTGGCGC 


ATTTACAATC ACTCTTCCGA 




CGGGCACAGC 


AACAGCTAAC 


GAAGCCTTAA 


CTGCCATTGC 


GAAAGATGCT GCTbbbAAAb 




AAAGTAATCC 


GACTGCTTTC 


AAAACACCTG 


CTGATCCAGA 


TGCACCAGTC GbbALAbb i A 




CTGTTGACAA 


AATCACTGGT 


AGCACGACAA 


ACGGCTATCA 


AGTAGTAbbA bbAbbAbAAb 


J. «? O *J 


TTGGTACAAC 


AGTTGAGGTG 


CGTGACGCCG 


ATGGCACAGT 


CCTTGGbAl b bbAAb i Ab 1 b 


90(14 0 


GAACTGATGG 


CAAATACACA 


GTGACTTTAG 


AGCCAGGGAA 


GGCCTCAbbI AAbbAAAbAA 


9D1 on 


TAACTGTCGT 


AGCGAAAAAT 


GCAACAGGAA 


AAGAAAGTCA 


GbbAbblAbA bbAAb lAbAb 


20160 


CAGTCGACTT 


AGCCACACCA 


ACCATTGATT 


CTATTACCGG 


AAAT I b 1 Ab J. AAAbb 1 1 Abb 


20220 


AAATCACTGG 


AACGGCGGAG 


CCAAAAACCA 


CTATTGATGT 


bbb 1 bAbbbA bAbbbAAbL-A 


?fl9fl0 


TCATTGCTGC 


TACAACTGCT 


AACGAAACCG 


GCCAATATAC 


bblbAbiblA Otnu^lvjuvylj 


20340 


TAGTGACACC 


AGGAGAAACG 


ATTACGATTA 


m ^i TA Z" 1 ^1 TV TA 7A 7\ 

TTAGCAAAbA 


r rr*r s rT*/~ > T\<~r* r P TA 7ATP7A TA TAP. TP 
1 bbbbbAbu 1 r\r\. 1 ort-rtrto 1 v_ 


204 00 


AACCAGCTAC 


AGCCGTTATT 


CCAGCGGATG 


TTb III 1 Abb 


r"PPPPP7A TAPT STTAmflAfin 


204 60 


TTGAAGGAAA 


CAAAGCCAAT 


GGCTATACAG 


TCAbTGGAAL 


rpr , r"vr* tv tpp ta ta taTPTPAPPP 
IbbibAibbA nnlolLnvjbvj 


20520 


TTCAATTTTA 


CAATAGCAGT 


GAACAATTAT 


TGGCAAGTGG 


CAATACAACT ACTGGAGGTA 


20580 


CCTTCTCCGT 


TCATATTGCA 


GCAGGGTTAG 


CAACAGAAAA 


. AGAAACGTTA ACCGCACTAA 


20640 


CCACAGATAC 


ACAAGGAAAT 


GTGAGTCCTA 


AAACCACATT 


TATGACGCCA GCCGATATTA 


20700 


CGGGAGAACC 


AG AG ATT AAA 


. ATTGCGGCAC 


CAACTGTTTC 


: TTCAGTTTTA GGAACGTCTA 


20760 


AAGCCGGCTA 


, CCTCATCAAA 


. GGAACAGCTG 


i AACCAAACCG 


! AATCATTCAA ATTAGTAACC 


20820 
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GACTATTAAG AAGTGTGATT GCTGTAGGTG 
AATTAACAGC GGGACAAGCG ACTGCTCAAC 
CAGGACATTA CAGTACGGCT ACAACCTTCA 
GAGGCAATGG TAACACTGGC GGAAATAACG 
GCGCAACTGG CGGGAATAAT GGGAATGGTT 
CTGGTTTAGG CACAACAGGT TCTGGCTTAG 
GTAGTGGCTA CCACCCTAAA CTAAGTACCA 
CAGGCTACTT ACCTAGCACA GGTGAAAAAG 
GCGCCTTTGT CGCACTCCTT GCGAGCATGG 
TCTCTCCTTT AATTCCTATA GATGAGAAAA 
ACGTTTTCTC GTTCCATTTA ATGAAGAAAA 
ATTTTCAGAA AATCGAAGGA ACAAGCAGTG 
TGCAGGAGGA GTGGGCGCTT TTTTTGTTGT 
GCTACAATGG AATTTGATAC AATTGCGGCG 
AGTATTGTTC GTTTAAGTGG CGAGCAAGCA 
GGCACTAAAG ATTTAGCTAA AGTCCCCACG 
CCGCAAAACG ATCAATTGAT TGATGAAGTC 
TTTACACGAG AAGACGTGGT AGAAATTAAC 
TTATTGCAAT TACTGTTACG TGAAGGCGCA 
CGTGCTTTTC TAAATGGCCG AATGGATCTT 
CGTGCCAAAA CGGATAAAGC GATGAATGTT 
ACGTTAATTC GTTCATTAAG ACAAGAAATT 
ATCGATTATC CAGAGTATGA TGATGTCGAA 
GCCGAATTTG TGAAAGCGCA GATTCAACAA 
TTGCGAGAAG GCCTAAGTAC AGCAATTATT 
TTAAACCATC TGTTACGAGA AGAAAAAGCT 
GATGTCATTG AAGAATACGT TAACGTTCGC 
GGGATTCGCG AAACAGAAGA CATTGTGGAA 
TTAGCAGATT CGGATTTAAT TTTGTTAGTC 
GATCGTCAAC TGTTGGAAGC AACGAAAGGC 
GACTTACCGA ■ CAAAACTGGA TCCTAACGAA 
TTATCCGTTT CGGTGTTGTC AAACACAGGC 
CTTTTCTTTG GCGGCCAAAC AGGCGAAAAA 
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CCACCGATGC TGAAGGCAAC TTCGCTATCC 20880 

AAAGTTTACT TGCGACAGCT ACCGATGGCG 2094 0 

TGACGCCAGC CGACCCAACG AATCCTGGAG 21000 

GCAATACAGG CGGCAATACA GGAAACAATG 21060 

CAAACACAGG TTCAAATCCA AATGGAGGTT 21120 

GTTCACTAGG CAATGGCCTC GGTACAAATG 21180 

TCAGTTATGG CACTGGAAAT CACGGGAAAA 21240 

AGTCTTCAGC CGTGACAACA AGTTTGTTTG 21300 

GAATCATCAA ACGCAAACGT AAAAACTAGC 21360 

CGTATTGTTG AGAATTCCCC ACTCAATGAT 21420 

ATTGCCAAAT AGAATAAAAT GTGCTATTCT 21480 

AAAGTGCTAA ATCCAACCTG AATTTTTTAA 21540 

TATGAAGAAA GGAAGCTAAT TATGCAATCA 21600 

ATTTCTACGC CGCCAGGTGA AGGTGCGATT 21660 

GTTGCTATTG CAAACAAAGT TTATCGGTCA 21720 

CACACGATTC ATTATGGTCA TATTGTTGAT 21780 

ATGCTTTCTG TGATGCGGGC ACCGAAAACA 21840 

TGTCATGGCG GCATTGTCGT GGTGAACCAA 21900 

CGAATGGCCG AACCTGGGGA ATTTACAAAA 21960 

TCACAAGCAG AAGCAGTGAT GGACTTAATC 22020 

GCATTGAACC AATTGGATGG TAACTTGTCG 22080 

TTAAATACTT TAGCACAAGT AGAAGTTAAT 22140 

GAATTAACCA CGAAATTACT TTTAGAAAAA 22200 

TTATTAACGA CCGCTAAACA AGGAAAAATT 22260 

GGCCGACCAA ATGTCGGGAA ATCCAGTTTG 22320 

ATCGTTACAG ATATTGCTGG AACGACACGT 22380 

GGCGTTCCTT TAAAATTAAT TGATACAGCG 224 4 0 

CGAATTGGTG TCGAACGTAG CCGCAAAGCG 22500 

TTAAATCAAA GCGAAGAATT GACAGAGGAA 22560 

TTAAAACGCG TTATTCTTTT AAATAAAATG 22620 

TTACAAGAAT TAGTCCCTGC AGAAGAAATT 22680 

TTAGACCAGT TAGAAGCGAA AATTGCTGAT 227 4 0 

GATGCTACCT ATATTTCTAA TACACGTCAT 22800 
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ATTGCTTTGC 


TGGATCAAGC 


AGCGCTTTCG 


TTACAAGAAG 


TGATTAACGG 


CATCGAAGCA 


22860 


GGTATGCCAG. TCGATTTAGT 


TCAAATTGAT 


ATGACCCGTT 


GTTGGGATTA 


TTTAGGGGAA 


22920 


ATCGTCGGCG 


ATAGTGTCCA 


AGATGAATTA 


AT T AC AC AG T 


TATTTAGTCA 


ATTTTGTTTA 


22980 


GGAAAATAAG 


ACAAGAGAGG 


AATTTTTTAT 


GAATCAATAT 


CAAGCAGAAT 


CATATGATGT 


23040 


TATTGTCGTC 


GGTGCTGGAC 


ATGCTGGCTC 


TGAAGCAGCC 


TTAGCAGCCG 


CACGAATGGG 


23100 


CGTTAAAACG 


TTACTTTTAA 


CGATTAATTT 


AGATATGGTT 


GCTTTTATGC 


CATGTAACCC 


23160 


TTCTGTTGGT 


GGACCTGCCA 


AAGGAGTCGT 


TGTCCGAGAA 


ATCGATGCAT 


TAGGCGGCGA 


23220 


AATGGGTAAA 


AACATTGATA 


AAACCTACAT 


CCAAATGCGG 


ATGCTAAATA 


CTGGTAAAGG 


23280 


TCCAGCAGTG 


CGTGCGTTGC 


GCGCGCAAGC 


TGACAAACAT 


GCCTATGCGA 


CAGAAATGAA 


23340 


ACATACCATT 


GAAAAAGAAG 


AAAATTTAAC 


CCTTCGTCAA 


GGGATTGTTG 


AAGAATTAAT 


23400 


CGTGGAAGAT 


GGCGTTTGTC 


GTGGTGTTGT 


AACTTCAACA 


GGTGCCGCAT 


ACCGTAGTCA 


23460 


AGCAGTGGTG 


ATTACTGCCG 


GTACTGCCTT 


GCGTGGCGAA 


AT TAT TAT TG 


GTGAATTAAA 


23520 


ATATTCATCA 


GGACCAAACA 


ATTCGCAACC 


TTCCGTTGGT 


TTAGCCAATC 


ATTTAAAAGA 


23580 


ATTAGGTTTA 


GAAATCGATC 


GCTTTAAAAC 


AGGAACGCCG 


CCACGAGTAA 


AATCAAGTAC 


23640 


GAT T GAT TAT 


TCTGTGACAG 


AGGAACAACC 


AGGCGATAAA 


GAACCCAACC 


ACTTTAGTTA 


23700 


CAGCACGCCA 


GATAGTGCCT 


ACAATCAAAA 


CCAAGAGCCT 


TGTTGGTTAA 


CTTATACGAA 


23760 


TGAAACAACG 


CATGAAATTA 


TTCAAAAAAA 


TCTACACCGC 


GCCCCAATGT 


TTACTGGGAT 


23820 


TGTGGAAGGT 


GTGGGCGCTC 


GGTATTGTCC 


ATCCATCGAG 


GACAAAATTG 


TTCGTTTTGC 


23880 


AGACAAACCA 


AGACACCAAT 


TGTTCTTGGA 


ACCAGAAGGG 


TTGAATACCG 


AAGAAGTTTA 


23940 


CGTCCAAGGC 


TTATCTACTT 


CTTTACCAGA 


AGATGTCCAA 


ACAGAAATGC 


TTCATTCTAT 


24000 


TGAAGGATTA 


GAAAAAGTAG 


AAATGATGCG 


TACAGGTTAC 


GCCATTGAAT 


ACGATGTGGT 


24060 


AGTCCCTCAC 


CAATTACGCC 


CAACATTAGA 


AACAAAGGTC 


ATTGAAAATC 


TTTATACAGC 


24120 


AGGTCAAACG 


AACGGAACTA 


GTGGCTACGA 


AGAAGCTGCG 


GGTCAAGGCT 


TGATGGCAGG 


24180 


AATTAATGCT 


GCATTGAAAA 


TTCAAGGCAA 


AGAGCCTCTT 


GTTTTAAAAC 


GTAGTGATGG 


24240 


CTATATTGGC 


GTCATGATTG 


ACGATTTAGT 


TACAAAAGGC 


ACTAACGAAC 


CTTATCGCTT 


24300 


ATTAACGTCA 


CGTGCGGAGT 


ATCGTTTGAT 


TTTACGACAT 


GACAATGCGG 


ACTTACGTTT 


24360 


AACAGAAATG 


GGTCATGAAA 


TTGGCTTAGT 


CAAAGAAGAA 


CAATATGCCG 


CCTATCTAGT 


24420 


CAAAAAAGCG 


GCTGTTGAAG 


CGGAAATTGC 


CCGCTTAGGC 


AAACATCGGA 


TTAAACCAAC 


24480 


CAAAGAAGTG 


CAAGCATTTT 


TGGAAACTAA 


AGGCGCTGCT 


GGCTTAAAAG 


ATGGTATTTT 


24540 


AGCTAGAGAT 


TTTCTGAAAC 


GCCCAGAAAT 


CAGTTATCAA 


GAAGTCGCTC 


AATTTATTCC 


24600 


CGCACCTGAA GAGGCGCTCG 


ATCCGAAAGT 


AATTGAACAA 


GTAGAAATTC 


AAATCAAGTA 


24660 


CGAAGGCTAC 


ATTAAAAAAG 


CCATGGAAAA 


AGTTGAGAAA 


CTAAAACGCA 


TGGAAGCAAA 


24720 


ACGCATTCCA 


GAAAACATTG 


ATTACCAAGC 


GATTAACGGC 


TTGGCGACAG 


AAGCAAAACA 


24780 
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AAAACTACAA 


AAAATCCAAC 


CAGAAACGAT 


TGCCCAAGCA 


AGCCGTATCA 


GCGGTGTGAA 


24840 


CCCCGCTGAC 


ATTAGC-AT-TT 


TGATGGTCTA 


CATCGAACAA 


GGAAAAATCG 


CGAAAGTGCA 


24900 


AGGATAATAA 


AAATTTGAAG 


CAGTTGCTTC 


TTCACGGATG 


ATTGATTGTG 


AAGAAGCAAC 


24960 


TGCTTTTTTG 


TTTTGAATTA 


GTCTACTCTT 


TTTTCTAAAC 


TACAAAAAAA 


GACTGTCTTA 


25020 


AATAATTTCA 


CAAACTGACC 


T AT ACT G AAA 


GCGCAAACAA 


AT GAT AG GAG 


TGGAACAAAT 


25080 


GACAGATTGG 


TTAGGAATTA 


AAGGAAAAGT 


AGTGATTGTT 


ACAGGAGGCT 


CATCTGGTAT 


25140 


TGGTCACAGC 


ATTGTTGAAA 


GCTTGTTAGC 


ACAAGAGGTC 


TATGTCGCTA 


ATTTTGATAT 


25200 


TTCAGCATCA 


GAAACGCAAC 


ATGAGCATTT 


ACTCTTTGTG 


AAAGTCGATG 


TAACTTCTCG 


25260 


AGCAGAAGTA 


GAAGCAGGTG 


TCGCTCAAGT 


GGTTGAAAAA 


TTTGGAACCG 


TAGATGGCCT 


25320 


AGTAAACAAT 


GCCGGCATCA 


ATATTCCACG 


TCTATTGGTG 


GATAAAAATC 


ATCCTCATGG 


25380 


GCCTTATGAA 


TTATCGGATG 


AGGTATTCGA 


CAAAATTGTA 


GCCATCAATC 


AAAAAGGGCT 


25440 


CTACCTAATG 


AGCCAAGCAG 


TGGGCCGGAT 


TCTTGTACAA 


AAACAAGCAG 


GTGTCATTAT 


25500 


TAATATGTCT 


TCAGAAGCTG 


GATTAGAAGG 


CTCCGAAGGG 


CAAAGTGcTT 


ACGCTGCAAC 


25560 


GAAAGCAGCG 


GTCAGCAGTT 


ATACACGTTC 


ATGGGCTAAA 


GAGTTAGGAA 


AATCAAACGT 


25620 


CCGTGTCGTG 


GGGATTGCGC 


CAGGTATTAT 


GGAAGAAACG 


GGctTGCGCA 


CTTTAGCATA 


25680 


CGAAGAAGCA 


TTGGCTTATA 


CACGAAATAT 


TTCTGTTGAG 


CAATTGCGAG 


CAGGTTATTC 


25740 


AAAAACAAGT 


ACGATTCCAT 


TAGGTCGTAG 


CGGAAAACTC 


CATGAAGTCG 


CTGACTTAGT 


25800 


ATGCTATTAT 


TTATCAGACC 


GTTCAAGCTA 


TATTACAGGC 


ATCACCACAA 


ACGTAGCTGG 


25860 


AGGTAAAACA 


AGAGGATAGA 


GGTGGGAAAA 


CTAAATGACG 


CTAGTAAATA 


GATGGTACCA 


25920 


AGTAATAAAA 


TTACTAGTTG 


ATCACAAAGG 


AATGAGCTTA 


CAGGAATTGC 


AAGAGAAATT 


25980 


AGCCGCAAGT 


CCTCAAACAG 


TTCGTAAAAA 


CATTGATACA 


TTGAATGATG 


AATTAATCGG 


26040 


TATCGCCCAA 


ATTATTCAGA 


AGGAAAATCT 


CTTTCAGTTA 


GAAATCAATA 


ACTTTGAAGG 


26100 


TTTTGAAGAG 


GTTTTGTCTG 


GACGGCTAAA 


AAGGGAATCT 


GATTTTAATT 


CTTCAAGCAA 


26160 


GCGTGTTTCC 


TACATTATCA 


AACGATTAAT 


AGAAGAAGAT 


CAATTTATTT 


CAACTTATGA 


26220 


TTTATCCGAA 


GAACTCGCCG 


TTAGTCGCGG 


CACAGTTAAT 


AAAGACATTA 


AGCGCATGAA 


26280 


AGAGCTGATT 


GCTCCTTGGC 


AGGTTGCGGT 


TGTAGGCACG 


CCAAACCGTG 


GCATTCATTT 


26340 


AGAAGGAAAA 


GAGTTTGATT 


TACGCTTACT 


TCATGTGAAT 


TACGTCCAAG 


AATATTTTGA 


26400 


AGAACAGTTT 


CTTCATGAAA 


CTACCAAACA 


AATGATACAA 


AAAATAATCA 


AAGAAACAAG 


26460 


ACTAGCGAAA 


CAAGATAGTT 


TTCTTTTACG 


AAGAGTGGTT 


TCAATTGTTT 


TGCAGCGTGT 


26520 


TTTAGCTGGA 


AAACTCATTA 


CGGAACTACC 


ACCAGAATAT 


GTTGATTATG 


TTCGTCATAA 


26580 


CGAGCAGATT 


GAAGAACTGA 


TGTATCACTT 


GGAAATAACC 


TATAATGTAA 


CGTTAAGCCA 


26640 


ATGGGAACGT 


TCCTTTATCA 


GCTTTCCTTT 


TAACATCAAC 


ACAAACCATA 


TTCGTAATTC 


26700 


TTTATTGGCT 


GATGAAGGAC 


TCTTAGCTGA 


CTATTTTCAA 


AAAATGATGA 


AAAAAATTCA 


26760 
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TCATTCAGTT 


GTTGTTGAAT 


TTGATGAGGA 


CTTCTTGTTT 


TCAGAAATGA 


AAGATCATTT 


26820 


AAGAAACGTG 


ATGAATCGAC 


TCGTTTTTCA 


TGTGGAGTGT 


CACGATTTAT 


TTTACGGAGA 


26880 


AATTGAACGA 


CAATATCCTT 


TGGCCTATGA 


ATTAGCCAAG 


ATTGGCTTGC 


AAGAATTGGG 


26940 


CCGTTTGTTA 


AATCGTTGTG 


TGCCGACGGT 


TGAATTTGGG 


TATCTCGCTT 


TGTATTTTGA 


27000 


ACTAGCATTG 


AGAGGGAACT 


ATGAAGCGGG 


AGCCAAAAAA 


GAAATCGCTG 


TAATCTGCAG 


27060 


TACAGGTCAT 


GGAACGGCCT 


TGATCATTCA 


GCGGCAATTG 


GAAAAAGTTT 


TGGGTCCAGA 


27120 


CGTCCAGATT 


GCCCATTTTT 


CTGAAGAAGA 


GGCAGAGAGA 


AGAGAATTGA 


ATCAATATTT 


27180 


TGCTATTTTT 


ACAACCATTC 


CCTTAAAAAA 


TATTCGACCA 


CAAACACCGA 


TGATCCATTT 


27240 


AACTAATTTA 


TTTAACGATA 


CTTGGCTCAG 


AAATGAGTGG 


CAGAGAGCAA 


AAGAAATTCG 


27300 


TTCCGTGGAA 


TCTCAAAATA 


TTCATATGAC 


CTATCAATTA 


TTAGAGAATA 


CTCAAGGTTA 


27360 


CCGAGCGAAT 


ATCCAGAAAA 


TGGTTGCTGA 


TTTAGAAGCA 


GAAAAATTAG 


TAGATCCCCA 


27420 


ATTTATCGAA 


CGAATTTTTA 


CTAGAGAAGA 


TCAACAAACA 


ACTATTTTTG 


AATCGGGGAT 


27480 


TGCCTTTCCC 


CATACAATTA 


ATCAAGCCTT 


ACCAACGATT 


ATTTTATCGA 


TTGGGGTTTT 


27540 


TGAAGAATCA 


CTGGTTACAC 


CAGAAGGTAA 


AGTTGATGTT 


ATTTTGCTGT 


TGGGTATTCC 


27600 


AGAAGAGCTG 


ACGGCCACTG 


TTGAAGCAGA 


GTTACTACAA 


CTATATGACC 


GCTTGTTTAC 


27660 


AATCGCAGGA 


GACGAACGTT 


TACGTAATGA 


GTTACGGAAG 


CAACGAGATG 


TTTTAGCGGT 


27720 


TAAAGAGTGG 


ATGCAAAGGA 


AG GG GAT TAT 


CATATGAGTT 


TTATCTACGT 


ATTCGGGGCG 


27780 


CTTTTTTTAA 


CAGCTTACTT 


CGTTCAGATA 


GCTATGGGAA 


TGAAACAGAT 


TAAACACTTT 


27840 


AATCAAAACT 


ACCAAGAATT 


GCGACAATTA 


GGGAAAGTCG 


CGATTGGTCG 


ACGTGCAGGG 


' 27900 


AAGTTAAAAG 


CAGGCACGAT 


TATCTTGTTT 


GCTGTGGATT 


CGCAAGGGAA 


AATCTTAGCC 


27960 


GCTAGAAAGA 


TGCAAGGTGT 


CACCGTTCTA 


GCTAAATTTC 


ATCCCTTACC 


TCAGTACGTG 


28020 


AACGAGGATA 


TTCATTATAT 


GGATCATTAC 


CATCCACTCG 


TACAAAAGGA 


AAATAAATTA 


28080 


ACCCAACAAG 


CGATGGAAAA 


TGCACGAGAA 


AT T TACT T AC 


GTGTTGAGCT 


TGGCAATTAT 


28140 


CAAGAAACAC 


AACCATTAAC 


GCCGCTTAAT 


GGAGCCAAAA 


TTCAATGGCA 


ATTATGGAAA 


28200 


AATAAAGTAC 


AAACAAAATT 


GAAAGGAAGT 


GTCGAGTAAT 


GTCTTATATT 


ACAAAATTTG 


28260 


CGGAAGGATT 


TATGCACCTT 


TTCCAAACAG 


GAGCAGAAAC 


ATTTATTTCT 


TGGATGACAG 


28320 


GCATCGTACC 


TGTTGTTTTG 


ATGTTAATGG 


TAGCTATGAA 


TACTTTAATT 


GCTTTTCTAG 


28380 


GAGAAGAAAG 


AGTAACCAAA 


GTAGCCAAAC 


TTTCTGCCAA 


AAATCCTGTT 


ACGCGGTATT 


28440 


TAGTGCTGCC 


TTTTTTATCC 


GCTTTTATGT 


TAGGCAATCC 


AATGTCCTTT 


ACGATGGCAC 


28500 


GATTTTTACC 


AGAATACTAT 


AAACCTAGCT 


ATTATGCTGC 


ACAAGCGCAA 


TTTTGTCATA 


28560 


CCAGTAATGG 


GGTTTTCCCT 


CATATTAATC 


CTGGTGAATT 


GTTCGTTTGG 


TTAGGGATTG 


28620 


CGCAAGGGAT 


TGATACGTTA 


GGACTAAATT 


CGATGGACCT 


AGCGATTCGC 


TATATGTTTG 


28680 


TAGGGATATT 


AATGAATTTT 


ATTGGCGGTT 


GGGTTACCGA 


TTTCACTACC 


GCCTATGTTT 


28740 
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GCAAACAACA 


AGGAATTACC 


CTTAGTAAAA 


CA'GTAGAGAT 


TGCTGTGGAT 


TAGAAAGGAG 


28800 


CAAGGTGAGA 


TGACGTACAA 


TAGTATTAAG 


GTAGTTAAAG 


GAAATGGCGG 


TTATGGTGGG 


28860 


CCCTTAGTGA 


TTACACCGAC 


AGAAGAAAAA 


CATAAATTTA 


TTTACATTAC 


AGGTGGCGGC 


28920 


GAGAAACCCG 


CAATCGTTGA 


TAAAATCGTA 


GAATTGACAG 


GAATGGAAGC 


GGTCAATGGT 


28980 


TTTAAGACAT 


CGATTCCAGA 


TGATGAAATC 


GCATTAGCCA 


TTATTGATTG 


TGGAGGCACT 


29040 


CTTCGTTGCG 


GAATTTATCC 


TAAAAAAGGA 


ATTCCTACTA 


TTAACGTTGT 


TCCAACTGGC 


29100 


AAAAGTGGTC 


CCTTAGCACA 


ATACATTAAT 


GAGACAATTT 


ATGTTTCCGC 


TGTAGGCCCG 


29160 


AAACAGATTA 


CATTGACATC 


AGAAGAAGCT 


TCTGCTGATA 


CAGAAAAGCC 


CACCTCGAAA 


29220 


AAAACAGAAT 


TTAAATATGA 


TACAAGTAAA 


AAAATTACTG 


AACAAAAAGC 


GGAACAACAA 


29280 


AGTTTTGTTG 


CAAAAATTGG 


TATGGGTGCA 


GGAAAAGTTG 


TAGCGACATT 


CAACCAATCA 


29340 


GCCCGTGAAG 


GCGTACAAAC 


GATGATCAAC 


ACAGTGATTC 


CGTTTATGGC 


ATTTGTTTCT 


29400 


TTATTAATTG 


GTGTGATTCA 


AGGTTCAGGA 


ATTGGTGAAT 


TTTTTGCCAA 


TATCATGGCG 


29460 


CCATTGGCAG 


GAAACATTGT 


GGGGTTAACA 


ATTATTGGTT 


TTATTTGTTC 


TCTGCCGTTT 


29520 


CTTTCTCCTT 


TATTAGGTCC 


TGGAGCAGTA 


ATCAGCCAAG 


TAATTGGTAC 


GTTAATTGGT 


29580 


GTGGAAATTG 


GGAAAGGCAA 


CATTCCGCCA 


CAACTAGCCT 


TACCAGCATT 


GTTTGCTATT 


29640 


AATACACAAA 


ACGCGTGCGA 


CTTTATTCCC 


GTGGGTTTAG 


GCTTGGAAGA 


AGCTGAAGCT 


29700 


GAAACCGTTG 


AAGTCGGGGT 


CCCTTCTGTT 


TTATATTCTC 


GTTTCTTAAA 


TGGTGCCCCT 


29760 


CGAGTACTAG 


TTGCTTGGTT 


AGCAAGTTTT 


GGATTGTATA 


GTTAAAAAAT 


TTTTTTTGGA 


29820 


GGATGGGAAA 


TGTCAGTATT 


TACAACAAAG 


GtTCAGTCTA 


TTGGACCAGA 


AGCCGAATTA 


29880 


TTTAAAGAAG 


AAAAAATGGT 


CATCTTATTT 


GGAAAAGATG 


CACCAGATGC 


TTTAGCAGAT 


29940 


TATTGTTATA 


ACATTGAGGT 


TCAGCCAGTC 


ACAGAAGCGA 


TTACGGACAA 


GCAAACGTTA 


30000 


GTGATTGATG 


AACAAACTTA 


CCAGATTACC 


GCTGTAGGCG 


AGGTAGTATT 


AACTAATTTA 


30060 


GACACATTGG 


GTCACATTAC 


TATTAAATTT 


GATGGCGCAA 


CAACACCTGA 


ATTACCAGGT 


30120 


ACATTATACG 


TTGAAGAAAA 


AGCTATTCCT 


GAAATAACTG 


TTGGTACAAC 


AATCACTATT 


30180 


CTATAATTAT 


CGTTTAAGGG 


AGGAACGTAA 


AAGATGGAAT 


TTATGCTAGA 


TACAATTAAT 


30240 


CTTGAGGCTA 


TTCGAAAGTA 


TCAGAAAATT 


TTACCACTGG 


CAGGCGTAAC 


GTCTAATCCC 


30300 


AGTATTGTTA 


AACAAGCAGG 


AAAAATTGAC 


TTCTTTGCGC 


AGATGAAAGA 


AATCAAAAAA 


30360 


ACGATTGGAC 


AAGCGAGTTT 


GCACGTCCAA 


GTGGTCGGTC 


AAACGACAGA 


AGAGATGCTA 


30420 


GAAGATGCCC 


AAACCATTGT 


ACAACAATTA 


GGCCAAGAAA 


CATTCATTAA 


AATCCCAGTC 


30480 


AATGAAGCAG 


GCTTGGCAGC 


CATCAAACAA 


CTAAAGCAGG 


CAAATTATCG 


GATAACAGCA 


30540 


ACCGCTATTT 


ATACAGAATT 


TCAAGGTTAT 


CTAGCGATTG 


CGGCAGGTGC 


AGATTATTTA 


30600 


GCGCCATATT 


ATAATAGGAT 


GGAGAATTTA 


ACAATTGATT 


CTCAAAAGGT 


TATTGAGCAT 


30660 


TTAGCTGCAG 


AAATTAAACG 


GACGAACGCA 


AAGAGTAAGA 


TCTTAGCTGC 


TAGTTTTAAA 


30720 



WO 98/50555 PCT/US98/08985 

1608 



AACGTGGCAC 


AAATTAATCA 


AGCCTGTCAA 


ATGGGGGCGC 


AAGCGGTCAC 


TATCGCACCA 


30780 


GAACTCGTGA 


CACAAGGCTT 


GGCAATGCCC 


GCCATTCAAA 


AAGCAGTAAC 


TGATTTTCAG 


30840 


GAAGATTGGG 


TTGCTGTTTT 


TGGAGTAGAA 


ACGGTAAACG 


AATTAGCATA 


AAAAGAACGA 


30900 


GAAGAAAAAG 


CGGCCATTTT 


TCTTCTCGTT 


CTTTTTTTGT 


TTCACATGAA 


ACATCTACGA 


30960 


TTATTCACTT 


GCTTAAAAAT 


TAGACAATTT 


TCTTTATTTG 


TCAAAGAATT 


TCTTAAAAAA 


31020 


AAGGGTTATC 


TTTTGGGTAT 


TTTTTTTGGT 


AGCAGTATGA 


TGAGGGTATC ■ 


AAATGATAGG 


31080 


AGGAAACGAG 


AATGAGATAC 


ACAAATGGAA 


ATTATGAAGC 


ATTTGCTCGA 


CCAAGAAAAC 


31140 


CAAAAGGTGT 


AGATGACAAG 


AGTGCCTATA 


TCGTTGGTGG 


TGGCCTAGCA 


GGTTTAGCAA 


31200 


CGGCTGTTTT 


CTTAATTCGT 


GACGGTCAAA TGAAGGGCGA 


AAATATTCAC 


ATTTTTGAAG 


31260 


AACTCACGCT 


GTCTGGCGGT 


TCTTTAGATG 


GTAAATTTAT 


TCCCCATGAT 


GGCTTTGTCA 


31320 


CTCGCGGCGG 


ACGCGAAATG 


GAAAACCATT 


TCGAGTGTTT 


GTGGGATCTA 


TTTCGTTCGG 


31380 


TACCATCATT 


AGAAGTAGAA 


GATGCCTCTG 


TCCTAGATbA 


GTTTTACCGG 


TTAGATTTAG 


31440 


ATGATCCGAA 


TTCTTCAAAT 


TGCCGCATTA 


TTCATAACLb 


TGGCGAACGT 


GTCCCCGATG 


31500 


ATGGACAATT 


TACTTTATCC 


7\ TV 7\ TV T\ TV O TV O 

AAAAAAGCAC 


AAAAAbAAAl 


CGTTGATTTA 


TTTATGACTT 


31560 


CTGAAGATCA 


ATTAATTGGC 


i\ tv t\ t\ t\ tv T\ rn 

AAAAAAATTG 


TV Tv/^tv rn/^rprnrprn 

AAb A 1 Cj 1 1 1 1 


CGGCGAAGAA 


TTTTTTGAAT 


31620 


CCAACTTTTG 


GATCTATTGG 


TGTTCAAibl 


rnm/" , /^»T» r P'T l rp/"'"j\ 

1 ibtl 1 1 loH 


AAAATGGCAC 


TCTGCCATTG 


31680 


AAATGCGCCG 


TTACATCATG 


LGL1 I 1 A1LC 


ALLAL A 1 L.AA 


AGGTTTGCCT 


GACTTTACAG 


31740 


CCTTGAAGTT 


TACCAAATAC 


rv AT>r , 7\ TV rp 7V r*(^ 

AA1 CAA1 ALU 


AA 1 Lb 1 1 Akj 1 


GAAmCCATTA 


TTAAGCTTTT 


31800 


TAACCAATCA 


AGGGGTAGAC 


TT"PP7\ Tv rp T\ m ^ 

111 LAAi Al (j 


A Ti 7\ C TA TA f P & T 

AAALAALuA 1 


CAATGATATC 


CAAGTCGACA 


31860 


TCAAAGGAGC 


TACTAAAGTT 


GCTCGGCGAT 


TATTGTTGAC 


ACAAGCGGGC 


AAAACCAAAG 


31920 


AAATTCCTTT 


AACTGAAAAT 


GATTTAGTCT 


TCGTAACGAA 


TGGCTCAATT 


ACTGAAAGTT 


31980 


CGACGCAAGG 


AGATCATCAT 


ACACCAGCAC 


CAATTACTCA 


TGAACTAGGG 


GGAAGTTGGA 


32040 


ATCTCTGGAA 


AAACTTAGCA 


AAACAATCCC 


CAGAATTTGG 


GCACCCAGAA 


GTTTTCTGTG 


32100 


AAAATTTACC 


AGATGAAAGT 


TGGTTTGTCT 


CAGCAACbAT 


TACGTGGGAA 


AATTTCGATA 


32160 


TTGAACCGTA 


TCTTTCTCGG 


TTAACACATC 


GTAAATTACG 


CACAGGAAAA 


ATTGTCACAG 


32220 


GTGGTATTAT 


T AC TAT C AAA 


GACTCTAATT 


GGATGATGAG 


TTTCGCTATG 


CATAGACAAC 


32280 


CGCATTTTAA 


AGAACAAAAT 


GATCAACAAT 


CTATAACTTG 


GGTATATGGA 


TTGCTTTCTA 


32340 


ATAAACCAGG 


AAATTATATC 


AAAAAACCAA 


TCGAACAATG 


TACAGGCCAA 


GAAaTTACGC 


32400 


AAGAGTTACT 


TTATCATTTA 


GGGGTCCCGG 


AAGGGGaAAT 


TGAACGTATT 


TCAGAAGAAT 


32460 


CGGCTACCAC 


TATTCCGGTT 


TATATGCCGT 


TTAtCACTTC 


TTACTTTATG 


TTAAGAGAAC 


32520 


CAGGCGATCG 


•aCCATTAGTT 


GTACCAAACG 


GCTCTAAAAA 


TCTTGCTTTC 


ATTGGGAACT 


32580 


TCGCAGATAC 


TGAACGAGAT 


ACGGTGTTTA 


CAACAGAGTA 


TTCTGTACGG 


ACAGCGATGG 


32640 


AAGCTGTTTA 


TCAATTGCTA 


GAGGTTGAAC 


GCGGAGTACC 


AGAAGTGTTT 


GCTTCTGCAT 


32700 
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ACGACTTGCG CGTCTTAGCA AACTCTGTTT ATTATTTATC TGACAAGAAA AAATTAACAG 327 60 
AAATGGAT 327 68 

(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



ACCCCTTTTT 


AbTIbbl ibb 


a p a a arpr a 
AbAAAbbbAn 


bbb 1 A J. A 1 


PAP.P.nTPPA A 


Tf^APrAA^TA 




TTGGTAAbbA 


att a nrfnTP 
Al lAAbbnbb 


1 A 1 bb A/\ i An 


A ATTP A AP.P. A 


AATP,AAPPTA 


TAAPAATTAA 


120 


t\ t\ T\ 7\ /"■< rp A"* rp t\ 

AAAAGbTb I A 


Hp/-' A AAA APTP 

1 b AAAAAb I b 


I lAAAAAAbb 


A A A APP.P.TTP. 


TTGAAATAfJA 
1 1 \jr\r\r\ 1 nbn 


AAGATTT^PAA 


180 


AG GAGA I 1 AA 


AAA i bAAbbA 


1 1 lAibbAAl 


TTTTTAP.ATA 


ATPATPPTP.A 


AfiAG AA AGAA 


240 


TfP AT7\ A Apr 1 

TTGA1 AAAbb 


bAAAAbbAI 1 


np a A PPTT AT 
Ab AA(jL> 1 J. A I 


AP AP.TTP.ATP. 


<*^A ATTTT AAA 


TGATPAACTT 


300 


AATT1 Ibl lb 


b 1 b 1 Ab A 1 Ab 


TCTATCTATT 


GGTCTGGAAA 


AATTTGGAAA 


TATTCATTAT 


360 


AGTA1 bbAAA 


«pr»pp nr a a tt 
i. bbbAbAA 1 I 


TAATGAATTA 


CTAAGTAAGA 


AGATGGTAGT 


ACTTTTTGTT 


420 


CCTbbbbAi b 


p aptpppapa 
bAb 1 bbbAbA 


CTCTGGCGCC 


AAAGAAAGAT 


ATTTTGGATT 


GAGACAATGG 


480 


AA L MLtL. iul 


PTPA ATPTP.T 


TCGAAAAAAT 


ACCATCATTC 


TTAGAATAGC 


TGATGTTGGC 


540 


TTTPTPTP7AP. 
1 1 1 b 1 b 1 U/Ua 


p. a t p p t a t t a 


CACATTATTT 


GAGCAAGAAA 


TTTTGGATTT 


AATTGAAGCA 


600 


a ap.a a aapap. 


AATTTP.C A^T 
rtri 111 uOn ^ l 


GAAGCATCAA 


GATATAATCT 


TCTATGGAAA 


CTCTAGAGGC 


660 


p,p,AArnr,r,Ar, 


PP,TT ATT ATT 
Lui lr-ii xni i 


AGGTATTACA 


GGTAATTACA 


AAACAGTTAG 


TGTAGATCCA 


720 


P.TA ATTP.ATP 


nAP,Af;rrTTft 

urtOrtvjv^v^ i i o 


GGTAAACAGC 


GCGGATGGTA 


GAATAGATCG 


CCAGTTAATG 


780 


TTTGATTTCG 


TAGAAATGGA 


TTTTTCAAAC 


CGAATAAAAG 


ATAAGATGGA 


ACAGTCAAAC 


840 


TATGAATTAG 


AAAACATTAC 


GGTGATTGCT 


TCTGAAGCAA 


ATAAAAAAAT 


GTATGAGAAA 


900 


ATTTACCCCT 


TATCTAATGG 


TTTTACCTTG 


AAAAAATTAG 


AATTAATGAT 


TGATTCTGAA 


960 


AATACAATTG 


GTATACACGG 


CCTTACGATA 


ATGGAAAGTA 


TTCATTTACA 


ATTAGCGTTA 


1020 


ATAAATAATT 


ATTTATTTGA 


TTAAGTAACA GAGAAAGTAG GAGCgGAAAA ATGAAAAAAG 


1080 


CAGCmATTAT 


GGGCTGGTGG 


TATAACCaAA ATTATGGATC 


AaTCTTAACA 


TACTATGCTC 


1140 


TAAACAAATA 


TGTACAGAAT 


AAAGGGTACG 


AGACGGTTAT 


GATTGATGGA 


CCACTAGGTT 


1200 


ATAAAAACAG 


AAGTAACTTT 


CGAGCATGGA 


TGCCTTTAGC 


GTATAATTTT 


TTTAAGAAAA 


1260 


ATAATATGCC 


CTATACTGAA 


CAATTAACAA 


AAGAAACGTT 


ACCGACATTG 


AACGATTTAG 


1320 


AAAATCTAGA 


TACCTTTATT 


TTAGGCTCTG 


ATCAAATGTG 


GAATCCCTGG 


AATGGTTGGG 


1380 


TAGACGACGA 


TGATTTCTTA 


GATTTTGTTT 


ATCCTCGTAA 


TAAAACAATT 


GCTTATTCTG 


1440 


TTTCATTAGG 


AAAAGCAGAT 


ACAAGTAAAT 


ATGATCCTAA 


ATGGGTTGCA 


AACCGCAAAA 


1500 


AAGATATTAC 


ACAGTTTAAT 


CATGTTTCTA 


TGAGAGAAGA 


TTTTTCTGTA 


CAAATTATGA 


1560 
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AAGA1A1 1 1 1 


1 UAAvjMAuAM 


pttattpapp, 


CTTTGGATCC 


AACCTATCTT 


GTTGAAAGAA 


1620 


G I GAA 1 A 1 bA 


1 1L1 1 i AGG 1 


patpaappta 

vjM I <^nno ^ Jn 


PPTTTAAAPG 


ACAAGAATCA 


GGAGATTATA 


1680 


TGGG IG 1 Gil 


<"pTTf""*TT/'"*/""'A m 

1 1 1 G 1 1 GGA1 


A T T 21 A P A A PP 
Mil MMVwMMVjo 


AAAAAGTTCG 


TGTAGCACTA 


GCGATCGCAG 


1740 


Ti "J\ TV T\ 7\ rp rp r""* 

AAAAA I TGGG 


A/^TPAAATTA 

AG 1 G AAA 1 1 A 


o 1 1 1 


PTA ATPPTTT 


AAAAGGACGT 


GAAPTAGCGA 


1800 


AAAAAACGTI 


1 GG 1 AA 1 AGA 


PTAPAATATP 
o 1 nunn inly 


TTTPTPAAPA 


TACACCAGAA 


AATTTPTTAT 


1860 


CTGTTTATCG 


AGAAGGAAAA 


TATATTATTA. 
1 Al Al 1 Al 1M 


PTPATTPPTT 


TPATGGCAPT 


PnTTTTTGPGA 

VJ X X X X X Vta/ VJii 


1920 


CTATTTTCCA 


T\ TV TV TV f** rp T* HP 

AAAACCGTTT 


1 GAAi I 1 1 1 i 


TAT A ATP A AP A 
A 1 AA 1 unnLn 


A PP,TP,P, A ATT 
M\^>o 1 ounn X X 


HAPPP,PTTTA 

UnUv^oL XXX n 


1980 


ATAGCTTGAT 


GAACCTGTTT 


GAGG 1 1 GGGG 


Zi A APT AP A PP 


TPTPT ATP.AP, 


r a AAAPAPAA 


2040 


AACAAGAAAT 


m/"* TV T\ TV TV TV T\ rp 

TGAAGAAAAT 


A A A A A "P A TT A 

AAAAAlAi IA 


r'TTTTAPPBPT 
Gill AUUAL- 1 


TP. APT A TP A A 
1 Onvy 1 n 1 Vyttn 


AAP^APPPiPTG 


2100 


GAAAGATTGC 


_ TV TV 7\ 7\ 7\ 7\ A"* 

a G AG AAAAAG 


AGA 111 GAGA 


rp a A PTPfTT Zi 
lAAGiGol In 


APPPATPPPT 


TPAPA AP,TAA 

X UM\^MMO X Mn 


2160 


ATTTAGCCCA 


GATGAAGAGG 


AAG1A1 Al GA 


HjAAl 1 1 Hbn 


PP.ATATATTA 
v^ on X M 1 M x in 


TP A ATA A APP 
X v»nn X nnnL. 


2220 


TTTGGATTTT 


TATCGAGC1 A 


A A A P A APTPT 

AAAG AAG 1 G 1 


TPPP ATTTPP 
1 ULunl 1 1 


PPAPPAATTP, 


TTTTATPHPP 
XXX inivyUVyV/ 


2280 


AAATGGTACG 


ATAAAAAATT 


Of 1 A TiTfTTn A 

CCAATGG I AA 


rpp a A AfTTTT 

1 GAAA\jU 111 


TPPA AP.PT A A 
1 bunnuu I Mn 


PAP AT A AAPA 


2340 


ATTAATTTTT 


TvmPTV 7\ *T1 TV 7\ rp 

ATGAATAATC 


Al GG 1 GAAG 1 


PArTBra zip a 
LAL 1 AL. AAU A 


TTT A ATPP.TP. 
1 1 IMMlL-olo 


A A A APTTTA A 
MMMnVj XXX nn 


2400 


GGGAAAGGAA 


ACGTTCAGTC 


ommmmTipTi AT 1 

CTTTTAGAAT 


GG 1 GGG1 AAA 


1 A 1 o 1 o AAAo 


ATPAAAAAAT 
M 1 oMMMnnM 1 


24 60 


CGAACATGTT 


tv m m 7A rn rr* rn y*> 

ATGTATTTGC 


ATGCGArAGA 


AAA ACTT A AP 

AAAAGG 1 AAG 


PPT A A APPP A 


A APA APPPAP 
MMV_» Mno o *w n^ 


2520 


aGAAAAAAAA 


GCATTAGCAA 


T TGGG GAG AA 


TPTTTTB AP A 
1L1 i 1 1 AMU A 


APAAAAPAAT 
MoMMMM^wrtM X 


T A ATT ATP A A 
X nn X X n X Onn 


2580 


AATTGTGCAA 


A ATlPAPAPA A 

AATGAGAGAA 


1 1GG 1 1GG1G 


PAAAPAAPAT 
bnnnvy MMoM 1 


PTTTT AP.AAP, 
Ul 1 1 i nonnu 


AATTACCAGA 

nnx X nvvnun 


2640 


TTTAGAGTTA 


TTTGTAGGTC 


G 1 G AA 1 1 AAA 


AP A AT ATP.PT 
AoAA 1 Al uL 1 


TPP.A ATTTTP 


TTppTppTrr 


2700 


T\ /— • /~» 7% T\ rp rp rrt 

AGCAGATCT 1 


i 1 AAi 1 i 1 1 G 


r"AZifiAAPPZiT 
tnnnnnLurt 1 


APA Af^AAP,TT 
nonMonnO l l 


xxx rui x x v3 


T A A AAT AT G C 


2760 


m TV7V7ATVT\rpTVT\ rp 

TAAAAATAA1 


GAGA1 1 GG 1 1 


1 AAG 1 G 1 1 A 1 


PPPTA A AP.P.t- 


TPAAATATTT 


TAGTPPGTGA 

x nu x v^v^vj x vjii 


2820 


TGG IGG1AIG 


GG 1 GGGA1 1 A 


PTTT7A A. ATZi T 
till nnn I n 1 


P APTP.PPTTP, 


AATTATPGTA 


AAATTACGGG 


2880 


GAATGTTTTA 


AGAG rTTGGG 


GAGG 1 GGAAG 


TTT A Zi TTit A A 
111 MM 1 1 unn 


APATPTTATT 
MUM 1 L 1 1 M X X 


ATPTAPTAGA 

n x ^» x ml x non 


2940 


ACACTTGAAA 


1 GGGG 1 1 1 AG 


AA 1 ubuvyMuft 


T A AT ATTPPA 
1MM1M1 1 ^^_rt 


PP.AAPPATTf; 

uOMM^r^nX x o 


GTGGCGCAGT 


3000 


TTATATGaAT 


GG 1 GGGAGAG 


1 1AAGGA1A1 


T A AT AP.PATP. 
1 MM 1 nuun 1 o 


TTTPTTP.AAP, 

X X X v> X X vjii-TiVJ 


PTAPGATTGT 


3060 


AGA 1 G AAAA 1 


P<" Af^AAATTA 

GGAGAMA1 1A 


ziAPTTTTAAA 

nnu 1111 nnn 


TAAAf^AAPAT 
1 MMMOnnvjn I 


HTTP A AT TT A 

ui i Vynn xxx n 


GTCATCGCTA 


3120 


1 AG 1 1 G A ill 


A 1 GG A 1 G A 1 K. 


PTP Zi P TPP a T 
U 1 uny 1 uun 1 


T ATTPT AP A A 


ApTAAATTAC 


AGATTTCAGA 


3180 


AGGTAATTTA 


GAAAATATGG 


TAAATGACAT 


GGTAGGAACT 


GTCGAAATTA 


GAGAAAGGAT 


3240 


GCACCCACTA 


ACACATCCAA 


ATCATGGGTC 


AACTTTTACT 


TGGGGACGGG 


CGCCTCGATT 


3300 


AATTCAACAA 


GCTGGCTTAG 


TTGGTACTAG 


AATTGGCGGA 


GTAAAAGTAT 


CTGAAAAACA 


3360 


TCCAGGTTTC 


TTTATCAATG 


TTGAACAAGC 


CTCTGCTCAA 


GAC TAT GAAG 


CATTAATTTA 


3420 


TTTAATTATT 


GCAAAAGTTT 


ATGAATTTTC 


TGGATTTTTA 


TTAAAACCAG 


AAGTGAGAAT 


3480 


TTTAGGTGCA 


AATATGTGGG 


AAGCAAGTTT 


AACTTTTAAA 


TAAACAGGGG 


AGTGTGTGAC 


3540 
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ATGAAGAAAA TAATCACAAT AATTGTGTCT TTAGCAATTG GTATCGGTTG TGGTTTTTGG 3600 

TCAGGATATT ACATCAGTAA AAAATATTCA GAAAACCAAG AGACAACTAT TTCCAAAAGT 3660 

ACTAACTCAG AACAACCTTT GATTACAAAT GGAGAGTATC AAGTAACAGC TAATTGGGAT 3720 

TgGAAAACAT TAGTTGTTAA AGATAATGTG TGGACATTGG ATGGCAAACG TAT TTATGAT 3780 

GTTTCCACTA CAAATGATAA TATTGTGATT TTAAAGGGAA AAAATTTGAA ATCACTTGTA 38 4 0 

GTTTATAAAA TAGAAAAAGA TAACGAAAAA ATCAACTTAT ATTCTTACAC AAAAGAAGGC 3900 

GTTGGGAAAA AAGTATTAGC TACTCTAGTG AAAAAATAAA TAATGCATTT GGTGTTTGAA 3960 

ATAAATTGAA AGTAACTTTG TTAGGAGAAG TTAGTAAATG AAATACAATA CGAAAAATTA 4 020 

TGACTATTTA ATTGTTGGTT CGGGACCTTT TGGGGCAACG TTTGCTTGTG AAGCAGCTAA 4080 

ACGAGGAAAA AAAGTATTAG TTATTGAGAA ACGTGCTCAT ATTGGTGGCA ATATGTATAC 414 0 

ACATAAAGAA AACGGGATTA ATGTCCATGA CTATGGTGCC CATATTTTTC ATACTGACAA 4 200 

TAAAGAAGTA TGGTCTTATG TCAATCAATT GACGGAATTC AATGGCTACA TCAATCAAGT 4 260 

AGTTGCTAAT TACAAAGGGG AGCTCTACAA CCTACCATTC AATATGAACA CGTTCTACCA 4 320 

AATGTGGGGG GTAAAAACAC CAGCTGAAGC ACAAGCAAAA ATTTCAGAGC AGCGGGAGGC 4 380 

AGCTGGAATT AAAGGCGCTC CmAAAAATTT AGAAGAACAA GCGATTCGTT TGATAGGAAC 4440 

GGATATCTAT GAAAAATTAA TTAAAGGCTA TACAGAAAAA CAGTGGGGAC GTAAGGCTAC 4500 

GGAATTACCA AGTTTTATTA TTCGCCGTTT GCCAACTCGT TTTACTTTTG ATAATAATTA 4 560 

CTTTAACCAT CGCTATCAAG GCGTwCCTGT CGACGGTTAT ACAGCTATGT TTGATAAATT 4 620 

GTTAGAGAGT GAATTAATTG ATATTCAATT GAATAAAGAC TTTTTGGCAG AGAAAGAAAC 4 680 

TTACTTATCA GAGTTTCCAA AAATTGTTTA TACAGGGATG ATTGATGCAT TCTTTGATTA 474 0 

TTCTTATGGC GAATTAGAAT ATCGTTCAGT ACGTTTTGAA AATGAAACAG TAGATTCAAA 4 800 

CAATGCACAA GGCAATGCCG TAATTAATTA CACAGATGCG GAAACTCCCT AT AC AC GG AT 4 860 

AATGGAATGG CGACATTTTG ATCAAAAGGC CGATGAAAAC AAAACGATTT TGACGAAAGA 4 920 
ATACCCACAA GATTGGGATC GTTCAAAAGA AGCCTATTAT CCTGTTAATG ATGAGAAAAA 4 980 

TTCACTTTTG TTTAGAAAAT ACAAAGAAAA AGCAAACGCT CAAGAAAAAG TTATTTTTGG 504 0 

TGGACGTTTA GCCAATTATC AATACTATGA TATGGACCAA GTTTTCGGAG CAGCTTTGAA 5100 
AGCAGTCGCT AAAGAATTTG GTGAAATGGC CAACTAGTGA ACGGTTAGAC TCGGAGGAAC 5160 
AAATGGAAAA AGAACTGAAA GTTAGAACAA AATTACTTAC CAAAGAATAC AGTTTAGCGC 5220 
AAACACGTAT TGACAAATTG AAGACACTTT TTAGTGTTTT TCAAAACAAA GTTCCCACAT 5280 
TTTGGGCGTT AAAAGGTGTT AGCCTCGACG TATATTCTGG AGAAACAATT GGTATTATTG 5340 
GTTTAAATGG TTCAGGCAAA TCCACTTTAT CAAATATTAT TTCAGGCATC ACCCCTCAAA 5400 
CGTCAGGAGA ACTTGAAATA AATGGTGAAG TTTCAATTAT TTCGATTGGT GCAGGACTAA 54 60 

ATAATAATTT AACAGGACGA GAAAATATTC GAATGAAATG TTTAATGTTG GGGGAAAAGA 5520 



WO 98/50555 PCT/US98/08985 

1612 

ATAAAGAAAT TGATGCCAAA ATCGATGATA TTATTGAATT CTCAGAATTA GGCGTTTTTA 5580 

TCGACCAGCC TGTTAAAACT TATTCTAGCG GTATGCGTGC TAAGCTTGGT TTTTCAATTG 5640 

CGGTTCATCA AAATCCAGAT ATTTTAGTGA TCGATGAAGC GTTATCTGTA GGAGATCAAA 5700 

CCTTCTATAA TAAAGGCTTA AAAAAAATGC TTGCATTTAA AGAGCAAGGC AAAACTATCT 57 60 

TTTTTGTCTC TCATtCTATT CAACAAGTAG AGCAAATTTG TGATCGTGTC GCTTGGATGC 5820 

ATTATGGTGA CCTACGTGCA TTkGGkGAAA CTAAGCATAT TTaAAGGAtA TCGTGCTTTC 5880 

TgCAcCGTaT AAyCATTTCm CTGAACCGCA AAAAGAATCG TATCAGAGAG AGGCAAGGCC 594 0 

AAACAGAGAA ATTTTCCTTG 5960 
(2) INFORMATION FOR SEQ ID NO: 401: 

(i! SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



AAAAGTTCAA 


AACCTTTTTG 


TTTAATCACA 


TTTTGTATGG 


CGGCGACGAC 


TTCTTCAATC 


60 


GTCAAGCCTG 


TTGTATCGAT 


TTTCACAGCA 


TCACTTGCTT 


GGACAAGGGG 


TGATACTTCT 


120 


CTGGTTGAAT 


CTAAATAATC 


CCGACGTTCA 


ATCTCCGCCT 


TCAAGGTTTC 


AAAATCCGTC 


180 


TCAATCCCTT 


TTTCTTGATT 


TTCTTTAAAA 


CGACGTTCTG 


CACGTTCTTC 


AACGCTAGCC 


240 


ACCAAGAATA 


TCTTTACTTC 


GGCCTTTGGT 


AAAACTGCAG 


TACCAATATC 


TCGACCATCC 


300 


ATGACAACTC 


CGCCGGCTTG 


ACCAATTTTC 


TGTTGTAACG 


CCACCATTTC 


TTCTCTAACT 


360 


TTTGCATGTT 


TTGAAACAGC 


TGAAACAGCA 


TTCGTTACAT 


CTGGCTGGCG 


AATCGCTTCG 


420 


GTCACTTCAT 


GACCGTCGAT 


AAAAACACGT 


TGGCCATTCT 


CTGCTTGCTG 


AAAAGAAATA 


480 


GTGTGGTTGA 


CACAAAGCGC 


TACTAATGGC 


TCTTCCGCTT 


GAATGTCGAT 


TTGATTTTGC 


540 


AAGGCCAGAT 


ACGTAATCGC 


TCGGTACATG 


GCACCCGTGT 


CACAATACAC 


ATAATTCAAT 


600 


TGTTTTGCTA 


AAATTTTTGC 


AACTGTACTT 


TTGCCTGACG 


AAGCTGGTCC 


GTCAATAGCA 


660 


AT ACT G ATT T 


TCCCCATGAA 


AAAACTCCTT 


AACTGATATG 


CATCTTTTTA 


AATGCGCTTC 


720 


TTCTTTTATA 


TTAAAATCAG 


AAGAAAAGAA 


CTTGGATTTA 


AACTGATGAC 


GAGTCTCTAA 


780 


CGAAAAAGCA 


ATAAGCCGAA 


ATCCCTCGAA 


AGACGCAAAA 


CATCTTTCGG 


AAATTTCTGC 


840 


TTATTGGCAT 


TCTCATGTAA 


TCATCCGTCg 


CGGTTTAGCC 


CTTGAAAGTG 


CTGTCCAAAC 


900 


CTCTTTTTCT 


GGATTATTTA 


ATACGTAATG 


GTTGTCCTGG 


ATAAAACACA 


GATGTATCAA 


960 


TCCCAGGATT 


TAACGCTAAA 


ATTTGTTCTA 


ATGTCATTCC 


GTTACGTTCG 


GCAACTTGAC 


1020 


GTGCCCCTTC 


GCCACTTTGA 


ACGGTATCAT 


AAACAGGTTG 


TTGGTTCTGC 


GCTTGTTGCT 


1080 


GTTGTTgATT 


TTGAGCTTCT 


TGTTGTTGTT 


GCTGCTCTTG 


TTGTTGCTGT 


TGCTGTTGTT 


1140 
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GTGCnTGTTG GTCCTGTTGT TGCTGTGCAG CnATGCTTGA ACTTGAAGGT GTTGTATTTT 1200 

CAGCTGGTTG GCTGCTTGCT GGTTCTGAGC TTTCCACTGA TTTTGGACGT ACTTTCTTTG 1260 

TTTCnTTC 1268 
(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

CTTTTGTGCC ACAATCCGTT CCTCCCACCT TGGCGTGCCC AAACAAGGTT AATGATTTGT 60 

ATAAACAATT TCCAACATCA ATAGTTGGGA CCACTAAAAT ATCTGCATCC CCCATGATAG 120 

GACCGCTGTA ACGTTTATGC GCAACCGCTT CTTCAGAGGT CGCTAAATCA AGCGAAAGTG 180 

GCCCAAAAAC CGTAGCCTCT TGTTGATCAT TAAAATGTGC CGTGACTTCT TTTGCTAAAA 24 0 

CAGACGAAGG CATTTTAGGA TTGAAATTTT CCGCTGCGCT TAACAAAGCA ATTTTCGGGT 300 

GATGCAGTCC CAATTTTTGG GCGACTTCTT TAGCATTTTC AACAATTTCA ATGAGGGTCG 360 

CTTGAGTGGG GGCGATATTC ATCGCACAAT CGGTTAACAA GAAGGTTTTT CCCGCAGGCA 4 20 

GCTCCACCAT TGCTACATGG GAAAGAATCG GTTTATTTTT TAATTGATGC TCACTTTTCA 4 80 

ACATTTCTTT CAGTAATGTG TGGGTCTGAA TAATTCCTTT CAATAAAATT tGTGCTTGAC 540 

CGGTTGCAAC TAAACTGACA GCTTCCTGTG CTACCGCGGC CTCATCTGAG CAATGAACAT 600 

ATTTCCAAAG ATTTTCAGTA TCAAGATTTT CATTTGTATC AAATACAATA AACTGCAACG 660 

GTTGCTCGGC TTCTTTTAGT GCTTTTTTAA CTAACTGTAA AATCTCTGGT TGTGAACCTC 720 

CTGCAATTGA AACAGTGATC ATTTTCTCAC CTCATTTTTT CTTCTTTACA CTTTAAATCT 7 80 

ACCTTAGTAT GTAAACGCAT ACAAATTGTT AATTTTTATG TCCTCATTAA TATTTTTTAT 8 40 

AGAGAAGGAA AAGCTTAGTA AATCAAGCTT TTCTTTCTCT AAACAATTGT TATCTAATTT 900 

ACTTTTCATG AAAAAACTTC TTTTCTAAGA ATGAAAGGAA AACTTATGCT TCGGCACAGC 960 

TGTTTATTTA TGTCAGATCC TAAAGCATGT TATACTTTAG ATACTTATTT TTAGGAGGAa 1020 

TTAAATGGCA CTACTACTAT TCTTTTTGTT TATCGCCCTA CTAGGATTTG GTATATTAAA 1080 

AATTAATAAC CGCAGtATCC TCGGCGGTAT CACtCTGGCT tCCGGCACCT TATtGTCATt 1140 

AGtCaCCTtA CTATTTATCG GATTAGACAA AATTTATTTA CATTTTAAGA ATGGCGACCT 1200 

AATT 1204 
(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 



ATAACAAAAC 


ACTACTCTTC 


TATCmATAAT 


TGaCTTAATA 


aTTGAATTTC 


TTGrTTCTTA 


60 


GkTAAGTCAA 


TTGtTTTCAT 


kGTAmCCGCT 


GTGGCTAAmC 


CTAAATAGAT 


TACCACCATT 


120 


TTAGGAAATA ATnCGACAAA TTTyCCTTkG yCCATTtGkA CATATTTTTt 


ATCCAATAAT 


180 


AAACGCTCTT 


CAAAAAAAGT 


TTGAAGCATT 


TGCTGCAATT 


CTTCTTCATA 


ACCATGTGCT 


240 


TTCATAAATA 


ACGCCGTATA 


GATTTTTTGA 


TGTTTTTGGG 


CAAATAATAT 


AAAGTTCAGA 


300 


CCCAATGCAA 


TCACTGGGTC 


TGAGGCACCG 


ACATCTGCTT 


TATAAAATAA 


TTGCATACGA 


360 


TGAAATAACG 


ATTTAATTAA 


GGTTCGTTTT 


AAATCGTCCA 


TGTTGACAAA 


TTCTAAATAA 


420 


ATTGGCTGCG 


TAGAAATCCC 


CATAGCATCT 


GCAATATTTC 


TAGCAGTTAA 


ATTAGAAAAA 


480 


CCTTTATCTA 


CTAAAATTTT 


TTCAGCTACG 


TTTAAGATTT 


TTTCCTTTGT 


ATATACCTTC 


540 


GTTCTAACCA 


TTTTCATACC 


CCTTTTCTCT 


TTTAGTTAAC 


AATAACTTTA 


TTATGTTCAC 


600 


ACAAAAGAAA 


ACCATTTTCA 


TTATATATAT 


AATATTTTAC 


ACTTTTTTAT 


GCGTTTCTTT 


660 


TCTCAAAATT 


TTCTATACAT 


TTCAAAAAAA 


TTGAGAATTT 


TCATCGCCTA 


TAAACAAATA 


720 


GGTCCAGAAC 


AAAAGGTCAT 


CACCTTTTGT 


TCTGGACACT 


AAACGGCTGC 


ACTCTTTATT 


780 


CTTCTATTTG 


TTTGCCTTGT 


TCTTTGGCAA 


GCTCATTTTT 


ATCATAAGCT 


ACAAAGAACG 


, 840 


GATAGTAAAT 


CAAAATTGCC 


ACTAGAATTA 


AAACAATATT 


CAAAACAGCA 


GCACGCCAAT 


900 


CGCCGCCTGT 


CGCCATAAAG 


GCGCCAATTG 


GTCCTGGTAA 


GGTCCAAGGA 


GCGGTAAAGA 


960 


CAACTTCACT 


TACTAAACCT 


AATTTTGTGG 


CAAACCAAGC 


AATCGTTGCT 


AAAACCATTG 


1020 


GTGCAAAGAT 


AAAAGGAATC 


ATTAAAATCG 


GATTTAAGAC 


AATCGGTACA 


CCGAAAATAA 


1080 


CTGGCTCATT 


AATATTAAAA 


ATAGACGGTG 


CTAAAATAGC 


TTTTCCTAGT 


TTTGAACCAT 


1140 


ATTCTGATTT 


TGCCCGGAAG 


GCTAATAATA 


TAGCTAAACC 


AATTGTGGCT 


CCAGCACCAC 


1200 


CAATCCAAAT 


AAACCATTGG 


TAAAAAGGTT 


CTGCCGCAAT 


CCGAGGTAAT 


GcTTCGCCAG 


1260 


CTGCTTTAGC 


AGTCGTGTTT 


TCCTCTAATA 


GTTGTAGCCA 


TAGAGGACGC 


GCCAACGATC 


1320 


CTACGATTGA 


AGCGCCGTGA 


ATCCCAAAGA 


ACCAGAAAAA 


TGTGGTTAAA 


AAGACTAACA 


1380 


GCAAGACACT 


CAATAAAGAG 


TCCGCTGCTT 


GAACTAGTGG 


TCCCACTAAG 


TTCCCTACAA 


1440 


ATGTGTGCCA 


ATCAAAGCCT 


AAATAATAAG 


TGATTGAACC 


AATCAATAAC 


ATGACAATTA 


1500 


AAGTCGGAGT 


CAATGATTCA 


AATGAACGAG 


CAACAGCAGG 


TGGAACGTTa 


TCAGGCATTG 


1560 


TAATTTTGAA 


CTTAGACTTA 


TCTGTCACTC 


GATAGATTTC 


GACAGCAATA 


ATTGACGTCA 


1620 


CAATCCCGAC 


AAACATCCCT 


GCACCACCCA 


AATTAGCCAT 


TGGTAAAACA 


AACCCrGrAA 


1680 


CGCcAGCTGC 


TTTACTGGCT 


tCTTCAGGAa 


TCGCTACAGG 


TACTAATGTT 


AATAAAAAAG 


1740 


CAATCGTAGC 


CAAAATCCCT 


CCAGAGACTT 


GATCAAGATT 


ATATGATTTC 


GCTAAACTAG 


1800 


CGCCAATGCC 


AAAAGTCGCA 


TATAACGCCA 


TAATGTACAT 


TGTCATTCGG 


TAAGGCAATA 


1860 
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AGATTGTAGC 


AGCATTACTT 


GTTAAAAATT 


GAGTAATGCC 


CCAAGATTCT 


GGTAACGGCG 


1920 


GAAACGCAAT 


AATTAGAAAA- 


AAAGAACCGA 


CAATAATTAA 


AGGTAATGTC 


GCAATAATCC 


1980 


CATCCCGCAC 


CGCACGTAAA 


TGGCGCTGAT 


TGGCAATTTT 


AGCCATTGGT 


CCAGCTAGAT 


2040 


GATTTTCAAT 


CCAAACAGTC 


AGTTTATCTA 


ACATAAATAA 


CACTCTCCTC 


TGTTTTTCTC 


2100 


TTAAAAAATA 


ACTCCTTTAA 


TCAGCCATAm 


ATAAGAGATA 


AGCTCTAATA 


AAATAAGAGA 


2160 


rATTAArAAC 


AACCAAATAA 


AACGCTTACA 


AAAACCTGTT 


TTCkTTTcAA 


TAAAATAATG 


2220 


TTCaTCGGCA 


ACGGCCGCCA 


CTTGTGGCAA 


GTGCGTGATA 


CACAACACCT 


GCGAATTTTC 


2280 


TGAAATTkGA 


TAAATTTTAT 


CGGCAATCGC 


CTGTGCTACT 


CGGCCACTaA 


CTCCTGTATC 


2340 


CACTTCATCA 


AAAACAATAC 


TAGTGATCCC 


TTGCGTTTGA 


GAAAAGATTG 


TTTTCATAGC 


2400 


CAACATCACT 


CGCGAAAGTT 


CTCCGCCAGA 


AGCCACCCGA 


ACTAACGGTT 


TTAATGGTTC 


2460 


CCCTGGGTTA 


GTAGTAATAT 


AAAATTCTAC 


TCCGTCTAAG 


CCATTTTCTT 


GTAAATGCTC 


2520 


AAGTTCTGTA 


AAGCGGACTT 


CAAATTCAGT 


TCGCTCCAAA 


TATAGTTCTT 


TTAATTCGGT 


2580 


TAAAATTTGT 


TGTTCGAGCT 


CTTTGGCTAG 


GCGCTTTCGT 


TCTTTTCGTA 


AAGcTAACGC 


2640 


TTGTTGATGA 


GCCGCTTGTT 


GTTTCTCTGC 


AAGCAACGCT 


TCTAAGTCAC 


CTGTACCGCC 


2700 


TTCCAAAAAA 


TCAGCCTCTG 


cTAACTCTTT 


GGTGATTTCT 


TCATAGTAAG 


ATAAAATCGT 


2760 


TTCGATTGaA 


TCGCCATATT 


TACGTTTCAT 


TTGACGGATT 


AATTCCAAAC 


GATTTTCTAC 


2820 


TTCATTCAAG 


CGGCCTTCGT 


CTAGTTCTAA 


GCCATCAATC 


AACCTAGAAA 


GATCTCCACT 


2880 


GGCTTCTTGT 


AGTAAGTAGT 


AAGCATTTTG 


AACAGTATCT 


GACAATGTTT 


TATATTCTGA 


2940 


ATCAAGGGAT 


TCAATCGAAG 


CGAGTTCATT 


CATACTTGTT 


CCGATTTtAT 


CCAAACTACT 


3000 


ATCGTCTTCA 


CCATTTAGCG 


CGGcATAACT 


AATCGTcAGT 


GgCATCAGCA 


ATCTTTTGAA 


3060 


AATtGTTCAG 


TTwATtGCGT 


tCTtCyAACA 


ATTGTTCTTC 


TTCGCCAGCG 


ACTAACTGTG 


3120 


CACTAGCAAT 


TtCATCACTT 


TGAAAaTGAa 


GCATGTCCAT 


tCTTTGGGCA 


AATTCTTTTT 


3180 


CATTTTTTtG 


tCGCTTTCTG 


ACTTTGGCTT 


CGAGTGCGCG 


ATACTCTTGA 


TACGCCTGTG 


3240 


TATATTTTTC 


TTTGACTGCT 


AAAAGTTTTT 


TCCCACCAAA 


TTCATCTAAC 


ATATCAATAT 


3300 


GGCGTTCACT 


TTGCATCAAT 


TCTTGATGTT 


CGTTTTGGCC 


ATGAATATCT 


ACTAAATATT 


3360 


CCCCAATTCT 


TTTTAAATTA 


GTAATGTTGA 


CAATCCGTCC 


GTTGACACGG 


CAAACATTTT 


3420 


TACCAGAAGC 


GGAAATATCT 


CGTTGAATCA 


CTAAAGAATC 


TTCTTCTGTT 


TCAATACCTA 


3480 


ATTCTTCTAA 


TAATTTCTTT 


AATTCTTGAC 


TTTTCGGCAT 


TGAAAAAAGT 


CCTTCTAAGG 


3540 


TGCATTTATT 


TGCTCCTTGA 


CGAATATAGT 


CACTGGAGCC 


GCGTCCGCCT 


GTGAGTAATC 


3600 


CCATTGCATC 


AATAATGATG 


GATTTCCCCG 


CACCCGTTTC 


TCCCGTTAAA 


ACGGTCATAC 


3660 


CCATTTGAAA 


CTCTAATTGT 


AACGAAGAGA 


TAATCGCAAA 


ATTTTTCACG 


GAAAGTTCTT 


3720 


GTAACATCTT 


GTTCACCTCA 


TTCTTTATAG 


ATAGCTTAAC 


AGCTCTTTAT 


GAAGTTTTTC 


3780 


TGCTGCTTCC 


TCAGTTCGTG 


CAATCATGAG 


GACACTATCA 


TCATCGTTGA 


TGACGGCAAA 


3840 
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AAGTTTCTCT 


TGGTAATTCT 


TTTCGATTAA 


ACTACCACAC 


GCCGCCGCAT 


TGCCAGGAAT 


3900 


GGTTCGTAAT 


ACGACATATT 


TTTCCATTTG 


TTCTGCAGCT 


ACAAAGGCAT- 


CTTTTAAAAG 


3960 


CTTTGCAAGT 


TTAGCACTCG 


TATTAGCTTG 


TGTTTCTAAT 


GGAAGGCTAT 


ATCGATAACC 


4020 


CCCTTCTGCA 


GATGGAACTT 


TGATCAGTTT 


CATATCTTTA 


ATGTCGCGAG 


AAATTGTTGC 


4080 


TTGTGTTaCA 


GTACGCCCTT 


TTCTTGTAAG 


TAGTTGACAA 


AATCTTCTTG 


CTTTTGAATA 


4140 


TTTTTTTCAG 


TCAATAAACG 


TGTAATTAAA 


CGGTGTCTAT 


CTTGCTTTCT 


CATGGCGCCG 


4200 


CTCTCCTTTT 


CAGTTTACTC 


TTGCCCTCAT 


CATAGCTTAT 


TTTTATGCAT 


TTTAAAAGAT 


4260 


GAATTACTAG 


ATTTATACAA 


AAAAGAAGCT 


TTCACCTATA 


GGCGTCCACT 


TCTTTCTTCT 


4320 


AACTTTTCTC 


ATCATTCCGC 


TAATTTTTGA 


TGCGCATGCG 


CGaCAACTTC 


TTCAATGGAT 


4380 


TCTGGCGCTT 


GATAGGATCC 


TTCCCCTTCA 


ACAGAAACTA 


AATGTGCCAA 


AAATTCAATG 


4440 


TTTCCTTCGC 


CACCAGTGAT 


TGGTGAAAAA 


TCTAAGTTTT 


TCACATCATA 


ACCGTTATTC 


4500 


ATGGCAAAGC 


GTGTAATTTC 


CTCCACAACC 


ATTTGATGGG 


TTTCTGGATC 


ACGCACGATT 


4560 


CCTTTTTTCC 


CAACAAACTC 


TTTGCCAGCT 


TCGAATTGTG 


GCTTAATTAA 


CGCCACCACA 


4620 


CTACCGCCTT 


TTTTCAAAAT 


ATCATGAAGA 


GGTGGCAAAA 


TTAAGCGTAA 


TGAAATAAAG 


4680 


GAAACATCAA 


TCGTTGCAAT 


ATCCGGCACC 


CCTTCAGTGA 


AATCTTCAGG 


TTTACTATAG 


4740 


CGAAAATTGG 


TACGTTCCAT 


TACCACCACG 


CGTTCATCTT 


GCCGAATTTT 


CCATGCTAGT 


4800 


TGGTTATAGC 


CAACATCTAA 


AGCGTAACTC 


AAACGAGCCC 


CATTTTGTAA 


AGCAACATCG 


4860 


GTAAAACCAC 


CTGTGGAgGA 


aCaATATCTA 


ACATTGTTTT 


GCCTTGAACG 


TTGATTGCAA 


4920 


AAACATTTAG 


TGCCTTCTCT 


AGTTTTAAGC 


CCCCACGAGA 


AACATAAGGC 


ATTACTTGGC 


4980 


CTTAnTATGT 


AAGGGT 










4996 



(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

CACCGTGGnG AACTCTTTTT TGGGTAACCn GGATGGTTTT TTTTGGACAT ACTATTCCnC 60 

CCTTGTCTAT CTAATTGACT ACCnTCCGAT AAATCTTATG TATTTTCTTA TATGCTTCAC 120 

ACATTATACT ACTTTCTTTT TCGAAAAACA GAAACGTTGG TAAATTCTCA TAATTATTGA 180 

AGGCGAACAT ATTTTCTTGA ATTTTATCAC AAAATTAGGT AAAATACAGG TAATTAGTCA 24 0 

TTTGAGTTTT ATTCTATTTT ATTCGTAAAG GAGGCAGAAG AATGGATCAA CATAAgCTAC 300 

CATCAAGAGC ATTTCTAAAC TGATTACGCT CTCTTCCCAA CATTTCTTTT CGGCATTTTT 360 

AGAAGATTCT CAAGATCAAA AGAATAAAGA AAAGGAACGC AGTGCGCTTT TACAAGAGCT 4 20 

CCAAGTTGct GCCGCCCAAA AATCCTTAGT GGTCTTACAA TTGAAAGAAG TCCCTACAGC 4 80 
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GCATAAATTC 


GAAACGGTGA 


TTGGCTGGAT 


TGTCTCAAAA 


AATATTTCGG 


ACAATATCGT 


540 


TGTTCGTTTA 


CAAACGGATG 


AACAACAGTT 


GCGCATGATT 


CCTGTTGCTT 


CTGTCATGAA 


600 


AGTCAGCACG 


TTAGCCAATC 


GTCACCAACG 


TCAAATTCAC 


TAGAATATTG 


AAGTCTTTAT 


660 


AAAATAAATG 


GAAACCACCA 


ACCCTGGCGA 


TTGCAGGTTG 


GTGGTTTCTT 


TTATTTATGA 


720 


CTTTAGTCAA 


AATCCCATGA 


AACTTCTGAC 


TTAGGACTCA 


ACTTATTTCT 


TAGGAACAAA 


780 


TAATTGCGGT 


GCTACTTTTG 


CCAATAAGCC 


TAGAACTAGA 


CTTACATACA 


AACAATTGAC 


840 


GACGGTACTT 


GTACCGTTTA 


GAACAAmGGA 


ATAGAGATAT 


GGGCTCATTC 


CTTCTGGTGC 


900 


ATACATGcCC 


CAAACCAATA 


CACCCGCCCA 


AAAATGCCAA 


AACCAGCGGG 


CGAAGACTGC 


960 


CACAACACTT 


CCTAAAATAA 


TCGTACGAAT 


GGCTGATTTT 


AAACGATTCG 


CTTGAAAATA 


1020 


CAGTTGAATT 


TTTCTAGCGA 


ATACGCCACC 


CATTCCTCCA 


AACGCAAAAG 


CAAAGGGATA 


1080 


TTCAAAAATA 


ATCTGTGGAA 


CGGACAAGAA 


ATTTTTCATG 


GCTGTACCGA 


GAATAATATT 


1140 


TAACATGCCC 


CAAACAAAGC 


CTGCTGCAAC 


TCCAGGAAGC 


AACCCACGAC 


GAAACGAATA 


1200 


AAGAACTAAT 


GGGACCATTC 


CCAACGATAG 


GTCCAAGCCA 


CTATTGGCAA 


ATTCAATCGG 


1260 


TAAAAAAGAG 


AGGGCCATTG 


CCATTGCTGC 


GACGACTGTT 


CCTTCTACCC 


AAATACGTAA 


1320 


ATCCATTTTT 


CTAGCCAAAC 


AAATTTCCTC 


CTATGTtCTG 


ACGCTCTCAA 


CCACTAATAC 


1380 


ATCGTTATTT 


TACCATAGGG 


ACTTTAGAAA 


AACACGTTTG 


TCGAACAAGC 


AAAAAAATAG 


1440 


CTGATCAGCA 


TTTAAGGGAA 


TCAACTCCTA 


CTCAAACGCA 


TGCCAGCTAC 


TTTTTATCCA 


1500 


TTAATCGTCC 


CGCTCAGTTG 


TCAAAATTTT 


ACCAGAATGA 


GCATCGATTT 


TTACCTCTAT 


1560 


TTTTTGTTGG 


CCCTCTTTCA 


CTTTCACTTC 


CCAATAGGTG 


ATTCCTAATT 


CTTTCTCTAA 


1620 


CTTCCAATCT 


GTTGCCTGGC 


CGACTTTCGC 


TGCTTTTTCA 


GCCAAGGTTG 


TGGCTTGTTC 


1680 


TCTGGAAATG 


ATATTTGTTA 


AATCAAGTGC 


CTCTTCTTGC 


ATTTTAACGC 


CATTTTGTTC 


1740 


ATCGGCATGC 


AAGGTTTCGA 


CTTTTTCTTT 


TGTGAATTCG 


CCTGTTTCTG 


CGTTCACTTC 


1800 


AACTTGATAT 


TCTTTTTGCT 


GATCCACGCC 


TTCTATTTCA 


TAAAAATAAC 


GTCCAAAATC 


1860 


TGTATCTAAC 


TGCAGACTTG 


TTATTTTAGC 


TTCTGGATAT 


TTTTTCTCAA 


ATGCTGTCGT 


1920 


TACTTT 












1926 


(2) INFORMATION FOR SEQ ID NO: 405: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 
GATTTCTGCT CACTCTTTTT GATGGGAAAA GTCATCaGGA CACGGTTCGT TACATCAATC 60 
TCGAAGGAGG CGAAAATTTT GAGTAGTGCT GTGGAAACTG TAACAAAGTT AGTCACACCA 120 
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ATCTTAGAAG AACAAAATTT TGAACTTGTG GAAGTAGAGT TTGTTAAGGA AGGAAAAAAT 180 

TGGTTTTTAC GAGTATTTAT CGATAAAGAA GGTGGCATTG ATATTGAGGA ATGTGCCTTT 24 0 

GTCAGCGAAA AATTAAGTGA AAAACTAGAT GCAATGGATC CAGATCCAAT TCCGCAAGCG 300 

TACTTTCTGG AAGTTTCTTC TCCGGGAGCA GAACGTCCGT TAAAGAAAGA AAGCGATTAT 360 

GAACAAGCGG TAGGTAAGTA CATTCATATT TCTCTTTATC AAGCAGTTGA TGGAGAAAAA 4 20 

CAGATTGAGG GTACCTTAGT CCATCTTGAT TCAGAACAaT TAACATTAAG TGTAAAAATT 4 80 

AAAACCAGAG TGAAGGAAAT GACTTTTGAA CGTAAAAATA TCGCAAAAGC TCGTTTAGCC 54 0 

ATTCAATTTT AATCAACAGA CTCGAATGTA AAAAAGGGTC CACTAAAAGT TTTACTTTTA 600 

GCTAGGCCTG TGACAAGTTT TAACGAGAAG TTTTTCCCTT GGAAGAAGAA CCACTCGTTC 660 

AGCTTTAACG AATCCATAGG AGGAAAAGAA AAACAATGAG CAAAGAAATG TTAAATGCAT 720 

TAGACGTATT AGAAGCTGAA AAAGGGATTC CTAAAGATAC GGTGATTGAA GCGTTACAAG 780 

CAGCTTTAGT TTCTGCATAT AAAAGACATT ATGGACAAGC AAGCAACGTC GAAGTTGAAT 84 0 

TTGAACCAAA AAAAGGCAAT ATTCATGTGT ATGCTGTCAA AGAAGTAACA GAAGAAGTAT 900 

TTGATTCACA ATTGGAAGTT TCATTAAAAG ATGCCGTTGC ATTAAATGGT GCATATGAAA 960 

TCGGCGACAA AATTCGTTTT GAAGTTACGC CAAAAGACTT TGGCCGCATT GCTGCCCAAA 1020 

CAGCAAAGCA AGTAATTTTA CAACGTGTCC GTGAAGCAGA AAGAACAATT ATTTATAATG 1080 

AATTTAGCGA ATATGAAAAT GACATCATGC AAGGaATTGT TGAACGTCAA GATCGTCGTT 1140 

ATATTTATGT AAACTTAGGT AAAATCGAAG CAGTCTTATC AAAACAAGAT CAAATGCCAA 1200 

ATGAATTTTA CCAACCGCAT GrTCGGrTTA AAGTTTACGT ATCACGTGTT GAAAACaaTt 1260 

CAAAAGGcCC ACAAGTTTTT GtAAGCCGTA GTCaTCCAGa TTTaCTAAAA CGkCTTTTTG 1320 

AACAAGAAAT TCCAGAAGTT TATGATGGAA TTGTTGAAAT CGTAAGCGTC GCTCGTGGAA 1380 

GnGGGCGATC GTTCAAAAGT AGnCCGTTCG TTCAACAGAT CCnAATATTG GAnCCCGTGG 14 40 

GGTACCCGTG GTTGGTCCAA AGGGCCACGT GTCCAGGCG 1479 
(2) INFORMATION FOR SEQ ID NO: 4 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

CAGGTGCAGT AGGAATTGCT TTTTTTGCTG GAAATATGAn ACAAGACAAA AGGATAGCAG 60 

ATAGGCAAAA CAAaaAAAGT GAGAAGAaGT ArGGAGAATA TTTATGCCAG TTAATAAAAA 120 

TGGTTTACCT TTTTATCTCG ACAGAAAATT AGATGTAATA CGCTTTGGAG TTGATCCGGA 180 

TAATCATGTG AAAGAACAAC AACTTTTTAA ATATCCGGTT AAAACGGATA AGCCAAATAA 24 0 

AAAAGAACGT TTCATTGAAA AAGAAAGTGA ATCTTTAATT CAATTAGGAT ATAAAATTGT 300 
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AG TT GAT AC A 


GTTACTGGTG 


TTAATTATTT 


AGAGAGCCCT 


AGAGGAGGAA 


TTACACCTTT 


360 


GCTTGATGAA 


ATTGGGAAAG 


TTTGCATCGA 


TAAAATTAGT 


AAGTAAATAA 


AACCCACCCC 


420 


TGAAAATCTT 


TTTTTGGGGG 


TTATTCTATA 


TTCAATTATT 


AATGATAATt 


ATaGTGATGA 


480 


GAGAGCAATT 


CTAAGCCATT 


TTCACAAATA 


TTGATCGGTA 


GTTATGCAAT 


GAAGAGATTT 


540 


TAACGTTCGC 


TCTTACAGGA 


TTAAATATGA 


TATGTAGATG 


AAAATAAAAA 


aaTCTCTAkT 


600 


TAAaTACGAA 


TAATTGTTCA 


tACCAGGCAt 


GTGGCCAaCC 


ATCAAAAGAC 


GAATAACTGG 


660 


TTTCGCTTTG 


CTCAACkGGC 


CTTCAACGTT 


TTGTATAGTT 


TCCCTCTCTG 


CGTTCAGTCC 


720 


AACTATGCCA 


AAAGTTTACA 


GGTGTATAAA 


AAAATACGGA 


AAACTGAATT 


TTACTAGTTG 


780 


CAAAAGAGAA 


CATTCGTTCG 


TATAaTGGGT 


TAGTAGAAAG 


ArGGGATTTk 


GGATGaATTT 


840 


GACGTTCGAT 


TATACGAArG 


AACCCAGTCG 


TGATGTGTTT 


TGTATCGACG 


TAAAAAGTTT " 


900 


TTATGCTAGC 


GTGGAATGCG 


TAGaACGTGG 


ACTTGATCCA 


TTGAAGACAA 


TGCTTGTGGT 


960 


CATGAGCAAT 


TCTGgwAAAT 


TCAGGTGGCC 


TGGTACTCGC 


TGCTTCCCCT 


ATGGCAAAAA 


1020 


AAGTATTAGG 


TATTTCCAAT 


GTTACAAGGA 


AAAATGAAGT 


TCCGGACCAC 


CCAAACTTAA 


1080 


TTATTGTACC 


TCCACGCATG 


AAATTATACA 


TGAAGAAAAA 


TCAAGAAATT 


AACAATTTAT 


1140 


ATAAACGCTT 


TGTTTCTGAT 


GAAGATCATT 


CTGTATTCAG 


TGTCGATGAA 


TCGTTTCTTG 


1200 


ATGTGACTGC 


TTCGCTGACC 


TATTTTAAGT 


GTGACACCGC 


CTATAAACTG 


GCCAAGATTA 


1260 


TTCAACGTGT 


GATTTATAAC 


CATATGGGAT 


TGTATGTGAC 


AATCGGAATT 


GGGGAAAATC 


1320 


CGTTGCTGGC 


CAAGTTAGCA 


TTGGATAATG 


AAGCAAAGAA 


TGCACCAGGC 


TTTGTGGCTG 


1380 


AATGGCGCTA 


TGAAGATGTG 


CCAGAAAAAG 


TTTGGCCAAT 


CTCCCCTCTT 


ACAGAATTTT 


1440 


GTGGGATAGG 


AAATCGCATG 


GCTGCTCGCT 


TAAAAAAGCT 


AGGTATTCGG 


TCCATTTATG 


1500 


ACCTAGCGCA 


CATTGAACCT 


TACATGTTAA 


AAGAGCGTTT 


TGGTATCATG 


GGGCTACAGC 


1560 


TGTACGCGCA 


CTCATGGGGC 


ATAGATCGTA 


GTTTTTTAGG 


GCAAAAAGCG 


GGACGGCCAA 


1620 


CAGAAAAATC 


ATTTGGCAAT 


AGTCAGGTTT 


TACCCAAAGA 


TTATGCAAAT 


AAAGAACAAA 


1680 


TAAAATTGGT 


TTTGAAGGAA 


TTGTCGGACC 


AGGTTGCTAG 


TCGcTTACGC 


ATGGCATCGT 


1740 


GTCAGACAAC 


CTGTGTCTCT 


TTGTTTGTTG 


GCTACTCAAA 


AGGTCAGACA 


GACAAATACG 


1800 


GCCAAACAGG 


CTGGCGTAGA 


CAAATGAAAG 


TAGAACCAAG 


TAATAATACA 


AAAGTTTTGA 


1860 


CTGAACATGT 


ACTGCGTCTT 


TTTGAAGAGA 


ATTACGCACC 


AGGGGTCGAC 


GTACGAAATT 


1920 






r*T T (Z T A TfZ fZ A 


APAAA AATTT 




TTATTTTCTG 


1980 


TTCCGGAGGA 


GCAAATTCAT 


GAAACGGATA 


TGTATTTTCT 


TATTGATAAA 


ATACGTCAGA 


2040 


AATTTGGGTT 


TAAAGCTCTG 


ATACATGCTT 


CTTCACTGAT 


GGAAGGCGCC 


ACAGCAATTT 


2100 


CACGAGCAAG 


CCTTGTTGGT 


GGCCATGCAG 


GTGGAACTGT 


TGGCTTAGGT 


ACAACCAAAT 


2160 


GAAGCAAAAT 


AGAAAAGAAT 


TTTCTTCTTA 


TTTTTCTCGC 


TCAATCAAAC 


AAAATAAGCC 


2220 


ATTATATTTA 


TTACTTATGT 


CTTCAGAAAC 


CAATCCTTTT 


CCTATACCAG 


TtATAGGAAC 


2280 
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TTTTCGAGGA 


TATGCGGAAG 


AAAACAAAAT 


AATAATTGGT 


GAAGATTCCT 


ATAGTATAGA 


2340 


AGAAATTGAC 


TCCGTGTGTA 


TGTATCGTCA 


TTCTAGCAGA 


ATAAAACAAG 


AAGAAAGAGT 


2400 


AAATTCAAAC 


CCTTTTcAAG 


GTTTGGTCTC 


CCCTCCTTTT 


TcAAAAAAAG 


AGCAGAAAAA 


2460 


GTTAGATGAG 


TATTTTAGCG 


ATTGGGGGTT 


TCGTAATGAA 


GGCATATGAT 


TATAATGGTA 


2520 


CCATTATTCC 


TCTAGGAGAT 


AGCGCCCGCC 


AAGTGGCTTA 


TGTAGAATGC 


AGCTGTGGCT 


2580 


GTCTAGCAAG 


CAGAGTTACG 


AATGACTCAA 


GCAAGTATAA 


ATGTTCCTGG 


TGTAAACGAG 


2640 


TTTACACGCT 


TAGAAAAGAT 


CATAAAACAG 


CTAGATAAAT 


TGTTGAAGGT 


TTTATTATTA 


2700 


AATTGGCAGA 


ATTTCAAATT 


ATGATATAAT 


TmATGCGGCA 


GCTCGCCTCG 


ATTGGAGGTG 


2760 


TGTTATTTGT 


GAAAGATTTA 


ATGTCGTTGG 


TTATCGCACC 


AATCTTTGTA 


GGATTGGTTC 


2820 


TGGAAATGAT 


TTCTCGTGTG 


TTGGACGAGG 


AAGACGATAG 


CCGAAAGTAA 


GCTGCTATCA 


2880 


ACACACACGC 


TAGAAGTCGC 


AACTAGTGTA 


AAAAAAAGCA 


ATCCTATTCG 


CCGTAGGATT 


2940 


GCTTTTTGTG 


TTATCTGTAC 


GATTTAATGT 


CGTTTCGCAC 


TTTTAGTATA 


GCATATTTTT 


3000 


ATTTTGGGTC 


AAGTTTTGTG ACTATGCAGG AATTGGTAAA gAATACAGTG 


GTAGCAATTT 


3060 


TCATCGATGC 


TATTTTATTA 


ATAAAATAGT 


AGTAGAAAAA 


TATATTTATT 


GATAAACTTA 


3120 


TAGTTATGAA 


TCTGTATAGT 


TAGTTATAAT 


AATTGGTATT 


TTTTTAGGAA 


AATTTGAGCT 


3180 


TTTGAATTGA 


ATAAGAAGGA 


GTGATTTTAT 


GGATTTAAAG 


TACAATGTTT 


TTGGTAATTC 


3240 


AATGTATTCT 


TTGAAAGAAA 


TGGAGCTAAT 


TCAACTAGCT 


TCACAATGGT 


TTTATGATGC 


3300 


AGGAAGGTTG 


CCTAGTAGTG 


CATGGGCGCA 


AGTAACCTAT 


GGGGATGTCG 


CTGTTGATAG 


3360 


TATACGGAAA 


ATTTTATCTA 


GTCAAAGtGA 


GCCGTATCAC 


TTTGGAGGAG 


TTGATAATAT 


3420 


TTTTCCTCCC 


TATTTACATG 


CTATTAGATC 


AGCAGTAGAA 


ATGTATAATG 


ATTTTTGTAG 


3480 


ACATGCACCA 


CCGTATTTCT 


TTAATAAGAC 


TTCAGATGAA 


GAAATAGAAA 


AAGTAATGAA 


3540 


TATTTCAGCA 


AATTTTGGTA 


TTGAGTTAAA 


GAAAGAATAT 


TTTACACGTT 


TCACAATGCT 


3600 


AGAGGAAATA 


TTAGTTAAAA 


TAAAAGATGA 


TATTGATTTC 


TTTGAAAATA 


ATTCAAAACT 


3660 


GATGGAATAA 


TAAAGTGCTA ATT AT AC GG A CAATACTTTA 


TAATTAATTA 


TACAGAGCTT 


3720 


TTAAGGCTAT 


TTCATAAAAA 


AACAAGTAGT 


AAGGGGCACA 


AGCATTTCCT 


TTATTGCTTG 


3780 


TTTTTTGATA 


ATATATTACT 


GCAAACGCAT 


TAATATATTA 


AGAAAGGAAA 


CTTACCCACC 


38 4 0 


ATGCCTAGAA 


AAACCTTTGT 


AT AT CC AC AC 


CCAATCAATA 


CTTATATCAT 


TAAGCAACTA 


3900 


GGGATTACTG 


TAGAGGAATT 


TTGCGAACTA 


CACTCCTTCT 


CTCAAGGAAC 






TGGATCACTC 


GCAACAAAAA 


AATGGAAACT 


TTACCAGTCT 


CCTTTATCTA 


CTCTCTTTCC 


4020 


TTATCTGCTA 


GCAAGACTAT 


GGACCAAGTC 


TATTCAGAGC 


TTTTAAAACT 


GCAAGACGAT 


4080 


TATTTGCTAC 


ACTTAGAACG 


CCACAGACGA 


ACTAAAAAAA 


TTATTGAATG 


AAAATACTTT 


4140 


TTTACTATCT 


TACTATTTTA 


CTACAAAAAA 


TAAAAACACA 


AGAGCAATGT 


TTGTACAGCA 


4200 


TTTCTTGTGT 


TTTTATTTTA 


TTAAACAGTA 


TCTTTTATTT 


TTGTTTCTTT 


TTGTCTCGTG 


4260 
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ATTCATAAGC 


TTTCATATAC 


ATTTCAAACA 


TTGTACGTTG 


TTCTGGTTCT 


AATGTGTTAT 


4320 


CTAATCTATC 


TAACATGAAC 


GATACAATGT 


CGTCTTGTGT 


TTCAAAATCT 


AAAGTATTCT 


4380 


GAAACGCATT 


TATTCTATTG 


ATGTTACTCT 


TTTGTATCCT 


AATTGTACTA 


TATACTTTAT 


4440 


TTTTAGCAGG 


TCGCCCAATT 


CGTTTTGATC 


CTGAATCACT 


TGCTGAACTT 


TCTTCTTTTG 


4500 


AAAAACTACT 


TAAATCAAAA 


GTAGTTTCAG 


GTTCTGAAAT 


TTCAGTTGCk 


TTAATTTCTT 


4560 


TTGATTGTTT 


ACGTTCAGGA 


TCCTTATGAA 


ATGTATACTT 


ACTCATTAGC 


AGTCGTCCCT 


4620 


TCTGTTCGkT 


CAATTATTTC 


ATCAGATAAT 


TTGTkATACA 


ATtCATGAAC 


TTTTCTATCA 


4680 


TGAAAATCAT 


ATAATTCCGT 


ATATCCTTCT 


TCTGATATAC 


CTTTACGATC 


ATAACGTTTT 


4740 


AACCGTTCCA 


TGTGTCTAAC 


AATTGTATTA 


AACAAAGTCT 


CATCACCAAA 'AGCGTCTTTG 


4800 


GCATCTTTAA 


TTATTTGATT 


ATCAATTCCA 


GAATCGTTTT 


yCAACAGCAC 


TGGAAGTACC 


4860 


CCAGCGATAT 


CAAAGTCAAT 


GTTTTTATAA 


G TAT CAT AC A 


AGGTTTGCAA 


GTATTCCCCA 


4920 


AAATGCTTCT 


GCACCATCAA 


GGGAGCGTTG 


CTGCGTTTGC 


AAAACAATTA 


CAATGTAATT 


4980 


AGAAGAATAA 


AGAGCTGTAT 


CGGTAAATAC 


TGAAAGTGTA 


GGAGGAACAT 


CAAATATAAT 


5040 


GTAATCATAA 


TCATTTTCAA 


TTGGTTTAAG 


TAATTCACTA 


AAAAATGCTA 


TTCGTCTCTC 


5100 


TTTGTAGTTT 








AAATCAGGAT 


AATTAACAAA 


51 60 


GTCTTTGTGT 


GAAGGCAATA 


AATACAAATT 


AGGCATTACA 


TTTACAACAG 


CTTTTCCTAA 


5 ! 220 


ATTTCCTTCT 


TGAATTGCTA 


CCATCATCGT 


TTCTTTTATA 


GGTAGTTCTT 


TATTATTTTG 


5280 


TAATCCATAT 


GTTCTTCGCA 


ACAATTGCGT 


TGAATTTGCT 


TGTGGGTCTA 


AGTCACAAAC 


5340 


TAAAACTCTA 


AAACCTTTTT 


TAGCTAATTC 


ATAGGCTACC 


ATTACAGAAT 


TAGTAGTCTT 


5400 


CCCAACGCCG 


CCTTTGAAAT 


TTCCAAATAC 


AATTTTTTTA 


ACCATACAAT 


GACT 


5454 



(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
ATCCCGGGGA ATGAGCCAAT CTATCAGCAA GCTGCAGTAG CGTTACCGCT TGGATGAAAG 60 

AAAATCACTA TGATCTAGTT TATGGCGGTG GCAATGTTGG TCTAATGGGT ACGGTGGCTG 120 

ACACGCTTTT AGCAGAAGGT GGAGAGGTAA TTGGTGTCAT GCCCACTTTT TTAATGGAAC 180 

GAGAGATTGC ACATAATGGC ATTACAAAGA TGCATACAGT CAGTGATATG CATGCGCGTA 2 40 

AGAAAAAGAT GATTGAATTG GCAGATGTGT ATTTAGCTTT ACCCGGTGGT CCCGGCACGC 300 

TAGAAGAAAT TAGCGAGGTG ATCTCTTGGG GCCGAGTCGG TGAACATATG AATCCTTGTA 360 

TTTTATACAA CGTGAATGGT TACTATGATT TACTAGCTGC TTTTTTTGAT AAAATGGTAG 4 20 
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AAACTAATTT 


CTTAACAGAA 


GCGGATCGAG 


CTAAAATTTT 


CATTTCAGAT 


TCCTTAGAAG 


480 


AAATTGGTGC 


ATTTATTGAT 


TCCTATGAAC 


CGCCAATGAT 


TCGTCAATAC 


AAAAAATAAC 


540- 


AAAAGAGGCA 


GTCACTCTAA 


CCAGAATGAC 


TGCCTCTTTT 


TTACTAGTTT 


TTCGTTAAAA 


600 


TCCGATGTAA 


GAACTCTTTT 


GTCCGAGCTT 


CTTTTGGATG 


GACAAAAATA 


TCTTCAGCGG 


660 


TCCCTTCTTC 


GGCAATGACA 


CCTTTATCCA 


TGAAAATAAC 


ACGATCGGAT 


ACGTCCCGCG 


720 


CAAATTCCAT 


TTCATGTGTT 


ACAATCACCA 


TTGTTAGCCC 


AGTATGGGCA 


AGATTTTTCA 


780 


TCGTTTTTAG 


CACTTCGCCA 


ACCATTTCAG 


GATCCAATGC 


AGAAGTTGGT 


TCATCGAAAA 


840 


GTAAAACATC 


TGGATCCATT 


GATAATGCTC 


GAGCAATGGC 


TACCCGTTGC 


TTTTGTCCAC 


900 


CAGACAATTG 


aGCAGGTTTG 


GCATCAATAA 


AACGAGCCAT 


GCCAACTTTT 


TCTAAATTTT 


960 


CTAAGGCAAT 


TTTTTTGGCT 


TCTTCTTTGT 


TTCTTTTTAG 


GACAGTCATC 


TGACCAGAAG 


1020 


TACAGTTTTC 


TAAGACATTC 


ATGTTATTAA 


ATAAATTGAA 


TGATTGGAAA 


AGCATACCTA 


1080 


AATGGGTCCG 


ATATTTGGGC 


AAACTATAAC 


CACGTTCTAA 


AACATTTTTA 


CCATTATAGA 


1140 


TGATTTTGCC 


ATCTGTTGGT 


TTTTCTAACA 


AATTAATACA 


ACGTAGTAAC 


GTTGATTTAC 


1200 


CAGAACCAGA 


AGAACCGATA 


ATGGTGACAA 


CTTCTCCTTT 


GTTGACGTTC 


ATATTGATAT 


1260 


CTTTTAATAC 


TTCGTTTTCG 


CCAAAGCTTT 


TTCGTAAATG 


TTCAATTTCA 


ATAATTGCAC 


1320 


TCATCTGTTT 


GCCTCCTAGC 


TTTCTTTTCC 


TTCTTCAGTT 


AATTCTTCTA 


CTTTAACGTA 


1380 


AGCAGAAGGG 


CCATCCATTT 


TACGTTCCAC 


AACACGTAAG 


ATTCGTGTGA 


TCACAAAAGT 


1440 


CATTACCAAG 


TACATGATGC 


CGACAATTGT 


AAAGGTTTGG 


AAGAACTGGA 


AGTTTGCGCC 


1500 


AGAAGCAGCA 


TTCCCTTGGA 


AGAATAAATC 


TGCAACACCG 


ATAACACTTA 


ACACGGCCGT 


1560 


ATCTTTGATA 


TTAATGACAA 


ATTCATTCCC 


TGTTGCAGGT 


AAAATATTAC 


GTAATACTTG 


1620 


GGGaATAACG 


ACTTTACGCA 


TTGTTTGTCC 


ATGGGTCATC 


CCAATCGCTT 


GTGCTGCCTC 


1680 


GAATTGTCCT 


TTATCTACTG 


CGAAAATACC 


TCCGCGAaCA 


ATTTCAGACA 


TATAGGCACC 


1740 


AGTATTCACA 


GAAACGATAA 


ATAATGCGGC 


CACTGTTCGG 


TCTAAAGAGA 


TGCCAAAAGC 


1800 


TAGAGCTAAA 


CCATAGAAAA 


TAACCATTGC 


TTGAACCATC 


ATTGGTGTAC 


CACGGAAAAC 


1860 


TTCAATATAA 


ATAGAAAGAA 


TTAAATTTCC 


TAATTTTTGG 


AAGAAACGAG 


CGACTGGGTT 


1920 


TTCAGAATCT 


GGAATTGTAC 


GGAAGACACC AATyAGTAAA CCAAGGGTTG 


TTCCGACAAC 


1980 


TGTCCCAATT 


AACGCGATAA 


ATAGCGTTAA 


ACCAGCACCA 


CGTAAAAACA 


TGTCGCCATA 


2040 


TTGATTCCAA 


ATATTTTTGA 


AATCGTTAAT 


AAGTCCTGTT 


TTTTGTTCAT 


CACTATCTGT 


2100 


TGCTGCAGGT 


TGATCCTTGA 


TTGCTTGGTC 


CATAATTTTT 


GTCCGTTCAT 


CTTGAGAAAT 


2160 


ACCTGCTAAA 


ATTTTGTTTA 


CTTTTTCAAT 


GTCTGGATCG 


CCTTTACGCA 


TGCCGACTGC 


2220 


TACTTGCGAG 


TCTTCAGCGC 


TAGCATCAAA 


GCCCTTTTCT 


TTTGGAAATc 


TAACATTTTT 


2280 


AATTCTTTAT 


TGACACTAGT 


TGCTGTTATG 


CCTTCAGGGC 


GTTCACTAAC 


ATACCCGTCA 


2340 


ATCATGCCAG 


AAGCAAGTGC 


TGTTCGCATA 


GCTGAAAAGT 


TATCCATTGC 


TTGCTGTTTA 


2400 
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TTGACATTTG 


GAATTTGATC 


AATTAATCCG 


TAGTGGAATG 


T A III A<j 1 lu 


7\ rTTT"V B 7A TT 


24 60 


T TAG CATC AG 


CTAAATCTTT 


CAAGCTTTTT 


GCATTAGCGT 


t\ rri fli "T> /** /"** Z^ 1 7\ 1*1/** 

ATI TGCCAIL, 


I 1 I 1 1 1 I ALA 




ATTACGACAA 


ATTGTGATTC 


ATAATAAGGG 


TTTGTAAAAG 


CAATTTCTTT 


m rrt m rn ^* rp trt 7\ 


9 n 


GCGGTTGGGC 


TCATTCCGGC 


GATAATGGCA 


TCAATTTTTC 


CAGATTGTAA 


GGCGGGCGLA 




AAGCCATCCC 


ATTTGGTTTG 


GACGATTACT 


AATTTACGAC 


CTAAACCATC 


GGCAATTTTT 


£. f UU 


TTAGAAATTT 


GAACATCATA 


GCCGCCTGCA 


TAAGAGTTGC 


CTTGAATAGG 


CACTGCGCCA 


Z / OU 


TGTGCGTCAT 


TTTTcTGTGA 


CCAGTTAAAA 


GGGcGTAGCC 


AGCTTCCATG 


CCGACACGAA 


Zo ZU 


ATTCACCATT 


TTCCTCCGCC 


GAAGCGTTGG 


TTGTCATGCC 


TV TV /"* m T\ TV TV TV 

AAGTAACGAA 


AATAGCGTGA 


zo 0 u 


CAAGTAGTAG 


ACTGAATGAA 


AAAACTTTTT 


TGTTCATCTT 


TTTCTCTCCT 


TTAAniTTTCT 


0 q a a 

z y 4 u 


TGTCTGATTG 


ATCGTTTTGT 


CTTTTTTCAA 


TCAACAAGCT 


TTGATAATCA 


GCTGAGcAAA 


"5 A A A 


AGCGATTCTG 


TCTTTTTAAA 


GGAGAATACT 


TCTGTCACAA 


CCTATTTTTG 


AATACAAAAA 


iUoU 


AACAACCAAA 


CAGATGTAAG 


GTTGCTAGGT 


AATGTCGAAA 


AAAATTCCTC 


AACGAAACAT 


j 1ZU 


AGCACAACTT 


AG CATC ACT A 


CTAAGACAGT 


CCGCAAGCTC 


TTAACTTACG 


TCCCAACACA 




CAACAGAAGA 


GGCTCCGTTA 


TGCATTTCGG 


CGATTGATCC 


TAAACTTGCT 


TCCCAGTGTC 


O O /I A 

JZ4U 


CTCTCACTCC 


ACAAGCGTAA 


TAAAAACCGC 


GACCTCTACC 


TCTTTTTTGA 


GGCACTATTC 


-a *a a a 


AGTTATTAAA 


TTCAGTTATT 


AATTTTaGGA 


TAAATCCAGA 


AGAAAGTCAA 


TATGTTCCAG 




CAAAGGAATT 


TATCAAGAGG 


ACACAAACAC 


CATGAAtTAA 


CGTAACATTT 


n-i ^ iti T\ y*^ T\ Tv 

TCCCTACGAA 




ATAGAAACTA 


ATACGAAAAA 


AGCAACCATT 


CCTAAAGGAA 


CAGTTGCTTT 


TTACTTAAGC 


J4oU 


TTTGACAGCT 


TTTGGTTCAC 


GACCTAAAAT 


AGCTTGAAGA 


ACAGTCAATA 


C AC C T AAAG A 


Jj-IU 


AGCTAAAATT 


GTATAAATTA 


GACTTGAGTA 


AGTTAAGATA 


CCGAAATCCC 


ti T* T* t* nrp /■*• rn 7\ 

AAGTTT I(j 1 A 




AACTAATAAA 


ACAATTGCTA 


AGGCGATAAT 


AGCGATAACG 


TTGAAGAACG 


C AAAAb LA, *- A 


jDOU 


AAGATTTCCG 


TTTTCTTTAC 


GAGAAAGATA 


AAAAACAGAG 


TV TV TV TV TV TO /"* 

AACCAAATGC 


rprn TV TV TV T\ ("T' 
TTAAAAL-LjLL, 


.5 / £ u 


CCAAGCGATG 


CAGGCAATAA 


TTGCAGCCCA 


7\ m TV T\ TV m 7\ TV fP 7V 

ATAAATAATA 


AA I CjLAaALL-A 


T T C T T r P T* T r* 
1 lbl luJ. 1 1L 


3780 


ACCTCACTTT 


TACTCATGCA 


TATTTTACCA 


CGAGTTGAGA 


AAI 1 lbi LAL 


7A 7A lir'TrTTTP 
rVrlrt^ 1^1 1 1 v> 


3840 


AAGATAAATC 


AATATATAGG 


TAAGAnGTTT 


tCmAAAAGTG 


GAAAAaCTTT 


tGGTGCTAAT 


3900 


GTTAAAATAC 


GTTATAATAT 


AGAAGATTAG 


GAGCTGAATC 


GATATGCAAG 


ATATC AC GC A 


3960 


AGCCGAGGCA 


ATTCATTGGT 


TGGCAACCCA 


ACAAAAAATC 


ACATTTnTAA 


CAGGTGCGGG 


4020 


TATCTCGACA 


GCCTCAGGTA 


TGCCAAATAA 


TGC 






4053 


(2) INFORMATION FOR SEQ ID NO: 408: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
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TAAACATATT 


CTTTACTTGT 


TAAAGGATCT 


GTTATTTTTA 


TTTAAAAATT 


GTTAGCTAGT 


60 


TAGAGCAAAT 


GGTTCGTAAT 


CATTCGGTTG 


ATTGTTCGAA 


TTTATCATAG 


TAGTTAAAAC 


120 


GAAATTATCT 


TGTGTATATA 


GAAAACACCT 


ACTGAAGACG 


CACAATACTT 


CCCCAAGTTG 


180 


TGTATGTTTA 


GTAGGTGTTT 


ATTTATAATT 


GTATATTTGA 


ATTATAAATA 


AAATACAATT 


240 


GTTTTGTAAA 


ATGAAAACTt 


GAAATTTAAA 


AAaGAAAGCG 


GTAAAATATC 


TTaCCAAACA 


300 


ATTTTATTTT 


TCATTTTATT 


ACTACCTCTT 


GTCGCCTTAC 


CCCAATGAGG 


CGGcTTTTTT 


360 


TACATAAAAA 


TATTGAACTC 


AAAAAATAGA 


TATTGTAAAA 


TATCTATACA 


TATTTAAACT 


420 


CTTATTTTTT 


CACGCAGACC 


GCCTTTTCTC 


AATAAGGTGG 


TCGTTTTTTT 


GTTGAAAATT 


480 


GAAAATCAGT 


AAAGTAAAAT 


TGTTATATAT 


TAAGCTATAA 


CTAGTAAGAA 


CTATTTTATC 


540 


CCCAATAAGC 


TAGTTATTAC 


TTGGACCATT 


AGCTCAGCTG 


GTTAGAGCAA 


ACGGCTCATG 


600 


ACTATTCGGT 


CGATGGTTCG 


AATTCATCAT 


GGTCGATAAT 


AGAATTAGAC 


AGGTAGTGAA 


660 


TTATTTTGAA 


AAGAACAGGA 


ATTCTCTTAG 


TGAATTTAGG 


AACACCAAAA 


GATTCTTCCA 


720 


AGACGGAAGT 


AAGGAAGTAT 


TTAAAAACTT 


TTTTATCAGA 


TAGAAGAGTA 


ATAAAAATAC 


780 


ATCCTATAAT 


TTGGAAACCG 


ATTTTGAACG 


GTATTATCTT 


GAATATACGT 


CCAAAAAAAT 


840 


CAGCAAAACT 


CTATCAAAAA 


ATTTGTACTG 


AAAATGGATT 


TCCTCTTTTG 


GAATATACTG 


900 


AGAAACAAAT 


GGAAAATTTG 


AAAAATATAT 


GTCCAGAAGT 


GGAAGTTACA 


ATCGGAATGT 


1 960 


CATATAGTGA 


ACCGAGTATA 


GAAACTGCTC 


TAG AT AC ATT 


ATTATCmAAA 


GAGATTGAAG 


1020 


AACTCAATGT 


CATACCAATG 


tATCCGCAAT 


ACTCTGGGAC 


AACGGTAGGA 


TCGGTGTTTG 


1080 


ATTCCGTTAT 


GAATTATTTT 


ATAAAAAGTG 


ATAGGATAGT 


GGATATTAAA 


TTTATTCGAT 


1140 


CATTTTACAA 


TAATCCACAG 


TAT AT AG ATT 


ACTTTTCAAA 


AAAAATAAAT 


GAAGCTTTGA 


1200 


ATGAAAGTCC 


AATAGATGCT 


ATCGTTTTTT 


CATACCACGG 


AATTCCTATG 


TCTTATGTAA 


1260 


AAGATGGGGA 


TAACTACCCG 


AAGGAATGTA 


CTAAAACAAC 


AAAATTAATA 


ATGGATAAAT 


1320 


TAG GAG AT AT 


TCGTTATTAT 


CAAACCTATC 


AATCGAAATT 


TGGGCCATCT 


GAATGGTTAA 


1380 


AACCAGCAAC 


TGATGATACC 


TTGAAAAAAT 


TACCATCAAA 


AGGTATTAAA 


AATATATTAA 


1440 


TTGTTGCmCC 


AGGTTTTGTT 


GTAGACTGTC 


TAGAAACAAT 


AGAAGAATTG 


GAACACGAAA 


1500 


ACAGAAACTA 


TTTTTTAGAA 


AATGGCGGGG 


AAGTCTACAA 


ATATGTTCAT 


CCATTTAATG 


1560 


GAGATATTGA AtTTGCmAAA TTAgTAAAGG 


aCATTAtTTC 


TTTGTAGTAA 


TTTGTTTTAG 


1620 


TkTTTTATTT 


TAGTTTGAAT 


TTGTTAAACA 


/VTTTATTCAG 


GGTTTGCACG 


AATATATnTA 


1680 


CTGGnATACA 


TATAGGTAGA 


. ATTAAGCCAA 


, AATTAAACGG 


CCTATCCCCC 


GGTCCAGTTT 


1740 


AnGGAATAAG 


CCGCCTAA 










1758 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double . 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

ATGCTGGCGT TTCAGCTACT- AAATTTCTAG CAAAGAAATT TAAAAAAGAT ACGGACGTAA 60 

CAATCACATT GATTGATCGC CATTCTTACC ACACTATGAT GACCGAATTG CATGAAGTAG 120 

CAGGCGGACG TGTTGAACCA GAAGCAATTC AATACGACTT ACAACGTTTG TTCTCACGCA 180 

AGAAAAACGT AAAATTGGTA AC AG AT AC TG TTACTGGAAT CGACAAAGAA AACAAAGTTG 24 0 

TGAAAACTTT AGCAGGTTCT TATCCATTTG ACCAGTTAAT TCTGGGTATG GGTGGCGAAC 300 

CAAACGACTT TGGAACACCA GGCGTGAAAG AAAATGGTTT CACCTTATGG TCATTTGATG 360 

ATGCAGTGAA AATTCGTCAT CACATTGAAG CAACCGTTGC TAAAGCAGCC ATCGAACCAG 4 20 

ATGCTGAAGT ACGCAAAGCA ATGTTAACAT TCGTTGTCTG TGGTTCTGGT TTTACTGGGa 4 80 

TCGAAATGGT CGGCGAATTA ATCGACTGGA AAGATCGTTT AGCGAAAGAT GCCAAAATTG 540 

ATCCAGACGA AATTACATTA ATGGTTGTTG AAGCAATGCC AACCATCTTA AACaTGTTAT 600 

CTCGTAACGA TGCTGCCAAA GCAGAACGTT ATCTAGAAAA GAAAAATGTT CAATTATTAk 660 

kGAACTCACC AATCGTTGAA GTTGCAGCAG ATCACATTAA ATTAAAAGAT GGTTCTGAAG 720 

TACCAACGCA TACGTTAATC TGGACAGCTG GTGTGAAAGC AACTTCTGAT GCCGCTGACT 780 

TTGGTTTAGA AGCAGCACGT GGTAGTCGTT TAGTTGCGAA TGAATACATG CAAGCAAAAG 840 

GTTACGAAGA CAAAAACATT TATATCATTG GTGACTTGGT TTATTACGAG GAAACACCAA 900 

ACACACCAAC TCCTCAAATC GTTCAAGCAG CTGGAACAAA CAGGTCATAC TGCGGCTGCC 960 

AACATCGTTG CAGATATTAA AGGCGGCGAA AAAACACGCC TTTTAAGGGG AACTACCCAG 1020 

GGATTCCTGG GTTTTCCATC GGCGGCGAAA TGGGGGCGTT GGCCAACTTA TTCnGATnAA 1080 

AATTCCATTT G 10 9 l 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

TCATCTATTA ACTCTCCATA GGTTTCATTT AAGGATTTTG GGGAAACTTT ATAATTTTGA 60 

ACTTTTCTTT CTATTACTAA TTTCATGACC CAGTCAACTG CTTCTAACAC TTGACTGTTC 120 

TCTAATTCCA TTTCTATCAT tAAT kCCTCA TCGTTTATGT TAGTCTGAAT TGCACTCGGT 180 

ACTATTmCAC TCTTCAAATC ACTTAGATTA AGAAAGTTAA CAGTCATATA ATTTCGATCG 24 0 

TTTATTTTTA TAGACTGTTT TTTTAAGTTC TTATCCTCAA GAACCTTTCC AT GAT T CAT T 300 

AAAATATATC TATCAGCATA TTTTTCAACT TCTAAAATAT TGTGTGTGAC CACGATGATT 360 
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ATATGCCCCT 


TATTAGATAA 


TTTTTTCAGA 


TATCTCCATA 


TTTTTTGCCG 


TCTAATAGGA 


420 


TCCACATCAT 


TAGTAGGTTC 


GTCAAGGACT 


ATTATTGGAG 


AAGGATATAC 


AACGGCCATT 


480 


GCAAATGAAG 


TGAGCCTTCT 


AAGTCCGCCT , 


GAAAGTTTCT 


GTCCTGGTAT 


ATCTTTCCAT 


540 


CCATCTATTT 


TTAATTCTTT 


TAAAATTTTA 


ATACACTCTT 


CTTCAGCTAT 


TTTCTGACTG 


600 


CCTCCTCTTA 


TTCTTATTAT 


TGACTCGACA 


GATTGAGCCA 


TCGTTATACC 


TGCCAAGGGA 


660 


GCATGTAGTT 


GAGGCATCAT 


AGAGACATAT 


TCTCTTGcAA 


TTTTACTATC 


TCTCGCAAAT 


720 


GATAGACCAT 


TAAATGTTAT 


AGAGCCCTTT 


GTTGATTTCG 


TTATACCTAT 


AATCTGATTC 


780 


AACATTGTTG 


TTTTACCAGC 


GCCGTTATGT 


CCAGCTATGG 


CAGTTATTTC 


ACCTTTATTA 


840 


AAGCGCAAAT 


TTATATTATC 


ATTTGCATAG 


CTCTCTGATT 


TCTTGTACTT 


TTTACTTAGC 


900 


GAATGTAATT 


CTAACTGTTG 


ATTCAAAGCA 


AACCTCCTAT 


ATTTAGACTT 


AATTAAATTG 


960 


ATTGATAACA 


GGTTGGAACC 


ATAACCAAAT 


TATCATTACT 


AGCACACTTA 


TGCTTATTAC 


1020 


TTTAATAATG 


ATTAATTTCA 


TAATTGCGAC 


CTCTTCAAaG 


TAGAAaGGAT 


TATCAACCAC 


1080 


AAAAGAAATC 


AAAaTGTTAA 


TATCTGTTTT 


TTGTATAATG 


ATAAATTGCC 


ACAATATCCT 


1140 


TGTTATAAGT 


ATTGATGCCG 


CAATAGAATT 


AACTCCAACA 


CTTTTATTAT 


TCATAATCAA 


1200 


AAAAACTAAC 


AAGAATATTG 


CATCCGACG 








1229 



(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



CATCAGTATT 


AGTTGGTGTA 


GGnCTTGTTT 


TTGGGGCAAC 


TGGAATAGTT 


AATGCACAAA 


60 


TGGGTGAAGG 


AAGATTAGTA 


AATTATTCTG 


CTTCAGGAAA 


TACGTTTCAA 


GAAAATCCGG 


120 


GATATACGAA 


GAATTATAAT 


TTCTCGGATT 


TACAATTCAA 


CCCTAAAGCA 


ATAACTGGTG 


180 


ATGTGTTACA 


AGGAAATACA 


ATTGATTTTG 


AGGTTTATGG 


GAAACATAAT 


ATTGCAGCTT 


240 


CAACTGCAAA 


CTGGGAAATT 


CGTCTTCAAT 


TAGATGAACG 


ATTGGCCCAG 


TATGTTGAAA 


300 


AAATTCAAGT 


TGATCCGAAG 


AAGGGCGTAG 


GAAATAGTAG 


ACGAACTTTT 


GTAAGAATTA 


360 


ATGATTCGCT 


TGGCAGACCT 


ACAAACATTT 


GGAAGGTTAA 


TTACATTCGA 


GCAAATGATG 


420 


GACTATTTGC 


TGGGGCAGAA 


ACAACTGATA 


CACAAACTGC 


TCCTAACGGT 


GTGATTACAT 


480 


TTGAmAAAAA 


TTTAGATGaA 


ATTTTTAAAG 


AAATTGGtGc 


AGATAATCTT 


aAAAGCGACC 


540 


GTTTaATGTA 


•TCGTATCTAT 


TTgGtAAGTC 


ATCAAGATGA 


CGATAAAATT 


GTACCTGGAA 


600 


TAGAAAGCAC 


TGGTTACTTT 


TTGACAGATC 


AAGATGATTT 


CTATAATAAG 


TTAGATGTAT 


660 


CCGAAAATAA 


TTCTGATCAA 


TTTAAGCATG 


GATCGGTAAA 


TACTAAATAT 


GAAGAAGCTA 


720 
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ATATCCAAAC 


AAAAGACGGT 


TCTGGGTCTA 


CTGGTGCTAA IGtjAbL IA1 A All I iAbftlt 


"780 


ATAAGTTAAC 


GAAAGAAAAG 


AATTTTTCTT 


ATTCAACTTC TGCAAAbGGA ACGCCi Ibbl 


ft d 0 


ATGCTAACTA 


CAAAATCGAT 


GAACGTTTAG 


TACCATATGT AlClobAAlA LAMlbLAlft 


-7 \/ v 


TGGTTCAGGC 


TGATAAAGTG 


GCATATAATG 


m j a m m /n fn TV 7\ fT\ T\ f TV 7\ TV 7V TV TV TV fTl **p 

TTGCTTTTGA ATCAGGAAAA AAAGTbGLib 




ATTTAGCAAT 


TGAGCGACGT 


GAAGGTCATG 


AGAATTATGG TATGGblTCA ATCACTGALA 




ATGATTTAAC 


TAAACTTATT 


GATTTTGCTA 


ATGCTAGTCC ACGACCTATC GTTGTTAGAT 


10o0 


ATGTTTTACA 


AC TG ACT AAA 


CCTTTAGATG 


... _ _ Mm — , m f. * Tt Tl m TV TV TV TV TV ^ 7\ 

AAATCTTAGA AGAAATGAAA GCAGCGGATA 


1140 


AAATCGAAGA 


AAATGCGCCA 


TTTGGTGAAG 


ATTTCATCTT TGATTCTTGG TTGTCGGATA 


i o a a 

1200 


CGAATAAAAA 


ATTAATCCAG 


AACACTTATG 


GAACAGGTTA TTATTATTTG CAAGATATTG 


i 0 c a 
l^bO 


ATGGTGATGG 


AAACCCTGAC 


GATAAAGAAG 


AGAGCGGAGA CACGAATCCA TATATCGGGA 


1 *3 0 A 

1 JiU 


AACCTGAATT 


AGAAGAAGTA 


TATGATGTTG 


ACACAACAGT TAAGGGGAAA GTATTCATCC 


1 O O A 

1 JoO 


ACGAGTTAGC 


GGGAACAGGT 


CACAAAGCGC 


AACTTGTTGA TAAAGAAGGT ACTGTATTAG 


1 A A A 


CAGAAAAAAC 


TATCGCTCCA 


AATGAAAAAG 


ATGGGGCTCC AATTTCAGAT ACTGTAGAAT 


t c a a 
1000 


TTGAATTTAC 


GGGTGTAGAT 


TCAAGTAAAC 


TAATCGCGAA AGATGAATTA AAAATCCAAA 


1 o bO 


TCGTTTCTCC 


AGGTTTTGAT 


AAACCAGAAG 


AAGGTTCAAC CGTTATTAAG GAATCACCAA 


1 bzo 


AAGCGGTTGA 


TAAACAAACC 


GTGGTAGTTG 


GATTTAAACC AGATGCTAAA GAATCAATTC 


icon 

1 DO U 


GGAATAATAA 


AAACTTACCT 


GAAGATGCAG 


AGTATTCATG GAAAACAGAG CCTGATACTT 


1/40 


CTAACGTTAC 


TGATAGTACG 


AAAGGTATTG 


TAACTGTTAA GATAGGAAAT CGAACTTTCG 


loOO 


ACGTGGATGT 


AGAGTTTGCT 


GTAAAAGCTT 


CTCAAGCTAT GGAAAATGAT GCAACATACG 


lo bO 


TACCTATAAC 


AACAACCCCA 


GAAACGACAA 


TTCAAAGTGG TAAACCTACA TTTGATAAAC 


T O O A 

1 y ^0 


CAGATGTTCC 


TCTAGCTAAC 


GATGCCTTTT 


CAGTTTTAGA TGTTTATAAT AAGGACTTCG 


1 you 


GCAATGCAAG 


TGTTGACGCA 


AATACTGGTA 


TTGTTACATT CACTCCAGCT AAAGGTGTAG 


*:04 0 


GAGAATCGGA 


GCCGATTACT 


GGAACAATTC 


CTATTAAAAT TGTTTACCAA GATGGTTCTG 


100 


TAGGCACGAC 


CGATTTAGCA 


GTAACTGTAA 


GTAAAAATAT TTATGAAAAT CCAGGAGAAA 


Z 1 DU 


ACATTCCTGC 


AGGCTACCAC 


AAaGTAACCT 


TCACCGCAGg AGAAGGAACA AGTATlbAAA 




GTGGAACAAC 


AGTCTTTGCA 


GTGAAAGACG 


GCGTAAGCTT ACCAGAAGAT AAACTlv-LGb 


99R0 


TGTTGAAAGC 


AAAAGATGGT 


T AT AC AG AT G 


CGAAATGGCC AGAAGAAGCA ALbCAAUUAA 


9^d 0 


TTAAAGCAGA 


TGATACAGAA 


TTTGTATCAA 


GTGCAACAAA ATTGGATGAT ATCATTGAAA 


9 4 00 


ACCCAGGGGA 


TAACATTCCT 


GCAGGCTACC 


ACAAAGTAAC CTTCACCGCA GGAGAAGGAA 


2460 


CAAGTATTGA 


AAGTGGAACA 


ACAGTCTTTG 


CAGTGAAAGA CGGCGTAAgC TTACCAGAAG 


2520 


ATAAACTTCC 


GGTGTTGAAA 


GCAAAAGATG 


GTTATACAGA TGCGAAATGG CCAGGAGAAG 


2580 


CAACGCAACC 


AATTAAAGCA 


GATGATACAG 


AATTTGTATC AAGTGCAACA AAATTGGATG 


2640 


ATAAATCTGA 


TGCTGACAAA 


TATACTCCTG 


AAGGTCAAAA AGTGACTACA GAATTGAATA 


2700 
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AAGAACCTGA CGCATCTGAG GGAATTAAAA ATAAGAAAGA TTTACCAAAA 
ATACTTGGAA AGAAAAAGTA G AT AT TAG T A CAGCTGGAAA TAAAAAAGGT 
TGACATATTC AGATGGATCA TCTGATGAAG TTGAAGTAGA TGTCACAGTA 
GCTCTGACGC TGATAAATAT GAGCCAACAG TAGAAGGTGA AAAgTAGAAA 
AGTrGATTTA ACAGATAaCG TTACTAACTT ACCAACGTTA CCACAAGGAA 
AGATGTTACT CCTGGTGGTA CAATCGATAC TAATACACCA GGTAATTACG 
TGAAGTAACG TATCCAGATG GTACAAAAGA TACAGTAAAA GTTCCAGTAG 
CAATCGCTCT GACGCTGATA AATATGAGCC AACAGTAGAA GGTGAAAAAG 
TGGTAAAGTA GATTTAACAG ATAACGTTAC TAACTTACCA ACGTTACCAC 
AGTAACAGAT GTTACTCCTG GTGGTACAAT CGATACTAAT ACACCAGGTA 
TGTCATTGAA GTAACGTATC CAGATGGTAC AAAAGATACA GTAAAAGTTC 
AACAGACAAT CGCTCTGACG CTGATAAATA TGAGCCAACA GTAGAAGGTG 
AATyGGTGGT AAAGTAGATT TAACAGATAA CGTTACTAAC TTACCAACGT 
AACAACAGTa ACAGATGTTA CTCCTGGTGG TACAATCGAT ACTAATACAC 
CGAgGTGTCA TTGAAGTAAC GTATCCAGAT GGTACAArAG ATACmGTAAA 
rAGTAACAGA CAATCGCTCT GACGCTGATw AATATacaCs TArAGGTGAA 
GGTGGTAAGT GnTTTACnGA TACGTTACTA CTTACCAACG TTACCACAGG 
C 

(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GATGCTAAGT 

ACGGTTGTAG 

ACAGACAATC 

TCGGTGGTAA 

CAACAGTAAC 

AAGGTGTCAT 

AAGTAACAGA 

TAGAAATCGG 

AAGGAACAAC 

ATTACGAAGG 

CAGTAGAAGT 

AAAAAGTAGA 

TACCACAAGG 

CAGGTAATTA 

AGTTCCgTAG 

AAGTAGAATC 

nACCACGTAA 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3721 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 
nCAAAnGAGG GCTAATTAAT GGAnAAAAAT GGATTGAaAA AAACmAATAT TTTGTTGGAG 
CAATTGGAGT TATTCTCaTT GCTCTTTTTT CATTCCtTCT TTTTGAAAAC CAACAAAAAG 
AAGAAGTGAA AGAAGCTATA AATACAGTGG TATCaAGTGG CTTAGAGGAA GCGGAGAAGC 
CAAAAAAACA TGTAGAGACA AAATCTAGTG AAACTGTTCA AAGTAATCAA GTAGATGTTA 
TTACTGGTAC GGAAGCACAA GAAGAaACmC CTATTTCAAC AGTGATTTCT TTAACCGAAG 
GCGAACGTCC GTCTGTTAAA AC GAT TC AAA ATGCCCAAGA AAATTTTAAT GAAGTGTCGG 
ATTCTGTTTT AGGGTCTATT GAAATTGATT CTATTCAATT AAAATTACCT ATTTTAATGG 
GTGATAGTTT TGACAATATG CTTTATGGAG CGTGTACTGT TTTATCAAAA CAAACGATGG 
GAAAAGGAAA TTATGCTTTA GCTTCTCACA ATGCTGGGTA TGAAGGATTA CTTTTTACCT 
CGTTAAATAA AGTTTCTGTA GGAGATTTGG TCAAACTTAA TGATCGTGAA GGACATTCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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TCATATATAA 


GGTAAAAGAG 


CAAAAGCATG 


TTGATATGAC 


TGATACAACT 


ATGTTGAATC 


660 


TAACAAGAAA 


GCCAACATTA 


ACTTTAATAA 


CTTGTGACCA 


AGCCACAAAA 


ACAACAGGAA 


720 


GAATTATTGT 


GATTGCGGAA 


TTAGTTTAGA 


GGAGGGAAAA 


AGTTGGTTGA 


AAAAATAAGG 


780 


GAATTGTCAA 


AAGAAGAATT 


TGTTAATCAA 


TACATGTTTG 


ATTTTAAGGT 


ATTGATTTTT 


840 


AAATCTAAAA 


ATCCTGAAAA 


AATAACTGCC 


AAGCAAGAAA 


GACTTTTAGA 


ATGTGAAAAG 


900 


AAGATTGCTG 


TGTTGTTTTA 


TGAAACGTAT 


AAACGAAATA 


AAGGTTATCT 


GCCTGATGAA 


960 


AAAGAACTGG 


GGAGGATTGT 


TCAAAGAAAT 


TTTCTTGACC 


GTTTAAAGTT 


ATTTCGAGTG 


1020 


GAATATGATG 


TGATtAGTGA 


AGAAAAATTT 


tGTGGTCTAC 


ACGTACAAAT 


GGTCAAACAG 


1080 


CAAATGCCAT 


TAGAAAAGTA 


TCGAACAGAT 


GATTtATCTT 


ATATTTTAGG 


TCGAGAAAAG 


1140 


AAGATAGCTA 


TTAATTATTT 


TATCGCACAT 


GATGATTTTC 


CTATGGGTTA 


TGAAGAACTA 


1200 


ATGATAAGTC 


GTTCTAAACA 


AGCAGTTACC 


CAAGGcTTGG 


AAGAGTTAAG 


GGGAGAATTt 


1260 


ATGGAAAGAT 


ACCATAAATA 


TTACAGAAAA 


ATGGAAAGGA 


GTTGTATTTT 


ATGAATCAAC 


1320 


ATATTATGCC 


AATTTTTAGA 


GAGCATAGTA 


AATTTTATCA 


AAGGTATTCA 


AATGTACCAG 


1380 


TTCCTGAAAT 


GCTTCGCTTG 


CATGGATTCA 


TAGTTGATGA 


AATGCCAATT 


GATGAAGCAG 


1440 


ATATTGTTTA 


TATGGAAGAA 


TATAAAAATT 


CTTTACTTAA 


AATGGGGATT 


ATTATTGATG 


1500 


AAGTGCCTTT 


TAAGAAGGAA 


TTTGTTGCGC 


A 






1531 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

ATTGTAGGAG ATAATGGTGG CTTGGCACGG GTCTCTGAAT CGTTTATTTT AGGTTCGGTC 60 

TCATTAATTT TTTGGACATT GACTATCTTA ACCACGATTA AATATGTAGT TATTGCTTTA 120 

AATGCTGATA ACCATGGAGA AGGTkGGATT TTCTCTCTTT ATACATTGGT TCGTAAAAAA 180 

AGTAAGTATT TGATTATTCC cGCAATGATT GGCGGTGCCG CTTnacTGgc TGATGGCGkk 240 

TtGACCCCAG CTGTGACAGT GACGACAGCG ATCGAAGGCT TACGGGGGAT TCCAGCATTT 300 

TTTGAACGTT TTGGCAATGA CCAAACGATT ATAGTGGTTA TTACGTTGAC GATTATTTTA 360 

ATTCTTTTTT CTGTTCAACG ATTTGGAACA GAATTGGTCG GGAAAGCTTT TGGGCCGATT 420 

ATGTTCTTGT GGTTTACTTT TTTAGGCATC ATTGGTTTAA TGaATTTTAG CCAAGATTGG 4 80 

ACGGTTATTC 'GTGCaTTAAA tCCATATTAT GcTTTGcAAT TACTAGTCAG TCCAGAAAAT 540 

AAATTAGGTC TATTTATTTT AGGGAATATT TTCTTGGCTA CAACTGGTGC TGAAGCTTTA 600 

TATTCTGATT TAGGGCATGT AGGAAAAATG AATATCCGCA TTAGCTGGCC GTATATTAAG 660 
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ATTTGTTTGA 


TTCTTAATTA 


TTTAGGGCAA 


GCTGCGTGGT 


TGTTAACGGT 


CAAAGAAAAC 


-1 i»\ A 

720 


CCTGAAATGC 


AGGCATTAGC 


TGAAATXAAC 


CCATTTTTCC 


AAATGaTTCC 


ACGAGGTATT 


780 


CTCGTATTTG 


GCGTTGTTTT 


TGCaACAATC 


GCCGCAGTGa 


TTGCTTCGCA 


AGCGTTAATA 


840 


TCAGGTTCGT 


ATACTTTAGT 


TTCTGAAGCG 


ATTAAATTGA 


AGTTATTACC 


AAGACTTAAA 


900 


ATTATTTATC 


CTGGGAGCAA 


TATCGGTCAA 


ATGTATATTC 


CAGCGGTAAA 


TTTAATTTTG 


960 


TGGTTAGCTT 


GTTCAGCAAT 


CGTGCTGGCT 


TTTAGAACAT 


CGACACACAT 


GGAAGCTGCG 


1020 


TATGGTCTAT 


CGATTACGAT 


TACGATGTTA 


ATGACAACCA 


TTTTATTGTT 


GTTTtATCTT 


1080 


TtGGATAAAA 


TTCCAGCTTG 


GTCTGCGTAT 


CTAATTTCCT 


TGTTCTTTGC 


TGCCATTGAA 


1140 


GTTGTCTTTT 


TCTTTTCAAG 


TGCTGCGAnT 


TTTTCCATGG 


TGGATATGTT 


GCTGTGGGAA 


1200 


TGGCAGTCTT 


CTTGTTATGC 


AT CAT GAT T A 


TTTGGGAACG 


CGGCAACGAG 


ATTAAGGAAG 


1260 


CGACGGCGGa 


ACAAGTGTCG 


TTGAAAAAAT 


ATGTACCACA 


ACTTAAAGCT 


TTGAAAGAAG 


1320 


ATACATCTGT 


ACCTATGTAT 


CAAACCAACG 


TGGTCTTTTT 


AACCTCCGAT 


CGAGTGGATG 


1380 


GCGAAATTAA 


TCGTAATATT 


ATTTATTCGA 


TTTTGGACAA 


ACAGCCAAAA 


CGAGCGAACG 


1440 


TTTATTGGTT 


TGTTAATGTT 


CAGGTAACAG 


ATGAaCCCTT 


TACTCAAGAG 


TATTCTGTTG 


1500 


ACATGCTAGG 


AACTGATTTT 


ATTGTGCAAG 


TTCAGCTATA 


TTTAGGCTTC 


CATATTTCTC 


1560 


AAGAAGTGAA 


TGTATATTTA 


AGACAAATTG 


TTCATGACTT 


GATGAAAACA 


GGCCGCTTGC 


1620 


CAAAACAGCC 


ACAACGTTAT 


TCATTAACGC 


CAGGAAGAGA 


AGTTGGCGAT 


TTCCAATTTG 


1680 


TATTAATTCA 


AGAAGAATTA 


TCTAACGTTT 


CTGAACTGAA 


AAAATGGGAT 


CGACAAATTA 


1740 


TGCAGGCAAA 


ATTAGCTATC 


AAAAACTTAA 


CTACCTCTCC 


TGAAAGTTGG 


TTTGGCTTGG 


1800 


AATACAGTGA 


AGTCAAATAT 


GAATCCGTTC 


CTTTAATCAT 


TGGCCCACAA 


CGGAAAACGC 


1860 


ACTTAGTCGA 


ACGAAAGAAT 


CGTTCATAAA 


TAGAAGAAAT 


TTAACTAAAA 


GATTGAGCCT 


1920 


AAGTCCTTGA 


GATACATGGC 


TCAATCTTTT 


TTGTAGAATC 


AGAGTAAATA 


CTGTAATTCC 


1980 


ACAAAGAATT 


CATTTAAAAG 


CGTTTGGTCT 


TTAAAGGAAT 


TCATCCGCTT 


AAAGATGGCC 


2040 


AGTAAGACTT 


CTTTTTGAGG 


TTCAGTTAAT 


TGATCTACTT 


TTAACAAAGG 


AAGAAAAGCG 


2100 


ATAATCAGAG 


AATAATCCAA 


TTCTGGATTT 


T'GAGAATGTG 


TAAATAATTC 


ATGGAGTTCT 


2160 


GACAAAATAA 


ACGGATGATA 


ATAAGATTGG 


TCAGACTGTT 


CTTCTGCAGT 


TGTTGGTTCA 


2220 


TAAATAGAAA 


TACTCTCTTG 


GGAATGAGAG 


GGGGCTTCTA 


CTGTTAAAGG 


AAAAGCATAA 


2280 


AAGGACTGCT 


GTAACTTTTG 


CGCTTGAATC 


ATAATTTTTA 


AGGAATCGTG 


ATAGGATAGC 


2340 


TGTTCTTCAG 


ATAAACGTCC 


CATCAACTGA 


GTAAAAAAAG 


GTGGTTTTAG 


CAGTTGTTTA 


2400 


TTTTGAAAAT 


TTtCTTTTAA 


TAGCTGGCGC 


ACAAATCGAT 


CGAAATCACC 


ATAAAGTAGC 


2460 


CAATCTACTG 


' TTGTGTCACA 


GGCTTTAGCG 


ATTTTTTTTA 


AGTATCGTTT 


ATtGGGTAAA 


2520 


TTCAAACCTA 


GTTCCCmGTT 


ACTAATkGkA CyTCCACTrG 


ATAAmCCAaT 


TTTATkGGCT 


2580 


AAATAACGTA 


AAGTGTACTT 


CTTCTTTTTT 


CGAGCTTCAC 


GCATGCGTGC 


CCCGACTTGC 


2640 
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TTTGTAGTAA TTTTCATTGA AACACTCCTT TTGTAATAAA TATGAGGATA AGGTGTTCTT 2700 
AAAATATACA TACGCAAAAT TCGGCTCTTT TTnTTCTAAA CnAAAAAAAG TnATTTATTC 27 60 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 
TTCGGCTACC CCTAATTTTA TCGGCtTAGA tAGAaATAGT GAATTAAGAG GTTAGCAGAC 60 

AACAAAATAA GAACCGACTG kCaTAGGACG GGAATCCTAG AGGACAGTTA AACAATTCtG 120 

ATACAGCATG AACTATACTt CGGTTCTCGT TTGTTGCAAC ATTAGTTACA ACGTATAGTA 180 

TAACAATTTT TAATGTAAAA TTCTAGGCTT TTTTAAACTC CTTTATTGGC CTAGGTAAAG 24 0 

TTTTTTACAA TGAATTGTTT TAATTAGTTG TATAAATGTT GGGACAGCGG GGAATGTATA 300 

CAGTTCATGT ATATGTTCCC CGCTTTTTTG TTGTCTGTTA ACTTGTTGAA AAAATAAAAA 360 

AACCAAGTAG CAAGCATTCT CATTTTCTAA AAATGGCCCG CTTGTGGAAT AATTCAATGC 420 

TTTCACGCAT TGTTTGTACT GGAACTACTC GTAAGAGCCG TGCAAACAAC GTGTT.GCATG 4 80 

GCTCTTTTTC TACTTGGTTC AGAAAGTAGG TGAATCAATG ATTGAACTGA AAGCAACAGA '540 
TTTGCACCGT ATTACGTGTC AACTGACCTT GTTTAATCGA CAAAGGTTTA AATTGTATAA 600 
CGGCACAACA GAACGTATCG TATATGATTT TTCAGGAAGA AATCTGTTGA TCAATCCTGT 660 
GAGTTTTGAA ACGGAGCAAG ACATGGAACG TTTCTTTCGA CAAGTAAAAC TGATTTATTT 7 20 

TGATGGAAAA GTGGTCGGTT ATCGTGGCTG CTCGGAACTT CCGTTAAAAC TTGAATGTAG 780 
AGCGTtCAAA AAAATGGnTA AcGACaAAAG ATGTTACTTA ATGCACCGTT CAACTATAAG 840 
GTTTTTGAAG AAAATGAATT TAGGGAATGG TGCAAGAAAA TGAGAAGCTA TAAGTAGTGA 900 
ATTTGGTTTT TTATACTTGG TCTTTTTGTT TGTTAGTTTT AACTAACATG TGCAATTTAC 960 

CAATAGCTAG GTTAGCTATC GAAAAAGGAT GCAGAAAAGC ACACGGACAC GAAGGTGCAA 1020 

ACTGGCTACA CGTCAACTTT TTTGCAGATT TGGGAGATCC TAAAAAATGA ATACCACAGG 1080 

nCTTATCCAG TCCATCAGCG TGCCTTAAAA GTCTAGGnAA GGGnAACCAn CCTAAGAGTC 1140 
CG 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 



1142 
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CTTTAGTTCA 


GGCTGGGCCT 


TTTGAAAATT 


TAGA1 GAAbb 


A 1 1 X b A lvjlL 


lAtbtLAt 1 b 


fin 


AAGCCTTTAA 


TGCGCCAGAT 


GGTCGTGCAC 


TTGbbb IbAb 


t t c n 7A t t n rr 


1 lbvvVwHvjnnn 


i 9 n 

1 ^ u 


TCACTTACCC 


AAGTGATGTG 


GAGGGTTGGG 


/"•TV TV TV TPPPTT 

CAAAlbbbi 1 


AAb 1^1 Vjvj 1 1 


AHrtortAL 1 b A 


1DU 


CAATTCACAA 


CGGGAAACTA 


TTTCAATATC 


CAGTTTCTGA 


MLKb/wi 1 b 


bl IbblbAAI 




CCGCTACTAC 


TTTATCAAAT 


GGCTGCCATT 


TCTTATCTAC 


TGCTTCTTTT 


bAAl 1AGAAG 




TGGATATTCC 


CAAAAATGAG 


ATTGCTTTTA 


TTCGGCTTTT 


TV rf™* /"■• Z"' TV T\/* , /™*7\ TV 

AGCGAACGAA 


AbGGGTTCAA 


360 


AAGGACTTTT 


AATTACAATT 


GrTACGATTC 


ATGGTAAAAT 


AACCCTTGAI 


CgAAbATl TG 




CTGGCCAATC 


TTTTGCTGAA 


AAGTATGGCA 


CAATTCGTGA 


AACTAAAATT 


Tv TV TV TV TVTArnTV 

AG G AAAAAT A 




AGTCAGTTCA 


GTTAACTATT 


TTTGTTGATT 


GCTCTGTTGC 


TV /** TV TV TA fTl TA fT* 

AGAAATCTAT 


G T AAAT AAAG 


D4 U 


GTGAAAAAAC 


GATGACTGGT 


CGCTTCTTTC 


CAGATAAAGC 


/-I TV TV 7\ m TV OH 

GCAACAGTAT 


CTTCATCTAT 


bUU 


CCAAGACGGC 


AAAAGCTTGT 


TTTTATGAGC 


TGGAAAATAC 


GAATAATTAG 


GAATGATGGT 


c r» 


GAATTTTGAT 


GGTGGTTAAA 


TTAACGGATG 


TAGCAAAGCT 


TGCTGGGGTG 


AGCCCGACAA 


TO f\ 
/ Z U 


CGGTAAGCCG 


CGTGATTAAT 


AATTATGGTT 


ATCTTAGTCA 


TV TV TV TV TA /"i TV TV fTl TT\ 

AAAAACAATT 


GAT AAAG lib 


7 £3 n 


ATCAAGCGAT 


GGAAGAATTA 


AATTATCAAC 


CTAATGGATT 


TA /"* /""* TV /"^ TV TV /"* 

AGCCAGAAGC 


y-*i m r" 1 TV A y^* TV TV 

CTCCAAGGAA 


o 4 U 


AAAGTACGCA 


GCTGATTGGT 


TTAGTCTTCC 


CTTCTGTTAG 


TCATCCATTT 


TTTGGTGAA1 


yuu 


TAATTGAAAC 


ACTGGAAAGA 


AAGCTCTTTG 


TTCAAGGATA 


TAAAGTGATT 


TTATGTGATA 


q c n 

t 


GTGAAAAAGA 


TCCAGAAAAA 


GAGCGCGCCT 


ATTTACGAAT 


GCTCGCTGCA 


t\ tv m tv tv t\ m /*• /*• 

AATAAAGTGG 


lUz U 


ACGGTGTAAT 


CACTGGTAGC 


CATAACTTAG 


CTATTAACGA 


ATATGAAAAT 


GTTTCACTAC 


lUo U 


CTATTGTTTC 


CTTTGACCGT 


TTCTTGGCAC 


CTGGCATTCC 


AATTGTCTCT 


TbbbAAAAb 1 


I 1 A Cl 

I I y u 


TTCAAGGGGG 


CCAAAAAGCC 


ACTGAAGCCT 


TATTTGCAAG 


TGGTGCACAA 


A A A T> T> A A 

AAGAT. I Gb AA 


1ZUU 


TTATTACTGG 


TGCTAATAAC 


ACAGGCGCAC 


CTAGCGATTA 


m /■* tv ffi On ^* /"* /■» m 

TCGATTGGCl 


^ /"^ T 1 T* A T> A A A P* 

Gbl 1A1AAAG 


i o en 
liDU 


AAACAATGGA 


AAAATATGGC 


GCAGAAAAAA 


CGATTCTACA 


tv t\ rn nn /~* tv m tv 7\ m 

AATTGATAAT 


rppflppTrn a 
GGGAbG 1 LAA 


1 TOfl 


CAACATTAAA 


AAATCTAGAA 


ATCGAACGTT 


TGCTTCAAAA 


T»T\ TV TV nPTTTA 

TAAAACTG1 A 


b ACbbC A 1 b I 


IjOU 


TTTGTACAGA 


TGATTTGACA 


GCAATTACAG 


m m m T\ TV f¥i T\ m 

TTATGAATAT 


TGCTCAAAAA 


rprp/-> A A ^ A rp7\ m 

1 I b AAb A i A 1 


J. fi H V 


CCATTCCTGA 


AGAATTAAAA 


GTAATTGGTT 


ATGATGGGaC 


AAAAI i AAi L 


AAAAb AA lib 




CCCCACAACT 


ATCAACCATT 


GTGCAGCCAA 


TCGACGAGAT 


bl GlbAbb i 1 


A 1 b A 1 1 b Ab 1 




TACTGCTTCG 


TAGAATGAAG 


GATCCTGATG 


TTGCACTTGA 


rT*T\ A A A T* T 1 A T 1 

GGAAAAl 1 A 1 


bb 1 Ai 1 bbbA 




TTCAGCTATC 


ATTGTCTGAA 


TCCTGTTAAA 


T\ TV TV ^ TV rri 

AAAGACACCT 


III tyAbb I b 1 


blllllllAl 




TCTTAATTCC 


TCATTTGCCT 


ATTTTCCCTT 


AATTCCAAAG 


C AC AG AG ATT 


AAATAATACA 


1740 


ACAACATGAT 


CCCTACAAAT 


ACACCGAGCA 


TTAGTAAAAC 


ATACCACCAA 


TTTTTGGCGC 


1800 


TATAGGTTTC 


• ATTTTTATTT 


TTTTCGGTTC 


GTTTCCCGGC 


AGCCCATTTC 


CATAAATCGT 


1860 


AATTATTCAT 


CTCTCTCACC 


ACCTTTAATG 


GGTTTATTAA 


CTTAAGCCTA 


ACACGGAATG 


1920 


AGAGAACTTG 


AAAACAAAAG 


GATCACCCGC 


TTATGCAATG 


GATGATCCTT 


TTTTGGGCAC 


1980 
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i"P7\ iT ,r pp f T ,r rr , Tr' 

TATTCI lblb 


AlAAbbbAl 1 


HI ill lnnln 


aptgattggt 

t\\~t i ofti 1 oo i 


ArrAATAAAA 


ACTATCTTTT 


2040 


TTAATbbblb 


b 1 AAA1 b 111 


T ft. B PTP Ti TP. P. 
1 nnb 1 Ln 1 


TPTTPAPGAT 


TAACGTAAAT 


AAAACCGTAG 


2100 


/** /"* ti 07* t* t\ 
bbT i III IbA 


ftl bb 1 1 bHl b 


bbnnJ~ 1 1 /-inn 


ATATPfATGA 

ft 1 ft 1 OV-fft X on 


CCGACCAAGG 


GCAATAGCCA 


2160 


AACAACTbAA 


bAbbb 1 bAbA 


a.ziTZiPPPP.PA 

nn 1 r\b b b o 


tgapaagpag 

1 oftv>nnoOfto 


CAATATGATC 

^ftft l n l On .1 \^ 


ATGCAAATAC 


2220 


GCAATTCGAT 


AATbA 1 b A 1 b 


ftrtl 111 ibbb 


1 v_v^ lLlul 1 ft 


ou i oft ivi uo 


TGTGrCTAAG 

1 O X U\rfVy i, ftftO 


2280 


CCGTTTTCTG 


TAATbAI 1AA 


ppppb. iid.ppp. 

bbbbrtnnbbu 


1 ft 1 1 OrtVwOft I 


AATAATPATT 


TAAAACTAAA 

l ftnnnv^ l nnn 


2340 


CGTAGACCAG 


TTGGATCAAT 


TTbbbbAbbA 


T 7A PTT np. & P. P. 
1 iib 1 1 ftbftbU 


TTTTTTAZlfiTP. 
11111 nnn 1 \j 


TTP. ATTTTTT 
1 1 on llliii 


24 00 


TCGATTTTAA 


AATAACCGTA 


TAbAl bAAAA 


TPTA T\ TP T 1 PPT 

1 bAAl b 1 bb 1 


TTTPTTTPP.T 
1 1 1 b 1 1 I bo 1 


APPPAATHHP, 
n^>\rf <^nn l OOO 


24 60 


TGTTGCTCAT 


CTGTCGGTAA 


ATAbbl Abbb 


APTATAPPT PP 

Ab 1 A/ibb 1 bb 


\3t\ L Mft 1 ftft 1 i 


AAP,AP,PP,ATA 


2520 


AAATCTGGTT 


TAGCGGCTTT 


CAAAATTGbT 


IbAlbl Ibib 


PPTPTPTTTP 
bb Iblbi 1 lb 


P. n, n. H T A A A t a 

ooon 1 nnn 1 o 


2580 


CCGCATTTGG 


mm TV TV TV mj TV ^* m i** 1 

TTAAATACTC 


TTGATAGTAA 


CCAGGGTATT 


CGCCAAAACA 


7\ t AP ATTTPT 

ft 1 ft.V-.ft 1 i 


2640 


AGAGCGTAAT 


TCGTTTTAAA 


GTTATCATTC 


ATACGAGCTG 


CCCAAACATC 


1 1 Lnuuv* l ift 


2700 


TTAGAACTAG 


Ti m t\ tv /■* m tv m 

GATAAGTCAT 


CGTTGAGGAA 


ACAGAGGGAC 


CGACTTTTCC 




2760 


AATTGGTGAC 


AATCATTCGT 


CGcTAACGCA 


TGGGCAACAA 


ACATATGGTA 


ATPPATTTfiT 

ft 1 V^v^ft 11 1 O 1 


2820 


GCACGAATTT 


TTTCAACATT 


TTCTGGTGCA ACATCGTACA TATTCATTCG 


TTCATCAAPA 
1 1 Vrfft 1 V—ftnXwrn 


2880 


CGAATCATTA 


AGTTTTGTTC 


GTTCGTAACT 


TGCCAATTTT 


TGACTCGATC 


APPAAATP,PT 
ftl^^ftftft luOl 


2940 


TGATAACAGA 


CTTGCGCATA 


GCGTTGAAAT 


GCGGAAACGC 


AGCGACGATC 


pnpprAACPA 

\s o Onnv Oft 


3000 


TTGTATTTTT 


CCACCAACGC 


AAAAGGTAAA 


TCAAAATGAT 


AAAGTGTGAC 


AAAPHP,PAAA 
ftnn*w o o v-i nnft 


3060 


ATATCGTTTT 


CTAAAAGTGT 


ATCAATGACA 


GCATTATAAA 


AATCGATGCC 


ooO 1 lonl lo 


3120 


ATTTCGCCAT 


CACCTGTTGG 


AATGATTCGT 


GCCCATGAAA 


GAGAAAAACG 


AT AHP^TTTTT 

ftlftOOl X 1 1 1 


3180 


AAGCCTAATT 


CTTTCATTAA 


CGCAATGTCC 


TCTTTGAAAC 


GATGGTAAAA 


ft 1 V_rOV_> X UUV/U 


3240 


ACTTTCGTAT 


CTGCTTGTAC 


AGCTGACTTT 


TTAAAAGAAT 


TGTAATCAGC 


oftO l o l l noo 


3300 


CCTTTGCCGT 


CbTbATTbbA 


AGCGCCCTCA 


ATTTGAAAAG 


CGGAGGAAGA 


AGCGCCCCAT 


3360 


AAAAAATCGT 


TCTTAAA1 1 b 


TTTAACCATT 


AGTTTTGCTC 


CTTTACTTGC 


ATGAAAAATA 


3420 


AATTTGTTTG 


TGGTGTAATb 


TCTGTTTGTT 


TGGTTGTCTC 


AATTACAGCC 


ATTTCCGCAG 


3480 


TATTTGTTAC 


GATGACTAAb 


ACGGTTGGAT 


CATAACCAGC 


AGCTTGGATG 


ATTTGTTT AT * 

ft 1 X X O X X X ft X 


3540 


CAAAAbAbAb 


bAbb 1 I I I b 1 


CCTAGCCTGA 


CTCTTTCTCC 


TACTTTTACG 


AAGGTTTCAA 


3600 


Al Gbbbbbbb 


Til 1 AA1 lbb 


ACGGTATTGA 


TTCCAATGTG 


AAGGAGGATT 


TCAGCACCAT 


3660 


TGTCTAACTT 


TAAGCCAATT 


GCATGATGAG 


TTGGATATAA 


AGCCGTCACT 


TGCCCATTTC 


3720 


CTGGTGAAAC 


AATTTGTTCC 


TCTGTCGGAT 


AAATAGCTAG 


CGTTTTTCCC 


ATTTGTTGAG 


3780 


AAGCAAAGAC 


TTCATCGTTG 


ATTTGGTCTA 


AAGATACGAG 


TTGTCCTTGA 


ACAGGTGCTT 


3840 


GAATCCGTTG 


TTTTTCTTTG 


ACTGTCCCTG 


CCTCTGTCTT 


GAAGCTGGGT 


TCTTGACGTG 


3900 


ACTCGTCAAG 


CGGTAAATCA 


TTAAAGCCTA 


GAAAAATCGT 


TAATAGAAAT 


GCACCAAAAA 


3960 
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AGCAATACCA CAACCTAATA AATAACAAAT AAAGGCTTTG GTTCCTGCTT CTGCGTAAAC 4020 

JGGGATGGTC AAGAGTCCTT GGTTAGCGAA GGCTGTCCCG TAGGAACCGC CCCAGCCCAT 4 080 

TAAGGCCCCG CCTGCTGcTC CGCTAGCCAC CGCATAAATC ATCGGTTTTT TTAGCCGCAA 4140 

GTTTGCACCA TAAATGGCTG GTTCTGTGAT ACCAAATAAT GCAGTCACTG TTGCAGAGGC 4 200 

AGCAACAGCT TTCAAGTCTT TATTTTTTGT GCGGACAAAC ACGCCAAATG CGGCGCCCGC 4 260 

TTGAGAAAAG TTTGCTGCAC CAGCAAACGC TAGCAAATTT TGCCGTCCTG TTCGAGCGAC 4 320 

ATCATTAATC CCAATTGGCA CAAGCCCACG GTGAGCCCCA AAAATGACTA AGACACACCA 4 380 

AATTCCCCCG ATAAAACCGC CTAACAAGAT GGGACTCAAA TTCATTATAT AATAATACAA 4 4 40 

CCAATTCACG CCGTCACCTA AATAAATCCC TATCGGTCCA AATAATAATA AAGTGGCTGG 4 500 

AATCATTAAA AGTAATGACA GCGTAGGAAC CATAATAATT TTTAGAACTT CTGGAATGAC 4 560 

TTTTTCCAAA AAGCGTTGCA CATAAGCTAA CATAAAAATT GCAAGAATAA TAGGAATTAC 4 620 

TGTGGAAACA TAATTGGCTT TGGTCACATG TAATCCAAAC AAGGTAACCG CTGATTCTTC 4 680 

AGTCATTAAT GAAACGATGG TTGGATAACA TAAGGTGGCA CCAATTACCA TGGCGATATA 47 4 0 

TTCGTTTGTT TTGAAAACCT TAGCAGCTGA ATAACCTAAA ATAACTGGCA TAAAATAAAA 4 800 

AAGTGCATCG GAAGCGGCGC TTAAAATAAT ATACGTATTC AAAGGTTTTA AATCGACCTG 4 860 

GTAAAAATTG TTTGCGACCA TCACAGCAAT AGCTAGAATT CCTTTTAACA TGCCTGAAGC 4 920 

GGCAATTGCT GGTACGACAG GTGTGAAAAT CGCCGCAACA GTATTTAAAA TTCGATTACC 4 980 

TAATGAATTT TTTTCTTTTG TAGATGTTTT GGTCGTAAGT TGCTGAGCCA ACAAAGGTTC 504 0 

TAGAGCATTA TAGACATGCG CGACTTTATT GCCTAAAACC ACTTGATACT GCCCGCCGCT 5100 

TTCAACAACA GAAATGACAC CTTCTGTTTG TAGTAAGGCT TCTTTGTTCG CTTGTTTCGT 5160 

ATCTTTCAAT ACGAAACGCA ATCGTGTATA ACAATGTGTG ACATCCACGA TATTGTCTTT 5220^ 

GCCACCTACG TCTTTCAAAA TTTCTTTTGC TATTGCTTGG TAGTCCATGG TGACCACTCC 5280 

TTATCTATAA AATAACAGCT CAATaAAAAA GACCAAATCA CACATGCACG TTTCAGCAAA 534 0 

CACACTCAAT ATTTGCTCGT CGTGAATGTC CCTTTGATCT GATCTACGTA ATAACTATGT 5400 

TTCTCACGAT AGAATAGCTG ATTGAATTTA TTTTTAATTT ACCATTATTT AGAGGGATTA 54 60 

TCAATGAATT AGGTAGAGCA ACTCAGTGTT TTGTTACTAA AATCAGCAGT GAAAATAGTA 5520 

GTTTACTACA TTCACTGCTG ATTTATTTGA TTACAAGACT GGGGACAACA AGCGTGAAAA 5580 

GTTTTGCTTG AATCGTAGCC AGTACCCTTG TTTTTTAATT ATCTCGTCGG TTAGTAAGAC 564 0 

ACTTTTTTCT ATATCTTCTT CAAAGACTTG CGCTAACATC CACGCAATGT CGGGATTGTA 5700 

AATAAACGTA CTTACCTCAA AATTCAATGA ATAACTGCGA ATATCTTGGT TTGTTGTCCC 57 60 

AACCATAGCTAATTCATCAT CAATCACCAT TGTTTTTGCA TGAATAAACC CACTATCATA 5820 

AATATAAATT TTAATGCCGC GTTTGTGTAA GTAATTTGCG TAATATTGTG TCGCGCGGTA 5880 

AATAAAGGGA TGGTCAGGCA TACAAGGAAT CATAATTCGC ACGTCGACAC CTGAACGAAC 594 0 
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CGCCACTAAC AACGCATTAA TCATACTATC ATCAGGGATT AAATACGGTG TTTGAATCCA 6000 

GACACTCTTT TCTGCAGAGA AAATCATCCG AACAAACCCG CTTTTCAAAA TTTCTTCTTC 6060 

TGAATCAGGA CCATCTGAAA CAACTTGTAA GGACAAATGC TCCACTTCAT GATCTTCTGG 6120 

TAATTTAAAA TATTTTTCAT GATACGTCAT ACGTTCTTCA GGATATTTGA CTGAAGCATT 6180 

CCAATCCATG ATAAATATCT CTTGCAATGA AAATACGGCC GTCCCGACCA AACGAATATG 624 0 

GGTATCTCGC CAGTAACCAA ATTTTTCGGT GACATTCAAA TATTG 628 5 



(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 



AAATTATTnG 


AnTTTCTTAC 


CCGCCATTAT 


CCATAATCCT 


TTAGTTCTCT 


CTTGGTTTCg 


60 


GGATTCGAAG 


TAGGCGTATC 


aCAACTACTA 


AAaTGGTTAG 


tATGGAATGG 


AGGACTgCAG 


120 


CATTATTTAC 


ATGTTTTACT 


TTAGCTAATC 


CTTTAACAAT 


TTTAACTTCT 


TTTATAATGG 


180 


CTCCAGTTGG 


CGCACTAAGT 


CCsGTATTAT 


CmGTAGGAAT 


GTTTTCAGCC 


CTAATGGAAG 


240 


CTAGTATAAA 


AAAACCTACG 


GTCAATGATT 


TTATGAATGC 


ACAAGACGAT 


ATTTCATCAA 


300 


TTAAATCGAT 


TTACAAAAAT 


AGATTGTTAA 


AGGTGGGTTT 


AATCTTTGTG 


TTAGCTAGTG 


360 


CAGGTGGGGC 


AATAGGTAAT 


ATTATCGGGG 


GAATAGAATT 


ATTTAAGAAT 


TTAATTTAAT 


420 


TAATAAACTA 


GGAGGAATTT 


AATTTATGAA 


GAAGAGAACG 


ATGCTTTCAT 


TTTTTTGTGT 


480 


AAGTGTGCTA 


GTTTTGATTT 


TAGGAGCTTG 


TAATGGAAAA 


GAAGGAACAA 


AAAATAACTC 


540 


TAAATCAAAA 


GAAGTTGAGC 


AAGTAGCTAC 


TTTTTCAGCT 


ATGACACCTG 


TTCAAAGTTT 


600 


AGACCCAGCA 


ACAGCAGTGG 


ATCAAACAAG 


TATTACATTG 


TTGGCCAACG 


TAATGGAAGG 


660 


GTTGTACCGA 


TTAGATGAAA 


AAAaTCmrCC 


mCmACCmGCm ATTGCmGCyG 


GGgCAmCCAA 


720 


AAttCCmATA 


AtGGTAAAAC 


ATATACAATT 


GTGATTCGAG 


ACGGTGCAAn 


ATGGGCCGAT 


780 


GGTACAG 












787 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 
CTACGTTAAA TGAATGGAAA AAACGATTTC CGGACATTAA GG AC AC CAT A nAAAGAGGTA 60 
AAGAAGTTGT AGACCGCCAA GTGGAAAaTG CaTTGTTTAA AmGTGCGGTG GGTTACGArT 120 
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ACACTGrAAT TACTAAAGAG TTAACTGACT CgGCATGCAA AGTAACTAAA CGAGTAACAA 180 

AACAAGTGGC CCCTAACCCA ACATCTGCTA TTTTCTGGTT GAAAAAT AGA . AAACCAGATG 24 0 

AGTGGAGAGA TAAGAAAGAA ACTCAAGTTT CTGGTGAAAT GTCTGTTAAC AATCCTTTTG 300 

CTGGTTTGTC TGAAAAAGAA TTAAGAAAGT TAGCTGAAAG TGATGAATAA GATTGTTTTG 360 

GGTGCGAAGT TAGAATTATC CCGTCGTTAT TTTTGGGAAT ATTGCAAATT AACGGCATCT 4 20 

GACTTTTATA AGCAAGACAG AGAGTACTTA AAAGAGTTAT GTGATGACTT GCAAGAATTC 4 80 

ATTTATGATA GCGATAATGA CGTTCTAGTT ATCAACGAAC CGCCGAGACA CGGGAAATCG 54 0 

AGAACTGCTG GTAAATTCGT AGAATGGTTG TTAGGGAATG ACACTCGAAA AAAAATAATG 600 

ACGGGATCAT ATAACGAAAC GTTATCTACA. ACATTTTCTA AAAGCGTAAG GAATACTATT 660 

CAAGAAATAA AAGCTGATAA AAACAGAGTT GTATTTTCAG ATGTATTCCC TGGTGTAGAG 720 

ATAAAGTCTG GTGATGGAGC CATGAACTTA TGGAGTTTGA CTGGCGGATA TAACAATTAT 780 

CTAGCAACGT CACCAACTGG CACaGCTACA GGGTTTGGTG CAGACATTAT TATCATTGAT 84 0 

GATTTAATTA AAAATGCTGA AGAAGCAAAT AACGCTATGG TATTAGAGAA ACACTGGGAG 900 

TGGTTTGTTA ATACGATGCT ATCTCGTTAG GAACCnGGCG GCAAAATCAT CATCATTATG 960 

ACCAGATGGG ATTCnAATGA TTTAGCCnGT AAAAGCTTAA AAGGATTGCC GCAATCCAGG 1020 

CTATAAAGTA AAACCTTATT TAGCCTGGAA GGCATATTAC CGAGGAAACC GGCCACCAnG 1080 

CCTTGTGGAG TC 1092 



(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

GTTTTCGCAA TGTCCATGTT TCCCCAAGTG TCATACCAAA TATAGGTATG CGCACGTTTA 60 

CCTTGGGCAA TAGCACTTGC TACTTGCGTT TTATATGTGT ATTGTTCATA AATACCGCTA 120 

GCATTGTAGC CGCCAATCTG AGCAATGGCG AATTTATCAT GTGCATAACC AAAACGACCC 180 

TGTTCGCCTT GATAAATCGC CCAGTCAACG CCTTGGTCAC CTTTTGCAGC AAATACATTT 24 0 

AAAGGCATAA AAAATAGAGC GACAAGCGCT CCTACTAAAA TTTTCTTTTT CATTTTTACT 300 

TCTCCTTGTC TTTTAAATTA TATGCTGACA CACCTGTTAC TACTCCTAAA AAAGTTGCAA 360 

TGGCATTGAT AGTTAAAACA GCCATATCTG TTTGTTGCCA ACCGTAGGCT TTACCTAGCG 4 20 

TGGCAATCAA AACAGaACTT GCAGGAAGCA CAGTAAGCAC TCCCCACTTG ATGATTTTGT 4 80 

AATACTTGTC TGGTAATATC ATTTCTAAAT TCCTCCTAAG TATTTCGTGA TTAAATAGAC 540 

AGCAACAGAA ACCCCAATTC CTGCAATTGT TCGCCACGTC CACTTTTGAT TCTCTTTTAT 600 

TTCCGCAATA TCGCCTTCAT TGTTTTTGGC CATTGAGAGC GCTATGTCTG CTTTCTCTCT 660 
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TAATTGTTCA TGATTATCCA ACTTTGTTTC AATCCGTGCC AAACGATCGA CGATTTCAAT 720 

TAAAGGATCA TCTTTCAAGT TATCGTCTCC ATCCCTCTAA CAAAAGAACC GCCTAGCTTT 780 

TGCTAAACGG TTCACCTGTC ATTTTAGTAA ATTCATCTTC TGTAATACAA CTAGGCACAA 84 0 

ATTCTGCAAC CTGCTCTGGA GTAAATAGTC CCCAATCATA CATCAGTTTA ATGTCATCAT 900 

ATGAATACAT TATTTTGCAC CTCCGATTTG TTCTTTAATA GCATCAATTT CTCTTGTATT 960 

TTGAAGAGAA GTAAGCATCG TCTTAGAATT GATTTGTGCA AGTGATTCTG CTTTAGCAGT 1020 

TAGCTTTTCA TTTGCTTGTT TTAACTCACT ATTTGAAACT TCTAAGCCAT TAGCAAGATT 1080 

TTCTAACAAA TTCAATTTTT TTGTATAATC CTGTGTCACT GCTTCTTCCC ATTTTTGTTC 114 0 

TGAAAAGTTA AAGAATTGGG ATTGTTCATT CACTAAATTT TCTAGTGGTT TCTCCTCTAC 1200 

AAATGGGAAG GCTAGAACAT AATCATCTTG AACTTCGAAA ATTTGAAATC CTACTGGGTA 1260 

GAGCACTTTA TATATTTTTT TCATTTTATT ATCCACTCCT ATTTACGACG CATAACGCCG 1320 

ATTTCTCTAA AAACAATTTT TTTATTTGCT TCATTTAATG TGTTTGCAGT ATCTCCTGTA 1380 

ATCACTAAAG ATGTACCAGA TTTAGAAATA GATATTTTTT TATACGCTAT TGTTGGGTCT 14 4 0 

GTTGAAGCAG AACCTACCCA TGCAACTGCT GGTAATTCGT ATTTTAATTT CGTAATATTC 1500 

GGAGTAACTG GTATTGAGTG AACAATACCG CCACCGTTAT CGTTGTAACG CGAGAATATC 1560 

AAAACAATTT CATCCACATT GCTGCTATTT GAAATTGTTA CTGATTGAGA TTCTGTCAAA 1620 

TAAGCTCCAG TTCCATCCCA GTAGTCTTTC ACCACCTCAT AGGTCGGTTT TGCATTAACC 1680 

GATACAGATT TACCACCGAT TTGAATCCCA TCTTGAAAAT TCTTTGTCCC TAAAACAGTC 174 0 

TCATTTCCAA CGGCCTTTAC TAATTTTCCT TCCACGCCGT CAATAGCATC TGCATGTGTT 1800 

TTCATATACT TTACAACACC ATTTTCTTTT AGCTGAACGA TATCTGCCAT TACGCTTCAC 1860 

CTACCTTTTC AAATGTAAAA ACTGGTAATG CATCCAATTT TGCTTTATCC GTTTTAGACA 1920 

TTAAACCGTC TTTTTCAGAA GTGGCATTGC TAGGAAGCGT TGGAATAACA GTTGTGTCTG 1980 

GCAGTGCTTT TACATCAGAA GCAGTTAAAA TAACTTCACC TGTATGACCA TTTACAGACG 204 0 

AGACAGTGCC TGCTTCAGCA CCACTAATTT TTCCATCAAC AAATTCATTT AATCCAACAA 2100 

CGCCAGCTGT ACTAGTTTGT ACATCAATAG CTACGCCGTC TTTTTTCACT ACATATAAAT 2160 

CAGGCATTTA TTTCTTCATC TCCTTTTACT TTTTCAAACT CAACACCAGA ACCACCTAGT 2220 

TTTCCATCTT CATAATCGGC TATGATTTTT AACATTTTGT CATACTCCTG TTTCGAAATC 2280 

ATAATCCCAT CAATAGGTAA ATCTAGGTCT GCACGCGTAA TAATGACTGC GCCTGTATGT 234 0 

CCATTTACTG AAGATACTTT TGAATTACCA GCCATTATCT CTGTTAATCC AAGGATTGCC 24 00 

GATACGTGTG TCATAGGAAA AAACTGACGT TTAATCCCAT TTTCATCAGT TTCCATCATT 24 60 

CTTTTAGCAT CAACCATTGT CTACACCTTC AATCGTAAAA ACATTTTGTT TTGAATCATC 2520 

AACTGTCGCT ATAATTAACG CCCCTTCTTC aATTGGATGA TTAACTGTTC CCAATACTTC 2580 

AACTTCATGA TTCTTAGAAA ATGAATCATC CTCTAAAATT TCTAACGTGT TTACATTACC 2 64 0 
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GTATTTGATG 


GTGTATAGCC 


GTTCCTCTAA 


CCGATGATAC 


AAATAGTTCA 


TATCAGCCAA 


2700 


TAAACGCTCA 


GAAAGTGAAT 


TATGACGTAC 


TCCTTGTATG 


TCTACACGTG 


CATCCATTAG 


2760 


CTCGGCTAAC 


ATTGTTCCGC 


CTGGATCAAT 


AGTTTTTAAA 


ATATCTTTGA 


TTGATTCGAA 


2820 


CCATTTTAGA 


TAATCTGTTT 


CTTGGCCGTT 


CCGCCAAGCT 


TCAAAAGTAT 


CTTGTTGATT 


2880 


TTTACGCCAT 


TTTTCAAACT 


CTTCTTTTCT 


AGCGTTCATC 


CACGCTGTGA 


AATCGCCCTT 


2940 


GTTTTCATTA 


ATAAAAGCAG 


TCATATCAGC 


TATCAAATCC 


TCGATTGACT 


GCCAATATGA 


3000 


CCCCATTTCA 


CCTTCTGTTT 


TAGAAGCAGC 


ATTCACAACA 


AAGTAAGAAA 


AGTTCTGCGT 


3060 


TGAGCCAATT 


AGATTGTCGC 


CTTTATGAAT 


ACTGAAGTAT 


GCTTCCTGTC 


TATGCAATGA 


3120 


CTGCATAGAA 


TATTCATCAA 


AGGTATACTG 


GATAATCCCT 


TTTTTGGCAT 


TCACAATTTT 


3180 


TGCTGCTCGT 


TGAATCGGAT 


AT T TAT TAT C 


AATAACTGAT 


TCAAAAAAAA 


CTTCGCAACC 


3240 


TGTTAAATCA 


AGTGGCAAAG 


CATTTTCAAC 


TAGTATGGCT 


TCTAAGACTT 


CTGTGTTTCG 


3300 


ATTTCCTTGT 


CGTACATTCT 


GAATCCCAAT 


GTAATTGTAA 


GGTTCAGTTG 


TACTTAGCGT 


3360 


TGCTTGCCAT 


TTAACCATTG 


AAAAATCCTC 


CTCTCGTTAT 


TTTGGTGGAA 


TAACAATCGA 


3420 


TTGAATAGAA 


TTAGCAAAAT 


ATAATCGGTC 


ATATTTTGCG 


ACAATTTGTC 


CTTGCTCGGC ' 


3480 


ATTCTGTTCT 


ATCGTTTGGA 


TACGTCCGTT 


ATTTAAGCCG 


TAAATCACGC 


CCGTGTGACC 


3540 


ATATGTTGGG 


TCTACTGTCC 


AACCTGgTTC 


CCCATTGGnC 


ACTCGTCTAA 


TATTGACGAT 


3600 


GCCCCTACnA 


CTAAAnCTGA 


TACGTGGATT 


GGATACTCGC 


CAACTACGnA 


TCCAATCAAA 


3660 


GCTCACCATA 


TTGCAGCAGA 


GATGATCACC 


ATACAGTGAA 


AGCATAATG 




3709 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1649 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

AAAATAAATT CCCTGTAGGT TCATCTGCAA AAATAATTTG CGGATTATTA ACTAAAGCAC 60 

GCGCTATACA TACCCGTTGT TTTTGGCCAC CAGATAAATT ATTCGCTTTA TTTTGAACTT 120 

TATCTTCCAA ACCGACTGCT TTTAACGCAT CTAAAGCCAT CTTTTTCCTC TTGCTACCGG 180 

AAATCCCACC AATTTTTAAT GGTAAAAGAA CATTATTTAA AACTGTATCC TTGGCATTCA 24 0 

TAAAGAATTG TTGAAAAACA AAACCAAATT CTTCATTTCT CGTTTTATTC AGCACTTTCT 300 

TTCTAATACT AGTAACATTT TTACCATTTA AATAAATATC TCCTGAAGTT GGTTGATCTA 360 

ACAATGCCAA AATGTGCATA AATGTTGATT TCCCAGAGCC ACTTTTACCA ATTATGGCCA 420 

CAGATTCACC TTTTTCAACT TTTAAATCAA CACCTTTTAA CGCGTCAAAT TTTGTtTCGT 4 80 

tCTTTCCATA ACTTTTTTTG ATATTTTTtG CTtCAATTAC CGCCATGTTA CCCCTCCATT 54 0 
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TTTCATTCTT 


TGTCGTTTAC 


TTATTTACAA 


AGTTTTTTAC 


TCTTTTCATA 


CTCCTTTCTT 


600 


CTGTTTCTAT 


TTTGACGGAA 


GCCCTTTCCT 


TTGTCATGGT 


ACTTCCGATG 


TATCTCTCAA 


660 


TTAAAACAAT 


AAGAACATGT 


TCCAAATTCA 


CCTCATTGAA 


CATGTTCTTA 


TTAATTAGTT 


720 


ATGCATTTAC 


GCGTTTAATT 


TCACGATGTT 


TATTCAATAA 


ATAGGCCAAG 


CCAAGATTAA 


780 


TACCAACAAA 


GCAAcTGATT 


GTCAGAAGAA 


GTGGTAGGAC 


AGTAAAACCA 


GTGGATTGAT 


840 


AACCAACCAT 


TGTTTGCAAG 


ATGTGGAGCA 


TTCTCAGGCG 


CATTGGTGCC 


CCCCCTAATT 


900 


GTGCAATCGT 


GCCCAGTAAA 


ATAGGAATCA 


TTAATAACAG 


CAATAAAACA 


ATGAGTTTTT 


960 


TTACGCCATC 


TATCCGATCA 


TTAAAGACAC 


CTGCTAACAA 


GCCAAGTGAG 


CCACTAAGCA 


1020 


TTAGTAAGAT 


AAAAAAGAGC 


AACCACGACA 


TGAAAAAGTT 


TCCTTGTGCA 


TAGACATCAA 


1080 


TTAATTTCAT 


TGaTAATTgA 


AAATGAGAAA 


TGAAATTGcC 


GCTAAATACT 


TTAATCAGTA 


1140 


CAAGTACAGC 


CAAGGrTCCA 


ACCAAAGATA 


rAaTwGCATT 


aCTTACmAAA 


TTCmCCAAAA 


1200 


AAATTGTCCA 


GCGGGAAAGC 


CCGTTTTGAA 


TAAACAGTTT 


AAAATCTGTG 


TTCATCCCCA 


1260 


AAAATGATAG 


TATCCCCATA 


AAAACCAGAC 


AAGGAATTAC 


TGCATCAGAG 


CTGACAGTGT 


1320 


TCACGTCGTT 


AGAAAAGAGC 


AACCCAATAA 


GAGGAAAAAG 


AATGCCGAAC 


AAAGCmTAAA 


1380 


AGCCAAAATA 


AATTGCTAAG 


GrACGGrCTT 


GATAAATAAC 


ACGGTACCGC 


AAAGCTGTTT 


1440 


TAAATTTCAT 


TTTACTTTTC 


CyTCTTTCTA 


TTGGTTAATk 


GTmCAAAATA 


TGTTTGTAAA 


1500 


TTTAGCGGCG 


CAATTTGAAC 


CTCTGCTTTT 


TCAGTCGGCA 


AGTCGCCGTA 


CACGTAAGCC 


1560 


GTAACTGCTC 


CACCTAAAGT 


GTCCATGCCT 


AAGATTTCCA 


ATGACTGAGT 


ATAGCGCTCT 


1620 


ACTnGTTCTT 


TCGGCCCnGA 


AACAATGCG 








1649 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

AGGGTAGCAC AGAAAAGAGG GAGGTATGAC AATGGAACAT CATCAACTAA AAGCATTTCC 60 

GCAAGATTTT TTATGGGGCT CTGCTTCGGC AGCTTATCAA GTGGAAGGTG CTTGGAATGA 120 

AgACGGCAAA GGCGCATCTG TTtGGGGATG AATTTGTTCG GCTACCAGnA AAAACGTTCA 180 

AAGAGACAAC TGGCGATTTA GCCGTCGATC ATTATCATCG TTTCAAAGAA GATGTGGCTT 240 

TAATGAAACA ACAAGGTTTA AAAGCATATC GTTTCTCGAT TGCATGGACA CGAATTTTAC 300 

CTGAAGGGCG TGGTCGAGTA AATCAAGCGG GACTGAAATT TTATTCAGAT TTAATTGATG 360 

AGTTATTAGC GGCAGGAATT GAACCAATGG TAACTTTGTA TCATTGGGAT TTGCCAGCTG 4 20 

TTTTGCAAAA AGAATATGGT GGCTGGGAAT CAAGAAAAAT TATTGATGAT TTTGTGGCCT 4 80 

ATGCGAAAAT TTTATTTGAC GCTTTCCGTG GCAAAGTTCG CTATTGGATT AGTTTAAATG 54 0 



WO 98/50555 PCT/US98/08985 

1640 

AACAAAATGT GTTTACTTCG TTAGGTTATC AATTAGCCGT GCATCCTCCC GGCGTAACAG 500 

ATAATAAACG GATGTATGAG GCCAATCATA TTGCTAATTT AGCCAATGCA GCAGTGATTA 660 

ATGCGTTTCA TGAATTAGAA ATGCCTGGCA AAATTGGCCC TAGCTTTGCT TATACGCCTG 720 

TTTATGCCGT TGATGCGCAT CCTGCCAATG TTTTGGCGGC AGAAAATGCA GAAGATTTGT 780 

TGTCGCATTT TTGGTTAGAT GTCTATCTTT GGGGAGAATA CCCAATTGCC GCATTAAATT 84 0 

ACTTACAGGA GCAAGGTGTG GCCCCAACGA TAAAAGAGGG CGATTTAGCG TTACTACGTT 900 

CAGCCAAACC TGATTTTCTA GGTATTAATT ATTATCGGAC GGATACGGTT GCTGCTAATC 960 

CATTAGATGG CGTCGGAATA GGGAAGATGA ATACAACGGG TGAAAAGGGT TCTGAAACGG 1020 

AATCTGGGGT ACCGGGTTTA TTTAAAAAAG TAAATAATCC ATATGTTGAA CGGACGAACT 108 0 

GGGATTGGGC CATTGACCCG CAAGGTTTGA GAATTGCCTT ACGACGATTA GCTAGTCGCT 114 0 

ATCAAGTCCC TATTTTAATT ACTGAAAATG GGTTAGGCGA ATATGATACC TTAACAGAAG 1200 

ATAAACAAAT TCATGATACG TATCGCATTG ATTATTTGCG AgCCATaTCC AAGCGATTCA 12 60 

AGAGGCGATT ACGGATGGTG TTTCAGTTAT TGGTTATTGT ACATGGAGTT ACACCGATTT 1320 

ACTGTCGTGG CTCAACGGCT ATCAAAAGCG TTATGGTTTT GTCTATGTGG ATCAAGATGA 1380 

AACACAAAAA GGTTCCTTAG AAAGAATCCC GAAAGACAGT TATTATTGGT ATCAAAAAGT 14 4 0 

CATTGAAACA AATGGACTTA TCTAATTATA AGAAAAGAGT GTGGAAAATA TGACAGTAAA 1500 

AATGGGGTTT ATTGGATTTG GTAAAAGTGC CAACCGCTAC CATTTACCTT ACGTAATGAT 1560 

TCGAGAAACA CTAGAAGTGA AAACGATTTT CGATTTACAT GTTAATGAAA AAGCAGCAGC 1620 

GCCCTTTAAA GAAAAAGGCG TAAATTTTAC CGCTGACCTT AATGAATTAT TGACGGATCC 1680 

AGAAATTGAA TTGATCACAA TCTGTACACC TGCGCATACA CATTATGATT TAGCCAAACA 1740 

AGCCATTTTA GCTGGTAAAT CAGTGATTGT GGAAAAACCA TTTTGTGATA CGTTAGAACA 1800 

TGCAGAGGAA TTATTTGCTT TAGGGCAAGA AAAAGGGGTA GTAGTGATGC CTTATCAAAA 1860 

TCGACGTTTC GACGGCGATT ACCTAGCGAT GAAGCAAGTA GTAGAGCAAG GCTTTTTAGG 1920 

AGAAATCAAT GAAGTGGAAA CACATATTGA TTATTATCGA CCAGGAAGTA TCACGGAACA 1980 

AGGACCTAAA GAAAATGGTT CGTTTTACGG ACTAGGCATT CATTTAATGG aTCGCATGAT 2040 

TGCTCTTTTC GGGCGTCCAG ATCAAGTCAC TTATGATATT CGTAATAATG AAGTGAGCGA 2100 

GGCAGTCGAT AATTATTTTG ACGTTGATTT ACATTATGGT TCaAAATTAA AAGTAAAAGT 2160 

TAAAACGAAT CaTTCTGTCG CTAGTCCgTA TCCaCGTTTT ATTGTTCATG GCAGCAATGG 2220 

CTCGTTCaTC AAATATGGTG AAGATCAACA AGAAAATGAT TTAAAAGCCG GAATCATGCC 2280 

AGATGCACCA GGCTTTGGAG AAGATTCACC AATGTATTAT GGAGAAGTGA CCTACCGTAA 234 0 

TGGCAATGGC GACTGGATTA AAAAACAAAT TAAAACGCCT GTAGGTGATT ACGGTCGTTA 24 00 

TTATGATGCC GTTTATGAAA CGTTGAAAAA TGGTGCACCG CAACTAGTAA CTAAAGAGCA 24 60 

AGCATTAACT AACATCGAAA TTTTAGAAGC AGGTTTTCTT AACCCAAGTC CAAGTGTTTA 2520 
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TCATTTGAAA 


GAAAACTAAA 


GCGTATTTAA 


ATGACTTGAA 


AAGsGCCTGC 


CCTACAGTCC 


2580 


AGACAACTGT 


GGGGCAGGCG 


CTTTTATTTA 


TAAAGAATTG 


TGAATTTTAA 


TATAAGAAAC 


2640 


GTATATTGTT 


AGTAAAAATA 


AaAAAAGGAA 


AGACATCTTA 


G TAG C ACT AC 


TTCCCCAAGA 


2700 


ATTGCTACTA 


AAATGTCTTT 


TGTGATGCTC 


GCTCAATTGA 


AGAGCCTTAA 


ATAGGATATC 


2760 


GAATATCCAG 


TATAAATGCA 


AGTAAACTAC 


TTTATAGTTA 


ATATTAGTAT 


ACTTCATAAA 


2820 


ATAATTTTTC 


AAAGAGAAAA 


AGGCTTGTTA 


TGAGAAAAAA 


TTAATTTTTA 


ACTCGTTTTT 


2880 


TCTAAGCAAG 


ATTCGGGTGA 


AAAAGACAAT 


AATTACTGAA 


GCTGAATGGT 


AATTAAAGAG 


2940 


GTTCTTTGCT 


AAAATAGGAA 


AGACAAGAAA 


GTTTGGTTAA 


CGAAAGAACC 


AAATCAGAAA 


3000 


GGAGGACAAC 


CAGTGGAGAG 


TTGGCTTTTT 


TTACTACTGA 


TTGCCTTGAT 


TGCGTTTGTT 


3060 


GCAAAAAATC 


AAAGTTTATT 


AATTGCGTCA 


GTTGTTGTGC 


TTGCTTTAAA 


AGCTTTACCA 


3120 


AATAGTGCGA 


AAATCATGTC 


GTGGTTGTCA 


GATAAAGGCA 


TTAATTTAGG 


CGTCACAATT 


3180 


ATTTCAATTA 


CCATTTTAGT 


CCCGATTGCT 


ACAGGACAAA 


TTGGCTTAAA 


AGATTTAATC 


3240 


CAATCATTCA 


AAACACCGAT 


GGGATGGCTG 


GGGATACTTT 


GTGGAATTCT 


GGTGGCTGTT 


3300 


TTATCCTCAA 


AAGGAGTAGG 


TTTAATTAAT 


CAATCGCCTG 


AGATTACGGT 


GGCCTTAGTT 


3360 


TTTGGAACGA 


TTCTTGGCGT 


CGTTTTTTTA 


AAAGGAATTG 


CAGCTGGTCC 


GATTATTGCA 


3420 


TCGGGGATGA 


TGTACGTAAT 


CATTACTACC 


TTTCAAGCCT 


TTCAACACTA 


AAAAAGCAAA 


'3480 


CAAAACGCGT 


AAACGTTTTG 


TTTGCTTTTT 


TAGTGTTGAT 


AAATGTCCAC 


ATTACTATCT 


3540 


AATATGCCAC 


TTTCATTTTT 


TAAAGTAATG 


GTTCCCGCAT 


TCGGATGAAT 


CACTAGCTGT 


3600 


AAATGCCCTG 


TTAAAAACAT 


TTGGTCATAC 


GTCGAAGTAT 


AGACGTGAAA 


AGAATCATTA 


3660 


GCAATTGTTT 


CGGTATAAGG 


CGCTTCTGCT 


TGATCAACAG 


GAACGATAAT 


GAAACGATCG 


3720 


CCAATCATAC 


AACAACCTTC 


TGCGGAAGAA 


TAAGGATTAG 


AGCCGTCATT 


TGCTACCAAC 


3780 


AAGAAGGTGT 


TCGTTTCAGT 


TAAGAAAGGG 


GCTAAACGTT 


GTTTTACTTC 


TTCAGAAATA 


3840 


GTCAGAGACA 


TTTGCCTCAC 


TCCTTTAAAT 


AG TAT GT AAA 


AGGAAGAGGG 


TAGCTTGTCG 


3900 


CTATGCCGCT 


CTTCAAAGGT 


TTTGTTATTT 


GCTTGTTTCT 


TTACTGCTTT 


TTTTGAAAAG 


3960 


AGCAATAGAA 


AAGGCGACAA 


CTAAGACAAC 


TGTTGCAGCT 


AAAACAAATG 


GATTTTTTGT 


4020 


TGCTGaTGTG 


AACCCTAGGa 


CTGTTGAAcC 


AATCGCACCT 


ACGTAAGATA 


AAATAATCGC 


4080 


TGCTTyCTTt 


GGkGTTGAAA 


GTTCTGTCAA 


AGTG 






4114 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
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ATGnAGGCCA GATCCCCTCC AAAAAAAATG GTGGCCnnnT TACATTACCC CGCGGCATTT 60 

GCCTTTGACT ATCAGAACTG AAGAACTTCA AGGAATTTTA GTCCTGTTGC CGACTCACCC 120 

ACGGACATTG GAAATCGTAA ACATCGCTGT CTCAGAAGAA AGCCGAGGGC GGGGCATCGG 180 

GCAAGAATTA TTACATTTTG CCATTGATTT TGCCAAAAAA GAAAAGTATG ATTCCTTGGA 24 0 

GATTGGGACT GGGAGCACAG GCTTTCAACA GCTATATCTC TACCAAAAAG TCGGTTTTCG 300 

AATGACACAT ATTGAACCAG ATTTTTTTAT C CAT CAT TAT GATGAGCCTA TTATAGAAAA 360 

CGGCTTGCCT . TTAAAGGATA TGGTTCGCTT ACGGTTACAT TTAAGTTAAA AAGAGGTCGG 420 

GACAGAACCG TTTAGCTCCG AGAAATAAGA AGAAATTTCC . GAAAATTGCT CTTTAATTTT 480 

TGGAGAATTT CGGCTTATTT CCGAAGaGTT GGTTCTGTTC CCGCCGTTTA TCAGTTTTTG 54 0 

AGCGTGGAGC AAAAATCCAA AGTGATTTTT GTCCCACGCT CTTTTTAATG CCCAAGAATT 600 

GGTAACAACT CATCTAGCTG TTGAATCGTA TAATTGGGTT GAATGGTTGG TGTTGCTGGA 660 

GCGGAGGTCG GATTGAGCCA AACAGTTTGA ATTTTCGCTT GTTGACCTCC TTGAATATCT 720 

GAATGCAAAG AATCGCCAAT AATCATTGTT TTTTCTCGAC TAAAATTAGG AATCCGTTCA 7 80 

AAAACAAAAT CAAAATACTC TTTCATCGGT TTTTGATAAC CCACTTCTTC TGAAACAAAA 84 0 

ATATCTTTAA AATAAGGCAT TAATTTCGAA TCTTCCAACC GACGATACTG TGTTTTTGCC 900 

ACTCCGTTTG TCACAATATA TAAATCATAA TGGGGTGCTA ATTTTTCCAA GATACGTTGA 960 

CTATTGCCAA GCAAGTCATG TCCTTGACTT AAATATTCTC GGTAGTTTTC TCCCATTTTA 1020 

ACGCTATCAA CTTCTCGATT AAACGCGTCA AACAATAAAC CAAACCGTGT ATCAAGGAGA 1080 

ACATCTTTTG TTATTTTTCC TTCTTCAAAT GAACGCCATA ATCCTTGATT AATTCGGTGG 114 0 

TATGTTGACT CGACTTCMA TGTAAATGGT ACATTTTCTT- CTTCAAATAA GGCATGTAGC 1200 

GCCTTCTTTT CAGCCAATTT AAAATCTAAC AATGTATCAT CAACATCAAA AAGCAATGTT 1260 

TTCATTTCCG TTCCTCTTCT CTTTTCAAAT TACTCGGTAA CGCTTTGCTT GTTTGTTGTT 1320 

TTTCAATAAA AAAATAAGGC AATAGCAATT GTAACTCTTC TCTTGTTAAT GAACGATAAG 1380 

CACCTAAAGG TAATTGCGGA TCTAGCCGCA ACGGCCCCAT AGTTTGGCGT TTTAAAGCAG 14 40 

TCACTTTTTT CCCCACGGAT AAAAACATTT TTTTCACTTG GTGAAATTTT CCTTCTTGAA 1500 

TAGTCAGGAA AACATGACTC TCCGTTTCAG TCGCTGCAAG AACAGTTATT TTTGCTGGCT 1560 

TACATTGAAT GCCTCCTTGC AAAACAATCC CATCTTTAAA TTTAGCACAA TCTTCTGCAG 1620 

AGACCAAACC ATTGACTTTC ACTTCATAAC ATTTCGCTAC TTCTTTATTT GGCCGAATTA 1680 

ATTGATACCC TAATTGTCCA TTATCTGTTA ATAGAAGTAA CCCTTCCGTA TCGCCATCGA 17 40 

GTCTGCCCAC TGGATAAAGA CCAGGGCGTC TGTCTTGTGG CGCAATCAAA TCAATAACTG 1800 

TTTTTTTAGA * TGCATCTGAG ACAGCGCTAA CCACACCTTT TGGCTTATTC AACATATAGT 18 60 

ACACATGGGT CTTATGCGAA ATTAATTGTC CTTTTACTTT AATCATTTGT AATTGCGCAT 1920 

CGACATTTTG GCTTTCATTT AAAGTCACGA TGCCATCCAC AGTGACCTGC TTGCCTTTGA 1980 
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TTAATCGTTT CACCGTTTTG CGTGATCCAA ACTGAAGTTC TTCTAGTAAT TTATCTAATC 204 0 

GCATTGTTCA GCTCCATTCT GGTATAAGTA TAGTGGAATC CGTTAAAGAT GAAAAGTGGA 2100 

AAAAGGTAAT TCCTTGACGT ATAAAAATAT AAGTGTTACG GTTAATAAGT AAAGTAAATA 2160 

AAGGTTcTCA aAAACAGCGT AATGACCTGG GAGAGAACac GATAACatGA ACCATTCCCa 2220 

TGAGTGTTGC CTGGCAATAT TTGTGGGGAT TTTGTATGTA CATTTCAGCA GACACGAAAA 2280 

GGTCT 2285 



(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 



ATCGgTCAtT 


GAAAcGGTTG 


ATCAGrATGa 


TCAAACmACG 


CTGTGCCTAA 


CTCAGATTCC 


60 


ATTTTTTTCA 


ATGCTAAAGA 


AAGTGTTGGT 


TGGGTGACAA 


ATAAATTGCG 


AGCGGCCGCC 


120 


GATAAACTGC 


GGGTATTAAC 


AATTTCaATG 


AAGTACTTTA 


AAAGCTGAAT 


ATTCATCAGA 


180 


AACTCCCTTT 


CTTTTTAAAA 


GCGCTTACAC 


AAATGTGCTT 


GGATAAAAAA 


TAACAAATAG 


240 


ATCACTTATA 


ATATAACATA 


TGTTTTGCCT 


TTTTATTGCT 


ATAAAAGTAA 


GAGATAGAGA 


300 


TTTTCTTAGC 

X X X X V*** X X OWVi/ 


AGATTTTTAC 

fiWii X X X X X O \mf 


CTAAAAATAA 

W X X **** 


AAAATTGAAC 


GGAGGAAAAG 


AAATGAAGAT 


360 


TATTATCGCA 


GGTGCTGGCG 


CTATGGGCAG 


TCGCTTTGGA 


CTAATGTTAC 


ATCAAGCAGG 


420 


AAATGATGTT 


ATTTTTGTCG 


ATGGTTGGCC 


AGAACATATT 


GAAGCAATAA 


GAAAAAACGG 


480 


GTTAGTTGCT 


GATTTTAATG 


GCGAAGTGGT 


GACCGCTAAA 


ATTCCTATCT 


ATCCTCAGAC 


540 


AGAAGTAGCA 


AGCATCGATT 


TTACAGCAGA 


CTTAGTTATC 


CTATTTACAA 


AAGCAATGCA 


600 


ACTGGCAGGA 


ATGCTTGAAG 


CATTAGAACA 


TTTCCTTAAT 


AAAAATACGA 


AAGTTTTATG 


660 


CTTGTTAAAT 


GGAATTGGGC 


ACGAAGATAT 


TATCAAAAAA 


TACGTGCCTT 


ACGAAAATAT 


720 


TTTATTAGGA 


AACACAATGT 


GGACCGCTGG 


GTTAGAAGGA 


CCGGGCAAAG 


TAAAATTATT 


780 


TGGTAATGGT 


TCGATTGATT 


TACAAAATTT 


AGGGGCGCAA 


AGTGAACAAG 


CAGCAGAAGC 


840 


AGTCATTAAG 


GTATTAAATA 


ACGCTGGCTT 


AAATGCTCGT 


TATTCTGAGA 


ATATTTTATC 


900 


ATCCATTTAT 


AAGAAGGCCT 


GTGTGAATGG 


AACAATGAAT 


GGTTTATGTA 


CTTTGTTGGA 


960 


TTGTAATATG 


GCAGACTTTG 


GCGAAACAGA 


GCAAGCAGAT 


CAAATTGTTC 


AACAAATAGT 


1020 


TAATGAATTT 


TTAGCAGTCG 


CTGAAGCGGA 


ACAAGTTACG 


TTAGATAAAC 


AAGCAGTTCT 


1080 


TCAACAAATC 


CGCTCGTGTT 


ATAATCGAGA 


AACAATTGGT 


TTGCACCATC 


CATCGATGTA 


1140 


TCAGGACTTA 


ATAACCAATC 


ATCGTTTAAC 


CGAAATTGAT 


TATATCAATG 


GAGCAATTGT 


1200 


TCGCAAAGGG 


AAAGTCTATG 


GGATTGCTAC 


ACCGTACTGT 


CAGTTTTTAA 


CAGAGTTAAT 


1260 


TCATTGTAAA 


GAAAATTTAC 


TTCATGCTAA 


ATAACAAGTA 


AAAACAGAAA 


TACTCTTGAG 


1320 
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TCGGTCTTGA GAAGAGCTTG ACCTACTTCT TTCTATACTT TGCCAGTGTA CAAGGCTGAG 1380 

ACAAAATCCA AAGTGAGTTt GGCTCAGCTT TTTTGTTTTT TAAAGAACAC TGCAGAAAAT 14 40 

AAATCAAATC CGTGaAAmAT ATGTTAAACT AGAGAAGGTG TTTTTTCGCA AGAnTCTACG 1500 

CGGaACACAG GACTCGTTGA AGAATTGAAT CAAAGAAACA ATGTGTTAAA GAAGGGATTT 15 60 

GAACAGATTA TGACCATTCA ATGGTTTCCC GGCCATATGG CCAAAGCAAG AAGAGAAGTT 1620 

TCTGAAAAAA TCAAATATGT CGATATTGTA TTTGAACTAG TTGATGCGCG GTTACCGCTT 1680 

TCTTCCCGTA ACCCAATGAT GGACCAAATT GTCCAACAAA AACCACGATT AGTCCTTTTA 1740 

AATAAAGGGG ACTTAGCAGA TAACGACCAA AATAAAAAAT GGCAACATTA CTTTCAAAAG 1800 

AAAGGGTACC AAACCCTAGT TATCAATGCC CAACAAAATA AAGGAATCAA TAAAATTGTG 18 60 

CCAGCAGCCA AGGAAGCGTT AAAAGAAAAA TTGGCTCGTG AACAGGCGAA AGGCTTGAAA 1920 

CCGCGTGCGA TTCGCGCCAT GTGTATTGGG ATCCCCAATG TTGGTAAGTC AACCTTAATG 1980 

AATCGTTTAG TCGGAAAAAA AATTGCGCAA ACTGGGAACA AGCCAGGTGT GACAAAAGGA 204 0 

CAACAATGGT TAGGCTCTGG TTCCCAATTG GAATTACTGG ATACACCAGG GATTCTCTGG 2100 

CCAAAATTTG AAGATGAAGA AATTGGAAAA AAATTAGCTT TAACTGGCGC TATAAAAGAT 2160 

CAATTATTGC ATCTAGATGA TTTGGCGATT TATGGATTAG AATTTTTTGC ACGTTTTTAT 2220 

CCGCAACGTT TGACGGAACG ATATGGTTTA ACAGAGGAAG AGCTATTTTT ACCTGCCCCA '2280 

GAACAGTTGA TGTTAATTAG CCAAAAACGT GGTTTTAGAG ATGATTATAA TCGAGCCAGC 234 0 

GAAATGATTA TTTTAGAAAT TCGCAGTGGA AAATTAGGTA CGTATACTTT GGACCGCTGG 24 00 

GAAGAACTAG GAGACGAATA ACATGGCAAA AGAGTCGATT CAGGCAATCA AAGAAGCGTT 24 60 

GTTAACTGTC ACTGATGCAA CCGATGAACG ATTAGCAAGC TGGCGAGAAG ATGAACGTTC 2520 

AGGAGTGCAA CAAGCGATCA AACAATGGGA ACGTCGCCAG TTAGCCAAAG AAAAAGAGTG 2580 

GCAACTTTTT AAAGAAATGA GTCAATTTGA GGAAGCGGCC TATAAAAAAG GCCATCGTTT 264 0 

AATTGCAGGA ATTGATGAAG TAGGCCGCGG ACCATTAGCT GGCCCTGTTG TGACAGCTGC 27 00 

AGTTATTTTA CCGAAAGATT GTCAGTTATT AGGCTTAAAT GATTCAAAAA AACTATCTGC 27 60 

TAAAAAACGG GAAaCGTTAT ATAACCAGAT TCmAGAACAG GCTGTCGCTA TCGGGTTAGG 2820 

TATCGCTGAT CmAGGAGTCA TTGATCmAtT AATATTTATC CAGCCACTTA ACCAGCGATG 2880 

GAATGGGCAT TGACGATTTA GCTTTTTACC CnGATTATTT ATTAATTGAT GCCATGCnGT 294 0 

TAGCGTTCCC CCACCnCAGA AGTTTGATTA AGGG 2 974 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



720 
780 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

GCCCTGTTTG ACGCTCTATA. GAnCT-TTGGC- -TCTCTTGTnA CGTGTCCATA AAACCTCCTT 60 

GGTGTGAAAA GAGCCGCAGC TATTCCGTCT GCGGCTCtAk GTAAGTTTaC ATCaGCCCtT 120 

AAgCGGTAGG cCAGTTTGTG TCCGCCCCGC TGCTGGCTGT TGTACTTTTC CAGAATGACG 180 

CGGGCAAAGC GCAGGGCGGC TTTATTGGTG GAGAAGTCCA TCTTGCCGCG CCGGATGATT 240 

TCTTCCGGGT CAACCGCTGA CAGCCGCTTA ATGAAAATGC GGTCGTTCAG TTCGGTTTCA 300 

TAGGTTTTCA GAAACAGCGC CATGCCGGAG AGCATCGAAG CACGGAGGGA AATGGATGCG 360 

CCGTGCCAGG TGCTGTCTAA GAGTACCAGC AGGCGGGAAA AAGCTGTGCC GCCAAGCAGA 420 

CGGTAGGCGT TGATGATGGC GCGGGTAGAA AGGATTTCGT AAGCGTCGCC GCTCTTTTTA 4 80 

TCCAGTATCC AAACAAAGCC CGCAGCTTCC ACTAGCCGCT TGATTTCCAG AACCTCCGCA 54 0 

TCCGTGCCGG ACTCCAGCAG GGCTTTTGTG GCGTGGGACA GACGAAGCCG CCCTTTGGAC 600 

TGATCCAGCT TAAAATACAT CTCGGCTTCC TGCTCATAGG TCATGCCGGT GTAAATCAGG 660 
CAGGGGACGG TCACgTTgCC GCCGTCTGCC ATTTTCCGCA TTGnGGCGAt CCGGTGCTGG 
CCGTCCACCA CATTAAACTT TCCGTCGCGG AAnCTGACCA CCAGCGGGGT CaGgGCGGTC 

GTCCCATTTG GCAATCAGCC GGTCAACGTC CTCCGGCTCC ACGGGCCGCT GGTAGGGCAG 84 0 

GCCGGAAGTA AGCTGGTTGC TGGATAAGTC GCGGGGGACG CCGGGGTTGG CGTACTGGAT 900 

TTCTGTTAAA AGGGTTTGTT GATCCGTCCG CTCGTTACCT GGCGGACGGC GTTTTTTTCG 960 

ATTGCTCATG TCGTTATTGT CCTTTCCATT TGATTTAATA ATTCCTCCAT CGCAGTAGAT 1020 

ATGCTTTCAA AGCGCTGCCG GACAAATTCA AACTGTACCT TGGAAATGCG CGGGAACACG 1080 

ATAGTACAGA ACGGGTCGTG ATACCATGCG AACCCTTGTG AAAGGTCTGC ACAAAGCCGT 114 0 

CGATGTCTGC CAGAAGCGTG TCCGGCGTGT AGCTGCAATC CTTATCCGGG TTTTTCAGGT 1200 

CGGCTATGGA TTCCTCCAAA GTGTCATAGT GCTGATCGCy cwGCGTATAG GGAACGGAGG 1260 

GCTGCGAGGG AAGTTTTGTA ATCGGCTCTG AGGTCTGCCC TTGTTCCTGC ACCGGTTCCG 1320 

TTTGTTCTGG TTTCTGTGTC AGTTTGGACT GTTCTGGTTT CTGCGTTTGC TCAGCCTGcT 1380 

CCGGGATTGG TTCGGGCTGT TTCGGTTCCG GCATT'GGCTT GGGGTACTTT GGCTCCTGCC 14 40 

TTGGTTCCGC CTGTTTCAGT TTCGGTGCTG GTTTGGATTG TTCCAGTTTC CGCGCCGGTT 1500 

TGGGAAGTCC CGGAGGTTCT GCCGGTTGAA ACTCGTCTGC GGACTTGATT TTACCAGCCG 1560 

CCAACTGTGC GGCGGCGTGT TCCTGCTGTT CCGGCTCCAT GCGGGACAGC TTCAGCAAGT 1620 

CCTTTTTCTT GATTTTGCGG TCGGAACCCT GAATGACTTC CTGCGCTTTk GGGGTtATAT 1680 

CCCTGGCAAT CTGGATATTT yCTtCCACCG AACGGGCGGA AATGCCCAGC TTTTTTGCTG 17 40 

TGTCCCGTGT * AAAGGACTTT GACGTGGACG CGCCGCGCTC GCTCACGTTG TTGCCCTGGG 1800 

CGCGGTTCAT TGCGTTTGCC TGCGACACGC CGCACCTTGT TTCCGGATGC AGGATTTCAT I8 60 

AGATTTCTTT CCGCCGCATC AGCAGCTTTC CAAACTCCAC CGTTTCCAGA TCGGTGCGTA 1920 
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CAAAGTTTTC GTCAATCTCA GCAAGTTCCG CCTGCAAGCC CTCCAATGTG CTGATATGGC 1980 

ACTCGATTTC CGACCAGCCA AGGAGCTTTG CCGCTTCCAG CCGGTGCAGT. . CCGGCAATCA 2040 

GGGTATAGTC CCGGTCAACG GTGATCGGAT TTAGCAGCCC TACCGCCGAA ATACTGTCCG 2100 

CCAGTTCGCG CACGGCTTCC GGCAGGGCTT CCCGCCGCCC GGAATTGACC TGGATTGCGT 2160 

CAATGTTGAT CTGCATTTTC ATCACCTCTT TCGGCCCTGG CCGCCTTATT TGTTTTCTCC 2220 

TGCCGGATGA AACACGTTGC GCCGCAGGAA TACTACGGCG TAAGGTCTTG CATCCCGCAG 2280 

GAAAGGGAAA GCCCGCGCCG GTTTTGTTTT GCGCCCTcAT TCTGTGAGGA AATATGACCG . 234 0 

GCGTGGGTTT CCGCTTACTG CATTTCTTAA AAATCTGTGC CGTATTATCA GGCGGCATGA 2400 

CCGCCTTACT GCCCACCCCC GGCACGGGAA TATGTAAGCC CAACATGGCA CGTCGGCGCG 24 60 

ATACCAACAG ACGGCGGCGG CAGGTGACGA AGCTGCCCaT ■ ACCAATCCCT CAGTATCTCC 2520 

CTTTTACCCT GCGATACGGG TTTTTGCGGT GCATCCCGCG TCTGCGGCGC TGCCATGCGC 2580 

CGGACAGCCA TGCCGGTATC GCTCGTGCCT TTGATGTGAT CGGTTTGACG GGCAGGGACA 2640 

GAGGTTTACA CCCCCTTTCG CTGGCCCTGT CCGCAGGCAG TCCTGGCGCG GAATGGCTGA 2700 

TGCAGGATGC AATCACTTAC ATACTGGATA TGAACAGGCG CTTGTCCTGC TGTTTTTCAA 27 60 

GGTGCTTTGG TTCTTACGCT CAGGGAGCAA ATTGGTGGAG AATACTGTCC AGTGTGGCAA 2820 

GGTCATCACC GGTAAGTTGT GCGCTCAATT TGACAAGCAG CGCAGCCTGC CGTGCCGTCA 2880 

ATTTCCTGCG ATAAAGGTAA AACCCATCCG CAACCTTGAC GCCGCCGCCA TACCGTCCGC 2940 

GTACCGTTTC AATGGGATAG GAACACATAA GCACGGCAAT ATCcTGACGG ATGGTTTCTC 3000 

TGGAAACTTG AAATTCAAAT GCCAGATTGC GGTAGGTATC GTGCyGCCTT ATGCACAGCA 3060 

CTTCCAGAAT CGCTTACT 3078 
(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

AAATATTACG ATaTTCmAGG GGCTTAAAAG CCCcTTATyC AtGCaAGTTA TTAAGTCGCt 60 

AAAGGACAAG TGGTGTGGGT ATTkGCACAm CCTGTTAGTT TAyCCTATCT CCTAATTCTA 120 

TTCTTAGGGA TTGTGTTAAC TGGCTTGTTT TGGAAAGAAG TACCATTACC ACAAATTTTA 180 

GGAATTAACT TAAATTTGTT AATCATGGTG TTGGTGCCGA ATAAAGTTGC TCGTTGGTAT 240 

TCCGAAAAGG AATTCGAAGG AAAAACAGGC TTTGGTTTTT TAAAAGATGG TTTTGTCTAT 300 

GTGAAAAATT ATGTGTTAGA TAGTAGACCA ATTGTAGCTT TCGAACGAGT AAAAGAAATA 360 

GAAGAGTTTT CATTTAAACG T TAG T AC AAA AAGCGCACTA TATCTATGAA TATAAAAAAA 4 20 

CAAGGAGCGC ATAGCATTAG CTCCTTATTT TTTATGTCTA AATTATGAAT TTTTATATTT 4 80 
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TAAATAAAGC 


ATGACTGCAC 


AGACTATTAT 


TCCGATGACT 


AAAAGTAGCA 


CAGTAACAAT 


540 


ACTTTCATTT 


TTATTTATTT 


TTCTTTCAGG 


TCTATAGCTA 


GGATTAATTT 


GTTTTGCGTT 


600 


AAAATCAGTA 


ATTCCAAAGT 


CTAAAGTAGG 


AAGTAAAAAC 


AGTAATAAAA 


TAATATATAT 


660 


AATTCCCGCT 


ATAAGGAAAA 


TAGGAGCGTT 


GGTGAAAAGT 


GAAATAATGG 


TACCTATGAT 


720 


TATAGGAGTT 


AAAATGTATA 


AAAAGGATTT 


GATAACATTT 


TCTTTTCTCA 


TTTTAATTAC 


780 


CAAATTTTGT 


TATTCATGTT 


TCCCATATTT 


CCTAAAGGAA 


CACAATAACC 


ACCGCTCATA 


840 


TTTTTTCCAC 


CTGTAGACCA 


ACCACCTTTT 


TTAAAATAAA 


AACCACCAGG 


GCAAGAGCTA 


900 


CCAAATTCAG 


CTTCTCTGGC 


ACTAGCTTCT 


GAACTAAAAA 


TAAAAGGTGT 


TACCATTGCG 


960 


AAAATAGTTA 


AAGCTAAAAA 


AACTTTTTTC 


TTCATACACT 


TTCCTTCTTT 


CCAATTATTA 


1020 


TTTAAAAGCT 


ATATTAAATA 


TAGCTTTATT 


TTATAATAAC 


TGAATGAAAA 


AAAGTTGTCC 


1080 


AATAATATAT 


AGAAATTTCT 


GAAAAATCAT 


TTTTAATGTT 


AAT TACT AAA 


ATAAAATCAA 


1140 


CAAGAAAGGA 


AAAAATATGC 


AATTAGAATA 


TCCATTAACG 


AATCATATGA 


AAGGAAATAT 


1200 


TGTTTTAACA CAAGAAAAAA CAGCGATAgC 


TTACTATCGT 


GTTCGTTCTG 


AAACGGTAAT 


1260 


GTTGACTGAT 


GTTGAAAAAA 


AATTAAAAAC 


GAAAAGAAAA 


GTTGCTCGTG 


CGTTAAATCG 


1320 


CTTAAAAGCA AATGATGGAT 


TTGAAATTAA 


CCTTTTGCCA 


GTTAATGCCG 


ACATTCGAGG 


1380 


GAAAATGGGT 


GCTATGCGTT 


CTTTAATTGA 


TTCTAATAAT 


TATGAAGTAG 


GCGTAGATAA 


1440 


AT T GAT G AAA 


ACGGCAATGG 


TTTTAGAACA 


AGAAATGGGT 


ATGGTTTACG 


AATATACGTG 


1500 


GATTATTGGT 


GTGCCTTTAA 


TCAAGAAAGA 


TCTCGCTGTT 


GATTTAAAAG 


AAACCGTTAC 


1560 


TAATTCAATT 


AGCAACGTAT 


CGGAAAAATT 


TGTTAAAGGG 


CTAGGTTTTG 


ATGTGGAACT 


1620 


AGCAGAATAT 


TGGGAAAAGG 


ATTACCGAGA 


ACAAGAATTA 


GAAGTTTACC 


AAAATCTATC 


1680 


TGAATTGTTA 


ATTGAACGAT 


TAACAGAAGA 


TGAATTGTAT 


TATTACCAAG 


CGTACCAATT 


1740 


CTTGaAAAAT 


ATTCCTCATG 


AGCGACAAGA 


TGTTCTTTCT 


AGTCAAAATT 


TAGaTAATTT 


1800 


aATGGCmACa 


AAAATTAAAC 


CGTTATTTGG CgGGGGATTG 


CATTTAAATT 


GTGATTATGG 


1860 


CGAAAGCTAT 


ATATCCTATC 


TTCCAATGGG 


AGATGTAGGT 


GTTTTTCTTG 


ATGGTAATCA 


1920 


TTTTTTAGAG 


GTCGT 










1935 



(2) INFORMATION FOR SEQ ID NO: 4 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
TTTGTAAAAC TTTTCTTTTG GCAAGCAAAC TATTTTGTAG AAAGCGGTTC ACTCTGTTAT 
CATAATAGTT GTATGAAAGA CGTATGTTAC ATACTAAAAA ATAGGGGGAT TTAAAAAATG 
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ACT T AC AC AT TACCAGAATT ACCATATGCT TATGATGCAT TAGAACCTTA CATTGACGTG 180 

GAAACAATGC ACTTACACCA TGATAAACAC CACAACACTT ATGTGACTAA CTTAAACGCA 240 

GCGATTGAAA AACATCCAGA ATTAGGCGAA AAATCTGTAG AAAACCTAAT TTCAGATATG 300 

AATGCTATTC CTGAAGATAT CCGCACAGCT GTTCGTAACA ATGGTGGCGG TCACGCAAAC 360 

CATACATTCT TCTGGGAAAT TATGGCACCA AATGCTGGTG GACAACCAAC TGGCGCTATT 4 20 

AAAGAAGCAA TCGATGAAAC ATTTGGTAGC TTTGaTGAAt GAAAGCTGCT TTCAAAACAG 4 80 

CTGCAACTGG CCGCTTTGGT TCAGGTTGGG CTTGGTTAGT TGTGAATAAC GGTAAATTAG . 54 0 

AAATCACTTC AACACCAAAC CAAGATTCAC CaTTAATGGA TGGCCAAACA CCTGTTTTAG 600 

GTCTTGACGT TTGGGaACAT GCGTATTACT TAAAATATAA AAACGTTCGT CCAGATTACA 660 

TTGAAGCATT CTGGaACGTk GTTAACTGGG aTAAAGTAAA CGAACTTTTk GCtGCAGCAA 720 

AATAAGTTTG CCAAtGGTAA AAGGAAAGAG TCAAATGACG GCCGnCA 7 67 
(2) INFORMATION FOR SEQ ID NO: 4 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

GCAGGTACTG GGATGGCCGT TTTCTTCGCA TTACAATTAT ACCTTGACTT CATCAACTTA 60 

TTCTTAGCTT TCTTAAGAAT TTTTGGTAAA AACAACTAAC ATGAACGTAA AAGAGGTnGG 120 

GACAGAAGCG TTTAACTCCG AGAAATAAGA AGGAATTTCC GAAAATTGTT CTTTAATTTT 180 

TGGATAATTC CGGCTTATTT CCGAAGGAGT TGCTTCTGTT CCCGCCGTTT ATCAGTTTTT 240 

GAGCGTGGAG CAAAAATCCA AAGTGATTTT TGTCCCACGC TCTTTTTTTA ACCCATTCCA 300 

ATAGCGAAAT TCAACAGTTT ATCGGAGTAT GTTGGGTTTt CyAAATCGGT TTTCATTGTT 360 

TCGTTCAGTA GACTGAGCTG ATCGGTAATG ATGCCATCGA CGCCATAAAA AATCATTCGT 4 20 

GTCATAACAT CTTCATCATT TGCTGTCCAA GCATAAACCT TTTTCCCATC ATGATGAGCG 480 

GCATTTACAA AGTTACGATT CATTGTACTG TATTCCATAG TAAAAAAGTC AACTGGCATT 540 

TTAGGCGGTC CCACAATGTT GAAAGGAATA ACGTAGCCTA CATAAAAGCG TGGTTCTTCT 600 

TTCTTTAAGG CCATCGCCGT ATCATAGGTT AAGGTATGAA GGATATGGCC GTTTTCAAAA 660 

ATATTTTCAC GGTATTGTTT GACAAAGCGC TGTACTAAAT CAGGGCTATC TTGAGGCGTG 720 

GTTTTGATTT CAATTAATAA ACGTTGTTTT AATTGATTGG CTTTGGCTAA ATAATCATCG 780 

AAAGAAACCA TTTTAGCAGT TTGACCATTT TCTGTGACAT TCATATTCGT TAATTCTTTA 84 0 

AGTGTTAATT GATTCGGTCG TTTATTAACG CCCGTTAACG CCCGTAAATT AAAATCGTGC 900 

ATGACAATAA ATTGATGGTC TTTGGTTTCT TGAATATCCA TTTCAACATA ATCAGGTTTT 960 

GCCTTATTGG TCTCGATTAA AGAGTCCAAG GTGTTTTGAA CGCCATTGTT ACCATTAACA 1020 
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CCACGATGGG AGATGGTAAC CGGTTTTCgG TCAGGTGTAT GACTTAGGTA ATcGTATTAT 1080 

AgGTGCCTAC ACCAATnCCG AAGATGAnTG CAATCaAACm AAAAAGAcGA CtCtGAAGGn 114 0 

GGtCCATCnA CCcGAmCcyG ATTTCGGGGT GAACCATTCT GGCGTGTCAG GCAAAATCCC 1200 

TTCATCTTCC ATATCATCna CaATwaTGtA AAAaATAGCm ACaCTwGaCA ACACCAAGTT 1260 

T a AG AG CATC ATAmCCTGAA GGAGTGTCAT AGCGATGACa GCGCTAGGTA AAGACCACTC 1320 

AGGTTTAAAC GTTTCGACTG CGGTTTGACC TAAAAATATC AAGAAATAGC CAGCCGTAAA 1380 

AATCCCTAAA ACAGTTCCTC CAATGATAAT AAATTGCCCT AAAATCTGGA AAAAGCGCCG 14 40 

CTTTGTAATT TGCCAACTAC GTTTTAAAGA CTCACGAAAc CAACATCACG TAAAATCAAC 1500 

TCTGGAAGGG CAAAAGCAAG TCGAATCGCT AAATATAAGA TAATCAGTTC TAACAATACG 1560 

AAACCAATAA TAATAATGAC GCGATTTGCA AAAATAAAAT CTAAGATAAA GGCTGGAATC 1*620- 

TTAACTCTTG TTAATAAATA AGAGCGGAAG GTTAACCCAC TTAATGGTAA AATCAAAAAG 1680 

AAATATGCCA GGAAAAACAG AAAGGTAATT CCTCGAACTT TTTTGATCTG AAGCAAGGTG 17 40 

CCGTGCAATA ATTGCGTTAG AGTAATTGGT TGCTTTTTCT TAATAAAATA GACACTAAGC 1800 

AATAAAAAGG TAAATTCAAA AAAGAGCGCC AGTAAAATTA ACAGCCCAAT TAGTAGAAAA 1860 

GTCAAAAGGA CATAGGGATG TTGCGTAGca ATTGCCCCTA AGTTATCGTA GGAAATATAA 1920 

TCAATCCCAC CTTCTTTGAG TACCAATTTG GaCAAACTTG. TCAATAAAGG CAATAGAACG 1980 

AATAACATAA ATCCGTGCAT CAAGAGCACA TCGCGaAAAT ACGCCGTTGT CCCTTTAAGA 204 0 

AAATCTAGCG TATTCACmAT ACTATTTTTA AAATAACTAA ACATCmAAAG TCCCCCAGTT 2100 

TCtACCmAAA TTCC 2114 
(2) INFORMATION FOR SEQ ID NO: 4 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1065 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) "SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

CAATTGTCTG AACACATTTA GCGCCGTCCT GAATTAATTC ATGGCAGCCT GTTGAAAAAG 60 

ACTGGAAAAT ACTTCCCGGT ACGGCAAATA CCTCGCGCCC GTAATCCATC GCCGCTTGTG 120 

CAGTAATTAA AGAGCCACTA TTCTTACGAG CTTCCATCAC GCAGACCCCT AAAGATAATC 180 

CAGCAATAAT GCGATTACGT GCAGGAAAAT GATATTTTTT CGGCCCACTC CCATTCACAT 24 0 

ATTCTGTTAA AACAAGCTGA TTCTGCTTCA TAGTTTGCTG TAATTCTTTC TTTTCATACG 300 

GATAATATAC ATCTAATCCA GTGCCTAAAA TTCCAATTGT CTGACCCCCG TTCTGGATGG 360 

CCATCTCATG GCTACGACTA TCGATACCTT TGGCAAGACC ACTGACGATT GTGAACCCTT 4 20 

CTTTGACTAT TTTAGGAACC AGAGCTTCAG TCACTCTTAA CCCATACGAA GTGGCATAGC 4 80 
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GAGCGCCGAC GATGCCAATT TTCCTTCTAC TTAACAATTG TAAATTTCCT CTATAAAAAA 54 0 

GAATtGCTGG TGGATTATAA ATTTCTCGTA ATAATTTAGG ATAGACGGCA TCTAAAATTG 600 

TCATAAATTG GTGTTGGCAT TGTCGTATAT ATAACGCTTC ACTATGAATC GTATGGTGTA 660 

ACCAAGATTG TTCAAATAAT TCACGATATT TCTTGATTTC AGCAATTCTA ATAATTTCTG 720 

TACTGGAAAA ATCAACACGC TTTTGTGCGA TAGCTTCTTG TAACACTTTT AAGATCCCTA 780 

AATTGCCAAT TCCTTGGCAT ACCGCTAATT TAAATAATAA TTGTCGTTGT AAAATTTCCA 840 

ACAAAAAACA CCTCTTTCAC ATCTTCTTAA AATAAGAATA CGTAAAAAAA GTGTTTTATT 900 

TTTAAATCAT ATCTTTTATT GGCGCAAACG TACGTCGATG AATGGGCGTT ACACCGTATT 960 

TTTCTAACCC GAGAgAkGTT CTTtTGTTCC aTAACCTGCA TTGynTTTAA ATCCATATCC 1020 

AGGATATAAT CCATCATATC CTTCCATTAA TCGTCTCTGG TACTT 1065 
(2) INFORMATION FOR SEQ ID NO: 428: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 



CTGACnAGTG 


CACACAGTTG 


TTGGCGCGAT 


GCTGGACGCC 


CtCGtTTTTG 


awcTTaAgyT 


- 60 


TgGGCtCtct 


TctGtTkGtT 


AGCCAGTnCA 


GCTCTTTTGC 


TGCTGGTcTT 


TTTCaGCCTT 


120 


tGCTTTTycC 


rCTyCCGcAt 


TTTtCGcTAA 


tyCTTmAAgC 


TTGGCTTGTT 


CTTGtTTCGC 


180 


AGCTGTTAcT 


GCTTGTTGAT 


CTTTATCATT 


TTGyACTTTA GCAGCATTCG 


TTTCTTTTGC 


240 


TTTTTCTTCG 


ACAACAGCTT 


GTTGTTGGTT 


AACGTCTGTT 


TGAGCTTGAT 


CyACTACTTT 


300 


TTGTTGGTCA 


TCAACwGCCT 


GTGTATCCGT 


AGCCACTTGC 


TCTTTGGCTT 


TTTCAATGGC 


360 


TGAAGGTGTT 


GCTTCATCCA 


CGACTTTTTT 


AGCTTCGTCT 


ACAACTGCTT 


GTTGGTCAGT 


420 


CACTGCTTGT 


TGACTTTGAT 


CCAATGCGTC 


TTTGTTTTGA 


TCGACCACTG 


CTTGTTGGTC 


480 


TTTAACAGAT 


TGATCAATGG 


CGTCTTTTTC 


TTTTTTCGCA 


GTGTCAGCGA 


CTTGTTGTTT 


540 


TTGATCGACA 


ATTGCTTGTT 


TCTCTGTTAC 


TTGTTGCTGT 


TTTTCAGTAA 


TAGCTTGCTC 


600 


CGTTGTTTGT 


GTAGCTTTCA 


CTGTTGGTTG 


TTCTGTAGAA 


CTGTTTTCAG 


GTGTTTTTGG 


660 


TTGCGCTTGT 


TCTGCTGCCT 


GTACTTCATT 


TCCCCCAATA 


CCTGCTACCC 


CTGTTGCGAC 


720 


TAAAATACTT 


AACCCTGyAC 


TTGCGATCTT 


TTTCATTTTA 


TTTTCCTCAC 


TTCCGATTAT 


780 


TTTTGTTGTA 


AATGGGATAC 


CCATTGTCTT 


TACGCCATTG 


TTGTAACGCT 


GTAGTTGTTA 


840 


TrTGGAACTC 


TTCCGCTATT 


TCTTTAAACT 


GTAArCCCAT 


TyTTTGATAG 


GCTTCAAATC 


900 


TTTCCTTGCT 


AATATGAGAA 


GAAAATTGTC 


TGGTGCTTGC 


TGTTTCTATC 


TTTTTTGCAA 


960 


GTTCTTGTAT 


TCGTTTAAAA 


CGCTCGGTAT 


CTTCATTCCT 


ATACCAGTCT 


GCGTCTTCAT 


1020 


TCATCAATGC 


CAACATCTCT 


TTACGCCAAA 


TACTTTGCTC 


TTTTTTTGTC 


ATACCCCTCT 


1080 
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CACCTCAAAT 


TAATAGTAAA 


AAAATAACAT 


TTATTAAAAA 


AATACTATCC 


TTCTTTTGTC 


1140 


TtCTTGGACT 


TACCAAAAAA 


TAAAAATATT 


TTCGACTAGT 


TTATGGCGTC 


CGGAACTAGA 


1200 


AACCCTAATA 


GACAACGGTA 


AATGTAACAA 


AATTATTGTT 


ACAAGTGCCT 


ATATGCATTG 


1260 


CAGCACACAA 


CATATCTTTT 


ACAGATATGC 


GCTCTCCCAT 


TACGAAGAGC 


CTATTAAATA 


1320 


TTACTTAATA ATAAGTGTTT 


TAGCCTAAAT 


CCCTTAGTAG 


CTGCTCTGAT 


TAATTTAGGC 


1380 


GCTATAAATT 


GCTTCCTCTC 


TGGTTCAAAA 


TGTTTGTTTA 


CTAACTATCG 


TTATCATAGT 


1440 


TATTCATTTG 


CAAAAGCTTA 


TGATTACGAC 


GTTTGCTTAC 


ACTAACCAAT 


TTTTTGTTGT 


1500 


TAATGTACTA 


AAACAATCCT 


TCGTCCGAAA 


TAAAGGAGGG 


CCACTCCACC 


GCTGAATCAT 


1560 


TAAGCCTATC 


TTAATCTTCC 


ACTCTCTCCA 


CATTCTTCTT 


ACAGAGATTT 


CACAATTGTT 


1620 


TTTTTAAGCA 


AGTCCAAGAA 


GACmAAATCT 


ATCTTATGAT 


TTTTTATCGT 


AGTCCAAAAA 


1680 


TTTACTCATT 


CCTCTACCGG 


TAACTTTTAA 


CGGACCACTA 


GGAGAATTTC 


TTCTAGCGAT 


1740 


TAATACCCCA 


TTATGGTCAT 


AAACATGTTT 


AATTTGTTTA 


TAGACACTGC 


GCAAAGGAAC 


1800 


TTGACCAACT 


TCTCTTTCGC 


ATAACTTTTT 


CCCTTTAAAA 


TACTTCTCTA 


TTTGAACAGC 


1860 


AAACATCCCC 


TCAATTCTAT 


ATGTTTTTGT 


CTTTCTAACA 


AACTTTGATG 


TAGCATTGGG 


1920 


TAACTTACGC 


TTCTCTCGTT 


GACCAAAAAC 


AATAATTCGG 


TTGCCGTAGT 


ACCCAACATG 


1980 


TTCAATTCGT 


AATACTGAAA 


AGCAACTCAT 


TTACATTCCC 


CCAATCGAAT 


ATTTTTTAAA 


2040 


TAAAAATAAG 


AGCCATGCAA 


CACGTTGTTT 


GCACGGCTCT 


TACGAGTAGT 


TCCaGTACAA 


2100 


ACAATGCGTG 


AAAGCATTGA 


ATTATTCCAC 


AAGCGGGTCA 


TTTTTAGAAA 


ATGAGCGTGC 


2160 


TTGCTACTTG 


GTTTTTTATT 


TTTTCAACAA 


GTTAACAGAC 


AACAAAAAAG 


CGGGGAATAT 


2220 


ATACATGAAC 


TGTATACaTT 


CCCCGCTGCT 


CCAACATTTA 


TACAACTAAT 


TAAAACAATT 


2280 


CACTGTAAAA 


ACTTTTCCTA 


GACAAATAAA 


GGAGTTTAAA 


AAAGTCTAGA 


AT-TTTACATA 


2340 


AAAAATTGTT 


ATACTATACG 


TTGTAACTAA 


TGTTGCAACA 


AACGAGAACC 


GAGTATAGTT 


2400 


CATGGGGTTC 


TGTTGCAAAG 


TTTTAAATAA 


AGAATAAAAT 


CCCTTACGGT 


ATCTATGATT 


2460 


TAAGCTGGGA 


TTCCCAATAA 


TACCTTGATT 


TCAGTACAGA 


CCGAAAACCC 


GAAGAGAGTG 


2520 


CCTTCTTTTC 


GGGTTTTCTT 


ATATAATCCT 


CGAATGGCTT 


CCATGCCTTT 


AATCGTGGTA 


2580 


GAGGCAGTGC 


GTAAACTTCG 


ATAGAATTTA 


TTGCGTCTCT 


TTACTGGACG 


ATGGTCTTGT 


2640 


TCAATCAAAT 


TATTCAGGTA 


TTTAATGGTA 


CGATGTTCTG 


TCCCTTGATA 


AAAGCCGTAT 


2700 


TCTTTTAGTT 


TCTTAAAGGC 


ACTTGTAATA 


GAGGGGGCTT 


TATCTGTGAC 


TAcAACtTCG 


2760 


GTTCATCAAA 


CTGCTTCACT 


AACCGCTTAA 


GAAAAGCATA 


GGCTGCTTGT 


GTGTCCCGTT 


2820 


TTTTACGTAA 


CCAAATATCC 


AAGGTTAAAC 


CATCTGCATC 


GATGGCTCGA 


TACAAATAAT 


2880 


GCCATTTTCC 


TTTAATTTTG 


ATGTACGTTT 


CATCCATTTT 


CCATGAATAA 


AAGGATTTTT 


2940 


TATTTTTCTT 


TTTCCAAATT 


T GAT AG AG T A 


GTTTGCCATA 


. TTCTTGCACC 


CAACGATAAA 


3000 


TCGTCGTATG 


AGAAACGTTA 


ATGCCACGAT 


CATATAAGAT 


TTCTTGAACT 


TCACGATAGC 


3060 
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TAAGGTTATA ACGAAGATAG TAGCCCACGG CTACAATAAT CACATCCTGC TGAAATTGCT 3120 
T TCC T T T AAA AT GAT T CATC GTCATTCCTC CTGCTATCTT TTTCTATTAn ACCACACAAT 3180 
ATATTTCCAA TTACCTCTTT CTAATACTGG CA 3212 
(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 



ATTGAGTGAG 


GTATTACnGG 


AnATTATGGT 


TCGCTATGGC 


GAATTATCAA 


C T AAAGG An n 


ou 


AAATCGTAAA 


ACGTTTATCA 


TGCAGTTAGC 


ACAAAATGTG 


AAACGTGCGT 


TAGCTGATTT 


1 9 Pi 


TCCAGCATTG 


AAAATCCATG 


CAGATCGAGA 


CAGAATGCaT 


ATTTTaTTAA 


ATGGGGAAGA 


lou 


TAGTGAAGAA 


GTGATTCCGA 


AACTAAGTAA 


AGTGTTTGGA 


ATTCAAAATT 


TT ICGCCAAG 


0 a n 

Z. H u 


TATTCGTATT 


GAAAAAGAAA 


TGCCAGCGAT 


TCGTGCAATG 


GTTCAAGAGG 


r I G i 1 GGGGA 


■inn 


AGTTTATACG 


CCAGGTAAGA 


CGTTTAAAAT 


TACTGCTAAA 


CGCTCGGAGG 


AI i GA1 1 1GA 




ACTTGATTCA 


AATGGCTTGA 


ATCAAGAATT 


AGGTGGGGCC 


GTTATTGAAG 


f^r*T\ tt^pppp a 


m c. U 


AATTCAAGTA 


CAAATGAAAA 


AGCCTGATAT 


TAACTTAAGA 


7\TTP7\ 7\ 7\ TTP 

All GAAAi 1 L. 


P T 7i 7A 7A P 21 T P C 


dPfl 

4 O VJ 


GGCGTATTTA 


TCTTATGAAA 


CGATTCGCGG 


I GGGGGI GGA 


t tp pp 2i p. t p. p. 




54 0 


TCGTGGTATG 


TTGATGTTGT 


CAGGAGGCAT 


TT^TV r P r P/^7\ 

1GA1 1 CA^LA 


oil ul i 1 1 




600 


GAAACGTGGC 


GTGGAAGTGG 


AAGCTGTCCA 


TTTTGCTAGT 


CCGCCTTATA 


CAAGTGAACA 


660 


AGCGTTACAA 


AAGGCTAAAG 


ATTTAGCTGA 


AAAATTAGTC 


CCTTATGTAG 


GAACTATTCA 


720 


ATTTATTGAA 


GTGCCCTTTA 


CAGAAATTCA 


AGAAGAAATA 


AAACGAGTTG 


TTCCCCAAGG 


780 


CTACTTAATG 


ACCATTACAC 


GACGTTTAAT 


GTTACGCTTA 


ACAGATGCAA 


TTCGTGAAAT 


840 


GCGGAAAGGT 


TTAGTCATTA 


TTAATGGTGA 


ATCGTTAGCA 


CAAGTGGcTT 


CGCAAACACT 


900 


TCAAAGTATG 


GTAGCAATTA 


ATGAGGTTAC 


CTCAACACCG 


ATAATTCGAC 


CCGTTGTTTC 


960 


AATGGATAAG 


ACAGAAATTA 


TCGAAATTGC 


CGAAAAAATT 


GACACGTTTG 


AATTAGCCAT 


1020 


CCAACCCTTC 


GAAGATTGTT 


GTACAATTTT 


TGCGCCACCG 


CAACCGAAAA 


CACGACCTCG 


1080 


CTTGGATAAA 


GCGCAAGAGT 


ATGAAGCGCG 


CTTAGACTTG 


GAAGGTTTAA 


TGGCTCGGGC 


1140 


CTTAGAAGGA 


CTCAAAATTA 


CAGAGATTTC 


AGCAGAAACA 


GCGAAAGATA 


AACAAGAAGA 


1200 


CGAATTTGCA 


GATTTCCTAT 


AAGArATAAA 


TACAGTGACG 


GTCGCTTAGA 


CGTCACTGTA 


1260 


TTTATTTTGT 


. ACTTATTTCA 


CmAACAAGTT 


GCTCGGAGAG 


GGGAACCGTG 


ATAAACTGAA 


1320 


TATGTGAAGT 


AGGTAACAAG 


GAGGACAGAG 


GATGGAGAAA 


AAAATGCCTA 


AAGCAACGGC 


1380 


AAAACGTTTG 


CCGGTTTATT 


TACGTTATTT 


AAAAATGTTA GGGgATTCTG 


GTGTTAAaCG 


1440 
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CATTAAaTCA CGAGaGTTTA GTGAAATGAT CCAGATTCCT TCTGCAAC 1488 
(2) INFORMATION FOR SEQ ID NO: .430:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 



ACGATGGTAT 


GTGCCCTTTT 


CCGACGACCT 


GTCAACnCnA 


AAACCTTGTA 


TCCTAGCAGA 


60 


CTGTTCGTCT 


TTTCCGATCT 


GATCTCTTGA 


TTGTACTTGC 


CATTCTTTCG 


TACTTCCTCG 


120 


TTCATGGCTA 


AAGATCAATG 


TATTTnCCTT 


TTAGTTTTTC 


TGTATCTAAC 


TGAGAACTAT 


180 


CGATAGCACC 


GTCAGTCATT 


GAnCTGCGAC 


AGCGACAATA 


TATTCCATCA 


TATCTGTTGC 


240 


TAAATAGAGT 


TTTGGCTCTT 


TGCCCTCGTT 


ATCTAACGAT 


CCAACCAATG 


AAATCGGTAC 


300 


ATCCATTCCC 


ATCGCTTTTA 


ATTTCATCTC 


TAAGTTGAAT 


GCTTTTTCTT 


TTTTGGCATC 


360 


TACTTGAATC 


TTCCCCAGAA 


AGCGATGCGT 


CTTTGATTTG 


CGTGATGAGC 


ATCCCAATCA 


420 


TAGGATTCGT 


TTGTGCACCA 


TCTTCTTGTG 


AAAATTTCAT 


GTCACTAATT 


GACATAGAGA 


480 


AATCyCAAaC 


CCCAGATTCT 


TGTGCGTTyT 


GATTGCCGAT 


TCCTTGAATG 


AACTCATCTT 


540 


TCGCACTTTT 


GCCACAACCT 


GTCAGTAACA 


AAGCTGAACA 


GAAAACGCCT 


GCGACAATCC 


-600 


CCACTTTTCT 


TTTTAACACA 


CTTTTTCCCC 


CT CAT TAT AT 


AAAAAATTAA 


TTCCCTTTTA 


660 


TTAAGAATAG 


CAAACATTGC 


TTATAAATAG 


CAATGTTTAA 


TTTTAGATTT 


CCTCTCTTTT 


720 


AATAAATGGG 


ATTGATTTAA 


TCTTTTAGGT 


TTGAATCATT 


AAGCTGTATT 


ATTTTTAAAA 


780 


TAAAATTGCT 


TTTCGCCAAA 


AAATACTGGG 


AAAAGTTCTG 


CTAAACTGCC 


AGCTAAAGGA 


840 


CCTTCTCGCA 


ATTCTTTTTC 


AGTGACCCAA 


AACAAGGGTT 


GTTCCGTTTC 


TTCTGGTAGT 


900 


TCATCCGCTG 


TTTCTGCAAT 


GAATAATGTG 


GCTAATTCCC 


GCGCACCTGA 


ACAATCATTC 


960 


AGCCATTCTG 


CAACACCTAC 


TAATTGTGGC 


TTCAGCACTA 


ATCCTGTTTC 


TTCAAGGAGT 


1020 


TCGCGAACCA 


TGGCTGTTTC 


CATGcCCTCG 


TTTTTTTCAA 


CATGTCCGCC 


AGGGAACGTC 


1080 


CAACCAGGCC 


AATCCTTTTT 


TTGTCGTTCT 


TGCACAAGGA 


TTTCATTTTT 


CTGATTCTAA 


1140 


TCATA 












1145 



(2) INFORMATION FOR SEQ ID NO: 4 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 
CAGAAACACT CTTTTTTTGT AAAGGATTAA ACTTTAAAGA AATCTATTTA GAAAATGCTG 60 
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TAAAAGACGG CCTAGAAATC TATGTTTCTG AGGTGCCTTA TGAGGTTCCG GCGCAACTAG 120 

GAATTATCGT GACTGACATT AAAAAAGCAA TGGCTGTGTT AAGCATGGCT TTTTATGATT 18.0- 

ATCCACAAAA TAAATTAAAA CTGATTGGCT TTACTGGAAC CAAGGGTAAA ACAACAGCGG 24 0 

CTTATTTCAC CAAATATATT TTAGATGTTG CTACACAACA AAAAACAGCC TTGCTTTCTA 300 

CCATGAATTC GACATTAGAT GGCAAAACAT TCTTTAAATC TGCCTTGACT ACACCCGAAT 360 

CTTTGGACTT GTATCGAATG ATGGCAACCG CTGTTGCTAA CGGGATGACT CATTTTATTA 4 20 

TGGAAGTTTC TTCCCAAGCC TACAAAACTA ATCGTGTCTA CAAGTTATTT TTTGATGTCG 4 80 

GCATTTTTTT AAACATTACC CCCGACCATA TCAGTCCAAT TGAACATCCA ACCTTTGACG 540 

ATTACTTTTA TTGCAAACGT CAATTGATTA CACATTCTAA AGTCATTGTG TTGAATCACG 600 

AAGCTGATTA TTTTCCATTA CTTAAAGAAA CTGCACAACA GCAAAAAGTC CCAGCCATTG 660 

TCTATGGCAG TCAGCCGGCT CCAGAAGTAG ACTATTCATT CGCTGTTTCT TCAGAGGATT 720 

CCTTACGTTT TATCGTTGAA AGTCCTGCAG ATGCGTTAGG ATTGGCCGGC TCTTATCACC 780 

TTCGTTTAGG TGGTGATTTT AACAAAGGTA ATGCGTTAAG TGCGGCGATT GCTAGTGTCC 84 0 

TTGTGGGAGC TTCAAAAGAA GAGTGTCAAC AAGGAATCGC AGCAACCACC GTCCCTGGCC 900 

GGATGGAATC TTTAACGAAT ACCAACGGTG CAACCGTCTA TGTTGATTAC GCTCATAACT 960 

ATGACAGTTT GAAGAATCTT TTAACCTTTG TCCGTGAAGA ACATCCTGAC GGCCGCTTAA 1020 

TTGTTCTTGT CGGTAGCACG GGAGACAAAG CGATTTCTCG CCGCAAAGAC TTTGGTAGGG 1080 

TTCTTTCTGA ATTGGCTGAT GTTGCAGTCC TAACAACCGA TGACCCAGCC AGTGAAGATC 1140 

CAGCAAAAAT TTGCCAAGAA ATTCAAGCCC ATATTACCAA AGAAATGCCT GTTTACACCG 1200 

TTTTAGATCG TGGCGAAnAA TTGCTCACGC ATTATCTTTA AGCACAACGG CTGATGATGC 1260 

GATTGTCTTA GCTGGTAAAG GCGCCGACCT TTATCAAAAA GTAAACGGCG TAGATGAACC 1320 

TTACGCaGGA GACTTTGCCC TTGCGGAAGC TTTCATTAAT AAAAAGrACT AAATCATTTA 1380 

AaCaCACATG AAAAAAACAA GCTGCAACCA CTTATCAGCA AAGGTTGCAG CTTGTTTTTT 14 40 

AAAGAACTTC TTTTATTTTC GTTACTTCTT CTTCTGATAA GCCATGACCG CCATCCACTA 1500 

GTAAAAAGTC GACTGGAAAA TGTTTCTTTA ATTGATTAGC TAAAGAAAGA ACTTGTCCCG 1560 

GAATACTCAA CTCATCCTGA GCCCCTGTTG TGACAATCAC TTTTGTCGCA AACTCACCAG 1620 

ATACATATTG ATAGCCTAAA TTGCTTGGAT GCAGTAAAAT AACAGTGTTA GCAATAGTTG 1680 

GGTTCTTTTC TAACAGGCCT AAAATAAAAT TGGCACCGTT AGAGTACCCT ATAAAAATAA 17 40 

CTTCATCGTA TAACTTTTGC GGTTGCCAAA CTTCTTCTAG AAACGCCGTC ACTCGCTCAT 1800 

CAAAGTTAGC ACGAGATAAT GCTCCATTGA CTAATGGCGC AAAAAAGCGT CGCTGTTCGC 1860 

CTACACCTTC ■ TGTCCCTAAG TAACTTAGAA CACTCGCTTC TGGATAAAGG GTTGCGATGG 1920 

TGGTTAATAA TTGATATTCA TTGCCACCAG TACCATGGAA AGCGACAACT AATTGATTAG 1980 

GTTTCGACCC TTTTTGAAAA AATTGTTTCA TTTAATAAAC CTCCAAGAAA GGAATTTTAC 204 0 
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TATGTAXbbA 


(~* A 7\ ATTP ATP 

bAAATXbAXb 


AC A X b X b X A 1 


TTTAAfiTPPL 




CTACATTTGA 


2100 


CTTCTATGAX 


7\ AT l A/~'AP ,f P/'~ > /'"* 

AA1 AbAb J. bb 


GAX XAAAAX X 


At X AA X Vj AAA 


rV\_^ vj rt I V- nr\^ \* 


AAGATGACCA 


2160 


CACCATGXAX 


b Ab 1 X b X X X X 


1L1 Lnb A X AA 


I tr\OtH.rY^orV 




AATTAACTTT 


2220 


TTTTGAATTG 


AAXbAXbbtb 


nnpTiTPZiziTP 
AHun X t AA X t 


till obbntn 


AATAPC^ATTG 


AACGAACTAT 


2280 


TTTAAAAGTA 


CCAACGCGCG 


Gb X b X X XbbA 


X 1 1 1 1 uouun 


n a ArriTrTTf; 


AP,PAAPiPPiKG 


2340 


TATCTGCCAC 


TATGGTGTGG 


AAAbAX X XAA 


tt AAt AtttM. 


r\ I 1 1 lu^ol l 


TTP.AA(^PnPP 


2400 


CGATAATACG 


CAAATGGCGC 


TTGTTCCGTT 


AbblbAAl 1 1 




A APATT ATTT 


24 60 


TCCAGCTGAG 


CATTCCGAAA 


T>i*ppnPT)vr'7\ 

TTCCAGTAGA 


AGAXGGAAX X 




&TPPA ATTPA 


2520 


ATTACGCGTC 


CAATATGCCG 


TAGCTACCAA 


AGAAGG I X X A 


b X bbA 1 Inl 1 


TliP ATTPPPA 


2580 


TGAAAAAGAA 


ACAGTACCTT 


TTTTTGAAAC 


TGGGGAX GAA 


uinnLLul 1L 


TPPAA A ATP A 


2640 


TCATCCACAG 


TTTTATCAAG 


AGGTGCATAT 


^'A'T"TT , *A r P^*A/™ , 

CAT IGAI GAG 


pprp 7\nT^ A J\mp 

bb X AbbAAl U 


PPPTTPPP AT 
tut 1 1 Ottnl 


2700 


TGAAGGAATT 


GGTGGCGTTC 


ATCACGTCGC 


GTX XGGGGXG 


b AAb A X ntbL 


Z1Z\PAATTAAP 


2760 


GCAAGTTGAT 


AGCCTGTTAG 


TV Tt TV TV /*> TV "A 

AG G AAAAG AA 


m rp T» f> A OA A A P 

TTTTACAAAG 


r n/™ , A/ r T'A ATI 1 A 

XbAbbAAX XA 


21 flPAPPPPPA 
AALj At t t kart 


2820 


ATTTTTCAAA 


TCATTGTATT 


TTAGAGAACC 


GAACCAG X X G 


X XAX XX \d AAU 


THPPPAPAPA 
1 riot ^_-rt.v»,ri^rTi 


2880 


AAAAGGCCTG 


CTAGATGCCG 


AAGCCTATGA 


TV TV TV l"P /~i TV TV TV /^/™t 

AAAT C AAAG G 


X b AAAb X X X b 


HTPZ1 A ATPPP 


2940 


CCTTTATTTG 


CCTCACTTTT 


TGGCAGATCA 


AAGAGAAGGA 


a TrfAnrrr a 
A X bbAAbbbA 


TTTTAPP APA 


3000 


ACAACGCCAT 


GAGTAAACAA 


AAATAGCGGT 


TCTAACCGGT 


AX lXXXbXXX 


TTTA AT ATPP 
111 rlrll ri 1 


3060 


TATTATTGCC 


CTTTTTTGAA 


AAAAAGGAG I 


AG AG X AA X Ab 


T7iP Zi m PA APT 
X Ao A i Vj/\HA- L 


PPPP AAGGAG 


3120 


GTGAGTACAT 


GTTATCTAAG 


GAAGAAGTTC 


TAGAG X XAX X 


A APPA APPA 

b AAC 0 AA^jt rV 


na A A AAPAAP 


3180 


TCGATCGTTT 


AGAAACAAAC 


CGCCAAGAAb 


AX 1 1 AvjLjXAA 


TTra ATP A AP 


TATATTGAAA 


3240 


ATGAATTACA 


GCTCCAACGT 


GTCTTATCTC 


AAb Jl bbAAGG 


1 lnl untt/vin 


PTAPTTGGTT 


3300 


AATTGATTAC 


GCATTAAATG 


AG AG AGG AG L 


A APP7\P7\TTP 

AAGGAGAX 1L 


Z\PPTPAD.TPA 


ATPTPPTTGP 


3360 


TCCTCTCTCa 


TTTTTTACTA 


GAL X AAX X bb 


X AAGG AAb i. X 


r"vrr' r rrT , h zit 


AATTTTTGCG 

An J, 1 1 JL 1 vj 


3420 


GTTGTTTCAG 


CGATTTCTCC 


A PT A A A rT*f~* A 

AG1 AAAbbbA 


7\ TPP ATTPTT 

AX bCAX XGX X 


pp n TZiTPTTP 


APPTPAAPPP 


3480 


ATCGTCATTT 


TACTTGTCAA 


AATAbl GGAG 


bAAAGGAb I b 


P7ATPPTTTTP. 


TPPAAAAPTT 


3540 


TGATGCAATT 


CGTTGGCTAA 


AGGGAXGG X X 


X 1 A I X X Ab 1 1 


PTTPTPPTTT 
bl 1L1 111 


PPPTTPTPiTP 


3600 


CGACCAAAAA 


AGTAGCCCAG 


AG AG A 1 AAGG 


bbbbb iorlriri 


PDPPAPPAPA 


AATAPACTTC 


3660 


GCTCCTCCAA 


CACCTACAGG 


a a a f A P A A 

AAAG GG AG AA 


bbbAl but^A 


PZiPPTTPA AT 




3720 


GCATCGAAGT 


GTTGCCGAAT 


GGTCGCCACA 


ATCGCTTCCG 


AGCAATAAAA 


ATCACCGTTA 


3780 


CGATAATAGG 


CTTCCGCTTC 


TTCCCTGATT 


GCTTGTAAAC 


AAATTCTCTT 


TTTCAAAATG 


3840 


CCCTCCATTC 


TGTATCCGCT 


TTCATTTTAT 


GAATAAAAGG 


ATGCCAAGCA 


, TCCCTTTATT 


3900 


TTATAGATTT 


TCTGCAGAGG 


TAAATACTTC 


TGACACATCA 


, TCATCGTCTT 


CTAACTTGTC 


3960 


AACCAAGCGC 


TCTAATTGCG 


CTTTTTGTTC 


: ATCGTTCAAG 


GTTAGTAAGG 


1 TTTGCGGAAC 


4020 
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CATTGTTAAT 


TCAGCTTGTG 


CTAATGAATA 


ACCTGCTTGT 


TCTAAAGCAT 


CGCGAACCGC 


4080 


TGTAAAATCT 


TCTGGTGCTG 


TGTAAATTTC 


AAATACTTCT 


GGTGATGTTT 


CTATATCCTC 


4140 


GCCGCCTGCT 


TCCAAAACGA 


CTTCTAGCAT 


GTCGTCTTCC 


TCAATTGCAT 


GATCTTCTCG 


4200 


CTTAATGACA 


ATATAGCCTT 


TACGATCAAA 


CATATAATTC 


ACCGAACCCG 


TTTCACCTAA 


4260 


AGAGCCACCA 


TTTCGAGTAA 


ACGCGACGCG 


AACATTGGTG 


GCCGTTCGGT 


TGCGGTTATC 


4320 


TGTTAGTGCA 


TGAACTAAAA 


CTGCCACGCC 


ACCTGGCCCG 


TATCCTTCAT 


AAGTTACCTC 


4380 


GTCATAATGT 


TCCCCTTCAC 


CAGCACTACT 


GGCTTTTTTT 


ATGGCGCGCG 


CAATATTATC 


4440 


ATTGGGCATA 


TTGGCTGACT 


TAGCTTTATC 


GACTGCTAAA 


CGTAAAGcAG 


GATTCATTGC 


4500 


GGGGTCAGGG 


CCACCTGCTT 


TTGCCGCCAT 


ATAGATTTCC 


CTTGAAACTT 


TTTGGAAAAT 


4560 


TTTCCCACGT 


TTGGCATCTT 


GCGCATTTTT 


ACGCCCTTGA 


ATGTTGCTCC 


ATTTGCTGTG 


4620 


TCCTGACA 












4628 



(2) INFORMATION FOR SEQ ID NO: 4 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 32: 



AAATTATTGA 


AG T TAT T AC A 


GAnAAATTTG 


AGAGTAGTGG 


AGaAATCGAT 


ATATTATTTA 


60 


ATAGCTTTGA 


TTTAGTCTCA 


TATTGGTTGG 


wACAATACTT 


TTCAATAmAT 


GAACTTTCTG 


120 


AAAATCATGT 


AATATATTwr 


CGCATAGAAG 


AAATTTTAgA 


AAAATATTAT 


TTAGTGGATG 


180 


AAGTTAATAC 


TGAGGAACCC 


GCAATAAATA 


TAAGCAAGAC 


mGrAAACaTT 


ATTATATGgA 


240 


ATATTACTyC 


ACCaATTaTC 


mAAAAATwGG 


rATgGCAAAA ACAAGTAGTT ATGAACGGAg 


300 


CTCATTAGTG 


AACTCATTGA 


TACGCTTGAA 


ACTTCTGATT 


TGGATACTTT 


AGAtCGAATA 


360 


TTTTATCCTG 


AATATAAGAA 


AAAAATGACT 


GGTTTGCTTA 


TTGaTGATAA 


TGGTAAACTT 


420 


AGAGTTCCTA 


CTCACGGATT 


TCAACTTTTA 


AAAATTAGTG 


AGTACGAGaC 


GAACCAGTTG 


480 


TTGGATGAGT 


TAGGAGAATA 


CTTGAAGGGA 


CAAGGCTATG 


TATATGGTGC 


GATTCCTAAA 


540 


AAAGACAATC 


TACAATTTTG 


TAATAAAATA 


GTTGGTTTTT 


TGTATTCTAT 


TTTAGAAAGA 


600 


GAAGCAAATG 


TATTCAATAA 


AAATCAATTG 


TTAAAACTTT 


TAATAGCTCA 


AATTGAAACC 


660 


CTTCTTCCTG 


TACAATTAAG 


AGGAGAATCA 


TCCTATAATA 


ATGATATAGC 


TCTTTCAGTT 


720 


CAAGAGAAAG 


ATCGTTTTTT 


TGAGCAGTTA 


AATGAGGATA 


ATAGAAATTC 


AATAGCGACT 


780 


AAATTTTTGT 


TAGAATATGT 


AGTTGCATCA 


CCAATTACTG 


GAGAACAAAA 


TGTAGGAAAG 


840 


TGGGAGATTG 


AGCGACTTTT 


AGCAATATGT 


TCATTAATAA 


TTGAATGGGC 


ACACCGGAGT 


900 


GATTATTTTA 


AATATAATTT 


TGTAGATACT 


ACAATGAATT 


TTTTACAATC 


AAATAGAATT 


960 


GGTATTAAGA 


AGAAAGACTT 


TAAAAATGTA 


AATTCGGCAA 


TGTTAGCTTC 


TAGGAATCTT 


1020 
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CAATTGGCAA 


ACTCTAATTT 


ACCTATTTCG 


GAGAACAGAA 


GATATGTTGA 


AAGAGTGAAC 


1080 


CAACTTTTTA AATCTAAGTT 


AGATAGTGCA 


TTTGTTGAAG 


CATTTGGATA 


TTCATATGAG 


1140 


GAATTTAATT 


TAGTGATAGG 


AGGTCTAAtA 


GATACACACA 


ATAATTTAGA 


GAAAATAGTA 


1200 


TGGATTGAAG 


AGGAAGAGGA 


GTTAGTTCGC 


AAAATATTTA 


ATGATTTAGA 


CAAAAAAATT 


1260 


AGTGAGATTA 


AAATAGTATC 


AGTATTAAAT 


TCTATTGCTC 


TTGTAGAAAG 


AGAAAAATAT 


1320 


TTAACTCCAC 


CAATAGGATT 


TGAAAAACTA 


GATATTTTCC 


CTTGGAGATT 


CAATCGTCGA 


1380 


TTGTCATTCA 


TTAGAAGACC 


TCTAGTAAAG 


TATGAAAATA 


AGTATATGTT 


TGGAATTAGG 


1440 


AATGTAATAC 


ACGCTCATAA 


ATATTTAATG 


CGACTGATTT 


GGGGTGGATG 


TTTAAAAACA 


1500 


AATAGTCGTA 


AGATGAAAGA 


TGTTATGTCT 


AAATTACGAA 


ATATACTAGG 


TGATGAATTT 


1560 


AATGATCGGG 


TAAAAAGTAT 


ACTCGAAACA 


TATCCTAACT 


TACAAGTAAT 


GAAAGGGGTA 


1620 


TCTAAAATTG 


GAAAGAAACG 


TATTTCTGGT 


GAAAATAATA 


ATACATTGGG 


CGATATAGAT 


1680 


GTTTTCGCTA 


TTAATCGAAA 


ACAAAAAAAA 


TTATTTGTTA 


TCGAAACCAA 


AGATTTTAGC 


1740 


TTTTCAAGAA 


ATCCCTATGA 


ATTAGCTATG 


GAGCAAGAAA 


AAGTATTTAT 


AGGAGATAAA 


1800 


GCTTTTTTAA 


ACAAGCACTT 


AAAAAGAAGT 


GATTGGATTC 


GCAAAAATTT 


AAAATATGTG 


1860 


ATTGATAATT 


AT GAT T TAG A 


AGATGGAAAG 


TGGGAAATTG 


TTACTATGTT 


TGTTGTTTCA 


1920 


GAACATTTGA 


TAACAAAAGA 


CTTGGTAGAT 


ACTAAAGGTG 


TTAAATTTAT 


TTCATTGAAA 


1980 


GAGCTAGATT 


ATAATATGTT 


CCATTAATTG 


GAAACTAGTA 


AAGTTTTAAA 


GTTATAAAAA 


2040 


ATAGTATTGT 


TCTGATTCAC 


GCATATTTTT 


GGGTACGTCT 


TTAGGTACGA 


TGGTCTTAAA 


2100 


ATGTGGTGAA 


ATTCATTAAA 


ATCGTACCCG 


TTGAAAAGTT 


ACTCAGTTTA 


TTGCTGTTAA 


2160 


AAGGTTTTCT 


AACTCGTTCA 


AAATCTCTGT 


CACAGAACCA 


GGTCTTAAGA 


AAGCTCGTAA 


2220 


AGCTTCACAG 


TTCTCAAAAC 


GTTAATTATT 


TGGAGTCGTT 


GCGAACGTGT 


TCGCTTACGA 


2280 


ATCAATTACA 


ATGTTTCAAG 


ACACTTTCCA 


TTTTTTGGAA 


GGTGTCTTTT 


TTTATGGTAG 


2340 


CGACAGTGTG 


TGTTCTGTCA 


TAAACCAGTA 


TATAAAAATT 


CACATACTGA 


CAATATGTTT 


2400 


ATCTTTCTTA 


CCAAGAGAAT 


AATTGATTCC 


GATTTCAGCA 


AGTATCCTCA 


TTCCGAATAA 


2460 


CGCACTTACG 


CAAATGATAC 


ATTTCTTTTT 


AGTTGATTTC 


CTCATTTTGA 


TCGCCGCTAT 


2520 


TTCTTTTTTC 


TTTGAATTCA 


TTATACGTTT 


TTGGGAGATA 


TTTCTTTATT 


AACTTTTTCT 


2580 


CTCTTGCGTT 


TTGGAGGGAG 


TAAGTTGCGG 


TTTTTTGAGA 


GATTAGTATG 


TTTAAACTGA 


2640 


AGAGTTATTA 


A 










2651 


(2) INFORMATION FOR SEQ ID NO: 4 33: 









(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 1966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 



nAAGGCCCGT 


GTGGTTGGAA 


AGCCGnTACG . 


CAT.TG.CCAAT. JAGTCGnACA 


AAAGGCAAAT 


60 


ATTTTGACGT 


ATATCAAAAc 


aATTAGGTCT 


GACAAWbbAA 




GTGGCTAGAA 


120 


ATACGTTTGA 


TG TAG AT GAG 


ACTTTAGAAA 


AAGAATTTAA 


TTGGTCCCAC 


TATAAACGAT 


180 


TAGGCGCCTA 


TATTAAGCCT 


TACAAAAAAG 


CTGTTTTTAA 


AACATTATTT 


GTCATCATAT 


240 


TAGCTAATCT 


GGCAAGTATG 


TTAGGCCCTT 


ACTTTACTAA 


TV tv m m ^* ^* Tv m m 

AATTGCGATT 


GACCAAGTAA 


300 


TTCCACAAAA 


GAATCTCTCG 


TTACTCTTGA 


TACTAGGGGC 


TATTTTCCTA 


TTTTCCTTGG 


360 


TGATTATTGG 


TTGGTGTATG 


CGTTATCGGA 


TATATGCCAT 


CACAGAAATT 


GGACAAGATA 


420 


TTTTAAAAGA 


TATGCGTTTT 


TCAATTTTTG 


AGCATCTACA 


AAAATTACCA 


TTTTCTTACT 


480 


TTGACAGTCG 


GCCGCATGGA 


AAAATCTTAA 


TTCGGGTGGT 


CAATTACATC 


AATACGTTAA 


540 


GTGATTTATT 


AAGTAACGGA 


TTGATCAATT 


TAATTTCTGA 


TTTATTTAAT 


GTGATTATCA 


600 


CCTTAATCTT 


CATGTTATTT 


ATTGATGTGA 


AATTAACCTT 


ATACAGTTTG 


CTGCTATTGC 


660 


CTGTGTTGTT 


CGTGATGGTT 


CTATTTATCC 


AAGGGAAACA 


ACGGAAAGCT 


TACCAAGAGC 


720 


TTAGCAATAA 


ACAATCAAAC 


TTAAATGCCT 


ATATTCATGA 


AAGTATTTCA 


GGAATTAAAA 


780 


TTACACAATC 


TTTTGCACGA 


GAAGATGAAA 


ATTTCCAGAT 


TTTTAACGAG 


GTAAGCGAAG 


840 


AATATCGTCA 


ATCCTTTATG 


AAAGCTGTGC 


GTGTCCAGTA 


TTTATTATGG 


CCAGCAGTGC 


900 


AAAATATATC 


CGTCATTACG 


ACTTGTTTTA 


TCTATTTCGT 


AGGAATCCGT 


CAATTAGGTG 


960 


TGTCCGTTAC 


CACTGGGACG 


CTAATTGCTT 


TTATCGGTTA 


TATCAATAAC 


TTTTGGAATC 


1020 


CAGTCATCAA 


TATTGGTAAC 


TTCTACAATT 


CTCTGATTAC 


TGCAACTGCC 


TATTTGGAAA 


1080 


GGATTTTTGA 


AACGATGGAT 


GTTGTTCCAG 


AAATTCAAGA 


TGCGCCGCAC 


GCTATTGAGT 


1140 


TGCCGCCAAT 


TAAAGGAACC 


GTCGACTTTC 


AGCATGTTTA 


TTTCCGTTAT 


GAAGAAGGAA 


1200 


AAAATATTTT 


GACCGATGTT 


AGTTTTCATA 


TTGAGCCAGG 


ACAAACAATC 


GCCTTGGTTG 


1260 


GGCCAACAGG 


TGCGGGTAAA 


ACAACAATCA 


TCAATTTGTT 


AAGTCGTTTT 


TATGATGTGA 


1320 


ATGAAGGGGC 


CGTTAAAATT 


GATGGTTATG 


ATGTTCGCGA 


TGTGaCACTC 


CGTTCGTTAA 


1380 


GAAAACAAAT 


GGGGGTAATG 


CTTCAAGATA 


CGTTTATTTT 


TTCAGGAACG 


ATTATTGAAA 


1440 


ACATTCGGTA 


CGGAAATTTA 


GCAGCCACAG 


TV J*"* TV TV TV TV m 

AGGAAGAAGT 


CATCCAAGCC 


GCAAAAATTG 


1500 


TTCGTGCGCA 


CGACTTTATC 


AAGGATTTAA 


AAGATGGCTA 


TGAAACAGTT 


GTGGAAGAGC 


1560 


GGGGTAGTAC 


ACTCTCGGCA 


GGACAACGCC 


AATTAATTTC 


ATTTGCTCGT 


GCTTTACTGG 


1620 


CAGATCCCAA 


AATTTTAATT 


TTAGACGAAG 


CAACCTCCAG 


TATTGATACA 


AAAACAGAAG 


1680 


AATTGTTACA 


AGAAGGACTA 


CAACAACTTC 


TGAAAGGACG 


GACATCGTTT 


ATTATTGCTC 


1740 


ATCGACTTTC 


• TACCATCAAA 


AATAGCGATA 


AAATTTTCTA 


TATTGATGGT 


GGTCGTATTG 


1800 


TGGAAGAAGG 


ATCTCATGAT 


CAACTAATGG 


CAAAACATGC 


GTTATATCAT 


CATTTATATC 


1860 


AATCGCAATA 


CGACTTATTA 


mAGAGCTAGA 


TGATAAAACA 


GAGAAAATTC 


TTAGTTGAAT 


1920 
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CGTTCTCTGn nTTTCTGTAT CGTAACCTTA ACAAAGGnnA AAATGG 1966 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2809 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



CCTTCTATAA 


ATATATAGTC 


AAACTTAAAg 


TTcTGaCCTG 


CTTGTTnnaG 


TTAGAAATAT 


60 


AACGCAACTA 


GCTTGTGTCA 


TATTCAACAT 


ACCATGACTG 


TACTCGAAGA 


ACTGTTTAAA 


120 


AAGATTTGGT 


CTACTCTTTA 


ACCCAAGCTT 


CCGTTTCAAC 


TTCTCCaTAA 


TTGCTCGCCA 


180 


CAATGTATTC 


ATAAGGTAGG 


CTGTCTATAA 


TGTTAGGAAT 


ATCGCGTACT 


TTTATTACTT 


240 


CCGTTCCATT 


AAATTTTGTA 


GAAACAATAA 


AGAATTCTAT 


TTCTTCATTG 


TAATCATTAA 


300 


GAAGTTGTAA 


CATATCTTTA 


TCTTTACTAA 


AAACATAGTG 


ATCAAAAATT 


TGGGGTTCTC 


360 


CATGATTGAC 


ATCTGATACA 


GAGATTACTT 


CAGCCATTGA 


TAATGCAATT 


TCTTCATTGT 


420 


ATTCTATTAA 


AAATATTCCA 


TGAGCTTTAT 


TACCCTTGTA 


AATTTTTAAA 


TTTTGAATAT 


480 


GAGTATCTAT 


GTTTTTCTTA 


CACGACTTTA 


ATAGGTTTTC 


CTTTAAATGA 


TTATTAGAAT 


540 


ACGTTAATCT 


ATGTCTAGCA 


AAAGTATCTT 


CATCGATCAG 


TCCTTCTTTT 


ATCTCATTAC 


600 


TTCGACTCTC 


GAACATTCTT 


TGTCTTTTTA 


TATGCTCGGC 


TCCTTTTTTC 


AAAGTTCTAC 


660 


ATTCTTGTTT 


TGTTTGTTGT 


TCTTCATTAT 


TCATTTTCTT 


AAACTATTCT 


CCTGTCTTTT 


720 


TTTAATATTA 


TGCTTTTACT 


TTATTTTGCA 


CGATAATCTT 


ACTCCACAAC 


AAATTGTTAA 


780 


ATTAGCATTA 


TTAAATGTAA 


AAATTGAGTA 


TACTCATATT 


TTACACAAAT 


GACGTATGCC 


840 


"CAACTTGAAT 


GTCCTACTGT 


TTACCTAGTA 


ACAATTGGAA 


TATGAAAAAr 


AACCAACACG 


900 


GCATACCCTG 


TTGGlcTCTTT 


TAGTTAATAT 


ATTTTAATAG 


AAAATTTATT 


TATTACATCA 


960 


TGAAATACTG 


TATAAAAATC 


GTAGCAAGCA 


CACATCACTT 


TTTTTAGCCT 


ATCATTTTAC 


1020 


GTCGACAGTC 


TAGCTGGTTT 


CTTATCTCAA 


TTTTGAACCC 


ATTGTTTTTC 


TGCTATAATC 


1080 


ATAGTTGACG 


TTTGATCTGA 


ATAATTTTTw ATGATATTGy 


TAAATTyTGT 


CATTGATTGG 


1140 


CTTAyTTCAT 


CATCAGCACT 


TTTTGAkCAT 


TTAAAATTTA 


GATATTTAAT 


TCTAGATCTA 


1200 


CTAATTGAGT 


CTAGATTTTT 


TAAATTAATT 


TTTTTAAAAG 


TTTGTTTATT 


rTCTACAGAT 


1260 


AAGATATATT 


CTATCATATA 


CAAATTAGCT 


AAATAAAAAT 


TCTGTAAATA 


ACTTTTATTA 


1320 


TTCAGTATCT 


TGTCTTCTTT 


ACTAAGTATT 


GGGTATATTA 


GTTTTAATTT 


CCTATTATTT 


1380 


TTGTCTATTA 


AATACTCATA 


TATAACATAC 


AGTAAGAATG 


TATTTAGCAC 


ATATAATGGT 


1440 


AATTCTTTAT 


TTTTTTTTAT 


TATTTTGCAT 


ACCTCATTGC 


TATTTAAGAA 


AGTTGACTGA 


1500 


TTATATTTAT 


TAAGTAAGCT 


ATTTGATACA 


ATATAATACA 


TTTCGAAAAT 


ATAATCCTTA 


1560 


GTAACTAATT 


TATATATATT 


GTATCCATCA 


TTCATAAACC 


CGTTAAAATT 


AAAAGGAACT 


1620 
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ACAGTAATCA AGAATATTGA AAAATTAAAT AAAGAAAAAT ATTGAAAAAA GTCAATTTTT 1680 

ACAAAAAAGG TTAATGTAAA AAAAAAGACT AAATATTGGT CCTCCGGTTA ACAACCTCAG 17 4 0 

AATTTTCTTA TTATTAAACG TAGTTGGCTT ATATCTAAAT AATCCACCAA AAAATCCCCA 1800 

CTGTGTATTT AATCTAAyCT TAAACTGwTT TTCCCACGAA AGTTTTATAA ACCCAAATAT 18 60 

TAATGACTCT GGTCTAACTT TATTAAATAA TCCAAAArCT AAATGACCTA ACTCATGTAC 1920 

AAAGATAGAC AATAACAAGG AAAAGT AT GC AG TAG AC ACT AAATTTAATT CAAAAACAAG 1980 

AGTAAAGATT ACGAGAAGTA ACGTAAAGAT CATCGGAGAT ATAAATTTCA TCTATTTTAT 2040 

TCTCACCTCT TTGTATTTAA GCATGCTATT TTTATCTATA AAAGAATCAG CATTTTCAAA 2100 

TAAAGTATTT AATACTTCAT TTGAATCTTT AGAATTATCT ACATTCTTTG ACATAATAGC 2160 

TGCTAAAGtG CGCTTACTTC TGGAS3TTGCT AAACTTGTTC CAAAAGAAAG TGTATwACCA 2220 

TCTGGgAAAG TCAGCAGCTT TGCCTAGTGG AGAAACTAGC GATGTAGGGT AATAAGTCAT 2280 

CATCATTtCA CGAGCATCAA TTTGtCCTGT TAwTtTGtAA TJcGTCACCAT ATCCTCCTGc 234 0 

AGGACCATAT ATwGAgTrCA TTAGACCCAT AATTAGAATA GTCAGCAATA TCACCACTCT 2400 

TTTTTGTCGC TCCGACGGTA ATTACAGACT CTAGTCCTCC TGGTATATGT TTTTCATTAC 24 60 

CAGTGCTTAT ATCACGCGAC TCATTTCCTG CTGATGCAAC AATTAGAATG TTATTTTTTC 2520 

TTGCATAGTT AACAGCTTTT CTGAATGCTT CTACAGTAAA TCTTTCGTCG TCTATTTCCA 2580 

TATTTTTATA TGATCCAAGA CTCACATTTA TTATATCTAC TTGATCATTT GTAGCATCAA 2 640 

CTATAGCTTT AAGCATATTT ATAGAGTTTC CATCTGTCCC ATCCATCACC TTATAAGAAT 2700 

TTAAATTTAC TCTTGGAGGC AATCGTGTCT ATTACTCCAG CAACTTGTGT ACCATGACCA 27 60 

TATTCATCTA ACTCAATATC ATTAACATAn GTTTTTTAAT CTTAnGnTA 2 809 

(2) INFORMATION FOR SEQ ID NO: 4 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4149 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

TTAATATCTn TCTTATTTAT TGACATAAAT TTATTACCCn TCGCAAGTAC AGTTAGCGTC 60 

TCAAGAACAT TTTCTTTTAA TATTAATGAA CTTTTTATAG AGTCTTCTAG AAAAACTTTG 120 

GCTTTTATTT CTTGATATTG TGTATTTTGA AGCAGCTCTC CAGATAACAT CATTGTATCA 180 

ATAATAAATG TATAGATAAC TGGATCAATT TTAGTACTAA TTTCTTTTTC ATTTTGTAGT 240 

TCTTTAACAG - TACTCCAGTA AACGTCAAAA AAACTTCTAT TATTATAGAA TTGTTTAACC 300 

CATTCAGTTA TCATTTTTTT ATCACTAGGA ATTTTATTAT AAAATATGAT GAGTTCAGGA 360 

TGTTGCTTTC CTAATTTTAA AATCATTTCA AACATTATTT CTAAACGATC TTTTAGTTTT 420 
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rtfViuonu^u 1 


PP. T T T A T A P.T 
Oo i 1 ml no l 


APTAAPaTAT 


TCATCTAAAA 


TAACAAAAAC 


GTGTTCTATC 


480 


ppp aptptt a 


AP ATAPPTTP 
ACAi AOO 1 1 O 


111 / i 1 1 VJ 1 1 VJ 


AAATACTGAT 


ATAATGATCC 


TACACTTAAA 


540 


rrv Cjy f~* Op Op A P 

Iblbbl 1 iAL 


1 bbb AAi 111 


APP.AAPAP,AA 


GTTP,PTTPAT 

O 1 IVJVvl 1 Oil 1 


AACCAAPTCT 


CCCAAATTCA 

vwvinnni i\^n 


600 


TCTAAbAl b I 


APPATAAAAT 

Abb A 1 AAAA 1 


Zi ZiP,TTP*PTTT 
nnu l l v3 v-' ill 


TTTTPTAATT 
11 x i o i rvi i i 


TTAGTCGATT 


AAATAATTCA 

ruui J. fin i i VvH 


660 


Gr i be i bi 1 1 


TATAPATAAP 
1 A 1 MUM 1 HMO 


TATPTPPTPA 

1 1 UiVw^/iVjft 


AAA A AG AT AT 


TkTAATTTAG 


Ta TACPATAT 


720 


TTAi lAlATLA 


A A A APP APPP 
AAAAO VjnO O Or 


PTP.aTTTTTT 


TGCGATCCTT 


AATTGAAtCT 


TrGrTTGa vA 

1 X» v_J w X X Vj CI V 


780 


AAAATTAAA1 


TAPATATAAA 
1 Mb A 1 Ml MMM 


APAflTTAArC 


TPT^TAAGAG 


AAAATTGPCA 


AAPTATPTAP 


840 


CTTCTbAAAA 


prppTD TT A TP 
b lbl Al 1 AlO 


pTTTATPaaa 

LI i J. r\ 1 <^t\r±r\ 


aaT AP^PTATA 


T ATTTATACT 


TPTTTAT AG A 

X \^ XXX ** X f4\Jil 


900 


AGAATAGTAb 


A A PT A A T A PP 

AAb 1 AA 1 Abb 


PTZiZiPTPZiTZi 
b 1 nnL 1 V3rt 1 M 


TPZiTZiPZiTTT 
1 on 1 nyn ill 


Zi A ATP APT A A 
fxr\r\ l OnO 1 nn 


ATTTPTAGAT 

n 1 1 1 U 1 ni?n l 


960 


TGCGTCATTT 


n"rrrrp A ATT 
1 ibbbbAAl 1 


1 1 Mrtb 1 b i 


TZiPTPPZiTPZi 
1 no 1 Oh 1 


Zi PP ZiPTTTTT 
nOOno 11111 


PPPTATATTP 

OOolnlnl lO 


1020 


CGTATATAAG 


GA1 Ab I Al 1 A 


opTT A A T* A A T 

bb 1 1 MM1 MM 1 


1 Al Al 1 1 1 1 A 


PZiPZiTTTTZiA 
unun 1111 nn 


TAT ZiP A A A AT 
lnl nOnnnn i 


1080 


GGAATGGTAA 


/-* rp rn rp try tp rprp 

bb libJ.1111 


PAPAAAAATT 1 

bAbAMMMMl 1 


ATZi ZiPPTPTT 
A i AAou loll 


PTP Zi Zi TPPPP 
\j l unn 1 oOOxj 


TP ZiTPTT ATT 
1 on l vj 1 lnl 1 


1140 


GCTCTGATTG 


CTTT i bAbAA 


on/-> A PPPTP A T* 
1 bAbbbl bAl 


PPPZlTPTPZiT 
bbbAI b 1 oA 1 


TPTPTPPPPP 
1 O 1 O 1 OOouO 


APA APPPPAA 
nOnnoOVj^nn 


1200 


TCATTTAATT 


a pppt {^^r^tr^r^ 
AoGoTGCClb 


1 bbAAAb A 1 A 


TT* AP A PPPP A 
1 bAbAbbbbA 


PPTPZiTTZiTP 
bblOAl iAIO 


A APPPPTPPP 
nnuooO 1 OOO 


1260 


TTTGCCAATC 


TCTGTTb Tbb 


AAA1 1 b 1 Al b 


bb i b 1 1 1 bAl 


Zi PPP Zi ZiTPTP 
AOOOAA 1 vj 1 O 


PPATPPAATP 
OOn X VjOnn 1 O 


1320 


GCAGGTAGTT 


pp app a atpp 
bb Abb AAi bb 


APATAATPAA 
Mb A 1 Art 1 brtrt 


ZiPZi ZiTTPTZiP 
AbAA 11^1 e\\3 


PPPTZiPTTPT 
OOo 1 nu 1 lul 


PPPAATPTTT 

OOOnnlui 1 1 


1380 

i 


TAAATTGGTA 


TTGATbbAbb 


A A TTTTAfTT 
AA 1 I b 1 Ab 1 1 


TP AP A A ATTT 
IbAbAAAl 1 1 


ZiTZiPPTi-PZiT 
Al AOo 1 Uonl 


TPPPP AAP AG 
1 OOOOnnk^nU 


1440 


»-i mm tv rr> /"* tv rp A 

CGTTATCATA 


/">/-' /~i T\ T\ A PPP T 

oGGAAAbbb 1 


1 1 b 1 1 Mb 1 bM 


A ZiPZi PPPPPT 


PZiTPPPZi A AP 
OA1 OOOnnnO 


PPATTTAAAA 


1500 


m tv or»/^*A <t> A on 

TAToATbbAl 


rrATTP A TT A 

bbAi I bAl 1 A 


TPA A ZITTTTT 

1 b AHA 11111 


TTPPZ1 TP. ZI TP 
1 1 OCA 1 bn 1 C 


TPPZiTPZi AAA 
1 oOn 1 U/irtArt 


APTTTGACGT 
no i i j. UHLi o i 


1560 


C 1 b 1 1 AAAbb 


A 1 AbboAA 1 b 


PP Zi PP A Zi Zi PP. 


PTTPTTTT EP 
01101111 MO 


PAATPPPPTA 
Onni 0>Ov3O in 


TGJ T T T T TT T C 


1620 


T b Abb Ab AAb 


A A T A APPP A T 
MM 1 HACLbn 1 


P. Z\ T T T P T P P, Zi 


TTaaaTAZiAT 


PAAPAATTAA 


GAAGATATAA 


1680 


1 bbbAI 1111 


I bbb 1 rt.bbL.t3 


ZiZiPZiTZiPPTZi 
nnLn i Moo 1 1\ 


LPZiTPPPTPZi 
nun 1 Ooo 1 


P A Z\T A PPPTP 
Onn 1 nuUU i 


PAATGACPGT 


1740 


TlbllblbiA 


A A TPTPP A A T 
MM 1 b 1 bbHA 1 


PTZiZiTZiPZi ZiT 
L. i AA 1 AC AA 1 


T T P Zi Zi T Zi P Zi T 
1 lOAAl AOM1 


TPPTTPTPTP 
1 OO 1 1 Vj 1 O 1 o 


PPPTTPATTA 

v^*wv». x x v-» n x x n 


1800 


b AAbb I b 1 1 b 


PTTPPPPATT 

b 1 1 bbobA I I 


Z\ Zi Zi Zi TP Zi PPP 
AAAA 1 IjAuLu 


ZiTPPTZiTZiPP 
A i O o 1 A 1 A Vj o 


TPAATTTPA A 
1 onn 1 l l Onn 


PPTPPGTTGT 

OOlVJOOl 1 VJ 1 


1860 


opTf' A TP ATPr 1 

1 1 bA 1 bA 1 bb 


PAP C^C ATTPT 

bAbbbAl 1 b 1 


PPPZiPPZiPTP 


ZiTTTPZiZiPPP 
Ai 1 loAAuOC 


PZiPPPTTPPP 
OnOOO 1 1 OO 


TAAGTATTTG 


1920 


1 1 1 AAi 1 1 bb 


bbb Ibbbbi A 


ZiPPTTTTPPP 
nut 1111 Cvjo 


T T Zi PP P Zi T Zi Zi 
I InLuunlnn 


ATTPTTPPTP 

n 1 1 O 1 1 OO 1 O 


AATAAPTTCT 

nn x nnu x x x 


1980 


T TP A P 21 TPT T 


LI 1 tut 1111 


Zi ATP.PPTPTT 
r\r\ 1 lull 


PPPTPZiTTAA 

lj O O L Urt 1 1 c\r\ 


TAATAPPiTCT 
i nn i noo x o j. 


GAAGAGAAAT 


2040 


("■TTTAApATT 
lAAuAl i 


T T Zi T A P Zi P P P 


PTP,ZiTZiTPP.A 


PTZiTTTPTPT 
olnl 1 lolul 


TTPTTPAPAT 
1 1 O 1 1 unO^n i 


CAATCACTTG 


2100 


TCTTTTCGTC 


CGAATATCAA 


CGACGCTTGC 


TTTAAAATAT 


CATTCTCCAT 


TTCAAGCTAC 


2160 


TTATTTTTCT 


TTCGTAGTGC 


GATCAATTCT 


ACTTATTCCG 


GGGTTAAGGT 


ATCTTTTTTC 


2220 


TTGAACGATT 


• CTGTTGAATG 


TGTTTGCTTC 


ACCCATTTAT 


CGAAGGAAGA 


AGGCGTAAGC 


2280 


TCATGTTCTC 


GAATGATTTC 


TACACGCGGC 


TTGCCTGCAA 


GGTAAAGATC 


GACGACGTGT 


2340 


TGTTTGAATT 


CTTTTGAATA 


GGTTCTTCGT 


TGACGTCTTG 


ACATAAAAAA 


TCCTCCAGTG 


2400 
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TGTTTTTTAT 


TATTCTACAC 


ATCTTATTTT 


TTCTTTCTAG 


TGTAGCCGAT 


TCAGAAAAAG 


2460 


AACCGATAG.T _TAAT1££ACC 


ATATTCATTA 


TAGTTAAAGA 


CATGTAAATG 


AAATGATCTC 


2520 


TCAAAGAATA 


TGTATTCCAT 


TTTTCTCTTA 


ATGAGAACTT 


ACTAAATAAA 


AATTGACAGA 


2580 


TTGTCTCATT 


GCATTACTTT 


ATTGTACATC 


TTAAAATTAT 


CTGCCTTATT 


TTGTTTGACC 


2640 


ATTCACTTTT 


TTTTTAGATC 


CTAAATATAC 


TTTTATCCGT 


TCATAAGAGC 


AGCCTGAATA 


2700 


AACACAACCT 


ACAATAATAC 


CCAAAACTAC 


TGCTCCAATA 


AGATTGTAGA 


AAAAGCCAAT 


2760 


TTTTTTTACA 


AATGTTTGAT 


AAAAATTTGT 


CAT TAT T AT C 


ATCATGACTA 


CACTGTTACA 


2820 


CAAGTCATAC 


ATAAAAAATA 


ATTTAAGAAA 


GAGAAAAGAG 


ATCATTCCTT 


TTTGATCAAT 


2880 


TCTTCTATAA 


AAATTATATA 


TCCAATACAA 


ATAAAGAAAC 


ATACTTCCTA 


CCAATAAAAG 


2940 


TAATTTTAGT 


ACAAACCACC 


ACGaAAAATG 


TGGATAGAAT 


AAAAAATCAA 


TCAAACATAA 


3000 


TAACGGAATA 


AAATAAAGGA 


ATTGATGAGA 


GTGGGTTAGA 


TATTTCCAAA 


AGGTTTCTTG 


3060 


TTTTAATCGT 


TCCATTTGTT 


GATAAAATTT 


TTGGTCATCT 


ATTATTTCCA 


TTTATTTACT 


3120 


CCTTATCTTA 


ATATCATACA 


TAGCAACATG 


TACAACCTTA 


TAAAAATAAA 


TAATTTACTT 


3180 


ATTTATTTTC 


ATCGcAATCG 


ACATTCATTA 


GTAGATATGT 


TkGTTAGTAT 


TAAAAGAACT 


3240 


TGTGATTATT 


AAATTAGTAA 


ATGTTTTATC 


ATATAGAACT 


TATTGATGGT 


TATTAAAAAA 


3300 


AAGCTTCTTT 


TCTATAATTT 


AAACACATCT 


ATAGGTAGAT 


AGATATAGAA 


ATTTATACAA 


3360 


CATTCAAGTA 


AGAATTTATT 


TATTAAAAAT 


GTGGTCTTTC 


TAGTAAAAGA 


AACAAAAAAC 


3420 


CAGTTAAGCA 


TTCTATCTAA 


CTGGTTTTTT 


ACTAAATTTC 


AGTTGTATCT 


TTCTATATTA 


3480 


ATTAAGTTGT 


TATTTTACGC 


GAGGTTCAGT 


TTCTTCTTTT 


ACGTTTTGTA 


CACCATCACT 


3540 


TACTTTTTCA 


CTTGTACGCC 


TAAAGCTTCT 


TTTGATTTTT 


CAAATtGTTT 


kGAAGCAAAT 


3600 


TCTCCAGTAG 


CAGAAGCTGC 


ATCGGAAAGA 


TGATCTTGTA 


GAGTAACTGA 


ATCATTTTCA 


3660 


TGTTGTTCTT 


TCGTTTTTAC 


ATCTACGACG 


TTTACATTAA 


yCyCAATTAC 


ATCTAGGCCA 


3720 


GTCATTTTCT 


TAACTTCATT 


TGAAATAACT 


TGCTTCATTT 


TATCATAAAT 


TGTAGTTACA 


3780 


TCTTTACCAT 


ATTCAGCCAC 


TATTGAAAGA 


TCTACTGCGA 


CTTGTTTTTT 


TCCTACTTCA 


3840 


ACATCCACTC 


CAGAAGTTGT 


GTTATCCGTA 


TTTACTAGTT 


TTCCAGCTAT 


ATTTGAGAAG 


3900 


AAGCCTCCAT 


CAATCGTTAG 


AAGACCATCA 


ATCTCATCTA 


ATGCAATACC 


AATTATTTTT 


3960 


TGAATTACTT 


TATCTTCAAA 


AAATAGTTCT 


CCTTTAATAG 


TCTCAAAAGT 


TTTTGACATG 


4020 


TCCkTTTTTT 


TGAkCTCTGT 


TTTTGTATTA 


TTTGTTTTGT 


TTTCCATTAT 


TTTTCCTCCT 


4080 


AAAAATATAA 


AATTTATTTT 


TACCGTAAAA 


ACGGTTATTA 


ACTAATTTTG 


TTTGTCAAAA 


4140 


AAACCAACA 












4149 



(2) INFORMATION FOR SEQ ID NO: 4 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 



CCTCTATGGA 


ATATTTTTTT 


CCTCAAGAAA 


TCATTTTTTT 


ATAAAAACCA 


TCCTTAAATA 


60 


TGATACCAGT 


GGCTGGCTTT 


ATTTTAAATA 


GACAAAAAAG 


AGTATGAGAC 


AAAAATCACT 


120 


TTAGATTTTT 


GTCTCATACT 


CAAAATCTGA 


AAAACAGTTC 


GATCCCGACT 


TCTTCGTTTC 


180 


TATTGTTTGG 


TTCTGGCAAC 


AACTAACAAG 


TAACTAATTT 


TACGAAATCC 


ATGACAATAT 


240 


TCATTATTGA 


GCATATTCAA 


TCCTTCTACT 


AATTTTCGTA 


TAAGATAAAT 


ATTGATTGAA 


300 


TGTGTTGACA 


CAACGAAAGG 


CCGTCCTATT 


CGCATATTGA 


TCATAATAAC 


GCTTGGTATA 


360 


CCACTGAAAT 


TCTTTCGTCC 


AAGGTTTGAC 


ATGGCCAGTA 


AAGTGGATAA 


TGGATGGTGC 


420 


TCTTCTCGTT 


TCTTCATATT 


GTTTTTCTCC 


ATAAATAGTT 


GGATGTTTTT 


TCGCTTTTGA 


480 


CAGAATATAA 


CCTTGGGCGT 


TCCACTTAGG 


ATGCAACAGG 


GTCCAACGGT 


CGTGTAAAAC 


540 


CGCATTCAAT 


GCATCTTGAT 


CATGGAAACG 


TAATTTATCT 


GGATTTTCTT 


CAATAAATCT 


600 


TAAGACTTTA 


GTGGTTACAT 


CTAAATTCAA 


CCATTTTTTA 


ACATCGATTA 


GCAACAAGCC 


660 


TGAGTTAAAA 


TAACACATAG 


ATTCTGCAGG 


AATCGCCATT 


TTTTCCaAAC 


GTTGATGGAA 


720 


CCCTGCATCT 


TCCaCAGCGG 


CGATGATATT 


TTCACCTAAA 


TCAACTGTCC 


ATAATTTTGC 


780 


GACATCATCT 


AAGGCAATCA 


TGTCACAATC 


CATATACAAC 


AGCCGTTCGA 


TCTGGCTTCC 


840 


TCTAAACAGC 


TCTGGAATAG 


CAATCCTGTA 


GTAAGCTGTT 


TTGGGAATTC 


GTTCACTGGT 


900 


TACAACATTT 


TTGAAAAAAt 


GGGGGTTGAT 


TTTTAAAAAT 


GTTAACTCTG 


CATTGATTTG 


960 


GGTATGTTTA 


ATCGTAAAAT 


ACAGTAGTTG 


CTTGCTTTCA 


AAATTGATAT 


TGTCATCAAT 


1020 


CACATAAAAA 


TGAACAGCTG 


CCGCTGAAGG 


ACTATTTTCT 


AAAATTGAAA 


CAAACATAGC 


1080 


AGCAAGATGC 


GGTACAAAGT 


TGGTATTGCA 


ACAGGAGACA 


ATCGCTAATT 


CTTTTCTATT 


1140 


TTCCATTTCT 


TGCACCTACT 


TTCTTTAACT 


TGTTTAGATA 


AAGAGCTGTA 


TAAGGATGAT 


1200 


TTTCTAACGT 


ATTCCAAGGT 


TTATCGTGTC 


CAGTAAAATG 


AATAATCGCT 


GGCTCTTGAT 


1260 


TTCCCGCTGT 


ATATAATTTT 


TGATAGGCTA 


CATTAGGCGC 


TTCATGACGA 


TTAAAAACTA 


1320 


AAGACGTCTG 


CATATTCCAT 


CTTGGTTCTA 


GAGCTAGCCA 


ATCTTCATAA 


AGAACAGCGT 


1380 


TTAAAGCATC 


TTGATCATGA 


TAAACTATGC 


GGTCCCCATT 


TTCTTCTAGA 


TACTGAATTG 


1440 


TTTTTTGGGT 


GATTGCTTTC 


TCGTTCCATC 


GGTCAATATC 


AATAATCATA 


ACTCCTGAAT 


1500 


TAAAATAATA 


ATGCGGAGGA 


AkGGATGCCC 


AAACGTTCAA 


nGCATAAGCT 


GACCAGGATC 


1560 


ATAACCGCGC 


CAATCGTTGG 


TTCAC 








1585 


(2) INFORMATION FOR SEQ ID NO: 4 37 : 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



CGAGCTCGGC 


CCCCCGATGT 


nAAAGTGGGC 


GACACTGTTA 


AAGTGAAATT 


TAGTGTTGAT 


60 


GCTTGGGCnA 


CTGGCGAAAG 


CTATTCCGCA 


ATGGGTAAAA 


GGAAACAGCT 


ATAAAGTACA 


120 


AGAAGTAACT 


GAGAGCAGAG 


TATTGCTTGA 


AGGTATCTTG 


TCATGGATCA 


GCAAAGGCAA 


180 


TATTGAATTA 


TTGCCAGATG 


CGGCAACTGT 


TCCTGATAAA 


CAACCAGAAG 


CGACTCATGT 


240 


GGTACAATAC 


GGCGAAACAT 


TATCAAGCAT 


TGCTTATCAA 


TACGGAACAA 


ACTATCAAAG 


300 


ATTGGCTGCA 


TTAAATGGAT 


TGACAAATCa 


AATCTTATTT 


ACCCTGGCCA 


AATTTTGAAA 


360 


GTAAATGGAT 


CAGTAGTAAG 


CAACATTTAC 


ACAGTTCAAT 


ACGGTGATAA 


TTTATCAAGT 


420 


wTTGCAGCTw 


AGCTTGGTAC 


GACTTATCAA 


aCCTTAgCTG 


CATTAAACGG 


ATTAGCAAAT 


4 80 


CCTAACTTGA 


TTTATTCTGG 


TCmAACaTtG 


AGCTATtGGA 


TTTTTkGCCA 


AAaTCGGkTG 


540 


aAATTTAGAG 


GtACCTCATT 


tACaATAGAG 


TTACCTTTTC 


ATATTATGAC 


TCTTTTCGAT 


600 


TTAGAAAAGA 


GGTGCTCCTA 


TCTTATGcCc 


AAGTCCTTAG 


GTAGGAGCAT 


TTTAATTTTA 


660 


CTCATACGTA 


GCTTAGTTTA 


AAGCTATTGT 


ATTACATAAG 


AAAACACCTA 


CCACAGATGC 


720 


ACGTACTTCC 


CCAAGCAGTT 


GTCTGTGCGG 


TAGGTGTTTT 


TTTAGATTTT 


ACTTGTTAAT 


780 


CATAACCGAT 


TTTCTTTTTT 


TATAACAAGG 


TTAAGCTTGA 


ATTTATGAAA 


AGGATAATGT 


840 


GTACGGAATT 


TATCCTAATA 


AAAAATAGAA 


TTAGCCTATT 


TCTAAGCTAC 


TATTACTTGA 


900 


GAACAGTTTC 


CTTTTAAATA 


AGTGGAAAAT 


TGTAGTACAG 


CTACTTTAGC 


GATATTCCTT 


960 


AATATCCCAG 


TTCTAATTTA 


AAGTCAAACA 


CAATTTTGTT 


AATAAGTTGT 


GAATAATTTT 


1020 


ATTGGTTGAA 


TTAACAAAAC 


TAATTTGCTA 


GAATAATAAA 


GCAATTACTT 


GTAGTAAAAG 


1080 


ACTACAGGsC 


CCTTGTATCA 


TAATTGTGCA 


aGGGGGTTTT 


TGTATCGTAT 


ATT T AT ATT T 


1140 


CATGTCTAGA 


GTAGATGGAT 


TATAACCTTT 


TTATAGTATT 


GGTcCCATCA 


TGtTTTAATA 


1200 


CACTCATTtG 


CTTTATAGGA 


AaTAGTATGG 


tATTATAAAT 


AAGTAATCAA 


TTTTGACATG 


1260 


TAACGAGAGC 


GCTATACATA 


AGACTATAAT 


TCTCCTTTTG 


CTTATCAAAC 


ATTTTTTGCA 


1320 


ACAAATCGCA 


TACAGGGTAT 


GCACAAGGAG 


GAAACATAyA TGAACAACGG 


TACAGTAAAG 


1 JoU 


TGGTTTAACT 


CAGACAAAGG 


TTTTGGATTT 


ATTACAGGTG 


AAGATGGCAA 


TGACGTATTT 


JL 4 H U 


GCTCATTTTT 


CAGCAATTCA 


AGCGGATGGT 


TTTAAGTCAT 


TAGAAGAAGG 


ACAAGCAGTA 


1500 


ACTTTTGATA 


TTGAAGAAGG 


ACAGCGCGGT 


ATGCAAGCTG 


TTAATATTTT 


TAAAGCATAA 


1560 


TAAAGGTTTT 


GGTAAAACGT 


TCCAATTTAA 


TTGGAACGTT 


TTTTTTATAA 


GATTAAATTG 


1620 


ATAGGCTCTG 


AATGTTGAAA 


GCGAAAGGAG 


ATCATCATAT 


GGGATTTGGA 


AAATGGAAAG 


1680 


AAaTTATACC 


■ ATCAGATAAA 


AGTAAACAGT 


TTGAAGAAGG 


ACAGAAAGTA 


TCATTTATTT 


1740 


ACAGAAAgAA 


tTAGAgGJtaT 


AATAGCgTTT 


TATTAATTAA 


TcAGCAGaAT 


TAATATTATA 


1800 


GACAGTTCTA 


AGCCAGAAAA 


CnnAAAAGAn 


AAAACAGT 






1838 
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(2) INFORMATION FOR SEQ ID NO: 4 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 



nnATTTGGGG 


TCTTAAGCGT 


TCTCTATTTT 


GTGGAACCCT 


TTGAGAACAA 


TCTCTTGGTA 


60 


TTGTATGTCG 


GATCGGCTGT 


TTTAGTGACG 


ATTTTAGAAT 


ATGTCACTAG 


TTATGGCTTG 


120 


GAAAAATTGT 


TCCATGCCTC 


TTGGTGGGAT 


TAT CAT GAT G 


TACCTTTTAA 


TTTAAACGGT 


180 


CfiCGTGGCTT 


TACCGGTTTC 


TCTTTTTTGG 


GGATTGGGGT 


GTGTCTTGAT 


TGTCAAAGTT 


240 


ATTPAACCAG 


AAATCGCCAA 


AGTTGTCAGC 


TTTTTACAAG 


CAACCTTTGG 


CAACTACTTA 


300 


C;r ACTTTGCA 


TAGTGATTGT 


AATGGGGCTT 


GACTTAATTT 


ATACGTTGCT 


GAACATGCAA 


360 


GGTTTCAAAA 


AACTTATTAC 


CGAGATGGGC 


CAAACGTTGG 


AAACGAACCA 


ACAAGAATTT 


420 


AAGCAACGGT 


TGAACACAAA 


AATGGAAATC 


GCTACAGCTG 


ATTGGCAGAA 


GTTAAAAGAA 


480 


CGAACCAAAA 


AAGAGCATTT 


CCAAAATCGG 


TTAAATTTCC 


AACAACGTCG 


TTTTATTAAC 


540 


AATTACCCAA 


AACTAACTTT 


AAAAAACATT 


AAAAATTCCA 


AAGAAGTTCG 


GCATCTCTTA 


600 


GAAGATTTGA 


AGAAAAAGGG 


CCAATAATTA 


ACAAAAAATA 


TTACAAAATA 


TTTCAATCAA 


660 


TGGTCTAATG 


GGATGAAAAA 


AAGAAAAATC 


GTCAAAAGCT 


AGTTTTTAAk 


GCGTCTTAAG 


720 


AAATTCTTAA 


AGAATTGAGA 


TAGAAATGGA 


CTAATGAATA 


AACGCAGGAT 


AAAAAAAGAG 


780 


TTGCATTTAA 


ATTGTAACAA 


ATTGTAACAT 


AATTGAAATA 


AAACAAGCCC 


TCCTGACTAG 


840 


TATAATGAGT 


AGGTGTTTTA 


GATAAGTTAG 


GAAAGGTGTT 


GGATTTAGTG 


AAGGTTAAAA 


900 


AGTTGATACC 


ATTTTTAACA 


ACAATGATGC 


TTATCAATTT 


TGGACTTCCA 


GTCGCTGCGG 


960 


CAGCAGAATC 


ACTTGATTCG 


TTGCAAGAAA AAGAAGCGCA AgCGrAACAA TCTGGGGAGT 


1020 


TAATTAGTAA 


AGAaATCAAT 


TCAGCCTTAG 


ACGATGTAAA 


TGCTAAATAT 


GCGGAAATCG 


1080 


AACGTTTAAA 


AACGCAAATT 


TCAAAAGCGG 


AAGATACGTT 


GAAAACTTCG 


AAAGAGGAAA 


1140 


TAGAmAAmAC 


AGAArAAAAT 


ATTTCTCGTC 


GTAAAGAAGC 


TGTTGGCGAT 


CGTaTGwaGG 


1200 


ATaTTCAATT 


AAaTGGCGAC 


CAAcGTACTT 


GGCcAgCCT 






1239 


(2) INFORMATION FOR SEQ ID NO: 439: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
TTGAAAAATC CCAAATTAAA GTGGAACCAG CAGGCTTTGA CTGCGACTAT GTGGACATCC 



60 
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AATGGTGATT AAACTGGGTC GCTACGGAAA ATTTTATGCA TGTAGTAATT TCCCAGATTG 120 

TCGGAATACA AAAGCGATCG TTAAAGrAAT TGGCGTkACg TGTCCTGTGT GTCACGAAGG 180 

ACAAGTTATC GAACGCAAAT CAAAGAAAAA TCGGATTTTC TATGGCTGTA GTCGGTACCC 240 

AGAGTGTGAC TTCACTTCTT GGGACAAmCC aGTAGGCCGA CCATGTCCmA AATGTGGGCA 300 

aTATTTAGTA GAGAAAAAAG TTAAAgGCGG CAAACAAGTC GTTkGTATCa ATGGcGATTA 360 

TGAAGAAAAC GTCCAAAAtT AAGAAAAATT GAGACCGAAG AATTTACTTC GGTCTTCTCT 4 20 

TTTAGTAAAA TGAATTGAGT TTGTAAGAGA CAGCTCTTAA CTTCTTGATA TAGAAAGGAA 4 80 

TTTTTATACA TGACAAAAAA AGTCACAATT ATTGGTGCCG GTTTAGCCGG AraGTGAAGCG 540 

GCATGGCAAG TGGCCAATGC TGGGGTACCA GTTGATTTAT ATGAAATGCG CCCTGTCAAA 600 

AAGACACCAG CGCATCAAAC AGAAAATTTT GCAGAGTTAG TTTGTTCTAA TTCATTACGT 660 

GGAaATAGTT TAACCAATGC GGTAGGTGTT TTGAAAGAAG AAATGCGCCG TTTAAATTCA 720 

ATCATTATTG GTAGTGCAGA TCAAACCGCT GTACCAGCTG GCGGCGCCTT AGCAGTTGAC 780 

CGTGATTCTT TTTCTGAAAC AATCACTGAA AAAATTAAAA GT CATC CAT T AATTACAATC 840 

AAAAATGAAG AGATCACCGA TATTCCGGAA GGGATTGTAA TTATTGCAAC GGGTCCTTTA 900 

ACCTCTGAAA GCTTATCTCA AAAAATTCAA GAATTTAATG GTTCTGAAGG CTTTTATTTT 960 

TACGATGCAG CAGCACCAAT TAT TG AT AAA TCAACAATTG ACATGGATAA AGTTTATTTA 1020 

AAATCTCGTT ATAACAAGGG GGAAGCGGCT TACTTAAACT GTCCGATGAC TGAAGAAGAA 1080 

TTCAATGCTT TTCATGAAGC ACTAGTCAAT GCAGAAGTGG TTCCTTTACG CACGTTTGAA 114 0 

AAAGAGAAGT TTTTCGAAGG CTGTATGCCC ATCGAAGTAA TGGCGCAACG TGGCATTAAA 1200 

ACCATGTTGT tTGGCCCAAT GaAACCTGTC GGACTAGAAG ACCCTAAAAC AGGTAAACGC 1260 

CCTTATGCTG TAATTCAATT ACGTCAAGAC AATGCTGCAG CTTCTTTATA CAATATTGTT 1320 

GGTTTCCAAA CGCATTTAAA ATGGGGCGAA CAAAAACGAG TATTCCGTAT GATTCCGGGA 1380 

TTAGAAAATG CTGAATTTGT TCGCTATGGC GTCATGCATC GTAATAGTTT TATGAATTCT 14 40 

CCAGAGTTGC TAAAACCAAC CTACCAATCA AAAAAACGTG ACGACCTGTT TTTTGCTGGA 1500 

CAAATGACTG GTGTTGAAGG ATATGTTGAA AGCGCGGCAA GTGGTTTACT CGCAGGTATT 1560 

AACGCAGCTC GTCTTGCCAA AGGGGAAGAG CCGATTGAAT TCCCTCGCGA AACGACTTTA 1620 

GGAAGTATGG CTTATTATAT TACACATGCA GAAGGCAAAC ATTTCCAACC AATGAATGCA 1680 

AACTTTGGgC TTTTCCCAGA GTTACCTGAA CGAATTCGTG ACAAAAAAGA ACGCTATGAA 174 0 

GCAATTGCTA ATCGGGCCTT AGATGTTCAA GCTCAAGTTA TCCAATCTTT AGATTAAATT 1800 

AGAAAAAATA TGAGAAAAGA CCGAGCATCA TTGCTAAGAA ATGCGATGAT ACTCAGTCTT 18 60 

TTAATTTTAG • AAAAAAATAA CAAAAGCTAA nGAATGGTAA AATTGTGACC ATATTTAGAA 1920 

ATATTGGnTA ATTCCTGACC ACCTTAATTG TCTTTTGAAA AAnGGTTATG CCTAAAGTTA 1980 

CTTTGTAATC GAATGGGGAG GGCTATCACC TGGGAGGAAA AGAAT 2025 
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(2) INFORMATION FOR SEQ ID NO: 440: 

(-i> SEQUENGE- -CHARACTERISTICS : 

(A) LENGTH: 10594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 



CATCATGGAA 


TGGACTGGGG 


TTCATTAGTA 


TTAGGAATTC 


TATTTGTTCT 


GACTGCATTA 


60 


ATTTCTTTCC 


AAAATCCTGC 


CGGTAATTTA 


ATTGCCATTG 


TGATGGTTTT 


TGCGATTTTT 


120 


GCTATTATTA 


AAGGGATTTT 


CGAAATTTTT 


GTACGTAATC 


GTATGAAAGA 


ATTGCTAGGC 


180 


TACAAAGCTT 


ATGCACCAAT 


CATCTTAGGG 


ATTATTGATA 


TCCTCATTGG 


TGTTTACTTA 


240 


TTGTTTAACT 


TAAATATTGG 


TGTCGCTGTG 


TTACCATTkG 


TTTTTGCAAT 


TTGGTTTTTA 


300 


TTCGACTCAA 


TTTTCGGCTT 


ATTTACTTTA 


GATTTTGCTA 


AACGAGTAAG 


CACAGGTTAC 


360 


TTCTGGTTCA 


CATTAATTGT 


TGACGTTTTA 


GGGATTATCT 


TAGGTGTAAT 


GTTATTATTT 


420 


AATCCATTGT 


CTTCAGCACT 


AACTTTAAGC 


TTCTTAGTTG 


GCTTCTACTT 


TATGATGTTT 


480 


GGTATTAGTA 


ATATTGTATA 


TGCTTTTAGA 


TAAAAGAAGT 


AAAAAAACAG 


GTTTTCCGAA 


540 


AAGGAAAACC 


TGTTTTTTTT 


TAATAACGAC 


GGTGGCAGTT 


GCCATTATTT 


CCCGAAATTG 


600 


TTCGAACAAG 


CC AC AC GAT A 


ATCGCTACGG 


GAACTAAAAT 


TTTTACTGCT 


AGCCACAATA 


660 


AGCTTCCCAC 


GATACCGACA 


ACGGCACTAA 


AGAGTGAACC 


GACAACTAAA AATAAAAgGA 


720 


CCACTAAAAG 


AATATTTATA 


AACATTGGTC 


GTTGATTATT 


CATACGTACT 


ACCTCCAAAT 


780 


TTTTCCTGTT 


TTCTATATAC 


AGAGTATAGC 


AAGAACGCGC 


aACAAGCACC 


TCCATCGGAA 


840 


GGCTGATTTT 


TTTCTCTGGC 


GAATGATGGA 


GCCGCTGTTG 


GTTATTTTGG 


ACTTTCTTAT 


900 


TGArATAATC 


AGAATATTCT 


GGTGTATATT 


ATAAAAAATG 


ATTTAAAGTG 


AGGGATTGGA 


960 


TGTCGACTGA 


GGGATTAATC 


GCTTTgTTAG 


AAAAGAAACG 


AAATCATTTA 


AGTAAAATTG 


1020 


AAAATCAAGT 


ACTGGATTAT 


ATTTTGACGC 


ATATGACGGA 


AATTAGTTCA 


CAAACCATTT 


1080 


ATGAGGTGGC 


GCAGGCGTTG 


TTTGTCTCAA 


CAGCCACTAT 


TTCTCGGACA 


GCAAAACACT 


1140 


TAGGTTATCG 


AGGCTTCCAA 


GAATTAAAAT 


ACGCAATCGT 


CCAATCGGTG 


CAGAGCGAGA 


1200 


GCACAGAAAG 


CGATAGTCGT 


AGTTTTCaAG 


CTATTaCGCA 


ACAGTTGATT 


GCCaATGTGC 


1260 


aAGATAGTTT 


TCAACAAATG 


GATGAAGCCA 


AAGTCGCGGA 


AATGCTTAAA 


ATGATTGAGC 


1320 


GTGCGAATAC 


GATTGAAGTG 


TTCGGTGTTG 


GTGGCAGTTT 


TCCTATTTGT 


ACTGATTTTG 


1380 


CACGAAAGCT 


AACATTTTTA 


GGAAAAAAAG 


CCTATGCACG 


ATCTGATTGG 


GATGAACAAG 


1440 


CGGCAGCAGT 


TAAGAATCTG 


GATGGACAGG 


ATTTAGCCAT 


TTTTGTCAGT 


TATACAGGAG 


1500 


AAACTAAGGG 


GATTTTAGCG 


TATGCACGTG 


TAGCGCAGGA 


ACAGCAAGTG 


CCGATGATTA 


1560 


GTTTGATTTC 


GACTAAAGGA 


AGCACCTTGG 


AAAAATTGTC 


GACGACGACG 


TTATTTGCCA 


1620 


AAGGGACGAC 


ACGTTATCAT 


CAGCGCGTGG 


ATTTAAGTTC 


AAGAATTGCT 


GTCATTTGTT 


1680 
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TGTTCGATAC 


GGTTTTATTA 


ATGTACGCAG 


AGGCAAAGCA 


ACCGCGGTAA 


TTTTATGCAA 


1740 


CACCATACTA 


AATGATGTAA 


GGGGATACAG 


AAAGAGTGTA 


GCCTCTTTTT 


TCTTTGTTTT 


1800 


GGGCTAGTTA 


TAATGAGGGC 


AATGAATGTA 


AACGCAATCA 


TTTTTAGTTT 


TGTAAAGAAT 


1860 


GAATGGAGGA 


GTTAAGAAAT 


GGCAGTAAAG 


AAAAAGAAAG 


CGACTTTCTG 


GGAGTTTTTC 


1920 


CAAGGATTAG 


GTAAAACGTT 


TATGTTGCCA 


GTGGCTTTAT 


TAGCTTTTAT 


GGGGATTTTG 


1980 


TTGGGATTAG 


GAAGTTCTTT 


TTCCAGTGAA 


TCCATGATTG 


AAACGATCCC 


GTTTTTAGGA 


2040 


AAGCCCGCAG 


TGAAAATTAT 


TTTCCAATTT 


ATGTCGACAA 


TTGGTGGTTT 


TGCGTTTGCC 


2100 


TATTTACCTG 


TCATGTTTGC 


AATGGCCATT 


CCGTTGGGGT 


TAGTTCGGAA 


AGAAAAAGGA 


2160 


ATTGCTGCGT 


TTTCAGGTTT 


CGTAGGCTAT 


ACGGTGATGA 


ATTTAGCGAT 


TAATTTCTAT 


2220 


TTAGTGCAGA 


CGAATCGCCT 


CGTGGATCCT 


GAGCAGTTGC 


GGGAAGCCGG 


CCAAGGAATG 


2280 


GTTTTTGGGA 


TTCAAACGAT 


TGAAATGGGT 


GTTCTTGGTG 


GAATTATCGC 


TGGGCTTATT 


2340 


GTCTATAAGT 


TGCATAATCG 


ATTTTATACG 


GTGCAACTAC 


CAGATAGTTT 


TGCCTTCTTT 


2400 


TCTGGCGCAC 


GGTTTGTGCC 


AATTATTACT 


TCATTAGTAA 


TGGCATTCGT 


GGGCTTAGTA 


2460 


ATTCCGTTAG 


TTTGGCCACT 


TTTCGCGCTA 


ATGATTATGG 


CGATTGGCCA 


ACTTATCCAA 


2520 


CGTTCTGGTA 


TTTTTGGGCC 


TTTTCTGTTT 


GGTTCAGGGG 


AACGTCTACT 


GTTACCGTTT 


2580 


GGGTTGCATC 


ATATTTTGGT 


TTCCATGATT 


CGTTTTACTG 


AAGCAGGGGG 


TTCAGCGGTG 


2640 


GTGGCTGGCA 


AAGAAGTGTT 


TGGCGCGTTA 


AATATTTTCT 


ATGCAGAATT 


GCAAAATAAT 


2700 


TTACCGATTT 


CACCAAGCGC 


AACAGCTTTC 


TTGTCACAAG 


GGAAAATGCC 


GACCTTTATT 


2760 


TTTGGCTTAC 


CAGCGGCGTC 


ACTTGCGATG 


TATCATACTG 


CAGCTCCGGC 


GAATCGACAT 


2820 


AAGATTAAAG 


GCTTACTCCT 


TTCGGGTGTA 


ATTGCTACGG 


CTATCACTGG 


CATTACAGAA 


2880 


CCAATTGAAT 


TTTTATTCTT 


ATTTATTAGT 


CC ATT ATT AT 


GGCTGTTTCA 


TGTCATTATG 


2940 


ACTGGTCTAG 


GTTTCATGGT 


AATGGCTTTA 


TTAGGTGTTG 


TGATTGGGAA 


CACTGATGGC 


3000 


GGGCTACTGG 


ACTTTGTTAT 


TTTTGGCTTG 


TTACAAGGAA 


CT TAT AC AAA 


ATGGTGGTGG 


3060 


GTTCTCATTG 


TCGGCGCTAT 


CTGGTTTGTT 


GTTTATTATT 


TCGTCTTTAA 


AACAGTCATT 


3120 


GTAACCTTTG 


ATTTAAAAAC 


ACCTGGACGA 


GACAAGGTCT 


TAG AT GAG AC 


TGAATATACC 


3180 


GATCAAGAAG 


TTCAATATAA 


GAAAACGGGT 


GGCTATGATG 


CGCCTGGGAT 


TTTAGCGGCT 


3240 


TTGGGTGGGC 


AAGAGAATAT 


TCAAGCTATT 


GATAATTGCA 


TTACGCGTTT 


ACGTTTAGTG 


3300 






AHATCJATnAC 
nun i on l un^ 


AAATTAAAGC 


AATTAGGGGC 


TTTAGGCGTT 


3360 


GTACATTTAG 


ATGCGCAGAA 


TGTACAAGTG 


ATTATCGGAA 


CCAAAGTGAC 


AACGGTCCGT 


3420 


AATCAATTAG 


AGATGATTTT 


AGGTTAGAAA 


AAAGACAGAA 


GGTGAAGAGT 


GTGTCGTTTG 


3480 


ATAAAGCAAT 


TGATCGACGA 


GGAACGTATT 


GTACACAGTG 


GGATTTTGTA 


GAAGACCGAT 


3540 


TTGGGGAAGC 


GGACTTGCTT 


CCCTTCACTA 


TTTCCGATAC 


GGATTTTGCT 


GTGCCAGAGG 


3600 


CAGTTCTGGC 


AACATTACAG 


GAGCGTTTGA 


ACCATCCAGT 


ATTTGGCTAT 


ACGAGGTGGA 


" 3660 



WO 98/50555 



PCT/US98/08985 



1669 



ACCATCCGCA 


ATTAAAAGAA 


GCAATTCAAA 


CGTGGTACCA 


GACAAGATTC 


CAGACGAAGA 


3720 


TTGAAGAACA 


CTGGATTATG 


TATACCCCGA 


CCGTTATCTA 


TGGTATTTCT 


GCTTTGATTC 


3780 


AATTATTGAC 


TAACGAAGGG 


GAGGGCATTA 


TTTTACAAAC 


ACCTGCCTAT 


GATGCATTTT 


3840 


TTAAAGTTAT 


TCAGGAAAAT 


AAGCGGCAAG 


TAGTGGCGAA 


TGAACTGCTT 


TATCAAGAGA 


3900 


AGCGCTACAG 


CATTGATTTT 


ATTGATTTAG 


AAAAAAAACT 


AGCGCAACCA 


GAAAATCGCT 


3960 


GTTTGCTTTT 


ATGTTCCCCG 


CATAATCCAA 


CAGGACGTGT 


TTGGGAGCAA 


TGGGAATTAG 


4020 


AGCGAATAGT 


TTCATTGTGC 


CAACAGTATG 


ATGTGTTTTT 


ACTTTCTGAT 


GAAATTCATA 


4080 


TGGATATCGT 


TAATAAAGGC 


CATGTTCATC 


GACCAATTAC 


GCAATTTGAT 


TATAAAAAAT 


4140 


CTGCCATTAT 


TACCAGTGGC 


ACGAAAACCT 


TTAATTTTCC 


AGGGCTGATT 


TTTGCTTACG 


4200 


CCTTAATTCC 


AGATAACGAA 


TTGCGTGACG 


CTTTTCAGCT 


CAAATTAAAA 


AACGCCGATG 


4260 


GTTTATCATC 


AACAAGTATT 


TTGGGAATGC 


TAGCAACCAT 


GACGGCCTAT 


CAAAAATGTG 


4320 


GTACTTGGGT 


TGACGAGTTG 


AATGATTATT 


TAGCGGAGAA 


CCAACGTTAC 


GTGAAAGATT 


4380 


TTTTACAGAC 


ATATTTGCCA 


AAAATTAAAG 


TGACGGAGCT 


GGAAGCCACT 


TACTTAATGT 


4440 


GGTTGGATGT 


ATCAGCGGCA 


GTTCCAGATG 


TGGCCCGCTT 


ACAAGCGGCT 


TTAGTTTCTG 


4500 


TGGGGAAAGT 


CGCTATTATG 


GACGGCTCGA 


TTTATGGCGG 


TAATGGGCAA 


CGATTTTTAC 


4560 


GGTTAAATAT 


TGGCTGTTCG 


CAAGCAAAAC 


TACACGAAGG 


ACTTGAAAGG 


ATGCGCCAAG 


4620 


GTTTTGAAGC 


TGTTTTACAA 


AAGGATGGCA 


CAGCATCAGC 


GCTTAGGAAT 


AATTGAGAAA 


4680 


AGATAGCATT 


CCGAAAATGG 


TTGCAGTACA 


ATGGAAAAAT 


AGAAACAAAT 


AAAATAACAA 


4740 


AAAGTAACAA 


CTTGTCATTT 


ACTGGGGGAA 


AAGAAATGAA 


AACAATTAAA 


ACAAAGCAAG 


4800 


CACCACAAGC 


AATTGGGCCA 


TATTCACAAG 


CAAAAATTGT 


GAATGGTTTA 


TTGATTACTT 


4860 


CT GGTCAGAT 


TCCACTGGAT 


CCTGAAACAG 


GCAAGGTGGT 


TGGGACAACG 


ATTAAAGAAC 


4920 


AAACCAATCA 


AGTACTGAAA 


AATATTCAAG 


GG AT TTTAAC 


AGAAGCTGGT 


AGTGATTTTG 


4980 


CTCATGTAGT 


CAAAACGACA 


TGCTATTTAC 


AAAATATGGC 


GGATTTCACG 


GCATTCAATG 


5040 


AAGTCTATGC 


TGAGTATTTT 


AGTGCGGAAT 


ATCCTGCACG 


CACAACGATT 


GAAATCTCTA 


5100 


AATTACCGAT 


GGATGTGTTG 


GTGGAGATTG 


AGATTATCGC 


AGAAGTAGTG 


GCGTAAAAAA 


5160 


TACAGTTAAA 


AGAGTGAGAA 


AAAATCGTCT 


TAGATTTTTT 


TCTCACTCTT 


TTTTTCAATA 


5220 


AAACAATACG 


ATTCAGCCAA 


AATGTCTAGC 


CGTGCTGGAA 


AATTTTCAGC 


AAATAGGCTA 


5280 


CCATGGAAGA 


AAATPHATAA 


A APflA A Af:TT 


A A AT/*: A A TZiP 
AAA 1 onn I Ho 


AAAAb AbA I A 


TGAAAGAGGC 


o J4 0 


TATAAGATGA 


AAAAAAATAT 


TAGCCAACTA 


ACGCAtGGTT 


GAAAGAACAC 


AGTTTATTAT 


5400 


TGAAACTAAT 


TTTTCTTGGT 


TCTGTATTAG 


TTTTCGTGGC 


TAACCAAGTG 


ACACATATCG 


5460 


CTCAAGGCAT 


GAGCTGGGCG 


GATATTTTTT 


CCACGATGGA 


ACAACAAAGT 


ACGGGGCGTT 


5520 


TAATTGGTAT 


GGTCTTGGCA 


GGACTATTAG 


GGGTTATTCC 


gATGCTCCTT 


TATGACTATG 


5580 


TAGTCGTTAA 


ATTATTGGAA 


AAAGAAGGTA 


AGCCACCAAT 


GAAACGAATG 


GATTGGTTAA 


5640 
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CTTCGGCTTG 


GGTCACCAAT 


ACCATTAATA 


ACTTAGCGGG 


TTTTGGCGGC 


GTGGTTGGGG 


5700 


CAACGCTACG 


GATTAATTTT 


TACGGAAAAG 


ATGTACCGCG 


GGGCAAGGTT 


GTAGCAACGG 


5760 


TCTCAAAAGT 


TGCTCTGTTC 


TTGATTTCAG 


GGTTATCTAT 


TTTATCTTTT 


GTGGCGTTTG 


5820 


TTGATTTATT 


TTTTATTCGA 


ACCCAAAATG 


TTTTTCGTGA 


ATATTGGGTT 


TGGTTATTAT 


5880 


TGGGCAGCCT 


GATTGCGCCA 


GCGCTCTGGT 


TTTTCACGTA 


TTTAAAACGG 


CGAACATTGT 


5940 


TTAAAACATT 


TTTTCCAAAA 


GCAGTGCTCC 


TATTATTCGG 


CGCTTCATTA 


GGTCAATGGC 


6000 


TAGGTGGGAT 


GTTTGCTTTC 


TTGATGATAG 


GTCGCTTGAT 


GCAGGTGCCA 


GTTTCGATGG 


6060 


TTTCTGTTTA 


CCCAATGTTT 


GTTATCGCTA 


CGTTGATTGG 


GATGTTAACT 


ATGGTTCCTG 


6120 


GTGGTATGGG 


AACGTTTGAT 


GTCTTAATGA 


TTTTGGGCTT 


ATCACAATTA 


GGGATTGATC 


6180 


GATCACAAGC 


GATCGTCTGG 


TTGTTGTATT 


ATCGTTTGTT 


TTATTATGTG 


ACGCCTTTTA 


6240 


TGACAGGAGT 


TATTTTGTTC 


TTGCAACAAG 


CAGGCATGAA 


AGTCAATCAA 


TTTTTTGATA 


6300 


ATTTACCTCG 


GTTATTTTCA 


CAAAAAGTGG 


CGCATTTTAT 


TTTGGTGGCG 


GCGTTATATT 


6360 


TTGCAGGGAT 


TATGATGGTT 


CTATTGTCGA 


CTGTGACGAA 


TTTGTCGAAT 


GTCAGTCGTT 


6420 


TATTCCAAGT 


GTTACTTCCC 


TTTTCTTTTA 


ATTTTTTAGA 


TCAAACATTG 


AATTTATTTG 


6480 


TAGGATTCTT 


GTTATTAGGT 


TTAGCTCGAG 


GCATTTCTAT 


GAAAGTGAAA 


AAAGCTTATT 


6540 


GGCCTACAAT 


TATTTTACTA 


GGATTTTGTA 


TTGTAAATAC 


GGTGGCACGA 


ACGACTTCGT 


6600 


GGCAGTTGAT 


TGCTGTTTAT 


GCTGTTATTT 


TATTGGCGGT 


TATTTTGGCT 


CGTAAAGAAT 


6660 


TTTATCGTGA 


AAAGTTTGTT 


TATTCATGGG 


GCGCTTTAAC 


GGTAGATAGT 


ATTTTGTTTG 


6720 


GCTGCTTGTT 


TATCGGTTAT 


GCGGTTGCTG 


GCTATTACGC 


CGCACGACCA 


GCTGGTGGCA 


6780 


ATCAAGTTAT 


CAATCACTTT 


TTGTTGTTTC 


CGTCAGATGA 


TGTTTGGTTT 


AACGGATTGA 


6840 


TTGGATTAAG 


TATTTCCTTG 


ATTGGTTTAT' 


"TCTTTTTGTA 


TCAATATTTA 


GCCGAAACAA 


6900 


CTGTTACTTT 


AGGTGAGGGA 


TTTGAGAAAG 


CGCGGTTGAC 


ACGCTTTTTG 


GAAAAATTTG 


6960 


GTGGCAACGA 


AGGCAGTCAA 


TTTTTGTATT 


TAAAGGATTA 


TGGTCATTTT 


TATTACCAAG 


7020 


AAGAAGGCGA 


AGATCAAGTA 


CTTTTTGGGT 


TTCAAATGAA 


GTTTAATAAA 


TGCTTTGTCT 


7080 


TGGCGGATCC 


GATTGGGCAA 


CGAGAGAAAT 


GGACAGCGGC 


GACACTGGCG 


TTTATGGATC 


7140 


AAGCTGATTT 


GTTAGGTTAT 


CAATTAGTTT 


TTTATCGTAT 


TTCGGAAGAA 


TATGTTATGA 


7200 


ATCTTCACGA 


TTGCGGCTTT 


GAATTTATGA 


AAGTTGGGGA 


AGAAGGCTTG 


ATTCAATTTG 


7260 


ATGAGCCGTC 


AACAGTGAAT 


CAAACAGCCT 


GGACGGAAAC 


AGTGACTGAA 


AAAATAGCAG 


7320 


CGGAAGCAGC 


GGATTTTCAG 


TTTGAATTTT 


ATCCCGAAAC 


GAT TAG T GAT 


GCCTTATATC 


7380 


AAGAGTTAGA 


ACGAGTTTCT 


GCGGACTGGT 


CACGGAATCA 


AAAAGAGCGC 


TACTTTATTG 


7440 


GTGGACGGTT 


GGATCCCGAG 


TACTTGAAAT 


GCAGTAGCGT 


GGGCTTGGTG 


AGACAGAAAC 


7500 


AAACGGTCAT 


CGGTTTCATT 


ACTGGAAAAG 


AAATGGAAAA 


AGGCAAAAGT 


ATTTCGTATG 


7560 


ATTTGTTACG 


TATTCGTTCA 


GACGCACCTG 


CTTTTACTAG 


AGAATATTTG 


TTCACTCATT 


7620 
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TTATTGAAAC 


CTATCAACAG 


CAGGGCTATC 


AATTAATTGA 


TATTGGGATG 


GCGCCGTTGG 


7680 


CTAACGTAGG 


GGAGAGCAAG 


TACTCTTTCT 


TAAAAGAACG 


CTTTGTTAAT 


ATTTTTTACA 


7740 


AGTATAGTTA 


TCAAATTTAT 


GCGTTTCAAG 


ATACACGCAA 


GCGCAAAGAA 


CAATATGTCA 


7800 


CTAGTTGGCA 


ACCACGTTAT 


TTTGCTTATC 


CTAAGCGGAC 


CAGCGTACTC 


TTTGCTTTTG 


7860 


TGCAACTTTC 


CTTGTTGATT 


ACGAAAGGAA 


GACATCAAAG 


TGTTTCGCTT 


GTGGAAGAAG 


7920 


CAATGACTGA 


AATTTAAAAA 


AATCACGCGA 


TGACTTAGCC 


ATCGCGTGAT 


TTTTTATTTG 


7980 


TCTTCTTTTT 


TACGGTTTCG 


TCTAGCAGGA 


ATCAAGGTTA 


CGTAGGCAAC 


GAAACCACCA 


8040 


GTACATAGAA 


CGAATAACAC 


GATGCCAGGA 


ATCAATCCTT 


CTGGTAAATA 


AGAGAACTGA 


8100 


ATAGTATGGG 


TTCCAGCGCT 


GACAGGAACG 


CTTAGAAAGG 


CATCTTTAAA 


GGCTTTTGGG 


8160 


GTAACTTTTT 


TGCCATCAAT 


TTTGACACGC 


CAACCTTTGT 


CATAAGGAAT 


TGTAGTGACG 


8220 


AGTACTTGAT 


CTTTGTCGGC 


GGTAAAGGTA 


CCTGAAGCAG 


AGCGTTTGCC 


AGTTGTTAAA 


8280 


TCAACGCCTT 


TTTCTTGCAC 


GGCGGAAATG 


GCTTTTTCAA 


ATGCATTGGT 


ATTTAAGCCA 


8340 


ACCACTTGCG 


GCTGAACAAA 


ACTTACTGCT 


TTGGTTCCAT 


AAAAACTTAC 


TTTGAAGTTA 


8400 


ACGGTTGTAT 


CTTTGGGATA 


GTAACCTAAG 


TTATAATATT 


GGCCAGTAAT 


GCCAATTTGT 


8460 


GATTGCTGAC 


TTGAGCCGTT 


CACGGTGACC 


GTCGCAGTGG 


AACTTTCCAA 


TTGAGCAAAA 


8520 


TCCGTTGGGA 


ATAAACTCAA 


ATAAGCTTGC 


GTATTGGCGG 


GCACATTGAC 


TGTATAAGAA 


'8580 


ATCTCTTTGG 


CTACGTTATG 


TTGTTTCTCC 


GTAAAAGTTA 


CGCCAGCTGT 


ATTTTGCGTA 


8640 


ATTGTCACGT 


TATTTTGAAG 


CGTCATCGTT 


GGTTGATAAA 


AGGTAAAATA 


CCGTTCATTA 


8700 


GTATTGGCTA 


AAGCATTTAG 


TAAATTTGTT 


TGACTACCTA 


AATTATCATT 


GAGAGGTTGC 


8760 


CGGACATTGT 


AAATCTCTTT 


GTCTGCTAAA 


AATCCTAACG 


GCAAAGCATT 


TTCGTTACGA 


8820 


TAAAGCTGAT 


ATTTACCAGC 


AGCAGCTTGT 


CGCTCAAAGC 


CGAATTTCAT 


TGGATTGTTT 


8880 


TCAGCAATAT 


TGTATTTAAT 


GCCCATCAAA 


GCATCCATTA 


ACAAGGTATT 


ATTTTGGTAA 


8940 


CGAATATTTA 


AATTCGTGCC 


TCGTGAGCGG 


AAACCTAAGG 


CGTTTAGGTA 


TGTTGAAGAA 


9000 


TGTCGGTTCC 


GTACGGAAGA 


AAACATGCTA 


ATGCCACTGT 


AGCCATAGTT 


AATGCCGTCA 


9060 


TTGGCCGAAA 


CACCATTTAA 


ATTTTCCAAG 


CGGTAAAAGG 


TATCATTTTC 


TTTTTTTGTT 


9120 


TGATCGACTA 


GATTTTTTAA 


GTCAGGATAA 


GGCTCAGAAT 


ACAAGCTGCG 


TGATGCGTAG 


9180 


TTCCAATCAT 


CTAAAATACC 


AGTGACCATC 


GCCCGTGTGT 


TAATGATTGC 


TTCACCAGAA 


9240 


ACGAGCAGCA 


ATAACAAAAT 


CACTAAGTAT 


TGCATAGGCG 


CTTTTTTCAA 


TTGATAAAAA 


9300 


GCAATCCCGA 


AAAAGTATAA 


TAGTAAAAAA 


GTTGCCGTTA 


ATACGAAAGA 


GGTTAAAGTC 


9360 


ACATACGTAT 


AACTAGTAGC 


ACCTTTTGTG 


CCCATTGCAA 


GAGCAAAAAT 


TGCTAAAAGA 


9420 


ATCAAAATCA 


AGCCGCTTAA 


GACACCGAGA 


TCATCCTTTT 


CAAATTTCTC 


CCAACCATAA 


9480 


CCTGCAAGCA 


AAATAACTAA 


GAATGAAAAG 


AGGAAACTAT 


AGCGGAAAAG 


AAACATGTTC 


9540 


GGCGCATGCA 


TACCGTGCCA 


GAATAAATTT 


AAGGGCGTGA 


TGTAAAAGCT 


AGCAATCAAT 


9600 
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AGGACAAACA AACTGCCAAA AAGTAGTTTG TTTTTCAATG GAACTTCTTT CGTAACGAAA 9660 

TAGAAGAGAC AAAAAATCAA AGGCAACAAC CCAATGTAAA TAAAAGGAAT GGACCCATAT 9720 

TTGGTTGTAT CGTAAACGCC AATCATATTT TTCATAATAA TATCTAAAAA GGCAGTCGCT 9780 

TCTGTTTTAA AGGTTGTAAT TTCAGAAAGA GTTTCACCAT TGGTTCGTAA ATCTAAGACA 984 0 

GCAGGTAAGA CCATAATCAT GGATGCTCCA CCAGCTAAAA GAGAAGTAGT AAAATACGCA 9900 

ACGATTCGAG ATTTATATCG TTGCCAGTCT GTAAACGTTC GTGCAAAATA GTACAGGAAG 9960 

GAAAACAACC CAATCATAAA GCCAAAATAA TAATTTGTAA TGAAAAGCAA GAAATAACTG 10020 

ACAAACAATA ATGTCGGTTT TCGTTGGTCC ATTAAACGAT GAATTCCTAA AATTACGAGT 10080 

GGCAGATAAA TAAAGGCATC CAGCCACATA ATTAATTCTG AATGGGCCAC AATGAAGGAC 1014 0 

ATTAGCGCAT AAGACACACT TAAAGTCACA TGGGACCATT TAGGAATCTT AAAGGTTTGT 10200 

TTTGCCAAGA ACCAAAAACT CAAGCCAGCA CTCCCAATTT TCAACAACGT TAGGAAATAG 10260 

AGCGCATCAG GCATGTTTTG ATTGTTAAAG AAAAAGACGA GCGGCGTAAA TAAGCCACCA 10320 

AGATAATAGG AAATTAAGGA CAAGTAGTTT AGTCCCAAAG AAGCATTCCA CGTATAGAAA 10380 

AGACTTTGTT TTCCATGAAG AACATTGTTA AAGCTTGCGT GAAAATTTGA AAATTGAGAA 104 4 0 

AAGGCATCGC TGGCTAAAAC ACTGCGACTA CTTCCTGGAT AAATCCCAAT ACTCAAATAG 10500 

ATAATTGCCA TAATTAAAAA TGGAaTGAAG rAACTgscTA ACATATAAGG CCAGTTGTtT 10560 

TTTAAAAACT GCcTACTTT AtTTTTCATA AATA 10594 
(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATGAATTAGA GCCTTTAATC GAGTTGGGCG TAACTGGCCT CCGGnTnCnn TAGCGGCATT 60 

ACGATAGAAC AAATGGCTCA TGCTTCACAT AAGATTGATA TAGGATTGAA TGCTAGTACC 120 

ATTACACTGG AAGAAGTGGC TGAGTTAAAG GCTCACCAAG CGGACTTTTC TCGCTTAGAA 180 

GCTTGGCATA ATTATTATCC AAGACCTGAA ACCGGCATTG GAACAACATT TTTTAATGAA 24 0 

AAGAACCGCT GGCTTAAAGA ATTAGGCTTA CAAGTTTTTA CTTTTGTACC AGGAGATGGG 300 

CAAACAAGAG GCCCAATTTT TGCTGGTTTA CCAACATTAG AAAAACATCG GGGGCAAAAT 360 

CCTTTCGCCG CTGCCGTTGG TTTGATGGCT GATCCTTATG TCGATGCTGT TTATATTGGT 420 

GATCCTACAA ' TTAGTGAGCG AACCATGGCT CAATTTGGTT ATTATCACCA AACTAATCAA 480 

TTTCTGTTAG AAGTAGCACC TAGCGAGAGC CGCTATTTGA AGCGAATACT TGGGACTCAT 540 

ACCAATCGTT TAGATGCTGC AAGAGATGTT TTGCGAAGTG AATTATCAAG AACGAGTGAA 600 
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ATGTTCAGAA AAGATGAAAT TGCaACGATT GAGTCAGAGC AGACTGAAGC ACGTCCAGTG 660 

GGaACCGTAA CGATTGATAA TGaAAAATAT GGTCGCTATA. TGGGaGAAAT TCaAGTCACA 720 

CTTGTnGGAC TTGCCGAAAG ATGaAAAAGT CaATACGATT ACACGAATTA TTGaAAAAGA 780 

TCiuAACGATA TTGCCGTTAA TCcAAGGCAG GCnATCaATT TACGCTAGTC ACGGAAGGnA 840 

CGATAGAGAA TGAATTTAGA AAACTTAACA ACTGAACGTA GAAATGAAAA TACAATGGGC 900 

TTAGATGAGA TGAGTGTCAA AGAAGCTTTA CAAAAAATGA ATCAAGAAGA CCAAAAAGTT 960 

GCAATGGCAG TGGGTCAAGA ATTAGCAGCC ATTGAACCAG TGGTAGAAGC CATCCATCAA 1020 

AAGCTTCAAT CAAGGGGGCC GCTTGATTCT CCnTGGGCCC TGTACCAATT GGCCGGTAAG 1080 

GGGCCTA 1087 
(2) INFORMATION FOR SEQ ID NO: 4 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 



ATTGTGATTT 


TTTCAATTCA 


GATTCTTCAT 


CGTTAAGATG 


TCATTAAAGA 


CACTGACTAA 


60 


GTAATCGkTG 


ACTGTTTCTA 


AATTAGGTTC 


CATTCTGTGC 


ACCATCCATA 


TACTTTGATT 


'120 


ATCAAAACAT 


TTGATGTTCA 


AACTATATCG 


TAAATAAGCA 


AAGAATGCAA 


GAGAATTTTC 


180 


TATATTTTTA 


ATGAAATATT 


CTTCAAATAT 


TTCAAAAAAG 


GCGCATTTTC 


AAGGGACGAG 


240 


GCTTTTAAAA 


AAAGGATCAA 


TTACTTTGAT 


ACCCAAACTA 


TTTTCACAGA 


AAAAGGAAAG 


300 


AAGTGCAACT 


AAAAAGCATC 


ACTGGCACTC 


CCCTGATTGA 


GAAGTGCCAG 


TGATTTTTGT 


360 


TATATAATTC 


TTAAATTCCC 


ACTAAGCTAT 


TGCTACAACT 


TGACTGGATA 


ATACGAGAAG 


420 


AGAGTGAACC 


AGAAGCTTTG 


GTGCTTCTAC 


TCCACTCCCT 


TATTAAAATG 


TAGGACGATT 


480 


AATTATTGAA 


TTAAGTCGTA 


AATTTCCATT 


GCTACAATAT 


CGATGTTATC 


GAATTGATAG 


540 


CTTTGGTGCG 


TATCAGATTC 


TGTTAATTCA 


AATGTTTCTG 


TTGATTTATC 


AAAACTTACA 


600 


ATACATTTCT 


CTACGCCTTC 


ACGTTCAAAG 


CGGCGTACTT 


GTACTTCATT 


ATCTGCTGCT 


660 


TCGATCATTG 


CTTCCAGTCT 


CTTAATAATC 


GCCACAAGTT 


GTGAATGTTT 


CATCAGTGGG 


720 


cCTCCTTTTG 


CAAACTAGTT 


CTGTACCATT 


GTAACAAAAC 


AT 




762 


(2) INFORMATION FOR SEQ ID NO: 4 43: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



443: 
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CAnnAbAbb 1 


GCTTCTGTTG 


^tvttvtaPPTATaT 
n A f AAbb AA 1 


1 1 1 b 1 lb 


p a fa a a a tpp 


a aap.TPTaap 

rlrVrtb 1 b 1 M/ib 


60 


bbAIbl lb 1 b 


bAAA 111 Abb 


rnrnrn f T\ AO-it, 7\ 


P T ti TrTltT a a 
biniblbl rtM 


aaaacppapp. 


aT^aTpparp 


120 


/"-» 7\ Tl TV TV A T» /"•/"• 

GAAAAbA X bb 


ATTTCGATAA 


Till bAbAI b 


rpm*"»rp A a 'FlfT"! 1 
lib 1 H/Al 111 


pTrvTraaTaa 

1 ri 1 i^rtrV 1 rtr\ 


PTTTPaPP.TT 
bill b/ibb 1 1 


1 fth 

lOv 


TCTTCGCTCC 


TATTCTGCTG 


r m a a Arr*TAr 
CTAAAbb 1 Ab 


Pfi APTaTTP-P 
b>iAb 1 H 1 1 bb 


bbb 1 Hbbb-H 1 


TTTTPT'Pr'i TV 

1 1 I 1 b 1 I bAA 


0 Ad 


TGGTTGTGCC 


AATTCTACCA 


ATTGTTCTTT 


I b 1 AA1 Al Ab 


bbbAl I bbAl 


A A A rn rp i_ 

AaGsAATTtb 


oUU 


TTCcAAACAa 


GsGACTTTTA 


tv tv m rn m m ^ ta 

AATTTTGACG 


m m rp rp +-/"■• A 7\ T> 1. 

TTT1 tbAAl K 


b 1 f TbAATAA 


ATGkTGACGC 




TTCTAATAAG 


GATTCGTGTG 


TGCCTGkATC 


T AACC AAG C A 


AaGCCACGGC 


tv rrl rn tv rn s** 

CCATTACCTC 


4 20 


AACAGAAAGT 


TTATTTTTTT 


n ^ rn it ^ tv 

cAAGGTAGAC 


TTTATTAACG 


TCTGTTATTT 


z^i rn tv tv m m f** f> f** 

CTAATTCCCC 


4 80 


ACGTTCTGAC 


GGTTTGATCC 


CTTTTGCAAT 


mmoiv TV /"• r"» TV rp 

TTCAACCACT 


rn/"'Tvrrirri/"»rn/""</"*rp 

TCATTGTCGT 


AA TV 7\ "A rp 7\ f~* 7\ 7\ 

AAAAATACAA 


b4 U 


ACCTGtCAcT 


GcGtAwTTkG 


ACTTCGGTTG 


TGCTGGTTTT 


rp r~*rnrr\ r+ t\ tv rp tv r 

TCTTCAATAG 


TV TV Ar^rO/^rr 

AAAGCGCCCG 




CATCTCTTCA 


TCAAATTCTA 


<<S TA TA ^ A tv TA 

CCACACCGAA 


ACGCTCTGGA 


TCATTTACAT 


GGTAAbbAAA 


DDU 


GACTGTTGCC 


CCTGATTCTT 


TAGAAGCCGC 


TCGTTGCAAC 


ATTTTTGATA 


AGbCGCCACC 


"7 o n 


ATAATAAATA 


TTGTCCCCTA 


AAACTAAGCA 


GACACTGTCA 


mon f~*t~* tv tv rn tv tv 

TCACCAATAA 


ATTCTTCAbb 


Ton 


AATAATAAAT 


GCTTGCGCCA 


AACCATCTGG 


GCTTTCTTGT 


ACCGCGTATT 


CGATATGAAT 




GCCTAAATCA 


TGGCCATCGC 


CGAATAAGCT 


fn rn ta tv tv tv r 4 ^ ^ m 

TTCAAAACGT 


GGTGTATCTT 


CTGGTGTAGA 


yuu 


GATAATCAAA 


ATTTCATTAA 


TTCCCGCCAA 


y-i tv mm tv tv m m 

CATTAACGTT 


GACATTGGGT 


A A rp Tv Tv t\ rrv O T\ rp 

AATAAATCAT 


yt>u 

t 


TGGTTTGTCG 


TAAATTGGCA 


TTAATTGTTT 


TGATGTTGCT 


TTTGTTAACG 


/-> tv rn a m a a t^r^r* 

GATATAAGCG 


i n on 


TGTCCCGCTT 


CCGCCTGCTA 


AAATAATTCC 


mm m tv m^ tv tv ^* 

TTTCATGAAC 


tv Tvrri/™»m/™ l '/ , 'mmTv 

AATCTCCTTA 


A A rpA r^rnrn (~^rr\y\ 

AATACTTCTA 


xUoU 


TTTTTAGTTT 


CTATAAAACC 


AAATATCCTT 


/-^ /™* m tv mm tv m 

CCCTATTATC 


GCATTAAGCA 


TAGGGtTTbl 




CATGTTTTTT 


ACGTAAATTT 


TACAAATCTG 


m tv ti t\ tv /"* mmffi tv 

TAAAAGTTTA 


GCGTTGTTCC 


rprprpA A AOrP r P 

TTT AAAbbb 1 


i onn 


GATGTAAAGC 


TTTTAATAAA 


TTCACACGCC 


AATTTAATTT 


TTGTTGGAAA 


^mrp^rprprprp/->/~- 

CTTb 111 lbb 


i o f;n 


TAATAATAGG 


GCTGACATTC 


GCTCCATGTC 


m m /rn a tv tv /■"* 

TTCGGTAAAG 


CACTAATGbl 


lbl 1 1 AAlbA 


i ^on 

1 JZU 


GACCCGTTTG 


CTTTTTCCGT 


/"•/"•TV /""/TP A TV m TV 

GCAGCTAATA 


AGbCAAlbbA 


bAl Ai bAI bb 


TVPTAPPfi A PTT 

A 1 Abb/irlb 1 1 


i JOv 


CTGGCGGAAT 


GGGTAAAATG 


ACGTTTTTCA 


TCTCTTbAbb 


AAA rrrr a rp a 
AAAbbbbA 1 A 


PPTPPPPPTA T\ 

bb 1 bbbbbArt 


1 A A fl 

1 fi *i U 


TATAGCCACT 


TTTTATCGCA 


m m m f~* /~~* tv tv tv 

TTTCGCCAAA 


ACCCTGGTT 1 


P A PTTTTP P TA 

bAb 111 lbbA 


TA TaPTTTA ta ta tat 
AAb 1 1 AArlA 1 


i ^nn 

1 jUU 


AAGAGGGATT 


GGTAACTTGT 


tv Tv tv rn /~* /~> tv rn 

AAATCCGCAT 


/-> T\ TV T\ TVrprpmrp 

CAACAATTTl 


b AAb 1 bAb 1 A 


A rp A TAPPTAPTT 

AlAAbbAbl 1 




bbA 1 b I b Abb 


blbbbl 1 IbA 


7\ tv A r P7\T r T l PTr 

AAA1A1 1 b lb 


1 b Ab 1 b 1 I b 1 


1 fib 11 inl 1 I 


c^paappaaa 


1620 


CGTCATCTTG 


ATCTGCTAAA 


7V A rp A rp/*>T\/TflfTI 

AATATCACTT 


CGCCTTTbb x 


TV TPPPT TA TA P ^ 

Al bbb I AAbC 


PPTA BTaTP^TA B 
bbAArtl bC/irt 


1DOU 


AATTAGCAAT 


CACTCCTTGC 


CCTGGaCCTT 


GTAACAATTG 


AATTTGGGGA 


TAATTCGCTG 


1740 


CATACGTTCT 


CAAAATTTCC 


AACGTATGaT 


CAGTAGaAcC 


ATCaTCTGAA 


ATAATTAGtT 


1800 


CATCyTCtTC 


* ACTGACyTGT 


AAAAGAATAC 


TATCTAATTG 


TTCCGCGAGA 


TATTTTTCTC 


1860 


CATTATATGT 


CGCAATACAA 


ACTGAGATCA 


TAGATTATCC 


TTTCATGAAC 


TTGAGAAATT 


1920 


CTGCCAATGT 


TTGCCGCCAA 


ATTTGGCGAT 


TTTTGACAGT 


TAnAAATnGC 


TTACTACTGC 


1980 
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GTAAAAATAG ATGGCGTCGA TGnGGGAAAA CT 2012 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



GACATTGGTG 


TTATATTACT 


TACAAGTTGT 


TTTTTCATTG 


GTTTTTATCA 


GGGTCCTGTT 


60 


CTGATAAAAG 


TTAGTGAAGA 


GACCTTCACT 


AACGTAACGA 


CTCGCCGTGC 


TAGACACCGA 


120 


GACGGTACGT 


TTTTTTTATT 


GGTTAAAATA 


GCTGCCGCTT 


GACATCATCG 


CGTCATGCGG 


180 


TTTTTTTTGA 


AAATAAGTAT 


TGTGGGCGCT 


ATTTATTTTG 


AATTTATGCT 


AAAATGAACA 


240 


CGATTGATAT 


TTGAGGAGCT 


GAAGATAGTT 


GACGAACCAT 


ATTGTTTTAT 


TTGAACCACA 


300 


AATTCCAGCA 


AATACTGGAA 


ATATTGCACG 


GACATGTGCA 


GCGACGAATA 


CACCGTTGCA 


360 


TTTAATTGAA 


CCTTTAGGAT 


TTTCAACAGA 


TGATAAACAC 


TTAAAACGTG 


CCGGATTAGA 


420 


TTATTGGAAT 


GATGTAAATA 


TTACGTATCA 


TAAAGATTCG 


GCTGCCTTTT 


TAGCCCATGT 


480 


GGCAGGCAAA 


CACTTGCATT 


TAGTAACCAA 


ATTTGCTCAC 


CAAACATACA 


GTGAGGTCAA 


540 


TTATAATGAT 


GGCGAAGACC 


ATTATTTCCT 


TTTTGGAAAA 


GaAACAACGG 


GCTTACCAGA 


600 


AACGTTTATG 


CGAGAAAACG 


AAGAAAAGTG 


TATCCGAATT 


CCTATGAACG 


ATGAACATGT 


660 


GCGCTCATTG 


AATCTTTCTA 


ACACTGCCGC 


ACTAATTGTT 


TATGAAGCCT 


TACGTCAGCA 


720 


AAATTTTCCG 


AACTTAGAAC 


TGACGCATCA 


TTATGAAAAT 


GATAAATTGG 


ATTAAAAAAA 


780 


GTTAAAACAG 


CAGCGCTTTT 


TTCAAAAATG 


TCAGAAGAAA 


GTGGTTTTAT 


TTTTACTGTT 


840 


GCCCAATTTC 


CAAGCCTAGA 


ACAAAAAACA 


ATT TAG ATT T 


TTGTTCTAGG 


CTTATTTTGC 


900 


t t a n a t t a c t 




ATflHATATTT 




CSTflATAAAPT 


AKAAACIACAT 


960 


ATGAAGAGAT 


AAAGAAGTTT 


GGAGAAGTTA 


CATGAAATTA 


TATTTTACAC 


GACATGGAAA 


1020 


AACTGAATGG 


AATCAACAGA 


AACGATTTCA 


AGGAATGACA 


GGAGATTCGC 


CGTTATTACC 


1080 


AACTAGTTAT 


GACGAAATTA 


AACAATTAGG 


CCAATATTTA 


CAAGATATTC 


CGTTTGAAAA 


1140 


AATTTATTCT 


AGTCCGCTAT 


TACGTGCCAA 


AAATACTGCG 


CGGGGAATTC 


AACAAGAATT 


1200 


GACTCATCCA 


GTTGAAATTG 


TTTATACAGA 


TACCTTAAAA 


GAATTAGGTT 


TAGGTCGTCT 


1260 


AGAAGGACAA 


TATATTGAAG 


AAATGCGTAA 


TTTTTATGGA 


GAAGAATTAG 


ATCATTTACG 


1320 


GCATCGTTTA 


GATTTGTATG 


ATCCAACAAT 


TTTTGATGGT 


GAACCAATCG 


AGCAAGCGAT 


1380 


TCAACGGATT 


TCAGAAACAG 


TTGCTGAAGC 


GGCGAAACAG 


CACGAAGGTC 


CAGTTCTATT 


1440 


TGTCGGACAC 


GGAGCAGCGC 


TGACGGCGGC 


AATCCAAGCG 


ATGGCTGGTA 


AACCATTATC 


1500 


AGAATTACGG 


ACAATGGGCG 


GGCTGTTAAA 


CAATAGCCTC 


TCAATCCTTG 


AAACCAAAGA 


1560 


GGCATCAAGG 


AACATGCCGT 


ACGACTTAAC 


GTTATGGAAT 


GATACGTCTT 


TTTTAGCTAA 


1620 
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AGAAAAAGCA CAATAAGCAC ACAACAGAAA GGAGGGCTTG TAGTGGTAAC ACTTTTTGGT 1680 

GAGGACGAAG AAAAAGCCTT TATTGTTGGA ACTGTACAAG CAATCTTTTT TGAAAATCCC 174 0 

AGTAATTTTT ATAAAGTCGT TCTTGTAAAT GTGACCGATA CAAATACAGA TTATTTAGAA 1800 

AAAGAAATTG TAGTAACTGG AAGTTTTGGC CAAGTTCAAG AAGAGGAACC CTATCGTTTC 18 60 

TTTGGTCATT TTGTGGACCA TCCTCGTTAT GGCAGACAGT TTCAGGTGGA TAGTTATCAG 1920 

CAAGAAAGAC CGACTTCGGC TAGTGGGGTG GTTAACTATT TATCAAGTGA CAAATTTCCT 1980 

GGCATTGGTA AACGAACCGC TGAAAAAATT GTTGAAGTAT TAGGGGAAAG TGCGATTGAC 204 0 

CGAATTATTG ATGATCCTAG TGTTTTGGAA GAAGTTACTG TTTTAAATGA AAAAAAACGT 2100 

CAAGTAATCG TCGAAACAAT TCGCTTGAAT CACGGGATGG AGCAAGTGAT TGTGGGACTA 2160 

AATCGATACG GCTTTGGTAG TCAGTTAGCT TTTTCAATTT ATCAAACCTA TCAAGAAGAA 2220 

ACCCTTTCTG TGATTcAAGA AAATCCCTAT CaGTTAGTTG AAGATATTGA GGGAGTCGGA 2280 

TTTAAACGAG CGGACAATAT TGCTGAACAA ATCGGCATTC AGGCGGATTC TGCTGTTCGA 234 0 

ATTCGAGCGG CGATTTTACA TGAAGTGTTT GAACAGTCTA TTCGTTCAGG CAATACGTAT 2400 

GTTCAAGCAG ATGTTTTGTT AGAAGAAGCG ATTCGGACCT TGGAAGCAAG TCGCCCAGTG 24 60 

GAAATCTCAC CTGATCAAGT GGCCAATGAA ATCATTACGT TAgTCGAACA TGGAAAAATC 2520 

CAACAAGAAG AGACTAAACT TTTTGAGAAC AGTCTCTATT TTTCTGAATG GGGCATTGGT '2580 

ACTTCGATTC AACGTTTACT TTCTCGTAAA AAAGAAATTC ATTATGAGGA ' AGAAGAAGTC 2640 

CAAAAAAATA TCCGAATGAT TGAGAAACGT TTAAATATTC AGTATGGCGA CTCGCAACAA 2700 

GCAGCGATTG AAGAAGCGAT TAAATCGCCG TTATTTATTT TAACTGGAGG GCCCGGAACA 2760 

GGGAAAACAA CTGTTATCAA TGGGATTGTC TCGTTGTTTG CGGAACTAAA TGGCCTTTCA 2820 

TTAGATCTCA AAGACTATAC GCAGGAGATG TTTCCAATCT TGCTAGCGGC GCCAACTGGG" 2880 

CGAGCAGCGA AACGTATGAA CGAAACGACA GGTTTACCTG CAAGTACCAT CCACCGTTTG 2 940 

TTAGGCTTGA CAGGTCGTGA AAAAAATCCG AGTTTAACCG CCAAAGAATT AGAAGGTGGT 3000 

TTGCTAATTG TAGATGAAAT GTCCATGGTG GATACTTGGT TAGCCAATAC GTTATTAAAA 3060 

GCTATTCCGA CAAACATGCA AGTGATTTTT GTTGGTGATA AGGATCAGCT ACCGTCTGTC 3120 

GGTCCAGGTC AAGTCTTACA TGACTTATTG CAAATTAACG AAATTCCCAA ATGTGAGCTA 3180 

AACGAAATTT ATCGTCAAGG AGACGGTTCA AGTATTATTC CTCTTGCCCA TGAAATTAAA 3240 

GAAGGAAAGT TGCCTGCTGA TTTTCAAAAA AATCAAAAGG ATCGCTCGTT CTTTGCAAGT 3300 

GATATTGGTC ATATTGAAGA ATACATTCGA CAAATCGTCA CAAAGGCAAA AGCAAAAGGG 3360 

TTTACGCCAC AAGATATCCA AGTTTTAGCG CCAATGTATC GTGGCGCAGC CGGCATTGAT 34 20 

GCCf TGAATA AGATGATGCA AGAAATCTTC AATCCAAATG ATGGTAAGAA AAAAGAAGTC 3480 

AAATGGAATG ACACCGTCTA TCGGATCGGG GACAAAGTTT TGCAACTAGT CAATACGCCA 354 0 

GAATTAAATG TGTTCAATGG AGATATGGGA GAAATCGTCG GAATTACTTT AGCCAAAGAC 3600 
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TCAGAGGACA 


AAGTGGACGA 


GTTGGTTTTA 


CAGTTTGATA 


ATAATGAAGT 


AACGTATAAA 


3660 


AGAAATGAAT 


GGAATAAAAT 


TACGCTATCG 


TATTGTTGTT 


CGATTCATAA 


AGCGCAAGGT 


3720 


AGTGAATTCC 


GCATGGTGTT 


ACTGCCAATG 


GTTCATCAGT 


ATAGCCGAAT 


GCTGCAACGA 


3780 


AATTTGcTGT 


ATACAGCAGT 


GACACGTAGT 


AAAGAATTAT 


TAATTCTACT 


AGGTGAGGTT 


3840 


TCTGCTTTTG 


AAACATGTGT 


CAAAAACGAA 


TCGGCTAGTC 


GCATGACGAT 


GCTAAAAGAA 


3900 


CGAATCGTGA 


ACGCAGAACA 


AATGACCTTA 


ACGACTAGAA 


CTCaGTTAGA 


AGCCTATGAA 


3960 


GAAGGTTTAA 


CGGCAGACCA 


TCCATTTACT 


GAAACGGAAA 


CAAAAGCTGT 


CTTTTATGAA 


4020 


ACTGAACAGC 


AATCAACGAA 


AGCCGATCAG 


ATCAAAGAmA 


CAGACGAGCA 


ATTAGTTGAC 


4080 


ACAACGGTAC 


AAGAGGTTTC 


ACTATTTGCT 


GATGAAGGGG 


AAGAGTCAAC 


CTCACCAGCT 


4140 


GAACCACAGA 


AAGAAGAACA 


ATTACCTGAA 


AATCCTGTAT 


TGTCAATTCA 


AGAAATTCAA 


4200 


GAAAATAAAA 


TTAATCCAAT 


GATTGGTATG 


GCTGGAACCA 


CTCCATATCA 


GTTTATGTGA 


4260 


AAATAAATAA 


TTAATTATGT 


TTGTCTGTTT 


AAGAAGTCAA 


TTTATTTTAT 


TTGATTCAAT 


4320 


ATTATTAAAT 


AGTTTTGAAT 


GTTTGTTTAA 


GCTTTGCAAC 


AGTTTTACAA 


AAAAATAAAA 


4380 


AAGTGAGACG 


ATGAATAAAT 


ATTTTTTACA 


GATATTTATT 


CATCACTTAA 


ACTGATAGAG 


4440 


AGGAATTAAT 


TGTAAAAATG 


AAACGAAAAA 


GTCTAAAATC 


ATT TAT AT AT 


GTAAGCGCAT 


4500 


TACTTGTTTT 


TTTGAGTGGT 


TTTGTTGGTG 


TTAGTTTATT 


CGTTAAGTCA 


GAAGTAGTAG 


4560 


AGGCTACAAC 


GATAGGTGAT 


GACTATCCTG 


TTAAATGGAA 


AAATCTTCCT 


TTAGGCGGGG 


4620 


CGATTGATGA 


TTGGAGAATG 


TATACTAGAT 


ATTGTACCTC 


ATTTGTCGCT 


TATAGATTAA 


4680 


GCACAGCTAA 


TAACTTTGAG 


CTTCCAAGTG 


GCTTTGGAAA 


TGCTGATCGT 


TGGGGAACAG 


4740 


AAGCAATGGC 


GAGAGGTTAC 


AAAGTAGATA 


AGAATCCTAA 


AGTAGGATCG 


GTGGCTTGGT 


4800 


GGACATCTAC 


GCATGTTGCG 


TGGGTAGCAG 


AAGTATCTGG 


CGACAATGTT 


AAAGTTGAAG 


4860 


AGTACAATTA 


TGGATTCGAT 


GGTAAATACA 


ATACTCGTTG 


GATTAACAAG 


AATTCCGCCA 


4920 


AT G GAT AC AT 


TCATTTTAAA 


GATATGCCAC 


AAACACCTGT 


AGGTTGGTAT 


AACAACGGTC 


4980 


r\\* x. c\ X un 1 In 


1 1 lnl inl 


fir" a c a can a a 


r'aaarrrr'aT 


1 KjKj 1 U iftAL 1 




D \J*i V 


CGAAACGCTA 


TAATTTCGAT 


AAAAATGGTG 


CCATGTACAA 


GAATGCTTGG 


ACAAACAGCG 


5100 


wTAAATATTC 


AT AT T AT AG T 


ACATCAGATG 


GCTCATTAGC 


GGTTGGTTTA 


ACTTGGGTAG 


5160 


GTACGAAACG 


CTATAATTTT 


GATAAAnATG 


GTGCCATGTA 


CAAGAATGCT 


TGGACAAACA 


5220 


GCGTAAATAT 


TCATATTATA 


GTACATCAGA 


TGGCTCATTA 


GCGGTTGGTT 


TAACTTGGGT 


5280 


AGGTACGAAA 


CGCTATAATT 


TTGATAAA 








5308 



(2) INFORMATION FOR SEQ ID NO: 4 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ..4.45 i 



GCTAAGCTTC 


CTGTTAAAAC 


CACGTTATTC 


GGAGATTTTG 


TTAGTTTTCT 


TTTCATATGT 


60 


TTTCACCTCA 


TCTACTAAAT 


TTTACACTAG 


AAAGGCTGAT 


TCGTCAGCCC 


ATCTAGCTT A 


120 


AAATTGTTAA 


TTATTTATCC 


GTATCTTTTA 


GGAAAATACT 


TCCCGTTGTT 


GTAGAAGCAT 


180 


TAATTTGAGC 


CATTGATTCT 


TCCAGTACAC 


GTCTAAAATG 


TAATTGTTGA 


TTGCCGCGAT 


240 


CTTTCTTTTC 


ACGAACAACT 


TCAATATCTG 


TTAAACGGCT 


GTTAATACTA 


CCTAAATTAG 


300 


TTTTCACTTG 


CCCTTCAACA 


CCTAAGTCAT 


TTGGCAAGGC 


CAATTTGATA 


TTCCCATTTG 


360 


CAGAGCTTGC 


TTCAACACGG 


CGAAcGTTTT. 


TTCTTTAGCT 


GTAATGCGGA 


TATCTCCATT 


420 


AATCAGAGAG 


ACACTTAGAC 


TTTCTGGTGC 


CGCGGAAATA 


G AC AC AT C AC 


CATTGACTGT 


480 


TTCAATAATG 


TTATCTAAAA 


TCGTTCCTTC 


TAGGACTTTA 


AT TTCAC CAT 


TCACACCTTC 


540 


AATTTCTAAC 


ATAGTTGCAT 


CAATTTTnTT 


AAACGTAATC 


GTCCCATTCG 


TTGATTTTGT 


600 


GTAAACATCT 


TTCGCTGTTA 


ACTCTTCTAC 


TAAGACATTT 


CCGTTTAATA 


ATTTAACAGA 


660 


TACATGATCG 


TAAGTGCGTT 


TTGGTAAATA 


GAACGTTAAA 


TCTGCTTTCA 


CCCGTTTGTT 


720 


TGGCACTGGG 


AAAGAAATCG 


TTCAC 








745 



(2) INFORMATION FOR SEQ ID NO: 446: 

i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

TGGGTACggG CCCCCCCTCG AGTTTTTTTT TTTTTTTTTT TCTTTTAAAG CATTTATTTA 60 

GAAAAATAAC CGTTCCCTGC CCGCCTTGCT CTGTGCCATG GGACTTGAGC ATCTTTCTTA 120 

ATACTCTTCT GCATTGGGCC TGCAACCCAC AGTGGTGGGG TCCAGCACAC GCCCCTCGAA 180 

ACGCTGCGGC ACTGCTGGGA GGCCTCACGC CCTTCCAGTC GTGACAGTTC CCGTGTTCAC 24 0 

TCCTTGTCCA GGCCTCTGTC TCCGGGTCTC CAGGTGGACA GGGGCCTCTG CGGGATGGGC 300 

GCAgcTCTGC CACAGTCTGT GTGCCCCTCC CGArGCTCTG CTCTCCCCGC ATCsGAAGCT 360 

CArGGCTGGA ArCGCGGGGA rCTGACATGA rGTTGGTGGA ATGCsTkGGC gCTGTaCACC 420 

ACCTCsGGGG GCGAgGCTTC TTGCn 44 5 
(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 7: 



TGATGTATGT 


TTCGTATATT 


CCGCAAATTT 


ATGGAAATTT 


ACATGGGGAA 


AAAGGCAATC 


60- 


CGACGCAACC 


ATTAGTGGCA 


ATGATTAATT 


GTATTTTTTG 


GACGATTCAT 


GGGTTATATG 


120 


GAGACGATGG 


AGAAACACGA 


GATAAGTCCA 


TTATTTTTGC 


TAATGTGCCC 


GGAATTATTT 


180 


TTGGyTTTyT 


TGCGTTTATT 


ACAGCATTAT 


AGTTTCCGAC 


GAATTGGAAA 


GTATTCCAAT 


240 


TCTGTACATA 


AAAAAGAGCG 


GGGGAAATTT 


CCCACGCTTG 


AAGAATTAAA 


TCTTTTTCAA 


300 


TTGTTTGTAA 


TTAATAACAA 


TCACACCATT 


GCTGTGCACC 


ATTAAGTGAT 


TGTTTTTTTg 


360 


CGtTTCGTCA 


ATTTCGACGA 


CAGCGGCATT 


TTTTAATTGT 


TTCGTAACAc 


GCCTGTTTGT 


420 


GGCTGTCCGT 


GAACAAGAAA 


AGTAACAkGG 


GkGyCAaCGG 


CGAAATCCTT 


ACCAGCATCC 


480 


GATGGAACAA 


CAGGAGTAAC 


CATACCATTA 


AAATTTGACA 


TACAAAAAAC 


CATCCTTTCT 


540 


TGATATCTCT 


ATTATACCAC 


ATTTTCAGAA 


AAAAGTTTGT 


AAAGTCGCTA 


TTTGGCTTAT 


600 


TTTAAAGAGA 


TTTTGCATAA 


ATAAATAACA 


AACAGACTTT 


ACAAATAAAG 


TAGAAAGTCC 


660 


TATAATAGAT 


TTGTTTCAGG 


CCAGCTTAGC 


TCAGTTGGTA 


GAGCAACGCA 


CTCGTAACGC 


720 


GTAGGTCGTA 


GGTTCGAACC 


CTATAGCTGG 


CATAGATTGA 


AAAGACAGAT 


ACACCAAGCC 


780 


GTAAAGGTAA 


TAGGTATATC 


TGTCTTTTTT 


GTGTTTTGAA 


AGTTTTCGCT 


GCCTAAAATT 


840 


GCCTACTTAT 


TAAAAATTAG 


TATTTAAAAC 


ATGTTATTTT 


ATAGCTCGAA 


AATATCATAA 


900 


TTATCAAATT 


TATCGCTCAT 


TGATTTTTGA 


GAATTAGGGT 


ATAAATGACC 


ATAAGTATCA 


960 


ATTGTTGTAG 


TAATTGAAGA 


ATGACCTAAT 


CGTTCTTTTA 


TAGTAGTATT 


CTTTTCGTTA 


1020 


TTGTGAATAA 


GAAGAGCTAC 


GTGTGAATGT 


CTAAAGTCGT 


GTAATCTTAT 


AGGTTTAAGG 


1080 


TTCAAGTTGT 


TTTTTACAAT 


TATTTTTTTT 


ATTTGTTTAG 


AGTAATAATC 


GCTCGAAGGG 


1140 


TGTTTTTCAC 


TGTATTGAAA 


AACAAGTAAT 


TTGTCACTAT 


CAATATTTTG 


AAAATTATCT 


1200 


ATTAGTAATT 


TTGGTTGTAT 


TTCTTGCCAT 


TTTTTTAACA 


TGGCGGTAAT 


TTTTGAGTTG 


1260 


ATAGTAATAT 


ATCTGTTGCT 


GCTTTCCGTT 


TTTGGTGTGG 


TAGTTACGTA 


ATTACCATTA 


1320 


AT TAG TT TAG 


CACTTTTATT 


TATACGGACT 


GTTTGATTAT 


AAAAATCAAT 


GTCGCnCCAC 


1380 


GTTAAAGCAA 


TCATGnCTCC 


TGGCACGCAG 


TCCTGGAAAA 


A 




1421 



(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

AACATGTAAA CAAATCGTAC AGATAACACG CAAGTTGCGT GTCGAGAGAG AGGCGGTTTC 60 

TATTCTTATA GAAACTGCTT CTATTTTTTT ACGAAAAAAT GAAAACTATA GAGGTGATTA 120 

AATGGAACAA AAnAACGTAC GATTATTCCC TGCCGTTTTA GCGACAGGAA TTATGTCTTT 180 
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TGCTGGCGTA TTAATTGAAA CAGCAATGAA TGTAACCTTT CCGACATTAA CAAAGGAATT 24 0 

TGGTGTTTCC ACCGGTACCG TGCAGTGGGT GACAACGATT TATTTATTAG TTATTTCTAT 300 

CATGGTGCCC TTATCCAATT ATTTATTAAA AACCTATTCG TTAAGACGCT TATTTATTGT 360 

TGCGAATCTT TTCTTTTTGA TTGGTTTAGC GATTGATGTC TATTCGCCAT CTTTCAGTAT 420 

CTTGTTATTA GGCCGACTCT TCCAAGGGGC TAGCACTGGG ATTGCTTTGC CACTGATGTT 480 

TCATATTATT TTGAACTTTA CCCCGCTGGA AAAACGAGGA ACGATGATGG GGGTAGGCAC 54 0 

ATTGACCACT TCAATTGCTC CAGCCATTGG TCCAACGTAT GGTGGAATCT TAACATCTTC 600 

CCTGTCTTGG CACGCCATCT TTTTATyCTn AATnCCtATT TaCTGCTTyC TTJcGTTwAtG 660 

GGcTtAtCTG CTATcCAGAA aTACCTGTnA AAAAAACCAC CACTCTgrAT TgGtaGTTaA 720 

TgGgAtgCCC yGTaATaGtG GtTGtgAgGT TTAAGnAAAT GGCCATGTTG GCGGCTTCAC 780 

TCTGCAG 787 
(2) INFORMATION FOR SEQ ID NO: 44 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 9: 



TTTTACCTGC 


ACCATTCGGA 


CCAACTAATC 


CATAAATTTT 


CCCTACTTCT 


AAATCTAATT 


60 


GATCAACCTT 


TAAATTGAAT 


GTATCACTTA 


CTTTTTTATA 


TAAATTATTT 


ATATTTATTG 


120 


TTTGCATTTT 


GCAATTCAGT 


CCCTTCTTGA 


CAATGAAATA 


AGTGAAAGTA 


ATGTAGTTAA 


180 


CACGAACCAA 


AAACAAACTA 


CAATAATATT 


AGAAACATTA 


ACAGTATTCT 


GTTCATAAAA 


240 


CAGTGATCTT 


ATCAAATTAC 


TCGATGGAAC 


AAATGGTAAA 


AATTCAAAAA 


AGTTCACTAA 


300 


AGCAGAAGGC 


AATCTATCTG 


TTGGATAAGT 


AATAGGCGAG 


AATAAAAGTC 


CACCTATCAT 


360 


AATTATCTGA 


GTTGgCTAAT 


CCTACTACAT 


TAGGTGTTAA 


CCAATATGCC 


AAAGcAAAAC 


420 


CTATAGAGAT 


CmTACATAAT 


TGAATTAGAC 


TAAGAATAGC 


yAGGGCCTTC 


ATtGAGAATC 


480 


yAATTTGGAC 


ATCGAATCTT 


AAATATGCCG 


CTAAAAATGA 


CATAAACACC 


CCAGGTAGTG 


540 


AGGCCATTCC 


CCAAATTATA 


AAGTCGGAAA 


TTAATATGTT 


CAACCGAGAA 


ACTGGCAATG 


600 


TCCTTTGATA 


GTCCACAATG 


CCCTCCTGTT 


TTGCTGTACT 


TACTATTTGA 


GCAGCAAGAA 


660 


CGCACCCTAC 


CGCAATAATC 


CCAAGAGTAG 


TTGCTCCAGA 


TG AC AG AT AG 


ATAGCTGTCT 


720 


GATTATTCAC 


ATCTGGTATT 


AGAAGTGCTA 


ACCCGTAGAC 


AATTGCTAAT 


GATAGAAATA 


780 


CTTGAGTTAA 


• TGTAAAAACA 


GGCAATAAGT 


ATTTAATGTC 


TTAATAAACT 


CCACCCTACT 


840 


AAAGGCAATA 


ATCCTTAnTC 


CATGATnGCT 


CTATAATTTG 


nATCATTTCC 


ACnATTAACT 


900 
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(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

GAATACGAAT GATACTGACA CTTnATGAGA TCAAAGAACG TGTGAAACCT GTAGCTGAAA 60 

AACACAATGT TCCAGTAATC TACATTTTTG GTTCCTATGC TAGAAGCGAA GCAAAAGAAA 120 

CAAGCGATAT TGATTTTGCT GTAGACATAA CTGGAGAAGC TGATGAAGAA GAaGCTTACT 180 

GGAATyTATA TGGAGATTTT TTTGaAGAGT TGAAAGAAGC GGTAGGACAT GATATTGATT 24 0 

TAGTCGAATT AGAGACATTA TTTGATTCAC AAGATAAACC AGAAAGAAAA GCTTTGCTTG 300 

AAAATATTTT GAAGGAGCGT GTAAAAGTAT TTGAAAGATA GTACAGAAAA AGATTTGCGT 360 

GTTCTAAAAG GGACGTTAAT CTATATAGAA GATATTGAGG AAACACTCAA TCGTTGTGGA 4 20 

AGACAGTATG AGGTCTTTTA TCACGATAGG GTGTTTTTTA ATGCCATAGG AATGGCTTTA 4 80 

ATGCAAGTTG GAGAACTAGG AACAGGAAGC AATGGATTAT CGGTAGAATT TAAAGAGAAA 54 0 

AATTCAGATA AAGTAGACTG GAATAAATTA AAGAAATTAA GAAATATGTT TGCACATTCG 600 

TATGATTCAA TGAATGAGCG ATTTATTTTC CGCATGGCAA CAATGTTTAT TCCGGTGTAC '660 

AAAGAATTTT GTATACAAGC AATAGCAGAA TTGCAAAAGA GACGAGGTAA ATAAATGTTA 720 

AAAAAATTAT TTAGATATAG AGGAAGGCGT ATTCGTTATT CTTCAAGAAA CCTGTTGGCA 780 

CTCTATCGAG TGTTATTTTT TATGCCAATT TTAGCTTGTT TAGGGTATTT TGTTGGCTAT 840 

AAATGGATTT ATCCTTTATA TCTTTCAAAT CCGCCTGATT GGAAAATATA CATTGTGCCA 900 

GCGTTGATAA TTGTAGGTAT TTCTATAGGT GCAATTGTGC TAATTACATT GCTTATTAAA 960 

GCTTCAATTA TTAATTCGGG GTATTTTTCT AAAGTGGaAC AACGTCAAGT ATTAGCaCAT 1020 

ATGATTATTG ACAACGGGTA TTACACaAAA AAACAAGTGA AAAGTwsTGA yGGyAAAACA 1080 

ArAGAAAAAA TCAAGTTTCC AAAAATTTAT TATAAGTCTG CAAAAAATAG TATTTTyGTT 114 0 

TCATTTGAAA CAGCrGGTAA TAAATTTCAG GAAAAATTTG AAACCATTGG TGGCTTTTTG 1200 

GAAACAGCGT TTCATGCGGA CAATTTAAAT AAGATTGATG AAAAAGGCTT TATAACCTAT 1260 

GAATTAGCTT CTGACGTTTA CAATAAGCGT ATTTGGATAA AGGATATGCA AGCGGACAAA 1320 

GGAAAGGTTC AATTAATGAA AGGGTTATAT TGGTATTTTG ATAAAGACCC ACATTTATTA 1380 

TTAGGTGGCG GAACTGGTGG AGGTAAAACC TTTACCATTT TATCTCTAAT TTATGCGCTT 14 4 0 

TGCCGTGTGG GTGAAATGGA AATTTGCGAT CCTAAAAATT CTGACTTGAT GGCACTAGGT 1500 

AAGTTACCAA TGTTTGCAGG AAAAGTGCAT ACAGGAAAAA CAGATATTGT TAATTGCTTG 1560 

AAAAATGCAG TTGAATTAAT GAATGCACGT TTTGAAATGA TGAATAATAG CCCTGATTAC 1620 

AAAATGGGGA AAAATTATGC GTACTATGGT TTAAAGCCTA AGTTTATTGT TATTGATGAA 1680 
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TTTGCAGCAT TTAAAGCAGA ATTAGCCAAT GACTATTCAA CAGATGGAGA AGTTGATGAG 174 0 

TATTTAACGC AATTGATTTT AAAAGCTCGT CAGTGCGGTA TTTTTCTAAT TGTCGCTATG 1800 

CAAAGACCAG ATGGTGAATT TATTAAAACG GCTTTACGTG ATAATTTTAT GTTTCGAATG 1860 

AGTGTTGGCC GTCTATCTGA AACAGGrATC CTTATGATTT TTGGTGATGA AAACAAAAAT 1920 

AAAAATTTCA AATATGTAGA GAAAATCGAT GGACAAAAAG TATACGGTCG TGGGTACGTT 1980 

GCACAAGGCG GTTCTGTCGC AAGGGAATTT TACAGTCCAC AAGTACCAAC AGATTTTGAT 204 0 

TTCATTGAAG AATTTATCAA AATTTCTAAA GAATTAGGTT ATGAAGATGT GTCAAAAGAA 2100 

GTGCAAGAAG AAGTATCTCA AAAAATAGCT AAGCATGTAG ATAAAGAAGC CCTTGCGGAA 2160 

ATTGATGAAG AGTTAGAAAA AGGTCAAGCA AAGTTAGCTA AGTTAAGTGA AAAATTTGAG 2220 

TAATGAGGAA GGAGATTTTT ATTGGATTCT AGCGAAATAA TCAAACGTGT TAGAGAACGA 2280 

GTGTATGTTG AAGTCAAGAA AGGATATAAA AAGCCTGATT TAGATATACG TATTCGTGAC 2340 

ATTCTGTATG AACAATCAGA AAGCTATGCA AAATTAGTGA GGTTTTCTAA CGGAAAGCGA 2400 

ATAAAGAAAT TAGCTGACCC TGTAGAGTTT GAAAAATTTA TGGAAAATAG GGGGGCTAAA 24 60 

ATTGTTGATG AAGTGATTGT TGGTTTAAAT GAAAAAGGAG AAGTATAAAT GGAATTTAAA 2520 
AATAGTAATC AACTTATGGA ATATTGGGAA ACAAAAGTAA AGGAATTTGA AAAAGAACTT - 2580 

AATAGTGGAA TTTCCCAATT GTCAAAAGAA GAGAGACAAG AaTCATGTGT TGTTGaAGCa 2 64 0 

AAAAwATATT TAGAAGaAAA aAGAAAATAT GCGAATAATC TTCATGCGTT ATTTTATGTT 2700 

GAAGGTAGTT ATCAGAATTC TAATGAAGTA GATAGGAAAT TTGATAGAGA ATTCTATCCA 27 60 

GCGTTTTATT CGTTTTTAAG TGAAATTGAT ATTTTAGCTC mTTAATAGrA AGGaATAGTG 2820 

ATTGGTATGA ACGGtAAATT GACTCTTGAA GAGTTTTATA AAAAAATGTC TTCGGAAATT 2880 

TATAGAAAAG TGAAATTAAA ATACAAGAAA AAAGATTTAG ATGATAGGTT TAGTCAGGTA 2 94 0 

TTGCATAATT CGTCTTTTCG TTTTATTTAT AGAAAATATC AGAATCGTCC TGATAGTTTA 3000 

CTTACTTATC AGGAAAGTGA AATGGAATTG GATAAAAATT TAGATGGTCT TGTCGATGAA 3060 

GTATTAAAAG GATTAACAAA TGTTCGTCAA ATTGATTTTT CGGAATATCT GGAAACTGTC 3120 

AAACGTGCAA CATTTAAACG CTGTTCTGAA AAAACAACAA AGTATTTTTC TAGTCAAGAT 3180 

TTTAATTCTA TTTTTAGAGA AGAATGTTTT GACTTTGTGA AGAGTGCTTT TAAAAGAGAT 3240 

TCAGACGGAG AAAGTGTTAT TTGCTGTGAT GATTTAGATA TTTTAATGGA AATAGTAGTG 3300 

AAGGATTGTG TTGAAAAAGT TATGCGTGTA ATTAATAAGT AATGCAGGGG GAGCAGGTGA 3360 

GTGAATATGT TCTCATTTGA AGAAGTGCTT TTTGAATTAG ATAATGCTAT TTCTTTGAAA 34 20 

ACGTTGAAAA ATTGGGCAAA TAAAATTGAA AAATTAACAG ATGTTCGATT TGTTCGCCAA 34 80 

TACGCTAAAA ATAAACAGGG TAGAAACTAC TCGTATAAAG TTTTTAGTGT AGAACAGGTT 354 0 

GAACAGTTAA AGCAATTGGT TCAATTAAGG AAGCAAAATG TCCCGCTAGA TCAGGCAATG 3600 

ATTGAAGTGT TTATGTCTGC AGAAGAAAAA GAAAGGCAGC AAGTAATTAC TATTGCAAAA 3660 



WO 98/50555 PCT/US98/08985 



1683 

ATAGATTTTG AAGAAAATAA AGCAACGGTG AAAGAATTAA TTGACTTAGT AAAAGATGTC 3720 

TTGTCGGATA ATGCAGAGAT AAAAAAGCGA CTGAAAGTTC TAGAAATG~AA GCAAGGAATG 3780 

ACAAGAGAGT AGACTAGTAG TTGCTAGGGA TTGTTGCGTG ATTTGTAGGT GGTGTAGGGG 3840 

TGGCGCCCTA AGGCGCCCGC CCCTACACCA CCgTACGCAA AATTAGCAAC GTATATCCCA 3900 

AATACTTCTA GTCGTGAGGT GGGCTTGTCC CACCTCAAAA GTTTCTACCC CCGTTTTATA 3960 

ACATGGGGGT AGAATTTAAC AT AC CAT AT A AAAAAGTTAC TTAAACGCTA TGTCCCACAA 4 020 

GGGATTGAGC GATTTTGTTT TTGTCACATT TTTTGACTAA AGTTGTCACA TTTTTTTGGA 4 080 

AGTAGGTTTA AAGCAGTGAG AGGTGTTGAT TTAAAGCGAT ATAGAAAAGA GCTAAAATTA 414 0 

AAACAGCAAG AATTGGCAAG TAAATTAGGA ATTGAGCGTT CGTTAATTTC AAAAATAGAA 4 200 

TCTGGAAAAC GTGTTATTTC TAAAGAGTTA GAACAAAAG 4 239 



(2) INFORMATION FOR SEQ ID NO: 4 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



GCATATTAAT 


GTAGTCCATT 


TCTCGCTTCC 


TTTCATGGTG 


TATGCATCTA 


TAGTATCCGT 


60 


TTAAACAAAT 


TAAATCAAGA 


TAATGATAAT 


TAACACCTTC 


TTAATGTTTA 


GGACGTTTCG 


120 


ATTAAAGTGG 


TTATCAGAGC 


TTTCCGATAG 


TAGCGGTTAT 


AGGAACGTTC 


CTATAACGTT 


180 


GGCAAGCAAT 


GAnCCGGAAT 


ATCCTTTTGG 


GATATTCCGG 


GTnGCTTGTT 


GGGGTTTCCC 


240 


CAAGCCCCAG 


GCCGTCCGCC 


TGCGCGTCCG 


GCnGAACACG 


CCTTTCGGCG 


TGGCTTTGCG 


300 


CTCCCTGCGG 


TCGCTTCCTC 


CTGCACATCC 


GCGCAGGAGA 


GGGCGGCGCA 


AAATGCGCTG 


360 


TCAGGAGGTT 


TACGAGATTG 


ACAGAAAACT 


TTATTTCTCT 


GAATCGTTCC 


CTGTTTCGTT 


420 


CTTTGTGATG 


GTAAGCTGCC 


CTTGCTGGCA 


TTTCACAGTG 


ATTTTGTCGC 


CGACAGAAAA 


480 


TCCGGCCTGT 


TCCAGCCATT 


TTCCTTGTAA 


AACAATCTGC 


GTCGGAGCCT 


GCTTGTAATT 


540 


GCCATGCCCG 


TAACATACGG 


TCAGTTTGCG 


GTTATCCATC 


CGTCCCCCTT 


TCCCGCAACC 


600 


TGCCGGCATA 


CGATTTCAAA 


TACATTCCCA 


TCCTCAGTCC 


TGACAGTGAG 


CAGCCCGTTC 


660 


CGCTCGTCTG 


TGTCCAGGTC 


GGCAATCCCC 


ATCCCCTCGC 


TGTCATTCAG 


TAAATCAAAA 


720 


AGCCTGTCCT 


TGAAATAGTT 


CAGTTCCATT 


TTTCAGTTCC 


CCTTTCGTAT 


TTTATTGTTG 


780 


CTTATTAGCG 


AATACTTAAA 


AACAAATCCT 


GCCGCCAGTC 


TGCCGGTTAA 


GCAAAACGGA 


840 


TGCTGCAAAA 


■ AATCAGTGCC 


GTCCCGGTTG 


GAAAGGGGCG 


GCGCTTTTGC 


GTCACCGTTC 


900 


CGGTTTGGTC 


TGTTCTTTTT 


CCTGCGCCTG 


CTCCGGCGGT 


TCCAATCCTA 


AAATCTTATC 


960 


CATGTTGCGG 


AGTGCGGTTt 


GATAAGTAAT 


CATATCCTGC 


CGCAAgTCTT 


GTACTGTtCA 


1020 
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TACTGTTCCC GTTTTcTTCa GCAGCGCGGC ATATTCCGCT GAAAGTGCGC TACTTTGGGA 1080 



ATAGCCTTGC CATTCAGGGC ATCAA 
(2) INFORMATION FOR SEQ ID NO: 4 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1105 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

CGCCAATTGA AGCCCCACGA ACAAAAGGTT CATTTTGTAA GGCTTTGAAA TTTTTGGTTT 60 

GTCCGGTGAG CATGACTCCC CAAATAGGTA ATTTTTCGAC TTTTTTATTT yTaTyTkGTT 120 

GGATAAmCTT -TTGGCCTyCT TGATTGTCAG CaAACAACGG GGATTTTTCT AAGGTTTCAA 180 

AAAAGTGTTG CTTGTCGTTT TCAGGATCAA ATATTTCGTT TTGATCGTCC ATGCTTAATT 24 0 

GAAAACCATA AGGCAGCGTA CCAGAAGGAC CTTCTGCTTC ATTAACCGTT T CG GAG T AAA 300 

CATAGAGCCA AACTAAGTTG ACATTGCTCG GTAACATTTt GCGGACTTCT TCATATGGAT 360 

AAGGTTTATC AAAAGAAATC GCCACTTCCG CTACTTGGTT TTCCATTTTA TCTAATTCAC 420 

TTAATTCGTT TTTGATACCA CCATAGTAAT TGGTAATCGC TGGGTTATAA AAAGATGCTA 480 

CTTTCTGCTT GGTTTGTCGG TTGTATTCAT AAAAATTCTT AGAAGAATAA TAGCTACCAG 54 0 

GATAGACTTC ATTCCAGTCA ATCGAATTTC TGAAAAAGCT ATATCTGCTA CGTAAATTAC 600 

TCCaCTCCAC CGTGTAACCA TCAATATTTT TTGAACGATT GGTCACAACT TCGCCCCCAA 660 

AAGGCGAACT ATTAGCAATT ACTTGGGAAT CTAATTGAGT GTTTGGTGAA GCAATCATTT 720 

CTTGATCATT AATTACTTCT AGTAGGGTAT GATAGTTTTT CGTCGAAAGT TTATCCATTC 780 

CCACAAAAAA ACCGACAAAC AATACGGCAA AACTAACAAC GGTTATGGCA CCAATCACGA 84 0 

TAAATTGTTT TCTTTTCGCT TTACGAATAC TTTTTTTTAA ATCCACTTCT ATTCCTCCAG 900 

TTCTTTTTTT AATTTTTGAC GTGTGCGATA CAATTTTTTG CGAACATTTy CAaCmGATAA 960 

TTCyAAAATC mACGcAaTCT CGGcATACGA TAATTGaTAA TAATATTTCA AAAGTACTAG 1020 

TTCTTGTTCC TCATCTTTTA AGGaATCGAT CAATAATAAT AACTCTTGCC CTTCCATTTT 1080 

TTCTATTGAT TGCAC 1095 
(2) INFORMATION FOR SEQ ID NO: 4 53: ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 
TGTCGGGGAG ATGGAGTTTC CTTCCTCAGA GCTTTTCTAT TACCAGCAAC AACAATTTGC 



60 
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TTTTCCGATT 


GATACTTCAA 


TGAATGTAGA 


AATTGTCACC 


AATAAAAAGG 


CATTAGCTAT 


120 


CGTTCGCAAC 


AAGAAAAAGG 


AATTAAAAGA 


TTTAGACAAC 


CACGCTTACC 


AATCTGACAA 


180 


TGAAACGAAT 


TCCAATGTTT 


TGGATGCTTT 


GGATTCCGTT 


GATGAATTGG 


AGACCACCTT 


240 


GGATCAGTCA 


AAAGAATGAT 


GAGGAAAAAT 


AG ACT AG TCA 


AAACAATGCC 


TACAATCCGT 


300 


AAAATTTTCC 


TTTTCATCTC 


CTTCACTCCA 


TTCTCTcTTG 


aACCGGTGGG 


CGAGTATCAA 


360 


AGGCATGGAA 


GAGTTCT TCA 


AAGACCGGGT 


GAACCTGAAC 


CACACCTACC 


CGACCATACA 


420 


AATCCTGCAT 


CAAGCATTGG 


CCGTTTTCCA 


AATCTCGTAG 


GCGTTTTTGA 


TTACTTTCAT 


480 


CTTCTTGATC 


CAACCCAAAA 


AATTCCAAAG 


TCTTCTTGAT 


TTCATGAATA 


TCGGTACTAC 


540 


GAAAAGCAAA 


TTTCAATCCG 


ATATTATTCT. 


TCAAGCGTTC 


ATCCAGTAAA 


TCGTCACTAT 


600 


TTTGCGTGAC 


AAAGTAAACA 


CCGGCATTCA 


TGGCCCGACC 


AGCACGAACG 


AGCTTATTTG 


660 


AAAGGGTCTT 


tCCTTGAGCC 


ACTTGAAGAA 


AACTCCAAGC 


CTcATCCAGG 


TcCACAATcT 


720 


TGAAGATTtC 


TCGATTACTA 


TGGATAAAAT 


CCAAGGCAAA 


CGTGGaAATG 


AC GAT TAG CA 


780 


TAG AG AC ACT 


TAGCAACTCC 


ATCGTGGTAT 


ATTCTTCAAA 


AGTGGTTTCT 


GCATCTGGTA 


840 


AAACCAAATC 


CGCTACTTGA 


AT GAT AT TCA 


ACTGCTTATC 


TAAACTAATT 


GATTGGGTAA 


900 


TCGAACCATC 


TGAGAAAAGC 


AAATGGGCAA 


AATCATAATC 


GGTAAAGGAC 


TCGATATGAT 


960 


CGGCGATATT 


TTCCGCGACT 


GGAGAGTTCT 


CCTTTCGAAG 


CTCATTAATG 


ACTTGCAGCA 


1020 


ATCCTCGTTG 


ATCACTTTGG 


GTAACCGTCC 


GAATGGCTTT 


GCGCAATACT 


GGAAACTTCT 


1080 


CCCCATCCCG 


ACTGGAAATT 


CCTGTTAAAA 


AAGTCAGGAT 


ATCAATTGCT 


AAACTTTCTG 


1140 


AATCTTTCTT 


TTTCTTCATA 


ATCACATAAG 


GATCAAGTAA 


GCCTTGATTT 


TCTGGTTTAC 


1200 


TAGTCAAGTT 


GAGAATATTG 


ATTTCATCCG 


CAATTTCTGG 


CAAGTTTTCT 


TTCCAACCAC 


1260 


CTCGTTCTCC 


TTTTGGATCG 


ACAATCACAG 


CTTGGCCACC 


AAAAAGAACA 


GC AT AG TAG A 


1320 


CCAAGAGATT 


GTTACTAAAG 


GATTTTCCGC 


CACCAAGAGA 


ACCAAGAAAA 


GCCGCAGCTA 


1380 


ATGCATTGGT 


TACTGAGCCC 


TTTACTCCTT 


GTGAGGCTAA 


TGCGGGTTTC 


AGGTAAACAT 


1440 


TACGCCCCGT 


ATCCGCATTG 


TAGCCAAAGT 


AAATCCCTTC 


CAACTCTCCT 


AACATTTGAG 


1500 


TGGCGCCAAA 


GCCAAGTCCT 


GCTAAAAAGT 


CAGATGTTAC 


ATACTGGATA 


TAGTCATTCA 


1560 


TATACCGTTT 


ACTAGCTGGT 


AGGAATTCTT 


CATGTAGCCC 


CATCGTATCG 


CCAAAGGGCC 


1620 


GGACTAGTTT 


CACATTGGTA 


TCATCATAAA 


AATCGAACAC 


TTCATCACAA 


CGGCGTTTTA 


1680 


GTTCATCCAC 


TGATTCTGCT 


CTCACCCGGA 


CCACATAGCT 


CAATTTGTAC 


ATGGATTCTT 


1740 


TTGACTGATC 


CAAGGTGGTC 


TCCAATTCAT 


CAACGGAATC 


CAAAGCATCC 


AAAACATTGG 


1800 


AATTCGTTTC 


ATTGTCAGAT 


TGGTAAGCGT 


GGTTGTCTAA 


ATCTTTTAAT 


TCCTTTTTCT 


1860 


TGTTGCGAAC • 


GATAGCTAAT 


GCCTTTTTAT 


TGGTGACAAT 


TTCTACATTC 


ATTGAAGTAT 


1920 


CAATCGGAAA 


AGCAAATTGT 


TGTTGCTGGT 


AATAGAAAAG 


CTCTGAGGAA 


GGAAACTCCA 


1980 


TCTCCCCGAC 


AATGGTATTA 


ATTGTCAAAT 


AGGTTACATA 


AGACTCATGA 


TTCTCATGTT 


2040 
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CAATCCTTAA 


GTAACGAGGT 


TTCTCTTCAA 


TCAAACAACG 


ACTGGGACGT 


AATAAGTCAT 


2100 


.&GCGTTTCAC 


CAAaGTTTCT 


GATTTCAATT 


TTTTCTTTGG 


TAACTGAAAG 


GCGTATTCTT 


2160 


CAAACGGTGT 


ACCTTTTTCG 


CCATAAATAT 


GTTCAATCAA 


GTAAGTAAGA 


TCACTGGGAT 


2220 


TCACTCGTCT 


GACTTTGAAT 


CTCCGCGCTA 


ATTTACTTTC 


CATCAGTTTT 


TCAAGCTTGG 


2280 


TATATCGGCG 


AATTTCTTCA 


TTTGATAATG 


AAACAAAATC 


GCCCATCAAA 


TGATGATTGA 


2340 


CCTCATAGAT 


AAACTCTTGA 


AAGCCGGAAA 


AGAAGGATTT 


TTTGAGACTT 


TTCAAATTCA 


2400 


CTTCTTCATC 


TGTAGCAATC 


AGTTTGAAAC 


CAATAAAAAA 


GCGATAATCA 


ATCTGACTAT ' 


2460 


CGCCAATCAT 


GGATACTAAT 


GCGTCTGTCT 


GTAAATCAAT 


CCGTTGTAAT 


GCTATTTCTT 


2520 


TGAGTCTACC 


TATAACTTCC 


TTTTTAGATC 


GTTCCTGAGT 


GGCACGAATA 


CTACTTTCTG 


2580 


TCGCAATTTG 


TAGCGCATGA 


ATCTTACCTT 


CTCGATTTTG 


GGCAATCAAT 


TGGCGGAAAT 


2640 


TATCATGCAA 


TTGATATTTT 


TGTTCAGGAG 


ATAAGAAGGA 


ATAGTTATAA . 


GGTATTAACT 


2700 


CATAATAAGC 


AAAACACTCT 


CCTTCTTGAT 


TAAAAACAAG 


ATTATTTTCG 


ATATACTTAA 


2760 


TCGGGTACAT 


AGTTCACACT 


CCTCACTAAG 


GTAATGGGTT 


CATCCCATCG 


TTTTTTCTGT 


2820 


TCTTTAACTT 


TTTTTCCTGC 


AAAAGTGACC 


TTAAATCGTA 


AATGATAACT 


GATAAAGGAT 


2880 


CGTAAAAATC 


CCAATGGTTT 


CTTGCCATCA 


AAAGTCTTTT 


GCGACATAAA 


CCAAGTAATC 


2940 


GCCACCGGGA 


TGCCAAAGTA 


TTTTAAAAAT 


GCCCCCTCTA 


TCACAGATAG 


TGGAGGCACA 


3000 


TGGGCAAATA 


CAATGACAAA 


CAGCAAGGAT 


AAAACAAACC 


AAGCCATTTG 


GTTAAAAGTT 


3060 


ACAGGAAAAG 


GCAGCTGAAA 


GTCATTGATT 


GCGTAAATGa 


CTTTTTCAAC 


TGCCCAGATA 


3120 


CTGGTGTAGC 


TTTTTATTTT 


TTTCATTAAC 


TCATCTCCAA 


ATATATTGTT 


TTTTAAATTA 


3180 


AATATTCTGT 


TATAATTAAA 


ATTAATTAGG 


GGGTAGATAA 


GTTTGAAACA 


AGAAGAGCTT 


324 0 


AATCTTAAAT 


TATATGAGTT 


GCTATTGAAA 


ATCCAAAATG 


AATCTGACAC 


ACTAACAAAT 


3300 


GCTGACATTT 


CAGAATCAGA 


CGCAGATTTT 


TATGGATATC 


TTATCTCTAA 


AGAATTAGTT 


3360 


TATAATTTAA 


AAATAGAACA 


CTATTACAGC 


GGTCCAAATA 


TAGATGCTAG 


CCAAGCAATA 


3420 


ATCACAGATA 


AAGGATATGC 


ATTTATTAAT 


CAAATTGGTG 


AAGCAGATAA 


ACCTATTAAA 


3480 


ATGAATCGTG 


AGAATCGATA 


CAAACAACTT 


CAGATATTTC 


TCAAACGATT 


AGATGACCAT 


3540 


GACGCTGATT 


TAAGAACTCC 


AAATTATCGT 


GTTCAAGAAA 


TTGACTACTA 


CGACTTAATT 


3600 


AAATATGCTA 


TTGATTCAAA 


TCTCGTTAAG 


GGAATTGCTA 


TTCGATATGC 


TAGCAATAAA 


3660 


CCCCATCTTT 


TTTTATCTCA 


ATTTGCAAGG 


GTTACTACAG 


AGGGATATGA 


TGTATTAGAT 


3720 


ACTCCCTATC 


CAAAAAATAA 


TACACCCGGA 


GC AAC CATC A 


GTAATACATA 


CAATATCTAT 


3780 


GGTGGAGATC 


AAAAGGGTGC 


GAGTTTTGGC 


AGTAACAACA 


CAACCAATAA 


TAATTAAAAG 


3840 


GAGAGATTAG 


• TATTGGACTA 


TCGAACCAAA 


GAAAAGAAGT 


TAAAGCAGCT 


CTTAAACTTC 


3900 


' ATTTATGATG 


GAATTACAGT 


TACGTCAGAC 


TTATTGGAGT 


TATCTCAACA 


AGAATTCGCA 


3960 


TCATTACTGG 


AAAGCGCAAC 


AAAATCTGGc 


TACATCCgAG 


GTGCATCCAT 


AACAAAAACC 


4020 
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AAAATGTACC 


CCATAATTTG 


GACTGACGAT 


GATATGTCAA 


TTTTTTTCGG 


CACCAGTAAG 


AGGT.GCCGIT 


TTTGATTTAT 


GTAGTAGATG 


GTAATGCGCT 


TTTCCTTACC 


CTTGCGGCAG 


GTAAATCGTG 


ATCCGGTCAA 


TCAGTTCGTG 


CAGCATGAAG 


GGTGTATTTT 


TGGATAAGGG 


CGGCGAAGCG 


GTCAATATCA 


TTCCAGTGCT 


TTTTCCAGTT 


CTGCAACTTT 


TGCTTTCAGT 


GTCCGCGATA 


AACATCGCAA 


AGATATGGTC 


GGGCAGTTTG 


GGTGCGTtTC 


AGTTtCACAT 


CCAGGTCAGC 


CAGCCGTTTT 


CTTCATTGCT 


GCGGTCTGTT 


TCTGCTGGTC 


GGCGCATTTC 


GGCTTTCTCG 


GTGTCCTCGC 


TGAACAGCCT 


GGCGTTGCGC 


CTGATGAAAG 


GTGTCACGGT 


TGATGTAGTG 


GGCGGAGCAG 


cTGCGCCCGC 


AGGTATAATA 


GAAGTATTTC 


TGCTGGCTGA 


CCGCAGkTTC 


GCAGAAGAAC 


AGTCCGGCAA 


ATAAATCGGG 


TCTTTTTCCG 


CGCCCCCATA 


CGTTTCTGCG 


CTTCGTCAAA 


GTTCGTGCAT 


ATTGGGAACA 


ACGATCCAGT 


CCTTTTTGGG 


TGCGGTAAGA 


GGGTGTGTAC 


TGCCTGCCCT 


GCACCATGCT 


TCAGCATGAA 


CTGCACATAA 


GTCCGTGTCC 


AGAAACTGCG 


GTTCCGGGTT 


GTGCTTGCGG 


AAACGGGTGT 


ACTCTGTTGG 


TCAATGTCGT 


TGCAATGCTC 


TTTGACCCTA 


TCCCGGCCGC 


GGACAACCGG 


CGCGGTTTCT 


TCGTTGATAA 


TCAGTTTGTG 


GgTAGCCGTA 


GGGgTCGAGT 


GCGCCAAGAT 


ACTCGCCGCG 


TAGACTTGAT 


TTTCGTGGAA 


ATATCTTTGG 


CGTACATATC 


GCGTAATATC 


CATGCTGCTG 


GTCTGGTTCA 


GGGTGTCAAC 


AGCGCACATG 


ACGCTCAGGG 


AAGTAAATCT 


CGGTGTACTG 


GGCCCAGACG 


TGAGAGGTCT 


TTGGTAATGA 


CCAGATTGAT 


CAATCATCCG 


CTGAAAGCCG 


GGACGGTCGA 


AATTTGTACC 


AGGTGTCAAC 


AATCTCCCAG 


CCTTGTTTCT 


GCACATACTC 


TCCCAATGCT 


CATGGATTCA 


TGGTCGGCTC 


CATCTTCGGA 


CGGCTCGGTA 


ATTCTGTTGT 


yCyCTCATAA 


AACTGCtCCT 


GGGTTTACGC 


TCGTCTGTCA 


TCTATAnTAT 


ACCACATTAA 



CAGCCnTn 

<2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 6 base pairs 
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ATGGACGTAA AGATAAGTGA 4080 

TGGATAGTCC CGACAAATTT 4140 

CGCCCCATAC CCGGCGTTTC 4200 

CGGTCGAGTT TTTCAAAGCT 4260 

GCCTTTGCAT CCCGAACCTT 4 320 

GkTTCCCGTT CCGTATCATA 4 380 

CCGCTCATGT TGTCCTCGTA 4 4 40 

TTTGCTGATG CAAGGTCACG 4500 

AGTTCCATTA GCTTTTGCGC 4560 

CGAATGTCCT CAAGCACAGC 4 620 

GCTAAATTCC CAATCGCATG 4 680 

CTTTCGGAAC CATTGAAGTG 47 40 

TTCGTCTTTG GCTTTGATGG 4 800 

CAGTTCCCGT GATACAATAG 4 8 60 

TATCGGTTCC CGCTTTTTGC 4 920 

TCCTACATAG ATTTCGTTTT 4 98 0 

GCGCTCAAAT TCGCCGTCAC 504 0 

AGACAAAATC CCCTCCCTGT 5100 

ACACATTTCA AACATCCGCC 5160 

CTTATCGTCA GGATGCCGCA 5220 

CTTTGCTTTG GTATGAAGCG 5280 

GTTCAGGATG TGCTTGAACG 534 0 

GCCATCGTTC AGGGCAATAT 5400 

CCCGCACATG ATGTAGTTAC 54 60 

GCGCCCTTTT TCAATATCGG 5520 

TGAGAATCCG TCGTCGATAT 5580 

GGCCAGCAAT GCCCTCTGGT 5640 

AGAAAGCCGG CAATAGATAG 5700 

wATCAGTCCG aGaAACAACA 57 60 

AGCGTTAAGA GTCAAGCTGA 5820 

5828 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 





TCCTncrzcvn. 

A 1 ouu^ 1 VJ 


LLAL 1 1 AL Ab 


tp ta a p a &pp a 


PTTB ATA A pp 
L 1 1 AA 1-AAbb 


t AL 1 TAGACG 


oU 




TT Z\ 7i a A. TPTfl 
1 1 AAAAlb 1 A 


bbb 111 LL 1 L 


Al Al AAAb 1 L 


L1LAH X Ibb 


TTAATTGACT 


120 


A iti /- rn A A A A A A 




bi AiblAi Al 


I AAAAAbAb 1 


bL 1 AL AAbLA 


ATATTTTAAT 


180 


Lb 1 Ab 1 ILIA 


b 1 AAA 1 AULA 


A I AL I I 1 1 A 1 


/"* A A A 7\ rn rp /-< a T 1 

L AAAA I I LA I 


GTTTGGAAAT 


ATAATTTGAA 


240 


A AAT , A r P r PT"T"T' 

1111 


AAL 1 AbLAbA 


A A A A /"» 7\ rp rp m/""" 

AAAAL AT T T C 


TAALAAAAAG 


TATTGGCTCA 


AAGTAAAGAG 


300 


AiLu 1 1 1 1 b 1 


1 Ab AAA I b Ab 


A A A rp /"•/"» /-i TV 7V/™ 1 

AAAT b b C AAC 


ATAAAAGTTT 


TCTATTTTTC 


TGCAATTAAG 


360 


ArnrTT TP a a 
ALAb 111 L AA 


AAb 1 bAAT TL 


ACTAGGAATA 


TGATTCGCAA 


TTGAGTATTC 


AAAAATATGC 


420 


LL/rtl IbbblA 


bb 1 AAbL AA I 


rji rp fti m fri /*i /™i t\ 

1 IbTTCTGCA 


/""'/"'/""•'TV rn /"^ TV /■-»/"• 71 

CGCATCAGGA 


AATCATTATC 


GGTTAAGTTT 


480 


AAA 1 HI 1 1 1 i 


ill L,LAALbb 


A r*' A A 7\ /~» /"» TV 

AbAAbbALbA 


ATCLCAGAAA 


TGCTGACCCG 


CGCACTTTGA 


540 


nl 1 ] i 1 i bn 1 


bbAAl 1 b 1 i L 


/"*r"^ 7\ A A A f A T» 

LbAAA I Al A I 


rn /~» 7v rn tv 71 7\rp/^^ 

IbAl AAAlbb 


TV /* l 7l r'fTlfTKTir'Tl TV 

AGACTTTCAA 


TTCTTTCATT 


600 


o J. OrxriLj 1 1 V^o 


P&A.fTAAA.fT' 

bAAi lAAAbb 


AALbbbLA 1 L 


LAAbl bl A I 1 


i" 1 A ATA TifpTPTV 

LAATAATTGA 


GGGCACATCT 


660 




PT aa ppf* A TiT 
b 1 AAbbbAAl 


AA 1 I 1 1 b 1 Ab 


AL 1 AAL i L 1 1 


pmipri rp /-> rp a A T* 

L 1 1 L lb 1 AA1 


ACTTAATTTT 


720 


TCAGTCACCC 


AAGGCTCAGG 


TACTTCTGTT 


GCAAAGCCTA 


AAACCACGGA 


TTCAACTTTT 


780 


CCATCGTAGG 


TTGAGTAAGT 


GCCGGTTAAC 


GGAGAAAATT 


TAGTAATGGT 


TCGATTTTGA 


840 


AAGTCTTGAA 


CAAAGCTGCC 


CGAGCCACGT 


CGTCTGACAA 


CATAACCTTC 


TTGCACAAGC 


900 


AAATGTAAGG 


CTTTTGTAAT 


GGTTAGTTCG 


CTGACCTGAT 


ATTTTTCTGC 


AAGAAGACGA 


960 


CCTTGGGGTA 


ACTTTTGGTT 


AATTGGATAA 


ACACCTGCTT 


CAATATCCTG 


TTTTATAGCT. 


1020 


GCATAAATTT 


CTTTATAACG 


TGTATTTGTC 


CGCATCAGAG 


TCcTCCTTTA 


TCACCAGTAT 


1080 


TTCTCTCATT 


ATACCATGAA 


TTTAACGTGT 


TAAAACTATA 


TATTTATTTT 


TAACGTAACC 


1140 


ACTTAATAAT 


CAAGTACTTA 


AATAAAAGTA 


GTTGACTTAT 


TCTTTTTAAA 


AC T AT AT AG T 


1200 


TTTAAAAGTA 


AACAGAAGAA 


CTTATTGATC 


TGATTGGTAC 


TTTTTG 




1246 



(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

CAGTTTAGGT 'TGGTCAAGAG TnATTGGACA CCAGTTATTT GAAAATTGGT GGAATAAATT 60 

TCTTCCCTGC TTCGTCATTA TCTACAGCCA AAACAAGtTG GACTTTTTCC GtTGTTAGTT 120 

TCGACTTTTC TAATTGTTCA AGAACGGTTG GCTTTTCTTC CTCTTTGACA TAAGAGCCAA 180 
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TTTCATTGGC AATAAGCGTA GATACTGCGC CCTTTTTAAG TCCATTCGCA CAAAATAATA 240 

CCGCATCTCC AATCGTCTCT TTTTTTAATT CGTAATAACT CATGAGATCT ATAGGCGCTT 300 

CAAAAACAAT GATTTTGATT GGCTTTTCTG ACGTAGCTTC TGCAATTTTT GGTGGATACC 360 

CTACACGAAC AGTCAACCCA TAATAGCCAT TTCCCCAGAC ACGCTTTAAC CGTCCTCGTT 420 

CGCCATGTAG CTTCTTATTT TCCCAAATAC CTTGTAAGGC AACGCCTTTA ATTTTTTCTT 480 

CAAGACCAAC GTGCTTGAAG ACTATTACTG GTTCACTTTG TCCGGTCTCT TTATCTGTAT 54 0 

ACGTGGATTG TGCCATGAGG TTTTGTTCAA AAAAGAAATC AATCGTTTCT CTTGATAACT 600 

TACGTTCCTG AAGCAAGTAA TCAATGGTGG CATTTTGTTG TGTATGTTCT TTCATATAAT 660 

AATGAAAGTT CGTCGGAGTA GGTTCTTTCA ACGTTTCAAT AGCCCCTGCT TCTACGGTCT 720 

GTAAATACTG CAATGCTTCA GCGTGTGTAC ACTCTTTTAT CACTTGAACA AGCTTAATAC 780 

TCCCACCACC AACTTGGCGA GAGTTCCAAT AAAAAGAATT CTTCTTAGGG GTTAAAACAA 840 

AACTATCGTG CTCTGACCAG GTATAACTTT GTCCCTGTCG TACAAGTGTT ATTCCTAAAC 900 

TTTCTGCGAC ATCCACAATA CTTTTTCGTT CAATTcTcTA ACACTTTTTG TCGTtGTGCA 960 

TGTGTCATtG GCATTTtCTG TtCCTCCTTT TTCTATTTTC TGAAATAAAA AACGCACGTC 1020 

TTTTTAAACG TGTGTTTCTT CTTtCAGTTC ACTTGTAATT TCTCCATGGT CAATAATATC 1080 

kGTTGCCGAG ATTTTATCCA AAATAGCTTC TTCATCAGTC ATATATCGGC GCCACGTATA 1140 

CCACTTATCA TCAGTCGGTC CTGTTGCTAG TTCATCTGCT CGGTCATGCT GAAAAGCATA 1200 

GAATTTTTTA TCTTTAAAAA CATGCTCTTT TGATATAAAA ACCAGACACT CGTCTCCTCC 1260 

AATAAGCGCA ACTTCACTCC TGTCAATCAA ATCGCGCCCG TATTTATCTC GATTTTCGGA 1320 

TCCTCCACCT TGATTTCCTT TATTCATGCT ATGTTTCCGA ATAGATAATG TTTGTTTCCC 1380 

TGCACGCGCA G AC AG AT ACT TGGTGGTTTC TTCTTCGTCT CCACCAAGGT ATAGTAAACA 14 4 0 

CGCACAGCCA TTTAAAATGT TTTGCCACCC ATTCCGATAA AGCGTTTGTA ATTGGTTCAA 1500 

GGACTGTAAA ACAATGGTAA ACGACATTTC TCGCTTACGA AACGTGGTCA ACATTTTCTC 1560 

AAAGTGAGGA ATTCGGCCAA TATTGGCAAA TTCGTCAATT AAGAAGCGGA CATGTAGGAG 1620 

TTCTTTTCCT TCTTCTAGTA CTCGTTTTCC TAAACGTACT TGGTCTGATT TATTTCGCAA 1680 

CACTTCTCCG ACTGTCGCAA ACATCAACGC TGCAATAAAA TTAAACGACG TATTAGTTTC 1740 

AGGTATCGCA ATAAACACCG CTGTTTTTTC TTCATTCCAG GAGTCAATAT CCATCGTGTC 1800 

TGTACGAATC ATATTGACCA CTTCTTCATG GTCAAAAACA GAGTAACGTG CAGCAGCTAT 18 60 

CCCTAGAACG CTTGCTCTCG TTTCACTCTC ATACAAATTG TTAAACAAGG TCCATTGTCG 1920 

GTAAGCGTAG TTGTCAGGAA TGGCCTCATT TAACTCCTCA AACCATTCTT C.CACTGGGcT 1980 

AGGTACATTT TyATCTTTTC GTTTTAGGTG ACGTAACATG TCCGCAATCA TTCCTAAATG 204 0 

TGGCGTATAA TCATTTCTGC GACCGTCAAA CCATAAATAC GCAATAAGAG AGCGAACCAA 2100 

TAAGCCTTCG GCTTTTATCC AAAAATCTTC CCCGGTCGTT TTTTCCCCTT CTTTTGTTGG 2160 
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CTTCGGTGAC AGCTTCTAAA AACTCGGTCC ACGTCTAAAT CnGAGTGCAT A ^ 2211 

(2) INFORMATION FOR SEQ ID NO: 456: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



TACCGATTGA 


ATTCATTGCT 


TCCGTTTGCG 


GAGAATGTGT 


CAACACATAT 


GATAAAACCG 


60 


TTTGTAAAAT 


AACTGATGCT 


ACAGCGAAAA 


CTTTTAAAAA 


ATCGCTGATA 


TTCAGATTTA 


120 


TTGGTTGTTC 


TTTACTCATG 


TACGTCCTCT 


TTCTTTCAAT 


AATTTAGCGT 


GTAAGTAAGC 


180 


GCTCGGATTA 


AGAACTTACT 


CACACGCTAT 


TCACGCTTCT" 


TTGATTTACA 


TTGCTAGCGA 


240 


TTCATAATTG 


GCTAACAAGA 


TGCCGCGTTC 


TTCAATCGCT 


TCTTTGACTT 


CTTGAGACGT 


300 


TAATATTTCC 


ACTTCACGAA 


TTCGAGGCAT 


ACAATACCCT 


GATTGATTTT 


GTAAAATAGT 


360 


ATCGATAAAC 


GCAGGATGAC 


AATTGATTTC 


AAAGACCTCT 


CCTTCACTAC 


AAACAACCAT 


420 


ATCAAGCAAT 


TGCAAGATGG 


TTTCAGTTGA 


AATCGCCTTG 


TCATAAAATT 


GGTACAACAT 


480 


TTCATCTGGC 


GTGCGAACAT 


CCTGATATAA 


CTCTAAATAG 


TCTTkTGTTT 


CAATACTTCT 


540 


TGAGGCGTTA 


CGTAAAGGTA 


ATTGATATTT 


GCGAGCAAGT 


GCCAAAGcTA 


CGCCCAATAA 


600 


CTTTTTATTT 


TTTCCGTGAA 


CATTATGATG 


AGAATCAATA 


TGATCAGGGC 


GTCTGCCACT 


660 


TTTCATAAAT 


GAAATAATTT 


GAGCATCCCA 


TTCGTTATAA 


ACCTCTTCTA 


AATTGACTTT 


720 


TTCTTCaAAA 


ATCGATTGAT 


GCCAArAATA 


ACCAGCkTCA 


TCTACTAAGG 


AAGGGACCAT 


780 


TTCTCGAGGT 


AAAATAGGTT 


TTGCTTGGTn 


CAATGTCAAT 


GTTAAA 




826 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 
TAAGCTAGAT GCGATAGAGT TATATTTAAC AACAGAGTTG TCCTATCAAG AAGTAGCTAA 60 

TCTTCTAAAG ATTCGCAATC CATCATTGAT TGCGAATTGG CTTAGAACGT ATCAGAAGTT 120 

TGGTATAGAG GGACTCTCTA AACAGAAAGG ACGTCCACCT ACCATGTCTA AAAAGAAACG 180 

AAATGAACCC CAACCACTAC CAAGTGAACG ATCTCAAGTA GAGAAAyTAG AAAAAGAAAA 24 0 

TAGAATGTTA ' AAAAkTGAGA ATGCCTATTT AAAAGAGTTG AGCAGACyGC GTTTAGAGGA 300 

CGAATAAAAA ATGGACGAAT CAAACGAATT ATTCATAGTT TCCAAGGACA ATTTCGATTA 360 

AAAGTTATTC TGGAAACCTT AAGGTTTCCA AATATCACTT ACATGTACTG ACAAAAACGT 4 20 
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CTCGACAGAA 


CAGCTCTAAC 


ACAAATAATC 


GAAGAAGAAA 


TTCAAGTGAT 


TCGTAAAAAA 


480 


AACATAGGCA 


TTGTACAAGA 


ATTTGGATAA 


ATCTACCTTC 


CAACAAATTC 


TTAACACTAA 


..S4G~ 


CTTAGTCACG 


AGTTAGTGAA 


AAGAGTATCA 


ATCTTAACAA 


AGTTATTCTT 


GAAAAGAACG 


600 


GGTATCTTAG 


TTGGTACCTG 


TTCTTTTTAT 


ATGATAGATA 


TTGAAATTTA 


AGGATAAAAT 


660 


CTTTAAAATA 


TGGATATCTT 


TTAAAGATTT 


TATAATTGAC 


CTTCTTATGA 


GAGATATCTT 


720 


GTTCAAAAGA 


AGGGAGCTCC 


TATCTTTTTC 


CAAGTTTTAA 


GTAGGAGTTT 


TTATATGTAA 


780 


TACTTTGCTT 


AGTTTAAAGC 


TATCGTATTA 


TATGAAGAAA 


ATACTTATTA 


AGGTATCaTC 


840 


aATTATGAAT 


GTAAAAAAGA 


AAACTATTTT 


TATAAATnAT 


TTATATAAAA 


AGTATTGCAT 


900 


TAAAAAATAG 


GTTGTTTTAA 


AATAAAGTTG 


T CAT CAT AG T 


ATGACTCTTT 


TCGATAGAAT 


960 


CTCCTATTGG 


AAACGATATG 


GATTCTTTAT 


TGAAAAAGAA 


TGTTGGAGTA 


ATGATTAGTC 


1020 


AAGTCCAGAC 


TCCTGTGTAA 


AATGCTATAC 


AATGTTTTTA 


CCATTTCTAC 


TTATCAAAAT 


1080 


TGATGTATTT 


TCTTGAAGAA 


TAAATCCATT 


CATCATGTAG 


GTCCATAAGA 


ACGGCTCCAA 


1140 


TTAAGCGATT 


GGCTGATGTT 


TGATTGGGGA 


AGATGCGAAT 


AATCTTTTCT 


CTTCTGCGTA 


1200 


CTTCTTGATT 


CAGTCGTTCA 


ATTAGATTGG 


TACTCTTTAG 


TCGATTGTGG 


GAATTTCCTT 


1260 


GTACGGTATA 


TTGAAAGGCG 


TCTTCGAATC 


CATCATCCAA 


TGATGCGCAA 


nTTTTGAATA 


1320 


TTTTGGTTGA 


TC 










1332 



(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 58: 

CTATATAATT A*\AGTATTTT AGATCTTAAC TAGGATCTAA AAACTGTATA TAAAAAACCA 60 

TCTATACTTT ACAAAAATAA TATTTTTGGA GTTATAAATA ATGAATGAAT TTTAATAGTA 120 

AATTAAATTT TGTTATACTT CAAGTTCTGG GTTGCATGTG CAGAATGTGC AGGTGGCTCC 180 

AGCAAATTAT wATwCmCCmC AmCAAaTCGT GATTGGTGGT GAGGTTGCTG CTGTTGATCA 240 

AGCGATGACA CTTCTCAAAG AAGCTGGTGT GAAGCGAATG ATTCCGTTAA ATGTGAGTGG 300 

CCCTTTCCaT ACGGCGCTGT TACAACCaGC ATCaAAAAAA TTGGCTCAGG ATTTAGCAAA 360 

ATTGAACTTT CAAACGATGC AAATTCCTGT CATTAGTAAT ACGACTGCCG AAATTATGCC 4 20 

CCAAGAGGCA ATTCAAGCGT TATTGGAAAA GCAAGTCATG TCTGCGGTAC GTTTTGAAGA 4 80 

CAGTATCGAA ACGATGAAGG CTATGAACGT AGGAACGATG ATTGAAGTTG GTCCAGGGAA 54 0 

AACATTAACT GGTTTTGTTA AAAAAATTGA CAAAACAATT GAAATGCACC GTGTGGAAGA 600 

TGTTGCAACA TTAACAGAAA CGTTAACAGC GCTTACTGGG AGGTAAAAAA TGGAATTAAC 660 

AGGAAAAAAC GTATTTATTA CTGGAAGCAC TCGTGGGATT GGTAAAGCAG TTGCGTTAgC 720 
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780 
840 



960 
1020 



TTTTGCGAAA GAGGGCGCCA ATATTGTCTT AAACGGTCGG AGTGAGATTA CGCCAGAACA 
ACGACAAGAA ATTGAAGCCT TTGGGGTAAA ATGTATTGGC CTTTCTGGAG ATATTTCCGA 

TTTTGATGCG GCAGGTGAGA TGATTCAAGC AACAGTTGAC CAATTAGGCT CGATTGATAT 900 
TCTGGTGAAT AACGCTGGGA TTACGAATGA CAAATTATTA TTACGAATGA CAAAAGAGGA 
TTTTAATGCC TGTTTAGATA TTAACTTGGT AGGAACTTTT AACATGACCC AGCAAGCGGT 

TAAACGTATG ATGAAACAAC GAAGTGGTCG GATTATTAAT ATGGCTAGCG TTTCTGGTTT 1080 

AATGGGAAAT GTTGGTCAAG CAAACTACGC TGCGAGTAAA GCCGGTGTCG TTGGTTTTAC 1140 

TAAATCGGTG GCACGAGAAG TTGCGCCGCG CGGTATTACC TGCAATGCGA TTGCACCAGG 1200 

GTTTATCCAA ACAGAAATGA CGGATGTTTT ATCGGAAAAA GTTAAAACAC AAATGAATGC 1260 

GCAAATTCCC TTACAAACGT TTGGGCAAGT CGAAGATGTC GCAGCTACAG CGATTTTCTT 1320 

AGCTAAAAGT CCCTATATTA CTGGGCAAGT CGTGAATGTC GATGGCGGCT TAGTCATGCA 1380 

CGGATAAACG AAAGGAGTCA AAAGGATGAA TCGAGTAGTT ATTACCGGTT ACGGTGTCAC 14 40 

CTCGCCGATT GGCAATGAAC CTGAGACTTT TTTAGAAAGT TTGAAAACAG GGAAAAATGG 1500 

TATCGGACCG ATTAGTAAAT TTGATGTTAG TGAAACAGGT GTCACCTTAG CAGCTGAAGT 1560 

GAAAGACTTT CCAATGGAAA AATATTTTGT TAAAAAAGAT GGCAAACGCA TGGATAAGTT 1620 

TTCGTTATTT GGCATTTATG CGGCCTTAGA AGCAATGGCT ATGAGTGGCT TGGATACCAG 1680 

TCAACTAGAT GTGGATCGGT TTGGCGTTAT GGTGGGTTCC GGAATTGGTG GTTTGGAAAC 1740 

AATTCAAAAT CAAGTAATTC GGATGCACGA CAAAGGACCT GAACGTGTGG CACCGTTATT 1800 

TATTCCAATG GCGATTGGAA ACATGGTGGC GGGAAATATT GCTTTACGTG TTGGTGCCAA I860 

AGGAATTTGT ACTTCAACAG TGACAGCTTG TGCTAGTGCA ACTCATTCAA TTGGGGAAgC 1920 
GTTCCGCAAT ATCAAACATG GCTACTCCGA TGTCATTATT GCTGGTGGAG CAGAAGCGCC 1980 
TATTACAGAA ATCGGTATTT CAGGATTTGC TTCCTTGACC GCTTTAACAA AGGCGACTGA 2040 
TCCaGAAAAA GCCTCAATTC CCTTTGATAA AGAACGTAGT GGATTTGTGA TGGGAGAAGG 2100 
AGCGGGTGTC TTTATTTTAG AATCATTAGA TCATGCTCTA GAACGAGGCG CAACTATTTT 2160 
AGGTGAAGTG GTTGGCTATG GTGCTAATTG CGATGCCTAT CATATGACAT CACCAACACC 2220 
AGATGGGAGC GGCGCAGCGA AAGCCATGGT ACTAGCAATG GAAGAAGCGG GGATTTCTCC 2280 
AGAAAAAATT GGCTATATTA ACGCCCATGG AACAAGCACA CAAGCCAATG ACAGTGCAGA 234 0 

ATCTAAAGCA ATTGAATTGG CTTTAGGTGA TGCCGCAAAA ACGGCGTATG TAAGTAGTAC 2400 
AAAGAGTATG ACGGGACATC TCTTAGGCGC CGCTGGTGGA ATTGAAGGAA TTGCTACCCT 24 60 

AAATGCTTTA CAACATCAAT TTATTCCGCC AACCATTAAT GTGGAAAATC AAGATGAAGC 2520 
AATTACGGTG AACGTTGTTT TAAACGAAAG CAAAGAGCAT AAATTTGaTT ATGCTTTAAG 2580 
TAACTCGCTA GGCTTTGGTG GCCaCAATGC AGTTATTTGC TTAAAACGCT GGGAGGATTA 2640 
ACATGCAGTT AGAAGAAGTA AAAGCACTAT TAACACAATT TGATCAATCA ACTTTAACCG 2700 
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AATTTGATTT ACGAGAAGGC TCGTTTGAAT TATATATGAA TAAAAATACA GTTTCTGGGC 
GAAGTGCCGT AGAACCAGTG GCGCAACCAC AAGAAACACC AGTAGCCGCT AGTGGTGTGT 
CAGTGCCTGT TGAAACAGTT TCTGTCGTTG AAGAAACGCC AACTAATACA CCGACAGCCA 
ATGAAAAGAC AGAAGAAATT ACGTCACCTA TTGTAGGAAT CGTTTATTTA CAACCAGCAC 
CAGATAAAGA GAACTTTGTC AAAGTAGGAG ACACTGTTAA AACGGGTGAC GTTGTGTGTA 
TCGTTGAAGC GATGAAaTTA ATGAATGAAA TTACGGCGAC AGTTGACGGT GTGATTACTG 
AAATATTAGT CAACAATGAA GATGTGGTTG AATTTGGTCA GCCGCTATTC CGAGTTGCTA 
AAGGAGAATA AAACATGAAA TTAACaATTA CAGAAATTCA AGAAATTTTA CCACATCGCT 
ATCCCTTTTT ATTATTAGAT AGTGTTGAAG AAGTCATTCC CGGGGAACGC GTTGTAGCGA 
AAAAGAATGT AACGGTTAAT GAGCAAGTTT TTCAAGGTCA TTTTCCTGGG AATCCTGTTT 
TGCCAGGAGT TTTAATTATT GAATCGTTGG CTCAAGCAGG CGCCGTGGCA TTACTTTCAA 
TGCCTGAATT CAAAGGGAAA ACGGCATACT TTGGTGGATT AGATAAAGCA AAATTTAGAC 
ArAAGGTAAC ACCAGGCGAC ACATTAATTT TAGAGGTTGA ACTTTTAAAA GTCCGCGCTT 
CTGCTGGAAT GGGTAAAGGT GTGGCAAAAG TTAATGGAAA AAAAGTTGCC GAGGCTGAAT 
TAACCTTTAT GATTGGATAG GTGAAAAATG TTTTCGAAAG TATTAATCGC AAACCGCGGA 
GAAATCGCTG TTCGTATTAT TCGTGCCTGT CGCGAATTAG GCGTTCAAAC AGTTGCTGTT 
TACTCTGAAG CGGATCAAGA AGCGTTACAC ACACAGTTAG CAGATGAAGC CATTTGTATT 
GGACCGGCGA AAGCAACTGA TTCTTATTTA AATGTTCAAG CTGTTTTAAG TGCAGCGATT 
GTTACCAATG CTGAAGCGAT TCATCCAGGG TTTGGTTTCT TATCTGAAAA TAGTCAATTT 
GCTTCAATGT GCGAAGAATG TAACATTACT TTTATCGGTC CTAAAGCAGA AACGATTGAT 
GCAATGGGCA ATAAAATAAA CGCACGTCAA CTGATGCAAA AAGCCAAGGT ACCTGTTATT 
CCTGGGAGTG ATGGCGTCAT TGATTCTGTC GAAGAAGCGT TGACGATTGC TGAAGAAATT 
GGTTACCCAG TGATGTTAAA GGCAGCTGCT GGTGGTGGCG GAAAAGGAAT CCGAAAAGTT 
CTTTCAAAAG AAGAGTTACC GAAACACTTT ACTTCAGCGC AgcAAGAAGC CAAAGCCGCA 
TTTGGTAATG ATGATATGTA TTTAGAAAAA ATTATTTATC CAGCACGTCA TATCGAAGTG 
CAAATTTTAG GTGATCAATA TGGGCATGTG ATTCACTTAG GGGAACGTGA TTGCTCACTA 
CAAAGAAATA ATCAAAAAGT GTTAGAAGAA TCACCATCGA TTGCTATTTC TGAAGAAAAG 
CGTCAGATGT TAGGCGAAAC GGCCGTTCGA GCGGCACAAG CTGTGCATTA TGAAAATGCC 
GGAACCATTG AATTTTTGAT GGATCCAGCT GGCGACTTTT ATTTCATGGA AATGAATACC 
CGCATCCAAG TCGAACATCC AGTTACGGAA ATGGTAACAG GGATTGATTT AGTCAAAGCA 
CAATTAGAAA TTGCTTCAGG TGAACCTTTA GGCTATACCC AAGAGGATGT TATCATGAGT 
GGGCATGCGA TTGAGTGCCG AATTAACGCT GAAAATCCTG CCTTTAATTT TGCTCCTTCA 
CCAGGTAAAA TTCAAAATCT CTTGTTGCCT AGTGGTGGAA TGGGCTTGCG TGTAGACAGC 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
! 3660 
3720 
3780 
3840 . 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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GCTATGTATT CAGGGTATTC AATTCCGCCT TATTATGATT 
GTACATGGTG AAAATCGTTT TGATGCCTTG ATGAAAATGC 
GTAACAGAAG G TAT T ATT AC GAATGCCGAA TTCCAATTAG 
GTTTTGACAG GTGACTATGA CACAAGCTTT TTACAAGAAA 
CCAGAAAGCA ATCATTAAAG AAAGAACGCC GCAAGAGCTG 
GAGGAGTAAG CAAATGUCAT TATTTAAAAA GAAAAATTAC 
TGCTCATGCA AATGATGCAT CAAAGAAGCC TTCTGTTCCA 
CCCTTCTTGT AAGCGAACAC TGTATACAAA AGAAATGGGT 
TTGTGGCTAT AGTTTTCGGA TTGGTGCATG GGAACGATTA 
AAGTTTCCAC AATTGGGACA GTGAATTAGT GACAAAAGAT 
TCTTGAAAAA ATTGAAAAAA TGCAAGAAAA AACTGGGCTA 
AGAAGCAACA ATTGAAGGAC AAGCTGTTGC AATTGGAATC 
GGGCAGTATG GGAACGATTG TTGGTGAAAA AATCACACGC 
AAAgCATTTA CCAGTAGTGA TTTTCACTGC ATCTGGTGGT 
TTTTTCATTG ATGCAAATGG CGAAAATTTC GGCCGCTTTG 
CTTGCTGTAT CTTACGGTAT TGACTGATCC AACGACTGGC 
GATGGATGGC GATATTATTT TGGCAGAGCC TCAGAGTTTA 
TGTAATTGAA CAAACGATTC GTCAAGAGTT GCCAGATGAT 
TTTAGAACAT GGTTTTGTAG ATCAGATTGT GCCAAGAAAT 
CGACTTGTTA CGTTTACATT CTTTGGAAGG GTGGCGCTAA 
ATGATGTTGT TACCTT GGCA AGAGCCCAAG ATCGTTTGAC 
CAATTTTTGA GGACTTTTTA GAATTTCATG GGGATCGTTA 
TAGTCGGTGG TGTTGCTACT T 
(2) INFORMATION FOR SEQ ID NO: 4 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 
TTTTTTAAAC CGAACAATTT TGGCAAAAAA CGAAAAAAGA ATTAACAGAG GGGACACGAT 
TTATCGCAAT "TGGTTATGGG CCCAATATTG GTACGGCGAA AGAGTTTGAG ACGAAGTTTA 
CAGAAACAGT TCGCTTACCT TCCCAAGGAT TtGAAGTCGA AGCCTATATG CATGGGCCTT 
ATCTTGAAGC GGATAATACA CATCTTTTGT TTTTTATTGA AAATCAAAGT CCTATTCAAG 



CAATGATTGC 

AAAGGGCTTT 

ATTTAATTAC 

CATTTCTACC 

TGCTCTTGCC 

ATTCGCATTA 

GATAATATGT 

GCAGAAAAAA 

GCGATAACCG 

CCACTAAATT 

GATGAAGCTG 

ATGGACGCTA 

TTGTTTGAGC 

GCCCGTATGC 

CAACGGCATA 

GGTGTTACCG 

ATCGGTTTTG 

TTTCAAAAGG 

CTTTTGCGTC 

CATGGAAAAG 

AACTTTAGAA 

CTTTGCCGAT 



AAAAGTCATC 

AAATGAAATT 

ACATGACAAT 

AAATTGGGAG 

GGTAAAATAC 

ATCCAAATCG 

GGGCAAAATG 

TCTGTCCGCA 

TTGATGAAAA 

TTCCTGGTTA 

TCTTAACGGG 

ATTTTATCAT 

GGGCGACAGA 

AAGAAGGAAT 

ACAAAGCAGG 

CAAGTTTTGC 

CTGGCCGCCG 

CCGAGTTTCT 

AACGATTGAG 

AAAACAGCCA 

TATATAGAAG 

GATTTAGCTG 



4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6021 



60 
120 
180 
240 
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CACGTAGCCA 


AGCCTTGAAA 


AATTATCTAG 


CCAGTTATGT 


TGGCCAATGT 


TACACAATTA 


300 


CAACGCGTTC 


AACGAGAGCA 


TCAAATACGT 


TAGATTTAGC 


AAGTGATGTC 


GAAGAAAAAA 


360 


TTAGTCCTTT 


ATTGTTAGTT 


ATTCCATTTC 


AATACTTGGC 


GCATCAAGTA 


GCTACAAGTA 


420 


AAGGAATTGA 


TTTATCTCAA 


CGGATTTTTG 


ATGATTTTGA 


TTGTGTCCTA 


AAAAGTAAAA 


480 


TAGCAAAGAA 


AGAGGGAAAT 


CAGCCATGTT 


AAAATTTAAT 


GAAGAACAAC 


AAATCAAAGA 


540 


CATACGAGGT 


GCCTTAGCCT 


TGCGTTCAGA 


TGTTGaAAAA 


GTTGTTGaTC 


AAGTTTGGAC 


600 


GAAAGGATTT 


GATGCGGTTT 


ATTATATAGG 


AATCGGCGGT 


ACTTATGCTT 


CaGCCcATGc 


660 


AAGCaGTTaC 


TTATGCGaAT 


GGAAAAAGTA 


ATTTACCAAT 


CTATGTTCAA 


CATGCTGCTG 


720 


AGTATTACAC 


AACAGGCAAT 


AAACGTTTAA CAGAAAAATC 


ATTCGTCATC 


TTGTCTTCCG 


780 


TTACTGGGAC 


AACACAAGAA 


GTGGTGCAAG 


CCGTTCAAGA 


AATCAAAAAG 


GTCGGCGCAA 


840 


CGCTATTTGG 


TTTTATTGAT 


GCAAAAGATA 


GTCTTTTAGC 


AGATTTATGT 


GATTATGTGG 


900 


TGACTTATCC 


AGCTCCAGGA 


ACTGAGCAAA 


TCAAGTTCTT 


TATGGTAGCT 


GATCGTTTAA 


960 


TGCAAAAAAA 


CAACGAATTT 


AC AG AT T AC G 


AAAATTATTA 


CCAAGAATTG 


GAAGAATATT 


1020 


TGCCTACAGG 


TTTAGCTGAA 


GCCGAAAAAA 


GTGCCGATGC 


GTTTGGCTTA 


GCTTTTGCGG 


1080 


AAAAACATCG 


TCATGATAGC 


ATTCACTATT 


TTATTGGCGC 


AGGGAACCAA 


TGGGGAGCGG 


1140 


TTTATTCTTA 


TGCAATGTGC 


TATTGGGAAG 


AGCAAAGTTG 


GCTTCGTTCA 


AAATCAATTC 


1200 


ATGcAGcAGA 


GTTCTTACAT 


GGaACGCTAG 


AAATTGTAGA 


TGAAACTACG 


CCAGTAACGT 


1260 


TGTTTATtGG 


AGAAGACGAG 


CAGCGTCCGT 


TGGcAGAACG 


AGTAGCaAAA 


TTATTGCCGA 


1320 


AAATCTGTGC 


CAATTATACA 


AT T ATT GAT T 


CGAAAGcGTA 


TGAACTTCCA 


GGrAtTAGTG 


1380 


AAAAATATCG 


TGGCCGAGTT 


GTTTCCATTT 


CTCCCTAATG 


CATGCGGTAA 


CGCAACGCAT 


1440 


TGATGCGCAT 


GTAGAAAAAT 


TAAATTGTCA 


CCCATTAGAG 


ATTCGTCGTT 


ATTATCGTCA 


1500 


ATTTGATTAC 


TAAGACAAAA 


TAGCCTCTTT 


TTCCAGCAAG 


CCTGTGGTAG 


ACTGAAGAGA 


1560 


GAATACTTCA 


GAAAGTAGGA 


AAACCAATGA 


CAAAACCGAA 


AATCTATACA 


ATGACCTTTG 


1620 


CCAGTGTTTA 


CCCGCTATAC 


ATCCAAAAAG 


CAGAACGTAA AAATCGAACG 


CAAGCGGAGG 


1680 


TTGACGAAAT 


TATTTACTGG 


TTAACTGGTT 


ATGATGAAAA 


AGGCTTGCAA 


CAACAATTGG 


1740 


CTCAAAAAGT 


AACGATGGAA 


ACATTTTTTG 


AAGAAGCACC 


CAATTTTAAT 


GAAAACGCCC 


1800 


AATTGATCAA 


AGGCGTAATT 


TGTGGcTATC 


GCGTGGAAGA 


AATTGAAGAT 


CCTTTAATGC 


1860 


AArAAATCCG 


TTACCTAGAT 


AAATTAATCG 


ACGAATTGGC 


CAAGGGAAAA 


GCGATGGAAA 


1920 


AAATTTTAAG 


AACAkGAAAA 


cAGGGTCtCa 


AkGAGACCCt 


GTTTTCaTGc 


yCcCtTGTAC 


1980 


ACGGAATTTA 


ACTATTTCCT 


GTATTAATGA 


AAGAGGTAAC 


TCTTGCGTGT 


AGGGAAATTG 


2040 


AATGGCACCT 


TTTGAGGTTT 


TAAAAGGGCG 


TAATTCTTTG 


CTGAAACGTT 


GGATGGCTGA 


2100 


TGGCGTAGGA 


TAGAAACCTA 


AATGTTGCTT 


AGCCGCCGCA 


AAATGCACTA 


AATTGCCATT 


2160 


TAAGTAAAAG 


GTCGGCATTC 


CATAAGCGAT 


TTTTTCAGTA 


GTCTCCTGAG 


GAACAACTGC 


2220 
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TAAAATTGCT TGGTAGATTT CTGTTAACTG TGTTTGATGT TGGGGTTGCT GGCGAATATA 2280 

GTTTG.CAM.G ATGGTCATAT AAAAACACCT CCGTTTGTTT C AG TAT ACT A TGTAAAAAAT 234 0 

AAAAGGCGAA GGAGATACAA AAAAACCGTC TAGTATkTTT TCTTTACTAA ACGGCTTTTA 24 00 

GCTTAAATAA CTACCTAAAG GGTCTTGCAT GGAAATGTTT GTCACkCCTT AGTTTAATTG 24 60 

TTTTAAGTGT GCAATTGCTT CTTCAATTGT GTGACCAGTT GCTGTTTTAC TTTCATTTTT 2520 

kGCATCAAAA ACAATGAAAA TATTTTTGTC TGTGTCTAAT TGAATACGGT ATTGAATGTT 2580 

TTTTACTTTG TAAGTCATAG TTATTCTCCT TTCATAAAGT ACTCTAAAAA GGCGACGATA 2640 

ACTGATAACG AAATTCCATT GTTACTATAC CACTGGAAAA TTATTTTTGC AACAAATAAC 2700 

ATTTTTTCTT TACCATTCCT TAGACTTCTT TGCTAGCTAC TTCTTTACAT TCTTTTTCAA 27 60 

GAAGCATAAA ATAAATACAT TACTAGTTTT TGAATAGGAG ATCGATGTGA AAA 2813 
(2) INFORMATION FOR SEQ ID NO: 4 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

CACTTCTTCC TGTTGAATGA GTCGCTTAAG CGCCAATGTa GGTCACGCCm CTACTATAaG 60 

CAtTACATTA GGTAAGACAC GATACGAAAT CGTTTGATCa TTGATGACAT TGCTTTCAGA 120 

AATAATGATT CCATCAGAAG TGACCGCTTC AACAAAAGCA ACATGTCCAT ATTGATTATC 180 

TGCGCCAGCA ACTCCTGGTT GGAAACTAAT CGCACGACCA GCTTTAGGCA ATGAACTAAC 240 

TTGATAGCCT AATGCACGTC CTTTTCTTCC CCAGTCACTG CCATTTCCCA TAAATTCATC 300 

CACAGAAGTG CCCAACTGTT TGAAACGATT ATAGACATAC CACGTACATT GGCCCaCGGG 360 

ATAGCTACCA CTTCGATTAT AATTGATCCC ATCATAGACT GGAAAATGCa TTTGTTGcTG 4 20 

tTCCGCTTCC GACAGyTTAT TTTTAGATTG GATAATCAAA CCGTCCACTG TTTTTGGTAA 4 80 

ATCAAATTGG GTTAAGTTAT AAACCGCAAT TAACGAATTT AATTTATTAT CGTATTGCTT 54 0 

gTCTGTTGCA TATTTACCTG TTAAAAAACG CGTTGCTTGA AGGTAGTTTT TGGCTTCCGA 600 

ACGCCACGCT GGTTTATAAA AATCTTGATT TCCGTCAATC CCTTGTCGGA GTAATTGAAC 660 

ATAATCTTGC AGTGAGTCAT TATAGCCACC GTAATCGCGA AACCCAGCGG AAATTGTATA 720 

AAGTTGGCCC CTCTGgATCA GCCTCTTTTG TTGAAAAGCT AACACTATTC CTGGAACGAA 780 

CCTTTCACGC CAAAGAGATa TAATAAGGTT CTTTGGCTAG TTGACTTTCG CCAGAGCCAC 8 40 

TTTCTAAAAT AGCCTGAGCT ATCATTACTG ATGCAAAAAC ATCGTATTTT TCGCCAAGTT 900 

CTCTGGCTGT CTCACCAATT TTTTTAATAA ACGCTGTGGT TTGTTCTGAC TTCTGCATTT 960 

CAAATGACGC ACCATATGTT GCTAAAACTT CTTTGCCATA CTCCGTTAAT ATGAACTCCT 1020 

TAAAAGGATT AGTCACTACC AAGCGTTTTT CCTCTTTTGA GTCTGGTTGT TGTTTGGCAG 1080 
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TAT AG AT ATT 


CTCTAACTGA 


GCGATTACTT 


CTGGTTTCTC 


TTTAGAATCC 


GATTCCGCTG 


1140 


CGGCAATTAA 


ATATTTTCCT 


TGGCCArGAT 


AAACACCATT 


TTGAAGCGTT 


TTTCCTTCAG 


1200 


CTGTTTGCCA 


AAAAATCAGA 


TCACCAGGTT 


CTGCCTCCTC 


TATAGAACGA 


ACTTTACCAA 


1260 


ATGTCATTTG 


TTCTTGCGGC 


GTACTGCCAA 


TGTCTATTTT 


AAATAATTGT 


TGGTAAATTT 


1320 


TTTGGGCAAG 


GAAACTACTG 


GTCAATGCTT 


GCTCCGTCAA 


ATTAAATTCT 


TTTTTTGTTT 


1380 


TCCCAACGTG 


TCTTAATGCT 


TCAGCGACCA 


CGACTGCCCG 


TTTCTTTTCG 


CCAAAAGACG 


1440 


ACAACAGTGG 


AAGTTCGTAG 


CCTTTTAAAT 


TTGAAGAAGC 


TATTTCATCT 


ACTTTGAgGG 


1500 


ACGGATTTAA 


CGATACAGCC 


TGATTTTCTT 


CAACAAAGTT 


ATTGTTGGCT 


TCAGGAACTT 


1560 


GgAATAGGTT 


CTTCGGGTTC 


TTGCTTGGtC 


ACAGGAACTT 


GTGGtGGtTG 


CGATTGTTcA 


1620 


ACAGGCTGCG 


TGaTTTCTGa 


TGATACCGTT 


GGcTCTGTAG 


GTTCTTTTTT 


TGTTTCAGAA 


1680 


CTAGTAwTTG TGGTTGTCGT TTCTGTCGTT gACTGACTCG 


TTGATTCGGT 


AGTCGTATCG 


1740 


GTAGAACTAC 


TTGATTCAGA 


TGACGAACTA 


CTAGTGGACG 


TTGTCGATGT 


AGCAACAGTT 


1800 


GTTTCGGTTG 


TTTCGGTTAC 


TTTTTCTTCC 


GAAGTAATTG 


TCGTTTCCGA 


GGACAGAGCT 


1860 


GTTTGTAGCG 


AGCTACTTGG 


TAATTCTGCA 


GCCAACGTAA 


CAACTGGAGC 


GACTTCCAAA 


1920 


CTCAGAAGAA 


CGAGAATACT 


TGTTAAAACA 


AGTAGTTGAT 


TCTTGATTTT 


CTTTCTTTTC 


1980 


ATTCGTAAAA 


ATCCTCTCCT 


ATTACTTACT 


TGCGCATTTG 


CACACTGTTT 


TTGTACAAAT 


2040 


GCGCCTTGTA 


AGTTAATGAT 


ACGCTAATTT 


TGCTCTTTTT 


TATTTTTGAA 


GAATAATTTT 


2100 


CCTAAACCAA 


TACTTACTAT 


CAATAGAAAA 


CCACTGATTG 


ACCAGATTGG 


CGAAACAACT 


2160 


TCATTTGCTT 


TTGGCAAAGT 


ACCAGAAGTG 


GTGGTTTTCG 


TTGTCGTTCC 


ACCTGTTGCT 


2220 


GTGCCCTTTC 


CTTGGCTGGT 


TACACCACCA 


GTACCGCTAA 


CCGTGCTATG 


ATTTCCATTG 


2280 


TTGCCTGTAG 


AGTTGCTACT 


GATTGGCTCT 


TCTGCCATTG 


CTTCAATGGC 


TTGTGATAAA 


2340 


ATACTTAAAC 


ATTTGTCCAC 


AGCTTCTTGC 


GTCGCATTTG 


GGTCCTCAAC 


AAGTGCTTTT 


2400 


GCTTGTTTAA 


TTGCTTCAGC 


TAAAGCTTCT 


TGCGATGCTT 


TTGTAAATTG 


TTTTCCTGCT 


2460 


TTAGGTTTCA 


AGGTTTCTGC 


TTTTTGAAGC 


AGTTCTTTCA 


ACTTTGTTTT 


ATCCGTTTGA 


2520 


AGGACTTGTT 


CCTCTAAACT 


ATCCAAAGCA 


GTTTTTACAT 


TTTTTTCAGC 


AATATCAATT 


2580 


TGTTCTTGCC 


GAGCATTTTT 


ATCAGCTAAA 


ATCCCTTCCG 


CTTGGTCAAT 


TGCTTCAGTT 


2640 


AAGCGCGCTT 


TAGACTCGCT 


GGTAAATTCT 


TTTCCGGCAG 


ATGGTTTGAT 


TGCTTTTGCT 


2700 


TTGGCTATCA 


TTGCCTCTAA 


GGCTTTTTTA 


TCTGTAACTA 


ACACTtCTTC 


TTGTAACCCA 


2760 


TCGATGGCAC 


TAGTTAATTC 


AGCAACCGCT 


TCATCAATCA TTTTTgATTA GCTGCTGG 


2818 



(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



ATATTCTTAT 


GGTCACTTCT 


GCCACAGCGG 


AATATTGTAA 


TAGTTAGAGC 


GCATTGCACT 




CACCAAATTA 


GTTTACCACA 


GACAAAATAA 


GCTGTAAAGA 


GAATTTTCTT 


GAAAGTTCCC 


120 


TTATTTTTGC 


TCTTCTTGTT 


TAATTTCCTC 


TAATAACTTC 


AACATTTCAG 


GCGTCATTTC 


180 


TACTGATTCC 


AACATGTCAG 


GTCGACGTAA 


AAAGGTCCGT 


CGTAATGATT 


CTTTTAGTTG 


240 


CCATTCTGCA 


ATTAATTTAT 


GGTTACCAnT 


CGTTAGCACG 


GCAGGGACTT 


CCATATCGTT 


300 


GAAAATAGCA 


GGTCTGGtAT 


ATtGTGGATG 


CTCTAATAGG 


CCAGTGGaAk 


GkGAATCCgT 


360 


TTGTGcAGAA AgATkGTtAC CTAAgrCaTC 


aGGTAATAAA 


cgGACTGTCG 


CATCAATCAT 


420 


GACCATTGCG 


CCTAATTCGC 


CCCCTGTTAA 


AACGTAATCC 


CCCAATGAAA 


CTTCATCTGT 


480 


GACCATCGTG 


CGAATTCGCT 


CATCATAGCC 


TTCATAATGT 


CCACAAATAA 


AAACTAAATG 


540 


TTCTTCTTGT 


GAAAATTCCT 


CGGCCATTTT 


CTGATTAAAC 


GGTTTTCCTG 


CTGGATCTAA 


600 


TAAGATGACC 


CGTTTTGGTT 


GCTGGTTTGT 


CTCTTCTTCA 


ATTGCCTTTA 


GGTTATCATA . 


660 


GATCGGTTGT 


ACTTTTAAAA 


GCATCCCCGC 


GCCACCGCCA 


TACGGATAAT 


CATCGACCGT 


720 


TTGGTGTTTG 


TTATCTGAAA 


AATCGCGAAA 


ATTAGAGACA 


TTCAATTCTA 


ATAACTGCTT 


780 


TTCCCGAGCT 


TTCCCAATAA 


TTGATTCCCC 


CATTGGtCCT 


TCAAACATTC 


TAGGAAAAAG 


840 


AGTTAATACA 


TCAATTCTCA 


TCGTCAATCA 


ATCCTTCTGG 


AATGTCCACA 


TCAATCCGTT 


900 


GATTTTCGAC 


GTCGACTTTT 


GtAACAACGC 


CTTCAATATA 


CGGkAAAAGT 


AAATCtTTTT 


960 


TCyTTGGyCG 


TTGGACGACC 


CACACATCAT 


TCGCACCAGG 


AGATAAAATT 


TCTTTAATTT 


1020 


TTCCCAGTTC 


ACTGCCGGTT 


TCATCGTAAA 


CAGTAGCACC 


AATGATTTGG 


TGATAATAAA 


1080 


ATTCGTCCTC 


TGCCAATTCA 


GCGAGATTGT 


CTTTTGCAAC 


TTTTAAAATA 


CCGTCACGAT 


1140 


ATTTTTCAAC 


ATCATTGATA 


TTTGGATGTC 


CTTCAAAGCT 


TAAAATGTCA 


AAATTCTTAT 


1200 


GCTTGCGGTG 


GGACTTCACG 


ACCAATTCAA 


TTGGTGCTTT 


TTTCTCTTGA 


AATAGTAGTA 


1260 


ACATAGCCCC 


TTTTTGATAA 


CGTTCTTCTG 


GAAAATCCGT 


TTTCGAAATG 


ACTCGAACTT 


1320 


CTCCTG 












1326 " 



(2) INFORMATION FOR SEQ ID NO: 4 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 62: 

CAGAGAAATT ATTTTTCTAA TTACTGGAAA AATAGTTTTT CTTTTTTTGT GTAATCAAAG 60 

AATCAACCAA TTAATTCATT GACACAAATT TTAAATGGAA CGATTTCTTG TACATCTAAA 120 

GTAGTTTTAG AAAGCATTGT AAAAAGCAGT TAAAAGCAAA CTTATGAGCT CACTCTTACC 180 
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TGCTTCAATT TATTTCTATA GATTAGTAGT TGCTCTTACA ATTCTTCATC AATGAGTTTA 240 

GTCTTGTTAT TCTCCATCAC CTTAAAAAAT GGATAGTAAA TAGCAGCACT AATTAGGATT 300 

AATACAAAGC AAAAAACAAT ATTTCGCCAG TCCATACTTG AGAGATATGC TTGTGCAAAA 360 

AATGGTGTGA AAGATGGGTC GATGATAAAT CCAACGCTAA TCAGTCCAAC ACTCTGCGCA 4 20 

ATATACGTCA ACAACAATGA AACAATTGGT GTAAGCAAAA ATGGAATAGC TAAAATGGGA 4 80 

TTGAAAACAA TCGGAAGCCC AAATATGACC GGTTCATTGA TTGAGCATAT CCCGGGAATA 54 0 

AGGGAAAGCT TTCCGATTGT TCGATATTCT GAAACTTTGC TACGTATCAT TAAGAGTACT 600 

AAACCAAGAG TTCCACCCGC ACCACCTAAA ACCGATATAC GGAACATTTG TAGATTCATC 660 

AGATGAGTCA TTGCTTCTTT ATTTGCCATC TGTTCTGCAT TCAAACCTGT TTGTGCCATT 720 

CCTAAAGTGA AGACGATTGG SaAAATGATA GATGAACCAT TGATACCGAA TAACCATAGA 780 

ATGTTTCCCA AAGTGACAAT CAATAAGAAT CCACCAAGGC TACCAGCAAT ATTTGTAGCA 84 0 

GGCGTTAAAA CCTGCATCAC TGTTTCTGGG AAAATCTTTT TCGTTGTCGC TAAGAAAATC 900 

AGATTGATGC CATAAAAAAG AATGATGTTT GTCAACAGTG GAACTAAAGT ATTGATGAAA 960 

GTAGCTACCA TAGGAGGGAC AGTATCTGGT AATTTTAACT TGATGTTTTT CACTTCAAAC 1020 

ACTCGATTGA CTTCAACAAC TAATAACCCG AT GAT GAT AG CTACGAAAAG GCCATTAGTT 1080 

CCCAAATAAT TTAAGTTtAT TGTTCCTTTT GTCACCGGGG TACATACCAT CAGGTAAATG 114 0 

ACAGAAGCGA CCATACCATT CATAGATGGG TTGGTTTTGT ATTCATTTGC AAGAGAGAAC 1200 

GCAATACCGA AGACACTGAT TAATCCGATG ATCCCCATTG TCAAGTTGTA TGGCGCAGTA 1260 

ATCAAGTCAT AATTAGCGAC TGCAAAATCC TTCCATCCAG CCATAATTTT CATAAAGAAA 1320 

TTAGCGGTCT CTGGATTATA ATGATCAATA TTTATTGGTG GATTTGCAAA AATCAGAAAA 1380 

AAAGAACCAA TAACAATAAA CGGTAATCCA AACATCATTC CACTAGAAAT GGCTTTAAGA 14 40 

TGTCGCTGAT TTCCGATTTT TTCAGCAATC GGGCTAAGTA TCTCATTCAA TTTTGTGATA 1500 

ATCTTAGAAT TTCCCATAAG TATTCCCCCT TATACTTGCC AAGGATTAAA AACAAACGTA 1560 

CTTGGTACGA CATGATTCGG ATCATATTTC TTAATAATTT CTTCATATTG TTCTTTGTAA 1620 

TCATTCTCTA CTTTTGCTTG CGGGATAACT TTACTTGCTT CATGTAAGAA ATGCTCTGAT 1680 

CCTCGTCCCA TCTCCTGACC TCGTGTCGTG TGTTTATCAA GAGCGATATC TGGAATTTCA 17 4 0 

GGGACATATC C CAT AG C AAA GCTTTTAATT ACAATATTTT TTAAAAGATC TGAGGAACGA 1800 

TCTTTCTCTG ATCCGCATAA GTAGCGAATG GCATGAAAGA AAAACATCGC ACGATCTGAT 1860 

TCATTATATT GAAATTCCTT ACGCATCATA TTCAAGTTAT TGATCATCAA TGCTGCTATA 1920 

GGATTTCCCA TACCGATATC CTCAACACTG ATTGCCAACA AACGTCTCCA AAGCTTTTCT 1980 

TCAAACTGTG GTGAAGTAAT ATACATCTCG TAAGCAAATT CGCAAGCGTC TTTTTCTTTT 204 0 

CCTCTACGGA TAGATTTTTG TAATGCAGAA ATGACTTCGT CACCAGCTAA ACCGTTTCGC 2100 

GTAGTCATCT TTGCCCAAGG ATCTTGAATA AATTTTTTTT CTGTCATGTT ATAATACTCC 2160 
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TAACTAAAGA ATTTGAGGTG GACCTTATGG ATACTGAAAT GCTAATCGAT 
TGAATGATTC AGAAGCTGCT GTTTTACGGT TTATGGATAA AAACCGTGAC 
GTCTTGGAAT TCGAGAAGTA GCAAAGGCTT CTTATGTCTC AACTACTGCA 
TGGCAAAAAA AATCGGTTAT TCTGGTTATA GTGAGCTGGT CTTTGCCTAT 
ATTCATCAAT TAGCTTTCCT GAGAATTTCT CAAACGAAGA AAAAGATGAA 
TATTATCTCG CTATCGAGAA AAACGGATTA TGGTTCTAGG CTCGGGTTTT 
TTGCAAACCA TTTTTCTGAA ATGCTTAACC TTTATGGATT TCGTGCAACT 
ATTTAGAATT CCTACGAGAA AATGTTGAAA AAGATGTCCT GATTTTTATT 
CTGGTGATAC TCTTCGGTTA GCAGAACTTG TTAAATTAGC AAAAAATCaT 
TCaTkGCTTT CgTTGGtGaA AAGAAATCCA AAATTGGACA ACTTGCTACA 
GCaCgGAAAC mtwTaAtCCy CaGGkAGtTT CAGAATATCA ACCAAACCTG 
CTGCnTTGAA TCAATTCGnA CTATT 
(2) INFORMATION FOR SEQ ID NO: 4 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 63: 
ACGACCACAC CAATTAATGA AATGGCAATC ACTGTAGCTA CACCTGCACT 
CCaTATTAGT ATTtGGCAAT ACTTTTGGTT TTTCTTGTGG AGTTGTAACA 
TAGAATCTGT CGGTTCATCA TTAATCACCA ATTGTGCCAC GTTTGGAATC 
CTTTATCATC GACATATTGC GAAAAGTCTG CGTCTTTCAA TTTCCCTTTA 
ACAATTGTTT TTCTTTTAAT TTGTTTGGTT CATTAGCATT CCAAGAAACT 
TTTCATCAAC ACTAATTTTT CCTTGATCCG TAATGTCTGT CCCATCTGAA 
TTACTTCTTT ATTCGCTTCC AATACATCTT CTAAATCATC TTGaATGACT 
TATCTTTGGT ATTTGGTACT TGCACATCTA ATTTGAACGT CACTTCATCG 
CACTGGTCAA ATCCACTTCT TTATCGCCtT CGAGGATTTT CTTAGCGATA 
TTGGTGTGAC AGTGGTTGTT GTTTCATCAG TTGGGACTTC TTTGTTGTTT 
TTGCTTTATT TGGAATTTGT AACTTGCCAT CAATAAGGTA CTTATCTAAG 
CACGAATTTT TGTTTGGAAA TTGAATTTGT ATTCGTGATT ATAGAAATCC 
ATACGTCAGC "CTTTGCAACC ATACGTACTA AATTGCCTTC ATGctTCGCT 
TGGTAACGTC CTTTCCTGCT TCGTCCGTGA CTTTCACATC AGACACACGT 
GAATCAGTTC GTCTTGCATT TCATAGTTGC TATAATAAAA TTCTTTCATT 



AAATATCAGT 
TGCTTCiAAAA 
ATCATCAATA ■ 
GGAAACAAAC 
TTTATCCGAT 
TCACAAAATA 
GCCAACAGTC 
GTTTCCAACT 
CATATTGATC 
CTAACTATAA 
TTCTTTGGGA 



TGTTCCAAAt 

TTCACTTTGT 

ATAATATTCC 

ATGTACACAA 

TTTTGTGTTT 

AGAGTGACTT 

AAAGACGTCA 

CCCACTTTGG 

GACGGGTCTT 

TGGATAATTT 

CTGGCACCTT 

GCTGTTTTTA 

TCAAATTTAT 

TCCAATTCAG 

TCATCAGGAA 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2845 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



WO 98/50555 PCTYUS98/08985 

1701 



ptttatptpt 

Li 1 1 Al ulul 


PATPPT AT AP. 
un 1 Lb 1 ri 1 Mb 


PTATATAPTt- 
L 1 /\ i ri 1 riL i L 


ppppTAATCC 


TTCTAACGTA 


TTTTTATCAA 


960 


p 7\ TTTPTTTP 

LA lllblllb 


a T P A f"T Zi T P T 


blLALLbl 1 1 


T&P.TAf;PiAAT 


CATCGGCGCT 


GTCCGAACTG 


102a 


bTl 111 IbbL 


AL 1 A 1 AbLLA 


AAA J. AAL Inl 


APTP.PTTATT 


ATTTTTTGAP 


GGTAATGGAT 

OVJ 1 rVfi 1 *OVJ/i 1 


1080 


TCLLTbAAl I 


rp rp m rrtrn /-* f* f* f rp 

1 1 1 1 1 Lbbb 1 


PTAPPPAAAT 




CGTGAAATPP, 


AATPGAftAAC 


1140 


tv m rr> t> t\ t\ m tv 

GATTAGAATA 


bAAbAALb 1 A 


7\ A A TP t^c ct\c 
AAA 1 L ALL AL 


PATTATPAPP 


ATPATPTAAA 


HHAATATfAT 


1200 


CAGAATCGTA 


> tv * Ti /■* m m m m rn 

AAAAGTTTTT 


llAllATltt 


rprprp/" , T 1 T/'**TTT 
I 1 1L 1 1L 1 1 I 


b irii A irt/il b 




1 9fif) 


TCGTTGGAAC 


GTACAATTTA 


TCGATTGCTT 


rn Tt rp 7\ /-»/-< A ATP 

I Al ALbAAl b 


TTTPTPAATP 
1 1 I L 1 LriM 1 L 


P.TPZ1TTPPTT 
\3 l l 1 1 i 


1 ^90 

1 JLU 


GTAAATTATC 


<n tv m TV m /™* tv t\ rti 

AATATCAATC 


CATGAATAbT 


AbLL 1 bAAAL 


TPPPTTTTPT* 
i bbL I 1 I i L 1 


TTPTPTP-Tf^P 


1380 


CATGTTCTAG 


/*> m ta ^rn TV HP 

GTACGTCCAT 


GTTTGTTTCA 


PPPATPAP A A 
LLLA1 bnun/i 


APPAPPTTP.P. 
/\LLr\LL 1 1 uo 


TAP,(^TTP,PAA 


14 40 


TTGCGGATAC 


7\ m rn m m T\ tv ti t\ 

ATTGTTAAAA 


bLAAL AA 1 A 1 


TnTTTPZiPAP 
i A 1 1 J. brtLML 


Lninl 1 1 lol 


P.P AT TTTT AT 
u^nl 1 l 1 irtl 


1500 


ATTGTTCCCA 


ATCTTGAAlb 


G I AAl I 1 1 1 A 


AA 1 LAA 1 1 bb 


TPTPPPATPA 
1 b 1 LLLrl 1 Lri 


TP ACPTf^T AP 


1560 


CTACATTGTT 


GTAACGAA1 0 


LALbAI IbLL 


Li i 1 L 1 1 Lb 1 


TTTPPTTZiPA 
111 1 1 riV^rt 


TPAAP,APPP,A 


1620 


TTTTCCCGTC 


CCATTTTTGT 


bbAbbLA 1 A I 


T A A PPT A A AP 
i AALb 1 AArtL 


ATP AT TPP TP 


PAPPPATPAP 


1680 


CACCTGTTTC 


AACCGTAGTG 


ALALL 111 AA 


T A A A TTTTPP 

1 AAA1 111 bL 


Zi AT Zi A A PPT A 
AA 1 AAHbb 1 r\ 


TZiPTTTPA AP 
1 e\\j 111 ijruAv^ 


174 0 


CAGCTTCATC 


ATCTGGTGGT 


CACTACAGAA 


bLAl lLbbLb 


PATTA ATPAT 

LAI lAAlbH.1 


+■ P Zi ATT A P.PP. 




CTTGCACGTT 


GGAAAAAATC 


TGCTTTCACT 


rp rp fp rn ^ T\ 

b 1 1 1 b 1 bbl A 


L 1 1 b 1 LLbL 1 


TTPT+- (ZTfZCT 


18 60 


TGAACCATCA 


TTGGCATTTC 


CATGCCTGL I 


AAAATTPTTA 

AAAA1 lb 1 1 A 


P APTTPPT A A 
LAb 1 1 bL 1 i\n 


TPPTTP.A AT A 
1 Lv- 1 lVj/vilrt 


1920 


ATCGTTGTTT 


rn m m ^ t\ 7\ ta t\ m 

TCTTCAAAAT 


7\ ni rn rn f+ txs rp rn 

Al I ILL1 L 1 1 


TTPA TTATTA 


ATTTTTTPTA 


PTTPT ATT TP 

Oi l^i.rt.1 1 1^. 


1980 


ATTTCTCCTA 


GATTTTGCAT 


AAAAAA1 Abb 


BP APPPZiPP A 
AbHb L L rtL bn 


ATPTTPTTPA 


TTP^TTPAPP 

1 1 Ov3 1 1 


2040 


TTTCCALAbl 


Ti rp 7\ Z"* 1 T 1 7\ 

1 Al 1LAL I 1 A 


AAAAb 1 1A1 1 


TPTATAAAPT 


ATATAfimTA 


ATAPAGGAGC 


2100 


rn rp/"^ ^P rp t\ rri rp 

TTL1 rbAl 1 L 


rnrnm rprp 7\ rn TPT 
i 1 1 1 1 Al 1 L 1 


rr zi attpttp 

LLrxril 1 L 1 


TTTA A A ATT T 


APATTTTCAT 

rlvn 1111 \^r\ X 


ATACATACTC 


2160 


CTCAACAACT 


ATACAACTTT 


TCTTTGCTAT 


ACTAGCATTA 


AAAATCAACT 


CATTTTCTAA 


2220 


AGAAATTTCT 


GAAAGAACAA 


AACCTTCTTT 


CATTGAATCT 


GCATGCtTCA 


AACTATCTCC 


2280 


CTTTAAATAG 


AGTTTCTTTT 


CATAAAAAAG 


TAATCTATCC 


CTTACTCTAT 


CAACAACTAC 


2340 


AGGAnACAAT 


TCTTCTAACT 


CTCGAnCATC 


TCT 






2373 


(2) INFORMATION FOR SEQ ID NO: 464: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

CGATGATGAC CTTAAGCGAC CGAAATGAAG CGATTGATGT GGTAACAATC AAAGATCGCC 60 

TAGAACAAGA GAATTTGTTA GAAGATGTTG GTGGACTAAG TTATCTGTCA GATTTAGCTT 120 

TAGCTGTGCC GACAGCTGCC AATGTCATTT ATTATGCAAA AATTGTTGAA CAAAAATCGT 180 
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TGCTGCGCAA 


CTTAATCCAA 


ACCGCTACAG 


AGATTGTAAC 


TAAAGGCTTT 


GAGCAAGGTG 


240 


AAGATGTTGA 


ATCTATTTTG 


GAT GAT GC AG 


AACGAAGCAT 


TTTAGAAGTC 


TCCGAAAAGC 


300 


GGaATCGTAG 


TGGCTTTTTA 


TCAATTTCAG 


ATGTtCTAAA 


TACaACTATT 


GCGAATATTG 


360 


ACCAATTATA 


CCAAAATGAT 


GAAGAGATTA 


CTGGTTTACC 


TACAGGCTAT 


CAAGCATTGG 


420 


ATAAAATGAC 


TGCTGGCTTG 


CaAGCGGAaG 


AGCTAATTAT 


tCTAGCGGCC 


CGTCCAGCGG 


480 


TAGGGAAAAC 


CGCTTTTGCA 


TTAAATATTG 


CCCAAAATGT 


CGGCACCAAG ACAGATAAAg 


540 


CTGTCGCTAT 


TTTTAGTTTA 


GAAATGGGGG 


CAGAATCATT 


AGTGAACCGG 


ATGCTTTGTG 


600 


CAGAAGGATC 


CATTGAGGCC 


AGCCATTTAA 


GAACAGGGCA 


ATTATCTGAA 


GAAGAATGGC 


660 


AAAACTTAAT 


TATTGCGATG 


GGTAGTTTGT 


CCCGTGCGAA 


TATCTATATT 


GATGACACAC 


720 


CAGGGATtAA AATTACCGAA ATTCGTGCGA AATGTCGTAA ATTAGCACAA GAAAAAGGCA 


780 


ACTTAGGTCT 


AATTTTGATC 


GACTACCTAC 


AATTGATTGA 


AGGAACTGGA 


AAAGAGAACC 


840 


GCCAACAGGA 


GGTTTCTGAT 


ATTTCACGTC 


AATTAAAAAA 


ATTAGCGAAA 


GAATTAAAAG 


900 


TCCCGGTTAT 


CGCTTTGTCA 


CAGCTTTCTC 


GTGGCGTGGA 


ACAACGTCAA 


GATAAACGTC 


960 


CTGTGTTAAG 


TGATATTCGT 


GAATCAGGTT 


CCATTGAGCA 


AGATGCCGAT 


ATCGTTGCCT 


1020 


TTCTTTATCG 


TGATGATTAC 


TACGATCGTG 


GTGAAGGAGA 


AGACGGTGAT 


CATGAGCCAC 


1080 


CAGAAGTTGA 


CAATGTTGTA 


GAAGTCATCA 


TTGAGAAAAA 


CCGTAGTGGT 


GCGCGTGGAA 


' 1140 


CGGTCGAATT 


ACTTTTCATT 


AAAGAATATA 


ATAAATTTTC 


TTCGCTTTCG 


CCGAGGACAG 


1200 


AATTTTAAAA 


AACGAACTCA 


TTTTTTGAGT 


TCGTTTTTTT 


GCTTATTAAG 


GGATTAAAAT 


1260 


CAGAGAGTAA 


AACACGAACA 


ATATTCGTAT 


ATTTTATTAT 


TTTACAAAAA 


CATAACTAAA 


1320 


AATGTTCGTA 


TTTTTGACAG 


AAACGCTTTT 


GATTGTGAAA 


GAATATTCGA 


TTTTTACCCT 


1380 


TGAAAGGGAC 


TGGTTTTACT 


GGTACAATGA 


TTCAGGACAA 


AAAAATTATG 


AACGAGGTGT 


1440 


TCGAATGTCA 


TCAGTAGTAG 


TTGTTGGAAC 


GCAATGGGGC 


GATGAAGGTA 


AAGGAAAAAT 


1500 


TACAGACTTT 


CTAAGTGAAA 


ACGCAGAAGT 


GATCGCTCGT 


TACCAAGGCG 


GAGATAACGC 


1560 


CGGACATACC 


■ ATTAAATTCG 


ACGGCGTAAC 


GTATAAATTA 


CATTTAATTC 


CTTCAGGGAT 


1620 


TTTCTATAAA 


GAAAAAATTA 


GTGTAATTGG 


TAACGGCGTA 


GTTGTGAATC 


CAAAATCTTT 


1680 


AGTAAAAGAA 


TTAGCTTATT 


TAAAAGAAAA 


TAATGTTGCC 


ACTGATAATT 


TACGTATTTC 


1740 


AGATCGTGCC 


CATGTCATCT 


TGCCGTATCA 


TATTAAATTG 


GATCAATTGC 


AAGAAGATGC 


1800 


GAAAGGTGAA 


AACAAGATTG 


GGACAACGAT 


TAAAGGAATT 


GGCCCAGCCT 


ACATGGATAA 


1860 


AGCAGCACGG 


GTGGGAATCC 


GTATCGCTGA 


TTTATTGGAT 


AAAGAAATTT 


TTGCGGAACG 


1920 


TTTGCAAATT 


AACTTAGAAG 


AAAAAAATCG 


TCAATTCGTT 


AAAATGTTTG 


ATAGCGAaCC 


1980 


ATTGAATTTG 


ATGATATTTT 


TGAAGAATAC 


TATGAATACG 


GACAACAAAT 


TAAACAATAT 


2040 


GTTACAGATA 


CTTCGGTTAT 


TTTAAACGAT 


GCATTAGATG 


CTGGAAAACG 


GGTTTTATTT 


2100 


GAAGGGGCAC 


AAGGTGTCAT 


GTTGGATATC 


GATCAAGGAA 


CCTATCCATT 


TGTTACTTCC 


2160 
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TCTAATCCAG 


TAGCTGGTGG 


CGTAACTATC 


GGTAGTGGCG 


TTGGTCCATC 


AAAAATTAAT 


2220 


AAAGTGGTTG 


GTGTCTGCAA 


AG CG TAG ACT 


TCACGTGTCG 


GTGACGGCCC 


ATTCCCAACA 


2280 


GAATTATTTG 


ATGAAACAGG 


AGAAACCATT 


CGTCGTGTCG 


GTAAAGAATA 


CGGAACAACA 


2340 


ACAGGACGTC 


CGCGTCGTGT 


CGGTTGGTTT 


GATTCAGTAG 


TCATGCGTCA 


TTCAAAACGT 


2400 


GTATCAGGGA 


TTACAAACTT 


GTCATTAAAC 


TCGATTGACG 


TGTTAAGTGG 


TTTAGAAACG 


2460 


GTGAAAATTT 


GTACAGCTTA 


TGAACTTGAT 


GGTGAATTAA 


TTTATCATTA 


TCCAGCAAGC 


2520 


TTGAAAGAAT 


TAAGCCGCTG 


TAAACCAGTT 


TATGAAGAAT 


TACCAGGTTG 


GTCTGAAGAT 


2580 


ATCACTGGTT 


GCAAAACTTT 


AGCCGATTTA 


CCAGCTAATG 


CTCGTAACTA 


TGTGCATCGG 


2640 


ATTTCAGAAT 


TAGTTGGTGT 


GCGCATTTCA 


ACATTCTCTG 


TAGGTCCAGA 


•CCGTAACCAA 


2700 


ACGAACGTTT 


TAGAAAGCGT 


TTGGGCACAA 


ATTTAAGCAG 


ATTTTTAAGG 


GAACGTAAAG 


2760 


GATGGCTTAT 


GCCAGTTCAG 


CCTTTACGTT 


CTTTTTTAAT 


TTTATATAAA 


AATGAAAAAC 


2820 


TATGGTAAAA 


TAAAAGAAAA 


CATGAAGGGG 


GCAATTGTCC 


GTGAATTATC 


AATTAGTGAC 


2880 


AGATTCTTGT 


TGTGACTTGC 


CGTATACATA 


CTTAAAAGAA 


AACCAAGTAC 


CTTTTATTAG 


2940 


CATGAATATT 


CAAATTGACG 


GGAAAGAATA 


TCGCGATGAT 


TTAGGTGAAA 


CGTTTGATTA 


3000 


TCAGAACTTC 


TTAACGGCTA 


TTAAAAATGG 


AAGTATGCCC 


ACTACCTCTC 


AAGTTAATGT 


3060 


AGGACGCTAC 


AATGAATTTT 


TCCGTCCTTT 


TGTGGAGCAA 


GGGTTACCGG 


TGATTTATTT 


3120 


AGCTTTTTCA 


TCTGGGTTAA 


GTGGCTCTTA 


TCAAAGTGCG 


TTGCAATCAG 


TAGAGATGTT 


3180 


AAAAGAAGAG 


TATGATAATG 


TAGAAATTCA 


TATTATTGAC 


ACAAAAGCAG 


CTAGCTTAGG 


3240 


TCAAGGGATG 


TTAGTCCGAG 


AAGCCATTCG 


ATTACAAACA 


GATGGTCATT 


CATTAGGAGA 


3300 


AGTTGTTGCC 


TATCTTGAAG 


AGCAAAAAAT 


GAAACTTCAC 


TCGTGGGTAA 


CGGTAGATGA 


3360 


TTTAAAACAT 


TTAGAACGTG 


GCGGACGGAT 


TTCTAAAACA 


GCGGCTGCAC 


TTGGTGGCTT 


3420 


AATGAACATT 


AAACCAATTA 


TTCGCGTGGA 


TGCGGCAGGC 


AAACTAGCAT 


CCGTTGGTAA 


3480 


AACTCGTGGA 


CGAAACAAGT 


CCTTGCAAAA 


GATTGCGCAA 


GAAACGATTC 


AAGGCATTGT 


3540 


AGAGCCAATG 


AAACAAACAC 


TATTGATTGC 


ATATGCGGGA 


ACGAAAGATG 


ACGCGGAAAA 


3600 


AGTCAAAGAA 


TTAATTGAAA 


AAGAAATAGA 


GGTTAATGAG 


ATACTGATTT 


ATCCTCTAGG 


3660 


ACCCACCATT 


ACTAGTCATA 


CCGGAATAGG 


TTGTATTGCT 


GTGTTTTCTT 


TTGGTGAAAA 


3720 


AAGAAAATAA 


AAAATAGAGA 


AGTCATTTTT 


TAATGACTTC 


TCTATTTTTT 


TAATAATCTG 


3780 


AACTGAATAA 


AACGTCCATT 


TGTTTTTGAA 


TTTCTGGGTT 


TTCCAAGAAT 


TCATCATAGG 


3840 


TTGTTTCTGC 


CCGATCAATG 


ACCCCTTTAT 


CAGAAACAGC 


AATAATTCGG 


TTCGCTAATG 


3900 


TTTGGATAAA 


TTGGTGGTCA 


TGTGAAGCAA 


AAAGAATTGA 


ACCAGTGAAA 


GCCATCAACC 


3960 


CATCATTTAA 


TGCAGTGATT 


GATTCTAAGT 


CTAAGTGGTT 


CGTTGGATCA 


TCTAAAACTA 


4020 


AGACATTGGC 


TTTGGAAAGC 


ATTAATTTTG 


AAAGCATGAC 


ACGCACTTTT 


TCGCCTCCGG 


4080 


AAAGAACATT 


GACTGGTTTT 


AGTACCTCTT 


CACCAGAGAA 


TAACATCCGA 


CCTAAGAAAC 


4140 
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TACGTAAGAA 


CGTATTGTCA 


TCTTCTTCTT 


TACCAGCAAA 


TTGACGTAAC 


CAATCTAAAA 


4200 


TGGTTAATGG 


CTCTTCAAAG 


TCTTTGCTGT 


TGTCTTTTGG 


TAAATAAGCT 


TGGCTAGTTG 


4260 


TAACGCCCCA 


ACGAACAGAA 


CCTGTATCGG 


GAGTAATTTC 


GCCCATAATC 


ACTTTGAATA 


4320 


ACGTAGTCGT 


TGTAATATCT 


GAGTCAGCGA 


TGAATGCCAC 


TTTATCATCT 


TTTGTTAAGT 


4380 


TGAACGAGAT 


ATTATCTAAG 


ATTTTTTTGC 


CATCAATCGT 


TACAGAGACA 


TTTTCAACTT 


4440 


GTAATAAGTC 


ATTGCCGATT 


TCACGTTCTG 


GTGTAAAACC 


AACGAATGGA 


TAACGACGAG 


4500 


AGGAAGGTTG 


AATGTCATCT 


AAGGTAATTT 


TATCTAACAT 


TTTTTTACGA 


GACGTTGCTT 


4560 


GTTTTGATTT 


TGAGGCATTG 


GCGCTAAAAC 


GAGCGATAAA 


GTCTTGTAAT 


TCTTTGATTT 


4620 


GTTCTTCTTT 


TTTGGCATTA 


GATTGTGCTT 


GCAATTTTGT 


CGCTAGTTGG 


CTTGATTCCA 


4680 


ACCAGAAATC 


ATAGTTACCA 


ACGTAAAGTT 


TGATTTTACT 


AAAGTCTAAA 


TCTGCCATGT 


4740 


GAGTGCAGAC 


TTTGTTTAGG 


AAATGACGGT 


CATGGGAAAC 


CACGATAACG 


GTATTTTCAA 


4800 


AGTTGATTAA 


AAATTCTTCT 


AACCAATTGA 


TTGAGCGGGT 


GTCTAAACCA 


TTTGTCGGCT 


4 860 


CGTCTAATAG 


TAAGACATCT 


GGTTTACCAA 


AAAGAGATTG 


AGCTAATAAT 


ACTTTGACTT 


4920 


TTTGACCAGC 


TGTTAATTCG 


CTCATTTTTT 


GATCATGTAA 


TTCTTCTGGA 


ATGTTTAGCC 


4980 


CTTGGAGTAA 


AACAGCTGCT 


TCAGGTTCTG 


CTTCCCAACC 


GTCAAGTTCA 


GCAAATTCGC 


5040 


CTTCTAGTTC 


TGCGGCACGA 


ATCCCATCTT 


CATCTGAAAA 


ATCTTCTTTC 


ATATAGATAG 


5100 


CATCTTTTTC 


TTTCATTACT 


TCGTAAAGAC 


GTTTATGTCC 


CATAATTACA 


GTTTCTAAAA 


5160 


CAGTGTAGTC 


TTCGTAGTCA 


AAGTGATTTT 


GTTTCAGCGT 


TGCTAGTCGC 


TCATTAGGAC 


5220 


CCATTGAAAC 


CACGCCAGTT 


GTCGGTTGAA 


TCTCACCTGA 


TAAAATTTTA 


AGAAACGTTG 


5280 


ATTTTCCTGC 


ACCGTTTGCG 


CCGATTAAGC 


CGTAACAGTT 


GCCAGGAGTA 


AATTTAATAT 


5340 


TCACTTCTTC 


AAAAAGTTTA 


CGATCTGGAA 


ATTGTAAACT 


TACATCATTA 


ACAGTAATCA 


5400 


ATTGTTTTCC 


TCGCTTTTAA 


TATATTCTTG 


AACCTCTCTG 


CATTATATCG 


GCTACAGAAC 


5460 


CTTTTTTCAA 


GGAGTTTCAA 


TCTTTTTACA 


GAAACGAATA 


AAAAGGTGGG 


TTTTTTATAG 


5520 


AAAAAAGGCG 


CTTTTTTAAA 


ATGTTTCATG 


.TGAAACAGAA 


ATTTTTTTCA 


TGAAAGGGTT 


5580 


TTTAGTGTAT 


TTTTTTAATG 


ATACACTGTA 


TCATGTAAAT 


TAATTAGTGT 


ATCGATTAAA 


5640 


GGAGCCATCA 


AAAATGAAGC 


GGATTGAAAA 


AATCTATCAA 


TATGTTAAAG 


AGCAAACGAA 


5700 


GAATTTAACG 


CCAATGACCT 


TAACAAGTGA 


TGCGGGAGTC 


ACGACACAAG 


AAATTTCAGA 


5760 


ACGTTTAAAC 


ATTCAGCGGA 


CCAATGCTAG 


TAAAGATTTG 


AATCAACTGG 


TTCGTGAAGG 


5820 


GCGCTTGAAA 


AAATTATCTG 


GTCGACCAGT 


AAAATATGTC 


GCGCAGGTGG 


CGTTTCAGCA 


5880 


TCGACCATTA 


ACCAAGCCAG 


TTAAAAGTTA 


TCGGGAAAAA 


TCAATGGATC 


ACTCCACAAA 


5940 


AACTTTTATC 


GAACCATTGG 


TTGAACCAGT 


AGTCACGACA 


AGTAAAACCC 


CAGAAGTTGA 


6000 


AGATATTTTC 


AAAAAAATTA 


TCGGTTCAGC 


TGGAAGCATG 


AAAACGCCGG 


TGGAACAAGC 


6060 


CAAAGCAGCG 


ATTTTATATC 


CGCCAAAAGG 


CTTGAATTGT 


TTAATTACTG 


GACCAACAGG 


6120 
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TTCTGGGAAA ACTTATTTTG CTCATGCCAT GTTTCAATTT GCGAAqCTGa ACCAAATTGT 6180 

GGCCAAAGAA AAAGAGTTTG TGGTGTTTAA CTGTGCAGAC TATGCGCATA ATCCAGAACT 6240 

TTTGATGTCG CATTTATTTG GGTATGTAGA AGGGGCCTTC ACAGGCGCTA CGAAAGCGAA 6300 

AGAAGGGATT ATTGATGAAG CAGATGGCAG CATTCTGTTT TTAGATGAAG TACATCGGTT 6360 

ACCACCCGAA GGACAAGAAA TGATTTTTTA TTTTATGGAT CACGGCGTGT ACGCGCGTTT 64 20 

AGGTGAAACA GTGAAATCAC ATCATGCGGA TGTTCGGATT ATTTGTGCTA CGACGGAGAA 64 80 

TCCTACTTCT TCTTTATTGA ATACTTTTGT GCGACGCATT CCGATTATCA TTCAATTACC 654 0 

GAACTTTAGT GACCGTCCTG CGAAAGAAAA AATTGACCTC CTAAAAGTGA TGGTTTCAAT 6600 

GGAAGCAGCG CGGATTCAAC GGCGAATCTC TTTGTCAGAA GATGTGGTCA AAGCGTTAAT 6660 

TGGAAGTGTT TCTTATGGAA ATGTCGGTCA GCTAAAATCG AATGTGCAAC TTGTGACTGC 6720 

GCGAGGGTTC CTTAATCAGA TGGAACAAGA GGAATTAATG ATTACGATGG ACGAATTAAC 6780 

GGACAATATC AAAGAAGGCA TTATGCAACT TGCCAGTAAT CGAGAAGTAT TGTCGGAGTT 684 0 

ATCGAAATAT CTGGAACCGC AATTAGTGGT ATCACCAAAT GAATCTTTAG TAGCTATTCA 6900 

GTCAGATGCC TATGAATTGC CTTACAATCT ATATGAGATT ATTGGAGATA AAGCGGCTCT 6960 

CTTAAAAGCA GACGGCTTGG ACCAAGAACT AATTAATAAT TTTATTACAA CGGATATCAA 7020 

TGTTCATTTA AAATCTTTCT ACAAAGATTA TGGGTTCACC TTTGATACAG AAAATAAATT '7080 

GGCTGAAATT GTTGACCAAC GAATTATTGA TGTCACTAAG AAAATTTATA ATTTTGCAGC 7140 

GAAgCGGTTA TCGTATTCGT TTCAACCCAA CTTTATTTAT GCAATGAGTT TACACATTAG 7200 

TTCTTTCTTG AAACGCATCC AATTAGGCAA AGATCCTAAG CATCCTTTAA ATGAAAGTAT 7260 

TCGAAATATG GTTTTAGATT ATCCAACAGA GTTTGAAACA GCAAAAGAAA TTAAAAATAT 7320 

TATTGAAAGT AGTTATCAGA TGGAGATTCC TGAATCTGAA AG T TACT ATT TAGCTGTTTT 7380 

GCTCATTTCT TTAAGAGAAA ATCCTGAGGC TGGCCGGATT GGCATTGTCG TGGcTGCGCA 7 4 40 

CGGAAATAGT ACCGCTAGCA GTATGGTACA AGTGGTTTCT CAATTGTTAA ATGTTGATAA 7500 

TTTGAAGGCT GTGGATATGC CGTTAGATAT GCCACCGAAA GAAGCTTTGC GTAAAATCGT 7 560 

TGAAGCAGTT GGTGAAGTGA ATGAAGAGAA TGGCGTACTA TTATTGGTGG ATATGGGTTC 7 620 

ATTAAGTACT TTTTCAGAAG AAATTGTACG CCAGACAGGC ATTGATGTGC GG AC AG TAG A 7 680 

CATGGTGACC ACGCCAATTG TTCTTGAAGC GGCCCGCAAA ACCGCATTAA TTGATACGCA 7740 

ATTGGAAACA CTTCACGAGT CTTTAAAAAA CTTTCATGGC TATGCCGATA TTCGCCAAAG 7800 

TGAAACGAAA CAAATCATTG AAAATTGGAA AACAAGAGCA ATTATTGCTA TCTGCGCTTC 78 60 

TGGAGAAGGA ACTGCGCGCC GAATGAAAGA GCTTATTGAG GAAGCTGTCT TGCCACAAAT 7920 

AGACTGGCAT TTAGAAGTGA TTCCTTTGTC CATTGTAAAT ATGAAAGAAG TCTTGCCTAA 7980 

AATTCAAGAA GACTATGAAA TTATTGCGAC AACTGGGATT ACCAATCCCA AAATTGGTGT 804 0 

TCCTTATATT TCCATGGAAA ATTTCTTTTC CGGTGAAGCA GAAAAAATTA TTCAGCAATT 8100 
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ACTAGCAGAG 


CGGAAAGAAG 


AAGTAATGAA 


TAAGCCGTTG 


GACGAAGCGG 


CAGCAAAAGA 


8160 


AATTTGTCTA 


AAATATATGG 


AGCAAAGTTT 


CACTTTCATT 


AATGGAAAAA 


AAATCATTGA 


8220 


TCTGTTGTGG 


TCATTCGCTA 


AAAATATCCA 


AACCCAATTA 


GCGATGCCAG 


AAGAGTACAC 


8280 


ATTTTACATC 


AATTTGATTA 


TGCACACATC 


GGGCATGTTG 


GAGCGTATTT 


TACGGAATGA 


8340 


TACACTGACG 


GTTTCAGAAA 


AGGAACTTGG 


TCGATTGGTT 


CAAGAACCAA 


TTTATCCAGT 


8400 


GATTGTGTCC 


TCAATTGAGA 


CTATGGAAGA 


AGCTCTTAAT 


ATGGATGTTC 


CAGCAGAAGA 


8460 


AGTTTATTTT 


ATCGCACAGT 


TAGTCAAAAA 


TGCTCAGTGT 


ATCGAAGATA 


AAATCACCGA 


8520 


AATTGATACA 


CTAGATTAAT 


ACAGAAAAAC 


GATACACTAT 


TTATTAGTGT 


ATCGTTTTTT 


8580 


TTATTTTCTT 


AAAAAAACCT 


ATAATTTTCT 


AGGTTTCGAA 


CCGATACACT 


AAAGTTGGCA 


8640 


CGAAACTTGC 


TTATAATAAG 


GTGTCCCTTT 


TTACCTAAAA 


TTTTTAAGTA 


AAAAGACAAA 


8700 


GGTCCTAACC 


TTTAAAAATG 


AATTTAAAGG 


AGCATTCCCG 


ATGGTAATGG 


ATATTCGTTT 


8760 


AGCTCGAATT 


GATGATCGAT 


TAATTCATGG 


CCAAGTAGCA 


ACTGTTTGGG 


CAAAATTGAC 


8820 


GAACATTAAT 


CGCATTTTAG 


TGGTTAGTGA 


TTCCGTTGCT 


CATGACAATT 


TGAGAAAAGC 


8880 


CTTGTTGGTT 


CAAGCTGCTC 


CACCTGGTGT 


CAAAGTAAAT 


GTCATCACTG 


TTCAAAAAAT 


8940 


GATTGATGTT 


TACCCTGATC 


CTCGTTTTGA 


TTCGTTTCGA 


GCCATGTTGT 


TATTCACTAA 


9000 


TCCTTTGGAC 


GTGAAACGAG 


TGGTCGAAGG 


CGGTGTACAA 


CTAACTTCTG 


TGAATATTGG 


9060 


CGGAATGAGT 


TTCAGCACAG 


GCAAGCGAAT 


GATTACGAAT 


GCAGTCGCAG 


TGGATGCCAA 


9120 


CGATTTAAAA 


GCATTTCTGT 


TTTTAAATGA 


ACAAGGTATT 


GAATTGGAAA 


TTCGAAAAGT 


9180 


CGCAGCAGAC 


AGCCAAGTGA 


ATTTGATGGA 


TTTACTGAAA 


AAAGAGCGTA 


gCAAAGAAAA 


9240 


TAACGCCAGC 


TGACGTTAAT 


TTTTACAGAC 


GAAAGACATT 


GAAAGGATGT 


GGTGGCCTAG 


9300 


GACCAACATC 


ATTGCTTCAT 


GAACTGTATT 


GATAAAGGCA 


AGTGCTGATG 


TTGAAAAAAA 


9360 


TCAAGAACTA 


TTGAAATTTT 


TTCATTTATT 


TTTGTCAGAA 


TCTGATTTAT 


CTGATTTTTT 


9420 


TCAAAAAGTG 


AAAGCGTTTA 


GCAGTACAAC 


CATGAAAAAA 


AACATCAATT 


ATACAGGAGG 


9480 


TATAAAAATG 


GTAGGAATTA 


TCCTAGCAAG 


TCACGGTGAA 


TTTGCTGAAG 


GGATCTTGCA 


9540 


ATCTGGTGCA 


ATGATCTTTG 


GCGAACAAGA 


AAATGTAAAA 


GCAGTTACGT 


TAATGCCTAG 


9600 


CGAAGGTCCC 


GATGATGTAA 


AAGCAAAAAT 


GCAAGAAGCC 


ATTGCATCGT 


TTGACAATCA 


9660 


AGACGAAGTA 


CTATTTTTAG 


TTGACCTATG 


GGGAGGAACA 


CCATTTAACC 


AAGCGAACTC 


9720 


ATTHTTAGAA 


GATCATAAAG 


ACAAATGGGC 


AATCGTTGCA 


GGTATGAACT 


TACCAATGGT 


9780 


TATTGAAGCA 


TATGCTTCTC 


GTTTCTCTAT 


GGAATCAGCA 


CAAGAAATTG 


CGACACACAT 


9840 


TCTTGAAACA 


GCAAAAGATG 


GTGTCAAAGT 


TAAACCAGAA 


GAGTTACAAC 


CAGCAGAAGC 


9900 


ACCAAAAGCT 


GCTGCGACAG 


AAGATGCACA 


GCCAAAAGGT 


TCGTTACCAC 


CAGGTACAGT 


9960 


TGTTGGCGAT 


GGCAAAATTA 


AATTTGTGCT 


AGCACGTATT 


GATTCACGTT 


TATTACATGG 


10020 


TCAAGTAGCA 


ACAGCTTGGA 


CAAAAGCAAC 


ACAACCAAAC 


AGAATTATTG 


TTGTTTCTGA 


10080 
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TGCCGTTGCA 


AAAGaTGACT 


TACGTAAGAA 


ATTAATCGAa 


CAAGCAGCTC 


CTCCAGGAGT 


10140 


AAAAGCCAAT 


GTTATTCCAA 


TCAGTAAAAT 


GATTGAAGTA 


GCGAAAGATC 


CACGTTTTGG 


10200 


CAATACAAAA 


GCATTATTAT 


TATTTGAAAA 


TCCTGAAGAC 


GTCTTAAAAG 


TGGTTGAAGG 


10260 


CGGCGTAGAA 


ATCCCAGAAG 


TAAACGTTGG 


TTCAATGGCT 


CACTCTGTTG 


GAAAAGTCGT 


10320 


TGTAAGCAAA 


GTATTATCAA 


TGGGTCAAGA 


AGACGTTGAT 


ACGTTTGATG 


AATTAAAAGC 


10380 


CAAAGGCATC 


AAATTTGATG 


TGCGTAAAGT 


GCCAAATGAT 


TCAAAAGCGA 


ATATGGACGA 


10440 


AATTTTGAAA 


AAAGCAAAAA 


ATGAGTTAGC 


AAACGCGTAA 


CTCGTCAAAT 


ACAAATCAAA 


10500 


ATAGGAGGCT 


TATCATGTCT 


GTTATATCAA 


TTATTTTAGT 


TTTACTCGTT 


GCCTTTCTAG 


10560 


CTGGTATGGA 


AGGGATTTTG 


GATGAGTTCC 


AATTCCACCA 


ACCGTTAGTA 


GCATGTACAT 


10620 


TAATCGGATT 


AGTTACTGGT 


AATTTAGAAG 


CCGGAATTGT 


TTTAGGTGGT 


ACTTTACAAA 


10680 


TGATCGCbl 1 


Abbl ibbbbb 


AAbA L Lbbbb 


bl bbbb 1 1 bb 


APprp 7\ Terr 1 


uLi 1 1 rtu^ri 1 


i m a n 


CTGTTGCCTC 


AGCAATTATT 


TTAGTTTTAG 


GTGGCCAAGG 


TGTTAAAGGT 


GTTCCTTCAG 


10800 


CGATTGCCAT 


CGCAGTTCCT 


TTAGCGGTTG 


CAGGTTTGTT 


CTTAACAATG 


ATCGTTCGGA 


10860 


CAGTTGCGGT 


TCCAATCGTT 


CACATGATGG 


ATGCGGCTGC 


TGAAAAAGGC 


AACATTAAAC 


10920 


AAGTTGAAtG 


TaCACATTTT 


CGCTGTATGT 


TTACAAGGTA 


TTCGTATTGC 


GATTCCAGCA 


10980 


GGTGCGTTAC 


TATTCATTCC 


TGCTGACACT 


GTTCGTAATT 


TCTTAGAATC 


AATGCC 


1'1036 



(2) INFORMATION FOR SEQ ID NO: 4 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

AACGGTGATG CAGGAAAAAG AATCCTCTTC GGCAATCGCC GAAAGCAGTA GCGGAAATGC 60 

GGTTGCAGTA ACTACTGAAA CCACAAATGA GATACAAAAT AG T GAT AC AG ATGGaAAAGC 120 

TGTTTCTGCr GAGAGTGTTT TTTCAGAAGC AGaTTATAAA CAAGCAACTG CACTGGAGTT 180 

AGCAACTTTA GTAAGAGAAA AAAAGGTGAC GAGCGAAGAA TTAGTAAAAA TTGCGTTGGC 24 0 

CATCACTAAA CGTGAAAATC CT AC ATT AAA TGCGGTCATT ACTTTACGAG AAGAAGCGGC 300 

TTTGACAGAA GCAAAAGCTT TACAAGATAC AGGTCAACCA TTTCTAGGTG TGCCGCTCTT 360 

ACTAAAAGGG TTAGGACAAT CCTTGAAAGG TGAAAGCAAC ACGAACGGTT TTGGTTTTCT 420 

CCAAGATCAA GTAGCTGGTG GGACATCGAC CTTTGTCAAA GCACTACAAA ATGCCGGATT 4 80 

CATAATCATT GGTCAAACGA ATTATCCAGA ATTAGGTTGG AAGAATATTT CAGATTCAAA 54 0 

GTTATATGGC GTTTCCGTCA ATCCATGGAA TCCTAACCAT TATTCAGGTG GTTCTTCAGG 600 

CGGACnGGGT GCTAGTGTCG CCGCGGCATT TGTTCCCATT GCTTCTGGAA GTGATGCTGG 660 



WO 98/50555 PCTAJS98/08985 

1708 



TGGCTCTATC 


CGCATCCCTG 


CTTcTTGGAC 


AGGCACCGTA 


GUI lGAAALG 


1 1 G i AG AGG A 


"7 9 0 


GTAATCATTG 


GTAATTCTAA 


fn tv m ta ta TV 7\ 

TAGTGCAAAA 


GGTCAAAC IG 


1 1 LnL 111 ob 


1 1 1AAG1GGA 


Ton 
/ou 


ACCGTGGCGG 


ATACAAATGC 


ATTATTTGAA 


TV /""• /""■» rn rn TV rp m 71 TV 

ACCTTAT I AA 


LGAAAAAAGA 


1 G 1 1GG I GGG 




GGACATTTAA 


GTCAAGCGCA 


ACCCATTGCT 


m TV m TV ^ TA TV ^ TV 

TATACAACAG 


AATCCCCTGC 


CGGAACGCCT 


yuu 


ATAAGTGCCG 


AAGCAAAAGA 


GGCGGTCGCT 


GAAGCTGTTG 


CTTTTTTAAA 


AGACCAAGGG 


O C f\ 


TACACATTGG 


TTGAAGTGAA 


GCACCCTGTT 


GATGGGGAAC 


GTGT. AATGAA 


AAATTATTAT 


i i r\ 


ACTGTAGCTG 


CTGGCTCAGC 


AGGAATTGCC 


GATTTTATGG 


CGCGGCAGAA 


TV rp m tv tv jv Tir'iv 

ATI G AAAAGA 


i r\ a n 


CCGCTTGAAC 


GAAATGATGT 


AGAACTGTTA 


ACATGGGCGC 


T TT T TC AAAC 


AGGAAAAAAJ. 


1 1 4 u 


ATAACGAGTG 


AAGAGACAAC 


TGCGGCTTGG 


ACAGATATTG 


CTTTACAAGC 


ACAAGCGATG 


1200 


GACGAATTTT 


ATCmACAGTA TCtATCyTAt 


TAmCACCAAC 


GACGGCTGCa 


CGGCACCTAg 


1260 


TAaTTGATAA 


TCCGTTACTA 


AAACCAGAAC 


ATGCcAGCAC 


AGATtGGAAA 


AAAaTTGATC 


1320 


mATTGTCACC 


cAGCcAGAaC 


aAAaACmATT 


GgtTTaTGaC 


CaTgGCTGAC 


GGCGTTCACa 


1380 


TTACmcCgtT 


TACCCCACCA 


AGCCAAATTT 


TAnTTGGGGC 


CATCCCGCCA 


TTAAGGGGGT 


1440 



(2) INFORMATION FOR SEQ ID NO: 4 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



TGCACTGAAT 


CATCCTAACC 


TAGAAGTTCG 


CTTAATAACG 


ACGGTTGCAG 


GGAATGTAGA 


60 


TGTAGAAAAA 


ACAACAAATA 


ATGCCTTGAA 


ACTTGTCGAT 


TTTTTTGGGA 


AAAAGGTTCC 


120 


TGTAGCCAAG 


GGGTGTAATT 


GTCCGTTACT 


TATTCAATTA 


GAAGATTCTG 


CTGAGATTCA 


180 


TGGGGAGACT 


GGTATGGATG 


GCTTTGAGTT 


TCCGCAACCT 


ATCTCTACGT 


GTTTAGATAT 


240 


TCATGCAGTG 


GAAGCTATGC 


GAAAAGAAAT 


TCTGTCTAGT 


GATGTTCCTT 


TGACCATTGT 


300 


TCCAATTGCT 


GCATTAACGA 


ATATTGCTTT 


GTTACTGACC 


CTCTATCCAG 


AAGTCAAAGA 


360 


AAATATTGCA 


GAAATCGTCA 


TGATGGGTGG 


CTCATTAGCA 


AGGGGAAATA 


CAAATACAAG 


420 


TGCCGAATTT 


AATACGTATG 


TCGATCCACA 


CGCTGCACAA 


ATTGTTTTCC 


AATCAGGTGT 


480 


CCCGCTAACA 


ATGGTAGGAT 


TAGATGTTAC 


AAGTCAAGCT 


GTGTTAACGA 


ATCATGAAGT 


540 


AACAGCCATC 


AGAGCGCTTG 


GTAGAGTTGG 


GGAAATGTTT 


TATGGTCTTT 


TCCGTCATTA 


600 


TAGAGGCGGC 


AGTCTAACTA 


CAGGTTTGAA 


AATGCATGAT 


GTCTGTGCGA 


TTGCCTATTT 


660 


AACTTCACCA 


GAATTATTTG 


AGACAACTGA 


AACGTTCATT 


GAAGTTGcTT 


TGGAAGGTCC 


720 


AGCTGCGGGA 


GcTACTGTTG 


CGGATTTAAA 


AATGAAGTAT 


CATAAAAACA 


CGAATGCCGT 


780 


TGCCTGTAtA 


GATGTAAATG 


TTGAAGCTTT 


TCAAAAATGG 


G 




821 


(2) INFORMATION FOR SEQ ID NO: 4 67: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LF.NGTH: 1420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 67: 



CTTAGGGATG 


GTGACTACTA 


GTGACGGTCG 


AAGCTTCGCT 


GCAbLAbAI 1 


.LbbbAbbLTT 


c a 
bU 


AATTGAAGGA 


GCTTCTCAAG 


GTGTCGGCTT 


Tl f-> TV TV T\ TV /"* 

AGGAACACAG 


TTCTTAbbLL 


ALA 1 i bAACG 


i o a 


GACGCGTGTT 


ATTTTACATG 


TCATTGATAT 


GAGTGGCATG 


GAArGACGbb 


Aibb X 1 AlbA 


ion 
lo U 


AGATTATTTG 


GCAATCAATA 


AAGAATTGGC 


TTCACACAAT 


TTACGTTTGA 


TGGAGCGACC 


O A A 


TCAAATTATT 


GTTGCCAATA 


AAATGGACAT 


GCCTGAAGCA 


TV TV /"^ TV TV TV fTA m 

GAAGAGAATT 


TAGCGAAATT 


o a a 


TAAAGAGCAA 


CTTGCGAAGG 


AACGCACCGA 


TGAGTATGCT 


/™* tv f + tv /■* m tv 

GATGAGCTAC 


CTATTTTCCC 


ocn 
JbU 


AATCTCTGGG 


GTTACTCGCA 


AAGGTATTGA 


ACCTTTATTA 


TV 7\ m /"* TV T\ /"* TV 

AATGCAACAG 


CTGACTTGTT 


yl O A 


GGAAGTAACG 


CCTGAATTCC 


CATTATATGA 


AGATGAAGTG 


^#Tl TV TV TV TV 

GTAGAGGAAG 


AAACCGTTCG 


/OA 
H O U 


TTATGGCTTC 


CAACCAGAAG 


GTCCAGAATT 


TACGATTGAT 


y - */"* ff^ ^» TV TV TV /"* 

CGTGAACCAG 


ATGCTTCATG 


C / A 


GGTTTTATCT 


GGTGAAAAAT 


TAGAAAAATT 


ATTTGAAATG 


tv tv tv m fn m y"* 

ACGAATTTCG 


n m ^ t\ rp tv m tv 

ATCATGATGA 


/"aa 


AACAGTAATG 


CGTTTTGCAC 


GTCAATTACG 


TGGTATGGGT 


GTCGATGAGG 


CATTACGAGC 


£LCC\ 

ooU 


ACGTGGCGCG 


AAAGATGGCG 


ACATTGTTCG 


GATTGGTAAT 


TTTGAATTTG 


AATTTbTCGA 


■? o n 


ATAAGCATAA 


AACCAGTTCA 


CTCAGTGAAC 


TGGTTTTTTA 


TTTTAAAATT 


7\/- , rnTvrprpTV7\7\rp 

AGT ATI AAA i 


/ oU 


TAGCGAATAG 


tv m m V"* * t» tv m t\ tv 

ATTGAAATAA 


TAtTTTTAAT 


ta ±* t\ m t\ 7A rn 

AGCtTATAAT 


TTTaAATGtw 


AbaAAA 1 nft^ 


0 H U 


AAACGAAAGG 


AGCGAAATTG 


AAATGGAAAA 


CAATCAAGCA 


ATGGTTCCTA 


AGTTTTCAGT 


900 


TAAAAGTGCC 


GGTTTTCTTT 


TTAGTATGGG 


GCTCTTATGT 


GGTGTCCTGT 


TGCCGAGTGT 


960 


TTTATTATGG 


ACTTTCAGcT 


GCCGTTTAAT 


ATTGGTGCCA 


TGATTTTTTT 


GCCTGTTGGG 


1020 


ATTGCTTTTG 


CGTTAGCGTA 


TAGTCATTAT 


TTATTGAAAC 


GAAAACAGGT 


TTTTGTAAGC 


1080 


GTTTTATnTG 


GTTGTTTTTA 


ATTTCTCTTA 


TTTTATAGAG 


CTTATCTCTT 


ATTTATGGCT 


1140 


GATTAAAGGA 


GTTATTTTTT 


AAGAGAAAAA* 


CAGAGGAGAG 


TGTTATTTAT 


GTTAGATAAA 


1200 


CTGACTGTTT 


GGATTGAAAA 


TCATCTAGCT 


GGACCAATGG 


CTAAAATTGC 


CAATCAGCGC 


1260 


CATTTACGTG 


CGGTGCGGGA 


TGGGATTATT 


GCGACATTAC 


CTTTAATTAT 


TGTCGGTTCT 


1320 


TTTTTTCTAA 


TTATTGCGTT 


TCCGCCGTTA 


CCAGAATCTT 


GGGGCATTAC 


TCAATTTTTA 


1380 


ACAAGTAATG 


CTGCTACAAT 


CTTATTGCCT 


TACCAATGAC 






1420 



(2) INFORMATION FOR SEQ ID NO: 4 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 468: 
GGTATATCCA tGTATwACAG tATAgCCmTT aCAtAAtCAT GTAACTaTCA ACAATCaTCA 60 

AAGgTTTCCw AAtCTTTATG GATTGCTGgT ATCCTATCGA TGGTGTGTCA CCATATCCCA 120 

TTGTATGCTT ATTTCCCAAG CATGGATGGT ATTGGCATTT GGCAATTCAC ATCCGCTTAT 180 

ATTGCAGGTG GTTTAGATGG TAACGTAGAT TTAACAGGAA TTACGGATAG TGGTTATACA 240 

GAT AC Aa ATA AACCAGAAAC GGATACGCCA GCAACAGATG CAGGCGAAGA AATTGAAAAA 300 

ATACCTAATT CTGATGTTAA AGTTGGCGAT ACCGTCAAAG TGAAATTTAG TGTTGATGCT 360 

TGGGCAACTG GCGAAGCTAT TCCAGATTGG GTTAAAGGAA ACAGTTATAA AGTGCAAGAA 4 20 

GCAACTGGAA GCAGAGTATT' GCTAGAAGGC ATTTTGTCAT GGATTAGCAA AGGTGATATT 480 

GAACTATTAC CAGATGCAAC AGTTGTCCCT GATAAGCAAC CAGAAACAAC ACATATAGTT 540 

CAACATGGCG AAACATTATC AAGCATTGCT TACCAATACG GTACCGATTA TCAAACTTTA 600 

GCTTCGTTAA ATGGATTGGC TAATCCAAAC CTTATTTACC CTGGACAAGT TTTGAAAGTA 660 

AATGGATCAG CAACAAGTAA TGTTTACACA GTCCAGTTGG GTGATAATTT ATCAAGTATA 720 

GCGGCTAAAC TTGGCACGAC TTATCAAACA TTGGCAGCTT TAAACGAATT AGCAAATCCT 780 

AACTTGATTT ATCCGGGTCA AACGTTGAAC TATTAAGAAA TAAGTAAAAG GCCTACTTCT 84 0 

CTTTGTGAGA GGTAGGTCTT TTTTCATTTA TACTAATATA TACATATCAT TGAATGCTAA 900 

AAAATTAAAA TTTGTCTTAA GTAAATTTTC AAGTTATGGT ATACTTTGCT GGTGAATATG 960 

GGAAGGAGCA TAAAATTTGA AGAAACATTC TAGGAAAATA AAAAATAATA AAGAACTAAC 1020 

TTTACTGATT ATAGGAATTT GGGTTGTTTT TTTTCTTTTT AATATGTATA TGCCGACAGT 1080 

GAGAGCTGAT GACTTAGTTT ACGTAAATAG AT TAG AT AAA TTAGGATATT TGGGTGCTTC 114 0 

AATAGAACAT TATAAAACAT GGAGTTCAAG AGTTATTATT GAATTATTCT TGATGTTTTT 1200 

TTCTAAACAC CTTATGTTAT GGAAATTATT AAATTCAACG ATAATGACAG GTTCTATAGT 1260 

ATTGCTTTGC AAATATGTTT TCAACAAGTT ATATTCTAAA AATTTGCTAC TAGTTTTTTC 1320 

TATATTTTGT CTAATTCCTT TAACAATAAT GGGGGAAACT GGTTGGATCG CTACTACGTT 1380 

AAATTACCAA TGGCCAGTAG CTTTTGGTTT GCTAGCTTTT TACCCTTTTT TTCAAGTGTT 14 4 0 

AAGCGGTAAA GAAATAAGCA AAAAAATATA TTGTATAAGT ATTCCTTTAT TAATATTCTC 1500 

TGCTAACCAA GAGCAAGTTA ACGTATGCTT TTTTGTTTTA ACTAGTCTAG TAAGTTTGTA 1560 

TTTATTATTT AGAAGAAATT ATAATTATAA GTTATCGGCG CTTTCCATAA TAAGTTTAGT 1620 

GGAATTATTT TTTTCTTTGA CGGCCCCAGG GAACACATTG CGGGCTGCTC ATGAAATTAA 1680 

TAAATGGTTT CCTGAATACA AAAAATTTAA CTTTGTGAAT AAATTAGACC TTGGTATATC 174 0 

ATCAGAGAGT GTAAAATATT TTGTGTAAAT GAAAAAATCC ATACAAAAAA GGAAGTCGCT 1800 

TCTGTAGAAT AAAGTTAACG ACAACCAATT CACAGAAAAG AGGACTTCCC TATGAATGAT i8 60 

TTTACTACAG AAATTGTGCA AACTCTAGTC ACTAAAGGCG ATTTAAATGA ATTATTCCGT 1920 
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TCGCACTTAG AAAAAGCGAT AAACACACTC CTACGGACTG AATTAACGGC TTTTTTAGAT 1980 

TACGAAAAAT ATGATCGCAC TGGTTTTAAT TCAGGTAATT CGAGAAACGG TTCTTACTTT 2040 

CGATCAATCA AAACCGAATA TGGTGaATTA ACATTGGgAA tACCTAGAGA TCGTAATGGT 2100 

GAGTTTAAAC mACmAACTTT ACCAGCCTAC mAAAgGAmCA cGrTACaTTG GaaCCmCTAT 2160 

TATCCnTTTA TCCGAAAAG ' 217 9 
(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 



TTTATATAAA 


TAAAATAGTA 


ATTGGAATGG 


CAATCAGCAC 


AaTCtGtGTA AAgCCaTGAa 


60 


TAGTGAaTTG 


GCTGTgATTT 


GGTCATCATC 


GAAAaGAGnC 




uu lul nrt ill 


120 


TCAACTTTAT 


CACCCATAAT 


AACTTTTGCG 


AAAATAAAAT 


rr a Trr z\ Ann 




180 


GCGGT TAAAG 


AAGTATAGAC 


AATAATTGGT 


TTCGACAATG 


btnnLu 1 r\c\ 1 


mrnmfpr- T>r; n a 7\ 


240 


ATTTGCCACT 


TGTTGGCACC 


GTCAATCATA 


GCCGATTCGT 




rcn zv a t czcztcz 

l/Uunn 1 uu i o 


300 


TCGAAGAAAC 


CTTTGGCGAT 


GTAAAAACCT 


AGCGCCGAGC 


CTGCAGAGTA 


AACCAGAATT 


360 


AAAGCGAATA 


AACTTtCAGT 


TAAATTTAAG 


GCTTTTAAAA 


TATAGTAAAC 


GGCAATCATA 


420 


CTCATAAAAC 


CAGGGAACAT 


ATTTAAAACG 


AGCGCCACTT 


TTAAAAATGG 


TTTTCTTAAG 


480 


CGGAAACGCA 


ACCGTGACAG 


CGCATAAGCC 


ATTGCGATTG 


TAATGAAAGT 


TGATAACAGA 


540 


CAGCTTGCGA 


TGGAAACGAA 


CAAGGTGTTC 


AAAAACCAAC 


GGACAAAGGG 


GAAGTTGCCA 


600 


GTCTGATTTG 


TTAATAATAC 


TTTATAATTT 


TCTAAAGTGA 


ATGTTTTGGG 


AATGATATAG 


660 


GGAACAAATG 


CACCGCCTTC 


GCCACGAAAA 


CTTGTTAAGA 


TAATCCACAC 


AATGGGGAAC 


720 


ATCCAAAAAA 


CTGCTAAAAT 


GGTTAGAATG 


ATGTAAACGC 


TGGTAAGTGC 


AGTGCGTTGT 


780 


TTTTTCTGGT 


AACTTTTTGC 


TTTAGCCATA 


AGTTACGCCT 


CCTTAAATGA 


ATTCGTACGT 


840 


GTATAGGCAA 


TTAAACTGAA 


GACTGCGGAG 


AGAATGAAGA 


TTAAAATACC 


AATCACAGAG 


900 


GCTAAATTGT 


AATCCATTGT 


ATCGACAGTC 


AATTTGTATA 


GCCAGGTAAC 


TAAAAGGTCA 


960 


GTTGAGCCTG 


CACCGTAGAA 


GTTAGAATTG 


GCTGGCCCAC 


CACCTGTTAA 


TAGATAAATG 


1020 


ACATTGAAGT 


TATTGATGTT 


GCCAATAAAC 


TGTTGAATCA 


GCGCTGGGGT 


CATCACAAAT 


1080 


AGAATTTGTG 


GAAACGTAAT 


ACTTTTAAAA 


ATTTGCCATT 


TATTGGCACC 


GTCAATTCTA 


1140 


GCTGCTTCGA 


' TTTGATCTTC 


AGGAAGGTTT 


TGGATAATCC 


CTGTAGAGAT 


TAACATCGTA 


1200 


GCAGGAATAC 


CAATCCACAT 


ATTGACAACA 


ATAATGGTTA 


ATTTTGCCCA 


ATTCGGATCT 


1260 


GTTAAGAAGG 


GGATAGAAGA 


ATCGATTAAG 


CCCCAATTTA 


ACAACAGAGC 


ATTGATAGGT 


1320 
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CCAGAACCAT 


TTAAGAGGTT 


TCGCATAATT 


AAAAGTGAAA 


CAAACGGAGG 


TACGGCCATC 


1380 


GTGATAACAA 


AAATAGTCCG 


CCAAAATGCT 


TTATATTTTA 


AGCCTTTTGT 


ATTGATAAGC 


1440 


AAGGCTAAAA 


TGATTCCAAA 


AAAGAAACAA 


GTTGCGGTTG 


CGAAAACAGC 


CCAAATCAAG 


1500 


GTCCAGCCAA 


GTACTGGAAA 


GAACGTTGAA 


GCCATATCGC 


CTGTGATTAC 


ATTACCTAGA 


1560 


TTTGCCAATC 


CAACCCAAGT 


AAAAAGGTTT 


TTCGGTGGCA 


GATGGTTATG 


GTCGTAGTTG 


1620 


GTAAAGGCAA 


TTGAAATCaT 


ATACAACAAC 


GGCAAGATGG 


TAAAGAAAAG 


AACGCCTAGT 


1680 


AAGGGGATCG 


TCATTAAAGT 


TGCATGGAAA 


CGTTCGTTTA 


ACAAGCTACG 


TAAATCATCC 


1740 


ATAGTTGTTG 


GAATTTTTTT 


GCCGGCTGTC 


TTTAATTCAT 


ATAGGTGACG 


GGCGCTTTTT 


1800 


AAATTGATTA 


CATATAAAAT 


GATTAGCGAT 


AAACAGACTA 


AAATTGCTGC 


AATTCCGAAT 


1860 


AATAATAAGA 


GCATTGAATT 


ATCGCCTTCT 


TTTAGAACTT 


CAATCCCTAA 


AGAATCATCA 


1920 


TAtACTAATC 


CTTGCTGTTG 


GGTCCCTAGT 


GTTCCTAGCA 


TCATTAATGC 


GTGGAAACCA 


1980 


TTACGGATTA 


ACCAATAGAT 


AAAGC 








2005 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 
GGTCCACCAA TATATTTAGG ACGAGAATAT AATGCTGTAA CATGGGCCGC ACCTAACGTT 60 

CGAATGCTCA AGCGAAGAGG AACTTGCACA TGTTTTACAT GCATACCGAT AGATGTGTCG 120 

CCAATGTCAA TCCCTGCTTG AGCAACAATA TGTTCAACTT CCACAGGTTC ATCAAATTTT 180 

TCAAAGGCAG CCACTGAACA AGAACCGCCA GCATGTAATT GTGGTACTAC CGAGACAATT 240 

TCAAAGCCTT TTTCTTCAGC CACACGACGT TCCATCGmTA AAGCACGATT AATATGTTCG 300 

CATCCTTGGA CAGCCAAAGA GATGCCTTTT TTATCTAATT GTTCAATTAA GGTTGAGACA 360 

ATCCATTGTC CTACTTCGGC ACTGGAATTT TTACCAATGA CGCCACCTAC TACCTCACTG 420 

GTGGTACAAC CCAAAACGAA TAGGTCCCCT TTTTTTAAAT TTCCTGCTGT AATGACTTCT 4 80 

TCTGTAATTG TCACTAACTG TTCTTnTAAC ATTTTnTCAT CGATAGTCA 529 
(2) INFORMATION FOR SEQ ID NO: 4 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 
AAGTTTTAGT CGTATCCACT TGATTATCTG GAATAATACT TTCTACTGTA AaCCCCATTT 



60 
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TTTCTTCTGT TGAAGTAyTC TCTGCTTGAA CAACTGAAGG AAAGAAAATA GATAATAGTA 120 

AAAACGCTAT AAACCCCATA AAAAATTTTT CTCGTTGTTT CACCTACATC CTCCTAAAAA 180 

AGGAGAGGAT GCATGATCCT CTCCCAAAAA ATTTACCGAT TAGATCGTAT CATTCAATGA 24 0 

CCAAGTCAAT GTCCCACTGT ATTGTTGATT AATCTTCGCT GCATTCGCAG GAACTTCTAG 300 

TTTAACATTT GACATCTCCA TAGCTGTTGA TCCTawGAAG cTAGCGCCGC CAGTTCCAGC 360 

ATCTTTCATA ACAGGCACTG CAGCAgCTCC AGCAGTCACT GATTGTGAAG AAGCATTAAC 420 

GGTTGCCGTA cGTGCTCCTT TATCAACAAT T 4 51 
(2) INFORMATION FOR SEQ ID NO: 4 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

TTCCTTGATG CAATACGCTT TCTGCATCAA TACGTCCTTT AyAGTTyTCG ACTAATTGTT 60 

TyGCAATCGA TAGACCTmAG CCATTACCAC CTTTATTTCT TGCACGGGCT TTATCTACTC 120 

GATAGAATCG ATCAAATATT TTCTCCAAGT CTTCTTCAGT GATTCCTTCA CCAAAATCTT 180 

GCACAGCTAT TTCAAATTCA TTCATCGTTC GAGAAATGGA AATATGCACT TCTTTTCGAT 1 24 0 

CGGTTGAATA TTTAATCGCA TTATCTAATA GAATAATTAG CAATTGTTCA AAGTGATTAC 300 

GATAGATTTT CAGCTCAACT TCGGTTGGTA AATCGTCATC TAATGTGATA TGAAACTCAG 360 

GATAAACCAA TTGGAAGTTA TTAAATACTT GGTAGACTAC TTGTTTAGCA TCTGTTCGTT 420 

CATTTGCATA TTGGGTGTCC ACTTGTTCAG CGCGTGAAAG GTCAAGCATT TCTTGGACCA 4 80 

AACTCTTCAT ACGACTAATT TCTTGTAAAC TGGCCTTTAA TGATTcATCT AAAATTTCAG 54 0 

GATCGTCTTT CCCCCAACGA TTTAAAAGGT TTAAATGGCC TTCCATAATC GCAACGGGCG 600 

TTCTTAATTC ATGGGAAACA TCTTCAACAA ACTGCTCTTG TTGTTCGATA TAGCGTCTCA 660 

TACGATCTAA CATTTCATTA AAGATTTCCG AGATATCTGC TAACTCATCT CTCGTATTAA 720 

TCTCCGGCAT GTGGACATCT GATTGCGGGT CTTTGCGGAT TGTATCCATC GTGTCTCTCA 780 

ATACTTTTAA TGGTTTTAAG AAATAAGAAG AGAGGATAAA GCCTAAGACA CTACTTACAA 84 0 

TCAGAGAAAT CACTTCTAAA ACTACAAGTG TTAATAATAA ATGATTACGA ATTTCATAAA 900 

AAGAAGAAAG TTCATAAAAC GCTTGAATAT AGCCAATCTT TTCTCGTGTT TCTTTTGAAA 960 

AAATAGGTTC CACGGAATAA AAACCGGTTT TATCAAAAAC GGTTCGAACA ACAGGTAATT 1020 

GTCGATCCAA TTGTAATAAC TTATCATATT CATTTTGTGT TTTAAAAACT AGCTTTTGGT 1080 

TTGTGTCATA CACAGATAAA TAAAGTTCTG GTTGTCCTAG TTCAGATATG AAACTGTCCA 114 0 

TTTCCATAAA CGAACCTTCT ACTGCTGTAT GCTTATTATA GTAGTTTTCA TCTCTTTCAC 1200 

TCGGCGTTTT TAAATAGTCA AAAACGTCTG TTACCGTTAA ATTTTCATTG GCATTAGCTA 1260 
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ATCGATTTGT TACTTCTGCA ATCGTTGCTT CGACATTTTC CTTTTCTTTG GcAACAATAA 1320 

G ACT G AC AG A AGACTTATAG GtAATCACCG CAAAAATGGt AAAAACTACA AATATAAAGA 1380 

AAGAACTTGC GAAAGCCCAC TTTATAGTTA AAgATGGGcC TTCCAGTTCT TTTTTAATCG 14 40 

TtCTTTTCAT nAATTAGTTC TCACGAAcGC ATAAcGTAGC CAGTTCCAcG GACAGTTTGG 1500 

ATGTAGCTTT CTtCTCCAGG TACGTCAATT TTATTTCGTA gTAGCGGATA G 1551 
(2) INFORMATION FOR SEQ ID NO: 47 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 



TTCGTGAATT 


TTTGTAGCTy 


TCAATTTTTC 


CtAAACGTCG 


TAGCCtATAT 


TGCTACCTTA 


60 


AGTATCTCCC 


TGACCATCGG 


ATTGAATCTA 


CTGGTACCTT 


TTTACTTAAT 


TTATACAAAA 


120 


TCATCGCAAT 


TCAACCCCAA 


AGGCTACTTT 


ATCAATTTAA 


TGACTTTTAC 


CTTTTGCCAA 


180 


TTTCTCCCAA 


TTAAAATGAC 


TTTTATCCAA 


CAATCTAAAG 


CATTTTTAGC 


CGGATTACTT 


240 


CTTTTTTTAA 


TCGCAGTGAA 


ATTCTATAGC 


TTTGTTTCTC 


AATATTATGC 


AAAAAGAACA 


300 

i 


TCTGAAAAAA 


ATGAGTTTAG 


CTTTTTCGCT 


GATTTATTGA 


ATTCATATGG 


TAACGAACAA 


360 


GAATATCAAG 


CTAACTTAAA 


CAAATTAGCC 


AAATTGACTC 


AACAATTGTA 


TTTCCAAGcA 


420 


TATCTTTCTT 


CTAGTTCTAA 


AAACAAAATT 


ACTCCTAAAA 


CTCGTATTGC 


ATTCAACTAT 


480 


GCCTTTTTAT 


CCCAACGTGC 


TCTATATTTT 


TTTACAAGTA 


TTGATAAATT 


GGTAAAGCCA 


540 


GATTCAATTA 


TTGAAAAATG 


GATGACAAGA 


ATTTCTGATT 


TTTTTGAAAA 


GTCCAGTCAG 


600 


TTTACTACGA 


TGTCAGTTAT 


yGAAAAAAAT 


GAGTTGAGAA 


TACAAGGGAA 


AATACTTATA 


660 


ACTTCTTTAG 


AACACTCCGA 


TGAAGTAGCA 


CTGATACTTT 


TAAGAAACCT 


GCTTATTCAC 


720 


AGTTTATTTA 


TATTACGTTT 


ATTGGATAAT 


GATCCAGAAA 


TAAGCAATAA 


TCATTGGTCA 


780 


ACGCCTCTCC 


mTCaCCrGAC 


CAAATATCTT 


TyCCAACACA AaTTTAGaAT 


tGATTCaTTT 


840 


GaATTAcGTG 


TTcGCaCtAC 


GTTTCaGTAT 


CGTTCTGGCT 


TTCTGTTTTT 


CTTTATGCAC 


900 


ATTGACTCAT 


ATTACTCGTG 


GAACTTGGTT 


ACCGATGAAT 


GCTTTCTTAC 


TGCTTCAACC 


960 


AATGAATGAA 


GAG AC TAG TT 


ATCGCTTAAA 


GACTCGTTTT 


TTAGGAACCA 


TCTTTGGATG 


1020 


TTTGATAAGT 


ATCTACACTG 


TGCAGCTCTT 


TAGCAACAAC 


TTACTCCATC 


TAATTTTGGC 


1080 


TAGCTTATTA 


GGGATCTGTG 


CCTTTACGAT 


AATTCCAGGA 


ACTACATTTC 


AAGCAACTTT 


1140 


ATCAACTATT 


"TTCGCATTGT 


TCCTATCATC 


ATTAGTTATG 


TCTAATTTAA 


TGGCCGCTGG 


1200 


ATTGCGCTTT 


GTTTATGTTT 


TATTAGCTTG 


TTCATTAGTT 


TTTCTCGTCA 


ATCAATTCAT 


1260 


TCTTCCAACA 


GGCATAAAAA 


ATCAATTTCG 


ATTTAATTTA 


CAAGAACTTT 


TTCATATTCA 


1320 
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TCAGACTAGT 


TTAATATTCC 


TTCAAAATTC 


ATTTCATCAC 


AAGATTGATT 


ATGGcACAAT 


1380 


CGCTGACTTA 


CAAATGaATT 


ATCACTTACT 


CCATGGTGAA 


ATTAAGAAT-T 


AT-TTCGGCAA 


14 4 0 


AAGCGATAAA 


TTGAAACAGA 


TATCCTACAA 


AGAATTTTTA 


TCTATATCTT 


GGCAAATGAT 


1500 


TACTATAGTT 


GAACAACTTT 


TACTACTTAT 


TGACAGTAAA 


AAAATTACAG 


AAAATGATAT 


1560 


TACAGTTATT 


GAAAATACAA 


TTGACTCTCT 


ATGTTACGTA 


TTGTCTCTAA 


TCCAAAACTT 


1620 


GTTAGACTGC 


AATTTAAAAA 


CATTAAAAAT 


ACCTGAAGTA 


CTTTTTAAAC 


GTGaTTTAGC 


1680 


AATTGaTCCT 


GAAATAGCGT 


ATCTGckATT 


TCaATACTCT 


aAAAAATTaT 


CAGCmATGtA 


1740 


TCGCCTCGTA 


CTTTGnATTG 


TCATTCAATT 


AAATTTCGAA 


AATrTTCGAT 


AATCTACTAT 


1800 


CTAAGGGAAT 


TATAAAGGGT 


ATTCTTATTA 


AAGAATGAAG 


AATACCCTTT 


AATTTACCTT 


1860 


TTCAAAATCT 


ATCCTTTACT 


CTTTTATCTG 


TAGTGCTAAT 


ATAAATGATT 


GTAGAGATAG 


1920 


TTTGTATATA 


AGGATATTTT 


TGTTCAGCTT 


AAAGGAGTTT 


TTACTCTATT 


TAATGATTAC 


1980 


TACATACAAT 


TATAGAAGTA 


CAAGTGATAA 


CaATAAGTCT 


ATACGCATAG 


GrAATAGyCA 


2040 


TyCATTAGCa 


AkGmCTTCCC 


ATTTTTTTGC 


TTTTTGTAAT 


ATTAAAACGT 


ATTGAAATTT 


2100 


TGATGTAATC 


TTAGATGTAG 


GATCAAkaTA 


TTTAACTAAA 


ATATTTGTCT 


TTATTGCTTG 


2160 


TCCTTTTTTC 


TGATAATTGA 


TACTGCTAAT 


TTCAACAAAC 


TTTAGATTCT 


TATAAATTGg 


2220 


CTCCAACACA 


TCTTTTTTTA 


CATAAtATCC 


tAGTTCTTCT 


TTAGAAGCkG 


TTGGATATAA 


2280 


TTTAAATAGA 


ACCTCTAAAA 


ATTCTTCAAT 


TTCTTCTTwG 


TkCTTTTCTT 


TAATCATCGA 


2340 


ATCATCTGTA 


AACTGCTTAG 


TTTTATAATT 


TGCTTTAACA 


GGTTTACTAA 


TCATTGTTGG 


2400 


CAAAGAAGTC 


ACAATACAAT 


TTCCAAGTCT 


ATCTTCAAAA 


ACAGAAATTC 


GGTATGCTGT 


2460 


TTTTTCTTGT 


AAAGCTTTTT 


TTTCCTCTAC 


TATTTCTTGA 


TCCACCAAAA 


ATGATACGAC 


2520 


GAACCATTTT 


GACTTTTTCT 


CAACATTTAG 


AATTTTCACT 


GAGTTCACCT 


TAGAACTTGT 


2580 


TGGTATATCT 


GCTCTAACCA 


TATCTTGACT 


TAAATCTAGT 


ACATCTTTTG 


TAAGATAATG 


2640 


CCCTAGAGAA 


CGTATTCTTT 


TTTCAATTGC 


TACTCTATCA 


TTTTTCCAAG 


AGAAATATTC 


2700 


TTTGGgCAAA tCTACAGTAA AATTTTCTAC ACCTGAGGgT 


ATTCACAATT 


TtTTCCTTCA 


27 60 


CAACCACTTT 


CTCATGAGTT 


GTATGTTGAT 


CAATAGCGGT 


GAAATTTTTG 


TAGATTCCAA 


2820 


AGGATAAACT 


ACTAAGAAGT 


AATAACCAAA 


GGACAAGCAC 


CATTTTTCGA 


TGCTTCCCCA 


2880 


TTGAAACAAC 


TTGTTCTTTT 


TTCTTTCCCG 


AAATCTTCTT 


TTTATTTCGT 


TCTATTTTTA 


2940 


TCTTCATCAT 


CTGTCCCCTT 


TTCTATTTTT 


ATACCGACCA 


GCACCAATGA 


TATGTTGCTG 


3000 


CCAATAAGTT 


TCGGTAAGAT 


TAGTCCATCC 


AACGGGATCA 


CCTGCATGGA 


ACATCCGATT 


3060 


GTCTCCAGCA 


TAGATACCTA 


CATGTGTCAC 


ATAAGTTCCT 


GCATCATACG 


TTGAGTGAAA 


3120 


GAAAATTAAA 


TCTCCTGGCT 


TAGCATCTTT 


CAACTCGATA 


TGAGACATAG 


CATCATACTG 


3180 


ATCCTGCGCG 


ACTCTGGGAA 


GTTGCAAACC 


AACTTTGGCA 


TAACACCACT 


GCGTTAATCC 


3240 


GCTACAATCA 


AAACCTGTTG 


CTGGACTTGA 


ACCTCCGAAC 


ACATATGCCG 


TCCCTTCATA 


3300 
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PTTAAAAGPT 


TPATPAAAAA 


TTCCTTGAAC 


GGTTTTATCA 


TCAAAtTCGT 


TGTAGTAAGA 


3360 


T ATTGTTTPA 


PTAACATTAC 


ATAAAACATA 


TTTCCATAGT 


TATAACGCCA 


ACCACCATTC 


342G 


TPPTTAATAG 


CGATTGGATT 


CGAATAGGTC 


ACTTTTTCCC 


CATCTGATTT 


TTCTTTTGAA 


3480 


AAAGATTPPG 


PTAAGGCAAA 


GCTGTACTTC 


TTTCCATTTT 


TTCCAACATA 


GTCTAAATAT 


3540 


c c li p p a r* p a t 


AGTTATAPGA 


TTGAATGATG 


CTCTCTTGaT 


CACAACCGAT 


ATCCTTTGCT 


3600 


r* /"* a t t ci zi P T A 


AGGAAGCAAA 


ATAAATGCAC 


CCTTGCTTGa 


TTGAAGCTTC 


GGTGCTTAAC 


3660 




P^GnAATPPPG 


CTGGATTCnG AACTTTGGCA yCACATCTTC 


CaTCGTACCG 


3720 




PPTPa ATA a T 


gcTAGAAGaA TCGGaACATA 


CTCTAAAtAT 


CCGaTkCTTT 


3780 




TrTAPAAGTG 


GaCGATGCTT 


TAAAACTTCT 


AAGrAAACAT 


TACTCCCTCC 


3840 


AgAGGACAAT 


CCAGTACTGC 


TATTTCCTTC 


ACTATCGTCA 


CCAAATAATA 


GACCGACACA 


3900 


AATCAAAATT 


CCCATAAACA 


AAACCGTTAA 


AAGAACTAAT 


AGAACGGTAC 


TTAACTTTTT 


3960 


GATTGTCATG 


CCATATCACC 


TGTCTTTACC 


TTCTCCCGGT 


TTATAGAATT 


CTTTTGATAT 


4020 


TGCTTCTTTC 


CATTTTTCGT 


TAGAGTGGAT 


TGATTTTTCA 


TTTTTTTGAT 


ACTCATCATC 


4080 


TTCGGTGCAA 


TCTTAGGATA 


AATGGTTTGT 


AGGGAATCCT 






4120 



(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 



AGAAAGAAAA 


CACAATTGTC 


TGAAnCGTTT 


TACGCACAGA 


CGACACAGTT 


GTTTTGGAGa 


60 


AAATGAaCAC 


AACAACTGAT 


GGATTAAGTG 


CACAGGAGGC 


TCAAAAGCGT 


TTAGAAACGT 


120 


ACGGGGAAAA 


TGTGCTAGAT 


GAAGGGAAGA 


AAAAATCACT 


TGCCGTGAAA 


TTCTTTGAGC 


180 


AATTCAAAGA 


TTTTATGATT 


ATTGTTTTAT 


TAGCGGCAGC 


GGTCATTTCA 


GCCGTCTTTT 


240 


CCCATGATGT 


GGTAGACTCG 


ATTATCATCT 


TAGTGGTTGT 


TATCTTGAAT 


GCCATCTTTG 


300 


GGGTCATCCA 


AGAAGCCAAA 


GCGGAACAAG 


CCATTGAAGC 


ATTAAAAGAA 


ATGTCTTCAC 


360 


CAAATGCTAA 


TGTTCGCCGT 


GATGGACACG 


TCATTACTGT 


TAAAAGCGAT 


GAATTAGTGC 


420 


CAGGTGACAT 


TGTTTTATTA 


GAAGCAGGGG 


ATGTCGTTCC 


AGCCGATTTA 


CGTTTATTGG 


480 


AAGCGAATAG 


CTTAAAAATT 


GAAGAAGCAG 


CGTTAACCGG 


TGAATCTGTT 


CCAGTTGAAA 


540 


AAGAAGTAAC 


CATCTTAGAG 


GGAACCGATA 


TTGGTATCGG 


TGACCGAATC 


AATATGGCTT 


600 


ATTCCAATAG 


TAATGTGACG 


TATGGACGTG 


GTCTTGGCGT 


TGTGGTTGGC 


ACAGGAATGA 


660 


ATACAGAAGT 


CGGTAAAATT 


GCTGGCATGC 


TAGCAAGCGA 


ACAAGAAACG 


GAAACGCCTT 


720 


TAAAACAAAA 


CTTAAACCAA 


TTGGGTAAAA 


TGTTAACAAT 


TGCCATTTTA 


GTCATTGCAG 


780 


CGGTGATGTT 


TGTGGTAGGA 


ATGATGArTG 


GTCGTCATGG 


ATTGACATGT 




830 
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(2) INFORMATION FOR SEQ ID NO: 4 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 



TTGTTCATTT 


TTTTTGAAAG 


TTTTGAGAAT 


ACTATAGATA 


GGAGTTTTTT 


TATGGAGGAA 


60 


GATATTGTAA 


GGnTGTTTTA 


TAAAGGATCT 


GAAATTATTT 


TAATTGGGAC 


CAGTCATATC 


120 


TCAGCTGAAA 


GTGCTGATTT 


GGTCAGAAAA 


ACAATTCAGG 


AAGAAAATCC 


TGACACTATT 


180 


TGTATTGAAT 


GGGATCAAAA 


AAGATACAAA 


AAAAACATAC 


ATCCAGACGA 


ATGGGATGAT 


240 


ACTGATATAG 


TTAAAATAAT 


AAAGAATAAG 


CAATTTCCAG 


TTTTTATTTT 


TGGTGTAATT 


300 


TACAAATTAT 


TTCaAAAAAA 


AGTTTCTCAA 


GATATGAATA 


GCCTTGTAGG 


AAAAGAATTT 


360 


GTAGTGGCTG 


TTGATGAGTC 


AAAAAAATTA 


AACATAAAGT 


TTTATCTAAT 


TGATAGAGAT 


420 


TCTTCTCTTA 


CTTTTAAAAG 


AGCATGGAGG 


ATGTTAAATT 


TTCGTGAAAA 


AGTGAAATTA 


480 


CCATATGCTT 


TTGGAAAAAT 


ATTTGAGGGT 


GCGGAAGAAA 


CTGAAGAAGr 


GGTTCAAAAT 


540 


CTGCTAGAAT 


CkGAAAATTT 


TGAACCTGTT 


TTTGAAGAAT 


TGAAGGAGTC 


CTATCCTAAT 


600 


TTGTGGGAAG 


TTTTCGTAAC 


TGAAAGAGAC 


GATTATTTAG 


CAACAAAAAT 


TCAAAATACA 


660 


GCGAATGGAA 


AGACTGTTGC 


TGTTTTAGGG 


AAAGCTCATT 


TAAAAGGGGT 


TTCTGACAGA 


720 


TTAAAGAATA 


ATCAAAAATC 


CGATTTACAA 


AAATTAGAGA 


TTATTCCACC 


CAAAAAATTT 


780 


GGAAGTAAGT 


TGCTTGAATG 


GATTATACCA 


GGTATATTAC 


TTATTCTTCT 


AGGAGTATCC 


840 


TTTTACCAAG 


GGACACAAGT 


GGGAATAGAG 


CAATTGTTAC 


GTTGGCTTTT 


ATGGAATGGT 


900 


GGATTAGCTG 


CATTGTTTAC 


AGTATTTGCT 


TTAGGACACC 


CTTTAACTGT 


TCTAACGTCC 


960 


TTAATATTTG 


CTCCTTTAGC 


TACTTTACTT 


CCAATGGTTT 


CTGTTGGAGT 


ATTTTCAGCA 


1020 


ATAGTGGAAG 


CTACTGTTAG 


AAAACCAAAA 


GTAAAAGATT 


TTCAAACGAT 


GGATTTAGAT 


1080 


CTTCAATCTA 


TTAAAACAAT 


ATACAAAAAT 


CGTGTTTTAA 


GAGTGTTTTT 


AGTATTTTTC 


1140 


TTATCAAGTC 


TTGGAGGGCT 


TTTAGGTAAT 


ATTATTGGTG 


GGTTAGGAAT 


AGTTAAAAAT 


1200 


TTAnTTTAAA 


GGAGTGAGT 










1219 


(2) INFORMATION FOR SEQ ID NO: 476: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 6: 
ATATTTTTGA AACGTTTGTT TAGTTACAAC ACGTGTTGCT ACCACCAAGC CTTGGTAAAT 



60 
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TAAATAGTAT 


TTGATTTTTT 


TCTGACAAGG 


TAACCACATC 


AGTTGCCAAT 


CTCGTTGGTT 


120 


AGCTTGACTA 


ATTCGTTGAT 


TTTGGTTTGC 


AAAGCGCGTC 


AGAAAGAGCC 


AATCTTCCTT 


180 


GAGTTTTGCC 


GCTTTTTCGA 


AGGCTTGTTT 


TTCAGCAGCC 


TCTTGCATTT 


GTTGAGCAAT 


240 


TCGTTTGATT 


GGTTCTCGTC 


CTTGATTATG 


GAAAAAGTCG 


AACAATTCTT 


GTTGAAATAG 


300 


TTCGGTCGGA 


ATGGCACTCT 


CTTTTTGAAA 


GGTTTGATGC 


CAGTAGTTGG 


TTGGGACGAG 


360 


AT CG TAG AT A 


CTATCCAAAA 


TTATTTTTAA 


ACGGTTTAAT 


TTTCGGCGAA 


TCGAATAAGG 


420 


ACCAAAACAG 


TTCTCTTTAG 


TGGGAATATT 


TAAGAGTTTA 


ATCTCTAATT 


GCCGCTCATT 


480 


AACGGCTACC 


GATACATAAG 


AATATTGTTC 


ATAAGCATTC 


ATTTGGCGAT 


TGTAGCGCGG 


540 


CCGATACTGT 


TGAATCAGTT 


GACATTCTAG 


GAGGAGTGCA 


TCCAATTCTG 


TGTCAGTATG 


600 


GAGAACATCG 


AAGTCTATAA 


TTTGTTTGAC 


TAAACGCATC 


GTCTTTCGAG 


AATGTTGATT 


660 


ACTATTAATA 


AAATAGCTCG 


ACACGCGATT 


TTTTAATTTT 


TTGGCTTTAC 


CCACATAAAT 


720 


AATTGTGTCG 


GTTGCGTCTT 


TCATTAAATA 


AACACCCGGT 


GTTAACGGTA 


AtTGATGGGc 


780 


TTTTTCCTTT 


AAATATTCTT 


TCATAAGTtT 


CCTCTTTtCt 


TAAAAGGATC 


GGCTTCAAGT 


840 


aTACAAAGTT 


TTCTGAAATA 


GTCAAGTAAc 


AGAAATAGnC 


ATAGGTAATT 


TCACTGAAAA 


900 


ATAGTTTTCA 


TTTTTTTTAA 


AATATGGATA 


ATTCnAGGGA 


ATT 




943 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 



ATAGATGTCT 


TTTCTGTCTC 


TTTTTTTGTA 


TAATAAACTT 


ATCAAATGAT 


GTGTATCTTA 


60 


CAACTTTGTT 


CGTTTGCAGA 


AAGAACAAAC 


GATGTTTTTT 


ATTGATAAAA 


AG T TAT AT AC 


120 


TAAAAGTACG 


ATGAGTATAA 


CTAAAGGAGA 


TGAGAAATGT 


GAGTGTTACC 


CTATCTAGAA 


180 


AAATTGTTAA 


TATTATTTCC 


ATTTTGGGCA 


TTGTTGCAAC 


CATAATTGCA 


ACCATCTATT 


240 


TTATCCGATT 


AGGTGTCTTC 


AAGGACATCA 


ATGCACTAAG 


AGGACTAGTA 


GGGCAATCTG 


300 


TCATTCTCGG 


TCCGATTATT 


TTCATGTTAA 


TTCAGATTAT 


TCAAGTGGTT 


ATTCCGATTA 


360 


TTCCTGGTGG 


TGTTAGCTGT 


GCTGCAGGGg 


TcTTGATTTT 


CGGGCCAACA 


ATGGGCTTCG 


420 


TCtAtAATTA 


CGTTGGTATt 


GCACTTGGCT 


CTATTATCAT 


TTTCTTATTA 


GGTCGTAATT 


480 


ATGGGAAACC 


ATTCATTATG 


AGCCTTGTCA 


GTGATAAAAC 


ATATAACAAG 


TATATCGGTT 


540 


GGTTAGACAA 


• TGAAAAACGT 


TTTGAACGAT 


TGTTTGCTTT 


AnCAATTTTC 


TTCCCGATTG 


600 


CACCGGATGA 


TGCGCTTTGT 


TTAATGGCCG 


GTTTAACAAA 


AATGTCTGTG 


AAAAAATTTA 


660 


CTCTGATTAT 


TTTATTAGCA 


AAACCTGCTT 


CTATCTATTT 


ATATAGTCTT 


GCGTTAATTT 


720 
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ATGGCGGTAC ATTTTTAACT AGCTTACTGG GAATGTAAAA TAAATAAAAA AGCGAACGGA 780 

CTAAATCAAA ACTCTGCCTJG ..TGtCAGTTTg ATTTAGTCCy TTTAwTAtGG AATTTTGGtA 8 40 

GAAcTGGGGG rctTTTGawG JcTTAgtTAtt TTAAAAAtAG tATTGkGaAa CgCCCAGATT 900 

TTCCTTAA 908 



(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 



CGAACGAGAT 


TTAAAAAAAT 


nACGTAGCCT 


TTTTACAAGA 


ACAAGGCCTG 


CATTCTTGGG 


60 


ACGAAGTTGA 


CCGTTACATG 


ATTACAGAAT 


TTTTGCAATC 


CCTTCACGAA 


GAACAGCAAG 


120 


CTTCAGCaAG 


TGTGATTCGG 


ATGATTTCTA 


GTTTGAGAGG 


ATTTCATCAA 


TTTTTACGTC 


180 


AAGAACgCTT 


AAGCGAACAT 


AATCCGATGC 


AACATATTGA 


CTCCCCCAAG 


AAAGTTCAAA 


240 


AATTGCCAAG 


TACATTATCT 


GTTGACGAAt 


GACTCGATTA 


ATTGAAACGC 


CTGATACAAG 


300 


TAAACCACTA 


GGTATGCGGA 


ATCGTGCTAT 


TTTAGAAGTG 


ATGTATGCAA 


CGGGCCTGCG 


360 


AGTTAGTGAA 


CTAATTGAAA 


TAAAATTAGG 


GGATCTGCAT 


.TTATCCATTG 


GTTTATTGCA 


i 420 


AACAATCGGA 


AAAGGGGACA 


AAGAACGAAT 


TATTCCTTTA 


GGCGATTATG 


CGATTCAATG 


480 


GATTGAGCGC 


TATATGAACG 


AaGCCAGACC 


TCAATtGATT 


AAAAACCATC 


CGAATGAAAC 


540 


ACACCTGTTC 


GTTAATCATC 


AlcGGCGAACC 


GTTATCTaGA 


CaGGGCATTT 


GGAAAAATTT 


600 


AAAAC 












605 



(2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



TGGCGTCGTT CTTACGATAC TCTTCCTCCA TTAATGGAAG CAACAGATGA AGGTTCTGCA 60 

GCAAACGACC GTCGTTATGC AATGTTAGAC CAACGCGATA TTCCTGGTGG CGAAAACTTG 120 

AAAGTTACTT TGGAACGTGC ATTACCATTC TGGCAAGATG AAATTGCGCC AGCTTTAAAA 180 

GACAACAAAA CTGTTTTAGT AGCGGCTCAT GGTAATTCAT TACGTGCTTT AGCTAAACAC 240 

ATCGAAGGCA -TTTCTGATGA AGATATTATG GATCTTGAAA TCCCAACAGG TAAACCATTA 300 

GTTTATGAAT TAAACGATGA TTTAACTGTG AAAGAAAAAT ACTACTTATA AGAACTTGTT 360 

TCTTATTAAC AAAAGCGGTT TGCCAAGGCA AACCGCTTTT TTATGGgATT AAATAAATtG 4 20 
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ACTTtCGGkT AAACmATCTT TATAAAyCGC TACTAwAtCC GCTGCATCCA ATGGTACATA 4 80 

AGCCTTTTCG GCAATTGCAC TATGGTCCAC AGCTTGTTGT GCCATTTGTT- -CAAATTTATC 54 0 

TGCTTCGATG CCAACTTCTG GTAATGTCAT TGGAATGCCA CAAGCCACAA AGTAATCATA 600 

CAATGCTTGA ATACCTTTTT TCGCTGCTAC TTCCTCCTCT TGTTCGACTA TTCCCCAAAC 660 

ATTTCGAGCA AACTGTGCAA ATTTGCCTAC GGTTTGTTCA GACAAAACGT AATTCATCCA 720 

ACGAGGCGTT AAAATTGCTA AGCCAATCCC ATGCGTAATG TCATAAAACG CACTTAATTC 780 

ATGTTCCATT GGGTGACAAG ACCAAACACC TTGTTTGcCG CGCCCTGTTA AACCATTtAA 84 0 

GGsTAAACTA mTTGsCCACA TTAAATTAGC ACGTGCATCA TAATCnTCyG GGATTTCyaA 900 

AGgCAaTAGG rCAATTTTga TAACsGTcGs ATtAAGCCTt CTGACACnAA ATCTT 955 



(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 



TTGGCTTTTA 


TCTGTGTTAT 


ATTTTTTTTA 


GTTACTTTTA 


CTATTGTGTC 


ACTACCATTA 


60 


ATCACTGCTC 


TTAAGACACA 


AGGATTTTCT 


TCTGAAGAAT 


TAATTGAATT 


AGCTAATTTT 


120 


GATTTAAATG 


TTGATATCCC 


TTTTTCTGAA 


CTAAACGTCT 


CAATTATTTy AATTAGTTTT 


180 


TCTGGTGCCG 


TTATTGACGG 


AAGTATGGCT 


ATTTGTAGTy 


CTACTTACGA 


AATATATACG 


240 


AAGAACCCAG 


ATTTAACTTT 


TAAAGAACTA 


TTTAATTCTA 


GTTTTAACGT 


AGTAATTGAG 


300 


GTATTAAATT 


CG AC TAT AT A 


CACTCTCTTA 


TTTGCTTTTA 


TAGCATCTAA 


TTTTGCATTA 


360 


GTAATTTACT 


TACAGGATTT 


AAACTATTCA 


TTTTTAGAAT 


TAATTAATTC 


TAAAATTTTT 


420 


GTAGGGGAAC 


TATTAGTAAG 


TATTTTAACT 


GGTTGTGCAG 


GAATTTTAAT 


TTTACCTTTG 


480 


TCCTGTTTAC 


TAGGTAGCTG 


GATATTAAAA 


AAAAGATTCC 


ACACTTTGTA 


TTGAAACCCa 


540 


AAAGCTAGGC 


CGAATACCTT 


TTATAGAAAT 


ATATTTGGTA 


TTTTTATACA 


ATTAACCGTT 


600 


CACTTATATT 


GAACAAAATT 


GCAGCCAAGT 


TTCGTTTTGA 


TTTATTGGTG 


ATTGAAATAA 


660 


TTAAATCCAT 


TTGTTATGTA 


AGTCTTGATT 


AGTGTGAAAA 


AACACCCTCT 


AATCCAGTCC 


720 


GCTCTTTTCC 


AAATACTTAA 


ATAAAAGAAT 


CCACAACTAA 


TTTATCTGTT 


ATTCTATGTA 


780 


ATATACTCAA 


CAAAATTCAC 


AGTAACTTAA 


CTATGACTTT 


TCAGTAAAAA 


TCAATTGTGC 


840 


AT ACT GAT AC 


AGTAAGTTTA 


AACTTTTATT 


GTAATAGTAT 


AATTCTTAAT 


TTATATTTTT 


900 


TACTTCTATT 


CCTAATTGAT 


TTATTCAAAA 


TTATCACAAA 


ATCGTAAAAG 


TTAAAATTGC 


960 


ATATTCTAAC 


ACAAGAAAAT 


CAAAAAAAGA 


TTTTCTTGTG 


TTAATATTTT 


TGAATATACT 


1020 


AAAGTTCGGA 


TAAGTAGAAA 


ACACCCTACT 


GATTTATATA 


ATAGTAGGGT 


GTTTTGTTAA 


1080 


TATAATTTAC 


TTAAATTCTA 


TAACTCACAG 


TCTATAATCA 


CCTTTTATTC 


ATTTAAATTT 


1140 
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AATCCGATAA CATACTTAAC TCTACGTCAA AGTATAATTT TACCACACGA ATCACCATAT 
ATAGTTGTAT AACTATTTAA GATGACTTCA TTAACTTGTT CAGGTTTAGT AAAAACAACT 
AT G AC AG AT G GATCTTTTCC TGCTTGTTTA ATTTGATCAA AATCAACTTC GGCTAATAAT 
GTTTCGGTAG TAACTTTCTC TCCTGCATTA ACAAAAATTG AAAAAGGTTT ACCCTCCAAT 
TCAACTGTGT CTATACCAAT ATGAATTAAA TACTCAATAT CTGCATCGCT TTTTATACTA 
ATTGCATGCT TTGTTGGAAA AACAGCTTCA ATTCGTCCAG TCAAAGGAGC ATATATTTTG 
TTACCTTCGG GAACAACTGC AAATCCTTCT CCAAGTACTT TTTTTGAAAA TATTGGATCA 
TTTACTTCTG AAATAGGAAT AATCCTCCCT GAACAAGGAG CAGTAATGGA AATTCCTTCT 
TGTTTTTTAT TTTTAAATAG ATTAAACATA AGATATCTCC TAATTCTCAA TTTCTAAAAT 
TTGATTAATT TCTTGACTAA GTAAATCAGC TTTACCGCCA AAAACTGCCT GAATACCACC 
ATTCACTTCA AAAACAGCAG TAGCACCTAG TTGTTTAATA GTTTCTTTGT CAACCATTCG 
GCCATCTACA ACAGCTAAAC GTAAACGAGT AGCACAAGCT TCAACGGgAT TCAATATTGG 
TTTTACTACC AAGAGCTTTA ATGATTATTT GAGCTTCATC ATTTAATGAT GTTTTTTTAT 
TTAATTTATT TTCAGTAGTC GTTGGCAACT CTTCTCCCAT TCCAGGAATA GCTACTTTAA 
ATTTCTTAAT GAAGAAAGTA AATACGCAGA AATAAACAAT TGCCCAAGCA ATTCCGAAAG 
GAATTAATTT CAGCCAATTT GTCTTATCGT TTCCTTGGAG TACACCAAAT AATAAGAAAT 
CAATTAATCC ACCAGAAAAT GAATTTCCTA CTCGGATATT TAAAATATCA GCAAAATAAA 
AAGATAGACC ATCTAAGAAT GCATGAATTA TATATAACCA TGGGGCTACA AATAAAAACA 
TATATTCAAC TGGTTCAGTA ATCCCTGTTA AAAATGAAGT CAAGCCACCA CTAAAATAAA 
GTCCACCATT CTTTTTACGA TTTTGTTTaG GTATAGCACG ATACATAGCA TAACAAGCTG 
CTGGTAAACC AAACATCATT GTCGCGAATC TACCTGCAAA GAAACGAGTA CCATAAGTAA 
ATAGTCCTGT ATGGTTTGGA TCAGCTAGTT GCGCAAAGAA AATATTTTGT GcACCAGAAA 
CAGTAACTCC AGCTACACTT TCTGtaCCTC CTAAAGACGT wTACCAAAAA AGTGGATAAA 
TAGTATGATG TAATCCTACT GCGCCAGTCA AACGAAGTaA AAATCCATAC AAAAAAGTAC 
CTAGACTTCC CATTTGTGCA ATATGTTCTC CAACTACTGC TAATCCATTT TGAATTGGTG 
GCCAAATTAA ATAAAAGAAA GAACCGATAA TAATTGCCGC TAAACTAGAA ATGATTGGAA 
TAAAACGCGA TCCGCCAAAA AAGCCCAAAA ATTGCGGTAG TTCAATTTTC CGATATCGAT 
TGTGTAGAAA TGCAACAGTT CCGCCAATTA CAATAGATCC AACAACACCA GTATCAAGAG 
TTGCATCTTT AGCAGAAAAT AACGCAAGAA AGCCGCTGAT CGTTGCTGTA TACACTAAGT 
AAGATACTCC ACCGGCTAGA CCTGCTGTTC CTTTATCCCC ATTTGCTAAC CCTACAGCAA 
TACCTATAGC AAAAATCAAT GCTAAATTAG TAAAGACAGC ATTTCCTGCA TAACTCATAA 
TACTTAATAC TGTATGGATC CATGGCTGTT CTAAAATTGG ATACGCATTC ATTGCACTTT 
CATTTGTTAA AGCACCACCA AGGCCTAAAA GTAAACCTGC TACTGGTAAG ATTGCAATCG 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
> 2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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GTAACATAAA TGCGCGACCT AATTGTGAAA ACTTCTTAAA CATCTTTTTC CTCCTTATTC 3180 

AATGCATCGA CAAATCTTTT AGCAATTTCT TGTGGTCGTG TAATTGCACC ACCAACGACA 324 0 

ATTCCTGCAA CACCCAGCTT TTTAATTTGT CTTGCTTGTT CAGGTGTATG AATTTTCCCT 3300 

TCAGCAATAA CACAAATATT TTCTTGTACT AATTGTTTAA TTAATTTCAT ATCTGGGCCA 3360 

TTTTGTTTTT TGCTCTCCTT TGTATAGCCA CTTAAGGTTG TACCAACAAA ATCCACACCA 3420 

GCTTGATAAG CATTCTTTCC TTCCTCAAAA GTAGAAATAT CAGCCATTAA TAATTGTTGA 34 80 

GGATACTTTT cTTTACTTTT TCTATAAArG TATTAATAGA TAATCCGTCA kGACGTTaCG 3540 

TAAAGTACAG tCCAAAGCaA TTACCTCCaC AcCAATTGcA cTAATTCAcT ACTCCCCCAT 3600 

AGTG 3604 
(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

GCGCATGGTT ATATGGGTTC GGCAGAAACG ATGTCTGTCT TTGCGAAAAT GTATCATGAT 60 

TGGGGCTACA ATGTGTTAGC GCCAGATGCG CGTGGTCACG GGAAAAGTCA AGGAGATTAT 120 

ATTGGGTTCG GCTGGCCCgA TCGTAAAGAT TaTGTCCAAT GGATTGAGAA AGTTTTAACC 180 

GAAAATGGTC AGCAAGAACA AATCACTTTA TACGGTGTCA GTATGGGTGC TGCTACAGTC 240 

ATGATGACGA GCGGGGAAAA ATTACCAGAT AACGTGAAAG CCATTGTGGA AGATTGTGGC 300 

TATTCTACAG TCAATCAGGA GTTGCAATAT CAATTGAAAG AGCTTTTCAA TTTACCAAGT 360 

TTCCCATTAG TTAATGTTAC TAGCGGGATT ACGAAACTTA GAGCTGGCTA TTTCTTTGGG 420 

GAAGCAAGCG CTGTTAAACA ATTACAAAAG AATCATTTAC CAATGCTCTT TATTCATGGT 4 80 

GAAAATGATA CGTTTGTGCC GTTTAGTATG TTAGACGAAk TTTATAATGC AACCCAAGGa 540 

CCTAAAGAAA AATATGTCGT GcCTGGaGGC AGAGCATGGC GAAAGCTTTA TAATAAAAAC 600 

CCAGAAAAAT ATAAGGGAAC AGTAnGCGGC TTTTCnAGAT AAATAT 64 6 
(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 
AAAGAAGAAT TAATATTATT ACTGAATGAT AAAGATTGTG CCTATGATGT ACGGATTCAA 60 
ATTCAACAAT TTCTTGAAGA AAGGTGAGTA AAGACAAGAT GTTCAGAAAC AAATGGAAAA 120 
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ACAGTTCACT 


AATTAAAAAT 


GTGTAAAAAG 


GGTCTGATGA 


AAAAAGGAGA 


CATAATAGTT 


180 


ATTATCTTTT 


TAATAGCTAT 


CTCTTTTTCT 


CCATATTTTA 


TTTTTTTTCA 


CAATAATCCA 


240 


TTTAACTCCA 


AAAGTTTTGA 


CG AC ACT AAA 


TATGCTGTGG 


TCAAGATAGA 


TGGGAAAGAG 


300 


ATTGAGCGTA 


TAAATTTAGA 


TGATTCAAAA 


GAATTTATCA 


AAACATATTA 


TCCATCAAAA 


360 


GGGCAATATA 


ATACTATAGA 


AGTTAAAAAT 


GGGCACGTTC 


GTGTAAAAAA 


AGATAATAGT 


420 


CCAGATCAAA 


TTGCGGTGAA 


AACAGGATGG 


ATATCAGAAC 


CAGGGCmAAC 


TAGTATCTGT 


480 


ATTCCTCACA 


GATTCATTTT 


AGAAATTGTT 


CAACAATATT 


CTAAGGATtA 


TTaTATTtAC 


540 


TAAAATCATT 


TTTTGGCATA 


CATGCTAAAA 


AATGATTTTT 


TGTAAATATT 


TTATTGCATT 


600 


AATATCAATA 


TAGTTTAGTT 


TTATTTGGAA 


GAATGTAATA 


ACAAATATGA 


GATGGGATTG 


660 


TCCTTATTAA 


ATAACATTTA 


TAAATGTTAT 


ATTTTAAGTG 


AAAGTAATGA 


TTTACAGTTA 


720 


ATTAAATGGA 


GGTTTTATGA 


ATAACCTATT 


TATTTTtGTT 


TGCTTTTGTT 


TTATGTGGTT 


780 


AATGATTTTT 


GGAGTAATTT 


TATACTATGT 


TGGTCTAGTT 


AATCACAGAT 


ATATACATCA 


840 


TACTTTAATT 


TTGGGTTTAG 


TAACTATTAT 


TTCTGGTACT 


CTATGTTGGT 


TATTTGTAGG . 


900 


ATACTCATTA 


AGTTTTTTTG 


GAAATATTCA 


ATATAGTATC 


kTTTATAGCC 


CCTTAGCTTC 


960 


TAGTGAAATT 


GTTTCTATAC 


TGATACAATT 


ACTTTTTTGC 


TTATATTCTG 


TAATTATGAT 


1020 


AATTGGTTCA 


G TAT TAG AAA 


GAGGTAATTG 


GAAATATATT 


GTGTTGTTTG 


TTCCTTTGTG 


'1080 


GATTGTATTT 


GTTTATGCAC 


CTATTTGCwT 


TTCGTTATGG 


GGACATGGAA 


ATTGGCTAGG 


1140 


GAAAATGGGA 


GTACTGGATT 


ATTCAGGCGG 


ACTAGTAGTC 


CATACTACCG 


CTGGAATAGG 


1200 


AAGTCTTGTT 


TTAGCAATAA 


CATCACCAAT 


ACGTTTAAAG 


AATTCATTGA 


TATTTAAATC 


1260 


ACAAGAAATG 


aTAGCTTTTG 


TAGGAATGCT 


GTTTATTACC 


TTAGGaTGGt 


TCGGaTTTAA 


1320 



PA 1322 
(2) INFORMATION FOR SEQ ID NO: 483: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 



GACCTGGTTT GGATCGGATT AATGCGTTAT TAGAAAAAAT TGATCATCCA GAAAACAAAG 60 

TGCCAACGAT TCATATCGCA GGAACCAATG GTAAAGGCTC AACCGTGACA TATTTACGCT 120 

GTCTATTAGA AGAAATGGGC TTGAAAGTCG GAACCTTTAC CTCTCCTTAT ATTGAATCAT 180 

TTAATGAAAG • AATTGCGATA AATGGTCAAC CAATTTCAGA CGAACAATTG ATTACCTATG 24 0 

TGGAAAAGTA TCAACCAATC ATTAAAGAAT TAGACCAGAT CACTGAAGTG GCTGGAATTA 300 

CAGAATTCGA AACATTGACA GGAATGGCCT TGGACTATTT CGTGAATGAA CAAGTGGATA 360 
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TCGCCGTTGT 


TGAAGTTGGC 


TTAGGTGGAT 


TACTTGATAG 


CACCAATGTC 


GTGAAGCCAT 


420 


TATTGACGGG 


AATTACGACA. 


AT.TGGGAAGG 


ATCACACAGA 


AATTTTAGGT 


GAAACAATTG 


480 


CTGAAATTGC 


CTATCAAAAA 


GCGGGTATTA 


TTAAAGAAAA 


AGTCCCAGTA 


GTAACAGGCA 


540 


ACATTTGTGC 


TGATGCGCTA gcAGTCATTG 


AAAAAGTCGC 


ACAAGAAAAA 


CAGAGTCCAA 


600 


J.1111 All 


TGGCAAAGAG 


TATCAAGTGG 


AATATCTACA 


TCCTGATACA 


CAATGGGGGG 


660 




1 1 1U1 i yun 


GAAATGGGCA 

O ruin ± tjvj\JV/n 


AACTGACAAA 


AATTAAAGTT 


CCCTTGTTGG 


720 




AGTAGAAAAT 


GCGGCGGTAG 


CTATTCAATT 


ATTTGACAAG 


TATTGCCAAC 


780 


I nLnrtv^n 1 i i 


GCCATTTAAA 


GAACGAGACA 


TTACTCAAGG 


GTTGGCTAAA 


GCGCAATGGC 


840 


tnbL 1 ^Ort-rl l 


GGAACGTCTT 


AGTGATGAAC 


CGTTGATTGT 


TTTAGATGGT 


GCCCACAATG 


900 


H 1 1 u^-rUj 1 


AAAACGCTTA 


GTTGAAAACT 


TGCGCAAAGA 


GTTTCCACAA 


CACACGATTC 


960 


ATATTTTATT 


TTCAGCTTTA 


GCAACTAAAG 


ATGTGGATGA 


GATGATTCAG 


GACTTAAAGC 


1020 


AAGTACCGAA 


TGCGCATCTT 


TACTTAACTA 


GCTTCGATTA 


TCCTAAAGCT 


ATTGCACTTA 


1080 


CAGAAATGGA 


AAAATATGAA 


GACGATTTAA 


CAGAAATTGT 


GTCTTTATGG 


CAATTTGGCT 


1140 


TAGGCGAAAT 


TTTAGAAAAA 


ATGTCGACAG 


AT GAT T TAT T 


ATTAGTAACA 


GGTTCATTAT 


1200 


ATTTtGTTTC 


TCAAGTAAGA 


GAATTACTTT 


TAACTATAGG 


AGGAAATGAT 


GAAGAAATTT 


1260 



GATGGCGT 

(2) INFORMATION FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 



nAAGCAATAA 


TTGAACCGTA 


AATTGCaCCA 


ACCCCtTTGT 


TCGTACCaAT 


CAT GAT T AAA 


60 


GGCGCAACTG 


CTTGAATGGC 


ACAACCTAAA 


ACGACTGGCA 


AGCCAATCCC 


AAAAAACTTA 


120 


TTCACCGTTA 


ATTGTAAAAG 


TGTCGCAACG 


CCGCACATAA 


AAATATCGAT 


TGAAATCAAA 


180 


TACGTCATTT 


GTTGCGAATC 


AAAATTCAAG 


CCGGTTCCGA 


TTAATAAAGG 


TACGGCAACT 


240 


GCTCCAGCAT 


ACATCGCTAA 


TAAATGTTGC 


AATCCTAAAA 


CGGCGGCTTT 


CCCATTTGGT 


300 


GTTTCCGTCT 


GTTGTGGTGT 


ATTCAATTGT 


TTTGTTTCCA 


AGATTATGCA 


TCCTCCTCTA 


360 


AAAACGTCAC 


CGTTCCTTCA 


CTTAATGAAG 


CAATACGTGC 


TAAAGAGACT 


ACATTTaAGC 


420 


CCATGTCTTC 


TAATAACTGa 


CGACCATCTT 


GGaATGaTTT 


TTCaATTACG 


aTACCAATTC 


480 


CTTCAACCTT 


TGCACCAGCC 


TGTTGGCATA 


ATTCTACCAA 


TCCTTTGGCA 


GCTTGTCCGT 


540 


TAGCTAAAAA 


GTCATCAATA 


ATCAACACTT 


TATCAGCATC 


AGATAAAAAT 


TTTCTTGAAA 


600 


TGGAAATTTG 


ACTTGTCACT 


TGTTTTGTAA 


ATGAATAAAC 


TGAAGCCGTT 


AAAAGCTCTT 


660 


CATCCATGGT 


TAAACTTTTT 


GCCTTACGAG 


CGAAGATCAT 


TGGCACGCCT 


AATTTTTGCG 


720 
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CTGCGTATAA GGCAGGAGCG ATTCCTGATG CTTCAATTGT AATTACTTTG GTAATGCCTG 780 

CTTCTGCGAA AACCTCAGCA AAACGATTAC CCATAGCCTC CATTAATTCT GGATCCACTT 8 40 

GGTGCGTAAT AAAACTATCT ACTTTTAAAA CACCTTCACC TAATACACGG CCATCATTTT 900 

TAATACGTTC GACTAATTCC TTCACAATAT GAACTCCTTT ACTTATTTTT CGATAGATGC 960 

TTTGTTCTCA TCCTGAGGAA AGGACGGTTC GTTAAAAACG AACGAAATAA AACAAAGTGC 1020 

TATTTATATT CGTGTTCTAA CAAAAGAArA AAGCCCATTT TGCCAAAGCA AAAAAGGGCT 1080 

ATCCGCTCTT CTTCACTTAT AGACCAGCAT TTACGGTGCT GGGTAGAGAn GGTCAACCGT 1140 

ATTTTGACCT TATACAAGCT TCTATGTGTC TCTAnGGATA GTACAAAAAT CTCTAACTnT 1200 

CCAATGGGTA ACAACTT 1217 
(2) INFORMATION FOR SEQ ID NO: 4 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 



AACCATCCCC 


TAAGGATTGA 


TTCAACTAAC 


GCTTTACGGT 


CGATTGAGTA AGAAATTGCT 


60 


TTACGTAAGT 


TCGCATTTCT 


AAATGGTGAT 


TTTTCATCAC 


GTTGATTTAG 


TTCCATATAT 


120 


TGTGTTGATG 


CTTCTTTTTG 


ACTAACAAAA 


GCTGGGTCAT 


TGGCCATTTG 


TTGGGCTAAT 


180 


TCACCAGAAA 


GAACGACATC 


GTCTGTTTGT 


CCATCTTGGA 


ACAAGTTCAA 


CGCGGTTGGT 


240 


GATTCTTTCA 


CGACATTCAC 


AT CT ACT GAG 


TCCAGTTTCA 


CAGTATCTTT 


ATCCCAATAT 


300 


TGATCGTTTT 


TCTTGAATGA 


CCATTTTGTA 


TCTGTACCAG 


GACCATCAAA 


GCCGTCTAAG 


360 


ACGAATGGAC 


CATTGTAGAC 


AGCACTTTCG 


CTGTTTGATG 


CATAATTTTT 


ACCATATTTT 


420 


TCCACAATGT 


CTTGACGTTG 


CGGGAAGAAT 


GATGGGAAAG 


CTAATAAGTA 


ATCAAAGTAT 


480 


GGTGTTGCTT 


TTTCTAAAGT 


GATTTCTAAT 


TCTGTATCAC 


TGACTGCTTT 


AATTCCTAAT 


540 


TCTGATTTAT 


CTTTTTCCCC 


TTTAGCAATG 


GCATCACCAT 


TTTTTACAGA 


GGCATACAGA 


600 


TAAGCATATT 


CAGAAGCTGT 


CGCTGGATCA 


ACTGTTCGTT 


GCCATCCGTA AACATAGTCA 


660 


TTAGCAGTCA 


CTGGTTTACC 


GTCTGACCAT 


TTTGCATCTT 


TATTTAATTT 


AATTTTGTAT 


720 


GTTAGTCCAT 


CTTCAGAAAC 


TTCTGCTTTT 


TCCGCTGCAC 


CTGCAGGTTG 


GACTTTGTTA 


780 


TCTTTGTCTA 


AACGATAAAT 


TCCTTCATAT 


ACATTATTTA 


ATGCAATAAA 


ACTAATTCTG 


840 


kCTGTTGCTA 


GTGATAAATC 


AGCTGTTGGC 


ATTTCTTGTT 


GTTCTGTGAC 


ACGTAAAACT 


900 


TGTTCGCCAC 


■TTGCCTTACC 


GCCACCAGAC 


GCGCTTGAGC 


TATcTGACGT 


TCCGCCGCCT 


960 


CCACAAGCCG 


CTAAAGTTGT 


TACGCTAAAT 


AATGTAATCA 


CACCGAATGT 


TAATGACTTT 


1020 


TTCAATTTCA 


TAGTGTTGCC 


TCCTAAAATA 


ATGCAATCTG 


TATTTTCTGA 


AAAATCaGaA 


1080 
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kTTAATAATT ATTaTkGCTA TGGaATATaT CTtACATTTt TCATATTAAA AtgCAAGTCA 114 0 

AAtGkTAACT TTTtTTAATT tTccTCcCTA TCAGAAACCA CTCTTTTTCT TCCTGTCTTT 1200 

TATTGCAATT CTTTTGTCAG TTTTCTAAAC ATTATGGTAA ATTAGGAAGT AAAGGAGTGA 12 60 

CAAAATATGA AAAATTCTAC GTTTTTAAAG GATTTGTTTG CTATTGATAA AGCAAATGAC 1320 

GAaTTATTTC GGTTaGTTGG TGTCGCAATT TGTATGGCAA TTCCATTATT AATTGGTTAT 1380 

TTTTCCAATA ATTTACTCAT TGGGACCTTT GGCtCGATGG G CAT T TAT AC GTTTATTTAT 14 4 0 

TATCAGCCGC TTCCTTTGCC TCAATTATTG CGCAGACTGA ATATTGTTGG CTTTTTTATT 1500 

GTTCTGGGCA ACAGCTTAGG AATGCTAAGT CACCATGTAC CTTGGTTGAT TCCCATtACA 1560 

ATTGCCATCG TCGCTTTTCT TGCACGCTTA CTTTTTAGAT TGTATGGCAT TGAGAAGCCT 1620 

GGTGCTTTGT TAGTTATCAT GTCAACCGCA ATGGGAACAA GCAACAATTT TCCTTTACAT 1680 

AAGATTCCCA TCATGGCCAG TTTTGTTCTT TTAGGAGTAG TGACCGGCAT TATCATGGGC 174 0 

ATTGTGCTTC ATTTCATCGA CAAGCGCCCC TACGTTTTTC AAAAACGGAT GTCTCTTCAA 1800 

GAACGACTTT ATATCGATCC CGCTTCTTTA TTAGATGCTT TGCATTATGC GGCCATTTTA 1860 

TTTTTAGCGG CTTATTTAAG CCAATCGCTT CATTTAGTCA ACGCCTATTG GATGACGTTT 1920 

1929 

ACTTGCGCA 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

TTGGTGATTA CGACTGCATT CAGTATTTCT ACGATGTTTT TCAATATTTC GGCTACCTAT 60 

CCCTTCTTTA TGGTAGCTTT TCTACAAATT GTGACCTTTG CCGGTATCTA TTTCAAATTG 120 

GGTGACATTA TGAGTATGTT CAACCTTCAG AGTAATGACA GTCAGTCAAT GGGCAGACGG 180 

GTCATGCGAA GACCGCAAAT GTTGATGAAC CGTAAATTGC GGCAACTCAA TCGAAATGTG 24 0 

GGCCGAACTC TGGCTTTTGG CGGTGCTGCA GCAGTTGGGA ATAAGTTGGC AAAGGAACAA 300 

TCAAAACCCA AATTTAAGCC AGCAGGTTCT TCTCTACGAA AGAATTCACG ATTACCTAAC 360 

GATCGCGAGG TATCCTCAGA CTCGGCTAAG GAGAATCCAA TTTCAAATAA TAAGAAACAA 4 20 

TCCCGAATGA ATTTGACTGG GAGAAAAATA GGCAAAGTTC TGGATACCCA GGCTTTGGTG 4 80 

AAAGATAAGG CAAAACAAGT AAAGGATCaG GTACGGAATA CGCCGACCAA CTTGAAATAC 54 0 

ATCCTCCATA AAGGTCTGGA AAAGACGAAA AAAGCACCGA AAGAATTTAA GCGAGGGCTT 600 

GTTCAAGAAA AAGCAGATCG AGAGAAGCTG CGAGACAAAC AACGACAACG CAGAGATGAG 660 

AAAATGGATG AAAAGCGAAA AAACTTGGGC GAAGTAACAG ACCGTCATGG AAAAAGAAGA 720 

AGTAATGTTT CGATAAAAGA GGATCCAAAA CCTCAGAAGG ATAGGATACT GAAAAATGAA 7 80 
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CTTCCAAAAC 


GGATTGTGAT 


GGTAAAACCG 


AATTCTGAGG 


TAAGGAGAAG 


ACTTGTTAAA 


840 


CAGAATTTGG 


TTGCGAGAGA 


GAATAACCAA 


CTATTAAGTA 


AGAAAAATTC 


TTTCCAGCAA 


900 


GTTAAGCAAC 


GATCTACTCT 


TCCGAAGAGA 


GCCAATCAAA 


AAGTACAAAG 


AATAATGAAG 


960 


GCTCGTCCaA 


AACCAAAGTC 


GGGTGATAAA 


AAATGAATTT 


AAAAGAATTG 


AGTGTCATTT 


1020 


CGTTAACCAT 


GTTGATGCTC 


TCTTTTATGG 


GCATCTTGAT 


GTGTGTAAGC 


TTATTGTTTG 


1080 


GGGAAGATAG 


TGAGGGGAGT 


GGTACTTCTG 


GTATTTCTCC 


TGGTGGCAAT 


AGTGTCTCAG 


1140 


AAGAAGTCTT 


GAAACATCGC 


TCAATGGTTG 


AAAAATATGC 


TAAAGAATCG 


GACATATTAG 


1200 


ACTATGTACC 


AATTCTTTTG 


GCAATCATCC 


AAGTTGAATC 


TGGTGGCACA 


ATGGAGGACG 


1260 


TCATGCAAAG 


TTCTGAGTCT 


GCGGGTTTAC 


CACCAAATTC 


ATTAAGTACC 


GAAGCATCTA 


1320 


TCAAGCAGGG 


AT G TAT T TAT 


TTTGCTTCCT 


TGGTTAAGAG 


CTCCAAAGAA 


CTAGGGTGTG 


1380 


ATCAAGATAG 


TATTATTCAA 


GCCTATAACT 


ATGGTGGCGG 


CTATTTGAAT 


TATGTCGCAA 


1440 


AAAACGGGAA 


GAAGCATAGT 


TTTACTTTAG 


CCGAATCATT 


TTCAAAGGAA 


AAATCGGGTG 


1500 


GTCAAAAAGT 


AGACTACCCC 


AACCCAATAG 


CGATTAAAGA 


AAATGGCGGG 


TGGCGCTATA 


1560 


ACTATGGCAA 


TATGTTTTAT 


TGCCTATTAG 


TGAAGCAATA 


CCTGACCACG 


ACACAGTTTG 


1620 


ATGATAAAAC 


AGTCCAAGGG 


ATTTTTGACG 


AGGCTTTTAA 


GTATGAAGGT 


ACTGCCTACG 


1680 


TATTTGGCGG 


TTCGAGTCCC 


GCAACGGGTT 


TTGATTGTAG 


TGGATTGACC 


CAGTGGTGTT 


,1740 


ATGCCAAAGT 


TGGTCTGAAA 


CTTCCCCGGG 


TGGCTCAAGA 


TCAGTATGAT 


GTCATGACTC 


1800 


ATATTGACTT 


AAAAGATGCC 


AAACCTGGTG 


ATTTAATTTT 


CTTTCACTCA 


ACCTATGATG 


1860 


CCGGTACTTA 


TGTGACTCAT 


GTAGGTATTT 


ATGCAGGAGA 


GAATCGCATG 


TTTCACGCAG 


1920 


GAAATCCAGT 


GGGTTGGACG 


AATCTTACAG 


AAAGCTATTG 


GCAGCAGCAT 


CTGATTGGAG 


1980 


CCGGCCGATA 


CAATTAAGAG 


AAAAGAGGAA 


AAATAATGAA" 


GATAAAAATT 


GAACGGATTC 


2040 


AAAAGGAAAA 


GATGTCAAAA 


AAGAAAAAAG 


AGCGGACAAT 


TTCAGTGGGC 


AAGCATCGAA 


2100 


AAATGGTGCT 


TGCCCTTTGG 


CTTTTACTGT 


CaTGTAGTCT 


TGCtTTTGGg 


TATCTACAAA 


2160 


TATTTCACCG 


CCATTGaTCA 


GCATACaCTC 


ACGTAAAAAT 


GGTTGTAAAn 


GAAAArGTAA 


2220 


TAAaTACTTC 


TGGAGTTGAA 


AGTTTTACAA 


AGAATTTTGT 


GAAAGAATAC 


TTTTCATGGA 


2280 


AGAATAACAA 


GGAAGTAATT 


GAAAAGAGAA 


TGAGTAATCT 


TGGACAGTAC 


CTTACAGAAG 


2340 


AAGGCCTTGC 


TTTGAGTCAA 


GATATGGTTC 


GAGTAGATAT 


TCCCACTAGT 


TCAGAAGTAA 


2400 


ACTCAGTAAA 


AATTCTAGAT 


GTTGAAAAAC 


GTTCGAAAGA 


TTTTGTCGTT 


TCATTTTTAG . 


2460 


TAGACCAAAA 


GATTACAGAA 


GGGAAAAAGT 


CAGAACAAAT 


TTCTTCTGCG 


TATCGAGTGA 


2520 


AGATATTTGA 


AGATGAAAAA 


GAAAATCATA 


TAGTGACGAG 


TTTACCAACT 


ATGATTAGTA 


2580 


AGCCAGGTAA 


GGCAAAATAT 


GAAGTAAAAC 


AAGTAGAAAG 


TGATTCTGAC 


ATTGATGCAA 


2640 


AAACAACGCA 


AGAAATTACT 


GAGTTTCTAG 


AGACATTTTT 


CAAACTTTAT 


CCAACTGCTT 


2700 


CTGAAAAAGA 


GCTAGAGTAT 


TATAGTGAAA 


> ATGAATCTAT 


GAGCCAAGTT 


1 AGTAGCGATC 


2760 
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TTAGATTTGT AGAAATAACC AATCCTATTT ATCTTGAAAA GGACGGTTCG GTACAAGTAA 2820 

AAATCGTAGT AAAATACCTA AATGAAATTG AAAAGGTGAC AAATAGTTTT CAATACAATC 2880 

TATTGTTAAA CAAAGAAGAA AATTGGAAAA TTATTAATGG AGAATAATTT TTTAAGGAGA 2940 

ATATAAAAAA CAAGGAAAGA ATGAATCATT GTAAACTTGT GGGAGCGTCA ATAATTTTGT 3000 

GTAAATGATT CTTCCCTACT GCAGATTGTT TCTAATTCTC AATCAGTTGG CTTAGCATTT 3060 

GTTCAAGTTC GCCTTCGGCC TGTTGAAAGC CTTTATGACT GCGACCTAGG AATTTTTGGT 3120 

TGTAAGTATC AAAAGACGAG ACTAAGAAAC GTTCCATCGA CTCTTCGTTT TGGAACTGTT 3180 

CCTTGCGGTG GCTGTATTTC TTGATTTGCT TATTAAAGGA TTCAATCAAG TTCGTAGAAT 324 0 

AAATACTCTT GCGTATGGCC AAGGGGAAAT CATAGAAAGT GAGCAAGTGA TCGTTCGAAA 3300 

GAATCGATTC AACCACTTTT GGATAGCTGG TTTTCCATTT TTCCGCAAAA GCACCACGTG 3360 

CTTCCAACGC CACCTTTTTA GATGAGGTTT GATACACCAT TTTAAAATCA TCACAGACTT 3420 

CCTTACGATC ATCGACACGC ACTTTGTGAC TGATATTACG GGAAACGTGT ACACAACAAT 34 80 

GTTGAAAACG AGCTTTGGGA TAGAACCGAC TAATCGCTCC TACCATCCCC TTTAAGCCAT 3540 

CCGTGATGAA CAGGAGGACA TTTTTCAAAC CGCGCTCTTG AAGGTCCAAT AAAATTTCCT 3600 

CCCAAATCGT GATGGATTCA GTCGGTGCAA yCGCATAGCT CAATACTTCC TTTGATCCGT 3660 
CCGGGCGAAT GCCAACTGCG ATATGAATGG CTTCCTTGGC GACGGTTTTC CGCTTTAACG 3720 
GAATATACGT TGCGTCCATA TAAATAGCGG CATAACGGTC ATGAAGCTCC CGCCCTTTAA 3780 
ACGCCGTTAC CTCTTCTGTA AATGATTTkG TTATATTGGA CATGGTTTGG GgCGTGTAGT 3840 
GaTGCCCATA CATTTTCTCA ATCAAGCTGC GATTTCGACA TGGTAATACC TTG 3893 
(2) INFORMATION FOR SEQ ID NO: 4 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

[XL) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 
CTCGTTCATC TTATCCAGCA CCGCCCGCTG TGCAGGGGTA AGGACACGGG GCGGGGTGAT 60 
GCGTACCCAT TTTTTCGGCA GTTCAAAGGA AAGCCCGCCA TATCCATTAT CCCCGGTCTG 120 
CCGTACCTGT TCGGGGTAAG TGCCGCACAG TTCGGAAAGC TGCCTGCAAA GACTGGTATT 180 
GTGGGTGTAG ATGCTGGCGG CGGTTTCTGC TTCGTTATAC AGCACGATAG TTTCCCTCTC 240 
GGTTCCGGAC AGCTTCATCG CTCCACCTCC TGTCCGCGCG GGGGGTTCTC CGGCGCGTCA 300 
CGGCCTTTGG CGGCATCTTC CCGGCACTGG TTCAGGGCAT CCAGCACCGA GGGTTTGGGC 360 
GGCTCGTTAT TGATGATCCC GTCAATCTGG TTATAGTTGC CCTCCACGCT CATTTCAGCG 4 20 

GCGGCAAGGT AATTTTCGGG GATtACTTCC TGAAAgGCGC GGACGTTCCC GTCCCCCGCC 4 80 
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AGTACCGGCA CATAGAGCCG CCCGGTCTGt AAATACATCC CGTCCAGGCC AGCCCCGCcT 540 

TTGTAGCGTC cTGCGtGT.CA . CGCTCACACT CCAAAAGGGC AATCCCGTTT GCCAGAAGAT 600 

601 

(2) INFORMATION FOR SEQ ID NO: 4 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

ACTCTTCTGC TAATTCTAAA AATACTTGAT CAACCACTTC ATCACCAATC GAATGGCAAT 60 

CAAAATGTTC AGGATAATGT CCAGTTAGTT CCTTAAAGCG CGTCAATTGC GCAATCGTTT 120 

CAAGTCGTAC TTCTTCATAT ACGAAGGTTA CTTGCTTTTG TTGACGATAA TAAGAAGAGC 180 

GATGAAAATT GCCTTGTTCA TCTACCAGTG AAGGAATCAT ACATGGGTCC GCACAAGGTT 240 

TCCCTAATAA AAAATTCGTA TGCTGACCTA CAAATAATCC TGGATAATTT TTTGCTAACG 300 

CAACGGCGTG TtCTGCTGTT GGTAAATTGG GCATCAAAAG TGTCGACGTT GTTAAACCAT 360 

CGACATAACT TTTGATAATG CCATGATTAA TCGCTTCTGA CATCCCAAAA TCATCAGCAT 420 

CAACAATTAC TTTAACCAAC AGCTCCCCCT CTTTTCTTTA CGTATTTTTG aTGACAATTC ■ 480 

CGACAATAAT ATTAACAGCT GATTCACAAT CATTAATCGT GCGCTCAAGC TGaTCATAAA 540 

CATCTTTCCA CTTGATAACT TTCAATAAAT CTTTTTCGTT AGT CATC AAA TCTTTCATGG 600 

CGCTACTGTA TAATTTATCG CCTTGAGATT CAATCGTATT CACTTCATCA ATCAATGAAA 660 

CTAACGTTTT GGAATTCTTA AATTTTGCAA ATTCCTTCGT TGCTGATTTA ACGCCTTCTG 7 20 

TCGCTTTAAC AATATAAGCA GTTAATTCTA GCGCGGGCGC AATCATTTCT TCGACAACTA 780 

AATGATCAAG TAAATAAGAC AAACTATTAA TGCTATCAAT AATGTTATCT AAGTGATCCG 840 

TAATATCGAC AATATCTTCT CGATCAATCG GCGTAATAAA GGAAATATAG AGTTCATTTA 900 

AAATTTTGCG AACAATTTCA TCATTCTCTT TTTCCAATTG ATGAATCGCT TCTGATTTTT 960 

TTACTAAGTT TTCTAATGAA TAATTGTCTA CAATTTCTTG CAAAACCTTT GCCGCTTCAT 1020 

AAGCATTTGT TGCTAACTGG TTTAATTCTC CAAAATAATC AAATTGTTTT TTACGTGCCA 1080 

TGTTTTTCTC TCCTTAAAAA ATTGCAATAA ATAATTTTGA CATCACATAA GCGATCrGCC 114 0 

CACATCCTGG GAAAGTTAAA ATCCATGmwA TCAGCATTTC TTTCACGATA CGCCAGTCAA 1200 

CACTTGAGAT TCGCTTAGAA GCACCTACTC CCATAATCGC TGTTGTTTTT GTATGAGTAG 1260 

TTGAAACTGG AATCCCAAAA GcCGaTGsTw AAAATAAACA AaTAGCCGTG CTTAAATCTG 1320 

CAGTGAAGCC TTGATAGCGT TCAAGTTTTA CCATATCCAT CCCGACTGAT TTAATGATTC 1380 

TCATGCCGCC CACAGAAGTT CCAATGCCCA TTGTAaTGGA aCACrGAaCC ATAcCCATAA 14 4 0 

CGGAtCACAA AACCGCCACt GTTTTTTCGA CTAAATTATT ATAAAAAAGC ACTAACATGA 1500 
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ACACGCCCaT 


AAATTTTTGC 


CCATCTTGCG 


CACCATGTAA 


AAAGCATTAG 


CTCCTGCACC 


1560 


AAgGCTTGTC 


CAACTGTAAA 


AAATTTGTTG 


GCTTTTCTTC 


TAGGCACAGC 


CGAGAAAGCT 


1620 


TTCACAACAA 


GTTTGGCAAT 


TAAATAACCG 


CCGCCAAAGC 


CCATAACGGT 


AGAAACAACT 


1680 


AGACCAACAA 


GCACTTTGAT 


CCATTCCGAG 


CCATTGACTG 


CACCCAAACC 


ACCTAATGCC 


1740 


ATTGCCGAAC 


CAGTCAAGCC 


GGCAATTAAC 


GCATGGCTTT 


CACTTGTGGG 


AATACCAAAA 


1800 


T AC LAC b C Ab 










/ibuLAC^ 1 b I 


I o bU 


GAAGCGTTTC 


CATTTGCGTC 


AAAACTAACA 


ATGTTACTAA 


TCGTTTCAGC 


TACTtGGGCA 


1920 


TTAAAAAAGG 


TCATGACTAA 


TGCACCTAAA 


AAGTTCATCA 


CAACAGCCAT 


CCAAATTGCG 


1980 


ACATTCGGTT 


TTAACACCCG 


AGTCGAGACA 


GCGGTTGCAA 


TTGCGTTCGG 


CGCATCCGTC 


2040 


CAACCGTTAA 


C AAAAAT T AC 


ACCCATTACC 


AAAGTAATCG 


TTACCAATAA 


TGCAATACTC 


2100 


ACAAAGACA 












2109 



(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 
GTAAATTCAC 
GTGTCACGAA 
TTCTTACACC 
GGCGTTTTTT 
CCTAGAATCA 
GCT'TGCAGGG 
CCTTCAGTCC 
ACTTGATTGT 
GAAGACGGCG 
TTTTCCACAC 
CACATAATGT 
GGTAAGTTCC 
CCTTCAAGCT 
TAGATATCTA 
GCCCGCACAT 
TATCTTTTTT 



TTCAATTGCC CCTTCCTTCT CCTGTTCTAT 
AAGAAGAAAA GAATCTGGAA TAATTGCCGT 
ATTTCTAACG AtACGTAATC CCTAATGTAA 
CCAGTGTTAA ATCGGCGCCA TCGACTTGAA 
GTGTTGCATT GGCGGCATTC GCAGGTAAAG 
TGTGTTGGTA CGTATAGTTG CCATCGACCA 
CTTTAAAGGT AATGTTGATT GGACTATTCG 
TAGCATCACT CACCACAGCC ACAGCTGTTT 
GTGTACTACT TTGTGTGCTA CTTTCTGGGG 
TGGCACTAGT TGCCGGTGTT TGAATAATCG 
ACAGCACCAC GACCGCAATG GCTAAACCAA 
GCAGTAACCA ACTGGATTGC TTTTCTTCGT 
CTTCATACAC TGGCTTGGGT TCTTCGACAA 
• CCAATTGATT GCCATCTAAG CCGACCGCAC 
AAAAGGTGCC TGGTAAGGAC TCAAAATCGT 
GAATTTTTGT CATTTGTTGC AGTTCATCAA 



TTTACGTTAT CTTGCTGGCA 60 

TTGACAGTTT ATCCAGATTC 120 

CATCCTTACG AATTAAAACT 180 

TGGCGACGTT ATTTGAGGCG 24 0 

CTGTTGTTTG GGTCTCTCCT 300 

TCACGCCAAT CCAGCAAGAG 360 

CATCTTTAAC CGTTGCAGTC 4 20 

TCTTCTCTGG TTCGCTTGAA 480 

CTTGACTTGA TTCTTTGGGC 540 

GTTCGGCTTT TTGATCTTGC 600 

TTAACGAAAA GGCAATGGCC 660 

CATGCAATTG CTTACGTGAT 720 

TTGCTGGTTC TTTGCCATCA 780 

TTGCATATTG GCGGATAAAC 840 

TTTGTTCAAT AATTTCTAAA 900 

TTGAGATGTT CTTCTGCAAT 960 
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CTTGCTTCTT TTAGTGTTTC TCCAATATTT ACGCTGGCCA CTCGTCCATC TTCTCCAGTC 1020 

TATTTTGTTA ATTTATCGAT TGTCTCATCA GTCTTTTTCA ATTGCCAAGG AAAACATTCC 1080 

CGTCCTTTAA GAATACCACA AAAGAAGCCA ACTGAGGATA ACAAAATTGA AAGATTTCAA 114 0 

AGATTTTTCT TTTTCTTAAA CAGTkGGTTT ATTGCTCGGT TTCTCCTTTA GGATACATGT 1200 

AGAAACGGCT CAATCCTTCT TCACGAATAA AGGTCTCTGC TACTTGGCGG ACGTCGGCTA 1260 

ATTGAATACT TTCAATGACT TCAATCTTAT CAAACAGCGT GGTTTCCCCA AACAATGATT 1320 

GTGAAAATTG GTTGGCGATA TACTCTAAAG AGTTTAAGGA TTGGAAATAT TTACCAATCA 1380 

TTTTTTTCTT CAATAAGGAC AAATTTTCTT CCGTTAATTC TGGACTTGTT GTGGCAGATA 14 4 0 

ATAAAATAGC TTCAATTCGT TCTGCTAAGA CTTCTGGTTG CTCACTATCG CCGCCAATAT 1500 

CCGCAAAATG GAAACTGCGA TCTAAATTAA AGTCATAACT AAAACTATCA TCTAACAAGC 1560 

CTTCATCATA AAGAGCCAAA TAGTTTTGAG AGGTATTACC AAACAATAAT TGTAACAACA 1620 

AGTTCACGGT GACTTGATAT TTCATTAAGT CACGACCCGT TGTTGGCACT TCATCTAAGC 1680' 

CTTTCAAACC AACAAGCACT TTTGCCCGTG TGACTGTCAT TTCAATGGCG CTCTCTTTGA 1740 

CAATGTCAGC AGCTGTTTCT TCAGGAAAAT GCCGACGAAT AGGTTGGGCG GCAGGAAACT 1800 

CTTTCGCCGC TTGGTTTTCA CGAATGAAGG CCATTAACGC TTCAGGGTCC ATTTTCCCTA 18 60 

CCACGAATAA GGTCATATTA CTTGGATGAT AAAACGTATT GTAACATGTA TACAAATCTT 1920 

CTGCGGTAAT CTCCGCAATA CTTGCCACTG TGCCGGCAAT ATCAATATGC AATGGATGCT 1980 

TCGGATAAAG ATTGCCTAAA ATCCCAAAGA ACAAGCGCCA GTTAGGATCA TCTAAATACA 204 0 

TTTGAATTTC TTGTCCGATG ATGCCTTTTT CTTTTTCGAC CGTTTCCTTC GTGAAATACG 2100 

GCGATTGCAC AAAATCAAGC AACGTTTCTA GATTTTGTGT CACTTGATCC GTCGTTGAAA 2160 

ACAAATAACT CGTTTTCGTA AAACTGGTAA AGGCATTCGC TGAAGCCCCT TGTTGCCCAA 2220 

ATTTTTGGAA GACATCGCCA TCTTCTTTTT CAAACATTTT ATGTTCTAAA AAGTGGGCGA 2280 

TTCCGTCAGG CACTTCAATC ATTTCCTCTT GACCAATTGG CACGAACGTA TTGTCAATGG 234 0 

AACCATAATC GGTCGTAAAC AAGCCATACG TTTTTTGGAA ATCATTTTTC GGTAATAGGT 24 00 

AAACCTTTAA ACCATTTTCC AGTGTTTCTG TATACAATGT TTCTTTAATT TGTGCATAGT 24 60 

CTTTTTTAAT CATTTTCCGT TTCTCCTTCC AAAAAGAAGA TCGCTTGAAG TTCCAAGCGT 2520 

TTGGCTACTT CTTGGATTTC TGGAATAGTC ACGGCATTGA TCCGCGCAAT CCATTCCTCC 2580 

GCCGTCAGCA TCGTCTGCGG CATTAATTCG TTCAAGTATT CTTTTTCCAG CCACGCGCCT 2640 

GCATTATCTA ACGCTAAGAT ATACTGATTT TTTAACATTG CTTTCGTTTG TTCAATTTCT 2700 

AATTCCCTAA TCTTACCTAA GCGAATATTT TCCAATTCTG TCGAAATCAA ACGCAAGACC 27 60 

TGATTCCGAT TTTTGCCATC GATCCCTGTT TGAACCGTCA TGAAGCCaCG GAAGGTGTCG 2820 

ATACTACTAG AAGCATAATA CGCTAAATGC TCTTTTTCAC GGACATTCAT AAATAATTTA 2880 

GAATGCGGGA AACCGCCAAA AATCCCATTA AACACTTGCA ACGCAAAATA ATAGCTATCG 294 0 
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C U A 1 AA1 AAA 


TATCTGTGTT 


GTAAGCCAAG 


TTTAATTTCG 


ATTGGGCTAA 


CACCTCACGC 


3000 




CTTCAATCAC 


GTTGCGAATT 


GGTTGATTGT 


AGAAAATCGC 


GGCTTTGCCT 


3060 


LACGAG 


GTGTAAACGG 


TAATTGGTTG 


AACAATGGCA 


CTAACTCCGC 


CTCATTGACA 


3120 


TCACCTAAAA 


CAAAAATATC 


CACTTGATCT 


TCCGCCAGCA 


TTTTTTGATA 


ATACGCTGCT 


3180 


AATGACGCCG 


CTGTTTCCTC 


AGctAACGCC 


GCAACAGTCC 


CGAAACTAGG 


GATTTTTTGA 


3240 


TCCTCCGATT 


GATTAAAATA 


GACACTTTGA 


AGCGCCAAAG 


AGGCATAGGT 


TTGCTTATCC 


3300 


TCCACAATAC 


TTTCCAAGTA 


CGCTTTCAAG 


TTCTCTTTTT 


CCCGTTGGAA 


GGTTTCCGCC 


3360 


TCAAATTGGC 


CCGCTTGAAT 


ATTGGGGGCA 


AAAATAATTT 


CCTTCAAAAA 


GTCCACCGCT 


3420 


TCGGCGAGAA 


CCTGACTATC 


CTGTAAATAA 


TGATCATTGA 


CAATGTTCAT 


AGAAATATTA 


3480 


AACCAATGTT 


GATTGCCTTT 


TTTGGAAACG 


CCAATCCCAA 


AGcTGGCGCC 


GTAAAGcTCC 


3540 


GCCAAACGCT 


CACTTAACTT 


GACTTGATTC 


GGGTAATTCA 


AGCTGTTGGT 


TTCCATCAAA 


3600 


CTGGACAACA 


GCGTTCTTTT 


TGTAATAGTT 


TCATGATTTA 


ATCGTGTGTT 


AAAGCGGACT 


3660 


AAAAGACGCA 


CCGTTTTATA 


TTTTTCTGTA 


GGAATGACGT 


GTAAATTCAC 


GCCTTTAACT 


3720 


AATTGAACAG 


ACATTGCCGC 


ATCGCTCCTT 


CATTTGCTTC 


ATCTTACCTA 


AAGAATGCTG 


3780 


TGAAATATAT 


GTATCGCGTA 


AATTCTAAAG 


CATTTTTTAA' 


TGTTTTTTCC 


AATAGCTCCA 


3840 


TTATAACACA 


GCTTCAGGTG 


CTTTCTGAAT 


GATAAGAGTT 


TTTCAGTTCA 


AATTTTCAAG 


3900 


AAGATTCGGT 


ATACTGTTTT 


TAAAGTGAGG 


AGGGAAAAAA 


GAATGATTAA 


AGAATTTAAA 


3960 


GAGTTTATCA 


TGCGCGGGAG 


TGTCTTAGAC 


CTAGCCGTTG 


GTGTTGTCAT 


TGGTAGTGCG 


4020 


TTTACAGCGA 


TTGTGACACA 


AGTGGTGGAA 


GGCTTGATTA 


CCCCGTTAAT 


TAGTCTGATT 


4080 


TTTGTCTTGA 


CGACTGGGAA 


AAAGAGCGCC 


GACGATGCAT 


TGGGTGCCCT 


TGTCTATAAA 


4140 


GTCGAAGGCG 


TTGAATTCAA 


TATTGGTAGC 


GTGATTAGTG 


CACTTATTAC 


ATTTTTAATT 


4200 


ACGGCCTTCG 


TCCTCTTCTT 


GATTGTCAAA 


GCGGCCAACA 


AAATGAAAAA 


CCGCGGCAAA 


4260 


AAAGAAGAAG 


CGGCAGAAGA 


AGAAGTGGTA 


CCAACGAGCG 


AAGATTACTT 


AAAAGAAATT 


4320 


CGCGACTTAT 


TAGCGGCGCA 


AACACCGCCT 


GCTGAAACTG 


TCAAAACAGA 


CAGCACTTTC 


4380 


ACAGAGAAAT 


AAGCAAAAAG 


TCATCCTAAG 


GGGATGACTT 


TTTTGTTTGG 


ACTTACACAG 


4440 


ATGGTCCTGC 


GACGAGGTTC 


CACGTGATAC 


TGCCTGTGTA 


GGCCCCTGCT 


TTAATATTGT 


4500 


TGGCTGGTGT 


TGTGATGTTT 


AAAGCTGTAT 


TAAACAGTGA 


AAATTGCCAT 


AATCCTTGGC 


4560 


CTTGACCAGC 


TGCTGCGTTA 


gcCATTAATT 


GGGCTTCCCC 


GCCTAAATCA 


AGGGTCCAAT 


4620 


TTTGCGGAgC AGCCAGTTGT GGTCTGACTT 


CTGGTGTAGA 


GACTGCTGTC 


GTCAGAGCTA 


4680 


CATAACTATT 


GCTTGTTGCC 


TCGTATTCTA 


AATCAAGTTG 


AGTAATCAGC 


GCCACTTTAA 


4740 


TGTTGCCTTG • 


TAAGCTTTCA 


GTGCCTAATT 


TTAGGCCATT 


AGATTGGGCC 


GTTAAATGCC 


4800 


AGCCTTCTTT 


GCCGCCGCGT 


ACGTCACCGA 


CACCAATCAT 


TTCTGTATGG 


CCGTTGGTGG 


4860 


CTTCGGGGTC 


TCTTTGGTTA 


AGAATTGGTT 


TCGTTAAATC 


ATCTGAAATT 


GATAATTTCC 


4920 
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CAAAATTAAA TTCTCGAGGA 
CTGGATCAAC GGTATTGGCA 
CTGTGACTGG CGTATTCCCT 
CAACCACTGC GCTACCAACT 
TCCTTTACGA TTGAATGATT 
GTCCTCTGTT CTTTTGTTGC 
ATATTGCCTG TGTAGGCACC 
GGTCTAAATG AAAATTGCCA 
ACTGCATCTC CGCCTAAAGC 
ACTGGGTCGG CAAGATCAGC 
AATGGATACA AAAACATTGC 
TTCATACCAG TCGATTGAGC 
ACACCAACAG ATTCCACATT 
GTTAAATCAT CAGAAATAGA 
TCAAAGTTAC TATTTGGCTT 
TCTGGCAACG TGCCGCCTGC 
TGTGCTGAAA GAGGTGCTAA 
ATCATTTTCT TTTTCATTTG 
GAATAGCTGA GGTTCTGCCC 
CGAGGTTCCA TGTGATGTTG 
TAACTTCAAG ATTCATGGTA 
TTGCTGTCGC AATTGCAATA 
GTGTAGTTGG ATCAGCTTCA 
TTTGTTTTGT TGCATCAAAT 
CTAAGGCTTC ACTCCCATTA 
TTCCTCTTAT ATCTCCGATT 
GACTATCGCT AATTTTTGTG 
AATTAGTGTT AGGTACTGTC 
TATCTAATGT TCCTCCACCA 
GTGCCGAAAG TGGTGCTAAA 
TCATTTTCTT • TTTCATCATA 
CGCGGTTCTT TTTTCTGCGC 
TCGTAATTTT AAATATTTAA 
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ATAAACAGTA AATCAAAGTT ACTGTTTGAG CCACTCGGGT 4 980 

XCGCCGTTGC CATCTGGCAA CGTGCCGCCT TCAAATGCTG 504 0 

GTCGTAGTCG CCTGTGCCGA AAGCGGGGCT AAACTTGCGC 5100 

AATAAACTTG CCATCATTTT CTTTTTCATT TGTGTTCTTC 5160 

ATTAGAGGGC AGACCAATAG CAGTTATCCG CTGATGGTCC 5220 

TTGTGCTTAA ATTGATGGTC CTGCGACTAG GTTCCAAGTG 5280 

TGCTTTGATA TTGTAAGCTG GTGTCGTAAT ATTTAAATAT 5340 

TAATCCTTGC CCTTTTCCTG CTGTAGCATT CGCAAGAAGC 5400 

AATTGAAAAA GCTGTTGTAT CAACAGTTTC CGGTTTGGTA 54 60 

ACTTTGATAA TCATACGTTT TAGTTTCCTC ATTATAAATT 5520 

TCCTGTTGTA ATACTGCCTT CCAAACTTTC CTCGCCGAGC 5580 

AGTCACATGC CAGCCTTCTT TGCTCCCTCG TATATCTCCT 5 640 

TCCAGCACGT CCTTCATCTG TTTTATTAGG AATTGGTTTA 5700 

TAATTTCCCA AAATCATATT CTCTTGGAAT AAAAAGTAAA 57 60 

CGTCGGGTCT GGATCGACTG TATTTGGAAC GCCATTACCA 5820 

AAATTCTGCT GTGACTGGTG TATTCCCAGT GGTCACCGCT 5880 

ACTTGCGCCT ACCACTGCGC TACCAACTAA TAAACTTGCC 594 0 

TGTTCTTCTC CTTTACGATT GAATGATTTA TTAGAGGACA 6000 

TCTGTTTTCG TTGCTTGTGC TTAAATTGAT GGTCCTGCGA 6060 

CCTGTGTAGG CACCTGCTTT GATTTGTTGG TATGGAGTAG 6120 

CGAATACCTG AATACCATAT GCCTTGTCCT TTTCCTGAAC 6180 

GCATCACCGC TTAACGGAAT ATTCGTTCCC GAAAAATCAG 6240 

TAATTAGCTA AACTCTCAGC ATCAGTTAAA TAAACATATA 6300 

TTTGAATAAT ATCCGATTTG ACTAAAATTA AGGTTTCCAG 6360 

GCCAAAGTTG CTGAAGCTGT GACATGCCAC CCTTCTTTCG 64 20 

GCATAGCCAA GATATTGTTG TGTAGAACTA ATATTTTGTA 64 80 

GTGTTAAATG TAACTGCTTT TGGTATCTTT ACTAAGTCGA 6540 

GGGTCTGGTG CTGTGTTATC TCCAGCATTT CCTGTCTCGT 6600 

AACTCCACCT GCACTGGCGT ATTCCCTGTC GTTACAGCCT 6660 

CTAGCGGCAA CGACTGCGCT ACCAACTAAT AAACTTGCCA 6720 

ATATGTACGT CTCCTTTAAC TGTTCTGAAT TTCCAGAGCG 6780 

TTCCGAAGCT ATAGCCTTAT TATACAAGAA TCTGCCCTCT 68 40 

CAGCCATTCA TGAAAATTGA TGATTCCTTT AAAACCAAGC 6900 
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ATTTCTCTTG ATTTCAAGCT TTAATTTCAG CACTTTTTGA TATTCACAAA TGTTATTTAA 6960 

TGATTTTTTC GTCAAATTCG TTTTCTCCTA AAATCTTATA CATTTACCGG CGGTTCTTGC 7020 

GAAAAAACCG TCAATTTTCA TTTAATGACA GCCGTTTTTG TATTTGTTCG TCACTAAGAC 7080 

TTACAATTTA CGGCGACGGG CGAATTCCAC AAACTTAAAT TTGTTTAAGC GGTGACGGGA 714 0 

TTCCGTATAG CCAAAACAAC GGGTATCTTC CAAGTTCATG ATACTACGAA CCACGACGAC 7200 

ATGATTGCCT TCGACTTCAC CAATCAGCTG CTTGTCCATC TCTGTGAGCG GTTCCACGGT 7 260 

AATTTCTTTC TGGGCATAAC TAATCGCCAA GCCAAGGTCG TTTTCCAGAT AATCGAAAAT 7320 

CGAATCTTCG GCACGGTGCT TCGGCAGTTC TGGGACAATC TCTTTAATCA AGTAATCAAT 7380 

ATCTAAAATA TCGACTTCCC CTTCAATCTC CCGTTGCCGA ACTAAGCGCC ACACAGGGGC 74 4 0 

CCCTTCCGTC CAACCAGTCA ACTGAGCCAA GTTGCGCTCC ACAGGAATTT CTTCTAAGGA 7500 

AACCACATGA GTATGACTTT CAATATGTTG CGTTTGTTGT AACTCCTTGT AACTGGTCAC 7560 

CCCTGAAATA GGAAAATTAA ACTTTTTAAA GTTGAGCACA ATCGAGCCTT TGCCTTGTTT 7 620 

TTTCTGAATG TAGCCAGCGT TAATCAGTAA ATTCAAGGCT TTCCGAATTG TTtCaCGGGA 7 680 

AACATTGTAA GTTTCCACTA ATTTGATTTT CA 77 12 
(2) INFORMATION FOR SEQ ID NO: 4 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90: 

CGAACTTTGT GTTTTTCGCC AGCCATGTCA AAATAACTGA CGGTAATTTT GTTATGGCGA 60 

ACGACATACC CAGGTACATC CCGATCGACA GATAAACAGC CTTCCCCTTC ACCTAAACAT 120 

ACATCTTGCA CAGAATGGCT CAAAATTTTA GGGTTGTACA TCACCGTACT TAATGAAGGC 180 

GTTTCGTTTT CTGGGTCGTT ACTAGGAACA TGGACAGCAA TAATTCGTTT AGAAATATCT 24 0 

AATTGTGGTG CAGCTAAACC AACGCCACCA CGTAATTGTA ATTCTTCGGC TTTGACTGGA 300 

TCTTGACTAT TTTTTAAGAA AGTCAACATG TCCTCACCTA ATTGTCGGTC TTCCTCAGTG 360 

ATAGGGACAG GCACTTCTTC TGCCACAGCA CGAAtGTGGG GATTCCCTTC ACGAATAATA 420 

TCTTTCATTG TAATCATTCA ATGGTTCCTC CTTTGAAACA TAAAAAAATT CTTCTTTTAT 4 80 

TTATTCaTGC GCTGrAATAA TCAGTTGtTT ACCACCyTTT AGtGkAnCmC aAAAyCmCCA 540 

AAAAAGGGGs tAgAwTGgTT TAAAAAACn 569 
(2) INFORMATION FOR SEQ ID NO: 4 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 



GnTTTACAAC 


TTATAATCGT 


ATCTGGCGTT 


TACCCACGTC 


TCTGCATCGC 


CATTAAGTCT 


60 


GGAATAAATC 


GCCTGCATTT 


TGCGAACGAA 


AAAGTTGCAG 


TAAAAAGAGA 


AATTCATCAT 


120 


CTTTCAACCA 


ATTTTTTGGT 


AATGATCAAA 


TAGTAAGTTT 


GAGATTGTTG 


TTTCACCAGC 


180 


TTCTAAATAT 


TCACGAATTG 


ATAACATTGT 


TGTTGTCCAC 


TTCCCCTTGC 


TTGTTTCaCT 


240 


TATTGCACTT 


GATAAACATC 


AAAATAAATA 


GCTTTCATAT 


AATCAATTAA 


ATAACTAGGA 


300 


TTCAATTCTT 


CTCCCgTTGC 


ATCATAAATT 


AATTGATTTG 


GTTTACGTGA 


AGCGCCGTAC 


360 


TGATGAATAT 


GTTGTGTTAA 


CCaTTTACGG 


ATATCAGAAT 


AATCCTCCGA 


AGCTAAAATT 


420 


TCATCAACAG 


ATAATTCTTG 


TTTCATTGCG 


TGAAATAGTT 


GCGCGGCGTA 


CATATAACCC 


480 


AAGGCATAAG 


ATGGGAAATA 


ACCAAAGCTA 


CCTCCAGACC 


AATGAACATC 


TTGAAGGGCA 


540 


CCTTCTAAAT 


CATTTTCAGG 


TGATACGCCT 


AAGTACTCTT 


GGTATTTTTC 


GTTCCATACT 


600 


TTTGGCAAAT 


CAGCGACTTC 


CAAAGAACCA 


TTAAACAACA 


TTTTTTCAAT 


TTCATAACGA 


660 


ATAATGATAT 


GCAAAGGATA 


GGTGAGACTA 


TCGGAATCAA 


TCCGTATCAA 


ACTTGCTTTC 


720 


GTTTCTTTTA 


AAGAAGCATA 


GAAATCTTCA 


AAAGCAATAT 


CATCAAAAGT 


GCCTTCTGCG 


780 


CATTCTTGGA 


AAAACGGGTA 


TTGTTTTTGC 


CAAAAAGCTC 


GGTTGCTACC 


AATGATAATT 


840 


TCATTAAAGA 


GCGATTGTGA 


TTCGTGAATC 


CCCATAGAGG 


CGCCTTCGCT 


TAGTGGTGTA 


900 


AAGTCAAATT 


TAGGATCAAA 


ATTTTGTTCA 


TACATCCCAT 


GACCCGCTTC 


ATGAATCACA 


960 


CCAAATGTTG 


CCATTGAGAA 


ATTATTTTCT 


TCCCAGCGnG 


TAGTAATCCG 


CGCATCATTT 


1020 


CGATTTAGAG 


CTGTCATAAA 


TGGATGAACT 


GTATCATCGA 


ATCGGnCTTT 


TGAAAAAnCA 


1080 


TAGCCTAATT 


GTTCAACCAC 


ACCAATCACA 


AAACGGnGGT 


GGTGG 




1125 



(2) INFORMATION FOR SEQ ID NO: 4 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 

GAAATtAcaA CAACATTGTC TAGTGCTGCT CCAATTCCTC CTAaCATTAA GCTGGCAAGA 60 

AAaACAGTTC GATAAGGTCT AGTTAAAAAT TGCATTTCTT CATAGCGCAG TCCTTGATCG 120 

TTTAATAATT TCATAGTAAT TGTGGTAATA AAATAaGrTa GAAAaaCACT AaATATTGTT 180 

GCTAAAACCT TCCTTAAATC AATTTTTTTT AGTCCGTAGA GAGTACCAGT TGATATAATA 24 0 

ATaGCAATTA TTGTATAGAT TGATATTAGC AATAGTAAGT TTATGGATCT GTTTCGCTGA 300 

TTTATCCAAA GCAAGAAAAA AAGAAAGATT ATATTGAATA AAATACCAAC TAAAGAAAAA 360 

AATCCTTTTT TTCCAGCGAT ACCTAAGACT AATGAAACAA AACTAACTAC TAAAATAAAA 4 20 
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GTATAGCCAT CTCTTTTTAG AGAAATAAAT TCCCACTTTT TAGTGGAATT ATCTAATAAT 4 80 

ATTTGATTAT TTTTATTAAA TACTGGaTTG GTACTTTCAT TTCCATTAAA TTTTACTTTT 54 0 

ACAACAACAG TTTTTTTTGt ATTATTTACT AGTTTAACTG TTAGTAATTG TTCGCTGkCa 600 

ATTATTTTTA CtGaTTTTAC aATACCmACC nGGGGAnGTA TAAAAGGTAA C • 651 
(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 93: 

GATGGTTCAA CTAATTTCTA TTATTTTTGC GTTTTTGATT ATACACACTT TATTTAAAAC 60 

AAGAGAAACA AGTAATtCTT CAAGAAAATT AAGAATGTTA GTGTACACTT TTTTGGGGAT 120 

AATTTTTCTA TTAGGCTTAG TAAGTGTCGT ATTAGGAACC TATGGGCTGT TGAATTCTTA 180 

TAACATTAGT GGAAATTGGC GTATAAATAT GTCCTCTATT TTAGTTGTGT TGGCTGGAAT 240 

TTTACTAGTT GTACTTTCTT ATGTTGGTAT TTCAAAATTT AAAATGAATT GTTAGGTGAA 300 
TTAAATGAAG AAAGTTATCT CTACCATTTT ACTAGTAATC CTTTTTGTGG GATCAGGTTA " 360 

TGTGGTAAAC GCTGCTCCTG AAATGAAGAG TTTGACTAAT TTCGAAACGA ATGATCAACA 420 

TTATGATTTA CAAGGGATAG AACATTCCTT TTGCTAAAGT GTTAAACTTT AAAACAAGTA 4 80 

GAAAAGATAT TACAGCAAAA ACGATATTAA AATTTGTATA TCTAAAAAAG ACAGACAATG 540 

AATTGAATAA TGCTTCTGCA TCTATTAAAG ATTGTATTCG CTATGCCACT TCTATTGGGG 600 

TAAATATGGA ATAGCTTGAT GCAATGAAGA ATGAACTTGA TTTCAACCAA GTGAAGAGTG 660 

AGATTGATAA GGATCATCCA GTATTACTAT GTTATAAACC AAACAAAACC GATTTAATGG 720 

AACCTTATTT ATTAGCTGTA GCGCATGGAT ACGTATACGT TGCTCCAGAT GGTCAAAATA 780 

CTAAAGAACC ATTTATGAGT ACAAGTATCT GGATGTCGAA CTATATATAT CCTTCTATCC 840 

TGACTACGTT GGCGAATAAA AATAAGTAAC AGTTACATAT TTTTCAGTGA ATAATGCGCC 900 

ACAAAAAGAT TATACATTTA TTGGTATGAT TATTTTTAAG TAAGAAAGGT TGTAAGGAAA 960 

TTGAAAAAAT ACATTTTGTG TTTGAACACA TTTGATTGAA GAACATCAAC GCGACTATCT 1020 

CTATGGTAAT TAGGGGCACG TACTAAGTGC CTTTTTGAAT GTAGAGTCAT TTATTATGCT 1080 

TAAGAAAAAA ACTTACACAA AACTCTATAC AGATCCGATT TCACTGTAAC TGCTTTTAAA 114 0 

GAAAAGACGA TATAAAGATA GTATAAAGAA ATAAACTTAA GTTTCACACT ATCTTTAGGT 1200 

CTTTTTTTTT ATGCTTACAG AGTAAAGAAA ATTAGTTAGT TTATTTAACT GGTTGTTTTG 1260 

AAAAGGAGTG AAGTGAAATG ACGATTATTT TAGGAATTAT TGGACTCTTA TTAATGATCT 1320 

ATCTTTTCTG GTTTTTATTT AAGGGGGAAG AATTATGAAT AGCTTGATTT ATCAAGGATT 1380 
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ATTTTTCCTA TTCATATTAT TGTTATTGGC AGTGCCGTTA GGcTGGTACA TTAAAGAAAT 14 40 

CATGCAGGGG GAAATCCCTA AAGGAGTATC CTTCTTACAG CCAGTAGAAC GTTTATTTTA 1500 

TAAAGCGATT GGACCTATCA GCAAAAAAGA AATGTCAGCG AAACGCTATG CACTGAATGT 1560 

GCTGCTGTTA AGTTTCTTCT CAATCGTACT TTTGATTGCT ATTTTAATGA CACAGCACTT 1620 

TTTACCGGGA GGTTCCTCAG TAAAAAATCT AACGCTGCCG TTAGCGATAA ATACGGCCGT 1680 

CAGTTATGTT ACGAATACCA ACTGGCAAGC ATATGTAGGT GAAACGACTT TATCAAATAC 17 40 

ATCGCAAATG TTTGGTTTGA CAGCTCAAAA TTTTGTTTCA GCAGGTGTGG GACTTTCTGT 1800 

TTTGGTAGCA TTGCTTCGAG GACTTTCACA AGTCAAAAAG AAAGCTTTAG GAAATTTTTG 1860 

GCAAGATTTA ACTCGTAGTT TAGTTTACAT TCTGTTACCG ATATCAATCA TTCTTGCTGT 1920 

TTTATTAATT TCACAAGGAA CCGTTCAGTC TTTCCAATCC GGAGTCGCTT ATCAAGGTTT 1980 

GGAAGGGAAA AGTCTTTGGT TGCATTTAGG TCCGGTAGCT AGTCAGGTGG CAATCAAACA 204 0 

ACTTGGTACA AATGGCGGTG GTTTCTTTGG TGCTAACTCA GCATACCCAT TCGAAAACCC 2100 

AACACTATTC TCAAACTTTT TAGAGAATAT TGCGATCTTG TTGCTCCCAG CGGCATTGAT 2160 

CTTTGCTTTT GGCTTTTGGG TCAAGGATTG GAGACAAGGA CGAACGATAA TGATAGTATC 2220 

CTTGTTTTTC TTACTATTAG CTTTCGTCGG CGTTGCTTTT AGCGAGTATT ACGGGCCTGA 2280 

ATTTACGCAT GTTCTTGGAT CTACCAATTT GGAAGGAAAA GAAATTCGCT TCGGCATTGG 234 0 

CTGGTCTAGC TTATGGTCCG TCGGTACGAC CGCCGCTTCG AATGGTTCGG TGAACGCGGT 24 00 

GTTGGATAGT TTTACACCGC TTGGTGGCGC CATTCCAATG TTCTTAATGC AGCTTGGCGA 24 60 

GATAATTTTT GGTGGTGTTG GGAGCGGACT GTATGGAATG TTGGCATTTC TATTGCTAGC 2520 

AGTTTTTATA GCAGGTCTTT TAGTTGGGCG GACACCTGAA TATTTAGGTA AAAAAATCGA 2580 

AGCTTTTGAT ATCAAAATGG CTAGCCTAGT TATCTTAACA CCATTGATGT TGACGCTTTT 264 0 

CGGGGCAATG GCTCTAGTCC TACATCCAGA GGTAATGTCT TGGTTAACGA ACCAGGGTCC 2700 

CCATGCTTTC ACTGAATTGC TTTATGGGGC AACTTCATTA GCCAACAACA ACGGAAGTGC 27 60 

TTTTGGTGGG TTGCTAGCTA ATACGACATT CTTAAACCTT TTAGCTAGTT TAATGATGGC 2820 

TGTTTCAAGA TACTTGCCGA TCATCGCCAT CTTACTTTTA GCGGAAAATA TGGGAAGTAA 2880 

AAAGAAAGCA GCTTTAAGTT CAGGAACATT ATCCACGGTT AGTCCAACAT TTGTTTCTAT 294 0 

GTTGATTATT GTTATTTTGG TAATTGGAGC TTTAAGTTTC TTACCATCAT TGGCACTAGG 3000 

TCCGATTGCG GAATTCTTTA CGATGAAATA GATGGAGTGA GTAAAAATGA AAAAAATCTA 3060 

TCAATGGGCA GTGGGACAAT CCTTTAAAAA ATTGGATCCA CGCCAACAAG TTAAAAATCC 3120 

CGTGATGTTT GTCGTTTATT TAGGTGCGCT CATCACTACA ATATTATGCT TCTACCCAAT 3180 

GGGTATACCG TTATGGTTTA ACATTTCAAT AACCATTTTT CTTTGGTTGA CTTTGCTTTT 3240 

TGCGAATTTT GCTGAAGCTG TTGCAGAAGG ACGAGGAAAA GCACAGGCCG ATAGTTTGAA 3300 

ACAAGCAAAA AAGGAAGTCA TGACTTATAA AATTAACAGT TTAGAAGATA TCAAAGAAGA 3360 
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GAATTTCATT GAATTACAGT CTTCTGATTT AAAGCGAAAC GATTTGGTCT ATGTACGTGC 3420 

AGGAGAGCAA ATCCCAGCTG ..ATG.GTGATG.T _ AATCGArGGG GCTGCaTCAG TAGATGaAAg 3480 

TGCCATTACT GGcGAATCCA G 3501 
(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

ATTCATCAAC GACAGCGTTG ACAATCGCTC CTAAAATAAT AATCGTGGCA GCAAAATTCA 60 

ACCATAACAT CAAGATGATA AAACTCCCGA TGATTTGATA ACTTGCGATT CTAGAACTAA 120 

AATATTTTAC GTACAAaCCA AAAaTTTGTG ATAATAGCAT CCAACCGACT GTCGAAAAAA 180 

TAGCCCCTGG CAAAATAGAA CGTAACGATA ATTTCCGATT AGGAACGACC GCATAAATCA 24 0 

AACACATAAT CACTAATAAA ACAACGGTCG TTAACGGCCA TTTTAACGCT TGGAAAGTAT 300 

C GAT C AC AG A TGTTGAGTAA TGGAAAATTG GTTGCAGTAA TTCAATAATG TACTGACCCA 360 

GACCTAAAAT TACTACTACA CCAACAATCG CTACCATAAA TAACAAAATA ACTAAAAATG 4 20 

AGACCACTCT TACGAGAATA AAATTTTTCC TTTGTTCGAC ACCAAAAGCT TTATTCATCG , 480 

CATTTTGTAG TGCATTAATA CTTTGACTGG CACTCCAAAA TGCTGCTAAA GCGGAAACAG 540 

AAAGCAACCC GCCTGAACGT TGAGTCAGTA GTGAACGAAT CGCTGGTTCT AAGTTCTTAT 600 

AAACATCTTT CGGAATC ICC TCTGCAATAT AAGGCAACAC AGAATTAGGG TCGATTCTTA 660 

AATAAGGTAA GACATTACCT ACCGCAATCA GTAATGGAAA CAGCGAGAGT AATAAATAAT 720 

AGGCAACAAC CACTGAGCTG TTGCCTATTT CGGCTGTGAC CATGTGACTT TGCGTTGTCT 780 

cAATAAAACG CATCAAAGAT TGATTCTTTT TGATGTTGTC GAGAAACTTC ATCACACACT 84 0 

CTCCTTATTA ATAAGTCAAT TCTTCGCCTT GTGACGTTAA GATTCTTGGA CCTTCTTTCG 900 

TAATCGCTAA TGAATGTTCA TATTGACAAC TTAGACCGCC ATCTTCAGTA TAAGCAGTCC 960 

AACCATTTGG GTCCATTTTC ATGCGCCAAG TTCCTGTGTT AACCATTGGC TCAATGGTAA 1020 

TAACCATTCC CTCTTTTAAA CGTAGTCCTT TACCTGCTTC ACCATAGTGA GGAATCATTG 1080 

GACTTTCATG GATAGTTGGG CCAATGCCAT GACCTACAAA GTCGCGAACT ACCCCGTAGC 1140 

CTTCTCCTTC TACATACGTT TGAATTGCAT GACCAATGTC GCCAATTCTA TTTCCTACTT 1200 

GTGCTTGTTC AATCCCAAGA TATAGCGCTT TTTTGGTTAC TTCCATCAAG CGATCAATTT 12 60 

CAGGTGTTGA TTCACCCACG ACATAAGACC AGCATGAATC AGAAATTGCG CCCTTTAAGT 1320 

CGACACACAT ATCTACTTTG ATTAGGTCGC CATCTTTTAA TACTTTTTTA CGAGGAAATC 1380 

CGTGGCAAAT TTCGTCGTTG ATACTACAAC AGGTAGCATA TTTGTAACCT TCATAGCCAA 14 40 

TTTGAGCCGC CACGCCACCA TGACTTTCAA TAAAGTCTCT CACAAATACT TCAATATCCC 1500 
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AGcTTGTAAT GCCAGGTTTA ATAAACGTCC GTAAATGACG GTGAACATCC GCTAATAATT 15 60 

CCCCTGATTC ATCCATCATT TCAATTTCTC GTGGTGATTT TAATGTAATC ATACTATCTC 1620 

TCCTAAACTT GTTTCATTCA AATTATTTTA TCACATTTTT TGCAAAAAAG ATTGTTCAAA 1680 

CACCAATTTT CAAAAAGTCC CTTAATAAAA GCACTCGCTA GTCTTTTATA AGAAATGCGC 1740 

ATCTGCTATA ATAAGTGACA GAATTGTATG AGGGGGAAGA AAGTTTATGA CGTTAGCTAA 1800 

AATTGTGTAT GCAAGCATGA CTGGGAATAC AGAAGAAATT GCCGACATTG TTGCAG 1856 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 



ATATGGTTAA 


GGAATTTGGA 


AATCGTGTAG 


TAATGCCTTT 


TGTAGATCAA 


GTAAATGTTT 


60 


ACTTATCTGA 


TATTGCAACA 


GATATGGGAT 


AT GAT GAG G A 


GAGAAAATAT 


ATGATACATG 


120 


TAAGTGGTGG 


ACAAGCTCAG 


GTAAATATTT 


CCAACGATAA 


CAGCACCATA 


AATGCTAACC 


180 


AAAATGTTCA 


AATCAATCAA 


TCTAAAGTTG 


ATGATTTAAT 


TTCTGATTTG 


AAAAGTAATA 


240 


TTGAAAAGGA 


ATTAGTGGAT 


AATGAACCTA 


TTAAAGATGT 


TTTGTTATCT 


CAATTGGCTC 


300 


TGATTGAGTC 


ACAACAGACT 


GAGAGTGAAC 


CTCAAAAAAA 


TGTTTTGAAG 


ACGGCATTTG 


360 


AGACAATGCA 


ATCTTTACTT 


AAGACTGCCC 


CATTAGCAGT 


AACTGCTGTT 


GAAAGTTGTA 


420 


ATAGATTGTA 


TGAACTGATG 


TCACCTTTAT 


TTAATTAGGA 


GCAAAAATTA 


AAATAGTTTT 


480 


AGAAGCCAAG 


AGCAAAAAaT 


GTtCTTGGCT 


TTttACTTTG 


CAaTGAaTGA 


GGTtATTTAT 


540 


GACATTTTtA 


GATTtATTtG 


CCGGGaTtGG 


TGGTTTcGTT 


tAGGTATGGA 


ACAAGCCGGC 


600 


CACCACTGTA 


TCGGCTTTTG 


TGAAATTGAT 


GATTTTGCCA 


GACGGAGTTA 


TAAGGCAATT 


660 


TATGATACAA 


GTGAGGAGGT 


GGAAATGTAT 


GACATCACAA 


GTGTATCAGA 


TGAGTTTATT 


720 


CAATCCCTCG 


GACCAGTGGA 


TATCCTTTGC 


GGCGGATTTC 


CGTGCCAAGC 


TTTTTCAATT 


780 


GCGGGAAAAC 


GGCAAGGATT 


TTCCGATACT 


CGAGGTACTT 


TATTCTTTGA 


AATCGCTCGG 


840 


TTCGCCGCTC 


TTCTCCAACC 


TAAGTTTTTA 


TTCCTTGAGA 


ACGTCCGGGG 


ATTACTCAAT 


900 


CACGAAGGAG 


GGGCTACGTT 


CGAGACAATC 


CTCCGAACGT 


TGGATGGATT 


GGGGTATGAT 


960 


GTGGAATGGC 


AAGTGCTTAA 


CTCAAAGGCC 


TACGTTCCCC 


AAAGCCGCGA 


GCGGGTTTTC 


1020 


CTTATCGGAC 


ATTCTCGAGA 


CATATGTACC 


GAACAAGTAT 


TTCCTATCAT 


TGAATCGTCT 


1080 


GCAACATCTG 


■ ATCAAAACCT 


CAGAAACTTG 


TTAAATACCA 


ATCCTTCCAA 


CCGAGGAATG 


1140 


AGTGAACAAG 


TCTATGGTTC 


AGATGAGTTT 


GCCCCAACCG 


TAACAGGCGA 


TGAAGGAATT 


1200 


AAAATCGCAC 


TACCAGTAAA 


TGATGGAATT 


TCTGTAGTGG 


GTATGTTGCC 


GGGAAATTTT 


1260 
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GAACAAGGTA 


ATCGGATTTA 


CGAGGTAACT 


GGAACAGCTC 


CCACCTTGTC 


AACGAAGCAA 


1320 


GGGGGAACGA 


AGATTATGAT 


TCGAAAAAAT 


AAAATAGAAT 


ATCAAGAGGT 


TAAGCCAGAG 


1380 


GGCAGTGTGA 


ATCTGGCTTT 


TCCTCATTCC 


AAAAGTCGCC 


GAGGTCGTCT 


TGAAAAGCAA 


1440 


AGCGTCCATA 


CGTTACTAAC 


AGGAGATCAA 


CAAGCAGTCA 


TTACAGACCA 


GTATCAAATT 


1500 


CGGAAGCTTA 


CACCAAGAGA 


ATGTTGGCGT 


CTTCAGGGTT 


TTCCCGATTG 


GACTTTTGAT 


1560 


CGAGCCTCTC 


AAGTCAATTC 


AGATAGTCAG 


TTGTATAAGC 


AGGCCGGAAA 


CTCTGTAACA 


1620 


GTTCCGGTCA 


TTTTTGATAT 


TGCCAAGAGA 


TTAAAGGAGG 


TGAAAACGGA 


TGATTTATGA 


1680 


TGAATTGATT 


GGAGAAATCT 


AATGGGTGAT 


TGGCAAGATT 


CACTCAGATC 


CGGAACTGAA 


1740 


AGAAGAATTA 


CGTCAGCTTA 


ATTTTGATAT 


TCGCAAAAAT 


GTTGTGAAAG 


TCACTGGTGA 


1800 


TTCGTATGTA 


ATGGACGAGG 


GAACAGATAC 


TCGAATTGAG 


CTGAATCAAG 


TCATCAAAGA 


1860 


ATTAGAACGG 


ATTGCGTACC 


TTGCGAAAGA 


ATCGGATATC 


CGTCAATACT 


TGTTTGAGAT 


1920 


AAAAGCAGAA 


CTGGAAGTAG 


GTGGCATTAC 


TGAAGGGTAG 


CAGCCGAATA 


AACTAACTGA 


1980 


TGGGCGCCGA 


CCGCGGGGGA 


TTTGCGAAGC 


AGAAATCCTC 


TGTGGTGCTG 


TGCCTGTTCA 


2040 


TGGACGGCGT 


CAGCCGGCCA 


TGATGTCTCA 


CCCCCCCGTA 


TCTAACAGGG 


GGGTACAAAA 


2100 


ACACAAAAAA 


TGAGCAAGTA 


ATCATTAAGA 


ATTATTGGTG 


TAGCAAGGAT 


ATTAAGTAAT . 


2160 


TTGCATAGGT 


GTTAACTCAT 


TAAAAAAAGT 


TAACACCTTG 


AATTTGGAGG 


TGAGAACTTG 


2220 


GCACAACGGA 


ATT TAG ATT A 


TCGATTATTA 


AAAGACCGAC 


GTAATGAATA 


TGGTGTTTCA 


2280 


CAAAATAAAT 


TAGCTATCGC 


TTGTGGGCTT 


AGTCGGACCT 


ATTTAAATCT 


GATAGAAAAC 


2340 


GGTGGCGTAA 


CTGCCTCTAC 


AAAAACCATG 


AGAAAAATTT 


TTGACCAACT 


GGAAAGTTTT 


2400 


AATCCCGATT 


TACCTTTAAC 


GTTACTGTTT 


GACTATGTAA 


GGATTCGTTT 


TTCCaCAACG 


2460 


GATGTGCGGA 


AGATTATTCA 


AGAGCTTCTC 


CATCTGAAAT 


TTGAGTATAT 


GCTTCATGAA 


2520 


GATTACGCCT 


TTTACTCCTA 


TCAGGAACAA 


TATGTCATGG 


GAGATATTGT 


AGTGATGTTG 


2580 


TCTCATGAAG 


AAGATAAAGG 


TGTTCTTTTG 


GAATTAAAGG 


GTCGTGGTTG 


CCGGCAGTTT 


2640 


GAAACTTTTT 


TACTCGCTCA AAAGCGTAgC 


TGGTACGACT 


TTTTCGAAGA 


TTGTCTAAAA 


2700 


ACCGGTGGTG 


TGATGAAACG 


CTTGGACTTG 


GCAATCAATG 


ATCGAGTGGG 


GCTACTGGAT 


2760 


ATTTCTGATT 


TAACGAAAAA 


ATGTCAGAAG 


GAAGAATGTA 


TCTCCTTGTT 


TCGTACCTTT 


2820 


AAAAGTTATC 


GTTCCGGGGA 


ACTTTTAAAA 


GCTGATGAAA 


AGGATGGCAT 


GGGAAATACC 


2880 


TTATATATCG 


GCAGTCTCAA 


GAGTGAGGTT 


TATTTCTGCT 


TGTACGAAAA 


AGATTATGAG 


2940 


CAATATATCA 


AGTTAGGGAT 


TCCATTAGAC 






1111 ofvart 1 1 




CGACTGAAAA 


ATGACCGTGC 


TTATCATGCG 


ATTCAAGATT 


TGTTAAAAGG 


TCGTAGCATT 


3060 


GAGAGCACCA 


CGTTTTCTAT 


TATCAATCGT 


TACTTGCGAT 


TTGCGGATAA 


GGTGGAAGGC 


3120 


AAACGGCGAA 


CCAACTGGCC 


CTTGAATGAA 


CAATGGGGTC 


GCTTTATTGG 


ACGGAATCGG 


3180 


AAGGAAATTC 


AATTAACTTC 


TGAACCGAAA 


CCTTATACCA 


TTGAACGAAC 


CTTGAATTGG 


3240 
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CTTGGACGTC 


AAGTCGCCCC 


TACGTGGAAG 


ATGGCAAAGG 


AATTGGATCG 


GCTGAATCAG 


3300 


ACAACGTATA 


TACAGGATAT 


GGTACGGAAT 


GCACGGTTAT 


CAGACCAGCA 


TAAAAAAATT 


3360 


TTGGAACAAC 


AAAGTATGGC 


AATCGAAAAT 


TTGATTGTAT 


GAAAAGAGGA 


ATTTTGATGG 


3420 


GAATTGTAAA 


AGATATTTTA 


TTGGTATTTG 


GTGGCAGTAT 


GCTTGGTGTT 


ACTGTGATGT 


3480 


GCTTGATGCA 


TGCAAGTACA 


GCAGCAGACC 


GAGTAATGGA 


AAAGAAAGGA 


AAGTGACGAG 


3540 


ATGAATTTTG 


GACAAAATTT 


ATATCAATGG 


TTTTTGACCA 


ATGCTCAATC 


CTTAGTCTTA 


3600 


CTAGCGATTG 


TAGTCATTGG 


TTTATTTTTA 


GGATTTAAGC 


GGGAGTTTTC 


TAAATTAATT 


3660 


GGCTTTTTGG 


TGATTGCCTT 


GATTGCGGTA 


GGTTTGGTCT 


TTAATGCTTC 


TGGTGTGAAG 


3720 


GATGTCTTAC 


TCAATTTATT 


TAACCGAATT 


GTTGGTGCAT 


AAGGGGGAAT 


CACGATGGAG 


3780 


CAGATGCGTG 


TTTATATTGC 


GAACCTTGGA 


AAATACAATG 


AAGGAGAGTT 


GGTGGGGGCG 


3840 


TGGTTTACTC 


CACCTGTCGA 


TTTTGATGAA 


GTCAAAGAAC 


GAATTGGGTT . 


AAATGATGAT 


3900 


TATGAAGAAT 


ATGCGATTCA 


TGATTATGAA 


CTACCCTTTG 


AGATTGATGA 


GTATACATCA 


3960 


ATTGAGGAAA 


TTAATCGATT 


GTGTGGCTTA 


GCGGAGGAAT 


TAGAAGGAAC 


GCCAATTGGT 


4020 


GAGGTTGCTT 


CAGAGATTCA 


GCACGCTTTC 


TTTAATTCTT 


TCGAGGAAAT 


GGTGGAGCAT 


4080 


GTGGATGATA 


TTGTGTATTA 


TCCAGATTGT 


AATGATATGG 


AAGATTTAGC 


TTATCAAATG 


4140 


GTAAATGAGG 


GGTATTTAGG 


AGATGCTCCA 


GAAAATTTTG 


TAAGGTATTT 


CAATTATTCC 


4200 


TCTTTTGCTC 


GCGACTTGGA 


GATTGAGGGA 


AACTATTTAG 


TTACTAATCG 


AGGTATTTTT 


4260 


GAATATCCAA 


TCTAAATATA 


AGCCATTCAA 


CTTGAATGGC 


TTATATCTCA 


ATTTATTAGT 


4320 


GGAAGTTTAT 


CAGTGTTAGA 


AAGCGCCCAA 


ACAATTGCAT 


TACCTGCTAA 


AGTAACAATT 


4380 


TTAGGATGTT 


TTTCGAGAAA 


ATCAGAAAAT 


CTTTCTAAAA 


CGTTAGGTTT 


AAAATCTTTA 


4440 


GTTTCTACGA 


TATTGATAAG 


CTCTTTGCCA 


ACTTGTTGCT 


CTTCAGGTGG 


TAAAGCATCA 


4500 


ATATATTCTT 


TTAATTCATC 


AAGAGAATTA 


TTCCAATTGT 


TTGTAACGGT 


ATTATTGTTT 


4560 


CCGAATGTCG 


CTCCTCTATA 


GTCACCTGAA 


GAGTTAAAAT 


TAAAGGTATT 


TGTCGTCTGA 


4620 


ATCGATTGAC 


TATTTTTTTC 


TGAAGGATGA 


ATTTTCTCTG 


CCCCCTCTTC 


TGTTAGTTCG 


4680 


ACATCATCTG 


ACGAAAAGAC 


GAATAGACGT 


AGAGGAAAAT 


GCCGAGAAAA 


ACAGGGGAAA 


4740 


TGCCCGATTA 


TCTGTCAAAA 


CTCCGACAAA 


AAATGCAGAA 


AAGATGGCCT 


ACCTCTGGCA 


4800 


CACACCACAA 


AAAAACAGAA 


TATCGAAACC 


ACAAGCCGTG 


CAGTAACTGA 


CAAACAATCA 


4860 


GCTACTGCAC 


GGCTTTTTTT 


GTTATAGATC 


GCTTTGCCCC 


G AC G AC AC AG 


ATCGCCGGGG 


4920 


CTTTTTTCAT 


TTAGACAGGA 


GGCTTTGAAA 


ATGCCGGAAA 


CATTGAATCT 


GAATTACTAC 


4980 


TACGGGAATG 


AAGCCGACCA 


GTACAGCTTT 


TACCGTATCC 


CCAAAACCCT 


GCTGACCGAC 


5040 


CGCCGCTATA 


•AGGGCGTGTC 


ACTGGAAGCA 


AAGGTACTGT 


ATGGCTTACT 


GCTTGACCGC 


5100 


ATGGGGCTGT 


CGGCCCGGAA 


CGGCTGGCTG 


GACAATAATG 


GGCGGGTATT 


CCTCTACTTC 


5160 


ACGCAGGAGG 


AAGCCATGAC 


CATGCTGGGC 


TGCGGCAAAG 


ACAAGGTGAC 


AAAACTGTTC 


5220 
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CGCGAATTAG AGGGGATTGG CCTGATGGAA CGGAAAAAAC AGGGGCAGGG ACATCCCGCC 5280 

CGGA1ATACG TTAAAAATTT TATCTTGGAA CCGGAACCCA CCGCGCCGGT TCAGACTGCG 5340 

GAAAATCAGC AGTCAAGACC GCTTCCCAGT GCCGCCGTCA AGACTGCTGA AAAACCGCAG 54 00 

TCTGCACCAC GGAAAACCAG CGGTCAAGAG TGCGGAATTT CCGCCCCTAA TAATACTGAT 54 60 

ATAAATAAAA CTGAAAAAAG CGATACTGAT CTATCCATCC TACCCCCTAG CCCCTCGGCA 5520 

GTTTCTTCTC CTGCTGGTAA AnrCGCTCCG CGCAGGATGG GATTGGATGA GATGGAAAGC 5580 

TACCGTGAGC TGATCATGGA GAATATTGAC TACGACATTC TCTTAGAACG TTATCCCTAC 5640 

GAAAAGGACT TGCTGGACGG CTATGTGGAG TTAATGCTGG AGGTCTGCTG CTCCACACGG 5700 

GAATCTGTCC GTATCTGCGG GCAGGAAGTA CCCACCGAAG TTATCAAAAG CCGGTTCCTG 5760 

AAGCTGAACA GCGAACATAT CGGCTATGTC ATGGACAGCC TGAAAAGCAA TACTGCCAAA 5820 

ATCGGCAACA TCAAGGCATA TACGCTGGCG GCACTTTACA ATGCCCCTGT CACGATGGGG 5880 

CAATATTACA CATCCCTTGT CAGCCACGAT ATGGCGCATG GGCTGCTGAG TGGTTAAACC 594 0 

ACCCAGGGCT TTGCGATACA TCAATCAAAC CAAAGGAGGA CATTTGCATG GAACATGAAA 6000 

TCAACATTCT GGTGGTGGAA CCGGGCAGAC CGCCCAGGCC GGCACGGGTG GACTACTCGC 6060 

TGGAAACTTT TTCACAGCTT GTCGGCGGGG ATCTGGCAAT GGGCTGTTTC CTTCCCCAGC 6120 

GGGTCATGCT GCTTTATAAC GAGGAGGCCA ACCGCCAGGG ACTGCCGCCC AACCGCTGCA 6180 

ATCCCTTTGT GAACGAGTGC ATCAGCGGTA CATTCCTGCT GTGCGGGCTT TCAGATGGGG 6240 

AGGGCTTCTG CTCCCTTTCT CCTGCCCAGC AAGCCGAGTT TCAGCGCCTT TTCGCCAGCC 6300 

CCGGCGAGTT TATGATGCTG GGGGCAAACC ACATCTGCAC TTCGCCTGCC GAGTTTGCCG 6360 

AAACTGTCAG CAGGCTTTGG GGCGGTATGG CAAGCGGTGA GTCTGTGGTG CTGACCAAGT 6420 

GGGGCGGCAC GGAAAGCGGG GCCTGAACAT GACAAAACAA AATAGCAATA CCGACCGCCG 6480 

CAGTTTGCAG GACTGCGGCG TTTTTTCATC GGCAGAATAC CATTCAGTCG GGTTAAGCGG 6540 

CGGCAAGGAT TCCTCGGCGC TACTGCTCCT GATGCTAGAA AAAGGGATGC CGATTGACAG 6600 

CGTGATATTT GCCGATACCG GCATGGATTT CCCGGAAATG GCGGC 664 5 
(2) INFORMATION FOR SEQ ID NO: 4 96: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 96: 

GGAACACAAG TnAGTACACA AGATACTGTT GAGGTTGATG GTGTACCAAT TTATCAAGAA 60 

GaATATGGTT ATTaTTTATT CTATAAACCA AGAGGGGTTA TTTCTGCGGT GTCTGACGAT 120 

AAAGGCCGTA AAGTAGTAAC GGATTATTTT ACAGATGTTA CTGAGCGAAT TTATCCAGTC 180 

GGTCGTCTGG AT TAT GAT AC TTCTGGGTTG TTACTTTTAA CGAATGATGG AAGTTTTTCA 24 0 
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CAAAAATTAA 


CCCATCCTAA 


ACATGAAGTA 


GATAAAGTCT 


ACGTAGCAAA 


AATTAAAGGT 


300 


GTTCCAACTA 


AACGGGATCT 


ATTGCCATTA 


GCCAAAGGCA 


TTCGCATTGA 


TGGCAAACGC 


360 


ACAGCACCAG 


CCAATTTTCA 


AATTCTTTCA 


GCGGACATTA 


AAACAGGTAG 


CAGTGTCGTG 


420 


GAATTAACGA 


TTCATGAAGG 


ACGTAATCAT 


CAAGTGAAGA 


AAATGTTTCA 


AGCTGTTGGT 


480 


TTCCCTGTTC 


AAAAATTAAA 


ACGTGAACGT 


TACGGTGAGT 


TGACCTTACA 


AGGTTTGCGA 


540 


CCAGGGCAAT 


ATCGTGATTT 


AACTAAAAAA 


GAAATCAGTC 


AATTATTAAA 


TCAAGCTAAA 


600 


TAAGATAAAA 


CGACTCGATC 


TCTTCGCGCT 


TATGGCGAAG 


GGATTGTTTT 


TTTTGAGAAA 


660 


CGAAGCAATC 


AATGAAATTT 


AATAAAAAAG 


CTTAGTTAGT 


TGCATTCATT 


GTTCAAATCG 


720 


GTTACACTAA 


GTAAGTAAAA 


AAATATAATA 


CAAGGTTCGT 


CTTCAGGGGC 


AGGGTGTAAT 


780 


TCCCGACCGG 


TGGTTATAGT 


CCACGACTCG 


TTTTTAACGA 


TTGAATTGGT 


GTAATTCCAA 


840 


TACCGACAGT 


ATAGTCTGGA 


TAAAGAAGAT 


AGGGCTTATT 


TGAGACGCTT 


TTTCATCAGA 


900 


TAATCCTACT 


CTATTTTTCC 


CTGCAGAAAA 


ATAGGGTTTT 


TTTGTATGAC 


AAAGAAGCGA 


960 


ATCAAAAAGT 


TCGTTGAAGa 


TGGGTCCTTA 


ATTGGAGGAT 


TtCAGATGAA 


CAACAAGGTA 


1020 


CAAAAAATGG 


TCAGCATTGC 


AATGTTGGCC 


GCAATnGGTA 


CAGTATTACA 


ATTTG 


1075 



(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 



TACCACATCC 


CACCACTTTC 


TACCACTATA 


AATCTAAAAA 


CTTCTATTTT 


TTAACTGTTA 


60 


TTTAAAATTT 


CTGTAATATT 


TAnAGGGTTT 


TATGTAACAA 


GCATCACCCA 


ACAAAAAATG 


120 


TTATTATAAC 


AAGGTTTTTA 


CTATGTTTTT 


TATAAAATAA 


CCCCACAAAT 


AACAAAAAAA 


180 


CCCCACGGAG 


gaAGtTGTGG 


GGGACATTTT 


CCCACAGCGT 


ATTTCCCCTA 


TTGTCTCCAC 


240 


GCTTTATTAC 


AGTAGATAAT 


GACTGATATT 


TAGCAAGATA 


AGCAACACAT 


ACAACACAAA 


300 


CGTTAGTAAA 


AACGTCAACC 


GCCAAAACAT 


CTTGAAATAT 


CGTCTATATA 


AAATCTCTCC 


360 


ATAATAATAA 


GCTTGAAACA 


GGGCAACAGC 


GACTCCTAAT 


AACAAAATAG 


AAATCACAAA 


420 


ATACGGCATA 


ATCGATTCAT 


CATAACTATC 


TTTGGATAAC 


TCATGGATCC 


CTATGAATAA 


480 


AAACGGAATG 


GCCAAATCCG 


GTGCTTTTAT 


TTTCCAACGT 


TCCGTTAGGG 


AAAAAGTGGA 


540 


CACGAGaAAA 


TTGCACGCAA 


AGAGCACAAT 


TACCGGAAAA 


ATATACCAAA 


ATAACATTAT 


600 


TGCCGAAAAC 


•GACACCATTC 


TCTCAACAAA 


TCCTTTCATT 


CCACCTGCTA 


TAACTATACT 


660 


TCATCAACGT 


TACACAGATG 


TTTCATTTCG 


TTTAAATATA 


TGTGCTAAAA 


GCCCTTTTTG 


720 


CTTGCTTTTT 


CATATAAAAA 


ACGGATTACT 


TATCAAACCA 


TTTTCAAAAC 


GAGGCCTCAC 


780 



WO 98/50555 PCT/US98/08985 



1744 

CTATCGCGGC CACTTATTTT TGAATGTTTT TTATAAGCTT TTCTTCTCGT ATTGTGTCAT 84 0 

r 

ACTTCTATTA TAATGGATTT AGAACAAAAA ATGTTGATTC GGGGTGAATC ..TTTTGAT.TAA 900 

TTATTTATTA TTGAAAAATG ACTATTTTGA ACCTTGTTCA CCAGACGATG AAGCCCTAAG 960 

TTGGATTTCG GTAGAATCTC CTACAGAAGA AGAAATCGAG CGTCTCGTCA ATCAATACCA 1020 

CTTACCAACA GATTATCTTA CCGGCGTTTT AGATGATGAA GAAAATGCTC GAGTGGAAGG 1080 

CTTCCGCCAC GAAAAATTAC AAACGCCTAC GTTACTTTTA TTTCGTTATC CCAAAGCTAG 1140 

TATCAGTCCT AGTGGCTATC TGCAAGTAGA AACGGTGCCG ATTGCTTTAA TCGCAACCAT 1200 

CGATAACAAA CTAATTACTG TTAGCAACGG GCCTAACGAC ATCGTTCATG GGATTCAAAA 1260 

AGAGGCGTTC ACCCATCAAG ATCTTTCTAT TGAGAAGGCC TTAATTTTAG CCCTTTCATG 1320 

GAAGATGGCC CTTTCCTTCA ATAAAAACTT ACAAGCACTT ATCCAACAAA CCAACAAACT 1380 

TGAAGGAGAG CTCCAAGTAG CCACAGAAAA TAGTCAATTG TATCAGATTA TGGACATTCA 14 40 

AAAAAGTCTA GTTTATTTCG AAGCAGCTTT AACGGATAAT TTAAAAGTAT TAAAACGTTT 1500 

ATATAGCGCT GAAATTTTTA ATCATCCAGA AAAACATTTG CCATTTTTAA GGGATATTCT 1560 

GATTGAACTA GAmCAAGGAC TGAATACGAC TAAAATACAa CTAAAATTAG TTGATAcATT 1620 

AGTAATCCTT TTTCAGCAAC CGTTCCAATA CTTAAATAAT GC AT GAAATT TACAnCTTAC 1680 

AATGGCCTG 168 9 



(2) INFORMATION FOR SEQ ID NO: 4 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 
ATATnATTTG ATTAnnAGTT CATCAAATTA CCCTGCATTA CAACAATCTA TATCAGGTAT 60 

TTACTCAATT TCAAATAAAT TATTTTTTGG CTCATTAATA TTTGCAGGAG TTGTAGTTTC 120 

ATTATTATTA TTCTTATGGA TGAATGCTAG AAAGAAAGAA ATAGCAGTAT TACTATCATT 180 

AGGTATATCA AAATTAGAAA TTTTTGGTCA ATTCCTAATT GAGATGGTAT TTATATCAAT 24 0 

CCCCGCATTC GTTGGTTCCT ATTTCTTAGC TCAATATACA GCTGACAAGC TTGGGAATAA 300 

TATATTGAAT AAGGTTACTG GTGATATAGC TAAACAAATA GCTAAACAAT CTGCATCTAG 360 

CCAATTAGGT GGTGGGGCAG AAGCTGAAGG ATTTAACAAG ACATTATCAA GTTTAGATAT 4 20 

CAATGTATTA CCTAAATTCA TGATTTATGT AGTTATATTT ATGAGTTTAG TACTTCTTGT 4 80 

AT C ATT GAT T ATTTCTTCAT TCAATATATT AAGAAGGAAT CCArAAGAAT TATTAATTGA 54 0 

CAATAATTAA AATTTTCGGT GCTTTTTAGC ACCAAAAATG AGAGTGTAGT ATAAAGGCTC 600 

TTtGTCAAAT GATGTTGGTA AAGAGAAATC TTAGAATAAT TTGGACAGAA CAGArAGAAG 660 

AAACCCTCTT CCtGTTCTGk TTATyTGGTT AAAAATGtTT TGCCATGACG AATCATGATC 720 
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CGTAGTTTGA AATTGTAmAA GCTGCGAAAG CCGTACGCAT TGCGTTTTAA TACTTTGATG 780 

TGGTTGTTTA AACATTCCAA TGGTCCATTT GAATAGCTGT AGTGAAGCGC ATTTTGGATC 8 40 

TGTGGCAGAA ACTTTTTAAA CGTTCGTAAA GTGGTTGTAT AGGTTTCAGG AAGACCGCTG 900 

ACTTCTCTGT TTAAACAGTC CTCAAATAAT GACTGATCTC TTTTCTTTAC CGCATAAAGA 960 

ATGTCTTGGT ATAACTGATA GCCTTGGCGC AGGTTGGGAC AAGCAGATAA CAGTCGATCG 1020 

ACAATCTCAC TTTCAGTCAA ATAATCTTTA AAACTGGCAC GCCAAACCCT TTTTTCGTAG 1080 

TTTAACTTTG CTTCTTCTTT TTGTAGGAGT TTCCAATAGC GTTTCAATTT TTTCCCTAAC 114 0 

CCTCGTTGAG AGTGGGAAGT TCCTTTAAGC CGTGCCGTGG TTTCTTTCAC TCGATGATTC 1200 

AAGAAGGTTC TGCCAATATG TTGGACGATg TGGAAACGGT CGATGATCAA CTGAGCATTC 1260 

GGGAAAAGCT TTTTCACCAA TGAAACATAA GGCGCATACA TATCAATAAC AATCCATTGA 1320 

ACGGCTTCAC GAGTCGCTAA TGAAAACCGA GAAAATAGCG 1360 
(2) INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 



TTTTTTATCG 


GnCTCGCCAT 


GCCCGTTTCC 


AACTTGGACG 


GGAAGnTTGA 


AGCTTACTGC 


60 


GAAAAGGAAG 


TTGGCTGCTT 


nCAGGAAAAG 


TTGGAATTCG 


CAACCACTTT 


AAACCAATGA 


120 


GgGAAtTACm AGkTGrAGCT 


TCTTCmATGn 


ATAwTAArGT 


CTCTGAGtTA 


aAAGarAAAA 


180 


TGGCGGATaA 


TGaAGAATTA 


TTGGCTCAAA 


TTTCGACAGA 


CAAACAGAAA 


GAGCAACAAC 


240 


GCTTAGTCAG 


CGAAAAAGCA 


GCAGCTGAGG 


CACAGAAAAA 


ACGTGCGGCA 


GAAACTGCCA 


300 


AAGCAGAAGC 


AGCGAAGAAA 


CAAGCAGCTG 


CCGACAAACA 


AGCGCAAGCA 


GATGCGGCGG 


360 


CTAAACTAGA 


AGCCGAAGCT 


GCTGAAGAGT 


CAAGCAGTCA 


AGCATCAAGC 


TCTTCTACAA 


420 


GTATGGAAAG 


CAGTAGCACT 


GAATCTTCAA 


GTTCAGCTAG 


TTCATCCGAA 


TCGTCATCAA 


480 


CGCCAGAATC 


ATCGACAGGC 


GAATCATCAA 


CACCAGGAAC 


TGGyTCTGGT 


CGTGTGCTTC 


540 


AGATGGAATC 


AACTGCTTAT 


TCTTGGCGCG 


AAGCTGGTGC 


AAGTAATCTA 


AGTGCTaCTG 


600 


GTATTGACTT 


AAGTAAAGAA 


AGCAATGTAG 


TTGCAGTCGA 


TCCAAGTGTA 


ATTCCTCTAG 


660 


GTTCATTAGT 


GAAAGTCTCT 


GGTTATGGCT 


TTGCTATTGC 


TGGGGATAcA 


GGTGGCGCTA 


720 


TTCAAGGCAA 


TATTATTGAT 


GTTCATTTTG 


ATTCAGTAGA 


CCAATGCCGT 


CTTTGGGGAC 


780 


GACGTCAGGT 


• TACAGTAGAA 


ATTCAATAAT 


AAATTTGTTA 


TACTAAAGAT 


AAGATAAAGG 


840 


GAATTGGCTA 


AGTCAGTAAT 


GATAGTAGCC 


AGTTCTTTTT 


TAGTTAAATG 


TAAAGGAGAA 


900 


TTACAATGAT 


TCGCTCAGCA 


ACAAAAGAAG 


ACGGTCAAGC 


GATTGCTCGG 


CTGGTCTTAG 


960 
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TTATTTTAAA AG AT AT GG AG TTACCAATTT TAGAAGAAGT CTCAGAAGAA CAAATGATTG 1020 

ACCTGTTAGC AGAGGCAACG GCGTATCCAA CCTATCGCTA TGGCTATCAA CGAATTTTAG 1080 

TTTATGAACA TGCAGGTGAA GTTGCAGGGA TTGCTGTCGG CTATCCTGCA GAAGATGAAA 1140 

AG ATT ATT GA TGAACCTTTA AGAGAAGTCT TTAAAAAACA TGGCTTAGCA GAAGATGTTC 1200 

GGCTATTTAT TGAAGAAGAA ACCTTGCCTA ATGAATGGTA TTTAGACACG ATCTCTGTGG 1260 

ATGAACGTTT CCGTGGAATG GGAATTGGTT CTAAGTTATT AGATGCTTTA CCTGAAGTAG 1320 

CCAAAGCTAG CGGGAAGCAA GCCTTGGGTC TAAATGTTGA TTTTGACAAT CCAGGGGCGA 1380 

GAAAGTTATA CGCAAGCAAA GGCTTTAAAG ATGTCACAAC AATGACTATT AGTGGTCACT 14 40 

TGTACAATCA TATGCAAAAA GAAGTGTAAA AATAACGAAG CTAAAGTCAA TCACTTTAGC 1500 

TTCGTTATTT TTCATTAATC CGTTAAAGGA TCAAAATCTA ATCCAAAATC ATCGGAAGTT 1560 

TTTTGTTTTT TCTTATTATA TAACCAGTAG TCAAGGACAG CTGCTATGCA GAT GC AC AAC 1620 

GGTGCGTCCT CATCATTGGC AAC AG TT AAA ACAAAA 1656 
(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 



TTACTTGGAC 


TGCTTCTGCT 


TCAGTACTTT 


CTTTCATTTT 


TTCAGTTCGT 


TGATCTGGTT 


60 


GGCACCCCAT 


TAAAACAATC 


ATTGAACAAA 


ATAACCCGAC 


AACTATCTTT 


TTAGTTTTCA 


120 


TTTTCCCTCC 


TTTTAAAACT 


CATTGAATTA 


CCATTTAATT 


TTAATATAAG 


TAGAATGAAA 


180 


AGTATAGCTA 


AGCAATCGTT 


TTTTCTTCAT 


ACGGCGATAA 


ACTTAACTGT 


CCAGATAAAT 


240 


ACTCACCTAA 


ATAGATTTCC 


TGTAATCTTT 


GAAATCCTTG 


TTCTTGGATT 


TGCGTTAATA 


300 


GCCAGTGCTC 


ATCTTTTTTG 


ATTAACTCTA 


GGACATCCAT 


GTTGGCTTGA 


CCGTCCACAA 


360 


TAATCGGATA 


ACGAATACTT 


TCATCGCCAT 


ATTCGACGAT 


TGTTAATTGA 


CCATTTTGTT 


420 


CTAAAACCGC 


CCGTTTAACA 


AGCTGGACTT 


CATAAATACC 


ATTTGCTCGT 


AATTTAAACA 


480 


TTAGATCATT 


TGCAGAAATA 


CCATTTTTCA 


AACATTCCTT 


CACCACAACC 


TGTCCATCCT 


540 


TGATTAAAGT 


AATGGGTTTA 


CCATCCACAA 


TTTGCTTCAC 


ATAGCGATTG 


TGTTCTTTTG 


600 


CAAATTTTAA 


AATTAAGACT 


AATAATGTCC 


ACAAAATTAA 


CACTAATACA 


AATTGGAGCA 


660 


CCGTAATTGA 


ATCATTATAA 


ATAACACCAC 


CAATAATGCC 


CCCCAATACA 


TAGTTTTGCA 


720 


CTTGGTCCAT 


AGCCGATGTG 


GGCGCCAAAT 


TCCCTTTTCC 


CATCAAATTA 


ATTTGGACAA 


780 


TTAAACAAAT 


AATACCTAAA 


GCAAATTTAA 


TAATAATTGG 


ACTATAAAAC 


TGCATTAATT 


840 


TTCCCTCACT 


TTACTTTATG 


ATTTCAATCG 


TTGGTGTTAA 


TAAATACGTT 


TCTTGTAACG 


900 


TATAAGACTG 


CTGATCCGTA 


CTTAAACTCA 


TTTTGTAAAA 


CACTTCGTCA 


ATTTTAACTA 


960 
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TGGTTCCGTC AGCCAATTGT GTACCATTGA CATAAACTTT TTCTGGAGAA ACTTTTTTTT 1020 

CCTGTGCCAC TTGTTCGACA AAATGTACCA TTTGTGAAGA TTGTGAATGC TGATTTTGGT 1080 

TCAGTTGATA ATTGGAATAT TGAACCCCTA AAACAAATAA TAATGATAAA AAAATAATAA 114 0 

TACTTAATTC ACGATATTTC GTTTGAATTT GATGTCGCAC ATAAAAACTA rACACAATGA 1200 

CTAGCCCGAT TAACAAAGcA AAAATAAAAA AATACTTTAA GTAATCATTT AAATTAGaTT 1260 

GGCTTTTTAA ATACTCGATG CCATAAAAAT TCATGTTCTT GTCCACTCCT TGATAGGTTC 1320 

GTTATAGTAA GTATTCTATC TCAAAAAAAA CAATCACCCC TACTTTAATA CCTTATrACA 1380 

AGAAAAAGCT CAGCGCAATC ATCTTGAGAA TGACTGCGCC GAGCTTCAAT CTTCTAAAGA 14 4 0 

TTCAGCTACT GAT T GAT T AC AAATTAATTT TACATCAAAC TTTTCAGTTC AACATCTGGA 1500 

TATTTATCCG CAAACCAGcG TTCCGCAAAT TGGTTTTCAA ATAAGAACAA AGGTTGATCG 1560 

AAACGGTCCC GTGCCAAAAT GTTTCGGCTA GAGCTCATGC GTTCATCTAA ATCTTCTGGA 1620 

TTAATCCAGC GGGCAATTTT AGAACCCATT GGCGTCATTA CCACTTCAGC ATTGTATTCA 1680 

TTAGACATTC GGTATTGGAA AACTTCAAAC TGCAATTGAC cTACGGCGCC AATGATATAT 17 40 

TCTTCCGTTA AATATGTTTT GTATAATTGA ATGGCTCCTT CTTGAACCAG TTGGTAAATT 1800 

CCTTTATGGA AAGATTTTTG TTTCATTACG TTTTTGGCAG TCACTTTCAT GAATAATTCA 1860 

GGGGTAAATG ACGGCAACTC TTCGTATTGA ACATTCATTT TCCCTTCATA CAAAGTATCG 1920 

CCAATTTGGT AATTTCCTGT GTCATACACG CCGATAATAT CGCCTGCAAC AGCTTCTGTG 1980 

ACGTTTTCGC GGGCATCCGC CATAAATTGC GTCACGTTAC TTAATTTGAC TTTTTTGCCT 2040 

GTTCTACCTA AGGTCACGTC CATGCCTCTT TCAAAAACAC CTGAACAAAT TCGAACAAAA 2100 

GCAATTCGAT CACGATGGGC GGGGTTCATA TTCGCTTGAA TTTTAAAGAC AAAGCCTGAA 2160 

AATTCTTTTT CATAAGGACT TACTTCGCCA CCCTCTTCTG TTTTATGCGC ATGTGGCGCA 2220 

GGCGCAAATT GTAAGAAGGT TTCTAAAAAT GTTTGAACCC CAAAGTTTGT CAAAGCAGAA 2280 

CCGAAGAAGA CTGGCGTTTG ATCACCACGA GCGATTTTTT CCTCGCTAAA TTCATCGCCT 234 0 

GCTTCTACCA ATAACTCAAC ATCTTCCAAC ACTTGTTGAT AAACGCTGTT ATTGTGCAAT 24 00 

GGTAAATCGC TTGGTATATC GCCATCTTTT AAmGGAATAA AACGCTCCCC GCCATTATTT 24 60 

TCTGGACGAT ATAATTCCAC GCGTTCATTA TAAATATCGT ATAGACCTTC TAGGCCTTTT 2520 

CCCATACCAA TTGGCCAGTT CATTGGATAA GATTCAATAT CCAATAGTTC TTCCAATTCT 2580 

TCTAAAAGTT CCAATGGTTC CCGTCCGTCA CGGTCCAATT TGTTAATAAA AGTAAAAATT 2 640 

GGAATACCAC GTTTTTTAAC AACTTGGAAA AGTTTTTTCG TTTGTGCTTC AATCCCTTTG 2700 

GCACTGTCAA TAACCATCAC AGCACTATCC ACGGCCATTA AGGTCCGATA CGTATCCTCT 27 60 

GAGAAATCCT CATGCCCTGG GGTGTCCAAG ATATTAATCC GTTTGTCTTG ATAGTCAAAT 2820 

TGCATCACAG AACTCGTTAC CGAAATTCCC CGTTGTTTTT CAATTTCCAT CCAGTCAGAT 2880 

TTGGCAAAGT TCCCTGTTTT TTTCCCTTTG ACTGTTCCAG CTTGGCGGAT GGCGCCCCCA 294 0 
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(2) INFORMATION FOR SEQ .ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



PCT/US98/08985 

3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3522 



AATAATAATA ATTGTTCAGT AATCGTTGTT 
AACGTACGGC GACTATCGAC TGCTTCTTTT 
CTTnGTGATG TyaTyTATTT ATTTyCTTCA 
CATGTCAAGC CTTGAGAATA AAGAGCTGCT 
ACATTAGTCA AAACGACTGA TGTCACAGGT 
ATTTCTTTaC GGAAAAAACG ACGATTTTCT 
TAGAAATGAA CGCGTAAGAC TAGCAAACGT 
ATATTGCCCA CTTCAAAAGA AGGTTTTTCC 
ATTAAGTAAC CCGCCATTGT ATCAACATCG 
TTAAATTCGT CAATTAACAT GCGTCCTTGA 



TTCCCGGCAT CCGGATGGGA AATAATCGCA 
TGaTTGATCG TCATAAAAAT CTCCTTATCA 
GCtAATTTTg ATTAACCTTT GCTAGTATAT 
GTTGGCTGTT TTTTAAAAGG AAAACCTGCG 
TATTTTTAAC GTCTTGGTTC GTCATCTTCC 
TCTGGTTCTT CATCGACAGT TTCTTCATCG 
GTGCCTTCCA TTTCTTCAGC AGTTAACTTA 
CCTTCATCAG GAATGGTTCC TAATGCTGTA 
CTCATGTGTA AATCTGTTTC AAATACTTCA 
ACCAnnTATT CA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

TTGTACTAGn AG AT a ACT TC CtGATcTAGT GCCtTTTTCA GAAGCAATCT CTTGGGCAAT 60 

AATTGAAGAA TAACTGcTGa TGAAGACAAA TAACAAAATT GTTATCCCCA TTGCTAACGC 120 

CATTTGAATC CCTGTGTTAT CTTCTCCTGT TTTTATTTTG CCATTTTCAT CAAAACTAAT 180 

TTTGGCTTTT TTAAATGTTG CTTGTTGATT CAACATGGCA ACTTGCTCAG CTGTTAGGTT 240 

TAAACGATTG GCATTCGCCA TTGTTTGTAA TTGGTTTAAC ATTTGAGGAA TTAGCATCTC 300 

GGTTGCATTT CCTAAAGAGC GTTCAGAGTA AAGCTGACCA GAAATCTGAT TATCTGTAGT 360 

GTCCAAAAGT AAATAGGCAT CAATTTTCTC GTCTTTTAAT TGTTCTTGGG CCTTTTTCTC 420 

AGAAGAAACA GCTTTGAAAG AAAAGTCCTC TGTTTTATTC TTTGTCAACT CTTTAACGAG 4 80 

CGCTTGATTT TCCGAAACCA GACCAATGGT AGTATCTCCT GAAAAACCGc TAGCTAGAGA 54 0 

GCCCGCTAAG TAAACAATTC CTACAAGAAG AAATGGCGCT AAAATCATGA TTAAGAAAGA 600 

CAGCGACTTC ACATTTTTCT TATAGACATC AGTCGCAATA ATCCAAAATT TACTCATCTT 660 

GCCCACCTAC TTTCATTTTG AAAATTTCTT CCAATGTTGG TGGTTGTTGA TTAAACATTG 720 

GAATGTAGCC GTTGGCTGTC GCTTTATCAA ACAACACTTT GCCCACTTCA GCGTTTTCtA 780 

ATGTCGCTTC TAAGACGCCa TCTTGtCGCA AGTGTGCTTC TTTCACAmCT GcTGTAGCAG 840 

CAACTTCCTC AACAGATAAC GGTGATTCTA AGAATAATTT TGTTCGACCA AATTGTTGTC 900 

GAATTTCATG AACCTGACCA TCTAAAACCA TGCGTCCATT TTTTAACATG ACTAAATGGT 960 

CACAGATTTT CTCCACATTG TCCATATTAT GACTGGAAAA AATGACACAA GAACCTTGAT 1020 
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TTTTCAATTC AATAATGCCG TCTTTTAATA ATTCTGCATT AACTGGATCC AAGCCACTGA 1080 

ATGGTTCGTC TAAAATAATT AACTTCGGTT CATGAATAAG TGTCGCAATC AATTGAACTT 114 0 

TTTGTTGATT GCCTTTTGAT AAAGATTTAA CTTTATCTGT TTTTTTGCCT TTAACTTGAA 1200 

ATTTTTCCAT CCAAAAGTCA ATTTTAGGTT CAATTTCTTT TTTAGTTTTT CCCCGCAACG 1260 

AAGCGAAATA AATTAGTTGG TCTTCAATCG TAATTTTAGG ATATAAACCC CGTTCTTCTG 1320 

GTAAATAACC AATAATATTG TACTCTTTTT CATTTAATGG TTGACCATTC CATAAAACCG 1380 

ACCCGCTATC TTGTGTTAAA AAATTTAAAA TCAAGCGAAA TGTAGTTGTT TTCCCGGCAC 14 40 

CATTTTGGCC AATTaATCCT AAAATTTTCC CGTCCGGTAT CGTAAAGGAT ACATGATCAA 1500 

CTGCTTTAAA ATCCCCAAAG GTTTTGACTA ACTCTTTTAC CTCTAACACA TAAACGCCCC 1560 

CTTCACTTTT TcTAAACCGC TGTTTAGTTA CyCATATCGT ATCTTTCCTG TGTTGGGCTT 1620 

GTCAATGACA ATTGTnATGG AAAGATGTTG GACTTTAATC AGACGCTTnA TTTTCCGCCn 1680 

AACGAAAAAG GCC 1693 
(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

nAAAnCTCGG TGGTGAAAAA ATGACAGACA AAAAGACAAC CGAAAAAAnA GAnACAGAAA 60 

CTGCTGCAAC CTTTACAGAA GAACTAACAG CAGTTCTTGA AGAAATAAAA GAAGAACCTG 120 

TAAAAGAAAA GAAAAAAGGG GAAATTGTCA CCATTGTGAA TGAAACAGAT ATTCAAATTG 180 

GTGAAAGAGA GTACCGAATT GTTAAAAATC ATCGTGATGC CTTTGATCCT GAACGCTTAG - 24 0 

GTGAACGGTT TAGTGAAGTA TTAGCTCGTT ATGATTATAT CGTAGGAGAC TGGGGCTATG 300 

AACAGTTACG CTTGCGTGGC TTTTTTGATA CTTCTAATCG TCGGGCAGCG CCGGATCAAC 360 

GCATTGATAC ACTAGAAGAC TATTTGTATG AATATTGTAA TTTTGGCTGT GCCTATTTTG 4 20 

TAATTGAACG AACAGGGGAA CGTAAAGAAA AAACAACGCA TCGTCGTCGT CGCCATAAAA 4 80 

AAAATAATCG TTCTAACGCG TTTGTCGAGG AAAAGAGTGC GCCCGCTAAT AAAAATAGCA 54 0 

AGCCAGTGAT TCGTACGAGA AAAGAAGAAA , ATAAAAAAAC GACTACACCC AAACAGCCTA 600 

GTCCCAAAAA ATCTGGCAAA GGCTTTACGA TTCGTCAACG TGAGGAGTAA ACAGAGTGAA 660 

AAAAGACTAT CAAGGATATT TAATTGATTT AGATGGAACG ATTTATTTAG GAAAAGAACC 720 

AATTCCAGCC GGAAAACGTT TTGTCGAGCG GTTACAGGAA AAAGATTTGC CTTTTTTATT 7 80 

TGTAACGAAT AATACCACTA AATCTCCTGA AACAGTGGCA CAACGTTTAG CAAATGAATT 84 0 

TGATATTCAT GTCCCAGCAT CCCTTGTTTA TACAGCGACT TTAGCGACGA TTGATTACAT 900 

GAAAGAAGCC AATCGGGGCA AGAAAGTCTT TGTCATCGGC GAGGCTGGTT TAATTGACTT 960 
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AATTTTAGAA GCAGGTTTTG AATGGGATGA AACAAATCCT GATTATGTTG TCGTGGGACT 1020 

TGATACAGAA TTATCTTACG AAAAAGTGGT TTTAGCTACC TTAGCCaTTC AAAAAGGGGC 1080 

GTTGTTTATT GGGACCAATC CAGATAAGAA CATTCCGACA GAAAGAGGCT TATTACCTGG 114 0 

TGCAGGTTCA GTTGTGACGT TTGTGGAGAC AGCTACTCAA ACAAAACCAG TTTATATTGG 1200 

CAAACCAAAA GCGATTATCA TGGAACGAGC CATTGCGCAT TTAGGCGTGG AAAAAGAGCA 1260 

AGTGATTATG GTTGGAGATA ATTATGAAAC CGATATTCAA TCAGGGATTC AAAATGGTAT 1320 

TGATAGCTTG TTAGTTACAT CAGGTTTTAC CCCTAAGTCA GCTGTTCCAA CTTTGCCAAC 1380 

GCCACCAACC TATGTTGTTG ATTCCTTAGA TGAATGGACT TTTTGAGTAT GATGAGGTCG 14 4 0 

AAATTGCGTT TAAGGGAAAC ACTTGGACTC ATTTGTTTGT TTTTAACGAT. TGTGG 14 95 
(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

GAGGAGTGGA AATGAGCATT GCATTACCAA AACAAAAACA TTTAAGAGGT CCGTTACTTG 60 

ATTACCTGAG AATTGCAGAG GATAATGGCG ATTTGTTTGC ATGGAGAAAG GCGTGCGAAA 120 

TTGGGCGAGA AATATTCTCA GGTGATTTTT CAGACAATGC TAAACCATTA ATTGTTATAT 180 

ACAAAGATGG ATCTTCAGAA GTATTTAACA CAAGAGCTGA TGTAATTTCA GCGTGTCGAA 24 0 

TCGGAAATGA AACTTTGCGA AAATGTTTGG AGACTGGCAA ACAAGATAGA TTGGGTCGCT 300 

GTTATGACTA TGCCATTTTA GAGTAGCAAA TTGTAATAGC TAATCGTATC TTTTGGCTCA 360 

AGAGGTTAGT CATTATGAAT TTTAAAATTA TGAGTTTGGA GGGGGAGCAA TGGGAAAAAA 4 20 

TTTACTCAGA GAGAAGAAGC GATTGATACG ACAAAAAATT CTTTTTCTGA CGGGTGAAAA 4 80 

CGAAtCTTGG gATGAAGAAT CCAGAAATCG TGAAAGAAGT CCAGAGGCTC TCTAAGCAAC 540 

TAGAGTCaGA TCTTATAGCa GATaAGCgAC CATTGCCCAG TTTAGATCCT GATAAATTAA 600 

CAAAAGAGAA GTATCAACAT TTCTTAGATT TAGGTTATCA AGTAGGAG 64 8 
(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1243 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 
CACTGTTTTC GGATCTACTT TGGCAAGTTC TGTCGTATCC CCTGTCACAA TGTATTGTAA 60 
AGAATCGCCT CGTTTCACAT TTAAATCATT GATATTTTTC TGrCCATCAG TGACtTCTTT 120 
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ATTCGGmACA AGAACAGGAA CCACATTGGC TTTGTATCGA TTGAATTCAA TCGTTGCTTT 180- 

TTCTGGTTGC TTTGGACGTC CTTTTTCTTG ATAAGGCGTA ATTGATTTTG CTTTTAAATC 24 0 

CGTATTAAAT GCGAACCACA TCGAATTTCC GTCATACCCT GGATTGTTAA CAATTTCATT 300 

CCCGAAAGTG AATTTAATTC TATTTGGTGT TGAAATAACA GTTGCGCCAA TATATTCTTT 360 

GCCTTGAACT GCTTCCCAGT CACTTATTGA AATCCCATTC GTCCCTACAT CAATATCCGT 4 20 

TTTAGAATAA ACTTTATTTC CATGACGAGC TACTTTTGAT CCATTAATTT CATGAACAGT 4 80 

ATCCGTGTTT CCTACTTTTA CGAACTCATA CGTAATGTTT TCACCACGTG AATTTAATGA 54 0 

AGCACCTGAA AATACAAATG GATTCTCTTT CGTTGGCAAG ATTTCTTTGC CATCTTCTCC 600 

AAAGAAGCGA ACGAGCATTT CAATCTCAGC ACCAGCCTTC TTTGTACGAG GGTCAATGTT 660 

AATTCCATAG ACAATTGTGG CCACTGGATC ATCATGTAAG AAAGCATTCA ATCGACCTTT 720 

TGAATCAGGT GTGGCCTTCA CTGTATATTG GAATTCAGCA CTAGCAATTT TTTTACCCAA 780 

ATAACTAGCA TTCAAATTGC CATAGGTAAT AGTTACTGGT TGATTTTTGT GAAGAACAAC 8 4.0 

CAATGTTCCT CCTGCACCGT TATTATAATA AAAATGTGCT TTTTTCGAGT CTTTAGTTGT 900 

TaACCCGCTT GAATCTACAC TAAGCATACT GATAAGTAAT TTTTTATCTT TATGTTTATC 960 

AGTTAGTTTT TTAGCGtCTA TATATTCTGC AGCTTTTACA GAAACAACTT TGGAATTTTT 1020 

TGTTGCTTCA CGAkCGAAAA TTAGCGTTTT aTTTACTGGT TTTGCkACAT AGCCATTTTC ,1080 

ATTTTTATGT TTGTTGTATT CCGCGACTTC TTTCTCGTAG CGTTCTTTTT CCGCTTTGTt 114 0 

TTTTGgCCGC AATTTCTGGC ATTTTCTTTG nGCAATTTTT TCAGTTTTGG GCTTTAATTC 1200 

AGCTGCTTGT TGCGCAAACT TTTTCGCTTG nTCTTTTTGT AAT 1243 
(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

ACAAACTGGT ATTTAAATCT TGTCTTAGCG AAAAAACAGC TGCTAAACAG GC AAT CAT AG 6( 

GTGAGCCACG ATGGAAAAAT TGAAATAAAA GnATACACAG CATGACAGCA ATTGCTGTTT 12< 

TTAACGTTCG CATGCCTAAA CGAAATCGAC CCACTATCAT AAATATTCGC TCCTAAATAA 18 

CTTTTCTCTT TAATGATAAC CGAAATAGAG GTAAAATTAC TAGAGAAAAG TGAAGAATTA 24 

TTATTTTTTA AGGGGAGTAT GCATGAAAAT TGGTTTACGA ACAGTCAAGA CAGCCATTAG 30 

CGCTGCATTA ■ GCAATGATTA TTGCGGAAAA ACTGGGATTA CTTTATGCCC CTTCGGCAGG 36 

GATTATTTCA GTTTTAAGTG TTACTAGCAC GAAAAAAACA TCTGTCATGA CAGGAATTTA 4 2 

TCGTTTGTTA TCCTTAGCTT TAGCAACTAT TTTAGCGTAT ATTTGTTTTA CTTTTTTAGG 4 8 
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TTTTACAGCG ATTGCTTTTG GTATTTTTTT AT TAT TAT TT ATTCCAGCAG CCGTGTATTT 540 

TCAATTATCT_ GATGGGATTG TTGTGAGTTC CGTTTTGGTA ACTCATTATT TAGTCGAAAA 600 

AAATCTTTCT TGGGCGATTA TTGGAAATGA GTTTTTATTA ATGAGTATTG GTGTGGGACT 660 

AGCGTTATTA GCTAATTCTT ATATGCCTGA TACAGAAAAA CGTTTAAGAG AAGACCAAGA 720 

AGTCATTGAA ACAAgTTTCG TAAAATTTTA AGGGAAAtGG CGCTACATTT AAACAACGCT 780 

GCAGGAGAGC GAAATTTGGT CATGCATTGT GCAGATTTAA AAACCTTTAT TCGTACAGGT 840 

GAGACGTGGG CCAAAAATCA TGGCCGAGAA TCCAATTGCn GGTCTACCCC AATACGkACt 900 

ATTTAGAGtA CTTTGCgATG AGAaAAATGC aAAGTAATAT TCtAAAAAmC ATGTTGGAAT 960 

TATTAGaAGA TATTmCyGTT GATGCGCAAC AAGTTCGCTA TATTCAACAA 1010 
(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 



GATAAACTCT 


TCGATTCTTT 


CGCCGTTGAT 


CCATTAAAGA 


AAGAAGTACC 


TCTTGAAGAA 


60 


CTGGCAAAAC 


AACGACAAGA 


AGTAAAAAAA 


GAAGAAGCAA 


AAAAACAAGC 


AAAAGAGTAA 


, 120 


TTTTGCTTGT 


TTTTTTCTAT 


AGAAATTTTT 


CAATTGCTGT 


CCAAGCTGCT 


TCTTTTCCTT 


180 


CTTTTGTTTC 


AGAAGAGAAA 


ATAATAAAGT 


CATCTTGTTT 


ATCAAAATTC 


AAGGCTTTTT 


240 


TGATGACACT 


CTCGTGTTTG 


TTCCATTTGC 


CCCGCGGAAT 


TTTGTCTGCC 


TTAGTTGCTA 


300 


CAATAATAAC 


TGGCAAATCA 


TAGTATTTCA 


AAAATTCATA 


CATTTGGATA 


TCATCTTGGG 


360 


TCGGCGGATG 


ACGTAAATCA 


ATTAAAGAAA 


CGACTGCACG 


TAATTGCTCA 


CGTTGCGTGA 


420 


TATACGTTTC 


AATCATTTTT 


CCCCACTTCG 


CACGTTCTGT 


TTTAGAAACT 


TTGGCATAAC 


480 


CATAGCCAGG 


AACATCCACA 


AAGTGTAAGG 


CATCCTCAAT 


TAAATAAAAA 


TTTAGCGTTT 


540 


GCGTTTTACC 


AGGTTTGCCT 


GAAGTTCTTG 


CTAAATTTTT 


ACGATTAATC 


AAGGTATTAA 


600 


TAAAaGATGA 


TTTCCCAACG 


TTCGAACGTC 


CAGCTAAAGC 


AATCTCCGGT 


AATTGAGATT 


660 


CAGGaTATTG 


kTTaGGGGga 


TACCGcACTA 


ATCACGAwTT 


nCyGCATtAt 


GTACGkTCAt 


720 


kGTTTTcCyC 


CGkTCyTCyA 


tTCAgTCTTT 


CATGCAAAAG 


CTTAACAnAA 


ACAGCTCTnT 


780 


CGGAGATCC 












789 


(2) INFORMATION FOR SEQ ID NO: 507: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

GGGAGTGAAA CGCCCTATTT GGTAATGGAT. -GAGCGAGGAA TGGCCATGTC GACAACAGAA 60 

TTGGCTGAAA ATCTAGCAAC ACTCTATGAA TCGCCAAATA ATGTTTTATT GGTGGCATTA 120 

GCGGGAGAAG CGATCATTGG GACCGCTTCT GTGTCTGCTT CTTCTAAAAA GCGGATGGAG 180 

CATATTGGTG AAATTGGCAT TAGTATTTTA AAAGAGTACT GGGGCTATGG ACTGGGCAGT 240 

ATCCTGATGG AAGAACTTAT TCGCTGGGCG CATGAAAGTC ATGTTATTCG TCGCTTGGAA 300 

TTAACGGTAC AGGATCGTAA TCAACGAGCG ATTCATGTCT ATAAAAAATT AGGTTTTGAA 360 

ACAGAGGCAA TTATGCCTCG GGGCGCCAAA ACAGACCAAG GTGAATTTTT AGATGTTCAT 4 20 

TTAATGCGTT TGCTGATTGA TTAATTATTG GTTAATAATT TTAATTTCTT AAATAATCGA 4 80 

ACGATAATTG CATGAAAAAA CAAAATTTTC GCAATTATCG TTTGTTTCAT ATTTATTTTT 54 0 

CTGATATTTT TGTTATAATT ATAGAAGAAT TaACGAAAAA ATGAAAATTT ATTTTGGGAA 600 

GGAATGTAAG AAATGGCGAg GCAACGAAAA TGAAGTTAAC TAT 64 3 
(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

TCTTGCTCAT CTTCATATAC TCCGTTTCTG ATTTTTTCTT TATTTTCAAC GACAATTGTC 60 

GACATATTCT GGTGCGAAAC ATTGCTATTA TACAGGGAAA TGCACGAAGA ATCAAAAAAG 120 

G CAT ACT TAG AAACAAGAAA ACTGTCTGAA TCTGCTCGCT AGAATGAGTG AT TC AG AC AG 180 

TTTAATCTTT TTACGGCTTT AACTACGTAA GCCACGCCCT GTTTCTACAA GGTACCAACA 240 

AATGCCATAT AAAACGACAA TAAATAAGAT TAAAATGGTC ATAGAAATAC CAATTGGCAC 300 

ATCTTGAACC CCTAGGAAAC CGTAGCGGAA CCCAGAAATC ATGTAAACAA TCGGATTAAC 360 

TTTTGAGACA GCTTGCCAAA ATGGCGGCAA CATGGAAATG GCATAGAAAA CCCCACCCAA 4 20 

ATAGGTTAAA GGCTGTAACA CAAAGGTTGG CACAATCGAT ACATCATCAT AGGATTGCGC 4 80 

AAAAATACCA TTTAACAAAC CAGCTAAAGA AAAAAGAATC GCTGTCATCA ATAAAGTCAT 540 

GATGACCATC AGCCAAGAAA CAACGTGCAA TGGCACAAAA AATAGAGAAA TGATGGTGAC 600 

TAACGCACCG ACCAAAATAC TACGACCAAC GCCCCCAATC ACAAATCCCC AAATAATCAC 660 

ATGAGTTGGA ACTGGCGCCA CGAGAATTTC TTCGATATTT TTCTGGAATT TTTGTGAGAA 720 

AAAGGAAGAT GAAACATTGG CGTACGAACT TGTAATGGCT GACATCATAA TTAAACCTGG 780 

CACGATAAAT TCCATATAAG AAAAACCGCC CATGTCGCCA ATTCGCCCGC CAATCATTTT 84 0 

CCCGAAAATC ACAAAATATA AGGACGTCGT AATTACTGGC GGTACCAGTG TTTGGACCCA 900 

AATCCGTAAA TAACGGTTGG TTTCCTTCAC AGCTAAACTT TTTAAAGCAG TAAAATACAG 960 
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ACTAAACATG 


TTGATCCTCC 


CTTTGGTACG ' 


TATCCTCGGT AATCTTCAAG AAGAGCTCTT 


1020 


CCAAACGGTT 


GGACTTATTC 


CGCATTGACA , 


ATACTTTGAT TCCTTGTTGA CTTAGCTGTT 


" "1080 


CAAACAAGTG 


ATTGACTCCT 


TGGTTTCGTT 


CCACTTCCAC CGCTAACGTT AACTCGTCCT 


1140 


CAAATACGCT 


TTGATAGCCT 


TCAATAACTG 


GCGCCTGCGT ATAAGGTGCT AAATCAAAAA 


1200 


TAAACGTTTC 


AAATTGTAAT 


TTTGCTAACA 


AGTGCTTCAT GCTGGTGTTT TCAATCAATT 


1260 


CACCAGACTG 


AATAATCCCA 


ATGTTGCGAC 


ACAACATCTC AGCTTCTTCT AAGTAATGGG 


1320 


TCGTCAAAAT 


AATCGTGGTT 


CCCTGCGCAT 


TTAACTCTTG TAAAAAGGCC CACATTTCAC 


1380 


GGCGCAATTC 


AATATCGACA 


CCTGCAGTTG 


GTTCATCTAA AATCAATAAC TTCGGCTCAT 


1440 


GCATCAGCGC 


GCGAGCAATC 


ATCAAGCGAC 


GTTTCATTCC GCCAGAAAGC ■ ATTCGCGCCC 


1500 


GTTCGTTCCG 


TTTTTCCCAT 


AAATTGGACT 


GTTTTAAATA CTTTTCACTC CGTTTCATGG 


1560 


CTTCTTTACG 


AGACACCCCA 


TAATAGCCCG 


CTTGATTGAC CACAATCTGC TGGACCGTTT 


1620 


CAAACGGATT 


GAAGTTAAAC 


TCCTGCGGAA 


CTAACCCAAT CTGTTGcTTC GCCCGCACCA 


1680 


TCTCCGTATC 


TAAATCATAC 


CCAAAAATTT 


TGACTTTCCC CGAAGTCTTA TTAACTAACG 


1740 


AGGTCACAAT 


CCCAATCGTC 


GTTGATTTTC 


CTGCGCCATT CGGTCCCAAA AGAGCATAAA 


1800 


AATCGCCTTC 


TTCGACTGTT 


AAATCAATCC 


CCyGCAACGC TTCAACACCT GAAGCGTACA 


1860 


CTTTGCGTAA 


ATCTTGTATC 


ACAAGAGCAT 


CAGTCATTCG TAACTCTCCT TAATGTATAA 


,1920 


ATATTCATGA 


ATATTATTGT 


AACACTTCTC 


GATAAAAACG GGGGTTATTT TGATTAAATT 


1980 


TTACAGTTTT 


GCTTAGCTAT 


TTTTCAACAA 


CAAACTAGCC AACGATTAAC GTGTTTATTT 


2040 


TGATTCAAAA 


CTGTTTAAAC 


AGTTTCAAGA 


GCGTTAAAAC CCTTTCAAAT CAAGGATTTA 


2100 


GAGTTCTCTT 


TTTTTGGCAC 


AGAACTTGCT 


TTATATAAAA GTGAGGTGAA AAGAATGAAA 


2160 


AGAAAATTTT 


TAATTGCTAG 


TCACGGAAAC 


TTAGCGAAAG GCTTCCAAAG TTCCTTGGAT 


2220 


ATTCTCGCAG 


ACAAAGGAAA 


AGAGCTGGCG 


GTCATCAATG CCTATGTGAC TCCTGAGGAC 


2280 


TACACGCCAA 


TTATTCAAAC 


GTTTCTTCAG 


TCCCTTGGCG CAGAAGAACA AGCTATTATT 


2340 


TTAACAGATT 


TATATGGTGG 


TAGTGTCAAT 


CAAAAAATTG TTCAAGAAGT CATGACGACA 


2400 


AAACCAGACA 


ATGTTTTTAT 


CATTTCCAAT 


GCCAACTTAG CCATTGCCTT GTCGCTGATT 


2460 


TTCTTAAAAG 


AAGGCGAAAA 


GCTAACCAAA 


GAAGATATTC AAGCAGCAAT TGCTGAAGCA 


2520 


CAAATTCAAT 


TCGTTGAATT 


AAACCCATCA 


AATGAAGAAG AAAACTTTTT TTAGGAGGAA 


2580 


ATTAAAATGA 


TTACACAAGT 


AC GAG TAG AC 


GACAGATTAA TTCACGGCCA AGTAGCCGTA 


2640 


GTTTGGACCA 


AAGAATTGAA 


TGCCCCCTTA 


TTAGTTGTAG CGAATGATGA AGCAGCCAAA 


2700 


AACGAAGTAA 


TGCAAATGAC 


GTTAAAAATG 


GCGGTCCCTA ATGGTATGAA ACTATTAATT 


2760 


CGCTCAGTCG 


ATGATGCGAT 


TGATGTCTTC 


, AATGATCCTC GGGGCAAAGA CAAACGAATT 


2820 


TTTGTAATTG 


TGAACAGTGT 


AGCGGACGCT 


' ACTAAAATTG CGAAAAACGT G AC AG AT ATT 


2880 


GAAAGTGTGA 


> ATGTTGCCAA CGCTGGCCGC 


: TTTGATAAAT CAGATCCAGC CACCAAAACA 


2940 
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ATGGTTTTCC CAAGTGTCCA ATTAAACCCT GAAGAGTTAG AAGCTGCCAA AGAATTAGCC 
AGTTTGACTC ACGTGGAAAG CTATAACCAA GTCCTGCCAA CCAACTCAAA ATTAAGTCTA 
AAACAAGCCG TCAATTAAAA GGAGGAAATG AACAATGCTT ATGCATGCAA CAATGGCAGC 
CTTAGCTGTA TTTATCTGTT TTGCTGGGAA TTATTTAACT GGTCAAAGTA TGATGGAACG 
TCCCTTAGTC GTTGGATTAG TCACAGGAAT TTTAATGGGC GACATGAAAA CAGGGGTCTT 
AATGGGGGCC TCTTTGGAAG CAATTTTCTT AGGAAATGTT AATATCGGTG GTGTTATCGC 
AGCGGAACCT GTAACTGCGA CTACTTTAGC CACAACGTTT GCAATTATTT CAAATGTCGA 
ACAAAAAGCC GCCATGACGT TAGCCGTACC AATTGGGATG TTAGCCGCTT TCGTGGTGAT 
GTTTTTAAAA AATGTCTTTA TGAATATCTT TGCTCCTTCT CTTGACAAAG CAGCACGTGA 
AGGCAACCAA AAAATGGTTG TGACACTGCA TTACGGAACG TGGATTATTT ATTATTTAAT 
CATCGCTTCG ATTTCATTTA TCGGAATTyT AGCAGGAAGC GGGCCAGTAA ATCTGTTTGT 
TGAAAGTATT CCACAAAATT TGATGAACGG CTTAAGCGCA GCTGGCGGAC TTTTACCTGC 
CGTTGGGTTT GCTATGTTAA TGAAACTGTT ATGGGATAAT AAATTAGCCG TCTTTTACAT 
TTTAGGCTTT GTTTTAACCG CGTACCTACA ATTACCAGCC GTTGCGGTAG CGGTCATCGG 
GACAGTAATT TGTGTCGTGA GTGCCCAACG GGATGTAGAA TTTCGCGACA TTCTAAAAAG 
AAAAcTGCGG CGTCTTCTGC GGTTGAAGGC AGCGCGAAAG AAATCGAAGA GGAGGACTTT 
TTCGCATGAA GTTAAAAGAG AATTTATCTA AAGAAGAGAA AAAAATGATG CGCTCTGTTT 
TCTGGCGTTC TTGGACGATG AACGCTAGCC GGACAGGAGC TACACAATAC CACGCAGTGG 
GTGTCATGTA CACTTTACTC CCAGTTATCA ATCGTTTTTA TAAAACCAAG GAAGAACGGG 
CAGAGGCATT GGTGCGTCAC ACCACTTGGT TCAATGCAAC CATGCATATC AATAACTTCA 
TCATGGGGCT CGTTGCTTCG ATGGAAAAAC AAAACAGTGA AGACGAAAAT TTTGATGCTA 
GCTCTATTAC AGCAGTTAAA GCCTCATTAA TGGGGCCACT TTCAGGCATC GGTGATTCCT 
TCTTCTGGGG CATCTTACGT GTTATTGCTG CTGGGATTGG GATTTCATTA GCCAGCACTG 
GTTCGCCATT AGGGGCGATT GTCTTCTTAC TTTTATACAA CGTTCCTGCG TTCTTAATTC 
ACTATTATGC GTTGTATAGC GGCTATTCAA TCGGTGAAAG TTTTATCCAA AAAATGTACG 
AATCAGGCGG AATGAAGATA CTCACAAAAG TATCAAGTAT GCTAGGCTTA ATGATGATGG 
GCAGTATGAC TGCTTCAAAT GTAAAATTCA AAACGATTCT TGAAGTTTCC GTTAAAGGAA 
. GCAAAGACGT AGTTAAAATC CAAGACTATT TGGATCAACT ATTCGTCGGT ATCGTACCAT 
TAGCCGTGAC ATTACTTGCT TTCTGGCTAT TACGCAAAAA AGTCAATGTT AACGTGGTGA 
TGTTTGGTAT CATGTTCTTA GGCATCTTGC TAGGATTACT AGGCATCTGT TAAAATTGAA 
ACAATAAACA TTTAATTATT AAACACTTAG AGCCTGAGAC AAAAATCCAA AGTGATTTTT 
ATCTCAGGCT TAAAAGCTGA TAAACGGCGG GAACAGAAGC AACCCCTGTC CCGACCTCTC 
TTTTGGAAGG ACGATGAATC ATGCGCTTGG AGACCTACCA ACACTTAGTT GATATTTTAG 
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AAACCAGCAA 


CACCCCGATG 


AGCACACAAG 


AAATTGCCAC 


AAAAATTCAA 


TTAAGCCGCT 


4980 


CGGTAACCAG 


CTTATACTTA 


AATAAATTAT 


TGGAAAAAGG 


TGAAGTCCAA 


CAAACGGGAA 


5040 


AAAAACCCGT 


TTATTGGCAA 


TTAACCCGTG 


CAACAACCCC 


CACGACCGAT 


GTGTTTCGAC 


5100 


AATACATCGG 


TAGCCAAGGT 


AGCGCCAAAA 


AAGCCATCGA 


ACAATGCAAA 


GCCGCTATGT 


5160 


TGTATCCCCC 


ATTGGGTATG 


CCTTTATTAA 


TCCATGGAGC 


CAGTGGCGTC 


GGCAAAAGCT 


5220 


TTTTGGCAAA 


ACTGATTTAT 


GAGTATCTAA 


AAAACGAACA 


AATCATCGGC 


CTGGAAAAAT 


5280 


TCTATACCTT 


TAACTGTGCC 


GATTATGCCA 


ATAATCCCGA 


ATTGCTATCA 


TCTATTCTCT 


5340 


TCGGCCATAC 


CAAAGGCGCA 


TTCACTGGGG 


CTGAATCAGA 


AAAACAAGGA 


CTCTTGGCCC 


5400 


AAGCTAATAA 


TTCCGTACTC 


TTTTTAGATG 


AAGTACATCG 


ATTATCCAAC 


GAAAACCAAG 


5460 


AAAAACTCTT 


TCAGTTTATG 


GATACAGGGA 


CCTTCCGCCC 


TATTGGTGAA* 


'gaaggaaaaa 


5520 


TGGTGCATTC 


CAAGGTCCGG 


CTACTATTTG 


CGACCACCGA 


AAACCCGAAA 


AAAGTCTTAT 


5580 


TACCAACCTT 


TTATCGCCGA 


ATTTCCGTAA 


TCGTCTCACT 


TCCGAACTTC 


AAAGAGCGCC 


5640 


CCATTCGAGA 


ACGGATTGCG 


ATTTTGAAAA 


ACTTATTCCA 


CCGAGAAGCG 


AAACGAATGA 


5700 


CCAAAGACAT 


CAACGTGGAC 


GAAGAAATCT 


TTACGGCATT 


ATTAGAAAAT 


GACGAACCTG 


57 60 


GCAACGTGGG 


GAGTTTATCC 


AACAAAGTGC 


AGCTCCTTTG 


CGCGTCGCAA 


TTACGAAAAA 


5820 


CTTTGCCCAA 


TCAACCTGTC 


GTCATTGGCG 


ATGCAACTCA 


ACCAATGATT 


GCGATTCCTT 


'5880 


TAGACAAAGA 


AGTGTTAGAA 


GAAGACACAC 


TTTCTTCCGA 


TATTTTCGCA 


ACATTTGAAG 


5940 


CCTTATTTAC 


AAAAGAAAAA 


ACACTGGCCC 


ACTTAAAAAC 


TGAACTGACA 


CAATTCATTA 


6000 


AATATTGTTT 


AGACGATGAA 


ATAACCTTGG 


AAAATGATTA 


TTTTTTACAA 


AACCTAGTCA 


6060 


CGGAAGTCGA 


TAAAGTCCGT 


ATAATTGTGT 


AAAAGTAAAA 


AGGCCATATA 


ACAGTCCTTT 


6120 


TACGGTACAA 


TGTTTTTAAC 


GACAAAAACA 


TACCCAGGAG 


GACTTTTACA 


TGaCCCAAGT 


6180 


ACATTTTaCA 


CTGrAAAGCG 


AAGAGATTCA 


AnGCATTATT 


GAATATTCTG 


tAAGGaTGAC 


6240 


GtTTCTaAAA 


tATTTaACAc 


gGtATTTAAT 


CACTATGGGG 


AAATCACGGA 


CCGAATTT 


6298 


(2) INFORMATION FOR SEQ ID NO: 509: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

CAAATAGCCC TTGATTAAAC GCCAAGACAT CGACGCGTTC TTTTTTCATA AGTGCCTTCT 60 

TTCTATCTCA ATTGTAACTG AGTTAACAAG TTTGCTAAAT GTTCTTCTGA AAAGTTGGGA 120 

ACATTTGCTT TGATTTTTTC TAAAACAGCG CTACCTTCTG CTAATTGATG TGTTAGCGCA 180 

TCTTTCGCGC CAgCAATCCC TAGAAGAGCT GGATACGTAC TTTTATTCAA CGCTTCATCT 24 0 
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CGGCCTACTT TTTTACCTAA ATCTGCTTCT GTACTTGTGG CATCTAATAA GTCATCACGA 2 



ATTTGaAAAG . CCAAGCCATA GTGAgCGCAA ACTGTGTCAG TAAACCaATG ACTTCTTCTG 2c : 

kTTGATTGGg CTAATACGCC CCCAGCAATT AAGGsGrATT CAATTAGCGs TCCGGGTTTC 42; 

TTTTCATGAA CAGcCGcTAA ATCTTCTnAA GGCAAGCTGA ATTTTTCTCC TTCAAATATC 4E; 

ACCCATTGGC CJ3GAAACC 4 9: 



(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 



ATTAAAAGTG 


nACGACAAAA 


TCAAAGCCGC 


TTTACCTTAA 


CCAAAATGAA 


TACGTTCATC 


60 


AAAAACTTCC 


TTGGTAGCTA 


CATTCTGTTA 


AGTTTAGAAA 


TTTTAATTGC 


TGCTGATATT 


120 


ATTGAATCCA 


TTGTCAAACC 


AACTTTTCAA 


GATATTTTGA 


AATTGGCAAC 


CTTGGTGATC 


IE 0 


ATCCGTACAG 


TTATTTCTTA 


TTTCTTACAT 


AAAGAAATTG 


AAGACACTAT 


GACCGATATG 


240 


GAACAACAAG 


AAAAAGAACC 


AGAGACCCCA 


AAAAATTAGC 


GGGTACGCTG 


GTTCTTTTTT 


3C0 


TATTTAATCG 


CTTTAAAATA 


ATGAAGTGGT 


TCGTTAGACA 


AACGCTGCAT 


TTTACGTTTT 


360 


TTATCTTCAA 


TAATTGCTAA 


AATCTCGTCT 


GAAAAATTTT 


CTAAAATAAC 


ACGGCCGCC.T 


420 


TTATAGGTGT 


GTCCTCCCAT 


TTCAGCAACT 


TCGCTGGCCG 


TCAAAATGAC 


TTTACGATGC 


4c0 


GCAATTTTTT 


TGAAAAATTC 


TTCAATTAAG 


TACGTCGGTA 


AGTAAAAATT 


ATTTTGAGCA 


540 


GAAAATGATT 


CCAAATAAAT 


AAAATCATCG 


ATGCTGATGA 


AACAATCATC 


CATCGATAAG 


6C0 


TATTCAATTG 


ATCTGTCTAT 


ATATAATTCA 


CGCTTTTGTA 


AAATTTCTTT 


GACTTTTTTA 


660 


TTTAATAAAT 


TCAATGATTC 


AATTTCTTTA 


ACCAAATGTT 


CTAGTTCACG 


ATAACGTACT 


720 


TCTTTTCGCT 


CTTTGTAAGA 


tGAATGTAAG 


GCGGCACTTG 


CTGTGACTAT 


TAGCGCACCG 


780 


AC GAT T AC AC 


CGACCATTCC 


GGCGATAAAA 


ACAAACATAT 


TTTTCCCCTC 


GCTCTTTTCT 


840 


AATTTTCCAT 


TTTAGTATAA 


CATAATCCAA 


AAGTTTCGGA 


TAAAAAATGA 


CACAAAATTT 


9C0 


TTCAAGCGAC 


TtCGTTGACC 


TTTATTGACC 


AAAGAGGTAT 


ACTTTTCATA 


GGCATTCAAA 


960 


GTGCTTTGTG 


ATAGAATGAC 


TTTAAAAGTT 


TCTTTATATA 


GGAGGATTTA 


CTTATGAATT 


1020 


TAATTCCAAC 


AGTTATTGAA 


CAATCATCTC 


GCGGTGAACG 


TGCTTATGAC 


ATTTACTCTC 


1050 


GTTTATTAAA 


AGACCGCATC 


ATTATGCTAA 


GCGGCCCTAT 


CGATGATaAC 


GTGGCAAACT 


1140 


CAGTGATTGC 


ACAGTTATTA 


TTCTTAGATG 


CCCAAGATTC 


AGAAAAAGAC 


ATTTACTTGT 


12C0 


ACATTAACTC 


TCCTGGTGGC 


AGCGTTTCTG 


CAGGCTTAGC 


GATTTTCGAT 


ACCATGAATT 


1260 


TCGTTAAAGC 


GGATGTCCAA 


ACGATCGTAT 


TAGGAATGGC 


AGCTTCAATG 


GGTAGCTTCT 


1320 


TATTAACAGC 


TGGTCAAAAA 


GGCAAACGTT 


TCGCTTTACC 


AAATGCGGAA 


ATCATGATTC 


12:0 
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ACCAACCACT TGGTGGCGCT CAAGGGCAAG CAACAGAAAT TGAAATTGCT GCTCGTCACA 14 40 

TTTTAGACAC TCGTCAACGT TTAAATTCAA TTTTAGCTGA ACGAACTGGC CAACCAATTG 1500 

AAGTGATTGA ACGTGATACC GATCGTGATA ACTATATGAC TGCTGAACAA GCAAAAGAAT 1560 

ATGGTTTAAT CGATGAAGTA ATGGAAAATA GTAGCGCCTT AAATTAAAAA TAAAAAAATG 1620 

GAAACAGCGA CAGTCGCTGT TTCCATTTTT TTTaTTTTTC AATGTATCCG CGTAATTCTT 1680 

CTTCAGTGAC ACTTTCGTCA AAGATATACT CACCAATGAT GATCGTTGGT ACAAATTGAA 1740 . 

TGTGCGCCGC ATTGGGCTTC AGCAATCACT GGCGGGAnAC AAGCGTTGGC ATCTTTTTGG 1800 

TTCTTTTAAG CCCAGATTTT TTTCAGCATA CGTCGGCTAC TTCTTCTAAG GGTAAGTTTC 18 60 

CCCATTCATC nTGCGTGAAA CCATTTTAGG CAnGCTGACA CGCTGGTTCA GGCGCTGAAT 1920 

AGT 1^23 

(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 



TTCTCGCTTA 


ATTTCTGGAA 


AAGGATATGT 


GAGTGCCGAT 


AGTCAAGAAA 


AAATTCTCAA 


60 


AGCAATGAAA 


GAATTAAATT 


ATACCCCGAA 


TTATGTGGCG 


CGGAATTTAC 


GAGCAGGTGC 


120 


CACAAAAACC 


ATTGGCTTTT 


TAGCCCCTAG 


CTATTTAGGT 


TCCTTAGGTA 


TTTTCTTGAA 


180 


TCGCTTTATT 


TCAATTGCCA 


AAAAATACAA 


TTATTCCGTA 


ACCTTATTTT 


TTACAGATGG 


240 


CGATCAGCAA 


AAAGAAATTG 


ATGCCTTGAA 


CCAATTAAAA 


TACAAACAAT 


TAGATGGCTT 


300 


GTTTTTATTG 


ACACGAACAA 


ATGAATGGTC 


GGTGATTGAG 


CCTTACAGCG 


CGTATGGTCC 


360 


TTTAGCAACC 


TGGCATCGTA 


TCGATTCACC 


TAATATATAC 


TCTTCCTACA 


TTGATCATTA 


420 


CTCTGGTTAT 


TATCGTTCCT 


TAGAATATTT 


ACAACAACAA 


GGATACCAAA 


AAATTGGTCA 


480 


TGTCCTCGGT 


AATCCCGAAA 


ATTTAAACAC 


GCAcGCACGA 


AAACGTGCAA 


TGAAAGATTT 


540 


TTATGAAAAT 


ACTCAGGCAC 


CTTGGCAAAC 


TGAGTGGCTG 


ATTAATGATC 


AGTATACTAA 


600 


CCATTCTGGT 


CGCAAAATTG 


CCCATTACTG 


GCAACAATTA 


ACCGACAAGC 


CAGAAGCTTT 


660 


GGCCTTTTAT 


AATGATCACG 


TAGCAGCTGA 


GTTTATTTCA 


GAACTTGAAA 


ATCTAAATTA 


720 


TGcCGTTCCT 


CGCGATGTCG 


CTGTCATTGG 


CTTTGATAAT 


AGTTATGTCA 


GTGAGTTGAT 


780 


GCATATTACG 


ACTGTTGATT 


ATGCCATTCA 


GTATCAAGCC 


GAAAACTCAT 


TTATTTACCT 


840 


ATACAACCAA 


•TTAAATCAGA 


CGAATATTCC 


TGAAAAAAAA 


TTAACCGTTC 


GTCTAGTTGA 


900 


ACGACGAACG 


GTGCCTAAAA 


AAGACCCGGC 


AGATCAGTCA 


CTTTGACTGA 


TTTACCGGGT 


960 


CCTTTTATAG 


TTAAACAAGG 


ATTATTTATG 


CTTTTGTTCA 


TTTTTTAAAA 


TAGAAATTGA 


1020 
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CTGTTTTAAA GATTGAATCA TGCCGCCTTC ACTGTATACA AGAACGTTTG ATTTGTATAA 1080 

ACGAGCAGAG AAGATTGTTA AAATAATTCC AAAAACTGCC AAAATCACTA ATGAGACAAT 114 0 

TGCTTCCATT GTTGATGCCG TTTCGGTCGC CAAGCGTACT GGCATAATGT ATGACGAAAT 1200 

CAGCGGAATA AAGGAAGTCA CTTTCACCAC GATATTTTGT GGATCATTCG TCCCAAACGT 1260 

AATGCCGATC ATATAACCGA TTAAACCAAT ATACATGATG GGCTGAACTG CTTTTGCTGT 1320 

ATCCTCTGGT TTGGAAACAA GAGAACCACA AAGTGCTGCC AAAACAGCAA AAACGAGAAT 1380 

ACCAATAATG AAAAAGCCTA ATGTAAAAAT AAAGGTTGAT GAAAAAATAC TTTGTGGTGA 14 4 0 

AATACCACTT AATAGACTTT TCATAAAATC AAGATTTTTC AATTGAGAAT AGCCGAGAAC 1500 

AAACGCCACT CCGTAAATGA AAATTTGTGT TAAAGCTACT AGmATAAcCC CAGTTAmTTT 1560 

GCCATAAAAA TGGGTTgCGC yTTTGTACTA GAGAGAATAA CTtCCATGAT TCT * 1613 

(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 



ATAAAATTTC 


ATATTATcTC 


TATCCTTTCT 


ATAACTATTA 


ATGATATGAA 


CTTTTAAGTG 


, 60 


TAAAATAAAA 


CACCTAAACA 


ATTAATTTCA 


TTTCCAAAAG 


TTAGATAATT 


ATTTAAAAAA 


120 


TGTTTCGTAC 


TAAAACATTT 


TTTAGTCTAA 


ATTTAATGTA 


AATTTAGAAT 


CAACTGACAT 


180 


CTGATATTTT 


TTAGGAGTCG 


TCCCTGTTAT 


CTTTTTAAAA 


ACTTTAGTGA 


AGTGAGACGT 


240 


ATCTGAAAAA 


CCTAATAATA 


GTGAAATATC 


ATTAATAGGA 


TAATTAGATT 


TTAACATTAC 


300 


TTTAGCTTCT 


GAAATTTTTC 


TTCTTAAAAT 


ATAATTATGT 


ATACTAATGT 


TCATTTCTTT 


360 


TTTAAAAGCT 


GTGCGTAACC 


TAGCTTCACT 


CATTGAAAAA 


TATTGAGCAA 


TCTGTCTTAC 


420 


TGTAATCTTT 


GTATACATAT 


TTAGACCAAT 


GTACTGAACA 


ATTGATGATA 


TCTGAGGAGA 


480 


AAGATTCCTT 


ACTTTAGCTT 


TTCCTATTTC 


AGAGGTATAA 


AATACTATAG 


CAGTATCTCT 


540 


AACTTTTAAT 


ACTTCTTTTA 


AATTTATACA 


TAGTTCATTT 


TTCCTTATAA 


AACTATCTCG 


600 


ACTCTGATAT 


GCATTTATTA 


AGTCCATTCC 


CATATTTATA 


GCTGCTTGTG 


CTAACTTCTC 


660 


AAATACAACT 


ATTGAATAAT 


TCTTTcAGAA 


CGTACGTTAT 


CCCCACTAAC 


CACAGGwACA 


720 


ACACCATTAC 


TTAGTTTAAA 


TATCATATTA 


CTTAATTCTT 


GTACATTACC 


ATTTACAACA 


780 


TATTCTAAAA 


TTTTATTTTC 


ATATAAAAAT 


AAATAAATTT 


CTGGATCATA 


ATTTTGAGAT 


840 


AATAACATAT 


CTATCCGTTC 


TATTTGGATA 


TCTTCTGAAA 


ATTTTTTCTC 


ATAATCATGT 


900 


AACGGCTTAT 


GAAATAAGTC 


TTCAATCTTT 


CCCGTAAAAA 


AATAATTTAT 


AAGGATAAGA 


960 


ATATCTCTAA 


TATCCCCTAA 


TGAAAATAAT 


GGTAATTCAT 


TTAATAATTT 


ATATAAGGAT 


1020 


TCTTTATCAG 


AATGCTTTAT 


ACTTTTATAT 


TGAACCATAG 


AAAGTCAAGT 


CCAGACTCCT 


1080 



WO 98/50555 



1760 



PCT/US98/08985 



GTGTAAAATG CTATACAATG TTTTTACCAT TTCTACTTAT CAAAATTGAT GTATTTTCTT 1140 

GAAGAATAAA TCCATTCATC " ATGTAGGTCC ATAAGAACGG CTCCAATTAA GCGATTGGCT 1200 

GATGTTTGAT TGGGGAAGAT GCGAATAATC TTTTCTCTTC TGCGTACTTC TTGATTCAGT 1260 

CGTTCAATTA GATTGGTACT CTTTAGTCGA TTGTGGGAAT TTCCTTGTAC GGTATATTGA 1320 

AAGGCGTCTT CGAATCCATC ATCCAATGAT GCGCAAGCTT TTGAATATTT TGGTTGATCG 1380 

ATATAATCAT GAATCAATCG ATTTTTAGCC TCACGCGCTA AGTTAATATC TGTGAACTTA 14 40 

AAAATTCCTT TAACAGCTCC TCTGAAAGAT TnGAATTTTT TAGGATGGGG nAAGAA 14 96 
(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 
GGTACTTTAC AAGAATCACG TAAACGTGAA TTCTACGAAA AACCAAGTGT AAAGCGTAAG 60 

AnAAAATCTG AAGCAGCTAG AAAACGTAAA AAATTCTAGT TTTGAATGAA TTGGAGTTGG 120 

TGGTATGTCA TTGCTAACCA CATTGAATGA CG AC ATT AAA ACAGCGATGA AAGCTAAAGA 180 

CAAAGAAACT TTAGCCGTAC TTCGTATGTT AAAGACTGCC ATTCAAAATG ATCAGATTAA 240 

AGCGGGCCGT GAATTGAACG GAGAAGAAGA ACTCACTGTA TTGTCTCGTG AGATGAAACA 300 

GAGAAAAGAT TCTTTATCAG AGTTTGAAAA GGCGGGACGC GACGATCTTG TTGAAAAAGT 360 

AAAAGTTGAA ATAGCAATTG TTGAAAAATA TATGCCTCAA CAACTAACGG AGGAAGAAAT 4 20 

TCGTCAACTA GTTCAAGAGG TAATGACGCA AACTGGCGCC ACTTCACCAA AAGATTTTGG 480 

TAAAGTGATG GGCGCTATGA TGCCGAAAGT CAAAGGCAAA GCcGATGGcA ACCAAGTTAA 54 0 

CGCaATTGTG aAAGAATTAT TACAAGaAAA CTAAGAGCGT TwTCGAAACT AGGACAGGAT 600 

ACTGcCgawT TtTTTTTaTG sCcaaGAgAA AcCGGTAGCC TTTTTTACGG TTTCCCCTTT 660 

TTTGGACCTT TCCAAGGTCC AAT 683 
(2) INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

CGAATAGTAT GAcGTACACG GAATAATTTC CTTCGTCAAC TGTTTTGACT TGAATAAAAA 60 

AATAGTTATT TTTAGAATAC GTTTCCACGC TTTCTATCAA TAACTTGCCG ATACCTTTGC 120 

GATGATACCG CTTTTTAACT GC CAT AC AAT GGATTTCTAC TGTATCTTTA CTTGTTTCTG 180 



WO 98/50555 PCT/US98/08985 

1761 

AAAGTGTTAT AAAACCGAGT ATTTTATTCT CTACATCCTT TGCCACCCAC ATAGGTAAAG 2 40 

TGCTAGCTAT AT C GAT AT AC TTGCGGGTCT CTTTTTCAAG GCCAAACCAT TCTGGAAGAT 300 

TAAACAATAC TTCTTGTGCT GCCTGAGTCT TTTTATCTGC ATTTTTTTCC CTTATAATTG 360 

TAAATTCTTT CAATTTACAA CCTCCTTTTT GTTTACATAA CAAAATTATT GTATACCCAA 4 20 

TCTTTTTTAT TAAATAATAG CATAGGTAGT TTAAAAATGT AAGTTAGTCA TTTATTCTTT 4 80 

ATTATTTCCT TAAAACGACG CCTTTTCTCG CTAAATCTGC CCTTTCTTTA TTTTTTTCTG 540 

TTAAACTAAA CGTATTAAGT CTATGAGGAG CGTGCGTATA TGGATTATCT GAACACTTAT 600 

AAGCGGTTGC TAGCATTGGC AGATGCTGGT CTTTTTTACG GGAAAGATGC TTTTGATCAA 660 

GAACGTTACC AAGAACTACG AGAACTTACG TkAmAACTTA TtAGCGCAAC CGGACACGAA 720 

GCGAcTGTCT GGCCCGACTT AGGAAAAAAC CTTACCTAAA GAAGAnGGTA TCCAACACCG 780 

GAAnGTAGAT GTT 793 
(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 



ATAAATGGCG 


TCGATGATGG 


GAAAACTTTT 


CTTGATCATA 


TCGACGATAA 


AATAACGTTT 


60 


CTGCTTCAAT 


AATAGAGCGA 


TAACGTTGAG 


GACTCATTGT 


ACGATAATCT 


AAAACAGACA 


120 


ATTCTTGTTT 


TAGGTGGATT 


GGTAAGACTT 


CAATCTTTTT 


ATTGGCTTGA 


TTCAATTGAT 


180 


AAAAGAACCA 


ATGATCTAAA 


TAGTCCAAAG 


GAAATTCTTC 


CGAAAATCCT 


TCCAACCAGC 


240 


GCAGTGTTTC 


TGCCGTAATG 


ACTGTACCAG 


AATTGATAGC 


CATCAACGGT 


TGGTTGGCTA 


300 


TCCCTGCTGT 


TGGCTTTGCT 


CCTTTAAGTC 


CAACATATTG 


ATCACTATAA 


ACTGGCGAAA 


360 


TTTGTTGTCC 


ATTTGCTTCT 


ACAATTGGAA 


CATAGACTGC 


CACAGTCGGA 


TCTAACGGCA 


420 


TGATGATCAA 


CGTATCAAAA 


TAAGAAGCTG 


GCACTTCTGT 


GTCTTGGTCA 


AGGAGCAACA 


480 


ATAATTCACA 


TTGATTCGTT 


TGACTAAAAG 


CAATCGCTTC 


ATTATAAGCG 


GTCGCTAGGC 


540 


CTGGATTATC 


AGGATTATGT 


CGATAAGTAA 


CATTTTGTTG 


TAAAAATAAT 


GCATCTTCTT 


600 


GAGGAAGTGG 


GCTGTTGTCA 


TAAATAAATA 


AGTGCAATTC 


GGGGTGATCT 


ACCACTTCTT 


660 


TTAATAACAA 


ATAATTCGGC 


GTATCAGCCA 


TTTTCATTTG 


ATATAAGACG 


ATGACTACCT 


720 


CTAATCTTTG 


ACTCATTAAA 


AGAACCTCCA 


GCCCCATTTG 


TTAAAAAATT 


TTCCCATTGA 


780 


TTGCATAAAG 


•ATTTTAAACA 


ATTTTCGACT 


TTTATGGGCG 


CCCTTTTCAT 


ACATGTGAAC 


840 


AACCGTTTCA 


AAAGGCGTAT 


AGAGAATCCG 


ATAGCCTGCT 


TTGCCAAAAC 


GTAAACATAA 


900 


ATCGTTGTCT 


TCaAAGTACA 


TGaAGAAACG 


TTcATCGAAC 


CCACCAATTT 


CAACGAATTT 


960 
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TTCACGATCA 


ATCAACATAA 


AACAGCCTGA 


GCCCATTTTA 


ATATCCGTTG 


TTTCTGTATC 


1020 


CGACAAATCG 


CGACATTCAT 


AAATACTCAA 


ACGTTTATCA 


AAAATTTTCT 


TTACAAATTG 


1080 


AAAGGGAATA 


AAACGTAACA 


TATAATCGAA 


AACATCTAAT 


TTTTGACGAA 


CTAAATATTG 


1140 


CGTCGTGCCA 


TCTTCATTTA 


GCACTTTAGG 


GCTAACGACT 


GCAATGTTCT 


TATCTATTTT 


1200 


GATACGGTTT 


AATAGACGAT 


CAAGCACGTC 


TTTAGTAACC 


AAAACATCGG 


GATTAAAAAT 


1260 


AATTGCATAC 


TTTGTCGAAG 


CATTGAATAA 


CACTTGATTA 


TGACCATGAC 


CAAACCCTTG 


1320 


ATTTTCTTCA 


GCGCGATGGA 


TAGTAATAAA 


TGGTTCATAT 


GTTGTTAATT 


TTTCAAGATA 


1380 


CGCTGTTTCA 


GAATGATTGT 


CATAGATATG 


AATATCATAG 


ATACTATCAG 


TACCCaGTTC 


1440 


GGCTTTTAAT 


TGGTCTAGTA 


CATTAAAAAT 


GTAACGACTA 


TTATAAGTGA 


CAATCGAGAT 


1500 


TGTTACCATT 


TCTTGCATGC 


TCATTtCTCC 


TTACTACTGA 


AACATCGTCT 


TAAAAAAAAA 


1560 


GCATATGTTT 


CACACATATA 


CTTTTTTAAG 


TTTAATGGAA 


TTATATCATA 


GGTTTACGCT 


1620 


CTCGCAAAGC 


AACCTAAAGA 


TTTCTTTACT 


TTCTCTTAGA 


ATGCTTGCCA 


AGTCGCTTTT 


1680 


TCATTTGCTC 


CTGACGATAT 


TCACGATTCC 


CTAAAATCCG 


TAACAAATAC 


ATCAACGGTT 


1740 


GATGTCCTTC 


GCCAACTAGA 


CCGATTAGTT 


CAATGAATAG 


TTCTAAGCCA 


ATTAAACAAA 


1800 


AGACAATTAA 


TAAAATTGAT 


GCTACTGTAC 


TTGAATAGCT 


GAACAATAAA 


GAGACAAAGG 


1860 


AAAAAACTAA 


CGCTAATGCA 


TAAATAGTCA 


TGACCGCCCC 


TTTATGGGTA 


AAACCTAAAG 


1920 


ATAACAAGCG 


GTGATGCAAA 


TGCATTTTAT 


CTGCTGAGGA 


AATGGGCTTC 


TTGTTCAATA 


1980 


GCCGTCGAAT 


AATTGCATAA 


ACCGTATCCG 


TAATTGGCAC 


ACCTAAAATC 


ACCATTGGCG 


2040 


TAATTACCGT 


AATAAACGTA 


GCATTTTTCA 


AGCCCTGTAA 


CG AC AT TACT 


GCAATCATAA 


2100 


ACCCGAGGAA 


TAACGCCCCG 


GTATCTCCTA 


GAAATATTTT 


AGCCGGATAA 


AAATTGTATG 


2160 


GGAAAAATCC 


CGCAATGCTC 


GCAACTAAAA 


TAAAAATAAC 


AATTGGGATA 


TAGACCGTTT 


2220 


TAGCATGTAG 


GAAAAAATAC 


CCTGTAATAC 


CAATCGTGGT 


TAATCCAATA 


ATGGATACGC 


2280 


CTGATGCTAA 


ACCATCCAAA 


CCATCAATTA 


AATTTACTGC 


ATTCGTAATC 


GCTAAAATCC 


2340 


ACAATAAAGT 


TAGTGGTAAA 


CTAAACCAGC 


GCAAATCAAT 


CATTCCAACA 


ACTGGCAACG 


2400 


TCACAAAATC 


TATCCGAATT 


CCAGCAACAA 


AATAAATAAC 


TAATGCTGCT 


AACAAAATAC 


2460 


CGATTGTTTT 


TTTCATTGGA 


GTAATTTCTT 


TAATATCATC 


AATGAGGCCT 


GTCAAAACAA 


2520 


CCATTCCACC 


AGCCAAAATA 




T 2i T R Zi. T C T T CZ 




rjAAPGAAAAA 


2580 


TCAATAAACA 


TGAACTAGCA 


AAAGCnATGT 


AGATTGCAAG 


TCCTCCkGCT 


GTAGGCATAT 


2640 


TTTTGGTATT 


GaTTCGTCGC 


TCTCCTGGTG 


CATCATAGGC 


GCCAATCCGA 


AACGCTAGAA 


2700 


GCTTAACAaG 


TGGTGTTAAT 


AGCAAAGATA 


AGATAAATGT 


CATAAATAAA 


CGTATCAAAA 


2760 


CTTCATAAAT ■ AAATAGTGAC 


ATAGTCGAAC 


CCACTTTCTT 


CTCGT 




2805 



(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5101 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 



AAAGGTGGCG 


TCATCGAAGG 


AAAAGTTTCT 


TCAGTGGAAC 


AAATTACTGC 


GTTGGCAAAA 


60 


CTACCAAACC 


GCGAAGGTTT 


ACTATCTATG 


CTATTATCTG 


TATTACAAGC 


GCCAGTCCGC 


120 


AACGTGGCTT 


ACGCTGTCAA 


AGCAGTGGCA 


GAGAAAAACG 


AAGAAGTTGC 


TTAATTGCAG 


180 


TCTAATTCAA 


ACCAATAAAA 


AACAAACAAT 


TTTAATTTAA 


ATGGAGGAAA 


TTCAAAATGG 


240 


CATTAAACAT 


TGAAAACATC 


GTTGCTGAGT 


TAGAAACAGC 


AACTATCTTA 


GAACTAAGCG 


300 


AATTAGTTAA 


AGCTATCGAA 


GAAAAATTTG 


ACGTATCAGC 


AGCAGCTCCT 


GtAGCAGTAG 


360 


CTGGACCAGC 


AGCAGGTGGA 


GCAGCAGAAG 


AACAAACTGA 


ATTCACTGTT 


GAATTAACAG 


420 


CAGCTGGCGA 


TCAAAAAGTT 


AAAGTTATCA 


AAGCAGTTCG 


TGAAGCAACT 


GGCTTAGGCT 


480 


TGAAAGAAgC 


TAAAGCTGTA 


GTTGATGGCG 


CTCCTGCACC 


AGTTAAAGAA 


GCTGTTTCTA 


540 


AAGAAGAAGC 


AGAAGCTTTA 


AAAGCTGCTT 


TAGAAGAAGT 


TGGCGCTTCA 


GTAACAGTAA 


600 


AATAATTCAT 


TCAACTTTTG 


AATGTTCGAG 


AACCCTTGTG 


TTGCAAGGGT 


TCTTTTTTAT 


660 


TAAGCTTTTA 


AATGGTGGTA 


AAGTTAGTGA 


ATTGATTGTC 


AAAATAATCT 


TcGTTTAATA 


720 


AAGtgCtTCC 


TAGCTAAggA 


AAATCCAGAG 


GATCTTAACC 


TCTGGATTTG 


ATAATTTTCT 


780 


ATAAATTCTC 


GTTACCAATA 


TTATTTGAtA 


AAAAGCAAAT 


TATTTAAATA 


GTGACTGAAT 


840 


AGTAGGTCTA 


TTTCTTTTTA 


TnTTTTATAA 


TGAATATACT 


AACTAAAGAT 


AGTAAGATGA 


900 


TTCCAATGAT 


AGTAAATAAA 


ACTGATTTTT 


TTTCGCCAGT 


ACTGGGAAAT 


ATCCCCTTTT 


960 


TGTGCTCGGT 


TGAACTTGAT 


GTAGTAGCTT 


TCTTCGAACT 


GCTCGTAGTT 


GTTTGTAATG 


1020 


TAGATTCTGT 


ATTAGTAGTA 


TCCGAAGAAA 


CACTTTCAGC 


ACTTTGATTA 


ATTGACGTAT 


1080 


AATCGTTTGC 


TTTTATTTTA 


TCTGCCATTG 


CTGCTAAACC 


ATAATCACCA 


CGGACATCAG 


1140 


ATGGGTAATT 


TTGGTTAGCG 


TAGTCTACCT 


TGACTTGATT 


AATTAAATCG 


ACGAGTGTTT 


1200 


GATCGCTTAA 


ATTATTtACA 


ACATCACTAT 


TTAAATcATA 


TACATAAATT 


AACGAAAGTC 


1260 


GAATTTGAGG 


TAAATAATTT 


TTTAAGTCAT 


CAATCGTTAA 


tGTTGaGTAA 


TCTGTAGAGT 


1320 


AGGAGTCCTC 


CGGGATAGGT 


TTCTCACCAT 


ATAATTTAGT 


AATCATTCTC 


GCTGTTCCAC 


1380 


TGATGTCTTG 


GTTTACAAAA 


TGAAACGAAA 


CTTTTTTTGC 


GTAGTCCAGT 


TCGTCATCAG 


1440 


ATAAACGATT 


TATCAAGGTT 


TCTGTTAAAC 


CATACATACC 


TACGTAATTA 


AGAAGTGTTT 


1500 


GACGAAGCGA 


TGGAAGTTCC 


TCGCTTTCAG 


CTGTAAGTGC 


TGTTGAATCA 


GTTGAAGCAT 


1560 


TAATTGTTGT 


TGCTGAAGTT 


TCTTGAGTTA 


AAGAAGTAAC 


TGTTGACTTT 


TCCACAGAAG 


1620 


CTTCATTAGC 


CGAATCTTCT 


GCCCATACAT 


TAGAAACTGA 


TAGTGTGGTA 


ATCGAAACAA 


1680 


AACAAAAGAG 


TCCTAAAAAC 


ATTTTTTTCA 


. TTTTAATCAT 


TCCTTTCGTT 


ACCTATTGTA 


1740 


ACATTTTTGA 


AATATTTTTG 


TAACAATTAA 


> ATATTAAGAA 


. AAGCTTAAGA 


> ATATATACTA 


1800 
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GGTGATAAAA 


AATATAAATT 


TTAAACTATT 


AATCTATCAA 


AAATACGATC 


CGTAGAGACT 


1860 


ATT AC AT TAG 


TGTCAAAATA 


TTTCAAAGGT 


GTTGGTTAAA 


GAAGTACTGT 


ATTTGCTTAA 


1920 


TGGTAGTTTC 


GAGTAAATTA 


GTCATTATAT 


AAAAGATTGA 


CAAGTATGCT 


GCATAGCGAA 


1980 


AACTTCGTCA 


ATCTTTTAAA 


AGGATAAAAA 


T AC AG ATT T A 


AGAATAATGA 


CAATTTATTT 


2040 


TAAGTCAAAA 


ACTGTTGACG 


ATATTGTCGA 


GGTGTCATCT 


CTTTTTTCTG 


GAAGAAAAAT 


2100 


TTTGAAAAAT 


AAGAAGGTGA 


AGAAAAACCT 


GTTTCAAAAC 


TGATTTCCGT 


GATGGTTAAG 


2160 


GAACTTTTTC 


TTAATAGAGA 


ACAGGCTTTT 


TGCAATTTTA 


AGTTTTGTAC 


ATAAATACTA 


2220 


GGATTTGTTT 


GTTCATAACG 


CTGAAATAGC 


TTAAATAAAT 


AGCTACGATC 


AATGTTATAC 


2280 


TCAGAAGCTA 


ATTGACTAAT 


TGAAAGATTG 


TTTGAAAAAT 


TTTCGTGAAT 


AGCCATTTTT 


2340 


ATTTGGTTAA 


ATAATTGTTG 


GCTTTGATTT 


TCGTGCTGTG 


TAACGGAATG 


ATGCTCTAGA 


2400 


TAATTCAACA 


ATTGAAAGCT 


TTCACTTAAG 


CTGCCAAAAT 


GATTATGTTT 


CTTCGCTTCT 


2460 


TGGTAAATAT 


TTTTAAATAA 


ACTTTCTAAT 


TTCTTATCAA 


CAGTTCCTTT 


TAAAATACAA 


2520 


TTTTCGGTAG 


TCAGGCCTAT 


TTTTTCTAAA 


TACTGAGACG 


AAAGATCTCC 


AACAAAACCG 


2580 


ACCCAATGAT 


AGGTCCAAGG 


ATTTTGCCGA 


TTAGGATAAT 


AAATATTGCG 


ACAATTGGCT 


2640 


GGGATCATAA 


ACAAATCGCC 


AGCGGTAATG 


TGTTGTTTTT 


CCATAAAGGG 


TGTTGAAAAA 


2700 


TAGCCTTCAC 


CATCGCTACA 


ATAATGCAAA 


ACCCAATAAT 


TAATGGGGGC 


AAACTCAGTT- 


,2760 


GGTTTTGTCG 


GACGGCATTT 


TTCATAGCCA 


GCTTCATAAA 


GATGAAATTC 


TTGGCTAATA 


2820 


GGTTGATCAT 


TAATCAGGAT 


GCTGACATTA 


TTTGTAAAAA 


CATTTTGCAT 


GCTATCACCT 


2880 


CTTTATAAGA 


GTATGTAAAG 


GCTTTCTAAA 


AGTCAACATC 


TTTACAAGAA 


ATAGGCAACA 


2940 


TTTTTTATTT 


TTAAAAAACT 


GTAAGCGTTT 


TACAATGTGT 


GAGTACTAAT 


TAATTTGATA 


3000 


AAGAGAGGAT 


GTAGTAAAAT 


GGCAACACAT 


TCACGGGAAC 


GATTAGGTTC 


TGTCGCATTG 


3060 


ATGCTCATGA 


CTTTTTCAGC 


GGTTTTTGCC 


TTTCCTAGCA 


TTGTTAACAA 


CAGTATTCAG 


3120 


ATTGGCCTAG 


CAACTATTCC 


AGGCTATTTA 


TTCGGCTCTA 


TTTTTTACTT 


TTTACCATTT 


3180 


ATTCTAATGA 


TTGCAGAATT 


TGCTTCTGCT 


AATTCTGAAA 


ATGAATCAGG 


TGTTCACAGT 


3240 


TGGTTGGAAT 


CTGTGCTAGG 


ACCTAAATGG 


GCATTTTTAG 


GCGCTTGGTC 


GTATTTCTTT 


3300 


GTCAACTTGT 


TTTTCTTCTG 


TTCATTGCTA 


CCGAATACAT 


TGATTTATGG 


TTCATACGCT 


3360 


TTTTTAGGTC 


AAAACGTTTT 


TCAAGGAAAC 


CATAGCACTA 


AAATAATTGC 


AGTGATTTCT 


3420 


ATCCTGTTAT 


TTTGGTTAAT 


GACATGGGTT 


TGTATTAAAG 


GGGTTTCGTG 


GATTTCGAAA 


3480 


GTGACTAGCT 


TAGCGGGTGG 


TGCTCGTTTA 


TTTATGGGCG 


TAGCTTTTGT 


GGTTTTAGCC 


3540 


TTTGTCGTCG 


TGTTTGGTTT 


TGGCAATGAG 


CCAGCACAAG 


AGTTTACCAC 


GACGTCAATT 


3600 


ATGCCAACTT 


TTAATTGGAC 


TTTTTTCATG 


ACGATGGCGT 


GGATTTTACA 


AGCTGTTGGC 


3660 


GGTGGTGAAA 


GTATCGGTGT 


ATATATTAAG 


GATGTTAAAG 


GCGGCAATAA 


AACGTTTGTC ' 


3720 


CGCACGATAA 


TTGGTGCAAC 


GATTGCGGTC 


GGTATCATGT 


ATATTCTTGG 


TGCCGTTGCT 


3780 
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GTAGGTTTGG 


TAGTGCCaAC 


GGACGTCTTA 


AAAGGAAATT 


TTTCAAATGG 


TATTTTTGAT 


3840 


ATTTTTAAAA 


TTTTAGGTAC 


GTATTTCCAT 


ATTCCCGCAG 


CAATGATGGT 


GCGCTTAGTA 


3900 


GGAATTATCT 


TATTTGTCGG 


AAGTCTTGGT 


TCTTTGGCTC 


TGTGGACGGC 


AGCACCAGTT 


3960 


AAAGTATTTT 


TCTCAGAAAT 


TCCTGATGGG 


GTCTTCGGTA 


AGTGGCTAGT 


GAAAACGAAC 


4020 


GAAGAGGGAA 


ATCCAACCAA 


TGCTTTGTTA 


GTTCAAGGAA 


TTATTGTGAC 


GATTTTAGTT 


4080 


GCCATTCCAG 


CATTAGGAAT 


TGGAAATATG 


GATAGCTTTT 


TAGAAACGTT 


AATTAATATG 


4140 


ACAGCTTCTA 


CGTCATTAGT 


GCCTGTCTTG 


TTCTTGTTAA 


TCGCCTATAT 


TGGGCTTCGT ' 


4200 


TGGAAAAAAG 


AAACAATGCC 


GAGAAGTTTC 


CGCTTTGGTA 


ATCGAACCTT 


TGGCTTAATT 


4260 


GCGGGAATTT 


TCTTATTAGC 


CATTTTTATC 


TTTGTATTCT 


TTATGTCAAC 


TGTGCCGGAT 


4320 


CCAAAATTAA 


TCATGGAAGA 


AATCAATTGC 


ACGTTACCGA 


AAGGAACCGC 


TAGTCCACTG 


4380 


GGCATGTTGG 


CGTATAACAT 


CATTGGGTTA 


ATTGTCTTTA 


TGGGCTTTGC 


TTGGATTTGT 


4440 


TGGAAACGAT 


ATGAAACAAA 


AGAAAAAAAT 


GAAGTTGGTA 


AAGGAGAAAT 


GGATTATGAA 


4500 


GACTTGGGAA 


AATTATAAAG 


TAGATAGTAT 


TAATCGGTTG 


CCTGGACGAG 


CACACTTTTC 


4560 


GAGCTTTCCT 


TCGAAAGAAA 


CAGCGTTATT 


AAATGAAAAT 


AAATATACAC 


AAGCATACAA 


4620 


AAATTTAAAC 


GGATGCTGGC 


ATTTTCTCTT 


TTTAGAAGCG 


CCAGAATACA 


GCCCAGAAAA 


4680 


CTTTTTTGCG 


ACGGACTTTG 


ATACAAGCCA 


AATGGATCAA 


ATCACTGTTC 


CTGGAAATTG 


4740 


GCAAGTCCAA 


GGATATGGCA 


AAATGCATTA 


CTCTGATTTA 


TGGTACAATT 


TTCCCATTAA 


4800 


TCCGCCGTAT 


GTACCAACGG 


AAAACCCAAC 


AGGTATTTAC 


AAACGGACAT 


TTGCTATAGA 


4860 


TGAAACATTT 


CACGATAAAA 


AAATCATTTT 


ACGTTTTTGT 


GGCGTAGATT 


CAGCTTATCA 


4920 


kGTTtGGGTG 


AACGGCCATG 


AAGTTGGCTA 


CAGTAAAGGt 


GCaCGAAATG 


AAGCAGAaTT 


4980 


TGgACATTAC 


GCCTTACGCT 


AAAAtTGGgT 


GaACCAAATG 


aTTTamCGGg 


CgAGTCTATC 


5040 


aATGGtCaGA TGGkACgTAT 


TTwrGAGGGT 


CCAAGATATT 


nTGGnnGGGT 


TAAGTGGGGA 


5100 


T 












5101 



(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 

ACAATTAGAT GGTACCCGAA TCGGCTTTAT CCTTAAAGGC ATCGTCGTCG GAATTCTTGT 60 

CGGCTTTGTT- GTCAGCTTAT TTCGCTTAGG CATTGAAAAA ATTGGCGAAC AAGTCGTTCA 120 

TCTGTATCAA ACGTTTCACG AAAAAACCTT TTGGATGATT CCTTGGTTCT TATTTTCTCT 180 

CGTGCTTGCT TATTTATTAG GACGACTGAT CAAAAGCGAA CCAGCAATTA AAGGCAGTGG 240 
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CATTCCCCAA 


GTGGAAGGAC 


AACTTCAAGG 


TCAATTAGAA 


ATTCACTGGT 


TTCCAGTGCT 


300 


TTfiGAAAAAA 


TTTATCGGTG 


GGATCTTAGC 


GATTAGTCCT 


GGGCT&TTTT. 


TGeGA££TGA 


360 


Ar^TCCTTCA 

nwu -L w w J. x wjn 


ATTCAACTAG 


GTGCAGTGGT 


TGGACAAGGC 


TATAGTCAAT 


GGCGGCAAAG 


420 


wrtw w/xttw J. w l 


GAAGAAAAAA 


TTTTAATCAG 


TAGTGGTGCC 


AGTGCTGGTT 


TAGCCGCCGC 


480 


T TT T Q 21 T^P A 
111 1 rir\ 1 \J^r\ 


fCAATCGCAG 

\^ w nn i wwwj*w 


GACTTTTATT 

\J4+N*r X X X X *» X X 


TGTTCTAGAA 


GAGGTTCACC 


ATTCCTTTTC 


540 


ALL 1 1 1/iu 1 1 


i. OU 1 1 nnw x 


CTTTTTCTGC 

\^ X X X X X X WV* 


GGCCATTTCC 


GCTAACTTTG 


TTTCTTTGCA 


600 


1 1 i w 1 1 luuA 


1 1 u^no w ^> r\w 


TTPTTTATAT 

X X w X X X ri X i* X 


TGGTCCGGTT 


AAATCTTTGC 


CTTTAGAGTA 


660 


L 1 A 1 1 oonUft 


1 i r\\z 1 rtVrf 1 vjw 


X WWWWW X X X X 


GCTAGGATTA 


TTAGGTTGGA 


TATATCAAAA 


720 


AALA1 lbl 1L 


TPPTT apr A A 
1 ww 1 1 rt w Lnn 


AAP4T ATATftft 


AAAAATCAAA 


GGCCTTTCTT 


CTAATTATTA 


780 


TGGTTTTGTT 


TCTTTTATTT 


TGATTTTACC 


GATTGGTTAT 


TTCTTTCCAC 


ATTTACTTGG 


8 4 0 


CGGTGGGAAT 


CAAATTGTTT 


TAGCTTTAGG 


CAATCAACCG 


ACTACGATTT 


GGGCTTTAGT 


900 


CGGCTTACTG 


GTTTTACGTT 


TTGTTTTTTC 


AATGGTTCTT 


ACGGTCGAAT 


CTCCCTGGTG 


960 


GATTTCTGCC 


ATTCTAACTT 


TAGGGCATAT 


CGGACATGAT 


GGAGCACTGT 


CCAAACGAGG 


1020 


AGGAnCATTT 


GGAAACTTAA 


TTTGCAnGCG 


GATTACGGTG 


GAAnACGTAC 


ACGTATTAGA 


1080 



(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 
■ ■ (A) LENGTH: 2677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 



CCACCTTCCA 


ATTTTGGTCC 


AAGTTGGGGT 


TAGCGGGArG 


CACGCAACrG 


CACAGCTTGA 


60 


AAAAAGCCGC 


TGATTATTAT 


GGCAACGGCA 


CACGCAATCC 


CTATCTGCGG 


ATCAAGAAAA 


120 


CCCAAGCGAA 


TTGGAGCTTA 


ACGGCGCAAC 


TGTCGCAACC 


GAAATCAGCG 


ACAGACAGCT 


180 


TGCCAACAGC 


GACCCGCTTA 


TTATTAGGGG 


CGGCTCCTGT 


TTCGAGTTTC 


TCAAATTACA 


240 


ACCAACCTAC 


CGAGTTGAAA 


AATGCGGTCG 


GTACTACGAG 


TGCCATTAGT 


TTGAATGCCA 


300 


ACAACACGGC 


C ACT AG GAT T 


ATTGCCAATC 


AACAATTCAC 


TGGGAGTAAT 


ATCTATCAGT 


360 


TGGACTTCAC 


CTTCAACAAT 


GTCAAACTTG 


AAGTGCCAGC 


CAACCAAGGG 


GTCAAGGGGC 


420 


AACAATACCA 


AGCCGCAATT 


ACGTGGAACC 


TAGTTACAGG 


TCCTTAAAAT 


AGTTAAGTTA 


480 


TCATGAGGAA 


AGCTCAAAAT 


GTTAGTATCA 


AACACTAACA 


TTTTGaGCTT 


TTGTTTATGG 


540 


AAATTAAGGA 


ATTTGGCTGA 


AAATCAGAAG 


CAAAATGGCA 


GATTCTCTTA 


ATTTTCGAGT 


600 


GTTTTTCTAG 


TGAAAAAGGA 


CTCTTCTACT 


TTGGTATAAT 


AGGAATCGTC 


GGAAAAGTAG 


660 


CTTAGCTGTA 


GCAAAATTTG 


AAATTTCAGG 


GCCACCTTTT 


TTTTGATAGA 


ATAATGGAGA 


720 


AAATTAAAAG 


CTGTACATAT 


ACCAATGAAC 


AATAAATCAA 


TACATAAAAA 


AGGAGAATAG 


780 
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CATGAATCAA 


AAATGGCAAA 


AGTTACTTCC 


AACAAGGAAC 


TCACTAAGAG 


GTGGTTCAGC 


840 


ACAAGCTATC 


ACGACTGACC 


CTTGCGCCCT 


GTATGAGGGA 


ACAAGGGGCA 


AGCAAATCTT 


900 


AGCATTCTCA 


GCATAGCCTT 


CATCGGCTTG 


GCGGGAATGG 


TACATAGAAA 


GAAGGGACGA 


960 


CATGAAGAGA 


ATTAACTGGC 


AACGCTTAGC 


AACCATTGGC 


TTGTGTAGTT 


CTTTAGTGAT 


1020 


TAACGCCTTT 


TTTGGCGTAA 


CAGCCATTGC 


GGAAACAGTC 


ACGAGTGAAA 


GTAGCCCGAC 


1080 


AGTGGCAAAT 


AGTATTAAGG 


AAGCAACACC 


AGCAAGTAGC 


GCAAGGCAAG 


AAAGTACGGA 


1140 


AAGCAGTCAA 


GAAACGACTG 


AAACGAGTCG 


GGAAAAAGTA 


AC AC AG G AAA 


CAGCGAAACA 


1200 


AGCAGAGACA 


AAAGCGCCAC 


TGGAAACACC 


AGAGATAAAA 


ACAAACACAA 


CAGCACAACC 


1260 


TGCAGAAAAT 


AGTAGTGAAA 


ATGACCGACA 


AACGAGTATT 


TCGCCACGTT 


GGGTGCCGAA 


1320 


TAATCAAGTG 


ATTTATCCGA 


GAATAACCTT 


GGCTTTCACT 


AATGAATCAG 


GGCAAGCTTT 


1380 


TGCTGATGAA 


AACTTAACGT 


TATCAGGGAC 


GTATACTAAA 


TTTGGATATG 


AAGACGCAAC 


1440 


TAGTTTATGG 


CGACCATATG 


CGGTTAATTA 


TCCAATTACT 


TCTTCTAATG 


CAGGTGGTGG 


1500 


TCAATATCGA 


GTAGCTGTCA 


ATGCAACGGT 


TCCTTTACCA 


TACGATTTTT 


TCACAGATTT 


1560 


ACCAGCAAAT 


TATAGGATGA 


AT AT AT AC GC 


GGTTGAACAA 


TTAAAGATTG 


ATAACACGCT 


1620 


AAAGTATGTC 


GACAGTATTG 


TTGCTGTGCC 


CGATTCTACA 


CTGAGGATGA 


CCAGATATAG 


1680 


TGCGCAACAA 


ACAACTGAGT 


TACCGCAGTT 


TGGAAATCAT 


ATGACTCCtT 


TTGtATTACG 


1740 


CACAAGAGAT 


AGCATATTTT 


CTAAAAGCAG 


TGAAAATGTT 


TTCTCTCTAA 


ACACGGATGG 


1800 


TAATGAATTT 


TTAAGAATCC 


TGAGTGTCGC 


CCCCATTGGC 


ACTCCAGTAG 


AACCAGTATT 


1860 


ATACGGATTT 


GATGGCACTG 


CAACGTACCG 


AGAAACACTT 


AATTACGTAG 


TCACTCGGAA 


1920 


ACAAGTCACC 


GAGAAGTTCG 


TAGATGCTAA CGGCGTAgcT 


ATCACGCCGC 


CAACAGGTTT 


1980 


TACCCAAAAC 


AAGAAAACAC 


CAATGACCAG 


CAATGACTTC 


ACCTTCAAAC 


AAGCTAGTAC 


2040 


TTTGCCGGAC 


ACCTATCAAG 


CAGACGGCAA 


AACCTACAAA 


TTCAAAGGCT 


GGTACAAAGG 


2100 


CAAAACCAAG 


CCAAATACCT 


TGACCACCAC 


CAAAGCGCCA 


AGCTATGCGG 


TGACCTATGA 


2160 


TGGGAATGAC 


GATTTGACGG 


TGGTGTATGA 


GGCGATTCAA 


GAAGGAAAAG 


GAATGTCATC 


2220 


TGTTCAGGTG 


ATGAATAAGG 


CGATTCAGAA 


TCCAGTGACT 


TTGCCTGATC 


GCATGTATCG 


2280 


TGTCCTATTT 


ATCAATGAAA 


CAGGGGGGCT 


GACATTTGCT 


GATCGTTATG 


GCTTTACTGG 


2340 


AGAATTAGTT 


GAGGTTGCGG 


ATGGCAAAGT 


GACGCCAATT 


.GGCCCAGTCC 


CGACTAACAA 


2400 


TAATAGCGCA 


GTAAAAGAAA 


TCACTATACC 


AGGTCGGGTT 


TTCGATACAG 


AAAGACCTAC 


2460 


GCCGTTGCAA 


TATGGAGTTC 


GCAATGCAAC 


CTTTACGTTG 


CCCAAAATGT 


ATAAAaCCGT 


2520 


AACGTATCAA 


CCTGGCCCAA 


ATTATACAGG 


TACAGCCTAC 


TCTATTCCAG 


AAACGTATCG 


2580 


tCTTAGTTTA 


AGAAGTGGTG 


TCmAAmGTGA 


CATCGATACm 


ACTGGATGGC 


GAATTGGCTT 


2640 


GACCCGACAA 


CAGATCCACA 


AAAGTTTACG 


ATGAGCG 






2677 



(2) INFORMATION FOR SEQ ID NO: 519: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 



AGATTGTCCn 


AA1 J. 1 1AAI 1 


ri/TTt A TPP AP 
bbbAAl bbAb 


TPPPP A P.T AT 
1 bbbbnu Iril 


I lUUU nl U 


TTTTPPn TPP 
ill iv^^ILl^,^, 


fin 


rn m /"* rr* /** T\ rp rp 7\ 7\ 

TTGTGAT 1 AA 


TV A TPPP^ 7_i 

AA 1 bbbnAAb 


PAPTAAATPT 
bAb -L AH/\ J. b 1 




Vj O lw ^ rt v« rt 


P.TPPAAP,Tf;A 


120 


CCAAATATGA 


GbbblAibi 1 


1 1 bb 1 bbA i 1 


PTTPPP A ATT 


APP A A PPP AT 
nb>— nnbbbn 1 


TAP.ATP.T A AP 


180 


TAAGTTTGAT 


ACA I bbbAAA 


b 1 b Ab AbA J. A 


TP 7A P TP A. T A T 

A bAb 1 b A 1 A 1 


PTTTTS TPPT 
b 1 1 1 1 A 1 bb L 


TPTAAP.P.AAT 
1 o 1 nnuunn 1 


240 


TACAAACATT 


AGATGTGTCG 


TV A rn rp m rp t\ rp 7\ 

AAT ill b A I A 


bbAb 1 b AAb 1 


P7AT7AT7AT7A TP 

bAl Al AlAlb 


APTP APATP.T 
Ab 1 bAb A 1 b 1 




TTGGTGAGTG 


m tv /** /"* /** tv rr* m 7\ 

TAGCGGATTA 


/-i t\ tv A A rn rp tv /"^ 

CAAACATTAb 


7\Tr" , T/ r "'TPPTv7A 
ATGlblbbAA 


mrpipfppj\fri7\ PP 

I I 1 1 bAi Abb 


APTP A APTP. A 
Ab 1 bAAb 1 bA 


J ou 


TATATATGAG 


TGACATGTTT 


GGTGAGTGTA 


/^T\/™ , /™ , 7\rprnA/"" , A 

GAGGAT TAbA 


A Ar^ATTAPAT 

AAb A I lAbAl 


b 1 b 1 bbAAl 1 


4 z u 


TTGATACGAG 


TCAAGTGATA 


AATATGAGTA 


GTATGTT1 bb 


TPATTPTAPA 

IbAl 1 b 1 AbA 


PPATTAPAAA 
bbAl lAbAAA 


ft 0 U 


CGTTAGATAT 


AACTAATTTT 


b A T Ab GAG T b 


AAPTPAPAAA. 

AAb i bAbAAA 


TATPA.PTA.PT 
1 A 1 bAb 1 Ab 1 


ATPTTTP.PTP 
Al b 1 1 1 bb 1 b 




ATTGTAGAGG 


j\mm A /"* A A A r"*P 

ATI AbAAAbb 


I 1 AbA i A 1 AA 


PT A ATTTTPA 
b 1 AA 1111 b A 


T APP. APTP A A 
1 AbbAb 1 bnn 


P.TP, APA A AT A 


600 


m Tv rr* tv tv rp 7\ rp 

TGAGTAATAT 


GTTTGATGgT 


1 blAbAbbAl 


T A P A A A P A TT 
1 AbAAAbrt 1 1 


AP.ATP.T ATP A 


AAP.TTTP.ATA 


660 


/™"7V7Vr"*rnAAA/"*rn 

CAAGTAAAbT 


AAbbbb 1 A I b 


TV rp rp TV rp T\ rp rp 

AI 1 1 Al A 1 b 1 


TTAP.AP.ATTP. 
1 lAbAbAl lb 


T A CCC P. ATT A 


PAAAPATTAP, 


720 


ATGTAAb J. AA 


b 1 I 1 AAi Abb 


A P T P A A P T A 7A 
Ab i b AAb 1 AA 


P AT ATATP.TP, 


P.AATATP.TTT 
unn iniui i l 


TPTP*P.TTnTA 


780 


b 1 bbAl 1 AbA 


aa.pptta./'" , ap 
AAbb 1 1 AbAb 


TT A TPP A ATT 
1 lAlbbAAl I 


TTP.ATAPPAP, 


TPAACiTAAPA 

1 v^nno X nnv^n 


AATAPGGATG 


840 


AbA lbll 1 b A 


T A A TTfTr TAT 
1 AA 1 1 b 1 b A 1 


P.PPPT A A A A A 


AP,ATAAPATT 
nun 1 nn*_n l l 


C1GGGGCTAAA 

ooooov-. i nnn 


TCAATATTTG 


900 


GGACTAAGAC 


TAATACTAAC 


TTGCCGTCTA 


TAGCAGrTAC 


TTCTTTGTAT 


ACTGGAAGAT 


960 


GGATTGGAGT 


AAACACGAGT 


AACACTTATA 


GTGATAGTAA 


TACATTTATG 


TCTAATTATG 


1020 


aCGGATCaGT 


TCCaGATACG 


TATGTATGGG 


aAAAAGCTAG 


CGTTCTAAAC 


aGTACTTTaG 


1080 


AGCCTTCTTC 


aGTAAGGGtT 


CATTCaGAAA 


GTGAAGTAGA 


ATGGACTTGG 


AAAATAACTA 


1140 


ATTCGTCGTC 


TAAGTCAGCC 


GAGAATGTTT 


ACTCAGATAT 


AACCTTACCT 


GAGGGTCTTA 


1200 


AAATTGACAA 


GAATAGTGTA 


AAAAAGAACA 


ATTTGCCTGT 


GTCGGTAGAT 


GATATTAATG 


1260 


GTATGAACAA 


TTTAGGAACA 


CTATCTAGTA 


ACGAAACAGT 


TACATTTACG 


TTTAAGACAA 


1320 


TAGTATCAGG 


GAAACCAGAT 


AAAtGGTTGG 


AATTAATGGG 


AAAAGTGACT 


kGGGAAGATA 


1380 


ATGcmTCCGC 


ACAGTCAATA 


GTTCTAATAA 


AGTGrAAATC 


ATCGATGAGG 


AGCAAAAAGA 


1440 


TAAAGGGAAT 


CAAACAAATG 


ATTTAGAGTT 


ATTATCAGTT 


CCAGTTGGGT 


TTCG 


1494 



(2) INFORMATION FOR SEQ ID NO: 520: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

TATTTATGCT GATTCGTTAC TGGAACAACG TATTAAAGCA ATTAAACGTA TGGAAAGAAC 60 

TATCGCTTAA AAATTAAAAA AATTAGTCTA TTCTTTTGAT AGGCTTTTTT TTTTAACCGA 120 

AAGAAAACGC TTACGCATTT TTCTATCTTT TTTTTCAAAT CATATTATTC AATTAAAAAT 180 

GATTCTAATT TCAAGATTAT TTTTTTCATT TCTTTTTTTT CGCCGTCAAA TACCGTCAAA 24 0 

CGTTTTAATA AACATCTGTT ATTTCAAGGT AG TAT AT AG T TTTTCATAGT TTTCAATAAA 300 

CTAGGATAAA AAACTTGTTT ATCCATTACA TTTTACCATC AATCTAATCT TTTTTGAAAA 360 

AATTGGTATT AACTTCTTAA GTAAGTTTAT AC AT AT AG TT AGTTATTTTA AAACAAAGAG 420 

GAGAGAAAAG AGATGAAAAA AAATTTAGTA GTAGGTTTCG TATCAAGTGT CGCTTTAGTA 48-0 

GGAATTGCTA TCGCAGGAGG GGGTAATGTT CAAGCAACTG T TACT GAT AG TGCAAGTACT 54 0 

GATATAGGAA TTGGATTTTC AGGACATGGA CCAGGAACGA CACCGGGACC GTTGGATATT 600 

CAATGGGCTC CAATAGGATT AGATTTTAGT AATTCGAACA TGGTAAATAC TACAGCCGTA 660 

CAAGCATTTC CTGAAACCAC TGGAACCAAT AAATATGTTG TTGTAAGTGA GAAACGTTCC 720 

AATGAGCCAA CTAGAGAATG GAGTTTGACT GCGCAATTAT CAGACTTGAC GAATGCGGGA 780 
AGGTCAGAnA CACTTACTGG GGnTGGACTA AAAGTTTGTG CTGCTGTAAA AAGGATACAC i 84 0 

TGGAACGAnT ACACCAGAAA GCCCAGGATC AATTGTGAAA nAAG 88 4 
• (2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

GAAACGTACA GAAACAAGAC AATTAGAAGT ACTTAAATAT ATATACGAAC AAGTCGAATT 60 

AAAGGGCTAT CCGCCAACCG TTCGCGAAAT TGGGAAGGCT GTTGATTTAT CCTCAACTTC 120 

TACCGTTCAC GGAcATTTAG CTCGTCTTGA AAAAAAGGGT CTAATCTTAC GTGACCCAAC 180 

AAAACCGCGA GCAATCGAAT TAACACCAGA AGGCTTGGAA AAGATCGGAA TTCAGCCGAC 24 0 

AACTATTCCG ATGTTAGGTG TCGTAACTGC TGGTGAACCA ATTTTAGCGG TGGAAGAAGC 300 

ATCAGATTTC TTCCCGTTAC CTCCkGATTt ACGTACAGAA GAAAATGCTT TATTwATGTT 360 

GACGATTCGA gGAGAAAGTA TGATTAACGC AGgTATTCtA GATGGCGACC AAGTGATTGT 4 20 

TCGTAAGCAA -TCGAATGCCA ATAATGGCGA CATCGTAATC GCaATGACCG aTGAAGATGA 480 

AGCAACCTGT AAACGATTCT TCCGAGAAGT AGATCATATC CGCTTGCAGC CTGAAAATGA 54 0 

TGCATTAGCG CCTATTCTTT TAGATAACGT TACAATTTTA GGTAAAGTtG TTGGTTTATA 600 
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TCGCAATCAT 


ATTTAATAAA 


AAAAGGAAGT 


CACTCCGTTG 


AGCGGCTTCC 


TTTTTTGCTT 


660 


GTTAACAgTG 


AAtAGCCTTT 


TGGGCTTCCT 


CGATAACCTC 


TTCTTCAGTT 


TCTACAGAAA 


720 


GTAGATAATT 


TAAAGCTTCT 


AAACTTTGCT 


CGGGTTCyCT 


TTTTGcGAGC 


TGACCTaACG 


780 


ACCAAGCAGc 


TGTTCCGCGA 


ATGACAGGAC 


GCTGATCATT 


TAAACAGAGA 


ATAATTTGTG 


840 


GGATAGCTTC 


ACGGCCGCCC 


AAATTcGtAA 


CGCAATTAAA 


GCATTCCGTT 


GCAAtGGTTT 


900 


TT T ACCACGC 

XXX W W W 


CAAGAGCCTG 


CTAAGTGTCC 


GAATTGCTGT 


TTAAACTCTT 


TATTGGAAAT 


960 


AGTTAwTAAC 


GGTGCTAATT 


TGGGATAAAC 


TTCTTCAATC 


TTTGGTTCCA 


TCTCTTCATG 


1020 


AAAATGAAAA 


TPTTTTPPTT 

X *w X X X X Vh> V,* X X 


TATTATAAGG 


GCAAACCAGC 


TGACAAATGT 


CGCAACCATA 


1080 




PHP ATTTTTT 

WWW.TXX X X X X X 


TCCGATAGTC 

x uwvo x nv X *w 


TTCAGGCATC 


ATGCCTTTTG 


TTTGCGTTTG 


1140 


ATAAGACAAA 


CATTTTTGAG 


CATTCATTCG 


ACCGTCCCCT 


AACAAAGCCC 


CTGTAGGACA 


1200 


www x o a Ort i n 


PAAPGAGTAC 


AATCACCGCA 


ACCATTTGGA 


ACAGGTTCAT 


CTGGTTCAAA 


1260 


TTf; A AT ATTG 


GTGGTPAPPT 

W A Ow X ^_*£^ww X 


CACCTAAAtA 


TACAAAGGAA 


CCAAATTCTT 


CGGTAATCAG 


1320 


TAGAPPATTT 


TTACCAATGA 


ATCCTAAAPC 


CGCCCGCTGA 


GCAACAGCGG 


TGTCAACATA 


1380 


x i wo wwrtu i r\ 


TCTACTTGCG 

X W A fV\y X X WWW 


GCCGAAAGCG 


CCAGTGTvCT 


TTTTCTGCTT 


CAGTGGCAGC 


1440 


TTGPTPTTPA 

1 1 A w A 1 \zf» 


ATAAAAGCAA 


TTAATTTTTG 


CAACCGATCC 


TGCAAAATGT 


GATGATAATC 


1500 


AATTPCCCAA 
rtrt i i www^rxrv 


GAGGCTCGCG 

\Ja\\J \J \^ X wwww 


CAAATTGTCC 


ACGCTTTTCA 


TCTCTAGGCA 


TTTTTTCCTG 


1560 


AGCTTTGGTG 

riwu X X A ww x w 


GGATAAGCTA 


AAGCAATGGA 


GATAATACTT 


TTAGGATGCT 


CAAACGTTTT 


1620 


TTPTGGGTAA 

j. i w a wu w a nn 


ATTCGTTCTT 

X X * \J X X w X X 


CCACATTTTG 


ATGTTCAAAT 


CCCGAGTTAT 


ATCCTCGCTC 


1680 


TPGTTGTTPA 

A WW A i \J X A W/l 


TGCAGAGAAT 


CTTCTAATTC 


AATAAATGGT 


TCAGCATGAG 


TAAAGCCGAT 


1740 


TTTATCGATG 

A i X X W VJXV X W 


CCTAAACGTT 


GACTCTCTTG 


AATAATTTTT 


TCTTTCAATG 


TCGGCATTGC 


1800 


TPATGACTCC 

x w n x uriv x ww 


CTTCATTTTA 


TTATTTATCG 


TTTCGATGAT 


ACGTTTCATA 


GTCATGTGCA 


1860 


TAGAGATTTT 
i no^iwn x x x x 


TPTCGTCCAC 


GACGCCGGGA 


ACAGTGGCAA 


CTTTTTCCCA 


AACAGACCAG 


1920 


TPGATTTPTG 

X W ^Ji \ X X A W X w 


GAAAGAAGGT 


GTCGCCTTCA 


AACGTTTCAT 


GAATCATCGT 


CCGATATAAG 


1980 


APATPGPATG 
n^>n i- w w w^i x w 


CGGGAATCAA 


TTCTTTAAAA 


ACGACAGAAC 


CTCCACCAAT 


AACGGTCACG 


2040 


CCTTCATACT 


TGTCAGCATA 


CGCAAGAATT 


TCCTCTACGG 


AATGCATAAC 


CAAAACGCCT 


2100 


TCAGCTTGGT 


AATCCGGATT 


ACTGGTAAGA 


ACAATCGTTG 


TTCGATTTGG 


TAACGGACGA 


2160 


CATCCCATAC 


CTTCGAAAGT 


TGCACGTCCT 


AAAACCAATG 


TATTATGAAT 


TGTTTTTTCC 


2220 


TTGAAAAATT 


TCAAGTCGTT 


GGGTAAATGC 


CAAGGCAATT 


TGCCTTCTTT 


ACCAATCACT 


2280 


CCTTGTTCAT 


CTTGGGCCCA 


AATGGCTGCT 


AACATAGCTT 


CTCTCTCCTT 


TTTTATACGG 


234 0 


CAATCGGCGC 


TTTAATCGTT 


GGATGTGGGT 


CATAGCCTTC 


TACTTTAATA 


TCTTCCATAT 


2400 


CAAAATCAAA 


. AACAGAAGCC 


TTGTCTGGAT 


TCAAAACGAG 


CGTTGGGaAA 


GAACGAACTT 


2460 


CTCGTGATAA 


TTGGTTCTTG 


GCATTTGGTT 


CCACATGGAT 


TTTTGATATA 


AGTGGGGGCA 


2520 


TCTCCTAGCG 


TGnTGGTACA 


AACTCCCCCA 


CTnCTAAAnC 


TGTTTCATGC 


GGCAATCAGA 


2580 
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(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

GATGGTTTTT CGATCAAATT TCTTAnnCnC TGTAGCAGTA TAAATTAGTA ATAACTTACT 60 

CAGAGATTTT ATAGAGAAAG ACGTTGTTGT AGCTTCTCTT AATTGCTTTA ACTGCTTTTG 120 

CAATTTTAAC GGATTTTTAT CTTTTAATAA TTTAATTTTC GTTAAACGAA AAATGTGTTC 180 

CTGATGCTCA AATGAACGTT CAGCATCTGA ATGACTAAAA CGAAGATATT CTGAAAGCAC 240 

CTCTTGATTT ACAGCAACTT TTCCTTTTGG TAAAAACCAT TCTTTGCCAT TTTGATCTAT 300 

TAGATGCCGC TCACTTTGAT TTGAAATAAC AATTTCTTTA TCGTCTTCAA AAAGATACGA 360 

AAAAAGCATT GCTTTTTTTA TAATTATGTA CTCAGTTACT TCATATGAAG ACAATTTTCT 4 20 

TAATTCTTCT ATCAAACTAT TCCATTGATT ATTTTGAACA AATTTACTTA GTTTGTGAAT 480 

AGGTGGTACT CGTAAAATGA ACTTCTCATC TTTCTCTATT TTCACGATAT AAAAAGTTGT 540 
GTCTAAGTCA TTTTCCCAAA TAAACAAAAT AATCATTTAC TGCCCCCTAG ACTTGTTGTT , 600 

CCGATACTTA ATAGTCAAAT AAATATATTT AGGAATTTGC ATTAACCGAT CGTATCTCTT 660 

CATTGTCAAA CAACGATATA ACCATTCCAA ATTGTGATTT ATATAAAATT CTGGCGCTCT 720 

TTTTACTGAT CCAGCTAATA CGTCAAAAGT AmCGCcTACG TCTAAAAATA CAT TAG C TGT 780 

CAGGTGATCA ATTGTTTGTT CCAAAAAAAT CTCTTGTTtC GGTGAACCTA ATGCTACaAA 84 0 

GACAAACCTT GGkTCAGCTT TGkTAAwTTG JcTCCACAATT TCTTGCTGTT TTTTCTTTGT 900 

ATAGCCATGT AAAATACCGC TAATAACAAT CCCTGGATAT TGTCGTTGAA TATTTTTAGC 960 

TGCTG 965 
(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

CATnCCGGGG ATACAATTGG TGGATTGGCT TTAAACTATA TTCATTTTGC GACAAGTGAA 60 

AGTAAGTTAT • ACCGTGCGCT TTATTTAGAA GAATATGGTG GCGGCAAAAA AATGCCAAGA 120 

ATTCTCTTAT CACTATTTTA CTGAAGCGGT AAAAGCTGAT CCTGAGTATG CTAATTTAAA 180 

CGATGTTCAA ATTGACTCTT TACACATGGG AACATGGGTG GTCGCTACTG GTGTGGCTGC 24 0 
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AT T GAT G ACT TCTGGAATTA TCCATCCTTC AGAAGAACAA AT CAT T CAT T TAATGAAAGA 300 

TAGTATTGAA GCAATTCTTG AACGAGATGA ACCAATTGAT ATTGACCTAT AAAAATTAAA 360 

AAAAGTGTAA CCTCATGCTG ATAACTTCTG TGAGGTTACA GCAAAAGACT GTTTCTTAAA 4 20 

AGAACGGTCT TTTTTTGTTA AGTGTATTAA TGGTAGATTA TTCGCAACCA TGCTACAGTA 4 80 

AAGAAGAGAA ATAGAAaGGG GCAAAAAACA TGAACCGAGT GAAAGATAAG CGTTTATGGC 54 0 

TAAGTATTGT AGGAATTACC GGACTAGGGC TCTTGTCGGG CTTATTTTCA GTAAATGCGA 600 

GAGAATATTA TCAAGCGTTG AGATTACCAA GTTTTGCTCC GCCAGGCTGG TTATTTGGAC 660 

CAGTGTGGTT GGTGTTGTAT ATTTTTATGG GCATTACTTT TTATTTCATC TTAGTTCATC 720 

CAAATAAACA ACAGAAAAAA CGGATGATTA CGCTATTTAT TTTACAATTT ATTGCGAACT 780 

TTTTCTGGAC ATTTTTCTTT TTTTCTTTAC AAAATAATTT ATTATCTGTG ATTGATATCT 84 0 

CATTGTTATG GCTATTATyA ATAATTCAAC AGTGGGACTA CTTGCGTTAT AAATTAATAA 900 

CGCGTGGCTG GTTGATGATT CCTTATATTC TTTGGGTTAC TTTTGCGGCA GCATTAAATT 960 

ACAGTATTTT ATTTTtAAAT TAGGAATATC AAAACACCAG AGAAAAATGG GAACTAGCCA 1020 

TTTTTTTCTG GTGTTTTTTT TATTTATTTC AACGTAGTTA TTCTGCGGCA GTTGCTAATT 1080 

TTTTCGCAAA TTCTTCTAAA CGTTGGAT 1108 
(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 



TTAGTAATTT 


CCTGTACTTC 


TCTCCCCATA 


CAATTCATAA 


TATCCTCACA 


TTGTTTCGTT 


60 


GTACGTACAT 


CATTCCATAA 


AATAGCAGGA 


CGAATTATTT 


CATTATCCTT 


GTCTAATAAT 


120 


ACTAAACTAT 


GCATTTGTCC 


AGATATACTA 


ATCCCTTTCA 


ATTCTTTTTT 


CATTTCGGGA 


180 


AATTCACTCA 


ATAAATCATT 


AATTACATTT 


TCAATTGCTT 


TAATCCAATC 


GTGTGGATTT 


240 


TGTTCACTAT 


ATCCTGGTCT 


TGAAGAAAAC 


AATGGATAAT 


CTGATGAAGA 


CGTAGCAATG 


300 


ACATCCCCTA 


TTTCAGATAC 


CAATAATCCT 


TTTAATGAAC 


TTGTACCTAA 


ATCTATACCT 


360 


AAAACATAAC 


TCATAATTAT 


TAAATCCTTT 


CAAAAAAgCT 


GAAATACCTA 


CAAAAGATAT 


420 


TTCAGCTCAT 


TTTTCTTAAT 


ACmGaTaATC 


aTTTAAAATA 


TTTTTTACAT 


ATTCTATATG 


480 


ACTTGATTCT 


ATTATTGGTT 


CATCATTTTT 


TAGGGCATAA 


TTAGTTAAAC 


TTTCTAAATT 


540 


TTCTTTTCCA 


GAAATAATTT 


TTGCTCCAAT 


ACCTTTTTTA 


AACGATGAAT 


AACGCTGTTC 


600 


TTTTAGATCC 


TCAAAGAAAC 


GATCCTCTTT 


TAATTTCATC 


GCACTCTTTA 


ATCCTCTCGC 


660 


AAATGTATCC 


ATTCCAGCAA 


TGTGAGCAAG 


TAATAAATCT 


TCCATTTCAA 


AAGATGAaCG 


720 


TCTAACTTTA 


GAATCAAAGT 


TTATTCCCCC 


AGGTTCAATA 


CCCCCATTTT 


CTAGTATTTC 


780 
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ATACATAGCT 


AATGTAACAT 


CATAAACATT 


AGTAGGAAAT 


TCATCTGTAT 




O ^ U 


CAGAACATCA 


CCTTGATTAG 


CATCAATTGA 


ACCCAAGGCA 


CCATAGTTTC 


n n n r t a c n t t 


Qflfi 


AAGTTCATGT 


TCAAATGTAT 


GTCCAGCTAA 


AGTTGCATGG 


TTAGCCTCTA 


oAl 1 1 Abb I 1 


y ou 


AAAATCCTTT 


TCTAACCCGT 


ATTTTAAAAT 


AAATGCCATA 


GTAGTTGCAG 


bA 1 tonAAlt 


iu^u 


GTATTGATGT 


TTTGAAGGCT 


CTTTGGGTTT 


TGGTTCAATC 


AAAAATTGTG 


bb 1 I bl bl bb 


lUo U 


AATTTTTTCA 


CCGTAAAATA 


TAGCCATTTT 


GAATAGACGC 


GCAATATTAT 


tl 1 b AAA 


1 1 / A 
X 1 H U 


CTTCATATCA 


GTATTTAGTA 


ATGTTTCATA 


TCCTTCACGT 


CCACCCCAAA 


AAAbAlAAl i 


IzUU 


TTCTCCACCT 


AATTTTTTTG 


AAACaTCTAA 


ACCTTTTTTT 


ACCTGAGCAG 


b 1 bbbAI tAC 


1 O C A 


GTATACGTTk 


GCATTATTTG 


TAGATGCTGC 


GCCGTTGACA 


TAACGAGGAT 


1 ibAAAAba I 


1 -jon 
1 J^U 


ATTTGCAGTA 


TTCCACAATA 


GTTTAATACC 


AGTtTCaTCC- 


ATCTTTTCTT 


I lATTAAATb 


1 "3 O A 

1 joU 


TGTAATTTCA 


TCAATATTAG 


AAAAGAATTC 


TTTTAAAGAG 


TTACCTTCTG 


bAbCAATAl b 


1 A A A 
1 4 4 U 


AATGTCATGA 


AAACAAAAAT 


ATTCAACATC 


TAGCTTTTCT 


AAAATTTCAA 


AAAAAbC x A I 


1 C A A 


AACTCTTTTT 


TTTGCAGTCT 


CCATAGGAGA 


TTCTCCCAAC 


CaACTTCGCT 


rp 71 rp rp 71 Tiril,rr 

1 ATTAACKGC 


labU 


TTTTCCAAAA 


GGATCTGAAC 


CGTCTTGTGT 


CATAGTGTGC 


CAGTATGCTA 


bAbbAAATbb 


1 CO A 


TAAATGTTCT 


tTCATAGTCT 


TTCCCATAAT 


TATTTCTTCT 


GAGTTATAGT 


G JL CGAAACGb 


1 £ Q A 


GTACATATTT 


TTTGTGTTCG 


TTCCTTCATA 


TTTAATTTTT 


TCGATTTTTG 


bbAAb I AAAb 


1 / 4U 


CATGTTAAAA 


GTCTCCTTAT 


CATCTTTAAA 


ATTGTTATTT 


CAATTCGATT 


AATATTTTTA 


1 Q A A 


TAATTTATAT 


ACTATTGAAA 


ATTAATTAGT 


CAGAGGTAGT 


TTTAATTTTA 


bib I lbb 1 b 1 


lo bU 


TCTTCAGCAA 


TTTGTAACTT 


CGGTATTTTA 


ATACTTTCGG 


TACCTATTTT 


bAA 1 AAA 1 b J. 


iyzu 


TCTACAAGAT 


TTTCCTCTTT 


TATAGCAAAG 


AATGATTCTA 


AAACCATTCC 


b AAA I l AAb i 


1 y 0 u 


CCAGTTATTA 


CCTGAATTCT 


TGAGTCTTCA 


CTTGCTAGCA 


TCATGGCAGT 


7\ rp rp rp /- p rp p p t\ 

Al 1 IbblbbA 




GTCCCACCCA 


TCAGATCTGC 


TAAAATCACA 


ATTCTTTTTG 


CATCATTTAA 


AGTACCmATa 


2100 


ATTTTATCTA 


AT T TAT T AAA 


GGTCCCTTCA 


ATACCATCAT 


CAGGCGTCAT 


AGAAACaACA 


2160 


GGATAaTTTA 


TTTGATCACC 


TAwGATCATT 


tGAGCCGAAT 


TCAAAATTTC 


TTTCGCCATA 


2220 


AGGCCATGGC 


TCATTAAAAT 


TACTTCGTTG 


GTCATAATGA 


TCTCCTTTAC 


AATATATCTT 


2280 


TTAAAAAAGT 


CTTTTTATCA 


CCGGGGGCCA 


TAATnGTGGT 


AATTCATATC 


TCGACTTAAT 


2340 



AGTTCCnCA 234 9 

(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 
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TACGGCGTCG 


AAAGAAAAAT 


AAATAGACCG 


TACTAAGAAT 


AAAAAAGCGA 


CTGAACTTTT 


60 


CAAAAGATCA 


GTCGCTTTTT 


TGTGCTTATT 


TTTAATTTTA 


AGCCCGTTTA 


AGCCGTTTTA 


120 


AGTATTTTTC 


TAGTAAAAAA 


GTACTTAGAA 


ATTAAAAACG 


GCTCTTAAAA 


CGCAAATATG 


180 


AGPPAAATAA 


ATAAATTTTA 


ATTTTTATCA 


GTACGAGCGA 


TTTTTTACGT 


ACTGCCCATT 


240 


T Ei P A T n P A A A 


TTTAAAAATT 


AGCAAAAAAT 


CGTTCCGAAT 


TTGCATTTTA 


AATGGGTAAA 


300 


1 1 uLnu 1 i J. 


AATCGCATAG 


TTCTCGCAGA 


GGCTAAAAAA 


TAAAAAAGCA 


CTGAAATATT 


360 


l^AVj 1 V3*~ 111 


TTTTATTTTT 

i 1 Jl 1 X X X X A 


CTTCTTCAAT 


TTCGTTGATT 


GTCTGATGAT 


TAATTTTTCT 


420 


TTTTTPTTfT 


TCGGTTAATT 


CGTGAAGTTC 


ACTCGCCAGG 


TACTCTTTCT 


GATTCCAAAT 


480 


AT A AT A A ATT 


TGATACAAAT 


ATTCAGTTGG 


ATCAATTTCT 


TTGAGATAAT 


CCAATTCTCC 


540 


f^TTTTTT AGT 
O 1 1 1 1 1 l nu x 


TTCTTTCTCG 


CCTCTTTAAA 


AAGTCCTGAA 


TAGACTAAAA 


TCTGCTTACC 


600 


TTTT A A AH AT 


TTATAATACG 


CGTCAAATAT 


TTTTTGATTG 


ATAAGATAAT 


CGCTATCTTT 


660 


A PPPP A AT AP 


TTCGCCATTT 


CGTATAATTC 


TTTGTTGTTA 


TTTTGCTTAA 


TTTTTTGAAC 


720 




GTGATCTCCG 


GCATTCCTGT 


GACATCTCGC 


CACATTTCGA 


GCCATTCTTT 


780 


TTPTf A A AT A 


TAAAPPTTAG 


AATCTTTAAA 


ATAAGATTTA 


TTTACAGCAA 


TTAATACATG 


840 




TTAAAATCAT 


CTCGTTGcTT 


GTTGTATGTG 


aTTTCTAATT 


TTCTGACGTA 


900 


aPPTTTPATP 


ATPGCTTTAA 


CATTTTTTCT 


TTTCACTAAC 


TTTCCAAACG 


ATAAATTAtA 


960 


TTTTTTT ATT 
llllllimi 


TPaTCTTCTA 


ATTCTTCAGC 


TTTCACATTT 


GGTGTCGTTA 


AGGTTAAGAA 


' 1020 


AZiTAAAPTPT 


TTTTTCTCTT 

X X X X X V-» X X X 


CTTGCTTAAT 


GTACTGCATC 


ATCAACGATA 


GACCTAACGC 


1080 


ATPTTTTPtC 


GCTCGTCGCC 


ACGCACACAT 


CGGACAGAAA 


CGATTTTTAC 


ACATATTATT 


1140 


TTTATAPAGT 

x i X c\ x nv^nu x 


TTCTTTTTTG 


TCTGCTCTTT 


ATCCGTTACA 


AACGCTAAAA 


ATGTGTTACA 


1200 


GTPPTTAATC 


AGATCCATCT 


GTTTTTCTCC 


CACAAATTTA 


TTAATAAAGT 


TCTGAAACAC 


1260 


TTGATTTPTC 


TTCTTTTTCT 


CAGTATACTT 


TTCCATATCA 


GATATATAAA 


ACAACTGGTT 


1320 


PTGTTPPAAA 


GTTTTAAATA 

w x x x x ruui x 4» 


AAGAATAAAA 


TCCCTTACGG 


TATCTATGAT 


TTAAGCTGGG 


1380 


ATTCCCAATA 


ATACCTTGAT 


TTCAGTACAG 


ACCGAAAACC 


CGAAGAGAGT 


GCCTTCTTTT 


1440 


CGGGTTyTCT 


TATATAATCC 


TCGAATGGCT 


TCCATGCCTT 


TAATCGTGGT 




i son 


CGTAAACTTC 


GATAGAATTT 


ATTGCGTCTC 


TTTACTGGAC 


GATGGTCTTG 


TTCAATCAAA 


1560 


TTATTCAGGT 


ATTTAATGGT 


ACGATGTTCT 


GTCCCTTGAT 


AAAAGCCGTA 


TTCTTTTAGT 


1620 


TTCTTAAAGG 


CACTTGTAAT 


AGAGGGGGCT 


TTATCTGTGA 


CTACAACCTT 


CGGTTCATCA 


1680 


AACTGCTTCA 


CTAACCGCTT 


AAGAAAAGCA 


TAGGCTGCTT 


GTGTGnCCGT 


T T T T T 


1735 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

AAGATTTAAA AGAGTCTGTC AACAGTTTAG AAGATAAGTT TTTAACAGTG GATGAATTAC . 60 

GAGCTTTTTT AAACTATGGG ATAGTTTATG AAGAGTTACC CATGTCTGTT CTTTTTCATG 120 

TTTTGTTTTA TACTGGGTGT CGTGTGAGTG AAGCGCTTGC TTTACAGCCT CAAGaTATAG 180 

ATTTTGAACG AAATGAAATT TTGTTTTATA AACAGACAGC AGTAAAAGGA AAAAGTAAAG 24 0 

ATTTTCGAAT AGAAACTACA AAAACTGTTA GTTCTGCAAG AAGAGTACCC GTCACACCAT 300 

TAGTAATGGA GAAATTACAA GAATTAATTG ATGTATTAAA TAAAACAAAA AGAAATAGTA 360 

ATTTTGTTGT AGATGAAACT TATTTATTTG TTTATCTCGA TCCTGGAAAg CGTGGAGTTC 420 

GTTATCGTCG TGAATATGTA AACGATCATG TTAAACGTTG TGTAGAGCGA TGTGGAATTA 4 80 

ACAAAGATTT TCACACCCAT TTGGCTCGTC ATACTATGGC TAGTTTAGTG GCAGAgtAtT 540 

GCmGTJcGGGA TGkTTwAAAG GATCGtTTAG GTCATACnGA CAGTACAACC TCAAAAATAT 600 

ATCGCCATTT AACATCAACT GAAAAGTTAA AGCCTCTGAC TGCTTTTAAT nCTTTAGAAG 660 

GTTAAAAAGA TGGTGAAAGT ACTGGTATTA CAGGATTCTT GAAAAACATT TTTTTAATAG 720 

GTTAGTCAAA GTGTAGTCAG AGGC 7 4 4 
(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 



TGAAGAAGTT 


ATATTGGTAG 


TTTCnCTCTC 


nGAAAaAAAG 


TTArGCAAAa 


CAAtGGATaA 


60 


aGAGTATCmA 


GaATGCTAAT 


CCtTATTAGT 


TAAAGTTCTA 


TCTGGAAAAm 


GCaAAAGTAT 


120 


TTGAGACACA 


ACACGCAGCC 


TATAGAGATT 


CAAGTGATCT 


GTGTAAAGAA 


TTAAAATAAA 


180 


AAACGAACGT 


TCATGTTGAA 


ATTAGAAATT 


AAATGTTTTA 


TCATACTAAT 


AGTATAAATT 


240 


ATCTGTAATG 


GATAATTATT 


TTTTATAGAT 


TGAATAAAAT 


GTTCGTTCAT 


TTTATTGTTT 


300 


TGTTACTTAA 


ATAGTTTTGG 


AATTTTAAGG 


AGGAGAAAAA 


ATGTTTTTTA 


CTGGTAAAGA 


360 


AAGGAATTTG 


AAATTTGCTA 


TTAAGAAGAA 


ACACGGTGGC 


GGTGGTGCAG 


CATCAGTAAT 


420 


TATCGGATGT 


ATTATTTTCG 


GAAGTATTGC 


GTTTGGTGGG 


ACAACATTAn 


CTTACGCTGA 


480 


TGAAGTGCAT 


AATAGTATAA 


ATCAGGATAT 


ACAAGATTCT 


GGTAGTACAA 


TTATTGGAGA 


540 


AAATGATTCT 


TCTACCAAAT 


CAGCTGAGTA 


TAAAATGATT 


CATGAAATTG 


ATGGAACTAA 


600 


AATTAGTAAC 


■ GGTGAAAaTA 


GTnAaGAAaC 


AaCTACAAGT 


TCAGGAACTA 


TACTGGCTGA 


660 


AGAAGCAATA 


GAAAGTTCAA 


ATCAAAAAAA 


TTCAAAGACA 


AGTGAAGTCG 


AACAGGATCT 


720 


TCATAAAGAT 


GTATCAGGAT 


CTGAATCAGT 


AAAACAAGTA 


GAAACTTCTG 


ATTCTATAAA 


780 
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AAAATCTGAA 


GAATCAGCTG 


TTAAAACATI 


AAAICTGGAT 


GA1 I CACAAG 


A P TV Tl m 71 /~»m TV TV 

AG AAT AC T AA 


O A A 
OH U 


TTCAATAACT 


ACCAAGGCAG 


AAAATGATGC 


GCTATCTACA 


GTTAAI GATG.. 


7\TVT\TV7V/^mTv mm 

AAAAAGTATT 


you 


AAATGAAAGT 


r~\ t\ rn n /-»rn j\ rrt rn tv. 

GATAGTATTA 


TP7V 7\ ATP7V AT 

TCAAA1CAAI 


1 CC I iCbbAA 


AC Ab Ab AA 1 b 


rp /"•/-> tv rp tv Tifp7v 7\ 

TCGATAATAA 


y bu 


TAGTAAAAGT 


GATAATAGGG 


7\/^rpmTV^rriTv/ - ~>T\ 

ACT TAG TAG A 


AAACTCAG i G 


G AAAC I AC i A 


TV Z^ T\ 7\ TV m TV 71 rp 

AGGAAATAAT 


1 A 1 A 


GAAAGAGGCT 


GATAAAACTG 


AAAT AGC TAA 


T\ 7\ TV TV rp /■"" 7\ z"' 

AAGGAATGAC 


AATATCTCTG 


rp tv m m /*-" tv rv»Tv/«o 

TATTGAATGC 


1080 


TGGAACTGAA 


TCATTTAATG 


ATAGTGATAA 


TV 7\ n"» Tv 7\ Ti Tt TV 

AATAAGAAGA 


TCTAAAAGAG 


7v m tv tv rp tv nm/™» 

AT AAT G AATC 


1140 


TCCTGTTCCA 


AGTTTGAGTA 


TTCAAATGCC 


AGTCACAAAA 


GAGATTTACG 


AAAATGATAG 


1200 


TACTATTAAA 


GGAAAGTCTG 


AACCAGGAGT 


AACGATAAAA 


ATTTATAAAA 


ATGGTATTGA 


1260 


ACTGGGGTCT 


ATTAAAGCAG 


ACGATGATGG 


AAATTTTGAA 


TATCCTCTAG 


AATCACAAGC 


1320 


TATCAAAAGT 


GATAACTACT 


CTTTCGTTGC 


GGAAAAAGAT 


GGAGCGAAAA 


G TAT AG AG AG 


1380 


CTCTGTAAAT 


GTAGTAGAAA 


ATAAAGAAAT 


AGTAGGTAAT 


CAACGAGATG 


CGTATACTCC 


1440 


ATATGGGTTT 


GGTATATATA 


ATCGAGGTTT 


AGTTACTACT 


GATGATGGAA 


AGAAACATTA 


1500 


TACAGTTGAT 


ATGGAGGGGA 


ATTTrGCATC 


TGAAGCGTAC 


TTTGATCTTG 


CAGGCGGAAC 


1560 


AATGTACTAT 


TCAATTGATG 


AAAAATTTGC 


ACCATATGTT 


GAAAAAATAG 


TTCTAGATGG 


1620 


TAAAACAATT 


ATTTCTAAAA 


ATCCGTATGA 


ATTGCCCAAT 


AAAACCAATA 


TTTGGTCTTC 


1680 


TGGTATTCTT 


AG T AC AG AT A 


AAAGAGGAGG 


ATTGGTTCGT 


GCTGCTTTAG 


TGGGTCGTAC 


1740 


TAACGGAACA 


ATAGATATTT 


ATTTTAAAGA 


TGATACTCCT 


CAAGAAGTTC 


TCAATAGTGA 


i a a a 
1800 


AATACCTTTC 


CAAGTTTGGG 


CTAGATTTAA 


AGAGAAAAAA 


GAGGGAGTAA 


TA TV TV TA #71 TA JTI fTt 

AAAAT G AT C T 


lo bU 


AATTTCTGAT 


TTTGATCTAA 


ATATTTTACT 


TCAAAACAAT 


AAATTAATAT 


TAAGTAAAGA 


i ft o a 


TACCTTTTTC 


AAAGATGAAG 


ATATTGTCAA 


TAATAATGGT 


CCTAAAATTA 


GTTCTAATTT 


T ft O A 

iybu 


TAACTATACC 


AATAATACTA 


TTGATATTAA 


TTATGGAGTA 


AATACTGGTA 


ATTATGGCTT 


O A /I A 


AACGATAGGG 


AAACGTAGTC 


CTTATAACTT 


GCATATTAAT 


ACAGGGAAAG 


AATTAAGTGA 


2100 


TTTAGTTGAT 


ACGATAAAGA 


TTGACAATAA 


AAAT TAT GAG 


TTCGAAAAAC 


TTCCTGATGG 


O 1 C A 

<£ 1 bU 


TTCGTTAGTG 


ATTAAAGATA 


TATACAAAAA 


GGGAATTTTA 


GGTGGAGCAT 


rp tv /■"• rp rp tv tv 

TACTTAACCG 




CCAAGAGATT 


ACAATCAGTT 


TAGGATTAAT 


AGCGAAAAAA 


AATCTGGGCG 


tv m r** rn tv miTir'Ti 

ATCTCATTGA 


" O O Q A 


TTTCAAACAA 


GAATTATTAA 


AACTTGATGT 


tv morrt tv m rn tv ta 

ATCTATTAGA 


^ tv m tv ta rn tv tv tv 

GATAATACAA 


CCTTTAAGCT 




AGTAAAAGGA 


GAAGCTACAG 


AAGCTATCCG 


mft t. t\ tv rp t\ /■*• T\ rp 

TAAAATAGAT 


TCCTCATTCG 


A 1 AC AT. T. I AA 


9 ^ on 
Z h uu 


ACAGGAATTA 


ACGAAATGGA 


TTGAAAAAAG 


AGATAGTAAT 


CCACAGGAAC 


OrpTv rn rv Tv m Tv T 

CTATAACTAT 




TAATTCTGGA 


GACAATACTT 


CTATAGTGGA 


TCCTAAAGAG 


ATGGTTGAAT 


TTGCAGTAAA 


2520 


AAATCCAACC 


ACAGAAAATA 


TTAATAAAGC 


ACGTAGTTTG 


GTTTCTAAAA 


TGTGGCTTTG 


2580 


GGACAAAAAC 


* AAATAT AATC 


CAGTTCTTAA 


TGCTGCTGAA 


GAAGTAAACA 


GATATCTTTC 


2640 


AACTACGTTA 


AATGGTTCAT 


CAAATGATGA 


TGGCTTGATA 


TTAAATAATA 


CTCATGCGGC 


2700 


AACTTTCGTT 


AAATTAATTG 


ATACGGATAA 


AGATGGGATA 


CTTGATAGAT 


ACGAAGCAAA 


2760 
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CATGGGAATT GGAACCAACT ACACAAATGC TGATTCTGAT GGTGATGGAA AAAGTGACGG 2820 

ATTTGAAATT TTTAATGATA CTGATCCTTT GGTGAGTCCG TACGATTGGT TTGATAAAAA 2880 

TGGAGAAAAA ATAAAAATAG TAACAACCGA TACTGATACG ATAAGTGGAA GAATAGGAAA 294 0 

TAACAACTAT GATACTGAAA ATGTCTACCC TAGAACTGTT CAATTAATAA AGGTTACTGA 3000 

TAACGGAGAA AATCTGATTT CTGAAATTAG TTCTTCTGAA GATAGAAAAG GGACTTTTGA 3060 
ATTTACTGGA TTGAGTGGTA AATTAAGCAA GGGAGATAGA CTCGTTGTTA AAATTATTAC ' 3120 

TAATGAAGTA CAAAGAGAAA TTGATCGACA AAGAAATGTT GTTCAAGTTG GTTATGATAA 3180 

TCCAGAGGTA TCTGAAGAAG TAATTGTTCA GGGCGCACAA GTAACAGTCT CTTTTGACTT 324 0 

TAATTATAAT CAAACTGATC CTGACTTTGA CCGTCAACCA GTTACTCAAG TAGTAGAATT 3300 

AGAAAAAGGT TCAAGGCTTT TAAATAGTTA TTTTACTCCA GAACGAAAAG GCTATAAATT 3360 

TATTGGATGG AATACTGATA AACTTGGAGT AGGACAAACT GTCAATGATG GTTCTAGTTT 34 20 

TAATGAAGAT ATAACGGTAT TTGCTCAATG GGAAAAAGAA CCTAACCTAG CTGGAGAAGT 34 80 

ACATGCTCCA CAAGGTCCTA TTGAAGAAGG TAAAAGTATT GTTCTAAATA ATGTAGTACA 3540 

GTCCAATAAA TCAGGGTCTG TAATTGTAAC TGGAGAACTT CCAAAAGGGC TCTTTGGGTT 3600 

ATCTATAAAT GAAACAGGTA ATCTAATTGG TACTCCACTT ATCAATGACT GGATTGATGG 3660 

AGAAAATAGT CGGGAAGTAA AGATTCCAGT TACAATTAGT AATGGTGACG AAAAGGTTAT 3720 

GGTGGAAGTT CCATTAACGA TCCTGCATGA TACTCGAGCG GCAGACGCAG TAAAAGCAGC 3780 

AGAAGACGCA GGCAAAGCTG GAGCAGATAA GAAAGCCGAA GTGGAAACCG ACGGTTTAGT 3840 

GACTCCAGAG GAAAAAGCGG CAGTGGATGG CTTGAATGAC ACGACTACCG CGAAGAAAGA 3900 

AGACGCAAGC AAGTTAGTGG ATGCCTTGCC AGAAGGACCA GTAAAAGATA GCTTGAAAGA 3960 

TCGCTTAGAT AAAGTGACAA CCTCTGAAGT AACAGTGAAT GATGCAGATA GCAATGGCAA 4 020 

AGCGGACGAT GTAGATTTAG CTGAAAAAGC AGCGGCAGAC GCAGTAAAAG CAGCAGAAGA 4 080 

CGCAGGyAAA GCTGGAGCAG ATAAGAAAGC CGAAGTGGAA ACCGACGGTT TAGTGACTCC 4140 

AGAGGAAAAA GCGGCAGTGG ATGGCTTGAA TGACACGACT ACCGCGAAGA AAGAAGACGC 4 200 

AAGCAAGTTA GTGGATGCCT TGCCAGAAGG ACCAGTAAAA GATAGCTTGA AAGATCGCTT 4 260 

AGATAAAGTG ACAACCTCTG AAGTAACAGT GAATGATGCA GATAGCAATG GCAAAGCGGA 4 320 

CGATGTAGAT TTAGCTGAAA AAGCAGCGGC AGACGCAGTA AAAGCAGCAG AAGACGCAGG 4 380 

CAAAGCTGGA GCAGATAAGA AAGCCGAAGT GGAAACCGAC GGTTTAGTGA CTCCAGAGGA 4 4 40 

AAAAGCGGCA GTGGATGGCT TGAATGACAC GACTACCGCG AAGAAAGAAG ACGCAAGCAA 4 500 

GTTAGTGGAT GCCTTGCCAG AAGGACCAGT AAAAGATAGC TTGAAAGATC GCTTAGATAA 4 560 

AGTGACAACC • TCTGAAGTAA CAGTGAATGA TGCAGATAGC AATGGCAAAG CGGACGATGT 4 620 

AGATTTAGCT GAAAAAGCAG CGGCAGACGC AGTAAAAGCA GCAGAAGACG CAGGTAAAGC 4 680 

TGGAGCAGAT AAGAAAGCCG AAGTGGAAAC CGACGGTTTA GTGACTCCAG AGGAAAAAGC 4 74 0 
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bbLAb 1 bbA 1 


bbL 1 1 b 1 1 Ho 


AAATAAAACA 


fiTCTTPATTT 


ATGCCGTTTG 


A A A ATTTATT 
xxx n x x 


4800 


I I LbAL 1 ALA 


HA I bH 1 1 nv A 


par APTTTPP 
tn^no ill 


TAAAAPTGGT 


GAAAAATPTP, 


ATTPTATTTT 

n ll^lnilll 


4860 


AAlCAI 1 1 Al 


P^APPTTTTAT 

bbHbb 1 1 1 HI 


T ATTPTTA AP 
lnl 1 \* 1 1 nno 


TA^TATAf^nA 


TTATTAC;C1AA 


TAAA A AAAAP 


4 920 


* m * ta tv tv rn 7\ t\ rp 

aaaaaataai 


ALbAAl lAAb 


1 1 lol 1 luln 


TTPTTPTTTB 
1 lv^l l^i l in 


AP.AAAP1P1ATA 


PPTPT ATA A A 
\jo ± \j 1 n 1 nnn 


4 980 


TTTTATCAAT 


AAAAAbL I bA 


PT ATTTPTP A 
L1A1 1 1 o 1 LH 


A ATA^AP,TTf^ 


ATAPA AT^AT " 


AATAAAAPAT 
HH 1 nnrinun 1 


sod n 


CATCTAAGGT 


Tv /— » rv rn rp rp 

AGGATT" L1L 


i 1 1 1 L 1 AL 1 1 


1 rtud 1 vjrt^* 1 1 


TTTPTTTAAA 
1 1 lul 1 1 nMH 


TATPZiPATAA 
1 H 1 LHVjH 1 HH 


si no 


ATTTTGATTG 


GACCTGTAGT 


rn/-» A A T A T A P A 

I b AA i A I AbA 


1 L 1 b 1 Lb 1 nu 


TTAPAPAAPP 
1 1HLHLHHLL 


T A a a r rr*rT' a t 
1HHH1 LbLH 1 


JiOU 


ACTTTTCAAT 


TCTAGAGGGG 


AAGTC1 lb 1 I 


bAb i A 1 1 Abb 


*T ATT A nff AT 
lnl IHHoLHl 


PBTAPPPP AP 
LH 1 HbLLLHb 


coon 
ozzu 


TTTTGAATTG 


CCTAACCACT 


TLbLTTbl AA 


rp rp pmrp rp T\ *Ti/^ 

1 1 L 1 1 1 LA 1 b 


PPT A A B.P A A A 

Lb 1 AAHLHHH 


PTD PTTTTAP 
o 1 nu 1111 r\\3 




GAGTACATTC 


TCATTAGTTA 


Ab I 1 bAbl Ab 


LAI 1 Al L 1 AA 


TPPAAAAAPP 

1 bLAAAHHLL 


TTTPPPTTTT 
111 LLL 1111 


J J 1 u 


AGCTCATAAT 


•n 7\ rn nmm rn T\ /TP 

AATCTTTAL I 


AAbTL 1 LL 1 L 


L 1 1 1 L 1 1 LbA 


PTATT ATTTP 
b 1 Ai lnl 1 1V^ 


ATTTPTPTAT 
HI 1 ILiVjlHl 


5400 


AAATTGCATC 


TTTTTCTTCT 


rp fTi tv rp rp rp/*» tv 

1 L I Ab 111 LA 


TPATTAAPAA 

1 bA 1 1 AAL 


PTAAAAATTT 


TTTT APA APP 
1111 nunnu o 


54 60 


tv ^ t\ TV TV /" , t\ rp TV 

ACACAAb A 1 A 


Al ALA1 i 1 Ai 


PPTPTAATAP 
bL 1 b 1 AH 1 Ho 


AT A A AP APHA 


T A ATT ATP.TT 
1 nn 1 Inlol i 


TAATPAAAAP 

1 nn X v^nnnno 


5520 


GAAGAGAAGC 


ti /"* m rn c** tv t\ tv rp tv 

AGTTCAAATA 


rbLAAl LAAA 


a a/""a a aa ?\rr 
AAbAAAAAbb 


bbbbbb 1 nun 


PP RTTTTTTT 
toL-nl 111111 


5580 


GTATTATCGG 


AG TAG T ATT G 


T AAG AAAAL T 


AAAf*A r P r PA A A 

AAAb A 1 1AAA 


AATAPPAAAP 

AA 1 Ab L AAAL 


pp A r-'ATTATT 
bbHiHl lnl 1 




TCCAkTTtCT 


TATTTTTGCA 


7\ AAA A T 1 Z*"' A A T» 

AAAAATGAAl 


b 1 1 LAI btb 1 


ATPA A APTA A 

A I b AAAb I HH 


PP A A T A A ATT 
LLHH1HHH1 1 




TATCTTTTCy 


CgAAGGAAGT 


TTATTTTTAA 


TAATTTbAbA 


bbb IbAl 1 Al 


ATTPTPTTTT 
Hliblbllll 


D f OU 


GAAAATAAAG 


GGTATTTTTT 


m /"■» TV TV TV rp /*i rp r*> TV 

TGAAATCTCA 


G3LA1 1 bL lb 


AATTTPTTAA 

AAI 1 1 b 1 1HH 


APTAPAAAP.A 
HL 1 Honnnon 


JOtU 


AGGAAGCAAC 


GCGGCTCAbL 


TAf^ATTAAAT 

1ALAI 1AAA1 


AL 1 1 Al bAL 1 


ATATTTTPZiT 
HI HI 1 1 1 oH 1 


TA AT ATTPTA 
inn lnl loin 


5880 


AAACAATTGT 


y^i rr* TV TV /"* TV /"* TV 

GTAAGCAGCA 


A A A A A A A TTP 

AAAAAAAl 1 b 


A 1 1 1 AL 1 1 AH 


TAAAPTPRRA 
1 HHHL 1 LHHH 


P A ATTTPA AP 
oHn ill ^nnu 


5940 


ACTATATTAA 


/*» otv » mm m T\ 7\ rp 

GCAATTTAA1 


rp/— tv A T P A rn 

1 bAAb 1 bbAI 


bLLAHHHl 1 1 


PTPATTATPT 
L 1 LH 1 l/ilLl 


TTTTTTPTTP 
1 1 1 1 1 It 1 


6000 


ATTAATTGTA 


GATT rbAAAL 


rp/-' TV HP A T» A A A 

1 b A 1 AAAb AA 


b 1 HH J HbHL 1 


PTPTAPAAAP. 


TPT AP A A A A A 
lol M^nnnnn 


6060 


GATGCTCTTA 


j^-i T\ TV /""""p 7\ /TP TV, 

GAGACTAC T A 


1 1 b 1 bbb 1 1 A 


AAi 1 1 LHbbb 


ATAAPATAAA 
H 1 HHLH 1 HHH 


TAAPTATAAA 
1 nrtt 1 H 1 HHH 


■ 6120 


TTAATAGTTC 


TV m TV TV TV /TP TV TV 

AT AC AACT AA 


ATATTAATTT 


1111 1 AbAAL 


P7ATPTTAAAP 
LAlbl 1HHHL 


P APT AT PTP A 
unO lnlul on 


6180 


TATTTACAAT 


TCCAILI Lbl 


A 1 b 1 bL i AAL 


L 1 1 1 1 H 1 LL 1 


TAAAPPTATT 
i HHHo Unl I 


TPPTTT A TPT 
ILui l lnlol 


6240 


/TTt TA TV 7\ A /"*/Tp 

GTAGAAALbl 


bAb I I 1 1 I b 1 


ttppta.pptt 
1 1 Lb 1 ALL 1 1 


HLHLLHHLuH 


A A ATP ATT A A 
nnnlUnl inn 


APTPAATAAT 
no l v^nn l nn i 


6300 


GAAAATGGGA 


LbAbAAl bbb 


7A P A A T APTP A 

AL AA 1 AL 1 LA 


PA A A A A A ATP 


TPATTATTPP 
lunl lnl 1 U^. 


PPPATTTTTT 

V_, Vwv^/i 1 11111 


6360 


/■* tv rn rp rn /"^ T\ TV 7\ 

TGATTTCAAA 


b 1 AL AL AAAA 


a. pp a ptp a tt 
AbLALlLAl 1 


PPTPTAAP.AT 
LL lol HHVjH 1 


TA A AP.TP.APP, 
1 HHnO 1 Vjn^Vj 


AAAPPPAATP 


6420 


AGAAAGGAAG 


CGAGTGCTTT 


atttttaaaa 


TTATACCATC 


AATCAAACAA 


CTTTACCATT 


6480 


AGATATGGAA 


CAATATATTC 


CTGAAACAGA 


TGTCGCTTTT 


GCGGTCAATG 


ATTTAGTCGA 


6540 


AGCCATGCCA 


• CAAACACTTT 


ttcttaaatt 


AGAAGAACAA 


CTAGGATGAC 


TGGCCTACCA 


6600 


CCCTAAAATA 


ATGTTAAAAA 


TTATTCTATG 


TGCTTACACG 


CAGCGTGTTT 


TTTCAGATCG 


6660 


TAAAATTGGA 


TTTCTATTAG 


ATGACAGCTA 


TCGTATGCGT 


TGGCTAGCAA 


ACCACGAACA 


6720 
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AGTCAGTTAC CGGACCATCA ATCGCTTCCA AAGTCAGGAA ACAACTGCTC ACCTTTTAGC 6780 

AGAAGCCTTT GTCTTATTCC GCTGTCAATT AATCACCAAT CAGGTAATTG ACAATGAAGC ...6840 

GCTATATATT GATGGCACAA AAATCGAAGC GG AT GCCAAT AAATTCAGTT 7TGTCTGGCG 6900 

CAAAGCAACA AATCAGTATG AAGCTTCTTT GGATAAACAA TCAAACGAAT 7C7ATCAAAC 6960 

CCTTTATAAA GAAGAAATCT TGCCTTCTTT AAAAGAAGAA AACCAAAGTG AT AG AT TG AC 7020 

AAGCAACTAA TTGGAAGAAA TCGTGCATCA CTTGGAGACA GAATTAGTGG CGACAGAAGA 7080 

ACAACTACAA AATGAGAAAA GGATAGA 7107 
(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

TGAATGGATT TGGCAATATG CTAAAAAAAA TAAGAATAAG TTACTGCTAG CTGTTATTTT 60 

TGTCGTTATT AACGCTATTT TAATTGTTAT TATTCCATTG TTATCTGGCA GAATTGTAGA 120 

CCAAGTAATC AATCAGGGGA AAACTGAAGA GCTTGTCCCA ATTCTTGGGA TTATGATTGG 180 

AGTGACCGTT ATTCGGACAG TTGTTCGTTA TAGTTATCAA GTCTTGTTTG AATCGGTTGG 240 

ACAATCTGCG TTATTTGACT TAAGAAAAGA TATGTATGTC AAACTTCAAG AATTAGATTT 300 

TGATTTCTTT AATCATACAC GTGTTGGCGA CATTATGGCG CC-AATGACAG GGGATACAGA 360 

CGCTATCCGC CATTTTGTwT CTTGGGTCAC TTATAACCTC TTAGAATGTA TTTnATGGkT 420 

CATCGCAGCC GTGATTGTGA TGGGGACGAT CAATTGGCAA TTGATGCTAG CATtAATTTT 4 80 

AATTACACCT TTGATTTTTC TGTTAACACA AAAAATGTCG AAGGAATCGC ATCCGGTATT 54 0 

TTTTGAGATC CGGGAAAGTT TTTCTCGGCT AAATTCGATG GTCGAAGAAA ATATTGGTGG 600 

AAATCGGGTG GTTAAAGCTT TTTCTCAAGA ACCTTATGAA ATTGAAAAGT TTAACAAGCA 660 

CAATGAAGAC TATAAAAAAT GGAATATGGC TTCTGCAGAT GTCTCAAAAA AA7ATTTACC 720 

GTGGTTAGAC GGTTTTGCAG GTAGTTTAAA TGTTATTGCC TTAGTTCTTG GTGGCCTCTT 780 

TGTAATTCAA GGTCGGATGA CAATTGGTGA TTTAGTGGCC TTTAACGGTT TTCTCTGGAT 840 

GTTGAACATG CCGATGCGAA TGAGTGGCTG GTTGATCAAT GATGTGCAAC GC7TTACGGC 900 

ATCGTCCATT AAAATTCGTC AGTTATTAGC AACGAAACCG AAGATTCCCA TTGCTCGGGA 960 

AAAAGAAGCA GAACCAATCC AAGGCTATGT GACGTTTGAT CACGTGTCCT TTCATTTTTC 1020 

AGATGATCCA GAAACCCCCA TTTTATCGGA TGTTTCATTT TCAGCAAAAC CTGGTGAAAC 1080 

AATTGGTATT TTAGGGGAAA CTGGTTCAGG CAAAACCACT TTAGTTAACT TAATCGCTCG 1140 

GTTCTATGAT CCAACAAAAG GCACTGTTTA TATTGATGGA AAAGATGCTA AAAGCTATTC 1200 

TGTTCGTAAA TTGCGAGAAA ATATCTCCAT GGTTATGCAA GATGTCTTTC 7T7TTTCCAA 1260 
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TACAATAGAA GATAACATTG CTTTTGGTAA CACATTGGCT AACTTGGAAC AAGTGCAAAT 1320 

GATGGCTGAA ATTGCTGATG ' CtCATAGCTT "f ATTTCCAGA ATGCCCGAAG GTTATGCAAC 1380 

AATTGTTGGT GAACGCGGCG TTGGCCTATC TGGTGGCCAA AAACAGCGGA TTTCACTGGC 14 40 

CCGTGCCTTA ACCAAAGATC CAGCTATTTT AATTTTAGAT GACACAACTT CCGCTGTTGA 1500 

TATGGAAACA GAATCAAAAA TTCAAAAAGA ACT 1533 
(2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

TTCTAGGCGG CGTACCCTTT TCCGGCTAAA AACGGTCCGG AAACTCTTGG AAACCCCTTG 60 

GTTTTAACTT GGGTCGGAAA TACTTGGGTC nTAGTTTCCT CCAAACCGTC AATCCCCTGG 120 

TTCAAAATTG GATTTAAATA ACACATCCAT ACGGAGTTTT TTCTTCTTTT TGCCCAATAA 180 

GGAATGAACT CTTTTTGGgA ACGCAACGAC CTTCGTTAAG GTTTCTTCTA ACAAGCCGAA 240 

ATCTTGTGCA GCGCGTGCTT TTAGCCAATC TGTATGTGCT TGGTTTAATA CTTTAACATA 300 
TTCTTGCATA CGTTCAGCTG GAACATTTTT ATTCAAAGCA TATTCTTCTT TTACTTTTTC ' 360 

AAAAACCAGT TTCCCTAATT CAGATAATTC TTCAGGGCGT GTCTCAAAAT AGGCCAATGC 4 20 

TTCTTGAATA ACTGGGCCAA TACTTCGTTC AAAATATAAT CCGGTTAAAT ACCCTTCTAC 480 

TTCTCCTCGA AAAGGACTAC TTTTTTCTGG CATCCCAGTT GATGAATCCC AGTCTAATAA 54 0 

TGTCAAGGCA TTTTTTAGTA ATTGAATTTC TTTCACTTCT TGTAAAAATA CAGCTTCCTT 600 

CATTTTTTTA CACCAGCTTT CTTAGTCTTC CCGTGGTTTT CTTGGTGGAC GTGTTCCCCA 660 

GTATTGGAAT AAATCTGTTC GTAAGGCTCC GTTGTAAAGC TTcCGTTTTT TAGTAGCTTk 720 

GGCACCATAA TATTCTTCAA AAGCAAGATC ACTCGTCAGT ATATACTTGc TCCAAGTAGT 780 

TAACGGACGG AAAACATGGn CCATTCTTTA TACAACGGGG AACACTTCCT CTTCTCCTAA 84 0 

ACGCTCACCA TAAGCGGGTA GCTACCATAC ACCATATCCT CTCGGTGGAA AACTT 895 
(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 
TATGAATTTT GnATAAAAAT GGTGCCATGT ACAAGAATGC TTGGACAAAC AGCGATAAAT 60 
ATTCATATTA TAGTACATCA GATGGCTCAT TAGCGGTTGG ACATCAACAA ATTGGTGAAA 120 
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ATAAATATAT TTTCGATAAA AATGGTGCAC TATATAAAAA TATGTGGGTA GATTTAGACG 180 

GAAATCGTTA C TAG AG T AAA GAAGATGGTG CTCTCGCTAT TGGAGAACAA AT CAT T GAT G 24 0 

GAATCACTTA CATGTTTGAT GAAAGTGGGA AACTAGTTAA TAATTAATTT TTGAGAATTA 300 

GAAAATTTAT TTTAGAATAG AGATGTAAAT CTGTTGCAAA GTTTAAGAAG ATATTCGGAT 360 

TGAGATAGAT AACACAGCrA TrAAAACATA CCAAGATTaT GAATTAATCT CGGTATGTTT 420 

TTATTTTTTA AAACCAAAAC AAACGTTTTT GTGTGGTATA AACCATATCA ATACTTCGTT 480 

TGCCTAAAAT CTCGCCAtCT GCGTGAATGG TTTGTGGAAT CGTCGAAACA AT 532 



(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 



AGTATCCAGT 


TTAATGCTTT 


GTCAATCGAA 


ATACAAGAAA 


GTGGnACAGC 


CGTTTGACGA 


60 


ATGTCCCACT 


TTTCTGTTTT 


ACTATTAACA 


GGTTTGTCTC 


AATGTTTGTG 


TAAAATGCTT 


120 


GGATGGTGTT 


ATAAAAGATT 


CCTGCGAACA 


TAAAGATTAA 


CCAACTAAAC 


CGCCAtCCCC 


180 


TAAAACAAAA 


CTmATAAATA 


GATAGAGTgC 


CACAACTAAG 


GGCCAATAAA 


TGGATGCAAT 


240 


AATTGCACCG 


ACAGGATTTC 


CCACAGAACC 


TTCCTCTGTT 


TTTGTTTGCT 


TAGGGGCATA 


300 


CTCAGGATGA 


CCAACATGTT 


CTGAATAAAA 


ATAAGTTTCA 


TTCAACAATT 


TATTAAAACT 


360 


TCCTTTAATT 


ACTCCACCAA 


ATATAAACAA 


GAAGACACCA 


ACCGATACTA 


AAAAGAAGAG 


420 


CCAACTAATC 


CCCATCGTTC 


CATTATAATA 


TTGTCCGCCC 


ATCATTTGAT 


GCATGGAACG 


480 


GCGCATCATC 


AGATGAAGGC 


CAC TACT AAA 


GACAACAACT 


GGGATTAAAC 


CCAAAATACA 


540 


AAAAACTACG 


CCCGCTGCCA 


TACAAAAAAT 


AAATGATCGT 


TGAAAACTTT 


GTTCTCTTTG 


600 


TGCAACTTCT 


TTTTTTAACT 


TAATCGGAAT 


CAATCGTTGC 


CCCAGCTGTT 


TGTCCATTGC 


660 


AACAAACCGA 


TTACCAGCAA 


CAATAAATAA 


TGGAACGCCG 


ATTGCCACAC 


CTATGAAAAG 


-720 


ACAAACTAAA 


CCAAGACCGT 


TAGAAAGAAC 


ACCACGCATA 


CCAACCAGCA 


AGCCAACACC 


780 


CACCATGATT 


GCAACAACAC 


CAAGACCAAT 


TTGAATCGCA 


CCTTTACGTT 


GGACAGCTAA 


840 


ATAGTCAAGT 


CCTTCCTCCA 


CTGrAATTTC 


TGGCAAACCG 


TTCATTGAAT 


CGTTTGGTTC 


900 


ATCCACTTGT 


TTATTTTCTA 


AATTCAATTC 


TTCCATTAAT 


TCATCAATAG 


TACCAAATTC 


960 


AGArATAATT 


GTACCArTTG 


ckTCATTTtC 


CGATTTTCCT 


TGGCTGAGCA 


AGTCTTCATA 


1020 


ACGATCCTCA 


GCATTTGCTA 


ATrAGTCCGm 


CTTTAATTGT 


TTTGTTTCAG 


ATGTTTCAAC 


1080 


AACGCCTGAA 


AATAAGCTAT 


CAATAAATTC 


TTCTATAATT 


TTCATTTTC 




1129 



(2) INFORMATION FOR SEQ ID NO: 532: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 



AACCCCCGAT 


AATTACTCTT 


TACTCGCAAT 


TAAGCGAAAA 


ATTGATCGTT 


CATTTTTACC 


60 


T GAT CAT ATT 


TCTTCCGTCT 


TATTAGATAC 


AGAGACGTTT 


GGTAGCAAGT 


TAGCCTTATT 


120 


ATTGCTTAAC 


AAACGCAAAG 


AAAAAACAGC 


CTTGATTAAT 


GAGGCTGAAA 


AATTGGTGCT 


180 


AGATCACAAA 


AATACATTAG 


GCATGTCTAG 


TGTTAATCCA 


CATAGCAAAA 


TGaTTGTAGT 


240 


GGGTAGTTTA 


AACGTTGATA 


ATTAtCTGTA 


TTCAACAAAT 


TTACCACATa 


■ACGGCaAAAC 


300 


AAATTTTTTG 


tCTAGGTATG 


CTAAATTCCC 


CGGCGGAAAA 


GGATTAAATC 


AAGCTGTTGG 


360 


CTTAACCAAA 


CTGGGCCATC 


AGGCCACCTT 


AATTGGTTGT 


TTAGGCTCAG 


ACACAGATGC 


420 


GAATTACTTA 


tATAAAGAAT 


TAGAAAAATA 


CCACGTTACG 


ACAGATGGTA 


TCACACGTAT 


480 


TCAAGATACT 


GAGACTGGTC 


AAGCTTATAT 


TTATGTTGAG 


ACAAGCGGCG 


ACTCCATGAT 


540 


TTCAATCTTA 


CCTGGTGCAA 


ACACAGCGCT 


TACGCCTAAA 


AAAATCGCAC 


AGCAAAAACA 


600 


CCTATTTATG 


GATGCCAGTT 


TTTGTCTCAT 


TCAAACAGAA 


ATTCCTTTGT 


CTGCTGTAGA 


660 


GAAAGCTTGT 


GAAATCGCCC 


AACATTCAGG 


GGTACCGATT 


ATTTTAAAAC 


CAGCTGCTAT 


, 720 


TC AT CAT ATT 


CCAGTGAATA 


TACTAGAAAA 


GGTTGATTTT 


TTCATTCCCA 


ATGAAGATGA 


780 


ATTATTGGAA 


CTTCAACCAG 


ATACTGGTAC 


ATTAGAAGAG 


AAAGCGGCGT 


ACTTCTTAGA 


840 


AATGGGCGTA 


AAAAATGTCA 


TTGTTACTTT 


AGGAAAAAAA 


GGCGTGTTAT 


TAAAAACACC 


900 


CCAAGTGTGC 


CACTATTTCC 


CTGCAACAGA 


AAATATAGCT 


GTGGATAGCA 


CTGGTGCCAG 


960 


TGACAGCTTT 


ATCAGCGCGC 


TCGCCTCCTA TCTTTCGAAA GgTTATCCGA 


CTGAAGCAGC 


1020 


CATTCAAATA 


GCAATTCAGG 


CTGCAGGATT 


TTCAGTTTCT 


AAAGAAGGGG 


TGATTGATTC 


1080 


TTTAGTTGAT 


CACGTCACTT 


TGGAAAATTA 


TTTGATAAAA 


AAAGAACCCG 


CTCTTTTTGC 


1140 


TCATCGTAAC 


ACCTGTATCG 


AC TG AC AC AG 


GnGTTTTCCT 


GCTGAATATC 


TTGTnATTTT 


1200 


CTCATnAACC 


AATGAGAATT 


GTGCTTTTTA 


TTACAATTTT 


ATTAGGCGAA 


CTATTATAAT 


1260 


AAAAGAGAAG 


TTATAACTAA 


GGG 








1283 


(2) INFORMATION FOR SEQ ID NO: 533: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
TTTTCGGGAT TCCTAGAAAA GCTTGGACTG TAACTTTAAC TTGGTATTAA CGCGGGACTT 60 



WO 98/50555 PCT/US98/08985 

1783 

CTTTTTATTC ACTGAAATCC ACGTCGTTTC AATTCTTGGG TTACCCTTCT ATAGCCATAA 120 

TGTTGATGTT TTTTACGAAT CGCTTGAATT TCTTCTTCAA TGATTTGTGT TGAAGCTGTT .180 

CTGTCGAGGC GTTTTTGCCA GTACATGTAA GTAGATTTTG GAAACCTTAG GGTTTCCAAA 240 

ATATCTTTTA ATCGAAAGTG TCTTCGGAGA CTGTGAATGA TTCGTGCGAT TCGTTCATTT 300 

TTTGTTCGTC CTCTAAACGC AGTCTCCTCA ATTCTTTTAA ATAGGCATTC TCAATTTTTA 360 

ACATTCTATT TTCTTTTTCT ArTTTyTCTA CTTGAGATmG TTCACTTrtA rTGJcTTkGGG 4 20 

TTCATTTCGT TTCTTyTTAG ACATGGTAGG TGGACACCCT TTCTATTTAG AGAGTCCCTT 4 80 

ACGCTAGACT TCCGATACGT TCTAAGCCAA TTCGCAATCA ATGATGGATT GTTCGTTTTT 540 

AAAAGATTAG CAACTTCTTA ATAGGACAAC TTTATCGCAT CTAGCTTAAA TTGAACAGAA 600 

TAATTCTTAC TTTTTTGACT ACGAAGTAGT CCTTTTTCAC CAAGTCTTTA TAGACATTAA 660 

TCCATTTGTT TATTTGAGAA CAAATTTAAG TTTAAATTCA AAATTTTATT TTGCCATACA 720 

AAAAAACTCA AAAGTTAGAT TTCTAGGTCT AACTTTGGGG GAGCACTACA GAACTGTAAT 780 

TCTCTTAAGC GCATTTAATA TAGATTCAAT CTTTTGTAAA AATCCCTATA TTTTGTTTCT 840 

TGAAAAATTT AATCATTTTA ATGAACTAGG GATGATCGAT TTCGAGAACA TAAAAAACAT 900 

ATTAATACAG GATTGCTTTA TCCTTGATAC GTTTTAAGAG TACTCCGGTT TGTTGTATCT 960 

TTTATTACTC CAAACGATGA TCCCGCAAAC GATAGCCCAC TCCAATCTCA GTTATCAAAT 1020 

ATTCTGGTTT CAATGTATTA CTTTCGATCT TTTTACGAAT ATTAGACATA TTCACCCGCA 1080 

ATGTCTGCGA GTCATTGCTG TAGGGGCCCC ATACTTTTGT CATCAAGGTT TGATACGCAA 114 0 

CTACCTTTCC CAACTGACGA AACATCAGCG CTAAAATTCG ATATTCGTTT TTTGTAAGAT 1200 

GGATTTCTTG TCCCTTTTTA TAAACTAACT GTTTTTCGAA ATCAATTTTC AAGTCTTTGT 1260 

TTTCCACGAC AGTTAGTGAT TCCGTTTGAA CGTTTTGATG GCGCAAAGCT GTACGAATCC 1320 

TCGCAAGTAA TTCATTCGTT CCAAATGGTT TAGTTACATA ATCATCCGCA CCTAAATCCA 1380 

ACGCTTTAAC TTTCTCCTCT TCATTATTTC TAGCAGAAAT AATGATGATG GGCAAATGCA 14 4 0 

TTCGCTTGCG AAGGATTTTT AACAAATCAA TCCCATCAAT GTCTGGTAAT CCTAAGTCTA 1500 

ACAAAACTAA ATCAAACGAT GTTGTTTGAA AAGCAGTCAA TGCCTCCATT CCAGTACTAG 1560 

CAATCGTTAA CTGATAGCGT TCCTGTCCCA AAACAACTTC CATAAAATCC GTAATATTTG 1620 

GATCATCTTC AATCAATAAA ATGGATTGCT TCATAATTCT TCCCCTCACT TCAAATAAAT 1680 

TCGAACCAAT GTTTTGCCCC TACCTATCGT CATTTCCATT TTACCATTGT GCGC AT GG AC 174 0 

GATTGTTTTT ACAATACTCA ATCCGATTCC TAAACCATTT TTTGAGTCAA CCGGTACTTC 1800 

ATTTGTGCCG GACAAGTTTG ATTGGATCTT TtGAAATTGT TTTAATGGAA TTTCGCCGTG 1860 

ATTTTCAATT ■ TCAAAAACTG TTTGTTCTTT TTCCTGATAG ACGTTTAGTT TGACTGGTAG 1920 

ATCATTTTCT CCATGACGAA ACGCATTTTC TATCAGATTA AACAACGCTT GTTCAATCAA 1980 

AATGGGGTCC GCTTGAATAA AAATGACCTC TTCTGGCAGG TGAATATCCA CTTGGCCGTC 204 0 
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TGGATAGACC 1 


TTCCTCAAAT GCTTATAAAC AGCCTCGATG ACCTCTTCTA CTGGTTCAGC 


2100 


AGTTTTATTA ACCTTCATAG TATCCATGTT GATTCGCGTG ATGGATAAAA GATTTTCAAC 


2160 


CATCCGAATC . 


AGCCATTGTG ACTCTTCTTG AATATCCTTT AATAATTTTC GCTGTGTCTC 


2220 


TTCTTTTAAA 


TCATTTCCAA TGCCTAATGT TTCAGCAATC CCTGAGATAA CAGTTAATGG 


2280 


TGTTCTCAGA 


TCATGAGATA CGGCTCGTAA TAAATTGCTG CGGACTTTTT CCCGTTCATT 


2340 


TTCTAACTCA 


ACTTGCTCTT TCTCATCTTT CAATTCTGTT TGTTCCAAGA TGACCGCAAT 


2400 


TTGAGTCAGC 


ACCAATTTTA AGTAATTGAG CTGATCATTT TCCAAATCAA GATCTGCATT 


2460 


TCGTTCAATG 


CCTAATACCG CCAACGTCTT CCTGCCTGCA GCAATCGGCA AATAGAATCC 


2520 


TTTAGCTCCA 


TTTAGTGTGT CTGTACCATT CCCAGCTTCT TTTTGGTTCT TTGCTGCCCA 


2580 


AAAGGCTACA 


GCTGCTTCGT CTTCATTATT TAATATTGAT "TTTTTTTCGT TGATACAATG 


2640 


AACACTCTCT 


GTCTTCACTT G AC GAT C AAA AATAATCACT TCTCGTTCCA ATAATCTTGA 


2700 


TAAATAAGTG 


GCAGAAATAT CAAGAATTTG TTTTCGACTT TCCACTAAAA CATATCGTTT 


2760 


GTTCAATTCG 


TACAAAATTT CCATTTGGTG CTCTTTTTTC ATAGAAGTAT CTGCTTGTTT 


2820 


CTTTAATCGA 


ATCATTAAAT TACTGCTCAT CAGGGCAACA ACTAGCATGA GCAATAAAGT 


2880 


AAAAGGATAT 


CCCTGTTTAT ATACTGTCAG AGAATATAAT GGTTCGACAA AAAACCAGTT 


2940 


AAAAGACAAA 


ACACTTAAAA TAGAGGATAA TGAGCTCCAA AAATAACCGG ATGTCGTTCG 


3000 


AGCGACTAAT 


AATACGAATG AAATATAAAT AAGCATCAGA TTTTGATCGC CAACGTGGAT 


3060 


ATATTGCATC 


AATTCCGTCA CGACCGTCGC TAGAAAAACA CCACCAATTG CAATCACTAA 


3120 


ATCTTTTCCG 


CCACCCTCAA TGACTTTACG AGTCTTAAAA AATAAAGACC GCTTTTCTTC 


3180 


ATTGAAAGGA 


ATTAAATGAA GTTCCGTATC TTTCAATCGT TTTAACAAGC GATCATCAAA 


3240 


CGCTTCAATA 


AAGATTTTTT CATACCACGG CTGCCTTAGA TTTTTCCCCA TAATAATATC 


3300 


AGTAACACCC 


GTCATTTTTG CGAACTCGAC AATCGTCTCA AAACTATCAT CCTCTTCAAT 


3360 


ACTAATAACC 


TCCGCACCTA ATTTATCGGC TAAAGGAATG TTCGTTGGTG TATTTTCTTG 


3420 


CGTACGAACT 


TGAATCACTG TCCAATCGAC AACTAGACCT TGCGCTAAAC GTGCTGTCCA 


3480 


GCGAATACAT 


TTTTCGGTCA TTTTTGGAAA GGCATCGTTC ACCACCGTCA ATAATTTACT 


3540 


TTGTATCCCA 


ATTGTCTTTC CACTAATTCG ATTAATATGA TCCGACGCTC GCTGAATTGC 


3600 


m T\ TV m/^p 7\ T 1 

TAATCCACb 1 


aaTTrrTrra ATTTTTftTftf; AATAAAAAAA TTTTTCAAGG CACGTTTCGC 


3660 


GTTTTCATTC 


GCATATATTT TTCCTTGTTC AAGTCGTTCG ATCAGCTCAT CGGGCTCCAC 


3720 


ATCAATCACT 


CTTATTGTCG CTTGCCGTAG AAAAGTGTCC GGCACCGTTT CCTTGACCTC 


T7 a n 

J (OU 


AATTCCGGTA 


ACTTCTTCAA C GAT AT CAT T CAAACTTTCG ATATGTTGAA CATTGACCGT 


3840 


AGTAAAAACA 


■TCGATTCCTG CATTCAACAG TTCATCAATA TCCTGGTATC GTTTACGATT 


3900 


TCGAGAACCT 


TCCGCATTGG AATGAGCCAG TTCATCAATC AACACGATTT CGGGCTTCTG 


3960 


TTGAATGATT 


TGATCAATAT CCGGCTCTGT TAAAACCATT TGTTTATAAA AAATATTCTT 


4020 
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AGGAGGAATT 


TGCGGTAATC 


CTTCTAATAA 


TCGATTGGTA 


TCTGGACGAT 


GATGAGGTTC 


4080 


GATATAGCCG 


ACAACAACAT 


GTTTTCCCAT 


CATCAATAAT 


TCATGAGCCT 


CTGTCAACAT 


4140 


APPGTAGnTT 


TTCCCAACAC 


CAGCCGCAAA 


TCCAAAAAAT 


ATCCGCAATC 


GACCTCGGCG 


4200 




GATTGCATTT 


CTTCTGTTGT 


CTGCATCAAC 


AGTTCTCCTC 


TCTTTGTATG 


4260 


PfiPAAPAATT 


CCCTAACCAT 


TTTTGGTTAG 


AGAATTGTTC 


TATATTATAT 


TCCTAGTTTA 


4320 


1 b 1 nri 1 I r\ 


nriT'TTAACTG 

\J\J 1 A X r\JTW* X VJ 


CAAAACATTG 


ACATAAAATC 


GGTCTGAAAA 


CCAATCTTTT 


4380 




TATPGATGAG 


TTTACGAATT 

JL X X il\m* VlUi X X 


GATTGTTGAG 


ACAAGCCACG 


TTCTTTAGAA 


4440 


Ai 1 buuVjO/i/i 


PTTGAAAATG 

a a vjruwi a u 


AGCTGCTGCC 


AAACTAATAT 


CCGGATCCAC 


ACCACTAGCA 


4500 




PGAGATCATT 


CGGAACTTTT 


TTCTCTGCTG 


GATTTTTTTC 


CAATTCTGCA 


4560 


Abbb 1 uk/Ou l 


TPTPAAPTAA 


TTGPTTTTGP 

A A W w X X X A 


TCCGCCGAAA 


CAGGTGATAG 


CTGACTAACT 


4620 


1 bb 1 l^ri**, i rv^ 


f^TPPTGAAAA 


ATATTTTGPP 

x <** x x x x yj \* \y 


TGTTCAAAAG 


TTTGCCCAAT 


CAACTTTGAA 


4680 


bbAAbAHb 1 I 


PATTATTPAP 


TP,AAAPPTTA 


CTGCCGTTAG 


CCTGATCTGA 


AAAGAACAAT 


4740 


1 bAbbHrt. 1 


PAPTAAPTPtP 


AAPGGTATAT 


AGCCCACCAA 


AAACGATCAG 


GCTAAAAATC 


4800 




ELZiPTPPPTAP 


A AT APTTTTT 
rtrt x i x x x x 


TTPATTGTTT 

x x \_>n x x u x x x 


CTACTCCTTT 


ATAAAATGAT 


4860 


CAGCGAGAGA 


ATCAGATCTA 


TAATPTTGAT 
1 nn I a x ort a 


AAAAATAAAT 


GGTGCAATAA 


TTCCCCCAAG 


4920 


TCCATAAATC 


AGCAGATTAT 


GAPTPAAAAT 


TTGACTGGCC 


GGTTTTTCTT 


GATACCTCAC 


4980 


ACCTTTTAAT 


GCTAGTGGAA 


TPAAGGCCAC 

i LnnuuL \^ fill 


GATAACGACT 


G CAT TAT AAA 


TGACAGCGGA 


5040 


TAGAATCGCA 


GTCAGTGGAC 


TACCTAATCC 


CATAATATTC 


AATCGATCCA 


GTTGCGGATA 


5100 


AATACTGTAA AATAGTACTG 


GAAT CACCGC 


AAAATACTTA 


GCGATATCAT 


TTGCGATACT 


5160 


AAACGTAGTT 


AACGCGCCGC 


GAGTCATCAA 


CAATTGTTTC 


CCAATCTGTA 


CGACTTGCAG 


5220 


CAATTTTGTT 


GGACTGGAAT 


CCAAATCAAT 


CATGTTCCCC 


GCTTCCTTCG 


CGGCTTGTGT 


5280 


TCCTGTATTC 


ATCGCCATCG 


CAACGTCGGC 


TTGCGCTAAT 


GCTGGTGCAT 


CGTTTGTCCC 


5340 


ATCACCAGTC 


ATTGCAACTA 


AGTGACCCTT 


TTCTTGATAC 


TCGCGAATCA 


GATTCATTTT 


. 5400 


ATTTTCTGGG 


GTAGCTTCTG 


CC AAGAAATC 


ATCTACTCCA 


GCTTCCGCTG 


CTATCGCTGC 


5460 


CGCTGTTAAG 


GGGTTATCCC 


CAGTGATCAT 


GATGGTCTTG 


ATACCCATCT 


TACGCATATC 


5520 


AGCAAATTTT 


TCCTTTACAC 


CATTCTTAAC 


GATGTCTTTC 


AAATAGACGA 


CACCCATCAC 


5580 


ACGATTATTT 


TTTATAACTA 


CTAGAGGTGT 


TCCACCTGCT 


CGAGCGATTT 


TATCAACAAT 


5640 


CTTGTCACAT 


TCAGATGGAT 


AGTCCTCTCC 


TTTAGATTGT 


ACATATTTTT 


TCATGGTGTC 


5700 


TGCGGCACCT 


TTGCGAATCA 


CGTCACCGCG 


ATAATCAATT 


CCGCTCATGC 


b lb 1 I 1 i bub 


fin 


ACTGAAATCG 


ATAAATTTAA 


CCTCTGATTG 


CTGGAATTCT 


CTTTCCCGCA 


AGTTGAAACG 


5820 


TTCTTTTGCT 


• AAAATCACGA 


TACTTCGACC 


TTCTGCTGTC 


TCATCAGCTA 


AAGAAGATAA 


5880 


TTGCGCCGCA 


TCCGCTAACT 


GTTCTTCACT 


AACACCATGG 


ACTGGCAAAA 


. AATCGCTTGC 


5940 


TCGTCGATTT 


CCCAATGTTA 


TGGTTCCTGT 


TTTGTCGAGT 


AAAAGAACAT 


CCACGTCACC 


6000 
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AGCTGCTTCG ATCGCACGAC CACTCATGGC AATGACGTTT TCCTTCGTCA AGCGGCTCAT 6060 

ACCTGCAATC CCGATCGAAG AGATTAAGGC CCCAATCGTC GTTGGCGCCA GACAGATTAG 6120 

TAAAGCAATC ACAATAACCA GTGATAAAGC TTCTCCTTTA CCAGATAACT GACTGCTGAA 6180 

ATCAGTAAAT GGGACTAAAG TAATTGATAC AGTTAAGAAA ATAATGGTCA AAGTAATCAA 6240 

GAAAATCTGC AAACCAATCT CATTCGGAGT CTTTTTTCGT TGAGTTCCCT CTACCATGGC 6300 

AATCATTTTA TCTAAGAAGG ATTGACCGTT TTCCGAGGTA ACACGAATCA CTAAATAATC 6360 

GGAAACGACG GTTGTTCCGC CAGTAACCGC ACTGCGATCC CCACCAGATT CACGAATTAC 6420 

TGGCGCTGAT TCGCCAGTAA TGGCACTTTC ATCTACTGAT GCAGCCCCCT 6470 
.(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 



CAATTAATTT 


GGAGATTTCA 


GCCATAGTAA 


TGCCTTTTTG 


AAATAATTGG 


ATAATGGTCG 


60 


TTTCTAATGA 


ATCATTGGTT 


CGCTTATATG 


CTGGTAGTGT 


TTGTTGAGAA 


AGTTCGCCAT 


120 


TCCGATCTCT 


AGGAATAGTT 


AAATTCAACT 


CGCCATATTC 


TGTTTTAAAT 


GTACGTGAAT 


180 


AATTCCCGTT 


ACGGGAATTT 


CCTGAATTGA 


ATCCATtTCA 


ATCATATTTT 


TCGTAATCTA 


240 


AAAAAGCAGT 


TaGCTCTGCT 


TGTAGAAGTG 


AATTAATCGC 


TCGCTCTAAA 


TGATCCTGAA 


300 


ATAATTCATC 


TAAATCCCCT 


TTATTGAGTA 


GTGTTTGCAT 


AATTTCTATA 


GTAAATTTAG 


360 


TCATGAGAAA 


GTCCTCCTAT 


AAAATTTCTG 


TGTGGTAACT 


TTAATTTTAC 


AGAATGGACT 


420 


TTTTTTTCTA 


CCCTAAATTT 


CTATTTACAC 


AAAATATTTT 


ACGCTATCTT 


TTATTTTAAA 


480 


ATATGGGATA 


TTATGATAAA 


CTAACTATGG 


ATGATATTTG 


AAAAAACCAT 


ACTGTATGTT 


540 


TGATTTAATT 


TTTATTTTTA 


AGCTCATATC 


GTAAGAATGG 


GACTTGTAGT 


CTCACTTCTT 


600 


TATGGAGGTG 


ATAATTATGA 


ACGTTGGTTT 


TATTGGAGCC 


GGCAAGGTAG 


GTTGTTCATT 


660 


CGGGAAGTAT 


TTCCAGGAAC 


ATAAAATACA 


AGTAACAGGA 


TTTTATAGTA 


AAAGCGAAGA 


720 


CTCGTCATTA 


GCAGCTTCGA 


ATTTTACATC 


TAGTAAACAA 


TATTTAAACT 


TAAGAGAATT 


780 


AGTAGATGAA 


AAT GAT ACTA 


TTTTCATTAC 


GACTCCTGAT 


GGTCAAATTC 


AGGAGGTATG 


840 


GCAAGAAATT 


AAAAATTATC 


AAATCAAAAA 


CAAATTAATC 


TGCCATTGCA 


GTGGTGCAAT 


900 


ATCTTCAGAT 


ATCTTTTCAA 


AT AT AC AG G A 


CTATGGTGCT 


TATGGCTATT 


CTGTCCATCC 


960 


TATGTTTTCA 


AT AT C AG AT A 


AATATAATTC 


ATACAAAAAA 


TTAAAAAAAT 


CTTTTATAAC 


1020 


AATTGAAGGC 


GATGAAAAAT 


ATGCAAAGTT 


TTTATGTGAT 


TTCTTCAAAA 


AGTTAGGTAA 


1080 


TAGTACTGTA 


ATTGTATCTA 


AGGAGAATAA 


ATCTCTTTAT 


CATGCCGCAT 


CAGTAGTATC 


1140 


TAGTAACCTT 


GTGCTAGGAC 


T TAT T AAT AC 


AAGCGTTACA 


TATCTAATTC 


AATGTGGATT 


1200 
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TACTGAAAAA ATGGCAATTG AAGCTTTATA TCCATTAATT GAATTTAATA TTAGAAATAT 1260 

AAAAGAAAAA GGAATTATAG AAAGTTTGAC AGGACCAGTT GAACGTTGTG ATATTTCAAC 1320 

AATTAAAGGT CACTGTGAAG TTTTATCAGA TGAAGACAGA GCTTTATATA TGCTATTATC 1380 

AAAAAATGTA T TAG AG AT AG CTATGTkrAA AAATATTAAT AGAGATTATA GTGAATTAGA 14 40 

AAAGTATTTA GGAGAGAGTT AGGAGAGAGA TAGATGAAAA ATACAGCAGT TACATTTAAA 1500 

GAATCTAAAT TAAGAAATGA AAAATTAACA ATGCTTACAG CTTATGACTA TTCAACAGCA 1560 

AAAATAATTG ATGAAGCTGG GaTAAATGGA ATATTAGTAG GaGATTCTCT TGGTATGGTA 1620 

TGTCCAGGAC ATGAAGGATA nCACTTTCAG TAACCATGGG AAGCCTGATT CCACCATACT 1680 

AGAGC 1685 
(2) INFORMATION FOR SEQ ID NO: 535: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 

GGGAAAAtAA akGyCCCCTT AtcCCTTTkG GTAtTCCsCc AgtGATAAAC yCCtGTTAAG 60 

aTAGTGGCtA TATAAaAAAG GcmGrTaCTT AAGaAGaGTA TATTCaTGaT TyCCCyCCTA 120 

GTTTAAAtGa AAGTCaTGAA TGaTTGTACC aAGATTTTTT AGAGAAGTTA AGATTGAATA 180 

GAAAAGTAAA TACGATAAAA TTTTTATAGA CTGTTACGTT GAAATGTTCT TCAGAGACTA 240 

TTTCTTCCAA TACCGCCTAC TAGATAGGTG GTTTTAATCA TTATTTTCGG AGGTTGTGTT 300 

TTGGCCAAAA AGTAAAGATT TAATGGATGT AGGTTCATTT GTGAGCCCAT CTAGATAGAA 360 

CGTTTCAGGA CTTTAGTGAG GCATGAAACG ATTCGATGCA AGCATTATCT GAAGGtGTCC 4 20 

CTTTTCGGGA CCTACTCATG GTAATGCCTT TTTTTATTTG GTTCTGGTAG TCAAAAGAAG 4 80 

TATAAACAGA GCCCTGATCA CTATGCAATA GACAATCCGT TGTTTGAGGC AATTGATCAA 54 0 

ATGTATCCAG AACAAAATCC GTATCTTGCT TATCACCGAT GGTATAGGCG ATAAATTCAC 600 

CATTGTATAA ATCCATAATT GAAGACAAAT ACAATTGCTT TTTACCAAAA GGCAGATAAG 660 

TAATATCTGT TACCAATTTC TTTAAAGGAG CGTCTGAAGA AAATTACTTA TTTAGCTTGT 720 

TCTCGACAAC CGTGTAAGGT TGTCCCACTT TACTGCGTTT CTTCATTTTC ACGCGACAAT 780 

TCTAATGATA TTTCCGCATA ATCTTTTGTA CGGTGTTTTT ATTTACTTTG TATTCCTGAT 840 

TAATCAACGC AtAATCTTTC GATAGCCATA TCGAAACTTA TGCTGGAAGC AAAGTCGACG 900 

AACAGCTTCt -TCAAGAGGCG TTAACTTTCC CAAGTCTTTT CGATGAGTCC AACGATAATA 960 

AGTAGCTCGC GAAATGCCGA AAAGTTGACA CAATGCAGTA ATGGGTAATT TTCCTTTCAT 1020 

TCTTTTAACT AATTCTACAA ATACTTCTGG CTCCACGTTC TTTCCAATTC TTTGAACTTT 1080 
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TTAAGCACCT CAATGTGCAT TTTTAAATAG TTATTTTCAT TTTTAAGCTG CTGTTCATTT 114 0 

GATAGATTTT CTGGACCTTT TCCATAAGCA TATTTTTTTC AACGGGTTGA ATCAGACGAT 1200 

GATGATTCTC GCCATTACGA TACCCCAACG CCACCATGTG TCAATTTGTG TTTTGTTTTT 1260 

AATTCCTAAT TCTTCCATAA TCGTTTTTAC AGGAATATTC TCTTTTTTCA TCTGTATAGC 1320 

CATTTCTTTG ATGTAAATAG GATAAGATAC TCAATTTCTC ATTGAAAAAG CACCTCCTAG 1380 

TTAAATTCAT TTTAACGTGA ATTCAACAAG AGGTGCTTTT yTTGTCTCAG TTTATTGGGT 14 40 

CAGTTCCCTA TTTGTTTGGT CACTTTTATT TTAGCAAAGG TGTTCTTATT TTTATACAAA 1500 

AAATAATCCA GATGTCAATy AATCTTATTA ATGGTATCGG AATTTCAAGC nCTGTTGnAC 1560 

AAAnTT 1566 
(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 



CAATAAAGGT 


TTCGCTTAAC 


GTCCCAAATA 


AAACTGATAA 


AAtmAGAATC 


ATCcAGAAGA 


60 


TTAAATAAGC 


CAGAAAAATA 


AACGGTTAGA 


TAATACTTTG 


TCCATTGTGT 


TTTTCCTGAT 


120 


TGATCTTCCT 


GATCCATATC 


CATAATATTT 


TTTAGAATCC 


AATCGATTAG 


CATATCGCCC 


180 


TCCTcTTCAA 


GTATTGCACT 


AAATTATAAC 


GAATATAGCG 


ACTAATAGAT 


AGATAATTCC 


240 


TTAATTAGTC 


ATTATCCCTC 


CCTAAGTAAG 


AAATAGTGCC 


TTGACCTGAG 


TATAATATCT 


300 


TCTTGAGACT 


GGTACTTCCG 


TTCCGTCTCT 


CATAGTTAAA 


AGATGCGTTT 


TATAATTTAA 


360 


TTTTGACGCA 


TATTTTTTAT 


TGATGATGAT 


TGATTGATGG 


ACTCTAATCA 


AATCTGGATG 


420 


CAGTTGTTCA 


ATTTTTGATA 


ACGTTCCATA 


AAATTCTCTT 


TTAAAATTTT 


TACCAACCAT 


480 


TGACACACGA 


TGATCGTATT 


CCGTTTGAAA 


GAAATAAATA 


TCATTAATAT 


CCATTTTTAT 


540 


AGAAGACGCT 


TTTAAGTTAA 


TGATAATATA 


TTCACTTTTT 


GTCTTTATAT 


TTTCTTTTTG 


600 


TTGAATATAT 


GCATCGACAC 


ATGTTTCGAC 


CTCTTTTTGG 


AAATCTAGAA 


AATCCACGTT 


660 


CTTTTGAATA 


AATTGAAGAG 


CTGAAACCAT 


ATATTTGTAG 


GAAGTTAAAA 


CTAATTCTGT 


720 


GTGAGTAGAA 


ATAAATACAA 


TCAGACTTTC 


ACTGTCTGTT 


TTTCTGATTT 


CTTTAGCAAC 


780 


CTCAAATCCT 


GCCTGGATAT 


GATTGTTCAT 


TTGAATATCT 


AGGAAAAAGA 


TATTTAGTTG 


840 


GGCGTCCTGG 


TGTGCCACTT 


GTAATAATTC 


ATCTGAACGT 


GAAAATAAAT 


GGATTGTTTC 


900 


ATTATAAATA 


TTTCTTGAAT 


GTAGGATATT 


TGTTAACAAT 


ACTTCTAACG 


CTTTTGCTTG 


960 


AATAATTTGG 


TCTTCTAATA 


TATAAATAGC 


CATTTGTTCA 


CTCATCCCTT 


TCTCAAGAAG 


1020 


ATATAAAATT 


ACACAACAAA 


AGACAAATGA 


TTATAATGTT 


CAATCAATTT 


ATACACCAAT 


1080 


AAAAGAGATA 


AAAAATTGAT 


TGTCTGTTAG 


TATATTCACA 


TAAATTTAGT 


ATAGCGATTT 


1140 
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CAGTAGTTTC CTGCAAACAA AAACATAAAA TAACAAAGGA TCAAGAAAGA GTACTTTGGA 1200 

CTATTCATCT TTTTCTAACG AATCATGAGG ACTATTGTCT CTTCGTTGAT GAAAAACAAA 12 60 

GAATCAACCG CCAAACTGTT CTTGTCATTT TATACATGAC AAAATCaATT CATTGGTCGA 1320 

TTCTGGTTAA TTTAATTTGT TAGTcACACA TGGGATCAtt GTACTAAtTt TtTACTTA 1378 
(2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 



TAAAATATCT 


AAACCACGAC 


GACCTCTCAC 


CCGTTTTAAA 


CTGTCTCCAG 


AATCAGCAAT 


60 


GTATTGTAAT 


TCATATAAAA 


CAAAGTTAGG 


GATGAGCAAA 


ACACCTTCTA 


AAAAGCCTGT 


120 


TTTAGCGATA 


TCGTAGATTC 


TGCCATCAAT 


GATGACACTG 


GTGTCTAAAA 


TTTTGTATTT 


180 


ATGGAAGTTG 


TCTTCCACTT 


TACGTTCCAA 


CATTTGTCCT 


TCGGTGTCTT 


CCGTATTTGC 


240 


TTTCTTTTGG 


CGAGGTGTAA 


AAACTTTTTT 


CCACTCCTCA 


ATCCGTGTTG 


TTCCCATTCG 


300 


GAAACCTAAG 


TAACCCAGAA 


TAATCATAAC 


TAGAATTGGC 


AGTAAACTAT 


TCACGACAGG 


360 


AATTTGTAAG 


TTGTACATTG 


GAATGGAAAT 


AATGACACCA 


ATTGTTAAAC 


CAATGATGGC 


420 


ACCAATACTT 


CCAAATAATA 


AATAGGTTAA 


ACTCATTTCA 


TTTAAAGCTG 


CTTCGATTTT 


480 


TTTGACAGCG 


GAAACAATAT 


ACTTTGCTAA 


CCCTAAAGAT 


AAAATAAAGA 


AAATAATTGC 


540 


ACCGATTAAT 


CCGTTCGTAA 


AGTTATTGTT 


TAACCAACTG 


TTATTGCCTT 


GCTTAGCCAT 


600 


TTCCCAAGCC 


ATTGGGAAAA 


GAGAAATTCC 


TAAACTTGCA 


CCAGCAATGA 


TCATCAGCAA 


660 


CGTGATGATT 


CGTTTTTGCA 


TATAGCCATC 


CTCCTCGTTT 


ATTTAATAAA 


GACGTCTAGT 


720 


GTTGTGGGGC 


GTACAAAAAT 


CATACGCCAC 


CAAAAACCTA 


AACGGATTAT 


TTCTACTTTC 


780 


ATTATTTAAA 


AACTTTTCTT 


AACGTTTCAC 


CAATTGTTGA 


AACGCCAATA 


ATTTCAATCC 


840 


CTTCAGGCGC 


TTCCCAGCCA 


CCAAGGTTAT 


TTTTAGGGAC 


ATAGACTTTT 


GTAAAGCCTA 


900 


GTTTTTGCGC 


TTCACGCACT 


CGTTGTTCAA 


TACTATTCAC 


GCGGCGGATT 


TCACCAGTTA 


960 


AACCAATCTC 


ACCAATGAAA 


CATTCCGAAG 


ATGAGGTTCC 


TTTTTCTTTA 


TAGCTAGAAG 


1020 


CAATACTGAT 


TGcTAACGCT 


AAATCAATAG 


CTGGTTCATT 


AATTTTGACT 


CCACCAGCCG 


1080 


CTTTTAGATA 


CGCATCCTGG 


TTTTGTAACA 


AAAGGCCTGC 


CCGTTTTTCT 


AAAACCGCCA 


1140 


TGATTAAAGA 


AACGCGATTA 


AAGTCCAACC 


CGGTGGTCGT 


TCGCTTGGCA 


TTGCCAAACA 


1200 


TCGTTGGGGT 


' AACTAACGCT 


TGGATTTCCA 


CCAAAATAGG 


GCGTGAACCT 


TCCATTGCTA 


1260 


CCACAATTGA 


AGATCCGGTG 


GCTCCTTCTA 


AACGTTCTTC 


TAAAAAGACC 


TGAGAAGGAT 


1320 


TCATAACTTC 


AACCAATCCA 


TGTGTCTGCA 


TTTCAAAAAT 


ACCAATCTCA 


TTCGTTGAGC 


1380 
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PaAaPPPATT 
bAAAbbbr-J, 1 


tttpapappp 


PGTAAGATTC 


TAAAGGTGTG 


ATGCTTATPT 


CCTTCAAAAT 


1440 


a. a nm arprT 
AAAbAAbbb 1 


blLl /IbbH. 1 ri 


l o x x u^nn\vn 




TGCGATAGAC 


PCTTCCTTCG 


1500 


1 1 ACG 1 G 1 


rafzi ar a a a a 


n x vj o \^t\r\ i 




TGCCAGTTTC 


AATAATTCGG 


1560 


/"•A /""■ rp rp /** rp rri rp/ - * 

CAGTTG1 1 1C 


1 LvjuAL i 1 un 


ptpapaptap 


PAf^PAAPP.PT 




PP*PTGrGTCA 


1620 


TTGTTTGAAT 


O A f~* I 1 A A HP A 

CG AG 1 C AA 1 A 


A TP AP AT APT 


PTP.P.TTPTAA 


TTTTTP A ATP 
11111 1 o 


PPTPPTPATA 
ov^ i 1 un l rt 


1680 


TTTCGTGCAT 


7\ rp /"^ rp ffi rp rp /""* Hp 

ATCTGx 1 1G1 


rr p t A T A A P T 
bbb ini /\rtb 1 


AAAATP.TPTP 


PTTPAPPPTP 




1740 


CCGCTCGTAA 


TTTAATrTGG 


1 L/lbUAL 111 


PTTP APP AP A 
b 1 1 bAbbAbA 


AAPATATAAA 


APPPTAPPAP 


1 POO 


CTGTGGCTGC 


TAATTGTTGG 


G A 1 AG 1 1 G 1 A 


7i P li PP B. Zi TPT 
Ab Abb AA 1 b 1 


TP APTT APPT 
1 bAb 1 1 Abb 1 


ATTPPPPPAT 
rVl l^^ooL^rVl 


X O \J \J 


CGCCACCGAT 


TAAAACi AAG 


GAAGG 1 GGAA 


PZi Zi PPPPAPP 
bAAbbbbAbb 


APPT AAA APP 


PPATTPAPPT 


1920 


CCACTAATTC 


t\ f m m rn m rp T\ /"^ 7\ 

AGTTTTTACA 


GGAGGGTG1 I 


ill lAbblAl 


PfiPTTPZiPZiT 
bAb 1 1 b AbA 1 


APPPPTTPTP 


1980 


GTTGTGTCTT 


TTTACCAGTC 


AAAG 1 1 AG 1 G 


1 AbGAGGAGG 


ATPIiPAIiPTA 
A 1 b Ab AAb 1 A 


TPT TP A AT A A 
1 b 1 1 o/xrt 1 rtrt 


7040 


CCTCTTCCAC 


OA HP tv 7\mfpp 

CATCGAATTG 


GA1 1 Gl GGGG 


AA1 1AGGAGA 


bbbbbbbAbA 


T ATTTTPPAP 
1 rt 1 1 1 lu ort o 


2100 


ACACATACCC 


AOAO/-"T ,r P rn OA 

ACAbCTT 1 GA 


GA1 1 GAAA1 1 


paapTTTTPP 

bAAb 1111 bb 


TTTTTTTPPP 
1111111 bbb 


ATPTAPAATP 


2160 


TCCTTATCTG 


TTGTTTATTT 


rp TA 7A TA rp rp rp 7A rp 

i AAG A 1 1 1 A 1 


rp A 7\ A rn rp rp rp 

1 AAA 1 bb 111 


PfifiTZi&PPTT 
bAAlAAbbl 1 


PPPTAPTPA A 




AAAATGGGCT 


TTATTTTTGT 


C C AG AC G AAG 


oa a a a ccrcr* 
GAAAAGGGGG 


PPTTPTT r PP ,r P 

Gb llbllibl 


PPPTPPAPTT 
bbb 1 bbAb 1 1 


£. £. O VJ 


GGTCATCATC 


/*•/■•/*■» m a t\ m tv t\ a 

CGCTAATAAA 


AACGGCI 1 AA 


A A A TP^WPP 

AAAiCCC 1 1G 


rrr* a a ttppt 
GGGAA1 1 bb 1 


TPaPPTTTTT 
1 bAbb 11111 




CAATGACAAC 


ATCTTCATAT 


/-i/*"* a a A /"• m m rn A 

C C AAAG T T T A 


AAA ATTTPAn 

AAAATi GGAA 


GAi GAl Al G 1 


PPTTPPTT&T 
bb 1 I bb 1 1 Al 




TCTCGTTATT 


i\ rti TV 7\ rri 7v 7a rp /"« 7\ 

ATAATAATCA 


GTG1 GAA1GA 


r* r T t <T~T' npnpp 
G 1 GGGAbAbb 


BTTPPPTAAA 
Al IbbblAH-ri 


APT APP A ATP 


24 60 


GTTTTAGCGG 


/Tn rp /"> A 7\ r""* A A 

GTTCGAAGAA 


GGAG 1 1 AGGA 


Al 1 G 1 AAA1 A 


TTPAPAATPA 
I 1 bAb AA 1 brt 


PPPATATAPP 


2520 


CCTTAATTCC 


rp /~ m m rp rp T\ TV. /""" rp 

TGTT I 1 AAG1 


A A A 7\TTrTTT 

AAAA1 ibbl 1 


1 bbbA 1 1 1 1 b 


TPPAPPTTPT 


TPTAAPTTPP 


2580 


AAAGACTTGG 


T AG AAG GAL G 


p rn rp rp f /-» p f"« rp 

bill LbuL 1 b 


PTTP A A A ATP 
b 1 1 b AAAA 1 G 


AT APPPPPPP 


PPAPPTTTTG 


2640 


TTGCTCGTTG 


TbGGACAC 1 G 


& aPPPPTPTT 

AAbbbblbi 1 


PTTP.ATAPTT 
Li 1 bri 1 Hlv 1 1 


APAA ATA ATP 


TPAAAACCAC 


2700 


GTTGTTTCAT 


7\ mm Tv rn rp /-» rn 

ATI GA1 1G1G 


LLL 1 Ab 1 b 1 b 


T A A APTAPTP 
1 AAAb I Ab 1 b 


PTATTTTAPP 


ATAATPTCTG 


2760 


ATACTCTGGT 


m /"* rprp rp rp rp TV /"* rn 

TGTTTT 1 AG 1 


AAG 1 1 1 G 1 G A 


1 1 1 b 1 AAAAb 


PPP APTT A AT 


PTAPAPTAPG 


2820 


CAAAAAAGAG 


GAGAATTTTT 


G 1 1 1 Al 1 1 1 G 


PTTPTTTTflP 
bllbllll Ab 


TPTATPATPT 


TPTPTTAAAA 


2880 


m Ti TV TV /""* 7\ 7\ O 

T AAAC AG AC C 


AAA1 GGGAAA 


AAb 1 Lutnnu 


ATTPPPTPAA 


A A APTTPAPA 


PTTTPATTAA 
Sw> x x x x x nn 


294 0 


ACTTTCTTTA 


/*> m TV rp rp TV rp rp "t\ 

CTAT 1 GA1 1 A 


rp rp rp J\Ppj\ /""T 1 T 1 

1 1 1 AbCAb 1 1 


TPTZiPTflPPP 
1 b 1 Ab 1 Abbb 


PPTPATTTTP 
1 on 1 1 l 1 o 


TTATAATPAP 


3000 


rp/-* m tv 71 7\ rp rp rp 

TGTCAAAT T T 


TAG 1 G i AAAA 


AbAAAbbAbb 


PAPTTATPAP 


AAAPAAAPTP 


PAAPAGGCCA 


3060 


AACGAATTGT 


GATCAAAGTC 


GGAACCAGTT 


CTCTGATTTA 


TCCCAATGGC 


AATATTAACT 


3120 


TAAAAGCAAT 


TGATCAATTA 


GCCTTTACTT 


TGAGTGATTT 


AAGCAACCAA 


GGCAAAGAAA 


3180 


TTATTTTAGT 


■ TTCTTCTGGC 


GCAATTGGTG 


TCGGACTGAA 


TAAATTAAAT 


CTATCTGTAC 


3240 


GCCCAACGAC 


AATTCCCGAA 


CAACAGGCAG 


TCGCTGCAGT 


CGGCCAAGCA 


, GAATTAATGA 


3300 


ACATTTATAA 


TCAGCGGTTT 


TCAACCTATA 


GCCAACAAAT 


GGCACAAGTT 


TTATTAACTC 


3360 
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GTGATGTTAT 


TGAATATCCT 


GAAAGTCGCA 


7\ T7\ A TrTT CP 

A J AAlbl 


l^rirl 1 t\K* 1111 


P A APA APTf^T 


J "l ^ V 


TAAAAATGAA 


TATTATTCCT 


ATCGTCAATG 


AAAA I b A 1 AC 


TPTPPP A ATT 
1 b i buCAfil 1 


P A AP A ATTPP 


.-2. ^ 


ATCACTTAAC 


CAAGTTCGGG 


GATAAbbAib 


AA1 1 b lb Ibb 


PATTPTPTGP 


PA A ATTPTTP 




AAGCTGACTT 


GTTAGTTATG 


TTATCAGATA 


TTbAlbbl 1 1 


b 1 1 1 1 bAtj AL. 


A APPPT APTP 
AAbbb i Ab 1 b 


jDUU 


TCAACAAAGA 


AGCCACGTTA 


TTTTCTGAAA 


TCAATGAAA1 


rp tv tv rpp TV jvpjvp ' 

1 AA 1 b AAbAb 


rprnpfprppp7\ Tv rp 

1 1 b 1 IbbAAl 


JDDU 


TAGCCGGTGG 


TAAGGGAAGC 


CGCTTTGGCA 


/-•rn/"" , p»Tippp Tv rp 

Clbbl bbbAl 


b 1 b 1 Ab 1 AAA 


1 lAAAAbblb 


J / £. U 


CTGAACGTGT 


GTTAGCTAAT 


CAACAGGCGA 


TGATTT1 Abb 


AAAb b b AAAb 


tv 7\ a a a a 
b Ab b b AAAAA 


"7 Q fi 


TCATTTTTGA AATTTTAGAA GGAAAAGATA 


TCGGTAbGl 1 


Al 1 1A1 1AAA 


bb Abbbb A 1 b 


Jo H \J 


AAAb 1 b Ab 1 b 


ATTTA A AGPA 


ACTAGGGCAA 


TV TV ^ /"^ m TV 7\ 7A 

CAAGCTAAAG 


tv nrppTrTTA 
AAbbblb 1 1 A 


1 Abbb 1 Abbb 




mrp/^ArpppTvrpA 

TTGA1 bbAl A 


PTPPTPA A A A 
C 1 bo 1 v^rt/vrtM 


A A AT APTTTG 


TTAAACAAAA 


TGGCCGCAbb 


bAl 1 bAAbbb 




AATbbAbbbb 


PPATTTTAPA 


AGPAAATGPA 


TTAGATTTAG 


TV /"•/^•TV TV f~* f~* f~T* f 

AGCAAGCCGC 


rf Ti rinpTipppTv 

TAbbbAbbbA 




AT J: 1 b 1 b/Wi 


P A ATP.P A AH A 


PPGPTTAPGT 


TTAACAGAAG 


tv t\ r** tv rn TV O 

AACGGATCAC 


AbbbAl bbb I 


zl OR 0 


bAuoo^n 1 AC 


GPPA AGTPGP 


TAPTTTACCC 


GATCCAATTG 


GGGAAGTAbA 


rp A AA A Tr'TrT' 

i AAAAI b 1 bb 


fi JL ^ VJ 




PPPG APTGTT 
^uuunO lull 


AATTGGGPAA 


CAACGCGTGC 


CTTTAGGTGT 


rp T\ rp rp tv tv rp 

TATTGbAAl b 


/i 9nn 


All 1 AlbAAl 


PPPGGPPA A A 


PGTGAPAAPA 


GATGCCGCTA 


GCCTTTGTTT 


CAAATCGGbA 


/i 0 

H £. OU 


AA J. bb iulin 


TTPTTPGAGG 


CGGAAAAGAA 


GCGTTTCACT 


<^ tv tv m tv t\ rn 

CCAATCAGAT 


A rprrtr>/-'rpy-'TVpA 

ATTbbT GAbA 




A T T T T A r* A A G 


AAGPGCTAAT 


CCAAGAAGCC 


GTTTCTCCCC 


tv m^>m/~'T\ rr> rp o 7\ 

ATCTCATTCA 


A 1 1 1 b 1 1 b A 1 




P AT APTTPT A 
Kj A 1 AC 1 l^ln 


GAGAAACAGC 


CCAACAACTT 


ATGCGCTTAA 


ATGAClAb 1 1 


bbAl b 1 bb 1 b 


4 4 4 0 


ril iV/Lyvuiu 


GCGGCGCCAA 


CTTAATCAAA 


ACTGTTTTGA 


LAAbAbbbAb 


rpp rn rp pp rp p rp rp 
1 b 1 1 lull 


4 500 


ATTGAAAPGG 


GCACGGGCAA 


TTGTCATATT 


TACGTGGAbA 


AAbAbbbbbA 


PTT A APA ATP 


4 5 60 


GCAACTGAAA 


TTATTGTCAA 


TGCTAAATGT 


CAACGTbbl 1 


prpprppfppmT\7\ 

b lb Ibl b 1 AA 




4 620 


ACGTTATTAA 


TTCATCAAGA 


GGTGGCTGAA 


GCCTTTTTAC 


/"> Tv TV TV TV rp rn p tv 

bAAbAAl IbA 


AAAAbbb 1 lb 


4 fiRO 


AAGGAGTTTC 


ACGTGGAACT 


TCGGGCAGAT 


GAACGTGCGT 


TAbbAAl 111 


TP A APA APPP 
i b AAbAAuCL- 


4 74 0 


ATTCCAGCGA 


CTGAACAAGA 


TTGGGAAACG 


GAATTTCTCG 


ACTTTATCTT 


TAP/" , 7Ar nr PP7iAA 
AbbAb 1 bAAA 


4 ftnn 


GTCGTTGATT 


CTTTGGACGA 


AGCCATCCAG 


/"> TV rnTl (TimTi TV Tl/"' 

CATATTAATb 


rp t\ 71 TV rn tv p 
bb 1 AbAA 1 Ab 


AAAAb A 1 1 V- 1 


4 Rfifl 


GAAAGTATTA 


TTAGTGATAA 


TTACTTTGCC 


TV 7\ /"> TA 7A /"» T\ 7\ (Tl 

ACACAACAAT 


rp rn rp rp Tv /** TV TV p TV 

TTi lAbAAbA 


tv prppp a rnr*(~*T\ 

Ab 1 bbA 1 bbA 




GCCGCTGTTT 


ATGCGAATGC 


TTCAACTCGT 


TTCACGGACG 


bl Tl 1 bAAl 1 


ibbAl i ibbb 


4 QRf) 


GCCGAAATCG 


GCATTAGTAC 


GCAAAAATTA 


CATbbbbb 1 b 


b Abb 1 A I 000 


TPT APP AG AG 


5040 


CTTACTTCGA 


CAAAATATGT 


TATTTATGGA 


AATGGGCAAG 


CACGTTCTTA 


GAAAGAAAAA 


5100 


CCCTTGATTT 


AAAAATCAAG 


GGTTTTTCTT 


TCtCCTCTCT 


AATGTGCATT 


TTTACACATG 


5160 


AGATTGTTCA 


■ CAGTTCCACA 


ATCTATTCTT 


GATATTTTTT 


TTGAATCCTC 


ATTATACTTT 


5220 


CTACATATGC 


TTGTTTCACT 


TGGACTGGAT 


CTAAAATTGT 


CACTTCTTCA 


. CCAAAGCCCA 


5280 


TAAAATAGTT 


GGCTAAAAAT 


TGTGTTTCTG 


: TTGGATGAAT 


CTGTCCAACT 


' AaAAAACACT 


5340 
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GCGTGGbbbl 


Tl b I i bbAAb 


I b AA 1 A 1 1 b 1 


PA A Af^P^ATft 


T T T AT A A A A A 


aattpttthc 


5400 


CTTTTTCTGT 


AATGATTGCT 


TTAAATGGAA 


TP/IT APflTnP 


PPiTPP»TTTP*T 


TnAT^nrnAT 

1 \jn 1 oo^urt 1 


54 60 


ACGATTCTTT 


CAATTCTTCA 


GTCGTAAAAT 


PPaCTP-TTTP, 
bb Ab 1 b 1 1 lb 


PTTTA ATTPP 


T(^A ATAP,PPT 


5520 


CCATAAAATC 


ACAGCGGTAA 


GTACGCCAAG 


Lb 1111 1 Ab 1 


PAPATPHTZ.P 
\^ Ab A 1 b A 1 Ab 


pptp a BPa bt 

bb 1 bAAbAAl 


J JOU 


ACCAATACCC 


TTCTAAAATA 


GTTAACCGGG 


mfnppm TV TV p> TV rp 

i 1 bb 1 AAbA I 


rprp rprfirprp^/™' rp rp 
111111 V_tj 1 1 


b 1 1 bb 1 1 bAi 


0 OH \J 


ATCTGGTATA GTTAAACTGA ATTACTTGTT 


GCGCCAAAAT 


npAA TP 7A 7\ 7\P 

Ab AA 1 b AAAb 


Tv ^/^"T>r , /* , rp/TpTi 

Abb 1 bb 1 b 1 A 


D / UU 


AATTbbb 1 bb 


TP.PTTPT APT 


TGTTCAATTC 


CTTCGTAATA 


bAbbAbb 1 1 A 


b 1 b A 111 bAb 


D f OU 


m t\ rn rri rn rp 7\ t\ TV 

TGAT I I 1 AAA 


TTnCP,TTf^T 


TCATCTAAAG 


AATGCAATAA 


rn rp rn rp rp p" f rp rp 71 

1111 Ibbi 1 A 


7\ rn rp rp/^p rp/™" t\ rp 

Al 1 1 bb 1 bAI 




AbbAA 1 bAbb 


A A APP.P-TP,AP 


TCTGCGACTA 


ATTTCAGTAA 


rpmprp n tv P'p- 71 t. 

1 1 b 1 AAbb AA 


AAAAAbAi lb 


JOOU 


b 1 AAbAb 1 1 b 


ATTTTPP^TTA 
M 1 1 1 Iv-Ol Irt 


AAATAAATCG ■GCGGCAACAA 


bbbl 1 lb 1 1 1 


a a a apTppaT 

AAAAb 1 bbr. 1 




Abbb AbbA 1 A 


APP APP ATTP 


TCCACATATA 


GCGGCACGCC 


1AAA1 1 1 ibb 


7\ TV 7\ ppttp a a 
AAAbb 1 1 b AA 


DUUU 


1 Al LtLbCAA 


APPTP-TCXPTT 


T T AG A AAT CC 


CAAAAGTTTT 


Pn*T"P7\ 7\ P""P P 1 TV 

CAT i AAb 1 bA 


rn it* T\ T\ T\ T 1 T 1 A T\ 

1 1 1AAA1 1 AA 




Ab 1 bAbbAl b 


PPTTPTP A A(7 


AAAAAAAGTT 


CCTGATTTAA 


7\y^i^>rprp^rp/^7\rn 

Abb 1 1 b 1 bAI 


rp m mmrrii^ 7\ rp rp rp 

11111 bAI 1 1 


fil Of) 




PPPTf^TPAPT 
Uoo lul 1 


TTTTGACACC 


TTTTATTGTT 


Al bbl 1 Al bl 


mnrp/^Tv Tirp/rpfp 
1 Al bAAl b 1 1 


DIOU 


AbbAbb A 1 


AAPAATP.AAA 


AACACACAAT 


TTGCCCTTTT 


1 Al bb 1 b 1 1 A 


AAbbbdbbAb 


D ti *i U 


bAAbbbcLAbb 


PTTP APP.TTT 
b 1 L 111 


TACCaAGAAG 


CCTTTGCTGG 


<PP7\ rp TV rp rp rp rp 

1 bAI Al 1 b 1 1 


r p r p r paaaa r p r pa 

1 1 1 AAAAl In. 




P^TAATPAaPa 
b 1 AA 1 b Artbrl 


ATTTA A AP-Afl 


CGCTTAAATC 


CTTCTTTGAC 


p» T\ rprp pi p*p»rp 

bAI Ibbbbb 1 


ppppaapa ap 

bbbbAHoArib 


OJuU 


aPTf^P-PTTTP 


TP APTPfi ATT 


TTACAAATGG 


ACACTTTTCA 


TV rp rp 71 /-< 7\ TV /^rp tv 
All Ab AAb 1 A 


PPTP a T A ATP 
bb 1 bAI AA1 b 


O *i c- \J 


PTTTP-TTTP-P 


AGGAATGAAT 


ACCACGCCCA 


bAbAAbAAi 1 


bAbbb 1 bAbb 


TTS APPPTPa 

1 i AAbbb 1 brt 


fi-i ft n 


aT ap,ppp,tpa 


rTPTGCCAAA 


ACAATTTTTG 


AAAAACTCAC 


b 1 bbbAlbbb 


bAA 1 bb 1 b 1 A 




TTATfrAAGA 


ACCAATTGAA 


AATGAATTTG 


CAACATTTTA 


lbbbAl lb lb 


a a a p aTPPTT 
AAAbA Ibb 11 


O D VJ \J 


TTP,PTTTAPiT 

1 1 O 1 1 1 i\VJ 1 


PArTCAAATT 


ACTCATGAAA 


AACAAGCAGA 


rp /->/-' rp 71 r>r*T\ TV 

Tbb 1 Abb AAb 


AAAbb AAb 1 1 


DDDv 


AGCCGAAGAA 


TGCGAACCTA 


TGAATCAAAA 


GAAGCCTTGA 


1 1 bAbbbbAi 


tp a a at a ppt 
1 bAAA 1 Abb 1 




TCACAAAAAT 


ATTTAGCTGA 


ATTTGCAGAA 


ATTCCTGAAA 


paPTTAAAPA 
bAb 1 1 AAAbA 


TPAPPPAATT 


6780 


GAAACAGTAG 


CTAAAACACC 


TTCAGAGAAC 


TTAGCCTATC 


A ATT AP-P-TTP- 


flPTPAAPTTG 


6840 


CTGCTTTCTT 


GGGAAGAACA 


AGAACAACGT 


GGTCTGACCG 


TTP A A APPPP 


AnPTf^AAKGC 


6900 


TATAAATGGA 


ATCAACTGGG 


CGCGCTCTAT 


CAATCATTTT 


A TP A A A PPT A 


TGGACAAATG 


6960 


AGTTTAGAAA 


GTCAGCTGAT 


TGCGTTGCAA 


GACACCTTAG 


a a n a att apt 

AAAAA 1 1 Ab 1 


TPATTP,P,ATT 


7020 


GACTCGCTTT 


CCGAAGACGA 


ATTATTTTTA 


CCTCAACAAC 


GGGCTTGGGC 


GACCACCAAA 


7080 


GCACAATGGC 


CTCTTTGGAA 


ATGGATTCAC 


ATTAATAGCG 


TTGCCCCTTT 


TACTAGTTTC 


7140 


CGAACGCAAA 


■ TTCGCAAATG 


GAAAAAAGCT 


TGTCTTTAAA 


AAGTTATCCA 


CATTTTTTCA 


7200 


AAAAAAGCTT 


GACAAGTATT 


TTTAAATTCT 


CCACAGAAAA 


CTTTGTGGAT 


GAAATTTCTT 


7260 


ATTTTTTCCA 


AAAAACTTTT 


CCACAATGCT 


TATTTGTTCG 


ACATTAAATC 


TTTGTAAATT 


7320 
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CTAGAAAAAA GAAGATGAAA ACGTTGTTTT AATAACGTTT TCATCTTCTT TCAACTTTAG 7380 

GGATTCCTTT ACTTTTAGCA AACAGGACTT CTGAACTTAC GTCTGTTTTT TTTAGCAACT 74 4 0 

CGAAAACTTA TCCTTGTGGA TAAACCGAGA TAAAAAGCTT TTTTGTGGTT TtCAACAGAG 7500 

TTATGCACAA AGTTATCCAC AGGGCGTTTT CAGCAATTTT TCGGAGAATA TTCGTTGTTT 7560 

AATCTGGTGA TTTAGCATTT GAATGTCAAT CCCTTACTAC TAATTCTTCT tCACTCCACC 7 620 

CTTAGGATCC TAATAAAATT TTAAAGTkTT cTcTAtTGGT ACATCArATA AAGAGTATGr 7 680 

kAAACTATAG GTACTT 7 696 
(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 



CTCCTATACA 


AACATAGGCT 


ATTGCTCAGA 


AATGTCGAGA 


GACACCAATG 


AGTAGAACAA 


60 


GACTGGTCGA 


GGAGAAGGCT 


TTTCTCAAAA 


TAGCTAAGAG 


TGATCTTTAC 


GTTGTATAGT 


120 


GCTAAAGCTC 


AACGTTCATG 


TTGATGTATC 


TTTTTAGTCT 


CTAGTCAATC 


AGTTAAGTAA 


180 


GTACTTTTAT 


TTAAAATTAA 


GAGAGGAGTG 


AAAGTATGAC 


TACTATGTTA 


GCAATCATTG 


240 


GAACTGTTGC 


AATCATCACA 


GTTCAAGTTA 


TTGAATATAA 


TATGACAGAA 


ATTCATATTC 


300 


CTACTGCAGG 


AACATTTAAT 


GACGATGATG 


CAGAACCCCC 


ACATTCTTCT 


GTAACCCTTG 


360 


TATTTATTCG 


CACTAtACyT 


AtCACTCTCT 


CACTTAACTT 


TCCATCTCAt 


TTTTTTGAGA 


420 


wGGkTTTTTT 


ATGkATyCGr 


GAAAATwATr 


AACmAAAAAG ACyGtTAAAy 


CAATGATTTA 


480 


ACAGTCTTTA 


GTATACCGGC 


GGCCGGGGTC 


GAACCGGCAC 


TCCCGTGAAG 


GAACTGGATT 


540 


TTGAGTCCAG 


CGCGTCTGCC 


AATTCCGCCA 


CGCCGGCAAG 


ATATTTTGGA 


AGGCGGTAAC 


600 


CGGATTTGAA 


CCGGTGATGA 


AGGTTTTGCA 


GACCTCTGCC 


TTACCACTTG 


GCTATACCGC 


660 


CATATTCCAT 


TTTTGTTACm 


AAAAAACTGG 


GGTAGCTGGA 


TtCGAACCAA 


CGCATGACGG 


720 


AGTCAAAGTC 


CGTTGCCTTA 


CCGCTTGGCT 


ATACCCCAAT 


AAAATTACAA 


AGGGCGACCG 


780 


ATGGGAATCG 


AACCCACGAA 


TGCCAGAGCC 


ACAATCTGGT 


GTGTTAACCA 


CTTCACCACG 


840 


ATCGCCGTAA 


TCTTAAAAAG 


TGTATCTATT 


AAATAATTAA 


ATGGCAGGGG 


CAGCAGGAAT 


900 


TGAACCCACA 


CCAACGGTTT 


TGGAGACCGT 


TGTTCTACCT 


TTAAACTATG 


CCCCTAAAAT 


960 


GGAnGAAAGT 


GGATTCGAAC 


CACTGAA 








987 


(2) INFORMATION FOR SEQ ID NO: 539: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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. (xi4 - SEQUENCE DESCRIPTION: SEQ ID NO: 539: 



CGCACCTCAC 


GATCGTAAGC 


AACAAATTTT 


TCCGTAAATA 


GCAAGGGATT 


TTCCAAGCCT 


bU 


AACGCTTTAG 


CAAAGGTAAA 


CAAGCTTGTT 


TCTTTTAAAG 


AAGCTGCTTG 


AATTAATTTT 


ion 

120 


TCATCTGTTA 


AAACAGCAAC 


TGGAGTCGCA 


TCTGCTAGTA 


AACGTCCATC 


ACAAACTACT 


180 


AACGCACGTG 


TTGTATATTC 


CAACATTAAG 


TGCATATCGT 


GAG T GAT CAT 


TAAAATAGTC 


240 


ACACCTAAGC 


GGTTCAATTC 


TTCTAAAAAT 


GTCATCATTT 


CTGTATAATG 


TTTAAAATCT 


300 


TGCCCAGCTG 


TTGGTTCATC 


TAAAATTAAT 


AATTCAGGTT 


CTAAAACTAA 


GATTGATGCA 


360 


ATAGTTACCC 


GTTTTTTCTG 


CCCAAAACTC 


AACGCAGAAA 


TCGGCCAGTT 


CCGGAAGGGA 


420 


TACAGCCCAC 


AAATATGTAA 


AATATTCGTC 


ACACGTTCCT 


CAATTTCTGC 


TTGTGGCACA 


480 


TCTCGTAAAA 


CTAATCCTAA 


AGCTACTTCT 


TCAAAAATCA 


TTTTTTTAGA 


AATCATTTGA 


540 


TTGGGATTTT 


GCATCACATA 


ACCAATTTTA 


TCTGCACGTT 


CTTtGATTGA 


ATAATTGGAA 


600 


AAATCCTGTC 


CTTCCCATAA 


AATCTTGCCA 


GATTGAGGCG 


TGATGAACCC 


GCAAATTATT 


660 


TTAGAAAGTG 


TAGACTTCCC 


GGcACcATTT 


TTTCCTACAA 


TGCtGATCAT 


TTCtCCAwGA 


720 


TGaATAGTTA 


CwGAGAAGTC 


ATCTAACACT 


TTTTCACCG 






759 



(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 



ACCGACGCCT 


AAATCCTTTT 


GCTTCAAAGT 


AGACTCCATT 


GCCATTCGCG 


CATGAAGTTT 


60 


CGTATTATTT 


TCTTTGCTAA 


GATGACCTAA 


ATAAATGCGT 


TTTGTTTTAT 


CACCTAACAC 


120 


ATCCGCCATC 


ACAAGAGCAC 


CATCATCATT 


GGATAAATGG 


CCTTTATCTC 


CTAAAATCCG 


180 


TTGTTTAAGA 


CTCCATGGAT 


AAGGTCCTGC 


TCGCAAAATT 


TCAATTTCAT 


GATTGCTTTC 


240 


GACTAAATAA 


GCATCTGCAT 


TTTCAATCGT 


ACCACGAATA 


TGATCGCTAC 


AATAACCTGT 


300 


ATCCGTTAGA ACAACAAAGG 


AACGATTATT 


GCGATGAAAA 


CGATAAAATT 


GAGGCGCAAT 


360 


CGCATCATGT 


GAGACACCAA 


AACTTTCAAT 


GTCCATATCT 


CCCAAGGTCA 


AAACTTTGCC 


420 


CATTTCAAAA 


ATATGTTTTT 


GAGCGACATC 


GACTTTGCCA 


ATCATGGGAT 


CCATCGCTTG 


480 


CCACGTTTTT 


TCATTGGCAT 


AGACATCTAA 


CTTATATTTT 


CGCGCTAACA 


CGCCAACGCC 


540 


GTGAATGTGA 


TCCCGATGTT 


CATGTGTAAC 


TAAAATCGCA 


TCTAAATCTT 


CTGGTTTGCG 


600 


ATTGACTTCT 


GCTAACAGTG 


AAGTGATTTT 


TTTGCCACTT 


AAACCCGCAT 


CTACTAATAC 


660 


TTTCTTTTGG 


TTGGTTTCAA 


TAAAAAGGGA 


ATTGCCTGTA 


CTGCCACTGG 


CAAGAACACT 


720 


GATATTAAAA 


GCAAACTCTG 


AATCCATTTA 


AAACTCTTCC 


CTTCTACATT 


ACAACATTAA 


780 
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CGTGTTTCAT TATACAACGG ACCTTAATGT TTTTCCACCT TTGGCACAGT ATTATTTGTG 840 

ATGATGGTAT TACTTACGGC ATTCACCTGT TCAATTTCAA CTGAATCTTC ACCAGTAGAA 900 

ATTCCTACAA ACCAGACTGG AACGTATACA TTTTTCTCAC GAATTTTATA AATTCTTGaA 960 

TAGGCCAATT TAAtGAATGT TATTTTGGAA TTACTAgGAA TCCGGTTATT CATATAGAGC 1020 

GTGTCTATCG CATCACGATT GGAATATAAA TCCGTTTTGT CTCGTAACTC TTCAATATTT 1080 

TCAATGTGCG TTTGCGAATA TTTACTAATT TGCCATAATT CACCTGACGA ATCTGCTAAA 114 0 

ATACTTAnTT TTGCCGGGTC GnnCCCGAAA A 1171 
(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 



GCAATCATTG 


TATTGGGATA 


GAGGTTAAGn 


CCTAGGGCAT 


CCCCATCTGG 


AAATGCTGTA 


bU 


TGGTCATGGn 


CATTCCTAAC 


CAGGATTAGG 


AnGGAGGGAT 


CGCAATTyCC 


ATCTCGTTCC 


120 


ATTTAATTTT 


GCTTAAGGAT 


TGTGCCATCA 


AAACACCAAC 


AATAATCAAC 


GCTGGTGCTG 


180 


TAACTTGCGA 


AGTGACAACT 


GCTAACAAAG 


GAGAAAATAG 


CAAACTGAAA 


ATAAATAAAA 


240 


TCCCGGTTGT 


AATAGCTGTG 


ATTCCTGAAC 


GACCACCCAC 


AGCAATCCCC 


GCAGATGATT 


300 


CGACATATGC 


GCCAACTGGC 


GATGTCCCTA 


ATAGAGAACC 


AGCTAACATC 


GCTGTTGAAT 


360 


CAGCGGCTAA 


AGCTTTCCCC 


ACACGCGGCA 


TTTTATTGTC 


CTTCATAAAA 


CCTGCTTGAT 


420 


TGGCTAAGCC 


CACCAACGTA 


CCAGCCGTAT 


CAAAGAATGT 


TACTAATAGG 


AACGTTAAAA 


480 


CCACGACCCA 


CATTTGAaTC .GTATTgATAT 


CTTTCACATG 


TCCTAACGCC 


ACTAAAAATG 


540 


TTGGTTCTAA aCTTGgCcGC 


TTGAAATAAt 


CTTATCAGGC 


ATTGGAATAA 


CGCCTGTCGC 


600 


TAAACCAACA 


ATCGTTGAAa 


TCGCCATCCC 


AATAAAAaTA 


CCACCCGGTA 


CTCGACGAAC 


660 


TAACAAAATT 


GCAGTTACGA 


CTAAGCCGAC 


AATGGTAACC 


CACGTTGAAC 


CAACATTCAA 


720 


AGGTCCTAAG 


CCAACGAGTG 


TCGCATCATT 


AGCCACAATT 


AAGCCACCTT 


GGCTTAAACC 


780 


AAGAAAAGCA 


ATAAACAACC 


CGATCCCACC 


TGaAATGGCA 


AATTTCAAGT 


CTGCAGGAAT 


840 


TGCATCAATA 


ATTAGCTCGC 


GTAATTTAAA 


AATTGTAATT 


AAAATAAATA 


TCAATGAAGC 


900 


CACAAAAACG 


CCCGCTAATG 


TCGTTTGCCA 


AGGAATCCCC 


ATACCAACAG 


AGACTGAATA 


960 


AGCAAAGAAT 


GCATTAATTC 


CTAAAGCTGG 


TGCTGTCGCA 


ATCGGATATT 


TCGCCAACAC 


1020 


ACCCATTAAA 


ATACAACCTA 


AGGCACTCGC 


TAAAGCGGTG 


GCAGTAAAAA 


CAGCGCCTTC 


1080 


GTCCATGCCA 


GAAGCACCAA 


GTACGGTTGG 


ATTAACGAAC 


AAAATATACG 


CCATTGAGAT 


1140 


AAACGTGGTG 


AAACCAGCTA 


ACATTTCTCG 


TTTCATATTC 


GTATTCAATT 


GTTCAAGTTC 


1200 
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AAAATATGAA CTTATTTTTT CTTTCATGTT CCGTCCTCCT AAAATCTAAA TATTTTACAA 1260 

TGTTCGTAAA ATATTCAATT TATTAATGAA ACAACGTGCT TCCTTGAGTG GATATTAT-TA • • ■ -1320 

AGCACGAATA GCCCCATTTT ACTTGATACA ATAGGAATTT GCAACAAAAC ATAATTAGTT 1380 

AAATACACAT AGTTTTGATC AAAAGCACAG ACCAAAGCAC AGACCTTTTT TAATCCATCA 14 40 

CGCCAATAGT TACATATTAC GTTTACTGTT TTTTAAACCC AACGGGAAAA GAATAGTTAT 1500 

ACCTGATATA CCTATACTGA TAAAAAAATA AATGAGGTAA CTTGTCCAGT CAGCGGTACC 15 60 

CCCATCTATA TAACCTGAAA TTGAAAAATA TGAAAaCAAA ACTATCACAC TGACTAACCA 1620 

CTGATATCTT AACCTTAATT TTAATTTTAT TGTCATTACG TAATACATCA AATACACGCC 1680 

TATCAGTACC AACCATACGA TGTAACTACT GATAGAATAA TGACCATCGA ATAAGcCCAC 17 4 0 

ACTAGATTAA TCAAAATAAT CAATATTATT GGTAGTAATA CTTGACATAA AGTGTTAGAT ■ 1800 

CGATAGATGC ATACTATTTT ATCGATAATT TTTTTCATTG TTTGCTCTCC AAAAGTTTAT 1860 

TTTTAATATA AGCTAATAAA GTTCCTAATT CAATTACTAC CCATGCTGTT AACAGAATAG 1920 

TCCAGAATAA ACTTCGTACA TAGAATAAAG CTATAAATGT AAATATTAAT ATTGCCAACT 1980 

GACTGATTAT CACTAATAAA TCTTTTTTTT GGGAAAATTT ATAAGCTAAA AGACTGCACA 2040 

CTATGCCAAT TACTGGGATA AAACGAATAT AACTAGGAAC CCCTTCCATT GGTACAGTAA 2100 

ACAATAGAAT TATAGAATAC CAAAGTTCAT CAGGAACCTT CGACGATAAT GCCCTTATTC 2160 

TTCCCCAATT CATAAGTTAA TCCCCCTTTT TCATTCTCTT ATACTTGTCT TCTATACGCT 2220 

TATCCTGTTT TTTCCCGCCT GCATGAAAAG CTAGAATAAT CATTGCCTCT GTAATAACAA 2280 

ACTCCCATAC TGACCAATGA AAAATAACTG TCATCCCTAG AATTACTAAC ACAGCTGATG 234 0 

CTAGAATAGC ATCTTTTGTG CCAAAATCTA TCCCTATAGG TAACATAAAA AACCTCCAAT 24 00 

CTCACTTTTT TTATCCTAAA GACATTGCCA AAATAAATAA TATCCAAGGA ACTAAAAAAC 24 60 

TAAAAGCACT TAAAAGATTA GCGACAAGCA ATATAAGTTT TAACCCTAAG TGCTTTATGT 2520 

ACATACTAGG AATAGCTCCT ATCAAGCCAA TTGGTAACGT CCACAACAAA TCAAAGGCAC 2580 

CATGATAGGG ATTGTAGAAA AAGAGGTACT TAATCCATAC AAGTATATTC AAAAGAACGA 264 0 

ATAGGACACT CGCAACAACT GCTGATCGAT TATTTAGTTC TATTTTCATA GTTACTGACC 2700 

TCTTTGGTTT TTATTTTAAT TATATCAGAA AAGACTTCAC TGATAAAAAT CTTTTCTTCC 27 60 

CCTCAAACAA AC GAG CT AT A ATAAAGTAAC TAGAAAACCA GCAAAAATAA TTTGTAATAA 2820 

TTTTAAATTA TTGTGCTTTT TTATATTTTA GGTATAAAAA AGGTAGCCGA CTATCCGACT 2880 

ACCTTTTAAA CACTATTCAA TTACAGTAAC ATTTACCGCT TTTGTTCGCC CTTCGTATTC 294 0 

TGCGGTTTCG TACTCCACTG CTTGGCCTTC AAGCAATTTT C.TAAAGCCTG GCATACAAAT 3000 

ACCTGTTTGA ■ TGTACAAAAA TCTCTGATCC ATCTTCGGCA ATAATAAAGC CATATCCTCT 3060 

TTTATGATTA AAACTTTTAA CTGTTCCAGT TTTCACTTAT ACTTCCCCCG TATTATGATT 3120 

TAAGTTGCCC ACGATCAAAT TTCATAGCTC TTTCTACTAG ATTAGGAGCT GAAACATAGC 3180 
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TTGAAATGCC 


TATTACACTT 


CTGTTATGGC 


GAGTAACTCC 


GATGATTTTA 


TTTCCTTCGT 


3240 


CTACTACCTC 


ACGATTATAG 


ATTGCATTAG 


CTTTTACTTT 


AACTGATTTT 


ATAAACATAG 


3300 


CCTTTTCATT 


TAATTGTTTA 


AGCAAAAGTG 


CTATTTCTTG 


TTCCACGGGC 


TCTAGTTGTT 


3360 


TTTCAATTGC 


TTGTCTGTAT 


TGAGAGTCAA 


TATCTCCTAC 


TTTATTAGGA 


AAATGAGTAA 


3420 


AACAAGATTG 


CTGTAGCTCC 


TCGCGTTCTT 


TCTTCATTTG 


TGTTAGTAAA 


TGATTATTTT 


3480 


CwGTTAAAAG 


CCTGTTTATT 


TCAGGTTTTC 


TCTTAAGACT 


ATCTACCGTA 


AAAGTTTTCG 


3540 


TATTTTCTAG 


CGTTTCTAAC 


TCTTTCTCTA 


AGACAGATTT 


TTCCTGtTCT 


AAATCATTAA 


3600 


TCACTTTATC 


TACTCCGCTA 


AAAATTTCTT 


TTAGCTCGGA 


AATAAGCTCA 


ACTTTTCCCG 


3660 


GAATATCGTA 


GTTAATTTCT 


ATCATTTTAA 


TAGCACCCCT 


TATTTATCAT 


ATTTAGACAT 


3720 


GATTTTTTTA 


AATGGATCAG 


GTTTATCTTC 


TTTCTTATCC 


TCTTTATCTG 


ATTCCGTAGA 


3780 


ATCACCTAAG 


CTTTCTTTAA 


TCAACTCTTC 


TAATGCTTCC 


GAAGCTTTAT 


CAATGTCAAA 


3840 


TTGCCCTTCT 


TCATCTACAA 


ATTCCTCTAA 


AACTTTTTTA 


ATATCTTCCG 


CTGTCATATT 


3900 


TCTTACCTCT 


TTCATAATTT 


TTTATATATT 


TTGGTGAAAT 


TTCACCGAAA 


TAAACATATT 


3960 


CTGCTTGATA 


CATGATATTG 


CCTAATTGAA 


AAGCGTCATT 


TACTTTGTCC 


TTGTTTGTAC 


4020 


ATAAGTGTGT 


ATGGAATTGC 


nGnATCCAAT 


AAACTAGCAG 


CCGTGTGATC 


TGGTTCGAAT 


4080 


ATTAGCAATG 


GCAGCATTCC 


GATAACACAA 


TATTACTTAT 


TAACGGCACn 


CCAT 


4134 



(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 



AATTTTTTTA 


GTTCGTCTGT 


GGCGTTCTAA 


GTGCAGCAAA 


TAATCGTCTT 


GCAGTTTTAG 


60 


AAGGTCTGAA 


TATACCTGAT 


CCATAGTCTT 


GCTGGCCGAC 


AAAGAGAGAG 


AATAAATAAA 


120 


TGAAATAGGA 


AGTGTTTCAA 


TTTTyTTGTT 


GCGAGTAATC 


CAAGAAGAGA 


CAGTTCCTTG 


180 


AGAGAAAGCA 


TGTAATTCAC 


AAAATTCTTC 


TACAGTAATC 


CCTAGTTGCT 


TGATAATGTA 


240 


TGCGTTGaTA 


GGGTGTGGAT 


ATACAAAGGT 


TTTTCTmGGC 


ATGGTGGGTA 


AGCTTCCTTT 


300 


CTTAGGTTAT 


TAATGCGTTT 


GCAGTAATAT 


TGTACTAAAA 


AACATGCAAT 


AAGGGAAGGG 


360 


CGTACCTGCT 


TCTTTTTTGT 


TGACCTTCTT 


TAATAATGTG 


TTATACTAAT 


AATGCGGAAG 


420 


ACATAAGTTC 


GATATGAmCA 


GCACAAkAGm 


CCAAACcTGT 


GACGAGGTTT 


GGGCTATTTT 


480 


TTTCGTCTTG 


CGGATAGACG 


TGTGGGCTAG 


GACAACTTGC 


ATGGTTATCT 


GCTAATCATC 


540 


TAACCAATGG 


TCAACTAGTA 


AGATTACTAG 


ACCCACAAAA 


AGTGGGGCAA 


TAACCAAAGA 


600 


CATAAGTTCG 


TACATAAACA 


GCACCCCCAA 


TCGGAGGCAA 


GTTGCCGAAA 


TAATTATAAC 


660 


ATAAATTCGA 


ATATACCTAT 


ACAGTAGTAA 


ACCTTCAGTG 


TTTTACTTCA 


ACTTTTAGCT 


720 
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CTCTTTTTAG TACGTATACT CGTTTACATA ACGAACATTT GTACTTACCT GAATCACTTG 780 

ACACTCTGCT CGCAAGACAC CCaCACGCAC ACTGTACGTA TGCAACGCCA CTTATTTTAT 84 0 

TGTnGTCyTT AATCACTGcA CCCtGGTAGT TAAaTTyCTT CATAAAATTC TCCATCATCG 900 

TAATAATCTT TCATAAAAGC ATCGATTGAT yCTtGTTCTT CATCTGATAA TTCTGGAAGT 960 

ACGACGTCTG CGAAAGGATT TTCCAATGTA TCTACAGCAG ACCA 1004 
(2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

CCCCTTyCAA ATAAgCtCGT TCgTGCTGGT CGGGCCakGA ATGCCgGTGT TTAcTTTGTC 60 

aCGCaAAATA GTGaCGATTT aCTGGATGaA CGCTTGAAGA ATAATATCGG ATTGAAATTT 120 

GCTTTTCGTA GTACCGATAT TCaTGAAATC aAGAAGACTT TGGAATTTTT TGGGTTGGAT 180 

CAAGAAGATG AAAGTAATCA AAAACGCCTA CkAGATTTGG AAAACGGCCA ATGCTTGATG 240 

CAGGATTTGT ATGGTCGGGT AGGTGTGGTT CAGGTTCACC CGGTCTTTGA AGAACTCTTC 300 

CATGCCTTTG ATACTCGCCC ACCGGTTCAA GAGAGAATGG AGTGAAGGAG ATGAAAAGGA 360 

AAATTTTACG GATTGTAGGC ATTGTTTTGA CTAGTCTATT TTTCCTCATC ATTCTTTTGA 420 

GTTTAGTTGG AAAGGTAGCG GAAGCGACTG GATTAGTCGA TGATACAGTG AAAGCTGGTA 4 80 

ATCTCTATTC TCAATATTCT CTTGGTAACT ACCAACTAGA TTTTTTTGTA GACAATTCTT 54 0 

GGGACTGGCT ACCTTGGAAC TGGGGGGATG GTTTAGGAAA AAGCGTGATG TATGGTCTCT 600 

ATGCCATCAC AAATTTCATT TGGACCGTAA GCTTGTATTT GTCGAATGCC ACGGGTTATG 660 

TGGTTCAAGA AGCTTATAAA TTGGATTTTA TTTCAGATAC GGCAGAAAGT ATCGGGAAGA 720 

ATATTCAAAC TCTAGCCGGG ATTAACGAAA ATGGACTTCA GCCATCTGGT TTTTATTTTG 780 

GCTTTTTGAT GTTAATGATT TTGGTTTTAG GCATTTACGT AGTCTATACA GGATTACTTA 84 0 

AACGGGAGAC TACCAAAGCA GTGCGAGCTG TTATTAATTT TGTAGCGATT TTTCTACTTT 900 

CGGGTTCCTT TATCGCCTAT GCCCCTAACT ACATTACGAA AATCAATGAC TTTAGTTCGG 960 

ATATTAGTGA ATCTGCATTG AGTCTTGGCA CTAAGATTGT CGTACCAAAT TCTGAAAGTC 1020 

ACGGTAAAGA TAGTGTTGAC TTAATTCGTG ACAGCTTATT CTCAATTCAA GTCCAACAGC 1080 

CGTGGCTCTT GTTGCAATTT GATGATTCCA ATGTAGAAGA AATTGGCGAA GAGCGAGTTA 114 0 

ATAAAATTCT "ATCCGTGAGC CCGGATGAAA ATAAAGGGAA GGATCGCGAG GAGGCGGTTA 1200 

AAGCCGAAAT TGAAGACAAT GAAAATGCAA ATTTGAGTAT AACAAAGACC ATGAGCCGAT 1260 

■ TAGGGACCGT GGTTTTTTTA GTATTGTTTA ATATTGGAAT TTCCTTCTTT GTCTTTCTAT 1320 
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TGACAGGAAT 


AATGCTATTT 


TCCCAAATTC 


TTTTCATTAT 


CTTTGCGATG 


TTTCTCCCAA 


1380 


TTAGXTTCCT 


GTTGT.CCATG 


TTACCGACCT 


ATGAAAGTTT 


GGGAAAGAAA 


GCGATTGTTC 


1440 


GCTTATTTAA 


C AC GAT TAT G 


ATGCGAGCGG 


GAG TG AC ATT 


GGTCATCACG 


ACAGCATTTA 


1500 


GTATCTCTAC 


GATGTTTTTC 


AATATTTCAG 


CAACCTATCC 


GTTCTTTATG 


GTAGCATTTC 


1560 


TACAAATTGT 


AACCTTTGCC 


GGGATTTATT 


TCAAATTAGG 


CGACATTATG- 


AGTATGTTCA 


1620 


ATCTCCAAAG 


TAACGA'i/vGT 


CAGTCTATGA 


GTAGGCGTGT 


GATGCGAAAA 


CCTCAAATGT 


1680 


TAATGAACCG 


TAAATTTCGT 


CAACTCAATC 


GAAATGTTGG 


CCGAGCGTTG 


GCTTTTGATG 


1740 


GCGGTGTAGC 


AGTTGCAGGT 


AAATTAGCAA 


CTGATTCTAC 


AAAAGAAAAC 


AGAGGTTCAA 


1800 


AAAGCAAAAG 


ATTATCTAAT 


CGAAGAAACC 


AATCGATACA 


AAGTAATCAT 


GAGCAAAATA 


1860 


CTGATCCGAA 


GAAACAAGAA 


AAGACTACTA 


ATAGTAGAAC 


CACTCGATTG 


AATCTTATGG 


1920 


GTAGGAAAGC 


TGGAAAGGCA 


ATGGATTCAA 


AATCTATCGT 


GAAGGACAAA 


GTAAAACAAG 


1980 


CAAAAGACCA 


TGTAATTGAT 


ACGCCAACAA 


ATTTGAAATA 


TAACCTGCAT 


AGAGGAATTG 


204 0 


AAAAGGCGAA 


AAATGCCCCA 


GGGGAATTTA 


AACGCGGGCT 


TGTGCAGGAG 


AAAAAAGAAC 


2100 


GAGAAGAACA 


AAGAAAAGCT 


AATCACCAAC 


AACGGCAGAA 


AAGAATGGAT 


GAAAAACGCC 


2160 



AAGTGTTAGG 2170 
(2) INFORMATION FOR SEQ ID NO: 54 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtf DEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 



CAAAGAATAT 


TGGAGGACAA 


AATGATCAAT 


CAACCAAATT 


CAGTTGATCA 


ACCAATCCTG 


60 


ACGAACCAAG 


AAAAAATGGT 


CAAAGGCTCA 


GCTTGGATGA 


CAGCTAGTAA 


TATTATCTCC 


120 


CGCATGTTAG 


GGGCGATTTA 


CATCATCCCC 


TGGTACGCTT 


GGATGGGCGA 


GCACGGAAAC 


180 


GAAGCGAACA 


GTTTGTTTTC 


AATGGGGTAT 


ACGATTTATG 


CCTTATTCCT 


AATGATCTCT 


240 


ACCGCCGGAA 


TTCCAGGTGC 


GATTGCCAAG 


CAGACCTCGC 


ATTATAATTC 


ATTGAACGAA 


300 


TATAAAwTTA 


GTCGACAGCT 


TTTTwATCGk 


GCCTTACaAC 


TAATGGGtGG 


CTTAGGTGTG 


360 


GkTTTTGCTA 


ttGtGATGTA 


TTtGGCATCT 


CCGGCATTAG 


CAGCTTTGTC 


TGGCGGAGGA 


420 


CCTGAATTAG 


TTCCAACAAT 


GCGTyCATTA 


AGTTTAGCCG 


TTTtAGTTTT 


CCCArGCATG 


480 


AGTGTCATTC 


GCGGTTATTT 


CCAAGGAAAT 


CAAGAAATGA 


TGCCCTTTGC 


TTTATCACAA 


540 


ATTGTTGAAC 


AAGTAGCACG 


AGTTTTTTAC 


ATGTTATTAA 


CTGCTTTTAT 


TATCATGAAA 


600 


GTCTTTGAAG 


GTAATTACGT 


TACAGCGGTG 


ACCCAAGCCA 


CTTTTGCGGC 


ATTTATTGGG 


660 


ATGCTCGCAA 


GTTTTGCGGT 


TTTAGGTTAT 


TACATGTACA 


AACAAAAACC 


ATTATTTGAT 


720 


TATTTAGAAG 


AACACAGCGC 


CAATGAACAT 


GAAATTGCTC 


CAAAAGAGTT 


ATTGATTGAA 


780 
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ACTTTTAAAG AGGCGATTCC TTTTATCGTA GTCGGCTCAG GAGTGACTGT CTTTAAACTT 



840 



GTCGATCAGT TTACTTTCTC AAACTTTATG CGTTTGTTTA CGGaTTATTC AGATACCCAA 



900 



TTACGAGAGC TTTTCGGTAT TTTTAATGCC AATCCGGACA AATTAACCAT GATTGTCGTG 



960 



GCTTTAGCGA CTTCGATTTC GGCTACTGGa TTACCGTTAA TTACAGAAGC GGTCACCTTA 



1020 



AAAaATCACC GTGAATTAGC GCGTTTGAtT AGTAATAATT TACmACyTTT TGtGkTTGTA 



1080 



ATGtGCCAGC AACGgTCsGt ATGATGTTTT AGCGnCACCA CTATACACTG GTTTTTACAT 



1140 



G 



1141 



(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 
TTAGATATGC CCTTATCAAG TCAGGCTCTG TATTTTCATT TAGCGATGCG TGCCGATGAT 60 

GATGGATTTA TCAATAATCC CAAAAAATTG CAGCGAATGG TCGGTTGTGG GGAAGACGAT 120 

CTAAAATTGT TGATGGTTAn AAAATTTATT CTAGTATTTG AAAGTGGTGT GATCGTTATC 180 

AAACATTGGA AAATTCATAA TTATATTCGC AGTGATCGTT ACAAACCAAC CTTGTATCAA 24 0 

GAAGAGArAA ATCAGATTGT TGAAAAAAAT AGCAAAGCTT ATACGTTTAA AGCAGAATCG 300 

TCTGTCGGTG GTCAACCAGC TGACTACCAA CGGTTACCAC AGGAAAGCAT AGTCCAGTCT 360 

AAGTTAGGTC AGAGTCAAGG GCAGTAGTTC AGAAAACGAT TGTTTAAAGA TGATTTATCA 4 20 

TTTTaTGAGG AAAACGGCTT gGTACACTGG CCTCAAAAAC AAGCCArGAT TTTaAGTATT 4 80 

GGTTGCAAGA TTTTATACAA AAAGGGGCTA GCCAAGAGGA AGCATGCCAA TTAATCTTGC 54 0 

ATGcTTTAGG AATTGCCGTC GATCGAAATA AACGGAATTA CGGCTATGTA AATGCTATTT 600 

TGAAAAGTTG GGAGCAACAA AATTATTTAT CCGTACATGA AGTTCTGGTA AATGATAAAA 660 

AACAAGTGTT GGAGCATGCG CCGCAAATGA CAGAAGAATA TCAAGAGTTA GGTTTTTAAA 720 

GAAAGGAGGA AATCAGTATG CATGCGACAG ATCnAACTTT TCCAATACTA TTGAGTCAAT 780 

TGTTAGAAAA AGTTGAAGAC CGTTGTCCTG AATGTGGCAG TGAACAATAT GTTTGGCCAC 840 

AAAAAAATAA AGATGGCACm GAACGTTGTG CCCCAACTTG TTGGTCGTGT GGGTAtAAAA 900 

TGCTAAAAAA ACATGAACAA nGAGCCAATC AACAACGTTC TCAAGAGAGT TTTATGGCAC 960 

GTAAC 965 
(2) INFORMATION FOR SEQ ID NO: 54 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 



GACAAATGCT 


GCTGCCGCGA 


AAGGATCGAG 


AGACCCTTAG 


TTGGTTGAAC 


AGGAATAATC 


60 


GTGAAGGTTA 


TTCATAATGT 


AGCTGATACA 


TGTTATCTAC 


GCCCTATAGT 


GCTAAAGmTC 


120 


AACGATAGAG 


AATCTTATTT 


AAAGATGGTA 


TTTTGGGACG 


TACTCTATTG 


TTGAGTAGGT 


180 


CCTTTTTTGT 


GTGTTTGGCA 


AAAAAGCAAT 


TATTCACTGG 


AAGTATAAAT 


AAGAAAAAGA 


240 


AAGGATGACT 


GTCATGGAAG 


ACCCTGGAAG 


TGAACAGAGA 


AACAAAATAC 


AGTCACCAAT 


300 


GAAAGGAGAA 


GATTTTAGCG 


CTCTTTTCGG 


CAGGTGACCA 


CAACCTTGGA 


GGAAAAGAAA 


360 


ATGATGAACA 


AAAAAACAAT 


GGaTATGAGA 


AAAGCAACCA 


AAGaTCAAGa 


ATTACGTAAA 


420 


TTGGCTTTGC 


TTTCTGAACG 


GGAATTAATG 


ATGGAATTAC 


GGACATCTGA 


AAAAGGACTC 


480 


TCGAATGAAG 


ATGCCGAAAA 


AAGATTAGAG 


GAATTTGGAC 


CAAACGAaGT 


TTCTGCACAA 


540 


AAGCCAACCC 


CAGCAATTAT 


TTTATTTTTA 


AGTGCATTCA 


AAGATCCGTT 


TGTCTATGTC 


600 


TTAGCCTTAT 


TGATGGTCGT 


TTCAACTTTA 


ACGAAAGATT 


TCGAAGCCGC 


TATCGTGATG 


660 


GGCGTGATGA 


TTTTAGCCAG 


TGTCTTGAnT 


GCTTnATACA 


AGAGTATCGT 


nCTCAA 


716 



(2) INFORMATION FOR SEQ ID NO: 54 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 



TAGACAATTG 


GTCCCCACTA 


CAACTATCCA 


GTTATAACnA 


AGAGCGCCCA 


CTCGAACCTT 


60 


TTTCTGTAAT 


ACGCCTAATG 


TCATTACCAT 


CATATTGCCT 


GTTGCTAACT 


CACCACCAAC 


120 


AAAAGTAAGC 


GCTATCAACC 


CAACTGGAAA 


TAAAGCTCCA 


CCCAGAAAGT 


TGTTAAAcTT 


180 


CCCCmTTCTG 


TGGGGGCAGT 


TCCaGTGATA 


CGAATAAACG 


CCAAGTAACC 


CAgTGCAaTA 


240 


CAAAAACCAG 


CCaTGATGCC 


TAAAATACAG 


AGTCGAACAA 


ACGATAATTT 


CCCTTTCATT 


300 


TCTGCCTTAT 


CACCTAAGCT 


TGCAACTATT 


TCTTCACTAC 


TGAAAAAATC 


CATTACGATT 


360 


CCTCCTTTTT 


ATCGATACCA 


CTGAGACAAT 


TCTTCTTTTC 


GATTGACATT 


TTTAAGTTGA 


420 


TTCTCTTTTG 


ATAGTTGCAC 


CGTTTTTACA 


GATAGACGCT 


CAAATTCTTG 


CCGAATCTGA 


480 


TAGTTATTTT 


GCGCAAGCTG 


TTTTTGAAAT 


GTTGGTAAAC 


AAGAACATCG 


ATAAAAGCCA 


540 


AATAATGGTT 


CTTGACGACC 


TTCAAAGTCA 


AAGATTGTTA 


CTTGATTTTG 


TCCACGCTTA 


600 


GAGGCTAACT .GATACACAAG 


TTTAGGAGAA 


AAAGTTGGTA 


TATCACAGGC 


AATTACAAAC 


660 


ACTTCAGCTT 


TCTGAGTATA 


GTTCATTGCT 


GTCACAATCC 


CACCTAATGG 


CCCCTTTCCA 


720 


AGATACATAT 


CTTGAATGAT 


TGTGCATTTT 


CTAAACGCCT 


TAGGAAATTT 


TTGACTATCA 


780 
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TTCGTTACTA AAATAACCTC TTCGAACAGT TCCATTAACT TTTCAGCTGT TTGCAAGATC 84 0 

ACGTATTGCC CTTGAATTTT CAATAATGCT TTATCAAATC CCATCCGTGA ACTTTTCCCG 900 

CCGCATAACA GCACTGckGT ACAATCAGCA ATAATTTTCG CCACCCCTTT TACCATAATn 960 

Tnn 963 
(2) INFORMATION FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 



CACATCCTCT 


CTAAGTTTTC 


TTCTACATCG 


TCGGAAAGCT 


CGTTTTCTTC 


TAATTCTATA 


60 


TTTTCCATCA 


AACAAATAGC 


TGTTTCTATC 


TCTTCAATCT 


CATCTAATAC 


GCTCTCTATA 


120 


AGCCTGTGAG 


GGCAATTAAC 


CTCTTCTTTA 


TACTTGAATC 


GCCTAATGAG 


CAATTGTTGC 


180 


TCTAAAGGCT 


TAAGAGTAGC 


CCAAAACCGC 


TCAAAATGTT 


TTTGTCGAAA 


TAATGTGCGT 


240 


TCGATTCGTT 


TATCTATTAG 


CTCTAGCTTT 


AACAAAAGTT 


TTTCCACTTC 


TTTATCTGGT 


300 


CGGAAGCCTT 


TTGTAGTCAT 


TCCTAACTCA 


TCAAACACAA 


TATGCGTTGC 


CATATTTTGG 


360 


CTGAAAAATT 


CAATACGCCT 


TAATTTTTTT 


ATTGAAAGCA 


ACTTTTGCAA 


CACTAACGGA 


420 


AATGAGAGAT 


AGTTTTTTAT 


AACAATTTCA 


TCAAATGGAT 


CCATTGACAA 


CGTTCCTCTC 


480 


TTTCCTGAAT 


TGTCTTTTTT 


AGGCTGCCGA 


AAGTGCCGAC 


TTTTTTAGGT 


GTCTGATTTG 


540 


TCAGATTTTA 


AACAAAAAAG 


AGGACACAAG 


ACCTCACTCA 


ATTAAGAGTA 


AGCTGTCTCA 


600 


TATCCTCTGG 


TTCTTCCTAG 


TAAGATACAT 


AGCGCTAGAA 


TAGCAAGATA 


TACTTATTTA 


660 


ACTGTCTTTA 


TTATAGCACT 


TTTTTCGCCT 


AATAGATAGT 


TTAGACTGCT 


AATTCTTAGG 


720 


TAAAACATGC 


GATGTAATTG 


TATCGTATTT 


TACTACCTTC 


CCATCTTTTA 


CAATCAACTT 


780 


AACTTCCCCA 


AATTTTGGTA 


GTTCTACCAC 


TTCCGTTTTT 


TTACTGTTAT 


CAATGTATAC 


840 


TTTATCCATG 


TTAGCCTCCT 


GTTAATAAAA 


ATCTTTTAAA 


TTCTTTAGTT 


TTCTTAATAG 


900 


TAAAAATAGC 


TGTGCACGTT 


GTTAAGTAGC 


TCTTTAGCAC 


TCCTTAAGAT 


GCATAATTGT 


960 


GTTTGTTTTG 


AAATATTTTA 


ATTATACAGC 


GTTAGTAATT 


GTTAGTATTC 


ATACTTTTTC 


1020 


ACACCTAATT 


AAGGAAAAAG 


TCAAGTTTAG 


TCAAGTTTTT 


ATATAGAATA 


AAACGAGATC 


1080 


TTGTAAGGkT 


CTTTTTTAGG 


ACGGAAGAAG 


CGGAAGATTT 


CATGCCGTTA 


TTTCTTCTTC 


1140 


AATCTTTTTA 


GTTTCTTCGA 


TAATCGCCGT 


TTTTCAATCT 


CTTTAGCGTT 


GTTCCCTTTA 


1200 


GAAGCTCTAC 


GGTCTTTAGA 


CTGCACATTT 


AATGCATGTT 


TTGTGCCTGC 


TGGTTTTTTC 


1260 


TCTCTAATCA 


TGACTATTAT 


TCCTCCTCAT 


ACCAAATAGA 


AATAGTTGTA 


ACTGACGTAA 


1320 


CCTCCTCAGa 


ATCTCCGTAG 


CTTTCCCCaG 


TCTGTTTAAT 


GTGTTTGaTT 


TCAATATACG 


1380 


GGTTCTCCTC 


TAAAAAATCG 


TTTAATTCAC 


GCTCTAACTT 


GTTGATATTT 


CCtCTTTGAA 


1440 
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AAATTTTAAC 


TTTCATAATC 


GCTACTCTCC 


TTGTCTTATA 


ATGTACAATT 


TGTAGTATTT 


1500 


TGTAGTATTT 


AAAATTTGGG " 


"ttgactatca 


AGATACATTA 


TAAACTCATC 


CGAACATAAC 


1560 


AGAAGGAAGG 


TAAATTTATC 


AAAAAGATCT 


ACGTACAAGC 


TCTCCTGATT 


CTCAGTAAGA 


1620 


TTATCTTTTG 


CTAAGGCAAA 


GTAAAGCCGA 


ATCAATCGCG 


CTAATTCTCC 


TTTAAGCTGT 


1680 


TCGACACTCG 


AAAAATTTTC 


ATCTTCTAAT 


AAAGACTGTA 


CAATTACTCC 


TTGCATGCCT 


1740 


CCTGCAGCTT 


CTTCTAACCA 


ATTAGGGTCT 


TTTTCCATTT 


CCGCTAAAAC 


GATAGGGGAA 


1800 


AAATGAGAAA 


TATCTCCAGT 


AAAATGACCT 


CGTGTAAAAG 


CCAGCATATT 


GGATAAAAAA 


1860 


CTTCTTGTAA 


TTTCTTCATA 


TTCAAAATTC 


GGTTCTTTTT 


CTGTTGATAA 


TATAAGCATA 


1920 


AGATTTCCTC 


TTTTCTTTAT 


CTTTTTTTGT 


TTTATTCGTT 


TGGTAACTAA 


TTCTAAAAAA 


1980 


AACTGGTTAC 


CGCTTAAAGC 


ATTGGGGGAG 


TAAGGATAGA 


GCCATTTGGT 


AACTAGTTAC 


2040 


CAATTCTTTT 


TACTATTCCT 


CAAACTTTTA 


TTAAAACTTT 


ATTTACAAAT 


TTCAATAGAT 


2100 


AAAATTAGTA 


ACTGGTTACC 


AAAACGCTCC 


TCCCTCACTC 


TCCCAAGGGG 


TGGGAGTGGT 


2160 


AACCGATTTA 


TTTTGAATTG 


GTTACCATTT 


TTTTATAAAT 


TTCTCTTCCC 


TCTTACTCTC 


2220 


CCAAGGGATT 


TGGTCGGTTA 


CCAATTTATT 


TTTAAATCGG 


TAATTTAACT 


TATCAGCTTT 


2280 


AAATCGTTTC 


TTACTAAAGA 


TTGGTAAGGC 


ATTCCATTTT 


CTGGTGGTTC 


TTCATAGCTG 


2340 


AAAAAATCTC 


CCAATGCTTT 


TATTTTTTCT 


ATCTCATTCA 


CACTATATTT 


TATTTTTTTA 


2400 


TTTATCCAAC 


TATCCCCTAA 


TAAATATGTG 


AACTGTTTAG 


AAAATTTAAT 


TTTACTCAAT 


2460 


GGTTTAAAAT 


TATTTTCCTT 


GCAAAACTCT 


TTATATTGAG 


AGTAAACCAT 


ATAAAAAGGG 


2520 


ATTTTGTTTA 


TTCCTAAAGG 


CTTGAATACT 


GTTTCTCTAA 


AATCAACTAG 


AGGGTCATTT 


2580 


TCTTGTTTAT 


ATTCTTCTAA 


AAGTTTTTTG 


GAAACATCAG 


GTACTATAAA 


ACGTTCAAAA 


2640 


TCCATGTTTA 


CCGCTTTATG 


CAAAACGTAT 


TCGAGCACCT 


CTTTTCGGTT 


GAtATACTCT 


2700 


TCTTTTATTC 


TCCAATCATC 


TTTAGCACCT 


TCAAATGAAG 


CATT 




2744 


(2) INFORMATION FOR SEQ ID NO: 54 9: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 

ATTnATCTTT TTCnGGCTCG GAACCAATAG ACCCATTTTG CCATTCTAAA CTGATCTAAA 60 

ACGTAATAAA ACAGTTTGAT CCATCTGGAT TGATTTCAAC CCATTTAGGT AGTTTGTAAG 120 

AACGTTTAGC • CAAATTTCAA TACTTCTTTA ATTGGCAAGC CCTACGTTCC TATAGATAAA 180 

CTTTTTGCTT TTAATTGTAC ATTACCATTC GCCATCACGT AAGGATCAAA ATATAAGTAA 24 0 

AAAGGAATGT CATGCCCTAA CACCTGAAAT GTTCCATTTA ATAAAGCATC ATTTTCTAAA 300 
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T A HZilTTTfiT 
1 r\J\f\r\ 1 1 i,ril 


ATTTP ATPTP 


Pf^AACCTTTT 


TGAAAGTCGG 


CTAAATAAAA 


GTCAATCAGT 


360 


TTPTTP APTT 
Hull bnb l x 


PTTGTTTTTT 

OX 1 U 1 i 1 1 1 X 


TGACTGAATG 


GTGACGACTG 


GCTCGCCTTC _ 


TTTTTCAACA 


420 


Ml Ib^l xr\ 


TTTTTTTTRA 


ATCTGGTTCG 


CGAACTTGTG 


TTGCTCGAAA 


TGTCACGAAA 


480 


c r"r a p a pt & p 

bL Jl rtbrtt 1 nt 


ATPATT AA 
bnn Jl ort 1 1 nn 


APPCACTAGA 


ACAAGAAAAG 


CGATTTTCCA 


GGGATTTCGT 


540 


TiAAbAl Ibb 


1 1 uu 1 luibi 


PTTTTTAATT 


GATTTACTTG 


TTTTTGGCTT 


TTCTTCATTC 


600 


n m rnrn rp T\ rp rp /"* t\ 

ATTTTAI 1 LA 


t 1 Att 1 it lb 


ATTTPiP,TT AT 
nl 1 1 llni 


CCATTCTTTC 


TCTGTCTTGA 


CCATTTCATC 


660 


^* a/^a tv t\ rp 

CCGGACAGCT 




TTTPCT APPP 
111 oo 1 rio^-^ 


AAPATTATTP 


GGATGGAAGC 


GATCTTCTTC 

sj^l X Vip-* X X \^ X X w 


720 


ATATAACAAA 


1 1 b 1 1 1 AAA 1 


tt 1 t 1 1 1 tt 1 


APPAPTAPTT 


PPTGTAGTTT 


CTGAGTCACC 


780 


ACCAGTTACG 


CCAAb 1 I bAI 


tbttAtbHtt 


TTTATAPAPT 


AAATPATTPA 

nnn 1 v— n i. X on 


TAGGAATGAA 

a nvjunn a \jnn 


840 


ATAGGCTCGT 


rp rp rn rp /•■• /"* rp rp rp 

TTT Ibbib 1 1 


bAAbtHl lit 


TTPPPTTPPw 


TPATTPPAAT 


TATPAAPGAT 


900 


m m m n*i r^ 7\ T* rp 

TTCTTGCATT 


i b 1 b 1 AA1 1 1 


PPPA A A AP.TT 


T A APT APAAP 


PPATTGTAAA 

uun x x vj a nnn 


TTCCTAAGAC 

x x V- * V— * x no\jnt> 


960 


ATAAATAGGG 


GCb i 1 b Ibb 1 


TPT APTPTPP. 
1 b 1 At 1 t 1 tt 


PATTTPTTPA 
\jr\ ± x lbl l brt 


APTAPTPPTP 
no i no x oo x 


TTAPTPGGCG 


1020 


TTGATAGGC1 


111 Lb 1 bbAb 


P ATTPA A PP. A 


PPTPAPTTTP 

VJb 1 Vflb 111b 


AAPTPPAAAA 

/wvj x \-.^jnnnn 


TATTACTACT 

x x x r*v x iiv> x 


1080 


AATTACTTTC 


All AAb 1 b A 1 


TTPP APPA AP 
1 1 ttnttnnL 


PPTTAPPPTA 
b\j l l nubvj l n 


ATPAPGTCCG 


CAGAGGCAAG 


1140 


GCCCTTTTGA 


A I 1 I b Abb 1 1 


1 t 1 t 1 1 1 bHl 


TPPTTTTAA A 
1 b w 1 1 1 1 nnn 


ATTTPATPAP 

n 111 On x v— n^_/ 


TACGATCCCC 


1200 


ATTTTTCCCA 


AAAl lb lb 1 b 


111 unntfttt 


PTTPAAPTTP 


TAPTPTTPTT 
x nvj i vj i i x 


TTAAATCATC 

x x ruui x x w 


1260 


Mri ^ ^ TV r™ 1 TV TV <T> TV 

T G C C AC AAT A 


bbAAbAAAAb 


PTPP APT ATT 
tlttnt lnl 1 


PPTPAAATPP 
bo l brvvn l bu 


PP AATTPPTT 

Uvnn a i a a 


CAGTTAAAGA 


1320 


Tv m z'"* 1 TV /** TV rn TV 

ATCACCGATA 


bb 1 b 1 Al AA1 


PA ATP.APTTP 
unn 1 oHt 1 lb 


TTTTTPATTT 

11111 Urt X 1 i. 


TTTTGAGCAG 

x x x x urwjwovw 


TTGTGGCCAC 


1380 


TTTLI t I Ibb 


ill AA 1 Ab 1 b 


tt 111 ill 


f^PPPATGGPP 


ACACTTAACA 


ATGTaAaAAC 


1440 


ACCTAbtbLA 


A 1 b AbbA 1 Ab 


PTPTTA APAP 


PPTPAPPAAA 

x wnuv^nnn 


ATATGCTGTG 


TTTGTTTTTT 


1500 


tAl b 1 1 Lu Jl b 


Mb 1 1 tt 1111 


AAATAAAPPA 


ACAAGAAATT 


TTCCTTTTCT 


TGTTAGAAAG 


1560 


TV z^* 1 rp rn /~* /-» /-» rp 

AGGTTGCG I L 


1 Ab Ibb 1 1 1 A 


APAPTPAPAP 


PPAAPCTCGT 


TATTTPTTAG 

X ♦* XXX \*r X X 4iW 


TCTGTATAGT 


1620 


AC AT AA I Abb 


AAAbbbAl i 1 


TTr-PTPTATP 
1 i tt loiniu 


TPiTTGCAATG 


ACTGGaTTCG 


TATGTAATAC 


1680 


GGbaAl Albb 


Aiblbl 1 1 & A 


APATTPPTTP 
nun 1 1 Ob 1 1 U 


TAATPCTTCT 


TtGAATCCAT 


TtGCTAGTTC 


1740 


TAbbbbAl Lb 


gb A 1 bAbArtn 


TPPP A ATTTP. 
1 utbrvl 1 1 1 \J 


TPPiAAPATTT 


GGAATCTTAC 


TTAATTCTGA 


1800 


fprprp/"*7\ TV T 1 T 1 r™* ^* 

111 bAA 1 i bb 


TP A A APPATT 


TATTAAAPPiT 
1 X X nrtrtbU J. 


TTTAACGCCT 


CGGCCTTTCG 


CTACAGGAAT 


1860 


CAATTCCGTA 


TTTTCAAAb I 


bbA 1 bAbHrtb 


TTTPATATTA 
ill bn 1 n 1 1 n 


A A AATPTTTP 
rvrirtrt 1 O 1 1 X O 


ATAATAATCC 

r\ x nn x nn a 


1920 


TGTTGTACGG 


CTGATTCGTC 


CACCTTTAAC 


CAAATTATCC 


AATGTTGAAA 


TGCCAATGTA 


1980 


TAATTTAGTA 


TTTTGTTTGA 


CGCGTTCAAT 


TTCAGCTAAA 


ATTTCTGGAA 


CACnCGCTCC 


2040 


CGCTTGTGCC 


AATTTTGCTG 


CTTGAATGAC 


TTGGAAAGAC 


AAACCTTGGT 


CTGTAAAATC 


2100 


GCTATCAATC 


- ACCGTTACTT 


TACTTGATGA 


TAAGTTGCTA 


GCTTGACGGG 


CCGCTTCAAn 


2160 


CCGTT 












2165 



(2) INFORMATION FOR SEQ ID NO: 550: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 



GGTGAAGCAA 


CCCTTCTGTT 


GCTCCACCAA 


CTCCTAACTA 


GACCGTTTCT 


CGATGCCTTA 


60 


GATTATTTTG 


ATTCTTTTTC 


GCCTGCTTCT 


AAGTAGCTGA 


CATCATTGAC 


TTTATCCAAA 


120 


GTATATTCGC 


CATTTTTGTA 


ATGAATTGTT 


GTGACACTAG 


CATTCTTCAA 


ACCGCCTTCT 


180 


GGGACTTTAA 


AAT CATC AAA 


TAAAGTTGCT 


AACAACGCTG 


AAATACTCAA 


GCCATGAGAG 


240 


ACCACTAAAA 


CATTGCCATT 


CCCAGAATTG 


GCTGATTCTG 


TGGCAACAAT 


TTTATCTAAG 


300 


CCTTTTTTCA 


AACGTTTAGT 


AATTGTAGCA 


TAGTCTTCTG 


CAGGCCAGTT 


ATTCTTGCTT 


360 


TCCTCGCGCT 


GTTGATCCAG 


TTTAGCTACA 


CTATTGGCAA 


AGGATTCAGG 


AGTCATGTTT 


420 


TTCATAAATT 


CTTCTAAGGA 


AACACCTTGA 


TCATCAGCAA 


TATCCTGCCA 


CATTGTCTTA 


480 


TTTAAATCCC 


CTTCATAGCT 


ACCAAAATTA 


AATTCACGTA 


AATCTGGGTC 


ACGCACGACT 


540 


TCAAGGTCTT 


TGCCTGCTTT 


ATTTTGATCT 


AAAATAAGTT 


GAGCAGTTTG 


CAAGGCGCGG 


600 


CCACTATCAC 


TACTATATGC 


ATTTTGAAAG 


GCAACATCTT 


TCAGTCCAAT 


CCCAGTTGCT 


660 


GTCACAACTT 


TTTCACCTTC 


TGGTGTTAGG 


ACCGCATCTG 


ACCATCCTTG 


TACGCGGTCC 


720 


GTCGTATTTA 


ACATGGTTTT 


TCCGTGGCGC 


ACAATGTAAA 


GAGTTAGTTC 


TTCAGGTTTT 


780 


GTCTCtTTTT 


CTTTTGTCGT 


TGCTTGCTGA 


TTTTTACCAC 


AACCAACCAT 


TACTAAAATG 


840 


GCCAGAAAAA 


GCATCCCCAC 


AATCGTCAAT 


CGTTTTTTCA 


TTTTTTATTT 


CCCCTCATCT 


900 


TCTACTCTGT 


TGTTAAGTTT 


TCACCATTAC 


TAGCGATAAC 


TTGTTTGTAC 


CAATTAAATG 


960 


ATTTTTTCTT 


AGTTCGTTTC 


AAGGTTCCCT 


GACCTTGGTC 


ATCCATGTCT 


ACATAGATAA 


1020' 


AGCCGTAACG 


TTTTTTCATT 


TCGCCAGTCC 


CTGCTGAAAC 


TAAGTCAATA 


CAACCCCATG 


1080 


GCGTATAACC 


AATTAAGTTA 


ACCCCGTCTT 


CTTCTATTGC 


GGTTCTCATG 


GCTTCAATAT 


1140 


GTTGTTTAAA 


ATAATCAATG 


CGGTAATCGT 


CGACAATCGA 


GCCATCCTCT 


TGCAATGTAT 


1200 


CCACCGCGCC 


AAACCCGTTT 


TCAACCACAA 


ACAATGGcAT 


ATTGTAACGA 


TCTGACAACT 


1260 


GACACAATGA 


AATTCTTAAC 


CCTAAAGGAT 


CGACAGCCCA 


GCCCCATTCA 


GATTCTTGCA 


1320 


AATAAGGATT 


TTTAACCGCA 


GGCATTTGGT 


TGCCTCCGAC 


ATATTTTACT 


TCTTCGGGAT 


1380 


AAGCAGTCGA 


AACAGTCGAC 


ATATAATAAC 


TAAACCCGAT 


ATAATCAACT 


GTTCCTTTTT 


1440 


GAATAATCGC 


TAAATCATCT 


GACTCCATTT 


GAATTGTCAT 


TCCTTCACGC 


TCAAATTCTT 


1500 


TGAGCTTATG 


CGCTGGATAA 


TAGCCCTTAG 


CCTGGACATC 


CATATAAAAT 


TCTTGCTCTC 


1560 


GGTTAAATTG 


AACAGCTGCT 


ATCACGTCCT 


CAGGTTTGCA 


GGACAAAGGA 


TAACTTGGAA 


1620 


TATAAGCCAC 


CATCATACCA 


ATTTGGAAAT 


CTGGATTGAT 


GGAATGTCCT 


AATTCAACTG 


1680 


CTTTCGCACT 


TGCTACAAAT 


AGATGATGCG 


CTGCTTGATA 


, TTTACTTTGT 


TTATCATTAT 


1740 
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TGTGAATCCC 


AATTTGCGTC 


CAACTTCTTA 


AAAAATTAAT 


TTCATTAAAA 


GTCATCCAAT 


1800 


ACTTCACTTT 


ATCCTGATAA 


CGTGTAAAAA 


TGGTTTCGCA 


AAAGAAAACA 


AAGAAATCAA 


1860 


TCACTTCACG 


ATTTTTCCAA 


C CAT CAT ATT 


CATCGGCCAA 


ATACATCGGA 


ATATCAAAAT 


1920 


GGTTAATCGT 


CACAACGGGC 


TCAATTCCAT 


GGCGCAGCAA 


AGCATCAAAA 


ACTTTATCAT 


1980 


AAAAGGCTAA 


GCCTTCTTCA 


TTAATCTCTT 


TTGTTCCATT 


TGGGCAAATC 


CGGCTCCAGG 


2040 


CGATAGACAG 


CCGAAAACAT 


TTGAAACCCA 


TTTCTGCAAA 


TAAGGCGATA 


TCTTCTTCGT 


2100 


AATGATGGTA 


AAAGTCCGTC 


GCTACATGAC 


TAGGGTAGTA 


TAATTCTTCA 


TCGATATAGC 


2160 


CAATCGCTCC 


TTCGGGTAAT 


GCTTGATCGC 


GCGTACACGA 


AGCTTTTTGG 


CCTTCTTTAG 


2220 


TTAAATACGT 


AATCCGTCGC 


GGTTCTGTTT 


GACTACCACC 


AGTAATTGCA 


TCTAATGTAC 


2280 


TCAGACCTTT 


GCCTCCTGAA AGgTAGCCAC 


CTTCATATTG 


GTTAGCAGCT 


GTTGCGCCGC 


2340 


CCCATAAAAA 


ATTTTCTGGA 


AATCCCATAA 


AATCCCTCTT 


TCCCTCTGTT 


TTTAGGCATC 


2400 


ACAAAGAAAC 


CATCAAGGGA 


ATTGACGCTG 


GCAGAAGAGT 


CCTTGTTTTT 


TCTACCAACG 


2460 


TCAAACGCCC 


TTTAATAGTT 


GCTAATTTTA 


TTAATTACTT 


AAATCTTCAC 


CATTTGTTGC 


2520 


GATTACTTTT 


TTGTACCAAT 


CAAACGATTT 


CTTCTTGCTT 


CGTTTTAACG 


TACCATTTCC 


2580 


TTCGTTATCG 


CGATCCACAT 


AGATAAAGCC 


ATAGCGTTTT 


TTCATTTCAC 


CTGTTCCAGC 


2640 


TGATACTAAA 


TCAATACAGC 


CCCAAGTCGT 


ATAGCCTAAC 


AAGTCGACAC 


CATCTTGTTC 


2700 


AACAGCATCT 


TTCATCGCTT 


GAATATGCGC 


CGCCAAATAA 


TCGATACGAT 


AATCATCGAC 


2760 


TACATCACCA 


TTTTCGTCTG 


GTGTGTCTAC 


CGCTCCTAAG 


CCATTTTCAA 


CAATGAATAG 


2820 


TGGTTTTTGA 


TAACGATCAT 


ATAAGTCATT 


CATCGTAATT 


CTCAAACCAA 


GCGGATCAAT 


2880 


TTGCCAACCC 


CATTCACTGG 


CTTTTAAATA 


AGGATTTTTC 


ACAGAGGCAA 


AGATGTTACC 


2940 


AGCTGTTTGT 


TCATTAATTT 


CTGGATCTGT 


TGATGTGACA 


CGAGAAGAAT 


AATAAGAGAA 


3000 


CGAAATGAAG 


TCCACCGTAT 


GTTCTTTTAA 


AAGTTCTTCG 


TCGCCTTCTT 


CCATTTCAAT 


3060 


TTGAATACCT 


TTACGGGCCA 


TTTCTTTTAA 


AGCATACGCA 


GGGTATTCCC 


CACGTGATTG 


3120 


GACATCAATA 


AAGAAATAAT 


TTTCACGATC 


TGCTTGTCGA 


GCAGCGAAAA 


CATCTTCTGG 


3180 


TTTACAAGTA 


TAAGCGTAGT 


TGGAACCAGC 


CGCTAACATA 


CAACCTACTT 


GGTTTTCAGG 


3240 


ATCTACCTCA 


TGAGCAATTT 


TTGTCGCAAT 


TGCACTTGCT 


AGTAGTTCAT 


GATGGGCCGC 


3300 


TTGGTATTTC 


ACTTGTTCTT 


TGTTTTCGCC 


TTCTTCAAAG 


TATAGACCTG 


CTCCCATAAA 


3360 


TGGTGCATGT 


AAAATCATAT 


TAATTTCGTT 


AAATGTTAAC 


CAATATTTGA 


CTAATCCTTT 


3420 


ATATCGGtTG 


AAAATTACAC 


GGCAAAGATT 


TTCATAGAAA 


CCTACTAACT 


TACGACTACG. 


3480 


CCAAGCGCCA 


TATTCTTCCA 


CTAAGTGCAT 


TGGGcAGTCA 


AAGTGTGTAA 


TCGTTACTAG 


3540 


TGGCTCAATC 


CCATATTTAT 


GACATTCTTT 


GAATAAGTCT 


TCATAGAACT 


TCAAGCCTTC 


3600 


TTCATTCGGC 


TCTGTTTCAT 


CTCCCATTGG 


AAAAATTCGA 


CTCCAAGCAA 


TGGATAAACG 


3660 


GTATGTTTTA 


AAACCCATTT 


CACCAAACAA 


GGCGATATCT 


TCTTTATAGC 


GATGATACAT 


3720 
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ATCAATCGCC 


TCTTGTGCAG 


GATAGAAATG 


TTCCTCATCA 


AAGTTAAACA 


TTTTCTTTTC 


3780 


ACCAGTGATT 


ACTGGAAAAC 


GATCTGGACC 


AGTTGGCGCT 


AAATCGACGT 


TTGCCAAACC 


3840 


ACGTCCGCCT 


TCGTTATAGC 


CACCTTCACA 


TTGATTGGCT 


GCTGTTGCGC 


CACCCCATAA 


3900 


AAAGTCTTTT 


CTAAATGCCA 


TAAATTCTCT 


TCCTCCACAT 


CTTTTTATAT 


TAAAGCGGTC 


3960 


AGCAAATCAT 


CACTTGTTGA 


AACTTCATTA 


CTGCCGACTT 


CCAAAATATC 


TAAATAATCC 


4 020 


GCTGTATTCG 


TCACAACGAT 


TGGAATTTGT 


GTACTAAACC 


CAGCTTTTTT 


AATTCCTTCT 


4080 


AAATCAAAGG 


TTACTAATGT 


TTGACCTTTT 


TTCACGCGGT 


CACCTTGTTT 


TACAAAAGCT 


4140 


TCAAAGCCTT 


CGCCTTCTAA 


TTGAACTGTA 


TCAATCCCAA 


TATGAATCAA 


TAATTCTGTG 


4200 


CCGTTATCTG 


AAATCAAACC 


TAATGCATGT 


TTTGTTGGGA 


ATAGTGTGAC 


AATCGTTCCA 


4260 


TCAAAGGGTG 


CAACCACTTT 


ACCTTCCGTT 


GGTTCAATCA 


CGACCCCATT 


TCCTAATGCT - 


4320 


CCATTTGCAA 


AAGCAGGATC 


CTCTGCATTT 


TTTAAAGATA 


ACACTCGTCC 


TTTGACTGGA 


4380 


CTTGTAATAT 


TTTCTTTTTT 


AATCGTTGTT 


TTATCAATAA 


TCACTTCTTC 


TTCCTCAACT 


4440 


GTATTGTCTT 


TCCAGAAGAA 


GAATGTTAGA 


CCAAAACCAA 


CGACCGCTGC 


AATGGCAATT 


4500 


GCAATAAAGG 


ACTGGATCAT 


CCCACTTGCA 


TCATCACCAT 


TGATAAAGTT 


TAGCACACCA 


4560 


AAAATTCCTA 


AACCACCCAT 


TGTATAAGCT 


GTCACATTGT 


TGATCATCAA 


ATATAAACCA 


4620 


CCGACAGCGC 


CACCAATCAT 


TGAATAAATA 


AATGGCCATT 


TTTTTGGTAA 


AGTGATTCCA 


:4680 


TAAATTGCTG 


GCTCTGTTAC 


ACCAAAAATA 


CCAGAGATAA 


TTGCTGGCGG 


ACACAAAGCT 


4740 


TTTAACTTTT 


TATCTTTCAA 


TTTGAAGAAC 


ATCGCTAAAA 


CAACCGCTGT 


TTGCGCAAAA 


4800 


CTAGCTGCAA 


ATGATCCCGT 


TAATACTTGG 


CTTGAACCTT 


CTTGTGTCAC 


TTGCATAATA 


4860 


GCTAATGGCA 


CAACGCTCCA 


ATGTAAACCA 


AAAATAACTA 


AGACTTGCCA 


GAAGAAACCA 


4920 


AGAATCAAAC 


CATACAAAGC 


TGGTGAGAAA 


CTCATTAATG 


CTGTAAAGCC 


AGCGCTTAAT 


4980 


AAATCTGTTA' 


GCATGC.TAAC 


GATTGGCCCA 


ATTACTAAGA 


AACCAATTGG 


TAAGGCAATC 


5040 


AACAAGACGA 


AAAACGGTAC 


TAAGAACGTT 


TGAACAACTT 


CTGGAATAAT 


ACGTTTAAAC 


5100 


ACTTTTTGAA 


CTTGTGCCGC 


AAAAGCAATA 


ATGAAGATGA 


TTGGTACGAC 


ACTACTTGTA 


5160 


TAGTTCGCAC 


CGACCCAAGG 


AATCCCCATA 


AATGTATTAT 


AGGCAGGTAA 


ACCGAATAAA 


5220 


TTATACGGTG 


CTGCGGCTCC 


GGCACCCGCT 


GTTGTTTCAA 


AGGCCGTTTG 


TAATGCACTT 


5280 


CCTTGAATCG 


TTGGGTAACA 


TAAAGctGCG 


CCAATCACGA 


TCCCTACCAT 


TGGATGCAAC 


5340 


CGGAATTTTT 


TAGCTGCTGT 


ATAACCTAAA 


ATAACTGGCA 


TAAAGTAGAA 


AATTGCATCG 


5400 


CCAATCCCAT 


TTAACATGGT 


ATAAGTCCCT 


GACGTAGCCG 


TATAGAGCTT 


TAAGAAAACC 


5460 


AATAAGGCAT 


TTAACCCTTT 


GACCATCCCA 


GCCGCTGctA 


ACGCCCCTAA 


AAAGGGTTGG 


5520 


AAACAACCAC 


TTAAAATATC 


TATTAAGCGA 


TCAAATAAAT 


TCCCACTGGA 


AGCTTCTTCT 


5580 


TCACGTTCTC 


CTGACAAGCC 


TGCAATACTA 


ACTACTTCTT 


CGTAAACGGC 


TGGAACATGG 


5640 


TTCCCAATAA 


CGACTTGGTA 


TTGTCCACCG 


CTTTTCATAA 


CAGTTACCAC 


ACCGTCCATA 


5700 
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TTTTTTAAGA 


CATCATCATT 


CGCTTGACTC 


TCATCTTTTA ATTTGAAGCG - 


AAGACGTGTG 


5760 


ATACAATGTG 


TCAAACTATT 


AATATTCTCT 


TGCCCACCGA CATTTTTTAC . 


AATTTTTTCT- 


5820 


GCTAGTTCAT 


GATATTTTCC 


CATCTTTTTT 


TCCTCCAAAA ATTTATTTTC 


ATTTGAAGGT 


5880 


TTAGCCAACT 


GAATGTCACT 


ATCCAAAGTG 


GTGCGTTTAT TGTTCCGTTG 


TTTGTACAAT 


5940 


TCGTGCAATG 


TGAATCGTTA 


GGTACAAGCG 


TTCTTCATTT GAAACTTGAT 


AGTCATAGTT 


6000 


CTCATTTAAA 


TACAAGCTGA 


TCTTTTCCAC 


GCAACGATAG GCGTTGAGAT 


ATTTCTTTTT 


6060 


GATTACCGCA 


TACAGTTCTT 


CATCATCATT 


TTCGACAACT TCACGTTGGT 


GTAATAAGCG 


6120 


ATAACAGAAA 


AATTTTAATG 


TGTGCTAAAT 


CGATAAAAGT' AGACAGAGTC 


TTCATCAAAA 


6180 


CTTACCTTAA 


AATAGTATTT 


GACGATATTT 


GTAATCTCTT GCATCAACTT 


TGTCAATTCG 


6240 


TAAATATTAC 


CGACTTCTTC 


CTCCATCTCC 


GCATTTACCA AGTGCAATGC 


GATAAAACCT 


6300 


GCCTCATCCG 


TTGACAATTC 


CATTCCTACG 


CGCTTTTGTA CCATTTTTAA 


AGCATTTACT 


6360 


CCTACTTCGT 


ACTCTTCAGG 


AAAAAAACGT 


TTAATATCCC ATAATAATAA 


ATTCTTAATT 


6420 


GCAATTCCTT 


GTTTTTTTCT 


TTCTAAAGCT 


GTGTACAAAT GATCCGTCAG 


AGAAATATAA 


6480 


ATGGTATCAT 


TCAAACTCTT 


TTTCAACGTT 


GATTTTGCAT AAACAATAAT 


GTCATCCGAG 


6540 


AGTTTCATCA 


TCTCTAAAGG 


AATATTTCCT 


AACAATTCTT GAAACTTAAG 


TGATGTTTCT 


6600 


TGATTGGCTA 


AATAAAAAAC 


TTGATCGATA 


CTTTCATCAG GAACCCAATC 


CCCAATGCGT 


6660 


TTCTTAAAAG 


CAATTCCTTT 


GCCCATAACA 


ATTTGCTCTC GTTTATCATC 


ATCTAATGTA 


6720 


ATAACCACAT 


TATTGTTTAA 


AATTTTGTGA 


ATAATCAATG CCCCACCTCC 


TTTGTGTTTC 


6780 


GTTGAAAAAA 


GAAAAACCCA 


ATTTCTTCAA 


AAAGAACGAC ACACCTGTCG 


CTTGATGAAA \ 


6840 


AAATTAGGTT 


TAGCTTGTCT 


TTAAACAATC 


ACTATCCATC TATTTACTTA 


tCCCtACaTC 


6900 


CaAAGAATAA 


CATATTTACA 


ACAAATGTCa 


ACGCTTTCAA CGACATTTTT 


TTGTAATAAA 


6960 


AATAAAAAAA 


GCATACAAAC 


ATTGTTTGTA 


TGCTTCACTA TGaCCCGTAC 


GGGACTCGAA 


7020 


CCCGTGTTAC 


CGCCGTGaAA 


GGGCGGTGTC 


TTAACCGCTT GACCAACGGG 


CCATTAAAAA 


7080 


GAAAAAACGG 


AGAAGGAGGG 


ATTTGAACCC 


TCGCGCCGGT TTCCCGACCT 


ACACCCTTAG 


7140 


CAGGGGCGCC 


TCTTCAGCCA 


CTTGAGTACT 


TCCCCAAGAA ATAAATGGGC 


CTAAATGGAC 


7200 


TCGAACCATC 


GACCTCACGC 


TTATCAGGCG 


TGCGCTCTAA CCAGCTGAGC 


TATAGGCCCC 


7260 


AAATAAGAAA 


AAAGCGGGTG 


ACGAGAATCG 


AACTCGCGAC AACAGCTTGG 


AAGGCTGTGG 


7320 


TTTTACCACT 


AAACTACACC 


CGCAAATTTT 


TCTTGGATGG CGCGGGACAG 


AATCGAACTG 


7380 


CCGACACATG 


GAGCTTCAAT 


CCATTGCTCT 


ACCAACTGAG CTACCGAGCC 


AAAAACGGTC 


7440 


TGGACGGGAC 


TCGAACCCGC 


GACCTCCTGC 


GTGACAGGCA GGCATTCTAA 


CCAGCTGAAC 


7500 


TACCAAACCA 


ATTCGTTTTT 


GCTTTATGCA 


. AATGTATGCT TTAAAAAAf A 


. AAAGCAATTG 


7560 


CGGGGGCAGG 


ATTTGAACCT 


ACGACcTTCG 


; GGTTATGAGC CCGACGAGCT 


ACCTGACTGC 


7620 


TCCACCCCGC 


: GATAATTAGA 


, CCAAGGAAAC 


: TTAATTCCTA AGAAAGGAGG 


1 ATAAGGGATT 


7680 
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CGAACCCTTG 


CACGGTTTTA 


CCCGCCTGAC 


GGTTTTCAAG 


ACCGTTCCCT 


TCAGCCGGAC 


7740 


TTGGGTAATC 


CTCCGTAACA 


AAACTATGAC 


CCGTACGGGA 


CTCGAACCCG 


TGTTACCGCC . 


7800 


GTGAAAGGGC 


GGTGTCTTAA 


CCGCTTGACC 


AACGGGCCAT 


TTTATAAAAT 


TATGGGCCTA 


7860 


AATGGACTCG 


AACCATCGAC 


CTCACGCTTA 


TCAGGCGTGC 


GCTCTAACCA 


GCTGAGCTAT 


7920 


AGGCCCAAAA 


AGAAAAGCGG 


GTGACGAGAA 


TCGAACTCGC 


GACAACAGCT 


TGGAAGGCTG 


7980 


TGGTTTTACC 


ACTAAACTAC 


ACCCGCATGG 


CGGTCTGGAC 


GGGACTCGAA 


CCCGCGACCT 


8040 


CCTGCGTGAC 


AGGCAGGCAT 


TCTAACCAGC 


TGAACTACCA 


AACCATACTT 


TTCTTTTTAT 


8100 


GTGCTTTTGA 


AATCAAAAGC 


AATTGCGGGG 


GCAGGATTTG 


AACCTACGAC 


CTTCGGGTTA 


8160 


TGAGCCCGAC 


GAGCTACCTG 


ACTGCTCCAC 


CCCGCGATAA 


TTAGACCAAG 


GAAATTTAAT 


8220 


TCCTAAGAAA 


GGAGGATAAG 


GGATTCGAAC 


CCTTGCACGG 


TTTTACCCGC 


CTGACGGTTT 


8280 


TCAAGACCGT 


TCCCTTCAGC 


CGGACTTGGG 


TAATCCTCCA 


TGATAAAAAT 


AAAACTATAA 


8340 


ACAACGTCAC 


AATGGACCTT 


GTAGGACTCG 


AACCTACGAC 


CGGACGGTTA 


TGAGCCGTCT 


8400 


GCTCTAACCA 


ACTGAGCTAA 


AGGTCCAGGC 


TCTTCCAAAT 


GTATAATCGC 


GGCGGAGGGG 


8460 


ATCGAACCCC 


CGACCTCCCG 


GGTATGAACC 


GGACGCTCTA 


GCCAGCTGAG 


CTACACCGCG 


8520 


AAACTCTTAT 


ATTAAAATAA 


GAATGGAGCC 


TAGCGGGATC 


GAACCGCTGA 


CCTCCTGCGT 


8580 


GCAAAGCAGG 


CGCTCTCCCA 


GCTGAGCTAA 


GGCCCCATAA 


AAATTATTAT 


CGGGAAGACA 


8640 


GGATTCGAAC 


CTGCGACCCC 


TTGGTCCCAA 


ACCAAGTGCT 


CTACCAAGCT 


GAGCTACTTC 


8700 


CCGTATATAA 


CTAATGCACC 


CAAAGGGAGT 


CGAACCCCTA 


ACCTTTTGAT 


TCGTAGTCAA 


8760 


ACACTCTATC 


CAGTTGAGCT 


ATGGGTGCTA 


AATATAAAAC 


AATGCCGAGG 


ACCGGAATCG 


8820 


AACCGGTACG 


GTGATCACTC 


ACCGCAGGAT 


TTTAAGTCCT 


GTGCGTCTGC 


CAGTTCCGCC 


8880 


ACCCCGGCGT 


AATGAATTGG 


CTAAGCGGAA 


AACGGGGTTC 


GAACCCGCGA 


CCCCCACCTT 


8940 


GGCAAGGTGG 


TGCTCTACCA 


CTGAGCTATT 


TCCGCAAAAT 


GGTGCCGGCT 


AAAGGACTTG 


9000 


AACCCTCGAC 


CCTCTGATTA 


CAAATCAGAT 


GCTCTACCAA 


CTGAGCTAAG 


CCGGCTAAAT 


9060 


ATGCGGGTGA 


AGGGACTTGA 


ACCCCcCAnC 


CTTGCGGCGC 


TAGATCCTAA 


ATCTAGTGCG 


9120 


TCTGCCAATT 


CCGCCACACC 


CGCAAATGAA 


ATATGAATAT 


GAGCCGTACA 


GGGCTCGAAC 


9180 


CTGTGACCCT 


CTGATTAAAA 


GTCAGATGCT 


CTACCAACTG 


AGCTAACGGC 


TCAAAAAATG 


9240 


GAGGTTAACG 


GGATCGAACC 


GCTGACCCCC 


TGCTTGTAAG 


GCAGGTGCTC 


TCCCAGCTGA 


9300 


GCTAAACCTC 


CAAAAACAAC 


AATTCGCGTG 


GCGACGTCCT 


ACTCTCACAA 


AGGGAAACCC . 


9360 


TTCACTACAA 


TCGGCGCTAA 


GAAGCTTAAC 


TTCTGTGTTC 


GGCATGGGAA 


CAGGTGTATC 


9420 


CTTCTCGctA 


TCGCCACCAC 


ACTGGGTGTT 


GTTTCTTATT 


GAGTTGAATC 


TTCATTCACT 


9480 


CAAAACTGGA 


TTGAAGTTTG 


AATCAAAATA 


ACCAAGTTGC 


TTTTACTTAT 


CCATTCTTTG 


9540 


GTTAAGTCCT 


CGACCGATTA 


GTATTGGTCC 


GCTCCAACTA 


TCACTAGCCT 


TCCACTTCCA 


9600 


ACCTATCTAC 


CTAATCATCT 


CTCAGGGGTC 


TTACTTTCTT 


AAAGAAATGG 


GAAATCTCAT 


9660 
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CTTGAGGTGG GCTTCACACT TAGATGCTTT CAGCGTTTAT CCCTTCCCTA CATAGCTACC 9720 

CAGCAATGCC CTTGGCAGAA CAACTGGTAC ACCAGCGGTA AGTCCATCCC GGTCCTCTCG 9780 

TACTAAGGAC AGCTCCT 9797 
(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1807 base pairs 

(B) TYPE * nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CCTCTTTCGT ATATGTnnCT GTTGTAACGT AATCTCTTTT CAkGATGTGT TCyTCcTAAA 60 

AGTTGTTTTT GTTTACACAT TGACTTTACC AAGTTTAGCG AAGAGTTTAC TATCCACCAA 120 

AAGTTAGAAA ACGCTTTCTT TTTCGGCGTA ATTTTTCATT AAAAAAAGTT TATTTATGGT 180 

TATAAACCGT TGGTACACCA GGGGTTTATA AAAGTTTAGG CGGATTTCTC TCTAATTTCT 24 0 

GACCAATCAT AG ATT TG AAA ATGGTATAAT TATTTTGAAA AATCATTCAC TTAAAATAAA 300 

AGATGAAAGG GGTTTCCAAT CGTGACATTT GTTTTTATTT ATAACATCTT GTTGATTATT 360 

TTATATTCGT TTACTTCAAC CTTTACCTTA AATCTTTACT TGAAAAATAA ACAACCGATT 4 20 

TTTTTATTGC TTCTCTTTTT AATGGTTATT TTTATTTGTG ACAATGTGAT TGTTTATATG 4 80 

ACTGAGTTCA TTAATTCGTT TGcAACCGaa TATaATCaGa CCTTTATGAC CGCACCTTTT 54 0 

TTAAAgACCA TCATTTTTAy TTGkTGtAAT TTTGCCTAnC TGGCTATTAT CAATACTATT 600 

AGTGGTCGTC CTTTTAAAAA TTATCAATTT GTCTGGTTAT TTTTAATTGG TTTATGGATG 660 

CTAGCTATTC CATTTTCACA AAATTCCGCC TTAAAAGTTT GGTTGTACTA TTTACCCAAT 720 

CAGCTATTTT TAATTTACCT TGGATGCTAT GCGCTCTATC AACTACGAAT AGATCCTTTA 780 

TCAGCCCTTG CGAAAAAATA TCTTCGTTTT ATTGGTTGGC TAAGCATTGG TTTCGGTGTG 84 0 

GCGATTCTTT TGGAAGATAC CtTCGTTATT TTTAACATTG ACCAATATTC TGATATTGTC 900 

TTTAAAATTA ACAACCGAAA CGTTTCTGAA GATATTTATA CAATCATTCT CTCCATCGCC 960 

ATTATCTACT TCTGTAATCG TGATTTTCCT CTGTCGGTTC TTGAAAAAGA CGCTGCTAAA 1020 

TTGGAAGAAA ATCAATCAGA TGAACCCGTA CTTTTGGCGC CTTTTTGTGA TGCCTATCAA 1080 

TTAACGCAGC GAGAACGAGA AGTGCTTTCG TTACTTTTAG AGTGCAAAAC AAATCAAGAT 114 0 

ATAGCCAACG AACTATTCCT ATCAATCGGG ACAGTAAAAA CACATATCCA CAATATTTTC 1200 

GTTAAATTAG AAGTAAACAA AAGAGCAGAA GTTTTTGTCA GTTATCAACT TTTTTCTCAA 12 60 

CAACAAACTG • AACATTTAGC GAGATAATTA AAAAGGAAGC CTCTCCGTGA TCGGTGGCTT 1320 

CCTTTTTGAT TAACTATTTT CTTGTACTTG TCGAACTCTC TTTTCAATTT CTTGAAAGTT 1380 

AATAATTTTA GCTTGGCGAT GAATTTCTTT TCCTTCATAG TATAAGATAA TGGTCGGTGC 14 40 
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AGAAAAAGCA TTTAATTGTC CAATTATTTC TGGACTTTTA TCAGCTTCTG CAATGTATGT 1500 

AGGAAATTCA TATTTTGTTG CAATTTTTTC GACAATCGGT TTATCAACTG TACAAACACT .1560 

ACAATTTTTT GCTGTCAAAA AGACGGCTTG TAGACGATtG TCTGTAATTT TTTCTTCTAA 1620 

TcyTCtAATG TTGTtACATA TTCCaCTGaA TAGCCCTCCT TGTCyTTTGt CyAAAATAAC 1680 

AGATTTACyA GCAAAATAAC AACTTAtTGt TTGGcyAGGa AkGCyTTAAC aAAAaAGTTG 174 0 

tCGcCCaTTT TAtaAAkGGG GsGaCAACTT TTTACTGGAT AACCGCTTAT TAAnTAAAAA 1800 

ATTTAnG 1807 
(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

TGAAGTCTGG TTCACCATTG TTTGAACAAA TTGATAAATC CATCGACAAG AAAGTGAATT 60 

TATTCTCAAA CAGCTTTTCC CGCTACGCTG TTCGTGCAAT GCTTGCTTGT CTATTTTTAA 120 

CGTTGGGAAC AGCCGTTGCT TTTGCAATCG CAATTAAAGG AGAAGGAATT TCTCATAGTT 180 
TAGGGAAAAT GCTTTACGCC TTCATGTTCA GTTGGtCACT TGTGaTGATT TTATATATGA ■ 240 

ATGCCGAACT TGGtACCTCG AATATGTTGT ACATGACTGT TGGGGTCTAT CGTAAAAAAG 300 

TCAACTTTTC CTTAgCTGCA AAAATTTTAT TTACCTGTAT CTTATTCAAT TTAATTGGTG 360 

GCGTCTTGTT TGGCTTCTTG GTATCACTTA CGGTGCCGTT CCAAGATTTG CCAAAAGATA 4 20 

GTTTCTTCTT TACATCGATT GCTGGAAAAT TAGAAAAAAC CACCCTACAG ATTTTAGTTG 480 

AAGCTATGTT CGCCAATATT GTAGTTAATA CAGCCGTATT AGTGAGTATG CGAATGAAAG 54 0 

ATGACGCCGG CAAAGTCGCG GCCATTATTT TCATTATCTT TATCTTTGCG TTCCTTGGTT 600 

TTGAACACGT GATTGCCAAC TTCCCAGCCT TAAGTTTAGC TTACTTTGCT TCGAACGGCG 660 

CAATTGAAGC TTTCACAGCT GGTAATGTTG TTCATAATCT GTTCTGGGCC TTTATCGGTA 720 

ACTTTATCGG TGGTGGCTTA ATTATGGGTC TCGGCTATGC cTGGTTAGAT AAAGACAcAA 780 

AAaCTTAACG TATTTkGATT AATTGACCAA CC AAC TAT AC TAAAACATGG ACACTGTTTT 84 0 

AGTATAGTTT TTTTATnATA AG 862 
(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 553: 
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GAGGTTGTCG TTGTCCCATG ACACTCACTC CTCTATAAAT GAATnTATTA CaCTTATTAT 60 

GTTACTACTA TTTTACAAGG TGTCAAATAT TTTCATCGTG CTTTCTGAAA ACCATTTCAA 120 

TGATAACTCC ATTAAAAACA ACGTATCACT TGATTGTCTC TTTTTTGGTG CAAAATTGTG 180 

CCTTTTTTGT ATAAATAAGT CCCTCGTTTT CCCACAAAAA AACCGCTACT TTTTCATCTA 240 

GTTTATAAAA AAGTAACAGT TATAAATTAT TTTTTCTTTT TCCCAATCGA GAGAATCCCT 300 

CTTTTTATGA CATCAAAGAA ACTCAAAGAT TGGACCATAT GATCTGGGTC AACATATTCA 360 

CGAATAgctC GTAACACTCG CkTCGGCkCT TTTGAAGCAA ACGTATACGT CCCATTTTTC 4 20 

TTGGTTTGAA TCGCATAACG AGGaATCCAT TTTCCTTTGa ACaTTACCGG AA 4 72 
(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554: 

TATAnTTTTC AGCAATCGTT TCAAATAACT TAAATAATGT CATGAAAATT TTAACATCCT 60 

TAACAATTGT CCTGACCATT CCGACCATCA TCGGAGGAAT TTACGGCATG . AATGTGAAAC 120 

TCCCTTTTGC AGAGCATGAA TATTCTTTTT GGATTATTTT TGCCATCACT ACCTTAATCT 180 

GCGTAATTAG TATTCGAATT TTAAAGAAAA AGAATCTCTT ATAACTTTAG GTTGCTGTTT 240 

TCAGGAAAAG GTTGTACACT ATTAGCAGAA ATTAAAAAGG ACGTGTCATT TATGAACGAC 300 

AAAGAACAAA GTCGCTTTTC AAAAATCACG AAAGTCGTGG TTTGGCTAAT GCTGATCGCA 360 

ATTGTAGGTT CAACTGTTTT AACAGCGATT CTTAGTCTTT AAACTGGAGA ACTTTTTCAC 420 

ACATTAAAAG TAAAAAAACA GCCCCTTCTA TCAATTGAAA GGGCTGTTTT TTTACTTGCT - 4 80 

CAAATAGAGC GGCTATTTTT TCGCCATCGC CAACAGCTCT TGTGCATGTT CAATGGTTAA 540 

TTTCGTAATT TCACTTCCTG CGAGCATACG CGCAATTTCG TTTACTCGCT CTTTTTCAGA 600 

TAAAATTCGG ACGCTTGTTT CTGTCCGCCC CGCCACAATT TCTTTTTCAA TAAAATAATG 660 

TTCATCGGCA ACGGCCGCCA CTTGTGGCAA GTGCGTGATA CACAACACCT GCGAATTTTC 720 

TGAAATTTGA TAAATTTTAT CGGCAATCGC CTGTGCTACT CGGCCACTAA CTCCTGTATC 780 

CACTTCATCA AAAACAATAC TAGTGATCCC TTGCGTTTGA GAAAAGATTG TTTTCATAGC 84 0 

CAACATCACT CGCGAAAGTT CTCCGCCAGA AGCCACCCGA ACTAACGGTT TTAATGGTTC 900 

CCCTGGGTTA GTAGTAATAT AAAATTCTAC TCCGTCTAAG CCATTTTCTT GTAAATGCTC 960 

AAGTTCTGTA AAGCGGACTT CAAATTCAGT TCGCTCCAAA TATAGTTCTT TTAATTCGGT 1020 

TAAAATTTGT TGTTCGAGCT CTTTGGCTAG GCGCTTTCGT TC 10 62 

(2) INFORMATION FOR SEQ ID NO: 555: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANCE DNESS ;_double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 



TTCCTGATTG 


AAGCAAAGAG 


AGGTGTAGCG 


CTATGGAAAT 


GGAACATATT 


AATGAAAA 1 A 


ou 


CGATTCGTGT 


GTTAATTGGC 


AACGAAGACC 


TGGCGGATAG 


AGG GAT TACT 


TTTCTTGACT 




TACTAGGAAA 


CCACAAAGAT 


GTTGAAAATT 


TCTTTTATAG 


TATTTTAGAA 


GAAGTTGATG 


lo U 


TCGAAGATGA 


GTTCCAAGGC 


AGTGAAGCAG 


TCACTTTCCA 


AGTCTTGCCT 


AAAAATGATG 


CH U 


GTTTAGAACT 


GTTTATTAGT 


AAAAATGTCG 


CTATGGACGA 


CTTGTCTTCC 


TTAGAAGGTC 


JUL) 


TGAGCGAAGT 


CaATGCAGAT 


GTTAGCGAAT 


TGATTCGCAA 


ACAAATTGAA 


GCAGACAAAG 


o c n 


CAGCCGCCGA 


TGAACTTGAC 


GAGATGGAAG 


CCACAGATGA 


AACAAACAGA 


AATGTAATCT 


4zU 


TTGAATTAGA 


TAATTTTGAA 


GCGATGATTC 


AATTATCCAA 


AGAAGTCTTC 


ATGCAATCTG 


4 oU 


TTTTGACTAA 


CTTATACACG 


TACAACGATC 


GTTATTACTT 


ACAAGTTTTG 


TTCTTAACCG 


OH U 


ATGAATTAGA 


AAAAACAAAT 


GTTGACAATG 


AAATTGCACA 


GATTTTAGAA 


TTTGCTCATA 


oUU 


AAACAACTGT 


CACTCAGGAT 


ACTTTAGTCG 


AATATGGCAC 


TTGTATTATG 


GAACGCAGTG 


O OU 


CATTGGAATT 


AACACGTTAT 


TACTTTAATG 


ATTAAGCAAA 


AAAGCAAAAA 


CTTTCGAbl 1 


7 90 


TcTTkTTkTT 


tGGaTGAAAA 


TTAAGAAACT 


TTGCGTAGTT 


1 Pil 1 IA1 Vj ALj 


7i C C T f* H H & A T 


780 


CATGTTAACA 


GCTTTAACGG 


CAGACAACCA 


AGAATTATTT 


TCTTTAATCG 


ATTGCTCGAT 


840 


TGATGAATTA 


ACACAGATAC 


GAAGCGAACA 


AGCTTTTGTG 


TGTCCAATGT 


GTCATCAATC 


900 


GGTCATTTTG 


aAGGCAGGCC 


CCATTAAAAT 


ACCCCATTTT 


GCTCATCGTA 


AAAAGAATAG 


960 


TTGTTGGtAC 


GAAGCAGAAG 


CTGAAACAGA 


AGAACACCTC 


CGCTTGAAGC 


AGCTTTTTGC 


1020 


GGAAAAGTGT 


TTACGCGAAA 


AGCTGTCCTT 


CCAAGTTGAG 


GCATACTTAC 


CCACCTTGAA 


1080 


GCAACGCCCA 


GAT T TACT C A 


TTGGGGAAGA 


TAGCCATTGA 


AATTCCATGG 


TAGTCCATTG 


1140 


GCCATTAAAn 


CGTTTGGTAG 


AACCGGACGG 


AGACTTAATC 


AAACTCCATG 


GCTAATnGAG 


1200 


TGGTTTGGGA 


TTnTGGTGGA 


CCGATTAAGG 


CCTAnAG 






1237 


(2) INFORMATION FOR SEQ ID NO: 556: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: 
GCTATTCAAG ATGAnTTAAG CGATACTAAA GACGAAATAC GTGAAGTGAC GAAACAAGCG 
AGAGAAGAAA CAnAAGAGTA TAAAGAAATC TTAGAAAGTA AAATAAAAGA CCATAACAAA 
GTGGTTGATA AATCGAATAC AGCGCTAAAA GTGATGACAA AAGGCGTTAC AAATATATTT 
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TTTGTGCTAA TTATTTTTGT ACTTGTAATG CTTGTGACAG GTCCGATAGG TCATTTCTTT 240 

GGTATTGAAC ATTTATATAG CTTTATTAAT GGTTTTATTG ATGACCACGA AAGTGCATGG 300 

CGATATTTGA TGTTGATACT TTATGCAGTA CCCTATGTAT TCTTCGCTAT CATTCTGTGG 360 

CTCATAGCAT TGTTTTTTAG AGTATTTAAT GACCTTTAGG TAGATTTATA TATAAAAAGG 4 20 

GCTAAATACG CAAAATACCC CCTTAAAATG CAAAATAAAT GCAAATAAGG GGGTAACTCC 480 

TTATGAAAAA ACAACTAATA GTATTATAAC ACGTTTTTAT GCCGAGAAAA CTTATTGGTT 54 0 

GGAATGGGCT ATGTGTTAGC TAACTTGTTA GCGAGTTGGT TGGACTTGAA TTGGGATTAA 600 

TCCCAAGAAA GTACCAACTC AACAACACAT AAAGCCCTGT AGGTTCCGAC CAATAAGGAA 660 

ATTGGAATAA AGCAACGAAA GGAGTTGAGT AAAATGAGTG AGAAAACGAA GAAAATCATT 720 

CGAGTATCAT TAGCTTTTTC ACAAGTAGTA GTGCAATCAT TGATAGTTTA TTATACTTAC 78 0 

CAGCAATATA AATTATCTAA AGAGAATTAA GTTTTGATTA TGTAATAAGT TTTGGTACTA 840 

AAAAAGACTT GATCTAATTA GACCAAGTCT TTTGTTAGTG TTATATTAAT AACAAAATAA 900 

AAATAAGAAG TCGCTCACTC CCCGACCAAA GTTTGTGAGC AAAATAAACA GTCGATGTTT 960 

GTTTCTTAAT TGTATATCTT GATATTAAAC GATTTAAGAT TATTCTTCAA GATATATATT 1020 

CGCAAATGTA ATTGGAAAAG CGACTTCTTA AACAAAATTA AGGAGTCGCT TTTTATGGCA 1080 

AAAATAACGr wAAATGAGCA AGAAAATTTG GTGGAAAATT ACACAAAAAA ACCGGCGTAC '114 0 

TCCAAACTAG TCAAGTCCAG ACTCCTGTGT AAAATGCTAT ACAATGTTTT TACCATTTnT 1200 

ACTTATCAAA ATTGATGTAT TTTCTTGGAG AATAAATCCA TTCATCATGT AGGTCCATAA 1260 

GAACGGCTCC AATTAAGCGA TTGGCTGATG TTTGATTGGG GAAGAT 1306 
(2) INFORMATION FOR SEQ ID NO: 557: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

CTATTTAATC TTGCTCACGT TGTTTTCTCT TGGCTGCTCG TTCTGCGCGG nGGGCAGCCC 60 

TTCTTGCTTC CGCTTCCGGA TCAACTACGC GTTCTTTTTT CACTCGTGGC TTCTCGTTCT 120 

CTGGCGTATT TATTTCTGGT TCACTTTGTT TTTCAAGTTT TGCTCGAATT TTTTCTTCTT 180 

GACCAGCTAA ACGAGCATAA TTATAGAACC GTCTCTTAGC ATCATTAATT GTTTTATTAA 240 

ATAACAATTC CGCTGTATCT GGTTGACTGG ATTCTAAGGA CGCAAAGCGA ACTTGTTGGC 300 

GCATAAACTC- TTTCATTAAT GAAAAATCTG GTTTTTTGAA ATCCAATATC ATTGGTTCTT 360 

TGCCTTTTTC TCGCAATAAT GGATTATAGC GATATAAAGA CCAGTAACCT GAATGAACCG 4 20 

CGTCTTTCGC TTCTTTCAAC GTTTGACTCA TCCCACCTGC TAAACCATGT GTAATACATG 4 80 
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GTGTaTAAGC 


AA1 AAi AA1L 


p. a t p. cz t p p a p. 


nP,AATTTTTC 


TGCTTCTTCA 


AAAGCTTTAA 


540 


TCGTCTGCAT 


1 1 tju 111 bUn 


pptpapppa a 


TTTGAP.CGAC 


ATAGACATTT 


TCATAAGTCA 


600 


TTGCCATCAT 


CCX1 AAA1 Ul 


tpTTTTAPaPP. 

1111 IrWjriuu 


PATAPTTPCC 


ACTGGCAGCA 


AATTTCGCAA 


660 


TCGCTGATGC 


/"■/-•/■"■TV /**rprnpr , T' 

CGGAGT, IGC i 


T T C P 7A 21 P T T T 


PPPPAPPTPT 


ATTGGAATAG 


APTTPATTAT 


170 


CTAGAACTAA 


CATATTTACA 


TCLuL-tjCLAU 


TaPPTa APAP 


PTP,P.TPAATA 


PPPPPPTAPP 




CAATATCGTA 


AGCCCAGCCG 


TCTCLAL-LAA 


t c a t r* r* a p t P 
lOAlvA-AU 1 o 


APTTPP.TTT A 


APAZiAAAPaT 


0 ^ V 


CTTGATCGTC 


ATAAATTGCT 


TCCAATAAAG 


fTTPn r P r P/^ , /'~' l T 

bl IGA1 1L*(j1 


1111 Iwl 1 LA 


appa aappTP 




CACGTAATTT 


AGCTGCACGT 


TGTTGCGTAC 


CTTLAL 1 X 


taljAL-AoA 1 o 1 


cic f a tpp a a t 

uL 1 A 1 LLnn 1 


you 


CTTCCATCAA 


TAAACGTAAA 


GAATCACTGG 


C T AC AG AAAA 


AGC 1 i Ivjo 


A 1 1 1 1 AL. 1 A« 




CTAAACGTTC 


GCGACGCGCT 


TGCGTTGCTa 


ACAACATGCC 


GTAACCAAAC 


TCGGCATTAT 


1080 


CCTCTAAAAG 


AGAATTTGAC 


CAAGCTGGAC 


CTTGTCCTTG 


TTCATTAGTA 


GTGTAAGGTG 


1140 


TCACACCCGC 


TGCAGCACCC 


CAAATCGATG 


AGCAACCTGT 


TGCATTAGCA 


tCAGCATTCG 


1200 


GTCACCAAAC 


ATTTGCGTCA 


ATAATTTAAC 


ATAAGGCGTT 


TCGCcGCAAC 


CAGAACAAGC 


1260 


ACCAGAAA 












1268 



(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 



GTTTTACCTT 


AAAACAAAGG 


GCAATCAGTG 


TTAAAACAAT 


AGTCATTAAA 


GGAGCAGCAC 


60 


CTAAATCAAT 


TAAAGGTTTA 


AAAATTGTAT 


TTGCTAGGTC 


AATAATCGTA 


TTCATTTATT 


120 


TTCTCCTTCA 


TAAGCTAAGA 


TAACAAAATT 


TTTATCTTAG 


TTTTGTTATT 


GTTGACAGTG 


180 


AAGCGTAAGT 


AATTAGTCTG 


AAAAGGAAAT 


CATTTGTTTA 


AAGAGTGAAT 


GATACTTTCC 


240 


ATTTGTTGAT 


AAACAGGTTC 


AGCCATTGCT 


GGAATTCGAT 


AAAGAATTGC 


GCCTGCATCA 


300 


ACTACAGGAA 


TCGTGATTGG 


GAAACCCAAA 


TCTGTTTTGG 


CGATTTGGGC 


GAAAATATCA 


360 


TAATGGGATA 


GCATCTCTTG 


ATTGATATCT 


TTAATCATTA 


CGGCATCGCA 


AGAAACTTGG 


420 


TAGCCTCTTT 


TAGATAATTC 


TGTTTCTAAT 


GCAGATTTGA 


TTTGATGGCT 


AGAGTTTACC 


480 


CCAGCCCCAC 


ATGCAGCTAA AATTTTGATC 


ATGTTAAATC 


TCTCCTTTAA 


AATTTTTTTG 


540 


TAAAAAACGA 


TAAACTTCTT 


CTTTATCTTC 


ACATTGAAAA 


AAAGCCAATA 


GTGATGAACG 


600 


GTCTGTCGTG" 


TTAATAAAAT 


CCaTGATAGT 


CGCTAAAAGA 


CCCGTTTGTT 


CAATACCGTT 


660 


TTCATTTAAA 


ATCATGAAAA 


GAAAGGrAAC 


aGAGAGCGTT 


yCATCTGGGr 


TAATCATGtT 


720 


AtGAAATgAT 


AAGGATTGTT 


TcAATTTAAT 


nCGGGATAAT 


TCTTGTCGTC 


CGTACAAAGG 


780 
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TACTTTCGGT ATGCGGAATG GCGATATTCG GTAGTGCAGG ATCAATCGGT GTTAAACTTA 84 0 

AACCAGTCGG ATAGTTGTGT TCTCGTTCCA ATAAATTAGT TAGAAAATnC AGGTGTAACT 900 

AATTTCTTTT TTAACAAGTC TAAATAAACT TCTTCAAAAA CTTCTTCTTG TGTTTTTTTG 960 

TTTGAAATAT AGGTAGTATC TAATGAAAAA ATCGATGTTT TAATAACCAT AGAACCCCTC 1020 

GTTTCTTTAA TATATTATTA nTTATTTACA AGTATAATAT 1060 
(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID "NO: 559: 

AnATnCCAAA AGAATTAAAA GACAAAAATT TACTTTATGA AAAACGTATT CCTCCGAAAA 60 

AACTCTCTAC TGGTGAATTC AAAAACTATC CAAACAGATT GTATTTAGGT AAGCTGGAAG 120 

TTTCAGCAAC AGATGTTTAC GCCATCAGTA ATGCTAGCTA TACAAATGAG TTTCCCGAAA 180 

GTGGGAAAAA CCAACCTTCG GATAAACATG ATAAAATCGG CGTTTCTTTC GAAAGTGGGA 240 

AAAACCAACG TCCAGAAGAA TCCAATAATA CCAGTACTTT GTTCGAAAGT GGGAAAAACC 300 

AACCCAATCT ATATCTAACT AATTCTTTAG ATACAATAGA TACAATAGAT ACTGAAAAAG . 360 

AACGATTGCA ACAACAATTA TTGCTCGATC AATTTTCAGA AGTTCAAGAG CATACATTTC 420 

TTAGCAAAGA TAGCTTAAAA TTTATTGCTG CCTTTTCTGA TACTATTCAA GAAGCCCATG 480 

AAATGGTwGG AACAATTATT CGTGCTAAAA CTAAAGTAGA AAAAGAATAC AATmTCGTTC 54 0 

TAATTGGTGA AGATTATCAA GAAGAaATAG ACAAATGTTT GCGACGAGTG ATGCATAAaA 600 

TCAAGACGGA TTCTACTGTT AAAAGTCCTA AAGGCTTATT TTACAAGTCT TTTTACAATT 660 

TATTTGTTGA ATGTGCACTA GAGAAAAAAA GTCAATTGAA CAAGAAAAGT AATTCTAACA 720 

CACCAGGAGT AATCACTCAT AACTGGGTTG AAAATCaATa AAGTaGCTGT TTaGTaTGaA 780 

TTTAAATTTA TGGAGGAAAA TTATATGTCT TTTGAATCTG AAAATAAAAA CGTAGAAAAA 8 40 

ATAATTGTGT CTAATAAAAC GTATTATCTT GTTGGGACAA GTCACATTTC AGAGAATAGT 900 

GTTAAACTTG TTAAAGAGGT TATTGAAAGA GTTCAACCAG ATACTGTTAG TATCGAGCTA 960 

GATAAAAAAA GATATGAAAA ATACACTAAC TCTAACCAAT GGGGAAATAC AGATATAATa 1020 

AAAATTATTA AAGAAAAAnA ATtAGTTGTT CTAATATCCA ATATTGTTTA CAGTGCATAT 1080 

CAAAAAAAAT TAGCAAATAC TAAAGGAACT ACTCAAGCTG GTGAATTAAT TCAGGCAATT 1140 

AAAAGTGCTA AGGAAATAGG AGCAAACATT CAATTAATCG ATCGTGATAT CCAAGTTACT 1200 

TTTAAAAGAA TGTGGCGTCA TTTAAGCTTT TTGGAAAAGC CAAAACTTTT TATGACATTT 1260 

TTTACTGAAT TTGATGATAT TGAACAAGAT AAATTAGAAG AATATTTAGA GTCTGATAGT 1320 

TTTGACAAAG TATTTATTTC GTTATCAAAA AAATATCCTT CGCTGTATCA GGATATGATC 1380 
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ACGGATAGAG ATAAGTATAT GTCTACTAAA TTGAmAAATA ATTCAAGTCA AGTGAATGTT 14 40 

GTGGTGGTTG GGAAAGCTCA TATGAaAGGa ATTAAAGAGA AGTTAGAAAA AAGAACAGAG 1500 
TTTTCCTTAG ATAATTTAAA TGAAATTCCA CCAAAAAAAT TArGTmCTAA ATtATtAGaA 1560 



1565 



TTnCC 

(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

CCGTGGAACC TTCTAAGTCA CCACCAATGA TTCCAGGTAG CCCCATTAAT AAAATATCGT 60 

TGGATAATAG GCTCTTTTAA CnCGATCAGC ATCCCACGTC CCATAATTAA TAATGCAAGC 120 

CTAAAACAAT ACTCGGAATT TGAATAATAA TATGTGAACG GAAGGGGTTA ATATCTCTTA 180 

ACCAATGCAC TTGAGTGAAG GCTTCTGGAA TAGTGGCTAG GAGCACCATC ATAATTCCTG 240 

AAAAATAAAG TAAAACCACT TCTAGCTTGT GGGCAATTTC CAAAGTAATT TGTTTTAGCA 300 

GTTGACGATA ATGAGTATCC AACTTTTGGC TTAAGTGTTT TGTAAAGAAC AAACAGCCAA 360 

TTAAGAACGG AATAATATAA TAGAATAAAC GATAGAGCAA TAACCACAAA ACAATAATTT 420 

CACGATCGAC ACCTAAGTTG CTCAGTCCTA AAATCATCAT GACATCAAAA CTACCTAAAG 480 

CCCCTGGAAT CATCGAAGCA ATCCCAATAA TCGAAGCTGC TACATATAAA GGAACAATAT 54 0 

CAATTAAAGG AATCGGTACA TCTAATAAGT AACCAATACT GATAAAGGTA ATTAAAACAC 600 

CTGTCCATTC TAAAAACGAA GGAAACCACT AGAGATAAGC GTTTTTTGCA TCGAGGGTAC 660 

CAAnGGACTA TTTTTTTGGG AATGTAAATT AGTTAAGTAC nGGAATTAAG CCGCACCTAA 720 

TTACCnATCC CATATTGCTG TAATAAGCCT CTTTGnTCAA CTGACCAGGG AAAGCAAGAT 780 
(2) INFORMATION FOR SEQ ID NO: 561: 

" (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561:. 

GAATGTGGAT TTAATTGATC GTGATGTGTT TGGTTATGCT GTTGATAAAG GCTGGATGTG 60 

TGTCCAAGTA TTCTTTGTCA GACAAGGAAA ATTAATCGAA CGTGATGTAT CGATTTTTCC 120 

ATTTTATGAT GATGCCAGTG AAGCATTTTT AACTTTTATT GGACAATTTT ATCAAGAGAA 180 

TGAGCATTTT GTGCCAAAAG AAGTCTTGAT TCCTGATGAC ATTGATAAAG AAAGCGTCGA • 24 0 

AGCTTTATTA GCTACGAAAG TCTTGCAGCC GCAACGGGGA GAAAAGAAAA AACTGGTTAA 300 
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ATTAGCCAGT 


AAAAATGCTG 


CAGTTGCTTT 


AAATGAAAAA 


TTCGATTTGA 


TTGTACGGAA 


360 


ACAAGAACGA 


ACAATTGGCG 


CAGTTGAAAA 


ATTAGGCAAT 


GCGATGAACA 


TACCTGCGCC 


420 


CATTCGAATT 


GAAGCATTTG 


ATAATTCAAA 


TATTATGGGA 


ACAAATCCTG 


TTTCTGCGAT 


480 


GGTTGTGTTn 


ATTGACGGAC 


GGCCAGCTAA 


AAATGAGTAT 


CGAAAATATA 


AAATAAAAAC 


540 


CGTCCAAGGT 


CCTGATGATT 


ACGCTTCTAT 


GAGAGAAGTC 


ATTTATCGCA 


GATATTCACG 


600 


AGTTTTAAAA 


GAAGGCTTGC 


CTTTTCCTGa 


TTTAATTTTa 


ATCGATGGTG 


GTAAAGGTCA 


660 


AGTCGATGTT 


GCCAAAGATG 


TGTTAGCCAA 


TCAACTAGGA 


GTCGATATTC 


CTGTAGCTGG 


720 


ATTAGCCAAG 


AACGACAAGC 


ACAAAACGAG 


TGAACTTTTA 


TTTGGTCCAA 


ACTTAGAAGT 


780 


TGTACCATTG 


GAAAGAAACT 


CACAAGAATT 


TTTCTTGTTG 


CAACGAATCC 


AAGATGAAGT 


840 


CCATCGTTTC 


GC CATC AC AT 


TCCATCGTCA 


ATTACGTAGT 


AAAAATAGTT 


TTGCTTCCAA 


900 


ATTAGATAAC 


ATTGAAGGCT 


TGGGACCGAA 


ACGCAAAAAA 


AATCTTTTAA 


AAGAATTTAA 


960 


ATCGCTTAAA 


AATATTACGG 


CAGCCAGTGT 


TGAAGAATTA 


AGAAAAGCTG 


GGTTACCTGa 


1020 


AACTGTCGCA 


AAGAATGTGT 


ATCGTCATTT 


GcATCAAGAA 


ACAACCTCTG 


AGATAGAAAA 


1080 


ATAAATGAAA 


ATGaaCCAGa 


CCmATTaGGT 


GCTTTTCCTA 


ATTTGtCTGG 


TTTTTTTATT 


1140 


CAATTTTTTC 


ATGAAAAGTT 


GAACTCTCAA 


GAAAATATGA 


TAACGATTTC 


TTTCGTGATA 


1200 


TACTAATACT 


AACTTAAAAG 


AGAGGACGTC 


TGATCATGAA 


AATACTAACA 


CCCGTTTATC 


1260 


AAATAAAATT 


TAAAAAAGTA 


CCAAATCAAT 


TACT GAT TGA 


TGAGCGTTTA 


ATGAATGAAT 


1320 


TGGGTAGATG 


GCTGAATCGT 


ACAGGTCGAA 


TTTGGGCCTG 


TCAATCATCA 


AAATCGGCAA 


1380 


AGGAATTTAA 


AAATACATTT 


TATGAAAGTA 


CAGGCTTGTC 


CGCAAATGAA 


GTGTATATTT 


1440 


CTGCCGAGAC 


GGATGGCCTT 


TTTCGTTaGC 


AGAAATAAAG 


CGCTGTTAAA 


GTGCACATCT 


1500 


TTCCATAACA ATGTCATGnC 


AGCCACACGG 


AAGTACGTTG 


GTACTAACAC 


GGTTGAAAAn 


1560 


GAAGGGGCGT 


TAGTGTAnGG 


AAAGGTAGCA 


ACCTTGGGAT 


TAACnGTGAC 




1610 


(2) INFORMATION FOR SEQ ID NO: 562: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 
TGAATAATAA ATAAAGGAGA GAAACAAAAT GAAAGATACC TTTCGCTTAG AAAACCAGAC 
GATTTATTTT GGTACAGAAC GGGCAATTTC AGCCAGCCCT CAAACGATTT GGCGCTACTT 
AACTGAGACA GACAAGCTAA AACAATGGTT TCCAGAATTA GAAATTGGTG AATTAGGTGT 
TAACGGGTTT TGGCGGTTTA TTCTGCCCGA TTTTGAAGAA ACAATGCCTT TCACTGATTA 
TGCAGAGGAA AAATATCTGG GCGTGACTTG GGATACTGGT ATCATCTATT TTGACTTAAA 



60 
120 
180 
240 
300 
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AGAACAAGCA 


CCTCATCAGA 


CCCTGTTAGT 


CTTTTCAGAA 


TCTCTCCCAG 


TV 7V TV 7\ rp rrs m m T\ /"* 

AAAAT T T T AC 


J bU 


AACACCTCGC 


CATAAAGATA 


TCGCTGGTTG 


GTCAATTGTT.. CTCAACCGCT 


TGAAGCAAGT 


/ on 

4 Z\J 


AGTTGAAACC 


CCTGATGCAG 


CACCAGAAAA 


AATAGATTTT 


CCGCAAATAG 


TV TV TV T\ H/^Ti TUTl 

AAAATCATTA 


ion 


CTTGGaAAAA 


TTAACGAATT 


TAGAAAATTA 


AAAAGTGTGG 


CACAACGGGT 


AAATCACCCT 


540 


CTGTGcCACA 


CTTTTTAAAA 


ACGTAAAATA 


CGCTGATAAA 


CAAATAAACC 


TAAATCAATT 


600 


AAGCAAACAA 


CAAAAATTGC 


CAATAATACA 


AAAGAGACCC 


GCAACTCTGC 


GGCGTACATC 


660 


CCTCGTAAAC 


TAATTGTTGC 


CAAACATAAC 


CCAAAAATCG 


TTGCCATTGT 


CACTTGTAAC 


720 


ACGGTTCCTG 


GTTTTAGGGC 


ATAAACAAAA 


ATGCCAATCA 


AACTAGAAAC 


AATGAGGTGT 


780 


GTCAGTGGCA 


aTTGAAGCCA 


ATTGACAACA 


AACAGACTAA 


GGAAACATCC 


TAAAAGAGTC 


840 


CCCCAAAAAC 


GTGTCTTAAT 


ACGCGTGTTC 


ATTTCTGCTT 


TCACTGGATG 


CAAGAGTAGA 


900 


AAAACATTCA 


TCGGAATCCA 


AATTATTTTA 


ATGCCTCCTA 


CACTCCACAT 


TAAAATAAAA 


960 


CTCAGTGTAA 


TGGTGAAACT 


CATACGAATA 


GCTAAACGAA ATCGTgCACC 


CTTTCGTCAG 


1020 


GACTTTCTAA 


AAATTGTTTG 


AAACTTGTAA 


TCCCAAAAAA 


AATCTGTTGT 


AAGTCGTGCC 


1080 


AGAAATTAAT 


AACTGCCAAT 


TTTTGTTGCT 


TCATATTTAA 


TTGAATGCTT 


TTAAAAAAAC 


1140 


GAACATGGCC 


CCTACTCTTT 


AAGCTAGTTA 


GCTGACGTAA 


TCCTGTTTGC 


CACCAAGCGT 


1200 


TTAATGTTAA 


AGCCAATAAA 


AAAAGAC 








1227 



(2) INFORMATION FOR SEQ ID NO: 563: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 



AGGATCCCGC 


ATATTTAAAG 


CGCTCGTGAT 


nCAGTCAATT 


AAATCTTGAG 


GTGTGACTAC 


60 


CATTGTTGCT 


CCAATATCAC 


GAGCTTTTTG 


TTCTTGTTTC 


CTCTTTTCAT 


TTGTATCTTG 


120 


ATCTATTATA 


TCGCCAACCG 


ACGTGAGGTG 


ATGTCCAATT 


TCCTCTGCAA 


CCGTACTAGT 


180 


TAATTCTCTA 


GGATGTTGTT 


GAGGATTTAA 


ATACACAACA 


TTGTTAATAT 


ACAATCCTTT 


240 


TTGCTTTTCG 


GGCATGTTTG 


GTTCAAACTT 


ATACGTTAAA 


TCTGAAAAGC 


GAG C CATC AG 


300 


TTCTTCTGAA 


GTTACCATTT 


AAACACCTAC 


TACTTTAATT 


ATTTGTGATC 


GCGTTTCTTA 


360 


ATATAATCAA 


TGAAAGAGAG 


AATCTCATTC 


ATTTCTGTAT 


CTGAAACGTT 


ATCGTCTATA 


420 


TGAGCAGCGA 


CGGTTAATTG 


TTTTTTTGAT 


AAGGAAGTAT 


CTTCAAAACT 


TTCTCTCCCA 


480 


TGTAAATAAT 


CTAAACTTAC 


ATTGAAATAG 


TCAGCAATCT 


TATTTTGGAT 


ATCTATATCT 


540 


GGAGTTCTTC 


TTCCTTGTTC 


ATAGGAAGAG 


TAAGTCGTTC 


TAGCTACACC 


AAGTGCATTT 


600 


GCAACATCAG 


TTTGTGTTAA 


TTTTTTTTGT 


TTCCTTAATT 


CTGTTAAACG 


TGTACCGaAC 


660 


ATAATGTTAC 


CTCCTACGAT 


TCGTTATGTA 


CATATAATAC 


TACGCAATCT 


GAGTAGTTTG 


720 
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AAGTTAAATA TATTAAATGT GTCAAAAAGA GTAATTAATT GTTGACATGT GTCAGAATGC 780 

GTAgwATTAT AAAAGTACGC AATACGACAC ATAAAGGAGG ACACAAAATG AGAGATTGGC 84 0 

TATTGGAATT GCGTATAAAA AATAAAATGA CTcAAGAAGA GGTTgCCgTT aAAgCGGaAA 900 

TTTcAAGAAC CaCGTATGCA TCTATTGaAC AGGGAAGAAG AAGGCCATCT GTGGaAAGTG 960 

CTATGCGAAT CGCATCAATC TTaGATTTTg aTTgGACAAT TTTTTTTtGa GTAAcTATGA 1020 

CTCGTTATGA CTCTTTATAT TTGGATAGGA GGTGAAAATA TGCCAGAAAA ACAAAAAATG 1080 

ATTGATTTCA TTTTGGAAAT CATGCCAAAC GTAAAGCTTA TTGTTGAACT AGCmACAGAT 1140 

GATCAGTTAG TTCGTCTGTA TGrACAAGCA CAGGATAAGT TGAACTATCm ATTGGATCTA 1200 

ATCTAATTGT ACaAAAAAAT ACATCATATA AAAAGATGGT TAATTAAGAA AGGAAGAAGT 1260 

AAAAGTAAAT GTCaATGTCa GTAATATTAA GAAAGTCGTT AATCGAAGTT CTTAATAAGA 1320 

AAAATGAAAA GAAGAAAGAT GTAGCaCGTG AAATCAGTAC ATCTCAACAA TCATTGAGTG 1380 

ATTGGACTTC TCAGAATAAT GTTAAACCAG TAACAATTGA AAATGCTCTA AGGCTTAGTG 14 4 0 

ATCACTTTAG AGATTCGACT TTCACAATGG AAGTTATTCA TCAGTTTTTT GGAATGTTTA 1500 

AAACTTGCGA TGGAGATGTT TATCGAAAAG ACCCTTCGTC ATTGGACCGA TTACAGAAAA 1560 

TTGAATCTAA TGAGAGAnnA GCATTGAAGC AT G AC AT AG A AAGAATTGTT TTGAAAAAT 1619 

(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 
.(B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

ATCCCGCAAG TCGTCGGTnC TAAAGGAAAA CTAGGACGAA ACAATGTAAT TTTTACTTTA 60 

TCTTTGGCTT ACTATGCGTC AAAAATTGAA CAAGACCGCT GTTTTGATGA TATGGATGTA 120 

TTCAACAGTT CTTTGAATTA TCCTCTATCA GTTTCAGAAA TAAAGAGAAT TGTTAAGAGT 180 

GCTTACTCAG GTAAATATAA AGGCGCAAcA AAACCTATGT GCAAGAACTA GTGTCAAATT 24 0 

GGGGATTAGA ATCGTTATCT GAGGAAAAAA TGTTTACACA GCAAAGAAAT ACTTGGTACA 300 

AATTTAAGAA GGAACGTCAT AAAAGAAAAA AATCTCACCT TTCCGAATGG AAAAAAGATA 360 

TTTTAGATTA CCTTGAAACC CAGTGTTATA GATATCGACC TGAAGTTAGT ATGAAGAAAA 4 20 

CAGATTTGCA AGCGGCTGTT ACTTTCAATG GGCAATCGAT TCCTAAAAGA TCACTTGACC 4 80 

GTGCACTACA AGAATTAGTG GCCGAAGGTA AATTATTTGT ACAAACAAAA GCCGGTAGGG 54 0 

GTGGAGGACT" AGTAGTTGCT ACTCGGAAAG CATTAATTCG TACAGTTATT CAAGTAAAAC 600 

AGCAAGTCAA ATATGCTTAT AAACAGGGTA TTAAGACATT CTTTAAAGAA GCTGATATGT 660 

TAGTAAGACT GTTTGAACAA ACTGATAAAn ACAGTCAAAG AGCCAGAGAA TACAAGCAAT 720 
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rp T\ T\ T\ TV /'"'""P TV »"p /— 

TAAAACTATG 


GAAbAb I bbA 


1AAA1 1 L AAA 


PATPTPAAPT 


GCCAATACAT 

vj v- nn x nv>ri x 


TATATTTCTT 


780 


TTATACTTAA 


CAGbTATAl A 


Al 1 1 1 lulAl 


T ATTTTTTTT 


APTTATTTPT 

rwj x x x \ x x x x 


T AAAAT TAT T 

x ruirvi x x ** x x 


840 


TTATACAAGC 


CTTAT 1 Al 1 I 


bu 1 i Ab I AAv- 


TTTTAAPAAP 


GGAGGGTGAT 


AAAAA TGAAG 


900 


AAGTACAAAA 


AGTTTTGTTT 


111 Abb 1 b 1 1 


TPPTT ATT AP 


PTTTPPTTTT 

1 1 1 VI O 1111 


APPTAPTTGT 

rtUU 1 t\\J 1 1 VJ J. 


960 


GGTACAAATA 


y-f rri /"* y*" 1 m TV 7V TV 7V 

CTGCTACAAA 


1 bA 1 1 LAtAn 


PATATAAPPP 


AAAAAAAAPT 


APA APAPPTA 


1020 


GCGACTTTGG 


CTGCAGGGAC 


Abb 1 b 1 1 bAA 


APTTTAPATP 


PAPPAAPTPP 


TPTTPATPAA 


1080 


ACGAGTATGA 


CTTTATTATC 


bAAbb 1 bAl b 


PA APPTTT AT 


ATPGATTAGA 


TGA AAA AAAT 


1140 


CAACCGCAAC 


CAGCAATTGC 


Abb i bbbbAA 


ppzirA T i T iP.TP,A 


PP,A AT A ATPP 


PAAAAPATAT 


1200 


ACCATTGTGA 


m m tv ta t\ m r** 

TTAGAGATGG 


bbb 1 AAb 1 bb 


PPTP. ATPPTA 
ub 1 bM 1 00 1 rt 


PAPAAATAAP 


TPPPAGTGAT 


1260 


TTTGTGGCTG 


CGTGTCAAAG 


AGTTb 1 AbAl 


bb 1 AAAAbAb 


PTTPTPPA A A 


TPTPPAACTG 


1320 


TTTTCTGCTA 


TAAAAAATGC 


/"•TV TV TV f~* Tv TV Tv rp fp 

CAAAGAAA1 1 


bbl IbAbbAA 


AAPAAPPAAA 
AAbrxrto ^rtJ-lrt 


APATAPTTTA 


1380 


GCAGTGAAAA 


GTATTGGTGA 


G AAAAbA I 1 A 


P 7A A ATTPA AT 
b AAA 1 1 bAA 1 


TAPTTPA APP 
inul 1 oi^Lrtv_Vw' 


A APAPPTTAT 


14 40 


TTTACTGATC 


TGTTATCCTT 


t\ t\ prrPTTA f* 

AACGGbTl Ab 


1 A 1 bbAb 1 Ab 


AP.PA^:A A APP 


A ATT A AAPAP 


1500 


TATGGGAAAG 


ACTATGGTAC 


rp rri /™»rp/~« 71 tv t\ t\ TV 
lib! b AAAAA 


rppn 7ATTPT A A 
1 bAA 1 1 b 1 AA 


PA A ATPP APP 


ATTT A APTTA 


1560 


ACAAATTTAG 


AGGGAGTAGG 


/~i t\ O rp f"P /"""T> /"* TV rp 

CACT 1 b 1 bA 1 


AAb 1 bbAbbA 


TTTPTA A A A A 


T A A APAPTAP 


1620 


TGGGATCAAA 


AAGATGTTTC 


fry ta rn ta ta t\ t\ 

TATGG A f AAA 


Al 1 AAb 1 1 bb 


AAPTAPTPAA 
AAb 1 nu 1 Lnn 


APA A ATT A AT 


1680 


ACAGGAATAA 


ATTTGTATAA 


m/"»'7N rp /•"*• TV jT> TV TV 

TGATGGAbAA 


^ r PAP7\ r PP7 , i r PP 

b 1 Ab A 1 b A 1 b 


P APPTTTPPP 


TPPPP A AT AT 


1740 


GCAAAACAAT 


ATAAAAAAGA 


1 AAAbAA 1 A 1 


TPAAPAAPAT 


TAATPPPPAA 


TAPAATGTTT 


1800 


T TAG AAAT b A 




PP A A A A T A A A 


PTTTTACAAA 


ACAAAAATGC 


CCGAAAAGCG 


1860 


ATTAGCTATG 


CTATCGACCG 


GGAAAGTCTA 


GTTGAAAAAT 


TACTAGATAA 


m s** /^i m f> m m 

TGGGTCCGTT 


iy^u 


GCTTCTGTTG 


GCGTAGTACC 


AAAAGAAATG 


GCTTTTAATC 


CAGTAAATAA 


nAAAGATTTT 


1980 


GCTAATGAAA 


AATTAGTGGA 


ATTTnACAAA 


AAAAAAGCAG 


AAGAATATTG 


GGGATAAGGC 


2040 


AAAAAAAGAA 


ATTGATTTAT 


CCAAAAATAC 


TCTTTAGGTT 


Tn 




2082 


(2) INFORMATION FOR SEQ ID NO: 565: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

AAATGACTTG ACTGTTGCTT CAGTTGnCAG CATTCACCTT GCTTTTAGCA GGCTGTAACA 60 

CAACCAATAA TAAAGGCACG ACCGCAACGA GTGATTCTGT CGCTAAATCC AGTGAAAGCA 120 

AAACAAAGGA TTCGGTCGAC TTAAATTCAC TAGATTTACC ACAACTAGAT GACAAAGTTG 180 

CCGAAGATCA AGACTTAGTC CAAATGGTTA CTTCAATGGG AAATATTGAA ATCAAACTAT 24 0 

TTCCTAAACA GGCTCCAAAA ACTGTCGAGA ACTTTATGAA ACATGCCAAA GATGGCTATT 300 
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ATGATGGCTT 


GACTTTCCAC 


CGCGTCATCA 


ATAACTTTAT 


GATCCAAGGT 


GGCGATCCTA 


360 


GCGGTGATGG 


aACTGGCGGA 


AACAGCATCT 


GGaATAAACC 


ATTTGAAGAT 


GAATTTTCAA 


420 


ACCAGTTATA 


TAACATTCGT 


GGCGCGTTAT 


CAATGGCGAA 


TGCaGGrCCT 


AATACTAACG 


480 


GCAGTCAGTT 


CTTTATTGTC 


CAAAATACAG 


ATGATCAATC 


TGATGGTCTT 


TTATATGATG 


540 


ATTATCCAAA 


AGCAATCATT 


GaTGCTTACa 


AAAAAGGCGG 


CTATCCTTCT 


CtTGATAAAA 


600 


AACaTACTGT 


TTTCGGTCAA 


GTAACCAAAG 


GAATGGATAT 


TGTCGATAAA 


ATTGCCAAAG 


660 


TCGAAGTTGG 


TGCCAACGAT 


AAACCAAAAA 


CTGATGTTAA 


AATTGAAAAA 


ATCAATATTC 


720 


TTCAAGAAGC 


AAAAAAATAA 


CAAAAAGTCA 


GAACGTACGC 


TGAAGATACG 


TTCTGACTTT 


780 


TTGTTATTTT 


TGAATCGCTT 


GTAATGCCTT 


TcTGGtACTT 


GGkCyTGkTT 


GACCTCyTCa 


840 


TACGTATTGA 


TGTGGGcCCC 


TTwATTkGrG 


mCTTTtAGaT 


AATGrTCTGk 


TTTTAAwTCa 


900 


kGGTCCGCTG 


tGaAAGkTAw 


TTCyCgCGkT 


TTGCCTTCCT 


CATCTGCGGC 


TACTTGCGTA 


960 


TAACTAGCAA 


TTCCATAGCC 


ATTAACGCTA 


ATCGGCTTTC 


CTTGTTTTAA 


CATACACATT 


1020 


TCCTGCTTTA 


AnCCAATGGG 


ATTTAAATGA 


TCCTAnCACA 


CCAGCTAAAT 


TCGCCAGAGG 


1080 


TCCnCTCCTG 


C 










1091 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 



ATTTCGTTCA 


CTGGTAAATT 


GAGATATTTC 


AAAACACCTT 


CTCGATTTTG 


AACTTCATCA 


60 


AAACAAATCA 


CTGTACCAAA 


ACCAGTACCA 


TCTTCACTAT 


CATCCAAATC 


ATTTGTTGAT 


120 


TGTGTAATCA 


ATACTAAGAA 


ATTATTAAAC 


GAACGTCCAA 


TACGTTTCAT 


TGATTTTAAA 


180 


ATCGCTTTAC 


CTTCTGGCGA 


AGATTGAAAA 


ATCCAAGATT 


CATCAAAGAA 


CGTCACTGTT 


240 


"TCTTCTTTTG 


GATTTCTTGA 


ACCAAACTTC 


GAACAAAATG 


TTCCTAAAGC 


AAACATCAAT 


300 


GTAACAGAAA 


GACGTTTATT 


TTCATCTAAA 


TTTTCATTTT 


TATCAGTTGG 


TAAGTCTAAG 


360 


TCCTCAATTT 


CTAAAATTGT 


TACTTTCTTA 


TCGAATGATA 


GACCCTCAAC 


ATCACCATGA 


420 


GAAAAAGCTA 


ATTCTAAAAC 


TGAACCTTTA 


ACAGTCGACA 


AAATAAATCT 


ACCCATATCT 


480 


TTCACGTCAG 


AAATTTCACT 


TTCCACTAAT 


CGCTCAATTA 


CATGCCAAAA 


TCCAACCTTT 


540 


TTGCCTTCTT 


TCCGTTCTAA 


AACAACTTTC 


GCAATCGCTT 


CATTAATTTC 


TGTTTTTTGC 


600 


TTTAAATTCC 


■ ATGTTCCTTC 


GTATATATTG 


GTTAACATTG 


CTTTAGTTGT 


TGCAATCGCT 


660 


TCTGTTTCAT 


CAAATAAAAC 


AATCGGGTCA 


AGCACGCCAT 


GGTTTTTCGT 


ATCACGAACA 


720 


TCTAAAGTGA 


CGAAATTAAA 


GGACTTCACa 


AAATCAACAT 


CTAAAGGATA 


TTTCTTTTGA 


780 
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TATTCAGGGT 


CATTAATTGT 


GTACATAAAT 


TGCTTTCGCA 


TTTCTTTTTT 


TGGATCAATG 


0 H U 


TATAACACTC 


•TGTCTT-TTAA 


AAGTGCCATT 


TGCATAAATA 


AGGTTTTAGC 


CGTCACTGAT 




TTTCCATTCC 


CAGTATCACC 


TGTAAGCGCC 


CAATGTGGGT 


TGTTTGTTTT 


CTTTCCATCA 


960 


ACCCCTTCTT 


TATTTGCTAA 


TAGCATGTTC 


ATAAAAATTA 


GATTTTTTGA 


ACTAGAAATA 


1020 


ATAGATTTTC 


TATCTTCTTT 


TTCTTCCAAA 


GTTGTATCCA 


CTCGTCCAAG 


GTAAAAGCCT 


1080 


AACTTCGTAC 


CTGCATGAAG 


TGAAGTAAAG 


AAATTCAACT 


CGGCAAACGA 


TTCAATCGTT 


1140 


GACGTGTGTT 


GCCAACGAGA 


AGTCTGCATT 


CGCAAAAAAT 


TACCAAATAA 


GGTTGATTGA 


1200 


AATAAATAAG 


GATTATCAAA 


CGTTGCTTTA 


ATAAGTGGAA 


TATCTAAACT 


ATCAAAACGA 


1260 


TTGATTAAAT 


ATTTTTTTCT 


TGCACGCATT 


TGTTTTTTCG 


TGCTACCACT 


AACAACAATC 


1320 


ACTGCTTTCC 


AGTCAATAAT 


ATCCACACCA 


TCATCAATTT 


TTTGGTCTAA 


ATCGTCaAGT 


1380 


GATAATTGAC 


CTTCAACAAT 


CTTTCGTTTT 


TGTTTAGAAC 


CAGCTAAGTG 


TGCTTCCTGC 


144.0 


ATGATATTCT 


TAGTGCGGGT 


TCTTGCACGT 


GACGAACGCC 


CAGACAGTGC 


TAATTGCCCT 


1500 


TTACTTTCTG 


CAAAAGTCGC 


TTGAATCTTC 


ACTTCGACTG 


GGAAAGGCAT 


TTTTTGAACG 


1560 


ACCTCTAAAA 


AATGT 










1575 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 



ATGGTCATTT 


AAATTTGATT 


GAACGCATGC 


GnAAGTTATT 


TGACGAATcA 


TTTAtTGGCG 


60 


TTTTTATAAA 


TACAAGTAAG 


CAAACATTAT 


TTACGCCAGA 


AGAGAAAAAA 


TATTTAATTG 


120 


AAGAAGCAAC 


TAAAGAAATG 


CCAAATGTGC 


GTGTTATTAT 


GCAAGAAACC 


CAGTTAACTG 


180 


TGGAAAGTGC 


AAAATCATTA 


GGGGCTAATT 


TTTTGATTCG 


TGGCATTAGA 


AACGTCAAAG 


240 


ACTACGAGTA 


CGAGAAAGAC 


ATTGCTAAAA 


TGAATCAACA 


TCTGGCGCCA 


GAAATTGAAA 


300 


CGGTTTTTCT 


ATTAGCAGAA 


GAACCATATG 


CACATGTAAG 


TTCCAGTTTA 


CTAAAAGAAG 


360 


TTTTACGTTT 


TGGGGGCGAT 


GTGTCTGACT 


ATCTTCCTCC 


CAATATTTAT 


CATGCGTTAA 


420 


AACAAAAGAA 


GAATGACTGG 


aGCTAAAATT 


ATGAATAagA AAACAGAAAA 


ATTTTCGTTT 


480 


AAATCaCTCA 


TTCCAaTGGT 


ATTGGCATtG 


CtGCTAATTG 


GTTTGTTTAT 


CGTCCCAATT 


540 


CCT TAT TATA 


TTGAAGGACc 


AGGAACGACA 


GAAAATTTAA 


AAGAGTTTGT 


AACTGTCGAT 


600 


GGAAAAAAAG 


ATACGCAGTC 


TgGTGCTTTT 


TATTTAACAA 


CGGTGGGCAT 


TCGTTCGGCC 


660 


ACTATTTTTT 


CTGCCATAAA 


AGCAAACTTT 


TCGGATTTTC 


AAGAAGTCAT 


GTCCAAAAAA 


720 


GAATTAATGG 


GT GAT AG TAG 


TAATAGCGAA 


TATAATCGAA 


TTCAACAATA 


CTATATGGAT 


780 


TCCTCAAAAA 


ACGCCGCAAT 


TGAGCAAGCC 


TTAAAATTAG 


CGAAGGTTCC 


ATATGAGATG 


840 
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AAATTTAAAG GCGTCTATGT GTTAGCGATG GAAGACAACT CAAGTTTTAA AGGAAAAATT 900 

GAAGTTGGCG ATACCGTGAC AGGCGTGGAT GGTAAATCTT TTAAAAGCAG TGAAGAATTA 960 

ATGAATTACA TTAAAGCACA AAAAGTGAAT CAAAAAGTCA CTGTTCAATT TATTCAAGAT 1020 

GGCAAAGCAA AAGAAGCGAC CGGCAAATTA ATTGAATTGC CAACAGATAA AAAAGCGGGG 1080 

ATTGGTATTG GTTTGACCGA TCaTACAGAA ATTGATTCTT CTnTCCCAGT ATCTATTGAA 1140 

GCGGGTGATA TTGGTGGG 1158 
(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 



AATCAATTCG 


TGACTGTTCG 


GAATGCCTTG 


AATTGGTCTT 


TCAATATCCC 


TGTCGTTCAT 


60 


GTTAATAATG 


AGCTAAGAAA 


AAAAATGGGT 


GATGaTAATT 


tCyCTTACAA 


TCACTACTTA 


120 


AGTAAAaTGA 


ACTATCCAGC 


TTCAGATGCT 


TGGGCTTACG 


AATCTGCGCC 


ATTGGGTTCA 


180 


GTAGAAACAA 


ATGTTGTAAC 


ACAAACAAAC 


GGCTTCCAAG 


CGCTTGCTAA 


CAAAGGTAAA 


240 


TATCAAAAAG 


CCTACATGAT 


TGAAAAAATC 


ACAGACAATA 


GCGGACATGT 


GGTCTACGAA 


300 


CACAAAGATG 


AAGGAACGCA 


AGTTTATTCA 


CCGGCGACCG 


CTTCTATTAT 


GAATGACTTG 


360 


TTGCGCTCAG 


TGGTGGATTC 


TGCGAATACG 


ACAAAATTCA 


AACCAACGTT 


AGCTGGCTTG 


420 


AATCCTCATC 


TAGCTAGTGC 


TGATTGGGTC 


GGAAAAACTG 


GGACAACTGA 


TGAGTTTAAA 


480 


GATTCATGGC 


TGATTGTCTC 


TACCCCAACT 


•GTGACACTAA 


GtTCTTGGGC 


AGGACATGAT 


540 


CTACCAGCAC 


CAaTGACGAT 


GACCAGCGGC 


GACAATAATG 


GGAATTATAT 


GGCGAACTTA 


600 


GCCAACGCGC 


TGTACTATGC 


GAATCCAGAA 


TTGTTTGGCA 


TTGGTCAAAA 


ATTTGAATTA 


660 


GATCCAAGCG 


TTATTAAATC 


AAAAGTTTCT 


GaATtTACTG 


GTGAAAAACC 


TGGCTCAATT 


720 


ACCTATAATG 


GTGCCAAGTT 


CAACACaCCT 


GGCAAAACAA 


CGACTTCATA 


CTATGCCAAA 


780 


GATGGTGCAC 


CACAAAGTAC 


CTATAAATTT 


GGGATCGGTG 


GAACAGACAG 


CAATTACGCT 


840 


TCTTACTGGG 


GAAATCTTGC 


TCCACGTGCC 


ACTACAAATA 


ATAACAACAA 


TAACAATAAA 


900 


AACAACGATA 


ATAAAAAGAA 


TAATAATTAA 


CTCGCTAAAA 


TGAGCCAGAA 


aaCGTATTTT 


960 


cTGGCTCATT 


TTTAGGTATC 


AAAAAcGGCT 


AAACTAAGAG 


TTAaCGCTCT 


TAGTTTAGCC 


1020 


sTTtAATTCA 


ATTTATTTTG 


TTGAACCTTT 


TTGATCAATT 


CCATAAGGTA 


AAACAcTGTT 


1080 


TTTTCTTCAA 


' TTtCTTCTTT 


GTTCATTAGT 


TTTGTTAATA 


AACGCATTGA 


CACGGCACCG 


1140 


ATATCATATA 


AAGGTTGTGT 


AATACTTGAT 


AGACGTGGAC 


GAGAGACTTC 


TGTAAGCAAT 


1200 


GAGTTATTGC 


TTGTAATGAT 


TTCAAATTCT 


TCTGGAACTT 


TCACACCAGC 


ATCTAGCATC 


1260 
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CCATCTAATA AGCCAATTGC TAGCTCATCA TCTGTAACAA AGGCAGCAGT TGCCCCACTA 1320 

TTACGGACAC GCTCAGCCAA ATTAATGCCT GCTTTAAATT TATATTCAGA TTCAAAAACT 1380 

AAGCCTTCAT TATAAGATAA GCCATTTTCG GCTAAAGCTT CTTTGTAGCC TTTCATTCGG 14 4 0 

TTTTGACCAT TAATTGGATC GATTAATGCA CCACTAACAA AAGCAATTTT TTTGTTTCCA 1500 

TTTTTCGCTA AAGTCGCTGT TGCATCTTTC GTTGCTTCTG TGTAATCAAT ATTAACGCTA 1560 

CCAACTTGTT CGTCTGGATC AATTGAACCT GCTAAGACAA CAGGTGTTTT TGAACGTGAA 1620 

AATTCACCAC GAATATCATC TGTAATGCGA TGACCCATAA AGATAATCCC ATCTACTTGC 1680 

TTGGCAAGTA AATTGTTTAA TACAGTAACT TCTTTTTGAT CGTCTCCATC TGAATTAGCT 17 4 0 

AAAATGATAT TGTATTTATA CATTGTAGCT ACGTCGTCAA TGCCACGTGC TAATGAAGCA 1800 

AAAAATGCAT TACTAACATC TGGAATAATG ACACCCACTG TTGTCGTTTT TTTACTTGCT 18 60 

AAACCACGTG CGACTGCGTT AGGACGGTAA TCTAAGCGAT CAATCACTTC TAAGACTTTT 1920 

TTACGCGTTG CTGGTTTTAC ATTGGGATTA CCATTGACAA CACGAGAAAC AGTAGCCATA 1980 

GATACATTTG CCTCTCTAGC AACATCATAA ATTGGTAATT GTTTGTnTTT CCATGTTTCT 204 0 

TCTCCCnATA AATATGAATT TGGnTCTCAT TTTCTGAAAA CAnCTGGTTA CCATTTCTGT 2100 

CCTAGCCC 2108 
(2) INFORMATION FOR SEQ ID NO: 569: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

TTTTGACTTT TAATCACTAT TTTGCAAAAA AnTCAGACTT TTTTCATTAT- TTAAGAGACn 60 

TCCCGCAAAA AGAACTTTTC TTTTTTTGCT ATTTCTAGTA AAATAATTCA TTGAGAGCAA 120 

AACTATACGG AGAGGAGAGA GTTTCAATGG CAACAGGTTG GGTAGTATTA ATCGCAGTAA 180 

TCGCTTTATT AGTAGGCGCT GCAGGGGGCT TTTTCCTTGC ACGCAAATAT ATGCAAGATT 24 0 

ATTTCAAAAA AAATCCTCCC GTTAATGAGG ATATGTTACG CATGATGATG GCATCAATGG 300 

GTCAAAAACC GTCAGAGAAA AAGGTTAGAC AAATGATGCA ACAAATGAAA AACCAAGGAA 3 60 

AATAGTAAAG GTTATTCAGG TAATCATGAA TAAGTGTAAA AACGGGCTGA TTTTCAGTCC 4 20 

GTTTTTTTGT TGTTTTGCGG GAGAATAGGA TAATCTAGGA GGAAAAGAGC GAAACGTTTA 4 80 

ATCAAAAACA ACAATAACGA AGAAACCTTC TAGCTCGAAA GTAGTCTTCG TAGAACAACA 54 0 

AAGTAAAAGG AGGGACAACT GAATGTCCAT ATTCAAAAAA TTAGGCTGGT TTTTTAAGCA 600 

AGAGAAAAAA AGTTATATTA TTGGGGtTTT CTCATTAATG ATGGTCGCTC TTGTTCAATT 660 

AGTCCCGCCC AAAGTTATTG GCGTCGTTGT AGATGAAATC GTTaACAAAG AAATTCGCTT 7 20 

AACGAAAATT ATCGTGTGGG TTGCACTCTT GATTGGTGCT GGGCTTGCCC AATATCTTTT 7 80 
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TCGCTATATT TGGCGGATGC ATATTTGGGG GAGTGCGGCT CGTTTGGAAA AAGAGCTACG 84 0 

GACTCAATTA TTTCATCATT TCACAAAAAT GGATAGCATC TTTTATCAGA AATATCGGAC 900 

AGGTGACTTG ATGGCGCATG CAACCAATGA TTTAAATGCC ATCCAAAATG TTGCTGGAGC 960 

TGGGATTTTA ACGTTTGCCG ACTCTGTGAT TACGGGAGGm ACAACGATTA TCGCAATGGT 1020 

TCTATTTGTC GATTGGCGCT TAACATTAAT TGCTTTATTA CCGCTGCCTT TATTAGCCGT 1080 

CACTTCACGA GTTCTAGGTT CTAAGTTGCA TGATGCCTTT CGAGATTCAC AAGCTGCTTT 114 0 

TTCAGCGATT A . 1151 
(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

AAAATAATGA AGGAGAATAG ATATGGGAAA ATTTCAAGTA ATTGATCATC CGCTAATTCA 60 

ACACAAACTA ACAATGATTC GCGAAAAGAA CTGTGGGACA AAGGTTTTCC GCGAaGTTGT 120 

CAACGAAATT GCTATGTTAA TGGCTTATGA AGTATCACGG GATATGCCTT TAGAGGATGT 180 

TGTCATCGAA ACACCGATGG GTAAATCAAC ACAAAAAACA CTTTCAGGGA AAAAAGTAGC 24 0 

AATCATTCCT ATTTTACGTG CCGGCATTGG CATGGTTGAT GGGATTTTAG AATTAATTCC 300 

AGCAGCGAAA GTTGGACACG TTGGACTTTA TCGTGATGAA GAAACATTGC AACCGCATGA 360 

ATACTTCGTG AAATTACCTG AAGACATTGC AAGTCGTCAA TTATTTGTTG TTGACCCAaT 4 20 

GCTAGCAACA GGTGGTTCCG CAATTATGGC CATCGATTCA TTAAAAGAAC GTGGTGCTAG 4 80 

CAACATTAAA TTTGTATGTC TAGTGGCCGT TCCAGAAGGT GTGAAAGCTT TACAAGAAGC 54 0 

GCACCCAGAT GTAGATATTT ATACAGCAGC TTTAGATGAG CGCTTAAACG AAGACGGCTA 600 

TATCGTTCCT GGTTTAGGGG ATGCTGGTGA CCGTTTATTC GGAACAAAAT AATAACAAAA 660 

AACCGTTAGg TAACCGATTA TGGkTACTAA CGGTTTcTTT TGCyTAAAGA mCTTycTAGG 720 

TGTwTAtAAA AAtaaA 736 
(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 
CGAAAGCTAG ATACGGCGGT AAGTAAGCAA TCGATTATCG ATTGACAACT ACGAGCAAAA 60 
GCTGAGACAC ATGT.CTCCAG CTGACTATCA TAG GAT T TAG taTGTGGGAC AGCAGTCAAG 120 
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CATGACTTTT GTCCCACATA TTTTCTTTTA TATAATATGG TGGCTCAACT GGAAAAGAAT 180 

ACTTCTTTTC CGTTCATTAT GTTATGATAG ACAAGAAAAA AACACCAAAG GAGAGAGAAG 240 

AAAATGACGA TCAAAGAATT CATTAAGAAA AGTCGCTTGA AAACATGGGA CGTTATTATT 300 

ATTGCTATTT TAGTTCTTGG TTCTTTTCTG CCCCTTGTTG TTTTTGCCAT GCAAAACCGT 360 

GGTCAAGAAG CGGCTACGTA TCAAGCAGTC TTGAAAGTAG ACAACAAAGT AATCAAAGTC 4 20 

TTTGACTTAA AAAAGGACGG CCCGCATTAT ACATATAAAT ACGAAGCAAA AGACGGTGAT 4 80 

TACAACCTGA TTGAAGTAGA TGGTGATCGT ATCCGCGTGA AGGAAGCCAA CTGTGCTGAC 540 

TTAGTGGATG TTCGACGAGG CTGGATTTCC AAACCTGGTG AAACCCCGAT TGCCTGTCTA 600 

CCCCACAATT TGTTTATCAC CGTGGAAGCT TCTGATGGGA GTGAAGACGG TAGCTTAATT 660 

TATTAAGCAC GACAACTATG AGCAAATTAC ACAAGAATAT TTATATTGCT ATGCTCGTTG 720 

CCCAAGGGGT CATTATTGGC TTAATCGAAA ATATGATTCC TTACCCTTTC GCATTTGCTC 780 

CTGGGGCAAA GCTCGGTCTA GCAAATTTAA TTACCATTAT TGCGATTTTC ACCATGCGTA 84 0 

AgcGTGATAG TTTTTTCCTG GTCTGTTTAC GTTTGATTTT AACGACATTA CTGGGCGGCA 900 

CACTTTCGAC CTTTTTCTAT AGTGCCAGTG GTGCCTTATT AAGTTATTTC GGCATGTTGC 960 

TTGTCAAGCA GCTGGGGCCA AAACGGGTCA GCATTATCGG GATTAGTGCC GTTGGTGGTT 1020 

TCTTACACAA CGTTGGTCAG TTGCTAACGA CCTCTTTCTT TGCTCATTCG TGGGCGCCGA 1080 

TGCTGTACTT ACCAnTTTTA TCTCTGTTTg GGGTtACTCT CGGGGaTGCC ATTGGGgATT 1140 

GCCGCTaATn ATtACyACAA CATGTcCAGA CGTTACGAAA ATTCCAACTG AATTATGAAC 1200 

AAACAACGAA ACATAAGTGG AGTCAATTTT TCTAATTGAA AACCTGTCCG CCACAATTAA 1260 

AGGAGTCATC TTATGAATAT ACATCCTATG TGGAATACCT ATCCTACGTT ATCAAAAGAA 1320 

TTAACAACTA CTCTTCGCTT GATGGAACAA GCGGTTCAAA TCGATAATCA AGAGATTCAA 138 0 

GCTGCTGTCC ATGACATGAT TCATTCTGGC GGCAAATTGC TACGTCCAGC CTACCAACTG 14 40 

TTGTTCTCTT ATTTTGGTGA ACAGCGGGAC CCTAAAAAGG CGACAGCGTT AGCGrCTTCC 1500 

ATTGAATTAC TGCATACCGC CACGTTAGTT CACGACGATA TTGTCGATGA AGCTGATACG 1560 

CGTCGTGGCT TGCCAACATT GCGTTCGCGC TTTGGAAATA GCACAGCCGT GTATACAGGC 1620 

GATTATCTTT TTGTTTGCTG TTTCAAGCTC TTATCTGATT ATTCTTCTTC ATTAAAAAGC 1680 

ATTCAATTAA ATTCGCGCAG TATGGAAAAA GTCCTGACTG GTGAGCTTGG TCAAATGGAC 17 40 

AATCGCTATA ATTTCGAGGT AACGATTGAC CAATATTTAA AAAATATTTC CGGGAAAACA 1800 

GCGGAATTGT TTGCGCTAAG TTGTTTTGTC GGTGCCTATG AAAGTGGCAC CTCCCAACGT 18 60 

TTTGCCAAAC GTTGTGGTGA AATCGGCGAA AATATCGGAC TGGCTTTTCA AATTATTGAT 1920 

GATATTTTGG ACTATACACA AACTGTGACG CTATCGGCAA ACCCGTCTTA GAAGaCGTAC 1980 

GTCAAGGCGT GTATTCTTTG CCCTTGATTT ATGCACTTGA AGCCAATCGT GAAGTCTTGT 204 0 

TGCCACTTTT AATGAAAAAA GAAGCCTTGA CCGATGAAGA AACACAGGAA ATTTATCGTC 2100 
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TTGTCCATGA 
AAGCGCTGAA 
ATGAAATTAC 
GGATTTAGCG 
TAGACATACG 
ACCCGTCAAT 
TTTGACAATT 
TAATTTGTAT 
GACATCAACA 
AGGAGTTAAA 
GATTCCAAAA 
ATTAGCGATT 
GAGATATTGT 
CTGCTCTCGT 
CAATAACCTC 
ATACCTCCGA 
CAGAACAGGC 
AAATTGCTCA 
CACGCATATA 
AGAGCGTACT 
GTtAATAAAC 
TTACTGATTT 
GTCACTACTA 
TTTTTGAGAA' 
TTCCTAGAAT 
CGCGCACCGT 
CCGACCATCT 
CGACAATGCT 
AATTTCTACG 
GGCTGGTTCT 
TAACGGATAA 
GACAGTTTTA 
TTTTTGGCAT 



ACTAGGTGGC 
AGAAATCAGC 
ACAAACGATT 
GCTAGCTTTT 
CTCTCGGGTA 
CCTTGCTAAA 
AATGTTGTTA 
TACCTAGTGA 
AAAACAAGAA 
GAAGGGGAAG 
GTAGAAGATT 
GATTATTCAC 
TATTATCGAC 
AATGAGCAAA 
GTTGGATAAG 
TAATGCAGTA 
ACAACGCAGA 
AAAGTTTTTA 
T GAT T TACT A 
AGACATACGC 
CCGTCAATCT 
TGACAATTAA 
TTCCTTTCTA 
ATTGCCACCG 
ACGCTTACTT 
TTACTGGGAT 
TTTTTGATGG 
CGCCAGACAT 
CCTTGTTGTG 
GGTTCGTTCG 
TCTTGGTAGC 
TGAAAGCCAA 
TGTTCAATTT 



GTAGAAAAGG 
AAACTTCCTG 
TTGACAAGAG 
TAGGTGATAT 
GACCGTTTAA 
GATTATGACG 
CTTATACAAC 
ATACATGTTA 
AACAAGGAAA 
AATTTGTTAT 
TTTTTGAAAA 
CTAGCGGAGG 
TTTGCGCCTT 
GACAGTTACA 
GAACGTCCTT 
AGTAAAGTGA 
ATAAAATATG 
CAAAACTTTT 
TTCGACTTTT 
TCTCGGGTAG 
TCGCTAAAAG 
TGTTGCGAAA 
GGGAAAGAGC 
TCATAAACTT 
TCCTGATTAT 
AAAACGATTC 
TCGGTCTGTT 
CAAATGTTTG 
CTGCATACAA 
TTAAAGCGCC 
GAAAGGCGTT 
TCCCTGGATC 
TTTCTGTATA 



1828 

CCCAAGAATT 
AAACAAAGGC 
AAAATTAAAA 
CTTGGTATAA 
GACGGTAGCT 
GGTTTATTTT 
TTAACTACTT 
TACTAATTAA 
CTCATTAGTT 
TCTTCGCAAA 
TACTGCTGAA 
AGAAGTTGAT 
CTAAAGGTTA 
ATATTTCAAC 
TTCTTGTCCC 
ATACGCTCCa 
TAGATACGTT 
CTTTTGCTGT 
CCGTTTCAAC 
ACCGTTTAAG 
TTATGACGGG 
gcAATCGCAA 
AAACAAGCAA 
CCTACTTGTG 
TCAACAGCCG 
AGCCAACTAA 
TAGTCTATTG 
TTTTAAGCCT 
GGAAGCAGCG 
AATGGTTCGT 
AGGATCTTTT 
GAAACAAATT 
AAATTGGTGC 



AGCCACTCAT 
CCAAGCGAAA 
AAGCCGCTGG 
TAAACACGAA 
TGATTTTGAT 
TTGTCGTTTT 
TTGATGCTTC 
AAAATGGAGG 
ATAACAATTC 
AATAATGGCT 
GGAGAGTTTT 
GGGCTATAAA 
TGAAATCAGA 
AAATTTAGTT 
TATTTATAGC 
AGTCTATTCA 
AGACGAAGAA 
TTAATTCGGC 
TTGCTATAAT 
ACGGTAGCTT 
TTTATTTTTT 
GCATCAATGC 
AAACATTGGC 
TAGTATAACA 
CTTTCTTTTG 
AATTTTTTAG 
CTCCGCCTTT 
TGGACATTAT 
GCTGAGCCAA 
TTGCGAGAAA 
AACAGTTGAA 
TTTTCTAAGG 
AACTCTTCTT 



TACACAGAAA 
GAACAATTGT 
ATCCTATTGA 
GAGAGCGTAC 
ACGGAAATAA 
TATTACTGAT 
ATTCCATCTC 
TGTTAGCTAT 
CTGCTACTTT 
CCATCGCATT 
ATCTTCCTGA 
CCAACTCAAA 
AAACGACGAC 
ATTGTCTGTC 
GAAAAACTTC 
CTCGACTATA 
ACATTTTACG 
TTCAACTGAA 
AAACACGAAG 
TAATTTATTG 
GTCGTTTTTA 
TTCAAAGACA 
AGTCCCCCTT 
CATTTGTACC 
TAATAAAAAT 
ACTAAACAGA 
ACTCCTTCTC 
GAACACGTAA 
AATCACGGTT 
CGCCGTATAG 
GGTTTTCTGC 
AAAGCCCATA 
TTAAAGAAAG 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 
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CGATTGGCGT 


TTTCTAGAAT 


GCATAATCAC 


AACTTGGACT 


TCAGGATATT 


GTGCGAGGAC 


4140 


TTCTGCCATC 


CCCGGAGTGT 


CTAATCCTTT 


GATGTCATtA 


ATCATTGTGG 


CGCCAgcTTG 


4200 


GATCGCGGCT 


TCCGCTACTT 


CGGGATAGTA 


GGTGTCCACA 


GAAATCGGCG 


CTAGAGAAGT 


4260 


CTCACGTATT 


TTTTTTATTA 


ATGGTAATAC 


TCGGTCTGCC 


TCAACTTGTG 


GAGAAACTTC 


4320 


TTCGTAGCCA 


GGACGTGTGG 


ATTGGCCGCC 


AATGTCTAAA 


ATATCGGCCC 


CCGCTTCTAG 


4380 


TAGATGAAGC 


GCATGTTGAT 


AGGCCACGTC 


GACCGTTGTG 


TAGCTACCGC 


CATCAGAAAA 


4440 


AGAGTCGGGG 


GTCGTGTTGA 


CAATCCCCAT 


AATTTGGTAA 


TTCTTTTTTA 


AATCAATCAT 


4500 


TGTCACATTG 


CTCCTTTATC 


TGCTGAATCA 


TCTTTCGTAA 


AGCCCGTATT 


CTAGGACTTA 


4560 


TCTTCATTCG 


TTCACTCGTG 


GACAACTCCG 


CTAAGGTTTT 


GCCTCGGTCT 


GGTAAATAAA 


4620 


TAATCGGATC 


AAAGCCATAA 


CCGCCTGTAC 


CGCGCGGTTC 


AACTAATTCA 


CCAGTCAAAG 


4-680 


TGGCTTCCGT 


TTGCAACAGT 


TTGTCGTCAT 


TGAGTACATA 


GACCAACGTG 


GCTGATAAGG 


4740 


TTAACTCTCG 


GGTCGACTGT 


TGCCCTTCAA 


ATAAGTGCAA 


CAGTTCGCGG 


TTCTGTTCTT 


4800 


CCGGATTGGC 


TGAATGGAAA 


AAGCGGCTTG 


TATGAATACC 


TAATAAATCA 


GGAAAAGCAG 


4860 


TTAACGTTAA 


GCCGCCATCA 


TCACCTAACA 


CTGGGCGGCC 


AATTAATTGT 


TGGAAAAAGC 


4920 


GAGCTTTTAG 


ATAAGCATTT 


TCAGCATACG 


TAGTGCCAGT 


TTCTGCAGGT 


TGTTCTTGAC 


4980 


TGGTTGTATA 


CTTTCGATAA 


GAGACAAGTC 


GAATAGCGGG 


GTCCTTAAGT 


CCTGATTGCA 


5040 


TTTCTTTTAA 


CTTACCTTGG 


TTATTCGTGC 


CAACAATGAT 


TTCCATTTTC 


TAACGTATCT 


5100 


CCATTAAAAA 


TTGATTTTTC 


CACTCTTGTT 


CTTTAAATAG 


TCCGGTAAAT 


TGAAATGTTT 


5160 


TGGTGCTACT 


TTGTGGTGTT 


TTCACGCCGC 


GCATCGTCAT 


ACACATATGT 


TCCGCTTCAA 


5220 


TGGCTACTGC 


AATGCCTTTG 


ACGGGAATAT 


TTTCTTGGAG 


TTTGCGGGCA 


ATTGTCACTG 


5280 


TTAAATCTTC 


TTGAACTGTG 


GGACGTTTGG 


CACAATGTTC 


TACTAATCTT 


GGTAACTTGC 


53^0 


TTAGTCCCAA 


TACTTTGCCA 


CCTTCTGGCA 


GATAAGCGAC 


ATGTACTTTT 


CCATAAAATG 


5400 


GCAATAAATG 


GTGTTCACAC 


ATTGAATAAA 


AGGCAATATC 


TTTTACTAGA 


ACCATTTCGC 


5460 


CTTCATTTAA 


GCTATCAAAT 


AATTTATAAT 


CGTCAAATTC 


AGGTTCGGTT 


AATCCACTAA 


5520 


AAACTTCTGC 


ATACATTTTA 


GTACGCGTTT 


GGGTGTATCA 


AGGAGTCCTG 


CCCGTTGAGT 


5580 


ATCTTCCCCC 


ACTGCTTCTA 


AGATTGTGGT 


AACTGCCTGC 


n 




5621 


(2) INFORMATION FOR SEQ ID NO: 572: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
CAnATATCCA ACAATGCTTG GAATCTCCTC CGCTTTCCAG TGTAAAATGT ACTTGGGTCA 
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TGGTAAAAGT 


CCCCCTGGGG 


TATGGTTTTT < 


GGTCGTTAAA AACCATGTAC CGTAAAAGGA 


120 


CTGTTATATG 


GCCTTTTTAC . 


TTTTACACAA- TTATACGGAC TTTATCTAAT GATTCCCTAA 


1 O Pi 

180 


CAGTATTTTA 


CAATTACTTT 


TnTAI I 1 bL-b 


PPPTTTPPPP AATAACCPCC TATCTTTTTP. 


240 


CATAAAAATA 


TTGAATTAAA 


ATGGTA1 Ibl 


PTAPPTAPCP BPTTTTPCAP ATTTTPAPAT 


300 


AGATCTCCTT 


TTTACCGAGA 


AGGTGGCCAT 


mfpmmmmf rprn/i 71 Q A ATT^S Zi li ZITCACTAZiAC 


360 


TrATATTATT 


ATATAtTAAG 


CTATAACTAT 


nTiPPBrTBTir BBPTHTTTTQ PPTT A AT A AC 


420 


CTAGCTCTTG 


CTTGGACCAT 


TAGCTCAGCT 


rrTTararra BflrrrTTfTiT AAPPrrTTpec 


480 


CCACAGGTTC ' 


GAGTCCTGTA 


TGGTCCATAA 


7\ nTATTTAPT P B p t 11 A P B AC CACTATTTPT 
AA I A I 1 1 AU I ortb i. AAL AAo urtVj irtl i i l 


540 


CATTTGAGAG 


ATAAGCCTTG 


TTTTAGTTGT 


nriTAATrnvr PTTT ATATRT TPCAACTTCC 
AG1AA1LLAL L-illAlAlAI l^jftrtbl iuo 


600 


ATATCATTTG 


AAAACCAAAT 


TAGTGATACT 


/■•tv /™"7\ /**"T>Ti r P r r ,f P TnTTTBSTRT PBBBBTPATfi 
CAGACTTTTI lAbllAAlAi U AAAA 1 L.A 1 A 


660 


AAATCTCTAT 


TTGACAGTGT 


TTGTTCACGA 


ti j\ 7\ 7\ 7\ 7\ t\ 71 A ArnrTPTTrB TTTBTAAPTT 

AAAAAT AAAA AbAb 1 b 1 1 LA J. 1 1 A i AAb 1 1 


720 


CTTGAAAGAG 


GAAGGTTATT 


TGTCTTAAAT 


GAATTTATAG GATAALbbi.1. LllAbJLAiAb 


780 


AATGTAAAAA 


AAGCCAGCCA 


GTATTTGGCT 


GGCTTCTGTG GTAAAbCbLA bbLbilllLL 


840 


GCCAGTTGCA 


CTTTAGGTGC 


GTGTTAGCAC 


TTGCTTCAAT AAAGAAliAA bAAJ-iAblAl 


900 


TATTATGCCA 


TAAAAGAAAA 


ACTTTGAATA 


GATATGGATA GCAAAAAALb AblbAAIAAA 


960 


ATTCGACTGG 


CAAGATAGAT 


TTTGGATAGA 


ji rnmr , »/ , a"rT Ti P P'A A A TITTTAP RRTTBTTATC 

ATTCCTTGAG GAAAAbllAb AAllAllAlb 


1020 


CCACCATGGA 


GGAATAGTTG 


CTATGATGTT 


m ti rp v™> t\ A A TVT'fpPTVT'TAPA BRTABPBPBBB 

GCTATCAGAA AIibAIlALA AAAAbALAAA 


1080 


AGTAATGTTC 


ATCGATTATA 


AAAAATGTTG 


a 7\rprprprpT\/-«7\ t\ r , r^ r VT\ TP" TP TAT CTCTATAACT 
AATTTTALAA LbiAlblbAi vjlbiaiArtbi 


1140 


ACATTTGCAA 


TCTCCTATAT 


TATTTGATAT 


(pppAPTfTPT PZ1BTCTTAAT CCACPATACA 
I bl_Ab i 1 1 b 1 oAAIollA/al VjVjrtov^riifWjJA 


1200 


TTAATTGGTA 


GAGCAGCGGT 


CTCCAAAACb 


BTPBPTBTAP CTTPAATPPP PATTPiTTPGT 
A I Lnlj 1 A 1 no oil v^/T-tt. 1 v^.^Vrf \*r\ l l o i i x 


1260 


AGCATTGCTA 


CTTAAATAAA 


AGAATCGCCA 


btt\PBTPTTT PTCPTC A ATT P ACC ATfiAHT 


1320 


CACTAGATTG 


CAAGTGTTCC 


TTTTTTATTT 


PPPBPABPBA TCAACAATTA Tf^TATAATAA 


1380 


CTTTTGCAGA 


AAAGTGAAAA 


GATAATGGCT 


BB.TPTPTTPA ATAAACCCCT nATCPPTATC 


14 4 0 


GTTCATAGCT 


TTATCTCTAG 


ATGTAGTAGA 


r^BTPTPTPPA TTPCAAATAT TATTHPTAAT 


1500 


GGTTGCATTC 


GTAACGCTAG 


TTTTGCTGGG 


p b TTi P BPPPT ATCAAPAPAA AAAATAAPPA 
bnlnon^vul rtlonnLn^nn rtrtrtrtirtrtv^urv 


1560 


TCTCACTTGA 


CGGGGCGTAg 


CTGGTTATTA 


BBBPTBBTTT ATCAPACTAA PPCTPTTTTT 
AAAb 1 AA 111 A 1 unL/iu 1 rir\ 1 l i i l i 


1620 


AACGGTTCTA 


CATAAGGATA 


TGTTAATAGC 


ACATTTTTTT CTATATATAT TATACCATGT 


1680 


AACAAAATAA 


ATTCATTCAA 


GATTACTCAA 


CATAAGTGAG TGGTCTTTTT TGTGAATAAA 


1740 


AAAGCCACTA 


GGACAGGGAA 


CCTAGTGGTC 


AGGGTAGCAA ATATTATTAG TCGTAAGTGT 


1800 


ACAATAAAAA 


TAATTTTTAT 


AGGAATATTT 


AATCAACATC ATAATACTAT TATTATAAGG 


1860 


AGTTGCTACT 


' CAGAATTATT 


ATATCAAATA 


CTAAAATAAT TTTGCAATAA AAAAGTTATA 


1920 


CACTAAAAAG 


CCACTAGGCT 


GTGGGCCCTA 


. GTGACTAGGT AGCAAGCATT TTCAATTGGA 


1980 


GTTTTACACA 


. AAATTTGAGG 


i ATTATTATAA 


t TACTTATCTG AATCGATAAa TTATCATATA 


2040 
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TATAGGGAGT 


TGCTALbbAA 


Irtl 1 1 In 


rzp a a a a 


ATT TACT AAA 


TCAAGTACAT 


2100 


AGAATTTAGA 


G AAAb 1 AAAb 


AAA i riv_/i 1 H 1 


oil rtMV-rirlrto 


GATTAGCTAG .AAATGTGGTG 


2160 


AGGCCATGGC 


AAAATATACA 


AAb i bbb 1 AA 


b 1 bA 1 bHHuu 


ATTAATTAAA 


ATTGAAAGGG 


2220 


TGAGCACGAG 


AGGGTCTCGC 


AGA1 b a Abb A 


A 1 Ay i Ab AAA 


ATATTGGGGA 


TTAAAAGACC 


2280 


AACCTATGAT 


TGaAAAAATA 


AATATTCTGa 


bA I I Ab 1 b Ab 


ACCCTAAAAA 


GaGAAAAAGA 


tJ4U 


AGTTGTAGAT 


TGCCAAGTGG 


AAAACGCACT 


7\TTT7l A A A A 

A 1 i 1 AAAAb A 


GGTGCAGGCT 


ATGAATACAC 


7 A 00 


CGAAGTTACA 


AAAGAACTGA 


CAGAAAATGG 


GAT. AAAAA1 b 


ACaAAAAAAG 


GTTACAAAAC 


9 d fin 


AATTGAATTA 


TATTGAGATT 


TAAGGTTTTT 


A A T /-*/■*■ A A A f"' 

AATGbAAAbb 


ATGTCACTTT 


TAAATTTTTG 


2520 


ATCTGAAGGC 


ATAGCATAGA 


AAATTATTTT 


a A A A A rp T 1 rp rn 

AGAAAAi ill 


GTGTAGCCAA 


CAGGAAAGCA 


2580 


TCAAAAACGC 


GATCTTCAAA 


AGTATGCTGC 


ATGTGTTGTA 


GTTAACTTAT 


TACCATTTGG 


2640 


TGGCGTTGTT 


TGGGATATAC 


TTAATTGTGA 


tv tv ^ m /"* rn t\ tv m 

AAATCTAAlb 


CAGGTTTTAA 


TTAATCCGTT 




GCAATGAAAA 


AAAAATTATG 


GTTATGATGT 


TAAAATACTT 


TTAGATGTAG 


GGAAAAGAAG 


?i fin 


TTTAACTCCT 


GAACAATTTG 


CAAAATTCAA 


TGCTTTAACT 


ATTGCAGCTG 


GTATAGCAAT 


9ff 9n 


TACTGCTGTG 


GCTTATTAGG 


GAATTAATTA 


GTTGAGCGGT 


TATGTGAGTA 


TATACGATTG 




TTAGAAGAAA 


AAATAATATC 


ATTTTTAAAG 


TTTTTAAAAG 


TTAACCCTCA 


GAATAATTGT 




TTTACATGTT 


ATAGTTATTC 


ACACCCTATT 


TTAAATAGTC 


TTCCTTTTAG 


CCTTTTGGTG ■ 




CAGTGTTTTG 


AGATAAAGGA 


CTATTCTGCT 


TTAGGGTTGG 


ATTCTGGATA 


TTTATTTACA 


3060 


ATAGAAGCTG 


AGraAAAATG 


CTATTATCAT 


ATTGCTAACG 


TAACAGGAAA 


TGATTTTTCT 


3120 


ACTAGAAATT 


TTTTCGAACG 


GTATTGAAGG 


T\ f\ /"-• » m TV TV TV *"n 

AACmATAAAT 


TGAATAAAAT 


ACTGGAAGAA 


3180 


ACAAGTGACA 


TACGAAATAT 


TATTATATTA 


ATAGACb TAA 


TGGGAGGAAC 


ACCTGCAAAT 


3240 


GTAACTATGA 


TCAAAGCAAG 


TATGGATAGA 


__/■*» A A rn rp /*"» A 7\ 

mGAAT. ibAAb 


TAGTAACTGG 


AATGAATCTA 


3300 


AGAATGGTAT 


TAGAATCTTA 


CTTTAAAATT 


A Arp/" , AA/ ,, AAA 

AA lb AAb AAA 


AACTTGCTGA 


GCATTTAATA 


3360 


GAAATAAGTA 


TAAGTAGTAT 


rf\ Tv T\ TV TV m TV rn 

TAAAATACCT 


bAb A 1 Ab Ab A 


TTAGTGGCGG 


AGAAGAGTAA 


3420 


TTATGATGTA 


ATAATCTTAG 


CAGCGGTGGG 


GTGTGAabiA 


TAT AC AAAT A 


rp rn y\ tv T 1 TV ryi rp rp 

1 1 AAi Al lib 


34 80 


TGTTAAACAT 


TTTTTTGATA 


T AAG AA I b 1 1 


rp /->/-> a A T/^ 7i 7A 
b 1 bbAA 1 bAA 


ATTCTAGAAA 


TATTTTAGAC 


3540 


TATAAAATTT 


ATTATACGGG 


GAAAATACAA 


CTTTTTAGAT 


AAAGTCCGTA 


TAATTGTGTA 


3600 


AAAGTAAAAA 


GGCCATATAA 


CAGTCCTTTT 


ACGGTACAAT 


GTTTTTAACG 


ACAAAAACAT 


3660 


ACCCCAGGAG 


GGACTTTTAC 


ATGGACCCCA 


AGTACATTTT 


ACACTGGAAA 


GCGAAGGAnG 


3720 


ATTCCAAAGC 


ATTAnTGGAA 


TAATTCCGGT 


AAAGGGATGA 


CGGTTCCTAA 


AAAAATATTT 


3780 


TAACCAACGG 


GAATTTAATC 


CACCAAT 








3807 



(2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 

CGTTGCACGT GTCACACATA CTTGnACTTT GCnGGCACTC TTTAGAATCC TTGCnATGAC 60 

AAGATGnTGA TTGCTGTGGT CCGTGCACTT CAGATAACGA GGAAGCTCGA ATGACnGCGC 120 

TTCCCATCGT GACAACATCA AATAATTTTT TCGTCCGTAA ACGGTTTCTA ATCTTTTGGT 180 

CTTGTTCCAC GTTGGCGTTT GCGTTCTTCA CGTTCAATGC GTAGACGTTC GCGTTTTTCT 24 0 

TCAGGATTTG GATCACTGAT CACTGTTCCT GCTTCAATTT CTTCTAACGC ACGACCTACA 300 

CTTTTTACAG AATCAAAGGA TTCAACGGTT GGTTGAACGC CTTCATCTAA TTCATGCGCA 360 

CGTTTGCTTG CTAAAaTAmC CAAAGAaTAT TTkGAAGGGA CTtCTTTTAA CAATGAGTCA 420 

ATTGACGGTT TTmACATCAT AAGGCTACAT CTCCTCTAAC ATTTTAATAT ATTTTCCAAT 480 

TACACGGTCA ACACGGAAGT GCTnCACTCG CGATAATGTC TTTAATTCGT TGCACGGCTr 54 0 

AAGGAACTTC ATcATTCACT ACCGCATAAT CATACAGCGC CATCATTTCG ATTtCTTCTC 600 

TGGCCACAGC CATTCGkTCT TCAwTCaCAG ACATTTCGTC TGTtCCGCGG CCAaTAATTC 660 

TkGATTTTAA TTCCGCTAAA TCGGGGGGTG TCAAAAAGAT AAATACGCCA TCAGGCACTT 720 

TGTCTTTGAC TTGTTTAGCC CCTTGGACTT CAATTTCTAA AAAGACGTCT TTGCCTTCAT 780 

CTAACGTCTG GTTAACATAC GTTAATGGTG TCCCATAATA GTTACCAACA TATTCCGCAT 84 0 

ATTCCAACAT TTCTCCAGCT TCAATCATCG CTTCAAATTC TTCTCTTGAA CGGAAATAGT 900 

AATCTACTCC ATCCACTTCG CCTTCTCGCT GTTTCCGAGT GGTCATTGAG ATTGAATATT 960 

GAAAATCATT TTCTTCACTG TCGAAAATAG CCTTTCTTAC AGTCCCTTTC CCCACTCCTG 1020 

aAGGACCTGA CAATACGATT AATAACCCGC GCTCTGaCAT AACGACTTCC TTTCAAAACa 1080 

TAATTTCACA GTATTAGTTC yAwTTTGcAC TTTTTTTCTG CAATTTTCAA GACTTyCTtG 1140 

GAGATtCACT GAAAAAATAm CATTTCTTCT TTAATTTGGC TTTCATAACT TGTTCTTTTT 1200 

TTAAACAAAT TACCTAGAAT TCCTATACAG GCAGCCATTT TTTAAGTAAA ATGAAGTGAT 1260 

GACTCCTTTA GGATGAAAAA ACGTGAATAA GAAAGGCTGA GAAACATCGC ACGAATTAAA 1320 

CCATTTACAT TGTTTCTGCT AGGCTTAGTT GTCAGTTGTG GGGCTnTTTT GCCCAACCTA 1380 

CTGTGACTCA CGCAGAAGAG AT AT AC 1406 
(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

CATTAATCGG CCTACACACC CATAAnGCCn TTTCTCTATA AGCAGTAAAC GCCCTAACTG 60 
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ATTATTTTAC GkTGCGtATc TGtGaCCAAT tAGGTtATAT AaTAACTGTt GGTACGTtCC 120 

tTTTCCGTTg ATCCATCACC CATAAGCCAG ATTGATTTTT yCCTTGAATA GJcTCCCGTTA 180 

ATCGCATCAC TTATTACCTG CCATAACAGA GTAGTTATTT GTGCATCTCC GCAAGCAATT 24 0 

TCGTTCAACC AGCTCTCTAC ATCCCAATCA TTAAATTTAG GTAACGACGG ATTTTCTACA 300 

TATGCGGTTG CAATTTTTGA TGTGAATACA TAATTGGGCG TAAACGGTTG TAGTTGTTTC 360 

GTTTTAAGGT TAAATACCCC GTTTTTCACT GGTATAAGAT ATCTTGAGAC CGTTTTAGGC 4 20 

TTTATTTTAG CTTCTTTCCA AATGTGAAAA ATAACATCAT TTACsGCTCn AGCATTGGGG 4 80 

TTCGGGTCAG CCACCCAATC ACTCGGTTAA TTGGGTTCAT TGGGGAAAAA GCCTCTTTGG 54 0 

CGAACAAGCA ACGnGATTCT CCAA 564 
(2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 

AACTATGCTG AGTCTAGGGT TATTATTAAC TGGGTGCGGA CTCAAGGAGG CACAACTGAA 60 
AAATCCAGCG AACGAATTAG TAACAAACCA GTTAAAATTG TCGCTGTTGG TTCCACCGCT , 120 

TTACAACCGT TAGTAGATGC TGCGAAAGAT CAATTTACAC AAGAACATCC TAATTATACA 180 

GTTTCTGTCC AAGGTGGCGG CAGCGGAACA GGACTTTCGC AAGTTGCAGA TGGCGCAGTG 240 

ACCATTGGAA ATTCCGATGT GTTTGCCGAA GAGAAAGATG GCGTGGATGC CTCTAAATTA 300 

GTCGATCATC GGGTGGCCGT GGTCGGTATG GGACCAGTGG TCAACAAAGA AGTCGGCGTG 360 

AAGAATTTAA CGAAGCAACA ATTGATTGAT GTCTTTACTG ^CAAAGTCAA AAACTGGAAA 4 20 

GAAGTGGGCG GCAAAGATCA AGAAATCGTC GTAATAAACC GCGCAAACGG AAGTGGCACC 4 80 

CGAGCAACAT TTkAAAAATG GGGCTTAGAT GGAGCTAAAC CAGTTCmATC ACmAGrACAA 54 0 

GtTCTTCgGG AcAGTTCGTA AAATTGTTGA ACAAACACCA GGaGCAtCAG CTATTTAGCT 600 
T 

(2) INFORMATION FOR SEQ ID NO: 576: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



601 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 
TCATGGAAAC TGCAAGAAAG CnTAAnAAAT TCCTGAAGTG GAAAGAATTC TTTAGACTTT 
TCTCGTTTTA ATTTAACTTT TCAATAAACC ACGATCATTT TGACAATTTC CGCATAGAAC 



60 
120 
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ATATTGCCAC CATTTTGATA AAGGTAACCC CCATTGTAAA CTAACGCTTG CCAACGATAA 180 

T-AACGATATT TCGTATTTTG GTCATTTCCT AGCAACGGTG AAAGGACTTC TCTTGAATAT 24 0 

TCTTCTGCTA ACTCTGTCGT GTGTTTCCCG CCGCGTTCTT TAACAAACTG AATATAATCC 300 

AGTCCATAAT TATATGCTTG TACGCCTGTC CACTCATCAC AGCCAGCTTC TTTTGCTTTC 360 

GCTAACGCTT CCGCGAGAAA TTTGACGCCA TGTGCGATAC TTTCCTCTGG TGTTTCAATC 4 20 

ACTTCTTGTT GTCCATAAAC ACTTTCACTG CTCTGCATAA TATCGCCAGA TTTCCCTTTA 4 80 

GATTCTGTAT AGATAATCGC TAAAGCTAAT GTTTTGTATT CAGGTATTTT ATTTTTTTTG 54 0 

ACTTCTGCTT • CTACTTGTTT TTCAAAGGTC ATTACCGTTT CAACATTTTG ATGTATTTTA 600 

TAGCCAAGAT AGCCAAGACC AGCAACTAAA CCAACAAACA AAAGTAGCAT GATTTnATAA 660 

ATTATTTnTC TTACTCTACG CATCGAATCA TCCAAACTTT TTCTTTTA 708 
(2) INFORMATION FOR SEQ ID NO: 577: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

TTGAAAAAAT CACAATTTAA TGATTTATCC ATTACGGAGA TGCACACAAT CGAAGCAATT 60 

GGTATGTACA AGAAAAAAAC GTCTTCAGAA GTAGCCAAGG AACTGTCAAT TACCGTCGGA 120 

ACACTTACTG TAGCGATTAA CAACTTAGTT AAAAAAGGCT ATGTTGAACG TTTGCGTAGT 180 

GAAGATGATC GACGTGTAGT AAAGCTTGGC TTAACCAAAA AGGGAAAATT ACTTTTTAGA 24 0 

GTCCATCAGC ATTTTCATCG TGAAATGGTT AAAAACATTT TGAAAGGCAT GGAGCAAGAA 300 

GAAGAACAAG CGCTGTTAAG AGCACTAAAA AATCTTCATG ATTTCTTGCA AGAATACAAA 360 

TAAAAAGTGA GGATCCCCAT GaAGAATTAT GCACGAATTA GTTGTACAAG CCGTTATGTT 4 20 

CCAGAGAATT GTGTCACGAA TCACCAACTG AGTGAAATGA TGGaTACCTC TGATGAATGG 480 

ATACATTCGC GAACAGGCAT TAGTGAACGT CGCATCGTTA CGCAAGAAAA TACGTCTGAT 54 0 

TTATGTCATC AAGTCGCAAA ACAGTTATTG GAAAAATCTG GAAAGCAAGC AAGTGAGATT 600 

GACTTTATTT TAGTTGCAAC AGTGACACCA GATTTCAATA TGCCTTCTGT TGCTTGCCAA " 660 

GTGCAAGGCG CAATTGGCGC CACTGAGGCC TTTGCTTTTG ACATCAGTGC AGCGTGTTCT 720 

GGTTTTGTCT ATGCCTTAAG CATGGCAGAA AAACTAGTCT TAAGTGGTCG TTATCAAACA 780 

GGTCTAGTAA TTGGTGGCGA AACATTTTCC AAAATGTTGG ATTGGACCGA TCGCTCCACA 84 0 

GCGGTGCTTT TCGGTGATGG GGCTGCCGGC GTCTTGATTG AAGCGGCTGA GACACCGCAT 900 

TTTCTAAATG AAAAATTACA AGCAGATGGA CAACGGTGGG CGGCTTTAAC GTCAGGCTAC 960 

ACGATCAACG AAAGTCCTTT TTATCAGGGG CATAAGCAGG CGAGTAAGAC GCTACAGATG 1020 

GAAGGACGTA GTATTTTTGA TTTTGCGATT AAAGATGTTT CACAAAATAT TTTATCACTC 1080 
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GTGACGGACG 


AAACCGTGGA 


TTATCTTCTA 


CTACATCAAG 


CAAATGTGCG 


CATTATCGAT 


1140 


AAAATTGCAC 


GTAAGACGAA 


GATTTCCCGC 


GAGAAGTTTT 


TAACGAATAT 


GGATAAATAT 


1200 


GGCAATACCT 


CAGCTGCTAG 


TATTCCTATT 


TTGTTGGATG 


AAGCAGTTGA 


AAATGGGACC 


1260 


TTAATTTTAG 


GTTCCCAACA 


ACGCGTGGTG 


CTAACAGGAT 


TTGGCGGCGG 


CTTAACATGG 


1320 


GGCTCACTGC 


TCCTAACGCT 


GTAAAAAACT 


GTTCACATAT 


AAATGAAGAA 


AATGACTATT 


1380 


TTGGAGGAAT 


ATACACATGG 


TATTTGAAAA 


AATTCAAGCA 


ATTATCGTAG 


AAGAATTAGG 


1440 


AAAAGATGCT 


GAAGAGGTAC 


AATTAACTAC 


AAACATTCAA 


GAAGAATTAG 


ACGCGGACAG 


1500 


CTTAGATTTA 


TTCCAAATCA 


TTAACGAAAT 


CGAAGATGAA 


TTTGATGTGA 


AAATCGAAAC 


1560 


AGAAGATGGC 


ATCCAAACCG 


TTCAAGATTT 


AGTAACATAT 


GTAGAAAAAC 


AACAAGCAAA 


1620 


CTAATACATG 


AAAATAGGGG 


GCTGGGACAT 


TTAATTGCTA 


AATGACCCAG 


TCCCTAAAAA 


1680 


AACAAGTAAA 


TAAGCCGTAA 


AAATCCTTTT 


TTACTTGAAT 


GAAGTGTACT 


TATCTTAGAA 


1740 


CTAAAGGACG 


TATAAAATCA 


ATGAATCAAG 


AGTTATGTGA 


GTTGCTTGGA 


ATCAATTATC 


1800 


CCATTTTTCA 


AGGCGGTATG 


GCTTGGGTAG 


CCGATGCTTC 


ATTAGCAAGT 


GCCGTGTCAA 


1860 


ACGCTGGTGG 


ATTAGGGATT 


ATTGCTGGCG 


GCAATGCCCC 


AAAAGAAGTC 


GTAAAAAAAG 


1920 


AAATTAAAAA 


AGTTAAAGAA 


TTAACGGAGC 


AACCCTTTGG 


TGTCAATATT 


ATGTTACTTT 


1980 


CACCTTTTGC 


CGATGAAATT 


GTCGATTTGG 


TTTGTGAAGA 


ACAGGTTCCT 


GTCGTAACGA 


, 2040 


CAGGTGCAGG 


CAATCCACCC 


AAATACATGG 


CTCGTTTTAA 


AGAACATAAC 


ATTAAAGTAA 


2100 


TCCCAGTAGT 


TCCTTCAGTT 


GCTTTAGCAA 


AAAGAATGGA 


AAAAATTGGT 


GCCGATGCTG 


2160 


TCATTTTTGA 


AGGAATGGAA 


GCTGGTGGAC 


ATATTGGCAA 


GTTAACCACT 


ATGAGTGGCT 


2220 


TACCGCAAAT 


CGTTGACGCT 


GTGTCAATTC 


CTGTGATTGC 


AGCAGGTGGG 


ATTGGTGATG 


2280 


GTCGTGGTAT 


GGCTGCGGCC 


TTTATGTTAG 


GTGCTGAAGC 


AGTCCAGTTA 


GGCACACGTT 


2340 


TTTTAATTGC 


CAAAGAATGC 


AACGTTCATG 


CAGATTATAA 


ACAGAAAGTT 


TTAAAGGCAC 


2400 


GTGATTTAGA 


TGCAGTAATT 


ACCTGTCAAC 


ATTTTGGCCA 


TCCAGTGCGT 


ACTTTAAAAA 


2460 


ATAAATTAAC 


CGCTCAATAT 


AATCAATTAG 


AAAAGCAAGA 


ACTCCAAAAA 


GAAGTGCCTG 


2520 


ATTTAGAAAT 


GTTTGAAAAA 


ATTGGTCAGG 


GCGCTTTGCG 


CAAAGCTGTC 


GTTGACGGGG 


2580 


ATATGGATTA 


CGGTTCCGTC 


ATGGCGGGAC 


AAATTGCCGG 


GTTAATAAAA 


AAAGAAGAAA 


2640 


CAGCCCAAGA 


AATCATTGAT 


TCACTCATGT 


CTGAATGCAA 


AGCGATTGTA 


CATAAGATGA 


2700 


ATCAGCGTTG 


GGGCTAAGTG 


AATAATTTTA 


ACAAAAAATG 


AGGTGTCGTA 


TGAAAACAGC 


2760 


GATTTTATTT 


AGTGGACAAG 


GAGCCCAGTA 


TCAAGGGATG 


GGTGAAGAAT 


TATATCACCA 


2820 


AGAAGCGATT 


GTTCGGGAAA 


CTTTCGATGA 


AGCAAGTCAT 


ATCTTAGGTT 


ATGAGATGGC 


2880 


AGAACTTTGT 


TTTACTGAAA 


ATGAACGTTT 


AAATGAAACA 


GAATATACGC 


AACCTGCTAT 


2940 


TTTAACAGTC 


AGTGTCGCAT 


TTTACCGTCT 


TTTGCAACAA 


AAAGGACTAA 


. CGCCTGATGT 


3000 


CGTAGCGGGT 


TTAAGCTTAG 


GGGAATACAG 


: TGCTTTGGTT 


GCCAGCGGGG 


i CTTTGCGCTT 


3060 
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TTCAGAAGCA 


GTGGCCTTGG 


TCCaAAAGCG 


CGGTCAGTAC 


ATGACAGAAG 


CAGCACCACA 


3120 


AGGAACTGGC 


AAAATGGTTG 


CTGTTATGAA 


TGCTGAGCGT 


GAAGTAATTG 


AGAAAGCCTG 


3180 


CCAAGAAGCC 


AGTGCTTTCG 


GAATTGTGGC 


TCCAGCAAAT 


TATAATACAC 


CACAACAAAT 


3240 


CGTGATTGGT 


GGTGAGGTTG 


CTGCTGTTGA 


TCAAGCGATG 


ACACTTCTCA 


AAGAAGCTGG 


3300 


TGTGAAGCGA 


ATGATTCCGT 


TAAATGTGAG 


TGGCCCTTTC 


CATACGGCGC 


TGTTACAACC 


3360 


AGCATCAAAA 


AAATTGGCTC 


AGGATTTAGC 


AAAATTGAAC 


TTTCAAACGA 


TGCAAATTCC 


3420 


TGTCATTAGT 


AATACGACTG 


CCGAAATTAT 


GCCCCAAGAG 


GCAATTCAAG 


CGTTATTGGA 


3480 


AAAGCAAGTC 


ATGTCTGCGG 


TACGTTTTGA 


AGACAGTATC 


GAAACGATGA 


AGGCTATGAA 


3540 


CGTAGGAACG 


ATGATTGAAG 


TTnGgtCCAG 


GGrAAACATT 


aACyGGtTTT 


GtTaAAAAAA 


3600 


TTGACCAAAC 


AATTGAAATG 


CACCGTGTGG 


AAGATGTTGC 


CACATTAACA 


GAAACG 


3656 



(2) INFORMATION FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 

CCACACTTCa sCsTCCTGTA TAAGAGGAAT CAGATTAAAT TCTATAAACA TCTTTTAAAT 60 

GTCTTTGCTC CAAAAAATTG ATGTCTAATC TTTAGAAATC ATTGTCTATC AGAATTATTC 120 

TTAAAAAGTT TGATTTTTTT AATCAAGAAA GAGCTAATTT CTTCCAGAAC TC AT AT CATC 180 

ATATTTTGCA ATGGCTAAAC AACTCAAAAT CGTTATACCA AGCAAACATC CTATAAACAT 2 40 

GCCTAACAAA AATAAAAACA TGGTTAACGC CACCTCTACA TGaTTCAATA GACTTAAATA 300 

AAAAAAATGr AGTAATCTTT CAGTATTTTT TATATTTTTT TACAAAAGTA GTTAAGTGTA 360 

tTTGAACATA TCTTTTGaAC AGTaATaATA AAGAGTGGtA gACGGAGCGG TT 412 
(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

ATCCATGACT GTTTATCAAA CGTAGCTTTA CCTGTAGCCT GTTCTTTTCG TTGCTCGTTT 60 

ACTTCTCTTG ATTCTAATAG TTGGAAGCTT TCTGTGACAA CCTCAGTTAC ATAAATACGC 120 

TGGCCTTGTT GATTCTCATA GTTTCTTGTT TGAATGCGAC CAGTTAAACC GATCAGAGTC 180 

CCTTTTGTTG CATAATTTGC TAATGACTCT GCAGCTTTCC GCCAAATAAC ACAGTTGATA 24 0 

AAATCTGCTT CCCTTTCATT GTTAGCATTG GTAAAGTTGC GATTAATGGC CAACGTAAAT 300 
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TGACCTACGG CTGTTCCACT TTGTGTATAC CTTAAATCTG GGTCTTTTGT TAATCGTCCA 360 

ACTAATGTAA CGTTATTAAT CAAGATTATT TCCTCCTACT CCTATAGTCA TTAATATTTT 4 20 

CTTGTATGGA TTtAATAAAT CATGCTATCT AATAACTCCT AAT CAT TATA ACATGTGCTA 4 80 

ATCTACTTTn CTTTTATTCT ACTAGTCTGA TTTCCCCTCG CAATCTCTCT TTAAACATTT 54 0 

TATTTTCTAA CTTTGTATTG GACTTTTTCT TTATGGAATT GGACAAGTAG CCCATGTTCA 600 

TATCTTTTTT TCTTTTGATA CAAGAAGTTT CCTCGACCAA TTAGaTAATC AAGAAAGCTC 660 

TTATTATCCC ATTTTAAAGA AAAGAGCGTC CACACAAGGA CGCCCACTCC TACTATAAGA 720 

ATCAACCAAA AAT TAG C AAT AATTGTGGTA AAAAAATTGA TTTTAGCGAC AAATGAAATA 780 

ATCGCGAATA TCAATAAGAG AATAAGGATA CCAATACCTA CAAAAATTTG' TTCCAATCGA 840 

ATGCCGTTTG GCGCAAAAGG CAATGCTACC CCTTTGATAC TGTAAATCTT GTGAGGTTGT 900 

TTAAATTCTC 910 
(2) INFORMATION FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 



CCCnGAAAAA 


AATATTCAGT 


TTGATCAGAT 


TGATTACACA 


aTGTACAAAm 


CArGTsCCGG 


60 


TATCCAATTA 


ATCAAAmCAT 


CTGAATATAA 


TAAACTCGCA 


AAATTTATAG 


ATGAACCAAC 


120 


TTTAACAGGT 


GAGAAAGTAG 


TTAATTTATC 


ACAAAAAGAT 


GCTCCTTCTG 


CTGCATCAGT 


180 


AGATGTCAAA 


TCTGTTCAAT 


TGGCAAATGA 


TCAAGTTTTA 


AACATTGATA 


AAACAGAAGA 


240 


AACCAATGTA 


TTACCGACTT 


TTGGCGGTGT 


GGTTGTTATC 


CCAGATAGTA 


CCTCAACAGA 


300 


TGGAATCAAA 


GCGGAAACAG 


AAACAATTTG 


GCAGCCCAAA 


GCAGGAACCA 


ATCGTGAAGA 


360 


ATTGATAAAA 


GCAGAAAATA 


TTATTGGTGA 


CAAGTATCCC 


ATGTTAGTAG 


GAACAGCTGC 


420 


AATGTCGGCA 


GCCATTACGC 


AAT AT TAT AC 


GCCAGTTCTT 


TTTGTCGGAC 


TATTTATCGG 


480 


TATTGTGTTC 


TTTGTTTCGG 


CGGGTAGTTT 


CTTATACTTC 


CGTTTATACA 


GTGATATGAA 


540 


TACGGACGTT 


GAAAAATTCA 


AGATGATTTA 


CAAACTTGGC 


TTGACGAAAA 


AAGAATTGAA 


600 


GAAAATGATT 


TACCAACAAG 


TAGGTATTTT 


ATTCTTTACC 


CCAATTGTTG 


TTTCATTTAT 


660 


TCATGGAGCT 


GTTGCTTTGA 


AAGCAATGTA 


TGCTGTCTTA 


GATCAGCCAA 


TGCAATTAGC 


720 


TGGCTGGGAA 


GTTTTAGGCG 


TTTTCTTATT 


GATTnCAGGT 


CGTTTACTAT 


TTAATTGCCA 


780 


GAACGTT 












787 



(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 



TTCCCGGGGC 


CAGCACCAGA 


TAAATTATGG 


ATCGATTAAT 


GTCCTACCAC 


TCCTTGGATn 


60 


TTCAACTrGG 


CAAtCATGAT 


ATGATTTGGC 


TGGCTGCTTA 


TAGTGGTTCA 


CTGGCTTGTT 


120 


TAGCCAATGT 


AGTACGTATT 


TGTGCTCGCT 


ATGGAAATTT 


AGATTTGTTA 


GAAGAACGTT 


180 


ATGCTATCGA 


TTTAACCGCC 


TTAAAAAAAT 


TTAGTTTAGA 


AACGTATAAA 


GAAAATCCTG 


240 


CTTTTGCACC 


CAAGAAAAAT 


CCCTATCGTG 


CGTTAACTGA 


AGCTGAAAAA 


CAAGTAGCGA 


300 


TGCGTGTGCA 


CAAGCCATTG 


CCATTATTCA 


AGAAAAACTA 


GAAGGCCAAA 


TTATTGGTCG 


360 


ACGCCCTGaT 


TTTAACTTGG 


CACATCGGtA 


CGCTTAGATA 


AGATTCAAGG 


GGaAACTATT 


420 


ACTTTTGACG 


AATGTCGTTA 


TACATTAATT 


AACAGCTGTT 


TCCAAACCGT 


TTCTGAAGAG 


480 


CAACCTTACC 


AATTAACACG 


TGAGGaAAAA 


CAGATAATTG 


AT G ATT TAT T 


AACGCAATTT 


540 


CAGTCTTCAC 


CGCGGTTAAC 


AAAACACATG 


CGCTTTCTAA 


TGGAAAAAGC 


TTCTCTTTAC 


600 


CTTGTGTATA 


ATCAAAATTT 


ATTGATTCAT 


GGCTGTCTGC 


CTCTGAATGC 


GGACGGTACT 


660 


TTTCAAGCTT 


ATACATTTAA 


AGGCCACTCC 


TATTCAGGAA 


AAGCTTTAGT 


TGACTTCTTT 


720 


CAGGAAATGT 


TAGAAGAAGC 


TTACGCTCAG 


CCTGCTAGTA 


CAGATGACTA 


TGCAACAGAT 


780 


TGTTtATGGT 


ATCTTTGGTG 


TGGCGAAGGT 


TCCTCATTAT 


TTGGTAAACG 


AGCTATGAAA 


840 


ACCTTTGAGC 


GTTATTTTCT 


CGCTGAAAAA 


GAAACACATT 


ACGAAGAAAA 


GAATCCTtAT 


900 


TACTCTTTGC 


GT 










912 



(2) INFORMATION FOR SEQ ID NO: 582: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

GAAATATCTT GTGCTTTTTG TTCAAGACTT GCGCATCCTG ATAATAGCAA AGTGGCTGCT 60 

GTTAGCAGTA GCCACTTTTT TCCAATAAAT AGTTTCATTT CCTAACTCCC TAACTTTTTT 120 

TCCGTTTTAA AGAATACTCA ATTTTCTTTT TTTCTAACCT ATCAAACGTT AATCCTTCAA 180 

CAGCTCGTTC CCATTTTGTT TGTTGTTCTG TACGTTTCTT TGTACTGTCT GCATATATTT 24 0 

TTTGTAAATC TCCATTAACC ATGAGTCACT CCTCCTTAAA ACGGTAAGTC ATGATCTGAA 300 

ATCTCAATAG- AACTACCTTC TAAATCTGGA AAGGGTTGTT GCGTATTGCT TTCCTGCTGC 360 

CCAACTGGTG TAGGATTAGA AGCGTTAACA CCACGTTGTT GACTAACTTC TTTTGACTCT 420 

AACAATTGGA AGCTTTCTAC AACCACTTCT GTCACATAAA TACGTTGTCC TTGTTGATTC 4 80 
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TCATAGTTAC GTGTTTGAAG TCTACCCACA AGACCAATCA ATGTTCCTTT TCGTGTATAG 540 

- T-TT-GCTAAAG TTTCTGCAGG TTTACGCCAA ATCACACACT GgAATATAGT CTGCTTCACG 600 

TTCACCATTT TGATTCGTAA AATTCCGGTT AACkGCTAGT GtAAAAgCsG CAACAGCTAc 660 

ACCaCTkGAC GTATAACGTA AATCACAATC TTTTGTAAGT CTTCCTACTA AAGTCACATT 720 

ATTAATCATT TAAAAAACCT CTTTCTATTA TTTTTTTAGA AGTGATAGCA ACTGGTACTA 780 

TCACTTCTAA ATATTTACTG TTTTACTAAA CCACCAAACA TATTTTTAAA CTTGATTTCA 84 0 

CCAAACTCTG AAAGGAACGG ATCCGTTTCG TTTAAGCCGT AAGTGTATTG GTAATTCTCT 900 

ACAATACTTT GATAATCCAT TTCCCAGTCC TCTTCATACG GAGATATATA AAACAAGTAC 960 

AACATTGTTA TATCCCCCAT ATACTTTTGT TCAATCGCAT AGTATACAAG TATATTGTTT 1020 

TCTTTCTCGA TAATTTCGAT TGCTTTTTTT CGTTCTTCGT TGATCCAATA GTTGGCGCCT 1080 

AGATATGATT TTCCCGAAAA ACTAATTTTT CCTTCCTCAA TAAATTGTGC AATTGTTGGT 114 0 

TCATGAATAC CTAAAATCTT CATTCTTTTT CTTGCTTCCA TTTTCATTTT TtCTAAATTC 1200 

AATTGAATCC CTCGTTTCTG TTTATCTTTA TAATCATTAT ACCATACATT GTATACAATG 1260 

CATAGTATAA TAATGAGGAA TATTGACAAA TTTTGGkTTT T TAT AT ACT A TATTTTGTAT 1320 

AGTATATATA GGAGGGTAAT TCAATGAAAG ATCTTTCTCG TGAAGAAGTT TTAACTTATT 1380 

TAGAAAACAA TGTGGTAGAT AAACAAGGTG CTGCAAAAAT TACAGGCCAA TCATTAAATG 14 4 0 

CCTTCACACA ATCCGTAAAA TTGAATGCAA TTAAGCCTTA CTTTGAAATT AAACATGTAA 1500 

ACGGTGAACG ACCTACTGTA CGCTTGTATC ACGTAGACGA TTTAAAGGAA TACGCAAAAA 1560 

ATAAACGAAG ATAAAAGAGA ACTCCCTTCT TTTATTCTCG TTTATTTTTT TATATAAAAT 1620 

CCTTAGTTTA TTTTAnGACT A 1641 
(2) INFORMATION FOR SEQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

TGTGCCCGAC CATTACGAGA GAGCGCACAC GATGCTGCCA TGGCCGATTG AATGAAAAGA 60 

CGnGCAGnTT CGTTTTGATA CCAGGAACTG GAGAGAACGG nACAGCAAGT GAAGAAGTGG .120 

CTAGAGGAAC AAATTGGAAA GAAGTGAATA TTTTTTTAAT GGCGTAAAAT GTTTTGAAAA 180 

TTTCAGAAAA AACAGTGAAA GTCTTkGTTT ttATTAAAAa aTGGCAAaGa TTCAaCGAAG 240 

AmCCAGTTCT AAaGrCCTaG TAATaGCGCA ATTTCATGtT ATAATATAGG CAACAAAAAT 300 

TATACAGAAT AGAAGAGGTG AACGTTATGC GTTGTCCaAG ATGTCATCAT AATAATTCCC 360 

GTGTTATTGA TAGCCGTCAA GCAGATGATG GACGTGCAAT TCGCCGTCGT CGTGAATGTG 420 

AAAATTGCAG TTATCGGTTC ACTACATTTG AAAGAATTGA AGCGGCGCCA TTACTTGTAA 4 80 
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TCAAGAAAAA TGGGGATCGT GAAGAATTTA ATCGTGATAA AATTTTGCGT GGGCTCATTC 54 0 

GCTCTGCGGA AAAAAGACCC GTTGCTATGG AACAAATGGT TCAAATTGTC GATAATGTTG 600 

AAAATCGAGT GCGTAGTTTA GGGGAAAATG AAGTCTCTAC AACGCTTATC GGTGAATATG 660 

TGATGGAAGA TTTAGTCAAT CTAGATGAAA TTGCCTACAT CCGCTTTGCG AGTGTTTATC 720 

GACAATTCAA AGATATGAGC GTGTTTTTGA AAGAGTTACA GGACATTGTC GACAAAGCAA 780 

AATCTTCTTC TCCTGACTCA GAAAATTAAA AAGGAGGTCT TTCTTTGAAA GGAAATCGGC 840 

AAAGAATCCA ACCAAAAAAT ATTTTTCAAG CGGTTATTGA TTCACCGCTT TCTGATCAAG 900 

AGAAAGAGGT TTTAACCTTT TATATCAACC AA 932 
(2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 4: 

TAAACCGACA GCACGGCCTG AACCGTTCAT CAAAGCAAAA GCGACAAAGA TAAACCAGAT 60 

ATTTACCCAA TAGAACATGC CAGATAAACA GAAGGCAATT GTTAAACCAA TTGTAGCAGC 120 

ACTTGATAAC CAAGTGAATA AACCAGCAAT CAATGCTAAA gCAATACATA CTTCAACGAT 180 

TGTCATGAAT TTTTGCATGA ACAAAGCAAC TTCTTGGTTT GGCATCATAA ATTTCATTAC 24 0 

GCTTTCGAAC CATTTAGGCA TGTGATCAAA CACTTGCATT GGTGTTTCGC CGTAAGCGTA 300 

gcTTAAAcCA AAGTGCGCTG CTTGTGTTGC CGCATCGGCA CCACCGCTTG CCGCCGCATC 3 60 

CGCCGCACCA CTTGCAGCaG TTGTTGcTGT TTCAGTTGCT TGAGAAGCAC CTGTTGTTAC 4 20 

TTGTTCTTGT AACCAAGGGA ATGGGAAAAC GACTTTATCT GTGAACCAAG AACCGTCACC 4 80 

AAACCAAGTG CTTGGTTTCA AGTAATCGCC GTTGCCGACA ATTTTCTTCA TTGATTCGAC 54 0 

TAACCAAACC ATACCATAGA AGACACGTAG CGG AACACTC CACAAGACAT TGCCGTAACG 600 

AGACGTATGT CCGCGAGCAA CAGAACGATC ATCTTTAATA TGGAAGATTT CGTGCATAAT 660 

GTATTGGAAC ATGTAGTAAC CAGAACGAAT ATCAAAGAAG TATTTCAAGT TAACGATATG 7 20 

TTTCATAATG ATGGCTAAGA AACCACTCAA ATGAATTTTA TCGAATAAGT T 771 
(2) INFORMATION FOR SEQ ID NO: 58 5: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
'(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 
TTTCTCTTAA ATAATGAAAA AAGTCTGATT TTTTTGCAAA ATAGTGATTA AAAGTCAAAA 60 
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ATATACGCTC TTAAATGTCA ATAGGACCTT TAAGAGCGAT TGTTGCTTTA GTTTTGTCCT 120 

TGAATTTTTT CTAATAATTC TGGATCAAAT GTTTGATTTT TTAACATCGC AATTTCAAAG 180 

CCGTAAGGGg CCTTTTTATT TTTTTTGTCT TCTCCGACAT AAGGTGTTTC TAAAATTTTT 240 

GGTAAAGCAG TTAATTTTTC ATGGTGAACC ACACCGTTTA AAGCTTCAAA ACCAATAGTG 300 

CCAAAGCCAA TATTGGCATG ACGATCTTTA TGTGAGCCCA TAGGATTTTT AGAA'TCATTC 360 

ACATGAATAA CTTTTAAACG ATCCAATCCG ATAATTTTAT CAAATTCAAC TAATACGCCA 420 

TCAAAATCAT CTTTTACATT ATAACCTGCA TCaTTAATGT GGCAgTATCC AGAGTGACAG 4 80 

AAAGCyTATC ATTTAGCGTA ACTCCGTCAA TAATCGCAGC TAGTTCGTCA AACGTACGGC 54 0 

CGATTTCGGT TCCTTTACCA GCCATGGTTT CTAAAGCGAT TTGAGGGATT' TGCTCTTTCC 600 

nCAAGACTTC GTTTAGACCT TTA 623 
(2) INFORMATION FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 



CCGCCTTTCT 


CTAAGnGTAA 


AACAAGCCAA 


TGTTTCATGT 


GAACATTGGC 


TTGTTTTACT 


60 


TACTATAATC 


ATCGCAC7TT 


ATTtGyCGTt 


CTCTGCGGTA 


GTCGTATTCy 


TGGATACGGG 


120 


GACTTAACCA 


ACCTTTTTTG 


G.CTGTTTTTC 


GTAATTGATG 


ATCAAATCTC 


CCTGGTGGTG 


180 


AACTATCTAG 


TAATTCATGG 


GTACGATTAT 


TCCGATTTGC 


ATAATTAAAC 


GATTCACCAT 


240 


TCAAGACTTT 


TTCCTGGTAT 


TCATTGTTTT 


TGGCCGCTTG 


aTAATAAGCT 


GGaTTACTAA 


300 


TTAATCTGCC 


TCGCCAGTCT 


TCAACTAACA 


CATCCCACTG 


GGgTCACAAA 


GGtCCCTTCT 


360 


TCcATTCaAA 


ATAAACTCAC 


TGTGaAAATT 


AGGTGTCAAT 


CGTTTCTTAt 


TAATTAAAGA 


420 


ATAAGATGCC 


CCCTCAGGGT 


ATTTGTTGTG 


CAAACGAGCA 


GAGCTTTTTT 


TCAGCTTCCG 


480 


CCCACCCATC 


CCCTTGCGGC 


TTGGCCGCAA 


AAACATAGCG 


TTTTAACGCC 


TGTTCATCGG " 


540 


TGTCCCCTTC 


TTGTTTAAAA 


AAATCGCGGA 


TATACGTGAT 


ATTTTGACGA 


TCAATATACA 


600 


TTCTTAATTG 


ATGTATCTTT 


TTCGCTTTCT 


CTTGGTTAAA 


GCTCCCAAGA 


AGACCTAACG 


660 


GATAGGCTTT 


TTGAACTGTT 


TTTGCCAACA 


AATCCCAAAA 


CGTCGCATGC 


GGTGCAACCG 


720 


TcTTCGAGaA 


ACGATAAGGT 


CCTACTAATT 


GCAACATGCC 


ATGCTGATCC 


AACACTGCAC 


780 


CTAATTGCCA 


TAACAGTAAC 


GAGAGCTTTT 


CTTGCGGACG 


ATAAGCCGAA 


AAGCGAAATA 


840 


ATCGCTCATA 


• CGCTGAAGAG 


CCCACTTCTT 


CCACTTCAGA 


AAACAGTTCC 


GCGACTGCTG 


900 


TTTTCGCTTG 


CTCAAGTGGC 


AAGAAACGCG 


TGTCTTTTAT 


GGCGCTTTCC 


ACTAATCCAT 


960 


AAACAGCTTG 


GCGGTCCCAG 


CCAGCGTCAA 


TCGCCGCCTC 


TACACCAGCA 


CTCATTGCTT 


1020 
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GTTGTAATTC TTGCCAACGT CCCATCTTCT CGCCGCCTTT TCTTATATAC TTCTTCTATA 108 0 

ATGATAATCC TTTCCCCACA AAATGCCAAC TGCTTTAACG CAAAAAAACA ACCCGCCCTT 114 0 

TGGGGAAGGG CGGGCGGAGG AGTTAAAAAA ATGAAAAAGT GTTTTGTTAA GGGTTTTGTG 1200 

TATCGGGTAT GGTTAAATAA TAGCCGTTAA AATGTGAAGA AACTATGGTA AAA 1253 
(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: ' SEQ ID NO: 587: 

GTnTCAATGG TGATTGAAGG" TCAACATTTA GTAGAAAATG GGTTGTTATA TGTGGCTTAT 60 

CCGAnAAAAG GCAATAaAAT GTATCCAACC TTTGTCCACC GAGATGAAAT TTTTCCAACA 120 

TTACAAGTAA ACGAGGCAGA TGGGTATATT AAAGGCTCAA CCTTAAAATT TAATCGCATG 180 

GTTAGTTTAG ATGAGACTTT TACGGTTGTC GGCATGAAGA ATGTTCTTAA AAAGCTAGCG 24 0 

AAGAATCAGA CCAGTTCTGC TGTTGGGGAT TATGTTCAAT ATTTACCGGA AATTGAAGCG 300 

TTGGTACAAA CAGAACCAGC TGCTGCAGCG ACATtCGCTG CATTAACGCC TGGTTATCAA 360 
AAAgAGTGGG CGCGGTATGT ATTTAGTGCA AAGCGAGCGG AGACACGGAA AAAGCGGCAA , 4 20 

ACTGAAATGT TAGTCATTTT AAAAGAAGGC TATAAAACCra AAGCGCTTaT CAGCmAAGGA 4 80 

rGAAtAATGG CGCAGTCAAT TGAAACAGTT ACAGCATACA TCGAAACAGT GGAACCAACC 540 

nGnCGAnTGn CTA 553 
(2) INFORMATION FOR SEQ ID NO: 588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

AAAAAATCGT CGCATATTTT TCTTCGATAA TTTCAGCTAA GCGTTTTCGA AAGTGTTCGT 60 

TGAAATTATT GTCCATGATA GTTAAGAAGA AAGATTTGTT CTCTTGAATA TTAATATAGA 120 

GTTGCGTCAG AAGTTCTTCA ATGTGCTTCA CGCGGACACG GTTGCCATTA ACCAGTTGTT 180 

GCGAATCAAG AATAGCGGTA AAAGCACTAA GCGTAAAGGT GAAGATTGCA TCATATAAAT 240 

GCTGTTTGTC TTTAAAATGA GCATAAAAAG TCGCACGATT AATCATAGCT TCATCAGCAA 300 

TATCCTGAAT' CGTCACCGCT TCGTAGCCTT TCGTTTCTAC AAGTTTAAAA AAAgCTTCAA 360 

TAATCATTTT tCTGGTTCGT TGTACTCTAA GGTCTGTTTT CTTTGCCATC ATAAATTCCT 420 

TTCGAGATAG TCAACATTCG CGGgAAACTT tGTkGTTawC tCACaAAATA GGGTAAATicG 4 80 
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CyTGTtGCAA ATCATTTAGA ACCTTTCTAA AATAGGTATA TCnATnATAA CCAACAGTTT 540 
GTnTAATAAT CCACCAACTG TGGAAAAATG AnGGGG 57 6 

(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 

GAGCCTGTGA CTGTTGAGGT AmACGTTCAA CTTCTCTTGA ATGATAATCA TTTGTCTCTT 60 

TGATTGCTTC CGTTAACTCA CCTAGAGGAA ATGCAGTTGG CCATTTCAAC TGAAATGGGC 120 

GATCTATATA ATCATTATAA GGTGTAAATT CTTTTTTAGT TTTTTTCATT GGTTTATCCT 180 

CCAAGACCAG TGTTGCCACC AGCGTGACCA CCAACTAATC CGGCTCGATT CACAGCTGTT 240 

GCTCCATCTA AAAGAGAAGA AGCGTGAATC AAGGCTTTGA AGCCAAATTT TCGACGTATT 300 

TTATCGATCA AAAAATCTAA TTGGGTATTA ATAATTTGTT CTTCAGGAGG GGAGAAGAGA 360 

TCCAATTGAA GTGTGGTATC CCAAACCAGC TTTCCmAAGG AGACTCCTAA aTTTCGAATA 4 20 

TCCATACCAG GAACATAGTT TTCTCGAAAA AGCTTCAATA CATGTTCGGT TAACACTTTT 4 80 

GTGTTGTTGC TACGAGCGAT TTTAAGCTGT TTTCGCCAAC CAGACCTGCC CAGTCCATCA 540 

ACTTGTCCTT TAGAATAACC AACAAAAATT GAAACACACT CAGTTTGACA GTTTGAATTT 600 

CTCAACCTGG TTGCTACTTG GTCTGAAAGT TCCTTCAGCA CTAATTCAAT TTGATCTC 658 
(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

AAAGAATCTA TCAAATAGTA GAGAAATATG GATATATAGA ATAAAGATGT AG CAT AT AG T 60 

TATGCAATTC TAAAGAAAAA GTTTTGTCAG GAGTAAATCA AAAATTTGAA CTTGGGAAGT 120 

TTTATGCGAT AGTAGGAAAG TCAGGAACAG GAAAATCTAC ACTTCTTTCC TTACTTGCTG 180 

GaCTTGATAA rCCTCAAACA GGAAAAATAT TGTTTAAGAA TGAAGATATA CAAAATAAAG 24 0 

GATATAGTAA TCATAGAAAA AATAATATAT CTTTAGTATT TCAAAACTAT AATTTAATAG 300 

ATTATTTATC ACCAATTGAA AATATTAGAT TAGTAAATAA ATCAGCAGAT GAAAGTATCT 360 

TGTTTGAACT AGGTTTAGAT AAAAAACAAA TAAAAAGAAA TGTTATGAAA TTATCTGGTG 4 20 

GACAGCAACA AAGAGTAGCT ATTGCTAGAG CATTGGTATC AGATGCTCCA ATAATACTAG 4 80 

CTGATGAGCC TACTGGTAAT CTAGACAGTG TTACTGCTGG AGAAATAATT AATATATTAA 54 0 
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AGACATTAGC TAAAGATAGA AATAAATGTG TAATAGTTGT AACACATAGT AAGGAAGTAG 600 

CAGATTCTGC- GGATATCATT TTAGAACTAA GTGGTAAAAA GTTGAAAAAA GTAAATAAAA 660 

TGAATTTGGA GGTTGAATAA TGATAAAAAA TGCATTTGCT TATGTAACTA GAAAAAGTTT 720 

AAAATCATTA ATTATCATAT TAGTTATTTT ATCTATGTCA ACTTTAAGTC TCATTAGTTT 780 

ATCTATTAAA GATGCTACGG ACAGAGCTTC AAAAGAAACA TTTGCTAATA TAACaAATAG 840 

TTTTTCTATG GaGwkAAaTA GACAgTAAAT CCaGGAaCcC TAGAGGtGGG GGrAATGTaA 900 

AGGTGaGaTt TAaAAgAaTC TCAACnGATA GTAAGACTCC 940 
(2) INFORMATION FOR SEQ ID NO: 591: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Dj TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 



TGAAAACGAC 


TAGGTGATTC 


AAACTTTCTG 


AAATTCTAAA 


TTGAATCATA 


CCAGAAGAAA 


60 


CTGGCCGAAC 


CAATAAAGTA 


AGGACAATAT 


ATAAGCCCAT 


AATGATACCA 


TTGGCTAAGA 


120 


CTGTTCTTAG 


CTTTCGATTG 


GCTTGAAACG 


AATTTTGCAT 


ACTTTTTCCT 


TCTTTCTCCG 


180 


AGTTTTTTTA 


CAAGGGATGG 


TTGATGAACC 


TTGGTTACAA 


TTGCAACAGT 


TCACATTCTA 


<240 


ATATGGAAAG 


TCAAAATAGT 


CAATGAAAGC 


CCTGTTTTAC 


TTTGCACTTA 


TTTGCAGGTT 


300 


TCACCTCTCA 


TGCATTGTTT 


CGTGCCTTTT 


TCTAGTCATT 


TTTTACTTTT 


TTAAGCAATC 


360 


TTTATTCGCA 


GAAGTTCCTC 


CCTCTGGTAT 


AATGGACAAT 


AGCGATTCAA 


AGGAGGAGCT 


420 


CAATGaACAT 


TCACTACTAT 


CTTATTATCA 


ACGAGCAGGC 


AAGTAGCGGC 


AATGGCCGAA 


480 


AAGTCGCTAG 


AAAAGTGATT 


CAACAACTCA 


AGCAACAAGA 


ACTTAAATAT 


ACGGCTCTTT 


540 


ACACTGATTA 


CGCTGGACAT 


GAAAAAGAAT 


TGACGAAGGA 


ACTTGCAGAA 


ACAACTTTAC 


600 


TTCCTTGGTC 


TGaAGACTTA 


GACGTTTcAA 


CTTTTCCAAT 


CCTAGTCGTG 


CTAGGCGGCG 


660 


ATGGCACACT 


ACATAATGTC 


ATTAATTCAT 


TACTTCCATA 


TGATTCAACT 


ATTCCTTTAA 


720 


GCTATATTCC 


ATGCGGCTCT 


GGAAATGATT 


TTGCACGAGG 


GGTGGGATTA 


TCAAGAAATA 


780 


TTGATAAAGC 


ATTGCATCAA 


ATCCTGCGCA 


CCAGACGACC 


AAAAGAAATT 


CAAACAATTC 


840 


ATTATGTAGA 


AGCCAACCAA 


GAAGAAATCG 


GCTTAGCCAC 


CAATAATGTT 


GGCTTAGGGT 


900 


TGGATGCgGC 


AATCGTGGAA 


AAAACCAACG 


AATCGTCATC 


AAAAAAAGCC 


TTAAATAAAT 


960 


TTAAGCTTGG 


CTCGCTTTCC 


TATATTTCTT 


CAATCATcAT 


GTCyTTTTTA 


GACAAAAAGG 


1020 


CTTcCAATTT 


TAGTTGAAaT 


GAATGGkAAA 


CAATACmCaT 


TwAATCGTGC 


TTTCCTATGk 


1080 


ACGGTCACAA 


ATCaTCCTTA 


wTTTGGcGGG 


GGCGTGnnnA 


TTATCCGACG 


GCCAATCCTC 


1140 


GCAAAGCAGT 


CGTGGACTTA 


GTTGTCGTGG 


AACGAATCAA 


TATTTTTAAA 


ATTTTGTGGC 


1200 


TTATTTTCCT 


AATTGTCGCC 


AAAAACAAGG 


GAAATCTAAG 


CATTTTCATC 


ATTTTCAATC 


1260 
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AAGCAAAA 1268 
(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

GGATGCTTCA GCTCTAGGAC CGATGGAATT ATTCATGTTT TTCGGTAAAC CTACGACAAA 60 

TTTCGTGACT TCGTATTCTT TCACCAATTC ACCTAATCGC TCAAAACCGA AGTTCTCTTC 120 

CTCTTCATTG ATGCGAATAA TTTCGATGCC TTGTGCTGTC CAACCTAATA AATCACTAAC 180 

AGCCACACCG ACTGTCCGTG AACCTACATC TAGCCCCATG ATCCTCATTT GATATCGATT 24 0 

CCGTGGTTCG CTAAGTAATA TTTGGTTAAT TCTTCCATGA TTTCATCACG TTCATGACGA 300 

CGAATTAAGT TTCGAGCATC TTGATAACGA GGAATGTAAG CCGGGTCCCC AGAAAGTAAA 360 

TACCCTACTA TTTGGTTAAT AGGATTGTAT CCCTTTTCTT CTAATGCCCG ATATACGGTT 420 

TCTAACGTTT CACTGATTGC TTTTTTTCGA TTATCATCAA AATCAAAGCG TACTGTTTCG 4 80 

TCTGTAAAAC CCATTTTACT ACACCTCACT TCTACTTTTA TACAATTTCA TTTtACTAGA 54 0 

AAAGAACCGA AACTACAAAC AAAATTcTTC TCCTTCTCTC tTTTTTGkTA TTCyTAGkCy 600 

TTatTAATAA ArGCTtCGkA AAAAAATGAC AGCCATCGAT AATTTTAAAA TTATAAAAAC 660 

AAATACACAA CTTTATAAAA CAAGAATAAA ACTCTGCTAA ATCGCCAGAA TATCACCTTT 720 

CCCTCCAATT CATTTTGCGA AAAAATCGAC GTTTTTCTTT TAAAAACTTT TAAATATCTA 780 

TGATTAAATA ATATCAGCCT TTTAATGTTA CAATTCGCTA ACAATCTGTT TACGATGTAA 840 

ATTTTCGATT GCAGCTGTTA TTTATTTCCT CTTTTTGGTT CGTTTGTAGT ACACTAGAAG 900 

AAAATACATA AAGTGTAGGC CTGCCGCAnG GACTCCGnCT CCATTTGGTT TTTACTTTTT 960 

AGTAAACCnG TTTGTTAGCC GGGTAAAAAT TGnGGATCGT AAAGCGATGG C 1011 
(2) INFORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

TCTnTTTTAT TCTAAGATTG TCTACTAAGC ACTTTTGTCC TAGTAGCGTT TTGGTTTCTA 60 

ACTTTATTAT TTACAGGGGA AGAAAAAATA AAGCACACCT TTACTGTGAT TCCAAAAATC 120 

ACGTGTTTAT CATAATATAT TTTTGTTTCA ATCCCAAGTA TTCTGCTCAC AAAGTGTCGC 180 

TTATTTAATA AACATTGCAT CGCCAAAACT AAAGAAACGA TATTTTTCCT CAATAGCATG 24 0 
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TTGGTAAGCA GCCAATGTTA AGTCTTTTCC TGCAAAAGCA CTCACAAGCA TCACTAACGT 300 

AGATTTTGGT AAATGGAAGT TCGTTGAAAA AGCCTCTACG ACTTTAAATT GATACCCTGG 360 

TGTGATAAAA ATATCGGTCC AGCCACTGTC AGCCTGaATT TCACCATTAA ATTTTGTGCC 4 20 

AATTGTTTCC AACGTACGAA TAGAAGTTGT TCCAACCGCA ACAATCCGaC CGcCTGCTTG 4 80 

GCGTACTTCA TTTAATTGCT TAGCCGCTTC TTCAGTTAAA CGATAGAATT CGCTATGCAT 54 0 

ATGATGTTCT TCTATATTAT CaACGCTCAC GGGACGGAAT GTTCCTAAAC CGACATGTAA 600 

AGTTAAATAA ACGAGATGAA CACcTTTTGC TTtGATCTCT tCTAATAGTt CTTTtGTAAA 660 

ATGCaAACCC GCTGTTGGTG CTGAAGCTGG AACCCATTTT CTTCAGCCAT AAACCGTTTG 720 

ATAACGATCT GGGGCCATCT AGACGGTCTT TGGATATATG GGCGG.GCAAT GGGCATTTTC 780 

TCCCTAAACG GATTCCTAAA AAAnTTCTAA GGAAAATTAC CAnnCAnAAT TTGAAATTC 839 
(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 



CnATTAGATG 


GTTCCTACTA 


CAT TC AT TAT 


GAGACACTAG 


CAGCTGGTGT 


CTCTTTTTTA 


60 


TTTGTAAGTA 


TTAGTTACAC 


TTGACCGAAC 


GTTCGTTCTC 


GTTTATAATA 


TAAATTGTAA 


120 


TCCTTATGAT 


TATATCTACT 


AGAAAGAAGT 


CAAGAAAAAT 


GCTTGTTTTT 


CTGTCTTCTT 


180 


TTTGATTTAA 


TTATTTTAGT 


AGGTATAATT 


AAtAcGGAGT 


GTTTATTAAA 


TGGTAAACAA 


240 


GAAGAGACAA 


GTAAGATTTG 


AATTTTTCCa 


GGTCaATGGA 


AAAGCTCAaG 


AAGGAGAAAA 


300 


AATTGTAAAA 


GGACTTTTCG 


ATCTATATCC 


aTTAGCTGAT 


AGGATAAATA 


GTATAAGTAA 


360 


CTATACTGAT 


AGAGATGTTA 


TTCTTTTTGG 


GGAAAAAGTT 


AGAATGGACA 


GATTTTTTGA 


420 


GGTTTCTAGT 


AGTCCAGAAT 


TATATGCAAT 


GCATTTTACT 


AGATTAAGAA 


ATGATAAACC 


480 


TGCGTATGTA 


GAATTAAATA 


ATGAAGTTTT 


GAAAGAAATT 


CCGTTAAATC 


CTGGAGAATA 


540 


TATTGCTGAG 


GATATTAGTT 


GCTTGTATGA 


TAGAGAGCTA 


TCTGTTTTAA 


TGGtACAAAG 


600 


AAATATTCAT 


AGTTTGTCAC 


CTTCAGGTAT 


TGrAGATTAT 


TTTACTGAAA 


TGAGTGATGA 


660 


TTTAGTAGAG 


ATTGAATTAT 


TGCCAGTkGT 


AAATAAaGAG 


ATTATTAGTA 


AnGCCTTAGC 


720 


TAATGAAAAA 


TTTCGTAAAC 


TAGAGTTAAG 


AGCAGGTTCT 


ATGAATACAA 


CTAGTGATAG 


780 


AAGTGGATTG 


AGAAAAGTAT 


TAGGTCCTTT 


TATGGAACTA 


TTTGAGAAAT 


TTGAAGGTAC 


840 


AAATTTTGTT 


' ATAGAAATTA 


GTTCAGGGAG 


ATCAAAAAAA 


GATTTATCAG 


AAGATCAAAT 


900 


GAAAGAAGTA 


ATAACAGCAA 


TTGAACAAGA 


TAAGAGTTTA 


TTTAGTTCAG 


CAATTGTTTC 


960 


AGCAAAAAAA 


TCAAAAGAGG 


TACCTGTTGA 


AAAGTATGAT 


TTAATAAATG 


GAAAGCTCTA 


1020 
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TGTGTATCGT TCTTTTGATT TGCCAGATGG TGCATTTTTA AAATCAGATA GTGTTATAGA 1080 
CAATATTAAA AATTATTATT TTCATCCTAA TGAAGGTGGG TATAGAAAAC AAATTATCGA 1140 
TGCTGTAAAA TAGTTTAAAG AAGAGGGTGA GATATCCTGA 1180 
(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 



GTAGACCTAT 


TCATCTAACT 


CAATATCATT 


CACATAGTnT 


TTTAATCTnA 


GGTTATTGTC 


60 


TTGAAGATTA 


GGATGAAGCC 


TATCAATCCC 


CGAGTCAATT 


AGAGCTATGG 


TTATATCTTT 


120 


CCCTAGTTGT 


TCTTTCACTG 


GATCAATATG 


AGATAAAATT 


TTTTTATATG 


GCCAATTAAA 


180 


AGACTTAAAA 


TCTTTGTTTA 


AAATATTAAT 


GTTTAAAATA 


GAACTATTAA 


CTTCTTCAGG 


240 


TTTAAAATCT 


GGTAATTTTT 


TCCCTTCCGA 


AACATTAAAA 


TATTTACCTA 


TAAACTTCCT 


300 


ATCACTATCA 


TCCAGGTTTT 


TAAAACTAAC 


TAATCCTATT 


TCCTGTATAT 


AATCTACTTT 


360 


TTCAGAAGAT 


AATTCAGACT 


CAATTTCTTC 


TATAGCAGTT 


GTTTGCGAAT 


TATCAATAAA 


420 


AAATGAAATA 


T TAT TT GAGA 


GATCTGATGC 


ATAACCTGTT 


GTTCCTAATG 


TTAAAAAAAT 


, 480 


ATATGAGATT 


AAAATATAAG 


TCAATCCTCT 


TTTTTTCATT 


TTGTACCTCC 


AAATTGTTGT 


540 


TTTTTGTAAT 


ACAGTGAACG 


ATATAACTCG 


CTATTTTCAA 


TAAGATCTTC 


ATGCTTTCCA 


600 


ATACATACTA 


TTTCACCATT 


ATCCATTAAT 


ATAATTTTAT 


CAAAGTTTCT 


CACCGTACTA 


660 


ATTCTGTGAG 


CAACAGTAAT 


AACTGTTCTT 


TTCTCATCTA 


ATAAATTAGA 


AAAAACTTCA 


720 


AATTCTGAAA 


TATTATCCAT 


TGCaGATGTA 


GGTTCATCTA 


AAAGTAATGT 


AT TC AC ATT A 


780 


GAATAGAACG 


CTCTAGCTAA 


AGCAATCTTT 


TGCCTTTGAC 


CACCAGAGAA 


ATTACTACCA 


840 


TTTTCTGATA 


CTATAGTTTT 


CTCATACTGA 


GGAATTCCTA 


AAAGTACCTC 


ATCCATTTTT 


900 


GATTTACTCA 


TACTTTCTTT 


TAGTCTTTTT 


TTCTCATTTA 


TAGAAGAATT 


CGGTTTAAAC 


960 


TCTAAAGAAA 


TATTTTTTTC 


GATAGTTTCA 


TTAAAAATAT 


GCGCATTTTG 


ATTAACATAA 


1020 


AAAATGTTTC 


TTCTATTATT 


GGAGTTGTTT 


GATAAAGGAT 


AGCCTTCATA 


CAATAtTTCt 


1080 


CCATTAGAAg 


GTTGTAACaA 


TCCaGCCAAT 


AGTTTTAGTA 


AAGTAGATTT 


CCCTGACCCA 


1140 


CTTCTACCAA 


CAATAGCAAC 


TTTATCCCCT 


TTTCTTATGT 


CAAAAGAaGG 


GAGCGTCAAT 


1200 


AATTTTGTGT 


AAATAAATTG 


TCCTCCTGcA 


AAATAATTAG 


TTACTCAGTA 


AACATTGaAA 


1260 


CTAATGTATC 


GGTTACCTGT 


TGaAAACCTT 


TATGGcTtCT 


GTTTAGAAAT 


TTTTGaTtGT 


1320 


ATGTATCAAA 


AAkGCTGACT 


AGrAAGCGkT 


CTAGTGawTC 


TTCATTTTGA 


AACTGCTCTT 


1380 


TTCTACGGCT 


GTATCTTTTA 


ATTTGCTTAT 


TGAAAGACTC 


GATTAGATTG 


GTTG 


1434 


(2) INFORMATION FOR SEQ ID NO: 596: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 



GTAGTCCCTA 


CAGTTCTTCT 


ATTTATAAGG 


yTTCTCGTTT 


AGACAAGCCA 


CGTTCTAAGA 


60 


TTTTTTGGCT 


TCTAAGATAC 


TCTGAATTTT 


CTGTACTTTC 


AGAAAAAGAC 


AACTCTGTTT 


120 


CGATCTCCTC 


TGACTCAATA 


CAAATGAGTT 


CTTTGCCATA 


CCATTGTGCT 


TTATACAACA 


180 


ATTTAGCTAA 


AAATAAAGAC 


CAAGCCAGTT 


CAGAACGATC 


ATGTTTTGGC 


GGCCGCTCAT 


240 


TACTGTGaTG 


GGcTTTTTCA 


ATACAAATAA 


CATCGTAACT 


TGaAACCAAT 


TCCGTTGTAA 


300 


TTTTGTTTAA 


AATATCCGTT 


CGTTGGTTGG 


cTATTTTTTC 


GCGAATTTTT 


GCGACTTTTA 


360 


TTTTTTGTTT 


TTGATAATTT 


TTTGCTTGCG 


AAAGATCGAC 


ACCTTTTTCC 


AAAGCAATCA 




TTTTACGCTT 


GTTTAATTTT 


CGTTCTTCTT 


GACGTAGTTT 


TTGCTTTAAA 


TGCTTTGAAC 


480 


AATAGCTTTT 


TTCGAAGCTT 


TCACCATTCG 


AAAGCAGCGC 


AAATTTTTCC 


CTACTGCTAC 


540 


AAATTCCCAC 


CATTTTTTGT 


TGCTTTGTTT 


TCGTTTCAAT 


CGGCGCTTCA 


CACAAAATGG 


600 


AAACATAATA 


ATTGTGACTC 


GCACTCATAG 


AAATAGTTAC 


AGATTTTAwC 


TCCCCCAGCG 


660 


GTTGGcGGTG 


TACCTTCATT 


TtAATTAAAG 


ATTtCAATTT 


AGGTACTTTC 


AAATAACGAT 


720 


TATCCAAAAT 


TTTGACCGTG 


CCATGTTGAt 


TGTTCGTAGT 


GTAGGAATGC 


ACCGTTTCtT 


780 


TCTTTTTCTT 


TTTTGGAAAG 


CCCATCCCTT 


TTtCCCGATA 


AAAATTTTTA 


AAGGCTCGAT 


840 


CCAAATAAAC 


TTGtGCATTt 


GCCAAGGCTA 


AACTATCAAC 


TTCTCTGAGG 


ACAGGGTATT 


900 


GTnTCTTATA 


TTTAGCTGGC 


GTTGGCATTT 


TAAACGTTTG 


TTGAGGATTT 


TTTCGCATCT 


960 


CTTTGTAGAk 


ATCAATCCGG 


GCTTGTAACA 


TTAAATTATA 


CACCAA 




1006 



(2) INFORMATION FOR SEQ ID NO: 597: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

TGGTAGGTGA AGAGGTGATA AGGTGGATAA AAGTATGAAA ATAAAATGGA AGCAGAATTT 60 

GTTAGTCGCA TGGATTGGTT GCTtTTTTAC AGGGGCGAGT ATTAGCTTAG TTATGCCGTT 120 

TATTCCAGTA TATGTTGAAC AATTAGGGAC GCCTAAGAGT CAAGTAGAGT TGTTTTCTGG 180 

ATTGGCAATT TCTGTCACGG CGTTTGCTGC TGCGATTGTT GCACCGATTT GGGGAAATTT 240 

AGCTGATCGA AAAGGTCGGA AAATTATGAT GATTCGGGCT GCAGCAGGAA TGACGATCAC 300 

AATGGGGGCG TTAcATTTGT CCCTAATGTA TACTGGCTGT TGATTATGCr TTTTATGAAC 360 
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GGGATTTTAT CCGGCTATAT TCCTAACGCA ACAGCGATGA TTGCGTCGCA GGC3CCAAAA 4 23 

GAGAAAAATG (TcTGGGCTCT AGGGACATTA TCAACAGGTG CAATTGcTGG GaCGTTAATT 4 30 

GCGCCATCGA TTGGCGGTGC GTTGGCACAG TGGTTTGGAA TGGAAAATGT CT7TATTAT7 540 

ACCGGCGTTA TTTTATcATT ACAACGTTGT TGACTATCTT TTTAGTTAAA GAAGATTTcA EDO 

ACCAGTTGAG AAAAAGGATT TATTAACGAC GAAAGAAATT TTTAGTAAGA TGGATCATG7 €60 

TTCAGTATTG ATgGTTTGTT TGTGACTACG TTAATTTaCA ATTAGGAATT ACAAG7ATCA 7 20 

GCCCAATCTT AACGCTATAT ATTCGtCTTT AAGgGCGATA CAGAGAAgTA TTATtTGTTA ' 7 50 

GTGGTTtGAT gTTcGATGCC GGCGTTCAGC AATAATTCTC TCCnACCTTn Gn £52 
(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

CTTTTGGGAC aATnTGrTaC CTAtrGCgCC tTaCaATTrG CaCnGTACGT TTCCm?GtCG 60 

CaAATAATGG caTCCgCGTG CaACCCCcAT GTGGTAGAmG GAATCTATGG CaATGATGaA 120 

AATGGAGCTT TaGGAAAAAT TTTAAAAGAA ATTGAACCAA AAGTTTTGAA TAAGGTGAAT 130 

ATTTCTGAAG ACCAGATAGG AATTCTTCAA CAAGGTTTCT ATAATGTGGT AAATGGAACT 24 0- 

AGTCCTTTTA CAACTGCACG TGGTTTAAAA TCAGACAAGT TTTCGATTGC TGCTAAAACA 3 DO 

GGGACCGCCG AAACGCAAGC AACAGATGCT AATGGAGTAA ACCATACAAC GGTTAATAGT 2 50 

AACTTGGTGG CTTATGCGCC GTACGAAAAT CCTGAAATTG CAATTAGCGT TGTTTTACCA 420 

CACTTAAATG ATGAAGCTTC CAAACCAAAT CAAACGATTG CCAAAGAAGT TTTAGAAGCG 4 30 

TATATGGAAA TGTACAAAAA ATAAAATTTG TAAAGTAGAA CTACTTTACA AATTTGCCCA 54 0 

AAAGGGTAGT CAAATGACAA AAAAAATGGT ATGATATTCA AGTCTAAGAA ATCTCTTGGA €00 

AAGATCAATT ATTAGTAGGA GGGGAAAACA TGCGCGTAAA CATCACTTTA GAGTGTACTT 650 

CTTGTAAAGA ACGTAACTAC TTAACAAATA AAAATAAACG TAACAATCCT GATCGTTTAG 720 

AAAAACAAAA ATATTGCCCA CGTGAAAGAA AAGTTACTTT ACACCGTGAA ACTAAATAAT 730 

TTATTTGAAT ATAGCCTAGA ATCCTTGCTT TATAAGGGTT CTAGGCGCTT TTTTTGTTGT £40 

TAGGAATTTA CTGATTTAGT TATTGATGTA GATTTGCTAT TTTAAAAATT TACATAGC7A 900 

CTGAAAGCAT CTGTTGCTTC TTTAGTAACT TCATCAGAAA CATGAGTGTA AGTATCCA7A 960 

GTTATTTGTA* AAGAAGAATG TCCTAAACGT TCTTGGATTA TTTTAGACCT AACGTTATCT 1C20 

GATTCGAATA ATAATGTTGC GTGGGTATGC CGAAAACCAT GACAACCAAT AGAATGTAAG 1C80 

TTAGCTTTTT CTGCCAATCT TTTAGAACGT TGGTAAATGT CTTGACTTCG GGACATGG.A 114 0 
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CCATCAATTT TTGkAAAAAT GaGTtGTGTT TTAAATCCAC CTTTTTTCAT TAAAGCTTCA 1200 

TGCTGTCTAA GTTTCCATTT TTTTAAGATA TAAGCAGTCT TGTTATCAAA AGAAATTTTA 1260 

CGAATAGAAT TGGGAGTTTT AGGATCGTTT ATAGTCAATC CATTTGTACT GATAGCAGTA 1320 

GTTTTATTTA TATTAACTAC CTGCTTTTTT AAATCAATAT CATTCCAATT CAATGCTAAA 1380 

GCTTCACCAA CACGTATACC AGTAAAAGAA AGTAAGCGAA AAATAGCACA GTCTAAGTCG 14 40 

GCATAGTATT TTAGAACTAA ACTTTCTTCT TTGGCTTGAT TGGCAATGCT ATCAGCTGTA 1500 

TTTAAGAAAT GTTCCAGTTC GTCTTTTGTA TAGAACTTTC TTTTTGTATk CTTTtCTAAT 1560 

TTCTkTAGCG AACTAGGTTT AGTTATTTTC TTAAATGGGT TTGAGTCTAT TATTTCTAAA 1620 

CCAACAGCAT AGTCACAAAC AC 1642 
(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

AGTGATCTTT ACCCTTTCCG CAGATACCGC AGCCAAAAAA GAACTACAAC GGATTCTTTT 60 

GCAGTATAAT AGAAAATAGA AACGAAAGAA CAGGCACCAA CACAAAGGAG GTATTTTTAT 120 

GAAAGTCACT CAnCTTTCTA GTGAAaCCCT TGACCGAGCG CACGAACGTT TTGAAGAAAC 180 

ACTTGCGCAA ATGACCGTTG CCGAAGCAAA CACGATGCCC GCACCATTAA TCAAGTCTGT 24 0 

TACTTGGCTG ATGTGGCACc AGCCAGAGAA TTAGACCTAC AAATTTCCGC ACTTAATCAT 300 

AGTGATCCGT TATGGCTTAG TCAACACTGG ACAGAAAAAT TTGCCTTGGA TTTGCCAGAT 360 

GAAACAGAAG ACTGGCATCA CACACCAGAG GAAGCGGCGA AAGTCGTGGT GGCAGAAAAG 4 20 

CAATTGCTTA GTGACTATTT GGCTGCCAGT GTGGCACTTA CCAAAAGCTA TTTAGACCAA 4 80 

ATCAAAGAAG AACAATTGTC AGACGTGATT GATAAAAATT GGACCCCGCC GGTGACTAGA 54 0 

CAAGTCCGTT TAGTTTcAGC GATTGATGAT GCAtGATGCA TTCTGGGCAA GCCGTTTATA 600 

CACGAAGATT AGTCATTGGA AAATGATAAA TAAGCAAAAA CCAAAAATGC TTCTGCTTAA 660 

GCGGAGCATT TTTTATTGAG AGAAAAATAC AT An 694 
(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 
AATTAATTTG ACATAAAACC AAGTAGAAAG ATAACATTCT TTATCTAATT CAGTTATATA 60 
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TCACTTTTTT 


TGCAGGTGCT 


TCAACAGTTA 


AGATTTCAAT 


AGAAACTGGA 


CTATTTTCTA 


120 


ATAT-ACCyTC 


TTCTTTTGCA 


TGTTTCCtAC 


ATAAATTATT 


TTTTGTAGAT 


AGTACTCCTA 


180 


TGTCCTATTG 


AAAATATTTG 


TATTATAATT 


TCTGAATCAC 


AAATATTAGC 


CAATATTCGA 


240 


TAATTTCCAA 


TTCTGTAACG 


CCATTCACCA 


CTTCTATTTC 


CAGTCAAGCC 


TTTTCCATGT 


300 


TGACGAGGAT 


CAGTAGTTCC 


TTCTAAGTTT 


TTTCTAATCC 


AAGCTACTAT 


TATTCTTGCC 


360 


TGATTTCTAT 


CCATTTTTTT 


TAATTGTTTC 


TGCGCTTGTG 


GCATATATAC 


AACTTTATAT 


420 


TTCATTTAGT 


AAATTCTCCA 


TTTCCTCTTG 


AGAAATACCA 


ATCTCTTTTT 


TCTCTTTGGA 


480 


AATTTCTATT 


GCGTTCCGTA 


AGgTTTTTAA 


ATCAATTTCA 


TCTTCAATTT 


TATCTAATAC 


540 


AGCTTTTCGA 


TAAAGTTCAG 


TTAATGAAGA 


ATTATGAGCT 


TTTGCATATT 


CTAACACAAG 


600 


TTCTTTCTCA 


TCCTCAGGaA 


TTCTAAATGT 


AATTGkTGaT. 


TGaGACATCt 


TATTACCTCC 


660 


ATTCGCACTT 


kGtAAtAACa 


TTGTATTAAA 


AACAAGGGGA 


TTTGGCAATA 


CAATGTTATT 


720 


ATAAATGTGA 


TGATACTCCA 


ATATATAGAA 


TAAACATGA 






759 



(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



ATTCATAAAA 


TTGGCATTAA 


TATTAATCAA 


ATGGCCCGTC 


TGGCCAATCA 


GTTTCATGAA 


60 


ATTTCGTCTG 


AAGATATTAA 


AGATTTGACC 


GATAAAGTAC 


AGAGTTTAAA 


TGCGCTGGTT 


120 


CAGAGCGAAT 


TAAACAAATT 


AATAAAGAGA 


AAGGAACAAT 


CGTAAATGGT 


GTATACAAAA 


180 


CATTTTGTTA 


TTCATACATT 


TGATAAATTA 


AACAATGCTT 


GCTCATATAT 


TGAGAATGCA 


240 - 


GAAAAAACTG 


AAGTCACGAA 


TGATAATCCG 


TCTGAACACT 


TGGAACATTT 


ATTTCAATAT 


300 


ATTGTGAATG 


ACGATAAGAC 


GTACATGAAA 


AAATTAGTTT 


CTGGGCATGG 


CATTGTGGAT 


360 


CCAACAAATC 


CTTATGAAGA 


ATTTAAATTA 


ACAAAATTAC 


AAGCAGCAAT 


TCAACGAAAA 


420 


ATCGGGTACA 


CATTCGATCC 


AAAATCAGAA 


CGATTGCTTC 


CGCCAACGTT 


AACAGAATTA 


480 


GAAAAAGGCA 


ACGCgctTTT 


AGCACACCAT 


TTAATCCAAT 


CCTTTTCTCC 


AGAAGATGAT 


540 


TTAACGCCAG 


caAAAAATAC 


ATGAAATAGG 


GTACAACACG 


GTGATGGAAT 


TGACGGGTGG 


600 


AGAGTATGAA 


TTTGTGATCG 


CCACACATGT 


CGATAAAGAA 


CATTTACmCa 


ATCmTAtTAt 


660 


TTTTAGGTTC 


CAACCAACTT 


TAAAAACCAG 


G 






691 


(2) INFORMATION FOR SEQ ID NO: 602: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3274 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION.: JS£Q ID NO: 602: 



TTTTACnTCT 


TCTATGTTAT 


TTATnTCTAT 


ATTATCTATA 


TATTCTTCAT 


7\ 7V TV "ft m 7V /T'fTl 

AGAAATAGGT 


60 


ATTTCTAGTA 


ACTTTTTTAA 


CTATATAAAT- 


ATCTGCCTCT 


AATTCTTTAT 


TTAAAAATTC 


120 


AAAAAAATCT 


CTTAATTTAT 


TCTCGCTATT 


AATCTTTGGC 


TTGTATACTA 


TCTTCTTTCC 


180 


ATCAGAAAAA 


GTTAGTGTAG 


AAACAGTTTT 


TCCGCGACTA 


TGAGAGTCTC 


CCTGTGATTC 


240 


TGATATAGAG 


TTAAGCTCTG 


AGGATTGTAT 


GTTAAAACAA 


TTTTGTATGC 


TAGGTAAATC 


300 


TTCAGTCACT 


CTTATAAGCA 


TTTGTTTAGT 


ATTATCTAAA 


AAATATCTCA 


TTCTAACAAC 


360 


AGTAATACGC 


ATCAACTCAG 


GATAGCATGT 


ATAAAAAGCT 


ATTATATCTT 


TTTTAGAGTT 


420 


AAATCTTTTT 


TTTAGATAAT 


AGATAAATCG 


CTTGCTACTA 


TCATTTCCCT 


TTAGAGGTTC 


480 


ATTTTTTTTA 


AAAGTATGCA 


AATCTAGCAC 


TAATGTTTTA 


CTTGTTAAGT 


GAATAAGCTC 


540 


TTGTGTTAAA 


GTTTCTAATA 


AATTAATAAT 


GAATTCTTTT 


GTACAAATAT 


TTAACTCTGA 


600 


GTTTAAATCA 


AGTAAAAATA 


AACGaGCATA 


CTGTAAGAAA 


TATCTAAAAG 


GATAACTAGC 


660 


ATCTACTTCT 


AATAATTTCT 


CTTCACTATC 


ATTATAAGAA 


TCTAAAATGG 


aTTCAAAAAA 


720 


AAtATACCAT 


GaTTGaTcTk 


gTAATGTAtT 


GAAtTGCAmC 


TTCTTTATCA 


GGAAAGTTTT 


780 


CAATAGGTGT 


TATTCCTAAT 


CCAAAATTAT 


CCAGTGATTC 


AT ATT TAT AT 


TTAATCAAGT 


840 


AGTCTAAATC 


ATCTTGTTTA 


AGAACACTTT 


TTCTTTCTTT 


CCAAGCTTGT 


AAATTTTTAA 


900 


TGTCGTACTT 


CTCTTTACCA 


CTTTGCTTTA 


GTAAAAAACA 


TCTTTCATTT 


ATGCTTAAAA 


960 


CATTTATTAA 


ATTATCTTCC 


ATGTAAGCAC 


TCCTTTTTTA 


TGTATAAGAG 


GGCTAGTTTC 


1020 


ACTAGCCCTC 


TTAATATATA 


AAT RAG C AAA 


ATTTAGCTGA 


AAATAATGCA 


CCTACTCCTA 


1080 


AGCCTATGGT 


AAAACATGCT 


GGAGTTGTCT 


CAGCCTGAAC 


ATCTCCACTA 


CCTTGAAtCG 


1140 


CTTCCATTTC 


TTCTAAACTT 


AACTCTTCAA 


AAGAAGGACC 


AACAAGTTCT 


AAT T TAT TAG 


1200 


AGTAATAGTT 


TTCTTGATTT 


TCCTTATTTA 


GCACTGTGCT 


TCACCTCACT 


AAGTTTTATA 


1260 


GTATATTTTA 


ACAATGTTTT 


AAAGACACAA 


CTACAGTTAC 


TCCAGTAAAA 


ATACCGCCAC 


i ion 


CAACCCAGCC 


ACAAGCAGCA 


CTACTTGCTG 


CAGCTGTCGC 


CGCaACAGCA 


CACACCGGTG 


noon 


TTGTCTCAGC 


CTGAACATCT 


CCACTACCTT 


GAATCGCTTC 


CATTTCCTCA 


ACACTTAGTT 


1 A A f\ 


CTTCAAAACT 


AGGrACTACA 


CTTAAATTTT 


CCATAATTCT 


TACCCTCCAT 


CATAAAATTG 


1500 


TTGTATATTG 


CATAACATAA 


TATGTTACAA 


TTTAATATTA 


TAGAGCtTTT 


AATCTTTTGT 


1560 


CAAGTGACAC 


TTGACAAAAA 


GGAGGAATCA 


AAATGGCTAT 


TTTTATTTTT 


TTATTGGATA 


1620 


TCATTTCTGT 


AGTCTACTTn 


GTTTTTAATC 


TGAnTGATTC 


AAAAAAaGaT 


GmGCGAATTT 


1680 


TATATwCaAT 


• TTCtTTAACA 


TCTACTTACA 


TGTACTTATT 


TATTAACTTT 


TCCATACTTA 


1740 


TAGTAATTGC 


TATTTtTCTT 


TTACTACCTA 


ATGTTATTAC 


TAAAAACTAT 


CTATTTAATT 


1800 


TTATCTACTA 


TTACTTCTCT 


CTATCAATGA 


TATTGCACTC 


TCTGTTACTA 


CTTTTTTTCA 


1860 
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ACAAGAGGAC 


TTCATAAATG 


ATAATGAA 1 A 


Ti PTT 2l A A AT T 
AG 1 1 AAAA i. l 


AATTAPiAfiAft 


AA AA21 A AAAA 




TTAGCCAAAG 


fn/"* T\ ta mm tv ^ rn 

TGAATTAGCT 


/— > mrp m t\ rp rn tv /"■ 

GCTTTATTAG 


AA/^ rnr PA./~ , PiiP 
AAG I 1 AuLA<j 


AP A AZ1PTATT 
AL AAA\_ ini 1 


AA 1 uo 1 A 1 Ao 


i qq n 
l yo u 


AAAAAAATaA 


ATATAACCCT 


TCTTTACAGT 


rp7\/*"'/'" , A r r rn A. Ii B 

TAGGA1 IMA 


nnl luLl 1AL 


1AGG 1GG-A1 A 


zU4 U 


CTCCACTAGA 


AGATATTTTT 


CAATGGCAAG 


G 1 onA 1 AAAA 


AAo A 1 1 1 i 1ft 


AAi 1 1 GAGGA 


0 1 A A 


CAGAATATTT 


AAATTAAAAG 


AAAGGTGCTT 


oprpTi rn/-« T\ A i*"* A 

Lb 1 A J. unnbrt 


AAAAu 1 1 Iftl 


A1GGG1 1 G 1G 


Zlbu 


ATTTTATTTT 


TAGGAATAAT 


TGTTGGCATA 


/'""T , AAr - " , A/"^*T 1f P r P 

GT AACAG 111 


A T 7\ rp #7i rp rp AAA 

A1A1 1 1 iAAA 


pp7\pns A A rp 71 tv 

GGATAAATAA 


2220 


TTATGACTGT 


TGTTTTATAC 


TTTAATCATA 


ACTGATTTTT 


1 A 1 LAG 1 AAA 


AGGG 1GAG1 1 


2280 


ATGATTACAT 


TCATTTAAAT 


AAAAACTAGG 


CTCAAATGTA 


rn 7\ /Tp 7\ t\ 7\ /""• TV /*"» 

TAGTAAAGAC 


TATCTCTTTT 


2340 


TTTGGGGGAG 


ACAGTCTCTA 


AGGTTATAAC 


TTATTTTAGC 


CTTTCAATTG 


rPTTTA rp A T 1 rp 

CCTTGATATT 


O VI A A 


ACTTTTATTT 


CAAGATTATT 


TAACATGGAC 


AAATACTTCT 


GAATCTTTTA 


i*"' A rp A A 7\ rp rp/-< 

cataaatgtc 


z4 bU 


GTGGTAATAA 


TTAGATTTCA 


TTAAGTCACT 


TTCTTCTTTA 


TTCAACGCGT 


T» A A rp A rnp» ti j\ 

TAATATCAAA 


1 C O A 


GATCTCCATG 


TTCACATTTA 


AGCCAGTGTA 


CGTTCTAAAT 


AAAAA/^rp^AT 1 

AAAAACTGAT 


TTGGCCAATA 


<iboU 


AAGGGGTTCA 


TTTGGTGGAA 


CCAGTTCTTT 


TAATATAGGA 


TAATTCTTCG 


tv tv <n tv m fjy /** t\ 

AACTATTGAC 


O C A A 


AAATACTGGC 


GAATCCTTAA 


ATAAAGAGGT 


is r^(flT\ n>/^ » 7v ■ 

ACTATGcACG 


GTCACcGTGt 


CACGGTTCAC 


O 1 A A 

Z /UU 


TAGATCTTTT 


AAATCACCGG 


CTAATATCAT 


ACTTTGTCGT 


m ^ TA TA TV TV z" 1 ta m m 

TCAAAAGATT 


CCTTTTGGTG 


2760 


ACTTAACGCA 


CTAGCATAAA 


CAAATGTAAA 


TV TV m ft TV Ti TA 

AGATAAAAGG 


m TV TA TV TV Z™ 1 TA TV 

TAAACACAGA 


TV TV TV TV TV TA 

AAAGGCAAAC 


i o o r\ 
2820 


CGCTGTTTTT 


GACACATAAC 


CTACCGCGGG 


TTCTGTTGCA 


AAGTTTTAAA 


m TV 7\ 7\ TV T\ fp T\ TV 

TAAAGAATAA 


O O O A 


AATCCCTTAC 


GGTATCTATG 


ATTTAAGCTG 


GGATTCCCAA 


m tv ta m ta /"^ m m s** 

TAATACCTTG 


ta m m try tv m TV /"< 

ATTTCAGTAC 


O Q A A' 

2940 


AGACCGAAAA 


CCCGAAGAGA 


GTACCTTCTT 


TTCGGGTTTT 


CTTATATAAT 


CCTCGAAi GG 


T A AA 


CTTCCATGCC 


TTTAATCGTG 


GTAGAGGCAG 


TGCGTAAACT 


TCGATAGAAT 


TTATTGCGTC 


3060 


TCTTTACTGG 


ACGATGGTCT 


TGTTCAATCA 


AATTATTCAG 


GTATTTAATG 


GTACGATGTT 


3120 


CTGTCCCTTG 


ATAAAAGCCG 


TATTCTTTTA 


GTTTCTTAAA 


GGCACTTGTA 


ATAGAGGGGG 


3180 


CTTTATCTGT 


GACTACAACc 


TTCGGTTCAT 


CAAACTGCTT 


CACTAAcCGC 


TTAAGAAAAG 


3240 


CATAGGCtGC 


TtGTGCCCGn 


TTTTACGnAC 


CAAT 






3274 



(2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 

ATTCCACGAG CCATAGCAGC ATTTGCAATA TCTGTTGGAT CACCAATTAA GATTnGTTCT 60 

ACTAATTCAT CAGATTTTAA GCGGGCAGCA GCACCTAAGA TTCTAGGATC TGTTGCTTCA 120 

GGGAAAACAA TTTTGATATT ACGGCGTACG ACTTTAAATT TTAAGCTATC GAATAATTCC 180 

ACGATAAACC CTCCAGTTTT ATATTTACAA AATCCATCAT ACACTATTCT CAAAGAAAAT 24 0 
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AACAGTTTTT TCTAGCTGTA TTAGTCAAAA AAACCAATAA TCCGATAAAC TTTTCACAAA 300 

CTAACTCTTC TACACTATAA CAGCATACAC TGTTACACTA CACTGTTTCA GGAAGTTGCC 360 

AATCAATCGG CGTTTCCCCT AAACGTTCTA GAGCGGCATT CGCTTGAGAA AACGGACGTG 4 20 

AGCCAAAAAA ACCACGATGT GCTGACAATG GACTTGGGTG TGGCGATTTG ATAATCACAT 4 80 

GCCGAGTCGT ATCAATCATT TTAATTTTTT CTTGAGCAGG ACGTCCCCAT AAAATGAAAA 54 0 

CAACAGGTTT TTCACGCTCA TTCAACTTTT CGATAATGAC ATCTGTCAGT TGCTCCCACC 600 

CTTTACCTCG ATGGGAATAG GCTTGACCTG CCCGAACGGT CAACACAGTA TTTAATAATA 660 

ACACGCCTTG TTTGGCCCAG CTTTCTAAAA AGCCATGGTT GACTGGTTGG TACCCTAAAT 720 

CAGCTTGTAA TTCTTTGTAA ATATTCGCAA GAGATGGCGG CACTTTCACA CCTGGCTGGA 780 

CAGAGAAACT TAGTCCGTGT GCTTGATTGG GTCCATGATA TGGATCTTGT CCCAAAATTA 840 

CTACTTTTAC TTCTTCAAAA GGGGTTAGTT CTArCGCTGA AAATAAATGA TACATATcTG 900 

GATAAATCGT TTGTGTCTGA TATTCTTTTT TCAAAAATTC CCGCAATTCT TGATAATACG 960 

GTTTTTCAAA TTCAGCAGAT AAAATGTTTT GCCAACTATT GTGAATAATT TCTTTCATAA 1020 

TGCTCACTCC TTCGCTCTCT AAGCATAACG AATTTCTCCA AACCTGACAA GAAATTTCTG 1080 

GGTACTTTCT CTTGGAAAAA CGTGGTAAAC TATAGGTAAG CTTTAAATAG ATAATGAGGT 114 0 

GTAGAGAATC GATGATTAAA TTAATTGCTT CAGATATGGA TGGAACACTT TTAGATGCAA 1200 

AAATGAGCAT CACAAATGAT AACGCTTCAG CAATTCGCGA AGGAwACGTT TaGGAaTCgA 1260 

ATTTAtGGTT GCTACTGGCC GCGCATAtAC cGaAGCGAAA CCCGCGTTAG AAGAAgCTGG 1320 

aATTGACTGC GCCATGATTA CTTTaAACGG CGCCCAAGTG TTCGACAAAG ATGGACACTC 1380 

ACTTTTCACG GCTGGGATCG AAAAAGAmAC AGTAACAGAA GTACTAACTA TTTTGAGTCA 14 4 0 

ACACAATGTT TACTATGAAA TCTCTACAAA TAAAGGTATT TTTTCTGAAC ATCAAGAAAA 1500 

AAGAATTGAA AACTTTGCCG CTCATATTGC AGAATCCATG CCCCATTTAA CTTATAAAGT 1560 

GGCGATTGCC ATGGCTTCAG CACACCTATC TTTATTGCAT ATTACCTATG TTGATCGTTT 1620 

GGACGACATT TTAAAAGATG ATTCGATTGA AGTCCTTAAA ATTATTGGCT TCAGTATGGA 1680 

TGGGCCGAAA GTATTAGGTC CTGCTGGGAT GGAAGTAGAA GAACTAGATG ATTTGGTTGT 1740 

TACCTCTTCT. GCGCTTAATA ATATTGAAAT TAACCACCGC CTAGCACAAA AAGGGATTGC 1800 

AGTTGCGCGC GTCkCTAAAG AACGTGGGAT TCCCGCTGAA CAAGTCATGA CTATTGGTGA 18 60 

TAATTTAAAT GACGTCAGCA TGATTCAATG GGGCTGGGGT AAGTTTTGCA ATGGGGCAAT 1920 

GCCGAACTGG AATTAAAAGA ATACGCTAAA TATGAAACAG CCACTAATTa GAAAAgGTGT 1980 

TGGCGArCaT CCgCGTGCAA TAAGAGAAGA TTtGTAATAA GCACTaCCAG TTACAGCACT 204 0 

CATTTATATA TAGAAAGGAG ATCGTGTCGT GTCTGAATTT TTTATTCAAG AACAACAGTT 2100 

AGGAAAAGTT ACGCGAACCA TCGTGAAAGA CCAAGCAGGC CGTTCACTTT TTCTATTGGT 2160 

GGGGCGCTGG GGCACACGAG GAGATGCGCT TTCTTTATAT GCAATGAACG GAGAAATTTT 2220 
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AGCCAGTATT AAACAAGTTT CTTGGACTTT TGGCACACGA TTTGAGTTAT ATCAGCGTTT 2280 

TGAAAAAGTG GGTACTCTAC GGAAACTGTT CAATTTGAAT GCTGATTTCT ATTATGTTCA 2340 

AGGCCTTCAC TGGGCAGTTG TTGGGGATAT CAAAGCGCAC CAATATTCAA TTTATCAAGT 2400 

CCATAAAAAA ATCATGTCAA TGGATAAAAC CATGCTTTGT ACAGGCGACT ATTTTGTTTT 24 60 

AATGTTGTC 24 69 
(2) INFORMATION FOR SEQ ID NO: 604: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) . TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 



GAATCACCTC 


AGAGTTAATT 


TTGTCAAAGC 


AGGTTTTTCC 


TAAAAATATA 


GAAATTACTT 


60 


CTTTTTTAAA 


TGTAGTATTT 


AATGTGGAGT 


TTAAAAATTA 


TGTAATGAAA 


TCTAGAACTT 


120 


TGATTTTATC 


AAGAACTGTA 


CGTGTTATAG 


AAGGATGCTC 


AGAGAATGAA 


TATCAGAATT 


180 


ATAGAAGAAA 


ATTATTAAAC 


TTTGTAGAAG 


AGTATTATGA 


GTCGGAAGAA 


GTGTCAAAAA 


240 


ATATTTCAAA 


ATCTTCTATA 


TCGAAATGGG 


TAACAGGAGA 


ATAATAGCAT 


GTATAATCAC 


300 


GTTTATTTAA 


AAGAAGATAT 


AGATGCTTTA 


ATAAATGAAT 


TTGGAAAATT 


GAATGCTGAG 


360 


GATGaATCTT 


TATTTAGaTT 


TCTAAGTAAA 


AAGATTATAT 


TCTTAAAGGA 


AATAAAAATA 


420 


AGTATAGTAA 


ATACTACTGT 


GATATTTTAT 


ATAGACCAAA 


TGATATCGGA 


TTTAATGTAC 


480 


TTAATGGCTA 


GTTATCATAA 


GGGTGAAGTG 


AGATATTTTT 


ATTTAAATAT 


TAGATCAGTA 


540 


ATTGAGGCAT 


TTTCCCGTTT 


GTTTAGTGAG 


GTAGAAACAA 


GTACTAATAG 


AATAACTATG 


600 


ACAACTTTAT 


TAGATAATAT 


AGCTAACTAC 


ATTACTTTAA 


ATGATTTGCG 


TGATAGTnAA 


660 


GAGGrTtCTC 


TGGACTATCC 


TAGACTAAAA 


GGTCTTTATA 


GAGAATGCTG 


TCTTTACGTA 


720 


CACGGAAATA 


TTAATGCnAA 


ATATTCATTA 


ATTGAGTTTT 


ACAACGAGTT 


GCTTAATGAA 


780 


GAAATAACTA 


GTGCTCA 










797 



(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

TCTGCGTTAT AGAGCTTAAG AGGTAATnAT nCAAnATCAT TGCAATCAAG GTGGTAnCGC 60 

GAGCAATCGT CCTTGACTGC AATGGTGTTT TTTGTGTACA CAATCTGCCA CTTAAGAGGG 120 

TACAAAATAT AAATAGTTAA TTATTAATAG AAAGTAGGAA AATCAATGAA AGAATTAACA 180 
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AGTAGTCAAG TTCGCCAAAT GTATTTAGAC 
CCAAGCGCTT CTTTAGTACC AGTAAACGAT 
GCAACCTTGA AAAAATATTT TGATGGCTCT 
GCTCAAAAAT CTATTCGTAC AAATGATATT 
ACAATGTTTG AAATGTTAGG GAACTTCTCA 
CACTGGGCTT GGGAATTTTT AACTGGTGCT 
TATGTGACTG TTTATCCGAA AGATACAGAA 
TTATCTGAAG ATCATATTAT TGATGTAGAA 
ArTGGTCCGG ATACGAAATC TTTTATGATC 
ATGATCCAGA AAaTTATCCT GGTGGCGAAA 
kGTtCyCyGG AATTTAATCA TACACCAGAA 
ATTGATACGG GCATGGGCTT GGAACGTGTT 
ATTTTGGAAA CTGATTTATT TATGCCAATn 
(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
'840 
900 
936 



TTCTTTAAGT 
CCAACATTAT 
GTTGTGCCAG 
GAGAATGTTG 
ATTGGAGATT 
GAATGGCTTG 
GCAAAACGCA 
GATAACTTCT 
GCGGCGAAGA 
aTGrACGCTA 
GATACGTACG 
GTATCTATCA 
CATTCA 



CTAAAGGACA 
TATGGATTAA 
AAAATCCAAG 
GGaAAACGGC 
ATTTTAAAAA 
CTTTTGACCC 
TTTGGCGTGA 
GGGgATATCG 
ATTTTTAGAT' 
wTTAGrAATT 
AACCATTACC 
TTnCAAGATG 



TTCAGTAGAA 
CTCTGGTGTT 
AATTACGAAT 
TCGTCATCAT 
TGAAGCGATT 
AGAAAAATTA 
TGAAGTTGGT 
GTGCTGGTCC 
ATTCCAGAAG 
TGGAmCTtAG 
ACATAAAAAC 
GCACCAnCAA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 

GCTGTTCCGG TTCTTCCTCT AGTTCCTTGG TTaTcAAAAa CGATATCTGC TCGTTCTTTT 60 

TTCATTTCAA TCGGCCATkG GCTAGCAATT CGkTGTTGCG CTTCTTCTTC TGTTAATgGT 120 

TTCGGGCCAT CAGACGCTCT TTnTGAATCT TTTCTGGTAC ATATACCACA GCAACTTGAT 180 

CCATGATCGC CTCATAATGA GCTTCATAAA GCAGAGGAAT ATCAACAATA ACAAGGGCGG 24 0 

CTTTCTTTTT AGCTTCTTCA ATTTGACGTA AAATTTCTTT TCTTAAAAAA GGTTTCAGCG 300 

TTTCATTTAA CAACTCTCTT TTTTGAGGAC TAGCGAAAAT TAATTGTCCT AATTTTTTCC 360 

GATCCAATTC ACCAGTTGTT AATACTATTT CTGGTCCGAA AGTTTCAACG ATTGCGGCTA 420 

AGGCTGGTTG CCCTTTTGCA ACAATTTCGC GGGCGATAAT GTCGCCGTCA ACAATTGGAT 480 

AACCTGCTTT TTTAAAAAGA GCAACAACCG TACTTTTACC TGTCGCAATC CCTCCTGTAA 54 0 

TGCCTAAAAC TTTTGTCATT TCAATCGACG ACCTTTCAAT TGTTGGCATT GTGGACAATA 600 

ATGCGTACCA CGTTGTGCCA CTTTTGTTTT AACAATTGGC GTACCACAAC GATTGCAAGG 660 

TAAGCCTGTT- TGCCCGTAGA CATTCAACGC TACTTGAAAA GTCCCTGCTT CTCCTAACGC 720 

GTTTAGAtAG GTGCGGAtGG TGGTtCCACC TGCTTCAACA GCGCGTGCCA AGACGTCAAT 780 

GATTGCTTGA TACAACGTAG CAACTTCCGC AGTTTTAAGG AATCTGCTGG TTGTTCTGGA 84 0 
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TGGATTTGCG CTTGCCAAAG TGCTTCATCT ACATAAATAT TCCCCAAACC 890 
(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607:. 



GCCAACGTTA 


GGGAACCAAC 


CAAGGCGTTT 


nGGACCAACT 


TTGGAAAACT 


GACGGACTAA 


60 


AAGGGCCTCC 


nGATTATCTA 


AnGGCTAAGG 


CCnTAAGGGA 


ATCCGAATCC 


CCTGTCATTG 


120 


TGTAGCCGTT 


TAACGCACCA 


TATACAGAGA 


TTAAAATCCC 


AATAGTTACG 


ATTTTGCCGC 


180 


CAATTGG-TCC 


AAAAATAGCC 


ACAGAAGCAT 


CCGAAGCTGC 


ATTCAAATTC 


CCAGCGATTT 


240 


GCTCAATTGG 


TAATGTTTTT 


AGAAACACAA 


AATTAATTAA 


GGCATAAATT 


AAAGTAATCA 


300 


ATAACAAACC 


AAAAATAATG 


GCTTTAGGCA 


AGTCTCTTTC 


TGGTCGCTTC 


ATTTCACCAG 


360 


TCACGTTACC 


CACACCCAGC 


CAGCCATCAT 


AAGCAAACAT 


GGTGGCGACT 


AAGGCGCCAC 


420 


TAAAAGCAAC 


TAAAAAGCCA 


GTATTCGcTG 


TTGTTTCAAC 


AGGAAAAAGT 


GAAACAGCTA 


480 


CTTGTCCGGG 


AGTAAATAAT 


CCTACTAAAG 


AAATcAAGGC 


AATTGGAATC 


AGTTTTACTA 


540 


CCAATGTCGT 


TGATTGGACC 


AAAGAAGCAA 


TTTTTGTTCC 


TAATAAATTA 


ATTATCGTGA 


■ 600 


TACTGGTTCC 


CGCAAGAATG 


GCAATTGGAA 


TCAATAAATT 


TGCCGATAGA 


TGAAACAAAT 


660 


TGATTAGTTG 


TGTACTAAAA 


ATAATGGATA 


ACGCCGAAAT 


GTTGGCAGGA 


TAATAAATGA 


720 


TACTTTGTGC 


CCACCCTAAA 


AGAAAACCAG 


CTAATTTACC 


ATATGTATAT 


TCAATATACT 


780 


TCACTGCGCC 


ACCTGTTTCA 


GGGATTGCTG 


TAGCAAGTTC 


GGCGCTGGTT 


AATCCTGCAC 


840 


AAATGGTTAA 


gGCACCACCT 


AACACCCAAG 


CAAAGATAGC 


TArACTAGCT 


GACTGTGCGT 


900 


GTCCAACGAC 


ACTCGCAGCT 


TTAAAGAAAA 


CACCAGCGCC 


AATCACCGTA 


CCCATTACCG 


960 


TGGaTAACGc 


ACCAAACGTT 


GTTATCTCTC 


TTTTTAATTG 


CTTTTTCTCC 


ATrGcTCCCt 


1020 


CATTTCATTT 


AAACTTAAAA 


AGCGTAACAA 


AATTTCCACT 


TTTTTGG 




1067 



(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 
TACCCGTCGC- CAATGGCTGA CAATGGATAT GAT ATT tC AG ACTATTATGG TATTTCTAGC 
GATTTTGGTA CAATGGCAGA CTTCGATGAA CTCATTGAAG AAGCAAAAAA ACGAAACATA 
AAAGTTATTT TAGATTTGGT GGTAAACCAT ACATCTGATG AACATGCTTG GTTTCAAGAT 



60 
120 
180 
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GTTTTAAAAA ATCCTCAAAG TCGTTTTCGA GACTTTTATA TTATAAAAGA AGGACGAGAA 240 

GCACCTACAA ATTGGCGGTC TAACTTCGGT GGTAGTGTTT GGGAAAAATT ACCTGGGGAA 300 

GATGCCTACT ACTTTCATGC CTTTCATAAA AAACAGCCGG ATTTAAATTG GGAAAATCCT 360 

GAACTACGCA AAGAAATCTA TCAAATGATT CGTTTTTGGT TAAACAAAGG AATTGCTGGT 420 

TTTCGTGTAG ATGCGATTAA TTTTATTAAG AAAGATTTGA CTTGGACAAA CTTACCTGCT 4 80 

GaTGGGGcAG ATCAATTAGC CAAAGTAACA AAAGCTAGTA GAAACATGCC AGGAATGAGT 540 

GACTTTCTGA ACGAATTAAA AGAAAAAGCT TTTGCTGGCT TTGaTAtTGT CACAGTTGCC 600 

GwArCGGCCG GCGTCAATTA TCTAAATTTA TCAGAATTTA TTGGAGAAAC AGGGTATTTC 660 

GACATGATTT TTGACTTCAA aTGGGCTGAT TTGGAwGtGA AATCAGGAAG CGAATGGTTT 720 

TATAGAATCG ATTGGtCTTG GAATGATTTA CGCACATTAA TTTTCAAACA GCAAGAAGCC 780 

ATGCAAGAnG C 791 
(2) INFORMATION FOR SEQ ID NO: 609: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 



AAGACGACGT 


tCCAATGTGT 


AAwTGAACAC 


AACCTGTTTT 


ACAGTGGTGT 


TGCCTAATAC 


60 


GGATTGCTAC 


TTGCTCAGCm 


ATTTCTTTTA 


CGACCAGTyC 


GATTTCATTT 


CTTCTAGTAT 


120 


AATCTCGTGG 


TAACACTTGT 


GAATTGCCAT 


AAGATTTTTC 


CTTTGACGGT 


TCTAGTGGTA 


180 


TTGCTATGTC 


AGTTCGATCA 


ATACCATTTG 


CATGGAAGTA 


CAGCTGTAAC 


CCAATCACGC 


240 


cTAAACGGCT 


CTTAATGGTG 


TATGGATTCC 


AATTAGCTAA 


ATCTTTGATA 


CTCATGATCC 


300 


CCATTTGGTT 


GAGCCTTTTT 


TTCATTCGAG 


AACCGATACC 


CCAAAAGTCT 


GtCATTTCAG 


360 


GAATATTCCA 


TACTTTATCA 


GGAACATTTT 


GATACGTCCA 


TTCGGCAATA 


AATCCTTCmT 


420 


TGkTCTTCGC 


TtCGTTGkCT 


AArGCAAGTT 


tGGCTAACAA 


GGGATTATCT 


CCGACACCTA 


480 


CTGCAGCAAT 


CAATCCTAGC 


TCTTCTTTAA 


TACGTTCTTG 


GATCATTTGA 


GCGAGCTTCT 


540 


TTCTACGTTG 


GCTTCGTGTT 


CCTTCAGTCG 


TAAAAAGATT 


CAGTGATCGG 


GTCACTTTTA 


600 


AGATTGATTC 


ATCGATCGAG 


TAAATCAGTA 


GATCTTCATC 


AGCCACATAT 


CTTCTGAAAA 


660 


TATTATTTAC 


CTGCATATTT 


CGCTTGATAT 


ATAGTTTCAT 


ACGTGGTGGA 


ACAACATGTA 


720 


GTGTTTTAGG 


AAATGGTTGT 


GGTAAAtCAC 


GkGGTCTACT 


CACATTtGTA 


AtACCATACC 


780 


GCTTTTTkGC 


TTCAGGAGAA 


GAAGCTAATA 


TCAATCCTGA 


ACCAGTATTG 


TCACCTCGAC 


840 


TCATAACAAC 


AAGTTCTGTT 


GTTAATGGAT 


CTAAATTTCT 


TTCTATACAT 


TCGACACTCG 


900 


CATAAAAAGA 


CTTCATGTCG 


ATTAGAAAAT 


AATCATTTAC 


TGGTTCTTTC 


GAATAATCAA 


960 


ACATGAATAA 


CACCCCTTCT 


TTTTACATTA 


TACAAACGTA 


TGTTCGATAT 


TGTAAAGAAG 


1020 
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AAATAGCTAA ACACACAAAA AAAGAAGCGT AGGTAAACTC TTACCCTGCT TCTTCTTTTG 



1080 



ATTTGTATGA TCAACTCATT GtATCACTTT ATTAAATAAT TTGAAAATcT AATaAAAACT 



1140 



CTCACTCTAA AAGAGTGAGA GCCTGCAAAT TAGAGTACGG AATTCTTTGA GAGTTCCGTT 



1200 



CCTTAATATT AACAAATGGT CTTAAAGCAT ACAAGGCGAA TTA 



1243 



(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

TAGATGnAAT nTGAAGTGAC TGGTGAAAAT CGACATATCA TTAATGGCCA TACACCAGTT 60 

AAACGAACAA AAGGCGAATC GCCAATCAAA GCAAATGGCA CGCTTCTCGT AATTGACGGC 120 

GGCTTTTCAA AATCTTATCA GACAATTACT GGGATTGCTG GTTATACATT GTTGTACAAC . 180 

TCGTTTGGTT TACAACTGAC CGCGCATAAA TCTTTTTCTA GTAAAGAAAC AGCCATTTTA 24 0 

AATAATCAAG ACATTCATTC CATCAAACAA GTCATCGATC GCCCCTTACA GCGGCTATTG 300 

GTGAAAGATA CCACTATCGG CAAAGAGCTA TTAAAGCAAT CGCAAGCACT CCAAAAACAA 360 

ATGAAACAAA ACCAGTAGTG CCGTTAAAAA GACGCTTGCG CTCTTATCCG AAAAATGATT 420 

AAATGGaGGA GTTGAGAAAA ATATCTATTA GAGAAACGGA AAGGGTTTtA CATTGAAGAa 480 

CAGCmACTCT CACTTTACGG CACCATTTTC TTTCTTTGCT TTnTCCTAAT TGTGGGCTTC 540 

TTCTnCTCAA GTCGTGGCGA AGATTATGCC AAAGATCTGA ACAAAA 58 6 
(2) INFORMATION FOR SEO ID NO: 611: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

GCACGCrCAA CTTTCAAGCA AAAACGAATT GGGAACTATA AAGAAACAGC CGATTGTTAT 60 

TCTCTTCAAT CGGCTGTTTC TTTATTTATT TGCCTTGTCA TCAGTGTTTT CTTCTGCAGC 120 

AATCCGCGCT TGAACTTGTT GACGTAAAGC TTCTAGTTCT TTTTCTTTCT TTTCTAATTC 180 

AGCTAAATCT TCAGAAATTT CATGGGCTTC TTCTGCTGGC TTAGGTTGTA AAAAGACGTT 240 

TAAACCAATC GCTATAAAGG TCCCAATGAA TGTATCCAAC ACACGTTCCA TTGCATATTG 300 

aAAGGaATCG CTTTGCGGaA TACTTAAAGA AATCATTaAA AGTGTCGCAG TTGCaGAAAT 360 

GATCCCAGCA TTGTTrTTAA TCCCATCAGA AACAACGaTG aCGaTAATGA CTAAAAGTGG 4 20 
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(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 



TTTTTCAATA 


AATAgTTATA 


AAGATAATAT 


TACTAGAATT 


AACGAAGAAT 


GGAATCAATA 


60 


TCAGTCAAAA 


GAGACCATTA 


ATAAAATAGT 


AGAGTTGGTA 


GAGAGAATCT 


TGGGGGGATT 


120 


TTGTGAAAAG 


TAAGGTACGC 


ACTGAATATA 


TTGGGGGATT 


ACCAGTTGAC 


GTATTAACAC 


180 


ATCAAGAAAT 


TATGGATGAC 


TTAGAAAGTT 


ATCTCAAAGC 


TAAAGAGAAA 


ATGATTGTmA 


240 


CTAGCGTTAA 


TCCGCAGATT 


TCATTGTTAG 


CATTTGAACA 


TCCAGAAGTT 


AAGTTATATT 


300 


TGAAAAGAGC 


AACTCATCGG 


ATTGCAGATG 


GGATAGGTGT 


GGTCAAGGCA 


TCGAAGCTAA 


360 


TGGGTGGGTC 


TATTCGTGAA 


AGAGTTACAG 


GAATTGATGT 


CaTGGATAAT 


TTTTTAGAGT 


420 


ATGCAAATCa 


ACATaAGGnA 


CGAATCTTCT 


TGTmTGGTGC 


aAAGCAAGAA 


GTAGTTGAAG 


•480 


CAGCAGCTAA 


AAATATTCCA 


CGACCATATC 


CAGGGATTGT 


TATTAGCGGT 


ATTTTACATG 


540 


GCTATACCAA 


GGAAAAACAG 


CAGGAATTGT 


GGACCAATTA 


ACCAAGCTGA 


ACCAGGTTGC 


600 


TTTGTAGCAT 


TGGTCnCCGA 


ACAAGAGTTT 


TTGGACAACA 


TTGTCnCCTG 


ACACTATGAT 


660 


TTAGACGAGC 


GGnCTTGCGT 


TACTGGn 








687 



(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

CGGCTTGAAT TTTTTATTCT TCTATAGTAG TTACAGCAAC TTTGGAAATC TTTTCTACAA 60 

AGGCATCAAG TAAATCGACA ACTGCTTGTT GTGTTTCGAC TTGGTTAATC GCAACTTTCA 120 

CTTTGGCTGC TCGCGAGATT CCTTTTAGAT AGTAAGAAGC ATGTTGGCGA AACTCACGGA 180 

CCGCAATTTT TTCACCTTTT AAATCCACTA ATCGTTGTAA ATGAACTTTT GCAATGTTAA 24 0 

TTTTTTCAGC TGGTGAAGGC TCAGGCATCA ATTCACCCGT TTCCAAGTAT TCTTTCGTCC 300 

GTTGAATCAT CCAAGGATTG CCCAAAGCAG CTCGACCAAT CATCACACCA TCCGCTCCAA 360 

CATATTCTAA CATGCGTTTA GCATCTTCAG GCGTTTTGAC ATCGCCATTT CCCATAAAAG 4 20 

GAATGGTTAA ATGCTTTTTC ACTTCCTTTA AAACGTCCCA GTTGGCTGTC CCTTCaTACA 4 80 

TTTGGACACG TGTCCGACCA TGCATAGCAA tTGCCGAAGC CCCTGCTTGT TCGGCAGCCA 54 0 

AAGCGTTGTC TACAGCATAT AGATGATCTT CATCCCAGCC TGTTCGCATT TTTACAGTGA 600 
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CCGGCAAAGA AACTGCCGAA GAGACTGCGT GAACCATCTC ATGCACTTTA TTCGGATCCA 660 

GCAACCATTT" CGCACCAGCT TCAgCTTTAA TTATTTTATT GACAGGGCAG CCCATGTTAA 720 

TATCAATAAT CGCCGCTTGG TATTTTCTTC CACAAACTTA GCAGCTTCCA CTAATGTTTC 780 

GTTATCTCCA CCAAAAATTT GTACACTTAA AGGATATTCC CGCTTCATCA ATATATAGCA 84 0 

TTTGCTAAGG TTTTTTGGTn TCTTnATTTT AAnGCCTTTG GTCACTAAT 88 9 
(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 



TGGTGATGAC 


TTTGTTTAGA 


ATACAAGTAG 


GATTTTGCTA 


CGACCGAAAT 


AAACATAGCC 


60 


ATAATCCCCA 


AGCCTACTGT 


AGCATAAATC 


ATGGTGAAAA 


TTTTTCCCAA 


GTCAGTAACT 


120 


GGATGCACAT 


CTCCATAACC 


AATCGTGGTT 


AACGTCATGA 


AACTTAAATA 


TAACGAATCT 


180 


AATGGCGACA 


ATTTTTCCAC 


TGTTGAATAA 


AAAATAGTCC 


CACTCAACAA 


CGTGGTTGCC 


240 


AGTAAAAAGA 


ACAAGGCACG 


AGTGTCTTCT 


TTTTTTAAGA 


TACCTAAAAA 


AGATGAAAAT 


300 


AATTTTTTGA 


CAGAAATAAG 


TAAAGATAGC 


AAAAAAATGA 


CCTCTTTCTA 


TTGTTAATCT 


360 


CTTCTTAACG 


TTAGCACTTT 


TTAGATGAAG 


TTCAACTTTT 


TAATTGGTGT 


GGTTAAAGAA 


420 


AATTTATTTT 


TTTGTTATTT 


ATAATACACT 


TCATCTAAAA 


ACAACCCTTG 


TGCAGGAGCC 


480 


GTTTCACCTG 


CGCCTTCTCG 


GACTTTTGTT 


GCGAACAATT 


CATCAATTAT 


CGCTAATTCT 


540 


TTGGTGCCTG 


CACaATTTCT 


AAAAGTGTCC 


CAACGAGGAT 


GCGGACCATT 


TTATGCAAGA 


600 


AACCATTGCC 


CACAAAAGTG 


AACTGTAACA 


TTTCGCCGTT 


TCTTTCAATC 


GTAATCTTTT 


660 


CAATCGTTCG 


AGTAGTCGAT 


TTTTTGGtTT 


TTTTCAATGC 


TGAAAAACCA 


ATaAAATCAT 


720 


GGGTCCCTAT 


TAATTTTTGG 


CAAGCTTGTT 


CCATTTTTTC 


TTGATCCAAC 


GCTTTGGGAA 


780 


AGTGAAAACT 


ATGATACCGT 


TCAAAGGcAT 


TAGGAATTgG GcTATTCCmA ACATAGTAGC 


840 


TGTATTGTTT 


GCCTGTGGCG 


TTATACCGTG 


CATGAAAACG 


TTCAGGCATT 


CTGGAACTTC 


900 


TTTAACAACA 


ATATCGCCAG 


GTAAATATCG 


ATTAAAAAGG 


ATAACATTCC 


GGTGTTGTAA 


960 


GTGACCAGnC 


GTTTAAAATC 


GCTACTGGAC 


CACGG 






995 



(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 61 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 
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TCAACCATTG TCGGCGCCAA TGCGTTTAGT TTATATTGGC TTTTGTTATC TGAAACAACA 60 

GACTCAGAGG AGAATGGCTC GCTGTTTCAT GCAGATTTAA TTAGTTTGCT GGACTTGAGT 120 

TGCCAACAAT TAGAAGAAGC TTGCTACAAA TTAGAAGGCA TTGGGTTGCT AGAAACGTAT 180 

AAAAAAACAG ATCGGGAATT AGGAGATTGT TATTTATATT ATTTAAAAGC ACCTGAAACA 24 0 

GCCGCTCGTT TTTTTAAAGA TGAAGTGTTA GCTTTAGTGT TATTTAATCG GGTTGGaCAA 300 

CGGAAATTTG ATCAATTAGT GCAAAAGTTC CAACCGCATC CTACTAAAAC AGAGGGCTAT 360 

CAAAATGTTT CCGCTAGCTT TCaAGArGTT akgCCTTarr GrAgACaAAT TGTTTcCGAA 4 20 

GCGAAcCGrT TGACcACGAT ccAGrAACtT TTTCTCAAAA G 4 61 



(2) INFORMATION FOR SEQ ID NO: 616: 

(i-) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

CAAAATTTGT TAAAACGAGA TATTGCCAGT nTTTTAAGAC AAATGCACGG TTTAGATTAT 60 

ACAGATATTA GTGAATGTAC TATTGATAAT AAACAAAATG TATTAGAAGA GTATATATTG 120 

TTGCGTGAAA CTATTTATAA TGATTTAACT GAT AT AG AAA AAGATTATAT AGAAAGTTTT 180 
ATGGAAAGAC TAAATGCAAC AACAGTTTTT GAGGGTAAAA AGTGTTTATG CCATAATGAT ' 240 

TTTAGTTGTA AT CATC TAT T GTTAGATGGC AATAATAGAT TAACTGGAAT AATTGATTTT 300 

GGAGATTCTG GAATTATAGA TGAATATTGT GATTTTATAT ACTTACTTGA AGATAGTGAA 360 

GAAGAAATAG GrACAAATTT TGGAGAAGAT ATATTAAGAA TGTATGGAAA TATAGATATT 420 

GAGAAAGCmA AAGaATATCm AGATATAGTT GAAGAATATt ATCCTATTGa CtATTGTTTA 4 80' 

GGAATTAACA ATATTAAACC GGAATTTATC GAAATGGTAG AnAAGAAATT nTAAnAGG 538 



(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

GCAATCAAAC GCGTACGTAC AAGTGCTAAC GCGAATGCGA AAAATTCATC TCAAACAAAC 60 

GCAATGCtAC AGaATCAAGA AATTTGAAGA AGCTGTAGCT GCTGGAGCTG ATAACGTCGA 120 

TGCTTTATAC AACGAAGCTG TTAAAGCTGT TGATATGGCT GCAACTAAAG GTTTAATCCA 180 

TAAAAACAAA GCTAACCGCG ATAAAATTCG TTTAAGCAAA TTAGCTAAAT AATGTGAGAT 240 

AGAAGTTAGC ATTCATGTTA GCTTCTTTTT TCATCCAAAA ATGTCCAAGT CTGTTTTCAC 300 
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AAAGACTCGG 


ACATTTTTTT 


ATAAAGGTGA 


ATCAATCCAC 


GGTAAATCCG 


TTGTGAGATG 


360 


TTGACTAAAT 


AATTTAGTCT 


CTTCTCGCAA 


AGCTAAACGT 


TGTTGTCGTC 


TCGTTTCATA 


420 


ATTGGAACAA 


ATCAACTTTT 


CTTCTGCAGT 


TTCTGGGACG 


ACTGCCGCTA 


ATTGGAAGTC 


480 


ACTTTCGGGA 


TTCATATGAG 


ACGGAATGGC 


TACAAAAGTT 


GTAAAACAAG 


TCGCCGCTAA 


540 


ATAACGCTCT 


CCAGTTGTTA 


AATCTTCTCC 


TACAACTTTA 


ACAAATACTT 


CCATCGATTT 


600 


ATGATGCACG 


CCAGAAACAA 


ACGTTTCAAC 


ACACACTGAG 


TGATTTTCTT 


TTAAAGGTTT 


660 


TAAAAAATTC 


AAGTTATCTA 


AAGATGCTGT 


AACTGCCCCT 


CTCCGACAAt 


GTCGTGAAAC 


720 


GGAAATGGAT 


GCGGCGTCAT 


CAATTAAAGA 


CATTAATTTC 


CCACCAAATA 


GTGCACCAAA 


780 


GGGATTTAAA 


TCAAAGGGrA 


ATACACGATG 


TGTTTGAATA 


ACTCGTGATT 


CTCGACAAAA 


840 


1111 1L1 1 


nTPATTAACG 

o j, Vt^rt i. X nri'w o 


ATTCCTCTAC 


TTTCATTTAT 


AAACTTTTCC 


ACGCTTTACT 


900 


TTAGCTTACT 


TTACCATTgA 


TGCAATCCTT 


ACTTTTTAGG 


AAACATTTAA 


AAAATAACAA 


960 


ATTTGCAAGT 


CCCATTTATT 


CATTTTTAGG 


AAAGTTGCTG 


TATAATAATA 


ACGGTTAAAG 


1020 


AACTTTTAAG 


GAGATGGaTA 


CACaTGAAAA 


wTGCaATTGC 


TGGTGCCGGC 


GCTATGGGCA 


1080 


GCCGTTTAGG 


AaTTAkGTtG 


CATCAaGcGG 


rAATGakGTA 


ACTTTAATCG 


ATCAATGGCC 


1140 


CCGCGCACAT 


TGAAnCTATT 


CGTA 








1164 



(2) INFORMATION FOR SEQ ID NO: 618: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 



ATTnTCCCAT 


TCCTTGTTTC 


AATAAAGCAC 


TGGAATCTTT 


GACCGGACGT 


GTAGTTAACT 


60 


AAATATCTAT 


TACTAAACTC 


TTCTTTCGTC 


GAAGTATACC 


AATTTCTGTC 


AAATGTAGAG 


120 


TAAAATAAGC 


ATTCTCTATA 


TTTTTTCTGA 


GCTTTTAAAT 


AATAATCTGA 


GGTATCTTCT 


180 


ATCCCTATAT 


GAATATTCAT 


TTCCCTATAG 


TCTATATCAT 


CAGCAACATA 


AAACGACAAA 


240 


TCATTTTCCG 


AAAACATAAT 


AGCGGGTATA 


AAATCTCCTT 


CGCCAAAAGC 


TCCTAACAGT 


300 


CCAGGTTCCG 


ATAATTGCAT 


TTTAGCTTTC 


ATAAATCTAG 


GTAAAAATAT 


ATCATCAGTT 


360 


TCTACTTGTG 


AGATTGCTCT 


AAGTTCCATT 


GCGTTAAAGT 


AAGCCATTGG 


AATATTTCTA 


420 


TATATATTAC 


TTAATTTAAG 


TGAAATATAC 


ACACTTCCTC 


CTAAAGTATG 


AACTATTATA 


480 


ATTATGAACA 


ATTGATTATT 


CATAATATTA 


TGTTTATTTA 


GTTTTTCTAC 


TATTAATTCT 


540 


ACTCTGCTGG- 


TTATTTTTTC 


TGTATAATCA 


ATATTTTCTT 


CTATACCACC 


AAAAATACCA 


600 


AAAGCTATAA 


GTAATTTATC 


ATTGGCAATA 


TTAAAAATTG 


CCTCCTTATA 


TTTTCTTTCA 


660 


TCCAGTAATT 


TTATAGCAAA 


ATTATTCTCA 


GCAATTTTTT 


CATGTTTCAA 


TCTCTTAAAA 


720 
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GACTTTCTTA TATCATTCCA CATTTCTTCA AATACGTATA CTTCAGATAT TATTGTGGCA 780 

ATTTTACTGG ATAATGCGTT AGCGACaGAT GTTATGTCTA ATATAAGTAC GCCTTCCCCA 84 0 

GTATCCAGAT AAaGGCTTAG TTAAAAAGAG GATGTTGGTT TTGGGAAAAT GGTTTTTGCT 900 

TTTCTAAATC AACATCTTCC TTCAnTATTG GATATTnCAT GTATTCCAGG TAATTnGTTA 960 

ACTAAAAAGC ACCAAGTTTC CATAAAACC 98 9 
(2) INFORMATION FOR SEQ ID NO: 619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2013 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

ATTGATCAGT GAATATTATG TTATGTAnCG nTTCTTTAAA CCAATTGCTC TTGATGAAAn 60 

ACAAAATTTC TCTAGTGTTC AGCTATTCCT TAACCTTTTT ATATTTAATG CTTTATTCAT 120 

GTTTGTTATG CTAACAAACT TTCATCAAGT TTTTGTAGGA AAAGAATTTA TTATTTTTGG 180 

TAGTTTCATG GGTGGAATAT TTACGATTTG CTTAAAGGAA CAACTAAAAG AATACCAGAA '24 0 

GCTAATGAAT CAATTTATAG ATATAAAGAA AGAAGTACAT TGATGAAAAA TAGAATTAGT 300 

TAGAACAGCC ATGTATGGTG AAATATTGTA GTATATTACT TGTTAAGGAC GATGAACAGG 360 

ATGAGTATCA AAAAATATTT ACTTTTTTAT ATTATTAGCA CATGTGCCAT TATTTTTGTT 4 20 

TCTCTTACTT TTTTCTTTTT ACCTCTAAAA TCGATTGATT CTAGGAGTTA TGAAGTACTT 480 

CATACGATAT CTCAATATGG AGTAATTTCT AATAAAGATG AGTGGAGGAA TATTTTAGAA 54 0 

TTAACAAGAT ACGGTAGAAA AGTAACTAAT GTGGATAATT TAAATACATT AGTATATGGT 600 

GTTAACAAAC ATAGTTCTGT AAAACAAATA AGTAGTGACA AAGATATGGA TAATATGGAA 660 

ACTCTTAAGT TACCAAGTAT TTCAAAGTAT GAGGATCTTA CAATTATTAA CATTCCAAGT 720 

GTCTATAGCA ATGATGATAA CTTCTCAATT AAATATGCAT CAAAGTTGAC AGATTTAATC 780 

GAATATACTT CTGGTAACAT TGTTTTAAAT TTATCAAATA ATTATGGCGG GCTGAAAGAA 840 

CCTATGATTA TTGGTGCGAG TTCTTTAATT CCTAATGGAA TGCTCTTTAG CAATATTAAT 900 

AATAAAAAGG AAAAGTATCC TGTTTACTTA AAAAAATGGT AAATTATTTG GTGGCATTCC 960 

AGGAACGATT AACGAACTGA ATCTTTATAA ATTATCATTA CACAAAAAGA AACTAGGGAA 1020 

AAAAGTTGCA GTTGTAATAA ATAATAACAC AGCAAGTGCC GCTGAGAGTT TACTTTTAGC 1080 

GTTAAAGAAT AATCCmAATG TAAAAGTTTT TGGTATGCCT TCTGCAGGAT ATACTTCGGT 114 0 

TAATyTAGGA CGTTTTTTTA CTAATCAAAA TTCCGATAAT TATTGGTGGt TTGTATATAC 1200 

TGTTGGCTAT TATGAGACAA TAAAGCCCAT AAAGGGAAAA TGATTGTTTA ACAATCAACC 1260 

TGTCCCGCCA GAT TAT TAT G TAGATTTATC GATGTTAGAT AGAAATACTC AAGATATTAC 1320 

ACAATTTTCT AATACCAAGA AACTATTCAA AGAAATAAAA AATTsGATTA ACTCTGATTG 1380 
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AGTTGAAGGT 


ACATTAAAGT 


AAATGAATTC 


TTAATGTATA 


GTATAACTTG 


TAAGCATAAA 


1440 


ATTTTTATAG 


GAGAAAGGCT 


TTTGCTAAAA 


TTATATTGTG 


GTAAAAGTCA 


CATGAAAGTG 


1500 


GGGGGGAGAA 


TCGATGACGA 


AAGGGAAAAA 


AATTTTGTTT 


AGTTTTATTT 


CTTTAGTTTT 


1560 


TGGAACTTTT 


TTATTTGCAG 


TATTTGTTCA 


TTATCAGTTC 


ATGTATTCAT 


TACCTATTTA 


1620 


TTTTATTAAG 


TTTGAAGAGT 


ATCCTTTTAT 


AGGGAATTAC 


ATTCAACTAT 


ATTTGTACTG 


1680 


GGGTAGTGTT 


ATAGGTATGG 


TCTTTATCCT 


TTTATTTATT 


TTAGGGATTG 


TTTTAAGTCC 


1740 


AGCAGAAGTA 


ACATCCGTGA 


AATTAAACGA 


TGACAAAGGA 


AT AT TAG AAA 


TCAAAAAAAG 


1800 


TGCTATTGTA 


GGAATTGTTC 


AGTCGCAATT 


AGATCAGTCC 


AATTTATTAA 


GGGATTCAAA 


1860 


AGTAAATGTA 


AAAATGTATA 


AGAAAAAGAT 


AAAAGTACGT 


ATTACAGGAA 


ATACAAGTGA 


1920 


CAATCTAGAT 


ATTATAAATC 


AAACGAATCA 


ATTAGTAAAA 


AATATTGAGT 


TATATCTTAA 


1980 


AACTTTTATA 


GGATTAGATA 


CTTCAATAAA 


AGC 






2013 



(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 



ATAAATTTCT 


GGTGTTnAAA 


AAGTTCCTTT 


AATTTCTTTA 


TAACCTAGTA 


TAGATAAnTC 


60 


ATCACTnATA 


TACGnATATT 


CAATATTAGG 


AATTTTTACA 


TTAGTTTCTA 


AATTTGTATT 


120 


TAAAAAATTA 


TATATTGCTT 


TTTCTTTTGC 


ATAACCTTTT 


TTCTTATTAG 


TACTAAATTT 


180 


TGTTTTAAAA 


ATGTATTCAT 


TATTAACTAA 


ATATGCCACA 


CTATCATAmC 


CACTACCGAT 


240 


TATTTCmATA 


CTATCTACTT 


TGAAATtATC 


aAAGTAATGC 


TCAATTAAAT 


ATTTCaTTGC 


300 


CTTAACATTT 


gTGGCATTAT 


CATCATATCT 


ATATTCCATT 


AAATAACAAT 


CTTCTTTTTT 


360 


GCCCTCGTGT 


AATTCATGTT 


CTGGCAAATC 


TTCAATAATT 


CTAAAACCAG 


ATTTTTGGTA 


420 


TGCCCTTATT 


GCTCTTGGAT 


TATTTTTATG 


AGGGTCTAAA 


ATAACTGCAT 


TAGCATTTCT 


480 


TTCTTTTTTC 


AAAAATTCAA 


AAATCAATTT 


AATATATCTT 


GTACCAATTC 


CTTTACTCCA 


540 


ATAATTTGGC 


TCTCCTATAA 


ATTGATCCAT 


ACCATAGACT 


ATCTCATCAG 


TTTTTGGATA 


600 


ATGATAATCA 


GTATATAACT 


CATCATACAT 


TTTATATATT 


TGTCCATATC 


CAATAGGAAC 


660 


ATTGTTATAT 


TCAATAATTA 


CTCTAAAAAC 


TTCATCTTCC 


CAAGGCTCTG 


TAf AAT.GTTT 


720 


TTTTAATGAT 


TCTAATGTAT 


ATTTTTTATC 


TCTACCACCA 


TAAAATTCTA 


ATACTCTTTC 


780 


ATCAGTTAAC- 


CATTTTAACA 


TCAAAGGaAA 


ATCATCATCT 


ATTAAAGTTC 


T TAT AC AT AT 


840 


TTCATTTTCA 


ACTATATTCA 


TTTATTTATC 


' ACCTTTTTCA 


TAATCATATA 


CATATACTAT 


900 


TTCATCTTTA 


TAATCATTTT 


TACCACCTAA 


TTTTTCATAT 


ACATGGCAAG 


CTCTAGGATT 


960 
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ACCTTTATCA GTTATTAAAA ACATTTCAGA ACAACCAATC TCTTTAGAAT ATTCCTTAAT 1020 

AAAAGATAAT AATTXTCAAC CATAACCTTT GTCTTGATAG TTAGGTAACA TTCCTATTGA 1080 

GTGTAAATAA AACATTGTTT TTCCATCAGG nCTTAAAAGT GTATAGCAAT ATGCAAATCC 114 0 

TATAATTTTA TTATTTkCTT TAGCTATAAA CCCAAATGAA CTGGtATCAT TAAGAAATCC 1200 

TTTTAAATTA TCAATATCAA AAACCATATT ATCATCAATT AAAACTTCTT CCATAAACTC 1260 

AGTCAAGTCC AGACTCCTGT GTAAAATGCT ATACAATGTT TTTACCATTT CTACTTATCA 1320 

AAATTGATGT ATTTTCTTGA AGAATAAATC CATTCATCAT GTAGGTCCAT AAGAACGGCT 1380 

CCAATTAAGC GATTGGCTGA TGTTTGATTG GGGAAGATGC GAATAATCTT TTCTCTTCTG 14 40 

CGTACTTCTT GATTCAGTCG TTCAATTAGA TTGGTACTCT TTAGTCGATT GTGGGAATTT 1500 

CCTTGTACGG tATATTGAAA GGCGTCTTCG AATCCATCAT CCAATGATGC GCAActTTTG 1560 

AATATTTTGG kTGATCGATA TAATCATGAA TCAATCGATT TTAGCCTCAC GCGCnAAGTT .. 1620 

AATACTGTGG ACTTAAAAAT CCC TAAnAGC GTCTCTGAAG AATTTGAATT GTTTTA 167 6 
(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 



ATCCCCTTAG 


TATAAATACA 


AACTCAAAGG 


AGAGACAAAA 


CATGTCAATT 


ATTACTGATA 


60 


TTTACGCACG 


CGAAGTCTTA 


GACTCACGCG 


GTAACCCAAC 


AATCGAAGTA 


GAAGTTTACA 


120 


CTGAAAGCGG 


AGTTTCGGCC 


GTGGAATGGT 


TCCTTCAGGA 


GCTTCAACTs 


GTGAATACGA 


180 


AGCAGTTGAA 


TTACGTGATG 


GCGACAAAGC 


TCGTTACTTA 


GGTAAAGGTG 


TTACTAAAGC 


240 


TGTAGATAAC 


GTAAACAACA 


TCATTGCTGA 


AGCAATCATC 


GGTTACGATG 


TACGCGATCA 


300 


AATGGCTATC 


GATAAAGCAA 


TGATCGATTT 


AGATGGAACT 


CCTAACAAAG 


GTAAATTAGG 


360 


TGCTAACGCT 


ATTCTTGGTG 


TGTCTATCGC 


CGTAGCTCGT 


GCTGCTGCTG 


ATTACCTAGA 


420 


AGTGCCTTTA 


TATCACTACT 


TAGGCGGATT 


CAATACAAAA 


GTATTACCAA 


CTCCAATGAT 


480 


GAACATCATC 


AACGGtGGAT 


CACATGCTGA 


CAACAGTATT 


GACTTCCAAG 


AATTCATGAT 


540 


TATGCCTGTA 


GGTGCGCCTA 


CATTCAAAGA 


gCTTTACGTA 


TGGGGTGCAG 


AAGTATTCCA 


600 


CGCATTAGCT 


yCAaTCTTAA 


AAGGTCGCGG 


TTTAGCwACT 


TCAgTAGGTG 


ACGAAGGTGG 


660 


TTTCGCTCCT 


AACTTAGGTT 


CAAACGAAGA 


AGGTTTCGAA 


GTAATCATCG 


AAGCTATCGA 


720 


AAAAGCTGGC 


TATGTTCCTG 


GTAAAGACGT 


TGTTCTTGCT 


ATGGATGCTG 


CTTCTTCAGA 


780 


ATTCTACGAC 


AAAGAAAAAG 


GCGTTTACGT 


TTTAGCTGAC 


TCAGGCGAAG 


GCGAAAAAAC 


840 


AACTGAAGAA 


ATGATCGCGT 


TCTACGAAGA 


ATTAGTTTCT 


AAATACCCAA 


TCATTTCAAT 


900 


CGAAGACGGC 


TTAGACGAAA 


ACGACTGGGA 


CGGATTCAAA 


AAATTAACTG 


AAGTATTAGG 


960 



I 
I 

i 
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CGACAAAGTT 


CAATTAGTTG 


GTGACGATTT 


ATTCGTAACA 


AATACAACTA 


AATTAGCTGA 


1020 


AGGTATCGAA 


AAaGGTATCG 


CTAACTCAAT 


CCTAATCAAA 


GTTAaCCAAA 


TCGGTACTTT 


1080 


AACTGAAACA 


TTTGAAGCAA 


TCGAAATGGC 


TAAAGAAGCT 


GGCTACACTG 


CAGTTGTATC 


1140 


TCACCGTTCT 


GGTGAAACAG 


AAGATTCAAC 


AATCTCTGAT 


ATCGCTGTTG 


CAACAAACGC 


1200 


TGGCCAAATC 


AAAACTGGTT 


CATTAAGCCG 


TACTGACCGT 


ATTGCTAAAT 


ACAACCAATT 


1260 


ATTACGTATT 


GAAGATCAAT 


TAGGCGATGT 


AGCTGAATAC 


aAAGGCTTAA 


AATCTTTCTA 


1320 


CAACTTAAAA 


AAC AAAa T An 


T T CAT T AAAT 




yTAAAGGGrC mATTGGTTAA 




TGAmCCAATT 


CTCCTTATAC 


TCGGACGTAA 


AAACcGCCTT 


TCTGCAAAGA 


AAGGCGTGnT 


1440 


nTTTTATATT 


TATAAAAATA 


TTGAAAAAAT 


AGATTAGAAA 


TAACTTCAAC 


CGTACATCGC 


1500 


TCTATAATCA 


CTTGTTTAAA 


AATAACAAAA 


CAT T AAC AAA 


ACAACATTAT 


ATAAAAATTT 


1560 


AATAAAACAA 


AATAATTGTT 


ATTTTATAAT 


TTTAGTATTG 


ACTGGTTTCA 


GATAACGCTG 


1620 


TACGCTTAAA 


GAAAT 










1635 



(2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 



AGAGGGTGCT 


TTGGTCCGTG 


TTCAAGCCTT 


TGCAACTATT 


AATGGAGAAG 


ATATTACCAC 


60 


TAGTTTAGCG 


GCGGACGCTT 


GAAATCTCTA 


AAATCAGTTG 


GTAGTAAAAA 


TCAATTGTCC 


120 


ATTTTACAAA 


TTCAAGAAGA 


AGTATCCAAA 


TATTATCATG 


TGCCTCTTAA 


AG AT TT AAAA 


180 


GGGAAAAAAC 


GGGTGAAAAC 


AATTGTGGTT 


CCTCGTCAAA 


TCTCAATGTA 


TCTTGCTAGA 


240 


GAAAT G AC CG 


ACAATTCCTT 


GCCAAAAATT 


GGTGCCGAAT 


TTGGCGGCAA 


AGATCATACA 


300 


ACGGTGATTC 


ATGCGCACGA 


AAAGATTCAG 


CAATTATTGG 


AAAAAGATCC 


AGCGATTCAA 


360 


AAAGAAGTCA 


GTGAAATTAA 


AAATTTATTG 


AACTCTTAGG 


ATGTGGAAAA 


CTAAACAAAA 


420 


AAGAACAAAG 


TTATCCACAA 


CTTATGCACA 


AGTGGaTAAC 


TTAGTAAAAT 


CAATGCTCTT 


480 


TTGACTTTTC 


CACAGAATCA 


ACATGCCCTA 


TTACTTTTAT 


TATTTATTTA 


TATATAATAT 


540 


ACAAAACAGT 


GGCGTACAAA 


ATTgGAmCGT 


ATTGAAAATA 


CTTAAAAAGG 


AGTTTACATT 


600 


ATGaAATTAA 


CGGTCnAACG 


AAGTGtCTTT 


TTAcAAGAAT 


TACAAACTGT 


TCaACGAGCA 


660 


ATTtCTTCTA 


AAACAACGAT 


TCCcAATCTT 


AACAGGTGTT 


AAAATTGTGT 


TATCAGAAGA 


720 


TGGCTTATCA 


CTTACTGGGA 


GTAACGCGGA 


TATTTCAATT 


GAAAGTTTTT 


TAAGTAAAGA 


780 


TGATGAAAAA 


GCTCAAATGA 


CCATTGAACG 


CACAGGTAGC 


ATCGTTTTAC 


AATCTCGTTT 


840 


CTTTGGCGAA 


ATTATTCGTA 


AATTACCAGA 


AGATATGTTC 


ACAATGGAAG 


TTCTAGATAA 


900 
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CAATCAAGTT 


GCAATTACTT 


CAGGAAAAGC 


TGATTTTACG 


GTTAATGGAT 


TAGATGCTGA 


960 


TAATTATCCG 


CACTTACCAG 


TAATTGATAC 


TCAAAACCAA 


ATGAAATTAC 


CTGTTCAT-TT 


1Q2G. 


ATTGACAAAA 


ATTATTAGTG 


AAACAGGTTT 


TGCTGTATCG 


ATGCACGAAA 


GTCGTCCAAT 


1080 


TTTAACTGGG 


GTTCACTTTA 


TTTTAGAAAA 


TCAAAAATTA 


CTTGCCGTTG 


CGACAGATTC 


1140 


ACATCGTTTA 


AGTCAACGTG 


TGATCCCGAC 


AGAACAAGCA 


GTAGAAGACT 


TTAACATTGT 


1200 


AATTCCAGGA 


AAAAGTTTAA 


CTGAACTTTC 


TCGTTCATTA 


ACCAATGAAG 


AAGAAATGGT 


1260 


TGAAATCAGC 


ATTATGGAAA 


ATCAAGTGCT 


ATTTAAAACA 


GAAACAATGT 


ACTTCTATTC 


1320 


TCGTTTGTTA 


GAAGGAAATT 


ATCCTGATAC 


CAACCGTTTA 


ATTCCAACTA 


GCCATAACAC 


1380 


GCAAATTGAA 


TTTTATGTAC 


CAGAATTGCT 


TTCAGCAATC 


GAACGTGCCT . 


CTTTACTTTC 


1440 


TCATGAAGGA 


CGTAACAATA 


TTGTTCGCCT 


TTCAATTTCA 


CCAGATTCTG 


TTGTTTTATA 


1500 


TGGAAATTCA 


CCTGAAATTG 


GAAAAGTCGA 


AGAAGCTTTA 


AACTATGAAA 


ATGTTTCTGG 


1560 


TGAAGCGTTG 


GATATTTCTT 


TCAACCCAGA 


TTATATGAAA 


GATGCGTTGC 


GAGCGTTTGG 


1620 


CGATATGAAT 


ATTACCGTGA 


AATTCCTTTC 


TCCAATTCGT 


CCGTTTACAT 


TGGAGCCAAC 


1680 


CGAAACAGAA 


CTAGATTTCA 


TTCAACTAAT 


TACACCGGTT 


CGTACAAATT 


AATCATGTTT 


1740 


AAAAAAGACC 


TTCGGAAAAT 


CAATATGATC 


GCCGAAGTTC 


TTTTTTGTCG 


TCTGTTTAAA 


1800 


GCCGATAAAC 


GCTGCTAGAT 


AAGCTTTTTA 


TAACTAGAAT 


AGCTTTTACG 


CTAATATGAT 


1860 


TGTGAATGTT 


TGAACTATCG 


TTTGACGAAT 


AAATTCTGCC 


TTTTTTTCGT 


GGATAAAGAA 


1920 


AAAAGCCCTT 


TAATTTTTCG 


TTTAAGGGCT 


GTTCATTTTT 


TTATGAATAT 


TTCCTAGTGA 


1980 


AAGAAGTTTA 


CGTTGCTTAA 


AACGACGCTC 


AGACCTCTAC 


AAGCTACTTT 


TCTTCATTGC 


2040 


GCGCCCTCTT 


CGTTTGTCTT 


TGAGTTACCC 


CTAATTGTTT 


TTTAGTTAAA 


CGCTACTCGT 


2100 


CATTTCTGCC 


AAATCAGGCC 


CCTTTTTTCT 


CAAGAAAATA 


TCAATTTCCG 


TTTTAAAATG 


2160 


CTTTTTAAGA 


GTTTTTTTAT 


CCTAATAATA 


TTTTCGGTCA 


TTTAGAATTA 


AAATTGCTTA 


' 2220 


GAATGAAAAT 


TAGACAGATA 


AATTTGTTTT 


CTCTTATGAA 


AAAGGGTATA 


ATATATATGA 


2280 


GAACGTATTA 


GAAAAAAGAG 


GGATGCACAT 


TGAAAAAAAC 


GTTTGTTTTA 


GCAACTGAGT 


2340 


ATATAACGCT 


TGGTCAGTTT 


CTTAAAGAAA 


TCAACGTCAT 


TGGTAGTGGC 


GGCCAAGCCA 


2400 


AATGGTATTT 


GGCAGATAAT 


AGCGTGTTTG 


TTGATGGCGA 


GTTGGAAAAT 


CGTCGCGGAC 


2460 


GGAAACTTTA 


TGCGGGCATG 


AT GAT T GAGA 


TACCTGAAGA 


AGGTACTTTT 


TTTATGGTGA 


2520 


AAAACGGAAA 


CGAGTCCGAC 


GATGCGGCTG 


AATGAGCTGA 


CACTTCAGCA 


TTATCGTAAT 


2580 


TATGAAACAG 


TCAGCTTAGA 


TTTTCCAAAA 


ACGTTAAATC 


TTTTTTTAGG 


CGAAAACGCC 


2640 


CAAGGAAAAA 


CCAACTTGTT 


AGAAAGCATT 


TATGTTCTAG 


CAATGACACG 


AAGTCATCGA 


2700 


ACCAGCAATG 


' AAAAAGAATT 


AATTGGCTGG 


GAACAAGCAG 


CAGCTAAAAT 


TAGCGGTGTG 


2760 


GTTGAAAAAA 


AGACAGGCAC 


CGTGCCGTTG 


GAAATTTTAA 


TTTCCAACAA 


AGGACGTAAA 


2820 


ACCAAGGTTA 


ATCACATTGA 


GCAAAAAAGG 


CTCAGTGCGT 


ACATTGGGCA 


ACTGAATGTT 


2880 
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ATTTTATTTG 


CCCCAGAAGA 


TTTATCTTTG 


GTGAAAGGCT 


CACCTCAAGT 


CCGCCGGAAA 


294 0 


TTTATTGACA 


TGGAACTCGG 


ACAAGTGAGT 


CCTATTTATC 


TTTATGATCT 


TGTCCAATAT 


3000 


CAGTCTGTTT 


TAAAGCAACG 


GAACCAATAC 


CTAAAACAAC 


TAGCTGAAAA 


GAAACAAACA 


3060 


GATACTGTTT 


ATCTGGATAT 


TTTGACAGAG 


CAGCTAGCTG 


AATTTGGCGG 


GAAAGTTCTA 


3120. 


TACGCTCGGT 


TAGGTTTTCT 


TAAAAAATTA 


GAACACTGGG 


CAAACTTAGT 


CCATCAAAAA 


3180 


ATTAGTCATG 


GACGCGAAAC 


GCTCACCATT 


GACTACGCAT 


CAAGCATCCC 


AATTGACAAC 


3240 


ACGGATCTTT 


CCTTAGAAGC 


ACTTCAAAAT 


CAATTGCTCC 


AACAATTAAT 


GAACAATCGT 


3300 


AAGCGTGAAT 


TGTTTAAAGC 


GAATACTTTT 


TTAGGCCCTC 


ATCGTGACGA 


TTTGCTGTTT 


3360 


ATTGTCAATG 


GTCAAAATGT 


TCAAACGTAT 


GGCTCACAAG 


GCCAACAGCG 


AACGACAGCT 


3420 


TTAAGTATTA 


AATTGGCAGA 


GATTGATTTG 


ATGCATTCAG 


AAACAGGAGA 


ATATCCTGTG 


3480 


TTATTACTAG 


ATGATGTCAT 


GAGTGAATTA 


GATAATGAAC 


GTCAAATTCA 


CTTGTTAGAA 


3540 


ACGATTGAAG 


GCAAAGTTCA 


AACGTTCTTA 


ACGACAACTA 


GTTTAGATCA 


CATAAAAGAC 


3600 


AAGCTAACAG 


TTGAACCAGA 


TATTTTCTAT 


GTTCAACAAG 


GCAAGATAGA 


AAGGAATTCA 


3660 


GCGACATGAC 


AGAAGAAGAA 


AAAAACATGA 


GAG/VAC GTGC 


GCAAGAATAT 


GACGCTAGTC 


3720 


AGATTCAAGT 


TTTGGAGGGC 


TTAGAAGCTG 


TCCGGAAACG 


TCCTGGTATG 


TACATAGGTT 


3780 


CTACTAGTGG 


CGAAGGGTTA 


CACCACTTAG 


TATGGGAAAT 


CGTTGACAAC 


TCTATCGACG 


3840 


AAGCCTTAGC 


AGGATTTGCT 


AAATCAATTC 


AAGTAATTAT 


CGAACCAGAC 


GATAGTATTA 


3900 


CTGTTATCGA 


TGATGGTCGT 


GGAATTCCAG 


TCGGGATTCA 


AGCCAAAACA 


GGTCGCCCAG 


3960 


CTGTGGAAAC 


AGTGTTTACA 


GTGCTACATG 


CCGGCGGGAA 


ATTTGGCGGT 


GGCGGCTATA 


4020 


AAGTATCAGG 


TGGCTTGCAC 


GGTGTTGGTT 


CCTCTGTTGT 


TAATGCTTTA 


TCGACCAGTT 


4080 


TAGATGTTCG 


TGTTTATAAA 


GATGGCAAAG 


TGTATTACCA 


AGAATATCGT 


CGTGGTGCGG 


4140 


TTGTTGATGA 


CCTAAAAGTT 


ATCGAAGAAA 


CGGATCGTCA 


CGGAACCACT 


GTTCACTTTA 


4200 


TTCCAGATCC 


TGAAATTTTT 


ACAGAAACAA 


CCGTTTATGA 


TTTTGATAAA 


TTAGCAACAC 


4260 


GGGTTCGTGA 


ATTAGCCTTT 


TTAAATCGAG 


GCTTACACAT 


TTCAATTGAA 


GACCGTCGTG 


4320 


AAGGACAAGA 


AGATAAAAAA 


GAGTATCACT 


ATGAAGGCGG 


GATTAAGAGT 


TATGTTGAGC 


4380 


ATTTAAATGC 


CAATAAAGAC 


GTTATTTTCC 


CAGAGCCAAT 


CTTCATTGAA 


GGAGAGCAAC 


4440 


AAGATATTAC 


AGTGGAAGTG 


TCAATGCAGT 


ATACAGATGG 


CTACCACTCA 


AATATTTTAA 


4500 


GTTTTGCCAA 


CAATATTCAT 


ACTTATGAAG 


GCGGAACgCA 


TGaATCTGGT 


TTTAAAACTT 


4560 


CTTTAACACG 


TGTGATTAAT 


GACTATGCAC 


GTAAACAAAA 


ATTGATGAAA 


GAAAACGATG 


4620 


AAAAATTAAC 


TGGGGAAGAT 


GTTCGTGAAG 


GCTTAACCGC 


AGTGGTTTCC 


ATCAAACATC 


4680 


CCGATCCTCA ATTTGAAGGA 


CAAACGAAAA 


CGAAATTAGG 


GAACTCAGAA 


GTACGGACCG 


4740 


TTACGGATCG 


TCTATTCTCA 


GAATACTTTA 


CAAAATTTTT 


AATGGAAAAC 


CCAACAGTCG 


4800 


GCAAACAAAT 


TGTCGAAAAA 


GGCATGTTGG 


CTTCAAAAGC 


TCGTTTAGCC 


GCTAAAAGAG 


4860 
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CCCGTGAAGT CACCCGTCGT 
CTGATTGCTC AAGTAAAGAT 
CCGGCGGTTC AGCAAAACAA 
GGAAAATCTT GAATGTTGAG 
GATCATTATT TACAGCTATG 
GCTATCACAA ACTAGTAATT 
TGTTATTAAC CTTGTTCTAT 
TTGCCCAACC AC C AC TAT AT 
GCAAACATGC TGAAGAAGAA 
CAAGTGTTCA ACGTTACAAA 
CAATGGACCC AGAAAAACGA 
ACCAAATTTT TGAAATGCTA 
AAAATGCCCA TTACGTGAAA 
TAGAACATAT ATTTAGAGGG 
GACGTCAATC TGACCAGCGA 
GTAGCCCGCG CGTTACCTGA 
TATGGAATGA ACGAATTAGG 
GTTGGGGATG TTATGGGTAA 
GTGCGGATGG CACAACCTTT 
GGTTCCGTCG ATGGTGACGG 
ATTGCTTTAG AAATGCTACG 
GATGATTCAG AACAAGAACC 
GGAACAACGG GGATTGCGGT 
GTGATTGATG CAACAAGTTT 
GAAGTGTTAC CTGGACCAGA 
CGCCGAGCAT ATGAAACAGG 
GAAATGCCGA ATGGAAAAGA 
GCCAAATTAA TCGAACGAAT 
GATTTGCGGG ATGAATCTTC 
GTGAGTGCCT CTGTTGTGTT 
GGTTTTAACA' TGTTAGCCAT 
TTAGAAAACT ACGTCGAGCA 
AACAAAGCAG AAGCACGGGC 
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AAAGGCGCTT TAGAAATTAG CAACTTGCCA GGGAAGCTGG 4 920 

CCGGAAAAAT GCGAATTATT TATCGTCGAA GGAGACTCGG 4 980 

GGACGTAGCC GTGAATTCCA AGCCATTTTG CCAATTCGTG 504 0 

AAAGCCAGCA TGGATAAAAT TTTAGCCAAT GAAGAAATTC 5100 

GGAACAGGCT TCGGCGAAGA TTTCGATGTT TCTAAAGCAC 5160 

ATGACCGATG CCGATGTCGA TGGTGCGCAT ATTCGAACGT 5220 

CGCTTCATGC GGCCAATTGT TGAAGCTGGT TATGTGTATA 5280 

GGTGTGAAAC AAGGGAAAAA CATCACCTAC GTTCAACCGG 534 0 

CTAGCAAAAG TGTTAGAAGA ATTACCTGCT TCACCAAAAC 54 00 

GGGTTGGGTG AAATGGATGA TCACCAACTT TGGGAAACAA 54 60 

TTAATGGCGC GAGTGAGTGT CGATGATGCC ATTGAAGCTG 5520 

ATGGGCGATC GTGTGGAACC ACGTCGTGCG TTTATCGAAG 5580 

AACTTGGATA TTTAATGTGT CGCAACGTTG TAAGAGACGC 5640 

GAAAAAATTC ATGAGTGAAG AAATTAAAGA AAACATTCAA 5700 

AATGAAAGAA TCTTTCATTG ACTACGCAAT GAGTGTTATC 5760 

CGTTCGTGAC GGTTTAAAAC CTGTTCATCG CCGAATCTTA 5820 

GGTAACCCCT GATAAACCAC ACAAAAAATC AGCCCGGATT 5880 

ATATCACCCC CATGGGGACA GTGCGATTTA CGAATCAATG 5 940 

TAGTTATCGG GCTATGTTAG TTGACGGCCA CGGAAACTTC 6000 

CGCTGCCGCT ATGCGTTATA CCGAAGCACG TATGAGTAAA 6060 

AGATATTAAC AAAAATACAG TCGATTTCCA AGGAAACTAT 6120 

AGTAGTTTTA CCTGCTCGTT TTCCAAACTT ACTAGTTAAC 6180 

GGGGATGGCA ACAAATATTC CACCACACAA TTTAAGTGAA 624 0 

ATTGATGGAC AATCCTGATG TAACGACGAA TGAATTGATG 6300 

TTTTCCAACA GGCGGTTTAG TGATGGGGAA ATCAGGGATT 6360 

GAAAGGTTCG ATTACTGTTC GTGCAAAAGT TGAATTGACT 64 20 

ACGTATTTTA GTAACTGAAT TGCCTTATAT GGTGAATAAA 64 80 

TTCTGAATTA CACAGAGATA AACGAATTGA AGGAATTACT 654 0 

TCGTGAAGGC ATGCGGATTG TCATCGATGT TCGTCGAGAT 6600 

AAACAACTTG TACAAAATGA CTGCCTTACA AACATCTTTT 6660 

CGAAAAAGGC GTACCGAAAA TTTTAAGCTT GAAACGTATT 6720 

CCAAAAAGAA GTCATTACTC GCCGGACGAT TTTTGACAAA 6780 

GCATATTTTA GAAGGTCTAC GAATTGCCTT AGATCATATC 684 0 
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GATGAAATCA 


TTGCCATTAT 


CCGTGGGTCA 


CAATCAGATG 


ACGAAGCAAA 


AGCAACGTTA 


6900 


ATTGAACGCT . 


.TTGAATITTC 


AGATCGTCAA 


GCGCAAGCGA 


TTTTAGATAT 


GCGTTTACGC 


6960 


CGTTTAACAG 


GCTTAGAACG 


CGACAAAATT 


GAAAATGAAT 


ATCAAGAGCT 


ATTGAAATTC 


7020 


ATTGCGGACT 


TAGAGGACAT 


TTTAGCCCGC 


CCAGAACGCG 


TCATTGAAAT 


CATTAAAACA 


7080 


GAGTTAAATG 


ACGTTCGCAC 


AAAATTTGGT 


GATGCACGAC 


GCACAGAATT 


ATTAGTAGGT 


7140 


GAAGTCCTAA 


GTCTTGAAGA 


TGAAGATTTA 


ATCGAAGAAG 


AAGAAGTCGT 


GATTACATTA 


7200 


ACCAATAACG 


GCTACATTAA 


GCGGATGGCA 


AACTCCGAAT 


TCCGGGCGCA 


ACGCCGCGGC 


7260 


GGACGTGGTG 


TCCAAGGCAT 


GGGCGTTCAT 


GATGATGATT 


TCGTGAAAAA 


CCTAGTTTCT 


7320 


TGTTCAACAC 


ATGACACGTT 


ATTATTCTTT 


ACGAACACTG 


GGAAAGTTTA 


CCGAGCAAAA 


7380 


GGTTACGAAA 


TCCCTGAATA 


CGGTAGAACA 


GCAAAAGGAA 


TTCCAGTGAT 


TAACTTACTG 


7440 


GGAATTGATT 


CTGCAGAAAA 


AATTCAAGCG 


ATTATTTCTG 


TTGAGGGCAA 


AGCGGAAGCA 


7500 


GGTAAATACT 


TGTTCTTCAC 


AACCTTAAAA 


GGAACCGTCA 


AACGGACAGC 


CGTAACAGCC 


7560 


TTTTCTAATA 


TCCGTAGTAA 


TGGATTAATC 


GCCATTAGCT 


TAAAAGAAGA 


TGATGAGTTA 


7620 


GTTAACGTAG 


TAACGACTAA 


TGGCAATCAG 


AAGATGATTA 


TCGGAACACA 


TGCAGGATAC 


7680 


TCTGTCACAT 


TTGATGAAAA 


TACTGTACGT 


GATATGGGCC 


GGACAGCATC 


AGGTGTTCGT 


7740 


GGAATCCGTC 


TCCGCGAAAA 


TGATTATGTG 


GTCGGCGCAG 


CGATTCTGGA 


TGAAAATAAA 


7800 


GAAGTCCTAG 


TCATTACTGA 


AAATGGTTAT 


GGTAAGCGTA 


CAAAAGCCTC 


TGAATATCCA 


7860 


GTTAAAGGAC 


GTGGCGGTAA 


AGGGATTAAG 


ACAGCAAATA 


TCACTGAGAA 


AAATGGTCCA 


7920 


TTAGCTGGTT 


TAACCACGGT 


CAATGGTGAT 


GAAGATATCT 


TATTGATTAC 


GAACAAAGGC 


7980 


GTCATTATCC 


GCTTTAACGT 


T 








8001 



(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

TTTATGATTT AACGATTATC GGCGGCGGTC CTGTTGGATT ATTTGCCGCT TTTTATGCGG 60 

GAATTCGTAA AGCGAAAACG AAAATCATTG ATAGCCTGCC GCAATTAGGT GGTCAGCTAA 120 

CAATGCTGTA TCCCGAAAAA T AT ATT TAT G ATATTCCTGG ATTCCCTGCT ATTAAAGCCG 180 

GTGAGTTAAT TGCCAATTTG GAAAAACAGA TGCAGCCATT TCAGCACGAT GTTTGCTTGG 24 0 

AAGAAGAGGT ■ CACACACCTT GCACAGGAAG CAGATGGACT TTTGCGCTTA GACACAACTA 300 

AAGGCACACA TTATTCTAAA ACGGTTATCT TTGCGATTGG TAACGGCGCT TTCCAACCTC 360 

GACGTTTAGC TATCGAAAtG TtGaAGCCTT TGaAGGTGAG TCcAATCCAT TATTATGTGA 4 20 
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CTGACATGAA AAAATTCGCT GGCAAAAAAG TTGCGATTGC TGGTGGCGGC GATTCCGCAA 4 80 

TTGACTGGGC TTTAATGCTA GrAAACGTCG CAGAAGAAGT. AT.CXATTATC CATCGTCGCC 54 0 

CACAATTTCG CGGCCACGAA CATAGCGTGG AACAACTGGA AAAATCTAGT GTTTCTATCA 600 

GAACTCCCTA TATCATtAGT GATATTTtAA AGGAAAACGA AACGTTCACA GGCATCCAAT 660 

TAACGGAGAC AAAAGGTGAC CAAACATTGG ATCTCCCTCT GGATGATTTA ATTATCAATT 720 

ATGGCTTCAC TTCTTCTTTA ACACACCTCA AAGAATGGGG ATTAGACGTT TCTAGAAATG 780 

CTATTAATGT TCATtCCGAT ATGTCAACTA ACATTCCTGG TGTATACGCT GTAGGGGATA 840 

TCTGTTCCTA TGAGGGAAAA GTGAAATTAA TCGCTACAGG CTTTGGTGAG GCACCTACTG 900 

CCGTAAATAA TGCTTTACAT TACTTACGGC CCGATGCTCG ACGTCAACCA GTTCATAGTA 960 

CAAGTTTATT TGAAAACGGC GTACCTAAAT AAGGTTAGCA CTGGACTTTA GCAGTTTTTA 1020 

AGCCAAGGAC AAAAATCTAA ATCGGATTTT TGTCCTTGGc TTTCTGCTTT ACTGAAAAGC 1080 

CTTTCTTTTT CTCTTTTAAA TTTATTTTtG ATACAATAAA AAGTGACTnG GAGGAAGAAA 114 0 

GATGAACCAA CCAGTTATTT ATTTAGGTGA AGTATTCAAC GAAGAACAAC TGGAACAAGT 1200 

GAAAGCTGTG GCACCCAATT ATCTAGTTAA AACATCTACC GATsATTTAT CATCTGCTGA 1260 

AGAAGAGGCC ATCGAAATTA TGTTAGGCTG GCATAAAGAA ATTgGTCCGC GTTTaTTAGC 1320 

ATCTGATACT AGTCACCTAA AAtGGATTCm GCTCATTTCT GCTGGTGCTG aTTATATGGA 1380 

TTTTGATmAA TTAAGaGAAA AAGGCATTCT GTTGTCTAAT GGTAGCGGAA TCCATAGTGT 14 40 

TTCTATCTCA GAACATGTTT TAGGCGTTTT ATTAGCACAT ACCCGCGGAC TGCAAGAAAG 1500 

TATCCAACAA CAAATGCaAC ACACATGGAA TCAAACAGCC CCTTCCTATC AACAACTTTC 1560 

TGGGCAAAAA ATGTTAATCG TAGGCACCGG GCAAATCGGA CAACAATTAG CTAAATTTGC 1620 

TAAAGGTTTA AATCTTCAAG TTTATGGCGT GAATACCTCT GGACATGTGA CGGAAGGCTT 1680 

CATTGAATGC TATTCACAGA AGAATATGAG TAAAATTATC CATGAAATGT CCATTGTTGT 1740 

GAACATTCTT CCTTTGACTG AAACAACAAA ACATTTATAC AATCAGGCAC TTTTTGAAAA 1800 

AATGGCTCCT GAAACAATTT TCGTAAATGT TGGTCGAGGC GCCTCAGTAG CAACGAACGA 18 60 

TTTGATAACT GCCTTAAACA ATAAAACCAT TGCTTTTGCT GCCTTAGATG TCTTTGAAGA 1920 

AGAACCTCTT CCCGAAGATA GCCCTTTATG GGAGATGCAG AATGTTTTAT TAACACCTCA 1980 

TATTTCTGGC ATGACACCGA AGTTCAAATC AAAATTACTG GCGATTTTTA TTCCAAACTT 204 0 

AAATcAATTT GTGACGGATC AGACCCTCGT CAAAAACCAA GTTTCCTTAA AAAAAGGCTA 2100 

TTAACTCAAA AAAGGCCTTC AACCATCTTT CAAGTAATTG TATGGTTGAA GGCTTTTTTT 2160 

GCTATTTTTT ACCAGATAAC CTTGAAGTTA TCATTTGTCG ACGCTTGAAT GATTGGATGT 2220 

TTTTTTAAAT ATTCTGCAAT CAGTTCCGTC ATATCAATTT GAATTTCTCG AATAATTTTT 2280 

TCAGGTTGGA ACATTTGATA GTTTCCGCCA CCGACTGCTC GGTATTGATT TGTCACGATT 2340 

TCCAAAACAT CTTCTGCTTG CACTGCTTTG CCATGATAAT TAAGTTTCGT CACTCTTTCA 24 00 
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CCAAACGGTT GtCTTAAATC AAGCGTATAC TCAATGCCTT CcAwACcwwA yCATAAtnAT 24 60 

AGTAwTGTGG TTTTGGATCG ACATATTTCG GGTTGAAAAT TGGTTGGC .2508 
(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 



TTTAGGTTGT 


CI 1 1 ILAlyL 




TATAGAATAC 


TTAGTAGAGG 


AGGAAGTTGA 


60 


TTAAATTATC 


AGAAAAAAAG 


TGTTTGGAAA 


AGTCGGATTG 


GTTGCTTAAA 


AATAGAAAGA 


120 


- TATGGGGAAA 


AATATGATTA 


AAAAAATAAT 


AGTCGTTGTT 


GCTTTCATGC 


TTACAGGCTT 


180 


CTCGCTAACT 


GCGATGAGTG 


CATCTGCAGA 


AGAAATAACT 


GATTTATTTT 


TACAAAAAGA 


240 


AGTGACATAT 


TCTGGTGTAG 


AAGGAGGAAA 


AATTGGAGAG 


AATTGGAAAT 


ACCCTCAATT 


300 


TGTTGGCGAA 


AAAGCTGTCG 


ATGGCGATGA 


AACAACACGC 


TGGTCCGCTG ATAAgcAAGA 


360 


TGAACAATGG 


TTAATTGTTG 


ATTTGGGTGA 


AGTAAAAAAC 


ATTGGCGAAC 


TCGTTTTACA 


420 


GTTGCATGCA 


GAAAGTCCTG 


TGTATGAAAT 


CTTAGTTTCA 


ACAGATGGTG 


AAAGCTATCA 


480 


ATCTATCTTT 


AAAGAAGAGA 


ATGGGAAaGG 


TGGCCAACCT 


ACTAAAAAAT 


ATATTGATGG 


■ 540 


CAaTAATGTm 


CAaGCACGTT 


TTGTAAAATA 


TCAGCAGATG AAAaTGTGsg 


CAACACACCA 


600 


AATAAGCAAT 


TTTACAGTTC 


AAGTATTATT 


TCGTTTGarG 


CATATGAnAA 


AAAACGACTG 


660 


GCCAGAAGCG 


ATTAAACTTT 


TTAACCGAGA 


ACCTGGACTA 


TTAGTGGAAA 


AAAGGAAAGC 


720 


CACCGCTAGC 


CTTTGGAAGT 


GTCCnCCAGC 


CGGGAGTAGG 


TTA 




763 



12) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 



AACCCACTCT GTGGTAATGC CAAAAATCCC AGAAATCGTT TCATTCTGGC CTGCGATGGA 60 

TGCAGTGATC AATGATACGT ACAAAGGCAA TATCAAACCT GCGGATTATC AAGCAAAATT 120 

AGATAAATTA GTCCAAGACA CATCAAAAGA AGCAAAAGAA TAAAAAGAGT AAAGAAATGG 180 

GaCGAGCTTC AATCTGCTGT CCCATTTCCT CTATCTTTTT TATTTAAGTA GAAAGTGAGT 24 0 

GAACGAAGAT- GTTCAAGAAA AAGAAAGCCC AGACGTCATT TCGAGAGGTC TTTAAAAAAG 300 

GCGACTTCGC TACAAAACTA TCTTTTTTAG TTATGGGTGC TGCAAACTTC GCCAACAAGC 360 

AGTGGTTAaA GGGaATCmTC tTTTTAACaG cTGrAATTGG CtTATCyaTT GGGGTAATCC 4 20 
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GTAATnGGTT CCnCGCCATT AAAnGA 44 6 
(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

AATTTTGGTT TGTTGGTGTT GAAACCAAGC ACTTTGTTCA AAAGGTGCTG ACAATGTTTG 60 

TAAGTGATGC ACTTCGATCT GATTGGTTGC CTGcGGTTCT GTTGTGAAAT CATGGGCCCG 120 

AATACCTATA GAATGACAAG AAATAGGAAC TTCTTGTGTA AC CAT TAG TG TTTGTTGCCA 180 

GCCAATCACT TGGACTCTGT GCGCATCTAT GCGTTTGACA GGCCAAATAT TTTTACAACC 240 

AGTGAGTTTA GCTGCTTCAA TCGTTTGAGG TTGGTTAAAT AATTGAnTTG TTGCACCAAA 300 

AAGTGTTTGG TTTTTTGTGA TTATCACTAA ACTCTGACAT AACTTATACA GTTCATCTaA 360 

ACTATGACTC ACAATTAAGG CATGTTGATT TAAATTACTT AATCGCTGtT GAAGkTCaAT 4 20 

TtGCaGCTCt TCCTTTAAGG GTGCATCTAA TGCGGAAAAT GGCTCATCTA ATAACAGGTA 4 80 

AGCCGGCTGA GCCGCTAGCA TTCGCGCAAA AGCCACTCGC TGTCTTTGTC CTCCAGAAAG 540 
TTGACTAGGA TATTGTTGTT GGACTTTAGT TAAGTGAAAA GATGCAAGCA ATTGAGTGAC ' 600 

CAATTGTAAA TCTTTTGTCA CGCACGTCAG ATTTTCTACT ACCGACAAAT GAGGAAACAA 660 

CGCATACTGT TGAAACAAAA GACCAATTTT TCGTTGTTGC GGCGACAAAT TGATCTTTTT 720 

TTCAGTATCG AATAAGACTT GATCATTTAA AACAATCTGA CCTTTATCCG CAGTCTCAAC 780 

ACCCGCAATA CATTTTAACA ACATACTCTT GCCACAACCA GAAGCACCTA mAATTCCTGT 840 

TGTTTGAGkC TTTGTTTCAA ACTGAATTTG CAAAGTATGA CTTTTCAATT CTTTTTGAAT 900 

ATCAACAATT AATCTCATTC CGATCTTCTC CAAGCGCGCT GaCTAAAAAA TTCCGTAAAA 960 

AAtAAAATAA TCAGGCmAAG TAATAACATr ATTACAACAT ATTGATTGGC TAAACGCCAA 1020 

TCCCCACTTG CTACAGCAGA GTAGATAGCT AAAGGAAGCG TTCTTGTTTT ACCAGCAATA 1080 

TTCCCTGCTA ACATTGTCGT CGCTCCAAAT TCGCCTAACC CACGGGCAAA GGCTAACACG 1140 

CCACCGGsCA AGAGACCGTT CATCGACAAA GGTAACGCAA TTTTTCTAAA ATATCGACTT 1200 

TCCGAAAATC CGAGTGTCTG GGCCACAGCC AACAATTCAC TATCCATTTG TTCAAACGCC 1260 

GTTAAGGCTG AACGtACtTA AAGGAAATGA AACAGCAACG GCAGCAATAA CCGTAGCTGG 1320 

CCACGAAAAA ACmACTTGAa TGGCmAAAAA ATCmAGTwAA AACTGGCCCA CTGGTTGCTG 1380 

AACACCmAAA aTAGTTAAGr GaAAAAAACC AAAAACAGTC GGCGGTAAAA CTAATGGCAA 14 4 0 

CGTAAATAAA CTATTCAAAA GAATTTTTAA TTTACGAT 14 78 
(2) INFORMATION FOR SEQ ID NO: 627: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 



CCTCTTTCGT 


AGAACGTACA 


GGAATTTCTT 


TAAATATTAC 


CGATGTTGGT 


TGGGCACCTT 


60 


TGGCCACCAT 


CACTTGGGGA 


TCTCCTTACA 


CCCTTTACTT 


TCTTTTAATT 


ATGGTGATTG 


120 


TCAATGTCAT 


TATGTTAATT 


CTAAATAAAA 


CAAACACATT 


AGATGTGGAT 


ATTTTCGACA 


180 


TTTGGCATCT 


ATCGATTGTT 


GGTTTATTTG 


CTATTTTTTG 


TGGCGCAAAC 


TTAATAATTG 


240 


CAACAATCTT 


AGTTATTTTC 


ATAGGTGTTT 


TAAAAATTAT 


CAATTCTGAC 


TTAATGAAAC 


300 


CAACGTTTAA 


CGaTTTACTT 


AACGCCCCAG 


ATACCAATCC 


TATGACTACC 


ACACATATGA 


360 


ACTATATGAT 


GAACCCAATC 


ATTATGGTTT 


TTGATAAAAT 


TTTTGATAAG 


CTTTTTCCTT . 


420 


GGTTAGACAA 


ATACGATTTT 


GACGCTGCCA 


AATTGAATAG 


TAAAATTGGC 


TTTTGGGGAT 


480 


CAAAATTTGC 


CATTGGGATT 


TACTTAGGGA 


TTTTTGTGGG 


TCTTTTAGCT 


GGCCAAACAC 


540 


CCACGCAAAT 


TTTTTCTCTA 


TCCTTTACGG 


CTGCAGTCTG 


TTTGGAACTA 


TTCTCTTTAA 


600 


TTGGAACTTG 


GTTTATCGCT 


GCAGTTGAGC 


CTTTATCCCA 


AGGGATCACT 


GATTTTGCTT 


660 


CGAATAAATT 


AAAAGGACGT 


ACACTTAATA 


TTGGTTTAGA 


TTGGCCTTTC 


TTAGCTGGTC 


720 


GTGCTGAAAT 


CtGGGCAGCA 


GCTAACGTAT 


TGGCTCCAAT 


TATGCTATTA 


GAAGCCATTG 


780 


TCCTACCTGG 


CAATAAACTT 


TTACCACTTG 


GCGGAATTAT 


TGCAATGGGT 


GTAACCCCTG 


840 


CTCTACTAGT 


AGTAACAAGA 


GGTAAACTCA 


TCCGAATGAT 


TATCATTGGA 


GCAATCGAAT 


900 


TACCTnTGkT 


CtTATGGTCT 


GGTACCCTAA 


TTGCACCGTT 


CGTTACCGAA 


ACCGCAAAAA 


960 


AAGTTGGCGC 


TTTCCCAGCG 


GGTCTAAATA 


ATAATACTTT 


AATTTCACAT 


ACTACCATGG 


1020 


AAGGGCCAAT 


GGAAAAATTC 


TTAGGATACT 


TAGTCGGCAA 


TGCATCACAA 


GGTCAACTCG 


1080 


AATTTATTTT 


ATACGCATGT 


CTGGCGCTAG 


TCGCCTATCT 


ACTGATTTTC 


ATTTGGTATG 


1140 


CAAAACAAAT 


GAAGAAGCGA 


AATATTATTT 


ATGCTCAAAA 


AGCTCAATAA 


GGGGArcTAA 


1200 


AAAGAAGAGG 


TGCATCTGCA 


CCTCTTCTTT 


TTAGTAGATC 


ACCCTGACAA 


CTTTTTCCAA 


1260 


TACTTTAGAT 


GTGTCAAATT 


TATCTTTATT 


TGTAACTACC 


GCTGTCACTT 


CTTCTAATTG 


1320 


ATAAAAAGAA 


TAAAAATCCT 


GTTTCTCTAC 


TTTAGAAGTA 


TCCATTAACA 


AATATTTTTC 


1380 


TATTGAGTTA 


TCTAATGCAA 


TTGCTTGTGT 


TTGTCCTTCT 


TCAATAGTGG 


CTGTCATAAT 


1440 


TTTATTATCT 


TTTAACGCAT 


TGCAACTAAA 


AAATGCTTTA 


TGGAAATGCA 


TTTCTTCTAA 


1500 


TGCTTTATTA 


GCGATTTCTC 


CAAAAAAAGA 


TTTAGTTAAA 


AGTCGCATTT 


CTCCACCAAC 


1560 


CAAATAAATG 


• GTCAACGCAT 


TCTTTTTTTT 


ACTTAACGTT 


TCAAACACAG 


GCAAACAGTT 


1620 


TGTTACTACT 


CTTAAATGTT 


TAAATTCCaT 


TGcCTCGGCC 


A 




1661 



(2) INFORMATION FOR SEQ ID NO: 628: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2359 base pairs 
- -(B)- TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 



tpagttppag 


TTGGGTTTCG 


TTATGGCATC 


TTGAACAAAA 


GTAACACACC 


ACAAACTATC 


60 


w/\ 111 r\r\r\ X w 


PGAGAAATTA 


TCAAACTCAC 


ACGAACGTTG 


TCACGGATGG 


TTTCTATACA 


120 


CC T T TH AfZG G 
1 1 lunouo 


ATGATCGPAC 


AAAaGATAAC 


GGTTGGAAAC 


TAACTGCGCA 


GTTATCTGAT 


180 


TTTAGPGATG 
1 X IrtwOwrtiw 


AGACAAATTC 


AACGGTGTTA 


ACGAATTCTG 


GGATAGCTTT 


AAAAATGGAA 


240 


a atatgaaaa 


TTGAATCTAT 


AAAAAATCGA 


GATACTCCAC 


AAGAGAGTAT 


TGATCAAAAT 


300 




PAPPTTCAAP 


TGTTaGCACs 


AATGAAACGC 


TGATATCAGG 


ACAAGCAGCr 


360 




T vAATGPAPA 


GG CAAATGAr 


GGyCACGGTA 


CGTGGCAACT 


AAGGATTCCT 


420 


TTTGATAAAG 
111 ort l /"vruiw 


TGTPCTTGAC 


TGTACCAGCG 


AATACAGGGG 


AAATAAATAA 


AAATTATACA 


480 


HPPAPTTTGA 


PATGGTPGTT 

w/Ai.wwl^wl 1 


AGATGATACA 


CCATAGGAGG 


AGAGTTCTAT 


TGATACAAGT 


540 


ATnfSATTGGA 


TAPPTTTGTT 

1 r\\s O 1 1 1 w A 1 


TTCTTGCAGC 


AAGTTCTTTT 


ATAGGTTTTC 


TCTATTTCTT 


600 


TTTGAAAGAT 

111 w.ttrtrt w/T. i 


AGTCGATTGC 

rtw 1 ^»Uri J. J. VJV 


GTGTGGACAA 


CGGTAAAAAG 


TCAACGGAAT 


TACTAGTAGA 


660 


1 1 U 1 OU 1 1 1 V3 


ATGTTTATTT 

1 W 1 1 1 " AAA 


CTTTATTATC 


AATAGTTGCA 


TCTGTTCTAT 


TATATCTCAA 


! 720 


PTGGCAAGAA 


CAGTTACAGT 


ATTTTCTAAA 


CTGATAAAAA 


GAAGTAACTT 


CAATGACGAA 


780 


GGAAAGATGT 


GTCATTTGAA 


GATAGTCAAC 


TTACAATTAT 


CTAATGGCAA 


AAAAGCTATT 


840 


GCATCATGGT 

W W/Hh ± S^iX A WW A 


TTCAAATTAA 


TCAATTAAAA 


GCTCTTTAAG 


AATTATTGAT 


TTAAAGGAAT 


900 


ATACATCTTA 


AAGGGGCATA 


TCAAGTTGAT 


TATCGTATAG 


TGTTACTTCA 


TAAGAGAAAC 


960 


TAATATTGAT 


CTTAATACAG 


AATAGTCTTT 


GATATTCAAA 


TATTTAGTAA 


CTGTGTTGTA 


1020 


TTTATAGTAA 


ATACTCATTA 


GGAGCTGCAC 


CAGTTGCTTA 


ATTAGTTGGC 


AATGAGGAAC 


1080 


TATTAAGAAT 


TGTTTAACGA 


ATTTAACTTA 


GTTAAAAGGG 


GAAAATAGTT 


AATGAAAAAG 


1140 


CGCGTAGTTA 


TGACTATTGC 


GGGGATTATG 


GGAGGAACCT 


TTACCTCGAG 


TGCTGTTGCA 




TTAGCACAAA 


CAAAAACAAG 


TGTGGAAGAA 


AGGGGTCCAG 


TACAAGAGAG 


TACGATTTCA 


1260 


ATACCAGACA 


GTGAGATATT 


GGCGGAGGAA 


AAGGAGGTAG 


CAGCAACAAT 


AGAGTCTACT 


1320 


ACCACGACGT 


CCACCATAAA 


GACAACCAGC 


GATAGCCTCG 


AAAAAGCCCT 


TGAGGCAACT 


1380 


ACCTCAGAGA 


CTTCTTCGAC 


AGAATCGGGG 


AGCGCAAGTC 


AGTCAAGCGC 


TTCACAACAT 


1440 


AGGAAAGAAA 


TAGAAACAGA 


GGAAACTTCT 


CTAAAAGCAC 


AAACTACATC 


AACGGTTTTA 


1500 


ACAGGGATGA 


TAGGAACGAG 


TCAATGGAGT 


TTAGATCCTA 


CATCTGGAGA 


ATTAACTATA 


1560 


GGCCCAGGAG 


TTTTTAGTGA 


TGATGTATCA 


TCTGCGTTCG 


ACTCCATATC 


AAATGTTAAA 


1620 


AAGGTAACAT 


TTTCTGGACC 


GGTGAAAGCA 


GGAAAAAGTT 


TAAAAAGTTT 


ATTCAGTAGC 


1680 


TGGTTTGCGC 


TATTGTCCAT 


TGAACGTCTG 


GATTATCTGG 


ACACGAGTCA 


AGTGACAGAT 


1740 
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ATGAGTTATA 


TGTTTACTAA 


TTGTACTGGA 


TTACAAACGT 


TAGATGTATC 


AAATTTTGAT 


1800 


ACGAGTCAAG 


TGACTAATAT 


GAGTATGATG 


TTTTATAATT 


GTCGTGGATT 


ACAAACATTA 


1860 


GATGTATCAA 


ATTTTGAAAC 


GAGTCAAGTG 


AC AG AT AT G A 


GTTATATGTT 


TGATGCTTGT 


1920 


AGCGGATTAC 


AAACATTAGA 


TGTATCAAAT 


TTTGAAACGA 


GTCAAGTGAC 


AGATATGAGT 


1980 


AGTATGTTTG 


ATGCTTGTAG 


CGGATTACAA 


ACGTTAGATG 


TGTCGAATTT 


TGATACGAGT 


2040 


CAAGTGACTA 


ATATGAGGGG 


TZXTf^TTTTTT 
Inlul J. 1 1 1 1 


PTTTftTACTG 


GATTACAAAC 


GTTAGATGTA 


2100 


TCAAATTTTG 


ACACCAGTCA 


AG TG AC AG AT 


ATGAATAATA 


TGTTTGCTGG 


GTGTAAGGCT 


2160 


TTACAGACAC 


TAGATTTGTC 


TAATTTTAAT 


ACAAATCAAG 


TGACAGATAT 


GGCTGGGATG 


2220 


TTTTCGTCTT 


GTGATAAAAT 


CCAAACAGTA 


AATGTGTCAA 


ATTTTGACAC 


CAGTCAAGTG 


2280 


ACAAATATGg 


GGGTATGTtT 


TCTCATtGTt 


GGGAaTTACA 


AGCATTAGaT 


GTAcTAAGTT 


2340 


gtACATCCAA AGGAnGTTG 










2359 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1130 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 



CTCCACAAGA 


CTGACATTGG 


TACTGATGGA 


AAACTGCTTT 


TTTCCGTTCT 


ACTAAAGGCG 


60 


GTGCATAACG 


GGCACCGCCA 


GTCTTAGCCA 


ATAACTCTTT 


AAAGTCCTTA 


TCTTTATGCT 


120 


GATAGCCTTT 


GCCAGCTAAA 


TGTAAATGAT 


AGTGACATAA 


CTCATGCTTA 


ATGACTTTTT 


180 


CCAGTTCCTC 


TTGTCCATAC 


TTTAAAAAGA 


CAGTTGGATT 


AAAATCCAGG 


TGATGCGAAn 


240 


TAAATGATAG 


CGACCGCCCG 


TAGTTTTTAA 


CCGCCGATTA 


AATGTAGCTT 


GATGCAGAAA 


300 


AGGJcTTCTGA 


AAAAATACGA 


TTGAAATTTT 


TTCCACTAGC 


GTTTGAAGCG 


CTTGATCTGT 


360 


CATCAAGACT 


GGTTGAGACT 


CTTTCATTGT 


TACTCTTTTC 


TATCAGCCCG 


AGGCAACATG 


420 


GTTAAACTAA 


TACGCCCCTT 


TTTCAAGTCA 


ACATCTTCTA 


CCCAAACTGT 


GACTACATCC 


480 


CCCACCGCAA 


CAACATCCGT 


TGGGTGCTTA 


ACAAATTTAG 


TACTTAACTT 


AGAAATGTGG 


540 


ACTAAGCCAT 


CTTGCTTCAC 


ACCAATATCG 


ACAAACGCAC 


CAAAGTCAAT 


TACGTTTCGA 


600 


ACAGTCCCTT 


GCATTTCCAT 


TCCTGGTTTC 


AAATCTTCCA 


TCGTCAACAC 


GTCTTTACGT 


660 


AAAAGTGGTG 


CTGACATCTC 


ATCGCGCATA 


TCACGACCTG 


GTTGGATTAA 


ACTAGCAATA 


720 


ATATCCGTGA 


GTGTTTCTGA 


ACCTACTTCT 


AACTCAGTAG 


CTAATTGATT 


CACTGG kAAC 


780 


TGtTTCAATT' 


TTTCGATAGC 


TTCTGCTGTC 


CCTAATTCTT 


TAGAAGCAAT 


CTCTGCTTTT 


840 


TCTAAAATTG 


CTTGTGCTAC 


TGGATAACTT 


TCTGGGTGAA 


TGCCAGTGTT 


ATCTAACACA 


900 


TTTTTGCCAT 


TAGGAATTCG 


TAAAAAGCCA 


ATTGCTGTTC 


AAATGCTTTG 


GnCCTAAACG 


960 
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AGTACTTTTT GACTGATTCG TGCTGTAAGA CACCATTTCA CACGATAAGG ACAAGTTGCG 1020 
CGTGGTTTAT TAAGCAGAAT AGTGGTAAAG TGAGACCGCG GATCACATAC ACCACTGGGT 10a0 
ACGAGTTCAC ACAAATCCAT GCCGCAGCGT TGCGAACGCA GCGAATGACG 1130 
(2) INFORMATION FOR SEQ ID NO: 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 



CACTACAAAT 


GAAnAnGAAA 


GAAAAACAnG 


CCAATTAAAG 


AAAAACAAGA 


AGAATCTACA 


60 


AmAAATATCT 


ACGCAAAATC 


CAaGCGGATT 


ATCTACCAAG 


GAmACAAAAA 


TATGAAACCT 


120 


ATAACCGATT 


TTTTCAAGAG 


CGAAATAGTT 


TTACTAAATC 


AGATACAGAT 


GCCACATTTA 


180 


TGCGAATGAA 


AG AT GAT TAT 


ATGCGCAATA 


GTCAACTAAA 


GCCAGGCTAC 


AATTTACAAA 


240 


TCGCAACAGA 


AAATCAATAT 


GTCCTAGCGT 


ATGATCTGTT 


CCCAAATCCA 


ACAGATATAA 


300 


AAAACTTACA 


GACATTTTTA 


GACAGCTTTT 


TGGAACAACA 


CAACGAATTA 


CCCGAGTATA 


360 


TTGTAGCGGA 


TACCGGCTAT 


GGAAGTGAAG 


AAAATTATAC 


GTATATCAAC 


GATGTCTTAC 


420 


ATAAAACGCC 


ATTAATCACT 


TATGCTAGTT 


ATCGTAAAGA 


AAATAAAAAG 


ACCTATAGAA 


1 480 


ACAGTCCTTT 


TATGGTTGAT 


AACTGGCGCT 


ATTTGGAAGA 


AGATACATAT 


ATTTTTCCTA 


540 


CCCATCGGGC 


AGTTCCTTTC 


AAGAGATATA 


GTCGAGGAAA 


AGAGAAAGGC 


GGATTTGTAC 


600 


GTGACTTCAA 


AATTTATGAA 


TGTGaAAATT 


GTCGAGATTG 


TCCTGTTCGT 


AGTCAATGTA 


660 


CGAAaGCAAA 


AAGTGaGCAA 




iiii hkj i una 


cpi\f A/^TiTP BP 
1 1 0/iw 


VjL. InL 1 1 




GCAGAATGTA 


AAAAGAAGCT 


TTTAGAGGAA 


AAGACAGGTT 


CGATTTATCG 


GAAACGAAAA 


780 


ATCAATATCG 


AACCAGTGTT 


TGGCCATCTA 


AAGGCTCATT 


TGGTGTTCCA 


ACATTTCCAC 


840 


TTACGAGGGA 


AACAAGGAGC 


CGAGATTGAC 


ATTGGATTGG 


CACTCATGGA 


ATTAAATTTA 


900 


AGGAAATTAG 


GGAAATAATT 


TCTGAGTATT 


GTACCGGATT 


AGATGCCCAT 


CAAAATAATG 


960 


TTGTGTTTTG 


AAAGGTCCAT 


TTATTCAAAT 


AAATAGAATG 


TTGGAGTATA 


ACTATATTGC 


1020 


TTATGTAATA 


TAATTTATTT 


TTATTGTATA 


TTTGTTCCCT 


TTTCGTACTT 


TTCAACTTTT 


1080 


ACTACCrGrG 


TwTmCTAAAa 


TATGCTATAT 


TTTTACTACA 


GTAGTTATAA 


aTTTGAGaTA 


1140 


AAAAAGGAGT 


TTTTTTTTGG 


ATAAAGGAGT 


CGCTGATATT 


GTAAAAATTA 


AGCAAGTGTT 


1200 


GAAACAAGAA 


TCAGTAAAAA 


GCTTAGTAGA 


AGGAACTGGG 


TTAAGCAAGA 


GTACTATTTC 


1260 


AAGTTTAAAA 


TCAGGTACTC 


GAAAAGTAGA 


AAAACTAAAC 


CTTTCTGCAG 


CTATTAAACT 


1320 


GACAGAATAT 


TCAGATCAAG 


TTTTTAGACC 


TATTATTGAA 


ATCTGGGGAG 


AAAAaCCGAA 


1380 


AAAaTAACCa 


CAGTTATTTT 


ATGAAACTAA 


TAGGATTAAA 


AATAGAAGAA 


CTAGAAGTTT 


1440 


TTTAGATGAA 


TGCTTGTTTA 


GCTCTATAAT 


GGATAGAACT 


GAAAGATAAT 


ATTCTTCAGT 


1500 
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CCTACCATTA 


TAGAGCTATA 


ATTTTATTCA 


ATTTCAAGAA 


GTCTTTTTTT 


TACCAATAAT 


1560 


TTAAGATATT 


TCATAAGATC 


TTCTTCAAAT 


TCTATAGATT 


CCCTATCTTG 


TTGGTACAAA 


1620 


GATAAATAAA 


GAAATATTTT 


TTTTCGATAG 


AATACGTTAA 


AATATGAATA 


GATATAGATA 


1680 


GTAATTATAT 


TATCTATAAA 


TAG T AG AG T A 


TAACGATCTT 


TTATTTTTGG 


ATTTTCTATA 


1740 


AATTTTAAGT 


AGTAAGAAAA 


TCTTTTTCGT 


TCAAACTTTT 


CTATAATCTC 


TAAATTTTTA 


1800 


ATTTGAACAG 


AATTAb 1 IbA 


a z\ t n 2i rzc zi t zx 


TAAAAATTTA 


ATAGTAATTG 


CTCCTTATCA 


I860 


GATTTTAGAC 


GTACTCTTTC 


AATTATATTC 


ATGATATATT 


CATCGATGGT 


AGAGCTTTTA 


1920 


TCAGCAATTT 


TTTCTAATTC 


AGAGTTTATT 


ATATCCAAAT 


TATACACAAT 


CACTGCCTCA 


1980 


TATAAATCAT 


GTTTTGTTTT 


AAAAAAGCTA 


TATACGGTAG 


TAGTGCTGGT 


TTTAGCTTCA 


2040 


TTAGCAATAT 


CTAAAAGTTT 


TGTTTTTyCA 


TAACCAAATT 


TAGaAAAgTG 


GTTCCATTGC 


2100 


GGGTAAATAT 


AATTTCTnCC 


nTTTTnAATA 


TC 






2132 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 



nTnAGGTGGA 


AGGCCCAACG 


GATTACTGGT 


TAGCCCTGTA 


CATGGTTAAT 


ACTAATAGAT 


60 


ATGCTTGATT 


GTAAATTTGT 


AGTGAAGTAC 


AATTTATTTT 


AAGAGTATAT 


TTGTAATATT 


120 


CTGTATAAGA 


AATTATTTAA 


GGAAGGAACA 


AATTTAATGA 


AGAAGAAAAA 


AATGATGAAG 


180 


AACTTAAATT 


TTGTAATCTT 


ACCTTTGTTA 


TTAGGGGTTT 


GTATTTCATT 


AATATGGAAT 


240 


ATACCATTTT 


TTGCkGTTTC 


kGGrGCTATT 


TACTTTGTAC 


TTTTAGTTTT 


TTTGGTTCCG 


300 


TCAGTTGACT 


TTGCCTTTAC 


AGATTTCAAT 


ACTATACGAA 


TAAaTCCGCA 


TTACAGAGGA 


360 


AGAAGAAAGA 


TAATTTTATC 


TAATGATACA 


TTGACATTAG 


TATTAGTTTT 


AATAGCACTA 


420 


ATAGTATCTG 


TGGTACTATC 


ATATTTTTAT 


TATAGATAAG 


AAGTATAATG 


GTAAGAmGGG 


480 


ATGAAAAAAA 


TCAAATATAA 


AACCATAAGG 


ATAGATAAAT 


ATTTATArGG 


TCAAAAACTT 


540 


TTAAGTTTTT 


GACCTCTTTA 


TTTTTTCGTG 


CTATACTTGG 


AATTATCAAG 


TGTCAAAAAA 


600 


TTATGTTCAG 


AAAAGGTGAG 


GAAATTTTGG 


CCATTTATGG 


TTATTCAAGA 


GTGAGTACTA 


660 


GTAATCAAGA 


TTACAAAACC 


CAGATTCAAA 


AACTAGAAAA 


AGCCGGAGCA 


GAAAAAATAT 


720 


TCAGTGAAAA 


ATATACGGGA 


ATAAAAAAAG 


AAGGAAGAAA 


AGAATTAGAA 


GAACTACTAT 


780 


CTATTGTAAA 


ACCTGGCGAT 


AAAGTTTTAG 


TGACGAAAAT 


TGATCGGCTA 


GCTAGAAGTA 


840 


TCGTTGATTT 


AAATTCGATT 


ATCTCTTCTT 


TAAATAATTC 


AGGAGTGACT 


ATCraCCTTCT 


900 


TAGwTAaTGC 


CCTAACCTTT 


GAcCAAATAA 


AaGTGATtCT 


ATGCAAACGT 


TAATGATGAA 


960 
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TATGCTGGGT AGCTTTGCGC AATTCGAAAG AGATTTAATC GTGACGAGAA CGCAAGAAGG 1020 

GAAACAGTGG CATCGAGCGA ACAATAAAAA TTATCGAGAA GGTCGTCCGA AAAGAGTTCT 1080 

AAATGATAAA TACAAACATG CACTAGAGTT AATGGAAACA AACmGCmTGC gAGaAGTGGg 114 0 

AAAGGcAAAC AGGTATTTCT TTATCTACTC TCAAACGAAT CaAGAAACag CCAAgGAAGA 1200 

ACmGTTACTT AGTGrGrAAT AATTCCAGGA GTACTTATAA TGGAAAATAG AGAGAAAATT 1260 

ATTCArTTGT TCAAgAATCC TTTAGTAACA GGTTATGGGA TTGAGATAAT GTCAAACGGG 1320 

CGACTTTATT CAGCAAACTT CCAAAGATAT AAGAATCGAG yAAAGAAAGA AGAGAACCCr . 1380 

TTGATTATTT TTGAGAGCAT GACAGAAAAA GTTGAGCAGG TATTTTTAGA ATTAGCTGAA 14 4 0 

GAAGTCATAC GAACGAACCC TAAAACAAAA CAAGAATTCA AkGAAATGAT TArAGAATAT 1500 

AGTTATAAGG AGrATArCAA ATGrTAGyTC GAAAAAmATA TGATCATTTT GGTATTGAAA 1560 

TTGGTATGTG GAACCGAGAC AATGTAGTAA ATAAAATAGA GTGTGATTGT 1610 
(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 



TCAACAAATT 


ATTGATTATT 


TACAAAATGA 


AATTGATTCA 


GGGCGATTGC 


CTGTTGGAGC 


60 


GCAAGTGCCT 


ACCGAAAAAG 


AATTATCGGC 


TCAGTTTAAC 


GTCAGTAGAA 


TTACGTCCAA 


120 


ACGAGCGCTA 


ACTGAGTTGG 


AAACGCAAGG 


CGTAATCCAA 


CGTCACCAAG 


GAAAAGGAAG 


180 


TTTTGTTCAA 


GCACCAAAGC 


ATCCTCTTCA 


TACCTCTTTA 


AATAAAGTTC 


TTTTTTTATT 


240 


GCCTTTTGCA 


GACGATTTAT 


CAGTAGGAAA 


TTTCTATAGC 


GGTTTGGCCC 


CTACTATTCA 


300 


AGCAGCTGGT 


TTAGAAGTCT 


TTATGACCTC 


GCCTAATTTT 


TTGAGAGAGA 


AAAATGCTGC 


360 


GGACATTGTG 


AATGAGTTTG 


CCGGGCTTGT 


CTATTATGCT 


GCTACTAGTA 


ATGACTATCT 


420 


CGACTTACTG 


TTCGAGTTGG 


CTTkGATGAA 


TTTTCCAGTC 


GTTGyCTtGG 


ATAAAAAGAT 


480 


CCATGATTTA 


CCTTTTGCAA 


CCATTCAATC 


AGACAACTTT 


GCTGGTGGTA 


AAACAGCAAC 


540 


CGAACGCTTA 


ATTGGCTTAG 


GACATACAAA 


AATTGGATAT 


ATCCTTTCAG 


GAAGTCCAGC 


600 


CCTACAATCC 


GTTCGGCAAC 


GGTATT 








626 



(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 
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ACTCAGCACG 


TCAAAACATG 


CACAAATTTA 


GTGGCCAATG 


GTCTGTGTTT 


GCCAaTTCTA 


60 


ATTTTGTTTT 


AGCAGmGGGA 


aCCAAAGTGA 


ATaAAGGGGm 


CGACATTTCG 


AAaCTATTCA 


120 


CGATGACCTT 


TGAACAAGGG 


GTAGTTAAAA 


TCACGGCCAG 


TCAAGCGTTT 


TTGGATGCGA 


180 


TGAATCTAAA 


AGAAAACAAA 


CACGTTGCGC 


ACTCATGGAA 


AGCGTTCATT 


GGTGTAGAAC 


240 


GAATTGCGGC 


AGGAGACGTT 


TACAACACAA 


TCGAAGAATC 


TTTCAACAAT 


GAGAAGATTA 


300 


AAACTAATAC 


GGTAGTCACA 


CATACGCCAG 


AAAAACCACA 


AACACCACCA 


GAAAAAACAG 


360 


TGATTGTACC 


ACCAACACCA 


AAAACACCAC 


AAGCACCAGT 


AGAGCCATTA GTGGTAGAAA 


420 


AAGCAAGTGT 


GGCACCCGAA 


CTACCTCATA 


CAGGTGAAAA 


AGAAAACACC 


CTATTATCTG 


480 


TACTAGGTGC 


CGGAATGCTA 


GTTGGTCTGG 


CTTGGTTTGG 


TTTGAAAAAA 


CGTGAAGTGA 


540 


AATAATGAGT 


AAAAAGAAAA 


AGCACTGTGA 


AAAATCAGTG 


CTTTTTGTTT 


ATTTTAGAAA 


600 


AAGTGTGGTG 


TTGTCAAGTG 


TGGATAATAA 


TTTTTGCGTT 


AAGTTGTATC 


GTAGTATCTA 


660 


TAATAATTGG 


AGCATTTGTC 


AGTGCAACGG 


TCGATGAACG 


ATTGGGAGAC 


AATATTTTTT 


720 


TGCTACTTCT 


AGTTATTTTA 


TTTTTTTGGT 


TTATTTTGAT 


TTTTAATTGT 


TAAGTATATT 


780 


TTTTTATATG 


CTATTTTAAT 


ATATTTAATA 


AGACTAGGTG 


AAAGTATTAT 


TAATTCAGAG 


840 


ATAGCTGCTA 


TTGTAGCTCC 


GAAATAAATA 


TTGGTCATAC 


TTTTTGAAAT 


AAACGACATT 


900 


ATTATAGAAG 


GTAGCATTAT 


AAATATTGTT 


GGCAAAAAGG 


CAAAATAATA 


AGTTTCTTTG 


960 


GTGATAGATT 


CGTTAAATTC 


TTTGATTTTT 


TGAAAGAAAG 


ATAGTTGTTT 


ATGATAGAAT 


1020 


ATCTTAATTT 


TCTGTATCTC 


GTTTAAAGAA 


TAAATTAAGC 


GCACAAATTC 


TTTATTTAGG 


1080 


TAATAAGTTA 


GAAGAACGGT 


ATCTTTGTTA 


AT CAT TG AAA 


CTAAGAAAAC 


TTTTGTATTA 


1140 


CATGGGATAT 


CATTTAAAGA 


AAGTTCTTGG 


GACAGTACGT 


AAAAGTTACT 


ATGCTTACTT 


1200 


CTTGAATTAG 


AAGGTATTAA 


GGTTCTATAA 


ATATCAGGGA 


GCTTTGCTAT 


TGAGAAAATA 


1260 


AATACAATAA 


AATAAATGAT 


AAAGTAAGTT 


AAGTAAGCTT 


TACCCTCTTT 


AGTAAGGTTA 


1320 


TTTATATCTG 


AAACAAAATC 


GGTGAATGCT 


GTGTCTGTAG 


TTATTAATGC 


ATAAGCAAAA 


1380 


ATACCAGATG 


AAAGTAGTAA 


AGAGGATATT 


AGATTAATAA 


AACCTTTAAA 


GATTTTATAT 


1440 


AATCCGATTT 


CAGTGGGAGA 


GGAAAAAAGG 


TTTTTTTTAA 


ATGAATCACT 


TATTACGTAT 


1500 


CCATAGATGG 


AGGAATAAAT 


TGTGAAAAAG 


CCTATAACTA 


TAGGTAAAAG 


TATACCCACT 


1560 


GGTTTTATTA 


ATTGGTTAAA 


AAAATCAATG 


TTCATATAAA 


TAATTACTCC 


TTATAAACTT 


1620 


TTTATAATAA 


TTATATCAAA 


AGGACGTGAA 


TATATGTTAA 


TAAAATTAGG 


TACAGAATTG 


1680 


ACGAAAGAAG 


AATATGTTAC 


AAGGTATATG 


CGAAACTTTC 


AGAAATTATT 


GTTGCTCGGC 


1740 


GATCGGCCAA 


AAGTATTAAC 


GAATCGTGAA 


GAACAACTCT 


TGCAGTATGA 


AAAAGAATTA 


1800 


TGTGTGCTGT 


' TTTATGAACA 


ATTTATAAAG 


AAACATCATA 


GAGCGCCTGA 


TGAAGCCACG 


1860 


CTAGACGATC 


AAGTTAAAGC 


GAATTTCATA 


GAGCGCTCTA 


AAATTTTCGC 


TCGATCTCCC 


1920 


TTAGTAATGG 


ACGAAGGAAA 


TTTTACGCAA 


GCACATATTG 


GTCAACTAAA 


ACGGTTGCGT 


1980 
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GAATTGCGCA TGGAAGACTA TTTGCCAGAC AATTATACGC ATATTT7ACA ACGAGAAGAA 2040 

GAATTGGCAC GTAACTATTT TCGAAAACAT GATGATTATC CATTTGGGTA TGAGTGTTTA 2100 

TGTATTAGTC GGTCACGAGA AGTGGTGAAC CAAGGGCTAG AAAAGC7CTT AGAGGGGTTT 2160 

TATGACAGTT ATCAGGTTTA TTATCGTCGG TATCGTAAAA ATGGTTA3AA AT3GAAGTAG 2220 

GAATAAAAAG CGCAGGAGGT GTAGCGAGCA ATGGCAGAAG AATCCAAATA TCCGCCAGTG 2280 

AAAGAAAAAT CTGTGTTTTA TCAGCGTTAT AGTCGTGTTT CGACGATTTC TATC-TTGATG 2340 

GATTTAGGTT TTAGTGTGAG TGAAGCCCAG ATTGATAATG AAGATATTGT TTATATGGAA 2400 

CAGTATAAAG AGTTTTTATT AAATAAAGGA ATTGTGGAAG AAAAAGTTCC TTTTAAAC3A 24 60 

GAATTTGTAG CGGAGTATCC ATTAAGCAAA ATGCGACGAC TTTTTGTAGA AGGCTATTAT 2520 

TTACGAAAAC CGAGTGGTGA AAAAATGTAT GGTAGCTACT ATTACGAITT TTATTTTAAA 2580 

GAGTATTCAG GAATGGGGAT TTATTGTACG AAAGCAACAA AATCTCTTTT TTTACATGTT 2640 

TTTGAACAAA GAATTAGAAA AATAAAAAAA TTAAAACATA GATTGGAAGA AT3AAAATGA 2700 

AAAAAATGAT TATTATTGCC TTATTCAGTA CAAGCCTTTT AGCAGGC-GGA AGCAGTGTTT 27 60 

CTGCTTATGC GCAAGAATCA GAAGGAAATC TTGGTGAAAC AACAGGGAGT GTTTTACCAG 2820 

ATGAACCGAA TGTACCAACT GACCCAATAA CGCCAAGTGA GCCAGAC-CAA CCAACAGAGC 2880 

CAAGTACACC AGAGCAACCA TCGGAACCGT CAACACCAAC CGAACCTAGT GAGCCTTCAA 2940 

AACCGACGGA TCCTTCGTTA CCAGACGAAC CGAGCGTACC AACAGAGCCA ACAACGCCAA 3000 

GTAAGCCAGA GCAACCAACA GAGCCAACAA CGCCAAGTGT ACCAGAGCAA CCAACAGAGC 3060 

CAAGTGTACC AGAAAAACCA GTAGAACCAA ATAAACCAAC CGAGCCAGAA AAGCCTGTGC 3120 

CAGTTGTTCC TGAAAAACCA GTTGTACCAC AACAACCAGA GCAACCAACA GATGTGGTGG 3180 
TAAAGCCAAA TGGAGAAATT GCAACAGGAG AATCTACACA ACAGCCAACT GTTCCAATTG 3240 
AAACGAATAA CCTTTCAGAA GTAACACATG TCCCAACTGT GACGACACCG ATTGAAACAG 3300 
CAAGCGGAGA AGCAATTGTC GCAGTGGATA AGGGCGTTCC TTTAACACAA ACGGCTGATG 3360 
GATTAAAACC GATTAAAAGT GAATATAAAG TATTACCAAG TGGCAA73TA CAAGTGAAAA 3420 
GTGCTGACGG AAAAATGAAA GTACTTCCTT ACACTGGTGA AAAAATGGGC ATAATTGGGT 3480 
CAATCGCTGG TGTATGTTTG ACTGTTTTAT CAGGAATCTT AATTTATAAA AAACGTAAAG 3540 
TGTAGTGAGT GATGTATGAA ATTTTTAAAA AAAATTCAAA CAAAGGAAAA AAGCAAAAAG 3600 
AGCCCAAGAG TTCGTTTACT ATCACAAAAA AAAGCGAATC GATTAGTATT AATTGTGGGG 3660 
TGTGCTCTTT TTGGATTGTC ATTGATTGGT GCTATTCGAG CAAATGTGAT GGCAGGCAAC 3720 
GTGATCCGCC TCAGTCAACA ATTAGAAACA ATAAATAAAG ATGTCAAAAA AATAAAAGAA 3780 
AAAAAGAGAG TGTATGATTC GTCGGCACTC TCTTTTTATG TCAGAAATTT TGTC-AATGAG 3840 
TATATCAATT TTAATGGAAA AGCAACAGAT GAAGAGAAAA AAGCACGTGA TGAGAAATTA 3900 
GCGAGTTATT TTTCAGGAGA TTTAGCGTTA GATAAAGGAA CAGACAATGC TC-CAAAAGAC 3960 



WO 98/50555 PCT/US98/08985 

1883 

TTAATTCGTA AATTATTACA AGTCTCTGTT ACTCGTGTAG AAGAATCTGA AGAGCTATTA 4020 
CTTGTGCATG CGAGTATATC TTATAAAGTG CAACAAAAAG GGCAGGATTT ACTAGATACA . 4 080 

CAAGAAATTG TTTTACCTAT TCAAGAAAAA AATGGGTTGT TTTCTATTGT TGGTCGTCCC 4140 

TATTTTTTAG CAATGAATGT TCCGAAAGGA AAACAGGAAC CATTGCAATC AGTGAAAGAA 4200 

CCGCTTGATG TTGAAAAAAA AGAACGACAA GGGATTGAAA AATTTTTACA ATTATTTTTT 4260 

GAAAAATATG CGAAGGGGAA TACAGAAGAA TTGTCTGTTT TAATGAAAGA ACCAGTTAAA 4 320 

ACAAGTGGTC AAGCAACTGT GTCGTCTATT GAAGAAAAAG AACTACGTTT TTTTGAAACG 4380 

AAACAAAAAG GTGTTACAGG CGTTCAGGTG TCAGTGAACT TTAAAGACAA AGCAATAAAT 4440 

TTCCTTTACA CGGAAGATTT TTCGTTGTGG TTAACGAAAA CAGAAAATTC GTATTTTGTA 4500 

CAAGAGTTAA AACATTATTA TACAGAAAAG GCAGGAGATA AATAGTGGAT ACAAGTAAAT 4560 

TAAGTCAATT TATGAGTAGT GGATTGATTA CACTTATTGC GTTTGGTGTG ATCATTTTAA 4 620 

TTTTAAAGCA TTGGAAAGAA GCTGCGTGGT TAAAGATTGG TTCAGTTATT GTCATTGCAT 4680 

TGATTTTAAA TGACTTTGCG ACAAATAATG GGCAGAACAT TTTCAAAGTT GTGAAGTGGG 4740 

TCTTAGGATT ATTTGGGATT CAGCTGTAGA GGAGAGAAAT AATGAAACAT TTTGATTATT 4800 

CGAGGGGCTT AAAAGCCCCT TACTCATTAC AAGTTATTAA GTCCCCAAAA GGGAAAATTG 4860 

TTTGGTATTT TGCACAACCC TTAAGTTTAG CTTATTTAGT GATGTTATTT TTAGGGATTG 4920 

TACTAACAGG CATTTTTTGG AAGTTTGTAC CATTGCCGTT AATTTTTGGT ATTAATCTTA 4 980 

ACCTAATGAT TATGTTGTAT TTTCCAAATA AAGTGGCTCG ATGGTATACA GAAACAGAGT 5040 

TTGAAGGGAA AACAGGATTA GCTTTTTTAA AAGATGGTTT TGTTTATGTA AAAAATTATG 5100 

TGTTAGACAA TAGATCAATT ATATCCTTTG AACGAGTAAA GGAAATAGAA GAATTTTCAT 5160 

TTAAACGTTA GTAATAAGAA AGAGTGTATT ATGGTGAAAA CATGAAAAAA AGAAGTGTGT 5220 

TGTAATAACT TCTTTTTTCA TGTTTAAATT ATGAATTTCT ATATTTTAAA TAAAGCATAA 5280 

CTGCACAGAC TATTATTCCA ATAACTAAAA GTAGCACAGT AACAATACTT TCATTTTTAT 5340 

CTATTTTTCT TTCAGGTCTA TAGCTAGGAT TAATTTGTTT GGCGTTAAAA TCAGTAATTC 5400 

CAAAGTCTAA AGTAGGAAGT AAAAAGAGTA ATAAAATAAT ATATATAATT CCCGCTATAA 54 60 

GGAAAATAGG GGCGCTAGTG AAAAGTGAAA TAATGGTACC TATGATTATA GGAATTAAAA 5520 

TGTATAAAAA GGATTTGATA ACAGATTCTT TTCTCATTTT AATTACCAAA TTTTGTTATT 5580 

CATGTTTCCC ATATTTCCTA AAGGAACACA ATAACCACCG CTCATATTTT TCCCACCTGT 5640 

AGACCAGCCA CCTTTTTTAA AATAAAAACC TCCAGGAC 5678 
(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

CCTTTTTATT GGTAACAAGG TACGTATGAT CGAAGGGTAA TTnCCACTTT ATACGGnCCA 60 

ATTTTTTCAT AACTTGCCGT TGAATTTCCA CACCCAGTTC AAATTCAGAT AATTTGGATT 120 

CTGTAGGCAA AnCCAAGTCG AACAAAATCT TCGGCTCTTC GCCCATGGGT AACTAAGCGG 180 

ATATCATGGA CTTTTAAAGA CGCGTCAAAA GAACGAATAA TCTTTTTGAT TGTTTGATGA 24 0 

ACAAATTGTT GTGTTGGGTC ATACAAATTC ACTGGnATCA ATATGACAAA CAAGATTAAC 300 

ATCCAACTCT TCTTTAAACT TCGCTTCAAT CGCATCGATT GTCTGATGGG CTTTATTTAA 360' 

ATCCCAGCGA TCATCAATTT CAATGTGAAC AGACGCAAAC GTTTGGCTGG GACCATAATT 420 

ATGAATCAGT AAATCGTGAT ACCCTACAAT TGTTTCCATT TTTGATAAGA- CATTCTTCAT 480 

TTCATCAATT TCTGTTTGGC TCGGACGCAT GCCCATCAAC TCATTAATAA ATTCGCGAAT 540 

TAACTGCAAG CCACTATAAA TAATATATGC AGCAATTAAA AAACCAATAT ATCCATCAAT 600 

TCGCAAACCA GTGACGCCTT CAATAAATGC AGAAATCAAA ACAGCTAATG TAGTATAAAC 660 

ATCATTAAAG CTATCCTTAG CAGAAGCAAC TAATGCTTGA GAATCAATTT TTGCTGATAC 720 

TTTTTTGTAA AACAGCCCTT GCCAAATTTT AATCCCAATT GATAAAGCTA AAACAGCAAA 780 

TAAAATGGGT GtCAsCTTGA TTGATTCTGG ATGCAGGATG CGATCCACGG AGGTCGTTAG 84 0 
AAACTCAAAC CCAATAAACG TAATCACTAA TGATACAAGC ATTCCACTAA TATATTCAAA , 900 

ACGTTCATGG CCATAGGGAT GTTCTTTATC CGCTGGCTTA CCAGCAATAT AAAAACCGAC 960 

TAATGTTAAA ACTGAAGAAA TAGTATCGGA CAAGCTATTA ATGGCATCTG CCATGATTGA 1020 

AACACTACCT GnAAGCAAGC CAATCATAAA TTTACTAACG GATAGAAT 1068 
(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

TCGGAACGGC AATTACATAC ATCTCTTTTT CATGAATTGT TACATCACTC ATGACTTTCA 60 

TTCCACTTAC AGAAATGATG CCACAAACAA CCACAAAAAC ACCGCCGATA ACTGGTGCCG 120 

GAATGGCCGA AATCAAGGTG GAAAGTTTTC CAGATAAGCC AAATAAGACA AACCACGCAC 180 

CTGCAGCCAC AAAAACCTTG CGACTAGCAA CTCCAGTGAT TGAAATAATC CCAGCATTTG 240 

TAGAGTAACT AGTTACTGGC GTGCCGCCAA TGAATGAAGA AACTAAGCAA CCTAAGCCTT 300 

CACCAATTAC ■ GCCGCGATTA ATTTTTTCTT CGGTCAAAGG TTCAGCGACA ACATGACTCA 360 

CAGCAAACCA AGTGcCTGTT GTTTCCGcCA tTAAAACCAG ATwAATaACT AACATCGTGG 4 20 

CAATAGCTGA CCAATcAAAT TGGaAAGAAA AGTTAACTAA TGGTAACTGA GGCAAACTAA 4 80 
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ACCAAGCCGC TTTACTAACG GCTGATAAAT CAAGtACCCC TAATAATTGT GCCGCGATGC 540 

TCCCAAATAA TAATGCCAAA ATGaCAGAAG AAACGCGGAA AATCCGCCCT TTTTTCAAAG 600 

CAGAACCTAA GAGCGCAAAG AAGATCAACG TACCCGCTGC TATAAATGCT AATAAAATAT 660 

TTTGTCCTAA TTCCCCTTTT GCGTGGTAAA TATTATCCgA TAAAGCGACT GGCATTAACG 720 

AAAGCCCCAC CACAAAAATA ATTGTTCCTC CCACAATTGG TGGAACAAAT TTAGTGATTA 780 

AACGATTAAA AATACCAGTA AATCCTAATA TAATAACAAC AATTGCCCCC ACTAAACTAG 8 40 

CACCTAAAAt GCGcCCCAGC CATTTCCTTG tTGGTTATTG GcAAAATAAA TTCCTGcAAT 900 

TGCCCCGATT GGaATAAATG AAGGGcCTTG CGcAACTGGt AAtCnCCATA CCAAAAATGG 960 

GGAnTGGAAT TAATCGTGGG CAAGTnCCTG CTGGC 995 
(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

CGnCTATCCA AGATGGCTGG AAAAATCCnG CACGTTCCAA AATTCCATGG AAGGAATTAA 60 

AAnAAACCGG GAAAGnGGGC ATAAGTCGTT CCGACTTATG CCCCCCCCCT TCGTAATCTT 120 

AAGATTATCa GsTAAATCaA AGTGAGTTCT TTTTAAAGAA ATACTTTTAT TTTAGCCTTT 180 

TATAGATCTA AATAATTTTC TAAGTAATCT GCTAAACCAT CTTGATCATT TGTTTTTTCA 240 

GTGACATCGt TGGCTACGGA TTTAATTTTA TCCGTGGCAT TGTTCATAGC AACACCCCAA 300 

CCAGCATAAC TTAACATTTC TTCGTCATTA TGTTCATCGC CAAAAGCAAT GATATCAGCT 360 

GGTTTGACAG ATAAAAAGTT CGCCACTTGA TCAACGCCAT GGGCCTTTTG AATTCCTTTT 4 20 

GCGACCATTT CTAAAATaGG CaTTGGACCG CCCCAAGTGC GGACATCAAT ATAATCACCA 480 

TATtGTTTtG TCAATGTATC AGCAACTTTT TcAGCTTGGT TTGGTGTGGT CCGAACCATC 54 0 

ATTGAGGTAG GATTTGTTCG TAAATTTTTA GCAGTAAGTA AATTATCTAA GGTTGCTTTG 600 

GAAGCAAAAA ATTTCGGATC GAACCCATCT AACGTATCGA TATAAAATGT TTCTTTGTTT 660 

TCAGCAGCAA CAAAATCTAA TTGTAATTCT TTTTTCTGTG CCAAAATGTC AAAGACTAGG 720 

TCACGCTCAA TATTGGcTTC GCTTTCCAAG TCCCATTTTT TCTCTGGAAT GTGGACTAAA 780 

GCACCGTTAA AGTTAACCAT AGGGGTCGTT AGGCCAAGTT GTTGGTAAAA TTGATGACTC 84 0 

ATTCTATAGG GACGTCCAGT GGCAATACTG ACGTAATGAC CGTTGTCAAT CGCTTTTTTT 900 

AGTGTTtGTT CGGkTTTTGG aCTGATTAAA GACTGTGaTT aAtGtGTGcC GTCtAAAtCA 960 

ATAGCAATTA ATTTTTGCAA GTAAAAATCC TCCTAAAATG CCTTTnACTC AnTCTAGCAT 1020 

AAGACCGAAG ATTATCCn 1038 
(2) INFORMATION FOR SEQ ID NO: 637: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) JJSNGIH.: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: • 

AAATATACCA TAGGCCAGTT TTATGGTTTT TTTAAGTACC CATCCCAAGT TATTTGGGAG 60 

TAATTGTATT TATCCTTTAA TGCAATTGGG ATCCTACAGT AACTTCTAAT TCCCTTGGTC 120 

CTGTCTTTGa ATTGAAGAAA ACAAGCAATT CCTATAGAAA CTGCTGTAAA CCCTATAATT 180 

AACAATATAA CAGTCTTTTT ATTCATTCCT ACTTCACTCC TTTATCAAAG ATTCTTCCAG 24 0 

TAATTCATAT GAAGATACTC CTTCTAATCG TTTATACACT TTTCCTTCTT TGATAATAAT 300 

TACAGTAGGA GTCTGTGTTA TATTTAATTG ATCTAAAAGT AAATTACTTT CTTTATTCTT 360 

CTCTTCTAgT CTACGCCATA TACTAATAAA TTTTTGTCTT TAATTACCTT ATTCAAGTCT 4 20 

TTTTTTTGTA CATTACATGG TTGGCAACCT TTTTTAAAAA AATAGATATT CAaGTCTTTA 4 80 

TTTTGAGAAA CCTTGTnCCT CAATACTCTC AGAAATCGGT nATTTTATAT TnCAATCTCT 540 

TCTTTATTTA A 551 
(2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

nCTCTTAATG GTTTTACnTT TTCGTAAGTT TTCATACCTA TATCTGnCCT TATCTAATGT 60 

TTATTTTAGT TTAACATATA TTTAAAATAG ATTTTTAAAA AAGTTTATTT CAGATGTTGA 120 

CATGTTTGTT TTGGAATGAT ATACTCTGTG TATAAAGTCT GAAACAGATA ATAGACATTC 180 

TGTTTTGGAT AAACGAAATT AAATAAACGA AAAAGCGTTC ACGAGAAGAA AATGAATCTC 240 

CATTTCGTAA ACGCCGCATA ATAAAACCTA GACACTTTTA TTGTACGGCA TTTGACGAGA 300 

TAATACAAGA GTGAAGTTTT CATTCAAGGT TATCAGTCTT TTTGTGGTGT GGTATAGTCA 360 

GCCAATGAAT GACTTGCAAG GTGGCAGTAG TATCTTTAAT TAGAGCCATA TTGATGAAGA 4 20 

TTTTAGGTGG AAGGTTTCTA GCTTTCATAC GAGGACTAGG TTCGCATACT GTGTATTTGT 4 80 

AAAAGATTAA CAGTTAGAAC AATGGAGTAA TTTCCGCTTG GTGATTGAAG GGTGTTCATT 54 0 

TAAAAGAATC AGATTATTTC CGATAACCAT GTCGTGATGA CACGTTAGAA ATAATTGCCG 600 

AATTAGATTG AATAAAAAAA TCtATGCAAG CTCAATAAAA GGAGGAGAGC TGTACACCGT 660 

ATGAGGTAAA TCTATTAATG TGCCGTAAtG TAATTTTTAT tACTAGCCGT TGAACgATAT 720 

yCyATGATTT ACTTTGAATG TAAGAATCCT AAATTTTAAT GCAATtAAGG TTGTTATTTA 780 
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CACGAGTGGA AAAGATTTTT TCTAAAAGAA TAACCAcCTT AACTGGAAAC 
GGAGCTAATT ACTTAATTCC tATTTATATA rATATTTTAG AAGTGAATAA 
TAAAACTTTT TCTAAAACTT TGTCGGTTGC ATATAGAAAT GCAATTGCAC 
GAAAAGCAAT CTATTTACGA TTGGTCCGTA AATAAGAGTT GTCGCTAAAA 
AACAAAGATT ACTTCAAAAA GATATTTGAA GTTACGCATA GATACCACCT 
AGTTATTTTA TTTTAACATA ATAAGGGAGG TTGGTACTTA TGATGTAGCG 
AAGCATAACT AGAATTCTCT GGTTGTGTTT TTTTGTGTTT AAAAAAGAAA 
ATAAATTACC AAGAGGAGGG AAGTCATGAT AGAAGAATTA TTACTTTTCG 
TGTTGAACGT ATGAAACGTC AGAAAAAAGC ATTCTCTATT TCAGATATAG 
TAACTTAGAA CGTAAGAAGT TAGGTAAGAG TGCAGTTAAG TTAACAAATA 
AACAATTGAA AGTCAATTGT TAAAATATCA GATTTTACAG CGGACATATA 
TTATCATAAA CCTTGTCAAG TTGTGTTTTT TAGTTAGTAA TAAGAATTGC 
AAAGGCAGTT AGTTGTTGAC TGTtTATTTT TGtATAAAAA TTAAGTgAAA 
ATTTgGCmAA TAGAATTCAG GAAATTGCAT TACCAGAAAA AATCTAGGAG 
CTATTTGTAn AnCAGA 

(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 
TGGTATGATT ATGACAACTC CAGACCTTGG TGGAGCCACT GAATATTGGA 
ATACCATGTT GCACAAACAA ACTATGCCTA CCAAGTAATT CCGGTATTAG 
TCTTTTGTCA ATATTGGAAA AATATTTTCA TAAAAAACTT CCTTCATCTA 
ATTCACACCA TTATTATCCG TAATAATTAC TGGATTCTTA ACTTTCACAG 
CACAATGTTA TTACTATCTA ATGGAATCAC AGATGCTATT GTTTGGTTAT 
TGGGkTCATA GGAATGGGGA TCTTTGGTGG AACATATTCA CTCATTGTTA 
TCATCAGTCA TTTCCTGCAA TCGAAACACA GTTACTTTCA rCAtGGaCmA 
ACATGGGGGT TTwayCTTgT AGTTGCTTCC ATGGGCAAAT GTAGCnCAAG 
(2) INFORMATION FOR SEQ ID NO: 640: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GTTAAAAATA 
TCTTCACTTT 
AAAGTCCTAA 
CAGCAGCGAT 
CCTTTTATTT 
ATTGTAAGTG 
AAATCGATAA 
TTGAGAAGTT 
AAAAGAGCTA 
TGGACCGTCT 
AAATGACAGG 
ATAAAGAGAG 
GAAGGAAATT 
TAAATTAAAG 



ATATTTTTGG 
CTTCCGTATA 
TTGATTTTAC 
TTATTGGTCC 
ATAATGCAAC 
TGACTGGTCT 
ATGGTATCGG 
GTGG 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1636 



60 
120 
180 
240 
300 
360 
420 
474 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

TGTGTGACTT CTTTGCTTTA TGAACGTATC CgTCCAGAGT TTCATCtTGC ACGCTGGATA 60 

TATTACGAAA AAGCACGATA TGAGTTAAAA GGGGTAGAAC TTGAAAGTGC AAAAATATTT 120 

TTCAACGGGC TAAAGAACCT ATCCGAATCA GATAAAAAGA TTTTAATTGA TGTTTATTAT 180 

CGCTCAAAAG AC TAT TAT AA ATTTAACCGA CAAACAGGAT TATATCAGTC TGTCAGACCA 24 0 

ATTTCCGACG ATGCAATAGC AGAACAATAC GGAATTACTA AAAAAGAAGT AACGAAAGTA 300 

AGACGGCAAG CGATAGATCA TTTAGCGGAA GAAATGCGGA AAATTATATT AGCTATAAGT 360 

ACAGCCTTTC ATTTGAAGAT AGGTAAAGAC TTATATCTAG TTAGGCTTAT AAACGAAGGA 4 20 

ACATACAAAG AACAGTTTGT TCTTGGAAAT AAAAGAGAAG CTAAAGTGTT TAGCGCAGAA 480 

"AAGGAAGACA CAATTAGAAA GTTTATGCAG CTAGGTTTTG AAAGAGAGCC AG CAT G a AT A 540 

AGGCTTAACC TTGAGAGGTA TTCTTATGAC AAAAAATAAG TTATTTCATT TAACAGATAA 600 

GTCCAATATC AATAGCATTA GAAAACACGG TTTAGTAGGA GTAAAAAAAG CAAAGGATCT 660 

TTTAAGAAGA GAGACAGGAA CAGACCATAC GCTCTTTAAT CAGCAAACCT ATTCTATTCT 720 

AAATACTTAC GGGTGGAAAG GCTACGATCT TAGATGTGCT ACTTTTATGT TTGAAGAAGA 780 

TAGTTGTCTT GGCTATGAAT TGCTGCAGCT AATGAATGAC GATCCATATA TTTTAGAAAT 840 

AGAGATAGAC CGTTTAAATA AAGATAAATT ATTTGTGTTT AATACGGAAA TTGCTAGCCA , 900 

TTTGCTAAAT TATTCGAAAC AAGATCAACA CAGGcTAGCT AAGTTTTATT GGAATACAGC 960 

AATTCCATAC AACACTTATG TACCAAAACC AAGGACAAAG TTAAATGGAC GCTTTTCAAT 1020 

TTAGGCAATT ATCCAGGTAA TCnAAGCAGG AATAAGGTnn AATTTCGGTG GAnATTTCCA 1080 

1094 

CCCAAAATTA TTAT 

(2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

ACAACAAAAA AAGCATAGAA CAATCCCGTT AAACATGTTG CGTAACCATT GTACACCACA 60 

TTGTATTACG AAATTCTTCC ATGCCTAGTA TTATCTAGTC ACATGAAAAC GCTATTCATT 120 

TGTTAACAGT GCTTATTCTA ACGAGTATCA TTATTTTTGT CAACCTTTTT CAATTATTTT 180 

TTTATTAATT GTTGACTATT AACAAAAAAG AAGCTCTTAT CGGAAAAAAA GTACCTTCAG 24 0 

CTAAAGGTAC 1 TTTTGTTCAC TAATTTATTC TTTATAGGCA TACTTATATG TGCTCGTTGC 300 

ACCGAAGTTA TGATACTGAA TACCTTTGAC ACCTGGTCGT ACAAGCTGAC TACGGGCCTC 360 

TTGGTATAAA GGAATTAAAG CGGCATCTTC TTCCACCAAT ACTTTTTCAG CTTTAATTAA 420 
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TGTTTGCCAA CGAGTTTCAG GCTCATTTGC GTATTTAGTT GTTGCTTCTT CAATTAGTTG 480 

ATCAAACGTT GGGCTAGAAT AGTTCATACG GTTACCACTA ACTGAGCTTT CTAAGAAAGT 54 0 

CATTGGATCC TGATAATCGG GTCCCCACAT GCCAACAGAC AATTCATAAT CCCCTTTTCT 600 

CATTAATTCA ATtGCATTAT TTAAAGGAcT GTTTTGACAT TAATAGTGAG ACCAGGCACG 660 

TTCTTGTACT GGCCTGGATA ATnCAnCACT TT 692 
(2) INFORMATION FOR SEQ ID NO: 64 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .1485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

AGTACAACCA TATGGCAGAT GGCTGAAGGT ATTATTAATC ATTGTTAATC TTGGCGCTTT 60 

GTTTAGTATT CTGCGTCTAT TTATTCATTT GTTGATTTGA AGGGGTCGAA AGGTGAGAAA 120 

ATGAGAAAGT TAATTTTAGT TTTTGTGGGT TGTGCCATGG CCTTTTGTTT ATGGCCTAAT 180 

CAAGCGGCGG CTGTAGCTGG TGGCTATGGC GGCGGGCGTG TAGAAAGTCC TAGCGGAGGT 24 0 

AATCCCTCGA ATGGTTTTTC GCCAAGCGGT GATGGATCAA GAAGTTATGT TCATCATAAC 300 

GAGACCGGGA CCAGCAACAC AACGCCACTG GAAGGCTTAA TTTCAACAGT GTTTTTTGTT 360 

CTTTTTGGGG GCACGATGAT TTGGCAAAGA CGCCGTAGAA ATCAACCAGT CGATGAAGGA 420 

GCTTTTCAAG AAATGTGGGC AkCATTACCT GGAACATCAA AAGAAAAAAA GGCACTCTTA 480 

AAAGAAATAC AACAAACGTT TTTAGAAATA CAACAAGCGT GGGATCAAGA AAACTTGTCT 54 0 

GTGGTCAAAG ATAACTATAC AGAAAAACTT TATCAAAAAC ATCTTGCAGT ATTACAAGCA 600 

AATCAACAAG TGGGCATCAG AAATCATGTG AAAAAAGTGA AAGTAGCGCC AGCTTCGAAT 660 

TATCATCATA TTAGTGAAAC GAGTTTTTCA CTGATGCTTC ATTTkTCTTG TATTGACTAC 72 0 

GAGGAAGATA CTCAAAGCGG TCGTGTTTTA TCGGGATACA AGCATCAAAA ACAATATTTT 780 

TCACAACTTT GGTATTTTGA TTACAATCCA TCTCTTGGAA AATGGCAAGC AGACTTCATT 84 0 

CAACCTAAAG GATAGTGCTA AAAAGCGCTT GGGGTTTAAA TCCCAAGCGC TTTTTTATTC 900 

TTCGATGcGT TTTACAAGCT GTATTTTTTT ATTTAATACG TAAATCAGGA TACCGCCGAC 960 

GGTCATTGAA AGAAGTTCAC CAAGTCCAAC GGTCAACCAT GTGTACCAAA AAGGTAACTG 1020 

ATAAAAGAAC GTTAAGGGTC CGGCGACAGT GAACATGGAC AAAGCAAAAA gAATGGCCGT 1080 

CAGCATTAGT TTAATCAATA AATTTTGCTG ATTTTTTGTC GCAAAATAGG CAATCCATAA 114 0 

GACTGCGCAC GTTCCAGCAC TACCGATGAC TATATCGATG ATACCAAGAG GCGAAGCGGT 1200 

GTTGGCAATC GCAACCCCTA ACGTAATGGA CCAGATATAT CGCTTGTTAA AAACGCCTAA 12 60 

ATAATTAAAT AATTCGGCAA GACGAAGCTG TATCGGACCA AAACTAAGCG GAGCGAGAAG 1320 

TACAGTGACA ACGACATACA AAGCGGTAAC CAGCGCCATT TTAGCCAACG CCGTAGCTGA 1380 
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CCACGTACTG GTTTGTGGAT TTGATGAATG ATTCATTTAA GATTCTCCTT TATAAAACAA 14 40 

CGACGTGTTT TAGTGATAAC CATAACCAAA GGAAGGGCAG TAAAG 14 8 5 

(2) INFORMATION FOR SEQ ID NO: 64 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 3: 



AGAAATATCA 


TTTCAAGTTC 


TTTTTAAAAA 


TATTTTTAAA 


TGAAATTACA 


TTTCTTATAT 


CL Pi 

dU 


AATTTTTGTG 


CTATACTAAA 


AGAAAACGGT 


TTAGGAGTGA 


TTATGTGcGG 


GTTTTGCTGA 




TTTATTTCAT 


AACTATTTTT 


CTATCCAATA 


CAGTTGGGGC 


GCTCTCTGGG 


ATGGGTGbAb 


i q n 
lou 


GCGTCATTAT 


TAAGCCTGTT 


TTGGATTTTC 


TTGGGTTCCA 


TTCGTTAAAT 


AGTATTGCGT 


240 


TCTATTCAAG 


TGTCGCTGTT 


TTTGTCATGT 


CCATTTCTTC 


GACTTATAAA 


CAGTATCAAA 


300 


ATGGTGTCCA 


AATTGAATGG 


AAAAAAGCCG 


CGAGTATTTC 


GTTTGGTTCC 


TTAGTCGGCG 


360 


GCATGTTAGG 


TGATTTACTT 


CTTAATCAGG 


CGATTGCACT 


AGCGCCAAAT 


GAAGAAAAAG 


420 


TACAACTAAT 


CCAATATATC 


ATCATGTTGC 


TGACCTTGGT 


TTTGGTTCTT 


CTCTACAATC 


480 


AATTTTCCAA 


CTGGCACTTA 


CACCTCAACG 


GATTGAGCAT 


TTTTCTGATT 


GTCGGATTAG 


540 


GCTTAGGGAT 


TCTATCAACC 


TTTTTAGGCA 


TCGGTGGCGG 


TCCCATTAAT 


GTAGCTTGCC 


600 


TTATTCTCTT 


TTTTGGAATG 


GATATCAAAT 


CAGCAACTGT 


CTATTCCATC 


ATTACTATTT 


660 


TTTTCTCCCA 


ACTCGCCAAA 


CTTGGCAATA 


TCGGCTTGAC 


CACAGGGTTT 


GCGGTTTTTG 


720 


ATTTGACAAT 


GCTTTGGGCT 


ATTATTCCTG 


CCGCTTTATT 


CGGTGGTTAT 


GTCGGAGGTC 


780 


TTTTCAGTAA 


AAAATTATCC 


CAACAACGGG 


TCGCCCAAAT 


TTATAGTTTA 


GTTGTTTTTT 


840 


TAGTTATTTT 


GTTAAATATG 


TATAATTTGT 


GGACTGTnAT 


CnAAAAAAGA 


AGCGCTTTGw 


900 


CGTATAGACA 


AAACGCTTCT 


TTTTTAATTA 


AATCACTGTn 


AATTGAACAT 


C 


951 



(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 



TCCAGAGAGT TGCCCTCCAG CTGAGGCTAT CCCATTTGAT GGAGTTTTTT TCAGACTATG 60 

TAAGGGTGAA AACTTTTGCs AAGAAGATTT TCTTACTCAT TTTGAAGCTG GGAAAAGTTT 120 

CCCAGATAAA AAACTGTGTG AAGCTATGGC ATTATCATTT TTTGATAGCT ATGATCATGC 180 

AGAAAAATTA AAAAAAAGAT TTAGAAAAAA ATTnGCAACA TGTTCTATCA AACCTGTTGC 240 
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AATTATTGAA GAT TAT GG AG TAGGAATTTT TGAAGAGAAT TCAGGGCATC TTAATTTATG 300 

GGAGCACAGG AACGTTGATA TTTTTGCTGA TTTAATGAAG GAGGATAAAG GAAAAGATGA 360 

ATAATCTAAT ATTTTCAGAC ATGGgAATAT ATTTTCGkTT TTGACTATTA TGaTACTCCt 4 20 

TTATTTTTCa TTtCTCGAAG TCGGAAAAAT AATAATTACy AaTTAwTCTA CTGTATAAAT 4 80 

GATGACGTTT ATTTTTACAG TGAGCTATCT GCTTCGGATA TTAATTACTT ATTTAGTAAT 54 0 

CCAAATGGAT ACGACATTCT TCATTACCTT AAAAAGAAGT CTAAATTAAA TTTTGTTATG 600 

GTAAATAATA ATTCTTTTGA TATCTATAGC TTAGCGGAAT ATACAAA 647 
(2) INFORMATION FOR SEQ ID NO: 64 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

GACGAATGGC CCAACAGGGC ACTTACAGAA ACAATATCTG AAGTACTCTC TAAGCAACGT 60 

GTCGCAAAAG TGTGTCGTAA TTGATGGAAG TGTACTTTGT CCAATTGGAC TTTATCGCGT 120 

AATTTGTGGA AGTAATACGT AAGTAAGCGA GGCTCACAAG GCTTATTATT ACGAGAAAAA 180 

ACAAATTTTC CTTGGCTGTG TTCTTTTTGA CACTTCAATG CATTTTTTAC TGTATTACTC 240 

ATAGGAATTA TTCGCTCAGA ATTTATCGTT TTTGTACAAT CGTAGTAAAG TTGAGTAGAT 300 

TCTCCGAACG AGCTTACAGA TACTCTTTGA AAAGTATGCT TTACATGAAT AGTATCCTTC 360 

TTAAAGTTGA TGTCCTCCCA TCTGAGCGCA GTAATTCTCC TATACGTAAG CCGGTATGTA 420 

AAGCGmCAAT AACTGaTAAT CCyTGTTGCT GAGGATTTTt ATAGGCAACA TTCtCCaATT 4 80 

tCTTTkGcyC TTCAATATCT AGTGCACGAA TATGTCGACG CTTTGGTTTA GGAAGCGAAA 540 

TAGTTGAACA GGGATTGCTT TTCAAATGCT TTGCTTTTAC TGCATGATTC AAGGCTCTTG 600 

AAAGGATACC GAATATGACA CAAATCGTAC TTTTACTCAA TCCTTGTGCG ATCAAGTCCT 660 

CTACTAATTT ATTCAATCTC AAATCATCTA TTTCATTTAA TGCAGAGTCT CCCATTGTTT 720 

TTAAGACATA GTTGCTTAAC TTGTAACTAT AACTAGCGAA AGTAGAAAGT TTTACTTCCT 780 

TCTTTACGAT GACTAACCAC TGTTCTATCC ATTCATTAAG TGTAAAAGCG CCTTGACCTC 840 

GAGATTGATA GATAGTTTGT TGATTTGCTT TTGCCACATA GAGTTTTTGT CGTACCTCTT 900 

GATAAGTAGA ACCATAGATA TATCCATACT TTAATTTTCC ATCTATCTTC CTTCCTCGTG 960 

GGAAGCGACC TTCCCATCGT CCATCTTTTC TTTTATATAT ATTTTCGCCA CGTTTACTCA 1020 

TTTCAATTCC* TCCGTTTAAC TACTATGTTC TGACGGCTAT ATTGACGGCT TATAGAAAGG 1080 

CTTATTTATA GCTTTAAAGA ACATTAGTTA TAATATAAAT CGTTATTATT GCTAAAAAAT 1140 

ATCATTATTT TACTTTTTGT TGACACATAT TTTGGAAGTA GATTATAATT CTnCTGTACG 1200 
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GGGATGAATG ATGnCCTACA TTGGTCTATA TATGAnTCCC CACAGGGGTT TCAATAAC 1258 

(2) INFORMATION FOR SEQ ID NO: 64 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 6: 

ACTGTAACAC TTGACGGCCA AGACCGCGAC CTCGTTGCGT TTTTGTCACA ACAAAGCCAT 60 

AAATCCCCCA TTCGTycTTC AAAATGATCC AAACGTAGCG TGGCTAAAAG TTGATTGTTT 120 

TCTTTATAAA TAAGTGTTCG TTGAAGGTCC TCCGGCAAAG GCGTACCTTC GAGTAACCTA 180 

GCCAAGGAAT CCAAATCAGC CACTTGTGGT CGGAGAAGTG TCAGCTTCGC CAACGCCTGT 24 0 

GAGTAAGTTT CTCTTTGAAA AACCAGATAC TGTTCTGAAA AAGCTGGCAC AATTCCTTGT 300 

CTATTAAAAT AATTCAGGCC AACAGATTGC TTGTCGTCCA TC AC GAG AAA GACTTCTTTC 360 

ATTCCCCACA AAGCTGCTTG CTTTTCGAGT TCTTGAACAA GTTCTTCAAA AAATGGTCCT 420 

ACATTTGGCG CAACAATCGT CGCTTCCAAG GTTTCCCCAT CAAAACAATA CCCTACGATA 4 80 

TATGCTTGAA TGGTTgTTCC TTCCTGtATA AACCATAATA AGTATAAGCT GATtCCTCAn 540 

GAAATGTTGT CCAAT 555 
(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 



AAGGCAAGCG 


TCGATTTTCG 


ACAATATCCA 


ATAATGCATG 


GCTTTGTCCG 


TCCATCATGA 


60 


TAAAACTCAT 


TGCACTGGTC 


ACATGTCGAA 


CCGATTTGAA 


TTCATCAAAG 


CATAAGACCT 


120 


TTGGCAAAGT 


CTGTTTGAAA 


GGATTCAATG 


GGCGGTAAAA 


TCTCTTCATG 


ACACGATAGA 


180 


CCGAAGAAGT 


GGAGATATTT 


TTCTGACGGG 


CGATTTCCGT 


CATGGAAGTA 


TTCTCTTTTA 


240 


AGCGAGCAAT 


AACAGACCAA 


CGAACTTTTT 


GACTAATGAA 


ACAGTATTTT 


TCAGCGACAG 


300 


AGGTATCCGC 


TACGAATGTT 


CTTTGGCAAG 


ACTTACATTT 


AAATCGTTGT 


TTCTTCAACT 


360 


CCAGATAAGT 


GCGATACTCA 


TATATTCTTT 


TTCAATCAGC 


CAATCTTCAT 


CAAAAAATAA 


420 


ATGTTTATCT 


GTTAATCCGA 


GTAATTTTCT 


AGTATGATTA 


TCCATAGAGC 


GACTCCTTTG 


480 


AtaggGTGAT 


GTTTTGGtCG 


ACTAAATTTT 


ACCAAAAGTG 


AGTCTCTTTT 


TGAGTTGAAA 


540 


CGAAGaAAAA 


GGGTGTTGGa 


AAATCGCCTT 


CGATTTTCCA 


AC a AC AC AAA 


ATATTATAGA 


600 


ACCCTTTATT 


CTAAAGGaTT 


TCAAGGTGTC 


TGTTTGcaTT 


tATTAAAAAA AAgAAAAAGT 


660 
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ATTGGCAACc 


AGAAGGCGTC 


ACCaACaCTA 


GATATTATAG 


AGCCcATAAT 


ATGCTGTATA 


720 


AtATCTAGTG 


TATAGGCTTA 


AGTGGAGGTA 


TATATGAAAA 


TATTnACTGT 


TGAAGATGAT 


780 


AATATGATAA 


GAGAGGGAAT 


AAGTGAATAT 


CTTTCAGAAT 


TTGGATATAC 


TGTTATTCAA 


840 


GCTAAAGATG 


GAAGAGAAGC 


CTTGTCAAAA 


TTTAATAGCG 


ATATAAATTT 


GGTTATCTTA 


900 


GACATTCAGA 


TACCTTTTAT 


AAATGGTTTA 


GAAGTGTTGA AAGAAATTAG 


AAAGAAAAGC 


960 


AATTTACCAA 


TTCTAATCTT 


GACTGCATTT 


AGTGATGAAG 


AATATAAAAT 


TGATGCATTT 


1020 
1030 


ACTAATTTAG 















(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 
AAATATTTCT TCTnTCTTTT ACATAATAAG TAGCTCCTTA GTTTGACAAT TGGTTGTACC 
CAAGGGTAGG CTTTCTTTTT GCAAGCGGAG CAACATGGTA CCATGTTGCG ATATTTTGAA 
TGACCCCTAC CAAAAATACT GGTAGGAGTC GTTCAAATTT TTGGGGTTCC CCCAATCCCC 
ATCGCTACAC TTTGACATTT ATTGACGTTT CGACTTACCA TTTTTGACCT TTATTTTTTC 
TTTCTTATTG ATAAGGAGCT AATAACTCAT CCAGAAAGAC TAAATACGCG TCATTTCCTT 
TCAACTTTCG TAAATAAGTA GCTGCTTCTT CGTAAGGAAA TAAATCAATT TCTAATTCTT 
TTTCTTTAAA ATCATGAATC AACGTATCCA GTTCTTTTAG CACCGTTCCT TTTGATTTTT 
CTTTCTTTTT CTGGTGGACT CTTGGTTGTT CACTAAGCGG GTCTGCCAAG AAAAATTGTG 
TCAGAATAAA TTTACTTTCG TCCGTTAATT GGTCCATTTT GTTTTCTCCT TTTCTCTTAT 
TATTTGTTTT TAAAATCGAG TGGGATTtGC TATTTGAGCA GAAATTTTTG GTTCATTTTG 
GTGCCTTTTT TGCTCGTTAG CTTGGGACTT TCCCACTTCA CTTTCTTTTT TCTnCGCATT 
CAATAAACGT TCTTCAAATT TATTTTGGGG gAAGTtGAAT GCCcTTTTTC TATGGATAAA 
GCATGGTCGG CTTTTTCAAT TAGCGTGCGC GCCcTCTTTC GTGATCGTTT CTAACGATtG 
CGTAAaTtCT tCTAACTkTT CTCCTTGCTT CGTCCAAGAA GAAAGATAAC CAAAAGAATA 
CGAGGACGTA TCCAGCCCTA AGTGACGGGA GACAATATAG GCAACTGATT CCGCTTCAAA 
TTCTTGTCGA GAATAAACGT CATCTCCAAA TATCGCTTGC GAATCTGGGT GAAGCATGGC 
ATGCGTCATT TCATGAAGTA AGACTTTTAA AGTCATCACT TCTCCTAAAT GTTGTTGGAG 
GAC 

(2) INFORMATION FOR SEQ ID NO: 649: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1023 



WO 98/50555 PCT/US98/08985 

1894 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 



TAT1 1 IU<jA1 




GCGGCGGCGG 


TGGCCGTTCT 


GTTGATCCAA 


ATGCACCAAG 


60 


AC AAb (j L. C 


PATTT APA AT 


ATACAATTCA 


ATTGAAATTT 


GAAGAAGcTA 


TTTTCGGTGT 


120 


/-i ^* Tl TV TV TV TA /^"* ^ 

C G AAAAACj Ab 


ill rlrlo I riv^ri 


ATPG AG AAG r 


TACTTGTGCm 


ACCTGTGGcG 


GTAATgGCGC 


180 


CAAACCAGGC 




A A APTTGTPA 


TAAATGTCAT 


GGTCTGGAAC 


AATCCATGTT 


240 


GACGTCAAAC 


f*CT* TS. T T A P f T 


PGTATGATGA 


GTCGTCAAAC 


TTGTGATGTG 


TGTCATGGCA 


300 


C AG G G AAAG A 


/\A 1 Lrinnuirtn 


CCATGTCCAA 


CTTGTCATGG 


CACAGGTCAT 


GAGAAAAAAG 


360 


CACATACTGT 


CAAAGTAAAT 


GTGCCTGCTG 


GAG TAG AAG A 


CGGTCAACAA 


ATGCGCTTAG 


420 


CTAACCAAGG 


AGAAGCGGGT 


ACCAATGGTG 


GACCTTATGG 


TGATTTATAT 


GTAGTCTTTC 


480 


GTGTTGAAGA 


TAGTGATATT 


TTCGATCGAG 


ATGGCGCAGA 


AATCTACTAT 


GATTTACCAG 


540 


TAAGTTTCGT 


CCAAGCGGCT 


CTAGGGGACG 


AAtAACTGTC 


CCAACTGTTC 


ATGGTGACGT 


600 


GAAATTAAAA 


ATTCCGGCTG 


GAACACAAAC 


TGGcACAAAC 


TTCCGCTTAA 


GAGGCAAAGG 


660 


TGCGccACGT 


TTaCGTGGTG 


GGGGCAATGG 


TGATCAACAC 


GTGAAAGTGa 


AATTAATAAC 


720 


ACCAAAAAAT 


CtTAATGaAG 


aACaAAAGGa TGCCstACgA gCnTTTTGCC AAAAAGCTGG 


780 


GCCGGGACCn 


AAAAATGGTT 


AACCTnGGAA 


CCCCAACCCA 


AGGGAAGGA 




829 



(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

TGACACTGTC GGTAATTTCT TAGAATCAAT GCCCAGCTTG GTTAnCTGAT GGTATGGCTA 60 

TCGGTGGTGG TATGGTCGTT GCCGTAGttA TGCATTAGTA ATCAACATGA TGGCAACAAA 120 

AGAAGTATGG CCATTCTTCA TTATCGGTTT CGTAGTCGCA GCGATTTCTC AATTAACATT 180 

AATCGCATTa GGTGCTTTAG GTGTCGCTTT AGCTCTTATJc TACTTGAACT TATCTAAAAT 240 

GGGCGGTTCT TCTAACTCAA ATGGCGGAGG CAACTCTGGC GACCCATTAG GCGATATTTT 300 

AAACGACTAC TAaTTCTGAA GGAGGAGCAA AGAAAATGGC AGAAAAAATT CAATTAACTA 360 

AAAAAGATCG TTTAGCTGTA GCTTGGCGTT CAACATTTAT CCaAGGTTCT TGGAACTATG 420 

AACGTATGCA AAACGGCGGC TGGGCATTCT CAATGATTCC AGCAATCAAA AAATTATATA 480 

AAACGAAAGA AGAACGTTCA GCGGCgTTaA AACGTCACTT AGAGTTCTTT AATACACACC 54 0 

CATACATTGC CtCACCAATT CTTGGTGTAA CATTAGCTTT GGAAGAAGAA CGTGCCAATG 600 

GCGCACCAGT TGATGACGTA GCGATTCAAG GGGTTAAAGT TGGGATGATG GGTCCTTTAG 660 
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CCGGTGTCGG TGACCCAGTT TTCTGGTTTA CTGTTCGTCC AATGTTAGGC GCTTTAGGTG 720 

CTTCCTTAGC AATGGGTGGT AACATCTTAG GCCCAATCTT ATTCTTCGTT GCTTGGAACT 780 

TAATTCGtTG GTCATTCATG TGGTATACAC AAGAGTTTGG TTACAAAGCA GGTTCTAAAA 840 

TTACTGATGA CtTTCAGGTG GTTTACTACA AGATATTACA AAAGGTGCTT CTATTTTAGG 900 

GATGTTCGTA CTGGCCGCAT TAGTTCAACG TTGGGTATCG ATTAAGTTCT TACCAATTGT 960 

CTCACAAGTT AAGTTAGACA AAGGTGCTTA TATCGAGTGG GATAAACTTC CTGCTGGCGG 1020 

TGAAGGAATG CATAAAGCCT TTGAACAAGT GAACCAAGGC TTGGCGCTTT CACCAACGAA 1080 

AGTTACTACT TTACAAGATA ACTTAGATCA ATTGATTCCA GGTCTAGCAC CATTACTACT 1140 

TACATTCTTA TGTATGTGGT TATTGAAGAA AAAAGTTAGC CCAATCGTTA TCATTCTTGG 1200 

TTTATTCGTA GTCGGCGTTG TCGGCCACTT AATCGGCTTA TTATAAGAAT GACGAGGTAA - 1260 

GTTTATTACT TCTAATCAAA AAAAGCGGAA GAGAAGTCGA GAAACTTTTC TTCCGCTTTT 1320 

TTAAACAAAG ATGTAATTTT GTTTGTTTCG ATCAGGAAAG AAACGTATAA TAAA 137 4 
(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 

GGAAAAATTC CTGGCGCCTT CATTAAACGT GAAGGTCGTC CAAGTGAaCG TGCGACATTA 60 

mCCGCTCGTT TAATCGaCCG TCCTATTCGT CCAATGTTCT CAGAaGGCTt CCGTAmCGAA 120 

GTCCAAATCA CGAmCATCGT TATGAGTGTG GAGCAAGACT GTACACCAGA AATGGCTGCA 180 

ATGTTTGGGT CTTCCTTAGC GTTAgCAATT TCAGATATTC CGTTTGATGG ACCAATTGCT 24 0 

GGTGTTGACG TTGGTCGTAT TAATGGCGAA TATGTTTTAA ATCCAACGGT AGAACAAGCG 300 

GAACAAACAG ATATTGAATT AACTGTTGCT GGAACAAAAG AAGCCATCAA TATGGTTGAA 360 

AGTGGTGCCA AAGAAGTATC AGAAGAAGAT ATGTTAGGTG CCTTACTTTT CGGTTTTGAT 420 

GCCATTAAAG AACTAGTGGC GTTCCAAGAA GAAATCGTTG CCGCTGTAGG TAAACCAAAG 4 80 

ATGGACGTGG ACTTATTACA AGTGGACGCT GATCTTAAAA AAGAAATTTT TGATGCGTAC 540 

TACAATACAA TGAAAACAGC CGTAATGACG GAAGAAAAAT TAGCACGTGA AGTTGAAATT 600 

GACAAAGTAA AAGATACAGT CAAAGAAGTT TACGCTGAAA AATTTTCGGA ACACGAAGAA 660 

GAAGCACAAT TACTAAAAGA AGTGAAACAA ATTGCGGAAG ATCTTGAAAA AGATGTCGTA 720 

CGTGAGTTAA TCACGATCGA TAAGATTCGC CCGGATGGCC GTAAATTAGA CGAAATTCGT 780 

CATTTATCAT CAGAAGTAAG TATTTTACCA CGTGTGCATG GTTCTGGCTT ATTTACTCGG 84 0 

GGTCAAACAC AAGCTTTATC CGTTTGTACA TTAGCGCCTT TAGGGGAACA TCAAATTATC 900 
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GATGGGTTGG 


GTGTCCAAGA 


T AbbAAAbb 1 


T TP A TPrATP 
1 IbAlbbAlb 


A 1 1 A 1 AAb 1 1 


bbbbbAAl lb 


960 


TCTGTTGGTT 


CAACAGGACG. 


bbb.bbbb 1 bA 


tLftub 1 bb i b 


(""TP A A ATPPP 

b 1 bAAA 1 bbb 


1 bAbbb 1 o^-v? 


1020 


TTAGGTGAAC 


GTGCCTTAbb 


AbAAAl 1H1 1 


pp n fpr* a p a a p 

bbA I b AbAAb 


A. A. PBrTTl^rP 
AAb Ab 1 X bbb 


1 1 ttV^tt^rlrl 1 


1080 


CGCTTAGTAG 


CAGAAGTGTT 


7\ 7\ 7\ Tip A A A T 1 

AbAA i LmAA 1 


bb 1 lb 1 1 b 1 1 


pnpa a ppzi Zip 
bAbAAbb/\Mb 


Irtl 1 lbbbbo 


1140 


GGAACATTAG 


CATTAATGGA 


rp/' , pppp m PT , p 
Tbbbbblblb 


pp a 7\ T* f r a 7\ nr 
bbAAi i AAAb 


bbbbAb 1 1 bb 


Abb 1 A 1 1 bbM 


1 9 00 


ATGGGTCTTG 


TTAGTGATGG 


CGAAAAbTAT 


AbAAi b I I AA 


PAPA m AT"PPA 

b AbA I A 1 I b A 


JIPPTTT A P A A 

Abb ill AbAA 


1^ DU 


GACCACCTTG 


GCGATATGGA 


y«i rn m rn tv tv tv ^ ro TV 

CTTTAAAGTA 


GCAGGAACGA 


AAG AC G G AAT 


CACTGCTT1 A 


1 -J T A 


CAAATGGACA 


TCAAAATCCA 


AGGAATTACA 


-TV T\ tv tv ta m rn rH 

GAACAAATTT 


TAACGGAAGb 


a m m A /"» A /** r** A A 

ATTAbAbbAA 


i ion 
1 JoU 


GCGAAGAAAG 


CACGGATGGA 


AATTCTAGAA 


TV TA m fY* TV TV TV 7\ 

GAATTAACAA 


CAACGATTGC 


TGb rCCTbbl 


i a a r\ 
LhhKj 


GAAGAATTAA 


GTCAATATGC 


ACCGAAAATT 


tv tv t\ m t\ m m 

GAAATGATTC 


tv tv tv nr m ta tv tv /™* 

AAAT T AAAC b 


a C^t^ A A A A r P r P 

AbbbAAAAl 1 


1DUU 


AAAGATGTCA 


TTGGTAAAGG 


TGGCGAAACA 


j\ m /""^ tv TV f~* TV TV 

ATCAACAGCA 


TTATTGA TbA 


AAbbbbbb 1 1 


1 ^o 


AAGATTGATA 


TCGATCAAGA 


CGGTAACGTA 


AGTATTGCTT 


/~i a rn /-» ti at/" 1 

CATbAbArGb 


IbAAAibAl 1 




AAAAAAGCCA 


TCAAGATTAT 


TGAAGAATTA 


TV TV TV TV /"* TV TA /"* 

ACCAAAGAAG 


TTGAAGTCGG 


TOA urTTTTlP 

TbAAbI 1 1 Ab 


1 DO U 


TTAGCAAAAG 


TTGTCCGCAT 


TGAAAAATTT 


GGTGCGTTTG 


TTAAbbTAAT 


TA A 7\PPP7\ A A 

TAAAbbbAAA 


i t 4 n 

1 / 4U 


GATGGCTTGA 


TCCATATTTC 


TCAATTAGCA 


AACGAACGTG 


m y"^* TV TV TA TV fT* 

TGAACAACGT 


TGAAGATb 1 b 


i q n n 


GTGAAATTAG 


GCGACGAAGT 


CCTTGTCAAA 


GTAACTGAAA 


TTGACAAACA 


AGGbbG Tb 1 b 


looU 


AACGTGTCAA 


GAAAAGCGTT 


ATTAAACGAA 


GAAAACAAAG 


TV TV TV TV TV fTl TV TV fTl rfl 

AAAAATAATT 


TCTTTTTAAA 


i y ^ u 


CCAACGAAAA 


CCAAGGAAGC 


TTCCTTTCTT 


TTCGTTGGTT 


TTTTCGTTGA 


A A A r* A APPP^ 

AAAb AAb bbb 


iyou 


TAAAAAAGAA 


ATGATAAAAA 


AAGCTAAAGA 


j*-* TV m /"* ^> TV TV 

CGATCGCCAA 


CTCTTTTGTT 


1 A 1 Ab 1 b AA 1 


9 n 4 0 


TAGTGGTAAA 


ATTAATACTA 


TCAGTCTGAA 


GAATATCTGG 


GGTGTCTTTG 


lbAAAAArtl 1 


91 00 


TAATCCAAAT 


AAAAATATAA 


TCATTACCTT 


AATTTTAGTb 


TV fTl fT> tv rn rfvfp 7\ 

AI 1 Al 1 I I Ab 


lAAbbMl inl 


91 60 


TAGTTTAACT 


GctGCGCAAC 


GTTCTGAAAA 


AGGCAAAAbb 


AAlblbbI lb 


AAbb 1 bbbb I 


9990 


CAATAATAGC 


GTTAGTATGG 


TGGATCGTGT 


GATTTCTTTT 


bbAbb ibb lb 


mppm^i" a A A A 


2280 


CGGCTTGTCT 


TCAATGGGCA 


ATTTAATTAA 


CAbb 111 AAA 


b AAAAb b AAb 




9?40 


AAAAATTGAT 


AGCTATAATG 


tv tv m m ta f** m /^rri 

AATTAGCTGT 


bbAAAAb AA I 


A A A m ft A A P 

AAb 1 A 1 AAAb 


CYC li Q Zi T Tit Zi 
b 1 b AAtt. 1 1 on 


9 4 00 


TAGTTTAAAA 


CAAGAATTGA 


ATCTGAACGA 


AACGTTAGCC 


A A r~"Ti A T 1 /~ ' AAA 

AAb 1 A 1 bAAA 


AAb 1 AAb 1 bb 


94 60 


GAATGTCATT 


ACGCGTTCGC 


CGGATACTTG 


/"* tv Tv /~* tv <irnfn tv 

GCAAGACTTA 


TTAAT1 Al bb 


a t a zi d. rr a hp 
A 1 AAAb b AAb 


9 ci 9fl 


CAATGACGGC 


ATTGAAGCGG 


GCATGGCGGT 


CATGGCGbAA 


TiTinrppf Tar 
AAAbbbu lAb 


L bbbbbb 1 1 


9 ^RO 


TATTGAAGTC 


AATGCAACGA 


CTGCAAAAGT 


GGAACTACTA 


ACGTCTAAAA 


ATGTTAACTC 


2640 


GAACCATTTC 


CCTGTGCGAG 


TAACTTCCGC 


AAATGGCGAA 


TCGTTTGGTT 


TACTTAAAAA 


2700 


TTACGACAAT 


• AAGACGAATG 


CATTAATTGT 


GTCACAATTA 


ACTGGCGATG 


CCACTCTGAA ' 


2760 


AGAAGGCGAT 


GTCGTTCAGA 


CTTCTGGCTT 


AGGTGGTAAT 


TCCCCAGCCG 


ATTTAGCGGT 


2820 


AGGGACAGTT 


ATTAAAGTGA 


AACCAGATAG 


TTTTGGTTTA 


GATCGTGAAG 


TGTATGTGAA 


2880 
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ACCTTATGCG 
AAGCGAATGA 
ATGTTAATTG 
GCCAGTTCTC 
TATTTATTAA 
ATCGGGATTT 
GTAATTCAAA 
GAACTGATTG 
TTTATTACGC 
TCTTATCCAC 
GAAAAATGCT 
TGAGAAGCTT 
ATTTACAAGA 
CTATGCTACA 
AAAAAAGCTT 
GGCTCTAATG 
AGATGAATAC 
AATGTCAGAT 
ACAAATCGAT 
TAGTGAAATT 
AGCACGCGAT 
AGATTCAGTA 
TAATGAATTA 
TGTTGAAAAA 
ATCATTAAAA 
TTCTGAAGAA 
TTTAGCAGAA 
AGCGCaAGCA 
AGCCAATGAG 
ATCAACAACT 
AAGTGTTGTA 
TAACCCAACG 
AGGAAAACAA 



GATGTCTATG 
TTCGGAAAGA 
ATGCTCATTT 
ATCTGTTATT 
TTACTTCGTT 
ATACAGTTGC 
CAAATTTACT 
CGGTACTGAT 
GTGTGTTGGG 
TGAAAAAACT 
CATTTGTTAG 
ACCCTGGAGA 
AGAAAATTGA 
ATTGCTCTAT 
AACAAAAAAG 
GTATGTTCCA 
GATACAAAGA 
GCAGAAGCAA 
ACATTAACAG 
TCTGATTTGA 
GTCCAAGTGA 
GCAGATGCAA 
CTAGAACAAC 
CAAATTGCTG 
ACATTGAAGA 
CAAGGGAAAA 
GAACAAGCAC 
CAAGCCGCAG 
GCAGCAGCAT 
ACTGAAAGTA 
ACAACACCTG 
ACTCCTGAAA 
GCCGCAATTA 



GTATCTCCGT 
GTATTTAAAA 
AACAAAGTCT 
ACTTGCGTTT 
AATTTTAGGC 
ACTGACGATT 
AACTGGTTTT 
TCAAGTGGCC 
CCCTACATTG 
GTTTTCCGTA 
CATTTTTCGC 
CTTGAAGAAA 
AATATTACTG 
CGTTGAAAAA 
GATTTTCGGA 
TTACATTAAC 
TTCAACAACA 
AAGTTGCCGC 
CTAAAAAGAA 
ATGTCCGTAT 
ATGGTCAAAG 
TTGGTCGCGT 
AAAAAGAAGA 
AATTAGAAGC 
TTCAACAAGA 
AAGACGGCTT 
GTCAACGTGC 
CACAAAAAGC 
CCGCCGCTGA 
CCACAACGCA 
TGGCAGCAGC 
AAGGCAATGA 
ATGCAGCTTT 
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TGTGACGGTC GTGAAAAGAT CGGCAGGGGA 294 0 

TACGTCACAC . CTATT£TCT:r_ .GTTTTTCTTA 3000 

TTTGAAACGT GGACAAGCAA TGTTTACTTC 3060 

ATGTTTGCCG TTCCTAATTT TTCTAAACGT 3120 

TTTATTTGCG ATAGCTACTA CATTGGTATT 3180 

TCGGTTATGT TGATGTATAC ATTTAAAGAA 3240 

TTTGGGATCG TCATTTTTAC CACGTTGTAT 3300 

TTTCATTTAG CAGATGTCAG CCCGATTTTA 3360 

CTATTGAACA TGGTGTGGTA TGTTATTTTA 3420 

AAATAAAAAT TGAAAATTTT GTTCGTTTGC 3480 

TtTTTTTATT TTCTTTAAAT AACAAAACAA 3540 

GTTGTGAAAA AAGTGAAGCT TTTTTTTGTC 3600 

TAACATCCAT ATTATCTCAG TGACATACAG 3660 

GAGAAATTTA TTCAAATTTT AATAAATAAT 37 20 

GGACAACAGA TTGAAGAAAA GTGTATTATC 3780 

AAGCGT As cG TTGCCATCCG CAGCATTTGC 3840 

AGATCAAAAA ATTAATGCGT TAACTAGCCA 3900 

GATTGAAAAT GATATGGTTG AAACGGCCAA 3960 

CAAGCTATCA TCAGAAGTAT CTAAATTATA 4 020 

TCAAAAACGT GAAGTACAAA TGACAAAACA 4 080 

TGATTCAATT ATTGATGCTG TCTTAGATGC 414 0 

TCAAGCGGTC TCAACAATGA TGAGCGCCAA 4 200 

CAAAGCGACT GTTGAAAAGA AAACAAAGAA 4 2 60 

AGCAACAAAA GAATTAAATG ATAAAACAGA 4 320 

AGTGGCTAAA AATGATTTAG AAGCACAACG 4 380 

CATTAAACAG AAAAAAGAAG CGGAAAAACG 4 4 40 

AGCTGCTAAA AAAGCAGAAG AACAAGCGGC 4 500 

GGCAGCAGAG CAAGCGAAAG CAACAAAAGC 4 560 

AGAAAAAGCC GCAACACCAG TAGTAGAATC 4 620 

AGAAACAACA ACTTCAAGTA CTGAAACAGA 4 680 

ACCTGAAAAA GAAAAAGAAG TTCCTGTAAC 47 4 0 

AGCAAAACCA GGGAATGGTG GCGTGACTTC 4 800 

AGCAGATGTT GGTAATTCTT ATGCAACAGG 4 8 60 



WO 98/50555 



PCT/US98/08985 



TTGGAATCAA 
TAACTTCGGT 
CTGGAGTAAC 
CTGGATTGGT 
CGTTGAAGCG 
AGCACCACCA 
TTAATAATTG 
TCATTTAATT 
TTTGCAATTA 
CGATACAAAA 
TAAAATAAGT 
TCACTTTACA 
CTTAGAAACA 
GATTAAAGAG 
TAATGTTTTT 
TGAGAACCAT 
TAGCGGCGTC 
GACGCAATTA 
AGAAGCCAAT 
TAACTGTGTT 
TTCGTCAGAA 
CTAGCCACTG 
AAGGAATTTC 
ATGACTATAT 
AAGAAGAGCA 
ACGTACCTGA 
GTACCTCAGA 
TAGTACTTTC 
CGGGAGGGAA 
TATTTATTGT 
AGTAGTATGT 
TTATTTGCAG 
AAATAAAGCA 



CCAGGCGAAT 
TATGGTGGTC 
GTGCAACCAG 
GGCGTGCATA 
AACAACCCAG 
GCAGGATTCC 
ATAACAAGGA 
TTAAATATAT 
TAGAGGTATA 
CAAGCAATCG 
GTCCAAGATA 
GATAAATATG 
GATATTTGTT 
AAGAGTCATT 
TCCGCTTCGA 
TTAACAGATA 
TTAACGAAAT 
TTCCGTTTAG 
GAGTAATGTG 
ATTTGTTTGT 
AGTCCAACAA 
GGAAGTTGGC 
CTATCAAGGA 
CATTGGGATG 
ACCGCGGATT 
TCCTTATTAT 
ATGGTTGAAA 
TGTTTCCCCT 
GTAAGATGAA 
TGATAATGAG 
• ATACAAGTGA 
GTATTCGTTA 
AAAAATCCTT 



GTTTAGTATC 
CAAACAGTGG 
GCGATGTTGT 
CTGTCTTGGT 
GAGGTTCTGG 
GCGCTGTTGT 
GTGGTAACTA 
GCATAAAATG 
TACATATTAG 
CTTCAGCTTT 
TTACCAAAAA 
ATTTATTACG 
TGGAAAACTG 
TTTACTATAC 
CCAATCGCTT 
AAGACAAACA 
GGATTCTTGA 
CGAAAGATAC 
TGAAGGATAA 
TTAGGCAATA 
GCCGGTTTAG 
AGTCAGCCAC 
ATGCGGGCGA 
GATACGAATA 
CATTTATTTA 
ACTGGAGATT 
AAAATAAAAG 
TGGTACAATA 
CAGTAGTTAT 
TCTGCCGCTA 
ATGGTTAATA 
TGGTTTTGCT 
GAAAAGTGTC 



1898 
TGTTCGTCGC 
CTATGTTGCT 
TCAATATGAA 
TACAGGTGTA 
TTATGTTTCT 
ATGGCGTTTT 
ACAAAAATCG 
TATCTGAAAA 
GAGCGGAGGA 
AATAGAATTA 
AGTGGGCTTG 
TTGGATTTAT 
GGAGGAACAA 
AACGGTGACC 
ATTTATTTCA 
ATTTTATGCG 
GGAATATCCT 
AGAATTTATG 
AAAGTTTTAT 
TTTGTCGCTC 
AATCGACTAT 
ATAAGGGAAC 
CACAAATCCA 
ATGTGGCCGA 
TGGAAGTTGT 
TTGAGGAAAC 
CACAGCTGAA 
GTGTTAACAA 
TTATCCTATG- 
GTGTATGCTG 
ATCTCTGTTT 
CGTTTGAAAC 
AAAACTTTTC 



TGGTTAGCGG 
TCTGGCGCAA 
AGTGCCTATA 
AGTGGTGGAA 
TCTAACTCTA 
CCAGGCTAAA 
TTAGTTGCCG 
TATTGTAATC 
ATCAGCATGA 
TGTGAACAAA 
AATCGTCAAA 
CGTCACGATG 
GCCCtGAAGT 
TCAGATTCAG 
TTATTTGAAC 
AATTTCTTTT 
CAAACTCCTT 
GCTTATCATT 
GAAGGAGTGA 
GCCAATGGCA 
TCAAGTGTTT 
CAGAAAAATT 
GCCAAGTGAT 
TTTAAAAGCA 
GGCTGGTAAA 
TTACCGCTTA 
AGATTAGGAC 
ATTGTTCAGG 
TTTTTGAATT 
TCACGTATCA 
TCTTATCTCC 
AGCAAGAACG 
AAGGATTTTT 



CAGGCGGTAT 
CACAAGTTAG 
GTCCAGATAG 
GTGTTCAAAT 
ACTGGTCACC 
TAGTCATCTA 
CTTTTTTTAT 
TGGCCGTTTT 
GCGGAACGCA 
AGGATTTTCG 
CATTTTATTA 
CATTGATTTA 
TATTAAAAGC 
AAGTTCTTCT 
AAGTCGATGT 
CCTATGGATG 
TGGAAATGGC 
TGTATGAACA 
AACGtTGTGG 
GAAGCGATTT 
TCAGCAGCTA 
TTGGAGCAAC 
TTTAAAAAAT 
TTAGCTCCTC 
GAAACCATGG 
GTGGAAGCAG 
TATAGTCCGA 
GAAAAAGAGA 
GAGTTTGTAT 
TTTATCTTTT 
GTTAGTTTTA 
CCAAGTGATG 
TTATTAAGCC 



4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 
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ACCCAATCGG CTAAAGTTTT TTGCTCCAAT TGGAAAGCCT TCAACATATC GTCGTTGGCG 6 30C 

TCTTTAATAG GCGTCGTTGC TGCTAATGTT CCTTGAACAG CAATTCGCGT CGTTGCTTGT 6560 

AAATCGCCAC AGGTAATTAA GGTAATCATA TTTTGACCAG GAACGTCATC AATTAACTCA 7020 

ACACGGGTTG GTTCGATTTT TTCTACAGAA GTTATTTTGT ATGTATAAAC AG TAG AT AAA 7C8C 

TCAGTGATAT AAATGAGTTC GTCTTTTTTG GTTCTTTCTA AAGGTGAAAA TAAGGAAACG 71 4C 

CCATCTTCCG TTCGATGACT AGCCAAGGCA TAATTGTTTT TCCCCATGAC TTGATCTTCT 7200 

TTCATGGTCC CGGCACCAGT TAATAAAGCG ACATTGGACA ATCCTTTAAA AATGGGcAAA 72 60 

TTAATTTCGA CACTTGGTAT CGCAATGGCA CCAATCACAG GTAAGTTTTT GTTTTCAAAT 7220 

TGGGCTTTCA TCACCGCTTC TGTGCTCAAG GACTCAACTG AATCAAAGTC AAACGTTGTT 7 230 

TCACGAGCCA TATTTTTCTT CACATCAGCT GGTTTCAACT TGCTAACGGC GTACGAGCGG 74 4 0 

CTATTTTGTT GAACCACCCA ACTACGTATC TGATTGTTAA AAATTAAGGC TAAGCCAATG 7500 

ATAAATAGTA AAACTAATAA ACTGTTGATT AACCAATTTT TTCCTCTTTT TTTCTCTTTT 7560 

GGGCGCATAT TTTCCCTCCT TTTAATGTAT CAAGTTTACC ACAATTTATT TGAAAAACCT 7 620 

ATTTATTTTC TTTGGGGAAT AGGTAAGCTG GCCGGACCTT AATTTAATAA AAAACAACTG 7 680 

AATCGATTGA TTTTTCAAAA AAATCCTCAT TTTGTAATGG AATTTTAAAG TAAAAGAAAC 7740 

AAGCTTTTCT TTTGAAAATA TTGTAAAATG GTAAGGAGGC TAGTAGAGGG GAGATGGGAA 7300 

TTTGCTTGAT TTAATTTATG TACATATAGA TGTAACCAGC AACGCGGTTT TATCTAAAGG 7 360 

AATTACCCAT TCCGATTTTG TTCGTTCAAT TGTGCATCAT CCACAGAACT TACTATTATT 7 920 

AGATCCTTCT GCCGAAGCTG GCGAATACGA TATGCATTCA GGCTTGAAAA TTATTCGTGG 7 380 

AGAAGAAAGC GTCAATCGCT ATTTTCAATC GCTACAACGT CGTTGGATGA CTGAAGAGAT 8 04 0 

TAAATGGATT GACTTTTCCG ATGTGACGAT GCTAAAAGAG CTAACGCCAT TAGAAATTTC 8100 

GGAACTTCTT TATTTTGGAC ACATGAAGAA TAGTTTGCAT TCGCCTTTCT TTTATAAGCT 8 2 60 

ACAAAATGAT TTTGTCTTTT TTGAATTTGC AG AT C AG AT G ACGCGTGTCT ATTATCGTTA 8 220 

TATTGATGAA TTTTATCGAA TCTTTGCTGA TAAAATTACA CGTGTTGTTT TAGAGAAAGT 8 280 

CAACCAGAAA AAGGCCTTCT TTAAACGAAA TACGCCTGTC GAAAAATTAA GTCCTGAATT 3 34 0 

ACTTAAAAAT ATGAAAGGTA TCTTGCAAGA AGGTGTTATT TTTTGCTTCC AACAAACGGA 8 400 

AGTGGATAAA GAAGAATATC GGATTCCTAT TTACTTGGTG GAAGATTCAC TTAGCAAGAT 84 60 

GAAAAATACC TCTTACCAAA AAGAAGAAAT TTTGGCAACG CTTGTCTATA ATAGTGAAAA 8 520 

ACGCATTTGG CACATTGAAC AAGAGAATGA TTTCGGTGAA ATTTTTATGT TGGCCCATGA 8 58C 

AGGCTGAGAG AGCTCTAACT TTATCAACTA AAGTAGGGCT CTTTTTGTAG CTATGATTTG 8 64< 
AATGAGAAAT' CAAGTATAAT AAAATGATAA AAAGGAAAGA AGGTTGTTTA ATGAAAAAAA 670' 
TACTCTATGG CGCTCTGACT GGTTTTGCCC TCTTTTTGGT AGCTGGCTGT ACACCCAAAA 87 6' 

CGCCTACTGC TGTTTCATCA ACAAATGAAC AAAAAGAAAC aGCAATATCm ACTAGTGAAA 8 32 
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CraAGTTCATC CGTACAACaA GAaGCAaCGA cGACATCAGA GAGCCAGTGC GTAnTGGTCC 8880 
CTTAGAAAAA. GTACGCAGCT TTGGGAAAAT T 8 911 

(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 



AGTACGCCAT 


GAGCAGCCAC 


GTGAAAAAGG 


CATACAACCA 


CTGATACCTT 


TTAAAACTTT 


60 


TACTTCGGTT 


ACTTGACCCG 


AGCCATCCTC 


GCTAGCCCGA 


AGCCAAGCAT 


AAAGCCCTGT 


120 


AATGTCCATA 


ATAATATAGA 


AGCTTTGTAA 


AAGAACTTCC 


GCATATACCC 


GTGTCACTAA 


180 


CCCAAAATAG 


CCAATAATCA 


CTGCACTGAT 


TAACCCAAAA 


ATGTAATTGC 


TGACTTTTCC 


240 


TTGGGCAATT 


AAATTCACAC 


AAATGATGCC 


GGCTAAACCA 


CCTACTAACG 


TAATCCAATT 


300 


CGATGGATTG 


ATAGCGAACG 


CAACCAACTG 


TACCCCCACC 


ATCAAAGCTA 


AAAACAAGTA 


360 


ACTTCTCGTT 


TCCCAACCTT 


TAAAATCATT 


CACCAAAAAA 


TTCTTATTCA 


CGACAAATTT 


420 


CCGACCTTCC 


TAAAACCAAA 


CTTTTCTCTG 


TTGACGTCTC 


AATAAGAGTA 


CTCCGTTTTA 


480 


AAAAACAAGT 


CAATCAGATT 


TCGGAATAAA 


AAGAAGATTT 


TAATACTATA 


TATTGATTCG 


540 


AAAAATATCr 


AAACACTACA 


TATAGTTATT 


AAGCAAGCTT 


TCTGATATCC 


TTTATTGCCA 


600 


CAATATTTTC 


TTCATCTTGT 


TTTAATCTTC 


TATGTATTCA 


TGGATTTGTT 


TCCATCTAAT 


660 


AATTCCTGCA 


GCAGAACCCT 


TTACGACGGT 


AGCGCTTCAA 


ATTAATATTG 


CGAAAGCGGA 


720 


AGCAACCGCA 


CATATTTATA 


CkTCAAACAC 


AGCCATCGTT 


CACTCTTTTT 


ATTGAAATGC 


780 


TTAAAAATTG 


aTGGCCTTAT 


TTCTCGAATC 


TGGCCACTTA 


TTAACTTATT 


ACATGAAAAC 


840 


ATTCGCCTAA 


AAAAGCAAAA 


AAAAACGACT 


CACCTAAATA 


AtCGACCTAC 


AAACAATTCC 


900 


TkTCATTTGT 


AGGTCGATTA 


TCTTTATTAG 


GTTTGATACG 


GATGTTTCGC 


CATTCCTAGT 


960 


TTCCCTTTAC 


TGCTCATCTC 


TATTTATTTT 


TTTACTGTTT 


GCCAACAGCT 


CGTTGTCTGA 


1020 


AAGTACTATC 


CGTCCTTTTG 


TACTTtGTTC 


AAAGCTGTAA 


GCTkGCkCAA 


ATAACGTTTG 


1080 


cwCTTtATTG 


GcAGTTTGTG 


TAAAAGTCAG 


CCCATAAGGT 


TGTCCTTCAC 


TGTTTTTCGC 


1140 


CAAAGGAACA 


GTCAATTCAG 


GATATCCTGC 


CACAGCACTT 


AAACCCGTAC 


CTTCAGAATC 


1200 


AATAAAGGCA 


TATCCGTCTA 


ATtGTTGTTT 


TTTtAGCAAC 


GCATCAAACG 


TTTGCTGTGC 


1260 


TTTTTTAATG 


GTTGTTTGAa 


TGATTtCTTT 


ATCTGGTTGT 


TTCTTTTTAA 


CATCTGCTTC 


1320 


TAATAAATCT 


TGCCCATACT 


gGCACGAACC 


TTTTTATCTT 


GTTGATTATA 


CGCAATTAGT 


1380 


TCCTCCAGTG 


TTTTAAATGG 


TAAGTCATAA 


CGTTTCGCAA 


ATTCTTCCAT 


GCAAACTTAA 


1440 


ATTCATTGGA 


AATAACTTTC 


CCACCATCAA 


TCCCTTCCAT 


GTCTGGCGTT 


ACCGGAATCA 


1500 


CTTCTGCTCC 


TGCTTTTTGG 


AGGGCCTTTT 


TAGCAGCCGT 


AAATTCTTCT 


TCAGCACTTG 


1560 
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GCATTTCTAG 


CAACCCAATG 


CGTTGACCTT 


GTAAGTTATT 


TTTAGTAAAT 


TTCGTTGAAT 


1620 


CAATGtCTCG 


GAACCGTGTC 


TTTCTTAAAC 


GCATTATAGC' CTTgCGCAGC ATCAACGACA 


1680 


CTTCGGGCAA 


TCGGGCCAAC 


TGTATCTAAT 


GACGGTGCCA 


ATGGCAAAAC 


ACCTTCGCCA 


1740 


CTTACTGAAC 


TGTGGGTCGG 


TTTAAAACCA 


ACCACTGATT 


GATGAGAAGC 


GGGGGAAACA 


1800 


ATCGATCCCA 


TCGTCTCAGT 


GCCTAAAGAG 


AAGGCGCCGA 


TATTCATCGT 


CACACTAGCC 


1860 


CCACTCCCAG 


AACTTGAACC 


TGAAGGTGTT 


ATTTTGAGCG 


GCCCATAAGG 


ATTAAGTGTT 


1920 


TGTCCATGCT 


TGGAACTATA 


GCCACTTGGT 


GCTTTCATTG 


ACATATAATT 


CGCTAGTTCA 


1980 


GATAAGTTTA 


CTTTTCCTAA 


AATCAAAGCT 


TGCGCCTCTG 


TTAACTTCTT 


AACTACTTCC 


2040 


GCATCTTCCT 


TAGGCTTGAA 


CGTGCGTAAG 


GCATAAGCTC 


CTGCACTCGA 


AATCATGTTG 


2100 


GTCGTAT-TTA 


TATtTTCnTT 


CAAAGTCACA 


GGAATACCAT ACAAAGGCTT 


CTTCGGTGTT 


2160 


TGAGAGGCTT 


GCTGGTcAAA 


TGCCTTGGCT 


TCCTTAATTG 


CTTGCGGATT 


GATTTCAGAG 


2220 


ATGCTGTTCA 


TGCCaTTATC 


TATTTGATCG 


AACTGTAAAA 


TACGGTCCAA 


ATAAAACGCC 


2280 


GTAAGTTCTT 


CATAGGTCAA 


TGTTCCGTTG 


GCAATCGCTT 


GCTGTAATTC 


AGTGATATTT 


2340 


TTTTCAATAA 


TCAACGGTGC 


TTTGGTCCGA 


GCTTTTTGTA 


CATCTGTTTC 


TTTTAATTGG 


2400 


TCTTCAATCG 


TTTTAACGAT 


TCTCTCTTGA 


TTGTACATTG 


CTGGGGTTTC 


TGCCTCAGGA 


2460 


GTGAACTTTT 


TGAGTATAAA 


CCAATACGCT 


GCAGCGCCTA 


TTATGACTAA 


CAAAACAGCA 


2520 


ATTATTCCGA 


TTA 










2533 



(2) INFORMATION FOR SEQ ID NO: 653: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 

AAACACTGTG GGTGAAAAAT GGCCATCACG TGTTCTGGGT ACTTTTAATT CTAGCGTGCC 60 

TACACGTGTC GTAAAGCTGC GCTCATAATA GCCATTTCGT TGACTTTGTC GGTTTTCTGT 120 

TCGTTCATAT TCTTTTGCTT GAATATATTC TGTTCGTTGA TTTTCCATTA GTTGATTAAA 180 

TACCGTTGTT AAAATATTTT TAGAAACGTC ATCCTTTACm GAATATTCAA TAATGCTTTG 24 0 

AATCTCTTCG CTTTTCAGTG TAAAATGTAc TTGGGTCATG TAAAAGTCCT CCTGGGTATG 300 

TTTTTGTCGT TAAAAACATT GTACCGTAAA AGGACTGTTA TATGGCCTTT TTACTTTtAC 360 

ACAATTATAC GGACTTTATC AATATTTGCC TAATATAAAT ATTTAAGTTC GCTCTAAACA 420 

CTAATTCTCC AGTTGATCTA TTACTAAAAT ACATTAGTGG CATACTAAAA AGTTTATCAA 4 80 

TATAAGAAAA CATTAATTTA AAAtCAAaTT CATACTGAAa TTCTGCTACT ACAGAGCTTT 54 0 

TTAAGTATTG AACTACATAT AAAACAGAAA AAAAACTAGT TAATATTAAT AACAGAACAT 600 
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AGGTAGGAAT 


TTCTTGAAAA 


GATCTAATAT 


TATCTATCGA 


ATACTTTGTT 


GLAA I Abb AA 


DOU 


TTAAGAGTAA 


TAAAAGTTGT 


GAAACAAATG 


ATAATATTAT 


m t\ tv tv fry tv t\ tv /** 

TAAACTAAAG 


AAAI Al 111. 


*7 o n 


TGAATTTTGT 


AAAAATAAAT 


GACTTTAAAA 


AAAACTGTTT 


CCTTTTGCCT 


TCTTTTTTAG 


Ton 
/oU 


TCTTTTTCTT 


ATGTGCGTAT 


ATTAATATAT 


TTGAAAAATT 


TTTTTTGAAT 


TCTGAAATAT 


84 0 


CAATCCAGCG 


TTTATTACTT 


GCAGGATCTA 


AGATTAATAC 


TTTCTTCTTT 


TTTATTTTCT 


900 


CTATGACCAC 


AAAATGTTGA 


TTATTCCAAT 


AACTTATTAC 


AGGAGTCGGA 


AGATCTAAAT 


960 


AATTTGAAAA 


ACTTGATTTA 


AATGTCGATA 


CATCAAATCC 


ATATTCGTCA 


AAGACAGTAC 


1020 


GAATATTCTT 


GATAGTTAGT 


CCTCCTTTGG 


GCACCCCATA 


TTTTTCCCTT 


AGTTCTACTA 


1080 


GTGTACTTTG 


ATTACCATAA 


TAATTAAGTA 


GCATAGTGAT 


ACATGCCAAA 


GCACATTCAC 


1140 


TATGCTCTCC 


TTGCGCAACG 


TACTTCAATC 


TTTTCATAAT 


TACACCTCTT 


ATAACTCAAA 


1200 


AAGTAATGCA 


TTCGGAATTT 


CAGAATCTAG 


GTTTCTCAAT 


AATTCATAAC 


CCACACCACT 


1260 


GATTCCTACA 


AAGAATCCTA 


GTGATTCTAA 


ATACTCACTT 


cCGGCAACCT 


TTAGTGTATT 


1320 


AtTTTTTTCa 


AAATCaTTTA 


gCATATAGCT 


AATAAGTTTA 


TTTTTTTTAT 


ACTGATAAGT 


1380 


TTCAGGATCT 


TTTTTAGCTA 


ATTGAATTAG 


CCCTTCTAAA 


GTACCTGCAT 


TTCCATGGCA 


1440 


CAGACAA 












1447 



(2) INFORMATION FOR SEQ ID NO: 654: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 

TATGAAACAC GAACATTATA TCGAnTATAT ATATAATAGT TAATATTTTG TTGTAAAAAG 60 

ATAGCGATTC TCTTATAATC AAGAGTATTA TTCGAAAACT GGATTTTAAG GGAGAGAAGT 120 

GCAATTATGA ATACGTTTTT CAAATTAAaG GAAAACAaGA CAACGGTATC AACTGAAATA 180 

ATGGCGGGGG TCACGACCTT TTTTGCAATG AGTTATATTT TATTTGTTAA TCCGTCCATT 24 0 

CTTTCAGAAA CAGGGATGCC TTTTCAGGCT GTTTTTTTAG CGACCATTAT TGCTTCAATC 300 

ATCGGTACAT TAATCATGGG GCTGTTTGCT AATGTACCTT ACGCTCAAGC ACCAGGGATG 360 

GGACTCAATG CTTTCTTCAC GTTTACGGTC GTTTTTGGGA TGGGGTATAC ATGGCAACAA 4 20 

GCATTAGCAA TGGTTTTTAT TTGTGGTTTG ATTAATATTT TGATTACAGT GACAAAAATT 4 80 

CGCAAAATGA TTATTAAAGC AATTCCAGAG AGTTTACAAC ATGCTATTGG TGGTGGGATT 540 

GGAATTTTTG TCnCTTATGT AGGGTTGAAA AATGCTGGCT T 581 
(2) INFORMATION FOR SEQ ID NO: 655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 



WO 98/50555 PCT/US98/08985 

1903 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 
TTTTATCATA GATCGGAGGT TATGAGGATG AAGTTAGATA TGGTCGGTAT TATCGTCAAC 60 

GATATGACAC AAGCACTCTC TTTTTACCAA ACGCTAGGTT TTAACGTGCT GGAAGAAGCT 120 

TCGCCTGATT ACACAGAATT ACAAAACAAT GGCGTCAGGA TTTCTTTAAA TTCACTACAA 180 

ATGATTACTG GTGTCTATGG GTTTGCGCCT AGCCAGCAAG GCGATAAAAT CGAACTTGCT 240 

TTTCTTTGTG AAcGGCCGCA GAAGTTGATA ACACCGTGAA AAAAATGAGC CAAGCTGGTT 300 

ACATTGTTTT TAAAGAACCT TGGAATGCAC CTTGGGGCCA ACGATACGCA ATCATTAAAG 360 

ACGTGGATGC AAATTTAATT TCACTCTTTG CCCCTCTATA ATATTACTAA nAmATGCGCC 4 20 

ACTTTCGTTT CTTTTACTCT CTTTCGgAAA GATTAGTGCA GAATGGGtCT TAAGTAGGaT 480 

GTCyTTTCtG tCyCtTTTTT TTGGTATAAT GAACCTTATC CAATGGATGG GGn 533 
(2) INFORMATION FOR SEQ ID NO: 656: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 



GCACAAGCAA 


ATGACATTTG 


AAGAAGTATT 


GCCACAGATT 


AAAGCAGGAA 


AAAAAGCCGT 


60 


ACGTAAAGGC 


TGGAGCGGTT 


TTGAACTTTT 


TATTGAATTA 


CGTGATGAAA 


TTGGCACCTC 


120 


CGAAnGTGAA 


TTTTTACAAG 


TTACCCCGTA 


TTTTTTAATC 


AAAACTTCTG 


ATGAAGGCTA 


180 


CAGCATGTTT 


TCACCAACAC 


CCTGTGATGT 


CTTAGCAGAA 


GATTGGGTGA 


TTGTTGAAGC 


240 


ATAACTACAC 


GCATAAGGTC 


TGGCAGATTA 


CGGCGATTGC 


TCGTTCTTCC 


AGACCTTAAT 


300 


TTTATAAATG 


TTTAAAGGAG 


TTTTTCTTAT 


GCCAAAAGAA 


TTTGAAACAA 


AACGGGTGCT 


360 


TGTAACCGGT 


GCAGCTTCAG 


GGaTTGGCCA 


AGCACAAGCA 


ATTGCCTTTG 


CTGAGCAAGG 


420 


TGCTGAAGTT 


ATTGGCATCG 


ACCTAGACGA 


AACGGGGTTA 


AAGCAGACAG 


CCGCACTGGT 


480 


TGACCCAGAT 


TCTGCTAAGT 


CGTTTACTTA 


TTTTGTCGGT 


GATGTGTCTT 


CTCCCTCATT 


540 


TGTGCAAGCC 


ACGATGAAAC 


AAATTGTGAA 


AAACAACGGC 


CAAATTGATA 


TTTTATTAAA 


600 


TACGGCAGGT 


ATTTTAGATG 


ATTATCGCCC 


TTCTCTAGAA 


ACTTCAGAAG 


CTTTATGGGA 


660 


TCAAATTTTA 


GCAACCAATT 


TAAAAAGTGT 


CTTTTTAGTG 


ACCAATGCCA 


TATTACCTTA 


720 


TTTCCTCCAA 


• CAAAAAAAAG 


GAGTAATCGT 


TAATATGGCA 


TCTATCGCTG 


GCTTAGTAGC 


780 


TGGGGGCGGC 


GGCGCAsGTA 


gcACTGCCTC 


CAAACACGCA 


ATCaTCGGGT 


ATACAAAACA 


840 


ACTTTCCTAC 


GATTATGCCA 


AATTAGGCAT 


TCGAGCAAAT 


GCGATTGCGC 


CAGGTGCCAT 


900 
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CCArAcACCC ATGAACGCAG CTGATTTTGg CAGGAGAAGG TGAAATGGCT GCyTGGGtAG 960 
CACGTGaAcA CCCGCGGGCC GTTGGGCACA GCCACAAGAG GTAGCnAAAC TTTCnTTATT 1020 
TCTAGCTAGT GnTGACGCTG ATTATATCCA TGGCACAGTT A 1061 
(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 
GAACAGCCGC AATGTGTCAA GACACATAAA AGAACTTTCT GTTATTTCTC ATAAATAACA 
GAAAGTTCTT TTTTATACTG GTTTATTTTT TTCTAAAGCA TTACTTAAAC GAGCGAATTC 
TTCTAAAGAC AATGTTTCAC CACGTCTAGA AGGATCAATT TCTGCTTCAG CTAGGCTCTC 
TTTTAGCCAA GCCACCGTTT GTTCATCTTT CCCATAAAAA TGCGTTAAGT TATTCCACAA 
TGTTTTACGT CTTAATTGGA AACTGGCTTT GGTTAATTTG AAAAATTCTT TTTCATTGGT 
CACTGTCACT GCTGGTGTAG CACGGCGCGT CAATTTAATA ATCGCCGAGT CAACATTGGG 
CTGCGGCACA AAAACTGTTT TTGgGCACAA TAAAGGCGAC ACTTGcTTCC ATAAAATATT 
GGACAGCAAT TGaTAATGAG CCATAAGCTT TCGTTCCTGG TTtAGCAGCA ATTCGATCAG 
CGACCTCT 

(2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 



TAAAGGnGTT 


ATTTTAACGG 


ATACTTTTTA 


CAATTATTCG 


GAGCAAGTGT 


AGCCAATATT 


60 


CCnGGAAGGC 


TTTCAACAAA 


TGnCnCTGTC 


ATTACCAGAT 


ACCGTCTTGG 


CCTTTTTGCC 


120 


CAGCTTTGAT 


TGTTTGGTCT 


ACCTTTAAAG 


CATTTAATGC 


GACGCCTGTA 


ATCGGGATTG 


180 


TTATTGGTTT 


AATGATGGTA 


TCACCCATTT 


TACCAAACGC 


TTATGCAGTT 


GCAACACCAG 


240 


ATTCTGGCGT 


GAAAGCTATT 


ATGGCTTTTG 


GTTTTATCCC 


AGTTGTCGGT 


GCACAAGGGT 


300 


CCGTTTTAAG 


TGCAATTGCT 


GCTGGTATAA 


TTGGCGCAAA 


GATTGAACTT 


TTTTTCCGAA 


360 


AAAAGATGCC 


GAATATTCTT 


GATCAAATTT 


TTACACCCTT 


TATGACTATG 


TTAATTACTT 


420 


TTTTAATCAT ■ 


GATTCTCGGA 


ATTGGCCCAA 


TTTTGCATAC 


CGTTGAGTTA 


GGCATGGTAG 


480 


ATGTTGTCCA 


ATGGTTGATT 


GGGTTGCCCT 


TAGGATTAGG 


TGGTTTCGTG 


ATTGGTGCCT 


540 


CATATCCTTT 


AATGGTTTTG 


ATTGGTATTC 


ATCATACACT 


AACTATGGTA 


GAAACATCAT 


600 



60 
120 
180 
240 
300 
360 
420 
. 480 
488 
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TATTAGCAAA 


TACAGGATTT 


AATGCATTGA 


1 1 Ab AA I A lb 


i uLLn iul ML. 


bbo 1 i J.o\-k-/i 


660 


ACGT.TGG.CAG 


T.TGTTTAGCC 


TTTGCGAAAA 


AAgbbbAAbA 




nrtu 1 bbrtbrVv? 




CCATTGGTTC 


TATGTTGTCT 


CAATTATTTG 


bAbl 1 AGTbA 


bbb 1 0 1 b 1 In 


All bbA 1 1 Ab 


i r n 

/on 


TTATTCGTTG 


GAACTTAAAA 


CCGCTGCTAT 


GTGTCTTGTT 


1 AbAi bAbb 1 


1 lAbbbbbAb 


ft 4 fi 
0 4 U 


CAATTTTAgC 


TATTTTTCAT 


ATTCAATCAA 


ATTCTTAKGb 


b 1 1 Abbbb 1 i 


TV r T* rp rp 1^ 

Ail bb 1 lb 1 K 


yuu 


TCTTAATGTA 


TmTCTATAGT 


GCACATCAAT 


TAGTGATCTA 


TTTATTAGTC 


bbAbTTTTAT 


y bu 


CTGTCGGTGT 


ATGTTATGCA 


TTAACCAGTC 


TATTTGCAAT 


TCCGCAAGAA 


GTTTTGATTT 


1 f\ O A 


CAGATaAAGT 


AATCGAAGAG 


GAAGAACGCG 


AAGTCTTTGA 


AATGCAACAC 


AA1 Abb 1 1 Ab 


1UBU 


ATGAACAACT 


TTTTTCTCCG 


GTAACTGGTT 


ACGCCATGAA 


TCTTACTGCT 


GTTAATGAbb 


1 T A f\ 
1 1 4 U 


CAGTATTTTC 


AAGCGAGATG 


ATGGGCAAAG 


GTTTAGCAAT 


CATGCCAACA 


GbbAAbAAbb 


l^UU 


TTTACGCGCC 


TGCAGATGGT 


TTATTAAACT 


TGGTTGCAGA 


AACTGGGCAC 


^ m m tv m ^ #n tv 

GCTTATGGTA 


i o c n 
lz bU 


TCCAAACGGA 


CGCAGGAGCT 


GAAGTATTAA 


TTCATATAGG 


TA TV rtl fTl Tl rp T* rn 

AATTGATACT 


GTCACATl bb 




GACAGGAGGT 


CTTTCAGACA 


CAAGTAACCC 


AAGGTCATCG 


fTl J_ fT\ T\ TV T\ TV TV TV 

TGtTAAAAAA 


GGAGATCTTT 


IjoU 


TAGGAACGTT 


CGATCGAAAA gCCATTAAAG 


AAGCTGGGCT 


TGACTCAACT 


GTCATGGTTA 


1 4 4 U 


TCATTACAAA 


TACTTCAAGT 


TATTTAAGCG 


TAGAGCCAAT 


GATGTCAGAT 


b AT AAT b AAA 


1 DUU 


TTACGCCGGA 


ACAAATCATT 


TTAAATCTAA 


ATACACCTAA 


CTAAACAAAA 


GArCAGCAAG 


10 bU 


AAGTAGTTAT 


CACTTCTTGC 


TGTTCTTTTT 


CTTAATAAAA 


AACCAATGAT 


TTTCGGTTGT 


icon 


ATTTGTCTAT 


CCGTTTGTAG 


TATGATTGAG 


TCAGTTTTAA 


AGGAAACATG 


CCAAGTATGG 


1680 


AAGACCATAC 


TTGACACGCA 


TTGACTAATT 


AAACATGGGG 


AACGTTGCCA 


TAAGTTGATG 


1740 


TTCCAGTCAA 


TAATGCAAGT 


AAATCmAACc 


ATAGCATGTA 


CCTCCTTTAA 


TTGATAATCA 


1800 


AAGTATAACC 


ATTTTGnAAT 


GGAATTTACC 


ATAGATAATA 


CCAAGTGGAG 


GTCTGTnCGT 


1860 


GGAAGGAGnC 


AGGAGGCCAG 


AAATTAAACC 


AAGCCGGGGA 


AAnCTT 




1906 



(2) INFORMATION FOR SEQ ID NO: 659: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE": nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 

GGCTGAAAAG GGGTGGCTTC AnCCCAAAAA CTTTTTTGGG CTTCCGCnAA CGACCGCTGG 60 

TGGGTCAAGT CACnGCTGGT GGACTTGayC CAAgCACCTT GtTTtGAtCG ACCACTGCTt 120 

GgTTGGTCTT tAAaCAGATt GgAtCAATGG TGtCTTTTtC TTTTTTCGCA GTGTCAGCAA 180 

CTTGTTGTTT TTGATCGACA ATTGCTTGTT TCTCTGTTAC TTGTTGCTGT TTTTCAGTAA 24 0 

TGGCTTGCTC CGTTGTTTCT GTAGCTTTCA CTGCTGGTTG TTCTGTAGAA CTGTTTTCAG 300 

GTGTTTTTGG TTGCGCTTGT TCTGCTgCCt GTACTTCATT TCCCCCAATA CCTGCTACCC 360 
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CCGTTGCGAC TAAAATACTT AACCCTGCAC TTGCGATCTT TTTCATTTTA TTTTCCTCAC 4 20 

TTCCGATTAT TTTTGTTGTA AATGGGATAC CCATtGTCTT TACGCCATtG TTGTAACGCT 480 

GTAGTTGTTA TATGGAACTC TTCCGCTATT TCTTTAAACT GTAAGCCCAT TTTTTGATAG 54 0 

GCTTCAAATC TTTCCTTGCT AATATGCGAA GAAAATTGTC TGGTGTTTGC TGTTTCTATC 600 

TTTCTTGCAA GTTCTTGTAT TCGTTTAAAA CGCTCGGTAT CTTCATTCCT ATACCAGTCT 660 

GCGTCTTCAT TCATTAATGC CAACATTTCT TTACGCCATA TTTTTATTAA TTTTTTAGTC 720 

ATACCCCTTT CACCTCAAAT TAACAATAAC AAAATAATAT TTATTATGAA AATAGTAACC 780 

TTTTAGAGAA AATAAAAAAT GTCTTTTACA TGACAATTAT TAAATGCCAT AAACAACATT 84 0 

ATCTTTAATT TCAAATTCAA ACCATTCTGA AAAATCTTGT TTAAATCCAT TGACGCTATA 900 

TTTTTCTTGT AAAACTACTT CTTCCGCATT CGCAGGGTCT AAATAGGTCG GTTTATCTTT 960 

TGGACGCGCA TAAACACCTA CCCAACTTGT AAAATCCCCC TCAAAAGCAC CATCAATTTC 1020 

ATAACCAACT TCTTCTAGCT TCTGACGAAC TTGTTTCTTT ATATCAATAA TATCTTTAGG 1080 

aATATAATTT TGaTACTTTT CTTGAAATTC CTTAAAAGCT AATACTTCTt TACCATAGAA 114 0 

CCTTTTTATn 1150 

(2) INFORMATION FOR SEQ ID NO: 660: 

(i) SEQUENCE CHARACTERISTICS: 1 
(A) LENGTH: 1175 base pairs 
(B} TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

TCnGATTnTC TTATTATTAA CGTAGTGGCT TATTCTAnTA TCCCCAAAAT CCCCACTGTG 60 

TATTTAATCT AATCTtAAAC TGATTTTCCC ACGAAAGTTT TATAAaCCCA AATATTAATG 120 

ACTCTGGTCT AcTTTATTAA ATAATCCAAA AGCTAAATGA CCTAACTCAT GTACAAAGAT 180 

AGACAATAAT AAGGCAAAGT ATGCTATAGA TACTAAATTT AATTCAAAAA CAAGAGTAAA 24 0 

GATTACGAGA AGTAACGTAA AGATCATCGG AGATATAAAT TTCATCTATT TTATTCTCAC 300 

CTCTTTGTAT TTAAGCATGC TATTTTTATC TATAAAAGAA ACAGCATTTT ACAATATTTC 360 

TATATGATCT AGTTTCAACT TCTATATTTG TTTCTAATGC TGTAAAACCA ATCCCAATCA 4 20 

TGCCTCAAAT AACTAAACTT TTAACTAACT TCTTTTTCAT TAGTATTCAT CTCGAAAATA 480 

AAAATTAGAA AACGTCTTCC ACGCAAGAAT TGTAGCTCCA TTGATCTAGT TTTGTAAACA 540 

TTGAGACAAA GGTTTCTATC TAACAATTTT GTAAGTTGGT GCCCTTCGAA TTTGATTAAA 600 

TTAGATTTTA' TTCCGTTTTA TTTCAAGGTA AGTATTAATA CAAGTTTTAA TTTTTATAAA 660 

AAAATGAATC TTACCATAGA GATTTACTAA CTAATTATTT ATACTTTTTA TAATAGATTG 720 

GAGGTTACTA TATGAGGGAA AAGTTATTAA TAGGGAGTTT TTGGTTATCA TTCTTTAATA 780 
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TTGTTTGTAA AATTCTCGGA TTCATTTATC TAATACCTtG GCTAAAATTT ATGGGrACTA 840 

TCCACAATCA ACAAACTGCT CAAGCTATTT ATAATGTTGC ATACTTACCT TATGCTTTAT 900 

TCTTATCACT TGGTACAGCT GGATTTCCTA GTGGCATTGC TAAAAAAATT GCTGAATTAA 960 

ATATTAATGG AAATAAAAAC CAAATAAAAG AATTGTTCAA AAGTGGCTTG ATAGTTATGG 1020 

AAATAATAGG GATTCTCTCT GCTTTAnTAA TGTTTATTTT TGCTCCTACA TTAAGTAAAA 1080 

TAAGTCCAAT AGTTGATCAT ACTGCGGGTA TCACAGCAAT CCGTAGCCTT TGTTTTTCTn 1140 

TATTAATTAT TCCTAnACTT AGCGCACTAA GAGGG 1175 
(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 



TAGTCCAAAC 


CCCCTTTTAT 


ATAGGGCCAG 


GCGCCTTAGG 


CTATCCACTA 


ACnAnAAAAT 


60 


CGGATnTCTG 


CCCCCTTAAT 


CATTGGTTTT 


CCCCCCCGTC 


AAAGCTACTA 


ACTGTTGATT 


120 


AATTTTnTTA 


ATCTTTTGCT 


CCTCCCATTT 


GAATTAAATG 


TCGCGATTGA 


TCTCGTTTTC 


180 


TTAACAAAAT 


AGTTTCCaAA 


AAAGAAGCAT 


CCTTAAGTTT 


TCTCAAAGTT 


TCCGCCATCT 


240 


CTTCAACTTT 


AAACCCTGAA 


TCTCTTAACA 


GTAAAATTTG 


TTTCAATAAA 


GGAATTTGAT 


300 


GTGCGCTATA 


ATAGCGATAT 


CCTGTTTTTT 


TATCAATTTC 


ATTTGGAACA 


AAAATTCCTG 


360 


CTTTTTCATA 


ATGACGCAGC 


ATCCGTGGGG 


AAATATCCAT 


TAACTTGGCA 


AATTGCCCAA 


420 


TTTGATACAT 


AGAGTCTCCT 


TTTCATTCAA 


TTGATATTCT 


TATCATAAAC 


CTTGACACCA 


480 


TGTGAAAGTC 


AATGAAAAAG 


TTTCTATTCA 


TACATTTCTA 


TTTTATTTGA 


ATATTCCCGT 


540 


GATCTTTCTC 


TAGTTATTCA 


GAAGTATATT 


GAGGGGTGAA 


ATTTTGGAAT 


TAAAAAAAAC 


600 


GATTTCCAAC 


AAAATAATGA 


TTATTTTAAT 


TGCAATTATC 


GTTGCAATCT 


TTGCAATGGG 


660 


ATGGATTCTA 


CCTATTGGCA 


TCGATAAAGT 


AACTAGGTTA 


AGTTACCGAG 


AATACTTATT 


720 


TAGTACATAT 


ACTGTATTTA 


CACAATTTGG 


CTTTCTAATG 


TTTTCGTTTC 


TAGTCTCTTT 


780 


CTTTATTAAT 


AAAGAGTACT 


CAGGAAAAAC 


AATTTTATTT 


TATCGTATGA 


TGAATACCAA 


840 


TAG CCT TACT 


TTTTATATAA 


AAAAAGTTTT 


GACACTTACG 


GTAGAGACCC 


TAGGCAGTAT 


900 


TCTAGTTCTG 


TTATTTATCG 


TTTCCTTTAT 


TTTTATGGAT 


TTTTCTGTAA 


TcTGCAAATG 


960 














965 



(2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



WO 98/50555 



1908 



PCT/US98/08985 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662: 



ATGTAACAGT 


TTACAACGAA 


GAATAAGAAG 


ACTATnATTA 


TGTTAACTAT 


TTTTAGGTGT 


60 


GCCTAAAAGT 


GAAATGTGTG 


AACAAGATAG 


ATTGAGGGAT 


GTTGGATGAA 


AGAACAGAAA 


120 


ACGGaAAAAA 


TGCTTAATTA 


TGCAAATGGA 


CCTAGTTTAG 


AAGAGATAAA 


TAATACCGTC 


180 


GAGGTACCTA 


AAAACGCAAG 


CTTTTGGCGA 


ACTTTGTTAG 


CCTATAGCGG 


ACCTGGCGCT 


240 


TTAGTGGCTG 


TTGGTTATAT 


GGATCCAGGA 


AATTGGATTA 


CGTCAATTGC 


TGGCGGAgCG 


300 


GAATATAAGT 


ACACGTTACT 


TAGTGTGATT 


TTGCTTTCTA 


GTTTAGTTnC 


CnGGGGAnTA 


360 


CAAAGTATGG 


CTGCGCGCTT 


AGGAATCGTG 


ACAGGAATGG 


ATTTAGCTCA 


AGCGACTAGA 


420 


GAGCATACAA 


GTAAACGAAC 


GGGCGTTGCC 


TTGTGGGTGG 


TGACGGAGTT 


AGCCATAATG 


480 


GCTACAGATA 


TCGCTGAGGT 


AATTGGTGGT 


GCCGTTGCTT 


TGCAATTATT 


ATTTGGTTTT 


540 


CCATTATTAA 


TTGGTGTGTT 


GATAACAACG 


TTTGATGTTT 


TATTACTGnT 


GCTACTGACA 


600 


AAGTTAgGCT 


TTCsCaAAAT 


CGAAGCAAwT 


GtTTCTTGTT 


TAATTGCAGT 


CATCTTTTTT 


660 


GTTTTTGCTT 


ATGAAGTGGC 


ATTAGCAGAT 


CCAAATGTTG 


GTGAAGTATT 


ACGAGGTTTT 


720 


ATTCCAGACA 


CAAAAATAGC 


GACAGATAAA 


TCCATGTTAT 


TTTTAGCCTT 


GGGGATCGTT 


780 


GGAGCGACAG 


TCATGCCCCA 


TAACTTATAT 


TTGCATTCTT 


CCATTGCGCA 


ACACGGAAAa 


840 


TTTGATCGTA 


ACGATGATGT 


TGAGAAAGCC 


AAAGCAATTC 


GTTTCACTAC 


TTGGGATTCA 


900 


AATATTCAAT 


TAACTGTTGC 


TTTCGTCGTA 


AATTGTTTGT 


TGTTAATTTT 


AGGAGGAGCA 


960 


TTATTTTATG 


GAACCAACAG 


TGAATTAGGT 


AAATTTGTTG 


ATTTATTTGA 


TGCTCTGAAA 


1020 


AATCCAGATA 


TTGTTGGTAA 


TATTGCTAGC 


CCAGTGTTAA 


GTATTCTTTT 


TGCAATTGCT 


1080 


TTACTGGCTT 


CGGGTCAAAA 


TTCAACGATT 


ACAGGTACGC 


TGTCAGGACA 


AATTGTAATG 


1140 


GAAGGCTTtA 


TCCATTTAAA" 


AATGCCGTTG 


TGGGCACGAC 


GAGTGTTAAC 


TCGTTTGTTA 


1200 


GCAATTGTGC 


CAGTCATTAT 


CTGTGTCATT 


ATTTATGGCG 


GTAGCGAAAC 


GGCTGTTGAA 


1260 


GACtTGTTAC 


TTTATACGCA 


AGTTTTCCTA 


AGTATTGCTT 


TgCCAGTGTC 


GATTATTCCT 


1320 


TTAACGATGT 


ATACAAGTGA 


TAAAAAATTA 


ATGGGCCAAT 


TTGCGAATCC 


AGCGTGGGTG 


1380 


AAATTTTTAG 


CTTGGATTAT 


TGCTATTGCC 


TTGACGTTAT 


TGAATTTATT 


TTTAATTTAT 


1440 


GGCACATTAA 


CTGGTTTAAC 


GAATTAATAA AAACAAAGgC 


TG AC TAG AAA 


AGTTAGCCTT 


1500 


TgTTTTTTCA 


TAGAATTTGC 


GCGTAAGAAA 


TGGTAGAATA 


GGAAAGATGA 


AATACAAGGA 


1560 


AGAGAGGCTT 


TCACGTGACA 


GATTATTTAA 


ATGTAGGAAA 


AATTGTGAAC 




1610 


(2) INFORMATION FOR SEQ ID NO: 663: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 



AAATTTAGGT 


GGTGGCGGTC 


ACTTTACTAA 


TGCGGCAGaC 


AATTATCAAA 


CGTAmCAGTA 


60 


GCAGAAGTAA 


AAGAGCAACT 


ACTTGATGTA 


ATTCGTCAAA 


ATATTAATGA 


AATGTATGAA 


120 


CAGGAGTGAA 


GGAAATGAAA 


GTCATTTTTT 


TACAAGACGT 


TAAAGGAAAA 


GGAAAAAAAG 


180 


GGGACGTAAA 


AGAAGTGCCA 


ACTGGTTACG 


CCCAAAACTT 


TTTAATTAAA 


AATGGTTATG 


240 


CGAAAGAAGC 


CAATAAAGGG 


AGCATGAGTG 


CGTTAgcGGG 


TCAAAAGAAA 


GCCCAAGAAA 


300 


AACATGAAGC 


AGAAGTTTTA 


GCACAAGCAA 


AAGAAATGCA 


AGCCTTTTTA 


GAAGATGAAA 


360 


AGACTGTTGT 


AGAAATCAAG 


GCAAAAGCTG 


GAGAAGACAG 


CCGTTTATTT 


GGCTCAATTC 


420 


CTTCAAAACA 


AATTGCAGAA 


GCCTTAAACA 


AACAATACAA 


TGTAAAATTG 


GATAAACGCA 


480 


AAATCGAATT 


AGCCAACCCA 


ATTCGTTCTC 


TAGGTTACAC 


TAACGTACCA 


GTCAAACTAC 


540 


ATCATGAAGT 


TACAGCGAAA 


ATTAAGGTTC 


ATGTCGTAGC 


GGAATAATAC 


AATCGATAAA 


600 


AAAGAAGCCG 


ATTTCTCCGG 


CTTCTTTTTT 


ATGCAAAATA 


AATTACTAGA 


ATTATTTTTT 


660 


ATAATATTTA 






TTAA ATTAAP 


ftAAACiTATTA 


TATTTTTATT 


720 


AATTGTATAT 


TTTTAAAAAT 


TGGAAAGTGA 


ACTAAAAATA 


GAGAAATCAG 


CAGAGAAAAA 


780 


ATGTACTTTT 


TTAATGAAAT 


TGTGTTAAAA 


TGGATAGACT 


AAACGAACCA 


GAAAGTAAGA 


840 


CATCGTGGGA 


GAGAaCAAAA 


TGAATGAATT 


ATTGCAAGAT 


AGGGTACCGC 


CACAAAATAT 


' 900 


TGAAGCGGAA 


CAAGCCGTTT 


TAGGTTCTAT 


TTTCTTAGAT 


GCAGATGTGC 


TAATAGAAGC 


960 


GATGGAATAT 


GTGGAACCTA 


AAGATTTCTA 


TCGTCGTAGT 


CATCAGTTGA 


TTTTCCAAAC 


1020 


GATGATGAC 












1029 



(2) INFORMATION FOR SEQ ID NO: 664: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664: 

CTTTCTCAGA AAGAGAAATA TGATAAAAGG AGTTGTTCAT ATGGACTATG CTAAACTAGC 60 

TTCCGACATA ATTCTAGCAG TTGGAAAAGA TAATCTTATC GCTGCTGCAC ACTGTGCAAC 120 

ACGTCTTCGT CTCGTACTAA AGGATAATAC AAAAGTTAAT CAAAAAGCTT TGGATGAAAA 180 

CCCAAATGTA AAAGGTACCT TTAAAATTGA TGGTCAGTAT CAGGTTATTA TTGGTGCTGG 240 

TGACGTTAAT TTTGTCTATG ATGAACTTAT AAAAAAAACT GGCCTTTCAG AGCTTTCAAC 300 

AGACGACTTG ' AAACAAATAG TTGATAAAAA TAAAAAATTC AATCCAATTA TGGcATTAAT 360 

TAAACTACTA TCAGAAATTT TCGTACCAAT TATTCCTGcA CTTGTTGCaG GTGGtTTATT 4 20 

AAkGGCTCTA CGAAATTTTT TAACTGCCGC GGGtCTTTTk GGACCAAAAT CaATCGrAGa 4 80 
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AAgTATCCTG cAATTAAAGG AATTTCCTCT ATGATTCAGC TTATGTCAGC TGCACCATTT 54 0 

ATGGT-TTT-TA GGA-TTGTAGT TGGTATT 567 
(2) INFORMATION FOR SEQ ID NO: 665: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665: 

TTTCCGTTTT CTGCAGTGAT GAAGCCAAAG CCTTTGTCTG AGTTAAACCA TTTTaCTGTA 60 

CCTGTTTCCA TAATAAATAT CCTCCTAGAT GTGTGTTATA ACACATTCAT" ATTTTGCAAT 120 

TACTTGAAAG GTGAAAAATG TGGAAGATGT TTATTTGAAA CAGCTACCCA TATTACCGAA 180 

CAAAAACTAC ATCTATTACT ATAGCATACT AACTGGTTTT TGTATAGCGA TAACCTTATT 24 0 

TTCTTTTTTA TCGGATGTTT TTTAAAAACT AGAAAGTGAA GATAAGGCTG TTATAGTGCT 300 

GTTTCAATAA GGCTCAGAAA AAAAGCCCAT TGTAAGGCGC TTTCTTACAA TGGGCTTTTT 360 

GCCTGTTTTT AATAACGGAG GATAGCCGCT AAAGATTTTT CATCTGGTGC AGCTTTTTGG 420 

TCTAAAACAA AAACTTGACC GCCGTTTTGA ATGACTTGGT AAGCAATGGT GTTTAAAAGT 4 80 

TTGCGACGAT CATACTCTTC AGATGACATT TCATTGTTTT CAGTAACAAA GAGTGACGTT 540 

GCAATAAAAA GATGGGAAAT TTTGCCGTCA GCTGCCGCTG GCGAAATATC TGTAAATTGA 600 

TCTAAGAATT TACGATCTAG TAATTTGTTA TAAGAAGCTG TTTCTTTTTC TGTTAACTCT 660 

GCAGCTATTT TTTCACTAGC GCtGCGGATA TCTkGAATGG TTGCTTGTGC AGGTGAAGCA 720 

GCAACTGAAA TTGAGCAATC ATAATAAGGA TTTTTAGCAA TCTTTTTAAA GAGTGTTTGG 780 

TTTTCTGGTA AAgCATAAAG ATAAAGCGGT AATTTTTCAG GATTATCCAG TTGATCTTTT 84 0 

AAGAAAGTAT CGACTGCTTG ATAGTAGTTC ACCCAATCAA TTTCAACTTC TTTATCTTTA 900 

GTATTGATGC TATGATAAGC GACACCTTCT TTAGAACCGT TAGAACTACC TTGAGTTGAA 960 

AAGTTTAAAT TACCACCCGT TAATTCATCA CCTAAAGCTG TGACCACGTC TGTTGGCGCG 1020 

TCTTTAGGCA ATTCAACAGG AGTCACAGTC GTATGATCAA CCTTGTAAAG TTTCATTGAA 1080 

TCGCGATTTA GAGCAAGTAG GTAATAATGG TAGTTAAATT GTTTGTCTTT GACTAGTCCA 114 0 

AGTAAATAAG GAATCGTGTC CACGTAATAT TGATCGTCAA CGGGCACATC TAATCGTTGA 1200 

ACAAAGGTGT TTTCTGGAGA TAAGATGATT GAAACACTTG CTGTGCCGTT GCGCCAGAAG 1260 

GAAGCATCAG CTAGAAGTGC GTCAATTTTT TCTTGGAAAA CAGACCATTT TTCCTCAGGA 1320 

AATTTTTTTT CAAAGCGTGT TTTTGCCGCT TTTGCAAAAT TTTTCAAAAC AAGCGAATCT 1380 

TTTTCTACGT TTTgATGAGC GACATGGGTG TTAAACAAGA AAGTAACAAA CGGTCCATGA 14 4 0 

aCTTCTTCAG AAAATAATGT TGATAAGCTA TCTTTTTTTT GATTTGCCAT AATAAAATCC 1500 

CCCTTCTAAA AGTAAGTGTA GCACAGGAAA TCGAATGTTT AAAACAATAT GTTCAATAAA 1560 
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AGArAACGCT 


TTAAATAATC 


TAAATAAAGG 


ACTGTTAATG 


GTGTTCTAAT 


AAAAGTTTTA 


1620 


TAAAAAATAT 


AGCTGAAACT 


CATGATAAAA 


AAGGCACAAA 


CCACTATAtT 


ACGTTATGCC 


1680 


AATTATCCCG 


AATTGGTTTT 


TTCATACTTA 


CTCCTACCAA 


GAGTAAATAA 


CAACCTTCAT 


1740 


TTCCACCACA 


GACGACGGTC 


GTGGTGGTTT 


TTTTTGTTTA 


GGTGCTAGTT 


TTTTTGCTAT 


1800 


CAATTGACTA 


GGAGAAGTGA 


ACGTTCTTTA 


AGAATTTGTC 


ATTTCAAATT 


TACGGAACCG 


1860 


CAAGAAACGA 


TTTGCTTCTT 


CCTCTAACTG 


AGTAATTTCT 


TCTTCTGTTA 


ACTCTTCCGT 


1920 


AGTAATGTGA 


ATATGTAACT 


CTTTTTGTGT 


GCTTTTCATA 


GGTGCCTGAT 


GATTAATCTC 


1980 


TGTTTGTTGG 


GTGCGGCCCA 


TTAAATAGTC 


CAAGGAGACA 


GAAAAATAGT 


CTGCTATTTT 


2040 


TAAAAGATTC 


TCTACTTTAG 


GCGAAGAGGT 


ATCCCAACGA 


CGAATTTGTC 


CATTGGAAAT 


2100 


GCCGACGTTT 


CTTTCTAATT 


GGGCWVbGbl 


1 Avj i LjU 1 1 lu 




1 lU J. J. Irvrtl U 


2160 


CGAGAAATTA 


AGGTCATTTC 


AAGTCCTCCT 


TGTGTTTTCC 


GTTTGCTAAT 


TTTAAATTAG 


2220 


CGTAAAAGCG 


AATGnTTGTC 


AATTGCGAGC 


GTGTAATAAG 


AAGGTGAACA 


TCTCATGAGT 


2280 


GATTTTTTAG 


AAAATTTCCG 


ATGAAAAAAA 


GAAGAAATAG 


AGTTATTATT 


AGTAAGTAAA 


2340 


CTGTTTTGGA 


GCGAGAGGGG 


ATGAATCAAT 


GGAGAAAATT 


AAAAGTTACC 


AAGAGGATTC 


2400 


TGAAGTGCAG 


AAAAATCGTT 


GGTGGATCTT 


AGTTTCTGTG 


GC 




2442 



(2) INFORMATION FOR SEQ ID NO: 666: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666: 



CCAATTCGGn 


CCAGCCTnTT 


CGACCAATGT 


TGCACCAAAn 


ATAAGTCnTT 


TAGACAATTA 


60 


TTCATCTGCC 


GAGGGAAaGC 


CTAGCACGAG 


TGATCGGGAT 


TCATTTTTTG 


TAAGAGGAAC 


120 


GTCGAGCGCA 


ACAATTCGCG 


CACCAAAATC 


TGTTGCTTTT 


AAAGTCACAC 


CATTTTTATT 


180 


CGTCAGTGTG 


TAAAGAAGCA 


TTCCTTGTCC 


AAATGATTCA 


GTTGAAATAG 


TCAAAATAAT 


240 


CAATCCTTCC 


TTTAAGTAAA 


TATTTAGTAA 


ATTTAACTTT 


GTTGTTTAGA 


TTATTTAAAT 


300 


AGTCAAAAAG 


AAATGCtAAG 


AAAATAAATC 


TTTATTTAAT 


TGTATTGTAC 


ACGTTTTCAA 


360 


TTGAAATGAC 


AATTAAGAAA 


ATATTAAGTA 


AACATTTGCA 


AAAGTTTAGT 


AAATATTGTA 


420 


CAATTTGTTT 


ACTAGTGAAA 


GGAGCGATTA 


GTGATGGAAG 


AACTGTTAAT 


GAATCGATTT 


480 


GTAGAAATAT 


TTGGTGAGAA 


AGGGACAGCC 


TGTTATTTTG 


CCCCTGGAAG 


AATTAATTTG 


540 


ATTGGTGAAC 


ATACAGATTA 


CAATGGCGGT 


CATGTTTTCC 


CAGCGGCTAT 


TACTTTAGGG 


600 


ACATTTGGTG 


TGGCAAGAAA 


AAGAGACGAC 


CAAAAAATTC 


GGATGTTTTC 


TGATAATTTT 


660 


AAAGAGGTTG 


GTCTCATCGA 


ATTTTCATTA 


GAAGATTTAA 


CCTATTCAGA 


TTCAGATGAT 


720 
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TGGGCAAATT 


ATCCTAAAGG 


CGTTTTAAAT 


TACTTGATTG 


AAAGCGGACA 


CAATATTGAT 


780 


TCTGGcTtAG 


ATGTGctCTT 


CTACGGTACG 


ATTCCTAATG 


GTGCTGGCTT 


GTCTTCTTCT 


■840 


GCTTCTATTG 


AACTTTTAAT 


GGGAACGATC 


TGTAATGATT 


TATATGCTTT 


ACACTGTCCA 


900 


ATGCTTGAAT 


TGGTGCAAAT 


CGGTAAAAAA 


GTGGAGAATG 


AATTTATTGG 


CGTTAATTCA 


960 


GGAATTATGG 


ACCAATTTGC 


CGTAGGGATG 


GGAGAAAAAG 


ATCAAGCTAT 


TCTACTAGAT 


1020 


ACCAATAATA 


TGCACTATGA 


AATGGTTCCC 


GCCAAATTAG 


GTGAGTATAC 


GATTGTCATT 


1080 


ATGAATACTA 


ACAAACGCCG 


TGAATTAGCA 


GATTCAAAAT 


ATAATGAACG 


TCGTGCAGAA 


1140 


TGCGAAGAAG 


CCGTTCGTTT 


GCTTCAAAAA 


GAGTTATCTA 


TTGAATTTTT 


AGGTGAATTA 


1200 


GACAGCGAGA 


CATTTGAACA 


ATATCAAGCA 


TTGATTGGAG 


ATCCAGCGTT 


GATTAAACGA 


1260 


_ GCGCGTCATG 


CCGTCACAGA 


AAATGAACGA 


ACTTTATTGG 


CAAAACAAGC 


GCTAACTGAA 


1320 


GGGGATTTGG 


AAGAATTTGG 


CTTATTATTA 


AATGCCTCAC 


ATCGTTCGCT 


AAAAGAAGAT 


1380 


TACGAAGTAA 


CTGGCATTGA 


GTTGGATACC 


TTAGTGGCTT 


GTGCGCAAGA 


ACAACCAGGC 


1440 


GTTTTGGGCG 


CACGGATGAC 


TGGTGCAGGG 


TTTGGCGGTT 


GTAGTATTGC 


TTTGGTACCA 


1500 


AAGCAAAATA 


TAGATGCATT 


TATTGAAGCT 


GTTGGCCAGA 


GTTATCAAGA 


TAAAATTGGC 


1560 


TATGCTGCTG 


ATTTTTATCC 


TGCATCAATT 


GATGATGGTG 


CCAGAAAGTT 


ATTTTAGAAA 


1620 


GGAAGGCTTT 


AAATGTCTAT 


TTTAGTATTA 


GGTGGCGCAG 


GCTACATCGG 


CTCACATGCC 


1680 


GTCGATCAAT 


TAATTTCAAA 


AGGCTATGCC 


GTTGTTGTTG 


TTGATAATTT 


ATTAACGGGG 


1740 


CATCGTTCAG 


CAGTTCATGA 


ACAAGCGACT 


TTCTATGAAG 


GCGATATTCG 


TGATAAAGCT 


1800 


TTTTTACGTA 


GTGTCTTTGA 


AAAGGAATCA 


ATTGAAGGGG 


TGTTGCACTT 


TGCGGCCAAT 


1860 


TCTTTAGTAG 


GAGAATCCGT 


GGAGAAACCG 


TTAATGTATT 


TCAATAACAA 


TGTTCACGGC 


1920 


ACTCAAATTG 


CGTTGGAAGT 


CATGCAAGAA 


TTCGGGGTGA 


AACACATTGT 


TTTTTCTTCC 


1980 


ACAGCGGCCA 


CTTATGGCGA 


ACCAAAAGCA 


ATGCCTATTA 


CAGAAGAAAC 


CCCAACGAAT 


2040 


CCTAAAAATC 


CGTATGGGGA 


AAGTAAATTA 


ATGATGGAAA 


AAATCATGAA 


ATGGTGCGAC 


2100 


AACGCTTATG 


AAATGAAATA 


TGTTGCTTTG 


CGTTAtTTTA 


ATGTtGCAGG 


AGCaAAAAAA 


2160 


GATGCCTCaA 


TTGGTGAGGA 


TCACACGCCA 


GAAACGCATA 


TTGTGcCAAT 


TATTTtACaA 


2220 


GTGGcATTAG 


GCCAACGAGC 


AGAGCTAAGT 


ATTTTtGGGG 


rtGATtATGA 


TACACCAGAT 


2280 


GGCACGTGCA 


wTCGAGATtA 


TGTTTACATT 


GAAGACTTGA 


TTGCAGCACA 


TATTTTGGCT " 


2340 


TTAGAATACT 


TGAAAAATGG 


TGGCGAAAGT 


GACGTCTTTA 


ACCTGGGTAG 


CAACAACGGC 


2400 


TATTCTGTTA 


AAG 










2413 



(2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 

GCCGnACCCT TAAATCCGGC CAAATGGTAT AATTGGTCCn CTGGrTTmCC CAatArCCcT 60 

cTTCtCyTCt GTCaCAGtGC TTCCgGCTCC aCTTGGgCAT cgCTTaGCTA ATTTTTTCCC 120 

aTAAGCAGTT TGACTTTTAG CATTTTTCTT TGCGGCATTC AAACGGGAAC GCGAACGTAA 180 

ATCTAACGCA TCTTCGCCAA AACGTTCTAA GTCTGGTTTG ATTTGTTGAT ATTGTCGTCT 24 0 

AGTTAAAGGA ATCTCAGGTT CTGAGGTTGG AATATTAAAT TTCCGCACCG TTGGATCTTC 300 

TTCCGTATCT TTCACTTGAA AAGCCGTTGG TTCATCATCA AGAATGACAT ATGAACGACG 360 

ATCTTTTTTC ATTGCTTTTA GCAACTCTTC TTCAGTAAAA CTAGGTTCCT GTTCCTCAGG 420 

AATCACAGAA GCCGGGaTAT ATTTAGGaAC AAAATAAGAA GGtGtCGTTT TTTGCACTAA 480 

TGGTGATGCG GCAGGtCTAC TTGGTTTCGC TaCGaCTTCt TTTtTTTTAt AcGTTTTGGT 54 0 

GGTCCCAG 548 



(2) INFORMATION FOR SEQ ID NO: 668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 



TATGATGTGT 


GGACTTAGTG 


GTGGGTATGC 


TTCGGTAGAA 


GCCAAAATTG 


ACTTTTkGAG 


60 


AATaCGATTT 


AAGACATTrG 


ATGTTCGGAC 


AGTTGTTGAA 


AAGTTGTTGC 


ATATGGATAT 


120 


GAACTGGTTT 


ACTTATGaAC 


CTAGAGGGTT 


T TAT CAT TAT 


ACTGAAACAT 


TTTCGTATAG 


180 


CTCTATTCGG 


ATTTTTCGGA 


ATCCaGAAAA 


CGTGAATATG 


GGAATTATGT 


TGGACTTGTC 


240 


TGGCGAAGGT 


TGTCGTCAAT 


TAGAAAAAAT 


TTTTGAAGAA 


GACAACAAGC 


GATCGTGGAC 


300 


AGAATTTTTT 


CGCTCACTCT 


ATGATGATGA 


TATTTTCGGG 


CAAGGTGTTT 


TAGTTGATAC 


360 


AAAAATAACT 


CGAATTGATA 


TTGCTCTTGA 


TGAATTGATT 


GTGAAAGAAC 


AAGAAAATTT 


420 


TGATTTATAT 


GTTTTGAAAG 


AAAAAATGGA 


ACAAGGGCTT 


GTGGATACCA 


CTTTTAAGAA 


480 


TTTTGATTTT 


AGCGGGGGCT 


TTGCTTATGA 


AAATAAAAAA 


ATGGTAAATA 


AAGGTTTGTC 


540 


TCTTTATTTT 


GGTAGTCGTC 


AATCGCCATT 


GTATTTTAAC 


TTTTACCAAA 


AAGATTATGA 


600 


GTTAGCACGA 


AAAGAAAGTA 


TAAGTGTTGA 


GGAAGCAAGA 


GAAAAACATG 


AAATTAAAAA 


660 


TCGGTACGAA 


ATTCGCCTTT 


CTGATGAAAA 


GGCGTTTTTG 


TTTGTAGAAT 


ATTTTCTTTC 


720 


TAGTGGTGAA 


ACGTTGGATT 


GGCTAGTCAA 


GGAAATTATT 


AATCAATCTT 


TGACTGTCTA 


780 


TGACATTGAA- 


GATGATATGA 


AAGTCTATTG 


CAAAGCTTGG 


TATGATGTTG 


TTGATAAATT 


840 


GGAAGGTTTA 


AAATTGTCTG 


TTCAAGGTGA 


AAAACCGTCT 


ATTGAGAAAA 


CACTTCGGTG 


900 


GTTATCAAAT 


TATTTAGCCC 


CTTCATTAAA 


GAT GAT T AAA 


GAAATTGATC 


ATTTGTTAGG 


960 
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AACAAATGAG CTGATGGAAC GTATCGATTT AGCAGAATTA AAAGAGAAAC ATGAAGAAAT 1020 

AATTGAAATG GTCTGTGTTG ATGCGAAAGA TTTATTGTTT ACCACTAATC AAAATAGCAG 1080 

TGTTCGATCT TATATGGAAC AAGAATTTGA TTTGGAGGAA GTATATCCAT TTTAAAGATT 114 0 

AAGGGGGATA TGTATACACT TGATTCGTTA TTTTAAATAG ATATGAGTTT AGTTGAAAGG 1200 

AATGTATGAA TTATGAGTAG AGAATTTATT GAACGCAATA CTAAAGTAGC AATTTCTATT 12 60 
ACTGAAAAAA TGAAAAAAGG TAAAAATGAT TTACAAAAAA CAAAAGAAAA AATTGTTCAA • 1320 

TTAGATGAAC AAGGGGAATT GaCTaTTcCt TATTTAAAGA TTACTTTTGA AAAGTTTTCA 1380 

GAATCAAATG AAGAGCTTTT AAAAGAAATT TCAAGATATG AATATACTTA TGTGGTTCAT 14 4 0 

GAAGCAGAAA TGGCGGTTAA AGAGAAAGCA ATTTGGGAAG AATTTTTCAG TATAAAAAAA 1500 

TTGTATGATA AAGAGCTTTC AGAATTTGCC AGTTTTAAAG AGAAGTATAA ATATTTTGAG 1560 

CCTAAAAATA GTGAAGAATT AAAAAAACAA GCTCGAGTGT TGCTAGAAAA GAAAGGGTAT 1620 

ATTGTTGACA GTCCATTTGA AGGAGATTTT GAmaGATGGA TTGGCGTTTA TGCTAGACCT 1680 

AAAGATAAAC CAACTTATTT AGATCCAACG GATGGAGAAG AAGCTGGATT ACAAGAATTG 17 4 0 

TACAGTGTAG ATGGATTTAA GCAAGACTTT GCAGAATGGT TTGAGTTTGA AGTTGTGGAA 1800 

GGGaAATTAA AGGAGGACAT TTTGTAGTGA AAAAGATTTG GCAATTGGTA TGTTTAGAAA 18 60 

ATAATTATGA TTCACACAGC TCTTATTTTC GGGrTATGGT AAGATATTTT TCCTATACAC 1920 

TGTTAGCACT ACTTTTATyC yTTTtAGGTT kGwATTTtAm CATAGGTTGG GTAAGTGGTT 1980 

TCTGTTTTGT AGGTTTGATA CTTTTACCAT ATATGGGCTT TTTATCTCGT GATAATTATA 2040 

GAGAATTTGC ACCGAGAGTA TGGAAAATAA GAGATGTTCG AGAGTTAGAA GAATTGTAAT 2100 

CCTGTAGTTG TT GAT AG AG 2119 
{2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 

AAAGATTGAC ATGTGCAACA ACCAATTGTA ATTGTTCTTT TTCTGCCACC TGTTCCATTA 60 

CGTCAAAAGG GCCATGGAGT CTACACCACC AGAAACGGCG ACCACGATTG TTTGATGCGG 120 

TTGCCAAAAT CcTTGTTCTk TTCCTCGTTT ATAAAACTGC CAAAACATTG TTCGCCCTTC 180 

TTTCTTCCTT AGAATAAAAG AGGAACCGAG AAATAACCCA TTCGTTATCC CCAGTCCCCC 24 0 

TATTTGTTTG TGCTTAGCCG CGACGTCCGC CACGTCCGCC ACGTTTGCCT TCAGTATTAC 300 

GTTTTAATGA AGATAAACGG TCATCACTAT CTTTCAAGAA AGAGCTCATC AATGAGTCAA 360 

AATCTTGTTT ATTGTTATTG TTATTTGCTT GTGGTTTAtT GAAAGGCTTT TTGCCCCCTG 4 20 

CGCGTGGAnT ACGGTCACGA TTCTCATAAT TGTTTTCACG ACGTTGAAAT TCACGTTTTG 4 80 
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GTTGTTGCCC 


TTCTGCTGGC 


TCTTGTGCTT 


TACGGATCGA 


TAAGCCAATT 


TTGCCGTCGT 


540 


CACCAACTGA 


GGTTACTTTC 


ACTGTtAmTT 


CGTCACCTAC 


TGTTAATACA 


TCATGAATAT 


600 


PTTTAACGAA 


TPPATTAGAC 


ACTTCACTGA 


TGTGTACTAA 


CCCAGTTTTT 


CCTTCTCCTA 


660 


2i a TP A ATA A A 


T^PAPCAAAG 


TTAGTAATCC 

X X Hv A **** A V#* v 


CTGACACTTT 


TCCTGGCAAT 


TTAGCTCCTA 


720 


PTTPPATTf^A 
^11 \^\3r\ X X On 


PATAAAAAAA 


TTGTTCCTCC 


TATTTTTTCC 


CTAATTATAA 


TTGCAGTTTA 


780 


TTrarPT^Q a 




TTTCAGTTGA 


AGACTGTGAT 


TGACTATTTT 


TTTCTGAACT 


840 


(jt- iL 1 1 ol— i 1 


TPTTPPPPTP 


PAGAAGTCTG 


TGTTTTTTTC 

X \J X X X X X X X 


ATTTGTTCTG 


GTGTGGGATA 


900 






apagaaggaa 


ACGGCTACGT 


GCTAATTTTG 


CCACATAATC 


960 


ZiTf fiTPTTTT 


AAPA AAPPAA 


PPTPTTTTTT 

VJ X^oXXXXXX 


TAGATCCTTG 


ACCTTCTTAT 


CAACTGCTAC 


1020 


bun 1 i LHU^n 


ATTPPATPTP 


PTPPTTPTTfi 


TTTGAAAGCA 


CCTAAGCGAT 


GATAATCTTT 


1080 


r* a tp a a t T PfZ 


ATPPPAPA A A 


APPPAAAAAT 


AATGAAGGCC 


ACGAGAAAAA 


TCGCTGCCAA 


1140 


b^uL. Jl on 1 ort 


aTPPPPTAAT 


PATTTTT ATP 

\J{ \ X X X X X / i X ^ 


AAAATPATTA 


AGTTAAGGTA 


G AT AC AC AT C 


1200 


TTPTP AT ATP 


ATPAAATGGT 


TTCGCCAAAA 


ATCAATAATC 


AGACAACAAA 


ATGTGCGAAC 


1260 


TCGATATTTT 


ACACGACTCT 


CTTTACCAAT 


TCTGCCCCGA 


ATTACACTTA 


AAACGACTCA 


1320 


ACAGCTTAAC 


GTTGGCTTGC 


CACGCmTTAC 


TTGACTGTAA 


AACTCTCACT 


CTTACCGAAC 


1380 


TTGGCCGTAA 


CCTGCCAACC 


AAAGCGAGAA 


CAAAACATAA 


CATCAAACGA 


ATCGACCGAT 


1440 


TGTTAGGTAA 


TCGTCACCTC 


CACAAAGAGC 


GACTCGCTGT 


ATACCGTTGG 


CATGCTAGCT 


1500 


TTATCTGTTC 


GGGCAATACG 


ATGCcCATTG 


TACTgTTGAC 


TGGnCTGAAA 


TTCGTGAGCA 


1560 


AAAACGACTT 


AAGGTATTGC 


GAGCTCn 








1587 



(2) INFORMATION FOR SEQ ID NO: 670: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 670: 
CAATATGATA ATCATACAAA AGAAGaACTT AGTGAAATGA ATACAGAAGT CAGCGTTGCt 60 

GgACGTATGA TGACAAAACG TGGTAAAGGA AAAGCTGGTT TCGCCCATTT GCAAGACCGT 120 

GAAkGCCAAA TTCAAATTTA TGTTCGGAAG GACCAAGTTG GAGACGAAGC CTATGAACTA 180 

TTTAAACATG CTGACTTAGG CGACTTCTTT GGTGTGACAG GTCAAGTAAT GAAAACCAAT 240 

ACGGGGGAAG TTACTGTTAA AGCTCAAACA ATTACGTTAT TAACTAAAGC TTTGCGTCCA 300 

TTGCCAGATA AATACCATGG ATTAACGAAT GTGGAACAAC GCTATCGTCA ACGCTATTTG 360 

GATTTAATCA GTAACAAAGA AAGCTTTGAT CGTTTCATGA AACGCAGTCA AATTATTAGC 4 20 

GAAATCCGTC GTTATCTAGa TGGCAATGGC TATGTCGAAG TAGAAACGCC TGTCTTGCAT 4 80 
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AACGAAGCAG GAGGAGCTGC TGCTCGtCCA TTTATCACAC ACCATAATGC GTTAGATATG 540 
G ACT TAT ACT TACGCATTGC CCTAGAGTTA CATTTAAAAC" GKTTAATTGT CGGtGGCATG 600 
GrAAAAGtAT ATGAAATnGG 620 
(2) INFORMATION FOR SEQ ID NO: 671: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 

CTCTCTCGTC ACTGCGTCTn TTGGATCGAA TAAGTCGTGC GTGTATTTCG TGCGAGCTCC 60 

CnGnCATTCT CACACTAACG CCGTAGTACT GTGTAAGTGT ATGATGACGA GCGCCGATTC 120 

TCCTCTGTAG GAGATCTACC GAATCCACTT AGGCTTCATC GTCTGGTGAA CAAGGTTCAA 180 

AATAGTCATT TTTCAATAAT AAATAATTAA TCAAAAGATT CACCCCGAAT CAACATTTTT 240 

TGTTCTAAAT CCATTATAAT AGAAGTATGA CACAATACGA GAAGAAAAGC TTATAAAAAA 300 

CATTCAAAAA TAAGTGGCCG CGATAGGTGA GGCCTCGTTT TGAAAATGGT TTGATAAGTA 360 

ATCCGTTTTT TATATGAAAA AGCAAGCAAA AAGGGCTTTT AGCACATATA TTTAAACGAA 420 

ATGAAACATC TGTGTACGTT GATGAAGTAT AGTTATAGCA GGTGGAATGA AAGGATTTGT 480 

TGAGAGAATG GTGTCGTTTT CGGCAATAAT GTTATTTTGG TATATTTTTC CGGTAATTGT 54 0 

GCTCTTTGCG TGCAATTTTC TCGTGTCCAC TTTTTCCCTA ACGGAACTTG GGAAAATAAA 600 

AGCACCGGAT TTGGCCATTC CGTTTTTATT CATAGGGATC CAT GAG T TAT CCAAAGATAG 660 

TTATGATGAA TCGATTATGC CGTATTTTGT GATTTCTATT TTGTTATTAG GAGTCGCTGT 720 

TGCCCTGTTT CAAGCTTATT ATTATGGAGA GATTTTATAT AGACGATATT TCCAGAAGTT 780 

TTGGCGGTGA CGTTTTTACn AACGTTTGGG TGGAAGGGGT GCCAACCTGC CAAAAACAGn 840 

A 841 
(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672:* 

ACnAAATGTC TTAAAAAAAT GCTTTTTTCn TAAAAAAATT ATTACAAAAA AAAaCAAGAA 60 

GTCAGCCAAG TATTGGCTGA CTTCTTGTTT TTTTTATTTA ATTATTCAGC TAAACTAGCT 120 

TTTAACGCTT CAACTTTATC TGTTTGTTCC CAAGGTAAAT CAATATCTGT TCGACCAAAA 180 

TGACCATAAG CCGCAGTTTG TTTATAAATT GGGCGACGTA AATCTAGCAT TTCAATAATG 24 0 
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CCTGCTGGAC 


GTAAATCAAA 


GTTTTTACGA 


ACTGCTTCGA 


TTAATTTACT 


TTCTGGAAGA 


300 


TCACTCGTAC 


CAAACGTATT 


AATAGAAATA 


GACACTGGTT 


GCGCCACACC 


AATTGCATAA 


360 


GCTAATTGGA 


CTTCTACTTT 


TTTCGCCAAA 


CCAGCAGcAA 


CAaTGtTTTT 


CGCGaTATAA 


420 


CGCGCAnATT 


AACTAGCAGA 


GCGGtCAACT 


TTGGTTGCAT 


CTTTTCCAGA 


AAAAGCACCG 


480 


CCGCCATGAC 


GAGCATAACC 


GCCATAAGTA 


TCAACGATAA 


TTTTACGACC 


AGTTAAGCCG 


540 


GCATCTCCTT 


GAGGGCCACC 


GATTACAAAG 


CGACCAGTTG 


GGTTGATAAA 


GTATTTTGTT 


600 


TCATCATCTA 


AAAGTTCATG 


AGGAATCACC 


TCATTGATTA 


CTTGTTCTTT 


AATATCTTTT 


660 


TCGATTTGTT 


CTAAGGTTGT 


TTCATCATCA 


TGTTGTGTAC 


TGATGACGAT 


CGTATCCACA 


720 


CGTTGTGGTT 


GACCTTGATC 


ATCATATTCC 


ACTGTTACTT 


GAGATTTTGC 


ATCAGGACGT 


780 


AAGTAAGGTA 


AGACTTTTTC 


CTTCCGCAAT 


TCTGCTAAAC 


GGCGAACCAA 


ACGATGACTT 


840 


AACGCGATCG 


GTAATGGCAT 


TAATTCCGGT 


GTTTCGTCAA 


CAGCAAAACC 


AAACATTAAG 


900 


CCTTGGTCAC 


CAGCGCCAAT 


ATCGTCTTTT 


TCATCTTGAT 


CACGCACTTC 


TAAGGCTGCG 


960 


TCAACCCCTT 


GAGCGATATC 


AGGTGATgCT 


CATCGATtGC 


TACAAGAACT 


GCgACAGTCT 


1020 


CTCCATCAAA 


TCCATATTTT 


GCTCGTGTAT 


AACCAATATC 


TTTCACAGTT 


TGACGAACAA 


1080 


TTTTTTGGAT 


ATCTACATAA 


GCTGTTGTAG 


AAATTTCTCC 


AAATACTAGG 


ACTAAACCAG 


1140 


TTGTTACGGa 


TGTTtCACAT 


GCCACGCGGG 


CCATCGGATC 


TTTCTCCAAA 


ATCGCATCTA 


1200 


AAATTGCATC 


ACTGACTTGG 


TCAGCTACCT 


TATCTGGATG 


TCCTTCTGAA 


ACGGATTCTG 


1260 


ATGTAAAtAA 


ATGTCTnTCT 


GTCATAATGT 


TATTCCCCCT 


AAATAGTTTC 


GGTTACAAGG 


1320 


CATCTTTAGT 


AAGTAACGAC 


AATTACTTGC 


TAAATCCTTT 


CGGGAACTTG 


TAACAAA 


1377 



(2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 

TGTTAACGTG aACTATTATT CATCGTTTTA AAACGTGTTT CCATTAACTC CACTGTATTT 60 

TCCAAGCATT GaGTAATATC TTTTTTTCCA GTATGAACTT TTCCTGCAAA TAGCGGTAAC 120 

TTGCCTAAAG CCATCAAGTC AGAATTTTTC GGGTCACAGA TTyCmAsmTs ACCcACTCGA 180 

CAAAGCGCAT AGATCAGAGA CAAAATTGTA AAAGTTTTAC CTCCACCaGT GCCTCCACCT 24 0 

AATAACAAAT GTGGATCTTT ATCaAAGTGC CAATACAACC CTTTCATCAA CTGAACTTTG 300 

CCTtCGTCTG CTTGCATATC CTTAATCCAA ATACGCTTAT TGTACACATC TGTTGCTAAT 360 

TCATAGGTAA CAAAACCTTT TTCATCAATC TTATTTAAAT TATCTGCATG AAACGkTGkT 4 20 

TCCAAAAAAC CACCAATGGT TTCAAATTTT TCCTGAAATT TATTACCCGC TGTTTCAAAT 4 80 

GAAACAAAAA TACTATTTTT TGCAGACTTA TAATAAATTT TTGGAAACTT GATTTT 536 
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PC17US98/08985 



(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 



CGATTGACTT 


TCTAACACTA 


CTTCCATTAT 


AATTAGCTTT 


GTTGTGTTAC 


AACTATTGTC 


60 


ATAGTAGCTC 


AGCAGGATAG 


AGCATCCGCC 


TTCTAAGCGG 


ACGGTCGGGG 


GTTCGAATCC 


120 


CTCCTGTGAC 


GTAACGATAA 


CAAAAGCGCC 


TCTTAGAAAC 


CTTGGTTTCT 


AAGAGGCGCT 


180 


TTTGTTATCC 


CAAGTTCATC 


CTAACCTAGT 


TTAACTTTCT 


ACCAACTTGA 


AGTACCTCCA 


240 


CCTCCGCCAA 


AACTCCCACC 


TCCGAAACTG 


TCACCGGAGC 


CATCATCAGA 


AGAACCCTCG 


300 


TTGCTTGTAA 


CCCACCAATC 


ATGACGATTC 


GCATACATAC 


TGGTCTTTTT 


TAGTTGCCTG 


360 


TCAGAACGTG 


AACTTGCTAG 


AAATTTAGCA 


ATAGACACAT 


TTCCATAAGC 


AAAACGCATA 


420 


TCATAGTATT 


CAGGATACGT 


AnTCTTAATA 


AATCTTATCG 


GGTAGAAGGA 


TTTTGAAGCT 


480 


TTTTTGTTGG 


TAATCTTTTT 


TtCTCTAATT 


TTTTtCCGCA 


AtAATTTTTT 


TTCAATTTwA 


540 


TATkGCtGGT 


TGTCtAAATC 


GTTTGCyAAT 


TGTTTtAATG 


ATTTTGATTT 


TTCTGAACTT 


600 


ACTTGCTGGA 


TTTCTTTTAA 


AAACCTACGG 


ATCATAGAAT 


CCAGGTGCAA 


TCTATTTTGT 


660 


AAGCCAACAC 


CTATTGATTT 


TATATAACCA 


AATAAGAAAA 


AGACCGGAAT 


CAAAAAATAG 


720 


ACCAGACTAA 


CCATGGGTAC 


TAAAGAGTCT 


TCGTCTTCTA 


AAAATGTCAG 


AATCTCTGTG 


780 


ATTAGTGTGA 


AAACTGTCGA 


AGGaAACGAC 


TGCyCTGATA AACTAACTTG 


AACATAAATA 


840 


ATCAGAGTCA 


CGGATACCAA 


AAAAGGATGT 


TTTAATAAAA 


TCGAAAGAGG 


AAACAAAATA 


900 


ATTTTTAGGT 


AAGCAGATAT 


TAAGTTATCA 


TTAATTCCAC 


TAATCCGACA 


CGCGGGCAGT 


960 


TTCTTTTCTG 


GGAACTTTTT 


CCCAATATAC 


TTCACCTATC 


CAAAAACGCG 


GGTAAGTAGG 


1020 


ATCCATGGGT 


TnGATTCACC 


CTTCCCAAAA 


AnnAGGCACA 


CATAGAAGAG 


GCTTCnTTA 


1079 



(2) INFORMATION FOR SEQ ID NO: 675: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 

GTATGAATTG GAAGACTTAA TGAAACGGAT TCCACCAATG TATGAAGAAT TAAATGAAAC 60 

TTTCCCAGAT CAATTAAAAG AAATCGAAGA AGGCTACAAC CAATTATTAG CTGACGATTA 120 

TGTGTTCCCT GAACAAAACT TTGCTGAAGA AATTCAACAT GCGAAAAAAC GTGTCGAGAA 180 

TTCAATGGCT GATTTAGAGA AAACAGAGAT TGCTGCAGTA GAGGTCGCAA ATCGCGACAC 24 0 
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AGCAACGGCG 


ATT GAT GC AT 


TATATGAAGT 


GATGGAACGT 


GAAATTGAAG 


CGAAGAAATA 


300 


TGTCGTGACC 


AaTCAAAAAA 


TTATTGACGA 


TTATATTTCT 


CATTCCTTAA 


AAAATAACCG 


360 


TCAATTAATG 


ATTGAATTGG 


aCCATGTATC 


ACAAAGCTAC 


ACCTTAAACA 


ACAACGAGCT 


420 


AGGACGTAGT 


CGTGGCTTCC 


AAACGGAAAT 


TGAAGAAATT 


ATCCGCCGTC 


AAAAAGACTT 


480 


GGAACCTCGA 


ATGAAAGAAC 


ATACCGTTCC 


TTACTCTGAA 


ATTCAAGCGT 


TCTATAAAGA 


540 


ATGCTATAAG 


ATT T TAG AT G 


ATATTGAAAA 


TCAACAATTA 


GAAATTGATG 


CTTCATTGAA 


600 


AGAACTAAGA 


AAAGGCGAAA 


AGGTCGCCCA 


AGAAAAAGTC 


GACGAATACG 


AATTCCGTTT 


660 


ACGGAGTATC 


AAACGATATG 


TTGAAAAGCA 


ACGATTACCA 


GGTTTGTCTG 


CTGATTACTT 


720 


Ab AA J. I 1 1 1 V- 






THAAGATTTA 


AGCCGAGCCT 


TAAATAAAAT 


780 


GCGCATTAAC 


ATGGATGAAA 


TCAATCGTTT 


GTGTGACTTA 


TGCGAAGATG 


ATTTAGAGTT 


840 


ATTAGATAAG 


AAAACAAAAG 


ATTTAGTCmA 


TGCAGCAGCT 


CTGACAGAAC 


AAATGATGCA 


900 


ATATGCAAAC 


CGCTmtCGTC 


ATACTCACGA 


GAACATCCGA 


GCGGCTTTAG 


ATAAAAGCAT 


960 


GTATTTATTC 


TCGACAGrAT 


TCCGTTATCA 


AGATGCATTA 


GATGAAATTG 


GAACGGCATT 


1020 


GGAAGCTGTT 


GAGCCAGGGG 


CGTTCAAACG 


AATCGAAGaT 


TTTTATTTTA 


AAAATAnCAA 


1080 



TAACCCTAAT 

(2) INFORMATION FOR SEQ ID NO: 67 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 676: 

TATCTTCTTT TTCCGTTATT GAAATAATTA ATAGCGTTAA TAAAAATAAA GGAATAAAGA 60 

GGAAGTGCAT AACCATTTGA GTAACAACTA TCGATACAGT AATCGTGTGC TTGTCAAATA 120 

GCTGAACTAG TATTGGATTA AACCAAACAA TGGCGACTAT TACTAACGAA AAATATTTAG 180 

TCACGCTTTC CAGCCTTATT ACATACAACA AGCCAAAGAA AATTAGTAAA CTAATAATGA 240 

TATCTTTATA AGTAAAAAAA TGTTCATAAT TCATAATTTC TTTCGTTTCA TCTCCACTCA 300 

ACTTGCTTAA CTATTCAAGG TTTAGGTAAA TCACGTTTCC CATTTTCTCA CATGATCGTT 360 

CAAAGAAACA GAACGTTGTT TGGTGTTTTA AAACGTAAAA GTTGAGCAAC ATATCCTGTC 420 

TAAACATTTA CTTGGCAGTT TATTTATAGA AAAATTGTTA CTTTCCAAAT TTTTGTATGC 480 

TTTTTAGAAA TGAAACCAAG CTTATGTTTT AATGCTTCTT ATGCAAA 527 
(2) INFORMATION FOR SEQ ID NO: 677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 



ACACGGATnA 


TTTATTGATC 


CCTAGGTCAT 


TnGGCGGGAT 


TCAAATTCCT 


TCGCATAAGC 


60 


CAGGCCaTTA 


AAtTmCTCGt 


GGtCaTAwGa 


cyCCtTaCaA 


TATwCyTTTA 


AAAAaGcTTT 


120 


TATCaTCTGG 


TGTCATAATk 


GTGtTATTTC 


atATCTCGAC 


TTAATAGkTT 


CtCAAAgCAG 


180 


ATCGCTCTTC 


aTCAGTGACA 


CTAATATTgG 


TCTTAAAACA 


GTAGTATTAT 


TACGATTTGA 


240 


CATATTACCA 


ACATTGATtC 


TTTTTATCTC 


TAAATCATTT 


TCAATAAATG 


GTAATAAATC 


300 


TACGGGTGAT 


TTGACAATCA 


TCAGAACATT 


TTGTCCTTCA 


TATTTGCCAT 


TTTTAATATT 


360 


ACTAATAGCT 


GTTTCTTCAG 


TTATGATAGA 


CGAAGAAATT 


CCAGCAGGAA 


CTGCCATTCT 


420 


TAAAACCTGC 


TTTTGTAAAT 


CATTTTTAGC 


AACGTTGTCG 


TTCACTACCA 


TTAAACGTGT 


480 


TACTCCCAAT 


TTATTACTCC 


ACATTGTAGC 


TACTTGACCA 


TGGATCAAGC 


GATCATCAAT 


540 


TCTAATATTG 


TTAATACCTT 


TCATTTCCAT 


CGAAAATGCC 


TCCCATAAAT 


TATTTTATTT 


600 


TAAAATACCT 


AG AGCCGCT A 


AAACAATTGA 


CAATACCATA 


ACACCAAGAA 


TTAAGTAAAT 


660 


AGGTTTAACG 


TTCTTCTTTA 


ATGCCCAGAA 


AACTAAACCA 


ACTAATAATG 


CAGGCAATAA 


720 


TCCAGGCATT 


ATTTGATCCA 


AAATATCTtG 


ACCTTTCATT 


GAAAAATTAC 


CCTGTTTAAA 


780 


GTTTAAAGTA 


ATATTAGCTT 


TAACGACTGA 


AGGTATTAAT 


GCCCCCACGA 


CCATTAATCC 


'840 


TAAAATCGAa 


GTTGACTCAG 


TTAATAATTT 


CAAACTTTTT 


TCTATAGTAG 


CAATTAATTT 


900 


TGTTCCTGAG 


GCATATCCTG 


CATTAAATAA 


TGGAATACTA 


ACTAATTTTa 


GCGCTACACT 


960 


TACAACAATC 


CATAACAAAA 


TTCCAACTGG 


ACTACCATCA 


ATGGCCATAT 


TTGCTGCAAT 


1020 


AGCACCAAAT 


ATAGTCCaAG 


GAATAACTAC 


TV 7\ 7\ m 7\ 7\ "A rp 7\ 

AAAT AAAC T A 


TCACCCAAAC 




i n q n 


GCCCATAAGG 


CCTGTCTTAA 


TAGATGTTAC 


AGCCTCTTCT 


GTGTTTTCGT 


CAGCATTCTC 


1140 


TTCCATAGCT 


ACATTAATAC 


CTAAAATGTA 


AGGTGCCACC 


CAAGGATTAG 


TATTAAAAAA 


1200 


TTGTAAATGA 


TTTAATACCG 


CTTTTTGTAA 


TGCTTCTGGT 


TTATTTTTAT 


ATGCTATTTT 


1260 


TAAAAATGGT 


AATACTGCAT 


AACAATATGC 


TAGCGCTTGC 


ATCTTTTCAT 


AATTAGAAGA 


1320 


AACTGAGCTT 


GTAAAAAACC 


ATCTCCACAA 


AaCACTTCGT 


TTAGTTTTTG 


TATCAATGCG 


1380 


TAATTTTAAT 


TGTTCGATAT 


TATTCATCGT 


CATCCATCCC 


CCCTGCTATA 


TTAATCACAG 


1440 


TATTCTTTAC 


TAAATCTTGA 


CTATTTTTTT 


TATAAATTAT 


TAATGCAATA 


GCT 


1493 



(2) INFORMATION FOR SEQ ID NO: 67 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 



WO 98/50555 PCT/US98/08985 

1921 

CCAATGGATA ATCCTCCTTT AATTTATATA AACTTATTTT TCATTTCCTG TTAGTAGGTC 60 

TATCATATCT CTTTTGTACC AAAAAAGCAA GAAAAAACCG . AACAAACATT_ CGCATATTTA 120 

TTTTTTATTA ATGATTCGGC GCCGCAAAAG ATCTAATCCT CGCATAACCG CACTCTGTCT 180 

AATATAATTT CGGTCACGGT TAAAATGGAA ACATTCTGCA ACGGTTGGTT GGCCTTCTTC 24 0 

CGCCAGCCCA ATCCAAACAG TGCCTGTCGG TTGACCTTCG AGTGGGTCAC CAGCGACCCC 300 

AGTAAATGAT AAACCATAGT TAGATTTAGC TAACTGACGC GCTTTCTCTG CCATTTCTTT 360 

GGCACACGCT TCACTTACCG TGCCATGTTC TTCTAAAAGT TC AT GAG AAA TTCCTAAAAA 420 

GTTCTCTTTT GTTTCTTSCG AATACGTCAC GAAGCCTCCC TTAAAAATCT TCGAAGCACC 4 80 

CGCAATTTCC CCCAGTGTAC TTTGGGAtAG GCCAGCGGTG AGGCTCTCAG CAGCCGTAAC 54 0 

GGGTTTGnCC CGTTTTGGTA nTAACAAATC nACGGTAAAC nT 582 
(2) INFORMATION FOR SEQ ID NO: 67 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679: 



CTGGTTCGCC 


ATCTTTCATT 


GTCATAGTGG 


CACCAATTCC 


TTCAAAATCA 


CCAGATAAGC 


60 


TTTCGTTTAA 


ATCATTGGCA 


GCAGATTCAT 


TTAGATAAGT 


TGAGTATGGG 


TCTCCAATGG 


120 


CTTCAGACAT 


GCCTTTTAAG 


GCGCCTTCAA 


CTAATTCATT 


TTTATCTACT 


TCACCTACAT 


180 


AATTGGTACT 


AATTTCATTA 


TATAAATCCT 


GCACTTTGCT 


TAAATCAGCA 


TTGGTAATTG 


240 


CCCCTTCTTG 


ACTCATCTTT 


TTAACACGAT 


GATCAAAATA 


AATATAACtA 


CTTCCTCCAG 


300 


CTAAAAATGC 


GACACAGAGG 


AGCGAAATAA 


TATATTGATA 


AAAAGGAACA 


GTTCGTTTGT 


360 


TCTTCATAGa 


TTTCATCTCA 


TTTCTAACAA 


ATTCGTCACT 


ACTTTTGTTT 


ACTATAACAT 


420 


GAATGAAAAA 


AGAGGAAAAG 


CTTTGTTTGC 


TTTTCCTCTG 


GATTTATTTA 


TTGTTTTCAA 


480 


GGTACTTTTC 


CCATAACTCA 


TCAAAGATGT 


CCATGTTGCT 


TAAATAGTCA 


GCATTCATTT 


540 


CTAAATAAGA 


AGAAAGTTCA 


TGATAATCCT 


CGGTTTGTTT 


TGGAAACTGA 


ATATCTTTGG 


600 


CCGCTTCATT 


GGCAAAATCA 


GTTTCAGAAT 


CTTTAGCCGG 


ACCTTTTAAT 


GTCATTAAGT 


660 


AATGGTAAAA 


GCTTCGTCTC 


ATGTAAATTA 


ATCACTCCAT 


TTTTCTTCAA 


TAAATTCACT 


720 


TCTACGTTGA 


TGGGACAATC 


CATAACGCAA 


AGCGTCTTTT 


TTATAAAAGT 


CTTGATGGTA 


780 


TTCTTCAGCA 


GGATAAAATG 


GTGCTGCTGG 


TTCGATGGTG 


GTAACAATTG 


GTTCTGTAAA 


840 


ACGGCCGCTA 


TTAGCTAGAC 


GTTCTTTACT 


TTTCTCAGCA 


ATTTCTTTTT 


GTTCTTGAGA 


900 


ACGATAAAAA 


ATAACTGGCC 


GATAATTATC 


GCCGCGGTCT 


TGAAACTGGC 


CGAAAGCATC 


960 


AGTAGGATCC 


GTTTGTTGCC 


AATAraATATC 


CACTAATTGT 


TCGTAAGAAA 


TAATCGCTGG 


1020 


ATCAAAAGTG 


ATTTCTACTG 


CTTCCGTATG 


GCCTGTTGTG 


TGTGTTAGAA 


CCTGTTCATA 


1080 
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TGTTGGATTA GGCACGTGCC CACCTGTATA ACCCGAAACA ACTGAGATAA TCCCTGGTTG 114 0 

TGTATCAAAA GGCTGCACCA TACACCAGAA ACAGCCGCCT GCAAAAATTG CT 1192 
(2) INFORMATION FOR SEQ ID NO: 680: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680: 



GTTATCCACG 


AGCGCCTTAA 


AATATGTCCn 


GGGAATGTGC 


CCGATTCCCT 


AGGCATTACA 


60 


AAAACCAAGC 


CTGGGAAATG 


TGCTTTAGCT 


GCTTATTCCC 


AAACAGTTTC 


CTGGAAGGAC 


120 


TTGTATACGG 


CGATTATAAA 


CTGTTTTTTT 


AAACtAACGG 


TACATTAGGG 


ATCGTTGAAT 


180 


GGTTTTAAAA 


CAAAGGAGCT 


ATATTATGAG 


AAAAAGTATT 


TTATTCATGT 


TGGGAATATC 


240 


TTTGTTCTCA 


ATGACTGCTT 


GTTCTAATGG 


GGATAAAAAA 


GAAGTAAAAT 


CAAAAGCATC 


300 


CTCsTCTACT 


GTTATCTCTG 


TCAACAAAGA 


ATCATCTGGT 


AaTACGAAGA 


AAAATGAATC 


360 


AAATTCAATA 


ACTAAAACGT 


CTAACGTTGT 


TTCTTCTACA TCTGTGGAAA AAACACgtGG 


420 


AAGTAAGTTT 


AGCgATTTTA 


TkGGTtGGGG 


TATtCraAAGT 


GGAAaCCTTT 


TTTTATTAAT 


480 


GAAGATGGGA 


CAT AT TC AAA 


TGGTCAAGTA 


GACCATTCTT 


CATTAATTGc 


CTTAAATTTA 


540 


GTATTTTAGC 


AGAcGGAAGA 


AAATCAATGA 


GCTCAAATTT 


AGGAACTCTA 


ATTAAAGAAT 


600 


CTGATGGCAC 


CTTAACAGAT 


GGTGAACTAG 


TCTTTCAACC 


ACTTGAGTTT 


AGTAACAAAG 


660 


AAGATTTCCT 


TGCAGATAAA 


CAAAAAGAAT 


ACAAAAATTC 


TCCCGAAACT 


AATGCACCTG 


720 


AGAACGAaGT 


TGAAATTACT 


AATACTATAG 


rAGATATCAC 


TAACGGAAGT 


ATCTCTACTG 


780 


ACACAAATAC 


GCTCACAGGA 


TTCTTAAATG 


TGTATGGTAT 


GAGTCCTGCA 


GCTTATAAAG 


840 


TTACTATTGA 


AGGAATGAGT 


GAAGAAGAAG 


CTTTAAGAAA 


CACACCTAAA 


GAAATGAAAA 


900 


CTTCTGCTGA 


AATACAATTA 


GGTATCTCTA 


AATATGGTAT 


TCAATAATTT 


GTCTACGATT 


960 


TATAATTAAC 


ATTAGCCATA 


AAAAATACAC 


ATAGGAsrCT 


AAAAAATGAG 


TGCATTTTTA 


1020 


GGTCTATCAT 


CTTTATTAGG 


ATTATTCGTA 


ACTTCTGTAT 


ATTTTGTTTA 


CTGTATTTTT 


1080 


AAGAAGAGAA 


ATAATTTATC 


CTATTTTGTA 


TGGATTTATA 


TGATTCTTAT 


TGGATTATTT 


1140 


TTCmCTGCGT 


TAGAGTTATT 


TATTCCAGGT 


ACTATTTTTC 


TTACTATTTC 


ATTGTATAAA 


1200 


ATATTAAAAG 


AkCCAAAAAC 


AAAGGaATTG 


cTACATGaGC 


aAGaAATAAA AcgAACmGaA 


1260 


AAAAtAATAG 


aAAAACAAAA 


CCCTATTCAA 


GAATTCACAG 


AAAGTAATTC 


AAACGAAAAA 


1320 


GATCTAAATC 


■ AGAAAGTTGC 


TTCAACATAT 


TCTAACGATT 


ATTATCCCTC 


GGATAAAAAA 


1380 


GcTGGGCTAT 


TCCATCTTGG 


CGTCTATTGT 


CCTTATTGTA 


GAAGCCTTAA 


CGTCCAATAT 


1440 



1442 
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(2) INFORMATION FOR SEQ ID NO: 681: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681: 



TTGCTTTACG 


CGCCGGCATG 


GACGGGCGGC 


TTGCTCCGTG 


CGGAAGCnAG 


CGACGACTTT 


60 


TCCGGCGTGG 


CCGCAATCTA 


TATTGACGGC 


GAGAAATACn 


ACCGATTTTA 


ACCAACGGCA 


120 


CACTGGACAT 


ACCGCTGCGG 


GATTTGCCGG 


ATGATTACGA 


ACAGCTTTCC 


GTGCAGGCGG 


180 


TGGACGCTGC 


GGGAAATAAA 


TCAAAAATCG 


TGCAGGTCAA 


GAACCCCAAC 


TATCAGGCGG 


240 


AGGATAACAA 


GCAGAAGCCC 


TCCACCGGCC 


AGACACAGCC 


CCCGGCGACC 


ACCGCGCCGG 


300 


TTACATCCAC 


CACGCCTGCG 


CCAACGACAC 


CGGCCACCAG 


CGCAACCACA 


CCCCCGACTA 


360 


CACCGACCAC 


CAATCCGGCA 


ACAACGACTG 


CCACAAAACC 


GGCTGCCGGC 


ACGGGCAGCG 


420 


CAAAGCCCTC 


CGCTTCGGCA AGCGCAgCCC 


gGACGAAGAA 


ACGGACACCA 


CGCCCCGCGA 


480 


TCCGATCCCT 


TTGACCCCGG 


ACGGCCAGGC 


TACCGTTCTG 


GACAACGCCA 


CCGGTGAGGA 


540 


CGGTAAGGAG 


TTTTATACCA 


TCCAGACACC 


GGATGAAAAC 


GTATTCTACC 


TGATTATCGA 


600 


CAACCAGCGC 


GATACGGAAA 


ACGTGTATTT 


CCTCAATGCT 


GTCACCGAAG 


CCGACCTGAT 


660 


GGCTCTTGCA 


GTAAAAGAGG 


ACGATACGCC 


GC AG AC AG AC 


GCAATCCCTG 


ACCCGGAGCC 


720 


GGCCTGCATC 


TGTACGGAAC 


AGTGCGCCGC 


CGGCGAGGTC 


AACACCGCCT 


GCCCGGTCTG 


780 


CACCCTGACC 


CGGAAAGACT 


GTACCGGAAA 


AGCCCCCGTG 


ACCGACACGG 


AAACCGAACC 


840 


CGACGAAAAG 


CCGGAAAAAA 


CGAAAAGCGG 


CGGTTCCGGC 


ACACTGATTT 


TAGTGCTGCT 


900 


TGTGGCGCTG 


GCGGCTGGCG 


GCGCTGGCTA 


CTACTTCAAA 


ATTTACAAGC 


CCAAAAAGGA 


960 


TTTGGACGAT 


GCAGAGGACT 


TTGACGAACT 


GACCGGCGGG 


GACGAGGAGG 


AAACGGTCAA 


1020 


CGAGGACGAG 


GATGTGGACG 


CGCAGGAAGA 


AACCCGGCAG 


ACGGGCGATG 


AGGAACCGGC 


1080 


AGCCTATGAC 


GAGCCGGAGG 


AACCGGACTA 


TCCTGAAAAC 


TATGGCGGGG 


AGGACGACTG 


1140 


ATGCGCTTTA 


CGGACAGCCC 


TTTTGAGCGG 


ATGATGACCC 


AAAGGCCGGC 


CCCCAGCCGC 


1200 


GCCGCGCCTG 


CCTCCCCCGT 


CCATCCGCCC 


GGACACCCTT 


GCCACCGCTG 


TCCcTACGGG 


1260 


CTAAATGCGC 


CCTGCGTGGG 


GATATGCTAC 


AAAAATCTGA 


AAAAGGAGCG 


TGAGAAAAAT 


1320 


GCAGTTAGTG 


ATCGCGGAAA 


AACCGTCGGT 


CGGAATGGCG 


CTGTCTAAGG 


TATTAGGCGC 


1380 


GAAAAGCCGT 


GGCGATGGTT 


ACATGGAGGG 


CGGCGGCTGG 


CTGGTAAGCT 


GGTGCATCGG 


1440 


GCATCTGGTG 


■ GAACTTGCCC 


CCGCCGATGC 


CTATGACCCC 


CGCTATTCCA 


AATGGAACTA 


1500 


TGCCGACCTG 


CCGATTTTAC 


CAAAACCTTG 


GCAGTTCCAG 


GTGCTGCCGG 


ACACCAAAAA 


1560 


ACAGTTTGAA 


GTCTTAAAAA 


CCCTGATACA 


CCGGGCCGAT 


GTCACCGCCT 


TAATCTGTGC 


1620 
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AACAGACGCA GGACGCGAGG 
CACCAAACCC GTCAAGCGGC 
CTTCGACAAC CTGCTGGACG 
CGCCAAGGCG GACTGGCTGA 
GGGCAAGTCC CTGAACGTGG 
GGAAGCCGCT ATTTCCGGCT 
GGGCTTTCGC GCCGCCAGTG 
CGGCTGTGTG GGCAAGCCTG 
CCCGCCCAAG CTGTACGATT 
CACCGCCCAG CAGACGCTTG 
CCCCCGCACG GACAGCCGGT 
CCAGTCCGTC GCTGCGGCGC 
GCAGGTaATC GACAACAGCA 
GGCAAAAACC GACCTGTCCG 
CGCCCGTCTG CTGTGCGCGG 
GGAATGTGGC GGCGCTGTCT 
GGCTATGGAG CAGGCTTTCC 
CCCGCCACTT CCTGCACTCA 
AGAGGGCGCG ACCTCCCCAC 
ACACGCCAGT GCAAAGGAGT 
TCCGGCCAcC CGTGCCGGCG 
GAAAAAGCAG CTTTTCCCCA 
GCTGAAATCC GTCAAGCTGA 
GGAACTCTCC CCGGAGGACT 
AAGCTATCAG GGCGTGACCG 
TGGTATCTGC CCCCGCTGCG 
GGGCTGGaAC gTGCGCCCGC 
AG C a AG C AC A AGGAACTGAC 
GCCATGACCG GCCTGTTTTC 
GAGGACACCG GAGAGAAATT. 
AAGAAAGGAA- ACTGACGCTT 
ACCTCCTGAC CATCAGCGGC 
TGCCGGAGAT CAACCGCTGG 
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GGGAACTCAT TTTCCGCCTG ACTTACCATC TGTGCGGCTG 1680 

TGTGGATTTC CTCAATGGAG GAAGCCGCAA TCCAGAAAGG 174 0 

GGAAGAACTA CGACAATCTC TACGCCGCCG CTCTCTGCCG 1800 

TCGGCATCAA CGGTACGCGC CTGTTTACCA CACTGTACCA 18 60 

GCCGGGTGCA GACCCCGACG CTGGCACTGC TGGCAGAACG 1920 

TTCAGAAAGA GAAATTTTAC ACGGTGGAGT TGGAACTGGA 1980 

AACGCTTTGC ATCCAAAACG GACGCAGGCA GGCTGCGTAC 204 0 

CAACGGTACA GACGGTTTCC CAAAAGGAGA AAACCGAGCG 2100 

TGACTACCCT CCAACGGGAG GCCAACCGGC TGTTTGACTA 2160 

ATTATCTGCA AGCCCTCTAT GAAAAGCGGC TGGCGACCTA 2220 

ACTTGACCGA GGATATGGCG GCGGGGCTTC CCGCGCTGTG 2280 

TGCCCTTTGC GGCAGATTTG TCCTTGCCGG TAAATGCCGC 234 0 

AAGTCACCGA CCATCATGCC ATCCTGCCCA CGACAGAAGC '2400 

CGCTGCCCTC CGGGGAGAAA AACATCCTCT TTCTGGTTGC 24 60 

TAGGCGAACC CCATACCTAT GCGGAAACCG CTGTCACACT 2520 

TTTCCGCAAA AGGCAAAACG GAAACCGCCG CAGGCTGGAC 2580 

ATGCCACACT GAAAAAGAAA CCGAAGCAGC CCGAACCCTC 264 0 

CCAAGGGCCA GAAGCTGACC GCCGGGGGTG CATCCATCAA 2700 

CGGCCCATTT TACCGAGGAC ACGCTTCTCT CCGCAATGGA 27 60 

TTGCGAAACT GGAAGATGTG GAACACAAAG GACTTGGCAC 2820 

TGATTGAAAA GCTGGTGCGC TCCGGCTTTG TGGAACGGAA 2880 

CGGAAAAAGG CACGGAACTT ATCAAGGTAA TGCCTGATAC 294 0 

CCGCTGAATG GGAGGAACGG CTGGGCGCGG TGGAGCGCGG 3000 

TTATGGCGGG CATTACCGCC ATGCTCACCG AACTGGTAAA 3060 

TGGCATCTGC CGCCCTGTCT GCATCCGGGC GCACGGTAAT 3120 

GCAAAAACGT GGTGGAGGGC AAAAAGAGCT TTTTCTGCGA 3180 

CTGCGACTTT GCCCTGTGGA AGAACGACCG CTTTTTTACC 324 0 

CCGcAAgGCT GCCGCCGCCC TGCTGAAAAA AGGGCGCGTT 3300 

CGAGAAAAAA GGCGTCCTCT ATGATGCAAT TGTTGTGATG 3360 

TGTTCGCTTT AAGCTGGAAT TTGACAATAA CACACCTAAA 34 20 

ATGGAAAAGA ACATGAACGC ATTTAACCGC AACCTGCACA 34 80 

ACGGATGCCG CTGAACTGGC GCAGACCCTT AACGTGGAAC 354 0 

CTgmAacsTC CGCCGCGCCG GATGTGTACC AGCTTCAGGA 3600 
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AATCGCCAAG CGGTTCGGGA TGCCCTACTC CTACTTTTTT GAGAGTGAGC AGAGCCTGCC 3660 

CAATGTGCCG AAAGTAGCGG.. CATG_G£TGGG CTTGTCGGAG GAAACGGTGG AAACCCTGCT 3720 

GGCAATGGCC GATACCGAGC CGGAGGATGT CATGGACGCG CTGGACGATG CCATTTATGC 3780 

AATGGCTACC GCTGTCAGTG CTGCGGGGGA GGTGGATACG GAATGAAAAA AGAAGAAGCG 384 0 

AAAG AC AT CC CTATCGGCGG AGAAATCGCT GTCCGGCAAA CTGGCGTCCC CTATGGCTGG 3900 

CTCTACCGGC GCACGGAAC 3919 
(2) INFORMATION FOR SEQ ID NO: 682: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682: 



GGTCCGTCCT 


TGATATTGAA 


CGGTTAACGC 


TTGTAACAGC 


TAAGTTAAAG 


CTTTTTCTCA 


60 


TGAAATTCCT 


CCTTTTTTAG 


GTTTGCCTTA 


AAATCAGTAT 


AAGGGATAGT 


TTAAGAAAAT 


120 


GCAAATAATT 


TTTAGGCGCA 


CCTAAATACA 


AGCGCTTTTT 


TCACAAAGAA 


AACCGTTTCT 


180 


TTCAATAAAT 


TAAGGAGGAT 


CCCATGATTT 


T TAT G AC AAA 


AAATGGTAGA 


ATATGCATGA 


240 


AAACGACTAT 


TATTTTATAT 


TGACGGAGGA 


TATCATGACC 


GAAAACTATC 


AAGATGGACA 


■ 300 


CCTTACGCAT 


CTTCAGTTTA 


AATGCAAATC 


GACCACTTGG 


CTGAAAAAAA 


TCGCGnCATC 


360 


AGTAGGaACA 


GAATTAGGAA 


AAAGCACCGT 


GcGCCaATTT 


AGTGaTGGcG 


AAaTTCaAAT 


420 


TAAtATTGAA 


GAAAGTATTC 


GTGGCGATCA 


TGTCTACATC 


ATCCAAGCAA 


CGAATGCTCC 


480 


TGTAAATGAT 


CATTTGATGG 


AATTGCTTAT 


TTTAATTGAT 


GCATTAAAAC 


GTGCTAGCGC 


540 


AAAAACAGTT 


AATGTTATTT 


TGCCTTACTA 


TGGTTATGCT 


CGCCAAGACC 


GCACAGCGAA 


600 


ACCAAGAGAA 


CCCATTACTG 


CGAAATTAGT 


TGCTAATATG 


TTGGTAGAGG 


CAGGCGCAAC 


660 


ACGTTTGTTG 


ACATTAGACT 


TGCATACTGT 


TCAAGTCCAA 


GGATTTTTTG 


ATATCCCAGT 


720 


AGATAATTTA 


TTTACAATGC 


CTTTGTTTGC 


TCATTATTAT 


CGTCAACAAG 


CATTAGTAGG 


780 


GGAAGAAATT 


GTGATTGTTT 


CGCCTAAAAA 


CAGTGGCGTT 


CAACGCGCAC 


GTAGTTTATC 


840 


GGAGTACTTA 


GATGCGACCT 


TAGCAATTGT 


CGACCATGAA 


GAGATCGATG 


GGGTGCGCCA 


900 


AGAAGGTTAC 


GTTATTGGGA 


ATGTCGCTGG 


GAAAAAATGT 


ATTCTTGTTG 


ATGACATTTT 


960 


GAATACTGGT 


CAAACGTTAG 


CCACAGCGGC 


GGAAGTTTTA 


ATGAAAAATG 


GTGCGCAAGA 


1020 


AGTGTATGCT 


TGCGCCTCGC 


ATGGTTTGTT 


ATCAGAAGGC 


GCAAAAGCAA 


CCTTAGAAAA 


1080 


TGCGCCAATT 


AAGGAAATTA 


GTATTACTGA 


CTCTGTTTAT 


ACGACAGCAs 


ACCGCCAACC 


1140 


AGCAACCTTG 


AATATTATTT 


CTTGCGCTGA 


GTTAATGGGG 


GAAGCGTTAC 


TTCGTATCCA 


1200 


TGAAAATAAA 


CCAATGAGTC 


CATTATTCCG 


TTTAGAACCA 


AAAGGAGAAT 


AAGTGATTCC 


1260 


TTAAACTAGA 


AATAGGTCTA 


ACAACTTTAA 


TAGTGTTGGG 


CCTATTTTTC 


TTATCTCTTT 


1320 
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ATAAATnGAA GAGAAAGT . 1338 
(2) INFORMATION FOR SEQ ID NO: 683: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683: 



TTCAATGAAT 


AGAAGGCTGC 


ACCTAATGTG 


TAAGACACAC 


CCCCTGCGAC 


CAATAACGCT 


60 


GTTCCTGTAA 


ACCCAAGAGA 


TTCATATAAC 


GGACGAGCAG 


CTATGATACA 


GAGCCAACCT 


120 


AAAACAATAT 


AAATAATCGT 


TGAAATGTTT 


TTCACTGTTT 


CTTGTTTATG 


CAACGTGAGG 


180 


GAT T TAT AG A 


CAATACCGAC 


GATGGCGATT 


AACCAGACTA 


AACTAAATAA 


GGTCCAACCT 


240 


AACCAACCGC 


CAATACTAAT 


TAAGCAAAAC 


GGCGTATAGC 


TGCCAGCAAT 


CAATAAGAAA 


300 


ATAGAACTAT 


GATCGAAGAC 


CTGAAAGACC 


TTTTTGGCTC 


TTGTAAAAAT 


TAAACTATGA 


360 


AATAACGTTG 


AAGATAAAAA 


AAGTAGAATC 


AACATGGAGC 


CATAAATAGC 


ATAAGATACC 


420 


ACGTGAATCG 


GTGAGCCTAA 


ACGAGCTCCT 


TTAACAAGTA 


AAATGACTAA 


GCCAGCAATA 


480 


CTTAAACCTG 


CGCCTATGCC 


ATGTGTAACT 


GCATTCAAGA 


CTTCGTTCAC 


GATTAGGTAT 


540 


TTTTTTGAGA 


AaTGCGTTTT 


TTCCAATAAA 


TTCACTTCCT 


TTATTTTAAT 


TTGATGAAAA 


600 


ATtCGGCAGT 


GTCTGTTATA 


CTATAACTAG 


ATAATTAACC 


GTTTAATGGT 


ATTCTACCAT 


660 


GAAAACAGGT 


TCCTTTTAAA 


GAAAGAGTGT 


AAATTTATGA 


CAAACGTAAA 


ATCGTACGGT 


720 


TCTCATGACG 


TGGAAAAAGC 


TAGGGTGATT 


AATATCATAT 


GTGCTTATCA 


TTATGGTGTG 


780 


GGTGTTATCA 


GTGTGTCATT 


GCAGCGAAAT 


TATGGCAGTG 


GCAATGGAGG 


CTACCCAACG 


840 


nGCACCCCAT 


TGGGATGnGA 


TAnTGTCCTG 


GGAGTGCGGA 


GATTCGTCCA 


GCAA 


894 



(2) INFORMATION FOR SEQ ID NO: 684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: 

GTAACGATCG TATTGACAAT TGACCTTAAT TACAGCCATT AGCACCGCTG CCCCAAAAAC 60 

TTGAGAAGCC AAAGTTCATT TCCCCCACAA AAaACGAGGT CATTTGTAAA ATCGCTGCAT 120 

TGACAACAAA GCTAAATAGA CCAAACGTTA ATAAAGTAAA TGGGAGCGAC AAGATTGTTA 180 

AGACTGGTTT CACCAACATA TTTAGGATTG AGAGAACAAA GGCGGCAATA ATTGCAGGCA 240 

TTATGCCTCT CACATGAATC ATGCCAGGAA AAACAACTGA TAAAGAAATG AATGTCAGTG 300 

TATTAACAAC CAATCGCTGA AAATATGTCA TTAAAAGTCA CTCCATTCGT CATCATCAAT 360 
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TTTTTCCGCT TCTTTACGTT GCTTTTTATT GGCAGCATAA GGATTTTCAT TGCGAACATT 420 

TCTATTATAA GGATTTTGAT GGCCGTAGCC GCGATCGCTT CCTGTTCTTC CAGAAGGGAT 4 80 

CAAGACTGCT AAAGCAATGT ATAAGAGAAT GCCTGGAAAT CCTGcTGAGA AAAAGCTGAG 540 

CmAGACATAA ACGACACGAA TAATTGTTGG GGTCAATTCC TAGCCAnTCA GCAATCCCAG 600 

CTAAGCTTCC TGnTAAAACC ACGTTATTCG GAGATTTTGG TAGT 64 4 



(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 

GCATTTnnTA GTGTCCGATT CGnCTTAGAA gTATTTTcAA TcGTAAaGCT AATAAAaTTt 60 

CGTtGGCTTt CCGTTtAAaG GTCGTTyCAT CAGTTAAAaT CTGkTGcTtG rCCAATTGCk 120 

GAGTCAaTGT TGAmCCmCCr GAgAcCGCsG ATACCTGTCG CATCAGAAtC AACGCTCGAA 180 

CTAGCGCTTT TGGTACCACC CaTGATGTTC yTCmAAATAT caTCTTCTGT GGAAATAATG 24 0 
GCTTTTTTTA GCAtGGAGAC ATTTGATCGC CATTAATTCG TGTCCGAATT AAATCTGATT ( 300 

TGATGTTAGC AATCGGAGTA CCATCTGCGT AAGTCATTTT TGACACTTCG GTAATATCAC 360 

TGATTTCTTT TTGCAATTCT TCCTTCGTTG GGGGCTGTGT ATCTTCAACT AGAAAAGCAA 420 

AaTAGCCCAT TCCAATACCT AAGCCAAGAG AGCCACCTAA GACAATCAGA tAAcCCCAAA 4 80 

ACCACTAATG CTGAAAACGC G 501 



(2) INFORMATION FOR SEQ ID NO: 68 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686: 

CTTTGTAATT GATGTACGCC AAAATCCTGG TGGACTGCTA GATCAAGCAG AGCGAATGGC 60 

CAkTATGTTT TTAAAAAATG GCGAAACAAT TGTTCAATTT GAAGACAAAA AAGGACGTAC 120 

GATGAAAGAA GTTGCTTCCA AAGAATTAGA CGGCGGCTTT aAAGTAAAAG AACCTGTTGC 180 

AGTTATCATT GATGGCAATA GTGCCAGTGC ATCCGAAATT TTTGCTGCAG CACTGCATGA 24 0 

ATCTGCGAAT GTCCCATTAA TTGGAACGAA AACATTTGGC AAAGGAACGG TCCAAACGGT 300 

GAAAGATTTA AATGACCAAA CCGAAATAAA ATTGACTGTT TTAAaATGgT TAACACCAAA 360 

AGGCGAATGG ATCArCGAAA AAGGAATTGA ACCAACCATC AAAGCTGATT ACCCCGAGTA 4 20 

TGCCTATTTA AAATTGATTC CTCGCGATAA AACGCTAAAA GAAGGAGATC AATCTGAAGA 4 80 
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TATTCAGAAC TTAAATGCTA TTTTAGCTGT ATTAGCGTAT CCTGTTGATG AAAACAATGC 54 0 

AAACTACACC GCAGAAACAA AAGCAGCGGT wAGCGATCTT CAACaGAAAA ATGGCTTGCC 600 

TGTAACTGGA GAAATCGACA ATGAAACAGC CACAAAAATT GAAGCCACTT TAGGTAAATT 660 

AATTCTTGAA AATGACGCTG CTTACGATAC AGCAGTCAAA GAAATTCAAA AAAACTAGAA 720 

AAArGTGGCA CGAaGTTGAA GaAGAAcGTG ATTATGTTGG gTATTtAATG IcACTTyCTGA 780 

AAATnTTAGt ACCAGCAtCg TAgCCGTTTT tATTTtAAGr GGtTTTCCC 829 



(2) INFORMATION FOR SEQ ID NO: 687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687: 

TAGATTCTAT AATGGTGACA GGGCTAGTAC CATATTTAGC GATGAAAGAT AAGTCAGATT 60 

CGAAGGATGA. AGGGGTTAAC CTTAGTGCGT TAAACTTTAA AGAGCAAAGC GTACCGTTCA 120 

AGATATTAAA AATTAAAAAT ACATTTACTG ATGAAATGCG TTTTGAATAT CAAACACACA 180 

TTATGGATAC TGCAAAAAAT ACTCCAATTA TGAATAATGA AAAAATTAGT TTTATCAGCT 240 

ATGAAAAATA TATAGTTACT GGTATGAAAA GTATTTTAAT GAAAGCAAAG GATTCTAAGA 300 

AAAAAATATT GGCATATATT AATAATAATT TACAAAATCT TATAGTTAGA AATGTAATCA 360 

GACCAACTCA AAGATATGCT GATATGTTGG AATTTTCATA CCATCCTAAT TGCTTTTCTA 4 20 

ATGCAATAGA GAGAGAAAAA GTACTTCATA ATAkGTGGGG CCTATCCCTT ATAAGGAnTA 4 80 

AAAAG 485 



(2) INFORMATION FOR SEQ ID NO: 688: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688: 

CTATACGCCA CAAACAATCT CCAACATGAG TAAATTGGTG GCTGAAGATG TTTTAGCTTT 60 

TAAAGAmAGA ACGTTAGAAG CCAATTATTC CGTTATATTT ATGGATGCGA CGyaTATCCC 120 

TGTGAAGCGA CAGACTGTTT CGAAGGAAGC AG TG TAT ATT ACGATTGGTA TTCGTTTGGA 180 

TGGAACCAAA GAAGTTCTTG GGTTTACTAT TGCCCCAACT GAGTCTGCTT ATATTTGGAA 24 0 

AGAAGTTCTT CAAGATCTTA GAAAACGTGG GTTAGAAGAA GTTTTATTAG TAGTGACAGA 300 

CGGATTAAGC GGTATTGAAG AAAGTATCCA TAGTGTGTAT CCGAATGCCC AATTTCAACA 360 

ATGTTGTGTG CATGTATCTA GAAATATCGC TCATAAAGTT CGTGTTCGAG ATCGAAAAGA 4 20 
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AATTTGTGAG GATTTCAAAT TGGTTTACCA AGCGAATTCA AAAGAAGAGG CATTGGATCA 4 80 

CATCGACTTT ATGATTAGGA AATGGAAAAA GCAGTATCCA AGAGTCGTCA ATTTACTCTT 540 

GAATCCTGCC CTATTAACCT TTTATAATTT CCCTCACGCC ATCAGACGAA CAATTTATTC 600 

GACGAACCTG ATTGAAGGCT TTAATAAGCA GCTAAAACGA TATACTCGAA GAAAAGAACA 660 

ATTCCCTAAT GAAGAATCTC TAGAGAGATT CTTCATTTCT CAATTTAATC AATATAACCA 720 

AAAATTTTTA GGTAGAATTC ACAAGGGATT TAAAGAAATT CAGGATACAT TAGAGTCGAT 78 0 

GATTTAACTG TAATTAACAG AATGGATTTT CCATTTACAC ATAATTCTTG ACGCTACCAT 84 0 

AATCTAAATA ACCATCATAT TCATCTACTA CTTATATAAT TGAGTTTTAT TAAACTTTTT 900 

AAGTTTCTAT TCTATCAAAT ATTTAAACAA TCTCTGCATC CGAAAAGACT ACCTCGTATA 960 

GTTTTACTAA -970 



(2) INFORMATION FOR SEQ ID NO: 68 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 



ATAAACATAT TGGTAAAAAT TATGTCTCTT CAAAAAATCC ATCTGkTTTT GAAAATTTTC 60 

TTTATCCGTT ATATCCATAT TAGCATCTCC cTTTTTTAAC G ATT ACT ATT GCTTATTCTA 120 

TCGACTTTTT CCCTATTATT TACTTTGAAT AAATCCTCTT TTTATTAGTG TTGTTAATAT 180 

AGACGAAGCA CACCGCAAAA CAGCGCTTTC ATAAAGGACA ACTATAATAA GAAATAATAG 240 

TTAGTTTTAT TAAATCATAA ATACAGCTAA AATAAATCAT ACAAAAGTCC TAAATCAAGT 300 

TTTATTTTAA GTkTCTAACT TTGATTGCkT TAGTTAAAAA TGACACmACC aTCTTATATC 360 

nTATTCCATA CCATTTTCnT AAT 38 3 



(2) INFORMATION FOR SEQ ID NO: 690: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 

GnCTATGTTG AGAAnAAAGT CGGTAAATTA GAnAGATATT TCCAGCGATT CACCTGAAGC 60 

AACGGTTCAC GTGAATTTGA AAGTGTACAC TGAAAAaAaC AGCCAAAGTC GAGGTTACTA 120 

TTCCTCTTCC TTATCTAGTT TTACGAGCTG AAGAAACGTC ACCTGATTTA TACGCAAGTA 180 

TTGATTTGGT TGTCGATAAA TTAGAACGAC wAATTCGTAA ATTTAAAACA AAAATCAACA '24 0 

GAAAAtCTCG TGAAACAGGA CGTAACAATA CAAAAGCCGC TGTCTTCTTA GTTGGTGAAG 300 
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AAACAGAAGA AACACCTTCA GAATTAGACA TCGTTCGTAC AAAACGTCTA TCATTAAAAC 360 

CGATGGACAG TGAAGAAGCT GTTTTACAAA TGAACATGTT AGGACACAAC TTCTTCATTT 420 

TTGAAGATGC TGAAACAAAT GGAACAAGCA TTGTTTACCG TCGTAAAGAT GGCAAATATG 480 

GCTTGATTGA AACAGACTAA AGCTTAAAAT ArGAGCTGAA ATGCCTTCGT GCATTTCAGC 54 0 

TCTTTAAACA TCTTTAAAGC ACGGAGTAAA GAAAAAATAA AGAAATACAT AAAATAACTT 600 

TAAAAGTTCC TCTTTTAAAG GAAAATGTCT GTAAAATAGA GAGAAATGGT TTTCTTTCTC 660 

GTCAAGTAAT GATAAAATAA AACGGATAGA TTGTGCTTCT TGTACAAAAA GGATACAAAA 720 

ATAAAAAGAG GTAATTGAAA GGACACAGTA CCATGGCAAT TTTTTGAAAA AATGTCGAAA 780 

CGTAAAAAGA TTAGCGTTGA AAAACGCGTA ATGTCCTGCA GGCATGACAC TACGTGACAT 84 0 

TCGGAAACGT 850 
(2) INFORMATION FOR SEQ ID NO: 691: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691: 



ATACTCACCT 


ACATCTTAAT 


TCCAACTAAA 


CTTAATACCC 


ATTTAACAAC 


TGCAAAAaTA 


60 


TTTTTTCCTT 


GATTCGTCGC 


AAAGTCCTGT 


AAAATTAAGG 


CAATGACAAT 


TACTGAACCA 


120 


ATTTTTGCCC 


ATTCGGCACC 


TTTCCAATGT 


TTTAAAATCA 


AGATAATCGC 


ACCAGCCGCT 


180 


AAAATTGTAA 


TAAAACCTGT 


ACTTAAAAAA 


CTACTCAAAT 


TGCCTGTATT 


CATTAGTCAT 


240 


TTCCTTCTTT 


CTCTGTAAAG 


TAATGTTTCA 


TCGTATTAAT 


AAAATAAGAA 


TTCTCCGTTT 


300 


TAGATAACCA 


AAGCGAAAAA 


TCCTCAGTAT 


GCGTTAATTT 


TGTTGTTTTA 


TCTGAAAAAG 


360 


TGACCGATAC 


TTGAACCCCT 


GTTACACCTT 


TCTGCTTTGT 


ATCAAAAAAT 


TTCATATCAC 


420 


CAGAATCTAT 


TGAAACAAAT 


GAGTCTTGAC 


CTGCTGTCGT 


TGTTGGATTT 


TTCATTAACA 


480 


ACATTAATTC 


TTGTTTGTTC 


CCAACTGAAT 


ATTf TGTAAA 


AAACATCTTT 


AAAAATTTCT 


540 


CTATAGATTG 


TTTTTCTTTC 


CCATCAACAT 


CTATTGAATT 


TTCTGTCTGT 


TTTAGAGATA 


600 


CTGTTTTTCC 


TTGTGGTAAA 


CTACTTGCTA 


CAAAATAAGG 


ACGTCCTACA 


ATTGAAAACA 


660 


AATTATCTTT 


TACTTGAATT 


GGTAAAACCA 


TATTCTGTGT 


CACATTTTCA 


GTCTTTTTAT 


720 


CTTTCTCCAA 


TTCATAATTC 


ACATTTAGAT 


AAACCAAATA 


ACAATCTTCC 


ATTTTCTCTA 


780 


TACGTGATAA 


AGACGTATTT 


TTAAACGTAC 


GTAAGACATT 


CCCTTGTAGA 


TTTTCTGTTT 


840 


CATCAACATC 


■ TTTTACATCT 


AGTGATAAAT 


ATTTAGATAA 


TCGATCAATA 


CGAGCCTTTT 


900 


TAGCTTCATC 


ACTTGCTTTA 


GAGTTGTAAT 


TTAAATACTC 


TTTCGCAAAA 


TTTTCGACAT 


960 


AAAAAGAGAG 


GGCTGAATAA 


TCAAACTCCT 


CTCTTACGAC 


GTTATTCTTT 


TTTGATTGTT 


1020 
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CGCTTAATGT ATCCACTTTT GAATATAAAC TATCTACATT ACTTGCCATC ACATTGGCCC 1080 

GAATCGCACC AACTCCTGAC ATCACCAGTA AAACACTAGC GCCAATTAAC ACAACCCGAT 1140 

TCGCTTTCTT TTGAGAAAAT GTCTTAACCT TTGGTGTTTT TTCTTTTTTC TCTTTCTTTT 1200 

TAATTTTCTT TAAATTCATA TTCCCCTCCT GCAATCAATA AACCTAAACA ACTTTCAAAT 12 60 

GATTTTAAAA AGGTAGATTT AATAACTTTT TCAACATAAA CTCTTTCTTT AACACGACCT 1320 

TTATATGTTt GAAAGCCTtC TTTATACAAA TAAAAATAAT ATGAATTGTA CATTTTATCT 1380 

CCATTAGGTT TTCTCAAGTA TTGACCTTCT GCTATTAAAC TAGTTA 1426 
(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: 

AATAAAGAAA ATGAAATtCT AGGAAaTAAG rAAATTACTA CTAAGcAGAC CAsATGTTGA 60 

GGGGAAAGAT GATATTTCAn ATAGTCGATT TGCTCTTTGG CAAAGTGCGA TnGATTTAAT 120 

GAAATTAAAT CCTATATTTG GCACnTCTCC AAAAGGTTTT GCAGATATAG CTAAAGATAA 180 

ATTGCCTCAA TCTCATATTG CTAAGACAGC TCAAACACCG CATAGTTTTT TCTTCTATTT '240 

ATTAGCAGCT ACAGGGATTT CAGGTACAAT TGTATTTCTA ATTTTCTTGC TATCTAAAAT 300 

GTTTAACTCT GCTAAATTGC TGTTTAGCGT TCGAATTAAA AATTATATGG ATTTTGTGTT 360 

AGATAACCAA ATTGTGTTAG TCATTTTAAT TTCTGGATTA CTGATAACTG AAGTCATTTT 4 20 

AACAAGACGT TTTGCAACAG TTATCTTTTG GTTATACTTA GGTAAAATTC AGTATCGCAC 4 80 

TGACATGGAA AATGATTTGA TTAGAGGAAA TGAGCaATGA TGCTCATTTT tTCTACTATA 54 0 
(2) INFORMATION FOR SEQ ID NO: 693: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 

GAAAAACAAA AAGGTGTCAT ■ GACATCTTTA GGAGGCAGTT TTAATGAAAA CAAACTCAGT 60 

AAAACGTTTA ACAATTTCAG CACTTTTAAT TGCAATGGGG AT TAT CAT TC CAATGGTTAT 120 

GCCAAGAATT ACCATTGGTC CAGCCAGTTT TACATTAGCA AGTCACGTCC CAGTTTTTAT 180 

TGCCATGTTT ATTTCTCCAG TGGTGGCAAT TGCGGTTAGT CTAGGAACAG GTTTTGGTTT 240 

TTTCTTATCC GCAACACCGA TTATTGCTTT GCGGGCCTTA TCTCATTTGA TTTTTGCAGT 300 

CATTGGCGCA tGaTTTTACA AAAGCATCCA GAAATTTTAA TTAATAAAGA GGGGAAATTC 360 
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ACTTTATTAA ATGGAAAATT ACAGTTGTTT AACGTGGGAA TTGGAGTGAT TCATTCCGCt 420 

GCCGAGCTAG TGGTTGTTct GTaTTtACAC AATGGGCAAC TTGcCAGGAA CCTATnATAC 4 80 

AGC 483 



(2) INFORMATION FOR SEQ ID NO: 694: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 

AAATGAAAAA TTGTCTGACG GGGAACCAGC TCAATTTGAT GCAAAAGAAG AAACGTTTTG 60 

GAAAAAAATT TCGCAAAAAG TTATTTTATA AAAGTAATAC ATACGAATAA GAGCGTACAC 120 

ATAGAAAGAA GCGTAAgCAG TTTAACACTG CTTACGTTTC TTTTTTAATG AATAGGAATA 180 

ACTGTTTCAC GCTTAATTAA TTTGTGAGGA ATAACGAGAC GGACACCATT TGTCAATTCT 24 0 

TCGTGGATTA ATTCATATAA TTTTTTCATC GCCATTCGAC CTAATTCTGA CGTGTCGATG 300 

TCAATGCTTG TTAGATAAGG ATGTGTCAAT GTTGgAAAAG ATTGAGTtGT TAAAACTAAT 360 

AATGGATAAC GTTTCCrGGr ACyTGaTAAC CAwAnGATTT GCGCaAGTTg CATCgG 416 



(2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 



GCCTGATACA 


TACATTTCCA 


ACATTGAAGC 


CATGAGGGCT 


TTTTCGTTtC 


GTTGATAACG 


60 


TTCAAACACT 


GTGGGTGAAA 


AATGGCCATC 


ACGTGTTCTG 


GGTACTTTTA 


ATTCTAGCGT 


120 


GCCTACACGT 


GTCGTAAAGc 


TGCGCTCATA 


ATAGCCATTT 


CGTTGACTTT 


GTCGGTTTTC 


180 


TGTTCGTTCA 


TATTCTTTTG 


CTTGAATATA 


TTCTGTTCGT 


TGATTTTCCA 


TTAGTTGATT 


240 


AAATACCGTT 


GTTAAAATAT 


TTTTAGAAAC 


GTCATCCTTT 


ACAGAATATT 


CAATAATGCT 


300 


TTGAATCTCT 


TCGCTTTTCA 


GTGTAAAATG 


TACTTGGGTC 


ATGTAAAAGT 


CCTCCTGGGT 


360 


ATGTTTTTGT 


CGTTAAAAAC 


ATTGTACCGT 


AAAAGGACTG 


TTATATGGCC 


TTTTTACTTT 


420 


TACACAATTA 


TACGGACTTT 


ATCATAATTT 


AAGTAATACT 


AATCAATGAA 


ATTTGAATTT 


480 


TGAAATAGTT 


ATTTATTTTC 


ATTTTAAATA 


CGTCTTTTAA 


TTAACATTAA 


CCTATGATTG 


540 


ACTATAATAA ' 


AAAATAGGCT 


CTCTTTGTAT 


AGAAAGTCTA 


TTAAATTTTA 


ATAATTTATT 


' 600 


TAGTTGACAA 


TAAGAATATA 


CCTTCAGCAA 


TTTTATTTTT 


AGTAATAGGA 


ATTACTTCTG 


660 


AATAACCTTT 


AGAGTTTGTT 


ATAATAACCA 


TTATTGTATC 


ATAAAGGCCT 


GAATTTTTGA 


720 
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TCATTGGGTT 


GTCCAATGTT 


CCAAGAACCT 


CCCCCTTTTT 


ACTTTTTGAT 


TAGCAACAAA 


780 


TTTTTTAGCG 


AATCATTTAC 


TACTTAATGA 


TACT G TAT C A 


ATACCAATAT 


TAATCAGTAA 


840 


CATGTCCAGT 


TTCAAATAGA 


ACTTGAACAA 


TACCACATCT 


ACTGGAGAAT 


AAATAGTGTT 


900 


TCTATTTGGC 


TTAATAGGAA 


CTATTTTCTT 


ATTGTTGATC 


CGTCTTGTGT 


TTGTATTGAA 


960 


AAAGAAGAAA 


TAATGAGGGA 


TTTAAAATTA 


GCAAAAAAAT 


TGGTAAAATG 


AACTAAGGTA 


1020 


ACGATCGCAC 


CCTCAATATA 


ATAGTGTCAT 


GAATTTAAAC 


CTCGATTTTT 


TTGCGGTTGT 


1080 


TACCTTTTTA 


TTTTAGGGTG 


GAAAATAAAT 


GTcTCTTTAT 


TTCTTTTTGT 


TATTTCGCAA 


1140 


GTGATAAACT 


CCTGTTAAGA 


TAGTGGCTAT 


ATAAAAAAGG 


CAGATACTTA 


AGAAGAGTAT 


1200 


ATTCATGATT 


TCCTCCTAGT 


TTAAATGAAA 


GTCATGAATG 


ATTGTACCAA 


GATTTTTTAG 


1260 


AGAAGTTAAG 


ATTGAATAGA 


AAAGTAAATA 


CGATArAATT 


TTTATAGACT 


GTTACGTTGA 


1320 


AATGTTCTTC 


AGAGACTATT 


TCTTCCAATA 


CCnCCTACTA 


GATAGGTGGT 


TTTAATCATT 


1380 


ATTTTCGGAG 


GTTGTGTTTT 


GGCCAAAAAG 


TAAAGATTTA 


ATGGATGTAG 


GTTCATTTGT 


1440 



GA 1442 
(2) INFORMATION FOR SEQ ID NO: 696: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: 
CTATGATTGG CTTGGAGAGC tTTCGTTAAT TGATTTCTTA CGTGATGTTG GAAAAAACTT 
TACAATTAAC TATATGATGA GTAAAGAAAG TGTGAAACGT CGAATCGAAA CAGGAATTTC 
TTATACTGAA TTTGCGTATC AATTATTACA AGCGTATGAT TTCTTGAAAT TGTACGAAAC 
AGAAGGTTGT TTATTGCAAT TAGGCGGCAG TGATCAATGG GGAAACATTA CTTCAGGGAT 
TGAATTATTA CGCCGAGAAC AAGAAGTTCA AGGATTTGGC TTAACTATGC CTTTAATTAC 
AAAAGCAGAT GGTACCAAAT TTGGTAAAAC AGAAGGAAAC GCAtTTGGCT AGATGCGGAA 
AAAACTTCGC CATATGAATT TTATCAATTC TGGATTAACA CAGATGATCG TGATGTCGTG 
AAATTCTTGA AATACTTTAC TwTCTTAACA TTGGATGnAA TTGCAACAAT TGrAGaAGAA 
TTTACAGCGA ACCCAGGTCA ACGTGCTGCG CAAAAAGCGT TACCAAAGAA GTGACTACTT 
TAGTCCACGG TGAAGCAGCC TATCATCAAG CGGTTAAAAT TTCAGAAGCT TTATTCAGCG 
GGGATATCCA AAGCTTAACA GCAGAAGAAA TTAAACAAGG CTTTAAAGAT GTACCGACGT 
ATGAGGTGCA ACCAGAGGAT CAATTAAGTT TAGTCGACTT ATTAGTTACC TCArAAATAG 
AcCATCCAAA CGTCAAGCCC GTGAaGATGT TCAAAATGGA GCGATTtACG TCAaTGGAGA 
aCGTaGACAA GATTTAGCAG CTGAATTGAC AGAAACTGAT AAAaTCGAAG GACAATTTAC 
TGTGATTCGT CGCGGTAAGA AAAAATATTT CTTGTTAAAA TACTArAAAA ATGaAAGCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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GGGACGaAGG TCGTTTaGAC AACAGGTTCC 
GaTTAATTCT GTTACAAAAA ACTCTTTTTT 
TGTGCTATGC TAGAATAAAA AGTGCAGTCG 
CGAAAAGTTA AACTTAACGA AGTATCAGCA 
GTATCTATCT GAATTTGCTT CAGCCGCGCA 
TTTTGCATTT GTGAAAACAA TGGCGGAATT 
TCAACATTTA GTAGAAAATG GGTTGTTATA 
GTATCCAACC TTTGTCCACC GAGATGAAAT 
TGGGTATATT AAAGGCTCAA CCTTAAAATT 
TACGGTTGTC GGCATGAAGA ATGTTCTTAA 
TGTTGGGGAT ATGTTCATAT TTACCGGAAA 
AGCGACATCG TGCATACGCn GGTATCCAAA 
CCGnAAC 

(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1627 



AAGCTTTCGC 
aCTATTGGGc 
CTATAAGGAG 
ACTAGTTATT 
GCAATTACCC 
TCAAGAGATT 
TGTGGCTTAT 
TTTTCCAACA 
TAATCGCATG 
AAAGCTAGCG 
TTGAAGCGTG 
GATGGCCCGn 



TTTTAAAAAA 
mTATTCaATT 
TAGGTTATGA 
TTAAATAGAC 
TCAGAACCAG 
GTTTCAATGG 
CCGAAAAAAG 
TTACAAGTAA 
GTTAGTTTAG 
AAGAATCAGA 
GGACAACAGA 
ATGTTTAGGC 



gATAAATTGG 
aCAGATAACT 
AATCAATTGT 
CACAGGGAGA 
TTGAGCTGAT 
TGATTGAAGG 
GCAATAAAAT 
ACGAGGCAGA 
ATGAGACTTT 
CCAGTTCTGC 
CCACTGCTGC 
AACGACGnGC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: 

TCAAACGTTT ATCATAAATA AATGTAAATT ATAATATTCT TTTGATTTAA ATAGATTTAT 60 

ATATGTnATA CGTTTGACTT TTTTCTGATT TGTTTATACT ATTAGACTAA CTACTATTTT 120 

CTAATAAAGG AGACATTACT TATGTCAAGC ATCATGAACC AATGGACGGA TGAATTACGT 180 

TATGCGCCTT ATTCTTCTTG GACATCTGCT CACCTCGAAA ATCTAACTTC TATTATCGCG 24 0 

CAATCTAGTT GGCGTTTTAA GTATCATATT CAACCACAGA CAGGACTACT AAATGATCCC 300 

AACGGTTTTT CGTATTTCAA TAACCAGTGG CATTTATTTT ATCAAGCGTT TCCTTTcGGG 360 

AGTGTTCACG GACTAAAAAG TTGGGCCCAC TTAACTTCCT CCGACTTAAT TCACTGGGAT 4 20 

TATGAAGGAA TTGCCCTTTA TCCCGACTCT GAATATGATT CTCATGGCGT CTATTCAGGC 480 

TCAGCTTTAA CAATAGATAA CCAACTATGT TTATTTTATA CAGGAAATGT TCGTGATCAA 54 0 

ACTTGGCAAC GATTTGCATA TCAAAATATT GCATGGCTGA ATTCTTTAGG TGCGATCACA 600 

AAGGArTCAA CACCATTCCT ACCTATTGAC CCCAATTATT CTTCCCATTT TCGTGATCCG 660 

ATGGTATTTC- CTTATCAAGA AGGACTTGTT TtATTAATTG GTGCTAGTGA TTTAAATGGA 720 

CAAGGAAAAA TTGTGGTCTA TTTTTCTAAA GATCGAAATG TACACAATTT TCATCAACTT 780 

GGCGAATTGA CGTTCACCAA CCAAGAATTA GGCTACATGG TTGAATGCCC CAATTTGGTA 840 
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TTTATTGATG GCCAGCCTGT CTTATTATT 8 69 

(2) INFORMATION FOR .SEQ.XD J»D; .69JB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

. {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698: 



TGTCAGATTG 


CTCGTTTATC 


GTTTCAATGC 


AGCATTAAGG 


AGTAACGAAA 


TGCGATTCGC 


60 


TACTCCTCAA 


AGACTGATTT 


CTATTCTTTT 


TCTGAACCTT 


CTGTTGCATG 


ATAGTTCACG 


120 


CTATTTGAGG 


TACCTTTTTT 


AGAAAAGTTT 


TCAATTAGCT 


CTTTCACGTC 


GAGGCCCGTT 


180 


GTTTCTTTTA 


GTGTTTCTTG 


TGTACCAGCC 


AACAAATTTG 


TTGCATAATT 


TGTAATACGG 


240 


TTGGCGCCAG 


AATTTTCGCC 


ACCTGCGCCT 


GTATCTACCA. 


CTGAAATCTT 


GTCGATGTTG 


300 


CCCAATGGTT 


GTGCTGCTTC 


TTTCATTAAT 


TGTGGCAACA 


TTTCCATCAC 


CATACTTAAC 


360 


ACAGCTGCTT 


CACCATATTC 


TTTGAAGGCA 


TCAGCAATCT 


TTTGTTTGGC 


TTCGGCTTCC 


420 


GCTGCACCAC 


GAGCTAAAAT 


AGCTTCGGCT 


TGCGCTTGAC 


CTGTTAAACG 


CGTTTTGTTG 


480 


GCTTCCGCTT 


CGGCTAATGC 


TTCAACTTTG 


AAGCGTTCTG 


CTTCGGCTTC 


TGsCACTTCG 


540 


CGTGcCTTTT 


GAGCAAGAGC 


TTCTTGTTCG 


CGTGCGTAAC 


GATCTGCATC 


GGCTTTTTTC 


600 


TTCACTTCTG 


AGTCGTATTG 


TTTTTCACGA 


CGCGTAATTT 


CTTTTTCTTC 


TAACTCAATT 


660 


TGTTTTTGAC 


GTTCAACGAC 


TTTGACTTCC 


ATTTCTTGTT 


CCACCACGTG 


TTGTTGTGCA 


720 


CGCGCGCTTT 


CCAAGtTATA 


GGCTTGGTCT 


GCTTTGGCTt 


TGGCAATATC 


TTGT 


774 



(2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID" NO: 699: 

ATCTGTAAAA ATAATCAGCn TCTCGTTCTT GTGCGTGTTC AATTTGTGTC AAAATTTCAT 60 

CATTGATAGC AGACCCAATC GTTTCAATCG TATCCGCTGC AACTACCTCT TGAATGCGGC 120 

GTGTGTCGTC TTTTCCTTCA ACGACAATGA TTTCTGCAAC TTTCATTTTT TCTTTCATTT 180 

ATCTATCCGA AACAAGCGAT GAGCGkTGAC TCTCGTTTGT TCTGCCACTT CTTCAAAGGT 240 

TAGTTGTCGT aATTCAGCAA TTTTTTCCAC TACGTATCGT GTATAGCCTG GTTCATTCCG 300 

TTTACCACGA- TAAGGAACAG GGGCTAaATA GGGTGCATCT GTTTcCACCA ACATTcTTTs 360 

tAGTGGTACC GCCATCGCCG CTTCTTGAAC ATCCAAtGCC TTTTTAAACG TTACGACTCC 4 20 

ACTAAAaGaa TATaCmAACC TAAATCTAAA AATTTCTTCA TCCACTCGGC ATCACCACTG 4 80 
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AAACTATGCA TGATTCCACC AATATCTGAA ATTTTTTCAT CTTTTAAAAT TCGATAGGTA 540 

TCTTCTAATG CCTCTCTCGT ATGCACACTA AACGGTAAAT TCATTTCCCG -AGCAATCGCA 600 

ATTTGACGGC GAAAGACTTT TTCTTGCACC TCTTTAGGAT CCTCCATCCA GTAATAATCA 660 

AGrCCGATTT CACCTAaTGC TACAATTTTT GGCAACGTTA ATtGTTCTTG TAAtCGTCGT 720 

TCAACCTCTG GCGTATAGCT ACCCGyTTCT GTTGGATGCC AACCAATGAT ACTATAAATA 780 

TTGGCATATT CCTGACTAAG TAAAAGAGAT TTTTCAATCG TTGGTTCATC AAAACCGACC 8 40 

ACGGCCATCT CTGTAACACC CAATTCTTGG GCATGTGCAA TCGTTTCTGG AATATCATCA 900 

TTAAATTGTT CTGCATTCAA ATGTGTATGT GTATCAAAAA TCATTATGAT TGCCTCCTTT 960 

ATCTCCTCTC TACTTTATCA TATTTCTTGA AA 992 
(2) INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700: 



GCCAGATTAT 


GAATTGTTAA 


TTAAAATCCT 


TAAAAACGAT 


TCTAACTATC 


AATTACTTTC 


60 


TTTTGAAACA 


nGAACATTTT 


CGTTAnCTTT 


TGCAArGTtA GTJcGAyCCTA 


aTACTGTAGC 


120 


TCATTCCAAA 


CAATATTTTG 


GTGGAGAAGA 


TAAGGACTTA 


GGAGTTTTTG 


TAGATATTTT 


180 


TCCAATTGAT 


GGTATCCCTG 


ATACTCAAAA 


GGAGAGAAAT 


GAACTAAGAA 


AGATAACGGA 


240 


ACAGTATCGG 


TTAAACTTAA 


TGGATACTTT 


AGGTTTAAGC 


TATGCACGAA 


gTTTTCATCT 


300 


TAGTAAGGCA 


TTAATAAAAT 


TAATAACTCG 


TTATCCTAAT 


CATTTTAAAG 


TGAAAAAAGT 


360 


AGGaAACAAT 


CAATATTGGC 


GTGATTTATA 


TCAAACAGAG 


GCAAAAAAAA 


CCCCATTTGG 


420 


AGCAACAAAA 


CATTGTGGTT 


ATCTAGAATG 


GATTCATATA 


GATTGGGGTG 


TATTTCCTAA 


480 


TGAATGGTTT 


TTGGAATATG 


AGGATGTAGA 


GTTTGAAGGA 


CATAAATTTA 


TGGCCATTAA 


540 


AAACCGAAAa 


CAATTTTTAA 


CTTTACGTTA 


TGGAGATTAT 


ATGGaAATGC 


CTCCAGTAGA 


600 


AGAAAGAaTC 


ACTCATCATC 


CCwAwGAwww 


wtACCAAAAA 


TAmAGATTAT 


TTATAGGTTT 


660 


TTTAGAAAGA 


AGGCGTACTT 


AAATGAAGCA 


GCAATATACA 


GCAACTGTTG 


TTATGGCTAC 


720 


CTATAATGGA 


GAAAAAAATA 


TCTTAGAA 








748 



(2) INFORMATION FOR SEQ ID NO: 701: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1002 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701: 
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TGGTGCCGAC 


TTTCGTCATC 


GCAnCGCACG 


ACCnTTTCGA 


AAnGATATGG 


GTTCCATGAC 


.60 


GTTGTCTTTG 


nGCTGGTAAG 


TTTGTGTTAA 


CTCCCATTCG 


ACTCTAGGTA 


TTCCAAGTTA 


120 


ATCACCATAT 


TCGGTTTGAT 


TGACGAAGTA 


AGAACCGTTT 


CTCGAATTAC 


CTGATTAAAA 


180 


CCAGTGCGAT 


CATATTTTTC 


GTAATCTAAA 


AAGCCGTTAA 


TTCAGTCCGT 


AGGAGTGTGT 


240 


TTACGCTTTT 


TCTAAGTGCG 


AACGGAATAA 


TTCATTTAAA 


TCGCCTTTAG 


TGACTAGAGT 


300 


TTGCACAATT 


TCkGTAGTAA 


AATCATTCAT 


AGGGAAGTCC 


TCTTTTCTGT 


GAATTGGTTG 


360 


TCGTTAACTT 


TATTCTACAG 


AAGCGACTTC 


CTTTTTTGTA 


TGGATTTTTT 


CATyTACACA 


420 


AAATATTTTA 


CACTCTCAAG 


TTAwCGTACG 


ATAACTTTAA 


wwCATATGTT 


GTACAAGTTT 


480 


TTTTGTTAAT 


TGTTTTTAGT 


TATATTATCA 


TTAAATTAAG 


TGAACTGAGA 


CAGTTTTATT 


540 


GGCTTwAAAA 


TCTATATTAA 


ATAAAACAAC 


AAAGACCTTA 


AGTTGGAGTT 


TAAGGTCTTT 


600 


GTTGTTTTAT 


TTAAGATTCA 


TAATCACAAT 


GCCGACAATA 


ATAACCACAG 


CACCAATAAG 


660 


GGCATTCGTT 


GTGATTGTCT 


CTTTTAGAAT 


AAGATTTGAA 


GCAATTAAAA 


TAGCGATGTT 


720 


TGTACAGGCA 


ACGCCTAAAG 


GAACAATGTA 


AGAAACATCG 


GAACGACTwA 


TAATTAACAT 


780 


CCATAAAATA 


AAACTAATTA 


AATAACAAAA 


TAAACCTAAT 


AAAGATATAA 


GTGAGATATT 


840 


CATTGaAAAG 


ATTGTTTGTT 


GGaGTTGGAT 


GTTCmAACTG 


CTAGAACCTA 


GTTTAAATAA 


900 


GACCmGACCA 


GCmGAAGACm 


GAAATACGTA 


AwTAATAAAA 


rGAaCCmTAA 


TTTCCCTTCT 


960 


CTCAAAAATA 


ATTTCCGTGC 


AAAAAAATGC 


AAGACCTCCA 


TT 




1002 



(2) INFORMATION FOR SEQ ID NO: 702: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702: 

CTTGAATAAA CGAAATCAAC TTGTCATTGA GAAAATGCTT GTAAGGAAAT CTTTGATTAG 60 

TGCATAAAAA ATGCTCACTC TTATCTGGGA GCAAGACTyT CGTTAATGAC TGTTTAATTC 120 

TGATAAAAAT AAAAGACCGT AGCTTATAGT AGGCTAGGGT CTTAAATACG TTATTGAGTA 180 

AAAGAATTTA ATGTATACTT TATTTTCGTT TCGTTGGTAT CGATATCAGT AATTTGCTCT 24 0 

TTAGTAATAT AAGCATGAAT ACTTTTTGAA ATCGTAATTT GTACTTCAGA GGTCTTTATT 300 

TGTTTAAATA GCGTTCTTTC AAATGTACCT GaGATATCTC CATGTGCTAA AGACCTAATC 360 

TTAAAATAGT TCTTACTTTT ATTTGCAGCC AAGACCTCTC CTTCTATCTC TATTTCTTcA 420 

GTTAACACGt TTCnTTCATT TATTtGATCA TTCACACTAG ACAACATAAC ATCTAAATTT 4 80 

CTTTTGATCT TTTCTGATTT TtTTATATTA CTCAACTTAA CTTGATTGGA AATTAAATCA 540 

ATTGATAAAT gCATTTTCTT CAATAACTTT ATACAGCTTA GCAGTTTCTT CAATGAGCTT 600 

TCTTTCATTA AACAGTTCTT CTTCATCGTA CATTGGTATA TAGGCTAAAT TATCTACGAA 660 
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GnCAACAGAA AAAGAGGCCT CAT C AG T AAA TAATCCCGTT TGnCGATCCT TTTCTAATTC 720 

GAAATCAAGT TTTAGCGACC CTCTTGAAGG TGCGGACATC AAAAGTTTCT TCTGAGTTCC 780 

TAATTCTGCA TGCCTTTGAC TTAAATCTTG CCATGTACTT CTTATAAATT CTAAAGCACT 84 0 

ATAAACAATA TCAAT 855 



(2) INFORMATION FOR SEQ ID NO: 703: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703: 



ACAGAAACAG TTGATGTTTC ATCAATGATT GATGAATTAG CAACAAAAGC GAATGTCGCT 60 

TTAAAAGCAA TGGAAGATTT TACACAAGAA CAAGTAGATC ACATTGTCCA TCAAATGGCG 120 

ATGGCGGCGT TAkACCAACA TATGCCTTTA GCAAAAATGG CTGTCGAAGA AACTGGCCGT 180 

GGaATCTAkG mmGATAAAGC AATCAAAAAT ATGTATGCAT CTGAATACAT TkGGAATAAT 24 0 

ATTAAACATG ATAAAACAGT GGGTGTAATT AACAAAGACG AACAAACCGG CTTAATGGAA 300 

ATTGCTGAGC CAGTTGGCGT GGTATGTGGG GTCACACCAA CAACGAACCC AACGTCAACA , 360 

ACTATTTTTA AAyCcTAATT GCCTTAAAAA CmAGaAATCC mATTGTTTTg CTTTCCATCC 4 20 

TAGCGcACCA AAAATGTTCG GCAGAaGCGG cACGTATTGT ACGGGATGCG GCGATTGCAG 480 

CAGGTGCACC AGAAAATTGT ATTCAATGGA TTGAACAACC ATCGATTGAT GCAACATCTG 54 0 

CTTTGATGAA TCATCCAGGG ATTGn 565 



(2) INFORMATION FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704: 



ACGTATGTTC GCTATAGATn ATATGTATTA TTAATAAAAG GAGAaCTACA TATGATTACA 60 

aAAAmaGAAA ATGGTTACTT TGTTGACGTT AGCTTGGGGA TTGATCCAAT ATCaGGCAAA 120 

CAACGTAGAA AGAGACTTAC AGTTTCAACT AAAAAAGAAG CTCAACAAAT TGAATCAAAG 180 

TATTTAAGGT TATACCACGA AAATAAATTA ACAACTAATT CAGACTTAAC TTTAAAAGAT 240 

ATTTACGAaG TTTACGAAGA AAAATACACG CAAAACCTTA AGCCATCTTA CAAGCAAACA 300 

CAAGAAAGAA TTTTTAAAAA CTATATAGAA CCTTATTTTA AAAATACTCA AATCAAACTA 360 

ATAAAAAAAC AACAAATCTA TGATTTCCAA CAGTTTCTAT TAACTAGTAA ACCTAAGAGA 420 

■ AAAGAGACTT TATCAAATAA GACAATCAAT ATGATTATTA TTCATCTACA GAAATTATTT 480 
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AACGTAGCTA TGAAAGAAGG ATTATCCTAC GAAAATCCTT GTAACCAAAT AGACAAGTTA 54 0 

AAAGTGCAAA AAAAAGAAAT TGATTTTTGG ACATTAGACG AATTTACAAC CTTTATTTCT 600 

CATATTGATA AAAATAAACC ATTTTTAAAA GTATTCTACC AGTTTGCGTT TTTTACAGGA 660 

ATGCGTGCGG AGA 67 3 
(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 

ACTGnCGAAT TGCTTATCAG CAAGTATTAG ATGTCGTTTC GCAAAATTTG CCAGCAGGAT 60 

TTGAATTAAC AATGCAATAT GGCATGCCAA CCTTTGTTGT GCCGCTTTCA GTATTTCCAG 120 

AAGGCTATCT TAACCGTGGT GATGAACCGT TACCATTTAT CAGCTTAGGC GCTACTAAAA 180 

GGCAAGTAGC TTTATATCAC ATGGGATTAA TGGGAAATGA AGCCGTCCGA AGTTGGTTTC 240 

AAGAAGCCTA TCAGGAACAA GTGTCAACAA AGCTAAACAT GGGGAAAAGT TGTCTTCGCT 300 

TCACTAATCC TAAGCGAATA CCCTATGAAC TAATTGGCGA ATTAGTAACA AAAATCACGC 360 

TAGAAGAATG GCTGCAACAG TATCAACACT ACCAAAAGAA AAAGAAGAAT TATTAGGAGG 420 

AACCACATGT TATTAGGATC aCACGTTAGT ATGAGTGGCA AGAAAATGCT ATTAGGCTCT 480 

GCCGAGGAAG CGGCTAGTTA TGGCTCAACA ACCTTTATGA TTTATACAGG TGCGCCACAA 540 

AACACGCGTC GGAAACCGAT TGAAGAAATG AATATTGAAG CTGGGCAAGC TTTTATGAAG 600 

GAACACAATT TAAGTAATAT CGTCGTTCAT GCCCCGTATA TTATTAATTT AGGTAATACA 660 

ATTAAAACGG AAAATTTCGG TTTTGCCGTA GACTTTTTGC GTCAAGAAAT TGAGCGAGCA 720 

CAAGCGTTGG GAGCAACGCA AATCACGTTA CACCCAGGTG CGCATGTTGG TGCAGGTCCA 780 



GAAGCCGGCA TTAAACAAAT TGTTAAAGGT CTAA 
(2) INFORMATION FOR SEQ ID NO: 706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



814 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706: 

TATAAATAGC GAAAATTTTT GACCAATAAA ATCTTTTTCT CTATAAAAAA ACACCTTTGT 60 

CGAACTCGTA TAAAAATAAG TCCAACAAAG ATGTCAATAT ATATATTTAA CCTTTCAAAT 120 

ATTCTCTTTA CTTT-AATAAT CAATAAGGTT GTTTATTGGC GGATAGCTTA ACCCACTTTT 180 

GCTACATACC ACTTATTAGC AGTTAAATAC CCTTGTTTCG TCTTTAACCT TGGTATACCA 240 
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TTGTTTGTAT ATTCGATTGC TTCAACTTCA ACAATCGTAT TCTTCTTAAC AGCCTGTCCT 300 

TTTTTAGTAA ACTCAGTATT AT TG TAG AAA CAATCATCAA TTAAGAGCTT AACTTTTTTT 360 

GGATTAGTGA TAAAGTAATT GTTATTGGTA TTTCCAGCAG CTGTCACATA GTTTTTATTT 4 20 

GcAGTAAGAT ATCCTCCTTC TGtCTTCAAG CGAGGaACAC CTTTAGAGGA ATACyCaATG 4 80 

CCCTTAAaCT GGaACmACaG TCCCaGATGa TAAGGTTCTG CTACCATTAC GGAAACTCCA 540 

ACATCnGTAT AAAAGTTTAT CCnGGACACC CGCCAnAAAA ATTTGGGnGG 590 

(2) INFORMATION FOR SEQ ID NO: 707: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 



CCCTAnTGTA 


AAAATATTAT 


CCCTAAAAAT 


AGTCCCAAAC 


ATCCAAAAAT 


CAAAGGATAT 


60 


TTaTaTTTkG 


ATATGAATTC 


TaCCaTTGkt 


TTCTCCtATA 


AAACTTTTGA 


TTTAGTCTTT 


120 


GTATTTTTTT 


CATGATAATC 


CTTAAAAATT 


ATTTCGGTTA 


ATATTATTTG 


CTCTGCACCA 


180 


AATAATTCAT 


TTAAATTAGT 


ATTTACCTTT 


CTTATAAAGG 


CTGTTTGTCG 


CTCTCCCGtT 


240 


TGaAAAGCTT 


TGCGTAtTTT 


TCCTTGTATA 


TAGATTTTGA 


TTTTCTTTTT 


CTTAATTACA 


300 


GATTTTACTG 


TTGGATCAAC 


AATAAAAGGT 


TCTTCTTTTG 


CAGCAATTAA 


TACAAAGTTT 


360 


TCAATTGCTT 


TTTTTTGGAC 


TGCTATTTTT 


CCACTATCAG 


TCTTTTTTTC 


TATACAATTA 


420 


TGTGCTTTAG 


GATAAAACAA 


TACAATAATT 


AGAATAATAA 


ACAATAATGT 


TAGCAAACAT 


480 


CCACTAATCC 


AAAACAAAAA 


CTGTGCCATA 


CTATTACCAA 


TGTACGGATA 


GTTTGTAAAT 


540 


GGAAGTAGGT 


AAAATGGTAA 


ACTAACAACA 


CCTTGCATTG 


TTATTAATGT 


GTGTGCTAAA 


600 


GGAAAAAATA 


AAAAAAATAA 


AACAAGGATT 


AATAATAATT 


TTAATCTTTT 


TCGCATAATA 


660 


CACCTCTTTG 


TCTACGTTGT 


GAATATATTC 


AATTATCTAG 


TCATTCCTTG 


AAAATTCACA 


720 


ATTCAAATAA 


AATATATACT 


ATTAAAAATT 


TCTAAAGATT 


TTGTCACTTT 


GACTyCTwTA 


780 


wCyTTCTCTC 


CTTTCAACTG 


TATAACTTAT 


TTAGGTGAAC 


TGATAAAATA 


AATTTTATCA 


840 


GTTACTATTT 


CTAAAATTCT 


TAAAGTAATC 


AGACCTCaTT 


TCAGAAAAAT 


AATGAATGTT 


900 


TATTTTATAA 


AATAAAATAT 


TTGATGAAAC 


AATATTAACA 


AGAAAAATCT 


ACTAAGTCAA 


960 


ACAAAATGAA 


CGATCATATT 


TTTAGGGAAG 


AAAAATAATT 


TAAATTTTAC 


AATGAATGTA 


1020 


TGAGcAAATT 


TGTTGAGCTA 


ATATTTGGAT 


ATTTTTATAA 


TAAGTTTAGG 


GGTATCCCAA 


1080 


TATCACTCTC • 


TAGGGGTGAC 


TCAGAGcAGG 


GGAAGGAGTT 


ACCCCTAAAA 


AGTATGTTAG 


.1140 


AATGATTTTA 


TCGCATCATA 


TCCTTTTTTA 


TGTAATTAGT 


TGATAAAATC 


TAtCTGTACT 


1200 


TATATTTAAC 


GACyGaAATC 


GGTCACTTAC 


TGAAAAAACA 


TCTAAGGTAG 


AAATGAATCT 


1260 
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CACCTAAGAT GTTTTTTGAT TnT 1283 
(2) INFORMATION FOR SEQ ID NO: 708: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708: 





TTTPPAATPP 


AACCCCAnGG 


ACAATCCCCT 


AAnTTTTTTT 

full 1XXXXXXX 


TTAAACnTCC 


60 


1 w 1 LI I 


TAAAPTGGGG 


PTAGGPTTCC 


TTGGGTGGGA 

X X VjvJVJ X \J\JVJil 


ATTctTTCtC 


tAtTCtTCtC 


120 


aTPrrAAAP-f-p 


a PTPPP a ATT 


AATGnaATTT 


At-PtTTTTPt 
r\i.L^L.i ix loo 


AGtAATCGtT 


TPAAtTCaCT 


180 


TTr< A A A-h-TPf* 
I 1 l*nnn L ILL 


TPAGPTG^ A A 


AAAPATGPAA 


TPGATPAAAP 


APTTCTTkGT 

x x v*« x x rw \j x 


APTPGTCCCC 


240 


1 1^1 1 lul Lrt 


ATAA A ATATT 


PTTTAAAPAT 


ATPAAGTTCT 


TPTTTAATGA 

x v./ x x x nn x vJ/k 


TTGAACTAGG 

X X fu*\ii X i*VJ \J 


300 


TGTTAPAT A A 
loll r\\—r\ 1 nr\ 


ATT A AGPGTG 


PATPTTPATA 


ATTTTTTAAA 

x x x x ± x nnr* 


TGTTTAATGT 

X \J XXX *W* X X 


ATTTTGGAAA 

m xxx wvjiuu* 


360 


papappptga 


PTATAAPGTP 


GTTTTGTCTT 

\JXXXX\JX\^XX 


TAGTGTTAAT 


TCTATTTCAA 


ATTGCCACTC 


420 


TAPTTPTPGT 


PPAATACTAT 


CCAAAAACAT 


AAAAATGAAG 


TCAGGAACAA 


AGTATTTTAC 


480 


GPPTTTTTGA 

11111 Uft 


TTTTTC TTCT 


CGTCTTTTAC 

V«<» \J X \* X X X X 4*\^ 


TTCATATACT 


TTACGGCGTT 


CTGATACTAC 


540 


PGCTTTAAGG 


TGCGGGTGGT 


TATTACAAAG 


TTCCAACATT 


TTTATCAGAT 


AAT CAT TAAT 


600 


TTGTAAGTGG 


TGTTGTAAAT 

X V_J X X \J X fliUi X 


TGTATTCAGG 

X VJ X *1 X X V<*>J VJ 


AACTTTTGGC 


AAGGTGTAGT 


ACCCACCAAT 


660 


G TAAT TAT GG 


PPTTPTTTTG 

^ XX X X V X X X X VJ 


TTAGGTAATA 


AAATTTTGTT 


AATCCGATTT 


GGGAACATTC 


720 


CACTAGATTT 


TTTTCCTTCA 


GTTTTTTTAG 


ATTACCGTAT 


ACACTACTTT 


TTTTTGCCAT 


780 


ATTATCAAAT 


TGACAAAAGT 


CAATAAGTTG 


TTTTACCGAT 


ACAACTCTCA 


ATTCATGTAG 


840 


TAAACACAAA 


AGAATCAAGT 


GTATCTTATC 


TTCTTTTGAA 


TAATCAATAA 


AGTAATTGTA 


900 


AAGTAATGGT 


TTCAAAGTAT 


CACCGCATTT 


CTATAAAATG 


TACAAATAGG 


CAAAGCCTAT 


960 


TTGTTTCATT 


GAAAAAGAGT 


TTCGGAACAA 


CGTGTAGAAA 


TACTTTTTCA 


GCTTTTAAGT 


1020 


CCTCCAAGTT 


TTGGTGGAGG 


ATTTAGGAGG 


TTGTTGCTTT 


GGGCGATAGT 


CCCTTCAAGG 


1080 


TTTGGTGAAG 


GGGCGTCTAT 


CGCAAGGGGT 


AGTTGTTAAT 


CTTTTGAACT 


TTATCTTTTG 


1140 


AACTTTTTTC 


TTTAATTTTT 


ATTACTTCTA 


TCACCTTACT 


CATTCTATTT 


TTTTACTCTA 


1200 


TTCTGTTtGT 


CCTCGGCGCT 


TGGTATTTTT 


TTCGTAGTkG 


CAAAATTCCT 


GACCATTTTC 


1260 


GCTTAATGGG 


GGTTTTAACC 


CACGTTGAGC 


GAAAATGGTC 


AGGAATTTTG 


CAGGCGGAGA 


1320 


AAGCCTTTTA 


TACCAAGCGC 


CGAGGACACG 


CCACTACAAT 


TTATAATTCC 


ACACAATTTC 


1380 


CAGTGGTTTT 


CCTAAGTATT 


TCCATGATGT 


TTCCAAAATT 


TGACACACTA 


TATTAATCTA 


1440 


TTTTGAATTA 


CTTTTTTGTA 


ACCATTCGAG 


TGCGTTTCGC 


ACATAAACTT 


TTTGTGGTGG 


1500 


GAGGTTTACA 


TTTTCTGCTG 


TACGATAAAT 


AATAAACTCC 


CCTGTACGCA 


ATTCTTTTAT 


1560 


CTTGTTTGGA 


CTAATTTTGA 


AGCGATCTAC 


ATTACGTTTG 


GTTCCTCTAT 


CTCCTGACCA 


1620 
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Al GALjAI ill 


ppa apatptp. 


A AT AAPPATP 


TTCTTGTTCC 


GTAACTGAAG 


TTACATCAAT 


1680 


G 1 G 1 1 G A 1 AG 


G 1 I v^L^n 1 lu 


TTTPAPTPPA 
111 OriO 1 V/vA 


ATAPTCGGCT 


TCACCTTTTT 


CGTTGGTCTT 


1740 


TCCAA1 GAAG 


TA7AAPTATTPA 
AAAo 1 Al I Ln 


PATTTPPGAP 


AAP,TTGTTCC 


ATTAATTTAT 


CGCCTAATTT 


1800 


TGCAATATCT 


GGGA1 GG1 1 1 


bnuwou 1 cnn 


A ATAPPPiTAA 


ACACCTGCAC 


CTCTACCTTT 


1860 


AT TT AC CAT A 


TCAA1 AAAGG 


111 ill nb 


ATA AP,AAfJPA 


GGTTCATCAA 


AAATCACAAG 

rirvn x v-*4ivy 


1920 


GACTTCACGT 


TTAAGAI 111 


TTPPATAPAT 
1 lL.bAlAL.Al 


TTPAP.AAP,PA 


AAATAATTTA 

ruui X XXX 4» 


CGTCTCCGAC 


1980 


TAACATTTGC 


GCT. AAAG I GG 


PP AT ATATTP 


TTTP.TAA ATP 


AATCCATTGA 


ACGATACGTA 


2040 


AACTAACTTT 


/""■i /Tp rp n rp rp /■"" T\ 

CGTTGAI 1 GA 


P TA T\ TTTPTTr 

GAA1 1 IG i 1G 


PAPATPP.AP.A 


APATTTTTTG 


CGTTTTTAGT 

\>rVJ X X X X X X 


2100 


ATCAAATAAT 


TTTCCTAACT 


GAG 1 G 1 G 1 AA 


TAAPATP.TTA 


ATGTTTGTPT 


GTAATGCTTC 

x fin x x x w 


2160 


TGATGTG'GCA 


TACAAATAAA 


tv /~< /-» ^ rp i~ TV 7\ rp/-« 

AGGGI GAA1G 


1 llAlul 1 L>a 


TA A PGA ATA A 


ATAATPGTTC 

r\ 1 /A/i 1 \<U 1 X O 


2220 


GAACAATTTT 


TGTTTGATAT 


m m m r n ^* m rn 
III G 1GG1G1 


TTPTT TP TPT 
1 1G1 1 lblLl 


PTTGGPA A AT 


ATPTTAGGTT 

rx x o x x n\j vj x x 


2280 


TCGTTTGAGT 


AAT AAAT AG I 


AGG AG I G 1 AA 


7ATPT A A TPTT 
A 1 L. 1 nn lull 


TPTGGATTA A 


GGAPAATCGT 


2340 


TTCCGTTTCT 


TTTTGTGCI A 


1 I GG 1 AG 1 1 G 


PP Ti A ATPTTT 
GGAAAlLil 1 1 


PT A A ATTTAP 
1 r\r\r\ 111 rtt 


TGTTPTTAAT 

X VJ 1 X \— ' X X lUi X 


2400 


TTGTGAATCT 


TCCTCTTTT 1 


G I 1GI 1 1 Al G 


A APPTPTP A A 
AAoLn 1 L* 1 unn 


TPAGGPAGTA 


TPTPGCTTCC 


24 60 


TTCTTTCACT 


GAI I 1 1 1 1 G 1 


TAPPTTTTTP 

1 AGu 1 1 1 l I Lj 


T ATTTTPTTT 


TPTAPTTPAA 


TCTCAAATAG 


2520 


TTTGGGATTG 


TtTGGl AAAA 


TTPPTATPPPP 

1 I GGA1 GL.0L. 


A A ATA AP.TP.T 


AAAAPATTTP 


TCGGAAGTAA 


2580 


AAATCGTTGA 


AG AAA I GG 1 A 


H 7A PTTPPTTP 

GAG1 iLljl 


A APP.P.TTTPT 
AAv^Vjvj 1 1 1 V_, 1 


GTPGTAPGAG 


TGTCTCCAGA 


2640 


TAAAGTAACC 


Tl GGG IGG 1 1 


rp Tv 7\ arTprTr 
1 AAAG 1 Go 1 L. 


TA APA ATTP, A 


ATTGTTACTT 
x x o x x no x x 


GCAAAAGAGC 


2700 


TTTnCGctGC 


GCCTGAA 1 AA 


AAGAGAG1 1 1 


PTPTTTPPPP 
L, 1 Li 1 1 ILijot 


TAA AGAPAPT 


AAPTTATCCC 


2760 


GAATAGATAI 


GI GAI 1 1 GGA 


1 GG 1L1 AL* 1 o 


PPTTPT ATPT 
1 i u In 1 U 1 


PATATPGCCA 


TCATCCGTAA 


2820 


AAGCATGCAC 


mm r+ tn m m m ^ 7\ 

TTG T I 1 G 1 GA 


rp tv AA7VTTTPP 

1 AAAA1 1 lulj 


P A ATTTPTPP 
L.AA1 i ILltu 


PATAPTTTPT 


AATGTATCTC 

nn x x ** x v x \** 


2880 


GAGCGCCTTT 


TCGA1 GAA I G 


Hutu tpta ppp 
aa 1 a 1 gagl.l? 


PAAAATTAAA 


TTTGAPTGPG 

1 1 lUnOlUOw 


TGTTGGACGA 


2940 


GTAATTGAAG 


TAAGG1 1 G 1 1 


TTPPPTPA AP 
1 1 GL.L, 1 (jAAL, 


PTnTPP.TTP,P 
L> 1 \3 1 L>0 1 1 


GGGAACAAGA 


ACATGCTGAT 


3000 


TGAACIG1GA 


AAAG 1 I G A 1 G 


AT AT ATP. AP A 
A i A 1 A 1 onLn 


APPPHAAPTP 


ACTCATACCA 


AGTAATAATT 


3060 


GGTTGGA1 AG 


rprp OWfp T* Tv rp tv 

1 IG 1 1 1 Al AG 


TTPTPA ATAT 
1 iol L. AA 1 A 1 


PAAPAPTTTP 


TACTGCTTCT 

1 fro x vjVrf x x N-f * 


TGGTATTCTT 


3120 


TTTCTTGTTC 


A 1 GAG 1 AAAu 


PPTTP.TP APT 
L-G 1 lul UAL* 1 


PTTPAAAPTT 

Oil K*t\t\r\\* 1 1 


ATTTTTTTGC 

C\ X X 1 X X X 1 VJV 


CTATTTTTGT 


3180 


ATI Gil I AGA 


1 1 GGG 1 GAG 1 


TTPTTPPPTT 
J. 1 L. 1 1 L.L.U 1 1 


PPTPTTPPTT 

iV/l 1 OO 1 1 


TGAAATAACT 


TTTCGTTTTG 


3240 


CCACAAATGA 


AGAAAATGCC 


AAAAAAATGG 


CCCCCATTCC 


ACTTGATCCA 


ATAAGCAAAG 




CGACTATTGT 


CTCCCAATTA 


AAACGAAGTG 


CTGTATGAAG 


AAGATTTAAG 


GGGAAAAGAA 


3360 


GATACGCAGA 


CAGATACCCT 


CTGGGAATCC 


AGAGGATAAA 


TGCAAAGACA 


TTTATTACAA 


3420 


ATAACACAGC 


TGAAAAACTA 


AGAAAGACAA 


TACTATTTCG 


TTTCAAAAAA 


CTCTCATAAT 


3480 


CCATATCAAA 


TACATTTGCA 


GCCTTGTCTC 


TTTTATCAAC 


CGCTAGACCA 


ATTAGTAATG 


3540 


GAATACTGCC 


TAAAATCGCA 


AAAGGATATA 


. AAAGAACTCC 


: GACTAATCCA 


, AAGACTAGTC 


3600 
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GGAGTTGGTG GTAGCGAGTC ATTTTTTGTT GTTCTTGCCC TATTGGCGGT TTCTTTGGTT 3660 
GTAGCAACCC ATTGTTAAAC AAATTATTTC CTTGCACGAG GTTGTCTCAC TCCTTTAGGT 3720 
TTGTTTGTTA TCTTCTGT 3738 
(2) INFORMATION FOR SEQ ID NO: 709: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 

GCGAGCTTTT ATTCAAATTG TTTTACCCAA TGTGTCTTCA GGAATTTTTG CAGCCTTTAT 60 

GTTAGCATTT GTCAATTCGT TTAACAATGT TCCTGTATCG ATGTTTTTAT CAGGTCCCGG 120 

TGTGACGATG CTGCCTACTT CTTTATTAAG TTATATGGAA TmCaATTATG aTCCaACTGT 18.0 

TTCmGcAATT TCTGTATTGT kGATGCTACT AACaATGGGC TTAATGTTTT TAATTGAAAA 24 0 

AACGTTGGGA CTAGCTTCCA TTGCCTAAAA CATGGAAGAA GCACGAGATA GAAAGAGGAG 300 

AAAATTTTGA CTTTTGTGGA TTTAAAAGAT ATTCGCGTCA cTATGACGGA AAACAAGATA 360 
TTTTAAAAGA TTTAAATATT TCAAgGAAAA AGGAGAATTA GTTtCTTTAT TAGGACCGAG , 420 

TGGTTGTGGT AAAACGACTA CGCTAAGAGT GATTGCTGGa TTAATTawGC CAAaCGACGG 480 

ACAGTTTytC GTTGACCAAG AAGAACTAAC AAAGGTTCCT GTCCAnAAAA GAAACTTCGG 540 

GATGGTTTTT CAAAG 555 
(2) INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710: 

TAAACCAAAC TTTTTTAATT TTATCTAATC ATACTAnAAA CATTCAATTT GGAGTCACTA 60 

ATCCnGGTTn CTCTTTTGGT TCATCATAAT ACTCTTttCA aCCCcTCTCt ATACTCTtCT 120 

CGAaTCTCAA CCaTTGATTT tGGTtCTTGT ACATTGCCTT tCTCTATAAA TATAACGTCA 180 

TCAAGAAaTG TTTCCATATC TCTCAATATA TGACTACTAA TTATCAACAC TCTATTTTCG 240 

GACAAGGTCT TAACATTCTC AATAAATAAC TCTACACCAT CAGGATCTAA ACCGTTAAAT 300 

GGCTCATCTA ATAATACTAA TGAAGGGTTG TGAGCTGTAG CAAGACCTAG CAAAAACCTT 360 

TGTTTCATTC CTAAAGACAT CGTATTAAAC TTTCTTTTTT TATTTACATT TAGACCAACA 4 20 

TCCATTAATA GAGTCTCTAT CATTAACAAA TTTTTTAACT TGTAATAAAA AGCATGGTCA 4 80 

• TTGTATATAT CTTCAAGCAT AAATCCATTT AAACTATCTG AGTTAACAAA GTTAGTACCC 540 
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ACCTGcAATA ATATTTTAGG GTCTGCTATT TCTACTTGCC GATTGTCTAT TTTAAGTATT 600 
CCACTATCAG GTCTAAnGCA AACTACATAG ACATTTCCAT CAAAGTTGGT TTTTACCTGC 660 
ACCATTCGGA CCCAACTAAT CCCATAAAAT TTTnCCCTAC 700 
(2) INFORMATION FOR SEQ ID NO: 711: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1226 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711: 



CGTAnATTGG 


GTGGnGCGGA 


GCTGTATTCC 


AAATGACATt 


CaCGTATTTC 


AAACTTTTTC 


60 


AAaGCTAAAT 


ATAACCTAAT 


CATACACTCT 


TGATAACAAT 


CTTCATCAAA 


ACTCCCATTT 


120 


CTACAAGATT 


CCTTAATTAG 


TAACGGTTaA 


ATTkGTyCAC 


AATCTGAGCG 


AACGCTTCTy 


180 


CATCATCCTT 


TTGAACTAAA 


ATAACAAGTt 


CTACCAATGA 


TTTCATTTCG 


TATTTCCTCC 


240 


AAACACTTAT 


CACTTTTTTT 


CAGTGAAAGC 


TCTGGTGGAT 


CACGATTCTT 


ATATATTTTT 


300 


TTAACTAACG 


TAgTAAACGT 


TCTCGTATTT 


TCTTTAGAAT 


ACGTTGACGC 


TTTTTTGTTA 


360 


CACCTTGTCT 


ACTTAACGAT 


AATTGAATTG 


AAATTTCATA 


ATCCGAATAT 


CCAAGAATAT 


420 


ATTTTCGCAT 


TAAAAAGGAT 


TTCTCTTCCC 


TATTTAATTT 


TTCAAATTCA 


GTAGCTAAAT 


480 


CTAAATTGTC 


GAATTCAAAT 


GATAAATTAT 


CTGtTTCAAC 


yCTGATCATT 


TCyTTTACmA 


540 


AGrAAGTCtT 


ATCTTCAATT 


AAGAAATCAA 


GAAACTTTTG 


TCCAGAAATA 


ACCTCTGCTT 


600 


CTTCTATTCT 


GGAATTTTTT 


ACAAAGTAAT 


TTATCGCaGT 


ATTATGAACG 


GTTTttTTAA 


660 


TATAATTTAT 


AATTTGAGCA 


TCATATTTTG 


TAACATCAAA 


GTTCTGTTTC 


ATCGCAGCCA 


720 


CTTCTTCCTT 


TGGAAATTCG 


AAAGGAAAAA 


GATATACGGA 


CAAGTTAGGG 


TGTGTTTCTA 


780 


TACTGCCAAT 


ATCCTCTCAT 


TGCACGCTCC 


AC AG AG AT TG 


AATCATCTCG 


ATTTATTTTT 


840 


GGAAGAATAT 


TATTCACGAA 


ATTTCTATAA 


TCGGGAAGAA 


TATCATTATT 


TTTAGCTAAT 


900 


TCCtGATATA 


TTT'GTTTACC 


TGACAAAGGT 


TTTCCTGACT 


CTTTTAAAAT 


CGATGCAATA 


960 


ATAAACCCAT 


AATCTTTGTA 


ATTTCGTTGA 


TGAGTGCCAA 


TCATTTTTCG 


CTGAATTAAA 


1020 


TAATCCTGAA 


GCACTTTTTC 


ACTTTTGAAA 


TTTGnAAATT 


nAGTTTCTTG 


AGATAGTTCA 


1080 


TCACTAGAAT 


CCGAAACAAT 


CTTGTCTTCT 


TCTAAATGAA 


TCAAGTACTC 


TTTTAAATCA 


1140 


TAATACTGTC 


TTGAAAGTTC 


CCTCCTTTCC 


TCCATTAACA 


GTTTCAGCTG 


TTCATACACA 


1200 


TGACATTTTT 


CATCAGGAAT 


ATCCAG 








1226 



(2) INFORMATION FOR SEQ ID NO: 712: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712: 

AGTTGTTCAA TTTTCTACAG AAAATTTTTG GTTTCGCTTT GTTAGTGGAT TAGTGGCTGT 60 

TGGACTTGGT GCCTATTTCG TCTATCTATT TGTTGCTTTT TTGAAAAAGA AAAAGCTCAT 120 

AATTTGGACT TTTTTAGGAA TCAAAGAAAC CTTGCCAGTG GTCTTGTTAG GTTTGGTGAT 180 

AATCTTTTGT GTGGGAATAT TTGGTGATTT CTTATTACGT CTGGAAGGAT TAGAGACGAC 24 0 

CTCTAACCAA CAAAGTGTTG AAGAAATTGT CCAATCTATT CCTAAAGGGC T TAT ACT TAT 300 

TTTTGCAGGT GTGATTGGGC CAATTTTCGA AGAGGGAATT TTTCGTGTCG GGATTCAAGC 360 

GTTTTTCAAG CCAGCTTCAA AAATAGGCAT ACTCGTTTCT TCwAwTTTAt TTGCACTTGn 4 20 

GCACACGCCG ACAGATATAG GkAgTTTtGT tATTnATgGG GCAnGGGnTT AGCTTaAGTT 4 80 

GGGnCyAwCw ACGCACGGGC GCT 503 



(2) INFORMATION FOR SEQ ID NO: 713: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713: 



AGTCAAAGAA 


TTACAAGCAT 


TGGCGGCAGC 


AAAAAATCTG 


CGCCTGATGG 


TTGGCTTCAA 


60 


TCGTCGTTTT 


GCCCCTTATA 


CGGAAGTATT 


GAAAAATATT 


CCTGAAAAAC 


AAACAATTTT 


120 


TATTwAAAAG 


rACCGCATCm 


ATaCAACGAG 


AGAAACAAGT 


TTTATGATGT 


ATGATTTATT 


180 


TTTACATGTC 


GTTGATACCG 


CAGTTTATTT 


AGCGGAAGGA 


CCGCTCCACG 


TGGTGCAGTC 


240 


AAAATTAGTG 


GAAGAAAACG 


GTCATTTAAA 


ACGAGCAATT 


TTACAATTAG 


AAACCGAACA 


300 


GACAACGATT 


GTTTGTTCCA 


TGGATTTACA 


CTCTGGGGCC 


AATACAGAAA 


CTTTTGAAGT 


360 


CACCAGCCCC 


ACAGGTACTT 


ATCGCTTGGA 


AAATTTAACG 


CATTTGACCA 


TCCAAACAGA 


420 


AGAAGAATAC 


CAAGTAAAAG 


AGATGGGAGA 


TTGGACGCCA 


ACATTGGAAA 


AAAGAGGTTT 


480 


TTACCAAATG 


GTTACTGCTT 


TCATTCAGGC 


CATTCAAAAA 


CCGCAAGAGC 


AAGAGTTAAA 


540 


GCAAGAAAAA 


GTCTATGAAA 


GTCATGCGTT 


ATGTGAAGAA 


ATGCTACGCC 


AACAACAACG 


600 


GCATGTTCTT 


TAAAGGGGAC 


TAGCAACTTT 


CTTTCTATCA 


TGCTAGGGAG 


GTGCGACAAG 


660 


TCCCTGnTAT 


GnGTCATGTT 


TGTCATCTGG 


AGCCATAGnA 


CAGGGTnCAT 


CATG 


714 



(2) INFORMATION FOR SEQ ID NO: 714: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714': 



TTAACTTTTC 


CCGGTnGCCC 


CAAAnAcCAg 


GCyTACACCa 


ACTAATACTG 


ATGCAACACC 


60 


CACACTTAAT 


TtTTTTATGC 


TATACTTTAA 


TACTCTTTTT 


CTcCCTTTTT 


TTAAAATCAT 


120 


ATGCTTATTG 

x* X \D u X X I* X X \J 


TTTTTTCCAA 


ACAAACTAAA 


TTTGCTCCCT 


TCCTTATTTA 


AATTTAAAAT 


1 AO 


TAPATTTTTA 

1 iiv>ri x i i i ± 


TTATAAATAA 


TACAAAATAG 


GAATTCTTGT 


AAATATCACA 


AATATTTGTA 




AAAAAATCAA 


AAAAATAAAA 

*X«U*XXXX X *4iU*ii 


TATTATAAAA 


AAAACCGACC 


TTTACGGGGT 


CACTGGAACA 


300 


TPCCTAGAAA 


TCAAGTAAAA 


AAAATCCAGA 


GGATTTTCTC 

VJW4* X X X X V** X V* 


CTCTGGATTs 


ACTATTACTC 


360 


TAPATTTTTT 


AGTTATTTTA 

ii W 1 1 4* 1111 *i 


TTATTCAATT 

X X X X x x 


AT TCTC AT T A 


AAAATTTTTC 


TTTCTATATT 

XXX X * 4 X X X 


420 


APTTAAPAPG 


AGATTPPGTT 

nun i i Vk/^vj i i 


AATTPTTTTG 

X X X X X X VJ 


CTTTAGAACT 


TCCTTTAGAA 


GPTAGpTPTT 


480 


TTGr-TTTPTP 

1 1 VJ^r 1 1 1 V/ A \rf 


TGTTTHPTPA 

Iwl 1 1 Vjv 1 \*r*^ 


CTGGcAAAAT 


TACTTACTGA 


TTPTGTTGPP 

X X \* X X X WW 


TCACCAACCT 


540 


TATPTTGT A A 


APTAAPGGAA 


TPTTPTTPAT 

1 V-< 1 1^1 1 1 


GPTGTTPTTP 


TGTPTTAAT A 


TPAAPAAPAT 




TTQPATTTAP 

J. 1 111 


TTPAATAAPT 
1 1 ^rirt 1 rt/AV** 1 


TPTAAPPPA^ 


TPATATTTTT 

1 1 rt 11111 


TAPTTPTTTT 

1 r\\^ 1 Xvl 11 1 


GTAATAATTT 

O l r\r\ l nn 111 


oou 


1 1 1 1 1 1L 


ATTrXTAAATT 


TT&TSZiATZiT 
1 Inl ftttn Ini 


PTTTTPP AT A 
\^ 1 1 1 l^^nln 


TTPTfiPAAPA 


ATATPTAA AT 


1 9 n 


ptaptgptap 


LilOl 1 1 1 lVb«l 


PPTAPTTPAA 


PPAPTAPAPP 


TGATGTAAPA 

l un l o l nnLn 


TPATPGGTAT 

1 1 ^VJVJ 1 rt 1 


/ou 


TCACTAACTT 


ACCAGCTAAA 


TTTGAGAAAA 


ATCCACCATC 


AACCGTAAGC 


AATCCTTCTA 


840 


CTTTTTCTAA 


AGAAATGCCA 


ATAATTTTTT 


GAATCACTTT 


ATCTTCATAC 


GTTAATTTTC 


900 


CTTGTACATT 


TTCCTTCTTT 


GTTTCTTCCT 


TTACACTTAC 


TTTTGTTTTC 


TCCATTTGTA 


960 


TCCACTCCTA 


AATATTCTTT 


ATTTTATTTA 


TTTTGGAAAA 


ATTTTCCAAA 


TAAATCCTCA 


1020 


TTTTTAATAT 


AACTCCCTAT 


AAATACACCA 


ATCCCAACCA 


TAACTAATAA 


GAGAATTGTT 


1080 


TTAAAAAAAC 


CAATAGTGAA 


AAAGAAAAGT 


GCTAATACCA 


ATCCTATTAG 


TCCTCCAATA 


1140 


ATTGGATACT 


TATATTCTAG 


GAACAATTCT 


TTCATATTCT 


CCCCTCTTTC 




1190 



(2) INFORMATION FOR SEQ ID NO: 715: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 

TTTAAAACAG ATACAGGATC GATTCTGATT AAACGTGTGA TTGGTTTACC AGGAGAAGCT 60 

GTGCGTTACG AAAACGATCA ATTATATGTC AATAATCAGC CAATCGCTGA ACCGTATTTA 120 

ACTAAAAACA GAAAAAAAGA TCATGAAACG ATGCCTTACA CTACGAATTT TGATTCAAAA 180 

GAATTGTTAA" TGCAAGAAAA ATTACCTAAA GATAGCTATT TTGTGCTTGG TGATAATCGC 24 0 

CGTATGTCCA AAGACAGCCG TTCTTTTGGT GCAATACATG CAGATCAAAt CTTAGGGAAA 300 

GCACAATTTG TTTATTACCC ACTCACTCAT ATGAAGATCA TTCCTAAATA AGTAAGACCA 360 
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mTAGTTCGTT TATTtAAATA ACGACTAGTG GtCTTTCtAA TCTGaAAAAT TGTyCTCTTA 420 
GCTGnAAAAT GGTAAGGGTA AAGTAATGGT AGATGTAGGA GGnCATGCCA ATGATTAGAA 4 80 

AAACAATTTT n 4 91 

(2) INFORMATION FOR SEQ ID NO: 716: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716: 

TAG T AC TAG T AATCGTTATT ATTGCTGCGG TGCTATACTT AATTGGTTAT TTTATGCGCA 

AAAAAAATCA AGAACAATTA'GATGAATTAG AAGTTCGTAA AGAAGCCCTG TTTGACTTGC 

CAGTCTTTGA AGAAATCGAT .GACATTAAAA AmATGCATCT AGTTGGACAG AGTCAAAATA 

GTTTCCGTGA ATGGAATCAA CGTTGGGTAG AACTATCGAC GCGTTCTTTC GCCGAACTAG 

AGAGTCAAAT TTATGAAGTC GAAAATCAAA ATGAAATTTT CCGTTTCATG AAAGCGAAGA 

AAGCGGTCGT GGAAGCGAAT GAAACGATGA CCGAaATGGA AGCCGAAGTG GAAGTCaTcG 

TAATGGCTTG AaAGAATtAC GTGaAAGTGA AGAGCGCaAT TCATTAGAAG TTCAAAAAGC 
TTnGGAGGnC ATGA 

(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717.: 



AGTGCAGAGA 


TTGCATTATT 


TTTTATCGCG 


AAAAAAGAAC 


CACCTGGATG 


TGTGGAACTA 


60 


GAAGCGAATA 


AGGAAGGAGG 


AGAACAAGGA 


ATGCCTACAA 


TTAACCAATT 


AGTACGTAAA 


120 


CCTCGTAAAT 


CTAAAGTAGA 


AAAATCTGAT 


TCACCAGCCT 


TAAACAAAGG 


ATATAACAGT 


180 


TTTAAGAAAA 


CTCAAACAAA 


CGTTAACTCT 


CCGCAAAAAC 


GTGGAGTTTG 


TACACGTGTG 


240 


GGTACGATGA 


CACCTAAAAA 


ACCGAACTCA 


GCTTTACGTA 


AATATGCCCG 


TGTTCGTTTG 


300 


TCAAACTTAA 


TCGAAGTTAC 


AGCTTATATC 


CCAGGTATCG 


GTCACAACTT 


ACAAGAACAC 


360 


AGCGTGGTAT 


TATTACGTGG 


TGGACGTGTA 


AAAGACTTAC 


CAGGGGTACG 


TTACCATATC 


420 


GTTCGTGGTG 


CGCTTGATAC 


TGCCGGTGTT 


AACGACCGTA 


AACAAAGCCG 


CTCTAAATAC 


480 


GGTACAAAAC 


GTCCTAAAGC 


TTAATTATAA 


TAAGACAATA 


ACAACTTGAA 


AAATCATATT 


540 


TCGGAAGGAG 


GAGTTACGGA 


TGCCTCGTAA 


AGGTCCTGTT 


GCAAAACGTG 


ATGTTTTACC 


600 


AGATCCAATT 


TATAACTCAA 


AATTAGTAAC 


TCGCTTAATC 


AACCGTGTAA 


TGGTTGACGG 


660 



60 
120 
180 
240 
300 
360 
420 
434 
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TAAACGCGGG ATTGCTGCTA ACATTATCTA TAATyCATTT GATATCATCA AAGAATCTAC -720 

AGGTAACGAT CCATTGGAAG TTTTTGAACA AGCAATGaAA AACGTTATGC CTGTTCTTGA 780 

AGTTAAAGCA CGTCGTGTTG GGGGTtCTAA CTACCAAGTA CCAGTTGAAG TTCGTCCAGA 84 0 

ACGTCGTACA ACTTTAGGTT TACGTTGGGT AGTTAACTAC GCACGTCTAC GTGGTGAACA 900 

CACAATGGAA GAACGTCTAG CGAAAGAAAT CATGGATGCT GCTAACAACA CTGGGGCTTC 960 

TGTTAAAAAA CGCGAaGACA CACACAAAAT GGCAGACGCA AACCGCGCAT TTGCTCATTa 1020 

TCGTTGGTAA GATCCTCTCT GTCTGTCGCG TAAAGTGGCA GCAGAAaGTC AATTTACmAT 1080 

CGATTTaAAT AAAGAGAGGA GTAAACATAC AAAATGGCAA GAGAATTTTC ACTTGAAAAA 1140 

ACTCGTAACA TTGGTATCAT GGCCCACGTT GATGCTGGTA AAACAACAAC AACTGAGCGT 1200 

ATCTTGTACT ACACTGGTAA AATCCATAAA ATCGGTGAAC CCATGAAGGT GCTTCACAAA 1260 

TGGACTGGAT GGACAAGA 1278 
(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 



CGTTnCCACC 


AGGAACAACC 


ATTAACnGAT 


GTTACnTCCT 


GGTGGTACAA 


TCGATACTAA 


60 


TACACCAGGT 


AATTACGAAG 


GTGTCATTGA 


AGTAACGTAT 


CCAGATGGTA 


CAAAAGATAC 


120 


AGTAAAAGTT 


CCaGTAGAAG 


TAACAGACAA 


TCGCTCTGAC 


GCTGATAAAT 


ATACACCTAA 


180 


AGGTCAAAAA 


GTAACTACTG 


AATTAAATAA 


AGAACCAGAA 


GCATCTGATG 


GAATTAAAAA 


240 


TAAATCGGAT 


CTACCAAAAG 


GAACTATGTA 


TTTTTGGAAA 


GAAAAAGf AG 


ATGTGGGTAT 


300 


ACCTGGAAAT 


AAAAAAGCTA 


CTGTTGTAGT 


AATATATCCA 


GATGGATCTA 


AAGAAGAGGT 


360 


AGAGGTTGTT 


ATTTCTGTAG 


TAGATAAAAA 


AGCGCCGAAT 


AAACCTCaAG 


TTGATCCTAT 


420 


TACAGATGGT 


GACAAAATTG 


TTACTGGTAA 


AACTGaACCm 


AATGCaGATG 


TaACaGTwAC 


480 


GTTACCTGAT 


GGAAGTCAAT 


ACCATGGCmC 


AGCTGATAAG 


AGTGGTTACT 


TTAAAGTTAA 


540 


CGTTCCTAAA 


TTAGAGGCAG 


GGACAAAAGT 


AAAAGTAACT 


TCAACTGATG 


AATCTGGCAA 


600 


TACTAGTGAA 


CC TACT GAT G 


TAGTTGTTTC 


GTCAAATGAA 


TTAAACGGTG 


GAAAAGGTAA 


660 


TGGAACAGAC 


AGTAAGACTA 


ACAACAATCA 


AGATAAAAAA 


CAATTCTTGA 


AAACTTATCC 


720 


TAAAAcAGGA 


gAgGTTgAtA 


GTAATATTTA 


TACTATTGCT 


GGTGGTTTAA 


TATTGTTAGG 


780 


AACTTTAGGG 


TTATTAGGGT 


ATGAAAAATG 


GAAAAAAGAG 


GATGAATAGA 


AGTTTAAATT 


840 


ATTAAATATT 


CTCTGAAAAG 


AGGAAGAAGT 


CTCTTCCTCT 


TTTTGTTTTA 


tTGTTGTTTA 


900 


tTTAAATAAT 


AtTGTGGATT 


TTATTATtTT 


TTATTAAAAT 


AATGTTGATA 


TTAAAATGAG 


960 


TAAATAATTA 


CTAGATAATA 


TTTATTTACT 


TATAGTAAAA 


GTTCATATTT 


AAGTAAAGGA 


1020 
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GTTTTTTTTT TGAATACGGA TAAAGAAGTT TTATTAAAGT CATTACATGT ATGCTTAGGG 1080 

ATACCTATAT TAGTATTTAA TGAAGATTAC ACTCTTGTAG AAGAATATAG ATCTGATAGG 114 0 

ACTATATCAT TATTTTATGA TTTCCCAACT TTTTTAAAAA AAGTTGGGAA AGACAAGCTT 1200 

AAATTTGATT ATATAACTGG TAATTATAAT GAGTTGTTTT TATTGTACGC ATACAATAAA 1260 

AAACTGTTTT TGTTTGGTCC ATTCAGATGT AATACCATAG AAAAAGATAA ATTCTATTCT 1320 

ATGGGATAAA GTCCGTATAA TTGTGTAAAA GTAAAArGGC CATATAACAG TCCTTTTACG 1380 

GTACAATGTT TTTAACGACA AAAACATACC CAGGAGGACT TTaCmTGACC CAAGTACATT 14 4 0 

TTTACACGGG AAAGCGAnGA GGA 14 63 

(2) INFORMATION FOR SEQ ID NO: 719: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1064 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 



CTAGAGTATT 


GAACCCGCAA 


GATAAGCAGG 


TGGGTTCCTC 


GATCGCAGTA 


TTGAACTACC 


60 


AAAAGAAAAG 


GCATGTTACC 


AACTTTGATA 


CAAGCAGGAT 


CCCTAGATAT 


CAATAAAAAT 


120 


GTTCGGTCAA 


CACAGTGAAA 


GGACAACGCG 


TACATCACAG 


AT TT AC CATC 


TTTATAATAG 


180 


CATACTCTCT 


CArAATCGGC 


AACGAGATTG 


TTTCTTTTTT 


TGAAAGCGTA 


TTATTTTGTT 


240 


ATTTTTTCTG 


TTTCCATTCT 


TGGATACGCT 


TGAGTTGCTG 


ATGTAAGGCT 


TGTTCATATT 


300 


TACCAATATC 


AATCGGTTCG 


TAATAACGTG 


CCTCTTTTAA 


TTTGTCAGGT 


AAGTATTGCT 


360 


GATTGACCCA 


ACCGTTTTCA 


AAATTATGTG 


GGTATTGATA 


ACCAATGCCA 


CGGTTTAATT 


420 


CTTTGGCTCC 


AGCATAGTGG 


GCATCACGTA 


AATGATCAGG 


AACATCTCCC 


ACAGAGCCTT 


480 


GTCGAATATC 


TGCCAAAGCA 


GCATCAATCG 


CTGTGACTGC 


AGAATTTGAT 


TTTGGCGATA 


540 


AACAkAAATC 


At G AC G AC AC 


TAGCTAGAGG 


AaTCCTAGCT 


TCTGGAAACC 


CTAGCTTTTC 


600 


GGCGGCTTGA 


ACAGCGGTGA 


CCGTCCGAGC 


GGCGGCCGCA 


GGATTCCCAA 


GACCGATATC 


660 


CTCGTAAGCA 


ATGACCATTA 


AACGACGACA 


AATAATCGGT 


AAATCACCTG 


CTTCAACTAA 


720 


GCGACCCAAA 


TAATGAAGTG 


CAGCATCTAC 


ATCGCTTCCT 


CGAATCGACT 


TTTGGAAAGC 


780 


GGAAATCACG 


TCATAATGGG 


CATCGCCATC 


TTTATCATGC 


GTTAAGGCTT 


TTCGTTGCAC 


840 


GCACTCTTCA 


ATAATCGGTA 


AGGTGATCTT 


AATTTCGCCT 


TGTTCATTTT 


TGGGtGTTGA 


900 


TTTAACAGCT 


AATTCTAGTC 


CATTTArCGC 


ACTACGTAAG 


TCACCGTTCg 


TTGCGCGkGT 


960 


CAAATGTgTA 


' ATGCTTTTTC 


TTCTAAGTGG 


ACAGGATAAT 


CGCCTAAGCC 


ACGTGTGCTA 


1020 


AACGGnTAAn 


GCTCTTGnAT 


CGCTTGCTGA 


ATGCTGTTCC 


GAAA 




1064 



(2) INFORMATION FOR SEQ ID NO: 720: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base padrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :. double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720: 



GCTATTAnAA 


TTATATTTTA 


ATTCnAAAAT 


GnATAAAAAA 


GTTAATTCAG 


AAATATrCAT 


60 


TGAAgCAAAA 


TAAAATAATT 


ATTTCTGAGA 


cTGCTACATC 


TATAGTTTTA 


AAAGAAAATA 


120 


ACAATATTAA 


TAAACACGCA 


TATATAAAAA 


AATACATTGA 


GTGTTTAGTT 


GAATTGTATA 


180 


AAGATGACAA 


CTATAAAAAT 


ATTTCAAATA 


AAATAAGATT 


TTATCATGAT 


TCAATAAATA 


240 


AATTGTATGA 


AGATAATTTT 


GAATATTGGA 


TGTCACATGA 


GCAATCAGAA 


TTAGAGAAAA 


300 


TTAAGAACAC 


ACCACCAGAA 


AAATTAGAAG 


GTGAAGATTA 


TGATTATGTG 


GCTGCAGAGA 


360 


AGAATATTGA 


GCATTATCAA 


AGAAAAATAA 


AAGATATAGA 


AGTTAGTTTA 


ACAAACTATC 


420 


ATAAAGCTAT 


TGATTTCTTT 


ACAACAGTTA 


TATCCTTATA 


TTTAAAAATA 


GArTGGGATA 


480 


AAGCCAAAGA 


AGGCCAGTAA 


CAGTATTGTA 


AaTTACTCAA 


TTTAATTAGG 


CATTTATAGG 


540 


ATAGTTGAAA 


GCAGTTAAGC 


TAATATATGG 


TACTGTAAGT 


TAAATTAAAG 


TTTGCTGAGA 


600 


TACTTTTTTT 


TCGATTTGTC 


ATACGCTATA 


CTTAAATTCA 


TAACGTTGAT 


TTTAATAACC 


660 


AAATTTATGA 


ACGGAGATGT 


CGTT 








684 



(2) INFORMATION FOR SEQ ID NO: 721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721: 

TGTCTATTCT ACTCCCATTA TAGAACAGCA AATCAAGCTA AACAATATAT AAACACTAAC 60 

CCATTGCTTA ACTTTATTTC CTGAAACAAC CTATTAAATT GTTACTACCG CTTCTAATAA 120 

TAGCTTTACT TGTTGTATAT CTTCACTATT TTTTTCACTA GGTGTCTCAA TCATAAGATC 180 

TAGTTTACCA ATCGCTTCTT CAATATTATC CGAACCAATA CAAACTTTTC GTTCAAAATC 24 0 

TTCATGTATT TTACACGCAT AAGACCTTAC AAGTTGTTCT AAAAATCCTT CAAATAACTT 300 

TACTACTGCT TCTTCATTTT CAAAAAATAT ATTATCCAAT AATAAAAATT GAACTAAATA 360 

TTCACTCGTT TCTTCATATA AATTCAAATA CAATTGATGA TAATATGGAA AACTGCACCA 4 20 

ACAACGATAG GCACTAAATT TACCATTTAA AAAACGTCTA TACTTTTGTC GTTTTAATTC 480 

TAATtCTTCT ■ GATAACATTC TTCTTTTAAT ACACCTCCTT CGTTTGAAAT TTCCTTCAAA 540 

TTACCACTTC CTTTATGAAG AAAAACAATA CTGCTCTTCA ATCTTCTCTC AACTAGCCAA 600 

TTTAATGGAT TCAATCTACG ACTTTCCAAT AATTCTTTTT CTTTTCTAAC TGGACGTTTC 660 
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TCTTGTTTCA TTCCAATCAC TCCCATTTTT TTATTAACTT TACCCACTCT TTTTGAACTC 720 

ACTCAAAATA. CTtTGAACTA AATCACTACT AATAAAACCA GCAAAATATA GTTGCATTAA 780 

GTATCTCCGA GCATAGTCAT ACCTGCTCAT TGGGAAAAAT GGGTTCGTTT CAAAAGTAAC 84 0 

AAATAATTCC CTTAATGAAT CAATCGCTTC ATTtGATAAG TtGAATTTTT TtGGATtCTT 900 

AATTAATTCC TCTAATCGAT tCATCkTTtA ACTAACCACC TCATTTTTTA TTATTAATGT 960 

CTATTAACAA ACCTTGTTGA CACAAATTAA TAATAAAAAG GGCACTCGTT AGAATGCCCT 1020 

TTACAGCTGT TAAACGGCGC TACCGTTCGT CCGAATACAA TATTTATTTA CTTCTATTGT 1080 

TTATTTTCTT GTTTTTCCTT TGGTTGAGGT GCCTGTGTTT 1120 



(2) INFORMATION FOR SEQ ID NO: 722: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 

AAnGAnCCTC T AC AT AC AG T AGGCCCATCT AACATGTCTG GTnCTTTTCC TAACATCTTA 60 

GCTAGTAGCC TTAGGTnTCT TTCACACTTC TCTTTCCCTA AGTATTTCTT TGTAACCGTC 120 
ATGGAAAAAA GTAATCTCGC TTTCAACGTT AAAAATTGGA TCTAATCCTT TGTTGTTCCT , 180 

CAACTCTTCA TGTATTAGAC CATAATACTT AATTTTATTA TTGTTTCTAA ATAATCGACT 240 

TTGGCCATGT GTGATAACAT TATCTAATTG ATATATTGCA CAACTAACTC TTAATGAATC 300 

TTTAAATTTA AACCTGTCTA ACATATCTAT TAAGAGCCTT AATTTGGTAA GCTCATCTGA 360 

AGATACATAT TCATCTGCAT CAATAATTAA TATCCATTCT TTACTTGCTT TTTGTATTGA 420 

ATAATTTCTC GCATCTGAAA AATCATCTTG CCAAGTATAA TCAAATATTT TTACTTTTTC 480 

GCTGCnATTT CTTTTACTAT TTTTTTTGTC AAATCAATGG ATCCTGTATC AATAAATAG 539 



(2) INFORMATION FOR SEQ ID NO: 723: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :" 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 723: 



CAGTTGAAAC AATTGCTACT GCCAATTTAT GGCTACGAAC AGCAGATCGG ATTAAAATCG 60 

TCGTTGGCGA ATTTAACGCC TATAGTTTTG ATGAATTATT TGAAAAAGTT AAAGCACTTC 120 

CTTGGGAAGA ■ TTATTTGCCT TTAGATGCAG AATTTCCAGT TGCAGGTAAA TCAATTAAGT 180 

CAAAATTGTA TAGTGTTCCT GATTGCCAAG CCATCACCAA AAAAGCCATT GTTAACCGTT 24 0 

TAAGAGAAGT TTACCATCGT CCAGCGACGG TGCCTTTAAC CGAAACAGGA GCTTTGTTTA 300 
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AATTAGAAGT GGCTCTTCTA AAAGATAAAG TGACGTTGAC GTTGGACACG ACTGGACCAA 360 

GTTTATTTAA ACGAGGCTAT CGCATTGAAA AGGGGGGCGC"ACCTCTTAAG GAAAaCATGG 420 

CAGCTGCTTT AGTGAwGtTG ACCAATTGGC GCAAAGaTCG TCCTTTTtAT GATCCAGTTT 4 80 

GTGGTTCTGG AACAATTTGT ATTGAAGCAG CATTAATTGG GCCATAACAT TGCCCCTGGC 54 0 

TTTAATCGTA GTTTCACTTG TGAAACATGG GATTGGGTTG ATCCAGCAAT TTTTGAAAAA 600 

GTACGTAACG AAGCCGAGGC AAAGGCAGAC TAC 633 



(2) INFORMATION FOR SEQ ID NO: 724: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724: 



ATAGTTTTAC 


TCAGAAAATA 


AAAAGAAATA 


AGGTATGAAA 


TGATTGTTGT 


TATAGTTGGA 


60 


TATAATTTTG 


AGAAACCGTC 


TGAAGATTTT 


AAAAATGCAC 


TGCCTATTAC 


TTCACTGACT 


120 


ATkGCTAWTA 


mCAAaTATAA 


ATATGGCATA 


GAAATtGCtC 


CTTTATTAaG 


TGATATTTAT 


180 


ATTTTAcmyT 


TTAGGTGGCT 


AATTCaATAA 


TAAATAGACT 


TGTATTTTTC 


TAGTGATTTG 


240 


CGAGGTATTG 


ATTTTTGTGA 


TTTCAAAAAT 


TTTAGGTAGG 


TTCTGAAACG 


AGTTGAAAAC 


300 


GAGTTTCTTC 


TTGTCTTGAT 


AATATAGGTA 


ATTATTCAAT 


AATATAGGTA 


ATTATTCACG 


360 


GCGAGTTATA 


TCTACGCTAA 


ACCGTTGATA 


TTATTGGTTT 


AAAACACAAA 


AAAAGTTGCT 


420 


TTTTTCAGCA 


TACTATTTTA 


TGATTAAGGT 


GTCGAATCAA 


AATCAAAAAT 


AGAGAGGAGA 


480 


AAAAGCAACT 


GTTTTTTGCT 


ATAAAAAACT 


CTTAAGTTGT 


TTGGTTCTGT 


TGCAAAGTTT 


540 


TAAATCTACT 


ATCAAATAAG 


GTAGAATAAT 


AGAAAAAGAT 


AGCAGGAGGA 


ATGACGATGA 


600 


ATCATTTTAA 


AGGAAAGCAA 


TTTCAGCAGG 


ATGTGATTAT 


TGTAGCCGTG 


GGCTACTATC 


660 


TcGTTATACC 


TT 










672 



(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725: 

GGTAAACCAG GATGTCCTGA GTTCGCTTTT TGTACGGCTT CAATACTTAA TGTCCGGATT 60 

GTATTAACGC CCAATTGATC TGTCTTGTCA AACAAAAAAA TCATCTCCTA ATTTAGGTTT 120 

TTTTTACCTT TATTCATAGT GTAGCTTATT TCGACAATGA AAGCAATTAC TTTCTGTTGT 180 

GAAGTCCTTT TTCACGCTGA ATTTTTTTTA GTTTTTCAGG AGTCACATCG GTCCCTTCTT 240 
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GATCCACGAC CTTCATTCCT TCGATATGGT GACGCATCCC TCCACGAAAT GCTTCTAAAT 300 

ACTCTTTACG TAATTCTTGT TGTTCGACTT TTTCTTTCGC ACTAAGTTCT cCACTTTTGC 360 

TTaTTCGCTA ATTCATTtAT ACGGGCAATT TTTtCTTtCG ATAACATCAA AATCCCTCCT 4 20 

TACTCTCACt GTCTAwATtA TAAGAAcTGC CTtAAAAAAA CAaCtATATt CTGATATTtA 480 

TAAAaCaCGA aCGCTtGTTt GATTcGTcGC ATTTATGGTA CACTAnTTTT nTTAnGGAAA 54 0 

GAGGTGCGGG TTTGGCGAAC GACAGAACAA GCAATAGAAG ACTAAATATA TATACGACA 599 



(2) INFORMATION FOR SEQ ID NO: 726: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 



TTTATATCCT 


CAGGCACGAT 


CTAGATGTAG 


CTCTTAGTAA 


ATCTTTTTTA 


AAGAAAAATT 


60 


GTCGCTTTTC 


TATTTCAAAT 


TTTGAGTATC 


TATTTTTATC 


ACACCATTCT 


GAAAAATTAA 


120 


GATATCCAAA 


AGCGCCGTAT 


TGTTCTCCTA 


AAATnAAAGC 


AACCCATGTT 


TTCAACGCAA 


180 


CCTCAATTTT 


CTCTATTGCA 


TGTAACAAAT 


GAATTCTTAA 


ATTTTTATCT 


TGATAGTATC 


240 


TTTTCAAAAC 


ATCGGAAAAA 


TATATTCCTT 


TGTAATCATA 


TACAGTATTT 


TCATTAACAA 


( 300 


TTTTCTTATT 


AGCTAAAGGC 


GCAGCAAACT 


GCTTAATTCG 


ATAATATCCT 


ATAACTTTTA 


360 


ATTTTTCTAC 


ATCTTTTGAG 


GTCATCTGCA 


TACCTCTTTC 


TTCAAAAAGT 


GCCATTTGTT 


420 


CATCAAACGA 


TAATGATAGT 


GGTTCCTTCA 


TATTTTTCCT 


CCAAAATAAA 


AAAACCCCCA 


480 


TGTCAGTACG 


TGACTGTCCT 


AAAAAGGGAT 


GGGGGGATTG 


TCTTCTTTTT 


TAAAGAATAA 


540 


" TATACTTTTA 


AAATATTGTC 


AATAGCTTTT 


ACTTATTTGA 


ATGGACCATA 


CAGGACTCGA 


600 


ACCTGTGACC 


GAACGGTTAT 


GAGCCGTTTG 


CTCTAACCAG 


CTGAGCTAAT 


GGTACAAGTA 


660 


AGAACCACCT 


TATTGAGGAT 


AAAATAGTTC 


TTAC 






694 



(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 

TATCCAAGGA GTGATAGGAC ATTAATCGAT CCATAATTTT ATCTGGTGCT GGCCCCCGGT 60 

CATAGATATC • CCCAATAACA TGTAGGTGAT CGATAATCAG ACGTTGGATT AGATAAGCTA 120 

ATTCTGTTAC GAAACGTTTG GCTTCACCTA GTAAAATAAT TTTTTCAATA ATTTGTTGAT 180 

AGTAACCGTT TTTGGTTGTT GTTTCATCAT ATTGATAAAT CAATTCTTCG AGAATATACC 24 0 
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CATAGGTTTC 


AGGAAGGGCT 


TTGCGAACTT 


TAGAACGTGT 


GTACTTCATC 


GAAGAAGAAC 


. 300 


GGACGATTTC 


AACCAGACGG 


CGAATCGTTA 


ATAGCCACCA 


TTCATnAGaG 


AGGGCGCTTT 


360 


CATTTTCTAA 


AAGTGTTAAT 


TtTTCTTCTG 


GaTAGTAGAT 


AATAAAACmA 


AGCTCGTCCa 


420 


TCTGATGGGC 


ATTTAATyCT 


kGTTTAAATA 


AAAATkGAAC 


TTTTTCTCGG 


aTtCGCCCAG 


480 


AACCaTTTCt 


TAAAATATGG 


TCAAAAGCTt 


CAAaTTCACC 


ATGTAAATCA 


CTGACAAAAT 


540 


GTTCTGTTCC 


TTTCGGTAAA 


TTCAAAATTG 


CTTCCAAATT 


AATAATTTCT 


TCAATTTGTT 


600 


GATCGCGGTT 


CAGTGTAGCC 


AAGTCATCTG 


CCTTCTTTCT 


TTTTCGTATA 


TACCAATTTT 


660 


AATTTAAAAG 


ATTAAAGTCT 


TGATAATAAA 


GGTTTTTTAT 


TCGTATGCCA 


ATTATTGTAG 


720 


GCTAATTGGC 


GAAAAAAAGC 


nACCGGAGAA 


AGTCAGAGGA 


AAATGGGGGA 


AATTATAnGG 


780 


AGCnACCQGG 


GTTTAAGCCA 


AGAATTJTAA 


TTAAACCAAA 


TGGAAGGGCC 


GGGAAACCAT 


840 


GTAATTGGGA 


CAGCCAAAAT 


GGATCCGGGA 


AATCCATTAG 


GAAATCGGnG 


GTTGGGACAC 


900 


CGTAAAGTTG 


GCGCCAAAAA 


TCCGGGAC 








928 



(2) INFORMATION FOR SEQ ID NO: 728: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728: 

GCCTGGAGCA GAGCATGCGA GAACTTnATA ATAAAAACCC AGAAAAATAT AAGGAAACAG 60 

TAGCTGCTTT TC TAG AT AAA TATATAAAAT AACAAATTTA TTCTTTACTA ATAGTAGCAA 120 

CAACAAAAGA CTTGTTCTTA AGTTAACGAC TTAGTGAACA AGTCTTTTGT TGTCAGTTGA 180 

TTTTTTTTAA TAAGTTGGAA TTTACCCATG CAATCTAGGG CATAATCTGA TAGGATAGTA 24 0 

CTAGAAATGA GGGAAGGATA ATGGAAGAAC AATTAGTCGA TTATTTACAC TATTTACCAA 300 

TAGAGCGGGG cTTATCaATA ACACACGGCG GArTTACGAA CGAGAtTtwA AAAAAtACGT 3 60 

rGCCCTTTTT ACCAGGACCA AGGCnGCATT CTTGGGCCGA ATTGGACCGT TACCnGGnTT 4 20 
(2) INFORMATION FOR SEQ ID NO: 729: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729: 

GATGGGAGGC AAAAAAATGA ACATTGTCCT GAACAATCCA TTGGAAACAG AAGCAATTGC 60 

TAAAATCATT GGACAAAAGG CACAAGCTGG TGATGTAATT GTCCTGACTG GTGATTTAGG 120 

GGCCGGTAAG ACGACAATGA CGAAAGGGAT TGCTTTAGGA TTAGGCATTT CCCAAATGAT 180 
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TAAAAGTCCC ACCTATACGA TTATACGCGA ATATCCACAA GGACGACTTC CGTTATACCA 24 0 

TATGGATGTC TACCGTGTTG AAGAGGGTGC GGATGAATTA GGGTTAGATG aGTATTTTGA 300 

AgGAGATGGG CTATCCGTTG TAGAATGGGG CAGCTTGGAT TGAAGAAGAA TTGCCAGAA 359 



(2) INFORMATION FOR SEQ ID NO: 730: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 



TCCACTTATT 


TTTTCATTtT 


CCATTGACAG 


AATAGTTGTa 


ATTGTcCAGA 


wAATTGTATA 


60 


TAATAAAGAA 


AGTTATTTTC 


ATATTATTTT 


TTTTCAATTA 


AAGGAGAATC 


CTATGAGTGA 


120 


AACACAACAA 


ACAACCTTAA 


AAAAACAATT 


AAGTCGCGCC 


ACATTACGAT 


GTTAGCCTTA 


180 


GGGGGAGCGA 


TTGGCGCAGG 


TCTTTTTAAA 


GGAAGTGGCG 


AAGCAATTGG 


TATTGCTGGT 


240 


CCTTCTGTTT 


TAATCGCCTT 


TCTAATTGGC 


GGTGCTGTTC 


TCTTCATCGT 


CATGAGTGGT 


300 


TTAGGAAAAC 


TTGTTTTAGA 


TGGCGGArAC 


ACACATCATG 


GTTTGTCTGG 


CTTGGTTCGA 


360 


CCATTTTTAG 


GTGCACATTC 


TGCTGATTTT 


ATTGATTGGG 


TTTATTATTC 


CATGTGGACA 


420 


ATTAATATTA 


TCGCAGAAGC 


GGTTGCCGCA 


GCTTCCTTTT 


TACAACTTTG 


GTTTCCTAAT 


480 


ATTCCTGCTT 


GGTTTTTTGT 


TTTTATTTTA 


rCTATTTTAA 


CGACGCTCAT 


TwaTCTTTAC 


540 


TCGtTCGTTT 


ATTTGCAGAA 


AcArAAtATT 


GGTTGGCGTT 


TGCTAAAATT 


AGTGTCATCA 


600 


TTTTGTTAAT 


TATTTTTGGT 


GTTTATCTGG 


TGGGACAACA 


AATGCTAGGC 


AGTGGTGTCT 


660 


TCCCTACTTT 


ACAAAGTATC 


ACCGATCAGG 


TGGCTTCGCC 


CCACAGGCAT 


GAAAGGTATA 


720 


GTTAATTCTT 


ATTAGTAGTT 


ATTTATCCTA 


GGCGGTTCAG 


AATAATCGCG 


ATTACTGTCA 


780 


GTGAAGCAGA 


TGATCCTAAA 


AAAGCCATCC 


TAAAGCGATC 


GTGGTGTGAT 




830 



(2) INFORMATION FOR SEQ ID NO: 731: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: 

TATTAAAAAA AATAGTTGnA AGAGTACTTT sAAATTTGTA AAsAsTAGAA ATGATTTTTA 60 

AyATTAGTAA GsTTAAAATA GTCATTACTA TAGTTGAATA AACTAATCTT GAATAATACT 120 

TTCTGTTATA* TTTTATTGGA TTAATATCAA ATTTTCTACT TAGAAATCTA TAAGCTAtGA 180 

TGGTTGTAAA CATCGTACAT ATAATTGTAG CAATCGATAA TCCATACCCT TTAAATAGAT 24 0 

ATATACATGG AATTTGAATA ATAAGTTTTA ATAAAAGAGT TATTGAAGTG ATTTGCATAG 300 
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CTTTCTTATG AAAACTTAGT GCTTGTAAAA TAGATAATAC TATTGTAAAA ATCCCTAATA 360 

GAGATGTGCT AATAATTGTT GCTGATAAAT ATCTACTTCC ATCTAAATTA AATCCAAAGA 4 20 

ATAATGTATT AAGAGGATTG CATAATATTA TCATTCCTAT CG AAGTAGGT AGTAAAAATG 4 80 

TAAAGGCTAA TCGTAGTATT TGTGAGGTAC CTTGTAATAA CTCAGGTTTT GATTTTACAG 54 0 

TACT TAT TAA AGGTAGAGCT GAAATAGTAA TTGATCCAAT AATACCTAGT AGTACTGGAG 600 

CTAATTTATT AGGATTTGCA GAGGCTAATG TATACAATGT TTGAAGTTCg CTTAAACTAC 660 

TTTGAGGAAA AAACAAAGAA TACAAAACCT TTAAAGAAAA TTGATCAATT AATTGTAAAA 720 

TAGAAATAGC CGATCCTACA TATACAAACG GCAATGATTC CTTGATTATA TCTATAGCAA 780 

GACCTTTCAT GTGAGATATA GATTTTaAAG GTAAGAAGTA AAAATCTTTA AGTTGAAACA 840 

AATTTCTTTT TTTTCCGATA A 851 
(2) INFORMATION FOR SEQ ID NO: 732: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732: 
TACGACCTAC TAACAAGCGG ACGTCAATCG TAATTTTTTC CAACGGTCGC TGTTGTAGCT 1 60 

CGGCTTTAAC TTCTTGAGAA ACTtGCCACT CTAATGGCGA CATTTCTAAT AAATCCAACC 120 

GTCTTTCCTC GGGAATGTCA AAGCAATAAG TTATTCGTTC ATCTACTTCC ACAGCTAATT 180 

CCTCGGCAAA ACGTTGGACT ACTTTTgATT GGAATAACTT TGTTTCTTTT CGTCATTGGG 24 0 

ATAAAAAGCC GCTCGTAATT CCTTTAAGTA ATTCTCTTCA GGAATAATTT TAATCACAGT 300 

CCCGTCAGCT TTTAGTACTC GCCGAAATTC TTGATAATGT GAAGGGGAAA AAATATTTAA 360 

AATCGTATCA AGCCCTTCAT TCGCAAATGG CAAATTCGTT AAATCCGCCA CACACCAAAA 4 20 

CGCATCAATT GGCTGATTAC TTGCTAAATA GATTCCTTCC TTTGATAAAT CAAACCCGAT 4 80 

TTTCAGTCCA GATAAGCCTG CTTGACTTAA CTCAGCAAGA AAACTCCCTT CACCACAACC 540 

AACATCGACT ACTGTTTTAT TTTGAGGAAG ATAGTGCATA ATTTTGTTTA ArACAGGCGC 600 

GTACATGCCA CTTTGAATCA TTTTTCCTCG AGAAGTAAAC ATTTTnTnAT nATATnCTGT 660 
CT 

(2) INFORMATION FOR. SEQ ID NO: 733: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1067 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



662 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733: 
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AGAATCGATA TCTGTAATTT TCATTCTTAG CAGTACCTCA GGGTCAACCT CATATCCTGT 60 

AAAATCTTGC TTCATAGAAG CCACTAGACT AGCTCCTAGA TAAAGTAATT CGAAAGCATT 120 

AAATTCATCT GAT G TAG AC G GTCTTTTATA ATTATATTGT TGAAAAAGAG AGCCAATATA 180 

AACTGTTCTC GCTGAACCAT ACATCATTAA TTCCTTAATC ATCTTATTTA AAACAGCTGG 24 0 

CTTCAAATTT TCAATTTTTT CAGTATCCGC TATCATATCC ATCCAACGAG AGAAAAGGTC 300 

TTCTATTTTT TTACCTGAAA TTGAACGATA ATAGTTATCA ACTTGCAATT CCCTAGTTGT 360 

TTGATGTTCC TTGTCTToCA T TAT AT TACT ATTTTCATTT CTTATCTCTT CTAGTTTTTT 4 20 

TTCATGTAGC CTTTGTGGCA ATTCAGATAA GCCATTTTCG ATAGCCTTTT TTATCCCTTT 4 80 

ATAGCCAGGA TAAATAAAAA TTAAAATAAG TAACTCAAAC CAATTATTTT TTAACAATTC 54 0 

AAACATTTTT TCCTCTCCTT ATAATTAATG GTAAATTTCC TGTAACATTA ATTATAAAAA 600 

ATAAACGAAA AATAAGCTAG TTTGTATTAT CTCTTATTTT TAATATATTG ATTTCCATTC 660 

CAATTTACCA ATGATTTATT GGATGTTTTA ATATTAATTA CATTTTTTAA ATTATCTGCA 7 20 

ACTTCTCTAT CAATAGTTGG ATATAAATGT GGGTACGTTT CCTAATGTTA TTTGTATATC 780 

AGAGTGTCCA AGCCTTTTTT GAACTGCTAA AGGATTAGCA TTCAACTCAT TAATCAATAA 840 

ACTTGCATGG CTATGTCTTA ATCCTTTTGA TAG TG AC TAT TAATAAAATT CATCAATTAA 900 

AAGGAAGCGA TTTCAACATG TCAAAAGAAA CTTTTCTTAA GTATGGTTTT GTTGTCCTCT 960 

TCATAGGAAT TTTTTTACTT CGTGCCCAAA AATTTATTTC TGAGCCCCTT TCGTACCTTT 1020 

TTTCGGGTAT TCTTTTGCTT GTTGTTGCGT ATTTAATACT CGTGAAA 1067 
(2) INFORMATION FOR SEQ ID NO: 734: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734: 

ACTTAAAGGA GGGAAAATAT GTTCGCAGAA AGATTGAAAT TATTAAGAAA ATCAAAGCCA 60 

AATTTGACTC AACAAGATAT GGCAAATATk TTAGGCGTTG CCAAAACCAC ATACGCATCG 120 

TATGAGCAGG GAAAACGAAC TCCTGACGCA GAATTACAAA ATAAAATTGC TGATTACTTk 180 

GAkGTATCGT TAGACTACCT ACATGGAAGA GACGTTATTA ATAACTCAAA TCTAGATAAA 24 0 

AATATGAATA GAAACAACGA GCTTTCTCCT GAACAATTAA CAGTAGCCGC ACATATAGAT 300 

GATGATGTTT CTGAAGAAGA AATGAAGGAA ATACTATCTT TCATTGATTA TATAAAGAAA 360 

CGCGATCACT ATAAGTAGGT GTAtTTGATG rATAAATTGG GAAAGCyTGA TGGcyAAwTa 4 20 

TCCkGAACtA AACTACCnGA TTAGGA 446 
(2) INFORMATION FOR SEQ ID NO: 7 35: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735: 
AATTCTTCAG CGTATACATC ATCAACTGTT TGTGTTGGGT CGATAACAAT GTATAAGTTT 60 
TTAGTTTTTT CGCTGACTTT TGTAGATTGA TAGATGTTAT CCAAGCCCTC ATCTGTTTTA 120 
TCTTTGTAAG GGAAATAAAC TAAATCTGGT GTGCCATTTT TGTAAAGAAT TTCCATACGT 180 
TCGATATCTT CATATTTTAA AGCACGAGAG AACATCCCTA ATTCTAAGCC ACCTAAGTTA 240 
ATATCGTGGG TTTGAACATG ATCTCCTTCT TGGAAAATTT CAATTTTGAA CCCTTCACAT 300 
GGATGAATTT CAACAAATTC ACTACCATGG ATACGGCCAA AACTTGTTGT AATATGTTTG 360 
ATCCATAAAT CACCAATArA ACGACGATCA ATCGTCCaAG TTTCACCATT TCTGAAATAr 4 20 

AATTTAAGTC CTGTCAkTTC TCTAGCCATA GCACAArTCT CCyTACTTAA AAATAATTTC 4 80 

TTACATTATT AGTTTACCAT ACTTTTGATA AACGGTCACA AACTTTCGCT GTAGATTCnT 54 0 

AAATCTTTTG TnATACTTTT TAAAAAAGGA ATCCAATCnC AACTACCTGC ACGTTGGTGG 600 
AGACCCTTTG GCAATTnGCC GTCC 624 
(2) INFORMATION FOR SEQ ID NO: 736: 

i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736: 

GTCTTGAAAA AAGAGGAGTT ATTAAGCTTT ATAAAGCTGT AGGTAACAGT GTTGAAAAGT 60 

GGTCAGAAGT AGGAAATCTT GATTTTTGTA GTAGTGGTTC AGAATACATG ATAGAGTGCC 120 

CTAATCTTTT ATTTGTAAAT GATACACCAA TTCTGATTTA TTGTCCTCAG GGTCTTGATA 180 

AATCAGAACT TAGTTATGAT AATATTTATC CGAATACTTA TAAAATTTGT CAAAGTTTTG 24 0 

ATGAAAGAAC TGCGAAACTT GTGGGTTCTT CTAAAATTAA AAATCTTGAT TATGGGTTTG 300 

ATGTTTATGC AACACAAGGA TTTAATTCTC CTGATGGACG TACTTTAATT GTTAGTTGGA 360 

TTGGGCTTCC TGATGTTGAA TATCCAACAG ATAAGTATGA TTATCAAGGC GCAATGAGTC 420 

TTGTTAAGGA GCTTTCTATC AATAATGGAA AACTTTATCA ATACCCTGTA GATGCTATTA 4 80 

AAACACTGCG AACTAGTTGT GAAGATTTTA CGTCAAAAAT AGAAACAAGT AATACTTATG 54 0 

AACTGGAAGT GACTTTTCCG GCTAATCAGA AATCAGAATT ACTTCTTTTT GCTGATGATA 600 

AAGGGAATGG ACTCAGCTTA ATAGTTGrTA CTAAAgAAGG AAAAgTTATT TtAGATCGGA 660 

GtAAAGCAGG aATTCaATAT GCaTTGaTTT GGtAwGaCtC GTGatGCtTA ATAGATCnGG 720 

AGAACCAATG CTAATATnTT GGAGAAATCC ATCATGAGAT TTATAATAAG GGAGAAGTAT 780 
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TACnGCGGnA TTCCGAGAGG 
(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737: 

TGTTGTGACA ACTTCTATAT CTTATCATTA TCTCAGTTGT TTGTCAACAA CTTTTTAAAA 60 

CTTTTTTGTT ATTTTATAAC AAGTTTTTTT GATCAAACGA TTGATTTAAA AGCAATCTCT 120 

TTAACGACTT TGTTATTCTA ACATGTTATT TATCTTTTCG TCAAGAAAAA TTTTTAACTT 180 

TTTAAAACTT TTTTCTTAAC GAACTGTTTG TCAGAACAGA TATTAATATA CCAAGGATTT 240 

TTCTAAAACG CAACTATTAT TTTGCAGAAA AATAACATTT TTTTCAATTT CGAACGATGA 300 

ATATTTTCTG cAAAGaAAAC ATTCGTTTCT TatCTTATAA TCCCTTTTCT CTTACTTAAA 360 

TGTTCGTTAA TCATTCGTAA AAAAAGCAAC GTACTTTATT GTACGTTGCT CTCTTCTTTT 4 20 

ATTCCAGCGA TTGGGAATAA AAATTCCAAA GATCGTTCCT GGATGCAATA CGCTTTCTGC 4 80 

ATCAATACGT CCTTATAGTT 500 
(2) INFORMATION FOR SEQ ID NO: 738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738: 

AGTAGAATCC GTCTTGATTT TATGCATCAC TCGTCGCAAA CATTTGTCTA TTTCTTCTTG 60 

ATAATCTTCA CCAATTAGAA CGAGATTGTA TTCTTTTTCT ACTTTAGTTT TAGCACGAAT 120 

AATTGTTCCT ACTTGCTCAT GAGCTTCTTG AATAGTATTA GAGAAGGCmG CAATAAATTT 180 

TAAGCTATCT TTGTTTAGGA AAGTGTTCTC TTGAACTTCT GAAAACTGaT CGAGCAATAA 240 

TTGTTGTTGC AATCGTTCTT TTTCAGTATC TATTGTATCT ATTGTATCTG TATTTAGATT 300 

ATTAAATACT GTACTGaTTT TCATACCCTC TGaTTTATCA ACGTTTTGGc AGGGTATGAT 360 

TTtCATACCC TCTTGATTAC CAAGGGATtC AGCGTTTGTT TTTCTTTGGG GGGTGGGTAT 4 20 

GATTtTCATA CGCTCTGATT TATCAACGTT TTCTTGCATA GTATCATCGA ATGTTTGCAA 4 80 

TTGATAAATA- TCGTTTtCtG TTACTtCAaG TTCCGCTAGA TAAAGTCTAT TAGGATTGTT 54 0 

CTTACCTAGT GTTTTATTAA AA 562 

(2) INFORMATION FOR SEQ ID NO: 739: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 739: 



GCATTATTTT 


TTCGTTTTAA 


AATAGTATAA 


ATAATCCTTG 


TAATCAAAAG 


AAGGCTACCT 


60 


AAGAAAAGAA 


ACGCGTTTTC 


TTTTGTTTCT 


CCTGTTTTTG 


GCAATTCCTT 


ATTACGAATC 


120 


GCCTGTTTTT 


CAATGGTTAT 


TGAAGTAGTT 


TCTGTGGTTG 


AATTCGTGGT 


AGTATCTGTT 


180 


GTCCCACTCG 


TTGGTGGTGT 


GGTACTTTCT 


ATAGGTTTTT 


CTGGCGTATA 


GTGATTAGTA 


240 


AATACTTGAC 


CTCCATCATA 


CTGAGTAGTT 


GCTACAAGAT 


TTCCTAAAAG' CGCCTCGACG 


300 


GTTACTT'GAA 


CTTGAAGGTt 


GTGCGTAtCG 


TAGGAAAcAC 


CTGTCTGTTG 


CGTGkTTAcT 


360 


kCCCGAATGG 


TGTATTGATA 


GTTTCCAGGA 


GTTCAAAGGT 


TAATGGTGAA 


AATTGAATTT 


420 


TCCCTAAAGT 


ATCGTTA 










437 



(2) INFORMATION FOR SEQ ID NO: 740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740: 



TCATTTAAGA 


AAAATGAGCG 


TGGGACAAAA 


ATCACTTTGG 


ATTTTTGCTC 


CACCCTCAAA 


60 


AACTGATAAA 


CGGCGGGAAC 


AGAACCAACT 


CCTTCGGAAA 


TAAGCCGAAA 


TTCTCCAAAA 


120 


ATTAAAGAGC 


AATTTTCGGA 


AATTTCTTCT 


TATTTCTCGG 


AGCTAAACGG 


TTCTGTCCCG 


180 


ACCTCATTAT 


TTCTTATTTT 


GCTGATTCTT 


AATGGTATAA 


GTCGAAACGA 


CCTTTAGCCA 


240 


TCATTTCTTT 


GGTACCACCT 


GTTTCAAAGA 


GCGCTTTGTT 


CATGATAATT 


TTTTTCATGA 


300 


ATGAAGCTGG 


GTATCCTTTA 


ATGTCTGTTT 


TACCAACGAT 


TCCTAAAGCA 


CGTGTATTTC 


360 


CGACAGAACA 


AACAGAACCT 


TGTGATTTAA 


AGCTGAAAGG 


TTTAGTTGCT 


TCCCCTTTCA 


420 


ATTGTGCTTG 


AATATTTTTA 


GCAGCATGAG 


CGCCCATCTT 


CAAGGCGATT 


TGTGCTGTTG 


480 


TtGGGTAgGT 


CGAwTGGGTT 


CnG 








503 



(2) INFORMATION FOR SEQ ID NO: 741: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
\D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 
TATTGACAAT TAAAAATAGC GGTGGTATAC TTTTTTTAAG TTATCAAAAG TATACTAAGG 



60 
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AGGAGTTGTC AATGGGAAAA. GTATATGTTA TTGGAAATTT TAAAGGtGGG GTTGGTAAAA 120 

CTAAAACAGT AACCATGCTT GCATATGAAA GTGCACTACA ATTAGGACGA AAAACGTTAG 180 

TAATTGATCT AGACCCTCAA GGAAACGCTA CTAGAGTATT AGCTAAGACG GGAGATTTAG 24 0 

CCGAAATCAC TTATACTGTT ACTGAAGCTT TTCAAGAAGG TAGTCTTGAA CCAGCAATCA 300 

CCAATATCTC AGAAAACCTT GATCTCATTC CAGCTAACAC TGCTTTTAGA AATCTCACCA 360 

AAATTTTAAT GACAAAATTT CCGTCTAACG AATTTGACCA AATAAATTAC TTAAATACAT 4 20 

TATTAAAACC CCTTAAAGAG AAATATGACG CAATTTATAT TGACGkTCCA CCAACTATTt 4 80 

CAGATTTtAr TGrTAATGCm ATGCyTGCmG CAGATTACTG tAwTAwTGkT CyTCaAACmC 540 

mAGAGTATCT TTAGACGGTG CTCAAACATA TATCGCCTAT ATGCAATACT TAGCTGACAC 600 

ATACGATAAT GGATTTACAn GGCCCTAGGG AAnAATTCCC TGGTATGTTG CGGnCCCGGG 660 

GGTAnGGGAA GCC 673 
(2) INFORMATION FOR SEQ ID NO: 742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742: 

TTGGTAGGCT GGTAAAGTTn GnGGTTTAAA CTCmCCaTTA CGAtCtCTAG GTATtTCCCA 60 

AkGTTAATTC CmCCATATTC GGTTTTGaTT GaTCGaAAGT AAGAACCGTT TCTCGAATTA 120 

CCTGAATTAA AACCAGTGCG ATCATATTTT TCGtAATCTA AAAAAGCCGT TAATTCAGTC 180 

CGTAGGAGTG TGTTTATCGC TTTTTCTAAG TGCGAACGGA ATAATTCATT TAAATCGCCT 240 

TTAGTGACTA GAGTTTGCAC AATTTCTGTA GTAAAATCAT TCATAGGGAA GTCCTCTTTT 300 

CTGTGAATTG GTTGTCGTTA ACTTTATTCT ACAGAAGCGA CTTCCTTTTT TGTATGGATT 360 

TTTTCATTTA CACAAAATAT TTTACACTCT CCAAAAGAAA TACCATTTAA AATATTTTTT 4 20 

TCAAAAACAG ATATAGTATA GTAAACATTA TTAACCCTAT AAATAAACTC TTCTTTTCCA 4 80 

ACATTTCCTT TTTTATTATT AAAATCATTT TTTTCTTCAT AAGTTAAAAC TTCAGATAAT 54 0 

TTTTGAAAAT ATACATTTAA TAGTAGAAAG TCATTATAAG AGGAAACCAA ACTTAGAATT 600 

GGCTTCATAA CCATTGTTAC TATAGAAACA AAACCAATTA GTGATCCTAA TGACAAAGAA 660 

TTGTTGATTA TTAATTTAAT ACCTATTATT AAAAATAGAG CTGGCATTAC ACTTTGTATG 720 

ATTTCTGGTA ATATTCCAAA AATAGCTATA TATCTATTTT TATTTTTTGT AATTAATAGT 780 

TGAGACGTAA- ACATGTTTTT CCAATTTAAC AAAAAACTCT TTTCTGCATT AGCAGATTTA 840 

ATGGTTTCAA TkcCTTCAAT TGCTTCTGTA ATAATTCTCT GGACGTTTCC TTGTTCCATT 900 

ATTTCTTTAT CTACAAATCT TTTTATTGTA TGCGAATTTA TAATACTTAA AAAAGCTATT 960 
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AAAGAGATTA GAACAAGAGC TATTATTGTC AGTAAAATAG AATAGTTAAC CATTAAAAAC 1020 

AAATATATCC CTAAAAAAAG ACTATCTATT AAAGTTGTTA TTACCTTTTG AGACAATATT 1080 

TGAGTCAAGT CCAGACTCCT GTGTAAAATG CTATACAATG TTTTTACCAT TTCTACTTAT 114 0 

CAAAATTGAT GTATTTTCTT GAAGAATAAA TCCATTCATC ATGTAGGTCC ATAAGAACGG 1200 

CTCCAATTAA GCGATTGGCT GATGTTTGAT TGGGGAAGAT GCGAATAATC TTTTCTCTTC 12 60 

TGCGTACTTC TTGAkTCAGT CGTTCAATTA GATTGGTACT CTTTAGTCGA TTGTGGGAAT 1320 

TTCCTTGTAC GGTATATTGA AAGGCGTCTT CGAATCCATC ATCCAATGAT GCGCAAGCkT 1380 

TTGAATATTT TGGTTGATCG ATATAATCAT GAATCAATCG ATTTTTAGCC TCACGCGCTA 14 4 0 

AGTTAATATC TGTGAACTTA AAAATTCCTT TAACAG 14 7 6 
(2) INFORMATION FOR SEQ ID NO: 7 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743: 



CCTGAACTGG 


AATTGTTGAA 


GTAATATCTC 


ATGCTGGTAA 


AGTAACGTTT 


ATTGAACAaA 


60 


AAATTAAAGa . 


AAAATTTTAA 


TATAGTCTGA 


CTAGACATAC 


TAGAAGACAT 


CTGACCAATT 


, 120 


GTGTTTTTAC 


ACATTGGTTA 


GGTGTCTTTT 


TGTATTTTTA 


TTAGAAGGAG 


CGATTCGCCG 


180 


ATGTTAGACC 


AACACGGAAG 


AAAGATTTTG 


ATTCAGGAAT 


ATAAGTCTGA 


TTTAAAAGAC 


240 


GCAAGTCGTC 


AGCATCGACG 


AATTGCAAAG 


AAGAAGTATC 


AAATTGAAAA 


AAATGGCAGA 


300 


TTAGAAACAA 


TTGATGACCG 


CACTGCAGAA 


GATATAAAGG 


AACAGTCCAT 


TTATGCTGAA 


360 


ATTATTTCAT 


CTACCAAATA 


TGCGTTATAT 


TGGCTTGAAC 


ATGGAATTGA 


GCGACCTCTT 


420 


GATGAGGAGG 


CAGCAAAAAA 


AATACCTAAA 


TATCGCAGAG 


ATAAACATAT 


TACAAATATG 


480 


GATAAGATAT 


CATACGAAAT 


TTATTGTAAC 


CAGTATGAAT 


CTGCACGTAA 


TTACCCTATT 


540 


ACTGAAGAGA 


AACAAGAGAT 


GTTGATTCAA 


CTTAAAGAGT 


TGTTATCAAC 


GTTCaGCGAG 


600 


AGAGAACGTG 


ATTTGTTTGA 


TTATATCCAT 


AATCAGCAAC 


TTACTTATGC 


AGAAGCTGCT 


660 


GAAAAAATGG 


ATATTAAAGT 


AGGGACTGCT 


AAATCCATGT 


CACAACGAAT 


TAGGAATAAA 


720 


ATTGATGCTT 


ATTTCGAATA 


TGGACATCAG 


ATTTCATTAT 


TTTAAATTTC 


ATTTTTTTGT 


780 


AAACCATTCC 


CACCTATAGG 


TGAGAGGTAA 


AATTCTCCTA 


TTCTAAGTTG 


GTAGAGTAGC 


840 


TTAATAATAT 


TTACTTGTAA 


AArAATACAA 


GAGAGGAGGT 


GTTCCTCCTC 


CTCTAAATTT 


900 


CTACAAATTA 


CGAGTAAATT 


AGGTAGACGT 


GTAGCTCAAT 


AGGTAGAGCA 


ATTGATTTTT 


960 


AATCAATGGT 


GCGTGATTGC 


TTGTGCCAGG 


TTCGACTCCT 


GTCACGTCAA 


TAAGTGGGAA 


1020 


ACCACTTAAA 


TAAAAAATCG 


GTATATGTCA 


ATAAATGTTT 


CTACTACTGT 


GACACACGAT 


1080 
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(2) INFORMATION FOR SEQ ID NO: 744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744: 

TTCGAGGAGA AAGTATGATT AACGCAGGGT TGTCCTGGAT AAAACACAGA TGTATCAATC 60 

CCAGGATTTA ACGCTAAAAT TTGTTCTAAT GTCATTCCGT TACGTTCGGC AACTTGACGT 120 

GCCCCTTCGC CACTTTGAAC GGTATCATAA ACAGGTTGTT GGTTCTGCGC TTGTTGCTGT 180 

TGTTGATTTT GAGCTTCTTG TTGTTGTTGC TGCTCTTGTT GTTGCTGTTG CTGTTGTTGT 240 

GCTTGTTGGT CCTGTTGTTG CTGTGCAGCA GCATCTGAAC TTGAAGGTGT TGTATTTTCA 300 

GCTGGTTGGC TGCTTGCTGG TTCTGAGCTT TCCACTGATT TTGACGTACT TTCTTTTTTC 360 

TTCTCTTTTG ATGATGATTG GACTAATGAA GAAGATGGCT GGCTATTTTG CTAGCACTTT 420 

CGTTCGGTTC CTGGCCTGCA TCATCCATAA GAAAGTCCCA ATTGGAAnGC nATAATTAAT 4 80 

AATAGC 486 
(2) INFORMATION FOR SEQ ID NO: 74 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745: 

AGTGAATTTA CCnAATAAAT TAaCAGTTTT AAGAATATTT ATGATTCCGA TTTTTATTAT 60 

TATTGTGTCG GTGCCAATGG ACTGGGGAAC GATTAGTTTT GGCGACACAA CGTTAGCCGT 120 

TACCCAATTA GTGGGCGCGA TTATTTTTGC AGTTGCTAGT TTTACAGACT GGTTGGATGG 180 

CAAAATTGCC CGTGCACAAG GCTTsGTGAC GAACTTCGGA AAATTTGCGG ACCCaTTAGC 24 0 

GGACAAAATG CTAGTGATGA CCgCTTTwAT CGTCyTAGTk GGTCAAGGca AAGTACCtGC 300 

tTGGATTGTA GCaATwATTG TTTGTCGTGa ATTGGCaGTA ACTGGkTTAC gGTTGTTAwT 360 

AgyCGAmCAT GGCGAAkTAA TGGCAGCAGC TTGGCCAGGA AAAGTAAAAm CAGCyACCCm 420 

AATGGTgCaA TATTt ACyGk TCaTTAmCAA CATyCCATTT yCAGaCTcyA CCACTAGATC 4 80 

AAATCATGCC ATATGCCTGC TTGAATTTTC ACCATTTACT CTGGGAG 527 
(2) INFORMATION FOR SEQ ID NO: 746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 746: 

ACGTATCCAG ATAAAAACCG GTTGAATCAA ACGTCATnTA ATGGTATTnT AGACGAGTTG 60 

TTATCCTTTT ATGAACAAGC AGGGGAAAAA GTGGATCAAT TTGGCCGAGT TTTTGAAGCG 120 

GCCTTTGCTA CAACCGAAAA TGGCCATTTG TTATCAGGTC CCATGAAACG ATTATTGGGA 180 

TTGTTGTTAC AAGTTTTATG GAGCCAGCAA GTCAATCACT TTGACTTTCA ATTTAAAAAT 240 

TTTATCATTT GGTTTATTCG CTTAGGTTTT CCGGTGGCGT TTCCAAAAGA AATTTTAAGT 300 

GCTGATTACG CAGAAC AAGT TTTGCTTCAC ACAGAAACAC TAGGTCCTCC TATGGTGTTG 360 

GAAAAAATCG GTCAACTAGG AGCAGCTTTA AAGCCAACGC CCGATCAACT GTTAACAACG 420 

ATTGAAAGAT ACCAAGAAAT TTTAAAAGAG ATTTGAGGGG AAATTATGTC ATATAGTGaA 4 80 

AAAATGGTTC AAGCATTACA AGCTGaAAAT TTaGCaGAAG CACAGTTAAT GTTTGaAGAA 540 

GCaTTAAAGa AAGATGaTGa AAATACCTTA GCTGATTTAG GGGAAACGTT ATTATCTTTA 600 

GGTTTTCTAG AAGAAGCCAA ACAAATTTTC CAACAGTTGT TGGAACAATT TCCCGATGCA 660 

GATGGCTTAA ACATTCCATT AGCAGAAATC GCCATTGAAA ATAATGAAAT AGATGACGCA 720 

TTCATCTATT TAGAAAAAAT TCCAGAAACT AGTGACAGTT ATGTCCAAAG TC 77 2 

(2) INFORMATION FOR SEQ ID NO: 747: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747: 
GCCCTTTTAA AATTTGTAAC CATAATTCTG TTACATCTTT ATTCTCTTGG TTCGTTAAAG 60 

TATACTTATT TTTATTAAAT ACCTCTACAT TAAaCCAGtC ATtAGTAATA AAaTTTyCrG 120 

AaTaCATTAt CTCTtCACCT CATTACTTtC TATTAGAGAT AGTTAtGCAa TTaATTTtGk 180 

GTAaCACTCT GGGTTATCTG TTAAGCATAA TATGTAATCt ACGCTTGTAG AATACAAAgT 24 0 

GATAGCTTAA CTAAAATATC TACTGGTACC ATTGTTGCAC CACTCTCATA TCTGGAATAC 300 

GCAGATTGAG AACAATGTAT ATAGTTACTG ACAAAAGATT GAGTATAATT ATTTTTAATT 360 

CTCAAAAATA GCAACCTAGA TTCCATAATT TTTCCCTCTC CTGTATGCTA TCTTTGTTTG 4 20 

TTAATAACAG TCAATATATT ATTCGGATTC TGCATAATTC TAAAATTTTT ATTATTTATC 4 80 

AATCACGAGA AAATAAAATA TGTCTATACC TTTATAAAAG GTAAAAATTT GTATTTCTTT 54 0 

ATTATTTGTT TTATAATATA AGTATCCATT GAGCAAAAAC GCTCAAAATA GGCTGATCGA 600 

GTTAATCGAT CAGCCTATTT TctTTAGATT TATATTATTT ACTAAAACTA TACTGTCACG 660 

AAAAACAACA TATATAATAT TATGTTGTTT TATTCAAACT ATAAATGATA aTATAyyTAA 720 

ATATATAATA GAGACTGTCT CTTGGGGGAG ACAGTCTCTA AGGTTATAAC TTATTTTAGC 7 80 
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CTTTCATTTC TAACAAATAA TCATTCATAG AACTGAACGA TCAATTCATT CAGTCTGAAA 840 

GAAAATAGTT AAATCTGAAA TAGCTATTTT TATTTTACAG TAtCTGCCTT GGCCTTTCAA 900 

TTTCCTTGAT ATTACTTTTA TTTCAGGATT ATTTAACATA AACAAATACT TCTGAATCTT 960 

TTACATAAAT GTCATGGTAA TAATTAGATT TCATTAATTC ACTTTCTTCT TTACTCAACG 1020 

CGTTAATATC AAAGATCTCC ATGTTCACGT TTAAACCAGT GTACGTTCTA AATAAAAACT 1080 

GATTTGGCCA ATAAAGGGCT TCATTTGGTG GA 1112 
(2) INFORMATION FOR SEQ ID NO: 748: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748: 



GGGCTTCCAT 


TTCnGGnTAT 


GCnGATGGCG 


CATCAAAACT 


TGCACGTAGG 


TAGTCTCCCG 


60 


AAAGAGGaGT 


AAtGcTAAAA 


AAaTATGGtC 


GCmCTTtCGG 


AAAaGCTTCa 


CTTGGTACAT 


120 


TTkGATTGGC 


GTGATgTTcA 


TTATGTAGgC 


TTTAGCAATT 


TATTTTTTTC 


AACAGTTGTT 


180 


AGATCATTAT 


CAAAAAgTTT 


CCaACTAGGG 


TTGTTAGTAG 


CGTATGGAAC 


GACAATTATT 


240 


TTAATTCCTC 


TATTGTCTTA 


CTGTGAACAG 


AAACCGAAAG 


CTTaCTTaAC 


TAATGGCATC 


300 


TATTTCTATT 


TGAAGAAATT 


AAGTTTAATA 


AAAATGAGCA 


AGATTTCCTA 


TGAAGAATAT 


360 


CTAAAACTGG 


GAGCGGGGGC 


TTTACTACAA 


AAAGTAGAAG 


TTGGTGCAGC 


AGCTGGGAGA 


420 


AATATCCACT 


TGAATTTTTA 


TGGGCGCTTA 


TTTCGAGAGT 


TAATTCCAGA 


GACGTTATTT 


480 


AATTTATTTT 


TCATAGCATT 


GATTGATAAA 


AAATTATTAC 


CAGCAATTCT 


AATTGGTTAT 


540 


GTGATTGTAT 


TTATTTTAAC 


CAAAATACTT 


TTGAAAACGC 


TACAAAAAAT 


GAAAGAAAAA 


600 


ACACTTATTT 


CTGAAGAAGC 


AATGAATGCT 


ACTTTGATAC 


GAGGGATGAC 


CGAACTAGTA 


660 


ACTTTTAGAA 


TTAATCGAAA 


ATATAAAAnA 


GAGATTGAnA 


ATTACGCGTT 


AATGGCAGAG 


720 


GGAAAATAGT 


CAAAATATAA 


CAAAAATGGA 


CTATGGATCC 


ATGGAATTTT 


TCTTTGGGCT 


780 


TTTTTGGCCT 


TGTTAGTTGG 


CACTTATCCA 


AGTAAGTATT 


GTCCGTACTT 


AGGTTTTACC 


840 


TAACGnCGnG 


GCGCCCAGTT 


TAGG 








864 



(2) INFORMATION FOR SEQ ID NO: 749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 9: 
ACCAACCTTC AAGATAAGTT TTAAGTCTGT GCCCTTACAC GAGATTTTTT TACGGCAAAA 



60 
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ATAATCCTTT 


GGTAGTTCAT 


CCAGCACAAG 


CACATTTTTA 


TATAACTGAT 


TAATTTGGTG 


120 


GAATTATAGT 


TATATCTATA 


TTGATTAnTA 


GCTGATCTTG 


CTAAGCATGG 


ATTTAATAAG 


180 


AATATTTTTG 


TTAAAAAATC 


ATATAnCCTT 


ACGTGTAGTG 


CTGCAGCACT 


CATTTCAAAT 


240 


TCTTCGCAAA 


TTTGACGGAA 


AGACATTTGC 


TTTTTTAAAC 


ATTCTTTCAA 


TGCTTCATTG 


300 


TTTATTAAAG 


CAAGGGATGC 


AAATATATTA 


GCTCTTATTT 


CTTTTGGTTC 


ATCTtCGCTA 


360 


GTGtATCCTT 


TTCCAGTAAT 


TAAATCTGAA 


AAACTTTGAG 


ACTTGCATTC 


CTTATTCATA 


420 


TCAAAAAAAT 


AATGTGcAGA 


TTCATGAAGT 


ATTGAGAAAT 


TTTGCcTTCC 


TTTAACCATT 


480 


GtTGGATTAT 


AGGaAATTCC 


GCATTTATCG 


TGAAATTTTA 


TTATGTTTCC 


TGaAAATATT 


540 


TTTcTkGAAA 


CATCACCGAA 


ACTATAGGGA 


ATTATGTCAA 


CATTTTCTAC 


GTCAATCATA 


600 


TAATCTTTGT 


AGTGTTTCCA 


TCTTAACTCT 


TTTAGTTCAA 


TTCCATAGTA 


ATTAGCAACT 


660 


TTTTCAATAA 


TAATATATGT 


CTGGTCATGA 


TATTCAAAAT 


AAGTATCGTA 


GTCTAAATAC 


720 


ATAGGGGTCA 


TCCAATCATT 


TATTTTTAAG 


TCTTGATTTC 


ATGAATTCTA 


AGTACTCTTT 


780 


TAATTCTTCT 


CTGAGCTGAT 


CTTTTTCGGA 


TTCGGTkAAa 


TCTtCTGTAT 


TTACACGGAA 


840 


GAAAGTTGTA ATTTCATCTy 


CTTCTAAGTT 


ATTAGAATTT 


GGATTATCAG 


TTCGTCCAAG 


900 


TAAATAAkCT 


ACAGATATAT 


TGAAGTAGTC 


CGCAACTTTT 


TTTAATTTAT 


CAGAACCAGG 


960 


AGTATTCTTA 


CGCCATCTAT 


TAATTACCCC 


GTTtCCAaAT 


CCCAATGTTT 


CTTCTAATTG 


1020 


TCTTATTGAC 


ATTTTCTTTT 


CGCTAGCAnG 


TAATTTTATT 


TTnTTCGTAT 


AAATCCACTT 


1080 


ATATCAACCT 


TTCTAGAAAT 


ACAAAGAAAA 


TATGTATACA 


AAAAATCTAC 


AAAATGGTTG 


1140 


ACAATGTAGA 


AnAAATGTAT 


ATAATAATTC 


TTGTAAGTTA 


ATTGGATAGA 


AAAAAAGCAA 


1200 


ACTAAAAACA 


TACCTTACAG 


CATTAAGTTT 


GGCGACCGAG 


TGCATAATAA 


AGGTTTTGTT 


1260 


ATAGGCTTAT 


TTAACTATGT 


TCTGATTGTA 


GAATATTTGT 


ATACATAAGT 


CAATAGTTTT 


1320 


ATAAATTCTA 


TCTAATTAAT 


TTACAAAAAA 


TACTAAAGAA 


AGGAGAAACT 


AATATG 


1376 


(2) INFORMATION FOR SEQ ID NO: 750: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750: 

TTTTGTAAAT GTTGTCATTC CATTTTATTT GCAGGATGCA CGCAAACTAA GTGCTAGTTA 60 

TGCTGGTCTA TTAATGATGG TATTTCCGTT ATTAATGGTG GTTGGGGCTC CTCTGAGTGG 120 

CTATTTGACG ■ GATAAAATTG GCCCAGGTAT TTTAACATTT GGCGGATTGT TGCTCTTGTG 180 

CTGTACGTCG TTAATGTATA TGTTTTTAGA TATGAATTCG CCTATCTGGT ATTATGTGAT 240 

TGCAACGGCC ATTATGGGCT TGGGAAATGC ACTTTTCCAG TCTCCAAACA ATACAATGGT 300 
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TATGAGCAGT GTTGAAAAGC AAGATTTAGG TGTAGCAGGA AgTATGaATt CTTTTGCTAg 360 

AAAnCTtAGG aAkGGyCATk GGGATTGCGT tGTCaACGAC CATTTTAtAtr -CGTGGCATGA 420 

GCGAACCTAA TGGTGAACgA GTAACCaCGT AtCTGGctAA tCgCCCaGAt ATAtTTATTG 4 80 

TGGGAATGCG TGAAcCTTTT TTGTCGCCTT TCTATTATGT GTGGCGGCCT TTATATTAAC 540 

GATTTTACGT TTTCGGAAAA CAACCAAATA AGTAAAAAGA AACAGACGAG AAAGATTCAT 600 

GATTTTTCTC GTCTGTTTCT TTACTAGCTA AGAAaATAAG CaATGCGATT GTATATTTCT 660 

GATCCCnGn 669 
(2) INFORMATION FOR SEQ ID NO: 751: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751: 
AAAATAATTA AAAACAGGAG TGATAAACGT GAACGTATTA GAATTCGGCG GTACTCGCTA 60 

GCATCAGCAC CACAGTTACA GAAAGTCTTA CAAATCGTCA AGGAAGAGCC GCGCCGAACG 120 

TTTGTCGTTG TATCCGCTCC TGGAAAGCGA ACCCCGCAAG ATATTAAAGT GACTGATTTA 180 

TTGATTCAAT ACTATCAGCG CTACTTAAAT AATGAGGAAA TTGAATCGAC AATCTCTGCC '240 

ATCATTCGTC GTTATGAAGA TCTTTTTGAT GAATTACACT TGGATAAAGC AGTTTTAGCT 300 

GATATTGCTA TTTCGATTCG CCAATTAGCC ACTTTACCAA AGAAAACAAT GCTTTTTTAT 360 

TTGACTATTT TTTAGCTAGC GGTGAGGACA ATAATGCCAA GTTGTTGCCA GTTTTTCAAG 4 20 

CAAATGGCTT AGATGCTCGC TACATTAGTC CAAAGAATAG GnCTCCTAGT GAnCCTGAGC 4 80 

CAGCATGCCG CATTTTACCA 500 
(2) INFORMATION FOR SEQ ID NO: 752: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752: 

TGAGTTTTAC AACGAGTTGC TTAATGAAGA AATAACTAGT GCTCAAAAAA GAAAAATGAG 60 

CAATGATATT AAATTCTTAG CGAGTATTTT AACTAATATT TCTTGCTACA GATACTCAAT 120 

TACTTTAAAT GATGTTTTTT TTAGAGGTAA GAATAAACTT GGCTATTtGA TAGGGGAATT 180 

TGGTGTtAAA ATAGTAAAAT CATATtCGAA CTTTATATTC TCATGTTTAT CAGGAGGCGA 240 

GTCTATATAT AATGTGGTTT TAACATCCAA AAAAGGAAGT AAGTTAAAAG TGaTAGAtTT 300 

TAATTCTGAT GaATATGAGA TGTCTtATTG GTGTTAGACC n GAT AC 34 6 
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(2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 753: 



GTCGATTGGT 


TAGTACTTAC 


CaAAGGAATG 


AATCGATAAA 


AAAAATAGCA 


GTGATGGCGA 


60 


TAACCGCTTT 


GACTTC1GCG 


GTCGGTTTGA ATTTCTTTyT 


GATTCCGGCA 


AAAGTTTTTT 


1^0 


CGGCAGGGAT 


GAACGGAATT 


GCGCAAATCA 


TTGCTACATT 


ATTATATACA 


AATTTAGGAA 


180 


TCCATATTAA 


TACAGGGATT 


TTCATTTTAT 


TTTTAAACAT 


TCCCGTCTTC 


ATTCTAGGGT 


240 


TTGTAAAATT 


"agggaaacaa 


TCAACGATTc 


TTAGTTTCAT 


CAATGTTATT 


GGTATTTCCG 


300 


TTGTCACTAT 


GTTTGTGCCA 


ATCGTAACAG 


TCACAACGAA 


TCCTTkGATG 


AACGCAATaT 


360 


GGGTGGTGTC 


TaGTCGGCGT 


TGGCGCAGGc 


TTcATgAAAA 


TGGGCTTtAA 


CACTGGCGGC 


420 


AT GG AC ATT A 


TTTCACTGAT 


TCTtCAAAAA 


CAACAGGGaA 


AACAGTTGGT 


AATTTtATGT 


480 


TCTTATTAAA 


TGGtAwTATT 


GTGCTGTTGG 


CTGGTTTTGT 


ATTTAACTGG 


GAAAGTGCGT 


540 


TGTATACTAT 


TATTTCTATn 


TACTGTTTGA 


GTCAGGTGGT 


CGACATGATT 


CATACCAGTC 


600 


ACCAAAAAGT 


CACAGCAATG 


ATTGTCACCA 


CGAGACCAGA 


AGCCGTTGCC 


TTAGAAGTTC 


660 


TCAACAAATC 


GCTCGTGGCA 


TGAnC 








685 



(2) INFORMATION FOR SEQ ID NO: 754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754: 

GGTGAATGTT TTACCGTCGC TAAGATTCTC TTCCAGATTA TAAACAAGCT GTGGCAACTC 60 

TGCGCCGGTC GTTTGTTCTA CATCAAAGAC AGGCGTTAAA AAATAGCGCG TCTCTTTGAC 120 

AATTTCTCCA TTTTCATCCG TGACTTTTTT CCCATTCTTA TCGACTTTAT CTTTAAAATA 180 

AGGGGCATAC ACATACAGTG CTTTGCTGCC TTTTTTTACT TGACGATCTA ACTTTTTCCA 24 0 

CGCATTGTAA CCAGCTACAC GACGTATGTT TGGATTTTGT GCCAACAACA AACGAATATT 300 

TTTTGACGAA TATTTATGAA ATTTTGAAGC AAAATCTAAA TAGTTTTTAA ACGTATCACT 360 

TGTTAAATAC TCTTTGaTCC CGTCGTTCAA AkGCTGCGAT AATTTTTGaT AAtCTTTCTT 4 20 

TTTAATTATC TCTTCCAAGG kGATTtCkGG TtGGCyTTTT tGCTTCATTt GTGTTTTTTC 480 

AGACATTyCT TwATtCCTTt CTCTGGATCG TTTTTTTGCT TCTTGACGCT TTAATTTTGC 54 0 

TTCTtTTyTC TTTGTTTCAT TCTTTGTTTT CTTTAAAATA TCGTTCCAAT CTGTkTTACT 600 
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TTTTCCTTCA ATTAGTTTAG GkTGGTCAAG AGTAATnGGG ACACCAGTGA ATTGGAAAAT 660 



GGTGGGAATA AATTTCCTTC CC 
(2) INFORMATION FOR SEQ ID NO: 755: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



682 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 755: 

TTGAGGGCTT GTCCCTCTAC TCATCGGAGA TTAGAGAGGT CAAAAATTGA CGAGGTAGAG 60 

AAAATATTTG AaAATATTTT TTAATAGGCA TGAGTAACGG CAACATCCGT CAGATTGGAT 120 

GTTGCCGTTG cTCTTaCCTA CTAAGGAAAT ATATACGAAT CTGCTTAAAT GCTCTTTGAA 180 

GCAGGCGAGT TACCGCCGAC GGGTCACATC CGAGGaTAAC TGCAATCTCC TTGTATTGaT 24 0 

ACCCTTGCCA GAAGTGaAGT TCaATCACTC GcCGCTGCCG GTCtGTCAGA CGGCTTAgAG 300 

CAGCCgATaA ATGCTCATTC TGAACAGAGT CCATAAAATC CTGCT 34 5 
(2) INFORMATION FOR SEQ ID NO: 756: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 1 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756: 

ACGTTTTATA TATCTTTTTC AT TAG TC AAA AAGCGCTTTT TGACTAACGA TGCTACTCAC 60 

TATAATACGT TTTAAATCGT TTAsCAAGTC TTTTTATTAG AGAATGTTCA TCTTAGATAA 120 

GAAACCCACC TATTTTGAAA AAAGGAAAAT CAAAAATAAG TGGGGCAGCA GTCGAAACGA 180 

CTGCTGCCCa CTtGcTaACT ATTAAGATAA TTCCCTGAAc AAAAGTAGCT CCTTTTATTC 2 40 

CTGTTrAATT TAAATCACTT CTTCTTCCTC TTCGACTTCA GAGAAATGGA ACTTAATCAT 300 

TTCAyTGACG AAAATTGGcA CAAAGGATAG GAAAATTGCT AACATCCAAT GATTTAAACT 360 

AATATGCACT AATCCAAATA GTTCTTGTGT GAAtGGTAAT AAAGcAAkGG TtGTGGTAAT 4 20 

TAGTAATGGC TAAAACGACC ATTTCAAAGA GCGACTTGTT GCTGATAATT TTACACGGAA 4 80 

AACAGAGTTC GCACTACGAA CATTAAAGAC ATGTAAGATA GAGGAATAAG CTAAGATTAA 54 0 

AAATGCCACT GTTTGTCCAA CTTCATAnCG TTTGAnAnAT AGGCTGAAAC ACCGGTACAA 600 

GGCACCTA 608 
(2) INFORMATION FOR SEQ ID NO: 757: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757: 



GATTPTCCPT 


CATAATCTAA 


CTCAGCTTGT 


CAACTTTAGC 


TCTAGnATTT 


TnCCTTTTAA 


60 


PaTPP,AAPTn 

1 ugrtAVi 111 


TCATTATTAT 

x \**r\ x x *\ x x i* x 


AGCCCCCTAT 


TGTAATTCAA 


ATGTAAAATT 


ATCTATTAAT 


120 


Lllullliiv 


PTATAAATAP 


TGCTATTGCA 


ACTAAAACTG 


ATTTTTCAAT 


ATAATTTACT 


180 


PTTTPT A ATP 


ATAAAPAATP 


AACAACTTCT 


ACATAATCAA 


TCTTTGCCAA 


ATTACAAGTA 


240 


i 1 in Iril 111 


T APTTATTAT 


PTPmATTATT 

^ X wlLul X X *i X X 


TTTrAGcTAT 


CTCTCTCTCC 


aTTATTTAAG 


300 


Uv> 1 1 1 Uu 


PTAAAPTTAA 


rPATTTATTT 

X. xxx c\ xxx 


AAAAtAGTAG 


CTGAAGATCT 


TTCCTCTAAA 


360 




TPTTTPTAPA 

lul x i i riort 


APTTTTAGCT 

riu x x x x x 


AGTCCATCCT 


CTTCCCTTAT 


AATTGGACAC 


420 


pptbpsi nrr,T 


PTATATPAAT 

\^ X r\ X X L»rui X 


ATTAAGGTPP 


CTAACCATTC 


TTTtAATTAC 


AGCTAACTGT 


4 80 


lijl o^n 1 ^ 1 


TTTPTPPAAA 


ATAAGPTPTA 


TCTGCTGGTA 

X >k*r X \J \^ X VV X *i 


TTATGTTAAA TAAyTtACTT 


540 


zipaapziPtpp 


AAAPPPPTPT 


AAAATGTATT 

n/wi x vj x ** x x 


GGCCTACTAG 


CTCCACATAA 


ACCTTCTGTT 


600 


BZiTPPAPTTZX 


TATTAAPAAA 


AGTACTTGCA 


TTGTCACAAT 


ACATCTCTTC 


AACTTCAGGA 


660 


1 1 c\r\r\ J. n 1 1 u 


PTPTTPPTPT 


APPTCTTGTG 


CATACTTCTA 


TATCTCTTTC 


TAAGTCTCTT 


720 


PPATZX APTAP 
Uun 1 ilrtVy 1 


PTAAATPTTP 


ATTTTTCCCA 

ni x x x x 


AACTGAGTTG 


GaTTTACAAA 


GATACTtACA 


780 


ZiPZiAPTTTAT 


PATTnTPTTT 

v_//^ 1 1 U X V i 1 X 


TGATGPTTTT 

X VJA X X X X X 


CTAATTAAAC 


TTTCATGACC 


TTCATGTAAA 


840 


TATPPPATAP 


TAGGAACTAA 


TCCTACCTTT 


AAGCCTTGTT 


TTTTCCACTC 


TTTTACTTGA 


900 


PTTPTAAPAT 


PATTTAPTGT 

^ri xxx *x w x vj x 


TTTAAATATA 


ACCATATTTC 


TTGACCCTTT 


TAATATAATT 


960 


TTTPTAAAAC 

x i i ^ J. rifuiriw 


ATPATCACTT 


ATTTTAAACC 


CATGTTCTGG 


ACCTGGAAAA 


ACTCCATCTT 


1020 


TTAPTnPPTP 


TATATATTTT 

X *1 X J* X 4* X X X X 


GTAAATGCTT 


TATTCATCTC 


TTCTGATAAA 


TTAGCATATT 


1080 


TPTTPAPAAA 


TTTAGGGGTA 


AAATCACTAT 


ACATCCCCAA 


CATATCTTGA 


TATACAAGAA 


1140' 


TTTGTCCATC 


ACACCCTGCA 


CCTGCTCCAA 


TACCAATkGT 


AGGTATTGAT 


ATCTCTTTAG 


1200 


AAATTAACTC 


TGCAAGCTTA 


GTTGGAACAC 


ATTCTAACAC 


TACAGCGAAT 


GCTCCTGCTT 


1260 


TTTCTACAQC 


TTTAGCAGCT 


CTTATTAATT 


CTTTAGCAGC 


TTCCTTGTCC 


TTTCCTTGTA 


i ion 


CCTTAAATCC 


TCCAAATGCA 


TTAACAGATT 


GAGGTGTTAA 


ACCTATATGC 


GCCATAACAG 


1380 


GTATAGAAGC 


CTTTACTATA 


GCCTCTATTT 


TATCGCATAC 


TTCTATCCCT 


CCTTCTAATT 


1440 


TAACAACCTG 


AGCTCGTCCT 


TCTTTTATTA 


ATCTTCCTGC 


ATTAACTACT 


GAATCATATA 


1500 


CTGAAGTTTG 


ATAAGACATA 


AATGGCATAT 


CTGCAACAAC 


TAAAGTGTTC 


TTAGCTCCTC 


1560 


TTGTAACTGC 


TCTAGTATGA 


TGGAATCATG 


TCTTCCATTG 


GTACTGAAGT 


GTATCTTCCA 


1620 


TGTCCTAGAC 


ATACCATACC 


AAGAGGnATC 


TCCTACTAAG 


TATnCCATTT 


AT 


1672 


(2) INFORMATION FOR SEQ ID NO: 758: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758: 



TTCTCACACA 


GATAATACCA 


GTTGAAATTT 


TATTCGTCAA 


CTCAATAAAT 


GTAACTTTTA 


60 


CATTTTGATG 


AAGAAAAGCC 


TATTCTATCA 


ACAAAAATTT 


TTTATTAAAG 


GAGATTTTAT- 


120 


ATGTTTTTAC 


AAAATAAGAA 


TGTCGTCGTA 


ATGGGTGTTG 


CCAACAAAAA 


AAGTATTGCT 


180 


TGGGGCTGTG 


CAAAAGCATT 


AAAAGATCAA 


GGGGCCAATG 


TTATTTATAC 


GTATCAAAAT 


240 


GAACGCATGA 


AAAAGCAAGT 


GGTTAAATTA 


GCAGATGAAA 


ATGATTTGTT 


GGTGGAATGC 


300 


GATGTTGCTT 


CTGATGCGTC 


CATTCAAGCA 


GCCTTTGAAA 


CAATTAAAAA 


TGAAGTGGGC 


360 


ACGATCGATG 


GACTAGTCCA 


TGCGATTGCT 


TTTGCCAAAA 


AAGAAGAATT 


GTCTGGAAAC 


420 


GTTAGTGrCC 


ATCACTCGTG 


ATGGTTTcCT 


ATTAGCACAA 


GATATTAGTA 


GCTACTCTCT 


480 


TTTAGCAGTC 


ACTCATTATG 


CGAAACCTTT 


GtTAAATCCT 


GGTTCAGGaA 


TTGTCACATT 


540 


GACGTATTTA 


GGtTCCGAAC 


GTGCGATTCC 


TAACTACAAT 


ATGATGGGGA 


TTGCTAAAGC 


600 


TTCTTTAGAA 


ACTGCCGTGA 


AaTATTtAGC 


yTTTGAATaG CtGCmGAaAA tcGTGTCAAC 


660 


GGAATCTCCG 


CAGGnCGGAT 


TAAAACATTG 


GCTGTGACGG 


TGTAAAAGAC 


TACGATCAAT 


720 


TGATTAGTAT 


TCnG 










734 



(2) INFORMATION FOR SEQ ID NO: 759: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759: 



ATTATTCAAC 


AGACGGAGAA 


GTAGATGAGT 


ATCTTACGCA 


ACTTATTTTG 


AAAGCTCGTC 


60 


AAGCAGGTAT 


TTTCTTTATT 


GTTGCAATGC 


AACGTCCAGA 


TGGAGAATTT 


TTAAAAACAG 


120 


CATTACGTGA 


CCAATTTATG 


TTTCGCATGT 


CTGTTGGACG 


TCTATCTGAA 


ACAGGTATTT 


180 


TGATGATTTT 


CGGTGATGAA 


AATAAAAATA 


AAAAGTTTAA 


ATATGTTGAA 


AAAATTGACG 


240 


GTCAGAAAGT 


CTATGGTCGT 


GGATATGTGG 


CTCAAGGAGG 


AGGAACTGCA 


CGTGAATTTT 


300 


ATAGTCCTCA 


AGTACCACAA 


GACTTTGACT 


TCATTGAAGA 


ATTTATCAAA 


ATTTCTAAAG 


360 


AGTTAGGTTA 


TGAAGATGTT 


CCAAAAGAAG 


TACAAGAAGA 


AGTATCTCAA 


AAAATATCTA 


420 


AACATATTGA 


TAAAGAAGCA 


CTGGCAGAAA 


TTGATGAAGA 


ATTTCAAGCA 


GAAAAAAATC 


480 


AGTTATCAGA 


GTTAAGTGAG 


AAATATTCTG 


CTTAAAAAGA 


AGTCACATAA 


ATTAGTTGGA 


540 


AAAGGAAGGT 


TTTTATGGAT 


TCTAATGAAA 


TAATTAAACG 


TGTCAGAGAA 


CGAGTATATA 


600 


GAGAAGTAAA 


GAAAAAATAT 


ACTAGAGACG 


ATTTGGATAC 


TCGTATTCAA 


GATGTATTAT 


660 


ATTATCGTTC 


AGAAACATAT 


ATGAAATTAG 


TTAGCTTTGC 


AAATGGTAAA 


AGAATTAAGA 


720 
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AGTTAGCTGA TCCTAGAAAA TTTGAAAAGT TTATGGATAC AAAAGGAGTA AAGATTGTTG 780 

CAGAAGTACT TGATGGATTG AATAATCAAC CAAAAATGCA AGCTATGGrA TACGAACAAA 840 

AGGTACTTAC TAAAGTACGT CAATGGTATC AGAAAAAGAA TCACCCTGAG CTTGTAGATT 900 

TGGAAGAAGA AGCATTTGAA CAACTTGTAG AAAAAAATAT CATCTATAAA AAAATGAAGA 960 

AAAGGTTGTA TGAAGAACAG GATAATCAAG GTTTTGTTTA TTCGGATAAT TTTGATATGC 1020 

AATTGATTCG TGATTCATGT GATATTGAAG AGGCTCTATA CTTAGATATC ACGTTGGGGG 1080 

ATTACTAAGT GAAAGGAGTA AAAAGAAAGT GAAATTTAAA TCTCGAGTAC AAATGTTAAA 1140 

CTATGTGATT GGGAAGAAAC CAGACATCAT GAAAACAGTG ATCGGAGAGC AGAAAAAGAG 1200 

GGGGGTACAG ATGATAATAG GTTACTTGAT TATAAATTAT TAGTAATTTC nGnTTTAGGT 12 60 

GC 1262 



(2) INFORMATION FOR SEQ ID NO: 7 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760: 

CACnGACTTA AAACTTATCT GGAAGTTTGT TGATAAnACT ACCACCAAAA ATGGAGGnAT 60 

TATAGTATGT TTTTATTTGG AAATAAAGAA GAAAnAAAGG TAAAGAAAGC AGAAGAAAAG 120 

CAAATAAAAG AATATTTTAA AAATAATCAT GATATTGTAA TAGGCGGTAT TTATTTTAAT 180 

GATTCAGACA AAAAAATATT TATCCCAAAA AGTATTTCTG AATCAAGAAA ACAACAAGTT 240 

ATCAACTATG ATGATTTGAT TAGTTACACA GATATTTTTG TTGGTGGCAA TATAAAAAAA 300 

CATCACGGAA TAACTCGTGC AGTTGTTGGT GGCGTTTTAG CTGGTCCTGT TGGGGCTTTA 360 

GTTGGTGCTG GAACAGGTGG AAAAGAATTT ACTTCCATTA AGCAATTAGG AGTTATGTTA 420 

CATCTTCCTA ATAATCAGAC AGTAAAATAT ATGTTAATCA CAACTGAAAC AAAAACTGAT 4 80 

TCTATGATTG GTAAAGGTTT AATGGATAAA TACAACGAAT TAATTGCAAA ACTGGaCCAA " 54 0 

ATACTAAAAA CTAATTCAAG TAATAAAGAT AACACTGTAT TATCTTCTGC AGACGAGATA 600 

AGAAAATTTA AAgCCTTGTT AGATGATGGc ATTATTACmA AACaAGAATT TGAAATAAAA 660 

AAACGTGAAw TGtTACmATA aAAATaACGC ACCCTCTCCG GCTAAGAAGT TAGtGCGTTA 720 

AAAATaGAAC CaAAATAGGC TwATTTkGTT ACGCCTATTT TACCACAAAG AAAAGG 77 6 



(2) INFORMATION FOR SEQ ID NO: 7 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761: 



ATTGCTTTTT 


CAATTTGGTC 


CTGAGGATTC 


CCAGACCCTC 


CGTGTAATAC 


TAGAGGTACA 


60 


TCAGAACCGA 


CAGCTTCAGC 


AATCGCTTGT 


AAGTGATCAA 


ATGCTAAACC 


TTTCCAGTTT 


120 


TCTGGATATT 


GACCATGGAT 


GTTACCGATA 


CCACATGCTA 


AGTAGTCAAT 


ACCTGTAGCA 


180 


ACCATTTGTT 


TACATTCTTC 


GATATCAGCT 


AATtCGCCAG 


TACCGATGAT 


TCCGTCTTCT 


240 


TCACCACCGA 


TTGAACCAAC 


TTCACATTCG 


ACAGAAATAC 


CTTTAGCGTG 


CGCTTTTTCA 


300 


ACTACATCTT 


TCGCTAATTT 


TAAGTTTTCT 


TCGAATGGTA 


AGTGAGAACC 


ATCAAACATG 


360 


ATTGAAGTAT 


AGCCAACTTC 


GATACATTCT 


AAAGCAGCTT 


CGTAGTCACC 


GTGGTCTAAG 


420 


TGAATTGCTA 


CTGGAACTGT 


GATATTCATT 


GAGTCTACTA 


AATTAGTAAT 


CATGTCTTTA 


480 


GCTACTTTGT 


AGCCGCCCAT 


GTATTTAGCA 


GCGCCCATTG 


ATGTTTGGAT 


TAGAACTGGT 


540 


GCTTTTTTAG 


CTTCTGCTGC 


TTCTAAAATC 


GCTTGTGTCC 


ATTCTAAGTT 


ATTTGTGTTA 


600 


TAGCCACCTA 


CTGCATACCC 


GCCTTTACGT 


GCTGCTTTTA 


AGAATTCTGC 


TCCTGATACT 


660 


ACTGGCATAA 


AAAATTCCTC 


CTAAGTTTTT 


AAAAACTTTA 


TTTGTTAGGT 


GTTCCCTAAC 


720 


CAACGGTTCT 


ATTTTAGCAG 


AATTCCTGAC 


ACTTTTCTAG 


TAAAATTCCC 


TTTTTCAGAA 


780 


AATTTTTGTC 


TATTCTTTCA 


CATAACCTTC 


ATTTTTATAA 


AAAAACCTCC 


AGTTTTCCCG 


840 


TTAAGACTTT 


GGGACCTGAn 


GGCCGACTTG 


CTATTTTAAT 


TnGTCTTGGT 


GGATAACGTT 


900 


CACCTTGTAG 


GATGAAGCnC 


AGGAAAT 








927 



(2) INFORMATION FOR SEQ ID NO: 762: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762: 
AATTACCAAA CCACTGAATT AGAAACAAGA CAAGCAAAAG AACGGGCCTT AGCAGCAGTT 60 

TCmGCGATTG TTGAAGGAAA GCCAATCCAC GTTATTTTTT CrGGGAAACC TGGTGTCGGT 120 

AAAAGTCaTT TGGCTATCAG TATTTTAGTT GAAGTCTTAG aACGCTCTGC ATATCaAAAG 180 

TATTGTTTAT TTGTCAGCTA CTCTGAGTTA TTAGAAAAAC TAAAAATGTC CATGAATGAA 24 0 

TCGGCCAAAA GCCAAGCAAA GGCTCAAGCG TATATTACTA GAATGAAAAA AGCAGACGTT 300 

TTGGTTTTAG ATGATTTAGG TGCTGAATTA GGAATTArAA ATAAAGTTAG TACGGATTTT 360 

AATAATGACA TCTTAAACCG AATTTTAGAA GC TAG AC AG A ATAGAGCAAC TATTTnTACT 4 20 

ACTAATTTTT • CCGGAAACAA CTGGTGGAGG CCTATGGAAC ACGCATTAnn CCTCGTCTAA 4 80 

TGAGCACGCC AATGGCTATG TnTCCATATG AAGACACACA GACAACGA 528 
(2) INFORMATION FOR SEQ ID NO: 763: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763: 



GCTGAAGCTG 


ATCCAATATA 


TATCAAAACA 


CAGCTAGGAC 


ATAAAGAGAT 


AATACAGTCA 


60 


TATGAATACG 


CATCAGCTAC 


TGAAGCAAAT 


AGAAAAAAGA 


ACAAAGAAAA 


ATATGAAAAC 


120 


AAATACAAAG 


ATATTTTGTA 


AATACCCAAA 


AATTACCCAA 


AGTATATCTA 


AATTGTTATC 


180 


TTGATGTATA 


TATTAAAACA 


TATAGAATTG 


TTTTTAATTA 


AACAGCAATA 


TTCTTGAATA 


240 


AT TAT C TAT G 


AAGCTGGCAG 


GTGGCTCTTT 


ATTAAAAnGC 


TAAAACCTTT 


ATTTATTAAA 


300 


GGTTTTAGCT 


TTTTTCATTA 


CCGACATACC 


CAAAAAGTAC 


CCAATTTTnT 


TGTATTATTT 


360 


TrTATTTCTT 


GATAGACAAC 


ACGAACGAAA 


GTTCGATATA 


ATTGTATTAG 


GGAGGACGAT 


420 


ATCATGAAAA 


CTAATTATGT 


AGGAGTAGTT 


GAAAAGATTA 


AAATGTnAAG 


TATGTATCCA 


480 


AAAATTTTAG 


GTCGATTCTC 










500 


(2) INFORMATION FOR SEQ ID NO: 7 64: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








Cxi) : 


SEQUENCE DESCRIPTION: SEQ ID NO: 7 64: 






ATTTATTTTT 


TTAGTACAAA 


AAGTCAGCAA 


TTTTTTGCGT 


ATGTTTGGGC 


CATTTTCCCA 


60 


GTCATCGCCT 


TTTTTTTGGT 


TAGTTTTTAT 


ACATTGGATT 


TTTTATTTTC 


CCCAAGCTAT 


120 


TTAATTATGG 


TGCCTGTATT 


TACTATTATG 


TTTAAAATTA 


ACTATAAAAA 


AGATTATTCG 


180 


TTTGTAGCGC 


TGCTCAAAAT 


GAGTATCAGG 


CAATTTGTGA 


TCAGTTTTTT 


AGCATTTATA 


240 


GCTAGTAGTT 


CCTTTAGCTG 


GTCTATTATT 


TTAAATAGTA 


GTGTGACATT 


TTTAATTGTT 


300 


TATTTATTTT 


ATTCATCTGC 


GACACCGATG 


CGGTATCATA 


GTTATTTGAT 


GATGTTTACG 


360 


TTTTGTCAGT 


TAATCCAGGT 


TAAAGGAAAT 


ACATTTTTGA 


TGCAATCTCA 


AGTTGTGGnC 


420 


TTTTTAATTG 


GTCTTTTTTT 


ATTGGC 








446 


(2) INFORMATION FOR SEQ ID NO: 765: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: linear 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO.: 


765: 






TATTTTTTCT 


GAACTGCTCT 


TGCTTTCTTC 


CGCTCCAGAA 


GTCTGTGTTT 


TTTTCATTTG 


60 
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TTCTGGTGTG GGATAAATCT GTTCGCCTTC TTTTGAGAGA AGGAAACGGC TACGTGCTAA 120 

TTTTGCCACA TAATCATCAT CTTTTAACAA AGCAACGTCT TTTTTTAGAT CCTTGACCTT 180 

CTTATCAACT GCTACGGATT CAGCAATTGC ATCTGCTCGT TCTTGTTTGA AAGCACCTAA 24 0 

GCGATGATAA TCTTTCATCA ATTGGATGCC AGAAAAGGCA AAAATAATGA AGGCCACGAG 300 

AAAAATCGCT GCCAAGgTCT GCGTCTAAAA ATCAATTGTT TCTGTTGkTT TTGGAATTCG 360 

ACATACTGTT CTTTCGTGTA GTCATTTTCC AATGCCGCAA CTTTCyTCGA GTTTTTTtCA 420. 

TTCTTTCCCA TCGTCTCGCT CCACTCGCTT tGGATTGTCC GTTTTATTAT ACCAACATCT 4 80 

AAGAACCTnG TCTAATGTtA ATTTCAGAGA AGTTAAAGAA AATCAAGCTT TTTTCTAAAA 540 

TAATCCTTGC CTTTGGAAAT AAACAGCGGT nGATTAnTAA ATTTTTTCCA AAAG 594 
(2) INFORMATION FOR SEQ ID NO: 7 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766: 

TGATTATTTT CCCTTGGAAT ACCTTAGCAG GCACAACGGT GAGTCCATTT GTTATGGTTT 60 

TTGAAAAAAT GAATATTCCT TTTGCTGCTG ACATCGTAAA TTTTGTGATT ATTTTAGCCT 120 

TGTTTTCTTC TATTAATTCT G GAG TAT AC G CCTCTTCTCG CCTTTTATAT TTTCGCTTAA 180 

AAGATAAAAA AGGACCCATG AGTAAACTAG GGGTTTTAAA CAAGCACCAA GTACCTCAAC 24 0 

GTTCAGTTTT CTTCTGCGCC AGTGTTTTAT ATCTGGGCGT TATTCTTTCT TATTTTGTTG 300 

GCGATGAATT ATTTGGTTAC TTAGCCGGTT CTCTCTCTTA CACTGTCTTG CTAATTTGGA 360 

TTTaATTAGC GCTGCGGCGT TTGTnCTGTC TTTAAAACGA GGGTCTCTCT TTGAAAAAAG 4 20 

TATTAATCTC CTTGCGTTAA TCATTTTAGG TCTCATCTTT ATTGGCATCT TATTtACAAA 4 80 

tTCGTaGGCG TCACTCyTTT AACAGGCCTA CTGTAC 516 
(2) INFORMATION FOR SEQ ID NO: 767: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767: 

TACGGCTTTG AGCGTGTCTT TACTAGTTTA TTGCAAATCA ATGATGATCG GGAAAAGGTA 60 

TTAGCAGAGT TTAAAAAAGT CG TAG ATT AC GCCAACCAAT TAGGTATGGA AGTGATGGTC 120 

GATATTAATC CTGCATTGTT CGAACAaTTA GGCATCT.CTT ATGATGATTT ATCATTTTTC 180 

CATGATATGG GCGCTTACGG TATTCGTTTA GATCTTGGTT TTACTGGGCA AGAAGAGGCG 24 0 
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AATATGACAA 


GAAACCCTTA 


TGGtATTAAA 


ATTGaAATTA 


ATATGAGTTC 


mGGGaCAAGT 


300 


TAT Gt AG AT A 


ATATTATGGc 


CTACTCACCa 


AACACaGaAA 


ATTTACTAGG 


CTCACATAAT 


360 


TTCTATCCAC 


ATCGCTATAC 


CGGCTTAGGC 


TATGAGCATT 


TTGTTTATTG 


TTCAGAAAAA 


420 


TTTAGAAAAT 


AtAATTTAAA 


TACGATGGCG 


TTTGTGAACT 


CACACGATGC 


TAGATTTGGG 


480 


CCATGGCCGA 


CACAAGATGG 


GTTATGTTCT 


TTGGAAGACC 


ACCGCGATTT 


AGAAATTGCA 


540 




GCATTTAGTT 

\j V-/ *i x x x n\j .i x 


TTAACAGGAT 


TAATCGATGA 


TATTTCTGTC 


GGAAATGCTT 


600 


ATGCCTCAGA 


AGCAGAATTA 


GCAGCAATGG 


CAGAAGCCTT 


TCATGCACCG 


TATCCTTCGA 


660 


TCAAAGTGGA 


TACAGAACCA 


GAAATCACAG 


AAGATGaAAG 


AATTGcATTG 


TTCGATAACT 


720 


TACACAGTTA 


TCGTGGaGAT 


CGTTCTGaTT 


ACGTTTTGCG 


TTCaACAaTG 


aCACGtGTTT 


780 


ACTATAAAGA 


TCGTCCTTTC 


CCAGCTCATA 


CAACCGAGAG 


ATATGGTTCG 


GGCGATGTCC 


840 


TTGnATTGAC 


AACGnnGGCT 


ATGG 








864 



(2) INFORMATION FOR SEQ ID NO: 768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768: 



ATTTGCTTTT 


AGCAAATAGG 


AATCTTAAAC 


TTTAATTACA 


ACTAATTTAT 


TCTAATTTAA 


60 


GGAGAGAAAA 


ATGATTTCCA 


GCAAATACAC 


TATTAGAGAA 


CAAAATGAAG 


CTCAAATACT 


120 


TAATCAAATT 


ATAAAACATA 


GAGAAATATC 


TAGAGCCGAA 


TTAAGTAAGA 


TCACCTCTTT 


180 


AAATAAAGCA 


TCTGTTTCTT 


CAATTACAAA 


CAAACTAATA 


AACGATGGAT 


TCATCATTGA 


240 


AAAAAGGATC 


GGAGAAGCTA 


CAAATCGAGG 


AAGAAAACCT 


ATCATGCTGA 


CCTTTAACGG 


300 


TAGTTCTTCA 


GCCTTAGTAA 


TTGCAATTGA 


TCTAGGCTAT 


AACTATATAG 


ATGGCGCATT 


360 


AGCTTCTTTA 


GATGGAAAAG 


AAATCCATAG 


GGTTCAACTT 


ATTGATACCT 


ATGTTAACAA 


420 


AAAAAATGTC 


CAAGATTTAA 


tTCATAAAGT 


ATTCAATCAG 


CTTACTTCAG 


ATTGTCCTCA 


480 


TACCCATTAT 


GG CAT AG TAG 


GTATGACAAT 


AGCTCTTCAA 


GGTCAAGTTC 


TAAATAATAA 


540 


AATTATCAGT 


ACTTCCTACA 


ATGATCTTGC 


TGAAATAAAC 


CTAGTTGAAA 


TGTTAAACCA 


600 


AGAATACAAc 


TTCCCAGTAT 


TCTTACAAAA 


TGAAGCTAAT 


TTATCAGCGT 


TGGGTGAATA 


660 


TACTTTTTCA 


TCAGATATAG 


AAmACTTAGC 


ATCCmTTmGT 


CTTCATAGTG 


GCATTGGTGT 


720 


CGGACTTGTG 


AAAAATGGTA 


AACTAGATGk 


TGGTAATAAm 


GGTTATGCTG 


GACAATTaGG 


780 


ACATACTATT • 


TTGTTTCCCA 


ATGGAAGAAA 


ATGTACTTGT 


GGAAATCATG 


GTTGCTTAGA 


840 


AAAATATTGC 


TCTACACAAG 


TGATTTATCA 


CGAAATTTCG 


AAAGAGAAAA 


ATCTCAATAA 


900 


AATAAATTCA 


GATGTAGTTT 


CTATTCTATA 


TAATAACAAA 


GAATCAAAAG 


TTGTTAAAAT 


960 
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GATTGAACAA TATGCCTACT ATTTATCTAT TGGTGTTAAT AATGCTATCA TGTTATATGC 1020 

ACCTGAATTA GTAATTTTCA ATAGTCCATT AACAAAAAAA ATTCCAGATA T.AATAAAQAT 1080 

AATCCAAAGA AATTTAAACA ATCAATTTAC AAAAGACGTA TTAGTTATTA ACAGCCCTAT 1140 

CGAATGGAAT CCTGTAATTA GTGGAGCTAT CTCACTTTCA ATACAGAAAT TTTTAAATAT 1200 

TGAACAA 1207 



(2) INFORMATION FOR SEQ ID NO: 769: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 7 69: 



nGATAATCAC 


TGCACCATCA 


TGTAGTGGTG 


TATTTGGAAT 


GAAAATATTA 


ATCAGTAATT 


60 


CACCAGTAAT 


ATCGGCAtCC 


AAATCAATAC 


CTGTCTCAAT 


GTATTCATCT 


AAACCGGTAT 


120 


TGCGTTCAAT 


GGTTATCAAA 


GCACCAATTT 


TCCGTTTTGA 


CATGTACTGA 


ATGGCTTTGT 


180 


CAAAGGCCTG 


AATAATACGC 


TCATCTTCTT 


GTTGCGCACC 


TTTATTCGGG 


CGGAAAAACG 


240 


AACTTCTCCC 


TAGATGTTCC 


AGCCCTCTTC 


GCACTTCTGG 


TTGAAAAATA 


ACAACCGCGG 


300 


CGATtACACC 


ATACATGATG 


ACTTGATTCA 


TtAACCAAGA 


TAATGTATGT 


AGACCAATTA 


360 


TCTCACTTAG 


AATGCGGATA 


ATGATAAAAA 


CAGCGACACC 


TTTTAATAGT 


TGGACTGCTT 


420 


TTGTTCCACG 


AACAAGCATG 


ATTAATTTGT 


AAACGAGGTA 


CCATACGACT 


AGGaTATCTA 


480 


TGATATkGAT 


AACAAAATCC 


TTAGATAAGA 


AATCCGTTGA 


GATGACTTGC 


TGCCAATAAT 


540 


TCmAATCTAA 


TAATTGATTC 


a AT kGAAACG 


ACaTCATACC 


CCTCACTTCT 


CTCtCAAGAT 


600 


TyCCTAACAA 


GTATACCACA 


TTATTTTCTT 


TCATCACAGA 


ACGCAAGCGG 


AAACATCAGG 


660 


AAAATTTAGG 


AACTTTTTCA 


AATAACTTGA 


AATTTTTCAT 


CAAAAAATCG 


CATTTCTAAC 


720 


TAAAAAACGT 


GTGCTAAGCn 


AAAAGCTnCA 


GACACGTTTT 


TTTGGACTnC 


TA 


772 



(2) INFORMATION FOR SEQ ID NO: 770: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770: 

GACnAGCTTC TAAAACTCGG TCCACGTCTA ATTCGGAGTG CATATACTCA AAAACATTGA 60 

ACGTATCACT ' ATTTGACAAC GTGTTTAAGT CTAAGACCTT. AATCTTGTAC TTGCCTTTTT 120 

TCAATAAAAG TCCAATCTCT GGCAATATCC CACCGTCTGG GTCTGTTGTG ATAAAGCTAG 180 

CATGTCGTTG CATAAGATTG GTTTTGATAA AAGCTAACGT TTTGGCCGCT CCAGAGTCAC 24 0 
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CTAGAATGGC GATATTCTTA TTTTTTTGTA GCTTGATTGG TAAACGTTTA TTAAAAAGAG 300 

AAATTTTTAC ATGCTTACTA ACAATAATAT TGTTTTCAGG AATTGGATCT TCATATTTTT 360 

TCATTTCAGC AGGTGTAGCA AAACGAGCCG AACCGTATTC TTCTCCATGA CGGTAGATTC 420 

CACGATCATT ATCTCGCACA TACATTAATA GCCCTAAAAG AAAACCAAGT ATTCCCACAC 4 80 

TAAATCCTAA TAGACTTGGT GTTATAAAAA ACCGATAAAA AGTTTGTTCG ATATACGTTA 54 0 

GAATATCTTT TGTAAAGAGA TAATTCGTTT TTGAAAAATA GTCTGTCCCT GGTAACAGCC 600 

AAAAGAAATT TCCTATTCCA AAGCCAATCA CAAATAGAAA AATACCAAGG AGAGATACGG 660 

GTTCAGTTCC TTTTTGGTAC GCTGCATCAC AGTTCCCTCC TTTTGG 706 



(2) INFORMATION FOR SEQ ID NO: 771: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: "635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771: 



CCAAAGGGAG GATTTnATGC AAATTAGGCA GGTACATGTT AGGGCATCGA AAATTAAACT 60 

GCAAAGAAnC ATTTACGATC GCTTTGGGGA CGATAGAATC AGCGGATAGT GCAATTGTGG 120 
AAATTGAAAC AGAAGAGGGA TTAGTTGGAT ATGGAGAAGG AGGGCCAGGC ATTTTTATTA ' 180 

CCGGTGAAAC GTTGGCTGGC ACGCTTGAAA CGATCGAATT ATTTGGCCAA GCAATCATTG 240 

GCTTAAACCC CTTTAATATT GAGAAAATTC ATGAAGTGaT GGATAAAATA TCTGCTTTCG 300 

CTCCAGCGGC AAAAGcAGCA ATCGATATTG CTTGTTATGA TTtGATGGGT CAGAAAGcTC 360 

mACTGCCTCT TTATCAATTG TTAGGCGGCT ATGATAACCA AGTAATAACG GATATTACAC 4 20 

TAGGTATTGA TGAACCAAAT GTGATGGCAC AAAAAGCCGT CGAAAAAGTC AAACTAGGTT 4 80 

TTGATACATT AAAAATTAAA GTCGGAACAG GAATTGAAGC TGATATCGCA AGAGTAAAAG 54 0 

CGATTCGTGA AGcAGTTGGC TTTGATATCA AGCTCCGCTT AGATGCGAAT CAGGCTTGGA 600 

CGCCCAAAGA TGCAGTTAAA GCGATTCAAG CATTA 635 



(2) INFORMATION FOR SEQ ID NO: 772: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772: 

TnATTAATAT- CTATATTTGA AATGTTATCA TCATATAATT CAATTGTCGC TAAAAGTTCA 60 

CCTACAGTTC CCTTACACCA ACTATTGTTT TTAGGTTCTT CTTCAAAGTA TGATTCTTTA 120 

ATTTTATGAA GTAAAGAATT TGCCTTATCT ATCCTATTGA ATTTTGATAG AAGATGGACA 180 
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TAGCTATATA TACCGTGTCC GAATCCTCTG AAATTAAAAT AGGGTTCTTC GGATAATTTT 240 

TCAAAAATTT CTAATGAAAA TTTTCTATAT TTTTCATCCT CTGTTATCTC GGATAACAAT 300 

AATAAAACTT TTATTATACT ATTATGCCCG TGGATCCAAT CATTTTTTAA CTCGCCATTA 360 

TTTATCGGTT TCTTTTCAAT CAACATGTCT GGCAATTTCT AnGGACTACA TTTAAAGAGT 4 20 

TTATATCATT ATTTAAGCGG nAATCTACAA GTAAAGGATA TATTAAACTT CCTTTTCCAA 4 80 

AAAAAGCTGA CAATATATCT TCAGATGGTA TTGTATAAAT TGAATTTTTT ATACATTCAA 540 

TTACATAGTC AtATTTATGA TycTttGTAA TGTATTTAAG AGCCACATAA AATATGAATA 600 

TACCAGGCAA TCCATCGwAC AT AT TAT TAT TTAGGATTCC CACATTCCAA TCTTGATCCA 660 

ACTTTATGTC AATCCAATTT ACAGTATTGG TTTTTTTATT AAAAATAGCT CTCTTGAAAA 720 

TTTTTTTCTC TATCTTCTGA CATGCTTGAA TAATTTTACC GTTTAGCTCT AAGCCAGTAT 7 80 

AAATATATTT ATTTGAATTT TGATTTTTCA AATCATTAAT ATATTTATAT GGATTATAAA 840 

TATTcAAAGC AATTTCTAGC CAAACGGTTT GAATAGAAAT ATCTTCrTCA CArrGaTCAT 900 

TTATTTTATT TAGGCATCTA TTCAGAGCGC TTnCTTGATA AAAATCTTCT ACTAAGCACC 960 

CGTCACTAGC TATTAATGAT GTTTTTGAGA TATTGTTATA AAAAATGGGT ATGTCTCCAT 1020 

CTATTAAGTC TGAAAATTCG TAATGAACTA CCTTTTTATT CTTATATGGA TAGGCCCACA 1080 

TATTATGAAG TACTTTTTCT CT 1102 
(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773: 

TATGATTGTT TAGGTTGTAG TTGTTGCAAT TCGGCTTCTG TTAAGGGCCG ATATTCCCCT 60 

AGTTCTAACG TTTCATCCAA CCATAGTTCG CCCATTCGTA ATCGTTTTAA ATACGTCACA 120 

GGTTTGCCTA CAGTGGCTAA CATCCGTTTG ACTTGATGAA ATTTCCCCTC TTGAATCACT 180 

AAACGGACTA CGGAAGTCCC TTCTTCCTCA TTGATCGTCT CAATGACTAA ACGACTTGGT 240 

TTAACCATTT CTCCATCTTT CAAGGCAAAT CCTTGCGCAA ACAGTTGACA ATCTTCCGCT 300 

GTCACAAGTC CTTCAACTTC AGCTAAATAw TCyTTATCCA CATGCTTTTT GGGJcGaTAAT 360 

AATTGATGCG CTAACTGGCC ATCATTtGTT GgTAGAAGCA AGCCyTCGGk AyCTTwATCC 4 20 

AAGCGACCAA CTGGGrACAA GTCGGCACGG kAATCCTGAT CAGTtAACAA GTCAATTACC 4 80 

GTTTCATCTC GCTGATCTTC CGTCGCAGAA ATGACACCTT GTGGTTTATG CAACATGTAA 54 0 

TAAAAGAATT TCTGATACGT AAGAATTTCT CCCATCAAGG CAATTTGATC TGTTTCyTCA 600 

TTtACTTGTG kTTTGGCTTC TTTtACGACG TCACCATTCA CCGkAACGaC TTTTTTCTTT 660 

AACAATTGTT TGATTTCTTT ACGACTACCG ATACCAACAT CGGCTAAAAA TTTATCTAAA 720 



WO 98/50555 



1980 



PCT/US98/08985 



CGCATCGTCA TTCCTCTTTC CTTTAAAAAA ATAGCTTGGG ACATAAATCG AATCGATCGA 780 

TGTCTCArGC CTTGTCTCTT AAGAGAATCT GCTGAAAGGA GTTCAATCAG CTATTATTTA 840 

ATATTGAATT TTGCACGTAA TTTTGCGGCA TTTTTCCCTA ATAAACGATC AGCTAAGCGA 900 

ATCTTCAAGA CCGAATAGCC ATAAAAAGCG CCTCCTGCAG CAGCCACAAT TACAACGATA 960 

ATGAAGGCTk GGAAAAnGTC TAGTTGGATC TAAAAATAGA TACAAGAACT GACGGAAAAT 1020 

CAAAGnGATA ATCATCATGA TGACTGTnAT ACAGAACATT AACAATGTTC GCCGTAAGGT 1080 

T 1081 
(2) INFORMATION FOR SEQ ID NO: 774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774: 

TTACTAAAAA ATTGTTTAAG ATTGTCCATA GTACTCCCTC TGTTTACGTT TCTTGGCTAA 60 

GTCTACCATA TTCTTGCAGG TACGACCTAG GTCTAAAGCA TGAAAAATTC TTGAAAAACA 120 

ATTTCAATGG TCTTTTCGCT GTAGCAAAGG AAATATGGCC TTCACTAGAG GTAAATAATT 180 

GCTTACGCTT AGAAACGAGT CTTTATTTAG CCTCTATTCT ATCATGCTTT TCTTCGTTTT 240 

TTATGTAAAT TTGTTTAAGA ATCTCTCAAG GAACTGCCTT TCTAACAAAT TTCAGCTTTT 300 

TTTCCGTTCT GACTTGACCT TTTGAGTAGC TTTGTTACAC TAAATAGCGA ATCAGTGAAA 360 

GGTGGGATTT GGTGGAATAT TCAATGAGAT TACCTGAAAA TTTTTCAACA ATGACTGTAA 4 20 

AAGAACTATT AGAAAATGAn GGCTAGTCCT CGAAAAGTGC GCCATTTTTT n 471 
(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775: 

ATGGAAAGAA AGTAATTAAA GGAGCTTCAA AATAGTTTAA TATATAACTT TTAGGAGGGA 60 

ACCCAAATGG AAATTATTTA CCCACCTTTA GTTGAAGAAG GATTAAAGTA TTATCTTGAA 120 

ACGACCCAAC AATCGCTAGA TAAAAGTACT TTTTATCGTT CGATGGTAGA GAGAGGAATT 180 

ATTACTGAAA CGGGTTTACC GACCCAACAG GCAATTGAAA ATGGATTAGT CAAAGACTAT 24 0 

TACGAGGACC AAGGGTTATC ATTTGATGAT TTTTTAAGAA TCTATCCAAT TTTTGAAGAG 300 

TATGACGAGG AGTTATTTCA ATGTATCGAT GGTaTTGGGA GATACCTGTg ATATGAAgAA 360 

AATTTAGCTc ACAATgGAAT CTGGCGAGTa cTTTgAgtGC ACAcAGtcAA GCGTtCTgAg 420 
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AcGtAgaTgA gaGGAAtTtG CGcAAAATAT TAGAGGCGAC GGGACGCGAC CCTGCAAAAA 480 
(2) INFORMATION FOR SEQ ID NO: 776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776: 

GTGAGAACCA ATCTTAATTG GTGATCCAAC AGCATATTGC AAATGCTGCT ATGGCTCGTG 60 

GAATCAACAC ATCAAACTTT ATTATTATCA ACCCAAATGA TTATGAAAAA TGGGATGAAA 120 

TGGTCGAAGC ATTTGTTGAA CGCCGTAATG GAAAAGCTAC TAAAGAAGAT GCTGAAAAAA 180 

TCTTAAAAGA TGTCAACTAT TTCGGTACTA TGTTAACTTA CATGGGTATT GCTGATGGTA 240 

TGGTTAGTGG TGCTATCCAC TCAACAGGTG ATACAGTTCG CCCTGCGTTA CAAATCATTA 300 

rAACAAAACC AGGCGTGAcC GTGACAAGTG GTGCTTtCTT AATGATTCGT GGTCGTGACC 360 

AAGAAAGATA TkTATTCTCA GACTGCGCGA TTAACGTGAA CCCAACTGCA CAAGAATTAG 4 20 

CTGAAATTGC TGnTGATAGT GCCAAAACAG CTGAATnATT TGACATTGAA CCAAAAGTTT 4 80 

CATTG 4 85 



(2) INFORMATION FOR SEQ ID NO: 777: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777: 



GTAAATTACT 


TCATnATTTG 


GnAACTACGG 


ATAATTTACC 


TAnAnATGAC 


AATTGGCCAA 


60 


TAACTATTAA 


AAAAATTACT 


TTTTTAAATC 


AAATATCAAA 


TGACAGCATA 


AGTAATTTGT 


120 


ATAAATTGTG 


GGTATTACCA 


CTATACTATA 


AGGCTATATC 


GGCTGATAAA 


CCTAATAATA 


180 


TGAAATACTT 


TAGTGTATTT 


AAAAAGTGTA 


TTCCATTTAT 


TTCAGATACT 


CTTTTTACTT 


240 


ATAAGTTATT 


ACTAAGATAT 


CAAGAAACAC 


AATACGAAGT 


TTTAAGAGAT 


TTAGAAAGTC 


300 


TATACAGTTC 


AATTTTTCGT 


AATTTGTTGA 


AATTTAAAGC 


TAATGAAAGT 


CCTGCTATTA 


360 


TAGTAGAAGA 


ATTGAAACGT 


GAAAGATCGG 


GCATTCTTAA 


TAATTCCCAG 


GAAATAATAC 


420 


AAAAAGCTAT 


ACAAGAATTA 


AGCTATTCAA 


AAAAAACAAA 


TTCCCCAATG 


GATACGGATC 


480 


TAGTCAGGTA 


TCTAATTGAA 


AGACTAAATG 


AGAATAATTT 


TAATACTCAA 


AGTGTTGTAG 


540 


GTGGAATTGA 


cACATAATTG 


AgATGAAAAg 


GamCGAAtGC 


AGTAGTAACA 


TAGGAATTTA 


600 


GTaTTTTTAG 


AAGGAAtTAT 


AATGATATGG 


CTAATAAGAT 


TAAACAAGAT 


TTACTTAAAA 


660 


AACATAATGA 


TGATGAAAAC 


AAGGTTGAAA 


GACTACTTTT 


AGAAAAGAAG 


TTTACTGAAA 


720 
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TAT AC ACT CG 


TAG TG AG TAT 


CCTCAAGTGA 


AGGAGCTATG 


CCGTACTTAC 


CCAGATGGGG 


780 


ACTTTACCGA 


AGTAGAAGTC 


AAAGAAAGAG 


CTAAATATAT 


GGCAAAGTAT 


TTTATCGATG 


840 


AGTATTTAGA 


AATCAATTAA 


CAGTTAATGT 


AAGTAAAATA 


AAACAATAGT 


TATTGAATCG 


900 


ATTACATTTT 


TTTGAGAATA 


CCAAATGGAT 


CTATTGATAT 


GGGGAGTCAA 


CCTTAAATTA 


960 


TTACCAATGT 


AAGAAAACGA 


TTCATACAAC 


TCGAAACAGA 


GTAAGTCAAA 


AAGGATATTA 


1020 


CAAAGAGTTA 


CACTATTTAT 


GAAAGAGAAA 


AAATTTAAAA 


CGTGCCAACA 


ATGCTTTTAT 


1080 


AAATTAAGCA 


ACGTTGGTTT 


TTCTGTTACT 


TAAAATCTAA 


TATCTTTAGT 


ATAATACCTT 


1140 


AATGTATTCT 


TAGAGTACGC 


CATAAGTGGT 


TAAAATACGC 


ATTCAATCTA 


GTATTCATGG 


1200 


TGAACTTTCT 


AAAAATAATA 


GTATTAATTA 


CAATTATTTA 


CTTGAGCGTT 


AAAAACAAAC 


1260 


TTAAAGAAAA 


ATGACTGTAG 


ATTTCTGAGC 


TACTTTAAAT 


TCAAAATAAT 


TCGGraAACT 


1320 


GAAATTTTGT 


TGkTGCaAAA 


CAGGACCGTT 


TGGTCCGGAT 


AATGGTACTC 


GGAAAGGCCG 


1380 


GTTTTTGC 












1388 



(2) INFORMATION FOR SEQ ID NO: 778: 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778: 



ACTACnGAAG 


AGTACnATnG 


AGCACCTTGA 


TCCCGCACAA 


AATCGGTGCC 


AGATCCTTAT 


60 


AT GAT T GAT A 


CCACACCCAT 


GGCTGCCACC 


ATTAAAGCGG 


GCGAGACCAT 


TTCCATGACT 


120 


TCGAAAAATA 


TGCGACAAAA 


AGGTCAAATT 


CTTTTAGAGA 


AGACTGGGGT 


AGAAACAGGT 


180 


ACTGATCTTT 


GGAATGACAA 


TTATTCTCTA 


GCTGGAAATA 


CATTTGCCAT 


TCGTAAAGAC 


240 


AGCCCAGCTG 


GTGAAATTGT 


CCAAGAAATA 


ACAACGGATG 


AAAAAGGTCG 


TGCGGAAACA 


300 


CCAAAAGAGC 


TTGCTAATGC 


TTTGGAACTG 


GGAACCTATT 


ACGTGACAGA 


AACTAAATCT 


360 


AGTAATGGTT 


TCGTGAATAC 


CTTCAAACCA 


ACAAAAGTCG 


AGTTAAAATA 


TGCCAATCAA 


420 


ACCGTGGcTC 


TTGTTACCAG 


TAACGTAAAA 


GGGCAAAACC 


AAGAAATTAC 


TGGGGAAACC 


480 


ACTTTGACAA 


AAGAAGACAA 


AGATACCGGT 


AATGAGAGTC 


AAGGGGAAGC 


TGAGTTTAAA 


540 


GGAGCTGAAT 


ATACTCTCTT 


ACTGCAAAGA 


TGGTCAGCTG 


TAAATnGAGT 


GAGCTTTAAA 


600 


ACAGATTAGT 


GAGGGACGAA 


GCTTCTGTGA 


ACAGTGCTTG 


GCTTAGTGAA 


GACCAGTGCG 


660 


TAACnCTGCA 


TTACGGATTC 


TGCAGACCAG 


CCTGAGTTCT 


TGTGACGGAT 


CTGAnnTAAA 


720 


GTGTATACGA' 


AAATG 










735 



(2) INFORMATION FOR SEQ ID NO: 779: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 9: 

GCTGTACGGC AGCCAGACGT TGCTGTCCGG CACGGTGATA CTGTTCAAGC AGCTCGATAC 60 

CGATATACCG GCGTCCGGAC TGAAGGGCGG CCACACAGGT TGAGCCGCTG CCTGCAAACG 120 

GGTCCAGCAC AATCGCGTTC GGGTGTGTGA AGCTCTCAAT CAGTGGTTGC AGGCTGGTAA 180 

CAGGCTTTTC CGTCGGGTGA TGACGATTGC CTGAATATTT CCAGCCCAGC ACGTCGGGCA 240 

ATGGCTTCTG AGGCA 255 
(2) INFORMATION FOR SEQ ID NO: 780: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780: 

AGATGCTGAT TTCTTTGCAT TTGAAGCGTG TGAGTATCGC CGTCATTTCT TGGCTTATTC 60 

ACCAGATTAT GCGATTATGA CGAATATCGA TTTTGATCAT CCAGATTACT ACAAGAGCAT 120 

TGAGGACGTT TTTTCAGCGT TCCAAACAAT GGCTCATCAA GTCAAAAAAG GAATTTTTGC 180 

TTATGGTGAT GATAAGTATC TTCGCCAGTT AGAATCAGAA GTGCCAGTTT ATTATTATGG 240 

CGTCAcGAAG AGGgATGATA TCCAAGCCCG AAATATTCAA CGAACAACGG AAGGCTCATC 300 

TTTTGATGTT TATCACAAGG ATGATTTTGT AGGTCATTTT GTCTTACCAG CATTtGGCCA 360 

TCACAaTATC mTGAATGCGC TAGgTGTGAT tGCtGTGGcT tATTTTGrAA AACTtGrTAt 420 

GCmAAAAGTC GCaGAaGAAa TGCTAAGTTT TAAAGGTGTA AAACGTCGTT TTAGCGAGAA 480 

AAAAGTCAGT GACATGATTA TTGTTGATGA TTATGCGCAC CATCCAGCTG AAATTAAAGC 54 0 

AACGATTGAT GGCGCTCGCC AAAAATATCC TGACAAAGAA ATTATTGCTG TCTTCCAGCC 600 

ACATACATTT ACACGAACAA TTGCCTTAAT GGATGAATTT GCTGAAGCAC TGGATTTGGC 660 

AGATGAAGTA TTCTTAT 677 
(2) INFORMATION FOR SEQ ID NO: 781: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781: 
TTTAGGGATG AGACAATTAC CGATGGGCGT TTCTTATGCG GTTTGGACAG GAATTGGTAC 
AGCAGGAGGC GCTATTTTAG GAATGCTCTT AGGTGAATCA AAAGACCTAC GACGGATATT 



60 
120 
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tTTTATTTTT TTAATCATTG TTTCTGTGAT TGGTTTAAAA TTAATTGGTT AGAAGGCGAG 180 

AGAACATTGT TTGGATTATT GAAAGACAGG TGAGTAAAAT TTAAAAAGAG ACAGAACCAA 24 0 

GTGCGGTGTT CTGTCTCAAA ATAGCTTTTA TTTTACAAGG TTACTATAAC GtGTCGCAAA 300 

TTCATTGTTG TAGGACTTGC CGTTGACATC TkGACCAACG TCCCAATTTG CAGACCAAGT 360 



CATTAAGCCA CGG . 

(2) INFORMATION FOR SEQ ID NO: 782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO: 783: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY: linear 



373 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
706 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782: 
ATGCATGAAA TTAAGTAAAC GACAAGAGCA AATTATCGCT GTTGTAAAAG AGCATCAGCC 
AGTTAGTGGT GAACGCATTT CGGACATACT AGCCGTTTCA CGTGCGAmAT TGCGTTCTGA 
TTTGTCTTTT TTAACACTTT CAGGAATTTT AAAAGCAAGT CCGAAAGTTG GTTATACTTA 
CGAAAGTGAT AATATGGAGG CTTTTTTCTT TTTTGATGTT TTTCAAACAA AAGTTCAGGA 
GATCATGAGT CCACCaTTGA TGGTTGCCCA AGACACCTCA ATTCGAGATG CGATTACCAA 
CTTGTTTATG TATGATGTCG GTTCTCTTTA TGTCAtGGAT GAAGCmAAAG AaTTATTAGG 
GGTTTTGTCA CGAAAAGATT TATTACGGGC CTCTTTAAAT ACAAATATTG ATGGCACACC 
TGTTGCTGTT TGTATGACAA GAGTGCCACA TGTGAAAACC TGcACACCAG AATTTACTAt 
TCTTGAAGCT GCTGACACTT TGCAAAAATA CGAAGTGGAT TCTTTGCCAG TCGTTGAAAA 
AGAAAATCCT AAAAAAGTTA TTGGAAAAAT TACGAAAACG AAAATTTTAA CCTATATTAC 
GCAACAAGCG AAAGAAGCCG CACAAAACAG ATAAGGGAGC TTAATTATGA AAAAAGAGAT 
CATTGTTTAT ACTATTTCCG ATTCACTTGG AGAAACATCA CAAAAA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783: 

TGAACAAAAT CGACTTCTTC ATAAGTCACA TCCTTTATCT ATTAATACAG TTAAAAAATC 60 

TGTATTACTT ATAGCATAAA TGAAAGCGAT TAAAAAGTAA AGTTCTCTTG AAAATTCCCA 120 

TGTTCAACTT TTAGGCTAGG TTCTTTTTAT ATGCTAAAAT AGAATAGAAA CGGAGGTTTT 180 

TaATGGAATG GaTTGAAATT AaGCACGCAA CaCaAAATAA TTTGrAGrAT ATCTCTGTCA 24 0 

ATATCCCTAA AAAGCAACTA ACTGTTGTTA CTGGACTTTC GGGTTCAGGA AAGTCCTCCT 300 
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TAGTATTTGA CACATTAGCC GCTGAATCAC GTCGGGAACT AAATGGATAC CATTTAGTTC 360 

GnTTTGGTTC CAAAATTACT TTACCCCAAA TATGGGTCGT CCCAGAAGTT GGAAAAAAAT 4 20 

CGAGAATCCT TCCTGGTTGG CCAATTGGTC CATTGGACCC nGAAAAAAAG TAnGCAGGGn 480 

AATTCnCCGT TCCGACAGT 4 99 



(2) INFORMATION FOR SEQ ID NO: 784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784: 



TATTCATTTC 


TCCCCTACTC 


TCTCTGGTTT 


TTATGGACGT 


GTATAgaTTA CTgTTTTTTT 


60 


ATAGGaAAAG 


AAATGGGCAT 


TCTTGTATAT 


TTTGATGTAC 


AATATAGTAT 


AGAAAGCGGA 


120 


TACAAGTAGG 


AGTGATTTTT 


TATGAACTAC 


ACAAATAGTC 


GAACGTTTAA 


TCCTGAAATT 


180 


TTATATGCCT 


TTGATCCTTG 


GAACGAAGAG 


ACCCACCCTT 


ATAATTGTCA 


TCATCATGAA 


240 


TTTCTGGAAA 


TATCTATTCT 


TTTAGAAGGT 


GAATCTGAAT 


ATATTGTTCA 


AGGGCAACAG 


300 


TACCATGCCA 


CTGCTGGTAC 


AGTGTTTCTT 


TTTAATCCGC 


GAACGGAACA 


TGGTGAGCAA 


360 


CAAAAAGCAG 


GAACTTACTC 


ACATCAACTA 


CACATTGGCA 


TTTCAAATCT 


CTATTTAGAA ' 


420 


GGCTTAGCGC 


GAAACGTATT 


TCCAAATAAA 


TCAGCTTTAT 


TAGATTTAAG 


TCATTTACAT 


480 


GGTGCTTTTT 


TAGAAAAAGC 


TTGGCAAATT 


GTTCATGAAT 


TAAACCATCA 


AGAAGTTGAA 


540 


TCAGCGTTAC 


AAATTAAAGC 


TTTAGTTATT 


GAACTATTGG 


TCTATATTTT 


AAGAAGCTTA 


600 


GCAGTTGATC 


AAGAAAATAA 


AATTGAAACT 


CGGCTATCGA 


AAACGGAAAA 


AAGAAAACGA 


660 


AATTTAGTCA 


ATCnCAACCA 


AAAATnGTTG 


CGATGGTCCA 


TGGCGAAACG 


GCGAT 


715 



(2) INFORMATION FOR SEQ ID NO: 7 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE:* nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785: 

ATTAAAATTG GCGCATTTTT CAAAAAAGCC CGTGCTATGG ATAAGCGTTG TCGCTCTCCT 60 

CCAGAAAGTG AACTTCCGTT TTCTCCGACC AATGTATGAT AACCGTCAGG TAGTTTTTCA 120 

ATAAAGTCTT CGCAGTTAGC TAAACGAGCT GCkTTTTTCA CTTCCTCATC TGACGCTTGA 180 

CTATTTCCAA TACGTATATt TTCCAAAATA CTTGTATTAA ACAaAGTTAC CTCTTGCAAA 24 0 

ACAATAGCGA TTTTGCTAAA CAnAGAAGCT GGAGATACAC GTTTAATATC ATACCCATCA 300 

ATTAAAATAC AGCCTTCATC ATAATCATAC AGTCTAGACA CTAATTTCAA GATACTAGTT 360 



WO 98/50555 PCT/US98/08985 

1986 

TTCCCGGAAC CACTAGCACC TACTAACGCA tCACTTCTCC TTGTTTAGCC GTAAAAGAGA 4 20 

TATGATCTAA AATTGGGGTA TTGTTATCAT ATGAGAAAGA GACATCTCGC AACTCAACGT 4 80 

CAAAAGATTT CAGAGGACTA TCTGATCCCT CTTGGAnACT GGTTTCCTTC ATAGCTCTAA 540 

TTCGCTGTAT TTTAGGnGCT AAATAAAATA TTTCAAGCAC TGCTTCTTTC ATAGAATCGA 600 

AGGAATCCTT TAnTTTTATT GCGGCTAGTA AGTAACCAAC GACGTAGAnG TATAGTCACT 660 

TCACCCGTTA AAAGTAAGTG AACACCTACT AGAA 694 
(2) INFORMATION FOR SEQ ID NO: 786: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786: 



AAGGAACTGT 


nAACAGAGCA 


ATTATTTTGG 


CTAGCTGAAT 


CTTTTTGGCC 


ACAAGTGTTT 


60 


CCAGAAGAGG 


AAGACTATTT 


TTTTGGCGTG 


AGCGGTGGCC 


TCACATTTTA 


TCAAAAACAA 


120 


AGATTGGCCT 


CGGCACAACT 


ATCTATAATT 


ACGCTTGAAA 


AAGAAAAGAC 


AGAGGTTCCA 


180 


ACAATGCCTC 


GTTATTTTGA 


TTACTTGGGC 


AAAGAGCAAT 


CCTTGCCAGA 


AGCCTTCAGC 


240 


GCTTTATATG 


ACTTGCTAGG 


GGACGTAAAT 


CCTGAATACT 


ATCTTTCGCA 


AGCGAGAAGA 


300 


GTCATTAAAC 


AAGCTCTGCG 


AGGCCGCAAA 


GTCAGTACTG 


TGCCTAATAT 


TTTTCAAGAA 


360 


AGTTTGCACT 


TGACCCAAGT 


GATTACGATT 


GACCAGGACC 


ACTTAAAACT 


CCTTTTACCA 


420 


GTAGCAACGG 


AACAAGCCGA 


GCCTTTAGAA 


GCACAGTCGG 


ATATTCTAGC 


ATTTTATTAT 


480 


GAAAAAATTG 


CTAACCGATC 


GGCCATCGAA 


CGCTTAGTTT 


TTATGCAACA 


ATTGATTGAG 


540 


CAACTAGGTA 


CGAACTCTCT 


AAGTTATCTT 


AGAATAAATT 


AAAGTTACGA 


TAATTTTATT 


600 


ATGTAAACTA 


AAAATCAATA 


ACAAGAAATG 


GCGCCGCTTC 


TTAGTGAGAA 


ATGGCGTCAT 


660 


TTCTTTTGCG 


AACTTTTTTC 


GTCCAAAAAC 


CACCGGAAAT 


GAACTTAGGC 


TCATAGGAAA 


720 


TGATAAATGC 


TCTCGGCTCT 


AATTGATTGA 


TTAATTTGTA 


CAATGTACGT 


TCGTTTTTTC 


780 


TTGGGGAAAG 


GATTTCTAAA 


ACCATTCGTT 


CTCCTTCGCG 


TCCGTAGGCG 


ACACTTTGCG 


840 


TAACACCATA 


GCCGTGTTaC 


ACGAAGtGTT 


tCTGGTAAAT 


GAAATTGTTC 


AGTAGTAGAA 


900 


GGGAGGATAA 


CCGATACCAT 


AATATAACCC 


AAGGCCAAAT 


AGTCTTCGAT 


TTTAATTCCT 


960 


ACACTAATGC 


CAACAGCGTA 


ACCAAGCGCA 


TAAACAAGTA 


AATTTAAAGG 


ATTGTCTAAC 


1020 


C GAT TC AGAA 


CCATACTCAG 


CCCTAAGACA 


TAAATTGTTA 


TTTCAGCCAT 


ACTAACCAGC 


1080 


GGTGCAATAA 


CCCGATAACC 


TTTCATCGTT 


AACATAAAAC 


GAATAGTATT 


TAAGGTAATA 


1140 


TACGCAAAAT 


TAATGATAAA 


AATCATAGCT 


AACATTTTCA 


AATCAACGAC 


CATAAAATAA 


1200 


GTACCTCTTT 


TCAAAATCTT 


TACATCCAGC 


CCTCATTATA 


GTACAGaAAA 


ACTGAGAA'TC 


1260 
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AAAGGCGAAA CACTTATAGA AATGAATTCT TTTTTGAATA CTCGCTCATA TTATTCGCCT 1320 

AATTTAACTG TTATTTGATA TAATCTTAGT ATGAAGTAAG AAGGAGGTAA GCGATGAAAA 1380 

TTTTTGTTTG GATTTTGTTA TTTTTATTAG TAATTAACGT TATTGCTGCC TTAATTACTG 14 40 

TTTTTAGAAA ACCTCGTAGT ATTTCCAGCG TTTTAGCATG GATGATGACG TTGATTTTCT 1500 

TGCCTGGTAT CGGCTTTATT ATTTACTTAT TCTGTGGGCG AGGTATCGAT GGACAAGAAG 1560 

TTTTTAAATT GTCTGATGAT GAAAAACAAA TGGTTCAACG AATTAAAGAA AAAGTGGACG 1620 

TTGATAACAA AAAAGCAGGG CGCGATAAAC GTTATGATTT ACTTTATGAT GCGAAAGTTT 1680 

TAAATCGTTA TTTCCGGAAT ATGGACGCAT CACCGTTGGC CAAAAGAAAT AGTTTGCAAT 17 4 0 

TATTCACAGA TGGCCAAGAA AAATTCCAAG CATTATTTGA AGATATTCGT GCAGCGAAAG 1800 
(2) INFORMATION FOR SEQ ID NO: 787: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 976 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787: 

ATTTGAATGA TTTGTAAAAT TTTTGGTACA AATAGAATTA GTCCTACTAG CACTGCGAAA 60 

CCGGCTGTTA ATAAAACAGC TCCTGCGGCC ATGTCTTTAA CTTTTTTTCC GATTGGATGA i 120 

AAGTGAAAAT CCGTAAACAT ATCTACGACA TTTTCAAAAA TTGTGTTGAT AATTTCTGCC 180 

AACCAGACAA GAAAAACGGC CAATAACAAC CACAGCCATT CAGCAAGTGT TAAGGAAAAC 24 0 

AAGAAACCAA ATACAATGGC CGTGACACCA AACGCTACAT GTTTGCGCAT ATTACGTTCT 300 

TCTCGAAAAA CTGTTTTGAT TCCTTGCCAG GCAAATTCTA ATGAATTAAT AAAATGTTTA 360 

TTTTTTCCTG TTTGCTCATC TTTTAAGGCC ATACGCATCT AAAaTCTCTT cTGTAAcCAA 4 20 

cATTTyTTTy CacTTCAGGT TCCATATGAT CGTACCCaTT CAAATGTAAA AgCCATGTAC 480 

TGCCAAAAAG CCCATTTCTC GTTCCAATGT GTGCCCATAT TCAACTGCTT GTTCTGCTGC 54 0 

TCGCTCTGTT GAAATAATTA AATCACCTAG ATtACGTGGT AGCTCTGCCA ACTCTTCATC 600 

ATCAAAAATA ATTGGCAATT CGTCTTCCCC TTCATCCTCT AAAGCAAAAC TGATGACATC 660 

TGTAGGCATA TCTTTGCCCC GATAGTCACG ATTAATCATT TGGATACCTG CGTTATCTGT 720 

AAAAGTCACC GACATCTCCG TATCTTCGGc AATTTCTAGA AAACCCGCTG CAAATTGCAG 780 

CAAGTTTTCG ATTTCTTTGA TTTCTTCTGT TGGTACTTTC TCGGTTTCAT CAATGAAAGT 84 0 

AATATCCATT AGTCATCTCT CTTTTCTTGT TCATCTTTCA TTTCTTCTGC TTCAGCTTTC 900 

ATTTTTGGAT ACTTAATCCG TGAATGGAAG GTACTGCATA ATCCATTAAT TAATGATTTT 960 

TCAATGATGC GAATCT 97 6 
(2) INFORMATION FOR SEQ ID NO: 788: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788: 

TTAGGAATGA CCAATTTTGA AAGGGGTTCG ATTGATGAAT TCGATGATAT TTCTAGCATT 60 

GATCAATACT ATCGAGCAAT AAAAGAAGGA TTTACGCCCA AAGAAGCCTT GTCACTTGTC 120 

AATTTGCGTA GTCGAGACAA TGCCCGCACG CCTTTTCCAT GGAATGATTC TATGTATGGC 180 

GGTTTTTCAA GCGTCAAGCC TTGGTTAGGA ATGGTGGATA ATTATAAAGA AATCAATGCC 240 

GAAGCAGAAA TAAAAAATAG TCAAAGTATT TTTCATTTCT ATAAGCGGAT GATTGCATTT 300 

CGTCaAAAGA GTCCCTATAC AGACATCTTA CTTTaTGGrA CATTTGAGGG tCTGTCaAAT 360 

CTTCCmGaTA ATGtTATTGC tTATAAACGC mAATTaAATG aAAAAACGAT TTACGCCtTT 420 

TTTAATTTkG GkGAGGcGGk tCCaATCCCy CmTTAtTTAG kTAATGGCAC AGTTATTTTT 4 80 

GAT AC AC AG T CTGAAGAACG AGAGCAATTA GTTCAGCAGG GAACATTATT AAAAAGCTAT 54 0 

CAAGGACTAT T GAT T AC TAT AA 5 62 
(2) INFORMATION FOR SEQ ID, NO: 789: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789: 

CAGTATTAAA AGTACTTnTC AATTACACAC TTAGTGTnCC AGTTTTTTTG GGTCTTTTCA 60 

ATAAAAATGA TTCTTTTTTC AtCGTTTTCT TtCAAaTCyA AACAAAGCAT TCGTTTGTTA 120 

GAAACAATAG CTACTTATTT CCGAATAATT AGaTTTAATT AAACCAAAAG CAAAGGAATT 180 

CACGAACGTC TTTcAATTAA CTTCTTGTTT AAATGGGAAT TTCAGGTCCT CTCACAAAGC 24 0 

ATTTTTTACT TATTTTTCAA AAAAAACAAC GAAAGAATAG ACACAGCCAT TCTCTCGTTG 300 

CTTCATATCC TTTTATTTGT AATCTTCTTC GTAAAAACGT CCTAAAATTT TAACATTATC 360 

TGAGATGCTG ACAAAGGCAT TCGGATCACT TTCTTTCATG GCTGCTTCTA AACTTGGCAA 420 

CTCAAAGCGT GTCACAATAG TTAGCAGAAC AGTTTGGCGG TCGTGGCGAT ACGnCCCTTC 480 

CGCTTCATGA ATAATTGTAA TACCGCGGCG CATTTTCTTT TGAAtCTCAT CCACGACTCG 540 

ATCTGGGkTT TTAGTGaCAA TCATAACTTG CATTTTCTTT TGCTTAGTAT AAACGGCATC 600 

TGTGACCTTC CCACTTACAA AAATTGATAA CGCACTGTAG AACATGTACT GCCAGCCAAA 660 

TAAATAACCT GCAACGAAGA CAATTAGTGC GTTAAAATAA ATTGAAATCG AGCCAACGGA 720 

ACGCCCCGTT TTTTTCCGAA TGGTAATGCT TAAAATATCT AAGCCCCCAG AAGAAACCCC 780 

GTTTTTTAAG ACAAAACCAA TTCCTGATCC AGAAATCGCC CCwCCwAAAA TAGCACAGAT 84 0 
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AATTGGATCT TGGGTCATCa CCGTTTGCGG AACAATATGC ATAAATAATG AAGTTAAAGT 900 
CACGGTAATG AAAGTAAAAA TTGTAAACTT CTTACCTATT TTTTTCCACG CAATAAAGAA 960 



TAACGGCACG TTTAAAAGAT A 

(2) INFORMATION FOR SEQ ID NO: 7 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



981 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790: 

TTTTTCGAGA CGATACAGTT TATAGTTTCG TCCTGTGTTA CTAATGAGAA TCGAACTAGC 60 

ATTTTTGGGT ACATACTTAA CATTCTAATC TTATCAACTA CTCCTACATA. ATTAGTTTTC 120 

ATACTATCGC CCTCCTAGTA ATATAATTAT ACCGAATGTA TGTTCGTGTT GTCTATAGAA 180 

AAATTTTGTA AATCTATGAA ATGATAAAAA CAAATGGTGA TACATTTGGT GATACATCTG 240 

AAATATCTAT GAAAAAAATG TAAATAGTGA AAGACTTCAA TAACTAATAA ATCATTGATA 300 

TATAAGTAAT CGGAAAGAAT GAAATCTATG GAAATGTTTa TCTAACAAAC GCAACATGTt 360 

CGCtTATTAG GTAAACCAGG TTTgAAGAAT TAGCmAAAGA TTTAAATGCA CGTCTATAAT 4 20 

i 

TgTAAtAAnA GA 4 32 
(2) INFORMATION FOR SEQ ID NO: 791: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791: 

CTGAAAAAAG CAACACTAGA AGTTTTGAAT TTACCTGCGT TGACAGGTAT AACTGATGAT 60 

AATTTACGTC GTCTACTCAA CAATCTAATT ATTGAACTGT ACAAGTACCA GGCGGAAGAA 120 

GAACGTCGAT ACATTCGTGA AACGCAGCAA CAGGGAATTA CACTGGCTAA AGCAAAAGGA 180 

AAATATAAAG GTGGAAAGCC AAAGTACcGA GAAAATGATC CACGTCTTCA GTTAGCATTT 240 

AAATTGTTTT TAGACGGATG CACAGATAAA GrAGTAGAAC AACAGACAGG GATTAATAGA 300 

AGrACTTTTA GrAGATATAG AATGAAATAT GGAATAACAA ATTCTCTTAG AC ATAAGAAT 360 

GATTAAAACT ACTAATCCAA TGTAAAACAG ATAATATATT AaAGAGGGCT AT AG AG T AAA 420 

ATGCAAAAGA AGTTTGAAGC AAAGCAATGC TCAGTAGACG nGTTACTAAA GGCGAnATAA 4 80 

GCGATTATAT AAGGTTTCGA ATTTTCAAAG AACATATGCA TGGAAATATA AAC 533 
(2) INFORMATION FOR SEQ ID NO: 7 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 287 base pairs 

(B) TYPE: nucleic acid 
<C) . STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792: 



CGAGTTTTCG CTGGCGCGGG CCACCCTGCT GCTGGATCGC GACGCCAGAT CGGAGGTCGA 60 

ACAGGGCGTG CGCGATGCCT GGGACCTGGT CGGCGGtTCG CGtGCGCAAC TGGCAGAACT 120 

TCCTGGCCTG GCAGTTCCAC TACGAGCACT ACATCCTGGC GCGCCTGTTC GTGAACCGCC 180 

GCACCGCGCG CGTGCTGGAC TTCGGCCATC GCGAATTCGG CGAGAACCGC ATCCTGCCCG 240 

ACGAGGAGGC GCATCGCATC CGCATCACGC AGTGnTGGCT GCGCAAC 287 



(2) INFORMATION FOR SEQ ID NO: 793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793: 



AATAAAATGG 


CCATAATAAC 


AAACGCAATG 


ACTAAATATA 


AATTACTATC 


TTTCAAATAA 


60 


TTTTTCATAT 


AATCCTCTTT 


TCTTGGTTTC 


AAGTTGCCCC 


GTGCCTGCCA 


CGGAAAATTC 


120 


TTTTTTATTA 


TAGTCGGTTT 


CGAGGGAAAA 


AGAAACAACT 


TCTCTTAGGT 


CTTATGATAT 


180 


ACTGAAAAGA 


AAAAAATGAA 


AGAAGGATTG 


GATATGACCT 


ATTCTATTAC 


ATGGGATTTA 


240 


GACAGTATTT 


TCCCCAhTGG 


GAGCGAGTCC 


AAAGAACTTA 


GCCAACGAAT 


GACTCAATTA 


300 


ACCGAACAGA 


CAAAAGAGTA 


CCATCAATTA 


ATCACAGAAT 


GGACACCAGA 


AAGTGCCCCA 


360 


nGCAAnGAAC 


AATTTGCTG 










379 



(2) INFORMATION FOR SEQ ID NO: 794: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794: 



CCAAAATTAT 


CCGACTACGC 


CATAGATCAT 


TTAAGAAGGA 


TAATTTTTGG 


AAGGGAAAGA 


60 


GAATCATATA 


TGAAGTGAAG 


GAGGTCAGGA 


CAGAACCGTT 


TATTTTCGAA 


GGAATTGGTT 


120 


CTGTTCTCGC 


CATTTATCAG 


GTTTTGAaCC 


TGGAACAAAA 


ATCCAAAGTG 


ATTTTTGTCC 


180 


CAGGCTCTTT 1 


CGCTCTTTTA 


TTGAGTAAGA 


AGTCAGTTTA 


GAATTTATAT 


TAAGTTTCGC 


240 


TGTACATTTT 


AAAATTATCA 


TCAACAGAAA 


ATACAACAGG 


CGAATTTTTA 


GATAAAAAAA 


300 


TTAGTAAAAC 


AAGTATTAAA 


TAAATAGTAT 


TTTTGAGGAT 


AAAGTAAGAG 


AAATGTTGTT 


360 
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TTTTTAATAG TTGTTAACGG AAAACAAAAT TATTTTATAG CCATAAAATA TATTTTTATT 420 

ATAATTATAT TATAAmaymm tGaGCATATG ATTTGCAGAT AAGTTTTTCA GGGCCTATTC 4 80 

TGAATAGGAA CTTAGCTATA AAAATAAAAG AGGAGAGTGG TTTTGATGAG AAAAAATCAd 54 0 

AAGATTGTGC CAATTGTGAC AGCTTCAGAC GAAAATTATG CTCCTTATTT GAATGTAATG 600 

ATGACAACTG TTTTAGAAAA TTGTCATGCG GAGAGACCTG TTCATTTTTA TGTGATTGAT 660 

GATGGGCTGT CGCTATCTAG CAAAAAAGCA CTCCGAGaAA CGGTTAGTAG TAACTCACAA 720 

AATGCAACTG TTGAATTTCT AACAGCAGAC AAAGAAGTCT ATCAAAATTT TTTAGTGAGT 780 

GATCATATTA CAACGACCGC ATACTTACGC ATTTCTTTAC CGTCACTTTT ACAAAAATAT 840 

AGCTATAAAA AGGTTCTTTA TTTAGATGCA GATACGCTGG TTTTAGATGA TATTGTCCAG 900 

TTGTAT 906 



(2) INFORMATION FOR SEQ ID NO: 7 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795: 

GGGCAAGAAG cATCCAAGTC TTGATTCGCC GAGGAGCAAG TCTTGAAGAT GCACAAGATG i 60 

CAGTGTCGCA AACGTATACA ACGATTTTTT CAATATTACC TGAAATTACA TCAGAAAATC 120 

TACGCCCGTG GTTTTTCCGA GTAACATTTA ATTACTACAT TACGATGTAT CGAAAAAAGA 180 

AACGAGAACG CACTTTTGTG GCAAACAGCC ATCCTGTGCA ATCAATAGAA AA 232 



(2) INFORMATION FOR SEQ ID NO: 7 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 96: 

TTTATTACAT ATCAAGATTC TATCACCTTT ACAAAGAATG TTGCATCTTG GTATACAGCT 60 

GTATCTTCGG TATTACCATA AACTTTAGCA GCTGTATGAA CATCAGTGAT TAATGTATTT 120 

TCGTATAGTt CTTGJcGCTGC ACTTACTCCA CCGCTATTAT GCCCATCGAA tAGaCCTTtA 180 

ATTTCTACTT CTACTGTTtC ATCTGCACCT TtCTCATTAT CAGCATCAAA TAAATTAGAT 24 0 

TTaTTTTTAT CTTATCTCCA ACTTTAAGGT TATTTTTTTC AGCTAAATCT TTATGCATTA 300 

AGATTTTATT- TTTATCTTCA TTTTCTAAAT GCTTTCCTTC TACTAATTTA TATGCTTCTG 360 

ATACAAATTT TGTTTCTTTT GTTGAGTCGT TAACCCCAGT TAACATAACT GTTCTTTTAA 4 20 

AATTCTTTGC TCTTTCAGGk GATTGaTTcG CAAGAGTCTC TTGAGTTTCA ATAATATCAT 4 80 
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AATCAACTAA ATCTGCAACA CTGTTTATTC TTTTTACATA GGAGTCTATA CTATCTGTTT 54 0 

GAGATATCTT TTTAATATCT TCACCTTTTA CATTTCCCCC ACCTCTAGGT GTTCCTGGAT 600 



(2) INFORMATION FOR SEQ ID NO: 7 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID NO: 7 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797: 
TTTTGGAGGA CAAGATGATT CGTATACTCG TACTATTACT CGCCGTTGTC ACAGGCGTAC 
cAGTTATTAT TTGATGAAGA AAAGTGCAGC TTTTTTACCA CTCTTAAAAA AAGAAACTGC 
AACAGAAAGT CAACAGTTTA TCGAACGTTT TGGGCGTTAT TACTTGATTA TCGCCATTCT 
AGGTGTTCTA GCGGCTATTT TTAATCGCCC GTTATTGTCA ATTGGTTTTA TCTTTTTTGT 
TTTATTACTT TCAACGTTGT TTAGTTTAAC TTTTGCAAAA AAAATGTCAT AAAAGACGAT 
GAATTTAAAG AGGTCGGGAC AGAAGTGTTT AACTCCGAGA AATAAGAAGG AATTTCCGAA 
AATTGTTCTT TAATTTTTGG AGAATTTCGG CTTATTTCCG AAGaGTTGCT TCTGTTTCCG 
CCGTTTATCA ATTTTTAAcG GTGGGACAAA AATCCAAAGT GATTTTTGTC CCAGGtTCTT 
TTTTTACmAA AAAAATGCCA ATATTCAGaA AAAAAAGAAG CGGnTTCAGT TcCkTTTTtC 
nTTGnTTGnT TTACnAttAT aTngAAmCAt TAAtGaTTtT GTTaCGCTGA AAGGAGCGGT 
TATTATGTTA CAACAATATC AnAAAATTAT GATAGC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798: 

TGGGTAATTT TTAGGTTTCA CTTCGAAACG ATTATTGACA TACTTTCGTA AATACGCACC 60 

TGTTTGTAGG GAAAGTGTTT GTTCCTCTTT AAAAGATAGA AGATTTTTGA CTGTAAAATT 120 

AACTAGCATG CTGTTCACTG AGAGATCCCC TCATAATTTC CCCAAAGCGT AACCATGTGT 180 

GAATAAATTT TGAGCTAGTA GGGTTGCAGC CACGAGTAAG TCTTCCCTTG TTATTGTGTA 24 0 

GCCAGAATGC CGCAAAACTT CCATGCCTAA GCGAACTGTT GAGAGTACGT TTCGATTTCT 300 

GACTGTGTTA GCCTGGAAGT GCTTGTCCCA ACCTTGTTTC TGAGCATGAA CGCCCGCAAG 360 

CCAACATGTT ■ AGTTGAAGCA TCAGGGCGAT TAGCAGCATG ATATCAAAAC GCTCTGAGCT 420 

GCTCGTTCGG CTATGGCGTA GGCCTAgTCC GTAGGCAGGA CTTTTCAAGT CTCGGAAGGT 4 80 

TTCTTCAATC TGCATTCGCT TCGAATAGAT ATTAACmAGT TGTTTGGGTG TTCGAATTTC 54 0 
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AACAGgTAAG TTAGTTGCTA GAAyCCATGG cTCCTTtGCC GACGCTGAGT AGAnTTTAGG 600 
TGACGGGTGG TGACmAtGnA kTCCgTGTCG AGCGCTGAtT TTtCGGsCTT TAGAGCGA 658 
(2) INFORMATION FOR SEQ ID NO: 799: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799: 

TAAAATTAAT ATT TACT AAA TTAAGATTCT TCAATAATTT TTTTGCTTCA TCTACAGTAG 60 

AGCATTGGCC CAATACCCAA GGAATAAACT CAAATGGAGA AACATTTTCT TTTCCTTCTT 120 

CAATTTTTTT ATAATCTGCA TAGCCTGAAA AGTTTAATCC AGCCATTCCT AATCCTTTTT 180 

CATTTATTGC ATCATAATAA AGCGGATAAT CAGCAATCCC AGCAGCAATT CCAAyTATTG 240 

CAAAATGATG ATCTAAATTT CCAACTTCTC GAAATGAAAA CTTATAATTT CTCGGCGTAA 300 

TAGTAACCAC CTCATTATAA GAmATTTCAT AATCAAAATT CCTTCCAAAG TAATGATCTT 360 

TTGATACATA AGTAATTGCT GTACACATAA TCAATTCCTC CTATATACTA ATTCAGTCAC 420 

TAAAATAGTA CACCCAATAC TCTTTTTTAT CAAATTTAAC CTATCCACAA AATGGCTAAA 4 80 

CAATATATGA ACATTCAATT TGGTTCTGTT GCAAAGTTTT AAATCTACTA TCAAATAAGG ( 540 

TAGAATAATA GAAAAAGATA GCAGGAGGAA TGACGGTGAA TCATTTTAAA GGAAAGCAAT 600 

TTCAGTAGGA TGTGATTATT GTAGCCGTGG GCTACTATCT TCGTTATAAC CTTAGCTATC 660 

GTGAAGTTCA AGAAATCTTA TATGATCGTG GGCATTAACG TTcTCATACG ACGATTTATC 720 

GTTGGGtGCa AGAATATGGc CAAcTACTCt AtCCAAtTTG GGAAAAAGaA AAATAAACma 780 

TCCtTTTAwT CccTGGGAAA tGGGATGaAc gkACcTCCaA ATTAnA 82 6 
(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 

AAACCAGATT GGTCTGCTCT GACTCAATCT GGACACAAAA TCATTATTTA TTTAATTAGT 60 

TTTGTTACTT TATTGATTTA TTGGAACAAT CATCATCATT TATTTCAAAG TGTCCGTAAA 120 

ATTGATGGAC GTGTCCTTTG GATGAATAAT TTTCTAATTT TGACGTTGAC GTTTTTCCCG 180 

TTTGTTACTG GTTGGGTGGG TGAACATCCG TTTTCTTGGC CGCCACAAGC TTTGTATGGC 240 

TTAGTAGTAT TmGGmGmTGA TGTTGCTTAT TATCTAGTGG TGCGGGCATT GATTCAGGCA 300 

AATGGGCCCA AATCCGAAGT ACAACGkTTG TTkGGCCACT AkCCAAAATT AACGCTtTCG 360 
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ATTGTGTTGA ATATTAnTGC GnTGATTGTG GGTAAAATGG TAGCACCAAT CGC 413 
(2) INFORMATION FOR SEQ ID NO: 801: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801: 

TCTTCATTCA CATCAAATGC GCAACTTACA CGAATACTGT TGCTTTGATG GAATTTGTAC 60 

TTTAATAAAG CACTTCCTAA ATTACCAACC CCGATTAATG CGACATTGGT TAACTCATCT 120 

TCATTCAATG TTTTTGCAAA AAAGTTCATT AAGTTTTCAA CATCATAGCC GTACCCACGT 180 

TTACCTAATT CACCAAAATA TGAAAAGTCA CGACGAATAG TCGCACTGTC TACTTGTACC 240 

GCTTCACTTA ATTCAGTTGA TGAGACTTTA TTTTTTCCTG TATCATGTAA CATTCTTAAG 300 

TAGCGATAAT ATAAAGGAAG GCGTCGCGCC GTTGCTTTTG GAATAACTTG ATCTTTCACA 360 

ATTGGACCTC CAATACTTtC ACTTAAATCk TATCCAtTAA CATAATAGCA GTTT 414 



(2) INFORMATION FOR SEQ ID NO: 802: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802: 



GCCAATTTTT 


ATAGAAAAAA 


AACAAAGAAG 


AATAGAATAC 


TGGCTAATTG 


CTTTTnTAGG 


60 


AATTTTATTT 


TTAACAACAT 


TTATTAATCA 


TACGTATGCT 


TTTTCAGAGA 


ACATTAAGTT 


120 


GATCATTTGG 


CAATGTATCT 


TTTTCTTTTC 


CGTTTACGAA 


GTAGGGCGTT 


CGAATGATAA 


180 


ATCTGTTTTT 


AAAGCTTTTG 


AGTATGTACT 


TTTAATTGTT 


TGGACCGCAT 


TAGTCATCGT 


240 


TGGGTTATAC 


TTATTTTTTG 


CGCGAATCAG 


CTTTTCAAAA 


CCAGTAGAGT 


CTTTGTATTA 


300 


CGGAATGAGA 


ATCGGTTTTT 


TTGAAAATCG 


ATTATATGGT 


GTTTTgTGGA 


GCCTAACTAT 


360 


GCATGTACCA 


TCTCATTAGk 


TtGTATTTTG 


GTTGGCGTGC 


GGAATTTtAT 


TAATACCAAT 


420 


AGCAAATGGT 


tAAAAGTGGT 


TtGTGCAATG 


GTGATTTTtC 


yACAATTtAG 


TTATGTAGCA 


480 


TTGTCTGGAT 


CAAGATCAGG 


AGTTATTCAG 


TTGATTTCAA 


TGGTAATTTT 


CGGGATGTTC 


540 


TTTACAATTT 


GGTTCTTTCa 


AAAAAATACA 


TCCAAAGCAG 


TTATAAAAAA 


AGTTGTGCAA 


600 


GCAGTATTAG 


TTTCTTTTTT 


CTGTGGTGCT 


ATTGTTTTCG 


GGGGCCTTAC 


AGTTAATAGA 


660 


AAAAGGGTAT ■ 


ATTGCTACTG 


CAAATTCAGT 


GAATATGGAA 


GTACCAAAAG 


TTTTAGGATA 


720 


ATAAAGGAAA 


ATGGAATTCn 


AGGGAAATA 








749 



(2) INFORMATION FOR SEQ ID NO: 803: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 
(B-)- T-YPE:-nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803: 

AAGGCTTCGT AGGTTGTATC AATCAGCTTG GCACCAGCAT CCACAATCAT GGTGGTACCG 60 

GCTACTTTGA TTTCTTTCAT TTCTGTTAGC ATAATCACGA CATCGCGAAT ACCGCCATCA 120 

CGAACAATCA AGTTGCTAGC ATTGCCTAAT ACTAACCATG AAAGTCCTTG TTCTCGGCAA 180 

TACGCCACAA TCTCTTCAAC TTCTTTTTTT GTTTTGGGTA AGGCCAACAC ATCAGCTGGG 240 

CCGCCTGTTT TGGTAAAGGT TACGTTTTTT AATGGTTCAT CTACTAATAA AGTAATTTCG 300 

TTTAATGTTT CTAACATAGC TTTAGTGTTC ACGAAAGTTA GTCCTTTCTT TTCAAAATAT 360 

CTGTCTTATT CTAGCATGAT AGAAAGAAAG TTGCTAGTCC TCTTTAAAGA ACATGCCGTT 420 

GTTGTTGGCG TACATTTCTT CACATAACGC ATGACTTTCA TAGACTTTTT CTTGCTTTAA 4 80 

CTCTTGCTCT TGCGGTTTTT GAATGGCCTG AATGAAAGCA GTAACCATTT GGTAAAAACC 540 

TCTTTTTTCC AATGTTGGCG TCCAATCTCC CATCTCCTTT ACCTGGTATC CTCTTCCGGT 600 

TGGAGGGCAA ATGCGTAAAT TTTC 624 
(2) INFORMATION FOR SEQ ID NO: 804: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804: 

TTTTTAGTCA TTTACTCCCT CCTTTTCGAG CAGTACTTAC CTTACTGTAG CATAGTTTTT 60 

GATTGGCGCC TATCAGCAAT TTATCTCCAG ATAATAGAAA AGCCCCCCAG CGAAAGTGrA 120 

TGGCTTTTCT ATTGTTAATT CTTTCTGATT TgTAGATAAC TTACACCTAA AACGGCAAAT 180 

GTACTTGCCA GTATTCCTGC TATTAGCATA AACGGTGTTT TTGTTGTGCC TGTTTTGGGA 24 0 

AGTTCTCGTT GCCCGTTTGC TTCATTTAAT TTTTGAATTG GTTCACTAAG CAGTTGGGGA 300 

GCCAACTGTG GGCTTCCTTC ATTTTTCAAT GTATTTAATG GAGCGTTTAG TATTTGTGGT 360 

TGATTTTTTG ATGTAACATT TTTATTTTTC TCCACAGTTG TTTCTGTGGA TGGCTGTTTT 4 20 

TTCTCAGTTG TAGTAATTTC TTTTGTTGGT GAGACATGAA CTTCTGGCTG GCTGCTTTCT 4 80 

GCCGTTcCGT TATACCATTC TCTTCAGTgT kACGCTTGGt TnTCAGGtAt GATTGGTGCA 540 

ATCGTGCCTA' GTGTTCACTT CCATCTGGAT 570 

(2) INFORMATION FOR SEQ ID NO: 805: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805: 



TGCATTTTGC 


CTGGGACAAT 


CCAGACGAGG 


ACTTGACCGG 


GTATTTCCGG 


CGCTTTCTGG 


60 


AACTGACCGC 


CATTAAGAGn 


GACCGAAAGC 


GGCGCGTCTA 


CGTCCTGACC 


AATTACGGCA 


120 


GCACCCATGA 


GCAGGACTTG 


TACCGGGTGG 


AAACCCTGCG 


CGGCATGrGr 


TATGACCCCT 


180 


ATGTGATGGT 


CTACGACCGC 


CCGGCCGCAC 


CGAAAATCAC 


CCGCCAGCTT 


CAGCGCTGGG 


240 


TGAATAATAA 


GCGGATTTTT 


TACACGGTCA 


AAAACTTTGC 


CG ATT AC ATT 


CCAAACAGAA 


300 


CAGGAGGGAT 


AACGTGAAAA 


TCGACCGATT 


AAAAACCGAC 


CTGGCAAAAG 


CCAGGGAAAA 


360 


AGCCGCCGAG 


TGGCAGGCCC 


GCGTGCGGGA 


CTTGGAGAAG 


CAGATCACCG 


AGCAGGAAAA 


420 


CCTTGAAATC 


TTGCAGGTGG 


TACGCAGCGT 


AnGGCTTTCG 


CCG 




463 



(2) INFORMATION FOR SEQ ID NO: 806: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 806: 



AAGnATTnCC 


GGACGCCGAG 


AAATTAATTT 


TAGAAGATGC 


GGGTGTAATA 


CCGTTACTTC 


60 


AAATTGGAAA 


TGCTAAATTA 


AGAAATCAAA 


AAATTTCTGA 


AATGAAAGTA 


CACTCTATTG 


120 


GTGCTAAGTA 


T GAT TAT AAA 


ACCATGGAGA 


TAAAATAAAA 


AACCTCTCTT 


AAAAGAGGTT 


180 


TTTTTATTTG 


TCAGATTTTA 


AATAGCCAAC 


ACCTTCAAAA 


GTATCAGCAA 


TTATATTTAA 


240 


AGAAGAGGGA 


ACTTCAAAAT 


TTTCTTTTCT 


AGCTATTTCT 


GTTACTTCTT 


TAACTAACTC 


300 


TTTATAAGTA 


TCATTTTCAC 


CTAATTGAGA 


GAATAAATTT 


ACGATGTTTA 


CAGCTTTCAA 


360 


ATATTCCTCC 


ATATTTCGTT 


TTTTCAAGAA 


ATTTCTGAGA 


TGTTTTAAAT 


AAATCATttC 


420 


TAAATTAATA 


gTaCCATCTA 


AAATAAAAGA 


AGGGACTTTT 


CGAACAATAT 


CAAATTGTTC 


480 


AAGGAAGTAA 


TCAGCTTTTT 


CAAAATCCTT 


TTTTTCAAGA 


AAGACAGTAA 


CCGCATTCTT 


540 


GATACATAAC 


TGAACTGCAT 


GGTCATGGAT 


ATCTCCATAA 


GATTtGGTTA ATtGGGAAAg 


600 


AGTTTTTCCT 


G 










611 



(2) INFORMATION FOR SEQ ID NO: 807: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807: 
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AAATGAAGAT AAATTTACTC ATTGGACCCC TAATGTTTAT AGTTATGGTG CATATGCTGA 60 

TGATAATAGA ACTATTGTTA AAGGCCACAA CGAAAGAAAC TTGCAACAAA TCAATACTTT 120 

TGTAATCGAT TTTGATCGCT CACTAGGTGA AATATTAGAT TCTCAAATGA TTCTCGATGC 180 

TGCAATTGAT CTGGAACTGA TGCCAACGCT AATTTTAGAA ACTCCAGGGG GATTTCAAGC 24 0 

GTATTTCATT CTTGAAAATG CCTGGTACAT TTCTTCAAAA AATAATTATC AATCAATCGA 300 

AGTGGCCAAG AGAGTATCTG AAAATTTAAG AAAAGCATTT GCTGAAGTAT TGCCTTCGGT 360 

CGATTTGGGG TGCAACaTTT TGGTATTGCA CGTATTCCTA GAACAGACAA TGTAGTCTAT 4 20 

TACTACCCAG CACTGACACA TGATATGCAA CAACTAaTCC AGTGGTCCAT GAAATTTGAG 480 

GCCAAAAAGA ACACCTnAAA AAACCGGAAT CTTAATGTTG GTTGCnGGTA AAGATGGAAT 540 

CrlGGATTTAA AG 552 



(2) INFORMATION FOR SEQ ID NO: 808: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808: 



GGCAGTAGAT 


ATTTTAACAT 


CTGCAAAAGT 


CAACGAAACT 


TCTAATATAG 


TAGAAGCAAA 


60 


AATTGCGGAC 


GAACAAGTAA 


CTGGCAAATT 


AACCGCAACA 


AAAACCGTTA 


ATAACGCCAA 


120 


ACCAAAACTA 


GGGGAAGAAA 


TCGAATACAC 


CATCAGCTTC 


CGCAACACTA 


TCGAAAACGG 


180 


CATCCTAAAC 


AAAGTAGTTA 


TCACAGACCA 


ACTGCCAAAA 


GGACTGACAT 


ATGTCAAAGA 


240 


TAGCCTAACA 


AGTGTTGGTG 


ATGAACCAAA 


GCCAACCAGC 


TTGAAAGAAA 


CCAACGGCAC 


300 


AATCACAGCG 


GAATACCCAA 


GTATCACCGA 


TATGAAAGAA 


CGGAGTATTC 


GCTTCAAAGT 


360 


GATCGTTAAC 


GAAGAAGCCA 


AAGCGGGCGA 


AACAATTCTC 


AACAAAGCCA 


AAGTAGATGA 


420 


CACAGTTAAT 


CCACCAGAAG 


AACCAGAGGT 


GCCTGTGGTA 


CCAGAAACGA 


ACGCAGGCAA 


480 


ACTAGCCGCA 


ACAAAAACCG 


TTAATAACGC 


yAAACCAAAA 


CTAGGCGAAA 


CTATCGAGTA 


540 


CACAATTAGC 


TTCCGCAAyA 


CCATCGAAAA 


CGGCGTCCTA 


AACAAAGTAG 


TTATCACAGA 


600 


CCAACTACCA 


AAAGGACTGA 


CGTATGTCAA 


AGATAGCCTA 


ACAAGTGTCG 


GTGATGAACC 


660 


AAAACCAACC 


AGCTTGAAAG 


AAGCCAACGG 


CACAATCACA 


GCGGAATACC 


CAAGTATCAC 


720 


CGATACGAAA 


GAACGGAGTA 


TTCGTTTCAA 


AGTGATCGTT 


AACGATGAAG 


CGAAAGCGGg 


780 


CGAAACAATT 


CTCAACAAAG 


CCAAAGTAGG 


TGATGGCATC 


AATCCACCAG 


AAGAACCAGA 


840 


AGTGCCCATT 


ACGCCTGAAG 


AACCAGCGAA 


AAATaAAAAG 


GAAACAAACA 


AAGTTGTTAC 


900 


AGACCAGAAC 


AAACCTACAA 


AAAATAGTAA 


AAATGAAATT 


GCCATAAACA 


AGAAAGAAAC 


960 


GTCTAAATCA 


TCCTATTTAC 


CAAAAACAGG 


AGAAAAAGTT 


CAAAAAATAT 


TTGCGTATCT 


1020 
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GGGTGTAGGA CTGATCTTAA TTGTATTAAT TCTCTATGTT ATAAAGAGAA ACAAAGAAAA 1080 

AGAGGAGTAA ACAAAATCGA TTGTTTTAGA ACGACTAATT AGCTGAAAAT TTAnGnGTTG 1140 

ACGACATAAG TCAAAATGAC CTATGTCGTC AACTCTTTTT GGATTTATAA CAAGGAAATC 1200 

CAACATATTG TGTGTGC 1217 
(2) INFORMATION FOR SEQ ID NO: 809: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809: 

GCGGCGAGCG AAcGGaAGAG CCCAAnCCAA CAAGCTTGCT TGTTGGGGTT GTAGGACTCC 60 

AATATGGTAG TCTGTTAGTA TAGTTGAAGG ATTTGGAAAA TTCCGCTAAA GAGGGTGAAA 120 

GCCCCGTAGA CGAAATGCTG ACAACACCTA GGAGGATCCT GAGTACGGCG GAACACGAGA 180 

AATTCCGTCG GAATCCGCGG GGACCATCCC GCAAGGCTAA AtACTcCCTA GTGACCGATA 240 

gTGaACCAGT ACCGTGAGGG AAAgGTGAAA AgCACCCCGG AAGGGGAGtG AAATAGATCC 300 

TGAAACCGTG TGCCTACAAC AAGTCAAAGC TCGTTAATGA GTGATGGCGT GCCTTTTGTA 360 

GAATGAACCG GCGAGTTACG ATTGCATGCG AGGTTAAGTC GAAGAGACGG AGCCGCAGCG 1 4 20 

AAAGCGAGTC TGAATAGGGC GAATGAGTAT GTAGTCGTAG ACCCGAAACC ATGTGATCTA 4 80 

CCCATGTCCA GGTTGAAGGT GCGGTAAAAC GCACTGGAGG ACCGAACCCA CGTACGTTGA 54 0 

AAAGTGCGGG GATGAGGTGT GGGTAGCGGA GAAATTCCAA ACGAACTTGG AGATAGCTGG 600 

TTCTCTCCGA AATAGCTTTA GGGCTAGCCT CGGAATTGAG AATGATGGAG GTAGAGCACT 660 

GTTTGGACTA GGGGCCCATC TCGGGTTACC" GAATTCAGAT AAACTCCGAA TGCCATTCAT 720 

TTATATCCGG GAGTCAGACT GCGAGTGATA AGATCCGTAG TCGAAAGGGA AACAGCCCAG 780 

ACCACCAGCT AAGGTCCCAA AATATATGTT AAGTGGAAAA GGATGTGGGG TTGCACAGAC 840 

AACTAGGATG TTGGCTTAGA AGCAGCCACC ATTTAAAGAG TGCGTAATAG CTCACTAGTC 900 

GAGTGACCCT GCGCCGAAAA TGTACCGGGG CTAAACATAT TACCGAAGCT GTGGACTACA 960 

CCATTAGGTG TAGTGGTAGG AGAGCGTTCT AAGGGCGTTG AAGGTCGATC GTGAGGACGG 1020 

CTGGAGCGCT TAGAAGTGAG AATGCCGGTA TGAGTAGCGA AAGACAGGTG AGAATCCTGT 1080 

CCACCGTATG ACTAAGGTTT CCTGGGGAAG GCTCGTCCGC CCAGGGTTAG TCGGGACCTA 114 0 

AGCCGAGGCC GATAGGCGTA GGCGATGGAC AACAGGTTGA TATTCCTGTA CCAGTTGTTT 1200 

TTGTTTGAGC AATGGAGGGA CGCAGTAGGC TAAGGAATGC ATGCGATTGG AAGTGCATGT 1260 

CCAAGCAATG AGTCTTGAGT AGAGTTAAAT GCTTTACTCT TTAAGGACAA GTTGTGACGG 1320 

GGAGCGAAAT AATAGTAGCG AAGTTCCTGA TGTCACACTG CCAAGAAAAG CTTCTAGTGA 1380 

GAAAACAACT GCCCGTACCG TAAACCGACA CAGGTAGTCG AGGAGAGTAT CCTAAGGTGA 14 40 
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GCGAGCGAAC TCTCGTTAAG GAACTCGGCA AAATGACCCC GTAACTTCGG GAGAAGGGGT 1500 

GCTGACTTCG GTCAGCCGCA GTGAATAGGC CCAAGCGACT GTTTATCAAA AACACAGGTC 1560 

TCTGCAAAAT CGTAAGATGA AGTATAGGGG CTGACGCCTG CCCGGTGCTG GAAGGTTAAG 1620 

AGGATGGGTT AGCTTCGGCG AAgcTCAGAA TTGAAGCCCC AGTAAACGGC GGGCGTAACT 1680 

ATAACGGTCC TAAGsTAGCG AAATTCCTTG TCGGGTAAGT TCCGACCCGC ACGAAAGGCG 174 0 

TAACGATTTG GGCACTGTCT CAACGAGAGA CTCGGTGAAA TTTTAGTACC TGTGAAGATG 1800 

CAGGTTACCC GCGACAGGAC GGAAAGACCC CATGGAGCTT TACTGTAGTT TGATATTGAG 1860 

TGTTTGTACC ACATGTACAG GATAGGTAGG AGCCGATGAG ACCGGAACGC TAGTTTCGGA 1920 

GGAGGCGCTG GTGGGATACT ACCCTTGTGT TATGAACCCT CTAACCCGCA CCACTAATCG 1980 

TGGTGGGAGA CAGTGTCAGA TGGGCAGTTT GACTGGGGCG GTCGCCTCCT AAAAGGTAAC 204 0 

GGAGGCGCCC AAAGGTTCCC TCAGAATGGT TGGAAATCAT TCGAAGAGTG TAAAGGCAGA 2100 

AGGGAGCTTG ACTGCGAGAC CTACAAGTCG AGCAGGGACG AAAgTCGGGG CTTAGTGATC 2160 

CGGTGGTTCC GCATGGAAGG GCCATCGCTC AACGGATAAA AGCTACCCTG GGGATAACAG 2220 

GCTTATCTCC CCCAAGAGTC CACATCGACG GGGAGGTTTG GCACCTCGAT GTCGGCTCGT 2280 

CGCATCCTGG GGCTGTAGTC GGTCCCAAGG GTTGGGCTGT TCGCCCATTA AAGCGGCACG 234 0 

mgAGCTGGGT TCAGAACGTC GTGAGACAGT TCGGTCCCTA TCCGTCGCGG GCGTTGGAAA 24 00 

TTTGA 2405 
(2) INFORMATION FOR SEQ ID NO: 810: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810: 

AGAATATGCT AAAGTATTAA GAAAGAATAG TCTGGAGTTG ATAAGAATGA ACTTTAAAGA 60 

AAAAATTACA GAAATGGTTG AAAATGTTTA CGATTTAGCA CGCGATGGTA AGTATGATGT 120 

CAAAGTTGAC TTAATTCCTC GTAAAAAAGG CAACATCAAA AGTGGTGTTA AAGTCACTTC 180 

TTCAAGAAAT ACAGCCCACA ATATCAAATG GCGTTCAAAA AACTAAJcTCG TCTTTAACAG 240 

AAACACCGGA GCAGACAAAA TTATTGTCTA CTCCGGTGTT TTTATTTTTC TAATTCGATA 300 

CCTAAGACAG TTAAATGCCC ATTACCAACT AATTTGATAA CTAATTGGGC TAAGTGGTTG 360 

GGTGTCATAT CTGTTTCGCT nGTGGTCCAC CAATGAAGGG TACCGATGAA GATAGAGGGC 4 20 

ATGTATTCAA TnACAAAATC nTATCGGnAC ATCAATAATC ATTTTCGGTA ATACGCAACT 4 80 

GCGGAAAAAA ATCGTCGCCA TATTTTCCTT CGAnAATTTC AGCTAAGCGT TTTCGAA 537 
(2) INFORMATION FOR SEQ ID NO: 811: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811: 
GCTTTTACTT TTAAACTGAG CGGTACTATA ATTTCTGCTG AATTTGTTAA ACTTTCTGCA 60 

CTAGTAATTT GTGGGGATTG TAGTGAAACT AAAGCTGCTG TAGATAGTAG CAAACCGCCA 120 

AGAATTATCT TTGATTTCAT AATATCATTC TCCTTAAATT CTATTTATTA AGCTCAAAAA 180 

AATAGTTTTT ATCTTAACAT TCTAGCTACT TTAATTCTTA ATACTACAAA AGGAAAACAT 240 

TGTTATATTG AATAAAAACA TGTCCCTATC TCCTTTCTAA TTAGttTCCA CAAAACGTAA 300 

TGATTACGTT TTGTGGAAAC TAATTTTACA AATTTTTTTA AGAATGTCAA TAAAAATGAA 360 

GGTTCATTTT ATTTAAGTGt CAAAAATAGA TTAATATAAA ATTTAACTCG ATTATTTTTC 420 

AGATAGCTAA AAAGCAATTT CATCTTTATT AGCTAAAAAG TAGTCAAGTC CAGACTCCTG 4 80 

TGTAAAATGC TATACAATGT TTTTACCATT TCTACTTATC AAAATTGATG TATTTTCTTG 540 

AAGAATAAAT CCATTCATCA TGTAGGTCCA TAAGAACGGC TCCAATTAAG CGATTGGCTG 600 

ATGTTTGATT GGGGAAGATG CGAATAATCT TTTCTCTTCT GCGTACTTCT TGATTCAGTC 660 

GTTCAATTAG ATTGGTACTC TTTAGTCGAT TGTGGGAATT TCCTTGTACG GTATATTGAA 720 

AGGCGTCTTC GAATCCATCA TCCAATGATG CGCAGCTTTT GAATATTTGG GTGATCGATA 780 
TAATCATGAA TCAATCGAT 
(2) INFORMATION FOR SEQ ID NO: 812: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid x 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812: 

TTTGCAAGTT AGTTTACTTT TAGGCTTCCT TTTAATCTGG ATTCATTCAT TTATGGAAGA 60 

TATTTTATTA GCGCCTCATT GGATGCCGAT AGTTTATAGT TTCCTTGGGC TAGCCTTCTA 120 

CTTCCGGCCA GAGAAAAAAC GAGGAAGACA TGAAAGACCA ACGACACCTA AAAGAAGAAA 180 

ACGAGTGAAA CAAACAAGTC CTGTTAGCAA TGAAGAGCGC CCGATAGCAC CTGTTGACGA 240 

AGAGGGCTGG GaCCAACCTG AAGAATTAAG TCGTGtCCAA AGACATCGCm GATAATTTTc 300 

ATCCTATAGA ACTTTCTTGT CCGCTGACAT TATGGkrATA TAgGGGTTGC ATGTTATTAC 360 

AAATAATGTA- ATTACTGGAA GTGAAGAGAG GAAGTTTTTA ATGACATCAA AAAGTGAATG 4 20 

GAGTAATGCA AGACGGATTG TAATTATTTT AGCAGACGTT TTGCTCTATA ATTTATCAAT 4 80 
(2) INFORMATION FOR SEQ ID NO: 813: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813: 

TTAAGCGATA AATCAGCTGA AGTCCTTTTA GATTTAGATG CTAAGAGCTT CCCAATGAAC 60 

GGCAAAACAG TTCGCGTTGC CCAAGTAAAT ACCGTCGATT TAGCAGAAGT TTTAGATCGT 120 

CAAGCTGAAT TAGAAGCAGC TATGGCCGCA GAAAATGCAG CCAATAATTA TGACTTGTTT 180 

GTCTTAATTA TTACGAATAT CCTAGACAGT GATTCTGAAT TATTAGCGAT TGGTGCGGAA 240 

CAAGCCAAAA TTGAAGCAGC GTTCAATGTA ACATTAGTTA ACAATCGTGC GTTCTTACCC 300 

GGCGTTGTTT CTCGTAAAAA ACAAGTAGTA CCACAATTAA CAGAAGTCTT TAACTAAAAC 360 

CGTArGgscC TAAGcAACGC GATTAAATTC CTTGTTGCTT AGGCGCTTCA ATAGTAAAGG 4 20 

GGCGTTAACA TGAATGATTT AGCAATTTAC TATCATTCTC CGCAACAAAC GAAACAATAC 4 80 

AATCATTTGC TCAACCAATC ACTTTTTTTC CCTCTTGTTA CTTTTATGTA TGAACATCGG 540 

GAAGAAAGAA TCATTTTAAG ACAATTAAAA GCGGCCTTTG CAACAGAAGC CAAGTTAGAA 600 
CAATTTTTAT CAGAAAGGAT TGATTGTCAG TTAATCATnG GGAAAATCGT CAATATCGTT 

TGAATTTTCC TATTATACGG CAGCTGAAGT GCGAGTTG 

(2) INFORMATION FOR SEQ ID NO: 814: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814: 

CTTAATCGTT TTTCTTTTAT AGGATATCGC AnTAATCGTA AnACATAATT CATnnGAAAG 60 

GATTAACCCA TGACCTTAAG AAAATCAAGT TTAATTATTT TAGTCCCATA CATTACGCTT 120 

TCTTCGGGCC GTTACTTTTT TCGGGTATTT CCCCCAATGC AGCGATTAAT GTGACAACTG 180 

CTCTCTACAT TATTGGCGCC CTTGTAATGA TTGGCATCTT TTTAAAAACA ACAGAACCTT 24 0 

CTCCTTTAGA GGAAACAGCA ACCTTGAAAT CACCTATTTT TATTTTTTTA CTAGGTGTTA 300 

GTGGGATTTT CATTGCCATG CTGATTCAAG GGGTTACTTT TGCCATTGAA GTAGCCATCA 360 
CAGGAgAACA AGCTACTTCT CAAAATACAC AAGCTATTGT AGCGGTGATT TTGGCAAATC 
CTTTATTTAT TTTGGCAACT ACTATCGGTG GTCCGATTAT GGAArAAwTC GTCTTTCgTT 
ATGCATTTAT CCATTTaATC CAsCCTTTtA CAAATTTCyG GawTGcGGsT ACTGtTAnTT 

CGGCGATTTT TTCACTCGCA CATGCCGATG GTCACTTTTT TGTTTATTTC TTTATGGGAT 600 

TTTTCTTTGC TCTTTTGTAC AAGCAAACTG GAAAAATTTG GACtCAATTA TTGCCCATTG 660 



660 
698 



420 
480 
540 
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CGGCATGAAT ACGATTGTGA TTATTGTTCA ACTACTTTTG CATAATGGTA CAATTCAATA 720 

AGAAAAGGGC CTnTCGTCTA AACGACTAGG CCCTTG 756 
(2) INFORMATION FOR SEQ ID NO: 815: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815: 

GATACGCTGC AAGCGAcrGT TTCaAGTGCC yGcGTTTTCT GGTTGTTCCA AGTCGTTAAA 60 

CCTTGGACTT CTGCTAACGT TAACGAAGGC TTTGCTTGAG GTCGACTAAA AATTAAGTCC 120 

ATGACATATA AAAGTTCTTT TCGTCGGGCA ACGATTTGTA TCTCATCCGT TGGCAGTTTT 180 

AGCATTTGCC ATTGCTCAAA AATATCATTT GCGGCTTGGG CAACTTCTGA CGCATTCGGT 240 

CCCTCTAGGC GAAATGTCTC AACTGGCCAA TTCACCATTC CTGCGTTGAT TAAAGGATAG 300 

TTAGCTAACT CAACATATTC TGAAATGCTC GCTTCGGCTA ATGGAAATTG TTGCAAACCC 360 

GTTTGATAAT AGACATGCTC ATTTTGTTCA AAGTCAGCAT CAATCCCAGC AAAATAATGA 4 20 

GGAAAGACCT CAACAATTTT CAACAATTGC TGAAAAGTCT TAATTGAACG TTTCGCTGAC 480 

TGATGTTTTT CAAACGCAAT CACACCTTGT TGCTGATATT GTGCCTGTTT TACAAAGGAA 540 

AACCCCCAGC TTTCGCCATC TAAGTTCATT CGAATCACTC GATGGTTCGT AGTCGCAGGA 600 

AATTGCTGAC TTCCCTGGTA GCCTTCTGTT GCCATGCACC ATTCACAGCG GGGATAGCTT 660 

TGTGCAGAGA GCGTTGCTTT TGACGCCTCA CTGTGGACCT TATTTGCTAA AAAATCTCCA 720 

TAGACTGTTG AAAAAACAAC AGGTTCTTCC TGTTCAATGA CCGTACCGTT TTTTTGACAA 780 

AGTTGGTAAA AATACTCGGT AGCCTCTTGT GGCTCTTTGG CATAATGTTG TGCAAAAAAA 84 0 

GCATTAACCA CAGAAGGTGG CGGTGTTAAC AAATCCATCA ATTGCACCAT GAACTGTTCT 900 

TTTTGTTGCT CTGTTTTTAC AAGTTGGTTG GCACTCGCAA CTTGACACAA TTGTTCCGCC 960 

AAATCAGCGG CGGGTGTTGC CACAGGCCGT ATGTCTACTT CGCCTAATTC TTGTTCACCA 1020 

ATCATTGATA ATAAACGGTT CTGAAGATAT AAGCGATCCA ATTCCATCCA ACCACCTGCT 1080 

TGAATAGCAA GTGTGGTAAA ATCGGCAATC ATTTGACTCG TGGTCATCGA TTCATTCCTC 1140 

CTATTACTTT TCTTGATATC CATGTGGATG AGCTTGATGC CATTGCCAAG CGGTGGCGAT 1200 

AATATCCTTG ACCTCAGTTA CTTCTGGTTG CCAGCCTAAA ACTCGTTTGG CTTTTTCACT 12 60 

AGAAGCAATT AACGTACTAG GATCCCCTGC TCGGCGTGGG GCGATTGTTG CAGGAATTTC 1320 

TTGGCCTGTC ACTTCGCGAG CAGCATCTAA CATTTCTTTA ACAGAATAGC CGTTGTTGCT 1380. 

ACCCAGGTTA AAGACGTCAC TTTCGCCACC ATTTTTCAAG TATTCTAAAG CCAAAATATG 14 40 

TGCTGCAATC AAGTCTTCAA TGTAAACATA ATCTCGAATG CACGTGCCAT CTGGTGTATC 1500 

ATAATCATCC CCAAAAATAC TTAGCTCTGC TCGTTGGCCT AATGCCACTT GTAAAATAAT 1560 
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TGGCACAATA TGCGTnTCTG GCGTGTGATC CTCACCAATT GA 1602 
(2) INFORMATION FOR SEQ ID NO: 816: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1008 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816: 

ATTTCCTTTG GCGTGGCTTC TCTCGTCATT TCAAAAATAG ATGCATTCTT TTTTATCTGC 60 

ACAGACATTA CAACACGCAT TGAAACAGTT GAGTGCTTCG GATGTGGATA TTCATGTGTT 120 

AACGATACCT ACCAATAGCT TTTCATGTTC TCTCTCCTAA TTGTTTTTAT GTATATTTGT 180 

TTATTGTAAG ACAAACACTT TAATCCCTGT TATACTCATT GAAAGACAGC AACTCCTTTT 24 0 

TGCTTCATGT AACACTTCCA GTTATTTCAA AACATAATCT GCTGTCTGGC CACTAGATAT 300 

TTTATCTGGT GGTTTTTCAT GCGAAAACAA TCCAAATATC CGACAAAACT TGACAGCTAt 360 

GTkACACTTG ttTTAGGkAG CACTCTkTCA TAATAGCkAA AGTTCATAAA CTACAAGTGA 420 

CACGAGATTT TCACTAACGC TACCTAGCCA CTAGATCCCA TTTCTAGTGG CTTTTTTTGC 4 80 

ACAAAAAATA AGTTTTGATG TGAATAAACG CTCTTTTATT CTTTATTCTT ATTTGATATA 54 0 

ATTTTTTAGG TAGCAACTCC TTTTTGTAAA TAGCAACCAA CAAAAATTTT GCACGAATGC 600 

TACCTAGCCA CTAGATCCCA TAGTCTAGTG GTTTtTTATG TACGArAAAA rGmCCACTTA 660 

CTGAGTGAnC TTwATAyCGm CGTGsCAGGA GTkGAtCtGC AgCCATkGAT TAAAAAwTAm 720 

CkGTyCTTAT TTATTAAkGA mCCTTTyCTT TTTTGGTTCA GwTTCAAAAA CTATTTCTAA 780 

mCTCAAAwTA wTAGTCTGTA AAGACmACgC CATAAmCAGC CCTGAGAAAA ATGTCCATAT 84 0 

ATAAAATATT AGGATTGAAG CAAAGGTCTC TTTATACTGA AATATTTGGA GCATCATTGA 900 

TAGTATCAAA GTAGTAAATG CACTTAATAA AGATATAAAG AGTTGATATT TTAATTTTTT 960 

TCTATAGCAG ATCCTCGTAT ATTCTTCATA AAACTAGTAT CTTTTATA 1008 
(2) INFORMATION FOR SEQ ID NO: 817: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817: 

GTCATAGCTG TTTCCTGTGT GAAATTGTTA TCCGCTCACA ATTCCACACA ACATACGAGC 60 

CGGAAGCATA AAGTGTAAAG CCTGGGGTGC CTAATGAGTG AGCTAACTCA CATTAATTGC 120 

GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG AAACCTGTCG TGCCAGCTGC ATTAATGAAT 180 

CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGGCGC TCTTCCGCTT CCTCGCTCAC 24 0 
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TGACTCGCnG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA TCAGCTCACT CAAAGGCGGT 300 

AATACGGTTA TCCACAGAAT CAGGGGATAA CGCAGGAAAG AACATGTGAG CAAAAGGCCA 360 

GCAAAAGGCC AGGAACCGTA AAAAgGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC 4 20 

CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 4 80 

ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG cGCTCTCCTG TTCCGACCCT 540 

GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC TTTCTCATAG 600 

CTCACGCTGT AGGnnTCTCA GTTCGGTGTA GGTCGTTnGC TCCAA 645 
(2) INFORMATION FOR SEQ ID NO: 818: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818: 

ATGCAAGCCA GTGGATTAAA AT AT ACT AAA AAACGAGAAT TACTGATGAG CTATTTGATT 60 

AAGCGAAATC nGTATGTTTC TGCACGTGAA GTGTATGAAT TTATGAATGA AACCTTTAAA 120 

GGAGTCAGTT ATGATACGGT TTATCGTAAC TTGCATGACT TTGAGCGCTT AGAATTACTG 180 

GAAAAAACAG AATTAAATGG CGAGCAAAAG TTTCGCTTCC GTTGTTGCCA AGAGGTGGAA 240 

CATC AT CATC ACTTTATTTG TACGGTTTGT GGCAAAACGG AAGAAATTCA CATGTGTCCG 300 

ATGAATTTTT TTGAAGAACA GTTAAAAGGT TGTTCAATTG AAGGACATCG CTTTGAAATC 360 

CTTGGTCGTT GTGCCGATTG TTGCGAAAAA TAGCGAATAA GACGAAAAAT CAAACAAAAA 4 20 

TTTCGTATTT TATCAGTAGA GTGGTTGACG GTTATTTATG CTTTACTATA ATATAACAAG 480 

GACAGTATTT TAATTTGTGG GTTTTTCCCA CAAATAAGAA AAACGGTTAC TTTTTAAGCT 54 0 

CGGAGGGGAG GGAATAACAT GTCCAAAAAC AGTCGTTCGT AAAAACGAAT CnCCTGGATG 600 

AnGGCTCTTC GTCGCTTCAA ACGTTCCGTT TCCAAAGCnG GGTACTTTTA CCAAGAATCC 660 

ACGTAAACGG GAATTTCCAC GAAAAAC 687 
(2) INFORMATION FOR SEQ ID NO: 819: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819: 

TnGTAnGCGT TCTTTTTTTA AGAATACCAA AACATTATTT AAAATAAAGC AATTTTTATT 60 

TTTAGTAGTA AGATAGTAAA GTAGTAAGAT AGTAAAGTAG TAAGATAGTA AAGTAGTAAA 120 

ATAGTAGAAT AGTAAAGTAG TAGAATAGTA AAATAGTAAG ATAGTAAAAA ATGTTTATTT 180 
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TTTAATAAAT ACGTCTAAAA ATCCTTTTGT AGAAAGGTTA ATATCATTCT TTTAAAACAA 240 
TAAGATAGTA AAATAGTAAG ATAGTAAAGT AGTAAGATAG TAAAATAGTA AAGTAGTAGA 300 
ATAGTAAAGT AGTAGGATAG TAAGATAGTA AAATAGTAAA TTTTATAAAT ATnATTTGTA 360 

370 

GTAAGAnAAT 

(2) INFORMATION FOR SEQ ID NO: 820: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820: 

AAAGTAAAAA TTGGCTTAAA TATGTTGTTA ATGATTATCG TTTTAGGCGT AATCTTTTAT 60 

GTCATGGATA ATTCATTAAG TGATATTTTT GCTCAACTGA TGGAAACGAG TTGGCTGGTG 120 

CTGATTGCTG TTATTTTTTT TGGAGTAGTT TATCAATTTG CAGAAGGGCG CTCAATCAAG 180 

GAAATAGCTC GTTATTTTAA CAAAGACTTT ACGACAGTGG ATGGTtTTTT TACTTCTTGT 240 

TATGTAGCAT TTaTCGGATT AATTCATTGG CACGGGGAAC CTTGCCTTTC AGAAATTTAA 300 

TTTTTAATAG GAAAAAGGGG AATTCCCGGT TTCC 334 
(2) INFORMATION FOR SEQ ID NO: 821: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821: 

CTTCAGACTC ACACCCATTC TACACTGGTC GTCAAAAATT nTTGACAAGC CGACGGACGT 60 

GTGGACCGTT TCAACAAAAA ATACGGTCTC AAAGACGAAA ACGCAAATCC AGACGCATAG 120 

TCTTTCAAAA CATTGTTATG ACAAGGTTTA TAGAGGTTTT CCACTGTGGT GGGAAACCTC 180 

TTTTTTTGTT GGATAGGCAG TTTTAAATTC AGAGAAATTA TTTAAATATT ATTTTATTTA 240 

ACTGAAATAT AGCTAGTCCG TATACCGTAt AATTTAAATT ATTTGGATAG T AT AC AT AT k 300 

GTAAAGTGCA ATTGTCTTTT TAGAATCTTT AAGGTATAAT TTGTTTTAAT GAGTTTATAT 360 

GTAAGCTTTt tCtcTTTAAA GmCTTtGAyC AtAyCTTTTT tATAagaGAA aCTATgTTtG 4 20 

TGAAaaGGaT ATAAAAGTGn TAAAACAATT GTACTTAACA GTATTAAGAT GGGAGACCGT 4 80 

ACAAGGCATA AAA 493 
(2) INFORMATION FOR SEQ ID NO: 822: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 



60 
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180 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 822: 
TTTCAATTCC CTCTTCATAT ATCGAAATAA GAT AAC TATA ACCTTCCAAT AAATAATTAA 
TTTGTTCATC AATAGTTTTA TTCCAAGAAC CTTTCTCTTC TTTTAATTTA GAATTTATAA 
AAAACTCTTC TAGAGTTGAA TACGGAGATC CGTCAAGCGT AATCAGTCGT TCTGAAATTA 
CATCCAACTG ATCATTAATT TGCTCCATAA GATCATCCAT TTTTGGATGT AGCGTTAGAA 
ATCTTCCTCC ACGCATATAC CAATGAACTT GATGAATTCT TACTGAAAAT ACGCTTAAAT 
CGGCTACCAA TTGATTTAAT ATTTTTkTTG TTTCACAAAA TTTCATAGTA TCACTCCTAA 
TTTAGTGTTA TTTAAAAAGA AATCATTCCG ATTTACGTAC AATTTAATAT AATAAATATA 
TCTTGTCAAA AATAAAATGA ACGTTTATAT TAAAAAAGTG TTTATAAAAA TCTTTACAAC 
TTATTCATCC TTT 

(2) INFORMATION FOR SEQ ID NO: 823: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823: 
TAGAACGGAa CTAACAGAAT CATGCAGATT gCAACnTTTG GTCTGATTAA TGAGATGGAA 
TCAGTGAAAA AATAATGAAt TTTGTTATCT CAAAGCtTCG aTTcTTCAAG TAT T ATT GAG 
ATATaTTTAA AAGCTCGTAG ATTTATCTAT AAAcGTTTCT TCaCAGTACc AtCAAAATCG 
GGAAACCGGG TGGCGACGGT GAATAATGTG CCAATCGGAG. ATAGGCATCG GATAGTATCA 
AATTCACTAC GCATGATGCG TAGCAATTAC AT 
(2) INFORMATION FOR SEQ ID NO: 824: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824: 
CGCGGCCTAC GAGAACACGA GGTTTTCnTG AACTATCnTC ATGTTGACTC AATACATAAC 
CACCGTAACG CCCTAATTTA AAAGCTAATT CAGGTGTTAA CTCCTTATTT GCAATTCCTC 
TTACACCATC- TGTTCCAAAA TATTTACCCA TTTTGATTCT CCTTAGTCAC TTGATTCTCT 180 
GTATTTTCAG TTGAACCAGC AGTTGTACTT TCTACTGTTG ATCCCTCACT TGTTGTTTCA 240 
GAAGTATTTG AT 



60 
120 
180 
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(2) INFORMATION FOR SEQ ID NO: 825: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825: 



GACATCTAAG 


GTGCCTTCAG 


CTTTTGAATA 


AAAGGCATTA 


TTCGGnGATC 


GTAACCCCAT 


60 


TTTGTTACTA 


TCAGTTTTAT 


AAATGGTATA 


AGGTCCTTTG 


CCTTTTAAGG 


ATAAGTGAAG 


120 


TTCTTCGCCT 


GCTTGTTTAT 


TTTTTAGTGT 


ATAACTTTCT 


TTTAGCGGTG 


TGACTGATTT 


180 


TTCCGCTTGC 


TTGTAATAAT 


AGACGGCGCC 


AGATCCACCA 


ATTGCCCAGC 


AAAAAATTCT 


240 


TGTTTGGCTT 


TTGGTAAAAA 


CAATGTGATA 


GATAAGAGGA 


TTTGTTCTTT 


AAAGATTGAC 


300 


CAATCTGCGA 


ATATTTCTTC 


ATTGGTTAAA 


ATCATTTCTA 


TTTTTTCTGC 


TATTTTTTTC 


360 


CAGAGGATCA 


TTCGCCAGTT 


ATTTTCACCC 


ACTTCcATAt 


TCTTTAAACc 


ATGCTTAATA 


420 


aCtGrAGrAT 


AaTGGGGAAC 


ATTTGGckGA 


CAtCgCsGCA 


AATnAAAAAT 


CCATTA 


476 



(2) INFORMATION FOR SEQ ID NO: 826: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826: 



TTAATGACAT 


GTGACAAi TG 


AATAATTTGT 


GGGTCGGTTA 


ATTCTTTGGA 


ATCACGAACA 


60 


CCTAATGATT 


TTAGTTTGCT 


AATCATGCTT 


TTGTCAACAT 


AGGnCAGCAC 


AGACAGTTAC 


120 


TGGACCAAAA 


TAGCTACCAlc 


TACCGrCTTC 


ATCGGaACCT 


ATCACAGACC 


AGTTGCCAAA 


180 


ATCAGCAGGC 


AGTGTAGTAG 


TCTGGGGAGA 


AACTTTTTTC 


TTTGGTGTGG 


TACTCGTTCC 


240 


TTCCCAACGA 


GCGGCCTCTT 


TTTCAGCTTG 


ArGTCCCTGA 


AACATGACTT 


TACCTGATTG 


300 


ATAAGCGGTG 


ATAGTAGTGG 


TGCCAACCTT 


GGCCACAAAG 


ACAGTATAAG 


GGACCGTTTt 


360 


ATTTAAACGa 


TgGGTTGtAA 


ACAGTGGTCA 


TCcTGCTAAT 


GnTTnTACTA 


ACT 


413 



(2) INFORMATION FOR SEQ ID NO: 827: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827: 
TTAAAAGATA CAATGTAATG sAGACAGGCA CATTAAAaCC GACAATTTTT GTTGAGAGGG 
TTGTTAAAAT TTGTGCCAAG CCTGTTAAGC CACTGGAATA AATATTTCCT GGTTGATAGA 



60 
120 
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AAAAGTTCAT GGCAtTGATG CAAGAATAGC ATAGACAATC GACACAGAAA ATTTTGTTGT 180 



ATAGTCATGA ACTGGCAGTC CGTCATA 
(2) INFORMATION FOR SEQ ID NO: 828: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



207 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828: 
AACGTTTGGC TAAATCACCT AGAATTGAAT CAGGCACATC CATCCATTGT GTAAAGTACG 60 

TAGAGAGGAC GCCATCATCC AATTTnAnGG GGGCTTATAA nGTGAAGTCC CCTTTAAAGA 120 

ATGGTACAAG CAGAGAGGCT TGTAAATCAT AGTCAAATTC AGGGTTTTCA AATAATTCTT 180 

TCGCCCGATG CAATAAATGA TCAAGAATCA CTTCCATCCC ACGAGAAACA GGGTGAAAAT 24 0 

AGACTTGGAC ATACATTTGG TAACGACTAA CGATATAGTC TTCCACCGCA TGCATCCCAT 300 

TCATAGCAAA GGCAATGCCT CCTTT 325 
(2) INFORMATION FOR SEQ ID NO: 829: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829: 
TTGACGGTTn TTTAAGTCTT TACTTCGTAn TTACAACTTC TTCCATTAAC TTATnATTTC 60 

CTCGCTATTC CTATCGTTGT CTAACAATAA GTTGATTCGA TCATATTCTT TGTCTTTATA 120 

CTTCACTGTT TTCTTACGGA TTTTATTAAC ATCACTTGTT CCTAGTTCTG AGCAAGTAAG 180 

GTTCCnTACA CATTGGTGTA AAAAACACGT GAAGCCAATT AAATTTCCTG CACTTTTTGC 240 

TCGTTGATCT GCAGATAATT TTTGCTCTAC TTCATTTCCG TTCTTATCTT TCCCTTTCTC 300 

TGTCATTGGT ACACTAACAT CACCTAAGAC CGGATTAATA TTTACAAATG CGGTCTCAAC 360 

TTCTTTATCT ACACTAAaCA TCATATCAAA CAAAGAATTG GACGTGTTGG CATCTTTTAG 4 20 

GACGGAAAGA CCCGTAAAAA TAAGAATCGT CATTAAAAAA ATACCAAAAA ACCGTTTGTA 4 80 

TCGTTGTTGT CCAATAAATT TACACCTAAA ATAACAAAGA CAAAGGCAAT nCCAATCGTT 540 
CCCGC AATGC CCAACATGTT AGTTGCTATG CTAGACGTTA GGTTCATAAT CGGTTGCTTA 600 
ATAGCAGTTA TAATATCCAT ACTAAATAGT GTTTTTACCA TAACTGCATT AAACTCTCCG 660 
AGTCCTTTTA CCATTGACCA GACAAAAGTC TTTATTGTCG CAGAAGCATT TGTGATGGCT 720 
TCTTCAACnC CTGTCCATGA ATCTTGTTTT TTTTCTTTGG TCATtAACTC GAAAGCGTTG 780 
TCTTTGTAaG TATCGTAAAT CAAATCTACA TTAGTGGTTG GCTCTGTAAC AATAACTTTA 84 0 
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TTTTCATCAG GTTTAGGTTC GGCATTAACC GCTGTAaGTT TGgCGTAAAC ACcAAAAAAA 900 
ACCnGGCCTA CTAAAAAATn GTAAGGACTG GCTTTTACCC CGTGGATTTC TTAATTC 957 
(2) INFORMATION FOR SEQ ID NO: 830: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830: 

GGATTTTnAC TCCAAAAAAA GTTTAGCATC CGCTTACATT GTTGCCAAGA AAAATGGAGA 60 

ATTTTGGAGT TTTTTAATTy TTATAGTGTA TAATAAACAA AAATTACATT CTAGAGGAGA 120 

TTTTATGAAT TACATTTATT TAATTAATGA' ACATTATCCC TTATTAACAA AATCGGAACG 180 

AAAAGTTGCT GATTTTATTC TTAACTCTGG AGAATCTATT ATTTACAGTA CTATGAATGA 240 

nATTAAAAnA AAAGCGAATG T 261 
(2) INFORMATION FOR SEQ ID NO: 831: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831: 



ATTGCTATTA 


AACCAAATGG 


GAACACTATA 


TATTCTCCAG 


TAGATGGTAT 


TGTCCAAGTC 


60 


GTATTTGAAA 


CTGGGCACGC 


TTATGACCTC 


AAATCAAATA 


CTGGTGCTGA 


AATATTAATT 


120 


CATGTTGGTA 


TTGACACAGT 


ATCATTGAAT 


GGTAAAGGAT 


TCACTAAAAA 


AGTTGGTGCT 


180 


AAACAAAAAG 


TAAAAAAGGG 


AGAGGTTCTT 


GGAACATTTG 


ACAGCACAGT 


AATTACAAAT 


240 


TCAGGACTTG 


ATGATACAAC 


AATGGTTATT 


GTAACAAACT 


CTAAAGATTA 


TTCAGAAGTT 


300 


ATTCCTATAA 


CTAAAAATAT 


AGTTACTGAA 


gGTGCAGCCT 


TATTGAsrAT 


TAAATAGCTT 


360 


ATTAAAAGTT 


AATAGACTCT 


CtATACmAAG 


AGAGTCTATA 


TTTTTATTAT 


AGTTAATTAT 


420 


AACTATTCTA 


AGCTAATTAA 


TTTGACATAA 


A 






451 



(2) INFORMATION FOR SEQ ID NO: 832: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832: 
CCCTTTATGC CCTnTCCGnC TGTAATGTTG TCCGGTTGTG GAACCnGTAT GGAAAAgrTA 



60 
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TTGACTGCCG tGGTGGTAAA TGGGAACTAG AAAATAAAAG TCCAACTACT ACTTACACTT 120 

TTTTTGATGA TGAAACTTTT TCGAGGTATA ATTCAAAAAT TAGTGATAGT GGAACGTACT 1 BO 

CTTACGATGA AAATAATAAA AAACTCACTT TGGATATAAA AAATAAAGAA CAATTAATAA 24 0 

TGGAAAATGT TGAATATAAA GACGGTAAAT TAAAAGGTGA AATTGGAGGC GTAGAAGGAA 300 

CTCTGATAAA AAAATAAAAT AAAGAGGTGA TCTTTTGAAT AAGAAATCAC TGGTAAAAGC 360 

AGTCTTACTA TTTTTAGTAG GCTGTTTTTT TGTGTTTACG CAAACTACAG CGGTTAATGC 4 20 

CGAACCTAAA CCTGATGAAA ATAAAGTTAT TGTTACAGAG CCAACCACTA ATGTAGATTT 4 80 

GATTTACGAT ACTTACAAAG ACAaCgCTTT CGAGTTAATG ACCAAAGArA AAAAACAAGA 54 0 

TTCATGGACA GGGTTGAAGA AGCCATCACA AATGCTTCTG CGACAATAAA GACTTTTGTC 600 

612 

TGGTCAATGG TA 

(2) INFORMATION FOR SEQ ID NO: 833: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833: 

CACTTACGGC AAAACTTTGA ATCGACCGTG TCAATTTTTT CACGAATATC TTCCAATGAA 60 

TCTTTGAGGC CTAACCACCG GATAATACCT CTTTCCTGCC AAATCTCCGC TTCACGTTTA 120 

TTACACATCG ATTGTGaACG TcGATAAATC GGcTTcTcTA ACGATGaAAG ACTTCCTAAT 180 

AATCtCAAGC AATCGTTCTG AGATAGTGTT TCGCTTAATA CAATTGCATC AAAATAATTA 24 0 

AAAAAAGTTA ACTCATGTTC TATAGTTTCT TCACAGCAGT TCGTACAAAA GACTTCATAA 300 

TTTAAATGTT GTAATTGCTG CTGAAATGTT TGCTCAACTA AAATGTTCTT TGTTAATACC 360 

AAAATCCTkC kCATTTcTTC TTTCCTCCTA GAACTCTTTC TGCTACCAAT AACTAAACGC 420 

ACCAATCAAA ATACGTTACC AAATCGCGTG TTTTCTCCAT CGATTGGTTA ATAAACACCT 4 80 

TTTTATCTTc ATCTCTCTTC GCTCCTTTAT TCTTTATTTT GGTGGTTATT TATATnATAT 540 

nAGAaATGgA TTTTgTTTGk GTGTAAAATT TTTTTAtCCc CGTAAAAAAA AATGaATTyC 600 

CTTTAATATC AACTAATAAA GCkGATTTTT aACAAAAAAg TGCCGAAACT TCCTACTTTT 660 

TGCTAAAACA CGCCAAAAnT CTGTAATTTT ATTATTTTTT TCTCGAGAAA ACATCAAAAT 720 

GTTGATGTTT AACACTAACA TTTTGATGTT TTCGACAAAA ATCTTTACAA TTAATCGGTA 780 

ATTTTCTCTA CTTTTAACGA TAACTACTTT TAAAAAAGAG TTATTTTAAA AGTGACTTAA 840 

ATATTTATCA TATAAAAAGT AAGGGACTAA AATCTTTACG ATTCTAGTCC CTTACTTTTT 900 

ATAATCCACA ACGTTTTTAG TTAGAAGAAT TCTTTTTTGC TTTCCTAGCA GAACGATCTT 960 

TTTGACGACG TTCAATTTTA CGTTTTTTCT TATTG 995 
(2) INFORMATION FOR SEQ ID NO: 834: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834: 

CGTTGGGGTA GCGAAATTGG TTCTTGGTTA GTTCAAGGGA GTAGACACCT TACGGATATG 60 

GGGTTATGGA CGCATATTCT TTCTGGATTC ATTGTATCAG TAACAGGGAT TATCATTGTT 120 

TACTCTTTTC ATAAGAAGCT ATCTCTTTTA GCTTTAACAG CAGCCATGTT GGTTGGCTTT 180 

AATCCTTGGT TTTTACAAAA TTTAAGTTTC AGGTTCGACA GTCCTTACAT GTCTTTGAGC 24 0 

CTTTTATTTT CTGTTCTTCC ATTTCTATTT TGGAATAAAA ATAAAGTCTT ATTTTTAGTT 300 

GTTTCTATTT TaAGCaTTTT TTTGATGTGt AATACTTATC AGGCGTCTTC GGGGATCTAT 360 

GTCGTGATGG TGTTAGCATT ATCACTGAAG CAGCTGTTAG ACAATCAATC GTTTATTGCT 420 

GTGCTTAAAA AAGCGATTGT AGCTATGATT AGCTATATAA GCGCCATGTT TCTTnATCTT 480 

ATTGAAACAA AATTTAATCC TGAAATAGCA ACGCGAGGCG GTATGnCAAC AATCGCGAGT 540 

GTTAAGGATA TACCCAAAAC AATCCTGGTA AATAGTCAAA TGTATCTGTC AAAAaTAaCT 600 

GAACAGAGTA CAAAACTATG GaTACTTTTG TTTTTTATTC TAGTAATTTT CTTTGTTTTA 660 

AGTACGGTTC TTAATGCGAA AGTATCGCCT GTTAAAAGCT TTTTATACGC AG TAT T ATT T 720 

CTATTTTTAG GCTCTATTTT GAGTTAnGGC GTC 753 
(2) INFORMATION FOR SEQ ID NO: 835: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1076 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 835: 

TGAGCGTGGA GCAAAAATCC AAAGTGATTT TTGTCCCACG CTCTTGTTTA TTACGGAAAA 60 

ACCATCAATT TTTTATTAAA TTCTGATATT TATTATTGCT TTTTCATTAT AGTTGGCGTA 120 

ACTTTAGAGT ATAAATTTGT GAAGCGAGGG AGTCCGTGTG AGAAATTTTA AAAAATCCGA 180 

TAAATTAAAT AACGTAAGTT ACGATGTCCG TGGTCCTGTT TTAGAAGAAG CAGAACGCAT 240 

GCAAGAAGAG GGCATACGTA TTTTGAAATT AAATACAGGG AATCCAGCGC CATTTGGTTT 300 

TGATGCCCCm AATgAAAwTG TGCGCgACAT GATTGTAAaT GTtCGCGATy CAGAaGGTTA 360 

TtCCGATyCA AAAGgTATTT TttCTGCAaG AAAaGCGATT GAmCAaTATT GCCAATTAAA 4 20 

AAAaTTtCCy AATGkAACGA TtAATGAwAT TtACACkGGT AATGGCGTGA GTGAACTAAT 4 80 

TACCATGTGT ATGCAAGGCT TGTTAAATAA CGGCGATGAA GTACTTGTGC CAATGCCAGA 54 0 

CTATCCTTTA TGGACTGCAT CTGTTTCTTT GGCGGGCGGT ACACCCGTTC ATTATATTTG 600 
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TGATGAGCmA GCAGAATGGT ATCCAGACAT TGrTGaTATA AAaTCmAAAA TAACTyCGAa 660 

TACAAAAGCA ATCGTTATTA TCAATCCCAA TAATCCGACT GGTGCTTTAT ATCCGAAAGA 720 

GTTATTACTA GAAATTGTCG AAGTGGCACG TCAAAATGAT TTAATTATTT ACTCCGATGA 780 

AAtTTATGAT CGCTTGGTCA TGGACGGTTT GGTksaTGTA CCGATTGCTA CATTAGCTCC 840 

TGaTTTAwTT GTCGtAACAT TGAATGGTTT GTCAAAATCA CATCGAGTGG CAGGCTTCCG 900 

ATGCGGTTGG ATGGTGkTAA GTGGAGATAA ATCACGAGTA AAAGGGTATA TCGAAGGGCT 960 

AAATATGCyT GCTTCCATGC GGTTATGCTC AAATGTGTTG TCACAmCAAA TCATyCAAmC 1020 

kGCATTAGGT GGCTATCAAA GTGTCGATGG TTTACTTTTG CCAGGTGGCA GAATTT 107 6 

(2) INFORMATION FOR SEQ ID NO: 836: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836: 

AGACAGCTTA AAGGTGAGCA AATGGnGGTT GTTACGCATG TTAnTGATGG CcAGCATCAA 60 

TTTAAAATCA ATGATTTTAT CGATTTTACA GGCAACAATC ATACATCAAA AGATAGCATG 120 

ATTGGGATGC TAG TAG GC AT ATTAGGTGGT CCTCTGGGCA TTCTTTTTGG CTGGTTTGCT . 18 0 

GGAAGTATGT ATGGTGCAAG CAAAGACGCC AAAGAAATTC AAGAAGCACA AACGGTTTTT 24 0 

GAACATGTGA TTCAAAAGAT TGATGAAGGA CAAACGGGAT TGTTATTAAT TGCAGAAGAA 300 

GAAGACAACC GTCCGCTCAA CCAATTGGTT ATGTTTGACT TAGGTGGCGA AATCACGCGG 360 

CTTGATTTAG AGGAAGTCCA CCAAGAATTT ACGATGCGAA CGAATTGCCC AATGGAAGCG 420 

GAACCATCCG TGGGCAGCCC AAAAAAAGGn CCACCCCCnA GGAAGCCACC TTCCGGAAAG 4 80 

nAGGATTAAG GGnGCCAAGA C 501 
(2) INFORMATION FOR SEQ ID NO: 837: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837: 

AGGTTCATTG GTTGTTTCAG GGCTTACTAC TGGTTGTTCT TGTGCTnGAT TTTTCACTCC 60 

TACTGTTGGT TGTTGGACGT TTGTGGTATC ATTTGGGGTA GCTGAACTCT GTTacTCmAT 120 

TTTTTTcrGA CGCTACTTCT TCTGCTTTGG TCGGTTGAGA AGTAGTGTCT TTTTGTACTG 180 

CAGATTCTTC AGATACTGCC GTATTAGGCG TTGCACTACC TAACTGCGGA TCGGGGTTAT 240 

CGGGGTGCAC TGTCGTTGAT CCTGGTTGCG TATCTAATTC CGCTGcTTGt ACATTATCAG 300 
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TAGCTAATCC TACAACTCCT AACACGCCTA GAAAAAGAAT AGGAGCTACG ATCCAATGCT 360 

TCTTTGcCTT ATACATTTTA AAACGTTTTT TTACTTCAGT CTGTTGATTC ATGTATCATT 4 20 

CTCCTCGATT TTTTATTTTT TTGTTTGTAA AATAACTCAC GCAACACAAG GCGCTCGTTG 4 80 

CTTTTTCATT GTCATTTCCC TCCTTACTAA AAAAAGAGAA CCGAAAGTAT CTTCGATTCT 540 

CTTAGTTATT TTATATTCTT ATAAAACACG AT AT AT AT CT TTGTTTTGCT ATATCAAACA 600 

ATTATTAACT ATTTTTTTTA CGTTTTTTGT TCTTGAAATT GATCGTACCT GCTGCCATCA 660 

ATAATCCAAT AATAGTCAAC CAAATACTTT GTTGTTCTCC CGTTTGTGGT AAGGATTCTG 720 

GTGCTTTTGC TACTTGTTTC GTCTTACGAG GTTCTGAATG TGTTAATGCT TGTGCTTTTG 780 

GTTGTTCTTT TTGTTTAGAA AAAGATAACT GTTGGAGACC AGGTGTTTTG TCTGCTGAAG 84 0 

856 

TAGCACTTGC TACTAA 

(2) INFORMATION FOR SEQ ID NO: 838: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 838: 

TGTGCCTCTT GGGATTTCGC TGTGCCATTT TTTACTTTCA AACCCAGGTG TGCCACCGTG 60 

AAGTGTTGTT TGAGTAGGAG TATTAATCAT CGTTTGATAC TTTTTACCCG CTAGTTcAAA 120 

GGAGCCATCG GCAATTCGGC CAGCTGgTTC GACCAATtGT TGcACCAAAA TATaAGTCTT 180 

TAGACAAATA TTCATCTTGC TGAGGGGAAA GCCTAGCACG AGTGATCGGG ATCATTTnTG 24 0 

261 

TAAGGGAACG TCGGCGCAAC A 
(2) INFORMATION FOR SEQ ID NO: 839: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS: double 
( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 839: 

ATATAGCTAA AAATGATATA CCTGATGCAG ATATTATTAT GTACAATGAG ATGGATATGG 60 

CATATCTTGA TGAAAAAATC AAGAATACTG GATTGCCTGT ATCATATATA GATTTACAAA 120 

CAAAAAATGC TGATAGTATA AAGAAAAAAT TAGAAAAATT TGCTGAAAAT AAAATACAAT 180 

AATAACTACT ATAAAGTTAC TGATGAAAAG AGCCAACAGG GTATACCCTG TTGGTCTTTT 240 

TGTATGTTCA TGATATACTT GAATTTCTAA GAGGkCaACA GAGTTGAGCA mCAGAATTTT 300 

AAAATAAAGT AATTTTTTrG AGGrGAtTTT CTATGGCAAA AATTGGATAT GCnCGAGTAA 360 

GAGTGGGATC AAAATTTAGA TCGACAGTTA GAATTATTAT CnCAGTGTGA TAAAATTTTT 4 20 
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ACTGTAAGCA AGTGGGAGGA ACTAAnCGnA TGGTTCCAGA AATGTGAATT TATCGGG 4 77 

(2) INFORMATION FOR SEQ ID NO: 84 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840: 

GCAACAAGTT CGCTATATTC AACAATTATT GGAATATACG GCAGAAACAT TTGCAGAGAA 60 

TAATGATGGA CAAGAAATTA TGAATCGAAT TAAGACCGTT TATGAAACCT ATCGCACAAA 120 

ACCATTGCCG CAAACAAGAr GTGAATTTGA AAATCGGGCG CGACTCTTTC AGTTTTTACA 180 

ATTGTTTCAG TCATTCATTC AAGTCAAAGC GGACTTTGCC AAATTACAAC TAGATAAATA 24 0 

AACAGAAAAA GAAACCTTTT TAAACAGAGG TTTCTTTTTT GTnCTCTTAA AAATTCAGGC 300 

AAGTTCTTTA GTATTCATTG AGAATGTGTT AAACTAATGA GAAAGAAAnn TAAC 354 
(2) INFORMATION FOR SEQ ID NO: 841: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid ( 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841: 

AAGCCAGAAA TATGTTGTAA AAGTTGAGCA CTCGCTGTAT TCACATCAAC ACCAACTTGG 60 

TTTACCGCAG TTTCTACAAC AAAATCCAAT TGCTCGGCAA GACGTTTTTG CGATACGTCA 120 

TGCTGATATT GACCGACCCC GACTGCTTTG GGATCAATTT TTACTAATTC TGCCAACGGA 180 

TCTTGCAAAC GACGTGCAAT GCTGACCGCA CTTCGTTCTT CGACTTGTAA ATCAGGAAAC 2 40 

TCTTTACGAG CCACTTCACT CGCAGAGTAA ACGGAAGCCC CTGCTTCATT AaCAATCACG 300 

TAAAAGACGT CACGCTTCAC CGCTTTTAAT TGCTCTGCCA CAAACAATTC AGATTCACGA 360 

CTTGCCgTTC CGTTTCCAAT AGCTACCATG TCCACtTGAT AGTCTTCArT GATCTTTTTA 420 
AAGGCTGGAC CAGnGGCTTC ACGTTTCGCA GCAGTCGC 
(2) INFORMATION FOR SEQ ID NO: 842: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



458 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842: 
ATTTGkCGTA CTTTTTCTTC ATAATTTGTC GTACTTTTAG TCGCAACTGC TTGAAGACTA 



60 
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GCCAATGTCG CTTTATGAAC AGATAGTtCT TTTTCTGCTG TTGCTTGTGT TGTTTTGGCT 120 

GTATTTAAGA CTTGCTGACT TGCACTTTTT GCTTGTTGAG CGTCTGATTC TGCTTGTTGT 180 

GCAGTAGCTA ATTTTTCTTG ATTGGTTGCT ACTTCTTTTT GAAGGGTTTC TTTACTCTTA 240 

AAGACTGGAG CAGTGCCTTC TTCATAGGTA TTTGCTTCAA CTAATTTTGA TTGATTATAG 300 

CTAATGATAT GGATTTTTGC CGAAATATCA TTTAAATCTC CTGAAATAGA AACTGCCATG 360 

TTTGTTGTGT CCCCTTTTAG CAATGAATCT ATATGAGAAT ACGCTGAACT TTCATCGTCA 4 20 

AAGAGCATAT CAGCAATTGT TTTTCGAGCA GCTTTTTCAA ACTCTAGTTG AGAAATAGTG 480 

CCATTAATCG TTTCAAAATA GCCCATACTT AGATTTTCAT AAAAGTTTTG CCCTGGATAC 54 0 

TCTTTAAAAC CATTTTCTTT TGCTGCACGA TTAATTGCAT TTACGTCATG ATCAAACTCT 600 

TTGGGATTAT CATATTTTGC TACATCCCAA GCAAATCTCA TAGCTTGGTT AGAAATCTTC 660 

AACTTTTGTA ACCCAAGCTT TTCGCGAACA TCATTGATAA GTCCTACAAC GTAATCGCTC 720 

ATTTGTTGCT TTTGTTTTyC GGTAGGATTT TTGACATTGA TCATTTCCGT TrCTTTTGCC 780 

rCATCTGCTT GATTTTCAGG AAAAACTTTA TTTAATGCTA ATGCTTCTTT TTCCATTGCT 840 

TGCTTTTCTT GTTCAGwTAA TTTTTTGTAA TAATCAGGTG TGAAGTTAGC AGGyAAGTTA 900 

AkTCCTTTAT GyCCyTTTAr CTCTTCTTTT GAkyTATCTA AAGTATTCTG CGCTTGCTTT 960 

GTTGCTGCTT GCTTGTCTTT yAAAGCATTt TCTTTyGmAG TTAAATCTGC CTGTTTCGCA 1020 

GCTGTATCTG CTTTTGCGTC TGCTACTTTC TCTTGACTAG TTGTCACAAC rGTTTGTTGG 1080 

TCTGCrACTG CTTGATOTTC CTTCGTyTTT TGATCTTTAG CTTTkGCTTC TTCTTTCTGT 1140 

TTGTTAGCCA GTTCAGCTTC TTTTGCTGCT TGTTCTTTTT CTGCCTTTAC TTTTTCCGCT 1200 

TCCGCATTTy TCGCTAATTC TTCAAGCTTG GCTTGTTCTT GTTTTGCAGC TGTTACTGCT 12 60 

TGTTGATCTT TTTCATTTTG CACTTTAGCA GCATTCGTTT CTTTTGCTTT TTCATCGACA 1320 

ACAGCTTGTT GTTGGTTAAC GTCTGTTTGA GCTTGATCCA CTACTTTTTG TTGTTCATCA 1380 

ACAGCCTGTG TATCCGTAGC CACTTGCTCT TTAGCTTTTT CAATGGCTGA AGGTGTTGCT 14 40 

TCATCCACGA CTTTTTTAGC TTCGTCTACA ACTGCTTGTT GG 14 82 
(2) INFORMATION FOR SEQ ID NO: 843: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843: 

GGAATCATGG CTAAAAATGA CCCTAAATAA GGGATTAAAT TTAAAAAGCC TGCTGCAATA 60 

CCTAAAGTGA CTGCATAATC TAAGCCAATG ACAGAAAAAC CAATCATAAA CATAATTGCC 120 

ACTGCGAAAG CGACAGTTAa TTGTCCTCTA ATATAAGAAG AGACTTGATC ATTGACTTCT 180 

GCAgAACTTT TAgGTTGGTT TACGCATCTT CACTGGTAAA AACTTCATTA AATtGGCGCT 24 0 
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AgTTTTTGCC ATCTTTCAGT AAATAAAATA GAATAAcGGC AtCGTAATGA TGGCGACAAA 
GATCGTAGCC ACCGC 

(2) INFORMATION FOR SEQ ID NO: 844: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844: 
TTTTATTCTA TTAACCAATC CGGCGCATTG GAATATCATT GTTATCTGGA TGAACCAATA 
AATATTGAAT AACATCAATA TTGCkTGCTT GGAATGAGGc TGCACATGCT TGCAAATATA 
AgTCCCACAT TCGATAGAAg CGCTCGCCTT TTTCGTCAAC AATTTCTGCn TCTATATTAT 
GGAAGTTTTT TGTCCAATGT TCCAACGTCA ATTGAnAATC TCTGCGCAAA CT 
(2) INFORMATION FOR SEQ ID NO: 84 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845: 



TCGCTGCATT 


TACATCTGGA 


TATGGGCCCA 


AnATATAACG 


CTTTGTCTTT 


TAAAACCTTT 


60 


CGAGTAATCA 


TTAAACGAGG 


ATATTTTTCG 


TTTGTGATTT 


TGATAAAAGG 


ATAGGTTTTA 


120 


TCGTCTTTTA 


ACATGATGTn 


ATATTTGGGA 


TTATTTTTAT 


GGATTAAATT 


AATTTCTAAT 


180 


AAAAGTGCTT 


CAATATTCGA 


TTCTGTCACA 


ATATATTCAA 


AATCATCGAT 


TTCACTGACT 


240 


AATCGTTCTG 


TCTTAGTATC 


ATGGCTACCA 


CGAAAATAAG 


ATCGGACACG 


GTTTTTCAAA 


300 


ATTTTGGCTT 


TCCCAACATA 


GATAATCGTG 


CCGTTTTTAT 


CTTTcATTAA 


ATaACAACCA 


360 


GGTTGATCAG 


GAAGCAGTGC 


TAATTTATTT 


TkTATTCGTT 


CGTTCATACG 


ATACTTCCTT 


420 


TCTAACAAAA 


TGCTTCGtCT 


ATTATACACC 


TCTTTTTGTA 


ATTAAACAAA 


AAAAGCCGCT 


480 


ACTTTATGTA 


CGAAAGAAAC 


AGCCTGAAAA 


GACAAtCGCT 


AATGATTTCc 


TCTTCAGACT 


540 


GAATCAAACT 


ATTCGTAATG 


ACTAATCACG 


TTTATAAGTG 


TTTGGCAATC 


ATTGCTTGCA 


600 


ATTGGTCTTT 


TGTATGAACA 


CCGACTGCTT 


TTTCAACAAC 


TTCGCCATCT 


TTTTTCAATA 


660 


GCAATGTTGG 


GATGCTCATG 


ATTCCAAAAC 


TTGCTGGTGT 


TGCAGGATTT 


TCATCCACAT 


720 


CCATTTTGAC 


GATTTTCACT 


TCGTCTTCAT 


CATATTCTTC 


AGATAATTGT 


TCTAAGATGG 


780 


GTGCTTGCAT 


ACGGCAAGGG 


CCACACCAAG 


TTGCCCAAAA 


GTCGATTAAG 


ACTAATCCTT 


840 


CATCTGTTTC 


TGTTGCAAAA 


TCTTTATCGG 


TAATTGCTTG 


TGTCATTTCA 


CGGGATC 


897 


(2) INFORMATION FOR SEQ ID NO: 846: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846: 

TCGAACTGCC GACACACGGA GCTTCAATCC GTTGCTCTAC CAACTGAGCT ACCACGCCTA 60 

AAACAAAAGC TTGTTCAAAT AATTTTCAAG TAACAACATT AAATGGCGGT TCCGACGGGA 120 

TTTGAACCCG CGATCTCCTG CGTGACAGGC AGGCATGTTA ACCCCTACAC CACGGAACCA 180 

ACCGAGTAAT GGAGGTTAAC GGGATCGAAC CGCTGACCCC CTGCTTGTAA GGCAGGTGCT 240 

CTCCCAGCTG AGCTAAACCT CCAAAAAGGA ATGTTACTAT TAAAATAAAT GACCCGTACG 300 

GGACTCGAAC CCGTGTTACC GCCGTGAAAG GGCGGTGTCT TAACCGCTTG ACCAACGGGC 360 

CAGTTTGTAA ATATGAAACG GAGAGTAAGG GATTCGAACC CTTGAGACAG TGTTTACCGC 420 

CTACATGATT TCCAATCATG CTCCTTCGGC CTCTCGGACA ACTCTCCATA GCTGAAAGAA 4 80 

GCACATACTC ATGATATTTC ATATGTTAAA CTCCGGCAGT AGGACTCGAA CCTACGACAT 54 0 

CATGATTAAC AGTCATGCGC TACTACCAAC TGAGCTATGC CGGAATAAwy GCGTGGCGAC 600 

GTCCTACTCT CACAAAGGGA AACCCTTCAC TACAATCGGC GCTAAGAAGC TTAACTTCtG 660 

TGTTCGnCAT GGGAACAGGT GTATCCTTCT CGTATCGCCA CCACACTGGG TGTTGTnTCT 720 

TATTGAGTTG AATCTTCATT 740 
(2) INFORMATION FOR SEQ ID NO: 847: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847: 
CATTAATACA GGTATTAATC TCTATAATGA TGGTCAACTA GACGATGCGC CTGTAGCTGG 60 

GGAATATAGT AAGCAATTAG AAAATnACAA AGATTTCATA CGAGAATTAT CTGCTACAAC 120 

TATGTTTTTA GAGGTAAATC AACGAAATAA AAAAAGTATA ACTTCAAATA AACATGCTCG 180 

ACAAGCGATT AATTTTGCGA TTGATCGTGA GGCTATATCA AACAAGATAT TAACAAATGG 24 

AAGTATACCA GCTAAAGGTG TAGTACCGAG TAAACTAGTA TATAATCCAA AAACAGGAAA 300 

AGATTTTACT AATTCAAGTT TAGTGTTTTT GGATAAGAGT AAAGCTAAAG ACAGCTGGGA 360 

AAAAGCTAAA AAAGAATTAA AAGGTACTGA TTTATCTATC GATATTATGG TCAATGAAGA 4 20 

AGATTTATCA AAAAAACTAG GTGAATATCT TCAAAATGAA TTGCAAGATA CTTTGGATGG 4 8 

ACTTAAAGTT TCAGTTACTG CAGTACCTGC CACGTTACAA ACTGAACGAT TAAATTCTGG 54 

GAACTTTATG ATTGCATTGA GTGGTTGGCA AGCAGCTTTG CCGATCCAGT TAGTTTTTTA 60 
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GCAAATTTTG AGAGT 

(2) INFORMATION FOR SEQ ID NO: 848: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848: 



CGTTGTAACG 


CCTTTTGCAA 


CCATGCTTGC 


TTTACGCAAT 


TGACGGCGTT 


GTTTTTTAGT 


60 


TTTACCGTGG 


AAACGGTGAG 


ATGTAAAGGC 


ACGGCCACGT 


TTTAAGCCGC 


CTCCACCAGT 


120 


ACGTTTTACA 


CGTTTTGCTA 


ATCCGCGGTG 


TGTTTTTTGT 


TTTGGCATGA 


CTAATTTTCC 


180 


TCCTCAAAAT 


CTTTGCTTCA 


AGTGAAAGAC 


TCTAACAGTC 


ACAACTGTCG 


AATATCCTTC 


240 


TTGTAGAAAT 


TACCAAAAGT 


CCGAATGCTC 


GGTCTCTTCG 


TTTACTTGTC 


GTTTTTCGGT 


300 


GCCAGCGTTA 


GAAACATGCT 


GCGTCCGTCC 


ATTTTCGCTT 


TTTGTTCCAC 


TGTGGCGATG 


360 


TCCGCAGTTT 


CTTCAGCTAA 


GCGATCAAGA 


ACTTTCTGAC 


CAATCTCTTT 


ATGGGTAATG 


420 


GCACGGCCTT 


TGAAACGGAT 


AGAAGCTTTC 


ACTTTGTCTC 


CTTTTTCTAA 


GAACTTACGC 


480 


GCATTACGAA 


GTTTTGTATT 


GAAGTCATTT 


ACGTCAATTG 


TTGGACTTAA 


ACGAACTTCT 


540 


TTGACGTTGA 


TCACTTTTTG 


TTTCTTGCGA 


GCTTCACGCT 


CTTTCTTTTG 


TTGTTCGAAA 


600 


CGGAATTTTC 


CATAATCCAT 


AATTCGCGCA 


ACTGGTGGTT 


TCGCTCCTGG 


CGCTACAAGT 


660 


ACTAAATCCA 


AGTTTGCAGA 


TTCAGCGATT 


TGTAATGCTT 


CTGCTTTGGT 


T'i'TTACACCT 


720 


AATTGTTCAC 


CATCTTGTCC 


GATCAAACGT 


rACTCGCGTG 


cACGAATGCC 


GTCGTTAACC 


780 


ATCATATCCT 


TtGCTATGGT 


CATTCACCTC 


CAAAATTTTn 


AAGAGAAAAT 


CCTAACGGAA 


840 


GTCTGCTTTG 


TTTGCTAAAT 


GCAATAAAAA 


AACAGGTTCC 


TATCCAGAAC 


CCGCCAACAG 


900 


TCTGCCTTAC 


TTAAAAAAAT 


AAGGAA 








926 



(2) INFORMATION FOR SEQ ID NO: 84 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 9: 

ACTCAAACTA TTTCGAACGA TAATCGTATA CCAGCTGAAT TTGTCCGTCA TGTTGATGGA 60 

GATACCACCG TGTTAAGAAT AGACGGAAAA GAACAAAAGG TTCGCTTTTT ATTAGTAGAT 120 

ACACCCGAGA CTGTAAAGCC GAAAACAAAA GTACAGCCAT TTGGTTTGGA AGCCAGCAAC 180 

CGTACAAAAG AGCTTTTGTC TACTGC 206 
(2) INFORMATION FOR SEQ ID NO: 850: 
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120 
180 
240 
300 
310 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850: 
ATCAAATTCT TTTTCTTACA ATAATTAAAA ACCATATTTA GAACCTGAAC AAGAGATTGT 60 
AAACTATTTT TAGCATAACC AACATTAACG CCATCAATTC CCTTTTCTTT TAAAATGAAA 
AGGAATTTTT GTAGTTCTTC TGATGATAAA CTACTTAATT TTTTACCACC AATCCATCGA 
TTAAGTATTT TCACCTGTnC TTCTCGTTTA TCATACGTTG CCACCTTTTG TTCGTTTAGT 
GATCGATAAT ATTCAAACCA CTTTTGACTA ACAACACTAA ATTTATCATT ACTAGCAAGC 
CTnTnTTGCT 
(2) INFORMATION FOR SEQ ID NO: 851: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) .STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851: 
CCTTTGAACA TTACTGAAGC CAACACGTAA TCGACTTCTT CCCACGGAAT TTGGATAAAC 60 
TTACGTGGAT CTTTGGCATG GTAAAATTCA AACCCGCGGT CACCTATCAT AATTTGTCCA 120 
TAATCTGTTA ATCCTGTAAA TGCCGTCGCA TCTATAACTA AATCAACTTT TGTGTTTAAT 
GATTGTACCA TGTTATCTkC TCCATTTCAT TCAACTAATT CCTCTTTATT ATACTTTTCT 
TTCCTCATCT AAACAAA 
(2) INFORMATION FOR SEQ ID NO: 852: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852: 
CACACCTTAC TCAGCAAATC TCGTCACAAA TAGTACTACA AAAACCTTCG TGTTATTATT 60 

AAATAGCCAT GAAACTTTTG AGGCTTGGGT TACGAGCTTT ATTGATAAAG CTCAAGAAAA 12 

TGGCTATAAC TGCTTGATTT TAAATAGTCA CGGTAGTTTA GAACAAGAAC TGAAAAATAT 18 

TACCTCGCCA- ACTTTAAAAC AAGCCAGTGG CATTATTTGG GACCCAATCA ATGAAAACAG 24 

TAAACAATAC GCGAAATTTA TAGAAGAATT AACGATTCCT TACTATTTCG TCAATCACTC 30 

AACTAACGAA TTACCTTCTC TAGCGGGGCT TTATGAAGAA GCTAGTTATG TTTTAACCCA 36 
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AGAATTATTA 


CAGAATAACC 


ATAAAAAAAT 


TAGTTATTTC 


GTCACTAACC 


CmAAATwTAA 


4 ZU 


AGrrGCmATT 


ATTGCmGGcT 


ATAAAAAAGC 


C 1 1 o 1 1 1 b A 1 


1 L^rirV 111 






AGAAAATATT 


TTAACTACTA 


TCCCTGCTGG 


ATTTCAGTAA 


TCAGCTTATC 


ACAGATGGGA 


540 


TTACGGGTAT 


CCTTACTGGA 


TGACTATACC 


CAGGCTGTTT 


TTTTGGAACA 


GCTTTTAAAG 


600 


CAGAGCGGTT 


TGCGAACCCC 


CGATAATTAC 


TCTTTACTCG 


CAATTAAGCG 


GAAAAATGGA 


660 


TCGGTGCCAT 


TTTACCGGAT 


CCnAnTTCCT 


nCGnCCTAAT 


TAGATACCGA 


GACGGTTGGG 


720 


TAGCCAGTTA 


G 










731 



(2) INFORMATION FOR SEQ ID NO: 853: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853: 
TCACTCGTTT ACATACGAAA GACAGCTGCG ATTTTGTAGA rnGCATCTGGT TCTTTCTGCC 
AAAGCAAGCG ACAAATTTGA TCACGATCCA ACCCTTCGTG GGCATGCTGA TACAAGATTT 
TTAAAAAGCG TTGCTCGTTT TT 
(2) INFORMATION FOR SEQ ID NO: 854: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854: 
AACATACACA TTTCCTTCTT TAACCAATGG ATTTAATTGA TCTAACACAC CAGCTAATTC 
GCCAGAGTTC TTCTCTGCGT AAAACTTGGC CCCAAATAAA CCACCACCAA TAACAATTGC 
TGTCACCACT AAAAATTTAA TAAATTTCAT CACATAAATC CTCCCGAGTA ATTATTCTCT 
GTTACTTCTT CCAAAGTATA GCAAGTTTTA AACAATCCTT CCTCGGGCGC AACGACCAGT 
TTTTCCTCAG ACTTATGTCG TATnTnTGTT ATTTTGAATA AATAACAAAA CACTACTCnT 
C 

(2) INFORMATION FOR SEQ ID NO: 855: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: double 
•(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855: 
TGCAGCAGCT GCAGTTGCCG ACTGAAAATC CCCGTTGTGA TCCCAAAATA AAGCTTCTAA 



60 
120 
180 
240 
300 
301 



WO 98/50555 PCT/US98/08985 

2021 

CATTTTTGCA CCTCGATAGT TTTATTTTAA TTATCTACTA ACTATCGATA TCTTTCAACT 120 

TAAGATTATC CTCAATCAAT ATTTTTAAAT CAGAAAATTA ATTCAATTTG CATTCTAATC 180 

CATCAAATTC TkGTTAATAT TATTTTGAAT GTCTGCTTCA TCAAAGAAAC CATGCCCACA 240 

ATAAACCAAG CTTACAATCA TCAATTTCCC TTGGCGTGGC TCCCTCGGnC ATTTCCAAAA 300 

ATAGGAnGCA TCCTTTTTAA TCTGGCACAG 330 
(2) INFORMATION FOR SEQ ID NO: 856: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856: 

AAGGTGTTAG AAAAAATGCT ACCTAACTCT TCTGATGTTA CTTTTTACAA ATTTTTAAAT 60 

AAACAGCATA GTTTGTATGA ACAACAATTT GAATTTTTAA AAGAAGTAAT TCAATTTAGA 120 

AAAAATAATG AATATGACCA ATATAGCCAA ATAGGGTGTA AT T TAT T AC A CTATGATTTA 180 

TTAATTTCAG GATTACTTTT TGATGTTAAG TCTTATGATT ATTCATATCA ATACTTTTTA 240 

AAATTAGATT TAGCTAACTA TTTTTCTGAA TTAGAGATTC CTGATGAATA CAAAATGTTA 300 

ATAAATAAGT ATCGGGAGAA TGAATCATGA TTAAGAGACC TCATGkAATG CAAGAkAGTA 360 

TACmAGATgT GGTGkTgCAT GCATAGrAAT GrTATGkAAA TTCyaTAATA TTAATATAGr 420 

TAGaaGATAT ATCCAAGAAG AAACTGGCTA TGGAATGATA GGAATATCAT TAAAAGCTAT 4 80 

GGAAAAATTT TTTTCAAAAG TAGATGCAAA CCCAGAAATA GTAAATATAT CTAAAATTAA 540 

TAGGTTAAAC AAAGAAAATA GGGAAATGAT TAATAACTCA CT 582 
(2) INFORMATION FOR SEQ ID NO: 857: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857: 

TGCAnAAATT AAAAGAATTG CAATAGCTAA ATAGTTAGTT tGATTAAAAT CTATTACAAA 60 

TTTTTTTAAT AGTTCTAAAA CTGTAGATAG TGCAAGTAGA GCTATAAATA TTAATATTAT 120 

ATTTAAGGAA GTTTGTATAT TTTTTTTTAA AGTTTGACTA TTGTTATTTA CACTAAAAAA 180 

TGTTATTAAT GACTCTTTTT TACACATTTC AATAACTTTG ATTCTAAACA GTAATATTTT 240 

TTTTATATTT GTTTTTAAAT CGACTTTTCG TTTTTTTTCA TCTAGAATGA CTTTTCCATC 300 

TCTGGTTAGT TCAACAACAC CCAGAAACAG TTCTTCTTTA TTACTATTCC ATAATAATAT 360 

CGGCCCCTGA CCATGATTTT TATCAATATC AAAATCAGAw ATAATAACTT TATCTGCAAC 420 
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TTTTCTTAAA TCATTTTCTA GTGTTTCTGA AGTTAAGTTC TTATTTAATT TATGGATCAA "4 80 

TGTG 484 
(2) INFORMATION FOR SEQ ID NO: 858: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858: 

GCGTAACTCA TGTGTTCACC CCCTGTCCGT TGCTCTCTGA AACAAATGGT ATAAGGTAAA 60 

GGGCAGGTGA ACCTTGGTCT TTACGAAAAA AGTTTTGAnA ATTTTTTTGT GCCTGATTGC 120 

GCATTTTGCT ACCTCCGAAA ATTTGAAAAT TAGTGAAATT CCAAACCTTT TTGGAGGTTT 180 

AATGTCTGAT TAGCATATCT GTATACGTTT GATAATGCGC TTAAATGATT TTGAAGCGGC 24 0 

GCGnTCTGTG CCACACTTTT TTTGACGGGA TACCCGCCGT GTATATAAAA ATAAAACCAT 300 

CCATGTACTG TCCCTTTCTA CGGACAGACA CAAATGGCAA AGGAAGTAAG CCCTGATGGC 360 

GGTATTCTTC TTTGCGCTGG CGTTCCATTT CACGCTCATA GTCGGCAAAG ACCTGATAGA 4 20 

TTcTCTGGAT ACCTCGACCG TGATGGGCTT ACCGCTACCA TCAAAAATCG TAACGTAACA 4 80 

CATGGcTCCC TCCTGTGCAG CGTCATCCAg GCAGAGAAGA ATGCCAkGGG GGAAATGTCG 540 

TCATCAGGAT CAAGGGGAGC GnCCCATGCT GCACAACCGA GGGGCAA 587 
(2) INFORMATION FOR SEQ ID NO: 859: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859: 

GCCnCCnTTC AAAGGAGTnG CACCATCAGG AAAATTAGCT ACTGCACAAC AAGCAGAATC 60 

AGCGCTCAAC AGCAAAAAGT GCAAGTCAGC AAGTCTTAAA TACAGCCAAA ACAACACAAG 120 

CAACAGCAGA AAAAGAACTA TCTGTTCATA AAGCGACATT GGCTAGTCTT CAAGCAGTTG 180 

CGACTAAAAG TACGACAAAT TATGAAGAAA AAGTACGACA AACTGCAACG GCAGAAAAAA 240 

GCCTTCAACA AACAAAAGAT CAATTGGCAA CAATCAATGA GTTAATTCAG AATCGAGCTG 300 

CTGTTTTAGA AAAAGCGAaA ACAAACGTTT CGGAAGCACA AGCAATTGAA CAAACGTCTG 360 

CTAAAGTGCT GAAAGAAAAA CAAGCAGCCC AAAAAGCAGA AGAAAACACA TTGAATAGCT 420 

TGAAGGAAGT ATTGAATTTA GCAAAAGAAA ATTTAAATCA AAAACAAGTT GCATTTAAAA 4 80 

CAAGTACACG TTCATTGTCT CGTTTAGAAA ATGCTCAACC AACATACGAA AAAGCATTAA 54 0 

ACGAGTTAAA CAAAGCAGAA GCAGCAGCGG TCCAAGCACA AGAAGCCTAT GAAAATTCTC 600 
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TGAAATCATT 


GGAGGAACTT 


AAAGAACAAC 


AAGCCGTTGC 


TACACTTGCT 


TATACACAAG 


660 


CACAAGAAGA 


CCTTTCTAAC 


GCGAAGTTAG 


AGCTACAGCA 


GTACCAGGGC 


GTATTAAGAG 


720 


AATTAGAAGC 


aCAACAAGCC 


GaACAGCAGC 


GACAAGAAGC 


GTTGCAAGAA 


CAAGTAGCAA 


780 


AAGAACAACA ACGCCTTGAA 


CGAGAAGCAA 


AACAAAGCCA 


AACGTTAGTA 


GCAAGTGCTA 


840 


CTTCAGCAGA 


CAAAACACCT 


GGTCTCCAAC 


AGTTATCTTT 


TTCTAAACAA 


AAAGAACAGC 


900 


CAAAAGCACA AGCATTAACA 


CATTCAGAAT 


CTCGTAAGAC 


GAAACAAGTA 


GCAAAAGCCC 


960 


CAGATTCTTT 


ACCACATACA 


GGAGAAAAAA 


ATAATAAATG 


GTTAGCTATA 


GCTGGTCTGA 


1020 


TATTTGCTTT 


GTTAGGAGCT 


GCGGGTATTA 


TAAGTTTTAT 


TAGTAGAAAC 


GAGAAGAAAG 


1080 


TAAAAAATAT 


CTTTAAAATT 


AAATAATAAA 


GTCAAAGGAG 


CGACATATTT 


CGCTCCTTTT 


1140 


TATTGTAATC 


ATTGGAATAA 


ATTGAAAAAT 


AAAAAAATAG 


CCAACCAAAA 


TGGTTGGCTG 


1200 


ATTGTATTAA 


AGGCAGAAGA 


ATCTACCAAA 


TTTAGGAGCC 


AACCAGTTTC 


CGCTGGTCAA 


1260 


CAAAAAATAA 


ATGGATTGTC 


TCGGATGGAG 


ACACTTAAAT 


AGTACCTGAC 


AATAAAATTT 


1320 


TAGGCAACAA 


AAAAGTCAGC 


CGTTTTCCGC 


CGACTGACTA 


AACGAATAAG 


ATTTGTAAAG 


1380 


AAGTGATTTG 


TACTTTTTAC 


ACAATTATGA 


TAGCACGTTA 


AATGTTTTTA 


ACGCAAAAAA 


1440 


AAGCTAAGGG 


AATCGAAAAa 


CTTTCGGTTC 


TCTTTTTGTG 


AGGAGGAATG 


GAAATAATGA 


1500 


TGAAAAAGCA 


ACGAGCTCCT 


TGTGTTGCGT 


GAACTATTTT 


TACAAACAAA 


AAATAAAAAA 


1560 


TCGAGGAGAA 


TGATACATGA 


ATCAACAGAC 


TGAAGTAAAA 


AAACGTTTTA 


AAATGTATAA 


1620 


GGCAAAGAAA 


CACTGGATTG 


TAGCCCCTAT 


TCTTTTTTTA 


GGTGTATTAG 


GAGCTGTAGG 


1680 


ATTAGCTACA 


GATAATGTAC 


AAGCCGCGGA 


ATTAGATACG 


CAACCAGGAA 


CAACGACGGT 


1740 


GCAACCCGAT 


AATCCCAACC 


CGCAGTCACG 


AAATGAAACA 


CTTAAAACGG 


CAGTATCTGA 


1800 


AGAAGCAGCA 


TTACAAAAAG 


ACACTACTTC 


TCAACCAACC 


ACAGCAGAAG 


AAGTAGTGCC 


1860 


GAAAGGAATT 


GCTGCTGAAC 


AAAGTTCAGC 


TACCTCAAAT 


GATACCACAA 


ACGTCCAACA 


1920 


ACCAACAGCA 


GAAGCAGAAA 


AATCAGCACA 


AGAACAACCA 


GTAGTCAGCC 


CTGAAACAAC 


1980 


CATTGAACCT 


CTAGGGCAGC 


CAACAGAAGT 


TGCACCAGCA 


GAAAATGACG 


CTAATAAATC 


2040 


AACGTCAATT 


CCTAAAGAAT 


TTGAAACACC 


AGACGTTGAC 


AAAGCAGTTG 


ATGAAGCGAA 


2100 


AAAAGATCCA 


AACATTACCG 


TCGTGGAAAA 


ACCAACTGAA 


GACTTAGGAA 


ATGTTTCTTC 


2160 


TAAAGATTTA 


GCTGCAAAAG 


AAAAAGAAGT 


AGACCAACTA 


CAAAAAGAAC 


AAGCCCAAAA 


2220 


GATTGCCCAA 


CAAGCAGCTG 


AATTAAAAGC 


CAAAAATGAA 


AAAATTGCCA 


AAGAAAATGC 


2280 


AGAAATTGCG 


GCAAAAAATA 


AAGCGGAAAA 


AGAGCGCTAC 


GAAAAAGAAG 


TGGCGGAATA 


2340 


CAACAAACAC 


AAAAATGACA 


AAGGCTATGT 


GAATGAAGCA 


ATCAGCAAAG 


ACTTAGTTTT 


2400 


TGATTCAAGC 


ATTGTGACTA 


AGGATACTAA 


AATAGACAAG 


ATTACAGGTG 


GCAAGTTTAT 


2460 


TAAAGCTTCC 


GATTTTAACA 


AAGTAAACCA 


AGGACAATCC 


AAAGATATTT 


TTACAAAATT 


2520 


AAGTAAGGAT 


ATGAATGGAA 


AAGCAACAGG 


CAACTTCCAA 


AGCTCAAAAG 


TGGCTGCTGT 


2580 
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TGAATTTGGC CCTAAAGGAG GATACGCGGT TCTTTTAGAA AAAAACAAAC CGGTCAATGT 2 64 0 

CACGTATACA GGATTAAATG CTAGTTATTT AGACCGAAAA ATCACAAAAG CAGAGTTCAT 2700 

TTATGAACTT CAATCTGCAC CAAGTCAAAG CGGCACCTTA AATGCAGTAT TTTCGAATGA 27 60 

TCCTATTATT ACAGCATTTG TTGGTACTAA AAATGCCAAT GGGAAGGACG TAAAAGTTCG 2820 

CTTAACCATT AAATTATATG ATGCTAATGG CAAAGAAGTT TTACCAGATA AAGACCATGC 2880 

CTTTGCCTAT GCGTTATCGT CATTAAACTC TAGTCTAGGA ACAAATTATA GTGTAGAACA 2940 

TGCGGAATTT GTTTCAGACT TTGGCTCAAA AAATGAGTTC AAGTATATTA ATGGTTCCTA 3000 

CGTAAAGAAA CAAGCGGATG GGAAATTCTA CTCAACCGAA GACCTTGATT ATGGGACAGG 3060 

TCCTTCAGGA CTAAAAAATA GTGATTGGGA CGCTGTAGGT CATAAAAATG CGTACTATGG 3120 

TTCTGGTGTA GGTCTAGCTA GAGAGGGTGG ACGTATTTCT TTCTCTTTTG GTATGACAAC 3180 

CAAAGGAAAA GTCAACCTAT CTGGTGCGCA ATGGTTTGCC TTTAGTACCA ATTTAAATGC 32 4 0 

GAAATCAATT AGACCATACC AAAAGAAAGG GAATCCaAAA GAACCAGAAA AAGCAACAAT 3300 

TGAATTCAAT CGATACAAAG CCAATGTCGT TCCTGTTCTT GTGCCgAATA AAGAAGTCAC 3360 

TGATGGTCAG AAAAATATCA AT GAT TT AAA TGTGAAACGA GGCGATtCTT TACAATACAT 34 20 

TGTGACAGGG g AT AC G AC AG AACTTGCCAA AGTAGATCCG AAAACAGTGA CAAAACAAGG 34 80 

GATTCGGGAT ACCTTTGATG CAGAAAAAGT GACGATTGAT TTATCCAAAG TGAAGGTTTA 3540 

TCAAGCAGA 3549 
(2) INFORMATION FOR SEQ ID NO: 860: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860: 

AATTCTAAAA TTTATTTTAA AATAAGAATT TGCATTGGCA TTTACTTTGT CAGCGGGTAT 60 

ACTTATACTT GATGTATTTT GCATTTATTT CAGAAAGGAA TGGTGACATG CTTTTTGCTA 120 

ACCTATGGAA AGCAATTATT CTTGGTATTA TCGAAGGAAT TACCGAGTGG CTTCCAATTA 180 

GTAGTACCGG ACATTTAATT TTAGTGGATG AGTTTATTAA ACTCGATCTA AGTAAAGATT 240 

TTATGGAAAT GTTCAACGTC GTTATCCAAT TGGGAGCGAT TATGGCGGTT GTnATtTTAT 300 

ACTTCCAC 308 
(2) INFORMATION FOR SEQ ID NO: 861: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 861: 
CACTATCACA ACAAATCAAC AGTTTGGAAT GCTCAGGAAA AGATTTAGCA GATATAAACC 60 
CAACTAACAC TACAGGTTGA AGAGTCCAAC GCTTGCAAAA CAAGAGATGA AATTACTGCA 120 
TGCAGCATAA AGCTGCAGCT TTCAGGACCA ATAGCAGGGG ATCTCCCCAT CCACC 175 
(2) INFORMATION FOR SEQ ID NO: 8 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862: 
TTCCTTTAGT GAAGAGAGTC TTTGGTTGGT GGAAAAAGAC GTAAAGAAAC AGCGAACCAC 
ATCTGGGAGT TGACTTTTTG AACCCTTAGT AGGAAAGTCC GGGACCACTC GTTAACAGGC 
GAaGTTATGA ACTTCATAAG GtCGTGTTGC GAAACACGAA TAAACTAAGG TGGAACCACG 
ATAATTGATT ATTCGTCC 

(2) INFORMATION FOR SEQ ID NO: 863: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863: 



TGATTATTTT 


AATAGATTGG 


TGTAGTTTAA 


TATCAAAATC 


AATATTATAG 


GTGATAATAC 


60 


ACATAGAGAA 


ATAAArATAA 


TCAAAAAATA 


GAACCATTTT 


TTAGAAACTA 


GCATACCTAA 


120 


TCCTATCGTT 


CCTAGTGTGT 


ACAGTACTAT 


CAAAATAAGA 


TAGATTTkkG TTCCATTTCC 


180 


TGAAAAATCC 


GTCTTGCTCA 


CACTAGTTAT 


CATTCGAAGA 


ATAGCATATA 


TAATCCCCAA 


240 


AGAGCACAAC 


ATCCAAATAA 


AGCTACTAAT 


ATATTTAAGG 


AGATTCATAA 


CCAGACCTAC 


300 


TTTCTATTAn 


ATTACTnAGA 


GTAGCCTTTC 


CAAGTTTGTG 


TGTGCGTCGC 


TCTAACATCA 


360 


AAGTAATCAG 


GA 










372 



(2) INFORMATION FOR SEQ ID NO: 864 :. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864: 
CATTGGTTTA AAAATGACAA ACGGTCTTGA AATTTTATTA CACATGGGTA TTGATAnGGT 
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AGAGTTAAAC GGGGCACCGT TCACTATTAA AGTGAAAGAA GGnGACCAAG TGACTGCAGA 120 

TACGGTGGTC GCAATTGCTG ATTTAGAAGC AATTAAAGCA GCTGGTAAAG GAACTGAAAT 180 

GGTTGTCATT ATTACCAACA TGGATAAAGT AACACAATTT TCGTTAGAGA AAACTGGCGT 24 0 

GGTTACAGC 249 
(2) INFORMATION FOR SEQ ID NO: 865: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865: 

ATTTTCCGAG GAAATATTTA TAAAATGCAT ATTTACCGCC tAAAGCAAAC GGTGGCGATA 60 

TGAGTTTACC TGTTACCTAT TTGTTAAAAA ATGCCACAAA ATTTATTTAT GTCAACACTT 120 

CATTTTTCAA GGTATTTTTT ACTATTAAAG AACTGCCTTA CATTGCTGTT TTAGATGAAA 180 

ATAATTATTT TTACGGCATk TTAACACACA GCACCTTATT AAATATTTTA GCGCAATCTT 24 0 

GGAATGTAAA ACAAGGGAGC TACGTTTTAA CAATTGCTTC AGTTGGTCAA CAAGGCGACT 300 

TAGCAGCCAT TTCAAAAATC ATTGCCAAAT ATAGCAGCAT CGCCAGCTGT ATTACATTGG 360 

ATGTCGATAG TGAAGAATTT GTTCGTCGCA C ACT T ATT AC TTTACCAGCC GGCACAACTG 420 

CCGAAACGTG TACAGCCATT GTAGAACATT TAGAGCGGAA AAACTTTAAA GTTGTAGAAT 4 80 

TGGAAAATTT AGAAgACGAG TAAAATTTAC GACCTTTAGC TTAGGATTTA mAGCTAGGGA 54 0 

GGTAAGCCAA TTTGACTTAC CTCCcTAGCT TATTTTTTTA GACTTCTTCA CCAATAATTC 600 

GAACTTCCGT TTGCAATTCA ACATCAAATT TTTCTTTAAT CACTTCTTGG ATATGTGCAA 660 

TTAATTCCAC ATAATCTGTT GCG 683 
(2) INFORMATION FOR SEQ ID NO: 866: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866: 

AATGTAATAA TGCCCATTTG AACTTGTGCA TTTGGTTGGT TTTCTTTATC AAAAAAAATC 60 

GTCATTTCTT CTGATTCTGC TGATTTAATA AGCAATAGCT TCCCCTCCTT TTTTAAATCT 120 

TGTATCTTCC ATTATTCCTT TAAATAGAAT CAAAGTCAAT CTTTTTATAG AAAATTCTGA 180 

ATATTCTACA- CAAAAAAAGA GAGCATGAAT TCTCTCTCTT TTGnCTCTTT AAACTTTTAC 24 0 

AATCCAGCcT TCTGGCGCTT CTTTGTCACC AAACTGTATG CCAACCAGTT CATCATATAG 300 

TTGcTTAGtC ACTGGtCCAA CTTCCGTTTC ACTATAAAAT ACATGGaAAT CTGTGCCGTT 360 
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TTGaATCCCA CCAATAGGAG AAATAATCGC CGCTGTGCCA CAAGCACCCG CTTCGGAGAA 4 20 

GTCCGCTAAT CGATCAATGT AAACGTCTCC TTCAAGCGCC GTCATCCCTA AACGTTCTTT 4 80 

TGCTAGCGTC AATAATGAAT ATTTAGTAAT ACTTGGCAAA ATAGACGnTG ATTTCGGCGT 54 0 

AATAAAAGTA CCATCTTTGG TTATGCCAAA GAAATTGGCT GCACCAACTT CTTCAATTTT 600 

CGTATGCGTA TACGGATCTA AATAAATGCA ATCACTAAAT TCTTTCTCAT GTGCTTCTGC 660 

ACCTGGTAAA AGACTAGCTG CATAATTGCC CCCCACTTTA GCTGCCCCAG TTCCTC 716 
(2) INFORMATION FOR SEQ ID NO: 8 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867: 

TACGTTTCAT CCATTTTCCA TGAATAAAAG GATTTTTTAT TTTTCTTTTT CCAAATTTGA 60 

TAGAGTAGTT TGCCATATTC TTGCACCCAA CGATAAATCG TCGTATGAGA AACGTTAATG 120 

CCaCGATCAT ATAAGATTTC yTGaACyTCa CGATAGCTAA GGTTATAACG rAGaTaGtAG 180 

CCCACGGCTA CaATAATCAC ATCCTGCTGA AATTGCTTTC CTTtAAAATG ATTCATCGTC 24 0 
ATTCCTCCJcG CTATCTTTTT CTATTATTCT ACCTTATTTG ATAGTAGATT TAAAACTTTG 1 300 

CAACAGAACC AATGGTATTA TAACAAGAAT TACTAATGTT AACTTTGCAC TAAGGTTAAT 360 

TAGGACTATC CCATATAAAA GAACTTCAAA TATTGATAAC ACAACTTTGG GTATTGTAGA 420 

ACTAACTAAA G TAT AC AT AC TACTTGCATC ATTTGATAAT ATACTAGCGA TATTACCAGr 480 

AAGAGAATGC TCAAAGAAAG AAGTTCGCAT ATTTAGTATT TGTTTTATAA CTTTAGCTCT 540 

TACTTTATTT ACAATAATTA TTCCTATTTT TTGTAGATAA AA 582 
(2) INFORMATION FOR SEQ ID NO: 868: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868: 

TGATGTCACT TTCTGTAACG GTACATCGTA AAAAGGATCT TCACGACGTC CTAAAAAGTC 60 

TTTTCCTTGT TGCGGGAACA TCGCATATAA TAGGACATCC TCTTCAGATT TTGCATATTG 120 

TTGGATTTCC TTTTCTAAAC TATGCAACTC TGGTTTCAAT AAATCCGcgG TCGACAAtGA 180 

TTACCTCTTC- ATCCCCAtGA TTTTTGCTTA ATCTCTcgCG ATAGTGcGGT GATTGCCATA 240 

TcTCCGCGCA CATAAyCTTG ATTCATtCGk gATtAAgGAT ATCTTCCCCA GAAATCACAT 300 

CCTTAAGGCT GTGGTCCAAC CATTGTGAAA GCGGCGGACT AGGGGGAACC CAAACTGCCT 360 
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ACTTAGGCTC AGCAAAACTC TCAATTGCTG GAACTGCCCG GAATGCTAAA AGGTGAnACA 420 

422 

AC 

(2) INFORMATION FOR SEQ ID NO: 869: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869: 

TACTCAGGGA AATCGAATTT tCTTTCTCTT CCTGCAGGTA CTAAGATGTT TCAGTTCTCT 60 

GCGTCTACCT CTAATCAGCT ATGTATTCAC TGAAAAGTAA TATCCTATAA AAGATATTGG 120 

GTTCCCCCAT TCGGAAATCT CTGGATCATA GCTTACTTAC AGCTCCCCAA AGCATATCGG 180 

TGTTAGTCCC GTCCTTCATC GGCTCCTAGT GCCAAGGCAT CCACCGTGCG CCCTTATTCA 24 0 

CTTAACCTTA TCAACCTTAC GGTTGGGTCT TrATCACTTC TTCTAGCGAT AGAAGGTTAA 300 

TCAATAAATA AGCAATTGAA CTTATtAAAA AACTCATTCA ACGCGGTGTT CTCGGTTTGT 360 

TTTGATTCTT TTTACTTCAA TATCCAGTTT TCAATGAACG AATCTGATAG CAAAAGCTAT 4 20 

CAATGGAGCc TAGCGGGATC GAACCGCTGA CCTCCTGCGT GCAAAGCAGG CGCTCTCCCA 4 80 

GCTGAGCTAA GGCCCCGAAT TTATTTGATT GAGAGTAGAC CTCTCAAAAC TGAACAAAGT '540 

AAAGACGAAA TGTGTATTTC CGTAATATTC CTTAGAAAGG AGGTGATCCA GCCGCACCTT 600 

CCGATACGGC TACCTTGTTA CGACTTCACC CCAATCATCT ATCCCACCTT AGGCGGCTGG 660 

CTCCAAAAAG GTTACCTCAC CGACTTCGGG TGTTACAAAC TCTCGTGGTG TGACGGGCGG 7 20 

TGTGTACAAG GCCCGGG AAC GTATTCACCG CGGCGTGCTG ATCCGCGATT ACTAGCGATT 780 

CCGGCTTCAT GCAGGCGAGT TGCAGCCTGC AATCCGAACT GAGAGAAGCT TTAAGAGATT 840 

TGCATGnCCT CGCGGTCTAG CGACTCGTTG TACTTCCCAT TGTAGCACGT GTGTAGCCCA 900 

GGTCATAAGG GGCATGATGA TTTGACGTCA TCCCCACCTT CCTCCGGTTT GTCACCGGCA 960 

GTCTCGCTAG AGTGCCCAAC TAAATGATGG CAACTAACAA TAAGGGTTGC GCTCGTTGCG 1020 

GGACTTAACC CAACATCTCA CGACACGAGC TGACGACAAC CATGCACCA 1069 
(2) INFORMATION FOR SEQ ID NO: 870: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870: 
CCTGATGAAA TTGTCGACTT TTATGAAGAC TTACAAGAGA TTGATTTATC AGGAAAAATC 60 
TTCGGCGTTT GTGGTTCAGG AGACACGTTT TATGATGATT TTTGTAAATC GGTGGACGAC 120 
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TTCGAAGCAG TCTTTACTCA AATT 144 
(2) INFORMATION FOR SEQ ID NO: 871: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871: 
TCnGAAAGTT ATCCACAGAT GTGGATAATG TGGACAATCA GTGGCGAATA TGTGTTTGTA 60 

TTGAAAAAGC TGTTAAATAA AGTAATTTTT TAGTTTGTTC TGTGGATAAC TCTGTGTATA 120 

TGTGTATAAT TATGTGGAAA CGTTGTGGAA AGTAGCCAAA CTTCGTGCTA AAATAAGAAA 180 

GAAGAGAAAA CGAGAAAGTA GTGGAATAGG TGAAAATTAA ACTAGTAACA GTCGGAAAAC 24 0 

TGAAAGAAAA ATACCTGATT CAAGGCATTA ATGAATATCT GAAACGTTTA AATAGCTATG 300 

CCAAGATGGA AATCATTGAA GTACCAGATG AAAAAGCCCC AGAAAAACTC AGTGACGCTG 360 

AAATGCTTCA AGTCAAAGAA AAAGAAGGCC AACGTATTTT GGGGAAAATT AATGACAACG 4 20 

AATACGTCTT TGTTTTAGCC. ATTAATGGCA AACAATTATC TAGTGAAGAA TTTTCAAAAG 4 80 

AnATTGAACA ATTAGGGATT AGTGGCAAAA GTAATCTAAC CTTTGTCATT GGTGGCTCCT 540 

TAGGTCTCAG TGATTCAGTC CTTCAGCGAA GTAACCAACA AATTTCTTTT GGCcGCTTGA 600 

CTTATCCACA TCAGTTAATG CGTTTGGTAT TAGTGGAACA AATTTATCGC GGTTTTCGGA 660 

TTATGAAAGG GGAACCGTAT CATAAATAGG AAATAGGCTG GTTGCTCAGT AAGTTCTATG 720 

AATCAAGnCA TTGCTTGAAA AATATATGAA GTAAAATATC TTTAGTCTGT TTGTCTGATA 780 

GAAAATAAAA AAATTCAATC AAACTGGATT GAGAGAAACG TtACCGTTCA AACAATAAAT 840 

AAAAaTGAgA GGAATGGCAA AAAmCAtGGC A 87 1 
(2) INFORMATION FOR SEQ ID NO: 872: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872: 

AATATTTTTA TTAGCGTGTT CAAATCCTGA TTTCTTGTAT CAATTATTTT CTGTTCACTT 60 

TCAATAAGAA AAAaGTCTTT ACTTCCATGG CTAATAATAa GTCTCTATTT TtCAtAATAT 120 

gCAAAATtAT ATCTTAATag AAAaTATTTT TATTTTAAAG ATAAAAAATT CTTCATCCTG 180 

CAATACTTTA- TATTCTATAT TGTAATTATT CAGAATATTA CTTACAATAT ATAAACCTAA 240 

TCCATTACTA TTTCCCTTAT TTAAATCAAA TTTAACATCA AATATTTTAT CCATATTCGA 300 

AATTTTATTG TTACCGTATG AATTCTCTAT ATATAACCAA TCATT 34 5' 
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(2) INFORMATION FOR SEQ ID NO: 873: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873: 



nCCCCCATTA 


TATGGAATAA 


CACCTGGAnC 


TATATTTCCA 


AAGGATCCCT 


AAAATTAnGG 


60 


TTTATGTCCA 


TGGTTTCAAA 


TGTATCTAAA 


TACTTTTCCG 


ACAAnCCTTC 


aCkGAGTAAA 


120 


TCaAATTGTA 


AAAGTAACTT 


TTGGCACTGC 


ACCTCCaGTG 


CCTTTTCGTC 


CAAATGATAA 


180 


TCTATTAATT 


GAGAATATAA 


TTTTTTAGTT 


TCTTTrGTTC 


TGTTGCAAAG 


TTTTAAATCT 


240 


ACTATCAAAT 


AAGGTAGAAT 


AATAGAAAAA 


GATAGCAGGA 


GGAATGACGA 


TGAATCATTT 


300 


TAAAGGAAAG 


CAATTTCAGC 


AGGATGTGAT 


TATTGTAGCC 


GTGGGCTACT 


ATCTTCGTTA 


360 


TAACCTTAGC 


TATCGTGAAG 


TTCAAGAAAT 


CTTATATGAT 


CGTGGCATTA 


ACGTTTCTCA 


420 


TACGACGATT 


TATCGTTGGG 


TGC 








443 



(2) INFORMATION FOR SEQ ID NO: 874: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874: 



ATTTGTTTAT 


GAGCATCAAG 


ACATTCTCTT 


CTAGAAATGA 


AAAGTAGGTT 


ATTTATCCTT 


60 


TCGTAACATG 


TTAGGCTTAT 


TCAACTATGC 


TTCCAATCGA 


AATTTAATTA 


ACTTAACAAC ■ 


120 


TTAAAATGTT 


CTCTTTATTG 


ATTTTCATAA 


TTTaAATTGT 


TCAAATTGAA 


TTAGATAAAT 


180 


AGTAATACTT 


CCCATATTAA 


AATTTATTTT 


TTTTCAGCCC 


TATAATACTT 


ACAATTACCG 


240 


TGAATAATCC 


TAAATACATG 


GGAAGACCAT 


TATACTTTTC 


TCCAGTTTTC 


GGTAACTCCA 


300 


TAGTATTTTT 


ATTTCGATTA 


CTnCn 








325 



(2) INFORMATION FOR SEQ ID NO: 875: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875: 

GATCAATCCA ATTATTTGTA TTGnATATTC ACATCGGTCT TGCAACATTG CGTAAGCTAT 60 

TTnACTATCA T TAG AC AT AG TTTTATAAAA TTCTGATGTA AAAAGTACTT TAGGTAATTG 120 

GTAATACAAA TTTCTATAGG TTTCATTAGC TGAAATAAAT TGAAAATCGT TCATAGCTTT 180 
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ACCTCTCTGT TTTTGTTAGT TTATAGTTTA ACCTACTTTT ATTTCCCTTG CAAATACTGA '240 

TATAACAGTA TTTGCCTGCT nTCTTAGTAT AT C AC GAT AT TGCGATATAA CGCAATTGTA 300 

ATATTGGAAT ATCGTGATAT TACAATATTT AAAAGATTAT TATTGATACA ACAGCGTTTG 360 

TAGAAAAAAA ACAACAACAA TATTATGATA TTGTTGTTGT CTAATATTGT AATATTGTGA 420 

TTCCACAA 428 
(2) INFORMATION FOR SEQ ID NO: 876: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876: 
GGAAAGnAGT AGTCTATGAT AAAATAGTAA AAAGAGCAGA AATGAGGACA AcgAATGAAA 60 
ATTGGAGTCA TTGGTTTAGG AAATATTGCA CAAAAAGCGT ATTTGCCAAC TTATAGTGAA 120 
TGCCGCCACT TGGCAGAGTT TGTATTAGCA ACTAG 155 
(2) INFORMATION FOR SEQ ID NO: 877: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877: 

GCCGAATAAA AGAAGTTTCT GATGATATTG AAGTTGTGGC TATCAATGAT TTGACTAGCC 60 

CAACAATGTT GGCTCACTTA TTACAATTTG ACTCTACGCA TGGCACCTAT CCTGGCACCG 120 

TGACAGCCAC TGAAAATGGC ATTGTGGTTG AnGGCGAAGA nACCCGTGTC TACGCAGAAC 180 

CAGATGCAAG TAAAATCCCT TGGGTTAAAG AAAACGGAGT AGACATTGTC CTTGArTGTA 24 0 

CAGGTTTTTA TACCTCTgAA GnAGAAGCAC AAGCTCA 277 
(2) INFORMATION FOR SEQ ID NO: 878: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878: 

AAAAGTATAC- TAGTATACTG GTATACTTTC CTTTAGCAAA CCTTTTAACA AGCAAGATTT 60 

CTAAGGTATA CTGGTATACC TTTGAATAAA AGTTTAAGTA TACCAGTATA CTGnCACGAT 120 

AAAAGCAACA GTATACCAGT ATACTTACGA TCATATTTTG GAGTATACTG GTATACTATT 180 
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GCTAAaGAAA ATAAAAAATA AT GAT TT AC A AATwTAATTA TGTACGGTAT TTTTTAAGAG 240 

TATTGAATTT ATAAATATTA AGATAAAAAA ATAGAGACGG TAACTTACTT GATCGCCAAA 300 

TCAAATAAGT TACAGAATCA AAAATGTACA CCCATTTTCA ATTCTTTTGC GTCCCTGGAT 360 

GAT A 364 
(2) INFORMATION FOR SEQ ID NO: 879: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879: 

CATGAGTGTT GCCTGGCAAT ATTTGTGGGG ATTTTTTGAT TATTGTTAGC ATTTTTCTAG 60 

ACAAGAACAA CTGAAAAAAG AGTTTCTTTT TTCCATTATG TTTTGTTTCT GAGTTTAACC 120 

AAATCACTCA TTATTTCAAG GAGGAATTAG GTTATGTCAG TaCAATTAGA AGTCAAAGAC 180 

AGAGCCTATC CGTCCAAGGT TCAACTAAGG A 211 
(2) INFORMATION FOR SEQ ID NO: 880: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880: 



ATCTTTTATT 


TAGAATGAAT 


AGGgATCTGA 


AaTaTAAGaT 


TTTTTGATGG 


AGAAAaCCAG 


60 


GATGATTTAA 


TAGTTATyCA 


TATACATATG 


ACTCAACACG 


nAATAGATGG 


TGATTAACAG 


120 


AGAGTTATCC 


AAATTATTAT 


ATATAGAAAT 


AATTCAAAAT 


TATTTTAAAG 


AAATTCTAAC 


180 


TTTTCACTCG 


ATAAAATATA 


GTGTAAAAAC 


GCAACTTGAT 


TCAGAGGATG 


AAAATGAAGT 


240 


GTTCAGATAT 


GAGTTGAAAA 


ATACTTATTT 


CAATTGATTG 


AGTAAAGCTG 


TTTTATCATT 


300 


ATTAGTAATA 


TGAAGATTTT 


TATATCAGAC 


TTAGAGAAAA 


GCTTTACTAC 


ACTCTTGTTA 


360 


ACTATAATGC 


TTTTAGCGAG 


AAATAATACC 


TAAAGTGTAA 


GCTAAAAATT 


TATCATAT 


418 



(2) INFORMATION FOR SEQ ID NO: 881: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881: 
ACATCTTTTT TnCTGGnACA TTAATTGnGC TACTTTTTTT AATTGTGTCG AGTTAAATCT 
CCAAGGAACG AACAGTAATT TTAGCGCATG AATTAACCCC TAGTnAGCTA CGTCGATTTT 



60 
120 
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GGCAAAACTG ATTTATGAGT ATCTAAAAAA CGAACAAATC ATCGGCCTGG AAAAATTCTA 
TACCTTTAAC TGTGCCGATT ATGCCAATAA TCCCGAATTG CTATCATCTA TTCTCTTCGG 
CCATACCAAA GGCGCATTCA CTGGGGCTGA ATCAGAAwAA CrAGGACTCT TGGCCCAAGC 
TaATAATTCC GTACTCTTTT TAGATGAAGT ACATCGATTA TCCAACGAAA ACGAAGAAAA 
ACTCTTTCAG TTTATGGATA CAGGGACCTT CCGCCCTATT GGTGAAGA 



(2) INFORMATION FOR SEQ ID NO: 882: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(2) INFORMATION FOR SEQ ID NO: 883: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
465 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882: 



CTTCAAAAAT 


CTCGCGATTG 


TGATCGCGAA 


TAATATACGT 


CATTGTAGCT 


TCTTCTGGCG 


TTCCAGCAAA 


CGCTGCTAAA 


TGATAAAAGC 


CTTCTTCACC 


TGCTGTTTTT 


TCTGGAACTT 


CGTCAGCAGG 


TAAGGCATTA 


TGGAAATCAA 


TTCCTAATTG 


TAACGCATTA 


ATCATTGTAT 


TTTTGGCTGT 


TCCTGGATGA 


ACGTTTTTCC 


CTTGAATTGT 


AATTTCTGCT 


TGCGCCGCAT 


TAAACGTTTC 


AAATTGTAAT 


TCGCCTACTG 


GCCCACCATC 


CATTGTATAA 


GCAAAATCCA 


CGTTAAATTG 


AGCGACATCA 


AATTTGTCGG 


GCACCTACAC 


CAATTTCTTC 


GTCAGGACCG 


AAAAGCCACA 


CGAATGGGGG 


CCATGTTTAA 


TTTCnGGGGn 


GATTAAATTA 


AATATTCCCn 


GGGCGGTCAA 


TAATTCCGCA 


ATAnCCGGAT 


TTATCAACCA 


GCTCC 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883: 
AAAGTTTTGA TGATAAAAAT TACTATTGCT AACTTATTAG AAGGGGTCCC ATTATTACTC 
TTATTAACAT TATACAAATA ATCTTTCCAA TCACTCCTGC AAACGGGCTT CGTCTCTAAC 
TGAAGGTCTA TTACATTAGC ATTGTGCGCA CATGCATTCC TAACAAAATT TATACATTTT 
AGCCATGATA ATAATTCAAT ATCCGTACAT CTAAACATTT TTGAAAGTTT TCGTTTATTT 
CT AG TAG AC A TAATTTCTAC TAATCTAACA AGTTCACCAA AAGTAAGAAC TTCCACAGCT 
AACCAAATAC TTGGaAAATT ATCTTTTTCT TTAnTCCTnn T 
(2) INFORMATION FOR SEQ ID NO: 884: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884: 

AAAGATGGAT TCTTCTTTAC TCACACCTAA AAGAnGTTTT TCTGAATCCT TTAGTAAGAA 60 

AAACCGGTGA CTCTATCGTT AACAGAGAGT AGTTGTCAAC GCAACTACAT AAGGCTTATC 120 

TTGTGAAAGA TAAGTAAATT GAGGTGGCAC CACGAATGCG ATTCGTnCTC TTGGCTCACA 180 

GCCAAGAGGC TTTTTTG 197 



(2) INFORMATION FOR SEQ ID NO: 885: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885: 



CGGAATTGGA 


CTAnCCnATT 


GAAAACCATT 


TCTTTTTnTC 


CTAAAAAAAT 


CGAAGATGGG 


60 


TAATAAAAAG 


AATGGGTAAT 


CTGATAAAAA 


AACGAACTTT 


ATCTAATCGG 


ATAAGAAAGA 


120 


ATAATCGGTA 


TTGAAAAAAA 


GTTTAAAAAA 


AGTGAAAATA 


ATGATTGACC 


AATGCTTGAA 


180 


ATCTTGGTAT 


AT TAT TT AC A 


TGGTCGCAAG 


GCGAGTCACA 


CAGCTGAGOG 


A 1 i L A\j AACj 1 


i 


CAATAATTAA 


ATAACAATTC 


TTGATTTCAA 


ATCTCTCGAA 


AAAAGTTTGA 


AAAAACAACA 


300 


AAAAAGTTGT 


tGACaAATAiu 


CAAACGAGAT 


GATAAGATwT 


AGAAGTTGTC 


ACAACAGACg 


360 


ACaACAACCG 


AGCAAAACAA 


CTTAAAAAAA 


CTTTTTTAAA 


AAAGTTCTTG 


ACATTCrAAy 


420 


GAAAGTTTGT 


TAwGaTATAA 


AAGTTGCTGC 


GAGGTAACGT 


AGACCTTTGA 


AAACTGAACA 


480 


AAGAATAGAC 


GAACCAAATG 


TGTAGGGCGT 


CTTGATTCAA 


TTCAAGACAA 


CAAACATTTT 


540 


TAACAAGCAA 


GCAATATGCT 


AGCAAACAAA 


TTGAGCTTAA 


CAATCGwAAG 


ATTGTTCTAA 


600 


CTTTTATTAT 


GAGAGTTTGA 


TCCTGGCTCA 


GGACGAACGC 


TGGCGGCGTG 


CCTAATACAT 


660 


GCAAGTCGAA 


CGCTTCTTTC 


CTCCCGAGTG 


CTTGCACTCA 


ATTGGAAAGA 


GGAGTGGCGG 


720 


ACGGGTGAGT 


AACACGTGGG 


TAACCTACCC 


ATCAGAGGGG 


GATAACACTT 


GGAAACAGGT 


780 


GCTAATACCG 


CATAACAGTT 


TATGCCGCAT 


GGCATAAGAG 


TGAAAGGCGC 


TTTCGGGTGT 


840 


CGyTGATGGA 


TGGACCCGCG 


GTGCATTAGC 


TAGTTGGTGA 


GGTAACGGCT 


CACCAAGGCC 


900 


ACGATGCATA 


GCCGACCTGA 


GAGGGTGATC 


GGCCACACTG 


GGACTGAGAC 


ACGGCCCAGA 


960 


CTCCTACGGG 


AGGCAGCAGT 


AGGGAATCTT 


CGGCAATGGA 


CGAAAGTCTG 


ACCGAGCAAC 


1020 


GCCGCGTGAG 


TGAAGAAGGT 


TTTCGGATCG 


TAAAACTCTG 


TTGTTAGAGA 


AGAACAAGGA 


1080 


CGTTAGTAAC 


TGAACGTCCC 


CTGACGGTAT 


CTAACCAGAA 


AGCCACGGCT 


AACTACGTGC 


1140 


CAGCAGCCGC 


GGTAATACGT 


AGGTGGCAAG 


CGTTGTCCGG 


ATTTATTGGG 


CGTAAAGCGA 


1200 


GCGCAGGCGG 


TTTCTTAAGT 


CTGATGTGAA 


AGCCCCCGGC 


TCAACCGGGG 


AGGGTCATTG 


1260 
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GAAACTGGGA GACTTGAGTG CAGAAGAGGA GAGTGGAATT CCATGTGTAG CGGTGAAATG 1320 

CG TAG AT AT A TGGAGGAACA CCAGTGGCGA AaGCGGCTCT CTGGTCTGTA ACTGACGCTG 1380 

AAGCTCGAAA gcsTgGGGAG CnAACaGGAT TAGATACCCT GGTAGTCCAC GCCGTAAACG 14 40 

ATGAnTGCtA ArTGTTGGAA GGTTTCCGCC CTTCAGTGCT GCCAGCnAAA CGOCnTTAAG 1500 

CACTCCGCCT GGGGGAGTAC GACCGCCAAG GTTGAAACCC CAAnGAAT 154 8 



(2) INFORMATION FOR SEQ ID NO: 886: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCr:*::' 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CATACC 

(2) INFORMATION FOR SEQ ID NO: 887: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
486 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886: 



AAAAAACAAC 


CATTGCCTAT 


GTAGAACAAC 


GTAGCGCCTT 


GGATCTTAGT 


TTCCCAATTA 


GCGTTTTTGA 


AACGGTCTTG 


CTAGGAACCT 


ATCCAAACTT 


GGGGACTATT 


AAAACGCCCA 


GGAAAGAAAG 


AaAAGCAAGC 


AGCCATGGCT 


GCATTAAAAA 


TGGTGCAATT 


AGAAGACTAT 


GCGCAACGCC 


AGATTGGCGA 


ACTTTCTGGT 


GGCCAATTAC 


AACGTGTGTT 


TATCGCCCGT 


GTTTTGGCCC 


AAGGTGCTGA 


GGTGATTTTT 


TTAGATGAAC 


CTTTCGTCGG 


CATTGATATG 


TCTAGTGAAA 


AAGTGATTAT 


GGATATTCTT 


AAATCATTAA 


AAAATCAAGG 


TAAAATGATT 


ATCATGTTCA 


CCATGATTGC 


ACAAAGTGTC 


CCACTATTTT 


GATGAATTAA 


TCGTTTGAAA 


AAnCGGCTAA 


TTGCTGCnGG 


GCCTGGTGnA 


CCAAACATTA 


CTGCAGAACG 


CTCCCAAGAA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887: 

CAGACGCGCA CAACCATTAA AACAGGTGAC GATTCTCTTT TATGTGACTG TGCCAGAAGC 60 

CTTTCGnGAA CGGGCGACAC GGTGTTTGAA GCTTGTCTCC CCAAACTGTC CAGTGATTCA 120 

ATCATTGGCT GAAAGAATTA CTGTCGAAGA AACAGTTGTA TTTAAAAAGT AAACCATTAG 180 

AGACATTATC GTTACGAGCG CTTTTCCCTT TTTGGATGCC TCGTTTCTGG GAATGTCTTT 24 0 

TnTTAGAGGA GCGTAGGGAA ATAGCTGTAG A 271 
(2) INFORMATION FOR SEQ ID NO: 88 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888: 

GCAGGCAGAG GAGGCACCTG TAGGGGGTGG TGGGCTGGTG GCCCAGGAGA AGTCAGGAAG 60 

GGAGCCCAGC TGGtGaCAAG AGAGCCCAGA GGTGCCTGGG GCTGAGTGTG AgAGCCCGGA 120 

AGATTTCArC CATGCCTCAC AGCTCCGACA GCAGTGACTC CAGCTTCAGC CGCTCTCCTC 180 

CCCCTGGCAA ACAGGACTCA TCTGATnGAT GTGAGAAAAT TCAnAGGAAG GAGAAAAATT 24 0 

CGTATTGCCG CCCAGAAGAG CCGACAGAGG CAGACCACnG AAGGCCGAAA CCCTGCAACT 300 

TGGAGAGCGA AGACCT 316 



(2) INFORMATION FOR SEQ ID NO: 889: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889: 

AGTCAAGACC ATAGCCTTCT AAATTGATTT CAGCAATATC AGCTAaGGGC GTGCGCTGCA 60 

TCGATGTCTT CwTGTGTGCA AGTTGGTGAT nTAAATAATA ATGTATCAGA AATAATCGCT 120 

GAGACCATCA TACCAGCAAT TTGAGCTGGA ATTGTCACGT TATTTTCTTT AAA 1*7 3 



(2) INFORMATION FOR SEQ ID NO: 890: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890: 

AAGAAGGCGG CAAAAAAGCG TTTGAATTAA TTAAAGAAGG TATGAACAAT GGTGCGAAAG 60 

TTTTAGGCTT AGCAACAGGT AGTACACCAG nAACATTATA TAAAGArATG "ACTGCAAtGA 120 

tGTCGACTTT ACAGAAATGA CTTCTGTAAA CTTAGATGAA 160 



(2) INFORMATION FOR SEQ ID NO: 891: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891: 
AGGTGTTATT TTAACATTAG TAATAATTGC AGCATTACTT ATTTTATCTA GTAGCTTAAA 60 



GTTGCAAGTT TTATTGGTTA AGTTTGCCAA TCGTTGGTTT AAACGACCAA AACTAAGAGA 
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180 
240 
300 



TTGGGTGGAT AATTGTAATT TACAAATCTA TTCCTTGCGG GCAGCTGTCC AAACAATTAC 
ACAAGAyCUC TCAGCGTTGC TTCGAATCTT TTCATGGAAT GTATTTAAAT TATTATTTTG 
GTACATTATT CCTTACATTG TTTTAGTAGA AAATCACCCG AATATTGATT TATTATTAGT 
GATGTCCTTT ACCAGTTTTG CTGGTTATTC TTTCCAGGAn TTATTCCCGA CACCTGCTGG 360 
CATTGGTCCA TTTGAAATTG TTTACTTATT GGTTGTTTAA ACC 403 
(2) INFORMATION FOR SEQ ID NO: 892: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892: 
nAAATACGAA TTCGAGCCCG GTACCCAAnA AACAATTCAA ATTAATTGAT TGTCGCAGTG 60 
AAGGAACTAC CAATTATTAC TCACTTTCTC ACGATACCAC CAAAATTGAG CAACTCCAAC 
AACTATTGAC CCACATTACG TCGATTATGC CAGGGAGGAA GCAAGCATGA AAAAAATTCT 
CGTGGTCCTC ACAAACGTCT CCCGTTATCA CGGCACCGrA raACCAACGG Gs CTCTGGcT 
AGGTGAAGCC ACGGAGTcGT GGAAGAAGTA ACCAAAGCAn GATTTTCAGT TGACTATGTT 
AGTCCCCAAG GAGGCTATGT CCCTTTGGAT CCCCGGAGTA TGAAATATGT AGATTCATCC 
ATTATGGCAG TTTATGAAAG TGCTGATTTT CAAGAACGAG CGCTAGCACA TTCCCTCTCT 
CTTGAAGAAA TTGAGCCAGA TGAATACGCA GCAATTTACT TTACAGGAAG CCATGGCGTT 
ATGTGGGGAC TTTCCTGAT 
(2) INFORMATION FOR SEQ ID NO: 893: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893: 
CTGATGCGtA TTTTCTCCTT ACGCATCTGT GCGgTATTTC ACACCGCATA TGGTGCACTC 60 
TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC CCGACACCCG CCAACACCCG 120 
CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG 180 
TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC AC CG AAACGC GCGAGACGAA 24 0 

AGGGCCTCGT GATACGCCTA TTTTTATAkG TTAATgTCAT GATAATAATG GTTTCTTAGA 
CGT 

(2) INFORMATION FOR SEQ ID NO: 894: 
(i) SEQUENCE CHARACTERISTICS: 
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420 
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(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894: 
ATCAATTACG CCTTATCAAG AAAAAGGACG TCCAAAGCAA CCAGAAAAAG CAACGrTTGA 
ATTCAATCGA 

(2) INFORMATION FOR SEQ ID NO: 895: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
'D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895: 
GGACCATTTT AAGGAAATTA CCCGGGGGGG AAAATTTTAA AAAGCCCTTT TGGGGGCCGG 
CCAAAACCCA AATTTTTAAG GAAAGGTTTT TGGGGGTTAA ATnTTTTAAA TTTGGGGGGG 
GTTGGcaATT TTTTTtCCCC CAAAACCCAA TTTTTTTGGG GGTTCCCTTG GCCCCCGGAA 
CCAAAAATTG GGGGCCTTAA CCGGGAAAAA ACCCCCAAGG AATTTTTTAA ATGGGTTT 
(2) INFORMATION FOR SEQ ID NO: 8 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896: 



TTTCAAATAG 


TATAGTTTGA 


ATGCTCAAGT 


ATTTCAGCAA 


CTTTTATAAC 


ACAGGAGGTA 


60 


ATAAATATGG 


CTGTACCAGC 


GAGAAGAACA 


TCGAAAACAA 


AAAAAAGATT 


GCGTCGAACA 


120 


CATGAAAAAT 


TAAAATCACC 


TGAAATTTCG 


TTTGATGAGA 


ATCTTGGTGA 


TTATCGTAAA 


180 


AGTCATCATG 


TATCTTTAAA 


AGGCTATTAT 


GGTGGAAAAA 


AAGTGATGGA 


TAAAAAATAG 


240 


GAGGTAATAG 


TATTGAAGAA 


AATGAATATT 


ATTTATCCAC 


CACTTGTGGA 


ACAGAGCTTT 


300 


CAATTTTATC 


AAGATTATGA 


ACAAGAAAGA 


TATGATAAGT 


CTGAATTATA 


CAGAATAATG 


360 


GTTATGAAGA 


ATATTATTAA 


TGAAAATGGC 


ACTCCAACAG 


AAGAGGCTCT 


TAAAAAAGGA 


420 


TTAGTTAAAG 


CTTTTATGAA 


GATATGATTT 


ATCTTT 






456 



(2) INFORMATION FOR SEQ ID NO: 897: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 97: 

TCAAAAAGCG GGTTGGCTTT GAGCGGTCCA GTTGTGAGAC GATTAAACAA ACCTGAATGG 60 

ATTCAATTGT ATTTAGATGA ATCAAGATTA GCATTGTTTG TAATTCCCTG TAAAGCGACG 120 

GACGAAGGCG CTAGAAGTTG TGTAAATCCG AAGTCAAAAA AGAAAGCAGG GTATCGTAAA 180 

AGCTGGTCAG GTAGCATTTT AGAGAAGGTA GCAAAAGCCA GCAAGATGGA TATTGAAAAT 24 0 

CATCGTTATC ATGTTGAACC AGnAAGCGTT GAAGGCTATC CAACTGACTC TTGGTTTTGA 300 

TTTAACGA 308 
(2) INFORMATION FOR SEQ ID NO: 898: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898: 
CAAGTGCTTG GTACTTTAGC CACTGGTTCT GTTACCGGCA CGTTGCTTGG TCCTCTTTTG 60 



(2) INFORMATION FOR SEQ ID NO: 899: 



AAAA 

(2) INFORMATION FOR SEQ ID NO: 900: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 899: 



TTCATTAATC 


ATATTCGTnG 


CTGCATTATT 


TTTATTCGTT 


ACATAGCTGT 


CAATTTGGGA 


AAnGGnGTTA 


TTTTTTTGTG 


AATCAATTGA 


TAAGTTAATT 


GTATAAGTAA 


CATAAACGCA 


CAAAGCAGCC 


ACCGTTAACA 


AAATAATACT 


TGTTAACAGT 


AGACTGTATT 


TTTGTACTAA 


TCGATTTAAT 


AAATATTTTT 


TTTTAAACAG 


TTTTTTGAAG 


ATTCGCTTCA 


TTTTGTTCCA 


ACTTTCCATC 


AATAAACTGG 


CGATTTTTGC 


GCGTGACATA 


GGCACACCAC 


ATCATAATTA 


AGGGCAAACG 


AAGCAAATAG 


TAGCAGTnCT 


TTCATTTTCC 


AGTTAAAGCA 


ACGATACTAA 


TTACACCGAA 


TAAGACTTTA 


AAAAATACAC 


CGAGGTTTAA 


AAAGATACTG 


ATAAATGGAA 


GGCTCATTAA 


ATCAATAGGT 


TGAGATAAAA 


TAGCTCCCTC 


CTATAGTAGC 


CTATAAATGT 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900: 

AAAGTCTCTT TTCAATCATT TAGTATACAC CTGAAAAGAA GAATAAACTA TCATTAATTA 60 

AAAGAAAAAG TGAAAsAACT AGCGGATTTT TTGTTCAGCT GATTATCTTA AGAATTAGAA 120 

AATGAGGTCG GGACAGAAGT GTTTAACTCC TAGAACCAAG TAGGTACTGT TTAACATCTA 180 

(2) INFORMATION FOR SEQ ID NO: 901: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901: 



TTCCGTGAAA 


TACGGCGTTT 


GGACAGGCAA 


AGAATTAACA 


AATCAAGGGA 


GCTTTCCAAC 


60 


ACCTGGCTCT 


TGGGAAGAAA 


TGAAAGCGCA 


TTTATATTCG 


GTTTATGCAG 


ACAAAAGAAA 


120 


TGAGTCGATT 


AGTGGTGTTG 


CTTTTAGTTC 


ACCTGGAGTA 


GTTGACGAAA 


AAAGTCAACA 


180 


GATTTTAGGG 


ATTTCCGCCA 


TTCCATATAT 


TCATCATTTT 


AACATTTATG 


AAGAATTAGA 


240 


AGCGTTGTTT 


GGCTTACCAG 


TGACTATCGA 


AAATGATGCG 


AATTGTGCTG 


GTTTAGCTGA 


300 


GATATATGAA 


GGCGCGGCGA 


AAGGGAAAAA 


AGAAGTGCTT 


TTTGTCGTTA 


TTGGGACAGG 


360 


AATTGGCGGA 


CGATTTTTCG 


CAATGGTGAA 


TTGTACAAAG 


GCGCTCATTT 


ATATGGAGGC 


420 


GAGTTTGGAC 


TGAACTTTTT 


AAGTAATGGC 


CAAACaTTCA 


GTGAAATTGG 


CACAGCTGTC 


480 


AAAATGGCTC 


AgCGCTATTG 


TGAACGAATT 


GGAGTCGAAA 


AACAGGCGGT 


GACAGGAGAA 


540 


GAAGTTTTTG 


AACTGGCACA 


ACGAGGGGAT 


GAGATTGCTC 


GCGAAGAAGT 


CAATAATTTT 


600 


TATGATTATT 


TAACACAAGG 


ATTATTCGGC 


TTACAGTTTT 


CTTATGATCC 


TGAAATGATT 


660 


GTACTCGGAG 


GCGGTGTTTC 


TGCTAAAGAA 


GGCTTATTAG 


CAGAGATTAA 


TCGCCGGATG 


720 


CTAACGCATT 


TACAAACCTT 


TGAATTAAAA 


GATTTTGTTC 


CAGAAATCGT 


GACCTGCCAT 


780 


TATCAAAATG 


ATGCCAACTT 


GAnTGGCGCA 


CGCCCAACTT 


TCAAGCCAAA 


ACGATTGGGA 


840 



CTATA 

(2) INFORMATION FOR SEQ ID NO: 902: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902: 
ATATCAAATG AGAAAAAGTT AAATTGATCA TCCTCTTCCG TCAACAATTC TTTCATCAAT 
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GTTGTTTTAA ATAATTCnGT TAAATAAAGT TCTGTTCCTA TTAATAAATA CACTGGAGCG 120 

ATTTTTTGCT CTCGCACTTC TTTCAAAGCT TTTTGTAACT CCATTGGCTC GCTCCTTTTC 180 

TCGTTTCTCT TGTTTATCCT ATCATGTTAC TTCTGGAAAT ACTAGTCTTG TTCkTTAACA 24 0 

AATTTGGCTT TAGAAAGCCC TTGCCaAGGt AACCATTGAT AAGAAATCAT TCCTGGAAGA 300 

TCGGTCCGAn AATAATnTAT GGTGTGCTCC 330 
(2) INFORMATION FOR SEQ ID NO: 903: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 903: 

ATAAGGGGCG TTTTTCTAGT GCTTAGTCGA AAAAAATAGT TGATTCATAA GTTCATTGTC 60 

TTATGAATCA ACTATTTGAT AACTTTCTCT CTTGCTTATT GCGCATTCAG GATCCACATG 120 

AATCCGACAA AAACAAAGAA CAAGCCATAC ATAATTGGCG AAACTTCnTT CCCACGTTTT 180 

GCCGCAATCA TTGTAATTGG ATAGAAGATA ' AAGCCTAAGG CAATCCCATC TGAAATGCTG 24 0 

TATGTCAATG GCATTCCTAA CAAGATTAAG AAGGAA 276 
(2) INFORMATION FOR SEQ ID NO: 904: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904: 
CGCTTAAAAC TTCTCCTTCA ATAGGTGCAT AAATTTCACC ATTTTCTGGA ACTACTGCAT 60 
AGCCATCkCC CATCATTTTT TGTGAGAAGG CAGGATCATT CACTTCACTG GTTGGGATAA 120 
TGGG 

(2) INFORMATION FOR SEQ ID NO: 905: 



124 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905: 
GGTTCCtCTA TGCCAACAAA TAACCCGGTT TGTGTCACAT AGCTAACCaT TCGATAGTTG 60 
CCAGAAGCAA GCnCAATTCG GCTCATTGTA TCTTCAACAC GGTACATTTC TGCATCGCTT 120 
TCTATCATAA TTTTACCAGC TAA 14 3 

(2) INFORMATION FOR SEQ ID NO: 906: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906: 



TATCAAGAAA 


AGTCTAACAA 


AACTAGATAC 


AGAGAGATAG 


AACTAGGGGA 


AAAAAGTCCT 


60 


ACCATTGAAG 


AATTAATTTC 


TTTTACAGAT 


ACTTTAGGTA 


TAAGAtTATC 


TGAATTTCTG 


120 


TATCGGGGTG 


AC ATT AC AC C 


TCTAGCAGCA 


TCTTACTATG 


GTAAAAGAAA 


AATTGAGGTT 


180 


GCTTCTTTAA 


TTAATAACTT 


CCCTGAATTA 


GAGAAACAAT 


TTCCTGAAAT 


ATACAAAGAT 


240 


CGCTTTAAGA 


ACCTACAGTG 


TTACACTTTA 


TTTACCTTAT 


GCTTAATAAT 


CTCTAAATCT 


300 


TTAAATAATT 


CCTTGTATAC 


TTTTAAAAAC 


AAAGATATTA 


GAGAACTTAA 


AAATTTTTAC 


360 


AAAAATAGAG 


AAGTAATATT 


AGGTGCTGGA 


CTATGCTATA 


CTAAGTCATC 


TTT AC AGGGC 


420 


AGTTCCCAGA 


ATATTCGAnG 


GAACTGGATT 


TTCTAACGGG 


AAAACnCTTC 


CCATTAACCA 


480 



(2) INFORMATION FOR SEQ ID NO: 907: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907: 

TGGAGAACGT CTGTATCTCA CTGAGCAAAA ATAAACGGAA TAACCCTGAA CGGATAGAAT 60 

TAAAAAAGTA TTCTCCAAAA TTGCGCAGAC GGGCAATTTT CAAAGAAGTG AAATAGGCAA 120 

GTGGtAAAAA GGrGACaAAA TGGGGCC 14 7 

(2) INFORMATION FOR SEQ ID NO: 908: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908: 

AAGGATGCCC TACGAGCTTT TGCnAAAGCT GGCGGACAAA ATGTAACTGA ACAACAAGAA 60 

GAAGGCTTCT TTGATAAAAT GAAAGATGCG TTTGGCGGTA AGAAGAAAAA ATAAACCAAA 120 

CGAGTCAGAG ACAAAAGCAG TTAGCTCCAA GTGTTCATCA CTCTCGAAGG CTAAATTGCA 180 

ACGAGTAATT GTAGGTGCTG CTTTTGTTCA ACTGATTATT TTAAGATTTA GnGAGTGGG 239 
(2) INFORMATION FOR SEQ ID NO: 909: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909: 
AAATAGAAGG ATGATTAGAG TTAGCTTGTT TATCACTAGC TTTACTATCC GAACTTTGTT 
TAGAATTTGT 

(2) INFORMATION FOR SEQ ID NO: 910: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910: 
CGCATTGCCA GC AG TAG AAA CGCTAGGAAA CTGCGaATGT TATCTGTTCT GATAAGACAG 
GAACACTGAC ACAAAATA. 

(2) INFORMATION FOR SEQ ID NO: 911: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911: 



TCATGAAAGG 


GATTCGTTCC 


ACGGAACnTT 


CCGAaGaGGA 


atGCcTmCTT 


AGTCGATTTA 


60 


TCATTAmAAA 


TTAAAGAAGC 


GATTAAAAAT 


GGGTATTATC 


CGCAACTGTT 


AAAAAACAAA 


120 


TCTTTAGGAA 


TGATTTTCCA 


ACAATCTTCA 


ACTCGAACAC 


GAGTATCTTT 


TGAAACTGCA 


180 


ATGGAACAAT 


TAGGCGGACA 


TGGCGAATAT 


TTAGCACCTG 


GTCAAATTCA 


ATTAGGGGGA 


240 


CATGAAACGA 


TTGAAGATAC 


CAGCCGTGTA 


TTGTCACGTT 


TAGTAGATAT 


TTTGATGGCT 


300 


CGTGTGGAAC 


GTCACCATAG 


TATTGTTGAT 


TTAGCTAATT 


GTGCTACCAT 


CCCAGTCATT 


360 


AACGGTATGT 


CTGATTACAA 


TCATCCAACC 


CAAGAATTAG 


GTGATTTGTG 


TACCATGGTT 


420 


GAGCATTTAC 


CAGAAGGCAA 


AAAATTAGAA 


GATTGTAAAG 


TTGTCTTTGT 


CGGAGATGCG 


480 


ACACAAGTTT 


GTTTTTCTTT 


AGGATTAATC 


ACAACAAAAA 


TGGGCATGAA 


TTTTGTTCAC 


540 


TTCGGTCCAG 


AAGGCTTCCA 


ATTAAATGAA 


GAGCATCAAG 


CGAAATTAGC 


AAAAAATTGT 


600 


GAAGTTTCAG- 


GCGGTAGCTT 


TTTGGTCACA 


GATGATGCAA 


GTTCTGTTGA 


AGGCGCTGAT 


660 


TTCTTGTACA 


CGGATGTTTG 


GTATGGCTTG 


TACGAAGCGG 


AATTATCTGA 


AGAAGAACGT 


720 


ATGAAGGTTT 


TCTATCCAAA 


ATATCAAGTG 


AATCAAGAAA 


TGATGGATCG 


GGCcAGGksC 


780 
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CAATTGtAAA TyCATsCaCT GTTTaCCAGC AACmCGTGGG GAAGAAGTAm CTGATGaAGk 840 
AATTGACGGT AAAAATTCTA TTTGTTTCGA TGAAGCAGAA AACCGTT 887 
(2) INFORMATION FOR SEQ ID NO: 912: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 742 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912: 



GAACAGCATC 


CGTTACGATT 


TAAGACCCTG 


GnLGb I AAC I 


AC 1 LoitAt i 


1 1 1 ICl nnLn 


fin 


TTGACGAAAC 


GATAGACCAG 


GAnACACTAC 


CATGGCTAGT 


GATTAGCGAC 


AAAAATTCTT 


120 


ACGAACGATT 


TTTCTTTACa 


GCTCAaGAAA 


GCACGCmAGA 


ACAaGCTTTT 


GAGAAAATGT 


180 


TAAAaGAACT 


AGAAACCGAC 


CaTGACAGTA 


CACTAGAAGT 


TGTTCATAAC 


TGGTTATGCC 


240 


GACAAACAGA 


CGAAGCGCTT 


TTTCTAGGTA 


TTCTCAAAGA 


AGATAGAACG 


TTAAAAGGTG 


300 


CTATGACTTA 


CTGCATAGGA 


AAAGCCCAAG 


AACAATCCAA 


AGACCAATCA 


AGCGCCATGG 


360 


TGGCGGACGA 


AGTGGTTTTC 


AGTTGGATAA 


AAGAGTATTT 


CTTATTAGAG 


AAATTACCAG 


420 


AAACCAAAGC 


AGTTGGTAAA 


GTAACGACCA 


AAAAATCAGA 


AACAAAACCT 


GAAAGAAAAG 


480 


AGGTGGACGA 


ACAAATTGAT 


TTGTTCGAGT 


GCGTATGAAT 


AAAAAACACC 


AAAACCATTA 


'540 


ACCCCACCCA 


AAGCCTTTTT 


TAACTGGTGT 


ACTGCACAAA 


TACCAACGTA 


TGAATGGCAA 


600 


AACAAAAAGG 


AGAcAATTCT 


TGCGAGTTCT 


CGCAAGAATT 


GCCCcaCCAT 


AAAAAAACGA 


660 


TTAACGAAAT 


ACTCACGATT 


ATCGTTTCCA 


ACGAAATTTT 


ACTCCTTTGT 


GATAAAGTnC 


720 


GACTTTTGCA 


AGCnCAGTTn 


TC 








742 



(2) INFORMATION FOR SEQ ID NO: 913: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913: 



TGAACTTTGT TTCGCTATCA AGTGAAGAAA GATATCGAGG AATGTTAGAT CCATTGCTAT 60 

TTCTGCCAAG AGAAGAAGCT ATTCAAACCG CACGAAACGT GTTAGAAAAT TTTGGTGAAG 120 

TTACGACAGA CAGTCACACT GCAGTGAkaA GAAGACGCTr ATTTTAGATT CTGTCAATGC 180 

GGTGATGAAA GGTAAAGGaA AAAAaCACCT AACGAAArTA ATCGAGGTTA TTCGCGAAAA 24 0 

AGACCCTAAA' TTAGCCAATT TAATCTCTGG CCACAATGTT GGGTTAGGTA AAATTCTTTT 300 

AGGGGAACGA TTACAGCGAA CCTATTCGCT TTGGAAAATC CAAATCCATG TCTTAnGTAC 360 

CCAGGGGCTT TAAATCCCTA CACAGCGGAA ATTAGACAGC GGACGATTAA ACATGGAACC 4 20 
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AATCGCCGGC TGTCCATTAT GGnAGTTATT ATGGAAATGT C 4 61 

(2) INFORMATION FOR SEQ ID NO: 914: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914: 

CTTACAAATC TATCGTTTAT CATCGCGGAA AACGGCTGAC TTTTTTGT.TG CCTAAAATTT 60 

TATTGTCAGG TACTATTTAA GTGTCTCCAT CCGAGACAAT CCATTTATTT TTTGTTGACC 120 

AGCGGAAACT GGTTGGCTCC TAAATTTGGT AGATTCTTCT GCCTTTAATA CAATCAGCCA 180 

ACCATTTTGG TTGGCTATTT TTTTATTTTT CAATTT-ATTC CAATGATTAC AATAAAAAGG 24 0 

AGCGAAATAT GTCGCTCCTT TGACTTaTTA TTaATTTTAA AGATATTTTa CTTTCTTCTC 300 

GTTcTACaAT AAAcTaTAAT ACCgCAGCTC CTAACAAAGC AAtATCAGAC CAGCaTAGCa 360 

ACCATTATaT TTTcTCCGTA GTGGTAAgAA CyGGGcTTgC aCTGTTCGcT ACGAGATCgA 4 20 

AGGTAAGCTG gCTTGGcGTC TTGTTAGAAA gAAcGTGGgA CAGGTTGcGC GAGAGCCTGC 4 80 

ACACGTGCTG TTGCCCGTCA GGTGTGCTTG CACGnCTnAC C 521 
(2) INFORMATION FOR SEQ ID NO: 915: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915: 

TCCTTGTCAC ACAGCGTTTA TGGnTTATGG TTAAGGnGGT TAAAAAGGAC TACTGTCTTA 60 

ymAATTTGCG TTCCGGCCAA mCCAAAaaGA aGCmCmGrAA AAAGTTTTAG AATTTTTTGG 120 

TATGGATACG AmCCCAGCAA ATATTCaGTT AATTAATGAA CTAAAATCAG GAACTTGTTT 180 

ATTCCAAGAC CATCGAGGGC GTAgcCAnCC TATTGCAATT GATGTTTTAT TCGATAGTTG 24 0 

GTTAATGGCC ATTTCGTCTA CTAATAAAGA GGATGAAGCT ACTCAAaTGG CGTTGGCAaT 300 

GGAACAAGGA TCATAAAAAA GGAGAAGTGA GCATGAAAAA AGTACTACCT TTTATTGCCT 360 

TAGTCGGCTT GTTATTGTTG TCaGGTTGTG GAmCAGATAT GAAAAAGATA TTGACTGCCG 4 20 

ATGGTGGTAA ATGGGAmCTA GAAAATAAAA GTCCAACTAC TACtTACACy TTTTTTGaTG 4 80 

ATGAAmCTTT TcCGAGGTAT AATTCCAAAA ATTAGTGATA GTnGGACCGT ACCCCTTACG 54 0 

TGGAAAATAA' TAAAAAACTC nCTTTGGnTA TAAAnA 576 

(2) INFORMATION FOR SEQ ID NO: 916: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916: 



CGTCTGCAAC 


TTTGGTTTCG 


CTTTTCTAAA 


ACAACAGCTC 


GATTCTGAAT 


AACTCATTGA 


60 


TTGTTGCCAA 


TTGATCTTTT 


GTTTGTTGAA 


GGTTTGTTTC 


TGCCGCCGCA 


GTTTGTCGTA 


120 


CTTTTTCTTC 


ATAGTTTGTC 


GTACTCTTAG 


TCGCAACTGC 


TTGAAGATTA 


GCCAATGTCG 


180 


CTTTATGAAC 


AGATAGTTCT 


TTTTCTGCTG 


TTGCTTGTGT 


TGTTTTGGCT 


GTATTTAAGG 


240 


CI 1 1 O/iC 1 


THPZiPTTPTT 
i o^rt^ i i ^ J. J- 


GCTTGTTGTG 


CGTCTGATTC 


TGCTTGTTGT 


GCGGTAGCTA 


300 


ATTTTTCTTG 


ATTGGTCGCT 


ACTTCTTTTT 


GAAGGGTTTC 


TTTACTCTTA 


AAGACTGGTG 


360 


CAGTGCCTTC 


TTCATAGGTA 


TTTGCTTCAA 


CTAATTTTGT 


TTGATTATAG 


CTAATGATAT 


420 


GGATTTTTGC 


TGAAATATCA 


TTTAAATCTC 


CTGAAATAGA 


AACTGCCATG 


TTTGTTTCAT 


480 


TTGCATCTAA 


TAAAGAATCT 


AAATGGGAAT 


AGCCAGAATG 


TCCATCGTTC 


ATCAACATGC 


540 


TTACAAGTGC 


ATTtCGAGCA 


GCTTTTTCAA 


AGTCATACAT 


AGAAACTTTA CCtTCtgTGT 


600 


CGTAAATCTT 


CCCaTACTTA 


GGGtTTCCAT 


AAAAgTTTnG 


TCCaGGGaAT 


TCCTTtAAAA 


660 


ACaATTTcCT 


TTgcCGGCCA 


CCGAATTGGA 


TCCGCCAATT 


TACGGGCCCA 


nGGATTCCCA 


720 


AAATTCCCTT 


TGGGGGGATT 


AAnCCCAAAA 








' 750 



(2) INFORMATION FOR SEQ ID NO: 917: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917: 

ATTCTGGnTT GGnATGTTAC CATTGCTTGG TTGCGCTGAT TGTGAAATGC TTAAAAATGA 60 

CAAGnAAAAT ACCGTCTACT TGACCAAGCA CGGTTATTTT TAAATnATAA ATTGGTCACC 120 

GTCTTAAACG GCTCTACATC AGAGAGATAT GTTGGAAGCA TATCTCTCTT ATCTTnATCA 180 

TTATAGCACA ACAACCCTAA AAAGACCATC AATAAAAAAT GCCTGAGCCT CCTATTTCAA 240 

GGCTCGGGCA TTTTTATTTT AGGGACCAGT CACCAAATTC CAAGTGACCG CAGCATTGTA 300 

CTGTTGTCCT TTGGTTCCTT GGTTGGCTGG • TACTTCTAAT TTAATGTTCT CAAACGTAAA 360 

ATCCAGTTGG TAGACGTCAC TACCTGTGAA CTGTTGGTTT GCGACAACAC TGGTTGCCAC 4 20 

ATTATTCGCC GTTAAGCTCA AAGAACTAGT CTTACCAACT GCTTTGATTT GTTCGGTTGC 4 80 

TTCGTTATAA TTTGTAAAAC TTGAAACATT CGCTGGACCT AACAATAAAC GAGTCGCTGT 540 

TGGCAGACTA TCTGTGGTTG CTTTCGGTTG AGATAGTTGC GCTGTCATCT GCCAATTCGG 600 

TTGATTTTTC TTAATTCGCA AGTAAGGGAT TGCGTGTACC ATTTTCGTAG TAATCGGCCG 660 
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CTTTTTTCAA GCCTGATTGT TGCGTTTTAC. TAGCAATATT CATTTTCCAA AATCAAAGGT 720 
TGGGCACGCT GATnAAGCCT TCnGTTGGGT GTAATGGTCC nTGGGGTTGT AACCGTCAGA 780 
CGTAACGCAT CGGGGCGGCT CATGGTCGGC AAATTCCCGT AACACAAG 828 
(2) INFORMATION FOR SEQ ID NO: 918: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918: 

AGTTATCTTA ACGACTAATT TTGCAGATAA AATTGGTTAT TATACCATTG AAAATACAGC 60 

GCAAGAAGGT AGTCTAGCAA TAATCAAAGT AGACAAAGAT TCAAGAAAAC GTCTAGCAGG 120 

TGCAGAGTTC AAATGGCGTG ATACGGTGAC AGGAAAAGAA GGAACAGTTG TAGCTGACAA 180 

AAACGGACAA GCCCTGATAA CGAATTTGCC AGTCAATCGT GTATACGAAA TTACAGAAAT 240 

TAAAGCACCA AACGGCTATA TCTTAAATAA CAAGACTTAC CGAGTTATCT TAACGACTAA 300 

TTTTGCTGaT AAAATTGGTT ATTATACCAT TGaAAATACA GCGCAAAArG GTAGTCTAGC 360 

AATAATCcAA GTAGACCAAG tTCagAAAaC GTCTAGCaGG TGCAGAGTTC CAaTGGCGTG 420 

ATACGGTGAC aGGaAAGagG AACAGTTGTA GCTGACAAAA CGGACAGnCC TGTTACGATT 4 80 

GCCAGTCATC GGTATCG 497 
(2) INFORMATION FOR SEQ ID NO: 919: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919: 



TGATCTGCCA 


CAGTTTGCGC 


TTGCTTCAAT 


GCTTCACTGT 


AGACTTTCCA 


ACTGCTTGCG 


60 


GTGTAGTCTT 


TTTCTTGGTG 


TTTCTCGTTT 


TCTTTGATGA 


TTTTCAACAA 


GTTGGTTTTA 


120 


TCTACTTCTT 


TAGTTGGCAC 


TTTTACCAAT 


TGCTTCACTG 


CCGAACGTAA 


CTCTGTTTCT 


180 


GCTTGGTCTA 


CTTCTGCTTG 


CGTTGCTGTT 


GTTTGATCTG 


CCACAGTTTG 


TGCTTGTTTC 


240 


AATGCTTCGC 


TGTAGACTTT 


CCAGCTGCTT 


GCGGTGTAGT 


CTTTTTCTTG 


GTGTTTCTCG 


300 


TTTTCTTTGA 


TGATTTTCAA 


CAAGTTGGTT 


TTATCTACTT 


CTTTAGTTGG 


CACTTTTACC 


360 


AATTGCTTCA 


CCGCCGAACG 


TAGTTTTGCT 


TCTGCTTGGT 


CTACTTCTGC 


TTGTGTCGCT 


420 


GTTGTTTGAT 


CTGCCACAGT 


TTGTGCTTGT 


TTCAATGcTT 


CGCTGTAGAC 


TTTCCAGCTG 


480 


CTTGCGGTGT 


AGTCTTTTTC 


TTGGTGTTTC 


TCGTTTTCTT 


TGATGATTTT 


CAACAAGTTG 


540 


GTTTTATCTA 


CTTCTTTAGT 


TGGCACTTTT 


ACCAATTGCT 


TCACTGCCGA 


ACGTAACTCT 


600 
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GTTTCTGCTT GGTCTACTTC TGCTTGCGTT GCTGTTGTTT GATCTGCCAC AGTTTGTGCy 660 

TGTTTCAATG cTTCGCTGTA GACTTTCCAG CkGCTTGCGG TGTAGTCcTT TtCcTGGTGT 720 

CyTTtTGTTC CTGAAATTAA TTTTCTAACG CTCTTCGGTA ACTTCAGGAG TGGTTTAACn 780 

ATGAATGAAC GCGAGC 796 



(2) INFORMATION FOR SEQ ID NO: 920: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 920: 



AACTTCAGAA AATATTAACA GGnAATAGCC GAATnAAGTA ACGTCTACCA AGAAGAAGGA 60 

CTCTACCGTT CACAnGCGTG TTCAAATCCT ATTGTTCCGA GGTCATTACT TTACATTCGG 120 

CTTTTCTTTA TAATTCTGGT TCACGATTGG CATCTTTGGA ATAAACGTAC ATTAGCGGTT 180 

CTTCACGACC AACCGCATAG GCTGCTTGTG GCACATAGGC AC T CAT G AAA ATATAGTCCC 24 0 

CTTTTTCCAC TGGATACCAT TCGTTGTCCA AATTGTACAT CCCTTGGCCA CTAATTAAAT 300 
ACGCGCCATG CTCTTGGACA TGCGTTTCAA TGTAGGCATG ACTAGCACCT GGTTCAAAAG , 360 

ATAAGATATG CATATTCATA TCAAAGTCAA ACTCTTTCGG CAATAAACTC CAAAGTAACA 420 

CGTCTGTCAT cCTTCATATT CTTCGGGTTG TTGGkCaTGA ATTGACCCAA CCACTTTaTA 480 

AGGtTGATGG cCTGCCAACG GGCTGAnAAG CGGnTTTTAA TAT 523 



(2) INFORMATION FOR SEQ ID NO: 921: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921: 



TTTCATCAAA CGACGTTCTT CATGTTTAGT CATTTTTTGT TTTCGTTCTT TTTTATCGGG 60 

AATTTTTTCA GTAATCGAAC AGAATTTACA TTTGAAAGAA CGACCATTTT TTCCTTCTAT 120 

GAT G AC CAT T TTTTTGGCTT CTGGCCGCCC GACTAACGGT ATGCGGCCCT TAGCAGATGA 180 

ATTGGAGATG GCTCATTATA ATGGACACTT ACTTTCCTAT AACGGCGCTT GTGTCACCCA 240 

TCACGGcAGT CAGCAACAAC TATTTAATCA GACAATTTCA AAGTCATTGa GTCAACAAAT 300 

TTTAGAGCAT TTGAAACAAT TTGaCGTAAT yCCAAkGATT AATGATGrAA CATTaTGTAT 360 

GTCAATGATG TTTTTCATAA CACCTTACAT TTAGAGACTG GTGATTTTAA TATTATTGAA 4 20 

TATGAAn 4 27 
(2) INFORMATION FOR SEQ ID NO: 922: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922: 



TTGATTTTTA 


nGGATnGGTT 


CCGGTGGCAT 


AAGGACAACT 


AGGTCTTTCA 


CTGGCAATCG 


60 


TCAATGGCCT 


ATTGGnAAAG 


ATGCCCCTAT 


CTGTATTATC 


TGAAAAATCA 


nACAATCATA 


120 


ACCAATTAGA 


AAGGTGGTTA 


TTATGTATAA 


CTTATTAACC 


AAACAGGAAA 


TTCAGTTACT 


180 


TTCCCTAATT 


GAATACTTGT 


ATGACAGTAA 


AGAAAAAGTC 


CCCATGCAAG 


TACTCCGACG 


240 


AAAATATGAA 


TTTTCCCATT 


aCAATATCAA 


CAATTTATTG 


AATCAATTAA 


CTTTGTTAAT 


300 


TTCTCGAGTC 


AATACACATG 


AAAATGTACA 


TATTCGTATT 


ATTAATAATC 


AACAGTCCAT 


360 


AGAATTAGTC 


GCCGATGAAA 


ATATCCCGAT 


TGAGTTAATG 


AAAGAAGCTG 


TTGTCCGCGG 


420 


GTCACTAACC 


TATATGTTAG 


CCCTGGATTT 


ACTTTTAAAA 


CGCTACACTT 


CAGCCAAAGA 


480 


TTTTTGTGAA 


GAGCACTTTA 


TTAACTTTTC 


nATTTTTAAA 


CAAGTCAGTG 


GCCGGTTAAA 


540 


TAATCCATCC 


AGCTAGGATT 


AATTGGGTAA 


TTAAACCTAA 


AAAGGnGGGG 


AAAAATTTGn 


600 


GGGAACGAGG 


AAGGATTTCG 


CAGGTTnCTT 


TAATCCCTAT 


CCTCCATTCC 




650 



(2) INFORMATION FOR SEQ ID NO: 923: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923: 



GGGCCCCCAG 


TTCCCCGTAC 


TTTAACGGAA 


TCATCAGTAG 


GTCGCCTCTG 


TCGTGTTGGA 


60 


CTnCTnGCTT 


TTCGCCACTr 


rTTTTAAATT 


GACTCCCAGA 


GAAATACACG 


aaCACCAGGA 


120 


AaTTTCATTT 


AATTGTCCTC 


TGaTAATTTT 


AGGTAAATAA 


TAGACAAcTT 


GTAAAAAACT 


180 


AACTTAGGTA 


AAATGATAAT 


GTGGACTTCG 


CCGTCATCTG 


GCGTAcATTG 


GCATCGAAAT 


240 


TGGTAAAGCC 


ACCAGCAGAA 


TTCGTCATGG 


TCATCGTTAA 


TAATTGGGCT 


TTGCCATGCC 


300 


AACGTCTTTC 


ATCGCCATCA 


ATTTTTAGTT 


TGTATTTTTT 


CTTTTTCATT 


AATAAGCGGA 


360 


AAAATTGCTT 


TGTAAAGGCT 


AAAGGACCAT 


ATkTTTGTTT 


CTCCTGTTGG 


GTCACATTTG 


420 


CTGCTGTATC 


TGCTAATAGT 


CCAATCGTTA 


AGGTCGAAAT 


CATCACATCT 


TGATTTACCA 


480 


TCCCAAAGTC 


AACT 










494 



(2) INFORMATION FOR SEQ ID NO: 924: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924: 

TTTGGCTGTA TnTAAAGGCT TGGCTGACTT GCACTTCTTG CTTGTTGTGC GTCTGATTCT 60 

GCTTGTTGTG CGGTAcTAAT TTTTCTkGAT TGGTCGCTAC TTCTTTTTGA AGGGTTTCTT 120 

TACTCTTAAA GACTGGTGCA GTGCCTTCTT CATAGGTATT TGCTTCAACT AATTTTGTTT 180 

GATTATAGCT AATGATATGG ATTTTTGCTG AAATATCATT TAAATCTCCT GAAATAGAAA 240 

CTGCCATGTT TGTTTCATTT GCATCTAATA AAGAATCTAA ATGGGAATAG CCAGAATGTC 300 

CATCGTTCAT CAACATGcTT ACAAGTGcAT TTCGAGCAGC TTTTTCAAAG TCATACATAG 360 

AAACTTTACC TTCTTGTGTC GTAAATCTTC CCATACTTAG GTTTTCATAA AAGTTTTGTC 420 

CAGGGAATTC TTTAAAACCA TTTTCTTTTG CTGCACGATT GATCGCATTT ACGTCATGAT 480 

CAAA 484 
(2) INFORMATION FOR SEQ ID NO: 925: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925: 

GCAACnTATT TnACACCTTG GATTAnCAGG TTGGCCCATG TGTTCTCTAC CGTCAGATAA 60 

TCAAATATGG AACCCCnCGG CAATCGCTCT TCTTGGCTAC ATCCGAATCA TTTTAAAAAT 120 

ACCTGCACAA CTAAGTTATA TATTTTGTCC GCTTTAAATG GTTTTGAAAA CAAACCGATA 180 

TCAAAAACAG TTACTCGCAA ACTGTCTGAT TTATCTTACA* AAGATTTATA TTATCAAGAG 24 0 

ATGAAACTTT TAAAAATTAT TCGTTAATAT ACTTATAGTG GTCTATCAAA AAAAGAATAT 300 

TTATTCCGTT ACATAAACTT TGAAAAAATG AAAAATAAGA CCTTCAACAT ACTCTATCCT 360 

GTTGAAGGCC tTCTCTCTTA TTTAACCCTT TAATTTTTAC GACCtCTAAA TAATCATACC 4 20 

CAAAATGTGC ATACGCAATG GCGTGCAATT CGTCCTCGTG GGTGTCGTTT AATGAAnCCT 4 80 

TTTTGAATAA GAAnGGGTCA TACATACCTC CACCGTTCGG TTCTCACGAT ATTACAGCAA 54 0 

GGACTAACCA CGGACGCGCG ATAATCATAT GTTCACATTT GTCGCATATC AGCTGAGACA 600 

CTGCGAnGTA CnTTGCGCAC GCGCGTTGCA CGA 633 
(2) INFORMATION FOR SEQ ID NO: 926: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926: 



-TAGTTTTAGC 


AGCAACGTCG 


GTCAGCATTT 


CAGTAGAAGT 


ATTAAGAGAG 


TTGAATGTTT 


60 


TAAGTAGTAA 


AGAAGGTGCA 


CGnnTTCTTG 


GGGCTTCCGT 


CGTTGACGAT 


ATCGTGGTCG 


120 


TGATTATTTT 


AAGTGTCGCT 


GTCGGCATGA 


TTGGTGCAAG 


TACAGGCGGC 


AATACAGAAG 


180 


TCAGCTTTAT 


TGTTAAGTTA 


ATTGAACAAG 


GCTTATTCTT 


TATTGGGATw 


TTCTTTTTGG 


240 


TCCGCTTTAT 


TGCTCCCTAT 


kTACTACGTT 


TAAGTCAAAA 


AATGAACATT 


GGGTCGTCTG 


300 


TTATTATCAT 


GTCATTGATT 


ATCTGTTTGG 


GCATGGCTTA 


TTTGGCTGAT 


TTAGTTGGTT 


360 


TAAGTTCTGT 


CGTAGGTGCT 


TTTTTTGCAG 


GGATTGCTGT 


GGGACAAACG 


GATGCAAAAA 


420 


CAGAAATTGa 


TTTCAATATT 


GAAGCTATTG 


GCTATGCCGT 


GtTTAATCCC 


gyCTyCTTgc 


480 


CAGCATgGCy 


TAtCAGTGAC 


ATTwACACGT 


AnGAAAGGAT 


TAnCTTTTAT 


TTGGGAATGA 


540 


CGATATGGCG 


AATTTAACCG 


AAAnTA 








566 



(2) INFORMATION FOR SEQ ID NO: 927: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927: 

TTTTTGTTGG CGTGATTGGG TATGCCAGTT ATCTTTTTAC CCAAAAGTTA TTCCAAGTTC 60 

CTTACCTCGA TTCATTCGCA GCCGCTTTTG TCATTGGTTT GTTGGCCTAT TTGGCTGTGC 120 

ATTTTCACTT AGCGGTCAAC ATCGATAATA TCATTATTGG TGCGGTAATG CCGTTAGTTC 180 

CAGGTGTGGC GATTACGAAC TCTTTTCGCG ACATTTTAGC GGGCCACCTC ATCAGCGGGA 24 0 

CGGCGCGAGG GACAGAAGCT ATTTTTATTG CTGGGTCTGT TGGCCTGGGa ATTGCCTTGA 300 

tTTTTAAATT ATTTaTGTAA GGAGTGTTCA AATGGAcTAA TTTTyCATCT cyCTTTAGTT 360 

TTTTAGCGAC TGTGACGTTg GaTTATTACG AcATcCTCGT AAAcAcTGGg GCTGgGCATA 42 

CTGGCATGAC AGGCTGaTGA tTATAwGcTA ACGcaACtTg wgCaGCaACt TGCAACTTTA 4 8 

GGACGCCGGA 4 9 
(2) INFORMATION FOR SEQ ID NO: 928: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928: 
ACGCTTTTTC TGGTGTGACG GCcAGTTGCG GAAACCGTGA CGATTGAAAG TAGTCCGACC 6 
GCCGAAAGTA GTGCCAAGGA AGAGACGCAA GCAAGTAGCG TGAAGGAAGA AACAACGAAA 12 
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GCCAGTACGG AAAATAGTCA AGTAACAACT GACACGAGTC AGGAAGAAGC AACGAAAGAA 180 

GCGGAGAAAG AAGAACCGCA AGCAGAAGTG GAACAAGCAG AAACACCAAT CATTCCTAAA 24 0 

CCAAAAAAAA TCAATATGAG GCACTTATTC ATTTTCTGCA GAACTTATCA GTTTGGATTG 300 

TGATGATCAG GTCATTATAA TCCAGTATTT ACCATTACGT TACTATGCCA AGGTCATGGA 360 

GGCAGATGGT ATAATCGAAG GGnCTAGTTG GTCAAGGATG CTTCACCGTG GAACTTAAGG 420 

ATGTATATGC CACACTCTGn GTATGCnCCA GnCGCATTGA 4 60 



(2) INFORMATION FOR SEQ ID NO: 929: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 929: 

CCTATGTAGG CTGCGATATT CAAAGCTCCG CAGCTTGCAT GGACAAATCA CTGGCCTACA 60 

TTCTTACnAA AAAATGCGGG CATCGCCGTC CCCGAATTTC GGTTTTCACA ACGCGGCCTT 120 

CCATCATGAT ATGAACAACA TCTGGCGTGA TATAGTTCAA TAAGCGTTGA TAAtGGGkAA 180 

TAAtTAACGC ACCAAAGTTC TCGCCGCGCA TTTCATTAAC CCCTTTAGAA ACTACTTTTA 24 0 

ACGCATCGAT ATCTAAGCCT GAATCAATTT CATCTAAAAT GGCAAACGTT GGCTCTAACA 300 

TCAATAATTG TAAAATTTCA TTTCGTTTTT TCTCGCCGCC AGAAAAACCT TCGTTTAAGT 360 

AACGCTCAGC CATTTCTTCA GGCATATTTA ACGG 394 



(2) INFORMATION FOR SEQ ID NO: 930: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID_NO: 930: 
CTCCAGCATT GTTGGTCATT TTTTCTAATT CTnCATAGGA GTGTTCTGCA TCACTAAATT 60 

TAACCATTTC TACTTTATGA AACTGGTGTA AACGAATTAA TCCTCGGGTA TCACGACCTG 120 

CACTACCCGC TTCCGAGCGA AAAGAAGGAC TTAAAGCAGT AAAATAAATC GGTAACTCTT 180 

GCGCATCTAA AATTTCATTA TTATAGTAGT TAGTTAACGG GACTTCCGCT GTGGGGATCA 24 0 

GCGTCAAATC ACTTTCTGCA ACTTGGAAAA CATCTTCTTT GAACTTAGGA AACTGACCGG 300 

TGCCAAACAT TGCTTTGCTA TTGACCAAAT AAGGAGGAAT CACTTCGGTA TAACCATGTT 360 

CATTAACATG TTGATCTAAC ATAAAATTAT AGACCGCTCG TTCTAAACGA GCCCCTAAGC 420 

CTTTGTAATA AAGAAAACGG CTACCAGATA CTTTCGCGCC TCTTTCAAAA TCTAAAATTC 4 80 

CTAAGGCTTC ACCGATTTCC CAATGTGGTT TTGGTTCAAA TGAAAAAGTT TTCGGCGTAC 54 0 
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TCCACCGCCG AACTTCTACG TTGGCGGCTT CGTCTGCGCC AACAGGTACG CTCTCATGAG 600 

GTAAGTTAGG CAAACGTTCT GCAATGGCTG TGCATTGTTC TTGCAAATGA ACAACTTCTT 660 

GATCTAAAAC TTTAATTTGC TCGCTAACTT CTTGCATATT TAGCAATTGT TGTGAAGCAT 720 

CTTTTTGTGC ACGCTTTAAT TGTGCAATTT CTCCTGACAC CTGATTACGG TGTTTTTTCA 780 

GTTCTTCAAC TTTAACAAGT AACGCTCTCC GTCCTTCATC TAAAGCCAAA AAGCGTAATA 84 0 

AAACTTCCTC GTCCACACCA CGTGTTTTTA ACTTTTTCTG TACCACTTCT ACATTTTGGC 900 

GAATTTTTTT CATATCCAGC ATCTTATTTC CCTCTTCATG ATTAAATTTT AAACAATGGT 960 

GAATTTTACT TTTTCAAAAA TTTCATTCTT CATTCGTTGG CCAGTCGCTG TGCCAGCTAA 1020 

GCCACCTAAA CCCGTTTCTC TCAATTCTCG TGGTAATTGA CGGCCGACTT TATTCATGGC 1080 

TTCCACCACT TCATCTGCTG GTATTTTATT TACAATTCCA GCTAAAGCCA TATCTGCAGC 114 0 

GATTAACGCA TTGCCTGCAC CTACTGTATT TCGTTTGACA CAAGGAATTT CGACTAACCC 1200 

AGCCACTGGG TCACAAACTA AACCTAATAA ATTCCCTAAA GCCATAGCAA AAGCTTCAGA 1260 

AGATTGGCGT GGTGTTCCGC CAGCCGCTTC AACAGCTGCA GCGGCGGCCA TCGCTGA 1317 
(2) INFORMATION FOR SEQ ID NO: 931: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931: 

CCCTTGTCAC ACAAGCGTTT ATGGTTTATG ATAAAGTAGT AAAAAAGAAC TACTGTCTTA 60 

CAAATTTGCG TTCCGGCCAA ACCAAAAAGA AGCACAGAAA AAAGTTTTAG AATTTTTTGG 120 

TATGGATACG AACCCAGCAA ATATTCAGTT AATTAATGAA CTAAAATCAG GAACTTGTTT 180 

ATTCCAAGAC CATCGAGGGC GTAGCCAACC TATTGCAATT GATGTTTTAT TCGATAGTTG 24 0 

GTTAATGGCC ATTTCGTCTA CTAATAAAGA GGATGAAGCT ACTCAAATGG CGTTGGCAAT 300 

GGAACAAGGA TCATAAAAAA GGAGAAGTGA GCATGAAAAA AGTACTACCT TTTATTGCCT 360 

TAGTCGGCTT GTTATTGTTG TCAGGTTGTG GAACAGATAT GAAAAAGATA TTGACTGCCG 4 20 

ATGGTGGTAA ATGGGAACTA GAAAATAAAA GTCCAACTAC TACTTACACT TTTTTTGATG 4 80 

ATGAAACTTT TTCGAGGTAT AATTCAAAAA TTAGTGATAG TGGAACGTAC TCTTACGATG 54 0 

AAAATAATAA AAAACTCACT TTGGATATAA AAAATAAAGA ACAATTAATA ATGGAAAATG 600 

TTGAATATAA AGACGGTAAA TTAAAAGGTG AAATTGGAGG CGAGAAGGAC TCTGATAAAA 660 

AATnGAATAA GAGGTGTCTT TGAATAGAAA TCCTGGTAAG CGTCTnCTAT TTTTGnGGCG 720 

TTTT 724 
(2) INFORMATION FOR SEQ ID NO: 932: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932: 



AAGGTACCTC 


CAGTAGTTGT 


ATTGCCACTT 


GCCAATAATT 


GTTCACTGCT 


ATTGTAAAAT 


60 


TGAACCGTGA 


CATTTGGATC 


AGCAGTTCCA 


GTGACTGTAT 


AGCCATTGGC 


TTTGTTTCCT 


120 


TCAACCTTCG 


TAATAGTTGG 


CGCCGCTAAA 


ACAACATCCG 


CTGGAATAAC 


GGCTGTAGCT 


180 


GGTTGACTTT 


CATTACCTGC 


GCCATCTTTG 


CTAATAATCG 


TAaTCGTwTC 


TCCTGGTGTC 


240 


ACTACGcCAG 


CtGGTAGAGT 


CACCGTATAT 


TGGCCGGTTT 


CGTTAGCAGT 


TGTAGCAGCA 


300 


ATGATGGTTC 


CGTCTGCGTC 


ACGGACATCA 


ATAGTGGTTT 


Tn 




342 



(2) INFORMATION FOR SEQ ID NO: 933: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933: 



AAATTGCCAG 


ACCTACTTTT 


TATGGAGACG 


ACTTTGGAAA 


TTATTAAAAC 


GACGAAATAA 


60 


CTTCATTCAG 


GATTTCATGC 


CAAAAGGGAT 


TACGAAAGCC 


TATGGTATTT 


CCTGTTAGCC 


120 


AAAGATTTAG 


GAATCCGTGG 


CAGAAGAAAT 


CATGACCCTA 


GGGACGAAGA 


AAACGATTTA 


180 


CCGATGATCG 


AATACGCGGC 


TAGTGTCGCC 


AGGCAAATGC 


GATTCCATaG 


TTAAAGAAGC 


240 


CGCAGATGTC 


GTCACAGACA 


CGAATGACCA 


GnGGCGTTGC 


CAAAnCGGTC 


GAAAAATATA 


300 


TTTTAACGCC 


GTTGGAAAGG 


AGGCCATTAA 


GAnGGGGATT 


T 




341 



(2) INFORMATION FOR SEQ ID NO: 934: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 934: 



GACTTAATGC 


TATTGAAGCC 


CGATGGATGG 


TGnGTAAGGT 


CnCGAAGCTn 


GnGGAATCAT 


60 


AAAAGTTAAT 


AAGCCTGGTT 


TTCCTAATGG 


GGGCGTCACT 


TAGGATAGTT 


TTCTTCCATT 


120 


CTCCAATCCT 


GCCGGATAAG 


AAGGTATTCG 


CTTCCTCCTT 


TGGAGTTAGC 


TTGATAAACA 


180 


GCCCTTAAAG 


T CATC C AC AG 


ATTTCTTTTC 


GGTCTTTGAC 


ACGTACTTTA 


TGAGCGATAT 


240 


T AC GAG AT AC 


ATGGATACAA 


CAATGCTGAT 


ATTTTGCTTT 


AGGATAAATT 


TGATGGATAG 


300 


TATCTTTCAT 


GCCTTTTAAG 


CCGTCCGTAA 


TAAAAAGCAA 


GACTTCTTGa 


ACTCCTCTGG 


360 
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AGTTAATATC CTGTAGCAGC TCATTCCAAA CGTATGTgAT TCAGTTGGAG CAATCGCATA 4 20 

ACTCAGTACT yCTTaGTGCC GTCCTCCCCG AAAACCATnG GCCAAAATAA ACCGCTCCTT 480 

GGGATAACGG TTTGACGTTT nAGTGGAATG nAAGAAGCG 519 



(2) INFORMATION FOR SEQ ID NO: 935: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935: 



CATATGGGGC 


TATACGTCAC 


GATTGGCATT 


GGAGATAACC 


CATTGTTAGC 


CAAGCTGGCC 


60 


CTAGATAACG 


AAGCAAAAAA 


TACACCAGGT 


TTTGTTGCAG 


AaTGGCGATA 


CGAAAACGTA 


120 


GAAGAAAAAG 


TTTGGACGAT 


CTCCCCTATC 


ACTGAATTTT 


GCGGGATTGG 


cmAGCGTATG 


180 


GCArCTCGTT 


TAAAGATGTC 


aGGAATTGAr 


TCgaTTnATG 


ATTTAGCACA 


TGCAGATCCG 


240 


TACATTTnAA 


AACAACGTTT 


TGGTGTCATG 


GGTTTACAGC 


TGTACGCTCA 


TtCTTGGGGG 


300 


ATTGATCGAa 


GTTTTTaGGA 


GGAAAAAAGA 


CAAGTTGCTA 


AAGAAAAGTC 


nTTTGGAAAT 


360 


AGTCCAAGTA 


TTACCnTAGA 


GATTACCGCA 








, 390 



(2) INFORMATION FOR SEQ ID NO: 936: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936: 



CTATTAAAAA 


ACGTCAAGAA 


GAAGCAAGGA 


AACATGGTAA 


AGCAGAAACT 


GGCTTAAAAC 


60 


AACGGCCATT 


AAATAAAGAA 


CAAGAAAAAC 


GTATGCnACT 


ACGCCGTGAA 


TTGAAGCCAA 


120 


GACCTCAACA 


TAAAGCGCCA 


AATGCTATCG 


AAAAAGCACA 


AGAAGGATTA 


AGACAAGCGC 


180 


ACGAAAAAGG 


ACGTATGCAA 


GAACAGAAAT 


CTGAAGAACA 


ACTCAAACGG 


CAGCAACATG 


240 


AATCAGCTCG 


CCAAAAAGAA 


TATGAAAAAG 


ATCGATTGAA 


AAAACAGGAA 


TCACTTAAAA 


300 


AAATTAATGA 


TCAAAGCGAC 


GAATCAACCA 


tTTAACAGCT 


TraGAAtCaA 


ATCGGCGTAC 


360 


TGGACAGCGA 


GTTAAAAACG 


CGAACAGGnC 


TTCAACACTT 


TTGAACGCCA 


GGGGGAAAGn 


420 


TCTTCACCAC 


AATTAnCCAA 


AGT 








443 



(2) INFORMATION FOR SEQ ID NO: 937: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(-xi-) SEQUENCE DESCRIPTION: SEQ ID NO: 937: 

AGGAATCACT TAAAAAAATT AATGATCAAA GCGACGAATC AACCAATTTA ACAGCTTTAA 60 

GAAATCAAAA TCGGTCTGGA CAGCGATAAA GGGGGCGAAA CAGGTCATCA AACACTTATG 120 

AAACGACAAG GAGAAAAAGT ACTTCAACCA ACAAATAAAC CAAAAGATAT GAAAAArCAA 180 

GTGGTTGnTC GTAGTCCAAG ACGAACAGGA CAACCAACGA CTCGTCAA 228 
(2) INFORMATION FOR SEQ ID NO: 938: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938: 
AAGATGTTTA GTTGTAGAAA ATAAACAGGT TTGGGAAGAA AGTGGGACAG GAGTATTTAG 60 
CTTCAAGAAT GAAGAAGCTA CTCCTGTCCC GCCGTTTATT TTAGGATTGG GaCGGATGTT 120 
CTAAGTTCTT CCTAAAtCGT TTCTGAAAAA AtGAAgGAGT GAGGGAGAAG ATGACAGrAC 180 
CAAtGTGgAa TtGAGAaGTt AGGATtaGAA aGCGCATCCt GAaGGgCcGG TtATTTtAAa 240 
CcAAACCAAT AAAAGTGAAC AATTGATTCC GACCAAACGA GGCGAACGGG CCGTnGCATA 300 
CAGCGATTTA ATTnCTATTA AATnCAGAAA 
(2) INFORMATION FOR SEQ ID NO: 939: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939: 
TACGGTCAAG ACCGAGTATG GGGAACTTCA TCTCCAGATT CCGCGCGACC GCAATGGCGA 
GTTCAAGCAA CAGACTGTTC cTGCTTaTAG ACGGACGAAT GACACGTTAG AGGAGAnCCG 
TCATTCACCT CTTTCCGAAA AAGTATTAnC CATGTCGGAA TCGCAGACTT GATTGAGAAA 
TGTTGGGGCA TCATACACGC CCAAACCATG TCCATATACC AAATCATTAC AGAGGGTACG 
CGTTAAnGGG GGACTCATGC CGTATGCGTA TTATATGCGC ACGATATCGT AACGGAACGC 
GCAAGAGCAT CTAC 

(2) INFORMATION FOR SEQ ID NO: 940: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
314 
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(xi) SEQUENCE DESCRIPTION: SEO ID NO: 940: 

CATCATATCA AACAAAGAAT TGGACGTGTT GGCATCTTTT AGGACGGAAA GACCCGTAAA 60 

AATAAGAATC GTCATTAAAA AAATACCAAA AAACCGTTTG TATCGTTGTT GTCCAATAAA 120 

TTTTACACCT AAAATAACAA AGACAAAGGC AATACCAATC GTTCCCGCAA TGCCCAACAT 180 

GTTAGTTGCT ATGCTAGACG TTAGGTTCAT AATCGGTTGC TTAATAGCAG TTATAATATC 24 0 

CATACTAAAT AGTGTTTTTA CCATAACTGC ATTAAACTCT CCGAGTCCTT TTACCATTGG 300 

AC C AG AC AAA AGTCTTTATT GTCGCAGAAG CATTTGTGGA TGGCTTCTTC AACACCTGTC 360 

CATGAATCnT GGTTTTTTTT CTTTGGtCAT TAACTCGAAA GCGTTGTCyT TGTAAGTAtC 4 20 

GTAAATCmAA TCTACATTAG kGGTTGGCTC yGTAACAAtA ACTTwATTTC ayCAGGGTTA 4 80 

GGGTCGGGCA TTAACCGCTG GAGTTTGCGG TAAACACCAA AAAACAGCCT ACTAAAAATA 540 

GGTAAGACTG GCTTTACCAG TGGT 564 
(2) INFORMATION FOR SEQ ID NO: 941: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941: 



AATAATTGAA 


GACGAAAAGA 


GGACGAACTG 


ATTGTCCTTC 


TTTTTTTAGT 


TATGGAGGGG 


60 


AnCCATTGGA 


AGCAGTA3TA 


GTAGAAAGAG 


AGGCAAAGGG 


GATGAAAGAA 


ATTGCCATCC 


120 


AAGAAAAAGA 


TTTGACCTTA 


CAGTGGAGAG 


GAAACACAGG 


TAAGTTAGTT 


AAAGTTCGAT 


18 0 


TAAAGAATAC 


ACGTGCAATG 


GAAATGTGGT 


ACAACAAACA 


AATTACCGAA 


GAAAACATTC 


240 


AAGAGATCAC 


CACGTTGAAT 


ATCATTAAAA 


ATGGAAAATC 


TTTGGCATTA 


GAAGTATATC 


300 


CAGAAAAAAG 


TATCTATGTG 


AAACCAAATT 


TAGGCAGAAT 


CAATGTGCCT 


GTCTTTTTTA 


360 


TCAAAACACC 


TATTAACAGA 


GGAGTATTTG 


AAGAGATTTT 


CGGCGAAACG 


TTAAAAGCAT 


420 


AAGTAAAGGA 


GAAAGAAAAG 


CCAATGTATG 


TATCTTTTGT 


"TCTAAGTTGT 


TTGCtTssaT 


480 


TTTCTGCCTA 


CAGTCTATTA 


AATAGGCTAA 


ATTCGTTGGA 


ATTTGTGGAT 


GTTTGGTTAG 


540 


ACAAAGrAAC 


ACAAAAAATC 


ACACTAAAAC 


GCTGTTTTTA 


TGATACGTCT 


TTCAAGAAmC 


600 


AAaCACTAAA 


aGAGTtAGca 


CGAGTaTATT 


TCCAATTAAa 


wGAAATaATC 


aaCGTGCAAA 


660 


TAAACAAGCG 


GTCTTTAAAT 


ACGAATGACA 


TACGTAATGT 


ACGAGAACTA 


GAGGAAAAAC 


720 


AACAAGAAAT 


AAAACGATTC 


ATGTTAGACG 


TTTTAGAAGA 


TGCTTATTGG 


AAAGAATTAG 


780 


CAAATATGCC 


' AGAAGACCAA 


CGACACTTAG 


ACGATTGGGA 


TTTCTTTTAA 


AATGGGTTAT 


840 


TGAAAGGAAG 


CAATAATAAA 


AAAAATGATG 


CCTAACACAA 


AAATATAGGA 


GGGAAAAAAT 


900 


GATTATTTTA 


TCAGGACTTT 


TTATTTTAGg 


GATAGGAATA 


GTTGGAGGTT 


ATCAGCTAGC 


960 
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CACGTTACCT AAATTGATTG AAATGAAACA GCACAAAGCA ATACAGAACC ATTTCAATGT 1020 

TAAAGGAAAT GAGTACACCT ATTATCAAGA AGACAGTGAA AATTACATTT .TATCCTT.GGA '.08 0 

AGATACAGAA TATCGAATTA AATTTTCCAg AACACCCCGT TAAAAGTGGT ATTCCTGAAT 114 0 

TTTAGAnCCn 1150 



(2) INFORMATION FOR SEQ ID NO: 942: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942: 
TTTAATTCCA CGGAGACnTT GATTATGGAC GGTTCTTTCA GGATTAAAAA ATAGTGATTG 60 

GGACgCTGTA GkTCATAAAA ATGCGTACTA TGkTTCTGGT GTAGGTCTAG CTAGAGAGGt 120 

GGACGTATTT CTTTyTCTTT TGGTATGACA ACCAAAGGAA AAGTCAACCT ATCTGGTGCG 180 

CAATGGTTTG CCTTTAGTAC CAATTTAAAT GCGAAATCAa TTAGACCATA CCAAAaGAAa 24 0 

GGGAATCCAA AaGAACCAGA AAAAGCAACA ATTGAATTCA ATCGATACAA AGCCAATGTC 300 

GTTCCTGTTC TTGTGCCGAA TAAAGAAGTC ACTGATGGTC AGAAAAATAT CAATGATTTA 360 

AATGTGAAAC GAGGCGATTC TTTACAATAC ATTGTGACAG GGGATACGAC AGAACTTGCC 4 20 

AAAGTAGATC CGAAAACAGT GACAAAACAA GGGATTCGGG ATACCTTTGA TGCAGAAAAA 480 

GTGACGATTG ATTTAn 4 96 



(2) INFORMATION FOR SEQ ID NO: 943: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943: 

AAAATGTAAA TCCCATTCTC CAGGCACCAC GTGATGTATG GGGTnCGGGA TGACTTGCCG 60 

GAATGTTAAA ACTAACGGAT CCCCCAATTT CGTCGCACGA TCAATGGCTC TGCACAAAAT 120 

TTTAGCAAAG CCGTACACCG CATCATGACG GTAACTCATT GGCGTAGTAC CTGCATGATT 180 

GGCTTGCCCT TTTAAGTTAA CCGTGTAGCG ACGTTGCCCT ACAATACTAT GAACAACACC 240 

GACTTGGAGC TTTTCGTTCT CTAAAATATT GCCTTGTTCA ATATGAATTT CGACAAATGC 300 

TTCAATGTCC TCACGTCTAG TTTGCTCTTG CCgAAAATCA AAGCCTTGGC GATGCATCTc 3 60 

ATCAACAAAC ■ TTTTTACCTT CGTTGTCAGA GATATCGACT ACTTCTTCAC GTTTGGCTTC 4 20 

ACCAACGACA TTTTTGCTTC CCCAGAAAAC CGTTGGAAAA CGACTC 4 66 
(2) INFORMATION FOR SEQ ID NO: 944: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 4: 



TTAAAAACTT 


GGGCCTCCCG 


ATGGCACGCn 


GCCCCAAATG 


CTCAAAAATC 


CCCnGATTTT 


60 


TAGAGGCCCn 


TCCnCCGTAG 


CCACGGTGTA 


TTATCCCGGA 


CTAGCGAGTC 


CATCCTGGTC 


120 


CACGAATTGC 


GCAAAAACAG 


ATGTCGGGAT 


TTTAGTGGCA 


TGATTTCTTT 


TGAATTAGCC 


180 


AAGGATGAGC 


AAGTCGTTCC 


TTTCGTAGAA 


GCGTTGGAAA 


TTTTTACTTT 


AGCGGAAAGT 


240 


TTAGGTGGCG 


TAGAAAGTTT 


AATTGAAGTA 


CCTGCCATTA 


TGACTCACGC 


ATsTAkCCCC 


300 


AAAGAAAATC 


GTGAAGCCGT 


GGGAATTAAA 


GATGGACTCA 


TTCGTTTATC 


TGTAGGAATT 


360 


GAAGCAGTCG 


AAGATTTATT 


GGCTGATTTA 


GAAAAGGGAT 


TACAAGCATA 


ArCAAAAGGs 


420 


CCTCGTCTTT 


AGACGAAGGC 


GTTG 








444 



(2) INFORMATION FOR SEQ ID NO: 94 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945: 



GATTTCnGCA 


CAGTCCAAAG 


TTATTGGTnA 


TTATTCCACC 


GGCnGATTGG 


AGGGAAAGAT 


60 


TTTTTAGTTG 


AAAAAGTATT 


GGTnCCTAaT 


GAAGTCAGkT 


TGaCgTwTmC 


CCACAATGAT 


120 


CGcATGATCt 


TTGGCGGTGT. 


AACcACCGAC 


GACTGAGTcG 


TTAGAGATTA 


TTTTGAAmnA 


180 


AGAATTAGGT 


GTCGATTATT 


TcTTGGAACG 


GcGTGAATTA 


GGAGTCATTA 


ATATTGGCGG 


240 


TCCGGGTTTT 


ATTGAAATCG 


ACGGTCGAAA 


AGAAGCGATG 


AAGAAGCAAG 


ATGGCTATTA 


300 


TGTCGGGAAA 


GAAACTCGAC 


AAGTGATTTT 


CTCTTCAGAA 


GATGCAGCTG 


ATCCTGCGAA 


360 


ATTTTACATT 


AGTTCTGCTC 


CGGCGCATCA 


CAAGTACCCA 


AATGTAAAAA 


TCAGTATTGA 


420 


TGAGATTAAG 


CCAATGGAAA 


CTGGGGAAGC 


ATTGACATTA 


AATGAACGTA 


AAATTTATCA 


480 


ATATATTCAT 


CCCAATATTT 


GTGAAAGCTG 


TCAGCTGCAA 


ATGGGCTATA 


CGATTTTAGA 


540 


GCCGGGCAGT 


TCTTGGAATA 


CCATGCCG 








568 



(2) INFORMATION FOR SEQ ID NO: 94 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946: 
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CTTTCCACTC 


CGCTTTTCGC 


AATCCTCAAG 


CTGGCTGtCT 


gTTCGCAcTG 


TAcgCTGtGA 


60 


tCTTATCATT 


gcACTTAgCA GCATctTTCt 


TTGcTTTTTC 


tCGACAACgC 


yTGTTGTTGG 


120 


TTAACGTCTG 


CTgAGCTTGA TCCACTACTT 


TTTGTTGGTC 


ATCAACAGCT 


TGTGTATCCG 


180 


TAGCCACTTG 


GTTTTTGGCT 


TTTTCAAkGG 


CTGAAGGTGT 


TGCTTCATCC 


ACAACTTTTT 


240 


TTGCTTCGTC 


TACGACCGCT 


TGTTGGTCAG 


TCACTGcTTG 


TTGACTTTGA 


TCCAATGCAC 


300 


CTTTGTTTTG 


ATCGACCACT 


GCTTGTTGGT 


CTTTAACAGA 


TTGATCAATG 


GTGTCTTTTT 


360 


CTTTTTTCGC 


AGTGTCAJCA 


ACTTGTTGTT 


TTTGATCGAC 


AATTGCTTGT 


TTCTCTGTTA 


420 


CTTGTTGCTG 


TTTTTCAGTA 


ATGGCTTGCT 


CCGTTGTTTC 


TGTAGCTTTC 


ACTGCTGGTT 


480 


GTTCTG 












486 



(2) INFORMATION FOR SEQ ID NO: 947: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947: 

ACATAACGTT CCACGATCCC ACAAAAATGG CCCAATGCAC ACCCCTCAAT GATTCCTTTC 60 

AAGACGCTAC TAAAAAAGCT TATCACGATA TCGCtTCATA GAAAtCGCTt GTTgCATGAA 120 

CTGCTGAACG ATATCCCGTT AATAAATA 14 8 



(2) INFORMATION FOR SEQ ID NO: 948: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948: 

CTTGGTAAnn CTTCTTTGTG AAGCCAAGCC TTTCAATCGG GAAATATTTA AATGGAACTG 60 

CCCCCCATGT CTTTCAACAA TTTGATGGCG GTTTCGATAG GCACTTnCCG CCGTGGCGCA 120 

CCAGAGCTTA ACGGACCTGT CGCAATGTTG ACATAGCCAA CTTTGCCTGT GGGTGAAACC 180 

AAGCCATTAA TCACkGTTTC GTCTTGACGA AGTAAGGcCC CGCGAAGCGC CGACACCTGT 240 

GAAGACTTGG TTCACGTGTT GCGGTTGTAG AACTTCTGAT AAGCGAGCCA CCATTTGACT 300 

TTGATTAGGA TCGCCAGCCC CTAAGCCGAC AGATAAAGCG TTGTTGGTTG CTGCTTGGTA 360 

TTTTTTCATG TCGTCAATAG CTGCTTCATC GGTTTCATAG TTTTTTGAAA GGACGCCTAA 4 20 

CACAACATGC CCTTCTGCTG CTTCATAACA GGCTTGGGCA TTTTCTACTG AATTGGCTAA 4 80 

GACGTTTAAA CAAATGCGTT CTTCTAAATA ATTAGGTTTT AATGACATGA CTTATTTCTC 54 0 

CTTTTTGTCC ATAATTTCTT GTCAATnTTG CACGATTTTG TTCATTn 587 
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(2) INFORMATION FOR SEQ ID NO: 94 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949: 



ATTnCGTTGG 


GGTTGCCACC 


AATTATCCCC 


CCCnCAATTT 


AAGTGAAGTG 


ATTGATsCAm 


60 


CAAGTTTATT 


GAT GG AC a AT 


CCtGATGTAA 


CGACGAATGA 


ATTGATGGAA 


GTGTTACCTG 


120 


GACCAGATTT 


TCCAACAGGC 


GnTTTAGTGA 


TGGGGAAATC 


AGGGATTCGC 


CGAGCATATG 


180 


AAACAGGGAA 


AGGTTCGATT 


ACTGTTCGTG 


CAAAAGTTGA 


ATTGACTGAA 


ATGCCGAATG 


240 


GAAAAGAACG 


TATTTTAGTA 


ACTGAATTGC 


CTTATATGGT 


GAATAAAGCC 


AAATTAATCG 


300 


AACGAATTTC 


TGAATTACAC 


AGAGATAAAC 


GAATTGAAGG 


AATTACTGAT 


TTGCGGGATG 


360 


AATCTTCTCG 


TGAAGGCATG 


CGGATTGTCA 


TCGATGTTCG 


TCGAGATGTG 


AGTGCCTCTG 


420 


TTGTGTTAAA 


CAACTTGTAC 


AAAATGACTG 


CCTTACAAAC 


ATCTTTTG 




468 



(2) INFORMATION FOR SEQ ID NO: 950: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950: 
CGGGGAAATG AAGTACAGGC AGCAGAACAA GCGCAACCAA AAACACCTGA AAACAGTTCT 60 
AC AG A 65 
(2) INFORMATION FOR SEQ ID NO: 951: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951: 



ATCAAGCAGA 


TGCAAGTCTA 


AACGAGAAAG 


ACTTAAAAGC 


TGTTGCTGCA 


GCGATTAATT 


60 


CAGGAAAAGC 


TATAGACGTG 


ACTGCTTCTT 


ATGTTCTTAA 


TTTAGATCAA 


AACACCGTCA 


120 


CAGCaATGAT 


GAAAACCAAC 


GCAGACGGTT 


CCGTTGTTTT 


AGCAATGGGG 


TATAAATATT 


180 


TACTTGTCTT • 


GCCGTTTGTA 


GTGAAAAATG 


TAGAAGGCGA 


TTTTGAAAAT 


ACAGCTGTTC 


240 


AGCTGACAAA 


TGATGGCGAA 


ACGGTAACAA 


ATACAGTGAT 


TAACCATGTG 


CCGGTTAGTA 


300 


ATCCTTCCAA 


AGATGTAAAA 


GCAGATAAAA 


ACGGTACAGT 


TGGCAGTGTT 


TCTCTACATG 


360 
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ATAAAGATAT TCCGTTACAA ACAAAAATTT ATTATGAAGT GAAATCTTCC GAACGTCCAG 420 

CTAACTATGG CGGAATTACC GAAGAATGGG G.CATGAATGA TGTCTTGGAC ACGACCCATG 4 80 

ATCGTTTCAC AGGTAAATGG CACGCTATTA CAAACTATGA CCTTAAAGTA GGGgACAAAa 540 

CGTaAAAGCA GGAaCAGATA TTTCTGCCTA CATTCTTTTA GAAAACAAAG ACAATAAAGA 600 

CTTGACGTTT ACAATGAATC AAGCATTATT AGCAGCGTTA AATGAAGGAA GCAATAAAGT 660 

AGGCAAACAA GCTTGGTCTG TATATCTGGG AAGTCGAACG GGATCAAAAC AGGTGACGTA 720 

GAAAATACGC 730 



(2) INFORMATION FOR SEQ ID NO: 952: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952: 



TTTCAATAGG 


GACTGTTGGC 


TGTTGTGTAC 


CATTTCCAGC 


GTGATTTAAT 


TCCCCTGTTG 


60 


GCGTTACAAC 


AACGTCGATT 


GGCTTGCTTG 


GTTCTGCTGG 


TTGTACTGGT 


TTTTCAGGTT 


120 


CTGTTGGTTT 


AGTTGGTGTC 


ACAGGTTTTT 


CGGGTTCTGC 


TGGTTTGCTT 


GGTGTAACTG 


180 


GTTTTTCAGG 


TTCTACTGGT 


TTACTTGGCT 


CCGTTGGTTG 


TTCTGGTTCA 


CTTGGCTCTG 


240 


TAGGAGTTGT 


CGGTTCTGTT 


GGTGTTGTCG 


GTTCCGTTGG 


AACTACTGGT 


TTTTCAGGTT 


300 


CTGTTGGTGT 


TGTCGGCTCT 


GTCGGTTCTG 


TTGGCTCCGT 


TGGTGTAATA 


GGTGGCTCTA 


360 


CAGGGTCTAC 


AGGTGGCTCA 


ATTGGCGGTG 


TTGGATCTGT 


TGGCTTTTCT 


GGTGTACTTG 


420 


GTTCTGTTGG 


TACTTCAATG 


GGCGGTGTCG 


TAGTATCATC 


AGGGATAATT 


TCTTGTGCAA 


480 


AAGCAGAAAT 


TTCACTTCCC 


CCAAATAGTA 


GTGTGGCACT 


AAATAAACTA 


GCAAAAAGAT 


540 


TTTTTTTCAT 


CATTATTTCC 


TCCATTTTAA 


TTTTAATTTG 


TAAGTTCAAA 


ATCTTCTTCT 


600 


GTCGGTCGTT 


CCATTACGAC 


CAGTTGTTCT 


TTTGTTcTTC 


AGCTGTAAAC 


GTAATTTTCT 


660 


TTGTCGATgC 


TCGTCTTTtT 


CATTGATTAA 


TACAACAATG 


AACGcTaTCG 


TCCATkGTTg 


720 


kCTTCCTCTT 


TACTCCCAGG 


GAAACATACT 


TCTCGCTC 






758 



(2) INFORMATION FOR SEQ ID NO: 953: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953: 
AATGCCACGG TTGAnATCCG AAATGCAGGA GGnGCCGTGA TaGGTACAGG TACTGCTGAT 60 
GGGACAGGGG CATTTACAGT TACCATTCCC GCAGGTGAAG CAGGTGCGAA TGAAACGTTA 120 
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ACCGCCGTAG CGAAAAACGC CAGCGGTACA GAAAGTACGC CAACAACGTT CCAAACGCCA 180 

GCGGATCCTA ATACGCCCGT GGCGACGCCA ATTGTTGAGA CTGTAACAGG TAG.T.A.CAACA 24 0 

AAAGGCTATG AGGTCAAAGG GACTGCTGAA GTTGGCACCA CCATTGAGGT TCGCGATGCA 300 

GCTGGCACGG TCCTTGGTAC TGCAACAACT GGAACTGACG GAAAATATAC AGTGACTTTA 360 

GATTCAGGAA CAGCAACAGC AAATCAAACG CTGAGCGTTG TAGCGAAAAA CGCTAGTGGC 4 20 

ACGGAAAGTC AACCAGCAAC GGCGACAAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA 480 

TTACATCACA GGnAACTTT 4 99 
(2) INFORMATION FOR SEQ ID NO: 954: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954: 



CTGTTAGGCA 


TTCATTTACT 


ATTTTGAGGT 


GTCTGGCTGA 


ACAnTTGTTG 


TTCCTTTTAT 


60 


TTCGTGGTCC 


TCAGCGTAAT 


GGCACTTCTG 


GTCTTCTGGT 


GGATGnGCCA 


TCACCTACTT 


120 


TGGCTTTGTG 


AGAATGTTCG 


CnCCTTCGCT 


TCAnCGTTAA 


CGATCACTTT 


GAAACGAATA 


180 


CTCCGTTCTT 


TCGTATCGGT 


GATACTTGGG 


TATTCCGCTG 


TGATTGTCCG 


TTGGCTTCTT 


24 0 


TCAAGCTGGT 


TGGTTTTGGT 


TCATCACCGA 


CACTTGTTAG 


GCTATCTTTG 


ACATACGTCA 


300 


GTCCTTTTGG 


TAGTTGGTCT 


GTGATAACTA 


CTTTGTTTAG 


GACACCATTT 


TCGATGGTGT 


360 


TGCGGAAGCT 


AATTGTGTAC 


TCGATAGTTT 


CGCCTAGTTT 


TGGTTTGGCG 


TTATTAACGG 


420 


TTTTTGTTGC 


GGCTAGTTTG 


CCTTCTTTCG 


CTTCTGGTAC 


CACCGGCACC 


TCTGGTTCTT 


480 


CGGGCGGATT 


AACTGTGTCA 


TCTACTTTGG 


CTTTGTTGAG 


AATTGTTTCG 


CCCGCTTTCG 


540 


CTTCTTCGTT 


AACGATCACT 


TTGAAACGAA 


TACTCCGTTC 


TTTCATATCG 


GTGATACTTG 


600 


GGTATTCTGC 


TGTGATTGTG 


CCATTGGCTT- 


CTGTCAAGCT 


GGTTGGTTTT 


GGTTCATCAC 


660 


CGACACTTGT 


TAGGCTATCT 


TTGACATACG 


TCAGTCCTTT 


TGGTAGTTGG 


TCTGTGATAA 


720 


CTACTTTGTT 


TAGGACGCCG 


TTTTCGATGG 


TATTGCGGAA 


ACTGATTGTG 


TACTCGATCG 


780 


CTTCCCCTAG 


TTTTGGTTTA 


GCGTTATTAA 


CGGTTTTTGT 


TGCTGTTAAT 


TTGCCTTCTT 


840 


TCGCTTCTGG 


TACCACCGGC 


ACCTCTGGTT 


CTTCGGGCGG 


ATTAACTGTG 


TCATCTACTT 


900 


TGGCTTTGTT 


GAGAATTGTT 


TCGCCTGCCT 


TGGCTTCTTC 


GTTAACGATC 


ACTTTGAAAC 


960 


GAATACTCCG 


CTCTTTTGTA 


TCGGTGATAC 


TTGGGTATTC 


CGCTGTGATT 


GTGCCGTTGA 


1020 


TTTCTTTCAA 


GCTGGTTGGT 


TTTGGTTCAT 


CACCGACACT 


TGTTAGGCTA 


TCTTTGACAT 


1080 


ACGTCAGCCC 


TTTTGGTAGT 


TGGTCTGTGA 


TAACTACTTT 


GTTTAGGACG 


CCGTTTTCGA 


1140 


TGGTGTTGCG 


GAAcTGATTG 


TGTACTCGAT 


TTCTTCCCCT 


AATTTTGGTT 


TGGCGTTATT 


1200 


AACGGTTTTt 


GTTGCGGTTA 


ATTTGCCTAC 


GTTCGTTTCT 


GGTACCACAG 


GCACCTCTGG 


1260 
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TTCTTCGGGC GGATTAACTG TGTCATCTAC TTTGGCTTTG TTGAGAATTG TTTCGCCTGC 1320 

CTTAGCTTCT TCGTTAACGA TCACTTTGAA ACGAATACTC CGCTCTTTTG TATCAGTGAT 1380 

ACTTGGGTAT TCCGCTGTGA TTGTGCCGTT GATTTCTTTC AAGCTGGTTG GTTTTGGTTC 14 4 0 

ATCACCGACA CTTGTTAGGC TATCTTTGAC ATACGTCAGy CCTTTTGGTA GTTGGTCTGT 1500 

GATAACTACT TTGTTTAGGA CGCCGTTTTC GATGGTGTTG CGGAACTGGA TTGTGTACTC 1560 

GATTTCTTCC CCTAATTTTG GTTTGGCGTT ATTAACGGTT TTTGTTGCGG TTAATTTGCC 1620 

TACGTTCGTT TCTGGTACCA CAGGCAnCTC TGGGTTCCTC GGGCGGAATA ACTGGGGCCA 1680 

CCAACTTTGG GCTTGGTTGA nAAATGGTTC GCCTGCCCTA nCCTCCTCCG GTAAAGATCA 174 0 

ACTTGnAACG AAA 1753 



(2) INFORMATION FOR SEQ ID NO: 955: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955: 
GAGGATCCCC GTGACTATGC TACTGCAATT CTTGTTGTAC CAACCGGGAG CGTCAATAAT 60 

TTTGTGTAAA TAACTCGTCC TTCTGCAAAA TAATGGGTTA CTCAGTAAAC ATTGAAGCTA 120 

ATGTATCCGT CACTTGTTGG AAGCCTTTAT GGCTTCTATT TAAGAATTTT TGATTGTATG 180 

TATCAAAGAT GGATACTAGG AAACGCTCTA GTGATTCTTC ATTTTGAAAC TGCTCTTTTC 24 0 

TGCGGCTGTA TTTCTTGATT TGTTTATTGA AAGATTCGAT TAAATTAGTT GAGTAAATGC 300 

TTCGGCGGAT ACCAGGCGGA AACTCATAAA AAGTCAATAA ATCTTGATTT TTTATCAGTG 360 

ACTGCGTCAC TTTAGGATAA GTTTTCTGCC ACTTCTCAAT CATACTCCCT AAAAAGGTAT 4 20 

TTGCCTCTTC CTTTGAGCTA GCTTGATAAA CAGCCTTAAA AT CATC AC AG ATTTCTTTTC 4 80 

GATCTTTGAC ACGTACTTTA 500 



(2) INFORMATION FOR SEQ ID NO: 956: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956: 



ACGACACGTG TAGGCACGCT AGAATTAAAA GTACCCAGAA CACGTGATGG CCATTTTTCA 60 

CCCACAGTGT TTGAACGTTA TCAACGAAAC GAAAAAGCCC TCATGGCTTC AATGTTGGAA 120 

ATGTATGTAT CAGGCGTTTC AACTCGTAAA GTATCAAAAA TTGTGGAAGA ACTTTGTGGT 180 

aAATCCGTCT CTAAGTCCTT CGTTTCTAGC TTAACAGAAC AGCTAGAACC TATGGTTAAC 2 40 
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GAGTGGCAGA ATCGTTTATT ATCAGAAAAA AATTATCCTT ACTTAATGAC CGATGTACTC 300 

TATATAAAAG TAGGAGAAGA AAATCGAGTA CTCtCAAAAA GCTGTCATAT AGCGATTGGA 360 

ATAACCAAAG ATGGCGACCG TGAAATTATC GGCTTCATGA TTCAAAGTGG CGAAAGgAAG 4 20 

AGACCTGGAC AACATtTTTT GnATACCTAA AAGAcGCGGT TTACAAGGTA CGGgAACTCG 4 80 

TTATTyCTGA TGCGCACAAA GGGATTAGTC TCTGGCCATT AGGAAAATCC TTCACCACGT 54 0 

AGGnGCCAAA AGATGCCAAG TTCACTTCCT AAGAnATATC TTTACCACCA TTCCTAAAAn 600 

ATTCAAATCT TTCAGAGAGC GGTTAAGGAA TTTTAGTTCC AGTATTACTA GCGCGG 656 



(2) INFORMATION FOR SEQ ID NO: 957: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957: 



CCCTTATTGG 


GGAAaGgCGG 


CAAtAAAAAA 


GTAATTGTAA 


AATACTGTTA 


GGGAATCATT 


60 


AGATAAAGTC 


CGTATAATTG 


TGTAAAAGTA 


AAAAGGCCAT 


ATAACAGTCC 


TTTTACGGTA 


120 


CAATGTTTTT 


AACGACAAAA 


ACATACCCAG 


GAGGACTTTT 


ACATGACCCA 


AGTACATTTT 


, 180 


ACACTGAAAA 


GCGAAGAGAT 


TCAAAGCATT 


ATTGAATATT 


CTGTAAAGGA 


TGACGTTTCT 


240 


AAAAATATTT 


TAACAACGGT 


ATTTAATCAA 


CTAATGGAAA 


ATCAACGAAC 


AGAATATATT 


300 


CAAGCAAAAG 


AATATGAACG 


AACAGAAAAC 


CGACAAAGTC 


AACGAAATGG 


C TAT TAT GAG 


360 


CGCAGyTTTA 


CGACACGTGT 


AGGCACGCTA 


GAATTAAAAG 


TACCCAGAAC 


ACGTGATGGC 


420 


CATTTTTCAC 


CCACAGTGTT 


tGAACGTTAT 


CAACGAAaCG 


AAAAaGCCCT 


CATGGCTTCA 


480 


ATGTTGGrAA 


TGTATGTATC 


AGGCGTTcAA 


CTCGTAAAGT 


ATCCAAAATT 


GTGGGAGACT 


540 


TGTGGTAATC 


CGTCTCTAGn 


CCTCGTTCTA 


GCTAACAGAC 


AGCTAGACTT 


GGGnACGGGG 


600 


CGATCGTTAT 


TCAGAAAAAT 


TCCTCTTATG 


nCGTGACCTT 


TAAGTCGGAG 


AAACGG 


656 



(2) INFORMATION FOR SEQ ID NO: 958: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958: 

CTGTCGTTTC TTTTTGGTAG TACTCACCTT CTTTATTTTT GTGTGTATTT TTAAGCTTAT 60 

CGAACAACGC TTCATACTCG TTATTTATTT GCGCCATTTC TTTATCAGTA CCGCCCATGT 120 

CTGGGTGGTA CTTTAAAGCT AGCTTTTTAT ATACGCGTTT CAATTCTTCT AATGTCGTTA 180 

CATCTTTGAT AAATTTCATG CTAATCGCCC TTTCTGATTG CTGCTGAACA GCTAAGCTGC 24 0 
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CCAGCGTTCC AATTTTTCAA TTGATTTTTT ATCATTTTCA AGGTCTGCTT TTGTGTACCA 300 

TGCTGGTATC TCTTTATCAG AAAATAGACT GTAGATAAAT TCTAAGATTT CCATGCkTtC 360 

TCTAGCTTTA ACAGCTGATT CACATTGATA ACGTAATATC tTACAyCTgg CTAAGTCTAT 4 20 

CGACATAgAA TAATGAAACG TTgGtTCCTc yACCATTtGt TAACcGtTTT TCCnTnG 4 77 



(2) INFORMATION FOR SEQ ID NO: 959: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959: 



GTAAAGCCCC 


nGCCCACCTT 


AGAACCACTT 


AGTGGTTTAA 


TTCCATAAAC 


TATTTTTAGG 


60 


AGGCTTACAG 


AAATGAAAAA 


AGCAAAATTA 


TTCGGTTTTA 


GTTTGATTGC 


ATTAGGTTTA 


120 


TCCAnTTCAC 


tTGCAGCAT-G 


TGGTGGTGGC 


AAAGGCAAAA CCcTGAAgCG 


GCGGTgGcAA 


180 


AGGGGATGCA 


GCGCATAGTG 


CTGTAATCAT 


TACAGATACA 


GGCGGCGTGG 


ATGACAAGTC 


240 


GTTCAACCAA 


TCTTCTTGGG 


AAGGATTGCA 


AGCTTGGGGT 


AAAGAACATG 


ATTTACCAGA 


300 


AGGTTCAAAA 


GGGTATGCAT 


ATATTCAATC 


GAATGATGCA 


GCTGACTATA 


CAACCAATAT 


360 


TGACCAAGCG 


GTATCAAGTA 


AATTCAACAC 


AATCTTTGGT 


ATTGGcTACT 


TGCTAAAAGA 


420 


TGCAATTTCT 


TCTGCAGCAG 


ATGCCAACCC 


TGATACAAAC 


TTTGTTTTAA 


TCGATGATCA 


480 


AATCGATGGG 


CAAAAAGAAT 


GTCGTTTCTG 


CAACCATTTA 


GGAGATAATG 


AAGCCAGCTT 


540 


ACTTAGCCGG 


TGTTGGCTGC 


TGCAAATGGA 


AACCAAAACG 


AACCAAAGTC 


CGGTTTTGGT 


600 


TnGGTGGTTG 


AAGAAGGGGT 


CCGTAATT 








628 



(2) INFORMATION FOR SEQ ID NO: 960: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: "nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 960: 



CTTTACCTCn TGnGAAATAG TACCTGCACT GCCACAATnC CAGGCGTTTC CATTCGTAAT 60 

CGATGATGGA GCCAGGAATT GCTATCCCAT AGCCATTGGC GCCCTTGGTC ACATCGTTAT 120 

CGCCAATAAA TGTTTTTCCA GTCGCTTTTT CAATCCCATG AAACGTTGTT GGCGTAAGTT 180 

TTGTAGCATT ATTGACCGTT GGATTAATCG TCAAGACCTG ATAGGCTTTG ACATTTCCAA 2 40 

ACCAAGTCCC TGTGCTATCC ACCAAAGCAG TTGTATCTGT CGTTGGTTTA ATCCATAAAA 300 

GTTCATCTTT TGTTGGATCG ACAGCTTCAA AATCCGCAAT GTTTACTTTA CtGTACCAGG 360 

AATTGTTTCT AATGTTGCAT GATCCTGaAC CGTTAGTTTG CTAGCTGAGA TTAAATTGGc 4 20 
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TGTGTTGGTG aTCCCAATTC CAGCGTTTTT AGATAGATGA ATkGATCCGA ATTCATGGnA 4 80 

AGTCGTACnG GGGTCGCGGC AGTAGCAGAA CGCCATTTTG GATGTnCCAC CATAGCCATA 540 
AATTAAnGCC ATGGGGGCAT TAATCC 566 
(2) INFORMATION FOR SEQ ID NO: 961: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961: 

GTGTTCGATT TTTATGATGA TACCAATGTG AAACTAGTCC GGCCCTTTGG CGATACGATG 60 

GGGCTACATG AAGAATTCCT ACCAGCTAGT AAACGGTATA TGAATGACTA TATCCAGTAT 120 

GTAACATCTG ACTTTTTAGC AGGACTTGGC TTTGGCGCCA CTCAAATGTT AGGAGAGTTG 180 

GAAgGGATTT ACTTTGGCTA CAATGCGGAT ACGGGGCGTA ATGTTTACCT GAAACCCGCA 240 

TTAGCCTCAC AAGGAGTAAA GGGCTCAGTA ACCAATGCAT TAGCTGCGGC TTTTCTTGGT 300 

TCTCTTGGTG GCGGAAAATC CTTTAGTAAC AATCTCTTGG TCTACTATGC TGTTCTTTTT 360 

GGTGGCCAAG CTGTGATTGT CGATCCAAAA GGAGAACGAg TGGTTGGAAA gAAAACTTGC 420 

CAGAAATTGC nGATGAAATC AATAtCCyCA cTTGACyAGt AAaCCrGAAA TCAAGGCnTA 4 80 

CyTGATCCCT AATGGGATTA TGAAGAAAAG AAGATCAGAA GTTAGCATTA TTCC 534 
(2) INFORMATION FOR SEQ ID NO: 962: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962: 
TAAAAATTCT AATTTGATTT TATCCAGCCC AATTTCTTTT TTCCCTTTTT CCCAGAATTC 60 

TTTTGCTTTC GCTGTATCGT AGGTTAAGTA GTTATTTTTC TTAGCAGCTA GTTTTGTGAA 120 

ATCTGTTCCA TCTGGCGCAA TCGTTTGTCC CGGCGGAATT ACTTGATCTG TAGCCGTTGA 180 

TTGATCTTGC AAGACTTCkT TTACAAAACT TTCTTTATCA AAAGCTTGTG CCAACGCTTT 24 0 

ACGAATATTG TTGTTCGCTA AAGCCGGATT TTCTTTTCCA TCACGAACGC TGTTCATTTT 300 

TAAAAAgTAT GTGACGAATT GAGGAATTGA AAGAAAGGCT GGATTAwCmA CaTAACCAGG 360 

AATAAATTCA CCAcTaGCTT AwTCACTAcA TCCAtTCATc GAGyCAAmCA AtTCACCGcT 420 

GTCGTGGGGC CCTAATACTG AACTTTCGCT CnGTAATTAC TT 4 62 
(2) INFORMATION FOR SEQ ID NO: 963: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 661 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963: 



G CAT TAT TAG 


CAGCGTTAAA 


TGAAGGAAGC 


AATAAAGTAG 


GCAAACAAGC 


TTGGTCTGTA 


60 


TATCTGGAAG 


TCGAACGGAT 


CAAAACAGGT 


GACGTAGAAA 


ATACGCAAAC 


AGAAAACTAC 


120 


AACAAAGAAC 


TTGTTCGTTC 


TAATACGGTG 


GTGACGCATA 


CACCTGATGA 


TCCAAAACCA 


180 


ACCAAAGCCG 


TTCACAACAA 


GAAAGGGGAA 


GATATTAACC 


ATGGAAAAGT 


TGCTCGTGGT 


240 


GATGTTCTTT 


CTTATGAAAT 


GACGTGGGAs 


TTAAAAGGGT 


ACGATAAAGA 


CTTTGCTTTT 


300 


GACACAGTCG 


ATCTTGCGAC 


AGGAGTTTCT 


TTCTTCGATG 


ATTACGACGA 


AACGAAAGTG 


360 


ACcCAATCAA 


AGACTTACTT 


CGTGTCAAAG 


ATTCTAAAGG 


GGCAGACATT 


ACGAACCAGT 


420 


TCACGATCTC 


TTGGGACGAT 


GCCmAAGGCA 


CGGTGACmAT 


CTCTGCCaAA 


GACCCmCAAg 


480 


CCTTTATTCT 


AGCGTwTGGT 


GGGCAAGAAT 


TGCGTGTAAC 


GTCCCTACAA 


AAGTCAAAGC 


540 


CAATGTTTCT 


GGTGATGTTT 


ATAATTCAGC 


GGAACCAAAT 


ACATTTGGTC 


AACGAATTAA 


600 


AACAAATACC 


GTTGTCAACC 


ATATTCCAAA 


GTGGACCCTA 


AAAAAGACGT 


GGTTnTAAGT 


660 



G 661 
(2) INFORMATION FOR SEQ ID NO: 964: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCfri: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964: 

AAGAACCGGA TATTTTTCTT TGATTTTATT CCAAAAGACA TTGTACATTT TCACATTTTG 60 

GACATAGTAA AGTTCGGCAA ATAAATGTGT AGCAAACAAA TAATAAAAAA TTTGCGGACG 120 

AATTTTCTCC AATTGCGGCG TCATGTGTAT TTCTTCTAAT AAGAAATGTT GGACTAAATG 180 

AGTCAACATG TAAACAGGTG CTTCAGTCTC ATCAAAATGC GCTAAAATTG CTTGTGCAGT 24 0 

GGCTTCATCT TGATAAAATT GGTCATTGGC TAACAAGAAA AAGTACACAA AAGTTAAATC 300 

GTTTTTGTTA AGGAAAAGGG TTGTAGCTGT CATCTCkkCT ATAAAGGCTg GTAATGTTTA 360 

TTACCTTCTA AATAAGTtTC TTGCTCGGCT GAAGGTCGAT ATTCGGCGCC TTGCGCACTA 420 

CGGACTTTTG AG AT AG CG AC CATATAwTTA ACTCGGCGTG kCACCGTTTC CGCAACTTCT 4 80 

ACGTGAAAAA AGCCATCATT TTGGCGGTTG TGCGACTTCC GTTCTGTAAT AGnCAAAAGC 54 0 

CAGTACACGG ATTAnTCATA AAATAGTGTA CCAAAGGGCA ATCGC 585 
(2) INFORMATION FOR SEQ ID NO: 965: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965: 



nCGGGAAGGn 


AACTGTAAAG 


CCCCTTCCCA 


TCAGCAGTAC 


CTGACCTATC 


ACGGCACCTC 


60 


CGCATTCGnA 


TCTCAACCGT 


GGCATGGCAT 


CGGCAGTTCC 


TTAAACTCGT 


AACCTGCTGT 


120 


GGAATTACCT 


GTCACTCCTG 


TGATTGTTGG 


TGCGGTTACG 


GTTGCTTCAT 


CCGCTGGTGT 


180 


nTGGAACGTT 


GTTGGCGTAC 


TTTCTGTGCC 


GCTGGCGTTT 


TTCGCTACGG 


CGGTTAACGT 


240 


TTCATTGGCA 


CCTGCTTCAC 


tGCGGGAACG 


GTAACTGTAA 


ACGCTCCTGT 


CCCATCAGCG 


300 


GTACCTGTGC 


CTATTACGGT 


GCCTCCTGCA 


TTTCGGATTT 


CAACCGTGGC 


ATTGGCATCG 


360 


GCAGTTCCTT 


TAAtTCGTAA 


CCTGCCGTTG 


AATTACCTGT 


CACTCCTGTG 


ATTGTTGGTG 


420 


CGGTTACGGT 


TGCTTCATCC 


GCTGGTGTTT 


GGAACGTTGT 


TGGCGTACTT 


TCTGTACCGC 


480 


TGGCGTTTTT 


CGCTACGGCG 


GTTAACGTTT 


CATTGGCGCC 


TGCTTCACCT 


GCGGGAACGG 


540 


TAACTGTAAA 


CGCTCCTGTC 


CCATCAGCGG 


TACCTGTGCC 


TATTACGGTG 


CCTCCTGCAT 


600 


TTCGGATTTC 


AACTGTGGGC 


ATTGGCATCA 


GCAGTTCCTT 


TAACCTCGTA 


ACCTGCCGTT 


660 


GAATTACCTG 


TCACTCCTGT 


GATTGTTGGT 


GCGGTTACGG 


TTGCTTCATC 


CGCGGGCGTT 


( 720 



721 



(2) INFORMATION FOR SEQ ID NO: 966: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966: 

AGAAAAAGTG TGAGCCTTGT GCTCACGCTT TTTTGTGCAA TTGAAGTTGG TGAATCTCAG 60 

CGAAAAGAGA AAGGTACCTT AAATAGTGTG TTTCTGGTCA ATAGCACAAT GAAATATCTA 120 

AATACATTAT GAACTGCTTT TCGTTTTATG AAGAGCAGTT TGTGCTATTT GAGAAATTGC 180 

TCAATTTAAT CATTAAGTAG TTTCATCAAT GGTGACTATG CTACTGCAAT TCTTGTTGTA 24 0 

CCAACCGGGA GCGTCAATAA TTTTGTGTAA ATAACTCGTC CTTCTGCAAA ATAATGGGTT 300 

ACTCAGTArA CATTGAAGCT AATGTATCCG TCACTTGTTG GAAGCCTTTA TGGCTTCTAT 360 

TTAAGAATTT TgATgTATGT ATCAAAGAtG GAT ACT AG GA AACGCTCTAg TGATtCTTCA 4 20 

TTTGAAACTG CTCTTTcTGC GGCgTATTct GATTGTTaTg AAgATCGATA AATAGTgAGT 4 8 

AAAgCTCsGC GaTACCAGcs GAAACTCATA Ann 51 
(2) INFORMATION FOR SEQ ID NO: 967: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967: 

CAGTTATCCC CCnGAGACCC AGAGGTGCnT GTGGTnCCAG AACGAACGCA GGCAAACTAG 60 

CCGCAACAAA AACCGTTAAT AACGCTAAAC CAAAACTAGG CGAAACTATC GAGTACACAA 120 

TTAGCTTCCG CAATACCATC GAAAACGGCG TCCTAAACAA AGTAGTTATC ACAGACCAAC 180 

TACCAAAAGG ACTGACGTAT GTCAAAGATA GCCTAACAAG TGTCGGTGAT GAACCAAAAC 240 

CAACCAGCTT GAAAGAAATC AACGGCACAA TCACAGCGGA ATACCCAAGT ATCACTGATA 300 

CAAAAGAGCG GAGTATTCGT TTCAAAGTGA TCGTTAACGA AGAAGCTAAG GCAGGCGAAA 360 

CAATTCTCAA CAAAGCCAAA GTAGATGACA CAGTTAATCC GCCCGAAGAA CCAGAGGTGC 420 

CTGTGGTACC AGAAACGAAC GTGGcAAATT wACCGCAACA AAAcCGTTAA TAACGCCAAc 4 80 

CAAATTgGGG AgAATCGgTA CACmATCAGt TCGCACACCA TCGAAACGGG TCTT 534 
(2) INFORMATION FOR SEQ ID NO: 968: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968: 

TAATAAAATC CGTAAGAAAA CAGTAAAGTA TAAAGATAAA GAATATGATC GAATCAACTT 60 

ATTGTTAGAC AACGATATGA ATAGCGAAGA AAATAATAAG TTAATGGAAG AAGTTGTAAA 120 

TTACGAAAGT AAAGACTTAA AAAACCGTTC AATTATGTAT TATAACAACT GGACCAATAC 180 

TTTTTATGGC TTGTTTTATC TGGTAGTTAA CTTCATTCAA ACAGTGGTGT ATTTCCTGTT 24 0 

AAGTTTCTnA CGATTAATTA TTGCGGTCAT TCAATTGTTC CTATTGCCGT TGTTACCGTT 300 

ATTGTTGTTC GCAGGACTCT TTTTAACTGA AACGAACGTC TTTGCGAATT ACTTTAAAAC 360 

CTTTGGCATG ACTATTTTTA TGAAGGGAAT GGTTGGTTTT GCGACTATTT TCTTTGCAAG 4 20 

TTTCTTATCA CTAGGGTTTC AATTAAGCAA TCAGACAGAA AACGTGTGGC AGAAGATATT 480 

AACGATACTT ATTTATCTTC TTACACCGCT TGGCTTGTAT GTCTTCCGGA AATTCTTTGC 540 

TAATCTTGTC ACGGGACGTG TTTCACTTTC TGATGGTGTC GGGTTTATCG GAAaTCCTTT 600 

TGGCACAGAA GCTAAT AT GC GTAGAGCGGC AAAAGAACAA AAGCaAGAGa ATAAGGAGCG 660 

CAnGnAACAG GCACAAGAAG AACGTAAAAG CAGCTATTAA AAAACGTCCA GAAGnAGCCA 720 

AGGGAACCAT GGTAAAGCAG GAAACTGGCn TAAAACCAAn GGCCATTAAA TAAAGGACCA 780 

AGGAAAACGT G 7 91 
(2) INFORMATION FOR SEQ ID NO: 969: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969: 



TGCTCGCGAA 


CAAAGAGCTG 


CCACACAGAA 


GGAGTTGTTA 


GAAACAGCAC 


CAGAAGCTAG 


60 


TCTCTTATCT 


GCCACAATGA 


ATATTCCTGG 


AGAAGTAAAA 


AATTCGCCCA 


CCTTAACGGC 


120 


TGTTTTTTTA 


GAAGTAATTG 


ACGAAtCGAG 


CAACAATTGC 


TGGATCAAGT 


ACCGATTGTT 


180 


AATTTTTACC 


GGAATGAAAA 


AACAGGTCCT 


GAATATTATT 


TGGCGGTTTC 


GTTAGCCCCA 


240 


CAAGAATTAA 


AACAAAGAAT 


GGTCAAAATT 


GAAGAAACAC 


ATCCTTATGG 


TCGTTTAGTG 


300 


GACCTAGATG 


TTTTATGGGG 


AAtGAGGAGC 


TAAAAAGCCT 


TCATCGAGGA 


GATTTAGGCT 


360 


kGCCACCACG 


GCGCTGTTTT 


AtTkGCCAAG 


aAGTGGCcGA 


AAaGTCTGTn 


GaCGAAATCG 


420 


asGTCACAGT 


CTtGaAGCAt 


GCCAgaAgAA 


ATaCCggAAT AATTTgaCcC 


GaAAGGGcac 


480 


CAAGTGGGTA 


AAAAAATCCG 


TTTAACGGAA 


A 






511 



(2) INFORMATION FOR SEQ ID NO: 970: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid' 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970: 



CGTTTGATGT 


CTTATCCGAT 


GAGGTCGTGA 


TGCAGTTGCC 


TTCTGCACCT 


AATTATCCTT 


60 


CAATTTATAG 


TTCTCTAAAC 


AAATTATTAA 


CCTTTGAGTC 


CTACAAATTA 


GAAGAGTTAA 


120 


CCCCGAATCG 


AATTTATAAT 


AAAGTTGTTG 


TTGCGATTGA 


TGAAGCGTAT 


TTAAATGAAC 


180 


GCATCAAGGA 


AATTCCAGCC 


TCTTTTTATG 


AGCGCTTTGA 


AATTATTAAA 


ACGCGAAATA 


240 


ACTTATTAGA 


ATTTATGCCA 


AAAGGGATTA 


CGAAAGCCTA 


TGGTATTTCC 


TTGTTAGCCA 


300 


AAGATTTAGG 


AATCCGTGCA 


GAAGAAATCA 


TGACCCTAGG 


GGACGArGAA AACgATTTaC 


360 


CGATGATCGA 


ATACGCTGGG 


CTTAGGGTGt 


CGCCATGGGC 


AAATGCGatT 


CCaTTAGGtT 


420 


AAAgAAGCCG 


CaGATGTCgT 


CACaGACmCg 


GATGrCCCAg 


AttGGCGkTG 


cCAAAGCGGG 


480 


TnGAAAATnT 


AnTTTAACCC 


CGTTGGGAAG 


GGGGCCATAA 


GA 




522 



(2) INFORMATION FOR SEQ ID NO: 971: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 971: 

GGCGGACGAG CCTTCCCCAG GAAACC.T.TAG . TCATACGGTG GACAGGATTC TCACCTGTCT 60 

TTCGCTACTC ATACCGGCAT TCTCACTTCT AAGCGCTCCA GCCGTCCTCA CGATCGACCT 120 

TCAACGCCCT TAGAACGCTC TCCTACCACT ACACCTAATG GTGTAGTTCC ACAGCTTCGG 180 

TAATATGTTT AGCCCCGGTA CATTTTCGGC GCAGGtCACT CGACTAGTGA GCTATTACGC 24 0 

ACTCTTTAAA TGGTGGCTGC TTCTAAGCCA ACATCCTAGT TGTCTGTGcA ACCCACATCC 300 

TTTTCCACTT AACATATATT TTGGGGACCT TAGCTGGTGG GTCTGGGGCT GGTTTTCCCT 360 

TTCGGACTAC GGGATCTTAA TCACTCGGCn GTCTGACTCC GGGATATAAA TGAATGGGCA 4 20 

TCGGAGTTTA nCTGAATCGG TnACCGAGAT GGGGCCCAAG GTCAAAAGnG 470 



(2) INFORMATION FOR SEQ ID NO: 972: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972: 



CGCGTAGAGG 


ATCCCGCCTG 


AT GAT CC AAA 


ACCAACCAAA 


GCCGTTCATA 


ACAAAAAAGG 


60 


GGAAGACATT 


AATCATGGAA 


AAGTGGCTCG 


TGGTGATGTT 


CTTTCTTATG 


AAATGACTTG 


120 


GGACTTAAAA 


GGGTACGATA 


AGGACTTTGC 


CTTTGATACA 


GTCGATCTTG 


CGACAGGCGT 


180 


TTCTTTCTTC 


GATGATTACG 


ATGAAACGAA 


GGTGACACCA 


ATCAAAGACT 


TACTTCGTGT 


240 


CAAAGATTCT 


AAAGGGGAAG 


ACATTACGAA 


CCAGTTCACG 


ATCTCTTGGG 


ATGATGCCAA 


300 


AGGcACGGtG 


ACGATTTCTG 


CCAAAGAcCC 


ACAAGCCTTT 


ATTTTGGCGC 


ATGGTGGGCA 


360 


AGAATTACGT 


GTAACTTTAC 


CAACAAAAGT 


TAAAGCCAAT 


GTTTCTGGTG 


ATGTGTATAA 


420 


TTTAGCGGAA 


CAAAATACAT 


TTGGTCmACG 


AATTAAAACC 


AwTACCGTTG 


TCAACCATAT 


480 


TCCAAAAGTG 


rACCCTAAAA 


AAGACGTGGT 


TATTAAAGTC 


GGTGATAAAC 


AAAGTCAAAA 


540 


TGGTGCCACA 


ATCAAATTAG 


GGGAGAAATT 


CTTCTATGAA 


TTTACAAGTA 


GTGACATTCC 


600 


TGCAGAATAC 


GCTGGTATTG 


TGGAAGAATG 


GTCGATTAGC 


GATAAACTAG 


ACGTCAAACA 


660 


TGACaAATTT 


AGTGGCCAAT 


GGTCTGTGTT 


TGCCAATTCT 


ACGTTTGTCT 


TaGCAGACGG 


720 


AACCAAAGTG 


AATAAAGGGG 


ACGACATTTC 


GAAACTATTC 


ACGATGACCT 


TTGAACaAGG ' 


780 


GGTAGTGAAA 


ATCACAGCCA 


GTCAAGCCTT 


TTTaGATGCG 


ATGaATCTAA 


AaGAAAaCCA 


840 


AAACGTTGcA 


CACTCATGGa 


AGCGTcATTG 


GTGTAGACGA 


tTGCGGcAGa 


GcGTtcACaC 


900 


aTCGAGAtCT 


TTCaCaTGGG 


aGTTaACyaT 


ccGGTTTGnC 


nCT 




943 



(2) INFORMATION FOR SEQ ID NO: 973: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 



WO 98/50555 PCT/US98/08985 



2073 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973: 

nCCAGGTACT AAGAkGTTTC AGTyCTCkGC GTCTACCTCT AATCAGCTAT GTATTCACTG 60 

AAAAGTAATA TCCTATAAAA GATATTGGGT TCCCCCATTC GGAAATCTCT GGATCATAGC 120 

TTACTTACAG CTCCCCAAAG CATATCGGTG TTAGTCCCGT CCTTCATCGG CTCCTAGTGC 180 

CAAGGCATCC ACCGTGCGCC CTTATTCACT TAACCTTATC AACcTTACGG TTGGGTCTTG 240 

ATCACTTCTT CTAGCGATAG AAGGTTAATC AATAAATAAG CAATTGAACT TATTAAAAAA 300 

CTCATTCAAC GCGGTGTTCT CGGTTTGTTT TGATTCTTTT TACTTCAATA TCCAGTTTTC 360 

AATGAACGAA TGTTTTGAGA GTAGACCTCT CAAAACTGAA CAAAGTAAAG ACGAAATGTG 420 

TATTTCCGTA ATATCCCTTA GAAAGGAGGT GATCCAGCCG CACCTTCCGA TACGG 475 



(2) INFORMATION FOR SEQ ID NO: 974: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 382 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974: 



TTGAAGACAG CCGATTCAGG TTACTTwACA CGTCGTTTAG TTGACGTTGC CCAAGACGTA 60 

ATCATtCGTG AAGACGACTG TGGCACTGAC CGTGGTCTTG AAATCGAAGC AATTCGTGAA 120 

GGTAACGAAA TTATCGAACC ATTAGACGAA CGTCTATTAG GTCGTTATAC ACGTAAATCA 180 

GTTGTTCATC CAGAAACTGG TGCAATCATC ATTGGTGCGG ATCAATTAAT CACTGAAGAT 240 

CTTGCCAGAG AAATCGTGGA CGCTGGTATT GAAAAAGTAA CCATCCGTTC TGTCTTTACA 300 

TGTAACACAA AACATGGTGT ATGTAAGCAC TGTTACGGAC GTAACCTTGC AACTGGTTCT 360 

GACGTTGAAG TCGGAGAAGC AG 382 



(2) INFORMATION FOR SEQ ID NO: 975: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975: 



AACATAATAA TCTTTCGGAT ACAAATAATA AGAAATGTTA TTTTTAAAAA TAAAACAATA 60 

TTTTTTAAGT • AATATTAAGA ATGTGAGAGT GTAAAATATT TTGTGTAAAT GAAAAAATCC 120 

ATACAAAAAA GGAAGTCGCT TCTGTAGAAT AAAGTTAACG ACAACCAATT CACAGArAAG 180 

AGGACTTCCC TATGAATGAT TTTACTACAG AAATTGTGCA AACTCTAGTC ACTAAAGGCG 2 40 
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ATTTAAATGA ATTATTCCGT TCGCACTTAG AAAAAGCGAT AAACACACTC CTACGGACTG 300 

AATTAACGGC TTTTTTAGAT TACGAAAAAT ATGATCGCAC TGGTTTTAAT TCAGGTAATT 360 

CGAGAAACGG TTCTTACTTT CGATCAATCA AAACCGAATA TGGTGAATTA ACATTGGAAA 420 

TACCTAGAGA TCGTAATGGT GAGTTTAAAC ACCAAACTTT ACCAGCCTCC AAAGACAACG 4 80 



AnCATTGGAA CCCTAT 

(2) INFORMATION FOR SEQ ID NO: 97 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



496 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976: 

CTTGCATAGT AAACTTCATA ATATCCCCAG TAATCGAAAC TTTCGCTGGA TCAATAAAAC 60 

GACCACTTTC ATCTAAAAAC GACATTTTGT ACGTCTTCTC TGGAATATCG AATGTCTGCA 120 

CTGCTCGCTG CGCACGCAtG ATTTCTGGTG TATTAACTAT TATTGGACTC GTCACTGACG 18 0 

CTTCAACTTG CCCGACTACT TGTTCAGTCT GTTCTGGTAG tCTTCATCAG CTGGCGGTAC 24 0 

cTtACTTCCG TATCTGCCTC TGGAAAAGAA ACGGGCGCCC GTCCTTGgAA CCCCCTCTGg 300 

AcGGGgACTA CTGtTTGGGC GGGTCCTCCC CTTGGAACAT CTGGGGGnCG CCATGGGTTT 360 

TCAGCAATTG GCTGGGGGAT GGCCATTTTA ATGGAACTGG ATTAGGTnAA GGAACCACAC 420 

AAAGCGGATT GGCnAATAAA CGGGTTTCCT TTTTT 4 55 
(2) INFORMATION FOR SEQ ID NO: 977: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 

( B ) TYPE : nucle'ic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977: 

CTTGTTGTGC CTCTAAGTCT CTTAATACAC CTTGGTACTG CTGTAGCTCT AACTTCGCAT 60 

TAGAAAGGTC TTCTTGTGCT TGTGCATAAG CAAGTGTAGC AACGGCTTGT TGTTCTTTAA 120 

GTTCCTCCAA TGATTTCATA GAATTTTCAT AGGCTTCTTG TGCTTGAACC ACTGCCGCTT 180 

CTGCTTTGTT TAACTCGTTT AATGCTTTTT CGTATGTTGG TTGAGCATTT TCTAAACGAG 24 0 

ACAATGAACG TGTACTTGTT TTAAGCGCTA CTTGTTTTTG ATTTAAGTTT TCTTTTGCTA 300 

AATCCAATAC TTCCTTCaAG CTATTCaATG TGtTTTCyTC TGCTTTTTGT GCnTCyTGtT 360 

TTCTTTCAGC • ACTTt AGCAG ACGtTGkTCm ATCGCTGgtG CnTCCGmACG tTGgTTTcGg 420 

CTTTyCaAAA CAGCAGCCgA TCcGaATAAC cCATGATGnG CATGACCTTG TTGTGAAGCT 4 80 

TTCCGCGTG 4 89 
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(2) INFORMATION FOR SEQ ID NO: 978: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978: 



AATTAATTAC 


ACAGTTATTT 


AGTCAATTTT 


GTTTAGGAAA 


ATAAGACAAG 


AGAGGAATTT 


60 


TTTATGAATC 


mATATCAAGC 


AGAATCATAT 


GATGTTATTG 


TCGTCGGTGC 


TGGACATGCT 


120 


GGCTCTGAAG 


CAGCCTTAGC 


AGCCGCACGA 


ATGGGCGTTA 


AAACGTTACT 


TTTAACGATT 


180 


AATTTAGaTA 


TGGTTGCTTT 


aTGCCATGTA 


ACCCTTCTGT 


TGGTGGACCT 


GCCAAAGGAG 


240 


TCGTTGTCCG 


AGAAATCGtG 


CATTAGGCGG 


CGAAATGGGT AAAAACATgA TAAAACCTAC 


300 


ATCCAAATGC 


GGATGCTAAA 


TACTGGTAAA 


GGTCCAGCAG 


gCGTGCGTgC 


GCGCGCAAcT 


360 


GgACAACATG 


CCAAGCGCCA 


nAATGAACAT 


ACCATGAAAA 


AGAGAATTTA 


CCCTCGCAAG 


420 


GATGTnGAGA 


TTAACGGGAA 


ATGCGTTTGC 


GGGGTGACTC 


ACAGGCCCAA' 


CGn 


473 



(2) INFORMATION FOR SEQ ID NO: 97 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979: 



ACAACGTTCC 


AAACACCAGC 


GGATGAAGCA 


ACCGTAACCG 


CACCAACAAT 


CACAGGAGTG 


60 


ACAGGTAATT 


CAACAGCAGG 


TTACGAATTA 


AAGGAACTGC 


CGATGCCAAT 


GCCACGGTTG 


120 


AGATCCGAAA 


TGCAGGAGGT 


GCCGTGATAG 


GTACAGGTAC 


TGCTGATGGG 


ACAGGGGCAT 


180 


TTACAGTTAC 


CATTCCCGCA 


GTGAAGCAGG 


TGCGAATGAA 


ACGTTAACCG 


CCGTAGCGAA 


240 


AAACGCCAGC 


GGTACAGAAA 


GTACGCCAAC 


AACGTTCCaA 


ACGCCArCGG 


rTCCTaATTA 


300 


CGCCCGTGGG 


GACGCCAATT 


TGTTTGAGAC 


TTGTTAACCG 


GkTAGTTACC 


AcCAAAGGsT 


360 


TATgAGGkTC 


CAAGGGGACT 


TGCTTGAAGT 


TTTgGGCACC 


aCCATTTnGA 


GGGTTTCCGC 


420 


GATTTCAnCT 


TTGGCACGGT 


nCCTTTGGTT 


ACTTTGCAAC 


AACTTTGGAA 


CTTTACGGGA 


480 


AnT 












483 



(2) INFORMATION FOR SEQ ID NO: 980: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980: 
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CCTnTGGGCC 


GCCCACGGAA 


TTGCCCCGGC 


GGAAATTGCC 


GCTATTAATT 


CACGTCCATT 


60 


CCTAAGGATT 


TGAACCCCCn 


GnCTTGACAA 


GCCGTTGGTG 


GAGCGAAAAA 


GGnTCCAAAC 


120 


TTTACCGTCG 


TGGAAAACCC 


AACTGGAGGA 


CTTAGGAATG 


TTTCTTCTAA 


AGATTTAGCT 


180 


GCAAArGAAA 


AAGAAGTAGA 


CCAACTACAA 


AAAGAACAAG 


CCCAAAAGAT 


TGCCCAACAA 


240 


GCAGCTGAAT 


TAAAAGCCAA 


AAATGAAAAA 


ATTGCCAAAG 


AAAATGCAGA 


AATTGCGGCA 


300 


AAAAATAAAG 


CGGAAAAAGA 


GCGCTACGAA 


AAAGAAGTGG 


CGGAATACAA 


CAAACACAAA 


360 


AATGACAAAG 


GCTATGTGAA 


TGAAGCAATC 


AGCAAAGACT 


TAGTTTTTGA 


TTCAAGCATT 


420 


GTGACTAAGG 


ATaCTAAAAT 


AGACaAGATT 


TACAGGTGGC 


CAGTTTATTA 


AAGCTTCCGA 


480 


TTTTAACAAG 


TAAACCCAAG 


GGACAATCCA 


nAGATATTTT 


TTACAAAATT 


TAAGTAA 


537 



(2) INFORMATION FOR SEQ ID NO: 981: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981: 

TGTGGGTAGC GGCGAAATTC CAAACGAACT TGGAGATAGC TGGTTCTCTC CGAAATAGCT 60 

TTAGGGnTAG CCTCGGAATT GAGaATGATG GAGGTAGAGC ACTGTTTGGA CTAGGGGCCC 120 

ATCTCGGGTT ' ACCGAATTCA GATAAACTCC GrATGCCATT CATTTATaTC CGGGAGTCAG 180 

ACTGCGAGTG ATAAGATCCG TAGTCGAAAG GGAAACAGCC CAGACCACCA GCTAAGGTCC 240 

CnAAATATAT GTTAAGTGGA AAAGGATGTG GGGTTGCACA GACAACTAGG ATGTTGGCTT 300 

AGAAGCAGCC AC CAT TT AAA GAGTG 325 



(2) INFORMATION FOR SEQ ID NO: 982: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982: 

GTAATATCCT ATAAAAGATA TTGGGTTCCC CCATTCGGAA ATCTCTGGAT CATAGCTTAC 60 

TTACAGsTCC CCAAAGCATA TCGGTGTTAG TCCCGTCCTT CATCGGCTCC TAgTGCCAAG 120 

GCATCCACCG TGCGcCCTTA TTCACTTAAc CTTATCAACC TTAcGGTTGG GTCTTGATCA 180 

CTTCTTCTAG CGATAGAAGG TTAATCAATA AATAAGCAAT TGaACTaTTA AAAAACTCAT 24 0 

TCAACGCGGk GTTCCgGTTt GGTTTgAAyC TtTtACTCmA AAATCCAGTT TTCAATGAAC 300 

GAATGTTTgA GAGTAGACCC CCCCAAA 327 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



A. The indications made below relate to the microorganism referred to in the description 
on page 8 , line _27 



B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet r-j 

Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manasas, Virginia 201 10-2209 
United States of America 



Date of deposit May 2, 1 997 



Accession Number 55969 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet |~ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only ■ 



J"^^JWs*srTeet was received with the international application 



Authorized ofTjcei 




• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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What Is Claimed Is: 



1. Computer readable medium having recorded thereon the nucleotide sequence 
depicted in SEQ ID NOS: 1-982, a representative fragment thereof or a nucleotide 

5 sequence at least 95% identical to a nucleotide sequence depicted in SEQ ID NOS: 1-982. 

2. The computer readable medium of claim 1 having recorded thereon any one 
of the fragments of SEQ ID NOS: 1-982 depicted in Tables 2 and 3 or a degenerate 
variant thereof. 

10 

3. The computer readable medium of claim 1, wherein said medium is selected 
from the group consisting of a floppy disc, a hard disc, random access memory (RAM), 
read only memory (ROM), and CD-ROM. 

15 4. The computer readable medium of claim 3, wherein said medium is selected 

from the group consisting of a floppy disc, a hard disc, random access memory (RAM), 
read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Enterococcus 
20 faecalis genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID NOS:l- 
982, a representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS: 1-982; 

b) search means for comparing a target sequence to the nucleotide sequence of 
25 the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 



6. A method for identifying commercially important nucleic acid fragments of 
the Enterococcus faecalis genome comprising the step of comparing a database 

30 comprising the nucleotide sequences depicted in SEQ ID NOS: 1-982, a representative 

fragment thereof, or a nucleotide sequence at least 95% identical to a nucleotide sequence 
of SEQ ID NOS: 1-982 with a target sequence to obtain a nucleic acid molecule comprised 
of a complementary nucleotide sequence to said target sequence, wherein said target 
sequence is not randomly selected. 

35 

7. A method for identifying an expression modulating fragment of Enterococcus 
faecalis genome comprising the step of comparing a database comprising the nucleotide 
sequences depicted in SEQ ID NOS: 1-982, a representative fragment thereof, or a 
nucleotide sequence at least 95% identical to the nucleotide sequence of SEQ ID NOS:1- 
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982 with a target sequence to obtain a nucleic acid molecule comprised of a 
complementary nucleotide sequence to said target sequence, wherein said target sequence 
comprises sequences known to regulate gene expression. 

5 8. An isolated protein-encoding nucleic acid fragment of the Enterococcus 

faecalis genome, wherein said fragment consists of the nucleotide sequence of any one of 
the fragments of SEQ ID NOS: 1-982 depicted in Tables 2 and 3, or a degenerate variant 
thereof. 

10 9. A vector comprising any one of the fragments of the Enterococcus faecalis 

genome of claim 8. 

10. An isolated fragment of the Enterococcus faecalis genome, wherein said 
fragment modulates the expression of an operably linked open reading frame, wherein 

15 said fragment consists of the nucleotide sequence from about 10 to 200 bases in length 
which is 5' to any one of the open reading of claim 8. 

11. A vector comprising any one of the fragments of the Enterococcus faecalis 
genome of claim 8. 

20 

12. An organism which has been altered to contain any one of the fragments of 
the Enterococcus faecalis genome of claim 8. 

13. An organism which has been altered to contain any one of the fragments of 
25 the Enterococcus faecalis genome of claim 1 0. 

14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule to a a nucleic 
acid molecule of claim 10. 

30 

15. An isolated polypeptide encoded by any of the fragments of the 
Enterococcus faecalis genome of claim 8. 

16. An isolated polynucleotide molecule encoding any one of the polypeptides 
35 of claim 15. 

17. An antibody which selectively binds to any one of the polypeptides of claim 

15. 
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18. A method for producing a polypeptide in a host cell comprising the steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Enterococcus faecalis 

5 genome of claim 8, under conditions where said heterologous nucleic acid molecule is 
expressed to produce said protein, and 

b) isolating said protein. 
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Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

See Remark 



2. | | Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be earned out, specifically: 



3. | I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this international Search Report covers all 
1 — J searchable claims. 



2 - CD Aa 8,1 searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 covers only those claims far which fees were paid, specifically claims Nos.: 



4. [XJ No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-7, see subject 1 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 



1. Claims: 1-7 



Sji^tL? adab1e ""^i urn having recorded thereon the 
nucleotide sequences depicted in SEQ ID nos 1-982 a 

i'p«J S «^-i Ve + - ra ? ,n ? nt thereof or a nucleotide sequence at 
in i iS ntlcal to a n " c1 eotide sequence depicted in SEQ 
Jr™;tJ"o? 2 lu a computer-based system for identifying 
fragments of the Enterococcus faecalis genome of 
SUS^I impc : rtance comprising: a) a data storage means 
comprising said nucleotide sequence(s); b) search means for 
comparing a target sequence to the nucleotide sequences of 

2m»!!iV? 0rag f n ^ ns of - step < a > t0 identify homologous 
sequence(s), and c) retrieval n=ans for obtaining said — 

SSll2 ?U ?i Se ? uence(s) of ? tep (b > ; a metnod for identifying 
commercially important nucleic acid fragments of the 
Enterococcus faecalis genome comprising the step of 
comparing a database comprising said nucleotide sequence (s) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Enterococcus faecalis genome 
comprising the step of comparing a database comprising said 
nucleotide sequence(s) with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary 
nucleotide sequence to said target sequence, wherein said 
target sequence comprises sequences known to regulate gene 
expression; 



2. Claims: (8-18) partially 

An isolated protein-encoding nucleic acid fragment of the 
Enterococcus faecalis genome, wherein said fragment 
consists of the nucleotide sequence of any one of the 
fragments of SEQ ID no.l depicted in Tables 2 and 3, or a 
degenerate variant thereof; a vector comprising any one of 

^!i f r a 5 m ! ntS 0f SE 2 ! P no ' 1 de Picted in Tables 2 and 3; an 
isolated fragment of the Enterococcus faecalis genome, 
wherein said fragment modulates the expression of an 
operably linked open reading frame, wherein said fragment 
consists of the nucleotide sequence from about 10 to 200 
bases in length which is 5' to any one of the open reading 
frames of SEQ ID no.l depicted in Tables 2 and 3 or a 
degenerate variant thereof; a method for regulating the 
expression of a nucleic acid molecule comprising the step of 
covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from 
about 10 to 100 bases 5' to any one of the open reading 
frames of SEQ ID no.l depicted in Tables 2 and 3 or a 
degenerate variant thereof; an isolated polypeptide encoded 
by any one of the fragments of SEQ ID no.l depicted in Table 
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2 and 3; an antibody which selectively binds to any one of 
said po ypeptides, a method for producing a polypeptide in a 
host cell comprising a) incubating a host containing a 
heterologous nucleic acid molecule whose nucleotide sequence 
consists of any one of the fragments of SEQ ID no.l depicted 
in Table 2 and 3, under conditions where said 
heterologous nucleic acid molecule is expressed to produce 
said protein, and b) isolating said protein; 



3-983. Claims: (8-18) partially 

Idem as subject 2 but limited to each of the sequences 
of SEQ ID no. 2 to 982, i.e. invention 3 is limited to the 
fragments of SEQ ID no. 2 depicted in libles 2 and 3, 
invention 4 is limited to the fragments of SEQ ID no. 3 
depicted in Tables 2 and 3, and so on. 

For the sake of conciseness, the second subject matter is 
explicitly defined, the other subject matters are defined by 
analogy hereto. 



REMARK: 

Although claims 1-4 could, at. least partially, be considered as a mere 
presentation of Information, Rule 39.1 (v) PCT, and claims 5-7 at least 
partially as a program for computers (Rule 39.l(v1) PCT), the search has 
been carried out as far as possible 1n our systematic documentation. 
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